redis

mirror of https://github.com/fluencelabs/redis synced 2025-07-03 10:51:33 +00:00

Author	SHA1	Message	Date
Jan-Erik Rediger	9b0a47cbc8	Do not attempt to lock on Solaris	2015-07-17 11:04:24 +02:00
antirez	d815289d54	Don't try to bind the source address for MIGRATE Related to issues #2609 and #2612.	2015-06-11 13:00:52 +02:00
antirez	3468cd3664	Cluster: redirection refactoring + handling of blocked clients. There was a bug in Redis Cluster caused by clients blocked in a blocking list pop operation, for keys no longer handled by the instance, or in a condition where the cluster became down after the client blocked. A typical situation is: 1) BLPOP <somekey> 0 2) <somekey> hash slot is resharded to another master. The client will block forever int this case. A symmentrical non-cluster-specific bug happens when an instance is turned from master to slave. In that case it is more serious since this will desynchronize data between slaves and masters. This other bug was discovered as a side effect of thinking about the bug explained and fixed in this commit, but will be fixed in a separated commit.	2015-03-24 16:16:44 +01:00
antirez	1641f41cfc	Two cluster.c comments improved.	2015-03-21 18:23:10 +01:00
antirez	b37b2b5c14	Cluster: TAKEOVER option for manual failover.	2015-03-21 18:23:06 +01:00
antirez	b64c861171	Cluster: non-conditional steps of slave failover refactored into a function.	2015-03-21 18:22:46 +01:00
antirez	47bbaa17b0	Cluster: separate unknown master check from the rest. In no case we should try to attempt to failover if myself->slaveof is NULL.	2015-03-21 18:22:39 +01:00
antirez	0595420b1e	Cluster: refactoring around configEpoch handling. This commit moves the process of generating a new config epoch without consensus out of the clusterCommand() implementation, in order to make it reusable for other reasons (current target is to have a CLUSTER FAILOVER option forcing the failover when no master majority is reachable). Moreover the commit moves other functions which are similarly related to config epochs in a new logical section of the cluster.c file, just for clarity.	2015-03-21 18:22:33 +01:00
antirez	62893f5b9f	Cluster: better cluster state transiction handling. Before we relied on the global cluster state to make sure all the hash slots are linked to some node, when getNodeByQuery() is called. So finding the hash slot unbound was checked with an assertion. However this is fragile. The cluster state is often updated in the clusterBeforeSleep() function, and not ASAP on state change, so it may happen to process clients with a cluster state that is 'ok' but yet certain hash slots set to NULL. With this commit the condition is also checked in getNodeByQuery() and reported with a identical error code of -CLUSTERDOWN but slightly different error message so that we have more debugging clue in the future. Root cause of issue #2288.	2015-03-20 10:06:11 +01:00
antirez	d8236ea262	Cluster: more robust slave check in CLUSTER REPLICATE. There are rare conditions where node->slaveof may be NULL even if the node is a slave. To check by flag is much more robust.	2015-03-18 12:09:39 +01:00
Michel Martens	f36482dd5f	Add command CLUSTER MYID	2015-03-18 11:29:32 +01:00
antirez	938dfdc1ea	Migrate: replace conditional with pre-computed value.	2015-02-27 22:34:18 +01:00
antirez	53659404e4	Improvements to PR #2425 1. Remove useless "cs" initialization. 2. Add a "select" var to capture a condition checked multiple times. 3. Avoid duplication of the same if (!copy) conditional. 4. Don't increment dirty if copy is given (no deletion is performed), otherwise we propagate MIGRATE when not needed.	2015-02-26 10:29:02 +01:00
Tommy Wang	97c4167aa1	Add last_dbid to migrateCachedSocket to avoid redundant SELECT Avoid redundant SELECT calls when continuously migrating keys to the same dbid within a target Redis instance.	2015-02-25 13:08:35 -06:00
antirez	55f2bc646a	Cluster: some bias towwards FAIL/PFAIL nodes in gossip sections. This improves PFAIL -> FAIL switch. Too late at this point in the RC releases to add proper PFAIL/FAIL separate dictionary to do this in a less randomized way. Tested in practice with experiments that this helps. PFAIL -> FAIL average with 20 nodes and node-timeout set to 5 seconds takes 2.5 seconds without this commit, 1 second with this commit.	2015-01-30 12:18:42 +01:00
antirez	0f1b9c3db1	More correct wanted / maxiterations values in clusterSendPing().	2015-01-30 12:18:42 +01:00
antirez	2553f6c9e5	Cluster: initialized not used fileds in gossip section. Otherwise we risk sending not initialized data to other nodes, that may contain anything. This was actually not possible only because the initialization of the buffer where the cluster packets header is created was larger than the 3 gossip sections we use, so the memory was already all filled with zeroes by the memset().	2015-01-29 15:52:17 +01:00
antirez	2616d6f6dc	Cluster: magical 10% of nodes explained in comments.	2015-01-29 15:44:54 +01:00
antirez	92f29b8904	CLUSTER count-failure-reports command added.	2015-01-29 15:44:49 +01:00
antirez	8dd3263216	Cluster: use a number of gossip sections proportional to cluster size. Otherwise it is impossible to receive the majority of failure reports in the node_timeout*2 window in larger clusters. Still with a 200 nodes cluster, 20 gossip sections are a very reasonable amount of bytes to send. A side effect of this change is also fater cluster nodes joins for large clusters, because the cluster layout makes less time to propagate.	2015-01-29 15:44:46 +01:00
Matt Stancliff	ebb07a0b48	Fix cluster migrate memory leak Fixes valgrind error: 48 bytes in 1 blocks are definitely lost in loss record 196 of 373 at 0x4910D3: je_malloc (jemalloc.c:944) by 0x42807D: zmalloc (zmalloc.c:125) by 0x41FA0D: dictGetIterator (dict.c:543) by 0x41FA48: dictGetSafeIterator (dict.c:555) by 0x459B73: clusterHandleSlaveMigration (cluster.c:2776) by 0x45BF27: clusterCron (cluster.c:3123) by 0x423344: serverCron (redis.c:1239) by 0x41D6CD: aeProcessEvents (ae.c:311) by 0x41D8EA: aeMain (ae.c:455) by 0x41A84B: main (redis.c:3832)	2015-01-22 10:35:40 +01:00
Matt Stancliff	98faed3a3f	Fix potential invalid read past end of array If array has N elements, we can't read +1 if we are already at N. Also, we need to move elements by their storage size in the array, not just by individual bytes.	2015-01-22 10:35:36 +01:00
Matt Stancliff	97ffeb7c09	Fix cluster reset memory leak [maybe] Fixes valgrind errors: 32 bytes in 4 blocks are definitely lost in loss record 107 of 228 at 0x80EA447: je_malloc (jemalloc.c:944) by 0x806E59C: zrealloc (zmalloc.c:125) by 0x80A9AFC: clusterSetMaster (cluster.c:801) by 0x80AEDC9: clusterCommand (cluster.c:3994) by 0x80682A5: call (redis.c:2049) by 0x8068A20: processCommand (redis.c:2309) by 0x8076497: processInputBuffer (networking.c:1143) by 0x8073BAF: readQueryFromClient (networking.c:1208) by 0x8060E98: aeProcessEvents (ae.c:412) by 0x806123B: aeMain (ae.c:455) by 0x806C3DB: main (redis.c:3832) 64 bytes in 8 blocks are definitely lost in loss record 143 of 228 at 0x80EA447: je_malloc (jemalloc.c:944) by 0x806E59C: zrealloc (zmalloc.c:125) by 0x80AAB40: clusterProcessPacket (cluster.c:801) by 0x80A847F: clusterReadHandler (cluster.c:1975) by 0x30000FF: ??? 80 bytes in 10 blocks are definitely lost in loss record 148 of 228 at 0x80EA447: je_malloc (jemalloc.c:944) by 0x806E59C: zrealloc (zmalloc.c:125) by 0x80AAB40: clusterProcessPacket (cluster.c:801) by 0x80A847F: clusterReadHandler (cluster.c:1975) by 0x2FFFFFF: ???	2015-01-22 10:35:32 +01:00
Matt Stancliff	4a36350d9f	Fix sending uninitialized bytes Fixes valgrind error: Syscall param write(buf) points to uninitialised byte(s) at 0x514C35D: ??? (syscall-template.S:81) by 0x456B81: clusterWriteHandler (cluster.c:1907) by 0x41D596: aeProcessEvents (ae.c:416) by 0x41D8EA: aeMain (ae.c:455) by 0x41A84B: main (redis.c:3832) Address 0x5f268e2 is 2,274 bytes inside a block of size 8,192 alloc'd at 0x4932D1: je_realloc (jemalloc.c:1297) by 0x428185: zrealloc (zmalloc.c:162) by 0x4269E0: sdsMakeRoomFor.part.0 (sds.c:142) by 0x426CD7: sdscatlen (sds.c:251) by 0x4579E7: clusterSendMessage (cluster.c:1995) by 0x45805A: clusterSendPing (cluster.c:2140) by 0x45BB03: clusterCron (cluster.c:2944) by 0x423344: serverCron (redis.c:1239) by 0x41D6CD: aeProcessEvents (ae.c:311) by 0x41D8EA: aeMain (ae.c:455) by 0x41A84B: main (redis.c:3832) Uninitialised value was created by a stack allocation at 0x457810: nodeUpdateAddressIfNeeded (cluster.c:1236)	2015-01-22 10:35:27 +01:00
antirez	0a3edcbe51	Cluster: node deletion cleanup / centralization.	2015-01-22 10:35:12 +01:00
antirez	5130c2536b	Cluster: set the slaves->slaveof filed to NULL when master is freed. Related to issue #2289.	2015-01-22 10:35:08 +01:00
antirez	df1a7fc4fe	Cluster: fetch my IP even if msg is not MEET for the first time. In order to avoid that misconfigured cluster nodes at some time may force an IP update on other nodes, it is required that nodes update their own address only on MEET messages. However it does not make sense to do this the first time a node is contacted and yet does not have an IP, we just risk that myself->ip remains not assigned if there are messages lost or cluster creation procedures that don't make sure everybody is targeted by at least one incoming MEET message. Also fix the logging of the IP switch avoiding the :-1 tail.	2015-01-13 16:23:48 +01:00
antirez	45e2a26ded	Cluster: clusterMsgDataGossip structure, explict padding + minor stuff. Also explicitly set version to 0, add a protocol version define, improve comments in the gossip structure. Note that the structure layout is the same after the change, we are just making the padding explicit with an additional not used 16 bits field. So this commit is still able to talk with the previous versions of cluster nodes.	2015-01-13 16:23:44 +01:00
antirez	799a3ccac1	Suppress valgrind error about write sending uninitialized data. Valgrind checks that the buffers we transfer via syscalls are all composed of bytes actually initialized. This is useful, it makes we able to avoid leaking informations in non initialized parts fo messages transferred to other hosts. This commit fixes one of such issues.	2015-01-13 16:23:40 +01:00
antirez	1584c7a31b	Cluster: initialize mf_end. Can't be initialized by resetManualFailover() since it's actual state the function uses, so we need to initialize it at startup time. Not really a bug in practical terms, but showed up into valgrind and is not technically correct anyway.	2015-01-12 15:40:30 +01:00
Matt Stancliff	8bce654246	Cluster: Notify user on accept error If we woke up to accept a connection, but we can't accept it, inform the user of the error going on with their networking. (The previous message was the same for success or error!)	2014-12-17 17:48:55 +01:00
antirez	86213b4e03	Fix comment in clusterHandleSlaveFailover().	2014-12-16 15:03:23 +01:00
antirez	2c6dc9f15f	Make sure buffer is enough in clusterSendPing().	2014-12-15 10:18:29 +01:00
antirez	73996c8615	Cluster PUBLISH message: fix totlen count. bulk_data field size was not removed from the count. It is not possible to declare it simply as 'char bulk_data[]' since the structure is nested into another structure.	2014-12-09 13:01:47 +01:00
Matt Stancliff	75e68625f1	Parse cluster state file in IPv6 compatible way We need to pick the port based on the _last_ colon, not the first one.	2014-10-31 10:39:33 +01:00
Matt Stancliff	f1a6f78024	Networking: add more outbound IP binding fixes Same as the original bind fixes (we just missed these the first time around). This helps Redis not automatically send connections from the first IP on an interface if we are bound to a specific IP address (e.g. with multiple IP aliases on one interface, you want to send from _your_ IP, not from the first IP on the interface).	2014-10-31 10:02:42 +01:00
antirez	c6226f262f	Cluster: process gossip section only for known nodes. With the exception of nodes sending MEET packets: we have to trust them since they can send us MEET packets only when the cluster is initially created or because sysadmin manual action.	2014-10-09 10:52:28 +02:00
antirez	419eb18505	Cluster: fix logic to detect we are among a minority. In the cluster evaluation function we are supposed to set the cluster state as "fail" if we are among a minority, however the code was not detecting to be into a minority partition if exactly half the masters were reachable, which is a minority.	2014-10-09 10:52:28 +02:00
antirez	9a867b686a	Cluster: more chatty slaves when failover is stalled.	2014-10-08 09:12:43 +02:00
Matt Stancliff	bd62c95200	Clean up text throughout project - Remove trailing newlines from redis.conf - Fix comment misspelling - Clarifies zipEncodeLength usage and a C API mention (#1243, #1242) - Fix cluster typos (inspired by @papanikge #1507) - Fix rewite -> rewrite in a few places (inspired by #682) Closes #1243, #1242, #1507	2014-10-06 10:07:01 +02:00
antirez	015cbf3015	Cluster: claim ping_sent time even if we can't connect. This fixes a potential bug that was never observed in practice since what happens is that the asynchronous connect returns ok (to fail later, calling the handler) every time, so a ping is queued, and sent_ping happens to always be populated. Howver technically connect(2) with a non blocking socket may return an error synchronously, so before this fix the code was not correct.	2014-09-19 14:22:00 +02:00
antirez	d4c3c1248f	Cluster: new option to work with partial slots coverage.	2014-09-19 14:22:00 +02:00
Matt Stancliff	3e22384193	Cluster: Fix segfault if cluster config corrupt This commit adds a size check after initial config line parsing to make sure we have at least 8 arguments per line. Also, instead of asserting for cluster->myself, we just test and error out normally (since the error does a hard exit anyway). Closes #1597	2014-08-26 10:41:03 +02:00
Matt Stancliff	29ff27d430	Fix memory leak in cluster config parsing The continue stop us from triggering the free after the long line for loop, so add it earlier.	2014-08-26 10:41:03 +02:00
Matt Stancliff	d409b5acd3	Clarify existing slot wording on cluster start	2014-08-26 10:41:02 +02:00
antirez	d34fade2da	Remove warnings and improve integer sign correctness.	2014-08-26 10:41:02 +02:00
antirez	990ec8dfc1	representRedisNodeFlags() moved into right code section. The funciton was also modified in order to be more standalone and produce an output without trailing spaces, making the reuse simpler. The global variable was renamed in cammel case as most other Redis globals, except the main ones we refer too many times, like 'server'.	2014-08-26 10:41:02 +02:00
charsyam	2c2204e050	Refactor cluster flag printing Less copy/paste code duplication. Closes #952	2014-08-26 10:41:02 +02:00
SungBin_Hong	987127c6e7	Free memory in clusterLoadConfig error handler Closes #1327	2014-08-26 10:41:02 +02:00
antirez	c5bd330f9e	Cluster: don't migrate to a master that never had slaves. Replica migration algorithm modified so that slaves never try to migrate to masters that were never configured to have slaves in the past. We want the algorithm to take care of masters that remained without working slaves, but that used to have slaves according to the cluster configuration.	2014-07-28 14:55:10 +02:00

1 2 3 4 5 ...

451 Commits