redis

mirror of https://github.com/fluencelabs/redis synced 2025-06-17 19:21:21 +00:00

Author	SHA1	Message	Date
antirez	6bb42b5bfe	Sentinel test: check that role matches at end of 00-base.	2014-02-20 12:28:18 +01:00
antirez	37be5987e6	Sentinel test: ODOWN and agreement.	2014-02-20 12:28:18 +01:00
antirez	b037c897ae	Sentinel test: check reconfig of slaves and old master.	2014-02-20 12:28:18 +01:00
antirez	f7cb83778b	Sentinel test: basic failover tested. Framework improvements.	2014-02-20 12:28:18 +01:00
antirez	a11749f9f6	Sentinel test: basic tests for MONITOR and auto-discovery.	2014-02-20 12:28:18 +01:00
antirez	8292dd50cd	Sentinel test: info fields, master-slave setup, fixes.	2014-02-20 12:28:18 +01:00
antirez	68a3d5602e	Prefix test file names with numbers to force exec order.	2014-02-20 12:28:18 +01:00
antirez	bbbe68aa0d	Sentinel test: provide basic commands to access instances.	2014-02-20 12:28:18 +01:00
antirez	6e4662e479	Sentinel: SENTINEL_SLAVE_RECONF_RETRY_PERIOD -> RECONF_TIMEOUT Rename define to match the new meaning.	2014-02-18 10:30:47 +01:00
antirez	bd31fcf16e	Sentinel: fix slave promotion timeout. If we can't reconfigure a slave in time during failover, go forward as anyway the slave will be fixed by Sentinels in the future, once they detect it is misconfigured. Otherwise a failover in progress may never terminate if for some reason the slave is uncapable to sync with the master while at the same time it is not disconnected.	2014-02-18 10:30:47 +01:00
antirez	2a35abcf12	Sentinel: initial testing framework. Nothing tested at all so far... Just the infrastructure spawning N Sentinels and N Redis instances that the test will use again and again.	2014-02-17 17:39:23 +01:00
antirez	c1786693b1	Test: colorstr moved to util.tcl.	2014-02-17 17:39:19 +01:00
antirez	48af4d4f26	Test: code to test server availability refactored. Some inline test moved into server_is_up procedure. Also find_available_port was moved into util since it is going to be used for the Sentinel test as well.	2014-02-17 17:39:15 +01:00
antirez	58b6dd9beb	Get absoulte config file path before processig 'dir'. The code tried to obtain the configuration file absolute path after processing the configuration file. However if config file was a relative path and a "dir" statement was processed reading the config, the absolute path obtained was wrong. With this fix the absolute path is obtained before processing the configuration while the server is still in the original directory where it was executed.	2014-02-17 17:39:12 +01:00
antirez	c36a5dce54	Sentinel: better specify startup errors due to config file. Now it logs the file name if it is not accessible. Also there is a different error for the missing config file case, and for the non writable file case.	2014-02-17 17:39:09 +01:00
antirez	3c1672da7d	Update cached time in rdbLoad() callback. server.unixtime and server.mstime are cached less precise timestamps that we use every time we don't need an accurate time representation and a syscall would be too slow for the number of calls we require. Such an example is the initialization and update process of the last interaction time with the client, that is used for timeouts. However rdbLoad() can take some time to load the DB, but at the same time it did not updated the time during DB loading. This resulted in the bug described in issue #1535, where in the replication process the slave loads the DB, creates the redisClient representation of its master, but the timestamp is so old that the master, under certain conditions, is sensed as already "timed out". Thanks to @yoav-steinberg and Redis Labs Inc for the bug report and analysis.	2014-02-13 15:13:38 +01:00
antirez	116617c5e7	Log when CONFIG REWRITE goes bad.	2014-02-13 14:33:53 +01:00
antirez	767846dc5b	Test: regression for issue #1549 . It was verified that reverting the commit that fixes the bug, the test no longer passes.	2014-02-13 12:27:10 +01:00
antirez	14143fbede	Fix script cache bug in the scripting engine. This commit fixes a serious Lua scripting replication issue, described by Github issue #1549. The root cause of the problem is that scripts were put inside the script cache, assuming that slaves and AOF already contained it, even if the scripts sometimes produced no changes in the data set, and were not actaully propagated to AOF/slaves. Example: eval "if tonumber(KEYS[1]) > 0 then redis.call('incr', 'x') end" 1 0 Then: evalsha <sha1 step 1 script> 1 0 At this step sha1 of the script is added to the replication script cache (the script is marked as known to the slaves) and EVALSHA command is transformed to EVAL. However it is not dirty (there is no changes to db), so it is not propagated to the slaves. Then the script is called again: evalsha <sha1 step 1 script> 1 1 At this step master checks that the script already exists in the replication script cache and doesn't transform it to EVAL command. It is dirty and propagated to the slaves, but they fail to evaluate the script as they don't have it in the script cache. The fix is trivial and just uses the new API to force the propagation of the executed command regardless of the dirty state of the data set. Thank you to @minus-infinity on Github for finding the issue, understanding the root cause, and fixing it.	2014-02-13 12:16:31 +01:00
antirez	0296aab6da	AOF write error: retry with a frequency of 1 hz.	2014-02-12 16:57:08 +01:00
antirez	dd73a7bf43	AOF: don't abort on write errors unless fsync is 'always'. A system similar to the RDB write error handling is used, in which when we can't write to the AOF file, writes are no longer accepted until we are able to write again. For fsync == always we still abort on errors since there is currently no easy way to avoid replying with success to the user otherwise, and this would violate the contract with the user of only acknowledging data already secured on disk.	2014-02-12 16:57:04 +01:00
antirez	910b6d34f2	Redis 2.9.50 (Redis 3.0.0 beta-1) 3.0.0-beta1	2014-02-11 10:47:54 +01:00
antirez	0725988a07	Cluster: clusterDelNode(): remove node from master's slaves.	2014-02-11 10:34:14 +01:00
antirez	4513d8fcd4	Cluster: UPDATE messages are the norm and verbose. Logging them at WARNING level was of little utility and of sure disturb.	2014-02-11 10:22:05 +01:00
antirez	9721255178	Cluster: redis-trib fix: handling of another trivial case.	2014-02-11 10:22:02 +01:00
antirez	6d550f2de4	Cluster: configEpoch assignment in SETNODE improved. Avoid to trash a configEpoch for every slot migrated if this node has already the max configEpoch across the cluster. Still work to do in this area but this avoids both ending with a very high configEpoch without any reason and to flood the system with fsyncs.	2014-02-11 10:21:58 +01:00
antirez	585e9fb886	Cluster: clusterSetStartupEpoch() made more generally useful. The actual goal of the function was to get the max configEpoch found in the cluster, so make it general by removing the assignment of the max epoch to currentEpoch that is useful only at startup.	2014-02-11 10:21:55 +01:00
antirez	8b5196addf	Cluster: always increment the configEpoch in SETNODE after import. Removed a stale conditional preventing the configEpoch from incrementing after the import in certain conditions. Since the master got a new slot it should always claim a new configuration.	2014-02-11 10:21:52 +01:00
antirez	2e3f6b0fb3	Cluster: on resharding upgrade version of receiving node. The node receiving the hash slot needs to have a version that wins over the other versions in order to force the ownership of the slot. However the current code is far from perfect since a failover can happen during the manual resharding. The fix is a work in progress but the bottom line is that the new version must either be voted as usually, set by redis-trib manually after it makes sure can't be used by other nodes, or reserved configEpochs could be used for manual operations (for example odd versions could be never used by slaves and are always used by CLUSTER SETSLOT NODE).	2014-02-11 00:39:24 +01:00
antirez	a221ae5ce2	Cluster: fsync at every SETSLOT command puts too pressure on disks. During slots migration redis-trib can send a number of SETSLOT commands. Fsyncing every time is a bit too much in production as verified empirically. To make sure configs are fsynced on all nodes after a resharding redis-trib may send something like CLUSTER CONFSYNC. In this case fsyncs were not providing too much value since anyway processes can crash in the middle of the resharding of an hash slot, and redis-trib should be able to recover from this condition anyway.	2014-02-11 00:39:20 +01:00
antirez	77c6fa65f1	Cluster: conditions to clear "migrating" on slot for SETSLOT ... NODE changed. If the slot is manually assigned to another node, clear the migrating status regardless of the fact it was previously assigned to us or not, as long as we no longer have keys for this slot. This avoid a race during slots migration that may leave the slot in migrating status in the source node, since it received an update message from the destination node that is already claiming the slot. This way we are sure that redis-trib at the end of the slot migration is always able to close the slot correctly.	2014-02-11 00:39:14 +01:00
antirez	f406a09495	Cluster: remove debugging xputs from redis-trib.	2014-02-11 00:39:11 +01:00
antirez	01de246843	Cluster: redis-trib fix: cover new case of open slot. The case is the trivial one a single node claiming the slot as migrating, without nodes claiming it as importing.	2014-02-11 00:39:08 +01:00
antirez	8f25428772	redis-trib: log event after we have reference to 'master'.	2014-02-11 00:39:05 +01:00
antirez	cc97305ec3	Cluster: don't update slave's master if we don't know it. There is no way we can update the slave's node->slaveof pointer if we don't know the master (no node with such an ID in our tables).	2014-02-11 00:39:02 +01:00
antirez	fa6f4f21c3	Cluster: ignore slot config changes if we are importing it.	2014-02-11 00:38:59 +01:00
antirez	30214fff3e	Cluster: update configEpoch after manually messing with slots.	2014-02-11 00:38:56 +01:00
antirez	cfcb09bf76	Cluster: redis-trib, more info about open slots error.	2014-02-11 00:38:53 +01:00
antirez	8e12fae05e	Cluster: fixed inverted arguments in logging function call.	2014-02-10 17:21:17 +01:00
antirez	6a01545744	Cluster: clear the FAIL status for masters without slots. Masters without slots don't participate to the cluster but just do redirections, no need to take them in FAIL state if they are back reachable.	2014-02-10 17:19:16 +01:00
antirez	969a4f1db3	Cluster: replica migration should only work for masters serving slots.	2014-02-10 17:08:47 +01:00
antirez	cb92a1ef08	Cluster: redis-trib del-node variable typo fixed.	2014-02-10 16:59:17 +01:00
antirez	6987a95952	Cluster: clusterReadHandler() fixed to work with new message header.	2014-02-10 16:28:44 +01:00
antirez	ad85f520f8	Cluster: don't propagate PUBLISH two times. PUBLISH both published messages via Cluster bus and replication when cluster was enabled, resulting in duplicated message in the slave.	2014-02-10 16:05:26 +01:00
antirez	b82b66b51d	Cluster: signature changed to "RCmb" (Redis Cluster message bus). Sounds better after all.	2014-02-10 16:05:22 +01:00
antirez	b6e04f5584	Cluster: discard bus messages with version != 0.	2014-02-10 16:05:18 +01:00
antirez	0ee1a78c86	Cluster: added signature + version in bus packets.	2014-02-10 16:05:15 +01:00
antirez	ff6a75a0c1	3.0 release notes added.	2014-02-10 15:36:02 +01:00
antirez	b1c3c6e518	Old Changelog file removed from unstable branch.	2014-02-10 15:19:12 +01:00
antirez	dca95f241c	Cluster: redis-trib: options table entry for add-node fixed.	2014-02-10 12:34:21 +01:00

1 2 3 4 5 ...

3896 Commits