redis

mirror of https://github.com/fluencelabs/redis synced 2025-06-20 04:26:31 +00:00

Author	SHA1	Message	Date
antirez	a6d560db64	Sentinel test: test majority crashing Sentinels. The test was previously performed by removing the master from the Sentinel monitored masters. The test with the Sentinels crashed is more similar to real-world partitions / failures.	2014-02-25 10:30:35 +01:00
antirez	3f16fee39a	Sentinel test: restart_instance should refresh pid attrib. Also kill_instance was modified to warn when a test will try to kill the same instance multiple times for error.	2014-02-25 10:30:35 +01:00
antirez	e8e2ac072c	Sentinel test: more stuff mored from 00-base to init. The area a number of mandatory tests to craete a stable setup for testing that is not too sensitive to timing issues. All those tests moved to includes/init-tests, and marked as (init).	2014-02-25 10:30:35 +01:00
antirez	85fa77e02e	Sentinel: log quorum with +monitor event.	2014-02-25 10:30:35 +01:00
antirez	2d48d7c2fc	Sentinel test: removed useless code to set SDOWN timeout. The new common initialization code used to start a new unit already set the timeout to 2000 milliseconds.	2014-02-25 10:30:35 +01:00
antirez	6e6106797e	Sentinel: generate +monitor events at startup.	2014-02-25 10:30:35 +01:00
antirez	96162c0c58	Sentinel: log +monitor and +set events. Now that we have a runtime configuration system, it is very important to be able to log how the Sentinel configuration changes over time because of API calls.	2014-02-25 10:30:35 +01:00
antirez	39eacde136	Sentinel: added missing exit(1) after checking for config file.	2014-02-25 10:30:35 +01:00
antirez	28aa15a6ea	Sentinel test: tmp dir and gitignore added.	2014-02-25 10:30:35 +01:00
antirez	3917224f7c	Sentinel test: minor fixes to --pause-on-error.	2014-02-25 10:30:35 +01:00
antirez	001ab29915	Sentinel test: --pause-on-error option added. Pause the test with running instances available for state inspection on error.	2014-02-25 10:30:35 +01:00
antirez	d31a394148	Sentinel test: added empty units to fill later.	2014-02-25 10:30:35 +01:00
antirez	d83ab8f9d4	Sentinel: IDONTKNOW error removed. This error was conceived for the older version of Sentinel that worked via master redirection and that was not able to get configuration updates from other Sentinels via the Pub/Sub channel of masters or slaves. This reply does not make sense today, every Sentinel should reply with the best information it has currently. The error will make even more sense in the future since the plan is to allow Sentinels to update the configuration of other Sentinels via gossip with a direct chat without the prerequisite that they have at least a monitored instance in common.	2014-02-25 10:30:35 +01:00
antirez	1dfab3d107	Sentinel test: framework improved and conf-update unit added. It is now possible to kill and restart sentinel or redis instances for more real-world testing. The 01 unit tests the capability of Sentinel to update the configuration of Sentinels rejoining the cluster, however the test is pretty trivial and more tests should be added.	2014-02-25 10:30:35 +01:00
antirez	aee165e084	Sentinel test: move init tests as includes. Most units will start with these two basic tests to create an environment where the real tests are ran.	2014-02-25 10:30:35 +01:00
antirez	73b290cc5b	Sentinel test: ability to run just a subset of test files.	2014-02-25 10:30:35 +01:00
Matt Stancliff	3ced78dc6c	Auto-enter slaveMode when SYNC from redis-cli If someone asks for SYNC or PSYNC from redis-cli, automatically enter slaveMode (as if they ran redis-cli --slave) and continue printing the replication stream until either they Ctrl-C or the master gets disconnected.	2014-02-25 10:30:35 +01:00
antirez	6441a41f94	Sentinel: report instances role switch events. This is useful mostly for debugging of issues.	2014-02-20 12:28:03 +01:00
antirez	05dde71919	Sentinel test: some reliability fixes to 00-base tests.	2014-02-20 12:28:03 +01:00
antirez	c499d82f28	Sentinel test: check that role matches at end of 00-base.	2014-02-20 12:28:03 +01:00
antirez	c01aff1234	Sentinel test: ODOWN and agreement.	2014-02-20 12:28:03 +01:00
antirez	56cb037a86	Sentinel test: check reconfig of slaves and old master.	2014-02-20 12:28:03 +01:00
antirez	d4edf63bc4	Sentinel test: basic failover tested. Framework improvements.	2014-02-20 12:28:02 +01:00
antirez	722d4f0c41	Sentinel test: basic tests for MONITOR and auto-discovery.	2014-02-20 12:28:02 +01:00
antirez	b26660e130	Sentinel test: info fields, master-slave setup, fixes.	2014-02-20 12:28:02 +01:00
antirez	ac5fd5a519	Prefix test file names with numbers to force exec order.	2014-02-20 12:28:02 +01:00
antirez	890559ea1c	Sentinel test: provide basic commands to access instances.	2014-02-20 12:28:02 +01:00
antirez	905c55d5ae	Sentinel: SENTINEL_SLAVE_RECONF_RETRY_PERIOD -> RECONF_TIMEOUT Rename define to match the new meaning.	2014-02-18 10:30:29 +01:00
antirez	1b345ec3f0	Sentinel: fix slave promotion timeout. If we can't reconfigure a slave in time during failover, go forward as anyway the slave will be fixed by Sentinels in the future, once they detect it is misconfigured. Otherwise a failover in progress may never terminate if for some reason the slave is uncapable to sync with the master while at the same time it is not disconnected.	2014-02-18 10:30:28 +01:00
antirez	351e790032	Sentinel: initial testing framework. Nothing tested at all so far... Just the infrastructure spawning N Sentinels and N Redis instances that the test will use again and again.	2014-02-17 17:38:58 +01:00
antirez	f71c7de88e	Test: colorstr moved to util.tcl.	2014-02-17 17:38:54 +01:00
antirez	0886c98fef	Test: code to test server availability refactored. Some inline test moved into server_is_up procedure. Also find_available_port was moved into util since it is going to be used for the Sentinel test as well.	2014-02-17 12:29:54 +01:00
antirez	4237f14a8a	Get absoulte config file path before processig 'dir'. The code tried to obtain the configuration file absolute path after processing the configuration file. However if config file was a relative path and a "dir" statement was processed reading the config, the absolute path obtained was wrong. With this fix the absolute path is obtained before processing the configuration while the server is still in the original directory where it was executed.	2014-02-17 12:14:19 +01:00
antirez	5efee4f0bb	Sentinel: better specify startup errors due to config file. Now it logs the file name if it is not accessible. Also there is a different error for the missing config file case, and for the non writable file case.	2014-02-17 12:10:12 +01:00
antirez	1d4d9e7b14	Redis 2.8.6. 2.8.6	2014-02-13 15:43:40 +01:00
antirez	85492dcfef	Update cached time in rdbLoad() callback. server.unixtime and server.mstime are cached less precise timestamps that we use every time we don't need an accurate time representation and a syscall would be too slow for the number of calls we require. Such an example is the initialization and update process of the last interaction time with the client, that is used for timeouts. However rdbLoad() can take some time to load the DB, but at the same time it did not updated the time during DB loading. This resulted in the bug described in issue #1535, where in the replication process the slave loads the DB, creates the redisClient representation of its master, but the timestamp is so old that the master, under certain conditions, is sensed as already "timed out". Thanks to @yoav-steinberg and Redis Labs Inc for the bug report and analysis.	2014-02-13 15:13:35 +01:00
antirez	eb10bb31ac	Log when CONFIG REWRITE goes bad.	2014-02-13 14:33:50 +01:00
antirez	ebdb37cea7	Test: regression for issue #1549 . It was verified that reverting the commit that fixes the bug, the test no longer passes.	2014-02-13 12:27:08 +01:00
antirez	9b73a274ac	Fix script cache bug in the scripting engine. This commit fixes a serious Lua scripting replication issue, described by Github issue #1549. The root cause of the problem is that scripts were put inside the script cache, assuming that slaves and AOF already contained it, even if the scripts sometimes produced no changes in the data set, and were not actaully propagated to AOF/slaves. Example: eval "if tonumber(KEYS[1]) > 0 then redis.call('incr', 'x') end" 1 0 Then: evalsha <sha1 step 1 script> 1 0 At this step sha1 of the script is added to the replication script cache (the script is marked as known to the slaves) and EVALSHA command is transformed to EVAL. However it is not dirty (there is no changes to db), so it is not propagated to the slaves. Then the script is called again: evalsha <sha1 step 1 script> 1 1 At this step master checks that the script already exists in the replication script cache and doesn't transform it to EVAL command. It is dirty and propagated to the slaves, but they fail to evaluate the script as they don't have it in the script cache. The fix is trivial and just uses the new API to force the propagation of the executed command regardless of the dirty state of the data set. Thank you to @minus-infinity on Github for finding the issue, understanding the root cause, and fixing it.	2014-02-13 12:16:34 +01:00
antirez	96973a7c33	AOF write error: retry with a frequency of 1 hz.	2014-02-12 16:57:17 +01:00
antirez	fadbbdd3f4	AOF: don't abort on write errors unless fsync is 'always'. A system similar to the RDB write error handling is used, in which when we can't write to the AOF file, writes are no longer accepted until we are able to write again. For fsync == always we still abort on errors since there is currently no easy way to avoid replying with success to the user otherwise, and this would violate the contract with the user of only acknowledging data already secured on disk.	2014-02-12 16:57:13 +01:00
antirez	688d32e16b	Don't count time to feed MONITORs in SLOWLOG.	2014-02-07 18:29:26 +01:00
antirez	3e4968339b	Sentinel: allow SHUTDOWN command in Sentinel mode.	2014-02-07 11:22:30 +01:00
antirez	301a0cfc69	Check for EAGAIN in sendBulkToSlave(). Sometime an osx master with a Linux server over a slow link caused a strange error where osx called the writable function for the socket but actually apparently there was no room in the socket buffer to accept the write: write(2) call returned an EAGAIN error, that was not checked, so we considered write(2) == 0 always as a connection reset, which was unfortunate since the bulk transfer has to start again. Also more errors are logged with the WARNING level in the same code path now.	2014-02-05 16:41:04 +01:00
antirez	4e809a9a19	Redis 2.8.5. 2.8.5	2014-02-04 11:17:21 +01:00
antirez	ddcf160309	Move mstime_t define outside sentinel.c. The define is now used in other parts of Redis 2.8 tree instead of long long. A nice side effect is that now 2.8 and unstable sentinel.c files are identical as it should be.	2014-02-03 16:34:46 +01:00
antirez	c5bc592650	Scripting: expire keys in scripts only at first access. Keys expiring in the middle of the execution of Lua scripts are to create inconsistencies in masters and / or AOF files. See the following example: if redis.call("exists",KEYS[1]) == 1 then redis.call("incr","mycounter") end if redis.call("exists",KEYS[1]) == 1 then return redis.call("incr","mycounter") end The script executes two times the same if key exists then incrementcounter logic. However the two executions will work differently in the master and the slaves, provided some unlucky timing happens. In the master the first time the key may still exist, while the second time the key may no longer exist. This will result in the key incremented just one time. However as a side effect the master will generate a synthetic `DEL` command in the replication channel in order to force the slaves to expire the key (given that key expiration is master-driven). When the same script will run in the slave, the key will no longer be there, so the script will not increment the key. The key idea used to implement the expire-at-first-lookup semantics was provided by Marc Gravell.	2014-02-03 16:29:25 +01:00
antirez	5201ca0ca1	Allow CONFIG and SHUTDOWN while in stale-slave state.	2014-02-03 15:51:07 +01:00
antirez	3da5cbe5bb	Scripting: use mstime() and mstime_t for lua_time_start. server.lua_time_start is expressed in milliseconds. Use mstime_t instead of long long, and populate it with mstime() instead of ustime()/1000. Functionally identical but more natural.	2014-02-03 15:46:47 +01:00
PatrickJS	0be31e2d22	update copyright year	2014-02-03 11:19:25 +01:00

1 2 3 4 5 ...

3528 Commits