flexnbd-c

Author	SHA1	Message	Date
nick	d87af93cec	tests: Add a migration test with many clients connecting in two waves	2013-09-24 10:11:40 +01:00
nick	71036730c4	Fix a warning in a test	2013-09-23 10:17:50 +01:00
nick	9770bbe42b	tests: Fix for the previous commit	2013-09-20 16:53:30 +01:00
nick	6ffa10bf89	flexnbd: Make a test a bit stricter	2013-09-20 16:00:56 +01:00
nick	0172eb1cba	flexnbd: Some comments and a minor fix in client.c to do with the event stream	2013-09-13 15:17:15 +01:00
nick	efdd613968	listen: Turn off CLIENT_MAX_WAIT_SECS The idea behind this feature was to avoid the client thread in a listen server getting stuck forever if the mirroring thread in the source died. However, it breaks any sane implementation of max_Bps in that thread, and there are lingering concerns over how it might operate under normal conditions anyway. Specifically, if iterating over the bitmap takes a long time, or even just reading the requisite 8MB from the disc in order to send it, then the 5-second timeout could be hit, causing mirroring to fail unnecessarily.	2013-08-14 16:09:55 +01:00
nick	197c1131bf	tests: Tell us which offset fails	2013-06-18 15:35:24 +01:00
Alex Young	574d44f17f	Add a trivial read buffer to flexnbd-proxy. Since the vast majority (something like 94% on boot) are sequential small reads, and since network latency is a major factor in determining how fast the exposed device appears to the client, it makes sense for us to try to minimise the number of network requests where we safely can. This patch implements the simplest possible read cache in flexnbd-proxy. When it receives a read request, if it's a small request then flexnbd-proxy will double the length of data requested. On receiving the data from the upstream server, flexnbd-proxy will return the first half to the downstream as normal, and stash the second half in a buffer. If the very next request is a read, and the offset and length match those of what we have stored, that second request will be satisfied from the buffer without going out over the network. The cache is invalidated by any non-read request, or by a disconnection.	2013-04-29 14:50:42 +01:00
nick	5257e93cb7	flexnbd: Split the proxy mode out into its own binary. "flexnbd-proxy ..." should be identical in operation to "flexnbd proxy ..."	2013-03-19 13:13:37 +00:00
nick	6842864e74	Automated merge with file:///home/lupine/Development/bigv-repos/flexnbd-c-sockutil	2013-02-15 16:53:18 +00:00
nick	98d8fbeaf0	flexnbd: Add a proxy mode This lets us proxy connections between NBD clients and servers, resiliently.	2013-02-15 16:52:16 +00:00
nick	9b67d30608	serve: Make some error conditions non-fatal, test them. We don't want flexnbd serve to fall over and die if the client sends an invalid request.	2013-02-15 16:51:28 +00:00
nick	91085b87fc	flexnbd: Add valgrind suppressions for a bug in glibc-2.11	2013-02-15 13:35:21 +00:00
nick	0fcbe04f80	flexnbd: Remove some obsolete 'rebind' options They steal short options that I want for other things	2013-02-13 13:11:20 +00:00
Alex Young	00d7237f66	Remove an errant debug output from test_happy_path.rb	2012-11-21 09:26:12 +00:00
Alex Young	ed70dacf2f	Don't skip parts of a file when calling fiemap A mis-incremented offset in the fiemap-processing code meant that non-sparse portions of files were missed.	2012-11-20 17:24:19 +00:00
Alex Young	22bea81445	Don't open the control socket until after the server socket is bound This makes it easier for the tests (and supervisor) to guarantee to be able to connect to the server socket. Also this patch moves freeing the mirror supervisor into the server thread.	2012-10-09 17:35:20 +01:00
Alex Young	161d2fccf1	Rename serve->has_control to serve->success. This makes the use of this variable to signal an unexpected SIGTERM while migrating less confusing.	2012-10-09 17:20:39 +01:00
Alex Young	a039ceffcb	Merge	2012-10-08 16:02:37 +01:00
Alex Young	062ecca1fd	Backed out changeset c25e7d82e56e This causes test failures under valgrind, and we don't need the reordering with a background allocation map builder.	2012-10-08 16:01:25 +01:00
Alex Young	1fa8ba82a5	Merge	2012-10-04 14:51:54 +01:00
Alex Young	f3e0d61323	Quit with an error status on SIGTERM during migration This prevents the supervisor from thinking that the migration completed successfully. In order to do this, I've introduced a new lock around the start (and finish) of the migration so that we avoid a race between the signal handler in the server_accept loop and the control thread mirror startup. Without that, we'd risk successfully starting a migration after the SIGTERM handler fired, which would be Bad.	2012-10-04 14:41:55 +01:00
nick	32cae67a75	flexnbd: Move building the allocation map to before server socket bind() Building the allocation map takes time, which scales with the size of the disc being presented. By building that map in the space between bind() and accept(), we leave the process in a useless state after the only good signal we have for "we are ready" and the state where it is actually ready. This was breaking migrations of large files.	2012-09-25 11:47:44 +01:00
Alex Young	ddc57e76d1	Remove an unneeded sanity check from the tests	2012-09-13 15:13:20 +01:00
Alex Young	1d9c88d4ca	Add the write-during-migration test to the acceptance test run	2012-09-13 14:41:50 +01:00
Alex Young	8b43321ef2	Fix for deadlocks when writing while migrating	2012-09-13 12:21:43 +01:00
nick	13328910c8	Add a test case that tickles a deadlock bug when migrating active source discs	2012-09-12 17:13:33 +01:00
nick	ee652a2965	Fix some races in the acceptance tests	2012-09-11 16:21:35 +01:00
Alex Young	33f95e1986	Add the --unlink option to mirror This deletes the local file before tearing down the mirror connection, allowing us to avoid an ambiguous recovery situation.	2012-07-23 13:39:27 +01:00
Alex Young	fd935ce4c9	Simplify the migration handover protocol The three-way hand-off has a problem: there's no way to arrange for the state of the migration to be unambiguous in case of failure. If the final "disconnect" message is lost (as in, the destination never receives it whether it is sent by the sender or not), the destination has no option but to quit with an error status and let a human sort it out. However, at that point we can either arrange to have a .INCOMPLETE file still on disc or not - and it doesn't matter which we choose, we can still end up with dataloss by picking a specific calamity to have befallen the sender. Given this, it makes sense to fall back to a simpler protocol: just send all the data, then send a "disconnect" message. This has the same downside that we need a human to sort out specific failure cases, but combined with --unlink before sending "disconnect" (see next patch) it will always be possible for a human to disambiguate, whether the destination quit with an error status or not.	2012-07-23 10:22:25 +01:00
Alex Young	4790912750	Remove listen mode Changing behaviour so that instead of rebinding after a successful migration and continuing as an ordinary server, we simply quit with a 0 exit code and let our caller restart us as a server if they want to. This means that everything in listen.c, listen.h, and anything making reference to a rebind address is unneeded.	2012-07-23 09:48:50 +01:00
Alex Young	314c0c2a2a	Added the `flexnbd break` command to stop mirroring	2012-07-17 16:30:49 +01:00
Alex Young	1caa3d4e27	Make an EADDRINUSE on server bind fatal. This is important because if we try to rebind after a migration and someone else is in the way, any clients trying to reconnect to us will instead be connecting to the squatter.	2012-07-16 12:34:39 +01:00
Alex Young	8814894874	Test setting an ACL	2012-07-16 11:38:01 +01:00
Alex Young	d0b39cce08	Flush bad write data from the client socket. If the client makes a write that's out of range, by the time we get to validate the message at the server end the client has already stuffed the socket with data we can't use, so we have to flush it. This patch also fixes a potential problem in the acceptance tests where the error field was being returned as an array rather than a value.	2012-07-15 23:19:12 +01:00
Alex Young	f5850e5aaf	Switch from expecting a reconnection to not doing do If we're aborting mirror operations early, a couple of specs need to change sense.	2012-07-15 22:07:00 +01:00
Alex Young	10625e402b	Move the mirror commit state mbox to struct control The mirror_super signals the commit state to the control thread via an mbox, and this mbox is moved to control. It was owned by mirror_super, but the problem with that is that mirror_super can free the mbox before the control client has been scheduled to receive the message. If it's owned by the control object, that can't happen.	2012-07-15 21:57:36 +01:00
Alex Young	b20fbc6a66	Don't retry a mirror which failed on the first attempt If the mirror attempt failed and we were able to report an error to the user, it makes no sense to attempt a retry. We don't have a way to abort a mirror attempt yet, so if the user got a setting wrong and it's failing for that reason, the only recourse they'd have would be to restart the server.	2012-07-15 20:07:17 +01:00
Alex Young	e77234c6b1	Close the mirror client socket on rejection If the mirror attempt connects ok, but is rejected (say, for reporting the wrong size), the client socket needs to be closed. The destination end can't close its socket and accept another connection attempt unless it does.	2012-07-15 18:30:20 +01:00
Alex Young	e0a61e91e6	Simplify acceptance test launching Get rid of checking for --verbose, since it's always there now	2012-07-15 17:14:22 +01:00
Alex Young	69ad6d6b7a	Only copy constants from C to Ruby once This avoids unnecessary duplicate constant warnings for C constants that are defined in two legs of an #ifdef.	2012-07-14 17:25:26 +01:00
Alex Young	e4d2b9a667	Make test sockets less dependent on enviroment It seems that ruby in a default wheezy VM can't handle a source address of nil.	2012-07-14 10:04:55 +01:00
Alex Young	2e4e592c08	Enable writing after the 2G boundary This patch fixes a bug in readwrite.c which truncated the 'from' field in nbd requests. It was casting them down from an off64_t to an int.	2012-07-12 18:01:10 +01:00
Alex Young	c6a084ce82	Add a --quiet command-line option --quiet will suppress all log lines except FATAL. Conceptually it's exclusive with --verbose, but this isn't checked - last one wins.	2012-07-12 14:45:55 +01:00
Alex Young	10b46beeea	Retry failed rebind attempts When we receive a migration, if rebinding to the new listen address and port fails for a reason which might be fixable, rather than killing the server we retry once a second. Also in this patch: non-overlapping log messages and a fix for the client going away halfway through a sendfile loop.	2012-07-12 14:14:46 +01:00
Alex Young	71b7708964	Minor tidy	2012-07-12 10:22:31 +01:00
Alex Young	eb90308b6e	Handle a failed disconnect correctly If the sender disconnects its socket before sending the disconnect message, the destination should restart the migration process. This patch makes sure that happens.	2012-07-12 09:39:39 +01:00
Alex Young	f3cebcdcd5	Test a source crashing after an entrust. This adds a test for destination behaviour, in that if a source crashes after sending an entrust message but before the destination can reply, the destination must allow the source to reconnect and retry the mirror.	2012-07-11 15:19:50 +01:00
Alex Young	84dd052465	Fix a test broken by stdout/stderr reshuffle	2012-07-11 10:12:10 +01:00
Alex Young	f3f017a87d	Free all possibly held mutexes in error handlers Now that we have 3 mutexes lying around, it's important that we check and free these if necessary if error() is called in any thread that can hold them. To do this, we now have flexthread.c, which defines a flexthread_mutex struct. This is a wrapper around a pthread_mutex_t and a pthread_t. The idea is that in the error handler, the thread can check whether it holds the mutex and can free it if and only if it does. This is important because pthread fast mutexes can be freed by any thread, not just the thread which holds them. Note: it is only ever safe for a thread to check if it holds the mutex itself. It is never safe to check if another thread holds a mutex without first locking that mutex, which makes the whole operation rather pointless.	2012-07-11 09:43:16 +01:00

1 2

60 Commits