why not add an (optional) sync before exiting? #13

RJVB · 2019-01-07T15:42:47Z

I was discussing EMD with some data reliability experts, found we agreed it would be a good idea to do a single sync() before exiting and we wondered why this wasn't yet implemented.

Of course sync() will flush all filesystems making it potentially more expensive than necessary but most of the time it shouldn't cost a lot while making EMD a lot safer in theory. Evidently one could make such a final sync optional, or allow to deactivate it (via an env. variable for instance).

A library destructor function (attribute(destructor)) appears to be unreliable for this, but exit() can be overloaded exactly like the other functions.

I can make a PR if there's interest for this.

The text was updated successfully, but these errors were encountered:

laurynas-biveinis · 2019-01-08T09:06:40Z

Since one of the use cases for libeatmydata is speeding up database testing, I wonder how much extra syncing this would result with PostgreSQL process-per-connection design

RJVB · 2019-01-08T09:17:43Z

I have no idea, but probably one sync per connection. You'd have to test how costly that is I've forked and branched the code and done an implementation of my idea, plus some unrelated changes to get the code to build and run properly on Mac (OS X 10.9). One addition I made to my initial idea: sync only when anything has actually been "swallowed", i.e. keep a counter that tracks the number of times a sync operation or mode has been avoided. I will do a PR later today or tomorrow. R.

dermoth · 2020-05-28T05:42:37Z

Why not run sync after the application has finished? It could also be implemented as a wrapper script, but please don't make this the default. I've seen systems take hours to complete a single sync() because of the sheer amount of dirty data and the nonstop writes - surely if I used libeatmydata for an app on them that would be to avoid impact to the block devices and I would never want to flush GB's of dirty buffers at the end!

dermoth · 2021-10-09T06:42:54Z

Another option that would be quite nice, allowing to disable libeatmydata using env. For fun I added libeatmydata.so /etc/ld.so.preload (including on initrd) on a very slow test system so the entire system runs libeatmydata. but I would like to be able to tell it when to sync the fs anyway (ex before a reboot in case it would spend too much time syncing and timeout before everything is flushed).

RJVB mentioned this issue Jan 9, 2019

Usability and Mac improvements #14

Open

RingsC mentioned this issue Sep 30, 2019

Can not act as we expected ON ARM platform. #16

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why not add an (optional) sync before exiting? #13

why not add an (optional) sync before exiting? #13

RJVB commented Jan 7, 2019

laurynas-biveinis commented Jan 8, 2019

RJVB commented Jan 8, 2019 via email

dermoth commented May 28, 2020

dermoth commented Oct 9, 2021

why not add an (optional) sync before exiting? #13

why not add an (optional) sync before exiting? #13

Comments

RJVB commented Jan 7, 2019

laurynas-biveinis commented Jan 8, 2019

RJVB commented Jan 8, 2019 via email

dermoth commented May 28, 2020

dermoth commented Oct 9, 2021