-
Notifications
You must be signed in to change notification settings - Fork 226
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Streaming / continuous backups? #1030
Comments
Unfortunately, We thought about system.part_log + LIVE VIEWS https://clickhouse.com/docs/en/sql-reference/statements/create/view#live-view-deprecated, but live view is deprecated now Moreover, we thought about https://github.com/fsnotify/fsnotify and watch new data parts, make hard links in upload folder and upload data parts to remote storage As some workaround In any case, we are open to a pull requests. |
OK, thanks. Hoped we were missing something more convenient. We did consider trying to abuse https://clickhouse.com/docs/en/sql-reference/statements/create/view#window-view-experimental - have you looked at that? |
We don't need aggregation (WINDOW VIEW goal), we need a proper way to watch new data parts Try workaround which i suggested above |
What we'd like to do is accept up to N minutes of loss (i.e. much less than the retention window of our queues) and run XXL single node CH instances with sharding / merging externally managed, since CH replicated mergetree is much slower than large nodes with plain mergetree, and inserts are >10x more ops replicated according to the docs.
However, this requires being able to bring up new nodes pretty quickly without a long rebuild process to handle failover / node replacement. We can then replay inserts very easily from the last row.
Right now backups seem too heavy to run every 60s / every N rows committed. We are self-hosters but I notice this is a feature gap in CH cloud too: backups are only daily.
Any plans to make this more efficient?
The text was updated successfully, but these errors were encountered: