Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add new tantivy directory for merging combined segments. #111

Open
ChillFish8 opened this issue Oct 7, 2022 · 0 comments
Open

Add new tantivy directory for merging combined segments. #111

ChillFish8 opened this issue Oct 7, 2022 · 0 comments

Comments

@ChillFish8
Copy link
Collaborator

Currently, we can only combine two or more segments into one another, which although works, can make our index inefficient.

We should create a directory that can read from the segment, and split it out into a temp directory (this can be done with the writer directory) then tell tantivy to process all of the deletes marked within the index (providing it is safe to do so, see below) and then re-export the directory to a new segment.

Issue notes

  • Deletes are not attached to the index that actually contains the documents intending to be deleted so we cannot blindly remove deletes.
  • Some deletes may occur after a new document has been inserted and should not affect the new documents.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant