Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Query length limit #12

Open
graceblackwell opened this issue Aug 3, 2020 · 6 comments
Open

Query length limit #12

graceblackwell opened this issue Aug 3, 2020 · 6 comments

Comments

@graceblackwell
Copy link

Would it be possible to increase the query length limit? I am wanting to query sequences up to 300kb and it would be good to avoid having to split the sequences up into chunks.

@bingmann
Copy link
Owner

bingmann commented Aug 3, 2020

Yes, this is possible by copying some of the query code.
Will do.

@graceblackwell
Copy link
Author

Oh great! Thanks

@shenwei356
Copy link

Hi @bingmann , how about canceling length limit?

@bingmann
Copy link
Owner

bingmann commented Aug 4, 2020

What do you mean with cancel?
The score counters can be 16-bit (max 65 Ki query length), or 32-bit (max 3 million query length), 64-bit would also be possible, but expensive memory-wise.

@shenwei356
Copy link

I see, I just figure out that 65535 is the maximum 16bit uint, where you use _mm_add_epi16 for parallelizing k-mer count for 8 documents. So replacing _mm_add_epi16 with _mm_add_epi64 can break the limit, in cost of little more memory usage.

@bingmann
Copy link
Owner

bingmann commented Aug 13, 2020

This limitation has been removed in 05588df

Please tell me if the new master version works for you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants