Replies: 7 comments 13 replies
-
Hi @arita37! Thanks for your interest in the project!
There is, currently, no direct configuration for this in the service. You can try to limit RAM usage using docker. It will force the service to switch vector storage to the mmaped files, starting from the given number of vectors per segment. The downside is slower retrieval speed - check out our benchmark for details: https://github.com/qdrant/benchmark/tree/master/memory_benchmark
Looks like data of this size would require distributed deployment, please follow our roadmap https://github.com/qdrant/qdrant/pull/158/files for updates on this feature
We haven't published binaries yet, but it should be possible to extract them from docker container. PS: I am going to convert this issue into a discussion |
Beta Was this translation helpful? Give feedback.
-
Sure,
performance: https://github.com/qdrant/qdrant/blob/v0.4.2/config/production.yaml Is your docker regularly in line with latest config ?
2021-12-31T16:35:40Z INFO actix_server::builder] Starting 32 workers
|
Beta Was this translation helpful? Give feedback.
-
Or maybe the network part…. |
Beta Was this translation helpful? Give feedback.
-
Thx, gd news,
It seems coming from underlying network….
Will try with persistence of 60sec.
Also, how to spill out the the logs on disk ?
(through docker command?)
… On Jan 2, 2022, at 21:52, Andrey Vasnetsov ***@***.***> wrote:
I can see from your log, from the service side response was pretty quick:
[2022-01-02T04:29:50Z INFO actix_web::middleware::logger] 100.XXXX:23619 "POST /collections/test/points/search HTTP/1.1" 200 319 "-" "-" 0.000647
0.000647 - is time in seconds
But 10 seconds seems too long even for establishing 1st time connection, are you using some sort of satellite connection?
but, think if the connection is persistent after 1st connection (?(
Default client uses this configuration for keep-alive:
KEEPALIVE_EXPIRY = 5.0 # seconds
Limits(max_connections=100, max_keepalive_connections=20)
But you can alter them by passing **kwargs to QdrantClient.
E.g.
# Disable keep-alive
client = QdrantClient(limits=httpx.Limits(max_connections=None, max_keepalive_connections=0))
—
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
You are receiving this because you were mentioned.
|
Beta Was this translation helpful? Give feedback.
-
Sure.
Was rather thinking of the real time logs of qdrant
(now its jusr printing on screen).
Not sure, maybe some —logs mylog_dirhost/
on qdrant CLI
thx
… On Jan 2, 2022, at 22:41, Andrey Vasnetsov ***@***.***> wrote:
Do you mean something like this?
docker logs %container_id% > log.txt
—
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
You are receiving this because you were mentioned.
|
Beta Was this translation helpful? Give feedback.
-
Using url directly, have those numbers,
Trying to figure how to workadound the network part. |
Beta Was this translation helpful? Give feedback.
-
Hi @generall, is there a method in Qdrant to limit memory usage without encountering out-of-vocabulary (OOV) errors? Additionally, is it possible to keep the most recently used collections in RAM, even when using memmap to store values in each segment? |
Beta Was this translation helpful? Give feedback.
-
Hi,
Few questions :
How to limit Maximum RAM usage for Qdrant ?
(ie storing the Index)
and use instead the ssd disk.
Or some reference of ram storage cost
of 1 million , 768-vectors ?
Is there any reference
For a given size of data:
10 million data points, vect 512
20k/requests ?
Is there a way to download directly binaries
on linux ?
Thanks
Beta Was this translation helpful? Give feedback.
All reactions