-
Notifications
You must be signed in to change notification settings - Fork 512
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Enhancement]: Option to set context in request in GPTCache Server #534
Comments
good ideas! |
Agree, because I have also encountered this issue, and now the same content cannot be separated from multiple sessions before But before making any changes, it can be distinguished as follows Each time a cache is added, an identification ID can be added in front of the content, and this identification ID can also be concatenated during queries For example, when adding: {ID} Hello, when querying: {ID} Hello |
I have tried this before. I started caching prompt and response like, I think since it has vector based(semantic matching), it cannot do strict matching, which results in false matches. I could be wrong though. |
we are maintaining our fork and have added multi-tenancy there: NumexaHQ@41aae69 |
What would you like to be added?
I am using GPTCache Server and use
/put
and/get
primarily . In my use case, there are multiple user utilizing this server. I want to add context to every request, it could be anything likeid
orrequest_id
so thatput
andget
adds or looks up according to that.example:
/put
body might look like this:below will return the answer because it is cached with same id
/get
below will not return any answer even if it was cached the id is different
/get
Why is this needed?
My application uses GPTCache server as it is, and it is multitenant. I can have multiple user/organisation/project and they don't want to share cache between them
Anything else?
No response
The text was updated successfully, but these errors were encountered: