Banana.dev CodeLlama-7B-Instruct-GPTQ starter template

This is a CodeLlama-7B-Instruct-GPTQ starter template from Banana.dev that allows on-demand serverless GPU inference.

You can fork this repository and deploy it on Banana as is, or customize it based on your own needs.

Running this app

Wait for the model to build after creating it.
Make an API request to it using one of the provided snippets in your Banana dashboard.

For more info, check out the Banana.dev docs.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
banana_config.json		banana_config.json
download.py		download.py
requirements.txt		requirements.txt