Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Add DeepSeek Coder LLM as a Model Option #637

Closed
jcraftsman opened this issue Jul 6, 2024 · 0 comments · Fixed by #638
Closed

Feature Request: Add DeepSeek Coder LLM as a Model Option #637

jcraftsman opened this issue Jul 6, 2024 · 0 comments · Fixed by #638

Comments

@jcraftsman
Copy link
Contributor

Describe the feature

I would like to request the addition of DeepSeek Coder LLM as an available model option within SWE-agent. This integration would allow users to select DeepSeek Coder for their benchmarking and experimental needs, leveraging its capabilities without promoting it over other models.

Link to DeepSeek Coder: DeepSeek Coder GitHub Repository

Proposed Change:

Model Integration: Add DeepSeek Coder to the list of selectable models in SWE-agent.

Benefits:

Provides users with a cost-effective alternative model for experiments and benchmarking.
Enhances the flexibility and diversity of models available within SWE-agent.

Thank you for considering this feature request.

Potential Solutions

Potential Implementation Steps:

Update the SWE-agent configuration to include DeepSeek Coder as a model option.
In sweagent/agent/models.py:

"deepseek-coder": {
            "max_context": 32_000,
            "cost_per_input_token": 1.4e-07,
            "cost_per_output_token": 2.8e-07,
        },

#...
    elif (
        args.model_name.startswith("gpt")
        or args.model_name.startswith("ft:gpt")
        or args.model_name.startswith("azure:gpt")
        or args.model_name.startswith("deepseek")
    ):
        return OpenAIModel(args, commands)

I tested it as OpenAIModel and it worked.
Here is an example of keys.cfg:

OPENAI_API_BASE_URL: 'https://api.deepseek.com/v1'
OPENAI_API_KEY: 'sk-yourp9iva3k3y'
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant