Skip to content
Change the repository type filter

All

    Repositories list

    • mergekit

      Public
      Tools for merging pretrained large language models.
      Python
      GNU Lesser General Public License v3.0
      4484.9k18015Updated Dec 2, 2024Dec 2, 2024
    • fastmlx

      Public
      FastMLX is a high performance production ready API to host MLX models.
      Python
      Other
      27227161Updated Nov 29, 2024Nov 29, 2024
    • Developer resources to work with Arcee models on AWS
      Jupyter Notebook
      Apache License 2.0
      1700Updated Nov 27, 2024Nov 27, 2024
    • Python
      0000Updated Nov 26, 2024Nov 26, 2024
    • Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
      TypeScript
      Other
      7.9k202Updated Nov 11, 2024Nov 11, 2024
    • DALM

      Public
      Domain Adapted Language Modeling Toolkit - E2E RAG
      Python
      Apache License 2.0
      4031265Updated Nov 8, 2024Nov 8, 2024
    • DAM

      Public
      Python
      74111Updated Nov 6, 2024Nov 6, 2024
    • optillm

      Public
      Optimizing inference proxy for LLMs
      Python
      Apache License 2.0
      131200Updated Nov 5, 2024Nov 5, 2024
    • Open-WebUI adaptation for Arcee model deployments
      Svelte
      MIT License
      6.1k002Updated Nov 5, 2024Nov 5, 2024
    • EvolKit

      Public
      EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Models (LLMs).
      Jupyter Notebook
      MIT License
      2218602Updated Oct 30, 2024Oct 30, 2024
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      1.9k000Updated Oct 28, 2024Oct 28, 2024
    • Optimizing inference proxy for LLMs
      Python
      Apache License 2.0
      131000Updated Oct 25, 2024Oct 25, 2024
    • tau-bench

      Public
      Code and Data for Tau-Bench
      Python
      MIT License
      27000Updated Oct 22, 2024Oct 22, 2024
    • entropix

      Public
      Entropy Based Sampling and Parallel CoT Decoding
      TypeScript
      Apache License 2.0
      313300Updated Oct 16, 2024Oct 16, 2024
    • The Arcee client for executing domain-adpated language model routines https://pypi.org/project/arcee-py/
      Python
      52672Updated Oct 8, 2024Oct 8, 2024
    • Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
      Python
      Apache License 2.0
      205001Updated Sep 23, 2024Sep 23, 2024
    • An Open Source Toolkit For LLM Distillation
      Python
      GNU Affero General Public License v3.0
      3936751Updated Sep 17, 2024Sep 17, 2024
    • Shell
      1000Updated Sep 10, 2024Sep 10, 2024
    • chat-ui

      Public
      TypeScript
      Apache License 2.0
      1.1k001Updated Aug 30, 2024Aug 30, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      4.8k001Updated Jul 31, 2024Jul 31, 2024
    • Ongoing research training transformer models at scale
      Python
      Other
      2.4k000Updated Jul 19, 2024Jul 19, 2024
    • axolotl

      Public
      Go ahead and axolotl questions
      Python
      Apache License 2.0
      887001Updated Jul 18, 2024Jul 18, 2024
    • The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
      Python
      Apache License 2.0
      105000Updated Jul 12, 2024Jul 12, 2024
    • domain adapted MOE training
      Python
      Other
      2.4k002Updated Jul 1, 2024Jul 1, 2024
    • A block pruning framework for LLMs.
      Python
      2100Updated Jun 20, 2024Jun 20, 2024
    • The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
      Python
      Apache License 2.0
      105100Updated May 24, 2024May 24, 2024
    • Python
      0500Updated May 6, 2024May 6, 2024
    • PruneMe

      Public
      Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
      Python
      2619800Updated Apr 23, 2024Apr 23, 2024
    • Automatically evaluate your LLMs in Google Colab
      Python
      MIT License
      93200Updated Apr 15, 2024Apr 15, 2024
    • The repository contains all the set-up required to execute trainium training jobs.
      Python
      2400Updated Mar 22, 2024Mar 22, 2024