Skip to content

👋 Welcome to Athina AI

Athina is building monitoring and evaluation tools for LLM developers.

Sign Up | Website | Contact

  • Evals SDK: Open-source framework for evaluating LLMs (Python + CLI)
  • Platform: Monitor your production inferences, and automatically run evals

hero

Open-Source SDK for Evals

athina-ai/athina-evals

Documentation | Quick Start | Running Evals

We have a library of preset evaluators, but you can also write custom evaluators within the Athina framework.

Example Preset Evals:

  • Context Contains Enough Information: Detect bad or insufficient retrievals.
  • Does Response Answer Query: Detect incomplete or irrelevant responses.
  • Response Faithfulness: Detect when responses are deviating from the provided context.
  • Summarization Accuracy: Detect hallucinations and mistakes in summaries
  • Grading Criteria: If X, then fail. Otherwise pass.
  • Custom Evals: Custom prompt for LLM-powered evaluation.
  • RAGAS: A set of evaluators that return RAGAS metrics.

Results can also be viewed and tracked on our platform. develop-view

Monitoring & Evaluations Platform for LLM Inferences

Documentation | Demo Video | Sign Up

  • UI for monitoring and visibility into your LLM inferences.
  • Run evals automatically against logged inferences in production.
  • Track cost, token usage, response times, feedback, pass rate and other eval metrics.
  • Analytics segmented by Customer ID, Model, Prompt, Environment, and More.
  • Topic Classification
  • Data Exports
  • ... and more

Contact [email protected] if you have any questions.

Pinned Loading

  1. athina-evals athina-evals Public

    Python SDK for running evaluations on LLM generated responses

    Python 242 14

Repositories

Showing 10 of 16 repositories
  • athina-evals Public

    Python SDK for running evaluations on LLM generated responses

    athina-ai/athina-evals’s past year of commit activity
    Python 242 14 1 5 Updated Dec 23, 2024
  • athina-client Public

    A light weight version of athina SDK

    athina-ai/athina-client’s past year of commit activity
    Python 0 0 0 0 Updated Dec 21, 2024
  • athina-deploy Public
    athina-ai/athina-deploy’s past year of commit activity
    Shell 4 0 0 0 Updated Dec 17, 2024
  • rag-cookbooks Public

    This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.

    athina-ai/rag-cookbooks’s past year of commit activity
    Jupyter Notebook 901 MIT 51 0 1 Updated Dec 12, 2024
  • athina-ai/athina-logger-ts’s past year of commit activity
    TypeScript 0 0 0 0 Updated Dec 2, 2024
  • athina-logger Public

    SDK to log LLM inference calls to Athina

    athina-ai/athina-logger’s past year of commit activity
    Python 3 2 0 1 Updated Oct 30, 2024
  • athina-docs Public
    athina-ai/athina-docs’s past year of commit activity
    MDX 1 MIT 0 0 2 Updated Oct 29, 2024
  • ai-research-papers Public

    Summaries of AI Research Papers

    athina-ai/ai-research-papers’s past year of commit activity
    10 2 0 0 Updated Jun 29, 2024
  • athina-ai/athina-evals-ci’s past year of commit activity
    Python 2 0 0 0 Updated Feb 23, 2024
  • ragas Public Forked from explodinggradients/ragas

    Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

    athina-ai/ragas’s past year of commit activity
    Python 0 Apache-2.0 780 0 0 Updated Feb 5, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics