Skip to content
@rungalileo

Galileo

Evaluate, observe, and protect your GenAI applications

Pinned Loading

  1. agent-leaderboard agent-leaderboard Public

    Ranking LLMs on agentic tasks

    Jupyter Notebook 217 22

  2. hallucination-index hallucination-index Public

    Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.

    116 9

  3. sdk-examples sdk-examples Public

    Examples on how to get started with the Galileo SDKs for AI Evaluation and Observability (both in Python and Typescript)

    Python 15 10

Repositories

Showing 10 of 56 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics