Llm Evaluation

9 projects

Showing 9 of 9 projects

FastChatPython

An open platform for training, serving, and evaluating large language model based chatbots.

#model-training#distributed-systems#chatbot

Stars39.5k

Forks4.8k

Last commit10 months ago

promptfooTypeScript

A CLI and library for evaluating, red-teaming, and comparing LLM prompts, agents, and RAGs with simple declarative configs.

#agent-testing#ai-testing#llmops

An open-source platform for debugging, evaluating, and monitoring LLM applications, RAG systems, and agentic workflows with tracing and automated evaluations.

#tracing#open-source#ai-evaluation

An open-source platform for debugging, evaluating, and monitoring LLM applications, RAG systems, and agentic workflows.

#python-sdk#ai-observability#open-source

A framework and open-source registry for evaluating large language models (LLMs) and LLM systems.

#model-assessment#ai-testing#prompt-engineering

Stars18.3k

Forks2.9k

Last commit10 days ago

TensorZeroRust

An open-source LLMOps platform unifying gateway, observability, evaluation, optimization, and experimentation for industrial-grade LLM applications.

#ai-gateway#ai#deep-learning

A command-line tool for red-teaming and vulnerability scanning of large language models (LLMs).

#ai#vulnerability-assessment#ai-safety

Stars7.6k

Forks891

Last commit3 days ago

evidentlyJupyter Notebook

An open-source Python framework to evaluate, test, and monitor ML and LLM systems with 100+ built-in metrics.

#html-report#hacktoberfest#python-library

Stars7.4k

Forks827

Last commit2 days ago

agentaTypeScript

An open-source LLMOps platform for prompt management, evaluation, and observability to build reliable LLM applications faster.

#production-ml#llmops#prompt-engineering

Stars4.0k

Forks511

Last commit2 days ago

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub