Skip to content
@langwatch

LangWatch

LangWatch 🏰

LangWatch Logo

Observe, Evaluate & Optimize your LLM performance LangWatch is an end-to-end evaluation and observability platform, helping teams ship their AI agents reliably and >8x faster!

Get started (free!) | Documentation | LangEvals Documentation

Welcome to LangWatch, the all-in-one open-source LLMops platform

LangWatch allows you to track, monitor, guardrail and evaluate your LLMs apps for measuring quality and alert on issues.

For domain experts, it allows you to easily sift through conversations, see topics being discussed and annotate and score messages for improvement in a collaborative manner with the development team.

For developers, it allows you to debug, build datasets, prompt engineer on the playground and run batch evaluations or DSPy experiments to continuously improve the product.

Finally, for the business, it allows you to track conversation metrics and give full user and quality analytics, cost tracking, build custom dashboards and even integrate it back on your own platform for reporting to your customers.

LangWatch Optimization Studio

You can sign up and already start the integration on our free tier by following the guides bellow:

🎬 Getting Started

Language of choice missing? đź“§ Email us, or đź’¬ join our Discord and let us know!

Getting running

  1. Clone the relevant repository
  • git clone https://github.com/langwatch/langwatch.git
  • # or if you use the GitHub CLI
    gh repo clone langwatch/langwatch
  1. Follow the documentation for setup and usage
  2. Contribute by opening issues, submitting pull requests, or discussing ideas.

🔑 Key projects

  • LangWatch The core platform for LLM Ops, integrating monitoring, analytics, and optimization tools.
  • LangEvals A unified framework for evaluating language models, aggregating multiple scoring methods and LLM guardrails.
  • Docs Comprehensive documentation to help users set up and utilize LangWatch tools.

🎸 Demo

📺 Short video (3 min) for a sneak peak of LangWatch and a brief introduction to the concepts.

🤝 Contributing

Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

Please read our Contribution Guidelines for details on our code of conduct, and the process for submitting pull requests.

🛟 Support

If you have questions or need help, join our community:

Popular repositories Loading

  1. langwatch langwatch Public

    The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨

    TypeScript 2.7k 245

  2. better-agents better-agents Public

    Standards for building agents, better

    TypeScript 1.3k 126

  3. scenario scenario Public

    Agentic testing for agentic codebases

    TypeScript 665 42

  4. langevals langevals Public

    LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores and LLM guardrails, for you to protect and benchmark your LLM…

    Jupyter Notebook 68 9

  5. data-simulator data-simulator Public

    Synthetic Data Generation

    Jupyter Notebook 8

  6. cookbooks cookbooks Public

    example projects that use langwatch's features.

    Jupyter Notebook 8

Repositories

Showing 10 of 20 repositories
  • better-agents Public

    Standards for building agents, better

    langwatch/better-agents’s past year of commit activity
    TypeScript 1,253 MIT 126 6 2 Updated Dec 3, 2025
  • docs Public

    Docs for LangWatch LLM Ops Platform

    langwatch/docs’s past year of commit activity
    MDX 3 3 0 4 Updated Dec 3, 2025
  • langwatch Public

    The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨

    langwatch/langwatch’s past year of commit activity
    TypeScript 2,656 245 58 (1 issue needs help) 48 Updated Dec 3, 2025
  • terrafied Public

    A little Slack CLI function to provide a groovy đź’… experience for reviewing infrastructure plans

    langwatch/terrafied’s past year of commit activity
    TypeScript 1 0 0 0 Updated Dec 2, 2025
  • scenario Public

    Agentic testing for agentic codebases

    langwatch/scenario’s past year of commit activity
    TypeScript 665 MIT 42 10 15 Updated Dec 2, 2025
  • langevals Public

    LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores and LLM guardrails, for you to protect and benchmark your LLM models and pipelines.

    langwatch/langevals’s past year of commit activity
    Jupyter Notebook 68 MIT 9 3 (1 issue needs help) 13 Updated Dec 1, 2025
  • langwatch/langwatch-flagsmith-demo’s past year of commit activity
    TypeScript 0 1 0 0 Updated Nov 25, 2025
  • langwatch/better-mastra-agent’s past year of commit activity
    TypeScript 0 0 0 0 Updated Nov 18, 2025
  • ksuid Public

    Generate prefixed, k-sorted globally unique identifiers...

    langwatch/ksuid’s past year of commit activity
    TypeScript 0 0 0 0 Updated Nov 13, 2025
  • vocs Public Forked from wevm/vocs

    Minimal Documentation Framework, powered by React + Vite.

    langwatch/vocs’s past year of commit activity
    TypeScript 0 90 0 0 Updated Nov 8, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…