How we evaluate AI models and LLMs for GitHub Copilot

We share some of the GitHub Copilot team’s experience evaluating AI models, with a focus on our offline evaluations—the tests we run before making any change to our production environment.

The post How we evaluate AI models and LLMs for GitHub Copilot appeared first on The GitHub Blog.

How we evaluate models for GitHub Copilot

We share some of the GitHub Copilot team’s experience evaluating AI models, with a focus on our offline evaluations—the tests we run before making any change to our production environment.

The post How we evaluate models for GitHub Copilot appeared first on The GitHub Blog.

Try out OpenAI o1 in GitHub Copilot and Models

OpenAI o1-preview and o1-mini are now available in GitHub Copilot Chat in VS Code and in the GitHub Models playground.

The post Try out OpenAI o1 in GitHub Copilot and Models appeared first on The GitHub Blog.