Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Another day, another Google AI model. Google has really been pumping out new AI tools lately, having just released Gemini 3 in November. Today, it’s bumping the flagship model to version 3.1. The new ...