Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Open this folder in VS Code Click "Reopen in Container" when prompted (or use Command Palette: "Dev Containers: Reopen in Container") Wait for container to build and install dependencies ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results