Write Evals for Your AI Extension
Make your AI Extension more reliable by writing evals.
Last updated
Make your AI Extension more reliable by writing evals.
Last updated
We all know that AI is not always reliable. This is why it's important to write evals for your AI Extension. Evals allow you to test your AI Extension and make sure it behaves as expected.
The easiest way to add an eval is to first use your AI Extension. Then, once Raycast AI used your tools to finish it's response, you can use the Copy Eval action to copy the eval to your clipboard.
You can then paste the eval into the evals
array in the package.json
file.
To run your evals, you can use the npx ray evals
command. This will run the evals and print the results to the console. You get an overview of the evals that failed and the ones that passed. From here you can start improving the names and descriptions of your tools.
Visit Learn Core Concepts of AI Extensions to learn more about the different types of evals you can write.