Add support for the DeepEval dataset in LLM tests

validmind-library

2.9.5

documentation

enhancement

highlight

Published

October 7, 2025

This update enhances the integration between DeepEval and the ValidMind library by adding support for a new dataset type specific to large language models (LLMs). You can now use various LLM tests from the DeepEval library. We have introduced new row-level metrics that return arrays.

These metrics can be used in the assign_scores interface and stored in memory by the virtual machine (VM) dataset object. This enables you to use them in generalized plots and statistical functions, aiding in the documentation and interpretation of test results.

DeepEval Integration with ValidMind

Intro to Assign Scores