Add support for the DeepEval dataset in LLM tests

validmind-library
2.9.5
documentation
enhancement
highlight
Published

October 7, 2025

This update enhances the integration between DeepEval and the ValidMind library by adding support for a new dataset type specific to large language models (LLMs). You can now use various LLM tests from the DeepEval library. We have introduced new row-level metrics that return arrays.

These metrics can be used in the assign_scores interface and stored in memory by the virtual machine (VM) dataset object. This enables you to use them in generalized plots and statistical functions, aiding in the documentation and interpretation of test results.