Testing
How do the out-of-the-box tests developed by ValidMind work?
All the default tests are developed using open-source Python and R libraries.
The ValidMind Library1 test interface is a light wrapper that defines utility functions to agnostically interact with different dataset and record (model) backends, and contains functions to collect and post results to the ValidMind Platform2 using a generic results schema.
When do I use tests and test suites?
While you have the flexibility to decide when to use which ValidMind tests, here are a few typical scenarios:3
- Dataset testing — To document and validate your dataset.
- Model testing — To document and validate your record, such as a model.
- End-to-end testing — To document a binary classification model and the relevant dataset end-to-end.
Can we configure, customize, or add our own tests?
Yes, ValidMind allows tests to be manipulated at several levels:
- You can configure which tests are required to run programmatically depending on the record’s use case.4
- You can change the thresholds and parameters for default tests already available in the library — for instance, changing the threshold parameter for the class imbalance flag.5
- You can also connect your own custom tests with the ValidMind Library. These custom tests are configurable and are able to run programmatically, just like the rest of the library.6
- Personalize tests further for your use case by using ValidMind’s
RawDatafeature7 to customize the output of tests.
How do I log tests as a developer?
You use the ValidMind Library to run and log tests during development, the results of which are then inserted into your documentation within the ValidMind Platform.9 The library also automatically generates draft test descriptions for your test results — generations that can be modified for your custom use cases.10
To log tests as a developer with the ValidMind Library:
How do I log tests as a validator?
You use the ValidMind Library to run and log tests during validation, the results of which are then inserted into your validation report within the ValidMind Platform.13 The library also automatically generates draft test descriptions for your test results — generations that can be modified for your custom use cases.14
To log tests as a validator with the ValidMind Library:
- You must have the Validator role15 or another role with sufficient permissions to access records (models) for validation, to review documentation, and to work with validation reports and artifacts.
- You must be the record validator, but not the record owner or record developer,16 for the record you want to log tests and update documentation for.
Does ValidMind support using synthetic datasets?
- The ValidMind Library supports you bringing your own datasets, including synthetic datasets, for testing and benchmarking purposes such as for fair lending and bias testing.18
- If you are unable to share your real-world data with us, ValidMind is happy to work with you to generate custom synthetic datasets based on characteristics of your data, or provide scripts to assist with synthetic dataset generation if details cannot be shared.
Does ValidMind support monitoring records after deployment?
Yes, ValidMind offers ongoing monitoring support to help you regularly assess a record’s accuracy, stability, and robustness to ensure it remains reliable after deployment:
- You can enable monitoring for both new and existing records.19
- You use the ValidMind Library to automatically populate the monitoring template for your record with data, providing a comprehensive view of your record’s performance over time.
- You then access and examine these results within the ValidMind Platform, allowing you to identify any deviations from expected performance and take corrective actions as needed.20
- Once generated via the ValidMind Library, view and add metrics over time to your ongoing monitoring reports in the ValidMind Platform.21