• Documentation
    • About ​ValidMind
    • Get Started
    • Guides
    • Support
    • Releases

    • ValidMind Library
    • Python API
    • Public REST API

    • Training Courses
  • Log In
  1. ValidMind Library
  • ValidMind Library
  • Supported records and frameworks

  • Quickstart
  • Quickstart for documentation
  • Quickstart for validation
  • Install and initialize ValidMind
    • Install and initialize the library
    • Install and initialize the library for R
    • Use an HTTP proxy with the library
  • Store credentials in .env files

  • End-to-End Tutorials
  • Development
    • 1 — Set up ValidMind Library
    • 2 — Start the development process
    • 3 — Integrate custom tests
    • 4 — Finalize testing & documentation
  • Validation
    • 1 — Set up ValidMind Library for validation
    • 2 — Start the validation process
    • 3 — Developing a challenger
    • 4 — Finalize validation & reporting

  • How-To
  • Run tests & test suites
    • Explore tests
      • Explore tests
      • Explore test suites
    • Run tests
      • Run dataset-based tests
      • Run comparison tests
      • Configuring tests
        • Configure judge LLM and judge embeddings
        • Customize test result descriptions
        • Enable PII detection in tests
        • Dataset Column Filters when Running Tests
        • Run tests with multiple datasets
        • Understand and utilize RawData in ValidMind tests
      • Using tests in documentation
        • Document multiple results for the same test
        • Run individual documentation sections
        • Run documentation tests with custom configurations
    • Custom tests
      • Implement custom tests
      • Integrate external test providers
  • Use library features
    • Data and datasets
      • Introduction to ValidMind Dataset and Model Objects
      • Dataset inputs
        • Configure dataset features
        • Load dataset predictions
    • Metrics
      • Log metrics over time
      • Intro to Unit Metrics
    • Qualitative text
      • Generate qualitative text with the ValidMind library
    • Scoring
      • Intro to Assign Scores

  • Notebooks
  • Code samples
    • Agents
      • Document an agentic AI system
    • Capital markets
      • Quickstart for knockout option pricing model documentation
      • Quickstart for Heston option pricing model using QuantLib
    • Code explainer
      • Quickstart for model code documentation
    • Credit risk
      • Document an application scorecard model
      • Document an application scorecard model
      • Document a credit risk model
      • Document an application scorecard model
      • Document an Excel-based application scorecard model
    • NLP and LLM
      • Sentiment analysis of financial data using a large language model (LLM)
      • Summarization of financial data using a large language model (LLM)
      • Sentiment analysis of financial data using Hugging Face NLP models
      • Summarization of financial data using Hugging Face NLP models
      • Automate news summarization using LLMs
      • Prompt validation for large language models (LLMs)
      • RAG Model Benchmarking Demo
      • RAG Model Documentation Demo
    • Ongoing monitoring
      • Ongoing Monitoring for Application Scorecard
      • Quickstart for ongoing monitoring of models with ValidMind
    • Regression
      • Document a California Housing Price Prediction regression model
    • Time series
      • Document a time series forecasting model
      • Document a time series forecasting model
    • Validation
      • Validate an application scorecard model

  • Reference
  • ​ValidMind test sandbox
  • ValidMind Library Python API
  • ValidMind Public REST API

On this page

  • What is the ValidMind Library?
  • Quickstart
  • End-to-end tutorials
  • Learn how to use the ValidMind Library
  • Try code samples by use case
  • Work with documentation
  • ValidMind Library Python API reference
  • Edit this page
  • Report an issue

ValidMind Library

Published

May 26, 2026

The ValidMind Library streamlines development and validation by automating testing. Run tests, log those test results to the ValidMind Platform, and have fully supported drafts of documentation or reporting ready for you to fine-tune.

What is the ValidMind Library?

The ValidMind Library provides a rich collection of documentation tools and test suites, from documenting descriptions of your datasets to validation testing your records (such as models) for weak spots and overfit areas.

​ValidMind offers two primary methods for automating documentation:

  • Generate documentation1 — Through automation, the library extracts metadata from associated datasets and records for you and generates documentation based on a template. You can also add more documentation and tests manually using the documentation editing capabilities in the ValidMind Platform.

  • Run validation tests2 — The library provides a suite of validation tests for common financial services use cases. For cases where these tests do not cover everything you need, you can also extend existing test suites with your own proprietary tests or testing providers.

1 ​ValidMind for development

2 ​ValidMind for validation

The ValidMind Library is designed to be agnostic. For example, if you have a model built with Python, the library provides all the standard functionality you may need without requiring you to rewrite any functions.

ImportantKey ​ValidMind concepts
record
A tool tracked in the ValidMind Platform inventory, such as a model. Records include traditional statistical models, legacy systems, artificial intelligence/machine learning models, large language models (LLMs), agentic AI systems, and other documentable items that benefit from oversight, testing, and lifecycle management.
model
SR 26-2 (which supersedes SR 11-7) defines a model as a "complex quantitative method, system, or approach that applies statistical, economic, or financial theories to process input data into quantitative estimates." Simple arithmetic, deterministic rule-based processes, or software without statistical, economic, or financial theories underpinning their design or use are generally outside SR 26-2’s definition of a model.

Within ​ValidMind, a model is a type of record tracked in the inventory.

documentation, model documentation
A structured and detailed record pertaining to a record (such as a model), encompassing key components such as its underlying assumptions, methodologies, data sources, inputs, performance metrics, evaluations, limitations, and intended uses.
validation report
A formal document produced after a model validation process, outlining the artifacts, assessments, and recommendations related to a specific model's performance, appropriateness, and limitations. Provides a comprehensive review of the model's conceptual framework, data sources and integrity, calibration methods, and performance outcomes.
ongoing monitoring, ongoing monitoring report, ongoing monitoring plan, monitoring plan
A comprehensive and structured periodic report assessing the record's performance and compliance over time, ensuring it remains valid under changing conditions. Monitoring includes key elements such as data sources, inputs, performance metrics, and periodic evaluations, ensuring transparency and visibility of the record's performance in the production environment.
document template
Lays out the structure of documents, segmented into various sections and sub-sections, and functions as a test suite to help automate your development, validation, monitoring, and other risk management processes. Document templates are available for default ​ValidMind document types as well as custom document types.
documentation template
A default ​ValidMind document type that serves as a standardized framework for developing and documenting records (such as models), including sections designated for record details, data descriptions, test results, and performance metrics. By outlining required documentation and recommended analyses, documentation templates ensure consistency and completeness across documentation and help guide developers through a systematic development process while promoting comparability and traceability of development outcomes.
validation report template
A default ​ValidMind document type that serves as a standardized framework for conducting and documenting validation, including sections designated for attaching test results, evidence, or artifacts (findings). By outlining required documentation, recommended analyses, and expected validation tests, validation report templates ensure consistency and completeness across validation reports and help guide validators through a systematic review process while promoting comparability and traceability of validation outcomes.
monitoring template, monitoring report template
A default ​ValidMind document type that serves as a standardized framework for ongoing monitoring, including sections designated for test results, performance metrics, and drift analyses. By outlining required monitoring checks and expected routine tests, monitoring templates ensure consistency and completeness across monitoring reports and help guide owners through a systematic monitoring process while promoting early detection of performance degradation.
test
A function contained in the library, designed to run a specific quantitative test on the dataset or record (such as a model). Test results are logged to the ValidMind Platform, where they are attached to documents.

Tests are the building blocks of ​ValidMind, used to evaluate and document records and datasets, and can be run individually or as part of a suite defined by your templates.

test suite
A collection of tests designed to run together to automate and generate documentation end-to-end for specific use cases. (Learn more: test_suites)
metrics, custom metrics
Metrics are a subset of tests that do not have thresholds. Custom metrics are functions that you define to evaluate your record (such as a model) or dataset. These functions can be registered via the ValidMind Library to be used with the ValidMind Platform.

In the context of ​ValidMind's Jupyter Notebooks, metrics and tests can be thought of as interchangeable concepts.

inputs
Objects to be evaluated and documented in the ValidMind Library. They can be any of the following:
  • model: A single record (such as a model) that has been initialized in ​ValidMind. Despite the naming convention, model objects can be any type of record you want to test, document, validate, or monitor with ​ValidMind. Refer to the vm.init_model() function for more information.
  • dataset: A single dataset that has been initialized in ​ValidMind. Refer to the vm.init_dataset() function for more information.
  • models: A list of ​ValidMind records — usually this is used when you want to compare multiple records in your custom tests.
  • datasets: A list of ​ValidMind datasets — usually this is used when you want to compare multiple datasets in your custom tests. (Learn more: Run tests with multiple datasets)
parameters
Additional arguments that can be passed when running a ​ValidMind test, used to pass additional information to a test, customize its behavior, or provide additional context.
outputs
Custom tests can return elements like tables or plots. Tables may be a list of dictionaries (each representing a row) or a pandas DataFrame. Plots may be matplotlib or plotly figures.

Quickstart

After you sign up for ​ValidMind to get access, try our Jupyter Notebook quickstarts for documentation or validation:

Quickstart for documentation
Learn the basics of using ValidMind to document records as part of a development workflow. Set up the ValidMind Library in your environment, and generate a draft of documentation using ValidMind tests for a binary classification model.
Quickstart for validation
Learn the basics of using ValidMind to validate records as part of a validation workflow. Set up the ValidMind Library in your environment, and generate a draft of a validation report using ValidMind tests for a binary classification model.
No matching items

End-to-end tutorials

Learn how to use the ValidMind Library with our high-level Jupyter Notebook courses covering usage of ValidMind for specific roles or concepts.

​ValidMind for development

Learn how to use ValidMind for your end-to-end documentation process based on common development scenarios with our ValidMind for development series of four introductory notebooks:

1 — Set up the ValidMind Library

Get to know ​ValidMind by setting up the ValidMind Library in your own environment, and registering a sample binary classification model in the ValidMind Platform for use with this series of notebooks.

2 — Start the development process

Learn to run and log tests with a variety of methods and in different situations with the ValidMind Library, then add the results or evidence to your documentation for the sample model you registered.

3 — Integrate custom tests

After you become familiar with the basics of the ValidMind Library, learn how to supplement ValidMind tests with your own and include them as additional evidence in your documentation.

4 — Finalize testing and documentation

Wrap up by learning how to ensure that custom tests are included in your documentation template. By the end of this series, you will have a fully documented sample model ready for review.

No matching items

​ValidMind for validation

Learn how to use ValidMind for your end-to-end validation process based on common scenarios with our ValidMind for validation series of four introductory notebooks:

1 — Set up the ValidMind Library for validation

Get to know ​ValidMind by setting up the ValidMind Library in your own environment, and gaining access as a validator to a sample model in the ValidMind Platform for use with this series of notebooks.

2 — Start the validation process

Independently verify the data quality tests performed on datasets used to train the dummy champion using tests from the ValidMind Library, then add the results or evidence to your validation report.

3 — Developing a potential challenger

After you become familiar with the basics of the ValidMind Library, use it to develop a potential challenger and run thorough comparison tests, such as performance, diagnostic, and feature importance tests.

4 — Finalize validation and reporting

Wrap up by learning how to include custom tests and verifying that all tests conducted during development were run and reported accurately. By the end of this series, you will have a validation report complete with artifacts ready for review.

No matching items

Learn how to use the ValidMind Library

Learn how to use the comprehensive out-of-the-box tests and test suites, and other features in the ValidMind Library that make it easy for you to automate building, documenting, validating your records and more.

How to run tests and test suites
​ValidMind provides many built-in tests and test suites, which help you produce documentation during stages of your risk management lifecycle, where you need to ensure that your work satisfies regulatory and risk management requirements.
How to use ValidMind Library features
Browse our range of Jupyter Notebooks demonstrating how to use the core features of the ValidMind Library. Use these how-to notebooks to get familiar with the library's capabilities and apply them to your own use cases.
No matching items

Try code samples by use case

Try our Jupyter Notebook code samples that showcase the capabilities of the ValidMind Library and cover a variety of sample use cases.

Code samples by use case

Examples that you can build on and adapt for your own usage include:

Document an agentic AI system
Build and document an agentic AI system with the ValidMind Library. Construct a LangGraph-based banking agent, assign AI evaluation metric scores to your agent, and run accuracy, RAGAS, and safety tests, then log those test results to the ValidMind Platform.
Document an Excel-based application scorecard model
Build and document an Excel-based application scorecard model with the ValidMind Library. Learn how to load an Excel-based model, prepare your datasets and model for testing, run tests and log those test results to the ValidMind Platform.
Validate an application scorecard model
Learn how to independently assess an application scorecard model developed using the ValidMind Library as a validator. You'll evaluate the development of the model by conducting thorough testing and analysis, including the use of challenger models to benchmark performance.
No matching items

Work with documentation

After you have tried out the ValidMind Library, continue working with your documentation in the ValidMind Platform:3

3 Working with documentation

Work with test results
Once generated via the ValidMind Library, view and add the test results to your documentation in the ValidMind Platform.
Work with content blocks
Make edits to your documents by adding or removing content blocks directly in the online editor.
No matching items

ValidMind Library Python API reference

ValidMind Library API Reference

Supported records and frameworks
  • ValidMind Logo
    ©
    Copyright 2026 ValidMind Inc.
    All Rights Reserved.
    Cookie preferences
    Legal
  • Get started
    • Development
    • Validation
    • Setup & admin
  • Guides
    • Access
    • Configuration
    • Integrations
    • Workflows
    • Inventory
    • Documents & templates
    • Documentation
    • Validation
    • Reporting
    • Monitoring
    • Attestation
  • ValidMind Library
    • Quickstarts
    • Development tutorial
    • Validation tutorial
    • Run tests & test suites
    • Use library features
    • Code samples
    • Python API
    • Public REST API
  • Training
    • Learning paths
    • Courses
    • Videos
  • Support
    • Troubleshooting
    • FAQ
    • Get help
  • Edit this page
  • Report an issue
  • Community
    • GitHub
    • LinkedIn
    • Events
    • Blog