Giskard
Testing for Large Language Models: Meet Giskard, an automated quality manager for LLMs.
An open source tool automatically tests language and tabular-data models for social biases and other common issues. Giskard is a software framework that evaluates models using a suite of heuristics and tests based on GPT-4.