Giskard

1 Post

Testing for Large Language Models: Meet Giskard, an automated quality manager for LLMs.

An open source tool automatically tests language and tabular-data models for social biases and other common issues. Giskard is a software framework that evaluates models using a suite of heuristics and tests based on GPT-4.

Giskard

Testing for Large Language Models: Meet Giskard, an automated quality manager for LLMs.

Subscribe to The Batch