Northwestern University
Bias By the Book: Researchers find bias in influential NLP dataset BookCorpus.
Researchers found serious flaws in an influential language dataset, highlighting the need for better documentation of data used in machine learning.