UPDF AI

Leakage and the reproducibility crisis in machine-learning-based science

Sayash Kapoor,Arvind Narayanan

2023 · DOI: 10.1016/j.patter.2023.100804
Patterns · 378회 인용

TLDR

A survey of literature in fields that have adopted ML methods finds 17 fields where leakage has been found, collectively affecting 294 papers and, in some cases, leading to wildly overoptimistic conclusions.