UPDF AI

Collection and characterization of spelling errors in scientific and scholarly text

J. J. Pollock,Antonio Zamora

1983 · DOI: 10.1002/asi.4630340108
Journal of the American Society for Information Science · 91 Citations

TLDR

The SPEEDCOP (SPEIIing Error Detection correction Project) project recently completed at Chemical Abstracts Service (CAS) extracted over 50,000 misspellings from approximately 25,000,000 words of text from seven scientific and scholarly databases and showed that the expected incidence of misspelling is 0.2%, that 90–95% of spelling errors have only a single mistake, and that substitution is homogeneous while transposition is heterogeneous.