Abstract
The method described assumes that a word which cannot be found in a dictionary has at most one error, which might be a wrong, missing or extra letter or a single transposition. The unidentified input word is compared to the dictionary again, testing each time to see if the words match—assuming one of these errors occurred. During a test run on garbled text, correct identifications were made for over 95 percent of these error types.
- 1 BLAIR, CHARLES R. A program for correcting spelling errors. Informat. Contr. 3 (March 1960), 60-67.Google ScholarCross Ref
- 2 DAVIDSON, LEON. Retrieval of misspelled names in an airline passenger record system. Comm. ACM 5 (March 1962), 169-171. Google ScholarDigital Library
- 3 GLANTZ, HERBERT T. On the recognition of information with a digital computer. J. ACM 4 (April 1957), 178-188. Google ScholarDigital Library
Index Terms
- A technique for computer detection and correction of spelling errors
Recommendations
Context-aware correction of spelling errors in Hungarian medical documents
HighlightsWe propose two methods to automatically correct Hungarian clinical text.Method 1 generates a ranked list of correction candidates disregarding context.Method 2 uses an SMT decoder to implement context-aware error correction.Method 1 is ...
Context-aware correction of spelling errors in hungarian medical documents
SLSP'13: Proceedings of the First international conference on Statistical Language and Speech ProcessingIn our paper, we present a method for automated correction of spelling errors in Hungarian clinical records. We model the problem of spelling correction as a translation task, where the source language is the erroneous text and the target language is ...
Multilingual text induced spelling correction
MLR '04: Proceedings of the Workshop on Multilingual Linguistic RessourcesWe present TISC, a multilingual, language-independent and context-sensitive spelling checking and correction system designed to facilitate the automatic removal of non-word spelling errors in large corpora. Its lexicon is derived from raw text corpora, ...
Comments