Language Error Pattern Essay

1547 Words7 Pages

Abstract—
Error pattern analysis of a language is helpful in language related technology, such as creation of Spell Checker and Corrector, Optical Character Recognition, Machine Translation, Natural Language Interfaces etc.It includes analysis of various types of errors (insertion, deletion, transposition, substitution, run-on, split word error) positional analysis, word length effects, phonetic errors, first position error analysis, keyboard effects etc. This paper focuses on the contribution of Single/Multi-Error misspellings in Punjabi Typed Text. It also discusses previous analysis results about spelling error patterns found in other languages and offers new insights on them. This paper is based on the analysis done on 20000 misspelled words generated …show more content…

First Position Errors
The percentage of first position errors in Punjabi language is considerable. It is observed that in single error misspellings 13.10% and 13.0% in multi error misspellings are found to be first position errors.
V. Spanish error Pattern (16 )
In Spanish Language, Ramirez Bustamante and F., E. López Díaz[16] found that vast majority of errors found in the corpus are single error misspellings (over 89%).Multi-error misspellings are less than 9%. There is an insignificant remaining percentage of noise related to spaces in multiple locations, extreme multi-error words and indecipherable strings of characters. The corpus used contains 8 million words of edited and unedited texts.
VI. Positional Analysis on Single/Multi-error Misspellings in Punjabi Language
The positional analysis plays an important and significant factor in the error pattern study. This can lead us to error zone of high probability. It has been found out that patterns for the positional mistakes are almost similar in both single/multi-error misspellings. The maximum of the mistakes occur at the third position and the error zone decreases after 3rd position. Figure 2 Position wise distribution of Single/Multi-error

Open Document