Skew Detection Analysis

2347 Words10 Pages

ABSTRACT
In this paper skew detection technique for printed and handwritten devnagri scanned documents is proposed. It involves four processing steps like preprocessing, segmentation, skew correction and detection with enhancement which is use to identify writer. Segmentation is used to extract text lines and words from handwritten and printed documents. skew angle detection and correction is done in two steps. First step is dimension reduction and second step is skew estimation. After correcting the skew done the enhancement for the document for the exact image for the future work. The algorithms for the skew angle detection and correction with enhancement tested on the benchmarking datasets handwriting segmentation contest and outperformed …show more content…

The skew of the scanned document image specifies the deviation of the text lines from the horizontal or vertical axis.skew detection techniques involves four processing steps like preprocessing, segmentation, skew correction and detection with enhancement. Skew refers to the text which neither parallel nor at right angles to a specified or implied line. Character recognition is very sensitive to the page skew, skew detection and correction in document images are the critical steps before layout analysis. Skew detection is used for text line position determination in Digitized documents, automated page orientation, and skew angle detection for binary document images, skew detection in handwritten scripts, in compensation for Internet audio applications and in the correction of scanned documents. In Preprocessing it is necessary to modify the data in image. Preprocessing is the preliminary step it transforms the data into a format that will be more easily and effectively processed.The main task in preprocessing is to decrease the variation in the captured data that causes a reduction in the recognition rate and increases the complexities. The preprocessing stage is used to normalize and remove variations in the handwritten and printed text …show more content…

Line extraction techniques may be categorized as projection based, grouping, smearing and Hough-based. [10]. The proposed algorithms that address the above-mentioned processing stages come mainly from the fields of image processing, computer vision, machine learning and pattern recognition. Actually, some of these algorithms are very effective in processing machine printed document images and therefore they have been incorporated in the workflows of well-known OCR systems. Text line segmentation is a critical stage in layout analysis, upon which further tasks such as word segmentation, grouping of text lines into paragraphs, characterization of text lines as titles, headings, footnotes, etc. may be developed. Segmentation is the process of partitioning a digital text image into multiple segments. The goal of segmentation is to simplify or change the representation of an image into something that is more meaningful and easier to analyze. It will be divided in to three parts as for the segmentation of the text, Line segmentation, Word segmentation, Character segmentation.
3.3.1Line Segmentation
Line Segmentation is to separate line from the text document. Text line segmentation algorithm first detects the probable text lines and then segments the text lines in their actual order. The text line segmentation proposals commonly make two assumptions: Firstly,

More about Skew Detection Analysis

Open Document