Challenges and Line Segmentation in Sindhi OCR
Shanky Goel Department of Computer Science Punjabi University, Patiala, India
Dr. Gurpreet Singh Lehal Department of Computer Science Punjabi University, Patiala, India
ABSTRACT: Arabic script based OCR systems are far behind in accuracy as compared to Latin script OCR systems. OCRs developed for many
…show more content…
Therefore the OCR used for Arabic or Urdu will not accomplish all the needs for Sindhi (Fig.1). Shaikh et al. [1] presented only character segmentation of Sindhi sub-word by calculating height profile vector of thinned primary strokes. Nizamani and Janjua [3] proposed a recognition system for isolated Sindhi characters which is written in a drawing panel using specific font “MB Lateefi”. Hakro et al [5] presented the recognition issues in Sindhi OCR. It is necessary to have Sindhi OCR application which can convert the printed books of Sindhi into editable computer text files. It would help to increase strength and life of language. At the same time it would also increase the richness of literature of Sindhi …show more content…
Challenges in Sindhi OCR
Sindhi script possesses more challenges because of complexities associated with the script. The cursiveness and context sensitivity are the two major problems in the development of Arabic script based OCRs. But developing a recognition system for a cursive language and a language that has a large set of characters such as Sindhi is a challenging job. The main challenges are:
Writing system:
Sindhi words are written from right to left and numerals are written from left to right. Sindhi language follows Bi-directional property. This poses a challenge for Sindhi OCR because at the recognition time, if a number comes between the characters then the output writing mechanism must be reversed [5]. In the example shown below (Fig.4) the given sentence is presented in a right to left flow while the date set inside (٦٢/٠١/٦١٠٦) takes the Sindhi numerals, is written in left to right form. Fig.4: Bi-directional writing
Segmentation challenge:
Segmentation is most challenging step in Arabic script based OCR systems. Segmentation of a text-document into lines, words and characters, is considered to be the crucial stage in Optical Character Recognition. The output of segmentation phase affects the overall recognition rate of the system. Segmentation is a big challenge in Sindhi OCR due to cursive nature of Sindhi. The Arabic text segmentation methods can be classified into two approaches Analytical Approach and Holistic Approach or Segmentation-Free
13 3.4 Non Functional Requirements • Performance: Handwritten characters in the input image will be recognized with an accuracy of about 90% and
SNC’s Orientation Paragraph consisted only of a current location of Brown Field and no other pertinent elements. SNC’s Situation Paragraph contained no Enemy sub-paragraph, a poorly formed incorrect friendly situation, and a vague overview of the fire team’s mission; SNC’s vague description of the mission in the Situation Paragraph was explained differently two times and bled directly into SNC’s Execution paragraph. SNC’s coordinating instructions, tasks and scheme of maneuver were confusing, mixed together and were being made up as SNC was briefing. SNC’s tasking statements contained no purpose.
Graphic Organizer Body Paragraph Task: This question explores the ideas of fairness -- what is “just” by asking you to think about punishments and rewards that made up Hammurabi’s Code. Body Paragraph Baby Thesis (Restate the question, introduce topic)
Simile: “True, I don’t look so good by the end of the day ... but it’s the brilliant green-and-yellow uniform that gives me away, like prison clothes on a fugitive.” (Ehrenreich 100) In comparing the obviousness of Ehrenreich’s maid outfit, to that of a Prisoner’s, a simile is utilized. This is a smooth and effective way of comparing the two, and adds to somewhat ornate language in Nickel and Dimed.
“My skin color was an asset for any move I was educated to want to make”(Mcintosh 1). A quote from Peggy McIntosh’s essay shows how the way we are treated in our societies has a direct impact on the way we perform in that society. The essay caused me to think deeply about myself and how I truly am privileged to be white; although we may not notice it there are millions of privileges linked to our skin colour. Upon finishing the reading I was questioning not only white privilege but also things like racism and what I myself could do to help people of other ethnicity’s not feel underprivileged. To begin, Peggy McIntosh mentions in her essay the fact that men have privilege over women causing women disadvantages in the same way whites have power
Siftying through text to find unobvious
Advancing technology is one of the most important factors leading to police officers, among other types of law enforcement, being able to more quickly, and safely, apprehend a criminal. One of these technologies is that of the Automatic License Plate Recognition system, otherwise referred to as the ALPR. According to Inspector Norm Gaumont and Constable Dave Babineau “ALPR was developed in 1992 at Cambridge University in the United Kingdom in response to terrorism… The United Kingdom continues to lead the way when it comes to the use of ALPR technology”. The Automatic License Plate Recognition system operates through two cameras which can be mounted on police cars, vans, stoplights or other stationary objects.
“Instead of proving human when it neared and someone else additional to him, as a great buck it powerfully appeared.” (14-16) “The Most of It,” is the story of a man who is expecting way too much out of life. Robert Frost sends his speaker on a trip of self discovery and spirituality. He used the elements of literature such as diction, tone and imagery to help convey his message.
There are a few types of text through computer. First, printed text that printed out through a printer and appears on paper. A printed text does not require a computer to present but the text is non-editable. Next, scanned text that is scanned from printed text. Software such as Optical Character Recognition (OCR) is used to recognize the text and convert it into digital form.
These lessons take up precious time in school ,which is what this source alludes to. Why not learn valuable information? I can guarantee that no college is going to only accept cursive-writing students. I remember when my school had these lessons. Source B provides a great example of the worksheets I did as a kid.
The House on Mango Street Message Not many of us can say that we have lived up to the expectations given to us and internally benefited from it. In the book The House on Mango Street by Sandra Cisneros, Esperanza struggles with growing up with many expectations placed on her. She lives in a Latino neighborhood in Chicago with many neighbors who teach her important lessons. Overall, the story has a message that you should not rely on expectations and the author shows it by using the characterization of Esperanza and through figurative language.
With this the teacher can spend less time on teaching cursive and the kids can still learn what they need in the future. If the kid wants to learn to write all cursive then there can be an after school activity or they can learn online
“Don’t let what you cannot do interfere with what you can do.” This quote by John Wooden adequately describes how difficulties experienced by individuals should not hinder or impede them from accomplishing a goal. In the classroom there are numerous challenges faced by students, however these challenges should not be allowed to interfere with their ability to learn and excel in everyday tasks. For students faced with writing and spelling challenges the use of technology can facilitate and improve their learning experience. A closer look at the pros and cons of utilizing technology in the classroom for students with severe writing and spelling challenges will be explicated.
Bailey: On page 191, Hosseini uses imagery to appeal to the reader’s senses and create a more realistic setting. Baba’s study is described, and it’s mentioned that Amir can smell the “sweetbrier-scented breeze”. This, along with description of the “twin columns of smoke” that are coming from Baba and Rahim Khan, creates a calm and pleasant setting for the audience to imagine. Further down on the page, a different setting is introduced.
This paper looks at the art of cursive handwriting. In the beginning it delves into the history of the art, how it began and evolved over a varying times periods. The paper looks at the important reasons why cursive has been used and celebrated throughout a big time period of time, and how cursive has helped mankind evolve. The research looks at current and ongoing removal of cursive from many schools in the education sector; it asks why cursive is deemed no longer important to mankind today. The paper looks at the rise of computer based information technology and how this medium is fast replacing many old techniques.