Search or add a thesis

Advanced Search (Beta)
Home > Optical Character Recognition for Printed Urdu Nastaliq Font

Optical Character Recognition for Printed Urdu Nastaliq Font

Thesis Info

Access Option

External Link

Author

Din, Israrud

Program

PhD

Institute

Bahria University

City

Islamabad

Province

Islamabad

Country

Pakistan

Thesis Completing Year

2019

Thesis Completion Status

Completed

Subject

Computer Science

Language

English

Link

http://prr.hec.gov.pk/jspui/bitstream/123456789/14662/1/Israrud-Din%20computer%20engg%202019%20bahria%20isb%20prr.pdf

Added

2021-02-17 19:49:13

Modified

2024-03-24 20:25:49

ARI ID

1676727801984

Asian Research Index Whatsapp Chanel
Asian Research Index Whatsapp Chanel

Join our Whatsapp Channel to get regular updates.

Similar


Optical Character Recognition (OCR) is one of the most investigated pattern classification problems that has received remarkable research attention for more than half a century. From the simplest systems recognizing isolated digits to end-to-end recognition systems, applications of OCRs vary from postal mail sorting to reading systems in scene images facilitating autonomous navigation or assisting the visually impaired. Despite tremendous research endeavors and availability of commercial recognition engines for many scripts, recognition of cursive scripts still remains an open and challenging research problem mainly due to the complexity of script, segmentation issues and large number of classes to recognize. Among these, Urdu makes the subject of our study. More specifically, this study investigates the recognition of printed Urdu text in Nastaliq style, the most widely employed script for Urdu text that is more complex than the Naskh style of Arabic. This work presents a holistic (segmentation-free) technique that exploits ligatures (partial words) as units of recognition. Urdu has a total of more than 26,000 unique ligatures, many of the ligatures, however, share the same main body (primary ligature) and differ only in the number and position of dots and diacritics (secondary ligatures). We exploit this idea to separately recognize the primary and secondary ligatures and later re-associate the two to recognize the complete ligature. Recognition is carried out using two techniques; the first of these is based on hand-crafted statistical features using hidden Markov models (HMMs). Features extracted using sliding windows are used to train a separate model for each ligature class. Feature sequences of the query ligature are fed to all the models and recognition is carried out through the model that reports the maximum probability. The second technique employs Convolutional Neural Networks (CNNs) to automatically extract useful feature representations from the classes and recognize the ligatures. We investigated the performance of a number of pre-trained networks using transfer learning techniques and trained our own set of networks from scratch as well. Experimental study of the system is carried out on two benchmark datasets of Urdu text, the ‘Urdu Printed Text Images’ (UPTI) database and the ‘Center of Language Engineering’ (CLE) database. A number of experimental scenarios are considered for system evaluation and the realized recognition rates are compared with state-of-the-art recognition systems for printed Urdu text. An interesting aspect of experimental study is the combination of unique ligatures in the two datasets to generate a large set of around 2800 unique primary and secondary ligatures covering a major proportion of the Urdu corpus. The system reports high classification rates (88.10% and 94.78% on CLE and UPTI query ligatures respectively) demonstrating the effectiveness of the proposed recognition techniques which can be adapted for other cursive scripts as well. The findings of this study are expected to be useful for the document recognition community in general and researchers targeting cursive scripts in particular.
Loading...
Loading...

Similar Books

Loading...

Similar Chapters

Loading...

Similar News

Loading...

Similar Articles

Loading...

Similar Article Headings

Loading...

ضبط نے وحشتوں کو باندھا ہے

ضبط نے وحشتوں کو باندھا ہے
یعنی پھر آنسوئوں کو باندھا ہے

کس نے سب زندگی کی کڑیوں میں
درد کے سلسلوں کو باندھا ہے

تیرے باعث ہی دیکھ غزلوں میں
درد کے قافیوں کو باندھا ہے

یوں ہی روشن نہیں ہے دل اس میں
آس کے جگنوئوں کو باندھا ہے

درد نے ساز پھر سے چھیڑے ہیں
ہم نے بھی گھنگھروئوں کو باندھا ہے

دل کی باتیں سمجھ نہ پائے تم
ہم نے کب فلسفوں کو باندھا ہے

تیری زلفوں کی ڈور سے ہم نے
اپنے سب رتجگوں کو باندھا ہے

Analisis Kemampuan Pemahaman Konsep Siswa Pada Materi Persamaan Garis Lurus

Pemahaman konsep sangat berguna dalam penyelesaian masalah. Tujuan penelitian adalah untuk mendeskripsikan kemampuan pemahaman konsep siswa kelas VIII SMP Negeri 1 Toma pada materi persamaan garis lurus. Jenis penelitian adalah penelitian kualitatif dengan pendekatan deskriptif. Sumber informan yaitu: siswa kelas VIII SMP Negeri 1 Toma dengan jumlah 33 orang. Teknik analisis data yaitu: reduksi data, penyajian data dan penarikan kesimpulan. Teknik pengumpulan data yaitu: tes dan wawancara tidak terstruktur. Berdasarkan hasil penelitian dan pembahasan diperoleh bahwa kemampuan pemahaman konsep matematis dengan kategori Sangat Baik (SB) berada pada 12%, kategori Baik (B) berada 21%, kategori Cukup (C) berada pada 27%, kategori Kurang (K) berada pada kategori 33%, kategori Sangat Kurang (SK) berada 6%. Sehingga kemampuan pemahaman konsep matematika lebih dominan pada kategori Kurang (K) sebesar 33%. Peneliti menyarankan agar guru dapat membangkitkan kemampuan pemahaman konsep matematis siswa dalam kegiatan pembelajaran.

Voting Behaviour in Pakistan: A Case Study of Khyber Pakhtunkhwa in 2008 General Elections

The core objective of this study is to provide a detailed analysis of the voting behaviour in Khyber Pakhtunkhwa with reference to 2008 general elections along with its comparison with 2002 and 2013 general elections. It focuses on the application of theory of party identification, issue voting, clientelism, religious voting and ethnic voting in the electoral politics of Khyber Pakhtunkhwa. Regarding the application of these theories, the study argues that party identification theory is applicable to limited extent (35.36%); issue voting (80.87%) and clientelism (73.01%) are applicable to a great extent; and religious voting (54.07%) and ethnic voting (52.2%) are applicable to some extent in the electoral politics of Khyber Pakhtunkhwa. The scope of the study is confined to Khyber Pakhtunkhwa. Data collection is based on both secondary and primary sources. The secondary data in the form of books and journals, cover the theoretical frameworks including party identification, issue voting, clientelism, religious voting and ethnic voting. The primary data in the form of questionnaire is the original contribution of this study which explores the extent of the application of the aforementioned theories of voting behaviour in Khyber Pakhtunkhwa. The research is based on the quantitative, analytical and comparative approaches. This research work is the answer to the main research question i.e. to determine the extent of application of theory of party identification, issue voting, clientelism, religious voting and ethnic voting? The study is based on a number of hypotheses. It has been hypothesized that issue voting and clientelism are relatively more important determinants while party identification, religious and ethnic voting are relatively less important determinants of voting behaviour in the electoral politics of Khyber Pakhtunkhwa. The quantitative data answers the research questions as well as tests the hypotheses related to the electoral politics of Khyber Pakhtunkhwa. The general elections of 2002, 2008 and 2013 have a unique significance in the electoral history of Khyber Pakhtunkhwa, because all these elections introduced a major electoral change. For example in 2002 elections, religious parties stood victorious with a heavy xxiimandate. But, in 2008, a Pakhtun ethnic party succeeded in winning majority of the seats, thereby wiping out religious political parties from the political scene. Similarly, in 2013 elections a new political party emerged on the political arena of Khyber Pakhtunkhwa. All these electoral changes are of great importance and need to be analysed.