Extraction of Caption and Animated Text from Videos/Images

Thesis Info

Access Option

External Link

Author

Tehsin, Samabia

Program

PhD

Institute

National University of Sciences & Technology

City

Islamabad

Province

Islamabad

Country

Pakistan

Thesis Completing Year

2014

Thesis Completion Status

Completed

Subject

Engineering

Language

English

Link

http://prr.hec.gov.pk/jspui/bitstream/123456789/2487/1/3081S.pdf

Added

2021-02-17 19:49:13

Modified

2024-03-24 20:25:49

ARI ID

1676727756802

Similar

Textual information embedded in multimedia can provide a vital tool for indexing and retrieval. Text extraction process has a lot of inherent problems due to the variation in font sizes, color, backgrounds and resolution. Text detection, localization and tracking are the most challenging phases of the text extraction process whereas text extraction results are highly dependent upon these phases. This dissertation focuses on the text detection, localization and tracking because of their very fundamental importance. A bio-inspired text detection, localization and tracking is developed and presented in the dissertation. Anthropocentric approach of text detection is studied and is mathematically modeled to design a text extraction process. A novel text segmentation method is proposed covering huge range of text scales, colors and font styles. Segmentation procedure consists of adopted K-means clustering and a fuzzy based perceptual merging process. Two effectual feature vectors are introduced for the classification of the text and non-text objects. First feature vector is based upon the human text detection system and is mathematically represented by the Radon transform of the text candidate objects. Second feature vector is derived from the detailed geometrical analysis of the text contents. Union of two feature vectors is used for the classification of text and non-text objects using Support vector machine (SVM). Fuzzy based text tracking mechanism is also introduced in the research that can handle static as well as dynamic text appearing in videos. The dynamic text includes the simple animations like vertical and horizontal scrolling, as well as the complex ones like random movement, scale change and zoom in/out. ii Text detection and localization results are evaluated on three publicly available datasets namely ICDAR 2011, ICDAR 2013 and IPC-Artificial text. Moreover, results are compared with state of the art techniques. Comparison demonstrates the superiority of the presented research. Text tracking dataset is also developed and proposed tracking algorithm is tested on the dataset that demonstrates the applicability of the proposed tracking technique.

Chapters

Show entries

Filter:

Showing 0 to 0 of 0 entries

Title	Author	Supervisor	Degree	Institute
No data available in table
Title	Author	Supervisor	Degree	Institute

Showing 0 to 0 of 0 entries

Similar Thesis

Show entries

Filter:

Showing 0 to 0 of 0 entries

Title	Author	Supervisor	Degree	Institute
No data available in table
Title	Author	Supervisor	Degree	Institute

Showing 0 to 0 of 0 entries

Similar News

Show entries

Filter:

Showing 0 to 0 of 0 entries

Headline	Date	News Paper	Country
No data available in table
Headline	Date	News Paper	Country

Showing 0 to 0 of 0 entries

Article Title	Authors	Journal	Vol Info	Language
No data available in table
Article Title	Authors	Journal	Vol Info	Language

Heading	Article Title	Authors	Journal	Vol Info
No data available in table
Heading	Article Title	Authors	Journal	Vol Info

Search or add a thesis