Search or add a thesis

Advanced Search (Beta)
Home > Using Text Processing Techniques for Linking News Stories for Digital Preservation

Using Text Processing Techniques for Linking News Stories for Digital Preservation

Thesis Info

Access Option

External Link

Author

Khan, Muzammil

Program

PhD

Institute

Preston University

City

Kohat

Province

KPK

Country

Pakistan

Thesis Completing Year

2018

Thesis Completion Status

Completed

Subject

Computer Science

Language

English

Link

http://prr.hec.gov.pk/jspui/bitstream/123456789/12892/1/Muzammil%20Khan%20PhD%20Thesis%2c%20Reg%20No%201094-114043.pdf

Added

2021-02-17 19:49:13

Modified

2024-03-24 20:25:49

ARI ID

1676727855105

Asian Research Index Whatsapp Chanel
Asian Research Index Whatsapp Chanel

Join our Whatsapp Channel to get regular updates.

Similar


The news generation in the digital environment is no longer a periodic process with a fixed single output like printed newspaper. The news are instantly generated and updated online in a continuous fashion. However, because of different reasons like the short lifespan of digital information and speed of generation of information, it has become vital to preserve digital news for the long-term. Digital preservation includes various actions to ensure that digital information remains accessible and usable, as long as they are considered important. Libraries and archives preserve newspapers by carefully digitizing collections as newspapers are a good source of knowing history. The lifespan of news stories published online vary from one newspaper to another, that is, from one day to a month or even more. Though a newspaper may be backed up and archived by the news publisher or national archives, in the future it will be difficult to access particular information published in various newspapers about the same news. The issues become more complicated if a story is to be tracked through a multi-lingual archive of many online newspapers, which require different access technologies. Based on prior studies, a ten step systematic approach is introduced for web preservation, which lead to create an effective web archive and followed to create the intended digital news stories archive using digital news stories extractor. Initially, the archive is enriched with three English newspapers, enhanced to ten online newspaii pers and then upgraded to dual language (Urdu and English) news articles archive, extracted from fifteen online news sources. The news stories archive preserve about 360 Urdu and 850 English news articles, periodically crawled every second day using digital news stories extractor tool. The main goal of the dissertation was to link digital news stories duration preservation using text processing techniques. To achieve the goal, the formulated text processing similarity measures are applied for linking two types of news articles, that is, to link English news (English-to-English) and dual languages (Urdu-to-English) news articles in the archive. To linking English news articles in the archive, the study proposed five contentbased similarity measures that find similarity based upon news content features and link news articles during preservation. The measures compute similarity value among news articles based on features like number of terms, named entities, named entities position, title terms, and position of terms in the titles, etc. The measures are evaluated on to same news articles sets, of different size and compared with human based judgment in order to evaluate the accuracy and assess the effectiveness, worth and significance of designed similarity measures for linking English news articles. The results showed that the proposed measures presented are feasible for linking English news articles in the news stories archive. The selection of measure depends upon the performance of that measure in a specific category, for example, a measure can perform better on a category “Opinion”. All the proposed measures are evaluated for six categories of news articles and the results are mutually compared with two known text based similarity measures to assess the effectiveness and appropriateness of proposed measure in the best fitted scenario. The pre-processing step in any web preservation project is of utmost importance because the intensions are to archive the targeted contents, especially, in a language which doesn’t have any sophisticated tools and techniques. To link dual languages news articles in the archive, Urdu news articles needs extensive pre-processing, which leads to create an Urdu bag of words and dictionary containing 50502 words and 78739 pairs of Urdu words with English meanings respectively. The study proposed five content-based similarity measures that find similarity based upon news content features and link Urdu-to-English news articles during preservation. The measures are applied to the same news articles sets, of different sizes and mutually compared. The results showed that three of the proposed measures presented appreciable results for linking dual-lingual news articles in the archive, which can be improved by improving the structure and contents of the Urdu dictionary. In summary, the performance of different measures has been evaluated individually for linking digital news articles in the digital news story archive to make sure the accessibility of these news articles in the future from this enormous collection.
Loading...
Loading...

Similar Books

Loading...

Similar Chapters

Loading...

Similar News

Loading...

Similar Articles

Loading...

Similar Article Headings

Loading...

شیخ عبدالعزیز شادیش

شیخ عبدالعزیز شادیش
افسوس ہے کہ اس مہینہ شیخ عبدالعزیز شادیش نے مصر میں وفات پائی، یہ مفتی محمد عبدہ کے شاگردوں میں تھے اور طبعاً نہایت پرجوش تھے، نوجوان ترکوں کی انجمن اتحاد و ترقی کے زمانہ میں یہ اس کے سرگرم حامی تھے، بلکہ یہ کہنا چاہئے کہ یہ اس کی مذہبی روح تھے، انور پاشا مرحوم کے دست و بازو تھے، بلقان کے بعد انہوں نے قسطنطنیہ سے ’’الہدایہ‘‘ نام کا ایک علمی، مذہبی، اصلاحی رسالہ عربی میں نکالا تھا، جنگ عظیم میں یہ اتحادیوں کے خلاف عرب میں جہاد کے واعظ اور مبلغ تھے، ترکی کے موجودہ انقلاب میں بھی شریک ہوئے اور چاہتے تھے کہ اس انقلاب کے ہاتھ سے معتدل مذہبی اصلاحات اور اتحاد اسلامی کا سررشتہ نہ چھوٹے، اس لئے انگورہ میں دنیائے اسلام کی ایک علمی و ادبی انجمن بنائی، جس کے کتب خانہ میں تمام اسلامی زبانوں کی کتابیں جمع کی جائیں تاکہ ایک نظر میں تمام اسلامی دنیا کی مختلف دماغی سطح معلوم ہوجائے اور اتحاد اسلامی کی مجسم شکل سامنے آجائے، مگر مصطفی کمال پاشا کی سرعت رفتار کا وہ ساتھ نہ دے سکے، ناچار مصطفی کمال نے جب خلافت کی قبا اتار پھینکی اور اپنے کو جیسے وہ تھے سب کے سامنے ظاہر کردیا، تو شیخ نے انگورہ چھوڑ کر مصر میں قدم رکھا اور سیاسیات سے یکسر تائب ہوکر اپنے استاد کے نقشِ قدم پر چلے، یعنی مصر کے تعلیمی محکمہ میں وہ ابتدائی تعلیم کے انسپکٹر مقرر ہوگئے۔
اس خدمت کے ساتھ ساتھ انہوں نے چند ہی سال کے اندر مصری طلبہ کو خطرناک قومیت کے جذبات سے بچانے کا کام اصلاحی حیثیت سے شروع کردیا، پہلے ان کے لئے مکارم الاخلاق کے نام سے ایک انجمن قائم کی، جس نے اپنے چند ہی اجلاسوں میں طلبہ کو مغربی اخلاق و تمدن...

مشاريع العمل عن بعد ودورها في تحقيق التنمية المستدامة

هدفت الدراسة إلى التعرف على دور مشاريع العمل عن بعد في تحقيق التنمية المستدامة وقد درست أدوراها في مجال المعرفة والثقافة والتدريب والدعم وتوظيف الاستفادة ومواكبة التطور وبالنسبة للتنمية المستدامة تم دراسة أبعادها الأربعة التقنية والبيئية والاجتماعية الاقتصادية وقد اتبعت المنهج الوصفي التحليلي لمعالجة وتحليل البيانات لعينة من العاملين في مجال العمل عن بعد وقد بلغ عددها 155 عامل في مجال العمل عن بعد وقد توصلت الدراسة إلى مجموعة من النتائج أهمها: شعور العامل بمسؤولية العمل اتجاه المشغل والشركة الوسيطة، تقديم التدريب والدعم للموظفين وقدرتها على تحسين الظروف الاقتصادية والمعيشية وقد جاءت التنمية التقنية في أهم أبعاد التنمية تحقيقاً أي بنسبة 81.00% وثم التنمية الاقتصادية بنسبة 79.40% وبينت الدراسة وجود علاقة إيجابية بين دور مشاريع العمل عن بعد والتنمية المستدامة وأوصت الدراسة بضرورة نشر الوعي والتثقيف بالعمل عن بعد ودورها في إحداث عملية التنمية المستدامة و توفير الدعم المادي لتحقيق أهداف الاستراتيجية الحكومية الهادفة لتبني فكرة العمل عن بعد ووضع القوانين والتشريعات واللوائح لخدمة وحماية الحماية العامل الفلسطيني أمام المشغل الأجنبي.

Charaterization and Utilization of Citrus Wastes As Value Added Fruit Leather

In current research work, locally grown citrus wastes (peel and bagasse) were characterized through compositional analysis followed by preparation of citrus waste enriched fruit leather. After that, biokinetic trial was carried out to evaluate the prophylatic potential of citrus waste with special reference to hypercholesterolemia and hyperglycemia on Sprague Dawley rats. In this context, chemical analysis proved that citrus peel is an excellent source of inorganic matter. Regarding the phytochemical profiling, citrus peel as well as bagasse showed highest activity in methanol extract followed by ethanol and water. On the other hand grapefruit peel and bagasse proved to have maximum polyphenols followed by oranges and musami. The methanolic extract of grapefruit peel showed maximum values of TPC 206.53±6.82 mg GAE/ 100 g, Flavonoids 83.06±2.74 mg QE/100 g, DPPH 62.80±2.07%, antioxidant activity 58.13±1.92%, ABTS 10.35±0.34 µmole TE/g, iron chelation 18.54±0.61%, superoxide anion 34.62±1.14% and hydrogen peroxide 55.90±1.84%. The same trend was observed in the methanolic extract of grapefruit bagasse. Furthermore, the bioactive entities, hesperidin and nobiletin quantified through HPLC showed maximum hesperidin in methanolic extract of grapefruit peel and bagasse i.e. 28.51 and 7.40 mg/g in peel and bagasse respectively. Similarly, the nobiletin was maximum 9.92 mg/g in methanolic extract of grapefruit peel and 2.78 mg/g in methanolic extract of grapefruit bagasse. On the basis of in vitro analyses and HPLC quantification methanolic extract of grapefruit peel and bagasse were selected for the preparation of citrus enriched fruit leather that was further used in bio-efficacy trail. Accordingly two types of fruit leathers were prepared using methanolic extract of citrus peel and bagasse @ 5% against control. The prepared fruit leather was assessed for physico-chemical analysis, antioxidant activity and sensory evaluation on monthly basis during storage of four months. During storage interval pH, acidity, TPC and DPPH changed significantly however, all other parameters changed non-momentously. The hedonic response of citrus enriched fruit leather showed that the best results were obtained by T1 (fruit leather with 5% grapefruit peel extract) followed by T2 (fruit leather with 5% grapefruit bagasse extract) and T0 (control fruit leather). After that, the valuation of hesperidin was evaluated by the biokinetic trial of experimental rat modeling. The biokinetic study was comprised of three studies i.e. normal study (study 1) fed on chow diet, hypercholesterolemic study (study II) fed on chow diet with 1.5% cholesterol and hyperglycemic study (study III) feeding on chow diet with 40% sucrose. All the studies were further divided into three groups categorized on the basis of diet. 1st group fed on control fruit leather, 2nd on fruit leather prepared with 5% grapefruit peel extract and 3rd on fruit leather prepared with 5% grapefruit bagasse extract. During the 60 days efficacy trials, the feed intake & drink intakes along with body weight changed significantly. Moreover, in hypercholesterolemic study (study II) the cholesterol level decreased momentously as 14.42% and 10.65% by grapefruit peel extract (T1) and grapefruit bagasse extract enriched fruit leather (T2). Likewise, the LDL level was 60.62±2.24 mg/dL in control group that reduced to 51.93±1.92 in T1 and 54.05±2.00 mg/dL in T2. Moreover, the grapefruit waste impart non-significant effect on HDL however, triglycerides level decreased 11.34% by T1 and 10.40% by T2. The same trend was observed in hyperglycemic study (study III) in which glucose level decreased from 141.53±4.95 to 116.90±4.09 mg/dL by grapefruit peel extract enriched fruit leather and 128.52±4.15 mg/dL by grapefruit bagasse extract enriched fruit leather. Similarly, the insulin level increased 10.32 and 5.07% in T1 and T2. The liver functioning tests as AST, ALT & ALP and kidney functioning tests (urea and creatinine levels) remained non-significant in all the studies by the supplementation of both fruit leathers. Moreover, neither grapefruit peel extract enriched fruit leather nor grapefruit bagasse enriched fruit leather impart any harmful effect on the biochemistry of blood as proved by the hematological analyses. Conclusively, it is stated that citrus wastes based fruit leathers are effectual to mitigate health related disorders.