Search or add a thesis

Advanced Search (Beta)
Home > Identification of Temporal Specificity and Focus Time Estimation in News Documents

Identification of Temporal Specificity and Focus Time Estimation in News Documents

Thesis Info

Access Option

External Link

Author

Khan, Shafiq Ur Rehman

Program

PhD

Institute

Capital University of Science & Technology

City

Islamabad

Province

Islamabad.

Country

Pakistan

Thesis Completing Year

2019

Thesis Completion Status

Completed

Subject

Computer Science

Language

English

Link

http://prr.hec.gov.pk/jspui/bitstream/123456789/11804/1/Shafiq%20ur%20Rehman%20Khan%20CS%202019%20cust%20isb%20prr.pdf

Added

2021-02-17 19:49:13

Modified

2024-03-24 20:25:49

ARI ID

1676727770879

Similar


Time is deemed as paramount aspect in Information Retrieval (IR) and it pro foundly influence the interpretation as well as the users intention and expectation. The temporal patterns in a document or collection of documents plays a central role in the effectiveness of IR systems. The accurate discernment plays an immense role in persuading the time-based intention of a user. There exists a plethora of documents on the web wherein most on them contain the divergent temporal pat terns. Assimilation of these temporal patterns in IR is referred to as Temporal Information Retrieval (TIR). The comprehension of TIR systems is requisite to address the temporal intention of a user in an efficient manner. For time specific queries (i.e. query for an event), the relevant document must relate to the time period of the event. To attenuate the problem, the IR systems must: determine whether the document is temporal specific (i.e. focusing on single time period) and determine the focus time (to which the document content refers) of the documents. This thesis exploits the temporal features of the news documents to improve the retrieval effectiveness of IR systems.As best to our knowledge, this thesis is the pioneer study that focuses on the problem of temporal specificity in news docu ments. This thesis defines and evaluate novel approaches to determine the tem poral specificity in news documents. Thereafter, these approaches are utilized to classify news documents into three novel temporal classes. Furthermore, the study also considers 24 implicit temporal features of news documents to classify in to; a) High Temporal Specificity (HTS), b) Medium Temporal Specificity (MTS), and c) Low Temporal Specificity (LTS) classes. For such classification, Rule-based and Temporal Specificity Score (TSS) based classification approaches are proposed. In the former approach, news documents are classified using a proposed set of rules that are based on temporal features. The later approach classifies news documents based on a TSS score using the temporal features. The results of the proposed approaches are compared with four Machine Learning classification algorithms: Bayes Net, Support Vector Machine (SVM),Random Forest and Decision Tree. x The outcomes of the study indicate that the proposed rule-based classifier outper forms the four algorithms by achieving 82% accuracy, whereas TSS classification achieves 77% accuracy. In addition, to determine the focus time of news documents, the thesis contem plates the temporal nature of news documents. The type and structure of doc uments influence the performance of focus time detection methods. This thesis propose different splitting methods to split the news document into three logical sections by scrutinizing the inverted pyramid news paradigm. These methods in clude: the Paragraph based Method (PBM), the Words Based Method (WBM), the Sentence Based Method (SBM), and the Semantic Based Method (SeBM). Temporal expressions in each section are assigned weights using a linear regres sion model. Finally, a scoring function is used to calculate the temporal score for each time expression appearing in the document. Afterwards, these temporal expressions are ranked on the basis of their temporal score, where the most suit able expression appears on top. Two evaluation measures are used to evaluate the performance of proposed framework, a) precision score (P@1, P@2) and average error years. Precision score at position 1 (P@1) and position 2 (P@2) represent the correct estimation of focus at the top 2 positions in the ranked list of focus time whereas, average error year is the distance between the estimated year and the actual focus year of news document. The effectiveness of proposed method is evaluated on a diverse dataset of news related to popular events; the results re vealed that the proposed splitting methods achieved an average error of less than 5.6 years, whereas the SeBM achieved a high precision score of 0.35 and 0.77 at positions 1 and 2 respectively. The overall findings presented in this thesis demonstrate that the valuable tempo ral insights of documents can be used to enhance the performance of IR systems. The time aware information retrieval systems can adopt these findings to satisfy the user expectation for temporal queries.
Loading...
Loading...

Similar Books

Loading...

Similar Chapters

Loading...

Similar News

Loading...

Similar Articles

Loading...

Similar Article Headings

Loading...

مولوی محمد عبدالرحمن خاں

مولوی محمد عبدالرحمن خاں
افسوس ہے کہ مخدومی جناب مولوی محمد عبدالرحمن خاں صاحب صدر حیدر آباد اکاڈمی کی رحلت پریہ کلماتِ تعزیت بہت تاخیرسے پیش کیے جارہے ہیں۔ مرحوم اپنے وقت کے بہت بڑے فاضل، علومِ جدیدہ کے محقق، ماہرِ فلکیات اور بہت سی قابلِ قدر انگریزی اور اردو کتابوں کے مصنف تھے۔ ندوۃ المصنفین سے نہایت گہرا اور مخلصانہ ربط وتعلق رکھتے تھے اور ہمیشہ اپنے قیمتی مشوروں سے نوازتے رہتے تھے ۔۱۹۴۴ء میں ادارے کے تعارف اور اس کے حلقوں کی توسیع کے سلسلہ میں حیدرآباد جانا ہوا توجن بزرگوں نے اس خدمت میں بیش از بیش حصّہ لیا تھا اُن میں مرحوم کا نام سرفہرست تھا۔ اس کے علاوہ انھوں نے اپنی بعض گراں قدر تالیفات کے مسودے بھی کسی معاوضے کے بغیر ’’ندوۃ المصنفین ‘‘ کے حوالے کردیے ،چنانچہ ’’قرونِ وسطیٰ کے مسلمانوں کی علمی خدمات‘‘، ’’تاریخ اسلام پرایک طائر انہ نظر‘‘ اور ’’تحفۃ النُظّار‘‘(خلاصہ سفرنامہ ابن بطوطہ)جو مرحوم کی نہایت مفید ،تحقیقی اور اہم تالیفات ہیں اسی ادارے سے شائع ہوئی ہیں۔
اس صدی کے شروع میں ’’جامعہ عثمانیہ‘‘ کے قیام کاجو خواب دکن کے ارباب ِ علم وفضل نے دیکھا تھا اس کی تعبیر میں جتنا عملی حصّہ خاں صاحب مرحوم کا تھا کسی دوسرے کاکم ہی ہوگا۔مرحوم کم وبیش پچیس سال تک اس عظیم الشان ادارے کے نہ صرف صدر رہے بلکہ اپنے خلوص،محنت ودیانت ،عزم وہمّت اور بے پناہ جذبۂ عمل سے اس میں زندگی کی روح پھونک دی ،پھر وہ وقت بھی آیا کہ جامعہ کے تمام قدیم وجدید شعبوں میں اُردو کوذریعہ تعلیم بنانے والایہ فاضلِ اجل حیدرآباد کے جاگیر دارانہ نظام کی سازشوں کاشکار ہوکر گوشہ نشین ہوگیا اور رفتہ رفتہ اس کے کارنامے طاقِ نسیاں کی نذر ہوگئے۔مگر گردشِ لیل ونہار کی ستم ظریفی بھی قابلِ ملاحظہ ہے کہ ۴۷ء کے انقلاب کی...

غیر مسلم اقوام کی مشابہت: اصولی ابحاث اور فقہاء کے استنباطات کا عصری انطباق

Islam, as a religion, has a clear distinguished. It gives evident rules, religious obligations, terms and conditions to his followers. Every Muslim should follow these commands. The logic behind the traditions of holy prophet (pbuh) which are in prohibition of resemblance with polytheist and unbelievers is, these are compulsory for a religious person to remain in touch his own culture and civilization. Through the traditions of the holy prophet, we concluded that he (pbuh) strictly disliked any Muslim to adopt and copy of non-Muslim’s deeds which leaving Islamic culture civilization, because all the non-Muslims have their own religious obligations. However, Islam condemn all religious obligations, festivals and conditions that are against Islam. If, a Muslim is doing such deeds against the teaching of Islam, what would be the result of this behavior and attitude according to Islam? In this article, same rules and conditions are discussed to discriminate the differences and its impact between the cultures of Islam and other religions. What would be the result of this behavior and attitude according to Islam? And author presented many principles is this regard so a Muslim may be able to distinguish his religious and as well as his cultural values in Islamic society.

Genetic Analysis for Desirable Attributes in Popcorn

Quantification of genetic attributes including gene action, heterotic behavior, heritability estimates, phenotypic and genotypic associations are important statistical tools in progressive breeding programs. The genetic material includes eight diverse popcorn inbred lines in which four were selected on the basis of phenotypic variation. Two of the four parents were developed from indigenous popcorn material, while the other two were of exotic (Bangladesh). Parental inbred lines were crossed to get four F1 crosses. These F1 crosses were selfed to generate F2and back crossed with their female and male parents to obtained, BC1 and BC2, respectively. In such way total genetic material synthesized in this study include four each of parents, F1s, F2s, BC1s and BC2s. These generations were evaluated at two locations i.e. Cereal Crops Research Institute (CCRI), Pirsabak, Nowshehra and The University of Agriculture (UOA), Peshawar. At both locations the experimental material was replicated thrice in randomized complete block design. Data recorded were analyzed using combined analysis of variance. Combined analysis of variance in all four crosses, revealed non-significant genotype × location interactions in days to tasseling, silking, anthesis, ear length, fresh ear weight, kernel rows ear-1, grains row-1, grains ear-1 and % unpopped grains, while rest traits exhibited significant genotype × location interactions in some crosses. Significant genotype × location interactions were observed for ear diameter in all crosses. Generation mean analysis indicated that in all crosses dominant gene action was predominantly important in the inheritance of both days to tasseling, anthesis and silking. The additive gene effect was found to be negative in these crosses but significant except PS-Pop-29-2-1 x BD- Pop-2-1-2 for anthesis. The digenic non-allelic epistasis of additive x dominance and dominance x dominance was considered the major contributing factor in the inheritance of days to anthesis in cross PS-1 x BD-2 and PS-2 x BD-2. Duplicate type of epistasis was observed in cross PS-1 × BD-1 for days to anthesis and silking, while complementary epistasis was observed in cross PS-1 x BD-2. Similarly, dominant gene actions were positive and significant in all crosses except, PS-2 × BD-1 (-9.16) for plant and ear height. Duplicate type of epistasis was observed for ear height in both cross PS-1 × BD-1 and PS-1 × BD-2, while in cross PS-2 × BD-1 and PS-2 × BD-2 complementary epistasis was found. On the basis of these results, it can beconcluded that crosses in which additive gene effects were high, could be improved effectively in subsequent generations. For grain yield and related traits dominant gene action was positively significant in all crosses, indicating that the parental populations could be effectively utilized in hybrid development programs. Broad sense heritability estimates for all the studied characters varied from moderate to high level, while narrow sense heritability ranged from low to high in all crosses. Maximum broad sense heritability estimates for days to tasseling (0.76 and 0.76) and anthesis (0.84 and 0.83) were observed in crosses PS-1 × BD-2 and PS-2 × BD-2, respectively.Maximum narrow sense heritability estimates for days to anthesis (0.67 and 0.57) and silking (0.67 and 0.58) were observed in cross PS-1 × BD-2 and PS-2 × BD-2, respectively. High broad sense (0.82) and narrow sense (0.62) heritability estimates calculated for grain yield were observed in cross PS-2 × BD-1 and Cross PS-1 × BD-1, respectively.Grain yield related attributes exhibited high broad sense heritability, while narrow sense heritability estimates for these traits varied from moderate to high levels, which indicate greater contribution of the genetic portion in the inheritance. Quality parameter, expansion volume revealed maximum broad sense heritability of 0.76 and narrow sense heritability of 0.50 for cross PS-1 × BD-1 and PS-2 × BD-1, respectively. Heterotic levels both, mid parent and better parent were negative for flowering characters in all crosses. Maximum negative mid- and better parent heterosis of -7.15 and -3.77 for days to tasseling was observed in cross PS-1 × BD-1 and PS-2 × BD-1, respectively. In cross PS-2 × BD-1, maximum negative mid- and better parent heterosis were observed for days to anthesis (-6.74 and -3.65) and silking (-6.48 and -3.40). Heterotic levels both, mid parent and better parent were positive and high in magnitude for grain yield and all related attributes. Cross PS-1 × BD-2, exhibited maximum mid- (80.73) and better (62.74) heterotic values for grain yield. Maximum mid- and better parent heterotic values are indication of dominance gene actions in all four crosses. Genetic and phenotypic association of flowering characteristics with grain yield were negative in nature, while yield related attributes were positively correlated with grain yield. Grain yield was positively correlated with plant height, while negatively correlated with ear height in two crosses. Quality trait of expansion volume was negatively associated with grain yield. Based on our results, it would be wise to use those parents in hybrid crosses which exhibited both additive and dominant gene actions coupled with high narrow sense heritability estimates and having strong relationship along with high heterotic levels for most of the grain yield related traits. Although all the F1 hybrids were superior but the PS-2 x BD-2 was outstanding for grain yield. Therefore, this cross can be recommended for commercial cultivation. Both popping expansion and percent of unpopped grains was positively and significantly controlled by dominant genes in two of the four crosses, while in the other two crosses dominant gene actions were negative in nature.