Search or add a thesis

Advanced Search (Beta)
Home > Novel Disease Named Entity Recognition Dner & Hybrid Relation Extraction Hre Frameworks for Biomedical Text

Novel Disease Named Entity Recognition Dner & Hybrid Relation Extraction Hre Frameworks for Biomedical Text

Thesis Info

Access Option

External Link

Author

Muzaffar, Abdul Wahab

Program

PhD

Institute

National University of Sciences & Technology

City

Islamabad

Province

Islamabad

Country

Pakistan

Thesis Completing Year

2017

Thesis Completion Status

Completed

Subject

Applied Sciences

Language

English

Link

http://prr.hec.gov.pk/jspui/bitstream/123456789/8002/1/Abdul_Wahab_Muzaffar_Softwar_Engineering_2017_HSR_NUST_22.06.2017.pdf

Added

2021-02-17 19:49:13

Modified

2024-03-24 20:25:49

ARI ID

1676726766581

Asian Research Index Whatsapp Chanel
Asian Research Index Whatsapp Chanel

Join our Whatsapp Channel to get regular updates.

Similar


Biomedical knowledge is usually presented in the form of unstructured segments; making the extraction of such information a complex task. Although, manual information extraction often produces the best results, it is harder to manage biomedical data extraction manually, because its data size is rising exponentially. Thus, there is a need for automatic tools and techniques for information extraction and knowledge discovery in biomedical text mining. Named entity recognition and relation extraction are focused areas of research in biomedical information extraction systems. Relation Extraction hinders the known relationship between Named Entities and in some way these are dependent on each other yet research also takes both these steps in an independent manner also. A lot of work has been done on biomedical named entity recognition focusing mostly on supervised and semi supervised solutions but very less attention work is done on unsupervised methods. Due to limited availability of annotated corpora the researchers now directed their efforts towards achievement of unsupervised named entity recognition systems. Named Entity Recognition from annotated corpora has been matured and there is very less margin for performance optimization. The challenge is still alive for the named entity recognition from unannotated corpora in all domains generally and for biological and biomedical domain specifically. Biomedical text exhibits relationships between different entities which are important for practitioners and researchers. Relation extraction is a significant area in biomedical knowledge, which has gained much importance in the last two decades. A lot of work has been done on biomedical relation extraction and identification focusing on two major areas: 1) rule based technique and 2) machine learning technique. In the last decade, focus has changed to hybrid approaches which have shown better results. This research presents an unsupervised named entity recognition framework along with a hybrid feature set for classification of relations between biomedical entities. Our Named Entity Recognition uses UMLS concepts and creates signatures that automate signature vectors. The vectorization of UMLS concepts ensures application of the framework in a generic way. Our framework differs with previous un-supervised methods in a way that we rely on UMLS for vector space creation instead of corpus statistics. The Relation Extraction approach uses bag of word feature, along with Natural Language Processing (NLP) to identify the noun and verb phrases and semantic features based on UMLS concepts. This hybrid feature set is a better representation of the relation extraction task. The main contribution in this hybrid features is the addition of semantic feature xi | P a g e set where verb phrases are ranked using Unified Medical Language System (UMLS), and a ranking algorithm is designed to get the most suitable concepts as features for the classifier. For Named Entity Recognition, we used Arizona Disease Corpus (AZDC) a gold standard corpus for this task. Our framework achieved accuracy of 72.56% which is competitive with supervised techniques on the same corpus. Our Relation Extraction approach has been validated on standard biomedical text corpus obtained from MEDLINE 2001, an accuracy of 96.19%, 97.45%, 96.49% and F-measure of 98.05%, 93.55%, 88.89% has been achieved for the cure, prevent and side effect relations respectively.
Loading...
Loading...

Similar Books

Loading...

Similar Chapters

Loading...

Similar News

Loading...

Similar Articles

Loading...

Similar Article Headings

Loading...

اسم ِ استفہامیہ : أنّٰی کہاں؟

اسم ِ استفہامیہ :أنّٰی کہاں؟

ارشادِ ربانی ہے:

"اَنّٰى لَهُمُ الذِّكْرٰى وَقَدْ جَاءَهُمْ رَسُوْلٌ مُّبِيْنٌ"۔ [[1]]

"ان کے لئے نصیحت کہاں ہے؟ کھول کھول کر بیان کرنے والے پیغمبر ان کے پاس آچکے"۔

رسولِ مُبین کے دو مطلب ہیں ۔

ایک یہ کہ اس کا رسول ہونا اس کی سیرت، اس کے اخلاق و کردار اور اس کے کارناموں سے عیاں ہے۔

 دوسرا یہ کہ اس نے حقیقت کو کھول، کھول کر بیان کرنے میں کوئی کسر نہیں اٹھا رکھی ہے۔اُس وقت یہ ماننے کا کوئی فائدہ نہیں : سو ارشاد فرمایا گیا کہ " اس وقت ان کے لیے نصیحت کا کوئی موقع کہاں باقی رہا جبکہ اس سے پہلے آچکے انکے پاس کھول کر بیان کرنے والے ایک عظیم الشان رسول " ۔ ایسے عظیم الشان رسول جن کی صداقت و حقانیت روز روشن کی طرح واضح تھی۔ اور واضح ہے۔ مگر پھر بھی یہ لوگ ایمان نہیں لائے تو اس کے بعد اب کیسے اور کیا ایمان لائیں گے؟ سو اس وقت ان کی تذلیل و تخجیل کیلئے اللہ کی طرف سے انکو یہ جواب دیا جائے گا۔ بہرکیف ارشاد فرمایا گیا کہ اِعلانِ حق کے پہنچ جانے اور اس کے دیکھ لینے کے بعد ایمان لانے اور نصیحت قبول کرنے کا موقع کہاں باقی رہے گا۔ بالخصوص جبکہ انکے پاس اِتمامِ حجت کیلئے اللہ تعالیٰ کی طرف سے ایک ایسا عظیم الشان رسول بھی پہنچ گیا جس نے انکے سامنے حق کو پوری طرح واضح کرکے اور نکھار کر بیان کر دیا تھا۔ لیکن انہوں نے اس کی بات کو مان کر نہ دیا سو ایمان لانے کا وہ موقع جب گزر گیا تو...

Cartoons as A Tool of Religious Instruction for School-Going Children: A Case Study of Sheikhupura City

This research elaborates on utilization of cartoons as a tool of religious instruction for children. Cartoons can be used as an instrument for children’s mental development because a child’s mind is like a plain slate, what you write leaves a long-lasting impact. Cartoons have become a favorite activity of children and they spend most of their leisure time on watching cartoons. Children’s interest in cartoons also impacts their mind according to the content and concept of cartoons. For this aim, cartoons with religious instructions were shown to children and interview of some children’s parents and teachers were done in Sheikhupura. The data collected from the observations and interviews done qualitatively for this research shows that cartoons are a very effective source of religious instruction for children. Children not only understand these instructions easily but also try to follow them.

Matrix Based Hybrid Key Establishment Schemes for Securing the Communication in Group Based Wsn

Secure key distribution is extremely crucial in commercial and military applications of WSN and wireless sensor and Actor networks (WSAN) for providing confidentiality to messages shared among sensors. It becomes more challenging when two cluster heads cannot communicate directly due to communication range. In this case, an ordinary node located at cluster boundaries to play a role of gateway node that has established keys in both clusters. Entire communication between clusters is transmitted through these gateway nodes. The main problem is that compromised gateway node exposes all keys transmitted through that node and relevant links are compromised before establishment. We have proposed Key Distribution using Key Fragmentation (KDKF) scheme that solves the problem. Sender divides the actual key into fragments using key fragmentation algorithm and sends these using gateway nodes as intermediaries. Receiver Node assimilates these key fragments using XOR operation to interpret actual key. KDKF scheme provides deployment scenarios and detailed protocol description to elaborate the message structure to exchange security credentials between distant nodes. Moreover, formal modeling is performed using Rubin Logic to verify and analyze the proposed protocol. Performance and resilience of protocol is validated through simulations using ns-2.35. It proves that a compromised gateway node cannot retrieve the actual key and only gets the key fragment. To assimilate the single actual key, adversary needs to subvert exactly those gateway nodes that participated in key fragments transmission. In contemporary schemes, all future keys transmitted through compromised gateway nodes were exposed. Results proof that KDKF is much more resilient against compromising attack and keys in network are not exposed to adversary. To further improve the communication overheads and better connectivity, we have proposed a Matrix based Key Establishment Scheme (MKES) where actual key is never transmitted on network. In MKES, each node is pre-loaded with one row and one column from a matrix. After deployment, indices for row and column are exchanged between the two nodes and values at intersection of row and column index is used to calculate the key on each node. It can establish keys with neighboring nodes, cluster heads, SINK and even across different WSN using same matrix values. Results are produced for analyzing resilience, storage and communication costs to prove dominance of MKES. It is also tested by deploying on android for securing messaging. Moreover, MKES is used in Critical Data Reclamation (CDR) that provides aggregated data extraction in isolated clusters where cluster head (GH) is destroyed. GH receives the query from sink node to send towards the cluster members and then transmits the aggregated Ata Ullah 52-FBAS/PHDCS/F09 vAbstract response to sink node. In the absence of GH, gateway nodes located at common boundary of neighboring clusters communicate with the neighboring GHs to transmit critical data to sink in a timely manner until a new GH is not added in the cluster. A secure protocol scenario is discussed in a stepwise manner to add new GH to the network. Proposed schemes is simulated and evaluated for Density of cluster, Connectivity, Resilience, Storage and Communication overhead.