Search or add a thesis

Advanced Search (Beta)
Home > Investigating Protein Semantic Similarity Measurement and its Correlation With Sequence Similarity

Investigating Protein Semantic Similarity Measurement and its Correlation With Sequence Similarity

Thesis Info

Access Option

External Link

Author

Ikram, Najmul.

Program

PhD

Institute

Capital University of Science & Technology

City

Islamabad

Province

Islamabad.

Country

Pakistan

Thesis Completing Year

2018

Thesis Completion Status

Completed

Subject

Computer Science

Language

English

Link

http://prr.hec.gov.pk/jspui/bitstream/123456789/9397/1/Najmul_Ikram_Semantic_Computing_HSR_2018_CUST_31.07.2018.pdf

Added

2021-02-17 19:49:13

Modified

2024-03-24 20:25:49

ARI ID

1676727781819

Asian Research Index Whatsapp Chanel
Asian Research Index Whatsapp Chanel

Join our Whatsapp Channel to get regular updates.

Similar


Protein sequence similarity is commonly used to compare proteins, and to search for proteins similar to a query protein. With the growing use of biomedical ontologies, especially Gene Ontology (GO), semantic similarity between ontology terms, proteins and genes is getting attention of researchers. Protein semantic similarity measurement has many applications in bioinformatics, including protein function prediction and protein-protein interactions. Semantic similarity measures were proposed by Resnik, Jiang and Conrath, and Lin. Recent measures include Wang and AIC. The question whether the semantic similarity has a strong correlation with sequence similarity, has been addressed by some authors. It has been reported that such correlation exists, and it has been used for the evaluation of semantic similarity computation methods as well as for protein function prediction. We investigate the correlation between semantic similarity and sequence similarity using graphs, Pearson''s correlation coe cient and example proteins. Wend that there is no strong correlation between the two similarity measures. Pearson''s correlation coef- cient is not su cient to explain the nature of this relationship, if not accompanied by graph analysis. Wend that there are several pairs with low sequence similarity and high semantic similarity, but very few pairs with high sequence similarity and low semantic similarity. Interestingly, the correlation coe cient depends only on the number of common GO terms in proteins under comparison. We propose a novel method SemSim for semantic similarity measurement. It addresses the limitations of existing methods, and computes similarity in two steps. In therst step, SimGIC like approach is used where contribution of common ancestors is divided by contribution of all ancestors. In the second step, we use two new factors: Speci city computed from ontology based information content, and Uniqueness computed from annotation based information content. Thenal result, after applying these two factors, makes clear distinction between the generalized and specialized terms. We conducted experiments on protein pairs having evidence of high similarity, and the ones having evidence of low similarity. Experiments show that SemSim performs better than the previous measures in both cases. When semantic similarity is used for searching proteins from large databases, the speed issue becomes signi cant. To search for proteins similar to a query protein having m annotations, from the database of p proteins, p m n g comparisons would be required. Here n is the average annotations per protein, g is the complexity of GO term similarity computation algorithm, and it is assumed that each term of one protein is compared with each term of the other. We propose a method SimExact that is suitable for high speed searching of semantically similar proteins. Although SimExact works on common terms only, our experiments show that it gives correct results required for protein semantic searching. SimExact can be used as a pre processor, generating candidate list for the existing methods, which proceed for further computation. Such arrangement will gain high speed while retaining the accuracy of the given method. We provide online tool that generates a ranked list of the proteins similar to a query protein, with a response time of less than 8 seconds in our setup. We use SimExact to search for protein pairs having high disparity between semantic similarity and sequence similarity. SimExact makes such searches possible, which would be NP-hard otherwise.
Loading...
Loading...

Similar Books

Loading...

Similar Chapters

Loading...

Similar News

Loading...

Similar Articles

Loading...

Similar Article Headings

Loading...

اے۔ کے ۔ بروہی

اے ۔ کے ۔ بروہی
ہندوستان اور پاکستان کے علمی حلقوں میں یہ خبر نہایت افسوس کے ساتھ سنی گئی کہ بین الاقوامی شہرت کے قانون داں اور عالم جناب اے۔ کے۔ بروہی جن کا پورا نام اﷲ بخش بروہی تھا، گزشتہ ستمبر میں عارضہ قلب میں انتقال فرما گئے، ان کی میت لندن سے کراچی لائی گئی، ان کی عمر ۷۲ سال کی تھی، مرحوم کے بارے میں یہ بالکل درست ہے کہ پیشہ کے لحاظ سے وہ قانون داں تربیت کے لحاظ سے فلسفی اور مزاج کے لحاظ سے دیندار تھے، ان کی قوت گویائی اعلیٰ درجہ کی تھی، ۱۹۶۰؁ء میں وہ پاکستان کے ہائی کمشنر ہوکر ہندوستان آئے، ان ہی دنوں ایک انڈوپاک کلچرل کانفرنس دلی میں منعقد ہوئی، جس میں پاکستان کے چوٹی کے ادیب، شاعر اور دانشور بھی آئے ہوئے تھے، افتتاحیہ جلسہ میں وزیراعظم جواہر لال نہرو شریک تھے، وہ بہت تھکے تھکے معلوم ہورہے تھے لیکن جب بروہی صاحب تقریر کرنے لگے تو وہ ہمہ تن گوش ہوگئے۔
بروہی صاحب نے اسلام آباد میں انٹرنیشنل یونیورسٹی قائم کی جس کے وہ پہلے ریکٹر ہوئے، پاکستان کی نیشنل ہجرۃ کونسل کے چیرمین تھے، جس کی وجہ سے حکومت نے انھیں سفیر کا درجہ دے رکھا تھا، وہ انگریزی میں کئی کتابوں کے مصنف تھے۔ نیشنل ہجرۃ کونسل کے چیرمین کی حیثیت سے وہ اسلام سے متعلق ایک سو اعلیٰ معیار کی کتابیں مرتب کرانے میں مصروف تھے، ان کتابوں کے انتخاب کے لئے ایک کمیٹی مقرر کی گئی ہے، جس کے ایک رکن مرحوم سید صباح الدین عبدالرحمن بھی تھے۔ (ضیاء الدین اصلاحی، جنوری ۱۹۸۸ء)

 

PENINGKATAN KUALITAS PEMBELAJARAN MENYIMAK CERPEN DENGAN MENGGUNAKAN MEDIA REKAMAN PEMBACAAN CERPEN PADA SISWA KELAS XI IPA2 SMA NEGERI 1 BONTOTIRO KABUPATEN BULUKUMBA

Improving the Quality of Learning to Listen to Short Stories by Using Recorded Media for Reading Short Stories for Class XI IPA2 Students of SMA Negeri 1 Bontotiro, Bulukumba Regency.” This study aims to describe the improvement in the quality of learning to listen to short stories using short story reading recording media for students of class XI IPA2 SMA Negeri 1 Bontotiro, Bulukumba Regency.             The results of the study prove that improving the quality of learning to listen to short stories using short story reading recording media in class XI IPA2 SMA Negeri 1 Bontotiro Bulukumba Regency at the planning stage found an increase in the ability of teachers in the field of study to plan better learning implementation in cycle II. In the implementation stage, there was an increase in student activity during the learning process, such as the sincerity, discipline, and self-confidence of students following the learning process. The evaluation stage found an increase in the results of the short story listening test, showing that in the first cycle 56.09% of students experienced mastery learning, and in the second cycle it reached 97.56% who experienced learning mastery. Based on the results of the study, it was concluded that the recording media for reading short stories could improve the quality of learning to listen to short stories in class XI IPA2 SMA Negeri 1 Bontotiro, Bulukumba Regency

The Prevalence of Malaria and Assessment of the Uptake of Malaria Prevention Measures in Blood Donors in Two Regional Blood Transfusion Centres in Kenya

Transfusion transmitted malaria is one of the most common transfusion transmissible infections and is a threat to blood safety and malaria control in Sub-Saharan African countries where malaria is endemic. The majority of healthy adults living in malaria endemic areas have some degree of immunity to the disease and an asymptomatic low-level parasitaemia is known to exist in a subset of this population. Blood donors recruited from the population are screened using a donor-selection criteria that includes age, weight, self-declared well-being and measurement of vital signs but not history of recent malaria infection or treatment. The Kenya National Blood Transfusion Services does not currently screen donated blood for malaria, opting instead for prophylactic anti-malarial use. This policy is inconsistent with the current WHO guidelines for the prevention of transfusion transmitted malaria, and the national policy guiding malaria treatment which states that antimalarial use is reserved for laboratory confirmed cases. A cross-sectional survey was conducted at two regional blood transfusion centres of differential malarial endemicity to determine the prevalence of malaria in blood donors. Of the 1,100 donors who participated in this study, five donors tested positive for malaria antigen, 3 from the Mombasa RBTC and 2 from the Nairobi RBTC giving an overall prevalence of 0.5% malaria antigen positivity. Only one peripheral blood film examined was positive for malaria yielding a total prevalence of 0.1% slide positivity. The prevalence of malaria in blood donors does not justify the routine use of prophylactic anti-malarias with each transfusion and a blood donor malaria screening algorithm as an alternative to malaria prophylaxis in the prevention of transfusion transmitted malaria should be developed and implemented.