Search or add a thesis

Advanced Search (Beta)
Home > Grand: Graph Based Author Name Disambiguation Framework

Grand: Graph Based Author Name Disambiguation Framework

Thesis Info

Access Option

External Link

Author

Hussain, Ijaz

Program

PhD

Institute

COMSATS University Islamabad

City

Islamabad

Province

Islamabad.

Country

Pakistan

Thesis Completing Year

2019

Thesis Completion Status

Completed

Subject

Computer Science

Language

English

Link

http://prr.hec.gov.pk/jspui/bitstream/123456789/10804/1/Ijaz%20Hussain_CS_2019_Comsats_PRR.pdf

Added

2021-02-17 19:49:13

Modified

2024-03-24 20:25:49

ARI ID

1676727765937

Asian Research Index Whatsapp Chanel
Asian Research Index Whatsapp Chanel

Join our Whatsapp Channel to get regular updates.

Similar


Author name disambiguation is a challenging research area in the field of biblio metric analysis, scientometrics, and informetrics. Author name ambiguity may occur in two ways, when multiple authors share a common name, or an author’s multiple name variations appear in the bibliographic databases, such as DBLP, ACM, and Google Scholar. In both these scenarios, it is difficult to be certain about the accuracy of the retrieved results. Proper identification of one’s work from other’s is necessary due to many reasons, for example, in author ranking sites such as the Arnetminer, presence of author’s name ambiguity in citations leads to wrong metrics such as h-index, g-index, and i-index. Author name am biguity is one of the main errors for the wrong analysis in these bibliographic databases. To improve the accuracy of aforementioned metrics, it is necessary to disambiguate these ambiguous authors. Similarly, these bibliographic databases provide content as an input to visual bibliographic information retrieval systems that are currently used for expert (supervisor) finding, specific literature searching, selecting reviewers, and detecting a potential conflict of interests. Existing author name disambiguation techniques require a representative labeled data set for the training of the model, or require a number of ambiguous authors known a priori, or require extra information from the Web, or need user feed back, and are less scalable due to the requirement of training thousands of models for each ambiguous author. In this dissertation, a complete author name dis ambiguation framework called “GRAND” is presented that consists of four main algorithms, one each for the resolution of homonyms, synonyms, sole authors, and incremental author’s name ambiguity. The first algorithm is DISC that exploits graph semantics, similarity measures, and community detection algorithms to disambiguate homonyms. The citation data set is preprocessed and ambiguous author blocks are created. DISC utilizes only two citation attributes–co-authors and titles, which are implicit bibliographic information in all bibliographic databases. The co-author’s graph of the citation data set is constructed and “GSkeletonClu: A graph Structural Clustering Algo rithm for networks” is used to identify hub vertices, outliers, and clusters of nodes in the co-author’s graph. Homonyms are resolved by splitting these clusters of nodes across the hub nodes if the similarity between their title feature vectors is less than a threshold. The second algorithm is SISTER that uses graph-based se mantic similarity measure “SynGeo”. It preprocesses and constructs co-author’s graph of the citation’s data set. Synonyms are resolved by exploiting SynGeo, which is based on syntactic similarity and graph geodesics between compared nodes. The third algorithm is GCLUSIM, which detects and disambiguates sole authors. In GCLUSIM, sole author’s and disambiguated author’s title feature vectors are constructed to find the similarity between them. On the basis of this similarity, a sole author may be merged with the disambiguated clusters. As our final contribution, the fourth algorithm is CAND that exploits author name in dices, author profiles, and a comparison function to solve the incremental author’s name ambiguity. Author name indices enhance the overall system performance and author profile models help in disambiguation of the incremental insertions. The comparison function utilizes the most strong bibliometric features–co-author, titles, and self-citations. The proposed algorithms are effective than state of the art methods in terms of clustering metrics. Furthermore, we believe that our pro posed algorithms in this dissertation can serve a baseline for future author name disambiguation studies.
Loading...
Loading...

Similar Books

Loading...

Similar Chapters

Loading...

Similar News

Loading...

Similar Articles

Loading...

Similar Article Headings

Loading...

مولوی محمد شفیع

مولوی محمد شفیع
افسوس ہے کہ گذشتہ مہینہ پاکستان کے نامور فاضل مولوی محمد شفیع صاحب سابق پرنسپل اورینٹل کالج لاہور نے انتقال کیا، ان کی وفات علمی دنیا کا بڑا حادثہ ہے، وہ ہندو پاک کے نامور فضلاء و محققین میں تھے، انگریزی کے ساتھ عربی و فارسی کے بھی ماہر تھے، ان کا علمی پایہ بہت بلند تھا، ان کے علمی و تحقیقی کارنامے بڑے متنوع ہیں، بہت سے فاضلانہ علمی مقالات کے علاوہ انھوں نے عربی و فارسی کی متعدد اہم اور نادر کتابوں کو تصحیح و تحشیہ کے ساتھ مرتب کرکے شائع کیا، اپنی پرنسپلی کے زمانہ میں علمی حیثیت سے اورینٹل کالج میگزین کا معیار بہت بلند اور اپنے تلامذہ میں سنجیدہ علمی تلاش و تحقیق کا ایک عام ذوق پیدا کردیا تھا، چنانچہ لاہور کے موجودہ فضلاء اور محققین میں بیشتر انہی کے تربیت یافتہ ہیں، ادھر چند سال سے لاہور یونیورسٹی اردو انسائیکلوپیڈیا کی تالیف و اشاعت کا کام ان کی نگرانی میں شروع ہوا تھا، اور اس کے بعض اجزا شائع بھی ہوئے لیکن ابھی یہ کام ابتدائی منزل میں ہے، اس قحط الرجال کے زمانہ میں علمی ذوق و طلب میں ان کی ذات علمائے سلف کا نمونہ تھی اﷲ تعالیٰ اس شیدائے علم کو رحمت و مغفرت سے سرفراز فرمائے۔ (شاہ معین الدین ندوی، اپریل ۱۹۶۳ء)

 

علم الوقف والابتداءپر علامہ سجاوندی

The Qur'an is the verses of Allah, which Allah Himself has taken assurance to keep safe. And there is no any other opinion in saying that the Holy Qur'an is still safe today as the prophet Muhammadصلى الله عليه وسلم taught to his Companions. Thereafter, it is also obligatory to follow the Qur'anic punctuation and pronunciation during the recitation of holy Quran same as mandatories mentioned in Quran o Hadith and ijma-e-ummah to earnest reading of the Qur'an. The Arabs were linguists, so they stop to the proper place, understanding the meaning. But for the nonArabs, it was a very difficult matter. Therefore, for the   onvenience of the people, Allama Sajawandi put symbols in different places in the mus'haf so that these symbols can be dedicated in a suitable place keeping in mind and avoid spiritual error. Allama Sajawandi not only wrote many books on this topic but was the first to formulate endowment symbols on the Holy Quran. He Divide the symbols into five steps and apply them to the Mus'haf, Which are written still today.

Synthesis, Development and Characterization of Some Advance Matrix Materials

Phthalonitrile resins are high temperature thermosetting polymers, which are considered ideals materials for marine, aerospace, and electronic applications. The synthesis of phthalonitrile monomers with self-catalyzing nature and with large processing window – defined as the temperature between the melting temperature of the monomer and the gelation temperature of the polymer network–, are gaining much more importance because of ease in processability and high thermal stability. In the present study, some novel phthalonitrile resins were synthesized using ortho- linked phthalonitrile monomers and self-catalyzing phthalonitrile monomers with different linkages such as: ether, imide-ether, and amide-ether between the reactive ends. All the synthesized monomers were characterized by spectroscopic techniques such as FT-IR, 1 H-NMR and 13 C-NMR. FT-IR indicated absorption peaks around 1522 and 1355 cm -1 and around 1010 cm -1 indicated the formation of triazine and phthalocynine rings (heterocyclic rings formed as a result polymerization) respectively after post curing. The thermal analyses were carried using DSC, TGA, DMA, and Rheometery. DSC and rheometric studies showed that the monomers with ortho linkages have low melting point and high crosslinking temperature. In self-catalyzed monomers, the monomers having amino group at ortho position or 1,2-linked (ortho) monomers have broad processing window. The complex viscosity (η*) was very low (<1 Pa.s) in between melting and the crosslinking temperature, which is highly suitable for resin transfer molding, resin infusion molding, and filament winding. TGA studies revealed that the resin synthesized from the monomer with heterocyclic ring shows high thermal stability and residual mass (char yield). The thermal stability of the polymer having ether or imide-ether linkages are nearly the same but more than the polymer having amide-ether linkage, indicating the effect of crosslinking density and structural changes. DMA measurements showed that storage muduli (E'') and glass transition temperature (Tg) enhance with the increase of curing temperature. These meaurements also indicated that the polymers having imide-ether and amide-ether linkages have higher storage moduli than the polymers have only ether linkages.