Search or add a thesis

Advanced Search (Beta)
Home > Enhancing Accuracy of Urdu Sentiments Analysis, Using Lexicon-Based Approach

Enhancing Accuracy of Urdu Sentiments Analysis, Using Lexicon-Based Approach

Thesis Info

Access Option

External Link

Author

Chiragh, Neelam.

Program

PhD

Institute

University of Peshawar

City

Peshawar

Province

KPK

Country

Pakistan

Thesis Completing Year

2018

Thesis Completion Status

Completed

Subject

Social sciences

Language

English

Link

http://prr.hec.gov.pk/jspui/bitstream/123456789/9257/1/Neelam%20Chiragh_CS_2018_UoPeshawar.pdf

Added

2021-02-17 19:49:13

Modified

2024-03-24 20:25:49

ARI ID

1676724682133

Asian Research Index Whatsapp Chanel
Asian Research Index Whatsapp Chanel

Join our Whatsapp Channel to get regular updates.

Similar


In this research the accuracy of Urdu Sentiment Analysis in multiple domains is enhanced by using the Lexicon-based approach. In the lexicon, apart from the traditional approach that considers adjectives only, nouns and verbs are also included. An efficient Urdu Sentiment Analyzer is developed that applies rules and makes use of this new lexicon to perform Urdu Sentiment Analysis by classifying sentences as positive, negative or neutral. Negations, intensifiers and context-depentent words are effectively handled for enhancing accuracy of Urdu Sentiment Analyzer. Specific rules for handling negations, intensifiers and context-dependent words are incorporated in Urdu Sentiment Analyzer. For testing the Lexicon-based approach, a corpus of 6025 sentences from 151 blogs belonging to 14 different genres is collected and the sentences are annotated by three human annotators to classify each sentence as positive, negative and neutral. Evaluating this Urdu Sentiment Analyzer, by using sentences from the corpus, yields the most promising results so far in Urdu language (up to the knowledge of the author) with 89.03% accuracy, 0.86 precision, 0.90 recall and 0.88 f-measure. The comparison with the previous works in Urdu Sentiment Analysis shows that the combination of this Urdu Sentiment Lexicon and Urdu Sentiment Analyzer is much more effective than the previous such combinations. The main reason for increased efficiency is the development of wide coverage lexicon and effective handling of negations, intensifiers and context-dependent words by the Urdu Sentiment Analyzer. Although high accuracy is achieved by Lexicon-based approach in multiple domains for Urdu Sentiment Analysis, which is the main objective of this research, but for comparison, Supervised Machine Learning approach is also used. Three well known classifiers that are Support Vector Machine, Decision Tree and K Nearest Neighbor are tested; their outputs are compared and their results are ultimately improved in several iterations. It is further concluded that K Nearest Neighbor is performing better than Support Vector Machine and Decision Tree. For verification of this result, three evaluation measures i.e. McNemar’s Test, Kappa Statistic and Root Mean Squared Error are used. The result from all these three evaluation measures confirmed that K Nearest Neighbor is performing much better than the other two classifiers and achieved 67.02% accuracy, 0.68, 0.67 and 0.67 precision, recall and f-measure respectively. The results from both the approaches are compared. On the basis of experiments performed in this research, it is concluded that the Lexicon-based approach outperforms Supervised Machine Learning approach, when Urdu Sentiment Analysis is performed in multiple domains in terms of accuracy, precision, recall and f-measure, economy of time and effort.
Loading...
Loading...

Similar Books

Loading...

Similar Chapters

Loading...

Similar News

Loading...

Similar Articles

Loading...

Similar Article Headings

Loading...

منصف ہاشمی کی نثری نظمیں

منصف ہاشمی کی نثری نظمیں
نثر اور نظم میں کیا فرق ہے؟ نوراللغات میں ’’نثر‘‘ کی تشریح ان الفاظ میں کی گئی ہے۔ ’’وہ عبارت جو نظم نہ ہو‘‘۔ یعنی لفظ ’’نثر ‘‘ کی اپنی کوئی حقیقت نہیں ہے۔ اسے’’ نظم‘‘کے منفیانہ یا تنسیخی معانی سے ہی پہچانا جائے۔۔۔ نثر کے لغوی معنی ہیں :’’پراگندہ‘‘، ’’بکھرا ہوا‘‘ ۔اس کی صفات میں’’ خشک‘‘، ’’غیر شاعرانہ‘‘ وغیرہ الفاظ تقریباً ہر لغت میں پائے جاتے ہیں۔ نثر کو نظم سے قریب ترلانے کے لیے جو حربے استعمال کیے گئے ان میں جملوں کے آخری الفاظ کا مقفیٰ ہونا شرطِ اول تھی۔ گویا نثر پر نظم کو مسلط کرنا شرطِ اول تھی، نہ کہ نظم پر نثر کی فوقیت کو جتانا۔ درحقیقت نثر نگاری دوسرے درجے کی ادبی کاوش ہے جب کہ نظم گوئی سرِ فہرست تھی۔ بیسویں صدی کے آخری تیس چالیس برسوں میں ’’نثری نظم ‘‘ کو ادبی جریدوں میں جگہ ملنی شروع ہوئی۔ لیکن اردو نے کبھی اس بات کو تسلیم نہیں کیا کہ’’ غزل گو‘‘یا ’’نظم گو‘‘کی جگہ ’’غزل نویس‘‘یا ’’نظم نویس‘‘ بطور اصطلاح تسلیم کیا جائے۔ ’’سخن‘‘ کا مطلب ’’بات‘‘ نہیں بلکہ’’موزوں بات‘‘ تسلیم کیا گیا ۔ا س کے لوازمات میں آہنگ ، لہجہ (صوت) ،زحافات کو صفِ اول میں جگہ دی گئی۔
منصف ہاشمی کو فیس بک اور رسائل کی وساطت سے میں دو دہائیوں سے پڑھ رہا ہوں۔ ارکان اور زحافات سے معرا ہونے کے باوجود ان کا آہنگ ایسے بیانیہ پر مبنی ہے جس میں نظم کی خصوصیات موجود ہیں۔ ترصیع، تجنیس، سجع، آہنگ اور سب سے بڑھ کر امیجری، شعر یات کے تقاضوں کو پورا کرتی ہیں۔ مترنم نہ ہونے کے باوجود ان کا بیانیہ شاعرانہ غنایت کا حامل ہے۔ ان کی نظمیں مضمون نگاری کے حوالے سے خیال بندی اور معاملہ بندی کی شرائط پر بھی پورا اترتی ہیں۔
موضوعات کے...

Phase-dependent expression profiling and quantification of several growth factors in liver regeneration after partial hepatectomy

Growth factors are the potential operational members which control different phases of liver regeneration. Different growth factors have expression regulation in the whole process relating to different phases of liver regeneration. Objective: To assess the expression regulation of different growth factors and cytokines involved in liver regeneration in a phase-dependent manner. Methods: Blood and liver samples were collected and analyzed on 1st, 3rd, 5th, 7th and 14th postoperative days after 50% Partia hepatectomy (PHx). Results: Steady increase of liver regeneration rate was recorded from 90.8% (1st day) to 97.9% (7th day). Liver function tests further confirmed the steady liver recovery in PHx mice. Several growth factors such as HGF and VEGF exhibited an up-regulation till 5th day and later gradual decrease till 14th day compared to control mice. Albumin, CK18 and CK19 showed sequential expression increase from 1st to 14th day compared to AFP and HNF-4α upregulated until 5th and 1st day, respectively. Quantification of these growth factors further confirm our results. Conclusions: Conclusively, these results highlight a phase-dependent regulation and role of growth factors in liver regeneration and recovery

Ale Moving Mesh Generation and High Performance Implementation Using Openmp and Mpi Libraries for Fsi and Darcy Flow Problems

A high performance algorithm for the implementation of Arbitrary Lagrangian and Eulerian (ALE) moving mesh scheme for both 2D and 3D Fluid Structure Interaction (FSI) problems for the shared and distributed memory systems is discussed in the the- sis. OpenMP library is used to implement parallel programs on shared memory systems whereas message passing interface (MPI) is employed to write parallel programs on dis- tributed memory systems. Moving mesh techniques are the integral part of a wider class of fluid mechanics problems that involve moving and deforming spatial domains, namely, free-surface flows and FSI. The moving mesh technique adopted in this work is based on the notion of nodes relocation, subjected to certain evolution as well as constraint conditions. A conjugate gradient method augmented with a preconditioning is employed for the solution of the resulting system of equations. The proposed algorithm, firstly, re- orders and partitions the mesh using an efficient divide and conquer approach and then parallelizes the ALE moving mesh. Different mesh partitioning algorithms are discussed, which include the octree method, and k-way graph partitioning technique using Parmetis library. Numerical simulations are conducted on AMD Opteron and Intel Xeon processors, and unstructured triangular and tetrahedral meshes are used for the 2D and 3D prob- lems. The better results, in terms of the speedup, are obtained for the shared memory system than the distributed memory system for both the 2D and 3D problems. The quality of meshes is checked by comparing the element Jacobians in the reference and current meshes, and by keeping track of the change in the interior angles in triangles and tetrahedrons. The proposed parallel mesh reordering algorithm using sampling approach for work load re-distribution concluded 51% of average efficiency in term of the speedup for shared memory systems. The overall maximum speedup of 6.37, for the shared mem- ory system, is achieved using eight processing elements (PEs) as compared to 4.11 for the distributed memory system including twelve PEs. As a case study, the thesis also discusses the high performance implementation of a stabilized mixed finite element method for Darcy flow using MPI library. It has a lot of practical applications in the field of petroleum engineering and earth sciences especially, where the flow of fluid is of interest in a permeable porous medium. The maximum speedup of 12.24 is achieved using 28 PEs by incorporating the proposed mesh partition- ing algorithm. Outline Chapter 1 defines and introduces the problem statement and Chapter 2 gives the gen- eral introduction of the thesis. Chapter 3 presents the literature review of ALE moving mesh generation, stabilized mixed finite element methods, k-way graph partitioning algo- rithm and tree based spatial data structures. Chapter 4 mathematically formulates the ALE mesh generation problem and presents the serial algorithm for optimization using the preconditioned conjugate gradient method. Chapter 5 presents a mesh reordering algorithm based on quadtree/octree and quick sort techniques. Chapter 6 discusses the parallelization part of mesh reordering algorithm based on a sampling approach and also discusses the experimental results for the shared memory systems. Detailed discussion about the mesh partitioning and experimental results using MPI are given in Chapter 7. Chapter 8 briefly describes the stabilized finite element method for Darcy Flow and dis- cusses the results of 2D problems for a distributed memory system. Finally, conclusions are drawn in Chapter 9 and future work is presented in Chapter 10.