Search or add a thesis

Advanced Search (Beta)
Home > Modeling Multivariate Biomedical Data

Modeling Multivariate Biomedical Data

Thesis Info

Access Option

External Link

Author

Nawaz, Uzma

Program

PhD

Institute

Bahauddin Zakariya University

City

Multan

Province

KPK

Country

Pakistan

Thesis Completing Year

2013

Thesis Completion Status

Completed

Subject

Mathemaics

Language

English

Link

http://prr.hec.gov.pk/jspui/handle/123456789/1546

Added

2021-02-17 19:49:13

Modified

2024-03-24 20:25:49

ARI ID

1676726657146

Similar


The study addresses the significance of biomedical data to be analyzed by Statistical Community in collaboration with the expertise of personnel in the biomedical field. The data has its own particular constraints and difficulties being privacy-sensitive, heterogeneous and voluminous data. The mathematical understanding of patterns and structures and estimation procedures may be fundamentally different from those of data collected in other fields. For the purpose complicated genomic data of leukemia cancer type of Golub et al (1999) is selected for the study. This dataset comes from a study of gene expression in two types of acute leukemia’s, acute lymphoblastic leukemia (ALL) and acute myeloid leukemia (AML). The training data set consisted of 38 bone marrow samples, 27 of which were taken from ALL patients (19 B-ALL and 8 T-ALL) and 11 of which were taken from AML patients. Each gene expression is the quantitative level of messenger RNA found in the cells. Understanding the genetic underpinnings of disease is important for screening, treatment, drug development, and basic biological insight. Thus exploring genomic data has drawn on mathematical, statistical, and computational methods to discover meaningful genetic relationships from large-scale measurements of genes. Since this is a continuously growing area and is constantly being seeded with new approaches and interpretations. Most of this new material is easily accessible given a familiarity with basic genetics and multivariate statistics. The application of multivariate techniques need a thorough study of the data in hand and the primary objective in the study has been to “let the data speak for itself”. For the proper interpretation of these data, experimental and computational genomics need to have a firm grasp of statistical methodology. An aspect of prime importance, keenly taken into consideration in the 1study. For the multivariate genomic data of leukemia cancer type an initial exploratory data analysis has been performed in the study with the graphical tools of Histograms and Box plots in conjunction with one another. This has exposed that such a data set has a thorough fit for the extreme value distributions, which apart for the study undertaken has not been found in literature for the data type. The fitting of extreme value distributions has opened many new avenues for the data type for the new researchers to work on. Another output of the exploratory data analysis is the application of an appropriate transformation (the classical Box Cox transformation) to deal with the sharp skewness the data, and not relying only on the traditionally used logarithmic transformation. The appropriate data transformation has been another high point in the application of PCA for visualizing clusters present in the data set. Previously PCA and other complicated techniques like SOM and SVM has been applied and new adaptations are continuously being tried on these apart from the traditional clustering methodologies. Here the focus has not been just on the application of multivariate techniques to locate the clusters as predefined by the biological knowledge, rather it is on the methodologically simple yet most appropriate technique to be applied after a thorough look into the interior of the data set. Thus the data set revealed a patterned correlation matrix which in itself explained the number and configuration of clusters. This provided a groundwork for the application of PCA on box cox transformed data using the patterned correlation matrix as the interrelationship matrix. Indeed a comparison has been made with other interrelationship matrices as well. The clear cluster structure presented was, with no any misclassification in the configuration of clusters and exactly coincided with the prior biological knowledge. Therefore as per our hopes this introduction to prototypical methods for 2studying the data and interpreting in the context of biological genomic knowledge has been successful to get started. Addressing the next immediate issue in the study of the biomedical genomic data was finding genes that may be specific for one leukemia type or the cluster. The initial exploratory data analysis exposed certain data values that were of prime biological significance and played statistically significant role in the specification of genes for each cluster defined or the leukemia type. Resultantly a criterion developed from the data set, classifying each gene into its specific single cluster, or two of the three clusters or in all of the three clusters (the common genes).Thus a classified data set of the most variant genes across all the samples was taken as a training data set. Based on the classified grouping a linear discriminant analysis was successfully performed to find the discriminating genes for the specific leukemia type with 99.97% probability of correct classification. The collections of the discriminating genes from the three clusters formed were then needed to be checked for the previously found externally valid cluster structure. PCA was then applied in a new dimension as a check for the discriminating genes. For the discriminating genes the cluster formed for the sample expression profiles were expected to be distinctively clear for the genes to term as a leukemia type specific or cluster specific. Thus the clusters formed were very clearly distinguishable from one and other in contrast to the clusters of the sample expression profiles comprising of the common genes in all. These presented no any distinctive cluster rather a big bulk of a cluster that did not showed any difference in the biologically different leukemia types. The two major issues of the biomedical genomic data have been addressed successfully with an appropriate proposed model for the data type. Thus the study has been based on methodologically simple yet appropriate statistical techniques for such a data type filling 3the inevitable space left in for a statistical community the Pakistani statistical community for the very first time for such a internationally important field, the genomic biomedical field. With the results being unequivocal: Simplest is best! Can cluster genes, cell samples, or both. Yet the study has explored many new dimensions that need to be explored to establish relationship between an experiment based leukemia class and its subclass and a clinical out come. Since the data has many dimensions and concentrating on few precisely has been a difficult task yet accomplished.
Loading...
Loading...

Similar Books

Loading...

Similar Chapters

Loading...

Similar News

Loading...

Similar Articles

Loading...

Similar Article Headings

Loading...

سر اکبر حیدری

سر اکبر حیدری
افسوس ہے کہ ۸؍ جنوری ۱۹۴۲؁ء کی شام کو سابق صدر اعظم ریاست حیدرآباد سراکبر حیدری نے دلّی میں وفات پائی، سراکبر حیدری ہندوستان کی سب سے بڑی اسلامی ریاست کے وزیر مالیات اورپھر صدر اعظم ہونے کے سبب سے تمام اسلامی اداروں سے ایک خاص مربّیانہ تعلق رکھتے تھے، اور اس بنا پر ان کا حادثہ وفات ہم سب کے لیے غم و الم کا باعث ہواہے، ان کی عمر اس وقت ۷۲ برس کی تھی، مگر اس عالم میں بھی جس انہماک، مصروفیت اور بیدار مغزی سے وہ اپنے مفوضہ خدمات کو انجام دیتے تھے اس سے ان کے غیر معمولی دل و دماغ کے آدمی ہونے کا ثبوت ملتا تھا، اﷲ تعالیٰ ان کی مغفرت فرمائے۔ (سید سلیمان ندوی، جنوری ۱۹۴۲ء)

 

DESAIN DAN ANALISIS  PEKERJAAN DALAM  MANAJEMEN PERSONALIA

  Each work design also influences the employee's attitude towards the task at hand. In addition, job design determines work relations between employees, between employees and superiors, types of work, and socio-economic relations. Through design work is carried out changing the content, functions and relations of work that are able to complete in accordance with the target and boss satisfaction. Job design and analysis includes; authority of the office holder, routine implementation of duties, identity of the holder, and involvement of the incumbent. Job functions include; work methods used, work coordination, responsibility, information flow, and work authority. While employment relations include joint work activities between officials and all employees in one agency. Keywords: official management, design work, design

Preparedness of University Students for the Challenges of Globalization

This study focused on preparedness of university students for the challenges of globalization. ICT being driving force of globalization was taken as main theme of the present research. Information and Communication Technology is the most vital part of modern curriculum at all levels in the field of education. In this age of modern technology, deficiency of ICT skills among teachers and learners can deteriorate the whole education system of any country. The objectives of the study were: a) to examine the facilities and resources available at university campuses for developing ICT skills among prospective teachers; b) to investigate ICT competencies of prospective teachers at university level; and c) to investigate the communication and collaboration skills of prospective teachers studying at university level.The population of the study was all the prospective teachers studying in B.Ed. Hons, M.A Education, and M. Ed programs and teacher educators from all the public sector universities of KP, Punjab, and Federal area of Pakistan. From 12 universities of the target area, 480 prospective teachers and 24 teacher educators were taken as sample of the present study using random sampling. Mixed method research design was used. A questionnaire probing ICTs related resources, facilities and competencies of prospective teachers was used for the collection of data from prospective teachers and a semi-structured interview exploring ICT related problems and difficulties of teacher educators was also used. Frequency, percentages and Chi-square were used for analyzing the data. The findings of the study revealed that facilities like personal computers, interactive whiteboards, laptops, printers and multimedia were available to majority of the respondents at universities campuses. Majority of prospective teachers had access to internet, educational portal and they had facility of submitting and getting back their assignments.On the other hand majority of the respondents had lack of facilities like virtual libraries, virtual classroom, scanner and satellites cable and many others. The findings of the research indicated that majority of respondents were able to use computer, multimedia projector, laptop, interactive white board. While majority of prospective teachers were not able to use on-line library, virtual library, virtual classroom, satellites cable, virtual resource centre, scanner, bulletin board and printer. The findings indicated that majority of respondents were not able to use educational portals, teachers‟ and students‟ websites. Majority of prospective teachers perceived that they were able use MS word completely. Majority of the prospective teachers were able to open, close and save an Excel file. They were efficient to enter and edit worksheet, change column width and row heights, develop formulas and functions in Excel sheet. The majority of prospective teachers perceived that they could open, close and save files in power point. They were able to create a new presentation and use the default template many other functions. Majority perceived that they were able to use internet and search for information by using search engine such as Google etc and to refresh a web page. They were able to download files from the internet, to send and receive email with and without attachment and to organize websites. The findings revealed that majority of the prospective teachers were able to communicate and collaborate with culturally-distinct counterparts. They were able to be social and confident while interacting with people from different cultures. They showed respect and openness to the values of people from different cultures. The findings showed that Teacher educators faced various problems like energy crises, ethical issues, no connection between university education and world of work, global competition and students‟ low capabilities and many others. It is concluded that prospective teachers were efficient or average in those areas of ICTs for which facilities were available while they were deficient in the skills of ICTs applications that were missing at the institutions. Theprospectiveteachers werenotabletouseofscanner, bulletin board,onlinelab and virtualclassroom. For equipping students with ICT skills virtual resource centers may be established at university campuses. Teacher educators may be trained in ICT skills. Workshops and seminars may be arranged on regular basis.