Investigating Genomic Associations by Fusing Regression Methods on Cancer Profiles
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Kyung-Ah Sohn | - |
dc.contributor.author | VANGIMALLA, REDDY RANI | - |
dc.date.accessioned | 2018-11-08T08:18:46Z | - |
dc.date.available | 2018-11-08T08:18:46Z | - |
dc.date.issued | 2015-08 | - |
dc.identifier.other | 20200 | - |
dc.identifier.uri | https://dspace.ajou.ac.kr/handle/2018.oak/12754 | - |
dc.description | 학위논문(석사)--아주대학교 일반대학원 :컴퓨터공학과,2015. 8 | - |
dc.description.tableofcontents | TABLE OF CONTENTS Master Dissertation i ACKNOWLEDGEMENTS i ABSTRACT ii 1 Introduction 1 2 Summary Of Work 4 3 Materials and Methods 6 3.1 Data & Preprocessing 6 3.2 Methods 8 3.2.1 Least absolute shrinkage and selection operator (Lasso) 8 3.2.2 Graph Guided Fused Lasso (GFLasso) 9 3.2.3 Sparse Group Lasso (SGL) 11 3.2.4 Structured Input-Output Lasso (SIOL) 13 3.3 Fusion Method ? SNF 16 4 Results 20 4.1 Comparison Of Regression Methods 20 4.1.1 Identifying the performance of all four regression methods in terms of MSE and Density 20 4.1.2 Discovering common genomic features of all methods 24 4.2 Integrative Regression Network 28 4.2.1 Investigating combined benefits of all regression methods using similarity measurement 29 4.2.2 Genomic association network construction and study 34 4.3 Functional characterization of the affected genes using the tool DAVID 38 5 Discussion & Conclusion 41 6 Future Work 43 REFERENCES 44 | - |
dc.language.iso | eng | - |
dc.publisher | The Graduate School, Ajou University | - |
dc.rights | 아주대학교 논문은 저작권에 의해 보호받습니다. | - |
dc.title | Investigating Genomic Associations by Fusing Regression Methods on Cancer Profiles | - |
dc.type | Thesis | - |
dc.contributor.affiliation | 아주대학교 일반대학원 | - |
dc.contributor.department | 일반대학원 컴퓨터공학과 | - |
dc.date.awarded | 2015. 8 | - |
dc.description.degree | Master | - |
dc.identifier.localId | 705434 | - |
dc.identifier.url | http://dcoll.ajou.ac.kr:9080/dcollection/jsp/common/DcLoOrgPer.jsp?sItemId=000000020200 | - |
dc.subject.keyword | Computer Science | - |
dc.subject.keyword | Data Mining | - |
dc.description.alternativeAbstract | Cancer is eventually the result of cells that uncontrollably grow and do not die. Normal cells in the body follow an orderly path of growth, division, and death. When this process breaks down, cancer begins to form due to the mass abnormal cell growth. The ongoing study of gene expression with respect to multi layered genomic features is highly useful to overcome poor prognosis of cancer. Association analysis of gene expression traits with genomic features is crucial to identify the molecular mechanisms underlying cancer. Simple correlation based association tests are prone to identify more indirect genomic associations. In this study, sparse regression methods GFLasso, Lasso, SGL and SIOL were employed to discover genomic associations. The purpose of this study is to understand all pros and cons of sparse regression, structural information and grouping effects, to identify the significant cancer causing genomic associations, genomic features and expression traits. An extensive study is carried out and compared the results obtained by each regression method. The performance is analyzed for each regression method in terms of mean squared error, non-zero beta densities, computational time, etc. Association study between gene expressions and a genomic feature (methylation) was done using the regression coefficients obtained by each computational method. The study was carried out by analyzing the association pairs, strong influencing predicators (methylation features) and output variants (mRNA) of each method, on various cancer profiles, ? By combining the results of all regression types and fusing the results using similarity measurement i.e., similarity network fusion (SNF). The overall motivation is to suppress noise, but still consider the weaker genomic associations that are true positives for the study, though identifying stronger genomic associations is equally important. SNF is used for this study for fusing, as fused network captures both shared and complementary information from different data sources, using propagation effects on multiple iterations. | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.