2015 | OriginalPaper | Chapter
Discovering genomic associations on cancer datasets by applying sparse regression methods
Authors : Reddy Rani Vangimalla, Kyung-Ah Sohn
Published in: Information Science and Applications
Publisher: Springer Berlin Heidelberg
Activate our intelligent search to find suitable subject content or patents.
Select sections of text to find matching patents with Artificial Intelligence. powered by
Select sections of text to find additional relevant content using AI-assisted search. powered by
Association analysis of gene expression traits with genomic features is crucial to identify the molecular mechanisms underlying cancer. In this study, we employ sparse regression methods of Lasso and GFLasso to discover ge-nomic associations. Lasso penalizes a least squares regression by the sum of the absolute values of the coefficients, which in turn leads to sparse solutions. GFLasso, an extension of Lasso, fuses regression coefficients across correlated outcome variables, which is especially suitable for the analysis of gene expres-sion traits having inherent network structure as output traits. Our study is about considering combined benefits of these computational methods and investigat-ing the identified genomic associations. Real genomic datasets from breast can-cer and ovarian cancer patients are analyzed by the proposed approach. We show that the combined effect of both the methods has a significant impact in identifying the crucial cancer causing genomic features with both weaker and stronger associations.