Random Forest Analysis of Age-Related Macular Degeneration
MetadataShow full item record
Nineteen genetic variants have been well-established as associated with risk for AMD; many of these are in genes related to atherosclerosis, angiogenesis, and the innate immune system. These known variants explain approximately 65% of the genetic basis of the disease .There is a need to develop statistical methods to identify additional genetic variants that contribute to AMD, to improve understanding of the disease etiology and enable treatments to be targeted to specific versions of the disease. For example, some evidence suggests that a treatment of antioxidants with zinc slows the progression of AMD in patients with the low-risk variant of the CFH gene, but not in patients with the high-risk variant. Our research goal is to find undiscovered genetic variants which may be associated with AMD using a two-phase approach. In phase 1 we will do a regression on previously known associated SNPs, and in phase 2 we will do a random forest on a different set of SNPs using residuals from the regression to find potential new associations. We will perform our statistical analysis first on a simulated data set and then apply this analysis to a set of real genetic data.
Department of Mathematics