RESEARCH ARTICLE


Genetic Studies: The Linear Mixed Models in Genome-wide Association Studies



Gengxin Li1, *, Hongjiang Zhu2
1 Department of Mathematics and Statistics, Wright State University, 201 MM, 3640 Colonel Glenn Highway, Dayton, OH 45435-0001
2 Division of Biostatistics, Coordinating Center for Clinical Trials, The University of Texas School of Public Health at Houston


Article Metrics

CrossRef Citations:
14
Total Statistics:

Full-Text HTML Views: 1372
Abstract HTML Views: 2180
PDF Downloads: 3330
Total Views/Downloads: 6882
Unique Statistics:

Full-Text HTML Views: 799
Abstract HTML Views: 1218
PDF Downloads: 2510
Total Views/Downloads: 4527



Creative Commons License
© 2013 Li and Zhu

open-access license: This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International Public License (CC-BY 4.0), a copy of which is available at: https://creativecommons.org/licenses/by/4.0/legalcode. This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

* Address correspondence to this author at the Department of Mathematics and Statistics, Wright State University, 259 MM, 3640 Colonel Glenn Highway, Dayton, OH 45435-0001; Tel: 937-775-4211: E-mail: gengxin.li@wright.edu


Abstract

With the availability of high-density genomic data containing millions of single nucleotide polymorphisms and tens or hundreds of thousands of individuals, genetic association study is likely to identify the variants contributing to complex traits in a genome-wide scale. However, genome-wide association studies are confounded by some spurious associations due to not properly interpreting sample structure (containing population structure, family structure and cryptic relatedness). The absence of complete genealogy of population in the genome-wide association studies model greatly motivates the development of new methods to correct the inflation of false positive. In this process, linear mixed model based approaches with the advantage of capturing multilevel relatedness have gained large ground. We summarize current literatures dealing with sample structure, and our review focuses on the following four areas: (i) The approaches handling population structure in genome-wide association studies; (ii) The linear mixed model based approaches in genome-wide association studies; (iii) The performance of linear mixed model based approaches in genome-wide association studies and (iv) The unsolved issues and future work of linear mixed model based approaches.

Keywords: Genetic similarity matrix, genome-wide association study (GWAS), linear mixed model (LMM), population stratification, sample structure, single nucleotide polymorphisms (SNPs).