Read more
This book presents the statistical aspects of designing, analyzing and interpreting the results of genome-wide association scans (GWAS studies) for genetic causes of disease using unrelated subjects. Particular detail is given to the practical aspects of employing the bioinformatics and data handling methods necessary to prepare data for statistical analysis. The goal in writing this book is to give statisticians, epidemiologists, and students in these fields the tools to design a powerful genome-wide study based on current technology. The other part of this is showing readers how to conduct analysis of the created study.
Design and Analysis of Genome-Wide Association Studies provides a compendium of well-established statistical methods based upon single SNP associations. It also provides an introduction to more advanced statistical methods and issues. Knowing that technology, for instance large scale SNP arrays, is quickly changing, this text has significant lessons for future use with sequencing data. Emphasis on statistical concepts that apply to the problem of finding disease associations irrespective of the technology ensures its future applications. The author includes current bioinformatics tools while outlining the tools that will be required for use with extensive databases from future large scale sequencing projects. The author includes current bioinformatics tools while outlining additional issues and needs arising from the extensive databases from future large scale sequencing projects.
List of contents
Introduction to Genome-Wide Association Studies.- Topics of Quantitative Genetics.- An Introduction to Association Studies.- Correcting for Hidden Population Structure in Single Marker Association Testing and Estimation.- Haplotype Imputation for Association Analysis.- SNP Imputation for Association Studies.- Design of Large-scale Genetic Association Studies, Sample Size and Power.- Post-GWAS Analyses.
About the author
Dr. Stram's interests center on the application of modern statistical methods to epidemiologic studies: his research includes methods in longitudinal analysis, meta-analysis, survival analysis, and the analysis of errors in exposure measurement and he is well known for his work on the Atomic Bomb survivors study, the Colorado Plateau Uranium Miners study, the Multiethnic Cohort study and on clinical trials of childhood cancer. He has been an investigator and collaborator on many large scale genetic association studies with an emphasis on multi-ethnic analyses, and has published widely on haplotype analysis, analysis of hidden population structure, and the design of multi-stage genotyping and genome-wide association studies.
Summary
This book presents the statistical aspects of designing, analyzing and interpreting the results of genome-wide association scans (GWAS studies) for genetic causes of disease using unrelated subjects. Particular detail is given to the practical aspects of employing the bioinformatics and data handling methods necessary to prepare data for statistical analysis. The goal in writing this book is to give statisticians, epidemiologists, and students in these fields the tools to design a powerful genome-wide study based on current technology. The other part of this is showing readers how to conduct analysis of the created study.
Design and Analysis of Genome-Wide Association Studies provides a compendium of well-established statistical methods based upon single SNP associations. It also provides an introduction to more advanced statistical methods and issues. Knowing that technology, for instance large scale SNP arrays, is quickly changing, this text has significant lessons for future use with sequencing data. Emphasis on statistical concepts that apply to the problem of finding disease associations irrespective of the technology ensures its future applications. The author includes current bioinformatics tools while outlining the tools that will be required for use with extensive databases from future large scale sequencing projects. The author includes current bioinformatics tools while outlining additional issues and needs arising from the extensive databases from future large scale sequencing projects.