CDCV

Visscher PM, Brown MA, McCarthy MI & Yang J 2012 Five years of GWAS discovery. Am J Hum Genet 90:7-24.

  • from McCLellan and King, Cell 20101:
  • odds ratios of the magnitude generally detected by GWAS (<1.5) can frequently be explained by cryptic population stratification, regardless of the p value associated with them
  • the common disease-common variant model has been the primary focus of human genomics over the last decade
  • if common alleles influenced common diseases, many would have been found by now
  • the issue is not how to develop still larger studies, or how to parse the data still further, but rather whether the common disease-common variant hypothesis has now been tested and found not to apply to most complex human diseases
  • from Nicholas Wade in the New York Times, March 20 2011:
  • about 2,000 sites on the human genome have been statistically linked with various diseases
  • in many cases the sites are not inside working genes, suggesting there may be some conceptual flaw in the statistics
  • it seemed that natural selection has weeded out any disease-causing mutation before it becomes common
  • from a commentary article by Jonathan Latham, on guardian.co.uk, 17 April 2011:
  • among all the genetic findings for common illnesses, such as heart disease, cancer and mental illnesses, only a handful are of genuine significance for human health
  • faulty genes rarely cause, or even mildly predispose us, to disease
  • as a consequence the science of human genetics is in deep crisis
  • since the Collins paper [Manolio et al. 20093] was published nothing has happened to change that conclusion
  • the original twin-study critics were more right than they imagined
  • the most likely explanation for why genes for common diseases have not been found is that, with few exceptions, they do not exist
  • there are now well over 2000 loci that are significantly and robustly associated with one or more complex traits
  • the vast majority of the loci identified are new
  • before 2007 their association with disease or other complex traits was not known
  • at this point in time, we can conclude that
  • many loci contribute to complex-trait variation
  • at a number of identified risk loci, there are multiple alleles associated with disease at a wide range of frequencies.
  • there is evidence for pleiotropy
  • a number of variants associated with disease or complex traits in one ethnic population are also associated the same disease or traits in other populations
  • the hypothesis76 that causal variant(s) that lead to the association between common SNPs and disease are mostly rare (say, have an allele frequency of 1% or lower) is not consistent with theoretical and empirical results
  • there is no widespread evidence for the existence of "synthetic associations"
  • a surprisingly large proportion of additive genetic variation is tagged when all SNPs are considered simultaneously
  • (1) is the GWAS approach founded on a flawed assumption that genetics plays an important role in the risk for common diseases?
  • the proportion of total variation explained by genome-wide-significant variants has reached 10%–20% for a number of diseases
  • clearly there are additional variants with such small effect sizes that they have not been detected with stringent significance
  • many of the detected loci are in biologically meaningful pathways
  • whole-genome analyses involving GWAS data have estimated that 20%–50% of phenotypic variation is captured when all SNPs are considered simultaneously for a number of complex diseases and traits
  • these estimates are based on population-wide studies and provide a lower limit of the total proportion of phenotypic variation due to genetic factors
  • these studies have provided independent evidence for the role of genetics in common diseases
  • (2) have GWASs been disappointing in not explaining more genetic variation in the population?
  • this criticism implies that the aim of GWASs is to explain all genetic variation
  • this is a misrepresentation of the objective of GWASs
  • for a number of complex traits the proportion of genetic variation uncovered by GWASs is actually substantial
  • for T2D, MS, and CD, approximately 10%, 20%, and 20%, respectively, of genetic variation in the population has been accounted for
  • apart from diseases with a known major locus (which is usually the major histocompatibility locus), the baseline of variation explained five years ago was essentially zero
  • (3) have GWASs delivered meaningful biologically relevant knowledge or results of clinical or any other utility?
  • the answer to this question is a definite "yes"
  • there have been few, if any, similar bursts of discovery in the history of medical research
  • (4) are GWAS results spurious?
  • the combination of large sample sizes and stringent significance testing has led to a large number of robust and replicable associations between complex traits and genetic variants, many of which are in meaningful biological pathways