1,721,676 research outputs found

    Using ancestry-informative markers to identify fine structure across 15 populations of European origin

    No full text
    The Wellcome Trust Case Control Consortium 3 anorexia nervosa genome-wide association scan includes 2907 cases from 15 different populations of European origin genotyped on the Illumina 670K chip. We compared methods for identifying population stratification, and suggest list of markers that may help to counter this problem. It is usual to identify population structure in such studies using only common variants with minor allele frequency (MAF) >5%; we find that this may result in highly informative SNPs being discarded, and suggest that instead all SNPs with MAF >1% may be used. We established informative axes of variation identified via principal component analysis and highlight important features of the genetic structure of diverse European-descent populations, some studied for the first time at this scale. Finally, we investigated the substructure within each of these 15 populations and identified SNPs that help capture hidden stratification. This work can provide information regarding the designing and interpretation of association results in the International Consortia

    A genome-wide association study of anorexia nervosa

    No full text
    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14 countries (15 sites) and 14 860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery data sets. Seventy-six (72 independent) single nucleotide polymorphisms were taken forward for in silico (two data sets) or de novo (13 data sets) replication genotyping in 2677 independent AN cases and 8629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication data sets comprised 5551 AN cases and 21 080 controls. AN subtype analyses (1606 AN restricting; 1445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01 × 10(-7)) in SOX2OT and rs17030795 (P=5.84 × 10(-6)) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76 × 10(-)(6)) between CUL3 and FAM124B and rs1886797 (P=8.05 × 10(-)(6)) near SPATA13. Comparing discovery with replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4 × 10(-6)), strongly suggesting that true findings exist but our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field.Molecular Psychiatry advance online publication, 11 February 2014; doi:10.1038/mp.2013.187

    Genetic risk factors for ischaemic stroke and its subtypes (the METASTROKE Collaboration): a meta-analysis of genome-wide association studies

    Full text link
    <p>Background - Various genome-wide association studies (GWAS) have been done in ischaemic stroke, identifying a few loci associated with the disease, but sample sizes have been 3500 cases or less. We established the METASTROKE collaboration with the aim of validating associations from previous GWAS and identifying novel genetic associations through meta-analysis of GWAS datasets for ischaemic stroke and its subtypes.</p> <p>Methods - We meta-analysed data from 15 ischaemic stroke cohorts with a total of 12 389 individuals with ischaemic stroke and 62 004 controls, all of European ancestry. For the associations reaching genome-wide significance in METASTROKE, we did a further analysis, conditioning on the lead single nucleotide polymorphism in every associated region. Replication of novel suggestive signals was done in 13 347 cases and 29 083 controls.</p> <p>Findings - We verified previous associations for cardioembolic stroke near PITX2 (p=2·8×10−16) and ZFHX3 (p=2·28×10−8), and for large-vessel stroke at a 9p21 locus (p=3·32×10−5) and HDAC9 (p=2·03×10−12). Additionally, we verified that all associations were subtype specific. Conditional analysis in the three regions for which the associations reached genome-wide significance (PITX2, ZFHX3, and HDAC9) indicated that all the signal in each region could be attributed to one risk haplotype. We also identified 12 potentially novel loci at p<5×10−6. However, we were unable to replicate any of these novel associations in the replication cohort.</p> <p>Interpretation - Our results show that, although genetic variants can be detected in patients with ischaemic stroke when compared with controls, all associations we were able to confirm are specific to a stroke subtype. This finding has two implications. First, to maximise success of genetic studies in ischaemic stroke, detailed stroke subtyping is required. Second, different genetic pathophysiological mechanisms seem to be associated with different stroke subtypes.</p&gt

    Adiposity-Related Heterogeneity in Patterns of Type 2 Diabetes Susceptibility Observed in Genome-Wide Association Data

    Full text link
    OBJECTIVE-This study examined how differences in the BMI distribution of type 2 diabetic case subjects affected genome-wide patterns of type 2 diabetes, association mid considered the implications for the etiological heterogeneity of type 2 diabetes.RESEARCH DESIGN AND METHODS-We reanalyzed data from the Wellcome Trust Case Control Consortium genome-wide association scan (1,924 case subjects, 2,938 control Subjects: 393,453 single-nucleotide polymorphisms [SNPs]) after stratifying case subjects (into "obese" and "nonobese") according to median BMI (30.2 kg/m(2)). Replication. of signals in which alternative case-ascertainment strategies generated marked effect size heterogeneity in type 2 diabetes association signal was sought in additional samples.RESULTS-In the "obese-type 2 diabetes" scan, FTO variants had the strongest type 2 diabetes effect, (rs8050136: relative risk [RR] 1.49 [95% CI 1.34-1.66], P = 1.3 X 10(-13)), with only weak evidence for TCF7L2 (rs7901695 RR 1.21 [1.09-1.35], P = 0.001). reversed in the "nonobese" scan, with FTO This situation was association undetectable (RR 1.07 [0.97-1.19], P = 0.19) and TCF7L2 predominant (RR 1.53 [1.37-1.71], P = 1.3 X 10(-14)). These patterns, confirmed by replication, generated strong combined evidence for between-stratum effect size heterogeneity (FTO: P-DIFF = 1.4 X 10(-7); TCF7L2: P-DIFF = 4.0 X 10(-6)). Other signals displaying evidence of effect size heterogeneity in the gencome-wide analyses (on chromosomes 3, 12, 15, and 18) did not, replicate. Analysis of the current list of type 2 diabetes susceptibility variants revealed nominal evidence for effect size heterogeneity for the SLC30A8 locus alone (RRobese 1.08 [1.01-1.15]; RRnonobese 1.18 [1.10-1.27]: P-DIFF = 0.04).CONCLUSIONs-This study demonstrates the impact of differences in case ascertainment on the power to detect and replicate genetic associations in genome-wide association studies. These data reinforce the notion that there is substantial etiological heterogeneity within type 2 diabetes. Diabetes 58:505-510, 2009</p

    Analysis of differences in human leukocyte antigen between the two wellcome trust case control consortium control datasets

    No full text
    © 2019, Korea Genome Organization.The Wellcome Trust Case Control Consortium (WTCCC) study was a large genome-wide association study that aimed to identify common variants associated with seven diseases. That study combined two control datasets (58C and UK Blood Services) as shared controls. Prior to using the combined controls, the WTCCC performed analyses to show that the genomic content of the control datasets was not significantly different. Recently, the analysis of human leukocyte antigen (HLA) genes has become prevalent due to the development of HLA imputation technology. In this project, we extended the between-control homogeneity analysis of the WTCCC to HLA. We imputed HLA information in the WTCCC control dataset and showed that the HLA content was not significantly different between the two control datasets, suggesting that the combined controls can be used as controls for HLA fine-map-ping analysis based on HLA imputation.Y

    A genome-wide association study provides evidence of sex-specific involvement of Chr1p35.1 (<i>ZSCAN20-TLR12P</i>) and Chr8p23.1 (<i>HMGB1P46</i>) with diabetic neuropathic pain

    Full text link
    Neuropathic pain is defined as pain arising as a direct consequence of a lesion or a disease affecting the somatosensory system and it affects around 1 in 4 diabetic patients in the UK. The purpose of this genome-wide association study (GWAS) was to identify genetic contributors to this disorder. Cases of neuropathic pain were defined as diabetic patients with a multiple prescription history of at least one of five drugs specifically indicated for the treatment of neuropathic pain. Controls were diabetic individuals who were not prescribed any of these drugs, nor amitriptyline, carbamazepine, or nortriptyline. Overall, 961 diabetic neuropathic pain cases and 3260 diabetic controls in the Genetics of Diabetes Audit and Research Tayside (GoDARTS) cohort were identified. We found a cluster in the Chr1p35.1 (ZSCAN20-TLR12P) with a lowest P value of 2.74 × 10−7 at rs71647933 in females and a cluster in the Chr8p23.1, next to HMGB1P46 with a lowest P value of 8.02 × 10−7 at rs6986153 in males. Sex-specific narrow sense heritability was higher in males (30.0%) than in females (14.7%). This GWAS on diabetic neuropathic pain provides evidence for the sex-specific involvement of Chr1p35.1 (ZSCAN20-TLR12P) and Chr8p23.1 (HMGB1P46) with the disorder, indicating the need for further research

    Observational study on variability between biobanks in the estimation of DNA concentration

    Full text link
    There is little confidence in the consistency of estimation of DNA concentrations when samples move between laboratories. Evidence on this consistency is largely anecdotal. Therefore there is a need first to measure this consistency among different laboratories and then identify and implement remedies. A pilot experiment to test logistics and provide initial data on consistency was therefore conceived

    DNA methylation meta-analysis reveals cellular alterations in psychosis and markers of treatment-resistant schizophrenia

    Full text link
    We performed a systematic analysis of blood DNA methylation profiles from 4,483 participants from seven independent cohorts identifying differentially methylated positions (DMPs) associated with psychosis, schizophrenia and treatment-resistant schizophrenia. Psychosis cases were characterized by significant differences in measures of blood cell proportions and elevated smoking exposure derived from the DNA methylation data, with the largest differences seen in treatment-resistant schizophrenia patients. We implemented a stringent pipeline to meta-analyze epigenome-wide association study (EWAS) results across datasets, identifying 95 DMPs associated with psychosis and 1,048 DMPs associated with schizophrenia, with evidence of colocalization to regions nominated by genetic association studies of disease. Many schizophrenia-associated DNA methylation differences were only present in patients with treatment-resistant schizophrenia, potentially reflecting exposure to the atypical antipsychotic clozapine. Our results highlight how DNA methylation data can be leveraged to identify physiological (e.g., differential cell counts) and environmental (e.g., smoking) factors associated with psychosis and molecular biomarkers of treatment-resistant schizophrenia

    Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls

    Full text link
    Copy number variants (CNVs) account for a major proportion of human genetic polymorphism and have been predicted to have an important role in genetic susceptibility to common disease. To address this we undertook a large, direct genome-wide study of association between CNVs and eight common human diseases. Using a purpose-designed array we typed approximately 19,000 individuals into distinct copy-number classes at 3,432 polymorphic CNVs, including an estimated approximately 50% of all common CNVs larger than 500 base pairs. We identified several biological artefacts that lead to false-positive associations, including systematic CNV differences between DNAs derived from blood and cell lines. Association testing and follow-up replication analyses confirmed three loci where CNVs were associated with disease-IRGM for Crohn\u27s disease, HLA for Crohn\u27s disease, rheumatoid arthritis and type 1 diabetes, and TSPAN8 for type 2 diabetes-although in each case the locus had previously been identified in single nucleotide polymorphism (SNP)-based studies, reflecting our observation that most common CNVs that are well-typed on our array are well tagged by SNPs and so have been indirectly explored through SNP studies. We conclude that common CNVs that can be typed on existing platforms are unlikely to contribute greatly to the genetic basis of common human diseases

    Unbiased estimation of odds ratios: combining genomewide association scans with replication studies.

    Full text link
    Odds ratios or other effect sizes estimated from genome scans are upwardly biased, because only the top-ranking associations are reported, and moreover only if they reach a defined level of significance. No unbiased estimate exists based on data selected in this fashion, but replication studies are routinely performed that allow unbiased estimation of the effect sizes. Estimation based on replication data alone is inefficient in the sense that the initial scan could, in principle, contribute information on the effect size. We propose an unbiased estimator combining information from both the initial scan and the replication study, which is more efficient than that based just on the replication. Specifically, we adjust the standard combined estimate to allow for selection by rank and significance in the initial scan. Our approach explicitly allows for multiple associations arising from a scan, and is robust to mis-specification of a significance threshold. We require replication data to be available but argue that, in most applications, estimates of effect sizes are only useful when associations have been replicated. We illustrate our approach on some recently completed scans and explore its efficiency by simulation
    corecore