0 Evaluaciones

ID

45707

Descripción

Principal Investigator: John S. Witte, PhD, University of California, San Francisco, CA, USA MeSH: Prostatic Neoplasms https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs001221 A genome-wide association study (GWAS) of prostate cancer (PCa) was conducted in Kaiser Permanente (KP) Northern California health plan members (7,783 cases, 38,595 controls; 80.3% non-Hispanic white, 4.9% African-American, 7.0% East Asian, and 7.8% Latino) [PMID: 26034056]. The data for these members were drawn from three KP cohort studies: Research Program in Genes, Environment and Health (RPGEH) ProHealth, and California Men's Health Study (CMHS) (described further under Study History). Four custom arrays were designed for genotyping, one for each of the four major race-ethnicity groups in the RPGEH cohort: African Americans, East Asians, Latinos, and Non-Hispanic Whites. The number of SNPs and SNP content varied by array, with SNP content designed to maximize the genome-wide coverage of low frequency and more common variants specific to the different race-ethnicity groups, including newly identified SNPs from sequencing projects, and SNPs with established associations with disease phenotypes and risk factors [PMIDs: 21565264, 21903159]. Within the total study cohort, n=34,736 completed a consent which permitted deposition of data to NIH. Genotyping followed the same general procedure described in [PMIDs: 26092718, plus additional quality control (QC) steps for the additional men, in order to control for potential batch and kit effects, described in [PMID: 26034056. Briefly, we first repeated the filters described in [PMID: 26092718] for all four arrays (EUR, LAT, EAS, AFR). Then, on an array-wise basis, we removed SNPs with MAF0.01, with a call rate95%, or with Hardy-Weinberg Equilibrium (HWE) p-value in homogeneous groups1x10ˆ-5. Furthermore, on the EUR array, to adjust for potential kit effect, we conducted a GWAS of kit, and removed those kit associated SNPs with p1x10ˆ-6; we also re-genotyped each of the new samples (those not genotyped with the original GERA data) with some of the original GERA data, and removed SNPs with 13/1,268 (1%) mismatches. For the AFR array, to adjust potential plate batch issues, we conducted a GWAS of whether an individual was in the original GERA data vs. in the newly genotyped data and removed those batch-associated SNPs with p0.05 (we used a stronger threshold than that used for the EUR array because there were fewer individuals on the AFR array); we also re-genotyped each of the new samples with the original GERA data and removed SNPs with 2/78 (2.6%). After the QC described above, imputation was performed as described in [PMID: 26034056]. Imputation was performed on an array-wise basis, pre-phasing with SHAPE-IT v2.5 [PMID: 22138821], and imputing from the 1000 Genomes Project October 2014 release as a cosmopolitan reference panel with IMPUTE2 [PMID: 22384356]. In addition to the GWAS described above, a nested exome-wide association study (EWAS) of PCa was also conducted (7,489 cases, 7,323 controls; 78% non-Hispanic white, 9% African-American, 3% East Asian, 6% Latino, 4% Other). A custom EWAS array primarily focused on rare variants was designed for genotyping that complemented the GWAS arrays [PMID: 26034056]. The EWAS array content included missense and loss-of-function mutations, and rare exonic mutations from The Cancer Genome Atlas (TCGA) and dbGaP prostate cancer tumor exomes [PMID: 26544944; PMID: 26544944]. Much of the EWAS array design content overlapped with the probesets on the UK Biobank Affymetrix Axiom array [PMID: 30305743]. Genotyping and QC steps taken to filter out samples exhibiting low quality and variants with low call rates are described in Emami et al., 2020 [biorXiv]. The resulting EWAS array genotypes are provided here.

Link

dbGaP-study=phs001221

Palabras clave

  1. 16/5/23 16/5/23 - Chiara Middel
Titular de derechos de autor

John S. Witte, PhD, University of California, San Francisco, CA, USA

Subido en

16 de mayo de 2023

DOI

Para solicitar uno, por favor iniciar sesión.

Licencia

Creative Commons BY 4.0

Comentarios del modelo :

Puede comentar sobre el modelo de datos aquí. A través de las burbujas de diálogo en los grupos de elementos y elementos, puede agregar comentarios específicos.

Comentarios de grupo de elementos para :

Comentarios del elemento para :


    Sin comentarios

    Para descargar modelos de datos, debe haber iniciado sesión. Por favor iniciar sesión o Registrate gratis.

    dbGaP phs001221 ProHealth: Kaiser Permanente GWAS of Prostate Cancer

    Subject ID, sex, race, affection status, age, BMI, Gleason Summary Score, histologic grading and differentiation, and SEER general summary stage of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.

    1. StudyEvent: SEV1
      1. Eligibility Criteria
      2. Subject ID, subject source, source subject ID, and consent group of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
      3. Subject ID, sex, monozygous twins, family ID, mother ID, father ID, and sex of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
      4. Subject ID, sample ID, and sample use variable obtained from participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
      5. Subject ID, array, African Principal Component, East Asian Principal Component, European Principal Component, and Latin Principal Component of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
      6. Subject ID, sex, race, affection status, age, BMI, Gleason Summary Score, histologic grading and differentiation, and SEER general summary stage of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
      7. Sample ID, package, analyte type, body site where sample was collected, tumor status of sample, and Reagent Kit of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
    pht007161
    Descripción

    pht007161

    Alias
    UMLS CUI [1,1]
    C3846158
    De-Identified subject ID
    Descripción

    SUBJECT_ID

    Tipo de datos

    string

    Alias
    UMLS CUI [1,1]
    C4684638
    UMLS CUI [1,2]
    C2348585
    Gender
    Descripción

    SEX

    Tipo de datos

    text

    Alias
    UMLS CUI [1,1]
    C0079399
    Case or control status
    Descripción

    PC_CASE

    Tipo de datos

    text

    Alias
    UMLS CUI [1,1]
    C3274646
    Age at prostate cancer diagnosis (cases only)
    Descripción

    Age_at_Diagnosis

    Tipo de datos

    text

    Alias
    UMLS CUI [1,1]
    C1828181
    UMLS CUI [1,2]
    C0600139
    Age when specimen sample was collected
    Descripción

    Age_at_Specimen_Collection

    Tipo de datos

    text

    Alias
    UMLS CUI [1,1]
    C0001779
    UMLS CUI [1,2]
    C1302413
    Year of birth in (inclusive) 5-year categories
    Descripción

    Birth_Year_Cat

    Tipo de datos

    text

    Alias
    UMLS CUI [1,1]
    C2826771
    Body Mass Index of participant, based on clinical data, at time point closest to date of specimen sample submission
    Descripción

    BMI_Cat

    Tipo de datos

    text

    Alias
    UMLS CUI [1,1]
    C1305855
    UMLS CUI [1,2]
    C1302413
    Self-reported race [Asian, Black, Hispanic, other/uncertain, White
    Descripción

    RACE

    Tipo de datos

    string

    Alias
    UMLS CUI [1,1]
    C0034510
    UMLS CUI [1,2]
    C0681906
    SEER General Summary Stage for cases
    Descripción

    StageGen

    Tipo de datos

    text

    Alias
    UMLS CUI [1,1]
    C0242638
    UMLS CUI [1,2]
    C1698493
    Histologic grading and differentiation. Describes tumor resemblance to normal tissue.
    Descripción

    Grade

    Tipo de datos

    text

    Alias
    UMLS CUI [1,1]
    C0919553
    UMLS CUI [1,2]
    C2945687
    Gleason summary score at biopsy [Values: 3-9]
    Descripción

    Gleason

    Tipo de datos

    text

    Alias
    UMLS CUI [1,1]
    C0005558
    UMLS CUI [1,2]
    C0332326

    Similar models

    Subject ID, sex, race, affection status, age, BMI, Gleason Summary Score, histologic grading and differentiation, and SEER general summary stage of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.

    1. StudyEvent: SEV1
      1. Eligibility Criteria
      2. Subject ID, subject source, source subject ID, and consent group of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
      3. Subject ID, sex, monozygous twins, family ID, mother ID, father ID, and sex of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
      4. Subject ID, sample ID, and sample use variable obtained from participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
      5. Subject ID, array, African Principal Component, East Asian Principal Component, European Principal Component, and Latin Principal Component of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
      6. Subject ID, sex, race, affection status, age, BMI, Gleason Summary Score, histologic grading and differentiation, and SEER general summary stage of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
      7. Sample ID, package, analyte type, body site where sample was collected, tumor status of sample, and Reagent Kit of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
    Name
    Tipo
    Description | Question | Decode (Coded Value)
    Tipo de datos
    Alias
    Item Group
    pht007161
    C3846158 (UMLS CUI [1,1])
    SUBJECT_ID
    Item
    De-Identified subject ID
    string
    C4684638 (UMLS CUI [1,1])
    C2348585 (UMLS CUI [1,2])
    Item
    Gender
    text
    C0079399 (UMLS CUI [1,1])
    Code List
    Gender
    CL Item
    Male (1)
    C0086582 (UMLS CUI [1,1])
    Item
    Case or control status
    text
    C3274646 (UMLS CUI [1,1])
    Code List
    Case or control status
    CL Item
    control (0)
    C3274648 (UMLS CUI [1,1])
    CL Item
    case (1)
    C3274647 (UMLS CUI [1,1])
    Age_at_Diagnosis
    Item
    Age at prostate cancer diagnosis (cases only)
    text
    C1828181 (UMLS CUI [1,1])
    C0600139 (UMLS CUI [1,2])
    Age_at_Specimen_Collection
    Item
    Age when specimen sample was collected
    text
    C0001779 (UMLS CUI [1,1])
    C1302413 (UMLS CUI [1,2])
    Item
    Year of birth in (inclusive) 5-year categories
    text
    C2826771 (UMLS CUI [1,1])
    Code List
    Year of birth in (inclusive) 5-year categories
    CL Item
    <1927 (1)
    CL Item
    1927-1931 (2)
    CL Item
    1932-1936 (3)
    CL Item
    1937-1941 (4)
    CL Item
    1942-1946 (5)
    CL Item
    1947-1951 (6)
    CL Item
    1952-1956 (7)
    CL Item
    1957-1961 (8)
    CL Item
    1962-1966 (9)
    CL Item
    1967-1971 (10)
    CL Item
    1972-1976 (11)
    CL Item
    1977-1981 (12)
    CL Item
    1982-1986 (13)
    CL Item
    1987-1991 (14)
    Item
    Body Mass Index of participant, based on clinical data, at time point closest to date of specimen sample submission
    text
    C1305855 (UMLS CUI [1,1])
    C1302413 (UMLS CUI [1,2])
    Code List
    Body Mass Index of participant, based on clinical data, at time point closest to date of specimen sample submission
    CL Item
    missing (-9)
    CL Item
    <=18 (1)
    CL Item
    19-25 (2)
    CL Item
    26-29 (3)
    CL Item
    30-39 (4)
    CL Item
    40+ (5)
    RACE
    Item
    Self-reported race [Asian, Black, Hispanic, other/uncertain, White
    string
    C0034510 (UMLS CUI [1,1])
    C0681906 (UMLS CUI [1,2])
    Item
    SEER General Summary Stage for cases
    text
    C0242638 (UMLS CUI [1,1])
    C1698493 (UMLS CUI [1,2])
    Code List
    SEER General Summary Stage for cases
    CL Item
    missing (-9)
    CL Item
    In situ (0)
    CL Item
    Localized (1)
    CL Item
    Regional, direct extension only (2)
    CL Item
    Regional, regional lymph nodes only (3)
    CL Item
    Regional, direct extension and regional lymph nodes (4)
    CL Item
    Regional, NOS (5)
    CL Item
    Distant (7)
    CL Item
    Not applicable (8)
    C1272460 (UMLS CUI [1,1])
    CL Item
    Unstaged (9)
    Item
    Histologic grading and differentiation. Describes tumor resemblance to normal tissue.
    text
    C0919553 (UMLS CUI [1,1])
    C2945687 (UMLS CUI [1,2])
    Code List
    Histologic grading and differentiation. Describes tumor resemblance to normal tissue.
    CL Item
    missing (-9)
    CL Item
    Well differentiated (1)
    CL Item
    Moderately differentiated (2)
    CL Item
    Poorly differentiated (3)
    CL Item
    Undifferentiated, anaplastic (4)
    CL Item
    T-cell (5)
    CL Item
    B-cell (6)
    CL Item
    Null cell (non T or B cell) (7)
    CL Item
    NK cell (natural killer cell) (8)
    CL Item
    Grade or differentiation not determined, not stated or not applicable (9)
    C1272460 (UMLS CUI [1,1])
    Item
    Gleason summary score at biopsy [Values: 3-9]
    text
    C0005558 (UMLS CUI [1,1])
    C0332326 (UMLS CUI [1,2])
    Code List
    Gleason summary score at biopsy [Values: 3-9]
    CL Item
    Score (Score)

    Do you need help on how to use the search function? Please watch the corresponding tutorial video for more details and learn how to use the search function most efficiently.

    Watch Tutorial