ID

45707

Description

Principal Investigator: John S. Witte, PhD, University of California, San Francisco, CA, USA MeSH: Prostatic Neoplasms https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs001221 A genome-wide association study (GWAS) of prostate cancer (PCa) was conducted in Kaiser Permanente (KP) Northern California health plan members (7,783 cases, 38,595 controls; 80.3% non-Hispanic white, 4.9% African-American, 7.0% East Asian, and 7.8% Latino) [PMID: 26034056]. The data for these members were drawn from three KP cohort studies: Research Program in Genes, Environment and Health (RPGEH) ProHealth, and California Men's Health Study (CMHS) (described further under Study History). Four custom arrays were designed for genotyping, one for each of the four major race-ethnicity groups in the RPGEH cohort: African Americans, East Asians, Latinos, and Non-Hispanic Whites. The number of SNPs and SNP content varied by array, with SNP content designed to maximize the genome-wide coverage of low frequency and more common variants specific to the different race-ethnicity groups, including newly identified SNPs from sequencing projects, and SNPs with established associations with disease phenotypes and risk factors [PMIDs: 21565264, 21903159]. Within the total study cohort, n=34,736 completed a consent which permitted deposition of data to NIH. Genotyping followed the same general procedure described in [PMIDs: 26092718, plus additional quality control (QC) steps for the additional men, in order to control for potential batch and kit effects, described in [PMID: 26034056. Briefly, we first repeated the filters described in [PMID: 26092718] for all four arrays (EUR, LAT, EAS, AFR). Then, on an array-wise basis, we removed SNPs with MAF0.01, with a call rate95%, or with Hardy-Weinberg Equilibrium (HWE) p-value in homogeneous groups1x10ˆ-5. Furthermore, on the EUR array, to adjust for potential kit effect, we conducted a GWAS of kit, and removed those kit associated SNPs with p1x10ˆ-6; we also re-genotyped each of the new samples (those not genotyped with the original GERA data) with some of the original GERA data, and removed SNPs with 13/1,268 (1%) mismatches. For the AFR array, to adjust potential plate batch issues, we conducted a GWAS of whether an individual was in the original GERA data vs. in the newly genotyped data and removed those batch-associated SNPs with p0.05 (we used a stronger threshold than that used for the EUR array because there were fewer individuals on the AFR array); we also re-genotyped each of the new samples with the original GERA data and removed SNPs with 2/78 (2.6%). After the QC described above, imputation was performed as described in [PMID: 26034056]. Imputation was performed on an array-wise basis, pre-phasing with SHAPE-IT v2.5 [PMID: 22138821], and imputing from the 1000 Genomes Project October 2014 release as a cosmopolitan reference panel with IMPUTE2 [PMID: 22384356]. In addition to the GWAS described above, a nested exome-wide association study (EWAS) of PCa was also conducted (7,489 cases, 7,323 controls; 78% non-Hispanic white, 9% African-American, 3% East Asian, 6% Latino, 4% Other). A custom EWAS array primarily focused on rare variants was designed for genotyping that complemented the GWAS arrays [PMID: 26034056]. The EWAS array content included missense and loss-of-function mutations, and rare exonic mutations from The Cancer Genome Atlas (TCGA) and dbGaP prostate cancer tumor exomes [PMID: 26544944; PMID: 26544944]. Much of the EWAS array design content overlapped with the probesets on the UK Biobank Affymetrix Axiom array [PMID: 30305743]. Genotyping and QC steps taken to filter out samples exhibiting low quality and variants with low call rates are described in Emami et al., 2020 [biorXiv]. The resulting EWAS array genotypes are provided here.

Link

dbGaP-study=phs001221

Keywords

  1. 5/16/23 5/16/23 - Chiara Middel
Copyright Holder

John S. Witte, PhD, University of California, San Francisco, CA, USA

Uploaded on

May 16, 2023

DOI

To request one please log in.

License

Creative Commons BY 4.0

Model comments :

You can comment on the data model here. Via the speech bubbles at the itemgroups and items you can add comments to those specificially.

Itemgroup comments for :

Item comments for :


No comments

In order to download data models you must be logged in. Please log in or register for free.

dbGaP phs001221 ProHealth: Kaiser Permanente GWAS of Prostate Cancer

Sample ID, package, analyte type, body site where sample was collected, tumor status of sample, and Reagent Kit of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.

  1. StudyEvent: SEV1
    1. Eligibility Criteria
    2. Subject ID, subject source, source subject ID, and consent group of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
    3. Subject ID, sex, monozygous twins, family ID, mother ID, father ID, and sex of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
    4. Subject ID, sample ID, and sample use variable obtained from participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
    5. Subject ID, array, African Principal Component, East Asian Principal Component, European Principal Component, and Latin Principal Component of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
    6. Subject ID, sex, race, affection status, age, BMI, Gleason Summary Score, histologic grading and differentiation, and SEER general summary stage of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
    7. Sample ID, package, analyte type, body site where sample was collected, tumor status of sample, and Reagent Kit of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
pht007162
Description

pht007162

Alias
UMLS CUI [1,1]
C3846158
De-identified sample ID
Description

SAMPLE_ID

Data type

string

Alias
UMLS CUI [1,1]
C4684638
UMLS CUI [1,2]
C1299222
Package
Description

PACKAGE_NUMBER

Data type

text

Alias
UMLS CUI [1,1]
C2700580
Reagent Kit
Description

AFFY_KIT_TYPE

Data type

string

Alias
UMLS CUI [1,1]
C0812225
Body site where sample was collected
Description

BODY_SITE

Data type

string

Alias
UMLS CUI [1,1]
C0449705
Analyte type
Description

ANALYTE_TYPE

Data type

string

Alias
UMLS CUI [1,1]
C4744818
Tumor status of sample
Description

IS_TUMOR

Data type

text

Alias
UMLS CUI [1,1]
C0475752

Similar models

Sample ID, package, analyte type, body site where sample was collected, tumor status of sample, and Reagent Kit of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.

  1. StudyEvent: SEV1
    1. Eligibility Criteria
    2. Subject ID, subject source, source subject ID, and consent group of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
    3. Subject ID, sex, monozygous twins, family ID, mother ID, father ID, and sex of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
    4. Subject ID, sample ID, and sample use variable obtained from participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
    5. Subject ID, array, African Principal Component, East Asian Principal Component, European Principal Component, and Latin Principal Component of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
    6. Subject ID, sex, race, affection status, age, BMI, Gleason Summary Score, histologic grading and differentiation, and SEER general summary stage of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
    7. Sample ID, package, analyte type, body site where sample was collected, tumor status of sample, and Reagent Kit of participants with or without prostate cancer and involved in the "ProHealth: Kaiser Permanente Genome-wide Association Study of Prostate Cancer" project.
Name
Type
Description | Question | Decode (Coded Value)
Data type
Alias
Item Group
pht007162
C3846158 (UMLS CUI [1,1])
SAMPLE_ID
Item
De-identified sample ID
string
C4684638 (UMLS CUI [1,1])
C1299222 (UMLS CUI [1,2])
PACKAGE_NUMBER
Item
Package
text
C2700580 (UMLS CUI [1,1])
AFFY_KIT_TYPE
Item
Reagent Kit
string
C0812225 (UMLS CUI [1,1])
BODY_SITE
Item
Body site where sample was collected
string
C0449705 (UMLS CUI [1,1])
ANALYTE_TYPE
Item
Analyte type
string
C4744818 (UMLS CUI [1,1])
Item
Tumor status of sample
text
C0475752 (UMLS CUI [1,1])
Code List
Tumor status of sample
CL Item
Not a tumor (N)

Please use this form for feedback, questions and suggestions for improvements.

Fields marked with * are required.

Do you need help on how to use the search function? Please watch the corresponding tutorial video for more details and learn how to use the search function most efficiently.

Watch Tutorial