ID

45425

Descrizione

Principal Investigator: Olga Gorlova, PhD, UT MD Anderson Cancer Center, Houston, TX MeSH: Lung Neoplasms https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000634 *A genomewide study of lung cancer in never smokers* *Abstract and specific aims* In the United States, lung cancer incidence and mortality rates have been steadily declining over the past decade, following decline in the prevalence of tobacco smoking. However, lung cancer remains the leading cause of cancer death, killing more patients than breast, colon, and prostate cancers combined. Although tobacco smoke is the predominant risk factor for development of lung cancer, some patients develop the disease without a history of tobacco smoking. About 10 - 15% of all lung cancers occur in lilfetime never smokers. This figure will increase as the proportion of never smokers increases in the population. Even at present rates, lung cancer in never smokers, if considered a separate disease, is 6th to 8th top cause of cancer death. The growing number of never smokers in the USA and other countries emphasizes the importance of understanding the epidemiology and biology underlying lung cancer in this group. Genetic polymorphisms associated with the risk of lung cancer in never smokers are expected to overlap with those associated with the risk of lung cancer in ever smokers only partially. Epidemiological, molecular and clinical data suggest that molecular mechanisms of LC may differ in smokers and non-smokers, implying that lung cancer in never smokers is a different disease compared to the lung cancer in smokers. One can expect that there should be stronger genetic component in the control lung cancer in never smokers because effects of the genetic factors in never smokers are unmasked by the lack of tobacco smoke exposure. The genetic epidemiology of lung cancer in never smokers has not been well explored, largely because of difficulties in accruing the needed sample size for association studies. We propose a multicenter (total 14 sites from the US and Europe) genomewide association study of lung cancer in never smokers with the following specific aims: *Aim 1: To identify candidate SNPs influencing risk for lung cancer in never smokers using Discovery sample.* In the Discovery phase we will genotype 1256 Caucasian cases and 1365 age- and gender-matched never smoker controls using the Illumina Human660W-Quad platform. In addition, we will include in the analysis 284 cases and 175 matched controls already genotyped on the 610Quad platform. In this phase we will only include the study sites that have collected blood specimens (MDACC, Mayo Clinic, Karmanos Cancer Institute, The University of Liverpool Cancer research Centre, Institute of Cancer Research in Sutton, and Lunenfeld Research Institute in Toronto, Canada). All the samples will be sent to the independent lab for genotyping, to reduce site-specific technical artifacts. The final sample will consist of 1540 cases and 1540 controls matched by study site. *Aim 2: To perform the second phase (validation) analysis of significant SNPs identified in aim 1 using an independent set of cases and controls.* SNPs associated with risk at the significance level of 0.01 or below in the discovery set will be included in the replication phase. The proposed threshold guarantees an adequate power to retain SNPs with the typical effect size of 1.3. We plan to carry 6000-7000 SNPs for validation. The independent replication set will include 800 cases and 800 controls, mostly from sites that collected tissue (Mayo Clinic, Karmanos Cancer Institute, UT Southwestern) or buccal specimens (UCLA), but also blood samples (Imperial College London, University of Pennsylvana, German Cancer Research Center, Heidelberg, National Research Center for Environment and Health, Neuherberg, Carmel Medical Center, Haifa). We will then perform a joint analysis to test the significance of the SNPs identified in the first stage using a stringent critical p-value of 10-7. There will be 2340 cases and 2340 controls in the joint set. Based on our experience with GWAS in smokers and assuming that genetic component in lung cancer risk in never smokers can be higher than genetic component in smokers, we expect to identify about 5-10 candidate regions associated with lung cancer risk in never smokers. *Aim 3: To identify and explore pathways associated with the risk of lung cancer in never smokers.* Results of the number of studies on the molecular mechanisms and drug response suggest that lung cancer in never smokers is a different disease and different pathways will be associated with lung cancer risk in non-smokers and smokers. To identify pathways and molecular functions associated with lung cancer risk in never smokers we will apply Ingenuity and DAVID bioinformatics tools. We will use at least 300 top candidate genes identified in joint and discovery analysis. The reason why we select rather large number of candidate genes for functional annotation is two-fold: 1. Both algorithms are looking for enrichment of pathways and function by most significant genes and they produce statistically robust results only when number of genes is relatively high. 2. Despite the fact that this study will be largest possible for never smokers we still are underpowered to detect SNPs with relatively small effect size. But though those SNPs will not reach genome wide level of significance they will tend to be on the top of the list. In other words genes from the gray zone (significant on individual level and non-significant for genome wide level) are expected to be enriched by true discoveries. True discoveries are likely to be associated with limited number of pathways / functions while false positives are expected to be uniformly distributed across functions and pathways. Therefore significant clustering of the gene to a given function will suggest that that those genes are true discoveries. This is the first GWAS aiming at identifying the genetic control of susceptibility to lung cancer in Caucasian never smokers. We will combine the available resources from the multiple sites to achieve the sample size sufficient for this study. The study will identify genetic architecture of the predisposition to the lung cancer in never smokers.

collegamento

dbGaP study = phs000634

Keywords

  1. 25/11/22 25/11/22 - Simon Heim
Titolare del copyright

Olga Gorlova, PhD, UT MD Anderson Cancer Center, Houston, TX

Caricato su

25 novembre 2022

DOI

Per favore, per richiedere un accesso.

Licenza

Creative Commons BY 4.0

Commenti del modello :

Puoi commentare il modello dati qui. Tramite i fumetti nei gruppi di articoli e articoli è possibile aggiungere commenti a quelli in modo specifico.

Commenti del gruppo di articoli per :

Commenti dell'articolo per :


Non ci sono commenti

Per scaricare i modelli di dati devi essere registrato. Per favore accesso o registrati GRATIS.

dbGaP phs000634 NCI GWAS of Lung Cancer in Never Smokers

Subject ID, gender, age, case or control status, level of education, second hand smoking status, education, principal components, geographic subset of subjects, cancer family history, lung cancer family history, and sample histology of participants with or without lung cancer and involved in the "National Cancer Institute (NCI) Genome Wide Association Study (GWAS) of Lung Cancer in Never Smokers" project.

pht003831
Descrizione

pht003831

Alias
UMLS CUI [1,1]
C3846158
De-identified subject ID 1
Descrizione

dnGap_ID1

Tipo di dati

text

Alias
UMLS CUI [1,1]
C4684638
UMLS CUI [1,2]
C2348585
Gender of participant
Descrizione

Gender

Tipo di dati

text

Alias
UMLS CUI [1,1]
C0079399
Case or control status
Descrizione

Status

Tipo di dati

text

Alias
UMLS CUI [1,1]
C3274646
Subject age
Descrizione

Age

Tipo di dati

float

Unità di misura
  • years
Alias
UMLS CUI [1,1]
C0001779
years
Source repository where subjects originate
Descrizione

Site

Tipo di dati

string

Alias
UMLS CUI [1,1]
C3847505
UMLS CUI [1,2]
C0449416
UMLS CUI [1,3]
C0681850
Level of education
Descrizione

Edu

Tipo di dati

text

Alias
UMLS CUI [1,1]
C0013658
Second hand smoking exposure status
Descrizione

SHS

Tipo di dati

text

Alias
UMLS CUI [1,1]
C1545750
1st principal component
Descrizione

pc1

Tipo di dati

float

Alias
UMLS CUI [1,1]
C0205435
UMLS CUI [1,2]
C1882460
2nd principal component
Descrizione

pc2

Tipo di dati

float

Alias
UMLS CUI [1,1]
C0205436
UMLS CUI [1,2]
C1882460
3rd principal component
Descrizione

pc3

Tipo di dati

float

Alias
UMLS CUI [1,1]
C0205437
UMLS CUI [1,2]
C1882460
4th principal component
Descrizione

pc4

Tipo di dati

float

Alias
UMLS CUI [1,1]
C0205438
UMLS CUI [1,2]
C1882460
5th principal component
Descrizione

pc5

Tipo di dati

float

Alias
UMLS CUI [1,1]
C1882460
UMLS CUI [1,2]
C0205439
Geographic subset of the subject/unit for meta-analysis
Descrizione

Subset

Tipo di dati

string

Alias
UMLS CUI [1,1]
C0017446
UMLS CUI [1,2]
C0920317
Lung cancer family history
Descrizione

LC_FH

Tipo di dati

text

Alias
UMLS CUI [1,1]
C0728711

Similar models

Subject ID, gender, age, case or control status, level of education, second hand smoking status, education, principal components, geographic subset of subjects, cancer family history, lung cancer family history, and sample histology of participants with or without lung cancer and involved in the "National Cancer Institute (NCI) Genome Wide Association Study (GWAS) of Lung Cancer in Never Smokers" project.

Name
genere
Description | Question | Decode (Coded Value)
Tipo di dati
Alias
Item Group
pht003831
C3846158 (UMLS CUI [1,1])
dnGap_ID1
Item
De-identified subject ID 1
text
C4684638 (UMLS CUI [1,1])
C2348585 (UMLS CUI [1,2])
Item
Gender of participant
text
C0079399 (UMLS CUI [1,1])
Code List
Gender of participant
CL Item
Male (1)
C0086582 (UMLS CUI [1,1])
CL Item
Female (2)
C0086287 (UMLS CUI [1,1])
Item
Case or control status
text
C3274646 (UMLS CUI [1,1])
Code List
Case or control status
CL Item
Control (1)
C3274648 (UMLS CUI [1,1])
CL Item
Case (2)
C3274647 (UMLS CUI [1,1])
Age
Item
Subject age
float
C0001779 (UMLS CUI [1,1])
Site
Item
Source repository where subjects originate
string
C3847505 (UMLS CUI [1,1])
C0449416 (UMLS CUI [1,2])
C0681850 (UMLS CUI [1,3])
Item
Level of education
text
C0013658 (UMLS CUI [1,1])
Code List
Level of education
CL Item
No formal education or elementary (1)
CL Item
Up to high school or vocational school (2)
CL Item
Some college or above (3)
CL Item
missing (9)
Item
Second hand smoking exposure status
text
C1545750 (UMLS CUI [1,1])
Code List
Second hand smoking exposure status
CL Item
Not exposed (0)
CL Item
Exposed (1)
CL Item
Missing (9)
pc1
Item
1st principal component
float
C0205435 (UMLS CUI [1,1])
C1882460 (UMLS CUI [1,2])
pc2
Item
2nd principal component
float
C0205436 (UMLS CUI [1,1])
C1882460 (UMLS CUI [1,2])
pc3
Item
3rd principal component
float
C0205437 (UMLS CUI [1,1])
C1882460 (UMLS CUI [1,2])
pc4
Item
4th principal component
float
C0205438 (UMLS CUI [1,1])
C1882460 (UMLS CUI [1,2])
pc5
Item
5th principal component
float
C1882460 (UMLS CUI [1,1])
C0205439 (UMLS CUI [1,2])
Subset
Item
Geographic subset of the subject/unit for meta-analysis
string
C0017446 (UMLS CUI [1,1])
C0920317 (UMLS CUI [1,2])
Item
Lung cancer family history
text
C0728711 (UMLS CUI [1,1])
Code List
Lung cancer family history
CL Item
Lung cancer in first degree relatives not reported (0)
CL Item
Lung cancer in first degree relatives reported (1)
CL Item
Missing (9)

Do you need help on how to use the search function? Please watch the corresponding tutorial video for more details and learn how to use the search function most efficiently.

Watch Tutorial