Prediction of lung cancer risk in a Chinese population using a multifactorial genetic model
- Huan Li†1,
- Lixin Yang†2,
- Xueying Zhao1,
- Jiucun Wang1,
- Ji Qian1,
- Hongyan Chen1,
- Weiwei Fan1,
- Hongcheng Liu1,
- Li Jin1,
- Weimin Wang3, 4Email author and
- Daru Lu1Email author
© Li et al.; licensee BioMed Central Ltd. 2012
Received: 21 March 2012
Accepted: 6 November 2012
Published: 10 December 2012
Lung cancer is a complex polygenic disease. Although recent genome-wide association (GWA) studies have identified multiple susceptibility loci for lung cancer, most of these variants have not been validated in a Chinese population. In this study, we investigated whether a genetic risk score combining multiple.
Five single-nucleotide polymorphisms (SNPs) identified in previous GWA or large cohort studies were genotyped in 5068 Chinese case–control subjects. The genetic risk score (GRS) based on these SNPs was estimated by two approaches: a simple risk alleles count (cGRS) and a weighted (wGRS) method. The area under the receiver operating characteristic (ROC) curve (AUC) in combination with the bootstrap resampling method was used to assess the predictive performance of the genetic risk score for lung cancer.
Four independent SNPs (rs2736100, rs402710, rs4488809 and rs4083914), were found to be associated with a risk of lung cancer. The wGRS based on these four SNPs was a better predictor than cGRS. Using a liability threshold model, we estimated that these four SNPs accounted for only 4.02% of genetic variance in lung cancer. Smoking history contributed significantly to lung cancer (P < 0.001) risk [AUC = 0.619 (0.603-0.634)], and incorporated with wGRS gave an AUC value of 0.639 (0.621-0.652) after adjustment for over-fitting. This model shows promise for assessing lung cancer risk in a Chinese population.
Our results indicate that although genetic variants related to lung cancer only added moderate discriminatory accuracy, it still improved the predictive ability of the assessment model in Chinese population.
KeywordsChinese Cumulative risk Genetic risk score Lung cancer Risk assessment
Lung cancer is one of the leading causes of cancer death worldwide [1, 2]. Most patients are diagnosed at an advanced stage, so are not able to undergo surgical removal of tumors . As a result, the overall 5-year survival rate is low. Early stage detection when treatment might be more effective, would therefore help reduce lung cancer mortality. For this reason, a well-established assessment model that could identify individuals at high risk would greatly benefit patients, clinicians and researchers.
Lung cancer is a polygenic disease, for which many genetic factors appear to play an important role in disease development [2, 3]. During the past three years, several genome-wide association (GWA) studies have identified a number of genetic susceptibility loci associated with lung cancer risk [4–9], but most of these studies were conducted in populations of European descent, and many identified risk alleles have not been adequately evaluated in Asian populations.
In addition, when examined individually, each of the genetic susceptibility loci only confers a small to moderate disease risk, and is of limited utility in risk prediction. It is possible that combining multiple disease-related loci with modest effects into a genetic risk score (GRS) may be useful to identify subgroups that are at high risk of lung cancer [10, 11]. Several lung cancer risk assessment models have been proposed, including the Bach model, Spize model, and Liverpool Lung Project (LLP) model [12–15]. However, most predictors from these models focus on demographic and clinical factors, and, to our knowledge, no report has quantified the risk of lung cancer using a combination of newly identified risk loci in a Chinese population.
In this case–control study, we evaluate the discriminatory and predictive ability of the cumulative effect of several SNPs associated with lung cancer risk in populations of European descent, and estimate the proportion of genetic variants explained by the selected risk loci in a Chinese population.
A total of 2,283 lung cancer cases and 2,785 cancer-free controls (from Shanghai Zhongshan Hospital, Shanghai Chest Hospital, First Affiliated Hospital of Nanjing Medical University, Beijing Union Medical College Hospital, and Wuhan Union Hospital, China) who were genetically unrelated Han Chinese were enrolled in this study. Eligible patients had histopathologically confirmed lung cancer, and with no previous cancer history and were no receiving radiotherapy or chemotherapy for other condition. Control participants were randomly selected from individuals receiving routine physical examinations in local hospitals or those who participated in a community-based screening program of non-communicable diseases. They were frequency-matched to the cases according to age, gender and residential area.
Information on smoking was collected by means of interviews. Individuals who had smoked less than one cigarette per day for less than one year of their lifetime, or less, were defined as nonsmokers. The remaining individuals were divided into light and heavy smokers according to the threshold of 25 pack years (median pack years in the controls). All participants provided written informed consent for study participation with approval from institutional review boards of each participating institution.
Selection of genetic risk factors and genotyping
Selected SNPs associated with lung cancer *
First author, year (reference)
Reported OR (95% CI) ‡
McKay, 2008 
McKay, 2008 
McKay, 2008 
CHRNA 5, CHRNA 3
You, 2009 
Miki, 2010 
Blood samples were collected from each subject at the time of recruitment, and genomic DNA was extracted using QIAamp DNA Maxi kit (Qiagen GmbH). All SNPs were determined using the Sequenom MassARRAY iPLEX platform using the matrix-assisted laser desorption/ionization time-of-flight mass spectrometer (MALDI-TOF). Primer sequences are available on request. Overall, more than 98% of genotypes were successfully determined for all the SNPs; 5% of samples were randomly selected to re-genotype for quality control, and showed a reproducibility of 100%.
Genetic risk score computation
Two approaches were used to calculate the genetic risk score (GRS): a simple risk alleles count method (count GRS, cGRS) and a weighted method based on the genotype frequencies for each SNP and effect sizes (allelic odds ratio) from our study (weighted GRS, wGRS). Based on the log-additive model, the three genotypes AA, AB, and BB (A, low-risk allele; B, high-risk allele) for an SNP had a relative risk of 1, OR and OR2, respectively. If the B allele had frequency p, then the average relative risk in the population is calculated as: u = (1-p)2 + 2p (1-p) OR + p2OR2. The adjusted risk values for AA, AB, and BB genotype were 1/u, OR/u, and OR2/u2, respectively. Missing genotypes were assigned a value of 1. The formula for our combined SNP weighted risk score was: wGRS = SNP1 × SNP2 × SNP3 × SNP4, where SNP1-4 were weighted risk score for individual SNPs.
Percentage of genetic variance explained
The percentage of genetic variance was estimated under a liability threshold model . Allele frequencies and effect sizes corresponding to ORs were used to calculate the threshold: [2p (1-p)] β2 (p, risk allele frequency; β, additive allelic effect).
Logistic regression was employed to test the association between genetic variants and lung cancer risk. The classification ability of the model was assessed using the area under the receiver operating characteristic (ROC) curve (AUC), known as a concordance (c) statistic. The Hosmer-Lemeshow test was used to evaluate the calibration of risk estimated in our cohort data. Internal validation of models was carried out using a bootstrap method involving 1000 replications to adjust model parameters for potential over-fitting. A second validation was performed by randomly dividing the cohort population into two unequal groups (one with 75% of the population, and the second with the remaining 25%). The larger group (training set) was used to rebuild the same model, which was then tested on the remaining 25% of the population (test set). All analyses were conducted by Statistical Analysis System (SAS) software (version 8.2; SAS Institute, Cary, NC). All p values were two-sided, and p values < 0.05 were considered statistically significant.
Association between genetic risk alleles and lung cancer
Association between SNPs and lung cancer
Frequency of high-risk allele
Observed OR, (95% CI)*
Genetic variance explained†
Genetic risk score association
Distribution of cGRS, wGRS, and demographic characteristic of case patients and control subjects
Case no. (%)
Control no. (%)
Logistic regression† OR (95% CI)
ROC AUC/c statistic
(95% CI) (BOC)
cGRS (count risk allele)
wGRS (weighted genetic risk score)
1 (≦60 year)
Discrimination performance of wGRS × demographic characteristics
Logistic regression model including GRS and smoking status
Model performance statistics
Goodness of fit
ROC AUC/c statistics (95% CI) (BOC)*
0.639 (0.621-0.652) (0.637)*
The adjusted AUC for the full model described above was 0.637 (Table 4 and Figure 1). The contribution of wGRS to the model was 0.020 (assessed by the reduction in c statistic when wGRS was removed from the full model). Smoking status was the strongest predictor in the model, with a contribution of 0.088.
Internal model validation
After the model was rebuilt on the training set, it displayed similar discrimination ability to the original one (c statistic, 0.641). The model was then tested on the test set, and also showed similar discrimination ability (c statistic, 0.633).
Predictive performance of model
Training set (75%)
Test set (25%)
In this study, we systematically evaluated the clinical utility of five SNPs identified in recent GWAs and large cohort studies of lung cancer. Using data from a large case–control study that enrolled 5,068 participants, we found that most of the genetic variants (rs2736100, rs402710, rs4488809, and rs4083914) identified previously in other populations were also associated with risk of lung cancer in a Chinese population. In addition, we showed that a wGRS accounting for the adjusted effect size of each SNP was a better predictor than a cGRS, and had a stronger association with lung cancer risk than any single SNP alone. Although the weighted genetic risk score had a moderate predictive ability, it gave a better discrimination between lung cancer cases and cancer-free controls (AUC of ROC curve, 0.639) when used in combination with smoking status using the logistic regression model.
Several lung cancer risk assessment models have previously been proposed [12–15], but most predictors focused on traditional risk factors such as family history of lung cancer, smoking status, environmental exposure, age and gender. In contrast to these, genetic scores derived from inherited genetic variations offer the advantage of stability during the lifetime of the individual.
Previous studies have indicated that inherited genetic variants might account for an important fraction of lung cancer developmental risk [18, 19]. Recent GWA studies of lung cancer in population of European ancestry identified three lung cancer susceptibility loci: 5p15 (TERT-CLPM1L), 15q25 (CHRNA 3–5) and 6p21 (BAT3-MSH5) [4–9]. McKay et al.  reported two independent markers of lung cancer at the 5p15 region, rs2736100 (TERT) and rs402710 (CLPM1L). Furthermore, an association between rs2736100 and lung cancer were also replicated in Asian populations [20, 21]. Of the five SNPs evaluated in this study, we observed a strong signal at rs2736100 in accordance with previous reports.
15q25 region encoding nicotinic acetylcholine receptor subunits was thought to be related with lung cancer risk [6–8]. We evaluated the rs1051730 SNP from this region in the present study, but it showed no association with disease risk. It is conceivable that the rs1051730 allele frequency in the Chinese Han population (MAF, 0.02) is too low to confirm the effects seen in European populations . Reported risk SNPs at 6p21 (rs3117582 and rs3131379) are not polymorphic in the Chinese Han population, so were excluded from this study. Rs4488809 and rs4083914, previously identified by GWA and large cohort investigations, were also shown to be significantly associated with lung cancer risk in this study [23, 24].
Of the five SNPs evaluated in this study, the strongest signal was found for rs4488809, for which there was 21% elevated risk of lung cancer with each risk allele. The three other SNPs (rs2736100, rs402710, and rs4083914) were also associated with a risk of lung cancer, albeit at lower levels (<18%) for each risk allele. The estimated proportion of genetic variation explained by these four SNPs was therefore 4.02%, which includes 1.82% due to rs4488809 and 1.33% due to rs2736100. This suggests that the genetic susceptibility loci identified by GWA and large cohort studies in other populations only confer a small to moderate risk in a Chinese population when considered alone, and are of little use in lung cancer risk assessment.
To overcome this, a genetic risk score combining multiple loci might improve the identification of persons at high risk for developing lung cancer. Our results showed that although wGRS was highly associated with lung cancer susceptibility, a model including wGRS alone did not provide a better predictive capacity than a model including traditional factors (c statistic for wGRS alone, 0.551). Smoking history was also associated with lung cancer risk in this study, in agreement with previous reports [12, 25]. Moreover, wGRS, in combination with smoking status showed a better predictive ability (c statistic, 0.639). Indeed, the c statistic decreased by 0.020 when wGRS was removed from the full model, indicating that genetic risk factors could improve the discriminatory ability of the traditional assessment model, although this effect was moderate.
This study has a number of limitations. First, the susceptibility loci identified by GWA and large cohort studies with evidence of replication were associated with a lung cancer risk through strong linkage disequilibrium, and always conferred moderate effects. Many additional susceptibility loci for lung cancer remain to be discovered, and it is possible that rare variants with high penetrance would explain the remaining hereditary . Next generation sequencing technologies offer hope in the future research of such variants . Recently, several identified SNPs were reported [28–30]. Combining these new SNPs might result in improvement in classification of lung cancer risk. Second, because of limited traditional factors, the full predictive model established in this study only provided a moderate level of classification accuracy, with a c statistic of 0.639, which is inadequate for risk prediction. The discriminatory capability of our model might be improved by including additional factors such as history of bronchitis, emphysema or pneumonia, asbestos exposure, and family history of lung cancer. Third, our assessment model lacked external validation even though our estimates of ROC AUC were corrected for over-fitting by bootstrap and internal validation was conducted. Finally, as this was a retrospectively designed study, the results need to be validated by a large-scale, prospective study.
We have shown that most of the genetic susceptibility loci identified by previous GWA and large cohort studies in other populations were also associated with lung cancer risk in a Chinese population. Although the weighted genetic risk score had only a moderate discriminatory accuracy, it still improved the predictive ability of the assessment model, which might help in the identification of individuals at a high risk of developing lung cancer. Future studies should focus on establishing a risk assessment model that incorporates both genetic variants and established traditional factors for lung cancer.
- 95% CI:
95% confidence interval
Area under the curve
Bootstrap optimism corrected
Genetic risk score
Genome-wide association (GWA) studies
Receiver operating characteristic
Single nucleotide polymorphism.
This work was supported in part by China National High-Tech Research and Development Program Grant (2012AA02A517, 2012AA02A518), Shanghai Science and Technology Research Program (09JC1402200, 10410709100) and Scientific and technological support plans from Jiangsu province (BE2010715).
- Jemal A, Siegel R, Xu J, Ward E: Cancer statistics, 2010. CA Cancer J Clin. 2010, 60: 277-300. 10.3322/caac.20073.View ArticlePubMedGoogle Scholar
- Li X, Hemminki K: Familial and second lung cancers: a nation-wide epidemiologic study from Sweden. Lung Cancer. 2003, 39: 255-263. 10.1016/S0169-5002(02)00535-4.View ArticlePubMedGoogle Scholar
- Jonsson S, Thorsteinsdottir U, Gudbjartsson DF, Jonsson HH, Kristjansson K, Arnason S, Gudnason V, Isaksson HJ, Hallgrimsson J, Gulcher JR, Amundadottir LT, Kong A, Stefansson K: Familial risk of lung carcinoma in the Icelandic population. JAMA. 2004, 292: 2977-2983. 10.1001/jama.292.24.2977.View ArticlePubMedGoogle Scholar
- McKay JD, Hung RJ, Gaborieau V, Boffetta P, Chabrier A, Byrnes G, Zaridze D, Mukeria A, Szeszenia-Dabrowska N, Lissowska J, Rudnai P, Fabianova E, Mates D, Bencko V, Foretova L, Janout V, McLaughlin J, Shepherd F, Montpetit A, Narod S, Krokan HE, Skorpen F, Elvestad MB, Vatten L, Njolstad I, Axelsson T, Chen C, Goodman G, Barnett M, Loomis MM, et al: Lung cancer susceptibility locus at 5p15.33. Nat Genet. 2008, 40: 1404-1406. 10.1038/ng.254.View ArticlePubMedPubMed CentralGoogle Scholar
- Rafnar T, Sulem P, Stacey SN, Geller F, Gudmundsson J, Sigurdsson A, Jakobsdottir M, Helgadottir H, Thorlacius S, Aben KK, Blondal T, Thorgeirsson TE, Thorleifsson G, Kristjansson K, Thorisdottir K, Ragnarsson R, Sigurgeirsson B, Skuladottir H, Gudbjartsson T, Isaksson HJ, Einarsson GV, Benediktsdottir KR, Agnarsson BA, Olafsson K, Salvarsdottir A, Bjarnason H, Asgeirsdottir M, Kristinsson KT, Matthiasdottir S, Sveinsdottir SG, et al: Sequence variants at the TERT-CLPTM1L locus associate with many cancer types. Nat Genet. 2009, 41: 221-227. 10.1038/ng.296.View ArticlePubMedPubMed CentralGoogle Scholar
- Amos CI, Wu X, Broderick P, Gorlov IP, Gu J, Eisen T, Dong Q, Zhang Q, Gu X, Vijayakrishnan J, Sullivan K, Matakidou A, Wang Y, Mills G, Doheny K, Tsai YY, Chen WV, Shete S, Spitz MR, Houlston RS: Genome-wide association scan of tag SNPs identifies a susceptibility locus for lung cancer at 15q25.1. Nat Genet. 2008, 40: 616-622. 10.1038/ng.109.View ArticlePubMedPubMed CentralGoogle Scholar
- Hung RJ, McKay JD, Gaborieau V, Boffetta P, Hashibe M, Zaridze D, Mukeria A, Szeszenia-Dabrowska N, Lissowska J, Rudnai P, Fabianova E, Mates D, Bencko V, Foretova L, Janout V, Chen C, Goodman G, Field JK, Liloglou T, Xinarianos G, Cassidy A, McLaughlin J, Liu G, Narod S, Krokan HE, Skorpen F, Elvestad MB, Hveem K, Vatten L, Linseisen J, et al: A susceptibility locus for lung cancer maps to nicotinic acetylcholine receptor subunit genes on 15q25. Nature. 2008, 452: 633-637. 10.1038/nature06885.View ArticlePubMedGoogle Scholar
- Thorgeirsson TE, Geller F, Sulem P, Rafnar T, Wiste A, Magnusson KP, Manolescu A, Thorleifsson G, Stefansson H, Ingason A, Stacey SN, Bergthorsson JT, Thorlacius S, Gudmundsson J, Jonsson T, Jakobsdottir M, Saemundsdottir J, Olafsdottir O, Gudmundsson LJ, Bjornsdottir G, Kristjansson K, Skuladottir H, Isaksson HJ, Gudbjartsson T, Jones GT, Mueller T, Gottsater A, Flex A, Aben KK, de Vegt F, et al: A variant associated with nicotine dependence, lung cancer and peripheral arterial disease. Nature. 2008, 452: 638-642. 10.1038/nature06846.View ArticlePubMedPubMed CentralGoogle Scholar
- Wang Y, Broderick P, Webb E, Wu X, Vijayakrishnan J, Matakidou A, Qureshi M, Dong Q, Gu X, Chen WV, Spitz MR, Eisen T, Amos CI, Houlston RS: Common 5p15.33 and 6p21.33 variants influence lung cancer risk. Nat Genet. 2008, 40: 1407-1409. 10.1038/ng.273.View ArticlePubMedPubMed CentralGoogle Scholar
- Meigs JB, Shrader P, Sullivan LM, McAteer JB, Fox CS, Dupuis J, Manning AK, Florez JC, Wilson PW, D'Agostino RB, Cupples LA: Genotype score in addition to common risk factors for prediction of type 2 diabetes. N Engl J Med. 2008, 359: 2208-2219. 10.1056/NEJMoa0804742.View ArticlePubMedPubMed CentralGoogle Scholar
- Wray NR, Goddard ME, Visscher PM: Prediction of individual genetic risk to disease from genome-wide association studies. Genome Res. 2007, 17: 1520-1528. 10.1101/gr.6665407.View ArticlePubMedPubMed CentralGoogle Scholar
- Bach PB, Kattan MW, Thornquist MD, Kris MG, Tate RC, Barnett MJ, Hsieh LJ, Begg CB: Variations in lung cancer risk among smokers. J Natl Cancer Inst. 2003, 95: 470-478. 10.1093/jnci/95.6.470.View ArticlePubMedGoogle Scholar
- Cassidy A, Myles JP, van Tongeren M, Page RD, Liloglou T, Duffy SW, Field JK: The LLP risk model: an individual risk prediction model for lung cancer. Br J Cancer. 2008, 98: 270-276. 10.1038/sj.bjc.6604158.View ArticlePubMedGoogle Scholar
- Spitz MR, Hong WK, Amos CI, Wu X, Schabath MB, Dong Q, Shete S, Etzel CJ: A risk model for prediction of lung cancer. J Natl Cancer Inst. 2007, 99: 715-726. 10.1093/jnci/djk153.View ArticlePubMedGoogle Scholar
- Tammemagi CM, Pinsky PF, Caporaso NE, Kvale PA, Hocking WG, Church TR, Riley TL, Commins J, Oken MM, Berg CD, Prorok PC: Lung cancer risk prediction: prostate, lung, colorectal and ovarian cancer screening trial models and validation. J Natl Cancer Inst. 2011, 103: 1058-1068. 10.1093/jnci/djr173.View ArticlePubMedPubMed CentralGoogle Scholar
- Medland SE, Nyholt DR, Painter JN, McEvoy BP, McRae AF, Zhu G, Gordon SD, Ferreira MA, Wright MJ, Henders AK, Campbell MJ, Duffy DL, Hansell NK, Macgregor S, Slutske WS, Heath AC, Montgomery GW, Martin NG: Common variants in the trichohyalin gene are associated with straight hair in Europeans. Am J Hum Genet. 2009, 85: 750-755. 10.1016/j.ajhg.2009.10.009.View ArticlePubMedPubMed CentralGoogle Scholar
- Zheng W, Wen W, Gao YT, Shyr Y, Zheng Y, Long J, Li G, Li C, Gu K, Cai Q, Shu XO, Lu W: Genetic and clinical predictors for breast cancer risk assessment and stratification among Chinese women. J Natl Cancer Inst. 2010, 102: 972-981. 10.1093/jnci/djq170.View ArticlePubMedPubMed CentralGoogle Scholar
- Matakidou A, Eisen T, Houlston RS: Systematic review of the relationship between family history and lung cancer risk. Br J Cancer. 2005, 93: 825-833. 10.1038/sj.bjc.6602769.View ArticlePubMedPubMed CentralGoogle Scholar
- Zhang Y, Shu XO, Gao YT, Ji BT, Yang G, Li HL, Kilfoy B, Rothman N, Zheng W, Chow WH: Family history of cancer and risk of lung cancer among nonsmoking Chinese women. Cancer Epidemiol Biomarkers Prev. 2007, 16: 2432-2435. 10.1158/1055-9965.EPI-07-0398.View ArticlePubMedGoogle Scholar
- Jin G, Xu L, Shu Y, Tian T, Liang J, Xu Y, Wang F, Chen J, Dai J, Hu Z, Shen H: Common genetic variants on 5p15.33 contribute to risk of lung adenocarcinoma in a Chinese population. Carcinogenesis. 2009, 30: 987-990. 10.1093/carcin/bgp090.View ArticlePubMedGoogle Scholar
- Kohno T, Kunitoh H, Shimada Y, Shiraishi K, Ishii Y, Goto K, Ohe Y, Nishiwaki Y, Kuchiba A, Yamamoto S, Hirose H, Oka A, Yanagitani N, Saito R, Inoko H, Yokota J: Individuals susceptible to lung adenocarcinoma defined by combined HLA-DQA1 and TERT genotypes. Carcinogenesis. 2010, 31: 834-841. 10.1093/carcin/bgq003.View ArticlePubMedGoogle Scholar
- Wu C, Hu Z, Yu D, Huang L, Jin G, Liang J, Guo H, Tan W, Zhang M, Qian J, Lu D, Wu T, Lin D, Shen H: Genetic variants on chromosome 15q25 associated with lung cancer risk in Chinese populations. Cancer Res. 2009, 69: 5065-5072.View ArticlePubMedGoogle Scholar
- Miki D, Kubo M, Takahashi A, Yoon KA, Kim J, Lee GK, Zo JI, Lee JS, Hosono N, Morizono T, Tsunoda T, Kamatani N, Chayama K, Takahashi T, Inazawa J, Nakamura Y, Daigo Y: Variation in TP63 is associated with lung adenocarcinoma susceptibility in Japanese and Korean populations. Nat Genet. 2010, 42: 893-896. 10.1038/ng.667.View ArticlePubMedGoogle Scholar
- You M, Wang D, Liu P, Vikis H, James M, Lu Y, Wang Y, Wang M, Chen Q, Jia D, Liu Y, Wen W, Yang P, Sun Z, Pinney SM, Zheng W, Shu XO, Long J, Gao YT, Xiang YB, Chow WH, Rothman N, Petersen GM, de Andrade M, Wu Y, Cunningham JM, Wiest JS, Fain PR, Schwartz AG, Girard L, et al: Fine mapping of chromosome 6q23-25 region in familial lung cancer families reveals RGS17 as a likely candidate gene. Clin Cancer Res. 2009, 15: 2666-2674. 10.1158/1078-0432.CCR-08-2335.View ArticlePubMedPubMed CentralGoogle Scholar
- Colditz GA, Atwood KA, Emmons K, Monson RR, Willett WC, Trichopoulos D, Hunter DJ: Harvard report on cancer prevention volume 4: Harvard cancer risk index. risk index working group, Harvard center for cancer prevention. Cancer Causes Control. 2000, 11: 477-488. 10.1023/A:1008984432272.View ArticlePubMedGoogle Scholar
- Pawitan Y, Seng KC, Magnusson PK: How many genetic variants remain to be discovered?. PLoS One. 2009, 4: e7969-10.1371/journal.pone.0007969.View ArticlePubMedPubMed CentralGoogle Scholar
- Roberson ED, Bowcock AM: Psoriasis genetics: breaking the barrier. Trends Genet. 2010, 26: 415-423. 10.1016/j.tig.2010.06.006.View ArticlePubMedPubMed CentralGoogle Scholar
- Dong J, Zhibin H, Chen W, Guo H, Zhou B, Lv J, Daru L, Chen K, Shi Y, Chu M, Wang C, Zhang R, Dai J, et al: Association analyses identify multiple new lung cancer susceptibility loci and their interactions with smoking in the Chinese population. Nat Genet. 2012, 44: 895-899. 10.1038/ng.2351.View ArticlePubMedGoogle Scholar
- Zhibin H, Chen W, Shi Y, Guo H, Zhao X, Yin Z, Yang L, Dai J, Lingmin H, Tan W, Li Z, Deng Q, Wang J, et al: A genome-wide association study identifies two new lung cancer susceptibility loci at 13q12.12 and 22q12.2 in Han Chinese. Nat Genet. 2011, 43: 792-796. 10.1038/ng.875.View ArticleGoogle Scholar
- Shi J, Chatterjee N, Rotunno M, Wang Y, Pesatori AC, Consonni D, Li P, Wheeler W, Broderick P, Henrion M, Eisen T, Wang Z, Chen W, et al: Inherited variation at chromosome 12p13.33, including RAD52, influences the risk of squamous cell lung carcinoma. Cancer Discovery. 2011, 2: 131-139.View ArticlePubMedPubMed CentralGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2350/13/118/prepub