Info:
Assistant Professor
Department of Biostatistics
Mailman School of Public Health
Columbia University
722 West 168th Street, 6th Floor
New York, NY 10032
Email: qc2138@cumc.columbia.edu
Phone: 212-342-1245
Research Interests:
Survey sampling; Missing data; Bayesian statistics; latent class analysis; population health; environmental health sciences.
Education:
2009, Ph.D. in Biostatistics, University of Michigan
2004, M.S. in Applied Statistics, Bowling Green State University
2002, B.A. in Economics, Nankai University
Professional Services:
Associate Editor for the Journal of the Royal Statistical Society: Series C .
Honors and Awards:
2016, Career Development Award, the NIEHS Center for Environmental Health in Northern Manhattan
2011, Best Contributed Paper Award, the Statistics and Data Analysis Section of the SAS Global Forum
2010, Calderone Research Prize for Junior Faculty, Columbia University
2010, Department of Biostatistics Teaching Award, Columbia University
2009, Edward C. Bryant Scholarship, American Statistical Association
2008, Student Paper Competition Award, Social Statistics Section/Section on Government Statistics/Survey Research Methods Section, Joint Statistical Meetings
2007, Otto Hutzinger Award, 27th International Symposium on Halogenated Persistent Organic Pollutants, Tokyo, Japan
Papers
‡ denotes students or mentees.
Statistical Methods Papers
- Chen, Q., Elliott, M., Haziza, D., Yang, Y., Ghosh, M., Little, R. J. A., Sedransk, J., and Thompson,
M. Weights and estimation of a survey population mean: a review, Statistical Science, submitted.
- Chen, Q., Paik, M. C., Kim, M., and Wang, C. (2016). Using link-preserving imputation for logistic partially linear models with missing covariates,
Computational Statistics and Data Analysis,
101, 174-185.
- Chen, Q., Gelman, A., Tracy, M., Norris, F., and Galea, S. (2015). Incorporating the sampling design in weighting adjustments for panel attrition,
Statistics in Medicine, 34(28), 3637-3647.
- DiMaggio, C., Chen, Q., Muennig, P. A., Li, G. (2014). Timing and effect of a safe routes to school program on child pedestrian injury risk during school travel hours: Bayesian changepoint and difference-in-differences analysis,
Injury Epidemiology, 1, 1-17.
- Chen, Q. and Wang, S. (2013). Variable selection for multiply-imputed data with application to dioxin exposure study,
Statistics in Medicine, 32, 3646-59.
- Chen, Q., Galfalvy, H., and Duan, N. (2013). Effects of disease misclassification on exposure-disease association,
American Journal of Public Health, 103, e67-e73.
- Chen, Q. Elliott, M. R., and Little, R. J. A. (2012). Bayesian inference of finite population quantiles from unequal probability samples,
Survey Methodology, 38, 203-214.
- Chen, Q., Elliott, M. R., and Little, R. J. A. (2010). Bayesian penalized spline model-based inference for finite population proportion in unequal probability sampling,
Survey Methodology, 36, 23-34.
- Chen, Q., Garabrant, D., Hedgeman, E., Little, R. J. A., Elliott, M. R., Gillespie, B., Hong, B., Lee, S., Lepkowski, J., Franzblau, A., Adriaens, P., Demond, A., and Patterson, D. (2010). Estimation of background serum 2,3,7,8-TCDD concentrations by using quantile regression in the UMDES and NHANES populations,
Epidemiology, 21, S51-S57.
- Gillespie, B. W., Chen, Q., Reichert, H., Franzblau, A., Lepkowski, J., Adriaens, P., Demond, A., Luksemburg, W., and Garabrant, D. (2010). Estimating population distributions when some data are below a limit of detection by using a reverse Kaplan-Meier estimator,
Epidemiology, 21, S64-S70.
- McGrath, R.N. and Chen, Q. (2008). Sample size determination for a relative quality improvement,
Quality Engineering, 20, 309-320.
Environmental Health Sciences Papers
- Chen, Q., Zhong, X., Acosta, L., Divjan, A., Rundle, A., Goldstein, I.F., Miller, R. L., Perzanowski, M. S. (2016). Allergic sensitization patterns identified through Latent Class Analysis among NYC asthmatic and non-asthmatic children,
Annals of Allergy, Asthma and Immunology,
116(3), 212-218.
- Just, A.C.‡, Miller, R. L., Perzanowski, M. S., Rundle, A. G., Chen, Q., Jung, K. H., Hoepner, L., Camann, D. E., Calafat, A. M., Perera, F. P., Whyatt, R. M. (2015). Vinyl flooring in the home is associated with children's airborne butylbenzyl phthalate and urinary metabolite concentrations,
Journal of Exposure Science and Environmental Epidemiology,
25(6), 574-579.
- Chen, Q., Jiang, X., Garabrant, D., Hedgeman, E., Gillespie, B., Hong, B., Lepkowski, J., Franzblau, A., and Jolliet, O. (2013). Estimation of age- and sex-specific background human serum concentrations of PCDDs, PCDFs, and PCBs in the UMDES and NHANES populations,
Chemosphere, 91, 817-823.
- Chen, Q., Just, A. C.‡, Miller, R. L., Perzanowski, M. S., Goldstein, I. F., Perera, F. P., Whyatt, R. M. (2012). Using latent class growth analysis to identify childhood wheeze phenotypes in an urban birth cohort,
Annals of Allergy, Asthma & Immunology, 108, 311-315.e1.
- Just, A. C.‡, Whyatt, R. M., Miller, R. L., Rundle, A. G., Chen, Q., Calafat, A. M., Perera, F. P., Goldstein, I. F., Perzanowski, M. S. (2012). Children's urinary phthalate metabolites and fractional exhaled nitric oxide in an urban cohort,
American Journal of Respiratory and Critical Care Medicine, 186, 830-837.
- Just, A. C.‡, Whyatt, R. M., Perzanowski, M. S., Calafat, A. M., Perera, F. P., Goldstein, I. F., Chen, Q., Rundle, A. G., and Miller, R. L. (2012). Prenatal exposure to Butylbenzyl Phthalate and early eczema in an urban cohort,
Environmental Health Perspectives, 120, 1475-1480.
- Demond, A. Franzblau, A. Garabrant, D., Jiang, X., Adriaens, P., Chen, Q., Gillespie, B., Hao, W., Hong, B., Jolliet, O., Lepkowski, J. (2012). Human exposure from dioxins in soil,
Environmental Science and Technology, 46, 1296-302.
- Franzblau, A., Hedgeman, E., Jolliet, O., Knutson, K., Towey, T., Chen, Q., Hong, B., Adriaens, P., Demond A., Garabrant D., Gillespie B., and Lepkowski J. (2010). Case report: The University of Michigan Dioxin Exposure Study: a follow-up investigation of a case with high serum concentration of 2,3,4,7,8-pentachlorodibenzofuran,
Environmental Health Perspectives, 118, 1313-1317.
- Demond, A., Towey, T., Adriaens, P., Zhong, X., Knutson, K., Chen, Q., Hong, B., Gilespie, B., Franzblau, A., Garabrant, D., Lepkowski, J., Luksemburg, W., and Maier, M. (2010). Relationship between PCDDs, PCDFs, and dioxin-like PCBs concentration in vegetation and soil on residential properties,
Environmental Toxicology and Chemistry, 29, 2660-2668.
- Franzblau, A., Towey, T., Demond, A., Adriaens, P., Chang, S-C., Luksemburg,W., Maier, M., Garabrant, D., Gillespie, B., Lepkowski, J., Chang, C-W., Chen, Q., Gwinn, D., Hong, B., and Lee, S-Y. (2009). Residences with anomalous soil concentrations of dioxin-like compounds in two communities in Michigan, USA: a case study,
Chemosphere, 74, 395-403.
- Franzblau, A., Zwica, L., Knutson, K., Chen, Q., Lee, S-Y., Hong, B., Adriaens, P., Demond, A., Garabrant, D., Gillespie, B., Lepkowski, J., Luksemburg, W. , Maier, M., and Towey, T. (2009). An investigation of homes with high concentrations of PCDDs, PCDFs and/or dioxin-like PCBs in house dust,
Journal of Occupational and Environmental Hygiene, 3, 188-199.
- Hong, B., Garabrant, D., Hedgeman, E., Demond, A., Gillespie, B., Chen, Q., Chang, C., Towey, T., Knutson, K., Franzblau, A., Lepkowski, J., and Adriaens, P. (2009). Impact of WHO 2005 revised toxic equivalency factors for dioxins on the TEQs in serum, household dust and soil,
Chemosphere, 76, 727-733.
- Garabrant, D., Franzblau, A., Lepkowski, J., Gillespie, B., Adriaens, P., Demond, A., Hedgeman, E., Knutson, K., Zwica, L., Olson, K., Towey, T., Chen, Q., Hong, B., Chang, C-W., Lee, S-Y., Ward, B., LaDronka, K., Luksemburg, W., and Maier, M. (2009). The University of Michigan Dioxin Exposure Study: predictors of human serum dioxin concentrations in Midland and Saginaw, Michigan,
Environmental Health Perspectives, 117, 818-824.
- Hedgeman, E., Chen, Q., Hong, B., Chang C-W., Olson K., LaDronka, K., Ward, B., Adriaens, P., Demond, A., Gillespie, B., Lepkowski, J., Franzblau, A., and Garabrant, D. (2009). The University of Michigan Dioxin Exposure Study: population survey results and serum concentrations for polychlorinated dioxins, furans and biphenyls,
Environmental Health Perspectives, 117, 811-817.
- Garabrant, D., Franzblau, A., Lepkowski, J., Gillespie, B., Adriaens, P., Demond, A.,Ward, B, LaDronka, K., Hedgeman, E., Knutson K., Zwica, L., Olson, K., Towey, T., Chen, Q., and Hong, B. (2009). The University of Michigan Dioxin Exposure Study: methods for an environmental exposure study of polychlorinated dioxins, furans and biphenyls,
Environmental Health Perspectives, 117, 803-810.
- Garabrant, D., Aylward, L., Berent, S., Chen, Q., Timchalk, C., Burns, C., Hays, S., and Albers, J. (2009). Cholinesterase inhibition in chlorpyrifos workers: characterization of biomarkers of exposure and response in relation to urinary TCPy,
Journal of Exposure Science and Environmental Epidemiology, 19, 634-642.
- Franzblau, A., Hedgeman, E., Chen, Q., Lee, S., Adriaens, P., Demond, A., Garabrant, D., Gillespie, B., Hong, B., Jolliet, O., Lepkowski, J., Luksemburg, W., Maier, M., and Wenger, Y. (2008). Case report: human exposure to dioxins from clay,
Environmental Health Perspectives, 116, 238-242.
- Demond, A., Adriaens, P., Towey, T., Chang, S., Hong, B., Chen, Q., Chang C.-W., Franzblau, A., Garabrant D., Gillespie, B., Hedgeman E., Knutson, K., Lee, C., Lepkowski, J., Olson, K., Ward, B., Zwica L., Luksemburg W., and Maier M. (2008). Statistical comparison of residential soil concentrations of PCDDs, PCDFs, and PCBs from two communities in Michigan,
Environmental Science and Technology, 42, 5441-5448.
Psychiatry Papers
- Owen, J. P., Bukshpun, P., Pojman, N., Wakahiro, M., Chen, Q., D'Angelo, D.‡, Glenn, O., Hunter, J., Berman, J., Roberts, T., Buckner, R., Nagarajan, S. S., Mukherjee, P., and Sherr, E. H.
Clinical brain imaging findings and associated outcomes in carriers of the reciprocal CNV at 16p11.2, Radiology, submitted.
- Sumner, J. A., Kubzansky, L.D., Roberts, A.L., Gilsanz, P., Chen, Q., Winning, A., Forman, J.P., Rimm, E. B., Koenen, K. C. (2016).
Posttraumatic stress disorder symptoms and risk of hypertension over 22 years in a large cohort of younger and middle-aged women, Psychological Medicine, in press.
- Steinman, K., Spence, S., Ramocki, M., Proud, M., Kessler, S., Marco, E., Green Snyder, L., D'Angelo, D.‡, Chen, Q., Chung, W. and Sherr, E. (2016).
16p11.2 deletion and duplication: characterizing neurologic phenotypes in a large clinically-ascertained cohort, American Journal of Medical Genetics, in press.
- Snyder, L. G., D'Angelo, D.‡, Chen, Q., Bernier, R., Goin-Kochel, R. P., Wallace, A. S., Gerdts, J., Kanne, S., Berry, L., Snow-Gallagher, A., Sherr, E., Roberts, T., Martin, C. L., Ledbetter, D. H., Spiro, J. E., Chung, W. K., Hanson, E. (2016). Characterizing the 16p11.2 duplication, Journal of Autism and Developmental Disorders, in press.
- Fink, D. S., Chen, Q., Liu, Y.‡, Tamburrino, M. B., Galea, S., Liberzon, I., Shirley, E., Fine, T., Cohen, G. H., Calabrese, J. R. (2016).
Incidence and risk for mood and anxiety disorders in a representative sample of Ohio Army National Guard members, Public Health Reports, 131, 614-622.
- Sumner, J. A., Kubzansky, L. D., Kabrhel, C., Roberts, A. L., Chen, Q., Winning, A., Gilsanz, P., Rimm, E. B., Glymour, M. M., Koenen, K. C. (2016). Associations of trauma exposure and posttraumatic stress symptoms with venous thromboembolism over 22 years in women,
Journal of the American Heart Association, 5(5). pii: e003197.
- D'Angelo, D.‡, Lebon, S., Chen, Q. (co-first authors) , Martin-Brevet, S., Snyder, G. L., Hippolyte, L., Hanson, E., Maillard, A. M., Faucett, W. A., Mac, A., Pain, A., Bernier, R., Chawner, S., Albert, D., Andrieux, J., Aylward, E., Baujat, G., Caldeira, I., Conus, P., Ferrari, C., Forzano, F., G?rard, M., Goin-Kochel, R. P., Grant, E., Hunter, J. V., Isidor, B., Jacquette, A., Jonch, A., Keren, B., Lacombe, D., Caignec, C. L., Martin, C. L., M?nnik, K., Metspalu, A., Mignot, C., Mukherjee, P., Owen, M., Passeggeri, M., Thambo, C. R., Spence, S. J., Steinman, K. L., Tjernage, J., Van Haelst, M., Yiping, S., Sherr, E. H., Ledbetter, D. H., Van Den Bree, M., Beckmann, J. S., Spiro, J. E., Reymond, A., Jacquemont, S., Chung, W. K. (2016). Defining the effect of the 16p11.2 duplication on cognition, behavior, and medical comorbidities,
JAMA Psychiatry, 73(1), 20-30.
- Summer, J. A., Kubzansky, L. D., Elkind, M. S. V., Roberts, A. L., Agnew-Blais, J., Chen, Q., Cerda, M., Rexrode, K. M., Rich-Edwards, J.W., Spiegelman, D., Suglia, S. F., Rimm, E. B., Koenen, K. C. (2015). Trauma exposure and posttraumatic stress disorder symptoms predict onset of cardiovascular events in women,
Circulation, 132(4), 251-9.
- Hanson, E., Bernier, R., Porche, K., Jackson, F. I., Goin-Kochel, R. P., Snyder, L. G., Snow, A. V.,Wallace, A. S., Campe, K. L., Zhang, Y.‡, Chen, Q., D'Angelo, D.‡, Moreno-De-Luca, A., Orr, P. T., Boomer, K.B., Evans, D. W., Kanne, S., Berry, L., Miller, F. K., Olson, J., Sheerl, E., Martin, C. L., Ledbetter, D. H., Spiro, J. E., Chung, W. K. (2015). The cognitive and behavioral phenotype of the 16p11.2 deletion in a clinically ascertained population,
Biological Psychiatry, 77(9), 785-793.
- Gill, R., Chen, Q., D'Angelo, D.‡, and Chung, W. K. (2014). Eating in the absence of hunger but not loss of control behaviors are associated with 16p11.2 deletions,
Obesity, 22(12): 2625-31.
- The Simons VIP Consortium (2012). Simons Variation in Individuals Project (Simons VIP): A genetics-first approach to studying autism spectrum and related neurodevelopmental disorders,
Neuron, 6, 1063-1067.
Other Collaborative Papers
- Cohn, E. G., Henderson, G. E., Appelbaum, P. S., Working Group on Representation and Inclusion in Precision Medicien Studies. (2016). Distributive justice, diversity and inclusion in precision medicine: waht will success look like? Genetics in Medicine, in press.
- D'Aunno, T., Pollack, H., Chen, Q., Friedmann, P. D. (2016).
Integration of addiction treatment organizations into patient-centered medical homes: results from a national survey, Medical Care, in press.
- D'Aunno, T., Friedmann, P. D., Chen, Q., Wilson, D. M. (2015). Integration of substance abuse treatment organizations into Accountable Care Organizations: results from a national survey,
Journal of Health Politics, Policy, and Law,
40(4), 795-817.
- Kruk, M. E. Hermosilla, S., Larson, E., Vail, D., Chen, Q., Mazuguni, F., Byalugaba, B., Mbaruku,G. (2015). Who is left behind on the road to universal facility delivery? A cross-sectional multilevel analysis in rural Tanzania,
Tropical Medicine and International Health, 20(8), 1057-1066.
- Aderibigbe, T., Lang, B., Rosenberg, H., Chen, Q., Li, G. (2014). Cost-effectiveness analysis of stocking dantrolene in ambulatory surgery centers for the treatment of malignant hyperthermia,
Anesthesiology, 120(6), 1333-8.
- Li, G., Brady, J. E., Chen, Q. (2013). Drug use and fatal motor vehicle crashes: A case-control study,
Accident Analysis and Prevention, 60, 205-210.
- Adusumilli, S., Hussain, H.K., Caoili, E.M, Weadock, W.J., Murray, J.P., Johnson, T.D., Chen, Q., and Desjardins, B. (2006). MRI of sonographically indeterminate adnexal masses,
American Journal of Roentgenology, 187, 732-740.
Research
Bayesian model-based survey inference
My research focuses on the development of statistical methods for the analysis of complex survey data. I am interested in developing Bayesian model-based methods that include design variables as covariates in the regression models for survey outcomes. Below shows the estimation of finite population cumulative distribution function (CDF) and associated quantiles in probability proportional to size sampling.
A Bayesian probit penalized spline regression was used to model a smooth relationship between the CDF and the probability of selection. (a) The posterior mean and 95% credible interval of the population CDF estimate was obtained for each of 20 selected sample units. (b) A smoothed CDF was estimated by smoothing the CDF estimates in (a) using monotonic smooth cubic regression. (c) The population median was obtained by inverting the estimated smoothed CDF. See the paper [PDF].
Analysis of data with missing values
My research on missing data is broad and has been motivated by real world problems emerging from my collaborative research. Below shows an application of my MI-LASSO method, a variable selection method for multiply-imputed data, to the University of Michigan Dioxin Exposure Study to identify important circumstances and
exposure factors that were associated with human serum dioxin concentrations in Midland, Michigan.
The MI-lasso treated the regression coefficients of the same variable
across all imputed datasets as a group and applied the group lasso penalty to yield a consistent variable selection across all imputed datasets. The graphic shows the profiles of MI-LASSO coefficients and BIC value as the shrinkage factor changes. See the paper [PDF].
Latent class analysis
I have a great interest in developing new statistical methods and the novel application of statistical methods to population health research, especially in environmental health sciences. Below shows the use of latent class growth analysis (LCGA) to define phenotypes of wheeze using repeated questionnairs in the Columbia Center for Children's Environmental Health birth cohort study.
The LCGA identified four wheeze phenotypes: never/infrequent (47.1%), early-transient (37.5%), early-persistent (7.6%), and late-onset (7.8%). See the paper [PDF].
National and local community-based health surveys
I have also been actively involved in the design and analysis of a few national and local community-based health surveys, including the University of Michigan Dioxin Exposure Study, the National Drug Abuse Treatment System Survey, the Ohio National Guard Study, etc.
Software
MI-lasso for multiply-imputed data
Available as the %MI_lasso SAS macro and MI.lasso R function.
An implementation of the MI-lasso variable selection method that extends the lasso method to multiply-imputed data. The MI-lasso treats the regression coefficients of the same covariate across all imputed datasets as a group and applies the group lasso penalty to yield a consistent variable selection across all imputed datasets.
Reference:
Chen, Q. and Wang, S. (2013). Variable selection for multiply-imputed data with application to dioxin exposure study,
Statistics in Medicine, 32, 3646-59.
[PDF]
Logistic partially linear models with missing covariates
Available in this supplement.
We propose a new kernel-assisted estimating equation method for logistic partially linear models with missing covariates. We replace the conditional expectation in the doubly robust estimating function with an unbiased estimating function constructed using the conditional mean of the outcome given the observed data, and impute the missing covariates using the so called link-preserving imputation models to simplify the estimation.
Reference:
Chen, Q., Paik, M. C., Kim, M., and Wang, C. (2016). Using link-preserving imputation for logistic partially linear models with missing covariates,
Computational Statistics and Data Analysis,
101 174-185.
[PDF]
Studying missing data patterns
Available as the %missingPattern SAS macro.
The macro is designed to look at missing data in four ways: the proportion of units for each pattern of missing data, the number and percentage of missing data for each individual variable, the concordance of missingness in any pair of variables, and possible unit nonresponse. The user can customize these analyses by specifying which variables to include or exclude, and which output should be produced.
Reference:
Schwartz, T.‡, Chen, Q., and Duan, N. (2011).
Studying missing data patterns using a SAS macro,
SAS Global Forum 2011 proceedings. [PDF] (Best Contributed Paper Award, Statistics and Data Analysis Section, 2011 SAS Global Forum) ‡ Chen's student.
Backward selection for survey linear regression
Available as the %backward SAS macro.
A macro to do backward selection for survey regression using PROC SURVEYREG.
Reference:
Chen, Q. and Gillespie, B. (2006). A SAS macro for performing backward selection in PROC SUREVYREG,
SAS Conference Proceedings: Midwest SAS User Group. (Best Statistical Paper Award, 17th Annual Conference of the Midwest SAS Users Group)
Survey regression for multiply-imputed data
Available as the %MI_SREG SAS macro for survey weighted linear regression and the %MI_SLOGIT SAS macro for survey weighted logistic regression.
These macros are designed to use Rubin's rules to combine the regression coefficient estimates of survey linear and logistic regression models (using PROC SURVEYREG and SURVEYLOGISTIC) for multiply-imputed data.
Forward stepwise selection for multiply-imputed data
Available as the %MI_SREG_STEPWISE SAS macro for survey weighted linear regression and %MI_SLOGIT_STEPWISE SAS macro for survey weighted logistic regression.
These macros are designed to implement the stepwise selection method for multiply-imputed data using Rubin's rules for survey linear and logistic regression models. To use these two macros, the user also needs to download the %MI_SREG and the %MI_SLOGIT.
Reference:
Chen, Q. and Wang, S. (2013). Variable selection for multiply-imputed data with application to dioxin exposure study,
Statistics in Medicine, 32, 3646-59.
[PDF]
Artwork
My recent paintings.