Department of
Biostatistics
Johns Hopkins Bloomberg School of Public Health
615 N. Wolfe St.
Baltimore, MD 21205
Office: E3535
Phone: (410) 955-2468
Fax: (410) 955-0958
Email: rpeng at jhsph.edu
URL: http://www.biostat.jhsph.edu/~rpeng/
Education
- Ph.D. Statistics, University of California, Los Angeles, 2003
- M.S. Statistics, University of California, Los Angeles, 2001
- B.S. Applied Mathematics, Yale University, 1999
Experience
- Department of
Biostatistics, Johns Hopkins Bloomberg School of Public
Health
Assistant Professor, 2005–present - Department of
Biostatistics, Johns Hopkins Bloomberg School of Public
Health
Postdoctoral Fellow, 2003–2005
Member of the Environmental Biostatistics and Epidemiology Group. - Department of
Statistics, UCLA
Graduate Student Researcher, 2000–2003 - Logicon
INRI/Northrop Grumman
Software Engineer — UnderSea Warfare Systems and Cartography, 1998 (summer) - KenCast, Inc.
Software Engineer, 1997, 1999 (summer)
Awards
- Faculty Innovation Fund Award, JHSPH, 2006
- UCLA Charles E. and Sue K. Young Graduate Student Award, 2003
- UCLA Dissertation Year Fellowship, 2002
- Winner, ASA Student Paper Competition in Statistical Computing and Graphics, 2002
- Teaching Assistant of the Year, UCLA Department of Statistics, 2000
Publications
Data and software for my research projects can be found at my Reproducible Research Repository. If you cannot obtain a copy of an article (for example, if you do not have a subscription), please send me an email.
- Peng RD, Eckel SP. "Distributed reproducible research
using cached computations," in revision.
[PDF] - Bell ML, Ebisu K, Peng RD, Walker J, Samet JM, Zeger SL, Dominici F. "Seasonal and regional short-term effects of fine particles on hospital admissions in 202 U.S. counties, 1999–2005," American Journal of Epidemiology, to appear.
- Samoli E, Peng RD, Ramsay T, Pipikou M, Touloumi G, Dominici F, Burnett R, Cohen A, Krewski D, Samet J, Katsouyanni K. "Acute effects of ambient particulate matter on mortality in Europe and North America: Results from the APHENA study," Environmental Health Perspectives, to appear.
- Eckel SP, Peng RD. "Interacting with local and
remote data repositories using the stashR package,"
Computational Statistics, to appear.
[PDF | Journal PDF] - Welty LJ, Peng RD, Zeger SL, Dominici F. "Bayesian
distributed lag models: Estimating effects of particulate
matter air pollution on daily mortality," Biometrics,
to appear.
[PubMed | PDF] - Katsouyanni K, Samet JM, Anderson HR, Atkinson R, Le Tertre A, Medina S, Samoli E, Touloumi G, Burnett RT, Krewski D, Ramsay T, Dominici F, Peng RD, Schwartz J, Zanobetti A. Air pollution and health: a North American and European approach (APHENA), Research Report, Health Effects Institute, Boston MA, to appear.
- Peng RD, Dominici F, Welty LJ. "A Bayesian
hierarchical distributed lag model for estimating the time
course of hospitalization risk associated with particulate
matter air pollution," Journal of the Royal Statistical
Society, Series C, to appear.
[PDF] - Peng RD (2008). "Caching and distributing
statistical analyses in R," Journal of Statistical
Software, 26 (7), 1--24.
[PDF | Supplementary Materials] - Peng RD, Chang HH, Bell ML, McDermott A, Zeger SL,
Samet JM, Dominici F (2008). "Coarse particulate matter air
pollution and hospital admissions for cardiovascular and
respiratory diseases among Medicare patients," Journal of the
American Medical Association, 299 (18),
2172–2179. (NIEHS Extramural
Paper of the Month, July 2008)
[PubMed | full text | PDF | Supplementary Materials | press release] - Peng RD (2008). "A method for visualizing
multivariate time series data," Journal of Statistical
Software, 25 (Code Snippet 1), 1–17. (Downloaded 1156
times since March 2008)
[PDF | Supplementary Materials | code] - Dominici F, Peng RD, Ebisu K, Zeger SL, Samet JM,
Bell ML (2007). "Does the effect of PM10 on
mortality depend on PM nickel and vanadium content? A
re-analysis of the NMMAPS data," Environmental Health
Perspectives, 115 (12), 1701–1703.
doi:10.1289/ehp.10737
[PubMed | PDF] - Dominici F, Peng RD, Zeger SL, White RH, Samet JM
(2007). "Particulate air pollution and mortality in the United
States: Did the risks change from 1987 to 2000? (with
discussion)" American Journal of Epidemiology, 166 (8),
880–888. doi:10.1093/aje/kwm222 (NIEHS Extramural
Paper of the Month, December 2007)
[PubMed | PDF] - Peng RD, Dominici F, Zeger SL (2006).
"Reproducible epidemiologic research," American Journal of
Epidemiology, 163 (9), 783–789.
doi:10.1093/aje/kwj093
[ PubMed | full text | PDF | press release] - Peng RD (2006). "Interacting with data using the
filehash package," R News, 6 (4), 19–24.
[PDF] - Dominici F, Peng RD, Bell ML, Pham L, McDermott A,
Zeger SL, Samet JM (2006). "Hospital admissions and fine
particulate air pollution—Reply,", Journal of the
American Medical Association, 296, 1966–1967.
[full text | PDF | related news article] - Zeger SL, McDermott A, Dominici F, Peng RD, Samet
JM (2006). Internet-Based Health and Air Pollution
Surveillance System, Communication 12, Health Effects
Institute, Boston MA.
[Abstract | PDF | Website] - Bell ML, Peng RD, Dominici F (2006). "The
exposure-response curve for ozone and risk of mortality and
the adequacy of current ozone regulations," Environmental
Health Perspectives, 114, 532–536.
doi:10.1289/ehp.8816
[PubMed | synopsis | PDF] - Dominici F, Peng RD, Bell ML, Pham L, McDermott A,
Zeger SL, Samet JM (2006). "Fine particulate air pollution
and hospital admission for cardiovascular and respiratory
diseases," Journal of the American Medical
Association, 295 (10) 1127–1134.
[PubMed | full text | PDF | press release | Supplementary Materials] - Zeger SL, Irizarry R, Peng RD (2006). "On time
series analysis of public health and biomedical data,"
Annual Review of Public Health, 27, 57–79.
doi:10.1146/annurev.publhealth.26.021304.144517
[PubMed | PDF] - Peng RD, Dominici F, Louis TA (2006). "Model choice
in time series studies of air pollution and mortality (with
discussion)," Journal of the Royal Statistical Society,
Series A, 169 (2), 179–203.
doi:10.1111/j.1467-985X.2006.00410.x (#2 most cited
paper in JRSS-A, 2005–2008)
[data | PDF] - Peng RD, Dominici F, Pastor-Barriuso R, Zeger SL,
Samet JM (2005). "Seasonal analyses of air pollution and
mortality in 100 U.S. cities," American Journal of
Epidemiology, 161 (6), 585–594.
doi:10.1093/aje/kwi075
[ PubMed | full text | PDF] - Peng RD, Schoenberg FP, Woods JA (2005). "A
space-time conditional intensity model for evaluating a
wildfire hazard index," Journal of the American
Statistical Association, 100 (469), 26–35.
[PDF] - Peng RD, Welty LJ (2004). "The NMMAPSdata
package," R News, 4 (2), 10–14.
[PDF] - Peng RD (2003). "Multi-dimensional point process
models in R," Journal of Statistical Software, 8 (16),
1–27.
[PDF] - Schoenberg FP, Peng R, Woods J (2003). "On the
distribution of wildfire sizes," Environmetrics, 14
(6), 583–592.
[PDF] - Schoenberg FP, Peng R, Huang Z, Rundel P (2003).
"Detection of nonlinearities in the dependence of burn area
on fuel age and climatic variables," International Journal
of Wildland Fire, 12 (1), 1–6.
[PDF] - Peng RD, Hengartner NW (2002). "Quantitative
analysis of literary styles," The American
Statistician, 56 (3), 175–185.
[data | PDF]
2008
2007
2006
2005
2004 and Earlier
Books
- Peng RD, Dominici F (2008). Statistical Methods
for Environmental Epidemiology in R: A Case Study in Air
Pollution and Health, Springer.
[Publisher's Website | Order from Amazon]
Software
- mvtsplot: A function for plotting multivariate time series data
- cacher: Tools for caching and distributing statistical analyses in R
- SRPM: A package development and management system for distributed reproducible research
- NMMAPSdata: Daily mortality, air pollution, and weather data from the National Morbidity, Mortality, and Air Pollution Study.
- NMMAPSlite: NMMAPS data made available via 'stashR' databases
- filehash: A file-based hash table for R
- filehashSQLite: Filehash databases using the SQLite database backend
- cacheSweave: Caching computations in Sweave
- stashR: A Set of Tools for Administering SHared Repositories
- gpclib: An R package for clipping complex polygons
- simpleboot: Simple bootstrapping routines in R
- ptproc: An R package for analyzing multi-dimensional point process models.
Workshops and Short Courses
- Integrating Computing into the Statistics Curriculum, UC Berkeley, July 2008
- NSF Computing in Statistics Workshop, MSRI, May 2007
- Summer Program in
Statistics for Undergraduates, UCLA, June 2006
Short course on statistical analysis of air pollution and health data
Invited Presentations
- Interface, Durham, May 2008
Statistical methods for estimating the health risks of particulate matter components - American Public Health Association Annual Meeting,
Washington DC, November 2007
Model choice in time series studies of air pollution and health - Joint Statistical Meetings, Salt Lake City, August
2007
Estimating the distributed lag between air pollution and hospitalization using a Bayesian hierarchical model - Health Effects Institute Annual Meeting, Chicago, April
2007
Air Pollution and Health: A Combined European and North American Approach - Health Effects Institute Annual Meeting, San Francisco, April 2006
- ENAR, Tampa, March 2006
Model choice in time series studies of air pollution and health - Joint Statistical Meetings, Minneapolis, August
2005
Spatial modeling of environmental exposures, diseases, and confounders using national databases - WNAR, Fairbanks, June 2005
Model choice in time series studies of air pollution and health - Health Effects Institute Annual Meeting, Baltimore, April
2005
Combined acute effects of ozone on the daily number of deaths among people older than 75 years in European and North Amercican Cities [with the APHENA group] - Department of Health Studies, The University of Chicago, February 2005
- Department of Statistics, North Carolina State University, February 2005
- Department of Biostatistics, Harvard School of Public Health, January 2005
- Department of Biostatistics, Johns Hopkins Bloomberg
School of Public Health, January 2005
Do current air pollution levels affect human health? Statistical and computational models for estimating air pollution health effects on a national scale. - Environmental Biostatistics and Epidemiology Group, Johns
Hopkins Bloomberg School of Public Health, July 2004
The NMMAPSdata R package and reproducible research - Mini-workshop on Climate and Health, National Center for
Atmospheric Research, May 2004
Seasonal analyses of PM10 and mortality [PDF] - Health Effects Institute Annual Meeting, May 2004
What models is APHENA using? [with the APHENA group] - Department of Biostatistics, University of Pittsburgh
Graduate School of Public Health, February 2004
Space-time point process models for evaluating a wildfire hazard index [PDF] - Department of Biostatistics, Johns Hopkins Bloomberg
School of Public Health, September 2003
Residual analysis for point process models and applications to wildfire hazard assessment - Point Processes: Theory and Applications, Banff International Research Station, June 2003
- Department of Statistics, UCLA, May 2003
- Statistical Sciences Group, Los Alamos National Laboratory, February 2003
- Geophysical Statistics Project, National Center for Atmospheric Research, February 2003
- Environmental Biostatistics Working Group, Johns Hopkins
Bloomberg School of Public Health, January 2003
Multi-dimensional point process models for evaluating a wildfire hazard index - GSO Seminar, Department of Mathematics, UCLA, November
2000
Multvariate analysis applied to differences in literary style
Contributed Talks
- Joint Statistical Meetings, Seattle, August 2006
Model choice in time series studies of air pollution and mortality - Joint Statistical Meetings, Toronto, August 2004
Seasonal analyses of air pollution and mortality in 100 U.S. cities [PDF] - Joint Statistical Meetings, San Francisco, August
2003
Evaluating a wildfire hazard index using point process models - Joint Statistical Meetings, New York, August 2002
Estimating the renewal distribution of spatial-temperal coverage process - Joint WNAR/IMS meeting, UCLA, June 2002
Estimating the fire interval distribution for Los Angeles County, California - Hawaii International Conference on Statistics and Related
Fields, June 2002
Estimating the fire interval distribution for Los Angeles County, California - Forest Fires 2001: Operational Mechanisms, Firefighting
Means and New Technologies, Athens, Greece, March, 2001
Estimation of hazard using time-since-fire and spatial-temporal wildfire data
Editorial Activities
- Associate Editor Biostatistics, 2006–present
- Associate Editor, Journal of Statistical Software, 2004–2006
- Reviewer for: Journal of the American Statistical Association, Biostatistics, Statistical Modelling, Computational Statistics and Data Analysis, Environmental Health Perspectives, Risk Analysis, Journal of Epidemiology and Community Health, American Journal of Epidemiology, Epidemiology, Springer (book proposals)
- Regular contributor to R-help and R-devel mailing lists
Teaching and Service
Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health- Primary instructor
- Biostat 140.776 (Introduction to Statistical Computing)
- Biostat 140.778 (Advanced Statistical Computing)
- Co-instructor for 2005 Winter Institute Data Analysis Workshop I & II
- Member of Biostatistics Information Technology committee
Department of Statistics, UCLA
- Teaching Assistant Coordinator
(2000–2001)
- Teaching Assistant (1999–2001)
- Organized workshop and weekly colloquia on using the R Statistical Computing Environment
- Department representative to the Math and Physical Sciences Council of the UCLA Graduate Students Association
Research Interests
- Environmental biostatistics; health effects of air pollution
- Point processes — residual analysis methodology, conditional intensity modeling, estimation via likelihood methods, assessment of multi-dimensional models, applications to wildfire prediction
- Statistical computing; software engineering; reproducible research; data structures
Computer Interests
- Operating Systems: GNU/Linux, Solaris, HP-UX, AIX, Windows, Mac OS X
- Languages and Statistical Packages: C, Perl, R, XLisp-Stat
- Other: GNU development tools (Emacs, gcc, etc.), LaTeX
Other
- Memberships: American Statistical Association, International Biometric Society (ENAR), Free Software Foundation, Electronic Frontier Foundation, R Foundation for Statistical Computing
- Citizenship: United States
- Languages: English (native), some Chinese and Spanish
- Personal Interests: Violin