Grantee Research Project Results
2006 Progress Report: Data Management, Statistics and Community Impact Modeling Core
EPA Grant Number: R832141C007Subproject: this is subproject number 007 , established and managed by the Center Director under grant R832141
(EPA does not fund or establish subprojects; EPA awards and manages the overall grant for this center).
Center: Water Environment and Reuse Foundation's National Center for Resource Recovery and Nutrient Management
Center Director: Olabode, Lola
Title: Data Management, Statistics and Community Impact Modeling Core
Investigators: Perera, Frederica P.
Institution: Columbia University in the City of New York
EPA Project Officer: Callan, Richard
Project Period: November 1, 2003 through October 31, 2008 (Extended to October 31, 2010)
Project Period Covered by this Report: November 1, 2005 through October 31,2006
RFA: Centers for Children's Environmental Health and Disease Prevention Research (2003) RFA Text | Recipients Lists
Research Category: Children's Health , Human Health
Objective:
All Columbia Center for Children’s Environmental Health (CCCEH) project data are housed in a Scientific Information Retrieval (SIR) database management system maintained by the Data Management, Statistics and Community Modeling (DSM) Core. The DSM Core is also the central resource for statistical consultation and data analysis related to the development of manuscripts. Finally, the DSM Core is a repository of community-level data and conducts multi-level research to examine the combined effects on child outcomes of individual and community-level risks.
Progress Summary:
All components of this comprehensive database, established in 1999, are now in place. In close consultation with Drs. Perera, Whyatt and Rauh and other members of the research team, Lori Hoepner, the DSM project coordinator, has greatly expanded the scope of the CEH database, which now contains a distinct record type (table) containing data for each of the scheduled interviews through the 8th year of life, including but not limited to: Prenatal Questionnaire (n=838), 48-hour Exposure (n=743), 3-Month Follow-up (n=592), 6-month Follow-up (n=588), 12-month Follow-up (n=538), 24-Month Follow-up (n=432), 36-Month Follow-up (n=379), 48-Month Follow-up (n=226), 60-Month Follow-up (n=279), 72-Month Follow-up (n=143), and 84-Month Follow-up (n=75). The 90-Month Follow-up and 96 Month Follow-up interviews are in the process of being programmed in the database and data entry began over the summer of 2006. In terms of environmental contaminants and biological specimen analysis, the following information is available in the central database: Dust/Allergen data from Monitoring (n=722), Biomarker-Mother (n=497), Biomarker-Infant (n=571), PAH & Pesticide Air Monitoring (n=654), Allergy Skin Testing (n=66), Medical Record Abstraction-Mother (n=625), Medical Record Abstraction-Infant (n=582), Plasma Pesticide Levels-Mother (n=446), Plasma Pesticide Levels-Infant (n=431), Plasma Pesticide Levels-36 Months (n=32), Plasma Pesticide Levels-60 Months (n=31). Over the past year, Ms. Hoepner has worked on developing methods to expedite uploading of the lab data that arrives in various formats files from multiple sources. A more concise reporting mechanism is also now in place to alert lab personnel of any issues that arise during the upload process, such as value range changes or missing data. Acquisition of developmental data has expanded to include assessments of the subjects as they approach school age and are available for: Denver at 6 months and Bayley at 12, 24 & 36 months (n=587), TONI-3 (n=485), HOME (n=380), CBCL 3-5 years (n=377), CBCL 6-18 years (n=65). The above numbers are underestimates--they reflect records actually computerized as of 6/7/2006 and do not include questionnaire and laboratory data that have been collected and are being edited in preparation for addition to the CEH database. Please see Appendix A for a table providing a comprehensive accounting of all data types through the 96-Month Follow-up which are available in the central database. In collaboration with Mark Becker, Director of the Geospatial Technology Section of the Center for International Earth Science Information Network (CIESIN) and Carlos Jusino of WE ACT, DSM Core members have developed novel approaches to mapping contaminants and outcomes using the geographic information system (GIS) software ESRI ArcGIS 8. Dr. Sarah McLafferty, a nationally recognized GIS expert with extensive experience in community level analysis in New York City, has also joined the DSM Core in a consulting capacity; several geographically-based analyses are being carried out under Dr. McLafferty’s supervision.
Statistics
During the previous year, the DSM Core statistician, Dr. Robin Garfinkel and Core head Dr. Howard Andrews, have provided input with respect to statistical design and have conducted data analyses for a number of initiatives that have resulted in presentations at scientific conferences and in papers published in peer-reviewed journals. Drs. Garfinkel and Andrews work closely with the DSM Core data manager, Lori Hoepner, in carrying out this work. Statistical system (SPSS or SAS) containing variables required for each initiative are generated from the database by Ms. Hoepner. Each step of the data analytic process, including extraction of data from the database, creation of derived variables, case selection, imputation and intermediate and final analyses, is carefully documented, and the syntax (programming) for each step is stored for future use. This ensures that any analysis conducted in the past can easily be replicated using current data, and that algorithms for derived variables used in a previous analysis are preserved for future use, ensuring continuity of methodology.
Meetings and Consultation
The DSM Core continues to hold a meeting every Monday at 10am. The meeting is open to any project investigator and provides consultation with respect to the development of data requests, statistical analysis, interpretation of findings, and manuscript review. The DSM Core works closely with the Principal Investigator and Executive Committee to uphold the Center’s publication policy, which includes guidelines for requesting data and formal review of methodological aspects of submitted papers.
Community Modeling
As stated earlier, during Year 4 the DSM Core undertook a joint initiative with the Center for International Earth Sciences Information Network (CIESIN) and West Harlem Environmental Action (WE ACT) to develop a comprehensive Geographic Information System for New York City, including the Columbia Center for Children’s Environmental Health’s (CCCEH) study area in Northern Manhattan and the South Bronx. The key collaborating personnel in this initiative are Dr. Mark Becker, and Carlos Jusino at WE ACT. In the current funding period, Dr. Sara McLafferty, an expert in the analysis of spatial data, has joined the GIS team. The scope of this collaboration includes the World Trade Center Pregnancy Study (funded by The New York Community Trust and the United Way of New York City; The New York Times 9/11 Neediest Fund; NIEHS grant #’s ES09089, 5P01 ES09600, 5RO1 ES08977; and EPA #R827027), and also involves the development of GIS systems for parallel studies that the Center is conducting in Poland and China. The GIS system integrates five areas so that spatial relationships between any of the domains can be juxtaposed visually and subjected to multi-level and raster density statistical analysis. The five domains are: 1) home and work addresses of study subjects (there may be multiple addresses for each subject); 2) plumes of selected toxins based on the integration of EPA and NYC air monitoring sites in NYC, and those established at the WTC in the wake of 9/11; 3) community-level birth outcome and child health data derived from the New York City Department of Health, which will include low birth weight, infant mortality and asthma hospitalization rates; 4) community-level social and demographic risk factors, including census-based indices of poverty and deteriorated housing, and crime-related indices derived from data sets available in the INFOSHARE community data system to which the DSM Core subscribes; and 5) point-sources of pollution, including bus depots and dry cleaners. The GIS team of the DSM Core played a major role in the development of the first publication to emerge from the World Trade Center Pregnancy study.
Supplemental Keywords:
RFA, Scientific Discipline, Health, INTERNATIONAL COOPERATION, ENVIRONMENTAL MANAGEMENT, HUMAN HEALTH, Health Risk Assessment, Genetics, Epidemiology, Biochemistry, Health Effects, Children's Health, Environmental Policy, Risk Assessment, asthma, prenatal exposure, environmental risks, genetic mechanisms, Human Health Risk Assessment, diesel exhaust, assessment of exposure, genetic risk factors, children's environmental health, exposure assessmentRelevant Websites:
http://www.mailman.hs.columbia.edu/ccceh/ Exit
Progress and Final Reports:
Original AbstractMain Center Abstract and Reports:
R832141 Water Environment and Reuse Foundation's National Center for Resource Recovery and Nutrient Management Subprojects under this Center: (EPA does not fund or establish subprojects; EPA awards and manages the overall grant for this center).
R832141C001 Growth and Development Research Project: Prenatal and Postnatal Urban Pollutants and Neurobehavioral Developmental Outcomes
R832141C002 Research Project on Asthma: Prenatal and Postnatal Urban Pollutants and Childhood Asthma
R832141C003 Mechanistic Research Project
R832141C004 Community-Based Intervention Project: Reduction of Exposure and Risk from Pesticides and Allergens
R832141C005 Community Translation and Application Core (COTAC)
R832141C006 Exposure Assessment Facility Core
R832141C007 Data Management, Statistics and Community Impact Modeling Core
R832141C008 Administrative Core
The perspectives, information and conclusions conveyed in research project abstracts, progress reports, final reports, journal abstracts and journal publications convey the viewpoints of the principal investigator and may not represent the views and policies of ORD and EPA. Conclusions drawn by the principal investigators have not been reviewed by the Agency.