Science Inventory

Informatics-Based Approaches for Managing and Curating Exposure Data

Citation:

Isaacs, K. AND K. Dionisio. Informatics-Based Approaches for Managing and Curating Exposure Data. CompTox Communities of Practice Webinar, Research Triangle Park, North Carolina, September 24, 2020. https://doi.org/10.23645/epacomptox.13058489

Impact/Purpose:

This slide deck is an introduction to a demo of ORD's Factotum/ChemExpoDB system for the CompTox Communities of Practice Webinar in September 2020

Description:

Exposure data, including chemical use and consumer product information, are required to inform chemical prioritization workflows and other assessments. As such the demand for transparent, curated, and high-quality exposure data is increasing. Under EPA’s ExpoCast project, new informatics-based methods and tools are being developed to facilitate collection, curation, and management of exposure-relevant information. We present here the ChemExpoDB/Factotum suite. ChemExpoDB is an integrated family of exposure databases linking data across multiple exposure domains. A web-based software application (called Factotum) has been developed to facilitate manual and automated management and annotation of data included in ChemExpoDB. Currently, ChemExpoDB includes over 500,000 primary source documents linked to >3.9 million chemical records, each containing product composition (consumer, industrial, and occupational products), functional use, or general chemical use information. Reported chemical identifiers were curated to unique chemical structures (DTXSIDs) using automated and manual techniques. The Factotum application allows for tracking of data extraction processes, documentation of randomized QA checks, and tracking of the original chemical records mapped to over 27,000 DTXSIDs. Machine learning-based natural language classifiers are being used to assign documents associated with consumer products to standard categories for linking with exposure models. In addition, elastic search algorithms have been implemented for rapidly identifying documents relevant to evaluation of individual chemicals. The ChemExpoDB/Factotum framework is being expanded to incorporate additional exposure data domains, including multimedia monitoring measurements and other exposure factor data (e.g., consumer product use patterns). Further, much of the data is now available through the EPA’s CompTox Chemicals Dashboard (https://comptox.epa.gov/dashboard). These tools can increase the volume, scope, and quality of chemical information available for use in Agency decision-making.

Record Details:

Record Type:DOCUMENT( PRESENTATION/ SLIDE)
Product Published Date:09/24/2020
Record Last Revised:10/06/2020
OMB Category:Other
Record ID: 349826