Science Inventory

Enhancing the utility of the ECOTOX Knowledgebase via ontology-based annotations.

Citation:

Fay, K., C. Elonen, D. Hoff, J. Olker, M. Skopinski, A. Pilli, AND C. LaLone. Enhancing the utility of the ECOTOX Knowledgebase via ontology-based annotations. OpenTox USA 2018, Raleigh, NC, July 11 - 12, 2018.

Impact/Purpose:

The EPA’s ECOTOXicology knowledgebase (ECOTOX) is a database which contains single chemical toxicity effects in more than 12,000 ecological species. The objective of this work is to enhance the interoperability of ECOTOX with several other EPA tools, including: the Computational Toxicology (CompTox) dashboard, Adverse Outcome Pathway Wiki (AOP wiki), and Sequence Alignment to Predict Across Species Susceptibility (SeqAPASS). The AOP discovery and development task on taxonomic applicability recognizes the need to leverage existing data resources, including the in vivo effects data contained in ECOTOX, and the potential advantages of computational approaches to predict chemical hazard and species susceptibility. In addition to mapping species and chemical identifiers contained in the ECOTOX database to validated, unique keys used in other data systems, this work aims to lay the groundwork for computational modeling of phenotype effects, species sensitivity and putative adverse outcome pathway by mapping ECOTOX codes to ontology classes.

Description:

The US Environmental Protection Agency’s Ecotoxicology (ECOTOX) Knowledgebase contains more than 30 years of reported single chemical toxicity effects data on aquatic and terrestrial organisms. Approximately 900,000 test results covering more than 11,000 chemicals and 12,000 species are available in ECOTOX. While the database is currently used by many sectors for a variety of purposes, a future goal is to allow for computational modeling of the data to identify novel adverse outcome pathways and networks, and assist in predicting chemical hazard and species sensitivity. One obstacle is that ECOTOX captures study information using author-reported descriptions, resulting in more than 4000 codes. Relationships among these codes are often not apparent in the current design (e.g., unique codes exist for both aryl hydrocarbon hydrolase and cytochrome P450 1A), and some codes are uniquely specific to the study of its derivation (e.g., 3rd generation male). To enhance the query capability of the data within and external to the ECOTOX knowledgebase, and to prepare for future computational functionality, the ECOTOX codes were mapped to existing biological ontology classes. To facilitate this mapping, a Java-based Lookup tool was developed using the ontology browser BioPortal (https://bioportal.bioontology.org/) REST API. This tool was designed to allow for batch processing and to make use of BioPortal’s Annotator and Recommender features. A training set composed of every code in the knowledgebase applicable to > 0.5% (2,368) of ECOTOX references (as of March, 2018) was used initially with the BioPortal Lookup tool. The majority (58%) of these codes were successfully mapped, with a higher rate of success for the effect measurement codes. Manual review of the mappings indicated that a proportion of the unmapped codes could be described using multiple ontology identifiers in combination, while some codes mapped using the BioPortal Lookup tool resulted in ontology classes with improper context.

Record Details:

Record Type:DOCUMENT( PRESENTATION/ POSTER)
Product Published Date:07/12/2018
Record Last Revised:07/11/2018
OMB Category:Other
Record ID: 341597