Science Inventory

Integration of Markush Structures into EPA’s DSSTox Database to Represent and Enumerate UVCB Substances

Citation:

Grulke, Chris, A. Williams, AND A. Richard. Integration of Markush Structures into EPA’s DSSTox Database to Represent and Enumerate UVCB Substances. Presented at American Chemical Society Spring meeting, New Orleans, LA, March 18 - 22, 2018. https://doi.org/10.23645/epacomptox.6986933

Impact/Purpose:

This abstract for a presentation at the 2018 ACS Spring meeting stresses the importance of dealing with UVCB chemicals in the structural domain using Markush representations in order to accurately gather data linked to associated structures/substances.

Description:

Many chemicals of interest in the domain of environmental chemistry fall in the class of substances referred to as Unknown, Variable composition, Complex reaction product or Biological origin (UVCB) substances. These chemicals generally cannot be represented with a single structure and, therefore, pose a significant challenge when trying to accurately characterize and share research results via chemical databases that rely primarily on distinct chemical structures to index and merge substance information. For this reason, it is not uncommon for public chemical databases to include questionably mapped structures associated with names or registry numbers that indicate a UVCB. To provide a more accurate documentation of the substance and clarify these data linkages, EPA’s DSSTox project has started employing Markush representations to represent UVCBs. Whereas the use of a Markush representation is more accurate than a single “representative” structure, it yields a new set of problems, including difficulty with determining uniqueness, storage format limitations, and inconsistencies in different software’s handling of the representations. It has, however, enabled the automated linking of a UVCB with the associated, well-defined chemical components which can be auto-enumerated within the software package employed. It also provides an interpretable depiction of the substance, and clarifies the linkage of research results to the appropriate chemistry. Additionally, it greatly increases the efficiency of collecting information associated with all components of a UVCB from within EPA databases, providing the ability to quickly identify and gather the compendium of data associated with either the UVCB or any of its structural components. This abstract does not necessarily represent U.S. EPA policy.

Record Details:

Record Type:DOCUMENT( PRESENTATION/ SLIDE)
Product Published Date:03/22/2018
Record Last Revised:08/23/2018
OMB Category:Other
Record ID: 342012