Science Inventory

A Systematic Evaluation of Analogs and Automated Read-across Prediction of Estrogenicity (QSAR2016)

Citation:

Pradeep, P., K. Mansouri, G. Patlewicz, AND R. Judson. A Systematic Evaluation of Analogs and Automated Read-across Prediction of Estrogenicity (QSAR2016). Presented at QSAR 2016, Miami, FL, June 13 - 17, 2016. https://doi.org/10.23645/epacomptox.5189395

Impact/Purpose:

slide presentation at QSAR2016 meeting on Systematic Evaluation of Analogs and Automated Read-across Prediction of Estrogenicity

Description:

Read-across is a data gap filling technique widely used within category and analog approaches to predict a biological property for a data-poor (target) chemical using known information from similar (source analog) chemical(s). Potential source analogs are typically identified based on structural similarity. Although much guidance has been published for read-across, practical principles for the identification and evaluation of the scientific validity of source analogs remains lacking. This case study explores how well 3 structure descriptor sets (Pubchem, Chemotyper and MoSS) are able to identify analogs for read-across and predict Estrogen Receptor (ER) binding activity for a specific class of chemicals: hindered phenols. For each target chemical, analogs were selected using each descriptor set with two cut-offs: (1) Minimum Tanimoto similarity (range 0.1 - 0.9), and (2) Closest N analogs (range 1 - 10). Each target-analog pair was then evaluated for its agreement with measured ER binding and agonism. The analogs were subsequently filtered using: (1) physchem properties (LogKow & Molecular Volume), and (2) number of literature sources as a marker for the quality of the experimental data. A majority vote prediction was made for each target phenol by reading-across from the closest N analogs. The data set comprised 462 hindered phenols and 257 non-hindered phenols. The results demonstrate that: (1) The concordance in ER activity rises with increasing similarity, (2) none of the 3 descriptor sets are clearly superior to the others for analog identification for ER read-across, (3) selecting hindered versus non-hindered phenols as analogs does not significantly improve concordance in ER activity, and (4) filtering analogs using physchem properties improves overall concordance. The read-across predictions reveal that: (1) PubChem and Chemotyper descriptors are superior to MoSS for ER activity, and (2) filtering of analogs significantly increases the prediction accuracy. As an example, the prediction accuracy using 3 closest analogs from PubChem is 70%. This increases to 74% when filtering by physchem properties, and 89% when data quality is accounted for. This case study demonstrates how biologically-relevant chemical descriptors can be used to identify valid analogs for read-across. This abstract does not necessarily reflect U.S. EPA policy.

Record Details:

Record Type:DOCUMENT( PRESENTATION/ POSTER)
Product Published Date:06/17/2016
Record Last Revised:07/19/2017
OMB Category:Other
Record ID: 336976