Science Inventory

Prediction of Skin Sensitization Potency Using Machine Learning Approaches

Citation:

Strickland, J., Q. Zang, M. Paris, D. Lehmann, S. Bell, D. Allen, A. Jacobs, W. Casey, AND N. Kleinstreuer. Prediction of Skin Sensitization Potency Using Machine Learning Approaches. Society of Toxicology, RTP, NC, March 13 - 17, 2017.

Impact/Purpose:

Replacing animal tests currently used for regulatory hazard classification of skin sensitizers is one of ICCVAM’s top priorities. Accordingly, U.S. federal agency scientists are developing and evaluating computational approaches to classify substances as sensitizers or nonsensitizers. Some regulatory agencies require that sensitizers be further classified into potency categories. We built machine learning models to classify substances using GHS categories as strong (1A) sensitizers, weak (1B) sensitizers, or nonsensitizers for both local lymph node assay (LLNA) and human outcomes.

Description:

Replacing animal tests currently used for regulatory hazard classification of skin sensitizers is one of ICCVAM’s top priorities. Accordingly, U.S. federal agency scientists are developing and evaluating computational approaches to classify substances as sensitizers or nonsensitizers. Some regulatory agencies require that sensitizers be further classified into potency categories. We built machine learning models to classify substances using GHS categories as strong (1A) sensitizers, weak (1B) sensitizers, or nonsensitizers for both local lymph node assay (LLNA) and human outcomes. The models used data from three in chemico or in vitro assays (direct peptide reactivity assay, KeratinoSens™, and human cell line activation test) and six physicochemical properties. To determine the optimal model, four machine learning approaches were used: classification and regression tree, linear discriminant analysis, logistic regression, and support vector machine (SVM). Two different strategies were used for modeling: a one-tiered strategy modeled all three potency categories in a single step, while a two-tiered strategy first delineated sensitizer vs. nonsensitizer responses and then strong and weak sensitizers. Models were developed on a training set (94 substances for LLNA; 63 for human) and evaluated using an external test set (26 substances for LLNA; 24 for human). Leave-one-out cross validation (LOOCV) was used to evaluate the models for overfitting. The two-tiered models performed better than the one-tiered models, and SVM outperformed the other machine learning approaches. The SVM models using all input variables performed better than those using subsets of the input variables. The two-tiered model using SVM and all input variables provided the best performance: LOOCV accuracy was 88% for LLNA (120 substances) and 81% for human (87 substances) outcomes. This compares to an accuracy of 69% for LLNA prediction of human potency categories (87 substances). These results suggest that computational approaches may be useful for assessing skin sensitization potency. This project was funded in whole or in part with Federal funds from the NIEHS, NIH under Contract No. HHSN273201500010C. This abstract does not represent U.S. EPA policy.

Record Details:

Record Type:DOCUMENT( PRESENTATION/ POSTER)
Product Published Date:03/14/2017
Record Last Revised:10/17/2017
OMB Category:Other
Record ID: 337901