You are here:
Implementation of a Flexible Tool for Automated Literature-Mining and Knowledgebase Development (DevToxMine)
Citation:
KNUDSEN, T. B. AND A. V. SINGH. Implementation of a Flexible Tool for Automated Literature-Mining and Knowledgebase Development (DevToxMine). Presented at 2009 Annual Teratology Society Meeting, Rio Grande, PUERTO RICO, June 27 - July 01, 2009.
Impact/Purpose:
This flexible text-mining tool (DevToxMine™), combined with ontology for embryogenesis, is being used to build a knowledgebase for EPA’s Virtual Embryo project.
Description:
Deriving novel relationships from the scientific literature is an important adjunct to datamining activities for complex datasets in genomics and high-throughput screening activities. Automated text-mining algorithms can be used to extract relevant content from the literature and build a thesaurus to convert word relations into concepts. Conceptmining has become an essential knowledge discovery tool to address causal links,
associations, relationships, and patterns among vast collections of nformation. EPA’s ToxRefDB database has been built from source data derived from 30-years worth of in vivo animal toxicity studies, mostly rat and rabbit studies. This database includes 751 prenatal developmental toxicity studies on 387 chemicals. For example, large-scale profiling of environmental chemicals for developmental effects with ToxRefDB revealed a species dimorphism of renal-ureteric defects expressed in the rat over rabbit, and a strong correlation between fetal weight reduction and defects of the axial skeleton. In this study, we applied custom text-mining tools to extract the underlying concepts from PubMed. Automated queries were built as
URLs/Downloads:
Implementation of a Flexible Tool for Automated Literature-Mining and Knowledgebase Development (DevToxMine) (PDF, NA pp, 8 KB, about PDF)Implementation of a Flexible Tool for Automated Literature-Mining and Knowledgebase Development (DevToxMine) (poster) (PDF, NA pp, 147 KB, about PDF)