Science Inventory

Finding small molecules in big data

Citation:

Schymanski, E. AND A. Williams. Finding small molecules in big data. Presented at Analytica Conference, Munich, N/A, GERMANY, April 10 - 12, 2018.

Impact/Purpose:

Presentation to Analytica Conference April 2018

Description:

Metabolomics and exposomics are amongst the youngest and most dynamic of the omics disciplines. While the molecules involved are smaller than proteomics and the other, larger “omics”, the challenges are in many ways greater. Elements are less constrained, there are no given “puzzle pieces” and there is a resulting explosion in terms of potential chemical space. It is impossible to even enumerate all chemically possible small molecules. The challenges and complexity of identifying small molecules even using the most advanced analytical technologies available today is immense. Current “big data” methods for small molecules rely heavily on chemical databases, the largest of which presently available contain ~100 million chemicals. Despite this large number, high resolution mass spectrometry (HR-MS) measurements contain tens of thousands of features, of which only a few percent can be annotated as “known” and confirmed as metabolites or chemicals of interest using these chemical databases. How can we find relevant small molecules in the ever increasing data loads? How can we annotate more of the unknown features in HR-MS experiments? This talk will present European, US and worldwide initiatives to help find small molecules in big data in smarter ways - from chemical databases to spectral libraries, real-time monitoring to retrospective screening. It will touch on the challenges of standardized structure representations, data curation and deposition. Finally, it will show how interdisciplinary communication, data sharing and pushing the boundaries of current capabilities can facilitate research efforts in metabolomics, exposomics and beyond. This abstract does not necessarily represent U.S. EPA policy.

URLs/Downloads:

FINDING SMALL MOLECULES IN BIG DATA_ABSTRACT_V3.PDF   (PDF,NA pp, 44.756 KB,  about PDF)

SMALLMOLBIGDATA_ANALYTICAMUNICH_V1.PDF   (PDF,NA pp, 6599.052 KB,  about PDF)

Record Details:

Record Type: DOCUMENT (PRESENTATION/SLIDE)
Product Published Date: 04/12/2018
Record Last Revised: 07/09/2018
OMB Category: Other
Record ID: 341010