Abstract
The progress of high-throughput screening (HTS) techniques is changing the chemical data landscape by producing massive biological data from tested compounds. Public data repositories (e.g., PubChem) receive HTS data provided by various institutes and this data pool is being updated on a daily basis. The goal of these data sharing efforts is to let users quickly obtain the biological data of target compounds. Without a universal chemical identifier, the repositories (e.g., PubChem) provide users various methods to query and retrieve chemical properties and biological data by several different chemical identifiers (e.g., SMILES, InChIKey, and IUPAC name). The major challenge for most users, especially computational modelers, is obtaining the biological data for a large dataset of compounds (e.g., thousands of drug molecules) instead of a single compound. This chapter aims to introduce the steps to access the public data repositories for target compounds with specific emphasis on the automatic data downloading for large datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kim S, Thiessen PA, Bolton EE, Chen J, Fu G, Gindulyte A, Han L, He J, He S, Shoemaker BA, et al. (2015) PubChem Substance and Compound databases. Nucleic Acids Res 44:D1202–D1213.
Weininger D (1988) SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J Chem Inf Comput Sci 28:31–36
Weininger D, Weininger A, Weininger JL (1989) SMILES. 2. Algorithm for generation of unique SMILES notation. J Chem Inf Comput Sci 29:97–101
Weininger D (1990) SMILES. 3. DEPICT. Graphical depiction of chemical structures. J Chem Inf Comput Sci 30:237–243
Heller S, McNaught A, Stein S, Tchekhovskoi D, Pletnev I (2013) InChI - the worldwide chemical structure identifier standard. J Cheminformatics 5:7
Kim S, Thiessen PA, Bolton EE, Bryant SH (2015) PUG-SOAP and PUG-REST: web services for programmatic access to chemical information in PubChem. Nucleic Acids Res 43:W605–W611
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media New York
About this protocol
Cite this protocol
Russo, D.P., Zhu, H. (2016). Accessing the High-Throughput Screening Data Landscape. In: Zhu, H., Xia, M. (eds) High-Throughput Screening Assays in Toxicology. Methods in Molecular Biology, vol 1473. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-6346-1_16
Download citation
DOI: https://doi.org/10.1007/978-1-4939-6346-1_16
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-6344-7
Online ISBN: 978-1-4939-6346-1
eBook Packages: Springer Protocols