Biological representation of chemicals using latent target interaction profile

Abstract Background Computational prediction of a phenotypic response upon the chemical perturbation on a biological system plays an important role in drug discovery, and many other applications. Chemical fingerprints are a widely used feature to build machine learning models. However, the fingerprints that are derived from chemical structures ignore the biological context, thus, they suffer from several problems such as the activity cliff and curse of dimensionality. Fundamentally, the chemical modulation of biological activities is a multi-scale process. It is the genome-wide chemical-target interactions that modulate chemical phenotypic responses. Thus, the genome-scale chemical-target interaction profile will more directly correlate with in vitro and in vivo activities than the chemical structure. Nevertheless, the scope of direct application of the chemical-target interaction profile is limited due to the severe incompleteness, biasness, and noisiness of bioassay data. Results To address the aforementioned problems, we developed a novel chemical representation method: Latent Target Interaction Profile (LTIP). LTIP embeds chemicals into a low dimensional continuous latent space that represents genome-scale chemical-target interactions. Subsequently LTIP can be used as a feature to build machine learning models. Using the drug sensitivity of cancer cell lines as a benchmark, we have shown that the LTIP robustly outperforms chemical fingerprints regardless of machine learning algorithms. Moreover, the LTIP is complementary with the chemical fingerprints. It is possible for us to combine LTIP with other fingerprints to further improve the performance of bioactivity prediction. Conclusions Our results demonstrate the potential of LTIP in particular and multi-scale modeling in general in predictive modeling of chemical modulation of biological activities.

Tags
Data and Resources
To access the resources you must log in

This item has no data

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field Value
PID https://www.doi.org/10.6084/m9.figshare.c.4793535.v1
PID https://www.doi.org/10.6084/m9.figshare.c.4793535
URL http://dx.doi.org/10.6084/m9.figshare.c.4793535.v1
URL http://dx.doi.org/10.6084/m9.figshare.c.4793535
Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field Value
Access Right not available
Attribution

Description: Authorships and contributors

Field Value
Author Ayed, Mohamed
Author Hansaim Lim
Author Xie, Lei
Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field Value
Collected From Datacite
Hosted By figshare
Publication Date 2019-01-01
Publisher figshare
Additional Info
Field Value
Language UNKNOWN
Resource Type Collection
keyword FOS: Chemical sciences
keyword FOS: Biological sciences
keyword FOS: Computer and information sciences
system:type other
Management Info
Field Value
Source https://science-innovation-policy.openaire.eu/search/other?orpId=dedup_wf_001::bbd884b72e9b47fbd3bb8a9b189c2af1
Author jsonws_user
Last Updated 19 December 2020, 23:14 (CET)
Created 19 December 2020, 23:14 (CET)