Does drug-target have a likeness?

H. Xu; Y. Fang; L. Yao; Y. Chen; Xin Chen

doi:10.1160/ME0425

Does drug-target have a likeness?

H. Xu, Y. Fang, L. Yao, Y. Chen, Xin Chen

Quantitative Health Sciences

Research output: Contribution to journal › Article › peer-review

16 Scopus citations

Abstract

Objective: The discovery of new targets that are sufficiently robust to yield marketable therapeutics is an enormous challenge. Conventional target identification approaches are disease-dependent, which require heavy experimental workload and comprehensive domain knowledge. In this work, we propose that a disease-independent property of proteins, "drug-target likeness", can be explored to facilitate the genomic scale target screening in the post-genomic age. Methods: A Support Vector Machine (SVM) classifier was trained to recognize target and non-target protein sequences compiled from the Therapeutic Target Database, DrugBank, and PFam. Protein sequences are encoded by their composition, transition and distribution features of residues and Gaussian kernel function was used in SVM classification. Results: SM with a fine-tuned kernel width records 66.4 ± 5.1% of sensitivity and 97.2 ± 0.6% of specificity, corresponding to an overall target prediction accuracyof 94.4 ± 0.8%. Conclusions: Though primitive, these results suggest that, similar to the "drug likeness" for small chemicals, their binding partners, drug targets, also display shared features which are reflected in their sequences and can be captured by statistical learning approaches. Further research on how to accurately and interpretably measure the likeness of protein being a drug target is promising. Inspired by the progress of "drug likeness" studies, advances in protein descriptors, statistical learning algorithms and more comprehensive and accurate gold-standard data set from disease biology research may help to further define the "drug-target likeness" property of proteins.

Original language	English (US)
Pages (from-to)	360-366
Number of pages	7
Journal	Methods of Information in Medicine
Volume	46
Issue number	3
DOIs	https://doi.org/10.1160/ME0425
State	Published - 2007

Keywords

Drug target
Human genome
Statistical learning
Support vector machine

ASJC Scopus subject areas

Health Informatics
Advanced and Specialized Nursing
Health Information Management

Access to Document

10.1160/ME0425

Cite this

@article{d50047b4e9d84735a73d6d12d2b60ba1,

title = "Does drug-target have a likeness?",

abstract = "Objective: The discovery of new targets that are sufficiently robust to yield marketable therapeutics is an enormous challenge. Conventional target identification approaches are disease-dependent, which require heavy experimental workload and comprehensive domain knowledge. In this work, we propose that a disease-independent property of proteins, {"}drug-target likeness{"}, can be explored to facilitate the genomic scale target screening in the post-genomic age. Methods: A Support Vector Machine (SVM) classifier was trained to recognize target and non-target protein sequences compiled from the Therapeutic Target Database, DrugBank, and PFam. Protein sequences are encoded by their composition, transition and distribution features of residues and Gaussian kernel function was used in SVM classification. Results: SM with a fine-tuned kernel width records 66.4 ± 5.1% of sensitivity and 97.2 ± 0.6% of specificity, corresponding to an overall target prediction accuracyof 94.4 ± 0.8%. Conclusions: Though primitive, these results suggest that, similar to the {"}drug likeness{"} for small chemicals, their binding partners, drug targets, also display shared features which are reflected in their sequences and can be captured by statistical learning approaches. Further research on how to accurately and interpretably measure the likeness of protein being a drug target is promising. Inspired by the progress of {"}drug likeness{"} studies, advances in protein descriptors, statistical learning algorithms and more comprehensive and accurate gold-standard data set from disease biology research may help to further define the {"}drug-target likeness{"} property of proteins.",

keywords = "Drug target, Human genome, Statistical learning, Support vector machine",

author = "H. Xu and Y. Fang and L. Yao and Y. Chen and Xin Chen",

year = "2007",

doi = "10.1160/ME0425",

language = "English (US)",

volume = "46",

pages = "360--366",

journal = "Methods of Information in Medicine",

issn = "0026-1270",