Computational Resource for Drug Discovery

Computational Resources for Drug Discovery (CRDD) is an important module of the in silico module of Open Source for Drug Discovery (OSDD).^[1] The CRDD web portal provides computer resources related to drug discovery on a single platform. It caters to researchers researching computer-aided drug design, provides computational resources, hosts a discussion forum, and has resources related to drug discovery, predicting inhibitors, and predicting the ADME-Tox properties of molecules. One of the major objectives of CRDD is to promote open source software in the field of cheminformatics and pharmacoinformatics.

Features[edit]

Under CRDD, numerous resources related to computer-aided drug design have been collected and compiled. These resources are organized and presented on CRDD so users may locate resources from a single source.

Target identification provides resources important for searching drug targets with information on genome annotation, proteome annotation, potential targets, and protein structure.
Virtual screening compiles resources important for virtual screening as QSAR techniques, docking QSAR, cheminformatics, and siRNA/miRNA.
Drug design provides resources important for designing drug inhibitors/molecules, such as lead optimization, pharmacoinformatic, ADMET, and clinical informatics.

Community contribution[edit]

CRDD developed a platform where the community may contribute to the process of drug discovery.

DrugPedia is a wiki created for collecting and compiling information related to computer-aided drug design. It is developed under the umbrella of the OSDD project and covers a wide range of subjects around drugs like bioinformatics, cheminformatics, clinical informatics etc.
Indipedia: A wiki for collecting and compiling drug information related to India. It is intended is to provide comprehensive information about India created for Indians by Indians. It is developed under the umbrella of the OSDD project.
The CRDD Forum was launched to discuss the challenges in developing computational resources for drug discovery.

Indigenous development: software and web services[edit]

Beside collecting and compiling resources, CRDD members develop new software and web services. All services developed are free for academic use. The following are a few major tools developed at CRDD.^{[citation needed]}

Development of databases[edit]

HMRBase: A manually curated database of hormones and their receptors. It is a compilation of sequence data after extensive manual literature search and from publicly available databases. HMRBase can be searched on the basis of a variety of data types. Owing to the high impact of endocrine research in the biomedical sciences, HMRBase could become a leading data portal for researchers.^{[citation needed]} The salient features of HMRBase are hormone-receptor pair-related information, mapping of peptide stretches on the protein sequences of hormones and receptors, Pfam domain annotations, categorical browsing options, and online data submission.^[2] This database is integrated with DrugPedia so the public can contribute.
BIAdb: A database for Benzylisoquinoline Alkaloids. The Benzylisoquinoline Alkaloid Database serves to gather information related to the BIA's. Many BIA's show therapeutic properties and can be considered as potent drug candidates. This database will also serve researchers working in the field of synthetic biology, as developing medicinally important alkaloids using synthetic process is one of the important challenges. This database is also integrated with DrugPedia so the public can contribute.^[3]
Antigen DB: This database contain more than 500 antigens collected from literature and other immunological resources. These antigens come from 44 pathogenic species. In Antigen DB, a database entry contains information regarding the sequence, structure, origin, etc. of an antigen with additional information such as B and T-cell epitopes, MHC binding, function, gene-expression and post translational modifications, when available. AntigenDB also provides links to major internal and external databases.^[4]
PolysacDB: A databse dedicated to provide comprehensive information about antigenic polysaccharides of microbial origin (bacterial and fungal), antibodies against them, proposed epitopes, structural detail, proposed functions, assay system, cross-reactivity related information and more. It is a manually curated database where most of data has been collected from PubMed and PubMed Central literature databases.^[5]
TumorHoPe: TumorHoPe is a manually curated comprehensive database of experimentally characterized tumor homing peptides. These peptides recognize tumor tissues and tumor associated micro environments, including tumor metastasis.^[6]
ccPDB: A database designed to service researchers working in the field of function or structure annotation of proteins. This database of datasets is based on Protein Data Bank (PDB).^[7]
OSDDchem: This chemical database is an open repository of information on synthesized, semi-synthesized, natural, and virtually designed molecules from the OSDD community.^[8]
CancerDR: A database of 148 anticancer drugs and their effectiveness against around 1000 cancer cell lines. CancerDR maintains comprehensive information about these drugs, their target gene/protein, and cell lines.^[9]

Software developed[edit]

MycoTB: An extended flexible system concept for building standalone Windows software. The software allows users to build their own flexible systems on their personal computers to manage and annotate whole proteomes of MycoTB.^[10]

Resources created[edit]

CRAG: Computational resources for assembling genomes (CRAG) was created to assist users in assembling of genomes from short read sequencing (SRS). CRAG pursues the following major objectives:
- Collection and compilation of computation resources
- Brief description of genome assemblers
- Maintaining SRS and related data
- Service to community to assemble their genomes
CRIP: Computational resources for predicting protein–macromolecular interactions (CRIP) was developed to provide resources related to interaction. This site maintains a large number of resources on the interaction of proteins that includes protein–protein, protein–DNA, protein–ligand, protein–RNA.
BioTherapy: Bioinformatics for Therapeutic Peptides and Proteins (BioTherapi) was developed for researchers working in the field of protein/peptide therapeutics. The platform was created to provide a single platform for this area of research. This site includes relevant information about the use of peptides/proteins in drugs and synthesis of new peptides. It also covers problems in their formulation, synthesis and delivery processes.
HIVbio: HIV Bioinformatics (HIVbio) site contains various types of information on Human Immunodeficiency Virus (HIV) life cycle and Infection.
GDPbio: GDPbio (Genome based prediction of Diseases and Personal medicines using Bioinformatics) is a project focused on providing various resources related to genome analysis, particularly for the prediction of disease susceptibility of individuals and personalized medicine development, with the aim of public health improvement.
AminoFAST: Functional Annotation Tools for Amino Acids (AminoFAST) is a server designed to serve the bioinformatics community. Its aim is to develop as many tools as possible to understand the function of amino acids in proteins based on protein structure in PDB. The broad knowledge of protein function would help in the identification of novel drug targets.

Web services for cheminformatics[edit]

CRDD developed an open source platform which allows users to predict inhibitors against novel M. Tuberculosis drug targets and other important properties of drug molecules like ADMET. Following are list of few servers.

MetaPred: A webserver for the prediction of cytochrome P450 isoforms responsible for metabolizing a drug molecule. The MetaPred server predicts metabolizing CYP isoforms of a drug molecule/substrate based on SVM models developed using CDK descriptors.^[jargon] This server is intended to help researchers working in the field of drug discovery. The effort also demonstrates that it is possible to develop free web servers in the field of cheminformatics. This may encourage other researchers to develop web servers for public use, leading to decreased cost of discovering new drug molecules.^[11]
ToxiPred: A server for prediction of aqueous toxicity of small chemical molecules in T. pyriformis.
KetoDrug: A user friendly web server for binding affinity prediction of ketoxazole derivatives and small chemical molecules against Fatty Acid Amide Hydrolase (FAAH).
KiDoQ: A web server to serve researchers working in the field of designing inhibitors against dihydrodipicolinate synthase (DHDPS), a potential drug target enzyme of a unique bacterial DAP/Lysine pathway.^[12]
GDoQ: GDoQ (Prediction of GLMU inhibitors using QSAR and AutoDock) is an open source platform for predicting inhibitors against Mycobacterium tuberculosis (M.Tb) drug target N-acetylglucosamine-1-phosphate uridyltransferase (GLMU) protein. This is a potential drug target involved in bacterial cell wall synthesis. This server uses molecular docking and QSAR strategies to predict inhibitory activity value (IC50) of chemical compounds for GLMU protein.^[13]
ROCR: The ROCR is an R package for evaluating and visualizing classifier performance. It is a flexible tool for creating ROC graphs, sensitivity/specificity curves, area under curve and precision/recall curve. The parametrization can be visualized by coloring the curve according to cutoff.
WebCDK: A web interface for the CDK library which is used for predicting descriptors of chemicals.
Pharmacokinetics: This data analysis determines the relationship between the dosing regimen and the body's exposure to the drug as measured by the drug's nonlinear concentration time curve. It includes a function to calculate area under this curve. It also includes functions for half-life estimation for a biexponential model, and a two phase linear regression.

Prediction and analysis of drug targets[edit]

RNApred: Prediction of RNA binding proteins from its amino acid sequence.^[14]
ProPrint: Prediction of interaction between proteins from their amino acid sequence.^[15]
DomPrint: A domain-domain interaction (DDI) prediction server.
MycoPrint: A web interface for exploration of the interactome of Mycobacterium tuberculosis H37Rv (Mtb) predicted by the "Domain Interaction Mapping" (DIM) method.
ATPint: A server for predicting ATP interacting residues in proteins.^[16]
FADpred: Identification of FAD interacting residues in proteins.^[17]
GTPbinder: Prediction of protein GTP interacting residues.^[18]
NADbinder: Prediction of NAD binding residues in proteins.^[19]
PreMier: Software for predicting mannose interacting residues in proteins.^[20]
DMAP: Designing of mutants of antibacterial peptides.
icaars: Prediction and classification of aminoacyl tRNA synthetases using PROSITE domains. ^[21]
CBtope: Prediction of conformational B-cell epitope in a sequence from its amino acid sequence.^[22]
DesiRM: Designing of Complementary and Mismatch siRNAs for silencing a gene.^[23]
GenomeABC: A server for benchmarking of genome assemblers.

References[edit]

^ "Computational Resources for Drug Discovery". Computational Resources for Drug Discovery homepage.
^ Rashid M, Singla D, Sharma A, Kumar M, Raghava GP (July 2009). "Hmrbase: a database of hormones and their receptors". BMC Genomics. 10: 307. doi:10.1186/1471-2164-10-307. PMC 2720991. PMID 19589147.
^ Singla D, Sharma A, Kaur J, Panwar B, Raghava GP (March 2010). "BIAdb: a curated database of benzylisoquinoline alkaloids". BMC Pharmacology. 10: 4. doi:10.1186/1471-2210-10-4. PMC 2844369. PMID 20205728.
^ Ansari HR, Flower DR, Raghava GP (January 2010). "AntigenDB: an immunoinformatics database of pathogen antigens". Nucleic Acids Research. 38 (Database issue): D847–D853. doi:10.1093/nar/gkp830. PMC 2808902. PMID 19820110.
^ Aithal A, Sharma A, Joshi S, Raghava GP, Varshney GC (2012-04-11). Kaufmann GF (ed.). "PolysacDB: a database of microbial polysaccharide antigens and their antibodies". PLOS ONE. 7 (4): e34613. Bibcode:2012PLoSO...734613A. doi:10.1371/journal.pone.0034613. PMC 3324500. PMID 22509333.
^ Kapoor P, Singh H, Gautam A, Chaudhary K, Kumar R, Raghava GP (2012-04-16). Xue B (ed.). "TumorHoPe: a database of tumor homing peptides". PLOS ONE. 7 (4): e35187. Bibcode:2012PLoSO...735187K. doi:10.1371/journal.pone.0035187. PMC 3327652. PMID 22523575.
^ Nucleic Acids Research, 2011
^ "Open Source Drug Discovery". www.osdd.net. Retrieved 2023-09-08.
^ Kumar R, Chaudhary K, Gupta S, Singh H, Kumar S, Gautam A, et al. (2013-03-13). "CancerDR: cancer drug resistance database". Scientific Reports. 3 (1): 1445. Bibcode:2013NatSR...3E1445K. doi:10.1038/srep01445. PMC 3595698. PMID 23486013.
^ Raghava, G.P.S. "MycoTB: A Software for managing Mycobacterium Tuberculosis". crdd.osdd.net. Retrieved 2024-03-04.
^ Mishra NK, Agarwal S, Raghava GP (July 2010). "Prediction of cytochrome P450 isoform responsible for metabolizing a drug molecule". BMC Pharmacology. 10: 8. doi:10.1186/1471-2210-10-8. PMC 2912882. PMID 20637097.
^ Garg A, Tewari R, Raghava GP (March 2010). "KiDoQ: using docking based energy scores to develop ligand based model for predicting antibacterials". BMC Bioinformatics. 11: 125. doi:10.1186/1471-2105-11-125. PMC 2841597. PMID 20222969.
^ Singla D, Anurag M, Dash D, Raghava GP (July 2011). "A web server for predicting inhibitors against bacterial target GlmU protein". BMC Pharmacology. 11: 5. doi:10.1186/1471-2210-11-5. PMC 3146400. PMID 21733180.
^ Kumar M, Gromiha MM, Raghava GP (2010). "SVM based prediction of RNA-binding proteins using binding residues and evolutionary information". Journal of Molecular Recognition. 24 (2): 303–313. doi:10.1002/jmr.1061. PMID 20677174. S2CID 12677753.
^ Rashid, M. and Raghava, G. P. S. (2010) A simple approach for predicting protein–protein interactions. Current Protein & Peptide Science (In Press).
^ Chauhan JS, Mishra NK, Raghava GP (December 2009). "Identification of ATP binding residues of a protein from its primary sequence". BMC Bioinformatics. 10: 434. doi:10.1186/1471-2105-10-434. PMC 2803200. PMID 20021687.
^ Mishra NK, Raghava GP (January 2010). "Prediction of FAD interacting residues in a protein from its primary sequence using evolutionary information". BMC Bioinformatics. 11 (Suppl 1): S48. doi:10.1186/1471-2105-11-S1-S48. PMC 3009520. PMID 20122222.
^ Chauhan JS, Mishra NK, Raghava GP (June 2010). "Prediction of GTP interacting residues, dipeptides and tripeptides in a protein from its evolutionary information". BMC Bioinformatics. 11: 301. doi:10.1186/1471-2105-11-301. PMC 3098072. PMID 20525281.
^ Ansari HR, Raghava GP (March 2010). "Identification of NAD interacting residues in proteins". BMC Bioinformatics. 11: 160. doi:10.1186/1471-2105-11-160. PMC 2853471. PMID 20353553.
^ Agarwal S, Mishra NK, Singh H, Raghava GP (2011). "Identification of mannose interacting residues using local composition". PLOS ONE. 6 (9): e24039. Bibcode:2011PLoSO...624039A. doi:10.1371/journal.pone.0024039. PMC 3172211. PMID 21931639.
^ Panwar B, Raghava GP (September 2010). "Prediction and classification of aminoacyl tRNA synthetases using PROSITE domains". BMC Genomics. 11: 507. doi:10.1186/1471-2164-11-507. PMC 2997003. PMID 20860794.
^ Ansari HR, Raghava GP (October 2010). "Identification of conformational B-cell Epitopes in an antigen from its primary sequence". Immunome Research. 6: 6. doi:10.1186/1745-7580-6-6. PMC 2974664. PMID 20961417.
^ Ahmed F, Raghava GP (2011). "Designing of highly effective complementary and mismatch siRNAs for silencing a gene". PLOS ONE. 6 (8): e23443. Bibcode:2011PLoSO...623443A. doi:10.1371/journal.pone.0023443. PMC 3154470. PMID 21853133.