Deep learning model using convolutional neural networks to predict drug-target binding affinity from protein sequences and compound SMILES.
DeepDTA is a deep learning model that predicts drug-target binding affinity using convolutional neural networks. It takes protein sequences and compound SMILES representations as input to estimate how strongly a drug molecule will bind to a target protein, which is essential for computational drug discovery.
Bioinformatics researchers, computational chemists, and drug discovery scientists working on predicting drug-target interactions and binding affinity.
DeepDTA provides a specialized deep learning approach for binding affinity prediction that outperforms traditional methods, using CNNs to effectively model protein sequences and compound structures for more accurate predictions.
Source code for "DeepDTA: deep drug-target binding affinity prediction"
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Uses convolutional neural networks to effectively capture local patterns in 1D protein sequences and SMILES strings, leading to accurate binding affinity predictions as validated in the Bioinformatics publication.
Simultaneously processes protein sequences and compound SMILES representations, enabling comprehensive analysis of drug-target pairs for computational drug discovery.
Supports adjustable window sizes, sequence lengths, batch sizes, and training epochs via command-line arguments, allowing customization for different research datasets.
Backed by a peer-reviewed study, providing a credible and tested approach that outperforms traditional methods for binding affinity prediction.
Relies on TensorFlow 1.x and Keras 2.x, which are deprecated and may cause compatibility issues with newer Python versions and deep learning ecosystems.
Fixed on CNN architecture without support for alternative models like transformers or RNNs, which might better capture long-range dependencies in sequences.
Requires manual data placement in specific directories and environment configuration using conda, which can be error-prone for users unfamiliar with the codebase.