DrugRepositioning

Drugsformer: Transformer for Drug Repurposing

We test two architectures for drug repurposing: RMat-RMat and RMat-SchNet. The prefix denotes the architecture of the ligand encoder and the suffix denotes the architecture of the protein encoder.
The experiments are conducted on the attached dataset of ~11.5k drugs and 7 proteins.
The aim of the project is to find the best architecture for drug repurposing and prove or disprove the following hypotheses:
- Model produces satisfying results on our dataset.
- Cross-attention outperforms a representations merge.
- Self-attention layers outperform graph layers.
- General models are better than protein-specific ones.
- Restricting the input to a pocket neighbourhood helps.
- Multiple tasks do not hurt the training.

To comply with dependencies create the following environment:

conda env create -f environment.yml

To train RMat-Rmat, RMat-SchNet or other model uncomment relevant config in train.py and run:

python train.py

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
data		data
huggingmolecules_repo		huggingmolecules_repo
img		img
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
run_conda.sh		run_conda.sh
run_training.sh		run_training.sh
slides.pdf		slides.pdf
train.py		train.py
train_regression.py		train_regression.py
train_smiles.py		train_smiles.py