- We test two architectures for drug repurposing: RMat-RMat and RMat-SchNet. The prefix denotes the architecture of the ligand encoder and the suffix denotes the architecture of the protein encoder.
- The experiments are conducted on the attached dataset of ~11.5k drugs and 7 proteins.
- The aim of the project is to find the best architecture for drug repurposing and prove or disprove the following hypotheses:
- Model produces satisfying results on our dataset.
- Cross-attention outperforms a representations merge.
- Self-attention layers outperform graph layers.
- General models are better than protein-specific ones.
- Restricting the input to a pocket neighbourhood helps.
- Multiple tasks do not hurt the training.
data:image/s3,"s3://crabby-images/f4e57/f4e57b013777d60359bc3e1c1a54cdd575671b5c" alt=""
data:image/s3,"s3://crabby-images/74c83/74c83e427d1af42c5ba23c1f32ded7ffd938411c" alt=""
data:image/s3,"s3://crabby-images/94ceb/94ceb7c68450164c812bab81c6f4e5c6afdd7d27" alt=""
data:image/s3,"s3://crabby-images/9fda9/9fda9ab4304df233ca448cd4bd1a56182adffe54" alt=""
data:image/s3,"s3://crabby-images/ec42a/ec42a05d0b65897cd99bf4a985cecd5f26d3fee2" alt=""
To comply with dependencies create the following environment:
conda env create -f environment.yml
To train RMat-Rmat, RMat-SchNet or other model uncomment relevant config in train.py and run:
python train.py