Commit 54a65ff6 authored by Gustaf Ahdritz's avatar Gustaf Ahdritz
Browse files

Add note about alignment format to README

parent 6da2cdaf
...@@ -302,6 +302,10 @@ multi-node distributed training, validation, and so on. For more information, ...@@ -302,6 +302,10 @@ multi-node distributed training, validation, and so on. For more information,
consult PyTorch Lightning documentation and the `--help` flag of the training consult PyTorch Lightning documentation and the `--help` flag of the training
script. script.
If you're using your own MSAs or MSAs from the RODA repository, make sure that
the `alignment_dir` contains one directory per chain and that each of these
contains alignments (.sto, .a3m, and .hhr) corresponding to that chain.
Note that, despite its variable name, `mmcif_dir` can also contain PDB files Note that, despite its variable name, `mmcif_dir` can also contain PDB files
or even ProteinNet .core files. To emulate the AlphaFold training procedure, or even ProteinNet .core files. To emulate the AlphaFold training procedure,
which uses a self-distillation set subject to special preprocessing steps, use which uses a self-distillation set subject to special preprocessing steps, use
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment