- 06 May, 2024 2 commits
-
-
Lukas Jarosch authored
-
Lukas Jarosch authored
The previous data_dir_to_fasta.py script is very slow and requires fully reparsing mmCIF files. This new script is much faster and uses the sequence information from the alignment data instead. Note that this will not include chains for which alignments could not be generated, but we can't use those during training anyways.
-