• Lukas Jarosch's avatar
    Add more efficient script to generate all-seqs FASTA · e2479cb5
    Lukas Jarosch authored
    The previous data_dir_to_fasta.py script is very slow and requires fully reparsing mmCIF files. This new script is much faster and uses the sequence information from the alignment data instead. Note that this will not include chains for which alignments could not be generated, but we can't use those during training anyways.
    e2479cb5
alignment_data_to_fasta.py 4.03 KB