Commit cc1d6094 authored by zhuwenwen's avatar zhuwenwen
Browse files

update to dtk23.10

parent bd6b9500
......@@ -2,7 +2,7 @@
* @Author: zhuww
* @email: zhuww@sugon.com
* @Date: 2023-03-31 17:09:07
* @LastEditTime: 2023-08-24 09:07:01
* @LastEditTime: 2023-11-2430 10:07:01
-->
# FASTFOLD
## 论文
......@@ -21,22 +21,19 @@ FastFold通过搜索同源序列和模板进行特征构造,基于蛋白质结
## 环境配置
提供[光源](https://www.sourcefind.cn/#/service-details)拉取推理的docker镜像:
```
docker pull image.sourcefind.cn:5000/dcu/admin/base/custom:fastfold-0.2.1-centos7.6-dtk-22.10-patch4-py38-latest
docker pull image.sourcefind.cn:5000/dcu/admin/base/custom:fastfold-0.2.0-dtk23.10-py38-latest
# <Image ID>用上面拉取docker镜像的ID替换
# <Host Path>主机端路径
# <Container Path>容器映射路径
docker run -it --name fastfold --shm-size=32G --device=/dev/kfd --device=/dev/dri/ --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --ulimit memlock=-1:-1 --ipc=host --network host --group-add video -v <Host Path>:<Container Path> <Image ID> /bin/bash
docker run -it --name fastfold --privileged --shm-size=32G --device=/dev/kfd --device=/dev/dri/ --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --ulimit memlock=-1:-1 --ipc=host --network host --group-add video -v <Host Path>:<Container Path> <Image ID> /bin/bash
```
镜像版本依赖:
* DTK驱动:dtk22.10
* Pytorch: 1.10
* fastfold: 0.2.1
* DTK驱动:dtk23.10
* Pytorch: 1.13
* fastfold: 0.2.0
* python: python3.8
激活镜像环境:
`source /opt/dtk-22.10/env.sh`
`source /opt/openmm-dtk-22.10/env.sh`
测试目录:
`/opt/docker/tests`
......@@ -96,7 +93,7 @@ $DOWNLOAD_DIR/
python inference.py T1024.fasta data/pdb_mmcif/mmcif_files/ \
--output_dir ./ \
--gpus 2 \
--gpus 1 \
--use_precomputed_alignments alignments/ \
--param_path /data/params/params_model_1.npz \
--uniref90_database_path data/uniref90/uniref90.fasta \
......@@ -117,7 +114,7 @@ $DOWNLOAD_DIR/
T1024.fasta为推理的单体序列;data修改为数据集下载目录;
`--output_dir`为输出目录;`--gpus`为使用的gpu数量;`--use_precomputed_alignments`为搜索对齐目录,可以加载已经搜索对齐的序列,若不添加则进行搜索对齐;
`--param_path`为加载单体模型路径,需要和`--model_name`保持一致,默认为model_1;`--chunk_size`为分块数量,设置为4,并且使用`--inplace`来降低显存占用;
默认进行relax操作,若需要,添加`--relaxation`;默认不保存输出的.pkl文件,若需要,添加`--save_outputs`.
默认进行relax操作,若需要,添加`--relaxation`;默认不保存输出的.pkl文件,若需要,添加`--save_outputs`.
Alphafold的数据预处理需要花费大量时间,因此我们通过[ray](https://docs.ray.io/en/latest/workflows/concepts.html)加快了数据预处理工作流程。
......@@ -126,7 +123,7 @@ Alphafold的数据预处理需要花费大量时间,因此我们通过[ray](ht
### 多体
python inference.py SUGP1.fasta data/pdb_mmcif/mmcif_files/ \
--output_dir ./ \
--gpus 2 \
--gpus 1 \
--use_precomputed_alignments alignments/ \
--model_preset multimer \
--uniref90_database_path data/uniref90/uniref90.fasta \
......@@ -164,13 +161,18 @@ alignments/
{target_name}_{model_name}_relaxed.pdb
```
查看蛋白质3D结构:[https://www.pdbus.org/3d-view](https://www.pdbus.org/3d-view)
![img](./docs/result_pdb.png)
[查看蛋白质3D结构](https://www.pdbus.org/3d-view)
<div style="display: flex; justify-content: center; align-items: center;">
<img src="./docs/result_pdb.png" alt="Image">
<div style="position: absolute; top: 50%; left: 50%; transform: translate(-50%, -50%); background: rgba(0, 0, 0, 0.5); color: #fff; padding: 10px;">
红色为真实结构,蓝色为预测结构
</div>
</div>
## 精度
测试数据:[casp14](https://www.predictioncenter.org/casp14/targetlist.cgi)[uniprot](https://www.uniprot.org/),使用的加速卡:4DCU 1代-16G
测试数据:[casp14](https://www.predictioncenter.org/casp14/targetlist.cgi)[uniprot](https://www.uniprot.org/),使用的加速卡:1Z100L-32G
1、计算lddt的值
1、计算plddts的值
python3 pkl2plddt.py
其中,data_path为推理生成的pkl文件路径。
......@@ -179,9 +181,9 @@ alignments/
2、其它精度值计算:[https://zhanggroup.org/TM-score/](https://zhanggroup.org/TM-score/)
准确性数据:
| 数据类型 | 序列类型 | 序列标签 | 序列长度 | GDT-TS | GDT-HA | LDDT | TM score | MaxSub | RMSD |
| 数据类型 | 序列类型 | 序列标签 | 序列长度 | GDT-TS | GDT-HA | PLDDTS | TM score | MaxSub | RMSD |
| :------: | :------: | :------: | :------: |:------: |:------: | :------: | :------: | :------: |:------: |
| fp32 | 单体 | T1026 | 172 | 0.914 | 0.765 | 79.634 | 0.941 | 0.907 | 1.289 |
| fp32 | 单体 | T1024 | 408 | 0.595 | 0.441 | 90.828 | 0.663 | 0.489 | 5.779 |
| fp32 | 单体 | T1053 | 580 | 0.937 | 0.782 | 92.284 | 0.984 | 0.929 | 1.105 |
| fp32 | 单体 | Q9NYK1 | 1046 | 0.907 | 0.744 | 86.642 | 0.962 | 0.905 | 5.757 |
......
This source diff could not be displayed because it is too large. You can view the blob instead.
This source diff could not be displayed because it is too large. You can view the blob instead.
This source diff could not be displayed because it is too large. You can view the blob instead.
This source diff could not be displayed because it is too large. You can view the blob instead.
>chain_sp_Q15637_SF01_HUMAN_Splicing_factor_1_OS_Homo_sapiens_OX_9606_GN_SF1_PE_1_SV_4_292_370
CKFQRPGDPQSAQDKARMDKEYLSLMAELGEAPVPASVGSTSGPATTPLASAPRPAAPANNPPPPSLMSTTQSRPPWMN
>tr|A0A151NFQ3|A0A151NFQ3_ALLMI Splicing factor 1 isoform B OS=Alligator mississippiensis GN=SF1 PE=4 SV=1
CKFTRPGDPQSAQDKARMDKEYLSLMAELGEAPVPTSVGSSSGPTTTPLSSGPRPAGPGSSQPPP-------SRPLWMN
>tr|A0A146NVK0|A0A146NVK0_FUNHE Splicing factor 1 OS=Fundulus heteroclitus PE=4 SV=1
CKYTSTFaaqratggepp--QSAQDKARMDKEYLSLMAELGEAPVPSSGGG---HSST-QSGGPRASGPNNNQPPP---PIRRSPILWTK
>tr|A0A060XMC0|A0A060XMC0_ONCMY Uncharacterized protein OS=Oncorhynchus mykiss GN=GSONMT00048929001 PE=4 SV=1
CKFTSSFaapragepp--QSAQDKARMDKEYLSLMAELGEAPVPSSGAG---HCNN-QNSGHNRSNNNN-QPPPSRP-------PWMN
>tr|F7DBZ6|F7DBZ6_XENTR Uncharacterized protein (Fragment) OS=Xenopus tropicalis GN=sf1 PE=4 SV=1
CKFTSVTvrpgEPQSAQDKARMDKEYLSLMAELGEAPVPTPMGPGSGPSHNPVQGGPRPGGMTGNAPPMKLIKVVHNRPPWMT
>tr|F6X335|F6X335_CALJA Uncharacterized protein OS=Callithrix jacchus PE=4 SV=1
CKFQRPRDHQPAQEKARMNKEYLFLMAELGEAPVPASVDSVSGPATTALASSPRPAAPASNPPPPSLMSTTQSCPPWMN
>ERR1719354_782846
CKFTSSYaprpgepp--QSAQDKARMDKEYLSLMAELGEAPVGGPSGGGGGGHGG-HNSGHHGGGRGNNQGPPSRP-------PWMN
>tr|W5M280|W5M280_LEPOC Splicing factor 1 OS=Lepisosteus oculatus PE=4 SV=1
CKFTSSFasrpgEPQSAQDKARMDKEYLSLMAELGEAPVPSSSGG---HSNAPHHGGHRGSGPGGNQPQQ-------NRPPWMN
>tr|H9GN87|H9GN87_ANOCA Splicing factor 1 OS=Anolis carolinensis GN=SF1 PE=4 SV=2
CKFAR--pgDPQSAQDKARMDKEYLSLMAELGEAPVPASVGsssG---PSNPPLQSGPRPSGPGnSQP-PP-------NRPPWMN
>tr|V9KR29|V9KR29_CALMI Splicing factor 1-like protein (Fragment) OS=Callorhinchus milii PE=2 SV=1
CKFTSPGtfnrpgDPQSAQDKARMDKEYLSLMAELGEAPVPTSSGslhT---NSAPSMQ---RSSAPGgG---qiLP-------NRPPWMN
>tr|A0A1S3N6T6|A0A1S3N6T6_SALSA splicing factor 1-like OS=Salmo salar GN=LOC106577532 PE=4 SV=1
CKFTSSFaapragePPQSAQDKARMDKEYLSLMAELGEAPVPSSGGG---HSNNQNSG-H-NRGNNn-NQPPP-------SRPPWMN
>tr|A0A087Y6W1|A0A087Y6W1_POEFO Splicing factor 1 OS=Poecilia formosa PE=4 SV=2
CKYTSTFagqratggePPQSAQDKARMDKEYLSLMAELGEAPVPSSGGG---HSSSP-AGAPRASGPNSNQPPP-------NRPPWMS
>tr|A0A1W4ZUX6|A0A1W4ZUX6_9TELE splicing factor 1 isoform X1 OS=Scleropages formosus GN=sf1 PE=4 SV=1
CKFTSSFaprpgEPQSAQDKARMDKEYLSLMAELGEAPVASSSGG---PSRSN-PSGPRGSGPSNNQPPP-------NRPPWMN
>tr|A0A0R4IBT0|A0A0R4IBT0_DANRE Splicing factor 1 OS=Danio rerio GN=sf1 PE=1 SV=1
CKFTSSFaprpgePPQSAQDKARMDKEYLSLMAELGEAPVPSSGGG---HNNAP-PSGPRPSGPNNNQPPP-------NRPPWMN
>tr|G3N6Z3|G3N6Z3_GASAC Splicing factor 1 OS=Gasterosteus aculeatus PE=4 SV=1
CKYTSSFaahratggePPQSAQDKARMDKEYLSLMAELGEAPVPTSGGG---HSSSQ-GGSQRSSGLNNNH-QS-------NRPPWMN
>tr|H3BGM9|H3BGM9_LATCH Splicing factor 1 OS=Latimeria chalumnae GN=SF1 PE=4 SV=1
CKFASSFtirpgDPQSAQDKARMDKEYLSLMAELGEAPVPAppvSSSG---PSNAP-LPsGPRPSGPSGNQQPM-------NRPPWIN
>tr|A0A2C9K2W4|A0A2C9K2W4_BIOGL Uncharacterized protein OS=Biomphalaria glabrata PE=4 SV=1
CKQKRPGdtfrmqqmqNP---ADRAKMDSEYMSLMAELGEGPPPPkseAP--------------------NQnPAANFGRPLlsnpppnpmaMNS---PWQM
>tr|A0A0B7A606|A0A0B7A606_9EUPU Uncharacterized protein OS=Arion vulgaris GN=ORF99065 PE=4 SV=1
CKQKRPGdtfrvqqmqNP---AERAKMDSEYMSLMAELGEGPPPPpksEP--------------------SNmVPTTYGRTLlsnpppnpmaMNS---PWQM
>tr|K1QZW2|K1QZW2_CRAGI Splicing factor 1 OS=Crassostrea gigas GN=CGI_10012625 PE=4 SV=1
CKSKKPGdsfknfpqngNPVSQADKAKMDSEYMSLMAELGEGPPPPktqTHP-------------------TPaVQTY-RPS--FS-------
>tr|V4B7Z6|V4B7Z6_LOTGI Uncharacterized protein OS=Lottia gigantea GN=LOTGIDRAFT_211871 PE=4 SV=1
CKQKRYNspmp-IVSQADKAKMDSEYMSLMAELGEGPPPSskpPGSN---LT----PLmNQHIAPPPNlQQMQMNRPNmnppppplmgnnNQQSVPPWQQ
>ERR550534_572001
CKTKRPGdmrderfgggggwggggfggrgggrGGHMAHEKAKMDEEYMSLMAELGQGPPPPppgHSST---P---------------------PREEngmgGSGGGMSRGG
>ERR1719239_1662057
CKQKRPGdtfqmqqmqNP---AEKAKMDSEYMSLMAELGEGPPPPpkqDM--------------------NPqVQNNLRQPLltnpppnsmgMNS---PWGM
>ERR1719483_1498850
CKSKRPWdsfrgqmqmqNP---AEKAKMDNEYMSLMAELGEGPAPAkndGPP-------------------QQqQQSHFGRPLlnnpppnpmgMNS---PWQR
>tr|R7V9H1|R7V9H1_CAPTE Uncharacterized protein OS=Capitella teleta GN=CAPTEDRAFT_219446 PE=4 SV=1
CKQKRPGeeiqaqln--QTPADRAKMDSEYMSLMAELGEGPPPKaqpQTSH---SR----PF--------------MQ----P--PPWQQ
>ERR1719150_2887651
CKQRRPGANfneewggqgGGGAGGNKIDQEYLSFMAELGDGRPGRRR-----------------------------------------
>UPI0005441643 status=active
CKMKRGGPPqnssnQGMGSGEKMDYEYMSLMAELGEGPPPPQG-----------------------------------------
>BogFormECP03_OM3_1039632.scaffolds.fasta_scaffold39806_1 # 3 # 170 # -1 # ID=39806_1;partial=10;start_type=TTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.577
CKSKRPGMGgiEGSNNQAKIDEEYMSLMAELGEVQPQEAP-----------------------------------------
>LakMenEpi03Aug12_release.lakeMendotaPanAssembly.Ray.scaffolds.fasta_scaffold5278426_1 # 1 # 282 # -1 # ID=5278426_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.408
CRSARSGGYsggggesggGGAAAGNKIDEEYMSLMAELGEGPSPKVD-----------------------------------------
>tr|A0A1Y3EGW9|A0A1Y3EGW9_9BILA RNA polymerase Rpb3/Rpb11 dimerization domain protein OS=Trichinella nativa GN=D917_09165 PE=4 SV=1
CKNPTHGGA---PTGAALDEEYSALMAELGHETTRPTER----------------------------------------
>ERR1719187_123057
CRQKRPGNGVPgqySNTANKIDEEYMSLMAELGEGPPPPATS----------------------------------------
>tr|A0A1S3KCY7|A0A1S3KCY7_LINUN splicing factor 1-like OS=Lingula unguis GN=LOC106180632 PE=4 SV=1
CKQKKPGDSFRnltamaptPIDKAKMDSEYMSLMAELGEGPPPEQPK----------------------------------------
>ERR1719468_130774
CMGKKPGGWN-GEPKTAMDEEYMSLMAELGEGPAPPPPG----------------------------------------
>ERR1719319_1888632
CRQKRPGAGGPgqfsNNANNKIDEEYMLLMAELGEGPPPPTSG----------------------------------------
>ERR1719273_391907
CRQKRPGNGPPgqfSGGANKIDEEYMSLMAELGEGPPPPTSA----------------------------------------
>tr|A0A1S3D0V8|A0A1S3D0V8_DIACI splicing factor 1 OS=Diaphorina citri GN=LOC103508985 PE=4 SV=1
CREKRPGMGGPpantHRNRAKIDEEYMSLMAELGEGPPPDKRQ----------------------------------------
>ERR1719419_954095
CKTKRPGDMRDqrfarppgtgggfpfgpaGHEKQKMDEEYMSLMAELGQGPPPPGSN----------------------------------------
>tr|C1L4S2|C1L4S2_SCHJA Splicing factor 1 (Zinc finger protein 162) OS=Schistosoma japonicum PE=2 SV=1
CKALLGGQAYLdqlnanPSERAKMDSEYTALMAELGVGGGSQGLR----------------------------------------
>ERR1719495_79040
CTGRRPGTGFPsstgggGGGESNIDEEYMSLMAELGEGPPPPPKD----------------------------------------
>tr|A0A087UIE5|A0A087UIE5_9ARAC Uncharacterized protein (Fragment) OS=Stegodyphus mimosarum GN=X975_26308 PE=4 SV=1
CREQKNPT-GVigAPDKAKIDEEYMSLMAELGEGPPVPNKM----------------------------------------
>tr|T1JNQ7|T1JNQ7_STRMM Uncharacterized protein OS=Strigamia maritima PE=4 SV=1
CREKRPGNVFPggnGVDKSKIDEEYMSLMAELGEGPPPPNKG----------------------------------------
>tr|A0A195F8P2|A0A195F8P2_9HYME Splicing factor 1 OS=Trachymyrmex septentrionalis GN=ALC56_08599 PE=4 SV=1
CRSKRPGQGGPaaagmggmgqAGDKAKIDEEYMSLMAELGEGPPPDRSK----------------------------------------
>tr|A0A067R957|A0A067R957_ZOONE Splicing factor 1 OS=Zootermopsis nevadensis GN=L798_10693 PE=4 SV=1
CRQKRPGAGGPnaAGDKNKIDEEYLSLMAELGEGPPPNRDN----------------------------------------
>tr|A0A1B6C4G7|A0A1B6C4G7_9HEMI Uncharacterized protein OS=Clastoptera arizonana GN=g.5228 PE=4 SV=1
CRQKRPGSAGPggippvRQDKAKIDEEYMSLMAELGEGPPPPNQG----------------------------------------
>tr|A0A1B6M4Y9|A0A1B6M4Y9_9HEMI Uncharacterized protein (Fragment) OS=Graphocephala atropunctata GN=g.37969 PE=4 SV=1
CRQKRPGGDKVvpatRQEKAKIDQEYMSLMAELGEGPPPPAKT----------------------------------------
>tr|J9K3Z9|J9K3Z9_ACYPI Uncharacterized protein OS=Acyrthosiphon pisum GN=LOC100166679 PE=4 SV=2
CRMKNTGGASFpiSQDKNKIDEEYMSLMAELGEGPPPPKHD----------------------------------------
>tr|A0A132A4P5|A0A132A4P5_SARSC Splicing factor 1-like protein OS=Sarcoptes scabiei GN=QR98_0044240 PE=4 SV=1
CKEKRAESN---VNQAKIDEEYLSLMAELGEAPPIQTSN----------------------------------------
>tr|A0A1Y3AU68|A0A1Y3AU68_EURMA Splicing factor 1-like protein (Fragment) OS=Euroglyphus maynei GN=BLA29_000057 PE=4 SV=1
CKEKRVESN---VNQAKIDEEYLSLMAELGEAPPSSAVS----------------------------------------
>ERR1719322_725732
--------TSGGIDRAKMDSEYMSLMAEIGESPAPSQSSD---------------------------------------
>ERR1739838_885244
--------LGGGIDRAKMDSEYMSLMAEIGESTGPSPSSS---------------------------------------
>ERR550534_2162228
--------GGPCVDRAKMDSEYMSLMAELGEGPMPTNDDH---------------------------------------
>tr|S4RV90|S4RV90_PETMA Splicing factor 1 OS=Petromyzon marinus PE=4 SV=1
--------PQGNQERARMDEEYLSLMAELGEETPARSGGG---------------------------------------
>tr|A0A074ZF33|A0A074ZF33_9TREM Uncharacterized protein OS=Opisthorchis viverrini GN=T265_06824 PE=4 SV=1
---------TNPSERAKMDSEYSALMAELGVGA-GSMLP----------------------------------------
>tr|A0A068WMH0|A0A068WMH0_ECHGR Zinc finger protein OS=Echinococcus granulosus GN=EgrG_000665700 PE=4 SV=1
---------INPMERAKMDSEYTALMAELGVGYGGGAATL---------------------------------------
>tr|A0A0R3SKU2|A0A0R3SKU2_HYMDI Uncharacterized protein OS=Hymenolepis diminuta PE=4 SV=1
---------INPTERAKMDSEYTALMAELGVGYNTGSSGG---------------------------------------
>tr|A0A2H1CP97|A0A2H1CP97_FASHE Uncharacterized protein (Fragment) OS=Fasciola hepatica GN=D915_03272 PE=4 SV=1
---------NNPTERAKMDSEYSALMAELGVGAASQFLNS---------------------------------------
>ERR1719354_180810
--------QASGVDRAKMDSEYMSLMAELGEHPAPEKSGG---------------------------------------
>tr|W4YQE3|W4YQE3_STRPU Uncharacterized protein OS=Strongylocentrotus purpuratus PE=4 SV=1
CREKATGDrGpmsqpiVNSADKAKMDSEYLSLMAELGEGPLPGGGG----------------------------------------
>ERR1719319_956392
CKQRRPGSgQygd-eygqpPAPGSAKIDAEYMSFMAELDGGPPPPPGA----------------------------------------
>ERR1719350_1966652
CKARRAGE-w-ae--GPKTAMDEEYLSLMAELGEGPGPAAPP----------------------------------------
>ERR1719347_1272148
--GPPPPGgApga-aPRP-------SGFAPANPLGLGPPPPGGA----------------------------------------
>tr|A0A287CX92|A0A287CX92_ICTTR Uncharacterized protein OS=Ictidomys tridecemlineatus PE=4 SV=1
-CKFQRPGdPq-SAQDKARMDKEYLSLMAELGEAPVPASVG----------------------------------------
>ERR1739838_361296
CRSDAGHQqGaa-mPGVDRAKMDSEYLSLMEELGEKTVPGPNS----------------------------------------
>tr|A0A1W2VPP9|A0A1W2VPP9_CIOIN zinc finger protein ZF(CCHC)-13 OS=Ciona intestinalis GN=zf(cchc)-13 PE=4 SV=1
CRSEHSSSqLqqv-dgsGNVDRAKMDSEYQSLMAELGEGPPSSGGN----------------------------------------
>ERR1719187_777014
CRQRQPGAgP-rqAPVDRQKIDEEYMSLMAELGEGPPPPGND----------------------------------------
>ERR1719203_749434
CRQKRPGAgApgq-FSNANNKIDEEYMSLMAELGEGPPPPASS----------------------------------------
>ERR1719215_2519331
CKSRRPGAaFndn-kRPPDRNNIDAEYMSLMAELEEGPPPAPPR----------------------------------------
>ERR1719369_206618
--------gPpgq-f-SNTVNKIDEEYMSLMAELGEGPPPPTST----------------------------------------
>ERR1719186_1696843
CKARRTGDwQqqq-ggsgp-SAGGSKIDAEYMSLMAELGEGPPPPAS-----------------------------------------
>ERR1719319_1105274
CKARRAGDwPagq-ggppgAPAGGHKMDQECMSLMAELGEGPPPQQVP----------------------------------------
>tr|E0VU73|E0VU73_PEDHC Splicing factor, putative OS=Pediculus humanus subsp. corporis GN=8231206 PE=4 SV=1
CRNKRPGGvaqtt---GTESRKIDQEYMSLMAELGEVPPQGR------------------------------------------
>tr|A0A0Q9WGQ8|A0A0Q9WGQ8_DROVI Uncharacterized protein, isoform B OS=Drosophila virilis GN=Dvir\GJ27151 PE=4 SV=1
CRNKRPGSgapgm-ACEDTQAKIDEEYMSLMAELGEGPPPSA------------------------------------------
>tr|A0A1Q3F4A7|A0A1Q3F4A7_CULTA Putative splicing factor 1/branch point binding protein rrm superfamily OS=Culex tarsalis PE=4 SV=1
CRSKRPGQggppaaG--NNNQAKIDEEYMSLMAELGEGPPPET------------------------------------------
>LauGreSuBDMM15SN_2_FD.fasta_scaffold235828_1 # 2 # 472 # -1 # ID=235828_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.285
CIQTDLPPiPVVQVDKAKMDSEYMSLMAELGEGPPPEP------------------------------------------
>ERR1719494_765280
CKVLQKEgaapVQQTFAEKAKMDNEYLSLMAELGCDEPAA-------------------------------------------
>ERR550534_2347854
CKVNRSmvptpglGtgnmTGQTYTEKAKMDNEYLALMAELGGEAPPP-------------------------------------------
>ERR1719428_1238246
CKINTGsSnhggHAQTFAEKAKMDNEYLALMAELGGEAPPP-------------------------------------------
>tr|B3SA56|B3SA56_TRIAD Predicted protein OS=Trichoplax adhaerens GN=TRIADDRAFT_61140 PE=4 SV=1
CKQKSNeEggnePVEPVIDREKMDNEYLSLMAELGEGSAPE-------------------------------------------
>ERR1711860_468308
CKGRRPGStPFDKPNTQKIDREYMSLMAELGEGPQPPPPSGGGGPG----------------------------------
>ERR1719245_1941769
CKQKRPGSGfppygeAPGGGGNKIDQEYMSLMAELGEGPPPPAGTPGAGPR----------------------------------
>ERR550517_1207180
CKARRTGDWqqqgggGPSAGGHKMDQEYMSLMAELGEGPPPQQMPS---------------------------------------
>SwirhisoilCB2_FD_contig_71_1479955_length_636_multi_3_in_0_out_0_1 # 1 # 636 # -1 # ID=2655841_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.675
CTGKRPGQGfgeggfGGDGPNKNIDEEYMSLMAELGEGPPPPKQER---------------------------------------
>ERR1719193_2579107
CKQRRPGSGFppygeapgGGGN--KIDQEYMSLMAELGEGPPPPG------------------------------------------
>ERR1719237_1542174
CKQRRPGSGQygdefgqppaPGSA--KIDAEYMSFMAELDGGAPP--------------------------------------------
>ERR1719500_144809
G--VGGGAGRgarggpgGPAGGHKMDQEYMSLMAELGEGPPPQQ------------------------------------------
>ERR1719264_1910087
LRGRRS-HYRrlqgtshrglATAGGHKMDQEYMSLMAELGEGPPPQQ------------------------------------------
>ERR1719427_2218414
CKNKRPGAGFnqfGEPADRKIDQEYMSFMAELGDGPAPGP------------------------------------------
>ERR1719397_186377
CKQKRPGTGFgyqygea-PGAGRKIDQEYDAFMAQLGDKPGPGG------------------------------------------
>tr|A0A1B0G6R9|A0A1B0G6R9_GLOMM Uncharacterized protein OS=Glossina morsitans morsitans PE=4 SV=1
CRSKGPGAVgelgiveNAENSQAKIDEEYMSLMAELGEGPPPSTET----------------------------------------
>ERR1719351_472967
CKQRRPGAGfnefgqPPAPGGNKIDQEYLSFMAELGDGPPGNGPA----------------------------------------
>tr|A0A1A9UTN5|A0A1A9UTN5_GLOAU Uncharacterized protein OS=Glossina austeni PE=4 SV=1
CRSKRPDAV---EAQAKIDEEYMSLMAELGEGPPLSAQP----------------------------------------
>ERR1719273_1401788
CMGKKTGSwDQgp----KTAMDEEYMSLMAELGEGPAPTPQ-----------------------------------------
>ERR1719492_302232
CTGKRPGQGFgeggfggdG--PNKNIDEEYMSLMAELGEGPGRGGR-----------------------------------------
>tr|A0A0K2UQT3|A0A0K2UQT3_LEPSM Uncharacterized protein (Fragment) OS=Lepeophtheirus salmonis PE=4 SV=1
CTQKRPGMgGFen--ASQNKMDDEYMSLMAELGEAPPPGTG-----------------------------------------
>ERR1719295_2098979
CTGKRPGFGFggsgggggnDGEGGSNIDEEYMSLMAELGEGPAAVAA-----------------------------------------
>ERR1719245_1032388
CKQRRPGSgY-deygqppAPGS-AKIDQEYMSFMAELDG----GA------------------------------------------
>ERR1719239_1240936
CKQRRPGGgF-gqygeppAPGS-NKIDEELEEPLgstAVLVKPDALCQL-----------------------------------------
>ERR1719500_2515869
CMVGRPGHnVPpgtgpggivppdkwgAPGQVGELDKEYESLMAELSGKTPPPSS-----------------------------------------
>ERR1719209_2074478
CMVGRPGHqVPpGTGPGGIVPPDKWgppSAGAQFGGGPPRAP------------------------------------------
>ERR1719220_3032897
LMAELSGKsPPpSSGPG---DAGKWappSAGAQFGGGPPRAP------------------------------------------
>ERR1719295_1867487
CRQKRPGEvFNkvksk--ADPKVIDAEYEAFLNDMDGKAGG--------------------------------------------
>ERR1719225_314164
CKQRRPGAnFNeewggqggGGAGGNKIDHEYLSFMAELGDGPAPPAP-----------------------------------------
>ERR1719300_1200922
LAslLLLEAtrlikSiCHlwlsWVMDLLGMDQHLLDLLLLLGQEQDLLQL-----------------------------------------
>ERR1719410_18487
CKMKRPGAgFPpygeaaG-SGGTKIDQEYMSLIAELGEGPPPPGT-----------------------------------------
>ERR1711976_3245
CMGKKGGGWSEAGPKSAMDEEYMSLMAELGEGPPPPT------------------------------------------
>ERR1719167_1777327
CKQRRPGAqDWtqpnpTPASGHKMDQEYMSLMAELGEGPPPPQQ-----------------------------------------
>ERR1719167_153244
CKQKRPGTgFPpfgeAGGAGTKIDQEYMSLMAELGEGPPPPAG-----------------------------------------
>ERR1719225_2379548
CKARRTGDwQQqggggpgSAAGGHKMDQEYMSLMAELGEGPMAQQG-----------------------------------------
>ERR1719481_1576490
CKQRRPGQdW-gsssapVAVGGHKMDEEYMSLMAELGEGPPPQVA-----------------------------------------
>ERR1719334_1087590
CKQRRPGQdWGsssapVAVGGHKMDEEYMSLMAELGEGPPPPQQ-----------------------------------------
>ERR1719317_407312
CKQKRPGSgFPpygeapG-GGGSKIDAEYMSLMAELGEGPPPPGG-----------------------------------------
>ERR1719450_1782325
PAGPPGGGsGAwggnnwSAPGAKPL----MSQPVQPPWGGPPKNG-----------------------------------------
>ERR1719282_2149255
CKQRRPGAgF-dqfggqppAPGS-NKIDQEYLSFMAELGDGPPGTAP-----------------------------------------
>ERR1719186_947515
CKSRRPGSNFnppp--RQEKNIDAEYMSLMAELGEGPAPPPQ-----------------------------------------
>ERR1719312_1305817
CTGRRPGYgFGgsgggggnDSEGGSNIDEEYMSLMAELGEGPAPPPK-----------------------------------------
>ERR1719195_892826
CKQRRPGAnFNeewggqggGGAGPHPRHRAG----AGAGRGELMPAT-----------------------------------------
>ERR1719483_1553578
CKQRRPGTg--dfigsaGPPGGSKIDAEYMSLMAELGEGPAPPED-----------------------------------------
>tr|A0A2A2M0F2|A0A2A2M0F2_9BILA Uncharacterized protein OS=Diploscapter pachys GN=WR25_06106 PE=4 SV=1
CRAEPGDA---MNMAAMMDDEYSALMQELGEKPMNKPPVG---------------------------------------
>tr|G5EF97|G5EF97_CAEEL SF1 protein OS=Caenorhabditis elegans GN=sfa-1 PE=1 SV=1
CKNPKGM----YASEAGMDDEYSALMAELGETPAAGAGAG---------------------------------------
>tr|G0MRF9|G0MRF9_CAEBE CBN-SFA-1 protein OS=Caenorhabditis brenneri GN=Cbn-sfa-1 PE=4 SV=1
CKNPKGM----YASEAGMDDEYSALMAELGETPTGGISSS---------------------------------------
>tr|A0A1I7V4P6|A0A1I7V4P6_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis PE=4 SV=1
CKNPKGM----YSSEAGMDDEYSALMAELGETPASGVNSL---------------------------------------
>tr|U5EVP1|U5EVP1_9DIPT Putative splicing factor 1 (Fragment) OS=Corethrella appendiculata PE=2 SV=1
CRNKRPGqggppd----MNTQAKIDEEYMSLMAELGEGPIPESNGS---------------------------------------
>ERR1740128_1312246
CKSRRPGasfnnqgG---PGGATKIDREYMSLMAELGEGPPPPPPGT---------------------------------------
>ERR1719383_334586
CKQRRPGerfnqgpP---DKNGSKIDREYMSLMAELGEGPPPPPPSG---------------------------------------
>tr|A0A1J1HKR0|A0A1J1HKR0_9DIPT CLUMA_CG002213, isoform A OS=Clunio marinus GN=putative Splicing factor 1 PE=4 SV=1
CKSKRPGmggveg----SNNQAKIDEEYLSLMAELGEVQPQEAPVQ---------------------------------------
>ERR1719510_115441
CVGKRPGTNFGtsgPGSNQNMDDEYMSLMAELGEGPAPPSKT----------------------------------------
>ERR550532_1169838
CKARRSGDWQQqqgdsgpSAGGHKMDQEYMSLMAELGEGPP-PQQM----------------------------------------
>ERR1719350_2442014
CKQRRPGDWQQqggsshSAGGHKMDQEYMSFMAELDGGAPPPPGA----------------------------------------
>ERR1719495_926933
CTGKRPGYGFGgndag-PNNANIDEEYMSLMAELGEGPAPPPKN----------------------------------------
>ERR1719361_2402365
---KAEI--HDisirkkFISTPLSVQNVSKKMSRHERGMSGANNV----------------------------------------
>ERR1719328_897334
CKRRPGAGFD-qfggqppap--GSNKIDQEYLSFMAELGDGPPGAGGG----------------------------------------
>ERR1719245_2412213
CMGQKTNKPWNqggggGGKQNAMDEEFANFMAELGEGPAPPGTG----------------------------------------
>ERR1740131_101980
CKSRRPGS-nFNppPRQEKNIDAEYMSYGGARGGASPSTPRQ----------------------------------------
>ERR1719300_575491
CTGRRPGYGFGgsgggggndGEGGSNIDEEYMSLMAELGEGPAPPPKE----------------------------------------
>ERR1719510_2788115
CVAKRPGTNFGnigPGSNQNMDEEYMPVMAELGEGSATYSKT----------------------------------------
>ERR1719500_1849896
VWVKKTGSWDQ-GPKTAMDEEYLSLMAELGEGPGPAAPP----------------------------------------
>ERR1719433_269159
MDQGKIDQEYL------------SFMAELGGDQSESWNY----------------------------------------
>ERR1719323_366708
CQQRRPGTGFGpggqH-HQQHRVLRLR-QRGSHRQGLPAAEAR----------------------------------------
>ERR1719216_154483
CKQRRPGASFGqyeeggagKMDQGKIDQEYLSFMAELGGDQSESWNY----------------------------------------
>ERR1719322_1038848
R-VTSPGTASSgd----------PAPASARAANTTRPRWTT----------------------------------------
>ERR1719242_1131658
CMGKKTGEWNSgg--SKNAMDEEYMSLWLNWVKVLLHPELL----------------------------------------
>ERR1719291_957451
CKQRRPGANFNeewggqgggGAGGNKIDQEYLSFMAELGDGPPGGGPA----------------------------------------
>ERR1719189_116843
MRSPRGDQKr--SHSHRSRSRSKDRSTQKRGDQKCSRSHR----------------------------------------
>ERR1711981_546873
CMGKKPG-GWTqesGGGKNAMDEEYMSLMAELGEGPPPGPGP----------------------------------------
>ERR1719412_206090
---RRPK--FVkvdKKRFYFQISDKMSR----NGGMSGANNT----------------------------------------
>ERR1719369_412980
YGQKKTGAWNE-GPKTAMDEEYMSLMAELGEGPAPTPPP----------------------------------------
>ERR1719433_177385
CQQRRPGTGFGpggEHNKTKMDDEYLSLMAELGEGPPPAPKT----------------------------------------
>ERR1711997_632794
------------ERVNFTEKKSIKMSRGRNDGMSGANNI----------------------------------------
>ERR1719361_2891322
---QTMS--FVllaEEP---AILLKIVWARKLESGILEAPKM----------------------------------------
>ERR1712038_671992
-----------tTSNFQLIESTKMSRNGVHGAGMSGANNS----------------------------------------
>ERR1719384_2550878
CKQRRPGANFNeewggqgggGAGGNKIDQEYLSFMAELGDGSPGGGPA----------------------------------------
>ERR1719203_818695
CKQRRPGASFNqygeggAPGSNKIDQEYLSFMAELGDGTPAPPGA----------------------------------------
>ERR1719464_2596802
CMGKKGNWNEG--PKSAMDEEYMSLMAELGEGPPPPNPK----------------------------------------
>ERR1719382_2187720
CKARRTGEWQQqqggsgpSAGGHKMDQEYMSLMAELGEGHLHSRCL----------------------------------------
>ERR1719158_1236452
CKQRRPGSGYDeyggqppAPGSNKIDQEYLSFMAELGDGPPGAGGG----------------------------------------
>ERR1719510_869690
CMGKKTGEW--nsGGSKNAMDEEYMSLRLNWEKAQLHQELH----------------------------------------
>ERR1719510_1306118
CMGKKTGEWn--sGGSKNAMDEEYMSLMAELFRQGFFSSL-----------------------------------------
>ERR1712226_449951
---ATNR--HKfycssnTISIPCSVQ-KPTIMSRHDRGMSGANNV----------------------------------------
>ERR1719391_233335
CKQRRPGHGNFdskPAGSAKIDREYMSLMAELGEGPPPPPPS----------------------------------------
>ERR1719471_386625
---GAAI--FQfeEEDVCIAEMSRRGGGGRRDDAMSGANNA----------------------------------------
>ERR1719193_2013002
CKQRRPGERFSspngvdigggrpgaggPGGGNKIDQEYMSLMAELGEGPPPPPP-----------------------------------------
>ERR1719268_302531
CKQKRPGAGFPpygeaagp--GGTKIDQEYMSLLAELGEGPPPPG------------------------------------------
>ERR1719414_2248046
---KAEK--ALkinsEESVSF---FGLKMSRG-GGGMSGANNI----------------------------------------
>tr|A0A267FAJ4|A0A267FAJ4_9PLAT Uncharacterized protein (Fragment) OS=Macrostomum lignano GN=BOX15_Mlig024025g2 PE=4 SV=1
CRAPRGGGGSDnaggpgsggggpgaQHQQKEMDSEYSALMQELGVSTASSSTA----------------------------------------
>tr|A0A182N0A2|A0A182N0A2_9DIPT Uncharacterized protein OS=Anopheles dirus PE=4 SV=1
CRSKRPGHGGPpaagggggsgGAAATKIDEEYMSLMAELGEAPPAQDN-----------------------------------------
>tr|T1PNX2|T1PNX2_MUSDO KH domain protein (Fragment) OS=Musca domestica PE=2 SV=1
CRSKRPGAGVPgeenENSQAKIDEEYLSLMAELGEGPPPSAA-----------------------------------------
>ERR1719273_2584621
CVGKRPGT---nfgtsgs---NQNMDEEYMSLMAELGEGSGTPVA-----------------------------------------
>tr|S4P9C4|S4P9C4_9NEOP Splicing factor 1 (Fragment) OS=Pararge aegeria PE=4 SV=1
CRAKRPGHTPQrgaq--qDKAKIDEEYMSLMAELGEAPPPGTG-----------------------------------------
>ERR550525_81937
CKARRTGDWQQqggsshs--aGGHKMDQEYMSFMAELGDGTPALPG-----------------------------------------
>ERR1719412_1720654
CKQKRPGSGFPpygeapgggg----NKIDQEYMSLMAELGDGTPAPPG-----------------------------------------
>ERR1719234_3012888
CKARRTGDWQQqggngggPSaGGHKMDQEYMSLMAELGEGHTTADA-----------------------------------------
>ERR1719427_2241673
CKNKRPGAGFNqfgepad----RKIDQEYMSFMAELGEGRAWRLG-----------------------------------------
>tr|A0A0L7LKA5|A0A0L7LKA5_9NEOP Splicing factor 1 OS=Operophtera brumata GN=OBRU01_03003 PE=4 SV=1
DKMGPPGMGGPhgmppnmgpppnmppphghmQPppGNSPYFVTYPHLYAFIVQ------------------------------------------------
>tr|A0A182JAJ5|A0A182JAJ5_9DIPT Uncharacterized protein OS=Anopheles atroparvus PE=4 SV=1
CRSKRPGHGGPpgsgSNVATKIDEEYMSLMAELGEAPPQHHD-----------------------------------------
>tr|A0A182FEB6|A0A182FEB6_ANOAL Uncharacterized protein OS=Anopheles albimanus PE=4 SV=1
CRMKRPGHGGSqaaaDPQATKIDEEYMSLMAELGEAPPQDTA-----------------------------------------
>ERR1719367_1736832
CQQRRPGT---gfgpggehNKtkM----DDEYLSLMAELGEGPPPSAK-----------------------------------------
>ERR1719367_2707688
CKQRRPGG---fdqfggqpPApgSNKIDQDQQRGLYLLRGR------------------------------------------------
>ERR1719495_2369038
CKQKRPGT---gfppygeAPgGGNKIDQEYMSLMAELGEGPPPSAS-----------------------------------------
>ERR1719188_598846
CQQRRPGTGFGpggeh--NKTKMDDEYLSLMAELGEHNKTKMD-----------------------------------------
>ERR1719210_582194
CKQRRPGA---sfggqygeGGgaGNNNIDEEYLSFMAELGD--GN--------------------------------------------
>ERR1719336_1523813
CKQRRCGA---sfsqygeQGgaGSNKIVQEYLSFMAELGDGAPAPPG-----------------------------------------
>ERR1719464_1483259
CKQRRPGG---gfdqygePPapGSNKIDQEYLSFMAELGGGDGAPPP-----------------------------------------
>ERR1719438_527121
TMVRPGG-GF-gqygePPapGSNKIDQEYLSFMAELGDGPPGAGG-----------------------------------------
>ERR1719420_249035
CKQRRPGANF-nefgqPPapGSAKIDQEYMSFMAELDGGAP---------------------------------------------
>ERR1719471_1910873
STSSRCW-GF-dqfggqPPapGSNKIDQEYLSFMAELGDGPPGAGG-----------------------------------------
>tr|A0A1D2NBJ4|A0A1D2NBJ4_ORCCI Splicing factor 1 OS=Orchesella cincta GN=Ocin01_04239 PE=4 SV=1
CKAKRPGEGGPpgsggggGGNKAKIDEEYLSLMAELGEAEPPKHE-----------------------------------------
>UPI0003C437F4 status=active
CKEPRRSLLNPeknfepgsssipdndlEGPSESIDEDYKRLMAELGEGX----------------------------------------------
>ERR1719188_273115
CHQRRPGTGfggGGDQPKTKMDDEYLSLMAEWGEGPPPSAKP----------------------------------------
>ERR1719457_141148
-----------GMDRAKMDSEYDSLMRELGEGSAPAANSN---NNPPQQPSGPRGPRPGMFG--PPRPGFGGPRPPWMG
>ERR1719282_1072311
------------GAPGQLDKEYESLMAELSGKSpPPSSG-----------------------------------------
>ERR1719154_131519
-----------PVDRHKMDHEYMSLMAELGEGPAPSIP-----------------------------------------
>SRR5688572_10376428
-PHPAAGDGlaaGGL-----------APRAVLGRGPVAPPPA----------------------------------------
>tr|A0A085M3C0|A0A085M3C0_9BILA Uncharacterized protein OS=Trichuris suis GN=M513_07412 PE=4 SV=1
-----------GSAGAVLDEEYSALLAELGHDTGKGIGGL---------------------------------------
>ERR1719210_1272850
CKQRRCGASfggqygeGGGAGNNKIDEEYLSFMAELGDGNPAPPGA----------------------------------------
>ERR1719233_717629
--------------TRSICHSWQSWVRDplllgvhleLlldqvALQEEISSKdQG---------------------------------------
>EndMetStandDraft_8_1072994.scaffolds.fasta_scaffold8339770_1 # 1 # 207 # 1 # ID=8339770_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.647
CKQRRPGAGfgqygeGGGAGSNKIDEEYLSFMAELGDGTPAPPGAG---------------------------------------
>tr|A0A1D1UN55|A0A1D1UN55_RAMVA Uncharacterized protein OS=Ramazzottius varieornatus GN=RvY_03221-1 PE=4 SV=1
CKTDLTQVEANALPTANMDEEYLSLMAELGHGPPAGQKArgGPSGNSATSNGSATTN------------------------
>tr|A0A1W0XBL1|A0A1W0XBL1_HYPDU Putative Branchpoint-bridging protein OS=Hypsibius dujardini GN=BV898_01121 PE=4 SV=1
CKTDMTQQSEA---vTVNMDDEYLSLMAELGQGGPAPPAKpkPTAVPTSNPQFAMPRP------------------------
>ERR1719193_320844
CKSKRPGDLR--------DERYGGG--GWGGGGGGGGGGggGFGG----------RG------------------------
>Cruoilmetagenom7_1024161.scaffolds.fasta_scaffold380985_2 # 273 # 434 # 1 # ID=380985_2;partial=01;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.506
CRADLSQVPGSGGPgapvqdKRKMDNEYLSLMEELGESTPKPTGAdiNTS-------------------------------------
>tr|F2TVE1|F2TVE1_SALR5 Splicing factor SF1 OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) GN=PTSG_00054 PE=4 SV=1
CRYKRPNASgppasSDTANQAKMDSEYLSLMAELGEVPAQP-------------------------------------------
>ERR1719326_38676
CKYKPKEgAPgpgapGDAAEQAKFDSDYMSLMAELGEVPASQ-------------------------------------------
>ERR1712226_679787
CKVKVDEEaapTQTFAEKAKMDTEYMSLMAELGVDAPPPPP-----------------------------------------
>tr|T2MCF4|T2MCF4_HYDVU Splicing factor 1 OS=Hydra vulgaris GN=SF1 PE=2 SV=1
CKIKHDVNsgpMQTFAEKAKMDTEYMSLMAELGVEGPPAKK-----------------------------------------
>ERR550534_2900640
CKVNLESEaaapVQTFSEKAKMDTEYMSLMAELGVDAPPQPP-----------------------------------------
>LAHU01.1.fsa_nt_gb|LAHU01250090.1|_1 # 1 # 549 # -1 # ID=250090_1;partial=10;start_type=ATG;rbs_motif=AAA;rbs_spacer=5bp;gc_cont=0.424
CQVKVEGNtgpIQTFSEKAKMDNEYMSLMAELGVDAPPPAK-----------------------------------------
>tr|A0A0L0FI47|A0A0L0FI47_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 GN=SARC_11070 PE=4 SV=1
CAQREIGGVPQKQSGENIDNEYLSFMAELEGGGGKAPAP----------------------------------------
>tr|A0A210QA80|A0A210QA80_MIZYE Splicing factor 1 OS=Mizuhopecten yessoensis GN=KP79_PYT05946 PE=4 SV=1
CKAKRPGDTLRITSPTpainqvdkaKMDSEYMSLMAELGEGPPPSQQK----------------------------------------
>ERR1719383_202914
CKSKRPGDMRDERFGGgggg----GWGGGRE--GGGGYR-GGR----------------------------------------
>tr|A0A0L8FNG9|A0A0L8FNG9_OCTBM Uncharacterized protein (Fragment) OS=Octopus bimaculoides GN=OCBIM_22013189mg PE=4 SV=1
CKQKRPGEPIRIQQGSqadraKMDSEYMSLMAELGEGPPPPQKT----------------------------------------
>tr|A0A0D2U4H7|A0A0D2U4H7_CAPO3 Splicing factor 1 isoform 1 OS=Capsaspora owczarzaki (strain ATCC 30864) GN=CAOG_001432 PE=4 SV=1
CMNRDGGNANLEPVSSsggsggsvqplrplpltpgqssssssyssqsnaqdrsKMDDEVNALMNSLANGDSGADGG----------------------------------------
>tr|A0A2B4SHZ5|A0A2B4SHZ5_STYPI Splicing factor 1 OS=Stylophora pistillata GN=SF1 PE=4 SV=1
CIQTDLPPIP------vvqvdkaKMDSEYMSLMAELGEGPLPEPKV----------------------------------------
>tr|A0A015KN53|A0A015KN53_9GLOM Msl5p OS=Rhizophagus irregularis DAOM 197198w GN=RirG_173550 PE=4 SV=1
CMERNNPEALQ-----qakqrdqKLDSEYLSLMAELGENVESSRSN----------------------------------------
>tr|A0A1Y1X843|A0A1Y1X843_9FUNG Uncharacterized protein (Fragment) OS=Basidiobolus meristosporus CBS 931.73 GN=K493DRAFT_241681 PE=4 SV=1
CIQRNNPEALE-----qarqrdqQLDHEYLSLMAELGEDVPEGSAA----------------------------------------
>SRR5579862_5429258
CMERNNPEALQ-----qakqrdqKLDSEYLSLMAELGEDVPPGGTA----------------------------------------
>SRR3954468_2956209
CLEKNNPEFME-----raaqrdsQLDSEYLELMQELGHNVDGPPGS----------------------------------------
>SRR6185312_10339750
CMERNNPEALQ-----qakqrdqKLDSEYLNLMAELGESVDGGRTD----------------------------------------
>tr|A0A137PBB9|A0A137PBB9_CONC2 Uncharacterized protein OS=Conidiobolus coronatus (strain ATCC 28846 / CBS 209.66 / NRRL 28638) GN=CONCODRAFT_56557 PE=4 SV=1
CMQRNNPEALE-----qakqrdiQLNSEYMNLMAALGEKVSTTTPT----------------------------------------
>tr|A0A1Y1V1L4|A0A1Y1V1L4_9FUNG Uncharacterized protein OS=Piromyces finnis GN=BCR36DRAFT_585796 PE=4 SV=1
CMQRNNQDLIA-----aaqqreqQFNNEYMSLMVELGETNANMSSS----------------------------------------
>tr|A0A1Y2CMW2|A0A1Y2CMW2_9FUNG Uncharacterized protein OS=Neocallimastix californiae GN=LY90DRAFT_384164 PE=4 SV=1
CMQRNNQDLIA-----aaqqreqQFNNEYMNLMVELGETDANSAST----------------------------------------
>tr|A0A1Y2F101|A0A1Y2F101_9FUNG Uncharacterized protein OS=Neocallimastix californiae GN=LY90DRAFT_522471 PE=4 SV=1
CMQRNNQDMIA-----aaqqreqQFNNEYMNLMVELGETNTNANSS----------------------------------------
>ERR1740124_1252185
CMVKVEHNSMP-----tqtfaekaKMDTEYMSLMAELGVDAPPAPAK----------------------------------------
>ERR1719427_579054
CRVVGDGSGAP-----qqtftekaKMDTEYMSLMAELGVDGPPPMPG----------------------------------------
>ERR1719354_1464437
CVVKVDANSGP-----aqtfaekaKMDTEYLSLMAELGVEAPPPPAQ----------------------------------------
>ERR1719410_499881
CQVKGNSSSGP-----mqtfsekaKMDNEYMSLMAELGVDAPPPAKK----------------------------------------
>ERR1719427_1418457
---------ASGVDRAKMDSEYMSLMAEIGEHPAPATGN----------------------------------------
>tr|A0A2G8L6Z3|A0A2G8L6Z3_STIJA Putative splicing factor 1 OS=Stichopus japonicus GN=BSL78_07141 PE=4 SV=1
----------GNLERAKMDSEYMSFMAELGEGPPQANRPN---------------------------------------
>ERR550519_2650572
CKQRRPGGGfgqygePPAPGSNKIDEEYLSFMAELGDGAPPPPGAG---------------------------------------
>ERR550532_173355
CKQRRPGGGfgqygePPAPGPWVTSLGTAS-REDLGAGLVSTENLQ---------------------------------------
>tr|A0A183SVJ5|A0A183SVJ5_SCHSO Uncharacterized protein OS=Schistocephalus solidus PE=4 SV=1
CKLRDPSVTL--ESLAKMDSEYSALMAELGVGLGGGSN-----------------------------------------
>tr|A0A068Y7E6|A0A068Y7E6_ECHMU Zinc finger protein OS=Echinococcus multilocularis PE=4 SV=1
CKLRDPSVTM--EHMginpmerAKMDSEYTALMAELGVGYGG-AA-----------------------------------------
>KBSSwiStaDraftv2_1062776.scaffolds.fasta_scaffold9287483_1 # 1 # 270 # 1 # ID=9287483_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.644
CKAKRPGDTY--RAMqnktaedrAKMDSEYMSLMAELGEGPPPPKV-----------------------------------------
>tr|A0A1V9XGP6|A0A1V9XGP6_9ACAR Splicing factor 1-like OS=Tropilaelaps mercedesae GN=BIW11_10209 PE=4 SV=1
CRNKGASGGLGwggqgggmggaglgggGPGNAKIDEEYMSLMAELGEGPPPEMQQS---------------------------------------
>ERR1719445_904778
CKQRRPGTGDFsngaPGEGSKIDAEYMSLMAELGEGPPPPSGGG---------------------------------------
>ERR1719341_10612
CRQRRTPGAPPCVDRQKIDEEYMSLMAELGEGPPPPQNNN---------------------------------------
>ERR1712226_1089363
CVAKRPGTNFGttgPGSNQNMDEEYMSLMAELGEGSVTAPKSD---------------------------------------
>ERR1719203_347752
CMGKKTGSWDQ-GPKTAMDEEYLSLMAELGEGPGPAAPPS---------------------------------------
>ERR1719423_465478
CRQKLPGQNQHssmpmnnapssggggggsHVDRQKIDQEYMSLMAELGEGPPPPTNSN---------------------------------------
>tr|A0A131YRI0|A0A131YRI0_RHIAP Splicing factor 1 OS=Rhipicephalus appendiculatus PE=4 SV=1
CRERGKGGSG-sfggrggfgggggdagpGGSQSKIDEEYMSLMAELGEGPPPPSKSG---------------------------------------
>tr|A0A293N6A9|A0A293N6A9_ORNER Uncharacterized protein (Fragment) OS=Ornithodoros erraticus PE=4 SV=1
CRERRSGGGAGaggfggaggggmgGGEHSKIDEEYMSLMAELGEGPPPPSKGG---------------------------------------
>tr|A0A1W4X6U8|A0A1W4X6U8_AGRPL splicing factor 1 OS=Agrilus planipennis GN=LOC108738982 PE=4 SV=1
CRQKRPGQGGPpvVGEKAKIDEEYMSLMAELGEGPPPPGTTN---------------------------------------
>tr|A0A0T6AZG8|A0A0T6AZG8_9SCAR K Homology domain containing protein OS=Oryctes borbonicus GN=AMK59_7008 PE=4 SV=1
CRQKRPGLGGPaaaaAGDKAKIDEEYMSLMAELGEGPPPPPASE---------------------------------------
>tr|D6X4F0|D6X4F0_TRICA Splicing factor 1-like Protein OS=Tribolium castaneum GN=TcasGA2_TC011081 PE=4 SV=1
CRQKRPGAGGPpvpGGEKNKIDEEYMSLMAELGEAPPPEAVAT---------------------------------------
>ERR1711899_356484
CVAKRPGTNFGnagQGSNQNMDEEYMSLMAELGEGPVPTKSD----------------------------------------
>ERR1719461_537362
CKRRRPGGGFGqygeppAPGSNKIDQEYLSFMAELGDGAPPPPGAG---------------------------------------
>ERR1719433_1260648
CTGKRPGQGFGeggfggDGPNKNIDEEYMSLMARGRDKDLEREDS----------------------------------------
>ERR1719433_1778886
CMGKKTGAWN-EGPKTAMDEEYMSLMAELGEGPGPGGPPG---------------------------------------
>ERR1719336_881526
VWAKKTGAWNEgpk----TAMDEEYMSLMAELGEGPGPGGPPG---------------------------------------
>ERR550532_2780623
CRQKVPGETLRhmqnqsAADRAKMDSEYMSLMAELGEGPPPAVEHK---------------------------------------
>ERR1719464_2169353
ME-SQTGAKIEilgkgrk--DGQGEDEPLHAYVTSRN--P----------------------------------------------
>ERR1719510_1612091
CMGKKTGGWSE-GPKTAMDEEYMSLMAELGEGPVPTPPPA---------------------------------------
>ERR1712223_845265
R--KRKELE-------ESRHGSIQRMLTINPEYKPPPDYK---------------------------------------
>ERR550534_991636
CRTPRNYGND-SSSGNKIDEEYMSLMAELGEGPSPKPESS---------------------------------------
>tr|E9GIM3|E9GIM3_DAPPU Uncharacterized protein OS=Daphnia pulex GN=DAPPUDRAFT_303941 PE=4 SV=1
CRTPRNSANA-dGAPGNKIDEEVNFAQL----------------------------------------------------
>ERR1712071_343727
CRTPRNYGGD-SMAGNKIDEEYMSLMAELGEGPAPKAETS---------------------------------------
>ERR1719461_74146
CKQRRPGAGFPpygeaaGPGGTKIDQEYMSLMAELGDGPPGAGGGA---------------------------------------
>tr|A0A1L8DHN1|A0A1L8DHN1_9DIPT Putative splicing factor 1/branch point binding protein rrm superfamily (Fragment) OS=Nyssomyia neivai PE=4 SV=1
CRSKRPGAGGPpsSNAQAKIDEEYMSLMAELGEGPPVEAGKR---------------------------------------
>ERR1719370_674928
ETKVSAQSGFPpygeapGGGGNKIDQEYMSLMAELGEGPPPPGGA----------------------------------------
>tr|W2TPZ7|W2TPZ7_NECAM Peptidyl-prolyl cis-trans isomerase, cyclophilin-type OS=Necator americanus GN=NECAME_07499 PE=4 SV=1
CKNPRPGGAEGaVAADGGMDDEYSALMEELGERPARAPDGSVRGRGA---------------------------------
>tr|A0A0M3J3T5|A0A0M3J3T5_ANISI Uncharacterized protein OS=Anisakis simplex PE=4 SV=1
LNAGflpfgeivgisipmdyetgKHRGFGf---VEFELAEDAAAAIDNMNDSEM---------------------------------------------
>ERR1719217_1204742
LEAAfrpfgdlksaeipvdyqsgKHKGFGf---VEFLDAEDAEAAIDNMHNAEL---------------------------------------------
>tr|A0A0M3JN72|A0A0M3JN72_ANISI Uncharacterized protein OS=Anisakis simplex PE=4 SV=1
LNAGflpfgeivgisipmdyetgKHRGFGc---VHSVVISFISPP---YVHRIL---------------------------------------------
>tr|A0A0D8XSX0|A0A0D8XSX0_DICVI Uncharacterized protein OS=Dictyocaulus viviparus GN=DICVIV_06294 PE=4 SV=1
CKNPRPGGTDGtSTADGGMDDEYSALMEELGERPARQADGSLRGRGG---------------------------------
>ERR1712029_21491
VKAAlipfgeisevqipldyqtdKHRGFAf---VEFELAEDAAAAIDNMNESEL---------------------------------------------
>ERR1719259_317264
LKNAfsmfgdlvdvqlpldyetgKHRGFSf---IEFELEEDALDAIDNLNESEL---------------------------------------------
>ERR1719369_1138043
LNQAfsmfgdvvdvqlpldyesgKHRGFAf---IEYELEEDAMDAIDNLNESEL---------------------------------------------
>tr|W4XHC1|W4XHC1_STRPU Uncharacterized protein OS=Strongylocentrotus purpuratus PE=4 SV=1
LHAAfipfgdimdiqipldyeteKHRGFAf---VEFEFAEDCAASIDNMNDSEL---------------------------------------------
>ERR1712131_275582
VKAAlipfgeitevqipldyqtsKHRGFAf---IEFELAEDAAAAIDNMNESEL---------------------------------------------
>tr|A0A183UQ34|A0A183UQ34_TOXCA Uncharacterized protein OS=Toxocara canis PE=4 SV=1
CKNPRPGSGSfGfGGLDGGMDDEYSALMEELGEKPPQKPFGGGGYQGN---------------------------------
>tr|A0A077WA57|A0A077WA57_9FUNG Uncharacterized protein OS=Lichtheimia ramosa GN=LRAMOSA00881 PE=4 SV=1
VHAAfipfgdivsvqlandpgshnPHKGYGf---VEFEEEEDCDAAIDNMNLAEL---------------------------------------------
>ERR1712176_1653281
----mpldqasqKHRGFGf---VEFELAGDAKAAIENMNNSEL---------------------------------------------
>tr|A0A238BVR2|A0A238BVR2_9BILA Peptidyl-prolyl cis-trans isomerase, cyclophilin-type OS=Onchocerca flexuosa GN=X798_03554 PE=4 SV=1
CKNPRPGSG-LfNVGDGGMDDEYTALMAELGEKPASRPYNAAGKPGL---------------------------------
>SRR5262249_3482360
CKNPRPGYQL--DGGAGMDDEYSALMAELGESKPNMGAAGG--------------------------------------
>tr|A0A0N4UUF2|A0A0N4UUF2_ENTVE Uncharacterized protein OS=Enterobius vermicularis PE=4 SV=1
CKNPRPGGSF--TADGGMDDEYSALMAELGERPSTASLGDKASPAN---------------------------------
>tr|F1KTK4|F1KTK4_ASCSU Splicing factor 1 OS=Ascaris suum PE=2 SV=1
CKNPRPGSGAfSlNNLDAGMDDEYSALMEELGEKPPPKPFYG-GGPGN---------------------------------
>tr|A0A0N4U4T2|A0A0N4U4T2_DRAME Uncharacterized protein OS=Dracunculus medinensis PE=4 SV=1
CKNPRPGGASfNLGADGGMDDEYSALMAELGEKPSGAVAKVPGTHA----------------------------------
>tr|A0A1I7RM36|A0A1I7RM36_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus PE=4 SV=1
CKNPNPGYQY--GEIAGMDDEYSALMAELGEKPAQPFGGLTQ-------------------------------------
>tr|A0A1I7ZDE2|A0A1I7ZDE2_9BILA Uncharacterized protein OS=Steinernema glaseri PE=4 SV=1
CKTPRT--AIiGqNGEVVGMDDEYTALMAELGEQPKDAA------------------------------------------
>tr|A0A0A9WP80|A0A0A9WP80_LYGHE Splicing factor 1 OS=Lygus hesperus GN=SF1_2 PE=4 SV=1
CRSGKATDITtTtVKDRAKIDEEYLSLMAELGEGPPPDKIAKRSSLSS---------------------------------
>tr|A0A170UYZ4|A0A170UYZ4_TRIIF Splicing factor 1 (Fragment) OS=Triatoma infestans PE=4 SV=1
CRTKRPGAATsPtanSKDKAKIDEEYMSLMAELGEGPPPDKNSAAKKPPS---------------------------------
>ERR1719510_1430973
CMGKKAGGWSe-G-PKTVMDEEYMSLMAELGEGPAPSAPPSGKEDKQ---------------------------------
>ERR1711934_721988
LKEVflpfgeindiqmpkdyeteKHRGFAf---IEFESAEDAAHAIDNMNDSEL---------------------------------------------
>ERR1712001_347213
CMGKKPGGWNs-GGSKNAMDEEYMSLMAELGEGPAPPGTSSSSSSSN---------------------------------
>tr|A0A0N5DM57|A0A0N5DM57_TRIMR Uncharacterized protein OS=Trichuris muris PE=4 SV=1
LHAAfipfgdivdvsmpldfetnKHRGFGf---IEYEMDADAASAVDNMNKGEL---------------------------------------------
>ERR1719422_3070455
LHSAfipfgeindiqmpldyeteKHRGFGf---IEYESAEDAAHAIDNMNDSEL---------------------------------------------
>ERR1719505_362685
IQASlipfgdlvdinlpldyetqKHRGFAf---VEFESAEDAAAAIDNMNEAEL---------------------------------------------
# STOCKHOLM 1.0
#=GF ID query
#=GF AU hmmsearch (HMMER 3.3.2)
#=GS 2dnx_A/41-41 DE [subseq from] mol:protein length:130 Syntaxin-12
#=GS 2dnx_A/60-107 DE [subseq from] mol:protein length:130 Syntaxin-12
#=GS 2gut_A/47-61 DE [subseq from] mol:protein length:77 ARC/MEDIATOR, Positive cofactor 2 glutamine/Q-rich-associated protein
#=GS 7jil_E/162-177 DE [subseq from] mol:protein length:183 50S ribosomal protein L5
2dnx_A/41-41 -----------------------------G-------------------------------------------------
#=GR 2dnx_A/41-41 PP .............................2.................................................
2dnx_A/60-107 --------------TNQLAKETNELLKELGSLPLPLSTSEQRQQRLQKERLMNDFSAALNNF-----------------
#=GR 2dnx_A/60-107 PP ..............56778899999*************999888777666665555554443.................
2gut_A/47-61 --------------KAKTRDEYLSLVARL--------------------------------------------------
#=GR 2gut_A/47-61 PP ..............9***********976..................................................
7jil_E/162-177 --------------TAKTDKEAKSLLAELG-------------------------------------------------
#=GR 7jil_E/162-177 PP ..............599************9.................................................
#=GC PP_cons ..............689999*******997********999888777666665555554443.................
#=GC RF xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
//
# STOCKHOLM 1.0
#=GF ID chain_sp_Q15637_SF01_HUMAN_Splicing_factor_1_OS_Homo_sapiens_OX_9606_GN_SF1_PE_1_SV_4_292_370-i1
#=GF AU jackhmmer (HMMER 3.3.2)
#=GS MGYP000669028722/41-104 DE [subseq from] PL=00 UP=0 BIOMES=0000000011000
#=GS MGYP000831674948/138-183 DE [subseq from] PL=11 UP=0 BIOMES=0000000011000
chain_sp_Q15637_SF01_HUMAN_Splicing_factor_1_OS_Homo_sapiens_OX_9606_GN_SF1_PE_1_SV_4_292_370 CKFQRPGDPQSAQDKARMDKEYLSLMAELGEAPVPASVGSTSGPATTPLASAPRPAAPANNPPPPSLMSTTQSRPPWMN
MGYP000669028722/41-104 ----RPGDPQSAQDKARMDKEYLSLMAELGEAPVPASVGSTSGPATTPLASAPRPAAPANNPPPPVSL-----------
#=GR MGYP000669028722/41-104 PP ....9***********************************************************9654...........
MGYP000831674948/138-183 CKFQRPGDPQSAQDKARMDKEYLSLMAELGEAPVPASVGSTSGPAT---------------------------------
#=GR MGYP000831674948/138-183 PP 9*******************************************98.................................
#=GC PP_cons 9***9***************************************99******************9654...........
#=GC RF xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
//
# STOCKHOLM 1.0
#=GF ID chain_sp_Q15637_SF01_HUMAN_Splicing_factor_1_OS_Homo_sapiens_OX_9606_GN_SF1_PE_1_SV_4_292_370-i1
#=GF AU jackhmmer (HMMER 3.3.2)
#=GS sp|Q15637|SF01_HUMAN/292-370 DE [subseq from] Splicing factor 1 OS=Homo sapiens OX=9606 GN=SF1 PE=1 SV=4
#=GS sp|Q64213|SF01_MOUSE/292-370 DE [subseq from] Splicing factor 1 OS=Mus musculus OX=10090 GN=Sf1 PE=1 SV=6
chain_sp_Q15637_SF01_HUMAN_Splicing_factor_1_OS_Homo_sapiens_OX_9606_GN_SF1_PE_1_SV_4_292_370 CKFQRPGDPQSAQDKARMDKEYLSLMAELGEAPVPASVGSTSGPATTPLASAPRPAAPANNPPPPSLMSTTQSRPPWMN
sp|Q15637|SF01_HUMAN/292-370 CKFQRPGDPQSAQDKARMDKEYLSLMAELGEAPVPASVGSTSGPATTPLASAPRPAAPANNPPPPSLMSTTQSRPPWMN
#=GR sp|Q15637|SF01_HUMAN/292-370 PP 9*****************************************************************************8
sp|Q64213|SF01_MOUSE/292-370 CKFQRPGDPQSAQDKARMDKEYLSLMAELGEAPVPASVGSTSGPATTPLASAPRPAAPASNPPPPSLMSTTQSRPPWMN
#=GR sp|Q64213|SF01_MOUSE/292-370 PP 9*****************************************************************************8
#=GC PP_cons 9*****************************************************************************8
#=GC RF xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
//
>chain_sp_Q8IWZ8_SUGP1_HUMAN_SURP_and_G-patch_domain-containing_protein_1_OS_Homo_sapiens_OX_9606_GN_SUGP1_PE_1-SV_2_188_242
TRKVIEKLARFVAEGGPELEKVAMEDYKDNPAFAFLHDKNSREFLYYRKKVAEIR
>tr|S4RYH3|S4RYH3_PETMA SURP and G-patch domain containing 1 OS=Petromyzon marinus GN=SUGP1 PE=4 SV=1
TRDVAEKLARFVAEGGPEMEQIAAEGNRDNPAFWLVMMYPLKREVFQMCRFFRCR
>tr|W4Z8C4|W4Z8C4_STRPU Uncharacterized protein OS=Strongylocentrotus purpuratus PE=4 SV=1
--------MAFSFKHGADKGPIKPSF-QD--KMKSLSDQ-ERMLLQKRQeierKMAQKK
>tr|H9GEC4|H9GEC4_ANOCA Uncharacterized protein OS=Anolis carolinensis PE=4 SV=2
RRDRGEKLARGVADGVPVVEAVALKNHRENQAFRFIYEPNSSGHKYHRQKLDEFR
>tr|I3JVA4|I3JVA4_ORENI SURP and G-patch domain containing 1 OS=Oreochromis niloticus GN=SUGP1 PE=4 SV=1
TQQVAEKLAKFVAEGGPEVEAIAAERNRNNPAFSFLYDEQSPAHRFYKEKVKEYR
>tr|W5K5S5|W5K5S5_ASTMX SURP and G-patch domain containing 1 OS=Astyanax mexicanus GN=SUGP1 PE=4 SV=1
TKKVADKLARFVADGGPEVEAIATKHNVDNPAFSSElllhlFPQKM--NRFYMNFIGDRT
>tr|A0A1L8HXG4|A0A1L8HXG4_XENLA Uncharacterized protein OS=Xenopus laevis GN=XELAEV_18006617mg PE=1 SV=1
AKMFAEKLARFVADGGPEVEAIALQNNRENPAFRFLYDQNSKGFKWYKLKLEEFR
>tr|A0A096NC45|A0A096NC45_PAPAN Uncharacterized protein OS=Papio anubis PE=4 SV=1
VKNLAGKLARFVVDWHPEVETTALQNNRENQAFNFLRPTAKDTS----TTDISGR
>ERR1719494_1475985
---------------------EARMKYQNDSRYRFLYESNSDVTRYYQKCVKDLK
>tr|V9KKN0|V9KKN0_CALMI SURP and G-patch domain-containing protein 1-like protein (Fragment) OS=Callorhinchus milii PE=2 SV=1
TRSVIEKLAKYVADGGPEMEEMAIQNNRDNPAFWFLYNQNSEAYRYYQDTV-DVF
>tr|S4RYH3|S4RYH3_PETMA SURP and G-patch domain containing 1 OS=Petromyzon marinus GN=SUGP1 PE=4 SV=1
IKLVYYTYVWVVCKCSTSIKCLWIVNSSVNCPYRFLYDQNSGAYKYFKHKV-KEY
>tr|A0A2D4LGK0|A0A2D4LGK0_9SAUR Uncharacterized protein OS=Micrurus spixii PE=4 SV=1
VMVTAQKLAEFVAEVGPEIEQFSIDNSADNPDLSFLQDPESSAFKFYRMKVHEL-
>tr|A0A1S3WNY1|A0A1S3WNY1_ERIEU SURP and G-patch domain-containing protein 2 OS=Erinaceus europaeus GN=SUGP2 PE=4 SV=1
CR------------VGGH--QATGRTRARTPL-------------LYRCRLDD--
>tr|Q4V7M9|Q4V7M9_XENLA MGC115540 protein OS=Xenopus laevis GN=sugp2 PE=1 SV=1
TKDTAIKLSQFVAQMGPELEEFSMENSINNPEFWFLREKNSPAYKFYQSKVEEF-
>HubBroStandDraft_4_1064222.scaffolds.fasta_scaffold5805164_1 # 3 # 230 # -1 # ID=5805164_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.605
TKRLIDYLAPLVLKDGARFERLLIDREGQNPNFSFLFNLGSSANIYYRWRVY---
>GraSoiStandDraft_5_1057265.scaffolds.fasta_scaffold4147734_1 # 2 # 244 # 1 # ID=4147734_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.650
LRYVIDTLAAFVVKDGAKFEQLVRQKETGNPAFTFLFENGSAANVYYRWKLF---
>tr|A0A2G9RG68|A0A2G9RG68_LITCT Uncharacterized protein OS=Lithobates catesbeiana GN=AB205_0058510 PE=4 SV=1
--DTAIKLAQFVVQMGPEIEEFSMQNSVNNPEFWFLREKDSPAYKYYKSKLEEF-
>tr|A0A0R3WAE0|A0A0R3WAE0_TAEAS Uncharacterized protein OS=Taenia asiatica PE=4 SV=1
QRSVIDKLAQFVARNGPDFERMTMEKQQGNPQFAFLFGGEN--SDYYKHKVEELK
>ERR1719424_449139
-------LAARAAAHGHRTRGGVCAAQP-----------------RDHRPARP--
>13_taG_2_1085334.scaffolds.fasta_scaffold135032_1 # 2 # 736 # -1 # ID=135032_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.325
ITALIDTTARYVSEDGYLFEKAVMDRERDNTDFVWLFDNKAPEHRYYRWRVFS--
>ERR1719234_870593
IRKIMDKLSKHVMQHGHPFEQAIMEREGRTGDYAFLFQHDSEDNKYYRWRVFSL-
>ERR1740130_1359979
LRQRIDLLARYIAKDGHSFEKIIMERERMNPEFAFLFDVDSDDHRYYRWRTWSL-
>ERR1719203_2031180
IQSLIDRLALFVAEEGHPFEQVIMEREASNPKYRFIFDHGSAEHLYYRWKVVSL-
>ERR1712070_935611
TRKTIDRVATFVRRHGFEFEALLMEKEHRNPAYGFLFQTESPEHMYYRWRVWSM-
>ERR1711871_379914
RRSVIDCAATYIKRFGYDFEATLQEKEQGNAEYSWLFDKQSEEHTYYR-------
>ERR1719199_534224
RRYIIDTTAVYVARDGSEFEDAVRERASNNPEFAFL-NCDGPEHVYYTWRVWSL-
>ERR1712000_565720
IDDIIIQLADFVAKEGYHFEDLVREKESRNSDFDFLFNKDSPEHLYYKWRVYSF-
>SRR5690606_13002377
-QQVIDKLADFVARNGLKFEVLTLEKQKDNPKFAFLKPE-HEHHGYYRYRIWC--
>SRR5690242_18793874
-KEVIDKFAATVVKNGPSFEEVVREKQKSNIMFDFLNEN-GQYSEYYRWKIYD--
>ERR1711937_308372
--YLIDLLAIFVVEDGPLFEKFIIGREKENKDFDFLMNRNLSEHVYYIWRLS---
>tr|M2XF33|M2XF33_GALSU U2-associated protein SR14 isoform 1 OS=Galdieria sulphuraria GN=Gasu_39870 PE=4 SV=1
-RREIDLLALHVSKEGYAFESLVIEREKKlssqgRSRFRFLFDVNeslSEESIYYRWKVY---
>tr|A0A059LNC6|A0A059LNC6_9CHLO Uncharacterized protein (Fragment) OS=Helicosporidium sp. ATCC 50920 GN=H632_c490p0 PE=4 SV=1
-CFVIDAVVAFVLQDGCVFEQLIMEREAGNPEFAFLFNTASPEHLFYRWRLF---
>ERR1719161_377680
-RSVVDILAKYVVENGQDIEIRIADREIKNPEFEFIRNRESPVYVYYCWRLY---
>ERR1719383_685072
-RYVIDTLAEMVVQDGWELEFLLMEQEKENPEYGWLTDTDSTAHAYYRWRLY---
>tr|A7APC5|A7APC5_BABBO Surp module family protein OS=Babesia bovis GN=BBOV_III008490 PE=4 SV=1
-RAVIDLTARYVAEIGADYEYLLISNEKRDGLFSFLHDRCSPEHVYYRWKVY---
>tr|L0B397|L0B397_THEEQ Uncharacterized protein OS=Theileria equi strain WA GN=BEWA_010080 PE=4 SV=1
-MQIIDMMATYVAEYGQNFEQMIMSRESPNGLFAFLFERFSSDHIYYRWRVY---
>APGre2960657468_1045069.scaffolds.fasta_scaffold784263_1 # 2 # 292 # 1 # ID=784263_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.505
-QDVIDKMADFVSRNGPSFENVIREKQRNNPKFQFLFDGG-ENYDYYKWKLSTLR
>ERR1719233_44580
VRKVIHKMAQRVLEGGMAFENFVISKERDVKIFSFLFHVDTPEHFYYRWKTI---
>SRR6476620_5363878
RLQLISTVAAFVAKDGAQLEKHLLEQFHLNQapDLDFLTYqpqahqtGIIEDHVFYKWRGY---
>ERR1711976_1132428
KNKLLNTLALFTSRYGIQFEEVVKSSERHNSFFQFVYQpfgPQNPQLTFYKWRVY---
>ERR1712166_1407372
TRRLMDQVSEFVSKFGYDFESLLMEREHSNVAYNFLYHTETAEHTDYRWRVY---
>ERR1711865_173637
--KRINSAAKLTAKHGYEFQAKLMEKEYDNDDFQFLFENGTDQYNYYCWRCW---
>tr|A0A2G5BFU3|A0A2G5BFU3_COERN Uncharacterized protein OS=Coemansia reversa (strain ATCC 12441 / NRRL 1564) GN=COEREDRAFT_96449 PE=4 SV=1
LVQLIHWTVEQVIKHGPEFECLLIARKTEDPRFRFLSDYLSPEHVYYRWRMY---
>tr|J3JTM1|J3JTM1_DENPD Uncharacterized protein OS=Dendroctonus ponderosae PE=2 SV=1
---ALGGVAAM----GACIFtnPLEVLKTRLQLQGELKAKG--QHAVHYKNVFHA--
>SRR4051812_8823418
---------------------VEVAKTRLQLQGELNRGN-----RPFRNPIQT--
>tr|A0A212FEX2|A0A212FEX2_DANPL Putative mitochondrial 2-oxoglutarate/malate carrier protein OS=Danaus plexippus plexippus GN=KGM_209633 PE=3 SV=1
---VIGGLAGA----GATIFtnPMDVVKTRLQLQGELRART--EHTARYRGIFHG--
>ERR1719273_1886236
-----------------------------VsFSPPSVCPAPSALYRNYRARVIEIK
>ERR1719203_542458
-----------------------------KTLEETDSRKNSSLYARYRERVTELK
>ERR1719320_1983718
--CISCKVVHSSQTETLVLEDIARDRNKNTPELKFLFEKSGPLYKRYRARVAELK
>ERR1719215_1724886
--ALVRAKVAAMKEGGVAVSAPSNV----ISEEEQRRRkaveEQKM-------------
>ERR1719195_1996654
--PVVEELAQMVAVSGEDLETIARERNKNTPELGFLHDRYSALYRSYRTRVVEIK
>ERR1719295_336204
--LVAEELASMVAVSGDSVEEVAKAHNVDEDQLAFLFDSTSKLYRRYRQKIRSLR
>ERR1712168_223549
----------------------PPQRGPQPpPQqqmkasrPLPGVA---SIFQEEEEVEGYLP
>ERR1711962_426282
--DLAKELASKVAEDGPHAEDEAREKHRGDSRYRFLTDPSSPVALYYQSELRERR
>ERR1719230_693565
--ALFAAKAAAVRGGAV---NVAPV----VTEEEMKRQkaieEQKMMNELYRKVIE---
>ERR1719354_1450368
--DLAKDLARRVAEEGSHVEEEARETHRGDSRYKFLTDRCSPVSSYYQAELRECK
>ERR1719414_555429
--MMNEDFARRITGGEGL---------------TFEQKkqiqEQQQMNAMVEMLNAKKK
>ERR1719470_325172
--SSLDSLALCTASSTCTLWLCCRTvhWAVRLDSDCTICSCR--VFTL-PFSVV---
>ERR1719424_1518737
QRELIDRLAAYVAKAGGAFEQAIIEREQSNPQYRFMLDQSLPEHAYYRWKVVSL-
>ERR1740130_257047
QRQLIDRLAAFVASEGCAFEQAIIEREQSNPQYRFMSDHSLSEHAYYRWKVVSL-
>ERR1740117_2645681
------------AVETP-----------STDSCLTTASPSTPT-IVGRWSLWSP-
>ERR550537_324718
-RIVIDRVAQYVARVGYGFETAIREREKGNPLFDFMTEKDTDLAKYYRW------
>WorMetDrversion2_8_1045237.scaffolds.fasta_scaffold646885_1 # 1 # 60 # -1 # ID=646885_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.383
-MSAADKVAEFVKDDGWALEQLILEREMSNPMFTFLHDTASPDHTYYRW------
>ERR1719295_1019007
QRQAIEHLAQFVKSNGHALEQVVRDRSRDDPKFSWLYDEASNGFKYYQYLL----
>ERR1719322_778975
-KKNTALFSrmlsQFNKTSQPSQNTKNL---QT--SSCFHGDESSSDEsSESETKTSSI-
>tr|D8QWQ0|D8QWQ0_SELML Uncharacterized protein OS=Selaginella moellendorffii GN=SELMODRAFT_438553 PE=4 SV=1
VRKVAEKLASFVAKSGREFEDITRQKNPTDPRFGFLFDVDCSDYKYYEHKLAEE-
>tr|R1FEX1|R1FEX1_EMIHU Uncharacterized protein OS=Emiliania huxleyi GN=EMIHUDRAFT_111405 PE=4 SV=1
---AIDKLADFKVKNGEQFEALIRNKQRDNPTFAFLFDEASPGYAYYHRKVQEY-
>tr|A0A0H5R5W0|A0A0H5R5W0_9EUKA Uncharacterized protein (Fragment) OS=Spongospora subterranea PE=4 SV=1
-RTAIERLGAYTARNGPEFETMIMQEQRFNPDFRFLFEAASPDNLYYRWIVSQ--
>tr|I1HE91|I1HE91_BRADI Uncharacterized protein OS=Brachypodium distachyon GN=BRADI_2g09840 PE=4 SV=1
IRVIIEKTATFVAKNGPEFERRIVALNRGNAMVNFLQSS-DPYHAYYQHRISELA
>tr|R0GNY0|R0GNY0_9BRAS Uncharacterized protein OS=Capsella rubella GN=CARUB_v10011640mg PE=4 SV=1
IKDIIERTALFVSKLGLVFENKVKAEKASKVNFNFLKND-DPYHAFYLHKLSEYS
>tr|V4NYN0|V4NYN0_EUTSA Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10016290mg PE=4 SV=1
IKAIIQTTAKYVSDTGSEFEKKIIAKEAENARFNFLKSS-DLYHAFYKQKLTKYR
>tr|A0A200QNR2|A0A200QNR2_9MAGN SWAP/Surp OS=Macleaya cordata GN=BVC80_1771g38 PE=4 SV=1
IRKIVDRFAKFVAKN-PEYEKIIIAQFVNNHKFNFLKGS-GPYYAYYQHRFRPVR
>ERR1719424_2463391
-------RAAHPEERGAEQQ------------VRIPQAG-RPVQPVLRGDGDRQR
>ERR1719217_1756068
AADAA-ASAK---ISGESTVKP-VK-----KAAAGVMPL-EPPKPQYI-------
>tr|A0A068VC59|A0A068VC59_COFCA Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00007059001 PE=4 SV=1
IRNIVDKTSQFVAKNGPEFEKRIMASNGDLILWKDLII-----------------
>ERR1711881_808177
IRSILEKTAQFVAKNGPEFESRIRKNEESNNKFNFLKDD-NAYNSFYRHRVAAFK
>ERR1711974_280099
IRTVVDKTAQYVAKHGKPFEDKIRTANFGNPKFAFLEEQ-DHYNAYYRQTINLLK
>ERR1719223_94782
IKEMVDRTADFVRKNGKQFEAEIIKRNTGKKKFDFLQPT-NPYHAYYQQRLQYGS
>ERR1719440_2487315
-----------------------DNEKKQNPKFAFLQAS-DPFHAYYVAKVQELG
>SRR5262249_27182004
RKTIIEKTAQFVARNGPMFEQTILEKESNNPVFAFIRPE-DPDHAYYKNRIQELL
>ERR1719331_2019977
IKAIVEKTAQFVARNGPEFEAKILANERQNAKFGFLVAS-NPFHKFYRARVNAIR
>ERR1719285_1124545
RKGTIGFVSFLQRAD-------------AEAALDALNgKFTWGTRLVLRWGN----
>ERR1712166_171130
LKQLIDLAALNTAVYGYEFETALKDKQSKNPDYGFLFDTKSDNHNYFCWRC----
>ERR1712054_257639
LRSAVDLLATYVVEYGPEFEKLIEQREKSNKIFRFITDYESPEHMYYRWRL----
>ERR1719233_103972
LKAIIDRLAVHVARDGMCFEYLFMSRERNNPNFDFLRNVKHSDHLYYRWRI----
>tr|A0A150H084|A0A150H084_GONPE Uncharacterized protein OS=Gonium pectorale GN=GPECTOR_2g1034 PE=4 SV=1
-VGVATKLADFVSKNGRQFEDMTRERNPGESPFKFLHDKASLGYQFYAAKLAELE
>tr|A0A250XIL8|A0A250XIL8_9CHLO Uncharacterized protein OS=Chlamydomonas eustigma GN=CEUSTIGMA_g10185.t1 PE=4 SV=1
-ITACDRLAEFVAKNGKSFEDMTRDRNPGDTPFKFLHDKGCSQYKYYDQKVKEFE
>ERR1719272_1388820
-VAVAPP--------AAVVAPAPVEKDAAEQAFDSLFAKAAVREKRR---FVEAP
>ERR1719399_1069341
VKDRLDRMADFVAHDGWEFEKIIADREKDNPLFAFLh--TNdveDPLKVYYRWRVFA--
>tr|A0A197KDS1|A0A197KDS1_9FUNG Uncharacterized protein OS=Mortierella elongata AG-77 GN=K457DRAFT_133270 PE=4 SV=1
-LVIIHRVVEKVLINGADFEDVLIEKEFRNPSFAFLINLKSPEHIYYRWRLY---
>tr|A0A0K3C8H0|A0A0K3C8H0_RHOTO BY PROTMAP: gi|472586638|gb|EMS24157.1| U2-associated protein SR140 [Rhodosporidium toruloides NP11] gi|647398047|emb|CDR41640.
-ESFLVTVARKVRDNGKSFEEILRDKERENPKFAFLRDDKLPSFHYFRMLVD---
>tr|A0A1B9G754|A0A1B9G754_9TREE U2-associated protein SR140 OS=Kwoniella bestiolae CBS 10118 GN=I302_04548 PE=4 SV=1
-KKFIKTVANRVKDVGRGFEDLLRKKEKENPKFAFLVNEDLPEYHFYQLSVD---
>tr|A0A1Y1UM42|A0A1Y1UM42_9TREE Uncharacterized protein OS=Kockovaella imperatae GN=BD324DRAFT_618400 PE=4 SV=1
-DKFIRAVAAKVRHHGKGFEQVLKTKEKDNEKFNFFFNSDLPDYHLYRSLLS---
>tr|F4RJC3|F4RJC3_MELLP Uncharacterized protein OS=Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) GN=MELLADRAFT_85661 PE=4 SV=1
-EKFITTVAKKVLEHGERFERTLREREKSNPKFNFLIESDAPAHHYFRMLID---
>tr|A0A0J1AWQ5|A0A0J1AWQ5_9TREE Uncharacterized protein OS=Cutaneotrichosporon oleaginosum GN=CC85DRAFT_264948 PE=4 SV=1
-LKFVEEVSKRIRDIGPGLLDTLKDREQDNPKFQFLFDQESAVYHVFQHLLD---
>tr|J6EVF1|J6EVF1_TRIAS Uncharacterized protein OS=Trichosporon asahii var. asahii (strain ATCC 90039 / CBS 2479 / JCM 2466 / KCTC 7840 / NCYC 2677 / U
-QDFLEAVANRVHDHGKGLLNTLRNNERSNPKQ-------SADYHVFNYFLD---
>ERR1719487_2547521
-RKRVEIMADHIIKNGPDFESMVKEKNMNNPQFAFLHGGHG--APYYEEIMWQK-
>ERR1740124_313856
-RARIEKLSAYVAQANVDMEATVRERQRGNPQFAFLFGGEG--ADYYTECLQNS-
>ERR1719409_2560591
-STTLR---RTFTTAG---ASTRSAKEAQNDTFKFLFEHDSPENVYYRWRVY---
>ERR1719311_655590
-ARAIRRP------RRVSFRAADHGEGGSNESFKFLFEHDSPDNVYYRWRVY---
>ERR1719409_1977738
-----RDHGGAHRTEWPRFENTVKQKNVNNPQFAFLYSGE--GGEYYAHVLAVH-
>ERR1719391_1199466
QRNIIDKLAQFVARNGPDFEMTTKNRQKGNPKRPRAEEHH-----RQTGPiCR---
>ERR1719361_48401
IREAIDMNIQCGQQLG-ELPVATKDGPPPVWiESKSDNDGK--FY-YYNAKtRE---
>ERR1719500_1233090
LWNIPVNL----------------SSQVHLWSSGLWMLEE--YYAYYDFKVR---
>tr|A0A178VHT3|A0A178VHT3_ARATH Uncharacterized protein OS=Arabidopsis thaliana GN=AXX17_At3g46540 PE=4 SV=1
------SLWKFLC----------------------WFYLSCFFmvpYSRMGFSVPFS-
>tr|I0YV60|I0YV60_COCSC Uncharacterized protein OS=Coccomyxa subellipsoidea (strain C-169) GN=COCSUDRAFT_55975 PE=4 SV=1
VRKVVEKLAEFVAKNGRNFEDVTRQRNPEDSSFRraFLYHKTSPEYLYYESRVRAL-
>tr|A0A251S961|A0A251S961_HELAN Uncharacterized protein OS=Helianthus annuus GN=HannXRQ_Chr15g0479141 PE=4 SV=1
VKKAADNLATFVAKNGRQIEHITRQKNPGDTPFKFLYDESCADYKYYEYRLSEE-
>SRR5690606_35954975
ARHTTRCVRTQTHRHGPDFEQLTRSRNEGNPKFGFLFDTAASAsaeihveRAYYLWVRDEE-
>ERR1740123_1565852
PTP--------------APT----KRTCNSRKLNPC--T-AHPIDWYQPRCSQ-R
>tr|A0A1I7RW97|A0A1I7RW97_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus PE=4 SV=1
IRAIVEKTAQFVAKNGIDFENRIKEKESGNSRFGFLAPN-DPYHAFYRAKVAEAE
>ERR1700761_57135
IREIVEKTAGYVQRNGAAFEDRIRQTQGQAPRMSFINPE-DEYHSYYQWRLKEIK
>tr|K8FD13|K8FD13_9CHLO Uncharacterized protein OS=Bathycoccus prasinos GN=Bathy15g00580 PE=4 SV=1
VRAIVDKTAQFVAKNGPEFETRILSSEKNNQKFSFLREN-SPFYSYYRGKIESNK
>ERR1719383_1143285
------------APSAPRVREragTVAPGSLSVSRPQFLIPT-NPYHTFYRKKVEDFK
>ERR1719379_1907514
IRGIVDKTATFVARNGPDFEKRILLNEASNAKFAFLQDN-NPYNAYYKQKVQEMS
>tr|A0A0A1NFN3|A0A0A1NFN3_9FUNG Uncharacterized protein OS=Rhizopus microsporus GN=RMCBS344292_09830 PE=4 SV=1
VRTMIDKAATHLATKPPELAKHILEND-KEGKFSFLKEG-DPYHAYYLFKFNEAK
>tr|A0A1C7NDE8|A0A1C7NDE8_9FUNG Splicing factor 3A subunit 1 OS=Choanephora cucurbitarum GN=SF3A1 PE=4 SV=1
IRKMIDKAAANLATKGPELENHIRETN-KDGRFNFLNPN-DPYYAYYRFKYQEVK
>tr|A0A2G5B9W4|A0A2G5B9W4_COERN Surp module (Fragment) OS=Coemansia reversa (strain ATCC 12441 / NRRL 1564) GN=COEREDRAFT_32794 PE=4 SV=1
IKTIADKTAEHVAKSGEAFQQLIRDKYQGNAKFSFIYPN-DPYFTYYEHMVEQFK
>tr|A9V640|A9V640_MONBE Predicted protein OS=Monosiga brevicollis GN=33566 PE=4 SV=1
IRNIVNKTAGFVAKNGWDFADRIRKEKAGEVKFNFLQPD-NPYHAYFVHMVEEVR
>ERR1719201_997202
----------------FQYVERIRREQGEQVKWNFLMPS-NPYHAYFLYMVEEVK
>tr|A0A167D095|A0A167D095_9ASCO Prp21p OS=Sugiyamaella lignohabitans GN=PRP21 PE=4 SV=1
VRAVVEKTVDYIARRGDAFLERIRNSN-DSARFSFVKEN-DPYYNYYNWRLEQHK
>tr|A0A0J9X9T8|A0A0J9X9T8_GEOCN Similar to Saccharomyces cerevisiae YJL203W PRP21 Subunit of the SF3a splicing factor complex, required for spliceosome assembl
VRSIVEKTVGYVVRNGPTFEERIQQKEESNSKFSFLRFE-DPYRAYYEWRLTERR
>tr|G3TTS0|G3TTS0_LOXAF Splicing factor 3a subunit 1 OS=Loxodonta africana GN=SF3A1 PE=4 SV=1
IRNILDKTASFVTRRRSEFEAR-QQKEINNPEFNFLNFY-NTYHAYHYKNSANSQ
>ERR1719376_762609
----------LGCSNwfnaftfheFIKFVViVCMIWISVRQKIKFLSNT-DPYHAYYKHKLNEFM
>tr|A0A1D2A0D8|A0A1D2A0D8_AUXPR Uncharacterized protein (Fragment) OS=Auxenochlorella protothecoides GN=g.7712 PE=4 SV=1
IRAIADKTALYVFKNGPDFERRVRANEAENTKFSFLKPG-DPYHAYYQHKLAEHA
>ERR1740117_2668353
LDLSRLQVHDC-------LPDC---TLQCGWFLGAGEPN-NPYWKYYQARLKF-G
>ERR1719223_1558363
--------------------QRILKNEANNNKFAFLKPG-DPFNPYYAEQVIANG
>GraSoiStandDraft_41_1057321.scaffolds.fasta_scaffold7758357_1 # 3 # 296 # -1 # ID=7758357_1;partial=10;start_type=GTG;rbs_motif=3Base/5BMM;rbs_spacer=13-15bp;gc_cont=0.724
---------PPPARAGAEFEARILGNESGNAKFNFLRDG-DPYHAYYRARVEALA
>ERR1740128_1177720
-----------------EFEARIHKNEQSNPKFNFLKPG-DAPAAPLSGNISAPP
>ERR1719394_957427
------------------LNRESNRMKLITPSLIFLNPG-DPYHAYYQYKVDAIR
>tr|A0A0C2MAQ1|A0A0C2MAQ1_THEKT Splicing factor 3A subunit 1 OS=Thelohanellus kitauei GN=RF11_12834 PE=4 SV=1
VRSIIEKTAEFVHKNGMEFEDKIRQREVQNLKFSFLNRN-DPYFMYYKHILYEYT
>tr|A0A087YFQ4|A0A087YFQ4_POEFO Splicing factor 3a, subunit 1 OS=Poecilia formosa PE=4 SV=2
T--rrSAEERRGGHAVNGPGLESNLRRPLLMSYA-HNLNLV-MCFKSYYSPDKTAIA
>ERR1712106_343958
--------------------HKIRQNEAANPKFNFLKDE-DPYNGYYRHKVKEFG
>ERR1711892_1033949
-----QRRVHGVARNGRSFLTQLMQKEAKNYQFDFLRPQ-HTLFQHFSKLVEQYN
>tr|A0A261Y4R6|A0A261Y4R6_9FUNG Uncharacterized protein OS=Bifiguratus adelaidae GN=BZG36_01717 PE=4 SV=1
IRAIVDKTADFVARHGTQAEDRIREQGRHNPKFAFMNPL-DPYHAYYRNMIQDIK
>tr|A0A1S8W701|A0A1S8W701_9FUNG Uncharacterized protein OS=Batrachochytrium salamandrivorans GN=BSLG_00954 PE=4 SV=1
IRNIVDRTADFVARNGPQFEERIKAKEQQNTKFSFMIPT-DPYHGYYLHRISEAR
>tr|A0A1Y2GCE2|A0A1Y2GCE2_9FUNG Pre-mRNA splicing factor PRP21 like protein-domain-containing protein OS=Lobosporangium transversale GN=BCR41DRAFT_361573 PE=4
SIVLADRTAAFVAGseLGAIVEERLRQKEKNSTKFSFLNPT-DPYHAYYAFKVKEAK
>tr|A0A059J564|A0A059J564_9EURO Uncharacterized protein OS=Trichophyton interdigitale MR816 GN=H109_05052 PE=4 SV=1
IKVIVEKTAGFVARNGHVFEDRVREKEKNNTKFCFLNPN-DAYAPFYAWRLSEIK
>tr|A0A0G2DZV5|A0A0G2DZV5_9PEZI Pre-mRNA-splicing factor OS=Diplodia seriata GN=BK809_0005368 PE=4 SV=1
LHDTIEKTAGYVSRNGKVFIERLRANHRTNPKFSFVFEE-DAFHPYFQWRIEEHR
>SRR3989338_6324060
LKTVIDKTAQYVAKNGPAFEKKIIET-NSSEKFAFLQPD-HPFHAYYRQCIDEFY
>ERR1712137_330219
IKTVIDKMVSFVATSGRQFEQKVWESESNNPKFGFIQPT-HHYHAYYLQQLKRLQ
>tr|R8BFV1|R8BFV1_TOGMI Putative pre-mrna-splicing factor sap114 protein OS=Togninia minima (strain UCR-PA7) GN=UCRPA7_6304 PE=4 SV=1
IREAIEKTAGFVARRGVSFEERIRESHGANPKFSFLMSQGDPYNAYYEWRKQEYE
>tr|S3CV22|S3CV22_OPHP1 Pre-mrna-splicing factor sap114 OS=Ophiostoma piceae (strain UAMH 11346) GN=F503_03657 PE=4 SV=1
MREVIEKTAGYAVRGGAGIEARLRENHSNNPKFSFVTNPDDAFHAFYEWRKEEYK
>tr|A0A177CGX5|A0A177CGX5_9PLEO Uncharacterized protein OS=Paraphaeosphaeria sporulosa GN=CC84DRAFT_1163122 PE=4 SV=1
VRDSIAKTADFVHRRGERDEAALvtRVRDQGKSNMAFVL-PEDTYNPYYAWYLQQLR
>SRR5271156_5509908
VLEIIEKTATYVSRNNKDFEERIRENERANPKFAFLN-PLDPYYKYYDWRLNELQ
>tr|A0A1E4TDZ0|A0A1E4TDZ0_9ASCO Uncharacterized protein OS=Tortispora caseinolytica NRRL Y-17796 GN=CANCADRAFT_98053 PE=4 SV=1
VRDVVEKTVQYVERNGAEFENRLKRTALQDGKLTFLL-EDNEYYSYYKWRVHEIR
>ERR1700735_4343456
---SLPLLPGWIDRVLTVLdLERVREKEVNNPKFSFLNPN-DAYAAFYHWRLAEVR
>tr|A0A0L0NDG4|A0A0L0NDG4_9HYPO Splicing factor 3A subunit 1 OS=Tolypocladium ophioglossoides CBS 100239 GN=TOPH_03522 PE=4 SV=1
VRNVVEKTAGYIMRNGDSMVARIREREeTSSVRFNFLDPS-DAYHLFYQWRLSEIR
>tr|B6HEJ8|B6HEJ8_PENRW Pc20g07660 protein OS=Penicillium rubens (strain ATCC 28089 / DSM 1075 / NRRL 1951 / Wisconsin 54-1255) GN=Pc20g07660 PE=4 SV=1
IRKAIETTASYVVRHKGTFEDRIRQKEKNNYKFSFLTPG-DAYEPYYQWYTAEYK
>tr|A0A2H1HFB3|A0A2H1HFB3_ZYMTR Uncharacterized protein OS=Zymoseptoria tritici ST99CH_3D1 GN=ZT3D1_G11255 PE=4 SV=1
VRSKIEKVASYVSRAGEKFEDSVRQKNAGTNQTTFLEPD-DPYFPYYKWRVGEIK
>tr|A0A0D1Z3L3|A0A0D1Z3L3_9PEZI Uncharacterized protein OS=Verruconis gallopava GN=PV09_01485 PE=4 SV=1
LRAAIEKVAGFVARNGAAFEQRMRNDTANATKLAFIFED-DPYYSYYQWRREECK
>tr|A0A074WHK1|A0A074WHK1_9PEZI Uncharacterized protein OS=Aureobasidium namibiae CBS 147.97 GN=M436DRAFT_48161 PE=4 SV=1
FRTLLEKTAGYIVRNGPAFEARIRDSAEGNPKLQFVLPD-NPYHPFYLWRLEEIK
>tr|A0A0S6XTZ3|A0A0S6XTZ3_9FUNG Uncharacterized protein OS=fungal sp. No.11243 GN=ANO11243_073340 PE=4 SV=1
IRAVIEKSADFVLRRGKEYEDRMFKMAEAEHKFQFVFPD-NIYHGFYLWRKEEIR
>JI6StandDraft_1071083.scaffolds.fasta_scaffold808813_2 # 377 # 496 # 1 # ID=808813_2;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.567
LTTIIEKTANFVARLGVDFEKRIYANEKNNPKFSFLRPT-DQFYPFYQKRIQDFR
>tr|A0A016RYQ7|A0A016RYQ7_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0334.g2840 PE=4 SV=1
--LVIQHVVEFTIREGPLFEAMLMTRERTNPLFRFLFELNHPTHVYYRWRLFSI-
>GraSoiStandDraft_50_1057286.scaffolds.fasta_scaffold778650_2 # 307 # 720 # 1 # ID=778650_2;partial=01;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.686
-CSLYVLINVTHSSVGPEFESRIRQNEINNPKFNFLNPG-DPYHAYFQQKVADIK
>ERR1740129_1361350
IRMLIDRTAEFVATEGWDFEKLLFDREVGNPRFAFMSfDqdnLEQPLHVYYRWRVLSY-
>ERR1719473_1386262
LRRIIDKTAELVSLEGWDFEKMILDREKDNPKFDFMKiDdPEAPLHLYYRWRCTAF-
>tr|A0A0B1PGF7|A0A0B1PGF7_9BILA RNA polymerase Rpb7 protein OS=Trichuris suis GN=D918_08715 PE=4 SV=1
------KDESVVITADDEIRVKILGIRVDaNDIFAIGSLM-DDYLGCM----CSWR
>ERR1712110_1149803
IRTIIDKTVQFVTKNGNAFEKKILSLQEGNAKFNFLLHK-NPYHLYYQAKLNEFK
>ERR1712226_566394
IRTIINKTVQFVAKNGYTFEQKILNQQEGNTKFNFLLHK-NPYYLYYRAKINEFQ
>ERR1712224_11165
IRVLIDKTAIFVKKNGSLFEKKIVEQQSHNSKFNFLQTS-DPYHSYYVAKLKELC
>tr|A0A1E3NLU4|A0A1E3NLU4_9ASCO Uncharacterized protein OS=Pichia membranifaciens NRRL Y-2026 GN=PICMEDRAFT_15100 PE=4 SV=1
IRTLIGKTAIYVDKNGKAFENKIKTKESKNPKFIFLNEK-DPYHAFYQYNLTVIK
>tr|A0A1D2VC26|A0A1D2VC26_9ASCO Uncharacterized protein (Fragment) OS=Ascoidea rubescens DSM 1968 GN=ASCRUDRAFT_26308 PE=4 SV=1
AREVVIRTAGYIHRNGSAFEARLSKDQDdrAVSKFSFLRLH-DPFNSYYRWVLSQYR
>tr|A0A1E3PGS0|A0A1E3PGS0_9ASCO Uncharacterized protein OS=Nadsonia fulvescens var. elongata DSM 6958 GN=NADFUDRAFT_52267 PE=4 SV=1
ICDDIERTAAYVSRNGSLFEERLRESKKNEKRFSFLDPE-DSYNAYYLFRYKEHK
>tr|Q6CDU9|Q6CDU9_YARLI YALI0B21032p OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=YALI0_B21032g PE=4 SV=1
IKAKIERTAEFVAKNGIAFEHRIREKEGSNALFSFLNND-DHYHLYYQWRCDEYA
>tr|A0A1E3QLA7|A0A1E3QLA7_9ASCO Uncharacterized protein OS=Babjeviella inositovora NRRL Y-12698 GN=BABINDRAFT_162686 PE=4 SV=1
IRDIIHKVSSYVSKNGDSFELKMKSQEANNPEFGFLYEK-DPYNRYYTWVLQELR
>tr|A0A1B2JBB2|A0A1B2JBB2_PICPA BA75_03224T0 OS=Komagataella pastoris GN=ATY40_BA7503224 PE=4 SV=1
VQEIINKTAGYVHRNGASFETRIKEKELHNNKFQFLIDE-DPYNAFYKWKLEQLK
>tr|K0KDM9|K0KDM9_WICCF Splicing factor 3 subunit 1 OS=Wickerhamomyces ciferrii (strain F-60-10 / ATCC 14091 / CBS 111 / JCM 3599 / NBRC 0793 / NRRL Y-
IKEIVLKTANYVYRNGKQFENKIRENESNNKNFTFVNEN-DPYNQYYQFVLENIE
>tr|W1QHG4|W1QHG4_OGAPD Pre-mRNA splicing factor OS=Ogataea parapolymorpha (strain ATCC 26012 / BCRC 20466 / JCM 22074 / NRRL Y-7560 / DL-1) GN=HPODL_0
IRAVIEKTAGFVVRNGESFEQRLLTKEAKNPKFAFLRKG-DVHHEYYQKYIQDLR
>tr|A0A0H5C985|A0A0H5C985_CYBJA Uncharacterized protein OS=Cyberlindnera jadinii GN=BN1211_5906 PE=4 SV=1
IKNIIHKTVQYIRRNGPEFEQKVRES---NSKFTFLKDT-DPYNSYYKFSLENLP
>GraSoiStandDraft_44_1057316.scaffolds.fasta_scaffold5170966_1 # 3 # 218 # -1 # ID=5170966_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.606
VKETIEKTAGYIARNGQQFANRLLKEGN--DKFSFLKPE-DPFYNYYDFCIQDYK
>tr|A0A137PEC1|A0A137PEC1_CONC2 Uncharacterized protein OS=Conidiobolus coronatus (strain ATCC 28846 / CBS 209.66 / NRRL 28638) GN=CONCODRAFT_68350 PE=4 SV=1
IRKIIDTTSSHVAKGGIQLEHMLKENNKAHSKLSFLYST-DPYHTYYQHMVSKFR
>tr|A0A1X2GJ78|A0A1X2GJ78_9FUNG Uncharacterized protein OS=Hesseltinella vesiculosa GN=DM01DRAFT_1335352 PE=4 SV=1
IRKKVDKVAELLWNKGAQLENKIRENERHNPLFSFLNPT-DPFHAYYQYKLRQAK
>SRR5258706_5503689
IKSeylfpedacvwlnhyssdIIARTASHVARSttRQQFEDRIRESRREDPKFSFLNSS-DPYNAYYKYRVERII
>SRR6266480_4713089
LSYFIQNIAGYVVRNGVPFETNIKATRGEDPRLSFLFEG-DEHNGYYQWCLEEGF
>ERR1719468_581092
LLCLIHRLIEFVVQEGPEFEAMVMVREINNPQFRFLFDNKSPAHSYYRWKLYSI-
>ERR1712127_216615
LLCLIHRMVEFVVLEGTQFEAAIMAKESQNPMFRFLFDMQSPAHGYYRWKLYSV-
>13_taG_2_1085334.scaffolds.fasta_scaffold294127_1 # 2 # 469 # 1 # ID=294127_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.427
TKATIDRLAKFIAIDGQQFEELILQREHENPQYAWLFTTKTVENDYYRWKVYSL-
>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold3962284_1 # 2 # 205 # -1 # ID=3962284_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.417
VRHAADLTAQAVIRHGPEVEEML--RSNGDPLFSFLNPpQKGPAHDYYIWRLYSF-
>tr|A0A0F7UTP5|A0A0F7UTP5_TOXGV RRM domain-containing protein OS=Toxoplasma gondii (strain ATCC 50861 / VEG) GN=BN1205_090590 PE=4 SV=1
KRVLIDLLAKYVAEEGHPFEQQIMEKSPrgsEDGKFDFLYDHDSPDNIYYRWRVFAF-
>tr|U6KVI3|U6KVI3_EIMTE RRM domain-containing protein, putative OS=Eimeria tenella GN=ETH_00017015 PE=4 SV=1
VKMVIDLLAKYVAEEGHVFEQQVMEKFPpESGRLNFIYEKDSPDNIYYRWRVYAF-
>ERR1719247_2496416
LQARIDTVAKFVALNDNVEDV-LLHREGKNKDYAFLKEYDSPAGMYYRWRTFAF-
>ERR1719409_71936
VKRIVDRLGRYVARDGYPFEQLIMEREVSQPLFKFLFEHDSDDNIYYRWRVYAF-
>ERR1712183_1039439
QRKWIDIVSSFVASVGLAFESFLRSELEegafllqNGVNIKFLDEFDTPDAVYYRWRVYAF-
>tr|A0A0N5CB43|A0A0N5CB43_STREA Uncharacterized protein OS=Strongyloides papillosus PE=4 SV=1
IRSIIDKTASFVAQHGEHFETKIKEKERENKKFKFMFSN-DPYHVYYRFQIDSFK
>tr|A0A1I8C4D4|A0A1I8C4D4_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 PE=4 SV=1
IRNIIDRTANFVSQHGTVFETKIKEKERDNKKFKFMFPG-DPYNVYYNMQIDAYQ
>ERR1719387_1002708
SRRTA-----RSSSRGSS-----PSRRT--TRSSSSCRG-KTlTMRITRQRSPSA-
>ERR1719387_355219
DRPVH-----REERRGVRAEDHRPAEGQ--HEVQVPVAE-RPlP-CVLQGKDRRV-
>ERR1711988_973443
VRNIIDKAAGKVNDVGPQFEHMLASRLQGNPKFNFFNPD-DHYHAYYRYKLQQL-
>ERR1719262_1022945
ITRSSPSSARTIHTMHTTYgDSARSKKEKHNPKFSFLSPN-DSYNAYYLWRLSEIK
>tr|I4YH61|I4YH61_WALMC WD40 repeat-like protein OS=Wallemia mellicola (strain ATCC MYA-4683 / CBS 633.66) GN=WALSEDRAFT_62672 PE=4 SV=1
SKNHV----FTADFspDGRMLAVGGADgritlwqSTTGQIQKDWITNN-GLY--DLRWQSpdgNRI-
>tr|A0A0F7SI04|A0A0F7SI04_PHARH Splicing factor 3a, subunit 1 OS=Phaffia rhodozyma PE=4 SV=1
IRVLIDRTSAALHRsaNPTQLESKIREGQKSDPKFSFLNSS-DPYHTYYQYSLARL-
>1_EtaG_2_1085319.scaffolds.fasta_scaffold246537_1 # 2 # 484 # -1 # ID=246537_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.424
IRAVADKTAAFVAEKGEEYERKIMARGGESNKFSFLTPT-NPYHAYYRMKVAELR
>JI10StandDraft_1071094.scaffolds.fasta_scaffold930452_1 # 1 # 378 # 1 # ID=930452_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.622
IRAVVNKTAQFVAKNGRQFESRILSSSeGSTPKFAFMAGS-SPYNPYYESWILFFQ
>ERR1719375_2481925
SRPGSWRTTPTT---------------TSSSSCRRAART-RPT------------
>ERR1719420_2723494
LRAIVDKTAAFVAKNGAEFEARIVANNPNNDKFQFLQRG-SPRRGARRRSRRTRR
>ERR1712147_529821
EDAVGADSLCAHRRAGLEFESRIVANNPNNDKFQFLQKG-SPYQAYYQYKIRVER
>ERR1719375_646352
----------------LRGEERRRVrgpdrgEQPQQRQVPVPAEG-QPVPGLLPVqnpgg-----A
>ERR1719375_389315
-----------------------RGEQPQQRQVPVPAEG-QPVPGLLPVQNPGGA
>AntAceMinimDraft_17_1070374.scaffolds.fasta_scaffold29566_2 # 1426 # 2319 # 1 # ID=29566_2;partial=01;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.450
GTGIVDSTAKYVAKXGGVFEERIKREHRDKPKFSFLLPD-DPFNAYYQHKVKEIR
>ERR1719484_246020
RPGSSTRRRKFVAKHGPEFEQRVLREQ-SNTKFAFLQAS-NPYRAYFDHRVADFK
>ERR1719247_2604304
IRGVIDTTAQFVAKHGKSFEERVQREQ-SSAKFAFLQPD-NPYRAYFDMRVSDFQ
>ERR1719258_290930
IRGVIDTTAQFVAKHGKSFEERVQR-EQSSAKFAFLQPD-NLQLRSTRCAALLKQ
>tr|C5LBL3|C5LBL3_PERM5 Pre-mRNA splicing factor SF3, putative OS=Perkinsus marinus (strain ATCC 50983 / TXsc) GN=Pmar_PMAR011885 PE=4 SV=1
IKSVIDKTAEFVAKHGDEFERRVMVQQAKQAKFAFLAPG-NPYRRYYEHRVAELR
>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9932630_3 # 1044 # 1280 # 1 # ID=9932630_3;partial=00;start_type=ATG;rbs_motif=TAA;rbs_spacer=11bp;gc_cont=0.350
MKAIIDKTAAWVARHGNSFEQKVKEKQKDASQFSFLRPG-NPYYAYYQQKVREIR
>GraSoiStandDraft_28_1057319.scaffolds.fasta_scaffold1231943_1 # 3 # 131 # -1 # ID=1231943_1;partial=10;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.674
IKNIINTTASFVARLGSNFETKIFSKEKNNPKFSFLQNT-DPYHRYYHWKIQEHR
>ERR1711976_523421
-RYLIDRLASYVFDNGPEFEYIIAEREKFNNDYDFITNVKLHEHIYYLGDCILY-
>tr|R7WAK3|R7WAK3_AEGTA U2-associated protein SR140 OS=Aegilops tauschii GN=F775_28369 PE=4 SV=1
LRHVIDTMALHVLDGGCAFEQAIMERGRGKALFNFLFDLKSKEHTYYVWRLYSF-
>ERR1719334_2839112
LMKVIDRTAVYVAQFGSTFERSVIESEGGResmslkkGDFNFLVV-PGELNLYYRWRVYSL-
>SaaInlV_125m_DNA_1040241.scaffolds.fasta_scaffold416871_1 # 2 # 310 # 1 # ID=416871_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.340
VMALVDTLARYVAEDGYAFERAVMDRERDTDDFAFLLDADCPEHQYYRWRVFSL-
>ERR1719336_902544
VQKVIDTLAWYVSKDGMVFENTAIEREREDANFSFLYQPAHPDHIYYRWRIWAF-
>ERR1719473_2091781
VAFRCDLLAHYVALHGLSFEEAVRKQERNNELFAFIDDPASPDHAYYRWRIYAY-
>JI10StandDraft_1071094.scaffolds.fasta_scaffold6498530_1 # 1 # 234 # 1 # ID=6498530_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.662
LRQMIEKTAEYVARNGSQFEEIIYNNEKDNPSFFFLKLD-NPYRAYYDKKVIEFA
>ERR1719343_1335689
KQRMIDTLAKYVSQEGHTFEQIVMDREQTN-aDFG---------------------
>tr|C1FIV2|C1FIV2_MICCC Uncharacterized protein OS=Micromonas commoda (strain RCC299 / NOUM17 / CCMP2709) GN=MICPUN_63212 PE=4 SV=1
VMYVIDRLAKYVVDNGKAFEEKVMEKERGNEEFGFLFDYKSPNHQYYKWRVFSL-
>tr|A4S4L2|A4S4L2_OSTLU Uncharacterized protein OS=Ostreococcus lucimarinus (strain CCE9901) GN=OSTLU_26450 PE=4 SV=1
LKRRIDITAAYVAEDGEVFERALKAREATNEEYRFLFDECSQAHAYYAWRVFAF-
>tr|Q4N5A1|Q4N5A1_THEPA Uncharacterized protein OS=Theileria parva GN=TP02_0389 PE=4 SV=1
IRSVIDKTAEFVAKNGEQFVSKLRLDQsnsslNDNIKFNFLEPG-NAYHLYYKLKLSEL-
>tr|A0A0M3K8M2|A0A0M3K8M2_ANISI Uncharacterized protein OS=Anisakis simplex PE=4 SV=1
---IRNAVVRVVVPTERYLndDSFYRYVRSFDLAFRFLFDNHHPAHVYYRWKLYSM-
>ERR1712071_495001
---YEHNISRFHKYLSFFFLIDIyekiDFKDFNLLFARYLKSCDR--------------
>tr|A0A0R3SSD7|A0A0R3SSD7_HYMDI Uncharacterized protein OS=Hymenolepis diminuta PE=4 SV=2
-LAIIHRLIEFVVREGPQFEAAIMNKEERNSQYKQV-------------DFKII-
>ERR1740117_1016824
IRKLIDTMADYVAKLGQIFEKQVMTKELKNPDYKFLFQVREENHRYYRWKTHSL-
>tr|A0A2C5Y2G5|A0A2C5Y2G5_9HYPO Uncharacterized protein OS=Ophiocordyceps australis GN=CDD81_732 PE=4 SV=1
MRNILERTAGYVSRNGQVFQERIQENNKHNLLFNFLSPE-DAYFPYYEWRLAEIT
>SRR2546423_1180499
-------------------------REARNYQFDFLRpQH-SLYNFFSHLVDQY--
>SRR3569833_340938
IREVIEKTAGYVVRGGSGIESRQRENQGSNPQFQLPIpNH-TFHNFFQHLVDQY--
>tr|A0A152A5F8|A0A152A5F8_9MYCE Uncharacterized protein OS=Dictyostelium lacteum GN=DLAC_01443 PE=4 SV=1
-QQIIDRISEYVARVGPRFESYILENQPHTTIFNFLKPD-QMNNDYYRWKLWTIK
>ERR1719285_1298600
LKLIIDETAMFVNNYGMNFEYLLLERNSPhnerrsknYERFRFLFAIDTEEHMYYRWKLWSL-
>ERR1719285_288658
LKLIIDETAMFVNNYGMNFEYLLLERNSPhnerrsknYERFRFLFAIDTEEHMVLLRESCH--
>ERR1719394_1330975
LQLIIDETAMFVNNYGMNFEYLLLERNSPctiagsygPSRTATKCTS--GPTPLSRC---S--
>ERR1712224_775926
LRDTIDTLADYVSEFGMALESLVIRN---rdmskFERFAFLFETDSPEHMYYRWRLWSL-
>ERR1740124_1605711
SRKVIDELCHFVHKHGLEFEWHVMTE---eqrspTNRFHFLFDLNCDEHLYYRWRVWSL-
>ERR1719204_765379
IRGMIDETASYVFMYGIEFECVLLDRSRDrrsslFEQFRFLFDIDRDEHMYYRWKLWTL-
>ERR1712038_2133997
TQRVIDRLSSFVSKLGHVFEKLVMERELKNKTFRFLYNVTSADHMYYRWKAYSL-
>ERR1719162_542392
LRNrrk--------LYRSsarlmhqRLHLQDEAQN-LQNHAKFSFLLPN-NPYRQYYEHKVKEFK
>tr|A0A0G4F3Q9|A0A0G4F3Q9_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 GN=Cvel_14969 PE=4 SV=1
IQNVIEKTASFVAKNGDAFESRIIQErQGDQQKFGFLMKN-NPYRPYYEMKLRELR
>ERR1719335_1473530
---------------------------QDAQERDY-MAT-IDWHEFVCCETIEFT
>ERR1712060_997097
-------------------QALIEEKERDKVRQ-AKKDK-KKDKLRALKNVGEFD
>ERR1719171_2980740
GL-------ELFHLVLVVCPVWIRR-QQERKLCVVLLPS-NPYRSYYEHKVKEFK
>ERR1719506_3679797
--------------RPTQLenKHLISEPIPPAKGNCGFAIK-KYTRYAVLWTMAS-M
>tr|L1IR54|L1IR54_GUITH Uncharacterized protein (Fragment) OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_41563 PE=4 SV=1
MKSIVDKTASFVAKNGPQFEQRILNNERNTTKFAFLQST-SPYYPYYQKKLAEAR
>tr|A0A0G4H1Y8|A0A0G4H1Y8_VITBC Uncharacterized protein OS=Vitrella brassicaformis (strain CCMP3155) GN=Vbra_19254 PE=4 SV=1
EREIIDKTAAFVAKNGVEFEQHLFRQhqdaaaSGQQQKFAFLFQN-NPYRAYYDHKVKELL
>tr|A0A023B6C5|A0A023B6C5_GRENI Splicing factor OS=Gregarina niphandrodes GN=GNI_081150 PE=4 SV=1
VREVIDKTAQFVAKNGVEFEKRMMQEANANERFGFLFPN-HGYRDYYDRKLQEVR
>tr|B9NGR6|B9NGR6_POPTR SWAP/surp domain-containing family protein OS=Populus trichocarpa GN=POPTR_0006s29410g PE=4 SV=1
LQKRIDKLVEYSAKNGPEFEVMIREKQQDNPAYSFLYGGE--GHAYYRYKLWL--
>SRR4051794_1511198
DRLTMDKLAEYVAKNGLKFEEMMKEKQADNPKFGFLEEGHP-YNPYYRFKVWC--
>APLak6261692095_1056202.scaffolds.fasta_scaffold14918_1 # 1 # 837 # -1 # ID=14918_1;partial=10;start_type=ATG;rbs_motif=AACAA;rbs_spacer=14bp;gc_cont=0.493
VREIIDKTAGFVVKNGRAFEDRISGSAQGNtLKFAFLTEL-HPYHAYYEQTILDL-
>ERR1719375_658062
TQKVVDRTAMIVAQEGWDFEKLLMERERDNPRFSWLKPddSEAPIHVYYRWRTFAF-
>ERR1719198_366746
LKRLIDVTAQFVAFEGWEFEHLLIEREANNERFAFLKTedVEDPLHVYYRWRTFSL-
>GraSoiStandDraft_55_1057291.scaffolds.fasta_scaffold1306120_2 # 271 # 417 # 1 # ID=1306120_2;partial=01;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.810
LKKLIDKTAEFVADEGWLRLFT---KN-----IAFX-------------------
>APCry1669190327_1035288.scaffolds.fasta_scaffold495035_1 # 1 # 123 # 1 # ID=495035_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.195
KRRTIDRLARYVSQEGFSFEKLIMQRESPEGMFDFLFYHDSPENIYYRWRTFAF-
>ERR1719495_2729017
--------------------GSSPILSLSSSSRTSLPSTSTIGGDCIRCCRERP-
>ERR1719318_2382840
--CLAPRSKLSSPRTGHSCASStgWWSLSSERVQYLkqQLX------------------
>ERR1719397_826466
--CLINRMVEFVIREGLILQPAFAST-YSHGKL-rC--------------------
>ERR1719394_1415840
--CLINRMVEFVIREGPIFEATIMNRSPPIVEVLrrLVLEKELELRI----------
>SoimicMinimDraft_9_1059737.scaffolds.fasta_scaffold1039942_1 # 2 # 235 # -1 # ID=1039942_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.650
LKQIIDKTANFVATKGMSFEQKLAEKERDNPTFAFLKPE-NAYNPYYKQKLQE--
>tr|A0A1A8LKE8|A0A1A8LKE8_9TELE U2 snRNP-associated SURP domain containing (Fragment) OS=Nothobranchius pienaari GN=U2SURP PE=4 SV=1
--------VRGRRSSlrniFPDFSDTI---------FCLGLQRI--TGLTNKH------
>tr|A0A1A8G881|A0A1A8G881_9TELE Uncharacterized protein (Fragment) OS=Nothobranchius korthausae GN=Nfu_g_1_005245 PE=4 SV=1
--QVEHYRNKLLQKE------FEKNEEKNERSK---SKDRQKDDRRNREKSKRK-
>tr|A0A293LKP6|A0A293LKP6_ORNER U2 snRNP-associated SURP motif-containing protein (Fragment) OS=Ornithodoros erraticus PE=4 SV=1
----------------------MKIHQWQHVTYLLWFENQSPAHVYYRWRLFSI-
>tr|F1QQI7|F1QQI7_DANRE Zgc:163098 OS=Danio rerio GN=zgc:163098 PE=1 SV=1
--GLIHRMIEFVVREGPMFEAMIMNREKNNPDFRTPMMKINVCKLFGQNYL----
>ERR1719354_117337
---------GSQVNSADFYFK-ILQKTRDTS-------LTFKKTPFTKTTDSSK-
>ERR1740117_103862
TRHRIEKMAEYINRNGPSFENMMQQKQQGNPAFSFLFPG-TEQYPFFKWAIHCM-
>ERR1712196_744669
TRQRIDKMAEYIVRNGTQFEAMMREKQRANPEFAFLFGG-Y-HSAYYHWVLYAM-
>tr|I2CQK6|I2CQK6_NANGC Uncharacterized protein (Fragment) OS=Nannochloropsis gaditana (strain CCMP526) GN=NGATSA_3050200 PE=2 SV=1
-------MAAYVHRNGVAFEEMMMSKERGNEKFGFLFEG-GRHRMYYRWVRHCM-
>tr|H3GC56|H3GC56_PHYRM Uncharacterized protein OS=Phytophthora ramorum PE=4 SV=1
VRNRIDRLVDFVARNGDAFEATAKQRERDNPDFAFLRLG-GPYSDYYQWKKQQ--
>SRR6266702_5772310
FTATINRTIVHMNRssDPLQFEVKLRESRRTDPAFSFLNSA-DPYHAYYRNRLDQA-
>tr|F2U0X1|F2U0X1_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) GN=PTSG_01136 PE=4 SV=1
IRRLIHRTIEFVVRHGPEFEDELAKRTNFDKDFSFLRDFSSSEHVYYRWKLYSI-
>ERR1712038_1203332
TRSLINRTIEFVIKHGAKFEKSLLEKEKNNQAFRFLSDYKSPEHIYYRWRLWSI-
>ERR1712127_962194
VKLLIHRTIEFVLKHGAKFEKSMIAKESKNPAFKFLYDFNSNEHVYYRWRIWSL-
>ERR1719453_1037672
---SGDSSWNKKLNRGVDFEALLMEREHSNPRFSFLFQLESPEHVYFRWRAWSL-
>tr|A0A067H716|A0A067H716_CITSI Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g017463mg PE=4 SV=1
---------------------MTCVTGPEIlsfpssFEFFILFALSPH---FLSFKSLI--
>ERR1740117_1242729
VKDRIDKFVNYLLRNGPGFEKMLRDKQQNNPEFAFLFGG--PEHNYYRWRLWCL-
>tr|A0A1I7SRL2|A0A1I7SRL2_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus PE=4 SV=1
LKLAIERTALYVAQNGPQSEIEARRVNYGNPGYQFLYNG--EGSTYYQFCLAK--
>tr|E3M0B9|E3M0B9_CAERE CRE-TAG-65 protein OS=Caenorhabditis remanei GN=Cre-tag-65 PE=4 SV=1
----MHESSNWRGGSGPPHGQTWRNNDIPLQ--GAHfYSN--NQK-RFHFTERH--
>tr|G5ED97|G5ED97_CAEEL SR-related CTD associated factor 6 OS=Caenorhabditis elegans GN=tag-65 PE=1 SV=1
----MNENSSWRGGSSSQHGQSWRSGDIPLPR-DFP---------------PP--
>tr|A8XCU2|A8XCU2_CAEBR Protein CBR-TAG-65 OS=Caenorhabditis briggsae GN=tag-65 PE=4 SV=1
----MFENSSWRGGNE-PPGQSWRTSDVPHP--------------------SQ--
>tr|A0A2H2I029|A0A2H2I029_CAEJA Uncharacterized protein OS=Caenorhabditis japonica PE=4 SV=1
----MNESSSWRGPPQHAESWRSMNSDIPLP--------------------PS--
>tr|A0A0L8H391|A0A0L8H391_OCTBM Uncharacterized protein (Fragment) OS=Octopus bimaculoides GN=OCBIM_22023393mg PE=4 SV=1
-KEDMEKT------LANAVVKVVIPTERytFFSLFLVVVERNTVNFLFVVFF-----
>tr|K1QT70|K1QT70_CRAGI Uncharacterized protein OS=Crassostrea gigas GN=CGI_10025796 PE=4 SV=1
-VCNQHirnngnEVAEFFYvCCYRNDEGRMICSELSEDLWLTVF-FHVTIVINVIVVLYSP-
>ERR1712194_386512
-----------------WF------ETWSKFDFHFLFESVSDNANYYRWRTYSI-
>tr|A0A137PA29|A0A137PA29_CONC2 Uncharacterized protein OS=Conidiobolus coronatus (strain ATCC 28846 / CBS 209.66 / NRRL 28638) GN=CONCODRAFT_69496 PE=4 SV=1
-LEVIHKTIERVLDLGNSFEQLIMERN-sNDPNFAFLTDIDSPNHLYYKWKLHSL-
>tr|A0A1Y1WAL9|A0A1Y1WAL9_9FUNG RNA-binding domain-containing protein OS=Linderina pennispora GN=DL89DRAFT_267412 PE=4 SV=1
-LRTIHWTIQHVLQYGTPFEVMLINR--GDPRLAFLINSRSAEGVYYRWRMYSL-
>ERR1719319_244708
TKRTIDKLAINVAKDGMVFENLVISRNRNNKKFDFLFNTKSSEHMYYRWRTISL-
>ERR1719288_173859
---------EFVAFEGWDLEKLLLERERDNSRFEFMRLekdnLEHPLHLYYRWRTFAF-
>Laugresbdmm110sd_1035091.scaffolds.fasta_scaffold747009_1 # 2 # 241 # -1 # ID=747009_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.458
QCELLDRLARYVSKDGQALETIIRDREASNPEYVCLREPNSEEGLYYRWKVFSL-
>tr|W7U2X6|W7U2X6_9STRA U2 snrnp-associated splicing factor putative OS=Nannochloropsis gaditana GN=Naga_100309g2 PE=4 SV=1
-EFLISRTARYVAKDGGPLEERLKVTESQNPRFRFLWEGDSPEALFYRWRVFSF-
>ERR1711976_375994
IKNVIDSTAHYVSKYGSEFEFYIIHQEKKKhtGLFEFLFEVESSEHFYYRWRLWSF-
>ERR1719272_1551129
VRNMVDKTAGFVARNGIEFADRIANEKRGQSKFNFLQPE-DRYHVYFQHKVKLIM
>tr|A0A1R0GZW1|A0A1R0GZW1_9FUNG Putative splicing factor 3A subunit 1 OS=Smittium mucronatum GN=AYI68_g3440 PE=4 SV=1
IKGIIDKTAEYVAQSGPILEQRVRETEKNNSKFSFLNPS-DPYFAYYQNQLSDFK
>tr|C1FFR5|C1FFR5_MICCC Uncharacterized protein OS=Micromonas commoda (strain RCC299 / NOUM17 / CCMP2709) GN=MICPUN_60603 PE=4 SV=1
-AKFIDTTAMFVAKNGPAFEAMARERQRGDPKFSFLFGG--PDEGYYRYKLGECR
>GraSoiStandDraft_34_1057297.scaffolds.fasta_scaffold2368207_1 # 3 # 323 # -1 # ID=2368207_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.657
-RKFIDTTAFFVAKNGVWFEQMAREKQKDEPKFNFLRKG--AGCLYYAWRLEKAK
>tr|A0A1L0G5Z1|A0A1L0G5Z1_9ASCO CIC11C00000001929 OS=[Candida] intermedia GN=SAMEA4029010_CIC11G00000001929 PE=4 SV=1
IKTTIEKTAQYVKKNGTAFERKLLQND-SDGKFSFLNNS-DKYNSYYKSLLEG--
>tr|A0A1E4SQG0|A0A1E4SQG0_9ASCO Uncharacterized protein OS=Candida tanzawaensis NRRL Y-17324 GN=CANTADRAFT_45761 PE=4 SV=1
ARITIDKTVGYILKNGTGFEQRLKDNN-KDHKFDFLEEG-NQFNEYFKWKIGR--
>tr|A0A1A0HIV8|A0A1A0HIV8_9ASCO Pre-mRNA splicing factor OS=Metschnikowia bicuspidata var. bicuspidata NRRL YB-4993 GN=METBIDRAFT_38261 PE=4 SV=1
MRQVIEKTVAYVRKNGASFEDKLRQND-ANGQFAFLSPE-HQYNSYYIESLKA--
>tr|A0A0V1PR98|A0A0V1PR98_9ASCO Uncharacterized protein OS=Debaryomyces fabryi GN=AC631_05481 PE=4 SV=1
VKETIDKTVGYVLKNGSSFEERLKNNEGSNEKFTFLKKN-DVYNDYYQWKLGN--
>tr|A0A2H1A464|A0A2H1A464_9ASCO Uncharacterized protein OS=[Candida] auris GN=CJI97_000771 PE=4 SV=1
IRDTIDKTVEYVLKNGKSFEERLLKNN-TDDKFSFINSD-SPYHEYYKAQLTE--
>tr|G3B904|G3B904_CANTC Uncharacterized protein (Fragment) OS=Candida tenuis (strain ATCC 10573 / BCRC 21748 / CBS 615 / JCM 9827 / NBRC 10315 / NRRL Y
VRAVIDKTAEYVLKNGDSFETRLKANADSTTKFPFMFST-DAHYPYYQWKLGR--
>tr|A5DH22|A5DH22_PICGU Uncharacterized protein OS=Meyerozyma guilliermondii (strain ATCC 6260 / CBS 566 / DSM 6381 / JCM 1539 / NBRC 10279 / NRRL Y-32
VLSTIEKTVGYIRKNGPSFEERLRNS--GNPKFSFLNPD-DAHHNEYLQRLNS--
>tr|G8Y008|G8Y008_PICSO Piso0_005813 protein OS=Pichia sorbitophila (strain ATCC MYA-4447 / BCRC 22081 / CBS 7064 / NBRC 10061 / NRRL Y-12695) GN=Piso0
LKKVIDKTVDYVIKNGSTFEERLKKNESKGSKFTFLLDD-DQYHPYYKWKLNY--
>tr|H8X5K0|H8X5K0_CANO9 Uncharacterized protein OS=Candida orthopsilosis (strain 90-125) GN=CORT_0D06200 PE=4 SV=1
VKDIIEKSVSYIQKNGKSFEERLLKNN-RNSQFNFLKPD-DEFHQYYVWALQS--
>tr|A0A0A8L1S6|A0A0A8L1S6_9SACH WGS project CCBQ000000000 data, contig MAT OS=Kluyveromyces dobzhanskii CBS 2104 GN=KLDO_g1114 PE=4 SV=1
VKKEICKAVNYVLRNGATFEEKLESEN-----IDFVQPG-GKHNDYYTYLLEH--
>GraSoiStandDraft_46_1057282.scaffolds.fasta_scaffold878252_1 # 2 # 295 # 1 # ID=878252_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.582
-RRRIHRTIEFVLREGVEFEMALMDKvDPADPDYVFLFDYTSLESAYYRWRMYSL-
>tr|M3ARN0|M3ARN0_SPHMS Uncharacterized protein OS=Sphaerulina musiva (strain SO2202) GN=SEPMUDRAFT_152422 PE=4 SV=1
VREKIERVAVYVARNGQAFEDNVRTKNAGTNTTHFLEPS-DEYNGYYKWRVAECK
>ERR1719319_2103600
-RRRVNYSRAFIVGHR---------------HFK-LIFG-FSFKVFRQ-------
>sp|Q86A14|SF3A1_DICDI Probable splicing factor 3A subunit 1 OS=Dictyostelium discoideum GN=sf3a1 PE=3 SV=1
LKTIIDKTAAYAAKLGESFENKVKQREGHNAKFNFMKEG-DQYYPYYRNKIVEN-
>tr|F0ZNQ8|F0ZNQ8_DICPU Uncharacterized protein OS=Dictyostelium purpureum GN=DICPUDRAFT_153372 PE=4 SV=1
LKNIIDKTAQYAAKLGESFETKVRNKEGHNPKFNFMKEG-DIHYSYYRNKINEN-
>tr|F4QDE1|F4QDE1_DICFS Ubiquitin domain-containing protein OS=Dictyostelium fasciculatum (strain SH3) GN=sf3a1 PE=4 SV=1
IKNIIKKTVEFYVKLGDSFIEKIREREKNNEKFNFLKLG-DQYHQYYINVRETE-
>tr|A0A151ZFZ0|A0A151ZFZ0_9MYCE Ubiquitin domain-containing protein OS=Dictyostelium lacteum GN=DLAC_05364 PE=4 SV=1
VKKRIEKTATFAAKYGADFEKKVLSNEGKNPQFSFMLEG-DPLHQYYLNLVAEY-
>tr|D3B3U3|D3B3U3_POLPP Ubiquitin domain-containing protein OS=Polysphondylium pallidum (strain ATCC 26659 / Pp 5 / PN500) GN=sf3a1 PE=4 SV=1
ISTIIEKTAEFVAKHGENFAEKVKAREKNNPKFNFLHDG-DIYNRYFLNKVEAH-
>GraSoiStandDraft_36_1057302.scaffolds.fasta_scaffold5792060_1 # 3 # 200 # 1 # ID=5792060_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.591
--------------MGDEFEKKVAG-QQAgQAKFAFLNHS-HPYRKYYELRIKELR
>UPI0007F40025 status=active
-----------------------------kNKQFQFLFPE-NHYHAYYLDMIQQMK
>tr|A0A1V9Y4N3|A0A1V9Y4N3_9STRA AP-1 complex subunit beta OS=Thraustotheca clavata GN=THRCLA_11963 PE=4 SV=1
TLRYINRFAECVVILDKTN-DAMK-LR-QDSRFAFLNDSNAPLYHYYKWKVISLR
>tr|A0A1V9Z4T0|A0A1V9Z4T0_9STRA Uncharacterized protein OS=Achlya hypogyna GN=ACHHYP_03032 PE=4 SV=1
TMRYVNRFAECVAEGGIQMEDAMR-LRQ-DPRFGFLSDPNASLYHYYKWKVLSLR
>tr|M4C1K6|M4C1K6_HYAAE Uncharacterized protein OS=Hyaloperonospora arabidopsidis (strain Emoy2) PE=4 SV=1
IRRRVDHLARYVATDGLQFENAVRMREANNKDYAFLFDPQCATASYYRWRVYSFA
>ERR1719174_2531191
FQKTIDMLAEYVSKDGQSFENLILARENLNPTFKFLFAMGTPANRYYRWRVYSLA
>tr|A0A024TM69|A0A024TM69_9STRA Uncharacterized protein OS=Aphanomyces invadans GN=H310_12027 PE=4 SV=1
QMALINYVARCVSKDGPQFEDLMR-RSQLDPKYDFLRApPTSSLNIYYKWKVYSLQ
>tr|A0A139AMI1|A0A139AMI1_GONPR Uncharacterized protein OS=Gonapodya prolifera JEL478 GN=M427DRAFT_54330 PE=4 SV=1
---RIHRMIERVVKYGEPFEREVMDREAENCgdgqgDWGFLFRHDSPEHIYYRWKLFSV-
>ETNmetMinimDraft_25_1059894.scaffolds.fasta_scaffold1494968_1 # 3 # 209 # -1 # ID=1494968_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.657
LVPIVETTANYVARMGLDFEVQIRQRQAKNKRFGFLDPA-NPYHAFYKWTLQAA-
>tr|Q7RT71|Q7RT71_PLAYO Drosophila melanogaster LD23810p OS=Plasmodium yoelii yoelii GN=PY00123 PE=4 SV=1
VKRIIDLLAKYVTEEGYAFEEIIKKNEKDNPMFNFIFNT-SDLHYYYKWRVFSF-
>tr|A0A1D3TDE0|A0A1D3TDE0_PLAMA U2 snRNP-associated SURP motif-containing protein, putative OS=Plasmodium malariae GN=PmUG01_13055600 PE=4 SV=1
LKRIIDLLAKYVTEEGYTFEETIKKNEKENPLFDFLFNT-SDLFYYYKWRVFSF-
>tr|A0A023EUW3|A0A023EUW3_AEDAL Putative serine/threonine-protein kinase fray2 (Fragment) OS=Aedes albopictus PE=2 SV=1
-RRLRLREIEVK-------------------IVQYQDEFESGARqVRVGWTVAEE-
>tr|A0A1Y1M5Y1|A0A1Y1M5Y1_PHOPY Uncharacterized protein OS=Photinus pyralis PE=4 SV=1
-DSIIDKCREVMIETINPKSAMlyynaacqyadkklqetcmqwflvnlmtfyyygslcylrtipipLMTRLVAN---PDLFVVQTEFSLYVMlkfWMYMHV-
>tr|W5JUN7|W5JUN7_ANODA Germ cell-less protein OS=Anopheles darlingi GN=AND_001206 PE=4 SV=1
-DGLIDRCAEVMVETTCADTAVlyyeaaceygvkrvkqftfgwllvnllnicrkkantlrlisveLMEQLITS---ADLYVMQTEFYLYTLlryWMTLKL-
>tr|A0A1L8DNQ1|A0A1L8DNQ1_9DIPT Putative conserved secreted protein (Fragment) OS=Nyssomyia neivai PE=4 SV=1
-HSLIEKCSEVMIESIDQNTVVayyeaacqygvknvrdaafewlqvnllnfymkhlkllnqidleLMYKLISS---PDLFVMQTEYAIYYLlknWIYFQL-
>ERR1719348_1355690
-DTLVDGHHhvtaEFEMEDDEKLDHAiiscaahldglgsvqsnIMNITIEDLEPEIEYESEEAG------------
>ERR1719378_654877
-LSLINRLIEFVIREGPIFEATIKVVIPSDRSILSLIN---------RLIEFVI-
>tr|A9UTE3|A9UTE3_MONBE Predicted protein OS=Monosiga brevicollis GN=23437 PE=4 SV=1
-RQLIHRVIEYVINHGPHFETLLIQSIERDPQLAFLTAFKSPNHVYYRWKLFSL-
>ERR1719397_1780947
-LAGLVEWIPCKCSRTAS---------HDTSKLSSKLQTGPAEHVYYRWRLYSL-
>ERR1719209_1725659
---------------------KERWSSAPPHQFKFLFENQSAEHTYYRWRLFSL-
>ERR1719507_1341828
-SSLRISLLNTST--------------TAGDSFPCCKERRRTNG-VWNRSGCSS-
>ERR1719495_2069746
-------------GSSPILSSSSSSRtsllSTSTIGGDCIRCCRERPRISGRWNRFKC-
>ERR1719206_1223601
-LCLINRMVEFVIREGPMFEATIMNRELNNHTLIGLLPRPIsPVmlpsACCCsasLpnlTKPYPL-
>tr|A0A1D1V8N6|A0A1D1V8N6_RAMVA Uncharacterized protein OS=Ramazzottius varieornatus GN=RvY_09259-1 PE=4 SV=1
LRKIIETLAQYVGKNGTQFEEMVRTREANNPQFSFLRGGD--FYKYYANCVAA--
>ERR1740139_644481
--AAVDKMAEFKVRNGAAFETLIRERQRDNPNFSFLFDTSSPALRLLLTEGA---
>ERR1719272_429242
VKNVIDSTAHYVSKYGSEFEFHIIQQERLNptGLFVFLLEVESPNHFYYRWRLWSF-
>ERR1712224_1103342
L-FIIDKMAKYVSKYGVKFEDCVIKSQKREiySKFKFLFNCDSEEYYYYRWKVWS--
>tr|V5F352|V5F352_KALBG Uncharacterized protein OS=Kalmanozyma brasiliensis (strain GHG001) GN=PSEUBRA_SCAF1g00375 PE=4 SV=1
---------------DPAFEVTIKATEQNNPKFAYLKED-DVYHAYYASRRDAVR
>ERR1712093_411067
---------------PATFEDKIRARQSSDSRFSFINDK-DAYHPYYQYRVKFYR
>tr|A0A0J0XPN5|A0A0J0XPN5_9TREE Uncharacterized protein OS=Cutaneotrichosporon oleaginosum GN=CC85DRAFT_307819 PE=4 SV=1
---------------PQVLEDKIRETQTTDPRFAFLNDE-DPYHQYYRWKLEYSR
>tr|A0A194S1G9|A0A194S1G9_RHOGW Uncharacterized protein OS=Rhodotorula graminis (strain WP1) GN=RHOBADRAFT_48643 PE=4 SV=1
---------------PALLEEKFKLREKSDSRFAFVNAD-DPYHAYYADRLAAFK
>ERR1719215_945185
-KNLIDKVAQYVSKNGSQFEQMMKEKQRGNPEYSFLFQN-GAFNKYYLSKVRSE-
>ERR1712227_320622
-KTLIDKVAQYVAKNGEGFEEMMKKKQTGNQEYDFLSPG-GQFHQYYQIKLRAE-
>ERR1711953_149143
--------------KG-----AISRDATVRPHFLEL----HILRSLYRHLFRQK-
>ERR1711907_431764
IRGTIEETASFVYSYGREFEQLLLDRTNDRrsslyDKFRFLIENYSY-------------
>ERR1719447_2647965
SRRLIDKLATYVVRYGMQFENLVLERERSRrst-RFYFLFEVDGSDHMYY--------
>tr|A0A151ZDT6|A0A151ZDT6_9MYCE SWAP/Surp domain-containing protein OS=Dictyostelium lacteum GN=DLAC_06947 PE=4 SV=1
-KYLIDSMATIVFREGFPFERTIIEKERGNPNFSFLFDTTSDEYYYYSWKVYSL-
>SRR5690606_25750428
-LDRINKIALFVAHDGYQFERIIKERERGNPKFDFLFYEDSNDHIYYMWKVYSL-
>ERR1712185_489816
-RALLDRLARYVSKDGAPLESIIAQREVENPDYVCLREPKSRDGLYYRWKVYSL-
>tr|A0A024G2U8|A0A024G2U8_9STRA Uncharacterized protein OS=Albugo candida GN=BN9_018790 PE=4 SV=1
-RERVDILARFVARNGACFETQLALREASNPDFAFLSEsmdsttAASPLYLYYRWRVYSF-
>tr|F0W005|F0W005_9STRA U2associated splicing factor putative OS=Albugo laibachii Nc14 GN=AlNc14C3G483 PE=4 SV=1
-WKRVDTLASFVAKDGATFETQLALREASNPDFAFLSEsmlpvtKASPLYLYYRWRVYSL-
>tr|J9EMJ1|J9EMJ1_9SPIT Surp module family protein OS=Oxytricha trifallax GN=OXYTRI_12106 PE=4 SV=1
IKSVIDKTANFVAKNGANFEALILKTEQNNLKFNFLRHLDDPYRPYYMQKIND--
>GraSoiStandDraft_57_1057295.scaffolds.fasta_scaffold562783_1 # 1 # 444 # 1 # ID=562783_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.732
---------KLVAKFGSNIETMMKNEDKNLPKFSFLNT-GDPYRPYYESVVNT--
>NGEPerStandDraft_5_1074534.scaffolds.fasta_scaffold464305_1 # 1 # 381 # 1 # ID=464305_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.719
LKVLIEKTAAYVARAGFVLEAEILSRQANNPQFAFLNP-DNVYNPFYKAKIAQ--
>ERR1712048_587272
-------------------------QKVGNKKFDFLKST-DPYNPYYQKMLYDK-
>ERR1719345_159053
SSRLARRPACLAA---PK---------------LVLLPR-LVLIVIF--------
>AP41_2_1055478.scaffolds.fasta_scaffold1661184_1 # 1 # 225 # -1 # ID=1661184_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.329
TYGIIDKTALLVAKGGDGYEKRIW-REQSHTRFDFLLPN-HPYRPYYEQKVEECK
>ERR1719234_1087808
--FLINRTVEFVIRGGPVFEAILINRERNNLEFNFLVNYGYYEHLYYRWKLFSL-
>ERR1719354_557275
RRQIIDCLSEHVAMHGMAFENVAISKVnsgqtDHQEDFSFLLAVDSPDHMYYRWRLWSL-
>SRR4051794_7489527
LRRLIDLLAAYAAdpERGPAFEAAMVARAqaaaaaggEQPGELAFLLHPQSQEHLYYRWRVWAL-
>SRR5947199_4859193
---RIHAVIEGVAEYGADFEALLMEREKFNEDYAFLFDSNvfpspliqvinkkLPDAQYYRWKSYSLR
>KBSSwiStaDraftv2_1062776.scaffolds.fasta_scaffold11948527_1 # 1 # 237 # -1 # ID=11948527_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.582
IREVVEKTAAFVARGGPEFETKIRASD--TtGKFNFLNAD-HPYNAYYKHKITEL-
>ERR1712238_239313
IRVVADRTAEYVAKNGRGFESRIVNSAKGkTPKFAFLQPT-SPFHAYYEERIQH--
>ERR1740124_97442
IRAVTHKTALFVSKNGRAFEARIFNSAKGkTPKFAFLHET-SPFHRYYEERILF--
>ERR1719507_3012794
LRKRIDIMAEHIARNGADFEGTVRQKNVNNPQFSFLYNGE--GAEYYAYVLQGFR
>ERR550514_2091786
LAKRIEVMAEHIAKSGAHFEEMVKARSHGNEQLGFLFDGE--GSAYYRSVLAERK
>tr|A0A1Y1YET0|A0A1Y1YET0_9FUNG Uncharacterized protein OS=Basidiobolus meristosporus CBS 931.73 GN=K493DRAFT_337058 PE=4 SV=1
MRNIIDKMADFVTRNGPSLEEKVKANRRNDPKFGFLLPR-HEYHAYYQEKIRDL-
>tr|A0A068S5Q4|A0A068S5Q4_9FUNG Uncharacterized protein OS=Lichtheimia corymbifera JMRC:FSU:9682 GN=LCOR_08622.1 PE=4 SV=1
IAGVIQKTAQSVARAGKALETRIRQGHGQNPKFGFIHPG-HQYHAYYKQQLESF-
>tr|A0A077X061|A0A077X061_9FUNG Uncharacterized protein OS=Lichtheimia ramosa GN=LRAMOSA04774 PE=4 SV=1
IANVIQRTAQSVARAGKALETRIRQGHGQNPKFGFIHPG-HEYNAYYKQQLELF-
>ERR1719235_392113
VQKTIDSLAVFVARNGPGFEAIAKERNAFDSKFNFLSGG--LGAQYYKWRLHEE-
>ERR1719206_1033864
MKTIIEKTVSFVVKNGIGFESKVRESQRSNRKFDFLNPG-HAFHNYYKQKY----
>tr|A0A084VVY9|A0A084VVY9_ANOSI AGAP003635-PA-like protein OS=Anopheles sinensis GN=ZHAS_00009830 PE=4 SV=1
IQHVIDRTAIYVAKNGYSFEEALRIK--NDPRFVFLNRA-HEYYPYYAYLVRQ--
>tr|A0A2H1W206|A0A2H1W206_SPOFR SFRICE_001484 OS=Spodoptera frugiperda GN=SFRICE_001484 PE=4 SV=1
VQIVIDKMASYVARNGDEFADIVRAK--NDPRFTFLDPE-NIYHPYYKRLMQQ--
>tr|A0A0L7LIN6|A0A0L7LIN6_9NEOP Protein suppressor of white apricot OS=Operophtera brumata GN=OBRU01_03977 PE=4 SV=1
------------------QSDIVRAK--NDPRFTFLEPS-NVYHAYYNRLMQE--
>tr|Q22RL3|Q22RL3_TETTS Surp module family protein OS=Tetrahymena thermophila (strain SB210) GN=TTHERM_00013750 PE=4 SV=2
IRTIIDKTAEFVVKHGAAVEENIIQAQVNNLSFNFLKQN-DPYRPYYDSKIAEF-
>tr|A0BZY6|A0BZY6_PARTE Uncharacterized protein OS=Paramecium tetraurelia GN=GSPATT00005955001 PE=4 SV=1
IKKYADKTAEYVAKNGATFEDLVMQKELSNPNFCFLRRD-DPYRPYYENKITEF-
>tr|A0A0V0QZQ3|A0A0V0QZQ3_PSEPJ SWAP/Surp OS=Pseudocohnilembus persalinus GN=PPERSA_11098 PE=4 SV=1
IREVADKTAKYVIDNGENFEQLVIMEEENNKQFYFLKKD-DPYRAYYEHKKREF-
>tr|Q4DJL8|Q4DJL8_TRYCC Uncharacterized protein OS=Trypanosoma cruzi (strain CL Brener) GN=Tc00.1047053510609.130 PE=4 SV=1
-TAFLDLIAFYVVQGGPTAEEEIMKREENNSHFAFLHApWNDPMQLYYRWRLYSL-
>DeetaT_10_FD_contig_31_5083659_length_237_multi_8_in_0_out_0_1 # 3 # 236 # -1 # ID=1563130_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.590
VRAIVDKTAQFVAksKDGKAREDKLMSSVATNLKFGFLRLD-DPYRAYYDFKVKKI-
>ERR1711862_135268
--QFITTIASFVSKDGSLIEQSIINRERHNPKFSFLFSDsNIEERIFYRWRVYSF-
>tr|A0A1Z5KPW7|A0A1Z5KPW7_FISSO U2-associated protein SR140 OS=Fistulifera solaris GN=FisN_UnNu111 PE=4 SV=1
--HFISRAAFVVASDEHPLEQKLRGE------------qdrSwSDAERRYYLWRVWSF-
>MKWU01.1.fsa_nt_gb|MKWU01013367.1|_5 # 4409 # 4651 # -1 # ID=13367_5;partial=00;start_type=GTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.403
--VFISKVASVVARHGSTVENNIMERVKGDTRFSFMFAkrnpefyalsaetkLrILEEQRFYRWRVYSF-
>LauGreDrversion4_1035100.scaffolds.fasta_scaffold3562174_1 # 3 # 206 # -1 # ID=3562174_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.431
--HFISIVASYVARDGAILEKRLMEKEEHTPQFAFLRPldvhrtsReALEETIFYRWRVYAY-
>ERR1712091_218887
-RRLIDATALFTAADGRAFEEFVRVQEGANPEFAFLSLADSDDGRYYRWRVYEA-
>tr|A0A1J4MUZ7|A0A1J4MUZ7_9CRYT U2-associated protein SR140 OS=Cryptosporidium andersoni GN=cand_022190 PE=4 SV=1
-KALIRLTSRFVAYFGYCFEQLLMKNELENPLFNFLFIS-SPLHHYYRWRVYSF-
>SRR5210317_1610024
-RAFISTVASFVAKDGSLLEQKLIEAESSNPDFQFLSHGDggdderMAEHIYYRWRVYSL-
>SRR6056297_2358809
-----RVCPSPARTRPTLPPQAVMSREEANPHFRFLFDPSAPEHRYYRWKVVSL-
>WorMetDrversion2_2_1049316.scaffolds.fasta_scaffold791889_1 # 1 # 255 # -1 # ID=791889_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.549
-----VLTTTPSPPHRYAFEEIILEREVHNPQYRFLYEADSCEHRYYRWKVVSL-
>ERR1719284_1134337
-DQKQE---ALTNNEAR--SLMRQDSKSRWENGWMFPRG-GRHHNYYVHKLL---
>ERR1712012_1060823
-IGILAT------------AQLLLSrvFHFFLLLMHLIRDK-DRHHNYYVHKLL---
>ERR1719187_1749260
-SRIYDQEDQERANSGRSLHDHVKSERSSspvgePRRGSFSGDF-NRDNL----------
>ERR1719282_1211899
-TCSSHSSPSYVAKNGRSFEDVVRSRPGASEKFSFLTPG-DRHHNYYVHKLL---
>ERR1712155_330494
------------LSLSLLFSSCLFFCSS--DSFFCSSSL-GSLYNYYLHKLN---
>ERR1719414_881606
-QILIDKTACWACKEIIKDGNdhaveekLGLLKDSDKETFEFLFPK-SKYHNYFKFKLA---
>tr|A0A0V0QIS1|A0A0V0QIS1_PSEPJ SWAP/Surp OS=Pseudocohnilembus persalinus GN=PPERSA_04833 PE=4 SV=1
-RRFIDRFARLVQQDGYGVEQFIQENHKQNLQYAFLFCQNCPEHIYYKWRLYSF-
>ERR1712217_957717
-CILIDICAKYVVETGPIFETIIKNREQNNKDLDFITCSISPEHFYYCWRLYSL-
>ERR1719300_26347
-RRIIDCLAEFVIQYGYEFESYVIrTNQGRNDDFNFLLKVQTQEHMYYRWRLYSL-
>ERR1711918_330905
-KERIDLTALNTAVYGYPFEAALMETKTNQSDFMFLFEKESPEHQYYRWRCWA--
>tr|A0A1D1YB41|A0A1D1YB41_9ARAE Zinc finger CCCH domain-containing protein 55 OS=Anthurium amnicola GN=Os08g0135800_2 PE=4 SV=1
--HNIEALCKFMTTVGPQFEDLARAKEIGNPGFSFLFGGEpgsaaAIGYEYFQWV-----
>tr|A0A061FYY9|A0A061FYY9_THECC Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_014295 PE=4 SV=1
--ERIEALCQCIAKNGPDYEDMVRKKESSKPEYAFLHGGElgseaAIAHDFFQWM-----
>tr|A0A1Q3C4Z7|A0A1Q3C4Z7_CEPFO Uncharacterized protein OS=Cephalotus follicularis GN=CFOL_v3_18794 PE=4 SV=1
--QKIEFFCQSIAKNGPDCEDIARRNEFGNPEYDFLFGGQpgsdaATAQEYFLWL-----
>tr|L1I4K6|L1I4K6_GUITH Uncharacterized protein (Fragment) OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_149666 PE=4 SV=1
----------YVARNGKAFEDMIREREGANPKFTFLREK-GEGCEYFRWRVYCL-
>ERR1740130_1525916
----------YVVRNGPAFEQLMKTKEGANPKFSFLFGPQgnPEQHAYYQWCLHCY-
>ERR1740123_306866
LRRVIDVLAKYVARYGAHFEEAIRKDlgkrrreiveklgvDNGDLDFGFLVeDAETPEAHYYRWRSFSY-
>ERR1719356_2338072
MRIFLDLLAKFVAKHGEDFEEAIRTDvdkrwretlrnlghEAESISFDFF-dDTDSDEARYYRWKTFTL-
>ERR1719461_1204081
-------------------EEAVRIDvsrrrldiledlglGRdsgALLDFSFLNeRALSDEALYYRWKAFTL-
>ERR1719460_833044
----VDTIAKFVSRYGDDFEKVLIADatkqkealqggEElgsRDLNLQFLLePPDAPIARYYRWRTIAY-
>ERR1719313_2091706
MaRQVIDAIAKYVARHGEHFEEALRRDvrehpERvagLNLDIAFLDsGAGTPEAQYYCWRAFAY-
>ERR1719191_1210627
TrRKIIDVLAKHVAQEGYQFEEGILRDvqklnheg--haISGEFGFVVQTSCDDGIYYRWRMLAF-
>ERR1712070_1227392
-----------------KFEEGIIQDlekrgaelkrclsqSTnSDVDFSFLKEVNTEDSIYYRWRMFAF-
>tr|A0A0L0HLP4|A0A0L0HLP4_SPIPN Uncharacterized protein OS=Spizellomyces punctatus DAOM BR117 GN=SPPG_03522 PE=4 SV=1
--SILDKMAEYVARNGQEFLEITKERQKHNPRYRFLWPE-DANYPYFQWKVQQFK
>ERR1711988_44237
IKVVIDKLAVMVRKNGIVFERKVKEEQANNPRFDFLKPW-DTYHAYYQMRVLQ--
>ERR1719162_415914
--HFISITASYVAKDPEL-ETRLKDEEKGNPTFSFLAFDsfdgeNLKEKTFYRWRVYSF-
>ERR1740124_1305236
--QFITTVASFVAKDGSLLEKKNHRARVQQPPLLL--PPppppqplhhnh----------------
>ERR1719215_94651
--HFLTTIASFVAKDGSVLEKYLIDRESGNFEYAFLVPDis----------------
>ERR1712127_938799
-----------------LLERKLIETTAFNSDFYFLAPCeddsdyeeHFTEHIFDRWRVYSF-
>SRR6056300_296558
--NFISTVAQFVAKDDIALEQALFQQHQQQslmdanNSMDFIWTPsqfrqrkqgetvsssrfdlQRQEHIYYKWRVYSF-
>ERR1711871_549735
-RKLIDLLASFVAVDGEAFECELMAKESHNEEFLFLFSTPSKLGYYYRWKTY---
>tr|A0A0J8B005|A0A0J8B005_BETVU Uncharacterized protein (Fragment) OS=Beta vulgaris subsp. vulgaris GN=BVRB_022680 PE=4 SV=1
LRPIVEKTAAYVAKHGTAFEAVILSKQQNDRRFDFLHPD-NAYHSFYKSKLVQ--
>ERR1719193_2657333
-AKRIDMMANFIVRNGSEFERMVMERNQGNPQFGFLFGAEGSGHGYFRETLKKL-
>ERR1711990_1203549
----SIKWPNMWQKNGDEFATMMMKKQKGNPEYSFLQPG-GSFNKYFDSKVKT--
>tr|A0A034VKK5|A0A034VKK5_BACDO Protein suppressor of white apricot OS=Bactrocera dorsalis GN=SUWA PE=4 SV=1
LKNIIDKTATYVVKNGRQFEETLRAK--SVDRFTFLLYD-NEFYPYYLYKVT---
>ERR1719264_964320
LKELIDDLASKVAKFGIKFEEIVKTMQDlkDKYDLSFIEEG-KPFNDYYRHKIRQ--
>ERR1719232_579204
FKDLIDNMAAKVAKYGITYEDLVRKIVQdmedtqgsQPHDISFFEEG-KPFNDYYRHEISR--
>ERR1712110_983499
TKDLIETLAKNVAKHGKVFEKKFKEKQGKNSNVSFLEEG-KEFNDYYKHRVAE--
>ERR1711894_650327
TKDIIETLAKNVAKHGKVFEKKFKEKKGKNSTVSFLDEG-NEFNDYYKHRVAV--
>ERR1712110_666619
TKNLIETLAKNVAKHGQILEKKFKEKQGNNSTVSFLNEG-NEFNDYYKHRVAV--
>tr|U6H6M4|U6H6M4_HYMMI Splicing factor arginine:serine rich 8 OS=Hymenolepis microstoma GN=HmN_000856300 PE=4 SV=1
LRSIIDKMAEYVARNGDEFQNVVKSKKQDDPRFAFLQTG-HIHHDYYIVKKK---
>tr|A0A210R217|A0A210R217_MIZYE Zinc finger CCHC domain-containing protein 8 OS=Mizuhopecten yessoensis GN=KP79_PYT15734 PE=4 SV=1
-NAIIEKTAKFIAEHGAQMEIIIKTKQSNNAQFEFMHFE-NTLNPYYKHMVKMIK
>tr|S9XQ49|S9XQ49_CAMFR Uncharacterized protein OS=Camelus ferus GN=CB1_000022002 PE=4 SV=1
-HAIIERTASFVCRQGAQFEIMLKAKQARNSQFDFLRFD-HYLNPYYKFIQKAMK
>tr|A0D7R2|A0D7R2_PARTE Uncharacterized protein OS=Paramecium tetraurelia GN=GSPATT00014046001 PE=4 SV=1
IRGIIDKLARQVVKEGAQFEQMIKQREINNSKYAFLYLQ-SEENEYYKWRVYSF-
>tr|J9JP45|J9JP45_ACYPI Uncharacterized protein OS=Acyrthosiphon pisum GN=LOC100160752 PE=4 SV=2
LQIIIDKMASYVTKNGKQFEETARKR--QDPRLKFLEPD-DSFNAYYTQKLK---
>ERR1719483_1222017
IQPIIDRMAMYVAKNGEEFEIVVKSR--NDNRFSFLNEY-HAHFPYYSVMKA---
>ERR1711934_97511
LRTVIDFTAERVAREGFGFELALKAAEKGNEAFTFLRDHESPDHAYYRYKVL---
>tr|A0A066W010|A0A066W010_9BASI Uncharacterized protein OS=Tilletiaria anomala UBC 951 GN=K437DRAFT_248024 PE=4 SV=1
IRNIIAKTAEYIARLGKQFEERMRSEDKG--------------------------
>ERR1719414_530734
---MINWC--ISLIGCPQFLGWQRAK--KDLRFQFLEKG-HEYHPYYLK------
>ERR1719195_1945948
-W-LLDRLVTYElhshekEQARLAAAGSAIT---mqarassradadidgeavsctvdgfpld-ETDLdGQRLELGDLSALIHIVEATRQ-
>ERR1719424_2386074
-W-LIDRLASYElhshdkEQERSENAVAAAKAAVeaaeadrkrkrmerkkkeedeldgeaidgidgltlt-DKDLdGEPIVTDDPSELLQLAESAAA-
>ERR1711879_565399
-Y-LVDRLACYElhshekEQARHVVSATQASA--sasrkglsrtesfdldgeaiecdidglpvd-ACDIdGEKVNFDDPAQFMAI-------
>ERR1712232_1284509
----------------------------rhsddidgeaiedmdglplt-EGDLdGEPLGEDDPGELVKLADLVSL-
>ERR1712232_300781
----------------------------gddfdgealteidglplg-DSDLdGEAIVDDDVGEILKLVELADL-
>tr|A0A0D1DWZ8|A0A0D1DWZ8_USTMA Chromosome 10, whole genome shotgun sequence OS=Ustilago maydis (strain 521 / FGSC 9021) GN=UMAG_10477 PE=4 SV=1
-RQLIETVASRIRSNGAHFEHILREREAENAQFAFLFEPDSVLHHYFRICLD---
>ERR1719421_1554010
--------------------QKVLSSdGGRTAKFNFLKPY-DPYNAYYEMKIREN-
>ERR1719238_996339
--------------------DAILTApTSPATILLldaNKRKP-HNTNGYKQPKLSLF-
>ERR1712039_838784
KRKQIDRMAKYVAQEGHHFEQVVVELESPNGNFSFLFDYN---------------
>ERR1719223_1796426
-AKFITTVAEFVSKDGSVLEQKLVDtRGHE-GQFAFLRpipggtdGAASREHVYYRWRVYAF-
>ERR1719161_777325
---------KFVAQEGYQFEEGILRdVQKLaqeghaiSGEFGFVHQSSGDDGIYYRWRMLAF-
>ERR1719192_511185
----------------------------AEDHWDAVQKK-PRLRIFTTHqQNTEV-
>ERR1719414_881606
-ANVLEKTAEFIAKNGAQMEILTRAKEAGNPKFQFLNPD-NPYHPIYKQVLEKK-
>ERR1719150_471280
-ANVLEKTTEFIAKNGAQMEILMRAKEAGNLKFQFLNPD-NPYHPIYKQVLEKK-
>ERR1719494_1800144
LQPIVDKLAHYVAKNGDEFEKIVRAK--QDPRFDFLNIW-SSHYTYYEHKKKLI-
>tr|A0A1W4XRW9|A0A1W4XRW9_AGRPL splicing factor, suppressor of white-apricot homolog OS=Agrilus planipennis GN=LOC108744321 PE=4 SV=1
VQLIIEKMASYVSKNGTEFENIVKAK--GDPRFEFLNEE-HKFHSFYQKKLEEF-
>ERR1711988_25572
TKNVIDKLAQFVLKSGRSFEDTIKQKQANNPVFSFLSGGE--HAPYYTWMKY---
>ERR1711988_535439
LVHCALRLPAVPAN-------PLNQRHI--LILKMX-------------------
>ERR1719472_160567
--DAIKKVAEFVCQHGAEYEATFRERQKDNEMFRFLWDTDSVEHALYKNTLST--
>ERR1719460_248670
----RRGRSQRSPVAGAEYEETFRERQKDNEMFRFLWDTDSVEHALYKNTLKT--
>ERR1719201_723535
---RVYKLAQFVVKNGVQFEQMMKAREANNPLFAFLFGG--QYHSYYVWVR----
>tr|C1N1T9|C1N1T9_MICPC Predicted protein OS=Micromonas pusilla (strain CCMP1545) GN=MICPUCDRAFT_51730 PE=4 SV=1
-KKFIDTTARFIAKNGPAFEAMAKERQKHDPKFDFLFGLG-DDVKYYAHKLAES-
>ERR1719173_274436
-IDLMKLIRIPKEM----YIQ-----------------Q-QPIDKYTVNLPQ---
>ERR1740136_456816
-----------------FFQFCFKVYVSSNKKFGFLFRE-NEHYPYYRKLLE---
>ERR1719187_2240098
-RNVDGQiveeddlaavpqlesQEELLLRQRVEEELKRRAAANNMVESPFT-----------HDMI----
>ERR1719203_2636544
--TGPAGviddddlaavprlesKEEERFRKRLEEEIRL-AGSGSMTTSPFT-----------GDLI----
>ERR1719204_590497
-FELLQKSatkqliSQYFNL--NRW---FNHIQTFVKEFEFLDPV-HPVFPYFQKLVN---
>ERR1719498_1751091
---TIEElddandlstipqvlsKKEHTLRNKIEEFLKQRASKSNFVESPFT-----------KDMI----
>ERR1712080_259173
----ILNldnrgeikvtia---DDIKEKQE-INLDINENNNKVKSPFT-----------GDMI----
>ERR1719461_4655
-VDLIKLTAQFVARNGRTFLNALSSQAQsaqgtLSQRYLFLNPI-DPRFPYFQKLVQ---
>tr|A0A267DIG6|A0A267DIG6_9PLAT Uncharacterized protein OS=Macrostomum lignano GN=BOX15_Mlig019567g3 PE=4 SV=1
-LSVINRMVEHVVREGPLFELLVCQRESDNPRYRFLFDYNSASTCTTAGASTqsCM-
>tr|A0A1I8F4V9|A0A1I8F4V9_9PLAT Uncharacterized protein OS=Macrostomum lignano PE=4 SV=1
-LSVINRMVEHVVREGRAIRVGSSASaSRTILAYRFLFDYNSAEHVYYRWRLYSI-
>tr|C5KMR9|C5KMR9_PERM5 Amino acid transporter, putative OS=Perkinsus marinus (strain ATCC 50983 / TXsc) GN=Pmar_PMAR029293 PE=4 SV=1
HRSLIDKIARMVADNGRNVEQVAIMrvqaRNGRKEKYNFLFDYDSPDGQYYRWRTFA--
>tr|A0A0G4H858|A0A0G4H858_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 GN=Cvel_5857 PE=4 SV=1
LRQRIETVARFCQQNGPMFESSTRSKEAHNPLFQFMFGGE--GAEYFLWCRYC--
>tr|A0A0G4GAI3|A0A0G4GAI3_VITBC Uncharacterized protein OS=Vitrella brassicaformis (strain CCMP3155) GN=Vbra_9791 PE=4 SV=1
LKNVIERIADYVTKNGPQFVDVVREKHPNDPKFAFLSGGE--GHDYFRFCLSA--
>ERR1719284_621401
-NAIIEKTSQFVALNGNIFEEKILAQNKGNTKFSFLVST-DPYHLYYKTKLTKL-
>ERR1719357_1843590
-QTIVDKMASYVARNGRSFEEVESGA--VPPSGPIVIPP-TDL-QTIVDKMA---
>ERR1719509_272253
-IKIHQILKEEVATRGRGLENTSDTG--RGQDPGIKENG-GN----VLGPGrgrG---
>ERR1719189_3156416
-QILIDKTASWICKqniqsgnknAGMDKLAVVKKM--HKDEFDFLFPE-GKYHTYFLYKIA---
>ERR1719350_2510378
-QILIDKTASWVCKenlksenkqSGEERLSVVKKL--HKDKFDFLLLE-TNIITTIYSKX----
>ERR1712241_197120
-QALIDQTASYVVKNGGEKLGILRKR--MPDKFAFLRSD-NKYNTYYQFKVA---
>ERR1719187_618897
-SSSRSKSRRHKRHRSRS--RDSRRK--RRH------------------RSrsrS---
>ERR1711962_163469
-QMVIDKMASYIMKNGPEFESMVRSR--GDERFAFLNKK-HKYNPYYRCKMR---
>tr|A0A1R2CNQ2|A0A1R2CNQ2_9CILI Uncharacterized protein OS=Stentor coeruleus GN=SteCoe_7010 PE=4 SV=1
IQALIEKTAEFVARNGSAIEAKIWETQKSNPKFSFLKHG-DIYRSFYEQKIHEY-
>ERR1719409_842248
LRRLADLTARFVAADGRTFEDFIRLRERDHGDFSQLLDVSSQAGRYYRYRAWE--
>ERR1719247_316414
-----LGQAHWCASRGREA--------PCRTPAARSRPVSSQAGRYYRYRAWE--
>ERR1719502_398719
----------------------VRVREAGHPDFAQFLDTRTPEGAYYRFRVWE--
>ERR1719387_1276303
LRSLIDNFAIFVVEAGPNFEMLISEREIDNKDYEFIRNCDLPEYLYYQWRIHM--
>ERR1711920_524116
LRSLIDKFSIFVVEYGPNFEMLISKREINNKDYEFIRNRNSPEYLYYQWRIHM--
>tr|A0A066VD45|A0A066VD45_9BASI Uncharacterized protein OS=Tilletiaria anomala UBC 951 GN=K437DRAFT_241154 PE=4 SV=1
--RFISAVAERVLQLGHKFEQMIYEKEKDNPRFDFLRNQKLLEHQYFRMLVD---
>tr|A0A177VPF7|A0A177VPF7_9BASI Uncharacterized protein OS=Tilletia controversa GN=A4X06_g3424 PE=4 SV=1
--ANVRTVAQRVQQHGPKFEELLREKEkdKDNPKFAFLWDEKSVLFQYYNMLLN---
>tr|A0A177TAC8|A0A177TAC8_9BASI Uncharacterized protein OS=Tilletia indica GN=A4X13_g7120 PE=4 SV=1
--TNVRTVAQRIQQHGPKFEQLVREKEKDNPKFAFLRDEKSVMYQYFNVLLN---
>tr|A0A0P1BMG3|A0A0P1BMG3_9BASI Predicted splicing regulator, contains RRM, SWAP and RPR domains OS=Ceraceosorus bombacis PE=4 SV=1
--ELIRDVAKRVVNHGDGYYlKAMRDNGttDVKDKFAFMWDENLPAYHYYRTLVD---
>ERR1719231_1073587
------------------FEERIKKENPDKRKFEFLHDE-SPYNPYYLNRIKT--
>ERR1719161_725264
QKRFIDLLARYVARHGAVFEDAIKRdcdqrgqeireyltkdLEGGKVGFRFLYEVDSDEAIYYRWRMFAF-
>ERR1719386_328496
QKRFIDLLARYVARHGVVFEDAIKRdcelrgpeireyltkdLEAGKVGFRFLNETESDE------------
>ERR1719316_48509
QKRFIDLLARYVARHGVVFEDAIKRdcagrgneireyltkdLEAGKVGFRFLSEIESDEARYYRWRMFAF-
>ERR1712093_68135
---------------------------DTVQDFAFLDKdADTEAAHYYRWRTFAL-
>tr|A0A1D1VL28|A0A1D1VL28_RAMVA Uncharacterized protein OS=Ramazzottius varieornatus GN=RvY_12314-1 PE=4 SV=1
-HTIIEKTAFFIADKGPQLEIVIRSKQANNESFNFLQPR-DALYPYYRFLVEAIK
>tr|A0A1W0WQP4|A0A1W0WQP4_HYPDU Uncharacterized protein OS=Hypsibius dujardini GN=BV898_08460 PE=4 SV=1
-NSIIEKTALFIAEKGRPLEGIIRAKQANNDSFSFLSPK-DALHPYYNFLVDAIK
>ERR1719285_5095
LQPIIEKTIEYVVRNGVNFEAELRKRNGSSTKFAFLLPK-NPYYPYYKRELF---
>ERR1740124_342161
TKKIIDTTVDYIVRNDIHFEAELKKRNASSnKKFGFLFRE-NEHYPYYRKLLE---
>ERR1719509_103223
VTSFETGS-----RSRASCESASRAAGS----STSSSRG-TRTTSGTSGNSP---
>ERR1719509_496265
-FNS------STSTVPSITSINTSSRQSNNPAFQFLNLD-SPLNHFYKHLVKM--
>ERR1719273_437433
-CKCSGKNDRVHCQKWSTNGNFNASQGSGKSQISIFESC-NPYHPIYKQVLEK--
>tr|A0A0D3DEI6|A0A0D3DEI6_BRAOL Uncharacterized protein (Fragment) OS=Brassica oleracea var. oleracea PE=4 SV=1
-ITLAARVAYLVSTQGVDFEMQLRDSQVNDTRFNFLRDASHVCHAYYQRSLAYL-
>tr|D7MAG1|D7MAG1_ARALL Uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_915240 PE=4 SV=1
-TSIVDTTARLVSKFGLEFEMMVKESNTDDERFNFLKSSEDPYHALYKQKLDEY-
>tr|O23467|O23467_ARATH SWAP (Suppressor-of-White-APricot)/surp domain-containing protein OS=Arabidopsis thaliana GN=dl4140w PE=4 SV=1
-TAIVETTSCLVSQFGSEFEMMVKDSNTDDARFNFLKSSEDPYNVLYKQKLDEY-
>tr|R0F5E3|R0F5E3_9BRAS Uncharacterized protein OS=Capsella rubella GN=CARUB_v10004868mg PE=4 SV=1
-KTLIERTAILISKKSSTVEERLRRCNLTNPRYKFLHR-SDPFHAFYQQKLREY-
>ERR1712106_98297
--G-MERMGLGTAKPGE-------------------VGEGDDEFDQYRKRMMLayrfrpnpmvclfFT
>tr|D7LS50|D7LS50_ARALL Uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_906172 PE=4 SV=1
IVGLIEKMAYVVALRGPEFEKEMIIVNRGDTTFSFMSSS-DPNHALYQQKLTEY-
>tr|A0A0D3BIE8|A0A0D3BIE8_BRAOL Uncharacterized protein OS=Brassica oleracea var. oleracea GN=106334682 PE=4 SV=1
ILVVIERTARAVAQLGLEHEKLILEANRCIENApykppgsktvleversGFLkSCD-HPYHQVYRKKRDAY-
>tr|A0A0D3AJB6|A0A0D3AJB6_BRAOL Uncharacterized protein OS=Brassica oleracea var. oleracea PE=4 SV=1
KTNFIEKVARLVSQKGLETERTLMALNNDNdKKFSFLQRS-DPCHAYYQRKLNYY-
>tr|A0A1J3E816|A0A1J3E816_NOCCA Putative splicing factor 3A subunit 1 OS=Noccaea caerulescens GN=GA_TR5591_c0_g1_i1_g.17843 PE=4 SV=1
ELSNVEAVARLVSEKGLEMEKTLMALNTDNdNKFRFLWRI-DPRHAFYKQKLKEF-
>tr|D7L1N2|D7L1N2_ARALL Uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_898699 PE=4 SV=1
NRIFVETIARLVSQKGLELERTLMSIDTNFngEIFRFFCNS-DPSNVFYKQTLNEY-
>tr|D7M2Y9|D7M2Y9_ARALL Uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_909018 PE=4 SV=1
IKNCLEETAYVVAKVGdaraLEFERRIFAADVENAVFNFLHPS-DPYHAYYKEKVTEY-
>tr|A0A0D3B7P5|A0A0D3B7P5_BRAOL Uncharacterized protein OS=Brassica oleracea var. oleracea PE=4 SV=1
VRCIIEKTAKFVCKHGLEIENRIIASNVKNARFKFLSSS-DPYHVFYKRKLYEY-
>tr|A0A078G4Q4|A0A078G4Q4_BRANA BnaC03g27620D protein OS=Brassica napus GN=BnaC03g27620D PE=4 SV=1
VRCIIEKTAKFIQVFKQLRSLS---CLLQTQTL----------------------
>tr|A0A1I7ZMA3|A0A1I7ZMA3_9BILA Uncharacterized protein OS=Steinernema glaseri PE=4 SV=1
VWKLIDQTVNYIAKHGPESEEETRRINYGDPRFQFLFGNE--HTEYYRFRL----
>tr|A0A2A6D389|A0A2A6D389_PRIPA Tag-65 OS=Pristionchus pacificus GN=PRIPAC_33892 PE=4 SV=1
VRGKIDKIASFVAKGGPQAEDRLRSDHAGKPQFSFLnFGGE--HSEYYRFAL----
>tr|A0A1I8BQ77|A0A1I8BQ77_MELHA Uncharacterized protein OS=Meloidogyne hapla PE=4 SV=1
LQRIIDSMAEYVAKNGPASEEVARQQNRNDPRYVFLFGGE--FSDYYKYRL----
>ERR1719295_149235
MKTIIEKTVQFVVKNGKEFESKVRESQRSNRKFDFLNTG-HAFNPYYNQQ-----
>ERR1712154_500380
LKQMIDRTAFYVAHFGKEFEDEIIKNGVFgEEENKFLvENG-SDLNKYYRWRVYSV-
>ERR1719174_940694
IRKKAETLARYVVKNGPSFETNVIEKHKGDPKACTHGALrfpfpwrE--GGEWLGP------
>ERR1739848_632275
IKKNCEALAKYVSKNGASFEKMAIARNEGNPLFAFLKGGL--GEDYYKH------
>tr|A9RJZ2|A9RJZ2_PHYPA Predicted protein OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_159541 PE=4 SV=1
--KNIDILSGFVVKNGPQFEDMARHKQADDPKFAFLFGGVpgseaAIGKAYYEWKKRS--
>ERR1719420_2763868
---------------GSKYY----EWVVGGFRRG-----LGKTAAPpahsaagakaapATP------
>ERR1719321_1838571
--ELVESIALAVRKNGPMFEQWLLKMRKASPGYGFLFEG-QPGHKYYLWA-----
>ERR1712065_10505
-KPFITTTAQYVANRGLGLERQISQRQASNPKFDFLKAS-SPYYAFYQKEIEK--
>ERR1719233_1195840
--ALIINFPHCESAKVVV---SL---PVPPSQHPPVVPAHGPQDVdavVPGAKLSVV-
>tr|A0A087SA08|A0A087SA08_AUXPR DNA excision repair protein ERCC-6-like protein OS=Auxenochlorella protothecoides GN=F751_4206 PE=4 SV=1
LRQAIDSLAFYVARSGGASEDLVRSTQVQNgAGHSFLLGG--PGSAYYAWKVHTLR
>tr|A0A1G4MGV4|A0A1G4MGV4_LACFM LAFE_0G03422g1_1 OS=Lachancea fermentati GN=LAFE_0G03422G PE=4 SV=1
LKDSVWKTAHYVARNGHQFEEKLHG-----DRFSFLQED-DEYNDYYRFLVSS--
>tr|A0A0P1KY80|A0A0P1KY80_9SACH LAQU0S27e00298g1_1 OS=Lachancea quebecensis GN=LAQU0_S27e00298g PE=4 SV=1
LKNTIEKTASFVARNGTEFEKKLDR-----VDFPFVDEQ-NAYYPYYKQFLGS--
>tr|A0A1G4K0H5|A0A1G4K0H5_9SACH LADA_0H03400g1_1 OS=Lachancea dasiensis CBS 10888 GN=LADA_0H03400G PE=4 SV=1
IKDAIEKTADYVSRNGPEFEQKLDL-----EQFPFIEKT-DPFHTYYRQLVYQ--
>tr|A0A1G4KFV7|A0A1G4KFV7_9SACH LANO_0G03268g1_1 OS=Lachancea nothofagi CBS 11611 GN=LANO_0G03268G PE=4 SV=1
WKEAIEKTANYVKRNGREFEQKLDV-----TQFPFLQEK-DQHNGYYQHLLKR--
>tr|A0A1G4JXR9|A0A1G4JXR9_9SACH LAFA_0G03378g1_1 OS=Lachancea sp. CBS 6924 GN=LAFA_0G03378G PE=4 SV=1
TRASIVKTAEYVKRNGKEFEAKLNV-----EQFPFLQES-NEYHGFYIGLLGH--
>tr|A7TGU3|A7TGU3_VANPO Uncharacterized protein OS=Vanderwaltozyma polyspora (strain ATCC 22028 / DSM 70294) GN=Kpol_2001p62 PE=4 SV=1
LIESIEKTVLYVRKNGVSFEDKLAGD----DKFSFVRPE-HKYYKFYKAKLNE--
>tr|G8ZSH0|G8ZSH0_TORDC Uncharacterized protein OS=Torulaspora delbrueckii (strain ATCC 10662 / CBS 1146 / NBRC 0425 / NCYC 2629 / NRRL Y-866) GN=TDEL0
ARETILKTAKYVAENGASFEKKLRDD----PRFSFVNPG-NSHYELYERMLVD--
>tr|A0A109UYN3|A0A109UYN3_9SACH HCL584Cp OS=Eremothecium sinecaudum GN=AW171_hschr31406 PE=4 SV=1
TLTEIAT-VLSQLQSSNTDLSELKEK----DNKSFLDSN-DNFNAFFEYSVAE--
>tr|A0A1Q3ACW3|A0A1Q3ACW3_ZYGRO Uncharacterized protein OS=Zygosaccharomyces rouxii GN=ZYGR_0AI07650 PE=4 SV=1
LKETIIKTATYAKQNGSPFVEKLKND----DRFSFTNPE-DVYYQYFQYILNH--
>tr|A0A0L0DRP6|A0A0L0DRP6_THETB Pre-mRNA-splicing factor sap114 OS=Thecamonas trahens ATCC 50062 GN=AMSG_10590 PE=4 SV=1
VKTICDRTAKFIASSGDEYEDKIRKQEAGNDKFAFLDPK-SPYHAYYAFRLEQA-
>tr|A0A2D3UUR7|A0A2D3UUR7_9PEZI Uncharacterized protein OS=Ramularia collo-cygni GN=RCC_01601 PE=4 SV=1
--RAIHTIADRLLSEpdserALELEAMLMALPEvqEDERFSFLYDSKSTAGVYYRYLLWND-
>tr|A0A179GMH7|A0A179GMH7_9HYPO RNA polymerase II, large subunit, CTD OS=Purpureocillium lilacinum GN=VFPBJ_06809 PE=4 SV=1
--RQIHSVVERIIKHGPAYETRLMARPDiqRDEKWSWLWDSKSTAGVYYRWRLWQL-
>tr|A0A177WJE4|A0A177WJE4_BATDE Uncharacterized protein OS=Batrachochytrium dendrobatidis JEL423 GN=BDEG_23630 PE=4 SV=1
--MVIHRSIERVLMFGFHFEMQLMTKTKDDAEFMFLRDTKSPEHIYYKWKLVSL-
>tr|A0A1B6H4P8|A0A1B6H4P8_9HEMI Uncharacterized protein OS=Cuerna arida GN=g.12593 PE=4 SV=1
IKEIIEKTAECVVKNGRQMEILIKTRQAQNKLFDFLTVG-TSLNPYYQYVLEAIK
>tr|A0A1B6KTN7|A0A1B6KTN7_9HEMI Uncharacterized protein OS=Graphocephala atropunctata GN=g.49693 PE=4 SV=1
VIEIIDKTADCVAKHGRQMEVLIKTRQAQNEIFNFLTVG-SDLNPYYEHVLEAIK
>tagenome__1003787_1003787.scaffolds.fasta_scaffold20766782_2 # 374 # 814 # -1 # ID=20766782_2;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.721
--KTMDVLAKFVAQCPDGFEDVVREREQMNPEFAFLRAGpYCPDAQYYRWRVYAF-
>EndMetStandDraft_9_1072997.scaffolds.fasta_scaffold925905_1 # 3 # 407 # -1 # ID=925905_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.657
--PEHHRSGGWVLPQAPRFEEAVVHRLQksrneglAGIDWSFLQEHSSPEAQYYRWRTFSF-
>ERR1719419_1222553
VRPVINRMAEYVARNGPEFEVIVKVKR--DDKFQFLESW-HPYHAYYISQ-----
>ERR1719376_965398
VRNCIDVFARYVAEHGFPLEQLVMESliERDkVDEYSWLFDPQCSDHYYYRWRVYAF-
>tr|A0A2D4GU88|A0A2D4GU88_MICCO Uncharacterized protein (Fragment) OS=Micrurus corallinus PE=4 SV=1
SSPVLGHKLR---------TVYLFCL--LLNRFEFLQPW-HQYNAYYEFK-----
>ERR1719333_1341784
-LEAINHLAEFVVNKDASMETIIKEKNQKNqggKRFFFLFKPESPEAIYYEWLKK---
>ERR1719460_3102034
LRKNIEIMARQVVQLGSSFEDMVLKGNRDKPQFAFLYGGE--GSEYYQQVLQAA-
>ERR1719326_2082432
LRKNIEIMARQVANLGASFEEMVLKNNRDKPQFAFLYGGE--GSDYYQQALREI-
>ERR1719461_1504073
LQTIIDKLRAMWPRMGEVL--------------RILSTT-RTRA-----------
>ERR1719394_1057841
LQTIIDKTASYVAKNGRSF--------------EDIVYN-KDKSRFVFLKSS---
>ERR1719410_3101195
--DNCGQDSELRGKNGRSFEEIVFSK--DKSRFGFLKPG-DKHYNYYLHKLN---
>ERR1719483_521266
---ESDSER---------------RR--RKKRKRSRSRD-RHRRkrSRSHHKKP---
>tr|A0A0A9XK95|A0A0A9XK95_LYGHE Protein suppressor of white apricot OS=Lygus hesperus GN=su(w[a])_1 PE=4 SV=1
VQKIIEVMVKFVIKNGLRFETVIMKK--GDVRFSFLNPK-NPYNAFYKQQLE---
>tr|D7LPI4|D7LPI4_ARALL Uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_347136 PE=4 SV=1
AERIVKNVARCIARKGLKYEKRMMTNV-KDPRINFLRNPEDPLHGYYKQKLSDY-
>tr|D7MAG0|D7MAG0_ARALL Uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_915239 PE=4 SV=1
LKTIIERTAEFIAKNGAEYEKEFLESHP---KLTFF-VSSDPNHAFYQDKLIEY-
>tr|D7KQU2|D7KQU2_ARALL Uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_890144 PE=4 SV=1
MRNLVEMAARFVSRHGSEYEKSIMNIKPhdERINFNFLKSSEDPLHGYYKQKLTEY-
>ERR1719453_2422175
--LLIEDIADVIARRGKEFSAWIKERRRDNPQYAFLYDG--PGHEYFEYCVRK--
>ERR550514_2192718
--EVIDKTAKFVQkaKDPAAFAKMIQAKNAGNPQFEFMNED-HAQHKFYQFLIH---
>ERR1719319_287409
-----CWCRCWCKGAGA---EVQVQRCWCR-CWCRGAHS-PRHHNYYLHKLA---
>ERR1719419_1212379
LQMvQMEACLCYRPQGSR---GS----TTSSRERERGRSG-AKGG-KEKGK-----
>ERR1719336_1872176
IKSIIEVTVSYVVRNGLSFEGELRKPQQSSRKFDFLISG-NPYYKWYKWKLA---
>ERR1719220_1060091
LRSIVLVTVDYVIRNGMNFEGELRKRQQNNRKFDFLIPG-NPYYKWYKWKLQ---
>ERR1719376_29632
IKDANVKWPAFVAKNGEAFEQAILNKKKD--QFEFLLPD-SPYRAYFDKRLREIK
>tr|C1N1E1|C1N1E1_MICPC Predicted protein OS=Micromonas pusilla (strain CCMP1545) GN=MICPUCDRAFT_51367 PE=4 SV=1
--KRVQKVAEFVVKNGKKFEDVTREKQRDNrVEFSFLWPG-GEGVDYYAWLKH---
>tagenome__1003787_1003787.scaffolds.fasta_scaffold2505706_1 # 2 # 184 # 1 # ID=2505706_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.645
--ERD------------RVDALEEGQSRGHr----QDGRG-GEGVRRGA-------
>ERR1719233_2831144
IKQIVKKLAVRVAHFGRVFEEKVKKQKNDDPSFVFLQSFN-KYHSYYLWVLD---
>ERR1711974_132915
VKQIVRKLALRVAHFGQVFEDKIRKQKKDDTKFAFLNSFN-KYHSYYLWVLA---
>tr|F4R418|F4R418_MELLP Uncharacterized protein OS=Melampsora larici-populina (strain 98AG31 / pathotype 3-4-7) GN=MELLADRAFT_114885 PE=4 SV=1
IRTIVDKTAAFIAKNPNpqLFEDKIRQREKTDSRFSFLSPD-DAYNAYYRHQIQSIK
>tr|A0A0L0VGV6|A0A0L0VGV6_9BASI Uncharacterized protein OS=Puccinia striiformis f. sp. tritici PST-78 GN=PSTG_08244 PE=4 SV=1
-SthllkSEV--DCRVHCKEPE--PSNFQRKDKTDSRFSFLDPD-DAYNPYYRHRIETIK
>ERR1719193_1712026
----HKKWLKFNHVEAPVMEILEMLKESGR----LTSI-------CTQTKLS---
>ERR1712156_1104909
----HAQR------------AGGRERPGPG----LAERS-ARVPNCGEDPX----
>SRR3989338_850217
--------RHCVRFDKGRMTVWealekldrvvdesdpdlqlpqivhaFQTAEGlrqdyPDWWGSsTIWVR-SSLYQS-----S---
>SRR4051812_29976171
--------ADYLRLDKRSLSVWeamealdaltddsdpdtdvpqiehsLQTMLAikkdgHPRWFQ---------------------
>SRR3989337_1531957
----LTTRAKARAVDGLALTVSevfelaetirdesdpdteesqwihaLQVAEGlrkkfPQKQYEWLWIT-GLIHDLGKILTC---
>tr|A0A212FMU3|A0A212FMU3_DANPL Uncharacterized protein OS=Danaus plexippus plexippus GN=KGM_206863 PE=4 SV=1
--AIIEKTAKFIAHQGTQMEILIKAKQGDNPQFQFLNKD-SSLHPYYTTLIALVK
>ERR1719295_671129
----HETWLKFNHMEGTIMEALDMLNELVD----QSDPD-LDLPNCGEDPRRASR
>tr|T2M9X4|T2M9X4_HYDVU SURP and G-patch domain-containing protein 1 (Fragment) OS=Hydra vulgaris GN=SUGP1 PE=2 SV=1
--ALAEELAQKVARDGPDAEAKAKIQYQKDPRYSFLYDPTHQVAKFFKAQVKKI-
>ERR1719494_1504859
--DVLDKVAAIVAAGGEMVEKLLKTKNEGNPQYSFLFNEGSEENKRYQMKVVMY-
>EndMetStandDraft_4_1072995.scaffolds.fasta_scaffold1684370_1 # 1 # 399 # -1 # ID=1684370_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.689
--DSIEVLAARVVRGGDQVERTAIQSNVNNPTYSFLFDVNSQGHKYYKNLVKCL-
>ERR1711962_81784
-QQIIDKTATFISKQSAQMEIVLKTKQSGNPQFAFLEFN-NYLNPYYKFILEKIK
>tr|A0A1S4EIY5|A0A1S4EIY5_DIACI splicing factor, suppressor of white-apricot homolog OS=Diaphorina citri GN=LOC103515449 PE=4 SV=1
-FAIIEKTASFINSQGAQMEILLKMKQASNPHFGFLSFD-NPLYPFYRHVLSAIQ
>ERR1712137_119510
IKNFIDKLAALVVTHGEHIETQAKQKHAENPTFSFLINENTSEYLYYQFV-----
>ERR1719382_1174404
-ASTIEKLAARTCNN-PSFESMILEKQSDNPLFAFLSGGE--GAEYYAY------
>ERR1719203_545124
-AEAIEKLAHRACNK-PDLEALVREKQQGNANFDFLNGGQ--GADYYEY------
>ERR1719473_607645
----IRLAAQCAAVNGPDFVAALARREAANPRFLFLRPQ-HSLHHFFRKLTE---
>ERR1719163_2191940
--ELVESIATAVRKNGGTFEQWLLKQRKNNPAYAFLFPGN-RGHEYFCFA-----
>tr|A0A183JXN2|A0A183JXN2_9TREM Uncharacterized protein OS=Schistosoma curassoni PE=4 SV=1
-RGIINKMAEYVVRNGLEFEELVIRQKSKDPRFGFLKSD-HPLHSYYMAKRKEL-
>SRR5689334_17267774
-----ERTASFVCRQRAEFEIMLKANQARNSQLDFPrFDH--YLNPYYKFIQKAM-
>MudIll2142460700_1097286.scaffolds.fasta_scaffold1997242_1 # 2 # 496 # -1 # ID=1997242_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.665
--KRIDITATFVTEDGKAVEHRIKARKKDDPDFAFLFDTieENEETVYYLWRVFSL-
>SRR3954467_13119551
--RAVHKTVANVVKFGPIFEKAMLASPRvqYDPNWDWFFDHTSQPNIYFRWLIYEY-
>ERR1719210_149676
--------------ESEEFRILIQRADLTSAQAASRRMLSS---ALPRPTA----
>ERR1719330_1085409
--------------EADEMRALVTVKDLPSKDSAAGI---ASRSQPSKPGF----
>ERR1712024_185743
--------------------ILIQRADLSSAQAASRRALSS---ALPRPTT----
>ERR1711974_383034
--------------ESEELKAVVIAPELPSVPPAAPSQRPR--AELPRPNL----
>ERR1719350_2212330
--------------EAEEMRALVMLKDLPALQAQVQASAAASRGTYTKVGS----
>ERR1719210_2598541
-----------------EMRALVMAVDLPSAGLQPSRTASAPSGQ----AL----
>ERR1719193_1899381
--------------EADEIRVLVAPPELPRPPPAQAARPMH--V-----------
>ERR1719161_2950423
--------------ESDEMRSIVTAKELPGEAKPVGAAPLS--A-----------
>ERR1740129_2089677
SRSWLST----SCGTVRSSRTLSSKKNAGSPQFCFLFGGEG--SEYYAALI----
>ERR1740129_2089676
LRKRIEIMAEHVVRNGAEFENTVKQKKCREPPVLFPLWWRG--LRVLRSTH----
>ERR1719161_2531743
--------------EEEEFQTLICADELPQQNNSWQQNRSSW-------------
>ERR1719265_2529358
--------------ESEEMRAIIVARQLPDLAKLGNASASAPKNAL---------
>ERR1719221_2345259
--------------EAEEIQTLITAAELPRLPSSQAVAGAR--S-----------
>ERR1711879_693505
--------------EAEEIRTIVIAVDLSVATKLGSPGASTPTK-----------
>ERR1719499_1860750
--------------ESEEFRILVTTQNLPGVAPVTGGPVPS--Q------A----
>ERR1719198_397817
--------------ESEEMRILITTENLPGVARPMQPALQQ--G------M----
>tr|A0A267F3X8|A0A267F3X8_9PLAT Uncharacterized protein OS=Macrostomum lignano GN=BOX15_Mlig000774g1 PE=4 SV=1
-KLIIDKLAAYVQRNGTDFEAVVKAR--SDSRFDFLQSS-HRYHPYYLMKCG---
>tr|A0A182BFF8|A0A182BFF8_LUPAN Uncharacterized protein OS=Lupinus angustifolius PE=4 SV=1
IKGVIEKVVEFISKNGKQFEAVLAEQDRAHGRFPFLVPS-NRYHTYYLKVLQT--
>tr|A0A061F5H0|A0A061F5H0_THECC SWAP/surp domain-containing protein, putative isoform 1 OS=Theobroma cacao GN=TCM_030834 PE=4 SV=1
LKRVVDKIVEFIQKNGRQFEAVLVEQDVRHGRFPFLLQS-NLYHPYYLKVLQK--
>tr|V4LST2|V4LST2_EUTSA Uncharacterized protein (Fragment) OS=Eutrema salsugineum GN=EUTSA_v10015403mg PE=4 SV=1
MKRVINKIVDFIQKNGIELEATLAAQDVKYGMFPFLRPA-SLYHGYYRKVLQE--
>ERR1719375_1668197
-AVERRAAAAAAARTGAVRAARAAAA-----------------------------
>ERR1711903_255173
----VR-----LAGTGYVPEGLGLI-VPPPDLRAIVDKT----------------
>ERR1719189_2316117
-ATLIEKTSEFIATQGAQMSILMRAKEAHNPKPIRFTCQ-HL-------------
>tr|H3CKX5|H3CKX5_TETNG Uncharacterized protein OS=Tetraodon nigroviridis PE=4 SV=1
--RLMDVISAqprsADRQGKADISDPLKASPQSSSPAVFLRFD-HYLNPYYKHVLRAMK
>ERR1712087_224748
--AILERTAAFVAKQGTQMEFVLKMKQEGKKEFGFLKFD-HPLHSYYKHVTRAIK
>ERR1740124_158579
--KIVEKTAGFLSTQNLQTELVIRTKQKDNAQFHFLEPD-HELNPFY--------
>tr|A0A0D3DEI6|A0A0D3DEI6_BRAOL Uncharacterized protein (Fragment) OS=Brassica oleracea var. oleracea PE=4 SV=1
TKTIVEKTASFLSRNKVDKSEI----MPSDVRYNFLRSKTDPSHAYYSYMLSKY-
>tr|O23467|O23467_ARATH SWAP (Suppressor-of-White-APricot)/surp domain-containing protein OS=Arabidopsis thaliana GN=dl4140w PE=4 SV=1
TRTLVDKAAQFVSKKGLEFETkiidS----YPTDAKFNFLRSTADPCHTYYKHKLAEY-
>tr|R0F5E3|R0F5E3_9BRAS Uncharacterized protein OS=Capsella rubella GN=CARUB_v10004868mg PE=4 SV=1
LASIIDRAAVFIFKHGVEYEAelleT----FPCHT---FL-KSSDPNHGMYQKRLMEY-
>ERR1719235_1568718
--DVVRLTAQFVARNGQAFLQGVASREYNNPQFHFLKAT-HSLNPFFSELVS---
>ERR1719305_1777623
------------LRTG-----------WRLSTWTSCG----SRRSSSPGTAR---
>SRR3990167_877412
--ELIKLTAQYIAINGDQFLSGLLKREQTNPQFEFLRSG-HPANLYFNSLVQ---
>tr|A0A0H2RJW9|A0A0H2RJW9_9HOMO Uncharacterized protein OS=Schizopora paradoxa GN=SCHPADRAFT_875731 PE=4 SV=1
-EKFIRTVALKVKENGEAFEDMLRDRERQNKKFSFLFETSSPEHKLYRAIIS---
>tr|A0A2A9P079|A0A2A9P079_9AGAR Uncharacterized protein OS=Amanita thiersii Skay4041 GN=AMATHDRAFT_73220 PE=4 SV=1
-DIFIRAVAAEVKGHGKKYENNLKERERYNPKYSFMLKKDHRRHAFYRGLIE---
>tr|A0A166ICL2|A0A166ICL2_9HOMO Uncharacterized protein OS=Peniophora sp. CONT GN=PENSPDRAFT_649036 PE=4 SV=1
-AGYIRTVASEVRGHGDMYEDSLREREKGNSRYAFMTDSRHRKHRFYKSLLD---
>tr|A0A074SEI9|A0A074SEI9_9HOMO Putative RNA recognition motif OS=Rhizoctonia solani 123E GN=V565_124550 PE=4 SV=1
-EEFVRKVADKVRRNGRAFQTLLETREKNNPSFEFLWDDKSPGHLLYKHVLD---
>tr|A0A067PT03|A0A067PT03_9HOMO Uncharacterized protein OS=Jaapia argillacea MUCL 33604 GN=JAAARDRAFT_80612 PE=4 SV=1
-DQFVRTVASEIRGHGAKYEESLREREKANPKYAFMVNKKHRKHRYFRSLVQ---
>ERR1711939_754415
-RHLLEDVARRIQQYGNRFENTLRDKERSNPQYDFLRDEALPAYHYFKMLRD---
>tr|W7JWB9|W7JWB9_PLAFO Uncharacterized protein OS=Plasmodium falciparum (isolate NF54) GN=PFNF54_05869 PE=4 SV=1
IKTVIDKTATFVKKNGKNFEQKIYrEKE---KQFGFISPS-HPYFYYYQYKLHGL-
>tr|A0A0L0D0L8|A0A0L0D0L8_PLAFA Splicing factor OS=Plasmodium falciparum RAJ116 GN=PFLG_01956 PE=4 SV=1
ITTVIDKTATFAKKNGKNFAQKINrEKE---KQFGFISPS-HPYFYYYQYKLHGL-
>tr|J4D7D6|J4D7D6_THEOR Splicing factor 3 subunit 1 OS=Theileria orientalis strain Shintoku GN=TOT_020000383 PE=4 SV=1
-REIIKNTALFVARNGHKFLVDLNKREKNNPQYDFLNPS-HYLFSFFSNI-----
>tr|U6LQB8|U6LQB8_9EIME Surp module domain-containing protein, putative OS=Eimeria brunetti GN=EBH_0086290 PE=4 SV=1
------------------AAAAAAKREQQNPQFAFLKPS-HHLFSYFASL-----
>ERR1719471_81008
--------------GGSLLycfVFTRVVQQADNPVFGFLWME-HSYNKFSRHLTT---
>ERR1719285_115611
--HVILKTARFLCTQGSQMEILLKTKQANNPAFQFLNPS-DVLHSFYRHLVYMLR
>SRR5690554_5418954
--KTIHRTIESLLTHGPSFKALLMAQPPirHDPNFSFLWDARSPAGTYYRWKLWDL-
>ERR1719233_243540
--DRINKLCEHVSQrsnNGHEFIKLVKKKEATNPEFAFLFPGQ-EGNEYYEWKKY---
>ERR1711871_1905584
--DLINHTARWVATNGPEFENKLNS--TGGPEFSFLREGSrSLPPAYYRNRLR---
>Dee2metaT_33_FD_contig_31_1079419_length_229_multi_2_in_0_out_0_1 # 2 # 229 # -1 # ID=785874_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.601
--QRIHRMIEFVIREGPMLEAMIMHKEMSNPQFRWAGVYVRTYV-----------
>tr|A0A168MNJ7|A0A168MNJ7_ABSGL Uncharacterized protein OS=Absidia glauca GN=ABSGL_04457.1 scaffold 5468 PE=4 SV=1
ILPVILKTANTVAKVGLDLEQKIKRLRRDDPTFGFLQPQ-HAYHPYYRRKLAEL-
>tr|A0A1X2IDM4|A0A1X2IDM4_9FUNG Alternative splicing regulator-domain-containing protein OS=Absidia repens GN=BCR42DRAFT_417244 PE=4 SV=1
VKETIDKMAASVAKVGLDLEQKIRHLRSQDPRFAFLQPH-HPHHSYYQKQLGLR-
>tr|M7XA59|M7XA59_ENTHI Splicing factor, putative OS=Entamoeba histolytica HM-3:IMSS GN=KM1_187530 PE=4 SV=1
--EIIKLTAQYTARNGSNFVKTLAEREQKNPTFAFLHKN-HPNYPYFAQLCES--
>SRR5690348_2565785
--NILRLTAQFVALNSRKFLTALTARESKNPQFNFLQPD-HPLFPYFTKMVEE--
>ERR1719447_2397499
--RAIEKTALFLAGQGAQMEILLKAKQAGIPVCGLAPAP-LLPPPDYPGEVWA--
>ERR1719266_1091987
--RAIEKTALFLADVEVFIFILFRVVNFVEFILLCVF------------------
>ERR1719237_570445
-----------------------FGKSFRRTLL-APLVL-PPLHPFYRHIISL--
>ERR1719331_614231
--RAIEKTALFLAGQGALRAASAGS---GAPPPPRSGA-----------------
>ERR1719228_1547995
-----------------------GSFWNSFGRTLFLSVD-SPLHPFFRHLITL--
>ERR1719266_1127129
--APEEKQEEE-ESEGRRR------------------------ERE-EDELEM--
>ERR1719320_1399933
--NNFS---SPSDNFGMIFRVYLNGNLEGFLNFNFLLCF-----QFFDLFsqsnfcsIYI--
>ERR1719481_1522442
-------------------SDESARPLGAPPVFEFLAPD-HTIYPFFRHIKNL--
>ERR1719470_780729
--KLKKPDSSKESSLGPNNAIS----------------G-L--DYG---------
>ERR1740128_386793
--KVKKTEEKD-THLPTASALTGEGS---------SGSG-DE-EKGQEEPTTT--
>tr|A0A178UM43|A0A178UM43_ARATH Uncharacterized protein OS=Arabidopsis thaliana GN=AXX17_At5g06100 PE=4 SV=1
IRSCVENTALIVSKNGLEIERKMMELSMNDARHRFVWST-DPYHAFYQLKLAEYR
>tr|D7LZR2|D7LZR2_ARALL Uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_349916 PE=4 SV=1
IRNFIQKTALFVSKNGLETARRFMELSMNDTRYRFVWST-HPYHAFYQLKLAEYC
>tr|A0A1J3FG29|A0A1J3FG29_NOCCA Putative splicing factor 3A subunit 1 OS=Noccaea caerulescens GN=GA_TR3425_c1_g1_i1_g.10787 PE=4 SV=1
ICFFIEKTALLVAQKGSEYEKSLMAEGNIHPDWSFLWSS-DPYHGYYQQKISEAR
>tr|T1FTU3|T1FTU3_HELRO Uncharacterized protein OS=Helobdella robusta GN=20212240 PE=4 SV=1
----IDQFVACVNQSGPKLEQVAIKNNKNNPMFRFLYDFGSPANKYYEEKMTPV-
>ERR1719367_2719732
----V-----FGSTDLEDHQWEVREKNMEEPELSFLQDADGDLYKQYRSKIEWL-
>ERR1719471_2220480
----------IVRINLSFACWHAIFRTFLLFPRRFLQDVEGDLYRQYRSKIEWL-
>ERR550532_2191596
-------------PKAGILRSHRSGRRQRGQNGRFLQDVEGELYRQYRSKIEWL-
>ERR1719237_156927
----A---VTFESVFFRL--LTLYSPPLSLISCRFLQDVDGDLYKQYRSKIEWL-
>ERR1712000_499885
-RSKLSSKRqQDtwr--------------ATAMSSKIAFERRK-SITrsspssartihtmhttygdsarsrkEEELLWQLAKQE
>ABMV01.1.fsa_nt_gi|175780699|gb|ABMV01286810.1|_1 # 1 # 87 # 1 # ID=286810_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.494
-SSKVSLLPaLYiwtLNIYAQLIVDRVRDKEKSNPKFSFLNPD-DAYANFYQWRLTEIK
>tr|A0A1I8BS73|A0A1I8BS73_MELHA Uncharacterized protein OS=Meloidogyne hapla PE=4 SV=1
--LLIERTALFVVEKGPQMEVVIKAKQRhKAEQFGFLdFDH--RLHPFYKYLCKQIR
>ERR1719483_695200
---------PNFRLPILTFNKHICCCEAANAKFGFLFDT-HPLHKYYLYL-----
>ERR1719361_1323735
--QVIITTARYIRKGGLDAELKLQARQIFCCCQR---------------------
>ERR1719433_557179
--INIQSFQIQIVFVQMDAELKLQARQAANPKFAFLFDT-HHLHKYYLYL-----
>ERR1719311_865951
LRRVVDAVARFVAKYGEDFERVLIADAKEqaerlsdSppVDLRFLLEsNDTPTQQYYRWRVVA--
>ERR1719171_2768445
LRRVVDAVARFVAKYGEDFERVLIADAKEqaerlsdVslPRSRSYGG-------------IA--
>tr|A0A0N5DJQ4|A0A0N5DJQ4_TRIMR Uncharacterized protein OS=Trichuris muris PE=4 SV=1
-HKIIEKTAAFIATHGIQMEIIVNIKQKGNPLFRFLDYN-HQLHVYYKHILRMI-
>ERR1719421_2720146
-------------RPAGARAGSGTARRRDNAQYSFLYRG--PGKTYFDALVAK--
>tr|D7LPI4|D7LPI4_ARALL Uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_347136 PE=4 SV=1
IRNDIEDMARYISKGGLVFE-SVMRHLvADEARYSFMASS-HPFHAFYQQKLTEYR
>tr|D7MAG0|D7MAG0_ARALL Uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_915239 PE=4 SV=1
VTNRIQGTALYVAKKGFKAGKMLMQSEANNPKYNFMRRS-DPYHAFYKQKLAEYR
>tr|D7KQU2|D7KQU2_ARALL Uncharacterized protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_890144 PE=4 SV=1
RRYLIKTVAHLISRKGLEDEREMMDSFiNKPGSFGFLKSS-HRHHAFFRKMLTECR
>CryBogDrversion2_9_1035297.scaffolds.fasta_scaffold81031_1 # 2 # 289 # 1 # ID=81031_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.451
--DIIRVTAQYTAASGRSFLDQITRREEHNPQFAFLKVT-HVLFKFFM-------
>tr|A0A178VJ76|A0A178VJ76_ARATH Uncharacterized protein OS=Arabidopsis thaliana GN=AXX17_At3g30130 PE=4 SV=1
IRNNIEMMASYISKGGLVVEKVMRYLVVSDARYNFMGIS-DPFHPFYQQRLTQY-
>tr|F4IWK8|F4IWK8_ARATH SWAP (Suppressor-of-White-APricot)/surp RNA-binding domain-containing protein OS=Arabidopsis thaliana GN=At3g27600 PE=4 SV=1
-------MASYISTGGLVVEKVMRYLVVGDARYNFMGIS-DPFHPFYQQRLTQY-
>ERR1719412_32886
--MVVDELASMVAVSGDQVEQIARARNVGEETLAFLFDLDCDLYKRYRSKVESLR
>ERR550519_2245261
--LEEGGLLgthAGV-LGGHNYVERSQS--SGLSGCADFVGEekvpdVGEVLLGedeshvvLDVGEETLQ
>ERR1719397_728372
--LEEGSLLiieVFS-YNFPEMV--------------HLLLIskivfNSQHALKed---------
>ERR1711890_204521
--WKKEVFSgpMPVFWAGTTTSRGP-----KLRPX----------------------
>ERR1711963_597826
--THVSELASMVAVSGDHVEHIARANNIGEEVLAFLFDMECPLYKKYRAKVEVLR
>tr|A0A0S6XQ11|A0A0S6XQ11_9FUNG Uncharacterized protein OS=fungal sp. No.11243 GN=ANO11243_060740 PE=4 SV=1
--KLIHDTAEQLIQYGPDFEALLMTQPEIqkDEKWAWLFDSSSQAGVYYRWVIWDH-
>SRR5438045_7177446
--KLIHKTIEGMLNNGTEFDALIMRRLX---------------------------
>tr|A0A175JVG3|A0A175JVG3_ENTHI Uncharacterized protein OS=Entamoeba histolytica GN=CL6EHI_052100 PE=4 SV=1
TKKIIDELIPYLDKYGQEFEELVIKKCSSDLHFKFLNQEETLEFHYYLWKVFEF-
>tr|A0A0A1UH17|A0A0A1UH17_ENTIV Uncharacterized protein OS=Entamoeba invadens IP1 GN=EIN_046680 PE=4 SV=1
TKYIVKVLCEYLTVQGIVFEQVVKNKCDNNNNFEFMKEN-NEENEYYKWKVFEA-
>tr|A0A1E1IRG2|A0A1E1IRG2_LEIGU Uncharacterized protein OS=Leishmania guyanensis GN=LgM4147LRVhigh.05.00160.00220 PE=4 SV=1
--TLLDLLATAVVQGGPTTEEEIVKRemGRGNLAFAFLGEkFNHPCMLYYRWRLYSL-
>SRR5271169_736643
-------NSKPCSCPGQPS-RATRNGPGcgM----------PGVWVAFTTWRLWDI-
>tr|C1GFP6|C1GFP6_PARBD Uncharacterized protein OS=Paracoccidioides brasiliensis (strain Pb18) GN=PADG_06082 PE=4 SV=1
--DVVKLTALFVAKRGKSFMTALSQREMRNFQFDFLRPQ-HSLYQFFTRLVDQ--
>tr|S9VSJ4|S9VSJ4_SCHCR Splicing factor Sap114 OS=Schizosaccharomyces cryophilus (strain OY26 / ATCC MYA-4695 / CBS 11777 / NBRC 106824 / NRRL Y48691)
--NVLRLTARYAAIRGPSFIAKLSQKEWGNTQFDFLHQN-HALYSYFMRIVQQ--
>SRR5271156_5509908
--DILRLTALFVARHGNAFQRAISERESRNYQFDFLRTN-HSLYPYFQEMVAE--
>tr|A0A1E4TDZ0|A0A1E4TDZ0_9ASCO Uncharacterized protein OS=Tortispora caseinolytica NRRL Y-17796 GN=CANCADRAFT_98053 PE=4 SV=1
--EVLRLAALYVASNGIERATnELRTVYRDNAQFAFLNPK-HSLHQYFSAMIAQ--
>tr|E5A8H5|E5A8H5_LEPMJ Uncharacterized protein OS=Leptosphaeria maculans (strain JN3 / isolate v23.1.3 / race Av1-4-5-6-7-8) GN=LEMA_P075090.1 PE=4 SV
--RLVHQTIEGVILGGVEFEAALMNDPQvqEEERFAWLYDQKHPVNRYYRWRLHQI-
>tr|A0A150GL88|A0A150GL88_GONPE Uncharacterized protein OS=Gonium pectorale GN=GPECTOR_16g670 PE=4 SV=1
--ERIQLLVKYAMSNGPSFIDMMRQKQAGDSKFAFLNGGE--GSDYFRWLLYA--
>tr|A0A1D1ZVH2|A0A1D1ZVH2_AUXPR Uncharacterized protein OS=Auxenochlorella protothecoides GN=g.54231 PE=4 SV=1
--QRIVKLVEFAVRNGPSFVDLARQKQAGNPEYGFLTGGD--GAAFFTWRLFA--
>ERR1719193_1618018
QRVLIDLVASYVARVGHPFEQNLIQDIKDsyivlpkHLSWAFLEKSQTDQGRYFRWRTYAF-
>ERR1719158_121338
QRHLIDLVASYISRCGHPFEEKLIDDVNAgililprRLSWDFLQKANSGDGKYYRWRTYAY-
>ERR1719387_647863
---------RDLEKCGTELKRCLSK----dnsVVDFNFLHETETNDAIYYRWRMYAF-
>APWor7970453378_1049310.scaffolds.fasta_scaffold18124_1 # 1 # 81 # -1 # ID=18124_1;partial=10;start_type=GTG;rbs_motif=None;rbs_spacer=None;gc_cont=0.346
LKRVIDLLANYVAQMGEPFEEAVVHRLQKsrneaiaGIDWSFLQEHASPEAQYYRWRTFSF-
>tr|C9ST89|C9ST89_VERA1 Predicted protein OS=Verticillium alfalfae (strain VaMs.102 / ATCC MYA-4576 / FGSC 10136) GN=VDBG_08114 PE=4 SV=1
--DCAHICRDDLYSVFPERASIQ-SLQAEamGDDFSWISDNCLLfslqgFGafpqlrrfdlvVNFDLALWTG-
>SRR5690606_21177905
-----------------------MAREDIqqDERWSWLWDTRSPAHNYYLWRFWEI-
>ERR1719379_1715681
-RRVIDFTAERVSRQGFAFESeLTISEGAAGSHFDFLRNRKSPEHAYYRWKVAQL-
>ERR1719313_170299
-NQRIPYSRTiLTPNGVYVFQSeLTISEGAAGSHFDFLRNRKSPEHAYYRWKVAQL-
>ERR1719446_920833
--VRIEKLAHHVAKsTGHQLEDYTRQRQNGNPDFDFLNSGEGS--EYYQY------
>ERR1740117_461964
--RRIEQLASREA-SRPGLEQFTRERQGDNPQFAFLAGGEGQ--EYYEY------
>ERR1719359_98973
--DIIKTTAQFVARNGQKFLVGLTQPPSRTLSRAWRRRR-RLRRTRR--------
>tr|C5KGL5|C5KGL5_PERM5 Spliceosome associated protein, putative OS=Perkinsus marinus (strain ATCC 50983 / TXsc) GN=Pmar_PMAR015736 PE=4 SV=1
--DIIKITAQFTARNGNRFLQGKSAFDPGA-------------------------
>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9932630_3 # 1044 # 1280 # 1 # ID=9932630_3;partial=00;start_type=ATG;rbs_motif=TAA;rbs_spacer=11bp;gc_cont=0.350
--DIIKLSAQFVARNGRAFQNGLFEREHKYSQFGFLQPY-HPLHQYFKKLVA---
>tr|A0A088AUW3|A0A088AUW3_APIME Uncharacterized protein OS=Apis mellifera PE=4 SV=1
-QIIIDKMASYVAKNGIDGGGVVAE--AGAPRGEAKEAG-RIAEEAAE-------
>tr|C5L0Y3|C5L0Y3_PERM5 Uncharacterized protein OS=Perkinsus marinus (strain ATCC 50983 / TXsc) GN=Pmar_PMAR008765 PE=4 SV=1
--MYIDKVSKYVARHGREFEKYLETLAKgGDKRLQFLlQPLDSPAGVYYRWRVCA--
>tr|A0A0M3KG91|A0A0M3KG91_ANISI Splicing factor 3A subunit 1 (inferred by orthology to a human protein) OS=Anisakis simplex PE=4 SV=1
-------------------------PRNPPPPFEFKTDPA-TinafdLFVYYFFIQF---
>GWRWMinimDraft_8_1066016.scaffolds.fasta_scaffold324368_1 # 3 # 218 # -1 # ID=324368_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.593
--ETIHLVAQYTCAYGKSFLASLTTKEYRNPVFDFLKPSN-PNFVYYTSMVD---
>tr|A0A093V9W6|A0A093V9W6_TALMA 2-succinyl-6-hydroxy-2, 4-cyclohexadiene-1-carboxylate synthase OS=Talaromyces marneffei PM1 GN=GQ26_0100140 PE=4 SV=1
--SAQSTSKSGYITFETAKKDFLMLNYAadKSAaellinadVlqlTEKLNHPAPLGEIIDIAQLWQS-
>SRR5215469_2120092
--RLIHKTVENVLTLGMDFETLLMDNTEvqQDERWAFLWNPRSVGGRWYRWRLYEL-
>tr|A0A1L7XWZ7|A0A1L7XWZ7_9HELO Uncharacterized protein OS=Phialocephala subalpina GN=PAC_19409 PE=4 SV=1
--LPLAEMKNVQLRFPNPQKDAMMLGPP---rcydpvIlkqTDRLQHQISLEELYDINVLWAG-
>tr|A0A0C3H5X2|A0A0C3H5X2_9PEZI Uncharacterized protein OS=Oidiodendron maius Zn GN=OIDMADRAFT_51536 PE=4 SV=1
------EIYPDSIILDCAMKDAVMLNFPaq---rlvdpeMclnTVQLNNPLPSDEVHDINVTWVT-
>tr|W2RP27|W2RP27_9EURO Uncharacterized protein OS=Cyphellophora europaea CBS 101466 GN=HMPREF1541_07096 PE=4 SV=1
-------TKLNPLSVPHEAKDTITF-H----teytfdpdIlkeTQRLGRDSSFEELYDINLLWPN-
>tr|W9WMC5|W9WMC5_9EURO Uncharacterized protein OS=Cladophialophora psammophila CBS 110553 GN=A1O5_07353 PE=4 SV=1
----------KVIEIPHEFRQKQAAGP----haalfdvsVfsSRRGESNTSDAEVYDINIQWPT-
>ERR1700761_376928
---------TAPLAVGND-------------knvadpaVwlASSCQAPTSVEELYDINMQWPT-
>ERR1719354_672505
--TVIEELANMVAVSGEELEDIARERNKDAPELRFLFERTGTMYKRYRARITEIK
>ERR1719150_3105981
--FI------TKDCFI--IPRFqEPESSSRANSNlyRFLYEPGSKTYRKYRQRVQELR
>ERR1712223_1974042
--KIAEELASMVACCGDQLESIARNHNFNCNNAeiKFLYDPGSKIYRKYRQRVQELR
>ERR1719394_563848
--EV---LIFL------KYVVVKHLKPVKLYYHqtRFLYDPRSKEYRKYRQRVQELR
>ERR1719367_1537762
--TVAEELASMVACSGDQLENIARNHNAGMDEKemSYVFIITLISVSDIHvnRQLHAFG
>ERR1719220_498260
--TVVEELANMVAVSGEELEDIARDRNERLRSRDSCRRRKGIytgstgpEWTKY--------
>ERR1719419_1730175
---LPCCYVCYVCL--PCCDVSLPCCYVCLPGCRFLFDEGSEANQYYRQRVRELR
>ERR1719419_706329
-----DTLARYVAEDGPHVEETARQANANNPTYWFLFDEGSEANQYYRQRVRELR
>tr|A0A132AEU1|A0A132AEU1_SARSC Suppressor-of-White-Apricot domain containing protein 2 OS=Sarcoptes scabiei GN=QR98_0080390 PE=4 SV=1
-EVIIEKLAEHVGKNGENFEQSIRAL--NDSKFDFLNKG-HKYHAYYVKR-----
>SRR5699024_1011955
-ELIITKLAQHVAKNGDEFESSIKAL--NDKKFEFLNPG-HIYHAFYVKQ-----
>ERR1719209_1900693
-QKIIQQVAALVASGGEETEEAIKKQHLDDPDYWFLFSKTNPLYQSYLDQLWSIR
>ERR1719209_827704
-----------------------------HAEL-LTFLMRRFSYRSyTSDQLWSIR
>ERR1719431_1494586
-EKIIQQVAALVASGGEETEEAIKKQHLDDPDYWFLFNKTNPWYQETTSRX----
>tr|A0A1Y1YET0|A0A1Y1YET0_9FUNG Uncharacterized protein OS=Basidiobolus meristosporus CBS 931.73 GN=K493DRAFT_337058 PE=4 SV=1
--EIIERTAKFLNTKEPKMEIIIQAKQSHNPDFCFLNKD-DPLHAFYRHVRW---
>tr|A0A077X061|A0A077X061_9FUNG Uncharacterized protein OS=Lichtheimia ramosa GN=LRAMOSA04774 PE=4 SV=1
--DTIVATAKAAAAsvNPKLFEIKTQARQGNNPLYAFLSQR-HPLYRFYTHIIW---
>sp|Q86A14|SF3A1_DICDI Probable splicing factor 3A subunit 1 OS=Dictyostelium discoideum GN=sf3a1 PE=3 SV=1
--DTIRLTAQFIAKNGDSFFMELASREVKNSQFDFLKPTN-HLYEWFRALVES--
>ERR1719450_384639
-QLVIDRLSLYVVKNGEEFEVGIKDK--KDPRFDFLNPW-NVYHPYYLNK-----
>tr|K7KKF9|K7KKF9_SOYBN Uncharacterized protein OS=Glycine max GN=GLYMA_04G160800 PE=4 SV=1
---KIEALCQLIAEKGADIEDKICQDEFQNPEYAFFIGGD-PgteaaiAHTYFLWM-----
>ERR1719391_1937109
-SAGWDKYVKVWNLTNCKLktnhIGhtgyLN--TVTMSPDGSLCASG-GKDAKAmlwdLNDGK----
>ERR550532_1493963
-LLTSItlgNPNSAPPAVyskplEEDDdd--eIRRERKKERRRSRs-WDRE-D--------------
>ERR550525_2192542
---PGHrllLLSCSLPSPwaiqtLLRLqsip-KlmrRM-MRRRAgrsrsRSR---EDRR-S--------------
>ERR1719220_3304821
---HGRhlrLPNWSPPSPwaiqtRPLLlsirKRsrmgrRRERKKRRrrsrsRSR---EDRR-S--------------
>ERR550532_28647
---PGRrlhLLSYSPPSPwatqtLLPLlftpKLprkkKRRKRSR------s-RSRE-D--------------
>ERR1719400_738581
-LLTSIplgNPNSAPPAIyskalEEDDeEESRREKKKRRRRSRs-RSRE-D--------------
>ERR1719447_1948310
-SQSLTKWPAMWRRMAARSRMLLDLHHLLKSQFPL--------------------
>ERR1719500_859723
-SLLLTRWPAMWRKMGDHLKMLCDPVLELRKSFLFLTPG-DRHHNYYVHKL----
>ERR1719189_2608489
-SLLLTRWPAMWRKMGDHLKMLCDPVLELRKSFLFX-------------------
>ERR550532_1147871
-QPVIDKMASYVAKNGRSFEDVVRSRPGASQKFSFLTPR-RQAPQLLRAQA----
>ERR1719237_596810
-DRSSNQKGRIWEYQ-------SD------ISV----FF-FRHHNYYVHKL----
>ERR1719180_72963
-QPVIDKMARYSYQSMADYSNILLTLfwlqLCCKKRALIX-------------------
>ERR1719458_1265783
-QPVIDKLAIYTTGSYDSALA--RVEPT---VFKVKKQD-EEKPS----------
>ERR1719266_725767
----------RTQAVGVGLASLTYAAMASlpYWLPALAAGG-DRHHNYYVHKL----
>ERR1719275_322675
-NNFMTR--SQLDRSSNQKGRIWEYQS--DISVF----F-FRHHNYYVHKL----
>ERR1719341_2690674
---PGRrllLLSYSPPSHwatqtLLPLlctpKLwrk-----MRRkrsrsRSR---EDRR-S--------------
>ERR1719481_1270947
-LVVPTrlgDPNSAPPAGyadpvVDLSsgDEDRKKKKRKRRRSRs-RDER-K---------S----
>ERR1719481_654233
-DPFQVs---------qGHINwgdeDRKKKKRKRR----Rs-RSRD-E---------R----
>ERR1712126_172811
-TVLHRr---------aTRTQwwtsPRGMKTGRRRRGNEGd-LGPG-T---------K----
>ERR1719350_678768
-LVVPTrlgDPNSPLPAGyssplVDVSddegRRRKRRR-------Rs-RSRD-R---------K----
>ERR1719233_359122
-LVIPKlagQ-ESPVELS----sdseEERRRRKKRRRSRSRs-RTRD-R--HR----KE----
>ERR1719233_2489118
-LVIPKlagQ-ESPVQLS----sdseEERRRRKKRRGS--Rs-RSRS-RDRHR----KQ----
>ERR1719233_225697
---------CPLIQ------KRKGGGERRGEDLAp-EAGP-GIDIG----GM----
>ERR1719320_1399933
-GRSFPIGNSWCLTILRSFEEVVRTKD--EERFSFLCNE-DKHHNYYLHKL----
>ERR1719188_42770
-LVIPKlaaSCNPVVELSsd----seeDIPKKRRR-SRSRSRs---RR-R---------D----
>ERR1719481_743039
-LVIPTlssKPENSSESDasernRDKEksrrDRDRHRRKRKRSRSRs-RSKT-R---------H----
>ERR1719445_248938
--------RAFVQ-----KVVVELSKI--DQKLGFLKPG-DKHYNYYLHKL----
>ERR1719237_1835921
-LVIPTlavPANAVVDLS----sdsEEERPKKRKRKSRSRs-RERR-R---------N----
>ERR1719341_626575
-LVIPTlavPANAVVDLSsd----seeERPKKRKRKSRSRSRe-RRR-----------------
>ERR1719500_1289897
-QGSHRAIDTSPQILHSWSPTPLRTK--DENRFRFLCPE-DNHHNYYLHKL----
>ERR1740128_74710
-LVIPTlagPLEIKSSDSdsshd----sstSSSRRKRRKHGSKHRs-RSRD-R---------K----
>ERR1740128_1042175
-VDTPPpeaKPGMPVAPGPRGPppPRSAYLRP--PFRPPpihg-LPPG-PPPH--V--------
>ERR1740128_816055
-VDTPPpeaKPGMPVAPGRPPPprPSGHLSVS--RLSTAclparpHMSTD-PLPQGLVL-------
>ERR1740128_720410
------KTASCIPSIGTSSTPSAAARTSSSTKMRPK-------------------
>ERR1712224_18802
--DLIKLTAQFVARNGRKFLTNLTSKEKNNTEVIILGLS-ALVlSPKYKTLIK---
>ERR1700687_3837003
--DIIQLTARYISVNGSEFMKTLIQKEKHNPLYEFLKPI-HPLFPYFQKLID---
>tr|A4HA73|A4HA73_LEIBR Putative RNA binding protein OS=Leishmania braziliensis GN=LBRM_19_1490 PE=4 SV=1
--EIIQYVAKYVVAscDGARYQDKVRTRTRHNPYFDFLNAK-HPYHQYYQYLLESYR
>tr|A0A1D5WM86|A0A1D5WM86_WHEAT Uncharacterized protein OS=Triticum aestivum PE=4 SV=1
-LSALIIYAFC----------------------HIIFPL----HTFLLIFFS---
>tr|A0A1E5WFM6|A0A1E5WFM6_9POAL Uncharacterized protein OS=Dichanthelium oligosanthes GN=BAE44_0002874 PE=4 SV=1
-HQIIARTALFVNEHGGQSEIVLRVKQGNNPTFGFLMPD-HNLHSYFRYLVD---
>tr|A0A1Q3CI47|A0A1Q3CI47_CEPFO Ubiquitin domain-containing protein/Surp domain-containing protein OS=Cephalotus follicularis GN=CFOL_v3_23388 PE=4 SV=1
--DIIKLTAQFVARNGGPVWITVTVPNFDEGNLrgqHLEITVQ-SLSETIGSL-----
>ERR1712226_1769281
---------RREVAPGRAAP---------PHCQFLPNPLC-YRERELRK------
>ERR1712159_63258
--DVIKVTAQFAARNGKKFVTALA-SCLCL-CLFYFLPVI-RMIRIISDQ-----
>ERR1719446_1649393
--DVIQLTAQFVARNGRARTHSSTFSNPCTTCFPTLRPSS-THTPNASRL-----
>NGEPerStandDraft_5_1074534.scaffolds.fasta_scaffold464305_1 # 1 # 381 # 1 # ID=464305_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.719
--EIIKLTALYVARNGSRFLNGITQREQRNPQFDFLKPT-HQLFTYFQKLVEA--
>APThiThiocy_cv2_1041547.scaffolds.fasta_scaffold197264_1 # 1 # 459 # -1 # ID=197264_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.754
-KDIIKATALYVARNGKLVHDVLQGRYARNSLFDFLKPH-HMYNKYFQQLV----
>tr|A0A194W9E8|A0A194W9E8_9PEZI Uncharacterized protein OS=Valsa mali GN=VM1G_08121 PE=4 SV=1
--EILRVTALFVARNGRQFMTTLAQREAGNPQFQFLIQN-HTFHNYFQHMVDQ--
>tr|A0A015LGK7|A0A015LGK7_9GLOM Uncharacterized protein OS=Rhizophagus irregularis DAOM 197198w GN=RirG_011750 PE=4 SV=1
------------ARPARGWQPPISIPHSQPYIPQLFPAHHRPIYHIITSRWPR--
>tr|A0A0S6XTZ3|A0A0S6XTZ3_9FUNG Uncharacterized protein OS=fungal sp. No.11243 GN=ANO11243_073340 PE=4 SV=1
--EIVKLTALFVAVKGSAWLTKFSQTYGLQPQFQFLRPQ-ANLHQYFTRMIDQ--
>tr|A0A1I8CJZ1|A0A1I8CJZ1_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 PE=4 SV=1
---QIETTAAFVSKSGPQMEIVIKVKQNDDkKNFGFLDFH-NPLNPFYKMVLKNIK
>tr|A0A0K0ELJ8|A0A0K0ELJ8_STRER Uncharacterized protein OS=Strongyloides stercoralis PE=4 SV=1
---RIEKTAAFVALKGSQMEIFIRVRQKDNaSKFNFLDFE-HPLNTFYKAIINEVK
>ERR1719220_527266
---FIEQFAQKAIR-DKTLERDKLKYEQDNPKYKFLFEQTCPAHKYYTEYSKK--
>ERR1712226_630881
---KIEQFAETVAKPgGAQIEANATDLHGDNPDWKFLWDRYCPEHKYYTEYISR--
>ERR1712048_1192047
----------TVSKPgGDQVEKNTFDLHDKNPDWKFQWDKYSPENKYYNEFLSR--
>ERR1712003_297943
---RIESFAKLAVH-DPSLESSTKNEEYDNPEFFFLTKETSPAHKYYIEYLSR--
>ERR1712087_677071
---RIENFAKLAVH-DNSLEYETKKEEFDNPEFNFLTKETSPAHKYYIEYLAR--
>tr|A0A1U8BP29|A0A1U8BP29_MESAU splicing factor 3A subunit 1 OS=Mesocricetus auratus GN=Sf3a1 PE=4 SV=1
--DVVKLTAQFVARNGRQFLTQLMQGPVSIkvqvpnmqdkteWKLngqglVFTLPltdqvsvikvkiheatgmpagK-QKLQYEYADSCSN--
>ERR1719220_3384904
--DLIKLTAQFVAKNGKSFLSELMHSQAKNYHFDFLRPQ-HSCHPFFMQMVEQ--
>tr|A0A0P5ZD63|A0A0P5ZD63_9CRUS Splicing factor, suppressor of white-apricot (Fragment) OS=Daphnia magna PE=4 SV=1
VRLLMDKTASYMSRNGRHLESAVQS--KGDPRFSFLNPE-HAFHGYYIQKL----
>ERR1712136_565131
IKPLVDKTASYISRHDRNLEAVIKT--KGDPRFSFLEED-NVFHCYYLQKW----
>tr|K8ECR7|K8ECR7_9CHLO Uncharacterized protein OS=Bathycoccus prasinos GN=Bathy03g00500 PE=4 SV=1
--HHIEKLSEYVAKNGAEFEQLTRQKS--LDMFWWLDDLNSNEYRMYKLLL----
>ERR1719234_89229
-AKVVEELAAMVAVSGEELEEAAKSNSSRAADLDFLEDKSSGLYMRYRARVAQLK
>ERR1719223_834032
--EIIRLVAQFTAVEGHTFVIGLNSRESKNPQFDFLKPT-HPQFQYFLSLVD---
>ERR1719266_555965
--DVVELTCLLNSNSFKVFGNNNILS------------A-SclIVCIILNV------
>ERR1719225_757619
--DVVKLTAQFVARNGRQFLTNLMNKEQRNYQFDFFAAS-TLIVPVLHQAFG---
>ERR1719247_479136
--RDHQTNSTIHGRVGPTVLGRPSATRTAEPAVRLPEAD-AclVFLLYATSX-----
>ERR1719258_767711
--EIIKLTAHCNSAMCVFRLTAK--T--DT-------PS-LclRTLW-SRKLLS---
>tr|D8RIW0|D8RIW0_SELML Uncharacterized protein (Fragment) OS=Selaginella moellendorffii GN=SELMODRAFT_33213 PE=4 SV=1
--DIIKLTAIFAARHGSEFLTGLASREHYNLQFSFLKPE-SSLHKIFTGLCH---
>ERR1712137_330219
--EIVKLTAQYVAKNGDYFRGRLAAKERDNPQFDFMKFG-HPAFALFNNLIE---
>ERR550532_1360988
--NVLEKTADFLAKHGTQMEIMIKIKQKDNLMFNFLNYG-DELNPYYKHVMK---
>ERR1719161_852935
-VETIVKTATFVQKaqDPNSFEDMVKKKNEGDPKFRFLSQGG-TGHNFY--------
>tr|A0A1E4TSY9|A0A1E4TSY9_PACTA Uncharacterized protein OS=Pachysolen tannophilus NRRL Y-2460 GN=PACTADRAFT_76458 PE=4 SV=1
--DVIKMTAFFIAKNGSSFIDQVLrKTSDEASQFEFLNAN-HSFRKIFDSYVLQY-
>tr|A0A1E3NLU4|A0A1E3NLU4_9ASCO Uncharacterized protein OS=Pichia membranifaciens NRRL Y-2026 GN=PICMEDRAFT_15100 PE=4 SV=1
--NVIKLVAQFAVVNGSkimnDFKENALNNSRLSSQFQFLNER-HSMNKIFQKYVDIY-
>tr|A0A1D2VC26|A0A1D2VC26_9ASCO Uncharacterized protein (Fragment) OS=Ascoidea rubescens DSM 1968 GN=ASCRUDRAFT_26308 PE=4 SV=1
--QVIKLTALFCARNGDnyinNLKTHIEnlsssdqsapehpnlvdskignlpnkpqseptkqNALYSVSQFQFLNSN-HSLNPLFTSFVNQY-
>tr|A0A1E3PGS0|A0A1E3PGS0_9ASCO Uncharacterized protein OS=Nadsonia fulvescens var. elongata DSM 6958 GN=NADFUDRAFT_52267 PE=4 SV=1
--DIVKVTALYAAVNGpD-FVTELARRQALSQQYNFLKPN-HSFYGYFQSLINQY-
>tr|Q6CDU9|Q6CDU9_YARLI YALI0B21032p OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=YALI0_B21032g PE=4 SV=1
--EVIHTTAQHTAEYGpS-FSMLLAKNEARNPQYEFLKPS-HSLHKFYQLLVEQY-
>SRR6476659_2359519
------------------FESDIYHREKTSTEFGFLHNS-HPHQAFYADLVRAY-
>tr|A0A1E3QLA7|A0A1E3QLA7_9ASCO Uncharacterized protein OS=Babjeviella inositovora NRRL Y-12698 GN=BABINDRAFT_162686 PE=4 SV=1
--SILKMTALFVAVNGEpy--ID-IIkTQRNATGQYDFLNDT-HSFHKIFRLFVQQY-
>tr|A0A1B2JBB2|A0A1B2JBB2_PICPA BA75_03224T0 OS=Komagataella pastoris GN=ATY40_BA7503224 PE=4 SV=1
--DIIKLTALYVAVDKHtgngVFRKNFEqSFGGKNAQFGFLDPT-HSLNSLFNQYVNQY-
>tr|K0KDM9|K0KDM9_WICCF Splicing factor 3 subunit 1 OS=Wickerhamomyces ciferrii (strain F-60-10 / ATCC 14091 / CBS 111 / JCM 3599 / NBRC 0793 / NRRL Y-
--EIIKLTSQFVAINGEs-YITSIRnKYKDQTAQFSFLNND-HSFHQLFLKYLKQY-
>tr|W1QHG4|W1QHG4_OGAPD Pre-mRNA splicing factor OS=Ogataea parapolymorpha (strain ATCC 26012 / BCRC 20466 / JCM 22074 / NRRL Y-7560 / DL-1) GN=HPODL_0
--DMIKLAAQYVALNGEesidVLKEHVSHDKKQTIQFEFLNVS-HSLHGLFRQYLESY-
>tr|A0A0H5C985|A0A0H5C985_CYBJA Uncharacterized protein OS=Cyberlindnera jadinii GN=BN1211_5906 PE=4 SV=1
--EIVQLTAQYVAQHGEtn-GILAIKqRYLNEPLLFAFTLPN-HKWYPLFASLVKQY-
>GraSoiStandDraft_44_1057316.scaffolds.fasta_scaffold5170966_1 # 3 # 218 # -1 # ID=5170966_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.606
--ETIKLTAMFYAVNHtSeyldKFKKHLF--KKNQSQFEFLKES-HRLNPVFNKFVQQY-
>ERR1719234_818733
----ILKTAAYVAEKGPQLEIMIKTKQSGNEDFNFLMQE-DPHNIFYTEIL----
>ERR1719234_645524
------------LTTFSTLKFFAKLKRKSWTLAYLQISN-HCTRIFCYLIL----
>ERR1719233_812219
--ELVLKTAEYIRKNGLKAEIELKVKQANNPVFSFLFEH-GTLHDYFEFL-----
>ERR1719245_1666175
--MLCLKTADYVRKHGFKAEIELKVKQANNKLFSFLFEQ-SRLHGYYEFL-----
>ERR1719400_1197623
-----GRSSCFLHSrSGHRLS--------------------SCFEPLRkLHFSS---
>tr|A0A2G8SQX0|A0A2G8SQX0_9APHY Uncharacterized protein OS=Ganoderma sinense ZZ0214-1 GN=GSI_01831 PE=4 SV=1
--TAETKVTeptFPIRGHDEEYER--tlQEREKSNPKYA--flN-KEHRRHRYYRSLLE---
>tr|A0A1J8PZ94|A0A1J8PZ94_9HOMO Uncharacterized protein OS=Rhizopogon vesiculosus GN=AZE42_01902 PE=4 SV=1
--VMVAAVRaggRVVPPGGTTQKMRLllspthlfvlwplrskamapsTKRIFVNGKRI--tlDREKHRRHAFYKGLVE---
>tr|A0A165A3U2|A0A165A3U2_9HOMO Uncharacterized protein OS=Sistotremastrum niveocremeum HHB9708 GN=SISNIDRAFT_448686 PE=4 SV=1
--QFVRTVATQVKEHGDDFEKNLLEREVNNPKYTFLRQDRSRMHRLYKSLIT---
>tr|A0A0D7BXL5|A0A0D7BXL5_9AGAR Uncharacterized protein OS=Cylindrobasidium torrendii FP15055 ss-10 GN=CYLTODRAFT_416235 PE=4 SV=1
--TFIRAVAVAVKGQGEDYEESLWKRERNNPKYAFM-RPSHNKHHIYRDLLV---
>tr|A0A1X6NDE5|A0A1X6NDE5_9APHY Uncharacterized protein OS=Postia placenta MAD-698-R-SB12 GN=POSPLADRAFT_1043887 PE=4 SV=1
--QFIRLVAAEVKGHDMEYEDSLREREKSNPIYSFLksearifllwkdgiPLSLSLRTTYvVSSLLK---
>tr|A0A1Q3EB19|A0A1Q3EB19_LENED SR140 protein OS=Lentinula edodes GN=LENED_006075 PE=4 SV=1
---IIK---FSVTGGAM-------KKICSNEK-----LPMGRRYNFYRNLVE---
>SRR5258708_5282258
--EFIQEMI--------------swfKDG-KTLAKRyAWEiv----MG----AYDNFI---
>tr|E2M1Q8|E2M1Q8_MONPE Uncharacterized protein (Fragment) OS=Moniliophthora perniciosa (strain FA553 / isolate CP02) GN=MPER_13793 PE=4 SV=1
---------------DPQVE---hqmsswqIYLLTSLNNPWATsllelvqldrgpgadmtASRRSK-NA----------
>tr|A0A066W0V0|A0A066W0V0_9HOMO Uncharacterized protein (Fragment) OS=Rhizoctonia solani AG-8 WAC10335 GN=RSAG8_05447 PE=4 SV=1
--AFMDSMI--------------aafKDG-KSIHRRyAWEiv----LA----SWDLLS---
>tr|A0A1M2VS81|A0A1M2VS81_TRAPU Serine/threonine-protein phosphatase OS=Trametes pubescens GN=TRAPUB_13052 PE=3 SV=1
--EFVKNMI--------------ewfKDG-KTIPRRyVWEiv----LG----AHSYFA---
>ERR1740127_271333
--EFCMELM--------------ewqREE-KTLAKKcAYAiv----LD----MYALLR---
>tr|C5L484|C5L484_PERM5 Serine/threonine protein phosphatase, putative (Fragment) OS=Perkinsus marinus (strain ATCC 50983 / TXsc) GN=Pmar_PMAR016475 PE
--IFIDALT--------------eflKDE-KTLAKKfAYEiv---LAA----IA-YFR---
>ERR1719387_37508
--DFVTRLR--------------dlqKNQ-KNLPKDqTRIll----RQ----VAATLK---
>ERR1719409_1552002
--EFITEML--------------erfRAQ-KLIHRKyVLQil----LR----TKELLE---
>ERR1712216_77561
--SVLDLMT--------------hfaAQAhdpsMPRLDIRyVLVil----VS----FRRVLK---
>ERR1719198_187017
--DIMSLLT--------------yfnDQRfspddHPRVYNKyVFVil----VR----YLKILK---
>ERR1719262_1655632
--VHVATDCDAVAVADLEIHEALlrleellraqQDLEHVLPMNELLrpealqhfcykfvrrraVLREVR-AVV---------
>tr|A0A165ZV96|A0A165ZV96_EXIGL Uncharacterized protein OS=Exidia glandulosa HHB12029 GN=EXIGLDRAFT_753082 PE=4 SV=1
--SLMRMVVGMIKDHGKQFEEALRERENGKPQFAFLWDENSPLYRMFRRLLE---
>tr|A0A0C3PH01|A0A0C3PH01_PHLGI Uncharacterized protein OS=Phlebiopsis gigantea 11061_1 CR5-6 GN=PHLGIDRAFT_25248 PE=4 SV=1
--QFIRTVAAEVKGHGEEYAKSLQEREVSNAKFGFL-KRGHRQYRMYTNLVK---
>tr|A0A067N148|A0A067N148_9HOMO Uncharacterized protein OS=Botryobasidium botryosum FD-172 SS1 GN=BOTBODRAFT_123385 PE=4 SV=1
--RLVRTVANRVKEHGKHFEEMLKHKEKSNPKFQFLFDDTLPAYNLFYSMVD---
>tr|A0A0C3QEI8|A0A0C3QEI8_9HOMO Uncharacterized protein (Fragment) OS=Tulasnella calospora MUT 4182 GN=M407DRAFT_27123 PE=4 SV=1
---DLDPARLRSNANGSKFGDMVRHKERDNPKFSFLYDKRMPEYHLFRSIAE---
>tr|G4T4S2|G4T4S2_SERID Uncharacterized protein OS=Serendipita indica (strain DSM 11827) GN=PIIN_00094 PE=4 SV=1
--EFIELVAAMTRAHGRAFESNLMERERDNPQYQFLHAPRSAAGKFYDELLD---
>ERR1719421_11381
--ARIDTLSRFVADFD-GLEEVILHREKDNPKFAFLRAYDSSEGIYYRWRVFSF-
>ERR1719387_1235862
--SRPGSIRSRdsSRTST-GLRKSSSTARRITRNSLFSEPSTVLKGSTIAGEFFSF-
>tr|A0A1X0NNP0|A0A1X0NNP0_9TRYP Putative splicing factor 3 subunit OS=Trypanosoma theileri GN=TM35_000332170 PE=4 SV=1
--SYMTCTAQYIAKYGDRFLKDLLGRYRNNVAFRFLNSED-VRHEVLLKLV----
>tr|S9VUH9|S9VUH9_9TRYP Uncharacterized protein OS=Strigomonas culicis GN=STCU_06012 PE=4 SV=1
--DLMALTAQYSAKYGELFLRSVEAKQKRNPNFRFLQDGD-VRHGVLQQLV----
>tr|A0A088SBG1|A0A088SBG1_9TRYP Uncharacterized protein OS=Leishmania panamensis GN=LPMP_251430 PE=4 SV=1
--DVLSTMAQYTAKYGDKFLAAVKGKQRHNPIFHFLHEDD-VRHGMFCKLV----
>tr|A0A0M9G005|A0A0M9G005_9TRYP Uncharacterized protein OS=Leptomonas pyrrhocoris GN=ABB37_05409 PE=4 SV=1
--DVLSTSAQYTAKYGDKFLAAVQAKQRHNPLMHFLQEDD-VRHSTFLKLV----
>ERR1719220_258784
--------RSSWRATAAEFLTNLMNREQRNYQFDFLRPQH-SLFQYFTRLL----
>ERR1719483_710700
--DIVKLTAQFTARNGRRFLETVMQREQRNYLFDFLRPQQ-RNYLFDFLRP----
>tr|G3TTS0|G3TTS0_LOXAF Splicing factor 3a subunit 1 OS=Loxodonta africana GN=SF3A1 PE=4 SV=1
--DMVKLMAWFVAWNGSHFLTQVMQKADADF----LHPKH-MIFMYFMKLE----
>ERR1719341_1891321
--DVVKLTAQFVAIFHQAFGAVHQGSDPSKRSAEQTX------------------
>ERR1719354_552856
--DIVRLTALFAAKNAVNRTISrsKADIEGGSAIYSYS---------AGGSFN----
>ERR1719376_688777
--DIVRLTALFAAKNGRTFINHIMNKEARSTWPSP---P---ALLGRGSRQ----
>ERR550534_956142
--AAGAAKPATLAAAAAKAEPIVLKDPP--PEYEFIADPPSisaydlDVVKLTAQFV----
>SoimicmetaTmtLAB_FD_contig_51_1071025_length_873_multi_1_in_0_out_0_1 # 95 # 871 # -1 # ID=1750330_1;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.683
VKETIKTMVSFVLKNGLPFEETVREREKhrDTIKFAFLQKT-SPYYPFYV-------
>ERR1719199_1921236
--RLIRI---HDITLRRPFLTGLQNRESRNAQFDFLKPT-HPLFTYFTVMV----
>tr|A0A1Y1Y5L6|A0A1Y1Y5L6_9FUNG Uncharacterized protein OS=Basidiobolus meristosporus CBS 931.73 GN=K493DRAFT_302795 PE=4 SV=1
--LVIDKLAEQTVKN-PALAQMVMQRQFGNPKYSFMRPD-GQYFHYFQWKIQLL-
>ERR1711991_483257
--QIIYTLVQFVKKNGSRVLDDVAAKQAENPMFNFLKAD-DPLRPYFDFLK----
>ERR1712000_754576
--NVIWTLVQFAQRHGQSVVDNARTKQADNPLFGFLRPE-HELHS----------
>tr|A0A1W4W595|A0A1W4W595_AGRPL activating transcription factor 7-interacting protein 1 isoform X1 OS=Agrilus planipennis GN=LOC108732823 PE=4 SV=1
---AMVSLGRMVAQCGPGIEDIVRQRKQQDPHLWFLFHKESAPYRQYQQLVEQF-
>tr|A0A023F538|A0A023F538_TRIIF Putative splicing factor 4 (Fragment) OS=Triatoma infestans PE=2 SV=1
---VPADNNQGITEQNDGLCKIEKdAeQPQEDSKQWF-AEREIIKTEMATTVPSFE-
>tr|J9JTJ8|J9JTJ8_ACYPI Uncharacterized protein OS=Acyrthosiphon pisum GN=LOC100162504 PE=4 SV=1
----------------DEIENII-lQeRPQDVNLLFL-RDKNSPAYTIFRQRVGYL-
>tr|A0A067QMB9|A0A067QMB9_ZOONE Uncharacterized protein (Fragment) OS=Zootermopsis nevadensis GN=L798_14997 PE=4 SV=1
---AVNHLARTVAQCGDDIEQIILTRNPDDPALWFLHDKGCAAYLQYRQLVEKI-
>ERR1719400_1774934
-------------------VGRQAaGgGetpe------LRFLQEREGELYRQYRSRVDQI-
>ERR1719273_2980398
-----------------NCKDGGTwEnWga--------PAHPDVVRSVNPNPTRVDQI-
>ERR1719412_1599224
-----------------SVVDKASvGnNtmgkSVVDGMDRGSVDSMGKNRGVVNNWVGQI-
>ERR1719295_586355
--ALMEKTAQKTvqSQNAFEFEEMIRRKQGSNPQFAFMFLG-HPWNHHYEWKK----
>ERR1719295_77799
--TLMEKTAQKTvqSQNAFEFEENKDRILNSRSCFSDILGT-IITNGRK--------
>tr|A0A2A2JBU1|A0A2A2JBU1_9BILA Uncharacterized protein OS=Diploscapter pachys GN=WR25_10800 PE=4 SV=1
-EPIVQKYAEHVAERGAATELALRNR--ADLKLDFMKPE-SPFFSYYQHRVR---
>sp|Q10580|SWAP_CAEEL Protein SWAP OS=Caenorhabditis elegans GN=swp-1 PE=2 SV=1
-EPILNSYAEHVAQRGLEAEASLAAR--EDLQLHFMEPK-SPYYSYYHHKVR---
>tr|W2TB91|W2TB91_NECAM Surp module (Fragment) OS=Necator americanus GN=NECAME_10001 PE=4 SV=1
-QPVIASYAEFVARHGAEAEMELRgkfvplgffSS--RELHFRFFMP-----FSYLRSPAK---
>tr|W6MG84|W6MG84_9ASCO Uncharacterized protein OS=Kuraishia capsulata CBS 1993 GN=KUCA_T00000737001 PE=4 SV=1
--DVIRLCAQFVAKNGPDYAAALRKHVSNSVQFEFMVKG-HSLYPVFESFVEQ--
>tr|A0A1E4SXZ9|A0A1E4SXZ9_9ASCO Uncharacterized protein (Fragment) OS=Candida arabinofermentans NRRL YB-2248 GN=CANARDRAFT_191151 PE=4 SV=1
--SMIKLAAQLIAVNGTSYAAKLSQHIksteTLKVQFAFLEPS-HSLHPIYQDYLSQ--
>ERR1719186_393343
--MLIDPYALCVARYGDECVPQITKQfglfNRKNRAFfilrvgqdSFLNPG-DHFHTYYKYRVFC--
>ERR1719186_2316088
--KAIDIFACAF---ASDLEQELKQKml------TlgppVlrTISTD-GPYNTYFKLRVFS--
>ERR1719186_63574
--KVIDIIALCSVNKHVAKEQAIKQ--------rctRltFFSPD-DPHHTYYKLKVFS--
>ERR1719186_152225
--EVIDITARLLYPRGVEAKQNW----------pdSvtFFSPD-DPHHTYYKLKVFS--
>ERR1719186_1851373
--FGLSPGITKDQRHGDEHEPGLKEAivre------NvdedSflFLNPG-DLFHTYYKHLVFR--
>ERR1719203_488720
--KNISLFARYgFAAKGLELEKALAKWnf-------frpdLrnFFSPD-DPYHTYYKLKVFS--
>ERR1719508_232059
-------ELGQAIKHNCTIERAIKLEal------KqnctRltFFSPD-DLYHTYYKLRVFS--
>ERR1719186_1629628
--KVIDRCVGLFVAKGLELRQAIKHNctmeQA--IKRaaleqnctVftFFRLD-DPYHTYFKLRVFS--
>ERR1719186_1614640
--EVIDFIAPDFVANGLELKGVLEQA-------akriwpdAytFFRND-GPYHTYYKLKVFS--
>ERR1719399_1418761
-KDIADRLAEVVVE-DPGVKDVIREREKENPLFSFLRRK-SLVQTYYRCRVWTL-
>ERR1719186_565907
-RIGIDCFANVVAietrdltfREGVEYELGFIERYEDSFLTRALSPD-DLYHTYYKLRLFS--
>ERR1719186_2494476
-RISIDCFANVVAietrdltfREGVEYEQGYRERYEDSFLTRVISPD-DLYHTYCKLRVFS--
>ERR1719203_1774542
-RMQINILARLLARAGQEFDQEIGD-------IASLSPD-GVYHTYFKHRVFS--
>ERR1719494_929001
-VPKTSE----NFRALCTGEKGFEAK--QDPRFDFLNSW-NMYSPYYRFK-----
>ERR1719166_359791
-NGHRQGLLrrhrrrrpPRQGHHGAACRRGAKDL--X--------------------------
>ERR1719432_480287
-VPKTSE----NFRALCTGDK-------GSSfhrvIPNFMCQG-GDFTAGNGTG-----
>ERR1712062_433391
-VPKTSE----NFRALCTGEKGFGFK--GSSfhrdPQLHVPGW-RLHGGQRHRG-----
>ERR1740117_2500101
--SAITNLAKFVSSQGRSFEAIVQAKSGHEAKFRFLVDKTSDAHLYYLQCL----
>tr|O23409|O23409_ARATH Splicing factor like protein OS=Arabidopsis thaliana GN=dl3830c PE=4 SV=1
--TIVDKVAFLVSKYAWEFKLLVMGSNTKDPRFEFLMAsPEDPIQVSYQRRLSR--
>tr|R0GMT9|R0GMT9_9BRAS Uncharacterized protein (Fragment) OS=Capsella rubella GN=CARUB_v10006699mg PE=4 SV=1
--TIADRVASLVAKYGWEFELMFLSISTS-----------DHGHSYYQKRLGH--
>ERR1719204_157643
-QMLIDRTASYVCRQNLEFGhqkgaeKIGVVKKLHKEKFAFLFPEN-KYNSYYLFKVA---
>ERR1719391_332696
-QMLIDRTASYVCRQTLSSVtrrgprRSGLSRNFTRRNSLSYFQKT-NT-TPTACSRW---
>tr|A0A0K2V9F0|A0A0K2V9F0_LEPSM Protein suppressor of white apricotlike [Bombyx mori] OS=Lepeophtheirus salmonis PE=4 SV=1
-MMLIDRTASYVAKNGSDTmsvVrkrspkEFAFLDGDH--SNHTYFQYK-VA-LYKE---I---
>ERR1711903_159549
--EAVEALARFVARVGPGFEDLARERGSAQgARFRFLRGG--VGAAYYKYRLV---
>ERR1712088_1093926
-----------GSVQWRSVRKHCKnhNAGMDESEMAFLYDTSSTMYKKYRQRVDSLR
>ERR1719507_1911107
--TVAEELASMVACSGDQLENIAKnhNAGMDESEMAFLYEPNGKMYREYRHKVESL-
>ERR1719412_2588182
--TVAEELASMVACSGDQLENIARnhNAGLDEEEMSYVLYYSFQFRCIFNNFSEVTW
>ERR1712038_2010435
--KNRYVFQTMPNADQ----DLSRqyLACiiTFYAVEMFLYDTAGKMYRKYRQRVESLR
>ERR1712088_1176742
--TVPNGKVKLFinviYRINKQHQ--QFlkSTVIVNNIFRFLYDTSSTMYRKYRQRVASLR
>ERR1712179_174937
--LVRLNEVRWLFITKDIVSIIPRfqepesSSRANSNLYRFLYEPGSKTYRKYRQRVQELR
>ERR1712079_619488
---------VLTVTSTEAM-----sskppgHDRFSQMSEQAAIIARKKAEIEAKIKATSEE
>ERR1740129_1362901
--LiLCAKGSSQKALVGKKMLTRAInrNVN--QDGAMRATK---LLYFNLVX------
>ERR1719203_2604591
--MMVDELASMVPEFGIPSAENAPakNVG--EEA-----------------------
>ERR1719266_2629611
--MMVDELASMVAVSGDHVEEIARvgKK-----MLTRAKNRNvnrdgamratKLLYFNlaWLWAREAVG
>ERR1739838_1256056
--TVKKaLFRSMVsq--F-SKNKNINKlktNAP------CFHDDSSSD-DESNNTKSPSMI
>ERR1719378_710609
--DIIKLTAQIVVLSLPGIVISQMIVRVsslmx---------------------------
>ERR1711892_1313498
--DIIKLTAQFVARNGQKCLMPNKDEVEklkkhSNAPADILDRA-MsRYYW----------
>ERR1719446_43443
--GHHQAHRPVRGQNGQKFLIGLTQRESRNPQFDFLKPT-HALFGYFTSLV----
>ERR1719271_1263303
-----------HGPR--GLLHE-VPHAGEGRGEKLKRYA-SHNTAFLNACM----
>ERR1719240_2589006
--RThgygchq--AHSTICGTKRTEVPHCLTQRESRNPQFDFLKPT-HALFGYFTSLV----
>ERR1719261_1423063
--AVRKcVSLSYSKRRRLAFK-----NAVGCCAKRFSFPS-HALFGYFTALV----
>ERR1719387_2317273
--D---EGPDAGGDGGRAAAAARGPVHGTAPVPRAPgHGH-HQ----AHGAV----
>ERR1719321_1853167
--KFLVGLtqrearnPQFDFLKPSHALFGYFTALVdsy---------------------------
>ERR1719159_1333342
--DIIKLTAQFVARNPQFDFLKPSHAL-fgyFTALVDS--------------------
>ERR1719193_2091695
--DVIQLTAQFVARNGQKFLSGLPKGRVGTPSSISSSPL-PHFLATSH-------
>tr|A0A023B6C5|A0A023B6C5_GRENI Splicing factor OS=Gregarina niphandrodes GN=GNI_081150 PE=4 SV=1
--EVIKLVARFAACSGQAFVSGLSQRERGNNAFDFLKAS-HPSHGYFRSLM----
>ERR1711976_714763
--DIIKHSAQFVAENGQRFLIALTEREKSNSQFEFLKPT-NALFPFFTTLI----
>ERR1719362_2828081
--DIIKLTAQFVARNGQKFLIGLTQRESRNPQFDFFKTN-TCALRLLHFV-----
>ERR1719428_1874458
----taelqpppedQYtVAHPFLAPLDMDIIKR-TPQFD------FLKPS-HALSGYFTALV----
>tr|D7G3K3|D7G3K3_ECTSI Uncharacterized protein OS=Ectocarpus siliculosus GN=Esi_0051_0111 PE=4 SV=1
--ITIRQTAEWFHAN-PDKSKVIMEKSRGNAQFSFLFDASSPGGRFYRQVLDEIK
>ERR1712137_525404
LKPLIDDLALQSIEEGDLRVQDI-IHGPHASLYSDLNDPTSDVSRYYEWKKE---
>ERR1712137_508716
VKTVIDELANFVVEKGLQYEDEIRDSKDICNKYAFLSDPDCEEFKYYEYKKE---
>ERR1712125_270872
LKTNIDKFAEIVVNNGLKYEQEALLIHKGDSMYNFLLDKDSNEYKYYEYIKE---
>tr|A0A1S4BMA9|A0A1S4BMA9_TOBAC dihydropyrimidine dehydrogenase (NADP(+)), chloroplastic-like OS=Nicotiana tabacum GN=LOC107809801 PE=4 SV=1
-KTLCSELKDFMRKHNFS--SIEDFR-GSSLEY-FTTHTDLvkRQQEAIRQRK--A-
>tr|A0A068TQR1|A0A068TQR1_COFCA Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00022823001 PE=4 SV=1
-KKLCSELKDFMKKHNFS--SIDDFK-GLSLEY-FTTHTDLvkRQQEAIRQRK--A-
>ERR1712166_359016
-RKRVDAAAKLTARHGYEFQALLMEKEYDSADYKFLFDRGTa-LYNYYCWRCWSF-
>tr|M4D015|M4D015_BRARP Uncharacterized protein OS=Brassica rapa subsp. pekinensis PE=4 SV=1
---------MMIIQ-KEEYETARPYQNSWSPfkatmRLSLLIEVAPLT---PATSTRLI-
>DeetaT_9_FD_contig_21_4465114_length_342_multi_7_in_0_out_0_1 # 3 # 341 # -1 # ID=225581_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.510
-RDLIDALAREVAREAITFEKALEIEREDNPDYSILYQTPWna-TAIYYRWKLFLI-
>tr|A0A0S7K7U3|A0A0S7K7U3_9TELE SR140 (Fragment) OS=Poeciliopsis prolifica GN=SR140 PE=4 SV=1
-----TERNLLSLIHRMi---EFVV---------REGPMF---EA-------MI--
>tr|A0A1A6HH82|A0A1A6HH82_NEOLE Uncharacterized protein (Fragment) OS=Neotoma lepida GN=A6R68_20017 PE=4 SV=1
-GSFWRPPPLNPYLHGMs---EEQEAEAFVEEPSKKGALK---EEQRDKLEEILR-
>tr|A0A1Y1XB05|A0A1Y1XB05_9FUNG Uncharacterized protein OS=Anaeromyces robustus GN=BCR32DRAFT_292320 PE=4 SV=1
TKIIIEKITNYIARNGSEFETLVRQKNIGDERFSFMQPW-NIYYGYYKYKISQC-
>tr|U9TUX8|U9TUX8_RHIID Uncharacterized protein OS=Rhizophagus irregularis (strain DAOM 181602 / DAOM 197198 / MUCL 43194) GN=GLOINDRAFT_97458 PE=4 SV=
LKVIIDKMSAYVAKNGQSLEAKVREKHIDDPRFSFLLPW-NEFHPYYKHKIQEE-
>tr|A9TSV4|A9TSV4_PHYPA Predicted protein OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_171742 PE=4 SV=1
-CKYIEGLASFVAKSGPRLEEISKEKQKENPMFGFLFGG--PGHDYYVRRLWEE-
>tr|W9S666|W9S666_9ROSA G patch domain-containing protein 1 OS=Morus notabilis GN=L484_022269 PE=4 SV=1
-RLLIDGVATLVARCGKLFEDLSREKNQSNPLFSFLRGG--NGHDYYTRKLWEA-
>tr|G7JT16|G7JT16_MEDTR SWAP (Suppressor-of-white-APricot)/surp domain protein, putative OS=Medicago truncatula GN=11441585 PE=4 SV=2
-KLLMEGVANLVAKCGKLYEDLSREKNRSNPLFNFLSGG--TGHDYYARKLWEA-
>tr|A0A0L9TL30|A0A0L9TL30_PHAAN Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan01g085300 PE=4 SV=1
----ARGQAPLVIALGPVADR--------------------GRRRSFSRRAWSR-
>ERR1711971_635923
--DIIKLVAQFTARNGKALLTGLFQREQSNPLFSFIRPN-NSLFPFFTNMVDA--
>ERR1711959_666064
--DIIRLAAQFTARNGKALLTGLFQREQSNPLFSFIRPN-NSLFPFFTNMVDV--
>ERR1719159_256109
--DVIKLTAQFTAKKGKAFLTGLFLREQMDSYHKIDAIA----------------
>tr|A0A0L1HEK4|A0A0L1HEK4_9PLEO Pre-mrna splicing factor OS=Stemphylium lycopersici GN=TW65_08957 PE=3 SV=1
--EVLKLTALYTARVGENWLKDLRNRELGNYQFDFLRPN-HSYFQFFRSLVEQYK
>ERR1719489_609998
--STIRKKSPKLMKNAILKVILI----------QTVMVV-IYI---------QV-
>ERR1719427_1491137
---------------------------SKVKMFSFFFFF-NCFltivihfqIQYKSEQSRKftegvstkgslkstV-
>ERR1719322_2374111
-----KRN----------------KLKIHNSN--FYNFN-ITSipttSTWWRRYV----
>ERR1712110_800905
--DIIEKTASFLATQNIQMEILLKTKQAKNEKFHFLNYG-DPMNDYYKILKKAI-
>ERR1712035_86362
--DIIEKTAEFLATQSIQMNCPLSVPFVFPLSVSYSRSP----------------
>tr|A0A0L0SIC6|A0A0L0SIC6_ALLMA Uncharacterized protein OS=Allomyces macrogynus ATCC 38327 GN=AMAG_07504 PE=4 SV=1
--AAMRATARYVAQHGADFEAKLRA--QHDAKLAFLNPW-HGMHAGFRALVQ---
>tr|A0A1Y2HRD6|A0A1Y2HRD6_9FUNG Uncharacterized protein OS=Catenaria anguillulae PL171 GN=BCR44DRAFT_55450 PE=4 SV=1
--EVIKATAKAVARQGDAMVQMLleRN--RGDPRFAFLSPW-HRLHTTFRAYVS---
>ERR1719376_508403
--VSIDKIACCVAKKGASFQRMVASK--GDPRFNFVLPF-DEHHQYYALKLAM--
>ERR1719259_1550730
--VAIDKVACGVARRGLSFQRTIASK--GDPRFNFVLPF-DEHHQYYALKLAV--
>ERR1712238_356430
IRVVIDQTAKYAAVkNGnrnrsrLEFEARIMKKQNntnnknknknknnpnisslnGINNIDLLTTI-SPFHEYYEGRIKY--
>tr|A0A178EM20|A0A178EM20_9PLEO Uncharacterized protein OS=Pyrenochaeta sp. DS3sAY3a GN=IQ07DRAFT_527284 PE=4 SV=1
-RETIAKTAEFIFRRGAEQLPAMQQRVLSGqspAVIRFVLED-DPYHKYFMWYLQQLK
>tr|E4ZUH9|E4ZUH9_LEPMJ Similar to pre-mRNA splicing factor OS=Leptosphaeria maculans (strain JN3 / isolate v23.1.3 / race Av1-4-5-6-7-8) GN=LEMA_P1147
-RENVAKAADFIYRRGDSHLAKMKLRVANEpkSNLTFVLED-DPYHSYFLWYLQQLK
>tr|A0A0D3C1E5|A0A0D3C1E5_BRAOL Uncharacterized protein OS=Brassica oleracea var. oleracea PE=4 SV=1
--LVLERTALLVSKKELEMERRIRNSNFRNAKFNFLNSS-DPCHPFYQQKLTEYR
>tr|M4ETP9|M4ETP9_BRARP Uncharacterized protein OS=Brassica rapa subsp. pekinensis PE=4 SV=1
------------------MERRIRNSNFRNAKFNFLNNS-DPCRAFYQQRLTEYR
>tr|A0A060D7B4|A0A060D7B4_9EUKA Splicing factor 3A subunit 1 OS=Lotharella oceanica GN=sf3a1 PE=4 SV=1
-LKIIDRTAFYVSQHGKQLETKIKYKY---EKFNFLNNN-NIFFPYYIYKLK---
>tr|A0A0H5BKU5|A0A0H5BKU5_9EUKA Splicing factor 3A subunit OS=Lotharella vacuolata GN=sf3a1 PE=4 SV=1
-FKIINKTAFYVSKHGKIFEKKIKKKY---KKFTFFDNK-NIFYPYYIYKLK---
>tr|Q3LWE6|Q3LWE6_BIGNA Putative transcription factor subunit OS=Bigelowiella natans GN=trf PE=4 SV=1
-KKIIDNTAEFIIKHGRNFMTLIKKKY---DKnkiFDFLEKN-NIFYGYFIYRLN---
>tr|D2V5Z0|D2V5Z0_NAEGR Predicted protein OS=Naegleria gruberi GN=NAEGRDRAFT_46938 PE=4 SV=1
-LDIIKLTAQYTAANGKQFMFEIASLESSNPQFDFLKPN-HRHFHFFTKLVDI--
>ERR1712054_208082
-VDIIKLSAQYTAIAGQDFLPGLLARESNNVQFSFLQPG-TPYFNYFTSMVDS--
>tr|A0A1B6L7G7|A0A1B6L7G7_9HEMI Uncharacterized protein OS=Graphocephala atropunctata GN=g.44768 PE=4 SV=1
-KKLISGMVKFAYSNNQDPATLLKEL--GDSNIDFTQPG-HMFYEYYKTLVQE--
>tr|A0A1B6D8Z9|A0A1B6D8Z9_9HEMI Uncharacterized protein OS=Clastoptera arizonana GN=g.19790 PE=4 SV=1
-KVIIDKMATYVLKNGQDFETLVKNK--GDPRFLFLNCG-HEHHKYYAQQVNQ--
>tr|J7S4J3|J7S4J3_KAZNA Uncharacterized protein OS=Kazachstania naganishii (strain ATCC MYA-139 / BCRC 22969 / CBS 8797 / CCRC 22969 / KCTC 17520 / NBR
-KANIYKTAQFVNERSQNVEEQLLKD--SSGKFSFLQPD-NEHYAFYQSLR----
>tr|H2AWJ8|H2AWJ8_KAZAF Uncharacterized protein OS=Kazachstania africana (strain ATCC 22294 / BCRC 22015 / CBS 2517 / CECT 1963 / NBRC 1671 / NRRL Y-82
-KKNIEKTVGFIKANGREFEAKLLND--PRDRFSFIRPE-NEHYEHYISLL----
>tr|A0A1X7R176|A0A1X7R176_9SACH Similar to Saccharomyces cerevisiae YJL203W PRP21 Subunit of the SF3a splicing factor complex, required for spliceosome assembl
-QANIKKTAEYVRQHGRELEDRLLRE--SEDKFSFLNNQ-DVNHSYYLSIL----
>tr|C5KHD1|C5KHD1_PERM5 Uncharacterized protein OS=Perkinsus marinus (strain ATCC 50983 / TXsc) GN=Pmar_PMAR003454 PE=4 SV=1
--QLIRNTALGVIRNGAQFAEWLQTKRRGDKQYTFLFKG--LGHDYYQWCLS---
>ERR1711939_70425
--DTIRLSAQFTATNGKAFLIGLFQREQNNSLFAFIRPS-NNLYPLFTSLVDA--
>ERR1711939_671459
--------LVMSVLITQAFLTSLFQREQNNSLFAFIRPN-NSIYKFFTSLVDA--
>tr|G8BS06|G8BS06_TETPH Uncharacterized protein OS=Tetrapisispora phaffii (strain ATCC 24235 / CBS 4417 / NBRC 1672 / NRRL Y-8282 / UCD 70-5) GN=TPHA0D
IKNHILKTVNYIKEHGKSFEDELRL----DEKFSFVNPD-NEYHKYYQCMLD---
>tr|E7NJ53|E7NJ53_YEASO Prp21p OS=Saccharomyces cerevisiae (strain FostersO) GN=FOSTERSO_2419 PE=4 SV=1
LKEDIKTTVNYIKQHGVEFENKLLE----DERFSFIKKD-DPLHEYYTKLMN---
>tr|A0A0L8RG66|A0A0L8RG66_SACEU PRP21-like protein OS=Saccharomyces eubayanus GN=DI49_2830 PE=4 SV=1
IREDIKTTATYIKQHGASFESKLLE----DERFSFIKKD-DPLHEYYIKVLN---
>tr|B8C0G9|B8C0G9_THAPS Uncharacterized protein OS=Thalassiosira pseudonana GN=THAPSDRAFT_4853 PE=4 SV=1
--TIIEHTATRTATS-NQLEVFLKVKQADNGDFAFLAPS-NELHPYYLFLKY---
>ERR1712008_89244
--TIIQHTASRIASN-NQLEVFIKVKQAANANFSFLNPS-DELHRYYLFLKG---
>ERR1719272_2271255
--QLIERTAGFVAANGEQVLGALAVKQ--GARLPFLHPA-HPLHGYFRLLMLH--
>tr|A0A0L0SIC6|A0A0L0SIC6_ALLMA Uncharacterized protein OS=Allomyces macrogynus ATCC 38327 GN=AMAG_07504 PE=4 SV=1
--EILHRTATFILSQpdVTAAENLIARKQYGQPQFAFLARA-HPHRAYFDHVLAAM-
>tr|A0A0L0SLK0|A0A0L0SLK0_ALLMA Uncharacterized protein OS=Allomyces macrogynus ATCC 38327 GN=AMAG_08403 PE=4 SV=1
--EILHRTAAFITSQpdATAAENLIARKQYGQPQFAFLARA-HPYRAYFDHVLASL-
>tr|A0A1Y2HRD6|A0A1Y2HRD6_9FUNG Uncharacterized protein OS=Catenaria anguillulae PL171 GN=BCR44DRAFT_55450 PE=4 SV=1
--EIIRRTASFIRTHsnPPLAEQVLLAKQGTNPQFAFLHAS-DPNHSYYQSLIIR--
>ERR1712126_521815
-----------VCKQRMKSgndqavEEKIDlLKNSYKERFKFLFPF-SRYHNYYRFTIA---
>ERR1719387_3456950
-QQHIMETARAVHQYGAQYEDSLRHSGGNDLRYAFLFGGD--GAKYYEWAVGGFR
>ERR1719281_1183510
-KQHIMETARAVHQYGAQYEDSLRHSGRNDLRYAFLFGGD--GAKYYEWALGGFR
>ERR1719387_2862669
-QQHIMDTARAVHQYGAQYEDSLRQSGRNDLKYAFLFGGD--GAKYYEWVVGGFR
>tr|A0A2G5BA96|A0A2G5BA96_COERN Uncharacterized protein OS=Coemansia reversa (strain ATCC 12441 / NRRL 1564) GN=COEREDRAFT_81644 PE=4 SV=1
--GIVEHTARFIADQavdrAAQMEIMIQGKQGTNPDFAFLNRN-NDLHPFYQHLLWLM-
>tr|A0A1Y1W5E9|A0A1Y1W5E9_9FUNG Uncharacterized protein OS=Linderina pennispora GN=DL89DRAFT_268532 PE=4 SV=1
--EIIERTARFISSQpadrSNQMELTIQGKQGNNSDFYFLNRD-DSLYPFYKHIRWLM-
>ERR1712070_87942
---LMKVTAMYTACSGVTFLNAIREKEDRNPDFNFLSTR-SIYSKYFTSLVDAYR
>ERR1712146_363037
---LMKVTAMFAASSGSTFLNALRRKEAKNKDYSFLLPR-SIYSNYFNSLIDAYR
>tr|A0A1X0P625|A0A1X0P625_9TRYP Putative RNA-binding protein OS=Trypanosoma theileri GN=TM35_000031350 PE=4 SV=1
--KIIQLVAKYVVFscDGARYQHKLVKKTKFNSYFAFIASPEHKYHDYYQYLIRSY-
>tr|A0A0Q3SFT5|A0A0Q3SFT5_BRADI Uncharacterized protein OS=Brachypodium distachyon GN=BRADI_1g77182 PE=4 SV=1
--HRIERVARFVARDrdGELAEALLLRllrITRNGRRWGFLAHD-HPLHPYYLQQ-----
>tr|A0A287PY44|A0A287PY44_HORVV Uncharacterized protein OS=Hordeum vulgare subsp. vulgare PE=4 SV=1
--HRIERVARFVARDrdGDLAEALLRRllrITRNGRRWGFLAND-HPLHPYYLQQ-----
>tr|A0A2A9M1U6|A0A2A9M1U6_9APIC Zinc knuckle domain-containing protein OS=Besnoitia besnoiti GN=BESB_024310 PE=4 SV=1
-VKRIHTIAEYCCRN-PEMEALVRQRDGADPRFAFINGG--EGYDYFRFAVACL-
>ERR1719195_1892437
-ADLIEKTARYIHKssDPNIFENGIHKKNKGKPDWQFLDLG-GDGHDYYRFVR----
>ERR1719362_1304271
-ADLIEKTARYIHKssDPNIFERTVQEKNKGKTEWTFLETG-GDGHDYYRFVR----
>ERR1719265_2379978
-VDQIEKCCNHIAGsqDPKVFEALITDRNKGKPGWSFLEEG-GEGFDYFEFVK----
>ERR1719230_550916
------------FRsqDAKVFERLIQDRNKGNPAWAFLEEN-GEGHEYYEFVK----
>ERR1719238_2522664
-VETIEKTAQHVFKssDAKVFERVIAEKNKGKEGWGFIVEG-GEGHEYYKFCL----
>ERR1719375_400016
-TNRIAQTAEKIVKseHGPKLEEMMISKGDPN--FFFVKAD-DANHLFYKFVK----
>ERR1719379_639727
-VERIEKVARHIHSskDPKVFERMVEERNKDNAEFSFMKEG-GVGRDYYLFVR----
>ERR1719265_2472687
-IQRIEACAQKLVKseHRDKLEKMIVQKADPQ--FSFVQSD-DSNHKFYKFVT----
>ERR1719316_801900
-AERIEKCATRIVQseHGAKLEAMMVEKAEPAGETSFVLED-DLHHKFYLFVK----
>SRR5260221_9458181
---MIQQTALSTACSRRNFLASLSTREGHNPQFEFLRPT-HSLFGYFNQLVEQY-
>ERR1719186_1147134
-RRFIETAAESVARYGDTMEQQIF---------RVMNPS-HLYYAYYKRTVFSF-
>ERR1719186_999437
-RIIVDAVADMVARFRVLKEEEMKPD------KSfgtldIRNPG-HRYHAYYKLRIFSH-
>ERR1719203_744638
-RIIVDAVADMIARFQVLKEEEMKPN------KTfgtldIRNPG-HRYHAYYKLRIFSH-
>ERR1719186_1610270
----KDQPCQLSRLLDSVANAEIKLK------RPfgkldIPNPG-NRYHAYYKLRVFSH-
>ERR1719186_2269789
-SIMDLVVDVA-------IHCIARED------QTfcnssIRNPG-HRYHAYYKLKLFSH-
>tr|L8HH25|L8HH25_ACACA Variant sh3 domain containing protein OS=Acanthamoeba castellanii str. Neff GN=ACA1_175490 PE=4 SV=1
---IIEKTAGFLFRQGPEAVNTLKKQQKDNPMFLFLVEG-HPLNSYLKFLISKL-
>SRR5690348_12776882
---IMEHTANFLHLQGEEALKALRENQGDNTNFLFLVAG-HPMHPFFQLLLKRL-
>tr|Q6CUP3|Q6CUP3_KLULA KLLA0C03322p OS=Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) GN=KLLA0_C03322
-RRSIWKTVEYVIRNGATYEE-----KLKSQDIGFTQPG-NKYNDYYAFLLEH--
>tr|W0T2H8|W0T2H8_KLUMA Pre-mRNA-splicing factor sap114 OS=Kluyveromyces marxianus DMKU3-1042 GN=KLMA_10150 PE=4 SV=1
-KLDILNTVEYVHRNGTAFEA-----KLNSDKTKFVLPG-NKYYDYYSFILER--
>tr|A0A1E5RI13|A0A1E5RI13_9ASCO Pre-mRNA-splicing factor OS=Hanseniaspora osmophila GN=AWRI3579_g1535 PE=4 SV=1
-KDSILKTALYVINNGKSFesKIIT--SEQNNKNFSFLHEG-DIYHEYYKFLIDS--
>tr|G8JML6|G8JML6_ERECY Uncharacterized protein OS=Eremothecium cymbalariae (strain CBS 270.75 / DBVPG 7215 / KCTC 17166 / NRRL Y-17582) GN=Ecym_1117 P
-KNSILQTVLKLANNQNVLsqEQ-----PTKKNDIPYANPN-DKYHDYYMYYMKR--
>tr|Q752D0|Q752D0_ASHGO AFR645Wp OS=Ashbya gossypii (strain ATCC 10895 / CBS 109.51 / FGSC 9923 / NRRL Y-1056) GN=AGOS_AFR645W PE=4 SV=2
-QTQICTNVLEGLQQTSIKqtN-----------RKTNDQNM-DAYSEYYNFLLNH--
>tr|R9XB98|R9XB98_ASHAC AaceriAFR645Wp OS=Ashbya aceri GN=AACERI_AaceriAFR645W PE=4 SV=1
-QLQICTNVLEALKQRSSLpvQ-----------PTSKEPHT-TSYKDYYNFLLQH--
>tr|Q6FUL7|Q6FUL7_CANGA Uncharacterized protein OS=Candida glabrata (strain ATCC 2001 / CBS 138 / JCM 3761 / NBRC 0622 / NRRL Y-65) GN=CAGL0F02475g PE=
-KEDINKTVEYLRQYGPEFle-K-----LRGDVRFEFIEEN-SPYHSYFISQYQD--
>ERR1740139_147120
-----------AADGGTQFEVLLKVKQKNNTSFSFLLQD-HMLHPFYRFM-----
>ERR550534_2344239
---MIEHTVFFVLKHGDQSEVRLRMDPIASKKIPFIDIN-HELNPYYQWLK----
>ERR1712071_409658
---IMKKTSKNTH-QNPQFQVLLKLKQSHNPDFGFLNHN-YRLFPLYQYM-----
>ERR1740139_5452
---VMKMTASRSV-QAPQFEVLLKVKQSNNEEFGFLSPS-HPHHQYY--S-----
>ERR1719354_55336
VRERIDKTLDAMTksPNPLEFEAHLRRKQGTNPDFQFLQLG-FPGNDYFEGKKQE--
>ERR1740124_1066979
LKARIDKTVQAMCqsPTPGEFQAHLINKQGDNPDFRFLLLG-GVGNDYFEAKKQE--
>ERR1719409_141696
-QRHINDVARHVSALSAEEEEAHKQQHMTDLKFAFLFGG--EGTEYYKWVLGGF-
>ERR1740138_1251472
-QRLINEVARQVSSLDAAAEEAYRQQHMSGLQYAFLFGG--EGSQ----------
>tr|A0A0D3DWK5|A0A0D3DWK5_BRAOL Uncharacterized protein OS=Brassica oleracea var. oleracea PE=4 SV=1
-REIVERTAALVGTHGLLMERRLLAVNVNDERYDFLRSReDHPYYDFYRRKVV---
>tr|Q54DI3|Q54DI3_DICDI Uncharacterized protein OS=Dictyostelium discoideum GN=DDB0184284 PE=4 SV=1
-EQVIEIVVEFVSKNGQTFETAIQNQ-SMSAIFPFLDPS-NQYYPIYKTKLDK--
>tr|S8F1K8|S8F1K8_TOXGM Uncharacterized protein OS=Toxoplasma gondii (strain ATCC 50611 / Me49) GN=TGME49_271290 PE=4 SV=1
---VIERTAHFVRTEGSRMEFRLKLDPVVSSQLCFLSVD-HQLNAYYTYLR----
>ERR1719506_3662512
---IVEHVAHFMREHGDILEVKLRFDASKLNAVPFVHVD-HPLHPYYSYLK----
>ERR1719498_601807
--KRIDKTIVALSQspNPKKFEFHLCNKQSDNPDFLFLKIG-QEGNKYFQFKKK---
>ERR1711970_82116
--VRIDKTIEALAQspNPVKFEVHLKEKQGNNPEFMFFKPG-KDGHSYFKARKM---
>tr|A0A1B8CUM0|A0A1B8CUM0_9PEZI Uncharacterized protein OS=Pseudogymnoascus sp. 24MN13 GN=VE04_09491 PE=4 SV=1
--RVIHKTTESMLTHDLGFRGAADEQTr----------------GSARTSMGVAM-
>SRR3569833_2099955
--RMIHKVIEGILQHGPEFEALLMSRPevQKDEKWAWIWDARSEGGVWYRYRLWEI-
# STOCKHOLM 1.0
#=GF ID query
#=GF AU hmmsearch (HMMER 3.3.2)
#=GS 1ug0_A/27-81 DE [subseq from] mol:protein length:88 splicing factor 4
#=GS 6qx9_A1/50-101 DE [subseq from] mol:protein length:647 Splicing factor 3A subunit 1,Splicing factor 3A subunit 1,Splicing factor 3A subunit 1
#=GS 6qx9_A1/168-214 DE [subseq from] mol:protein length:647 Splicing factor 3A subunit 1,Splicing factor 3A subunit 1,Splicing factor 3A subunit 1
#=GS 6qx9_A1/617-627 DE [subseq from] mol:protein length:647 Splicing factor 3A subunit 1,Splicing factor 3A subunit 1,Splicing factor 3A subunit 1
#=GS 5z56_u/50-101 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 5z56_u/168-214 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 5z56_u/763-773 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 5z57_u/50-101 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 5z57_u/168-214 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 5z57_u/763-773 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 5z58_u/50-101 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 5z58_u/168-214 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 5z58_u/763-773 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6ah0_u/50-101 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6ah0_u/168-214 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6ah0_u/763-773 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6ahd_u/50-101 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6ahd_u/168-214 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6ahd_u/763-773 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6ff7_p/50-101 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6ff7_p/168-214 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6ff7_p/763-773 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6y53_6/50-101 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6y53_6/168-214 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6y53_6/763-773 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6y5q_6/50-101 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6y5q_6/168-214 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 6y5q_6/763-773 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 7abg_p/50-101 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 7abg_p/168-214 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 7abg_p/763-773 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 7abi_p/50-101 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 7abi_p/168-214 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 7abi_p/763-773 DE [subseq from] mol:protein length:793 Splicing factor 3A subunit 1
#=GS 1x4o_A/18-71 DE [subseq from] mol:protein length:78 Splicing factor 4
#=GS 2dt6_A/4-55 DE [subseq from] mol:protein length:64 Splicing factor 3 subunit 1
#=GS 1x4p_A/10-58 DE [subseq from] mol:protein length:66 Putative splicing factor, arginine/serine-rich 14
#=GS 2e60_A/29-73 DE [subseq from] mol:protein length:101 Splicing factor, arginine/serine-rich 8
#=GS 2e5z_A/26-70 DE [subseq from] mol:protein length:90 Splicing factor, arginine/serine-rich 8
#=GS 2dt7_B/16-24 DE [subseq from] mol:protein length:85 Splicing factor 3 subunit 1
#=GS 2dt7_B/35-82 DE [subseq from] mol:protein length:85 Splicing factor 3 subunit 1
#=GS 5nrl_V/12-55 DE [subseq from] mol:protein length:280 Pre-mRNA-splicing factor PRP21
#=GS 5nrl_V/95-135 DE [subseq from] mol:protein length:280 Pre-mRNA-splicing factor PRP21
#=GS 5zwm_w/12-55 DE [subseq from] mol:protein length:280 Pre-mRNA-splicing factor PRP21
#=GS 5zwm_w/95-135 DE [subseq from] mol:protein length:280 Pre-mRNA-splicing factor PRP21
#=GS 5zwo_w/12-55 DE [subseq from] mol:protein length:280 Pre-mRNA-splicing factor PRP21
#=GS 5zwo_w/95-135 DE [subseq from] mol:protein length:280 Pre-mRNA-splicing factor PRP21
#=GS 6g90_V/12-55 DE [subseq from] mol:protein length:280 Pre-mRNA-splicing factor PRP21
#=GS 6g90_V/95-135 DE [subseq from] mol:protein length:280 Pre-mRNA-splicing factor PRP21
#=GS 7dco_w/12-55 DE [subseq from] mol:protein length:280 Pre-mRNA-splicing factor PRP21
#=GS 7dco_w/95-135 DE [subseq from] mol:protein length:280 Pre-mRNA-splicing factor PRP21
#=GS 7oqb_V/12-55 DE [subseq from] mol:protein length:280 Pre-mRNA-splicing factor PRP21
#=GS 7oqb_V/95-135 DE [subseq from] mol:protein length:280 Pre-mRNA-splicing factor PRP21
#=GS 7oqe_V/12-55 DE [subseq from] mol:protein length:280 Pre-mRNA-splicing factor PRP21
#=GS 7oqe_V/95-135 DE [subseq from] mol:protein length:280 Pre-mRNA-splicing factor PRP21
#=GS 6ksg_A/181-200 DE [subseq from] mol:protein length:301 Methionine aminopeptidase
#=GS 6ksg_B/181-200 DE [subseq from] mol:protein length:301 Methionine aminopeptidase
#=GS 6lh7_A/181-200 DE [subseq from] mol:protein length:301 Methionine aminopeptidase
#=GS 6lh7_B/181-200 DE [subseq from] mol:protein length:301 Methionine aminopeptidase
#=GS 6k26_A/180-199 DE [subseq from] mol:protein length:300 Methionine aminopeptidase
#=GS 6k26_B/180-199 DE [subseq from] mol:protein length:300 Methionine aminopeptidase
#=GS 4jro_A/67-85 DE [subseq from] mol:protein length:271 FabG protein
#=GS 4jro_B/67-85 DE [subseq from] mol:protein length:271 FabG protein
#=GS 4jro_C/67-85 DE [subseq from] mol:protein length:271 FabG protein
#=GS 4jro_D/67-85 DE [subseq from] mol:protein length:271 FabG protein
#=GS 4xgs_A/92-128 DE [subseq from] mol:protein length:165 Ferritin
#=GS 4xgs_B/92-128 DE [subseq from] mol:protein length:165 Ferritin
#=GS 4xgs_C/92-128 DE [subseq from] mol:protein length:165 Ferritin
#=GS 4xgs_D/92-128 DE [subseq from] mol:protein length:165 Ferritin
#=GS 4xgs_E/92-128 DE [subseq from] mol:protein length:165 Ferritin
#=GS 4xgs_F/92-128 DE [subseq from] mol:protein length:165 Ferritin
#=GS 2fl4_A/24-41 DE [subseq from] mol:protein length:149 spermine/spermidine acetyltransferase
#=GS 2fl4_A/99-125 DE [subseq from] mol:protein length:149 spermine/spermidine acetyltransferase
#=GS 3guw_A/187-200 DE [subseq from] mol:protein length:261 uncharacterized protein AF_1765
#=GS 3guw_A/230-243 DE [subseq from] mol:protein length:261 uncharacterized protein AF_1765
#=GS 3guw_B/187-200 DE [subseq from] mol:protein length:261 uncharacterized protein AF_1765
#=GS 3guw_B/230-243 DE [subseq from] mol:protein length:261 uncharacterized protein AF_1765
#=GS 3guw_C/187-200 DE [subseq from] mol:protein length:261 uncharacterized protein AF_1765
#=GS 3guw_C/230-243 DE [subseq from] mol:protein length:261 uncharacterized protein AF_1765
#=GS 3guw_D/187-200 DE [subseq from] mol:protein length:261 uncharacterized protein AF_1765
#=GS 3guw_D/230-243 DE [subseq from] mol:protein length:261 uncharacterized protein AF_1765
#=GS 6zl1_A/110-135 DE [subseq from] mol:protein length:609 Albumin
#=GS 6zl1_A/284-300 DE [subseq from] mol:protein length:609 Albumin
#=GS 6zl1_B/110-135 DE [subseq from] mol:protein length:609 Albumin
#=GS 6zl1_B/284-300 DE [subseq from] mol:protein length:609 Albumin
#=GS 3kap_A/14-33 DE [subseq from] mol:protein length:147 Flavodoxin
#=GS 3kaq_A/14-33 DE [subseq from] mol:protein length:147 Flavodoxin
#=GS 3tbk_A/245-253 DE [subseq from] mol:protein length:555 RIG-I Helicase Domain
#=GS 3tbk_A/354-377 DE [subseq from] mol:protein length:555 RIG-I Helicase Domain
#=GS 3tbk_A/410-419 DE [subseq from] mol:protein length:555 RIG-I Helicase Domain
#=GS 6qgm_a/128-165 DE [subseq from] mol:protein length:531 VirX1
#=GS 6qgm_b/128-165 DE [subseq from] mol:protein length:531 VirX1
#=GS 6qgm_c/128-165 DE [subseq from] mol:protein length:531 VirX1
#=GS 6qgm_d/128-165 DE [subseq from] mol:protein length:531 VirX1
#=GS 6qgm_e/128-165 DE [subseq from] mol:protein length:531 VirX1
#=GS 6qgm_f/128-165 DE [subseq from] mol:protein length:531 VirX1
#=GS 4jgl_A/88-120 DE [subseq from] mol:protein length:169 hypothetical protein
#=GS 3kij_A/91-101 DE [subseq from] mol:protein length:180 Probable glutathione peroxidase 8
#=GS 3kij_A/115-132 DE [subseq from] mol:protein length:180 Probable glutathione peroxidase 8
#=GS 3kij_B/91-101 DE [subseq from] mol:protein length:180 Probable glutathione peroxidase 8
#=GS 3kij_B/115-132 DE [subseq from] mol:protein length:180 Probable glutathione peroxidase 8
#=GS 3kij_C/91-101 DE [subseq from] mol:protein length:180 Probable glutathione peroxidase 8
#=GS 3kij_C/115-132 DE [subseq from] mol:protein length:180 Probable glutathione peroxidase 8
#=GS 4reu_A/95-127 DE [subseq from] mol:protein length:164 Ferritin
#=GS 4reu_B/95-127 DE [subseq from] mol:protein length:164 Ferritin
#=GS 4reu_C/95-127 DE [subseq from] mol:protein length:164 Ferritin
#=GS 4reu_D/95-127 DE [subseq from] mol:protein length:164 Ferritin
#=GS 4reu_E/95-127 DE [subseq from] mol:protein length:164 Ferritin
#=GS 4reu_F/95-127 DE [subseq from] mol:protein length:164 Ferritin
#=GS 1eum_A/96-128 DE [subseq from] mol:protein length:165 FERRITIN 1
#=GS 1eum_B/96-128 DE [subseq from] mol:protein length:165 FERRITIN 1
#=GS 1eum_C/96-128 DE [subseq from] mol:protein length:165 FERRITIN 1
#=GS 1eum_D/96-128 DE [subseq from] mol:protein length:165 FERRITIN 1
#=GS 1eum_E/96-128 DE [subseq from] mol:protein length:165 FERRITIN 1
#=GS 1eum_F/96-128 DE [subseq from] mol:protein length:165 FERRITIN 1
#=GS 4ztt_A/97-129 DE [subseq from] mol:protein length:166 Bacterial non-heme ferritin
#=GS 4ztt_B/97-129 DE [subseq from] mol:protein length:166 Bacterial non-heme ferritin
#=GS 4ztt_C/97-129 DE [subseq from] mol:protein length:166 Bacterial non-heme ferritin
#=GS 4ztt_D/97-129 DE [subseq from] mol:protein length:166 Bacterial non-heme ferritin
#=GS 4ztt_E/97-129 DE [subseq from] mol:protein length:166 Bacterial non-heme ferritin
#=GS 4ztt_F/97-129 DE [subseq from] mol:protein length:166 Bacterial non-heme ferritin
#=GS 5ztc_A/44-69 DE [subseq from] mol:protein length:204 Lmo2088 protein
#=GS 5ztc_B/44-69 DE [subseq from] mol:protein length:204 Lmo2088 protein
#=GS 4lfk_B/35-51 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfk_B/150-157 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfk_D/35-51 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfk_D/150-157 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfl_B/35-51 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfl_B/150-157 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfl_D/35-51 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfl_D/150-157 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfm_B/35-51 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfm_B/150-157 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfm_D/35-51 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfm_D/150-157 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfn_B/35-51 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfn_B/150-157 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfn_D/35-51 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 4lfn_D/150-157 DE [subseq from] mol:protein length:172 Galactose-6-phosphate isomerase subunit B
#=GS 5lwh_A/218-229 DE [subseq from] mol:protein length:289 Enterochelin ABC transporter substrate-binding protein
#=GS 5lwh_A/249-275 DE [subseq from] mol:protein length:289 Enterochelin ABC transporter substrate-binding protein
#=GS 5mbu_A/220-231 DE [subseq from] mol:protein length:291 Enterochelin uptake periplasmic binding protein
#=GS 5mbu_A/251-277 DE [subseq from] mol:protein length:291 Enterochelin uptake periplasmic binding protein
#=GS 5mbu_C/220-231 DE [subseq from] mol:protein length:291 Enterochelin uptake periplasmic binding protein
#=GS 5mbu_C/251-277 DE [subseq from] mol:protein length:291 Enterochelin uptake periplasmic binding protein
#=GS 5mbu_B/220-231 DE [subseq from] mol:protein length:291 Enterochelin uptake periplasmic binding protein
#=GS 5mbu_B/251-277 DE [subseq from] mol:protein length:291 Enterochelin uptake periplasmic binding protein
#=GS 5gsm_A/318-328 DE [subseq from] mol:protein length:786 Exo-beta-D-glucosaminidase
#=GS 5gsm_A/514-529 DE [subseq from] mol:protein length:786 Exo-beta-D-glucosaminidase
#=GS 5gsm_A/557-568 DE [subseq from] mol:protein length:786 Exo-beta-D-glucosaminidase
#=GS 5gsm_B/318-328 DE [subseq from] mol:protein length:786 Exo-beta-D-glucosaminidase
#=GS 5gsm_B/514-529 DE [subseq from] mol:protein length:786 Exo-beta-D-glucosaminidase
#=GS 5gsm_B/557-568 DE [subseq from] mol:protein length:786 Exo-beta-D-glucosaminidase
#=GS 3cyn_A/100-110 DE [subseq from] mol:protein length:189 Probable glutathione peroxidase 8
#=GS 3cyn_A/124-141 DE [subseq from] mol:protein length:189 Probable glutathione peroxidase 8
#=GS 3cyn_B/100-110 DE [subseq from] mol:protein length:189 Probable glutathione peroxidase 8
#=GS 3cyn_B/124-141 DE [subseq from] mol:protein length:189 Probable glutathione peroxidase 8
#=GS 3cyn_C/100-110 DE [subseq from] mol:protein length:189 Probable glutathione peroxidase 8
#=GS 3cyn_C/124-141 DE [subseq from] mol:protein length:189 Probable glutathione peroxidase 8
#=GS 5mbt_A/220-231 DE [subseq from] mol:protein length:291 Enterochelin uptake periplasmic binding protein
#=GS 5mbt_A/251-277 DE [subseq from] mol:protein length:291 Enterochelin uptake periplasmic binding protein
#=GS 5mbt_B/220-231 DE [subseq from] mol:protein length:291 Enterochelin uptake periplasmic binding protein
#=GS 5mbt_B/251-277 DE [subseq from] mol:protein length:291 Enterochelin uptake periplasmic binding protein
#=GS 5mbt_C/220-231 DE [subseq from] mol:protein length:291 Enterochelin uptake periplasmic binding protein
#=GS 5mbt_C/251-277 DE [subseq from] mol:protein length:291 Enterochelin uptake periplasmic binding protein
#=GS 5lsj_B/132-166 DE [subseq from] mol:protein length:176 Polyamine-modulated factor 1
#=GS 5lsj_E/132-166 DE [subseq from] mol:protein length:176 Polyamine-modulated factor 1
#=GS 5lsk_B/132-166 DE [subseq from] mol:protein length:176 Polyamine-modulated factor 1
1ug0_A/27-81 TRRVIEKLARFVAEGGPELEKVAMEDYK...DN.PAFTFLHDKNSREFLYYRRKVAEIR
#=GR 1ug0_A/27-81 PP 8***************************...**.***********************97
6qx9_A1/50-101 -RNIVDKTASFVARNGPEFEARIRQNEI...NN.PKFNFLNP-NDPYHAYYRHKVSEF-
#=GR 6qx9_A1/50-101 PP .89*************************...**.*******7.99************8.
6qx9_A1/168-214 -----KLTAQFVARNGRQFLTQLMQKEQ...RN.YQFDFLRPQHSL-FNYFTKLVEQ--
#=GR 6qx9_A1/168-214 PP .....568********************...**.********9996.9***999986..
6qx9_A1/617-627 ----------------------------...--.----FIKDSNSLAY--Y--------
#=GR 6qx9_A1/617-627 PP ......................................7888888875..5........
5z56_u/50-101 -RNIVDKTASFVARNGPEFEARIRQNEI...NN.PKFNFLNP-NDPYHAYYRHKVSEF-
#=GR 5z56_u/50-101 PP .89*************************...**.*******7.99************8.
5z56_u/168-214 -----KLTAQFVARNGRQFLTQLMQKEQ...RN.YQFDFLRPQHSL-FNYFTKLVEQ--
#=GR 5z56_u/168-214 PP .....568********************...**.********9996.9***999986..
5z56_u/763-773 ----------------------------...--.----FIKDSNSLAY--Y--------
#=GR 5z56_u/763-773 PP ......................................8889999876..4........
5z57_u/50-101 -RNIVDKTASFVARNGPEFEARIRQNEI...NN.PKFNFLNP-NDPYHAYYRHKVSEF-
#=GR 5z57_u/50-101 PP .89*************************...**.*******7.99************8.
5z57_u/168-214 -----KLTAQFVARNGRQFLTQLMQKEQ...RN.YQFDFLRPQHSL-FNYFTKLVEQ--
#=GR 5z57_u/168-214 PP .....568********************...**.********9996.9***999986..
5z57_u/763-773 ----------------------------...--.----FIKDSNSLAY--Y--------
#=GR 5z57_u/763-773 PP ......................................8889999876..4........
5z58_u/50-101 -RNIVDKTASFVARNGPEFEARIRQNEI...NN.PKFNFLNP-NDPYHAYYRHKVSEF-
#=GR 5z58_u/50-101 PP .89*************************...**.*******7.99************8.
5z58_u/168-214 -----KLTAQFVARNGRQFLTQLMQKEQ...RN.YQFDFLRPQHSL-FNYFTKLVEQ--
#=GR 5z58_u/168-214 PP .....568********************...**.********9996.9***999986..
5z58_u/763-773 ----------------------------...--.----FIKDSNSLAY--Y--------
#=GR 5z58_u/763-773 PP ......................................8889999876..4........
6ah0_u/50-101 -RNIVDKTASFVARNGPEFEARIRQNEI...NN.PKFNFLNP-NDPYHAYYRHKVSEF-
#=GR 6ah0_u/50-101 PP .89*************************...**.*******7.99************8.
6ah0_u/168-214 -----KLTAQFVARNGRQFLTQLMQKEQ...RN.YQFDFLRPQHSL-FNYFTKLVEQ--
#=GR 6ah0_u/168-214 PP .....568********************...**.********9996.9***999986..
6ah0_u/763-773 ----------------------------...--.----FIKDSNSLAY--Y--------
#=GR 6ah0_u/763-773 PP ......................................8889999876..4........
6ahd_u/50-101 -RNIVDKTASFVARNGPEFEARIRQNEI...NN.PKFNFLNP-NDPYHAYYRHKVSEF-
#=GR 6ahd_u/50-101 PP .89*************************...**.*******7.99************8.
6ahd_u/168-214 -----KLTAQFVARNGRQFLTQLMQKEQ...RN.YQFDFLRPQHSL-FNYFTKLVEQ--
#=GR 6ahd_u/168-214 PP .....568********************...**.********9996.9***999986..
6ahd_u/763-773 ----------------------------...--.----FIKDSNSLAY--Y--------
#=GR 6ahd_u/763-773 PP ......................................8889999876..4........
6ff7_p/50-101 -RNIVDKTASFVARNGPEFEARIRQNEI...NN.PKFNFLNP-NDPYHAYYRHKVSEF-
#=GR 6ff7_p/50-101 PP .89*************************...**.*******7.99************8.
6ff7_p/168-214 -----KLTAQFVARNGRQFLTQLMQKEQ...RN.YQFDFLRPQHSL-FNYFTKLVEQ--
#=GR 6ff7_p/168-214 PP .....568********************...**.********9996.9***999986..
6ff7_p/763-773 ----------------------------...--.----FIKDSNSLAY--Y--------
#=GR 6ff7_p/763-773 PP ......................................8889999876..4........
6y53_6/50-101 -RNIVDKTASFVARNGPEFEARIRQNEI...NN.PKFNFLNP-NDPYHAYYRHKVSEF-
#=GR 6y53_6/50-101 PP .89*************************...**.*******7.99************8.
6y53_6/168-214 -----KLTAQFVARNGRQFLTQLMQKEQ...RN.YQFDFLRPQHSL-FNYFTKLVEQ--
#=GR 6y53_6/168-214 PP .....568********************...**.********9996.9***999986..
6y53_6/763-773 ----------------------------...--.----FIKDSNSLAY--Y--------
#=GR 6y53_6/763-773 PP ......................................8889999876..4........
6y5q_6/50-101 -RNIVDKTASFVARNGPEFEARIRQNEI...NN.PKFNFLNP-NDPYHAYYRHKVSEF-
#=GR 6y5q_6/50-101 PP .89*************************...**.*******7.99************8.
6y5q_6/168-214 -----KLTAQFVARNGRQFLTQLMQKEQ...RN.YQFDFLRPQHSL-FNYFTKLVEQ--
#=GR 6y5q_6/168-214 PP .....568********************...**.********9996.9***999986..
6y5q_6/763-773 ----------------------------...--.----FIKDSNSLAY--Y--------
#=GR 6y5q_6/763-773 PP ......................................8889999876..4........
7abg_p/50-101 -RNIVDKTASFVARNGPEFEARIRQNEI...NN.PKFNFLNP-NDPYHAYYRHKVSEF-
#=GR 7abg_p/50-101 PP .89*************************...**.*******7.99************8.
7abg_p/168-214 -----KLTAQFVARNGRQFLTQLMQKEQ...RN.YQFDFLRPQHSL-FNYFTKLVEQ--
#=GR 7abg_p/168-214 PP .....568********************...**.********9996.9***999986..
7abg_p/763-773 ----------------------------...--.----FIKDSNSLAY--Y--------
#=GR 7abg_p/763-773 PP ......................................8889999876..4........
7abi_p/50-101 -RNIVDKTASFVARNGPEFEARIRQNEI...NN.PKFNFLNP-NDPYHAYYRHKVSEF-
#=GR 7abi_p/50-101 PP .89*************************...**.*******7.99************8.
7abi_p/168-214 -----KLTAQFVARNGRQFLTQLMQKEQ...RN.YQFDFLRPQHSL-FNYFTKLVEQ--
#=GR 7abi_p/168-214 PP .....568********************...**.********9996.9***999986..
7abi_p/763-773 ----------------------------...--.----FIKDSNSLAY--Y--------
#=GR 7abi_p/763-773 PP ......................................8889999876..4........
1x4o_A/18-71 -KNLAEKLARFIADGGPEVETIALQNNR...EN.QAFSFLYDPNSQGYRYYRQKLDEFR
#=GR 1x4o_A/18-71 PP .689************************...**.***********************97
2dt6_A/4-55 -RNIVDKTASFVARNGPEFEARIRQNEI...NN.PKFNFLNP-NDPYHAYYRHKVSEF-
#=GR 2dt6_A/4-55 PP .89*************************...**.*******7.99************8.
1x4p_A/10-58 ---TIDQLVKRVIEGSLSPKERTL--LK...ED.PAYWFLSDENSLEYKYYKLKLAEM-
#=GR 1x4p_A/10-58 PP ...79***********99888877..9*...**.***********************8.
2e60_A/29-73 --AIIERTASFVCRQGAQFEIMLKAKQA...RN.SQFDFLRFDHYL-NPYYK-------
#=GR 2e60_A/29-73 PP ..59************************...**.*******97765.79998.......
2e5z_A/26-70 --PVIDKLAEYVARNGLKFETSVR--AK...ND.QRFEFLQPWHQ-YNAYYEFK-----
#=GR 2e5z_A/26-70 PP ..59*****************998..89...**.********985.78999876.....
2dt7_B/16-24 ----------------------------...--.PEFEFIADP----------------
#=GR 2dt7_B/16-24 PP ..................................788888776................
2dt7_B/35-82 ----VKLTAQFVARNGRQFLTQLMQKEQ...RN.YQFDFLRPQHSL-FNYFTKLVEQ--
#=GR 2dt7_B/35-82 PP ....5568********************...**.********9996.9****99986..
5nrl_V/12-55 ----IKTTVNYIKQHGVEFENKLL----...ED.ERFSFIKK-DDPLHEYYTKLMNE--
#=GR 5nrl_V/12-55 PP ....678999***********998.......79.99****96.6788****987665..
5nrl_V/95-135 ---VIKLTARYYAKDKSIVEQMIS--KD...GE.ARLNFMNSSH-PLHKTF--------
#=GR 5nrl_V/95-135 PP ...78889**************96..55...66.7888888754.556665........
5zwm_w/12-55 ----IKTTVNYIKQHGVEFENKLL----...ED.ERFSFIKK-DDPLHEYYTKLMNE--
#=GR 5zwm_w/12-55 PP ....678999***********998.......79.99****96.6788****987665..
5zwm_w/95-135 ---VIKLTARYYAKDKSIVEQMIS--KD...GE.ARLNFMNSSH-PLHKTF--------
#=GR 5zwm_w/95-135 PP ...78889**************96..55...66.7888888754.556665........
5zwo_w/12-55 ----IKTTVNYIKQHGVEFENKLL----...ED.ERFSFIKK-DDPLHEYYTKLMNE--
#=GR 5zwo_w/12-55 PP ....678999***********998.......79.99****96.6788****987665..
5zwo_w/95-135 ---VIKLTARYYAKDKSIVEQMIS--KD...GE.ARLNFMNSSH-PLHKTF--------
#=GR 5zwo_w/95-135 PP ...78889**************96..55...66.7888888754.556665........
6g90_V/12-55 ----IKTTVNYIKQHGVEFENKLL----...ED.ERFSFIKK-DDPLHEYYTKLMNE--
#=GR 6g90_V/12-55 PP ....678999***********998.......79.99****96.6788****987665..
6g90_V/95-135 ---VIKLTARYYAKDKSIVEQMIS--KD...GE.ARLNFMNSSH-PLHKTF--------
#=GR 6g90_V/95-135 PP ...78889**************96..55...66.7888888754.556665........
7dco_w/12-55 ----IKTTVNYIKQHGVEFENKLL----...ED.ERFSFIKK-DDPLHEYYTKLMNE--
#=GR 7dco_w/12-55 PP ....678999***********998.......79.99****96.6788****987665..
7dco_w/95-135 ---VIKLTARYYAKDKSIVEQMIS--KD...GE.ARLNFMNSSH-PLHKTF--------
#=GR 7dco_w/95-135 PP ...78889**************96..55...66.7888888754.556665........
7oqb_V/12-55 ----IKTTVNYIKQHGVEFENKLL----...ED.ERFSFIKK-DDPLHEYYTKLMNE--
#=GR 7oqb_V/12-55 PP ....678999***********998.......79.99****96.6788****987665..
7oqb_V/95-135 ---VIKLTARYYAKDKSIVEQMIS--KD...GE.ARLNFMNSSH-PLHKTF--------
#=GR 7oqb_V/95-135 PP ...78889**************96..55...66.7888888754.556665........
7oqe_V/12-55 ----IKTTVNYIKQHGVEFENKLL----...ED.ERFSFIKK-DDPLHEYYTKLMNE--
#=GR 7oqe_V/12-55 PP ....678999***********998.......79.99****96.6788****987665..
7oqe_V/95-135 ---VIKLTARYYAKDKSIVEQMIS--KD...GE.ARLNFMNSSH-PLHKTF--------
#=GR 7oqe_V/95-135 PP ...78889**************96..55...66.7888888754.556665........
6ksg_A/181-200 ---------------GTTIEKHIKTNNK...NN.PRFKF--------------------
#=GR 6ksg_A/181-200 PP ...............7889*********...**.****9....................
6ksg_B/181-200 ---------------GTTIEKHIKTNNK...NN.PRFKF--------------------
#=GR 6ksg_B/181-200 PP ...............7889*********...**.****9....................
6lh7_A/181-200 ---------------GTTIEKHIKTNNK...NN.PRFKF--------------------
#=GR 6lh7_A/181-200 PP ...............7889*********...**.****9....................
6lh7_B/181-200 ---------------GTTIEKHIKTNNK...NN.PRFKF--------------------
#=GR 6lh7_B/181-200 PP ...............7889*********...**.****9....................
6k26_A/180-199 ---------------GTTIEKHIKTNNK...NN.PRFKF--------------------
#=GR 6k26_A/180-199 PP ...............7889*********...**.****9....................
6k26_B/180-199 ---------------GTTIEKHIKTNNK...NN.PRFKF--------------------
#=GR 6k26_B/180-199 PP ...............7889*********...**.****9....................
4jro_A/67-85 ----AEETAKLVAEHGVEVEAMK-----...--.-------------------------
#=GR 4jro_A/67-85 PP ....89**************985....................................
4jro_B/67-85 ----AEETAKLVAEHGVEVEAMK-----...--.-------------------------
#=GR 4jro_B/67-85 PP ....89**************985....................................
4jro_C/67-85 ----AEETAKLVAEHGVEVEAMK-----...--.-------------------------
#=GR 4jro_C/67-85 PP ....89**************985....................................
4jro_D/67-85 ----AEETAKLVAEHGVEVEAMK-----...--.-------------------------
#=GR 4jro_D/67-85 PP ....89**************985....................................
4xgs_A/92-128 ------KLEQLITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4xgs_A/92-128 PP ......8999999999***99998.789...**.******97766665...........
4xgs_B/92-128 ------KLEQLITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4xgs_B/92-128 PP ......8999999999***99998.789...**.******97766665...........
4xgs_C/92-128 ------KLEQLITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4xgs_C/92-128 PP ......8999999999***99998.789...**.******97766665...........
4xgs_D/92-128 ------KLEQLITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4xgs_D/92-128 PP ......8999999999***99998.789...**.******97766665...........
4xgs_E/92-128 ------KLEQLITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4xgs_E/92-128 PP ......8999999999***99998.789...**.******97766665...........
4xgs_F/92-128 ------KLEQLITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4xgs_F/92-128 PP ......8999999999***99998.789...**.******97766665...........
2fl4_A/24-41 ------------AEQQAFIESMA-ENLK...ES.D------------------------
#=GR 2fl4_A/24-41 PP ............55555666665.5666...65.5........................
2fl4_A/99-125 ----------------------LIEKYQ...TN.KLYLSVYDTNSSAIRLYQQ------
#=GR 2fl4_A/99-125 PP ......................579***...**.*****************97......
3guw_A/187-200 ----AEDAARIVAEHGPE----------...--.-------------------------
#=GR 3guw_A/187-200 PP ....7889*********9.........................................
3guw_A/230-243 --------------GREEMEKVARENAR...--.-------------------------
#=GR 3guw_A/230-243 PP ..............7789*******987...............................
3guw_B/187-200 ----AEDAARIVAEHGPE----------...--.-------------------------
#=GR 3guw_B/187-200 PP ....7889*********9.........................................
3guw_B/230-243 --------------GREEMEKVARENAR...--.-------------------------
#=GR 3guw_B/230-243 PP ..............7789*******987...............................
3guw_C/187-200 ----AEDAARIVAEHGPE----------...--.-------------------------
#=GR 3guw_C/187-200 PP ....7889*********9.........................................
3guw_C/230-243 --------------GREEMEKVARENAR...--.-------------------------
#=GR 3guw_C/230-243 PP ..............7789*******987...............................
3guw_D/187-200 ----AEDAARIVAEHGPE----------...--.-------------------------
#=GR 3guw_D/187-200 PP ....7889*********9.........................................
3guw_D/230-243 --------------GREEMEKVARENAR...--.-------------------------
#=GR 3guw_D/230-243 PP ..............7789*******987...............................
6zl1_A/110-135 ------EMADCCAKQEPERNECFLQHKK...DN.PN-----------------------
#=GR 6zl1_A/110-135 PP ......699*******************...**.*7.......................
6zl1_A/284-300 -------LAKYICENQDSISSKLK----...--.-------------------------
#=GR 6zl1_A/284-300 PP .......89******998877665...................................
6zl1_B/110-135 ------EMADCCAKQEPERNECFLQHKK...DN.PN-----------------------
#=GR 6zl1_B/110-135 PP ......699*******************...**.*7.......................
6zl1_B/284-300 -------LAKYICENQDSISSKLK----...--.-------------------------
#=GR 6zl1_B/284-300 PP .......89******998877665...................................
3kap_A/14-33 TESIAQKLEELVAAGGHEVT--------...--.-------------------------
#=GR 3kap_A/14-33 PP 5689**************86.......................................
3kaq_A/14-33 TESIAQKLEELVAAGGHEVT--------...--.-------------------------
#=GR 3kaq_A/14-33 PP 5689**************86.......................................
3tbk_A/245-253 -----EKLAKDVSE--------------...--.-------------------------
#=GR 3tbk_A/245-253 PP .....899998876.............................................
3tbk_A/354-377 -------------EKLEELEKVSRDPSN...EN.PKLRDLY------------------
#=GR 3tbk_A/354-377 PP .............66789**********...**.**99777..................
3tbk_A/410-419 ---------------------------E...EN.PALSFLK------------------
#=GR 3tbk_A/410-419 PP ...........................6...9*.******7..................
6qgm_a/128-165 -----ETFARFVNSNTYLAEHNRLTRNK...DN.KIPNFNFDWDTAY------------
#=GR 6qgm_a/128-165 PP .....678********************...**.8888888887654............
6qgm_b/128-165 -----ETFARFVNSNTYLAEHNRLTRNK...DN.KIPNFNFDWDTAY------------
#=GR 6qgm_b/128-165 PP .....678********************...**.8888888887654............
6qgm_c/128-165 -----ETFARFVNSNTYLAEHNRLTRNK...DN.KIPNFNFDWDTAY------------
#=GR 6qgm_c/128-165 PP .....678********************...**.8888888887654............
6qgm_d/128-165 -----ETFARFVNSNTYLAEHNRLTRNK...DN.KIPNFNFDWDTAY------------
#=GR 6qgm_d/128-165 PP .....678********************...**.8888888887654............
6qgm_e/128-165 -----ETFARFVNSNTYLAEHNRLTRNK...DN.KIPNFNFDWDTAY------------
#=GR 6qgm_e/128-165 PP .....678********************...**.8888888887654............
6qgm_f/128-165 -----ETFARFVNSNTYLAEHNRLTRNK...DN.KIPNFNFDWDTAY------------
#=GR 6qgm_f/128-165 PP .....678********************...**.8888888887654............
4jgl_A/88-120 ---------MIVPRGGDKLEITIKKSSM...KNtPSFTFIPTPDC--------------
#=GR 4jgl_A/88-120 PP .........57899*********99998...8879****876665..............
3kij_A/91-101 ----------------KEVESFARKNY-...--.-------------------------
#=GR 3kij_A/91-101 PP ................699*9999987................................
3kij_A/115-132 -------------------------GSE...GE.PAFRFLVDSSKKE------------
#=GR 3kij_A/115-132 PP .........................567...78.*******987765............
3kij_B/91-101 ----------------KEVESFARKNY-...--.-------------------------
#=GR 3kij_B/91-101 PP ................699*9999987................................
3kij_B/115-132 -------------------------GSE...GE.PAFRFLVDSSKKE------------
#=GR 3kij_B/115-132 PP .........................567...78.*******987765............
3kij_C/91-101 ----------------KEVESFARKNY-...--.-------------------------
#=GR 3kij_C/91-101 PP ................699*9999987................................
3kij_C/115-132 -------------------------GSE...GE.PAFRFLVDSSKKE------------
#=GR 3kij_C/115-132 PP .........................567...78.*******987765............
4reu_A/95-127 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4reu_A/95-127 PP ..........56666678877777.789...**.******97766665...........
4reu_B/95-127 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4reu_B/95-127 PP ..........56666678877777.789...**.******97766665...........
4reu_C/95-127 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4reu_C/95-127 PP ..........56666678877777.789...**.******97766665...........
4reu_D/95-127 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4reu_D/95-127 PP ..........56666678877777.789...**.******97766665...........
4reu_E/95-127 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4reu_E/95-127 PP ..........56666678877777.789...**.******97766665...........
4reu_F/95-127 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4reu_F/95-127 PP ..........56666678877777.789...**.******97766665...........
1eum_A/96-128 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 1eum_A/96-128 PP ..........56666678877777.789...**.******97766665...........
1eum_B/96-128 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 1eum_B/96-128 PP ..........56666678877777.789...**.******97766665...........
1eum_C/96-128 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 1eum_C/96-128 PP ..........56666678877777.789...**.******97766665...........
1eum_D/96-128 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 1eum_D/96-128 PP ..........56666678877777.789...**.******97766665...........
1eum_E/96-128 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 1eum_E/96-128 PP ..........56666678877777.789...**.******97766665...........
1eum_F/96-128 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 1eum_F/96-128 PP ..........56666678877777.789...**.******97766665...........
4ztt_A/97-129 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4ztt_A/97-129 PP ..........56666678877777.789...**.******97666655...........
4ztt_B/97-129 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4ztt_B/97-129 PP ..........56666678877777.789...**.******97666655...........
4ztt_C/97-129 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4ztt_C/97-129 PP ..........56666678877777.789...**.******97666655...........
4ztt_D/97-129 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4ztt_D/97-129 PP ..........56666678877777.789...**.******97666655...........
4ztt_E/97-129 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4ztt_E/97-129 PP ..........56666678877777.789...**.******97666655...........
4ztt_F/97-129 ----------LITQKINELAHAAM-TNQ...DY.PTFNFLQWYVSEQH-----------
#=GR 4ztt_F/97-129 PP ..........56666678877777.789...**.******97666655...........
5ztc_A/44-69 --------------------------DK...DD.LFLSIMKDAKSTEIDYYRAKLR---
#=GR 5ztc_A/44-69 PP ..........................56...77.77788999************96...
5ztc_B/44-69 --------------------------DK...DD.LFLSIMKDAKSTEIDYYRAKLR---
#=GR 5ztc_B/44-69 PP ..........................56...77.77788999************96...
4lfk_B/35-51 ----------------------------...--.------YDTHRTHYPIYGKKVAE--
#=GR 4lfk_B/35-51 PP ........................................89999***********9..
4lfk_B/150-157 -------------------------DQK...DN.PHF----------------------
#=GR 4lfk_B/150-157 PP .........................79*...**.*99......................
4lfk_D/35-51 ----------------------------...--.------YDTHRTHYPIYGKKVAE--
#=GR 4lfk_D/35-51 PP ........................................89999***********9..
4lfk_D/150-157 -------------------------DQK...DN.PHF----------------------
#=GR 4lfk_D/150-157 PP .........................79*...**.*99......................
4lfl_B/35-51 ----------------------------...--.------YDTHRTHYPIYGKKVAE--
#=GR 4lfl_B/35-51 PP ........................................89999***********9..
4lfl_B/150-157 -------------------------DQK...DN.PHF----------------------
#=GR 4lfl_B/150-157 PP .........................79*...**.*99......................
4lfl_D/35-51 ----------------------------...--.------YDTHRTHYPIYGKKVAE--
#=GR 4lfl_D/35-51 PP ........................................89999***********9..
4lfl_D/150-157 -------------------------DQK...DN.PHF----------------------
#=GR 4lfl_D/150-157 PP .........................79*...**.*99......................
4lfm_B/35-51 ----------------------------...--.------YDTHRTHYPIYGKKVAE--
#=GR 4lfm_B/35-51 PP ........................................89999***********9..
4lfm_B/150-157 -------------------------DQK...DN.PHF----------------------
#=GR 4lfm_B/150-157 PP .........................79*...**.*99......................
4lfm_D/35-51 ----------------------------...--.------YDTHRTHYPIYGKKVAE--
#=GR 4lfm_D/35-51 PP ........................................89999***********9..
4lfm_D/150-157 -------------------------DQK...DN.PHF----------------------
#=GR 4lfm_D/150-157 PP .........................79*...**.*99......................
4lfn_B/35-51 ----------------------------...--.------YDTHRTHYPIYGKKVAE--
#=GR 4lfn_B/35-51 PP ........................................89999***********9..
4lfn_B/150-157 -------------------------DQK...DN.PHF----------------------
#=GR 4lfn_B/150-157 PP .........................79*...**.*99......................
4lfn_D/35-51 ----------------------------...--.------YDTHRTHYPIYGKKVAE--
#=GR 4lfn_D/35-51 PP ........................................89999***********9..
4lfn_D/150-157 -------------------------DQK...DN.PHF----------------------
#=GR 4lfn_D/150-157 PP .........................79*...**.*99......................
5lwh_A/218-229 ----------------------------...KN.PDYIFVVDRN---------------
#=GR 5lwh_A/218-229 PP ...............................79.9999998876...............
5lwh_A/249-275 -------------------KTKAAQNKKiiyLD.PEYWFLASGNGLE------------
#=GR 5lwh_A/249-275 PP ...................55555555522248.******9999876............
5mbu_A/220-231 ----------------------------...KN.PDYIFVVDRN---------------
#=GR 5mbu_A/220-231 PP ...............................79.***9998876...............
5mbu_A/251-277 -------------------KTKAAQNKKiiyLD.PEYWFLASGNGLE------------
#=GR 5mbu_A/251-277 PP ...................55555555522248.******9999876............
5mbu_C/220-231 ----------------------------...KN.PDYIFVVDRN---------------
#=GR 5mbu_C/220-231 PP ...............................79.***9998876...............
5mbu_C/251-277 -------------------KTKAAQNKKiiyLD.PEYWFLASGNGLE------------
#=GR 5mbu_C/251-277 PP ...................55555555522248.******9999876............
5mbu_B/220-231 ----------------------------...KN.PDYIFVVDRN---------------
#=GR 5mbu_B/220-231 PP ...............................79.***9998876...............
5mbu_B/251-277 -------------------KTKAAQNKKiiyLD.PEYWFLASGNGLE------------
#=GR 5mbu_B/251-277 PP ...................55555555522248.******9999876............
5gsm_A/318-328 ----------------------------...--.-------KEDKLGHIYYK-------
#=GR 5gsm_A/318-328 PP .........................................57788999**9.......
5gsm_A/514-529 -REVQDKLVEFVARGGN-----------...--.-------------------------
#=GR 5gsm_A/514-529 PP .6788**********95..........................................
5gsm_A/557-568 ---------------------VEREKAR...RN.PRL----------------------
#=GR 5gsm_A/557-568 PP .....................5689999...9*.974......................
5gsm_B/318-328 ----------------------------...--.-------KEDKLGHIYYK-------
#=GR 5gsm_B/318-328 PP .........................................57788999**9.......
5gsm_B/514-529 -REVQDKLVEFVARGGN-----------...--.-------------------------
#=GR 5gsm_B/514-529 PP .6788**********95..........................................
5gsm_B/557-568 ---------------------VEREKAR...RN.PRL----------------------
#=GR 5gsm_B/557-568 PP .....................5689999...9*.974......................
3cyn_A/100-110 ----------------KEVESFARKNY-...--.-------------------------
#=GR 3cyn_A/100-110 PP ................699*9999987................................
3cyn_A/124-141 -------------------------GSE...GE.PAFRFLVDSSKKE------------
#=GR 3cyn_A/124-141 PP .........................567...78.*******987765............
3cyn_B/100-110 ----------------KEVESFARKNY-...--.-------------------------
#=GR 3cyn_B/100-110 PP ................699*9999987................................
3cyn_B/124-141 -------------------------GSE...GE.PAFRFLVDSSKKE------------
#=GR 3cyn_B/124-141 PP .........................567...78.*******987765............
3cyn_C/100-110 ----------------KEVESFARKNY-...--.-------------------------
#=GR 3cyn_C/100-110 PP ................699*9999987................................
3cyn_C/124-141 -------------------------GSE...GE.PAFRFLVDSSKKE------------
#=GR 3cyn_C/124-141 PP .........................567...78.*******987765............
5mbt_A/220-231 ----------------------------...KN.PDYIFVVDRN---------------
#=GR 5mbt_A/220-231 PP ...............................7*.*****98876...............
5mbt_A/251-277 -------------------KTKAAQNKKiiyLD.PEYWFLASGNGLE------------
#=GR 5mbt_A/251-277 PP ...................55555555522248.******9999876............
5mbt_B/220-231 ----------------------------...KN.PDYIFVVDRN---------------
#=GR 5mbt_B/220-231 PP ...............................7*.*****98876...............
5mbt_B/251-277 -------------------KTKAAQNKKiiyLD.PEYWFLASGNGLE------------
#=GR 5mbt_B/251-277 PP ...................55555555522248.******9999876............
5mbt_C/220-231 ----------------------------...KN.PDYIFVVDRN---------------
#=GR 5mbt_C/220-231 PP ...............................7*.*****98876...............
5mbt_C/251-277 -------------------KTKAAQNKKiiyLD.PEYWFLASGNGLE------------
#=GR 5mbt_C/251-277 PP ...................55555555522248.******9999876............
5lsj_B/132-166 ------QLADAVLAGRRQVEELQLQVQA...QQ.QAWQALHREQR--------------
#=GR 5lsj_B/132-166 PP ......79********************...**.***99999875..............
5lsj_E/132-166 ------QLADAVLAGRRQVEELQLQVQA...QQ.QAWQALHREQR--------------
#=GR 5lsj_E/132-166 PP ......79********************...**.***99999875..............
5lsk_B/132-166 ------QLADAVLAGRRQVEELQLQVQA...QQ.QAWQALHREQR--------------
#=GR 5lsk_B/132-166 PP ......79********************...**.***99999875..............
#=GC PP_cons 68898889**999999999999989889...99.*9**999887877899899999887
#=GC RF xxxxxxxxxxxxxxxxxxxxxxxxxxxx...xx.xxxxxxxxxxxxxxxxxxxxxxxxx
//
# STOCKHOLM 1.0
#=GF ID chain_sp_Q8IWZ8_SUGP1_HUMAN_SURP_and_G-patch_domain-containing_protein_1_OS_Homo_sapiens_OX_9606_GN_SUGP1_PE_1-SV_2_188_242-i1
#=GF AU jackhmmer (HMMER 3.3.2)
#=GS MGYP000082224325/20-68 DE [subseq from] PL=00 UP=0 BIOMES=0000000011000
#=GS MGYP000082224325/91-127 DE [subseq from] PL=00 UP=0 BIOMES=0000000011000
#=GS MGYP000277099866/2-44 DE [subseq from] PL=00 UP=0 BIOMES=0000000011000
#=GS MGYP000277099866/68-112 DE [subseq from] PL=00 UP=0 BIOMES=0000000011000
#=GS MGYP000491155519/1-26 DE [subseq from] PL=01 UP=0 BIOMES=0000000010100
#=GS MGYP000491155519/50-94 DE [subseq from] PL=01 UP=0 BIOMES=0000000010100
#=GS MGYP000324022611/34-67 DE [subseq from] PL=00 UP=0 BIOMES=0000000011000
#=GS MGYP001127532122/81-114 DE [subseq from] PL=01 UP=0 BIOMES=0000000011000
#=GS MGYP000641555956/33-84 DE [subseq from] PL=11 UP=0 BIOMES=0110000000000
#=GS MGYP000641555956/156-192 DE [subseq from] PL=11 UP=0 BIOMES=0110000000000
#=GS MGYP000511767851/15-64 DE [subseq from] PL=01 UP=0 BIOMES=1000000000000
#=GS MGYP000511767851/114-155 DE [subseq from] PL=01 UP=0 BIOMES=1000000000000
#=GS MGYP000125367204/57-106 DE [subseq from] PL=00 UP=0 BIOMES=0110000000000
#=GS MGYP000170712716/57-106 DE [subseq from] PL=00 UP=0 BIOMES=0110000000000
#=GS MGYP001151811203/323-343 DE [subseq from] PL=11 UP=0 BIOMES=1000000000000
#=GS MGYP001151811203/394-448 DE [subseq from] PL=11 UP=0 BIOMES=1000000000000
#=GS MGYP000251977013/12-60 DE [subseq from] PL=00 UP=0 BIOMES=0110000000000
chain_sp_Q8IWZ8_SUGP1_HUMAN_SURP_and_G-patch_domain-containing_protein_1_OS_Homo_sapiens_OX_9606_GN_SUGP1_PE_1-SV_2_188_242 TRKVIEKLARFVAEGGPE-LEKVAMEDYKDNPAFAFLHDKNSREFLYYRKKVAEIR
MGYP000082224325/20-68 ------EIGPFVAEAGPE-LEKVITEDYKDNLAFPFLHGKNSRQFLYHRKEVAEIR
#=GR MGYP000082224325/20-68 PP ......5678********.***********************************98
MGYP000082224325/91-127 --NLAEKLAEFIARGDPE-METIALQKNRENQAFSFLYEP----------------
#=GR MGYP000082224325/91-127 PP ..678*************.******************986................
MGYP000277099866/2-44 ------------AEAGPE-LEKVITEDYKDNLAFPFLHGKNSRQFLSPRKEGAELR
#=GR MGYP000277099866/2-44 PP ............89****.**********************************997
MGYP000277099866/68-112 ---LAGKLARFVVDWHPE-VETIALQNNCENQAFSFLYETNSQGYKYYR-------
#=GR MGYP000277099866/68-112 PP ...5679***********.*****************************8.......
MGYP000491155519/1-26 ------------------------------NPAFAFLQDKNSREFLYYRKKVAEIR
#=GR MGYP000491155519/1-26 PP ..............................8***********************98
MGYP000491155519/50-94 ---LAGKLARFVVDWHPE-VETIALQNNCENQAFSFLYETNSQGYKYYR-------
#=GR MGYP000491155519/50-94 PP ...5679***********.*****************************8.......
MGYP000324022611/34-67 TRKVIEKLARFVAEGGPE-LEKVAMEDYKDNPAFA---------------------
#=GR MGYP000324022611/34-67 PP 8*****************.***************7.....................
MGYP001127532122/81-114 TRKVIEKLARFVAEGGPE-LEKVAMEDYKDNPAFA---------------------
#=GR MGYP001127532122/81-114 PP 8*****************.***************7.....................
MGYP000641555956/33-84 -RAIVDKTAQFVAKNGPE-FENRILSSEKNNQKFSFLMEKDPY-HAYYRGKIESI-
#=GR MGYP000641555956/33-84 PP .789**************.*********************976.689**999876.
MGYP000641555956/156-192 ---VIKLSAQFVARNGAKFLSGLASREYQ-NPEFAFLKPAH---------------
#=GR MGYP000641555956/156-192 PP ...788889****9887637889999996.9*****97654...............
MGYP000511767851/15-64 --NLIDKTAKSVAQKGPE-LEELVKKNFADNPKFSFLDF-GDPYRPYYDQKVEE--
#=GR MGYP000511767851/15-64 PP ..689*************.*****************954.55556788888877..
MGYP000511767851/114-155 ---VMQHTAQFVAKNGQR-FLVGLTEREKHNPLFDFLKPTHSL-FPY---------
#=GR MGYP000511767851/114-155 PP ...77889*********9.998889999*********887663.545.........
MGYP000125367204/57-106 -RAIVDKTAQFVAKNGPE-FESRILSSEKNNQKFSFLRE-NSPFYSYYRGKIE---
#=GR MGYP000125367204/57-106 PP .789**************.******************86.8999*****9985...
MGYP000170712716/57-106 -RAIVDKTAQFVAKNGPE-FETRILSSEKNNQKFSFLRE-NSPFYSYYRGKIE---
#=GR MGYP000170712716/57-106 PP .789**************.******************86.8999*****9985...
MGYP001151811203/323-343 -----------------------------------YLFDETSQEFKYYQHKLAELR
#=GR MGYP001151811203/323-343 PP ...................................899****************97
MGYP001151811203/394-448 TRETAENLARFMIQLGSD-IEDFNMDSLTNNPDFWFLTKKDSPAHKFYQMKLVEVR
#=GR MGYP001151811203/394-448 PP 79999*************.**********************************997
MGYP000251977013/12-60 -KNIIDKLAQFVARNGPE-FEHMTKQKQKDNPKFSFLFG--GEYFNYYQYKVT---
#=GR MGYP000251977013/12-60 PP .579**************.******************86..567889988875...
#=GC PP_cons 88889999**********.******************9898989999*99998987
#=GC RF xxxxxxxxxxxxxxxxxx.xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
//
# STOCKHOLM 1.0
#=GF ID chain_sp_Q8IWZ8_SUGP1_HUMAN_SURP_and_G-patch_domain-containing_protein_1_OS_Homo_sapiens_OX_9606_GN_SUGP1_PE_1-SV_2_188_242-i1
#=GF AU jackhmmer (HMMER 3.3.2)
#=GS sp|Q8IWZ8|SUGP1_HUMAN/188-242 DE [subseq from] SURP and G-patch domain-containing protein 1 OS=Homo sapiens OX=9606 GN=SUGP1 PE=1 SV=2
#=GS sp|Q8IWZ8|SUGP1_HUMAN/265-317 DE [subseq from] SURP and G-patch domain-containing protein 1 OS=Homo sapiens OX=9606 GN=SUGP1 PE=1 SV=2
#=GS sp|Q8CH02|SUGP1_MOUSE/184-238 DE [subseq from] SURP and G-patch domain-containing protein 1 OS=Mus musculus OX=10090 GN=Sugp1 PE=1 SV=1
#=GS sp|Q8CH02|SUGP1_MOUSE/261-313 DE [subseq from] SURP and G-patch domain-containing protein 1 OS=Mus musculus OX=10090 GN=Sugp1 PE=1 SV=1
#=GS sp|Q68FU8|SUGP1_RAT/185-239 DE [subseq from] SURP and G-patch domain-containing protein 1 OS=Rattus norvegicus OX=10116 GN=Sugp1 PE=1 SV=1
#=GS sp|Q68FU8|SUGP1_RAT/262-314 DE [subseq from] SURP and G-patch domain-containing protein 1 OS=Rattus norvegicus OX=10116 GN=Sugp1 PE=1 SV=1
#=GS sp|Q8IX01|SUGP2_HUMAN/590-637 DE [subseq from] SURP and G-patch domain-containing protein 2 OS=Homo sapiens OX=9606 GN=SUGP2 PE=1 SV=2
#=GS sp|Q8IX01|SUGP2_HUMAN/786-837 DE [subseq from] SURP and G-patch domain-containing protein 2 OS=Homo sapiens OX=9606 GN=SUGP2 PE=1 SV=2
#=GS sp|Q8CH09|SUGP2_MOUSE/595-619 DE [subseq from] SURP and G-patch domain-containing protein 2 OS=Mus musculus OX=10090 GN=Sugp2 PE=1 SV=2
#=GS sp|Q8CH09|SUGP2_MOUSE/769-820 DE [subseq from] SURP and G-patch domain-containing protein 2 OS=Mus musculus OX=10090 GN=Sugp2 PE=1 SV=2
#=GS sp|Q8CGZ0|CHERP_MOUSE/13-61 DE [subseq from] Calcium homeostasis endoplasmic reticulum protein OS=Mus musculus OX=10090 GN=Cherp PE=1 SV=1
#=GS sp|Q8IWX8|CHERP_HUMAN/13-61 DE [subseq from] Calcium homeostasis endoplasmic reticulum protein OS=Homo sapiens OX=9606 GN=CHERP PE=1 SV=3
#=GS sp|A2VDN6|SF3A1_BOVIN/50-101 DE [subseq from] Splicing factor 3A subunit 1 OS=Bos taurus OX=9913 GN=SF3A1 PE=2 SV=1
#=GS sp|Q8K4Z5|SF3A1_MOUSE/50-101 DE [subseq from] Splicing factor 3A subunit 1 OS=Mus musculus OX=10090 GN=Sf3a1 PE=1 SV=1
#=GS sp|Q15459|SF3A1_HUMAN/50-101 DE [subseq from] Splicing factor 3A subunit 1 OS=Homo sapiens OX=9606 GN=SF3A1 PE=1 SV=1
chain_sp_Q8IWZ8_SUGP1_HUMAN_SURP_and_G-patch_domain-containing_protein_1_OS_Homo_sapiens_OX_9606_GN_SUGP1_PE_1-SV_2_188_242 TRKVIEKLARFVAEGGPELEKVAMEDYKDNPAFAFLHDKNSREFLYYRKKVAEIR
sp|Q8IWZ8|SUGP1_HUMAN/188-242 TRKVIEKLARFVAEGGPELEKVAMEDYKDNPAFAFLHDKNSREFLYYRKKVAEIR
#=GR sp|Q8IWZ8|SUGP1_HUMAN/188-242 PP 8****************************************************98
sp|Q8IWZ8|SUGP1_HUMAN/265-317 --NLAEKLARFIADGGPEVETIALQNNRENQAFSFLYEPNSQGYKYYRQKLEEFR
#=GR sp|Q8IWZ8|SUGP1_HUMAN/265-317 PP ..678*********************************************99976
sp|Q8CH02|SUGP1_MOUSE/184-238 TRRVIEKLARFVAEGGPELEKVAMEDYKDNPAFTFLHDKNSREFLYYRRKVAEIR
#=GR sp|Q8CH02|SUGP1_MOUSE/184-238 PP 89***************************************************98
sp|Q8CH02|SUGP1_MOUSE/261-313 --NLAEKLARFIADGGPEVETIALQNNRENQAFSFLYDPNSQGYRYYRQKLDEFR
#=GR sp|Q8CH02|SUGP1_MOUSE/261-313 PP ..678*********************************************99876
sp|Q68FU8|SUGP1_RAT/185-239 TRRVIEKLARFVAEGGPELEKVAMEDYKDNPAFTFLHDKNSREFLYYRKKVAEIR
#=GR sp|Q68FU8|SUGP1_RAT/185-239 PP 89***************************************************98
sp|Q68FU8|SUGP1_RAT/262-314 --NLAEKLARFIADGGPEVETIALQNNRENQAFSFLYDPNSQGYRYYKQKLEEFR
#=GR sp|Q68FU8|SUGP1_RAT/262-314 PP ..678*********************************************99976
sp|Q8IX01|SUGP2_HUMAN/590-637 ----IDQLVKRVIEGSLSPKE--RTLLKEDPAYWFLSDENSLEYKYYKLKLAEM-
#=GR sp|Q8IX01|SUGP2_HUMAN/590-637 PP ....67777778887644333..334699***********************97.
sp|Q8IX01|SUGP2_HUMAN/786-837 --ETAEKLARFVAQVGPEIEQFSIENSTDNPDLWFLHDQNSSAFKFYRKKVFEL-
#=GR sp|Q8IX01|SUGP2_HUMAN/786-837 PP ..678**********************************************986.
sp|Q8CH09|SUGP2_MOUSE/595-619 ----------------------------QDPAYWFLSDESSLEYKYYKLKLAE--
#=GR sp|Q8CH09|SUGP2_MOUSE/595-619 PP ............................57*********************98..
sp|Q8CH09|SUGP2_MOUSE/769-820 --ETAEKLARFVAQVGPEIEQFSIENSTDNPDLWFLHDQSSSAFKFYREKVLEL-
#=GR sp|Q8CH09|SUGP2_MOUSE/769-820 PP ..678**********************************************986.
sp|Q8CGZ0|CHERP_MOUSE/13-61 -RNVIDKLAQFVARNGPEFEKMTMEKQKDNPKFSFLFG--GEFYSYYKCKLA---
#=GR sp|Q8CGZ0|CHERP_MOUSE/13-61 PP .88*********************************85..445678877765...
sp|Q8IWX8|CHERP_HUMAN/13-61 -RNVIDKLAQFVARNGPEFEKMTMEKQKDNPKFSFLFG--GEFYSYYKCKLA---
#=GR sp|Q8IWX8|CHERP_HUMAN/13-61 PP .88*********************************85..445678877765...
sp|A2VDN6|SF3A1_BOVIN/50-101 -RNIVDKTASFVARNGPEFEARIRQNEINNPKFNFLNP-NDPYHAYYRHKVSEF-
#=GR sp|A2VDN6|SF3A1_BOVIN/50-101 PP .889****************9999999*********86.677889******996.
sp|Q8K4Z5|SF3A1_MOUSE/50-101 -RNIVDKTASFVARNGPEFEARIRQNEINNPKFNFLNP-NDPYHAYYRHKVSEF-
#=GR sp|Q8K4Z5|SF3A1_MOUSE/50-101 PP .889****************9999999*********86.677889******996.
sp|Q15459|SF3A1_HUMAN/50-101 -RNIVDKTASFVARNGPEFEARIRQNEINNPKFNFLNP-NDPYHAYYRHKVSEF-
#=GR sp|Q15459|SF3A1_HUMAN/50-101 PP .889****************9999999*********86.677889******996.
#=GC PP_cons 89899*************999**9999*********99*999999*****99977
#=GC RF xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
//
docs/result_pdb.png

383 KB | W: | H:

docs/result_pdb.png

253 KB | W: | H:

docs/result_pdb.png
docs/result_pdb.png
docs/result_pdb.png
docs/result_pdb.png
  • 2-up
  • Swipe
  • Onion skin
......@@ -526,7 +526,8 @@ if __name__ == "__main__":
automatically according to the model name from
./data/params""")
parser.add_argument(
"--relaxation", action="store_false", default=False,
"--relaxation", action="store_true", default=False,
help="Whether to relax."
)
parser.add_argument(
"--save_outputs", action="store_true", default=False,
......
......@@ -4,8 +4,7 @@
python3 inference.py T1024.fasta /data/pdb_mmcif/mmcif_files \
--output_dir ./ \
--gpus 4 \
--use_precomputed_alignments alignments/ \
--gpus 1 \
--param_path /data/params/params_model_1.npz \
--uniref90_database_path /data/uniref90/uniref90.fasta \
--mgnify_database_path /data/mgnify/mgy_clusters_2018_12.fa \
......
......@@ -4,7 +4,7 @@
python3 inference.py SUGP1.fasta /data/pdb_mmcif/mmcif_files \
--output_dir ./ \
--gpus 4 \
--gpus 1 \
--use_precomputed_alignments alignments/ \
--model_preset multimer \
--uniref90_database_path /data/uniref90/uniref90.fasta \
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment