Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
OpenFold
Commits
85d39c80
Commit
85d39c80
authored
Dec 30, 2021
by
Gustaf Ahdritz
Browse files
Add more sample data
parent
dfdd722c
Changes
5
Show whitespace changes
Inline
Side-by-side
Showing
5 changed files
with
2568 additions
and
0 deletions
+2568
-0
tests/test_data/alignment/bfd_uniclust_hits.a3m
tests/test_data/alignment/bfd_uniclust_hits.a3m
+2048
-0
tests/test_data/alignment/mgnify_hits.sto
tests/test_data/alignment/mgnify_hits.sto
+183
-0
tests/test_data/alignment/pdb70_hits.hhr
tests/test_data/alignment/pdb70_hits.hhr
+158
-0
tests/test_data/alignment/uniref90_hits.sto
tests/test_data/alignment/uniref90_hits.sto
+177
-0
tests/test_data/short.fasta
tests/test_data/short.fasta
+2
-0
No files found.
tests/test_data/alignment/bfd_uniclust_hits.a3m
0 → 100644
View file @
85d39c80
>query
MAAHKGAEHHHKAAEHHEQAAKHHHAAAEHHEKGEHEQAAHHADTAYAHHKHAEEHAAQAAKHDAEHHAPKPH
>tr|A0A2A6NXF8|A0A2A6NXF8_9BRAD Uncharacterized protein OS=Bradyrhizobium sp. C9 OX=142585 GN=CO675_03465 PE=4 SV=1
MSDHAGVEHHHKAAEHHEHAARHHREAARHHEAGDHHKAAHHAHSAHGHASHAQHHHTEASRHHAEHHGEH--
>tr|A0A1F2V377|A0A1F2V377_9BACT Uncharacterized protein OS=Acidobacteria bacterium RIFCSPLOWO2_12_FULL_60_22 OX=1797188 GN=A3J28_14435 PE=4 SV=1
-MPRTGAEHHEAAAQHHEQAARHHHEAAKQDHSGHHEKAGHYAHLAYAHFKHAEQHAAEAAKTHAKNHTG---
>SRR6202048_823629
MTDHAGVEHHHKAAEHHEQAAKHHREAAKHHEAGDHEKAEHPAPTAPGHASHAEEHHAEASRHHAEHHV----
>ERR1700724_1870475
MADHAGVEHHHKDAEHHEPAAKHHREAAKRHEAGDHEKAAHNAHSVQGHASHAEEHHAEATRHPAEPH-----
>ERR1700758_4094796
MRDHAGVEHHHKAAEHHEHAARHHREAAKHHEAGDHEKAAHHAHTAHGHHQHATHHGAEAAKAHTEHHG----
>ERR1700724_4573945
MADHAGVEHHHKAAEHHEHAAKHHREAAKHHEAGDHGKAGPPAHTGHGQATP---------------------
>SRR5271157_4511021
MSEHKGVEHHHKAAEHHEHAARHHREAAKHHEAGHHEKAAHHAHTAHGHASHAEHHATEAAKAHAEAHG----
>SRR5579863_5645041
EMSKQAVEQHLKSAEHHEQAARHHKEAAKHHQSGNHEKAAHHAHMAHGHHEHAQHHAAEAAKAHAQEHD----
>ERR1700733_9528035
SMAHHGAEHHHKAAEHHEQAAAHHREAAKHHESGDHQKAGHHAHIAHGHTLHAAQHAEEAGKHHADQHG----
>SRR6202030_3868138
-MSKQAAEHHHKAAEHHEHAARHHREAATHHESGNHETAAHHAHTAQGHLNHATHHASESAKQHAEHHGEK--
>ERR1700691_2094390
-MSKQTAEHHHKAAEHHEHAARHHREAAKHHETGNFETAAHHAHSAQGHLHHATHHSAEAAKAHVDHHGHK--
>ERR1700730_18364367
-MSKQAAEHHHKAAEHHEHRARHHKEDAKHHEAGKLETASRHARLAKGHHEHAIHHAAEAVKPHLEHYGKT--
>ERR1700683_378504
LMSKQAAEHHHKAAEHHDTAARHHREAAAHHEAGDHYQAAHHAHTAQGHLHHATHHSEEAAKLHVEHHGHKT-
>ERR1700735_4440382
LMSKQAAEHHHKAAEHHDHAARHHREAAAHHEADNHETAAHHAHTAQGHSHHATHHATEAAKHHVEHHGEKA-
>SRR5271166_1724810
HMSKEAAEHHHKAAEHHEHAAKHHKAAAAAHEAGNHEKAGHHAHVAEGHLNHATHHAEEASKLHATEHGHKX-
>ERR1700689_5314695
LMSKQAAEHHHKAAEHHDHAARHHRETRGHHEASEQ-------------------------------------
>SRR5215468_10051876
TMSKHLAEAHHQAAEHHEHAARHHREAAKHHEAGDHETAAHHAHTAQGHLHHATHHSTEASKQHAEHHGGTA-
>SRR6202795_3681341
PMSKKAAEHHLQAAEHHEHAARHHREAAKHHEAGDHESAAHHAHTAQGHLHHATHHSAEAAKMHVEDHGEKR-
>ERR1700681_451985
PMSKQAAEHHTKADDHHENAARHHREAARHHEADDHESAAHHAHTAQGHLHHATHHAAEAAKSHAEHHGNKT-
>SRR5580698_71909
EMSKQAAEHYHKAAEHHEKAALHHRHAAKHHEADDHKSAAHHAHTAQGHLHHAAHHATEAAKLHV--------
>SRR6516225_6087423
YMSKEAAEHHREAAQHHEQAAKHHHEAAKHHEAANHQEAAHHAHSAQGHLHHATHHAAEAAKLHAEHHGHKA-
>SRR5215469_385579
PMSKEAAEHHGKAADHHEHAAHHHREAAKHHESGNWETAAHHAHTAQGHLHHATHHASEAAKLHAEQHGSKT-
>SRR6202008_2710750
QMSKQAAEPHGKAAEHHEHAARHHREAAKPHESGNYETAAHHAPSAQGHLHHATHHAIEAAKSPLEHHGSKS-
>ERR1700685_3697209
TMSREAAEHHRLAAEHHDHAARHHREAAKHHEDGDHHSAAHHAHTAQGHTHHSSHHAAEAAKAHAAEHGHKS-
>ERR1022692_1727563
SMSENHIDHHHKAAEHHEHAAKHHHAAAEHHANEHAASEHMSAPX----------------------------
>ERR1022692_874132
SMSENHIEHHHKAAEHHEHTAKHHHAAAEHHQNGDHEKASHHAHAAHGHALHAEHHANEAAKHHANEHAAS--
>SRR5882757_333284
LMSKHAVEHHHKAAEHHEHAAKHHREAAKHHDSGDHEKAAHHAHTAHGHASHAEEHHHEASRHHAEHHGAH--
>SRR5580693_5755422
SMSENHIDHHHKAAEHHEHAAKHHRAAAEHHQNGNHEKGAHHAHAAHGHSLHADHHATEAAKHQANEHGHH--
>tr|A0A1Q3KM49|A0A1Q3KM49_9PROT Uncharacterized protein OS=Alphaproteobacteria bacterium 65-37 GN=BGN99_28215 PE=4 SV=1
MPKDKIIEHHRSAADHHEKAAQHHREAAKHHESDSHEKAAHHAHSAHGHSAHATHHAGEASKHHAEHHGGH--
>SRR5450755_4508362
NMSDHAVEHHHKAAEHHEHAAKHHREAAKHHETGDHEKEAHHAQVAHGHGLHADHHASEAAKQHANEHGDA--
>SRR5471032_1874884
GERNMSVEHPHKAAKHHEHAAKHPREATKHHEAGDHEKAAHHAHTAHGHASHAEEHHAEASRHHAVHHGAH--
>SRR5215472_4418814
RPKMTPHEHHHKAAEHHEHAARHHREAAKHYEAGNHEKAAHHAHLAHGHHLHALHHGQEAAKGHV--------
>SRR5271165_4906151
HMSKNATEHHRKAAEHHEHAAKHHHAAAEHHEAGNHEKAGHHAHVAEGHLNHATHHSEEASKHHANQHAHS--
>ERR1700680_4602609
FTINKGAEYHKKAAEHHELAAKHHREAAKHHEAGSHEKAAHHSEIAAGHGLTAVHHTEEATKHHPEEHTEK--
>ERR1019366_9606641
MATHKGTEHHKKAAEHHELAAKHHREAAKLHEAGSHEKAAHHAQIAAGHGLHAVYHTEEATKHHADEHTGK--
>SRR5271165_1617824
MATHKGAEHHKKAAEHHELAAKHHREAAKHHEAGSHEKAAHHSEIAAGHTLQAVHHTEEAVKAHLDEHGKK--
>SRR5580693_8743019
FTINKGAEYHKKAAEHHELAAKHHREAAKHHEAGSHEKAAHHSHSAHGHASQAEHHHAQASRH----------
>ERR1035438_5033680
MATHKGAEHHKKAAEHHDLAAKHHQEAAKHHEAGSHEKAAHSSEVATGHGLHAVYHTEEAIKHHADEHTGK--
>SRR5581483_8067321
MNDHEIHEHHEHAADHHEHAAKHHREAAQHHKAGDHEKAAHHSKIAHGHHLHAVEHHEHAAKKHADEHE----
>ERR1700737_2020172
IMSKQAAEHHKKAAEHFEHAARHHKEAAKHHDAGAHEKAAHHAHVAHGHHLHARHYAEEAAKSHVEHHGKKX-
>SRR5215831_18626949
DMSKEAAEHHRKAAEHLEHAAHHHKEAASHHEAGAHEKAAHHAHVAHGHHLHADHHAEEAAKTHVEHHGKK--
>ERR1022692_4502959
MATNQAAEHHHKAAEHHEHAARHHKEAAKHHEAGNHEKAAHHAHLAHGHTHHATHHAAEAAKAHVEHHGKKPX
>ERR1022692_1596713
MATNKAAEHHHKAAEHHEHAARHHKEAAKHHEAGNHEKAAHHAHLAHAHHLHVTHHSTEATKAHAQDHGSKX-
>ERR1700726_249602
RMSKQAAEHHNKAAQHHEQAAEHHREAAWYHEDGDHEAAAHHAHTAQGHLHHATHHAAEAAKLHVEHHGHKV-
>SRR5580693_6803329
QMSKQAAEHHNQAAEHHDHAARHHREAARHHEAGDHEAAAHHAHTAQGHQHHATHHATEAAKLHVEHYGQKV-
>ERR1700690_193599
QMSKEISDHHHSAAKHHESAAHHHKEAAKHHEAGNHEKAAHHAHTAHGHMTHATHHAAEAAKLHVEHHGSHK-
>ERR1700683_4574945
SMSKQAAEHHHRAAEHHEHAARHHREAAKHHEAADRLSAAHHAHTAHGHLQHATHHASEAAKSHVEHHGHKV-
>SRR3974390_98844
NMSKQAAEHHHKAAEHHEHAARHHREAAKHHEAGDHHLAAHHAHTANGHHHHAMHHSAEAAKAHAQEHGGAS-
>SRR5208282_6358198
FMSKHAVEHHHKAAEHHDHAAHHHREAARHHEAGEHHLAAHHAHLASGHHHHAMHHSAEAAKAHVEHHGESA-
>ERR1700678_2201630
QMSKKAVQHHTSAAEHHEHAARHHREASKHHEAGDHESAAHHAHTATGHLHQATQHGAEAAKAHAEEHGNKK-
>SRR6516225_12260139
TMSIQAAEHHNKAAEHHDHAARHHREAAKHYQAGDHHLAAHHAQTASGHHQHAMHHANEAAKAHA--------
>SRR5262245_37247554
ASKHNEAEHHIKAAEHHEQAARHHREAAKHHEAGAHDKSAHHAHIAYGHTTHARQHAQEAGKAHADEHGHHA-
>ERR1017187_10159119
CMSKQAIEHHRKAAEHHEHAARHHKEAAKHHEAGKHETAAHHAHLARGHHEHAMHHAAEAAKAHVEDHGGQ--
>ERR1039458_8411690
CMSKQAAEHHRKAAEHHEHAARHHKEAAKHHEAGRRGARRV--------------------------------
>SRR5580704_692809
SMSKEAVEHHKKAAEHHEHAAKHHHAAAEQHEAGNHEAAAHHAHVAHGHHSHATHHAGEASKHHAEAHSX---
>ERR1019366_8910956
CMSKQAAEHHRKAAEHHEHAARHHKEAAKHHEAGKHVTAAHHAHLARAHHDVATHHAVEAAKAHLEEHGKA--
>ERR1035438_7570652
KMSKKAAEHHRKAAEHHEHAARHHKEAAKHHDAGAHETAAHHAHTAHAHHEVATHHAVEAAKSHLEDHGKA--
>ERR1039457_5679623
-MSKKAAEHHLKAAEHHEHAARHHKEAAKHHQAGSHEKAAHHAHIARAHHEHADEHAIEAAKAHAEEHGNK--
>ERR1700704_2262512
-MSKKAAAHHKKSAEHHEHAARHHKEAAKHHDAGAHEKAAHHAHLAHGHSHEAMDEEAEAAKSHREEHGSK--
>SRR3974377_1513360
-KSHPAHAHHVKAAEHHEHAAKHHKEAAGHYAAGHHETAAHHAHSAHAHMLHATHHAGEAAKAHVAHHSQK--
>ERR1700753_3973459
-MSKTAADHHKKASEHHQHAARHHAEAAKHHEAGNHEKAAHHAHAAHGHTSHAREYGERASRAHSKEHGTK--
>SRR5262249_40564937
MTAHQGAEHHRAAAEHHAKAAHHHREAAKHHDDEDHTQAAHHAHSAHGHASHAAHHASEASKHHAEHHGDL--
>SRR5215831_3075836
MSEHQGAEHHRSAAEYHEKAAHHHREAAKYHEDGEHMQAAHHAHSAHGHSMHAAHHASEASKHHAEHHDDA--
>ERR1700681_3605376
MTAHKGADHHRSAAEHLENAAHHHREAAKHHDEGDHRQAAHHAHTAHGRATHAAHHSSEASKHHAERHGDI--
>ERR1700735_830787
LMSKHAVDHHHKAAEHHEHAARHHKEAAKHHEDGKHETAAHHAHLAHGHHEHATHHAIEAAKAHVEHHGX---
>SRR6202046_837085
LMSKHAVDHHHKAAEHHEHAARHHKEAAKHHEAGKNETAAHHSALARAHQHHASRHSEDW-------------
>SRR5450432_1682947
YMPHQAAEHHHKAAEHHEYAARHHKEAARHHEAGKHETAEHHVHLANGHQQDAIHHAAEAVKVQIERP-----
>SRR5579863_9408980
PMSHEAADHHHKAAEHHEHAARHHRDAAQRHKEGHHEGAAHHAHLAHAHHVHAVEHAEQAAKHHIEAHGS---
>SRR5579863_4526352
-MSKKAAEHHKKASEHHSQAARHHGEAAKHHEAGNHEKAAHHAHTASGHAAHARTHSEEAGKAHLEEHGKK--
>SRR5271165_3194793
-MSKKAADHHTKASEHHAEAAKHHSEAAKHHGAGHHEKAAHHAHTASGDASHARTHAEEAGKAHAEEHGKK--
>SRR5262249_11842710
-MSIKASEHHKKASEHHSRAAHHHEEAAKHHAAGHHEKAAHHAHSASGHATHARTHAEEAMKSHVEEHSKK--
>SRR5580658_11071561
-MSKKAAEHHRKAAEHHAQAAKHHDSAADSHEAGNHEKAAHHAQTARGHHKQAEEHSDEATKAHSSEHGHK--
>ERR1700761_2911494
-MSTKAAQHHKNPADHHTQAASHHTEAAKHHESGNHEKAAHHAHTASGHAHHATHHGEEAGKAHMEEHGKK--
>tr|I0IMJ9|I0IMJ9_LEPFC Uncharacterized protein OS=Leptospirillum ferrooxidans (strain C2-3) GN=LFE_0783 PE=4 SV=1
-SKMKPQEHHKEAAQHHEEAAKHHKEASKMYEAGDHKTAAHHAHSATGHASSAEEHQNEASRKHASLFGDK--
>SRR5215510_6524027
EMSKQAAEHHTKAADHHEHAARHHREAAKHHEAGNHEKAAHHAHVAHGHHLQALHHHEEAQSSISSITARS--
>SRR5580704_15507109
PMSQQSAEHHTKAAEHHEHAARHHREAAKHQTSGSHEKAGHHAHVAHGHHLHAIHHSEEAAKHHAEEHGSK--
>SRR5499433_4327306
PMSTKAAEHHEQAAAHHEHAARHHKEAAKHHKAGDHEKAAHHAHVAHGHHLQAIHHHEEATKFHLEHHGKK--
>SRR6516164_3912753
TMSKKAAEHHTKAAEHHEHAAKHHREAAKHHAAGHHEKAAHHAHVAHGHAHHASHHSTEAAKGHVEEHGHK--
>SRR5215471_1111430
PMSQKSAEHHTKAAEHHEHAARHHKEAAKHYAAGSHEKAAHHAHLARGHDLHADQHAEEAAKHHVEEHGSK--
>SRR5215467_15503710
PMSKQAAEHHTKAAEHHEHAARHHREAAQHHEDGDHETAAHHAHTAQGHLHHATHHAEEAAKQHVEHYGSK--
>SRR5580700_9320380
LPMNQPTEHHTKAAEHHEHAARHHKEAAKHQASGNHEKVAHHAHTAHGHHLQAAHHAEEAAKQHAVEHGSK--
>SRR6516225_9609087
SMPQKLKEHHTKAAEHHEHAAKHHRKAAEHHGAGKHELAAHHAHAAHGHHLHATHHASEAAKRHVELHGNK--
>SRR5580692_12958183
GMAKQIAEHHTKAAEHHEHAAKHHREAAKHHESGNAETAAHHAHLAHGHTQFANHHAGEAAKAHIADHSKT--
>SRR5580658_3666240
YMSHEAAHHHTKAAEHHEHAARHHHHAAKHHADGAHPDAAHHAPLAHGHHIHAAEHAEHAVKHHIEAHGEK--
>ERR1700679_1520425
LMSKQTAEHHTKAAEHHEHAARHHKEAAKHHEAGKVETAAHHAHLAHGHHQYASHHAGEAAKAHIEDSDKS--
>SRR5579863_67246
SMSKESAEHHSKAAEHHEHAARHHRAAAEHHEAGNHEKAGHHAHVAAGHHHQATHHAEEASKHHATAHGHH--
>SRR5262249_23440399
KCQSKQQNITLKLPNITSNAARHHKEAAKHHEAGNHEKAAHHAHVAHGHHLQAIHHHEEATKFHLEHHGKK--
>ERR1039458_5673656
DMSKAAAAHHLKAVEHHEHAARHHREAAKHHEAGNHEKAAHHAHLAHGHHLHATEYAGEAAKAH---------
>ERR1035437_7167454
DMSKQAADHHKQAAEHHEHAARHHQEAATQYEAGNHEKAAHHAHLAQGHHVHATEHAEHAAREHVEAHGAK--
>ERR1039458_9997194
SMSKEAPHHHTQAAEHHEHAAHHHHEAAKHHLEGNHEAAAHHAHLAHGHHIHAAEHAEHAAKQHIEAHGQK--
>SRR5262245_65951219
MAKHKGAEHLERAAEHHELAAHHHREAAKHYEAGNPEKAGNHEHIEHGDHLCVTYKAEGAGCTQRHDX-----
>SRR5215470_17505878
MANHKGAEHHENAAEHHQLAAQHHREAAKHYESGNHEKAGHHAHIAHGHHVHATYHAEEASKSHATEHGGQ--
>SRR6266540_5162723
MAKHKGAEHHERAAEHHQLAAHHHREAAKHYEAGKPEKAGHHAHIAHGHHLHATYHAEEAGKRHATEYGGQ--
>SRR5215475_12112351
MAKHKGAEHHKRATEHHELSARHHREAAKHYEASDPEKAGHHAHIAHGHHLHATYHAEEAGKHHATEHSSQ--
>ERR1700694_1609601
MATHKGADHHRKAAEHHEHAAKHHHEAAKHHESGNHEKAGHHAHIAHGHTQHAAHHATEAAKHHSDEHGGT--
>SRR5579862_1684974
PMSKERAEHHRKAAEHHGHAAKHHLAAAEHHEAGNHEKAGHHAHVAHGHQLHAVHHAEEAGKHHANEHTHQ--
>SRR6266511_5188420
MAKHKGAEHHERAAEPHHPSAPPPREA----------------------------------------------
>tr|A0A2W5ZIQ4|A0A2W5ZIQ4_9BACT Uncharacterized protein OS=candidate division AD3 bacterium OX=2052315 GN=DLM66_00475 PE=4 SV=1
-MSKKAAEHHGQAADHHEKAAQHHRQAKTHHEAGDHQAAAHDAHTARGHHEHAAHHASEAAKAHAEEHGHK--
>tr|A0A1B9C1C9|A0A1B9C1C9_9PROT Uncharacterized protein OS=Acidithiobacillus ferrivorans OX=160808 GN=BBC27_06515 PE=4 SV=1
-SEMKLHEHHKEAAEHHEEAAKHHKEASKLYESGDHKGAAHHAHSSAGHSDYAREHESVASKKHAAMFGDK--
>tr|A0A2H9SEK4|A0A2H9SEK4_9GAMM Uncharacterized protein OS=Legionella sp. OX=459 GN=CK424_06600 PE=4 SV=1
-DKKKLHKHHLKAAEHHKKAAEHHSEAAKHHEAGEHEKGQASAYLALAHGRHAKDESCEACSHYAGIEVER--
>tr|A0A0C1UQR9|A0A0C1UQR9_9BACT Uncharacterized protein OS=Methylacidiphilum kamchatkense Kam1 OX=1202785 GN=A946_08515 PE=4 SV=1
-MADTVAEEHEKAAMHHEHAAVHYRKAAEHHRAGEHADSGHHAHIAHGHAKHAQAHAEAAAKEEANMHDKK--
>tr|A0A2W6AI54|A0A2W6AI54_9BACT Uncharacterized protein OS=candidate division AD3 bacterium OX=2052315 GN=DLM67_06925 PE=4 SV=1
-MSKEAAQHHQQAAEHHEHAGRHHREAAKAHEAGDHAKAAHHAHTARGHHEHASHHAAEAAKSHVEHHGHK--
>ERR1700726_61598
MS-KEISEHHHSAAKHHESAAYHHKEAAKHHEAGDHEKAAHHAHTAHGHASHAEHHHVEASRHHAEHHGQH--
>SRR5271165_3086472
MAQHKGADHHKQAAAHHRHAATHHEEAAKHHEAGDHEKAAHHAHAAHGHHLNAEHHTHEAAKHHATEHGGG--
>SRR5579872_1018840
RMSKESAQHHHQAAEHHEHAARHHREAARHHEEGNHEKAAHHAHTAQGHHHQAEHHAREAAKLHTEQHGQA--
>SRR5271157_582607
VMSKKAAEHHHKAAEHHEHAARHHREAAKHHEAGKHETAAHHAHLAHAHHEHAMHHAAEAAKAHLEDHGKA--
>ERR1700682_2320681
LMSMQAADHHHKAAEHHEHAARHHKEAAKHHEAGKHETAGHHAHLAHGHHQHAMHHAAEAAKAHIEHHSKA--
>SRR5262249_24366611
SMSKNATDHHNAAAEHHEMAAEHHRKAAEHHDDGNHEKAAHHAHVAQGHLHHATHHAAEAAKSHLEDHGKH--
>SRR5215469_175376
LMSKKAAEHHHKAAEHHEHAARHHKEAAKYHEAGKHETAAHHAQLANGHQQHAMHHAGEAAKAHIEDHGRA--
>SRR5580692_10590558
FMSKEASEHHQKAAEHHEHAARHHKEASKHHDAGKHETAAHHAQLARAHQHHAAHHSEEADKAHLEDHVKS--
>ERR1035441_3275339
LMSKKAVEHh-HKAAEHHEHAARHHKEAAKHHEAGKHETAAHHAHLARGRLRRCLLLSYIQL--SLPDPD-V--
>SRR5262245_3093639
HMSKKAAEHh-KKASEHLTHAARHHVEAAKHHEAGKHETAAHHAQTATGHAVHARGHAEEAVKAHAEEHGKK--
>ERR1700752_1174679
YMSKKATEHh-RKAAEHHELTARHHREAAKHHEGGRHETAAHHAHLAHGHHTYASHHAGEASKAHVEDHGSS--
>SRR6266705_6478280
RMAKQAAEHh-HKAAEHHEHAARHHKEAAKHYEAGKHETAAHHAHLAHGHLQHATHHAGEAAKAHIQDHGNK--
>SRR5215472_9654556
LMSKKAAGHh-LKAAEHHQLAAQHHREAAKHHQAGKHETAAHHAHLARGQDEHAMHHAAEAAKAHVDDYGKA--
>SRR5215472_7924964
TMSKKAAQHh-HQAAEHHEDAARHHKEEAKHHEAGKHETAAHHAHLARGHHEHAMHHAGEAAKAHIEDHGQA--
>ERR1035441_3624924
FMSKQAAEHh-HKAAEHHEHAARHHKEAAKHHEAGKHETAAHHAHLARAHHELATHHAAEAAKVHLEQYGKG--
>ERR1700677_2623774
SMSKQAAEHh-HKAAEHHEHAARHHKEAAKHHEEGRHETAAHHAHLAHGHHQHASHHAAEAAKSHVEHHGSA--
>SRR5271169_5745082
LMSNQAAEHh-HKAAEHHEHAARHHKEAAKHHEAGKPEAAAHHAHLAHGHHQHATHHAPEAAKAHIEDHGKS--
>SRR6202049_3772221
CMSKQAAEHh-HKAAEHHEHAARHHKEAAKHHEAGNHETAA-HAHLARGHHEHAMHHAAEAAKAPRLLGRGA--
>ERR1700690_1934298
PYVKESRRGpSQSRRASRTHAARHHKEAAKHHEAGKHETAAHHAHLARGHHEHAMHDAGEAAKAHVEDHGGQ--
>SRR6201997_5942927
MSDHAGVEHHHKAAEHHEHAARHHREAAKHHEEGNHETEPHHAHTPQGPSPHATHHATEAAKPHVEHHGQK--
>ERR1700683_5385528
PMAHPIAEHHKKAAHHHEHAARHHHEAAKHHEAGDHHKAGHHAHVAHGHHHQAMHHAGEAAKAHAEAHGKX--
>ERR1039458_1052396
DMSKEAAHHHKQAAEHHEHAARHHHEAAKHHEAGNHEKAAHHAHLAHAHHVLAAEHAENAAKEHLKAHGTK--
>ERR1035441_9756897
DMSKEAAHHHKQAAEHLEHAARHHHEAAKHHEAGNHEKAAHYAHLAHGHLVHATEHAENAAKEHVKSEE----
>SRR5271157_2981033
SMSKEAAQHHKQAAEHHEHAARHHKEAAKHHEGGNHEKAAHHAHVAHGHHAHATHHATEAAKAHVEAHGAK--
>ERR1039458_10647682
DMSKEAAHHHKHAAEDRKHAARHHNAA----------------------------------------------
>SRR5271165_3465347
DMSKQAAEHHKKAAEHLEEAAKHHVEAAKHHVEGVFDKAAHHAHSAHAHHVQAVEHAENAAKEHLKAHGTK--
>SRR5215469_13833100
DMSKQAAEHHKQAAEHLEQAAKHHVEAAKRHVEGVVEKAAHEAHLAHAHHVQAI-------------------
>SRR5262249_28378874
VMSEDAAEHHRKAAEHHQHAARHHEQAAHHHEAGAHEKAAHHAHSAQGHSHHANHHAAEAAKAHTEHHGAKX-
>tr|A0A142H9K5|A0A142H9K5_9BACT Uncharacterized protein OS=Hymenobacter sp. PAMC 26554 GN=A0257_23020 PE=4 SV=1
-MSKKAVDSHKKAATHHTEAAKHHTEAAKHHEAGSHEKAAHHAHTAAAHTDHAAEHATHARKSHAEEHGTK--
>tr|A0A1F3RER5|A0A1F3RER5_9BACT Uncharacterized protein OS=Bacteroidetes bacterium RIFCSPLOWO2_12_FULL_31_6 GN=A3K10_03545 PE=4 SV=1
--MKSVIEKHKKAASHLEEAAKCHQEAAKHHEAGSHEKAHHSSVKANGHSTHASELEREIQKHHVIASK----
>SRR5216683_1839118
VMSKQAAEHHKKAA--------------EHHEAGTHEKAAHHAHVAHGHALHARHHAEEAVKSHLEHHGKKX-
>SRR5277367_3271760
MMSKKAAEHHKKASEQMTHAARHHGEAAKHHEGGLHEKAAHHAHTARAHAIHAQEHAENAVKAHADEHGKKX-
>SRR5271166_5766653
HMSKKAAGHHKKASEHLTHAARHHGEAAKHHEAGSHEKAAHHAHLARGHIIHGRGHAEEAVKAHLEEHGKKX-
>SRR5262245_78877
NMSKRAAEHHKKASEHLTHAARHHGEAAKHHDAGHHEKAAHHAHTAHGHAIHARGHAEEAVKVHVEEHGKKX-
>SRR5215468_2014457
HRSKKAADHHKKASEHLTHAARHHGEAAKHHESGNHEKAAHHAHTASGHMIHARGHAEDAVKAHAEEHGKKX-
>SRR6202158_2104302
HMSKKAAEHHKKAAEHHTHAARHHGEAAKHHEGGHHEKAAHHAHTARAHGLHATEHAEEAAKAHGTEHGS---
>SRR5215475_990513
PMSKKAAEHHKKASEHLTHAARHHGEAAKHHDTGNHEKAALHAHTARGHVVHATRHAEEAVMAHTDEHGKK--
>SRR5689334_9332785
VMSKKAAEHHRKASEHHTNAARHHGEAAKHHDVGNHEKAAHHGHTARGHAIEARTHSEDAVKAHTEEHGKKX-
>SRR4029077_25657
QMSKKAAEHHKKVQEHLTHAARHHGEAAKHHESGQHEKAAHHAHVARSHVIHARGYAEEAVKAHHEEHGNKX-
>SRR6476646_3723538
StRSGSMECLGLSDSEHLTHAARHHGEAAKHHEAGSHEKAAHHAHVARGHVIHGRGHAEEAVKAHLEEHGKKX-
>SRR6516164_3211544
NrMSKKAADHHRKAAEYHTHAARHHGEAAKHHETGQHEKAAHHAHLARAHAIHARGHSEEATKAHHEQHGDKQ-
>SRR6202048_2952714
AMSKKAAEHHKQSAEHHTHAARHHGEAAKHHEAGHHEKAAHHAHTARGHALHARHHSDQAAMVHMEEHGKNK-
>SRR6202011_3428404
AMSKKAAEHHKQSQEHHTNAARHHGEAAKHHASGQHEKAAHHAHTARGHALHARHHSDQAAMAHMEEHGKKK-
>SRR6516225_9485238
NrMSKKAADHHRKAAEYHTHAARHHGEAAKHHETGQHEKAAHHAHTARGHASHATEYAEEAAKLHAEEHGKKX-
>SRR5665213_515099
AMSKKAAEHHRKASEHAAHAARHHGEAAKHHDAGHHEKAAHHAHSATGHASHARGHADEAARAHADEHGKKX-
>SRR5215831_13043785
SMSKKAAEHHKKASDHHTHAARHHGEAAKHHETGHHEKAAHHAHTARAHAIHARGHAEQATVAHSEEHGK---
>ERR1700681_52020
KMSKKAAEHHHKASEHHTHAARHHGEAAKHHEGGHHEKAAHHAHTARAHAIHSRHHSEEAAKMHGEEHGKKX-
>SRR6478672_3437904
AMSKTAADHHRKASEHSTHAAKHHGEAAKHHDSGQHEKAAHHAHTAAGHERQSREHADEAAKAHANEHGKKX-
>SRR5207302_8234716
HMSKKAAEHHRKASEHHTHAARHHGEAAKHHDSGQHEKAAHHAHTAAGHAVHARQHADESRKAHTEEHGKKX-
>SRR6202049_3861440
PMSKKAAEHHRKASEHLTHAARHHGEAAKHDDAGHHEKAAHHVHTARGHATHARGPAEEAAKAHTEEHGKKX-
>ERR1700693_2890077
PMSKKAAEHHKKASEHLTHAARHHGEAAKHYDTGEHAMGAHHAHTARGHVVHARLHAEETVKAHVEEHGKKX-
>SRR3984893_4017493
AMSKKAAEHHKQESEHLTHAAHHHGEAAQHHEAGNHEKAAHHAHTARAHVIHGRGHAEEAVKAHADEHGKKX-
>SRR6266478_7429653
TMAE---NKPRQADLSARARKSDHGEAAKHHEAGNHEKAAHHAHTARAHIIHGRGHAEEAVKAHAEEHGKK--
>SRR5580765_1108604
SMSKKAAEHHKKAEEHHTQAAHHHGEAAKHHEGGRHEKAGHHAHTARGHSLHARDHSEEAAKAHMEEHGKKX-
>ERR1700681_4628765
HMSNKAAEHHRKALEHLTRAARHHDETAKHYDTGEHAMGGHHAHTARAHMIHARGHAEEAVKAHAEEHGTKE-
>ERR1700722_6390987
-MSKEREEHHLKAAEHHEHAAKHHRAAAEHHAAGDHETAGHHAHVAHGHHTHAEHHADEASKHSANHHAT---
>ERR1700691_1558590
-MSKERQDHHLKAAEHHEHAAKHHRAAAEHHASGNEEKAGHHAHVAHGHHAHATHHAE---------------
>SRR5580692_2709317
-MSKEREDHHLKAAEHHEHAAKHHRHAAEHHAAGDHEKAHHHAHVAHGHHIHAGHHAEEASKHTANHHSA---
>SRR5450755_2590302
-MSKEREEHHLKAAEHHEHAAKHHKMAAEHHAAGDHEKAHHHAHVAQGHKTHAEHHSDEASKHTANHVPT---
>SRR6516164_8547976
VMSKKAAEHHKKASEHHTHAARHHAEAAKHHEAGSHEKAAHHAHTARGHVAHARGYAEEAAKAHVEEHGKKX-
>SRR6476661_1594845
QMSKKAAEHHRKAAEHSSHATHHHNEAAKHHEAGNHEKAAHHAHTARGHGAHVMHHADEAAKAHIEEHGKKX-
>SRR5579883_2368435
SMSKKAAEHHGKAAEHHEQAAKHHKEAQKHHEAGNHEKAAHHAHTARGHHASAEHHGNEAAKAHADDHGKKX-
>SRR5215831_19438088
-MAKNAVEHHEKAAEHHEHAARHHREAASHHESGDHQVAAHHAHVAHAHMLHASEHASEAAKAHAEAHGGQ--
>SRR3974390_1406771
-MATPAVEHHEKAPEHHEPAARHHREAAAHHESGDHEVAAHHAHVAHAHTLHASPHAAEAAKAHADAHGGQ--
>SRR3974377_1527111
-MATHAVEHHEKAAEHHEHAARHHPQAAAHHESGAHETAAQHGPVAPATHLYPLDHAAA--------------
>SRR3974377_2609624
-MATHAVEHHEKAAEHHEHAARHHREAAAHHESGAHEVAAHHTPFAPSHT-----------------------
>SRR5262245_37694928
---HKGSSHHETAAEHHETAAHHHREAAKHYEHFDHEKAGHHAHVAHAHGLHAAHHGHEAAKHHAQSHAEH--
>ERR1700738_4504323
---HKGSSHHETAAEHHEKAAEHHRAAARHYGEDDHHKASHHAHLAHAHGLHATHHGHEAAKHHAEHHDEH--
>SRR6478672_11888828
---HKGGSHHETAAEHHETAAHHHREAAKHYEHGDHEKAGHRPRGACAWTACDPSWarGRETPRGKPR----G--
>SRR5258707_6049855
---HKGGDHHESAAEHHENAAHHHREAAKHYEAGDHEKAGHHAHVAHAHGLHASQHGEEAAKHHAEHHVED--
>SRR3984957_18403883
---HTGSEHHETAAGHHESAAHHHREAAKHYEGGEPEKAGHHAHVAHAHRLHATHHAHEAANHHAERLAGQ--
>SRR6185312_2038455
---AESHVHHAKAAEHHKKAAYHHEEASRHFRDDNPAKGAHHAQLAHGHGLHANEHANNASRRFGQDYAKD--
>SRR5215469_12611957
SMSKEAAEHHRSAAHHYEHAAQHHHEAAKHHEAGDHQAAAHHAHIAQGHQHHATHHATEAAKSHAEHHGQQ--
>ERR1700683_227600
SMSKQAAEHHHSAAEHHEHAARHHREAARHHEEGNHESAAHHAHTAQGHLHHATHHAAEAAKSHTEHHGHK--
>SRR5262249_30748479
MAQDKIVQHHHAAAEHPEHAAKHHREAAKHHEADSHEKAAHHAHSAHGHSEHAAHHAAEASKHHAEQHGDH--
>SRR5471032_1000550
MSKDKIVEHHQTAADHHEHAARHHREAAKHHEADSHEKAAHHAHTAHGHSSHATHHASEASKHHAEHHGQH--
>SRR5215475_7292062
MSKDKIVEHHHAAAEHHEYAAKHHREAAKHHESDHHEKAAHHAHSAHGHSSHAAHHA----------------
>ERR1700740_1508672
MSKDKIVEHHTAAAEHHEHAARHHREAAKHHGADSHEKAAHHAQSAHGHSAHAAHHAAEASKHHAEHHGTH--
>SRR5277367_3781890
FMSKQAAEHHHQAADHHEHAARHHKEAAQLHEAGSHELAAHHAHLAHGHHQHASHHAAEAAKAYIEHHAKA--
>SRR5580692_8293406
IMSKQAAEHHQKAAEHHEHAARHHKEAAMHHEAGKHEMAAHHAHLAQGHHAHATHHAAEAAKSHVEHHGKA--
>SRR5580698_9551526
VMSKVAAEHHHAASEHHEHAARHHKAAAKHHEDGKHELAAHHAHLAHGHHQHASHHAAEAAKAHIEHHKAA--
>ERR1019366_1648353
MPKHEGAEHHKKAAEHHEKAAQHHKEAAKHHEEGRHETAGHHAYVAHGHHLTAIQHSEEAAKYHSQQHGEKK-
>SRR5580658_4588397
MPKHEGAEHHKKAAEHHEHAARHHKEAARHHEEGSHEKGGHHAHIAHGHHLHATHHAEEAAKTHSNQHGKES-
>ERR1700683_1984599
VSKHEddkhqekaaehqekvalhhedkAAEHHEKAAEHTEKAAEHHKEAAKHHEEGHHETAGHHAHIAHGHHLNATYPSEETAKHHAQQHGEKK-
>SRR5580704_7292703
MANHTGASHHHEAADHHEHAAKHHREAAKHHEAGDHVQAGHHAHIAHGHLTHATHHAEEAGKHHATEHGKS--
>ERR1041385_1551557
-MKHKGAEHHNKAAEHHEHAARHHREAAKHHEAGSHEKGGHHAHVAHGHMVQANEHTEEAAKSHMEHHGKK--
>SRR5262249_10445052
-MAHKGAEHHTKAAEHHEHAARHHREAAKHHEAGSHEKGGHHAHMAHGHSTHAHGFADEAAKHHAMEHGGG--
>ERR1700719_3807446
TMSKQAAEHHHQAAEHHEHAARHHREAAKHHEAGDHESAAHHAHSAHGHASHAEHHHHEASRHHAEQHGQHX-
>ERR1700760_623008
TMSKQAAEHHTKAAEHHDNASKHHREAAKHHEAGNHESAAHHAHTAQGHLHQATHHAGEAAKSHADTHGN---
>ERR1022692_2998277
-MSKQAAENHLKAAEHHEHAARHHKEAAKHHQAGNHEKAAHHAHTAHGHEEHADHHAGEAAKAHAQDHGSK--
>ERR1017187_7576438
-MSKQAAEHHLKAAEHHEHAARHHKEAAKHHQAGNHEKAAHHAHTARAHHENAAHHAAEAAKAHLEHHGKA--
>SRR5262249_54984532
-MSEKAAEHHRKAAEHHEHAAKHHYEAARHHDDGAHETAAHHAHSAQGHAIHADHHSGEAAKAHTEHHGSK--
>SRR5580704_771817
MNHHEAAEHHNKAADHHEHAAAHHLKAAEHHVEENHEKAAHHAHIAHGHGLHAAHHAGEATKHHTDAHGGP--
>ERR1039458_7468520
MEHHEAAEHHRKAAEHHEHAAAHHREAAKQHEAGNHEKAAHHAYVAHGHGLHAAHHAGEATKHHSDTHGGP--
>ERR1039457_6746667
MNQKDAAEQHKKAAEHHEHAAAHHREAAEHHANGNHEKAAHHAHIAHGHGLHAAHHAGEATKHHANTHGGS--
>ERR1700722_3522043
MSDHKGADHHNQAAEHHEHAATHHRAAPRHHESGDHEKAAHHAHIAHGHGLHAAHHAGEATKYHADEHGGG--
>ERR1035438_4004146
MSTHTGAEHHEKAAEHHEHAAAHHREAAIHHESGDHEKAAHHAHIAHGHGlhaapharvasrprhhahiahghgLQAAHHAGEAAKHHADEHGGE--
>SRR3981081_3201937
PMSTKAAEHHEHAAAQHEHAARHHKEAAKHHKAGNHEKAAHHAHSARGHHEHAAHHASEAAKSHTEEHGHK--
>ERR1700720_4700009
TMSTQAAEHHEKAAEQHEHAARHHKEAAKHHKAGNHEKAAHHAHTARGHHEQATEHASAAAKSHVEHHGKK--
>SRR5450759_1153254
LMSKKAAEHHRKAAEHHEHAARQHKEAAKHHDAGAHEKAAHHAHIAHAHHLHATHFADEAAKAHAEEHGSK--
>SRR5476649_602780
LMSKEAADHHRKAAEHHEHAARHHKEAAKHHDAGAHEKAAHHAHIAHAHHLHAEQHAGDAAKAHAQAHGTK--
>SRR5260370_9889087
PVSTKAAEHHEHAAAQHEHAARHHKETPKHQKAVRHEKAAQHAHTASGHAEK---------------------
>SRR5215471_19435997
PMSTKAAEHHEHAAEQHAHAARHHKEAAKQHKAGHHEKAAHHAHTACGHHEHATHHATEAAKAHTEEHGHQ--
>tr|A0A2M6XEG2|A0A2M6XEG2_9RHIZ Uncharacterized protein OS=Methylobacterium sp. CG09_land_8_20_14_0_10_71_15 OX=1975532 GN=COT56_21735 PE=4 SV=1
--KHPGADHHHKAAEHHEHAARHHREAAKHHEGGHHEKAAHHAHSAQGHAHYATHHGSEASKHHAEHHGKG--
>tr|A0A1I4D138|A0A1I4D138_9RHIZ Uncharacterized protein OS=Methylocapsa palsarum OX=1612308 GN=SAMN05444581_1317 PE=4 SV=1
--PTKIAEHHTQAAQHHEKAAEHYKEAAKHHETGAVEKGAHHAQVSQGHAVHAEYHADEAAKAHAQHHANK--
>SRR6516162_2577000
LMSKKASEHHKKASEHHSHASRHHEEAAKHHEAGHHEKAAHHAQTAMGHAIHARTHSEEAVKAHAEEHGKK--
>SRR5262249_44780301
LMSKKAAEHHKKAAEHHSHAARHHEEAAKHHAAGHHEKAAHHAHTASGHASHARGHAEEAMKSHAEEHGQK--
>ERR1700686_4403266
PMSKKAAEHHKKAAEHHTHAARHHEEAAKHHEAGQHEKAGHHAHTARGHALHARHHSDEAAKSHMEEHGKK--
>SRR5215471_16139522
AMSKKAAEHHKKASEHHTHAARHHAEAAKHHEGGHHEKAAHHAHTARAHATHARDHSEEAVKAHAEEHGKK--
>SRR2546421_8056338
PMSKKAAEHHKKASEHHTHAARHHDEAAKHHEAGHHEKAAHHAHTARGHASHTRHHSEEAARAHAEDHGKK--
>SRR6516162_7817916
PMSKKAAEHHKKASEHHTHAARHHGEAAKHYEAGQHEKAAHHAHTARAHAIHARGHSEEAAKAHHEDHGNK--
>ERR1700732_5276201
HMYKKAAQHHKQAAEHHTHAARHHGEAAKHHEAGHHEKAAHHAHTAAGHATHSRHHSEEAAKMHTEEHGKK--
>ERR1700721_288514
PMSKKAAQHHKQAAEHHTHAARHHGEAAKHHEAGHHEKAAHHAHLVRGTVLKGRGTLKGGWRATSE-------
>SRR5579872_3850512
-MSKKAGEHHQKAAEHHEHGARHHKGAAKHHQAGSYEKAAHHAHIARAHHEHAHEHAIEAAKAHAQEHGSD--
>SRR5487761_2742555
-MSKQAAEHHLKAAEHHEQAARHHKEAAKHYQAGSYEKAAHHAHTACGHEEHAAFHSGEAAKAHAQEHGN---
>ERR1700730_7170546
-NKHAATEHHLKAAEHHEHAARHHREAGKHHEASNHEKAAHHAHTAQGHMTHAHHHAGEASKHHLAHHGDK--
>ERR1700748_2579388
-MTKEAANHHSKAAEHHENAAKHHREAGKHHEAGDHEAAAHHAHTAQGHTANASHHADEAAKLHTQHHGNK--
>SRR5580698_5335757
-MTKEAANHHNKAEEHHENAARHHREAGKHHEAGDHESAAHHAHTAQGHTQHATHHAGEAAKLHTEHHGKK--
>SRR5258705_12432272
MDATKLAEHHEKTAEHHQKAAEHHRHAAQHHQQQDHEKGAHHAHLAYGHHLHATEHAEQAAKTHAEGQT----
>ERR1035438_1862924
-MHHEAAEHHRKAAEHHEHAAAHHREAAAHYEQGNHEKAAHHAHIAHGHGLQASHHADEASKHHTSSHGGA--
>SRR5580698_7177634
-MSQERIDHHRKAAEHHEHAATHHNAAADHHEAGDHEKAGHHAHIAHGHTTHAAHHAAEASKHHANEHTGE--
>SRR5208283_4889738
-MSKEAADHHRKAAEHHEHAAKHHHAAAHEHEAGNHEKAGHHAHLAHGHHALATHHAEEASKHHVTEHGHH--
>SRR5580704_6446761
-HMSEHADHHRKAAEHHEHAAKHHRAAADHHESGDHEKAGHHAHVAHGHTVHAAHHAEEASKHHANDHGHH--
>SRR5262249_53837718
MTMHKGAGHHRSAAEHHEKAAHHHREAAKHHDEGDHHRAAHHAHAAHGHATHAAHHGGEASKHHAAEHGDP--
>SRR5262249_3839383
MKEHKGAEHHRSAAEHHEKAAHHHHEAAKHHEDGDHKSAAHHAHTAHGHATHAAHHSSEASKHHAETHGDH--
>ERR1700676_1561084
LMSQEAAEHHRKAAEHHEHAARHHEEAAKHHDAGSHEKAAHHAHTAHGHHLHATHHAGEAVKTHADEYGSK--
>SRR5271156_6624548
MADHKIHEHHEKAAEHHEHAAKHHREAAKHHKAGAHEKAAHHSKIAHGHHLHATEHHEHASKKHAGDHGDA--
>SRR5580704_6068697
MHEHEIHEHHEKAAEHHEHAAKHHREAAKHAKAGDHEKAAHPSKVAHGHSLHATEHHEHASKKHADQHSXX--
>ERR1700734_3267748
MPEHDIHEHHEKSAHHHDQAAKHHREAAKHHKAGHHEKAAHHSKVAHGHSLHATDHHHHASKKHAEHHSX---
>SRR5271170_1512638
ENGHDIHQHHEKAADHYEHAAKHHREAAKHHEAGDHEKAAHHSKVAHGHALHAEEHHGHASKMHAEQHGX---
>SRR5579863_3905028
MSGHGIHEHHEKAAEHHEHAAKHHREAAKHHQSGNPEKAAHHSKIAHGHALHATEHHAHASKMHAEHHGX---
>SRR6202050_2286552
IMDQDIHKHHEKAAHHHDDAAKHHREAAKHHKSGHHEKAAHHSKVAQGHSLHATDHHHHASKKHAEHHGX---
>SRR5208282_1032254
MNSHEIHEHHEQAAHHHEEAAKHHREAAKHHEAGHHEKAAHHSKVAHGHSLHATEHHEHASKKHAEQHSX---
>ERR1700745_4273030
LHDNEIHEHHEEAAHHHEQAAEHHREAAKHQKDGDHDKAAHHSKVAHGHHLYATEHHDEAAKLHAEAHGDD--
>SRR5271154_267655
MNSHEIHDHHEIAADHHDHAAKHHREAAKHAKAGDHEKAAHHSKVAHGHSLHATEHHDHASKKHAEQHGXX--
>ERR1700733_2112371
-LRRAAKCRLELAADHHEHAAKHHREAAKHAKSGDHEKAAHHSKVAHGHSLHATEHHEHASKKHADQHSX---
>ERR1700691_6755303
MDEHDIHEHHEKAAEHHEHAAKHHREAAKHAEAGDHEKSAHHSKGARGHSLHPNAHHNEAPKKPAVQHGX---
>ERR1035437_7181262
SMSKEAALHHTQAAEQHDLAARHHREAAKHHIAGNHEKAAHHAHLAHGHHVLATEHAENAAKEHVKAHGTK--
>ERR1017187_4718788
AMSKEAAHHHTQAAEHHENAARHHREAAKQHLAGNHEKAAHHAHLAHGHHFLATEHAENAAKEHVKAYGAK--
>ERR1035437_5215839
SMSKEAAHLHTQAAEHHDHAARHHREAAKHYLAGNHA------------------------------------
>SRR5208337_5201425
YMSHEAAEHHTKAAEHHEHAARHHHAAAKAHSEGNHEKAAHHAHLAHGHHAHAAEHAEHAAKAHIEAHGEK--
>SRR5438132_4014951
HTEHPATEHHRKAAAHHEEAAKHHRAAAQAHSQGDHEKAAHHAHLAFGHHVHAAHHMQEAAKKHTEHTSAV--
>SRR6202021_2491305
TMSKEAAHHHTQAAEHHEHAARHNHEAAKHHQDGDHEAGAHHAHLAHGHHIQATEHAEHAAKHHVEAHGEV--
>ERR1700744_918969
TMSKEAAHHHTQAAEHHEHAARHHHEASKHHEAGQHEKAAHHAHLAHAHHVHAADHAEHAAKKHIEAHGAK--
>SRR5476651_2291918
MSKDKIVDHHNAAAEHHEHAAKHHREAATHHEADNHEKAGHHAHSAHGHSSHATHHAGEASKHHAEHHGKH--
>SRR5256885_10433591
MAKDKIIEHHNAAAEHHEHAAKHHREAAKHHEADSHETAAHHAHSAHGHSAHAAHHATEASKHHAEHHGKQ--
>SRR5215470_13748103
MSKAKIVEHHTSAAEHHEQAASHHREAAKHHQADDHEKAGHHAHTAHGHATQAAHHGGEASKHHADMHGKK--
>SRR5262249_5909060
AMSKDAAEHHKHSAEHHTQAAHHHAEAAKHHESGHHEKAAHHAHSANAHALHARHHAEEAAKSHMNEHGKK--
>ERR1700674_4915123
MAKKEHKEHHEAAAEHHESAAEHHREAAKHYEVGHHEKAAPHAHLAHGHGVHATHHAQEAAKHHVEHHDDD--
>SRR5476649_712169
-SHEKKLEHHHKAAEHHDHAARHHREAAEAHHAGNHEKAAHHAHVAHAHHLHAEHHGDEAGKLHAEHHGEA--
>ERR1700677_2502920
-SHEKKIEHHRHAAAHHEHAARHHHAAAEAHTAGQHERAAHHAHIARAHHLHAEHHGDEAGKLHAEHHSHE--
>SRR6516164_10081394
VMSKKAAEHHRKAAEHHTHAAHHHGEAAKHHDSGHHEKAAHHAHTAGGHALHAREHSEEASNAHMEEHGKKX-
>SRR5215813_15420037
AMSKKSAEHHKKASEHHTHAAHHHVEAAKHYEGGDHEKAAHHAHTARGHATHAAHHSEEAVKAHAEEHGKKX-
>SRR6516162_3769719
LPSATPAEPHKNAAQHHTEAARHHGEAAKHHESGQHEKAAHHAHTAGGHATHARHHAEEASRAHVEEHG----
>SRR2546423_2679145
---HKGGSHHELAAEHHETTAHHHREAAKHYGHGDHDKAGHHAHVAHAHGLHATHHGQEAAKHHAEHHEE---
>ERR1700682_6433899
---HKGGSHHETAAEHHENAAHHYREASKHYDSGDHEKAGHHAHPAHAHRLPPTHH-----------------
>SRR5262249_7960664
---HKGGGHHEIAAEHHETAAHHHREAAQHYESGDHETDGHRAHVAHAHGLHATHHGHEAAKHHAEHHKX---
>SRR5262245_44145014
---HKSGSHHEMAAEHHETAAHHHREAAKHHETGDHEKAGHHAHMAHAHELHATHHGHEAAKHHAEHHEE---
>SRR5215469_11644734
---HKGGTHHELAAEHHETAAHHHREAAKHYESGDAEKAGHHAHVAHAHELHATHHGHEAASITPSTISK---
>SRR5215471_6019152
---AKGHDHHASAAEHHEHAAHHHREAARHYEAGDHEKAGHHAHVAHAHELHAIHHGHEAAKHHAEHHEX---
>ERR1700730_10676216
---HKGGSHHEVAAEHHENAAHHHREAAKHYDSGEHEKAGHHAHVAHAHGLHASHHAHEATKEHAEHHAG---
>SRR5271163_3974060
MSKAKIAEHHRKAAEHHEKAAAHHHKAAEHHDDEDHMMAAHHAHVAHGHHHHATHHAAEAGKLHAEHHAD---
>ERR1700691_908072
MKSHELAEHHEKAAHHHAQAAEHHRHAAQHHKGGDTHKATHHAHTAHGHHLHAAHHASEAGKLHAQHHAD---
>ERR1039458_5453327
-MPKEAADHHLKAAEHHEHAARHHKEAAKHHNAGVHEKAAHHAHTAHAHHLHATHFADEAAKASCRER-----
>SRR5882757_11516447
MTNHKGAEHHRSAADHHEKAAQHHRDAARHHDDGDHGRAAHHAHTAHGHATHATHHGSEASKHHAENHG----
>ERR1700759_3684327
MSSHKGAEHHRSAAEHHENAAHHHREAAKHHDSGDHHRAAHHAHSAHGHATHAAHHGSEASKHHAEKHA----
>SRR6267154_2535493
MTNHKGAEHHRSAADHHKKAAQHHRDAARHHDDGDHGRAAHHAHTAHGHATHATHHGSEASKDHAENHG----
>ERR1700681_2967493
---HKGANHHDVAAEHHENAAHHHREASKHYDTGEHEKAGHHAHVAHAHGLHATHHAHEAAKHHAEHHA----
>SRR4249919_3050305
LMSKKAVDHHKGASEHLTHAAKHHDEAAKHHESGNHEKAAHHAHTARGHALHARHHSDEAAKAHMEEHGKKX-
>SRR5215469_8883529
AMSKKAAEHHKQAAEHHGHAARHHGEAATHHEAGRHEQAAHHAHTARGHAAHATEHAEHAAKAHAEEHGTKX-
>SRR5215472_6198358
LMSKKAADHHKKASEHLTHAARHHTEAAKHHEAGDHEKAAHHAHTARAHAAHARDHSEEAAKVHLGEHGKKX-
>ERR1017187_7736977
AMSKKAAEHHKQSAEHHTHAARHHGEAAKHHEAGHHEKAAVCTENLNPNVLTMKSAQYDAR---IYDARSLN-
>SRR5580704_12853319
SMSKPAADHHMKAAEHHEEAAKHHRAAAEHHTAGDHQKAGHHAHVANGHHVNAVHHAEEASKHHATDHS----
>ERR1019366_5760491
--PRSGAQHHDAAAQHYEEAARHHRMAAKQYQASHHEKAAHYAQLAYAHHMYAEQHAAEAAKAHAKNHG----
>ERR1700693_4750673
--PITEEEHHEAAAQHHEQAARHHRVAAKQDHAGNHEKAAHYAHLAYAHHVQAEQHAAEAAKAHAKSHN----
>ERR1700730_12173117
MSAHKHKEHHEAAAKHHEHAAHHHQEAAKHYASGHHEKAGHHAHTAHAHGAHATHHAHAAANINVEHHGEK--
>ERR1700694_6071327
MSAHKHKEHHEAAAKHHEHAAHHHQEAAKHYASMACMRRTTRMKPRSTMsSIMARS---KSARX----------
>SRR5580693_4924512
AMRKAHHEHHANAAEHHEHAAHHHREAARHYESGEHEKAGHHAHVAHGHGVHATHHAHEAAKHHAEHHSED--
>SRR5437016_8712387
EMSKQAAEHHIKAAEHHEHAARHYKEAAKHHEAGNHEKAAHLAHVAHGHHLHATHHRSEERRVGKECRSRW--
>SRR5579883_1766477
MTKQHIAEHHRKAAERHEKAAHHHRMAAEHHDDEDHVTGAHHAHVAHGHHLHATHHATEAGKLHVEHHGHH--
>ERR1700722_7570681
-MAKQTAEHHTRAAEHHGHAQKHHQQAAKHHESGNHEKAAHHAQVAQGHQTQAMHHANEAAKSHTEHHGSKE-
>ERR1700743_30692
-MAKQTAEHHTRAAESHGHAQKHHQQAAKHHTAGNHEKAAHHAHLASSHEEDARTPSVNTRKSHKDTYGDKE-
>SRR5580700_2371651
AMSKEAAHHHSKAAEHHELAANHHREAAQHHEDGDHQAAAHHAHVAQGHQAHATHHASEAAKHHVEAHGDKX-
>SRR5579863_5227466
IMSKEAAHHHSQAAEHHEHAANHHKEAAKHHEAGDHEAAAHHAHVAQGHHAHATHHATEAAKHHVQAHGDKX-
>ERR1700689_4571874
-MAHKGAEHHHQAADHHEAAAKHHREAASHHEAGNHESATHHAHVAHGHALHATHH-----------------
>ERR1700688_3733124
-MSKEAAGHHYKAAEHHEHAAKHHRAAAEHHEAGDHQKAGHHAHVAHGHTVHAS-------------------
>SRR5277367_1853101
-STHSAHEHHAKAADHLEQAAHHHREAAAHHESGDAATAGHHAHVAAGHTAHA--------------------
>ERR1700761_4254522
SMSKQASEHHNLAAEHHEHAARHHRDAAKHHKAGDHEKAAHHAHVAHGHHLHATHHATEAAKHHVEAHGEK--
>ERR1700727_2977704
SMSKQASEHHNLAAEHHEHAARHHRDAAKHHEAGDHEKAAHHAHVEHGHASHAEHHHTEASRHHAAHHGQH--
>ERR1700731_2030917
-PDPSIHEHHEKAAHHHDQAAKHHREAAKHHKAGAHEKAAHHSKIAHGHHLHATEHHEHTSKLHAEKHGS---
>ERR1700743_1236405
-SMEEIHEHHEKAAHHHEQAAKHHREAAKHHQAGSPEKAAHHSKIAHGHASHATEHHEHASKLHAEDHGX---
>ERR1700756_1994461
-HDSDIHEHHEEAAHHHEQAAKHHREAAKHHKAGHHEKAAHHSKVAHGHHLHATEHHEEAAKLHAEAHSD---
>SRR2546423_14472982
-AEHEIHEHHEKAAHHHEQAAKHHREAAKHHKAGSHEKAAHHARIAYGHRLHAAEHQDHAAKMHAEEHSX---
>ERR1700680_2379019
-MSKKAAEHHRKASEHSTQAAKHHTEAAKHHDAGQHEKAAHHAHTAGGHERHSRTHSDEAAKAHADEHGKK--
>SRR6476659_3824902
-MSKKAAEHHRKASMHSGEAAKHHDQAAKQHEAGQHEKAAHHAHTATGHERQSRMHADEAAKAHADEHGKK--
>SRR4029079_3412719
-MSKKAAEPHTKESMHTGEDANHHDQAAKHHEAGQHEKAAHHAHTATGHERHSRMHADEAAKAHADEPAKK--
>SRR5438477_9761204
MPKHEGAEHHKKAAEHNEHAARHHKEAARHHEEGSHEKVGHHAHIAHGHHLHATHHAEEAAKTHSNQHEKEN-
>SRR5580704_1157045
MPKHDSPEheekvakhqdkladhheekateHHEKAAKHHDKAAQHHREAAKLHKEGDHETAGHHAHIAHGHHLNATHHSEEAAKSHAQQHGEK--
>SRR6266571_3990511
MAGVSSTDHHTKAAEHHEMAAKHHRAAAEAHSKGDVATAAHHAHLAHGHHSHATHHMEEAAKKHTEH------
>SRR6266567_3749516
MAGHSSVDHHTRAAEHHEMAAKHHRAAAAVHAKGGIVEAAHHAHLAQGHHAHATHHMEEAAKMHTEH------
>SRR3984957_15754445
MTEIKIHEHHEQAAQHYEHAAKYHREAAKHHKAGNHEKAAHHARIAFGHYLEAAEHQNNAARQHAKEHSX---
>ERR1700730_3219255
MKEYKIYEHHEQAAQHYDQAAKYHREAAKNHNAGNHEKAAHHARIAFGHYLEAAEHQNNAARQHAKEHSX---
>SRR5208283_1776841
LHEHDIHEHHEQAAHHHEHAAKHHREAAKHHKAGDHEKAAHHTKVAHGHHLHAVDHHEHASKMHAEEHGE---
>ERR1035437_8645898
-MSKKAAEHHKKASEHLTLAARHHGEAAKHYEAGAHEKAAHHAHIARGHAILARGNAEEAVKAHVEEQAKN--
>ERR1700693_1544462
-MSKKAAEHHKKASEHLTHAARHHGEAAKHHEAGAYEKAAHHAHAARGPGNSRSGTRX---------------
>SRR4029077_4859853
-MSKKAAEHHHQAAEHHEHAARHPRDAARHYEAGDHETAAHHAHTAQGHLHHATHHSTEAAKQHAEHHGQK--
>ERR1017187_6129136
KMSKKAAEHHRKAAEHHEHAAHHHKEAAKHHDAGAHEKAENHAHRAHAHHLHVTHHYEE--------------
>SRR5215472_2424335
AMSKKSAEHHTKAAEHLEHAAHHHKEAARHHEAGAHEKAAHHAHIAHAHHVHSHHHADEAAKSHLEDHGKL--
>SRR5450756_2276617
LMSKKAAEHHRKAAEHHEHAARHHKEAAKQHDAGAHEKAAHHAHIAHAHHGGKTTPLTYAVP-----------
>SRR5208283_368143
MAQHSGSGDHREAAEQYELAARHHREAAKAHDLGNHEKAGYHAYVAHAHHTLATQHAEEAMKHYATSHA----
>ERR1700723_380338
MS-HSGSHHHREAAEHYDQAAKHHREAAKHHDAGHHEKAGYHAYVAHAHHTFAAQHAEEAEKHYATAHA----
>ERR1700689_1737127
MAQHSGSHHHREAAEHYDQAAKHHREAAKHHDAGSHEKAGYHAYVAHAHHTFAAQHAEEAEKHYAPSHA----
>SRR5215467_2810391
-MSTKAAEQHDRAAEHHEHAARHHKEAAKHHKAGNHEKAAHHAHSARGHHEHAAQHGAEAAKAHTEEHGHQ--
>SRR5450830_554856
TMSKKASEHHRKAAEHHKLAATHHEEAAAHYDKGNHEKAAHHAHVAHGHTLHATHYAAEAAKMHVEEHGSKK-
>ERR1017187_7609860
---NKKIDHHRHAAAHHEHAARHHHAAAEAHASGLREKAGHHAHVAHAHDLHAQHHDDEAAKLHAEHHAGEP-
>ERR1700677_4341665
---QKRIEHHQHAARHHEQAATHHHAAAEAHSAGHHEKADHHAHVAHAHHLHARHHGDEAAKLHAEHNAHED-
>SRR5262249_39732114
QMSKKAAEHHKKAQEQHSHAARHHGEAAKHHEAGHHEKAAHHAHIARAHAIHARHYSEEATKAHGEEHGDK--
>SRR5215467_6832124
PMSSHAVDHHRKAAEHLEHAARHHQEAANHHEAGHHEKAAHHAHLARAHAIHARGYSEDATKAHHEDHGNK--
>SRR5215470_12036960
QMSKKAAEHHKKASEHHEHASHHDAEAAKHHESGHHEKAAHHTHTASGHAIHARHHSEEAGKAHAEDHGHK--
>ERR1700759_5669011
PMSKSAADHHKKAAEHHQHAAKHHTEAAKHHEAGHHEKAAHHAHVAHVHSSHAQEHHEHASRAHGEEHGSK--
>SRR3982074_501293
--THQGGEHHETAADHHESAAHHHREAAKHYESGDHEKAGHHAHVAHAHGLHATHHGHEAAKHHAENHKYP--
>SRR5215467_6148838
--AHKGGSHHELAAEHHETAAQHHREAAKHYEAGDHEKAGHHAHVAKAHGLHATHHGHEAAKNHAEHNESA--
>SRR5215813_14120567
--THKATSHHETAADHHEAAAHHHRAAAKHYESGDHEKAGYHAHVAHAHGLHAAHHGQEAAKHPAEHHAEH--
>SRR5262252_190131
--SHKGGDHHETAAEHHEEAARHHREAAKHYEDGDHHKAGHHAHLAHAHGLHATHHGHEAAKHHAEHHADH--
>SRR5215471_20087447
--THKGGSRHETAADHHETAAHHHREAAKHYESGDHEKAGHHAHVAHAHGLRPIMGKRPRSITPNI-------
>ERR1700688_719809
--SHAGSEHHETAADHHESAAHHHREAAKHYEGGEPEKAGHHAHVAHAHGLHATHHAHEAPKHHAEHHPEE--
>SRR5215467_15706025
----AKHEHHEKAAHHHEQAAKHHREAAKHHQAGNHEKAAHHSKIAHGHHLHAGEHHDHA-------------
>SRR5215471_17268835
----TIHEHHEKAAEHHEHAARHHREAAKHAQAGHHEKAAHHSKIAHGHSLHAAEHHQHA-------------
>SRR6202051_5226611
----TIHEHHEKSAHHHEQVAKHHREAEKHHKAGDHEKAAHHSKIAHGHHLHAVEHHDTA-------------
>SRR5580700_691679
-MSQKGVDHHLKAAELLEHAAKHQRSAAKYHGSGEFEKAAHHAMISHGHLVHAMEHVEGASKHVAENHDS---
>SRR5271154_6203042
-MSQKGVDHHLRAAELLEHAAKHHRTAAKHHETGEFEKSAHHAMVAHGHLVHAIEHVQEASKHHAFEHDT---
>ERR1700692_4913725
-MSQKGVDHHLKAAELLEHAAKHQRSAAKHHGAGAVEKAAHHAMISHGHLVQASEHIEGASKHQTESHDS---
>SRR5947209_798729
-EHLTGTERHLAAADHHERAASHHRDAAKHYAEKDFARAAHQALIAHGHMQQAVWHANEATKYHIEHHSN---
>SRR5580704_13162530
-----ASKHHHDAAEHHEKAAHHHREAAKHYEEDESETAAHHAHTAAGHGAHASHHTTEAAKLHTQHHGX---
>ERR1700743_439014
-----ASKHHHDAAEHHEKAAHHHREAARHYEEDDTEGAAHHAHSATGHGTHAHHHASEAS------------
>SRR5580658_837536
-----ASKHHHDAAEHHEKAAHHHREAAKHYEEEDADAAAHHAHTASGHGHHAHHHAAEASKAHAEHH-----
>ERR1700691_3551227
-----ASQHHHDAAEHHEKAAQHHREAAKHYEDEDHDAAAHHAHSASGHGHHANHHAAEARQPHPQHHGP---
>SRR5258706_712044
---HPSHDHHMKAAEHHEHASKHHKEAAAHHASGHSEKAAHHAHTAHAHTLHAAHHAGEAAKHHVTHKK----
>SRR5215472_8550299
---HPAQEHHTKAAEHHEHASKHHKEAATHYAAGAHEKAAHHAHSAHAHALHAAHHAGEAAKHHTSHHA----
>ERR1700689_759555
---HPAHEHHLKASEHHEHASKHHKEAAGHHAAGHHEKAAHHAHTAHAHTLHAEHHASEAAKHHVSHKK----
>SRR5271157_4256807
---HPAHEHHVKAAEHHEQAGKHHKEAAAHYASGDEAKAAQHAHTARAHTLHAEHHAGEAAKHHVSHKK----
>ERR1700730_16246211
VMAHKGAEHHKKAAEHHTHAAHHHREAAKHHEAGTSEKGAHHAHAAHGHTTHARHHADEAAKHHADEHGHS--
>ERR1700730_11984108
VMAHKGAEHHKKAAEHHAHAAHHHREAAKHHEAGTTEKGAHHAHAAHGHTLHARHHGDEDGKAL---------
>SRR5664280_462104
SMSKKAAESHKKVSEHLTHAARHHTEAAKHHETGQHEKAAHHAHIARAHATHAREHSENAAKSHLEEHGKK--
>SRR5450759_554306
SCLRKPQRRIKKASEHLTHAARHHTEAAKHHETGQHEKAAHHAHIARAHATHAREHSENAANTKSRYPQPI--
>SRR5512139_3675460
VTSKKAAESHKKASEHLSHAARHHTEAAKHHEAGQHEKAAHHAHTARAHATYAREHSENVAKAHSEGIKX---
>SRR5262249_4493708
----PASTHHHAAAEHYEKAAHHHRLAARLYEDDESGMAAHHAQSAAGYSAQAAHHSAEASKLHAHHHGEE--
>ERR1700759_2735061
----PASTHHHAAAEHHEKAAHHHRQAASHYEDNDSDTAAHHAHSATGHGAHAAHHGAEASKLHAHHHGEE--
>ERR1700733_7137713
----PASTHHHAAAEHHEKAAHHHRMAAKQYEDERAEAAAHHAHTASGHGAHAAHHSAEASKLHAHHHAEE--
>SRR5277367_858819
----PVAEHHHAAAEHHEHAARHHREAAKHYEEDDAETGAHHAHTASGHGAHAAHHAVEASKLHAHHHGSE--
>SRR5215472_5482998
TMSHATIEHHRKAAEHHEHAARHHREAAARHESGDHHTASHHALIAQGHLHHATHHTSEAAKHYANSHTEY--
>SRR5262245_47040845
PMSKKAVEHHRKAAEHSSHAEHHHNEAAKHHEAGHHETAAHHAHTARGHVVLTLHHAQEAAKAHAEEHGKK--
>ERR1700722_17094089
AMSKKAAEHHKKAAEHATHAAHHHTEAGKHNDAGHHEKPATHADPAHGDASHARHHAEEAARAHTEEHGKK--
>SRR5208282_3791820
-------DAHNKAAEHHENAAKSHRMAAEQHRKGEHEKGREHASQARAHSKTAHEHSETA-------------
>SRR6266481_543054
-HVEKGCGTPQKASEHLTHAAHHHGEAAKHHEAGHHEAAAHHAHTAHGHAIHARGDAEEAVKAHVEEHGKKX-
>SRR5258706_4967713
-HVEKGCGTPQKASEHLTHAAHHHGEAAKHHEAGLQIPVHRG------QSFRRIADSVPVIADSFRX------
>SRR4029077_733555
-LPLIWSPLHKKASEHLTHAARHHGEAAKHHEAGNHEKAAHHAHTARGHATHARGHAEEAAKAHTEEHGKK--
>SRR5471030_639260
-MSKKAAEHYKQSVEHHTHAARHHGEAAKHHEAGQHETAAHHAHTARGHATYARGHAEEAVKAHTEEHGKKX-
>tr|A0A1H5INE7|A0A1H5INE7_9RHIZ Uncharacterized protein OS=Rhizobiales bacterium GAS191 GN=SAMN05444161_5687 PE=4 SV=1
-MSKKAAEHHKKAAEHATHVARHHGEAAKHHEAGHHEKAAHHAHTAMGHAFHARGHAEEAAKAHAEEHGKK--
>SRR5260370_6339496
-HVEKRLRDTTKKLQNISRMRRItMGRLPSIMRLDTTKQRHTTLAPRHGHAIHATGHAX---------------
>tr|A0A225DK00|A0A225DK00_9BACT Uncharacterized protein OS=Fimbriiglobus ruber OX=1908690 GN=FRUB_09278 PE=4 SV=1
-MSKKAAESHKKAAESHKKAGEHHEQAAKHHEAGNHEKAAHHAHTAKGHQTHAERHTNDAAAHHAEEHGAK--
>SRR5262245_23436742
--PHSGRDHHETAAEHHENAAHHHRQAAKHYETGDPEKAGHHAHLAHGHGVHATHHAHEAAKHHAEHHGNH--
>SRR5208283_915204
--SHKGSHHHKAAAEHHSKAAHHHSKAAEHYEEGDHEKGGHHAHLAHAHGLHATNSAHEAAKHHAEHHGNE--
>ERR1022692_1187252
PMSKKAAEHHRKASEHLTHAASHHGEAAKHHDAGYHEKVAHHAHTARGHAIHARRHAEDAVMAHTEEHGKKX-
>SRR6266853_2390647
LMSKKAAGHHKKASEHLTHAAHHHGEAAKHHEAGHHETAAHHAHIASGHAIHARGYAEEAVKAHVEAYGKKX-
>SRR5262249_7896378
TMSKKAAEHHRKASEHHSHAARHHQEAAKHHDSGHHEKAAHHAHTAGGHAIHARDHAEEARKAHTEEYGKKK-
>SRR6478735_11977141
TMSKKAAEHHRKASEHLKHAARHHEEAAKHHDAGHHEKAAHHAHTARGHVIHGRGHAEEAVKGSYRGARQKI-
>SRR5215469_8564958
LMSKKAAEHHRKASEHLKHAAHHHEEAAKHHDAGHHEKAAHHAHTARGHVIHGRGHAEEDVTAHTEEHGKKS-
>SRR4029079_15311338
DDTHDRAEHHRKASEHHSHAARHHEEAAKHHDSGHHEKAAHHAHTAGGHAIHAIDH-----------------
>SRR3974390_1082698
MTDHDIHHHHHEAAKHHEAAAEHHRKAAHHAEAGDHEKASHHAHLAHGHKLHAVEHAEHAAKKHAHHHGNG--
>SRR5579862_249647
MTEHKIHHHHLEAAKHHEHAALHHRKAAEHEEAGYHELASHHAHIAHGHKLHAIEHSEHAAKKHTHRHADK--
>SRR3974390_2923829
MSEHEVHHHHREAAKHHEHAAEHPRRAATHAEAGEHEKASHHAHLAHGHKLHAIEHAEHAAKKHAQKHGHG--
>SRR5450631_4335944
MNEHDIHDHHHEAAKHHEHAAEHHRKAAAHAEAGEHEKASHHAQLAHGHKLHAIEHAEQAAKKHVHKHGNG--
>SRR5258705_12677773
----LAQGHHVKAAEHLEQASKHHNEAAGHSAAGHHETAAHHAHSAHAHMLQAAHHASEAAKAHRVHK-----
>SRR5215831_19127532
LMSKKAAGHHKKASEHLKHAALHHEEAAKHHEVGRHETAAHHAHTAMGHIIHARGHAEEAVKAHVEEHDRH--
>SRR5215813_12713863
FMSKKAAGHHKKASEHLKHAALHHEEAAKHHEVGRHETAAHHAHIAMGHIIHARGHAEEAVKAHVEEHDRH--
>SRR5262249_35423489
VVSKKAAGHHKKASEHLAHAVRHHEEAAKHHDAGHHETAAHHAHLATGHTILARGHVEEATKAHVEEHGKK--
>SRR6516225_2630054
LMSKKAAGHHKKAAEHLTHAARHHEEAAKHHDAGHHETAAHHAHLATGHAVHARGHAEEAMKAHT--------
>SRR5258706_11303519
LMSKKAAGHHKKVSEHLTNAAHHHEEAAKHHEAGRHETPAHHCSHRDGPSNSCX-------------------
>SRR6266403_6300338
IMSKKAAGHHKKVSEHLTHAAHHHEEAAKHHEAGRHATATHHAHTAMGHMIHAKGHAEEAVKAHVEEHGRS--
>ERR1700730_16364569
LMSKEAADHHRRAAEHHEHAARHHKEAATHHDAGSHEEAAHHAHTAHGHHLHASHHASEAAKAHAHEHVS---
>SRR3984893_11274889
LMSKQAAEHHRKEAENHAYADRHHKEAAKHHDAGSHEEAAHHAHSAHGQHLHATHHAGEAAKAHAHEHVS---
>SRR5260370_10324744
PMSKEATEHHRKAAEHHEHAARHHKEAAKHHDAGSHEEAAHHAHTAHGHHLPATHHAGEPAKAHPHQPST---
>SRR5579864_1471419
MSDHDIHEHHEMAAEHHENAAKHHREAAKHAKSGDHGKSAHHSHAAHGHALHAHEHHGHASKLHAEHHG----
>SRR6201999_4428379
MSTHEIHEHHDKAAEHHEHAAKHHREAAKHAKDGDHEKSAHHSKVAHGHALHAHEHHGHASKKHADHHS----
>SRR6202000_1008192
MSSHDMHEHHEKAAEHHEHAAKHHREAAKNSKAGDPEKSAHHSHAAHGHALHAHEHHGX--------------
>ERR1700733_5798165
MDSPEIHEHHEKAAEHHEHAAKHHREAAKHAKAGNHEKSAHHSKVAHGHSLHANEHHEHASKKHAEHHG----
>SRR5262245_59006430
MSTHQHKEHHESAAEHHAKAAHHHGKAAEHYEEGEHEKGGHHAHLAHAHGLHATHAANEAAKHHAENHGVH--
>SRR5262245_43150745
MTTHRHTEHHETATEHHAPAPHHHRKAAEHYEDGEHEKGGHHARLAHAHGLHATHAADEAAKHHAENHGEH--
>SRR5262249_27898195
MSTLHQKDHHEAAAEHHAKAALHHRKAAEHYEEGEHQKGGHHAHLAHAHGLHATHAANEAAKSHAEHHDEH--
>ERR1700685_3638131
SSAKSHKDHHEAAAEHHDKAAEHHRKAAEHYDSGDHEKGGHHAHLAHAHGLHATSAAHEAAKHHAEAHGDH--
>SRR5271170_782980
---SLLADHHDKAAEHHEAAADQHRQAAEHHRSAAHEKAAHHAHLAHGHHLHAAHHAEEAGKQLATAHA----
>ERR1700688_4600441
---SKLADHHDKAAEHHEAAAGHHRQAAEHHRTANHEKAGHHAHVAHGHHLHAVHHAEEAGKHHAEAHH----
>SRR5579864_6620188
HMSKKAAEHHKKAAEHLTNAAHHHKEAAKHHDAGHHEKAAHHAHTARGHAIQGRGHSEEAVKAHTEEHGKKX-
>SRR5580704_16417950
VMSKKAAEHHRKASEHLTNAARHHSEAAKHQDSGHHEKAAHHAHTASGHASQARSHADEAGRAHAEEHGQKX-
>ERR1700730_9174416
------KDEHNKAAEHHENAAKSHRAAAEHHGKNEHEKAKEHSRSPQQHSQNARQHSEQA-------------
>SRR5580692_11307601
------KDDHNKAAEHHDNAAKSHRAAAEQHGKGDHAKGKEHSATAQQHAQSAGKQSEQA-------------
>SRR5271155_1227401
------KDAHNKVAEHHENAAKSHRAAAEQHGKSDHAKGKEHSTNAQQHSQNARQHSEQA-------------
>SRR5262252_663114
---HKGADHHSAAAEHHENAARHHREAAKHYQSGDHHKAGHHAHLAHGHGVNATHHAHEAAKHHAEH------
>SRR6185295_16454184
-MSKQAADHHRKAAEHNEHAAQNHKEAAKYHEAGNHEKAAHYAHLAHAHHLHVAHHSAEASKSHLEHHGKK--
>SRR4029079_628390
-MSKQAADHHKKAAEHNEHAAQNHKEAAKYHEAGNHERAAHYAHLAHAHHLHVAHHSAEASKSHLEHHSTK--
>SRR5436305_1246676
----PATEHHTKAAEHHDRAAQQHRDAAKHYEDDKHETAAHHAHSAHGHASSAQEEATQASKKHAAHHSGQ--
>SRR5215469_17711277
LMSKKAAEHHRKASEHLKHAARHHEEAAKHHDAGHHEKAAHHAHTARGLIAHNAPKRELPIPAQTEQEPSI--
>SRR6476660_854516
-MSKTAADHHRKASEHSTHAAKHHGEAAKQHDAGQHEKAAHHAQTASGHEREARMHSGEDAKAHANEHGKK--
>ERR1700733_13046317
---KNASDHHHTAAKPHEHAAKHHKLAAEHHASGELAKAARHAHVAHGHHLSAEHHHHEAAKHFAEHNTD---
>SRR6266478_6069517
LMSKKAAGHHKKVSEHLTHAAHHHEEAAKHHEAGRHETAAHHAYLAMGHLIHARGYAEEAVKAHVDEHDRP--
>SRR6516162_5521683
GLDRGGTEHHRKASEHLKHAAHHHEEAAKHHDAGHHEKAAHHAHTARGHVIHARGHAEEAVKAHTAEHGKK--
>SRR6516162_9495542
GSTEVALNTTGRHRNTSSMPPTTTRRPPSTTMPDITKKAAHHAHTARGHVIHGRGHA--AVNAHTEEHGKK--
>SRR5580704_15213454
--NITSMRLSIIAKPPPTWRMAItRRPRILRIlpmLTTDT-PNITPAKLQKPISSFI---TST-----PPLPNK---
>ERR1700727_2850150
--AKHAADHHEHAAKHHEHAAEHHREAAAHVADGDHEAGAHHAHLAHAHHKHAEHHAGEASKAHIELHHE---
>tr|A0A1N6HF04|A0A1N6HF04_9BURK Uncharacterized protein OS=Paraburkholderia phenazinium GN=SAMN05444168_3227 PE=4 SV=1
--QHEVHHHHHEAAKHLDSAAKHHREAAKHAEAGDHEAASHHAHLAHGHGLHAGEHAEHAAKKAAHLHSG---
>ERR1700731_210757
--AKHAAEHHEHAAKHNEHAPNHHRKAAAHVADDDHESGAHHAHLAYAHHKHAEHHAGEASKAHIELHAG---
>SRR6201999_2564239
--KHPASKHHHDAADHHEKATHHHREAAKHYEDEDAETAAHHAHTASGHSHHAHHHAAEASKAHVQEHGH---
>SRR5208337_642429
--QHPAQAHHTKAAEHHEHAMKHHKEAATQYASGHPEKAAHHAHSAHAHALQATHHAGEAAKGHISHAQKK--
>SRR5271166_4746791
--QHPAHGHHTKAAEHHDQAMKHHKEAATHYAGGQHEKAAHHAHTAHAHSLQASHHANEAAKAHVSHGQKK--
>ERR1035438_9181592
-----AKEHHDKAAEYHEHAAKAHRAAAEHHGKGDHVKGKEQANAAKQHSQTANQHTDQA-------------
>SRR6185295_6369403
-----MKDAHNKAAEHHENAARSHRAAAEHHGKNDHAKGKEHSTKAQEHSQNARRHSEDA-------------
>SRR5580704_8854768
-----SRDEHNKAAEHHENAAKAHRAAAEHYGKGDHAKGKEYATSAKQQSQTANQYSDQA-------------
>SRR5580704_7478170
-----ARDEHNKAAEHHENAAKSHRAAAEHHGKGDHSKGMEHSTNAQQHSQNARQFSDDT-------------
>ERR1035438_3719677
---HKGIENHRKAAKHHEEAAKHHHDAAKHHEAGNHDKACESTVKAHGHHCLASDHMREVSKQHR--------
>SRR5579862_8090639
---QKGIENHKTAAKHHEEAAKHHLEAAKHHEAGNHDKACESTVKAHGQHCLASEAEREDVKH----------
>tr|A0A1F3K8Y4|A0A1F3K8Y4_9BACT Uncharacterized protein OS=Bacteroidetes bacterium GWF2_33_38 GN=A2W98_03950 PE=4 SV=1
------MENHKKAAKHHEEAAKHHHDAAKHHAEGNHEKASHSAVKADGHHCIASEARKEDAKHHT--------
>SRR6516164_3816328
-MSQKSAEHHTKAAEHHEHAARHHREAAKHYTAGSHEKAAYHAHVAHGDHLHAIYHAEEAAKYS---------
>SRR5580692_7223228
-MSTQGTEHHIKAAEHHEHAARHHRVAAEHYAGGNDERAAYHAQVAHGHHLHAIHHAEEAAKYT---------
>tr|A0A2H5FQX6|A0A2H5FQX6_9GAMM Uncharacterized protein OS=Legionella sainthelensi OX=28087 GN=CAB17_11925 PE=4 SV=1
----KLHQHHSSAAEHHRKAAEHHGEAAKHHQGGDHEKGNHHAHLAHGHQEHAKHHSSEAAKHTTGHERKE--
>tr|G9EPV5|G9EPV5_9GAMM Uncharacterized protein OS=Legionella drancourtii LLAP12 OX=658187 GN=LDG_7296 PE=4 SV=1
----KLKQNHTTAADHYKKAAEHHLEAAKNHEAGNHEKGNSHAYMAHGHSKQAKIHGSDACCHSAGIDTKK--
>SRR5271166_4754981
-MSQQSAEHHTKAAEHDEYAARHHREAAKHYASGNHEKAGYHAHLAHGHELHAINHAEEAAKYEIKFISEGT-
>SRR5271165_5204439
-AAC-----TSSVSPCLMFPRRNSSASgSSRYFPTA-RRIGRAPXX----------------------------
>SRR5208283_2469540
----KIAEHHAQAAQHHEKAAEHHKEASKHYEAGAVEKGAHHAQVAQGHAVQAEYHADEAAKAHAEHHGGK--
>tr|A0A257S911|A0A257S911_9PROT Uncharacterized protein OS=Acidiphilium sp. 21-60-14 OX=1970292 GN=B7Z67_11670 PE=4 SV=1
MMAHTTHEHHAHAAMHHERAAHHHHEAAKHAEAHEHEAAAHHAHLAHAHHLHATHHADEAAKQAADTHA----
>SRR5262249_1401492
---HKGGSHHEIPDENHDHHATHHRVRRQTSRSGEALRSGR--------------------------------
>SRR5260370_35260483
---HKGGSHHQTAAEHQQTAAHHHREAAKHNEAGHPGQTGPT-------------------------------
>SRR5260370_40037558
---HKAGSHHETAPEHHETAAPHHRASAQHYEA----------------------------------------
>SRR5450631_1542563
------QQHHEKAAEHHEQASKHHKEAVKHHESGDEKTAAHHAHIAHGHSAQATEQETEASKKYAEKHNPK--
>SRR5689334_7481690
XMLNEAAEHHKKAAEHHEFAARHHKEAAKYHETGFHEKAVYHARLAHEHHIHATYHASKG-------------
>SRR5690242_14891423
XMSKQIAEHHKKAAEHHESAAHHNKQAVMHHEAGSYEKAAYHARLAHEHYVRATYHASKD-------------
>ERR1019366_9440480
--PAAAAKQDDAAAQHYEEAARHHRQAAKDYQAGHFEKVSHHSHLAYAHHLHAEQRSEEAARAHLKNYFD---
>ERR1035437_3282233
--PGAAAKHHDAAARHYEEAARHHRQAAKHYQSGHHEKVSHHAHLAYAHHLHAEQHAEEAAKAHIKNHLD---
>ERR1700688_1358693
--PRTGAQHHEAAAQHHELAARHHRVAAQHDLSGHHEKAGHYAHLAYAHHLHAEQHGAEAAKTHAKHHTG---
>SRR5215813_1807967
-MSTKAAEHHEQTVPRLTLLVKSPLHR---APSGVTRDGVRAAQSVPA-------------------------
>ERR1700722_19382195
MSTLNKAEHHQAAADHHEKAAEHHREAAKHHDEGEHHLSGHHAHIAHGHGLQADHHADEATRHHVEAHSH---
>SRR5579863_25933
MSTLNKADHHHAAAEHHEKAAKHHREAAKHHEDGEHHLSGHHAHVAHGHGLQADHHAGEAAKHHVETHSH---
>SRR5450759_2404522
-MSKKAAESHKKASEHLTHAARHHAEAAKHQEAGQHEKAAHHAQNARAQATYAREHSENAAKAHFEEHGKK--
>SRR5450830_872629
-ILVERAASHKKASEHLTHAARHHAEAAKHQEAGQHEKAAHHAQNARAQATYAREHSENAAKAHFEEHGKQ--
>ERR1700721_1610003
-MSKKAAEHHKKAAEHATHAARHHTEAGKHHDAGHHEKAAHHAHTAHGPASHARHTAEDAPRPQTERTG----
>ERR1700676_2341835
-MSKKAAEHHKKAAEHATHAAPHHTEAGKPHDSGHHEKAAHHAHTAHGHASHARHHAEEAARAPPGEDAHS--
>SRR6516162_2935501
EMSKQAAEHHIKAAEHHEHAARHHKEAAKHHEAGNHEKAAHHAHVAHGHHLHAMHHHEEAMIFLGEKD-----
>SRR5215470_6709139
----TVVGFRVRDSANPNPSLRTRLLRGKPR--------------TVGLRLKSIRI---TVPREEANV-----
>SRR5580704_10693489
-----TWEHYHHAAGHHEQAAYHYKKAEKYDQAEEHEKAAHHAYLAHGHNQHAIHHDVEAARLHPEHCD----
>SRR6185437_1950867
-----TSDHHLRAAHHSEQAAKHHHEAAKHEEAGAHDLAAHHAYLAHGHGEHAAHHRVEAAKQHADHCD----
>SRR5579872_6549096
-----TWEHYHHAARHHEKAAYHYNEAAKYDEGQEHEKAAHHAYLAHGHNQHAMHHETEAAKLHAEQCA----
>SRR5579859_232825
-----TWEHYHHAARHHERAAYHYKAAAKYDQTEEHEKAAHHAYIAHGHTQQALHHDAEVAKLHAEHCD----
>ERR1700733_13494560
-----TWEHYHHAARHHEKAASRLHEAAKYDQAEEHEKAAHHAYLAHGHGQHATHHDVEAAQPHSEHCN----
>SRR6476620_2890533
-----TWEHYHYAARHHERAAYHYNEAAKFEQANEHERSAHHAYLAHGNTQHAIQHDAQAAKLHAEHCD----
>SRR5271170_5375838
-----TWEHYHHAARHHERAAYHFNEAAKYNQAEEYEKAAHHAHLAHGHNQHAVHNENEAAKLYASQCD----
>ERR1035438_5075301
-----TWKHYHHAARHHEKAAYHFNEAAKYDQAEEHEKAAHHAYLAHGLSQNPVLHDVEAAKLHAEQCN----
>ERR1035441_11100738
-----TLFPYTTLFRSHERAAYHFNEAAKYDEGEEHEKAAHHAYLAHGHNQHAIHHDVEAAKLHAEHCD----
>SRR5664279_1442597
-----TWEHYHHAAHHHERAAYHYKEAAKYDQAEEHEKAMHHAYLAHGHTQHAIQHDIEAAKSHADLCD----
>SRR5664279_4926826
-----TWEHYDLAAHHHARAAHNYQEASKYSQAEEHEKAMHHAYLAHGHSQSAIQHETEAARLHAEECE----
>ERR1035438_7334095
-----TLEHYQGAAHHHERAAYQFKEAAKYHQSEEDEKESHHAYLAHGHAQHALLHEVAAAKLHVEKCD----
>ERR1700721_2784080
---RRSAEHHTLAANHHEHATRHHHEAAKHFQNDDHAHAAHQAQIAYAHTRRAIRHSDGSCRILYGTAWA---
>SRR4051812_19989905
---RRSAEHhHTFAAHHHEQAARHHHEAAKHFQNDDHAHAAHQAQIAYAHTRRAIRHSNQAAEYYTELDDR---
>ERR1700730_1514122
---RRSAEHHSLEAHHHEQAARHHHEAAKHFQNDDHAHAAHQSQIAYAHTRHAIRHSDEATEYYTEQHGL---
>ERR1700722_13050175
---RRSAEHHTLAAHHHEHAARHHHEAVKHFQNDDHAHAAHQAPFVTATKLPNIiRNSMGGLRPTA--------
>SRR3954451_3354512
----TGTEHHDAAAVHHEQAASHHREASRHYAEKDYAHAAHQALIAHGHTQQRQPSTKSS-------------
>SRR5579872_2397560
----TGTEHHVAAAEHHEQAATHHRQAAKHYAEKDYAHAAHQALIAHGHTQQAVRHGNEATKYHLEQHGKD--
>SRR6185503_15719553
----TGAEHHTAAAKHHEQAASHHRQASRHYSEKNYIKAAHQGLIGHGHSQRAIRHGNEATKYHVEHEEKA--
>SRR5258706_11916707
----TGTEHHLAAAEHHEKAAVHHRGASECYAKQDYAQAAHRALIAHGHTQQAVRHGNEATKYHLEHDKE---
>ERR1700741_2922018
----TGTEHHDKAATHHEQAERHHREGSLHYAEGAYAHAAHQALIAHGHTQQAIRHGNEATKYHVEHHGRF--
>SRR5205807_4931412
-------DTHTKAAEHHENAAKSHRAAAEHHGKGDHAKGHEHSSTAQQHSKTAREHSETAHKKSGEHAGR---
>SRR5271167_149749
-------DTHAKAAEHHEVAAKAHRTASEHHGKGDHATGHEHSTTAHRHSETAHGHSKEAHEKSSQHAGK---
>ERR1700679_4233730
------KETHTKAAEHHENAAKSHRAAAEHHGKGEHTKGQEESTKAQAHSKTAREHSDM--------------
>SRR5450759_136227
--------KHNMAAEHHEKAAKSHRTAAEHHGKGEHEAGQRHSSEALEHSKNAHQHSQEAHNKSIEANKK---
>ERR1039458_1966516
--------KHNMAAEHHEKATKSNRTAAESAGAILGHITPRGRGAAEKKX-----------------------
>ERR1019366_4731186
--------KHNMAAEHHEKAAKSHRTAAEHHGKGEHGAGHRHFEAPLGDPAPVVAPKYVCPrgdytwyQKSAGSPK----
>ERR1035437_823894
--------KHNMAAEHHETAANSPRTAFSGYHSTCSSPKAD-----L------------CSaktfavgNX----------
>ERR1035441_4576807
--------KHNMAAEHHEKAAKSHRTAAEHHGKGEHEAGQRHSSEALARTFKECSSALTRGPX----------
>tr|A0A0M8YVQ9|A0A0M8YVQ9_9ACTN Uncharacterized protein OS=Streptomyces purpurogeneiscleroticus GN=ADL19_30475 PE=4 SV=1
--------AHTEAAEHHEKAAKSHRTAAEHHGKGDHADGHKHSTEAHGHSTTAHERSTKAHg-KSGEHHT----
>ERR1039458_5552437
MATHPAAEHHTKAAEHHKAAAAHHEQAAEHYGHGNYEKAAEHAHHAHGHHALATHHMEEAAKAHATHPDT---
>SRR6185312_11034738
----SGAEHHLAAATHHEQAAAHHRLASQHYAEKDYAHAAHQALIAHGHGQQAARHANEATKYHIEHHDAVP-
>SRR6185437_5883084
----TGTEHHVAAADHHELAARHHRNASKHYEEGDHAHAAHQALIAHGHAQLAARHANEATKYHVEHHGDAE-
>SRR5581483_9044389
----TGTEHHDAAAVHHEKAALHHREASRHYAEKDYAHAAHQALIAHGHTQQAIRHGNEATKYHVEHHGNPS-
>SRR6185437_768864
----SGTEHHEAAANHHEKAAWHHREAARNYAKKDYAHAAHQALIAHGHTQQAIRHGSEATKYHVEHHGQAL-
>SRR5579864_1751781
----TGKEHHTAAADHHEQAARHHRLASKHYEEKDYAHAAHQALIAHGHTQQAMRHGNEATKYHVEHHGNDS-
>SRR6185437_13689150
----TGTEHHMKAAEHHEQAALHHRRASRHYMEREFAYAAHQALIAHGHTQRAARHANEATKYHVEHHGKES-
>SRR5512135_2626370
----TGAEHHSLAAKHHEQAARHHHQAAKHYEEKDYAHAAHQALIAHSHTQEAIFHGTEATKYHAEHYDRAT-
>SRR5580704_12517227
-------HDHHKAAEHHDEAAKSHRKAAESHEEGDTEQASQHSQLANDHSKKAQE------------------
>SRR5271166_1276736
---HPASEHHHQAAAHHHAAAHHHHRAAHHHDLGEHEEAKEHAEAAQEHSEQAHKHTTTA-------------
>SRR5271167_711373
---HSSSEHHHQAAAHHHAAAHHHHQAAHHHDLGEHDEGKDHADAAHEHSELAQKHTTTA-------------
>ERR1700739_3733392
---HPSSQHHQTAAAHHHAAAHHHHQAAHHHEIGEHEEAQAHAVAAKDHSELAHQHTETA-------------
>SRR6516165_8907569
---HPAVEHHRMAAMHHHAAAHHHHQAAHHHAHGQHEEAKKHATSAHEHSEHGHKHSKEA-------------
>ERR1700744_635017
---HPSSQHHTTAAAHHHAAAHHHHQAAHHHEHGDHEEAQEHAAAAKEHADLAHQHTATA-------------
>ERR1700730_3219825
---HPSSQHHQTAAAHHHAAAHHHHQAAHHHELGEHEDAKEHAAAALGHSELAHKHTTTA-------------
>SRR5271167_1124854
---HPSTEHHLQAAAHHHAAAHHHHQAAHHHDIDEDEEAQEHAEAAHEHSEMAHKHTKTA-------------
>SRR5580692_6476539
---HASIDHHEQAAAHHHAAAHHHHQAAHHHAAGEHDHAKRHATAANEHSHAAHRHSNTA-------------
>SRR5215469_3288153
---HPAVEHHRQAAAHHHAAAHHHLQAAHHHSHGQHEEAKKHATTAHEHSDHGHKHTKDA-------------
>ERR1700734_4447503
---HPASEHHHQAAARHHAAAHHHHQAAHHHDLGEHKEAKEHAEAAHEYSEQGQKHTATA-------------
>tr|A0A1U7CVY5|A0A1U7CVY5_9BACT Uncharacterized protein OS=Paludisphaera borealis GN=BSF38_04616 PE=4 SV=1
---HPASEHHHQAAAHHHAAAHHHHAAAHHHDIGEHAEAKQHATAAHEHSEKAHAHTKTA-------------
>SRR5208283_1986152
---EASGQHHHQAAAHHHAAAHHHHQAAHLHDIGKHEEAKEHAEAALEHSEQAHKHTT---------------
>SRR5579864_3769991
MSTEETVEYHRKAAEHFQYAANHHMAAAAHYSDGRHEQAAREAYLAHGHYLHGSNHAAEAARLHARHFGQK--
>SRR5712692_8710527
DMSTEAVKHHRKAAEHFSYAAKHHAEAGTHYGAGRHEQAAREAYLAHGHYLTATNHAAEAARLHTRHFGQK--
>tr|A0A1Q7AG34|A0A1Q7AG34_9BACT Uncharacterized protein OS=Acidobacteria bacterium 13_2_20CM_2_66_4 GN=AUI11_11295 PE=4 SV=1
-MSTEAVDHHRKAAEHFEHAAQHHSAAASHYGAGRYDQASREAYLAHGHYLHGSNHAAEAARLHTRHFGQK--
>ERR1700693_4730633
-----TWEHYDLAARHHERAAHEFKDAAKYHETEEHEKAAHHAYLAHGHNQHTIHHGNATAKLHTAHCD----
>ERR1700751_2402501
-----AWEHYRHAARDHERAAHHFKEEAKYDEVEEHEKAAHHAYMAHGHNQHAIYH-----------------
>SRR3984957_10564985
----MAHEAHHKSAEHHEEAAKHHHLAAEHHIKGDHKKAHDHATQAHEHSVKPMTTPPPRTRR----------
>ERR1700679_2802903
----MAHEAHHKSAEHHEEAAKHHQLAAEHHIKGDHKKAHEHATKAHEHSAKAHEHSKAAHEA----------
>ERR1035437_1094595
---KTGIENHKKTAKHLEEAAKHHHDAAKHHEDGNHAKASESTIKAHGHCCCANDLQKEDSKGHA--------
>ERR1022692_4360246
---QKGIENHKKAAKHLEEAAKHHLDAAKHHEAGNHEKACASTLKAHGHTCLATEHQRENIKHHA--------
>SRR5579872_2422874
---QKGIENHKTAAKHHEEAAKHLHDAAKHHEAGNHEKASESTIKAHGHAYIAGEHQREYAKQHA--------
>SRR6478736_6934229
---HKGIKNHERAAHHHEKAAKHHHEAARHHQEGNHKKASESAIKALGHHCLASEAEREDIKHHA--------
>tr|A0A1F3BRJ9|A0A1F3BRJ9_9BACT Uncharacterized protein OS=Bacteroidetes bacterium GWA2_31_9 GN=A2033_19665 PE=4 SV=1
---KTVIEKHKKVATHLEEAAKLHHEAAKNHEEGNHDKAHSSTVKANGHTEHAKEIDKEIKKHHV--------
>ERR1700733_15661450
--SEMPKDAHNKAAEHHENAAKSHKTAAEHHGKGDHAKGREEYAKAHAHSTRTQENSQ---------------
>SRR5579871_5864872
--DHMARDAHNKAAEHHENAAKSHKTAAEHHGKGEHAKGREESARAQGHSKTAHEHSE---------------
>ERR1700728_3975191
--DDIARHTHTKAAQHHESAAKSHKKAAEHHGKGEHAKGREESAKAYGHSKTAHEHSE---------------
>SRR5580658_7378271
-AKHPSAEHHHNAASHHHAAAHHHHQAEHHHAMGEHEQAKHHAKAAKEHSELAHKHTE---------------
>ERR1700678_803200
-AKHPASEHHHTAASHHHAAAHHHHQADHHHVRGEHEQAKHHAAAAKEHSELAHKHSE---------------
>SRR5580704_6309482
-PKHPAQEHHHAAAAHHHAAAHHHHQAEHHHAVGEHAEAKQHATAAHEHSELAHKHTT---------------
>SRR5208283_5298634
-AKHPAGEHHHTAAGHHHAAAHHHHQAEHHHARGEHEEAKQHASAAQEHSEAAHKHTT---------------
>ERR1700688_3021457
-TKHPAQEQHHLAAAHHHAAAHHHHQAEHHHAVGEQAEAKQHATAALEHSELAHKHTT---------------
>ERR1700730_17718460
-ANHRSPQHHHLARAHHHAAAHHHHQAEHHHALGEHEDAKQHATAAHEHSELAQKHTT---------------
>tr|A0A1G7GDX7|A0A1G7GDX7_9SPHI Uncharacterized protein OS=Mucilaginibacter pineti GN=SAMN05216464_11097 PE=4 SV=1
-MPNTKHSHHEEAANHHEAAAKSHRNAHKEHTEGNDEKAATHAHEAEGHAEHARTNSKEAAKKHATKSATA--
>tr|A0A1Q6A0Z7|A0A1Q6A0Z7_9SPHI Uncharacterized protein OS=Mucilaginibacter polytrichastri GN=RG47T_3130 PE=4 SV=1
-MPTTKHSHHEDAAKHHDEAAKSHRAAHKEHTEGNDEKAAHHAQKAQGHHTQAGEHAKEASKKHATKHASK--
>tr|A0A1N6RL65|A0A1N6RL65_9SPHI Uncharacterized protein OS=Mucilaginibacter lappiensis GN=SAMN05421821_102120 PE=4 SV=1
----MKHSHHEEAAKHHTEAAKHHTEAHKSHAEGNDEKAAHHAQTAQGHQHKATEHATEAAKKHAEKHSSS--
>tr|A0A1F2JHJ8|A0A1F2JHJ8_9SPHI Uncharacterized protein OS=Sphingobacterium sp. HMSC13C05 GN=HMPREF3127_23090 PE=4 SV=1
-MSETKHNHHHDAAKHHDEASKHHQNAHKAHQEGNDEKAAEHAKSAAESSKKANDHAEEATKKHSHKHGMK--
>SRR5471030_1443776
XMSKKAADHHRKASEHFEQAALHHTEAATYHATNAYEKAAHHAYLAQAHQHHATHHAGEALQAHLNDHGSS--
>SRR3984893_18837529
CMSKKAADHHRKASEHHEQAAFHHAEAAKHHLTNAFEKAAHHASLAQAHQHHATHHLGEALQAHLTDHGSG--
>ERR1700704_2548263
------NDMHKKAAEHHETAAKSHRAAAEHHGKGDHAKGKEHSTNAQQESQNAHQHSEQA-------------
>SRR5450755_131087
---------------------------------RDPVHKRKPLTVDYQHSEQAD-STKSA-------------
>ERR1700739_1735412
------KDEHNKAAEHHENAAKEHRTAAEHHGKGDHGKGREHASSAKQHSQTANQH-----------------
>SRR5580692_11831826
------KDEHNKAAEHHENAAKAHRSAAEHHGKGDHASGKKHSTEARDHASKASEA-----------------
>tr|S9SB59|S9SB59_PHAFV Uncharacterized protein OS=Phaeospirillum fulvum MGU-K5 OX=1316936 GN=K678_11413 PE=4 SV=1
-ATLKANEHHAAAAAHSESAAQHHKEAAKQFDSGHHEKAAHHAQVAAGHSAHATEHATEATKKYAEQHSS---
>SRR5665811_936752
------QQHHEKAAEHHEQAAKHHKEAVKHYESGDDKTAAHHSYVAHGHSEEAREQEMEASKKYAITQG----
>tr|A0A2P8H9L3|A0A2P8H9L3_9BACT Uncharacterized protein OS=Chitinophaga niastensis OX=536980 GN=CLV51_11087 PE=4 SV=1
------HEHHEKAAFHYDLASKSHREAHKSHQEGNDEKAAHHAQAAHGHAAQAKEHEVEASKKHSEKVK----
>tr|A0A2W2A7H8|A0A2W2A7H8_9BACT Uncharacterized protein OS=Taibaiella soli OX=1649169 GN=DN068_18475 PE=4 SV=1
------QKNHEEAAKHHDEAAKHHRDAAKHASEGNYDKAAHSAQAAQGHHAKAGEQAKKAATQYAEKKG----
>SRR5690242_15247408
---HAASEHHHTAAAHHQAAAHHHLEAAHHHDIGEHDEAKVHAASAQEHCEHAEKHTKTA-------------
>SRR5579871_3518436
---HASSEHHHTAAAHHQAAAHHHLQAAHHHDHGNDEDAKKHSSAAHEHSEHGDKHTK---------------
>tr|L0DAS1|L0DAS1_SINAD Uncharacterized protein OS=Singulisphaera acidiphila (strain ATCC BAA-1392 / DSM 18658 / VKM B-2454 / MOB10) GN=Sinac_1996 PE=4
---HAACEHHHKAATHHAAAAHHHLEAAHHHNVGEHEAAKQHDEAAHEHGEHAHKAATTA-------------
>tr|A0A1N6H0I2|A0A1N6H0I2_9BACT Uncharacterized protein OS=Singulisphaera sp. GP187 GN=SAMN05444166_2625 PE=4 SV=1
---HAASEHHHMAAAHHAAAAHHHLEAAHHHDVGEHEAAKKHAETAHEHGEHAHKAAATA-------------
>ERR1700722_3501349
-------EAHTKAAEPHENAAKSHRTAAEHHGKGDHDNGREESTKAQSHAKTAREHSEAA-------------
>SRR5208283_1367323
-------YFDNVIRAHVPSAAKSHRAAAEYHGKNDHMKGNEHAMEAQKHSKVASAASNEA-------------
>SRR5580700_9650725
-------QAHTKAAEHHETAAKSHRAAASEHGRNDHMKGTEHSTEAHKHSKAGGEASDQA-------------
>SRR5258708_9298853
---HTGAGHHTLAAEHHEQAAHHHRQASKHYEKKDHANAAHESLIAHDHTRRAVHHSNEAGKYHAERHRK---
>SRR5690348_12125180
---YSGAEHHTLAAEHHEAAARHHRQAAKHYQGKDYAHAAHQSLIAHDHTRRAIHHSNEAGKYHAERHGA---
>SRR6185312_14811857
---HTGAEHHTFAAEHHERAARHHRQASKHYEEKDYAHAAHQSLIAHDHTRRAVHHSNEAGKYHAEQHGD---
>SRR5207248_1576108
---YTGAEHHTLAADHHEQAALHHRKASQHYDAKDYADAARGSLTAHGHTRRAVHHSNEAGKYHGERAEQ---
>SRR4029077_5485951
-MSKHASEHHRQASTHYHDAARHHQEAAHFSQAGNYERAAYHAGIAAEHQRQAAHHANEAAKHLP--------
>ERR1700737_4938755
-MSKNAAEHHRQASTHYHDAARQHQEAAHFHEAGNYEKAAHHAQIAADHQRQAAHHADEAAKHHA--------
>ERR1700747_3175619
-MSKQASEHHRQASTHYHDAARHHQEAAHFSEAGNYERAAYHAGFAAKHQRHAAHHAEQAAKHTP--------
>SRR5438128_9326564
-MSK-GAEHHRQASTHYHDAARHHQAAAHLSQAGNHGRAAYHAAIAAEHLRQAAHHADEAARHFP--------
>ERR1700727_996832
-------DAHSKAAEHHENAAKSHRTAAEHHGKADHAKGREKSAKAHGLSKTAHESSE---------------
>ERR1700730_4501103
---HPASEHHHAAAAHHAAAAHHHLQAAHHHDHGNHEEAKKHAASAHDHSQDADRHSKV--------------
>ERR1700730_10583744
---HASSEHHHNAASQHEAAAHHHRQAAHHHEYGNHDEAKNHATAAHDHSQDADRHSKG--------------
>SRR5580704_14572243
---HASSEHHFLAAAEHEVAAQQHRQAAHQHDRGNHAEAQKHARAAHDHSQDADRHSKT--------------
>SRR5450755_4646905
---HAASEHHHRAAAEHAAAAHHHYQAAHHHDHGNHEEAKKHAESAQGHSQDADRHSKI--------------
>tr|G8NWU3|G8NWU3_GRAMM Uncharacterized protein OS=Granulicella mallensis (strain ATCC BAA-1857 / DSM 23137 / MP5ACTX8) OX=682795 GN=AciX8_0020 PE=4 SV=1
------HEAHKKAAEHHEHAAKAHHAAAEHHESGDHKAAHEH-------SEKAHEHSTEAHKHSADAHSK---
>SRR5579864_5391397
-----TWEHYPQAARHHERAAYHYKEAGKFDEAEEHEKAAHDAYLAHGHNQHAIHHDSEAAKLHAEQCD----
>SRR5580704_9138589
-----TWEHYHHAGRHHEQAAYHYHEAAKYYQAEEFEKAAHHAYLAHGHHQHAMHHDAEAAKLHTEHSD----
>ERR1700694_2352438
----PASEHHLQAAAHNPAAAHHHLEAAHEHDYDTHEEAKKHAASALNHSQDADRHSK---------------
>ERR1700734_2270764
----PASEHHLKAAAHHAAAAHHHFEAAYEHDHGNHDEAKKHAASALDHSQDADRHSR---------------
>ERR1700687_1619548
----PASEHHLKAAAAHAAAAHHHFEAAHQHDYDNDEEAKKHAASALDHSQDADRHSK---------------
>ERR1022692_1760285
----PASEHHLKAAAHHAAAAHHHFEAAYQHDNDNHEEAQKHQASELDHSHDADRHSK---------------
>SRR5579872_7468067
----ASSEHHHNAAAQHQAAAHHHLEAAHHHDHGEHDEGKKHASSAQEHSEQADRHSK---------------
>SRR5208282_1491605
----LSSEHHHKAASQHEAAAHQHRQAAHHHENGNHEVAKKHASSACDHSQDADRYSK---------------
>tr|A0A1Y0M4X7|A0A1Y0M4X7_9FLAO Uncharacterized protein OS=Polaribacter sp. SA4-10 OX=754397 GN=BTO04_01060 PE=4 SV=1
--NINGIKSHRKTTGYLQVSAKKHLEAAMHYQEGNHEKAVQSAIVAHPNFNLVYKAQRKDMNQHA--------
>tr|A0A1F3BRJ9|A0A1F3BRJ9_9BACT Uncharacterized protein OS=Bacteroidetes bacterium GWA2_31_9 OX=1797314 GN=A2033_19665 PE=4 SV=1
--MKTVIEKHKKVATHLEEAAKLHHEAAKNHEEGNHDKAHSSTVKANGHTEHAKEIDKEIKKHHV--------
>tr|A0A2S7T1N8|A0A2S7T1N8_9BACT Uncharacterized protein OS=Chitinophagaceae bacterium RB1R16 OX=2077091 GN=CJD36_000780 PE=4 SV=1
--MKKSIENHKQAAQHHEEAAKHHKQAAKHHEEGNHDRAHTSTVIANGHAHMASEKQTDDAKHHA--------
>SRR5580704_10616937
-------QSHTKAAEHHETAAKSHRAAAEQHGKNEHGKAKEHATQAQQHSKTAREHSEQA-------------
>ERR1700752_4702105
---RQAVEHHESAAKHYQDAAYHHREAAKHYTAGDYEKAAYHAHMAHGHHLHADDHASEAAKHVLG-------
>ERR1700733_8059481
---QKAVEYHESAAKHHQDAAYHHKEAAKHYTAGDHEKAAYHAHMAHGHHLHAADHSAEAAKQMLG-------
>tr|A0A1V3PEB7|A0A1V3PEB7_9GAMM Uncharacterized protein OS=Rhodanobacter sp. C01 OX=1945856 GN=B0E50_17670 PE=4 SV=1
------HHHHHEAAKHLDEAAKHHRAAAEHAEAGNHDKASHHAHLAHGHKLHAIEHAEHAAKKHAHKHDV---
>SRR5271157_2351377
---HPAAEHHHQAAAHHAAAAHHHLEAAHHHETGEHDQAKKHAEAALRHSEHGHKHTTTA-------------
>SRR5271166_1750728
---HPATEHHHRAAAHHAAAAHHHLEAARHHEAGELDQAKKHSVAAHRHSEHGTKHTTTA-------------
>SRR6266566_1561533
IMSTQAAEQHEKAAAQYGHAARHYKEAAEHHKAGNYEKAAQHAQTARWHHEQATDHASEAAKAHAEHYGKQQ-
>SRR5947209_9682788
IMSTQAAEQHEKAAAQYGHAARDRKSTRLNSSHANISYA----------------------------------
>ERR1700730_3404010
-------DMHQKAAEHHEQAAKAHRIAAEQHGSSDHATAKQQSAQAADKSKAAHKQST---------------
>ERR1700688_2973991
-------DMHQKAAEHHDQAAKAHRTASEQHGSNDQASAKQQSAQDAEKSKAAHEQST---------------
>SRR6476646_9263370
-------DMHEKAAEHHEQTAKAHRTAAQQHGSNEHVSAKQQSAQAADKSKAAHEHST---------------
>SRR5450631_562038
-------EMHQKAAEHHEQAAKAHQNAATQHGSNDHVGGKQQSAQAAEKSKTAHEHST---------------
>SRR5580700_1624679
-------DMHQKAAEHHEQAAKAHRTAAQQHGSSDHVNAKQQSAQAVEKSKAAHEQSM---------------
>SRR5580704_15841692
-------DMHQKAAEHHEQAAKAHRAVAEQHGSNNHAAAKQQSAQAVEKSKSAHEHST---------------
>ERR1700730_10426099
-------DAHNKAAEHHEQAAKSHRVAAEHHGGGDHAAGHEHSGKAHAHSKMAHDQSG---------------
>tr|A0A2N3PRL3|A0A2N3PRL3_9PROT Uncharacterized protein OS=Telmatospirillum siberiense OX=382514 GN=CWS72_18585 PE=4 SV=1
--------SHTKAADAHEAAVKMHRSAADEHAKGDHKAGLEHAEKAVKLSKEAQERGTGA-------------
>tr|A0A1H0JSH8|A0A1H0JSH8_9RHIZ Uncharacterized protein OS=Methylobacterium phyllostachyos OX=582672 GN=SAMN05216360_12370 PE=4 SV=1
--------AHHEAAKHHEAAAKSHKTAAEHHEKGDAKTAGKHAEEAHGHSAKAHESSTKA-------------
>ERR1700685_1504215
-------DLHREAAEQHEQAARSHRTASEHNEKGDHDAAKWHAER----------------------------
>SRR6516164_11255356
---HPSSEHHHQAAAHHHAAAHHHHQAAHHHAVGQHEDAKKHATAAQEHSEMAHKHTSTA-------------
>ERR1700740_158540
---HPSQEHHHAAAAHHHAAAHHHHQAEHHHGRGEHEDAKHHAAAAHEHSEQAHKHTTSA-------------
>SRR5262249_17407650
---HPSSEHHLSAAVHHHAAAHHHHQAGHHHALGQHEEAKQHATAAHEHSEHAHKHTATA-------------
>SRR5580658_1949052
-----MHETHREAAEKHELAAHAHRTAAEHNEKGDYSKATWHSERA---------------------------
>SRR6202167_6317267
-----VHEAHGDAVERHELAAQAHRTAAEHNEKGDLSAAAWHSERA---------------------------
>SRR5215831_2780084
TSCRRKLLNtTERHQNTLKHAARHHEEAAKHHDAGHHEKAAHHAHTARGHVIHGRGHAEEAVKAHTEEHGKKX-
>SRR5262245_21882200
-SCRRKLLNTRKASEHLKHAAHHHEETAKHHDAGHHEKAAHHAHTARGHIIHGRGHAEEAVKAHAEEHGKKX-
>SRR5262249_32641853
ISCRRKLLNtTERHQNTLSTPPVTTRRPPS-TTMPDITKRRHTTLTPRGHVIHGRGHAEEAVKAHTEEHGKKX-
>SRR5271166_3867598
----KTWEHYHHAALLHEKAAYHHKEAARYDQAEEHEKAAHHAYLAHGHSQHAVHHEAEAAKLHAEQCAIL--
>SRR5271166_3724902
----KTWEHYHQAARNHEKAAYHFNEAAKYNQAEEHEKAAHHAYLAHGHSQQAAHHDVEAAKLHTEQCDRV--
>SRR5579863_300323
----KTWEHYHHAARHHEKAAYHYNEAAKYDQAEEHEKAAHHAYLAHGHSQHAAHHDVEAAKVHADQCDKA--
>SRR5580658_4821791
----KTWEHYHHAARNHEKAAYHFNEAAKFNQAQEGEKAAHHAYLAHGHSQQAIHHAAEAAKLHAEHYASQ--
>ERR1035441_6906181
----KTWEHYHHAARAYEKAAYHFNEAAKYNQAEEHERSTLFAYLAHGHSQHAVHHDVEAPKLHAEQCDSL--
>SRR5208283_2767898
----KTWEHYHHAARNHEKASYHYNEAAKYNRAEEHEKEAHHAYLAHGHGQLAVHHAAEAAKLHAEQCGSL--
>SRR5579863_4455819
----KTWEHYHHAARDHEKAAYHYNEAAKYHQAEEHEKEVHHAYLAHGLSQHAVHHEAEAAKLHTEQCDKL--
>SRR5579859_1088175
----KTWEHYHDAARHHELAAYHYKEASKYDKAEEHERAAYHAYLAHGHNQHAIHHDIEAAKADAEQCDKV--
>ERR1700734_995030
----KTWEHYHHAARNHEKAAYHFNEAAKFNQAQEHEKAAHHAYLAHGHSQQAIHHAAEAAKLHAEHYGSQ--
>SRR5271154_1375729
----KTWEHYHHAARDHEKAAYHFHEAAKYYQAEEREKAAHHAYLAHGHSQQAIHYAGEAAKLHAEQHDKL--
>SRR5271154_2378436
----KTWEHYHHAGRHHEKAAYHYHEAAKYYQAEELEKAAHHAYLAHGHHQQAIHHDAEAAKLHAERCDTP--
>ERR1051325_8213161
----KTWERYHHATRHHDRVADHDKTAAKYNPSEAHEKAAHYAYIAHGQTQHALHHDAEVAKLCAKQFDGD--
>ERR1700744_6269464
----SGPEHHLAAADHHESAAQHHRNASKHYEEGDHAHAAHQALIAHGHAQLASRHAKDATKSHVEHHSDS--
>ERR1700728_2423293
----KTWEHYHDAACNHEKAAYHFNEAAKYDQAEEHEKAAHQAYLALGHSQHAVHYAAEAAKLHAEQCAS---
>ERR1019366_10183257
----KTWEHYHHASRHHERAAYHYKEAAKYDKAEEHEKAAHHAYLAHGHSQHAIHHDAEAAKLHAEQCAS---
>SRR6476646_11755220
LMSKQAAKHHKKASEHFAKAAHHHGEAAKQHQAGNHETAAHHASIARGCDLHATEHAHAARKAYADDHG----
>SRR5664279_2450751
-MSK-TWVLaYRCAAHHLERAAYHYKEAAKYEEAGDHEKATHHAYLAHGYTQHAIHDDAEAAKLHAEHF-----
>ERR1039457_3077952
-MSK-TWELaYQCAARHHERAAYHYKEAAKYEEAGEHEKAAHHAYLAHGHTQHAIDCDAEAAKLHADHL-----
>SRR5664280_3607282
-MST-TCELaYYCAARHHECAANNYKEAAKCEAAGEHEKAAHHAYLAHGHTQHAIDCDAEAAKLHADHF-----
>SRR6266566_4045491
LMSKKAAQHHKQVAEHLKHAAFHHEEAAKHHETGRHETAAHHAHIAMGHNFSTRVTFAWRAgtsAAPYPVKN----
>SRR6266699_5933420
LMSKKAAQHHKQVAEHMKHAAFHHEEAAKHHETGRHETAAHHAHRAMGHNFSTRVTFAWRAgtsQHHTRSRI----
>SRR5262249_22277374
LMSKKAAGHHKQVAEHLKHAAFHHEESAKHHEAGRHEAAAHHAHVAMGHIIHARSHAEEAVKAHVAEHD----
>SRR5215467_11810449
LMSKKAGEHHKKASEHFTHAAHHYEEAAKHGESGNHEKAAHHAAIARGHDLHGTEHAHAARKVTAENQGK---
>SRR5262249_44821660
LMSKKAAEHHKKASEHFTHAAHHYEEAAKHGESGNHERAAHHAAIARDGIIQPTPGRASFN-LCAK-RGR---
>tr|G3IVL7|G3IVL7_METTV Uncharacterized protein OS=Methylobacter tundripaludum (strain ATCC BAA-1195 / SV96) OX=697282 GN=Mettu_0532 PE=4 SV=1
----TPQQHHQKAAEHHEQAAKHHKEAAKHYESGDDKTAAQHAHIAHGYSTQAMEQEMEASKKYAKMQ-----
>ERR1700761_5729412
------DAHHLKAADHLEEAAHHHREAAKHHAEGDVELAGHHAQVAAGHTAEADHHTVKAAKLYAKLHE----
>SRR5579862_2076159
------EDHHHQAAEHHEQAAHHHREAAKYHTEGDVELAGHHAHVATGHSAHAAHHAVESSKLHAHLHD----
>ERR1700761_5328672
---------HQKAATHHERAALHHREAAEHHAEGDIELAGHHAQVAAGHTAEAARHAAKAAKLHAKLHD----
>SRR5665647_1062279
----TPQQHHQKAAEHHELASKHHKEAAKLHESGDYEAAAHHALIAHGHTVQPQNKRRKPA------------
>SRR5450759_4733936
----TPQQHHQKAAEHHELASKHHKEAAKFHGSGDDEAAAHHALIAHGHTVHATEQEEEASKKYANR------
>ERR1039458_9226938
----TPQQHHQKAAEHHELASKHHKEAAKLHESGDDEAAAPPPLIANEHRVKATEQEEEASKKYANR------
>SRR5258706_9872423
----TGMEHHIAAAEQYERAALHHRRASQHYAELNHPQAAHQALIAHGHMQQAVRHSNEATKHYVELHSV---
>SRR4051812_20172208
----TGSEHHIAAAEEYERAARHHRCASQHYLELNHPQAAYQALIAHGHMQQAVRHSSEATKYYVELNGQ---
>SRR5690348_3201367
----TGSEHHIAAAEQYERAAERHRRASQHYVDLEHPQAAHQALIAHGHMQQAVRHSNEATKYYVEQHGA---
>ERR1700678_3139464
---DQIADHHEKAAAHHEKAAHHHRKAAEYHKSDDVDTAAQHAHSAHGHDLHAEHHAEAA-------------
>ERR1700722_11937569
---DQIADHHEKAAMHHEKAAHHHRQAAQHQKSEDIAAAAQHAHSAHGHDLHADHHAEAA-------------
>SRR5208337_523507
-SDTTLAEHHSKAAEHHGHAKHHHEEAAKAQEDDDHAKGHHHAHIAHGHHLQAEHHHEVAAKH----------
>SRR5271166_3951483
-SDTTLAEHHSKAAEHHGHAKHHHEEASKAHKAGDHAKGHHHAHVAHGHHLQAEHHQEEAAKH----------
>SRR4029453_15264427
TISTQAAAQHEQAAEQYGHAARHYQEAAEHYKRGQYAKAAHDVQTARGHHAQATAHAATAAKYHAEAYV----
>SRR6266446_8423588
TMSTQAAEQHEQAAEQYGHAARHYEEAAKHQKAGNHEKAAHHAHTARGHHKQATAHASAAVKPHA--------
>SRR4029453_3573209
TMSTQAAAQHEQAAEQYGHAARHYQEAAEHYKRGQYAKAAHEVQTARGHHAQDTDNTVTAAKYHAESYV----
>SRR4029434_6774894
TMSIQAAEQHAQAAAQYGHAARQYQEAAAHHQVGQYAKAAQHAQTARAHHAQATAHALAAARAH---------
>SRR6266851_6134461
-------KSHVAAADHYEKAAEHHRTAAEHASEGDQQAAAHHAHIAQGHALHGHEHAASAAKQHVALHA----
>SRR5580693_7599173
-------KFHVAAADHYEKAAEHHRSAADHADEGNPQAAAHHAHIAQGHALHGHEHAAEAAKKHIELHA----
>tr|A0A2U3KQE7|A0A2U3KQE7_9BACT Uncharacterized protein OS=Candidatus Sulfotelmatobacter kueseliae OX=2042962 GN=SBA1_400038 PE=4 SV=1
----KTREHYQEAARHHERAAFHYKEATRYDAAEEHEKAAHYAYLAHGHNQHAIHHDAEAAKLHAERCDS---
>SRR5579871_2888725
----SGIEHHETAAEHHEHASRHHHQASKHGEKRDHSPASHEVNLANGHAHRAVFHGDEAAKYHVEHFGRS--
>SRR5437868_383465
----SGAEHHVAAADHHEQAAQHHRLASKHCDGKDYAMAVQEAQIAHRHAQHSVFNGNEAAKHHVEHYGKS--
>ERR1700693_805720
----SGAEHHAAAADHHEQAARHHGQASMHCEG----------------------------------------
>SRR5579871_6775725
----SCAEHHAAAAGLHEEACGHLSRVAGHFQKSKIGEAAREAKLALDLAVRAAFHSNEAAKDYAK-------
>SRR5471032_131497
----SGAEHHAAAADHHEQAARHRDHAAELCVSSDDALAAREAAVAKSHARRAVFHGDEAAKHHVEHYGRS--
>SRR5580692_1574559
----RGAEHHAAAADHHDLAARHQGQATKHHDAKEYAQAAHEVQIAHGHAQRSVFHGDEAAKHHVEHLGKS--
>SRR5580658_8805264
----SGADHHTAAADHHEQAARHYGRASKHYDAKEYKQAAHEAQIAQGHAQHSVFHGDEAAKHHVEHFGKS--
>ERR1700723_2911494
----GPAEHHAAAADHHDQAARHHGLAAKHWDRNDDAL-----------------------------------
>SRR5271156_4309961
----SVAEHHAAAADHHEQAARHHGQAAKHRDDADYVLAAHEAQIAHGHAQHSIFHDNEAAKHHVEHFGKS--
>SRR5580692_11411556
----ICAEHHTAAAALHEEACTHLSCIAGHFQKSKV-------------------------------------
>ERR1700693_2802083
----DGAEHHAAAAAHHEKAARHHQEASRLCGEQKYAEAAHEAQMAHRHAHYSVF------------------
>SRR3984885_8417543
----GTAAHHAAAATHHEQAAHHHEEAARLCGEKDYARAAHEAQMAHRHAHYSIFHDDEAAMHHVEHYGKS--
>SRR5271154_5983987
----SSAAHHAAAGLHHEQAAHHHKDAARLCNEQQYARAAHEAQMAHRHAHYSVFHDDEAAMHHIEHYGKS--
>SRR5277367_1397320
----SAAQHHIAAAEHNEAAAQHHADAAQHCGRKADGSATIEAEIARGHAEYAVF------------------
>ERR1700722_13942348
--PSKTIDNHQQAAVHHTEAAKHHLEAAKFYAEGNTEKAAHSAMLAWGHHAIAGEFMNDDAKHHAQ-------
>SRR5580658_1516979
--YKQTIDRHQQAAAHHTEASKHHLDAAKFYAEGNPEKAAHSAMLAWGHHAIAGEFINDDAKHHAQ-------
>ERR1700733_10419240
--YKKTIENHQQAAAHHTEAAKHHLEAAKAYAENSPEKAAHSAMLAWGHHAIAGEFINDDAKHHAQ-------
>SRR5580658_3530262
--HKKSIDNHTQAAAHHKEAARHHLEAAKFYAEGNSEKAAHSAMLAWGHHAIAGEFINDDAKHHAQ-------
>SRR3989338_1468871
---QRGIKNHQRAAAHYEAAAKSHLEAAGHHENENHEKAAKSTVEAHGHSSLGNDAQKEDVKHHTE-------
>tr|A0A257K659|A0A257K659_9FLAO Uncharacterized protein OS=Flavobacterium sp. BFFFF2 GN=CFE24_14185 PE=4 SV=1
---QKGIDNHKKAAAHFESAAKSHLAAAKHHEDGHHEKAAKCTVDAHGHACMGKDAQTQDVKHHAS-------
>tr|A0A257LAW7|A0A257LAW7_9BACT Uncharacterized protein OS=Bacteroidetes bacterium B1(2017) GN=CFE21_08740 PE=4 SV=1
---QKGIDNHKKAASHFEAAAKSHLEAAKHHEDGHHEKAAKATVEANGHSNMAIDHQKEELKHSTK-------
>tr|A0A1F3VUM1|A0A1F3VUM1_9BACT Uncharacterized protein OS=Bacteroidetes bacterium RIFCSPHIGHO2_02_FULL_44_7 GN=A3D92_22580 PE=4 SV=1
---QKTVEGHRTAAAYYEAAAKSHLEAAAHLMNDQNDKASQSTMQAYGHSKLAIEAQKEYVKRHTL-------
>SRR5580704_9113175
-----AVEAHHKAAEHHQKAAEHHHKAAAHHEAGNHEKAHEHATKAHEHATEAHKHSSEAHEKS---------
>ERR1700722_5269228
-MSKSAAAHHGHAAYHHESATRHHRAAENAYGSGDHKKAAHEAQLAQTHALKAKHHSDLAAKEHLEHHGMD--
>SRR5665213_2018094
-VMSSSGEHHGWAAYHHESATRHHRAAENAYGSGDHKTAAHEAQCASDHASRAKHHADLAVKSHIEHHGMD--
>SRR5450432_4126119
-VMSSSGEHHGWAAYHHESATRHHRAAENAYGSGDHKTAAHEEQCASDHACRAKHHADLAVKSHIEHHGMD--
>SRR5262244_3625698
VMSDKAAGHHKKASEHLARAAYHHGKAAK---TKGYEAAMQHAQTARNHRLQAAGHAEKALKAHIDH------
>SRR5215468_6366957
VMSDKAADHHKKASEHLVRAAYHHRKAADHGETGRHETAVHHAQTARAHRLRAAGHAEKALNAHVEY------
>SRR5215475_6990276
--------------------AYHHGQAAK---TKGYEAAMQHAQTARNHRLQAAGHAEKALKAHIDH------
>SRR5215467_9446568
XMPHKAAEHHEKAAAHLERAAYHHGKAAKE--AGRYETAVDHAQMARTHRLQAAGHAEKALKAHVEY------
>SRR5262245_23793110
VVSDRAADHHKKASEHLAHAADHHKKAANHGETGRHEMAVHHAQTARAHRLQAAGHAEKALNAHIEY------
>SRR5262249_56064253
-------------MLEGVVAAYHHRKAANHGEIGGHETAVHHAQTARAHRLQAAGHAEKALNAHIEY------
>SRR5262249_33404574
XMSKRAAEHYRKASEHLTRAAQHDEKAASDHEAGRDEAAMEHAQAARTHTVRAESHAEKALRAYVEH------
>SRR5262249_28992298
----RVTDHYQKASEHLARAAEHEKKAAQDHEAGREQAALQHAQTARLHTLRAESHAEKALNAYVEH------
>SRR5271157_5172123
------KDAHNKAAEHHESAAKSHRSAADSHGKNDHAKGKEHATHAQQHAQTANEHSKTAN------------
>SRR5271170_6162693
------RESHNKAAELHESAAKSHRAAAESHGRNEHAKGKEHATQAQQHAQSAHEQSKTAN------------
>ERR1700683_1074718
------KDEHNKVAEQHEAAAKSHRAAADAHGKNDHAKGKEHSGQAQQHSQNARNQSQAAH------------
>ERR1019366_63553
-V------HQNT-TVVPRTTTR---APRDIIAPPRTRTX----------------------------------
>ERR1019366_3465073
-MSQSPAEHHGKAAYHHESATRHHRAAEKAYGSGDHKTAAHEAQCACGHSNLAKNSADAAAKSHMEHHGAQ--
>ERR1035441_4551387
-MSQSPSEHHGRAAYHHESATRHHRAAENAYGSGDHKTAAHEAQCAAGHASLAKHHSALAARSHMEHHGME--
>SRR6202011_4950182
-MKHASSEHHHSAASKHEAADYYHRQAAHNHDRGDHEEAQKHATSAHDHSQDADRHSKIAH------------
>SRR5260370_19141254
-MNHASSEHHRSAASEHEAAAYHHRQAVHHHENGNPEDAKKHATSAHDHSQDADRHSKNAH------------
>SRR6202011_5447000
-KKHVSSEHHHNAAAQHEAAAHHHRQAAHHHDHGNHEEAKKHATSAHDHSQDADRHSKTAH------------
>SRR5450830_80626
-------QLHQKVAEHHEQAAEHHQEAAKHHESGDDETAAHHAQIAHGHAVHATMH-----------------
>ERR1035437_2959451
-------DLHRHAAEHHELAAQHHRAAGMCQDCCHDDDAAHHAKETTCHAFHAAMH-----------------
>SoimicmetaTmtLMB_FD_contig_41_728288_length_224_multi_1_in_0_out_0_1 # 2 # 223 # -1 # ID=1056114_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.653
-------HLHEQAAEHHEQAAKHHKTAANCCASGDMDGADHHARSAQGHAVHAGAH-----------------
>ERR1700678_2085178
----NMKEEHNKAAEHHESAAKSHRAAADAHGKNDHTKGKEHSTQAQQHAQNAQQHSKTAN------------
>ERR1700692_1389415
----QMKEEHTKAAEHHESAAKSHRAAAEAHGRNDHPKGKEHATQAQQHAQNANENSKTAN------------
>ERR1700722_5975226
----SMKDSHNKAAEHHESAAKSHRAAADSHGKNDHAKGKEHSTQAQQHSQNARDHSKTAH------------
>ERR1700753_2365108
----KMKDDHNKAAEHHESAAKSHRAAAEAHGRDDHAKGKEHSTQAQQHAQNASEHSKTAN------------
>SRR3979409_386687
----KMKDAHDKAAEHHESAAKSHRAAAESHGKNDHTKGKEHATQAQQHAQNAGEHSKTAH------------
>SRR5579863_8143594
----QMKEEHNKAAEHHEAAAKSHRAAADSHGKNDHSKGKEHSTQAQQHSQDARNQSQSAH------------
>SRR5579863_8033294
----TMKEDHNKAAEHHESAAKSHRAAAESHGKNDHAKGKEHATQAQQHAQNAHEHSKSAN------------
>SRR6188472_3074432
VMDDQSERQYTNAADELERSVAHYREASRHSARGEHVKAAHHAHIARGHFLNAQASAHDAAKWHADHFS----
>SRR4029079_19750494
MDQDQSARQYTTAADELERAVAHYREAARHSALGEHVKAAHHAHIARAHFLNAQANAHDAAKWHADHFS----
>SRR5919205_844608
MTQDLSQDQYTMAADELERAVQHYREAARHSELGEHVKAAHHAHIARGHFLNAQSMAHDAARWHAEQFS----
>SRR5688500_4780906
MSQDRSVQEYAVAAQALARAAAHYREAARHTELGEHVKAAHHAHIARGHFLNAQDFAHEAAKLHASRFS----
>SRR5271170_5730888
-----AHEAHHAAAEHHENAAKHHRHAAEHHAAGNHKEGHEHSVQAHEHSKKAHEASTDAHKHSVAAH-----
>SRR5262245_50603022
--KHPAVEHHLQAAHHHHVAAHHHLHAAHHHAHGQHEEAKKHATTAHEHSEHGHKHSKNAHG-----------
>SRR6516162_4486343
--KHASVEHHHQAAARHHAAAHHHLQAARHHTHGQHEEAKKHATAAHEHSEHAHRHSKDAHS-----------
>SRR5271155_274146
------KEAHTKAAEHHENAAKSHRTAAEHHGKGEHTKGHEESTKAQTHSKTARDHSDMAH------------
>SRR3984957_6795854
------KEAHTKAAEHHETAAKSHRTAADHHSKGDHAKASEESTKAQSHSKTARDHSDMAH------------
>SRR6202048_1289238
------KETHTRAAEHHENAAKAHRTAAEHHGRGEHAKGKEQANAAKQHSQTANQHTEQAH------------
>SRR6478672_995202
------KETHTRAAEHHENAAKAHRTAAEHHGKGDHAKGREESTKAQGHAKTAREHSEA--------------
>SRR6516164_8106867
---HASIEHHHQAAARHHAAAHHHLQAAHHHAHGQHEEAKKHAVTALEHSEHGHKH-----------------
>SRR4051794_5723104
--------TMTLAAEHHEHAARHHREAAKFHEVKDILAAVDQAHMATDHQAHAIHYATQAAKEYLAA------
>SRR4051812_7997314
----PSRDEYTRAADELEKAVRHYREAASHSGRGEHVQAAHHAHIARGHFLNAQGMAHDAARRHADQFS----
>SRR5918997_1468141
----PSKDEYTRAADQLEKAVRHYREAASHSERAEHVQAAHHAHIARGHFLNPQGMAHDAARRHADLFS----
>SRR4051812_48614955
----PSKDECTWAQTNSRGHCA-TTGKPEPLRAGEHVEPAHHRHIARGHDLNAQGMAHDAARRHADRFS----
>tr|A0A1H7LQK3|A0A1H7LQK3_9ACTN Uncharacterized protein OS=Blastococcus sp. DSM 46786 GN=SAMN04515665_10783 PE=4 SV=1
----LSKNQYTKAADELVLAVRHYREAASHSGLGEHVQAAHHAHLARGHFLNAQAVAHDAARWHADEVS----
>ERR1039458_9372234
--PVPGATHHDAPAQHDEEAARHRQQAAELYQCGRHEKVSHHGHLAYAHHLHAKQHAEEAAKAHM--------
>ERR1017187_367733
--SAPGAKHHNAAAQHDEEAARHRQQAAKLYQRGHHEKVSHHAHLAYAHYLHAKQHAEEAAKAHM--------
>ERR1017187_6707531
--PVPGATHHDAPAQHDEEAARHRQQAAKLYQRGHHVKVSHHAHLASAP------------------------
>SRR5579864_7684482
--PVPGATHHDAPAQHDEEAVRHRQQAAELYQCGHHEKVCHHAHLAYAHIVHTKQHAEDAAKAHM--------
>SRR5271156_4620907
----MPKETHTRAAEHHENAAKAHRTAAEHHGKGEQDKGHEESTKAHEHSTEAHRSLNRC-------------
>ERR1700726_2667972
-----AKDEHNKAAEHHENAAKSHRAAAEHHGKNDHAKGKEHSANAQQHSQNAPKHSEPAH------------
>ERR1700756_5891212
----KLAEHHETAAHFHELAAEHHRQAAEHQRDEEHEKAAQHALAADGYRLHAVEHAEEASRLYAEEF-----
>SRR6202008_972907
----KLAEHHETAAHFYELAAEHHRQAAEHHRDEEHEKSAQHAFAADGYRLHADEHADEAARLFAEVF-----
>SRR5215469_981565
----KLAEPHETAAHFYELAAEHHRQAAESHRDEEHERAAQRAFAADGYRLHADEHADEAARLFAEVF-----
>SRR5271155_5034285
----MPKETHTKAAEHHENAAKAHRTAAEHHGKGEHDKGHEESTKAHEHSTQAHRHSADAHGKSGEART----
>tr|A0A257URR7|A0A257URR7_9PROT Uncharacterized protein OS=Acidiphilium sp. 37-64-53 GN=B7Z58_15790 PE=4 SV=1
----MASSNHKEAAKAHETAAKAHHTAAEHHDKGDHAAAQQHSTKAHEHSSAAHKHSTDAHQQSGKAAG----
>ERR1035441_2777089
-----AREEHNKAAEHHENAAKSHRAAAELHGKGEHTKGVEQSKTAQQHSQTAGKQSDQA-------------
>tr|A0A0D7P848|A0A0D7P848_9BRAD Uncharacterized protein OS=Bradyrhizobium sp. LTSP885 GN=UP09_16020 PE=4 SV=1
-----ANSEHNKAAELHETAAKSHRAAADQHSKGDHSKGVEQSKSAQQHSQSAGKQSDQA-------------
>ERR1039458_2112867
---QKGVDIHKQAAKHHLEASKHHLDAAKFYEVGEHEKAAVSTVKAQGSASLASDASREDAQMHSF-------
>ERR1017187_9273322
---QRGVDVHKQAAKHHLQASMHHLDAARFHEIGDHEKAAVSTVKALGSACYASQAMNEDAQIHTI-------
>ERR1035437_936469
---QPRVDIHKQAAKHHQDAAKHHQDAAKFHEQGQHDKAAASTVKAQGSATLANDASREDSRSHAI-------
>SRR5476651_1852145
---LKGIETHKQAAKHHQDAAKNHLDAAKFHEAGDHEQAAKSTVKAQGSASLANDAAREDAKSHAV-------
>ERR1022692_1737537
---LKGIENHKHAAKHHQDAAKNHLDAAKFHEAGDHEKAAASTVKAHGSASLANDISKEDAQNHAL-------
>SRR5665213_98322
---QKGIDLHNKAAKHYEAAAKYHHEAAKYHETDDHKMADESTVKANAAATLGNDAAREDAQYHAL-------
>SRR5579871_474268
--THPAAEAHHTAAASHEAAAHHHRQAAHHHETGEHETARTHANSAHSHSATAHEHTTTAH------------
>SRR4029078_9117261
-----RKDEHNKAAEHHESAAKSHRAAAEAHGKNEHAKGKEHANQDQQHEQNAHAHSQSAH------------
>ERR1700681_418265
-----MKDAHNKAAELHEAAAKSHRTAAEHHGRNDHAKGKEHATQAQQHAQNANEQSKTAN------------
>SRR5271169_2644609
-----MKDAHNKAAEHHESAAKSHRSAADSHGKNDHAKGKEHSTQARQQSQSAEEHSKSAH------------
>SRR5579871_2888725
---QGAMPDHASAAHHHAQAAYFHREALTHYRIGkDYAHAAHQALVAHGHAMQAVFHGEEARKYYSGHNGNG--
>SRR5437868_383465
-------------------------------SLNaDYAHAAHQALVAHGHALLAIDRGTEASKYYAEHDGNT--
>ERR1700733_4417216
---HRASEHHRTAARHHTQAAEYHRESSRHYEIGkDYAHAAHQALIAHGHALLGLKYGDEARAHYAGHHLSD--
>ERR1700693_805720
---QRAAAHHASAAIHHHQAARYHNEASRNYQVGkDYAHAAHQALVAHGHALQAFDHGNEASKFYAEHDGSA--
>SRR5713101_2571449
---HRAAEHHVSAAFHHKQAARHHREASRHYQVGkDYAHAAHQALVAHGHALQAIDRGTEARKYYTEHDGNA--
>SRR5476651_332982
---HGAAEHHNRAAMHHTLAARYHREASRHYQTGkDYAHAAHQALVAHGHALQAIDRGNDASKYYAGHNGNA--
>SRR5580658_6078868
---HDAAGHFTSAAFHHKQAARFHREASRHYEIGkDYAHAAHQALVAYGHGLRAIDYGSDAGTYFAEHDRKA--
>SRR5580658_8805264
---------------------------------------------AHGHGLQAIDHGNDAGTYFAEHDGKT--
>ERR1700723_2911494
---------------HHELAARYHREASRHYQIGkDYAHAAHQALVAYGHGLHAINHGNEARKYYARHDGSA--
>SRR5580692_11411556
------AEFHASAAFHHRQAAQFHREASRHYEVGkDFAHAAHQALIAHGHALQALEFELAAIVYYAGHAVRK--
>SRR5476651_2202389
---HGAAEHHNRAAMHHTQAARYHNEASRHYETGkDYAHAAHQALVAHGHALRALRYGDEARTHYAPHHLSE--
>SRR5277367_1397320
------------------------------------------AFLAMGHDLRAVAHGNEAARYHDG---VP--
>ERR1700675_2935724
-----AKEEHNKAVEHHENAAKAHRSAAEHHGKGDHAKGKEHANSAKQHSQTANQHSDQAH------------
>ERR1035441_10294419
-----AKDEHNKAAEHHENAAKAHRSAAEHHGKGDHMPRARNMRTVQSSIRRPPISIA-IR------------
>tr|A0A2E7Y947|A0A2E7Y947_9RHIZ Uncharacterized protein OS=Methylobacterium sp. OX=409 GN=CMH16_04620 PE=4 SV=1
MNSHPAHEHHMLAATHHAAAAHHHHEAAHHHAHGNAEEAKRHSTSAHEHAEHAHRHTANAHKH----------
>SRR3984957_15326747
-------QAHSKTAAHNESASKAHRAAAEHHGKNDHMKGSEHAAEAQKHSKVAGAASDEAH------------
>SRR5271168_46436
-------QAHTKAAEHHESAAKSHRAAAEFHGKNDHLKGNEHATEAQKHSKVASGATEAAH------------
>SRR5277367_474245
-------ESHEEASKHHESAAKSHKMAAEHHGRGDTASAAKHASEAHEHSSKAHQSSTK--------------
>SRR6202521_2882464
-------EAHQEAATHHENAAKSHKAAAEHHAKGDTASAAKHASEAHEHSSKAHQSSTK--------------
>tr|A0A1I4SLP7|A0A1I4SLP7_9RHIZ Uncharacterized protein OS=Methylobacterium pseudosasicola GN=SAMN05192568_104322 PE=4 SV=1
-------TAHAEAAKHHEAAAKSHKTAAEHHEKGDEATAAKHLKEAHGHSEKVHESSTK--------------
>ERR1700735_2891498
-------EAHEEAAKHHENAAKSHKTAAEHHGKGDTASAAKHSAEAHGHSTKDHERPT---------------
>tr|A0A0L6J387|A0A0L6J387_9RHIZ Uncharacterized protein OS=Methylobacterium sp. ARG-1 GN=AKJ13_24265 PE=4 SV=1
-------NAHREAAKHHEAAAKSHNTAAEHHEKGDNTTAAKHAKEAHGHSEKAHESSTT--------------
>tr|A0A177PXP4|A0A177PXP4_9PLAN Uncharacterized protein OS=Planctomycetaceae bacterium SCGC AG-212-D15 OX=1799653 GN=AYO40_06070 PE=4 SV=1
---HPCSEHHCNAASQHEAAASHHRQAAHHHNQGKHEEAKKHANSVIDRSQDADRHSKTAH------------
>tr|A0A2N8MCK1|A0A2N8MCK1_9RHIZ Uncharacterized protein OS=Beijerinckiaceae bacterium OX=1978229 GN=CR217_06575 PE=4 SV=1
---HPAGEHHHQAAAHHHAAVHHHHQAAHHHDLGEHKEAKEHATAALEHSELAHKHSTTAH------------
>tr|A0A1U7CVY5|A0A1U7CVY5_9BACT Uncharacterized protein OS=Paludisphaera borealis OX=1387353 GN=BSF38_04616 PE=4 SV=1
---HPASEHHHQAAAHHHAAAHHHHAAAHHHDIGEHAEAKQHATAAHEHSEKAHAHTKTAH------------
>SRR5436305_210371
---------HGNAAFHHEAAAHHHRQASRHHTAGDNEEADRHTRMAHTHSQTAHEHS----------------
>SRR3954471_7869634
-------------AFYHESAAHHHRQAARHHEAGDTEEAGRHAEAARSHGSTASQHS----------------
>SRR4051794_3105455
----------RRAAFYHETAAHHHRQAAKHHEGGDVEEAEQHGELAYGHSETAHghsg-KA----------------
>SRR3954465_8563158
----------HDAAHYHEAAAHHHREAARHHEGGEHERARRHATTAHEHSGQAHghsqeahqgSHG----------------
>SRR5208283_2189165
---HPASEHHLQAAAHHHAAAHHHHQAAHHHELGEHEEAQEHAKAAHEHSEQGHEHSTTA-------------
>SRR5271165_7167234
---HPASEHHLQAAAHHHAAAHHHHQAAHHHALGEHDKAKQHSTSAHEHSQHAHKHTTDA-------------
>tr|A5ER26|A5ER26_BRASB Uncharacterized protein OS=Bradyrhizobium sp. (strain BTAi1 / ATCC BAA-1182) OX=288000 GN=BBta_6724 PE=4 SV=1
-----ARDEHNKAAEHHDNAAKAHRSAAELHGKGDHAKGKEHASSAKQHSQTASQHSEQAH------------
>tr|A0A2N3PRK8|A0A2N3PRK8_9PROT Uncharacterized protein OS=Telmatospirillum siberiense OX=382514 GN=CWS72_18515 PE=4 SV=1
------AEHHRSAVSHHEAAARYHREASKHYQIGhDHAHAAHQALIALGQAWQAVDHAKTANGYYadhdidslqkymEQ-------
>tr|A0A126YQH4|A0A126YQH4_9BURK Uncharacterized protein OS=Burkholderia sp. PAMC 28687 OX=1795874 GN=AX768_20475 PE=4 SV=1
------NARETSAPSSHELAARLHIDASRHYLAGkDYSHAAHQALVAHGHALLALAQGKAVSDRYrkrasgetaTV-------
>tr|A0A1G7SI70|A0A1G7SI70_9BURK Uncharacterized protein OS=Paraburkholderia phenazinium OX=60549 GN=SAMN05216466_102505 PE=4 SV=1
------KEHLEAAASHHEQAGRFHREASRHFEEGkDFNHAAHQAVMAHGHALHAIAEANDALKHPAS-------
>tr|A0A2U0W4R2|A0A2U0W4R2_9BURK Uncharacterized protein OS=Paraburkholderia unamae OX=219649 GN=C7402_115204 PE=4 SV=1
------KMHIEAAASHHEHAAQHHREASRHFEEGrDFGHAAHQALMAHGHTLHAIDQAHEAGAHGSN-------
>tr|A0A2U0Y0F7|A0A2U0Y0F7_9BURK Uncharacterized protein OS=Paraburkholderia sp. OV555 OX=2135497 GN=C7513_102216 PE=4 SV=1
------KVHLEAAASHHEQAARFHREASQYYEAGsDQDHAAHQAVLAQGHALHAIDEANVAVKHAGA-------
>ERR1700733_4273555
------TESHTKAAEHHENAAKSHRTAAAQHSKGEHAKGQEESTKAQSHSKTARDHSDMAH------------
>ERR1035441_938148
------KDAHLKAAEHHDNAAKAHRTAAEHHGKGDHAKGMQHSKIAFDHSVKAHEASTHAHKKSSE-------
>ERR1035441_5448854
------KDAHLKAAEHHDNAAKAHRTAAEHHGKGRSEERRVGKEG----------------------------
>ERR1017187_7208358
------AVFSaaFLAGGCWALAavFAAARFAAQRFFNAATI-AALPAALSF--------------------------
>ERR1700753_2918760
-------APHKKAADHHEKAAKSHRAAAEHHDKGDKAAAGKHADEAHGHSTKAHETSAK--------------
>ERR1700761_7356260
-------ATHKEAADHHEKAAKSHRTAAEHHDKGDATAASKHAEEAHGHSTKAHESSSK--------------
>ERR1039458_3843154
-------DAHNKAAEHHENAAKAHRNAAELHGKGDHGAGKKHSATALEDSGKAHDAS----------------
>SRR5260370_22522930
-------NAHEEAASHHENAAKAHRTAAEHHGKGNHEEGRRHSSTAHEHSGKAHEAS----------------
>tr|A0A1I2DTA1|A0A1I2DTA1_9BACT Uncharacterized protein OS=Spirosoma endophyticum GN=SAMN05216167_12052 PE=4 SV=1
-----AHEHHKEAAYHFRKAAEYHENAQQLHEAGDHEKEAHEAYVAYGHHNLADQHAQAAAEHHAEKHDT---
>tr|D2QVB8|D2QVB8_SPILD Uncharacterized protein OS=Spirosoma linguale (strain ATCC 33905 / DSM 74 / LMG 10896) OX=504472 GN=Slin_6800 PE=4 SV=1
-----AHDHHKEAAYHFGEAAKHHQKAQELHQAGDHEKEAHEAYQAQGHHNLGDHHAKAAAEHHAEGHDK---
>SRR5476649_1149520
----HVAEHHEAAAELHEHATRYLLQASRHYEAGNVALSAHEAQTAHAMGLCTIDHSNEAAKHHAVR------
>SRR5450756_2141572
-----------LVSWAREMCIRDSRTAAEHHGKGDHAKGMEHSKIAFDHSVKAHEASTHAHAKSSE-------
>SRR6202046_2156493
-------HDHHKAAEHHEEAAKSHRKAADAHEKGEHADATQHSQMAHDHSTKAHEASSSA-------------
>ERR1700684_1565712
-------NDHHKAAEHHEEAAKSHRKAGDAHDKGEHADASQHSQIAHDQST----------------------
>ERR1700722_13108227
------RDSHTKAAEHHENAAKSHRTAAEHHGKGEHDKGRERPRRRRVAQRQRGSIRTPP-------------
>ERR1700722_6883511
------RDSHTKAAEHHENAAKSHRTAAEHHGKGEHAKGNEESMKAQGHSKSAREHSEMA-------------
>tr|A0A2M6VDT7|A0A2M6VDT7_9BURK Uncharacterized protein OS=Limnohabitans sp. B9-3 OX=1100707 GN=B9Z42_07035 PE=4 SV=1
----TEHQHHVQAAEHLELAAKSHKEAAKLISAGDHKAALQHVETAKTHTAHASDHVKEAQKK----------
>tr|E9I7K5|E9I7K5_DAPPU Uncharacterized protein OS=Daphnia pulex OX=6669 GN=DAPPUDRAFT_279722 PE=4 SV=1
----KPEHHHTKVAEHLEMAAKSHKEVAKHITANDHAAAQTHAKVAEEHMTKAKEHADLA-KK----------
>tr|A0A2M6VZL0|A0A2M6VZL0_9BURK Uncharacterized protein OS=Limnohabitans sp. 15K OX=1100706 GN=B9Z40_07615 PE=4 SV=1
----KPEQHHSKAAEHLELAAKAHKEVAKLISANDHTGAHAHVAVAHEHLTHAHTHADAA-KK----------
>tr|A0A1N6KRN2|A0A1N6KRN2_9BURK Uncharacterized protein OS=Paraburkholderia phenazinium GN=SAMN05444165_4433 PE=4 SV=1
-----KKEHLEAAASHHEQAGRLHREASRHFEDGkDFAHAAHQAMLAHGHTLHAIDRANEALKHHAGAPL----
>tr|A0A1Q8IYL3|A0A1Q8IYL3_9BURK Uncharacterized protein OS=Burkholderia sp. SRS-W-2-2016 GN=BTH42_10720 PE=4 SV=1
-----KKGHLESAASHHEHAARHHQEASRHFEDSrDPGHAGHQAVLAHGHTLLAIDEAQDAGAHSANAP-----
>tr|A0A244DI83|A0A244DI83_9BURK Uncharacterized protein OS=Paraburkholderia terrae GN=CA603_35275 PE=4 SV=1
-----KKEHLDAAASHHEQAARFHREASRHFEAGkDFAHAAHQAMMAHGHALHAIYQANDAGKHNSDTPL----
>SRR5579863_4152399
-----KKGHLEAAASHHEQAARHHREASRHFEDGrDLVHAAHLAMMAHGHTLHAIDQAHEAGAHSANTP-----
>SRR6201994_3502295
-----KKEHLVAAASHHEQAARYHHGASRHFEAGkDYAHTAHQAMLAHGHTLHAIDEAHDAGAHSANTSS----
>ERR1700756_1205677
-----KKGHLEAAASHHEQAARYHHEASRHFEAGkDYAHAAHQAMMAHGHALHAIDRAHDAGAHSAATPP----
>ERR1700716_2440016
-----KKEYLEAAASHHEKAARYHREASQHFEAGkDYAHAAHQSMMAHGHTLHAIDQAHNAGAHSASTPP----
>ERR1700742_968682
-----KKEHLVAAASHHEQAARFHHAASRHFEAGkDFDHTAHQAMLAHGHTLHAINEAHDAGAHNVNVPP----
>ERR1700693_4071883
-------------------------------------HAAHQAMMAHGHALHAIEHVNEALKHNAGAPL----
>ERR1700733_12934978
------------------------------------------AMLAHGHTLHAIVEAHDAGVLSASPPP----
>tr|A0A221AH39|A0A221AH39_9BURK Uncharacterized protein OS=Burkholderia sp. AD24 GN=bAD24_III09205 PE=4 SV=1
-----KKAHLEAAASHHEQAARYHHGASRHFDTAqgqdqDHAHAAHQAMMAHGHTLQAIDEAHEAGAHSTGAPP----
>tr|A0A1H4CXJ3|A0A1H4CXJ3_9BURK Uncharacterized protein OS=Paraburkholderia sartisoli GN=SAMN05192564_102553 PE=4 SV=1
-----KKGHLEAAASHHEQAARYHRAASRLFEGGhDFAHAAHEALIAHGHTLHAIDQAHDAGAHSTSAPP----
>tr|A0A1H1JIH2|A0A1H1JIH2_9BURK Uncharacterized protein OS=Paraburkholderia fungorum GN=SAMN05443245_6487 PE=4 SV=1
-----KKGHLEAAASHHEQAARYHHGASWHFEEGkDFAHAAHQAMLAHGHTLHAIDHAHDAGAHSNAPPT----
>SRR5579859_2687318
--------------------------LTVTGVQTcALPTSAHQAMMAHGHTLHAIDQAHDAGAHRANTPP----
>SRR5476651_246744
-----------SASGHHKQAAKYHREASRHYQSGkDYAHAAHQALAAHGHALQAIDHGKVAERYQAPRDP----
>SRR5579863_1914993
-----KVGHLEAAASHHEQAALFHREASQYYEAAkNYEHAAYLAVLAHGHAQHAIDEVHVAAKHASAPSS----
>ERR1700677_902425
------SMHHAAAVVHHQQAARFHREASRHYQIGkDYAHAAHQALTAHGHALRAMEHGQTASAHYVAHEH----
>ERR1700722_4097143
----------------------FHREASRHYQTGrDYAHAAHQALTAHGHALRAQEHGEAASALYAAHEG----
>ERR1700679_1675721
-----QSAHHVAAADHHQQAAQFHRAASRHYQIGkDYAHAAHQALAAHGHTLKAIDHENEASKYYAEHIG----
>ERR1700730_12142891
-----AAEYHASAAIHHELAARYHREASRHYQIGkDYAHAAHQALVAYGHALHAVDHGNHARNYGGGTSS----
>SRR5271170_1814435
-----SAEPHASAAIHHAAAARFHREASRHFQVGeDHAHAAHQALLAHGHGLRAMERGNQADAYYAT-------
>ERR1700690_2248574
------------------------------------------ALTAHGNAVYIRKHGQPANAGYAAHEG----
>APAga8741244255_1050121.scaffolds.fasta_scaffold61951_1 # 1 # 285 # -1 # ID=61951_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.709
-----KKGHLEAAASHHEQASRYHRAASRYFEAGqDYAHAAHQAMMAHGHTLHAIDQAHEAGTHSANTPP----
>UPI0006EF4159 status=active
-----KKGHIEAAASHHEQAARFHREASRSFEAGkDYDHAAHQAIMAHGHVLHAIDHAHDAGAHTTGTPP----
>ERR1035438_8884209
-----MHEAHWKAAEQHELAARAHRTAAEHNEKGDFTTAIWHSQRALEYSDHAYRLAKEAHAK----------
>SRR5271157_4414361
-----MHEDHRRAAELHELAAAAHRTAAEHNEKGDYTTAVWHSERALEYSDSAYKLAKEARTK----------
>ERR1700690_4483870
-----MHEAHRRAAEQHELAAHSHRTAAEHNEKGDYAAAFWHSQRALEYSDHAYKLAKEAHAR----------
>ERR1700690_2838516
-----MHEAHRKAAEQHELAARAHRTAAEHNEKGDCTTAVWHTQRAMKYSDHAYELSKEAHNK----------
>SRR5579863_5662397
-----MHEAHWKAAEMHQLAAQAHRTAAEHNEKGDFTTAEWHSARAREYSDHTYTLAKQCHNK----------
>SRR5580700_5209628
-----VHETHRQAAENHELAAQAHRTAAEHNEKGNYSTATWHSERALEYSDNAYKLAKEAHSK----------
>ERR1035438_1235828
-----MHDAHRRAAEQHELAALAHRTAAEHNEKGDYSAAILHSERALEYSDQAYKLAKEAHSK----------
>SRR5208282_4175553
-----MHEEHRKAAEQHALAAKAHRTAAEHNEKGDHAAAVWHSERALEYSDHAYKLAKETQNK----------
>ERR1035441_160778
-----MHEAHREAAEKHELAAQAHRTAAEHNEKGDSTAADWHSERDRKSTRLNSSHLGISYA-----------
>ERR1017187_5918702
-----MHEAHRKAAEQHELAARAHRTAAEHNEKGDRTTAELHSERALQYSDHAYALAQEAHTK----------
>ERR1035437_812044
-----MHETHRKAAEQHELAARAHRMAAEHNEKGDNVAGSWHAEQALX-------------------------
>ERR1035438_2952819
-----MHEAHRRAAEQHELAAQAHRTAAEHNEKGDLSNAVWHSQRAMEYSDHAFKLAKEADSK----------
>SRR5208283_3547207
-----MHEAHWRAAELHELAAEEHRTAAEHNEKGNFAPAIWHAERALEYADQAYKLGKEAHTR----------
>ERR1700680_3230249
-----VHDALRKAAEQHELAAQAHRTAAEHNEKGDNEAGSWHSERALECSDHGYRLAKEAHIK----------
>ERR1700680_5293557
-----VHDALRKAAEQNELAAQAHRTAAEHNEKGDNAEGSWHSERALEYSNHAFKLAQEAHNK----------
>SRR5579871_4072473
-----MHQAHRKAAEQHELAAQSHRTAAEHNEKGDFPMAVWHSERALAYSDKAYRLAQEAHNK----------
>SRR5580658_3585280
-----MHeerreaaekhDPHEKAAAQHDLAAQAHRTASEHNEKGDDGKGQWHAERALEHSTQAFRLSKEAHTK----------
>ERR1035438_1797539
-----MHESHRRAAEQHELAAQAHRTAAEHNEKGDNIAGKWHAERALEYSDHAYKLAREAHAKS---------
>ERR1017187_6667227
-----MHESHRRAAEEHELAAQAHRTAAEHNEKGDNIAGKWHAERALVYSDRAYKLANEAHNKS---------
>ERR1019366_7385905
-----MHESHRRAAEEHELAAQAHRTAAERNEKGDYVAERWHAERALEYSDHAYKLAREAHTKS---------
>SRR5271157_1405023
-----LHESHRKAAEQHELAAMAHRTAAEHNEKGDGAAGSWHAERALEYSDHAYKLAREAQTKS---------
>SRR5579862_6534521
-----MHESHRKAAEQHQLAAQAHRTAAEHNEKGDYTAAIWHSERALEYSENAYKLAKEAHNKS---------
>ERR1039458_3948685
-----MHETHRKAAEQHELAAQAHRTAAEHNEKGDYAAAIWHSERALVYSDRAYKLANEAQTSQ---------
>ERR1039458_7317855
-----MHEEHRRAAELHELAAQAHRTAAEHNEKGDRATSIWHSERALEYSDRAYKLAVEVRNKS---------
>ERR1017187_969610
-----INDAHREAAEEHERAAQAHRTAAEHNEKGDGTAGSWHAERALQYSDHAYKLAKEAHNKS---------
>ERR1017187_7516137
-----MHETHRQAAEHHELAAQAHRTAAEHNEKGDYPAAAWHSERALEYSDRAYKLAKEAHSKS---------
>SRR5665647_3655933
-----MHEAHRKAAEQHELAAQAHRTAAEHNEKGDYAAAIWHSKRALEYADRAYQLADEAHTKS---------
>SRR6202050_3988918
-----VHETHRAAAERHELAAQAHRTAAEHNEKGDLSVAAWHSERALEYSDHAYKLAKEAHNKS---------
>SRR5271165_2409151
-----MHETHRKAAEQHELAAQAHRTAAEHNEKGDCTTAEWHSKRALEYSDQDRKLAKEAHNKS---------
>SRR5580658_9165545
-----FESLHGKAAELHDLAAQAHRTAAEHNEKGDHDAENWHLERANEYSEQAFKIAQELHTKS---------
>SRR5271157_975027
-----MHEEHRRAAELHELAAQAHRTAAEHNEKGEGVAGSWHAQRALEYSDHAYKLAMEAHNKS---------
>ERR1039458_1801200
-----IRDAHTRAAEQHERAAQEHRTAAEHNEKGDGVKGSWHAERALEYSDHAYKLAMEAHNKS---------
>ERR1019366_6067402
-----MHDTHRKVAELHALAAHAHRTAAEHNERGDDAAGGWHSERALDYSDQAYKLAKEAHAKS---------
>ERR1022692_786886
-----MHNLeHRKAAEQHELAAHEHRTAAEHNERGEGVKGSWHSERAMQYSDHAYKLSKEAHNKS---------
>SRR5271166_2169624
-----MHETHRRAAEEHELAARAHRTAAEHNEKGDRTAADFHSERALEYSDHAYRLAQEAHSKS---------
>ERR1700683_4802183
-----MHELHRRAFEEHELAAQAHRTAAEHNEKGDDPTENWHTERALEYSDRAFKLAKEAHAKS---------
>SRR5580692_7835881
--NHNAAEHLRSAALHHQRAGQFHREASRHYQIGkDYAHAAHQALIARGHALQASDHEDDAGAYFSEHNGN---
>SoimicMinimDraft_4_1059732.scaffolds.fasta_scaffold1835258_1 # 3 # 221 # -1 # ID=1835258_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.680
--KHNPAGHLTSAVFHHKQAAQFHREASRHYQVGkDYAHAAHQALVAHGHGLQAIDHGNDAGAYFVEHNGK---
>GraSoiStandDraft_25_1057303.scaffolds.fasta_scaffold2786330_1 # 1 # 273 # 1 # ID=2786330_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.641
--SRVQSAHHAAATDHHQQAAQFHRAASRHFQIGkDYAHAAHQALAAHGHALRALERGQAASALYAEHEGS---
>SRR5471032_58518
------KGHLEAAASYHEQAARFHREASHHFGAGkDYDHAAHQAVMAHGYALHAIDEANNVVKHNAG-------
>SRR6478735_7802765
------KGHLESAASHHEQAARFHREAAQHYEAGkDYDHAAHQAVLAHGHALHAIDEASVAAKHAGA-------
>SRR5471032_3571840
------KGHLEAAASHHEQAARFHREASHHFGAGkDYDHAARSEEHTSELQSHH--------------------
>tr|A0A1H5NEP2|A0A1H5NEP2_9BURK Uncharacterized protein OS=Burkholderia sp. WP9 GN=SAMN02787142_6276 PE=4 SV=1
------KEHLDAAASHHEQAARFHREASQIYEAGkDYDHAAHQAVLAQGHALFAIAESNLAAKHTGA-------
>ERR1700744_2473096
----------------------------------------HQAILAQGHALHAMDESNLAAKHTGG-------
>ERR1700690_4486807
-------------------------------ENGkDYAHAAHQAVMARGYSVQAIHHGNEASKYHAG-------
>ERR1700720_1670717
-----MHEAHRRAAQQHELAARAHRTAAEHNEKGDDEEGNWHSERALEYSDQAYRLAKDAHAKS---------
>SRR5689334_4818128
-----MHEGHRKAAEQHDLAAHAHRTAAEHNEKGDSVAEQWHAERALEYSDQAYKLAKEAHAKS---------
>ERR1035438_6644050
-----QSDTHRLAAEQHELAAQAHRTAAEHNEKGDDEGGRWHAERALEYSNHAYKLAKEAHAKS---------
>ERR1700722_7188787
-----MHETHRRAAEQHELAAEAHRTAAEHNEKGENETGKWHSQRAMEYSDHAYKLAKEAHTKS---------
>ERR1700733_11670911
----TMHEAHRTAAEQHELAAHAHRTAAEHNEKGDNEGGKWHAERALEYSDQAYKLAKEAHTKSA--------
>ERR1700722_20636178
----TMHEAHRTAAEQHELAAHAHRTAAEHNEKGDNEGGKWHAERALEYSSTRIRPINSQRKRTR--------
>ERR1700720_3737208
----IMHESHRQAAEQHELAAHAHRTAAEHNERGDNPTANWHATRALAYSDQAYKLAKEAHTKSG--------
>ERR1700733_11573431
----TMHDAHRKAAEQHELAARAHRTAAEHNEKGDDEAGRWHAERALEYSDHAYKLAKEAHAKSA--------
>ERR1700691_2814054
----NMHEAHRKAAEQHELAARAHRTAAEHNEKGDNEAGIWHAERALEYSDQAYKLAQEARTKSG--------
>SRR6202161_1855922
----NMHEAHRKAAEQHELAASAHRTAAEHDEKGDDEAGRWHAERALEYSNDADKLSLEAHNKSG--------
>SRR5690242_7580166
----TMHETHRRAAEQHELAAHAHRTAAEHDEKGDTETGNWHAERALAYSDRAYKLAMEAHTKSG--------
>SRR5690349_19263105
----TMHETHRRAAEQHELAAHAHRTAAEHDEKGDTRRVtgmrsaPWHIRIVPI----GW--LWKRTPNPG--------
>SRR5712691_13313662
----SIRSLHRKAAEYHDLAAHAHRTAAEHNEKGGNEAQNWHLERALEYSNRAYKLAQEAHSKSG--------
>SRR5271165_4566811
----IMHEEHRKAAEQHERAAQAHRTAAEHNERGDGAGGRWHAERALEYSDHAYKLAKAANNKSS--------
>ERR1700679_954346
-------EGRRTARTCGTRS----SHRRRTPRKGDNEGGKWHAERALEYSDHAYQLAKE--------------
>SRR5579872_3040707
----IMQDLHRKIAELHELAAQAHRTAAEHNEKGDNESANWHSQRALDYSNRAYELAKEAHNKSA--------
>SRR5579864_4074794
----VMHDLHQKAAEYHELATQAHRTAAEHNEKGDNESANWHSKRALEYSNRAYELAKEAHNKSA--------
>SRR5271165_620434
----NMHDAHRKAAEQHELAAKAHRTAAEHNEKGDNEAGRWHARRALEFANQAYKLAQEAHNK----------
>ERR1035438_965373
----TMHEAHRKAAEQHELAARAHRTAAEHNEKGHSTAAIWHSERALEYSDHAFKLAKEAHNKSG--------
>ERR1035441_4345400
----TMHEAHRKAAEQHELAARAHRTAAEHNEKGHSRSEERRVG-----------------------------
>SRR5580658_4948604
----TMHDTHRKAAERHELAARVHRTAAEHNEKGDNEAGSWHSERALEFSDHAYKLAQEAHAKSG--------
>SRR5580704_12130880
----PMRETHRQAAERHEQAARAHRTAAEHNEKGDDEAGRWHSERALEYSDHAYKLAQEAHTKSG--------
>SRR5580700_4825899
----TMQDTHRKAAERHEQAARAHRTAAEHNEKGNDDAGRWHSERALEYSDHAYKLAQEAHTKSG--------
>ERR1035441_5708109
----TMHEEHREAAELHELAAREHRTAAEHNEKGNFTAAEYHSQRELEYSDQAYKLAKDAHTKSG--------
>SRR5579872_2570035
-----MHEAHQKAAEQHELAAKAHRTAAEHNEKGDYTAAIWHSQRALEYSEQAYKLAKEAHTK----------
>ERR1700693_2895405
-----MHdahEAHRKAAEQHEISAHAHRTAAEHNEKGDYSGAIWHSERALEYSEQAYKLSKEAHTK----------
>SRR5579864_2262205
-----MHSAHLKAAEQHDLAAHAHRTAAEHNEKGDNDAEKWHSERALEYSDQAYKLAKEAHAR----------
>SRR5579864_6142950
-----MHDARRKAAEQHELAARAHRTAAEHNEKGDPEEASWHSQRALEYSDHAYKLAKEAHAK----------
>SRR5215831_17273075
LMSNKAAEHHKKALQHLTHAARHHGKAAWHHQAGRYERAIHHAHTASGHHYQAGGHADRAVKAHVQH------
>SRR5215831_2017232
PMSKRAAEHHKKASKHLAAAACHHEKAAAAHEIGRYETETDHAYEAGRHRVYAKRHAQRAWKDHVEH------
>LakMenE01Jun11ns_1017448.scaffolds.fasta_scaffold3487450_1 # 2 # 187 # 1 # ID=3487450_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.435
ISTEPEAQS-----AELMIRTKDGRDRLWSFVSSALG-----TQSDGRRLFVCMAQDVTERKAHDEQ------
>HubBroStandDraft_5_1064220.scaffolds.fasta_scaffold1605167_1 # 1 # 417 # 1 # ID=1605167_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.650
LL---------GESQRLTVQL----------QSRQTELQQTNEELATKAKLLAEQ--NAERERKEEH------
>SRR5579872_4560724
---QHRAAQHGAAASHHRQAAQHHRAAATHYRSGkDYAHAAHQALAAHGHTLLAIDYGSHAGTYAAQHGGD---
>SRR5579863_6441461
-----------SAASHHKQAARYHREASTHYRSGkDYAHAAHQALAAHGHALLAIDHGDQAGKYYAQHGGD---
>SRR5271165_1744909
---HhDSGEPHAAAAVHHAEAARFHREASRHYQDGeDHAHAAHQALLAHGHGLRAFERGNQANAYYGTLSVE---
>ERR1035437_993647
-----THDAHLKAADQHELAAHAHRTAAEHHEKGDDVGGRWHAARALEFSDHAYKLAKEAHNKS---------
>ERR1035437_10047576
-----THDTHLNEAEQLELAAHTHRTAAEHHERGDDVGGRGHTARALEFSDHA--------------------
>ERR1017187_4798261
-----THDAHLKAAEQHELAAHAHRTAAEHHEKGDDVGRSE--------------------------------
>ERR1700730_15266082
-----MHDTHRRAAEQHELAAHAHRTAAEHNEKGDNETGNWHSQRALEHSDRAYELAKEAHAKS---------
>SRR5580658_2267416
-----VHGLHSKAAEQHELAARAHRTAAEHNEKGDHETAEFHVERAREFADRAYQLAKEAHSKS---------
>SRR6476661_2253354
------SKLHESAAEHHEHAARHHREAARLHEVKDVLAAVDQAHMATDHQVHAIHYATQAAKEYMA-------
>SRR3954469_21339197
------SKLHESAAKHHEHAARHHREAAKLHEVKDVLAAVDQAHMATDHQVHPIHYGTQAAKEYLA-------
>SRR6478736_2086344
------SRLHENAAKHHEHAVRQHREAARLHEVKDVLAAVDQAHMATDDQAHAIHYATQAAKEYLA-------
>SRR5271166_4700256
-----MHETHRKAAEQHELAAKAHRTASEHNEKGENETGNWHSERALEHSDRAYQLAKEAHNK----------
>ERR1700735_2757028
-----MHDTHRKAAEQHELAALAHRTASEHNEKGENETGNWHSKRALAYSDRAYELAKDAHNK----------
>SRR5579863_3543293
-----MHDAYRRAAEQHELAARAHRTAAEHNEKGDNETGNWHSGRALEYADRAYELAKQACNK----------
>ERR1022692_2141220
----TMHESHRRAAEEHELAAQAHRTAAEHRSEEHTSELQSP-------------------------------
>ERR1019366_59648
----AMHEGHREAAELHERAAHAHRTAAEHHEKGDSATAVWHAERALEYSDHAYK------------------
>SRR5579872_4326349
----TMHDTHRKAGEQHELAAKAHRTASEHNEKRENETGNWHAERELDYSDRAYQLAKEAHNKS---------
>ERR1035441_9666701
----TMHEAHRRAAELHELAAQAHRTAAEHNEKGDCPTSVWHSERALEYSDRAYKLAVEARNKS---------
>SRR5258705_10833459
----AMHETHRKAAEQHELAAHAHRTAAEHNERGDNDTGNWHADRALEYSDRAYKLAQEAHSKS---------
>SRR5271155_3255843
----LMHELHREAAEQHERAAQAHRTASEHNEKGDNPSGNWHAQRALQFSNRAYELAREAHNKS---------
>SRR5256885_13586733
----MKHEAHRRAAEQHELAAQAHRTAAEHNEKGDNETGSWHADRALEYSDRAYEPAKEAHSKS---------
>ERR1039457_4229862
-MTQNLEEHHSRAAQHFDSAAEHHRAAEKAYVTGDLKTSAYEAQCAMGHSVQANDHADLAAMAHLEHHGLN--
>tr|A0A226X0S3|A0A226X0S3_9BURK Uncharacterized protein OS=Caballeronia sordidicola OX=196367 GN=BSU04_21575 PE=4 SV=1
--SLHVAEHHEAAAELHEHAARYLRQATKHYEEGKVALAAHEAQAAHAIALCAIDHSNEAAKHHAIR------
>SRR5438552_9601204
------KDAHNTAAEHHEKAAKSHRTAAEHHGKSDNQAGHQHSTAALEHSTKAHEASKQAHEKSTQNKN----
>ERR1700686_3344577
----KMKDAHNKAAEHHESAAKSHRAAAESFGRNDNVKGKEHATQAQHNAQNANENSKTA-------------
>SRR5450759_2783561
------KDAHNTAAQHHENAAKSHRTAAEHHGKGDHEAGHKHSQEAYDHSTKAHEASKKAHE-----------
>ERR1035441_8005555
------KDQHNTAADHHEKAAKSHRAAAEHHGKGDHEAGHRHSGEAQEHSKNAHQHSQDAHA-----------
>SRR5450756_557274
------QDAHHKAAEHHENAAKAHRTAAEHHGKGDHEAGKKHSATALEHSGKAHEATQAAHE-----------
>ERR1700734_311097
------KSEHEEAATHHENAAKSHRSAAEHHGRGSHEEGRKHSTSAHEHSGKAHEASKKAH------------
>ERR1035437_280669
------KNEHQEAASHHENAAKSHRAAADHHGKGNHEEGKKHSAAAHEHSGKAQEASKTAH------------
>ERR1039457_5823228
----TMHEAHRKAAEKHELAAQAHRTAAEHNEKGDSTAADWHSERAMQYSDHAYKLAMEAHSK----------
>SRR5690242_15786718
----TMHETHHRAAEQHELTAHAHRTAAEHDEKGDTETGNWHAERGLAYSDRAYKLAMEAHTK----------
>SRR5580692_3629653
----KMHENareHRKAAELHQLAAQAHRTAAEHNEKGDEAAGSWHSQRALEYSDQAYKLAKKAHAK----------
>SRR5579872_2516938
----RVHESHQKAAEQHELAARAHRTAAEHNEKGDNPTGNWHSERALEYAEHAYRLAKDAHNTS---------
>SRR6185437_161601
----RVHETHQKAAEQHELAARAHRTAAEHNERGDNPTGNWHSERAFEYAEHAYRLAKDAHNRS---------
>ERR1019366_7611031
----IMQETHRQAAERHEMAARAHRTAAEHNEKGDNPSGNWHSERALEYAERAYKLAKDAHSKS---------
>ERR1700693_3320993
----NMHENHRKAAEQHELAARSHRTAAEHNEKGDFTAAVWHSERALQYSDQAYRLAKEAHNKS---------
>SRR5271163_4018991
----FMHELHREAAEQHELAARAHRTAAEHNEKGDNATGNWHSERALEYADRAYELAKKAHNKS---------
>SRR5580700_1525914
----FMHELHRQAAEQHEMASRAHRTAAEHNEKGDNETGNWHSERAMEHSENAYKLAKEAHQKS---------
>ERR1700674_4905784
----TMHELHREAAEQHKLAARAHRTAAEHNEKGDNPTGNWHATRALEYADQAYKLAKDAHNKS---------
>SRR5450755_1041639
----IMHEEHRQAAEQHELAAHAHRTAAEHHEKGDEKGGSWHSQRAMEFSERAYKLAKEAHSKS---------
>SRR3984957_13701284
-------HDHHKAAAHHDEAAKSHRKAAEAHEKGDHADASQHSQIANDHSAKAYEASQSAH------------
>ERR1700685_3318198
-------HDHHKAAAHHDEAAKSHRNAAEAHEKGDQADASQHSQLAHDHSTQAHEASQSAH------------
>ERR1700722_3266946
-------HDHHKAATHHDEAAKAHRDAAEAHEKGNQADATQHSQLANDHSAKANEASNIAH------------
>ERR1700722_20372013
-------HDHHKAAEHHDEAAKAHRSAAEAHEKGDHADASQHSQIANDHSAKANEASNVAF------------
>SRR3984885_15921837
-------HDHHKAAAHHDEAAKSHRNAADAHEKGNQADASQHSQIGNDHSAKAHEASQSAH------------
>ERR1700722_6238913
-------HDHHKEAEHHEEAAKAHRDAAEAHEKGNQADASQHSQLAYDHSTKAHEASQRAH------------
>ERR1700722_5003591
-------HDHHKAAAHHEEAAKAHRSAAEAHEKGEQADASQHSQIANDHSIKAQEASNAAH------------
>SRR5271169_5076590
-----MKDARNKAAESHEAAAKSHRAAAESHSKNDHAKGKEHSKQAQQHAQNANEHSKTANNKS---------
>SRR4051794_13470881
-----PKDAHTKAAEQHETAAKTHRAAAQQHGSNDHSKGKQQAADALQQSKAAHQHSDDAHGKS---------
>tr|A0A127EN64|A0A127EN64_9RHIZ Uncharacterized protein OS=Rhodoplanes sp. Z2-YC6860 GN=RHPLAN_12460 PE=4 SV=1
-----PKDAHIKAAEHHETGAKSHRAAAQQHGSNDHSKGKQQSSEALQHSKVAHQHSDEAHGKS---------
>SRR5271168_4727755
-----ARDAHNKAAQHHESAAKSHKTAAEHHGKGEHARGREESAKAYAHSKSAHEHSEMAH------------
>ERR1700728_2926415
-----ARDAHTKAAQHHENAAKRHKTPAEHHGEGEHARGREESAKAHSHSKTAHEHSEMAH------------
>SRR5580700_6974991
------HDSHRQAAELHELAAHAHRTAAEHNEKGDNETGNWHAERALEYSDRAYQLAKEAHAK----------
>SRR5579863_2279912
------HETHRSAAEFHELAAHAHRTAAEHNERGDNETGNWHAERALEYSNRAYELAKEAHNK----------
>SRR5580693_7061096
------HDTHQKIAELHELAAHAHRTAAEHNERGDNDTANWHAERALEYSDRAYQLAKDAHSK----------
>ERR1039458_7487882
------QSLHREAAEYHDLAVHAHRTAAEHNEKGDSEAGNWHLDRAREYSDQAFKLAQDVHCK----------
>ERR1039458_3425906
------QALHREAAEYHDLAAQAHRTAVEHNEKGDNETGNWHLDRAREYSDQAFKIAQDIQCK----------
>ERR1035441_3561326
------RSLHREAAEYHDLAAQAHRTAAEHNEKGDNETGNWHLDRPRECSYQAFKLAQDVHCK----------
>ERR1039458_6934656
------RSLHREAAEYHDLAAQPHRTAAEQIGRAHVXX-----------------------------------
>SRR5271165_3251650
----NMHEGHRLAAEQHELAARAHRTAAEHNEKGDGSAAIQHSERALEYSDRAYQLAKEAHNK----------
>SRR5579864_9452655
----KMHDAHRKAAEEHERAAHAHRTAAEHNEKGENEAGNWHSERALEYSDHAYELAKEAHSK----------
>ERR1700686_4490454
----TMHDLHRRAAEEHERAAHAHRTAAEHNEKGDDATGNWHSERALEYADRAHELAREAHTK----------
>tr|A0A1W2C3J3|A0A1W2C3J3_9BURK Uncharacterized protein OS=Polynucleobacter sp. VK13 OX=1938817 GN=SAMN06296008_11834 PE=4 SV=1
-------NYHDHAANHHEQAAKSHMEAARMRSLGNHEASANHALIAHGHALQALRYSEEAINEHAN-------
>tr|A0A1J0D7C3|A0A1J0D7C3_9BURK Uncharacterized protein OS=Polynucleobacter asymbioticus OX=576611 GN=A4F89_09430 PE=4 SV=1
-------HFHGKAANHHEQAMKSHLEAARMRELGNHEASATHALVAHAHTLKALQNSEDAINEHAN-------
>ERR1019366_9111953
-MSHHDHNRYRSAAEHHEHAANHYRRAETSGMAGDHIAAANHARTAHEHARQAAAFSGDADGGHDEHHGMK--
>ERR1035437_6137483
-MSHHDHERYRSAAEHHEHAANHYRRAETSGMAGDHIAAANHARTAHEHARQADAYAGEAAKSNDEHHGMN--
>ERR1035437_7818790
-MSHHDHNRYRSAAEHHEHAANHYRRAETSGMAGDHVAAANHARTAHEHARQAAAFSGEADESHDEHHGMN--
>SRR3979490_1267869
-------QAHTKAAEHHETAAKSHRAAAEQHGKNDHANGQEHSSQAQQHSKTAREHSETAHTKSS--------
>ERR1700752_190489
-------QAHTKAAEHHETAAKSHRAAAEQHGKNDHVKGHEHSSQVQLHSKSAREHSETAHGKSA--------
>ERR1700734_1853930
-----ARDEHNKAAEHHENAAKAHRSAAEHHGKGDHAKGKEHANVAKQHSQAANQHTEQAH------------
>SRR5882762_9456542
-----ARDEHNKAAEHHENAAKAHRSAAQHHGKGDHTKGKEHANVAKQHSQTANQHTDQAH------------
>ERR1700721_689871
---------PNGRAQ--ECA----SDAERSRGDGERDRGW----VRKSNRRAAVQHHGQ--------------
>ERR1700735_832010
-------QSHTKAADHHESAAKSHRAAAEHHGKNDHMKGNEHAAEAQKHSKVAGAASDEAHA-----------
>ERR1019366_4490127
------KDAHLKAAEHHENAAKTHRLAAEHHGKGDHAAGKKQSATALEHSGKAHEASQAAH------------
>ERR1039458_7026760
------KDAHLKAAEHHENAAKTHRLAAEHHGKGDHAAGKKQTATALEYSGKAHEASQAAH------------
>ERR1019366_5964583
------KDANLKAAEHHENAEKTNRLAAGHKEKEDHAGGKKQSETALEPSGKPQETSKAAH------------
>ERR1700729_1498231
-----MKDAHNKAAEHHESAAKSHRAAAEAHDRNDHAKGKEHSGQAQQHAQNANEQTKTAH------------
>ERR1700735_457524
-----MKDAHNTAAEHHESAAKSHRAAAAAHGSNDHAKGKEHSTQDQQHATNDEE------------------
>tr|G9ELI3|G9ELI3_9GAMM Uncharacterized protein OS=Legionella drancourtii LLAP12 OX=658187 GN=LDG_5982 PE=4 SV=1
----KLASYHADAAKHYEHAAKYHHEAQKHHLSGDHDKAALAAHKAQGHACCANGHAKKALKC----------
>ERR1019366_10459532
---QKLRDAHRKAAEQHELAAKAHRTAAEHNEKGEDEAGRWHSERALEYSDRAYKLAKEAHNKS---------
>ERR1039457_2594544
---KAMASEHGKAAEQRELAAHAHRTAADHNEKGENEAGSWHADRALEYSDHAYMLAKEAHNKS---------
>ERR1017187_2984396
-MTQNVEEHHGRAADHFDLAAEHHRAAEMASIAGDHKTAAHEAHCAHGHCVAATDHADLAAMGHVEQHDTH--
>ERR1700685_4321756
------KDAHTSAAYHHERAAKSHRAAAEQSNKGAHDACVEHAVTACGHSTKADEASKLAL------------
>SRR6185312_10729741
------RDAHTTAAYHHERAAKSHRAAAEQSNQGAHAVCAEHALTACGHSNKADEASKLAL------------
>ERR1700686_4148535
------RDAHTTAAYHHERAAKSHRAAAEQSSKGAHEACEQHAVTACAHSMKADEASKLAL------------
>SRR5579862_2273143
------KDAHTTAAYQHERAAKSHRAAAEQSNKGAHEACAQHAATACDHSTKADEASKAAL------------
>ERR1700690_2101855
------KEAHTTAAYHHERAAKSHRAAAEQSNQGAHAACEEHALTACGHSTKADEASKIAH------------
>SRR5208337_2096126
------NESHQRAAEFHELAAHAHRAAAAHHGKEDHQTGHEHSKQALEHARKAFEWSQEAHRKSAKAAG----
>ERR1700687_2569050
------NESHQRAAEFHELAAHAHRAAAAHHGKEDHLTGHELSKQALEHANKASQWSQEAHRKSAKAAG----
>ERR1700733_13128378
------RDSHQRAAEFHELAAHAHRAAAVHHGKEDHQPGHEHSKQALEHADKAFQASQEAHRKSAKSTG----
>ERR1700687_752579
------NESHQRAAEFHELAAHAHRAAAAHHGKEDHQAGHEHPKQALEYSNKASEWTQEAHRKSEKSME----
>SRR5258707_15042374
------NESHQKAAEFHELAAHAHRAAAAHHGKEDHQTGHDHSRQALEHATTAFQYSQEAHQKSEKAGI----
>SRR5713101_657853
------NDSHQRAAEFHDLAAHAHRVAAAHHGKEDHLTGHELARQAMEHSAKAHQATQEALQESAKLAK----
>SRR5713101_2834890
------NDSHQRAAEFHDLATHAHRVAAAHHGKEDHLTGHELARQAMEHSAKAHQATQEALQESAKLAK----
>SRR4029077_1283922
------NDSHQRAAEFHDLAAHAHRVAAAHHGKEDHLSGHELARKAMEHSAKAHQASEEALHQSAVFIK----
>SRR3984893_12462898
------EDSHRRAAEFHELAAHAHRVAAAHHDKEDHLTGHEHSKQAMEHSAKAHQSSQEALQKSVIFTE----
>SRR5271156_5420018
------NDSHQRAAEFHEQAAHAHRAAATSHGKGDHLSGHELSRQALENAHKAFQWSQ---------------
>SRR5678816_4305783
------QDSHRKAAEFHDMAAHAHRAAAVHHDKGDHKTGQQQSRKALEHATKAFELAQEAHRLSSAPKK----
>ERR1700675_2880414
-----MRDAHKKAAEQHELAARAHRTAAEHNEKGDNPTGKWHSERALEYADHAFELAKKAHNK----------
>SRR5450755_2771584
-----MHKLHREAAEQHELAAKEHRTAAEHNEKGDNPTGNWHTQRAVEYSNRAYELAKEAHNK----------
>SRR6266850_1864928
------NNDHNKAAELHENAAKSHRAAAEQHSKGDHAKGMEHSKSAQQHSQSANKQSDQAN------------
>ERR1700676_4116006
------KDAHNKAAEHHESAAKSHRAAAAAHGSNDHAKGKEYSTQAQQHAQNANEHSKTSQAKSAE-------
>SRR5258707_184143
------NQAHNRAAVFHENAARSHRIAAEHYANNDRAKGDEHAMQARAYSRSARDHSEQTHMK----------
>ERR1044071_6326665
------NQAHTKAAEHHETAAKAHRLAAEHHGKNDHAKGNEHSGYAQTHSKSAREHSEQAHTK----------
>SRR5213078_2364262
------NQAHTKAAEHHETAAKAHRLAAEHHVKNDHVKGNEHSAYAQTHSKSARDRSEQAHTK----------
>ERR1019366_4618907
-TKHPAIEHHHAAAAHHAAAAHHHLEAAHEHGQGKHEEAKQHSAAALEHSEQAHKHTVEAHKHS---------
>SRR5664279_4760768
-TKHPSVEQHHAAAGHHAAAAHHHLEAAHEQGQGKQEEAKQHSAAAHEHSE----------------------
>ERR1700679_12343
----IIHELHREAAEKHELAAHAHRTAAEHNEKGDQAAGDWHSQRAMEYSDHAYKLAKEAHTK----------
>ERR1019366_1160250
----ALHDAHRKAAEQHDMAAHAHRTAAEHNEKGDEDSGRWHAERALEYSDHAYKLAKEAHNK----------
>ERR1019366_1197723
----AVHEEHLRAAEQHERAAKAHRTAAEHNEKGNGAEESWHSQRALEYSDHAYRLAKEAHSK----------
>ERR1700688_5101470
----IMHDAHRKAAEQHELAARAHRTAAEHNEKGDHEGRDWHAARALEYSDNAYKLA----------------
>ERR1700676_867798
--EENVHDAHRKTAEQHELAAQAHRTAAEHNEKGENELGNWHLQRALEYSDHAYKLAQEAHSK----------
>ERR1700676_837832
--EENVHDAHRKTAEQHELAAQAHRTAAEHNEKGENELGNWHLQRALEYSDHAYKLAQESHSK----------
>SRR5580704_17448066
--EKKLHDAHRKAAEQHDLAAHAHRTAAEHNEKGENELGSWHLQRALEYSDHAYKLSQDAQTK----------
>ERR1700690_4019350
--ENAVHEEHRKAAEQHELAARAHRTAAEHNEKGENESGNWHAERALEYSDRAYTLAKEAHAK----------
>ERR1700678_3694437
--GNMMHDAHRKAAEQHELAAKAHRTAAEHNEKGENETGNWHSQRALEYSDHAYKLAKDAHTK----------
>SRR6202051_959840
--EKKLHDAHRKAAEQHYLSAHAHRTAPEHNQKGENELGNWHLQRALEYSDHAYKLAREAHSK----------
>SRR5579863_1142833
--RTTMHDFHRRAAEQHELAARAHRTAAEHNEKGENETGNWHAQRALEYSDRAYQLAQEAHTK----------
>ERR1700680_337363
--EENVHDAHREAAEQHELAAQAHRTAAEHNEKGDNAEGSWHSERALEYSNHAFKLAQEAHNK----------
>ERR1700689_1959300
--VTTMHDAHWKAAEQHELAARAHRTAAEHNEKGEDEAGRWHAERALEYSDHAYRLAKEAHTK----------
>SRR5271169_408185
--GNTMHDAHRKAAEQHELAARAHRTAAEHNEKGDNETGNWHLKRALEHSEHAYKLAKEAHDK----------
>SRR5580658_7139169
--ETPMQDAHRKAAEQHELAARAHRTAAEHNEKGDNEGGRWHAERALEYSDHAFRLAKEAHSK----------
>SRR6185369_2844766
---------------PNTMK-----K-GTTRRHAGIRNERWSSPIARI----SWPRQP---------------
>ERR1700722_12608701
--EENMYDTHRQAADQHELAAHAHRTAAEHNEKGKNELGNWHLQRALEYSDHAYKLAKEAHSK----------
>SRR3984957_3206025
---------HTPPSrRSARTCCARSSDGREHNEKGKNELGNWHLQRALEYSDHAYKLAKEAHSK----------
>GraSoiStandDraft_50_1057286.scaffolds.fasta_scaffold7233880_1 # 1 # 222 # 1 # ID=7233880_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.689
-----MSDTHRQAADQHELAAHAHRTAAEHNEKGKNDLVNWHLQRAAEYSDHAYKLAKKAHTI----------
>ERR1035438_3812566
------QALHREAAEYHDLAAQAHRTAAEHNEKGDNEAGNWHLDRARAYSDQDFKVAQDVHC-----------
>ERR1035441_5344174
------HDLHRKAAEYHELAAQAHRTAAEHNEKGDNETGNWHSKRALEYSNQAFKLAQEAHG-----------
>ERR1700675_786553
------HVLHRKAAEAHELAAKSHRTAAEHNEKGDNETGNWHSQRALDYSEHAYRLAKEAHP-----------
>ERR1700686_2254141
-----ADDSHQRAAELHEQAAHAHRAAAAHHGKEEHQTGQEHSKQAMEHSAKAYQQSLEADKQSayfATKHGKK--
>SRR3982074_2549457
-----ARDSHQRAAELHEQAAHAHRTAAAHHGKEDHQSGQEHSKQAMEHSAKAHEQSLEANKQSaffAKQHEKK--
>ERR1700734_1591532
--------DHHKAAAHHDEAAKSHRDAAVAHEEGDTERASQHSQIANDHSKKAQEASNAAHR-----------
>ERR1700722_3394704
----------QFRAALRGFESKSHRDAAAAHEEGDTEKASQHSQVANEHSKKAQEASNSAHQ-----------
tests/test_data/alignment/mgnify_hits.sto
0 → 100644
View file @
85d39c80
# STOCKHOLM 1.0
#=GF ID query-i1
#=GF AU jackhmmer (HMMER 3.3.2)
#=GS MGYP000406148242/1-68 DE [subseq from] PL=00 UP=0 BIOMES=0101000000000
#=GS MGYP000119383271/47-117 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000430010134/3-69 DE [subseq from] PL=00 UP=0 BIOMES=0000110000000
#=GS MGYP000184282189/1-71 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000372988949/3-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000222615028/3-68 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000384795733/25-88 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000680660046/4-73 DE [subseq from] PL=00 UP=0 BIOMES=0000110000000
#=GS MGYP000586297297/4-70 DE [subseq from] PL=00 UP=0 BIOMES=0000110000000
#=GS MGYP000526302968/5-69 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000081082088/4-68 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000172493671/1-71 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000694390052/2-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000246175980/4-68 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000358235060/4-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000635416234/5-68 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000656061151/3-65 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000718018739/4-64 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000234420019/4-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000689530757/1-71 DE [subseq from] PL=00 UP=0 BIOMES=0000100000000
#=GS MGYP000266820214/24-89 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000190165740/1-71 DE [subseq from] PL=00 UP=0 BIOMES=0000000000001
#=GS MGYP000589249599/4-69 DE [subseq from] PL=00 UP=0 BIOMES=0000110000000
#=GS MGYP000048618675/3-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000377290797/1-69 DE [subseq from] PL=00 UP=1 BIOMES=0110000000000
#=GS MGYP000697367932/3-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000747506700/4-68 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000255037255/6-64 DE [subseq from] PL=10 UP=0 BIOMES=0000101000000
#=GS MGYP000602985373/3-68 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000420186793/4-69 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000452617499/5-64 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000119404247/1-68 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000134149386/3-60 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000461455637/26-91 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000119389418/96-161 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000546988737/26-93 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000624371167/1-68 DE [subseq from] PL=00 UP=0 BIOMES=0101000000000
#=GS MGYP000650157322/5-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000246214200/7-73 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000113479303/34-96 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000187226991/3-69 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000381848663/3-69 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000066325489/28-89 DE [subseq from] PL=00 UP=0 BIOMES=0000000000001
#=GS MGYP000013251582/4-69 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000499794189/19-84 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000555816272/4-69 DE [subseq from] PL=00 UP=0 BIOMES=0000000000001
#=GS MGYP000653248377/3-70 DE [subseq from] PL=00 UP=0 BIOMES=0110000000000
#=GS MGYP000113511630/3-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP001057101778/4-69 DE [subseq from] PL=00 UP=0 BIOMES=1000000000000
#=GS MGYP000210824545/3-69 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000676742083/9-64 DE [subseq from] PL=10 UP=0 BIOMES=0000101000000
#=GS MGYP000545010933/4-70 DE [subseq from] PL=00 UP=0 BIOMES=0000110000000
#=GS MGYP000541064880/3-68 DE [subseq from] PL=00 UP=0 BIOMES=0000000000001
#=GS MGYP000541064880/99-161 DE [subseq from] PL=00 UP=0 BIOMES=0000000000001
#=GS MGYP000729801087/3-52 DE [subseq from] PL=10 UP=0 BIOMES=0000101000000
#=GS MGYP000715079888/40-96 DE [subseq from] PL=10 UP=0 BIOMES=0000101000000
#=GS MGYP000033872322/3-43 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000464421157/4-69 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
query MAAHKGAEHHHK-AAEHHEQAAKHHHAAAEHHEKGE-HEQAAHHADTAYAHHKHAEEHAAQAAKHD-AEHHAPKPH
MGYP000406148242/1-68 MATHKGAESHKK-AAEHHTTAAKHHTEAAKSHESGN-HEKAAHHAHTATAHGKHASDHSDDAAKTY-ASEH-----
#=GR MGYP000406148242/1-68 PP 899*********.***********************.***************************98.8877.....
MGYP000119383271/47-117 MATHKGTEHHKK-AAEHHELAAKHHREAAKLHEAGS-HEKAAHHAQIAAGHGLHAVYHTEEATKHH-ADEHTGK--
#=GR MGYP000119383271/47-117 PP 899*********.***********************.*****************************.**99866..
MGYP000430010134/3-69 ---KKAAEHHRK-AAEHHQNAAKHHNAAAESHEAGN-HEKAAHHAHTAHGHHTQAGEHGGEAAKAH-RDEHGQ---
#=GR MGYP000430010134/3-69 PP ...699******.***********************.***************************88.877765...
MGYP000184282189/1-71 MPKHEGAEHHKK-AAEHHEKAAQHHKEAAKHHEEGR-HETAGHHAYVAHGHHLTAIQHSEEAAKYH-SQQHGEK--
#=GR MGYP000184282189/1-71 PP 568*********.***********************.****************************9.9999876..
MGYP000372988949/3-70 ---KKAAEHHLK-AAEHHEHAARHHKEAAKHHQAGS-YEKAAHHAHTARAHAEHADEHAVEAAKAH-AEEHGSK--
#=GR MGYP000372988949/3-70 PP ...699******.***********************.*****************************.**99865..
MGYP000222615028/3-68 ---KKAVEHHHK-AAEHHEHAARHHKEAAKHHEAGK-HETAAHHAHLARGHHEHAMHHAAEAAKAH-VEDHG----
#=GR MGYP000222615028/3-68 PP ...6899*****.***********************.***************************99.99986....
MGYP000384795733/25-88 ----SGSQQHDA-AAQHYEEAARHHRQAAKHYQASR-HEKAAHHAQLGYAHHLYAEQHAAEAAKAH-AKNH-----
#=GR MGYP000384795733/25-88 PP ....6999****.***********************.***************************99.9998.....
MGYP000680660046/4-73 -STHKGAEHHKE-AAAHHKKAAEHHLAAAEHHEAGD-HEKAGHHAHVAHGHHLNAVHHAEEAGKHHGAEHSGP---
#=GR MGYP000680660046/4-73 PP .57*********.***********************.**************************9752788777...
MGYP000586297297/4-70 ----QAAEHHQK-AAEHHEHAARHHREAAAHHEEGN-HETAAHHAHTAQGHLHHATHHASEAAKHH-VEHHGNK--
#=GR MGYP000586297297/4-70 PP ....689*****.***********************.*****************************.****977..
MGYP000526302968/5-69 -----REEHHLK-AAEHHEHAAKHHLAAAEHHAGGD-HEKAGHHAHVAHGHSTHAEHHAEEASKHT-ANHDAA---
#=GR MGYP000526302968/5-69 PP .....469999*.***********************.*****************************.***985...
MGYP000081082088/4-68 ----QAAEHHHK-AAEHHEHAARHHKEAAKHHEAGK-HETAAHHAHLARGHHEHAMHHAAEAAKAH-IQDHG----
#=GR MGYP000081082088/4-68 PP ....689*****.***********************.**************************977.66664....
MGYP000172493671/1-71 MTKHEGAEHHKQ-AAQQHQDAARHHLEAAKHHEAGA-HEKAGHHAHIAYGHHLQATHHAEEAAKHH-AMQHGDK--
#=GR MGYP000172493671/1-71 PP 678*********.***********************.*****************************.*999876..
MGYP000694390052/2-70 --SHAAAEHHKK-AAEHHEHAARHHQEAAKHHEAGN-HEKAAHHAHVAHGHHVHAVEHAEHAAKHH-AETHGAK--
#=GR MGYP000694390052/2-70 PP ..699*******.***********************.*****************************.**99865..
MGYP000246175980/4-68 ----QAAEHHHK-AAEHHEHAARHHKEAAKHHEAGK-HETAAHHAHLARGHHVHAMHHAGEAAKAH-IEDHG----
#=GR MGYP000246175980/4-68 PP ....689*****.***********************.***************************88.88885....
MGYP000358235060/4-70 ----QAAEHHGK-AAEHHEHAARHHREAANHHEAGD-HQQAAHHAHTAQGHLHHATHHSAEAAKLH-VEHHGHK--
#=GR MGYP000358235060/4-70 PP ....689*****.***********************.*****************************.****877..
MGYP000635416234/5-68 -----VADHHHK-AAEHHERAAKHHREAATHYESDR-HETAAHHAHMAHGHHQHAVHHASEAAKAH-IEHHD----
#=GR MGYP000635416234/5-68 PP .....489****.***********************.*****************************.****6....
MGYP000656061151/3-65 ---KKAAEHHRK-AAEHHEHAARHHKEAAKHHDAGA-HEKAAHHAHTAHAHHLHATHFADEAAKAH-AD-------
#=GR MGYP000656061151/3-65 PP ...699******.***********************.**************************977.75.......
MGYP000718018739/4-64 -----GAKHHNA-AAQHYEEAARHHRKAAELYQCGH-HEKVSHHANLASGHPLHAKQHAEEAAKAL-IE-------
#=GR MGYP000718018739/4-64 PP .....99*****.***********************.**************************976.55.......
MGYP000234420019/4-70 ----AAAEHHRK-AAEHHEHAARHHEEAAEHHESGA-HETAAHHAHSAQGHTHHALYHASEAAKEH-AEHHGDK--
#=GR MGYP000234420019/4-70 PP ....479*****.***********************.*****************************.****875..
MGYP000689530757/1-71 MPTHTGAEHHRK-AAEHHQLAAKHHLEAAKLHDAGS-HEKAAHHSEIAAGHGHHAVYHTEEATKQH-ADMNAEK--
#=GR MGYP000689530757/1-71 PP 578*********.***********************.****************************9.9999877..
MGYP000266820214/24-89 ---KKAAEHHLK-AAEHHEHAARHHKEAAKHHQAGS-HEKAAHHAHTARAHEEHAEFHSAEAAKAH-GQEHG----
#=GR MGYP000266820214/24-89 PP ...699******.***********************.**************************977.77775....
MGYP000190165740/1-71 MARHEGAEHHKQ-AAEHHQHAARHHLEAAKHHEAGA-HEKAGHHAHIAQGHHLHAIHHAEEAAKHH-AAQHGDK--
#=GR MGYP000190165740/1-71 PP 799*********.***********************.*****************************.*999876..
MGYP000589249599/4-69 ----QAAEHHTK-AAEHHQHAARHHLEAAKHHEAGR-HEAAGHHAHLAHGHHQHATHHASEAAKSH-IEHHGK---
#=GR MGYP000589249599/4-69 PP ....689*****.***********************.*****************************.****75...
MGYP000048618675/3-70 ---KKASEHHRK-AAEHHKLAATHHEEAAAHYDKGN-HEKAAHHAHVAHGHTLHATHYAAEAAKMH-VEEHGSK--
#=GR MGYP000048618675/3-70 PP ...6899*****.***********************.***************************99.9999866..
MGYP000377290797/1-69 MSDHAGVEHYHK-AAEHHEHAARHHREAAKHHEEGN-HEKAAHHAHSAHGHASHAQHHHTEASRHH-AEHHG----
#=GR MGYP000377290797/1-69 PP 678*********.***********************.*****************************.****7....
MGYP000697367932/3-70 ---KKASEHHRK-AAEHHKLAATHHEEAAAHHDKGN-YEKAAHHAHVAHGHTHHATYHAAEAAKIH-AEDYGSK--
#=GR MGYP000697367932/3-70 PP ...6899*****.***********************.***************************99.9988765..
MGYP000747506700/4-68 ----QAAEHHHK-AAEHHEHAALHHKEAAKHHEAGK-HEMAAHHAHLARAHHEHAMHHAVEAVKAH-LQDHG----
#=GR MGYP000747506700/4-68 PP ....689*****.***********************.**************************977.76664....
MGYP000255037255/6-64 ---SKIAEHHTK-AAEHHETAAQHHREAAKHHEAGS-IEKAAHHAQVAYGHGAHAWNYQEEAAK------------
#=GR MGYP000255037255/6-64 PP ...5789*****.***********************.******************999999998............
MGYP000602985373/3-68 ---KKAVEHHNK-AAEHHEHAARHHKEAAKHHEAGK-HETAGHHAHLARGHQEHAMHHSAEAAKAH-IEDHS----
#=GR MGYP000602985373/3-68 PP ...6899*****.***********************.***************************99.98886....
MGYP000420186793/4-69 ----QAAEHHLK-AAEHHEHAAHHHKEAAKHHQGGS-HEKAAHHAHTARGHHEHAQHHAAEAAKAH-AQEHGN---
#=GR MGYP000420186793/4-69 PP ....689*****.***********************.***************************99.999975...
MGYP000452617499/5-64 -----AAAHHLK-AVEHHEHAARHHREAAKHHEAGN-HEKAAHHAHLAHGHHLHATEYAGEAAKAH-I--------
#=GR MGYP000452617499/5-64 PP .....678999*.***********************.**************************965.5........
MGYP000119404247/1-68 MAGHKIHEHHEK-AADHHEHAAKHHREAAKHHKAGD-HEKAAHHSKVAHGHHLHATEHHDEASKKH-AEDH-----
#=GR MGYP000119404247/1-68 PP 799*********.***********************.***************************99.9998.....
MGYP000134149386/3-60 ---KKATEHHRK-AAEHHEHAARHHKEAAKHHEAGK-HETAAHHAHLARGHQERAAQQAAEAA-------------
#=GR MGYP000134149386/3-60 PP ...6899*****.***********************.***********************998.............
MGYP000461455637/26-91 -----AAKHHDL-AAQHYEEAARHHREAAQDYQSGR-HEKASHHAHLAYAHHLHAEQHAEEAAKAH-IKNHLDD--
#=GR MGYP000461455637/26-91 PP .....589****.***********************.***************************99.9999765..
MGYP000119389418/96-161 ---KQAAEHHRK-AAEHHEHAARHHKEAAKHHEAGK-HETAAHHAHLARAHHEVATHHAVEAAKAH-LEEHG----
#=GR MGYP000119389418/96-161 PP ...5689*****.***********************.***************************88.88775....
MGYP000546988737/26-93 ---EKAAEHHEK-AAEHNERAAQHHREAAKHHEEGH-HETAGHHAQIAHGHHLNATHHSEEAAKHH-AQQHGEK--
#=GR MGYP000546988737/26-93 PP ...589******.***********************.*****************************.****876..
MGYP000624371167/1-68 MAKHPGADYHRM-AAEHHEKAALHHKKAAEYYEAGN-LKKAAIHAELAAVFHKQADEHVYNKQEEI-DVHH-----
#=GR MGYP000624371167/1-68 PP 799*********.***********************.*********************98877665.5566.....
MGYP000650157322/5-70 -----ATEHHRR-AAEHHEHSAKHHKAVADHHEAGN-HEKAGHHASVAEGHLNHASHHAEEASKHH-AADHGHK--
#=GR MGYP000650157322/5-70 PP .....579****.***********************.*****************************.9999765..
MGYP000246214200/7-73 ----KIAEHHAQ-AAQHHEKAAEHHKEAAKHYGTGA-VEKGAHHAQVAQGHAVHAEYHADEAAKAH-AEHHAGK--
#=GR MGYP000246214200/7-73 PP ....779*****.***********************.*****************************.****976..
MGYP000113479303/34-96 --NHKGIENHRK-AAKHHEEAAKHHHDAAKHHEAGN-HDKACESTVKAHGHHCLASDHMREVSKQH-A--------
#=GR MGYP000113479303/34-96 PP ..5*********.***********************.**********************9999875.5........
MGYP000187226991/3-69 ---KKAADHHKQ-AAEHHTHAAKHHTEAARHHESGN-HEKAAHHAHSSRAHASQADDHAEQAAKAH-MDEHGK---
#=GR MGYP000187226991/3-69 PP ...689******.***********************.***************************88.888865...
MGYP000381848663/3-69 ---KKAAEHHHK-ASEHHTHAARHHSEAAKHHEGGH-HEKAAHHAHTARAHALHSRHHSDEAAKMH-GEEHGK---
#=GR MGYP000381848663/3-69 PP ...699******.***********************.***************************99.999876...
MGYP000066325489/28-89 ----KTIANHKQ-AARHHMEAAKHHMEAARHHEEGN-HEKAAHSTLLAYGHHTIAGEFVSDDAKHH-AQ-------
#=GR MGYP000066325489/28-89 PP ....56678999.***********************.********************999999988.75.......
MGYP000013251582/4-69 ----EAANHHKQ-AAEHHEHAARHHHEAAKHHLAGN-HEKAAHHAHLAHGHHVHATEHAENAAKEH-VKAHGA---
#=GR MGYP000013251582/4-69 PP ....57889999.***********************.***************************99.888865...
MGYP000499794189/19-84 ---NDAAEHHRK-AAEHHEHAAAHHREAAEHHANGN-HEKAAHHAHIAHGHGLHAAHHAGEATKHH-ANTHG----
#=GR MGYP000499794189/19-84 PP ...5689*****.***********************.*****************************.*9986....
MGYP000555816272/4-69 -----EAAHHHKQAAEHHEHAARHHHEAAKHHEAGN-HEKAAHHAHLAHAHHVLAAEHAENAAKEH-LKAHGT---
#=GR MGYP000555816272/4-69 PP .....4555554399*********************.***************************99.888865...
MGYP000653248377/3-70 ---KKAAEHHKK-ASEHLTHAARHHGEAAKHHEAGS-HEKAAHHAHTARAHIIHGRGHAEEAVKAH-AEEHGKK--
#=GR MGYP000653248377/3-70 PP ...699******.***********************.*****************************.**99865..
MGYP000113511630/3-70 ---KKAAEHHRK-AAEHHKHAAGHHEEAAAHHDKGN-HEKAAHHAHVAHGHTLHAAHHAEEAAKAH-VEEHGSK--
#=GR MGYP000113511630/3-70 PP ...699******.***********************.***************************99.9999866..
MGYP001057101778/4-69 ---DKIIEHHRS-AADHHEKAAQHHREAAKHHASDS-HEKAAHHAHSAHGHSAHATHHAGEASKHH-AEHHG----
#=GR MGYP001057101778/4-69 PP ...5678*****.***********************.*****************************.****6....
MGYP000210824545/3-69 ---KKAAESHKK-ASEHLTHAARHHTEAAKHHETGQ-HEKAAHHAHIARAHATHAREHSENAAKAH-LEEHGK---
#=GR MGYP000210824545/3-69 PP ...689******.***********************.***************************99.999976...
MGYP000676742083/9-64 ------RDEHNK-AAEHHENAAKAHRSAAEHHGKGD-HAKGKQHADTAKQHSQTAHQHTDQAHS------------
#=GR MGYP000676742083/9-64 PP ......5789**.***********************.**********************99854............
MGYP000545010933/4-70 --KHPSTEHHTS-AAEEHDNASRHHRAAAKNYEEGK-HETAAHHAHSASGHSSNARDQAEEASRKH-AKQHG----
#=GR MGYP000545010933/4-70 PP ..58999*****.***********************.*************************9888.88775....
MGYP000541064880/3-68 -AEHNAAEHHGF-AAHHHQRAAQFHREASRHYEAGKDYAHAAHQALVAHGHALLAIDHGNEAGKYY-AG-------
#=GR MGYP000541064880/3-68 PP .789********.*********************963789***********************997.64.......
MGYP000541064880/99-161 ------SEHHAA-AADDHEQAAQHHAQAAKHLNEKD-YELAAHEAQLAHRHAHYSIFHDDEAAKHH-VEHYG----
#=GR MGYP000541064880/99-161 PP ......69****.***********************.**************999************.***86....
MGYP000729801087/3-52 ---KKVAEHHLK-AAEHLEHAARHHKEAAKHHEAGN-HEKAAHHAHIARAHHEHA---------------------
#=GR MGYP000729801087/3-52 PP ...5889*****.***********************.*****************7.....................
MGYP000715079888/40-96 -----SAEYHKK-AANCHYEAAKHHNIAAKHHEAGN-HKKASEYALKAYWYHCLASEAEKEDVK------------
#=GR MGYP000715079888/40-96 PP .....69*****.***********************.***************998876655555............
MGYP000033872322/3-43 ---KKAAEHHRK-AAEHHEHAARHHKEAAKHHDAGA-HEKAAHHAH------------------------------
#=GR MGYP000033872322/3-43 PP ...699******.***********************.*******96..............................
MGYP000464421157/4-69 ----EAAEHHKH-AAEHLTHAARHHSEAAKHHEAGQ-HEKAAHHAHLAHGHQEHASEHAVEAAKKH-IEAHGN---
#=GR MGYP000464421157/4-69 PP ....689*****.***********************.***************************99.999875...
#=GC PP_cons 7887889*****.***********************.**************************999.9998766..
#=GC RF xxxxxxxxxxxx.xxxxxxxxxxxxxxxxxxxxxxx.xxxxxxxxxxxxxxxxxxxxxxxxxxxxx.xxxxxxxxx
//
tests/test_data/alignment/pdb70_hits.hhr
0 → 100644
View file @
85d39c80
Query query
Match_columns 73
No_of_seqs 55 out of 57
Neff 2.88591
Searched_HMMs 80799
Date Thu Dec 30 19:40:02 2021
Command /home/ga122/openfold/lib/conda/envs/openfold_venv/bin/hhsearch -i /tmp/tmpedq9nsbw/query.a3m -o /tmp/tmpedq9nsbw/output.hhr -maxseq 1000000 -d /data/ga122/alphafold/pdb70/pdb70
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 1HF9_B ATPASE INHIBITOR (MITOC 7.5 3.8E+02 0.0047 16.2 0.0 22 7-28 10-31 (41)
2 2CRB_A nuclear receptor bindin 6.4 4.7E+02 0.0058 18.0 0.0 20 11-30 32-51 (97)
3 4ZEY_A nuclear receptor bindin 6.3 4.7E+02 0.0059 17.3 0.0 20 11-30 26-45 (84)
4 3U8V_A Metal-binding protein s 4.1 8.1E+02 0.01 17.3 0.0 32 15-46 50-81 (93)
5 1PSM_A SPAM-H1 (RESIDUES 90 - 1.9 2.1E+03 0.026 13.4 0.0 18 11-28 14-31 (38)
6 5KC1_F Autophagy-related prote 1.5 2.7E+03 0.033 16.9 0.0 17 12-28 25-41 (226)
7 5KC1_J Autophagy-related prote 1.5 2.7E+03 0.033 16.9 0.0 17 12-28 25-41 (226)
8 3ZEE_A PARTITIONING DEFECTIVE 1.1 3.8E+03 0.046 12.5 0.0 15 58-72 30-44 (84)
9 4I6P_A Partitioning defective 1.0 4.3E+03 0.054 12.4 0.0 16 57-72 32-47 (88)
10 2Q2K_A Hypothetical protein/DN 1.0 4.3E+03 0.054 13.4 0.0 17 56-72 54-70 (70)
No 1
>1HF9_B ATPASE INHIBITOR (MITOCHONDRIAL); ATPASE INHIBITOR, F1 ATPASE INHIBITOR; NMR {BOS TAURUS} SCOP: h.4.8.1
Probab=7.51 E-value=3.8e+02 Score=16.19 Aligned_cols=22 Identities=23% Similarity=0.398 Sum_probs=13.1 Template_Neff=4.500
Q query 7 AEHHHKAAEHHEQAAKHHHAAA 28 (73)
Q Consensus 7 aEhH~kAAeHHe~AA~HH~eAA 28 (73)
.++|++-++.|+.|.+-|++-.
T Consensus 10 I~~He~qIk~heeaI~RHk~~i 31 (41)
T 1HF9_B 10 ISHHAKEIERLQKEIERHKQSI 31 (41)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3456666666666666665543
No 2
>2CRB_A nuclear receptor binding factor 2; NRBF-2, MIT domain, helix bundle; NMR {Mus musculus} SCOP: a.7.16.1
Probab=6.35 E-value=4.7e+02 Score=18.01 Aligned_cols=20 Identities=20% Similarity=0.165 Sum_probs=11.0 Template_Neff=5.700
Q query 11 HKAAEHHEQAAKHHHAAAEH 30 (73)
Q Consensus 11 ~kAAeHHe~AA~HH~eAAkH 30 (73)
..|++-|.+|++...+|.+-
T Consensus 32 ~~Aie~H~kAA~~f~~A~~~ 51 (97)
T 2CRB_A 32 EEAISCHRKATTYLSEAMKL 51 (97)
T ss_dssp HHHHHHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHHHHHHHHHc
Confidence 34555556666555555543
No 3
>4ZEY_A nuclear receptor binding factor 2; Structural Genomics, Joint Center for; HET: SO4, MSE; 1.5A {Homo sapiens}
Probab=6.31 E-value=4.7e+02 Score=17.29 Aligned_cols=20 Identities=25% Similarity=0.200 Sum_probs=10.8 Template_Neff=6.200
Q query 11 HKAAEHHEQAAKHHHAAAEH 30 (73)
Q Consensus 11 ~kAAeHHe~AA~HH~eAAkH 30 (73)
..|++-|.+|+....+|.+-
T Consensus 26 ~~A~e~H~~AA~~f~~A~~~ 45 (84)
T 4ZEY_A 26 EEAISCHKKAAAYLSEAMKL 45 (84)
T ss_dssp HHHHHHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 44555555555555555543
No 4
>3U8V_A Metal-binding protein smbP; four helical bundle, metal chaperone; 1.9A {Nitrosomonas europaea}
Probab=4.11 E-value=8.1e+02 Score=17.29 Aligned_cols=32 Identities=34% Similarity=0.433 Sum_probs=15.8 Template_Neff=4.400
Q query 15 EHHEQAAKHHHAAAEHHEKGEHEQAAHHADTA 46 (73)
Q Consensus 15 eHHe~AA~HH~eAAkHheaG~HekAahhAh~A 46 (73)
+|-..+.++-.+|.++-..|+-+.|..++-.|
T Consensus 50 ~H~~~aik~LeeAI~hgk~ghad~A~kha~~A 81 (93)
T 3U8V_A 50 THVGHGIKHLEDAIKHGEEGHVGVATKHAQEA 81 (93)
T ss_dssp CHHHHHHHHHHHHHHHHHTTCHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHcCcHHHHHHHHHHH
Confidence 34444455555555555555555544444433
No 5
>1PSM_A SPAM-H1 (RESIDUES 90 - 127; POLYMORPHIC ANTIGEN; NMR {Plasmodium falciparum} SCOP: j.18.1.1
Probab=1.89 E-value=2.1e+03 Score=13.41 Aligned_cols=18 Identities=39% Similarity=0.433 Sum_probs=8.0 Template_Neff=1.300
Q query 11 HKAAEHHEQAAKHHHAAA 28 (73)
Q Consensus 11 ~kAAeHHe~AA~HH~eAA 28 (73)
.+|++--|+|++.=.+|+
T Consensus 14 e~aa~dae~a~k~ae~a~ 31 (38)
T 1PSM_A 14 EQAAKDAENASKEAEEAA 31 (38)
T ss_dssp HSTTTTTTHHHHHTTTTT
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 444444444444444443
No 6
>5KC1_F Autophagy-related protein 38; Atg38, coiled-coil, dimerization, NRBF2, autophagy; HET: NO3, NH4, EDO, NA; 2.2A {Saccharomyces cerevisiae}
Probab=1.52 E-value=2.7e+03 Score=16.87 Aligned_cols=17 Identities=12% Similarity=0.040 Sum_probs=0.0 Template_Neff=5.100
Q query 12 KAAEHHEQAAKHHHAAA 28 (73)
Q Consensus 12 kAAeHHe~AA~HH~eAA 28 (73)
.|++-|.+|++.-.+|.
T Consensus 25 eAie~h~kAAe~l~~a~ 41 (226)
T 5KC1_F 25 NAKAKYQEAIEVLGPQN 41 (226)
T ss_dssp -----------------
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34444444444444443
No 7
>5KC1_J Autophagy-related protein 38; Atg38, coiled-coil, dimerization, NRBF2, autophagy; HET: NA, NO3, EDO, NH4; 2.2A {Saccharomyces cerevisiae}
Probab=1.52 E-value=2.7e+03 Score=16.87 Aligned_cols=17 Identities=12% Similarity=0.040 Sum_probs=0.0 Template_Neff=5.100
Q query 12 KAAEHHEQAAKHHHAAA 28 (73)
Q Consensus 12 kAAeHHe~AA~HH~eAA 28 (73)
.|++-|.+|++.-.+|.
T Consensus 25 eAie~h~kAAe~l~~a~ 41 (226)
T 5KC1_J 25 NAKAKYQEAIEVLGPQN 41 (226)
T ss_dssp -----------------
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34444444444444443
No 8
>3ZEE_A PARTITIONING DEFECTIVE 3 HOMOLOG; CELL CYCLE; 6.1A {RATTUS NORVEGICUS}
Probab=1.14 E-value=3.8e+03 Score=12.49 Aligned_cols=15 Identities=27% Similarity=0.237 Sum_probs=7.4 Template_Neff=7.600
Q query 58 AQAAKHDAEHHAPKP 72 (73)
Q Consensus 58 ~eAak~ha~~H~~kp 72 (73)
.+|.+.|....+.+|
T Consensus 30 ~~a~~Ry~~~~~~~~ 44 (84)
T 3ZEE_A 30 QQAVTRYRKAVAKDP 44 (84)
T ss_dssp HHHHHHHHHHHCSSS
T ss_pred HHHHHHHHHHcCCCc
Confidence 455555555544433
No 9
>4I6P_A Partitioning defective 3 homolog; PB1 like motif, DUF3534, Cell; 2.9A {Rattus norvegicus}
Probab=1.01 E-value=4.3e+03 Score=12.37 Aligned_cols=16 Identities=25% Similarity=0.206 Sum_probs=0.0 Template_Neff=7.500
Q query 57 AAQAAKHDAEHHAPKP 72 (73)
Q Consensus 57 a~eAak~ha~~H~~kp 72 (73)
+.+|.+.|....+.+|
T Consensus 32 ~~~a~~Ry~~~~~~~~ 47 (88)
T 4I6P_A 32 IQQAVTRYRKAVAKDP 47 (88)
T ss_dssp HHHHHHHHHHHHCCCT
T ss_pred HHHHHHHHHHHcCCCc
No 10
>2Q2K_A Hypothetical protein/DNA Complex; protein-DNA, partition, segregation, parB, DNA; HET: EPE; 3.0A {Staphylococcus aureus}
Probab=1.01 E-value=4.3e+03 Score=13.41 Aligned_cols=17 Identities=24% Similarity=0.343 Sum_probs=0.0 Template_Neff=1.100
Q query 56 HAAQAAKHDAEHHAPKP 72 (73)
Q Consensus 56 Ha~eAak~ha~~H~~kp 72 (73)
|-.||-+.|.++-|..|
T Consensus 54 hireal~ryiee~g~~p 70 (70)
T 2Q2K_A 54 HIREALRRYIEEIGENP 70 (70)
T ss_dssp HHHHHHHHHHHHCCHHC
T ss_pred HHHHHHHHHHHHHCCCC
tests/test_data/alignment/uniref90_hits.sto
0 → 100644
View file @
85d39c80
# STOCKHOLM 1.0
#=GF ID query-i1
#=GF AU jackhmmer (HMMER 3.3.2)
#=GS UniRef90_D7BIZ4/1-73 DE [subseq from] Uncharacterized protein n=1 Tax=Meiothermus silvanus (strain ATCC 700542 / DSM 9946 / VI-R2) TaxID=526227 RepID=D7BIZ4_MEISD
#=GS UniRef90_A0A345WS72/1-69 DE [subseq from] Uncharacterized protein n=1 Tax=Sphingomonas sp. FARSPH TaxID=2219696 RepID=A0A345WS72_9SPHN
#=GS UniRef90_A0A1F2V377/4-68 DE [subseq from] Uncharacterized protein n=1 Tax=Acidobacteria bacterium RIFCSPLOWO2_12_FULL_60_22 TaxID=1797188 RepID=A0A1F2V377_9BACT
#=GS UniRef90_A0A3C0R222/4-69 DE [subseq from] Alpha-carbonic anhydrase domain-containing protein n=1 Tax=Spartobacteria bacterium TaxID=2052183 RepID=A0A3C0R222_9BACT
#=GS UniRef90_A0A3G2VJ28/2-67 DE [subseq from] Uncharacterized protein n=1 Tax=Methylobacterium brachiatum TaxID=269660 RepID=A0A3G2VJ28_9RHIZ
#=GS UniRef90_A0A317IC02/3-67 DE [subseq from] Uncharacterized protein n=1 Tax=Candidatus Melainabacteria bacterium TaxID=2052166 RepID=A0A317IC02_9BACT
#=GS UniRef90_A0A4P6K0I8/4-69 DE [subseq from] Uncharacterized protein n=1 Tax=Ktedonosporobacter rubrisoli TaxID=2509675 RepID=A0A4P6K0I8_9CHLR
#=GS UniRef90_A0A142HH28/3-70 DE [subseq from] Uncharacterized protein n=1 Tax=Hymenobacter sp. PAMC 26554 TaxID=1484116 RepID=A0A142HH28_9BACT
#=GS UniRef90_A0A402A866/4-68 DE [subseq from] Uncharacterized protein n=1 Tax=Tengunoibacter tsumagoiensis TaxID=2014871 RepID=A0A402A866_9CHLR
#=GS UniRef90_UPI00131BC0F4/3-69 DE [subseq from] hypothetical protein n=1 Tax=Acidisphaera sp. S103 TaxID=1747223 RepID=UPI00131BC0F4
#=GS UniRef90_A0A5E6MFW5/5-71 DE [subseq from] Uncharacterized protein n=1 Tax=Methylacidimicrobium tartarophylax TaxID=1041768 RepID=A0A5E6MFW5_9BACT
#=GS UniRef90_A0A6M1MC51/1-69 DE [subseq from] Uncharacterized protein n=1 Tax=Methylobacterium sp. DB0501 TaxID=2709665 RepID=A0A6M1MC51_9RHIZ
#=GS UniRef90_A0A368HF25/2-66 DE [subseq from] Uncharacterized protein n=1 Tax=Acidiferrobacter thiooxydans TaxID=163359 RepID=A0A368HF25_9GAMM
#=GS UniRef90_A0A2N3PRK8/17-83 DE [subseq from] Uncharacterized protein n=1 Tax=Telmatospirillum siberiense TaxID=382514 RepID=A0A2N3PRK8_9PROT
#=GS UniRef90_A0A2N3PRK8/115-180 DE [subseq from] Uncharacterized protein n=1 Tax=Telmatospirillum siberiense TaxID=382514 RepID=A0A2N3PRK8_9PROT
#=GS UniRef90_A0A7Y3P168/15-76 DE [subseq from] Uncharacterized protein n=1 Tax=Bacteroidia bacterium TaxID=2044936 RepID=A0A7Y3P168_9BACT
#=GS UniRef90_A0A4R8DP52/4-70 DE [subseq from] Uncharacterized protein n=1 Tax=Dinghuibacter silviterrae TaxID=1539049 RepID=A0A4R8DP52_9BACT
#=GS UniRef90_A0A1I4D138/7-73 DE [subseq from] Uncharacterized protein n=1 Tax=Methylocapsa palsarum TaxID=1612308 RepID=A0A1I4D138_9RHIZ
#=GS UniRef90_UPI0011BDFA18/9-74 DE [subseq from] hypothetical protein n=1 Tax=Adhaeribacter aerolatus TaxID=670289 RepID=UPI0011BDFA18
#=GS UniRef90_A0A1Q3KM49/4-69 DE [subseq from] Uncharacterized protein n=1 Tax=Alphaproteobacteria bacterium 65-37 TaxID=1895711 RepID=A0A1Q3KM49_9PROT
#=GS UniRef90_A0A225DK00/3-70 DE [subseq from] Uncharacterized protein n=1 Tax=Fimbriiglobus ruber TaxID=1908690 RepID=A0A225DK00_9BACT
#=GS UniRef90_A0A3E1NFY1/4-67 DE [subseq from] Uncharacterized protein n=1 Tax=Deminuibacter soli TaxID=2291815 RepID=A0A3E1NFY1_9BACT
#=GS UniRef90_UPI0015707348/3-70 DE [subseq from] hypothetical protein n=1 Tax=Hymenobacter sp. 9A TaxID=2735894 RepID=UPI0015707348
#=GS UniRef90_A0A7G4RF23/9-68 DE [subseq from] Uncharacterized protein n=1 Tax=Legionella sp. PC997 TaxID=2755562 RepID=A0A7G4RF23_9GAMM
#=GS UniRef90_A0A177QKT9/4-69 DE [subseq from] Uncharacterized protein n=1 Tax=Nitrospira sp. SCGC AG-212-E16 TaxID=1799664 RepID=A0A177QKT9_9BACT
#=GS UniRef90_UPI000A0039BF/3-69 DE [subseq from] hypothetical protein n=1 Tax=Bradyrhizobium sp. NAS80.1 TaxID=1680159 RepID=UPI000A0039BF
#=GS UniRef90_A0A537SU55/5-72 DE [subseq from] Uncharacterized protein n=1 Tax=Alphaproteobacteria bacterium TaxID=1913988 RepID=A0A537SU55_9PROT
#=GS UniRef90_UPI0009DA3672/5-71 DE [subseq from] hypothetical protein n=2 Tax=Verrucomicrobia TaxID=74201 RepID=UPI0009DA3672
#=GS UniRef90_UPI000943D660/10-75 DE [subseq from] hypothetical protein n=1 Tax=Rufibacter TaxID=1379908 RepID=UPI000943D660
#=GS UniRef90_A0A2K8YE90/1-69 DE [subseq from] Uncharacterized protein n=5 Tax=Bradyrhizobium TaxID=374 RepID=A0A2K8YE90_9BRAD
#=GS UniRef90_A0A2W6AI54/4-69 DE [subseq from] Uncharacterized protein n=1 Tax=Candidatus Dormibacteraeota bacterium TaxID=2052315 RepID=A0A2W6AI54_9BACT
#=GS UniRef90_UPI001AECAC8A/84-146 DE [subseq from] hypothetical protein n=2 Tax=Beijerinckia sp. 28-YEA-48 TaxID=1882748 RepID=UPI001AECAC8A
#=GS UniRef90_A0A411HJN1/1-70 DE [subseq from] Uncharacterized protein n=1 Tax=Pseudolysobacter antarcticus TaxID=2511995 RepID=A0A411HJN1_9GAMM
#=GS UniRef90_A0A7D4C1D1/3-70 DE [subseq from] Uncharacterized protein n=1 Tax=Hymenobacter sp. BRD128 TaxID=2675878 RepID=A0A7D4C1D1_9BACT
#=GS UniRef90_A0A3S0S9L9/2-68 DE [subseq from] Uncharacterized protein n=1 Tax=Hyphomicrobium sp. TaxID=82 RepID=A0A3S0S9L9_HYPSQ
#=GS UniRef90_UPI0015F67598/3-69 DE [subseq from] hypothetical protein n=2 Tax=Rhodospirillales incertae sedis TaxID=451274 RepID=UPI0015F67598
#=GS UniRef90_A0A2W5ZIQ4/3-69 DE [subseq from] Uncharacterized protein n=1 Tax=Candidatus Dormibacteraeota bacterium TaxID=2052315 RepID=A0A2W5ZIQ4_9BACT
#=GS UniRef90_A0A5C1ACH7/3-69 DE [subseq from] Uncharacterized protein n=2 Tax=Gemmataceae TaxID=1914233 RepID=A0A5C1ACH7_9BACT
#=GS UniRef90_A0A7X8SVC5/4-59 DE [subseq from] Uncharacterized protein n=1 Tax=Rhizobium sp. P38BS-XIX TaxID=2726740 RepID=A0A7X8SVC5_9RHIZ
#=GS UniRef90_UPI001647FE78/10-75 DE [subseq from] hypothetical protein n=1 Tax=Rufibacter TaxID=1379908 RepID=UPI001647FE78
#=GS UniRef90_A0A534V5G6/4-70 DE [subseq from] Uncharacterized protein n=1 Tax=Deltaproteobacteria bacterium TaxID=2026735 RepID=A0A534V5G6_9DELT
#=GS UniRef90_A0A7D3WQ23/3-70 DE [subseq from] Uncharacterized protein n=3 Tax=Hymenobacter TaxID=89966 RepID=A0A7D3WQ23_9BACT
#=GS UniRef90_UPI00067F5429/4-57 DE [subseq from] hypothetical protein n=1 Tax=Bradyrhizobium viridifuturi TaxID=1654716 RepID=UPI00067F5429
#=GS UniRef90_G3IVL7/18-76 DE [subseq from] Uncharacterized protein n=2 Tax=Methylobacter tundripaludum TaxID=173365 RepID=G3IVL7_METTV
#=GS UniRef90_A0A431QXA7/3-69 DE [subseq from] Uncharacterized protein n=2 Tax=Bradyrhizobiaceae TaxID=41294 RepID=A0A431QXA7_9BRAD
#=GS UniRef90_A0A2V7ZTA3/4-68 DE [subseq from] Uncharacterized protein n=2 Tax=unclassified Acidobacteria TaxID=305072 RepID=A0A2V7ZTA3_9BACT
#=GS UniRef90_A0A516TLI4/49-117 DE [subseq from] Uncharacterized protein n=2 Tax=Methylacidiphilum kamchatkense TaxID=431057 RepID=A0A516TLI4_9BACT
#=GS UniRef90_UPI00155D9B40/8-70 DE [subseq from] hypothetical protein n=1 Tax=Leptospirillum ferrooxidans TaxID=180 RepID=UPI00155D9B40
#=GS UniRef90_A0A2H9SEK4/12-70 DE [subseq from] Uncharacterized protein n=1 Tax=Legionella sp. TaxID=459 RepID=A0A2H9SEK4_9GAMM
#=GS UniRef90_A0A1H1JX39/3-70 DE [subseq from] Uncharacterized protein n=3 Tax=unclassified Rhizobiales TaxID=41292 RepID=A0A1H1JX39_9RHIZ
#=GS UniRef90_UPI000975F98E/3-57 DE [subseq from] hypothetical protein n=2 Tax=Bradyrhizobium TaxID=374 RepID=UPI000975F98E
#=GS UniRef90_A0A142H998/3-70 DE [subseq from] Uncharacterized protein n=3 Tax=unclassified Hymenobacter TaxID=2615202 RepID=A0A142H998_9BACT
#=GS UniRef90_UPI00031CAACE/3-70 DE [subseq from] hypothetical protein n=2 Tax=Zavarzinella formosa TaxID=360055 RepID=UPI00031CAACE
#=GS UniRef90_S9SB59/5-73 DE [subseq from] Uncharacterized protein n=2 Tax=Magnetospirillum fulvum TaxID=1082 RepID=S9SB59_MAGFU
#=GS UniRef90_I0IMJ9/13-73 DE [subseq from] Uncharacterized protein n=1 Tax=Leptospirillum ferrooxidans (strain C2-3) TaxID=1162668 RepID=I0IMJ9_LEPFC
#=GS UniRef90_A0A2Z3R562/2-66 DE [subseq from] Uncharacterized protein n=1 Tax=Acidiferrobacter sp. SPIII_3 TaxID=1281578 RepID=A0A2Z3R562_9GAMM
query MAAHKGAEHHHKAAEHHEQAAKHHHAAAEHHEKG-EHEQAAHHADTAYAHHKHAEEHAAQAAK-HDA-EHHAPKPH
UniRef90_D7BIZ4/1-73 MAAHKGAEHHHKAAEHHEQAAKHHHAAAEHHEKG-EHEQAAHHADTAYAHHKHAEEHAAQAAK-HDA-EHHAPKPH
#=GR UniRef90_D7BIZ4/1-73 PP 89********************************.****************************.***.*******9
UniRef90_A0A345WS72/1-69 MAEHKGAEHHRTAAEHHEHAAKHHRSAAEQHEAG-NHEKAGHHAAAAGGHASHAREHGEQASR-HHA-EHHG----
#=GR UniRef90_A0A345WS72/1-69 PP 799*******************************.****************************.***.***6....
UniRef90_A0A1F2V377/4-68 ----TGAEHHEAAAQHHEQAARHHHEAAKQDHSG-HHEKAGHYAHLAYAHFKHAEQHAAEAAK-THA-KNHT----
#=GR UniRef90_A0A1F2V377/4-68 PP ....69****************************.****************************.999.9995....
UniRef90_A0A3C0R222/4-69 ----KLKEHHTKAAEHHEHAAKHHRKAAEHHVSG-KHETAAHHAHLAHGHHMHARHHATEAAK-RHV-ELHGN---
#=GR UniRef90_A0A3C0R222/4-69 PP ....6679**************************.****************************.*99.99975...
UniRef90_A0A3G2VJ28/2-67 --AHQGAEHHHKAAEHHEKAAQHHREAAKHHESG-NHEKAAHHAHTAHGHATHASHHHTEASR-HHA-EQH-----
#=GR UniRef90_A0A3G2VJ28/2-67 PP ..8*******************************.****************************.***.*99.....
UniRef90_A0A317IC02/3-67 ---KKASEHHKKAAEHHRKAADHHEQASKHHDSG-SHEKAAHHAQTATGHHLHAEHHAHEATK-CHS-DEY-----
#=GR UniRef90_A0A317IC02/3-67 PP ...6899***************************.**************************99.666.555.....
UniRef90_A0A4P6K0I8/4-69 --NHPSVEHHKKAAEHHTKAAEHHTKAAEHHTKG-EHEAAAHHAHLAHGHHAQATEHANEAAK-KHA-SHT-----
#=GR UniRef90_A0A4P6K0I8/4-69 PP ..58999***************************.****************************.999.996.....
UniRef90_A0A142HH28/3-70 ---KKAADSHKKAAEHHTEAAKHHTEAAKHHEAG-SHEKAAHHAHTAAAHKDHATEHATTARK-AHA-EEHGKK--
#=GR UniRef90_A0A142HH28/3-70 PP ...6899***************************.****************************.***.*99865..
UniRef90_A0A402A866/4-68 --GHPSIEHHRKAAEHHRKAAEHHEKAAEHHAKG-EHETAASHAHMAHGHHIQATEHLEEAAKKHTA-Q-------
#=GR UniRef90_A0A402A866/4-68 PP ..6999****************************.************************99862665.5.......
UniRef90_UPI00131BC0F4/3-69 --NHQGATHHKKAAEHHEMAAKHHAQAAHHHESG-EHEAAGHHAHAAAGHAAHAKDHAEHAAK-HHA-ETHA----
#=GR UniRef90_UPI00131BC0F4/3-69 PP ..6*******************************.****************************.***.***8....
UniRef90_A0A5E6MFW5/5-71 -----IAEHHEKAAMHHEHAATHHKKAAEHHRKG-EHVESGHHAHIAHGHAEHAEVHAKEAAK-EEA-TVHDKEP-
#=GR UniRef90_A0A5E6MFW5/5-71 PP .....59***************************.****************************.***.9997665.
UniRef90_A0A6M1MC51/1-69 MATHQGAEHHKKAAEHHEHAARHHREAAKHYEAG-SHEKAAHHAHTAHGHASHATHHHTEASR-HHA-EQHG----
#=GR UniRef90_A0A6M1MC51/1-69 PP 899*******************************.****************************.***.*996....
UniRef90_A0A368HF25/2-66 ---HEGAEHHKNAAKHHTEAAKHHTEAAKHHDAG-QHEKAAHHAHLAYAHSVHAAHYREEAAK-HYA-AHN-----
#=GR UniRef90_A0A368HF25/2-66 PP ...9******************************.****************************.***.996.....
UniRef90_A0A2N3PRK8/17-83 --EHRAAEHHRSAVSHHEAAARYHREASKHYQIGHDHAHAAHQALIALGQAWQAVDHAKTANG-YYA-DHD-----
#=GR UniRef90_A0A2N3PRK8/17-83 PP ..59****************************995699********************99999.999.885.....
UniRef90_A0A2N3PRK8/115-180 ------AEHHAVAADNHEQAAKHHRRAAQHCDEK-NYMMAACEAHLAHGHAQHSIFHGIEAAK-HHV-DHQTQNP-
#=GR UniRef90_A0A2N3PRK8/115-180 PP ......89**************************.****************************.***.**98776.
UniRef90_A0A7Y3P168/15-76 ---NKGIENHKKAAKHHEEAAKHHHEAAKHHEAG-NHDKAFESTIKAYGHHCLANEAQ----R-EDL-KHHA----
#=GR UniRef90_A0A7Y3P168/15-76 PP ...79*****************************.****************9988754....5.566.6665....
UniRef90_A0A4R8DP52/4-70 ----EHAEHHKKAASHSEKAAEHHHEAAKHYEAG-DHEAGAHHAHAAHAHHLHAEDHAKHAAK-LHA-EHHGEK--
#=GR UniRef90_A0A4R8DP52/4-70 PP ....569***************************.****************************.***.***865..
UniRef90_A0A1I4D138/7-73 ----KIAEHHTQAAQHHEKAAEHYKEAAKHHETG-AVEKGAHHAQVSQGHAVHAEYHADEAAK-AHA-QHHANK--
#=GR UniRef90_A0A1I4D138/7-73 PP ....779***************************.****************************.***.***976..
UniRef90_UPI0011BDFA18/9-74 ---KKSAEHHQIAADHLEQAAKNHRAAAEHLAAG-DHQKAAHHGYTAYGLSSHAQYHAQQAAL-HHS-HEHK----
#=GR UniRef90_UPI0011BDFA18/9-74 PP ...4789***************************.***************************9.877.5553....
UniRef90_A0A1Q3KM49/4-69 ---DKIIEHHRSAADHHEKAAQHHREAAKHHESD-SHEKAAHHAHSAHGHSAHATHHAGEASK-HHA-EHHG----
#=GR UniRef90_A0A1Q3KM49/4-69 PP ...5678***************************.****************************.***.***6....
UniRef90_A0A225DK00/3-70 ---KKAAESHKKAAESHKKAGEHHEQAAKHHEAG-NHEKAAHHAHTAKGHQTHAERHTNDAAA-HHA-EEHGAK--
#=GR UniRef90_A0A225DK00/3-70 PP ...689****************************.****************************.***.*99865..
UniRef90_A0A3E1NFY1/4-67 -------KNHEDAAKHHEEAAKHHRSAAEEAGKG-NHEKAAHHAQAAHGHTEHAKEHAREASK-KYA-QQHEEK--
#=GR UniRef90_A0A3E1NFY1/4-67 PP .......57999**********************.****************************.***.999876..
UniRef90_UPI0015707348/3-70 ---KKAVDSHKKAAAHHTEAAAHHTEAAKHQEAG-SHEKAAHHAHTAAAHTDHAAEHATQARK-SHA-EDHGTK--
#=GR UniRef90_UPI0015707348/3-70 PP ...578899*************************.****************************.***.*99865..
UniRef90_A0A7G4RF23/9-68 ----KLKQHHTLAAEHHKKASEHHNEAAKYHQSG-DHEQGHHHAHLARGHHEHAQHHSSEAAK-HS----------
#=GR UniRef90_A0A7G4RF23/9-68 PP ....56789*************************.****************************.*7..........
UniRef90_A0A177QKT9/4-69 ----QAADHHRKAAEHHEHAARDHKEAAKYYEAG-EHEKAAHYAHRAHAHHLHVAHHSAEATK-SHL-EHHDK---
#=GR UniRef90_A0A177QKT9/4-69 PP ....689***************************.****************************.***.***75...
UniRef90_UPI000A0039BF/3-69 ---KKAAEHHKQAAEHHTQAARHHGEAAKHYEGG-QHEKAAHHAHTASGHGHHANYHTEEAGK-AHM-EEHGK---
#=GR UniRef90_UPI000A0039BF/3-69 PP ...689****************************.****************************.999.99976...
UniRef90_A0A537SU55/5-72 --THKGGSHHETAADHHETAAHHHREAAKHYESG-DHEKAGHHAHVAHAHGLHAAHHGQEAAK-HHA-EQHAE---
#=GR UniRef90_A0A537SU55/5-72 PP ..7*******************************.****************************.***.***96...
UniRef90_UPI0009DA3672/5-71 -----IAEHHEQAAMHHEHAAIHHKKAAEHHRKG-EHAESGHHAHIAHGHAQHAEHHAELAAK-EEA-TMHDKEP-
#=GR UniRef90_UPI0009DA3672/5-71 PP .....59***************************.****************************.***.9997766.
UniRef90_UPI000943D660/10-75 ---KKSAENHRKAAEYFEQAAANHRAAAEHLAKG-DHEKSAHHGYTAYGLSSHGRHHAEDAAL-HHS-HEHK----
#=GR UniRef90_UPI000943D660/10-75 PP ...4789***************************.**************************99.877.5553....
UniRef90_A0A2K8YE90/1-69 MSDHAGVEHHHKAAEHHEHAAHHHREAAKHHAAG-DHEKAAHHAHSAHGHASHAEHHHTEASR-HHA-EHHG----
#=GR UniRef90_A0A2K8YE90/1-69 PP 678*******************************.****************************.***.***7....
UniRef90_A0A2W6AI54/4-69 ----EAAQHHQQAAEHHEHAGRHHREAAKAHEAG-DHAKAAHHAHTARGHHEHASHHAAEAAK-SHV-EHHGH---
#=GR UniRef90_A0A2W6AI54/4-69 PP ....689***************************.****************************.***.***86...
UniRef90_UPI001AECAC8A/84-146 ------HEHHTKAAEHHELAAKHHREAAKHHESG-EHEKAAHHSKIAHGHSLHATEHHEHASK-KHA-EHHS----
#=GR UniRef90_UPI001AECAC8A/84-146 PP ......59**************************.****************************.***.***5....
UniRef90_A0A411HJN1/1-70 MSSHTVAEHHQKAAEHHTLAAEHHHEAAKHHSDG-AHEKAAHHAHLGHSHHLHATHHSQEATK-QFGHDHHA----
#=GR UniRef90_A0A411HJN1/1-70 PP 789*******************************.**************************99.75526776....
UniRef90_A0A7D4C1D1/3-70 ---KKAAEHHKHAATHHAEAAKHHTAAATHHEAG-HHEKAAHHAHTAAAHTEHATEHTSHARK-AHA-EEHGTK--
#=GR UniRef90_A0A7D4C1D1/3-70 PP ...689****************************.****************************.***.*99865..
UniRef90_A0A3S0S9L9/2-68 --AQKPHEHHQKAAEHHEQAAQHHKEAAKQHQAG-QHEKAAHHAHLAEAHHIHAKEHHEEAAK-AHL-AMHG----
#=GR UniRef90_A0A3S0S9L9/2-68 PP ..67889***************************.***************************9.766.6665....
UniRef90_UPI0015F67598/3-69 --KDKIVEHHNAAAEHHEHAAKHHREAATHHEAD-NHEKAGHHAHSAHGHSSHAAHHAGEASK-HHA-EHHG----
#=GR UniRef90_UPI0015F67598/3-69 PP ..56889***************************.****************************.***.***7....
UniRef90_A0A2W5ZIQ4/3-69 ---KKAAEHHGQAADHHEKAAQHHRQAKTHHEAG-DHQAAAHDAHTARGHHEHAAHHASEAAK-AHA-EEHGH---
#=GR UniRef90_A0A2W5ZIQ4/3-69 PP ...699****************************.****************************.***.*9975...
UniRef90_A0A5C1ACH7/3-69 ---KKAAASHKKAAEHHKKAGEHHENAAKHHEAG-NHEKAAHHAHTAKGHQSQAEKHGDEAAA-SHA-EEHGT---
#=GR UniRef90_A0A5C1ACH7/3-69 PP ...588999*************************.*************************999.999.99976...
UniRef90_A0A7X8SVC5/4-59 -------ESHTKAAEHHENAAKSHRSAAEHHGKG-DHEKGREHSKTAHAHSQSAHEHSDAAHK-K-----------
#=GR UniRef90_A0A7X8SVC5/4-59 PP .......889************************.**********************987766.5...........
UniRef90_UPI001647FE78/10-75 ---QKSAESHRKAAQYYQQAAEQHRAAAEHLNSG-DHEKAAHHGYTAYGLSEHARHHAKEAAL-HHS-HEHK----
#=GR UniRef90_UPI001647FE78/10-75 PP ...589****************************.**************************99.877.5553....
UniRef90_A0A534V5G6/4-70 ----QAAEHHTKAAEHHEHAARHHKEAAKHHEAG-NHEKAAHHAHVAHGHHLQAIHHHEEATK-FHL-EHHGKK--
#=GR UniRef90_A0A534V5G6/4-70 PP ....689***************************.****************************.***.***865..
UniRef90_A0A7D3WQ23/3-70 ---KKAAESHKHAAQHHTEAAKHHTEAAKSHEAG-NHEKAAHHAHTAAAHTEHATEHAGHARK-SHA-EEHGKK--
#=GR UniRef90_A0A7D3WQ23/3-70 PP ...6899***************************.****************************.***.*99865..
UniRef90_UPI00067F5429/4-57 -------EEHNKAAEHHENAAKAHRSAAEHHGKG-DHAKGMEHADTARQHSQTAHQHSEQAH--------------
#=GR UniRef90_UPI00067F5429/4-57 PP .......899************************.***********************9985..............
UniRef90_G3IVL7/18-76 ------QQHHQKAAEHHEQAAKHHKEAAKHYESG-DDKTAAQHAHIAHGYSTQAMEQEMEASK-KYA---------
#=GR UniRef90_G3IVL7/18-76 PP ......589*************************.**********************999999.766.........
UniRef90_A0A431QXA7/3-69 ---KKAAEHHKQSAEHHTHAARHHGEAAKHHESG-AHEKAAHHAHTARGHALHARHHSDEAAK-LHM-EEHGK---
#=GR UniRef90_A0A431QXA7/3-69 PP ...689****************************.****************************.999.98875...
UniRef90_A0A2V7ZTA3/4-68 ----EAVDHHRKAAEHFEHAAQHHSAAASHYGAG-RYDQASREAYLAHGHYLHGSNHAAEAAR-LHT-RHFG----
#=GR UniRef90_A0A2V7ZTA3/4-68 PP ....5689**************************.***************************9.888.8865....
UniRef90_A0A516TLI4/49-117 ---DTVAEEHEKAAMHHEHAAVHYRKAAEHHRAG-EHADSGHHAHIAHGHAKHAQAHAEAAAK-EEA-NMHDKKP-
#=GR UniRef90_A0A516TLI4/49-117 PP ...56699**************************.****************************.***.***9998.
UniRef90_UPI00155D9B40/8-70 ------QEHHQKAAEHHEHAAEHHKEAAKHHASG-DHKTASHHAHIAHGHSVHAREHEEEASK-KYV-VLHG----
#=GR UniRef90_UPI00155D9B40/8-70 PP ......69**************************.**************************99.876.6665....
UniRef90_A0A2H9SEK4/12-70 ------HKHHLKAAEHHKKAAEHHSEAAKHHEAG-EHEKGQASAYLALAHGRHAKDESCEACS-HYA---------
#=GR UniRef90_A0A2H9SEK4/12-70 PP ......57999***********************.***************************9.976.........
UniRef90_A0A1H1JX39/3-70 ---KKAAEHHKKAAEHATHVARHHGEAAKHHEAG-HHEKAAHHAHTAMGHAFHARGHAEEAAK-AHA-EEHGKK--
#=GR UniRef90_A0A1H1JX39/3-70 PP ...699****************************.****************************.***.*99865..
UniRef90_UPI000975F98E/3-57 ------KEEHNKAAEHHENAAKAHRSAAEHHGKG-DHAKGMEHANTAMQHSQTAHQHSEQAH--------------
#=GR UniRef90_UPI000975F98E/3-57 PP ......589*************************.***********************9985..............
UniRef90_A0A142H998/3-70 ---KKAAESHKHAATHHAEAAKHHTEAAKHHEAG-SHEKAAHHAHTAAAHTAHATEHATHARK-AHA-EEHGTK--
#=GR UniRef90_A0A142H998/3-70 PP ...6899***9***********************.****************************.***.*99865..
UniRef90_UPI00031CAACE/3-70 ---KKAAESHKKAAESHKKAGEHHEQAAKHHEAG-HHEKAAHHAHTAKGHQTQAEKHGNDAAT-QHA-EDHGSK--
#=GR UniRef90_UPI00031CAACE/3-70 PP ...689****************************.****************************.99*.999865..
UniRef90_S9SB59/5-73 MATLKANEHHAAAAAHSESAAQHHKEAAKQFDSG-HHEKAAHHAQVAAGHSAHATEHATEATK-KYA-EQHS----
#=GR UniRef90_S9SB59/5-73 PP 6778999***************************.****************************.***.*997....
UniRef90_I0IMJ9/13-73 ----KPQEHHKEAAQHHEEAAKHHKEASKMYEAG-DHKTAAHHAHSATGHASSAEEHQNEASR-KHA---------
#=GR UniRef90_I0IMJ9/13-73 PP ....6789**************************.***********************99987.655.........
UniRef90_A0A2Z3R562/2-66 ---HEGAEHHKNAAKHHTEAAKHHTEAAKHHDAG-QHEKAAHHAHLAHAHGTHAAHHHEEAAK-YYA-AHH-----
#=GR UniRef90_A0A2Z3R562/2-66 PP ...9******************************.****************************.***.**9.....
#=GC PP_cons 7877889***************************.***************************9.999.99876679
#=GC RF xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.xxxxxxxxxxxxxxxxxxxxxxxxxxxx.xxx.xxxxxxxx
//
tests/test_data/short.fasta
0 → 100644
View file @
85d39c80
>query
MAAHKGAEHHHKAAEHHEQAAKHHHAAAEHHEKGEHEQAAHHADTAYAHHKHAEEHAAQAAKHDAEHHAPKPH
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment