Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
OpenFold
Commits
96809433
"...AutoBuildImmortalWrt.git" did not exist on "f460cfe7cbe45f534689873039e41e3c4f62db14"
Commit
96809433
authored
Dec 30, 2021
by
Gustaf Ahdritz
Browse files
Change name of test data directory
parent
fe5e581a
Changes
4
Show whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
2566 additions
and
0 deletions
+2566
-0
tests/test_data/alignments/bfd_uniclust_hits.a3m
tests/test_data/alignments/bfd_uniclust_hits.a3m
+2048
-0
tests/test_data/alignments/mgnify_hits.sto
tests/test_data/alignments/mgnify_hits.sto
+183
-0
tests/test_data/alignments/pdb70_hits.hhr
tests/test_data/alignments/pdb70_hits.hhr
+158
-0
tests/test_data/alignments/uniref90_hits.sto
tests/test_data/alignments/uniref90_hits.sto
+177
-0
No files found.
tests/test_data/alignments/bfd_uniclust_hits.a3m
0 → 100644
View file @
96809433
>query
MAAHKGAEHHHKAAEHHEQAAKHHHAAAEHHEKGEHEQAAHHADTAYAHHKHAEEHAAQAAKHDAEHHAPKPH
>tr|A0A2A6NXF8|A0A2A6NXF8_9BRAD Uncharacterized protein OS=Bradyrhizobium sp. C9 OX=142585 GN=CO675_03465 PE=4 SV=1
MSDHAGVEHHHKAAEHHEHAARHHREAARHHEAGDHHKAAHHAHSAHGHASHAQHHHTEASRHHAEHHGEH--
>tr|A0A1F2V377|A0A1F2V377_9BACT Uncharacterized protein OS=Acidobacteria bacterium RIFCSPLOWO2_12_FULL_60_22 OX=1797188 GN=A3J28_14435 PE=4 SV=1
-MPRTGAEHHEAAAQHHEQAARHHHEAAKQDHSGHHEKAGHYAHLAYAHFKHAEQHAAEAAKTHAKNHTG---
>SRR6202048_823629
MTDHAGVEHHHKAAEHHEQAAKHHREAAKHHEAGDHEKAEHPAPTAPGHASHAEEHHAEASRHHAEHHV----
>ERR1700724_1870475
MADHAGVEHHHKDAEHHEPAAKHHREAAKRHEAGDHEKAAHNAHSVQGHASHAEEHHAEATRHPAEPH-----
>ERR1700758_4094796
MRDHAGVEHHHKAAEHHEHAARHHREAAKHHEAGDHEKAAHHAHTAHGHHQHATHHGAEAAKAHTEHHG----
>ERR1700724_4573945
MADHAGVEHHHKAAEHHEHAAKHHREAAKHHEAGDHGKAGPPAHTGHGQATP---------------------
>SRR5271157_4511021
MSEHKGVEHHHKAAEHHEHAARHHREAAKHHEAGHHEKAAHHAHTAHGHASHAEHHATEAAKAHAEAHG----
>SRR5579863_5645041
EMSKQAVEQHLKSAEHHEQAARHHKEAAKHHQSGNHEKAAHHAHMAHGHHEHAQHHAAEAAKAHAQEHD----
>ERR1700733_9528035
SMAHHGAEHHHKAAEHHEQAAAHHREAAKHHESGDHQKAGHHAHIAHGHTLHAAQHAEEAGKHHADQHG----
>SRR6202030_3868138
-MSKQAAEHHHKAAEHHEHAARHHREAATHHESGNHETAAHHAHTAQGHLNHATHHASESAKQHAEHHGEK--
>ERR1700691_2094390
-MSKQTAEHHHKAAEHHEHAARHHREAAKHHETGNFETAAHHAHSAQGHLHHATHHSAEAAKAHVDHHGHK--
>ERR1700730_18364367
-MSKQAAEHHHKAAEHHEHRARHHKEDAKHHEAGKLETASRHARLAKGHHEHAIHHAAEAVKPHLEHYGKT--
>ERR1700683_378504
LMSKQAAEHHHKAAEHHDTAARHHREAAAHHEAGDHYQAAHHAHTAQGHLHHATHHSEEAAKLHVEHHGHKT-
>ERR1700735_4440382
LMSKQAAEHHHKAAEHHDHAARHHREAAAHHEADNHETAAHHAHTAQGHSHHATHHATEAAKHHVEHHGEKA-
>SRR5271166_1724810
HMSKEAAEHHHKAAEHHEHAAKHHKAAAAAHEAGNHEKAGHHAHVAEGHLNHATHHAEEASKLHATEHGHKX-
>ERR1700689_5314695
LMSKQAAEHHHKAAEHHDHAARHHRETRGHHEASEQ-------------------------------------
>SRR5215468_10051876
TMSKHLAEAHHQAAEHHEHAARHHREAAKHHEAGDHETAAHHAHTAQGHLHHATHHSTEASKQHAEHHGGTA-
>SRR6202795_3681341
PMSKKAAEHHLQAAEHHEHAARHHREAAKHHEAGDHESAAHHAHTAQGHLHHATHHSAEAAKMHVEDHGEKR-
>ERR1700681_451985
PMSKQAAEHHTKADDHHENAARHHREAARHHEADDHESAAHHAHTAQGHLHHATHHAAEAAKSHAEHHGNKT-
>SRR5580698_71909
EMSKQAAEHYHKAAEHHEKAALHHRHAAKHHEADDHKSAAHHAHTAQGHLHHAAHHATEAAKLHV--------
>SRR6516225_6087423
YMSKEAAEHHREAAQHHEQAAKHHHEAAKHHEAANHQEAAHHAHSAQGHLHHATHHAAEAAKLHAEHHGHKA-
>SRR5215469_385579
PMSKEAAEHHGKAADHHEHAAHHHREAAKHHESGNWETAAHHAHTAQGHLHHATHHASEAAKLHAEQHGSKT-
>SRR6202008_2710750
QMSKQAAEPHGKAAEHHEHAARHHREAAKPHESGNYETAAHHAPSAQGHLHHATHHAIEAAKSPLEHHGSKS-
>ERR1700685_3697209
TMSREAAEHHRLAAEHHDHAARHHREAAKHHEDGDHHSAAHHAHTAQGHTHHSSHHAAEAAKAHAAEHGHKS-
>ERR1022692_1727563
SMSENHIDHHHKAAEHHEHAAKHHHAAAEHHANEHAASEHMSAPX----------------------------
>ERR1022692_874132
SMSENHIEHHHKAAEHHEHTAKHHHAAAEHHQNGDHEKASHHAHAAHGHALHAEHHANEAAKHHANEHAAS--
>SRR5882757_333284
LMSKHAVEHHHKAAEHHEHAAKHHREAAKHHDSGDHEKAAHHAHTAHGHASHAEEHHHEASRHHAEHHGAH--
>SRR5580693_5755422
SMSENHIDHHHKAAEHHEHAAKHHRAAAEHHQNGNHEKGAHHAHAAHGHSLHADHHATEAAKHQANEHGHH--
>tr|A0A1Q3KM49|A0A1Q3KM49_9PROT Uncharacterized protein OS=Alphaproteobacteria bacterium 65-37 GN=BGN99_28215 PE=4 SV=1
MPKDKIIEHHRSAADHHEKAAQHHREAAKHHESDSHEKAAHHAHSAHGHSAHATHHAGEASKHHAEHHGGH--
>SRR5450755_4508362
NMSDHAVEHHHKAAEHHEHAAKHHREAAKHHETGDHEKEAHHAQVAHGHGLHADHHASEAAKQHANEHGDA--
>SRR5471032_1874884
GERNMSVEHPHKAAKHHEHAAKHPREATKHHEAGDHEKAAHHAHTAHGHASHAEEHHAEASRHHAVHHGAH--
>SRR5215472_4418814
RPKMTPHEHHHKAAEHHEHAARHHREAAKHYEAGNHEKAAHHAHLAHGHHLHALHHGQEAAKGHV--------
>SRR5271165_4906151
HMSKNATEHHRKAAEHHEHAAKHHHAAAEHHEAGNHEKAGHHAHVAEGHLNHATHHSEEASKHHANQHAHS--
>ERR1700680_4602609
FTINKGAEYHKKAAEHHELAAKHHREAAKHHEAGSHEKAAHHSEIAAGHGLTAVHHTEEATKHHPEEHTEK--
>ERR1019366_9606641
MATHKGTEHHKKAAEHHELAAKHHREAAKLHEAGSHEKAAHHAQIAAGHGLHAVYHTEEATKHHADEHTGK--
>SRR5271165_1617824
MATHKGAEHHKKAAEHHELAAKHHREAAKHHEAGSHEKAAHHSEIAAGHTLQAVHHTEEAVKAHLDEHGKK--
>SRR5580693_8743019
FTINKGAEYHKKAAEHHELAAKHHREAAKHHEAGSHEKAAHHSHSAHGHASQAEHHHAQASRH----------
>ERR1035438_5033680
MATHKGAEHHKKAAEHHDLAAKHHQEAAKHHEAGSHEKAAHSSEVATGHGLHAVYHTEEAIKHHADEHTGK--
>SRR5581483_8067321
MNDHEIHEHHEHAADHHEHAAKHHREAAQHHKAGDHEKAAHHSKIAHGHHLHAVEHHEHAAKKHADEHE----
>ERR1700737_2020172
IMSKQAAEHHKKAAEHFEHAARHHKEAAKHHDAGAHEKAAHHAHVAHGHHLHARHYAEEAAKSHVEHHGKKX-
>SRR5215831_18626949
DMSKEAAEHHRKAAEHLEHAAHHHKEAASHHEAGAHEKAAHHAHVAHGHHLHADHHAEEAAKTHVEHHGKK--
>ERR1022692_4502959
MATNQAAEHHHKAAEHHEHAARHHKEAAKHHEAGNHEKAAHHAHLAHGHTHHATHHAAEAAKAHVEHHGKKPX
>ERR1022692_1596713
MATNKAAEHHHKAAEHHEHAARHHKEAAKHHEAGNHEKAAHHAHLAHAHHLHVTHHSTEATKAHAQDHGSKX-
>ERR1700726_249602
RMSKQAAEHHNKAAQHHEQAAEHHREAAWYHEDGDHEAAAHHAHTAQGHLHHATHHAAEAAKLHVEHHGHKV-
>SRR5580693_6803329
QMSKQAAEHHNQAAEHHDHAARHHREAARHHEAGDHEAAAHHAHTAQGHQHHATHHATEAAKLHVEHYGQKV-
>ERR1700690_193599
QMSKEISDHHHSAAKHHESAAHHHKEAAKHHEAGNHEKAAHHAHTAHGHMTHATHHAAEAAKLHVEHHGSHK-
>ERR1700683_4574945
SMSKQAAEHHHRAAEHHEHAARHHREAAKHHEAADRLSAAHHAHTAHGHLQHATHHASEAAKSHVEHHGHKV-
>SRR3974390_98844
NMSKQAAEHHHKAAEHHEHAARHHREAAKHHEAGDHHLAAHHAHTANGHHHHAMHHSAEAAKAHAQEHGGAS-
>SRR5208282_6358198
FMSKHAVEHHHKAAEHHDHAAHHHREAARHHEAGEHHLAAHHAHLASGHHHHAMHHSAEAAKAHVEHHGESA-
>ERR1700678_2201630
QMSKKAVQHHTSAAEHHEHAARHHREASKHHEAGDHESAAHHAHTATGHLHQATQHGAEAAKAHAEEHGNKK-
>SRR6516225_12260139
TMSIQAAEHHNKAAEHHDHAARHHREAAKHYQAGDHHLAAHHAQTASGHHQHAMHHANEAAKAHA--------
>SRR5262245_37247554
ASKHNEAEHHIKAAEHHEQAARHHREAAKHHEAGAHDKSAHHAHIAYGHTTHARQHAQEAGKAHADEHGHHA-
>ERR1017187_10159119
CMSKQAIEHHRKAAEHHEHAARHHKEAAKHHEAGKHETAAHHAHLARGHHEHAMHHAAEAAKAHVEDHGGQ--
>ERR1039458_8411690
CMSKQAAEHHRKAAEHHEHAARHHKEAAKHHEAGRRGARRV--------------------------------
>SRR5580704_692809
SMSKEAVEHHKKAAEHHEHAAKHHHAAAEQHEAGNHEAAAHHAHVAHGHHSHATHHAGEASKHHAEAHSX---
>ERR1019366_8910956
CMSKQAAEHHRKAAEHHEHAARHHKEAAKHHEAGKHVTAAHHAHLARAHHDVATHHAVEAAKAHLEEHGKA--
>ERR1035438_7570652
KMSKKAAEHHRKAAEHHEHAARHHKEAAKHHDAGAHETAAHHAHTAHAHHEVATHHAVEAAKSHLEDHGKA--
>ERR1039457_5679623
-MSKKAAEHHLKAAEHHEHAARHHKEAAKHHQAGSHEKAAHHAHIARAHHEHADEHAIEAAKAHAEEHGNK--
>ERR1700704_2262512
-MSKKAAAHHKKSAEHHEHAARHHKEAAKHHDAGAHEKAAHHAHLAHGHSHEAMDEEAEAAKSHREEHGSK--
>SRR3974377_1513360
-KSHPAHAHHVKAAEHHEHAAKHHKEAAGHYAAGHHETAAHHAHSAHAHMLHATHHAGEAAKAHVAHHSQK--
>ERR1700753_3973459
-MSKTAADHHKKASEHHQHAARHHAEAAKHHEAGNHEKAAHHAHAAHGHTSHAREYGERASRAHSKEHGTK--
>SRR5262249_40564937
MTAHQGAEHHRAAAEHHAKAAHHHREAAKHHDDEDHTQAAHHAHSAHGHASHAAHHASEASKHHAEHHGDL--
>SRR5215831_3075836
MSEHQGAEHHRSAAEYHEKAAHHHREAAKYHEDGEHMQAAHHAHSAHGHSMHAAHHASEASKHHAEHHDDA--
>ERR1700681_3605376
MTAHKGADHHRSAAEHLENAAHHHREAAKHHDEGDHRQAAHHAHTAHGRATHAAHHSSEASKHHAERHGDI--
>ERR1700735_830787
LMSKHAVDHHHKAAEHHEHAARHHKEAAKHHEDGKHETAAHHAHLAHGHHEHATHHAIEAAKAHVEHHGX---
>SRR6202046_837085
LMSKHAVDHHHKAAEHHEHAARHHKEAAKHHEAGKNETAAHHSALARAHQHHASRHSEDW-------------
>SRR5450432_1682947
YMPHQAAEHHHKAAEHHEYAARHHKEAARHHEAGKHETAEHHVHLANGHQQDAIHHAAEAVKVQIERP-----
>SRR5579863_9408980
PMSHEAADHHHKAAEHHEHAARHHRDAAQRHKEGHHEGAAHHAHLAHAHHVHAVEHAEQAAKHHIEAHGS---
>SRR5579863_4526352
-MSKKAAEHHKKASEHHSQAARHHGEAAKHHEAGNHEKAAHHAHTASGHAAHARTHSEEAGKAHLEEHGKK--
>SRR5271165_3194793
-MSKKAADHHTKASEHHAEAAKHHSEAAKHHGAGHHEKAAHHAHTASGDASHARTHAEEAGKAHAEEHGKK--
>SRR5262249_11842710
-MSIKASEHHKKASEHHSRAAHHHEEAAKHHAAGHHEKAAHHAHSASGHATHARTHAEEAMKSHVEEHSKK--
>SRR5580658_11071561
-MSKKAAEHHRKAAEHHAQAAKHHDSAADSHEAGNHEKAAHHAQTARGHHKQAEEHSDEATKAHSSEHGHK--
>ERR1700761_2911494
-MSTKAAQHHKNPADHHTQAASHHTEAAKHHESGNHEKAAHHAHTASGHAHHATHHGEEAGKAHMEEHGKK--
>tr|I0IMJ9|I0IMJ9_LEPFC Uncharacterized protein OS=Leptospirillum ferrooxidans (strain C2-3) GN=LFE_0783 PE=4 SV=1
-SKMKPQEHHKEAAQHHEEAAKHHKEASKMYEAGDHKTAAHHAHSATGHASSAEEHQNEASRKHASLFGDK--
>SRR5215510_6524027
EMSKQAAEHHTKAADHHEHAARHHREAAKHHEAGNHEKAAHHAHVAHGHHLQALHHHEEAQSSISSITARS--
>SRR5580704_15507109
PMSQQSAEHHTKAAEHHEHAARHHREAAKHQTSGSHEKAGHHAHVAHGHHLHAIHHSEEAAKHHAEEHGSK--
>SRR5499433_4327306
PMSTKAAEHHEQAAAHHEHAARHHKEAAKHHKAGDHEKAAHHAHVAHGHHLQAIHHHEEATKFHLEHHGKK--
>SRR6516164_3912753
TMSKKAAEHHTKAAEHHEHAAKHHREAAKHHAAGHHEKAAHHAHVAHGHAHHASHHSTEAAKGHVEEHGHK--
>SRR5215471_1111430
PMSQKSAEHHTKAAEHHEHAARHHKEAAKHYAAGSHEKAAHHAHLARGHDLHADQHAEEAAKHHVEEHGSK--
>SRR5215467_15503710
PMSKQAAEHHTKAAEHHEHAARHHREAAQHHEDGDHETAAHHAHTAQGHLHHATHHAEEAAKQHVEHYGSK--
>SRR5580700_9320380
LPMNQPTEHHTKAAEHHEHAARHHKEAAKHQASGNHEKVAHHAHTAHGHHLQAAHHAEEAAKQHAVEHGSK--
>SRR6516225_9609087
SMPQKLKEHHTKAAEHHEHAAKHHRKAAEHHGAGKHELAAHHAHAAHGHHLHATHHASEAAKRHVELHGNK--
>SRR5580692_12958183
GMAKQIAEHHTKAAEHHEHAAKHHREAAKHHESGNAETAAHHAHLAHGHTQFANHHAGEAAKAHIADHSKT--
>SRR5580658_3666240
YMSHEAAHHHTKAAEHHEHAARHHHHAAKHHADGAHPDAAHHAPLAHGHHIHAAEHAEHAVKHHIEAHGEK--
>ERR1700679_1520425
LMSKQTAEHHTKAAEHHEHAARHHKEAAKHHEAGKVETAAHHAHLAHGHHQYASHHAGEAAKAHIEDSDKS--
>SRR5579863_67246
SMSKESAEHHSKAAEHHEHAARHHRAAAEHHEAGNHEKAGHHAHVAAGHHHQATHHAEEASKHHATAHGHH--
>SRR5262249_23440399
KCQSKQQNITLKLPNITSNAARHHKEAAKHHEAGNHEKAAHHAHVAHGHHLQAIHHHEEATKFHLEHHGKK--
>ERR1039458_5673656
DMSKAAAAHHLKAVEHHEHAARHHREAAKHHEAGNHEKAAHHAHLAHGHHLHATEYAGEAAKAH---------
>ERR1035437_7167454
DMSKQAADHHKQAAEHHEHAARHHQEAATQYEAGNHEKAAHHAHLAQGHHVHATEHAEHAAREHVEAHGAK--
>ERR1039458_9997194
SMSKEAPHHHTQAAEHHEHAAHHHHEAAKHHLEGNHEAAAHHAHLAHGHHIHAAEHAEHAAKQHIEAHGQK--
>SRR5262245_65951219
MAKHKGAEHLERAAEHHELAAHHHREAAKHYEAGNPEKAGNHEHIEHGDHLCVTYKAEGAGCTQRHDX-----
>SRR5215470_17505878
MANHKGAEHHENAAEHHQLAAQHHREAAKHYESGNHEKAGHHAHIAHGHHVHATYHAEEASKSHATEHGGQ--
>SRR6266540_5162723
MAKHKGAEHHERAAEHHQLAAHHHREAAKHYEAGKPEKAGHHAHIAHGHHLHATYHAEEAGKRHATEYGGQ--
>SRR5215475_12112351
MAKHKGAEHHKRATEHHELSARHHREAAKHYEASDPEKAGHHAHIAHGHHLHATYHAEEAGKHHATEHSSQ--
>ERR1700694_1609601
MATHKGADHHRKAAEHHEHAAKHHHEAAKHHESGNHEKAGHHAHIAHGHTQHAAHHATEAAKHHSDEHGGT--
>SRR5579862_1684974
PMSKERAEHHRKAAEHHGHAAKHHLAAAEHHEAGNHEKAGHHAHVAHGHQLHAVHHAEEAGKHHANEHTHQ--
>SRR6266511_5188420
MAKHKGAEHHERAAEPHHPSAPPPREA----------------------------------------------
>tr|A0A2W5ZIQ4|A0A2W5ZIQ4_9BACT Uncharacterized protein OS=candidate division AD3 bacterium OX=2052315 GN=DLM66_00475 PE=4 SV=1
-MSKKAAEHHGQAADHHEKAAQHHRQAKTHHEAGDHQAAAHDAHTARGHHEHAAHHASEAAKAHAEEHGHK--
>tr|A0A1B9C1C9|A0A1B9C1C9_9PROT Uncharacterized protein OS=Acidithiobacillus ferrivorans OX=160808 GN=BBC27_06515 PE=4 SV=1
-SEMKLHEHHKEAAEHHEEAAKHHKEASKLYESGDHKGAAHHAHSSAGHSDYAREHESVASKKHAAMFGDK--
>tr|A0A2H9SEK4|A0A2H9SEK4_9GAMM Uncharacterized protein OS=Legionella sp. OX=459 GN=CK424_06600 PE=4 SV=1
-DKKKLHKHHLKAAEHHKKAAEHHSEAAKHHEAGEHEKGQASAYLALAHGRHAKDESCEACSHYAGIEVER--
>tr|A0A0C1UQR9|A0A0C1UQR9_9BACT Uncharacterized protein OS=Methylacidiphilum kamchatkense Kam1 OX=1202785 GN=A946_08515 PE=4 SV=1
-MADTVAEEHEKAAMHHEHAAVHYRKAAEHHRAGEHADSGHHAHIAHGHAKHAQAHAEAAAKEEANMHDKK--
>tr|A0A2W6AI54|A0A2W6AI54_9BACT Uncharacterized protein OS=candidate division AD3 bacterium OX=2052315 GN=DLM67_06925 PE=4 SV=1
-MSKEAAQHHQQAAEHHEHAGRHHREAAKAHEAGDHAKAAHHAHTARGHHEHASHHAAEAAKSHVEHHGHK--
>ERR1700726_61598
MS-KEISEHHHSAAKHHESAAYHHKEAAKHHEAGDHEKAAHHAHTAHGHASHAEHHHVEASRHHAEHHGQH--
>SRR5271165_3086472
MAQHKGADHHKQAAAHHRHAATHHEEAAKHHEAGDHEKAAHHAHAAHGHHLNAEHHTHEAAKHHATEHGGG--
>SRR5579872_1018840
RMSKESAQHHHQAAEHHEHAARHHREAARHHEEGNHEKAAHHAHTAQGHHHQAEHHAREAAKLHTEQHGQA--
>SRR5271157_582607
VMSKKAAEHHHKAAEHHEHAARHHREAAKHHEAGKHETAAHHAHLAHAHHEHAMHHAAEAAKAHLEDHGKA--
>ERR1700682_2320681
LMSMQAADHHHKAAEHHEHAARHHKEAAKHHEAGKHETAGHHAHLAHGHHQHAMHHAAEAAKAHIEHHSKA--
>SRR5262249_24366611
SMSKNATDHHNAAAEHHEMAAEHHRKAAEHHDDGNHEKAAHHAHVAQGHLHHATHHAAEAAKSHLEDHGKH--
>SRR5215469_175376
LMSKKAAEHHHKAAEHHEHAARHHKEAAKYHEAGKHETAAHHAQLANGHQQHAMHHAGEAAKAHIEDHGRA--
>SRR5580692_10590558
FMSKEASEHHQKAAEHHEHAARHHKEASKHHDAGKHETAAHHAQLARAHQHHAAHHSEEADKAHLEDHVKS--
>ERR1035441_3275339
LMSKKAVEHh-HKAAEHHEHAARHHKEAAKHHEAGKHETAAHHAHLARGRLRRCLLLSYIQL--SLPDPD-V--
>SRR5262245_3093639
HMSKKAAEHh-KKASEHLTHAARHHVEAAKHHEAGKHETAAHHAQTATGHAVHARGHAEEAVKAHAEEHGKK--
>ERR1700752_1174679
YMSKKATEHh-RKAAEHHELTARHHREAAKHHEGGRHETAAHHAHLAHGHHTYASHHAGEASKAHVEDHGSS--
>SRR6266705_6478280
RMAKQAAEHh-HKAAEHHEHAARHHKEAAKHYEAGKHETAAHHAHLAHGHLQHATHHAGEAAKAHIQDHGNK--
>SRR5215472_9654556
LMSKKAAGHh-LKAAEHHQLAAQHHREAAKHHQAGKHETAAHHAHLARGQDEHAMHHAAEAAKAHVDDYGKA--
>SRR5215472_7924964
TMSKKAAQHh-HQAAEHHEDAARHHKEEAKHHEAGKHETAAHHAHLARGHHEHAMHHAGEAAKAHIEDHGQA--
>ERR1035441_3624924
FMSKQAAEHh-HKAAEHHEHAARHHKEAAKHHEAGKHETAAHHAHLARAHHELATHHAAEAAKVHLEQYGKG--
>ERR1700677_2623774
SMSKQAAEHh-HKAAEHHEHAARHHKEAAKHHEEGRHETAAHHAHLAHGHHQHASHHAAEAAKSHVEHHGSA--
>SRR5271169_5745082
LMSNQAAEHh-HKAAEHHEHAARHHKEAAKHHEAGKPEAAAHHAHLAHGHHQHATHHAPEAAKAHIEDHGKS--
>SRR6202049_3772221
CMSKQAAEHh-HKAAEHHEHAARHHKEAAKHHEAGNHETAA-HAHLARGHHEHAMHHAAEAAKAPRLLGRGA--
>ERR1700690_1934298
PYVKESRRGpSQSRRASRTHAARHHKEAAKHHEAGKHETAAHHAHLARGHHEHAMHDAGEAAKAHVEDHGGQ--
>SRR6201997_5942927
MSDHAGVEHHHKAAEHHEHAARHHREAAKHHEEGNHETEPHHAHTPQGPSPHATHHATEAAKPHVEHHGQK--
>ERR1700683_5385528
PMAHPIAEHHKKAAHHHEHAARHHHEAAKHHEAGDHHKAGHHAHVAHGHHHQAMHHAGEAAKAHAEAHGKX--
>ERR1039458_1052396
DMSKEAAHHHKQAAEHHEHAARHHHEAAKHHEAGNHEKAAHHAHLAHAHHVLAAEHAENAAKEHLKAHGTK--
>ERR1035441_9756897
DMSKEAAHHHKQAAEHLEHAARHHHEAAKHHEAGNHEKAAHYAHLAHGHLVHATEHAENAAKEHVKSEE----
>SRR5271157_2981033
SMSKEAAQHHKQAAEHHEHAARHHKEAAKHHEGGNHEKAAHHAHVAHGHHAHATHHATEAAKAHVEAHGAK--
>ERR1039458_10647682
DMSKEAAHHHKHAAEDRKHAARHHNAA----------------------------------------------
>SRR5271165_3465347
DMSKQAAEHHKKAAEHLEEAAKHHVEAAKHHVEGVFDKAAHHAHSAHAHHVQAVEHAENAAKEHLKAHGTK--
>SRR5215469_13833100
DMSKQAAEHHKQAAEHLEQAAKHHVEAAKRHVEGVVEKAAHEAHLAHAHHVQAI-------------------
>SRR5262249_28378874
VMSEDAAEHHRKAAEHHQHAARHHEQAAHHHEAGAHEKAAHHAHSAQGHSHHANHHAAEAAKAHTEHHGAKX-
>tr|A0A142H9K5|A0A142H9K5_9BACT Uncharacterized protein OS=Hymenobacter sp. PAMC 26554 GN=A0257_23020 PE=4 SV=1
-MSKKAVDSHKKAATHHTEAAKHHTEAAKHHEAGSHEKAAHHAHTAAAHTDHAAEHATHARKSHAEEHGTK--
>tr|A0A1F3RER5|A0A1F3RER5_9BACT Uncharacterized protein OS=Bacteroidetes bacterium RIFCSPLOWO2_12_FULL_31_6 GN=A3K10_03545 PE=4 SV=1
--MKSVIEKHKKAASHLEEAAKCHQEAAKHHEAGSHEKAHHSSVKANGHSTHASELEREIQKHHVIASK----
>SRR5216683_1839118
VMSKQAAEHHKKAA--------------EHHEAGTHEKAAHHAHVAHGHALHARHHAEEAVKSHLEHHGKKX-
>SRR5277367_3271760
MMSKKAAEHHKKASEQMTHAARHHGEAAKHHEGGLHEKAAHHAHTARAHAIHAQEHAENAVKAHADEHGKKX-
>SRR5271166_5766653
HMSKKAAGHHKKASEHLTHAARHHGEAAKHHEAGSHEKAAHHAHLARGHIIHGRGHAEEAVKAHLEEHGKKX-
>SRR5262245_78877
NMSKRAAEHHKKASEHLTHAARHHGEAAKHHDAGHHEKAAHHAHTAHGHAIHARGHAEEAVKVHVEEHGKKX-
>SRR5215468_2014457
HRSKKAADHHKKASEHLTHAARHHGEAAKHHESGNHEKAAHHAHTASGHMIHARGHAEDAVKAHAEEHGKKX-
>SRR6202158_2104302
HMSKKAAEHHKKAAEHHTHAARHHGEAAKHHEGGHHEKAAHHAHTARAHGLHATEHAEEAAKAHGTEHGS---
>SRR5215475_990513
PMSKKAAEHHKKASEHLTHAARHHGEAAKHHDTGNHEKAALHAHTARGHVVHATRHAEEAVMAHTDEHGKK--
>SRR5689334_9332785
VMSKKAAEHHRKASEHHTNAARHHGEAAKHHDVGNHEKAAHHGHTARGHAIEARTHSEDAVKAHTEEHGKKX-
>SRR4029077_25657
QMSKKAAEHHKKVQEHLTHAARHHGEAAKHHESGQHEKAAHHAHVARSHVIHARGYAEEAVKAHHEEHGNKX-
>SRR6476646_3723538
StRSGSMECLGLSDSEHLTHAARHHGEAAKHHEAGSHEKAAHHAHVARGHVIHGRGHAEEAVKAHLEEHGKKX-
>SRR6516164_3211544
NrMSKKAADHHRKAAEYHTHAARHHGEAAKHHETGQHEKAAHHAHLARAHAIHARGHSEEATKAHHEQHGDKQ-
>SRR6202048_2952714
AMSKKAAEHHKQSAEHHTHAARHHGEAAKHHEAGHHEKAAHHAHTARGHALHARHHSDQAAMVHMEEHGKNK-
>SRR6202011_3428404
AMSKKAAEHHKQSQEHHTNAARHHGEAAKHHASGQHEKAAHHAHTARGHALHARHHSDQAAMAHMEEHGKKK-
>SRR6516225_9485238
NrMSKKAADHHRKAAEYHTHAARHHGEAAKHHETGQHEKAAHHAHTARGHASHATEYAEEAAKLHAEEHGKKX-
>SRR5665213_515099
AMSKKAAEHHRKASEHAAHAARHHGEAAKHHDAGHHEKAAHHAHSATGHASHARGHADEAARAHADEHGKKX-
>SRR5215831_13043785
SMSKKAAEHHKKASDHHTHAARHHGEAAKHHETGHHEKAAHHAHTARAHAIHARGHAEQATVAHSEEHGK---
>ERR1700681_52020
KMSKKAAEHHHKASEHHTHAARHHGEAAKHHEGGHHEKAAHHAHTARAHAIHSRHHSEEAAKMHGEEHGKKX-
>SRR6478672_3437904
AMSKTAADHHRKASEHSTHAAKHHGEAAKHHDSGQHEKAAHHAHTAAGHERQSREHADEAAKAHANEHGKKX-
>SRR5207302_8234716
HMSKKAAEHHRKASEHHTHAARHHGEAAKHHDSGQHEKAAHHAHTAAGHAVHARQHADESRKAHTEEHGKKX-
>SRR6202049_3861440
PMSKKAAEHHRKASEHLTHAARHHGEAAKHDDAGHHEKAAHHVHTARGHATHARGPAEEAAKAHTEEHGKKX-
>ERR1700693_2890077
PMSKKAAEHHKKASEHLTHAARHHGEAAKHYDTGEHAMGAHHAHTARGHVVHARLHAEETVKAHVEEHGKKX-
>SRR3984893_4017493
AMSKKAAEHHKQESEHLTHAAHHHGEAAQHHEAGNHEKAAHHAHTARAHVIHGRGHAEEAVKAHADEHGKKX-
>SRR6266478_7429653
TMAE---NKPRQADLSARARKSDHGEAAKHHEAGNHEKAAHHAHTARAHIIHGRGHAEEAVKAHAEEHGKK--
>SRR5580765_1108604
SMSKKAAEHHKKAEEHHTQAAHHHGEAAKHHEGGRHEKAGHHAHTARGHSLHARDHSEEAAKAHMEEHGKKX-
>ERR1700681_4628765
HMSNKAAEHHRKALEHLTRAARHHDETAKHYDTGEHAMGGHHAHTARAHMIHARGHAEEAVKAHAEEHGTKE-
>ERR1700722_6390987
-MSKEREEHHLKAAEHHEHAAKHHRAAAEHHAAGDHETAGHHAHVAHGHHTHAEHHADEASKHSANHHAT---
>ERR1700691_1558590
-MSKERQDHHLKAAEHHEHAAKHHRAAAEHHASGNEEKAGHHAHVAHGHHAHATHHAE---------------
>SRR5580692_2709317
-MSKEREDHHLKAAEHHEHAAKHHRHAAEHHAAGDHEKAHHHAHVAHGHHIHAGHHAEEASKHTANHHSA---
>SRR5450755_2590302
-MSKEREEHHLKAAEHHEHAAKHHKMAAEHHAAGDHEKAHHHAHVAQGHKTHAEHHSDEASKHTANHVPT---
>SRR6516164_8547976
VMSKKAAEHHKKASEHHTHAARHHAEAAKHHEAGSHEKAAHHAHTARGHVAHARGYAEEAAKAHVEEHGKKX-
>SRR6476661_1594845
QMSKKAAEHHRKAAEHSSHATHHHNEAAKHHEAGNHEKAAHHAHTARGHGAHVMHHADEAAKAHIEEHGKKX-
>SRR5579883_2368435
SMSKKAAEHHGKAAEHHEQAAKHHKEAQKHHEAGNHEKAAHHAHTARGHHASAEHHGNEAAKAHADDHGKKX-
>SRR5215831_19438088
-MAKNAVEHHEKAAEHHEHAARHHREAASHHESGDHQVAAHHAHVAHAHMLHASEHASEAAKAHAEAHGGQ--
>SRR3974390_1406771
-MATPAVEHHEKAPEHHEPAARHHREAAAHHESGDHEVAAHHAHVAHAHTLHASPHAAEAAKAHADAHGGQ--
>SRR3974377_1527111
-MATHAVEHHEKAAEHHEHAARHHPQAAAHHESGAHETAAQHGPVAPATHLYPLDHAAA--------------
>SRR3974377_2609624
-MATHAVEHHEKAAEHHEHAARHHREAAAHHESGAHEVAAHHTPFAPSHT-----------------------
>SRR5262245_37694928
---HKGSSHHETAAEHHETAAHHHREAAKHYEHFDHEKAGHHAHVAHAHGLHAAHHGHEAAKHHAQSHAEH--
>ERR1700738_4504323
---HKGSSHHETAAEHHEKAAEHHRAAARHYGEDDHHKASHHAHLAHAHGLHATHHGHEAAKHHAEHHDEH--
>SRR6478672_11888828
---HKGGSHHETAAEHHETAAHHHREAAKHYEHGDHEKAGHRPRGACAWTACDPSWarGRETPRGKPR----G--
>SRR5258707_6049855
---HKGGDHHESAAEHHENAAHHHREAAKHYEAGDHEKAGHHAHVAHAHGLHASQHGEEAAKHHAEHHVED--
>SRR3984957_18403883
---HTGSEHHETAAGHHESAAHHHREAAKHYEGGEPEKAGHHAHVAHAHRLHATHHAHEAANHHAERLAGQ--
>SRR6185312_2038455
---AESHVHHAKAAEHHKKAAYHHEEASRHFRDDNPAKGAHHAQLAHGHGLHANEHANNASRRFGQDYAKD--
>SRR5215469_12611957
SMSKEAAEHHRSAAHHYEHAAQHHHEAAKHHEAGDHQAAAHHAHIAQGHQHHATHHATEAAKSHAEHHGQQ--
>ERR1700683_227600
SMSKQAAEHHHSAAEHHEHAARHHREAARHHEEGNHESAAHHAHTAQGHLHHATHHAAEAAKSHTEHHGHK--
>SRR5262249_30748479
MAQDKIVQHHHAAAEHPEHAAKHHREAAKHHEADSHEKAAHHAHSAHGHSEHAAHHAAEASKHHAEQHGDH--
>SRR5471032_1000550
MSKDKIVEHHQTAADHHEHAARHHREAAKHHEADSHEKAAHHAHTAHGHSSHATHHASEASKHHAEHHGQH--
>SRR5215475_7292062
MSKDKIVEHHHAAAEHHEYAAKHHREAAKHHESDHHEKAAHHAHSAHGHSSHAAHHA----------------
>ERR1700740_1508672
MSKDKIVEHHTAAAEHHEHAARHHREAAKHHGADSHEKAAHHAQSAHGHSAHAAHHAAEASKHHAEHHGTH--
>SRR5277367_3781890
FMSKQAAEHHHQAADHHEHAARHHKEAAQLHEAGSHELAAHHAHLAHGHHQHASHHAAEAAKAYIEHHAKA--
>SRR5580692_8293406
IMSKQAAEHHQKAAEHHEHAARHHKEAAMHHEAGKHEMAAHHAHLAQGHHAHATHHAAEAAKSHVEHHGKA--
>SRR5580698_9551526
VMSKVAAEHHHAASEHHEHAARHHKAAAKHHEDGKHELAAHHAHLAHGHHQHASHHAAEAAKAHIEHHKAA--
>ERR1019366_1648353
MPKHEGAEHHKKAAEHHEKAAQHHKEAAKHHEEGRHETAGHHAYVAHGHHLTAIQHSEEAAKYHSQQHGEKK-
>SRR5580658_4588397
MPKHEGAEHHKKAAEHHEHAARHHKEAARHHEEGSHEKGGHHAHIAHGHHLHATHHAEEAAKTHSNQHGKES-
>ERR1700683_1984599
VSKHEddkhqekaaehqekvalhhedkAAEHHEKAAEHTEKAAEHHKEAAKHHEEGHHETAGHHAHIAHGHHLNATYPSEETAKHHAQQHGEKK-
>SRR5580704_7292703
MANHTGASHHHEAADHHEHAAKHHREAAKHHEAGDHVQAGHHAHIAHGHLTHATHHAEEAGKHHATEHGKS--
>ERR1041385_1551557
-MKHKGAEHHNKAAEHHEHAARHHREAAKHHEAGSHEKGGHHAHVAHGHMVQANEHTEEAAKSHMEHHGKK--
>SRR5262249_10445052
-MAHKGAEHHTKAAEHHEHAARHHREAAKHHEAGSHEKGGHHAHMAHGHSTHAHGFADEAAKHHAMEHGGG--
>ERR1700719_3807446
TMSKQAAEHHHQAAEHHEHAARHHREAAKHHEAGDHESAAHHAHSAHGHASHAEHHHHEASRHHAEQHGQHX-
>ERR1700760_623008
TMSKQAAEHHTKAAEHHDNASKHHREAAKHHEAGNHESAAHHAHTAQGHLHQATHHAGEAAKSHADTHGN---
>ERR1022692_2998277
-MSKQAAENHLKAAEHHEHAARHHKEAAKHHQAGNHEKAAHHAHTAHGHEEHADHHAGEAAKAHAQDHGSK--
>ERR1017187_7576438
-MSKQAAEHHLKAAEHHEHAARHHKEAAKHHQAGNHEKAAHHAHTARAHHENAAHHAAEAAKAHLEHHGKA--
>SRR5262249_54984532
-MSEKAAEHHRKAAEHHEHAAKHHYEAARHHDDGAHETAAHHAHSAQGHAIHADHHSGEAAKAHTEHHGSK--
>SRR5580704_771817
MNHHEAAEHHNKAADHHEHAAAHHLKAAEHHVEENHEKAAHHAHIAHGHGLHAAHHAGEATKHHTDAHGGP--
>ERR1039458_7468520
MEHHEAAEHHRKAAEHHEHAAAHHREAAKQHEAGNHEKAAHHAYVAHGHGLHAAHHAGEATKHHSDTHGGP--
>ERR1039457_6746667
MNQKDAAEQHKKAAEHHEHAAAHHREAAEHHANGNHEKAAHHAHIAHGHGLHAAHHAGEATKHHANTHGGS--
>ERR1700722_3522043
MSDHKGADHHNQAAEHHEHAATHHRAAPRHHESGDHEKAAHHAHIAHGHGLHAAHHAGEATKYHADEHGGG--
>ERR1035438_4004146
MSTHTGAEHHEKAAEHHEHAAAHHREAAIHHESGDHEKAAHHAHIAHGHGlhaapharvasrprhhahiahghgLQAAHHAGEAAKHHADEHGGE--
>SRR3981081_3201937
PMSTKAAEHHEHAAAQHEHAARHHKEAAKHHKAGNHEKAAHHAHSARGHHEHAAHHASEAAKSHTEEHGHK--
>ERR1700720_4700009
TMSTQAAEHHEKAAEQHEHAARHHKEAAKHHKAGNHEKAAHHAHTARGHHEQATEHASAAAKSHVEHHGKK--
>SRR5450759_1153254
LMSKKAAEHHRKAAEHHEHAARQHKEAAKHHDAGAHEKAAHHAHIAHAHHLHATHFADEAAKAHAEEHGSK--
>SRR5476649_602780
LMSKEAADHHRKAAEHHEHAARHHKEAAKHHDAGAHEKAAHHAHIAHAHHLHAEQHAGDAAKAHAQAHGTK--
>SRR5260370_9889087
PVSTKAAEHHEHAAAQHEHAARHHKETPKHQKAVRHEKAAQHAHTASGHAEK---------------------
>SRR5215471_19435997
PMSTKAAEHHEHAAEQHAHAARHHKEAAKQHKAGHHEKAAHHAHTACGHHEHATHHATEAAKAHTEEHGHQ--
>tr|A0A2M6XEG2|A0A2M6XEG2_9RHIZ Uncharacterized protein OS=Methylobacterium sp. CG09_land_8_20_14_0_10_71_15 OX=1975532 GN=COT56_21735 PE=4 SV=1
--KHPGADHHHKAAEHHEHAARHHREAAKHHEGGHHEKAAHHAHSAQGHAHYATHHGSEASKHHAEHHGKG--
>tr|A0A1I4D138|A0A1I4D138_9RHIZ Uncharacterized protein OS=Methylocapsa palsarum OX=1612308 GN=SAMN05444581_1317 PE=4 SV=1
--PTKIAEHHTQAAQHHEKAAEHYKEAAKHHETGAVEKGAHHAQVSQGHAVHAEYHADEAAKAHAQHHANK--
>SRR6516162_2577000
LMSKKASEHHKKASEHHSHASRHHEEAAKHHEAGHHEKAAHHAQTAMGHAIHARTHSEEAVKAHAEEHGKK--
>SRR5262249_44780301
LMSKKAAEHHKKAAEHHSHAARHHEEAAKHHAAGHHEKAAHHAHTASGHASHARGHAEEAMKSHAEEHGQK--
>ERR1700686_4403266
PMSKKAAEHHKKAAEHHTHAARHHEEAAKHHEAGQHEKAGHHAHTARGHALHARHHSDEAAKSHMEEHGKK--
>SRR5215471_16139522
AMSKKAAEHHKKASEHHTHAARHHAEAAKHHEGGHHEKAAHHAHTARAHATHARDHSEEAVKAHAEEHGKK--
>SRR2546421_8056338
PMSKKAAEHHKKASEHHTHAARHHDEAAKHHEAGHHEKAAHHAHTARGHASHTRHHSEEAARAHAEDHGKK--
>SRR6516162_7817916
PMSKKAAEHHKKASEHHTHAARHHGEAAKHYEAGQHEKAAHHAHTARAHAIHARGHSEEAAKAHHEDHGNK--
>ERR1700732_5276201
HMYKKAAQHHKQAAEHHTHAARHHGEAAKHHEAGHHEKAAHHAHTAAGHATHSRHHSEEAAKMHTEEHGKK--
>ERR1700721_288514
PMSKKAAQHHKQAAEHHTHAARHHGEAAKHHEAGHHEKAAHHAHLVRGTVLKGRGTLKGGWRATSE-------
>SRR5579872_3850512
-MSKKAGEHHQKAAEHHEHGARHHKGAAKHHQAGSYEKAAHHAHIARAHHEHAHEHAIEAAKAHAQEHGSD--
>SRR5487761_2742555
-MSKQAAEHHLKAAEHHEQAARHHKEAAKHYQAGSYEKAAHHAHTACGHEEHAAFHSGEAAKAHAQEHGN---
>ERR1700730_7170546
-NKHAATEHHLKAAEHHEHAARHHREAGKHHEASNHEKAAHHAHTAQGHMTHAHHHAGEASKHHLAHHGDK--
>ERR1700748_2579388
-MTKEAANHHSKAAEHHENAAKHHREAGKHHEAGDHEAAAHHAHTAQGHTANASHHADEAAKLHTQHHGNK--
>SRR5580698_5335757
-MTKEAANHHNKAEEHHENAARHHREAGKHHEAGDHESAAHHAHTAQGHTQHATHHAGEAAKLHTEHHGKK--
>SRR5258705_12432272
MDATKLAEHHEKTAEHHQKAAEHHRHAAQHHQQQDHEKGAHHAHLAYGHHLHATEHAEQAAKTHAEGQT----
>ERR1035438_1862924
-MHHEAAEHHRKAAEHHEHAAAHHREAAAHYEQGNHEKAAHHAHIAHGHGLQASHHADEASKHHTSSHGGA--
>SRR5580698_7177634
-MSQERIDHHRKAAEHHEHAATHHNAAADHHEAGDHEKAGHHAHIAHGHTTHAAHHAAEASKHHANEHTGE--
>SRR5208283_4889738
-MSKEAADHHRKAAEHHEHAAKHHHAAAHEHEAGNHEKAGHHAHLAHGHHALATHHAEEASKHHVTEHGHH--
>SRR5580704_6446761
-HMSEHADHHRKAAEHHEHAAKHHRAAADHHESGDHEKAGHHAHVAHGHTVHAAHHAEEASKHHANDHGHH--
>SRR5262249_53837718
MTMHKGAGHHRSAAEHHEKAAHHHREAAKHHDEGDHHRAAHHAHAAHGHATHAAHHGGEASKHHAAEHGDP--
>SRR5262249_3839383
MKEHKGAEHHRSAAEHHEKAAHHHHEAAKHHEDGDHKSAAHHAHTAHGHATHAAHHSSEASKHHAETHGDH--
>ERR1700676_1561084
LMSQEAAEHHRKAAEHHEHAARHHEEAAKHHDAGSHEKAAHHAHTAHGHHLHATHHAGEAVKTHADEYGSK--
>SRR5271156_6624548
MADHKIHEHHEKAAEHHEHAAKHHREAAKHHKAGAHEKAAHHSKIAHGHHLHATEHHEHASKKHAGDHGDA--
>SRR5580704_6068697
MHEHEIHEHHEKAAEHHEHAAKHHREAAKHAKAGDHEKAAHPSKVAHGHSLHATEHHEHASKKHADQHSXX--
>ERR1700734_3267748
MPEHDIHEHHEKSAHHHDQAAKHHREAAKHHKAGHHEKAAHHSKVAHGHSLHATDHHHHASKKHAEHHSX---
>SRR5271170_1512638
ENGHDIHQHHEKAADHYEHAAKHHREAAKHHEAGDHEKAAHHSKVAHGHALHAEEHHGHASKMHAEQHGX---
>SRR5579863_3905028
MSGHGIHEHHEKAAEHHEHAAKHHREAAKHHQSGNPEKAAHHSKIAHGHALHATEHHAHASKMHAEHHGX---
>SRR6202050_2286552
IMDQDIHKHHEKAAHHHDDAAKHHREAAKHHKSGHHEKAAHHSKVAQGHSLHATDHHHHASKKHAEHHGX---
>SRR5208282_1032254
MNSHEIHEHHEQAAHHHEEAAKHHREAAKHHEAGHHEKAAHHSKVAHGHSLHATEHHEHASKKHAEQHSX---
>ERR1700745_4273030
LHDNEIHEHHEEAAHHHEQAAEHHREAAKHQKDGDHDKAAHHSKVAHGHHLYATEHHDEAAKLHAEAHGDD--
>SRR5271154_267655
MNSHEIHDHHEIAADHHDHAAKHHREAAKHAKAGDHEKAAHHSKVAHGHSLHATEHHDHASKKHAEQHGXX--
>ERR1700733_2112371
-LRRAAKCRLELAADHHEHAAKHHREAAKHAKSGDHEKAAHHSKVAHGHSLHATEHHEHASKKHADQHSX---
>ERR1700691_6755303
MDEHDIHEHHEKAAEHHEHAAKHHREAAKHAEAGDHEKSAHHSKGARGHSLHPNAHHNEAPKKPAVQHGX---
>ERR1035437_7181262
SMSKEAALHHTQAAEQHDLAARHHREAAKHHIAGNHEKAAHHAHLAHGHHVLATEHAENAAKEHVKAHGTK--
>ERR1017187_4718788
AMSKEAAHHHTQAAEHHENAARHHREAAKQHLAGNHEKAAHHAHLAHGHHFLATEHAENAAKEHVKAYGAK--
>ERR1035437_5215839
SMSKEAAHLHTQAAEHHDHAARHHREAAKHYLAGNHA------------------------------------
>SRR5208337_5201425
YMSHEAAEHHTKAAEHHEHAARHHHAAAKAHSEGNHEKAAHHAHLAHGHHAHAAEHAEHAAKAHIEAHGEK--
>SRR5438132_4014951
HTEHPATEHHRKAAAHHEEAAKHHRAAAQAHSQGDHEKAAHHAHLAFGHHVHAAHHMQEAAKKHTEHTSAV--
>SRR6202021_2491305
TMSKEAAHHHTQAAEHHEHAARHNHEAAKHHQDGDHEAGAHHAHLAHGHHIQATEHAEHAAKHHVEAHGEV--
>ERR1700744_918969
TMSKEAAHHHTQAAEHHEHAARHHHEASKHHEAGQHEKAAHHAHLAHAHHVHAADHAEHAAKKHIEAHGAK--
>SRR5476651_2291918
MSKDKIVDHHNAAAEHHEHAAKHHREAATHHEADNHEKAGHHAHSAHGHSSHATHHAGEASKHHAEHHGKH--
>SRR5256885_10433591
MAKDKIIEHHNAAAEHHEHAAKHHREAAKHHEADSHETAAHHAHSAHGHSAHAAHHATEASKHHAEHHGKQ--
>SRR5215470_13748103
MSKAKIVEHHTSAAEHHEQAASHHREAAKHHQADDHEKAGHHAHTAHGHATQAAHHGGEASKHHADMHGKK--
>SRR5262249_5909060
AMSKDAAEHHKHSAEHHTQAAHHHAEAAKHHESGHHEKAAHHAHSANAHALHARHHAEEAAKSHMNEHGKK--
>ERR1700674_4915123
MAKKEHKEHHEAAAEHHESAAEHHREAAKHYEVGHHEKAAPHAHLAHGHGVHATHHAQEAAKHHVEHHDDD--
>SRR5476649_712169
-SHEKKLEHHHKAAEHHDHAARHHREAAEAHHAGNHEKAAHHAHVAHAHHLHAEHHGDEAGKLHAEHHGEA--
>ERR1700677_2502920
-SHEKKIEHHRHAAAHHEHAARHHHAAAEAHTAGQHERAAHHAHIARAHHLHAEHHGDEAGKLHAEHHSHE--
>SRR6516164_10081394
VMSKKAAEHHRKAAEHHTHAAHHHGEAAKHHDSGHHEKAAHHAHTAGGHALHAREHSEEASNAHMEEHGKKX-
>SRR5215813_15420037
AMSKKSAEHHKKASEHHTHAAHHHVEAAKHYEGGDHEKAAHHAHTARGHATHAAHHSEEAVKAHAEEHGKKX-
>SRR6516162_3769719
LPSATPAEPHKNAAQHHTEAARHHGEAAKHHESGQHEKAAHHAHTAGGHATHARHHAEEASRAHVEEHG----
>SRR2546423_2679145
---HKGGSHHELAAEHHETTAHHHREAAKHYGHGDHDKAGHHAHVAHAHGLHATHHGQEAAKHHAEHHEE---
>ERR1700682_6433899
---HKGGSHHETAAEHHENAAHHYREASKHYDSGDHEKAGHHAHPAHAHRLPPTHH-----------------
>SRR5262249_7960664
---HKGGGHHEIAAEHHETAAHHHREAAQHYESGDHETDGHRAHVAHAHGLHATHHGHEAAKHHAEHHKX---
>SRR5262245_44145014
---HKSGSHHEMAAEHHETAAHHHREAAKHHETGDHEKAGHHAHMAHAHELHATHHGHEAAKHHAEHHEE---
>SRR5215469_11644734
---HKGGTHHELAAEHHETAAHHHREAAKHYESGDAEKAGHHAHVAHAHELHATHHGHEAASITPSTISK---
>SRR5215471_6019152
---AKGHDHHASAAEHHEHAAHHHREAARHYEAGDHEKAGHHAHVAHAHELHAIHHGHEAAKHHAEHHEX---
>ERR1700730_10676216
---HKGGSHHEVAAEHHENAAHHHREAAKHYDSGEHEKAGHHAHVAHAHGLHASHHAHEATKEHAEHHAG---
>SRR5271163_3974060
MSKAKIAEHHRKAAEHHEKAAAHHHKAAEHHDDEDHMMAAHHAHVAHGHHHHATHHAAEAGKLHAEHHAD---
>ERR1700691_908072
MKSHELAEHHEKAAHHHAQAAEHHRHAAQHHKGGDTHKATHHAHTAHGHHLHAAHHASEAGKLHAQHHAD---
>ERR1039458_5453327
-MPKEAADHHLKAAEHHEHAARHHKEAAKHHNAGVHEKAAHHAHTAHAHHLHATHFADEAAKASCRER-----
>SRR5882757_11516447
MTNHKGAEHHRSAADHHEKAAQHHRDAARHHDDGDHGRAAHHAHTAHGHATHATHHGSEASKHHAENHG----
>ERR1700759_3684327
MSSHKGAEHHRSAAEHHENAAHHHREAAKHHDSGDHHRAAHHAHSAHGHATHAAHHGSEASKHHAEKHA----
>SRR6267154_2535493
MTNHKGAEHHRSAADHHKKAAQHHRDAARHHDDGDHGRAAHHAHTAHGHATHATHHGSEASKDHAENHG----
>ERR1700681_2967493
---HKGANHHDVAAEHHENAAHHHREASKHYDTGEHEKAGHHAHVAHAHGLHATHHAHEAAKHHAEHHA----
>SRR4249919_3050305
LMSKKAVDHHKGASEHLTHAAKHHDEAAKHHESGNHEKAAHHAHTARGHALHARHHSDEAAKAHMEEHGKKX-
>SRR5215469_8883529
AMSKKAAEHHKQAAEHHGHAARHHGEAATHHEAGRHEQAAHHAHTARGHAAHATEHAEHAAKAHAEEHGTKX-
>SRR5215472_6198358
LMSKKAADHHKKASEHLTHAARHHTEAAKHHEAGDHEKAAHHAHTARAHAAHARDHSEEAAKVHLGEHGKKX-
>ERR1017187_7736977
AMSKKAAEHHKQSAEHHTHAARHHGEAAKHHEAGHHEKAAVCTENLNPNVLTMKSAQYDAR---IYDARSLN-
>SRR5580704_12853319
SMSKPAADHHMKAAEHHEEAAKHHRAAAEHHTAGDHQKAGHHAHVANGHHVNAVHHAEEASKHHATDHS----
>ERR1019366_5760491
--PRSGAQHHDAAAQHYEEAARHHRMAAKQYQASHHEKAAHYAQLAYAHHMYAEQHAAEAAKAHAKNHG----
>ERR1700693_4750673
--PITEEEHHEAAAQHHEQAARHHRVAAKQDHAGNHEKAAHYAHLAYAHHVQAEQHAAEAAKAHAKSHN----
>ERR1700730_12173117
MSAHKHKEHHEAAAKHHEHAAHHHQEAAKHYASGHHEKAGHHAHTAHAHGAHATHHAHAAANINVEHHGEK--
>ERR1700694_6071327
MSAHKHKEHHEAAAKHHEHAAHHHQEAAKHYASMACMRRTTRMKPRSTMsSIMARS---KSARX----------
>SRR5580693_4924512
AMRKAHHEHHANAAEHHEHAAHHHREAARHYESGEHEKAGHHAHVAHGHGVHATHHAHEAAKHHAEHHSED--
>SRR5437016_8712387
EMSKQAAEHHIKAAEHHEHAARHYKEAAKHHEAGNHEKAAHLAHVAHGHHLHATHHRSEERRVGKECRSRW--
>SRR5579883_1766477
MTKQHIAEHHRKAAERHEKAAHHHRMAAEHHDDEDHVTGAHHAHVAHGHHLHATHHATEAGKLHVEHHGHH--
>ERR1700722_7570681
-MAKQTAEHHTRAAEHHGHAQKHHQQAAKHHESGNHEKAAHHAQVAQGHQTQAMHHANEAAKSHTEHHGSKE-
>ERR1700743_30692
-MAKQTAEHHTRAAESHGHAQKHHQQAAKHHTAGNHEKAAHHAHLASSHEEDARTPSVNTRKSHKDTYGDKE-
>SRR5580700_2371651
AMSKEAAHHHSKAAEHHELAANHHREAAQHHEDGDHQAAAHHAHVAQGHQAHATHHASEAAKHHVEAHGDKX-
>SRR5579863_5227466
IMSKEAAHHHSQAAEHHEHAANHHKEAAKHHEAGDHEAAAHHAHVAQGHHAHATHHATEAAKHHVQAHGDKX-
>ERR1700689_4571874
-MAHKGAEHHHQAADHHEAAAKHHREAASHHEAGNHESATHHAHVAHGHALHATHH-----------------
>ERR1700688_3733124
-MSKEAAGHHYKAAEHHEHAAKHHRAAAEHHEAGDHQKAGHHAHVAHGHTVHAS-------------------
>SRR5277367_1853101
-STHSAHEHHAKAADHLEQAAHHHREAAAHHESGDAATAGHHAHVAAGHTAHA--------------------
>ERR1700761_4254522
SMSKQASEHHNLAAEHHEHAARHHRDAAKHHKAGDHEKAAHHAHVAHGHHLHATHHATEAAKHHVEAHGEK--
>ERR1700727_2977704
SMSKQASEHHNLAAEHHEHAARHHRDAAKHHEAGDHEKAAHHAHVEHGHASHAEHHHTEASRHHAAHHGQH--
>ERR1700731_2030917
-PDPSIHEHHEKAAHHHDQAAKHHREAAKHHKAGAHEKAAHHSKIAHGHHLHATEHHEHTSKLHAEKHGS---
>ERR1700743_1236405
-SMEEIHEHHEKAAHHHEQAAKHHREAAKHHQAGSPEKAAHHSKIAHGHASHATEHHEHASKLHAEDHGX---
>ERR1700756_1994461
-HDSDIHEHHEEAAHHHEQAAKHHREAAKHHKAGHHEKAAHHSKVAHGHHLHATEHHEEAAKLHAEAHSD---
>SRR2546423_14472982
-AEHEIHEHHEKAAHHHEQAAKHHREAAKHHKAGSHEKAAHHARIAYGHRLHAAEHQDHAAKMHAEEHSX---
>ERR1700680_2379019
-MSKKAAEHHRKASEHSTQAAKHHTEAAKHHDAGQHEKAAHHAHTAGGHERHSRTHSDEAAKAHADEHGKK--
>SRR6476659_3824902
-MSKKAAEHHRKASMHSGEAAKHHDQAAKQHEAGQHEKAAHHAHTATGHERQSRMHADEAAKAHADEHGKK--
>SRR4029079_3412719
-MSKKAAEPHTKESMHTGEDANHHDQAAKHHEAGQHEKAAHHAHTATGHERHSRMHADEAAKAHADEPAKK--
>SRR5438477_9761204
MPKHEGAEHHKKAAEHNEHAARHHKEAARHHEEGSHEKVGHHAHIAHGHHLHATHHAEEAAKTHSNQHEKEN-
>SRR5580704_1157045
MPKHDSPEheekvakhqdkladhheekateHHEKAAKHHDKAAQHHREAAKLHKEGDHETAGHHAHIAHGHHLNATHHSEEAAKSHAQQHGEK--
>SRR6266571_3990511
MAGVSSTDHHTKAAEHHEMAAKHHRAAAEAHSKGDVATAAHHAHLAHGHHSHATHHMEEAAKKHTEH------
>SRR6266567_3749516
MAGHSSVDHHTRAAEHHEMAAKHHRAAAAVHAKGGIVEAAHHAHLAQGHHAHATHHMEEAAKMHTEH------
>SRR3984957_15754445
MTEIKIHEHHEQAAQHYEHAAKYHREAAKHHKAGNHEKAAHHARIAFGHYLEAAEHQNNAARQHAKEHSX---
>ERR1700730_3219255
MKEYKIYEHHEQAAQHYDQAAKYHREAAKNHNAGNHEKAAHHARIAFGHYLEAAEHQNNAARQHAKEHSX---
>SRR5208283_1776841
LHEHDIHEHHEQAAHHHEHAAKHHREAAKHHKAGDHEKAAHHTKVAHGHHLHAVDHHEHASKMHAEEHGE---
>ERR1035437_8645898
-MSKKAAEHHKKASEHLTLAARHHGEAAKHYEAGAHEKAAHHAHIARGHAILARGNAEEAVKAHVEEQAKN--
>ERR1700693_1544462
-MSKKAAEHHKKASEHLTHAARHHGEAAKHHEAGAYEKAAHHAHAARGPGNSRSGTRX---------------
>SRR4029077_4859853
-MSKKAAEHHHQAAEHHEHAARHPRDAARHYEAGDHETAAHHAHTAQGHLHHATHHSTEAAKQHAEHHGQK--
>ERR1017187_6129136
KMSKKAAEHHRKAAEHHEHAAHHHKEAAKHHDAGAHEKAENHAHRAHAHHLHVTHHYEE--------------
>SRR5215472_2424335
AMSKKSAEHHTKAAEHLEHAAHHHKEAARHHEAGAHEKAAHHAHIAHAHHVHSHHHADEAAKSHLEDHGKL--
>SRR5450756_2276617
LMSKKAAEHHRKAAEHHEHAARHHKEAAKQHDAGAHEKAAHHAHIAHAHHGGKTTPLTYAVP-----------
>SRR5208283_368143
MAQHSGSGDHREAAEQYELAARHHREAAKAHDLGNHEKAGYHAYVAHAHHTLATQHAEEAMKHYATSHA----
>ERR1700723_380338
MS-HSGSHHHREAAEHYDQAAKHHREAAKHHDAGHHEKAGYHAYVAHAHHTFAAQHAEEAEKHYATAHA----
>ERR1700689_1737127
MAQHSGSHHHREAAEHYDQAAKHHREAAKHHDAGSHEKAGYHAYVAHAHHTFAAQHAEEAEKHYAPSHA----
>SRR5215467_2810391
-MSTKAAEQHDRAAEHHEHAARHHKEAAKHHKAGNHEKAAHHAHSARGHHEHAAQHGAEAAKAHTEEHGHQ--
>SRR5450830_554856
TMSKKASEHHRKAAEHHKLAATHHEEAAAHYDKGNHEKAAHHAHVAHGHTLHATHYAAEAAKMHVEEHGSKK-
>ERR1017187_7609860
---NKKIDHHRHAAAHHEHAARHHHAAAEAHASGLREKAGHHAHVAHAHDLHAQHHDDEAAKLHAEHHAGEP-
>ERR1700677_4341665
---QKRIEHHQHAARHHEQAATHHHAAAEAHSAGHHEKADHHAHVAHAHHLHARHHGDEAAKLHAEHNAHED-
>SRR5262249_39732114
QMSKKAAEHHKKAQEQHSHAARHHGEAAKHHEAGHHEKAAHHAHIARAHAIHARHYSEEATKAHGEEHGDK--
>SRR5215467_6832124
PMSSHAVDHHRKAAEHLEHAARHHQEAANHHEAGHHEKAAHHAHLARAHAIHARGYSEDATKAHHEDHGNK--
>SRR5215470_12036960
QMSKKAAEHHKKASEHHEHASHHDAEAAKHHESGHHEKAAHHTHTASGHAIHARHHSEEAGKAHAEDHGHK--
>ERR1700759_5669011
PMSKSAADHHKKAAEHHQHAAKHHTEAAKHHEAGHHEKAAHHAHVAHVHSSHAQEHHEHASRAHGEEHGSK--
>SRR3982074_501293
--THQGGEHHETAADHHESAAHHHREAAKHYESGDHEKAGHHAHVAHAHGLHATHHGHEAAKHHAENHKYP--
>SRR5215467_6148838
--AHKGGSHHELAAEHHETAAQHHREAAKHYEAGDHEKAGHHAHVAKAHGLHATHHGHEAAKNHAEHNESA--
>SRR5215813_14120567
--THKATSHHETAADHHEAAAHHHRAAAKHYESGDHEKAGYHAHVAHAHGLHAAHHGQEAAKHPAEHHAEH--
>SRR5262252_190131
--SHKGGDHHETAAEHHEEAARHHREAAKHYEDGDHHKAGHHAHLAHAHGLHATHHGHEAAKHHAEHHADH--
>SRR5215471_20087447
--THKGGSRHETAADHHETAAHHHREAAKHYESGDHEKAGHHAHVAHAHGLRPIMGKRPRSITPNI-------
>ERR1700688_719809
--SHAGSEHHETAADHHESAAHHHREAAKHYEGGEPEKAGHHAHVAHAHGLHATHHAHEAPKHHAEHHPEE--
>SRR5215467_15706025
----AKHEHHEKAAHHHEQAAKHHREAAKHHQAGNHEKAAHHSKIAHGHHLHAGEHHDHA-------------
>SRR5215471_17268835
----TIHEHHEKAAEHHEHAARHHREAAKHAQAGHHEKAAHHSKIAHGHSLHAAEHHQHA-------------
>SRR6202051_5226611
----TIHEHHEKSAHHHEQVAKHHREAEKHHKAGDHEKAAHHSKIAHGHHLHAVEHHDTA-------------
>SRR5580700_691679
-MSQKGVDHHLKAAELLEHAAKHQRSAAKYHGSGEFEKAAHHAMISHGHLVHAMEHVEGASKHVAENHDS---
>SRR5271154_6203042
-MSQKGVDHHLRAAELLEHAAKHHRTAAKHHETGEFEKSAHHAMVAHGHLVHAIEHVQEASKHHAFEHDT---
>ERR1700692_4913725
-MSQKGVDHHLKAAELLEHAAKHQRSAAKHHGAGAVEKAAHHAMISHGHLVQASEHIEGASKHQTESHDS---
>SRR5947209_798729
-EHLTGTERHLAAADHHERAASHHRDAAKHYAEKDFARAAHQALIAHGHMQQAVWHANEATKYHIEHHSN---
>SRR5580704_13162530
-----ASKHHHDAAEHHEKAAHHHREAAKHYEEDESETAAHHAHTAAGHGAHASHHTTEAAKLHTQHHGX---
>ERR1700743_439014
-----ASKHHHDAAEHHEKAAHHHREAARHYEEDDTEGAAHHAHSATGHGTHAHHHASEAS------------
>SRR5580658_837536
-----ASKHHHDAAEHHEKAAHHHREAAKHYEEEDADAAAHHAHTASGHGHHAHHHAAEASKAHAEHH-----
>ERR1700691_3551227
-----ASQHHHDAAEHHEKAAQHHREAAKHYEDEDHDAAAHHAHSASGHGHHANHHAAEARQPHPQHHGP---
>SRR5258706_712044
---HPSHDHHMKAAEHHEHASKHHKEAAAHHASGHSEKAAHHAHTAHAHTLHAAHHAGEAAKHHVTHKK----
>SRR5215472_8550299
---HPAQEHHTKAAEHHEHASKHHKEAATHYAAGAHEKAAHHAHSAHAHALHAAHHAGEAAKHHTSHHA----
>ERR1700689_759555
---HPAHEHHLKASEHHEHASKHHKEAAGHHAAGHHEKAAHHAHTAHAHTLHAEHHASEAAKHHVSHKK----
>SRR5271157_4256807
---HPAHEHHVKAAEHHEQAGKHHKEAAAHYASGDEAKAAQHAHTARAHTLHAEHHAGEAAKHHVSHKK----
>ERR1700730_16246211
VMAHKGAEHHKKAAEHHTHAAHHHREAAKHHEAGTSEKGAHHAHAAHGHTTHARHHADEAAKHHADEHGHS--
>ERR1700730_11984108
VMAHKGAEHHKKAAEHHAHAAHHHREAAKHHEAGTTEKGAHHAHAAHGHTLHARHHGDEDGKAL---------
>SRR5664280_462104
SMSKKAAESHKKVSEHLTHAARHHTEAAKHHETGQHEKAAHHAHIARAHATHAREHSENAAKSHLEEHGKK--
>SRR5450759_554306
SCLRKPQRRIKKASEHLTHAARHHTEAAKHHETGQHEKAAHHAHIARAHATHAREHSENAANTKSRYPQPI--
>SRR5512139_3675460
VTSKKAAESHKKASEHLSHAARHHTEAAKHHEAGQHEKAAHHAHTARAHATYAREHSENVAKAHSEGIKX---
>SRR5262249_4493708
----PASTHHHAAAEHYEKAAHHHRLAARLYEDDESGMAAHHAQSAAGYSAQAAHHSAEASKLHAHHHGEE--
>ERR1700759_2735061
----PASTHHHAAAEHHEKAAHHHRQAASHYEDNDSDTAAHHAHSATGHGAHAAHHGAEASKLHAHHHGEE--
>ERR1700733_7137713
----PASTHHHAAAEHHEKAAHHHRMAAKQYEDERAEAAAHHAHTASGHGAHAAHHSAEASKLHAHHHAEE--
>SRR5277367_858819
----PVAEHHHAAAEHHEHAARHHREAAKHYEEDDAETGAHHAHTASGHGAHAAHHAVEASKLHAHHHGSE--
>SRR5215472_5482998
TMSHATIEHHRKAAEHHEHAARHHREAAARHESGDHHTASHHALIAQGHLHHATHHTSEAAKHYANSHTEY--
>SRR5262245_47040845
PMSKKAVEHHRKAAEHSSHAEHHHNEAAKHHEAGHHETAAHHAHTARGHVVLTLHHAQEAAKAHAEEHGKK--
>ERR1700722_17094089
AMSKKAAEHHKKAAEHATHAAHHHTEAGKHNDAGHHEKPATHADPAHGDASHARHHAEEAARAHTEEHGKK--
>SRR5208282_3791820
-------DAHNKAAEHHENAAKSHRMAAEQHRKGEHEKGREHASQARAHSKTAHEHSETA-------------
>SRR6266481_543054
-HVEKGCGTPQKASEHLTHAAHHHGEAAKHHEAGHHEAAAHHAHTAHGHAIHARGDAEEAVKAHVEEHGKKX-
>SRR5258706_4967713
-HVEKGCGTPQKASEHLTHAAHHHGEAAKHHEAGLQIPVHRG------QSFRRIADSVPVIADSFRX------
>SRR4029077_733555
-LPLIWSPLHKKASEHLTHAARHHGEAAKHHEAGNHEKAAHHAHTARGHATHARGHAEEAAKAHTEEHGKK--
>SRR5471030_639260
-MSKKAAEHYKQSVEHHTHAARHHGEAAKHHEAGQHETAAHHAHTARGHATYARGHAEEAVKAHTEEHGKKX-
>tr|A0A1H5INE7|A0A1H5INE7_9RHIZ Uncharacterized protein OS=Rhizobiales bacterium GAS191 GN=SAMN05444161_5687 PE=4 SV=1
-MSKKAAEHHKKAAEHATHVARHHGEAAKHHEAGHHEKAAHHAHTAMGHAFHARGHAEEAAKAHAEEHGKK--
>SRR5260370_6339496
-HVEKRLRDTTKKLQNISRMRRItMGRLPSIMRLDTTKQRHTTLAPRHGHAIHATGHAX---------------
>tr|A0A225DK00|A0A225DK00_9BACT Uncharacterized protein OS=Fimbriiglobus ruber OX=1908690 GN=FRUB_09278 PE=4 SV=1
-MSKKAAESHKKAAESHKKAGEHHEQAAKHHEAGNHEKAAHHAHTAKGHQTHAERHTNDAAAHHAEEHGAK--
>SRR5262245_23436742
--PHSGRDHHETAAEHHENAAHHHRQAAKHYETGDPEKAGHHAHLAHGHGVHATHHAHEAAKHHAEHHGNH--
>SRR5208283_915204
--SHKGSHHHKAAAEHHSKAAHHHSKAAEHYEEGDHEKGGHHAHLAHAHGLHATNSAHEAAKHHAEHHGNE--
>ERR1022692_1187252
PMSKKAAEHHRKASEHLTHAASHHGEAAKHHDAGYHEKVAHHAHTARGHAIHARRHAEDAVMAHTEEHGKKX-
>SRR6266853_2390647
LMSKKAAGHHKKASEHLTHAAHHHGEAAKHHEAGHHETAAHHAHIASGHAIHARGYAEEAVKAHVEAYGKKX-
>SRR5262249_7896378
TMSKKAAEHHRKASEHHSHAARHHQEAAKHHDSGHHEKAAHHAHTAGGHAIHARDHAEEARKAHTEEYGKKK-
>SRR6478735_11977141
TMSKKAAEHHRKASEHLKHAARHHEEAAKHHDAGHHEKAAHHAHTARGHVIHGRGHAEEAVKGSYRGARQKI-
>SRR5215469_8564958
LMSKKAAEHHRKASEHLKHAAHHHEEAAKHHDAGHHEKAAHHAHTARGHVIHGRGHAEEDVTAHTEEHGKKS-
>SRR4029079_15311338
DDTHDRAEHHRKASEHHSHAARHHEEAAKHHDSGHHEKAAHHAHTAGGHAIHAIDH-----------------
>SRR3974390_1082698
MTDHDIHHHHHEAAKHHEAAAEHHRKAAHHAEAGDHEKASHHAHLAHGHKLHAVEHAEHAAKKHAHHHGNG--
>SRR5579862_249647
MTEHKIHHHHLEAAKHHEHAALHHRKAAEHEEAGYHELASHHAHIAHGHKLHAIEHSEHAAKKHTHRHADK--
>SRR3974390_2923829
MSEHEVHHHHREAAKHHEHAAEHPRRAATHAEAGEHEKASHHAHLAHGHKLHAIEHAEHAAKKHAQKHGHG--
>SRR5450631_4335944
MNEHDIHDHHHEAAKHHEHAAEHHRKAAAHAEAGEHEKASHHAQLAHGHKLHAIEHAEQAAKKHVHKHGNG--
>SRR5258705_12677773
----LAQGHHVKAAEHLEQASKHHNEAAGHSAAGHHETAAHHAHSAHAHMLQAAHHASEAAKAHRVHK-----
>SRR5215831_19127532
LMSKKAAGHHKKASEHLKHAALHHEEAAKHHEVGRHETAAHHAHTAMGHIIHARGHAEEAVKAHVEEHDRH--
>SRR5215813_12713863
FMSKKAAGHHKKASEHLKHAALHHEEAAKHHEVGRHETAAHHAHIAMGHIIHARGHAEEAVKAHVEEHDRH--
>SRR5262249_35423489
VVSKKAAGHHKKASEHLAHAVRHHEEAAKHHDAGHHETAAHHAHLATGHTILARGHVEEATKAHVEEHGKK--
>SRR6516225_2630054
LMSKKAAGHHKKAAEHLTHAARHHEEAAKHHDAGHHETAAHHAHLATGHAVHARGHAEEAMKAHT--------
>SRR5258706_11303519
LMSKKAAGHHKKVSEHLTNAAHHHEEAAKHHEAGRHETPAHHCSHRDGPSNSCX-------------------
>SRR6266403_6300338
IMSKKAAGHHKKVSEHLTHAAHHHEEAAKHHEAGRHATATHHAHTAMGHMIHAKGHAEEAVKAHVEEHGRS--
>ERR1700730_16364569
LMSKEAADHHRRAAEHHEHAARHHKEAATHHDAGSHEEAAHHAHTAHGHHLHASHHASEAAKAHAHEHVS---
>SRR3984893_11274889
LMSKQAAEHHRKEAENHAYADRHHKEAAKHHDAGSHEEAAHHAHSAHGQHLHATHHAGEAAKAHAHEHVS---
>SRR5260370_10324744
PMSKEATEHHRKAAEHHEHAARHHKEAAKHHDAGSHEEAAHHAHTAHGHHLPATHHAGEPAKAHPHQPST---
>SRR5579864_1471419
MSDHDIHEHHEMAAEHHENAAKHHREAAKHAKSGDHGKSAHHSHAAHGHALHAHEHHGHASKLHAEHHG----
>SRR6201999_4428379
MSTHEIHEHHDKAAEHHEHAAKHHREAAKHAKDGDHEKSAHHSKVAHGHALHAHEHHGHASKKHADHHS----
>SRR6202000_1008192
MSSHDMHEHHEKAAEHHEHAAKHHREAAKNSKAGDPEKSAHHSHAAHGHALHAHEHHGX--------------
>ERR1700733_5798165
MDSPEIHEHHEKAAEHHEHAAKHHREAAKHAKAGNHEKSAHHSKVAHGHSLHANEHHEHASKKHAEHHG----
>SRR5262245_59006430
MSTHQHKEHHESAAEHHAKAAHHHGKAAEHYEEGEHEKGGHHAHLAHAHGLHATHAANEAAKHHAENHGVH--
>SRR5262245_43150745
MTTHRHTEHHETATEHHAPAPHHHRKAAEHYEDGEHEKGGHHARLAHAHGLHATHAADEAAKHHAENHGEH--
>SRR5262249_27898195
MSTLHQKDHHEAAAEHHAKAALHHRKAAEHYEEGEHQKGGHHAHLAHAHGLHATHAANEAAKSHAEHHDEH--
>ERR1700685_3638131
SSAKSHKDHHEAAAEHHDKAAEHHRKAAEHYDSGDHEKGGHHAHLAHAHGLHATSAAHEAAKHHAEAHGDH--
>SRR5271170_782980
---SLLADHHDKAAEHHEAAADQHRQAAEHHRSAAHEKAAHHAHLAHGHHLHAAHHAEEAGKQLATAHA----
>ERR1700688_4600441
---SKLADHHDKAAEHHEAAAGHHRQAAEHHRTANHEKAGHHAHVAHGHHLHAVHHAEEAGKHHAEAHH----
>SRR5579864_6620188
HMSKKAAEHHKKAAEHLTNAAHHHKEAAKHHDAGHHEKAAHHAHTARGHAIQGRGHSEEAVKAHTEEHGKKX-
>SRR5580704_16417950
VMSKKAAEHHRKASEHLTNAARHHSEAAKHQDSGHHEKAAHHAHTASGHASQARSHADEAGRAHAEEHGQKX-
>ERR1700730_9174416
------KDEHNKAAEHHENAAKSHRAAAEHHGKNEHEKAKEHSRSPQQHSQNARQHSEQA-------------
>SRR5580692_11307601
------KDDHNKAAEHHDNAAKSHRAAAEQHGKGDHAKGKEHSATAQQHAQSAGKQSEQA-------------
>SRR5271155_1227401
------KDAHNKVAEHHENAAKSHRAAAEQHGKSDHAKGKEHSTNAQQHSQNARQHSEQA-------------
>SRR5262252_663114
---HKGADHHSAAAEHHENAARHHREAAKHYQSGDHHKAGHHAHLAHGHGVNATHHAHEAAKHHAEH------
>SRR6185295_16454184
-MSKQAADHHRKAAEHNEHAAQNHKEAAKYHEAGNHEKAAHYAHLAHAHHLHVAHHSAEASKSHLEHHGKK--
>SRR4029079_628390
-MSKQAADHHKKAAEHNEHAAQNHKEAAKYHEAGNHERAAHYAHLAHAHHLHVAHHSAEASKSHLEHHSTK--
>SRR5436305_1246676
----PATEHHTKAAEHHDRAAQQHRDAAKHYEDDKHETAAHHAHSAHGHASSAQEEATQASKKHAAHHSGQ--
>SRR5215469_17711277
LMSKKAAEHHRKASEHLKHAARHHEEAAKHHDAGHHEKAAHHAHTARGLIAHNAPKRELPIPAQTEQEPSI--
>SRR6476660_854516
-MSKTAADHHRKASEHSTHAAKHHGEAAKQHDAGQHEKAAHHAQTASGHEREARMHSGEDAKAHANEHGKK--
>ERR1700733_13046317
---KNASDHHHTAAKPHEHAAKHHKLAAEHHASGELAKAARHAHVAHGHHLSAEHHHHEAAKHFAEHNTD---
>SRR6266478_6069517
LMSKKAAGHHKKVSEHLTHAAHHHEEAAKHHEAGRHETAAHHAYLAMGHLIHARGYAEEAVKAHVDEHDRP--
>SRR6516162_5521683
GLDRGGTEHHRKASEHLKHAAHHHEEAAKHHDAGHHEKAAHHAHTARGHVIHARGHAEEAVKAHTAEHGKK--
>SRR6516162_9495542
GSTEVALNTTGRHRNTSSMPPTTTRRPPSTTMPDITKKAAHHAHTARGHVIHGRGHA--AVNAHTEEHGKK--
>SRR5580704_15213454
--NITSMRLSIIAKPPPTWRMAItRRPRILRIlpmLTTDT-PNITPAKLQKPISSFI---TST-----PPLPNK---
>ERR1700727_2850150
--AKHAADHHEHAAKHHEHAAEHHREAAAHVADGDHEAGAHHAHLAHAHHKHAEHHAGEASKAHIELHHE---
>tr|A0A1N6HF04|A0A1N6HF04_9BURK Uncharacterized protein OS=Paraburkholderia phenazinium GN=SAMN05444168_3227 PE=4 SV=1
--QHEVHHHHHEAAKHLDSAAKHHREAAKHAEAGDHEAASHHAHLAHGHGLHAGEHAEHAAKKAAHLHSG---
>ERR1700731_210757
--AKHAAEHHEHAAKHNEHAPNHHRKAAAHVADDDHESGAHHAHLAYAHHKHAEHHAGEASKAHIELHAG---
>SRR6201999_2564239
--KHPASKHHHDAADHHEKATHHHREAAKHYEDEDAETAAHHAHTASGHSHHAHHHAAEASKAHVQEHGH---
>SRR5208337_642429
--QHPAQAHHTKAAEHHEHAMKHHKEAATQYASGHPEKAAHHAHSAHAHALQATHHAGEAAKGHISHAQKK--
>SRR5271166_4746791
--QHPAHGHHTKAAEHHDQAMKHHKEAATHYAGGQHEKAAHHAHTAHAHSLQASHHANEAAKAHVSHGQKK--
>ERR1035438_9181592
-----AKEHHDKAAEYHEHAAKAHRAAAEHHGKGDHVKGKEQANAAKQHSQTANQHTDQA-------------
>SRR6185295_6369403
-----MKDAHNKAAEHHENAARSHRAAAEHHGKNDHAKGKEHSTKAQEHSQNARRHSEDA-------------
>SRR5580704_8854768
-----SRDEHNKAAEHHENAAKAHRAAAEHYGKGDHAKGKEYATSAKQQSQTANQYSDQA-------------
>SRR5580704_7478170
-----ARDEHNKAAEHHENAAKSHRAAAEHHGKGDHSKGMEHSTNAQQHSQNARQFSDDT-------------
>ERR1035438_3719677
---HKGIENHRKAAKHHEEAAKHHHDAAKHHEAGNHDKACESTVKAHGHHCLASDHMREVSKQHR--------
>SRR5579862_8090639
---QKGIENHKTAAKHHEEAAKHHLEAAKHHEAGNHDKACESTVKAHGQHCLASEAEREDVKH----------
>tr|A0A1F3K8Y4|A0A1F3K8Y4_9BACT Uncharacterized protein OS=Bacteroidetes bacterium GWF2_33_38 GN=A2W98_03950 PE=4 SV=1
------MENHKKAAKHHEEAAKHHHDAAKHHAEGNHEKASHSAVKADGHHCIASEARKEDAKHHT--------
>SRR6516164_3816328
-MSQKSAEHHTKAAEHHEHAARHHREAAKHYTAGSHEKAAYHAHVAHGDHLHAIYHAEEAAKYS---------
>SRR5580692_7223228
-MSTQGTEHHIKAAEHHEHAARHHRVAAEHYAGGNDERAAYHAQVAHGHHLHAIHHAEEAAKYT---------
>tr|A0A2H5FQX6|A0A2H5FQX6_9GAMM Uncharacterized protein OS=Legionella sainthelensi OX=28087 GN=CAB17_11925 PE=4 SV=1
----KLHQHHSSAAEHHRKAAEHHGEAAKHHQGGDHEKGNHHAHLAHGHQEHAKHHSSEAAKHTTGHERKE--
>tr|G9EPV5|G9EPV5_9GAMM Uncharacterized protein OS=Legionella drancourtii LLAP12 OX=658187 GN=LDG_7296 PE=4 SV=1
----KLKQNHTTAADHYKKAAEHHLEAAKNHEAGNHEKGNSHAYMAHGHSKQAKIHGSDACCHSAGIDTKK--
>SRR5271166_4754981
-MSQQSAEHHTKAAEHDEYAARHHREAAKHYASGNHEKAGYHAHLAHGHELHAINHAEEAAKYEIKFISEGT-
>SRR5271165_5204439
-AAC-----TSSVSPCLMFPRRNSSASgSSRYFPTA-RRIGRAPXX----------------------------
>SRR5208283_2469540
----KIAEHHAQAAQHHEKAAEHHKEASKHYEAGAVEKGAHHAQVAQGHAVQAEYHADEAAKAHAEHHGGK--
>tr|A0A257S911|A0A257S911_9PROT Uncharacterized protein OS=Acidiphilium sp. 21-60-14 OX=1970292 GN=B7Z67_11670 PE=4 SV=1
MMAHTTHEHHAHAAMHHERAAHHHHEAAKHAEAHEHEAAAHHAHLAHAHHLHATHHADEAAKQAADTHA----
>SRR5262249_1401492
---HKGGSHHEIPDENHDHHATHHRVRRQTSRSGEALRSGR--------------------------------
>SRR5260370_35260483
---HKGGSHHQTAAEHQQTAAHHHREAAKHNEAGHPGQTGPT-------------------------------
>SRR5260370_40037558
---HKAGSHHETAPEHHETAAPHHRASAQHYEA----------------------------------------
>SRR5450631_1542563
------QQHHEKAAEHHEQASKHHKEAVKHHESGDEKTAAHHAHIAHGHSAQATEQETEASKKYAEKHNPK--
>SRR5689334_7481690
XMLNEAAEHHKKAAEHHEFAARHHKEAAKYHETGFHEKAVYHARLAHEHHIHATYHASKG-------------
>SRR5690242_14891423
XMSKQIAEHHKKAAEHHESAAHHNKQAVMHHEAGSYEKAAYHARLAHEHYVRATYHASKD-------------
>ERR1019366_9440480
--PAAAAKQDDAAAQHYEEAARHHRQAAKDYQAGHFEKVSHHSHLAYAHHLHAEQRSEEAARAHLKNYFD---
>ERR1035437_3282233
--PGAAAKHHDAAARHYEEAARHHRQAAKHYQSGHHEKVSHHAHLAYAHHLHAEQHAEEAAKAHIKNHLD---
>ERR1700688_1358693
--PRTGAQHHEAAAQHHELAARHHRVAAQHDLSGHHEKAGHYAHLAYAHHLHAEQHGAEAAKTHAKHHTG---
>SRR5215813_1807967
-MSTKAAEHHEQTVPRLTLLVKSPLHR---APSGVTRDGVRAAQSVPA-------------------------
>ERR1700722_19382195
MSTLNKAEHHQAAADHHEKAAEHHREAAKHHDEGEHHLSGHHAHIAHGHGLQADHHADEATRHHVEAHSH---
>SRR5579863_25933
MSTLNKADHHHAAAEHHEKAAKHHREAAKHHEDGEHHLSGHHAHVAHGHGLQADHHAGEAAKHHVETHSH---
>SRR5450759_2404522
-MSKKAAESHKKASEHLTHAARHHAEAAKHQEAGQHEKAAHHAQNARAQATYAREHSENAAKAHFEEHGKK--
>SRR5450830_872629
-ILVERAASHKKASEHLTHAARHHAEAAKHQEAGQHEKAAHHAQNARAQATYAREHSENAAKAHFEEHGKQ--
>ERR1700721_1610003
-MSKKAAEHHKKAAEHATHAARHHTEAGKHHDAGHHEKAAHHAHTAHGPASHARHTAEDAPRPQTERTG----
>ERR1700676_2341835
-MSKKAAEHHKKAAEHATHAAPHHTEAGKPHDSGHHEKAAHHAHTAHGHASHARHHAEEAARAPPGEDAHS--
>SRR6516162_2935501
EMSKQAAEHHIKAAEHHEHAARHHKEAAKHHEAGNHEKAAHHAHVAHGHHLHAMHHHEEAMIFLGEKD-----
>SRR5215470_6709139
----TVVGFRVRDSANPNPSLRTRLLRGKPR--------------TVGLRLKSIRI---TVPREEANV-----
>SRR5580704_10693489
-----TWEHYHHAAGHHEQAAYHYKKAEKYDQAEEHEKAAHHAYLAHGHNQHAIHHDVEAARLHPEHCD----
>SRR6185437_1950867
-----TSDHHLRAAHHSEQAAKHHHEAAKHEEAGAHDLAAHHAYLAHGHGEHAAHHRVEAAKQHADHCD----
>SRR5579872_6549096
-----TWEHYHHAARHHEKAAYHYNEAAKYDEGQEHEKAAHHAYLAHGHNQHAMHHETEAAKLHAEQCA----
>SRR5579859_232825
-----TWEHYHHAARHHERAAYHYKAAAKYDQTEEHEKAAHHAYIAHGHTQQALHHDAEVAKLHAEHCD----
>ERR1700733_13494560
-----TWEHYHHAARHHEKAASRLHEAAKYDQAEEHEKAAHHAYLAHGHGQHATHHDVEAAQPHSEHCN----
>SRR6476620_2890533
-----TWEHYHYAARHHERAAYHYNEAAKFEQANEHERSAHHAYLAHGNTQHAIQHDAQAAKLHAEHCD----
>SRR5271170_5375838
-----TWEHYHHAARHHERAAYHFNEAAKYNQAEEYEKAAHHAHLAHGHNQHAVHNENEAAKLYASQCD----
>ERR1035438_5075301
-----TWKHYHHAARHHEKAAYHFNEAAKYDQAEEHEKAAHHAYLAHGLSQNPVLHDVEAAKLHAEQCN----
>ERR1035441_11100738
-----TLFPYTTLFRSHERAAYHFNEAAKYDEGEEHEKAAHHAYLAHGHNQHAIHHDVEAAKLHAEHCD----
>SRR5664279_1442597
-----TWEHYHHAAHHHERAAYHYKEAAKYDQAEEHEKAMHHAYLAHGHTQHAIQHDIEAAKSHADLCD----
>SRR5664279_4926826
-----TWEHYDLAAHHHARAAHNYQEASKYSQAEEHEKAMHHAYLAHGHSQSAIQHETEAARLHAEECE----
>ERR1035438_7334095
-----TLEHYQGAAHHHERAAYQFKEAAKYHQSEEDEKESHHAYLAHGHAQHALLHEVAAAKLHVEKCD----
>ERR1700721_2784080
---RRSAEHHTLAANHHEHATRHHHEAAKHFQNDDHAHAAHQAQIAYAHTRRAIRHSDGSCRILYGTAWA---
>SRR4051812_19989905
---RRSAEHhHTFAAHHHEQAARHHHEAAKHFQNDDHAHAAHQAQIAYAHTRRAIRHSNQAAEYYTELDDR---
>ERR1700730_1514122
---RRSAEHHSLEAHHHEQAARHHHEAAKHFQNDDHAHAAHQSQIAYAHTRHAIRHSDEATEYYTEQHGL---
>ERR1700722_13050175
---RRSAEHHTLAAHHHEHAARHHHEAVKHFQNDDHAHAAHQAPFVTATKLPNIiRNSMGGLRPTA--------
>SRR3954451_3354512
----TGTEHHDAAAVHHEQAASHHREASRHYAEKDYAHAAHQALIAHGHTQQRQPSTKSS-------------
>SRR5579872_2397560
----TGTEHHVAAAEHHEQAATHHRQAAKHYAEKDYAHAAHQALIAHGHTQQAVRHGNEATKYHLEQHGKD--
>SRR6185503_15719553
----TGAEHHTAAAKHHEQAASHHRQASRHYSEKNYIKAAHQGLIGHGHSQRAIRHGNEATKYHVEHEEKA--
>SRR5258706_11916707
----TGTEHHLAAAEHHEKAAVHHRGASECYAKQDYAQAAHRALIAHGHTQQAVRHGNEATKYHLEHDKE---
>ERR1700741_2922018
----TGTEHHDKAATHHEQAERHHREGSLHYAEGAYAHAAHQALIAHGHTQQAIRHGNEATKYHVEHHGRF--
>SRR5205807_4931412
-------DTHTKAAEHHENAAKSHRAAAEHHGKGDHAKGHEHSSTAQQHSKTAREHSETAHKKSGEHAGR---
>SRR5271167_149749
-------DTHAKAAEHHEVAAKAHRTASEHHGKGDHATGHEHSTTAHRHSETAHGHSKEAHEKSSQHAGK---
>ERR1700679_4233730
------KETHTKAAEHHENAAKSHRAAAEHHGKGEHTKGQEESTKAQAHSKTAREHSDM--------------
>SRR5450759_136227
--------KHNMAAEHHEKAAKSHRTAAEHHGKGEHEAGQRHSSEALEHSKNAHQHSQEAHNKSIEANKK---
>ERR1039458_1966516
--------KHNMAAEHHEKATKSNRTAAESAGAILGHITPRGRGAAEKKX-----------------------
>ERR1019366_4731186
--------KHNMAAEHHEKAAKSHRTAAEHHGKGEHGAGHRHFEAPLGDPAPVVAPKYVCPrgdytwyQKSAGSPK----
>ERR1035437_823894
--------KHNMAAEHHETAANSPRTAFSGYHSTCSSPKAD-----L------------CSaktfavgNX----------
>ERR1035441_4576807
--------KHNMAAEHHEKAAKSHRTAAEHHGKGEHEAGQRHSSEALARTFKECSSALTRGPX----------
>tr|A0A0M8YVQ9|A0A0M8YVQ9_9ACTN Uncharacterized protein OS=Streptomyces purpurogeneiscleroticus GN=ADL19_30475 PE=4 SV=1
--------AHTEAAEHHEKAAKSHRTAAEHHGKGDHADGHKHSTEAHGHSTTAHERSTKAHg-KSGEHHT----
>ERR1039458_5552437
MATHPAAEHHTKAAEHHKAAAAHHEQAAEHYGHGNYEKAAEHAHHAHGHHALATHHMEEAAKAHATHPDT---
>SRR6185312_11034738
----SGAEHHLAAATHHEQAAAHHRLASQHYAEKDYAHAAHQALIAHGHGQQAARHANEATKYHIEHHDAVP-
>SRR6185437_5883084
----TGTEHHVAAADHHELAARHHRNASKHYEEGDHAHAAHQALIAHGHAQLAARHANEATKYHVEHHGDAE-
>SRR5581483_9044389
----TGTEHHDAAAVHHEKAALHHREASRHYAEKDYAHAAHQALIAHGHTQQAIRHGNEATKYHVEHHGNPS-
>SRR6185437_768864
----SGTEHHEAAANHHEKAAWHHREAARNYAKKDYAHAAHQALIAHGHTQQAIRHGSEATKYHVEHHGQAL-
>SRR5579864_1751781
----TGKEHHTAAADHHEQAARHHRLASKHYEEKDYAHAAHQALIAHGHTQQAMRHGNEATKYHVEHHGNDS-
>SRR6185437_13689150
----TGTEHHMKAAEHHEQAALHHRRASRHYMEREFAYAAHQALIAHGHTQRAARHANEATKYHVEHHGKES-
>SRR5512135_2626370
----TGAEHHSLAAKHHEQAARHHHQAAKHYEEKDYAHAAHQALIAHSHTQEAIFHGTEATKYHAEHYDRAT-
>SRR5580704_12517227
-------HDHHKAAEHHDEAAKSHRKAAESHEEGDTEQASQHSQLANDHSKKAQE------------------
>SRR5271166_1276736
---HPASEHHHQAAAHHHAAAHHHHRAAHHHDLGEHEEAKEHAEAAQEHSEQAHKHTTTA-------------
>SRR5271167_711373
---HSSSEHHHQAAAHHHAAAHHHHQAAHHHDLGEHDEGKDHADAAHEHSELAQKHTTTA-------------
>ERR1700739_3733392
---HPSSQHHQTAAAHHHAAAHHHHQAAHHHEIGEHEEAQAHAVAAKDHSELAHQHTETA-------------
>SRR6516165_8907569
---HPAVEHHRMAAMHHHAAAHHHHQAAHHHAHGQHEEAKKHATSAHEHSEHGHKHSKEA-------------
>ERR1700744_635017
---HPSSQHHTTAAAHHHAAAHHHHQAAHHHEHGDHEEAQEHAAAAKEHADLAHQHTATA-------------
>ERR1700730_3219825
---HPSSQHHQTAAAHHHAAAHHHHQAAHHHELGEHEDAKEHAAAALGHSELAHKHTTTA-------------
>SRR5271167_1124854
---HPSTEHHLQAAAHHHAAAHHHHQAAHHHDIDEDEEAQEHAEAAHEHSEMAHKHTKTA-------------
>SRR5580692_6476539
---HASIDHHEQAAAHHHAAAHHHHQAAHHHAAGEHDHAKRHATAANEHSHAAHRHSNTA-------------
>SRR5215469_3288153
---HPAVEHHRQAAAHHHAAAHHHLQAAHHHSHGQHEEAKKHATTAHEHSDHGHKHTKDA-------------
>ERR1700734_4447503
---HPASEHHHQAAARHHAAAHHHHQAAHHHDLGEHKEAKEHAEAAHEYSEQGQKHTATA-------------
>tr|A0A1U7CVY5|A0A1U7CVY5_9BACT Uncharacterized protein OS=Paludisphaera borealis GN=BSF38_04616 PE=4 SV=1
---HPASEHHHQAAAHHHAAAHHHHAAAHHHDIGEHAEAKQHATAAHEHSEKAHAHTKTA-------------
>SRR5208283_1986152
---EASGQHHHQAAAHHHAAAHHHHQAAHLHDIGKHEEAKEHAEAALEHSEQAHKHTT---------------
>SRR5579864_3769991
MSTEETVEYHRKAAEHFQYAANHHMAAAAHYSDGRHEQAAREAYLAHGHYLHGSNHAAEAARLHARHFGQK--
>SRR5712692_8710527
DMSTEAVKHHRKAAEHFSYAAKHHAEAGTHYGAGRHEQAAREAYLAHGHYLTATNHAAEAARLHTRHFGQK--
>tr|A0A1Q7AG34|A0A1Q7AG34_9BACT Uncharacterized protein OS=Acidobacteria bacterium 13_2_20CM_2_66_4 GN=AUI11_11295 PE=4 SV=1
-MSTEAVDHHRKAAEHFEHAAQHHSAAASHYGAGRYDQASREAYLAHGHYLHGSNHAAEAARLHTRHFGQK--
>ERR1700693_4730633
-----TWEHYDLAARHHERAAHEFKDAAKYHETEEHEKAAHHAYLAHGHNQHTIHHGNATAKLHTAHCD----
>ERR1700751_2402501
-----AWEHYRHAARDHERAAHHFKEEAKYDEVEEHEKAAHHAYMAHGHNQHAIYH-----------------
>SRR3984957_10564985
----MAHEAHHKSAEHHEEAAKHHHLAAEHHIKGDHKKAHDHATQAHEHSVKPMTTPPPRTRR----------
>ERR1700679_2802903
----MAHEAHHKSAEHHEEAAKHHQLAAEHHIKGDHKKAHEHATKAHEHSAKAHEHSKAAHEA----------
>ERR1035437_1094595
---KTGIENHKKTAKHLEEAAKHHHDAAKHHEDGNHAKASESTIKAHGHCCCANDLQKEDSKGHA--------
>ERR1022692_4360246
---QKGIENHKKAAKHLEEAAKHHLDAAKHHEAGNHEKACASTLKAHGHTCLATEHQRENIKHHA--------
>SRR5579872_2422874
---QKGIENHKTAAKHHEEAAKHLHDAAKHHEAGNHEKASESTIKAHGHAYIAGEHQREYAKQHA--------
>SRR6478736_6934229
---HKGIKNHERAAHHHEKAAKHHHEAARHHQEGNHKKASESAIKALGHHCLASEAEREDIKHHA--------
>tr|A0A1F3BRJ9|A0A1F3BRJ9_9BACT Uncharacterized protein OS=Bacteroidetes bacterium GWA2_31_9 GN=A2033_19665 PE=4 SV=1
---KTVIEKHKKVATHLEEAAKLHHEAAKNHEEGNHDKAHSSTVKANGHTEHAKEIDKEIKKHHV--------
>ERR1700733_15661450
--SEMPKDAHNKAAEHHENAAKSHKTAAEHHGKGDHAKGREEYAKAHAHSTRTQENSQ---------------
>SRR5579871_5864872
--DHMARDAHNKAAEHHENAAKSHKTAAEHHGKGEHAKGREESARAQGHSKTAHEHSE---------------
>ERR1700728_3975191
--DDIARHTHTKAAQHHESAAKSHKKAAEHHGKGEHAKGREESAKAYGHSKTAHEHSE---------------
>SRR5580658_7378271
-AKHPSAEHHHNAASHHHAAAHHHHQAEHHHAMGEHEQAKHHAKAAKEHSELAHKHTE---------------
>ERR1700678_803200
-AKHPASEHHHTAASHHHAAAHHHHQADHHHVRGEHEQAKHHAAAAKEHSELAHKHSE---------------
>SRR5580704_6309482
-PKHPAQEHHHAAAAHHHAAAHHHHQAEHHHAVGEHAEAKQHATAAHEHSELAHKHTT---------------
>SRR5208283_5298634
-AKHPAGEHHHTAAGHHHAAAHHHHQAEHHHARGEHEEAKQHASAAQEHSEAAHKHTT---------------
>ERR1700688_3021457
-TKHPAQEQHHLAAAHHHAAAHHHHQAEHHHAVGEQAEAKQHATAALEHSELAHKHTT---------------
>ERR1700730_17718460
-ANHRSPQHHHLARAHHHAAAHHHHQAEHHHALGEHEDAKQHATAAHEHSELAQKHTT---------------
>tr|A0A1G7GDX7|A0A1G7GDX7_9SPHI Uncharacterized protein OS=Mucilaginibacter pineti GN=SAMN05216464_11097 PE=4 SV=1
-MPNTKHSHHEEAANHHEAAAKSHRNAHKEHTEGNDEKAATHAHEAEGHAEHARTNSKEAAKKHATKSATA--
>tr|A0A1Q6A0Z7|A0A1Q6A0Z7_9SPHI Uncharacterized protein OS=Mucilaginibacter polytrichastri GN=RG47T_3130 PE=4 SV=1
-MPTTKHSHHEDAAKHHDEAAKSHRAAHKEHTEGNDEKAAHHAQKAQGHHTQAGEHAKEASKKHATKHASK--
>tr|A0A1N6RL65|A0A1N6RL65_9SPHI Uncharacterized protein OS=Mucilaginibacter lappiensis GN=SAMN05421821_102120 PE=4 SV=1
----MKHSHHEEAAKHHTEAAKHHTEAHKSHAEGNDEKAAHHAQTAQGHQHKATEHATEAAKKHAEKHSSS--
>tr|A0A1F2JHJ8|A0A1F2JHJ8_9SPHI Uncharacterized protein OS=Sphingobacterium sp. HMSC13C05 GN=HMPREF3127_23090 PE=4 SV=1
-MSETKHNHHHDAAKHHDEASKHHQNAHKAHQEGNDEKAAEHAKSAAESSKKANDHAEEATKKHSHKHGMK--
>SRR5471030_1443776
XMSKKAADHHRKASEHFEQAALHHTEAATYHATNAYEKAAHHAYLAQAHQHHATHHAGEALQAHLNDHGSS--
>SRR3984893_18837529
CMSKKAADHHRKASEHHEQAAFHHAEAAKHHLTNAFEKAAHHASLAQAHQHHATHHLGEALQAHLTDHGSG--
>ERR1700704_2548263
------NDMHKKAAEHHETAAKSHRAAAEHHGKGDHAKGKEHSTNAQQESQNAHQHSEQA-------------
>SRR5450755_131087
---------------------------------RDPVHKRKPLTVDYQHSEQAD-STKSA-------------
>ERR1700739_1735412
------KDEHNKAAEHHENAAKEHRTAAEHHGKGDHGKGREHASSAKQHSQTANQH-----------------
>SRR5580692_11831826
------KDEHNKAAEHHENAAKAHRSAAEHHGKGDHASGKKHSTEARDHASKASEA-----------------
>tr|S9SB59|S9SB59_PHAFV Uncharacterized protein OS=Phaeospirillum fulvum MGU-K5 OX=1316936 GN=K678_11413 PE=4 SV=1
-ATLKANEHHAAAAAHSESAAQHHKEAAKQFDSGHHEKAAHHAQVAAGHSAHATEHATEATKKYAEQHSS---
>SRR5665811_936752
------QQHHEKAAEHHEQAAKHHKEAVKHYESGDDKTAAHHSYVAHGHSEEAREQEMEASKKYAITQG----
>tr|A0A2P8H9L3|A0A2P8H9L3_9BACT Uncharacterized protein OS=Chitinophaga niastensis OX=536980 GN=CLV51_11087 PE=4 SV=1
------HEHHEKAAFHYDLASKSHREAHKSHQEGNDEKAAHHAQAAHGHAAQAKEHEVEASKKHSEKVK----
>tr|A0A2W2A7H8|A0A2W2A7H8_9BACT Uncharacterized protein OS=Taibaiella soli OX=1649169 GN=DN068_18475 PE=4 SV=1
------QKNHEEAAKHHDEAAKHHRDAAKHASEGNYDKAAHSAQAAQGHHAKAGEQAKKAATQYAEKKG----
>SRR5690242_15247408
---HAASEHHHTAAAHHQAAAHHHLEAAHHHDIGEHDEAKVHAASAQEHCEHAEKHTKTA-------------
>SRR5579871_3518436
---HASSEHHHTAAAHHQAAAHHHLQAAHHHDHGNDEDAKKHSSAAHEHSEHGDKHTK---------------
>tr|L0DAS1|L0DAS1_SINAD Uncharacterized protein OS=Singulisphaera acidiphila (strain ATCC BAA-1392 / DSM 18658 / VKM B-2454 / MOB10) GN=Sinac_1996 PE=4
---HAACEHHHKAATHHAAAAHHHLEAAHHHNVGEHEAAKQHDEAAHEHGEHAHKAATTA-------------
>tr|A0A1N6H0I2|A0A1N6H0I2_9BACT Uncharacterized protein OS=Singulisphaera sp. GP187 GN=SAMN05444166_2625 PE=4 SV=1
---HAASEHHHMAAAHHAAAAHHHLEAAHHHDVGEHEAAKKHAETAHEHGEHAHKAAATA-------------
>ERR1700722_3501349
-------EAHTKAAEPHENAAKSHRTAAEHHGKGDHDNGREESTKAQSHAKTAREHSEAA-------------
>SRR5208283_1367323
-------YFDNVIRAHVPSAAKSHRAAAEYHGKNDHMKGNEHAMEAQKHSKVASAASNEA-------------
>SRR5580700_9650725
-------QAHTKAAEHHETAAKSHRAAASEHGRNDHMKGTEHSTEAHKHSKAGGEASDQA-------------
>SRR5258708_9298853
---HTGAGHHTLAAEHHEQAAHHHRQASKHYEKKDHANAAHESLIAHDHTRRAVHHSNEAGKYHAERHRK---
>SRR5690348_12125180
---YSGAEHHTLAAEHHEAAARHHRQAAKHYQGKDYAHAAHQSLIAHDHTRRAIHHSNEAGKYHAERHGA---
>SRR6185312_14811857
---HTGAEHHTFAAEHHERAARHHRQASKHYEEKDYAHAAHQSLIAHDHTRRAVHHSNEAGKYHAEQHGD---
>SRR5207248_1576108
---YTGAEHHTLAADHHEQAALHHRKASQHYDAKDYADAARGSLTAHGHTRRAVHHSNEAGKYHGERAEQ---
>SRR4029077_5485951
-MSKHASEHHRQASTHYHDAARHHQEAAHFSQAGNYERAAYHAGIAAEHQRQAAHHANEAAKHLP--------
>ERR1700737_4938755
-MSKNAAEHHRQASTHYHDAARQHQEAAHFHEAGNYEKAAHHAQIAADHQRQAAHHADEAAKHHA--------
>ERR1700747_3175619
-MSKQASEHHRQASTHYHDAARHHQEAAHFSEAGNYERAAYHAGFAAKHQRHAAHHAEQAAKHTP--------
>SRR5438128_9326564
-MSK-GAEHHRQASTHYHDAARHHQAAAHLSQAGNHGRAAYHAAIAAEHLRQAAHHADEAARHFP--------
>ERR1700727_996832
-------DAHSKAAEHHENAAKSHRTAAEHHGKADHAKGREKSAKAHGLSKTAHESSE---------------
>ERR1700730_4501103
---HPASEHHHAAAAHHAAAAHHHLQAAHHHDHGNHEEAKKHAASAHDHSQDADRHSKV--------------
>ERR1700730_10583744
---HASSEHHHNAASQHEAAAHHHRQAAHHHEYGNHDEAKNHATAAHDHSQDADRHSKG--------------
>SRR5580704_14572243
---HASSEHHFLAAAEHEVAAQQHRQAAHQHDRGNHAEAQKHARAAHDHSQDADRHSKT--------------
>SRR5450755_4646905
---HAASEHHHRAAAEHAAAAHHHYQAAHHHDHGNHEEAKKHAESAQGHSQDADRHSKI--------------
>tr|G8NWU3|G8NWU3_GRAMM Uncharacterized protein OS=Granulicella mallensis (strain ATCC BAA-1857 / DSM 23137 / MP5ACTX8) OX=682795 GN=AciX8_0020 PE=4 SV=1
------HEAHKKAAEHHEHAAKAHHAAAEHHESGDHKAAHEH-------SEKAHEHSTEAHKHSADAHSK---
>SRR5579864_5391397
-----TWEHYPQAARHHERAAYHYKEAGKFDEAEEHEKAAHDAYLAHGHNQHAIHHDSEAAKLHAEQCD----
>SRR5580704_9138589
-----TWEHYHHAGRHHEQAAYHYHEAAKYYQAEEFEKAAHHAYLAHGHHQHAMHHDAEAAKLHTEHSD----
>ERR1700694_2352438
----PASEHHLQAAAHNPAAAHHHLEAAHEHDYDTHEEAKKHAASALNHSQDADRHSK---------------
>ERR1700734_2270764
----PASEHHLKAAAHHAAAAHHHFEAAYEHDHGNHDEAKKHAASALDHSQDADRHSR---------------
>ERR1700687_1619548
----PASEHHLKAAAAHAAAAHHHFEAAHQHDYDNDEEAKKHAASALDHSQDADRHSK---------------
>ERR1022692_1760285
----PASEHHLKAAAHHAAAAHHHFEAAYQHDNDNHEEAQKHQASELDHSHDADRHSK---------------
>SRR5579872_7468067
----ASSEHHHNAAAQHQAAAHHHLEAAHHHDHGEHDEGKKHASSAQEHSEQADRHSK---------------
>SRR5208282_1491605
----LSSEHHHKAASQHEAAAHQHRQAAHHHENGNHEVAKKHASSACDHSQDADRYSK---------------
>tr|A0A1Y0M4X7|A0A1Y0M4X7_9FLAO Uncharacterized protein OS=Polaribacter sp. SA4-10 OX=754397 GN=BTO04_01060 PE=4 SV=1
--NINGIKSHRKTTGYLQVSAKKHLEAAMHYQEGNHEKAVQSAIVAHPNFNLVYKAQRKDMNQHA--------
>tr|A0A1F3BRJ9|A0A1F3BRJ9_9BACT Uncharacterized protein OS=Bacteroidetes bacterium GWA2_31_9 OX=1797314 GN=A2033_19665 PE=4 SV=1
--MKTVIEKHKKVATHLEEAAKLHHEAAKNHEEGNHDKAHSSTVKANGHTEHAKEIDKEIKKHHV--------
>tr|A0A2S7T1N8|A0A2S7T1N8_9BACT Uncharacterized protein OS=Chitinophagaceae bacterium RB1R16 OX=2077091 GN=CJD36_000780 PE=4 SV=1
--MKKSIENHKQAAQHHEEAAKHHKQAAKHHEEGNHDRAHTSTVIANGHAHMASEKQTDDAKHHA--------
>SRR5580704_10616937
-------QSHTKAAEHHETAAKSHRAAAEQHGKNEHGKAKEHATQAQQHSKTAREHSEQA-------------
>ERR1700752_4702105
---RQAVEHHESAAKHYQDAAYHHREAAKHYTAGDYEKAAYHAHMAHGHHLHADDHASEAAKHVLG-------
>ERR1700733_8059481
---QKAVEYHESAAKHHQDAAYHHKEAAKHYTAGDHEKAAYHAHMAHGHHLHAADHSAEAAKQMLG-------
>tr|A0A1V3PEB7|A0A1V3PEB7_9GAMM Uncharacterized protein OS=Rhodanobacter sp. C01 OX=1945856 GN=B0E50_17670 PE=4 SV=1
------HHHHHEAAKHLDEAAKHHRAAAEHAEAGNHDKASHHAHLAHGHKLHAIEHAEHAAKKHAHKHDV---
>SRR5271157_2351377
---HPAAEHHHQAAAHHAAAAHHHLEAAHHHETGEHDQAKKHAEAALRHSEHGHKHTTTA-------------
>SRR5271166_1750728
---HPATEHHHRAAAHHAAAAHHHLEAARHHEAGELDQAKKHSVAAHRHSEHGTKHTTTA-------------
>SRR6266566_1561533
IMSTQAAEQHEKAAAQYGHAARHYKEAAEHHKAGNYEKAAQHAQTARWHHEQATDHASEAAKAHAEHYGKQQ-
>SRR5947209_9682788
IMSTQAAEQHEKAAAQYGHAARDRKSTRLNSSHANISYA----------------------------------
>ERR1700730_3404010
-------DMHQKAAEHHEQAAKAHRIAAEQHGSSDHATAKQQSAQAADKSKAAHKQST---------------
>ERR1700688_2973991
-------DMHQKAAEHHDQAAKAHRTASEQHGSNDQASAKQQSAQDAEKSKAAHEQST---------------
>SRR6476646_9263370
-------DMHEKAAEHHEQTAKAHRTAAQQHGSNEHVSAKQQSAQAADKSKAAHEHST---------------
>SRR5450631_562038
-------EMHQKAAEHHEQAAKAHQNAATQHGSNDHVGGKQQSAQAAEKSKTAHEHST---------------
>SRR5580700_1624679
-------DMHQKAAEHHEQAAKAHRTAAQQHGSSDHVNAKQQSAQAVEKSKAAHEQSM---------------
>SRR5580704_15841692
-------DMHQKAAEHHEQAAKAHRAVAEQHGSNNHAAAKQQSAQAVEKSKSAHEHST---------------
>ERR1700730_10426099
-------DAHNKAAEHHEQAAKSHRVAAEHHGGGDHAAGHEHSGKAHAHSKMAHDQSG---------------
>tr|A0A2N3PRL3|A0A2N3PRL3_9PROT Uncharacterized protein OS=Telmatospirillum siberiense OX=382514 GN=CWS72_18585 PE=4 SV=1
--------SHTKAADAHEAAVKMHRSAADEHAKGDHKAGLEHAEKAVKLSKEAQERGTGA-------------
>tr|A0A1H0JSH8|A0A1H0JSH8_9RHIZ Uncharacterized protein OS=Methylobacterium phyllostachyos OX=582672 GN=SAMN05216360_12370 PE=4 SV=1
--------AHHEAAKHHEAAAKSHKTAAEHHEKGDAKTAGKHAEEAHGHSAKAHESSTKA-------------
>ERR1700685_1504215
-------DLHREAAEQHEQAARSHRTASEHNEKGDHDAAKWHAER----------------------------
>SRR6516164_11255356
---HPSSEHHHQAAAHHHAAAHHHHQAAHHHAVGQHEDAKKHATAAQEHSEMAHKHTSTA-------------
>ERR1700740_158540
---HPSQEHHHAAAAHHHAAAHHHHQAEHHHGRGEHEDAKHHAAAAHEHSEQAHKHTTSA-------------
>SRR5262249_17407650
---HPSSEHHLSAAVHHHAAAHHHHQAGHHHALGQHEEAKQHATAAHEHSEHAHKHTATA-------------
>SRR5580658_1949052
-----MHETHREAAEKHELAAHAHRTAAEHNEKGDYSKATWHSERA---------------------------
>SRR6202167_6317267
-----VHEAHGDAVERHELAAQAHRTAAEHNEKGDLSAAAWHSERA---------------------------
>SRR5215831_2780084
TSCRRKLLNtTERHQNTLKHAARHHEEAAKHHDAGHHEKAAHHAHTARGHVIHGRGHAEEAVKAHTEEHGKKX-
>SRR5262245_21882200
-SCRRKLLNTRKASEHLKHAAHHHEETAKHHDAGHHEKAAHHAHTARGHIIHGRGHAEEAVKAHAEEHGKKX-
>SRR5262249_32641853
ISCRRKLLNtTERHQNTLSTPPVTTRRPPS-TTMPDITKRRHTTLTPRGHVIHGRGHAEEAVKAHTEEHGKKX-
>SRR5271166_3867598
----KTWEHYHHAALLHEKAAYHHKEAARYDQAEEHEKAAHHAYLAHGHSQHAVHHEAEAAKLHAEQCAIL--
>SRR5271166_3724902
----KTWEHYHQAARNHEKAAYHFNEAAKYNQAEEHEKAAHHAYLAHGHSQQAAHHDVEAAKLHTEQCDRV--
>SRR5579863_300323
----KTWEHYHHAARHHEKAAYHYNEAAKYDQAEEHEKAAHHAYLAHGHSQHAAHHDVEAAKVHADQCDKA--
>SRR5580658_4821791
----KTWEHYHHAARNHEKAAYHFNEAAKFNQAQEGEKAAHHAYLAHGHSQQAIHHAAEAAKLHAEHYASQ--
>ERR1035441_6906181
----KTWEHYHHAARAYEKAAYHFNEAAKYNQAEEHERSTLFAYLAHGHSQHAVHHDVEAPKLHAEQCDSL--
>SRR5208283_2767898
----KTWEHYHHAARNHEKASYHYNEAAKYNRAEEHEKEAHHAYLAHGHGQLAVHHAAEAAKLHAEQCGSL--
>SRR5579863_4455819
----KTWEHYHHAARDHEKAAYHYNEAAKYHQAEEHEKEVHHAYLAHGLSQHAVHHEAEAAKLHTEQCDKL--
>SRR5579859_1088175
----KTWEHYHDAARHHELAAYHYKEASKYDKAEEHERAAYHAYLAHGHNQHAIHHDIEAAKADAEQCDKV--
>ERR1700734_995030
----KTWEHYHHAARNHEKAAYHFNEAAKFNQAQEHEKAAHHAYLAHGHSQQAIHHAAEAAKLHAEHYGSQ--
>SRR5271154_1375729
----KTWEHYHHAARDHEKAAYHFHEAAKYYQAEEREKAAHHAYLAHGHSQQAIHYAGEAAKLHAEQHDKL--
>SRR5271154_2378436
----KTWEHYHHAGRHHEKAAYHYHEAAKYYQAEELEKAAHHAYLAHGHHQQAIHHDAEAAKLHAERCDTP--
>ERR1051325_8213161
----KTWERYHHATRHHDRVADHDKTAAKYNPSEAHEKAAHYAYIAHGQTQHALHHDAEVAKLCAKQFDGD--
>ERR1700744_6269464
----SGPEHHLAAADHHESAAQHHRNASKHYEEGDHAHAAHQALIAHGHAQLASRHAKDATKSHVEHHSDS--
>ERR1700728_2423293
----KTWEHYHDAACNHEKAAYHFNEAAKYDQAEEHEKAAHQAYLALGHSQHAVHYAAEAAKLHAEQCAS---
>ERR1019366_10183257
----KTWEHYHHASRHHERAAYHYKEAAKYDKAEEHEKAAHHAYLAHGHSQHAIHHDAEAAKLHAEQCAS---
>SRR6476646_11755220
LMSKQAAKHHKKASEHFAKAAHHHGEAAKQHQAGNHETAAHHASIARGCDLHATEHAHAARKAYADDHG----
>SRR5664279_2450751
-MSK-TWVLaYRCAAHHLERAAYHYKEAAKYEEAGDHEKATHHAYLAHGYTQHAIHDDAEAAKLHAEHF-----
>ERR1039457_3077952
-MSK-TWELaYQCAARHHERAAYHYKEAAKYEEAGEHEKAAHHAYLAHGHTQHAIDCDAEAAKLHADHL-----
>SRR5664280_3607282
-MST-TCELaYYCAARHHECAANNYKEAAKCEAAGEHEKAAHHAYLAHGHTQHAIDCDAEAAKLHADHF-----
>SRR6266566_4045491
LMSKKAAQHHKQVAEHLKHAAFHHEEAAKHHETGRHETAAHHAHIAMGHNFSTRVTFAWRAgtsAAPYPVKN----
>SRR6266699_5933420
LMSKKAAQHHKQVAEHMKHAAFHHEEAAKHHETGRHETAAHHAHRAMGHNFSTRVTFAWRAgtsQHHTRSRI----
>SRR5262249_22277374
LMSKKAAGHHKQVAEHLKHAAFHHEESAKHHEAGRHEAAAHHAHVAMGHIIHARSHAEEAVKAHVAEHD----
>SRR5215467_11810449
LMSKKAGEHHKKASEHFTHAAHHYEEAAKHGESGNHEKAAHHAAIARGHDLHGTEHAHAARKVTAENQGK---
>SRR5262249_44821660
LMSKKAAEHHKKASEHFTHAAHHYEEAAKHGESGNHERAAHHAAIARDGIIQPTPGRASFN-LCAK-RGR---
>tr|G3IVL7|G3IVL7_METTV Uncharacterized protein OS=Methylobacter tundripaludum (strain ATCC BAA-1195 / SV96) OX=697282 GN=Mettu_0532 PE=4 SV=1
----TPQQHHQKAAEHHEQAAKHHKEAAKHYESGDDKTAAQHAHIAHGYSTQAMEQEMEASKKYAKMQ-----
>ERR1700761_5729412
------DAHHLKAADHLEEAAHHHREAAKHHAEGDVELAGHHAQVAAGHTAEADHHTVKAAKLYAKLHE----
>SRR5579862_2076159
------EDHHHQAAEHHEQAAHHHREAAKYHTEGDVELAGHHAHVATGHSAHAAHHAVESSKLHAHLHD----
>ERR1700761_5328672
---------HQKAATHHERAALHHREAAEHHAEGDIELAGHHAQVAAGHTAEAARHAAKAAKLHAKLHD----
>SRR5665647_1062279
----TPQQHHQKAAEHHELASKHHKEAAKLHESGDYEAAAHHALIAHGHTVQPQNKRRKPA------------
>SRR5450759_4733936
----TPQQHHQKAAEHHELASKHHKEAAKFHGSGDDEAAAHHALIAHGHTVHATEQEEEASKKYANR------
>ERR1039458_9226938
----TPQQHHQKAAEHHELASKHHKEAAKLHESGDDEAAAPPPLIANEHRVKATEQEEEASKKYANR------
>SRR5258706_9872423
----TGMEHHIAAAEQYERAALHHRRASQHYAELNHPQAAHQALIAHGHMQQAVRHSNEATKHYVELHSV---
>SRR4051812_20172208
----TGSEHHIAAAEEYERAARHHRCASQHYLELNHPQAAYQALIAHGHMQQAVRHSSEATKYYVELNGQ---
>SRR5690348_3201367
----TGSEHHIAAAEQYERAAERHRRASQHYVDLEHPQAAHQALIAHGHMQQAVRHSNEATKYYVEQHGA---
>ERR1700678_3139464
---DQIADHHEKAAAHHEKAAHHHRKAAEYHKSDDVDTAAQHAHSAHGHDLHAEHHAEAA-------------
>ERR1700722_11937569
---DQIADHHEKAAMHHEKAAHHHRQAAQHQKSEDIAAAAQHAHSAHGHDLHADHHAEAA-------------
>SRR5208337_523507
-SDTTLAEHHSKAAEHHGHAKHHHEEAAKAQEDDDHAKGHHHAHIAHGHHLQAEHHHEVAAKH----------
>SRR5271166_3951483
-SDTTLAEHHSKAAEHHGHAKHHHEEASKAHKAGDHAKGHHHAHVAHGHHLQAEHHQEEAAKH----------
>SRR4029453_15264427
TISTQAAAQHEQAAEQYGHAARHYQEAAEHYKRGQYAKAAHDVQTARGHHAQATAHAATAAKYHAEAYV----
>SRR6266446_8423588
TMSTQAAEQHEQAAEQYGHAARHYEEAAKHQKAGNHEKAAHHAHTARGHHKQATAHASAAVKPHA--------
>SRR4029453_3573209
TMSTQAAAQHEQAAEQYGHAARHYQEAAEHYKRGQYAKAAHEVQTARGHHAQDTDNTVTAAKYHAESYV----
>SRR4029434_6774894
TMSIQAAEQHAQAAAQYGHAARQYQEAAAHHQVGQYAKAAQHAQTARAHHAQATAHALAAARAH---------
>SRR6266851_6134461
-------KSHVAAADHYEKAAEHHRTAAEHASEGDQQAAAHHAHIAQGHALHGHEHAASAAKQHVALHA----
>SRR5580693_7599173
-------KFHVAAADHYEKAAEHHRSAADHADEGNPQAAAHHAHIAQGHALHGHEHAAEAAKKHIELHA----
>tr|A0A2U3KQE7|A0A2U3KQE7_9BACT Uncharacterized protein OS=Candidatus Sulfotelmatobacter kueseliae OX=2042962 GN=SBA1_400038 PE=4 SV=1
----KTREHYQEAARHHERAAFHYKEATRYDAAEEHEKAAHYAYLAHGHNQHAIHHDAEAAKLHAERCDS---
>SRR5579871_2888725
----SGIEHHETAAEHHEHASRHHHQASKHGEKRDHSPASHEVNLANGHAHRAVFHGDEAAKYHVEHFGRS--
>SRR5437868_383465
----SGAEHHVAAADHHEQAAQHHRLASKHCDGKDYAMAVQEAQIAHRHAQHSVFNGNEAAKHHVEHYGKS--
>ERR1700693_805720
----SGAEHHAAAADHHEQAARHHGQASMHCEG----------------------------------------
>SRR5579871_6775725
----SCAEHHAAAAGLHEEACGHLSRVAGHFQKSKIGEAAREAKLALDLAVRAAFHSNEAAKDYAK-------
>SRR5471032_131497
----SGAEHHAAAADHHEQAARHRDHAAELCVSSDDALAAREAAVAKSHARRAVFHGDEAAKHHVEHYGRS--
>SRR5580692_1574559
----RGAEHHAAAADHHDLAARHQGQATKHHDAKEYAQAAHEVQIAHGHAQRSVFHGDEAAKHHVEHLGKS--
>SRR5580658_8805264
----SGADHHTAAADHHEQAARHYGRASKHYDAKEYKQAAHEAQIAQGHAQHSVFHGDEAAKHHVEHFGKS--
>ERR1700723_2911494
----GPAEHHAAAADHHDQAARHHGLAAKHWDRNDDAL-----------------------------------
>SRR5271156_4309961
----SVAEHHAAAADHHEQAARHHGQAAKHRDDADYVLAAHEAQIAHGHAQHSIFHDNEAAKHHVEHFGKS--
>SRR5580692_11411556
----ICAEHHTAAAALHEEACTHLSCIAGHFQKSKV-------------------------------------
>ERR1700693_2802083
----DGAEHHAAAAAHHEKAARHHQEASRLCGEQKYAEAAHEAQMAHRHAHYSVF------------------
>SRR3984885_8417543
----GTAAHHAAAATHHEQAAHHHEEAARLCGEKDYARAAHEAQMAHRHAHYSIFHDDEAAMHHVEHYGKS--
>SRR5271154_5983987
----SSAAHHAAAGLHHEQAAHHHKDAARLCNEQQYARAAHEAQMAHRHAHYSVFHDDEAAMHHIEHYGKS--
>SRR5277367_1397320
----SAAQHHIAAAEHNEAAAQHHADAAQHCGRKADGSATIEAEIARGHAEYAVF------------------
>ERR1700722_13942348
--PSKTIDNHQQAAVHHTEAAKHHLEAAKFYAEGNTEKAAHSAMLAWGHHAIAGEFMNDDAKHHAQ-------
>SRR5580658_1516979
--YKQTIDRHQQAAAHHTEASKHHLDAAKFYAEGNPEKAAHSAMLAWGHHAIAGEFINDDAKHHAQ-------
>ERR1700733_10419240
--YKKTIENHQQAAAHHTEAAKHHLEAAKAYAENSPEKAAHSAMLAWGHHAIAGEFINDDAKHHAQ-------
>SRR5580658_3530262
--HKKSIDNHTQAAAHHKEAARHHLEAAKFYAEGNSEKAAHSAMLAWGHHAIAGEFINDDAKHHAQ-------
>SRR3989338_1468871
---QRGIKNHQRAAAHYEAAAKSHLEAAGHHENENHEKAAKSTVEAHGHSSLGNDAQKEDVKHHTE-------
>tr|A0A257K659|A0A257K659_9FLAO Uncharacterized protein OS=Flavobacterium sp. BFFFF2 GN=CFE24_14185 PE=4 SV=1
---QKGIDNHKKAAAHFESAAKSHLAAAKHHEDGHHEKAAKCTVDAHGHACMGKDAQTQDVKHHAS-------
>tr|A0A257LAW7|A0A257LAW7_9BACT Uncharacterized protein OS=Bacteroidetes bacterium B1(2017) GN=CFE21_08740 PE=4 SV=1
---QKGIDNHKKAASHFEAAAKSHLEAAKHHEDGHHEKAAKATVEANGHSNMAIDHQKEELKHSTK-------
>tr|A0A1F3VUM1|A0A1F3VUM1_9BACT Uncharacterized protein OS=Bacteroidetes bacterium RIFCSPHIGHO2_02_FULL_44_7 GN=A3D92_22580 PE=4 SV=1
---QKTVEGHRTAAAYYEAAAKSHLEAAAHLMNDQNDKASQSTMQAYGHSKLAIEAQKEYVKRHTL-------
>SRR5580704_9113175
-----AVEAHHKAAEHHQKAAEHHHKAAAHHEAGNHEKAHEHATKAHEHATEAHKHSSEAHEKS---------
>ERR1700722_5269228
-MSKSAAAHHGHAAYHHESATRHHRAAENAYGSGDHKKAAHEAQLAQTHALKAKHHSDLAAKEHLEHHGMD--
>SRR5665213_2018094
-VMSSSGEHHGWAAYHHESATRHHRAAENAYGSGDHKTAAHEAQCASDHASRAKHHADLAVKSHIEHHGMD--
>SRR5450432_4126119
-VMSSSGEHHGWAAYHHESATRHHRAAENAYGSGDHKTAAHEEQCASDHACRAKHHADLAVKSHIEHHGMD--
>SRR5262244_3625698
VMSDKAAGHHKKASEHLARAAYHHGKAAK---TKGYEAAMQHAQTARNHRLQAAGHAEKALKAHIDH------
>SRR5215468_6366957
VMSDKAADHHKKASEHLVRAAYHHRKAADHGETGRHETAVHHAQTARAHRLRAAGHAEKALNAHVEY------
>SRR5215475_6990276
--------------------AYHHGQAAK---TKGYEAAMQHAQTARNHRLQAAGHAEKALKAHIDH------
>SRR5215467_9446568
XMPHKAAEHHEKAAAHLERAAYHHGKAAKE--AGRYETAVDHAQMARTHRLQAAGHAEKALKAHVEY------
>SRR5262245_23793110
VVSDRAADHHKKASEHLAHAADHHKKAANHGETGRHEMAVHHAQTARAHRLQAAGHAEKALNAHIEY------
>SRR5262249_56064253
-------------MLEGVVAAYHHRKAANHGEIGGHETAVHHAQTARAHRLQAAGHAEKALNAHIEY------
>SRR5262249_33404574
XMSKRAAEHYRKASEHLTRAAQHDEKAASDHEAGRDEAAMEHAQAARTHTVRAESHAEKALRAYVEH------
>SRR5262249_28992298
----RVTDHYQKASEHLARAAEHEKKAAQDHEAGREQAALQHAQTARLHTLRAESHAEKALNAYVEH------
>SRR5271157_5172123
------KDAHNKAAEHHESAAKSHRSAADSHGKNDHAKGKEHATHAQQHAQTANEHSKTAN------------
>SRR5271170_6162693
------RESHNKAAELHESAAKSHRAAAESHGRNEHAKGKEHATQAQQHAQSAHEQSKTAN------------
>ERR1700683_1074718
------KDEHNKVAEQHEAAAKSHRAAADAHGKNDHAKGKEHSGQAQQHSQNARNQSQAAH------------
>ERR1019366_63553
-V------HQNT-TVVPRTTTR---APRDIIAPPRTRTX----------------------------------
>ERR1019366_3465073
-MSQSPAEHHGKAAYHHESATRHHRAAEKAYGSGDHKTAAHEAQCACGHSNLAKNSADAAAKSHMEHHGAQ--
>ERR1035441_4551387
-MSQSPSEHHGRAAYHHESATRHHRAAENAYGSGDHKTAAHEAQCAAGHASLAKHHSALAARSHMEHHGME--
>SRR6202011_4950182
-MKHASSEHHHSAASKHEAADYYHRQAAHNHDRGDHEEAQKHATSAHDHSQDADRHSKIAH------------
>SRR5260370_19141254
-MNHASSEHHRSAASEHEAAAYHHRQAVHHHENGNPEDAKKHATSAHDHSQDADRHSKNAH------------
>SRR6202011_5447000
-KKHVSSEHHHNAAAQHEAAAHHHRQAAHHHDHGNHEEAKKHATSAHDHSQDADRHSKTAH------------
>SRR5450830_80626
-------QLHQKVAEHHEQAAEHHQEAAKHHESGDDETAAHHAQIAHGHAVHATMH-----------------
>ERR1035437_2959451
-------DLHRHAAEHHELAAQHHRAAGMCQDCCHDDDAAHHAKETTCHAFHAAMH-----------------
>SoimicmetaTmtLMB_FD_contig_41_728288_length_224_multi_1_in_0_out_0_1 # 2 # 223 # -1 # ID=1056114_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.653
-------HLHEQAAEHHEQAAKHHKTAANCCASGDMDGADHHARSAQGHAVHAGAH-----------------
>ERR1700678_2085178
----NMKEEHNKAAEHHESAAKSHRAAADAHGKNDHTKGKEHSTQAQQHAQNAQQHSKTAN------------
>ERR1700692_1389415
----QMKEEHTKAAEHHESAAKSHRAAAEAHGRNDHPKGKEHATQAQQHAQNANENSKTAN------------
>ERR1700722_5975226
----SMKDSHNKAAEHHESAAKSHRAAADSHGKNDHAKGKEHSTQAQQHSQNARDHSKTAH------------
>ERR1700753_2365108
----KMKDDHNKAAEHHESAAKSHRAAAEAHGRDDHAKGKEHSTQAQQHAQNASEHSKTAN------------
>SRR3979409_386687
----KMKDAHDKAAEHHESAAKSHRAAAESHGKNDHTKGKEHATQAQQHAQNAGEHSKTAH------------
>SRR5579863_8143594
----QMKEEHNKAAEHHEAAAKSHRAAADSHGKNDHSKGKEHSTQAQQHSQDARNQSQSAH------------
>SRR5579863_8033294
----TMKEDHNKAAEHHESAAKSHRAAAESHGKNDHAKGKEHATQAQQHAQNAHEHSKSAN------------
>SRR6188472_3074432
VMDDQSERQYTNAADELERSVAHYREASRHSARGEHVKAAHHAHIARGHFLNAQASAHDAAKWHADHFS----
>SRR4029079_19750494
MDQDQSARQYTTAADELERAVAHYREAARHSALGEHVKAAHHAHIARAHFLNAQANAHDAAKWHADHFS----
>SRR5919205_844608
MTQDLSQDQYTMAADELERAVQHYREAARHSELGEHVKAAHHAHIARGHFLNAQSMAHDAARWHAEQFS----
>SRR5688500_4780906
MSQDRSVQEYAVAAQALARAAAHYREAARHTELGEHVKAAHHAHIARGHFLNAQDFAHEAAKLHASRFS----
>SRR5271170_5730888
-----AHEAHHAAAEHHENAAKHHRHAAEHHAAGNHKEGHEHSVQAHEHSKKAHEASTDAHKHSVAAH-----
>SRR5262245_50603022
--KHPAVEHHLQAAHHHHVAAHHHLHAAHHHAHGQHEEAKKHATTAHEHSEHGHKHSKNAHG-----------
>SRR6516162_4486343
--KHASVEHHHQAAARHHAAAHHHLQAARHHTHGQHEEAKKHATAAHEHSEHAHRHSKDAHS-----------
>SRR5271155_274146
------KEAHTKAAEHHENAAKSHRTAAEHHGKGEHTKGHEESTKAQTHSKTARDHSDMAH------------
>SRR3984957_6795854
------KEAHTKAAEHHETAAKSHRTAADHHSKGDHAKASEESTKAQSHSKTARDHSDMAH------------
>SRR6202048_1289238
------KETHTRAAEHHENAAKAHRTAAEHHGRGEHAKGKEQANAAKQHSQTANQHTEQAH------------
>SRR6478672_995202
------KETHTRAAEHHENAAKAHRTAAEHHGKGDHAKGREESTKAQGHAKTAREHSEA--------------
>SRR6516164_8106867
---HASIEHHHQAAARHHAAAHHHLQAAHHHAHGQHEEAKKHAVTALEHSEHGHKH-----------------
>SRR4051794_5723104
--------TMTLAAEHHEHAARHHREAAKFHEVKDILAAVDQAHMATDHQAHAIHYATQAAKEYLAA------
>SRR4051812_7997314
----PSRDEYTRAADELEKAVRHYREAASHSGRGEHVQAAHHAHIARGHFLNAQGMAHDAARRHADQFS----
>SRR5918997_1468141
----PSKDEYTRAADQLEKAVRHYREAASHSERAEHVQAAHHAHIARGHFLNPQGMAHDAARRHADLFS----
>SRR4051812_48614955
----PSKDECTWAQTNSRGHCA-TTGKPEPLRAGEHVEPAHHRHIARGHDLNAQGMAHDAARRHADRFS----
>tr|A0A1H7LQK3|A0A1H7LQK3_9ACTN Uncharacterized protein OS=Blastococcus sp. DSM 46786 GN=SAMN04515665_10783 PE=4 SV=1
----LSKNQYTKAADELVLAVRHYREAASHSGLGEHVQAAHHAHLARGHFLNAQAVAHDAARWHADEVS----
>ERR1039458_9372234
--PVPGATHHDAPAQHDEEAARHRQQAAELYQCGRHEKVSHHGHLAYAHHLHAKQHAEEAAKAHM--------
>ERR1017187_367733
--SAPGAKHHNAAAQHDEEAARHRQQAAKLYQRGHHEKVSHHAHLAYAHYLHAKQHAEEAAKAHM--------
>ERR1017187_6707531
--PVPGATHHDAPAQHDEEAARHRQQAAKLYQRGHHVKVSHHAHLASAP------------------------
>SRR5579864_7684482
--PVPGATHHDAPAQHDEEAVRHRQQAAELYQCGHHEKVCHHAHLAYAHIVHTKQHAEDAAKAHM--------
>SRR5271156_4620907
----MPKETHTRAAEHHENAAKAHRTAAEHHGKGEQDKGHEESTKAHEHSTEAHRSLNRC-------------
>ERR1700726_2667972
-----AKDEHNKAAEHHENAAKSHRAAAEHHGKNDHAKGKEHSANAQQHSQNAPKHSEPAH------------
>ERR1700756_5891212
----KLAEHHETAAHFHELAAEHHRQAAEHQRDEEHEKAAQHALAADGYRLHAVEHAEEASRLYAEEF-----
>SRR6202008_972907
----KLAEHHETAAHFYELAAEHHRQAAEHHRDEEHEKSAQHAFAADGYRLHADEHADEAARLFAEVF-----
>SRR5215469_981565
----KLAEPHETAAHFYELAAEHHRQAAESHRDEEHERAAQRAFAADGYRLHADEHADEAARLFAEVF-----
>SRR5271155_5034285
----MPKETHTKAAEHHENAAKAHRTAAEHHGKGEHDKGHEESTKAHEHSTQAHRHSADAHGKSGEART----
>tr|A0A257URR7|A0A257URR7_9PROT Uncharacterized protein OS=Acidiphilium sp. 37-64-53 GN=B7Z58_15790 PE=4 SV=1
----MASSNHKEAAKAHETAAKAHHTAAEHHDKGDHAAAQQHSTKAHEHSSAAHKHSTDAHQQSGKAAG----
>ERR1035441_2777089
-----AREEHNKAAEHHENAAKSHRAAAELHGKGEHTKGVEQSKTAQQHSQTAGKQSDQA-------------
>tr|A0A0D7P848|A0A0D7P848_9BRAD Uncharacterized protein OS=Bradyrhizobium sp. LTSP885 GN=UP09_16020 PE=4 SV=1
-----ANSEHNKAAELHETAAKSHRAAADQHSKGDHSKGVEQSKSAQQHSQSAGKQSDQA-------------
>ERR1039458_2112867
---QKGVDIHKQAAKHHLEASKHHLDAAKFYEVGEHEKAAVSTVKAQGSASLASDASREDAQMHSF-------
>ERR1017187_9273322
---QRGVDVHKQAAKHHLQASMHHLDAARFHEIGDHEKAAVSTVKALGSACYASQAMNEDAQIHTI-------
>ERR1035437_936469
---QPRVDIHKQAAKHHQDAAKHHQDAAKFHEQGQHDKAAASTVKAQGSATLANDASREDSRSHAI-------
>SRR5476651_1852145
---LKGIETHKQAAKHHQDAAKNHLDAAKFHEAGDHEQAAKSTVKAQGSASLANDAAREDAKSHAV-------
>ERR1022692_1737537
---LKGIENHKHAAKHHQDAAKNHLDAAKFHEAGDHEKAAASTVKAHGSASLANDISKEDAQNHAL-------
>SRR5665213_98322
---QKGIDLHNKAAKHYEAAAKYHHEAAKYHETDDHKMADESTVKANAAATLGNDAAREDAQYHAL-------
>SRR5579871_474268
--THPAAEAHHTAAASHEAAAHHHRQAAHHHETGEHETARTHANSAHSHSATAHEHTTTAH------------
>SRR4029078_9117261
-----RKDEHNKAAEHHESAAKSHRAAAEAHGKNEHAKGKEHANQDQQHEQNAHAHSQSAH------------
>ERR1700681_418265
-----MKDAHNKAAELHEAAAKSHRTAAEHHGRNDHAKGKEHATQAQQHAQNANEQSKTAN------------
>SRR5271169_2644609
-----MKDAHNKAAEHHESAAKSHRSAADSHGKNDHAKGKEHSTQARQQSQSAEEHSKSAH------------
>SRR5579871_2888725
---QGAMPDHASAAHHHAQAAYFHREALTHYRIGkDYAHAAHQALVAHGHAMQAVFHGEEARKYYSGHNGNG--
>SRR5437868_383465
-------------------------------SLNaDYAHAAHQALVAHGHALLAIDRGTEASKYYAEHDGNT--
>ERR1700733_4417216
---HRASEHHRTAARHHTQAAEYHRESSRHYEIGkDYAHAAHQALIAHGHALLGLKYGDEARAHYAGHHLSD--
>ERR1700693_805720
---QRAAAHHASAAIHHHQAARYHNEASRNYQVGkDYAHAAHQALVAHGHALQAFDHGNEASKFYAEHDGSA--
>SRR5713101_2571449
---HRAAEHHVSAAFHHKQAARHHREASRHYQVGkDYAHAAHQALVAHGHALQAIDRGTEARKYYTEHDGNA--
>SRR5476651_332982
---HGAAEHHNRAAMHHTLAARYHREASRHYQTGkDYAHAAHQALVAHGHALQAIDRGNDASKYYAGHNGNA--
>SRR5580658_6078868
---HDAAGHFTSAAFHHKQAARFHREASRHYEIGkDYAHAAHQALVAYGHGLRAIDYGSDAGTYFAEHDRKA--
>SRR5580658_8805264
---------------------------------------------AHGHGLQAIDHGNDAGTYFAEHDGKT--
>ERR1700723_2911494
---------------HHELAARYHREASRHYQIGkDYAHAAHQALVAYGHGLHAINHGNEARKYYARHDGSA--
>SRR5580692_11411556
------AEFHASAAFHHRQAAQFHREASRHYEVGkDFAHAAHQALIAHGHALQALEFELAAIVYYAGHAVRK--
>SRR5476651_2202389
---HGAAEHHNRAAMHHTQAARYHNEASRHYETGkDYAHAAHQALVAHGHALRALRYGDEARTHYAPHHLSE--
>SRR5277367_1397320
------------------------------------------AFLAMGHDLRAVAHGNEAARYHDG---VP--
>ERR1700675_2935724
-----AKEEHNKAVEHHENAAKAHRSAAEHHGKGDHAKGKEHANSAKQHSQTANQHSDQAH------------
>ERR1035441_10294419
-----AKDEHNKAAEHHENAAKAHRSAAEHHGKGDHMPRARNMRTVQSSIRRPPISIA-IR------------
>tr|A0A2E7Y947|A0A2E7Y947_9RHIZ Uncharacterized protein OS=Methylobacterium sp. OX=409 GN=CMH16_04620 PE=4 SV=1
MNSHPAHEHHMLAATHHAAAAHHHHEAAHHHAHGNAEEAKRHSTSAHEHAEHAHRHTANAHKH----------
>SRR3984957_15326747
-------QAHSKTAAHNESASKAHRAAAEHHGKNDHMKGSEHAAEAQKHSKVAGAASDEAH------------
>SRR5271168_46436
-------QAHTKAAEHHESAAKSHRAAAEFHGKNDHLKGNEHATEAQKHSKVASGATEAAH------------
>SRR5277367_474245
-------ESHEEASKHHESAAKSHKMAAEHHGRGDTASAAKHASEAHEHSSKAHQSSTK--------------
>SRR6202521_2882464
-------EAHQEAATHHENAAKSHKAAAEHHAKGDTASAAKHASEAHEHSSKAHQSSTK--------------
>tr|A0A1I4SLP7|A0A1I4SLP7_9RHIZ Uncharacterized protein OS=Methylobacterium pseudosasicola GN=SAMN05192568_104322 PE=4 SV=1
-------TAHAEAAKHHEAAAKSHKTAAEHHEKGDEATAAKHLKEAHGHSEKVHESSTK--------------
>ERR1700735_2891498
-------EAHEEAAKHHENAAKSHKTAAEHHGKGDTASAAKHSAEAHGHSTKDHERPT---------------
>tr|A0A0L6J387|A0A0L6J387_9RHIZ Uncharacterized protein OS=Methylobacterium sp. ARG-1 GN=AKJ13_24265 PE=4 SV=1
-------NAHREAAKHHEAAAKSHNTAAEHHEKGDNTTAAKHAKEAHGHSEKAHESSTT--------------
>tr|A0A177PXP4|A0A177PXP4_9PLAN Uncharacterized protein OS=Planctomycetaceae bacterium SCGC AG-212-D15 OX=1799653 GN=AYO40_06070 PE=4 SV=1
---HPCSEHHCNAASQHEAAASHHRQAAHHHNQGKHEEAKKHANSVIDRSQDADRHSKTAH------------
>tr|A0A2N8MCK1|A0A2N8MCK1_9RHIZ Uncharacterized protein OS=Beijerinckiaceae bacterium OX=1978229 GN=CR217_06575 PE=4 SV=1
---HPAGEHHHQAAAHHHAAVHHHHQAAHHHDLGEHKEAKEHATAALEHSELAHKHSTTAH------------
>tr|A0A1U7CVY5|A0A1U7CVY5_9BACT Uncharacterized protein OS=Paludisphaera borealis OX=1387353 GN=BSF38_04616 PE=4 SV=1
---HPASEHHHQAAAHHHAAAHHHHAAAHHHDIGEHAEAKQHATAAHEHSEKAHAHTKTAH------------
>SRR5436305_210371
---------HGNAAFHHEAAAHHHRQASRHHTAGDNEEADRHTRMAHTHSQTAHEHS----------------
>SRR3954471_7869634
-------------AFYHESAAHHHRQAARHHEAGDTEEAGRHAEAARSHGSTASQHS----------------
>SRR4051794_3105455
----------RRAAFYHETAAHHHRQAAKHHEGGDVEEAEQHGELAYGHSETAHghsg-KA----------------
>SRR3954465_8563158
----------HDAAHYHEAAAHHHREAARHHEGGEHERARRHATTAHEHSGQAHghsqeahqgSHG----------------
>SRR5208283_2189165
---HPASEHHLQAAAHHHAAAHHHHQAAHHHELGEHEEAQEHAKAAHEHSEQGHEHSTTA-------------
>SRR5271165_7167234
---HPASEHHLQAAAHHHAAAHHHHQAAHHHALGEHDKAKQHSTSAHEHSQHAHKHTTDA-------------
>tr|A5ER26|A5ER26_BRASB Uncharacterized protein OS=Bradyrhizobium sp. (strain BTAi1 / ATCC BAA-1182) OX=288000 GN=BBta_6724 PE=4 SV=1
-----ARDEHNKAAEHHDNAAKAHRSAAELHGKGDHAKGKEHASSAKQHSQTASQHSEQAH------------
>tr|A0A2N3PRK8|A0A2N3PRK8_9PROT Uncharacterized protein OS=Telmatospirillum siberiense OX=382514 GN=CWS72_18515 PE=4 SV=1
------AEHHRSAVSHHEAAARYHREASKHYQIGhDHAHAAHQALIALGQAWQAVDHAKTANGYYadhdidslqkymEQ-------
>tr|A0A126YQH4|A0A126YQH4_9BURK Uncharacterized protein OS=Burkholderia sp. PAMC 28687 OX=1795874 GN=AX768_20475 PE=4 SV=1
------NARETSAPSSHELAARLHIDASRHYLAGkDYSHAAHQALVAHGHALLALAQGKAVSDRYrkrasgetaTV-------
>tr|A0A1G7SI70|A0A1G7SI70_9BURK Uncharacterized protein OS=Paraburkholderia phenazinium OX=60549 GN=SAMN05216466_102505 PE=4 SV=1
------KEHLEAAASHHEQAGRFHREASRHFEEGkDFNHAAHQAVMAHGHALHAIAEANDALKHPAS-------
>tr|A0A2U0W4R2|A0A2U0W4R2_9BURK Uncharacterized protein OS=Paraburkholderia unamae OX=219649 GN=C7402_115204 PE=4 SV=1
------KMHIEAAASHHEHAAQHHREASRHFEEGrDFGHAAHQALMAHGHTLHAIDQAHEAGAHGSN-------
>tr|A0A2U0Y0F7|A0A2U0Y0F7_9BURK Uncharacterized protein OS=Paraburkholderia sp. OV555 OX=2135497 GN=C7513_102216 PE=4 SV=1
------KVHLEAAASHHEQAARFHREASQYYEAGsDQDHAAHQAVLAQGHALHAIDEANVAVKHAGA-------
>ERR1700733_4273555
------TESHTKAAEHHENAAKSHRTAAAQHSKGEHAKGQEESTKAQSHSKTARDHSDMAH------------
>ERR1035441_938148
------KDAHLKAAEHHDNAAKAHRTAAEHHGKGDHAKGMQHSKIAFDHSVKAHEASTHAHKKSSE-------
>ERR1035441_5448854
------KDAHLKAAEHHDNAAKAHRTAAEHHGKGRSEERRVGKEG----------------------------
>ERR1017187_7208358
------AVFSaaFLAGGCWALAavFAAARFAAQRFFNAATI-AALPAALSF--------------------------
>ERR1700753_2918760
-------APHKKAADHHEKAAKSHRAAAEHHDKGDKAAAGKHADEAHGHSTKAHETSAK--------------
>ERR1700761_7356260
-------ATHKEAADHHEKAAKSHRTAAEHHDKGDATAASKHAEEAHGHSTKAHESSSK--------------
>ERR1039458_3843154
-------DAHNKAAEHHENAAKAHRNAAELHGKGDHGAGKKHSATALEDSGKAHDAS----------------
>SRR5260370_22522930
-------NAHEEAASHHENAAKAHRTAAEHHGKGNHEEGRRHSSTAHEHSGKAHEAS----------------
>tr|A0A1I2DTA1|A0A1I2DTA1_9BACT Uncharacterized protein OS=Spirosoma endophyticum GN=SAMN05216167_12052 PE=4 SV=1
-----AHEHHKEAAYHFRKAAEYHENAQQLHEAGDHEKEAHEAYVAYGHHNLADQHAQAAAEHHAEKHDT---
>tr|D2QVB8|D2QVB8_SPILD Uncharacterized protein OS=Spirosoma linguale (strain ATCC 33905 / DSM 74 / LMG 10896) OX=504472 GN=Slin_6800 PE=4 SV=1
-----AHDHHKEAAYHFGEAAKHHQKAQELHQAGDHEKEAHEAYQAQGHHNLGDHHAKAAAEHHAEGHDK---
>SRR5476649_1149520
----HVAEHHEAAAELHEHATRYLLQASRHYEAGNVALSAHEAQTAHAMGLCTIDHSNEAAKHHAVR------
>SRR5450756_2141572
-----------LVSWAREMCIRDSRTAAEHHGKGDHAKGMEHSKIAFDHSVKAHEASTHAHAKSSE-------
>SRR6202046_2156493
-------HDHHKAAEHHEEAAKSHRKAADAHEKGEHADATQHSQMAHDHSTKAHEASSSA-------------
>ERR1700684_1565712
-------NDHHKAAEHHEEAAKSHRKAGDAHDKGEHADASQHSQIAHDQST----------------------
>ERR1700722_13108227
------RDSHTKAAEHHENAAKSHRTAAEHHGKGEHDKGRERPRRRRVAQRQRGSIRTPP-------------
>ERR1700722_6883511
------RDSHTKAAEHHENAAKSHRTAAEHHGKGEHAKGNEESMKAQGHSKSAREHSEMA-------------
>tr|A0A2M6VDT7|A0A2M6VDT7_9BURK Uncharacterized protein OS=Limnohabitans sp. B9-3 OX=1100707 GN=B9Z42_07035 PE=4 SV=1
----TEHQHHVQAAEHLELAAKSHKEAAKLISAGDHKAALQHVETAKTHTAHASDHVKEAQKK----------
>tr|E9I7K5|E9I7K5_DAPPU Uncharacterized protein OS=Daphnia pulex OX=6669 GN=DAPPUDRAFT_279722 PE=4 SV=1
----KPEHHHTKVAEHLEMAAKSHKEVAKHITANDHAAAQTHAKVAEEHMTKAKEHADLA-KK----------
>tr|A0A2M6VZL0|A0A2M6VZL0_9BURK Uncharacterized protein OS=Limnohabitans sp. 15K OX=1100706 GN=B9Z40_07615 PE=4 SV=1
----KPEQHHSKAAEHLELAAKAHKEVAKLISANDHTGAHAHVAVAHEHLTHAHTHADAA-KK----------
>tr|A0A1N6KRN2|A0A1N6KRN2_9BURK Uncharacterized protein OS=Paraburkholderia phenazinium GN=SAMN05444165_4433 PE=4 SV=1
-----KKEHLEAAASHHEQAGRLHREASRHFEDGkDFAHAAHQAMLAHGHTLHAIDRANEALKHHAGAPL----
>tr|A0A1Q8IYL3|A0A1Q8IYL3_9BURK Uncharacterized protein OS=Burkholderia sp. SRS-W-2-2016 GN=BTH42_10720 PE=4 SV=1
-----KKGHLESAASHHEHAARHHQEASRHFEDSrDPGHAGHQAVLAHGHTLLAIDEAQDAGAHSANAP-----
>tr|A0A244DI83|A0A244DI83_9BURK Uncharacterized protein OS=Paraburkholderia terrae GN=CA603_35275 PE=4 SV=1
-----KKEHLDAAASHHEQAARFHREASRHFEAGkDFAHAAHQAMMAHGHALHAIYQANDAGKHNSDTPL----
>SRR5579863_4152399
-----KKGHLEAAASHHEQAARHHREASRHFEDGrDLVHAAHLAMMAHGHTLHAIDQAHEAGAHSANTP-----
>SRR6201994_3502295
-----KKEHLVAAASHHEQAARYHHGASRHFEAGkDYAHTAHQAMLAHGHTLHAIDEAHDAGAHSANTSS----
>ERR1700756_1205677
-----KKGHLEAAASHHEQAARYHHEASRHFEAGkDYAHAAHQAMMAHGHALHAIDRAHDAGAHSAATPP----
>ERR1700716_2440016
-----KKEYLEAAASHHEKAARYHREASQHFEAGkDYAHAAHQSMMAHGHTLHAIDQAHNAGAHSASTPP----
>ERR1700742_968682
-----KKEHLVAAASHHEQAARFHHAASRHFEAGkDFDHTAHQAMLAHGHTLHAINEAHDAGAHNVNVPP----
>ERR1700693_4071883
-------------------------------------HAAHQAMMAHGHALHAIEHVNEALKHNAGAPL----
>ERR1700733_12934978
------------------------------------------AMLAHGHTLHAIVEAHDAGVLSASPPP----
>tr|A0A221AH39|A0A221AH39_9BURK Uncharacterized protein OS=Burkholderia sp. AD24 GN=bAD24_III09205 PE=4 SV=1
-----KKAHLEAAASHHEQAARYHHGASRHFDTAqgqdqDHAHAAHQAMMAHGHTLQAIDEAHEAGAHSTGAPP----
>tr|A0A1H4CXJ3|A0A1H4CXJ3_9BURK Uncharacterized protein OS=Paraburkholderia sartisoli GN=SAMN05192564_102553 PE=4 SV=1
-----KKGHLEAAASHHEQAARYHRAASRLFEGGhDFAHAAHEALIAHGHTLHAIDQAHDAGAHSTSAPP----
>tr|A0A1H1JIH2|A0A1H1JIH2_9BURK Uncharacterized protein OS=Paraburkholderia fungorum GN=SAMN05443245_6487 PE=4 SV=1
-----KKGHLEAAASHHEQAARYHHGASWHFEEGkDFAHAAHQAMLAHGHTLHAIDHAHDAGAHSNAPPT----
>SRR5579859_2687318
--------------------------LTVTGVQTcALPTSAHQAMMAHGHTLHAIDQAHDAGAHRANTPP----
>SRR5476651_246744
-----------SASGHHKQAAKYHREASRHYQSGkDYAHAAHQALAAHGHALQAIDHGKVAERYQAPRDP----
>SRR5579863_1914993
-----KVGHLEAAASHHEQAALFHREASQYYEAAkNYEHAAYLAVLAHGHAQHAIDEVHVAAKHASAPSS----
>ERR1700677_902425
------SMHHAAAVVHHQQAARFHREASRHYQIGkDYAHAAHQALTAHGHALRAMEHGQTASAHYVAHEH----
>ERR1700722_4097143
----------------------FHREASRHYQTGrDYAHAAHQALTAHGHALRAQEHGEAASALYAAHEG----
>ERR1700679_1675721
-----QSAHHVAAADHHQQAAQFHRAASRHYQIGkDYAHAAHQALAAHGHTLKAIDHENEASKYYAEHIG----
>ERR1700730_12142891
-----AAEYHASAAIHHELAARYHREASRHYQIGkDYAHAAHQALVAYGHALHAVDHGNHARNYGGGTSS----
>SRR5271170_1814435
-----SAEPHASAAIHHAAAARFHREASRHFQVGeDHAHAAHQALLAHGHGLRAMERGNQADAYYAT-------
>ERR1700690_2248574
------------------------------------------ALTAHGNAVYIRKHGQPANAGYAAHEG----
>APAga8741244255_1050121.scaffolds.fasta_scaffold61951_1 # 1 # 285 # -1 # ID=61951_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.709
-----KKGHLEAAASHHEQASRYHRAASRYFEAGqDYAHAAHQAMMAHGHTLHAIDQAHEAGTHSANTPP----
>UPI0006EF4159 status=active
-----KKGHIEAAASHHEQAARFHREASRSFEAGkDYDHAAHQAIMAHGHVLHAIDHAHDAGAHTTGTPP----
>ERR1035438_8884209
-----MHEAHWKAAEQHELAARAHRTAAEHNEKGDFTTAIWHSQRALEYSDHAYRLAKEAHAK----------
>SRR5271157_4414361
-----MHEDHRRAAELHELAAAAHRTAAEHNEKGDYTTAVWHSERALEYSDSAYKLAKEARTK----------
>ERR1700690_4483870
-----MHEAHRRAAEQHELAAHSHRTAAEHNEKGDYAAAFWHSQRALEYSDHAYKLAKEAHAR----------
>ERR1700690_2838516
-----MHEAHRKAAEQHELAARAHRTAAEHNEKGDCTTAVWHTQRAMKYSDHAYELSKEAHNK----------
>SRR5579863_5662397
-----MHEAHWKAAEMHQLAAQAHRTAAEHNEKGDFTTAEWHSARAREYSDHTYTLAKQCHNK----------
>SRR5580700_5209628
-----VHETHRQAAENHELAAQAHRTAAEHNEKGNYSTATWHSERALEYSDNAYKLAKEAHSK----------
>ERR1035438_1235828
-----MHDAHRRAAEQHELAALAHRTAAEHNEKGDYSAAILHSERALEYSDQAYKLAKEAHSK----------
>SRR5208282_4175553
-----MHEEHRKAAEQHALAAKAHRTAAEHNEKGDHAAAVWHSERALEYSDHAYKLAKETQNK----------
>ERR1035441_160778
-----MHEAHREAAEKHELAAQAHRTAAEHNEKGDSTAADWHSERDRKSTRLNSSHLGISYA-----------
>ERR1017187_5918702
-----MHEAHRKAAEQHELAARAHRTAAEHNEKGDRTTAELHSERALQYSDHAYALAQEAHTK----------
>ERR1035437_812044
-----MHETHRKAAEQHELAARAHRMAAEHNEKGDNVAGSWHAEQALX-------------------------
>ERR1035438_2952819
-----MHEAHRRAAEQHELAAQAHRTAAEHNEKGDLSNAVWHSQRAMEYSDHAFKLAKEADSK----------
>SRR5208283_3547207
-----MHEAHWRAAELHELAAEEHRTAAEHNEKGNFAPAIWHAERALEYADQAYKLGKEAHTR----------
>ERR1700680_3230249
-----VHDALRKAAEQHELAAQAHRTAAEHNEKGDNEAGSWHSERALECSDHGYRLAKEAHIK----------
>ERR1700680_5293557
-----VHDALRKAAEQNELAAQAHRTAAEHNEKGDNAEGSWHSERALEYSNHAFKLAQEAHNK----------
>SRR5579871_4072473
-----MHQAHRKAAEQHELAAQSHRTAAEHNEKGDFPMAVWHSERALAYSDKAYRLAQEAHNK----------
>SRR5580658_3585280
-----MHeerreaaekhDPHEKAAAQHDLAAQAHRTASEHNEKGDDGKGQWHAERALEHSTQAFRLSKEAHTK----------
>ERR1035438_1797539
-----MHESHRRAAEQHELAAQAHRTAAEHNEKGDNIAGKWHAERALEYSDHAYKLAREAHAKS---------
>ERR1017187_6667227
-----MHESHRRAAEEHELAAQAHRTAAEHNEKGDNIAGKWHAERALVYSDRAYKLANEAHNKS---------
>ERR1019366_7385905
-----MHESHRRAAEEHELAAQAHRTAAERNEKGDYVAERWHAERALEYSDHAYKLAREAHTKS---------
>SRR5271157_1405023
-----LHESHRKAAEQHELAAMAHRTAAEHNEKGDGAAGSWHAERALEYSDHAYKLAREAQTKS---------
>SRR5579862_6534521
-----MHESHRKAAEQHQLAAQAHRTAAEHNEKGDYTAAIWHSERALEYSENAYKLAKEAHNKS---------
>ERR1039458_3948685
-----MHETHRKAAEQHELAAQAHRTAAEHNEKGDYAAAIWHSERALVYSDRAYKLANEAQTSQ---------
>ERR1039458_7317855
-----MHEEHRRAAELHELAAQAHRTAAEHNEKGDRATSIWHSERALEYSDRAYKLAVEVRNKS---------
>ERR1017187_969610
-----INDAHREAAEEHERAAQAHRTAAEHNEKGDGTAGSWHAERALQYSDHAYKLAKEAHNKS---------
>ERR1017187_7516137
-----MHETHRQAAEHHELAAQAHRTAAEHNEKGDYPAAAWHSERALEYSDRAYKLAKEAHSKS---------
>SRR5665647_3655933
-----MHEAHRKAAEQHELAAQAHRTAAEHNEKGDYAAAIWHSKRALEYADRAYQLADEAHTKS---------
>SRR6202050_3988918
-----VHETHRAAAERHELAAQAHRTAAEHNEKGDLSVAAWHSERALEYSDHAYKLAKEAHNKS---------
>SRR5271165_2409151
-----MHETHRKAAEQHELAAQAHRTAAEHNEKGDCTTAEWHSKRALEYSDQDRKLAKEAHNKS---------
>SRR5580658_9165545
-----FESLHGKAAELHDLAAQAHRTAAEHNEKGDHDAENWHLERANEYSEQAFKIAQELHTKS---------
>SRR5271157_975027
-----MHEEHRRAAELHELAAQAHRTAAEHNEKGEGVAGSWHAQRALEYSDHAYKLAMEAHNKS---------
>ERR1039458_1801200
-----IRDAHTRAAEQHERAAQEHRTAAEHNEKGDGVKGSWHAERALEYSDHAYKLAMEAHNKS---------
>ERR1019366_6067402
-----MHDTHRKVAELHALAAHAHRTAAEHNERGDDAAGGWHSERALDYSDQAYKLAKEAHAKS---------
>ERR1022692_786886
-----MHNLeHRKAAEQHELAAHEHRTAAEHNERGEGVKGSWHSERAMQYSDHAYKLSKEAHNKS---------
>SRR5271166_2169624
-----MHETHRRAAEEHELAARAHRTAAEHNEKGDRTAADFHSERALEYSDHAYRLAQEAHSKS---------
>ERR1700683_4802183
-----MHELHRRAFEEHELAAQAHRTAAEHNEKGDDPTENWHTERALEYSDRAFKLAKEAHAKS---------
>SRR5580692_7835881
--NHNAAEHLRSAALHHQRAGQFHREASRHYQIGkDYAHAAHQALIARGHALQASDHEDDAGAYFSEHNGN---
>SoimicMinimDraft_4_1059732.scaffolds.fasta_scaffold1835258_1 # 3 # 221 # -1 # ID=1835258_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.680
--KHNPAGHLTSAVFHHKQAAQFHREASRHYQVGkDYAHAAHQALVAHGHGLQAIDHGNDAGAYFVEHNGK---
>GraSoiStandDraft_25_1057303.scaffolds.fasta_scaffold2786330_1 # 1 # 273 # 1 # ID=2786330_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.641
--SRVQSAHHAAATDHHQQAAQFHRAASRHFQIGkDYAHAAHQALAAHGHALRALERGQAASALYAEHEGS---
>SRR5471032_58518
------KGHLEAAASYHEQAARFHREASHHFGAGkDYDHAAHQAVMAHGYALHAIDEANNVVKHNAG-------
>SRR6478735_7802765
------KGHLESAASHHEQAARFHREAAQHYEAGkDYDHAAHQAVLAHGHALHAIDEASVAAKHAGA-------
>SRR5471032_3571840
------KGHLEAAASHHEQAARFHREASHHFGAGkDYDHAARSEEHTSELQSHH--------------------
>tr|A0A1H5NEP2|A0A1H5NEP2_9BURK Uncharacterized protein OS=Burkholderia sp. WP9 GN=SAMN02787142_6276 PE=4 SV=1
------KEHLDAAASHHEQAARFHREASQIYEAGkDYDHAAHQAVLAQGHALFAIAESNLAAKHTGA-------
>ERR1700744_2473096
----------------------------------------HQAILAQGHALHAMDESNLAAKHTGG-------
>ERR1700690_4486807
-------------------------------ENGkDYAHAAHQAVMARGYSVQAIHHGNEASKYHAG-------
>ERR1700720_1670717
-----MHEAHRRAAQQHELAARAHRTAAEHNEKGDDEEGNWHSERALEYSDQAYRLAKDAHAKS---------
>SRR5689334_4818128
-----MHEGHRKAAEQHDLAAHAHRTAAEHNEKGDSVAEQWHAERALEYSDQAYKLAKEAHAKS---------
>ERR1035438_6644050
-----QSDTHRLAAEQHELAAQAHRTAAEHNEKGDDEGGRWHAERALEYSNHAYKLAKEAHAKS---------
>ERR1700722_7188787
-----MHETHRRAAEQHELAAEAHRTAAEHNEKGENETGKWHSQRAMEYSDHAYKLAKEAHTKS---------
>ERR1700733_11670911
----TMHEAHRTAAEQHELAAHAHRTAAEHNEKGDNEGGKWHAERALEYSDQAYKLAKEAHTKSA--------
>ERR1700722_20636178
----TMHEAHRTAAEQHELAAHAHRTAAEHNEKGDNEGGKWHAERALEYSSTRIRPINSQRKRTR--------
>ERR1700720_3737208
----IMHESHRQAAEQHELAAHAHRTAAEHNERGDNPTANWHATRALAYSDQAYKLAKEAHTKSG--------
>ERR1700733_11573431
----TMHDAHRKAAEQHELAARAHRTAAEHNEKGDDEAGRWHAERALEYSDHAYKLAKEAHAKSA--------
>ERR1700691_2814054
----NMHEAHRKAAEQHELAARAHRTAAEHNEKGDNEAGIWHAERALEYSDQAYKLAQEARTKSG--------
>SRR6202161_1855922
----NMHEAHRKAAEQHELAASAHRTAAEHDEKGDDEAGRWHAERALEYSNDADKLSLEAHNKSG--------
>SRR5690242_7580166
----TMHETHRRAAEQHELAAHAHRTAAEHDEKGDTETGNWHAERALAYSDRAYKLAMEAHTKSG--------
>SRR5690349_19263105
----TMHETHRRAAEQHELAAHAHRTAAEHDEKGDTRRVtgmrsaPWHIRIVPI----GW--LWKRTPNPG--------
>SRR5712691_13313662
----SIRSLHRKAAEYHDLAAHAHRTAAEHNEKGGNEAQNWHLERALEYSNRAYKLAQEAHSKSG--------
>SRR5271165_4566811
----IMHEEHRKAAEQHERAAQAHRTAAEHNERGDGAGGRWHAERALEYSDHAYKLAKAANNKSS--------
>ERR1700679_954346
-------EGRRTARTCGTRS----SHRRRTPRKGDNEGGKWHAERALEYSDHAYQLAKE--------------
>SRR5579872_3040707
----IMQDLHRKIAELHELAAQAHRTAAEHNEKGDNESANWHSQRALDYSNRAYELAKEAHNKSA--------
>SRR5579864_4074794
----VMHDLHQKAAEYHELATQAHRTAAEHNEKGDNESANWHSKRALEYSNRAYELAKEAHNKSA--------
>SRR5271165_620434
----NMHDAHRKAAEQHELAAKAHRTAAEHNEKGDNEAGRWHARRALEFANQAYKLAQEAHNK----------
>ERR1035438_965373
----TMHEAHRKAAEQHELAARAHRTAAEHNEKGHSTAAIWHSERALEYSDHAFKLAKEAHNKSG--------
>ERR1035441_4345400
----TMHEAHRKAAEQHELAARAHRTAAEHNEKGHSRSEERRVG-----------------------------
>SRR5580658_4948604
----TMHDTHRKAAERHELAARVHRTAAEHNEKGDNEAGSWHSERALEFSDHAYKLAQEAHAKSG--------
>SRR5580704_12130880
----PMRETHRQAAERHEQAARAHRTAAEHNEKGDDEAGRWHSERALEYSDHAYKLAQEAHTKSG--------
>SRR5580700_4825899
----TMQDTHRKAAERHEQAARAHRTAAEHNEKGNDDAGRWHSERALEYSDHAYKLAQEAHTKSG--------
>ERR1035441_5708109
----TMHEEHREAAELHELAAREHRTAAEHNEKGNFTAAEYHSQRELEYSDQAYKLAKDAHTKSG--------
>SRR5579872_2570035
-----MHEAHQKAAEQHELAAKAHRTAAEHNEKGDYTAAIWHSQRALEYSEQAYKLAKEAHTK----------
>ERR1700693_2895405
-----MHdahEAHRKAAEQHEISAHAHRTAAEHNEKGDYSGAIWHSERALEYSEQAYKLSKEAHTK----------
>SRR5579864_2262205
-----MHSAHLKAAEQHDLAAHAHRTAAEHNEKGDNDAEKWHSERALEYSDQAYKLAKEAHAR----------
>SRR5579864_6142950
-----MHDARRKAAEQHELAARAHRTAAEHNEKGDPEEASWHSQRALEYSDHAYKLAKEAHAK----------
>SRR5215831_17273075
LMSNKAAEHHKKALQHLTHAARHHGKAAWHHQAGRYERAIHHAHTASGHHYQAGGHADRAVKAHVQH------
>SRR5215831_2017232
PMSKRAAEHHKKASKHLAAAACHHEKAAAAHEIGRYETETDHAYEAGRHRVYAKRHAQRAWKDHVEH------
>LakMenE01Jun11ns_1017448.scaffolds.fasta_scaffold3487450_1 # 2 # 187 # 1 # ID=3487450_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.435
ISTEPEAQS-----AELMIRTKDGRDRLWSFVSSALG-----TQSDGRRLFVCMAQDVTERKAHDEQ------
>HubBroStandDraft_5_1064220.scaffolds.fasta_scaffold1605167_1 # 1 # 417 # 1 # ID=1605167_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.650
LL---------GESQRLTVQL----------QSRQTELQQTNEELATKAKLLAEQ--NAERERKEEH------
>SRR5579872_4560724
---QHRAAQHGAAASHHRQAAQHHRAAATHYRSGkDYAHAAHQALAAHGHTLLAIDYGSHAGTYAAQHGGD---
>SRR5579863_6441461
-----------SAASHHKQAARYHREASTHYRSGkDYAHAAHQALAAHGHALLAIDHGDQAGKYYAQHGGD---
>SRR5271165_1744909
---HhDSGEPHAAAAVHHAEAARFHREASRHYQDGeDHAHAAHQALLAHGHGLRAFERGNQANAYYGTLSVE---
>ERR1035437_993647
-----THDAHLKAADQHELAAHAHRTAAEHHEKGDDVGGRWHAARALEFSDHAYKLAKEAHNKS---------
>ERR1035437_10047576
-----THDTHLNEAEQLELAAHTHRTAAEHHERGDDVGGRGHTARALEFSDHA--------------------
>ERR1017187_4798261
-----THDAHLKAAEQHELAAHAHRTAAEHHEKGDDVGRSE--------------------------------
>ERR1700730_15266082
-----MHDTHRRAAEQHELAAHAHRTAAEHNEKGDNETGNWHSQRALEHSDRAYELAKEAHAKS---------
>SRR5580658_2267416
-----VHGLHSKAAEQHELAARAHRTAAEHNEKGDHETAEFHVERAREFADRAYQLAKEAHSKS---------
>SRR6476661_2253354
------SKLHESAAEHHEHAARHHREAARLHEVKDVLAAVDQAHMATDHQVHAIHYATQAAKEYMA-------
>SRR3954469_21339197
------SKLHESAAKHHEHAARHHREAAKLHEVKDVLAAVDQAHMATDHQVHPIHYGTQAAKEYLA-------
>SRR6478736_2086344
------SRLHENAAKHHEHAVRQHREAARLHEVKDVLAAVDQAHMATDDQAHAIHYATQAAKEYLA-------
>SRR5271166_4700256
-----MHETHRKAAEQHELAAKAHRTASEHNEKGENETGNWHSERALEHSDRAYQLAKEAHNK----------
>ERR1700735_2757028
-----MHDTHRKAAEQHELAALAHRTASEHNEKGENETGNWHSKRALAYSDRAYELAKDAHNK----------
>SRR5579863_3543293
-----MHDAYRRAAEQHELAARAHRTAAEHNEKGDNETGNWHSGRALEYADRAYELAKQACNK----------
>ERR1022692_2141220
----TMHESHRRAAEEHELAAQAHRTAAEHRSEEHTSELQSP-------------------------------
>ERR1019366_59648
----AMHEGHREAAELHERAAHAHRTAAEHHEKGDSATAVWHAERALEYSDHAYK------------------
>SRR5579872_4326349
----TMHDTHRKAGEQHELAAKAHRTASEHNEKRENETGNWHAERELDYSDRAYQLAKEAHNKS---------
>ERR1035441_9666701
----TMHEAHRRAAELHELAAQAHRTAAEHNEKGDCPTSVWHSERALEYSDRAYKLAVEARNKS---------
>SRR5258705_10833459
----AMHETHRKAAEQHELAAHAHRTAAEHNERGDNDTGNWHADRALEYSDRAYKLAQEAHSKS---------
>SRR5271155_3255843
----LMHELHREAAEQHERAAQAHRTASEHNEKGDNPSGNWHAQRALQFSNRAYELAREAHNKS---------
>SRR5256885_13586733
----MKHEAHRRAAEQHELAAQAHRTAAEHNEKGDNETGSWHADRALEYSDRAYEPAKEAHSKS---------
>ERR1039457_4229862
-MTQNLEEHHSRAAQHFDSAAEHHRAAEKAYVTGDLKTSAYEAQCAMGHSVQANDHADLAAMAHLEHHGLN--
>tr|A0A226X0S3|A0A226X0S3_9BURK Uncharacterized protein OS=Caballeronia sordidicola OX=196367 GN=BSU04_21575 PE=4 SV=1
--SLHVAEHHEAAAELHEHAARYLRQATKHYEEGKVALAAHEAQAAHAIALCAIDHSNEAAKHHAIR------
>SRR5438552_9601204
------KDAHNTAAEHHEKAAKSHRTAAEHHGKSDNQAGHQHSTAALEHSTKAHEASKQAHEKSTQNKN----
>ERR1700686_3344577
----KMKDAHNKAAEHHESAAKSHRAAAESFGRNDNVKGKEHATQAQHNAQNANENSKTA-------------
>SRR5450759_2783561
------KDAHNTAAQHHENAAKSHRTAAEHHGKGDHEAGHKHSQEAYDHSTKAHEASKKAHE-----------
>ERR1035441_8005555
------KDQHNTAADHHEKAAKSHRAAAEHHGKGDHEAGHRHSGEAQEHSKNAHQHSQDAHA-----------
>SRR5450756_557274
------QDAHHKAAEHHENAAKAHRTAAEHHGKGDHEAGKKHSATALEHSGKAHEATQAAHE-----------
>ERR1700734_311097
------KSEHEEAATHHENAAKSHRSAAEHHGRGSHEEGRKHSTSAHEHSGKAHEASKKAH------------
>ERR1035437_280669
------KNEHQEAASHHENAAKSHRAAADHHGKGNHEEGKKHSAAAHEHSGKAQEASKTAH------------
>ERR1039457_5823228
----TMHEAHRKAAEKHELAAQAHRTAAEHNEKGDSTAADWHSERAMQYSDHAYKLAMEAHSK----------
>SRR5690242_15786718
----TMHETHHRAAEQHELTAHAHRTAAEHDEKGDTETGNWHAERGLAYSDRAYKLAMEAHTK----------
>SRR5580692_3629653
----KMHENareHRKAAELHQLAAQAHRTAAEHNEKGDEAAGSWHSQRALEYSDQAYKLAKKAHAK----------
>SRR5579872_2516938
----RVHESHQKAAEQHELAARAHRTAAEHNEKGDNPTGNWHSERALEYAEHAYRLAKDAHNTS---------
>SRR6185437_161601
----RVHETHQKAAEQHELAARAHRTAAEHNERGDNPTGNWHSERAFEYAEHAYRLAKDAHNRS---------
>ERR1019366_7611031
----IMQETHRQAAERHEMAARAHRTAAEHNEKGDNPSGNWHSERALEYAERAYKLAKDAHSKS---------
>ERR1700693_3320993
----NMHENHRKAAEQHELAARSHRTAAEHNEKGDFTAAVWHSERALQYSDQAYRLAKEAHNKS---------
>SRR5271163_4018991
----FMHELHREAAEQHELAARAHRTAAEHNEKGDNATGNWHSERALEYADRAYELAKKAHNKS---------
>SRR5580700_1525914
----FMHELHRQAAEQHEMASRAHRTAAEHNEKGDNETGNWHSERAMEHSENAYKLAKEAHQKS---------
>ERR1700674_4905784
----TMHELHREAAEQHKLAARAHRTAAEHNEKGDNPTGNWHATRALEYADQAYKLAKDAHNKS---------
>SRR5450755_1041639
----IMHEEHRQAAEQHELAAHAHRTAAEHHEKGDEKGGSWHSQRAMEFSERAYKLAKEAHSKS---------
>SRR3984957_13701284
-------HDHHKAAAHHDEAAKSHRKAAEAHEKGDHADASQHSQIANDHSAKAYEASQSAH------------
>ERR1700685_3318198
-------HDHHKAAAHHDEAAKSHRNAAEAHEKGDQADASQHSQLAHDHSTQAHEASQSAH------------
>ERR1700722_3266946
-------HDHHKAATHHDEAAKAHRDAAEAHEKGNQADATQHSQLANDHSAKANEASNIAH------------
>ERR1700722_20372013
-------HDHHKAAEHHDEAAKAHRSAAEAHEKGDHADASQHSQIANDHSAKANEASNVAF------------
>SRR3984885_15921837
-------HDHHKAAAHHDEAAKSHRNAADAHEKGNQADASQHSQIGNDHSAKAHEASQSAH------------
>ERR1700722_6238913
-------HDHHKEAEHHEEAAKAHRDAAEAHEKGNQADASQHSQLAYDHSTKAHEASQRAH------------
>ERR1700722_5003591
-------HDHHKAAAHHEEAAKAHRSAAEAHEKGEQADASQHSQIANDHSIKAQEASNAAH------------
>SRR5271169_5076590
-----MKDARNKAAESHEAAAKSHRAAAESHSKNDHAKGKEHSKQAQQHAQNANEHSKTANNKS---------
>SRR4051794_13470881
-----PKDAHTKAAEQHETAAKTHRAAAQQHGSNDHSKGKQQAADALQQSKAAHQHSDDAHGKS---------
>tr|A0A127EN64|A0A127EN64_9RHIZ Uncharacterized protein OS=Rhodoplanes sp. Z2-YC6860 GN=RHPLAN_12460 PE=4 SV=1
-----PKDAHIKAAEHHETGAKSHRAAAQQHGSNDHSKGKQQSSEALQHSKVAHQHSDEAHGKS---------
>SRR5271168_4727755
-----ARDAHNKAAQHHESAAKSHKTAAEHHGKGEHARGREESAKAYAHSKSAHEHSEMAH------------
>ERR1700728_2926415
-----ARDAHTKAAQHHENAAKRHKTPAEHHGEGEHARGREESAKAHSHSKTAHEHSEMAH------------
>SRR5580700_6974991
------HDSHRQAAELHELAAHAHRTAAEHNEKGDNETGNWHAERALEYSDRAYQLAKEAHAK----------
>SRR5579863_2279912
------HETHRSAAEFHELAAHAHRTAAEHNERGDNETGNWHAERALEYSNRAYELAKEAHNK----------
>SRR5580693_7061096
------HDTHQKIAELHELAAHAHRTAAEHNERGDNDTANWHAERALEYSDRAYQLAKDAHSK----------
>ERR1039458_7487882
------QSLHREAAEYHDLAVHAHRTAAEHNEKGDSEAGNWHLDRAREYSDQAFKLAQDVHCK----------
>ERR1039458_3425906
------QALHREAAEYHDLAAQAHRTAVEHNEKGDNETGNWHLDRAREYSDQAFKIAQDIQCK----------
>ERR1035441_3561326
------RSLHREAAEYHDLAAQAHRTAAEHNEKGDNETGNWHLDRPRECSYQAFKLAQDVHCK----------
>ERR1039458_6934656
------RSLHREAAEYHDLAAQPHRTAAEQIGRAHVXX-----------------------------------
>SRR5271165_3251650
----NMHEGHRLAAEQHELAARAHRTAAEHNEKGDGSAAIQHSERALEYSDRAYQLAKEAHNK----------
>SRR5579864_9452655
----KMHDAHRKAAEEHERAAHAHRTAAEHNEKGENEAGNWHSERALEYSDHAYELAKEAHSK----------
>ERR1700686_4490454
----TMHDLHRRAAEEHERAAHAHRTAAEHNEKGDDATGNWHSERALEYADRAHELAREAHTK----------
>tr|A0A1W2C3J3|A0A1W2C3J3_9BURK Uncharacterized protein OS=Polynucleobacter sp. VK13 OX=1938817 GN=SAMN06296008_11834 PE=4 SV=1
-------NYHDHAANHHEQAAKSHMEAARMRSLGNHEASANHALIAHGHALQALRYSEEAINEHAN-------
>tr|A0A1J0D7C3|A0A1J0D7C3_9BURK Uncharacterized protein OS=Polynucleobacter asymbioticus OX=576611 GN=A4F89_09430 PE=4 SV=1
-------HFHGKAANHHEQAMKSHLEAARMRELGNHEASATHALVAHAHTLKALQNSEDAINEHAN-------
>ERR1019366_9111953
-MSHHDHNRYRSAAEHHEHAANHYRRAETSGMAGDHIAAANHARTAHEHARQAAAFSGDADGGHDEHHGMK--
>ERR1035437_6137483
-MSHHDHERYRSAAEHHEHAANHYRRAETSGMAGDHIAAANHARTAHEHARQADAYAGEAAKSNDEHHGMN--
>ERR1035437_7818790
-MSHHDHNRYRSAAEHHEHAANHYRRAETSGMAGDHVAAANHARTAHEHARQAAAFSGEADESHDEHHGMN--
>SRR3979490_1267869
-------QAHTKAAEHHETAAKSHRAAAEQHGKNDHANGQEHSSQAQQHSKTAREHSETAHTKSS--------
>ERR1700752_190489
-------QAHTKAAEHHETAAKSHRAAAEQHGKNDHVKGHEHSSQVQLHSKSAREHSETAHGKSA--------
>ERR1700734_1853930
-----ARDEHNKAAEHHENAAKAHRSAAEHHGKGDHAKGKEHANVAKQHSQAANQHTEQAH------------
>SRR5882762_9456542
-----ARDEHNKAAEHHENAAKAHRSAAQHHGKGDHTKGKEHANVAKQHSQTANQHTDQAH------------
>ERR1700721_689871
---------PNGRAQ--ECA----SDAERSRGDGERDRGW----VRKSNRRAAVQHHGQ--------------
>ERR1700735_832010
-------QSHTKAADHHESAAKSHRAAAEHHGKNDHMKGNEHAAEAQKHSKVAGAASDEAHA-----------
>ERR1019366_4490127
------KDAHLKAAEHHENAAKTHRLAAEHHGKGDHAAGKKQSATALEHSGKAHEASQAAH------------
>ERR1039458_7026760
------KDAHLKAAEHHENAAKTHRLAAEHHGKGDHAAGKKQTATALEYSGKAHEASQAAH------------
>ERR1019366_5964583
------KDANLKAAEHHENAEKTNRLAAGHKEKEDHAGGKKQSETALEPSGKPQETSKAAH------------
>ERR1700729_1498231
-----MKDAHNKAAEHHESAAKSHRAAAEAHDRNDHAKGKEHSGQAQQHAQNANEQTKTAH------------
>ERR1700735_457524
-----MKDAHNTAAEHHESAAKSHRAAAAAHGSNDHAKGKEHSTQDQQHATNDEE------------------
>tr|G9ELI3|G9ELI3_9GAMM Uncharacterized protein OS=Legionella drancourtii LLAP12 OX=658187 GN=LDG_5982 PE=4 SV=1
----KLASYHADAAKHYEHAAKYHHEAQKHHLSGDHDKAALAAHKAQGHACCANGHAKKALKC----------
>ERR1019366_10459532
---QKLRDAHRKAAEQHELAAKAHRTAAEHNEKGEDEAGRWHSERALEYSDRAYKLAKEAHNKS---------
>ERR1039457_2594544
---KAMASEHGKAAEQRELAAHAHRTAADHNEKGENEAGSWHADRALEYSDHAYMLAKEAHNKS---------
>ERR1017187_2984396
-MTQNVEEHHGRAADHFDLAAEHHRAAEMASIAGDHKTAAHEAHCAHGHCVAATDHADLAAMGHVEQHDTH--
>ERR1700685_4321756
------KDAHTSAAYHHERAAKSHRAAAEQSNKGAHDACVEHAVTACGHSTKADEASKLAL------------
>SRR6185312_10729741
------RDAHTTAAYHHERAAKSHRAAAEQSNQGAHAVCAEHALTACGHSNKADEASKLAL------------
>ERR1700686_4148535
------RDAHTTAAYHHERAAKSHRAAAEQSSKGAHEACEQHAVTACAHSMKADEASKLAL------------
>SRR5579862_2273143
------KDAHTTAAYQHERAAKSHRAAAEQSNKGAHEACAQHAATACDHSTKADEASKAAL------------
>ERR1700690_2101855
------KEAHTTAAYHHERAAKSHRAAAEQSNQGAHAACEEHALTACGHSTKADEASKIAH------------
>SRR5208337_2096126
------NESHQRAAEFHELAAHAHRAAAAHHGKEDHQTGHEHSKQALEHARKAFEWSQEAHRKSAKAAG----
>ERR1700687_2569050
------NESHQRAAEFHELAAHAHRAAAAHHGKEDHLTGHELSKQALEHANKASQWSQEAHRKSAKAAG----
>ERR1700733_13128378
------RDSHQRAAEFHELAAHAHRAAAVHHGKEDHQPGHEHSKQALEHADKAFQASQEAHRKSAKSTG----
>ERR1700687_752579
------NESHQRAAEFHELAAHAHRAAAAHHGKEDHQAGHEHPKQALEYSNKASEWTQEAHRKSEKSME----
>SRR5258707_15042374
------NESHQKAAEFHELAAHAHRAAAAHHGKEDHQTGHDHSRQALEHATTAFQYSQEAHQKSEKAGI----
>SRR5713101_657853
------NDSHQRAAEFHDLAAHAHRVAAAHHGKEDHLTGHELARQAMEHSAKAHQATQEALQESAKLAK----
>SRR5713101_2834890
------NDSHQRAAEFHDLATHAHRVAAAHHGKEDHLTGHELARQAMEHSAKAHQATQEALQESAKLAK----
>SRR4029077_1283922
------NDSHQRAAEFHDLAAHAHRVAAAHHGKEDHLSGHELARKAMEHSAKAHQASEEALHQSAVFIK----
>SRR3984893_12462898
------EDSHRRAAEFHELAAHAHRVAAAHHDKEDHLTGHEHSKQAMEHSAKAHQSSQEALQKSVIFTE----
>SRR5271156_5420018
------NDSHQRAAEFHEQAAHAHRAAATSHGKGDHLSGHELSRQALENAHKAFQWSQ---------------
>SRR5678816_4305783
------QDSHRKAAEFHDMAAHAHRAAAVHHDKGDHKTGQQQSRKALEHATKAFELAQEAHRLSSAPKK----
>ERR1700675_2880414
-----MRDAHKKAAEQHELAARAHRTAAEHNEKGDNPTGKWHSERALEYADHAFELAKKAHNK----------
>SRR5450755_2771584
-----MHKLHREAAEQHELAAKEHRTAAEHNEKGDNPTGNWHTQRAVEYSNRAYELAKEAHNK----------
>SRR6266850_1864928
------NNDHNKAAELHENAAKSHRAAAEQHSKGDHAKGMEHSKSAQQHSQSANKQSDQAN------------
>ERR1700676_4116006
------KDAHNKAAEHHESAAKSHRAAAAAHGSNDHAKGKEYSTQAQQHAQNANEHSKTSQAKSAE-------
>SRR5258707_184143
------NQAHNRAAVFHENAARSHRIAAEHYANNDRAKGDEHAMQARAYSRSARDHSEQTHMK----------
>ERR1044071_6326665
------NQAHTKAAEHHETAAKAHRLAAEHHGKNDHAKGNEHSGYAQTHSKSAREHSEQAHTK----------
>SRR5213078_2364262
------NQAHTKAAEHHETAAKAHRLAAEHHVKNDHVKGNEHSAYAQTHSKSARDRSEQAHTK----------
>ERR1019366_4618907
-TKHPAIEHHHAAAAHHAAAAHHHLEAAHEHGQGKHEEAKQHSAAALEHSEQAHKHTVEAHKHS---------
>SRR5664279_4760768
-TKHPSVEQHHAAAGHHAAAAHHHLEAAHEQGQGKQEEAKQHSAAAHEHSE----------------------
>ERR1700679_12343
----IIHELHREAAEKHELAAHAHRTAAEHNEKGDQAAGDWHSQRAMEYSDHAYKLAKEAHTK----------
>ERR1019366_1160250
----ALHDAHRKAAEQHDMAAHAHRTAAEHNEKGDEDSGRWHAERALEYSDHAYKLAKEAHNK----------
>ERR1019366_1197723
----AVHEEHLRAAEQHERAAKAHRTAAEHNEKGNGAEESWHSQRALEYSDHAYRLAKEAHSK----------
>ERR1700688_5101470
----IMHDAHRKAAEQHELAARAHRTAAEHNEKGDHEGRDWHAARALEYSDNAYKLA----------------
>ERR1700676_867798
--EENVHDAHRKTAEQHELAAQAHRTAAEHNEKGENELGNWHLQRALEYSDHAYKLAQEAHSK----------
>ERR1700676_837832
--EENVHDAHRKTAEQHELAAQAHRTAAEHNEKGENELGNWHLQRALEYSDHAYKLAQESHSK----------
>SRR5580704_17448066
--EKKLHDAHRKAAEQHDLAAHAHRTAAEHNEKGENELGSWHLQRALEYSDHAYKLSQDAQTK----------
>ERR1700690_4019350
--ENAVHEEHRKAAEQHELAARAHRTAAEHNEKGENESGNWHAERALEYSDRAYTLAKEAHAK----------
>ERR1700678_3694437
--GNMMHDAHRKAAEQHELAAKAHRTAAEHNEKGENETGNWHSQRALEYSDHAYKLAKDAHTK----------
>SRR6202051_959840
--EKKLHDAHRKAAEQHYLSAHAHRTAPEHNQKGENELGNWHLQRALEYSDHAYKLAREAHSK----------
>SRR5579863_1142833
--RTTMHDFHRRAAEQHELAARAHRTAAEHNEKGENETGNWHAQRALEYSDRAYQLAQEAHTK----------
>ERR1700680_337363
--EENVHDAHREAAEQHELAAQAHRTAAEHNEKGDNAEGSWHSERALEYSNHAFKLAQEAHNK----------
>ERR1700689_1959300
--VTTMHDAHWKAAEQHELAARAHRTAAEHNEKGEDEAGRWHAERALEYSDHAYRLAKEAHTK----------
>SRR5271169_408185
--GNTMHDAHRKAAEQHELAARAHRTAAEHNEKGDNETGNWHLKRALEHSEHAYKLAKEAHDK----------
>SRR5580658_7139169
--ETPMQDAHRKAAEQHELAARAHRTAAEHNEKGDNEGGRWHAERALEYSDHAFRLAKEAHSK----------
>SRR6185369_2844766
---------------PNTMK-----K-GTTRRHAGIRNERWSSPIARI----SWPRQP---------------
>ERR1700722_12608701
--EENMYDTHRQAADQHELAAHAHRTAAEHNEKGKNELGNWHLQRALEYSDHAYKLAKEAHSK----------
>SRR3984957_3206025
---------HTPPSrRSARTCCARSSDGREHNEKGKNELGNWHLQRALEYSDHAYKLAKEAHSK----------
>GraSoiStandDraft_50_1057286.scaffolds.fasta_scaffold7233880_1 # 1 # 222 # 1 # ID=7233880_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.689
-----MSDTHRQAADQHELAAHAHRTAAEHNEKGKNDLVNWHLQRAAEYSDHAYKLAKKAHTI----------
>ERR1035438_3812566
------QALHREAAEYHDLAAQAHRTAAEHNEKGDNEAGNWHLDRARAYSDQDFKVAQDVHC-----------
>ERR1035441_5344174
------HDLHRKAAEYHELAAQAHRTAAEHNEKGDNETGNWHSKRALEYSNQAFKLAQEAHG-----------
>ERR1700675_786553
------HVLHRKAAEAHELAAKSHRTAAEHNEKGDNETGNWHSQRALDYSEHAYRLAKEAHP-----------
>ERR1700686_2254141
-----ADDSHQRAAELHEQAAHAHRAAAAHHGKEEHQTGQEHSKQAMEHSAKAYQQSLEADKQSayfATKHGKK--
>SRR3982074_2549457
-----ARDSHQRAAELHEQAAHAHRTAAAHHGKEDHQSGQEHSKQAMEHSAKAHEQSLEANKQSaffAKQHEKK--
>ERR1700734_1591532
--------DHHKAAAHHDEAAKSHRDAAVAHEEGDTERASQHSQIANDHSKKAQEASNAAHR-----------
>ERR1700722_3394704
----------QFRAALRGFESKSHRDAAAAHEEGDTEKASQHSQVANEHSKKAQEASNSAHQ-----------
tests/test_data/alignments/mgnify_hits.sto
0 → 100644
View file @
96809433
# STOCKHOLM 1.0
#=GF ID query-i1
#=GF AU jackhmmer (HMMER 3.3.2)
#=GS MGYP000406148242/1-68 DE [subseq from] PL=00 UP=0 BIOMES=0101000000000
#=GS MGYP000119383271/47-117 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000430010134/3-69 DE [subseq from] PL=00 UP=0 BIOMES=0000110000000
#=GS MGYP000184282189/1-71 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000372988949/3-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000222615028/3-68 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000384795733/25-88 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000680660046/4-73 DE [subseq from] PL=00 UP=0 BIOMES=0000110000000
#=GS MGYP000586297297/4-70 DE [subseq from] PL=00 UP=0 BIOMES=0000110000000
#=GS MGYP000526302968/5-69 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000081082088/4-68 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000172493671/1-71 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000694390052/2-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000246175980/4-68 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000358235060/4-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000635416234/5-68 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000656061151/3-65 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000718018739/4-64 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000234420019/4-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000689530757/1-71 DE [subseq from] PL=00 UP=0 BIOMES=0000100000000
#=GS MGYP000266820214/24-89 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000190165740/1-71 DE [subseq from] PL=00 UP=0 BIOMES=0000000000001
#=GS MGYP000589249599/4-69 DE [subseq from] PL=00 UP=0 BIOMES=0000110000000
#=GS MGYP000048618675/3-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000377290797/1-69 DE [subseq from] PL=00 UP=1 BIOMES=0110000000000
#=GS MGYP000697367932/3-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000747506700/4-68 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000255037255/6-64 DE [subseq from] PL=10 UP=0 BIOMES=0000101000000
#=GS MGYP000602985373/3-68 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000420186793/4-69 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000452617499/5-64 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000119404247/1-68 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000134149386/3-60 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000461455637/26-91 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000119389418/96-161 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000546988737/26-93 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000624371167/1-68 DE [subseq from] PL=00 UP=0 BIOMES=0101000000000
#=GS MGYP000650157322/5-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000246214200/7-73 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000113479303/34-96 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000187226991/3-69 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000381848663/3-69 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000066325489/28-89 DE [subseq from] PL=00 UP=0 BIOMES=0000000000001
#=GS MGYP000013251582/4-69 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000499794189/19-84 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000555816272/4-69 DE [subseq from] PL=00 UP=0 BIOMES=0000000000001
#=GS MGYP000653248377/3-70 DE [subseq from] PL=00 UP=0 BIOMES=0110000000000
#=GS MGYP000113511630/3-70 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP001057101778/4-69 DE [subseq from] PL=00 UP=0 BIOMES=1000000000000
#=GS MGYP000210824545/3-69 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000676742083/9-64 DE [subseq from] PL=10 UP=0 BIOMES=0000101000000
#=GS MGYP000545010933/4-70 DE [subseq from] PL=00 UP=0 BIOMES=0000110000000
#=GS MGYP000541064880/3-68 DE [subseq from] PL=00 UP=0 BIOMES=0000000000001
#=GS MGYP000541064880/99-161 DE [subseq from] PL=00 UP=0 BIOMES=0000000000001
#=GS MGYP000729801087/3-52 DE [subseq from] PL=10 UP=0 BIOMES=0000101000000
#=GS MGYP000715079888/40-96 DE [subseq from] PL=10 UP=0 BIOMES=0000101000000
#=GS MGYP000033872322/3-43 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
#=GS MGYP000464421157/4-69 DE [subseq from] PL=00 UP=0 BIOMES=0000101000000
query MAAHKGAEHHHK-AAEHHEQAAKHHHAAAEHHEKGE-HEQAAHHADTAYAHHKHAEEHAAQAAKHD-AEHHAPKPH
MGYP000406148242/1-68 MATHKGAESHKK-AAEHHTTAAKHHTEAAKSHESGN-HEKAAHHAHTATAHGKHASDHSDDAAKTY-ASEH-----
#=GR MGYP000406148242/1-68 PP 899*********.***********************.***************************98.8877.....
MGYP000119383271/47-117 MATHKGTEHHKK-AAEHHELAAKHHREAAKLHEAGS-HEKAAHHAQIAAGHGLHAVYHTEEATKHH-ADEHTGK--
#=GR MGYP000119383271/47-117 PP 899*********.***********************.*****************************.**99866..
MGYP000430010134/3-69 ---KKAAEHHRK-AAEHHQNAAKHHNAAAESHEAGN-HEKAAHHAHTAHGHHTQAGEHGGEAAKAH-RDEHGQ---
#=GR MGYP000430010134/3-69 PP ...699******.***********************.***************************88.877765...
MGYP000184282189/1-71 MPKHEGAEHHKK-AAEHHEKAAQHHKEAAKHHEEGR-HETAGHHAYVAHGHHLTAIQHSEEAAKYH-SQQHGEK--
#=GR MGYP000184282189/1-71 PP 568*********.***********************.****************************9.9999876..
MGYP000372988949/3-70 ---KKAAEHHLK-AAEHHEHAARHHKEAAKHHQAGS-YEKAAHHAHTARAHAEHADEHAVEAAKAH-AEEHGSK--
#=GR MGYP000372988949/3-70 PP ...699******.***********************.*****************************.**99865..
MGYP000222615028/3-68 ---KKAVEHHHK-AAEHHEHAARHHKEAAKHHEAGK-HETAAHHAHLARGHHEHAMHHAAEAAKAH-VEDHG----
#=GR MGYP000222615028/3-68 PP ...6899*****.***********************.***************************99.99986....
MGYP000384795733/25-88 ----SGSQQHDA-AAQHYEEAARHHRQAAKHYQASR-HEKAAHHAQLGYAHHLYAEQHAAEAAKAH-AKNH-----
#=GR MGYP000384795733/25-88 PP ....6999****.***********************.***************************99.9998.....
MGYP000680660046/4-73 -STHKGAEHHKE-AAAHHKKAAEHHLAAAEHHEAGD-HEKAGHHAHVAHGHHLNAVHHAEEAGKHHGAEHSGP---
#=GR MGYP000680660046/4-73 PP .57*********.***********************.**************************9752788777...
MGYP000586297297/4-70 ----QAAEHHQK-AAEHHEHAARHHREAAAHHEEGN-HETAAHHAHTAQGHLHHATHHASEAAKHH-VEHHGNK--
#=GR MGYP000586297297/4-70 PP ....689*****.***********************.*****************************.****977..
MGYP000526302968/5-69 -----REEHHLK-AAEHHEHAAKHHLAAAEHHAGGD-HEKAGHHAHVAHGHSTHAEHHAEEASKHT-ANHDAA---
#=GR MGYP000526302968/5-69 PP .....469999*.***********************.*****************************.***985...
MGYP000081082088/4-68 ----QAAEHHHK-AAEHHEHAARHHKEAAKHHEAGK-HETAAHHAHLARGHHEHAMHHAAEAAKAH-IQDHG----
#=GR MGYP000081082088/4-68 PP ....689*****.***********************.**************************977.66664....
MGYP000172493671/1-71 MTKHEGAEHHKQ-AAQQHQDAARHHLEAAKHHEAGA-HEKAGHHAHIAYGHHLQATHHAEEAAKHH-AMQHGDK--
#=GR MGYP000172493671/1-71 PP 678*********.***********************.*****************************.*999876..
MGYP000694390052/2-70 --SHAAAEHHKK-AAEHHEHAARHHQEAAKHHEAGN-HEKAAHHAHVAHGHHVHAVEHAEHAAKHH-AETHGAK--
#=GR MGYP000694390052/2-70 PP ..699*******.***********************.*****************************.**99865..
MGYP000246175980/4-68 ----QAAEHHHK-AAEHHEHAARHHKEAAKHHEAGK-HETAAHHAHLARGHHVHAMHHAGEAAKAH-IEDHG----
#=GR MGYP000246175980/4-68 PP ....689*****.***********************.***************************88.88885....
MGYP000358235060/4-70 ----QAAEHHGK-AAEHHEHAARHHREAANHHEAGD-HQQAAHHAHTAQGHLHHATHHSAEAAKLH-VEHHGHK--
#=GR MGYP000358235060/4-70 PP ....689*****.***********************.*****************************.****877..
MGYP000635416234/5-68 -----VADHHHK-AAEHHERAAKHHREAATHYESDR-HETAAHHAHMAHGHHQHAVHHASEAAKAH-IEHHD----
#=GR MGYP000635416234/5-68 PP .....489****.***********************.*****************************.****6....
MGYP000656061151/3-65 ---KKAAEHHRK-AAEHHEHAARHHKEAAKHHDAGA-HEKAAHHAHTAHAHHLHATHFADEAAKAH-AD-------
#=GR MGYP000656061151/3-65 PP ...699******.***********************.**************************977.75.......
MGYP000718018739/4-64 -----GAKHHNA-AAQHYEEAARHHRKAAELYQCGH-HEKVSHHANLASGHPLHAKQHAEEAAKAL-IE-------
#=GR MGYP000718018739/4-64 PP .....99*****.***********************.**************************976.55.......
MGYP000234420019/4-70 ----AAAEHHRK-AAEHHEHAARHHEEAAEHHESGA-HETAAHHAHSAQGHTHHALYHASEAAKEH-AEHHGDK--
#=GR MGYP000234420019/4-70 PP ....479*****.***********************.*****************************.****875..
MGYP000689530757/1-71 MPTHTGAEHHRK-AAEHHQLAAKHHLEAAKLHDAGS-HEKAAHHSEIAAGHGHHAVYHTEEATKQH-ADMNAEK--
#=GR MGYP000689530757/1-71 PP 578*********.***********************.****************************9.9999877..
MGYP000266820214/24-89 ---KKAAEHHLK-AAEHHEHAARHHKEAAKHHQAGS-HEKAAHHAHTARAHEEHAEFHSAEAAKAH-GQEHG----
#=GR MGYP000266820214/24-89 PP ...699******.***********************.**************************977.77775....
MGYP000190165740/1-71 MARHEGAEHHKQ-AAEHHQHAARHHLEAAKHHEAGA-HEKAGHHAHIAQGHHLHAIHHAEEAAKHH-AAQHGDK--
#=GR MGYP000190165740/1-71 PP 799*********.***********************.*****************************.*999876..
MGYP000589249599/4-69 ----QAAEHHTK-AAEHHQHAARHHLEAAKHHEAGR-HEAAGHHAHLAHGHHQHATHHASEAAKSH-IEHHGK---
#=GR MGYP000589249599/4-69 PP ....689*****.***********************.*****************************.****75...
MGYP000048618675/3-70 ---KKASEHHRK-AAEHHKLAATHHEEAAAHYDKGN-HEKAAHHAHVAHGHTLHATHYAAEAAKMH-VEEHGSK--
#=GR MGYP000048618675/3-70 PP ...6899*****.***********************.***************************99.9999866..
MGYP000377290797/1-69 MSDHAGVEHYHK-AAEHHEHAARHHREAAKHHEEGN-HEKAAHHAHSAHGHASHAQHHHTEASRHH-AEHHG----
#=GR MGYP000377290797/1-69 PP 678*********.***********************.*****************************.****7....
MGYP000697367932/3-70 ---KKASEHHRK-AAEHHKLAATHHEEAAAHHDKGN-YEKAAHHAHVAHGHTHHATYHAAEAAKIH-AEDYGSK--
#=GR MGYP000697367932/3-70 PP ...6899*****.***********************.***************************99.9988765..
MGYP000747506700/4-68 ----QAAEHHHK-AAEHHEHAALHHKEAAKHHEAGK-HEMAAHHAHLARAHHEHAMHHAVEAVKAH-LQDHG----
#=GR MGYP000747506700/4-68 PP ....689*****.***********************.**************************977.76664....
MGYP000255037255/6-64 ---SKIAEHHTK-AAEHHETAAQHHREAAKHHEAGS-IEKAAHHAQVAYGHGAHAWNYQEEAAK------------
#=GR MGYP000255037255/6-64 PP ...5789*****.***********************.******************999999998............
MGYP000602985373/3-68 ---KKAVEHHNK-AAEHHEHAARHHKEAAKHHEAGK-HETAGHHAHLARGHQEHAMHHSAEAAKAH-IEDHS----
#=GR MGYP000602985373/3-68 PP ...6899*****.***********************.***************************99.98886....
MGYP000420186793/4-69 ----QAAEHHLK-AAEHHEHAAHHHKEAAKHHQGGS-HEKAAHHAHTARGHHEHAQHHAAEAAKAH-AQEHGN---
#=GR MGYP000420186793/4-69 PP ....689*****.***********************.***************************99.999975...
MGYP000452617499/5-64 -----AAAHHLK-AVEHHEHAARHHREAAKHHEAGN-HEKAAHHAHLAHGHHLHATEYAGEAAKAH-I--------
#=GR MGYP000452617499/5-64 PP .....678999*.***********************.**************************965.5........
MGYP000119404247/1-68 MAGHKIHEHHEK-AADHHEHAAKHHREAAKHHKAGD-HEKAAHHSKVAHGHHLHATEHHDEASKKH-AEDH-----
#=GR MGYP000119404247/1-68 PP 799*********.***********************.***************************99.9998.....
MGYP000134149386/3-60 ---KKATEHHRK-AAEHHEHAARHHKEAAKHHEAGK-HETAAHHAHLARGHQERAAQQAAEAA-------------
#=GR MGYP000134149386/3-60 PP ...6899*****.***********************.***********************998.............
MGYP000461455637/26-91 -----AAKHHDL-AAQHYEEAARHHREAAQDYQSGR-HEKASHHAHLAYAHHLHAEQHAEEAAKAH-IKNHLDD--
#=GR MGYP000461455637/26-91 PP .....589****.***********************.***************************99.9999765..
MGYP000119389418/96-161 ---KQAAEHHRK-AAEHHEHAARHHKEAAKHHEAGK-HETAAHHAHLARAHHEVATHHAVEAAKAH-LEEHG----
#=GR MGYP000119389418/96-161 PP ...5689*****.***********************.***************************88.88775....
MGYP000546988737/26-93 ---EKAAEHHEK-AAEHNERAAQHHREAAKHHEEGH-HETAGHHAQIAHGHHLNATHHSEEAAKHH-AQQHGEK--
#=GR MGYP000546988737/26-93 PP ...589******.***********************.*****************************.****876..
MGYP000624371167/1-68 MAKHPGADYHRM-AAEHHEKAALHHKKAAEYYEAGN-LKKAAIHAELAAVFHKQADEHVYNKQEEI-DVHH-----
#=GR MGYP000624371167/1-68 PP 799*********.***********************.*********************98877665.5566.....
MGYP000650157322/5-70 -----ATEHHRR-AAEHHEHSAKHHKAVADHHEAGN-HEKAGHHASVAEGHLNHASHHAEEASKHH-AADHGHK--
#=GR MGYP000650157322/5-70 PP .....579****.***********************.*****************************.9999765..
MGYP000246214200/7-73 ----KIAEHHAQ-AAQHHEKAAEHHKEAAKHYGTGA-VEKGAHHAQVAQGHAVHAEYHADEAAKAH-AEHHAGK--
#=GR MGYP000246214200/7-73 PP ....779*****.***********************.*****************************.****976..
MGYP000113479303/34-96 --NHKGIENHRK-AAKHHEEAAKHHHDAAKHHEAGN-HDKACESTVKAHGHHCLASDHMREVSKQH-A--------
#=GR MGYP000113479303/34-96 PP ..5*********.***********************.**********************9999875.5........
MGYP000187226991/3-69 ---KKAADHHKQ-AAEHHTHAAKHHTEAARHHESGN-HEKAAHHAHSSRAHASQADDHAEQAAKAH-MDEHGK---
#=GR MGYP000187226991/3-69 PP ...689******.***********************.***************************88.888865...
MGYP000381848663/3-69 ---KKAAEHHHK-ASEHHTHAARHHSEAAKHHEGGH-HEKAAHHAHTARAHALHSRHHSDEAAKMH-GEEHGK---
#=GR MGYP000381848663/3-69 PP ...699******.***********************.***************************99.999876...
MGYP000066325489/28-89 ----KTIANHKQ-AARHHMEAAKHHMEAARHHEEGN-HEKAAHSTLLAYGHHTIAGEFVSDDAKHH-AQ-------
#=GR MGYP000066325489/28-89 PP ....56678999.***********************.********************999999988.75.......
MGYP000013251582/4-69 ----EAANHHKQ-AAEHHEHAARHHHEAAKHHLAGN-HEKAAHHAHLAHGHHVHATEHAENAAKEH-VKAHGA---
#=GR MGYP000013251582/4-69 PP ....57889999.***********************.***************************99.888865...
MGYP000499794189/19-84 ---NDAAEHHRK-AAEHHEHAAAHHREAAEHHANGN-HEKAAHHAHIAHGHGLHAAHHAGEATKHH-ANTHG----
#=GR MGYP000499794189/19-84 PP ...5689*****.***********************.*****************************.*9986....
MGYP000555816272/4-69 -----EAAHHHKQAAEHHEHAARHHHEAAKHHEAGN-HEKAAHHAHLAHAHHVLAAEHAENAAKEH-LKAHGT---
#=GR MGYP000555816272/4-69 PP .....4555554399*********************.***************************99.888865...
MGYP000653248377/3-70 ---KKAAEHHKK-ASEHLTHAARHHGEAAKHHEAGS-HEKAAHHAHTARAHIIHGRGHAEEAVKAH-AEEHGKK--
#=GR MGYP000653248377/3-70 PP ...699******.***********************.*****************************.**99865..
MGYP000113511630/3-70 ---KKAAEHHRK-AAEHHKHAAGHHEEAAAHHDKGN-HEKAAHHAHVAHGHTLHAAHHAEEAAKAH-VEEHGSK--
#=GR MGYP000113511630/3-70 PP ...699******.***********************.***************************99.9999866..
MGYP001057101778/4-69 ---DKIIEHHRS-AADHHEKAAQHHREAAKHHASDS-HEKAAHHAHSAHGHSAHATHHAGEASKHH-AEHHG----
#=GR MGYP001057101778/4-69 PP ...5678*****.***********************.*****************************.****6....
MGYP000210824545/3-69 ---KKAAESHKK-ASEHLTHAARHHTEAAKHHETGQ-HEKAAHHAHIARAHATHAREHSENAAKAH-LEEHGK---
#=GR MGYP000210824545/3-69 PP ...689******.***********************.***************************99.999976...
MGYP000676742083/9-64 ------RDEHNK-AAEHHENAAKAHRSAAEHHGKGD-HAKGKQHADTAKQHSQTAHQHTDQAHS------------
#=GR MGYP000676742083/9-64 PP ......5789**.***********************.**********************99854............
MGYP000545010933/4-70 --KHPSTEHHTS-AAEEHDNASRHHRAAAKNYEEGK-HETAAHHAHSASGHSSNARDQAEEASRKH-AKQHG----
#=GR MGYP000545010933/4-70 PP ..58999*****.***********************.*************************9888.88775....
MGYP000541064880/3-68 -AEHNAAEHHGF-AAHHHQRAAQFHREASRHYEAGKDYAHAAHQALVAHGHALLAIDHGNEAGKYY-AG-------
#=GR MGYP000541064880/3-68 PP .789********.*********************963789***********************997.64.......
MGYP000541064880/99-161 ------SEHHAA-AADDHEQAAQHHAQAAKHLNEKD-YELAAHEAQLAHRHAHYSIFHDDEAAKHH-VEHYG----
#=GR MGYP000541064880/99-161 PP ......69****.***********************.**************999************.***86....
MGYP000729801087/3-52 ---KKVAEHHLK-AAEHLEHAARHHKEAAKHHEAGN-HEKAAHHAHIARAHHEHA---------------------
#=GR MGYP000729801087/3-52 PP ...5889*****.***********************.*****************7.....................
MGYP000715079888/40-96 -----SAEYHKK-AANCHYEAAKHHNIAAKHHEAGN-HKKASEYALKAYWYHCLASEAEKEDVK------------
#=GR MGYP000715079888/40-96 PP .....69*****.***********************.***************998876655555............
MGYP000033872322/3-43 ---KKAAEHHRK-AAEHHEHAARHHKEAAKHHDAGA-HEKAAHHAH------------------------------
#=GR MGYP000033872322/3-43 PP ...699******.***********************.*******96..............................
MGYP000464421157/4-69 ----EAAEHHKH-AAEHLTHAARHHSEAAKHHEAGQ-HEKAAHHAHLAHGHQEHASEHAVEAAKKH-IEAHGN---
#=GR MGYP000464421157/4-69 PP ....689*****.***********************.***************************99.999875...
#=GC PP_cons 7887889*****.***********************.**************************999.9998766..
#=GC RF xxxxxxxxxxxx.xxxxxxxxxxxxxxxxxxxxxxx.xxxxxxxxxxxxxxxxxxxxxxxxxxxxx.xxxxxxxxx
//
tests/test_data/alignments/pdb70_hits.hhr
0 → 100644
View file @
96809433
Query query
Match_columns 73
No_of_seqs 55 out of 57
Neff 2.88591
Searched_HMMs 80799
Date Thu Dec 30 19:40:02 2021
Command /home/ga122/openfold/lib/conda/envs/openfold_venv/bin/hhsearch -i /tmp/tmpedq9nsbw/query.a3m -o /tmp/tmpedq9nsbw/output.hhr -maxseq 1000000 -d /data/ga122/alphafold/pdb70/pdb70
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 1HF9_B ATPASE INHIBITOR (MITOC 7.5 3.8E+02 0.0047 16.2 0.0 22 7-28 10-31 (41)
2 2CRB_A nuclear receptor bindin 6.4 4.7E+02 0.0058 18.0 0.0 20 11-30 32-51 (97)
3 4ZEY_A nuclear receptor bindin 6.3 4.7E+02 0.0059 17.3 0.0 20 11-30 26-45 (84)
4 3U8V_A Metal-binding protein s 4.1 8.1E+02 0.01 17.3 0.0 32 15-46 50-81 (93)
5 1PSM_A SPAM-H1 (RESIDUES 90 - 1.9 2.1E+03 0.026 13.4 0.0 18 11-28 14-31 (38)
6 5KC1_F Autophagy-related prote 1.5 2.7E+03 0.033 16.9 0.0 17 12-28 25-41 (226)
7 5KC1_J Autophagy-related prote 1.5 2.7E+03 0.033 16.9 0.0 17 12-28 25-41 (226)
8 3ZEE_A PARTITIONING DEFECTIVE 1.1 3.8E+03 0.046 12.5 0.0 15 58-72 30-44 (84)
9 4I6P_A Partitioning defective 1.0 4.3E+03 0.054 12.4 0.0 16 57-72 32-47 (88)
10 2Q2K_A Hypothetical protein/DN 1.0 4.3E+03 0.054 13.4 0.0 17 56-72 54-70 (70)
No 1
>1HF9_B ATPASE INHIBITOR (MITOCHONDRIAL); ATPASE INHIBITOR, F1 ATPASE INHIBITOR; NMR {BOS TAURUS} SCOP: h.4.8.1
Probab=7.51 E-value=3.8e+02 Score=16.19 Aligned_cols=22 Identities=23% Similarity=0.398 Sum_probs=13.1 Template_Neff=4.500
Q query 7 AEHHHKAAEHHEQAAKHHHAAA 28 (73)
Q Consensus 7 aEhH~kAAeHHe~AA~HH~eAA 28 (73)
.++|++-++.|+.|.+-|++-.
T Consensus 10 I~~He~qIk~heeaI~RHk~~i 31 (41)
T 1HF9_B 10 ISHHAKEIERLQKEIERHKQSI 31 (41)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3456666666666666665543
No 2
>2CRB_A nuclear receptor binding factor 2; NRBF-2, MIT domain, helix bundle; NMR {Mus musculus} SCOP: a.7.16.1
Probab=6.35 E-value=4.7e+02 Score=18.01 Aligned_cols=20 Identities=20% Similarity=0.165 Sum_probs=11.0 Template_Neff=5.700
Q query 11 HKAAEHHEQAAKHHHAAAEH 30 (73)
Q Consensus 11 ~kAAeHHe~AA~HH~eAAkH 30 (73)
..|++-|.+|++...+|.+-
T Consensus 32 ~~Aie~H~kAA~~f~~A~~~ 51 (97)
T 2CRB_A 32 EEAISCHRKATTYLSEAMKL 51 (97)
T ss_dssp HHHHHHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHHHHHHHHHc
Confidence 34555556666555555543
No 3
>4ZEY_A nuclear receptor binding factor 2; Structural Genomics, Joint Center for; HET: SO4, MSE; 1.5A {Homo sapiens}
Probab=6.31 E-value=4.7e+02 Score=17.29 Aligned_cols=20 Identities=25% Similarity=0.200 Sum_probs=10.8 Template_Neff=6.200
Q query 11 HKAAEHHEQAAKHHHAAAEH 30 (73)
Q Consensus 11 ~kAAeHHe~AA~HH~eAAkH 30 (73)
..|++-|.+|+....+|.+-
T Consensus 26 ~~A~e~H~~AA~~f~~A~~~ 45 (84)
T 4ZEY_A 26 EEAISCHKKAAAYLSEAMKL 45 (84)
T ss_dssp HHHHHHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 44555555555555555543
No 4
>3U8V_A Metal-binding protein smbP; four helical bundle, metal chaperone; 1.9A {Nitrosomonas europaea}
Probab=4.11 E-value=8.1e+02 Score=17.29 Aligned_cols=32 Identities=34% Similarity=0.433 Sum_probs=15.8 Template_Neff=4.400
Q query 15 EHHEQAAKHHHAAAEHHEKGEHEQAAHHADTA 46 (73)
Q Consensus 15 eHHe~AA~HH~eAAkHheaG~HekAahhAh~A 46 (73)
+|-..+.++-.+|.++-..|+-+.|..++-.|
T Consensus 50 ~H~~~aik~LeeAI~hgk~ghad~A~kha~~A 81 (93)
T 3U8V_A 50 THVGHGIKHLEDAIKHGEEGHVGVATKHAQEA 81 (93)
T ss_dssp CHHHHHHHHHHHHHHHHHTTCHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHcCcHHHHHHHHHHH
Confidence 34444455555555555555555544444433
No 5
>1PSM_A SPAM-H1 (RESIDUES 90 - 127; POLYMORPHIC ANTIGEN; NMR {Plasmodium falciparum} SCOP: j.18.1.1
Probab=1.89 E-value=2.1e+03 Score=13.41 Aligned_cols=18 Identities=39% Similarity=0.433 Sum_probs=8.0 Template_Neff=1.300
Q query 11 HKAAEHHEQAAKHHHAAA 28 (73)
Q Consensus 11 ~kAAeHHe~AA~HH~eAA 28 (73)
.+|++--|+|++.=.+|+
T Consensus 14 e~aa~dae~a~k~ae~a~ 31 (38)
T 1PSM_A 14 EQAAKDAENASKEAEEAA 31 (38)
T ss_dssp HSTTTTTTHHHHHTTTTT
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 444444444444444443
No 6
>5KC1_F Autophagy-related protein 38; Atg38, coiled-coil, dimerization, NRBF2, autophagy; HET: NO3, NH4, EDO, NA; 2.2A {Saccharomyces cerevisiae}
Probab=1.52 E-value=2.7e+03 Score=16.87 Aligned_cols=17 Identities=12% Similarity=0.040 Sum_probs=0.0 Template_Neff=5.100
Q query 12 KAAEHHEQAAKHHHAAA 28 (73)
Q Consensus 12 kAAeHHe~AA~HH~eAA 28 (73)
.|++-|.+|++.-.+|.
T Consensus 25 eAie~h~kAAe~l~~a~ 41 (226)
T 5KC1_F 25 NAKAKYQEAIEVLGPQN 41 (226)
T ss_dssp -----------------
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34444444444444443
No 7
>5KC1_J Autophagy-related protein 38; Atg38, coiled-coil, dimerization, NRBF2, autophagy; HET: NA, NO3, EDO, NH4; 2.2A {Saccharomyces cerevisiae}
Probab=1.52 E-value=2.7e+03 Score=16.87 Aligned_cols=17 Identities=12% Similarity=0.040 Sum_probs=0.0 Template_Neff=5.100
Q query 12 KAAEHHEQAAKHHHAAA 28 (73)
Q Consensus 12 kAAeHHe~AA~HH~eAA 28 (73)
.|++-|.+|++.-.+|.
T Consensus 25 eAie~h~kAAe~l~~a~ 41 (226)
T 5KC1_J 25 NAKAKYQEAIEVLGPQN 41 (226)
T ss_dssp -----------------
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34444444444444443
No 8
>3ZEE_A PARTITIONING DEFECTIVE 3 HOMOLOG; CELL CYCLE; 6.1A {RATTUS NORVEGICUS}
Probab=1.14 E-value=3.8e+03 Score=12.49 Aligned_cols=15 Identities=27% Similarity=0.237 Sum_probs=7.4 Template_Neff=7.600
Q query 58 AQAAKHDAEHHAPKP 72 (73)
Q Consensus 58 ~eAak~ha~~H~~kp 72 (73)
.+|.+.|....+.+|
T Consensus 30 ~~a~~Ry~~~~~~~~ 44 (84)
T 3ZEE_A 30 QQAVTRYRKAVAKDP 44 (84)
T ss_dssp HHHHHHHHHHHCSSS
T ss_pred HHHHHHHHHHcCCCc
Confidence 455555555544433
No 9
>4I6P_A Partitioning defective 3 homolog; PB1 like motif, DUF3534, Cell; 2.9A {Rattus norvegicus}
Probab=1.01 E-value=4.3e+03 Score=12.37 Aligned_cols=16 Identities=25% Similarity=0.206 Sum_probs=0.0 Template_Neff=7.500
Q query 57 AAQAAKHDAEHHAPKP 72 (73)
Q Consensus 57 a~eAak~ha~~H~~kp 72 (73)
+.+|.+.|....+.+|
T Consensus 32 ~~~a~~Ry~~~~~~~~ 47 (88)
T 4I6P_A 32 IQQAVTRYRKAVAKDP 47 (88)
T ss_dssp HHHHHHHHHHHHCCCT
T ss_pred HHHHHHHHHHHcCCCc
No 10
>2Q2K_A Hypothetical protein/DNA Complex; protein-DNA, partition, segregation, parB, DNA; HET: EPE; 3.0A {Staphylococcus aureus}
Probab=1.01 E-value=4.3e+03 Score=13.41 Aligned_cols=17 Identities=24% Similarity=0.343 Sum_probs=0.0 Template_Neff=1.100
Q query 56 HAAQAAKHDAEHHAPKP 72 (73)
Q Consensus 56 Ha~eAak~ha~~H~~kp 72 (73)
|-.||-+.|.++-|..|
T Consensus 54 hireal~ryiee~g~~p 70 (70)
T 2Q2K_A 54 HIREALRRYIEEIGENP 70 (70)
T ss_dssp HHHHHHHHHHHHCCHHC
T ss_pred HHHHHHHHHHHHHCCCC
tests/test_data/alignments/uniref90_hits.sto
0 → 100644
View file @
96809433
# STOCKHOLM 1.0
#=GF ID query-i1
#=GF AU jackhmmer (HMMER 3.3.2)
#=GS UniRef90_D7BIZ4/1-73 DE [subseq from] Uncharacterized protein n=1 Tax=Meiothermus silvanus (strain ATCC 700542 / DSM 9946 / VI-R2) TaxID=526227 RepID=D7BIZ4_MEISD
#=GS UniRef90_A0A345WS72/1-69 DE [subseq from] Uncharacterized protein n=1 Tax=Sphingomonas sp. FARSPH TaxID=2219696 RepID=A0A345WS72_9SPHN
#=GS UniRef90_A0A1F2V377/4-68 DE [subseq from] Uncharacterized protein n=1 Tax=Acidobacteria bacterium RIFCSPLOWO2_12_FULL_60_22 TaxID=1797188 RepID=A0A1F2V377_9BACT
#=GS UniRef90_A0A3C0R222/4-69 DE [subseq from] Alpha-carbonic anhydrase domain-containing protein n=1 Tax=Spartobacteria bacterium TaxID=2052183 RepID=A0A3C0R222_9BACT
#=GS UniRef90_A0A3G2VJ28/2-67 DE [subseq from] Uncharacterized protein n=1 Tax=Methylobacterium brachiatum TaxID=269660 RepID=A0A3G2VJ28_9RHIZ
#=GS UniRef90_A0A317IC02/3-67 DE [subseq from] Uncharacterized protein n=1 Tax=Candidatus Melainabacteria bacterium TaxID=2052166 RepID=A0A317IC02_9BACT
#=GS UniRef90_A0A4P6K0I8/4-69 DE [subseq from] Uncharacterized protein n=1 Tax=Ktedonosporobacter rubrisoli TaxID=2509675 RepID=A0A4P6K0I8_9CHLR
#=GS UniRef90_A0A142HH28/3-70 DE [subseq from] Uncharacterized protein n=1 Tax=Hymenobacter sp. PAMC 26554 TaxID=1484116 RepID=A0A142HH28_9BACT
#=GS UniRef90_A0A402A866/4-68 DE [subseq from] Uncharacterized protein n=1 Tax=Tengunoibacter tsumagoiensis TaxID=2014871 RepID=A0A402A866_9CHLR
#=GS UniRef90_UPI00131BC0F4/3-69 DE [subseq from] hypothetical protein n=1 Tax=Acidisphaera sp. S103 TaxID=1747223 RepID=UPI00131BC0F4
#=GS UniRef90_A0A5E6MFW5/5-71 DE [subseq from] Uncharacterized protein n=1 Tax=Methylacidimicrobium tartarophylax TaxID=1041768 RepID=A0A5E6MFW5_9BACT
#=GS UniRef90_A0A6M1MC51/1-69 DE [subseq from] Uncharacterized protein n=1 Tax=Methylobacterium sp. DB0501 TaxID=2709665 RepID=A0A6M1MC51_9RHIZ
#=GS UniRef90_A0A368HF25/2-66 DE [subseq from] Uncharacterized protein n=1 Tax=Acidiferrobacter thiooxydans TaxID=163359 RepID=A0A368HF25_9GAMM
#=GS UniRef90_A0A2N3PRK8/17-83 DE [subseq from] Uncharacterized protein n=1 Tax=Telmatospirillum siberiense TaxID=382514 RepID=A0A2N3PRK8_9PROT
#=GS UniRef90_A0A2N3PRK8/115-180 DE [subseq from] Uncharacterized protein n=1 Tax=Telmatospirillum siberiense TaxID=382514 RepID=A0A2N3PRK8_9PROT
#=GS UniRef90_A0A7Y3P168/15-76 DE [subseq from] Uncharacterized protein n=1 Tax=Bacteroidia bacterium TaxID=2044936 RepID=A0A7Y3P168_9BACT
#=GS UniRef90_A0A4R8DP52/4-70 DE [subseq from] Uncharacterized protein n=1 Tax=Dinghuibacter silviterrae TaxID=1539049 RepID=A0A4R8DP52_9BACT
#=GS UniRef90_A0A1I4D138/7-73 DE [subseq from] Uncharacterized protein n=1 Tax=Methylocapsa palsarum TaxID=1612308 RepID=A0A1I4D138_9RHIZ
#=GS UniRef90_UPI0011BDFA18/9-74 DE [subseq from] hypothetical protein n=1 Tax=Adhaeribacter aerolatus TaxID=670289 RepID=UPI0011BDFA18
#=GS UniRef90_A0A1Q3KM49/4-69 DE [subseq from] Uncharacterized protein n=1 Tax=Alphaproteobacteria bacterium 65-37 TaxID=1895711 RepID=A0A1Q3KM49_9PROT
#=GS UniRef90_A0A225DK00/3-70 DE [subseq from] Uncharacterized protein n=1 Tax=Fimbriiglobus ruber TaxID=1908690 RepID=A0A225DK00_9BACT
#=GS UniRef90_A0A3E1NFY1/4-67 DE [subseq from] Uncharacterized protein n=1 Tax=Deminuibacter soli TaxID=2291815 RepID=A0A3E1NFY1_9BACT
#=GS UniRef90_UPI0015707348/3-70 DE [subseq from] hypothetical protein n=1 Tax=Hymenobacter sp. 9A TaxID=2735894 RepID=UPI0015707348
#=GS UniRef90_A0A7G4RF23/9-68 DE [subseq from] Uncharacterized protein n=1 Tax=Legionella sp. PC997 TaxID=2755562 RepID=A0A7G4RF23_9GAMM
#=GS UniRef90_A0A177QKT9/4-69 DE [subseq from] Uncharacterized protein n=1 Tax=Nitrospira sp. SCGC AG-212-E16 TaxID=1799664 RepID=A0A177QKT9_9BACT
#=GS UniRef90_UPI000A0039BF/3-69 DE [subseq from] hypothetical protein n=1 Tax=Bradyrhizobium sp. NAS80.1 TaxID=1680159 RepID=UPI000A0039BF
#=GS UniRef90_A0A537SU55/5-72 DE [subseq from] Uncharacterized protein n=1 Tax=Alphaproteobacteria bacterium TaxID=1913988 RepID=A0A537SU55_9PROT
#=GS UniRef90_UPI0009DA3672/5-71 DE [subseq from] hypothetical protein n=2 Tax=Verrucomicrobia TaxID=74201 RepID=UPI0009DA3672
#=GS UniRef90_UPI000943D660/10-75 DE [subseq from] hypothetical protein n=1 Tax=Rufibacter TaxID=1379908 RepID=UPI000943D660
#=GS UniRef90_A0A2K8YE90/1-69 DE [subseq from] Uncharacterized protein n=5 Tax=Bradyrhizobium TaxID=374 RepID=A0A2K8YE90_9BRAD
#=GS UniRef90_A0A2W6AI54/4-69 DE [subseq from] Uncharacterized protein n=1 Tax=Candidatus Dormibacteraeota bacterium TaxID=2052315 RepID=A0A2W6AI54_9BACT
#=GS UniRef90_UPI001AECAC8A/84-146 DE [subseq from] hypothetical protein n=2 Tax=Beijerinckia sp. 28-YEA-48 TaxID=1882748 RepID=UPI001AECAC8A
#=GS UniRef90_A0A411HJN1/1-70 DE [subseq from] Uncharacterized protein n=1 Tax=Pseudolysobacter antarcticus TaxID=2511995 RepID=A0A411HJN1_9GAMM
#=GS UniRef90_A0A7D4C1D1/3-70 DE [subseq from] Uncharacterized protein n=1 Tax=Hymenobacter sp. BRD128 TaxID=2675878 RepID=A0A7D4C1D1_9BACT
#=GS UniRef90_A0A3S0S9L9/2-68 DE [subseq from] Uncharacterized protein n=1 Tax=Hyphomicrobium sp. TaxID=82 RepID=A0A3S0S9L9_HYPSQ
#=GS UniRef90_UPI0015F67598/3-69 DE [subseq from] hypothetical protein n=2 Tax=Rhodospirillales incertae sedis TaxID=451274 RepID=UPI0015F67598
#=GS UniRef90_A0A2W5ZIQ4/3-69 DE [subseq from] Uncharacterized protein n=1 Tax=Candidatus Dormibacteraeota bacterium TaxID=2052315 RepID=A0A2W5ZIQ4_9BACT
#=GS UniRef90_A0A5C1ACH7/3-69 DE [subseq from] Uncharacterized protein n=2 Tax=Gemmataceae TaxID=1914233 RepID=A0A5C1ACH7_9BACT
#=GS UniRef90_A0A7X8SVC5/4-59 DE [subseq from] Uncharacterized protein n=1 Tax=Rhizobium sp. P38BS-XIX TaxID=2726740 RepID=A0A7X8SVC5_9RHIZ
#=GS UniRef90_UPI001647FE78/10-75 DE [subseq from] hypothetical protein n=1 Tax=Rufibacter TaxID=1379908 RepID=UPI001647FE78
#=GS UniRef90_A0A534V5G6/4-70 DE [subseq from] Uncharacterized protein n=1 Tax=Deltaproteobacteria bacterium TaxID=2026735 RepID=A0A534V5G6_9DELT
#=GS UniRef90_A0A7D3WQ23/3-70 DE [subseq from] Uncharacterized protein n=3 Tax=Hymenobacter TaxID=89966 RepID=A0A7D3WQ23_9BACT
#=GS UniRef90_UPI00067F5429/4-57 DE [subseq from] hypothetical protein n=1 Tax=Bradyrhizobium viridifuturi TaxID=1654716 RepID=UPI00067F5429
#=GS UniRef90_G3IVL7/18-76 DE [subseq from] Uncharacterized protein n=2 Tax=Methylobacter tundripaludum TaxID=173365 RepID=G3IVL7_METTV
#=GS UniRef90_A0A431QXA7/3-69 DE [subseq from] Uncharacterized protein n=2 Tax=Bradyrhizobiaceae TaxID=41294 RepID=A0A431QXA7_9BRAD
#=GS UniRef90_A0A2V7ZTA3/4-68 DE [subseq from] Uncharacterized protein n=2 Tax=unclassified Acidobacteria TaxID=305072 RepID=A0A2V7ZTA3_9BACT
#=GS UniRef90_A0A516TLI4/49-117 DE [subseq from] Uncharacterized protein n=2 Tax=Methylacidiphilum kamchatkense TaxID=431057 RepID=A0A516TLI4_9BACT
#=GS UniRef90_UPI00155D9B40/8-70 DE [subseq from] hypothetical protein n=1 Tax=Leptospirillum ferrooxidans TaxID=180 RepID=UPI00155D9B40
#=GS UniRef90_A0A2H9SEK4/12-70 DE [subseq from] Uncharacterized protein n=1 Tax=Legionella sp. TaxID=459 RepID=A0A2H9SEK4_9GAMM
#=GS UniRef90_A0A1H1JX39/3-70 DE [subseq from] Uncharacterized protein n=3 Tax=unclassified Rhizobiales TaxID=41292 RepID=A0A1H1JX39_9RHIZ
#=GS UniRef90_UPI000975F98E/3-57 DE [subseq from] hypothetical protein n=2 Tax=Bradyrhizobium TaxID=374 RepID=UPI000975F98E
#=GS UniRef90_A0A142H998/3-70 DE [subseq from] Uncharacterized protein n=3 Tax=unclassified Hymenobacter TaxID=2615202 RepID=A0A142H998_9BACT
#=GS UniRef90_UPI00031CAACE/3-70 DE [subseq from] hypothetical protein n=2 Tax=Zavarzinella formosa TaxID=360055 RepID=UPI00031CAACE
#=GS UniRef90_S9SB59/5-73 DE [subseq from] Uncharacterized protein n=2 Tax=Magnetospirillum fulvum TaxID=1082 RepID=S9SB59_MAGFU
#=GS UniRef90_I0IMJ9/13-73 DE [subseq from] Uncharacterized protein n=1 Tax=Leptospirillum ferrooxidans (strain C2-3) TaxID=1162668 RepID=I0IMJ9_LEPFC
#=GS UniRef90_A0A2Z3R562/2-66 DE [subseq from] Uncharacterized protein n=1 Tax=Acidiferrobacter sp. SPIII_3 TaxID=1281578 RepID=A0A2Z3R562_9GAMM
query MAAHKGAEHHHKAAEHHEQAAKHHHAAAEHHEKG-EHEQAAHHADTAYAHHKHAEEHAAQAAK-HDA-EHHAPKPH
UniRef90_D7BIZ4/1-73 MAAHKGAEHHHKAAEHHEQAAKHHHAAAEHHEKG-EHEQAAHHADTAYAHHKHAEEHAAQAAK-HDA-EHHAPKPH
#=GR UniRef90_D7BIZ4/1-73 PP 89********************************.****************************.***.*******9
UniRef90_A0A345WS72/1-69 MAEHKGAEHHRTAAEHHEHAAKHHRSAAEQHEAG-NHEKAGHHAAAAGGHASHAREHGEQASR-HHA-EHHG----
#=GR UniRef90_A0A345WS72/1-69 PP 799*******************************.****************************.***.***6....
UniRef90_A0A1F2V377/4-68 ----TGAEHHEAAAQHHEQAARHHHEAAKQDHSG-HHEKAGHYAHLAYAHFKHAEQHAAEAAK-THA-KNHT----
#=GR UniRef90_A0A1F2V377/4-68 PP ....69****************************.****************************.999.9995....
UniRef90_A0A3C0R222/4-69 ----KLKEHHTKAAEHHEHAAKHHRKAAEHHVSG-KHETAAHHAHLAHGHHMHARHHATEAAK-RHV-ELHGN---
#=GR UniRef90_A0A3C0R222/4-69 PP ....6679**************************.****************************.*99.99975...
UniRef90_A0A3G2VJ28/2-67 --AHQGAEHHHKAAEHHEKAAQHHREAAKHHESG-NHEKAAHHAHTAHGHATHASHHHTEASR-HHA-EQH-----
#=GR UniRef90_A0A3G2VJ28/2-67 PP ..8*******************************.****************************.***.*99.....
UniRef90_A0A317IC02/3-67 ---KKASEHHKKAAEHHRKAADHHEQASKHHDSG-SHEKAAHHAQTATGHHLHAEHHAHEATK-CHS-DEY-----
#=GR UniRef90_A0A317IC02/3-67 PP ...6899***************************.**************************99.666.555.....
UniRef90_A0A4P6K0I8/4-69 --NHPSVEHHKKAAEHHTKAAEHHTKAAEHHTKG-EHEAAAHHAHLAHGHHAQATEHANEAAK-KHA-SHT-----
#=GR UniRef90_A0A4P6K0I8/4-69 PP ..58999***************************.****************************.999.996.....
UniRef90_A0A142HH28/3-70 ---KKAADSHKKAAEHHTEAAKHHTEAAKHHEAG-SHEKAAHHAHTAAAHKDHATEHATTARK-AHA-EEHGKK--
#=GR UniRef90_A0A142HH28/3-70 PP ...6899***************************.****************************.***.*99865..
UniRef90_A0A402A866/4-68 --GHPSIEHHRKAAEHHRKAAEHHEKAAEHHAKG-EHETAASHAHMAHGHHIQATEHLEEAAKKHTA-Q-------
#=GR UniRef90_A0A402A866/4-68 PP ..6999****************************.************************99862665.5.......
UniRef90_UPI00131BC0F4/3-69 --NHQGATHHKKAAEHHEMAAKHHAQAAHHHESG-EHEAAGHHAHAAAGHAAHAKDHAEHAAK-HHA-ETHA----
#=GR UniRef90_UPI00131BC0F4/3-69 PP ..6*******************************.****************************.***.***8....
UniRef90_A0A5E6MFW5/5-71 -----IAEHHEKAAMHHEHAATHHKKAAEHHRKG-EHVESGHHAHIAHGHAEHAEVHAKEAAK-EEA-TVHDKEP-
#=GR UniRef90_A0A5E6MFW5/5-71 PP .....59***************************.****************************.***.9997665.
UniRef90_A0A6M1MC51/1-69 MATHQGAEHHKKAAEHHEHAARHHREAAKHYEAG-SHEKAAHHAHTAHGHASHATHHHTEASR-HHA-EQHG----
#=GR UniRef90_A0A6M1MC51/1-69 PP 899*******************************.****************************.***.*996....
UniRef90_A0A368HF25/2-66 ---HEGAEHHKNAAKHHTEAAKHHTEAAKHHDAG-QHEKAAHHAHLAYAHSVHAAHYREEAAK-HYA-AHN-----
#=GR UniRef90_A0A368HF25/2-66 PP ...9******************************.****************************.***.996.....
UniRef90_A0A2N3PRK8/17-83 --EHRAAEHHRSAVSHHEAAARYHREASKHYQIGHDHAHAAHQALIALGQAWQAVDHAKTANG-YYA-DHD-----
#=GR UniRef90_A0A2N3PRK8/17-83 PP ..59****************************995699********************99999.999.885.....
UniRef90_A0A2N3PRK8/115-180 ------AEHHAVAADNHEQAAKHHRRAAQHCDEK-NYMMAACEAHLAHGHAQHSIFHGIEAAK-HHV-DHQTQNP-
#=GR UniRef90_A0A2N3PRK8/115-180 PP ......89**************************.****************************.***.**98776.
UniRef90_A0A7Y3P168/15-76 ---NKGIENHKKAAKHHEEAAKHHHEAAKHHEAG-NHDKAFESTIKAYGHHCLANEAQ----R-EDL-KHHA----
#=GR UniRef90_A0A7Y3P168/15-76 PP ...79*****************************.****************9988754....5.566.6665....
UniRef90_A0A4R8DP52/4-70 ----EHAEHHKKAASHSEKAAEHHHEAAKHYEAG-DHEAGAHHAHAAHAHHLHAEDHAKHAAK-LHA-EHHGEK--
#=GR UniRef90_A0A4R8DP52/4-70 PP ....569***************************.****************************.***.***865..
UniRef90_A0A1I4D138/7-73 ----KIAEHHTQAAQHHEKAAEHYKEAAKHHETG-AVEKGAHHAQVSQGHAVHAEYHADEAAK-AHA-QHHANK--
#=GR UniRef90_A0A1I4D138/7-73 PP ....779***************************.****************************.***.***976..
UniRef90_UPI0011BDFA18/9-74 ---KKSAEHHQIAADHLEQAAKNHRAAAEHLAAG-DHQKAAHHGYTAYGLSSHAQYHAQQAAL-HHS-HEHK----
#=GR UniRef90_UPI0011BDFA18/9-74 PP ...4789***************************.***************************9.877.5553....
UniRef90_A0A1Q3KM49/4-69 ---DKIIEHHRSAADHHEKAAQHHREAAKHHESD-SHEKAAHHAHSAHGHSAHATHHAGEASK-HHA-EHHG----
#=GR UniRef90_A0A1Q3KM49/4-69 PP ...5678***************************.****************************.***.***6....
UniRef90_A0A225DK00/3-70 ---KKAAESHKKAAESHKKAGEHHEQAAKHHEAG-NHEKAAHHAHTAKGHQTHAERHTNDAAA-HHA-EEHGAK--
#=GR UniRef90_A0A225DK00/3-70 PP ...689****************************.****************************.***.*99865..
UniRef90_A0A3E1NFY1/4-67 -------KNHEDAAKHHEEAAKHHRSAAEEAGKG-NHEKAAHHAQAAHGHTEHAKEHAREASK-KYA-QQHEEK--
#=GR UniRef90_A0A3E1NFY1/4-67 PP .......57999**********************.****************************.***.999876..
UniRef90_UPI0015707348/3-70 ---KKAVDSHKKAAAHHTEAAAHHTEAAKHQEAG-SHEKAAHHAHTAAAHTDHAAEHATQARK-SHA-EDHGTK--
#=GR UniRef90_UPI0015707348/3-70 PP ...578899*************************.****************************.***.*99865..
UniRef90_A0A7G4RF23/9-68 ----KLKQHHTLAAEHHKKASEHHNEAAKYHQSG-DHEQGHHHAHLARGHHEHAQHHSSEAAK-HS----------
#=GR UniRef90_A0A7G4RF23/9-68 PP ....56789*************************.****************************.*7..........
UniRef90_A0A177QKT9/4-69 ----QAADHHRKAAEHHEHAARDHKEAAKYYEAG-EHEKAAHYAHRAHAHHLHVAHHSAEATK-SHL-EHHDK---
#=GR UniRef90_A0A177QKT9/4-69 PP ....689***************************.****************************.***.***75...
UniRef90_UPI000A0039BF/3-69 ---KKAAEHHKQAAEHHTQAARHHGEAAKHYEGG-QHEKAAHHAHTASGHGHHANYHTEEAGK-AHM-EEHGK---
#=GR UniRef90_UPI000A0039BF/3-69 PP ...689****************************.****************************.999.99976...
UniRef90_A0A537SU55/5-72 --THKGGSHHETAADHHETAAHHHREAAKHYESG-DHEKAGHHAHVAHAHGLHAAHHGQEAAK-HHA-EQHAE---
#=GR UniRef90_A0A537SU55/5-72 PP ..7*******************************.****************************.***.***96...
UniRef90_UPI0009DA3672/5-71 -----IAEHHEQAAMHHEHAAIHHKKAAEHHRKG-EHAESGHHAHIAHGHAQHAEHHAELAAK-EEA-TMHDKEP-
#=GR UniRef90_UPI0009DA3672/5-71 PP .....59***************************.****************************.***.9997766.
UniRef90_UPI000943D660/10-75 ---KKSAENHRKAAEYFEQAAANHRAAAEHLAKG-DHEKSAHHGYTAYGLSSHGRHHAEDAAL-HHS-HEHK----
#=GR UniRef90_UPI000943D660/10-75 PP ...4789***************************.**************************99.877.5553....
UniRef90_A0A2K8YE90/1-69 MSDHAGVEHHHKAAEHHEHAAHHHREAAKHHAAG-DHEKAAHHAHSAHGHASHAEHHHTEASR-HHA-EHHG----
#=GR UniRef90_A0A2K8YE90/1-69 PP 678*******************************.****************************.***.***7....
UniRef90_A0A2W6AI54/4-69 ----EAAQHHQQAAEHHEHAGRHHREAAKAHEAG-DHAKAAHHAHTARGHHEHASHHAAEAAK-SHV-EHHGH---
#=GR UniRef90_A0A2W6AI54/4-69 PP ....689***************************.****************************.***.***86...
UniRef90_UPI001AECAC8A/84-146 ------HEHHTKAAEHHELAAKHHREAAKHHESG-EHEKAAHHSKIAHGHSLHATEHHEHASK-KHA-EHHS----
#=GR UniRef90_UPI001AECAC8A/84-146 PP ......59**************************.****************************.***.***5....
UniRef90_A0A411HJN1/1-70 MSSHTVAEHHQKAAEHHTLAAEHHHEAAKHHSDG-AHEKAAHHAHLGHSHHLHATHHSQEATK-QFGHDHHA----
#=GR UniRef90_A0A411HJN1/1-70 PP 789*******************************.**************************99.75526776....
UniRef90_A0A7D4C1D1/3-70 ---KKAAEHHKHAATHHAEAAKHHTAAATHHEAG-HHEKAAHHAHTAAAHTEHATEHTSHARK-AHA-EEHGTK--
#=GR UniRef90_A0A7D4C1D1/3-70 PP ...689****************************.****************************.***.*99865..
UniRef90_A0A3S0S9L9/2-68 --AQKPHEHHQKAAEHHEQAAQHHKEAAKQHQAG-QHEKAAHHAHLAEAHHIHAKEHHEEAAK-AHL-AMHG----
#=GR UniRef90_A0A3S0S9L9/2-68 PP ..67889***************************.***************************9.766.6665....
UniRef90_UPI0015F67598/3-69 --KDKIVEHHNAAAEHHEHAAKHHREAATHHEAD-NHEKAGHHAHSAHGHSSHAAHHAGEASK-HHA-EHHG----
#=GR UniRef90_UPI0015F67598/3-69 PP ..56889***************************.****************************.***.***7....
UniRef90_A0A2W5ZIQ4/3-69 ---KKAAEHHGQAADHHEKAAQHHRQAKTHHEAG-DHQAAAHDAHTARGHHEHAAHHASEAAK-AHA-EEHGH---
#=GR UniRef90_A0A2W5ZIQ4/3-69 PP ...699****************************.****************************.***.*9975...
UniRef90_A0A5C1ACH7/3-69 ---KKAAASHKKAAEHHKKAGEHHENAAKHHEAG-NHEKAAHHAHTAKGHQSQAEKHGDEAAA-SHA-EEHGT---
#=GR UniRef90_A0A5C1ACH7/3-69 PP ...588999*************************.*************************999.999.99976...
UniRef90_A0A7X8SVC5/4-59 -------ESHTKAAEHHENAAKSHRSAAEHHGKG-DHEKGREHSKTAHAHSQSAHEHSDAAHK-K-----------
#=GR UniRef90_A0A7X8SVC5/4-59 PP .......889************************.**********************987766.5...........
UniRef90_UPI001647FE78/10-75 ---QKSAESHRKAAQYYQQAAEQHRAAAEHLNSG-DHEKAAHHGYTAYGLSEHARHHAKEAAL-HHS-HEHK----
#=GR UniRef90_UPI001647FE78/10-75 PP ...589****************************.**************************99.877.5553....
UniRef90_A0A534V5G6/4-70 ----QAAEHHTKAAEHHEHAARHHKEAAKHHEAG-NHEKAAHHAHVAHGHHLQAIHHHEEATK-FHL-EHHGKK--
#=GR UniRef90_A0A534V5G6/4-70 PP ....689***************************.****************************.***.***865..
UniRef90_A0A7D3WQ23/3-70 ---KKAAESHKHAAQHHTEAAKHHTEAAKSHEAG-NHEKAAHHAHTAAAHTEHATEHAGHARK-SHA-EEHGKK--
#=GR UniRef90_A0A7D3WQ23/3-70 PP ...6899***************************.****************************.***.*99865..
UniRef90_UPI00067F5429/4-57 -------EEHNKAAEHHENAAKAHRSAAEHHGKG-DHAKGMEHADTARQHSQTAHQHSEQAH--------------
#=GR UniRef90_UPI00067F5429/4-57 PP .......899************************.***********************9985..............
UniRef90_G3IVL7/18-76 ------QQHHQKAAEHHEQAAKHHKEAAKHYESG-DDKTAAQHAHIAHGYSTQAMEQEMEASK-KYA---------
#=GR UniRef90_G3IVL7/18-76 PP ......589*************************.**********************999999.766.........
UniRef90_A0A431QXA7/3-69 ---KKAAEHHKQSAEHHTHAARHHGEAAKHHESG-AHEKAAHHAHTARGHALHARHHSDEAAK-LHM-EEHGK---
#=GR UniRef90_A0A431QXA7/3-69 PP ...689****************************.****************************.999.98875...
UniRef90_A0A2V7ZTA3/4-68 ----EAVDHHRKAAEHFEHAAQHHSAAASHYGAG-RYDQASREAYLAHGHYLHGSNHAAEAAR-LHT-RHFG----
#=GR UniRef90_A0A2V7ZTA3/4-68 PP ....5689**************************.***************************9.888.8865....
UniRef90_A0A516TLI4/49-117 ---DTVAEEHEKAAMHHEHAAVHYRKAAEHHRAG-EHADSGHHAHIAHGHAKHAQAHAEAAAK-EEA-NMHDKKP-
#=GR UniRef90_A0A516TLI4/49-117 PP ...56699**************************.****************************.***.***9998.
UniRef90_UPI00155D9B40/8-70 ------QEHHQKAAEHHEHAAEHHKEAAKHHASG-DHKTASHHAHIAHGHSVHAREHEEEASK-KYV-VLHG----
#=GR UniRef90_UPI00155D9B40/8-70 PP ......69**************************.**************************99.876.6665....
UniRef90_A0A2H9SEK4/12-70 ------HKHHLKAAEHHKKAAEHHSEAAKHHEAG-EHEKGQASAYLALAHGRHAKDESCEACS-HYA---------
#=GR UniRef90_A0A2H9SEK4/12-70 PP ......57999***********************.***************************9.976.........
UniRef90_A0A1H1JX39/3-70 ---KKAAEHHKKAAEHATHVARHHGEAAKHHEAG-HHEKAAHHAHTAMGHAFHARGHAEEAAK-AHA-EEHGKK--
#=GR UniRef90_A0A1H1JX39/3-70 PP ...699****************************.****************************.***.*99865..
UniRef90_UPI000975F98E/3-57 ------KEEHNKAAEHHENAAKAHRSAAEHHGKG-DHAKGMEHANTAMQHSQTAHQHSEQAH--------------
#=GR UniRef90_UPI000975F98E/3-57 PP ......589*************************.***********************9985..............
UniRef90_A0A142H998/3-70 ---KKAAESHKHAATHHAEAAKHHTEAAKHHEAG-SHEKAAHHAHTAAAHTAHATEHATHARK-AHA-EEHGTK--
#=GR UniRef90_A0A142H998/3-70 PP ...6899***9***********************.****************************.***.*99865..
UniRef90_UPI00031CAACE/3-70 ---KKAAESHKKAAESHKKAGEHHEQAAKHHEAG-HHEKAAHHAHTAKGHQTQAEKHGNDAAT-QHA-EDHGSK--
#=GR UniRef90_UPI00031CAACE/3-70 PP ...689****************************.****************************.99*.999865..
UniRef90_S9SB59/5-73 MATLKANEHHAAAAAHSESAAQHHKEAAKQFDSG-HHEKAAHHAQVAAGHSAHATEHATEATK-KYA-EQHS----
#=GR UniRef90_S9SB59/5-73 PP 6778999***************************.****************************.***.*997....
UniRef90_I0IMJ9/13-73 ----KPQEHHKEAAQHHEEAAKHHKEASKMYEAG-DHKTAAHHAHSATGHASSAEEHQNEASR-KHA---------
#=GR UniRef90_I0IMJ9/13-73 PP ....6789**************************.***********************99987.655.........
UniRef90_A0A2Z3R562/2-66 ---HEGAEHHKNAAKHHTEAAKHHTEAAKHHDAG-QHEKAAHHAHLAHAHGTHAAHHHEEAAK-YYA-AHH-----
#=GR UniRef90_A0A2Z3R562/2-66 PP ...9******************************.****************************.***.**9.....
#=GC PP_cons 7877889***************************.***************************9.999.99876679
#=GC RF xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.xxxxxxxxxxxxxxxxxxxxxxxxxxxx.xxx.xxxxxxxx
//
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment