BLASTP 2.2.18 [Mar-02-2008]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= PGN_2056 hypothetical protein
(351 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
6,515,104 sequences; 2,222,278,849 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|34541649|ref|NP_906128.1| lipoprotein, putative [Porphyr... 729 0.0
gi|150006854|ref|YP_001301597.1| hypothetical protein BDI_0... 225 4e-57
gi|150008797|ref|YP_001303540.1| hypothetical protein BDI_2... 211 6e-53
gi|150005132|ref|YP_001299876.1| hypothetical protein BVU_2... 201 5e-50
gi|154491021|ref|ZP_02030962.1| hypothetical protein PARMER... 196 2e-48
gi|153809259|ref|ZP_01961927.1| hypothetical protein BACCAC... 195 3e-48
gi|154494429|ref|ZP_02033749.1| hypothetical protein PARMER... 188 6e-46
gi|53712371|ref|YP_098363.1| hypothetical protein BF1079 [B... 187 7e-46
gi|160887189|ref|ZP_02068192.1| hypothetical protein BACOVA... 185 5e-45
gi|29349781|ref|NP_813284.1| hypothetical protein BT_4373 [... 184 1e-44
gi|167765102|ref|ZP_02437215.1| hypothetical protein BACSTE... 182 3e-44
gi|160892218|ref|ZP_02073221.1| hypothetical protein BACUNI... 176 2e-42
gi|167754135|ref|ZP_02426262.1| hypothetical protein ALIPUT... 126 2e-27
gi|120437616|ref|YP_863302.1| hypothetical protein GFO_3292... 112 6e-23
gi|86144110|ref|ZP_01062447.1| hypothetical protein MED217_... 111 6e-23
gi|163789256|ref|ZP_02183698.1| hypothetical protein FBALC1... 106 2e-21
gi|83856082|ref|ZP_00949611.1| hypothetical protein CA2559_... 105 6e-21
gi|163754588|ref|ZP_02161710.1| hypothetical protein KAOT1_... 103 2e-20
gi|86135130|ref|ZP_01053712.1| hypothetical protein MED152_... 100 1e-19
gi|149369365|ref|ZP_01889217.1| hypothetical protein SCB49_... 100 2e-19
gi|126662709|ref|ZP_01733708.1| hypothetical protein FBBAL3... 99 3e-19
gi|88801997|ref|ZP_01117525.1| hypothetical protein PI23P_0... 97 2e-18
gi|126648900|ref|ZP_01721383.1| hypothetical protein ALPR1_... 92 4e-17
gi|88711616|ref|ZP_01105704.1| hypothetical protein FB2170_... 91 1e-16
gi|88804536|ref|ZP_01120056.1| hypothetical protein RB2501_... 85 7e-15
gi|89889749|ref|ZP_01201260.1| conserved hypothetical prote... 85 8e-15
gi|91217050|ref|ZP_01254013.1| hypothetical protein P700755... 85 8e-15
gi|86131113|ref|ZP_01049712.1| hypothetical protein MED134_... 84 2e-14
gi|146298736|ref|YP_001193327.1| hypothetical protein Fjoh_... 80 2e-13
gi|124002673|ref|ZP_01687525.1| hypothetical protein M23134... 79 4e-13
gi|83815026|ref|YP_445790.1| hypothetical protein SRU_1671 ... 70 2e-10
>gi|34541649|ref|NP_906128.1| lipoprotein, putative [Porphyromonas gingivalis W83]
gi|34397967|gb|AAQ67027.1| lipoprotein, putative [Porphyromonas gingivalis W83]
Length = 351
Score = 729 bits (1883), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/351 (99%), Positives = 349/351 (99%)
Query: 1 MKRNTRLLFVVPAFFAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAI 60
MKRNTRLLFVVPAFFA+FAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAI
Sbjct: 1 MKRNTRLLFVVPAFFAIFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAI 60
Query: 61 YVAMTEVAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQW 120
Y AMTEVAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQW
Sbjct: 61 YEAMTEVAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQW 120
Query: 121 ANGQLVVLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMF 180
ANGQLVVLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMF
Sbjct: 121 ANGQLVVLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMF 180
Query: 181 GYHINVPADILSHKEGKDFLWMSNNQMRRRKDFLVYTFPYREASDLGLDRMVEVRDSVLK 240
GYHINVPADILSHKEGKDFLWMSNNQMRRRKDFLVYTFPYR ASDLGLDRMVEVRDSVLK
Sbjct: 181 GYHINVPADILSHKEGKDFLWMSNNQMRRRKDFLVYTFPYRGASDLGLDRMVEVRDSVLK 240
Query: 241 ANIEGSDAGSYPVTERMLLKHRYVYPGDQKRGEVHGIWRMEGGQMMSGPFVAQAFVDGDR 300
ANIEGSDAGSYPVTERMLLKHRYVYPGDQKRGEVHGIWRMEGGQMMSGPFVAQAFVDGDR
Sbjct: 241 ANIEGSDAGSYPVTERMLLKHRYVYPGDQKRGEVHGIWRMEGGQMMSGPFVAQAFVDGDR 300
Query: 301 VVVVEGFVYHPNEKKRDLMRQMEAALYSMRPKAEKEFDPSIIRKARWTTLK 351
VVVVEGFVYHPNEKKRDLMRQMEAALYSMRPKAEKEFDPSIIRKARWTTLK
Sbjct: 301 VVVVEGFVYHPNEKKRDLMRQMEAALYSMRPKAEKEFDPSIIRKARWTTLK 351
>gi|150006854|ref|YP_001301597.1| hypothetical protein BDI_0186 [Parabacteroides distasonis ATCC
8503]
gi|149935278|gb|ABR41975.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 345
Score = 225 bits (573), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 127/333 (38%), Positives = 190/333 (57%), Gaps = 9/333 (2%)
Query: 5 TRLLFVVPAFFAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAM 64
TR LF+ + + AC G+ T +TG E+++VMD + P GEAI +
Sbjct: 3 TRSLFI--GMLLLLVLMGACNSGSV---MTQATGFAYEVVVVMDQKDWKGPAGEAIKSEL 57
Query: 65 TEVAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQ 124
P L Q EP + ++ V KDFTG LRY+RN+L + I+P +TK S+ Y + WA GQ
Sbjct: 58 ASSVPGLPQAEPSLKITYVQPKDFTGLLRYVRNILVVTIDPTAYTKVSVGYENNVWARGQ 117
Query: 125 LVVLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHI 184
+VV + +P T+S+ Y ++++ A+ N F++ E+ + QK+ S ++ +
Sbjct: 118 VVVSLKAPNTESIVEYSKTHEKALVNFFVKVEMNRAIEQLQKEYSMVVMDNLKSDLNVML 177
Query: 185 NVPADILSHKEGKDFLWMSNNQMRRRKDFLVYTFPYREASDLGLDRMVEVRDSVLKANIE 244
N PA+ +K+ DF W SNN R D +VYTFPY + + + +V RDSVLKAN+
Sbjct: 178 NAPANFTYYKDTTDFFWSSNNANTGRMDLIVYTFPYTDPNTFTEEYLVAKRDSVLKANLP 237
Query: 245 GSDAGSYPVTE-RMLLKHRYVYPGDQKRGEVHGIWRMEGGQMMSGPFVAQAFVD--GDRV 301
GS GSY TE R +++ + + G + G+WRM+ G MM GPFV+ +D RV
Sbjct: 238 GSFPGSYMQTETRAGVEYTPITLNGKYCGVMRGLWRMQ-GDMMGGPFVSHTRLDEKNHRV 296
Query: 302 VVVEGFVYHPNEKKRDLMRQMEAALYSMRPKAE 334
VV EGFVY P KR+ MR++EAAL+++R E
Sbjct: 297 VVAEGFVYAPETDKRNFMRRIEAALFTLRLPGE 329
>gi|150008797|ref|YP_001303540.1| hypothetical protein BDI_2187 [Parabacteroides distasonis ATCC
8503]
gi|149937221|gb|ABR43918.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 334
Score = 211 bits (537), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 123/335 (36%), Positives = 188/335 (56%), Gaps = 9/335 (2%)
Query: 1 MKRNTRLLFVVPAFFAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAI 60
M+ T LL++ + + AC + S +TG P E+++VMD VGE +
Sbjct: 1 MRTQTVLLWISLSLMVGLS-ACDWTSSSPVASLRRATGWPFEVVVVMDRDAWNGEVGELL 59
Query: 61 YVAMTEVAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQW 120
+ PAL Q E M V+ F+G LRY+RN+L + ++P+ +TK + D+W
Sbjct: 60 QEQLRAPIPALPQAEASMRVTYSEPSGFSGLLRYVRNILIVSVDPNRYTKAGTRESPDEW 119
Query: 121 ANGQLVVLINSPATDSLTAYVESNKSAIQNLF--IRHELFLFGQLWQKDCSADADQRVQR 178
A+GQ VV + +P T +LT ++ N+ + N F + E ++ G K S+ +++++
Sbjct: 120 ASGQNVVTLYAPTTAALTEFLLLNEGRLVNHFNRVERERWIVGL--SKSHSSWINEKLKS 177
Query: 179 MFGYHINVPADILSHKEGKDFLWMSNNQMRRRKDFLVYTFPYREASDLGLDRMVEVRDSV 238
FG + VP D+ ++KE DF W SNN R R D LVY+FPY D +V +RDSV
Sbjct: 178 RFGISLFVPEDMSAYKEETDFFWGSNNANRGRTDILVYSFPYTAQEAFTEDYLVAMRDSV 237
Query: 239 LKANIEGSDAGSYPVTE-RMLLKHRYVYPGDQKRGEVHGIWRMEGGQMMSGPFVAQAFVD 297
++ N+ GS SY TE R L + + + RG + G+WRMEG + M GPFV+ A +D
Sbjct: 238 VRRNVPGSFPNSYMKTEKRFELSYEPITYRGEYRGVLRGLWRMEGDK-MGGPFVSHALLD 296
Query: 298 GD--RVVVVEGFVYHPNEKKRDLMRQMEAALYSMR 330
RV+V E FVY P KK +L+R+MEAAL+++R
Sbjct: 297 KKKGRVIVAESFVYAPESKKANLIRRMEAALFTLR 331
>gi|150005132|ref|YP_001299876.1| hypothetical protein BVU_2599 [Bacteroides vulgatus ATCC 8482]
gi|149933556|gb|ABR40254.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 354
Score = 201 bits (512), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 112/323 (34%), Positives = 174/323 (53%), Gaps = 4/323 (1%)
Query: 22 CACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALNQNEPIMAVS 81
+C+ G S+G P E+++V++ P G A++ + P L Q E +S
Sbjct: 18 ASCKKGQKNLFTPTSSGRPYEVLVVVNKPVWDRPAGRALFDVLDTNVPGLPQAERSFRIS 77
Query: 82 RVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQLVVLINSPATDSLTAYV 141
V + F L+ RN++ +DI DI+T+ LK+ D +A+ Q+++ I +P D +V
Sbjct: 78 NVDPQHFDRVLKIFRNIIIVDIQ-DIYTQPKLKFSRDVYASPQMIMTIQAPNEDEFEEFV 136
Query: 142 ESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHINVPADILSHKEGKDFLW 201
N I + F++ E+ L ++ S +V +F I VP ++ ++KEGKDFLW
Sbjct: 137 AKNSKVIIDFFVKAEMNRQITLLKQKHSDVISTKVGSIFDCDIWVPVELANYKEGKDFLW 196
Query: 202 MSNNQMRRRKDFLVYTFPYREASDLGLDRMVEVRDSVLKANIEGSDAGSYPVTERMLLKH 261
S N+ +F++Y++PY + D + RDSV+KANI G G Y T+ M +
Sbjct: 197 ASTNRATADMNFVMYSYPYTDKDTFTKDYFIHKRDSVMKANIPGEREGMYMATDSMFVDV 256
Query: 262 RYVYPGDQKRGEVHGIWRMEGGQMMSGPFVAQAFVD--GDRVVVVEGFVYHPNEKKRDLM 319
+ + E G+W ME G MM GPFV+ A VD RVVVVE F+Y P++ KR+LM
Sbjct: 257 EDIVVKGEYAQEARGLWEME-GDMMGGPFVSHARVDRANGRVVVVEAFIYSPDKLKRNLM 315
Query: 320 RQMEAALYSMRPKAEKEFDPSII 342
RQMEA+LY++R E D +I
Sbjct: 316 RQMEASLYTLRLPNESLIDEIVI 338
>gi|154491021|ref|ZP_02030962.1| hypothetical protein PARMER_00938 [Parabacteroides merdae ATCC
43184]
gi|154088769|gb|EDN87813.1| hypothetical protein PARMER_00938 [Parabacteroides merdae ATCC
43184]
Length = 332
Score = 196 bits (498), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 112/317 (35%), Positives = 174/317 (54%), Gaps = 8/317 (2%)
Query: 23 ACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALNQNEPIMAVSR 82
AC G + N+TG E+++ MD +P G+AI +T P L Q EP ++
Sbjct: 8 ACSSGPVMR---NATGFAYEIVVTMDKADWDAPAGKAIKAELTSDIPGLPQAEPAFKITY 64
Query: 83 VSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQLVVLINSPATDSLTAYVE 142
V+ F G L Y+RNVL + I+ +TK SL Y ++WA GQ+V+ + +P ++ YV+
Sbjct: 65 VTPDQFNGLLTYVRNVLIVRIDKSQYTKVSLNYENNRWAKGQVVMTLTAPDDAAILEYVK 124
Query: 143 SNKSAIQNLFIRHELF-LFGQLWQKDCSADADQRVQRMFGYHINVPADILSHKEGKDFLW 201
++ I F + E GQL +K+ S V+ F ++ PA++ ++ F W
Sbjct: 125 AHPRNIVEFFTKCERNRTIGQL-EKEHSPVVMDHVKDRFNVMLSAPANMTYFRDTTGFFW 183
Query: 202 MSNNQMRRRKDFLVYTFPYREASDLGLDRMVEVRDSVLKANIEGSDAGSYPVTERMLLKH 261
SNN R D +VY FPY++A D ++ RDSV+K N+ G+ GSY T+ + +
Sbjct: 184 ASNNANTGRTDIVVYDFPYKDADTFTADYLIAKRDSVMKLNMPGTFPGSYMATDTAWVTY 243
Query: 262 RYVYPGDQKRGEVHGIWRMEGGQMMSGPFVAQAFVD--GDRVVVVEGFVYHPNEKKRDLM 319
+ G + G+W M+ G MM GPFV+ A +D +RVVVVEGFV+ P KR+ +
Sbjct: 244 NPTTVNGKYCGVLRGLWHMK-GDMMGGPFVSHARLDEKNNRVVVVEGFVFAPETDKRNFI 302
Query: 320 RQMEAALYSMRPKAEKE 336
R++E+ALY++R E E
Sbjct: 303 RRIESALYTLRLPGEFE 319
>gi|153809259|ref|ZP_01961927.1| hypothetical protein BACCAC_03571 [Bacteroides caccae ATCC 43185]
gi|149128029|gb|EDM19250.1| hypothetical protein BACCAC_03571 [Bacteroides caccae ATCC 43185]
Length = 346
Score = 195 bits (496), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 108/320 (33%), Positives = 172/320 (53%), Gaps = 10/320 (3%)
Query: 13 AFFAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALN 72
A A +C +G S+G E+++V+D G A++ + P L
Sbjct: 11 ALIAFVFASCGLKGNHT------SSGRAYELLVVVDHGVWDRAAGRALHDVLDSDMPGLP 64
Query: 73 QNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQLVVLINSP 132
Q+EP + S KD+ L+ IRN++ +DI DI+TK S KY D +AN Q+++ I +P
Sbjct: 65 QSEPSFRIMYTSPKDYDSTLKLIRNIIIVDIK-DIYTKASFKYAKDVYANPQMILTIQAP 123
Query: 133 ATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHINVPADILS 192
+ +VE NK I + F R E+ + ++ S Q+V +FG I +PA++ +
Sbjct: 124 NEEEFQKFVEENKQTIVDFFTRAEMNRQISMLEEKHSNFISQKVDSLFGCDIWLPAELAN 183
Query: 193 HKEGKDFLWMSNNQMRRRKDFLVYTFPYREASDLGLDRMVEVRDSVLKANIEGSDAGSYP 252
K GKDF W S N ++F++Y++PY + + V RDSV+KANI G G Y
Sbjct: 184 SKTGKDFFWASTNTGTADRNFVMYSYPYTDKDTFTKEYFVHKRDSVMKANIPGFKEGVYM 243
Query: 253 VTERMLLKHRYVYPGDQKRGEVHGIWRMEGGQMMSGPFVAQAFVD--GDRVVVVEGFVYH 310
T+ +L R + + E G+WRM+ G M GP+V+ +D R++ E FVY
Sbjct: 244 STDSLLTDVRPINVQNSYTMEARGLWRMK-GDFMGGPYVSHTRLDEKNQRIITAEIFVYS 302
Query: 311 PNEKKRDLMRQMEAALYSMR 330
P++ KR+L+RQMEA+LY+++
Sbjct: 303 PDKMKRNLVRQMEASLYTLK 322
>gi|154494429|ref|ZP_02033749.1| hypothetical protein PARMER_03784 [Parabacteroides merdae ATCC
43184]
gi|154085873|gb|EDN84918.1| hypothetical protein PARMER_03784 [Parabacteroides merdae ATCC
43184]
Length = 334
Score = 188 bits (477), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 105/296 (35%), Positives = 159/296 (53%), Gaps = 4/296 (1%)
Query: 36 STGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALNQNEPIMAVSRVSEKDFTGFLRYI 95
+TG E+++VMD G +Y +T P L QNEP M V+ F G L Y+
Sbjct: 36 ATGFAYEVLVVMDKDAWKGEAGRLLYDQLTAPIPGLPQNEPAMRVTYAEPFQFNGLLHYV 95
Query: 96 RNVLQIDINPDIFTKTSLKYGYDQWANGQLVVLINSPATDSLTAYVESNKSAIQNLFIRH 155
RN++ + I+ ++TK S+ D+WA GQ VV +N+P++ L Y+E +++
Sbjct: 96 RNIIHVRIDESLYTKVSVHKEKDRWATGQEVVTLNAPSSRILAEYLEKQGTSLVAWLGEK 155
Query: 156 ELFLFGQLWQKDCSADADQRVQRMFGYHINVPADILSHKEGKDFLWMSNNQMRRRKDFLV 215
E + S + +V+ F + P ++ S+K+ DF W++++ R R D +V
Sbjct: 156 ERERQADYLESSHSVWVNDKVRARFNAQLYAPEEMCSYKDTADFFWVTDHGTRGRIDMVV 215
Query: 216 YTFPYREASDLGLDRMVEVRDSVLKANIEGSDAGSYPVTE-RMLLKHRYVYPGDQKRGEV 274
Y+FPY LD +V +RDSVL +I+G+ GSY TE R + + + G V
Sbjct: 216 YSFPYVSGRTFTLDYLVAMRDSVLGEHIQGAFPGSYMTTEKRFTPSYEAISKNGEYCGVV 275
Query: 275 HGIWRMEGGQMMSGPFVAQAFVD--GDRVVVVEGFVYHPNEKKRDLMRQMEAALYS 328
G+W ME G MM GPFV+ A +D RVVV E FVY P KK L+R+ EA+LY+
Sbjct: 276 RGLWEME-GDMMGGPFVSHAHLDEKNQRVVVAETFVYAPGTKKAKLLRRGEASLYT 330
>gi|53712371|ref|YP_098363.1| hypothetical protein BF1079 [Bacteroides fragilis YCH46]
gi|60680540|ref|YP_210684.1| hypothetical protein BF0996 [Bacteroides fragilis NCTC 9343]
gi|52215236|dbj|BAD47829.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
gi|60491974|emb|CAH06735.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
Length = 348
Score = 187 bits (476), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 105/331 (31%), Positives = 175/331 (52%), Gaps = 6/331 (1%)
Query: 7 LLFVVPAFFAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTE 66
L ++ A AV + +C+ G S+G E+++V++ P G A+Y +
Sbjct: 5 LFYLSMALVAV--VLFSCKSGKKSVFTPTSSGRAYEVLVVVEKPVWERPAGRALYNVLDT 62
Query: 67 VAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQLV 126
P L Q+E + S KDF L+ +RN++ +DI DI+T+ KY D +A+ Q++
Sbjct: 63 DVPGLPQSERSFRIMSTSPKDFDAILKLVRNIIIVDIQ-DIYTQPKFKYAKDVYASPQMI 121
Query: 127 VLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHINV 186
+ I +P S +VE NK I + F R E+ +K+ + +V MF I +
Sbjct: 122 LTIQAPDEASFEKFVEENKQPIIDFFTRAEMNRQISFLEKNHNDYISTKVGSMFNSDIWL 181
Query: 187 PADILSHKEGKDFLWMSNNQMRRRKDFLVYTFPYREASDLGLDRMVEVRDSVLKANIEGS 246
PA++ S K G+DF W N ++F++Y++PY + + + RDSV+KANI G+
Sbjct: 182 PAELSSSKTGEDFFWAGTNAATGDQNFVIYSYPYTDKDTFTKEFFIHKRDSVMKANIPGA 241
Query: 247 DAGSYPVTERMLLKHRYVYPGDQKRGEVHGIWRMEGGQMMSGPFVAQAFVD--GDRVVVV 304
G Y T+ ++ R + E G+WR++ G M GPFV+ +D R++
Sbjct: 242 KEGMYMATDSSTVEVRPIDIHGDYTMEARGLWRIK-GDFMGGPFVSHTRLDKASHRIITT 300
Query: 305 EGFVYHPNEKKRDLMRQMEAALYSMRPKAEK 335
E F+Y P++ KRDLMR++EA+LY+++ EK
Sbjct: 301 EVFIYSPDKMKRDLMRRLEASLYTLQLPTEK 331
>gi|160887189|ref|ZP_02068192.1| hypothetical protein BACOVA_05205 [Bacteroides ovatus ATCC 8483]
gi|156107600|gb|EDO09345.1| hypothetical protein BACOVA_05205 [Bacteroides ovatus ATCC 8483]
Length = 346
Score = 185 bits (469), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 111/332 (33%), Positives = 176/332 (53%), Gaps = 12/332 (3%)
Query: 1 MKRNTRLLFVVPAFFAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAI 60
MK+ + +L + A A +C +G S+G E+++V+D G A+
Sbjct: 1 MKKYSFILCI--ALVAFVVASCGLKGNHT------SSGRAYELLVVVDHGVWDRAAGRAL 52
Query: 61 YVAMTEVAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQW 120
+ A+ P L Q+EP + S KD+ L+ IRN++ +DI DI+TK S KY D +
Sbjct: 53 HDALDADMPGLPQSEPSFRIMYTSPKDYDSTLKLIRNIIIVDIQ-DIYTKASFKYAKDVY 111
Query: 121 ANGQLVVLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMF 180
AN Q+++ I +P + +VE NK I + F R E+ + S Q+V +F
Sbjct: 112 ANPQMILTIQAPNEEEFEKFVEENKKTIVDFFTRAEMNRQITFLEGKHSNFISQKVDSLF 171
Query: 181 GYHINVPADILSHKEGKDFLWMSNNQMRRRKDFLVYTFPYREASDLGLDRMVEVRDSVLK 240
G I V A++ + K G DF W S N ++F++Y++PY + + V RDSV+K
Sbjct: 172 GCDIWVDAELANSKTGDDFFWASTNTGSADRNFVMYSYPYTDKDTFTKEYFVHKRDSVMK 231
Query: 241 ANIEGSDAGSYPVTERMLLKHRYVYPGDQKRGEVHGIWRMEGGQMMSGPFVAQAFVD--G 298
ANI G G Y T+ +L R + + E G+WRM+ G M GP+V+ +D
Sbjct: 232 ANIPGFKEGVYMSTDSLLTDVRPINVQNSYTMEARGLWRMK-GDFMGGPYVSHTRLDEKN 290
Query: 299 DRVVVVEGFVYHPNEKKRDLMRQMEAALYSMR 330
R++ E FVY P++ KR+L+RQMEA+LY+++
Sbjct: 291 QRIITAEIFVYSPDKMKRNLVRQMEASLYTLK 322
>gi|29349781|ref|NP_813284.1| hypothetical protein BT_4373 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341692|gb|AAO79478.1| putative lipoprotein [Bacteroides thetaiotaomicron VPI-5482]
Length = 345
Score = 184 bits (466), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 105/326 (32%), Positives = 174/326 (53%), Gaps = 10/326 (3%)
Query: 7 LLFVVPAFFAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTE 66
++ A A +C +G S+G P E+++V+D G A++ +
Sbjct: 5 FFYLSMALVAFVVASCGLKGNHT------SSGRPYELLVVVDAGVWDRAAGRALHDVLDS 58
Query: 67 VAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQLV 126
P L Q+EP + S KD+ L+ IRN++ +DI DI+TK + KY D +A+ Q++
Sbjct: 59 DMPGLPQSEPSFRIMYTSPKDYDSTLKLIRNIVIVDIK-DIYTKGTFKYAKDVYASPQMI 117
Query: 127 VLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHINV 186
+ I +P + +VE NK I + F R E+ L ++ + +V +FG I +
Sbjct: 118 LTIQAPNEEVFEKFVEENKQTIIDFFTRAEMNRQITLLEEKHNNFISNKVDSLFGCDIWI 177
Query: 187 PADILSHKEGKDFLWMSNNQMRRRKDFLVYTFPYREASDLGLDRMVEVRDSVLKANIEGS 246
P+++ + K G+DF W S N ++F++Y++PY + + V RDSV+KANI G
Sbjct: 178 PSELNNSKTGEDFFWASTNTGSADRNFVMYSYPYTDKDTFTKEYFVHKRDSVMKANIPGY 237
Query: 247 DAGSYPVTERMLLKHRYVYPGDQKRGEVHGIWRMEGGQMMSGPFVAQAFVD--GDRVVVV 304
G Y T+ +L R + + E G+WRM+ G M GPFV+ +D R++
Sbjct: 238 KEGVYMSTDSLLTDVRPINVHNDYTMEARGLWRMK-GDFMGGPFVSHTRLDQKNQRIITA 296
Query: 305 EGFVYHPNEKKRDLMRQMEAALYSMR 330
E FVY P++ KR+L+RQMEA+LY+++
Sbjct: 297 EIFVYSPDKMKRNLVRQMEASLYTLK 322
>gi|167765102|ref|ZP_02437215.1| hypothetical protein BACSTE_03488 [Bacteroides stercoris ATCC
43183]
gi|167696730|gb|EDS13309.1| hypothetical protein BACSTE_03488 [Bacteroides stercoris ATCC
43183]
Length = 352
Score = 182 bits (463), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 98/323 (30%), Positives = 174/323 (53%), Gaps = 4/323 (1%)
Query: 17 VFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALNQNEP 76
+ A+ AC G G S+G E+++V+D P G A+Y + P L Q+E
Sbjct: 13 ITALFSACGNGKKGVFTPTSSGRAYEILVVIDQGLWERPAGRALYDVLDSDVPGLPQSER 72
Query: 77 IMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQLVVLINSPATDS 136
+ S ++ L+ IRN++ +++N D++T+ KY + +A Q ++ I +P S
Sbjct: 73 SFRIMYTSPANYDSTLKLIRNIIIVEVNKDLYTQPKFKYAKNVYAAPQSILTIQAPDEAS 132
Query: 137 LTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHINVPADILSHKEG 196
+VE N+ I + F R E+ + + S +V+ MF + VP ++ + K+G
Sbjct: 133 FEKFVEENRQVIVDFFTRAEMNRQIAVLENKHSDYVSTKVKSMFDCDVWVPGELTATKQG 192
Query: 197 KDFLWMSNNQMRRRKDFLVYTFPYREASDLGLDRMVEVRDSVLKANIEGSDAGSYPVTER 256
++F W N ++F++Y++PY + + V RDSV+K NI G+ G Y T+
Sbjct: 193 ENFFWAGTNAATGDQNFVIYSYPYTDKDTFTKEYFVHKRDSVMKINIPGAREGMYMETDS 252
Query: 257 MLLKHRYVYPGDQKRGEVHGIWRMEGGQMMSGPFVAQAFVD--GDRVVVVEGFVYHPNEK 314
++ R + D+ E G+WR++ G M GP+V+ +D R++V E FVY P++
Sbjct: 253 LMTDVRPINVQDEYALEARGLWRVK-GDFMGGPYVSHVRLDKVNQRIIVSEIFVYSPDKM 311
Query: 315 KRDLMRQMEAALYSMR-PKAEKE 336
KR+L+RQMEA+LY+++ P +++E
Sbjct: 312 KRNLVRQMEASLYTLKLPGSKQE 334
>gi|160892218|ref|ZP_02073221.1| hypothetical protein BACUNI_04682 [Bacteroides uniformis ATCC 8492]
gi|156858696|gb|EDO52127.1| hypothetical protein BACUNI_04682 [Bacteroides uniformis ATCC 8492]
Length = 351
Score = 176 bits (446), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 95/322 (29%), Positives = 167/322 (51%), Gaps = 3/322 (0%)
Query: 17 VFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALNQNEP 76
V + AC GG G S+G E+++V++ P G A++ + P L Q+E
Sbjct: 13 VLTLFSACGGGKKGVFTPTSSGRAYEVLVVVEPGLWERPAGRALFDVLDSDVPGLPQSER 72
Query: 77 IMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQLVVLINSPATDS 136
+ + ++ L+ IRN++ +D+N D++T+ KY + +A Q ++ I +P S
Sbjct: 73 SFRIMYTAPSNYDATLKLIRNIIIVDVNKDLYTQPKFKYARNVYAAPQSILTIQAPDEAS 132
Query: 137 LTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHINVPADILSHKEG 196
+VE N+ I + F E+ + + S +V+ F + VP ++ S KEG
Sbjct: 133 FEKFVEENRQVIIDFFTHAEMNRQISVLKDKHSDYIATKVKSQFDCDVWVPGELTSTKEG 192
Query: 197 KDFLWMSNNQMRRRKDFLVYTFPYREASDLGLDRMVEVRDSVLKANIEGSDAGSYPVTER 256
++F W N ++F++Y++PY + + V RDSV+K NI G+ G Y T+
Sbjct: 193 ENFFWAGTNAATGDQNFVIYSYPYTDKDTFTKEYFVHKRDSVMKINIPGAREGMYMSTDS 252
Query: 257 MLLKHRYVYPGDQKRGEVHGIWRMEGGQMMSGPFVAQAFVD--GDRVVVVEGFVYHPNEK 314
++ R + E G+WR++ G M GPFV+ +D R++V E F+Y P++
Sbjct: 253 LMTDVRPISVQGDYALEARGLWRIK-GDFMGGPFVSHVRLDKANQRIIVSEIFIYSPDKM 311
Query: 315 KRDLMRQMEAALYSMRPKAEKE 336
KR+L+RQMEA+LY+++ K+
Sbjct: 312 KRNLVRQMEASLYTLKLPGSKQ 333
>gi|167754135|ref|ZP_02426262.1| hypothetical protein ALIPUT_02428 [Alistipes putredinis DSM 17216]
gi|167658760|gb|EDS02890.1| hypothetical protein ALIPUT_02428 [Alistipes putredinis DSM 17216]
Length = 369
Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 81/296 (27%), Positives = 145/296 (48%), Gaps = 6/296 (2%)
Query: 38 GSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALNQNEPIMAVSRVSEKDFTGFLRYIRN 97
G+P E+++V + Q S +G+ + + P LN+ EP+ V + E F G + RN
Sbjct: 36 GAPYELIVVCNQQLWESELGDTLRSVLLAPIPYLNEREPLFNVMQFPESGFKGTVVDYRN 95
Query: 98 VLQIDINPDIFTKTSLKYGYDQWANGQLVVLINSPATDSLTAYVESNKSAIQNLFIRHEL 157
+L++ ++P + S YD A+ Q+VV + P ++LT YV +++ + F E
Sbjct: 96 ILKVLVDPSL-AAASAGVQYDVTASPQVVVTLQGPTREALTQYVAEHRAELVQTFEGAER 154
Query: 158 FLFGQLWQKDCSADADQRVQRMFGYHINVPADILSHKEGKDFLWMSNNQMRRRKDFLVYT 217
K ++ + V+ FG + +P + ++ DFLW + F++Y+
Sbjct: 155 DRSIAFANKFNQTESGKLVREKFGVEMKIPKGYILAQQSDDFLWFRYEFPAASQGFMLYS 214
Query: 218 FPYREASDLGLDRMVEVRDSVLKANIEGSDAGSYPVTERMLL-KHRYVYPGDQKRGEVHG 276
+PY L + ++ R+ A I G GSY T ++ ++R+ + E+ G
Sbjct: 215 YPYEGKQSLSEEALLAARNR-FAARIPGPSDGSYMTTSKVFTPRYRFFRLEGRAWVEMRG 273
Query: 277 IWRMEGGQMMSGPFVAQAFVD--GDRVVVVEGFVYHPNEKKRDLMRQMEAALYSMR 330
W +E G M GPFV+ VD DRV ++ +VY P KR+ MR++E L+ ++
Sbjct: 274 FWDVE-GDFMGGPFVSYTTVDTATDRVFTLDCYVYSPKLPKRNYMREVEHLLHLVK 328
>gi|120437616|ref|YP_863302.1| hypothetical protein GFO_3292 [Gramella forsetii KT0803]
gi|117579766|emb|CAL68235.1| conserved hypothetical protein [Gramella forsetii KT0803]
Length = 328
Score = 112 bits (279), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 161/333 (48%), Gaps = 17/333 (5%)
Query: 6 RLLFVVPAFFAVFAMACACQGGTAGKS---FTNSTGSPGEMMLVMDDQYLGSPVGEAIYV 62
R L ++ + F + ++C ++ K+ T+S+G+ ++ +V++++ +GEAI
Sbjct: 3 RSLILLASIFLL--ISCKNDEKSSNKNDRILTDSSGNINQLTIVIENELWEGEIGEAIRT 60
Query: 63 AMTEVAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWAN 122
+ L Q EP+ ++S++ + F+GF+R R L+I+ + + +++A
Sbjct: 61 NLAAPVDGLPQEEPLFSLSQIPPETFSGFVRNSRIFLKIEEGGN----KGMDLLENEFAR 116
Query: 123 GQLVVLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGY 182
Q +++ ++ + + +I + + EL + +K S D + FG
Sbjct: 117 PQTGIVVQGESSSEIIEVINKESDSIIKILKKTELTEKQRRIRK--SLKKDDELSEKFGV 174
Query: 183 HINVPADILSHKEGKDFLWMSNNQMRRRKDFLVYTFPYR--EASDLGLDRMVEVRDSVLK 240
++ P+ K+ F W+ + + LVY P E + ++++RDS+ +
Sbjct: 175 NLKFPSAYRYAKQEDHFSWIRKEIPKGSMEILVYEVPINTIENDSSVIANIIKMRDSIGE 234
Query: 241 ANIEGSDAGSYPVTERMLLKHRYVYPGDQKRG-EVHGIWRMEGGQMMSGPFVAQAFVD-- 297
A I G GSY +TE+ + + D K E G W +E M+GPFV A D
Sbjct: 235 AQIPGRLEGSYMITEKAYAPYLFETKIDGKFAYETRGTWEVE-NDFMAGPFVNYAVKDEK 293
Query: 298 GDRVVVVEGFVYHPNEKKRDLMRQMEAALYSMR 330
DR +V+EGFV+ P+ KRD + +++A L S++
Sbjct: 294 NDRFIVLEGFVFSPSRGKRDNIFELDAILQSIK 326
>gi|86144110|ref|ZP_01062447.1| hypothetical protein MED217_18366 [Flavobacterium sp. MED217]
gi|85829372|gb|EAQ47837.1| hypothetical protein MED217_18366 [Leeuwenhoekiella blandensis
MED217]
Length = 325
Score = 111 bits (278), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 84/323 (26%), Positives = 157/323 (48%), Gaps = 18/323 (5%)
Query: 15 FAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALNQN 74
F +FA AC +GK + S+G+ + +VM + PVGE+I + L Q
Sbjct: 8 FVLFAF-IACDDKPSGKILSASSGALNNLTVVMPNDMWEGPVGESIREKLAGPVNGLPQV 66
Query: 75 EPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQLVVLINSPAT 134
EP+ A++++ + F+GF+R R L+I+ S++ D++A Q +L+ P
Sbjct: 67 EPMFAINQMPSEAFSGFMRKQRTFLKIEQA----DSASIEVVNDEYARPQTGILVKGPTE 122
Query: 135 DSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHINVPADILSHK 194
++ ++ + + I N E ++ + S D+ + FG + P+ K
Sbjct: 123 SAIINIIQQDSAKIVNTIKETE--FTEKIRRVSLSLKEDKPLTDAFGITMKFPSAYRYAK 180
Query: 195 EGKDFLWMSNNQMRRRKDFLVYTFPY----REASDLGLDRMVEVRDSVLKANIEGSDAGS 250
+ F W+ + + VY PY R+++ +G ++++RDS+ NI S+
Sbjct: 181 DDDGFFWIRKDIPNGDMNITVYEVPYNMIDRDSNTIG--SLIKMRDSIGGDNITVSEGMR 238
Query: 251 YPVTERMLLKHRYVYPGDQKRG-EVHGIWRMEGGQMMSGPFVAQAFVD--GDRVVVVEGF 307
+ +TE + D K ++ G+W ++ G+ M+GPFV D DR +V+EGF
Sbjct: 239 F-ITEAAFAPYLSETTIDGKPAYQMKGMWEVK-GRYMAGPFVNFTVDDKENDRYLVLEGF 296
Query: 308 VYHPNEKKRDLMRQMEAALYSMR 330
V+ P+ KRD + ++E+ L S++
Sbjct: 297 VFKPSANKRDNIFELESILRSVK 319
>gi|163789256|ref|ZP_02183698.1| hypothetical protein FBALC1_00225 [Flavobacteriales bacterium
ALC-1]
gi|159875471|gb|EDP69533.1| hypothetical protein FBALC1_00225 [Flavobacteriales bacterium
ALC-1]
Length = 325
Score = 106 bits (265), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 91/333 (27%), Positives = 151/333 (45%), Gaps = 15/333 (4%)
Query: 6 RLLFVVPAFFAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMT 65
R L+ V F V +AC Q S+G + +V+D+ VGE I
Sbjct: 2 RALYTV--LFCVLLLACGDQKSDGKIYLPESSGKLNTISVVVDNILWEDSVGENIRNIFA 59
Query: 66 EVAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQL 125
L NEP+ + ++ + F GF R VL+I+ T+ S K + +A Q
Sbjct: 60 TPLNGLPTNEPMFNMKQIPPQVFDGFAARNRIVLKIEKED---TEASTKIVNEVFAKPQT 116
Query: 126 VVLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHIN 185
+V+++ + A ++ NK+ I + F + E+ +L K S AD+ ++ G I+
Sbjct: 117 LVVVSGKTNQEIIAQLQENKAKIVDAFNKEEVKEKLRLINK--SLLADEAMENKLGITID 174
Query: 186 VPADILSHKEGKDFLWMSNN-QMRRRKDFLVYTFPYREAS--DLGLDRMVEVRDSVLKAN 242
+P K DF W+ N + D ++Y P S D + +V++R+ + K
Sbjct: 175 IPWAYKISKADDDFFWLRKNFNNSKTIDLMLYEVPIETISKGDSTIVDIVKMRNIITKTK 234
Query: 243 IEGSDAGSYPVTERMLLKHRYVYPGDQKRG-EVHGIWRMEGGQMMSGPFVAQAFVD--GD 299
+ G D Y E + D K EV G+W+ME G M GPF+ D +
Sbjct: 235 VPGED-DIYMAVEDAYAPALFNTIIDNKPAYEVRGLWKME-GYTMGGPFITYVIEDKINN 292
Query: 300 RVVVVEGFVYHPNEKKRDLMRQMEAALYSMRPK 332
R +V +G+VY P+ KRDL+ ++E+ + S+ K
Sbjct: 293 RYLVGDGYVYAPSLDKRDLVFELESMIKSITIK 325
>gi|83856082|ref|ZP_00949611.1| hypothetical protein CA2559_03305 [Croceibacter atlanticus
HTCC2559]
gi|83849882|gb|EAP87750.1| hypothetical protein CA2559_03305 [Croceibacter atlanticus
HTCC2559]
Length = 323
Score = 105 bits (261), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 83/335 (24%), Positives = 154/335 (45%), Gaps = 20/335 (5%)
Query: 1 MKRNTRLLFVVPAFFAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAI 60
MK+ LL V+ F +C+ T S+G+ + +++++ VG+AI
Sbjct: 1 MKKFVLLLSVLATVF------TSCEDEKV--ILTTSSGNINNLSVIVENDLWKGEVGDAI 52
Query: 61 YVAMTEVAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQW 120
+ L Q EP+ ++++ K F+GF+ R L+++ + I +LK +Q+
Sbjct: 53 RTNLAATVEGLPQEEPLFNMNQIPPKAFSGFVTKNRTYLKVEKSDSI----ALKIVRNQF 108
Query: 121 ANGQLVVLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMF 180
A Q V+ P S+ + + I +F + E+ + K S AD +++
Sbjct: 109 ARPQTGVIAKGPDAASIAKLINERGADIVAVFKQTEIKEKQRRINK--SLKADDVIKKKL 166
Query: 181 GYHINVPADILSHKEGKDFLWMSNNQMRRRKDFLVYTFPYRE--ASDLGLDRMVEVRDSV 238
G + P KE F W+ + + ++Y P ++ A + ++++RDS+
Sbjct: 167 GVSLKFPTAYRYAKEDDKFFWLRKDLKNGGMNIMIYEIPMQKINADTNVISSIIKMRDSI 226
Query: 239 LKANIEGSDAGSYPVTERMLLKHRYVYPGDQKRG-EVHGIWRMEGGQMMSGPFVAQAFVD 297
+A+I G GSY +TE + + D + E G W ++ M+GPF+ D
Sbjct: 227 GQAHIPGPVDGSYMITEAAYAPYLFESEIDGRFAYETKGTWEVKNA-FMAGPFLNYTIED 285
Query: 298 --GDRVVVVEGFVYHPNEKKRDLMRQMEAALYSMR 330
+R +V+EGF + P+ KRD M ++EA L S +
Sbjct: 286 KANNRYIVLEGFTFAPSAPKRDHMFELEAILKSAK 320
>gi|163754588|ref|ZP_02161710.1| hypothetical protein KAOT1_16873 [Kordia algicida OT-1]
gi|161325529|gb|EDP96856.1| hypothetical protein KAOT1_16873 [Kordia algicida OT-1]
Length = 328
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 83/342 (24%), Positives = 159/342 (46%), Gaps = 29/342 (8%)
Query: 1 MKRNTRLLFVVPAFFAVFAMACACQGG--TAGKSFTNSTGSPGEMMLVMDDQYLGSPVGE 58
MK+ L+FV+ F +C + A + S G + +V+D+ +G+
Sbjct: 1 MKKLYALIFVLALF-----SSCGDKNDKDVAERYKPQSAGKVNLLAVVIDNDQWKGKIGD 55
Query: 59 AIYVAMTEVAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYD 118
+I L EP+ ++ +++ + F + R++L I+ + + + D
Sbjct: 56 SIRSIYGAPTDGLPMEEPLFSLKQINPEVFKDPITTYRSILIIESS----DEKKFRIKGD 111
Query: 119 QWANGQLVVLINSPATDSLTAYVESNKSAIQNL----FIRHELFLFGQLWQKDCSADADQ 174
+A Q +V I+ + + E+ +I+ L + + F+ G L + D+
Sbjct: 112 VYAKPQQIVYISGSDDEIIAQLTENAAKSIKTLKDNELLSKQQFIKGAL-------NKDK 164
Query: 175 RVQRMFGYHINVPADILSHKEGKDFLWMSNNQMRRRKDFLVYTFPYRE--ASDLGLDRMV 232
++ FG +N+P+ K+ +F+W + + YT PY +D +D ++
Sbjct: 165 SLEETFGISLNIPSVYKVVKKEDNFVWFERKVSNGTMNIIAYTLPYGSIPKNDTTIDAII 224
Query: 233 EVRDSVLKANIEGSDAGS-YPVTERMLLKHRYVYPGDQKRG-EVHGIWRMEGGQMMSGPF 290
++RDS+ +A + G D + Y +TE+ + D K E G+W + G QM +GPF
Sbjct: 225 KMRDSIGEAYVPGRDPETMYMITEKAYAPFLFEATIDNKFAFETKGMWEVHGFQM-AGPF 283
Query: 291 VAQAFVDGD--RVVVVEGFVYHPNEKKRDLMRQMEAALYSMR 330
+ D D R++V+EGF + P+E KRD M ++EA L S++
Sbjct: 284 INYVVDDKDHNRLMVIEGFAFAPSEDKRDFMFEIEAILKSLK 325
>gi|86135130|ref|ZP_01053712.1| hypothetical protein MED152_10460 [Tenacibaculum sp. MED152]
gi|85821993|gb|EAQ43140.1| hypothetical protein MED152_10460 [Polaribacter dokdonensis MED152]
Length = 326
Score = 100 bits (250), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 154/329 (46%), Gaps = 20/329 (6%)
Query: 14 FFAVFAMA---CACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPA 70
F+VF ++ +C G S G ++M+V VG+ + + E+
Sbjct: 4 IFSVFVLSILLVSCVGNDK-MILKESIGRINKVMVVAKISDWTGDVGQEVRNSFGELMVG 62
Query: 71 LNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQLVVLIN 130
L Q EPI++VS+V+ F+ ++ RN+L I K++ + +A Q ++ +
Sbjct: 63 LPQAEPILSVSQVAPSGFSSMMKASRNILVISEGE----KSNFSVKRNVYAKPQTIIYVQ 118
Query: 131 SPATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHINVPADI 190
+ +++ K I+ F+ ++ +++KD S + ++++ G + +P
Sbjct: 119 GKDDKEIIEILQNRKEEIRKTFLDSDIAFTQGIFKKDASKQKYKTLEQL-GLSMTIPNKF 177
Query: 191 LSHKEGKDFLWMSNN-----QMRRRKDFLVYTFPYREASDLGLDRMVEVRDSVLKANIEG 245
+ + +FLW+ N+ + LVY+ P + + + D +V VR+++ K I G
Sbjct: 178 NTVDDTGEFLWLRNHLTSGIAKTGSNNILVYSVPLVDKTTVA-DSIVSVRNAIGKKYIPG 236
Query: 246 SDAGS-YPVTERMLLKHRYVYPGDQKRG-EVHGIWRMEGGQMMSGPFVAQAFVD--GDRV 301
SD Y +TE + D K+ E G W ++ M+GPF+ +D +R+
Sbjct: 237 SDPEKMYMITEEAYTPFTFDATIDGKKAYETRGKWEVK-NDFMAGPFLNYTVLDEANNRL 295
Query: 302 VVVEGFVYHPNEKKRDLMRQMEAALYSMR 330
+VVEGF Y P+ KR + ++EA S++
Sbjct: 296 IVVEGFTYAPSVNKRAFVFELEAIAKSLK 324
>gi|149369365|ref|ZP_01889217.1| hypothetical protein SCB49_06057 [unidentified eubacterium SCB49]
gi|149356792|gb|EDM45347.1| hypothetical protein SCB49_06057 [unidentified eubacterium SCB49]
Length = 325
Score = 100 bits (248), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 149/323 (46%), Gaps = 19/323 (5%)
Query: 13 AFFAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALN 72
A F + +C T K S G + +V+D++ PVGE I L
Sbjct: 10 ALFVITLSSCDNNQKTT-KILKESVGKINGLQIVIDNELWNGPVGEIIRERFAAPTDGLP 68
Query: 73 QNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQLVVLINSP 132
Q+EP+ ++ +++ + FTGF R R L + + + + + +A+ Q+ ++ +
Sbjct: 69 QDEPLFSMRQMTPEAFTGFARTSRLFLHVSLG----NEEKVGLNNEVYASPQMGAVVEAL 124
Query: 133 ATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSAD---ADQRVQRMFGYHINVPAD 189
+ L +E N AI N F E+ + Q+ + V++M G + +P+
Sbjct: 125 TEEGLVKQLEENSDAIINSFKEAEI----KERQRRTNIQLLPVPSLVEKM-GVTMRLPSA 179
Query: 190 ILSHKEGKDFLWMSNNQMRRRKDFLVYTFPYREA-SDLGLDRMVEVRDSVLKANIEGSDA 248
+E DF W+ N + +VY P D + ++++RD+ + + +
Sbjct: 180 YRFVRENDDFFWLRKNLKSGSTNIIVYQVPLNTIRKDSAIADIIKMRDTTGSSLMPVEEG 239
Query: 249 GSYPVTERMLLKHRYVYPGDQK-RGEVHGIWRMEGGQMMSGPFVAQAFVD--GDRVVVVE 305
G + +TE + + D K E GIW ++ Q M+GPFV A +D DR +++E
Sbjct: 240 GLF-ITEGAYAPYLFETELDGKFTYETKGIWEVK-DQYMAGPFVNYAVMDEENDRWLILE 297
Query: 306 GFVYHPNEKKRDLMRQMEAALYS 328
GFVY P+E+KRDL ++ + L S
Sbjct: 298 GFVYAPSEEKRDLQFELISILKS 320
>gi|126662709|ref|ZP_01733708.1| hypothetical protein FBBAL38_05120 [Flavobacteria bacterium BAL38]
gi|126626088|gb|EAZ96777.1| hypothetical protein FBBAL38_05120 [Flavobacteria bacterium BAL38]
Length = 326
Score = 99.4 bits (246), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 81/337 (24%), Positives = 151/337 (44%), Gaps = 32/337 (9%)
Query: 7 LLFVVPAFFAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTE 66
LF++ F ++C A + S G + +++DD +G++I
Sbjct: 6 FLFIIALSF----ISCKESAEEAKAVLSESNGKINNVSIIIDDNLWNGEIGDSIRKKFAA 61
Query: 67 VAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGY----DQWAN 122
L Q EP+ +++ K F GF+R RN+ I K + G+ +++A
Sbjct: 62 PVDGLPQEEPLFTLNQYPTKVFEGFVRKSRNI--------IIVKKGKEAGFASNTNKYAK 113
Query: 123 GQLVVLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGY 182
Q V I+ T+ + +E + I I+ + Q+ K S +D ++Q+MFG
Sbjct: 114 PQNVFFISGTDTEDVLTILEQKSAEIIKT-IKASEIIENQVRMKK-SLISDAQIQKMFGV 171
Query: 183 HINV----PADILSHKEGKDFLWMSNNQMRRRKDFLVYTFPYR--EASDLGLDRMVEVRD 236
+ + D++ K F+W+ L+Y P E + + +RD
Sbjct: 172 SLKIGFGYKYDMVKDK----FIWLRKEFTSGYNSVLIYEVPISTVEKDTNIIANITAMRD 227
Query: 237 SVLKANIEGSDAGSYPVTERMLLKHRY-VYPGDQKRGEVHGIWRMEGGQMMSGPFVAQAF 295
+ KANI+G+ ++ +TE + + V +K G W ++ M+GPFV A
Sbjct: 228 EIGKANIQGTLPNTWMITEAAYAPYLFDVTIAGKKTYLTKGTWELK-NDFMAGPFVNYAI 286
Query: 296 VD--GDRVVVVEGFVYHPNEKKRDLMRQMEAALYSMR 330
D +R +++EGF Y+P++ KRD + ++EA + S++
Sbjct: 287 KDTKNNRYLILEGFTYNPSKSKRDWVFELEAIIQSVK 323
>gi|88801997|ref|ZP_01117525.1| hypothetical protein PI23P_05022 [Polaribacter irgensii 23-P]
gi|88782655|gb|EAR13832.1| hypothetical protein PI23P_05022 [Polaribacter irgensii 23-P]
Length = 292
Score = 96.7 bits (239), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 82/295 (27%), Positives = 134/295 (45%), Gaps = 37/295 (12%)
Query: 56 VGEAIYVAMTEVAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQID--------INPDI 107
+G AI A E+ L Q EP ++VS+++ F RN+L I I ++
Sbjct: 13 LGTAIRSAFGELVVGLPQPEPTLSVSQIAPNAFGSMTNISRNILIIGEGEKEDFYIKKNV 72
Query: 108 FTKTSL---KYGYDQWANGQLVVLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLW 164
+ K YG D + ++ + N + ++AY+ S+ QN+F +L L QL
Sbjct: 73 YAKPQTIIYVYGTD---DASIIKVFNEYKNEIISAYISSDILMTQNIFKAKQLAL-SQL- 127
Query: 165 QKDCSADADQRVQRMFGYHINVPADILSHKEGKDFLWM-----SNNQMRRRKDFLVYTFP 219
+ G +P + + K+ DFLW+ S + LVY+ P
Sbjct: 128 ----------KTLEEIGVSFTIPEEFKTVKDTGDFLWLRQHLTSGIAKTGSNNILVYSIP 177
Query: 220 YREASDLGLDRMVEVRDSVLKANIEGSDAGS-YPVTERMLLKHRYVYPGDQKRG-EVHGI 277
E S + + ++ +R+++ K +I GSDA + Y TE D K+ E G
Sbjct: 178 LEEESKVS-ENIIPIRNAIGKKHIPGSDAETMYMTTEEAYTPQTLETVVDGKKAFETRGK 236
Query: 278 WRMEGGQMMSGPFVAQAFVDG--DRVVVVEGFVYHPNEKKRDLMRQMEAALYSMR 330
W ++ M+GPFV+ VD +R+VV EGF Y P+ KR + ++EA SM+
Sbjct: 237 WEVKN-DFMAGPFVSYFVVDKKHNRIVVFEGFTYAPSVNKRASLFELEAIAKSMK 290
>gi|126648900|ref|ZP_01721383.1| hypothetical protein ALPR1_17168 [Algoriphagus sp. PR1]
gi|126575042|gb|EAZ79400.1| hypothetical protein ALPR1_17168 [Algoriphagus sp. PR1]
Length = 365
Score = 92.4 bits (228), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 87/350 (24%), Positives = 144/350 (41%), Gaps = 31/350 (8%)
Query: 7 LLFVVPAF-FAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMT 65
L +++ AF F + +C G S + G+ GE++LV+D PVG+ +
Sbjct: 4 LKYILVAFSFGILFSSCEEGSGDYDSSKPKARGTIGEIILVIDSTKWAGPVGDELKDIFE 63
Query: 66 EVAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDI----------FTKTSLKY 115
P + ++EP V+RV + T L+ N++ + D F+K S
Sbjct: 64 ADLPGMIRSEPRFKVNRVDPRAMTRMLKMSTNLIYVTTFDDKGGASQVINSQFSKASKDK 123
Query: 116 G-----------YDQWANGQLVVLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLW 164
D++A GQ V+ + D L A + NK + NLF E ++
Sbjct: 124 AEADPSLFQLRMEDEYARGQEVLYLFGNTEDELVANLRKNKQRLINLFEVRERGRLEKII 183
Query: 165 QKDCSADADQRVQRMFGYHINVPADILSHKEGKDFLWM---SNNQMRRRKDFLVYTFPYR 221
+ A + G INVPA K FLW+ + R Y Y
Sbjct: 184 LARKNTAALHEAETNLGIKINVPASYQVVKSKPGFLWVRQPTPTANRADISLFFYETDYV 243
Query: 222 EASDLGLDRMVEVRDSVLKANIEG--SDAGSYPVTERML-LKHRYVYPGDQKRGEVHGIW 278
+ +V+++D + K I G D S+ V+E+ + R + D E+ G W
Sbjct: 244 AEEQTFPENIVKLKDEITKNEIFGDPEDPTSFMVSEKQIPPAFRNMVINDNFTTEIRGSW 303
Query: 279 RMEGGQMMSGPFVAQAFVDGD--RVVVVEGFVYHPNEKKRDLMRQMEAAL 326
+ M G ++ VD ++ +EGFV++PNE RD +R++E L
Sbjct: 304 KTNNIS-MGGTYLGYLMVDEKKGKLYYMEGFVFYPNEVHRDALREIETIL 352
>gi|88711616|ref|ZP_01105704.1| hypothetical protein FB2170_05860 [Flavobacteriales bacterium
HTCC2170]
gi|88710557|gb|EAR02789.1| hypothetical protein FB2170_05860 [Flavobacteriales bacterium
HTCC2170]
Length = 327
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/325 (22%), Positives = 151/325 (46%), Gaps = 25/325 (7%)
Query: 18 FAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALNQNEPI 77
++C QG + +S G+ +++V+D++ S VG+ + + L +EP+
Sbjct: 14 IVLSCKDQGKKS--YLPSSIGAYNTLIVVIDNELWKSKVGDKVREHFAASSLGLTMDEPL 71
Query: 78 MAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQLVVLINSPATDSL 137
+++++ F+G +R R+VL + D +K D ++ Q V ++ + D +
Sbjct: 72 FSITQIPPGVFSGAVRNSRSVLY--VQKDTINLAHIK--TDMYSKPQKVAVVKGNSDDEI 127
Query: 138 TAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHINVPADILSHKEGK 197
++ + F E+ + K + + D +Q FG ++N+P+ K +
Sbjct: 128 IENLDLKAEEMIGTFKNLEISEAQTRFLKSLNKEKD--IQEKFGINLNIPSIYKVVKSEE 185
Query: 198 DFLWMSNNQMRRRKDFLVYTFPYREAS--DLGLDRMVEVRDSVLKANIEGSDAGSYPVTE 255
+F+W+ + + + + Y P+ S + +V +RDS+ I G D +
Sbjct: 186 NFVWVERLIQKGQMNIIAYEMPWDSFSVDSTFIKDIVRMRDSIGSLYIPGPDVPG----K 241
Query: 256 RMLLKHR-----YVYPGD---QKRGEVHGIWRMEGGQMMSGPFVAQAFVD--GDRVVVVE 305
R ++ V+P + +K EV G+W ++ M+GPF+ D +R +VVE
Sbjct: 242 RTHMRSEPAFAPSVFPAEIAGRKAAEVRGLWDIKNYP-MAGPFLTYIVNDRENNRKIVVE 300
Query: 306 GFVYHPNEKKRDLMRQMEAALYSMR 330
GF + P +KRD M ++EA + ++R
Sbjct: 301 GFTFAPATEKRDYMFELEAIIKTLR 325
>gi|88804536|ref|ZP_01120056.1| hypothetical protein RB2501_06780 [Robiginitalea biformata
HTCC2501]
gi|88785415|gb|EAR16584.1| hypothetical protein RB2501_06780 [Robiginitalea biformata
HTCC2501]
Length = 337
Score = 85.1 bits (209), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 84/336 (25%), Positives = 150/336 (44%), Gaps = 24/336 (7%)
Query: 6 RLLFVVPAFFAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMT 65
R L ++ A AVF+ AC K S G+ + +VM + VG+ +
Sbjct: 2 RKLVILIACAAVFS---ACDDTPKKKYKPGSIGALNTVAVVMPNAMWEGAVGDKVREHFA 58
Query: 66 EVAPALNQNEPIMAVSRVSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQL 125
L +EP + + FTG R+ R+VL + + + T + D +A+ Q+
Sbjct: 59 APLIGLTWDEPRFNLEHMPPSVFTGTTRHRRSVLYVSRD----SVTGAQVQSDLYASPQM 114
Query: 126 VVLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHIN 185
V ++ + ++ A +E++ I + E+ + +++ S + Q FG +
Sbjct: 115 VGVVKGTSDSAIMAAIETHAGQIIDSIRTMEMEEAQERFRRSLSKET--IFQDKFGVTLR 172
Query: 186 VPADILSHKEGKDFLWMSNNQMRRRKDFLVYTFP---YREASDLGLDRMVEVRDSVLKAN 242
+P+ K+ ++F+W+ + + + Y P + S D +V +RDS+ K
Sbjct: 173 LPSIYKVGKQEENFVWIDREIQKGSMNIIAYEMPGDSFDNDSTFVRD-IVRMRDSIGKLY 231
Query: 243 IEGSDAG---SYPVTERMLLKHRYVYP---GDQKRGEVHGIWRMEGGQMMSGPFVAQAFV 296
I G D +Y TE+ V+P G + EV GIW + M+GPF+
Sbjct: 232 IPGPDVPGKVTYMGTEKAFAPS--VFPAEVGGKPAAEVRGIWEIVN-YPMAGPFLTYIIN 288
Query: 297 D--GDRVVVVEGFVYHPNEKKRDLMRQMEAALYSMR 330
D +R +V+EGF + P KRD M ++EA L ++R
Sbjct: 289 DPENNRKLVLEGFTFAPATNKRDYMFELEAILRTVR 324
>gi|89889749|ref|ZP_01201260.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
gi|89518022|gb|EAS20678.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
Length = 325
Score = 85.1 bits (209), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 79/328 (24%), Positives = 155/328 (47%), Gaps = 26/328 (7%)
Query: 15 FAVFAMACACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALNQN 74
+ AM A T+S G +++LV++++ + VG++I + L +
Sbjct: 6 YIFLAMITLTGCNDAPVIVTDSAGRMNDILLVIENELWENEVGDSIRQILAAPVDGLVRE 65
Query: 75 EPIMAVSRVSEKDFTGFLRYIRNVLQIDIN---PDIFTKTSLKYGYDQWANGQLVVLINS 131
EP +++++ F G L RN L+++++ P + T L +AN Q V+I
Sbjct: 66 EPQFSLNQIEPDSFAGLLMKNRNYLKVEVSNKEPGVNVTTGL------YANPQTGVIIRG 119
Query: 132 PATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHINVPADI- 190
+ + +N I+ +F E+ +L +K + + DQ ++ F + P
Sbjct: 120 KDAAGIIEQIITNAQQIKTIFNDGEIKEKQRLMEK-VALNIDQ-IKERFNIELIAPRSYR 177
Query: 191 LSHKEGKDFLWMSNNQMRRRKDFLVYTF---PYREASDLGLDRMVEVRDSV--LKANIEG 245
+ + KDF W+ N D ++Y + S++ + + +VRDSV LK +
Sbjct: 178 YAAPDDKDFFWLRRNIKEGTMDIMIYEVNRDKIPQDSNV-VASITKVRDSVGSLKVPV-- 234
Query: 246 SDAGSYPVTERMLLKHRYVYPGDQKRG-EVHGIWRMEGGQMMSGPFVAQAFVDGDRV--V 302
D G + +TER + D K E G+W + GQ MSGPF+ A + ++ +
Sbjct: 235 -DKGEF-ITERAFSPYVNESQVDGKFAYETKGLWEV-SGQFMSGPFLNYAIYNEEKNNWL 291
Query: 303 VVEGFVYHPNEKKRDLMRQMEAALYSMR 330
++EG+++ P+ K+R+ + ++E+ L S++
Sbjct: 292 IIEGYIFAPSAKQRNYLFELESILKSVK 319
>gi|91217050|ref|ZP_01254013.1| hypothetical protein P700755_11335 [Psychroflexus torquis ATCC
700755]
gi|91184921|gb|EAS71301.1| hypothetical protein P700755_11335 [Psychroflexus torquis ATCC
700755]
Length = 348
Score = 84.7 bits (208), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 139/318 (43%), Gaps = 23/318 (7%)
Query: 23 ACQGGTAGKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALNQNEPIMAVSR 82
+C S+G+ ++ +++ ++ +GE I L Q EP A+ +
Sbjct: 41 SCNDNKTSAYKAESSGNLNQLSVIITNEDWQDSIGEKIREIFAADVDGLPQQEPKYALRQ 100
Query: 83 VSEKDFTGFLRYIRNVLQIDINPDIFTKTSLKYGYDQWANGQLVVLINSPATDSLTAYVE 142
+ F+GF R R L+I+ + + T + D +A Q + I ++ +E
Sbjct: 101 IPPMAFSGFARKNRTFLKIEKSKE----TGHQILLDSFAKPQFGIAITGSNQQTIIEELE 156
Query: 143 SNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHINVPADILSHKEGKDFLWM 202
NK + L H+ L + + + + + G + P+ K+ +F WM
Sbjct: 157 QNKDSF--LGKLHDTELKEKRRRMSKAPKKVDVLTKSLGVSMLFPSAYRYAKQEDNFFWM 214
Query: 203 SNNQMRRRKDFLVYTFPYREASDLGLDR-------MVEVRDSVLKANIEGSDAGSYPVTE 255
+ + L Y E S ++R ++++RDSV K NI G ++ +TE
Sbjct: 215 RKDIKNGSMELLAY-----EVSKSTIERNGDIIANVIKMRDSVGK-NIPGPTEDTHMITE 268
Query: 256 RMLLKHRYVYPGDQKRG-EVHGIWRMEGGQMMSGPFVAQAFVD--GDRVVVVEGFVYHPN 312
+ + D K E+ G W ++G M+GPF+ A D DR V++EGFV+ P+
Sbjct: 269 EAYAPYLFETEIDGKFAYEIKGTWEVKGF-FMAGPFLTYAIRDDKNDRYVILEGFVFKPS 327
Query: 313 EKKRDLMRQMEAALYSMR 330
KR+ M ++EA L S +
Sbjct: 328 SSKRNQMFEIEAILRSAK 345
>gi|86131113|ref|ZP_01049712.1| hypothetical protein MED134_09336 [Cellulophaga sp. MED134]
gi|85818524|gb|EAQ39684.1| hypothetical protein MED134_09336 [Dokdonia donghaensis MED134]
Length = 323
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 148/318 (46%), Gaps = 19/318 (5%)
Query: 23 ACQGGTAGKSFTNSTGSPGEMMLVM-DDQYLGSPVGEAIYVAMTEVAPALNQNEPIMAVS 81
+C G + T S G+ + +V+ DQ+ G+ +G+A+ + + L + EP ++
Sbjct: 17 SCSNGDGKRILTESVGNINNVKVVLTKDQWQGT-IGDALRDVLADPVYGLPREEPKFSID 75
Query: 82 RVSEKDFTGFLRYIRNVLQIDINP-DIFTKTSLKYGYDQWANGQLVVLINSPATDSLTAY 140
++ + F FL ++N + + I P D T T + D++A Q ++I + +
Sbjct: 76 QIPPEAFGDFL--LKNRIFVQIKPADSATVTIVN---DKYARPQTGIIITGKSETQIANL 130
Query: 141 VESNKSAIQNLFIRHELFLFGQLWQKDCSADADQRVQRMFGYHINVPADILSHKEGKDFL 200
+ N+ I EL + Q K + D +++ FG N P ++ D
Sbjct: 131 IIDNEEKIVTALKNQEL-VANQRRIKKALKNTDS-LRKTFGVRFNFPNAYRYAEQKPDIF 188
Query: 201 WMSNNQMRR-RKDFLVYTFPYR--EASDLGLDRMVEVRDSVLKANIEGSDAGSYPVTERM 257
W+ + R + VY P + + L + ++ +RDS+ I D G + +TER
Sbjct: 189 WIRKDLKRSGNMNITVYEVPLQAIDKDTLTIKNIIRMRDSIGGKKIPVDD-GRF-ITERA 246
Query: 258 LLKHRYVYPGDQK-RGEVHGIWRMEGGQMMSGPFVAQAFVDGD--RVVVVEGFVYHPNEK 314
+ D K E G W ++ G+ M+GPFV A D + R +++EGF++ P++
Sbjct: 247 FSPYLQETEIDGKFTYETKGTWEVD-GRYMAGPFVNYAIRDEENGRYLILEGFIFSPSQD 305
Query: 315 KRDLMRQMEAALYSMRPK 332
+RD M ++E+ L + + K
Sbjct: 306 QRDNMFELESILKTAKLK 323
>gi|146298736|ref|YP_001193327.1| hypothetical protein Fjoh_0975 [Flavobacterium johnsoniae UW101]
gi|146153154|gb|ABQ04008.1| hypothetical protein Fjoh_0975 [Flavobacterium johnsoniae UW101]
Length = 322
Score = 80.5 bits (197), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 72/306 (23%), Positives = 136/306 (44%), Gaps = 28/306 (9%)
Query: 37 TGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALNQNEPIMAVSRVSEKDFTGFLRYIR 96
+G + +++DDQ VG+ I L Q EP+ +++ K GF+ R
Sbjct: 29 SGKTNTISIIIDDQLWYGEVGDTIRNKFASPVLGLTQEEPLFTINQYPAKLLEGFVTDSR 88
Query: 97 NVLQIDINPDIFTKTSLKYGYDQWANGQL---VVLINSPATDSLTAYVESNKSAIQNLFI 153
++ I K + ++ + L I + D + +E N + I +
Sbjct: 89 SI--------IVVKKTANEKFEITHSKALPHNTFRIYGKSIDDIICSIELNSAQIIKMIR 140
Query: 154 RHELFLFGQLWQKDCSADA--DQRVQRMFGYHINVPA--DILSHKEGKDFLWMSNNQMRR 209
E+ + Q+D S ++ F I +P + + HK K+F+W+ + +
Sbjct: 141 DAEI----EKIQEDNSKSLLNPAVIKNKFHIDIQIPTGYEYVLHK--KNFIWLKKDIISG 194
Query: 210 RKDFLVYTFPYREASDLG--LDRMVEVRDSVLKANIEGSDAGSYPVT-ERMLLKHRYVYP 266
L+Y P G + ++++RDSV + I+G + + +T E +
Sbjct: 195 NTSLLIYQIPLHNFKTKGNVVGNIIKMRDSVGRY-IKGREPNTPMITGEAYAPYFSATHL 253
Query: 267 GDQKRGEVHGIWRMEGGQMMSGPFVAQAFVDG--DRVVVVEGFVYHPNEKKRDLMRQMEA 324
D++ E G W ++ M+GPF+ A VD +R++V+EGF Y P+ ++RDLM +EA
Sbjct: 254 DDKEAFETKGTWELKN-DFMAGPFINYAIVDPLYNRILVIEGFCYSPSNQERDLMLDLEA 312
Query: 325 ALYSMR 330
+ S++
Sbjct: 313 IIKSVK 318
>gi|124002673|ref|ZP_01687525.1| hypothetical protein M23134_04121 [Microscilla marina ATCC 23134]
gi|123991901|gb|EAY31288.1| hypothetical protein M23134_04121 [Microscilla marina ATCC 23134]
Length = 351
Score = 79.3 bits (194), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 88/348 (25%), Positives = 145/348 (41%), Gaps = 50/348 (14%)
Query: 22 CACQGGTA-----GKSFTNSTGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALNQNEP 76
AC GG + +S G GE+++VMD + P+G+A+ PAL Q E
Sbjct: 4 SACIGGDQTSKDNSQFVVDSKGGIGEIVVVMDPKKWDEPLGKALRNVFARYIPALPQPES 63
Query: 77 IMAVSRVSEKDFTGFLRYIRNVLQI------------------DINPDIFTKTSLKY--- 115
+ ++ F F++ N++ + + D+ K K+
Sbjct: 64 WFKLMKIPPSKFKSFMKRHHNIVILASLDNTSLEGRYLKGYFTQKSIDLIKKDPKKFMMR 123
Query: 116 GYDQWANGQLVVLINSPATDSLTAYVESNKSAIQNLFIRHELFLFGQLWQKDCSADADQ- 174
D +A GQ V+ + L + NK + N F E + KD A+Q
Sbjct: 124 RKDVFAKGQRVMYLFGNNDTQLIENMTKNKDQLLNYFHDME----QKRLTKDLYNIAEQY 179
Query: 175 ----RVQRMFGYHINVPADILSHKEGKDFLWMSN--NQMRRRKD--FLVYTFPYREASDL 226
V+ + + +PA K+ K+F+W+ + R+ D L+ + PY
Sbjct: 180 KLSNYVKDIMKVKVRIPAGFKLAKKDKNFVWLRKPGDAALRKADINLLMASRPYTSTKVF 239
Query: 227 GLDRMVEVRDSVLKANI-EGSDAGSYPVTER----MLLKHRYVYPGDQKRG-EVHGIWRM 280
++ RDS+ K + + SY VTE L+ R + D+K E G W++
Sbjct: 240 DEQNIIAWRDSLGKKYTNDPTLTESYVVTETRFEPALITKRII--KDKKFAVEYRGFWKL 297
Query: 281 EGGQMMSGPFVAQAFVD--GDRVVVVEGFVYHPNEKKRDLMRQMEAAL 326
+ G G F++ AFVD +V +EGF+Y P KK+D + Q EA L
Sbjct: 298 KNG-TRGGAFISYAFVDEAQKKVFYLEGFIYAPGMKKKDAIFQTEAIL 344
>gi|83815026|ref|YP_445790.1| hypothetical protein SRU_1671 [Salinibacter ruber DSM 13855]
gi|83756420|gb|ABC44533.1| hypothetical protein SRU_1671 [Salinibacter ruber DSM 13855]
Length = 382
Score = 70.5 bits (171), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/335 (22%), Positives = 136/335 (40%), Gaps = 42/335 (12%)
Query: 36 STGSPGEMMLVMDDQYLGSPVGEAIYVAMTEVAPALNQNEPIMAVSRVSEKDFTGFLRY- 94
+ G GE+ +VMD + P GEA+ +T L +E + + D + R
Sbjct: 34 AVGPIGEITVVMDSSHWTGPSGEALRANVTPWVETLPVSERSFQLRHLELTDERTYERVQ 93
Query: 95 -IRNV-----LQIDINPDIFTKTSLKYGYDQ---------------WANGQLVVLINSPA 133
++NV L N F + L +Q W Q + + + +
Sbjct: 94 DLKNVIIAAPLSDSTNETSFLRRRLSDEAEQAILDGQTAVVPKPNLWRRSQRIYYVAAAS 153
Query: 134 TDSLTAYVESNKSAIQNLFIRHELFLFGQ-LWQKDCSADADQRVQRMFGYHIN------V 186
D+L +S + I++ F L + +++ D + + + + G+ +N +
Sbjct: 154 PDALAETFQSQGTQIRDTFTEATLQRMDRNMYEDDRRSALEDTLMQRHGFAVNMQSDFQI 213
Query: 187 PADILSHKEGKDFLWMSNNQMRRRKDFLVYTFPYREASDLGLDRMVEVRDSVLKANIEGS 246
D + EG F+W+ + R++F VY + + + RDS+ + ++ G+
Sbjct: 214 AVDTTTGSEG--FVWLRRLLAQTRREFFVYYVEDVSPDQITPEWIYNTRDSLTRRHLRGN 271
Query: 247 DAGSYPVTERMLLKHRYVYPGDQKRGEVHGIWRM----EGGQMM-----SGPFVAQAFVD 297
G + R LK D+ + G+W M GG + GPFV F D
Sbjct: 272 VRGFAKIDYRRPLKTEQRSFLDRYAYDSRGLWYMVAPLGGGNELRPVGGGGPFVNYTFYD 331
Query: 298 --GDRVVVVEGFVYHPNEKKRDLMRQMEAALYSMR 330
DRV +++G ++ P+ K DL+RQME + R
Sbjct: 332 QATDRVYMLDGSIFAPDNDKIDLLRQMEVMARTFR 366
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
Posted date: May 10, 2008 4:54 AM
Number of letters in database: 884,634,002
Number of sequences in database: 2,620,852
Database: /apps/blastdb/nr.01
Posted date: May 10, 2008 4:52 AM
Number of letters in database: 976,814,986
Number of sequences in database: 2,761,530
Database: /apps/blastdb/nr.02
Posted date: May 10, 2008 4:46 AM
Number of letters in database: 360,829,861
Number of sequences in database: 1,132,722
Lambda K H
0.322 0.136 0.407
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,559,492,444
Number of Sequences: 6515104
Number of extensions: 65470330
Number of successful extensions: 130384
Number of sequences better than 1.0e-04: 31
Number of HSP's better than 0.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 19
Number of HSP's that attempted gapping in prelim test: 130296
Number of HSP's gapped (non-prelim): 31
length of query: 351
length of database: 2,222,278,849
effective HSP length: 135
effective length of query: 216
effective length of database: 1,342,739,809
effective search space: 290031798744
effective search space used: 290031798744
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 122 (51.6 bits)