BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= PI0048
(432 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|29347490|ref|NP_810993.1| conserved hypothetical protein... 564 e-159
gi|153808847|ref|ZP_01961515.1| hypothetical protein BACCAC... 547 e-154
gi|150004631|ref|YP_001299375.1| putative inner membrane pr... 545 e-153
gi|53715056|ref|YP_101048.1| putative inner membrane protei... 544 e-153
gi|156109651|gb|EDO11396.1| hypothetical protein BACOVA_033... 541 e-152
gi|156861269|gb|EDO54700.1| hypothetical protein BACUNI_016... 541 e-152
gi|60683018|ref|YP_213162.1| hypothetical protein BF3560 [B... 541 e-152
gi|154492788|ref|ZP_02032414.1| hypothetical protein PARMER... 479 e-133
gi|150007772|ref|YP_001302515.1| putative inner membrane pr... 478 e-133
gi|34540670|ref|NP_905149.1| hypothetical protein PG0909 [P... 437 e-121
gi|78355854|ref|YP_387303.1| hypothetical protein Dde_0807 ... 347 7e-94
gi|153953253|ref|YP_001394018.1| hypothetical protein CKL_0... 347 1e-93
gi|117619569|ref|YP_858502.1| hypothetical protein AHA_4077... 330 1e-88
gi|145297338|ref|YP_001140179.1| hypothetical protein ASA_0... 330 2e-88
gi|90409992|ref|ZP_01218009.1| hypothetical inner membrane ... 321 5e-86
gi|153094355|gb|EDN75210.1| hypothetical protein MHA_2323 [... 320 8e-86
gi|54301920|ref|YP_131913.1| hypothetical inner membrane pr... 317 8e-85
gi|89076151|ref|ZP_01162509.1| hypothetical protein SKA34_0... 308 5e-82
gi|89076156|ref|ZP_01162514.1| hypothetical protein SKA34_0... 305 3e-81
gi|90578283|ref|ZP_01234094.1| hypothetical protein VAS14_1... 305 3e-81
gi|146329640|ref|YP_001209036.1| hypothetical protein DNO_0... 304 9e-81
gi|34497279|ref|NP_901494.1| hypothetical protein CV_1824 [... 303 2e-80
gi|90409323|ref|ZP_01217416.1| hypothetical protein PCNPT3_... 300 1e-79
gi|156975351|ref|YP_001446258.1| hypothetical protein VIBHA... 297 9e-79
gi|149189825|ref|ZP_01868105.1| hypothetical inner membrane... 296 1e-78
gi|148323198|gb|EDK88448.1| hypothetical protein FNP_0644 [... 296 2e-78
gi|153833860|ref|ZP_01986527.1| conserved hypothetical prot... 296 2e-78
gi|118754581|ref|ZP_01602377.1| protein of unknown function... 295 3e-78
gi|116185206|ref|ZP_01475121.1| hypothetical protein VEx2w_... 295 3e-78
gi|123440826|ref|YP_001004817.1| hypothetical protein YE044... 295 5e-78
gi|28898947|ref|NP_798552.1| hypothetical protein VP2173 [V... 294 6e-78
gi|153838698|ref|ZP_01991365.1| conserved hypothetical prot... 293 2e-77
gi|75240858|ref|ZP_00724756.1| COG3681: Uncharacterized con... 291 5e-77
gi|26249696|ref|NP_755736.1| hypothetical protein c3867 [Es... 291 6e-77
gi|82545369|ref|YP_409316.1| hypothetical protein SBO_2975 ... 289 3e-76
gi|75197064|ref|ZP_00707134.1| COG3681: Uncharacterized con... 289 3e-76
gi|15803649|ref|NP_289682.1| hypothetical protein Z4462 [Es... 288 4e-76
gi|75187921|ref|ZP_00701188.1| COG3681: Uncharacterized con... 288 4e-76
gi|75239319|ref|ZP_00723290.1| COG3681: Uncharacterized con... 288 5e-76
gi|49176312|ref|YP_026202.1| conserved protein [Escherichia... 288 6e-76
gi|16766537|ref|NP_462152.1| putative inner membrane protei... 287 9e-76
gi|74313659|ref|YP_312078.1| hypothetical protein SSON_3267... 287 1e-75
gi|75211064|ref|ZP_00711177.1| COG3681: Uncharacterized con... 287 1e-75
gi|110806992|ref|YP_690512.1| hypothetical protein SFV_3151... 286 3e-75
gi|75178731|ref|ZP_00698767.1| COG3681: Uncharacterized con... 285 4e-75
gi|16762010|ref|NP_457627.1| hypothetical protein STY3418 [... 285 4e-75
gi|62181754|ref|YP_218171.1| putative inner membrane protei... 285 5e-75
gi|82778439|ref|YP_404788.1| hypothetical protein SDY_3300 ... 284 7e-75
gi|56415176|ref|YP_152251.1| hypothetical protein SPA3107 [... 283 1e-74
gi|157085887|gb|ABV15565.1| hypothetical protein CKO_04510 ... 282 3e-74
gi|53729252|ref|ZP_00133782.2| COG3681: Uncharacterized con... 281 7e-74
gi|59711248|ref|YP_204024.1| hypothetical protein VF0641 [V... 280 2e-73
gi|148976149|ref|ZP_01812892.1| hypothetical protein VSWAT3... 278 7e-73
gi|90409022|ref|ZP_01217151.1| hypothetical protein PCNPT3_... 277 1e-72
gi|19704482|ref|NP_604044.1| hypothetical protein FN1147 [F... 276 2e-72
gi|110800835|ref|YP_695254.1| hypothetical protein CPF_0803... 275 4e-72
gi|18309788|ref|NP_561722.1| hypothetical protein CPE0806 [... 274 1e-71
gi|149909553|ref|ZP_01898207.1| hypothetical protein PE36_1... 273 1e-71
gi|110801779|ref|YP_698115.1| hypothetical protein CPR_0790... 273 2e-71
gi|152972895|ref|YP_001338041.1| hypothetical protein KPN_0... 269 2e-70
gi|106894563|ref|ZP_01361681.1| Protein of unknown function... 268 4e-70
gi|146292312|ref|YP_001182736.1| protein of unknown functio... 268 5e-70
gi|120599752|ref|YP_964326.1| protein of unknown function D... 268 7e-70
gi|113971128|ref|YP_734921.1| protein of unknown function D... 267 1e-69
gi|149115725|ref|ZP_01842464.1| protein of unknown function... 266 2e-69
gi|118073122|ref|ZP_01541306.1| protein of unknown function... 266 2e-69
gi|117921411|ref|YP_870603.1| protein of unknown function D... 266 2e-69
gi|113949660|ref|ZP_01435305.1| protein of unknown function... 266 2e-69
gi|153001593|ref|YP_001367274.1| protein of unknown functio... 266 2e-69
gi|153810614|ref|ZP_01963282.1| hypothetical protein RUMOBE... 263 1e-68
gi|20806790|ref|NP_621961.1| hypothetical protein TTE0269 [... 263 2e-68
gi|28211905|ref|NP_782849.1| hypothetical protein CTC02309 ... 261 5e-68
gi|150383265|ref|ZP_01922103.1| protein of unknown function... 260 1e-67
gi|24372981|ref|NP_717023.1| hypothetical protein SO_1403 [... 260 1e-67
gi|150392345|ref|YP_001322394.1| protein of unknown functio... 259 3e-67
gi|117619552|ref|YP_856155.1| hypothetical protein AHA_1619... 254 1e-65
gi|42527647|ref|NP_972745.1| hypothetical protein TDE2144 [... 251 5e-65
gi|153854136|ref|ZP_01995444.1| hypothetical protein DORLON... 247 1e-63
gi|154502833|ref|ZP_02039893.1| hypothetical protein RUMGNA... 244 1e-62
gi|106887868|ref|ZP_01355140.1| Protein of unknown function... 243 2e-62
gi|126700852|ref|YP_001089749.1| hypothetical protein CD323... 242 3e-62
gi|154498241|ref|ZP_02036619.1| hypothetical protein BACCAP... 238 5e-61
gi|81427766|ref|YP_394765.1| hypothetical protein LSA0156 [... 237 1e-60
gi|89893855|ref|YP_517342.1| hypothetical protein DSY1109 [... 234 1e-59
gi|153941420|ref|YP_001391289.1| hypothetical protein CLI_2... 231 1e-58
gi|148379926|ref|YP_001254467.1| hypothetical protein CBO19... 229 3e-58
gi|153814330|ref|ZP_01966998.1| hypothetical protein RUMTOR... 227 1e-57
gi|153939751|ref|YP_001391333.1| hypothetical protein CLI_2... 226 2e-57
gi|51244443|ref|YP_064327.1| hypothetical protein DP0591 [D... 221 7e-56
gi|147920322|ref|YP_685905.1| hypothetical protein RCIX1285... 220 2e-55
gi|148379970|ref|YP_001254511.1| membrane protein [Clostrid... 219 2e-55
gi|77975571|ref|ZP_00831106.1| COG3681: Uncharacterized con... 217 2e-54
gi|153853582|ref|ZP_01994962.1| hypothetical protein DORLON... 215 4e-54
gi|153940079|ref|YP_001391070.1| hypothetical protein CLI_1... 209 2e-52
gi|148379774|ref|YP_001254315.1| hypothetical protein CBO18... 206 2e-51
gi|50841709|ref|YP_054936.1| conserved membrane associated ... 192 4e-47
gi|139437096|ref|ZP_01771256.1| Hypothetical protein COLAER... 187 1e-45
gi|154249957|ref|YP_001410782.1| protein of unknown functio... 185 6e-45
gi|39996627|ref|NP_952578.1| hypothetical protein GSU1527 [... 181 7e-44
gi|77979116|ref|ZP_00834537.1| COG3681: Uncharacterized con... 181 7e-44
gi|150020899|ref|YP_001306253.1| protein of unknown functio... 180 1e-43
gi|145953743|ref|ZP_01802751.1| hypothetical protein CdifQ_... 177 2e-42
gi|153855512|ref|ZP_01996631.1| hypothetical protein DORLON... 171 7e-41
gi|7465931|pir||A65100 hypothetical 19.4 kD protein in exuR... 159 4e-37
gi|83590261|ref|YP_430270.1| Protein of unknown function DU... 159 5e-37
gi|30064452|ref|NP_838623.1| hypothetical protein S3359 [Sh... 157 1e-36
gi|15669214|ref|NP_248019.1| hypothetical protein MJ1025 [M... 156 2e-36
gi|120603539|ref|YP_967939.1| protein of unknown function D... 156 3e-36
gi|46578856|ref|YP_009664.1| hypothetical protein DVU0440 [... 156 3e-36
gi|46369625|gb|AAS89662.1| putative inner membrane protein ... 153 2e-35
gi|134045156|ref|YP_001096642.1| protein of unknown functio... 141 8e-32
gi|150402637|ref|YP_001329931.1| protein of unknown functio... 140 2e-31
gi|45359031|ref|NP_988588.1| hypothetical protein MMP1468 [... 136 3e-30
gi|30064453|ref|NP_838624.1| hypothetical protein S3360 [Sh... 122 4e-26
gi|7465932|pir||B65100 hypothetical 19.4 kD protein in exuR... 104 1e-20
gi|34764933|ref|ZP_00145275.1| hypothetical protein [Fusoba... 95 8e-18
gi|145640967|ref|ZP_01796549.1| tRNA (uracil-5-)-methyltran... 81 1e-13
gi|145630605|ref|ZP_01786385.1| tRNA (uracil-5-)-methyltran... 81 1e-13
gi|145953744|ref|ZP_01802752.1| hypothetical protein CdifQ_... 80 2e-13
gi|145632224|ref|ZP_01787959.1| tRNA (uracil-5-)-methyltran... 74 2e-11
gi|68249449|ref|YP_248561.1| hypothetical protein NTHI1023 ... 73 3e-11
gi|16272795|ref|NP_439015.1| hypothetical protein HI0855 [H... 73 4e-11
gi|46133161|ref|ZP_00156710.2| COG3681: Uncharacterized con... 72 9e-11
gi|34764928|ref|ZP_00145272.1| hypothetical protein [Fusoba... 66 4e-09
>gi|29347490|ref|NP_810993.1| conserved hypothetical protein, putative inner membrane protein
[Bacteroides thetaiotaomicron VPI-5482]
gi|29339390|gb|AAO77187.1| conserved hypothetical protein, putative inner membrane protein
[Bacteroides thetaiotaomicron VPI-5482]
Length = 429
Score = 564 bits (1453), Expect = e-159, Method: Composition-based stats.
Identities = 285/426 (66%), Positives = 348/426 (81%), Gaps = 2/426 (0%)
Query: 7 REQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIP 66
R+QIIDL+ ++V+PA+GCTEP+AVALC A+A E LG KPEKI LSANILKNAMGVGIP
Sbjct: 6 RKQIIDLIKKEVIPAIGCTEPIAVALCVAKAAETLGMKPEKIEVLLSANILKNAMGVGIP 65
Query: 67 GTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLY 126
GT M+GLPIA++LGALIG+SEYQLEV++D TPE +E+GK +++E RI I LKE ITEKLY
Sbjct: 66 GTDMVGLPIAVALGALIGRSEYQLEVLRDCTPEAVEQGKLFIAEKRICISLKEDITEKLY 125
Query: 127 IEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWD 186
IE+ C AG +KATA+I+ HT FVY D KVLLDKQ A E ++ ++LNL+ V+D
Sbjct: 126 IEVICTAGSQKATAVIAGGHTTFVYIATDEKVLLDKQQTANE--EEEDASLELNLRKVYD 183
Query: 187 FATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSK 246
FA T+P++EI FIL+ R N AAE+A KGNYGH +GK + + G+S++SHILS
Sbjct: 184 FALTSPLDEIRFILDTARLNKAAAEQAFKGNYGHSLGKMLRGTYEHKVMGDSVFSHILSY 243
Query: 247 TASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIY 306
T++ACDARM GAM+PVMSNSGSGNQGI AT PVVVFA+EN T EEL+RAL LSHLT IY
Sbjct: 244 TSAACDARMAGAMIPVMSNSGSGNQGISATLPVVVFAEENGKTEEELIRALMLSHLTVIY 303
Query: 307 IKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCA 366
IKQ+LG LSALCGC+VA TGSSCGIT+LMGG+YN + +AV+NMIANLTGM+CDGAKPSCA
Sbjct: 304 IKQSLGRLSALCGCVVAATGSSCGITWLMGGNYNQVAFAVQNMIANLTGMICDGAKPSCA 363
Query: 367 LKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIM 426
LKV++GVSTA+LSAM++ME+ VT EGIID+D+D+SIRNLT +G AMNE D MVLDIM
Sbjct: 364 LKVTTGVSTAVLSAMMAMEDRCVTSVEGIIDEDVDQSIRNLTRIGSQAMNETDKMVLDIM 423
Query: 427 TSKGAC 432
T KG C
Sbjct: 424 THKGGC 429
>gi|153808847|ref|ZP_01961515.1| hypothetical protein BACCAC_03147 [Bacteroides caccae ATCC 43185]
gi|149128673|gb|EDM19891.1| hypothetical protein BACCAC_03147 [Bacteroides caccae ATCC 43185]
Length = 446
Score = 547 bits (1409), Expect = e-154, Method: Composition-based stats.
Identities = 278/424 (65%), Positives = 341/424 (80%), Gaps = 2/424 (0%)
Query: 7 REQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIP 66
R+QII+L+ ++V+PA+GCTEP+AVALC A+A E LG KPEKI LSANILKNAMGVGIP
Sbjct: 24 RQQIIELIKKEVIPAIGCTEPIAVALCVAKAAETLGVKPEKIEVLLSANILKNAMGVGIP 83
Query: 67 GTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLY 126
GTGM+GLPIA++LGALIG+S YQLEV++D TPE +E+GKQ+++E RI I LKE ITEKLY
Sbjct: 84 GTGMVGLPIAVALGALIGRSAYQLEVLRDCTPEAVEQGKQFIAEKRIRISLKEDITEKLY 143
Query: 127 IEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWD 186
IE+ C++G K A A+IS HT F+Y + VLLDKQ E + +LNL V+D
Sbjct: 144 IEVICKSGDKTAKAVISGGHTTFIYIAMNENVLLDKQQATAEEEEETSP--ELNLYKVYD 201
Query: 187 FATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSK 246
FA T P++EI FIL+ R N AAE+A KGNYGH +GK + + G+S++SHILS
Sbjct: 202 FALTAPLDEIRFILDTARLNKAAAEQAFKGNYGHSLGKMLRGTYEHKVMGDSVFSHILSY 261
Query: 247 TASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIY 306
T++ACDARM GAM+PVMSNSGSGNQGI AT PVVVFA+EN T EEL+RAL LSHLT IY
Sbjct: 262 TSAACDARMAGAMIPVMSNSGSGNQGISATLPVVVFAEENGKTEEELIRALMLSHLTVIY 321
Query: 307 IKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCA 366
IKQ+LG LSALCGC+VA TGSSCGIT+LMGG+YN + +AV+NMIANLTGM+CDGAKPSCA
Sbjct: 322 IKQSLGRLSALCGCVVAATGSSCGITWLMGGNYNQVAFAVQNMIANLTGMICDGAKPSCA 381
Query: 367 LKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIM 426
LKV++GVSTA+LSA+++ME+ VT EGIID+D+D+SIRNLT +G AMNE D MVLDIM
Sbjct: 382 LKVTTGVSTAVLSAIMAMEDRCVTSVEGIIDEDVDQSIRNLTRIGSQAMNETDKMVLDIM 441
Query: 427 TSKG 430
T KG
Sbjct: 442 THKG 445
>gi|150004631|ref|YP_001299375.1| putative inner membrane protein [Bacteroides vulgatus ATCC 8482]
gi|149933055|gb|ABR39753.1| putative inner membrane protein [Bacteroides vulgatus ATCC 8482]
Length = 431
Score = 545 bits (1405), Expect = e-153, Method: Composition-based stats.
Identities = 285/430 (66%), Positives = 345/430 (80%), Gaps = 3/430 (0%)
Query: 1 MLEKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNA 60
M+ K REQII L++R+VVPA+GCTEP+AVALC A+ATE LG++PE+I A LSANILKNA
Sbjct: 1 MIAKPEREQIIALINREVVPAIGCTEPIAVALCVAKATETLGKRPERIKALLSANILKNA 60
Query: 61 MGVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEG 120
MGVGIPGTGMIGLPIAI+LGALIGKSEYQLEV+KD TP+ + EGK+ + I I LKE
Sbjct: 61 MGVGIPGTGMIGLPIAIALGALIGKSEYQLEVLKDSTPDAVAEGKKLIDSQAISIGLKEN 120
Query: 121 ITEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLN 180
I EKLYIE+ CEA G ATAII+ HTNF+Y + +VLL+KQ + D K+ +LN
Sbjct: 121 IEEKLYIEIICEADGDTATAIIACGHTNFIYVALNNQVLLNKQTTST--CNEDAKEPELN 178
Query: 181 LKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSR-GLFGNSI 239
L+ V+DFATTTP++EI FILE KR N AAE + KGNYGH +GK + S + G++
Sbjct: 179 LRKVYDFATTTPLDEIRFILETKRLNKAAAERSFKGNYGHELGKILKSSKSEEQILGSNT 238
Query: 240 YSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTL 299
++HILS T++ACDARM GAM+PVMSNSGSGNQGI AT PVVV+A++N + EEL+RALTL
Sbjct: 239 FTHILSYTSAACDARMAGAMIPVMSNSGSGNQGITATLPVVVYAEDNHKSEEELIRALTL 298
Query: 300 SHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCD 359
SHLTAIYIKQ+LG LSALCGC+VA TGSSCGITYLMGG Y I +AV+NMIANLTGM+CD
Sbjct: 299 SHLTAIYIKQSLGRLSALCGCVVAATGSSCGITYLMGGTYEQITFAVQNMIANLTGMICD 358
Query: 360 GAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMD 419
GAKPSCALK+SSGVSTA+ SA+L+ME+ V+ EGIID D+D+SIRNLT +G MNE D
Sbjct: 359 GAKPSCALKLSSGVSTAVFSAILAMEHKCVSSVEGIIDNDVDRSIRNLTRIGSQGMNETD 418
Query: 420 IMVLDIMTSK 429
+VLDIMT K
Sbjct: 419 KLVLDIMTHK 428
>gi|53715056|ref|YP_101048.1| putative inner membrane protein [Bacteroides fragilis YCH46]
gi|52217921|dbj|BAD50514.1| putative inner membrane protein [Bacteroides fragilis YCH46]
Length = 446
Score = 544 bits (1401), Expect = e-153, Method: Composition-based stats.
Identities = 280/424 (66%), Positives = 343/424 (80%), Gaps = 2/424 (0%)
Query: 7 REQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIP 66
R+QII L+ R+V+PA+GCTEP+AVALC A+ATE LG KPEKI LSANILKNAMGVGIP
Sbjct: 24 RKQIIALIQREVIPAIGCTEPIAVALCVAKATETLGAKPEKIKVLLSANILKNAMGVGIP 83
Query: 67 GTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLY 126
GTGMIGLPIA++LGALIGKS+YQLEV+KD TPE +EEGK+ + E RI I LKE ITEKLY
Sbjct: 84 GTGMIGLPIAVALGALIGKSDYQLEVLKDSTPEAVEEGKKLIDEKRICISLKEDITEKLY 143
Query: 127 IEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWD 186
IE+TCEAGG++ATAIIS HT FVY +VLL+KQ G + + ++L L+ V+D
Sbjct: 144 IEVTCEAGGEQATAIISGGHTTFVYVAKGDEVLLNKQ--QTSGEEEEEETLELTLRKVYD 201
Query: 187 FATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSK 246
FA T P++EI FILE R N AAE++ +G+YGH +GK + + G+S++SHILS
Sbjct: 202 FALTAPLDEIRFILETARLNKKAAEQSFQGDYGHALGKMLRGTYEHKIMGDSVFSHILSY 261
Query: 247 TASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIY 306
T++ACDARM GAM+PVMSNSGSGNQGI AT PVVV+A+EN + EEL+RAL +SHLT IY
Sbjct: 262 TSAACDARMAGAMIPVMSNSGSGNQGISATLPVVVYAEENGKSEEELIRALMMSHLTVIY 321
Query: 307 IKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCA 366
IKQ+LG LSALCGC+VA TGSSCGIT+LMGG Y + +AV+NMIANLTGM+CDGAKPSCA
Sbjct: 322 IKQSLGRLSALCGCVVAATGSSCGITWLMGGSYKQVAFAVQNMIANLTGMICDGAKPSCA 381
Query: 367 LKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIM 426
LKV++GVSTA+LSA+++MEN VT EGIID+D+D+SIRNLT +G MNE D +VLDIM
Sbjct: 382 LKVTTGVSTAVLSAVMAMENRCVTSVEGIIDEDVDQSIRNLTRIGSQGMNETDRVVLDIM 441
Query: 427 TSKG 430
T KG
Sbjct: 442 THKG 445
>gi|156109651|gb|EDO11396.1| hypothetical protein BACOVA_03300 [Bacteroides ovatus ATCC 8483]
Length = 428
Score = 541 bits (1395), Expect = e-152, Method: Composition-based stats.
Identities = 276/424 (65%), Positives = 339/424 (79%), Gaps = 2/424 (0%)
Query: 7 REQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIP 66
R QII+L+ ++V+PA+GCTEP+AVALC A+A E LG KPEKI LSANILKNAMGVGIP
Sbjct: 6 RRQIIELIKKEVIPAIGCTEPIAVALCVAKAAETLGAKPEKIEVLLSANILKNAMGVGIP 65
Query: 67 GTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLY 126
GTGM+GLPIA++LGALIGKS+YQLEV++D TPE +E+GKQ+++E RI I LKE ITEKLY
Sbjct: 66 GTGMVGLPIAVALGALIGKSDYQLEVLRDCTPEAVEQGKQFIAEKRICISLKEDITEKLY 125
Query: 127 IEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWD 186
IE+ C+ K A AII+ HT F+Y + + LLDKQ E + +LNL+ V+D
Sbjct: 126 IEVICKTEDKTAKAIIAGGHTTFIYIAKNEQTLLDKQQTVSEEEEEASP--ELNLRKVYD 183
Query: 187 FATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSK 246
FA T P++EI FIL+ N AAE+A KGNYGH +GK + + G+S++SHILS
Sbjct: 184 FALTAPLDEIRFILDTAHLNKAAAEQAFKGNYGHSLGKMLRGTYEHKVMGDSVFSHILSY 243
Query: 247 TASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIY 306
T++ACDARM GAM+PVMSNSGSGNQGI AT PVVVFA+EN + EEL+RAL LSHLT IY
Sbjct: 244 TSAACDARMAGAMIPVMSNSGSGNQGISATLPVVVFAEENGKSEEELIRALMLSHLTVIY 303
Query: 307 IKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCA 366
IKQ+LG LSALCGC+VA TGSSCGIT+LMGG+YN + +AV+NMIANLTGM+CDGAKPSCA
Sbjct: 304 IKQSLGRLSALCGCVVAATGSSCGITWLMGGNYNQVAFAVQNMIANLTGMICDGAKPSCA 363
Query: 367 LKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIM 426
LKV++GVSTA+LSAM++ME+ VT EGIID+D+D+SIRNLT +G AMNE D MVLDIM
Sbjct: 364 LKVTTGVSTAVLSAMMAMEDRCVTSVEGIIDEDVDQSIRNLTRIGSQAMNETDKMVLDIM 423
Query: 427 TSKG 430
T KG
Sbjct: 424 THKG 427
>gi|156861269|gb|EDO54700.1| hypothetical protein BACUNI_01684 [Bacteroides uniformis ATCC 8492]
Length = 426
Score = 541 bits (1395), Expect = e-152, Method: Composition-based stats.
Identities = 276/432 (63%), Positives = 346/432 (80%), Gaps = 10/432 (2%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAM 61
+ K R+QIIDL+ +VVPA+GCTEP+AVALC A+A E+LG+ PEKI+ LSANILKNAM
Sbjct: 1 MTKTERQQIIDLVKSEVVPAIGCTEPIAVALCVAKAAEVLGRHPEKITVLLSANILKNAM 60
Query: 62 GVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGI 121
GVGIPGTGMIGLPIA++LGALIGKSEYQLEV+KD P+ +EEG++++ E RI I LK+GI
Sbjct: 61 GVGIPGTGMIGLPIAVALGALIGKSEYQLEVLKDCNPDAVEEGRRFIDEKRIHIALKDGI 120
Query: 122 TEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPA----EEGAPTDNKDI 177
EKLY+E+ CE G +K+TAII+ HT+FVY +G+VLL+KQ A EEGA +
Sbjct: 121 KEKLYVEVCCEVGDEKSTAIIAGGHTSFVYMARNGEVLLNKQAVASTEKEEGA------L 174
Query: 178 QLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGN 237
L+L+ V+DFA T P++EI FILE R N AAE + +G+YGH +GK + + G+
Sbjct: 175 DLSLRKVYDFALTAPLDEIRFILETARLNKAAAESSFEGDYGHGLGKILRGTYEHKVMGD 234
Query: 238 SIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRAL 297
S++SHILS T+ ACDARM GAM+PVMSNSGSGNQGI AT PV+++A+EN + EEL+RAL
Sbjct: 235 SVFSHILSYTSGACDARMAGAMIPVMSNSGSGNQGISATLPVLIYAEENGKSEEELIRAL 294
Query: 298 TLSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMV 357
LSHLT IYIKQ+LG LSALCGC+VA TGSSCGIT+LMGG Y + YAV+NMIANLTGM+
Sbjct: 295 MLSHLTVIYIKQSLGRLSALCGCVVAATGSSCGITWLMGGTYEQVAYAVQNMIANLTGMI 354
Query: 358 CDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNE 417
CDGAKPSCALKV++GVSTA+LSA+++MEN VT EGIID+D+D+SIRNLT +G MNE
Sbjct: 355 CDGAKPSCALKVTTGVSTAVLSAIMAMENRCVTSVEGIIDEDVDQSIRNLTKIGSKGMNE 414
Query: 418 MDIMVLDIMTSK 429
D +VL+IMTSK
Sbjct: 415 TDKLVLEIMTSK 426
>gi|60683018|ref|YP_213162.1| hypothetical protein BF3560 [Bacteroides fragilis NCTC 9343]
gi|60494452|emb|CAH09248.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
Length = 428
Score = 541 bits (1394), Expect = e-152, Method: Composition-based stats.
Identities = 280/424 (66%), Positives = 342/424 (80%), Gaps = 2/424 (0%)
Query: 7 REQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIP 66
R+QII L+ R+V+PA+GCTEP+AVALC A+ATE LG KPEKI LSANILKNAMGVGIP
Sbjct: 6 RKQIIALIQREVIPAIGCTEPIAVALCVAKATETLGAKPEKIKVLLSANILKNAMGVGIP 65
Query: 67 GTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLY 126
GTGMIGLPIA++LGALIGKS+YQLEV+KD TPE +EEGK+ + E RI I LKE ITEKLY
Sbjct: 66 GTGMIGLPIAVALGALIGKSDYQLEVLKDSTPEAVEEGKKLIDEKRICISLKEDITEKLY 125
Query: 127 IEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWD 186
IE+TCEAGG++ATAIIS HT FVY +VLL+KQ G + ++L L+ V+D
Sbjct: 126 IEVTCEAGGEQATAIISGGHTTFVYVAKGDEVLLNKQ--QTSGEEEKEETLELTLRKVYD 183
Query: 187 FATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSK 246
FA T P++EI FILE R N AAE++ +G+YGH +GK + + G+S++SHILS
Sbjct: 184 FALTAPLDEIRFILETARLNKKAAEQSFQGDYGHALGKMLRGTYEHKIMGDSVFSHILSY 243
Query: 247 TASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIY 306
T++ACDARM GAM+PVMSNSGSGNQGI AT PVVV+A+EN + EEL+RAL +SHLT IY
Sbjct: 244 TSAACDARMAGAMIPVMSNSGSGNQGISATLPVVVYAEENGKSEEELIRALMMSHLTVIY 303
Query: 307 IKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCA 366
IKQ+LG LSALCGC+VA TGSSCGIT+LMGG Y + +AV+NMIANLTGM+CDGAKPSCA
Sbjct: 304 IKQSLGRLSALCGCVVAATGSSCGITWLMGGSYKQVAFAVQNMIANLTGMICDGAKPSCA 363
Query: 367 LKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIM 426
LKV++GVSTA+LSA+++MEN VT EGIID+D+D+SIRNLT +G MNE D +VLDIM
Sbjct: 364 LKVTTGVSTAVLSAVMAMENRCVTSVEGIIDEDVDQSIRNLTRIGSQGMNETDRVVLDIM 423
Query: 427 TSKG 430
T KG
Sbjct: 424 THKG 427
>gi|154492788|ref|ZP_02032414.1| hypothetical protein PARMER_02427 [Parabacteroides merdae ATCC
43184]
gi|154087093|gb|EDN86138.1| hypothetical protein PARMER_02427 [Parabacteroides merdae ATCC
43184]
Length = 430
Score = 479 bits (1234), Expect = e-133, Method: Composition-based stats.
Identities = 261/432 (60%), Positives = 328/432 (75%), Gaps = 8/432 (1%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAM 61
++K + QII L+H++V+PA+GCTEP+AVAL A+A E+LG KPEK FLSANILKNAM
Sbjct: 1 MDKTTQTQIIKLIHQEVIPAIGCTEPVAVALAAAKAAEVLGCKPEKTEVFLSANILKNAM 60
Query: 62 GVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGI 121
GVGIPGTGM+GLPIA++LG LIGKS Y LEV++DLTPE L EGKQ + + RI I LK+ +
Sbjct: 61 GVGIPGTGMVGLPIAVALGTLIGKSAYGLEVLRDLTPEALAEGKQVIEDKRIHIALKDNV 120
Query: 122 TEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEG----APTDNKDI 177
+KLYIE+ C AG + + II HTN VY E +G VL D++ +EG A D ++
Sbjct: 121 -DKLYIEVICSAGDETSRVIICHEHTNVVYVEKNGVVLTDRR---KEGVSCDASGDEDEL 176
Query: 178 QLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGN 237
+L+ V++FA P++EI FILE N AAE +LKGN+GH V KT+ R G+
Sbjct: 177 RLSFSTVYEFAMEMPLDEIRFILETADLNRKAAEASLKGNFGHTVSKTVSGVYGRKYMGD 236
Query: 238 SIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRAL 297
S Y+H+L+ TA+ACDARM GAM+PVMSNSGSGNQGI AT PV+ FA++ E + E+L+RAL
Sbjct: 237 SAYTHMLAMTAAACDARMDGAMIPVMSNSGSGNQGIAATLPVLSFAEDIECSEEQLIRAL 296
Query: 298 TLSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMV 357
LSHL IYIKQ+LG LSALCGC+VA TG+SCGITYLMGGD I YA+KNMI N+TGM+
Sbjct: 297 MLSHLMVIYIKQSLGRLSALCGCVVAATGASCGITYLMGGDKVQISYAIKNMIGNITGMI 356
Query: 358 CDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNE 417
CDGAKPSCA+KVSSGVSTA+LSA+++MEN VT EGIID+++DKSI NLTS+G M
Sbjct: 357 CDGAKPSCAMKVSSGVSTAMLSALMAMENKVVTPVEGIIDENVDKSIINLTSIGSKGMEA 416
Query: 418 MDIMVLDIMTSK 429
D +VLDIMT K
Sbjct: 417 TDKLVLDIMTGK 428
>gi|150007772|ref|YP_001302515.1| putative inner membrane protein [Parabacteroides distasonis ATCC
8503]
gi|149936196|gb|ABR42893.1| putative inner membrane protein [Parabacteroides distasonis ATCC
8503]
Length = 430
Score = 478 bits (1231), Expect = e-133, Method: Composition-based stats.
Identities = 255/425 (60%), Positives = 326/425 (76%), Gaps = 2/425 (0%)
Query: 6 IREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGI 65
I+ QIIDL+HR+V+PA+GCTEP+AVAL A+A E+LG+KPEKI +LSANILKNAMGVGI
Sbjct: 5 IKTQIIDLIHREVIPAIGCTEPIAVALAAAKAAEVLGRKPEKIEVYLSANILKNAMGVGI 64
Query: 66 PGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKL 125
PGTGM+GLPIAI+LG++IGKS Y LEV+KDLT E L+EGK+ V + I I LKE + +KL
Sbjct: 65 PGTGMVGLPIAIALGSIIGKSAYGLEVLKDLTSEGLKEGKEMVCKKCIGIDLKENV-DKL 123
Query: 126 YIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPT-DNKDIQLNLKMV 184
YIE+ AG ++ II HT+ +Y E +G+VL D + G +NKD++L+ MV
Sbjct: 124 YIEIISSAGNDRSRVIICHEHTHIIYVEKNGEVLTDLRTANASGEEVCENKDLRLSFSMV 183
Query: 185 WDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHIL 244
++FA P++EI FILE N AA+ ++KGNYGH V KT+ R G+S Y+H+L
Sbjct: 184 YEFAMEMPLDEIRFILETAELNKKAAQASMKGNYGHTVSKTVSGAFGRKFMGDSAYTHML 243
Query: 245 SKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTA 304
T++ACDARM GAM+PVMSNSGSGNQGI AT PV+ FA++ + + E+L+RAL LSHL
Sbjct: 244 IMTSAACDARMDGAMIPVMSNSGSGNQGIAATLPVLSFAEDIQCSEEQLIRALMLSHLMV 303
Query: 305 IYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPS 364
IYIKQ+LG LSALCGC+VA TG+SC ITYLMGG+ I YA+KNMI N+TGM+CDGAKPS
Sbjct: 304 IYIKQSLGRLSALCGCVVAATGASCAITYLMGGNKARISYAIKNMIGNITGMICDGAKPS 363
Query: 365 CALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLD 424
CA+KVSSGVSTA+LSA+++ME+ VT EGIID+D+DKSI NLT++G M D +VLD
Sbjct: 364 CAMKVSSGVSTAMLSALMAMEDKVVTSVEGIIDEDVDKSIANLTAIGSKGMEATDRLVLD 423
Query: 425 IMTSK 429
IMT K
Sbjct: 424 IMTGK 428
>gi|34540670|ref|NP_905149.1| hypothetical protein PG0909 [Porphyromonas gingivalis W83]
gi|34396984|gb|AAQ66048.1| conserved hypothetical protein [Porphyromonas gingivalis W83]
Length = 433
Score = 437 bits (1123), Expect = e-121, Method: Composition-based stats.
Identities = 235/432 (54%), Positives = 312/432 (72%), Gaps = 4/432 (0%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAM 61
++ + +++II L+ ++VVPA GCTEP+AVAL A+A L+ Q+P+ + LS NILKNAM
Sbjct: 1 MDTSTQQRIISLIKKEVVPATGCTEPVAVALAAAQAASLMEQRPDHVEVLLSPNILKNAM 60
Query: 62 GVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGI 121
GVGIPGTGMIGLPIAI+LG ++ QL+V+ + PE LEE K+ V I + +K+G
Sbjct: 61 GVGIPGTGMIGLPIAIALGIVVADPTKQLKVLDGIAPEQLEEAKKIVDGKIIQVAVKQGD 120
Query: 122 TEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEE----GAPTDNKDI 177
+KLYIE+ AG + A+ II K HTN +Y +G+V++D + A + + ++I
Sbjct: 121 IDKLYIEINMSAGSESASTIIEKIHTNIIYAAHNGQVVIDGRHDAADKSESASSESEEEI 180
Query: 178 QLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGN 237
L+ +MV+DFA TP EIEFILEA R N +A+E ++KGNYGH VG+ + L R G+
Sbjct: 181 ALSFEMVYDFAMNTPTEEIEFILEAARLNRHASEVSMKGNYGHAVGRMIQGSLGRRYLGD 240
Query: 238 SIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRAL 297
S + +L+ T+SACDARM GA V VMSNSGSGNQGI AT PV+ FA++ + E VRAL
Sbjct: 241 SSLTRMLTYTSSACDARMDGAPVTVMSNSGSGNQGITATLPVLSFAEDEQADHERTVRAL 300
Query: 298 TLSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMV 357
LS+L IYIKQ LG LSALCGC+VA TGSSCG+ YLMGG I +A+KNMI N+TGM+
Sbjct: 301 VLSNLMVIYIKQKLGRLSALCGCVVAATGSSCGLCYLMGGTKEQIGFAIKNMIGNITGML 360
Query: 358 CDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNE 417
CDGAKPSC++KVSSGVS+A+ SA+L+ME VT EGI+D D+D+SI NLTS+G+D MN
Sbjct: 361 CDGAKPSCSMKVSSGVSSAMFSALLAMEKKVVTSNEGIVDDDVDQSIDNLTSIGRDGMNA 420
Query: 418 MDIMVLDIMTSK 429
D +VL+IMTSK
Sbjct: 421 TDTLVLNIMTSK 432
>gi|78355854|ref|YP_387303.1| hypothetical protein Dde_0807 [Desulfovibrio desulfuricans G20]
gi|78218259|gb|ABB37608.1| conserved hypothetical protein [Desulfovibrio desulfuricans G20]
Length = 428
Score = 347 bits (891), Expect = 7e-94, Method: Composition-based stats.
Identities = 197/419 (47%), Positives = 264/419 (63%), Gaps = 2/419 (0%)
Query: 11 IDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTGM 70
IDL+H++VVPA+GCTEP+AVAL A A LG+ P++I +S N+LKN MGVG+PGTG
Sbjct: 10 IDLLHKEVVPALGCTEPVAVALAAAHAAATLGRTPDRIEVKVSGNLLKNGMGVGVPGTGT 69
Query: 71 IGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEMT 130
G+ IA ++GAL G + LEV+ LTPE E G+Q V+E R+D+ + +G LY E+T
Sbjct: 70 TGMNIAAAVGALGGDPQRGLEVLAGLTPEQAEAGRQMVAEGRVDVSVAQG-APLLYAEVT 128
Query: 131 CEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDFATT 190
GG A +++ H+N V E DG+ + V + K L + + +FAT
Sbjct: 129 VTGGGHTARSVLVHEHSNIVRLERDGETVFSVPVQDLSQGGVEEK-WPLTMAAIHEFATQ 187
Query: 191 TPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKTASA 250
P + I FILEA R N A E L YG VG+T+D + + L + + + + TA+A
Sbjct: 188 APYDAIAFILEAARLNEAIAVEGLAREYGLKVGRTIDENIRKHLMSDDVSTLAVKLTAAA 247
Query: 251 CDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYIKQN 310
DARM G +PVMSNSGSGNQGI T PVV FAK E++ EEL RAL +SHLT+I++K
Sbjct: 248 SDARMAGVSLPVMSNSGSGNQGITCTMPVVAFAKRLESSDEELARALIMSHLTSIHMKHR 307
Query: 311 LGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCALKVS 370
LG LSALCG VA + CGI LMGG + ++NM+ N+ GM+CDGAK CA+KV+
Sbjct: 308 LGRLSALCGATVAAAAAGCGIVLLMGGGMEQVDRTIRNMVGNVAGMICDGAKTGCAMKVA 367
Query: 371 SGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMTSK 429
S VS + SAML+M+ + EGI++ DI+ I NL LG D M E D +VLDIM SK
Sbjct: 368 SAVSAGVQSAMLAMDGIGIDRREGIVEDDIELCIANLARLGSDGMQEADRVVLDIMVSK 426
>gi|153953253|ref|YP_001394018.1| hypothetical protein CKL_0616 [Clostridium kluyveri DSM 555]
gi|146346134|gb|EDK32670.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555]
Length = 418
Score = 347 bits (889), Expect = 1e-93, Method: Composition-based stats.
Identities = 196/424 (46%), Positives = 276/424 (65%), Gaps = 10/424 (2%)
Query: 10 IIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTG 69
+I L+H++VV A+GCTEP+AVAL A+ E LG+ PE I S NILKN MGVGIPGTG
Sbjct: 1 MIRLLHKEVVLALGCTEPVAVALAAAKCKETLGKIPETIEILASTNILKNGMGVGIPGTG 60
Query: 70 MIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEM 129
M+GL IA +LG G S LEV+ D+ E ++ K+ + E R+ IK K+ +EKLYIE
Sbjct: 61 MVGLHIAAALGVTGGNSHKLLEVLSDIKSEDVKAAKRMIDEKRVGIKHKDA-SEKLYIEA 119
Query: 130 TCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAP----TDNKDIQLNLKMVW 185
C+ + + IIS +HTN V E++G+ ++ E G + K+ ++ + ++
Sbjct: 120 VCKYNDEYSRVIISGSHTNIVLVESNGE-----KISEENGQDILEYKEIKNEKITVDYIY 174
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
F +EI F+L N ++EAL +YG VGK + + G +S+
Sbjct: 175 KFINEISTDEIIFLLHGALINKKLSDEALANHYGLGVGKNLYENVKYGRIEDSMEVRAKY 234
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
TA+A DARM G +PVM+NSGSGNQGI + PV+ A++ + E+L+RAL LS+L AI
Sbjct: 235 TTAAAVDARMAGCSLPVMTNSGSGNQGITVSMPVLAAAEKLKIPQEKLIRALALSNLIAI 294
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
+IK NLG LSALCGC+VA TG+ CGITY+ GG NI YA+KNMI +++GMVCDGAK C
Sbjct: 295 HIKSNLGRLSALCGCVVASTGACCGITYIFGGKLENIKYAIKNMIGDISGMVCDGAKCGC 354
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
ALKVS+GVS A+ +AML++ N +++ +GIIDKD++K+I+NL L + E D ++L+I
Sbjct: 355 ALKVSTGVSAAVQAAMLALSNVEISQNDGIIDKDVEKTIKNLCELDTKGLKEADDVILNI 414
Query: 426 MTSK 429
MT K
Sbjct: 415 MTCK 418
>gi|117619569|ref|YP_858502.1| hypothetical protein AHA_4077 [Aeromonas hydrophila subsp.
hydrophila ATCC 7966]
gi|117560976|gb|ABK37924.1| conserved protein [Aeromonas hydrophila subsp. hydrophila ATCC
7966]
Length = 437
Score = 330 bits (846), Expect = 1e-88, Method: Composition-based stats.
Identities = 176/420 (41%), Positives = 264/420 (62%), Gaps = 5/420 (1%)
Query: 11 IDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTGM 70
I L+ R+VVPA+GCTEPM+VAL A +LLGQ P ++S ++S N+ KN MGVG+PGTGM
Sbjct: 16 ITLLKREVVPALGCTEPMSVALAAANCRKLLGQVPTRVSVWVSGNLFKNGMGVGVPGTGM 75
Query: 71 IGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEMT 130
IGLP+A ++G G + LEV+K LTP +EE K + ++D+K + + LY E+
Sbjct: 76 IGLPVAAAVGITGGNPDAGLEVLKALTPAQVEEAKVLLPAIKVDVK---DVPDVLYAEVL 132
Query: 131 CEAGGKKATAIISKTHTNFVYEEADGKVLLDK-QVPAEEGAPTDNKDIQLNLKMVWDFAT 189
+ G A +I HT + E DG+VL+ + P + + + L+ + +FA
Sbjct: 133 AQVEGHSARVVICTDHTRIILMERDGEVLMAQDSAPGVQIQAAPSSKPAMTLREIVEFAL 192
Query: 190 TTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKTAS 249
P+ EI+FI EA N A+E L+G YG +GK + + R L + + + + +++
Sbjct: 193 QVPLAEIDFIREAATMNQALADEGLQG-YGLRIGKILTEQVERKLLSDDLMTLAMRLSSA 251
Query: 250 ACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYIKQ 309
A DARM GAM+P MSNSGSGNQGI AT PVV A+ + + E+L RAL +SHL AIYIK
Sbjct: 252 ASDARMDGAMLPAMSNSGSGNQGIAATMPVVAAARFLQASDEQLTRALVMSHLVAIYIKT 311
Query: 310 NLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCALKV 369
LSALC A G+ IT+L+GG + I + + NMI +++G++CDGA +C++KV
Sbjct: 312 YQNKLSALCAASTAAMGAGAAITWLLGGQFEQISHCINNMIGDVSGIICDGAGSACSMKV 371
Query: 370 SSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMTSK 429
S+ S A+ S+++++ N HV ++EGI+ D+D++I NL L K M + DI +++IM +K
Sbjct: 372 STSTSAAVKSSLMAINNLHVPQSEGIVSDDVDQTIANLGRLSKQGMLDTDIEIINIMRAK 431
>gi|145297338|ref|YP_001140179.1| hypothetical protein ASA_0240 [Aeromonas salmonicida subsp.
salmonicida A449]
gi|142850110|gb|ABO88431.1| conserved hypothetical protein [Aeromonas salmonicida subsp.
salmonicida A449]
Length = 435
Score = 330 bits (845), Expect = 2e-88, Method: Composition-based stats.
Identities = 178/420 (42%), Positives = 264/420 (62%), Gaps = 5/420 (1%)
Query: 11 IDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTGM 70
I L+ R+VVPA+GCTEPM+VAL A +LLGQ P ++S ++S N+ KN MGVG+PGTGM
Sbjct: 14 ITLLKREVVPALGCTEPMSVALAAANCRKLLGQTPTRVSVWVSGNLFKNGMGVGVPGTGM 73
Query: 71 IGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEMT 130
IGLP+A ++G G + LEV+ LTP +EE K + ++D+K + + LY E+
Sbjct: 74 IGLPVAAAVGFTGGNPDAGLEVLNTLTPAQVEEAKALLPIIKVDVK---DVPDVLYAEVL 130
Query: 131 CEAGGKKATAIISKTHTNFVYEEADGKVLLDKQ-VPAEEGAPTDNKDIQLNLKMVWDFAT 189
E G A +I HT V E DG+VL+++ P + P + + L+ + FA
Sbjct: 131 AEVEGHSARVVICTDHTRIVLMELDGEVLMEQNSAPGVQIQPAKSDKPAMTLREIVAFAL 190
Query: 190 TTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKTAS 249
P+ EI+FI A N A+E L+G YG +GK + + R L + + + + +++
Sbjct: 191 EVPLAEIDFIGAAATMNQALADEGLQG-YGLRIGKILTEQVERKLLSDDLMTLAMRLSSA 249
Query: 250 ACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYIKQ 309
A DARM GAM+P MSNSGSGNQGI AT PVV A+ + + E+L RAL +SHL AIYIK
Sbjct: 250 ASDARMDGAMLPAMSNSGSGNQGIAATMPVVAAARFLKASDEQLTRALVMSHLVAIYIKT 309
Query: 310 NLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCALKV 369
+ LSALC A GS IT+L+GG + I + + NMI +++G++CDGA +C++KV
Sbjct: 310 HQNKLSALCAASTAAMGSGAAITWLLGGQFEQISHCINNMIGDVSGIICDGAGSACSMKV 369
Query: 370 SSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMTSK 429
S+ S A+ S+++++ N HV ++EGI+ D+D++I NL L K M + DI +++IM +K
Sbjct: 370 STSTSAAVKSSLMAINNLHVPQSEGIVSDDVDETIANLGRLSKLGMLDTDIEIINIMRAK 429
>gi|90409992|ref|ZP_01218009.1| hypothetical inner membrane protein [Photobacterium profundum 3TCK]
gi|90329345|gb|EAS45602.1| hypothetical inner membrane protein [Photobacterium profundum 3TCK]
Length = 428
Score = 321 bits (823), Expect = 5e-86, Method: Composition-based stats.
Identities = 188/430 (43%), Positives = 273/430 (63%), Gaps = 4/430 (0%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAM 61
++ N+ ID++ R+VVPA+GCTEP++VAL A A E L EKI+A +S N++KN M
Sbjct: 1 MKTNLWNGFIDVVKREVVPALGCTEPVSVALAAAIAVEKLNGTVEKITALVSPNLMKNGM 60
Query: 62 GVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGI 121
GVG+PGTGM+GLPIA ++GA+ G+++ QLEV+K++TPE + K + + + + + +
Sbjct: 61 GVGVPGTGMVGLPIAAAVGAIAGEAKAQLEVLKNITPEDVAHAKTLIDAGNVHVGVAD-V 119
Query: 122 TEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTD--NKDIQL 179
LY ++T +G + I+ +HT+ + E +G + PA T N
Sbjct: 120 NNILYAKVTVISGDEFIAVTIADSHTHVMAIEENGITTYIAE-PANTATTTKKVNPFEGA 178
Query: 180 NLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSI 239
L+ ++DFA P+ +I FI A N +EE L G YG +G T R + RGL +
Sbjct: 179 LLEDIYDFALNAPLEDIRFIEHASELNDALSEEGLTGKYGLQIGATFQRNVDRGLLSGGL 238
Query: 240 YSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTL 299
+ +L +TA+A DARM GAM P MSNSGSGNQGI AT PVVV A + E+ +RAL L
Sbjct: 239 LTDVLRRTAAASDARMDGAMKPAMSNSGSGNQGIAATMPVVVVADFLKVDKEKTIRALML 298
Query: 300 SHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCD 359
SHLTAIYIK + LSALCG A G+ G+T+L+GGD N I A+ +MI ++ G++CD
Sbjct: 299 SHLTAIYIKSHQNKLSALCGATTASMGAVAGMTWLLGGDLNKINNAICSMIGDIAGIICD 358
Query: 360 GAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMD 419
GAK SCA+KVSS +A+ SA+++++ HVT EGI+ + D SIRNL++L +M + D
Sbjct: 359 GAKTSCAMKVSSSAGSAVKSALMALDGIHVTGNEGIVADNADASIRNLSALANGSMTQTD 418
Query: 420 IMVLDIMTSK 429
+ +LDIM +K
Sbjct: 419 VQILDIMVNK 428
>gi|153094355|gb|EDN75210.1| hypothetical protein MHA_2323 [Mannheimia haemolytica PHL213]
Length = 433
Score = 320 bits (821), Expect = 8e-86, Method: Composition-based stats.
Identities = 173/423 (40%), Positives = 271/423 (64%), Gaps = 4/423 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+ I+ ++ +VVPA+GCTEP+++AL +A A + LG PE+I A +S N++KN MGV +PG
Sbjct: 10 QSILHAVNHEVVPALGCTEPISLALASAIARQYLGHLPERIEAKVSPNLMKNGMGVAVPG 69
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEK-LY 126
T +GLP+A ++GA+ G+ + LEV+K++TPE +++ K ++ +++ + + E TE LY
Sbjct: 70 TATVGLPMAAAIGAIGGEPDGGLEVLKNITPEQVKQAKAMLNTDKVSVSIFE--TEHILY 127
Query: 127 IEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWD 186
E T + I++ HTN +Y E +G+ +L K T+ L+ K +++
Sbjct: 128 SEATLFHQANRVCVRIAEHHTNVIYIEKNGQTILSKPCAVNNDNTTE-IFASLSAKDIFN 186
Query: 187 FATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSK 246
F+ + +I FI +A + N ++E L+ +YG +G+T+ + + GL + + + I+ +
Sbjct: 187 FSMQIELEKIRFIEQAAQLNSALSQEGLRADYGLHIGRTLQKQIGLGLISDDLLNRIVIE 246
Query: 247 TASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIY 306
T SA DARMGGA +P MSNSGSGNQGI AT P+VV A+ T E+ +RAL LSHL AIY
Sbjct: 247 TTSASDARMGGASLPAMSNSGSGNQGITATMPIVVVARHLNVTDEQRIRALFLSHLMAIY 306
Query: 307 IKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCA 366
I L LSALC A GS GI +L+ G + NI A+ +MI +++G++CDGA SCA
Sbjct: 307 IHSKLPKLSALCAVTTAAMGSCAGIAWLLTGKFENISMAISSMIGDISGIICDGASNSCA 366
Query: 367 LKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIM 426
+KVS+ V+++ S ++SM+N VT EGI++ DID+SI NL S+ +M D V++IM
Sbjct: 367 MKVSTSVTSSYKSILMSMDNSQVTGNEGIVEHDIDRSINNLCSIASRSMQYTDRQVIEIM 426
Query: 427 TSK 429
+K
Sbjct: 427 VNK 429
>gi|54301920|ref|YP_131913.1| hypothetical inner membrane protein [Photobacterium profundum SS9]
gi|46915340|emb|CAG22113.1| hypothetical inner membrane protein [Photobacterium profundum SS9]
Length = 435
Score = 317 bits (813), Expect = 8e-85, Method: Composition-based stats.
Identities = 187/439 (42%), Positives = 271/439 (61%), Gaps = 22/439 (5%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAM 61
++ N+ ID++ R+VVPA+GCTEP++VAL A A E L EKI+A +S N++KN M
Sbjct: 8 MKTNLWNGFIDVVKREVVPALGCTEPVSVALAAAIAVEKLNGTVEKITALVSPNLMKNGM 67
Query: 62 GVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGI 121
GVG+PGTGM+GLPIA ++GA+ G++ QLEV+K++TPE + K + + + + + +
Sbjct: 68 GVGVPGTGMVGLPIAAAVGAIAGEANAQLEVLKNITPEDVAHAKILIDAGNVHVGVAD-V 126
Query: 122 TEKLYIEMTCEAGGKKATAIISKTHTN-----------FVYEEADGKVLLDKQVPAEEGA 170
LY ++T +G + I+ +HT+ ++ E A+ + K P E
Sbjct: 127 NNILYAKVTVTSGDEFVAVTIADSHTHVMAIEENGITTYIAEPANTATSVKKTSPFEGAL 186
Query: 171 PTDNKDIQLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPL 230
D ++DFA P+ EI FI A N +EE L G YG +G T R +
Sbjct: 187 LED----------IYDFALNAPLEEICFIEHAAELNDALSEEGLTGKYGLQIGATFQRNV 236
Query: 231 SRGLFGNSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTP 290
RGL + + +L +TA+A DARM GAM P MSNSGSGNQGI AT PVVV A +
Sbjct: 237 DRGLLSGGLLTDVLRRTAAASDARMDGAMKPAMSNSGSGNQGIAATMPVVVVADFLKVDK 296
Query: 291 EELVRALTLSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMI 350
E+ +RAL LSHLTAIYIK + LSALCG A G+ G+T+L+GGD N I A+ +MI
Sbjct: 297 EKTIRALMLSHLTAIYIKSHQNKLSALCGATTASMGAVAGMTWLLGGDLNKINNAICSMI 356
Query: 351 ANLTGMVCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSL 410
++ G++CDGAK SCA+KVSS +A+ SA+++++ +VT EGI+ + D SIRNL++L
Sbjct: 357 GDIAGIICDGAKTSCAMKVSSSAGSAVKSALMALDGIYVTGNEGIVADNADASIRNLSAL 416
Query: 411 GKDAMNEMDIMVLDIMTSK 429
+M + D+ +LDIM +K
Sbjct: 417 ANGSMTQTDVQILDIMVNK 435
>gi|89076151|ref|ZP_01162509.1| hypothetical protein SKA34_09253 [Photobacterium sp. SKA34]
gi|89048161|gb|EAR53745.1| hypothetical protein SKA34_09253 [Photobacterium sp. SKA34]
Length = 429
Score = 308 bits (788), Expect = 5e-82, Method: Composition-based stats.
Identities = 170/428 (39%), Positives = 271/428 (63%), Gaps = 11/428 (2%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q ++++ V PA GCTEP++ A +A A +LLGQ PE++ +S N+ KN+MGV +PG
Sbjct: 6 QQYKNIINAVVKPAFGCTEPISAAYASAVAADLLGQAPEELEVKVSDNLFKNSMGVFVPG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA + GA+ G LEV+ ++ PE +E+ +Q + ++ K+ + E +Y
Sbjct: 66 TGRIGLAIASAAGAVGGNPNGGLEVLVNIKPEHVEKAQQLIDAGKVKAGRKD-VEEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLD-------KQVPAEEGAPTDNKDIQLN 180
E+ +AG A IS HT V + +G + D + P + + DI +
Sbjct: 125 EVIAKAGSDVAIVEISGGHTQVVKKTLNGNTVFDATTDLSKDEKPKSTASVCEGVDI--S 182
Query: 181 LKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIY 240
++ ++DFAT EI+FILEA+ N ++E L YG VG+T+ + +++GLFGN +
Sbjct: 183 IEGIYDFATNVEFEEIKFILEARNLNSALSDEGLNHAYGLEVGRTIQKSIAKGLFGNGLI 242
Query: 241 SHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLS 300
++I+ +TA+A DARMGGA +P MSN GSGNQGI AT PVV AK ++ E+L RAL +S
Sbjct: 243 NNIVMRTAAASDARMGGATLPAMSNFGSGNQGIAATMPVVEAAKHYGSSEEQLARALIMS 302
Query: 301 HLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDG 360
HL AIY+K LS CG V + ++ G+ YL GG++ +C+A++N +++ TGM CDG
Sbjct: 303 HLGAIYMKSFYPPLSPFCGNAVTSSAAAMGMVYLSGGNFEQMCFAIQNTLSDTTGMYCDG 362
Query: 361 AKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDI 420
AK +CA+KV S ++A++S ++++ENH ++GI+ KD++K+IRN+ + + M+ D
Sbjct: 363 AKSTCAMKVKSSTNSAVMSFLMAIENHE-AHSQGIVAKDVEKTIRNVGKMVRLGMSNTDA 421
Query: 421 MVLDIMTS 428
++DIM++
Sbjct: 422 TIIDIMSA 429
>gi|89076156|ref|ZP_01162514.1| hypothetical protein SKA34_09278 [Photobacterium sp. SKA34]
gi|89048166|gb|EAR53750.1| hypothetical protein SKA34_09278 [Photobacterium sp. SKA34]
Length = 429
Score = 305 bits (782), Expect = 3e-81, Method: Composition-based stats.
Identities = 165/426 (38%), Positives = 266/426 (62%), Gaps = 7/426 (1%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q ++++ V PA GCTEP++ A +A ELLG+ PE++ +S N+ KN+MGV +PG
Sbjct: 6 QQYKNIINTVVKPAFGCTEPISAAYASAVVAELLGRAPEELDVKVSDNLFKNSMGVFVPG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGLPIA + GA+ G LEV+ ++TPE +E+ +Q + ++ K+ + E +Y
Sbjct: 66 TGRIGLPIASAAGAIGGNPNGGLEVLVNITPEHVEKAQQLIDAGKVTAGRKD-VEEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLD-KQVPAEEGAPTDN----KDIQLNLK 182
E+T +AG A IS HT V + +GKV+ D + G P + + ++LK
Sbjct: 125 EVTAKAGDDVAIVEISGGHTQVVKKTLNGKVVFDLSSDTSSAGKPKSTASVCEGVDISLK 184
Query: 183 MVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSH 242
++DFAT EI F++EA + + E L+ YG +G+T + +S GL G+S+ +
Sbjct: 185 GIYDFATQVEFEEIAFMVEAYNLTVALSNEGLENEYGLQIGRTFKKHISSGLIGDSLANR 244
Query: 243 ILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHL 302
+ ++A+A DARMGGA +P MSN GSGNQGICA+ PV+ FA E + E + RAL +SHL
Sbjct: 245 AIMRSAAASDARMGGATLPAMSNYGSGNQGICASMPVIEFASHYEASEETMTRALIMSHL 304
Query: 303 TAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAK 362
A+Y+K LS CGC V + ++ +TYL GG + C AV+N +++ TGM CDGAK
Sbjct: 305 AAVYMKSFYPPLSPFCGCAVTSSAAAMAMTYLAGGTFEQSCMAVQNTLSDTTGMFCDGAK 364
Query: 363 PSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMV 422
+CA+KV S S+A++S +++++ ++ +GI+ D++K+IRN+ + ++ M D ++
Sbjct: 365 STCAMKVKSSTSSAVMSTLMALD-YNEARCQGIVADDVEKTIRNVGKMVREGMVNTDKVI 423
Query: 423 LDIMTS 428
++IM++
Sbjct: 424 IEIMSA 429
>gi|90578283|ref|ZP_01234094.1| hypothetical protein VAS14_14569 [Vibrio angustum S14]
gi|90441369|gb|EAS66549.1| hypothetical protein VAS14_14569 [Vibrio angustum S14]
Length = 429
Score = 305 bits (782), Expect = 3e-81, Method: Composition-based stats.
Identities = 166/426 (38%), Positives = 267/426 (62%), Gaps = 7/426 (1%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q ++++ V PA GCTEP++ A +A ELLG+ PE++ +S N+ KN+MGV +PG
Sbjct: 6 QQYKNIINSVVKPAFGCTEPISAAHASAVVAELLGRAPEELDVKVSDNLFKNSMGVFVPG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGLPIA + GA+ G LEV+ ++TPE +E +Q + ++ K+ + E +Y
Sbjct: 66 TGRIGLPIASAAGAIGGNPNGGLEVLVNITPEHVETAQQLIDAGKVTAGRKD-VEEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLD---KQVPAEEGAPTDN--KDIQLNLK 182
E+T +AG AT IS HT V + +GKV+ D AE+ T + + + ++LK
Sbjct: 125 EVTAKAGDDVATVEISGGHTQVVKKTLNGKVVFDLSSDTSSAEKPKSTASVCEGVDISLK 184
Query: 183 MVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSH 242
++DFAT EI F++EA + + E L+ YG +G+T + + GL GNS+ +
Sbjct: 185 GIYDFATQVEFEEIAFMVEAYNLTVALSNEGLENEYGLQIGRTFKKHIESGLVGNSLANR 244
Query: 243 ILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHL 302
+ ++A+A DARMGGA +P MSN GSGNQGICA+ PV+ FA E + E + RAL +SHL
Sbjct: 245 AIMRSAAASDARMGGATLPAMSNYGSGNQGICASMPVIEFASHYEASEETMTRALIMSHL 304
Query: 303 TAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAK 362
A+Y+K LS CGC V + ++ +TYL GG + C A++N +++ TGM CDGAK
Sbjct: 305 AAVYMKSFYPPLSPFCGCAVTSSAAAMAMTYLAGGTFEQSCMAIQNTLSDTTGMFCDGAK 364
Query: 363 PSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMV 422
+CA+KV S S+A++S +++++ ++ +GI+ D++K+IRN+ + ++ M D ++
Sbjct: 365 STCAMKVKSSTSSAVMSTLMALD-YNEARCQGIVADDVEKTIRNVGKMVREGMVNTDKVI 423
Query: 423 LDIMTS 428
++IM++
Sbjct: 424 IEIMSA 429
>gi|146329640|ref|YP_001209036.1| hypothetical protein DNO_0106 [Dichelobacter nodosus VCS1703A]
gi|146233110|gb|ABQ14088.1| conserved hypothetical protein [Dichelobacter nodosus VCS1703A]
Length = 433
Score = 304 bits (778), Expect = 9e-81, Method: Composition-based stats.
Identities = 164/426 (38%), Positives = 264/426 (61%), Gaps = 1/426 (0%)
Query: 4 KNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGV 63
+++ ++I+ + +VVPA+GCTEP+ VAL A L PEKI+ +S N++KNAMGV
Sbjct: 6 QDLEQEILAAVRHEVVPALGCTEPICVALAAAIGRTHLSSAPEKIAVRVSGNLMKNAMGV 65
Query: 64 GIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITE 123
+PGTGM+GL IA ++GA+ G +E L+ +K ++ E + K+ ++++ + + + +
Sbjct: 66 TVPGTGMVGLSIAAAVGAIGGDAEAGLQTLKSISAEDVAAAKKMLADDAVSVSMAD-TDH 124
Query: 124 KLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKM 183
Y E+T AG + A I+ THTN V E +GK + K V A + ++ K
Sbjct: 125 IFYAEVTLTAGAEIARVCIADTHTNVVLIEKNGKTIYQKPVAAADEKNPVAVFTKITAKD 184
Query: 184 VWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHI 243
V+DFATT + +I FI EA N ++E L+ +YG +G+++ + + GL + + + I
Sbjct: 185 VFDFATTVDLEKIRFIKEAATLNSALSQEGLRTDYGLHIGRSLQKNIGNGLLSDDLLNRI 244
Query: 244 LSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLT 303
+ +T +A DARMGGA +P MSNSGSGNQGI AT PVVV A+ +++ E+ +RAL LSH
Sbjct: 245 IIETGAASDARMGGASLPAMSNSGSGNQGIAATMPVVVVARHLKSSEEKTIRALFLSHAL 304
Query: 304 AIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKP 363
AIYI L LS+LC A GS+ +L + + + +MI +++GM+CDGA
Sbjct: 305 AIYIHAKLPTLSSLCAANTAAMGSAGACAWLFSERFEAVSDTICSMIGDVSGMICDGAAN 364
Query: 364 SCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVL 423
SCA+KV S +++ S +++++N VT EGI++ D+++SI NL +L +++M +D ++
Sbjct: 365 SCAMKVISSITSGYKSMLMALDNTRVTGFEGIVEHDLERSINNLCALARESMQHVDQQII 424
Query: 424 DIMTSK 429
DIM K
Sbjct: 425 DIMVQK 430
>gi|34497279|ref|NP_901494.1| hypothetical protein CV_1824 [Chromobacterium violaceum ATCC 12472]
gi|34103135|gb|AAQ59498.1| conserved hypothetical protein [Chromobacterium violaceum ATCC
12472]
Length = 430
Score = 303 bits (775), Expect = 2e-80, Method: Composition-based stats.
Identities = 177/432 (40%), Positives = 263/432 (60%), Gaps = 6/432 (1%)
Query: 1 MLEKNIR--EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILK 58
M E+ +R + + + ++VVPA+GCTEP+++AL A A LG+ PE+I A++SAN++K
Sbjct: 1 MSEREVRLWPEFVKALKQEVVPALGCTEPISLALAAALAARELGKAPERIDAWVSANLMK 60
Query: 59 NAMGVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLK 118
N MGV +PGTG +GLPIA ++GAL G + +LEV+K+LT E + GKQ +++ ++ + +
Sbjct: 61 NGMGVTVPGTGTVGLPIAAAVGALGGDPDAKLEVLKNLTVEQVAAGKQMLADGKVKLGVA 120
Query: 119 EGITEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI- 177
+ LY E G + A I+ HTN + E +G+V L ++ A + P + D+
Sbjct: 121 -AVPNILYAEACVWHGDECARVAIADAHTNVIKIELNGEVKLKRE--AADAKPVETYDLG 177
Query: 178 QLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGN 237
+ V+DFA P++ I FI +A N A+E + G YG +G T+ R + GL
Sbjct: 178 DATARDVYDFAMRAPLDSIAFIHDAAVLNSALADEGMSGKYGLHIGATLQRQIEAGLLSE 237
Query: 238 SIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRAL 297
+ S+IL++T +A DARMGGA +P MSNSGSGNQGI AT PVV A+ + E L+RAL
Sbjct: 238 GLLSNILTRTTAASDARMGGATLPAMSNSGSGNQGIAATMPVVAVAEHVKADRETLIRAL 297
Query: 298 TLSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMV 357
LSHL A+YI L LSALC A G++ G+ L+ G Y + A+ +MI +L GM+
Sbjct: 298 ALSHLIAVYIHTRLPKLSALCAVTTASMGAAAGMAQLLNGGYPAVSMAISSMIGDLAGMI 357
Query: 358 CDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNE 417
CDGA SCA+KVS+ + + +++++ VT EGI+ D+D SI NL L M +
Sbjct: 358 CDGASNSCAMKVSTSAGSGYKAVLMALDGTRVTGNEGIVAHDVDVSIANLGKLATQGMAQ 417
Query: 418 MDIMVLDIMTSK 429
D +L IM K
Sbjct: 418 TDTQILQIMMDK 429
>gi|90409323|ref|ZP_01217416.1| hypothetical protein PCNPT3_10163 [Psychromonas sp. CNPT3]
gi|90309570|gb|EAS37762.1| hypothetical protein PCNPT3_10163 [Psychromonas sp. CNPT3]
Length = 424
Score = 300 bits (769), Expect = 1e-79, Method: Composition-based stats.
Identities = 162/421 (38%), Positives = 259/421 (61%), Gaps = 2/421 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q I++++ V PA+GCTEP+AVA +A A ++L +PEKI+ +S N+ KN+MGV +PG
Sbjct: 6 QQYINILNSVVKPALGCTEPIAVAYASAVAMQMLSAEPEKITVHVSDNLYKNSMGVFVPG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA S+GA G L+V++ + +E+ + + N + + + + E +Y
Sbjct: 66 TGRIGLHIAASVGAFAGDPLADLQVLEKINTRDVEKAQALIDTNNVSVS-RIDVDEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDF 187
+ +AG A +S HT + ++ + +++ K + + + +N++ ++DF
Sbjct: 125 LVEIKAGQDIAIVEVSGAHTQVICKKLNNEIVFSKSNTDKTSTASICDGVDINIESIYDF 184
Query: 188 ATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKT 247
A P +IEFILE+ + N A+E L +YG +G+ + + + G + S IL +T
Sbjct: 185 AMHAPFADIEFILESAKLNSALAQEGLLNDYGLKIGRIIQKSIKDGFMSEGLVSDILMQT 244
Query: 248 ASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYI 307
++A DARMGGA +P MSN GSGNQGI AT PVV+ AK ++ +EL RAL LSHL+AIYI
Sbjct: 245 SAASDARMGGASLPAMSNYGSGNQGIAATLPVVLMAKHCKHGDDELARALILSHLSAIYI 304
Query: 308 KQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
K LSA CG + ++ + YLMGGDY CYA++N++ + TGM+CDGAK +CA+
Sbjct: 305 KSYYPPLSAFCGNTATSSAAAMAMVYLMGGDYKQSCYAIQNVLGDCTGMICDGAKSTCAM 364
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
KV + S+AI S++L++ N T+ +GI+ +++ SI+NL L M D ++ IM+
Sbjct: 365 KVKTSTSSAIYSSLLAI-NSTGTKDQGIVASNVEDSIKNLGKLISQGMPNTDTQIIKIMS 423
Query: 428 S 428
+
Sbjct: 424 A 424
>gi|156975351|ref|YP_001446258.1| hypothetical protein VIBHAR_03081 [Vibrio harveyi ATCC BAA-1116]
gi|156526945|gb|ABU72031.1| hypothetical protein VIBHAR_03081 [Vibrio harveyi ATCC BAA-1116]
Length = 425
Score = 297 bits (760), Expect = 9e-79, Method: Composition-based stats.
Identities = 170/423 (40%), Positives = 259/423 (61%), Gaps = 7/423 (1%)
Query: 9 QIIDLMHRQVVPAVGCTEPMAVALCTARATELLG-QKPEKISAFLSANILKNAMGVGIPG 67
Q I ++ + V PA+GCTEP+A A A A LG Q P+ I +S N+ KN+MGV +PG
Sbjct: 7 QYIQIIKQVVKPALGCTEPIAAAYAAAVAKRELGCQTPDTIEVRVSDNLFKNSMGVYVPG 66
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA S+GAL G +LEV+ + + + +Q + E R+ + + E ++
Sbjct: 67 TGKIGLKIAASVGALAGDPNAELEVLAKINQDDVTAAQQLIDEERVSVA-RIDTQELIFC 125
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDN--KDIQLNLKMVW 185
+T AG + IS HTN + +G V D P ++ T + + + +++K ++
Sbjct: 126 SVTMSAGEDTVSVTISGGHTNIIQITRNGVVTFD--APQQQRVATGSVCEGVDISIKQIY 183
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
DFA P ++I+FIL+A N + A+E + YG +G+T+ + +GL GN + S I
Sbjct: 184 DFALQAPFDDIKFILQAAELNSSLAQEGIDRGYGLEIGRTLKGNIEQGLLGNDLMSRIQM 243
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
T++A DARMGGA +P MSN GSGNQGI AT PVV+ A+ +++ E L RAL +SHL AI
Sbjct: 244 MTSAASDARMGGATLPAMSNFGSGNQGIAATMPVVIAAEAFQSSEEHLARALIMSHLGAI 303
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YIK LSA CG V +S + YL GG + CYA++N+I++ +GMVCDGAK SC
Sbjct: 304 YIKSYYPPLSAFCGNTVTSAAASMALVYLAGGTFEQSCYAIQNVISDSSGMVCDGAKSSC 363
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KV + +TA+ S +++M NH V + +GII +++++IRN+ S+ + M D ++DI
Sbjct: 364 AMKVCTSATTAVRSYLMAMGNHSV-KNQGIIGDEVEQTIRNVGSMVRLGMPYTDKSIIDI 422
Query: 426 MTS 428
M++
Sbjct: 423 MSA 425
>gi|149189825|ref|ZP_01868105.1| hypothetical inner membrane protein [Vibrio shilonii AK1]
gi|148836311|gb|EDL53268.1| hypothetical inner membrane protein [Vibrio shilonii AK1]
Length = 428
Score = 296 bits (759), Expect = 1e-78, Method: Composition-based stats.
Identities = 176/420 (41%), Positives = 264/420 (62%), Gaps = 6/420 (1%)
Query: 13 LMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTGMIG 72
L+ ++VVPA+GCTEP++VAL +A A + L + IS +S N++KN MGVG+PGTGM+G
Sbjct: 12 LIKKEVVPALGCTEPVSVALASAIAAQKLAGDIQMISVHVSPNLMKNGMGVGVPGTGMVG 71
Query: 73 LPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEMTCE 132
L IA ++GA+ G+ E LEV+K++TPE + + K S +++ + +Y ++
Sbjct: 72 LSIAAAIGAVAGEPEAGLEVLKNITPEDVTQAKSLQSAVTVEVA---DVLNIIYAKVVVS 128
Query: 133 AGGKKATAIISKTHTNFVYEEADG-KVLLDKQVPAEEGA--PTDNKDIQLNLKMVWDFAT 189
G + I+ +HTN V E DG ++ Q E A PT++ + ++DFA
Sbjct: 129 DGEHRVAVTIADSHTNVVSIEEDGLTTYINTQEQEAETASEPTESPMKDACAQDIFDFAL 188
Query: 190 TTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKTAS 249
P+ I FI +A + N + + E L +YG VG T R + +GL S+ + +L +T++
Sbjct: 189 NAPLESISFIEQAYQLNNDLSHEGLSQDYGLQVGATFKRNVDKGLLAKSLMTDVLCRTSA 248
Query: 250 ACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYIKQ 309
A DARM GAM P MSNSGSGNQGI AT PVVV A+ E ++RAL LSHL A+YIK
Sbjct: 249 ASDARMDGAMKPAMSNSGSGNQGISATMPVVVVAEHVHADRETMIRALMLSHLMAVYIKS 308
Query: 310 NLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCALKV 369
LSALCG A G++ G+T+L+GG++ I A+ +MI ++ GM+CDGAK SCA+KV
Sbjct: 309 YQHKLSALCGATTASMGAAAGMTWLLGGNFKQISDAICSMIGDVAGMICDGAKTSCAMKV 368
Query: 370 SSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMTSK 429
SS A + +++++ VT EGI+ +++D+SIRNL+SL AM + D+ +L+IM +K
Sbjct: 369 SSSAGAAFKATLMALDGIRVTGNEGIVSENVDESIRNLSSLANGAMTQTDVQILEIMVNK 428
>gi|148323198|gb|EDK88448.1| hypothetical protein FNP_0644 [Fusobacterium nucleatum subsp.
polymorphum ATCC 10953]
Length = 427
Score = 296 bits (758), Expect = 2e-78, Method: Composition-based stats.
Identities = 168/422 (39%), Positives = 252/422 (59%), Gaps = 3/422 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
E+++ ++ ++V A GCTEP+A++ A+A +LG P K+ FLS NI+KN V IP
Sbjct: 6 EKVLKILEEEIVAAEGCTEPIALSYAAAKARRILGTVPNKVDVFLSGNIIKNVKSVTIPN 65
Query: 68 T-GMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLY 126
+ GMIG+ AI++G + G E +L VI D+T E +EE K+++ +N I + G KLY
Sbjct: 66 SEGMIGIEPAIAMGMIAGDDEKELMVISDVTHEQVEEVKKFLDKNIIQTHVYPGDI-KLY 124
Query: 127 IEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWD 186
I + G I THTN +GK+LL + + + L +K ++D
Sbjct: 125 IRLEISTGEDNVLLEIKHTHTNITRILKNGKILLSQICNDGDFNSSLTDRKVLTVKFIYD 184
Query: 187 FATTTPINEIEFILE-AKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
A I+ I+ I E YN AEE LKG YG +GK + + RG++GN I + S
Sbjct: 185 LAKIIDIDLIKPIFEKVITYNSAIAEEGLKGKYGVNIGKMILDNIERGIYGNDIRNKAAS 244
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
++ DARM G +PVM+ SGSGNQG+ A+ P++ FA E T EEL+R L +SHLT I
Sbjct: 245 YASAGSDARMSGCGLPVMTTSGSGNQGMTASLPIIKFAAEKNLTEEELIRGLFVSHLTTI 304
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
++K N+G LSA CG I A +G + +T+L GG Y +C A+ N++ NL+G++CDGAK SC
Sbjct: 305 HVKTNVGRLSAYCGAICAASGVAAALTFLHGGSYEMVCDAITNILGNLSGVICDGAKASC 364
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+K+SSG+ +A + ML++ + +GI+ DI+++IRN+ L + M D +L I
Sbjct: 365 AMKISSGIYSAFDATMLALHKDVLKSGDGIVGVDIEETIRNVGELAQCGMKGTDETILGI 424
Query: 426 MT 427
MT
Sbjct: 425 MT 426
>gi|153833860|ref|ZP_01986527.1| conserved hypothetical protein [Vibrio harveyi HY01]
gi|148869802|gb|EDL68776.1| conserved hypothetical protein [Vibrio harveyi HY01]
Length = 425
Score = 296 bits (758), Expect = 2e-78, Method: Composition-based stats.
Identities = 168/423 (39%), Positives = 260/423 (61%), Gaps = 7/423 (1%)
Query: 9 QIIDLMHRQVVPAVGCTEPMAVALCTARATELLG-QKPEKISAFLSANILKNAMGVGIPG 67
Q I ++ + V PA+GCTEP+A A A A LG Q P+ I +S N+ KN+MGV +PG
Sbjct: 7 QYIQIIKQVVKPALGCTEPIAAAYAAAVAKRELGCQTPDMIEVRVSDNLFKNSMGVYVPG 66
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA S+GAL G +LEV+ + + + +Q + E R+ + + E ++
Sbjct: 67 TGKIGLKIAASVGALAGDPNAELEVLAKINQDDVTAAQQLIDEERVSVA-RIDTQEFIFC 125
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDN--KDIQLNLKMVW 185
+T AG + IS HTN + +G+V D P ++ T + + + ++++ ++
Sbjct: 126 SVTMSAGEDTVSVTISGGHTNIIQITRNGQVTFD--APQQQRVATGSVCEGVDISIQQIY 183
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
DFA P ++I+FIL+A N + A+E + YG +G+T+ + +GL GN + S I
Sbjct: 184 DFALQAPFDDIKFILQAAELNSSLAQEGIDRGYGLEIGRTLKGNIEQGLLGNDLMSRIQM 243
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
T++A DARMGGA +P MSN GSGNQGI AT PVV+ A+ ++ E+L RAL +SHL AI
Sbjct: 244 MTSAASDARMGGATLPAMSNFGSGNQGIAATVPVVIAAEAFQSDEEQLARALIMSHLGAI 303
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YIK LSA CG V +S + YL GG + CYA++N+I++ +GMVCDGAK SC
Sbjct: 304 YIKSYYPPLSAFCGNTVTSAAASMALVYLAGGTFEQSCYAIQNVISDSSGMVCDGAKSSC 363
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KV + +TA+ S +++M NH V + +GI+ +++++IRN+ S+ + M D ++DI
Sbjct: 364 AMKVCTSSTTAVRSYLMAMGNHSV-KNQGIVGDEVEQTIRNVGSMVRLGMPYTDKSIIDI 422
Query: 426 MTS 428
M++
Sbjct: 423 MSA 425
>gi|118754581|ref|ZP_01602377.1| protein of unknown function DUF1063 [Shewanella pealeana ATCC
700345]
gi|118693922|gb|EAW00143.1| protein of unknown function DUF1063 [Shewanella pealeana ATCC
700345]
Length = 433
Score = 295 bits (756), Expect = 3e-78, Method: Composition-based stats.
Identities = 177/438 (40%), Positives = 275/438 (62%), Gaps = 15/438 (3%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPE----KISAFLSANIL 57
+++++ ++ + R VVPA+GCTEP++VAL A A LG K + K+ +SAN++
Sbjct: 1 MKQHLWPLFLEAIKRDVVPALGCTEPISVALAAAIAIGELGIKNQASNVKMDVSVSANLM 60
Query: 58 KNAMGVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKL 117
KN MGVGIPGTGM+GLPIA ++GA+ G S LEV+K+LT ++ K + ++ + +
Sbjct: 61 KNGMGVGIPGTGMVGLPIAAAIGAVAGDSNAGLEVLKNLTDSDVQAAKLMLDNGQVTVGV 120
Query: 118 KEGITEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI 177
+ + LY ++T + A+ I+ +HT + E +G+ L PA E ++K
Sbjct: 121 AD-VANVLYAKVTVYYQQQTASVTIADSHTKVIAIEKNGEQCL----PAPEVESVNDKSN 175
Query: 178 QLN------LKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLS 231
+ N L+ ++DFA P+++I FI+++K N + E L G+YG +G T+ +
Sbjct: 176 KANPFTEARLQDIYDFAMHAPLDDIGFIMQSKSLNDALSIEGLSGHYGLKIGATLVKNQE 235
Query: 232 RGLFGNSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPE 291
+GL + + +L++TA A DARM GAM+P MSNSGSGNQGI AT PVV A+ +++
Sbjct: 236 KGLLSGGLLTEVLARTAGASDARMDGAMMPAMSNSGSGNQGIAATMPVVACAEFLKSSET 295
Query: 292 ELVRALTLSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIA 351
+ +RAL LSHLTAIYIK LSALCG A GS+ GITYL+ G+ + A+ +MI
Sbjct: 296 QTIRALMLSHLTAIYIKSYQNKLSALCGATTAAMGSAAGITYLLDGEIEQVSAAICSMIG 355
Query: 352 NLTGMVCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLG 411
+++G++CDGAK +CA+KVSS A+ SA+++++ VT EGI+ D+D++I NL +L
Sbjct: 356 DVSGVICDGAKTACAMKVSSSAGAAVKSALMAIDGIRVTGTEGIVADDVDQTISNLATLA 415
Query: 412 KDAMNEMDIMVLDIMTSK 429
AM + D+ +L+IM K
Sbjct: 416 NGAMTQTDVQILEIMMHK 433
>gi|116185206|ref|ZP_01475121.1| hypothetical protein VEx2w_02002275 [Vibrio sp. Ex25]
gi|151939073|gb|EDN57905.1| conserved hypothetical protein [Vibrio sp. Ex25]
Length = 425
Score = 295 bits (756), Expect = 3e-78, Method: Composition-based stats.
Identities = 168/421 (39%), Positives = 255/421 (60%), Gaps = 3/421 (0%)
Query: 9 QIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKP-EKISAFLSANILKNAMGVGIPG 67
Q I ++ + V PA+GCTEP+A A A A + LG + I +S N+ KN+MGV +PG
Sbjct: 7 QYIQIIKQVVKPALGCTEPIAAAYAAAVARKELGTSDIDAIEVRVSDNLFKNSMGVFVPG 66
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA S+GAL G +LEV+ + + + +Q + E R+ + + E +Y
Sbjct: 67 TGKIGLKIAASVGALAGDPTAELEVLARINEQDVAAAQQLIDEERVTVA-RMDTQEFIYC 125
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDF 187
+T +G + IS HTN + +G V+ D + + + +++K +++F
Sbjct: 126 SVTLTSGDDVVSVTISGGHTNIIQITRNGDVIFDAPPQQRVATASVCEGVDISIKQIYEF 185
Query: 188 ATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKT 247
AT P EI+FIL+A N A+E + YG +G+T+ + +GL GN + S I T
Sbjct: 186 ATQAPFEEIKFILQAAELNTLLAQEGIDRGYGLEIGRTLKGNIEQGLLGNDLMSRIQMMT 245
Query: 248 ASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYI 307
++A DARMGGA +P MSN GSGNQGI AT PVVV A+ +N E+L RAL +SHL AIYI
Sbjct: 246 SAASDARMGGATLPAMSNFGSGNQGIAATMPVVVAAEAFQNDEEQLARALIMSHLGAIYI 305
Query: 308 KQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
K LSA CG V +S + YL GG + CYA++N+I++ +GMVCDGAK SCA+
Sbjct: 306 KSYYPPLSAFCGNTVTSAAASMALVYLAGGTFEQSCYAIQNVISDSSGMVCDGAKSSCAM 365
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
KV + +TA+ S +++M NH V + +GI+ ++++++IRN+ S+ + M D ++DIM+
Sbjct: 366 KVCTSSTTAVRSYLMAMGNHSV-KNQGIVGEEVEQTIRNVGSMVRFGMPYTDKSIIDIMS 424
Query: 428 S 428
+
Sbjct: 425 A 425
>gi|123440826|ref|YP_001004817.1| hypothetical protein YE0448 [Yersinia enterocolitica subsp.
enterocolitica 8081]
gi|122087787|emb|CAL10573.1| conserved hypothetical protein [Yersinia enterocolitica subsp.
enterocolitica 8081]
Length = 438
Score = 295 bits (754), Expect = 5e-78, Method: Composition-based stats.
Identities = 180/425 (42%), Positives = 262/425 (61%), Gaps = 4/425 (0%)
Query: 11 IDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTGM 70
I ++ R V PAVGCTEP+A+AL +A A L K E+I A +S N++KN MGV +PGTGM
Sbjct: 14 IHVIQRDVQPAVGCTEPIALALASAIAASYLPAKAERIEARVSPNLMKNGMGVTVPGTGM 73
Query: 71 IGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEMT 130
+GLPIA ++GAL G + LEV+K+L+ + + E K + + + ++ G E L+ E T
Sbjct: 74 VGLPIAAAVGALGGDPDGGLEVLKNLSSQQVVEAKAMLDRGDVRVDMQAG-DEILFAEAT 132
Query: 131 CEAGGKKATAIISKTHTNFVYEEADGKVL--LDKQVPAEEGAPTDNKDI-QLNLKMVWDF 187
G + A I+ HT V +G VL L + P+E+ + + Q + V+ F
Sbjct: 133 LYHGDQWACVTIAGGHTQVVRIVINGNVLFELAPESPSEQVVCHAHDCLKQATARQVYQF 192
Query: 188 ATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKT 247
AT P +I FIL+A + N ++E L GNYG +G ++ R RGL + S I+ ++
Sbjct: 193 ATQVPFEQIAFILQAAKLNGALSQEGLTGNYGLHIGASLMRQRGRGLLVKDLLSDIMIRS 252
Query: 248 ASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYI 307
A+A DARMGGA++P MSNSGSGNQGI AT PVVV A+ + EEL RAL LSHL AIYI
Sbjct: 253 AAASDARMGGALLPAMSNSGSGNQGIAATMPVVVVAEHVGASEEELARALILSHLMAIYI 312
Query: 308 KQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
L LSALC A G++ G+ +L+ Y + A+ +MI +++G++CDGA SCA+
Sbjct: 313 HNQLPTLSALCAATTAAMGAAAGMAWLLEPRYEPVALAIGSMIGDISGIICDGAANSCAM 372
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
KVS+ VS A + ++++++ VT EGI+ D+D+SI NL +L AM + D +++IM
Sbjct: 373 KVSTSVSAAYKAVLMALDSSGVTGNEGIVADDVDQSIANLCALACGAMRQTDSQIIEIMA 432
Query: 428 SKGAC 432
K C
Sbjct: 433 HKCHC 437
>gi|28898947|ref|NP_798552.1| hypothetical protein VP2173 [Vibrio parahaemolyticus RIMD 2210633]
gi|28807166|dbj|BAC60436.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633]
Length = 425
Score = 294 bits (753), Expect = 6e-78, Method: Composition-based stats.
Identities = 168/421 (39%), Positives = 255/421 (60%), Gaps = 3/421 (0%)
Query: 9 QIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKP-EKISAFLSANILKNAMGVGIPG 67
Q I ++ + V PA+GCTEP+A A A A + LG + I +S N+ KN+MGV +PG
Sbjct: 7 QYIQIIKQVVKPALGCTEPIAAAYAAAVARKELGTSDIDAIEVRVSDNLFKNSMGVFVPG 66
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA S+GAL G +LEV+ + + + +Q + E R+ + + E +Y
Sbjct: 67 TGKIGLKIAASVGALAGDPTAELEVLARINEQDVAAAQQLIDEERVTVA-RMDTQEFIYC 125
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDF 187
+T +G + IS HTN + +G V+ D + + + +++K +++F
Sbjct: 126 SVTLTSGDDVVSVTISGGHTNIIQIMRNGDVIFDAPPQQRVATASVCEGVDISIKQIYEF 185
Query: 188 ATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKT 247
AT P EI+FIL+A N A+E + YG +G+T+ + +GL GN + S I T
Sbjct: 186 ATQAPFEEIKFILQAAELNTLLAQEGIDRGYGLEIGRTLKGNIEQGLLGNDLMSRIQMMT 245
Query: 248 ASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYI 307
++A DARMGGA +P MSN GSGNQGI AT PVVV A+ +N E+L RAL +SHL AIYI
Sbjct: 246 SAASDARMGGATLPAMSNFGSGNQGIAATMPVVVAAEVFQNDEEQLARALIMSHLGAIYI 305
Query: 308 KQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
K LSA CG V +S + YL GG + CYA++N+I++ +GMVCDGAK SCA+
Sbjct: 306 KSYYPPLSAFCGNTVTSAAASMALVYLAGGTFEQSCYAIQNVISDSSGMVCDGAKSSCAM 365
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
KV + +TA+ S +++M NH V + +GI+ ++++++IRN+ S+ + M D ++DIM+
Sbjct: 366 KVCTSSTTAVRSYLMAMGNHSV-KNQGIVGEEVEQTIRNVGSMVRFGMPYTDKSIIDIMS 424
Query: 428 S 428
+
Sbjct: 425 A 425
>gi|153838698|ref|ZP_01991365.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810]
gi|149747918|gb|EDM58790.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810]
Length = 425
Score = 293 bits (749), Expect = 2e-77, Method: Composition-based stats.
Identities = 167/421 (39%), Positives = 254/421 (60%), Gaps = 3/421 (0%)
Query: 9 QIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKP-EKISAFLSANILKNAMGVGIPG 67
Q I ++ + V PA+GCTEP+A A A A + LG + I +S N+ KN+MGV +PG
Sbjct: 7 QYIQIIKQVVKPALGCTEPIAAAYAAAVARKELGTSDIDAIEVRVSDNLFKNSMGVFVPG 66
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA S+GAL G +LEV+ + + + +Q + E R+ + + E +Y
Sbjct: 67 TGKIGLKIAASVGALAGDPTAELEVLARINKQDVAAAQQLIDEERVTVA-RMDTQEFIYC 125
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDF 187
+T +G + IS HTN + + V+ D + + + +++K +++F
Sbjct: 126 SVTLTSGDDVVSVTISGGHTNIIQITRNDDVIFDAPPQQRVATASVCEGVDISIKQIYEF 185
Query: 188 ATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKT 247
AT P EI+FIL+A N A+E + YG +G+T+ + +GL GN + S I T
Sbjct: 186 ATQAPFEEIKFILQAAELNTLLAQEGIDRGYGLEIGRTLKGNIEQGLLGNDLMSRIQMMT 245
Query: 248 ASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYI 307
++A DARMGGA +P MSN GSGNQGI AT PVVV A+ +N E+L RAL +SHL AIYI
Sbjct: 246 SAASDARMGGATLPAMSNFGSGNQGIAATMPVVVAAEVFQNDEEQLARALIMSHLGAIYI 305
Query: 308 KQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
K LSA CG V +S + YL GG + CYA++N+I++ +GMVCDGAK SCA+
Sbjct: 306 KSYYPPLSAFCGNTVTSAAASMALVYLAGGTFEQSCYAIQNVISDSSGMVCDGAKSSCAM 365
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
KV + +TA+ S +++M NH V + +GI+ ++++++IRN+ S+ + M D ++DIM+
Sbjct: 366 KVCTSSTTAVRSYLMAMGNHSV-KNQGIVGEEVEQTIRNVGSMVRFGMPYTDKSIIDIMS 424
Query: 428 S 428
+
Sbjct: 425 A 425
>gi|75240858|ref|ZP_00724756.1| COG3681: Uncharacterized conserved protein [Escherichia coli F11]
gi|91212536|ref|YP_542522.1| hypothetical protein UTI89_C3545 [Escherichia coli UTI89]
gi|110643356|ref|YP_671086.1| hypothetical protein YhaM [Escherichia coli 536]
gi|117625413|ref|YP_858736.1| hypothetical protein APECO1_3314 [Escherichia coli APEC O1]
gi|91074110|gb|ABE08991.1| conserved hypothetical protein [Escherichia coli UTI89]
gi|110344948|gb|ABG71185.1| hypothetical protein YhaM [Escherichia coli 536]
gi|115514537|gb|ABJ02612.1| conserved hypothetical protein [Escherichia coli APEC O1]
Length = 436
Score = 291 bits (745), Expect = 5e-77, Method: Composition-based stats.
Identities = 169/425 (39%), Positives = 255/425 (60%), Gaps = 2/425 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL A A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDATAQAISDAKALLAAGKVSVKIQEPCDEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVW 185
G K A I HTN V+ E V+ +Q EG + + L +
Sbjct: 131 RAKVWNGEKWACVTIVGGHTNIVHIETHNGVVFTQQACVTEGEQESPLTVLSRTTLAEIL 190
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
F P I FIL++ + N ++E L GN+G +G T+++ +RGL + S I+
Sbjct: 191 KFVNEVPFAAIRFILDSAKLNCALSQEGLSGNWGLHIGATLEKQCARGLLAKDLSSSIVI 250
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI AT PVVV A+ E L RAL LSHL+AI
Sbjct: 251 RTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAI 310
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 311 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 370
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 371 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 430
Query: 426 MTSKG 430
M SK
Sbjct: 431 MASKA 435
>gi|26249696|ref|NP_755736.1| hypothetical protein c3867 [Escherichia coli CFT073]
gi|26110124|gb|AAN82310.1|AE016767_70 Conserved hypothetical protein [Escherichia coli CFT073]
Length = 436
Score = 291 bits (745), Expect = 6e-77, Method: Composition-based stats.
Identities = 169/425 (39%), Positives = 255/425 (60%), Gaps = 2/425 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL A A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCDEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVW 185
G K A I HTN V+ E V+ +Q EG + + L +
Sbjct: 131 RAKVWNGEKWACVTIVGGHTNIVHIETHNGVVFTQQACVTEGEQESPLTVLSRTTLAEIL 190
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
F P I FIL++ + N ++E L GN+G +G T+++ +RGL + S I+
Sbjct: 191 KFVNEVPFAAIRFILDSAKLNCALSQEGLSGNWGLHIGATLEKQCARGLLAKDLSSSIVI 250
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI AT PVVV A+ E L RAL LSHL+AI
Sbjct: 251 RTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAI 310
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 311 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 370
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 371 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 430
Query: 426 MTSKG 430
M SK
Sbjct: 431 MASKA 435
>gi|82545369|ref|YP_409316.1| hypothetical protein SBO_2975 [Shigella boydii Sb227]
gi|81246780|gb|ABB67488.1| Uncharacterized conserved protein [Shigella boydii Sb227]
Length = 436
Score = 289 bits (739), Expect = 3e-76, Method: Composition-based stats.
Identities = 168/425 (39%), Positives = 253/425 (59%), Gaps = 2/425 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL A A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVW 185
G K A I HTN V+ E V+ +Q EG + + L +
Sbjct: 131 RAKVWNGEKWACVTIVGGHTNIVHIETHNSVVFTQQACVAEGEQESPLTVLSRTTLAEIL 190
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
F P I FIL++ + N ++E L G +G +G T+++ RGL + S I+
Sbjct: 191 KFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVI 250
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI AT PVVV A+ E L RAL LSHL+AI
Sbjct: 251 RTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAI 310
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 311 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 370
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 371 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 430
Query: 426 MTSKG 430
M SK
Sbjct: 431 MASKA 435
>gi|75197064|ref|ZP_00707134.1| COG3681: Uncharacterized conserved protein [Escherichia coli HS]
gi|75513325|ref|ZP_00735749.1| COG3681: Uncharacterized conserved protein [Escherichia coli 53638]
gi|83585752|ref|ZP_00924393.1| COG3681: Uncharacterized conserved protein [Escherichia coli 101-1]
gi|124525979|ref|ZP_01697984.1| protein of unknown function DUF1063 [Escherichia coli B]
gi|124502083|gb|EAY49543.1| protein of unknown function DUF1063 [Escherichia coli B]
gi|157068272|gb|ABV07527.1| conserved hypothetical protein [Escherichia coli HS]
Length = 436
Score = 289 bits (739), Expect = 3e-76, Method: Composition-based stats.
Identities = 168/425 (39%), Positives = 254/425 (59%), Gaps = 2/425 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL A A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVW 185
G K A I HTN V+ E V+ +Q EG + + L +
Sbjct: 131 RAKVWNGEKWACVTIVGGHTNIVHIETHDGVVFTQQACVAEGEQESPLTVLSRTTLAEIL 190
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
F P I FIL++ + N ++E L G +G +G T+++ +RGL + S I+
Sbjct: 191 KFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCARGLLAKDLSSSIVI 250
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI AT PVVV A+ E L RAL LSHL+AI
Sbjct: 251 RTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAI 310
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 311 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 370
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 371 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 430
Query: 426 MTSKG 430
M SK
Sbjct: 431 MASKA 435
>gi|15803649|ref|NP_289682.1| hypothetical protein Z4462 [Escherichia coli O157:H7 EDL933]
gi|15833244|ref|NP_312017.1| hypothetical protein ECs3990 [Escherichia coli O157:H7 str. Sakai]
gi|12517701|gb|AAG58241.1|AE005540_6 orf; Unknown function [Escherichia coli O157:H7 EDL933]
gi|13363463|dbj|BAB37413.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
Length = 436
Score = 288 bits (738), Expect = 4e-76, Method: Composition-based stats.
Identities = 168/425 (39%), Positives = 253/425 (59%), Gaps = 2/425 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL A A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVW 185
G K A I HTN V+ E V+ +Q EG + + L +
Sbjct: 131 RAKVWNGEKWACVTIVGGHTNIVHIETHDGVVFTQQACVAEGEQESPLSVLSRTTLAEIL 190
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
F P I FIL++ + N ++E L G +G +G T+++ RGL + S I+
Sbjct: 191 KFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVI 250
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI AT PVVV A+ E L RAL LSHL+AI
Sbjct: 251 RTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAI 310
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 311 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 370
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 371 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 430
Query: 426 MTSKG 430
M SK
Sbjct: 431 MASKA 435
>gi|75187921|ref|ZP_00701188.1| COG3681: Uncharacterized conserved protein [Escherichia coli
E24377A]
gi|75229972|ref|ZP_00716485.1| COG3681: Uncharacterized conserved protein [Escherichia coli B7A]
gi|75260043|ref|ZP_00731321.1| COG3681: Uncharacterized conserved protein [Escherichia coli E22]
gi|157080979|gb|ABV20687.1| conserved hypothetical protein [Escherichia coli E24377A]
Length = 436
Score = 288 bits (738), Expect = 4e-76, Method: Composition-based stats.
Identities = 168/425 (39%), Positives = 253/425 (59%), Gaps = 2/425 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL A A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVW 185
G K A I HTN V+ E V+ +Q EG + + L +
Sbjct: 131 RAKVWNGEKWACVTIVGGHTNIVHIETHNGVVFTQQACVAEGEQESPLTVLSRTTLAEIL 190
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
F P I FIL++ + N ++E L G +G +G T+++ RGL + S I+
Sbjct: 191 KFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVI 250
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI AT PVVV A+ E L RAL LSHL+AI
Sbjct: 251 RTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAI 310
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 311 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 370
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 371 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 430
Query: 426 MTSKG 430
M SK
Sbjct: 431 MASKA 435
>gi|75239319|ref|ZP_00723290.1| COG3681: Uncharacterized conserved protein [Escherichia coli
E110019]
Length = 436
Score = 288 bits (737), Expect = 5e-76, Method: Composition-based stats.
Identities = 168/425 (39%), Positives = 253/425 (59%), Gaps = 2/425 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL A A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDSTAQAIADAKALLAAGKVSVKIQEPCNEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVW 185
G K A I HTN V+ E V+ +Q EG + + L +
Sbjct: 131 RAKVWHGEKWACVTIVGGHTNIVHIETHNGVVFTQQACVAEGEQESPLTVLSRTTLAEIL 190
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
F P I FIL++ + N ++E L G +G +G T+++ RGL + S I+
Sbjct: 191 KFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVI 250
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI AT PVVV A+ E L RAL LSHL+AI
Sbjct: 251 RTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAI 310
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 311 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 370
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 371 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 430
Query: 426 MTSKG 430
M SK
Sbjct: 431 MASKA 435
>gi|49176312|ref|YP_026202.1| conserved protein [Escherichia coli K12]
gi|89109877|ref|AP_003657.1| hypothetical protein [Escherichia coli W3110]
gi|54042334|sp|P42626|YHAM_ECOLI Uncharacterized protein yhaM
gi|48994923|gb|AAT48167.1| conserved protein [Escherichia coli K12]
gi|85675908|dbj|BAE77158.1| conserved hypothetical protein [Escherichia coli W3110]
Length = 436
Score = 288 bits (736), Expect = 6e-76, Method: Composition-based stats.
Identities = 168/425 (39%), Positives = 253/425 (59%), Gaps = 2/425 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL A A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCDEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVW 185
G K A I HTN V+ E V+ +Q EG + + L +
Sbjct: 131 RAKVWNGEKWACVTIVGGHTNIVHIETHDGVVFTQQACVAEGEQESPLTVLSRTTLAEIL 190
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
F P I FIL++ + N ++E L G +G +G T+++ RGL + S I+
Sbjct: 191 KFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVI 250
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI AT PVVV A+ E L RAL LSHL+AI
Sbjct: 251 RTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAI 310
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 311 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 370
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 371 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 430
Query: 426 MTSKG 430
M SK
Sbjct: 431 MASKA 435
>gi|16766537|ref|NP_462152.1| putative inner membrane protein [Salmonella typhimurium LT2]
gi|16421796|gb|AAL22111.1| putative inner membrane protein [Salmonella typhimurium LT2]
Length = 436
Score = 287 bits (735), Expect = 9e-76, Method: Composition-based stats.
Identities = 172/435 (39%), Positives = 259/435 (59%), Gaps = 5/435 (1%)
Query: 1 MLEKNIR---EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANIL 57
M E I + I + +V PA+GCTEP+++AL A A L E+I A++S N++
Sbjct: 1 MFESKINPLWQSFILAVQEEVKPALGCTEPISLALAAAAAAAELNGTVERIDAWVSPNLM 60
Query: 58 KNAMGVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKL 117
KN MGV +PGTGM+GLPIA +LGAL G ++ LEV+KD + + + + K ++ + + L
Sbjct: 61 KNGMGVTVPGTGMVGLPIAAALGALGGDAKAGLEVLKDASAKAVADAKAMLAAGHVAVML 120
Query: 118 KEGITEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI 177
+E + L+ +G A I HTN V E D V+ + A+E T +
Sbjct: 121 QEPCNDILFSRAKVYSGDSWACVTIVGDHTNIVRIETDKGVVFTQADNAQEEEKTSPLGV 180
Query: 178 --QLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLF 235
+L+ + F P + I FIL+A R N ++E L+G++G +G T+ + RGL
Sbjct: 181 LSHTSLEEILAFVNAVPFDAIRFILDAARLNGALSQEGLRGSWGLHIGSTLAKQCDRGLL 240
Query: 236 GNSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVR 295
+ + IL +T++A DARMGGA +P MSNSGSGNQGI AT PV+V A+ E L R
Sbjct: 241 AKDLSTAILIRTSAASDARMGGATLPAMSNSGSGNQGITATVPVMVVAEHVGADDERLAR 300
Query: 296 ALTLSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTG 355
AL LSHL+AIYI L LSALC A G++ G+ +L+ G Y+ I A+ +MI +++G
Sbjct: 301 ALMLSHLSAIYIHHQLPRLSALCAATTAAMGAAAGMAWLIDGRYDTIAMAISSMIGDVSG 360
Query: 356 MVCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAM 415
M+CDGA SCA+KVS+ S A + ++++++ VT EGI+ ++++SI NL SL +M
Sbjct: 361 MICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHNVEQSISNLCSLACRSM 420
Query: 416 NEMDIMVLDIMTSKG 430
+ D +++IM SK
Sbjct: 421 QQTDKQIIEIMASKA 435
>gi|74313659|ref|YP_312078.1| hypothetical protein SSON_3267 [Shigella sonnei Ss046]
gi|73857136|gb|AAZ89843.1| Uncharacterized conserved protein [Shigella sonnei Ss046]
Length = 436
Score = 287 bits (734), Expect = 1e-75, Method: Composition-based stats.
Identities = 168/425 (39%), Positives = 252/425 (59%), Gaps = 2/425 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL A A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVW 185
G K A I HTN V E V+ +Q EG + + L +
Sbjct: 131 RAKVWNGEKWACVTIVGGHTNIVRIETHNGVVFTQQACVAEGEQESPLTVLSRTTLAEIL 190
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
F P I FIL++ + N ++E L G +G +G T+++ RGL + S I+
Sbjct: 191 KFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVI 250
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI AT PVVV A+ E L RAL LSHL+AI
Sbjct: 251 RTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAI 310
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 311 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 370
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 371 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 430
Query: 426 MTSKG 430
M SK
Sbjct: 431 MASKA 435
>gi|75211064|ref|ZP_00711177.1| COG3681: Uncharacterized conserved protein [Escherichia coli B171]
Length = 436
Score = 287 bits (734), Expect = 1e-75, Method: Composition-based stats.
Identities = 167/425 (39%), Positives = 252/425 (59%), Gaps = 2/425 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL A A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVW 185
G K I HTN V+ E V+ +Q EG + + L +
Sbjct: 131 RAKVWNGEKWVCVTIVGGHTNIVHIETHNGVVFTQQACVAEGEQESPLTVLSRTTLAEIL 190
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
F P I FIL++ + N ++E L G +G +G T+++ RGL + S I+
Sbjct: 191 KFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVI 250
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI AT PVVV A+ E L RAL LSHL+AI
Sbjct: 251 RTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAI 310
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 311 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 370
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 371 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 430
Query: 426 MTSKG 430
M SK
Sbjct: 431 MASKA 435
>gi|110806992|ref|YP_690512.1| hypothetical protein SFV_3151 [Shigella flexneri 5 str. 8401]
gi|110616540|gb|ABF05207.1| Uncharacterized conserved protein [Shigella flexneri 5 str. 8401]
Length = 436
Score = 286 bits (731), Expect = 3e-75, Method: Composition-based stats.
Identities = 167/425 (39%), Positives = 251/425 (59%), Gaps = 2/425 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL A A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVW 185
G K A I HTN V+ E V+ Q EG + + L +
Sbjct: 131 RAKVWNGEKWACVTIVGGHTNIVHIETHNGVVFTHQACVAEGEQESPLTVLSRTTLAEIL 190
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
F P I FIL++ + N ++E L G +G +G T+++ RGL + S I+
Sbjct: 191 KFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVI 250
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI A PVVV A+ E L RAL LSHL+AI
Sbjct: 251 RTSAASDARMGGATLPAMSNSGSGNQGITAIMPVVVVAEHFGADDERLARALMLSHLSAI 310
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 311 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 370
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 371 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 430
Query: 426 MTSKG 430
M SK
Sbjct: 431 MASKA 435
>gi|75178731|ref|ZP_00698767.1| COG3681: Uncharacterized conserved protein [Shigella boydii BS512]
Length = 436
Score = 285 bits (729), Expect = 4e-75, Method: Composition-based stats.
Identities = 167/425 (39%), Positives = 252/425 (59%), Gaps = 2/425 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL A A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVW 185
G K A I HTN V+ E V+ +Q EG + + L +
Sbjct: 131 RAKVWNGEKWACVTIVGGHTNIVHIETHNSVVFTQQACVAEGEQESPLTVLSRTTLAEIL 190
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
F P I FIL++ + N ++E L G +G +G T+++ RGL + S I+
Sbjct: 191 KFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVI 250
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI AT VVV A+ E L RAL LSHL+AI
Sbjct: 251 RTSAASDARMGGATLPAMSNSGSGNQGITATMSVVVVAEHFGADDERLARALMLSHLSAI 310
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 311 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 370
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 371 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 430
Query: 426 MTSKG 430
M SK
Sbjct: 431 MASKA 435
>gi|16762010|ref|NP_457627.1| hypothetical protein STY3418 [Salmonella enterica subsp. enterica
serovar Typhi str. CT18]
gi|29143497|ref|NP_806839.1| hypothetical protein t3158 [Salmonella enterica subsp. enterica
serovar Typhi Ty2]
gi|25512334|pir||AD0896 conserved hypothetical protein STY3418 [imported] - Salmonella
enterica subsp. enterica serovar Typhi (strain CT18)
gi|16504313|emb|CAD07762.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhi]
gi|29139131|gb|AAO70699.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Typhi Ty2]
Length = 436
Score = 285 bits (729), Expect = 4e-75, Method: Composition-based stats.
Identities = 172/435 (39%), Positives = 259/435 (59%), Gaps = 5/435 (1%)
Query: 1 MLEKNIR---EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANIL 57
M E I + I + +V PA+GCTEP+++AL A A L E+I A++S N++
Sbjct: 1 MFESKINPLWQSFILAVQEEVKPALGCTEPISLALAAAAAAAELDGTVERIDAWVSPNLM 60
Query: 58 KNAMGVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKL 117
KN MGV +PGTGM+GLPIA +LGAL G ++ LEV+KD + + + + K ++ + + L
Sbjct: 61 KNGMGVTVPGTGMVGLPIAAALGALGGDAKAGLEVLKDASAKAVADAKAMLAAGHVAVML 120
Query: 118 KEGITEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI 177
+E + L+ +G A I HTN V E D V+ + A+E T +
Sbjct: 121 QEPCNDILFSRAKVYSGDSWACVTIVGDHTNIVRIETDKGVVFTQADNAQEEEKTSPLGV 180
Query: 178 --QLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLF 235
+L+ + F P + I FIL+A R N ++E L+G++G +G T+ + RGL
Sbjct: 181 LSHTSLEEILAFVNAVPFDAIRFILDAARLNGALSQEGLRGSWGLHIGSTLAKQCDRGLL 240
Query: 236 GNSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVR 295
+ + IL +T++A DARMGGA +P MSNSGSGNQGI AT PV+V A+ E L R
Sbjct: 241 AKDLSTAILIRTSAASDARMGGATLPAMSNSGSGNQGITATVPVMVVAEHVGADDECLAR 300
Query: 296 ALTLSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTG 355
AL LSHL+AIYI L LSALC A G++ G+ +L+ G Y+ I A+ +MI +++G
Sbjct: 301 ALMLSHLSAIYIHHQLPRLSALCAATTAAMGAAAGMAWLIDGRYDTIAMAISSMIGDVSG 360
Query: 356 MVCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAM 415
M+CDGA SCA+KVS+ S A + ++++++ VT EGI+ ++++SI NL SL +M
Sbjct: 361 MICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHNVEQSISNLCSLACRSM 420
Query: 416 NEMDIMVLDIMTSKG 430
+ D +++IM SK
Sbjct: 421 QQTDKQIIEIMASKA 435
>gi|62181754|ref|YP_218171.1| putative inner membrane protein [Salmonella enterica subsp.
enterica serovar Choleraesuis str. SC-B67]
gi|62129387|gb|AAX67090.1| putative inner membrane protein [Salmonella enterica subsp.
enterica serovar Choleraesuis str. SC-B67]
Length = 436
Score = 285 bits (728), Expect = 5e-75, Method: Composition-based stats.
Identities = 171/435 (39%), Positives = 258/435 (59%), Gaps = 5/435 (1%)
Query: 1 MLEKNIR---EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANIL 57
M E I + I + +V PA+GCTEP+++AL A A L E+I A++S N++
Sbjct: 1 MFESKINPLWQSFILAVQEEVKPALGCTEPISLALAAAAAAAELDGTVERIDAWVSPNLM 60
Query: 58 KNAMGVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKL 117
KN MGV +PGTGM+GLPIA +LG L G ++ LEV+KD + + + + K ++ + + L
Sbjct: 61 KNGMGVTVPGTGMVGLPIAAALGVLGGDAKAGLEVLKDASAKAVADAKAMLAAGHVAVML 120
Query: 118 KEGITEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI 177
+E + L+ +G A I HTN V E D V+ + A+E T +
Sbjct: 121 QEPCNDILFSRAKVYSGDSWACVTIVGDHTNIVRIETDKGVVFTQADNAQEEEKTSPLGV 180
Query: 178 --QLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLF 235
+L+ + F P + I FIL+A R N ++E L+G++G +G T+ + RGL
Sbjct: 181 LSHTSLEEILAFVNAVPFDAIRFILDAARLNGALSQEGLRGSWGLHIGSTLAKQCDRGLL 240
Query: 236 GNSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVR 295
+ + IL +T++A DARMGGA +P MSNSGSGNQGI AT PV+V A+ E L R
Sbjct: 241 AKDLSTAILIRTSAASDARMGGATLPAMSNSGSGNQGITATVPVMVVAEHVGADDERLAR 300
Query: 296 ALTLSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTG 355
AL LSHL+AIYI L LSALC A G++ G+ +L+ G Y+ I A+ +MI +++G
Sbjct: 301 ALMLSHLSAIYIHHQLPRLSALCAATTAAMGAAAGMAWLIDGRYDTIAMAISSMIGDVSG 360
Query: 356 MVCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAM 415
M+CDGA SCA+KVS+ S A + ++++++ VT EGI+ ++++SI NL SL +M
Sbjct: 361 MICDGASNSCAMKVSTSASAAWKALLMALDDTAVTGNEGIVAHNVEQSIANLCSLACRSM 420
Query: 416 NEMDIMVLDIMTSKG 430
+ D +++IM SK
Sbjct: 421 QQTDKQIIEIMASKA 435
>gi|82778439|ref|YP_404788.1| hypothetical protein SDY_3300 [Shigella dysenteriae Sd197]
gi|81242587|gb|ABB63297.1| Uncharacterized conserved protein [Shigella dysenteriae Sd197]
Length = 436
Score = 284 bits (727), Expect = 7e-75, Method: Composition-based stats.
Identities = 166/425 (39%), Positives = 252/425 (59%), Gaps = 2/425 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL + A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAASVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVW 185
G K A I HTN V+ E V+ +Q EG + + L +
Sbjct: 131 RAKVWNGEKWACVTIVGGHTNIVHIETHDGVVFTQQACVAEGEQESPLSVLSRTTLAEIL 190
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
F I FIL++ + N ++E L GN+G +G T+++ RGL + S I+
Sbjct: 191 KFVNEVQFAAIRFILDSAKLNCALSQEGLSGNWGLHIGATLEKQCERGLLAKDLSSSIVI 250
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI AT PVVV A+ E L RAL L HL+AI
Sbjct: 251 RTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLLHLSAI 310
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 311 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 370
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 371 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 430
Query: 426 MTSKG 430
M SK
Sbjct: 431 MASKA 435
>gi|56415176|ref|YP_152251.1| hypothetical protein SPA3107 [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
gi|56129433|gb|AAV78939.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
serovar Paratyphi A str. ATCC 9150]
Length = 436
Score = 283 bits (725), Expect = 1e-74, Method: Composition-based stats.
Identities = 172/435 (39%), Positives = 258/435 (59%), Gaps = 5/435 (1%)
Query: 1 MLEKNIR---EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANIL 57
M E I + I + +V PA+GCTEP+++AL A A L E+I A++S N++
Sbjct: 1 MFESKINPLWQSFILAVQEEVKPALGCTEPISLALAAAAAAAELDGTVERIDAWVSPNLM 60
Query: 58 KNAMGVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKL 117
KN MGV +PGTGM+GLPIA +LGAL G ++ LEV+KD + + + + K ++ + + L
Sbjct: 61 KNGMGVTVPGTGMVGLPIAAALGALGGDAKAGLEVLKDASAKAVADAKAMLAAGHVAVML 120
Query: 118 KEGITEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI 177
+E + L+ +G A I HTN V E D V+ + A+E T +
Sbjct: 121 QEPCNDILFSRAKVYSGDSWACVTIVGDHTNIVRIETDKGVVFTQADNAQEEEKTSPLGV 180
Query: 178 --QLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLF 235
+L+ + F P + I FIL+A R N ++E L+G++G +G T+ + RGL
Sbjct: 181 LSHTSLEEILAFVNAVPFDAIRFILDAARLNGALSQEGLRGSWGLHIGSTLVKQCDRGLL 240
Query: 236 GNSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVR 295
+ + IL +T++A DARMGGA +P MSNSGSGNQGI AT PV+V A+ E L R
Sbjct: 241 AKDLSTAILIRTSAASDARMGGATLPAMSNSGSGNQGITATVPVMVVAEHVGADDECLAR 300
Query: 296 ALTLSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTG 355
AL LSHL+AIYI L LSALC A G++ G+ +L+ G Y+ I A+ +MI +++G
Sbjct: 301 ALMLSHLSAIYIHHQLPRLSALCAATTAAMGAAAGMAWLIDGRYDTIAMAISSMIGDVSG 360
Query: 356 MVCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAM 415
M+CDGA SCA+KVS S A + ++++++ VT EGI+ ++++SI NL SL +M
Sbjct: 361 MICDGASNSCAMKVSISASAAWKAVLMALDDTAVTGNEGIVAHNVEQSISNLCSLACRSM 420
Query: 416 NEMDIMVLDIMTSKG 430
+ D +++IM SK
Sbjct: 421 QQTDKQIIEIMASKA 435
>gi|157085887|gb|ABV15565.1| hypothetical protein CKO_04510 [Citrobacter koseri ATCC BAA-895]
Length = 436
Score = 282 bits (721), Expect = 3e-74, Method: Composition-based stats.
Identities = 168/430 (39%), Positives = 257/430 (59%), Gaps = 2/430 (0%)
Query: 3 EKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMG 62
E + ++ I + +V PA+GCTEP+++AL A A L + E++ A++S N++KN +G
Sbjct: 6 ENPLWQRFILAVQEEVKPALGCTEPVSLALAAAVAAAELDGEVERVDAWVSPNLMKNGLG 65
Query: 63 VGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGIT 122
V +PGTGM+GLPIA +LGAL G + LEV+K+ + + + K ++ ++ + L+E
Sbjct: 66 VTVPGTGMVGLPIAAALGALGGDASAGLEVLKNASSGAIADAKAMLAAGKVSVMLQEPCD 125
Query: 123 EKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLN 180
+ L+ +G A I HTN V E V+ + + A + + +
Sbjct: 126 DILFSRAKVYSGDAWACVTIVGGHTNIVRIETHTGVIFTQTESVQGEAQESPLSVLSKTS 185
Query: 181 LKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIY 240
L+ + F P I FILEA R N ++E L+G +G +G T+ + +RGL + +
Sbjct: 186 LEEILAFVNAVPFVSIRFILEAARLNGALSQEGLRGTWGLHIGATLQKQCARGLLADDLS 245
Query: 241 SHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLS 300
+ IL +T++A DARMGGA +P MSNSGSGNQGI AT PV+V A+ E L RAL LS
Sbjct: 246 TAILIRTSAASDARMGGATLPAMSNSGSGNQGITATVPVMVVAEHVGADDERLARALMLS 305
Query: 301 HLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDG 360
HL+AIYI L LSALC A G++ G+ +LM G YN I A+ +MI +++GM+CDG
Sbjct: 306 HLSAIYIHHQLPRLSALCAATTAAMGAAAGMAWLMDGRYNTIAMAISSMIGDVSGMICDG 365
Query: 361 AKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDI 420
A SCA+KVS+ S A + ++++++ VT EGI+ ++++SI NL +L AM + D
Sbjct: 366 ASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHNVEQSISNLCALACHAMQQTDR 425
Query: 421 MVLDIMTSKG 430
V++IM SK
Sbjct: 426 QVIEIMASKA 435
>gi|53729252|ref|ZP_00133782.2| COG3681: Uncharacterized conserved protein [Actinobacillus
pleuropneumoniae serovar 1 str. 4074]
gi|126209069|ref|YP_001054294.1| hypothetical protein APL_1605 [Actinobacillus pleuropneumoniae L20]
gi|126097861|gb|ABN74689.1| hypothetical protein APL_1605 [Actinobacillus pleuropneumoniae L20]
Length = 432
Score = 281 bits (718), Expect = 7e-74, Method: Composition-based stats.
Identities = 171/429 (39%), Positives = 267/429 (62%), Gaps = 4/429 (0%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAM 61
+ + + II + ++VVPA+GCTEP+++AL A A + LG P++I A +S N++KN M
Sbjct: 3 FKSELEQAIIATVQQEVVPALGCTEPVSLALAAAVARQYLGALPDRIEAKVSPNLMKNGM 62
Query: 62 GVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGI 121
GV +PGTG +GL +A ++GA+ G LEV+K +T E + KQ +++++I++ + +
Sbjct: 63 GVTVPGTGTVGLTMAAAIGAIGGDPNGGLEVLKHITNEQVALAKQMINDHKIEVSISD-- 120
Query: 122 TEK-LYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLN 180
TE LY E T ++ I+ HTN +Y E +G++L K E +N LN
Sbjct: 121 TEHILYSEATLFNTDQQVKVRIAAHHTNVIYIEKNGELLFSKPCVVES-ENAENVFANLN 179
Query: 181 LKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIY 240
K ++DF+ + +I FI +A N ++E L +YG +G+T+ + + +GL + +
Sbjct: 180 AKDIYDFSLNVELEKIRFIQQAAILNSALSQEGLNQDYGLHIGRTLQKQIGKGLISDDLL 239
Query: 241 SHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLS 300
+ I+ +T +A DARMGGA +P MSNSGSGNQGI AT PVVV A+ E+L+RAL LS
Sbjct: 240 NRIVIETTAASDARMGGANLPAMSNSGSGNQGITATMPVVVVARHVAAGEEQLIRALFLS 299
Query: 301 HLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDG 360
HL AIYI L LSALC A GS G+ +L+ G + I A+ +MI +++G++CDG
Sbjct: 300 HLMAIYIHSKLPKLSALCAVTTAAMGSCAGVAWLLTGKFEAISMAISSMIGDISGIICDG 359
Query: 361 AKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDI 420
A SCA+KVS+ VS++ S ++++++ VT EGI++ ID+SI NL ++ +M D
Sbjct: 360 AANSCAMKVSTSVSSSYKSILMALDDTQVTGNEGIVEHQIDRSINNLCAIASRSMQYTDR 419
Query: 421 MVLDIMTSK 429
V++IM SK
Sbjct: 420 QVIEIMVSK 428
>gi|59711248|ref|YP_204024.1| hypothetical protein VF0641 [Vibrio fischeri ES114]
gi|59479349|gb|AAW85136.1| hypothetical protein VF0641 [Vibrio fischeri ES114]
Length = 425
Score = 280 bits (715), Expect = 2e-73, Method: Composition-based stats.
Identities = 171/427 (40%), Positives = 256/427 (59%), Gaps = 7/427 (1%)
Query: 5 NIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVG 64
+I +Q ID++ V PA+GCTEP+ A + AT++LG KPE I F+S N+ KN+MGV
Sbjct: 3 SIWKQYIDILQGVVKPALGCTEPICAAYAASVATQMLGSKPETIDVFVSDNLYKNSMGVF 62
Query: 65 IPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEK 124
+P TG +GL IA + GA+ G + LEV+ +T E ++E ++ + + ++ +E E
Sbjct: 63 VPRTGRVGLAIAAATGAIGGNPDAGLEVLAKITEEEVDEAQKLIDNGCVVVQ-RETTDEF 121
Query: 125 LYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVL--LDKQVPAEEGAPT-DNKDIQLNL 181
+Y + + A IS HT + + D V+ LD +P A D DI ++
Sbjct: 122 IYCRVIAKNAVHNAEVTISGGHTLIIEKRLDDNVIFTLDSSLPKTSTASICDGVDITIS- 180
Query: 182 KMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYS 241
++DFAT ++I+FILEAK N+ A+E L YG VG+T + + +GL S+ S
Sbjct: 181 -SIYDFATQAEFDDIKFILEAKELNIALAQEGLNNPYGLEVGRTYQKNIEKGLLAKSLDS 239
Query: 242 HILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSH 301
IL T++A DARMGGA +P MSN GSGNQGI AT PVV A E+L RA +SH
Sbjct: 240 DILIYTSAASDARMGGATLPAMSNYGSGNQGIAATIPVVKMADFFNADDEKLARAFIMSH 299
Query: 302 LTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGA 361
L AIYIK + LSA CG V +S + YL GG + C A++N I++ +GM+CDGA
Sbjct: 300 LGAIYIKSHYPPLSAFCGNAVTSAAASMAMVYLAGGTFEQSCSAIQNTISDTSGMICDGA 359
Query: 362 KPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIM 421
K +CA+KV S +A+ SA+L++ +H VT+ +G+I D++K+I+N+ + M +D
Sbjct: 360 KSTCAMKVGSSAQSAMKSALLALNDHCVTK-QGVIADDVEKTIKNIGRMITTGMPNIDHE 418
Query: 422 VLDIMTS 428
+++IM S
Sbjct: 419 IIEIMAS 425
>gi|148976149|ref|ZP_01812892.1| hypothetical protein VSWAT3_07586 [Vibrionales bacterium SWAT-3]
gi|145964544|gb|EDK29798.1| hypothetical protein VSWAT3_07586 [Vibrionales bacterium SWAT-3]
Length = 424
Score = 278 bits (710), Expect = 7e-73, Method: Composition-based stats.
Identities = 163/428 (38%), Positives = 256/428 (59%), Gaps = 16/428 (3%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q D+++ V PA+GCTEP++ A + A ++L P+++ ++S N+ KN+MGV + G
Sbjct: 6 QQYKDILNTVVKPALGCTEPISAAYACSVAAKMLVGAPDRVRVYVSDNLYKNSMGVFVLG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGLPIA ++GA+ G LEV+ +++ + +E+ + + ++ K+ + E +Y
Sbjct: 66 TGKIGLPIAAAVGAIGGDPHAGLEVLANISEKDVEKAQALIDAGQVSAHRKD-VDEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLL----DKQVPAEEGAPTDNKDIQLNLKM 183
E+ G+ A IS HT + + +G VL D+ +P G+ D I +++
Sbjct: 125 YAEVESEGQIAAVEISGGHTQIIKKTLNGTVLFSQDCDQTIPT--GSICDG--INIDIGS 180
Query: 184 VWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHI 243
+++FAT+ ++I FILEA N +EE L YG VG+TM + + GL + + I
Sbjct: 181 IYNFATSADFDDIAFILEASALNGKLSEEGLAHAYGLEVGRTMVQGIQLGLMAEDLMNKI 240
Query: 244 LSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLT 303
+ +TA+A DARMGGA +P MSN GSGNQGI AT PV V A+ + T E+L RAL +SHL
Sbjct: 241 VMQTAAASDARMGGATLPAMSNFGSGNQGIAATIPVAVIAEHFKATQEQLARALIMSHLG 300
Query: 304 AIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKP 363
AIYIK + LSA CG V ++ + YL G + CYA++N++++ +GMVCDGAK
Sbjct: 301 AIYIKSHYPPLSAFCGNTVTSASAAMAMVYLTKGSFEQGCYAIQNVLSDCSGMVCDGAKS 360
Query: 364 SCALKVSSGVSTAI---LSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDI 420
+CA+KV + S A+ L AM E H+ +GI+ D++ SIRN+ L M+ D
Sbjct: 361 TCAMKVKTSTSAAVNGFLMAMRCNEAHN----QGIVADDVETSIRNIGKLVTAGMSVTDS 416
Query: 421 MVLDIMTS 428
++DIM++
Sbjct: 417 TIIDIMSA 424
>gi|90409022|ref|ZP_01217151.1| hypothetical protein PCNPT3_06728 [Psychromonas sp. CNPT3]
gi|90309880|gb|EAS38036.1| hypothetical protein PCNPT3_06728 [Psychromonas sp. CNPT3]
Length = 424
Score = 277 bits (709), Expect = 1e-72, Method: Composition-based stats.
Identities = 160/422 (37%), Positives = 260/422 (61%), Gaps = 3/422 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q I++++ V PA+GCTEP++VA A AT++L + PEKI+ +S N+ KN+MGV +PG
Sbjct: 6 QQYINILNSVVKPALGCTEPISVAYACAVATKILNKTPEKITVKVSDNLYKNSMGVFVPG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA ++GA+ G LEV+K +T + +++ ++ + ++ ++ + + E +Y
Sbjct: 66 TGKIGLHIAAAVGAIAGDPLADLEVLKKITMQDVDKAQRLIDAKKVTVE-RIDVNEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDF 187
+ EA G A+ IS HT+ ++ + ++ + A + I +N++ ++DF
Sbjct: 125 YVELEADGNVASVEISGAHTHVTSKKFNDDIIFSAPNNSSSKASIFDS-IDINIEKIYDF 183
Query: 188 ATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKT 247
+ P ++I+FILEA + N A E L YG +G + + + G + ++IL +T
Sbjct: 184 SMHAPFSDIKFILEAAKLNSALANEGLNKQYGLNLGGIIKKSMQDGFISEGLINNILMET 243
Query: 248 ASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYI 307
+A DARMGGA +P MSN GSGNQGI AT PVV+ AK E+L RAL LSHL AIYI
Sbjct: 244 TAASDARMGGATLPAMSNYGSGNQGIAATLPVVIMAKHCNVDEEKLARALILSHLGAIYI 303
Query: 308 KQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
K + LSA CG + +S + YLMGG + YA++N++ + TGM+CDGAK +CA+
Sbjct: 304 KSHYPPLSAFCGNSATSSAASMAMVYLMGGSFEQCSYAIQNVLGDCTGMICDGAKSTCAM 363
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
KV + S+AI A+L++ + + +GI+ ++++++I NL L + M D +++DIM+
Sbjct: 364 KVKTSTSSAIYGALLAINSTEAND-QGIVGQNVEQTIVNLGKLISNGMQSTDTVIIDIMS 422
Query: 428 SK 429
K
Sbjct: 423 GK 424
>gi|19704482|ref|NP_604044.1| hypothetical protein FN1147 [Fusobacterium nucleatum subsp.
nucleatum ATCC 25586]
gi|19714754|gb|AAL95343.1| Hypothetical protein [Fusobacterium nucleatum subsp. nucleatum ATCC
25586]
Length = 411
Score = 276 bits (705), Expect = 2e-72, Method: Composition-based stats.
Identities = 158/409 (38%), Positives = 241/409 (58%), Gaps = 3/409 (0%)
Query: 21 AVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGT-GMIGLPIAISL 79
A GCTEP+A++ A+A +LG P K+ FLS NI+KN V IP + GMIG+ AI++
Sbjct: 3 AEGCTEPIALSYAAAKARRILGTVPNKVDVFLSGNIIKNVKSVTIPNSEGMIGIEPAIAM 62
Query: 80 GALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEMTCEAGGKKAT 139
G + G + +L VI D T E ++E + ++ + I + G KLYI + G
Sbjct: 63 GLIAGDDKKELMVISDTTHEQVQEVRDFLDKKLIKTHVYPGDI-KLYIRLEISNGENNVL 121
Query: 140 AIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDFATTTPINEIEFI 199
I THTN + KVLL + + + L++K ++D A T I+ I+ I
Sbjct: 122 LEIKHTHTNITRILKNDKVLLSQICNDGDFNSSLTDRKVLSVKYIYDLAKTIDIDLIKPI 181
Query: 200 LE-AKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKTASACDARMGGA 258
+ RYN A+E LKG YG +GK + + +G++GN + + S ++ DARM G
Sbjct: 182 FQKVIRYNSAIADEGLKGKYGVNIGKMILDNIEKGIYGNDVRNKAASYASAGSDARMSGC 241
Query: 259 MVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYIKQNLGALSALC 318
+PVM+ SGSGNQG+ A+ PV+ FA E + EEL+R L +SHL I++K N+G LSA C
Sbjct: 242 ALPVMTTSGSGNQGMTASLPVIKFAAEKNLSEEELIRGLFVSHLITIHVKTNVGRLSAYC 301
Query: 319 GCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCALKVSSGVSTAIL 378
G I A +G + +T+L GG + +C A+ N++ NL+G++CDGAK SCA+K+SSG+ +A
Sbjct: 302 GAICAASGVAAALTFLHGGSFEMVCDAITNILGNLSGVICDGAKASCAMKISSGIYSAFD 361
Query: 379 SAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
+ ML++ + +GI+ DI+++IRN+ L + M D +L IMT
Sbjct: 362 ATMLALNKDVLKSGDGIVGVDIEETIRNVGELAQCGMKGTDETILGIMT 410
>gi|110800835|ref|YP_695254.1| hypothetical protein CPF_0803 [Clostridium perfringens ATCC 13124]
gi|110675482|gb|ABG84469.1| conserved hypothetical protein [Clostridium perfringens ATCC 13124]
Length = 427
Score = 275 bits (703), Expect = 4e-72, Method: Composition-based stats.
Identities = 170/425 (40%), Positives = 266/425 (62%), Gaps = 4/425 (0%)
Query: 6 IREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGI 65
+RE + + ++VVP+ GCTEP+A+A + A E L + ++++ +LS N++KNA+GVGI
Sbjct: 1 MRELYLKTLKKEVVPSEGCTEPIAIAYAASIAAEHLKGEIKEVNIYLSKNVIKNALGVGI 60
Query: 66 PGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKL 125
PGTG +G+ IA +LG I KS +L ++ + T + L++ K+ V +N I+IK K + L
Sbjct: 61 PGTGGVGIEIAAALGISIQKSYKKLTILSNFTEDELKKAKEIVDKNIINIKQKN-TNKAL 119
Query: 126 YIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVW 185
YIE+ + KA II THTN E D ++++D E D K + ++
Sbjct: 120 YIEVELLSETSKAKVIIEDTHTNVTLIECDDEIIMDNNSEVSEDLEEDYK--LFKIADIY 177
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
+FA + I+FILE+ + N +EE LKG+YG VG + + + LF N + I++
Sbjct: 178 NFAKEVDFDHIKFILESAKMNEKVSEEGLKGDYGLQVGSKIIQKGNFNLFSNDASNKIIA 237
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+A+A DARM G +P+M+ +GSGNQGI + PV A+ + + EEL RAL LS+L I
Sbjct: 238 ASAAASDARMDGCAMPIMTTAGSGNQGIACSIPVAQTARLLDKSEEELARALVLSNLVTI 297
Query: 306 YIKQNLGALSALC-GCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPS 364
IK+++G LS LC I TG+SCGITYL+GGD NI Y + NMI++L+GM+CDGAK +
Sbjct: 298 RIKKHMGRLSPLCGAGIAGATGASCGITYLLGGDLENINYCINNMISDLSGMICDGAKET 357
Query: 365 CALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLD 424
CALK+++G + AI A L++ T +GI+ KD++++I ++ +L ++ +D +L+
Sbjct: 358 CALKIATGTNAAIQCANLAINGISATANDGIVAKDVEETIESIETLIQNGFKNVDDTILN 417
Query: 425 IMTSK 429
IM K
Sbjct: 418 IMLEK 422
>gi|18309788|ref|NP_561722.1| hypothetical protein CPE0806 [Clostridium perfringens str. 13]
gi|18144466|dbj|BAB80512.1| conserved hypothetical protein [Clostridium perfringens str. 13]
Length = 427
Score = 274 bits (700), Expect = 1e-71, Method: Composition-based stats.
Identities = 169/425 (39%), Positives = 267/425 (62%), Gaps = 4/425 (0%)
Query: 6 IREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGI 65
+RE + + ++VVP+ GCTEP+A+A + A E L + ++++ +LS N++KNA+GVGI
Sbjct: 1 MRELYLKTLKKEVVPSEGCTEPIAIAYAASIAAEHLKGEIKEVNIYLSKNVIKNALGVGI 60
Query: 66 PGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKL 125
PGTG +G+ IA +LG I KS +L ++ + T + L++ K+ V EN I+IK K + L
Sbjct: 61 PGTGGVGIEIAAALGISIQKSYKKLTILSNFTEDELKKAKEIVDENIINIKQKN-TNKAL 119
Query: 126 YIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVW 185
YIE+ + K+ II THTN E D ++++D E D K + ++
Sbjct: 120 YIEVELLSETSKSKVIIEDTHTNVTLIECDDEIIMDNNSEVSEDLEEDYK--LFKIADIY 177
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
+FA ++I+FILE+ + N +EE LKG+YG VG + + + LF N + I++
Sbjct: 178 NFAKEADFDDIKFILESAKMNEKVSEEGLKGDYGLQVGSKIIQKGNFNLFSNDASNKIIA 237
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+A+A DARM G +P+M+ +GSGNQGI + PV A+ + + EEL RAL LS+L I
Sbjct: 238 ASAAASDARMDGCAMPIMTTAGSGNQGIACSIPVAQTARLLDKSEEELARALVLSNLVTI 297
Query: 306 YIKQNLGALSALC-GCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPS 364
IK+++G LS LC I TG+SCGITYL+GG+ NI Y + NMI++L+GM+CDGAK +
Sbjct: 298 RIKKHMGRLSPLCGAGIAGATGASCGITYLLGGNLENINYCINNMISDLSGMICDGAKET 357
Query: 365 CALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLD 424
CALK+++G + AI A L++ T +GI+ KD++++I ++ +L ++ +D +L+
Sbjct: 358 CALKIATGTNAAIQCANLAINGISATANDGIVAKDVEETIESIETLIQNGFKNVDDTILN 417
Query: 425 IMTSK 429
IM K
Sbjct: 418 IMLEK 422
>gi|149909553|ref|ZP_01898207.1| hypothetical protein PE36_17145 [Moritella sp. PE36]
gi|149807458|gb|EDM67409.1| hypothetical protein PE36_17145 [Moritella sp. PE36]
Length = 424
Score = 273 bits (699), Expect = 1e-71, Method: Composition-based stats.
Identities = 159/421 (37%), Positives = 250/421 (59%), Gaps = 2/421 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q L++ V PA+GCTEP++ A +A A +L PE+I+ +S N+ KN+MGV +PG
Sbjct: 6 QQYKQLLNTFVKPALGCTEPISAAYASAVAASMLPATPEQITVHVSNNLYKNSMGVFVPG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA ++GA+ G + LEV+ +TP + + + + ++ +K + E +Y
Sbjct: 66 TGKIGLAIAAAVGAIAGDPDAGLEVLAQITPPQVTQAQALIDAGKVIVKRTDS-KEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDF 187
+ + A IS HT +GK++ + D+ + ++ ++D+
Sbjct: 125 HIEAKCRDYVAVVEISGGHTKITKTTLNGKLVFSNNSCTAQSTSNIGDDLDITIQGIYDY 184
Query: 188 ATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKT 247
A +I+FIL+A + N A+E L YG VG+T+++ + G + + I+ +T
Sbjct: 185 AMGADFADIDFILDAAKLNSALADEGLNNAYGLQVGRTIEQNIQSGFMSADLNNTIIMRT 244
Query: 248 ASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYI 307
A+A DARMGGA +P MSN GSGNQGI AT PV + AK + + E+L RAL LSHL AIYI
Sbjct: 245 AAASDARMGGATLPAMSNYGSGNQGIAATVPVEIIAKHYQASDEQLARALILSHLGAIYI 304
Query: 308 KQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
KQ+ LSA CG V ++ + YL GG+Y CYA++N++++ +GMVCDGAK +CA+
Sbjct: 305 KQHYPPLSAFCGNTVTSASAAMAMVYLAGGNYVQSCYAIQNVLSDCSGMVCDGAKSTCAM 364
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
KV + S+A+ + ML+M N + +GII DI+ SIRN+ L M D+ +++IM+
Sbjct: 365 KVKTSTSSAVTAFMLAMNNTQAFD-QGIIAHDIEASIRNIGMLVTKGMVNTDVTIINIMS 423
Query: 428 S 428
+
Sbjct: 424 A 424
>gi|110801779|ref|YP_698115.1| hypothetical protein CPR_0790 [Clostridium perfringens SM101]
gi|110682280|gb|ABG85650.1| conserved hypothetical protein [Clostridium perfringens SM101]
Length = 427
Score = 273 bits (698), Expect = 2e-71, Method: Composition-based stats.
Identities = 168/425 (39%), Positives = 266/425 (62%), Gaps = 4/425 (0%)
Query: 6 IREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGI 65
+RE + + ++VVP+ GCTEP+A+A + A E L + ++++ +LS N++KNA+GVGI
Sbjct: 1 MRELYLKTLKKEVVPSEGCTEPIAIAYAASIAAEYLKGEIKEVNIYLSKNVIKNALGVGI 60
Query: 66 PGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKL 125
PGTG +G+ IA +LG I KS +L ++ + T + L++ K+ V +N I+IK K + L
Sbjct: 61 PGTGGVGIEIAAALGISIQKSYKKLTILSNFTEDELKKAKEIVDKNIINIKQKN-TNKAL 119
Query: 126 YIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVW 185
YIE+ + KA II THTN E D ++++D E D + ++
Sbjct: 120 YIEVELLSETSKAKVIIEDTHTNVTLIECDDEIIMDNNSEVSEDLEEDYN--LFKIADIY 177
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
+FA ++I+FILE+ + N +EE LKG+YG VG + + + LF N + I++
Sbjct: 178 NFAKEADFDDIKFILESAKMNEKVSEEGLKGDYGLQVGSKIIQKGNFNLFSNDASNKIIA 237
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+A+A DARM G +P+M+ +GSGNQGI + PV ++ + + EEL RAL LS+L I
Sbjct: 238 ASAAASDARMDGCAMPIMTTAGSGNQGIACSIPVAQTSRLLDKSEEELARALVLSNLVTI 297
Query: 306 YIKQNLGALSALC-GCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPS 364
IK+++G LS LC I TG+SCGITYL+GGD NI Y + NMI++L+GM+CDGAK +
Sbjct: 298 RIKKHMGRLSPLCGAGIAGATGASCGITYLLGGDLENINYCINNMISDLSGMICDGAKET 357
Query: 365 CALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLD 424
CALK+++G + AI A L++ T +GI+ KD++++I ++ +L ++ +D +L+
Sbjct: 358 CALKIATGTNAAIQCANLAINGISATANDGIVAKDVEETIESIETLIQNGFKNVDDTILN 417
Query: 425 IMTSK 429
IM K
Sbjct: 418 IMLEK 422
>gi|152972895|ref|YP_001338041.1| hypothetical protein KPN_04419 [Klebsiella pneumoniae subsp.
pneumoniae MGH 78578]
gi|150957744|gb|ABR79774.1| hypothetical protein KPN_04419 [Klebsiella pneumoniae subsp.
pneumoniae MGH 78578]
Length = 425
Score = 269 bits (688), Expect = 2e-70, Method: Composition-based stats.
Identities = 167/423 (39%), Positives = 245/423 (57%), Gaps = 11/423 (2%)
Query: 11 IDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTGM 70
+ + ++V PA+GCTEP+A++ A A + L Q KIS F+SAN+ KNAMGV IPGT +
Sbjct: 10 VKWLKQEVAPALGCTEPVAISFAAAYAAQYLDQPCTKISGFISANLYKNAMGVTIPGTTV 69
Query: 71 IGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEMT 130
G+P+A ++GA G + L+ ++D+TP+ +E ++ ++ N +DI ++E + +++++T
Sbjct: 70 CGVPLAAAIGAFGGDPQKGLKTLEDITPQHVEMAQKLIANNAVDIAVEE-TPDFIHLDLT 128
Query: 131 CEAGGKKATAIISKTHTNFV--YEEADGKVLLDKQ--VPAEEGAPTDNKDIQLNLKMVWD 186
AG ++ THTN V Y + L +KQ E PT +L+ +D
Sbjct: 129 LSAGDNCCRVVVKGTHTNVVELYINGQPQPLSEKQNTRTQRETLPT------FSLQQAYD 182
Query: 187 FATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSK 246
F N+I FIL+A R N A E YG + T + GL N + S ++
Sbjct: 183 FINRVDFNDIRFILDAARLNSALAAEGKTKKYGLNINGTFSDAVKNGLMSNDLLSKVIIN 242
Query: 247 TASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIY 306
T +A DARMGGA V MSN GSGNQGI AT PVVV A+ E L RAL+LSHLTAI
Sbjct: 243 TVAASDARMGGAPVVAMSNFGSGNQGITATMPVVVVAEHLGVDEETLARALSLSHLTAIS 302
Query: 307 IKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCA 366
I LSALC A G++ G+ +L D N I A+ NMI+++TGM+CDGA SCA
Sbjct: 303 IHSRYTRLSALCAASTAAMGAAAGMAWLFTRDINTINTAIINMISDITGMICDGASNSCA 362
Query: 367 LKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIM 426
+KVSS VS+A + +++M+N +GI+ D++++I NL L M D ++ IM
Sbjct: 363 MKVSSVVSSAFKAVLMAMQNSCAGANDGIVCADVEQTINNLCRLVIKPMTLTDKEIISIM 422
Query: 427 TSK 429
+K
Sbjct: 423 VAK 425
>gi|106894563|ref|ZP_01361681.1| Protein of unknown function DUF1063 [Clostridium sp. OhILAs]
gi|106774162|gb|EAT30728.1| Protein of unknown function DUF1063 [Clostridium sp. OhILAs]
Length = 430
Score = 268 bits (686), Expect = 4e-70, Method: Composition-based stats.
Identities = 164/425 (38%), Positives = 243/425 (57%), Gaps = 3/425 (0%)
Query: 6 IREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKP-EKISAFLSANILKNAMGVG 64
++ II + +VVPA+GCTEP+AVAL A+A ELLG K +S NI KN + VG
Sbjct: 3 LKNLIIKTLKEEVVPAMGCTEPVAVALGCAKAKELLGDMDITKAEILVSPNIYKNGLSVG 62
Query: 65 IPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEK 124
IP T +GL IA +LG + GKSE L+V+ + E + + + E ++ I +K I EK
Sbjct: 63 IPNTNEVGLFIAGALGIVAGKSEKDLQVLSGIVEEDVVIAHELLKEEKVTIDIKPTI-EK 121
Query: 125 LYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMV 184
+Y+E+ A +TAII H FVY G +LL+ A + N ++ ++ +
Sbjct: 122 IYVEVNLYAEEGSSTAIIQGRHNEFVYLAQSGNILLNGLQEATSSTKSTNPLFEMKIRDI 181
Query: 185 WDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHIL 244
+ ++EI F+LE N A E LK G VG+T+ + +G+ + + + +
Sbjct: 182 IKEISELGMDEIGFMLEGLEMNEKIAMEGLKNTSGISVGRTIYENIQKGILADDLMNTAM 241
Query: 245 SKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTA 304
TA+ DARM G +PVMS+SGSGN G+ A P++ + K+ L +AL +SH+T
Sbjct: 242 MLTAAGSDARMSGIRMPVMSSSGSGNNGLTAILPILAYHKKFPVEDRPLAQALAISHMTN 301
Query: 305 IYIKQNLGALSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKP 363
YIK +G LSALCGC + A TG+S I +LMG D I +KNMI NL+GM+CDGAK
Sbjct: 302 SYIKHYIGRLSALCGCGVAAGTGASISIAWLMGADAEKIDGTIKNMIGNLSGMICDGAKV 361
Query: 364 SCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVL 423
CALK+++ S AI SA+L++ H + GII + +I+NL L ++ M D +L
Sbjct: 362 GCALKLATSASAAIQSALLALNGHVIPSKNGIIGDTAEDTIKNLGILSEEGMYFADHTIL 421
Query: 424 DIMTS 428
+M +
Sbjct: 422 KVMKA 426
>gi|146292312|ref|YP_001182736.1| protein of unknown function DUF1063 [Shewanella putrefaciens CN-32]
gi|145564002|gb|ABP74937.1| protein of unknown function DUF1063 [Shewanella putrefaciens CN-32]
Length = 424
Score = 268 bits (685), Expect = 5e-70, Method: Composition-based stats.
Identities = 170/423 (40%), Positives = 264/423 (62%), Gaps = 6/423 (1%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q I ++++ V PA+GCTEP+A A +A A LLG PE IS +S N+ KN+MGV +PG
Sbjct: 6 QQYIQILNQVVKPALGCTEPIAAAYASAVARTLLGIVPEAISVQVSDNLYKNSMGVYVPG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA + GA+ G +E LEV+ +TPE + + + + ++ ++ E E +Y
Sbjct: 66 TGKIGLAIAAAAGAIAGNAEAGLEVLAAITPEQVAQAQDLIDAGKVKVERTE-TEEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLL--DKQVPAEEGAPTDNKDIQLNLKMVW 185
+T +AG ++A I HT + +G+ + D A G+ D DI ++K ++
Sbjct: 125 CVTLKAGEQEALVKICGGHTLIAEKRLNGEPVFTADNAQSAATGSICDGIDI--SIKSIY 182
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
FA P ++I+FIL+A N ++E + YG VG+TM ++ G+ G + + I+
Sbjct: 183 QFAQEVPFDQIKFILKASELNGKLSDEGMAKPYGLEVGRTMKSGIAAGIIGEDLLNKIVM 242
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
TA+A DARMGGA +P MSN GSGNQGI AT PVV+ A+ T E+L RAL +SHL AI
Sbjct: 243 LTAAASDARMGGANLPAMSNLGSGNQGIAATIPVVLTAQCYNVTEEKLARALIMSHLGAI 302
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YIK + LSA CG V +S + Y+ GG + C+A++N+I++ +GMVCDGAK SC
Sbjct: 303 YIKSHYPPLSAFCGNTVTSAAASMAMVYIAGGSFEQSCFAIQNVISDSSGMVCDGAKASC 362
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A+ S ++++ + +V+ +GII D++K+I+N+ + + M+ DI ++DI
Sbjct: 363 AMKVSTSSSAAVRSFLMALNSQNVS-GQGIIATDVEKTIKNIGKMILNGMSSTDITIIDI 421
Query: 426 MTS 428
M++
Sbjct: 422 MST 424
>gi|120599752|ref|YP_964326.1| protein of unknown function DUF1063 [Shewanella sp. W3-18-1]
gi|124545038|ref|ZP_01704265.1| protein of unknown function DUF1063 [Shewanella putrefaciens 200]
gi|120559845|gb|ABM25772.1| protein of unknown function DUF1063 [Shewanella sp. W3-18-1]
gi|124511225|gb|EAY55290.1| protein of unknown function DUF1063 [Shewanella putrefaciens 200]
Length = 424
Score = 268 bits (684), Expect = 7e-70, Method: Composition-based stats.
Identities = 169/423 (39%), Positives = 264/423 (62%), Gaps = 6/423 (1%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q I ++++ V PA+GCTEP+A A +A A LLG PE IS +S N+ KN+MGV +PG
Sbjct: 6 QQYIQILNQVVKPALGCTEPIAAAYASAVARTLLGIVPEAISVQVSDNLYKNSMGVYVPG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA + GA+ G +E LEV+ +TPE + + + + ++ ++ E E +Y
Sbjct: 66 TGKIGLAIAAAAGAIAGNAEAGLEVLAAITPEQVAQAQDLIDAGKVKVERTE-TEEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLL--DKQVPAEEGAPTDNKDIQLNLKMVW 185
+T +AG ++A I HT + +G+ + D A G+ D DI ++K ++
Sbjct: 125 CVTLKAGEQEALVKICGGHTLIAEKRLNGEPVFTADNAQSAATGSICDGIDI--SIKSIY 182
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
FA P ++I+FIL+A N ++E + YG VG+TM ++ G+ G + + I+
Sbjct: 183 QFAQEVPFDQIKFILKASELNGKLSDEGMAKPYGLEVGRTMKSGIAAGIIGEDLLNKIVM 242
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
TA+A DARMGGA +P MSN GSGNQGI AT PVV+ A+ T E+L RAL +SHL AI
Sbjct: 243 LTAAASDARMGGANLPAMSNLGSGNQGIAATIPVVLTAQCYNVTEEKLARALIMSHLGAI 302
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YIK + LSA CG V +S + Y+ GG + C+A++N+I++ +GMVCDGAK SC
Sbjct: 303 YIKSHYPPLSAFCGNTVTSAAASMAMVYIAGGSFEQSCFAIQNVISDSSGMVCDGAKASC 362
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A+ S ++++ + +V+ +GII D++K+I+N+ + + M+ D+ ++DI
Sbjct: 363 AMKVSTSSSAAVRSFLMALNSQNVS-GQGIIATDVEKTIKNIGKMILNGMSSTDVTIIDI 421
Query: 426 MTS 428
M++
Sbjct: 422 MST 424
>gi|113971128|ref|YP_734921.1| protein of unknown function DUF1063 [Shewanella sp. MR-4]
gi|114048367|ref|YP_738917.1| protein of unknown function DUF1063 [Shewanella sp. MR-7]
gi|113885812|gb|ABI39864.1| protein of unknown function DUF1063 [Shewanella sp. MR-4]
gi|113889809|gb|ABI43860.1| protein of unknown function DUF1063 [Shewanella sp. MR-7]
Length = 424
Score = 267 bits (682), Expect = 1e-69, Method: Composition-based stats.
Identities = 164/420 (39%), Positives = 261/420 (62%), Gaps = 2/420 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q I ++++ V PA+GCTEP+A A A A LL +PE I+ +S N+ KN+MGV +PG
Sbjct: 6 QQYIQIINQVVKPALGCTEPIAAAYAAAVARTLLPVEPESIAVQVSDNLYKNSMGVYVPG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA + GAL G +E LEV+ ++TPE + + + + ++ ++ E E +Y
Sbjct: 66 TGKIGLAIAAAAGALAGDAEAGLEVLANVTPEQVTKAQTLIDAGKVKVERTE-TDEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDF 187
++ AG ++A I HT + +G+++ + + I +N++ ++ F
Sbjct: 125 CVSLTAGEQEAMVKICGGHTLIAEKRLNGELVFTADNAQAKATGSICDGIDINIESIYRF 184
Query: 188 ATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKT 247
A P EI+FIL+A N ++E + YG VG+TM ++ G+ G + + I+ T
Sbjct: 185 AQEVPFEEIQFILKASELNSKLSDEGMSKPYGLEVGRTMKNGIAAGIIGEDLLNKIVMLT 244
Query: 248 ASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYI 307
A+A DARMGGA +P MSN GSGNQGI AT PVV+ A+ + + E+L RAL +SHL AIYI
Sbjct: 245 AAASDARMGGANLPAMSNLGSGNQGIAATIPVVITAQCYKVSEEKLARALIMSHLGAIYI 304
Query: 308 KQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
K + LSA CG V +S + YL GG + CYA++N+I++ +GMVCDGAK SCA+
Sbjct: 305 KSHYPPLSAFCGNTVTSAAASMAMVYLAGGSFEQSCYAIQNVISDSSGMVCDGAKASCAM 364
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
KVS+ S A+ S ++++ + +V+ +GII KD++K+I+N+ + + M+ D+ +++IM+
Sbjct: 365 KVSTSSSAAVRSFLMALSSQNVS-GQGIIAKDVEKTIKNIGKMVLNGMSSTDVTIINIMS 423
>gi|149115725|ref|ZP_01842464.1| protein of unknown function DUF1063 [Shewanella baltica OS223]
gi|146864399|gb|EDK49816.1| protein of unknown function DUF1063 [Shewanella baltica OS223]
Length = 424
Score = 266 bits (681), Expect = 2e-69, Method: Composition-based stats.
Identities = 160/421 (38%), Positives = 258/421 (61%), Gaps = 2/421 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q I+++ + V PA+GCTEP+A A A A LLG +P+ I+ +S N+ KN+MGV +PG
Sbjct: 6 QQYINIIKQVVKPALGCTEPIAAAYAAAVARALLGVEPDSIAVQVSDNLYKNSMGVFVPG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA + GA+ G + LEV+ +TPE + + + ++ ++ E E +Y
Sbjct: 66 TGKIGLAIAAAAGAIAGNPDAGLEVLAVITPEQVARAQALIDAGKVTVERTE-TAEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDF 187
+ + G ++A I HT + +G+ + + + + + + ++ ++ F
Sbjct: 125 CVIAKKGDREALVKICGGHTLIAEKRLNGESVFSVDSTQAKATGSICEGVDITIESIYRF 184
Query: 188 ATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKT 247
A P EI+FILEA N ++E + YG VG+TM ++ G+ G + + I+ T
Sbjct: 185 AQEVPFEEIKFILEASELNGKLSDEGMANPYGLEVGRTMKSGIAAGIIGEDLLNKIVMLT 244
Query: 248 ASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYI 307
A+A DARMGGA +P MSN GSGNQGI AT PVV+ A+ + + E+L RAL +SHL AIYI
Sbjct: 245 AAASDARMGGANLPAMSNLGSGNQGIAATIPVVLTAQCYKVSEEQLARALIMSHLGAIYI 304
Query: 308 KQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
K + LSA CG V +S + YL GG + C+A++N+I++ +GMVCDGAK SCA+
Sbjct: 305 KSHYPPLSAFCGNTVTSAAASMAMVYLAGGSFEQSCFAIQNVISDSSGMVCDGAKASCAM 364
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
KVS+ S A+ S ++++ +H+V+ +GII D++K+I+N+ + + M+ D+ ++DIM+
Sbjct: 365 KVSTSSSAAVRSFLMALSSHNVS-GQGIIATDVEKTIKNIGKMILNGMSSTDVTIIDIMS 423
Query: 428 S 428
+
Sbjct: 424 A 424
>gi|118073122|ref|ZP_01541306.1| protein of unknown function DUF1063 [Shewanella woodyi ATCC 51908]
gi|118022422|gb|EAV36242.1| protein of unknown function DUF1063 [Shewanella woodyi ATCC 51908]
Length = 425
Score = 266 bits (681), Expect = 2e-69, Method: Composition-based stats.
Identities = 162/428 (37%), Positives = 266/428 (62%), Gaps = 7/428 (1%)
Query: 4 KNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLG-QKPEKISAFLSANILKNAMG 62
K + ++ I+++++ V PA+GCTEP+A A A A + LG P+ + F+S N+ KN+MG
Sbjct: 2 KPVWQKYIEIINQVVKPALGCTEPIAAAYGAAVAVQELGCSVPDSLEVFVSDNLYKNSMG 61
Query: 63 VGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGIT 122
V +PGTG IGL IA + GA+ G ++ LEV+ +TPE + + ++ + ++ +K +
Sbjct: 62 VFVPGTGKIGLAIAAASGAIGGNADAGLEVLAAITPEQVLQAQEMIDAGKVSVK-RTVTD 120
Query: 123 EKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKV--LLDKQVPAEEGAPTDNKDIQLN 180
E +Y + + G+++ I HT V ++ +GK+ + D+ G+ + DI N
Sbjct: 121 EFIYCCVIAKFEGRESLVKICGGHTLIVEKQLNGKINFIADESSATSTGSICEGVDI--N 178
Query: 181 LKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIY 240
+ +++FAT + +I FIL+A N ++E + +YG VG+TM + + G+ N +
Sbjct: 179 IASIYEFATQVELEKIAFILQAADLNTKLSDEGMTNSYGLEVGRTMKKSIDEGILSNDLL 238
Query: 241 SHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLS 300
+ I+ +TA+A DARMGGA +P MSN GSGNQGI AT PVV+ AK +N+ E+L RAL LS
Sbjct: 239 NKIVMETAAASDARMGGATLPAMSNLGSGNQGIAATIPVVIAAKHYKNSDEQLARALILS 298
Query: 301 HLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDG 360
HL AIYIK + LSA CG V ++ + YL GG + C+A++N+I++ +GMVCDG
Sbjct: 299 HLGAIYIKSHYPPLSAFCGNTVTSAAAAMAMVYLAGGSFEQSCFAIQNVISDSSGMVCDG 358
Query: 361 AKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDI 420
AK SCA+KVS+ A+ + +++M + V+ +GII KD++++I N+ + M D
Sbjct: 359 AKASCAMKVSTSSCAAVRAFLMAMNSRSVS-GQGIIAKDVEQTIINIGQMISHGMPSTDT 417
Query: 421 MVLDIMTS 428
+++IM++
Sbjct: 418 TIINIMSA 425
>gi|117921411|ref|YP_870603.1| protein of unknown function DUF1063 [Shewanella sp. ANA-3]
gi|117613743|gb|ABK49197.1| protein of unknown function DUF1063 [Shewanella sp. ANA-3]
Length = 424
Score = 266 bits (680), Expect = 2e-69, Method: Composition-based stats.
Identities = 162/424 (38%), Positives = 262/424 (61%), Gaps = 2/424 (0%)
Query: 4 KNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGV 63
K + +Q I ++++ V PA+GCTEP+A A A A LL PE I+ +S N+ KN+MGV
Sbjct: 2 KPLWQQYIQIINQVVKPALGCTEPIAAAYAAAVARTLLPVAPESIAVQVSDNLYKNSMGV 61
Query: 64 GIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITE 123
+PGTG IGL IA + GAL G ++ LEV+ ++TPE + + + + ++ ++ E E
Sbjct: 62 YVPGTGKIGLAIAAAAGALAGNADAGLEVLANVTPEQVAQAQTLIDAGKVKVERTE-TDE 120
Query: 124 KLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKM 183
+Y ++ AG ++A I HT + +G+++ + + + +N++
Sbjct: 121 FIYCCVSLTAGEQEAMVKICGGHTLIAEKRLNGELVFTADSAQAKATGSICDGVDINIES 180
Query: 184 VWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHI 243
++ FA P EI+FIL+A N ++E + YG VG+TM ++ G+ G + + I
Sbjct: 181 IYRFAQEVPFEEIQFILKASELNSKLSDEGMSKPYGLEVGRTMKNGIAAGIIGEDLLNKI 240
Query: 244 LSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLT 303
+ TA+A DARMGGA +P MSN GSGNQGI AT PVV+ A+ + + E+L RAL +SHL
Sbjct: 241 VMLTAAASDARMGGANLPAMSNLGSGNQGIAATIPVVITAQCYKVSEEKLARALIMSHLG 300
Query: 304 AIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKP 363
AIYIK + LSA CG V +S + YL GG + C+A++N+I++ +GMVCDGAK
Sbjct: 301 AIYIKSHYPPLSAFCGNTVTSAAASMAMVYLAGGSFEQSCFAIQNVISDSSGMVCDGAKA 360
Query: 364 SCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVL 423
SCA+KVS+ S A+ S ++++ + +V+ +GII KD++K+I+N+ + + M+ D+ ++
Sbjct: 361 SCAMKVSTSSSAAVRSFLMALNSQNVS-GQGIIAKDVEKTIKNIGKMVLNGMSSTDVTII 419
Query: 424 DIMT 427
+IM+
Sbjct: 420 NIMS 423
>gi|113949660|ref|ZP_01435305.1| protein of unknown function DUF1063 [Shewanella baltica OS195]
gi|126175271|ref|YP_001051420.1| protein of unknown function DUF1063 [Shewanella baltica OS155]
gi|113907949|gb|EAU26609.1| protein of unknown function DUF1063 [Shewanella baltica OS195]
gi|125998476|gb|ABN62551.1| protein of unknown function DUF1063 [Shewanella baltica OS155]
Length = 424
Score = 266 bits (680), Expect = 2e-69, Method: Composition-based stats.
Identities = 160/421 (38%), Positives = 259/421 (61%), Gaps = 2/421 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q I+++ + V PA+GCTEP+A A A A LLG +P+ I+ +S N+ KN+MGV +PG
Sbjct: 6 QQYINIIKQVVKPALGCTEPIAAAYAAAVARALLGVEPDSIAVQVSDNLYKNSMGVFVPG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA + GA+ G + LEV+ +TPE + + + + ++ ++ E E +Y
Sbjct: 66 TGKIGLAIAAAAGAIAGNPDAGLEVLAVITPEQVAKAQALIDAGKVTVERTE-TAEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDF 187
+ + G ++A I HT + +G+ + + + + + + ++ ++ F
Sbjct: 125 CVIAKKGDREALVKICGGHTLIAEKRLNGESVFSVDSTQAKATGSICEGVDITIESIYRF 184
Query: 188 ATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKT 247
A P EI+FILEA N ++E + YG VG+TM ++ G+ G + + I+ T
Sbjct: 185 AQEVPFEEIKFILEASELNGKLSDEGMANPYGLEVGRTMKSGIAAGIIGEDLLNKIVMLT 244
Query: 248 ASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYI 307
A+A DARMGGA +P MSN GSGNQGI AT PVV+ A+ + + E+L RAL +SHL AIYI
Sbjct: 245 AAASDARMGGANLPAMSNLGSGNQGIAATIPVVLTAQCYKVSEEQLARALIMSHLGAIYI 304
Query: 308 KQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
K + LSA CG V +S + YL GG + C+A++N+I++ +GMVCDGAK SCA+
Sbjct: 305 KSHYPPLSAFCGNTVTSAAASMAMVYLAGGSFEQSCFAIQNVISDSSGMVCDGAKASCAM 364
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
KVS+ S A+ S ++++ +H+V+ +GII D++K+I+N+ + + M+ D+ ++DIM+
Sbjct: 365 KVSTSSSAAVRSFLMALSSHNVS-GQGIIATDVEKTIKNIGKMILNGMSSTDVTIIDIMS 423
Query: 428 S 428
+
Sbjct: 424 A 424
>gi|153001593|ref|YP_001367274.1| protein of unknown function DUF1063 [Shewanella baltica OS185]
gi|151366211|gb|ABS09211.1| protein of unknown function DUF1063 [Shewanella baltica OS185]
Length = 424
Score = 266 bits (679), Expect = 2e-69, Method: Composition-based stats.
Identities = 160/421 (38%), Positives = 258/421 (61%), Gaps = 2/421 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q I ++ + V PA+GCTEP+A A A A LLG +P+ I+ +S N+ KN+MGV +PG
Sbjct: 6 QQYIHIIKQVVKPALGCTEPIAAAYAAAVARALLGVEPDSIAVQVSDNLYKNSMGVFVPG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA + GA+ G + LEV+ +TPE + + + + ++ ++ E E +Y
Sbjct: 66 TGKIGLAIAAAAGAIAGNPDAGLEVLAVITPEQVAKAQALIDAGKVTVERTE-TAEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDF 187
+ + G ++A I HT + +G+ + + + + + + ++ ++ F
Sbjct: 125 CVIAKKGDREALVKICGGHTLIAEKRLNGESVFSVDSTQAKATGSICEGVDITIESIYRF 184
Query: 188 ATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKT 247
A P EI+FILEA N ++E + YG VG+TM ++ G+ G + + I+ T
Sbjct: 185 AQEVPFEEIKFILEASELNGKLSDEGMANPYGLEVGRTMKSGIAAGIIGEDLLNKIVMLT 244
Query: 248 ASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYI 307
A+A DARMGGA +P MSN GSGNQGI AT PVV+ A+ + + E+L RAL +SHL AIYI
Sbjct: 245 AAASDARMGGANLPAMSNLGSGNQGIAATIPVVLTAQCYKVSEEQLARALIMSHLGAIYI 304
Query: 308 KQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
K + LSA CG V +S + YL GG + C+A++N+I++ +GMVCDGAK SCA+
Sbjct: 305 KSHYPPLSAFCGNTVTSAAASMAMVYLAGGSFEQSCFAIQNVISDSSGMVCDGAKASCAM 364
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
KVS+ S A+ S ++++ +H+V+ +GII D++K+I+N+ + + M+ D+ ++DIM+
Sbjct: 365 KVSTSSSAAVRSFLMALSSHNVS-GQGIIATDVEKTIKNIGKMILNGMSSTDVTIIDIMS 423
Query: 428 S 428
+
Sbjct: 424 A 424
>gi|153810614|ref|ZP_01963282.1| hypothetical protein RUMOBE_00995 [Ruminococcus obeum ATCC 29174]
gi|149833793|gb|EDM88874.1| hypothetical protein RUMOBE_00995 [Ruminococcus obeum ATCC 29174]
Length = 429
Score = 263 bits (673), Expect = 1e-68, Method: Composition-based stats.
Identities = 155/422 (36%), Positives = 255/422 (60%), Gaps = 10/422 (2%)
Query: 11 IDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTG- 69
++++ +++VPA+GCTEP+A+A A+A ++LG+ P+ ++ LS NI+KN GV +P +G
Sbjct: 10 LNILKQELVPALGCTEPIAIAYAAAKAHQVLGEFPDSVNMSLSGNIIKNVKGVTVPNSGG 69
Query: 70 MIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEM 129
+ G+ +A LG + G ++ LEV+ ++T + + ++ V ++ L EG+ + LYI
Sbjct: 70 LKGIDVAAILGIVGGNADKALEVLSEITEDDIARTRELVKQHVCSCSLVEGV-DNLYITA 128
Query: 130 TCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDFAT 189
G A+ I HTN E DG+V+LD AE D +L +K + DFA
Sbjct: 129 KVIKGDHFASVTIEHQHTNITRIEKDGEVILDNPYQAECKTVIDKS--KLTVKDILDFAD 186
Query: 190 TTPINEIEFILEAK-RYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKTA 248
I +++ I++ + + N A+E L NYG +GKT+ ++G S+ + ++ A
Sbjct: 187 QVRIEDVQPIIDRQIKLNSAIAQEGLDNNYGAQIGKTLMH-----VWGKSVTTRACARAA 241
Query: 249 SACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYIK 308
+ DARMGG +PV+ NSGSGNQG+ + PV+VFA E E + E+L R+L +S+L AI+ K
Sbjct: 242 AGSDARMGGCSMPVVINSGSGNQGMTVSLPVIVFADEWEVSHEKLCRSLVVSNLIAIHQK 301
Query: 309 QNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCALK 368
+G+LSA CG + A G+ GITY+ GG Y + + N + N+ G+VCDGAKPSCA K
Sbjct: 302 YYIGSLSAYCGAVSAACGAGAGITYMYGGTYQQVSLTIINTLGNVGGIVCDGAKPSCAAK 361
Query: 369 VSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMTS 428
++S V A+++ LS++N EGII DI+++I+++ +G+ M D +L++M
Sbjct: 362 IASSVDAALMAFQLSIQNKSFLPGEGIIKDDIEETIKSMGYIGRVGMRSTDTEILNVMID 421
Query: 429 KG 430
+
Sbjct: 422 RA 423
>gi|20806790|ref|NP_621961.1| hypothetical protein TTE0269 [Thermoanaerobacter tengcongensis MB4]
gi|20515253|gb|AAM23565.1| conserved hypothetical protein [Thermoanaerobacter tengcongensis
MB4]
Length = 423
Score = 263 bits (671), Expect = 2e-68, Method: Composition-based stats.
Identities = 159/422 (37%), Positives = 236/422 (55%), Gaps = 9/422 (2%)
Query: 10 IIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTG 69
+ D++ V PA+GCTEP AVA ++A E+LG++P ++ + +ILKN M V IPGT
Sbjct: 8 LTDILKANVAPALGCTEPGAVAYAVSKAREILGEEPREVYVAVDRDILKNGMFVSIPGTK 67
Query: 70 MIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEM 129
GL A +L + GKSEY+LE +++ T E +++ + V+ + I L E E LYI+
Sbjct: 68 EKGLVFAAALALVCGKSEYKLEALREATEEDIKKAHKIVNRKAVKIVL-EKDAEGLYIKA 126
Query: 130 TCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI-QLNLKMVWDFA 188
+ +AT I+ H N VYEE DG VL K+ +E I + ++ D+
Sbjct: 127 SVVGDKHRATVIVKDAHDNIVYEERDGVVLKAKEENLKEDKSWLKAKIKEFTIEDFLDYC 186
Query: 189 TTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKTA 248
+ EIEFI E N A L G +GK + R + S + TA
Sbjct: 187 DSVDFKEIEFIGEGIEMNKKIAYAGLNEEVGVGIGKMLKRQI------RDEESLAKALTA 240
Query: 249 SACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYIK 308
+A +ARM G +PVMS++GSGN G+ A P+ + +E E+++RA+TLSHL Y+K
Sbjct: 241 AASEARMSGYPLPVMSSAGSGNHGLVAILPIAIIGEERGYDREKIIRAITLSHLLTAYVK 300
Query: 309 QNLGALSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
+G LS +CGC + A G S G+TYL+GG I AV NM+A L+GM+CDGAK CA
Sbjct: 301 AYIGVLSPICGCGVAAGVGMSAGLTYLLGGSRKQIKGAVSNMLAGLSGMICDGAKIGCAY 360
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
K+S V+ A+ ++ +MEN + GI+ ++SI+NL + + M D ++LDIM
Sbjct: 361 KLSISVTAALEASKFAMENIFIPSDNGILGNTAEESIKNLGRISVEGMKNADDVILDIML 420
Query: 428 SK 429
K
Sbjct: 421 KK 422
>gi|28211905|ref|NP_782849.1| hypothetical protein CTC02309 [Clostridium tetani E88]
gi|28204348|gb|AAO36786.1| conserved protein [Clostridium tetani E88]
Length = 449
Score = 261 bits (668), Expect = 5e-68, Method: Composition-based stats.
Identities = 157/422 (37%), Positives = 246/422 (58%), Gaps = 9/422 (2%)
Query: 10 IIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTG 69
+I+++ QVVPA+GCTEP+AVA ++A E+LG++ E + + +I KN VGIPGT
Sbjct: 33 LIEILKNQVVPALGCTEPIAVAYGVSKAKEILGEEVEHMEVNVDRSIFKNGKEVGIPGTD 92
Query: 70 MIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEM 129
G+ IA +L ++GKSEY L+V+KDLT E + + + + N + + LKEG+ + LYIE+
Sbjct: 93 KKGILIAAALSIIVGKSEYSLQVLKDLTNEDIPKALELIQNNVVKLNLKEGV-KGLYIEI 151
Query: 130 TCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQ-LNLKMVWDFA 188
K+ +I H N V E +G + +K + +I+ +K + DF
Sbjct: 152 IASGNKNKSRVVIKNNHLNIVLLEKNGMCIEEKSEEKSSVSVNLRDEIRNFTIKDLKDFV 211
Query: 189 TTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKTA 248
I +I FI E N A+E L G +G + N++ + + T+
Sbjct: 212 DNIDIEDIYFINEGIAMNKRIAKEGLDNKLGLGLGDLLKSE------ENNVIEYAKAVTS 265
Query: 249 SACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYIK 308
AC+ARM G +PVMS++GSGN G+ A P+ ++ E E+++R++ LSHL IYIK
Sbjct: 266 GACEARMSGYPLPVMSSAGSGNHGLVAILPIASIGEKLEKNEEKVIRSVALSHLVTIYIK 325
Query: 309 QNLGALSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
GALS +CGC + A G+S G+ YLM G I A+KNMIA ++GM+CDGAK CA
Sbjct: 326 SYTGALSPVCGCGVAAGVGASAGLCYLMDGTLEQIYGAIKNMIAGISGMICDGAKLGCAY 385
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
K+ VS++I +A ++++N V +GI+D+ +KSI+NL + D M+ D ++L++M
Sbjct: 386 KLCISVSSSIDAARMALKNIFVPSNDGILDETAEKSIQNLGKVSTDGMSCTDEVILEVML 445
Query: 428 SK 429
+
Sbjct: 446 DR 447
>gi|150383265|ref|ZP_01922103.1| protein of unknown function DUF1063 [Shewanella sediminis HAW-EB3]
gi|150251777|gb|EDM89527.1| protein of unknown function DUF1063 [Shewanella sediminis HAW-EB3]
Length = 424
Score = 260 bits (665), Expect = 1e-67, Method: Composition-based stats.
Identities = 155/421 (36%), Positives = 246/421 (58%), Gaps = 2/421 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q +++ V PA+GCTEP+ A +A A +L +PE +S +S N+ KN+MGV +PG
Sbjct: 6 QQYQAILNAVVKPALGCTEPICAAYASAIAASMLTSEPESVSVHVSDNLYKNSMGVFVPG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA ++GA+ G +E LEV+ + + +E + + + + + E +Y
Sbjct: 66 TGKIGLAIAAAVGAIGGNAEAGLEVLATIEADQVERAQAMIDAGNVSVS-RTQTDEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDF 187
+ +G T IS HT V + +G+++ K+V A + +++ ++D+
Sbjct: 125 YVKAHSGDDVVTVEISGGHTQVVEKTLNGEIVFAKEVSATTSTAGICDGVDISIAGIYDY 184
Query: 188 ATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKT 247
AT +I FIL+A N + E L YG VG+T+D+ + G + ++IL T
Sbjct: 185 ATQVSFEDIRFILDASELNTKLSAEGLANEYGLQVGRTIDKSIKDGFMSSDFANNILMHT 244
Query: 248 ASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYI 307
A+A DARMGGA +P MSN GSGNQGI AT PVV+ A + E L RAL +SHL AIYI
Sbjct: 245 AAASDARMGGASLPAMSNYGSGNQGIAATLPVVMTATHYQANDELLARALIMSHLGAIYI 304
Query: 308 KQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
K + LSA CG V ++ + YL GG Y C+A++N++++ +GMVCDGAK +CA+
Sbjct: 305 KSHYPPLSAFCGNTVTSAAAAMAMVYLAGGSYQQSCFAIQNVMSDCSGMVCDGAKSTCAM 364
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
KV + +A+ + ML++++ +A+GI+ D++ +IRN+ L M D ++DIM+
Sbjct: 365 KVKTSTGSAVNAFMLAIQS-TAAQAQGIVADDVEHTIRNIGQLVTLGMGNTDTTIIDIMS 423
Query: 428 S 428
+
Sbjct: 424 A 424
>gi|24372981|ref|NP_717023.1| hypothetical protein SO_1403 [Shewanella oneidensis MR-1]
gi|24347131|gb|AAN54468.1|AE015583_13 conserved hypothetical protein [Shewanella oneidensis MR-1]
Length = 424
Score = 260 bits (664), Expect = 1e-67, Method: Composition-based stats.
Identities = 163/422 (38%), Positives = 259/422 (61%), Gaps = 6/422 (1%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+Q I ++++ V PA+GCTEP+A A A A LL P+ I+ +S N+ KN+MGV +PG
Sbjct: 6 QQYIQIINQVVKPALGCTEPIAAAYAAAVARTLLNDDPDSIAVQVSDNLYKNSMGVYVPG 65
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TG IGL IA + GAL G ++ LEV+ +TPE + + + + ++ ++ E E +Y
Sbjct: 66 TGKIGLAIAAAAGALAGNADAGLEVLASVTPEQVAQAQALIDAAKVKVERTE-TDEFIYC 124
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLL--DKQVPAEEGAPTDNKDIQLNLKMVW 185
+T +G ++A I HT + +G+++ D G+ D DI N++ ++
Sbjct: 125 CVTLTSGEQEAMVKICGGHTLIAEKRLNGELVFTADNAQAKATGSICDGVDI--NIESIY 182
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
FA P EI+FIL+A N ++E + YG VG+TM ++ G+ G + + I+
Sbjct: 183 RFAEEVPFEEIQFILKASELNSKLSDEGMSKPYGLEVGRTMKNGIAAGIIGEDLLNKIVM 242
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
TA+A DARMGGA +P MSN GSGNQGI AT PVV+ A+ + + E+L RAL +SHL AI
Sbjct: 243 LTAAASDARMGGANLPAMSNLGSGNQGIAATIPVVITAQCYKVSEEKLARALIMSHLGAI 302
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YIK + LSA CG V +S + YL GG + C+A++N+I++ +GMVCDGAK SC
Sbjct: 303 YIKSHYPPLSAFCGNTVTSAAASMAMVYLAGGSFEQSCFAIQNVISDSSGMVCDGAKASC 362
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A+ S ++++ + +V+ +GII ++K+I+N+ + + M+ D+ +++I
Sbjct: 363 AMKVSTSSSAAVRSFLMALSSQNVS-GQGIIANHVEKTIKNIGKMVLNGMSSTDVTIINI 421
Query: 426 MT 427
M+
Sbjct: 422 MS 423
>gi|150392345|ref|YP_001322394.1| protein of unknown function DUF1063 [Alkaliphilus metalliredigens
QYMF]
gi|149952207|gb|ABR50735.1| protein of unknown function DUF1063 [Alkaliphilus metalliredigens
QYMF]
Length = 447
Score = 259 bits (662), Expect = 3e-67, Method: Composition-based stats.
Identities = 155/425 (36%), Positives = 239/425 (56%), Gaps = 5/425 (1%)
Query: 6 IREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLG---QKPEKISAFLSANILKNAMG 62
++ I++ + +VVPA+GCTEP+AVAL A+A E+ G + + + +S N+ KN +
Sbjct: 1 MKRLILETLKAEVVPAIGCTEPIAVALACAKAREIAGVSIDEVDHVDVIVSPNVYKNGLA 60
Query: 63 VGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGIT 122
VG+P T IGL IA +LG GK L+V++ + + + + I + +K+
Sbjct: 61 VGVPHTEHIGLAIAAALGLTGGKCHQGLQVLEGMKKTEQDIAVSLMDQGLISLDIKD-TN 119
Query: 123 EKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLK 182
EK+YIE+ G KA II + H FVY E G VLLD + A N + +K
Sbjct: 120 EKVYIEVILSIQGWKAKVIIKERHNQFVYLEKQGHVLLDSKTVPGVIASHQNPLYHMEIK 179
Query: 183 MVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSH 242
+ P E+ F+++ N A L+ G VG T + +G+ + I +
Sbjct: 180 EIIAIIEQIPHEELAFMMDGVEMNKKMAMTGLQPGVGMGVGYTYYDNMKKGILSDDIMNQ 239
Query: 243 ILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHL 302
+ TA+A DARM G+++PVMS++GSGN GI A P+V + + + E++ +AL +SHL
Sbjct: 240 AMMLTAAASDARMSGSILPVMSSNGSGNNGITAILPIVAYGMKFQVEDEKMAKALAISHL 299
Query: 303 TAIYIKQNLGALSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGA 361
YIK +G LSALCGC + A TG+S I +LMG I +KNM+AN++GM+CDGA
Sbjct: 300 MNSYIKHYIGRLSALCGCGVAAGTGASVAIAWLMGAKEQQIDGVIKNMLANVSGMICDGA 359
Query: 362 KPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIM 421
K CALK+++ AI SA+L+M++H V GI+ + + +I NL L ++ M D
Sbjct: 360 KVGCALKLATSAQVAIQSALLAMDHHIVPTGNGIVAETAEGTIENLRILSEEGMQLTDHA 419
Query: 422 VLDIM 426
+L +M
Sbjct: 420 ILSVM 424
>gi|117619552|ref|YP_856155.1| hypothetical protein AHA_1619 [Aeromonas hydrophila subsp.
hydrophila ATCC 7966]
gi|117560959|gb|ABK37907.1| conserved hypothetical protein [Aeromonas hydrophila subsp.
hydrophila ATCC 7966]
Length = 429
Score = 254 bits (648), Expect = 1e-65, Method: Composition-based stats.
Identities = 159/432 (36%), Positives = 252/432 (58%), Gaps = 11/432 (2%)
Query: 4 KNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGV 63
K +Q + ++ + V PA+GCTEP+A A A A LG KP ++ +S N+ KN+MGV
Sbjct: 2 KQAWQQYLQIIQQVVKPALGCTEPIAAAYAAAVAARQLGCKPVRLEVVVSDNLYKNSMGV 61
Query: 64 GIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITE 123
+PGTG IGL IA + GA+ G +E LEV+ +TP + E ++ + ++ + + E
Sbjct: 62 YVPGTGKIGLAIAAAAGAIGGNAEAGLEVLAAITPAQVAEAQELIDAGQVQVS-RTSAPE 120
Query: 124 KLY-----IEMTCEAGGKKATAIISKTHTNFVYEEADGKVLL--DKQVPAEEGAPTDNKD 176
+Y + + E A + HT V + + +V ++ GA D D
Sbjct: 121 FIYCRVRLLGLDAEGTEHSAEVTLCGGHTRIVEQRCNDEVTFTAEQGQGGATGAICDGVD 180
Query: 177 IQLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFG 236
I ++ + +FAT +I FIL+A N ++E + YG +G+TM + + GL G
Sbjct: 181 I--SIAAIHEFATQVEFEQIRFILQASELNGKLSDEGMNNPYGLEIGRTMQQNIQAGLIG 238
Query: 237 NSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRA 296
+ + I+ +TA+A DARMGGA +P MSN GSGNQGI AT PVVV A+ + E+L RA
Sbjct: 239 EDVMNRIVMRTAAASDARMGGASLPAMSNFGSGNQGIAATIPVVVIAERFGASEEQLARA 298
Query: 297 LTLSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGM 356
L +SHL AIYIK + LSA CG V ++ + YL GG + C+A++N++++ GM
Sbjct: 299 LIMSHLGAIYIKSHYPPLSAFCGNTVTSAAAAMAMVYLAGGSFEQSCHAIQNVLSDSAGM 358
Query: 357 VCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMN 416
VCDGAK SCA+KVS+ A+ ++++ +H V+ +GI+ +++++IRN+ + KD M+
Sbjct: 359 VCDGAKASCAMKVSTSSGAAVRGFLMALNSHGVS-GQGIVAGNVEQTIRNVGQMVKDGMS 417
Query: 417 EMDIMVLDIMTS 428
D ++DIM++
Sbjct: 418 ATDSTIIDIMSA 429
>gi|42527647|ref|NP_972745.1| hypothetical protein TDE2144 [Treponema denticola ATCC 35405]
gi|41818475|gb|AAS12664.1| conserved hypothetical protein [Treponema denticola ATCC 35405]
Length = 431
Score = 251 bits (642), Expect = 5e-65, Method: Composition-based stats.
Identities = 147/427 (34%), Positives = 249/427 (58%), Gaps = 3/427 (0%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAM 61
L+K+ E+ + ++ ++VPA+GCTEP+A+A A +++G P++I S NI+KNA
Sbjct: 3 LDKSKHEKYVQILREELVPALGCTEPIAIAYTAANLRKIMGGIPDEILIESSGNIIKNAK 62
Query: 62 GVGIPGTG-MIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEG 120
V +P TG M G+ + +G + G ++ LEV+ D+T E ++ +Y++++ +KL +
Sbjct: 63 SVIVPNTGGMKGMEASALIGLIGGNADKGLEVLADVTEEHVKLAHEYLAKSCTKLKLMD- 121
Query: 121 ITEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLN 180
L+I +T + G A + HTN V + + +++ +K E A LN
Sbjct: 122 TPASLHIRITGKLNGDTGVAELIHQHTNIVLLKKNDEIIFEKPFSLESAAGALTDRTCLN 181
Query: 181 LKMVWDFATTTPINEIE-FILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSI 239
+K + DFA T P++E+ I+ YNM +E+ LK +YG GK + + + S+
Sbjct: 182 VKDILDFADTVPVDEVSPIIMRQVEYNMRVSEDGLKTSYGIETGKNILKYNQKKGDDFSV 241
Query: 240 YSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTL 299
+ A+A DARM G PV++NSGSGNQG+ + PVVV+A+EN+ + E+L+R L +
Sbjct: 242 KVQAEGEVAAASDARMCGCSYPVITNSGSGNQGLAVSVPVVVYARENKISEEKLIRCLIV 301
Query: 300 SHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCD 359
S+L AI+ K +G LSA CG + A + ITY+ GG Y +C + N + ++G++CD
Sbjct: 302 SNLLAIHQKTGIGRLSAYCGAVTAGAACAAAITYMKGGSYEQVCGTIVNTLGTVSGILCD 361
Query: 360 GAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMD 419
GAK SCA K++S + +A+ S L+M+ + +GI+ DI+K+I + + M++ D
Sbjct: 362 GAKQSCAAKIASALDSALFSHELAMDGNFFAGGDGIVKDDIEKTIAGIGVVAAQGMHKTD 421
Query: 420 IMVLDIM 426
+VL +M
Sbjct: 422 EVVLQVM 428
>gi|153854136|ref|ZP_01995444.1| hypothetical protein DORLON_01435 [Dorea longicatena DSM 13814]
gi|149753185|gb|EDM63116.1| hypothetical protein DORLON_01435 [Dorea longicatena DSM 13814]
Length = 429
Score = 247 bits (631), Expect = 1e-63, Method: Composition-based stats.
Identities = 148/428 (34%), Positives = 237/428 (55%), Gaps = 8/428 (1%)
Query: 7 REQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIP 66
+E++I L+ VVPA+GCTEP+ VALC A A ++ K I ++A I KN M GIP
Sbjct: 4 KEEMITLLKNDVVPALGCTEPVCVALCAANAGKMTENKIRSIEVEVNAGIYKNGMSAGIP 63
Query: 67 GTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLY 126
G +GLP A +LGA + E LE+++D+TPE LE+ K+ + +K+KE + LY
Sbjct: 64 GCDYVGLPYAAALGAYLKNPEKGLELLEDITPEILEQMKELCGMAAVSVKIKEQ-EQGLY 122
Query: 127 IEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI----QLNLK 182
++ + T++I THTN VY E +GK++ +K E G +DN I Q+ +
Sbjct: 123 VKCKIKTEADMITSVIRGTHTNLVYLEKNGKIIYEKN--QENGQASDNALIETLKQMTIA 180
Query: 183 MVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSH 242
+ A T E+ F+++ N A + G + T+ + N + +
Sbjct: 181 QIRQVADTASEEELHFLMDGVEMNERLAAYSEDKKVGVGIADTLRSEKGSEVLKNDLLTR 240
Query: 243 ILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHL 302
I+ K +SA ++R+ G +P MS+SG+G +G+ PV A + E+ VRAL ++HL
Sbjct: 241 IMLKVSSAAESRLDGCPLPTMSSSGAGTKGLVVILPVSEAADALGVSMEKKVRALAIAHL 300
Query: 303 TAIYIKQNLGALSALCGCIVA-CTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGA 361
YI +G LS +C C++A T +S GI YL+GG + YAV+NM +TGM+CDG
Sbjct: 301 VNRYINAYIGKLSPMCSCVMASSTAASVGIAYLLGGSDEQLGYAVRNMSGTVTGMICDGG 360
Query: 362 KPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIM 421
K CA+KV++G S A+L A+ ++ + + ++GI + + IR++ +G M + D
Sbjct: 361 KVGCAMKVATGSSAALLCALTAVHDAPLRVSDGICAETPEDCIRHMAQIGNQGMAQTDKE 420
Query: 422 VLDIMTSK 429
++ IM K
Sbjct: 421 IIHIMEQK 428
>gi|154502833|ref|ZP_02039893.1| hypothetical protein RUMGNA_00647 [Ruminococcus gnavus ATCC 29149]
gi|153796716|gb|EDN79136.1| hypothetical protein RUMGNA_00647 [Ruminococcus gnavus ATCC 29149]
Length = 431
Score = 244 bits (622), Expect = 1e-62, Method: Composition-based stats.
Identities = 157/432 (36%), Positives = 252/432 (58%), Gaps = 13/432 (3%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAM 61
+++ I + + ++ +++VPA+GCTEP+A+A +A+A E+LG+ PE ++ + S NI+KN
Sbjct: 1 MDRTIYDNYVKILRKELVPALGCTEPIALAYASAKAREVLGEFPEHMTVWCSGNIIKNVK 60
Query: 62 GVGIPGTG-MIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEG 120
GV +P +G M G+ A LG G LEV++ +T ++ K+ + E+ D LKEG
Sbjct: 61 GVKVPNSGGMKGVEAAAVLGLAGGDPSQALEVLEAVTQTDIKRTKELLRESFCDCCLKEG 120
Query: 121 ITEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLL--DKQVPAEEGAPTDNKDIQ 178
+ LYIE+ G +AT II + HTN E +GK++ K+V EE +
Sbjct: 121 VA-NLYIEVQVVNGENEATVIIEQEHTNITRIEKNGKIVYAHKKEVSGEE---IEVDKSL 176
Query: 179 LNLKMVWDFATTTPINEIEFILEAK-RYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGN 237
LNL + FA +NE+ +L + RYN A+E L+ +G VG+ + FG
Sbjct: 177 LNLADILVFAQEVDLNEVRDVLARQIRYNSRIAKEGLEHEWGAQVGRVIAEE-----FGT 231
Query: 238 SIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRAL 297
++ ++ A+ DARM G +PV+ NSGSGNQG+ + PV+ + KE + + EE+ RAL
Sbjct: 232 TVQWKAVASAAAGSDARMSGCSLPVIINSGSGNQGMTCSLPVIEYGKELKKSEEEIYRAL 291
Query: 298 TLSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMV 357
+S+L A+ K+ +G+LSA CG + A G+ GITYL GG+ I V N IA+ G+V
Sbjct: 292 CVSNLVALNQKRYIGSLSAYCGAVCAAAGAGAGITYLCGGNLEQIQNTVVNTIADAGGIV 351
Query: 358 CDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNE 417
CDGAKPSCA K+++ + AILS ++M EG++ + +I+ + +G+ M +
Sbjct: 352 CDGAKPSCAAKIATSLQAAILSHKMAMRGLVFGSGEGLVMDCPEDTIKAVGYVGRAGMKQ 411
Query: 418 MDIMVLDIMTSK 429
D+ +L++M K
Sbjct: 412 TDVEILNLMIGK 423
>gi|106887868|ref|ZP_01355140.1| Protein of unknown function DUF1063 [Clostridium phytofermentans
ISDg]
gi|106764620|gb|EAT21427.1| Protein of unknown function DUF1063 [Clostridium phytofermentans
ISDg]
Length = 417
Score = 243 bits (620), Expect = 2e-62, Method: Composition-based stats.
Identities = 155/421 (36%), Positives = 249/421 (59%), Gaps = 10/421 (2%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+++ID++ ++VPA+GCTEP+A+AL +A+A E+LG+ P++++ S+NI+KNA V +P
Sbjct: 3 KKMIDILKAELVPALGCTEPIAIALASAKAREVLGEMPDELTVECSSNIIKNAKSVVVPM 62
Query: 68 T-GMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLY 126
T + G+ A +G + G + +LEV+ +T E LEE K+ ++ K + TEKL+
Sbjct: 63 TKNLKGIEAAAIVGLIGGDANKKLEVLTTVTEEDLEETKRLLATGFCQTKFLQ-TTEKLH 121
Query: 127 IEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWD 186
I + + G + + KTHT E DG V ++++ + + D L++ + D
Sbjct: 122 IIVRMKKGENSSLVELVKTHTGIARIEKDGVVTFEEEIEDCDDSTVDYS--VLSVASILD 179
Query: 187 FATTTPINEIEFILEAK-RYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILS 245
FA I E+ ILE + YN A+E L+ YG VG T+ ++G+ + +
Sbjct: 180 FANQVDIEEVRPILERQIEYNTKIAKEGLRNTYGVNVGSTL-----LDVYGDDVKIRAKA 234
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
A+ DARM G +PV+ NSGSGNQG+ + PV+ F K + E+++RAL +S+L AI
Sbjct: 235 MPAAGSDARMNGCELPVIINSGSGNQGMTVSLPVIEFGKMLDVEDEKILRALIISNLIAI 294
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
Y K +G LSA CG + A G+ GITYL GG+ I + N +AN++G+VCDGAK SC
Sbjct: 295 YQKSEIGRLSAYCGAVSAAAGAGAGITYLYGGNEEQINQTIINTLANVSGIVCDGAKSSC 354
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A K++S V AI++ M+S++ +GI+ I K+I + +L K+ M E D +++ I
Sbjct: 355 AAKIASAVDAAIVATMISLKGKGFLSGDGIVKDTIQKTIDGVVTLAKEGMQETDEVIVKI 414
Query: 426 M 426
M
Sbjct: 415 M 415
>gi|126700852|ref|YP_001089749.1| hypothetical protein CD3232 [Clostridium difficile 630]
gi|115252289|emb|CAJ70130.1| conserved hypothetical protein [Clostridium difficile 630]
Length = 427
Score = 242 bits (618), Expect = 3e-62, Method: Composition-based stats.
Identities = 167/425 (39%), Positives = 249/425 (58%), Gaps = 14/425 (3%)
Query: 11 IDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTGM 70
++++ +V PAVGCTEP+A+AL A+A ELLG++ + +S +I KN M VGIPGT
Sbjct: 1 MEMLKAEVKPAVGCTEPVALALACAKAKELLGEEIVENRMLVSPSIYKNGMCVGIPGTER 60
Query: 71 IGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEMT 130
+GL IA +LG + G SE L V++ LT E ++ + Y+ + I + EK++IE+
Sbjct: 61 LGLKIAAALGIVGGHSENGLSVLETLTKEEVKIAEDYMDNTPLSITPAD-TREKVFIEVV 119
Query: 131 CEAGGKKATAIISKTHTNFVYEEADGKVLLDKQ--VPAEEGAP------TDNKDIQLNLK 182
+ A I H NF + E DG+VLLD + V A A D IQ +K
Sbjct: 120 LKGKNHIAKVRIRTKHDNFTFLEKDGEVLLDNEPKVSASNDAAEKAESLMDTVTIQELIK 179
Query: 183 MVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSH 242
V + +IEF+L+ + N AE LK G VG + + + GL GN + ++
Sbjct: 180 NVEEI----DFKDIEFLLDGVKMNEEMAEYGLKQKTGIGVGYGIKKSIEEGLLGNDVINY 235
Query: 243 ILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHL 302
+ TA A DARM G +PVMS++GSGN G+ A P+V + K+ + E L +AL +SHL
Sbjct: 236 AMMLTAGASDARMAGVKMPVMSSNGSGNHGLTAILPIVAYNKKFPQSDERLAKALAISHL 295
Query: 303 TAIYIKQNLGALSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGA 361
YIK G LSA+CGC + A TG++ GI++LM G I A++NMIA+L+GM+CDGA
Sbjct: 296 VTGYIKNYTGRLSAVCGCGVAASTGATAGISWLMNGTEKQIEGAIENMIADLSGMICDGA 355
Query: 362 KPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIM 421
K CALK+SS S AI SA+++ ++ V GI+ +++SI+NL + M+ D +
Sbjct: 356 KAGCALKLSSAASAAIQSAIIAKQDCFVPPLNGIVGSSVEQSIQNLGRVSDKGMSITDEI 415
Query: 422 VLDIM 426
+L++M
Sbjct: 416 ILNVM 420
>gi|154498241|ref|ZP_02036619.1| hypothetical protein BACCAP_02229 [Bacteroides capillosus ATCC
29799]
gi|150272788|gb|EDM99956.1| hypothetical protein BACCAP_02229 [Bacteroides capillosus ATCC
29799]
Length = 430
Score = 238 bits (608), Expect = 5e-61, Method: Composition-based stats.
Identities = 156/429 (36%), Positives = 238/429 (55%), Gaps = 4/429 (0%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAM 61
+++ + + D M + A GCTEP+A+A A A E+I SANI+KNA
Sbjct: 1 MDQQTMQVLRDAMAAGMKVATGCTEPVAIAFAGATARAQTSGAIERIMLRASANIIKNAF 60
Query: 62 GVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGI 121
VGIPGT G A+++GA+ G +LE+++ L P + + Q V + ++++ E
Sbjct: 61 VVGIPGTEFTGPKYAVAIGAVCGDPSRELELLEGLDPAGVAQAAQLVQDGKVELDRAE-T 119
Query: 122 TEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNL 181
KLYIE+ + +HT+ +G V+ AE T+ + L
Sbjct: 120 PAKLYIEVELHTAADTVVVRVVGSHTHVDSIVKNGVVVYQNSCAAEGADHTEG--LNCRL 177
Query: 182 KMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYS 241
V++F TT P+ E + +A N A+E L +YG VGKT+ + G N +
Sbjct: 178 ADVYEFCTTAPLEEFSLVEQAITLNARIAQEGLTNDYGLQVGKTIREDVRAGALANDTTN 237
Query: 242 HILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVV-VFAKENENTPEELVRALTLS 300
+ ++ +A DARM G +PVMSNSGSGNQGI AT PVV V+ + + E+L+RA LS
Sbjct: 238 YAMTLAGAAADARMAGVDLPVMSNSGSGNQGIAATMPVVAVWERVDGKDREKLIRACALS 297
Query: 301 HLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDG 360
+L IYIK G LSALCG +VA TG + + +L GG I A+ N++ N+ GM+CDG
Sbjct: 298 NLITIYIKSRFGVLSALCGAVVASTGVASAVVWLRGGGLGEIACAISNVLGNVAGMLCDG 357
Query: 361 AKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDI 420
AK SCALK+S+ + A+L+A L+M + V EGI+++ + +I N LG + +E+D
Sbjct: 358 AKASCALKISTCTNAAMLAATLAMRHLRVASNEGIVEQRPEYTIDNFALLGNEGSDELDR 417
Query: 421 MVLDIMTSK 429
++LD++ K
Sbjct: 418 LILDMIIHK 426
>gi|81427766|ref|YP_394765.1| hypothetical protein LSA0156 [Lactobacillus sakei subsp. sakei 23K]
gi|78609407|emb|CAI54453.1| Hypothetical protein [Lactobacillus sakei subsp. sakei 23K]
Length = 431
Score = 237 bits (604), Expect = 1e-60, Method: Composition-based stats.
Identities = 156/428 (36%), Positives = 241/428 (56%), Gaps = 8/428 (1%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARA-TELLGQKPEKISAFLSANILKNAMGVGIP 66
+ I + VVPA GCTEP+AVA A +L ++ + I +S N++KNA+ V +P
Sbjct: 6 KHFIQALKNGVVPATGCTEPIAVAFGAATCMAQLTNREIQAIEVHVSPNVMKNALAVMVP 65
Query: 67 GTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLY 126
GTG GL +A + GA+ G + L VI+ L L ++ K + + LY
Sbjct: 66 GTGEPGLLVAAAAGAIAGDATVGLSVIEGLRATDLPTILDLAHSGKVTAKTAL-VPDDLY 124
Query: 127 IEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQ-LNLKMVW 185
+E+T + T I+ +HTN + D ++L+D PA + + +Q +N K VW
Sbjct: 125 VEVTIKDLENTVTVAIAGSHTNIFSLKKDDQILIDHPRPAAHASSESKQFLQTMNFKAVW 184
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFG----NSIYS 241
DFA P+ + F+ EA N+ A++ +YG +G++++ R FG N + +
Sbjct: 185 DFAMNEPLEHLRFMKEAHTLNLALAQDGFTHDYGLQLGQSINGA-KRNHFGSGTENDLGN 243
Query: 242 HILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSH 301
+++ A+A DARMGGA +P MSNSGSGNQGI AT PV V A + T E+L+RA LSH
Sbjct: 244 RMIAYAAAASDARMGGAQLPAMSNSGSGNQGITATIPVSVAADAVQATEEQLIRAQALSH 303
Query: 302 LTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGA 361
LTA+YI L LSA C A G++ G+ YL G Y + C+A++NM + GM+CDGA
Sbjct: 304 LTALYIHSFLPVLSAFCATDSAAMGAAAGVVYLYDGTYEDACHAIQNMAGDAAGMICDGA 363
Query: 362 KPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIM 421
+CA+KV++ VS+ L+++ + + GI+ ID++I + LG + + E D +
Sbjct: 364 GCACAMKVATAVSSMYRGVNLALQGIVIPNSNGIVCPTIDETIHGIGRLGTEGLRETDPV 423
Query: 422 VLDIMTSK 429
+LDIM +K
Sbjct: 424 ILDIMLNK 431
>gi|89893855|ref|YP_517342.1| hypothetical protein DSY1109 [Desulfitobacterium hafniense Y51]
gi|109646581|ref|ZP_01370485.1| protein of unknown function DUF1063 [Desulfitobacterium hafniense
DCB-2]
gi|89333303|dbj|BAE82898.1| hypothetical protein [Desulfitobacterium hafniense Y51]
gi|109641827|gb|EAT51381.1| protein of unknown function DUF1063 [Desulfitobacterium hafniense
DCB-2]
Length = 430
Score = 234 bits (596), Expect = 1e-59, Method: Composition-based stats.
Identities = 150/423 (35%), Positives = 247/423 (58%), Gaps = 13/423 (3%)
Query: 11 IDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGT-G 69
++++ ++V A+GCTEP+A+A A+A E+LG P K+ +S+NI+KN V +P T G
Sbjct: 14 LNILKEELVAAMGCTEPIAIAYGAAKAREVLGAVPHKVVLEVSSNIIKNVKSVVVPNTDG 73
Query: 70 MIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEG-ITEKLYIE 128
+ G+ A + G + G+S+ LEVI ++ ++ K Y+SE I++KL + I + I
Sbjct: 74 LKGIEAATAAGIIAGRSDKILEVIAEVCQAEKQQIKTYLSETDIEVKLADSQIIFDIMIT 133
Query: 129 MTCEAGGKKATAIISKTHTNFVYEEADGKVLLDK-QVPAEEGAPTDNKDIQLNLKMVWDF 187
M + K I+ HT+ V+ E +G+++ + A + TD K L++ + +F
Sbjct: 134 MFHQDSYVKLR--IADYHTHIVHIEKNGEIIFGTGDLDAGISSLTDRK--LLSVSKIIEF 189
Query: 188 ATTTPINEIEFILEAK-RYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSK 246
A + I +++ +L+ + YN A ++GNYG VG+ + + +GN + +
Sbjct: 190 ADSVRIEDVKELLDKQIEYNSAIARAGMEGNYGANVGRVLLKT-----YGNDVKIRAKAM 244
Query: 247 TASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIY 306
A+ DARM G +PV+ NSGSGNQG+ A+ PV+ +A E ++ E+L RAL +S+L ++
Sbjct: 245 AAAGSDARMSGCELPVIINSGSGNQGMTASLPVIEYAAELQSGEEKLYRALVVSNLITLH 304
Query: 307 IKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCA 366
+K +G LSA CG I A GS GI YL GG Y+ I + + N A ++G+VCDGAKPSCA
Sbjct: 305 LKTGIGRLSAFCGVICAGCGSGAGIAYLHGGGYDEIAHTIVNAAAIVSGIVCDGAKPSCA 364
Query: 367 LKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIM 426
K+++ V IL ++ E +GI+ K I+ +I N+ LGK M E D +++IM
Sbjct: 365 GKIAAAVDAGILGYLMYKEGQQFRGGDGIVAKGIENTIANIGYLGKVGMKETDKEIINIM 424
Query: 427 TSK 429
++
Sbjct: 425 LNQ 427
>gi|153941420|ref|YP_001391289.1| hypothetical protein CLI_2031 [Clostridium botulinum F str.
Langeland]
gi|152937316|gb|ABS42814.1| conserved hypothetical protein [Clostridium botulinum F str.
Langeland]
Length = 430
Score = 231 bits (588), Expect = 1e-58, Method: Composition-based stats.
Identities = 140/424 (33%), Positives = 233/424 (54%), Gaps = 2/424 (0%)
Query: 7 REQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIP 66
+E ++ L+ ++VVPA+GCTEP+ VAL TA A +G + I ++ I KN M VGIP
Sbjct: 4 KENLLTLLKQEVVPALGCTEPVCVALATADAYHAIGGRIVSIKIEVNPGIYKNGMSVGIP 63
Query: 67 GTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLY 126
G +GL A SLGA+IG E +LE+++D+T E ++ + V +++ + +K +LY
Sbjct: 64 GFDRVGLKYAASLGAVIGNPEKKLELLEDITAEVSQKAIKIVENSQVVVVIKHE-EAQLY 122
Query: 127 IEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWD 186
+ + I TH+N ++ + + +LL K+ A+ + + + +
Sbjct: 123 VRAEIITTAGMGISEIRGTHSNIIFTKRNNDMLLQKEYSADSDDSLHQQLKLMGIAEIRK 182
Query: 187 FATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSK 246
E+ F+L+ N A+ L+ + G + + ++ + G++++S + +
Sbjct: 183 LIDECKEEELSFLLDGVDMNERLADYGLEHSLGIGIASALQEKMTTDIMGDNLFSRTMLR 242
Query: 247 TASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIY 306
AS+ + RM G VMS++GSGN GI A PV A+ ++ E+LV+AL SH +Y
Sbjct: 243 VASSAEGRMSGCPYAVMSSAGSGNHGITAILPVTEMARYLNSSREQLVKALAFSHTLNVY 302
Query: 307 IKQNLGALSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
IK G LSA CGC + A T +S + +LMGG+ + I A+ NM NLTGM+CDG K C
Sbjct: 303 IKLFTGKLSATCGCGVSAATAASAAMVWLMGGNDHQIANAIINMSGNLTGMICDGGKIGC 362
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
ALK+++ + A++ A L+M + + ++GI D ++ IRN+ + M E D +L I
Sbjct: 363 ALKLATATNAALMCAYLAMSDVALQPSDGICDVTAEQVIRNMGQVSNPGMVETDQTILSI 422
Query: 426 MTSK 429
M K
Sbjct: 423 MIEK 426
>gi|148379926|ref|YP_001254467.1| hypothetical protein CBO1964 [Clostridium botulinum A str. ATCC
3502]
gi|153933718|ref|YP_001384223.1| hypothetical protein CLB_1903 [Clostridium botulinum A str. ATCC
19397]
gi|153935143|ref|YP_001387764.1| hypothetical protein CLC_1909 [Clostridium botulinum A str. Hall]
gi|148289410|emb|CAL83506.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
3502]
gi|152929762|gb|ABS35262.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
19397]
gi|152931057|gb|ABS36556.1| conserved hypothetical protein [Clostridium botulinum A str. Hall]
Length = 430
Score = 229 bits (584), Expect = 3e-58, Method: Composition-based stats.
Identities = 139/424 (32%), Positives = 232/424 (54%), Gaps = 2/424 (0%)
Query: 7 REQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIP 66
+E ++ L+ ++VVPA+GCTEP+ VAL TA A +G + I ++ I KN M VGIP
Sbjct: 4 KENLLALLKQEVVPALGCTEPVCVALATADAYHAIGGRIVSIKIEVNPGIYKNGMSVGIP 63
Query: 67 GTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLY 126
G +GL A SLGA+IG E +LE+++D+T E ++ + V +++ + +K +LY
Sbjct: 64 GFDRVGLKYAASLGAVIGNPEKKLELLEDITAEVSQKAIKIVENSQVVVVIKHE-EAQLY 122
Query: 127 IEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWD 186
+ + I TH+N ++ + + +LL K+ + + + + +
Sbjct: 123 VRAEIITTAGMGISEIRGTHSNIIFTKRNNDMLLQKEYSVDSDDSLHQQLKLMGIAEIRK 182
Query: 187 FATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSK 246
E+ F+L+ N A+ L+ + G + + ++ + G++++S + +
Sbjct: 183 LIDECKEEELSFLLDGVDMNERLADYGLEHSLGIGIASALQEKMTTDIMGDNLFSRTMLR 242
Query: 247 TASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIY 306
AS+ + RM G VMS++GSGN GI A PV A+ ++ E+LV+AL SH +Y
Sbjct: 243 VASSAEGRMSGCPYAVMSSAGSGNHGITAILPVTEMARYLNSSREQLVKALAFSHTLNVY 302
Query: 307 IKQNLGALSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
IK G LSA CGC + A T +S + +LMGG+ + I A+ NM NLTGM+CDG K C
Sbjct: 303 IKLFTGKLSATCGCGVSAATAASAAMVWLMGGNEHQIANAIINMSGNLTGMICDGGKIGC 362
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
ALK+++ + A++ A L+M + + ++GI D ++ IRN+ + M E D +L I
Sbjct: 363 ALKLATATNAALMCAYLAMSDVALQPSDGICDVTAEQVIRNMGQVSNPGMVETDQTILSI 422
Query: 426 MTSK 429
M K
Sbjct: 423 MIEK 426
>gi|153814330|ref|ZP_01966998.1| hypothetical protein RUMTOR_00540 [Ruminococcus torques ATCC 27756]
gi|145848726|gb|EDK25644.1| hypothetical protein RUMTOR_00540 [Ruminococcus torques ATCC 27756]
Length = 436
Score = 227 bits (578), Expect = 1e-57, Method: Composition-based stats.
Identities = 157/435 (36%), Positives = 248/435 (57%), Gaps = 14/435 (3%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAM 61
+ + I + + ++ ++VPA+GCTEP+A+A A+A E+LG+ P+ I+ S NI+KN
Sbjct: 1 MNRKIYNEYVTILESELVPALGCTEPIALAYAAAKAKEVLGKMPDHITMRCSGNIIKNVK 60
Query: 62 GVGIPGTG-MIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEG 120
GV +P +G M G+ A LG G LEV++ +T ++E ++ + D LK+
Sbjct: 61 GVKVPNSGGMKGVEAAAVLGITGGDPSQALEVLEHVTDREIDEAEKLLKAGFCDCVLKDD 120
Query: 121 ITEKLYIEM--TCEAGGK-KATAIISKTHTNFVYEEADGKVLLDKQVP--AEEGAPTDNK 175
+ LYIE C+ K +A +I HTN + E DG+VL K+ +E T +K
Sbjct: 121 VA-NLYIEAYAVCKKTEKSEALVVIEDEHTNITHIEKDGQVLFHKEKKEYCQEREKTPDK 179
Query: 176 DIQLNLKMVWDFATTTPINEIEFILEAK-RYNMNAAEEALKGNYGHCVGKTMDRPLSRGL 234
+ LNL+ + FA I ++E +L + +YN AEE L+ +G VG+ +
Sbjct: 180 SL-LNLEDIITFANEVQITDVEKVLGRQIKYNTRIAEEGLRNPWGAQVGRVVLEE----- 233
Query: 235 FGNSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELV 294
FG + ++K ++ DARM G +PV+ NSGSGNQG+ + PV+ F KE + + EE+
Sbjct: 234 FGEDVKWRAVAKASAGSDARMSGCALPVIINSGSGNQGMTCSLPVIEFGKELKKSKEEIY 293
Query: 295 RALTLSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLT 354
RAL +S+L A+ K+ +G+LSA CG + A G+ GITYL GG I V N IA+
Sbjct: 294 RALCVSNLVALNQKKYIGSLSAYCGAVCAAAGAGAGITYLCGGTLEQIENTVVNTIADAG 353
Query: 355 GMVCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDA 414
G+VCDGAKPSCA K+S+ + AILS ++M EG++ + +I+ + +G+
Sbjct: 354 GIVCDGAKPSCAAKISTALQAAILSHKMAMRGLTFARGEGLVMDCPEDTIKAVGYVGRAG 413
Query: 415 MNEMDIMVLDIMTSK 429
M + D+ +L++M K
Sbjct: 414 MKQTDVEILNLMIGK 428
>gi|153939751|ref|YP_001391333.1| hypothetical protein CLI_2075 [Clostridium botulinum F str.
Langeland]
gi|152935647|gb|ABS41145.1| conserved hypothetical protein [Clostridium botulinum F str.
Langeland]
Length = 434
Score = 226 bits (576), Expect = 2e-57, Method: Composition-based stats.
Identities = 151/428 (35%), Positives = 238/428 (55%), Gaps = 10/428 (2%)
Query: 3 EKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMG 62
++ I E++++L+ + PA+GCTEP+AVA A + + + KI +S NILKN
Sbjct: 6 KEEISERLLELIKDETKPAIGCTEPVAVAFTVATGKKYMAGEVLKIDLKVSKNILKNGKS 65
Query: 63 VGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGIT 122
V IP T + GL IA +LG + G E L V K++ + L++ ++ + + + E T
Sbjct: 66 VTIPNTEVCGLDIAGALGGICGDPEEGLFVFKNVNKDYLDKAREMIKNKVVTLNPIEN-T 124
Query: 123 EKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI----Q 178
+ +++E T + + AI+ HTN +GK+ +K E+ DNKD +
Sbjct: 125 DPVFVEATLKGEKDEVIAILEGGHTNIERIIVNGKIAFEKDNKNEK----DNKDCDFMKE 180
Query: 179 LNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNS 238
L+LK + + I ++ FI++ N AA+E LK G +G ++ + G G
Sbjct: 181 LSLKDIREITEDISIEKLGFIMDGIEMNKEAAKEGLKRQKGLTLGSSLLKLQQEGKLGKD 240
Query: 239 IYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALT 298
+ TA+ D RMGG M P+M++ GSGNQG+C P+ V A++ + E+L RA+
Sbjct: 241 SATIARILTAAGSDLRMGGGMCPIMTSGGSGNQGLCVILPITVVAEDIKAPKEKLQRAVF 300
Query: 299 LSHLTAIYIKQNLGALSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMV 357
H ++K+ G LSA+CGC I A G++ GI +L+GG I A+ NM+ANLTGMV
Sbjct: 301 FGHAVNNFVKKYTGKLSAICGCAIAAGIGATAGIAWLLGGKDKEINGAILNMLANLTGMV 360
Query: 358 CDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNE 417
CDGAK SCA+K+S+ S A++SA L++ + V GII ++ +I NL L KD +
Sbjct: 361 CDGAKGSCAIKLSTSASEAVISAYLALNDIIVPNNTGIIGNTVEDTINNLGMLCKDGFYK 420
Query: 418 MDIMVLDI 425
D ++L I
Sbjct: 421 ADDVMLSI 428
>gi|51244443|ref|YP_064327.1| hypothetical protein DP0591 [Desulfotalea psychrophila LSv54]
gi|50875480|emb|CAG35320.1| conserved hypothetical protein [Desulfotalea psychrophila LSv54]
Length = 451
Score = 221 bits (563), Expect = 7e-56, Method: Composition-based stats.
Identities = 150/436 (34%), Positives = 243/436 (55%), Gaps = 12/436 (2%)
Query: 4 KNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQ-KPEKISAFLSANILKNAMG 62
+ ++ + +++ +V A+GCTEP+A+AL A A LL K + I + NI KN +
Sbjct: 14 RKMKFSVKNILEMEVCLALGCTEPVAIALGAAAAATLLPGLKFDHIHLIIDPNIYKNGLA 73
Query: 63 VGIPGTG-MIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGI 121
V IPG+G + GL A +LGA G + +LEV+ L+PE + +++E R+ + L+E
Sbjct: 74 VVIPGSGGLTGLDTASALGAFGGDAYGKLEVLSSLSPEMVARASAFLAEGRVKVDLRE-- 131
Query: 122 TEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQL-- 179
LY++ T GG A ++I+ HTN V DG+ + D ++ A E T NK +L
Sbjct: 132 ESGLYVKTTISGGGHVAESLITDVHTNIVSLMLDGEEVADPRLVATEAMSTGNKLAELEE 191
Query: 180 -----NLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGL 234
L+ + D +++F+ E ++N+ AE LK G +GK +DR L + L
Sbjct: 192 WLRSLTLEDILDLTNELDEADLDFLEEGVQHNLRLAEYGLKHGSGLGIGKDIDRLLKQKL 251
Query: 235 FGNSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELV 294
+ + T++A DARM G +P MS+ GSGN G+ A P+ E E ++
Sbjct: 252 LVKDMTTSARMLTSAAADARMDGVNLPAMSSGGSGNHGLTAILPIWAIKDFIETDRESVL 311
Query: 295 RALTLSHLTAIYIKQNLGALSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIANL 353
RA+ LSH+ YIK + G LSA+CGC + A G++ GITYL+GGD + A+KN++ +L
Sbjct: 312 RAIGLSHIITAYIKAHTGRLSAVCGCSVAAGAGATAGITYLVGGDLQQVEGAIKNILEDL 371
Query: 354 TGMVCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKD 413
G++CDGAK CA+K+++ A+ +A+ S++ V + +GII ++++N+ L
Sbjct: 372 AGVICDGAKAGCAIKLNTAAGAAVQAALFSLQGVSVKDTDGIIGDSTRQTVQNIGDLSNY 431
Query: 414 AMNEMDIMVLDIMTSK 429
M D +L IM +K
Sbjct: 432 GMVATDKTILKIMRAK 447
>gi|147920322|ref|YP_685905.1| hypothetical protein RCIX1285 [uncultured methanogenic archaeon
RC-I]
gi|110621301|emb|CAJ36579.1| conserved hypothetical protein [uncultured methanogenic archaeon
RC-I]
Length = 425
Score = 220 bits (560), Expect = 2e-55, Method: Composition-based stats.
Identities = 150/419 (35%), Positives = 226/419 (53%), Gaps = 13/419 (3%)
Query: 14 MHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTGMIGL 73
+ ++ GCT+P AV L A LG PEKI +S NI KN + VG+PGTGM GL
Sbjct: 13 LESEIERTTGCTDPGAVCLAVRAAAIELGVDPEKIVVTVSPNIYKNGINVGVPGTGMRGL 72
Query: 74 PIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEMTCEA 133
IA LGA+I + L ++ + P +E Q V+ R+ I E T LYI+ +
Sbjct: 73 HIAAGLGAVIKSTSSGLALLDAVDPADVERAVQLVNNGRVTITHAE-TTSALYIKAEVFS 131
Query: 134 GGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQL---NLKMVWDFATT 190
G A A+I +T+ V DGK +V + G P K L L+ ++ T
Sbjct: 132 GAHYAHAVIRDDYTSIVEVGLDGK-----KVSSPRGMPVKTKHESLKGYTLEELFTSIDT 186
Query: 191 TPINEIEFILEAKRYNMNAAEEALKGNYGHC-VGKTMDRPLSRGLFGNSIYSHILSKTAS 249
+ E+ F+ +A N AAE L+ G C +GK + L+ + + + TA+
Sbjct: 187 MTVEELTFLRDAAEVNRKAAEAGLES--GPCPLGKALYSGLAGAGVRHMAAARAQALTAA 244
Query: 250 ACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYIKQ 309
AC+ARM G VP+++ +GSGN GI + ++ A+ + E+L++AL +S + IK+
Sbjct: 245 ACEARMSGMQVPIIAIAGSGNHGIASFLGILAVAETLASPEEKLLKALAISSTVTVAIKE 304
Query: 310 NLGALSALCGCIVAC-TGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCALK 368
+ LSA CGC VA TG + G YL+GG Y I +A++++I L GMVCDGAK SCA K
Sbjct: 305 HSTKLSAFCGCAVAASTGVAAGTVYLLGGSYEEITHAMQSVIGTLAGMVCDGAKESCAFK 364
Query: 369 VSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
+SS V+ AI LS+E+ ++ E GI+ + I+K+ NL L M D ++L +++
Sbjct: 365 LSSSVALAIQFGHLSLEDAYIKEGMGIVSQSIEKTFENLGRLNNPGMVTADKLMLQMIS 423
>gi|148379970|ref|YP_001254511.1| membrane protein [Clostridium botulinum A str. ATCC 3502]
gi|153931516|ref|YP_001384269.1| hypothetical protein CLB_1949 [Clostridium botulinum A str. ATCC
19397]
gi|153935029|ref|YP_001387808.1| hypothetical protein CLC_1955 [Clostridium botulinum A str. Hall]
gi|148289454|emb|CAL83551.1| putative membrane protein [Clostridium botulinum A str. ATCC 3502]
gi|152927560|gb|ABS33060.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
19397]
gi|152930943|gb|ABS36442.1| conserved hypothetical protein [Clostridium botulinum A str. Hall]
Length = 434
Score = 219 bits (559), Expect = 2e-55, Method: Composition-based stats.
Identities = 147/424 (34%), Positives = 234/424 (55%), Gaps = 2/424 (0%)
Query: 3 EKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMG 62
++ I E++++L+ + PA+GCTEP+AVA A + + + KI +S NILKN
Sbjct: 6 KEEISERLLELIKDETKPAIGCTEPVAVAFTVATGKKYMAGEVLKIDLKVSKNILKNGKS 65
Query: 63 VGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGIT 122
V IP T + GL IA +LG + G E L V +++ E L++ K+ + + + E T
Sbjct: 66 VTIPNTEVCGLDIAGALGEICGDPEEGLFVFRNVNNEYLDKAKEMIKNKVVTLNPIEN-T 124
Query: 123 EKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLK 182
+ +++E T + + AI+ HTN +GK+ +K ++ + +L+LK
Sbjct: 125 DPVFVEATLKGEQDEVIAILKGGHTNIEKVIVNGKIAFEKDNKNKKDNKDCDFIKELSLK 184
Query: 183 MVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSH 242
+ I +++FI++ N AA+E LK G +G ++ + G G +
Sbjct: 185 DIRQITEDISIEKLDFIMDGIEMNKEAAKEGLKRQKGLTLGSSLLKLQEEGKIGKDSATI 244
Query: 243 ILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHL 302
TA+ D RMGG M P+M++ GSGNQG+C P+ V A++ + E L RA+ H
Sbjct: 245 ARILTAAGSDLRMGGGMCPIMTSGGSGNQGLCVILPINVVAEDIKAPKERLQRAVFFGHA 304
Query: 303 TAIYIKQNLGALSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGA 361
++K+ G LSA+CGC I A G++ GI +L+GG I A+ NM+ANLTGMVCDGA
Sbjct: 305 VNNFVKKYTGKLSAICGCAIAAGIGATAGIAWLLGGKDKEIEGAILNMLANLTGMVCDGA 364
Query: 362 KPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIM 421
K SCA+K+S+ S A++SA L++ + V GII ++ +I NL L KD + D +
Sbjct: 365 KGSCAIKLSTSASEAVISAYLALNDIIVPNNTGIIGNTVEDTINNLGMLCKDGFYKADDV 424
Query: 422 VLDI 425
+L I
Sbjct: 425 MLSI 428
>gi|77975571|ref|ZP_00831106.1| COG3681: Uncharacterized conserved protein [Yersinia frederiksenii
ATCC 33641]
Length = 426
Score = 217 bits (552), Expect = 2e-54, Method: Composition-based stats.
Identities = 154/431 (35%), Positives = 242/431 (56%), Gaps = 8/431 (1%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAM 61
+ K +Q++D + +V ++GCTEP+A+A A A + L KI+ +S N+ KNAM
Sbjct: 1 MPKITHQQLLDWLKTEVKVSLGCTEPIAIAYAAAVAAKYLNGPTLKITGNISENLYKNAM 60
Query: 62 GVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGI 121
GV IPGT G+ +A ++GA+ G ++ +LEV+K++T E + + Q ++ ++ E I
Sbjct: 61 GVTIPGTSYSGVTLAAAIGAIGGNADAELEVLKNITVEQISQAYQLNESGQVCLEAVESI 120
Query: 122 TEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQL-- 179
+ +YI++T + + II HTN D + KQ E+ + K ++L
Sbjct: 121 -DFIYIDITLYSLCDQCRVIIQGGHTNI----TDVYINNIKQAITEDLMTGNEKGMKLPE 175
Query: 180 -NLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNS 238
+L ++F T +I F+L++ N ++E + NYG V + + G
Sbjct: 176 FSLNDAFEFITHVAAKDIIFMLKSAEINSALSDEGQRKNYGLNVNGALSQARKNGFISQD 235
Query: 239 IYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALT 298
+ S +L T +A DARMGGA +P MSN GSGNQGI T PVV AK E E L RAL
Sbjct: 236 LLSQMLINTTAASDARMGGAPLPAMSNYGSGNQGITVTMPVVTLAKHLEVNDETLARALA 295
Query: 299 LSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVC 358
L+HL AI I LSALC A G++ G+++L+ +Y I YA+ NMI++++G++C
Sbjct: 296 LAHLAAISIHMRYTRLSALCAASTAAMGAAAGMSWLLTKNYQTISYAISNMISDISGIIC 355
Query: 359 DGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEM 418
DGA SCA+KVS+ ++A S +++ +N E +GI+ D++ SI NL L M +
Sbjct: 356 DGASNSCAMKVSTATASAFKSVLMAQQNSGAGERDGIVSCDVEGSINNLCQLVLSPMRQT 415
Query: 419 DIMVLDIMTSK 429
D ++ IMT K
Sbjct: 416 DKEIISIMTRK 426
>gi|153853582|ref|ZP_01994962.1| hypothetical protein DORLON_00952 [Dorea longicatena DSM 13814]
gi|149753737|gb|EDM63668.1| hypothetical protein DORLON_00952 [Dorea longicatena DSM 13814]
Length = 425
Score = 215 bits (548), Expect = 4e-54, Method: Composition-based stats.
Identities = 151/421 (35%), Positives = 243/421 (57%), Gaps = 11/421 (2%)
Query: 11 IDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTGM 70
+ +++ +++PA+GCTEP+A+A A+A E+LG P+++ S +I+KN V +P T
Sbjct: 12 VQILNEELIPAMGCTEPIALAYAAAKAREVLGCLPDRVCIGASGSIIKNVKSVIVPNTNH 71
Query: 71 I-GLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIK-LKEGITEKLYIE 128
+ G+P A + G + G +E +LEVI ++T E +E +++ I ++ + G + +E
Sbjct: 72 LKGIPAAAAAGIVAGNAEKELEVISEVTEEETKEIAEFLEHTEITVEHINNGCVFDIIVE 131
Query: 129 MTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDFA 188
+ G KA I+ HTN V E +G++LL+ V E ++ + L++K +WDFA
Sbjct: 132 VF--HGEDKAKVRIANYHTNIVRIEKNGEILLNVPVAGESEEGLTDRSL-LDMKSIWDFA 188
Query: 189 TTTPINEI-EFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKT 247
T I +I E I YN A+E LKGNYG +G + +GN + + ++
Sbjct: 189 NTVDIEDIREVIGRQIEYNSAIADEGLKGNYGANIGSVL-----LDTYGNDVRTRAKARA 243
Query: 248 ASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYI 307
A+ DARM G +PV+ N+GSGNQG+ + PV+ +A E ++ E+ RAL LS+L AI+
Sbjct: 244 AAGSDARMNGCELPVVINAGSGNQGMTCSLPVLEYADELQSGEEKTYRALVLSNLVAIHQ 303
Query: 308 KQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCAL 367
K +G LSA CG + A + GI YL GG Y + + V N +A ++GMVCDGAK SCA
Sbjct: 304 KTGIGRLSAYCGAVSAGAAAGAGIAYLCGGGYEEVIHTVVNALAIVSGMVCDGAKASCAA 363
Query: 368 KVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
K++S V IL + + +GI+ K ++ +I+N+ LGKD M E + ++ +M
Sbjct: 364 KIASSVDAGILGYNMYLRGQQFYAGDGIVTKGVEATIQNIGRLGKDGMKETNEEIIKMMI 423
Query: 428 S 428
S
Sbjct: 424 S 424
>gi|153940079|ref|YP_001391070.1| hypothetical protein CLI_1810 [Clostridium botulinum F str.
Langeland]
gi|152935975|gb|ABS41473.1| conserved hypothetical protein [Clostridium botulinum F str.
Langeland]
Length = 426
Score = 209 bits (533), Expect = 2e-52, Method: Composition-based stats.
Identities = 143/432 (33%), Positives = 232/432 (53%), Gaps = 18/432 (4%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAM 61
+++NI II+++ ++++P +G TEP ++AL +A+A E++G + + I + KNA
Sbjct: 1 MDQNI---IINILRKEMMPGLGVTEPASIALSSAKAYEVIGGEIKNIKIIADPGLFKNAF 57
Query: 62 GVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGI 121
IPGT +G +A LGA+ G + LE ++ + E + + K + + I+IK +
Sbjct: 58 SCAIPGTKEVGNEMAALLGAICGDASLGLECLRKIKKEDVSKAKTMLDKIHIEIKSQ--- 114
Query: 122 TEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVL------LDKQVPAEEGAPTDNK 175
TE LY+E II H N V E + K+L L+K + A K
Sbjct: 115 TEGLYVESIVTTNNGIGRTIIRYKHDNIVLVEKNNKILYQKENTLNKSNNFSQEAIDSKK 174
Query: 176 DIQLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLF 235
++ L + +F +IEF+LE+ + N +E+ L+G ++ S
Sbjct: 175 ITEMKLDEIVEFVNNVNYEKIEFLLESIKMNKKLSEKGLEGLGIGLGKLILE---SCNEN 231
Query: 236 GNSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVR 295
+Y+ L T SA DAR+ GA VP M+ +GSGN GI AT P++ ++ E L R
Sbjct: 232 NYELYAEAL--TCSAIDARVSGAAVPAMTVTGSGNHGIIATLPLLAIKEKKNLNNEMLAR 289
Query: 296 ALTLSHLTAIYIKQNLGALSALCGCIVAC-TGSSCGITYLMGGDYNNICYAVKNMIANLT 354
++ LS++ IYIK+ G LSA CGC VA TG S GI YL+GG I +KNM +N+T
Sbjct: 290 SIALSYIINIYIKEFSGKLSAFCGCAVAAGTGVSAGICYLLGGRLKEIENTIKNMASNIT 349
Query: 355 GMVCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDA 414
GM+C G +C+LK ++GV A LSA ++++N + GI+ I+ +++N+ +
Sbjct: 350 GMICTGGNLACSLKANTGVKAAFLSAKMALKNIVIPNKCGIVSNSIEDTMKNIGRIAYPG 409
Query: 415 MNEMDIMVLDIM 426
M + D +L+IM
Sbjct: 410 MMQTDKEILNIM 421
>gi|148379774|ref|YP_001254315.1| hypothetical protein CBO1815 [Clostridium botulinum A str. ATCC
3502]
gi|153933435|ref|YP_001384072.1| hypothetical protein CLB_1750 [Clostridium botulinum A str. ATCC
19397]
gi|153934676|ref|YP_001387612.1| hypothetical protein CLC_1757 [Clostridium botulinum A str. Hall]
gi|148289258|emb|CAL83354.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
3502]
gi|152929479|gb|ABS34979.1| conserved hypothetical protein [Clostridium botulinum A str. ATCC
19397]
gi|152930590|gb|ABS36089.1| conserved hypothetical protein [Clostridium botulinum A str. Hall]
Length = 426
Score = 206 bits (525), Expect = 2e-51, Method: Composition-based stats.
Identities = 141/424 (33%), Positives = 226/424 (53%), Gaps = 15/424 (3%)
Query: 10 IIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTG 69
II+++ ++++P +G TEP ++AL +A+A E++G + + I + KNA IPGT
Sbjct: 6 IINILRKEMMPGLGVTEPASIALSSAKAYEVIGGEIKNIKIIADPGLFKNAFSCAIPGTK 65
Query: 70 MIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEM 129
+G +A LG + G + LE ++ + E + + K + ++IDI++K TE LY+E
Sbjct: 66 EVGNEMAALLGTICGDASLGLECLRKIKKEDVSKAKTML--DKIDIEIKSQ-TEGLYVES 122
Query: 130 TCEAGGKKATAIISKTHTNFVYEEADGKVL------LDKQVPAEEGAPTDNKDIQLNLKM 183
II H N V E + K+L L+K + A K ++ L
Sbjct: 123 IVTTNNGIGRTIIRYKHDNIVLVEKNNKILYQKENNLNKSNNFSQEAIDSKKITEMKLDE 182
Query: 184 VWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHI 243
+ +F +IEF+LE+ + N +E+ L+G ++ S +Y+
Sbjct: 183 IVEFVNNVNYEKIEFLLESIKMNKKLSEKGLEGLGIGLGKLILE---SCNENNYELYAEA 239
Query: 244 LSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLT 303
L T SA DAR+ GA VP M+ +GSGN GI T P++ ++ E L R++ LS++
Sbjct: 240 L--TCSAIDARVSGAPVPAMTVTGSGNHGIITTLPLLAIKEKKNLNNEVLARSIALSYII 297
Query: 304 AIYIKQNLGALSALCGCIVAC-TGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAK 362
IYIK+ G LSA CGC VA TG S GI YL+GG I +KNM +N+TGM+C G
Sbjct: 298 NIYIKEFSGKLSAFCGCAVAAGTGVSAGICYLLGGSLKEIENTIKNMASNITGMICTGGN 357
Query: 363 PSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMV 422
+C+LK ++GV A LSA +++ N + GI+ I+ +++N+ + M E D +
Sbjct: 358 LACSLKANTGVKAAFLSAKMALNNIVIPNKCGIVSNSIEDTMKNIGRIAYPGMMETDKEI 417
Query: 423 LDIM 426
L+IM
Sbjct: 418 LNIM 421
>gi|50841709|ref|YP_054936.1| conserved membrane associated protein [Propionibacterium acnes
KPA171202]
gi|50839311|gb|AAT81978.1| conserved membrane associated protein [Propionibacterium acnes
KPA171202]
Length = 316
Score = 192 bits (488), Expect = 4e-47, Method: Composition-based stats.
Identities = 122/310 (39%), Positives = 170/310 (54%), Gaps = 1/310 (0%)
Query: 121 ITEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQ-L 179
+ + LY E G A A I+ HTN D +VL K+ PA P +Q L
Sbjct: 4 VDDDLYAETVLHLDGHHARACIAGDHTNVFLVSRDDEVLESKERPAAGHVPPTAALLQGL 63
Query: 180 NLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSI 239
+L ++DFATT I IEFI EA+R N + G+YG G + + RG+ + +
Sbjct: 64 SLAKIYDFATTVDIERIEFITEAERLNSALVDAGRDGSYGIGEGAAILGSIDRGMASDDL 123
Query: 240 YSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTL 299
+ + + TA+A DARMGGA +P M+NSGSGNQGI AT PV V A E VRAL L
Sbjct: 124 CTRMTAYTAAASDARMGGAPLPAMTNSGSGNQGIVATVPVTVAADYAGVDHERRVRALAL 183
Query: 300 SHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCD 359
SH A+Y L LSA C A G++ GI ++ G Y+ + AV +M ++ GMVCD
Sbjct: 184 SHAVALYAHAGLPVLSAFCAATTAAMGAAAGICLVLDGSYSAVERAVASMTGDVVGMVCD 243
Query: 360 GAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMD 419
GA SCALKVS+ + A +A+LS+ V G++ D+D +IR + LG + M + D
Sbjct: 244 GAGCSCALKVSASANAAGRAALLSLAGRRVPGTNGLVHDDVDAAIRGIGRLGTEGMKQTD 303
Query: 420 IMVLDIMTSK 429
+L +M +K
Sbjct: 304 PEILSLMMAK 313
>gi|139437096|ref|ZP_01771256.1| Hypothetical protein COLAER_00234 [Collinsella aerofaciens ATCC
25986]
gi|133776743|gb|EBA40563.1| Hypothetical protein COLAER_00234 [Collinsella aerofaciens ATCC
25986]
Length = 439
Score = 187 bits (475), Expect = 1e-45, Method: Composition-based stats.
Identities = 151/432 (34%), Positives = 234/432 (54%), Gaps = 23/432 (5%)
Query: 12 DLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTG-M 70
D+++R++V A+GCTEP+AVA A A++ LG +P+ + S NI+KN V +P +G M
Sbjct: 13 DVLNRELVCALGCTEPIAVAYAAALASQTLGYEPDHMDVACSGNIIKNVKSVTVPNSGGM 72
Query: 71 IGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSE-NRIDIKLKEGITEKLYIEM 129
G+ A LGA+ G ++ LEV++ + E + +++ + D+ L EG+ LYI++
Sbjct: 73 HGIEAAAVLGAVGGDAKSALEVLESVNDEDRARVAELLADADYCDVSLVEGVP-NLYIKV 131
Query: 130 TCEAGGKKATAIISKTHTNFVYEEADGKVLLDK----------QVPAEEGAPTDNKDIQL 179
T AGG A I+ HTN DG + K +V AE A DN D +
Sbjct: 132 TATAGGHTAVVEITDHHTNVTCHTLDGIPVCGKSAGECAACAERVVAERAA--DNADTPM 189
Query: 180 NLKMVWDFATTTPINEIEFILEAKRYNMNAA--EEALKGNYGHCVGKTMDRPLSRGLFGN 237
+++ + DF I + +E ++ +N A E L +G VG+T+ G +
Sbjct: 190 SIETIIDFIEDGDIEDARAAVE-RQIELNGAISAEGLAHAWGAEVGRTL-----LGARAD 243
Query: 238 SIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRAL 297
+ ++ A+ DARM G +PV GSGNQGI PV+ +A+ E LVRA+
Sbjct: 244 DVACRARARAAAGSDARMNGCALPVAIVCGSGNQGITCALPVMEYAEYLRCDHERLVRAV 303
Query: 298 TLSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMV 357
LS L A++IK +GALSA CG I A G+ IT+L GG I V N + N+ G+V
Sbjct: 304 MLSDLIAVHIKSYIGALSAFCGAICAACGAGAAITWLCGGTREQIGATVSNTLGNVGGIV 363
Query: 358 CDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNE 417
CDGAK SCA K+S+ V AIL ++M+ EG+I ++++I ++ +G+ M +
Sbjct: 364 CDGAKASCAAKISAAVDAAILGHDMAMQGRGFRAGEGLIQDTVEQTIASMGYVGRVGMKD 423
Query: 418 MDIMVLDIMTSK 429
D+ +L+IM K
Sbjct: 424 TDVEILNIMIGK 435
>gi|154249957|ref|YP_001410782.1| protein of unknown function DUF1063 [Fervidobacterium nodosum
Rt17-B1]
gi|154153893|gb|ABS61125.1| protein of unknown function DUF1063 [Fervidobacterium nodosum
Rt17-B1]
Length = 411
Score = 185 bits (469), Expect = 6e-45, Method: Composition-based stats.
Identities = 132/427 (30%), Positives = 216/427 (50%), Gaps = 29/427 (6%)
Query: 3 EKNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMG 62
E + R I ++ V + GCTEP+AV L A L + I + N KN +
Sbjct: 8 ENSKRSIIREIFFDNVKLSYGCTEPVAVGLSVAVGKGYLRGVLKSIDVIMDRNTYKNGLE 67
Query: 63 VGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGIT 122
VGIPGT + G +AI+L L+GK EY L+V KD+ L K Y +++I + + +
Sbjct: 68 VGIPGTHLHGFDLAIALAYLVGKPEYGLQVFKDVNSHVLS--KAYELKDKIRVSYEN--S 123
Query: 123 EKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLN 180
L+I+ EA + I + +H N DG + + Q KD+ ++
Sbjct: 124 YNLHIKTKLEADNEVLIEI-TDSHDNISKIVVDGNEIRNTQTSVNF-----KKDLVKSIS 177
Query: 181 LKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIY 240
L ++++ ++ + + E +YN+NAA E +K G FG ++
Sbjct: 178 LNDIFEYIENPDLDVVNVVKEGIKYNVNAAREGIK---------------KEGNFGYALE 222
Query: 241 SHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLS 300
I + A+ D RM G ++P+M+ +GSGNQGI + P ++ +ENE E++ +A+ LS
Sbjct: 223 EGIPAYVAAGVDERMNGELIPIMTVAGSGNQGIASIVPPTLYGRENEMPEEKIEKAVLLS 282
Query: 301 HLTAIYIKQNLGALSALCGCIVACTGSSCGI-TYLMGGDYNNICYAVKNMIANLTGMVCD 359
L YIK G L+ +CG + S TYL GG+ I A+ N++A L GM CD
Sbjct: 283 ILVTTYIKAFTGVLTPVCGAGSIASAGSSAAITYLAGGNAEQIKNAINNVLATLFGMTCD 342
Query: 360 GAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMD 419
GAK CALK S G A+ ++ L++++ ++ G KD++++IR + L K ++ + D
Sbjct: 343 GAKRGCALKASIGTQMALNASKLALKDTNIPCGNGFAAKDVEETIRRIELLTK-SLRQFD 401
Query: 420 IMVLDIM 426
V+D +
Sbjct: 402 QDVIDFI 408
>gi|39996627|ref|NP_952578.1| hypothetical protein GSU1527 [Geobacter sulfurreducens PCA]
gi|39983508|gb|AAR34901.1| conserved hypothetical protein [Geobacter sulfurreducens PCA]
Length = 429
Score = 181 bits (460), Expect = 7e-44, Method: Composition-based stats.
Identities = 143/428 (33%), Positives = 221/428 (51%), Gaps = 19/428 (4%)
Query: 12 DLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTG-M 70
D++ +V PA+GCTEP+AVA + A E LG + E ++A + + KN V +P TG +
Sbjct: 6 DVIKSEVFPALGCTEPIAVAYAASLAAERLGAEVETVTASVDPGVFKNGFAVTVPKTGGL 65
Query: 71 IGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEMT 130
G IA +LGALI + E ++E++ L + + VS R + L + T+ LYI++
Sbjct: 66 KGNVIAAALGALIARPELKMEILSGADERLLAQAELLVSSGRATVALVKERTD-LYIDVV 124
Query: 131 CEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVWDFA 188
GG+ A A++ HTN V E DG++LL+ P + + Q+ +
Sbjct: 125 VTGGGRTARAVLEGGHTNIVRLECDGRILLNADEPVSAVDSHAYRAVLRQMTFSEMIGLL 184
Query: 189 TTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKTA 248
++ ++ N+ AEE G VG ++ + +G + S TA
Sbjct: 185 DDLDQGDLVYLKRGVEMNLRIAEE---GKQLTKVGHYVEELVRKGFLLADVVSSSKILTA 241
Query: 249 SACDARMGGAMVPVMSNSGSGNQGICAT----NPVVVFAKENENTPEE-LVRALTLSHLT 303
SA DARM G PVMS+ GSGNQGI A N + F + PEE ++R++ LSHL
Sbjct: 242 SASDARMAGLPYPVMSSGGSGNQGIVAILVPYNVGMFF-----HVPEETILRSIALSHLV 296
Query: 304 AIYIKQNLGALSALCGCIVACTGSSCGITYLM--GGDYNNICYAVKNMIANLTGMVCDGA 361
YIK + G L+ +CGC +A + G D + I AV +I+++ GM+CDGA
Sbjct: 297 NAYIKCHTGDLAPICGCAIAAGVGAAVAIVYQQAGPDMHKIDLAVNTIISDIGGMLCDGA 356
Query: 362 KPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIM 421
K CALKV S AI +A +++ H ++E EG + K +++I NL+ + M +D
Sbjct: 357 KGGCALKVVSSTDAAIRAAYMALNGHGISEEEGFVGKSAEETIHNLSRIADKGMALVDDT 416
Query: 422 VLDIMTSK 429
+L IM K
Sbjct: 417 MLCIMLQK 424
>gi|77979116|ref|ZP_00834537.1| COG3681: Uncharacterized conserved protein [Yersinia intermedia
ATCC 29909]
Length = 366
Score = 181 bits (460), Expect = 7e-44, Method: Composition-based stats.
Identities = 127/371 (34%), Positives = 201/371 (54%), Gaps = 7/371 (1%)
Query: 61 MGVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEG 120
MGV IPGT G+ +A ++GA+ G ++ LEV+K +T + E + + + +
Sbjct: 1 MGVTIPGTSYCGVTLAAAIGAIGGNADADLEVLKGITATQITEAYIFNQSGNVCLNAVDA 60
Query: 121 ITEKLYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--Q 178
T+ ++I++T + + +I +HTN V E + KQ+ + T++ + +
Sbjct: 61 -TDFIFIDITLYSLSDQCRVVIQGSHTN-VTEVYINNI---KQILVRDLITTNDSGVLPE 115
Query: 179 LNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNS 238
+L + F T +I F+L++ N ++E + NYG V + + G
Sbjct: 116 FSLNDAFKFVTDVAAKDIMFMLKSAEINTALSDEGQRKNYGLNVSGALSQARKNGFISTD 175
Query: 239 IYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALT 298
+ S +L T +A DARMGGA +P MSN GSGNQGI T PVV A+ E L RAL
Sbjct: 176 LLSQMLINTTAASDARMGGAPLPAMSNYGSGNQGITVTMPVVTLARHLNTNDETLARALA 235
Query: 299 LSHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVC 358
L+HL AI I LSALC A G++ G+++L+ D+ I YAV NMI++++G++C
Sbjct: 236 LAHLAAISIHMRYTRLSALCAASTAAMGAAAGMSWLLTQDFQTISYAVSNMISDISGIIC 295
Query: 359 DGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEM 418
DGA SCA+KVS+ + A S +++ +N E +GI+ D++ SI NL L M +
Sbjct: 296 DGASNSCAMKVSTATACAFKSVLMAQQNSVAGERDGIVSCDVEGSINNLCKLVLSPMRQT 355
Query: 419 DIMVLDIMTSK 429
D ++ IMT K
Sbjct: 356 DKEIISIMTRK 366
>gi|150020899|ref|YP_001306253.1| protein of unknown function DUF1063 [Thermosipho melanesiensis
BI429]
gi|149793420|gb|ABR30868.1| protein of unknown function DUF1063 [Thermosipho melanesiensis
BI429]
Length = 397
Score = 180 bits (457), Expect = 1e-43, Method: Composition-based stats.
Identities = 129/414 (31%), Positives = 209/414 (50%), Gaps = 26/414 (6%)
Query: 12 DLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGTGMI 71
D+ QV PA GCTEP+A+AL TA + + I+ L N KN + V IPGT
Sbjct: 4 DIFFEQVKPAYGCTEPIAIALSTAVGKKYSKGNVKSINITLDKNTYKNGLVVNIPGTNTF 63
Query: 72 GLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEMTC 131
GL +A +LG L G LEV+KD+ + L++ + +N ++I L E + L++ T
Sbjct: 64 GLELAAALGYLCGDGNKGLEVLKDINEDCLKKAMKM--KNMVNISLNE--EQHLFVNTTI 119
Query: 132 EAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWDFATTT 191
A +I H N + KV+ +++ G + K +L + +
Sbjct: 120 IAE-NTVQIVIEGKHDNIAKIVVNDKVIKNEEF--HPGMTSIEKIKNYSLDKIIKYVEHP 176
Query: 192 PINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKTASAC 251
+ ++++ +A N+ AE + ++G F N+ + + ++
Sbjct: 177 DEDVLKYVEKAIDMNLKIAEYGIN---------------TKGNFSNAAINEYVKYVSAGV 221
Query: 252 DARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYIKQNL 311
DARM G ++PVM+ +GSGNQG+ P+ +F + E E+L++A LS L IYIK
Sbjct: 222 DARMSGVLMPVMTVAGSGNQGLACILPIAIFKGKVER--EKLLKATLLSILVTIYIKAYT 279
Query: 312 GALSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCALKVS 370
G L+ +CG ++ G++ G+TYL GG+ I A+ + + L G+ CDGAK CALK
Sbjct: 280 GLLTPICGAGSISSAGAAAGLTYLKGGNNTQIKNAINDTLGTLFGLTCDGAKRGCALKAI 339
Query: 371 SGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLD 424
+G TA+ + L++ N V GI+ KD++++IR + L N D VLD
Sbjct: 340 TGTFTALQVSSLALNNIDVPCGNGIVAKDVEETIRRVEKLTNSVKN-FDKDVLD 392
>gi|145953743|ref|ZP_01802751.1| hypothetical protein CdifQ_04003752 [Clostridium difficile
QCD-32g58]
Length = 299
Score = 177 bits (448), Expect = 2e-42, Method: Composition-based stats.
Identities = 117/294 (39%), Positives = 172/294 (58%), Gaps = 13/294 (4%)
Query: 142 ISKTHTNFVYEEADGKVLLDKQ--VPAEEGAP------TDNKDIQLNLKMVWDFATTTPI 193
I H NF + E DG+VLLD + V A A D IQ +K V +
Sbjct: 3 IRTKHDNFTFLEKDGEVLLDNEPKVSASNDAAEKAESLMDTVTIQELIKNVEEI----DF 58
Query: 194 NEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKTASACDA 253
+IEF+L+ + N AE LK G VG + + + GL GN + ++ + TA A DA
Sbjct: 59 KDIEFLLDGVKMNEEMAEYGLKQKTGIGVGYGIKKSIEEGLLGNDVINYAMMLTAGASDA 118
Query: 254 RMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAIYIKQNLGA 313
RM G +PVMS++GSGN G+ A P+V + K+ + E+L +AL +SHL YIK G
Sbjct: 119 RMAGVKMPVMSSNGSGNHGLTAILPIVAYNKKFPQSDEKLAKALAISHLVTGYIKNYTGR 178
Query: 314 LSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCALKVSSG 372
LSA+CGC + A TG++ GI++LM G I A++NMIA+L+GM+CDGAK CALK+SS
Sbjct: 179 LSAVCGCGVAASTGATAGISWLMNGTEKQIEGAIENMIADLSGMICDGAKAGCALKLSSA 238
Query: 373 VSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIM 426
S AI SA+++ ++ V GI+ +++SI+NL + M+ D ++L++M
Sbjct: 239 ASAAIQSAIIAKQDCFVPPLNGIVGSSVEQSIQNLGRVSDKGMSITDEIILNVM 292
>gi|153855512|ref|ZP_01996631.1| hypothetical protein DORLON_02645 [Dorea longicatena DSM 13814]
gi|149752034|gb|EDM61965.1| hypothetical protein DORLON_02645 [Dorea longicatena DSM 13814]
Length = 429
Score = 171 bits (434), Expect = 7e-41, Method: Composition-based stats.
Identities = 132/434 (30%), Positives = 213/434 (49%), Gaps = 19/434 (4%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
+++ +L+ + PA+G TEP A+A A A + K + L++ + KNA GIP
Sbjct: 2 QKLTELIREDMKPALGVTEPGAIAFAVASAKKYTEGKIVNVHVALNSGMYKNAFTCGIPN 61
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
+ G A +LGA+ +E LE + D+TPE E+ Q VS+ +++ + + + ++ I
Sbjct: 62 SERFGNYYAAALGAVAADAEKGLESLADITPEDDEKAAQMVSDGLVEVVM-DHVGSEITI 120
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQ--VPAEEGAPTDNKDI--QLNLKM 183
+ + +I HTN V E DG V+ + + + EE + + +L
Sbjct: 121 DAVVTTENNRCKVMIRDAHTNIVKIEKDGIVIFETEDKMYREESRKKQARPVIHGYSLAE 180
Query: 184 VWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHI 243
+ + T P+ EI FI +A N+ +E L + K + R GN I S
Sbjct: 181 ILRYVNTVPLEEIAFIEKAYTMNLELLKEGLASDRA-VFAKKLYRE-----NGNRIISRD 234
Query: 244 LSKTAS-----ACDARMGGAMVPVMSNSGSGNQGICATNPVVVF--AKENENTPEELVRA 296
KTA A +AR+ G P MS +GSG GI AT P+ + A E++ E L RA
Sbjct: 235 ALKTAQLLCNGAIEARVLGLSRPAMSITGSGAHGIIATMPLYAYRHANEDKTDDETLWRA 294
Query: 297 LTLSHLTAIYIKQNLGALSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIANLTG 355
LS+L +YIK+ G LSA CGC I A TG +CG+ YL G D ++NM + LTG
Sbjct: 295 TALSYLITMYIKEYSGRLSAFCGCGIAAGTGMACGLAYLQGADGEKHTKVIQNMASGLTG 354
Query: 356 MVCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAM 415
M+CDG C +K V A + ++M++ + GI + + +++ + + M
Sbjct: 355 MICDGGNHGCTMKGIVAVDAAFRATDMAMDDICIENIHGINGRTPEATMKYMGMIASPGM 414
Query: 416 NEMDIMVLDIMTSK 429
+ +++IM SK
Sbjct: 415 TGTEKTIVEIMESK 428
>gi|7465931|pir||A65100 hypothetical 19.4 kD protein in exuR-tdcC intergenic region -
Escherichia coli (strain K-12)
gi|606049|gb|AAA57912.1| ORF_f188 [Escherichia coli]
Length = 188
Score = 159 bits (401), Expect = 4e-37, Method: Composition-based stats.
Identities = 88/185 (47%), Positives = 124/185 (67%)
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI AT PVVV A+ E L RAL LSHL+AI
Sbjct: 3 RTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAI 62
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 63 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 122
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 123 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 182
Query: 426 MTSKG 430
M SK
Sbjct: 183 MASKA 187
>gi|83590261|ref|YP_430270.1| Protein of unknown function DUF1063 [Moorella thermoacetica ATCC
39073]
gi|83573175|gb|ABC19727.1| Protein of unknown function DUF1063 [Moorella thermoacetica ATCC
39073]
Length = 425
Score = 159 bits (401), Expect = 5e-37, Method: Composition-based stats.
Identities = 115/420 (27%), Positives = 203/420 (48%), Gaps = 20/420 (4%)
Query: 7 REQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIP 66
++ +I+L+H++ A+GCTEP+ VAL A+ ++LG P + +S+ + KNA VG+P
Sbjct: 6 QQTLINLLHQEADVAIGCTEPVMVALAAAKTRDMLGTLPRLVDISVSSAVWKNARRVGLP 65
Query: 67 GTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLY 126
GTG + L+ E ++ LTP +E+ K V E + E LY
Sbjct: 66 GTGE-KGLAMAAAMGLLAPVEAGQRLLAALTPVQVEQAKILVREGVV-KVGVVAAKEGLY 123
Query: 127 IEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMVWD 186
+ +A ++ +H NF DG++ G +N +++L + D
Sbjct: 124 ARAVARSNQHEAIVELNGSHKNFSALWLDGRM---------AGGAGENLNLKLEALLAQD 174
Query: 187 FAT------TTPINEIEFILEAKRYNMNAAEEALKGNYGHCVG-KTMDRPLSRGLFGNSI 239
+ + + E+ F+ + + A E +G + R G G S+
Sbjct: 175 YQSLLKQVLSLSPEELYFLYQGAEDILTFAREIHQGGRNPLSAMASFFRRTESG--GESL 232
Query: 240 YSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTL 299
I + T A RM GA PV++ +GSGNQGI A +++ +E PE + RAL +
Sbjct: 233 EVLIRNLTGIAVAERMAGATYPVLTCAGSGNQGILAAVSLLLAGQELRAGPESVTRALAI 292
Query: 300 SHLTAIYIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCD 359
+H T +Y+K G LS LCG + G + I +L+ G I A++ ++ NL ++CD
Sbjct: 293 AHFTNMYLKAYTGKLSPLCGAVTGGAGVAAAICWLLEGSCQQIINAMQIVLGNLCCVICD 352
Query: 360 GAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMD 419
GAK SCALK+S+ A+ + ++ + ++ GI+ K ++ ++ + + + + E+D
Sbjct: 353 GAKESCALKISTAAVEAVRAGYMACQGINLEAGTGIVGKKLEDTMELVRKVYQGGLGEID 412
>gi|30064452|ref|NP_838623.1| hypothetical protein S3359 [Shigella flexneri 2a str. 2457T]
gi|56480263|ref|NP_708914.2| hypothetical protein SF3150 [Shigella flexneri 2a str. 301]
gi|30042711|gb|AAP18434.1| hypothetical protein S3359 [Shigella flexneri 2a str. 2457T]
gi|56383814|gb|AAN44621.2| orf, conserved hypothetical protein [Shigella flexneri 2a str. 301]
Length = 188
Score = 157 bits (397), Expect = 1e-36, Method: Composition-based stats.
Identities = 87/185 (47%), Positives = 123/185 (66%)
Query: 246 KTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTAI 305
+T++A DARMGGA +P MSNSGSGNQGI A PVVV A+ E L RAL LSHL+AI
Sbjct: 3 RTSAASDARMGGATLPAMSNSGSGNQGITAIMPVVVVAEHFGADDERLARALMLSHLSAI 62
Query: 306 YIKQNLGALSALCGCIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSC 365
YI L LSALC A G++ G+ +L+ G Y I A+ +MI +++GM+CDGA SC
Sbjct: 63 YIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSC 122
Query: 366 ALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDI 425
A+KVS+ S A + ++++++ VT EGI+ D+++SI NL +L +M + D +++I
Sbjct: 123 AMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEI 182
Query: 426 MTSKG 430
M SK
Sbjct: 183 MASKA 187
>gi|15669214|ref|NP_248019.1| hypothetical protein MJ1025 [Methanocaldococcus jannaschii DSM
2661]
gi|3024973|sp|Q58431|Y1025_METJA Uncharacterized protein MJ1025
gi|1591681|gb|AAB99029.1| conserved hypothetical protein [Methanocaldococcus jannaschii DSM
2661]
Length = 388
Score = 156 bits (395), Expect = 2e-36, Method: Composition-based stats.
Identities = 131/426 (30%), Positives = 202/426 (47%), Gaps = 45/426 (10%)
Query: 5 NIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVG 64
N E I +++ +VV A+GCTE + A+A ++I L KNA VG
Sbjct: 6 NKNELITEILKNEVVKALGCTEVGLIGYTVAKAKPEDLYSIKEIKLILDKGTFKNAFSVG 65
Query: 65 IPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEK 124
+P T G+ A+ +G L+G+ E +LEV KD+ + D KL+E I K
Sbjct: 66 VPNTNKFGILPAV-VGGLLGREENKLEVFKDI---------------KYDEKLEEFIENK 109
Query: 125 LYIEMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLNLKMV 184
L IE+ K +K + GK L D N L LK
Sbjct: 110 LKIEVIDSDVYCKVIIKANKVYEAETKGSHSGKSLSDD---------LKNAYKSLTLKDF 160
Query: 185 WDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIYSHIL 244
D+ P I+ I E N N + + ++ ++D + I +H+L
Sbjct: 161 IDYIEDIPEEVIKIIKETIETNKNLSTPEVPEDF-----ISLD-------LKDEILNHML 208
Query: 245 SKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLSHLTA 304
KT SA RM G P M+ +GSGN G+ AT P++ + + + E+L +++TLS LT
Sbjct: 209 KKTVSAVYNRMIGINKPAMAIAGSGNMGLTATLPIIAYDEIKGHDEEKLTKSITLSALTT 268
Query: 305 IYIKQNLGALSALCGCI-VACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKP 363
IY + +SA+CGC+ G+ G++Y + G ++ I ++K+ ANL G+VCDG K
Sbjct: 269 IYSAYHSSYISAMCGCVNRGGIGAVSGLSYYIFG-FDRIEESIKSFTANLPGIVCDGGKI 327
Query: 364 SCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVL 423
CALK++SGV LS V GI+ KD + I N+ +GK AM +D ++
Sbjct: 328 GCALKIASGVFAIYLSLF-----SKVPYTNGIVGKDFKECIENIGKIGK-AMKPVDDEII 381
Query: 424 DIMTSK 429
+I+ +K
Sbjct: 382 EILKNK 387
>gi|120603539|ref|YP_967939.1| protein of unknown function DUF1063 [Desulfovibrio vulgaris subsp.
vulgaris DP4]
gi|120563768|gb|ABM29512.1| protein of unknown function DUF1063 [Desulfovibrio vulgaris subsp.
vulgaris DP4]
Length = 443
Score = 156 bits (394), Expect = 3e-36, Method: Composition-based stats.
Identities = 129/434 (29%), Positives = 214/434 (49%), Gaps = 29/434 (6%)
Query: 17 QVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGT-GMIGLPI 75
+V PA+GCTEP AVA + A +P ++ LS ++ KN VGIPGT G+ G +
Sbjct: 10 EVKPALGCTEPGAVAYAASIAARHCPGEPLSVALSLSLSMFKNGRDVGIPGTGGLRGNRL 69
Query: 76 AISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEMTCEAGG 135
A LG L G ++ L ++ + +E + + + ++ +G+ +Y +T G
Sbjct: 70 AAVLGVLAGDADKGLMALEHIDMAVVERAQTLLDAGMVTEEVVDGVP-GVYAAVTLRCAG 128
Query: 136 KKATAIISKTHTNFVYEEADGKVLLD---KQVPAEEGA------------------PTDN 174
+ T ++ H DG+V+ ++ P +G P
Sbjct: 129 HEVTVTVAGRHDRVASIVVDGEVVGGEGMERAPEADGTLHGGASCEPSASFTEPPLPAYL 188
Query: 175 KDI-QLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRG 233
+++ + + +WD A + +L NM A L+ +G VG T L+
Sbjct: 189 EELRECDFAQLWDMAAGIDATLEQELLRGAAMNMAVARMGLESGWGLGVGHT----LAAH 244
Query: 234 LFGNSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEEL 293
+++ I +A D RM GA PVMS++GSGN GI AT PV V A+ +P
Sbjct: 245 AEAADLHARIRFMAGAAADVRMAGAPQPVMSSAGSGNHGITATVPVAVAAEGLGVSPRVQ 304
Query: 294 VRALTLSHLTAIYIKQNLGALSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIAN 352
AL LSHL Y+K + G L+ +CGC + A G++ GI ++GG+ AV +++A+
Sbjct: 305 AEALALSHLVTGYLKAHTGRLTPICGCSVAAGAGAAAGIVKVLGGNAVQAERAVASLMAS 364
Query: 353 LTGMVCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGK 412
L GM+CDGAK SC LKV++ A +A+L M++ V EG+++ DI + R L L +
Sbjct: 365 LMGMLCDGAKGSCGLKVATAAGEAYAAALLGMDDRGVQRPEGVVNPDIATTARALARLSR 424
Query: 413 DAMNEMDIMVLDIM 426
+ D ++++++
Sbjct: 425 EGFAAADAVMVELL 438
>gi|46578856|ref|YP_009664.1| hypothetical protein DVU0440 [Desulfovibrio vulgaris subsp.
vulgaris str. Hildenborough]
gi|46448268|gb|AAS94923.1| conserved hypothetical protein [Desulfovibrio vulgaris subsp.
vulgaris str. Hildenborough]
Length = 443
Score = 156 bits (394), Expect = 3e-36, Method: Composition-based stats.
Identities = 129/434 (29%), Positives = 214/434 (49%), Gaps = 29/434 (6%)
Query: 17 QVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPGT-GMIGLPI 75
+V PA+GCTEP AVA + A +P ++ LS ++ KN VGIPGT G+ G +
Sbjct: 10 EVKPALGCTEPGAVAYAASIAARHCPGEPLSVALSLSLSMFKNGRDVGIPGTGGLRGNRL 69
Query: 76 AISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYIEMTCEAGG 135
A LG L G ++ L ++ + +E + + + ++ +G+ +Y +T G
Sbjct: 70 AAVLGVLAGDADKGLMALEHIDMAVVERAQTLLDAGMVTEEVVDGVP-GVYAAVTLRCAG 128
Query: 136 KKATAIISKTHTNFVYEEADGKVLLD---KQVPAEEGA------------------PTDN 174
+ T ++ H DG+V+ ++ P +G P
Sbjct: 129 HEVTVTVAGRHDRVASIVVDGEVVGGEGMERAPEADGTLHGGASCEPSASFTKPPLPAYL 188
Query: 175 KDI-QLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRG 233
+++ + + +WD A + +L NM A L+ +G VG T L+
Sbjct: 189 EELRECDFAQLWDMAAGIDATLEQELLRGAAMNMAVARMGLESGWGLGVGHT----LAAH 244
Query: 234 LFGNSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEEL 293
+++ I +A D RM GA PVMS++GSGN GI AT PV V A+ +P
Sbjct: 245 AEAADLHARIRFMAGAAADVRMAGAPQPVMSSAGSGNHGITATVPVAVAAEGLGVSPRVQ 304
Query: 294 VRALTLSHLTAIYIKQNLGALSALCGC-IVACTGSSCGITYLMGGDYNNICYAVKNMIAN 352
AL LSHL Y+K + G L+ +CGC + A G++ GI ++GG+ AV +++A+
Sbjct: 305 AEALALSHLVTGYLKAHTGRLTPICGCSVAAGAGAAAGIVKVLGGNAVQAERAVASLMAS 364
Query: 353 LTGMVCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGK 412
L GM+CDGAK SC LKV++ A +A+L M++ V EG+++ DI + R L L +
Sbjct: 365 LMGMLCDGAKGSCGLKVATAAGEAYAAALLGMDDRGVQRPEGVVNPDIATTARALARLSR 424
Query: 413 DAMNEMDIMVLDIM 426
+ D ++++++
Sbjct: 425 EGFAAADAVMVELL 438
>gi|46369625|gb|AAS89662.1| putative inner membrane protein [Yersinia ruckeri]
Length = 175
Score = 153 bits (387), Expect = 2e-35, Method: Composition-based stats.
Identities = 84/174 (48%), Positives = 118/174 (67%)
Query: 212 EALKGNYGHCVGKTMDRPLSRGLFGNSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQ 271
E LKG YG +G T++R RGL + S I+ +TASA DARMGGA +P MSNSGSGNQ
Sbjct: 2 EGLKGQYGLHIGATLERQRLRGLLARDLLSDIMIRTASASDARMGGATLPAMSNSGSGNQ 61
Query: 272 GICATNPVVVFAKENENTPEELVRALTLSHLTAIYIKQNLGALSALCGCIVACTGSSCGI 331
GI AT PVVV A+ ++ E+L RAL LSHL AIYI L ALSALC A G++ G+
Sbjct: 62 GIAATMPVVVVAEYLGSSEEQLARALMLSHLMAIYIHSQLPALSALCAATTASMGAAAGM 121
Query: 332 TYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCALKVSSGVSTAILSAMLSME 385
+L+ Y+ + A+ +MI +++G++CDGA SCA+KVS+ + A + +++++
Sbjct: 122 AWLIEPGYDTVAMAISSMIGDISGIICDGAANSCAMKVSTSATAAYKAVLMALD 175
>gi|134045156|ref|YP_001096642.1| protein of unknown function DUF1063 [Methanococcus maripaludis C5]
gi|132662781|gb|ABO34427.1| protein of unknown function DUF1063 [Methanococcus maripaludis C5]
Length = 397
Score = 141 bits (356), Expect = 8e-32, Method: Composition-based stats.
Identities = 123/436 (28%), Positives = 212/436 (48%), Gaps = 49/436 (11%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTE----PMAVALCTARATELLGQKPEKISAFLSANIL 57
++ + R I ++ +V A+GCTE AV+LC + EKI L+
Sbjct: 1 MDDSKRILITKILKNEVTEALGCTEVGLIGYAVSLCNISDPFSI----EKIELTLNNGSF 56
Query: 58 KNAMGVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLT-PETLEEGKQYVSENRIDIK 116
KN VG+P TG GL A+ +G +G S+ +L + D+T + LE+ ++ +++IK
Sbjct: 57 KNVYAVGVPNTGKYGLLPAV-VGGFLGNSKNKLLIFNDITYSQELED----FTKEKLEIK 111
Query: 117 LKEGITEKLYIEMTC-EAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNK 175
+ G LY + + GK ++I H N V E + +++ E + K
Sbjct: 112 VING---PLYCSVKIKDNSGKIHESLIKDNHLNVVIPE-----IKKEKINMEINSSEKEK 163
Query: 176 DIQLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLF 235
L L ++ P I+ + + N N +KG++ + +
Sbjct: 164 YKNLELLDFLNYLDEIPEEIIKLVEKTIYTNKNL----IKGDFLN--------------Y 205
Query: 236 GNSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVR 295
G I S++++KT SAC+ RM G + MS + SGN GI AT P++ + EN E+L++
Sbjct: 206 GTDILSNMVNKTTSACNIRMTGENMTAMSVAKSGNMGIMATLPIISYDFSTENNSEKLIK 265
Query: 296 ALTLSHLTAIYIKQNLGALSALCGCIV-ACTGSSCGIT-YLMGGDYNNICYAVKNMIANL 353
++ LS L IY N LS++CGC+ G+ G++ Y G + + + ANL
Sbjct: 266 SVLLSMLVTIYSTYNSSYLSSMCGCVSKGGMGAVIGLSHYKNGKNLKKFDSSARTFTANL 325
Query: 354 TGMVCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKD 413
G++CDG K CALK++SG A S + ++ GI+ K+ + + N++ + K
Sbjct: 326 PGIICDGGKVGCALKLASGCFAAYSSLFVD-----ISYENGIVGKNFKECVENISKISK- 379
Query: 414 AMNEMDIMVLDIMTSK 429
AM ++D +++IM+ K
Sbjct: 380 AMGDLDCDIVEIMSKK 395
>gi|150402637|ref|YP_001329931.1| protein of unknown function DUF1063 [Methanococcus maripaludis C7]
gi|150033667|gb|ABR65780.1| protein of unknown function DUF1063 [Methanococcus maripaludis C7]
Length = 397
Score = 140 bits (352), Expect = 2e-31, Method: Composition-based stats.
Identities = 121/436 (27%), Positives = 214/436 (49%), Gaps = 49/436 (11%)
Query: 2 LEKNIREQIIDLMHRQVVPAVGCTE----PMAVALCTARATELLGQKPEKISAFLSANIL 57
++ + R I ++ +V A+GCTE A++LC + EKI L+
Sbjct: 1 MDDSKRILITKVLKNEVTEALGCTEVGLIGYAISLCNISYPFSI----EKIEVTLNNGSF 56
Query: 58 KNAMGVGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLT-PETLEEGKQYVSENRIDIK 116
KNA VG+P T G+ A+ +G L+G S+ +L + D+ + LE+ + R+++K
Sbjct: 57 KNAYAVGVPNTKKYGILPAV-VGGLLGNSKNKLLIFNDIKYDQKLED----FIKKRLEVK 111
Query: 117 LKEGITEKLYIEMTC-EAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNK 175
+ +G LY + + GK ++I H N V + + + + + + + K
Sbjct: 112 VLDG---PLYCGVKIKDTSGKFFESLIKDNHLNVVIPKIEKEKI---SLEITDFEKEEYK 165
Query: 176 DIQLNLKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLF 235
++L DF + ++ E +N E+ + N G ++ +
Sbjct: 166 SLELT-----DF--------LNYLDEIPEEIINLVEKTIYTNKNLIKGDFLN-------Y 205
Query: 236 GNSIYSHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVR 295
GN I S++++KT SAC+ RM G +P MS + SGN G+ AT P++ + EN E+L +
Sbjct: 206 GNDILSNMVNKTTSACNTRMTGENMPAMSVAKSGNMGLMATLPIISYDNLTENNFEKLKK 265
Query: 296 ALTLSHLTAIYIKQNLGALSALCGCIV-ACTGSSCGITYLMGG-DYNNICYAVKNMIANL 353
+L L+ L IY N LS++CGC+ G+ G+ Y G + + A + ANL
Sbjct: 266 SLLLAMLVTIYSTYNSSYLSSMCGCVSKGGMGAVIGLCYYKNGKNLKKLNSAARAFTANL 325
Query: 354 TGMVCDGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKD 413
G++CDG K CALK++SG A S + ++ GI+ K+ + ++N++ + K
Sbjct: 326 PGIICDGGKVGCALKLASGCFAAYSSLYV-----EISHENGIVGKNFKECVQNISKISK- 379
Query: 414 AMNEMDIMVLDIMTSK 429
AM ++D ++ IM+ K
Sbjct: 380 AMGDLDCDIVKIMSKK 395
>gi|45359031|ref|NP_988588.1| hypothetical protein MMP1468 [Methanococcus maripaludis S2]
gi|45047906|emb|CAF31024.1| conserved hypothetical protein [Methanococcus maripaludis S2]
Length = 396
Score = 136 bits (342), Expect = 3e-30, Method: Composition-based stats.
Identities = 124/431 (28%), Positives = 209/431 (48%), Gaps = 50/431 (11%)
Query: 7 REQIIDLMHRQVVPAVGCTE----PMAVALCTARATELLGQKPEKISAFLSANILKNAMG 62
R I ++ +V A+GCTE AV+LC + EKI L+ KNA
Sbjct: 6 RILITKILKNEVTEALGCTEVGLIGYAVSLCNISDPFSI----EKIELTLNNGSFKNAYA 61
Query: 63 VGIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLT-PETLEEGKQYVSENRIDIKLKEGI 121
VG+P T G+ A+ +G L+G + +L V + + LE+ ++ E R+ I++ I
Sbjct: 62 VGVPNTKKYGILPAV-VGGLLGDHKNKLLVFNGIKYSQKLED---FIKE-RLKIRV---I 113
Query: 122 TEKLYIEMTC-EAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDIQLN 180
LY + + G ++I H N V + + K++ ++ E K+ N
Sbjct: 114 NSPLYCGVKIKDNSGNTFESLIKDNHLNVVIPKINNKLI--SEINGSE------KEEYKN 165
Query: 181 LKMVWDFATTTPINEIEFILEAKRYNMNAAEEALKGNYGHCVGKTMDRPLSRGLFGNSIY 240
L+++ DF +E+I E + E+ + N G ++ FGN
Sbjct: 166 LELL-DF--------LEYIDEIPEEIIQLVEKTIYTNNNLIKGDFLN-------FGNDCL 209
Query: 241 SHILSKTASACDARMGGAMVPVMSNSGSGNQGICATNPVVVFAKENENTPEELVRALTLS 300
S++++KT SAC+ RM G +P MS + SGN GI AT P++ + NE E+L++++ LS
Sbjct: 210 SNMVNKTTSACNTRMIGENMPAMSVAKSGNMGIMATLPIIAYDYSNEQNQEKLIKSILLS 269
Query: 301 HLTAIYIKQNLGALSALCGCIV-ACTGSSCGITYLMGG-DYNNICYAVKNMIANLTGMVC 358
L IY LS++CGC+ G+ G+ Y G + + A + ANL G++C
Sbjct: 270 VLVTIYATYKSSYLSSMCGCVSKGGMGAVIGLCYYKNGKNIKKLDSAARTFTANLPGIIC 329
Query: 359 DGAKPSCALKVSSGVSTAILSAMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEM 418
DG K CALK++SG A S + ++ GI+ K+ + + N++ + K M ++
Sbjct: 330 DGGKVGCALKLASGCFAAYSSLFVD-----ISYENGIVGKNFKECVENISEISK-IMGDL 383
Query: 419 DIMVLDIMTSK 429
D ++ IM+ K
Sbjct: 384 DSDIVKIMSKK 394
>gi|30064453|ref|NP_838624.1| hypothetical protein S3360 [Shigella flexneri 2a str. 2457T]
gi|30042712|gb|AAP18435.1| hypothetical protein S3360 [Shigella flexneri 2a str. 2457T]
Length = 269
Score = 122 bits (307), Expect = 4e-26, Method: Composition-based stats.
Identities = 73/214 (34%), Positives = 115/214 (53%), Gaps = 2/214 (0%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL A A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEGAPTDNKDI--QLNLKMVW 185
G K A I HTN V+ E V+ Q EG + + L +
Sbjct: 131 RAKVWNGEKWACVTIVGGHTNIVHIETHNGVVFTHQACVAEGEQESPLTVLSRTTLAEIL 190
Query: 186 DFATTTPINEIEFILEAKRYNMNAAEEALKGNYG 219
F P I FIL++ + N ++E L G +G
Sbjct: 191 KFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWG 224
>gi|7465932|pir||B65100 hypothetical 19.4 kD protein in exuR-tdcC intergenic region -
Escherichia coli (strain K-12)
gi|606050|gb|AAA57913.1| ORF_f187 [Escherichia coli]
Length = 187
Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats.
Identities = 61/162 (37%), Positives = 95/162 (58%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
++ I + +V PA+GCTEP+++AL A A L E++ A++S N++KN +GV +PG
Sbjct: 11 QRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPG 70
Query: 68 TGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITEKLYI 127
TGM+GLPIA +LGAL G + LEV+KD T + + + K ++ ++ +K++E E L+
Sbjct: 71 TGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCDEILFS 130
Query: 128 EMTCEAGGKKATAIISKTHTNFVYEEADGKVLLDKQVPAEEG 169
G K A I HTN V+ E V+ +Q EG
Sbjct: 131 RAKVWNGEKWACVTIVGGHTNIVHIETHDGVVFTQQACVAEG 172
>gi|34764933|ref|ZP_00145275.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii ATCC
49256]
gi|27885726|gb|EAA23126.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii ATCC
49256]
Length = 111
Score = 95.1 bits (235), Expect = 8e-18, Method: Composition-based stats.
Identities = 43/108 (39%), Positives = 71/108 (65%)
Query: 320 CIVACTGSSCGITYLMGGDYNNICYAVKNMIANLTGMVCDGAKPSCALKVSSGVSTAILS 379
C + C+G + +T+L GG Y +C A+ N++ NL+G++CDGAK SCA+K+SSG+ +A +
Sbjct: 3 CYMCCSGVAAALTFLHGGSYEMVCDAITNILGNLSGVICDGAKASCAMKISSGIYSAFDA 62
Query: 380 AMLSMENHHVTEAEGIIDKDIDKSIRNLTSLGKDAMNEMDIMVLDIMT 427
ML++ + +GI+ DI+++IRN+ L + M D +L IMT
Sbjct: 63 TMLALHKDVLKSGDGIVGVDIEETIRNVGELAQSGMKGTDETILGIMT 110
>gi|145640967|ref|ZP_01796549.1| tRNA (uracil-5-)-methyltransferase [Haemophilus influenzae R3021]
gi|145274481|gb|EDK14345.1| tRNA (uracil-5-)-methyltransferase [Haemophilus influenzae
22.4-21]
Length = 116
Score = 81.3 bits (199), Expect = 1e-13, Method: Composition-based stats.
Identities = 39/91 (42%), Positives = 62/91 (68%)
Query: 6 IREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGI 65
I + ++ ++ VVPA+GCTEP+++AL +A A + LG+ PE+I +S N++KN +GV +
Sbjct: 9 IEKPLLHIVKHDVVPALGCTEPISLALESATAAKYLGKTPERIEVKVSPNLMKNGLGVAV 68
Query: 66 PGTGMIGLPIAISLGALIGKSEYQLEVIKDL 96
PGTGM+GLPIA ++ L +L +I L
Sbjct: 69 PGTGMVGLPIAAAMKVLSNTILIELLIISAL 99
>gi|145630605|ref|ZP_01786385.1| tRNA (uracil-5-)-methyltransferase [Haemophilus influenzae
22.4-21]
gi|144983995|gb|EDJ91437.1| tRNA (uracil-5-)-methyltransferase [Haemophilus influenzae R3021]
Length = 122
Score = 81.3 bits (199), Expect = 1e-13, Method: Composition-based stats.
Identities = 39/91 (42%), Positives = 62/91 (68%)
Query: 6 IREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGI 65
I + ++ ++ VVPA+GCTEP+++AL +A A + LG+ PE+I +S N++KN +GV +
Sbjct: 9 IEKPLLHIVKHDVVPALGCTEPISLALESATAAKYLGKTPERIEVKVSPNLMKNGLGVAV 68
Query: 66 PGTGMIGLPIAISLGALIGKSEYQLEVIKDL 96
PGTGM+GLPIA ++ L +L +I L
Sbjct: 69 PGTGMVGLPIAAAMKVLSNTILIELLIISAL 99
>gi|145953744|ref|ZP_01802752.1| hypothetical protein CdifQ_04003753 [Clostridium difficile
QCD-32g58]
Length = 136
Score = 80.1 bits (196), Expect = 2e-13, Method: Composition-based stats.
Identities = 51/126 (40%), Positives = 83/126 (65%), Gaps = 1/126 (0%)
Query: 4 KNIREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGV 63
++IR+++++++ +V PAVGCTEP+A+AL A+A ELLG++ + +S +I KN M V
Sbjct: 2 RDIRKELVEMLKAEVKPAVGCTEPVALALACAKAKELLGEEIVENRMLVSPSIYKNGMCV 61
Query: 64 GIPGTGMIGLPIAISLGALIGKSEYQLEVIKDLTPETLEEGKQYVSENRIDIKLKEGITE 123
GIPGT +GL IA +LG + G SE L V++ LT E ++ + Y+ + I + E
Sbjct: 62 GIPGTERLGLKIAAALGIVGGHSENGLSVLETLTKEEVKIAEDYMDNTPLSITPAD-TRE 120
Query: 124 KLYIEM 129
K++IE+
Sbjct: 121 KVFIEV 126
>gi|145632224|ref|ZP_01787959.1| tRNA (uracil-5-)-methyltransferase [Haemophilus influenzae 3655]
gi|145634722|ref|ZP_01790430.1| hypothetical protein CGSHiAA_03938 [Haemophilus influenzae
PittAA]
gi|144987131|gb|EDJ93661.1| tRNA (uracil-5-)-methyltransferase [Haemophilus influenzae 3655]
gi|145267888|gb|EDK07884.1| hypothetical protein CGSHiAA_03938 [Haemophilus influenzae
PittAA]
Length = 116
Score = 73.9 bits (180), Expect = 2e-11, Method: Composition-based stats.
Identities = 39/91 (42%), Positives = 62/91 (68%)
Query: 6 IREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGI 65
I + ++ ++ VVPA+GCTEP+++AL +A A + LG+ PE+I +S N++KN +GV +
Sbjct: 9 IEKPLLHIVKHDVVPALGCTEPISLALASATAAKYLGKTPERIEVKVSPNLMKNGLGVAV 68
Query: 66 PGTGMIGLPIAISLGALIGKSEYQLEVIKDL 96
PGTGM+GLPIA ++ L +L +I L
Sbjct: 69 PGTGMVGLPIAAAMKVLSNTILIELLIISAL 99
>gi|68249449|ref|YP_248561.1| hypothetical protein NTHI1023 [Haemophilus influenzae 86-028NP]
gi|68057648|gb|AAX87901.1| conserved hypothetical protein [Haemophilus influenzae 86-028NP]
Length = 115
Score = 73.2 bits (178), Expect = 3e-11, Method: Composition-based stats.
Identities = 39/91 (42%), Positives = 62/91 (68%)
Query: 6 IREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGI 65
I + ++ ++ VVPA+GCTEP+++AL +A A + LG+ PE+I +S N++KN +GV +
Sbjct: 9 IEKPLLHIVKHDVVPALGCTEPISLALASATAAKYLGKTPERIEVKVSPNLMKNGLGVAV 68
Query: 66 PGTGMIGLPIAISLGALIGKSEYQLEVIKDL 96
PGTGM+GLPIA ++ L +L +I L
Sbjct: 69 PGTGMVGLPIAAAMKVLSNTILIELLIISAL 99
>gi|16272795|ref|NP_439015.1| hypothetical protein HI0855 [Haemophilus influenzae Rd KW20]
gi|1176162|sp|P44904|Y855_HAEIN Uncharacterized protein HI0855
gi|1573870|gb|AAC22514.1| conserved hypothetical protein [Haemophilus influenzae Rd KW20]
Length = 115
Score = 73.2 bits (178), Expect = 4e-11, Method: Composition-based stats.
Identities = 36/77 (46%), Positives = 58/77 (75%)
Query: 6 IREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGI 65
I + ++ ++ V+PA+GCTEP+++AL +A A + LG+ PE+I A +S N++KN +GV +
Sbjct: 9 IEKPLLHIVKHDVMPALGCTEPISLALASATAAKYLGKTPERIEAKVSPNLMKNGLGVAV 68
Query: 66 PGTGMIGLPIAISLGAL 82
PGTGM+GLPIA ++ L
Sbjct: 69 PGTGMVGLPIAAAMKVL 85
>gi|46133161|ref|ZP_00156710.2| COG3681: Uncharacterized conserved protein [Haemophilus
influenzae R2866]
gi|145628243|ref|ZP_01784044.1| hypothetical protein CGSHi22121_04360 [Haemophilus influenzae
22.1-21]
gi|145638341|ref|ZP_01793951.1| hypothetical protein CGSHiII_06204 [Haemophilus influenzae
PittII]
gi|144980018|gb|EDJ89677.1| hypothetical protein CGSHi22121_04360 [Haemophilus influenzae
22.1-21]
gi|145272670|gb|EDK12577.1| hypothetical protein CGSHiII_06204 [Haemophilus influenzae
PittII]
Length = 114
Score = 71.6 bits (174), Expect = 9e-11, Method: Composition-based stats.
Identities = 35/74 (47%), Positives = 56/74 (75%)
Query: 6 IREQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGI 65
I + ++ ++ VVP +GCTEP+++AL +A A + LG+ PE+I A +S N++KN +GV +
Sbjct: 9 IEKPLLHIVKHDVVPTLGCTEPISLALASATAAKYLGKTPERIEAKVSPNLMKNGLGVAV 68
Query: 66 PGTGMIGLPIAISL 79
PGTGM+GLPIA ++
Sbjct: 69 PGTGMVGLPIAAAM 82
>gi|34764928|ref|ZP_00145272.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii
ATCC 49256]
gi|27885733|gb|EAA23131.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii
ATCC 49256]
Length = 84
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 32/74 (43%), Positives = 50/74 (67%), Gaps = 1/74 (1%)
Query: 8 EQIIDLMHRQVVPAVGCTEPMAVALCTARATELLGQKPEKISAFLSANILKNAMGVGIPG 67
E+++ ++ ++V A GCTEP+A++ A+A +LG P K+ FLS NI+KN V IP
Sbjct: 6 EKVLKILEEEIVAAEGCTEPIALSYAAAKARRILGAIPNKVDVFLSGNIIKNVKSVTIPN 65
Query: 68 T-GMIGLPIAISLG 80
+ GMIG+ AI++G
Sbjct: 66 SEGMIGIEPAIAMG 79
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.316 0.132 0.378
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,540,533,706
Number of Sequences: 5470121
Number of extensions: 64116198
Number of successful extensions: 168547
Number of sequences better than 1.0e-05: 124
Number of HSP's better than 0.0 without gapping: 124
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 168171
Number of HSP's gapped (non-prelim): 126
length of query: 432
length of database: 1,894,087,724
effective HSP length: 136
effective length of query: 296
effective length of database: 1,150,151,268
effective search space: 340444775328
effective search space used: 340444775328
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 131 (55.1 bits)