BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= FNP_2031
(395 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|148324547|gb|EDK89797.1| hypothetical protein FNP_2031 [... 780 0.0
gi|34762954|ref|ZP_00143933.1| hypothetical protein [Fusoba... 763 0.0
gi|83648213|ref|YP_436648.1| hypothetical protein HCH_05563... 323 9e-87
gi|148658114|ref|YP_001278319.1| VWA containing CoxE family... 317 6e-85
gi|121606630|ref|YP_983959.1| VWA containing CoxE family pr... 304 7e-81
gi|156453078|ref|ZP_02059444.1| VWA containing CoxE family ... 302 2e-80
gi|153899801|ref|ZP_02020364.1| VWA containing CoxE family ... 301 7e-80
gi|118729369|ref|ZP_01577886.1| VWA containing CoxE-like [D... 300 9e-80
gi|54026625|ref|YP_120867.1| hypothetical protein nfa46520 ... 300 1e-79
gi|111220033|ref|YP_710827.1| hypothetical protein FRAAL054... 298 3e-79
gi|13476047|ref|NP_107617.1| hypothetical protein mlr7258 [... 297 6e-79
gi|118053246|ref|ZP_01521791.1| VWA containing CoxE-like [C... 296 2e-78
gi|72161865|ref|YP_289522.1| von Willebrand factor, type A ... 293 1e-77
gi|111018905|ref|YP_701877.1| hypothetical protein RHA1_ro0... 289 2e-76
gi|29349318|ref|NP_812821.1| hypothetical protein BT_3910 [... 287 1e-75
gi|152966010|ref|YP_001361794.1| VWA containing CoxE family... 286 2e-75
gi|21223183|ref|NP_628962.1| hypothetical protein SCO4805 [... 285 5e-75
gi|119963870|ref|YP_947772.1| VWA domain containing CoxE-li... 283 1e-74
gi|146337943|ref|YP_001202991.1| conserved hypothetical pro... 279 2e-73
gi|117168622|gb|ABK32286.1| Jer6 [Polyangium cellulosum] 277 1e-72
gi|29829998|ref|NP_824632.1| hypothetical protein SAV3455 [... 277 1e-72
gi|117168589|gb|ABK32254.1| Amb6 [Polyangium cellulosum] 276 1e-72
gi|68235386|ref|ZP_00574391.1| VWA containing CoxE-like [Fr... 274 7e-72
gi|115380185|ref|ZP_01467212.1| von Willebrand factor, type... 259 2e-67
gi|150017599|ref|YP_001309853.1| VWA containing CoxE family... 219 2e-55
gi|24213397|ref|NP_710878.1| hypothetical protein LA0697 [L... 208 5e-52
gi|45658734|ref|YP_002820.1| hypothetical protein LIC12904 ... 207 8e-52
gi|26248499|ref|NP_754539.1| Hypothetical protein yehP [Esc... 207 1e-51
gi|75196166|ref|ZP_00706236.1| hypothetical protein EcolH_0... 205 5e-51
gi|15802605|ref|NP_288632.1| hypothetical protein Z3294 [Es... 204 6e-51
gi|78062407|ref|YP_372315.1| VWA containing CoxE-like prote... 204 6e-51
gi|75209876|ref|ZP_00710068.1| hypothetical protein EcolB_0... 204 1e-50
gi|75236308|ref|ZP_00720417.1| hypothetical protein EcolE1_... 204 1e-50
gi|75242937|ref|ZP_00726643.1| COG2304: Uncharacterized pro... 203 2e-50
gi|30063554|ref|NP_837725.1| hypothetical protein S2315 [Sh... 203 2e-50
gi|83646059|ref|YP_434494.1| VWA_CoxE family protein [Hahel... 202 3e-50
gi|91211406|ref|YP_541392.1| hypothetical protein UTI89_C23... 202 3e-50
gi|117624324|ref|YP_853237.1| hypothetical protein APECO1_4... 202 3e-50
gi|83588103|ref|ZP_00926728.1| COG2425: Uncharacterized pro... 202 4e-50
gi|16130059|ref|NP_416625.1| conserved protein [Escherichia... 201 5e-50
gi|75514135|ref|ZP_00736465.1| hypothetical protein Ecol5_0... 201 5e-50
gi|82543488|ref|YP_407435.1| hypothetical protein SBO_0944 ... 201 8e-50
gi|75231539|ref|ZP_00717908.1| hypothetical protein EcolB7_... 200 1e-49
gi|75186911|ref|ZP_00700178.1| hypothetical protein EcolE_0... 197 8e-49
gi|145595401|ref|YP_001159698.1| VWA containing CoxE family... 187 1e-45
gi|11875069|dbj|BAB19548.1| hypothetical protein [Escherich... 179 4e-43
gi|23011912|ref|ZP_00052135.1| hypothetical protein Magn030... 177 2e-42
gi|113940946|ref|ZP_01426763.1| VWA containing CoxE-like [H... 143 2e-32
gi|149173116|ref|ZP_01851747.1| hypothetical protein PM8797... 140 2e-31
gi|32473270|ref|NP_866264.1| hypothetical protein RB4715 [R... 136 3e-30
gi|21219695|ref|NP_625474.1| hypothetical protein SCO1184 [... 136 3e-30
gi|21224983|ref|NP_630762.1| hypothetical protein SCO6688 [... 130 2e-28
gi|29827527|ref|NP_822161.1| hypothetical protein SAV986 [S... 119 3e-25
gi|119881509|ref|ZP_01647803.1| VWA containing CoxE-like [S... 115 5e-24
gi|145594552|ref|YP_001158849.1| VWA containing CoxE family... 111 8e-23
gi|124526698|ref|ZP_01698607.1| VWA containing CoxE family ... 107 1e-21
gi|82777399|ref|YP_403748.1| hypothetical protein SDY_2171 ... 105 7e-21
gi|124526697|ref|ZP_01698606.1| conserved hypothetical prot... 93 3e-17
gi|82777400|ref|YP_403749.1| hypothetical protein SDY_2172 ... 90 2e-16
>gi|148324547|gb|EDK89797.1| hypothetical protein FNP_2031 [Fusobacterium nucleatum subsp.
polymorphum ATCC 10953]
Length = 395
Score = 780 bits (2015), Expect = 0.0, Method: Composition-based stats.
Identities = 395/395 (100%), Positives = 395/395 (100%)
Query: 1 MDIKEDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVG 60
MDIKEDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVG
Sbjct: 1 MDIKEDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVG 60
Query: 61 AGAGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINL 120
AGAGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINL
Sbjct: 61 AGAGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINL 120
Query: 121 ASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSL 180
ASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSL
Sbjct: 121 ASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSL 180
Query: 181 DFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSV 240
DFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSV
Sbjct: 181 DFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSV 240
Query: 241 MACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIEN 300
MACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIEN
Sbjct: 241 MACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIEN 300
Query: 301 PKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASM 360
PKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASM
Sbjct: 301 PKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASM 360
Query: 361 GIPCFACNPEKLPLLLERVLKNLDLSSFQQEFKKK 395
GIPCFACNPEKLPLLLERVLKNLDLSSFQQEFKKK
Sbjct: 361 GIPCFACNPEKLPLLLERVLKNLDLSSFQQEFKKK 395
>gi|34762954|ref|ZP_00143933.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii ATCC
49256]
gi|27887377|gb|EAA24468.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii ATCC
49256]
Length = 396
Score = 763 bits (1971), Expect = 0.0, Method: Composition-based stats.
Identities = 385/395 (97%), Positives = 393/395 (99%)
Query: 1 MDIKEDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVG 60
M+IKEDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGE G
Sbjct: 1 MNIKEDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGESG 60
Query: 61 AGAGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINL 120
AGAGKGPSN QIS+WLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINL
Sbjct: 61 AGAGKGPSNLQISKWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINL 120
Query: 121 ASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSL 180
ASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSL
Sbjct: 121 ASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSL 180
Query: 181 DFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSV 240
DFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFT+ILDIDQSGSMGESVIYSSV
Sbjct: 181 DFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTIILDIDQSGSMGESVIYSSV 240
Query: 241 MACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIEN 300
MACILAS+ASLKTR+VAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIEN
Sbjct: 241 MACILASMASLKTRIVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIEN 300
Query: 301 PKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASM 360
PKKTIFFLISDLMEGGNRGGMLRNLEDMKEAG+TVVCLLAISGDGQPYYDAQMAGKIA+M
Sbjct: 301 PKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGITVVCLLAISGDGQPYYDAQMAGKIAAM 360
Query: 361 GIPCFACNPEKLPLLLERVLKNLDLSSFQQEFKKK 395
GIPCFACNPEKLPLLLERVLKNLDLSSFQ+EFKKK
Sbjct: 361 GIPCFACNPEKLPLLLERVLKNLDLSSFQEEFKKK 395
>gi|83648213|ref|YP_436648.1| hypothetical protein HCH_05563 [Hahella chejuensis KCTC 2396]
gi|83636256|gb|ABC32223.1| conserved hypothetical protein [Hahella chejuensis KCTC 2396]
Length = 383
Score = 323 bits (829), Expect = 9e-87, Method: Composition-based stats.
Identities = 159/378 (42%), Positives = 249/378 (65%), Gaps = 18/378 (4%)
Query: 3 IKEDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAG 62
+ E +RWRL+LG D +++ +S S ED +D+ L+A+Y G+ G
Sbjct: 4 LSERERRWRLLLGGDGQDN---------ASMSAEDMQLDQILNALYGSDGE-------RG 47
Query: 63 AGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLAS 122
A S P+++RWLGDIR F +V+++Q DA +R L++++ EPE+LE V+PDI+L +
Sbjct: 48 ADLSSSAPKVARWLGDIRERFPSSVVRVMQKDAFERLNLERMLLEPEMLEAVQPDIHLVA 107
Query: 123 TIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDF 182
+M L IP++SKE+ R ++KIVEE+ + L+S ++A+R +LN+ P + +D+
Sbjct: 108 NLMSLGHLIPERSKETARQVVRKIVEELMRKLQSSTEQAIRGSLNRAMRKQRPRHADIDW 167
Query: 183 KTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMA 242
TI+ +K+Y E + I+PE + R P++ V+L +DQSGSM SVIY+S+ A
Sbjct: 168 ARTIRANLKHYQPEYRTIVPERLIGYGR--KQPSALKEVMLCVDQSGSMASSVIYASIFA 225
Query: 243 CILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPK 302
++ASI +LKT++V FDT +VDLTEK DPVD+L+G QLGGGTDINK++ YC + I P+
Sbjct: 226 AVMASIPALKTQLVVFDTAVVDLTEKLQDPVDVLFGVQLGGGTDINKAVTYCQQQITKPQ 285
Query: 303 KTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGI 362
T F LI+DL EGG++ +++ + ++K AGV V+ LLA++ DG PY+D ++A + AS+ +
Sbjct: 286 DTSFILITDLYEGGDQKQLVQRVAELKNAGVNVITLLALNDDGAPYFDKRLAQQFASLDV 345
Query: 363 PCFACNPEKLPLLLERVL 380
P FAC P++ P L+ L
Sbjct: 346 PTFACTPDQFPDLMAVAL 363
>gi|148658114|ref|YP_001278319.1| VWA containing CoxE family protein [Roseiflexus sp. RS-1]
gi|148570224|gb|ABQ92369.1| VWA containing CoxE family protein [Roseiflexus sp. RS-1]
Length = 401
Score = 317 bits (813), Expect = 6e-85, Method: Composition-based stats.
Identities = 155/382 (40%), Positives = 239/382 (62%), Gaps = 14/382 (3%)
Query: 4 KEDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGA 63
+E ++RWRLILG + D S +E D MD+AL A+Y+ E
Sbjct: 13 QERLRRWRLILGGEA-------DGTGFSLSNETDLGMDQALAALYS------RAEQKGRG 59
Query: 64 GKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLAST 123
G S P ++RWLGDIR F +V I+Q DA++R L+Q++ +PE+LE V PDI+L ST
Sbjct: 60 GLSASAPHVARWLGDIRTYFPAPVVHIMQKDALERLNLRQMLLQPEMLEAVTPDIHLVST 119
Query: 124 IMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFK 183
I+ L + IP++++ + R ++ +VE++ K LE +++A+ AL++ H P + +D+
Sbjct: 120 ILSLSKVIPEQTRHTARQLVRALVEQLEKKLEGPLRQAISGALHRAAHHRRPRYADMDWN 179
Query: 184 TTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMAC 243
TI+ +K+Y K IIPE +ER + + +IL +DQSGSM SV+Y+S+ A
Sbjct: 180 RTIRANLKHYQPNYKTIIPERRIGYERRKSASRLR-DIILCVDQSGSMASSVVYASIYAA 238
Query: 244 ILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKK 303
++AS+ ++KT +V FDT ++DLT + DPVD+L+G QLGGGTDI +++ YC I P
Sbjct: 239 VMASLPAIKTSLVLFDTAVIDLTPELHDPVDVLFGAQLGGGTDIGQALTYCQSLIHVPND 298
Query: 304 TIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIP 363
TI LISDL EGG+ +LR+ + AGV V+ LLA+S +G+P + Q+A ++ ++GIP
Sbjct: 299 TILILISDLYEGGHPDRLLRHAASLVNAGVQVITLLALSDEGRPSFHHQIAARLVALGIP 358
Query: 364 CFACNPEKLPLLLERVLKNLDL 385
CFAC P++ P L+ +K DL
Sbjct: 359 CFACTPDQFPGLMAAAIKREDL 380
>gi|121606630|ref|YP_983959.1| VWA containing CoxE family protein [Polaromonas naphthalenivorans
CJ2]
gi|120595599|gb|ABM39038.1| VWA containing CoxE family protein [Polaromonas naphthalenivorans
CJ2]
Length = 403
Score = 304 bits (778), Expect = 7e-81, Method: Composition-based stats.
Identities = 151/383 (39%), Positives = 241/383 (62%), Gaps = 11/383 (2%)
Query: 7 IKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGA-GAGK 65
++RWRL+LG + E + MD+AL A+Y+ GK G + G+
Sbjct: 19 LRRWRLVLGGEAESSCGKLSGAPAE--------MDQALSALYDADGKNGLGRSSSRQGGR 70
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
G S P ++RWLGDIR F +V+++Q DA++R L+ ++ +PE+LE V+PD++L ++++
Sbjct: 71 GGSAPSVARWLGDIRKYFPSSVVQVMQHDALERLNLRDMLLQPEMLESVQPDVHLVASLI 130
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
L IP +KE+ R ++K+VE + K LE ++ AV AL++ Q + P S +D+ T
Sbjct: 131 SLSRVIPATTKETARMVVRKVVEALLKKLEEPMRSAVSGALDRSQRNRRPRHSEIDWNRT 190
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
I+ +K++ + + I+PE + R + +P + V+L IDQSGSM SV+YSS+ ++
Sbjct: 191 IRANLKHWQPDYRTIVPERLIGYGRKARSPQRE--VVLCIDQSGSMAASVVYSSIFGAVM 248
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
AS+ ++ T++V FDT IVDLTE+ DDPV+LL+G QLGGGTDIN ++ YC I P+ TI
Sbjct: 249 ASLPAVATKLVVFDTAIVDLTEQLDDPVELLFGVQLGGGTDINGAVGYCQSVIREPRNTI 308
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
LISDL EGG +LR ++ +GV + LLA+S +G P YD +A K+A++G+P F
Sbjct: 309 LVLISDLYEGGVEANLLRRAAELVASGVQFITLLALSDEGAPSYDRALAAKLAALGVPSF 368
Query: 366 ACNPEKLPLLLERVLKNLDLSSF 388
AC P+ P L+ ++ D++++
Sbjct: 369 ACTPDAFPGLMAAAIRKEDINAW 391
>gi|156453078|ref|ZP_02059444.1| VWA containing CoxE family protein [Methylobacterium
chloromethanicum CM4]
gi|156187956|gb|EDO20159.1| VWA containing CoxE family protein [Methylobacterium
chloromethanicum CM4]
Length = 387
Score = 302 bits (774), Expect = 2e-80, Method: Composition-based stats.
Identities = 151/382 (39%), Positives = 238/382 (62%), Gaps = 18/382 (4%)
Query: 5 EDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAG 64
E ++RWRL+LG ED + SE D +DRA+ A+Y+ K G
Sbjct: 7 ERLRRWRLVLGGGAAEDTGC-------TLSERDQRLDRAMGALYDSDRK---------GG 50
Query: 65 KGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTI 124
G S P + RWLGDIR+ F +V+++Q DA +R LK+++ EPE+LE V+PD++L ST+
Sbjct: 51 LGASAPSVPRWLGDIRDYFPASVVQVMQRDAFERLDLKRMLTEPEMLEAVQPDVSLVSTL 110
Query: 125 MLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKT 184
+ ++ + ++KE+ R ++K+V+ + K LE +++AV A+++ + P + +D+
Sbjct: 111 ISMRGLLGGRTKETARLVVRKVVDALMKRLEEPLRQAVTGAIDRASINRRPRHAEIDWNR 170
Query: 185 TIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACI 244
TI+ +++Y E + IIPE R S S VIL IDQSGSM SV+YSS+ +
Sbjct: 171 TIRANLRHYQAEYRTIIPETRLGHGRKSRG--SLKDVILCIDQSGSMANSVVYSSIFGAV 228
Query: 245 LASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKT 304
+AS+ ++ TR+V FDTE+VDL++ DDPV++L+ QLGGGT+IN+++ YC I P+ T
Sbjct: 229 MASLPAVSTRLVVFDTEVVDLSDAMDDPVEVLFSVQLGGGTNINRAVGYCASRITRPEDT 288
Query: 305 IFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPC 364
+ LISDL EGG G+L + + AGV VV LLA+S +G P YD +A ++A++G+P
Sbjct: 289 VLVLISDLYEGGVEAGLLAQAQRLVAAGVQVVALLALSDEGAPAYDRGLAARLAALGVPA 348
Query: 365 FACNPEKLPLLLERVLKNLDLS 386
FAC P+ P ++ ++ DL+
Sbjct: 349 FACTPDLFPDMMAAAIRKQDLN 370
>gi|153899801|ref|ZP_02020364.1| VWA containing CoxE family protein [Methylobacterium extorquens
PA1]
gi|151589257|gb|EDN52668.1| VWA containing CoxE family protein [Methylobacterium extorquens
PA1]
Length = 387
Score = 301 bits (770), Expect = 7e-80, Method: Composition-based stats.
Identities = 150/382 (39%), Positives = 237/382 (62%), Gaps = 18/382 (4%)
Query: 5 EDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAG 64
E ++RWRL+LG E + SE D +DRA+ A+Y+ K G
Sbjct: 7 ERLRRWRLVLGGGAAEGTGC-------TLSERDQRLDRAMGALYDSDRK---------GG 50
Query: 65 KGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTI 124
G S P + RWLGDIR+ F +V+++Q DA +R LK+++ EPE+LE V+PD++L ST+
Sbjct: 51 LGASAPSVPRWLGDIRDYFPASVVQVMQRDAFERLDLKRMLTEPEMLEAVQPDVSLVSTL 110
Query: 125 MLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKT 184
+ ++ + ++KE+ R ++K+V+ + K LE +++AV A+++ + P + +D+
Sbjct: 111 ISMRGLLGGRTKETARLVVRKVVDALMKRLEEPLRQAVTGAIDRASINRRPRHAEIDWNR 170
Query: 185 TIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACI 244
TI+ +++Y E + IIPE R S S VIL IDQSGSM SV+YSS+ +
Sbjct: 171 TIRANLRHYQAEYRTIIPETRLGHGRKSRG--SLKDVILCIDQSGSMANSVVYSSIFGAV 228
Query: 245 LASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKT 304
+AS+ ++ TR+V FDTE+VDL++ +DPV++L+ QLGGGTDIN+++ YC I P+ T
Sbjct: 229 MASLPAVSTRLVVFDTEVVDLSDAMNDPVEVLFSVQLGGGTDINRAVGYCASRITRPEDT 288
Query: 305 IFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPC 364
+ LISDL EGG G+L + + AGV VV LLA+S +G P YD +A ++A++G+P
Sbjct: 289 VLVLISDLYEGGVEAGLLAQAQRLVAAGVQVVALLALSDEGAPAYDRGLAARLAALGVPA 348
Query: 365 FACNPEKLPLLLERVLKNLDLS 386
FAC P+ P ++ ++ DL+
Sbjct: 349 FACTPDLFPDMMAAAIRKQDLN 370
>gi|118729369|ref|ZP_01577886.1| VWA containing CoxE-like [Delftia acidovorans SPH-1]
gi|118670814|gb|EAV77407.1| VWA containing CoxE-like [Delftia acidovorans SPH-1]
Length = 410
Score = 300 bits (769), Expect = 9e-80, Method: Composition-based stats.
Identities = 158/389 (40%), Positives = 252/389 (64%), Gaps = 11/389 (2%)
Query: 7 IKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGKG 66
++RWRL+LG+ +E + + SS E MD+AL A+Y K G + G+G
Sbjct: 28 LQRWRLVLGQPSEASCGGLGPKG-SSIDE----MDKALAALYEEDNK--DGGLSRRGGRG 80
Query: 67 PSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIML 126
S+P ++RWLGDIR F ++V+++Q DAM+R L++L+ +PE+LE V+PD+++ + ++
Sbjct: 81 NSSPSVARWLGDIRKYFPSQVVQVMQRDAMERLNLRELMLQPEMLEHVQPDVHMVADLIS 140
Query: 127 LKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTTI 186
L IPQ +KE+ R ++K+V+++ + LE ++ AV AL++ Q + P S +D+ TI
Sbjct: 141 LGSVIPQNTKETARIVVRKVVDDLMRRLEEPMRSAVSGALDRSQRNRRPRHSEIDWNRTI 200
Query: 187 QRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACILA 246
+ ++++ + + I+PE + R P + VIL IDQSGSM SV+YSS+ ++A
Sbjct: 201 RANLRHWQPDYRTIVPETLVGYGRKVRRPQRE--VILCIDQSGSMANSVVYSSIFGAVMA 258
Query: 247 SIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTIF 306
S+ ++ TR+V FDT +VDLT+K DPVD+L+G QLGGGTDIN+++ YC I P+ I
Sbjct: 259 SLPAVATRLVVFDTAVVDLTDKLSDPVDVLFGVQLGGGTDINRAVGYCQGLISEPRNAIV 318
Query: 307 FLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCFA 366
LISDL EGG G+LR ++ E+GV + LLA+S +G P YDAQ+A K+A++G+P FA
Sbjct: 319 VLISDLYEGGVESGLLRRASELVESGVQFITLLALSDEGAPAYDAQLAAKLAALGVPSFA 378
Query: 367 CNPEKLPLLLERVLKNLDLSSFQ--QEFK 393
C P+ P L+ ++ D++++ Q FK
Sbjct: 379 CTPDAFPQLMAAAIRRDDVAAWAAGQGFK 407
>gi|54026625|ref|YP_120867.1| hypothetical protein nfa46520 [Nocardia farcinica IFM 10152]
gi|54018133|dbj|BAD59503.1| hypothetical protein [Nocardia farcinica IFM 10152]
Length = 395
Score = 300 bits (768), Expect = 1e-79, Method: Composition-based stats.
Identities = 165/384 (42%), Positives = 237/384 (61%), Gaps = 17/384 (4%)
Query: 8 KRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAG---AG 64
+RWRL+LG E + + S +D +DRAL A+YN TG E GAG G
Sbjct: 13 RRWRLVLGAAAEPELGGLGSA-------DDVAVDRALGALYN-TGN----EQGAGPRAGG 60
Query: 65 KGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTI 124
G S P+++RWLGDIR F +V+++Q DA+DR L +L+ EPE+L VEPD++L T+
Sbjct: 61 LGGSAPRVARWLGDIRTYFPSTVVEVLQRDAIDRLHLTELLLEPELLAAVEPDVHLVGTL 120
Query: 125 MLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKT 184
+ L +P+ +K + RA ++++V I + + + + AV ALN+ P +D+
Sbjct: 121 LSLNRVMPETTKATARAVVEQVVRRIGERIATHTRTAVGGALNRAARVARPKLRDIDWDR 180
Query: 185 TIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACI 244
TI+R + +Y E + ++PE + R S + V+L +DQSGSM SV+Y+SV +
Sbjct: 181 TIRRNLAHYLPEHQTVVPERLVGYGRNSQ--AVRREVVLAVDQSGSMAASVVYASVFGAV 238
Query: 245 LASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKT 304
LAS+ SL+T +V FDTE+VDLT+ DPVD+L+G QLGGGTDIN++I YC I P T
Sbjct: 239 LASLRSLRTSLVVFDTEVVDLTDLLTDPVDVLFGTQLGGGTDINRAIAYCQSLITRPADT 298
Query: 305 IFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPC 364
+F LISDL EGG R MLR + +++AGV VV LLA+S DG P +D A +A +GIP
Sbjct: 299 LFVLISDLYEGGIRAEMLRRVNALRDAGVQVVVLLALSDDGAPSFDHDNAAALAGLGIPA 358
Query: 365 FACNPEKLPLLLERVLKNLDLSSF 388
FAC P+K P LL L D+ ++
Sbjct: 359 FACTPDKFPDLLAVALDRGDVHAW 382
>gi|111220033|ref|YP_710827.1| hypothetical protein FRAAL0543 [Frankia alni ACN14a]
gi|111147565|emb|CAJ59218.1| conserved hypothetical protein [Frankia alni ACN14a]
Length = 418
Score = 298 bits (764), Expect = 3e-79, Method: Composition-based stats.
Identities = 151/379 (39%), Positives = 230/379 (60%), Gaps = 7/379 (1%)
Query: 4 KEDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNP-TGKFMSGEVGAG 62
+E ++RWRL+LG + + S+ D +D AL A+Y+
Sbjct: 30 EERLRRWRLVLGAPA-----APAFGPARALSKRDADIDAALGALYDADGESGGGRGRERS 84
Query: 63 AGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLAS 122
AG G S P ++RWLGDIR+ F +V+++Q DA+ R GL QL+ EPE+L EPD++L
Sbjct: 85 AGLGGSAPAVTRWLGDIRDYFPSSVVQVLQHDAVQRLGLAQLLMEPELLAAAEPDVHLVG 144
Query: 123 TIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDF 182
T++ L+ +P++++E+ R ++ +VE+I + L ++ AV ALN+ + + P +D+
Sbjct: 145 TLLSLRSALPERTRETARRVVRMVVEDIERRLAQSLRSAVFGALNRGERARTPRLPDVDW 204
Query: 183 KTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMA 242
TI ++NY E + +I + ++RA + VIL IDQSGSM SV+Y+ V+
Sbjct: 205 NRTILANLRNYQPEQRTVIVDRLVGYQRARRVAALR-DVILLIDQSGSMASSVVYAGVLG 263
Query: 243 CILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPK 302
LASI SL TR+V FDT +VDLT+ DDPVD+L+G QLGGGTDI++++ Y + P
Sbjct: 264 ASLASIRSLSTRLVVFDTSVVDLTDSLDDPVDVLFGVQLGGGTDIDRAVGYGASLVRRPA 323
Query: 303 KTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGI 362
T+ LISDL+EG ++ ++ L + +AGVTV+ LLA+S DG P YD Q+A A +G
Sbjct: 324 DTVLILISDLIEGADQSSLIARLRALVDAGVTVIVLLALSDDGAPAYDHQLAATCAQLGA 383
Query: 363 PCFACNPEKLPLLLERVLK 381
P FAC P++ P LL L+
Sbjct: 384 PAFACTPDRFPELLATALR 402
>gi|13476047|ref|NP_107617.1| hypothetical protein mlr7258 [Mesorhizobium loti MAFF303099]
gi|14026807|dbj|BAB53403.1| mlr7258 [Mesorhizobium loti MAFF303099]
Length = 415
Score = 297 bits (761), Expect = 6e-79, Method: Composition-based stats.
Identities = 150/382 (39%), Positives = 233/382 (60%), Gaps = 10/382 (2%)
Query: 8 KRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIY-NPTGKFMSGEVGAGAGKG 66
+RWRL +G D + + S+ D + ALDA+Y + G + G G
Sbjct: 23 RRWRLAIGADDQSS---------PALSDTDKRLSAALDALYGDGAGDTTADPRKRRGGLG 73
Query: 67 PSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIML 126
S P++++W+GDIR+ F ++V+I+Q DA +R LKQ++ EPE L+ +E D+NL + ++
Sbjct: 74 RSAPRVAQWMGDIRSFFPAQVVQIVQKDAFERLNLKQMLMEPEFLKAIEADVNLVADLIS 133
Query: 127 LKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTTI 186
L+ +P K+K+ R I IV ++ + LE A+R AL++ Q + P +D+ TI
Sbjct: 134 LRSVMPAKTKDIARTIIADIVAKLMQRLEQKTAEAIRGALDRSQRTNRPRQRDIDWPRTI 193
Query: 187 QRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACILA 246
+++Y E K I+PE F R V+L +DQSGSM SVIY+S+ A ++A
Sbjct: 194 SANLRHYQAEHKTIVPERLVGFMRKQRRLVDLDEVVLCVDQSGSMASSVIYASIFAAVMA 253
Query: 247 SIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTIF 306
S+ ++T++V FDT IVDLTE+ DPV++L+G QLGGGTDIN+++ YC IE P K+
Sbjct: 254 SLPVVRTKLVCFDTAIVDLTEELSDPVEVLFGVQLGGGTDINQAVAYCADRIERPTKSHM 313
Query: 307 FLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCFA 366
LI+DL EGGN +L+ L + +GV VV LLA++ G+P YD +MAG +A++GIP F
Sbjct: 314 VLITDLYEGGNGQELLQRLASLVRSGVNVVVLLALTDQGRPGYDPKMAGSVAALGIPVFT 373
Query: 367 CNPEKLPLLLERVLKNLDLSSF 388
C P+ P ++ L+ D+S++
Sbjct: 374 CTPDLFPDMMAAALRREDVSAW 395
>gi|118053246|ref|ZP_01521791.1| VWA containing CoxE-like [Comamonas testosteroni KF-1]
gi|117999552|gb|EAV13711.1| VWA containing CoxE-like [Comamonas testosteroni KF-1]
Length = 401
Score = 296 bits (758), Expect = 2e-78, Method: Composition-based stats.
Identities = 159/389 (40%), Positives = 250/389 (64%), Gaps = 16/389 (4%)
Query: 7 IKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGKG 66
++RWR++LG + ++ +E MD+AL A+Y K S + G+G
Sbjct: 24 LQRWRMVLGSPADASCG-----GVTGRLQE---MDQALAALYEEDSKLASRK----GGRG 71
Query: 67 PSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIML 126
S+P +SRWLGDIR F ++V+++Q DAM+R L++L+ +PE+LE V+PD++L + ++
Sbjct: 72 NSSPSVSRWLGDIRKYFPSQVVQVMQRDAMERLNLRELLLQPEMLENVQPDVHLVADLIS 131
Query: 127 LKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTTI 186
L IPQ +K + R ++K+V+E+ K LE ++ AV AL++ Q + P + +D+ TI
Sbjct: 132 LGSVIPQNTKATARLVVRKVVDELMKKLEEPMRSAVAGALDRSQRNRRPRHAEIDWNRTI 191
Query: 187 QRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACILA 246
+ ++++ E K I+PE + R + P + VIL IDQSGSM SV+YSS+ ++A
Sbjct: 192 RANLRHWQPEYKTIVPETLIGYGRKARRPQRE--VILCIDQSGSMANSVVYSSIFGAVMA 249
Query: 247 SIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTIF 306
S+ ++ T++V FDT +VDLTEK DDPVD+L+G QLGGGTDIN ++ YC I P+ +I
Sbjct: 250 SLPAVATKLVVFDTAVVDLTEKLDDPVDVLFGVQLGGGTDINGAVGYCQGLISEPRNSIL 309
Query: 307 FLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCFA 366
LISDL EGG G+LR ++ EAGV + LLA+S +G P YDA++A K+A++G+P FA
Sbjct: 310 VLISDLYEGGVESGLLRRANELVEAGVQFITLLALSDEGAPAYDAELAAKLAALGVPSFA 369
Query: 367 CNPEKLPLLLERVLKNLDLSSF--QQEFK 393
C P+ P L+ ++ D++++ Q FK
Sbjct: 370 CTPDAFPQLMAAAIRRDDVAAWAATQGFK 398
>gi|72161865|ref|YP_289522.1| von Willebrand factor, type A [Thermobifida fusca YX]
gi|71915597|gb|AAZ55499.1| von Willebrand factor, type A [Thermobifida fusca YX]
Length = 410
Score = 293 bits (751), Expect = 1e-77, Method: Composition-based stats.
Identities = 154/396 (38%), Positives = 243/396 (61%), Gaps = 9/396 (2%)
Query: 3 IKEDIKRWRLILGKDTEEDFSSMDSEAISS----FSEEDWLMDRALDAIYN--PTGKFMS 56
+ E +RWRL+LG+ E ++ + + +D +D AL A+YN TG +
Sbjct: 11 LPERARRWRLVLGEAAENACAAATGATTPATGTVLNRDDARIDAALAALYNYSDTGGRLR 70
Query: 57 GEVGAGAGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEP 116
G A S P ++RWLGDIR F +V++IQ DA+ R L L+ EPE+++ VEP
Sbjct: 71 GPGKRSASLDSSAPTVARWLGDIRTYFPSSVVRVIQHDALTRLNLTTLLLEPEMMDAVEP 130
Query: 117 DINLASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPS 176
D++L T++ LK+ +P+ ++E+ R ++++V ++ + L + V+ AL++ + P
Sbjct: 131 DVHLVGTLLALKDVMPESARETARTVVRRVVNDLERKLAHHTRSTVQGALDRSARTTRPR 190
Query: 177 -ASSLDFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESV 235
S +D+ TI+R +++Y E + I+P+ + R S + VIL +DQSGSM SV
Sbjct: 191 RVSDIDWDRTIRRNLQHYLPEHRTIVPQTLVGYARRSRG--VQRDVILAVDQSGSMASSV 248
Query: 236 IYSSVMACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCM 295
+Y+SV + +LAS+ +L+T +V FDT +VDLT++ DPVD+L+G QLGGGTDIN++I YC
Sbjct: 249 VYASVFSAVLASLRTLRTSLVVFDTAVVDLTDQLSDPVDVLFGTQLGGGTDINRAIAYCQ 308
Query: 296 KYIENPKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAG 355
I P +IF LISDL EGG R MLR + +M AGV V+ LLA+S DG P Y A
Sbjct: 309 GLITRPANSIFVLISDLYEGGIREEMLRRVAEMTAAGVQVIVLLALSDDGAPAYHHDNAA 368
Query: 356 KIASMGIPCFACNPEKLPLLLERVLKNLDLSSFQQE 391
+A++G+P FAC P+K P L+ ++ L+++ ++
Sbjct: 369 ALAALGVPAFACTPDKFPDLMAAAIQGQSLTAWVEQ 404
>gi|111018905|ref|YP_701877.1| hypothetical protein RHA1_ro01908 [Rhodococcus sp. RHA1]
gi|110818435|gb|ABG93719.1| conserved hypothetical protein [Rhodococcus sp. RHA1]
Length = 353
Score = 289 bits (740), Expect = 2e-76, Method: Composition-based stats.
Identities = 153/356 (42%), Positives = 220/356 (61%), Gaps = 3/356 (0%)
Query: 40 MDRALDAIYNPTGKFMSGEVGAGAGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRC 99
MD AL A+Y+ T S GAG G S P+++RWLGDIR F +V+++Q DA+DR
Sbjct: 1 MDGALAALYD-TSSEGSKSRRRGAGLGGSAPKVARWLGDIRTYFPSSVVQVMQKDAIDRL 59
Query: 100 GLKQLIFEPEILEQVEPDINLASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIK 159
GL QL+ EPE+L+ VEPD++L T++ L +P+ SK + R ++K+V E+ + + +
Sbjct: 60 GLTQLLLEPELLDAVEPDVHLVGTLLSLNRVMPETSKATARMVVEKVVREVEERIAQKTR 119
Query: 160 RAVRAALNKRQHSPIPSASSLDFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKF 219
AV ALN+ P +D+ TI+ + +Y E K ++PE + R S
Sbjct: 120 TAVTGALNRSARITNPKYRDIDWNRTIRANLAHYLPEYKTVVPERLLGYGRRSQ--AVHR 177
Query: 220 TVILDIDQSGSMGESVIYSSVMACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGF 279
V+L IDQSGSM SV+Y+SV +LAS+ +LKT ++ FDT +VDLT+K DPVD+L+G
Sbjct: 178 DVVLAIDQSGSMASSVVYASVFGAVLASMRALKTSLIVFDTAVVDLTDKLSDPVDVLFGT 237
Query: 280 QLGGGTDINKSIKYCMKYIENPKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLL 339
QLGGGTDIN++I Y I+ P +++F LISDL EGG R MLR + MK GV VV LL
Sbjct: 238 QLGGGTDINRAIAYSQSLIDRPTESLFVLISDLYEGGIRAEMLRRMSAMKNVGVQVVVLL 297
Query: 340 AISGDGQPYYDAQMAGKIASMGIPCFACNPEKLPLLLERVLKNLDLSSFQQEFKKK 395
A+S DG P +D A + ++GIP FAC P++ P LL L+ D+ + + +
Sbjct: 298 ALSDDGAPSFDHDNAAALGALGIPAFACTPDRFPELLALALERGDIGRWADSLQTE 353
>gi|29349318|ref|NP_812821.1| hypothetical protein BT_3910 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29341226|gb|AAO79015.1| VWA containing CoxE family protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 376
Score = 287 bits (734), Expect = 1e-75, Method: Composition-based stats.
Identities = 158/382 (41%), Positives = 234/382 (61%), Gaps = 19/382 (4%)
Query: 4 KEDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGA 63
+E +KRWRLILG D E D + + + + E+ +D +L+A+Y+ +
Sbjct: 3 EELLKRWRLILGGD-EADGTGV------TLNLEEQRIDHSLEAVYDSDRR---------G 46
Query: 64 GKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLAST 123
G G S P++SRWLGDIR F + +V++IQ DA+ R L L+ E E+LE V PD++L +T
Sbjct: 47 GLGSSAPKVSRWLGDIREFFPQTVVQVIQRDAIKRLNLTSLLTEKEMLETVVPDVHLVAT 106
Query: 124 IMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFK 183
+M L IP+K+KE R ++K+VEE+ + L + ++AV ALN+ P + +D+K
Sbjct: 107 LMSLSRVIPEKNKEMARQVVRKVVEELLRKLSAPTQQAVTGALNRSSRRRNPRYNEIDWK 166
Query: 184 TTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMAC 243
TTI + +KNY + K IIPE + R + +IL +DQSGSMG SVIYS +
Sbjct: 167 TTITKNLKNYQPDYKTIIPEIRIGYGRKRK---AMKDIILCLDQSGSMGTSVIYSGIFGS 223
Query: 244 ILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKK 303
+LASI ++ TR+V FDT +VDLT+ DPVDLL+G QLGGGTDI +++ YC I P+
Sbjct: 224 VLASIPAVSTRMVVFDTAVVDLTDDLQDPVDLLFGVQLGGGTDIARALTYCQGVITRPQD 283
Query: 304 TIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIP 363
T+ L++DL EGG+ M + + +GV ++ L A++ DG P YD A +AS+G+P
Sbjct: 284 TVMVLVTDLYEGGDSREMRKKFVSLVNSGVQLIVLPALNDDGAPSYDKGHAEFLASIGVP 343
Query: 364 CFACNPEKLPLLLERVLKNLDL 385
FAC P+K P L+ L D+
Sbjct: 344 TFACTPDKFPDLMAAALSKQDI 365
>gi|152966010|ref|YP_001361794.1| VWA containing CoxE family protein [Kineococcus radiotolerans
SRS30216]
gi|151360527|gb|ABS03530.1| VWA containing CoxE family protein [Kineococcus radiotolerans
SRS30216]
Length = 421
Score = 286 bits (731), Expect = 2e-75, Method: Composition-based stats.
Identities = 138/328 (42%), Positives = 208/328 (63%), Gaps = 2/328 (0%)
Query: 63 AGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLAS 122
AG G S P+++RWLGDIR F +V+++Q DA++R L +L+ EPE+L V+PD+NL
Sbjct: 85 AGLGGSAPRVARWLGDIRTYFPSSVVQVMQRDAVERLDLTRLLLEPELLGAVQPDVNLVG 144
Query: 123 TIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDF 182
T++ L +P+++KE+ R + ++V +I + AV AL++ + P +D+
Sbjct: 145 TLLSLSRVLPERTKETARQVVAEVVRQIEARVADRTTSAVTGALDRAARTHRPRLPDVDW 204
Query: 183 KTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMA 242
TI+ + +Y E + ++PE R + V+L+IDQSGSM ESV+Y+SV
Sbjct: 205 NATIRANLTHYLPEHRTVVPERLVGHGRRQQVVAKE--VVLEIDQSGSMAESVVYASVFG 262
Query: 243 CILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPK 302
+LA + +LKT ++AFDTE+VDLT+ DPVD+L+G QLGGGTDIN++I Y + I P
Sbjct: 263 AVLAKMRTLKTTLIAFDTEVVDLTDSLTDPVDVLFGVQLGGGTDINRAIAYGQERITRPA 322
Query: 303 KTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGI 362
T+FFLISDL EGG R M++ + MK AGV V+ LLA+S G P +D + A +A++GI
Sbjct: 323 DTLFFLISDLFEGGVRDEMVKRMAAMKSAGVQVIVLLALSDSGAPAFDRENAAALAALGI 382
Query: 363 PCFACNPEKLPLLLERVLKNLDLSSFQQ 390
P FAC P+ P LL + D+ ++ Q
Sbjct: 383 PAFACTPDAFPDLLALAMTGGDVGAWAQ 410
>gi|21223183|ref|NP_628962.1| hypothetical protein SCO4805 [Streptomyces coelicolor A3(2)]
gi|8218206|emb|CAB92668.1| hypothetical protein SCD63A.16 [Streptomyces coelicolor A3(2)]
Length = 384
Score = 285 bits (728), Expect = 5e-75, Method: Composition-based stats.
Identities = 151/370 (40%), Positives = 228/370 (61%), Gaps = 10/370 (2%)
Query: 4 KEDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGA 63
+E ++RWRL+LG D + + D MD AL A+Y G +G A A
Sbjct: 7 RERLRRWRLVLGGDPADGTGHV-------LCGRDAAMDGALTALYGRGGAPRAGRDRA-A 58
Query: 64 GKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLAST 123
G G S P ++RWLGDIR F +V+++Q DA+DR GL L+ EPE+L+ VE D++L T
Sbjct: 59 GLGASAPAVARWLGDIRTYFPSPVVQVMQRDAIDRLGLATLLLEPEMLQAVEADVHLVGT 118
Query: 124 IMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFK 183
++ L E +P ++E+ RA ++K+VE++ K L + + + AL++ + P +D+
Sbjct: 119 LLSLNEAMPDTTRETARAVVRKVVEDLEKRLVTRTRATLSGALDRSARTARPRPHDIDWN 178
Query: 184 TTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMAC 243
TI +K+Y E + ++PE + RAS + + ++L +DQSGSM S++Y+SV
Sbjct: 179 RTIGANLKHYLPEYRTVVPERLVGYGRASR--SVRKDIVLCVDQSGSMAASLVYASVFGA 236
Query: 244 ILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKK 303
+LAS+ S+ TR+V FDT + DLT++ DDPVD+L+G +LGGGTDIN+++ YC I P
Sbjct: 237 VLASMRSIDTRLVVFDTAVADLTDQLDDPVDVLFGTRLGGGTDINRALAYCQSRITRPAD 296
Query: 304 TIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIP 363
T+ LISDL EGG R ML+ + M+ AGV V LLA+S +G P YD + A +A++G P
Sbjct: 297 TVVVLISDLYEGGIREEMLKRVAAMRVAGVRFVTLLALSDEGTPAYDREHAAALAALGAP 356
Query: 364 CFACNPEKLP 373
FAC P+ P
Sbjct: 357 AFACTPDLFP 366
>gi|119963870|ref|YP_947772.1| VWA domain containing CoxE-like protein family [Arthrobacter
aurescens TC1]
gi|119950729|gb|ABM09640.1| VWA domain containing CoxE-like protein family [Arthrobacter
aurescens TC1]
Length = 407
Score = 283 bits (725), Expect = 1e-74, Method: Composition-based stats.
Identities = 155/387 (40%), Positives = 239/387 (61%), Gaps = 10/387 (2%)
Query: 2 DIKEDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGA 61
D ++ + RWRL+LG + ++ + + S++D D+AL+ +Y K
Sbjct: 19 DNRDRLSRWRLVLGGHDADGITTAEDMPVQ-LSDDDVRRDQALEELYGDGSK-------Q 70
Query: 62 GAGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLA 121
G G S+P+++RWLGDIR F +V+++Q DAMDR GL+QL+ EPE+L V+PDI L
Sbjct: 71 RGGLGSSSPRVARWLGDIRGYFPSSVVQVMQADAMDRLGLRQLLLEPEMLRTVQPDIGLV 130
Query: 122 STIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLD 181
ST++ L IP+ S+E+ R+ I+++ +E+ + L + +AV ALN+ + P +D
Sbjct: 131 STLVGLGRVIPEASRETARSVIRQVTKELEERLRARTIQAVSGALNRSARTRRPRHRDID 190
Query: 182 FKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVM 241
+ TI +K+Y E + ++PE + R S+ + +IL IDQSGSM ESV+YSSV
Sbjct: 191 WNRTIAANLKHYQPEYRTVVPERLHGHARRSSEIQRE--IILCIDQSGSMAESVVYSSVF 248
Query: 242 ACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENP 301
+L+S+ S+ T++V FDTE+VDLT+ DDPVD+L+G QLGGGTDIN+++ YC I P
Sbjct: 249 GAVLSSLRSVSTKLVVFDTEVVDLTDDLDDPVDVLFGVQLGGGTDINRALAYCQDQITKP 308
Query: 302 KKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMG 361
+TI LISDL EGG MLR + G T++ LLA+S G P +D+ A +A +G
Sbjct: 309 TETILVLISDLYEGGIAEEMLRRAASIVGGGTTMITLLALSDSGHPSFDSSHAAALAGIG 368
Query: 362 IPCFACNPEKLPLLLERVLKNLDLSSF 388
+P FAC P+ P ++ ++ D+S +
Sbjct: 369 VPAFACTPDLFPDMMAAAIERRDVSEW 395
>gi|146337943|ref|YP_001202991.1| conserved hypothetical protein; putative von Willebrand factor type
A (VWA) domain [Bradyrhizobium sp. ORS278]
gi|146190749|emb|CAL74754.1| conserved hypothetical protein; putative von Willebrand factor type
A (VWA) domain [Bradyrhizobium sp. ORS278]
Length = 397
Score = 279 bits (714), Expect = 2e-73, Method: Composition-based stats.
Identities = 149/384 (38%), Positives = 236/384 (61%), Gaps = 13/384 (3%)
Query: 5 EDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAG 64
E +RWRL+LG D + + S+ D +D AL +Y+ G G
Sbjct: 8 ERNRRWRLVLGGDDQ-----------AGLSDRDRRLDAALAGLYDAGSG-GKRGGGRRGG 55
Query: 65 KGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTI 124
G S P+++ WLGDIR F +V++IQ DA +R GLK+++ +PE L +E D++L + +
Sbjct: 56 LGGSAPRVASWLGDIREFFPAPVVQVIQKDAFERLGLKEMLLQPEFLAALEADVHLVADL 115
Query: 125 MLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKT 184
M L+ +P+K+K + R + K+V+E+ + L++ +R A+++R+ + P A +D+
Sbjct: 116 MALRSVMPEKTKVTAREVVAKVVKELMEKLDARTTETIRGAVDRRRRTRRPRAGDIDWPR 175
Query: 185 TIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACI 244
TI + ++++ E + I+PE RA+ T+ VIL +DQSGSMG SV+YSS+ A +
Sbjct: 176 TIGKNLRHWQAEHRTIVPETLVGHARAARQ-TNLEEVILCVDQSGSMGTSVVYSSIFAAV 234
Query: 245 LASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKT 304
LASI ++ R+V FDT I+DLTE+ DPV++L+ QLGGGTDIN+++ YC + + P +T
Sbjct: 235 LASIPAIAMRLVVFDTNIIDLTEELADPVEVLFSVQLGGGTDINQALAYCEQLVREPTRT 294
Query: 305 IFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPC 364
LISDL+EGG ML + + +GV ++ LLA++ DG+P YDA+ A +AS+G P
Sbjct: 295 HMVLISDLIEGGIAEQMLARAKALVSSGVNLIVLLALNDDGRPAYDARHAAILASLGCPV 354
Query: 365 FACNPEKLPLLLERVLKNLDLSSF 388
FAC P + P L+ LK D+ S+
Sbjct: 355 FACTPHQFPELMATALKRQDIWSW 378
>gi|117168622|gb|ABK32286.1| Jer6 [Polyangium cellulosum]
Length = 416
Score = 277 bits (708), Expect = 1e-72, Method: Composition-based stats.
Identities = 148/390 (37%), Positives = 242/390 (62%), Gaps = 20/390 (5%)
Query: 4 KEDIKRWRLILGKDTEEDFSSMD-------SEAISSFSEEDWLMDRALDAIYNPTGKFMS 56
++ + RWRL LG + E + + A+ + +D+AL IY+
Sbjct: 30 RDALLRWRLALGPEAERVDPRLSLGGLGGAAPALDVDARRLGDLDKALSFIYDERA---- 85
Query: 57 GEVGAGAGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEP 116
G G S P + WL +R F E+V ++Q DA++R GL QL+FEPE L +E
Sbjct: 86 ------GGLGGSRPYVPEWLSAVREFFSHEVVALVQKDAIERKGLTQLLFEPETLPFLEK 139
Query: 117 DINLASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPS 176
++ L +T+M K IP ++++ R ++++VEE+ + LE++++ AV AL + SP+
Sbjct: 140 NVELVATLMSAKGLIPDAARDTARQIVREVVEEVRRALEAEVRTAVLGALRRNTTSPLRV 199
Query: 177 ASSLDFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVI 236
+LD+K TI++ +K ++ E ++++P+ YF+ A+ ++ V + +DQSGSMGESV+
Sbjct: 200 LRNLDWKRTIRKNLKGWDAERRRLVPDKLYFW--ANQTRRHEWDVAILVDQSGSMGESVV 257
Query: 237 YSSVMACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCM- 295
YSS+MA I AS+ L+TR++ FDTE+VD+T DPVD+L+ QLGGGTDIN+++ Y
Sbjct: 258 YSSIMAAIFASLDVLRTRLLFFDTEVVDVTPMLVDPVDVLFTAQLGGGTDINRAVAYAQA 317
Query: 296 KYIENPKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAG 355
+IE P+KT+ LI+DL EGGN ++ + + ++ V +CLLA+S G+P YD +MA
Sbjct: 318 NFIERPEKTLLILITDLFEGGNAEELVARMRQLADSKVKSICLLALSDGGKPSYDHEMAQ 377
Query: 356 KIASMGIPCFACNPEKLPLLLERVLKNLDL 385
K+A++G PCF C P+ L ++ER+++ DL
Sbjct: 378 KLAALGTPCFGCTPKLLVKVVERLMRGQDL 407
>gi|29829998|ref|NP_824632.1| hypothetical protein SAV3455 [Streptomyces avermitilis MA-4680]
gi|29607108|dbj|BAC71167.1| hypothetical protein [Streptomyces avermitilis MA-4680]
Length = 349
Score = 277 bits (708), Expect = 1e-72, Method: Composition-based stats.
Identities = 149/334 (44%), Positives = 212/334 (63%), Gaps = 3/334 (0%)
Query: 40 MDRALDAIYNPTGKFMSGEVGAGAGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRC 99
MD AL A+Y K +G AG G S P ++RWLGDIR F +V+++Q DA+DR
Sbjct: 1 MDGALTALYGKGDKPQTGR-DRSAGLGASAPSVARWLGDIRTYFPSSVVQVMQRDAIDRL 59
Query: 100 GLKQLIFEPEILEQVEPDINLASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIK 159
GL L+ EPE+LE VE D++L T++ L + +P +KE+ RA ++K+VE++ K L + +
Sbjct: 60 GLSTLLLEPEMLEAVEADVHLVGTLLSLNKAMPDTTKETARAVVRKVVEDLEKRLATRTR 119
Query: 160 RAVRAALNKRQHSPIPSASSLDFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKF 219
+ AL++ P +D+ TI +K+Y E + I+PE + RAS + K
Sbjct: 120 ATLTGALDRSARITRPRHHDIDWNRTIAANLKHYLPEYRTIVPERLIGYGRASQ--SVKK 177
Query: 220 TVILDIDQSGSMGESVIYSSVMACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGF 279
V+L IDQSGSM SV+Y+SV +LAS+ S+ TR+V FDT +VDLT++ DDPVD+L+G
Sbjct: 178 EVVLCIDQSGSMAASVVYASVFGAVLASMRSIATRLVVFDTAVVDLTDQLDDPVDVLFGT 237
Query: 280 QLGGGTDINKSIKYCMKYIENPKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLL 339
QLGGGTDIN+++ YC I P T+ LISDL EGG R ML+ + MK +GV V LL
Sbjct: 238 QLGGGTDINRALAYCQSQITRPADTVVVLISDLYEGGIRDEMLKRVAAMKASGVQFVTLL 297
Query: 340 AISGDGQPYYDAQMAGKIASMGIPCFACNPEKLP 373
A+S +G P YD + A +A++G P FAC P+ P
Sbjct: 298 ALSDEGAPAYDREHAAALAALGAPAFACTPDLFP 331
>gi|117168589|gb|ABK32254.1| Amb6 [Polyangium cellulosum]
Length = 477
Score = 276 bits (707), Expect = 1e-72, Method: Composition-based stats.
Identities = 150/390 (38%), Positives = 242/390 (62%), Gaps = 20/390 (5%)
Query: 4 KEDIKRWRLILGKDTEE-----DFSSMDSEAISSFSEEDWL--MDRALDAIYNPTGKFMS 56
++ + RWRL LG + E + A + + L +D+AL IY+ +
Sbjct: 91 RDALLRWRLALGPEAERVDPRLSLGGLGGAAPALDVDPRRLGDLDKALSFIYDERAGNLG 150
Query: 57 GEVGAGAGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEP 116
G S P + WL +R F E+V ++Q DA++R GL QL+FEPE L +E
Sbjct: 151 G----------SRPYVPEWLSAVREFFSHEVVALVQKDAIERKGLTQLLFEPETLPFLEK 200
Query: 117 DINLASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPS 176
++ L +T+M K IP ++E+ R ++++VEE+ + LES+++ AV AL + SP+
Sbjct: 201 NVELVATLMSAKGLIPDAARETARQIVREVVEEVRRALESEVRTAVLGALRRNTTSPLRV 260
Query: 177 ASSLDFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVI 236
+LD+K TI++ +K ++ E ++++P+ YF+ A+ ++ V + +DQSGSMGESV+
Sbjct: 261 LRNLDWKRTIRKNLKGWDAERRRLVPDKLYFW--ANQTRRHEWDVAILVDQSGSMGESVV 318
Query: 237 YSSVMACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCM- 295
YSS+MA I AS+ L+TR++ FDTE+VD+T DPVD+L+ QLGGGTDIN+++ Y
Sbjct: 319 YSSIMAAIFASLDVLRTRLLFFDTEVVDVTPMLVDPVDVLFTAQLGGGTDINRAVAYAQA 378
Query: 296 KYIENPKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAG 355
+IE P+KT+ LI+DL EGGN ++ + + ++ V +CLLA+S G+P YD +MA
Sbjct: 379 NFIERPEKTLLILITDLFEGGNAEELVARMRQLADSKVKSICLLALSDGGKPSYDHEMAQ 438
Query: 356 KIASMGIPCFACNPEKLPLLLERVLKNLDL 385
K+A++G PCF C P+ L ++ER+++ DL
Sbjct: 439 KLAALGTPCFGCTPKLLVKVVERLMRGQDL 468
>gi|68235386|ref|ZP_00574391.1| VWA containing CoxE-like [Frankia sp. EAN1pec]
gi|68196996|gb|EAN11363.1| VWA containing CoxE-like [Frankia sp. EAN1pec]
Length = 433
Score = 274 bits (701), Expect = 7e-72, Method: Composition-based stats.
Identities = 149/384 (38%), Positives = 231/384 (60%), Gaps = 7/384 (1%)
Query: 2 DIKEDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGA 61
D E ++RWR++LG +D S S D +D AL A+Y+ +
Sbjct: 44 DDTERLRRWRMVLGAPAA---PVLDGRI--SLSGPDGEIDAALGALYDADAAGEGRRRRS 98
Query: 62 GAGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLA 121
G S P ++RWLGDIR F +V+++Q DA+ R GL QL+ EPE+L EPD++L
Sbjct: 99 GGLGA-SAPSVARWLGDIRTYFPTSVVRVLQRDAVARLGLGQLLLEPELLAAAEPDVHLV 157
Query: 122 STIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLD 181
T++ L+ +P++++E+ RA ++++VE+I + LE ++ AV A+++ + P +D
Sbjct: 158 GTLLSLRSALPERTRETARAVVRRVVEDIERRLEQSLRSAVLGAVDRTSRTRTPRLPDVD 217
Query: 182 FKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVM 241
+ TI ++NY E +I + RA + VIL IDQSGSM SV+Y+ V+
Sbjct: 218 WDRTILANLRNYQPEQHTVIVDRLIGHTRARRTAALR-DVILLIDQSGSMASSVVYAGVL 276
Query: 242 ACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENP 301
LA++ ++ TR++ FDT +VDLT++ DDPVDLL+G +LGGGTDI++++ Y + P
Sbjct: 277 GASLATLRAVSTRLIVFDTSVVDLTDQLDDPVDLLFGVRLGGGTDIDRAVGYGASLVTRP 336
Query: 302 KKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMG 361
T+F LISDL+EGG++ ++ L + EAGV VV LLA+S DG P YD ++A A +G
Sbjct: 337 TDTVFVLISDLIEGGDQSSLVARLRALVEAGVCVVVLLALSDDGTPAYDHRLAATCAELG 396
Query: 362 IPCFACNPEKLPLLLERVLKNLDL 385
P FAC P++ P LL L+ D+
Sbjct: 397 APAFACTPDRFPELLATALRRDDV 420
>gi|115380185|ref|ZP_01467212.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1]
gi|115362801|gb|EAU62009.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1]
Length = 302
Score = 259 bits (662), Expect = 2e-67, Method: Composition-based stats.
Identities = 126/300 (42%), Positives = 197/300 (65%), Gaps = 5/300 (1%)
Query: 96 MDRCGLKQLIFEPEILEQVEPDINLASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLE 155
M R GL ++ +PE+L VEPD++L +T++ L++ IPQK+KE+ R ++K+VE++ + L
Sbjct: 1 MTRLGLTDMLLQPELLAAVEPDVSLVATLLSLRKVIPQKTKETARQVVRKVVEDLERRLR 60
Query: 156 SDIKRAVRAALNKRQHSPIPSASSLDFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNP 215
+ +RAVR AL++ + P A+ +D+ T++ + NY E + ++ E R +
Sbjct: 61 APTERAVRGALSRSSRTRKPRAAEIDWNRTLRANLGNYLPERQSVVVEKLVGHGRKRS-- 118
Query: 216 TSKFTVILDIDQSGSMGESVIYSSVMACILASIASLKTRVVAFDTEIVDLTEKSDDPVDL 275
S V+L IDQSGSM SV+YSS+ +LAS+ ++ TR+V FDT +VDL+E+ DPVDL
Sbjct: 119 -SLRDVVLCIDQSGSMAASVVYSSIFGAVLASLRAVSTRMVLFDTSVVDLSEQLSDPVDL 177
Query: 276 LYGFQLGGGTDINKSIKYCMKYIENPKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTV 335
L+G QLGGGTDI++++ YC + I P +TI LI+DL EGGN ML+ + ++GVTV
Sbjct: 178 LFGTQLGGGTDIDQALAYCQQLITRPAQTILVLITDLYEGGNAPRMLQRAASLVQSGVTV 237
Query: 336 VCLLAISGDGQPYYDAQMAGKIASMGIPCFACNPEKLPLLLERVLKNLDLSSF--QQEFK 393
VCLLA+S G P +D A + A++GIP F+C P+ P L+ ++ DL ++ +QE +
Sbjct: 238 VCLLALSDQGAPSHDGHHAAQFAALGIPTFSCTPDLFPELMAAAIQRQDLRAWAARQELQ 297
>gi|150017599|ref|YP_001309853.1| VWA containing CoxE family protein [Clostridium beijerinckii NCIMB
8052]
gi|149904064|gb|ABR34897.1| VWA containing CoxE family protein [Clostridium beijerinckii NCIMB
8052]
Length = 376
Score = 219 bits (559), Expect = 2e-55, Method: Composition-based stats.
Identities = 128/382 (33%), Positives = 219/382 (57%), Gaps = 22/382 (5%)
Query: 7 IKRWRLILGKDTEEDFS-SMDSEAISSFSEE-DWLMDRALDA-----IYNPTGKFMSGEV 59
+ RWRL+LGK E+ S DS S S+ ++L DR D + +P G
Sbjct: 8 LNRWRLVLGKFAEDRIGFSEDSSNYSELSDLLEFLYDRDYDEERGIRVDDPRG------- 60
Query: 60 GAGAGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDIN 119
G+G S + W+ +R+LF KE V+I++ A+++ +K+L+ + ++LE +EP+
Sbjct: 61 ----GRGSSKFTVPSWITKVRSLFPKETVEILEKHALEKYNMKELLTDKKVLEAMEPNAE 116
Query: 120 LASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASS 179
L I+ +K + + ++ R +KK+V+EI K LE+++K + +++ + S + SA +
Sbjct: 117 LLKNILQMKHLMKGEVLDTARKIVKKVVDEITKSLENEVKLTIMGKVDRNKRSAVKSARN 176
Query: 180 LDFKTTIQRGIKNYNKELKKIIPEHYYFFERAST-NPTSKFTVILDIDQSGSMGESVIYS 238
+DFK TI+ +KNY+KE ++II + YF+ R NP + +++ +D+SGSM SVI+S
Sbjct: 177 IDFKRTIRANLKNYDKEEERIIVDKVYFYGRVRKYNP---WNIVVAVDESGSMLSSVIHS 233
Query: 239 SVMACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYI 298
++MA I + + LKT + FDTEIVDLT DD V L QLGGGT+I K++ Y K +
Sbjct: 234 AIMAGIFSKLPMLKTSLFIFDTEIVDLTSYVDDAVQTLMSVQLGGGTNIGKALSYAEKLV 293
Query: 299 ENPKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIA 358
E+P +T+ +++DL +G N M + + E G ++ L A+ +YD + A K+
Sbjct: 294 ESPLRTMVVMVTDLYDGYNYNIMYARAKAIIETGAKLIILTALDDKANGFYDKKAAAKMT 353
Query: 359 SMGIPCFACNPEKLPLLLERVL 380
++G A P L + +++
Sbjct: 354 ALGADVAAMMPGGLAKWIAKII 375
>gi|24213397|ref|NP_710878.1| hypothetical protein LA0697 [Leptospira interrogans serovar Lai
str. 56601]
gi|24194155|gb|AAN47896.1|AE011256_3 conserved hypothetical protein [Leptospira interrogans serovar Lai
str. 56601]
Length = 374
Score = 208 bits (529), Expect = 5e-52, Method: Composition-based stats.
Identities = 119/365 (32%), Positives = 203/365 (55%), Gaps = 10/365 (2%)
Query: 9 RWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYN-PTGKFMSGEVGAGAGKGP 67
RW+LILG +E+ F + +FSEE M+ A++ +Y+ G+ + G G
Sbjct: 10 RWKLILGNGSEQSFGN------ETFSEEQQRMNIAMEYLYDREYGEDRNIRTG---GLSE 60
Query: 68 SNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIMLL 127
SN + W+ +I LF K+ ++ I+ DA++R + +++ PE+L++ P+ L ++
Sbjct: 61 SNLTVPLWINEIHELFPKKTIERIEKDALERYQIMEMVTNPELLKRASPNTTLLKAVLHT 120
Query: 128 KEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTTIQ 187
+ + + R ++K+++E+ K LE+ I + + N+ S + D K TI+
Sbjct: 121 QHLMNPQVLSLARELVRKVIDELMKKLETTILTSFQGIKNRNLRSSFKIYKNFDIKNTIR 180
Query: 188 RGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACILAS 247
+K+Y+ + +K++ + F R + ++ +I+ +DQSGSM +SVI+S+V A I
Sbjct: 181 SNLKHYDLKSQKLVLQKPLFHSRTHRSMAERWHLIILVDQSGSMLDSVIHSAVTASIFWG 240
Query: 248 IASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTIFF 307
I S+KT ++ FDTEIVD+T+ DPV+ L QLGGGTDI ++ Y +ENP++TI
Sbjct: 241 IKSIKTSLILFDTEIVDVTDHCSDPVETLMKVQLGGGTDIGSALLYAEGKVENPRRTIII 300
Query: 308 LISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCFAC 367
LISD EG ++ N + E+GV V+ L A+ P YD +MA K+ +G A
Sbjct: 301 LISDFCEGAPPFKLISNTHHLVESGVKVLGLAALDETANPSYDKEMAEKLVKVGAEIAAM 360
Query: 368 NPEKL 372
P +L
Sbjct: 361 TPGEL 365
>gi|45658734|ref|YP_002820.1| hypothetical protein LIC12904 [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|45601978|gb|AAS71457.1| conserved hypothetical protein [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
Length = 374
Score = 207 bits (528), Expect = 8e-52, Method: Composition-based stats.
Identities = 118/365 (32%), Positives = 203/365 (55%), Gaps = 10/365 (2%)
Query: 9 RWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYN-PTGKFMSGEVGAGAGKGP 67
RW+LILG +E+ F + +FSEE M+ A++ +Y+ G+ + G G
Sbjct: 10 RWKLILGNGSEQSFGN------ETFSEEQQRMNIAMEYLYDREYGEDRNIRTG---GLSE 60
Query: 68 SNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIMLL 127
SN + W+ +I LF K+ ++ I+ DA++R + +++ PE+L++ P+ L ++
Sbjct: 61 SNLTVPLWINEIHELFPKKTIERIEKDALERYQIMEMVTNPELLKRASPNTTLLKAVLHT 120
Query: 128 KEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTTIQ 187
+ + + R ++K+++E+ K LE+ I + + N+ S + D K TI+
Sbjct: 121 QHLMNPQVLSLARELVRKVIDELMKKLETTILTSFQGIKNRNLRSSFKIYKNFDIKNTIR 180
Query: 188 RGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACILAS 247
+K+Y+ + +K++ + F R + ++ +I+ +DQSGSM +SVI+S+V A I
Sbjct: 181 SNLKHYDLKSQKLVLQKPLFHSRTHRSMAERWHLIILVDQSGSMLDSVIHSAVTASIFWG 240
Query: 248 IASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTIFF 307
I S+KT ++ FDTE+VD+T+ DPV+ L QLGGGTDI ++ Y +ENP++TI
Sbjct: 241 IKSIKTSLILFDTEVVDVTDHCSDPVETLMKVQLGGGTDIGSALLYAEGKVENPRRTIII 300
Query: 308 LISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCFAC 367
LISD EG ++ N + E+GV V+ L A+ P YD +MA K+ +G A
Sbjct: 301 LISDFCEGAPPFKLISNTHHLVESGVKVLGLAALDETANPSYDKEMAEKLVKVGAEIAAM 360
Query: 368 NPEKL 372
P +L
Sbjct: 361 TPGEL 365
>gi|26248499|ref|NP_754539.1| Hypothetical protein yehP [Escherichia coli CFT073]
gi|26108904|gb|AAN81107.1|AE016763_66 Hypothetical protein yehP [Escherichia coli CFT073]
Length = 399
Score = 207 bits (526), Expect = 1e-51, Method: Composition-based stats.
Identities = 116/377 (30%), Positives = 209/377 (55%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G
Sbjct: 33 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSGGLGG 85
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 86 --SNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 143
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R ++++VEEI L ++++A ++R+ S IP A + DFK+T
Sbjct: 144 HTKHLMNPEVLAAARQIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKST 203
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + ++ ++L +DQSGSM +SVI+S+VMA L
Sbjct: 204 LRANLQHWHPQHGKLYIESPRFNSRIKRQ-SEQWQLVLLVDQSGSMVDSVIHSAVMAACL 262
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 263 WQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSV 322
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++G+ V+ L A+ P YD MA + ++G
Sbjct: 323 IILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTTTPCYDRDMAQALVNVGAQIA 382
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 383 AMTPGELASWLAENLQS 399
>gi|75196166|ref|ZP_00706236.1| hypothetical protein EcolH_01001974 [Escherichia coli HS]
gi|157067283|gb|ABV06538.1| von Willebrand factor type A domain protein [Escherichia coli HS]
Length = 378
Score = 205 bits (521), Expect = 5e-51, Method: Composition-based stats.
Identities = 115/377 (30%), Positives = 208/377 (55%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G
Sbjct: 12 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSGGLGG 64
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 65 --SNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 122
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R ++++VEEI L ++++A ++R+ S IP A + DFK+T
Sbjct: 123 HTKHLMNPEVLATARRIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSSIPLARNFDFKST 182
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + ++ ++L +DQSGSM +SVI+S+VMA L
Sbjct: 183 LRANLQHWHPQHGKLYIESPRFNSRIKRQ-SEQWQLVLLVDQSGSMVDSVIHSAVMAACL 241
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 242 WQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSV 301
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++G+ V+ L A+ P YD A + ++G
Sbjct: 302 IILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDHDTAQALVNVGAQIA 361
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 362 AMTPGELASWLAENLQS 378
>gi|15802605|ref|NP_288632.1| hypothetical protein Z3294 [Escherichia coli O157:H7 EDL933]
gi|15832184|ref|NP_310957.1| hypothetical protein ECs2930 [Escherichia coli O157:H7 str. Sakai]
gi|12516345|gb|AAG57187.1|AE005439_6 orf, hypothetical protein [Escherichia coli O157:H7 EDL933]
gi|13362399|dbj|BAB36353.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
Length = 378
Score = 204 bits (520), Expect = 6e-51, Method: Composition-based stats.
Identities = 115/377 (30%), Positives = 208/377 (55%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G
Sbjct: 12 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSGGLGG 64
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 65 --SNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 122
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R ++++VEEI L ++++A ++R+ S IP A DFK+T
Sbjct: 123 HTKHLMNPEVLAAARRIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSSIPLARDFDFKST 182
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + + ++ ++L +D SGSM +SVI+S+VMA L
Sbjct: 183 LRANLQHWHPQHGKLYIESPRFNSRIKRH-SEQWQLVLLVDLSGSMVDSVIHSAVMAACL 241
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 242 WQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSV 301
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++G+ V+ L A+ P YD MA + ++G
Sbjct: 302 IILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDMAQALVNVGAQIA 361
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 362 AMTPGELATWLAENLQS 378
>gi|78062407|ref|YP_372315.1| VWA containing CoxE-like protein [Burkholderia sp. 383]
gi|77970292|gb|ABB11671.1| VWA containing CoxE-like protein [Burkholderia sp. 383]
Length = 381
Score = 204 bits (520), Expect = 6e-51, Method: Composition-based stats.
Identities = 117/372 (31%), Positives = 209/372 (56%), Gaps = 10/372 (2%)
Query: 1 MDIKEDIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVG 60
++I ++RWRL+LG+ E + ++A ++ + +WL R D + GE
Sbjct: 10 LNIPGPLERWRLLLGEPAEAACGTPGADAQAADAALEWLYGRDDD-------RAKRGE-- 60
Query: 61 AGAGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINL 120
GAG GPS W+ I LF KE++ ++ DA++R G+ +++ E+LE++EP +L
Sbjct: 61 RGAGLGPSALSTPDWINTIHTLFPKEVIDRLERDAVERFGIDEVVTNLEVLERIEPSESL 120
Query: 121 ASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSL 180
++ K + + + R + ++V I + L +++++A ++R+ S + A +
Sbjct: 121 LRAVLHTKHLMNPEVLAAARRLVAEVVRRIMERLATEVRQAFSGTRDRRRRSRMKIARNF 180
Query: 181 DFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSV 240
D+ T+ +++++ E +K+ + F R + + ++L +DQSGSM SVI+S+V
Sbjct: 181 DYTRTLAANLRHWHPERRKLYLDTPVFNSR-TRRQAEPWDIVLLVDQSGSMVNSVIHSAV 239
Query: 241 MACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIEN 300
MA L + ++TR+VAFDT +VDLT DPV+LL QLGGGTDI K++ Y + N
Sbjct: 240 MAACLWQLPGMRTRLVAFDTSVVDLTADVSDPVELLMKVQLGGGTDIAKAVAYAQSCVAN 299
Query: 301 PKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASM 360
P +T+ L+SD EGG+ ++R ++ + E+G V+ L A+ +P YD +MA ++ +
Sbjct: 300 PARTVVVLVSDFYEGGSGYELVRRVKALAESGARVLGLAALDSAAEPAYDREMAARLVNA 359
Query: 361 GIPCFACNPEKL 372
G A P +L
Sbjct: 360 GAQIGAMTPGQL 371
>gi|75209876|ref|ZP_00710068.1| hypothetical protein EcolB_01002943 [Escherichia coli B171]
gi|75259282|ref|ZP_00730630.1| hypothetical protein EcolE2_01001173 [Escherichia coli E22]
Length = 378
Score = 204 bits (518), Expect = 1e-50, Method: Composition-based stats.
Identities = 115/377 (30%), Positives = 208/377 (55%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G
Sbjct: 12 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSGGLGG 64
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 65 --SNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 122
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R ++++VEEI L ++++A ++R+ S IP A + DFK+T
Sbjct: 123 HTKHLMNPEVLAAARQIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSLIPLARNFDFKST 182
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + ++ ++L +DQSGSM +SVI+S+VMA L
Sbjct: 183 LRANLQHWHPQHGKLYIESPRFNSRIKRQ-SEQWQLVLLVDQSGSMVDSVIHSAVMAACL 241
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 242 WQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSV 301
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++G+ V+ L A+ P YD A + ++G
Sbjct: 302 IILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTAQALVNVGAQIA 361
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 362 AMTPGELASWLAENLQS 378
>gi|75236308|ref|ZP_00720417.1| hypothetical protein EcolE1_01002318 [Escherichia coli E110019]
Length = 378
Score = 204 bits (518), Expect = 1e-50, Method: Composition-based stats.
Identities = 115/377 (30%), Positives = 208/377 (55%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G
Sbjct: 12 ELQRWRLILGEAAETALCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSGGLGG 64
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 65 --SNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 122
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R ++++VEEI L ++++A ++R+ S IP A + DFK+T
Sbjct: 123 HTKHLMNPEVLAAARQIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKST 182
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + ++ ++L +DQSGSM +SVI+S+VMA L
Sbjct: 183 LRANLQHWHPQHGKLYIESPRFNSRIKRQ-SEQWQLVLLVDQSGSMVDSVIHSAVMAACL 241
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 242 WQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSV 301
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++G+ V+ L A+ P YD A + ++G
Sbjct: 302 IILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDHDTAQALVNVGAQIA 361
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 362 AMTPGELASWLAENLQS 378
>gi|75242937|ref|ZP_00726643.1| COG2304: Uncharacterized protein containing a von Willebrand factor
type A (vWA) domain [Escherichia coli F11]
gi|110642328|ref|YP_670058.1| hypothetical protein YehP [Escherichia coli 536]
gi|110343920|gb|ABG70157.1| hypothetical protein YehP [Escherichia coli 536]
Length = 378
Score = 203 bits (516), Expect = 2e-50, Method: Composition-based stats.
Identities = 115/377 (30%), Positives = 207/377 (54%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G
Sbjct: 12 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSGGLGD 64
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF + +++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 65 --SNLTTPEWINSIHTLFPQRVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 122
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R + ++VEEI L ++++A ++R+ S IP A + DFK+T
Sbjct: 123 HTKHLMNPEVLAAARRIVHQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKST 182
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + + ++ ++L +DQSGSM +SVI+S+VMA L
Sbjct: 183 LRANLQHWHPQHGKLYIESPRFNSRIKRH-SEQWQLVLLVDQSGSMVDSVIHSAVMAACL 241
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 242 WQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSV 301
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++G+ V+ L A+ P YD A + ++G
Sbjct: 302 IILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDHDTAQALVNVGAQIA 361
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 362 AMTPGELASWLAENLQS 378
>gi|30063554|ref|NP_837725.1| hypothetical protein S2315 [Shigella flexneri 2a str. 2457T]
gi|56480041|ref|NP_708009.2| hypothetical protein SF2190 [Shigella flexneri 2a str. 301]
gi|30041807|gb|AAP17534.1| hypothetical protein S2315 [Shigella flexneri 2a str. 2457T]
gi|56383592|gb|AAN43716.2| orf, conserved hypothetical protein [Shigella flexneri 2a str. 301]
Length = 378
Score = 203 bits (516), Expect = 2e-50, Method: Composition-based stats.
Identities = 115/377 (30%), Positives = 209/377 (55%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE +G G
Sbjct: 12 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSSGLGG 64
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 65 --SNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 122
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R ++++VEEI L ++++A ++R+ S IP A + DFK+T
Sbjct: 123 HTKHLMNPEVLAAARRIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKST 182
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + ++ ++L +DQSGSM +SVI+S+VMA L
Sbjct: 183 LRANLQHWHPQHGKLYIEFPRFNSRIKRQ-SEQWQLVLLVDQSGSMVDSVIHSAVMAACL 241
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 242 WQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAMEYGRQLIEQPAKSV 301
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++G+ V+ L A+ P YD A + ++G
Sbjct: 302 IILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTAQALVNVGAQIA 361
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 362 AMTPGELASWLAENLQS 378
>gi|83646059|ref|YP_434494.1| VWA_CoxE family protein [Hahella chejuensis KCTC 2396]
gi|83634102|gb|ABC30069.1| VWA_CoxE family protein [Hahella chejuensis KCTC 2396]
Length = 371
Score = 202 bits (515), Expect = 3e-50, Method: Composition-based stats.
Identities = 119/374 (31%), Positives = 207/374 (55%), Gaps = 11/374 (2%)
Query: 9 RWRLILGKDTEEDFSSMDSEAIS-SFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGKGP 67
RWRLILG+ DSE ++ S + + D+ L+ +Y + + + G
Sbjct: 8 RWRLILGE--------TDSEHLNPSMTPQQRQQDQLLEYLYGQEYRRDNRNIRGGT-LDE 58
Query: 68 SNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIMLL 127
S+ I W+ I LF KE ++ ++ DA++R +++++ PE+L++ +P + L I+
Sbjct: 59 SSLTIPEWINGIHELFPKETIERLEKDALERYQIQEMVTNPELLKRAQPSLTLLKAILHT 118
Query: 128 KEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTTIQ 187
K + Q+ RA K+++ E+ + L ++ +++ + S + A + D + TI+
Sbjct: 119 KHLMNQEVLALARAMAKRVISELLEKLARPMRHPFLGRIHRLKRSHLKIAKNFDARETIR 178
Query: 188 RGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACILAS 247
R +K+Y++E +++ E YF R + K+ +I+ +DQSGSM +SVIYS+V A I
Sbjct: 179 RNLKHYDRERGRLVIETPYFHSRIRRQ-SDKWRLIILVDQSGSMMDSVIYSAVTASIFWG 237
Query: 248 IASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTIFF 307
I +L T +V FDT IVDLT+ DPV+ L Q+GGGTDI +++Y + P KT+
Sbjct: 238 IQALDTHLVVFDTNIVDLTDHCQDPVETLMKVQMGGGTDIGHAMQYGASLVSQPTKTLLV 297
Query: 308 LISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCFAC 367
LISD EGG+ +L +D+ E+GV V+ L A+ P YD ++A ++A++G
Sbjct: 298 LISDFCEGGDPRRLLSVTQDLVESGVQVLGLAALDERANPVYDQRIAQQMANIGAKVGCM 357
Query: 368 NPEKLPLLLERVLK 381
P +L + V++
Sbjct: 358 TPGELANWVSEVIE 371
>gi|91211406|ref|YP_541392.1| hypothetical protein UTI89_C2393 [Escherichia coli UTI89]
gi|91072980|gb|ABE07861.1| conserved hypothetical protein [Escherichia coli UTI89]
Length = 399
Score = 202 bits (514), Expect = 3e-50, Method: Composition-based stats.
Identities = 114/377 (30%), Positives = 209/377 (55%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G+
Sbjct: 33 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERYGGLGR 85
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 86 --SNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 143
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R + ++VEEI L ++++A ++R+ S I A + DFK+T
Sbjct: 144 HTKHLMNPEVLAAARRIVHQVVEEIMARLAKEVRQAFSGVRDRRRRSFISLARNFDFKST 203
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + + ++ ++L +DQSGSM +SVI+S+VMA L
Sbjct: 204 LRANLQHWHPQHGKLYIESPRFNSRIKRH-SEQWQLVLLVDQSGSMVDSVIHSAVMAACL 262
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 263 WQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSV 322
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++G+ V+ L A+ + P YD A + ++G
Sbjct: 323 IILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSNATPCYDHDTAQALVNVGAQIA 382
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 383 AMTPGELASWLAENLQS 399
>gi|117624324|ref|YP_853237.1| hypothetical protein APECO1_4428 [Escherichia coli APEC O1]
gi|115513448|gb|ABJ01523.1| conserved hypothetical protein [Escherichia coli APEC O1]
Length = 448
Score = 202 bits (514), Expect = 3e-50, Method: Composition-based stats.
Identities = 114/377 (30%), Positives = 209/377 (55%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G+
Sbjct: 82 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERYGGLGR 134
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 135 --SNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 192
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R + ++VEEI L ++++A ++R+ S I A + DFK+T
Sbjct: 193 HTKHLMNPEVLAAARRIVHQVVEEIMARLAKEVRQAFSGVRDRRRRSFISLARNFDFKST 252
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + + ++ ++L +DQSGSM +SVI+S+VMA L
Sbjct: 253 LRANLQHWHPQHGKLYIESPRFNSRIKRH-SEQWQLVLLVDQSGSMVDSVIHSAVMAACL 311
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 312 WQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSV 371
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++G+ V+ L A+ + P YD A + ++G
Sbjct: 372 IILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSNATPCYDHDTAQALVNVGAQIA 431
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 432 AMTPGELASWLAENLQS 448
>gi|83588103|ref|ZP_00926728.1| COG2425: Uncharacterized protein containing a von Willebrand factor
type A (vWA) domain [Escherichia coli 101-1]
Length = 378
Score = 202 bits (513), Expect = 4e-50, Method: Composition-based stats.
Identities = 114/377 (30%), Positives = 208/377 (55%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G
Sbjct: 12 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSGGLGG 64
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 65 --SNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 122
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R ++++VEEI L ++++A ++R+ S IP A + DFK+T
Sbjct: 123 HTKHLMNPEVLAAARQIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKST 182
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + ++ ++L +DQSGSM +SVI+S+V+A L
Sbjct: 183 LRANLQHWHPQHGKLYIESPRFNSRIKRQ-SEQWQLVLLVDQSGSMVDSVIHSAVIAACL 241
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 242 WQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSV 301
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++G+ V+ L A+ P YD A + ++G
Sbjct: 302 IILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTAQALVNVGAQIA 361
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 362 AMTPGELASWLAENLQS 378
>gi|16130059|ref|NP_416625.1| conserved protein [Escherichia coli K12]
gi|89108937|ref|AP_002717.1| hypothetical protein [Escherichia coli W3110]
gi|465583|sp|P33352|YEHP_ECOLI Uncharacterized protein yehP
gi|405852|gb|AAA60484.1| yehP [Escherichia coli]
gi|1788440|gb|AAC75182.1| conserved protein [Escherichia coli K12]
gi|85675234|dbj|BAE76597.1| conserved hypothetical protein [Escherichia coli W3110]
gi|744221|prf||2014253Q yehP gene
Length = 378
Score = 201 bits (512), Expect = 5e-50, Method: Composition-based stats.
Identities = 115/377 (30%), Positives = 207/377 (54%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G
Sbjct: 12 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSGGLGG 64
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 65 --SNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 122
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R + ++VEEI L ++++A ++R+ S IP A + DFK+T
Sbjct: 123 HTKHLMNPEVLAAARRIVCQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKST 182
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + ++ ++L +DQSGSM +SVI+S+VMA L
Sbjct: 183 LRANLQHWHPQHGKLYIESPRFNSRIKRQ-SEQWQLVLLVDQSGSMVDSVIHSAVMAACL 241
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 242 WQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSV 301
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++G+ V+ L A+ P YD A + ++G
Sbjct: 302 IILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTAQALVNVGAQIA 361
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 362 AMTPGELASWLAENLQS 378
>gi|75514135|ref|ZP_00736465.1| hypothetical protein Ecol5_01001959 [Escherichia coli 53638]
Length = 378
Score = 201 bits (512), Expect = 5e-50, Method: Composition-based stats.
Identities = 115/377 (30%), Positives = 207/377 (54%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G
Sbjct: 12 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSGGLGG 64
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 65 --SNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 122
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R + ++VEEI L ++++A ++R+ S IP A + DFK+T
Sbjct: 123 HTKHLMNPEVLAAARRIVCQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKST 182
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + ++ ++L +DQSGSM +SVI+S+VMA L
Sbjct: 183 LRANLQHWHPQHGKLYIESPRFNSRIKRQ-SEQWQLVLLVDQSGSMVDSVIHSAVMAACL 241
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 242 WQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSV 301
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++G+ V+ L A+ P YD A + ++G
Sbjct: 302 IILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTARALVNVGAQIA 361
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 362 AMTPGELASWLAENLQS 378
>gi|82543488|ref|YP_407435.1| hypothetical protein SBO_0944 [Shigella boydii Sb227]
gi|81244899|gb|ABB65607.1| conserved hypothetical protein [Shigella boydii Sb227]
Length = 378
Score = 201 bits (510), Expect = 8e-50, Method: Composition-based stats.
Identities = 115/377 (30%), Positives = 206/377 (54%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G
Sbjct: 12 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSGGLGG 64
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 65 --SNLTTPEWFNSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 122
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R ++++VEEI L ++++A ++R+ S IP A + DFK+T
Sbjct: 123 HTKHLMNPEVLAAARRIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKST 182
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + ++ ++L +DQSGSM +SVI S+VMA L
Sbjct: 183 LRANLQHWHPQHGKLYIESPRFNSRIKRQ-SEQWQLVLLVDQSGSMVDSVIPSAVMAACL 241
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 242 WQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSV 301
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++G+ V+ L A+ P YD A + ++G
Sbjct: 302 IILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTAQALVNVGAQIA 361
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 362 AMTPGELASWLAENLQS 378
>gi|75231539|ref|ZP_00717908.1| hypothetical protein EcolB7_01000454 [Escherichia coli B7A]
Length = 378
Score = 200 bits (509), Expect = 1e-49, Method: Composition-based stats.
Identities = 114/377 (30%), Positives = 207/377 (54%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G
Sbjct: 12 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSGGLGG 64
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 65 --SNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 122
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R ++++VEEI L ++++A ++R+ S IP A + FK+T
Sbjct: 123 HTKHLMNPEVLAAARQIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFVFKST 182
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + ++ ++L +DQSGSM +SVI+S+VMA L
Sbjct: 183 LRANLQHWHPQHGKLYIESPRFNSRIKRQ-SEQWQLVLLVDQSGSMVDSVIHSAVMAACL 241
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 242 WQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSV 301
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++G+ V+ L A+ P YD A + ++G
Sbjct: 302 IILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDHDTAQALVNVGAQIA 361
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 362 AMTPGELASWLAENLQS 378
>gi|75186911|ref|ZP_00700178.1| hypothetical protein EcolE_01000137 [Escherichia coli E24377A]
gi|157078705|gb|ABV18413.1| von Willebrand factor type A domain protein [Escherichia coli
E24377A]
Length = 378
Score = 197 bits (502), Expect = 8e-49, Method: Composition-based stats.
Identities = 113/377 (29%), Positives = 206/377 (54%), Gaps = 10/377 (2%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G
Sbjct: 12 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSGGLGG 64
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 65 --SNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 122
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R ++++VEEI L ++++A ++R+ S IP A + DFK+T
Sbjct: 123 HTKHLMNPEVLAAARQIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKST 182
Query: 186 IQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACIL 245
++ +++++ + K+ E F R + ++ ++L +DQSGSM +SVI+S+VMA L
Sbjct: 183 LRANLQHWHPQHGKLYIESPRFNSRIKRQ-SEQWQLVLLVDQSGSMVDSVIHSAVMAACL 241
Query: 246 ASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTI 305
+ ++T +VAFDT + DLT DPV+LL QLGGGT+I +++Y + IE P K++
Sbjct: 242 WQLPGIRTHLVAFDTSVDDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSV 301
Query: 306 FFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCF 365
L+SD EGG+ + ++ ++ + V+ L A+ P YD A + ++G
Sbjct: 302 IILVSDFYEGGSSSLLTHQVKKCVQSCIKVLGLAALDSTATPCYDHDTAQALVNVGAQIA 361
Query: 366 ACNPEKLPLLLERVLKN 382
A P +L L L++
Sbjct: 362 AMTPGELASWLAENLQS 378
>gi|145595401|ref|YP_001159698.1| VWA containing CoxE family protein [Salinispora tropica CNB-440]
gi|145304738|gb|ABP55320.1| VWA containing CoxE family protein [Salinispora tropica CNB-440]
Length = 377
Score = 187 bits (475), Expect = 1e-45, Method: Composition-based stats.
Identities = 115/369 (31%), Positives = 201/369 (54%), Gaps = 8/369 (2%)
Query: 6 DIKRWRLILGK--DTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGA 63
+++RWRL+LG+ D + +E + + +WL R + + + G
Sbjct: 5 ELERWRLVLGEPGDAALGRRPLAAETAARDAALEWLYGRDEEL----GRRGVRRAGGRYG 60
Query: 64 GKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLAST 123
G GP+ WL DI LF ++ ++ +Q DA++R + ++ +P +LE+VEP+ ++
Sbjct: 61 GDGPATLTTVDWLDDISRLFPRDTIERLQRDAVERYEIHDIVTDPAVLERVEPNQSMLRA 120
Query: 124 IMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFK 183
++ K + + R ++ ++ ++ L ++++ A A R+ S A + D +
Sbjct: 121 VLRTKHLMNPQVLRLARRIVEAVIRQLMDKLATEVRVAFTGA-RARRPSRFRQARNFDVR 179
Query: 184 TTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMAC 243
TI+ + +Y E +++ E YFF R + ++ VIL +DQSGSM +SVI+S+V A
Sbjct: 180 RTIKDNLGHYRPEDQRLFIETPYFFSRIRQH-IDQWQVILLVDQSGSMTDSVIHSAVTAA 238
Query: 244 ILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKK 303
L + ++T +VAFDT+IVDLT DDPV+LL QLGGGT+I +++ Y + IE P++
Sbjct: 239 CLWGLPGVRTHLVAFDTDIVDLTSDVDDPVELLMKVQLGGGTNIGRAVDYAAQLIEQPRR 298
Query: 304 TIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIP 363
+I LI+D EGG+ ++R + + E G V+ L A+ P YD A ++A +G
Sbjct: 299 SIVALITDFYEGGSEERLVRTVRGLVEQGTKVLGLAALDEQANPVYDRVAAQRLADVGAS 358
Query: 364 CFACNPEKL 372
A P +L
Sbjct: 359 VGAMTPGEL 367
>gi|11875069|dbj|BAB19548.1| hypothetical protein [Escherichia coli O157:H7]
Length = 306
Score = 179 bits (453), Expect = 4e-43, Method: Composition-based stats.
Identities = 94/307 (30%), Positives = 179/307 (58%), Gaps = 1/307 (0%)
Query: 76 LGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIMLLKEQIPQKS 135
+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++ K + +
Sbjct: 1 INSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVLHTKHLMNPEV 60
Query: 136 KESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTTIQRGIKNYNK 195
+ R ++++VEEI L ++++A ++R+ S IP A DFK+T++ +++++
Sbjct: 61 LAAARRIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSSIPLARDFDFKSTLRANLQHWHP 120
Query: 196 ELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACILASIASLKTRV 255
+ K+ E F R + + ++ ++L +D SGSM +SVI+S+VMA L + ++T +
Sbjct: 121 QHGKLYIESPRFNSRIKRH-SEQWQLVLLVDLSGSMVDSVIHSAVMAACLWQLPGIRTHL 179
Query: 256 VAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTIFFLISDLMEG 315
VAFDT +VDLT DPV+LL QLGGGT+I +++Y + IE P K++ L+SD EG
Sbjct: 180 VAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSVIILVSDFYEG 239
Query: 316 GNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCFACNPEKLPLL 375
G+ + ++ ++G+ V+ L A+ P YD MA + ++G A P +L
Sbjct: 240 GSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDMAQALVNVGAQIAAMTPGELATW 299
Query: 376 LERVLKN 382
L L++
Sbjct: 300 LAENLQS 306
>gi|23011912|ref|ZP_00052135.1| hypothetical protein Magn03006485 [Magnetospirillum magnetotacticum
MS-1]
Length = 193
Score = 177 bits (448), Expect = 2e-42, Method: Composition-based stats.
Identities = 80/166 (48%), Positives = 116/166 (69%)
Query: 221 VILDIDQSGSMGESVIYSSVMACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQ 280
VIL IDQSGSM SV+YSS+ ++AS+ ++ TR+V FDTE+VDL+++ DDPV++L+ Q
Sbjct: 10 VILCIDQSGSMANSVVYSSIFGAVMASLPAVSTRLVVFDTEVVDLSDEMDDPVEVLFSVQ 69
Query: 281 LGGGTDINKSIKYCMKYIENPKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLA 340
LGGGTDIN+++ YC I P+ T+ LISDL EGG G+L + + AGV VV LLA
Sbjct: 70 LGGGTDINRAVGYCASRITRPEDTVLVLISDLYEGGVEAGLLAQAQRLVGAGVQVVALLA 129
Query: 341 ISGDGQPYYDAQMAGKIASMGIPCFACNPEKLPLLLERVLKNLDLS 386
+S +G P YD +A ++A +G+P FAC P+ P ++ ++ DL+
Sbjct: 130 LSDEGAPAYDRGLAARLAGLGVPAFACTPDLFPDMMAAAIRKQDLT 175
>gi|113940946|ref|ZP_01426763.1| VWA containing CoxE-like [Herpetosiphon aurantiacus ATCC 23779]
gi|113897413|gb|EAU16458.1| VWA containing CoxE-like [Herpetosiphon aurantiacus ATCC 23779]
Length = 460
Score = 143 bits (361), Expect = 2e-32, Method: Composition-based stats.
Identities = 86/295 (29%), Positives = 161/295 (54%), Gaps = 6/295 (2%)
Query: 81 NLFDKELVKIIQ---TDAMDRCGLKQLIFEPEILEQVEPDINLASTIMLLKEQIPQKSKE 137
NL ++EL ++IQ D + R L++++ + + Q+ P + + ++ K + +
Sbjct: 159 NLSEEELRQVIQGLEKDLIKRMALREVLQDNRLAAQLTPSMAVVEQLLRDKSHLSGNALI 218
Query: 138 SVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTTIQRGIKNYNKEL 197
+ + IK+ V+E+ +L + +AV A ++ R P +LD K TI R + N+N
Sbjct: 219 NAKRLIKQYVDELADVLRLQVMQAVSAKID-RSVPPKRVFRNLDLKRTIWRNLTNWNSNE 277
Query: 198 KKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVMACILASIASLKTRVVA 257
++ + Y+ R + + +I+ +DQSGSM ++++ +++A I A + + ++A
Sbjct: 278 GRLYVDRLYY--RQTAKKRTPMRMIVVVDQSGSMVDAMVQCTILASIFAGLPHVDMHLIA 335
Query: 258 FDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTIFFLISDLMEGGN 317
FDT ++DLT DP ++L QLGGGT IN+++ + + I+ P+KT LI+D EGG+
Sbjct: 336 FDTRMLDLTPWVHDPFEVLLRTQLGGGTSINEALLFASEKIQEPRKTAVVLITDFYEGGS 395
Query: 318 RGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCFACNPEKL 372
+L ++ M E+GV + + A++ G + K+ MG P FA +P KL
Sbjct: 396 DQVLLDTIKAMIESGVHFIPVGAVTSSGYFSVNDWFRTKLKEMGRPIFAGSPRKL 450
>gi|149173116|ref|ZP_01851747.1| hypothetical protein PM8797T_28039 [Planctomyces maris DSM 8797]
gi|148847922|gb|EDL62254.1| hypothetical protein PM8797T_28039 [Planctomyces maris DSM 8797]
Length = 1197
Score = 140 bits (352), Expect = 2e-31, Method: Composition-based stats.
Identities = 97/372 (26%), Positives = 177/372 (47%), Gaps = 28/372 (7%)
Query: 9 RWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPT---GKFMSGEVGAG-AG 64
RWRLILG + S+ S+ ++ LD +Y + G+ + G++ + G
Sbjct: 833 RWRLILGV---KGCSTPKSQQVAG----------TLDQLYGGSEREGRGLQGDLASDRGG 879
Query: 65 KGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTI 124
+ P + W+ D+ LF K++ + + +A + + E V P + L +
Sbjct: 880 TEAAAPSVREWISDVERLFGKDVCEEVLGEAA--VNGRAAVLEHLNHATVRPSVELLEQV 937
Query: 125 MLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRA---ALNKRQHSPIPSASSLD 181
+ L+ + ++ +R + I E + K L + ++ A+ A R+ SP LD
Sbjct: 938 LSLRGALSERELGLLRKLARNITERMAKQLANRLRPALHGLSIARPTRRRSP-----RLD 992
Query: 182 FKTTIQRGIKN-YNKELKKIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSV 240
F T+ + Y K +I R + +I +D SGSM SVIYSS+
Sbjct: 993 FARTLNSNLHTAYRKSDGRISIAPTRLVYRLPAKRQMDWHLIFVVDVSGSMEASVIYSSM 1052
Query: 241 MACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIEN 300
MA I +++ ++ + AF T+++D T + +DP+ LL Q+GGGT I ++ + I N
Sbjct: 1053 MAAIFSALPAIDVKFFAFSTQVIDFTGRVEDPLSLLMEIQIGGGTHIGLGLRAARESITN 1112
Query: 301 PKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASM 360
P +T+ L++D EG + +L + + +G ++ L A++ + +P Y A A +
Sbjct: 1113 PSRTLVVLVTDFEEGVSVPELLSEVVMLSSSGAKLIGLAALNDEAKPRYHAGTAAAVVQA 1172
Query: 361 GIPCFACNPEKL 372
G+P A +PE+L
Sbjct: 1173 GMPVAAVSPERL 1184
>gi|32473270|ref|NP_866264.1| hypothetical protein RB4715 [Rhodopirellula baltica SH 1]
gi|32397949|emb|CAD73950.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
Length = 1291
Score = 136 bits (342), Expect = 3e-30, Method: Composition-based stats.
Identities = 100/371 (26%), Positives = 174/371 (46%), Gaps = 23/371 (6%)
Query: 9 RWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGKGPS 68
RWRLI G E S A+ S D L R G + G G G
Sbjct: 912 RWRLIFGLPPE----SGTPLAMRCASSLDQLYGRGHGE--GSRGGLANAPSGMGGGTEAP 965
Query: 69 NPQISRWLGDIRNLFDKELVK-IIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIMLL 127
P ++W D+ LF +L + ++ T A + + +P+ V P + L ++ L
Sbjct: 966 EPTTAQWAEDLEALFGSDLCQEVLGTAAGNGRSTAIELLDPD---TVTPSLELLQQVLSL 1022
Query: 128 KEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPS---ASSLDFKT 184
+P+ ++R +++ E+ L S++ ++ A+N SP P+ A L+
Sbjct: 1023 AGAMPESKVATLRRLARRLTEQ----LASELAVRLQPAMNGLS-SPRPTRRRARKLNLPR 1077
Query: 185 TIQRGIKNYNKELK---KIIPEHYYFFERASTNPTSKFTVILDIDQSGSMGESVIYSSVM 241
T++ + N ++ I+ E F + T ++D+ S SM SVIYS+++
Sbjct: 1078 TLRDNLANCHRRADGRATIVAEKLMFHSPSKRQMDWHVTFVVDV--SASMSASVIYSALV 1135
Query: 242 ACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKSIKYCMKYIENP 301
A + ++ +L R +AF TE++D +E+ DP+ LL Q+GGGTDI ++ + P
Sbjct: 1136 AAVFDALPALSVRFLAFSTEVLDFSEQVADPLSLLLEVQVGGGTDIGLGLRAARAGVTVP 1195
Query: 302 KKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYDAQMAGKIASMG 361
++I L+SD EG + G M+ + ++ +AGV + L ++ G + A +A G
Sbjct: 1196 SRSIVILVSDFEEGVSVGRMIAEVRELVDAGVKCLGLASLDDSGVARFHQGYAAMMAGAG 1255
Query: 362 IPCFACNPEKL 372
+P A +PEKL
Sbjct: 1256 MPVAAVSPEKL 1266
>gi|21219695|ref|NP_625474.1| hypothetical protein SCO1184 [Streptomyces coelicolor A3(2)]
gi|6468436|emb|CAB61596.1| conserved hypothetical protein SCG11A.15 [Streptomyces coelicolor
A3(2)]
Length = 1320
Score = 136 bits (342), Expect = 3e-30, Method: Composition-based stats.
Identities = 97/382 (25%), Positives = 184/382 (48%), Gaps = 44/382 (11%)
Query: 9 RWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGE-------VGA 61
RWRL+LG+ T+ S ALD +Y S G+
Sbjct: 956 RWRLVLGRRTDR------------LSAAAAAPATALDELYGSGRGEGSRGDLTRPGRGGS 1003
Query: 62 GAGKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPD---- 117
G G+ PS P + W ++ LF + + + A P++L +++PD
Sbjct: 1004 GGGREPSYPGVREWSEELAALFGPGIREEVLAAAAASG-------RPDVLAELDPDSVRP 1056
Query: 118 -INLASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPS 176
++L T++ +P+ ++R ++++VE + + L + ++ A+ + P PS
Sbjct: 1057 SVDLLRTVLRHAGGLPEARLAALRPLVRRLVEALTRELATRLRPALHGTV-----VPRPS 1111
Query: 177 ---ASSLDFKTTIQRGIKNYNKE---LKKIIPEHYYFFERASTNPTSKFTVILDIDQSGS 230
LD T++ + + ++ +++PEH F RA + + ++ D+ SGS
Sbjct: 1112 RRPGGGLDLPRTLRANLASAHRGPDGTVRVLPEHPVFRTRARRSADWRLVLVTDV--SGS 1169
Query: 231 MGESVIYSSVMACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKS 290
M S +++++ A +LA + +L T +AF TE++DLT+ DDP+ LL +GGGT I
Sbjct: 1170 MEASTVWAALTASVLAGVPTLSTHFLAFSTEVIDLTDHVDDPLSLLLEVSVGGGTHIAAG 1229
Query: 291 IKYCMKYIENPKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYD 350
+++ + + P +T+ ++SD EG GG+L + + AG V+ ++ G+P Y
Sbjct: 1230 LRHARELVTVPSRTLVVVVSDFEEGYPLGGLLAEVRALVGAGCHVLGCASLDDAGRPRYS 1289
Query: 351 AQMAGKIASMGIPCFACNPEKL 372
+AG++ + G+P A +P +L
Sbjct: 1290 TGVAGRLVAAGMPVAALSPLEL 1311
>gi|21224983|ref|NP_630762.1| hypothetical protein SCO6688 [Streptomyces coelicolor A3(2)]
gi|5457289|emb|CAB46976.1| conserved hypothetical protein [Streptomyces coelicolor A3(2)]
Length = 1171
Score = 130 bits (326), Expect = 2e-28, Method: Composition-based stats.
Identities = 102/400 (25%), Positives = 178/400 (44%), Gaps = 62/400 (15%)
Query: 9 RWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGE---------- 58
RWRL+LG+DT +++ A RALD +++ G+ E
Sbjct: 785 RWRLLLGRDTAGLPAALRPYA------------RALDELFDREGEEADEESRETNEGGDG 832
Query: 59 ---------VGAGA------GKGPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRC---G 100
G+ G S P + W D+R LF E I+ + ++R G
Sbjct: 833 GEPEPGAGGTSEGSDDDRTGGAARSFPSVRHWAEDLRTLFGAE----IRQEVLERAVADG 888
Query: 101 LKQLI--FEPEILEQVEPDINLASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDI 158
+I +P V P + L S ++ L +P++ S+R +K++VEE+ K L + +
Sbjct: 889 RTDVIALLDPA---SVRPSVELLSAVLTLARGMPEQRVASLRPLVKRLVEELTKELATRL 945
Query: 159 KRAVRAALNKRQHSPIPS---ASSLDFKTTIQRGIKNYNKELK---KIIPEHYYFFERAS 212
+ + +P P+ LD T++ + + + +++PE F R
Sbjct: 946 RPTLTGLT-----TPRPTRRPGGPLDLPRTLRANLAHIRRREDGRVEVVPERPVF--RTR 998
Query: 213 TNPTSKFTVILDIDQSGSMGESVIYSSVMACILASIASLKTRVVAFDTEIVDLTEKSDDP 272
T + + +IL +D S SM SV++S++ A IL +L T + F T++ DLT DP
Sbjct: 999 TARRNDWRLILVVDVSASMETSVVWSALTAAILGGAPTLSTHFLTFSTQVADLTGLVADP 1058
Query: 273 VDLLYGFQLGGGTDINKSIKYCMKYIENPKKTIFFLISDLMEGGNRGGMLRNLEDMKEAG 332
+ LL ++GGGT I + + + P +T+ ++SD EG G+L + + AG
Sbjct: 1059 LSLLLEVKVGGGTHIAAGLAHARSLVTVPDRTLVVVVSDFEEGAAVEGLLAEVGALVSAG 1118
Query: 333 VTVVCLLAISGDGQPYYDAQMAGKIASMGIPCFACNPEKL 372
V ++ A + G P Y + ++ + G+P A P L
Sbjct: 1119 VRLLGCAAPADGGTPRYSVPVTRRLVAAGMPVAALGPLSL 1158
>gi|29827527|ref|NP_822161.1| hypothetical protein SAV986 [Streptomyces avermitilis MA-4680]
gi|29604627|dbj|BAC68696.1| hypothetical protein [Streptomyces avermitilis MA-4680]
Length = 478
Score = 119 bits (299), Expect = 3e-25, Method: Composition-based stats.
Identities = 68/282 (24%), Positives = 145/282 (51%), Gaps = 3/282 (1%)
Query: 91 IQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIMLLKEQIPQKSKESVRAFIKKIVEEI 150
I+ D + R L++++ +P + Q+ P ++L ++ K + + + +A I++ V+E+
Sbjct: 191 IEADLVKRMHLREVLADPTLAAQLTPSMSLIEQLLRDKNNLSGVALANAKALIRRFVDEV 250
Query: 151 NKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTTIQRGIKNYNKELKKIIPEHYYFFER 210
++L + + +A AL+ R P + +LD TI + + N++ E +++ + Y+ R
Sbjct: 251 AEVLRTQVAQATAGALD-RSVPPKRTFRNLDLDRTIWKNLTNWSPEEERLYVDRLYY--R 307
Query: 211 ASTNPTSKFTVILDIDQSGSMGESVIYSSVMACILASIASLKTRVVAFDTEIVDLTEKSD 270
+ T+ +I+ +DQSGSM +S++ +++A I A + + ++A+DT +DLT
Sbjct: 308 HTARKTTPQRLIVVVDQSGSMVDSMVNCTILASIFAGLPKVDVHLIAYDTRAIDLTPWVS 367
Query: 271 DPVDLLYGFQLGGGTDINKSIKYCMKYIENPKKTIFFLISDLMEGGNRGGMLRNLEDMKE 330
DP ++L LGGG D ++ I P+ T+ ISD E + +E +
Sbjct: 368 DPFEMLLRTNLGGGNDGPVAMAMARPKITEPRSTVMVWISDFYEFDRSQPLFEGIEAVHR 427
Query: 331 AGVTVVCLLAISGDGQPYYDAQMAGKIASMGIPCFACNPEKL 372
+GV + + +++ G+ + + ++G P + + KL
Sbjct: 428 SGVKFIPVGSVTSSGRQEVNPWFRERFKALGTPVVSGHIRKL 469
>gi|119881509|ref|ZP_01647803.1| VWA containing CoxE-like [Salinispora arenicola CNS205]
gi|119825598|gb|EAX28152.1| VWA containing CoxE-like [Salinispora arenicola CNS205]
Length = 504
Score = 115 bits (288), Expect = 5e-24, Method: Composition-based stats.
Identities = 77/338 (22%), Positives = 162/338 (47%), Gaps = 23/338 (6%)
Query: 62 GAGKGP-SNPQISRWLGDIRNLFDKEL------------------VKIIQTDAMDRCGLK 102
G GP S Q++RW D F++ L + ++ D + R L+
Sbjct: 170 AGGTGPVSASQLARWQSDA-GWFEQALGAEPGELRRQGATGLGGALAALEGDLVRRMHLR 228
Query: 103 QLIFEPEILEQVEPDINLASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAV 162
+++ +P + ++ P ++L ++ K + + + +A I++ V+E+ ++L + +++
Sbjct: 229 EVLADPALASRLTPSMSLIEQLLRDKANLSGVALANAKALIRRFVDEVAEVLRTQVEQTS 288
Query: 163 RAALNKRQHSPIPSASSLDFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVI 222
++ R P +LD TI + + N++ E +++ + Y+ R + T+ +I
Sbjct: 289 VGTID-RSVPPKRVFRNLDLDRTIWQNLTNWSPEDQRLYVDRLYY--RRTARRTTPARLI 345
Query: 223 LDIDQSGSMGESVIYSSVMACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLG 282
+ +DQSGSM +S++ +++A I A + + ++A+DT +DLT DP ++L +LG
Sbjct: 346 VVVDQSGSMVDSMVNCTILASIFAGLPKVDVHLIAYDTRALDLTPWVRDPFEVLLRTKLG 405
Query: 283 GGTDINKSIKYCMKYIENPKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAIS 342
GG D ++ I P+ T+ ISD E +L +E + +GV + + +++
Sbjct: 406 GGNDGPVAMAMARPKIAEPRNTVMVWISDFYEFDRSQPLLDGIEAVHRSGVRFIPVGSVN 465
Query: 343 GDGQPYYDAQMAGKIASMGIPCFACNPEKLPLLLERVL 380
GQ + + +G P + + KL L+ L
Sbjct: 466 SSGQQSVNPWFRQRFKDLGTPVISGHIRKLVFELKSFL 503
>gi|145594552|ref|YP_001158849.1| VWA containing CoxE family protein [Salinispora tropica CNB-440]
gi|145303889|gb|ABP54471.1| VWA containing CoxE family protein [Salinispora tropica CNB-440]
Length = 447
Score = 111 bits (278), Expect = 8e-23, Method: Composition-based stats.
Identities = 76/338 (22%), Positives = 163/338 (48%), Gaps = 23/338 (6%)
Query: 62 GAGKGP-SNPQISRWLGDIRNLFDKEL------------------VKIIQTDAMDRCGLK 102
AG GP S +++RW D F++ L + ++ D + R L+
Sbjct: 113 AAGTGPVSASELARWQSDA-GWFEQALGAEPGELRRQGGTGLGGALAALEGDLVRRMHLR 171
Query: 103 QLIFEPEILEQVEPDINLASTIMLLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAV 162
+++ +P + ++ P ++L ++ K + + + +A I++ V+E+ ++L + +++
Sbjct: 172 EVLADPALASRLTPSMSLIEQLLRDKANLSGVALANAKALIRRFVDEVAEVLRTQVEQTS 231
Query: 163 RAALNKRQHSPIPSASSLDFKTTIQRGIKNYNKELKKIIPEHYYFFERASTNPTSKFTVI 222
++ R P +LD TI + + N++ E +++ + Y+ R + T+ +I
Sbjct: 232 VGTID-RSVPPKRVFRNLDLDRTIWQNLTNWSPEDQRLYVDRLYY--RRTARRTTPARLI 288
Query: 223 LDIDQSGSMGESVIYSSVMACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLG 282
+ +DQSGSM +S++ +++A I A + + ++A+DT+ +DLT DP ++L +LG
Sbjct: 289 VVVDQSGSMVDSMVNCTILASIFAGLPKVDVHLIAYDTQALDLTPWVRDPFEVLLRTKLG 348
Query: 283 GGTDINKSIKYCMKYIENPKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAIS 342
GG D ++ I P+ T+ ISD E + +E + +GV + + +++
Sbjct: 349 GGNDGPVAMAMARPKIAEPRNTVMVWISDFYEFDRSQPLFDGIEAVHRSGVRFIPVGSVN 408
Query: 343 GDGQPYYDAQMAGKIASMGIPCFACNPEKLPLLLERVL 380
GQ + + +G P + + KL L+ L
Sbjct: 409 SSGQQSVNPWFRQRFKDLGTPVISGHIRKLVFELKSFL 446
>gi|124526698|ref|ZP_01698607.1| VWA containing CoxE family protein [Escherichia coli B]
gi|124501754|gb|EAY49216.1| VWA containing CoxE family protein [Escherichia coli B]
Length = 152
Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats.
Identities = 56/152 (36%), Positives = 88/152 (57%)
Query: 231 MGESVIYSSVMACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKS 290
M +SVI+S+VMA L + ++T +VAFDT +VDLT DPV+LL QLGGGT+I +
Sbjct: 1 MVDSVIHSAVMAACLWQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASA 60
Query: 291 IKYCMKYIENPKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYD 350
++Y + IE P K++ L+SD EGG+ + ++ ++G+ V+ L A+ P YD
Sbjct: 61 VEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYD 120
Query: 351 AQMAGKIASMGIPCFACNPEKLPLLLERVLKN 382
A + ++G A P +L L L++
Sbjct: 121 RDTAQALVNVGAQIAAMTPGELASWLAENLQS 152
>gi|82777399|ref|YP_403748.1| hypothetical protein SDY_2171 [Shigella dysenteriae Sd197]
gi|81241547|gb|ABB62257.1| hypothetical protein SDY_2171 [Shigella dysenteriae Sd197]
Length = 152
Score = 105 bits (261), Expect = 7e-21, Method: Composition-based stats.
Identities = 55/152 (36%), Positives = 87/152 (57%)
Query: 231 MGESVIYSSVMACILASIASLKTRVVAFDTEIVDLTEKSDDPVDLLYGFQLGGGTDINKS 290
M +SVI+S+VMA L + ++T +VAF T +VDLT DPV+LL QLGGGT+I +
Sbjct: 1 MVDSVIHSAVMAACLWQLPGIRTHLVAFGTSVVDLTADVADPVELLMKVQLGGGTNIASA 60
Query: 291 IKYCMKYIENPKKTIFFLISDLMEGGNRGGMLRNLEDMKEAGVTVVCLLAISGDGQPYYD 350
++Y + IE P K++ L+SD EGG+ + ++ ++G+ V+ L A+ P YD
Sbjct: 61 VEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYD 120
Query: 351 AQMAGKIASMGIPCFACNPEKLPLLLERVLKN 382
A + ++G A P +L L L++
Sbjct: 121 RDTAQALVNVGAQIAAMTPGELASWLAENLQS 152
>gi|124526697|ref|ZP_01698606.1| conserved hypothetical protein [Escherichia coli B]
gi|124501753|gb|EAY49215.1| conserved hypothetical protein [Escherichia coli B]
Length = 214
Score = 92.8 bits (229), Expect = 3e-17, Method: Composition-based stats.
Identities = 53/205 (25%), Positives = 108/205 (52%), Gaps = 9/205 (4%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G
Sbjct: 12 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSGGLGG 64
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ +++DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 65 --SNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 122
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R ++++VEEI L ++++A ++R+ S IP A + DFK+T
Sbjct: 123 HTKHLMNPEVLATARRIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKST 182
Query: 186 IQRGIKNYNKELKKIIPEHYYFFER 210
++ +++++ + K+ E F R
Sbjct: 183 LRANLQHWHPQHGKLYIESPRFNSR 207
>gi|82777400|ref|YP_403749.1| hypothetical protein SDY_2172 [Shigella dysenteriae Sd197]
gi|81241548|gb|ABB62258.1| conserved hypothetical protein [Shigella dysenteriae Sd197]
Length = 230
Score = 90.1 bits (222), Expect = 2e-16, Method: Composition-based stats.
Identities = 51/198 (25%), Positives = 104/198 (52%), Gaps = 9/198 (4%)
Query: 6 DIKRWRLILGKDTEEDFSSMDSEAISSFSEEDWLMDRALDAIYNPTGKFMSGEVGAGAGK 65
+++RWRLILG+ E +D A +WL R +P + GE G G
Sbjct: 12 ELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGR------DPE-RLQRGERSGGLGG 64
Query: 66 GPSNPQISRWLGDIRNLFDKELVKIIQTDAMDRCGLKQLIFEPEILEQVEPDINLASTIM 125
SN W+ I LF +++++ ++ DA+ R G++ ++ ++LE+++P +L ++
Sbjct: 65 --SNLTTPEWINSIHTLFPQQVIERLEIDAVLRYGIEDVVTNLDVLERMQPSESLLRAVL 122
Query: 126 LLKEQIPQKSKESVRAFIKKIVEEINKLLESDIKRAVRAALNKRQHSPIPSASSLDFKTT 185
K + + + R ++++VEEI L ++++A ++R S IP A + DFK+T
Sbjct: 123 HTKHLMNPEVLAAARRIVRQVVEEIMARLAKEVRQAFSGVRDRRCRSFIPLARNFDFKST 182
Query: 186 IQRGIKNYNKELKKIIPE 203
++ +++++ + K+ E
Sbjct: 183 LRANLQHWHPQHGKLYIE 200
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.317 0.136 0.384
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,423,762,253
Number of Sequences: 5470121
Number of extensions: 60403597
Number of successful extensions: 172284
Number of sequences better than 1.0e-05: 59
Number of HSP's better than 0.0 without gapping: 59
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 172130
Number of HSP's gapped (non-prelim): 59
length of query: 395
length of database: 1,894,087,724
effective HSP length: 135
effective length of query: 260
effective length of database: 1,155,621,389
effective search space: 300461561140
effective search space used: 300461561140
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 131 (55.1 bits)