BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= PI0345
(415 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|153806474|ref|ZP_01959142.1| hypothetical protein BACCAC... 443 e-122
gi|124005742|ref|ZP_01690581.1| hypothetical protein M23134... 113 2e-23
gi|124002986|ref|ZP_01687837.1| hypothetical protein M23134... 111 9e-23
gi|124004358|ref|ZP_01689204.1| hypothetical protein M23134... 110 1e-22
gi|124008725|ref|ZP_01693415.1| hypothetical protein M23134... 110 2e-22
gi|124009886|ref|ZP_01694553.1| hypothetical protein M23134... 108 5e-22
gi|116749310|ref|YP_845997.1| protein of unknown function D... 85 9e-15
gi|114707485|ref|ZP_01440381.1| hypothetical protein FP2506... 77 2e-12
gi|91789628|ref|YP_550580.1| protein of unknown function DU... 73 4e-11
gi|149201895|ref|ZP_01878869.1| hypothetical protein RTM103... 69 5e-10
gi|87120002|ref|ZP_01075898.1| hypothetical protein MED121_... 69 7e-10
gi|85858389|ref|YP_460591.1| mu-like prophage Flumu protein... 67 2e-09
gi|24374221|ref|NP_718264.1| hypothetical protein SO_2680 [... 66 4e-09
gi|121587537|ref|ZP_01677304.1| conserved hypothetical prot... 66 4e-09
gi|85703723|ref|ZP_01034827.1| hypothetical protein ROS217_... 66 4e-09
gi|48696939|ref|YP_024702.1| gp29 [Burkholderia phage BcepM... 65 6e-09
gi|116221405|ref|ZP_01486813.1| hypothetical protein VchoV5... 65 6e-09
gi|147673735|ref|YP_001216624.1| hypothetical protein VC039... 65 1e-08
gi|134288841|ref|YP_001111203.1| gp3 [Burkholderia phage ph... 64 2e-08
gi|154248422|ref|YP_001419380.1| protein of unknown functio... 63 4e-08
gi|34497594|ref|NP_901809.1| hypothetical protein CV_2139 [... 62 5e-08
gi|46579533|ref|YP_010341.1| hypothetical protein DVU1120 [... 62 8e-08
gi|16760410|ref|NP_456027.1| hypothetical protein STY1616 [... 60 2e-07
gi|75512142|ref|ZP_00734714.1| COG4383: Mu-like prophage pr... 60 3e-07
gi|83310863|ref|YP_421127.1| Mu-like prophage FluMu protein... 60 3e-07
gi|75255742|ref|ZP_00727503.1| COG4383: Mu-like prophage pr... 60 3e-07
gi|75228563|ref|ZP_00715197.1| COG4383: Mu-like prophage pr... 60 4e-07
gi|146277346|ref|YP_001167505.1| protein of unknown functio... 59 6e-07
gi|60099637|gb|AAX12941.1| hypothetical protein [Escherichi... 59 6e-07
gi|54654574|gb|AAV37152.1| hypothetical protein [Haemophilu... 56 4e-06
gi|37527315|ref|NP_930659.1| hypothetical protein plu3441 [... 56 4e-06
gi|32029859|ref|ZP_00132813.1| COG4383: Mu-like prophage pr... 55 7e-06
>gi|153806474|ref|ZP_01959142.1| hypothetical protein BACCAC_00738 [Bacteroides caccae ATCC 43185]
gi|149131151|gb|EDM22357.1| hypothetical protein BACCAC_00738 [Bacteroides caccae ATCC 43185]
Length = 414
Score = 443 bits (1139), Expect = e-122, Method: Composition-based stats.
Identities = 203/388 (52%), Positives = 284/388 (73%), Gaps = 5/388 (1%)
Query: 1 MELQRTTDSLTRKDIGDWRIAWQMAINVDNPNRQRLYDIYRDVEVDLHLSGCIQQREGFV 60
+EL + +LT+KDI WR AWQ AIN++NP+R L IY D +DLHL GCI QR+G V
Sbjct: 32 VELALQSQALTKKDIKSWRTAWQQAINIENPHRGPLLTIYTDAMIDLHLGGCIGQRKGMV 91
Query: 61 LSRSFKLVNEKGDEDTDATKYFNTAWFKRLMKFALDANYWGHSLIELGELTTDANNRLCY 120
+ +SFKLV+ KG+E+ DAT+ WFK M + LD+ Y+GHSLI+LG++ + N ++ Y
Sbjct: 92 MKKSFKLVDIKGNEEPDATQLMEAVWFKDFMDYVLDSRYYGHSLIQLGDVV-ERNGKIQY 150
Query: 121 DGVTLIPRKHVIPEYGRLVANLGDDWKSGKDYRRTPFTEWLIEVGQPDGLGLYLKAATQT 180
D V L+PR HV+PE+G ++ D+W+ G DYR +W +E G+ D LGL LK QT
Sbjct: 151 DRVLLVPRHHVVPEHGVILKEASDEWQKGIDYRTGAMADWCVEAGKWDDLGLLLKLVPQT 210
Query: 181 IPKKNTLAFWDTFAEIFGMPMRIARTTTRDKKEMAEMEKMMSDMGTEGWAIFQQGTEIEV 240
I KKN LAFWD F E+FGMP+R+ +TTTRD KE +++E M+++MG W +F +GTEI++
Sbjct: 211 ISKKNMLAFWDQFGELFGMPIRVGKTTTRDAKERSKIENMLNEMGAAAWGLFPEGTEIDI 270
Query: 241 VESKKGDAFNVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQSETHLEVFQNLVEADCDT 300
ES +GDAFNVYD+R++RANSE+SK ++ QTMTI+ GSSLSQS+THL+VFQN+V+ D D
Sbjct: 271 KESSRGDAFNVYDKRVERANSEISKGVLNQTMTIDSGSSLSQSQTHLQVFQNVVDRDADF 330
Query: 301 IRDVVNNQLIPRMIQHGFPLQGIHFDWDYSEDYTPEQQVAYEQLVLNNYEVDPSYFEEKY 360
+RD +N +L+PRM++HGFP++G+ FDWD S DYTPEQQVAYE ++ + +EV+P YF EKY
Sbjct: 331 LRDTINGELLPRMLKHGFPVKGLRFDWDESIDYTPEQQVAYETMIADRFEVEPEYFIEKY 390
Query: 361 NMPVGERRNVGSSTGENGKNKNLKKFFD 388
N+P+ ++ ++ G+ K FFD
Sbjct: 391 NIPIKGKK----ASPVPGQMPQAKPFFD 414
>gi|124005742|ref|ZP_01690581.1| hypothetical protein M23134_03968 [Microscilla marina ATCC 23134]
gi|123988810|gb|EAY28416.1| hypothetical protein M23134_03968 [Microscilla marina ATCC 23134]
Length = 616
Score = 113 bits (282), Expect = 2e-23, Method: Composition-based stats.
Identities = 98/392 (25%), Positives = 177/392 (45%), Gaps = 24/392 (6%)
Query: 12 RKDIGDWRIAWQMAINVDNPNRQRLYDIYRDVEVDLHLSGCIQQREGFVLSRSFKLVNEK 71
R I DW AW++A + P L ++Y V +D + I R+ VL F +++ +
Sbjct: 28 RVTIADWMQAWRIAESPVMPRTYPLMEVYDMVCIDAEVITAINNRKNKVLGEPFLVLDSE 87
Query: 72 GDEDTDATKYFNTAWFKRLMKFALDANYWGHSLIELGELTTDANNRLCYDGVTLIPRKHV 131
G+ ATK +WF + LDA ++G+SLIE+G+ D + + + + R+++
Sbjct: 88 GNYQEVATKKLRGSWFDDFINAVLDAEFYGYSLIEIGQTLNDDDYSI--KEINTVERRNI 145
Query: 132 IPEYGRLVANLGDDWKSGKDYRRTPFTEWLIEV-GQPDGLGLYLKAATQTIPKKNTLAFW 190
IP ++ ++ G D+ W I + LGL L AA I ++ L
Sbjct: 146 IPGRKLVLPYPFSAYQEGIDFSADYLKRWYIFCHSKRFPLGLLLYAAPYAILSRDALVKH 205
Query: 191 DTFAEIFGMPMRIARTTTRDKKEMAEMEKMMSDMGTEGWAIFQQGTEIEVV-ESKKGDAF 249
F +G P +I T ++ +E E+ + ++ E +F ++EV S G
Sbjct: 206 AEFNRDYGKPYKIVH-TDKEGEERREVLDALLNVEEELNGVFDIEEKVEVAFPSGGGGTV 264
Query: 250 NVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQSETHLEVF-----QNLVEADCDTIRDV 304
+ +D+ ++A+ +L K+I G D +Q+ + + + + E D + V
Sbjct: 265 DTFDKMDEKASKKLLKIIQGHVKLSNDTDGGAQTYMNKDSIAKTPSEEIREYDMARVESV 324
Query: 305 VNNQLIPRMIQHGFPLQGIHFDW--DYSEDYTPEQQVAYE---QLVLNNYEVDPSYFE-- 357
VN +L PR++ +PL G F + Y + +++ E +L LN++EV + FE
Sbjct: 325 VNEELFPRLLDFNYPLAGHSFIFLDTYLRELQRQKKTISEKALELALNHFEVSGTEFEKA 384
Query: 358 ------EKYNMPVGER-RNVGSSTGENGKNKN 382
+K +PV + + V EN N N
Sbjct: 385 TGIKVKKKATLPVDQNVKAVVKKNAENDTNTN 416
>gi|124002986|ref|ZP_01687837.1| hypothetical protein M23134_07451 [Microscilla marina ATCC 23134]
gi|123991636|gb|EAY31044.1| hypothetical protein M23134_07451 [Microscilla marina ATCC 23134]
Length = 616
Score = 111 bits (277), Expect = 9e-23, Method: Composition-based stats.
Identities = 96/392 (24%), Positives = 177/392 (45%), Gaps = 24/392 (6%)
Query: 12 RKDIGDWRIAWQMAINVDNPNRQRLYDIYRDVEVDLHLSGCIQQREGFVLSRSFKLVNEK 71
R I DW AW++A + P L ++Y V +D + I R+ VL F +++ +
Sbjct: 28 RVTIADWIQAWRIAESPVMPRTYPLMEVYDMVCIDAEVITAINNRKNKVLGEPFLVLDSE 87
Query: 72 GDEDTDATKYFNTAWFKRLMKFALDANYWGHSLIELGELTTDANNRLCYDGVTLIPRKHV 131
+ ATK +WF + LDA ++G+SLIE+G+ D + + + + R+++
Sbjct: 88 SNYQEAATKKLRGSWFDDFINAVLDAEFYGYSLIEIGQTLNDGDYSI--KEINTVERRNI 145
Query: 132 IPEYGRLVANLGDDWKSGKDYRRTPFTEWLIEV-GQPDGLGLYLKAATQTIPKKNTLAFW 190
IP G ++ ++ G D+ W I + LGL L AA I ++ L
Sbjct: 146 IPGRGLVLPYPFSAYQEGIDFSAEYLKRWYIFCHSKRFPLGLLLYAAPYAILSRDALVKH 205
Query: 191 DTFAEIFGMPMRIARTTTRDKKEMAEMEKMMSDMGTEGWAIFQQGTEIEVV-ESKKGDAF 249
F +G P +I T ++ +E E+ + ++ E +F ++E+ S G
Sbjct: 206 AEFNRDYGKPYKIVH-TDKEGEERREVLDALLNVEEELNGVFDIEEKVEIAFPSGGGGTV 264
Query: 250 NVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQSETHLEVF-----QNLVEADCDTIRDV 304
+ +D+ ++A+ +L K+I G D +Q+ + + + + E D + V
Sbjct: 265 DTFDKMDEKASKKLLKIIQGHVKLSNDTDGGAQTYMNKDSIAKTPSEEIREYDMARVESV 324
Query: 305 VNNQLIPRMIQHGFPLQGIHFDW--DYSEDYTPEQQVAYE---QLVLNNYEVDPSYFE-- 357
VN +L PR++ +PL G F + Y + +++ E +L L+++EV + FE
Sbjct: 325 VNEELFPRLLDFNYPLAGHSFIFLDTYLRELQRQKKSISEKALELALSHFEVSTAEFEKA 384
Query: 358 ------EKYNMPVGER-RNVGSSTGENGKNKN 382
+K +PV + + V EN N N
Sbjct: 385 TGIKVKKKATLPVDQNMKAVVKKNAENDTNTN 416
>gi|124004358|ref|ZP_01689204.1| hypothetical protein M23134_05800 [Microscilla marina ATCC 23134]
gi|123990428|gb|EAY29927.1| hypothetical protein M23134_05800 [Microscilla marina ATCC 23134]
Length = 619
Score = 110 bits (276), Expect = 1e-22, Method: Composition-based stats.
Identities = 92/368 (25%), Positives = 167/368 (45%), Gaps = 16/368 (4%)
Query: 12 RKDIGDWRIAWQMAINVDNPNRQRLYDIYRDVEVDLHLSGCIQQREGFVLSRSFKLVNEK 71
R I DW AW++A + P L +IY V +D + I R+ VL F +++
Sbjct: 22 RVTIADWMQAWRIAESPVMPRTYPLMEIYDMVCIDAEVITAINNRKNKVLGEPFLVLDSD 81
Query: 72 GDEDTDATKYFNTAWFKRLMKFALDANYWGHSLIELGELTTDANNRLCYDGVTLIPRKHV 131
G+ ATK +WF + LDA ++G+SLIE+G+ D ++ + + R+++
Sbjct: 82 GNYQEAATKKLRGSWFDDFINAVLDAEFFGYSLIEIGQ---DLHDNYSIKEINTVERRNI 138
Query: 132 IPEYGRLVANLGDDWKSGKDYRRTPFTEWLIEV-GQPDGLGLYLKAATQTIPKKNTLAFW 190
IP ++ + G D+ W I + LGL L AA I ++ L
Sbjct: 139 IPGRKLVLPYPFSAYGEGIDFSADYLKRWYIFCHSKRFPLGLLLYAAPYAILSRDALIKH 198
Query: 191 DTFAEIFGMPMRIARTTTRDKKEMAEMEKMMSDMGTEGWAIFQQGTEIEVV-ESKKGDAF 249
F +G P +I T ++ +E E+ + ++ E +F ++EV S G
Sbjct: 199 ADFNRDYGKPYKIVH-TDKEGEERREVLDALLNVEEELNGVFDIEEKVEVAFPSGGGGTI 257
Query: 250 NVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQSETHLEVF-----QNLVEADCDTIRDV 304
+ +D+ ++A+ +L K+I G D +Q+ + + + + E D + V
Sbjct: 258 DTFDKMDEKASKKLLKIIQGHVKLSNDTEGGAQTYMNKDSIAKTPSEEIREYDMARVESV 317
Query: 305 VNNQLIPRMIQHGFPLQG---IHFDWDYSEDYTPEQQVAYE--QLVLNNYEVDPSYFEEK 359
VN +L PR++ +PL G I D E ++ ++ L LN++EV P+ F++
Sbjct: 318 VNEELFPRLLDFNYPLAGHSFIFLDTHLRELQRQKKSISETALTLALNHFEVSPAEFKKA 377
Query: 360 YNMPVGER 367
+ V ++
Sbjct: 378 TGIKVKKK 385
>gi|124008725|ref|ZP_01693415.1| hypothetical protein M23134_07325 [Microscilla marina ATCC 23134]
gi|123985801|gb|EAY25674.1| hypothetical protein M23134_07325 [Microscilla marina ATCC 23134]
Length = 620
Score = 110 bits (275), Expect = 2e-22, Method: Composition-based stats.
Identities = 96/392 (24%), Positives = 172/392 (43%), Gaps = 24/392 (6%)
Query: 12 RKDIGDWRIAWQMAINVDNPNRQRLYDIYRDVEVDLHLSGCIQQREGFVLSRSFKLVNEK 71
R I DW AW++A + P L +IY V +D + I R+ VL F +++
Sbjct: 31 RVTIADWMQAWRIAESPVMPRTYPLMEIYDMVCIDAEVITAINNRKNKVLGEPFLVLDPD 90
Query: 72 GDEDTDATKYFNTAWFKRLMKFALDANYWGHSLIELGELTTDANNRLCYDGVTLIPRKHV 131
G+ ATK +WF + LDA ++G+SLIE+G+ + + + + + R+++
Sbjct: 91 GNYQEAATKKLRGSWFDDFINAVLDAEFFGYSLIEIGQALNEDDYSI--KEINTVERRNI 148
Query: 132 IPEYGRLVANLGDDWKSGKDYRRTPFTEWLIEV-GQPDGLGLYLKAATQTIPKKNTLAFW 190
IP ++ + G D+ W I + LGL L AA I ++ L
Sbjct: 149 IPSRKLVLPYPFSAYGEGIDFSADYLRRWYIFCHSKRFPLGLLLYAAPYAILSRDALIKH 208
Query: 191 DTFAEIFGMPMRIARTTTRDKKEMAEMEKMMSDMGTEGWAIFQQGTEIEVV-ESKKGDAF 249
F +G P +I T ++ +E E+ + + E +F ++E+ S G
Sbjct: 209 ADFNRDYGKPYKIVH-TDKEGEERREVLDALLSVEEELNGVFDIEEKVEIAFPSGGGGTI 267
Query: 250 NVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQSETHLEVF-----QNLVEADCDTIRDV 304
+ +D+ ++A+ +L K+I G D +Q+ + + + + E D + V
Sbjct: 268 DTFDKMDEKASKKLLKIIQGHVKLSNDTEGGAQTYMNKDSIAKTPSEEIREYDMARVEAV 327
Query: 305 VNNQLIPRMIQHGFPLQGIHF---DWDYSEDYTPEQQVAYE--QLVLNNYEVDPSYFEEK 359
VN +L PR++ +PL G F D E ++ ++ L LN++EV P+ FE+
Sbjct: 328 VNEELFPRLLDFNYPLAGHSFVFLDTHLRELQRQKKSISETALTLALNHFEVSPAEFEKA 387
Query: 360 YNMPVGER---------RNVGSSTGENGKNKN 382
+ V ++ + V EN N N
Sbjct: 388 TGIKVKKKVTLTADQNVKEVAKKGAENDTNTN 419
>gi|124009886|ref|ZP_01694553.1| hypothetical protein M23134_06475 [Microscilla marina ATCC 23134]
gi|123984122|gb|EAY24488.1| hypothetical protein M23134_06475 [Microscilla marina ATCC 23134]
Length = 617
Score = 108 bits (271), Expect = 5e-22, Method: Composition-based stats.
Identities = 96/392 (24%), Positives = 175/392 (44%), Gaps = 25/392 (6%)
Query: 12 RKDIGDWRIAWQMAINVDNPNRQRLYDIYRDVEVDLHLSGCIQQREGFVLSRSFKLVNEK 71
R I DW AW++A + P L ++Y V +D + I R+ VL F +++
Sbjct: 28 RVTIADWMQAWRIAESPVMPRTYPLMEVYDMVCIDAEVITAINNRKNKVLGEPFLVLDPD 87
Query: 72 GDEDTDATKYFNTAWFKRLMKFALDANYWGHSLIELGELTTDANNRLCYDGVTLIPRKHV 131
G+ ATK +WF + LDA ++G+SLIE+G+ D ++ + + R+++
Sbjct: 88 GNYQEVATKKLRGSWFDDFINAVLDAEFFGYSLIEIGQ---DLHDNYSIKEINTVERRNI 144
Query: 132 IPEYGRLVANLGDDWKSGKDYRRTPFTEWLIEV-GQPDGLGLYLKAATQTIPKKNTLAFW 190
IP ++ ++ G D+ W I + LGL L AA I ++ L
Sbjct: 145 IPSRKLVLPYPFSAYQEGIDFSADYLKRWYIFCHSKRFPLGLLLYAAPYAILSRDALVKH 204
Query: 191 DTFAEIFGMPMRIARTTTRDKKEMAEMEKMMSDMGTEGWAIFQQGTEIEVV-ESKKGDAF 249
F +G P +I T ++ +E E+ + ++ E +F ++EV S G
Sbjct: 205 AEFNRDYGKPYKIVH-TDKEGEERREVLDALLNVEEELNGVFDIEEKVEVAFPSGGGGTV 263
Query: 250 NVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQSETHLEVF-----QNLVEADCDTIRDV 304
+ +D+ ++A+ +L K+I G D +Q+ + + + + E D + V
Sbjct: 264 DTFDKMDEKASKKLLKIIQGHVKLSNDTDGGAQTYMNKDSIAKTPSEEIREYDMARVESV 323
Query: 305 VNNQLIPRMIQHGFPLQGIHFDW--DYSEDYTPEQQVAYE---QLVLNNYEVDPSYFE-- 357
VN +L PR++ +P G F + Y + +++ E +L L+++EV + FE
Sbjct: 324 VNEELFPRLLDFNYPFTGHSFIFLDTYLRELQRQKKSISEKALELALSHFEVSTAEFEKA 383
Query: 358 ------EKYNMPVGER-RNVGSSTGENGKNKN 382
+K +PV + + V EN N N
Sbjct: 384 TGIKVKKKATLPVDQNMKAVVKKNAENDTNTN 415
>gi|116749310|ref|YP_845997.1| protein of unknown function DUF935 [Syntrophobacter fumaroxidans
MPOB]
gi|116698374|gb|ABK17562.1| protein of unknown function DUF935 [Syntrophobacter fumaroxidans
MPOB]
Length = 582
Score = 85.1 bits (209), Expect = 9e-15, Method: Composition-based stats.
Identities = 55/210 (26%), Positives = 100/210 (47%), Gaps = 23/210 (10%)
Query: 171 GLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIAR-TTTRDKKEMAEMEKMMSDMGTEGW 229
GL A + K ++ W FAEI+GMP+RI R + + + + + +GT+
Sbjct: 206 GLMRVCAWMYLFKNYSIKDWVAFAEIYGMPLRIGRYEPGASRMDREALVQAVRSLGTDAA 265
Query: 230 AIFQQGTEIEVVESKKGDAFNVYDRRIDRANSELSKLIIGQTMTIEDGS-----SLSQSE 284
+ + TEIE +E+++ + NVY+ +++ SK ++GQT+T E GS S +
Sbjct: 266 GVISKSTEIEFIETQRSSSQNVYESLARFCDAQTSKAVLGQTLTSEAGSPGGTGSYALGR 325
Query: 285 THLEVFQNLVEADCDTIRDVVNNQLIPRMIQHGFPLQGIHFDWD-----YSEDYTPEQQV 339
H EV +LV ADC + + QL+ PL G ++ WD + + P + +
Sbjct: 326 VHGEVRHDLVAADCRALGKTITQQLLG-------PLVGFNYGWDAPVPRFGFLFEPPEDL 378
Query: 340 -----AYEQLVLNNYEVDPSYFEEKYNMPV 364
Y L +++ + +++ +PV
Sbjct: 379 RAAAETYRILAEMGFDLSQEHLSQRFKVPV 408
>gi|114707485|ref|ZP_01440381.1| hypothetical protein FP2506_11467 [Fulvimarina pelagi HTCC2506]
gi|114537044|gb|EAU40172.1| hypothetical protein FP2506_11467 [Fulvimarina pelagi HTCC2506]
Length = 541
Score = 77.4 bits (189), Expect = 2e-12, Method: Composition-based stats.
Identities = 46/158 (29%), Positives = 83/158 (52%), Gaps = 7/158 (4%)
Query: 171 GLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRI----ARTTTRDKKEMAEMEKMMSDMGT 226
G+ AA + + +L W FAEI+G+P+R+ A + +DK+ + + + D+
Sbjct: 208 GMARPAAWAFLIQSYSLKDWAAFAEIYGVPLRVGKYHANASDKDKRSLL---RAVRDISN 264
Query: 227 EGWAIFQQGTEIEVVESKKGDAFNVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQSETH 286
+ AI G E E + + +V+ +D + ++SKLI+GQTMT +DGSS +Q+ H
Sbjct: 265 DAAAIIPMGMEFEFAKVEGQHGESVFGGLLDYCDKQVSKLILGQTMTSDDGSSQAQANVH 324
Query: 287 LEVFQNLVEADCDTIRDVVNNQLIPRMIQHGFPLQGIH 324
+V ++++AD +N LI + F Q ++
Sbjct: 325 NDVRLDILKADAKQQAQTINRDLIRPFVIMNFGEQAVY 362
>gi|91789628|ref|YP_550580.1| protein of unknown function DUF935 [Polaromonas sp. JS666]
gi|91698853|gb|ABE45682.1| protein of unknown function DUF935 [Polaromonas sp. JS666]
Length = 537
Score = 72.8 bits (177), Expect = 4e-11, Method: Composition-based stats.
Identities = 79/346 (22%), Positives = 144/346 (41%), Gaps = 32/346 (9%)
Query: 37 YDIYRDVEVDLHLSGCIQQREGFVLSRSFKLVNEKGDEDTDATKY------FNTAWFKRL 90
Y IY D+E D H + +R+ V SR + ++ + D F F +
Sbjct: 60 YRIYDDLERDAHCGAVLDKRKMAVTSRPW-VIKPASEAPLDVAAADLVRLAFGKLRFNGV 118
Query: 91 MKFALDANYWGHSLIELGELTTDANNRLCYDGVTLIPRKHVIPEYGRLVANLGDDWK--- 147
K LDA G S+ E+ D ++P K + + R NL +
Sbjct: 119 CKRMLDATLKGFSVGEVMWAVVDG---------MVLPVKVINRDQRRFTFNLQGQLRLKT 169
Query: 148 --SGKDYRRTPFTEWLIEV-----GQPDGLGLYLKAATQTIPKKNTLAFWDTFAEIFGMP 200
+ D P ++++ G P GLGL K KK + FW TFA+ FG P
Sbjct: 170 KENMMDGIAVPERKFIVHTYGGKDGTPYGLGLGSKLFWPVFFKKQGIGFWLTFADKFGSP 229
Query: 201 MRIARTTTRD-KKEMAEMEKMMSDMGTEGWAIFQQGTEIEVVESKKGDAFNVYDRRIDRA 259
+ + + E ++ + M + I G E+E++E+ + + + Y++ I
Sbjct: 230 TALGKYPNGTLQPEQNKLLSALRAMSQDAGVIIPAGMEVELIEASRSGSVDTYEKLIRYM 289
Query: 260 NSELSKLIIGQTMTIEDGSS---LSQSETHLEVFQNLVEADCDTIRDVVNNQLIPRMIQH 316
+ ++SK ++G+TM+ S+ +Q+ H +V L + D D + + + + LI ++ +
Sbjct: 290 DEQISKAVLGETMSTTAASTGLGSNQAGVHNDVRIELAQDDNDDLCETLGSTLIAWLVFY 349
Query: 317 GFPLQGI-HFDWDYSEDYTPEQQVAYEQLVLN-NYEVDPSYFEEKY 360
P G+ + ++ E Q+ ++ V YE Y E Y
Sbjct: 350 NMPGAGLPTLEREFEEPEDLAQRATRDKTVGELGYEPTEEYILETY 395
>gi|149201895|ref|ZP_01878869.1| hypothetical protein RTM1035_05125 [Roseovarius sp. TM1035]
gi|149144943|gb|EDM32972.1| hypothetical protein RTM1035_05125 [Roseovarius sp. TM1035]
Length = 516
Score = 69.3 bits (168), Expect = 5e-10, Method: Composition-based stats.
Identities = 54/192 (28%), Positives = 89/192 (46%), Gaps = 10/192 (5%)
Query: 156 PFTEWLIEVG-----QPDGLGLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIARTTTRD 210
P W+ G +P GLGL K+N L W + FGMP + +
Sbjct: 175 PEKFWVFSTGADHDDEPYGLGLAHWLYWPVWFKRNGLKLWLIALDKFGMPTARGKYPAQA 234
Query: 211 KKEMAE--MEKMMSDMGTEGWAIFQQGTEIEVVESKKGDAFNVYDRRIDRANSELSKLII 268
KE + +E +M+ + +E I G EI+++ + G + Y + D ++ +SK+++
Sbjct: 235 TKEEQKTLLEAVMA-IRSEAGIIIPDGMEIDLLAAPSGASSLDYQKLHDTMDAAISKIVL 293
Query: 269 GQTMTIEDGSSLSQSETHLEVFQNLVEADCDTIRDVVNNQLIPRMIQHGFPLQGIHFDWD 328
QTMT ++GSS SQ+E H +V + ++D D + N + R+ + FP W
Sbjct: 294 SQTMTTDNGSSRSQAEVHDDVGDAVKKSDADLVCQSFNEGPVARLCEFNFPGVAPPRVWR 353
Query: 329 YSEDYTPEQQVA 340
ED PE A
Sbjct: 354 KMED--PEDTTA 363
>gi|87120002|ref|ZP_01075898.1| hypothetical protein MED121_02140 [Marinomonas sp. MED121]
gi|86164704|gb|EAQ65973.1| hypothetical protein MED121_02140 [Marinomonas sp. MED121]
Length = 520
Score = 68.6 bits (166), Expect = 7e-10, Method: Composition-based stats.
Identities = 43/144 (29%), Positives = 78/144 (54%), Gaps = 6/144 (4%)
Query: 171 GLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIAR-TTTRDKKEMAEMEKMMSDMGTEGW 229
GL A + K TL W +FAE+FG+P+R+ + K ++A + + + ++
Sbjct: 207 GLARLVAVMYMLKSYTLRDWWSFAEVFGIPIRVGKYGPNATKDDIATLVNAIGKIASDAG 266
Query: 230 AIFQQGTEIEVVES---KKGDAFNVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQSETH 286
A ++E++ES K GD +++ + ++SK ++GQTMT ++GSS SQ+E H
Sbjct: 267 AAIPDSMKLELIESGASKGGDT--LFENMARWCDEQISKAVLGQTMTSDNGSSRSQAEVH 324
Query: 287 LEVFQNLVEADCDTIRDVVNNQLI 310
V ++ + D + +N+QLI
Sbjct: 325 DGVRTDIAKWDARQLEATINDQLI 348
>gi|85858389|ref|YP_460591.1| mu-like prophage Flumu protein gp29 [Syntrophus aciditrophicus SB]
gi|85721480|gb|ABC76423.1| mu-like prophage Flumu protein gp29 [Syntrophus aciditrophicus SB]
Length = 530
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 47/151 (31%), Positives = 76/151 (50%), Gaps = 14/151 (9%)
Query: 183 KKNTLAFWDTFAEIFGMPMRIAR---TTTRDKKEMAEMEKMMSDMGTEGWAIFQQGTEIE 239
K L W FAE+FGMP+R+ + RD+K+ + + +G++ I + TEI+
Sbjct: 224 KNYVLKDWVAFAEVFGMPLRLGKYDPLAGRDEKD--ALLAAVRSLGSDAAGIISKNTEID 281
Query: 240 VVESKKGD-AFNVYDRRIDRANSELSKLIIGQTMTIEDG-SSLSQSETHLEVFQNLVEAD 297
+ES K A N+YD + + ++SK I+GQT T E L + +V +LV +D
Sbjct: 282 FIESAKATGADNLYDALANFCDKQMSKAILGQTATTEGTPGRLGNDDVQDKVRHDLVVSD 341
Query: 298 CDTIRDVVNNQLIPRMIQHGFPLQGIHFDWD 328
+ + + Q+I PL G +F WD
Sbjct: 342 AEALCRTIRFQIIR-------PLVGWNFGWD 365
>gi|24374221|ref|NP_718264.1| hypothetical protein SO_2680 [Shewanella oneidensis MR-1]
gi|24348737|gb|AAN55708.1|AE015707_20 conserved hypothetical protein [Shewanella oneidensis MR-1]
Length = 521
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 38/142 (26%), Positives = 77/142 (54%), Gaps = 2/142 (1%)
Query: 171 GLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIAR-TTTRDKKEMAEMEKMMSDMGTEGW 229
GL A + K T+ W FAE+FG+P+R+ + + +++ + + + ++
Sbjct: 208 GLARLVAVMYMLKSFTVRDWWAFAEVFGVPVRVGKYGANASESDISTLINAIGRIASDAG 267
Query: 230 AIFQQGTEIEVVESKKGDAFNVYDRRIDR-ANSELSKLIIGQTMTIEDGSSLSQSETHLE 288
A+ + ++E++E+ KG+ + + R + ++SK ++GQTMT ++GSS SQ+ H E
Sbjct: 268 AVIPESMKLELIETAKGNGGDTLFENMARWCDEQISKAVLGQTMTADNGSSQSQATVHNE 327
Query: 289 VFQNLVEADCDTIRDVVNNQLI 310
V ++ + D + +N LI
Sbjct: 328 VRIDIAKWDARQLEASINEYLI 349
>gi|121587537|ref|ZP_01677304.1| conserved hypothetical protein [Vibrio cholerae 2740-80]
gi|121548268|gb|EAX58336.1| conserved hypothetical protein [Vibrio cholerae 2740-80]
Length = 519
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 37/142 (26%), Positives = 77/142 (54%), Gaps = 2/142 (1%)
Query: 171 GLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIAR-TTTRDKKEMAEMEKMMSDMGTEGW 229
GL A + K T+ W FAE+FG+P+R+ + + +++ + + + ++
Sbjct: 208 GLARLVAVMYMLKSFTVRDWWAFAEVFGIPVRVGKYGANASEGDISTLINAIGRIASDAG 267
Query: 230 AIFQQGTEIEVVESKKGDAFNVYDRRIDR-ANSELSKLIIGQTMTIEDGSSLSQSETHLE 288
A+ + +I+++E+ KG+ + + R + ++SK ++GQTMT ++GSS SQ+ H E
Sbjct: 268 AVIPESMKIDLIETAKGNGGDTLFENMARWCDEQISKAVLGQTMTADNGSSQSQANIHNE 327
Query: 289 VFQNLVEADCDTIRDVVNNQLI 310
V ++ + D + +N L+
Sbjct: 328 VRIDIAKWDARQLESCINEYLV 349
>gi|85703723|ref|ZP_01034827.1| hypothetical protein ROS217_23317 [Roseovarius sp. 217]
gi|85672651|gb|EAQ27508.1| hypothetical protein ROS217_23317 [Roseovarius sp. 217]
Length = 497
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 79/335 (23%), Positives = 135/335 (40%), Gaps = 56/335 (16%)
Query: 37 YDIYRDVEVDLHLSGCIQQREGFVLSRSFKLVNEKGDEDTDATKYFNTAW---------F 87
+DIY+ + D + + QR V SR +++V EDT A K W F
Sbjct: 54 FDIYKPILTDPQVKSVMTQRISAVTSREWEVVP---GEDTAAGKR-AADWLREELAALKF 109
Query: 88 KRLMK----------------FALDANYWGHSLIELGELTTDANNRLCYDGVTLIPRKHV 131
RL + F D WG I + + + C G+ L+ ++
Sbjct: 110 DRLTEKMLWGLFYGYSVAEQMFRRDGQLWGWEEIRVRDRVRFRFDEEC--GLRLLTMSNM 167
Query: 132 IPEYGRLVANLGDDWKSGKDYRRTPFTEWLIEVG-----QPDGLGLYLKAATQTIPKKNT 186
+ +G++ F W+ G +P GLGL K+N
Sbjct: 168 L---------------TGEEMPPEKF--WVFSTGADHDDEPYGLGLAHWLYWPVWFKRNG 210
Query: 187 LAFWDTFAEIFGMPMRIARTTTR-DKKEMAEMEKMMSDMGTEGWAIFQQGTEIEVVESKK 245
L W + FGMP + ++ +E + + + + +E I G +IE++ +
Sbjct: 211 LKLWLIALDKFGMPTARGKYPSQATAEEQKTLLEAVLAIRSEAGIIIPDGMDIELLSAPA 270
Query: 246 GDAFNVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQSETHLEVFQNLVEADCDTIRDVV 305
G + Y + D ++ +SK+++ QTMT ++GSS SQ+E H +V + ++D D +
Sbjct: 271 GASSLDYQKLHDTMDAAISKIVLSQTMTTDNGSSRSQAEVHDDVGDAVKKSDADLVCQSF 330
Query: 306 NNQLIPRMIQHGFPLQGIHFDWDYSEDYTPEQQVA 340
N + R+ + FP W ED PE A
Sbjct: 331 NEGPVARLCEFNFPGVAPPRVWRKMED--PEDTTA 363
>gi|48696939|ref|YP_024702.1| gp29 [Burkholderia phage BcepMu]
gi|44829502|gb|AAS47869.1| gp29 [Burkholderia cenocepacia phage BcepMu]
Length = 491
Score = 65.5 bits (158), Expect = 6e-09, Method: Composition-based stats.
Identities = 86/351 (24%), Positives = 152/351 (43%), Gaps = 41/351 (11%)
Query: 39 IYRDVEVDLHLSGCIQQREGFVLSRSFKLVNEKGDEDTDATK----YFNTAWFKRLMKFA 94
+YR++ D H+ GC+++R+ V + + L ++G + K F R++
Sbjct: 58 VYRELRADAHVGGCVRRRKAAVKALEWGL--DRGKAKSRVAKSIADVFADLDLSRIVTEM 115
Query: 95 LDANYWGHSLIEL--GELTTDANNRLCYDGVTLIPRKHVIPEYGRLVANLGDDWKSGKDY 152
LDA +G+ +E+ G++ N + D V V +L D W G++
Sbjct: 116 LDAVLYGYQPMEITWGKV---GNYIVPIDVVGKPADWFVYDPENQLRFRSKDHWMQGEEL 172
Query: 153 RRTPF------TEWLIEVGQPDGLGLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIA-- 204
F +L G PD L + T KK L FW F E +G PM +
Sbjct: 173 PARKFLVPRQEATYLNPYGFPD-LSMCFWPTTF---KKGGLKFWVQFTEKYGSPMLVGKH 228
Query: 205 -RTTTRDKKEMAEMEKMMSDMGTEGWAIFQQGTEIEVVES--KKGDAFNVYDRRIDRANS 261
R+ + +K + + + DM + A+ + IE+ E+ K G A +VY+R +
Sbjct: 229 PRSASDGEKNL--LLDCLEDMVQDAVAVVPDDSSIEIKEAAGKTGSA-DVYERLLHFCRG 285
Query: 262 ELSKLIIGQTMTIEDGSSLSQSETHLEVFQNLVEADCDTIRDVVNNQLIPRMIQHGF--- 318
E+S ++GQ T E S+ + ++ LEV ++ + D + + + N LI + F
Sbjct: 286 EVSIALLGQNQTTEATSTRASAQAGLEVTDDIRDGDKAVVSEAM-NMLIRWICDLNFDGA 344
Query: 319 --PLQGIHFDWDYSEDYTPEQQVAYEQ-LVLNNYEVDPSYFEEKYNMPVGE 366
P+ FD + ++ E Q +Q L P+YF+ YN+ G+
Sbjct: 345 DRPV----FDM-WEQEQVDEIQAGRDQKLTQAGARFTPAYFKRAYNLQDGD 390
>gi|116221405|ref|ZP_01486813.1| hypothetical protein VchoV5_02000539 [Vibrio cholerae V51]
gi|125621731|gb|EAZ50059.1| conserved hypothetical protein [Vibrio cholerae V51]
Length = 519
Score = 65.5 bits (158), Expect = 6e-09, Method: Composition-based stats.
Identities = 37/142 (26%), Positives = 77/142 (54%), Gaps = 2/142 (1%)
Query: 171 GLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIAR-TTTRDKKEMAEMEKMMSDMGTEGW 229
GL A + K T+ W FAE+FG+P+R+ + + +++ + + + ++
Sbjct: 208 GLARLVAVMYMLKSFTVRDWWAFAEVFGIPVRVGKYGANASEGDISTLINAIGRIASDAG 267
Query: 230 AIFQQGTEIEVVESKKGDAFNVYDRRIDR-ANSELSKLIIGQTMTIEDGSSLSQSETHLE 288
A+ + +I+++E+ KG+ + + R + ++SK ++GQTMT ++GSS SQ+ H E
Sbjct: 268 AVIPESMKIDLLETAKGNGGDTLFENMARWCDEQISKAVLGQTMTADNGSSQSQANIHNE 327
Query: 289 VFQNLVEADCDTIRDVVNNQLI 310
V ++ + D + +N L+
Sbjct: 328 VRIDIAKWDARQLESCINEYLV 349
>gi|147673735|ref|YP_001216624.1| hypothetical protein VC0395_A0674 [Vibrio cholerae O395]
gi|147674836|ref|YP_001216713.1| hypothetical protein VC0395_A0763 [Vibrio cholerae O395]
gi|146315618|gb|ABQ20157.1| conserved hypothetical protein [Vibrio cholerae O395]
gi|146316719|gb|ABQ21258.1| conserved hypothetical protein [Vibrio cholerae O395]
Length = 519
Score = 64.7 bits (156), Expect = 1e-08, Method: Composition-based stats.
Identities = 37/142 (26%), Positives = 77/142 (54%), Gaps = 2/142 (1%)
Query: 171 GLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIAR-TTTRDKKEMAEMEKMMSDMGTEGW 229
GL A + K T+ W FAE+FG+P+R+ + + +++ + + + ++
Sbjct: 208 GLARLVAVMYMLKSFTVRDWWAFAEVFGIPVRVGKYGANASEGDISTLINAIGRIASDAG 267
Query: 230 AIFQQGTEIEVVESKKGDAFNVYDRRIDR-ANSELSKLIIGQTMTIEDGSSLSQSETHLE 288
A+ + +I+++E+ KG+ + + R + ++SK ++GQTMT ++GSS SQ+ H E
Sbjct: 268 AVIPESMKIDLLETAKGNGGDTLFENMARWCDEQISKAVLGQTMTADNGSSQSQANVHNE 327
Query: 289 VFQNLVEADCDTIRDVVNNQLI 310
V ++ + D + +N L+
Sbjct: 328 VRIDIAKWDARQLESCINEFLV 349
>gi|134288841|ref|YP_001111203.1| gp3 [Burkholderia phage phiE255]
gi|134132177|gb|ABO60698.1| gp3 [Burkholderia phage phiE255]
Length = 491
Score = 63.9 bits (154), Expect = 2e-08, Method: Composition-based stats.
Identities = 84/348 (24%), Positives = 145/348 (41%), Gaps = 35/348 (10%)
Query: 39 IYRDVEVDLHLSGCIQQREGFVLSRSFKLVNEKGDEDTDATK----YFNTAWFKRLMKFA 94
+YR++ D H+ GC+++R+ V + + L ++G + K F R+
Sbjct: 58 VYRELRADAHVGGCVRRRKAAVKALEWGL--DRGKAKSRVAKSIADVFADLDLSRIATEM 115
Query: 95 LDANYWGHSLIEL--GELTTDANNRLCYDGVTLIPRKHVIPEYGRLVANLGDDWKSGKDY 152
LDA +G+ +E+ G++ N + D V V +L + W G++
Sbjct: 116 LDAVLYGYQPMEITWGKV---GNYIVPIDVVGKPADWFVYDPENQLRFRSKEHWVQGEEL 172
Query: 153 RRTPF------TEWLIEVGQPDGLGLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIAR- 205
F +L G PD L + T KK L FW F E +G PM + +
Sbjct: 173 PARKFLVPRQEATYLNPYGFPD-LSMCFWPTTF---KKGGLKFWVQFTEKYGSPMLVGKH 228
Query: 206 TTTRDKKEMAEMEKMMSDMGTEGWAIFQQGTEIEVVES--KKGDAFNVYDRRIDRANSEL 263
+ E + + DM + A+ + IE+ E+ K G A +VY+R + E+
Sbjct: 229 PRSASDAETNLLLDRLEDMVQDAVAVIPDDSSIEIKEAAGKSGSA-DVYERLLHFCRGEV 287
Query: 264 SKLIIGQTMTIEDGSSLSQSETHLEVFQNLVEADCDTIRDVVNNQLIPRMIQHGF----- 318
S ++GQ T E S+ + ++ LEV ++ + D + + + N LI + F
Sbjct: 288 SIALLGQNQTTEATSTRASAQAGLEVTDDIRDGDKAIVVEAM-NMLIRWICDLNFDGAAR 346
Query: 319 PLQGIHFDWDYSEDYTPEQQVAYEQLVLNNYEVDPSYFEEKYNMPVGE 366
P+ FD E Q E+L P+YF+ YN+ G+
Sbjct: 347 PV----FDMWEQEQVDEIQAGRDEKLTRAGARFTPAYFKRAYNLQDGD 390
>gi|154248422|ref|YP_001419380.1| protein of unknown function DUF935 [Xanthobacter autotrophicus Py2]
gi|154162507|gb|ABS69723.1| protein of unknown function DUF935 [Xanthobacter autotrophicus Py2]
Length = 544
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 46/154 (29%), Positives = 77/154 (50%), Gaps = 3/154 (1%)
Query: 171 GLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIARTTTRDKKEMAE-MEKMMSDMGTEGW 229
GL A+ + K TL W F + +GMP+R+ R E E + ++++ +
Sbjct: 206 GLARVASWAWMYKAFTLRDWALFCQNYGMPIRLGRYGAGALHEDKEVLWSAVANIAGDCA 265
Query: 230 AIFQQGTEIEVVESKKG-DAFNVYDRRIDRANSELSKLIIGQTMTIE-DGSSLSQSETHL 287
AI + IE VE K D +Y+ R ++++SKL++GQT T + + S + +TH
Sbjct: 266 AIIPESMRIEFVELKNAADGSKLYEARASWMDAQISKLVLGQTATTDANPGSHAAGQTHR 325
Query: 288 EVFQNLVEADCDTIRDVVNNQLIPRMIQHGFPLQ 321
V +++ AD I +N QLI +M+ F Q
Sbjct: 326 LVQEDMERADAKLIAATLNRQLIHQMVAFNFGPQ 359
>gi|34497594|ref|NP_901809.1| hypothetical protein CV_2139 [Chromobacterium violaceum ATCC 12472]
gi|34103450|gb|AAQ59812.1| conserved hypothetical protein [Chromobacterium violaceum ATCC
12472]
Length = 488
Score = 62.4 bits (150), Expect = 5e-08, Method: Composition-based stats.
Identities = 77/353 (21%), Positives = 145/353 (41%), Gaps = 45/353 (12%)
Query: 39 IYRDVEVDLHLSGCIQQREGFVLSRSFKLVNEKGDEDT--DATKYFNTAWFKRLMKFALD 96
+YR++ D H+ GCI++R+ V + + L ++ FN R++ LD
Sbjct: 59 VYRELRADAHVGGCIRRRKAAVKALEWGLDRDQAKSRVAKSVASIFNDLNLSRIIGEMLD 118
Query: 97 ANYWGHSLIEL--GELTTDANNRLCYDGVTLIPRKHVIPEYGRLVANLGDDWK-----SG 149
A +G+ +E+ G++ G L+P V + + G+ + +
Sbjct: 119 AMLYGYQPMEIMWGKV-----------GSYLVPVDIVGKPADWFIYDEGNQLRLRTKQAP 167
Query: 150 KDYRRTPFTEWLI-----EVGQPDGLGLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIA 204
P ++L+ P G T KK L FW F E +G P +
Sbjct: 168 LKGEELPARKFLVPRQDASYDNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPWLVG 227
Query: 205 R-----TTTRDKKEMAEMEKMMSDMGTEGWAIFQQGTEIEVVESKKG-DAFNVYDRRIDR 258
+ +T + +E M+ D A+ + +E+ E+ G + +VY+R +
Sbjct: 228 KHPRSASTQETDLLLDSLEAMVQD----AVAVVPDDSSVEIKEAANGANNADVYERLLHF 283
Query: 259 ANSELSKLIIGQTMTIEDGSSLSQSETHLEVFQNLVEADCDTIRDVVNNQLIPRMIQHGF 318
SE+S ++GQ T E ++ + ++ LEV +++ + D + I + + NQL+ + + F
Sbjct: 284 CRSEVSIALLGQNQTTEASANRASAQAGLEVTRDIRDGDKEVIEEAL-NQLVRWVCELNF 342
Query: 319 -----PLQGIHFDWDYSEDYTPEQQVAYEQLVLNNYEVDPSYFEEKYNMPVGE 366
P F+ E Q E+L P+YF+ YN+ G+
Sbjct: 343 NGGDRP----RFEMWEQEQVDEVQAGRDEKLTRAGATFTPAYFKRAYNLQDGD 391
>gi|46579533|ref|YP_010341.1| hypothetical protein DVU1120 [Desulfovibrio vulgaris subsp.
vulgaris str. Hildenborough]
gi|46448948|gb|AAS95600.1| conserved hypothetical protein [Desulfovibrio vulgaris subsp.
vulgaris str. Hildenborough]
Length = 530
Score = 61.6 bits (148), Expect = 8e-08, Method: Composition-based stats.
Identities = 42/148 (28%), Positives = 79/148 (53%), Gaps = 13/148 (8%)
Query: 167 PDGLGLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRI----ARTTTRDKKEMA-EMEKMM 221
P GL L + KK + F+ TF E +GMP + AR + +K+ MA ++ +M+
Sbjct: 204 PYGLRLLSRCLWPVAFKKGGVTFYTTFLEKYGMPWAVGVAPARASRDEKRAMAADLARMV 263
Query: 222 SDMGTEGWAIFQQGTEIEVVESKKGDAFNVYDRRIDRANSELSKLIIGQTMTIE---DGS 278
D A+ G ++ + E+ G ++++ + R ++ +SK+++GQT+T+E G
Sbjct: 264 QD----AVAVIPHGADVRI-ETPTGQVGDLHEAFLRRWDASISKVLMGQTLTVELDGKGG 318
Query: 279 SLSQSETHLEVFQNLVEADCDTIRDVVN 306
S + S+TH +V ++ EAD + +N
Sbjct: 319 SRAASDTHKDVADDIAEADRRLVEAALN 346
>gi|16760410|ref|NP_456027.1| hypothetical protein STY1616 [Salmonella enterica subsp. enterica
serovar Typhi str. CT18]
gi|29141826|ref|NP_805168.1| hypothetical protein t1372 [Salmonella enterica subsp. enterica
serovar Typhi Ty2]
gi|25333419|pir||AE0686 hypothetical protein STY1616 [imported] - Salmonella enterica
subsp. enterica serovar Typhi (strain CT18)
gi|16502705|emb|CAD01862.1| hypothetical protein [Salmonella enterica subsp. enterica serovar
Typhi]
gi|29137454|gb|AAO69017.1| hypothetical protein t1372 [Salmonella enterica subsp. enterica
serovar Typhi Ty2]
Length = 498
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 58/204 (28%), Positives = 94/204 (46%), Gaps = 24/204 (11%)
Query: 183 KKNTLAFWDTFAEIFGMPMRIARTTTRDKKEMAEMEKM---MSDMGTEGWAIFQQGTEIE 239
KK + FW FAE FG P I + R + AE+EK+ M M + A + IE
Sbjct: 205 KKGGMKFWLRFAEKFGSPWVIGKHP-RGAND-AEIEKLLDSMEQMVEDAVAAIPDDSSIE 262
Query: 240 VVESK-KGDAFNVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQSETHLEVFQNLVEADC 298
+ + K D+ V+ I + SE+S ++GQ T E S+ + ++ LEV ++ +AD
Sbjct: 263 LKAADGKADSSEVFRELITLSRSEISIALLGQNQTTEANSNKASAQAGLEVTADIRDADA 322
Query: 299 DTIRDVVNNQLIPRMIQHGF-----PLQGIHFDWDYSEDYTPEQQVAYEQLVLNNYEVDP 353
D I+ V NQ+I ++ F P+ + W+ E + E+L + P
Sbjct: 323 DIIQAAV-NQVIRTVVTLNFGDVPCPVWAM---WE-QETIDDTRATRDEKLTRAGLRLTP 377
Query: 354 SYFEEKYNM--------PVGERRN 369
YF+ +Y + P ER+N
Sbjct: 378 QYFKREYQLQDGDIDETPPSERQN 401
>gi|75512142|ref|ZP_00734714.1| COG4383: Mu-like prophage protein gp29 [Escherichia coli 53638]
Length = 498
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 55/193 (28%), Positives = 91/193 (47%), Gaps = 16/193 (8%)
Query: 183 KKNTLAFWDTFAEIFGMPMRIARTTTRDKKEMAEMEKM---MSDMGTEGWAIFQQGTEIE 239
KK + FW FAE FG P I + R + AE+EK+ M M + A + IE
Sbjct: 205 KKGGMKFWLRFAEKFGSPWVIGKHP-RGAND-AEIEKLLDSMEQMVEDAVAAIPDDSSIE 262
Query: 240 VVESK-KGDAFNVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQSETHLEVFQNLVEADC 298
+ + K D+ V+ I + SE+S ++GQ T E S+ + ++ LEV ++ +AD
Sbjct: 263 LKAADGKADSSEVFRELITLSRSEISIALLGQNQTTEANSNKASAQAGLEVTADIRDADA 322
Query: 299 DTIRDVVNNQLIPRMIQHGF-----PLQGIHFDWDYSEDYTPEQQVAYEQLVLNNYEVDP 353
D I+ V NQ+I ++ F P+ + W+ E + E+L + P
Sbjct: 323 DIIQAAV-NQVIRTVVTLNFGDVPCPVWAM---WE-QEAIDDTRATRDEKLTRAGLRLTP 377
Query: 354 SYFEEKYNMPVGE 366
YF+ +Y + G+
Sbjct: 378 QYFKREYQLQDGD 390
>gi|83310863|ref|YP_421127.1| Mu-like prophage FluMu protein gp29 [Magnetospirillum magneticum
AMB-1]
gi|82945704|dbj|BAE50568.1| Mu-like prophage FluMu protein gp29 [Magnetospirillum magneticum
AMB-1]
Length = 534
Score = 59.7 bits (143), Expect = 3e-07, Method: Composition-based stats.
Identities = 41/162 (25%), Positives = 76/162 (46%), Gaps = 7/162 (4%)
Query: 165 GQPDGLGLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIART-TTRDKKEMAEMEKMMSD 223
G P GL A + K + W F E+FG P+R+ + + + + A + + + +
Sbjct: 206 GLPIRGGLARAMAWLYLFKNFDIKAWVEFLEVFGQPIRLGKYHSGASEPDKATLLRAVRN 265
Query: 224 MGTEGWAIFQQGTEIEVVESKKGDAFNVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQS 283
+ + AI +E++E+K V + A+ ++SK ++GQT T + G + +
Sbjct: 266 LHKDAAAIIPDSMMVELIEAKISGTITVQKEFAEWADKQVSKAVLGQTGTTDVGQHVGTA 325
Query: 284 ETHLEVFQNLVEADCDTIRDVVNNQLIPRMI------QHGFP 319
+ H +V ++ D D I D +N QL+ I QH +P
Sbjct: 326 KAHDKVKDDIERDDADQIGDTINAQLVRPFIDLNQGPQHRYP 367
>gi|75255742|ref|ZP_00727503.1| COG4383: Mu-like prophage protein gp29 [Escherichia coli E22]
Length = 497
Score = 59.7 bits (143), Expect = 3e-07, Method: Composition-based stats.
Identities = 55/193 (28%), Positives = 90/193 (46%), Gaps = 16/193 (8%)
Query: 183 KKNTLAFWDTFAEIFGMPMRIARTTTRDKKEMAEMEKM---MSDMGTEGWAIFQQGTEIE 239
KK + FW FAE FG P I + R + AE+EK+ M M + A + IE
Sbjct: 205 KKGGMKFWLRFAEKFGSPWVIGKHP-RGAND-AEIEKLLDSMEQMVEDAVAAIPDDSSIE 262
Query: 240 VVESK-KGDAFNVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQSETHLEVFQNLVEADC 298
+ + K D+ V+ I + SE+S ++GQ T E S+ + ++ LEV ++ +AD
Sbjct: 263 LKAADGKADSSEVFRELITLSRSEISIALLGQNQTTEANSNKASAQAGLEVTADIRDADA 322
Query: 299 DTIRDVVNNQLIPRMIQHGF-----PLQGIHFDWDYSEDYTPEQQVAYEQLVLNNYEVDP 353
D I+ V NQ+I ++ F P+ + W+ E + E+L + P
Sbjct: 323 DIIQAAV-NQVIRTVVTLNFGDVPCPVWAM---WE-QETIDDTRATRDEKLTRAGLRLTP 377
Query: 354 SYFEEKYNMPVGE 366
YF +Y + G+
Sbjct: 378 QYFMREYQLQEGD 390
>gi|75228563|ref|ZP_00715197.1| COG4383: Mu-like prophage protein gp29 [Escherichia coli B7A]
Length = 497
Score = 59.7 bits (143), Expect = 4e-07, Method: Composition-based stats.
Identities = 55/193 (28%), Positives = 90/193 (46%), Gaps = 16/193 (8%)
Query: 183 KKNTLAFWDTFAEIFGMPMRIARTTTRDKKEMAEMEKM---MSDMGTEGWAIFQQGTEIE 239
KK + FW FAE FG P I + R + AE+EK+ M M + A + IE
Sbjct: 205 KKGGMKFWLRFAEKFGSPWVIGKHP-RGAND-AEIEKLLDSMEQMVEDAVAAIPDDSSIE 262
Query: 240 VVESK-KGDAFNVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQSETHLEVFQNLVEADC 298
+ + K D+ V+ I + SE+S ++GQ T E S+ + ++ LEV ++ +AD
Sbjct: 263 LKAADGKADSSEVFRELITLSRSEISIALLGQNQTTEANSNKASAQAGLEVTADIRDADA 322
Query: 299 DTIRDVVNNQLIPRMIQHGF-----PLQGIHFDWDYSEDYTPEQQVAYEQLVLNNYEVDP 353
D I+ V NQ+I ++ F P+ + W+ E + E+L + P
Sbjct: 323 DIIQAAV-NQVIRTVVTLNFGDVPCPVWAM---WE-QEAIDDTRATRDEKLTRAGLRLTP 377
Query: 354 SYFEEKYNMPVGE 366
YF +Y + G+
Sbjct: 378 QYFMREYQLQEGD 390
>gi|146277346|ref|YP_001167505.1| protein of unknown function DUF935 [Rhodobacter sphaeroides ATCC
17025]
gi|146278138|ref|YP_001168297.1| protein of unknown function DUF935 [Rhodobacter sphaeroides ATCC
17025]
gi|145555587|gb|ABP70200.1| protein of unknown function DUF935 [Rhodobacter sphaeroides ATCC
17025]
gi|145556379|gb|ABP70992.1| protein of unknown function DUF935 [Rhodobacter sphaeroides ATCC
17025]
Length = 543
Score = 58.9 bits (141), Expect = 6e-07, Method: Composition-based stats.
Identities = 46/171 (26%), Positives = 83/171 (48%), Gaps = 11/171 (6%)
Query: 149 GKDYRRTPFTEWLIEVGQPDGL----GLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIA 204
G++ TPF EV G+ GL AA + K T W F + +G P+R+
Sbjct: 180 GEELELTPFKFIFAEVKAKSGIALRSGLARAAAWAWMFKAFTQRDWAIFTQTYGQPLRLG 239
Query: 205 R---TTTRDKKEMAEMEKMMSDMGTEGWAIFQQGTEIEVVESKK-GDAFNVYDRRIDRAN 260
R + D K A + + ++++ + AI + I+ VE+K G ++Y +R D +
Sbjct: 240 RYGPGASEDDK--ATLFRAVANIAGDCAAIIPESMAIDFVETKSVGATADLYKQRADWLD 297
Query: 261 SELSKLIIGQTMTIEDGS-SLSQSETHLEVFQNLVEADCDTIRDVVNNQLI 310
++SK ++GQT T + + L + H +V +++ AD + ++N LI
Sbjct: 298 QQISKAVLGQTATTDAVTGGLGSGKEHRQVQEDIERADAKALSGILNRDLI 348
>gi|60099637|gb|AAX12941.1| hypothetical protein [Escherichia blattae]
Length = 523
Score = 58.9 bits (141), Expect = 6e-07, Method: Composition-based stats.
Identities = 39/162 (24%), Positives = 80/162 (49%), Gaps = 2/162 (1%)
Query: 162 IEVGQPDGLGLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIARTTTRDKKEMAE-MEKM 220
++ G P GL A + K T+ W FAE FG+P+ I + +E + +
Sbjct: 207 LKSGLPLRNGLARLVAVMYMLKSFTVRDWWAFAEKFGIPIVIGKYGANASQEQIQTLIDA 266
Query: 221 MSDMGTEGWAIFQQGTEIEVVES-KKGDAFNVYDRRIDRANSELSKLIIGQTMTIEDGSS 279
++ + ++ Q ++E+ E+ + + ++ + +++ SK ++GQTMT +DGSS
Sbjct: 267 IASIASDAGCAIPQSMQLEMQETASRNNGGALFKEMAEWCDAQTSKAVLGQTMTTDDGSS 326
Query: 280 LSQSETHLEVFQNLVEADCDTIRDVVNNQLIPRMIQHGFPLQ 321
+Q++ H V ++ + D + + +N L+ IQ + Q
Sbjct: 327 RAQADVHDRVRMDIAKWDARQLENTLNEFLVRPFIQFNYGPQ 368
>gi|54654574|gb|AAV37152.1| hypothetical protein [Haemophilus influenzae biotype aegyptius]
Length = 471
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 70/321 (21%), Positives = 138/321 (42%), Gaps = 28/321 (8%)
Query: 40 YRDVEVDLHLSGCIQQREGFVLSRSFKLVNEKGDEDTDA--TKYFNTAWFKRLMKFALDA 97
YR+V D H++GC+++R+ + ++ + G+E TD T F +++ LDA
Sbjct: 44 YREVLSDSHVAGCVRRRKAAIKGLEWR-ITPTGNEKTDEILTALFERLPLPQIITEMLDA 102
Query: 98 NYWGHSLIEL-----------GELTTDANNRLCYDGVTLIPRKHVIPEYGRLVANLGDDW 146
+G+ ++E+ EL +D + + G+LV
Sbjct: 103 ALFGYQVLEVMWENQNGLWLPAELIGKPQEWFVFDEQNQLMLRDKENRNGKLVPEKKFLL 162
Query: 147 KSGKDYRRTPFTEWLIEVGQPDGLGLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIART 206
+ + P+ G+PD L L AAT K+ FW F E +G P + +
Sbjct: 163 TTQEANYTNPY-------GKPD-LALCFWAATF---KRGGFKFWLEFTEKYGSPWLVGKH 211
Query: 207 TTRDKK-EMAEMEKMMSDMGTEGWAIFQQGTEIEVVES-KKGDAFNVYDRRIDRANSELS 264
+ + E+ ++ + M A + I ++ES KG + V+D + SE++
Sbjct: 212 PRQAQTHEIEDLLDSLEQMIGTAVAAIPDDSAISMLESASKGGSSQVFDEFLKYCKSEIA 271
Query: 265 KLIIGQTMTIEDGSSLSQSETHLEVFQNLVEADCDTIRDVVNNQLIPRMIQHGFPLQGIH 324
++GQ T E S+ + + LE+ + + D T+ + NQL+ + + F ++ +
Sbjct: 272 IALLGQNQTTEQESNRASATAGLELTKEIRNEDI-TLVESGFNQLLNWICELNFNVETLP 330
Query: 325 FDWDYSEDYTPEQQVAYEQLV 345
Y ++ + QV +Q++
Sbjct: 331 KFELYEQESIDKVQVERDQIL 351
>gi|37527315|ref|NP_930659.1| hypothetical protein plu3441 [Photorhabdus luminescens subsp.
laumondii TTO1]
gi|36786749|emb|CAE15815.1| unnamed protein product [Photorhabdus luminescens subsp. laumondii
TTO1]
Length = 497
Score = 56.2 bits (134), Expect = 4e-06, Method: Composition-based stats.
Identities = 37/129 (28%), Positives = 63/129 (48%), Gaps = 2/129 (1%)
Query: 183 KKNTLAFWDTFAEIFGMPMRIAR-TTTRDKKEMAEMEKMMSDMGTEGWAIFQQGTEIEVV 241
KK + FW FAE +G P I + + E+ + M M + A + +E+
Sbjct: 204 KKGGMKFWVRFAEKYGSPWVIGKHPRGTPQSEIDLLLDSMEAMIDDAVAAIPDDSSVEIK 263
Query: 242 ESK-KGDAFNVYDRRIDRANSELSKLIIGQTMTIEDGSSLSQSETHLEVFQNLVEADCDT 300
E+ K D+ ++Y I A SE+S ++GQ T E S+ + ++ LEV ++ +AD D
Sbjct: 264 EAAGKSDSSDIYQNLITVARSEISIALLGQNQTTEANSNKASAQAGLEVTDDIRDADADV 323
Query: 301 IRDVVNNQL 309
I +N +
Sbjct: 324 ITSAINQTI 332
>gi|32029859|ref|ZP_00132813.1| COG4383: Mu-like prophage protein gp29 [Haemophilus somnus 2336]
Length = 552
Score = 55.5 bits (132), Expect = 7e-06, Method: Composition-based stats.
Identities = 54/218 (24%), Positives = 100/218 (45%), Gaps = 32/218 (14%)
Query: 170 LGLYLKAATQTIPKKNTLAFWDTFAEIFGMPMRIAR----TTTRDKKEMAEMEKMMSDMG 225
LGLY A + K + + F E++GMP+RI + T +K+ + + ++ +G
Sbjct: 208 LGLYRTLAWHYMFKHYAVHDFAEFLELYGMPIRIGKYGAGATNEEKRTLL---RALAQIG 264
Query: 226 TEGWAIFQQGTEIEVVESKKGDAF-NVYDRRIDRANSELSKLIIGQTMTI-EDGSSLSQS 283
I + EIE+ + G + N + + I+ +E+++LI+GQT+T DG S + +
Sbjct: 265 HNAAGIMPESMEIELHNAANGSSVNNPFLQMIEWCRTEIARLILGQTLTSGADGKSSTNA 324
Query: 284 --ETHLEVFQNLVEADCDTIRDVVNNQLIPRMIQHGFPLQGIHFDWDYSEDYTPEQQVAY 341
+ H EV ++L+ AD I + Q+I P ++ D E P
Sbjct: 325 LGKVHNEVRRDLLIADAKQIAQTITKQII-------LPYLQLNVDPSIDESRCPR----- 372
Query: 342 EQLVLNNYEVDPSYFE--EKYNMPVGERRNVGSSTGEN 377
+E D + +E EK+ + + N+G + E+
Sbjct: 373 -------FEFDTAEYEDLEKFAKALPDLVNIGVAVPES 403
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.318 0.136 0.408
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,591,404,631
Number of Sequences: 5470121
Number of extensions: 69550411
Number of successful extensions: 162032
Number of sequences better than 1.0e-05: 42
Number of HSP's better than 0.0 without gapping: 9
Number of HSP's successfully gapped in prelim test: 33
Number of HSP's that attempted gapping in prelim test: 161950
Number of HSP's gapped (non-prelim): 53
length of query: 415
length of database: 1,894,087,724
effective HSP length: 135
effective length of query: 280
effective length of database: 1,155,621,389
effective search space: 323573988920
effective search space used: 323573988920
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 131 (55.1 bits)