BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= PI0509
(518 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|56696575|ref|YP_166932.1| hypothetical protein SPO1693 [... 247 2e-63
gi|120597810|ref|YP_962384.1| hypothetical protein Sputw318... 215 6e-54
gi|99078553|ref|YP_611811.1| hypothetical protein TM1040_35... 213 4e-53
gi|91790149|ref|YP_551101.1| hypothetical protein Bpro_4311... 206 2e-51
gi|67920891|ref|ZP_00514410.1| similar to Uncharacterized c... 179 4e-43
gi|118443910|ref|YP_877792.1| hypothetical protein NT01CX_1... 116 3e-24
gi|91772241|ref|YP_564933.1| hypothetical protein Mbur_0172... 100 2e-19
gi|83643137|ref|YP_431572.1| hypothetical protein HCH_00232... 98 2e-18
gi|121593250|ref|YP_985146.1| hypothetical protein Ajs_0828... 91 2e-16
gi|83944454|ref|ZP_00956907.1| hypothetical protein EE36_14... 91 3e-16
gi|88813619|ref|ZP_01128850.1| hypothetical protein NB231_1... 88 2e-15
gi|153872276|ref|ZP_02001215.1| conserved hypothetical prot... 83 5e-14
gi|84619226|emb|CAJ43152.1| hypothetical protein [Bacteriop... 80 3e-13
gi|67920890|ref|ZP_00514409.1| conserved hypothetical prote... 80 4e-13
gi|148263102|ref|YP_001229808.1| hypothetical protein Gura_... 79 1e-12
gi|150398874|ref|YP_001322641.1| hypothetical protein Mevan... 79 1e-12
gi|86750096|ref|YP_486592.1| hypothetical protein RPB_2979 ... 74 2e-11
gi|76261174|ref|ZP_00768793.1| conserved hypothetical prote... 74 3e-11
gi|88804596|ref|ZP_01120116.1| hypothetical protein RB2501_... 72 8e-11
gi|39995276|ref|NP_951227.1| hypothetical protein GSU0165 [... 72 9e-11
gi|53688205|ref|ZP_00109506.2| COG4938: Uncharacterized con... 71 1e-10
gi|83816599|ref|YP_445232.1| hypothetical protein SRU_1103 ... 69 5e-10
gi|118068861|ref|ZP_01537111.1| conserved hypothetical prot... 69 5e-10
gi|126463976|ref|YP_001045089.1| hypothetical protein Rsph1... 69 1e-09
gi|119489545|ref|ZP_01622306.1| hypothetical protein L8106_... 67 2e-09
gi|87122784|ref|ZP_01078657.1| hypothetical protein MED121_... 67 2e-09
gi|149203433|ref|ZP_01880403.1| hypothetical protein RTM103... 66 4e-09
gi|15600970|ref|NP_232600.1| hypothetical protein VCA0200 [... 65 2e-08
gi|75908284|ref|YP_322580.1| hypothetical protein Ava_2063 ... 63 4e-08
gi|154494816|ref|ZP_02033821.1| hypothetical protein PARMER... 62 9e-08
gi|114331937|ref|YP_748159.1| hypothetical protein Neut_196... 62 1e-07
gi|16799908|ref|NP_470176.1| hypothetical protein lin0834 [... 61 2e-07
gi|154247664|ref|YP_001418622.1| hypothetical protein Xaut_... 61 2e-07
gi|156974069|ref|YP_001444976.1| hypothetical protein VIBHA... 60 2e-07
gi|30249834|ref|NP_841904.1| hypothetical protein NE1882 [N... 60 3e-07
gi|116620799|ref|YP_822955.1| hypothetical protein Acid_168... 58 1e-06
gi|53713058|ref|YP_099050.1| hypothetical protein BF1769 [B... 58 1e-06
gi|113474043|ref|YP_720104.1| hypothetical protein Tery_013... 58 2e-06
gi|120554381|ref|YP_958732.1| hypothetical protein Maqu_146... 57 2e-06
gi|90579612|ref|ZP_01235421.1| hypothetical protein VAS14_0... 56 6e-06
>gi|56696575|ref|YP_166932.1| hypothetical protein SPO1693 [Silicibacter pomeroyi DSS-3]
gi|56678312|gb|AAV94978.1| conserved hypothetical protein [Silicibacter pomeroyi DSS-3]
Length = 531
Score = 247 bits (630), Expect = 2e-63, Method: Composition-based stats.
Identities = 172/535 (32%), Positives = 276/535 (51%), Gaps = 37/535 (6%)
Query: 7 IKNLRSIKDSGTISLRPIMILLGANSSGKSTFLRSFPLFTQSVDKKLRGPIAWFDTSYVD 66
++NLRS+ + I LRPI IL+G NS+GKST+LRSF L QS++ K PI W+ YVD
Sbjct: 7 VENLRSLTRTPEIELRPITILIGRNSAGKSTYLRSFALMRQSLEVKSSAPILWYG-DYVD 65
Query: 67 YGDFKTAKNRFAEDEEGITFCYRITNTAFSTRRV-----HFLQNIERVDDNE--FKDAEL 119
+GDF A + +D + ITF +R+ + TR V ++ + R + ++ L
Sbjct: 66 FGDFDAAVSNGDKDSD-ITFNFRVED--LDTRDVEDEIYYYNEYAYRSSRRKISYQAVSL 122
Query: 120 SFTLQGDNNGTYINYIHIKLEHVDYIFSMNDRKDDVKCI---INGQIL-DIQEKFRFNFN 175
S +L T I +++ + KD +C INGQ L D+ F
Sbjct: 123 SISLGEQLRRTVRRRIALEIPEDRISAVASFEKDGRECSDFQINGQRLSDLIPNVAVYFR 182
Query: 176 TAFAI-LPILVTNAEGDHT-------AVSAIGTRLISIFKKHCNKKLQNTQRLIN---IL 224
+ P+LV D A S IG+ + ++ +K++ I IL
Sbjct: 183 ASDIFDRPVLVRQVRKDGKSLRRVVGAASEIGSAISQRLRQQVDKRISEENLAIEARRIL 242
Query: 225 NCKSIEKEDFLKHLKYGDDIKSFQKSIRNWTINTSEFNLIYNYFLLLKSNSIIE--RLNN 282
N ++ L+ L+ +SF+K W + + L L ++++I R+++
Sbjct: 243 NHPRLDPTS-LQQLRLRAGNRSFRKLYDRWLKDEASTTLAEIDHLCALNHALITVGRVSS 301
Query: 283 ELANFYHTCDYIAPMRAEAGRFYRIQGLQVQSVDPSGSNLLEFIASLTHREKISYDDFIK 342
L F+ Y+ P RA + R+YR Q L+V + P G NL F+ASL+ E+ + D+++
Sbjct: 302 VLKGFFRNTTYLGPARARSERYYRSQELEVSEISPDGQNLPMFLASLSTTEQGRFSDWVE 361
Query: 343 DILGVTVDVPMESGMKSLRIKNSNGHFNIADVGYGYSQVLPVITKLWHTAYMTSNQKDYF 402
+ G V V + G S++++ N+AD GYG SQ+LPV+ ++W + N
Sbjct: 362 EAFGFGVLVERKEGHVSIKLRQRTISVNVADTGYGVSQILPVLAQIWWATQIPRN----- 416
Query: 403 FRYKFRESVKTTILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQALI 462
+ + R ++ I MEQPELHLHPA QAK+AD I+ + + A+L++ETHS+ALI
Sbjct: 417 YTRQMRHGLR-PITMEQPELHLHPAHQAKLADVFQSAIERSNASGPEAVLLIETHSEALI 475
Query: 463 NRIGRRIREGKLGVDDVNILLF--QKDDNMQNSIIKQISYTNEGQLRDWPYGFFD 515
NR+G I GK+ +DV +++F + + N+ I ++ + G LRDWPYGFF+
Sbjct: 476 NRLGELIEYGKIAANDVQVVIFSARGEGEELNTEISTSTFDDNGALRDWPYGFFN 530
>gi|120597810|ref|YP_962384.1| hypothetical protein Sputw3181_0980 [Shewanella sp. W3-18-1]
gi|120557903|gb|ABM23830.1| conserved hypothetical protein [Shewanella sp. W3-18-1]
Length = 516
Score = 215 bits (547), Expect = 6e-54, Method: Composition-based stats.
Identities = 154/536 (28%), Positives = 271/536 (50%), Gaps = 43/536 (8%)
Query: 2 ITEICIKNLRSIKDSGTISLRPIMILLGANSSGKSTFLRSFPLFTQSVDKKLRGPIAWFD 61
+ +I ++ L+S++DSG L+ I IL+G NSSGKSTFLR FPL QSV+KK RGPI W +
Sbjct: 1 MQKIKVEGLKSLRDSGDFELKKINILVGNNSSGKSTFLRIFPLLRQSVEKKTRGPILW-N 59
Query: 62 TSYVDYGDFKTAK-----NRFAEDEEGITFCYRITNTAFSTRRVHFLQNIERVDDNEFKD 116
Y+D+G F TA N ++E+ + F ++++ ++ ++ H + E +
Sbjct: 60 GLYIDFGSFDTAIHKKFFNGEVKNEDELFFTFQLS---YNRKKYH-----HSFINEEPVE 111
Query: 117 AELSFTLQGDNNG--TYINYIHIKLEHVDYIFSMNDRKDDVKCIINGQILDI---QEKFR 171
+ N+G Y +K+ + F ++ D V I+ L + +
Sbjct: 112 LICKIGMMRSNSGLSCYTASYQVKMYNHTIFFKFDE--DGVITDISSDRLSWPLKKHDLK 169
Query: 172 FNFNTAFAILPILVTNAEGDHTA-----VSAIGTRLISIFKKHCNKKLQNT-QRLINILN 225
+ +LPIL++ + G A +A+ +++ +FK + N Q +
Sbjct: 170 YQITDTDNLLPILISPSSGLVDAKGKQISTALFNQVVQLFKNYSGSSSSNKLQAMATRFL 229
Query: 226 CKSIEKEDFLKHLKYGDDIKSFQKSIRNWTINTSEFNLIYN----YFLLLKSNSIIERLN 281
S+ + L L+ + ++K +W +N+ +F I Y+L+ S +I + +
Sbjct: 230 RNSVNDNNKLALLRKMNSTAKWEKETLSWDVNSKKFRFISGLADLYYLIENSANINDSMT 289
Query: 282 NELANFYHTCDYIAPMRAEAGRFYRIQGLQVQSVDPSGSNLLEFIASLTHREKISYDDFI 341
NEL YIAP+R R+YR Q L + +D G N+ FI+++T + + + +
Sbjct: 290 NELLG----VRYIAPLRTSTARYYRHQDLSIDELDHRGDNIGMFISNITGKWREELNKWT 345
Query: 342 KDILGVTVDVPMESGMKSLRIKNSNGHF--NIADVGYGYSQVLPVITKLWHTAYMTSNQK 399
K G ++V + S S+ ++ ++ NI+D+G+G+SQVLP+I +LW + N K
Sbjct: 346 KSEFGFVINVDLNSNHISINLQYADSDLSDNISDMGFGFSQVLPIIIQLWAVSSGYENSK 405
Query: 400 DYFFRYKFRESVKTTILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQ 459
+ R+ +EQPELHLHP MQA ++ I I+ L++ETHS+
Sbjct: 406 ------RSRKINSYIFAIEQPELHLHPKMQASLSIVFINAIELANNNGIELKLIIETHSE 459
Query: 460 ALINRIGRRIREGKLGVDDVNILLFQKDDNMQNSIIKQISYTNEGQLRDWPYGFFD 515
++I+++G I +GK+ D + + LF++D + + I ++ G L +WP GFFD
Sbjct: 460 SIISKMGNLISKGKIANDMIGVSLFEQDRESRETKIISANFDQNGFLNNWPLGFFD 515
>gi|99078553|ref|YP_611811.1| hypothetical protein TM1040_3580 [Silicibacter sp. TM1040]
gi|99035691|gb|ABF62549.1| hypothetical protein TM1040_3580 [Silicibacter sp. TM1040]
Length = 510
Score = 213 bits (541), Expect = 4e-53, Method: Composition-based stats.
Identities = 151/514 (29%), Positives = 265/514 (51%), Gaps = 30/514 (5%)
Query: 26 ILLGANSSGKSTFLRSFPLFTQSVDKKLRGPIAWFDTSYVDYGDFKTAKNRFAEDEEGIT 85
+L+G NS+GKSTFLRS PL QS++ + PI W+ +VD+GDF+TA + A+D +
Sbjct: 1 MLVGRNSAGKSTFLRSLPLIRQSLETRSSAPILWYG-DFVDFGDFQTAVSSEAKDGYAV- 58
Query: 86 FCYRITNTAFSTRRV--HFLQNIERVDDNEFK--DAELSFTLQGDNNGTYINYIHIKLEH 141
F +++ + R H + R K +A + + + ++ T + I +++
Sbjct: 59 FSFKVRDLEKRHRSTVSHHTNYLLRYRTQTVKIDEAIVKYYVGAESGKTALKKISLEIPS 118
Query: 142 VDYIFSMNDRK---DDVKCIINGQILD-IQEKFRF-NFNTAFAILPILVTNAEGDHTAV- 195
D I ++ R +NG+ L I + F N + P L++ + ++ V
Sbjct: 119 EDLICDISYRGRLGTSGSLTVNGEELPYILQNFEIVNSSNDLFSAPSLISKTKDANSNVR 178
Query: 196 -----SAIGTRLISIFKKHCNKKLQNT----QRLINILNCKSIEKEDFLKHLKYGDDIKS 246
I + + F + ++ ++ + NIL+ + E+ K L + S
Sbjct: 179 RRHSYQDIAVKELERFLRREIPRISSSDTFRNEVANILS-GTPHLEEKWKRLSETANTAS 237
Query: 247 FQKSIRNWTINTS--EFNLIYNYFLLLKSNSIIERLNNELANFYHTCDYIAPMRAEAGRF 304
F+K N TS + I ++ +++ +N++L F+ Y+ P RA RF
Sbjct: 238 FRKYYENIQKGTSGKSKDSILTIQRTFRALQVLDVINDDLQYFFSGVSYLGPARAAGERF 297
Query: 305 YRIQGLQVQSVDPSGSNLLEFIASLTHREKISYDDFIKDILGVTVDVPMESGMKSLRIKN 364
YR Q L+V + P+GSN F+ SL+ +K S+ D+++ I G V++ G S+ +K
Sbjct: 298 YRKQELEVSEILPNGSNFPMFLDSLSTGQKRSFSDWVESIFGYGVELKSHEGHISIHLKA 357
Query: 365 SNGHFNIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHL 424
N+ D GYG SQ+LPV+ +W ++ T+ + + Y R++++ TI +EQPELHL
Sbjct: 358 GEKSVNVTDTGYGVSQILPVLATVWWSSRRTTER--WGIPYGRRQAIR-TIAIEQPELHL 414
Query: 425 HPAMQAKIADTLIKTIDETREADQCAI--LVVETHSQALINRIGRRIREGKLGVDDVNIL 482
HPA QAK+AD +K I + I +++ETHS+ALINR+G I G + DDV ++
Sbjct: 415 HPAHQAKLADVFVKGIKLGQSETNINIVNILIETHSEALINRLGELIELGSISSDDVQVI 474
Query: 483 LFQKDDNMQNSIIKQIS-YTNEGQLRDWPYGFFD 515
+F +D++ + ++ + + G L +WP+GFF+
Sbjct: 475 VFAAEDDINSPTRMSVANFDSNGALENWPFGFFN 508
>gi|91790149|ref|YP_551101.1| hypothetical protein Bpro_4311 [Polaromonas sp. JS666]
gi|91699374|gb|ABE46203.1| conserved hypothetical protein [Polaromonas sp. JS666]
Length = 514
Score = 206 bits (525), Expect = 2e-51, Method: Composition-based stats.
Identities = 158/544 (29%), Positives = 265/544 (48%), Gaps = 65/544 (11%)
Query: 2 ITEICIKNLRSIKDSGTISLRPIMILLGANSSGKSTFLRSFPLFTQSVDKKLRGPIAWFD 61
++ + +KNLRS++ + + PI +LLG NS GKSTF R FPLF QS ++K + P+ WF
Sbjct: 4 LSHLGVKNLRSLRCVPAVRIAPITVLLGRNSVGKSTFARIFPLFRQSSERKKKSPVLWFG 63
Query: 62 TSYVDYGDFKTAKNRFAEDEEGITFCYRIT--------NTAFSTRRVHFLQNIERVDDNE 113
D DF + + E E+ I F + I + RR L
Sbjct: 64 ----DLVDFGSLERSVTEGEKEIGFTFHIDVERPTKKGTVPWHQRRGKTL---------- 109
Query: 114 FKDAELSFTLQGDNNGT-YINYIHIKLEHVDYIFSMNDRKD---DVKCIINGQILDIQEK 169
AE+S TL D T + + +KL + S++ K+ + +NG+ D K
Sbjct: 110 ---AEVSITLVEDTQQTGHAKNVTVKLLGGE--ISIDPSKEVSMNTPFFVNGKRYDPITK 164
Query: 170 FRFNFNTAFAILPILV-----TNAEGDHTAVSA------IGTRLISIFKKHCNKKLQNTQ 218
ILP++ + +G + V + + T++I H N +
Sbjct: 165 DHGTVTFQGDILPLIYFYLYQKDPKGTASWVQSANHWEDLPTQVIR-RHVHSNTSAVTCR 223
Query: 219 RLINILNCKSIEK-EDFLKHLKYGDDIKSFQKSIRNWTINTSEFNLIYNYFLLLKSNSII 277
++ N + ++E+ E L + ++ +S +I +++ L+ ++ I
Sbjct: 224 KIANRVPVGTVEEIEKALVEIPGPPSWRAVAQSGNTKSIAAQ----LHHALLVSHTDEIF 279
Query: 278 ERLNNELANFYHTCDYIAPMRAEAGRFYRIQGLQVQSVDPSGSNLLEFIASLTHREKISY 337
+++ L +++ Y+ P+RA A R+YR L V ++ GSNL F+ SL+ + ++
Sbjct: 280 RQVDAALRSYFQGVRYLKPLRATAERYYRKLDLSVSEIEADGSNLPMFLDSLSEADLKNF 339
Query: 338 DDFIKDILGVTVDVPMESG---MKSLRIKNSNGHFNIADVGYGYSQVLPVITKLWHTAYM 394
+++ L V P G M + KN FNIAD+G+G SQVLP+ +LW T
Sbjct: 340 RSWVRIFLDVDA-FPKRQGDQVMVMAQGKNDPKPFNIADMGFGISQVLPIAAQLWATTRP 398
Query: 395 TSNQKDYFFRYKFRESVKTTILMEQPELHLHPAMQAKIADTLIKTIDETR-EADQCAILV 453
S + + + +++EQPELHLHP QA++AD TI T + + ++
Sbjct: 399 GSWTQ-----------LTSMVVLEQPELHLHPDSQARLADVFAGTIGATNPRSPRTPSIL 447
Query: 454 VETHSQALINRIGRRIREGKLGVDDVNILLFQKDDNMQNSIIKQI-SYTNEGQLRDWPYG 512
VETHSQ L+NR+G+ + G+L +DV+I+LF+ S +I S++ EG L++WPYG
Sbjct: 448 VETHSQHLVNRLGQLVESGRLAAEDVSIILFESSGGPSPSTKVRISSFSAEGVLQNWPYG 507
Query: 513 FFDP 516
FF+P
Sbjct: 508 FFEP 511
>gi|67920891|ref|ZP_00514410.1| similar to Uncharacterized conserved protein [Crocosphaera watsonii
WH 8501]
gi|67857008|gb|EAM52248.1| similar to Uncharacterized conserved protein [Crocosphaera watsonii
WH 8501]
Length = 406
Score = 179 bits (454), Expect = 4e-43, Method: Composition-based stats.
Identities = 121/389 (31%), Positives = 194/389 (49%), Gaps = 31/389 (7%)
Query: 134 YIHIKLEHVDYIFSMNDRKDDVKCIINGQILDIQEKFRFNFNTAFAILPILVTNAEGDHT 193
YI L H + + + ++ + I+N +L+ + RF F F+ L + N T
Sbjct: 38 YILFDLGHKFMLPQVREFSNNQEKILNLGVLNRVNRTRFIFQQLFSKLFQITKNKAHKFT 97
Query: 194 AVSAIGTRLISIFKKHCNKKLQNTQRLINILNCKSIEKEDFLKHLKYGDDIKSFQKSIRN 253
+ + I + S LQ+ I+ F K+++KS+ +
Sbjct: 98 SETTIFKMISSFVIGDSATMLQD------------IKNNPF--------STKTWKKSVNS 137
Query: 254 WTINTSEFNLIYNYFLLLKSNSIIERLNNELANFYHTCDYIAPMRAEAGRFYRIQGLQVQ 313
WTI ++EF + + ++E ++ LANF Y+ P+RA A R+YR Q L V
Sbjct: 138 WTIESNEFRQFRDLIIANSVPWLLEECDDLLANFATKISYMGPVRATAERYYRTQDLAVD 197
Query: 314 SVDPSGSNLLEFIASLTHREKISYDDFIKDILGVTVDVPMESGMKSLRI---KNSNGHFN 370
VD G+NL F+ +LTH E + + + G + G+ SL I K+S FN
Sbjct: 198 EVDYQGTNLPMFLRNLTHGELKEFRTWTLNNFGFEPLITTSQGLISLEIRPSKDSKESFN 257
Query: 371 IADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYK---FRESVKTTILMEQPELHLHPA 427
+AD G+G+SQ+LP+IT+LW + T R + FR + +EQPELHLHP
Sbjct: 258 VADTGFGFSQILPIITQLWFLSEFTKTSS----RIRPPGFRYNSPVIFPIEQPELHLHPR 313
Query: 428 MQAKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLGVDDVNILLFQKD 487
+Q +A+ +K+I+ +E LV+ETHS+ LIN++G + E + +D+NI+LF K
Sbjct: 314 LQGVLAEVFVKSIEAAKERGIDLRLVIETHSEVLINKLGYLVAEKHILPEDINIVLFYKS 373
Query: 488 DNMQNSIIKQISYTNEGQL-RDWPYGFFD 515
D + ++ + +EG L DWP GFFD
Sbjct: 374 DQSKEIKVRTSQFDSEGYLSEDWPLGFFD 402
>gi|118443910|ref|YP_877792.1| hypothetical protein NT01CX_1713 [Clostridium novyi NT]
gi|118134366|gb|ABK61410.1| conserved hypothetical protein [Clostridium novyi NT]
Length = 473
Score = 116 bits (291), Expect = 3e-24, Method: Composition-based stats.
Identities = 136/521 (26%), Positives = 238/521 (45%), Gaps = 88/521 (16%)
Query: 7 IKNLRSIKDSGTISLRPIMILLGANSSGKSTFLRSFPLFTQSVDKKLRGPIAWFDTSYVD 66
++N++ DSG+ ++PI +++G NSSGKS+ +R FPL + ++ P F Y+D
Sbjct: 6 LRNIKGFPDSGSFEIKPITVIIGRNSSGKSSLIR-FPLVLKQTVQESVSPFLSFG-RYID 63
Query: 67 YGDFKTAKNRFAEDEEGITFCYRIT-NTAFSTRRVHFLQNIERVDDNEFKDAELSFTLQG 125
YG+F E + F + N +F D E + + L+
Sbjct: 64 YGNF-----------EDVVFNHDTNLNISF---------------DIEINSEKFLYLLKR 97
Query: 126 DNNGTYINYIHIKLEHVDYIFSMNDRKDDVKCIINGQILDIQEKFRFNFNTAFAILPILV 185
TYI Y KL+ YI ++ K II+ ++ ++ + IL+
Sbjct: 98 ----TYIPYKISKLKEAFYI-------ENKKLIIHVEVGKPKKVSKSKEGLKVKKFQILL 146
Query: 186 TNAEGDHTAVSAIGTRLISIFKKHCNKKLQNTQRLINILNCK----SIEKEDFLKHLKYG 241
GD +S C++ +T +N+++ K S++ + F+ ++ Y
Sbjct: 147 ----GDDNIIS-------------CDE--NDTLYDVNVMDRKFGKYSLQFDSFIPNINYM 187
Query: 242 DDIKSFQKSIRNWTINTSEFNLIYNYFLLLKSNSIIERLNNELANFYHTCDYIAPMRAEA 301
D + S+ N SE Y L+ +I + L+ YI P R
Sbjct: 188 D----LEPSVNN---TESE----YKEELIYMVRAITRAVQMYLSTLCDNISYIGPFRKIP 236
Query: 302 GRFYRIQGLQVQSVDPSGSNLLEFIASLTHREKISY----DDFIKDILGVTVDVP-MESG 356
R YR + +++V G E +AS R + ++++ LG+ + V ++
Sbjct: 237 ERSYRYKEANLENVGFDGQYTAEILASYYRRSDTKFFENINEWLNKHLGMKLTVEDLKGD 296
Query: 357 MKSLRIKN--SNGHFNIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTT 414
+ + IK+ +N + NI DVG+G SQVLP+I +++ + + + +F K
Sbjct: 297 LYKILIKDLETNVNNNIIDVGFGLSQVLPIIVQVFMNKHYDTKTIRQNYTRRFLN--KKL 354
Query: 415 ILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKL 474
++EQPELHLHP+ QA +AD I E + D+ ++ETHS+ LI R+ RRI E K+
Sbjct: 355 NIIEQPELHLHPSAQANLADLFI----EGHKIDKKNYFLIETHSEHLILRLRRRILEDKI 410
Query: 475 GVDDVNILLFQKDDNMQNSIIKQISYTNEGQLRDWPYGFFD 515
+DV + +K N +S ++++ G + +WP GFFD
Sbjct: 411 NPNDVGLYFIEKRSN-NDSCVQKLHLDEFGNISNWPEGFFD 450
>gi|91772241|ref|YP_564933.1| hypothetical protein Mbur_0172 [Methanococcoides burtonii DSM 6242]
gi|91711256|gb|ABE51183.1| conserved hypothetical protein [Methanococcoides burtonii DSM 6242]
Length = 527
Score = 100 bits (249), Expect = 2e-19, Method: Composition-based stats.
Identities = 129/568 (22%), Positives = 237/568 (41%), Gaps = 116/568 (20%)
Query: 1 MITEICIKNLRSIKDSGTISLRPIMILLGANSSGKSTFLRSFPLFTQSVDKKLRGPIAWF 60
MI ++ ++N + +KDS I L+P+ +L G+NSSGKS+ ++S + Q+ + + R
Sbjct: 1 MIKQLHLENFKGLKDSKKIDLKPLTLLCGSNSSGKSSIIQSLLMLKQTFESQSRYRKVVL 60
Query: 61 DTSYVDYGDF-KTAKNRFAEDEEGITFCYRITNTAFS--TRRVHFLQ---------NIER 108
+ + G F N+ +E + F + A + ++ F+ +
Sbjct: 61 NGQFTHLGSFINIIHNKTLSNEVMLKFQISPSTPAINGGNKKYSFIDYHLNDFNFGSSSE 120
Query: 109 VDDNEFKDAELSFTLQGDNNGT--YINYIHIKLEHVDYIFSMNDRKDDVKCII-----NG 161
+ N+ D EL + D+N Y+N ++ + D + +D I N
Sbjct: 121 NNSNKLLDIELGLKVLKDSNAMEPYVNTFNVLAKSNDLQENHSDGTSISLSRISTDKYNL 180
Query: 162 QILDIQEKFRFNF-----NTAFAILPILVTNAEGDHTAVSAIGTRLISIFKKHCNKKLQN 216
+ +I+ + NF +I+ + H + S IG KKH +
Sbjct: 181 KWKNIKSGYFLNFVEDPVPIPKSIISYVCPKCGISHRSNSGIG-------KKHYSYH--- 230
Query: 217 TQRLINILNCKSIEKEDFLKHLKYGDDIKSFQKSIRNWTINTSEFNLIYNYFLLLKSNSI 276
F KH G ++K+ ++ I ++++ Y ++ NS+
Sbjct: 231 -----------------FYKH--NGAEVKASFSNLMP-IIKQQDYSVKALYPIVHSINSL 270
Query: 277 IERLNNELANFYHTCDYIAPMRAEAGRFYRIQGLQVQSVDPSGSNLLEFIASLTHREKIS 336
E ++ E +N Y YI P+R E R Y I + + G N FI S+ +KIS
Sbjct: 271 KELIHTEFSNLY----YIGPLREEPARRY-IHEAEFIDIGIRGENA-PFILSIEGNKKIS 324
Query: 337 Y---------------DDFIKDI-------LGVTVDVPMESGMKSLR-----IKNSNGHF 369
DD ++D +G++ + +E+ +R I++ +
Sbjct: 325 PYRFYNEPLNEWQSIDDDNLEDAVNRWLRYMGIS-EYELEAKDDIIRLNMSSIQDKDVKV 383
Query: 370 NIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAMQ 429
+ADVG+G SQ+LP++ + R T+++EQPE+HLHP +Q
Sbjct: 384 TLADVGFGVSQILPILVE------------------GLRIGDGQTLILEQPEIHLHPKLQ 425
Query: 430 AKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLG--VDDVNILLFQKD 487
++AD + + + +++ETHS ++NRI RRI E + N+L F+
Sbjct: 426 MQLADFFVSMVLSGKR------VLIETHSDHIVNRISRRILESDNNELLSKTNLLFFENI 479
Query: 488 DNMQNSIIKQISYTNEGQLRDWPYGFFD 515
D + + + + +WP GFFD
Sbjct: 480 DGY--PALNPVEIDSNRGIVNWPKGFFD 505
>gi|83643137|ref|YP_431572.1| hypothetical protein HCH_00232 [Hahella chejuensis KCTC 2396]
gi|83631180|gb|ABC27147.1| uncharacterized conserved protein [Hahella chejuensis KCTC 2396]
Length = 463
Score = 97.8 bits (242), Expect = 2e-18, Method: Composition-based stats.
Identities = 67/238 (28%), Positives = 111/238 (46%), Gaps = 30/238 (12%)
Query: 282 NELANFYHTCDYIAPMRAEAGRFYRIQGLQVQSVDPSGSNLLEFIASLTHREKISYDDFI 341
+E+ + + Y+ P+R + R+YRI Q+V P G N K D ++
Sbjct: 233 SEIRSLFQNLSYVGPLRKKIERYYRIASETPQTVGPQGENAPNLFRRNYENLKHDVDRWV 292
Query: 342 KDI-LGVTVDV-PMESGMKSLRIKNSNGHFNIADVGYGYSQVLPVITKLWHTAYMTSNQK 399
K G ++ + + L + + + N+ADVG+G SQVLP+I +
Sbjct: 293 KHFEFGDSLHFDDVNDDIFQLFLGSGDQRVNVADVGFGASQVLPLIIQ------------ 340
Query: 400 DYFFRYKFRESVKTTILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQ 459
+ L EQPE+HL+P +Q +AD + A+ ++VETHS+
Sbjct: 341 ------ALAAPPDSLTLAEQPEIHLNPRLQCALADLFVHM------ANNGHRVLVETHSE 388
Query: 460 ALINRIGRRIREGKLGVDDVNILLFQKDDNMQNSIIKQISYTNEGQL--RDWPYGFFD 515
LI R+ + + G + DDV + ++D + S+IKQ++ N G + WP GFFD
Sbjct: 389 HLIMRLRKFVANGDIKKDDVALYFLERDGD--RSVIKQVNIENNGHINTESWPKGFFD 444
>gi|121593250|ref|YP_985146.1| hypothetical protein Ajs_0828 [Acidovorax sp. JS42]
gi|120605330|gb|ABM41070.1| conserved hypothetical protein [Acidovorax sp. JS42]
Length = 465
Score = 90.9 bits (224), Expect = 2e-16, Method: Composition-based stats.
Identities = 74/268 (27%), Positives = 122/268 (45%), Gaps = 58/268 (21%)
Query: 271 LKSNSIIERLNNELANFYHTCDYIAPMRAEAGRFYRIQGLQVQSVDPSGSNLLEFIAS-- 328
L + S++ER++ Y+ P+R+ R Y+ G SV G + I +
Sbjct: 204 LATESMLERIS-----------YLGPLRSHPQRIYQWSGDTPASVGQMGEYTIAAILAAQ 252
Query: 329 ---------LTHREKISYDDFI----KDILGVTVDV---PMESGMKS----LRIKNSNGH 368
+ H K + +FI KD LGV D P+ +G K ++
Sbjct: 253 GEGRRLNRQVGHHTK-GFAEFIAAWLKD-LGVIHDFAVRPVAAGRKEYEVLVKTHAKAPE 310
Query: 369 FNIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAM 428
I DVG+G SQVLP + + ++ +TI MEQPE+HLHP +
Sbjct: 311 VKITDVGFGISQVLPALVQAFYCP------------------PHSTIWMEQPEIHLHPQV 352
Query: 429 QAKIADTLIKTIDETREADQC--AILVVETHSQALINRIGRRIREGKLGVDDVNILLFQK 486
QA++AD I + RE + L+VE+HS+ +NR+ RR+ EG + +DV + ++
Sbjct: 353 QAELADVFISAT-QAREGGRPRDVQLIVESHSEHFLNRLQRRVAEGVVKPEDVAVYFCRR 411
Query: 487 DDNMQNSIIKQISYTNEGQLRDWPYGFF 514
D+ ++ + G++ +WP FF
Sbjct: 412 ADSATE--LEPLKLNMFGEIENWPPNFF 437
>gi|83944454|ref|ZP_00956907.1| hypothetical protein EE36_14013 [Sulfitobacter sp. EE-36]
gi|83844656|gb|EAP82540.1| hypothetical protein EE36_14013 [Sulfitobacter sp. EE-36]
Length = 102
Score = 90.5 bits (223), Expect = 3e-16, Method: Composition-based stats.
Identities = 43/101 (42%), Positives = 63/101 (62%), Gaps = 2/101 (1%)
Query: 417 MEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLGV 476
MEQPELHLHPA QAK+AD I A+L++ETHS++L+NR+G I +G +
Sbjct: 1 MEQPELHLHPAHQAKLADVFRSAILRNESNLTNAVLLIETHSESLVNRLGELIEDGSIDP 60
Query: 477 DDVNILLFQKDDNMQN--SIIKQISYTNEGQLRDWPYGFFD 515
+DV +++F Q+ + I ++ +G L DWPYGFF+
Sbjct: 61 NDVQVVIFSASGGEQDLKTEISTSTFDADGVLLDWPYGFFN 101
>gi|88813619|ref|ZP_01128850.1| hypothetical protein NB231_12926 [Nitrococcus mobilis Nb-231]
gi|88789124|gb|EAR20260.1| hypothetical protein NB231_12926 [Nitrococcus mobilis Nb-231]
Length = 464
Score = 87.8 bits (216), Expect = 2e-15, Method: Composition-based stats.
Identities = 78/301 (25%), Positives = 129/301 (42%), Gaps = 53/301 (17%)
Query: 242 DDIKSFQKSIRNWTINTSEFNLIYNYFLLLK---SNSIIE---RLNNELANFYHTCDYIA 295
D ++ K R W I E ++ L + ++++ E RL LA+ Y+
Sbjct: 162 DPMRLVMKQGRKWPIEAPEKFYRFSGVTLARYQNADALSEFPLRLEQLLADL----TYLG 217
Query: 296 PMRAEAGRFYRIQGLQVQSVDPSGSNLLEFIASLTH----------REKISYDDFI---- 341
P+R R Y G V V G + + + TH + ++D F+
Sbjct: 218 PLREPPQRTYSWAGDTVPEVGARGELAIPALLAATHDGVKLNRGPGKRYEAFDAFVARWL 277
Query: 342 KDILGVT---VDVPMESGMKS----LRIKNSNGHFNIADVGYGYSQVLPVITKLWHTAYM 394
+D LGV V P+ G K +R + S+ + DVG+G SQVLP + +
Sbjct: 278 RD-LGVIDSFVVRPIAEGRKEYEVLVRTRRSSAEVKLTDVGFGVSQVLPALVQ------- 329
Query: 395 TSNQKDYFFRYKFRESVKTTILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAI-LV 453
F +T+ MEQPE+HLHP QA++AD I + ++ + L+
Sbjct: 330 -----------AFYAQPNSTVWMEQPEIHLHPQAQAELADAFISAVQASQNGTPRNVQLI 378
Query: 454 VETHSQALINRIGRRIREGKLGVDDVNILLFQKDDNMQNSIIKQISYTNEGQLRDWPYGF 513
VE+HS+ + R+ RR+ E + D+V ++ + QI+ G + +WP F
Sbjct: 379 VESHSEHFLQRLQRRVAEQAISPDEVAAYFVRQGGVGVKLVPLQINPF--GDIENWPDNF 436
Query: 514 F 514
F
Sbjct: 437 F 437
>gi|153872276|ref|ZP_02001215.1| conserved hypothetical protein [Beggiatoa sp. PS]
gi|152071259|gb|EDN68786.1| conserved hypothetical protein [Beggiatoa sp. PS]
Length = 547
Score = 82.8 bits (203), Expect = 5e-14, Method: Composition-based stats.
Identities = 72/264 (27%), Positives = 110/264 (41%), Gaps = 53/264 (20%)
Query: 284 LANFYHTCDYIAPMRAEAGRFYRIQGLQVQSVDPSGSNLLEFIASLTHREKISY------ 337
L F+ T Y+ P+R RFY I + ++ G +AS + Y
Sbjct: 289 LRQFWLTFRYLGPLREAPQRFYLINEIGKMNIGVKGEYTAIVLASEQEQSIPPYYKCIYE 348
Query: 338 DDFIKDILGVTVDVPMESG------------MKSLRIKNSNGHFNI---------ADVGY 376
IK+ T D ME+ M + IK + + DVG+
Sbjct: 349 GKQIKEYEYRTSDQMMEALNVWLSFMKLPQLMPTPAIKRTTNQIKLDAFGIEVCLPDVGF 408
Query: 377 GYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAMQAKIADTL 436
G SQ+LPV+ + R TI++EQPE+HLHP++Q+K+AD L
Sbjct: 409 GVSQILPVLVEC------------------LRTKPGETIILEQPEIHLHPSLQSKLADFL 450
Query: 437 IKTIDETREADQCAILVVETHSQALINRIGRRI--REGKLGVDDVNILLFQKDDNMQNSI 494
I ++ ++VETHS+ LI R+ R+ E +N + D+ Q SI
Sbjct: 451 ICMAKAGKK------IIVETHSEHLIKRLYLRVAQEESDETRRLLNTIFVSFDEQQQTSI 504
Query: 495 IKQISYTNEGQLRDWPYGFFDPLD 518
+ I G++ +WP FFD D
Sbjct: 505 TQAIMINEYGEIENWPVHFFDEDD 528
>gi|84619226|emb|CAJ43152.1| hypothetical protein [Bacteriophage P2-EC5]
Length = 555
Score = 80.5 bits (197), Expect = 3e-13, Method: Composition-based stats.
Identities = 66/232 (28%), Positives = 116/232 (50%), Gaps = 37/232 (15%)
Query: 292 DYIAPMRAEAGRFY----RIQGLQVQSVDPSGSNLLEFIASLTHREKISYD-DFIKDILG 346
++++P+RA R+Y + Q+ S D GS L E + T KI+ + + +
Sbjct: 329 NHVSPLRAFPQRYYLLDKTVNHAQLNSAD--GSELAEILKKNT---KITENINLLLAEFN 383
Query: 347 VTVD-VPMESGMKSLRIKNSNGHFNIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRY 405
+ VD V + + + I + + DVG+G S+VLPV+ + AY++
Sbjct: 384 LAVDIVRVNDIIHKIVINQEAVNLELTDVGFGISRVLPVLVQ----AYLSPK-------- 431
Query: 406 KFRESVKTTILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQALINRI 465
+ ++EQPE+HLHP MQA + D LIK + ++ I ++ETHS AL+ RI
Sbjct: 432 ------GSITIIEQPEIHLHPKMQAWLTDALIKISMQEKK-----IFIIETHSDALVRRI 480
Query: 466 GRRI--REGKLGVDDVNILLFQKDDNMQNSIIKQISYTNEGQLRDWPYGFFD 515
RI +L D V I ++D+ ++++ +++ ++G + WP F D
Sbjct: 481 RLRIVDESSELTEDQVAIYHLERDEEGTSTLLNRVAVNSDGDI-TWPNEFMD 531
>gi|67920890|ref|ZP_00514409.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
gi|67857007|gb|EAM52247.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
Length = 96
Score = 80.1 bits (196), Expect = 4e-13, Method: Composition-based stats.
Identities = 38/86 (44%), Positives = 58/86 (67%), Gaps = 1/86 (1%)
Query: 5 ICIKNLRSIKDSGTISLRPIMILLGANSSGKSTFLRSFPLFTQSVDKKLRGPIAWFDTSY 64
I ++ LR + D+G I ++PI IL+G NSSGKSTFLR FPL QSV+ + GP+ W +
Sbjct: 4 IRLQRLRCLSDTGEIKIKPITILVGENSSGKSTFLRFFPLLKQSVESRTIGPVLW-NGRL 62
Query: 65 VDYGDFKTAKNRFAEDEEGITFCYRI 90
VD+G++K A + +E++ + F + I
Sbjct: 63 VDFGNYKDAHQKHSEEDICLEFKFEI 88
>gi|148263102|ref|YP_001229808.1| hypothetical protein Gura_1029 [Geobacter uraniumreducens Rf4]
gi|146396602|gb|ABQ25235.1| conserved hypothetical protein [Geobacter uraniumreducens Rf4]
Length = 565
Score = 78.6 bits (192), Expect = 1e-12, Method: Composition-based stats.
Identities = 54/163 (33%), Positives = 82/163 (50%), Gaps = 35/163 (21%)
Query: 363 KNSNGHFNIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPEL 422
+ SN DVG G SQVLPV+ AY + ++ + MEQPE+
Sbjct: 418 QRSNTVVTHRDVGIGISQVLPVLV----MAYGSQHK---------------LVAMEQPEI 458
Query: 423 HLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLGVD----- 477
HLHPA+QA++ D IK+ +Q ++ETHS+ LI R RR+RE D
Sbjct: 459 HLHPALQAELGDVFIKSA----LGEQKNTFILETHSEHLILRFMRRLRETYQKKDNGLPP 514
Query: 478 ----DVNILLFQKDDNMQNSIIKQISYTNEGQL-RDWPYGFFD 515
D+++L + D +SI++++ G+L + WP GFF+
Sbjct: 515 ITPMDISVLYVEPDG--PHSIVREMPLNELGELVKAWPGGFFE 555
>gi|150398874|ref|YP_001322641.1| hypothetical protein Mevan_0116 [Methanococcus vannielii SB]
gi|150011577|gb|ABR54029.1| hypothetical protein Mevan_0116 [Methanococcus vannielii SB]
Length = 617
Score = 78.6 bits (192), Expect = 1e-12, Method: Composition-based stats.
Identities = 98/401 (24%), Positives = 179/401 (44%), Gaps = 64/401 (15%)
Query: 132 INYIHIKLEHVDYIFSMNDRKDDVKCII---NGQILDIQEKFR----------FNFNTAF 178
I I LE++D IFS+ND + K + N I+ EK + N T +
Sbjct: 237 IKNIKKALENIDLIFSLNDEMNVSKFELRSNNNLIIRYSEKTKSGELLGNSCNLNDLTFY 296
Query: 179 AILPILVTNAEGDHTAVSAIGTRL--ISIFKKHCNKKLQNTQ-RLINILNCKSIEK-EDF 234
LP + ++ + I ++ I F + K + +++I C+++E +D
Sbjct: 297 GFLPAVEKSSSYELLKNFEIPNKMEKIHSFYEELIKHIFGVDLSVLSIFACRNLESFKD- 355
Query: 235 LKHLKYGDDIKSFQKS------IRNWTINTSEFNLIYNYFLLLKSNSIIERLNNELANFY 288
KYGD I+ F ++ N IN +++I I++L NE N
Sbjct: 356 ----KYGDFIQIFNENPDILAQFLNQGINLGIYHII------------IDKLKNEFDNI- 398
Query: 289 HTCDYIAPMRAEAGRFYRIQGLQVQSVDPSGSNLLEFIASLTHREKI-SYDDFIKDILGV 347
+YI P R R Y SV P G N + L + D +++ LG
Sbjct: 399 -NVNYIGPFRKVPERIYPEIDYVPDSVGPLGENAAYILDKLNGSDDFHKIDVWLRTNLGC 457
Query: 348 TVDVPMESGMKSLRIKNSNGHF---NIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFR 404
+ + + + IK++N ++ D+GYG SQ+LP++T + + + K+
Sbjct: 458 NLIIEDYNDSGNFSIKSANTTRQADDLVDMGYGISQILPILTMI-----LVNFPKN---- 508
Query: 405 YKFRESVKTTILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQALINR 464
++S + L+EQPELHLHPA+ +AD +D+T ++ + +++ETHS+ L+ R
Sbjct: 509 ---KDSGENMYLIEQPELHLHPAVHGNLADLF---VDKTLNDNKSSKMIIETHSEYLLKR 562
Query: 465 IGRRIRE--GKLGVDDVNILLFQKDDNMQNSIIKQISYTNE 503
+ + + + +D+++ KD S +K++ + E
Sbjct: 563 LQELVLDPNNPITEEDISVYYVNKDKEWV-SDVKKVDFCKE 602
>gi|86750096|ref|YP_486592.1| hypothetical protein RPB_2979 [Rhodopseudomonas palustris HaA2]
gi|86573124|gb|ABD07681.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2]
Length = 464
Score = 74.3 bits (181), Expect = 2e-11, Method: Composition-based stats.
Identities = 65/272 (23%), Positives = 116/272 (42%), Gaps = 51/272 (18%)
Query: 269 LLLKSNSIIERLNNELANFYHTCDYIAPMRAEAGRFYRIQGLQVQSVDPSGSNLLEFIAS 328
L +++ + NE + ++ P+R R Y G V +G +E I +
Sbjct: 190 LYFQNSQFLSEFENEYVKQMDSLIHLGPLRDYPKRQYIWAGSSPIDVGRTGERTIEAILA 249
Query: 329 LTHREKIS-------YDDFIKDILGVTVDVPMESGMKSLRIKNSNGHFN----------- 370
T R ++ F + I ++ + K + I + G +
Sbjct: 250 ATSRNEVRNLRAKARLRPFQEMIAWWLREMGLIHSFKLVEIGSGAGLYRTVIKRDPESPE 309
Query: 371 --IADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAM 428
I DVG+G SQ+LP + L++ +TI++EQPE+HLHPA+
Sbjct: 310 TLITDVGFGISQILPALVLLYYAPE------------------GSTIVLEQPEIHLHPAI 351
Query: 429 QAKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREG------KLGVDDVNIL 482
Q+ +AD +I I +TR +++E+HS+ L+ R+ RR+ EG ++ D+ +
Sbjct: 352 QSSLADLIITAI-KTRNIQ----VILESHSEHLLMRLLRRVAEGNKSPYPEIEPQDMKVY 406
Query: 483 LFQKDDNMQNSIIKQISYTNEGQLRDWPYGFF 514
Q D S +++ G + +WP FF
Sbjct: 407 FCQARDG--ESRAEELKVNLFGSIENWPTDFF 436
>gi|76261174|ref|ZP_00768793.1| conserved hypothetical protein [Chloroflexus aurantiacus J-10-fl]
gi|76163917|gb|EAO58078.1| conserved hypothetical protein [Chloroflexus aurantiacus J-10-fl]
Length = 466
Score = 73.6 bits (179), Expect = 3e-11, Method: Composition-based stats.
Identities = 46/144 (31%), Positives = 72/144 (50%), Gaps = 24/144 (16%)
Query: 371 IADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAMQA 430
+ DVG+G SQ+LPVIT ++ + +++E PE+HLHP QA
Sbjct: 325 MTDVGFGISQMLPVITLCYYVPK------------------GSIVILEHPEIHLHPRAQA 366
Query: 431 KIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLGVDDVNILLFQKDDNM 490
+AD LI + +TR ++VE+HS+ L+ R+ RRI E L D + + D
Sbjct: 367 DLADVLIDAV-KTRGVQ----IIVESHSEHLLRRLQRRIAEDVLPSDQAALYVCTTDAE- 420
Query: 491 QNSIIKQISYTNEGQLRDWPYGFF 514
S I+++ G + +WP FF
Sbjct: 421 GVSHIQRLHLDPYGNITNWPKDFF 444
>gi|88804596|ref|ZP_01120116.1| hypothetical protein RB2501_07080 [Robiginitalea biformata
HTCC2501]
gi|88785475|gb|EAR16644.1| hypothetical protein RB2501_07080 [Robiginitalea biformata
HTCC2501]
Length = 630
Score = 72.0 bits (175), Expect = 8e-11, Method: Composition-based stats.
Identities = 68/242 (28%), Positives = 111/242 (45%), Gaps = 30/242 (12%)
Query: 283 ELANFYHTCDYIAPMRAEAGRFYRIQGLQVQSVDPSGSNLLEFIASLTHREKI--SYDDF 340
+ + + DYI+ R R +G + + E I + RE+ Y+
Sbjct: 392 KFKSLFKNIDYISANRGTQKRVLLNEG---------ETQMDEIIRDYSSRERTFKDYESK 442
Query: 341 IKDILGV--TVDVPMESGMKSLRIKNSNGHFNIADVGYGYSQVLPVITKLWHT--AYMTS 396
+ G+ ++V S+ N++D+GYGYSQ++P+I ++ + + T
Sbjct: 443 ALQLFGIPGEIEVKRFENFASVVSLKGKKEINLSDLGYGYSQIVPIILRISNAIGSLATF 502
Query: 397 NQKDYFFRYKFRESVKTTILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVET 456
+K Y TI++E+PE +LHP +Q+KIAD L TI T Q V+ET
Sbjct: 503 QKKPLINEY--------TIIIEEPEANLHPNLQSKIADFLHLTI-TTFPGLQ---FVIET 550
Query: 457 HSQALINRIGRRIREGKLGVDDVNILLFQKDD--NMQNSIIKQISYTNEGQLRD-WPYGF 513
HS+ LI ++ + L V I F D+ N + +K+IS T G L D + GF
Sbjct: 551 HSEYLIRKLQYLVANDSLDEKSVAIYYFNSDEYVNEKEPKVKEISITANGNLTDTFGPGF 610
Query: 514 FD 515
+D
Sbjct: 611 YD 612
>gi|39995276|ref|NP_951227.1| hypothetical protein GSU0165 [Geobacter sulfurreducens PCA]
gi|39982038|gb|AAR33500.1| conserved hypothetical protein [Geobacter sulfurreducens PCA]
Length = 451
Score = 72.0 bits (175), Expect = 9e-11, Method: Composition-based stats.
Identities = 47/144 (32%), Positives = 70/144 (48%), Gaps = 25/144 (17%)
Query: 370 NIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAMQ 429
NIADVG+G SQ LPV+ L H A + T+ +EQPE+HLHP Q
Sbjct: 306 NIADVGFGVSQTLPVVVAL-HVA-----------------APGQTVYLEQPEIHLHPRAQ 347
Query: 430 AKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLGVDDVNILLFQKDDN 489
A +A+ + ++ + +VVETHS + + + E KL D V++ F +DD+
Sbjct: 348 AVMAEVITDAVNRGVK------VVVETHSSIFLLTMQSLVAEEKLPSDVVSLHWFSRDDD 401
Query: 490 MQNSIIKQISYTNEGQLRDWPYGF 513
+I+ S + G WP F
Sbjct: 402 GL-TIVTSASLDSSGSFGQWPEDF 424
>gi|53688205|ref|ZP_00109506.2| COG4938: Uncharacterized conserved protein [Nostoc punctiforme PCC
73102]
Length = 617
Score = 71.2 bits (173), Expect = 1e-10, Method: Composition-based stats.
Identities = 69/257 (26%), Positives = 106/257 (41%), Gaps = 52/257 (20%)
Query: 287 FYHTCDYIAPMRAEAGRFYRIQGLQ------------VQSVDPSGSNLLEFIAS----LT 330
F Y+ P+R E Y + G +D + + +E+I S
Sbjct: 361 FTRFVKYLGPLRDEPKPVYSLTGTTDSKDIGFRGEHTAAVLDVNRNTKVEYIPSNQLKFG 420
Query: 331 HREKISYDDFIK-------DILGVTVDV-PMESGMKSLRIK-NSNGHFNIAD---VGYGY 378
E I+ DF+ + +GV DV ++ G +K + G + D VG G
Sbjct: 421 SNETITKADFLPIAVLDWLNYMGVASDVQTLDKGKLGHELKVATTGSLLLHDLTHVGVGV 480
Query: 379 SQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAMQAKIADTLIK 438
SQVLP++ + +T++ EQPELHLHP +Q ++AD I
Sbjct: 481 SQVLPILVQ------------------SLLAEKGSTLIFEQPELHLHPRVQTRLADFFIS 522
Query: 439 TIDETREADQCAILVVETHSQALINRIGRRIREGKLGVDDVNILLFQKDDNMQNSIIKQI 498
T QC +VETHS+ LINR+ + + N+L++ + S + +
Sbjct: 523 M---TMLKKQC---IVETHSEYLINRLRYQAAASEGEEISKNVLIYFVEKQSGESKYRPV 576
Query: 499 SYTNEGQLRDWPYGFFD 515
G L DWP GFFD
Sbjct: 577 KLNKFGVLEDWPKGFFD 593
>gi|83816599|ref|YP_445232.1| hypothetical protein SRU_1103 [Salinibacter ruber DSM 13855]
gi|83757993|gb|ABC46106.1| conserved hypothetical protein [Salinibacter ruber DSM 13855]
Length = 216
Score = 69.3 bits (168), Expect = 5e-10, Method: Composition-based stats.
Identities = 51/180 (28%), Positives = 90/180 (50%), Gaps = 32/180 (17%)
Query: 343 DILGVTVDVPMESGMKSL---RIKNSNGHFNIADVGYGYSQVLPVITKLWHTAYMTSNQK 399
D+ + D+ +ES L ++ + G +AD+G G +Q+LP+I KL T S
Sbjct: 46 DVFDLGSDLDVESVAPYLYAASVRRNGGRRYLADLGSGTAQLLPLILKL--TVGHPS--- 100
Query: 400 DYFFRYKFRESVKTTILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQ 459
+ +L+E+PE +LHP +Q+++AD L++ I + ++VETHS+
Sbjct: 101 -------------SVLLLEEPEANLHPNLQSRLADLLVELIGSGHQ------VLVETHSE 141
Query: 460 ALINRIGRRIREGKLGVDDVNILLFQKDDNMQNSI--IKQISYTNEGQLRDWPY--GFFD 515
L+ R+ + +G+ D ++L D+ S ++ I GQL + P+ GFFD
Sbjct: 142 YLVRRLQYLVAKGRCDPDSASVLYVDATDSTDTSTPDVRSIFIDEHGQLSE-PFGGGFFD 200
>gi|118068861|ref|ZP_01537111.1| conserved hypothetical protein [Serratia proteamaculans 568]
gi|118017032|gb|EAV30954.1| conserved hypothetical protein [Serratia proteamaculans 568]
Length = 189
Score = 69.3 bits (168), Expect = 5e-10, Method: Composition-based stats.
Identities = 47/158 (29%), Positives = 76/158 (48%), Gaps = 25/158 (15%)
Query: 359 SLRIKNSNGH-FNIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILM 417
S +IKN + ++ADVG+G SQ LPV+ L + + + + + +
Sbjct: 35 SRKIKNGDDDCVSVADVGFGVSQTLPVLVAL-----LVARKGQHVY-------------I 76
Query: 418 EQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLGVD 477
EQPELHLHP Q K+A+ + +D + +++ETHS LI I +I ++
Sbjct: 77 EQPELHLHPKAQFKLAEIMKSALDRGVK------IIIETHSSLLIRGIQTKIARKEIDST 130
Query: 478 DVNILLFQKDDNMQNSIIKQISYTNEGQLRDWPYGFFD 515
V++ F +D+ +II + G DWP F D
Sbjct: 131 QVSLNWFTQDEETGETIINTKAPDELGAFGDWPSDFDD 168
>gi|126463976|ref|YP_001045089.1| hypothetical protein Rsph17029_3220 [Rhodobacter sphaeroides ATCC
17029]
gi|126105787|gb|ABN78317.1| conserved hypothetical protein [Rhodobacter sphaeroides ATCC 17029]
Length = 429
Score = 68.6 bits (166), Expect = 1e-09, Method: Composition-based stats.
Identities = 43/145 (29%), Positives = 77/145 (53%), Gaps = 25/145 (17%)
Query: 370 NIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAMQ 429
N+ DVG G SQ+LPV+ YF + R SV +++E+PE+HLHP Q
Sbjct: 287 NLRDVGIGVSQILPVLVV------------SYFAK---RGSV---VILEEPEIHLHPLAQ 328
Query: 430 AKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLGVDDVNILLFQKDDN 489
+ +A+ ++ + + R+ +VETHS+ L R+ I + ++DV++ +++D
Sbjct: 329 STLAELFVE-VGKERDLQ----FIVETHSEHLFRRMQTLIAKNPRNLEDVSMYFIEREDG 383
Query: 490 MQNSIIKQISYTNEGQLRDWPYGFF 514
+ ++ + G+L +WP GFF
Sbjct: 384 --KAKLRTLEVDEVGRLLNWPDGFF 406
>gi|119489545|ref|ZP_01622306.1| hypothetical protein L8106_28226 [Lyngbya sp. PCC 8106]
gi|119454624|gb|EAW35771.1| hypothetical protein L8106_28226 [Lyngbya sp. PCC 8106]
Length = 374
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 44/150 (29%), Positives = 72/150 (48%), Gaps = 28/150 (18%)
Query: 369 FNIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAM 428
+ +VG+G + LP++ A +TS T I++E PE HLHP
Sbjct: 231 YRATNVGFGVTYTLPILV-----AILTSKPG-------------TLIVIENPEAHLHPKG 272
Query: 429 QAKIADTLIKTIDETREADQCAI-LVVETHSQALINRIGRRIREGKLGVDDVNILLFQKD 487
QAKI + A C + +++ETHS ++N I + +GKL +DV + FQ+
Sbjct: 273 QAKIGELFAL-------AASCGVQVIIETHSDHVLNGIRLAVHDGKLSENDVQLHYFQRI 325
Query: 488 DNMQNSIIKQIS--YTNEGQLRDWPYGFFD 515
+ ++ + +S G++ WP GFFD
Sbjct: 326 EKEGQAMSEVVSPRIDRNGRIDRWPEGFFD 355
>gi|87122784|ref|ZP_01078657.1| hypothetical protein MED121_24199 [Marinomonas sp. MED121]
gi|86161949|gb|EAQ63241.1| hypothetical protein MED121_24199 [Marinomonas sp. MED121]
Length = 369
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 47/160 (29%), Positives = 77/160 (48%), Gaps = 30/160 (18%)
Query: 363 KNSNGHFNIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPEL 422
K + F +VG+G S VLPV+T + +++ + D +++E PE
Sbjct: 229 KETTEEFKPQNVGFGLSYVLPVVTSI-----LSAAKGDL-------------LIIENPES 270
Query: 423 HLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLGVDDVNIL 482
HLHPA Q+ + A+ L+VE+HS +N + +++ + DDV +
Sbjct: 271 HLHPAGQSLMGKLC------AIAANNGVQLIVESHSDHFLNGMRVAVKQKVIAADDVKVF 324
Query: 483 LFQKDDNMQNSI-IKQISYTN---EGQLRDWPYGFFDPLD 518
Q+D + NSI ++ Y N EG++ WP GFFD D
Sbjct: 325 FLQRD--VHNSIHSSEVMYPNIDDEGRIDCWPEGFFDQWD 362
>gi|149203433|ref|ZP_01880403.1| hypothetical protein RTM1035_02410 [Roseovarius sp. TM1035]
gi|149143266|gb|EDM31305.1| hypothetical protein RTM1035_02410 [Roseovarius sp. TM1035]
Length = 628
Score = 66.2 bits (160), Expect = 4e-09, Method: Composition-based stats.
Identities = 46/154 (29%), Positives = 73/154 (47%), Gaps = 29/154 (18%)
Query: 372 ADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAMQAK 431
+DVG G SQ++PV+ M + + +EQPELH+HPA+Q
Sbjct: 487 SDVGVGISQMVPVVV----ACLMEPD---------------GLVAIEQPELHIHPAIQVG 527
Query: 432 IADTLIKTIDETREADQCA-ILVVETHSQALINRIGRRIRE--------GKLGVDDVNIL 482
+ D I + A L+VETHS+ ++ R+ RRIRE G LG+ ++
Sbjct: 528 LGDLFISATSTYQPATGPERSLLVETHSEHIMLRLLRRIREQSENELPPGILGLTPNDLS 587
Query: 483 LFQKDDNMQNSIIKQISYTNEGQLRD-WPYGFFD 515
+ + Q + K++ + +G D WP GFF+
Sbjct: 588 VVYVEATEQGPVFKKLRVSEDGDFIDSWPKGFFE 621
>gi|15600970|ref|NP_232600.1| hypothetical protein VCA0200 [Vibrio cholerae O1 biovar eltor str.
N16961]
gi|121586742|ref|ZP_01676525.1| hypothetical protein VC274080_A0226 [Vibrio cholerae 2740-80]
gi|121727730|ref|ZP_01680818.1| hypothetical protein VCV52_A0213 [Vibrio cholerae V52]
gi|147672080|ref|YP_001215863.1| hypothetical protein VC0395_1027 [Vibrio cholerae O395]
gi|153222335|ref|ZP_01953085.1| hypothetical protein A53_A0270 [Vibrio cholerae MAK 757]
gi|153816781|ref|ZP_01969448.1| hypothetical protein A5C_A0295 [Vibrio cholerae NCTC 8457]
gi|9657593|gb|AAF96113.1| hypothetical protein VC_A0200 [Vibrio cholerae O1 biovar eltor str.
N16961]
gi|121549039|gb|EAX59076.1| hypothetical protein VC274080_A0226 [Vibrio cholerae 2740-80]
gi|121629947|gb|EAX62357.1| hypothetical protein VCV52_A0213 [Vibrio cholerae V52]
gi|124118972|gb|EAY37715.1| hypothetical protein A53_A0270 [Vibrio cholerae MAK 757]
gi|125618516|gb|EAZ47054.1| hypothetical protein VCMO10_A0504 [Vibrio cholerae MO10]
gi|126512584|gb|EAZ75178.1| hypothetical protein A5C_A0295 [Vibrio cholerae NCTC 8457]
gi|146314463|gb|ABQ19003.1| hypothetical protein VC0395_1027 [Vibrio cholerae O395]
Length = 374
Score = 64.7 bits (156), Expect = 2e-08, Method: Composition-based stats.
Identities = 45/148 (30%), Positives = 70/148 (47%), Gaps = 27/148 (18%)
Query: 369 FNIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAM 428
+ +VG+G S VLP++ L T +++E PE HLHP
Sbjct: 238 YRAVNVGFGLSYVLPIVLALLVT------------------KPGGLVIIENPEAHLHPRG 279
Query: 429 QAKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLGVDDVNILLFQK-D 487
Q+ + + A+ +VVETHS +IN I R GK V D N+L +Q +
Sbjct: 280 QSYLGRLI------ALAAEAGVQVVVETHSDHIINGIRLMPRLGK--VQDKNLLFYQVIN 331
Query: 488 DNMQNSIIKQISYTNEGQLRDWPYGFFD 515
+++ + +K+I +GQ +WP GFFD
Sbjct: 332 EDVGTTTVKKIIVNAQGQFSEWPDGFFD 359
>gi|75908284|ref|YP_322580.1| hypothetical protein Ava_2063 [Anabaena variabilis ATCC 29413]
gi|75702009|gb|ABA21685.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 459
Score = 63.2 bits (152), Expect = 4e-08, Method: Composition-based stats.
Identities = 56/207 (27%), Positives = 87/207 (42%), Gaps = 41/207 (19%)
Query: 325 FIASLTHREKISYDDFIKDILGVTVDVPMESGMKS---------LRI----KNSNGH--- 368
++AS+ HR + D ++++ V + + S +++ LR+ +NS+
Sbjct: 254 YVASIIHRWQRPRDKRLQELNNALVSLGLTSKVQAKQLDDTQVELRVARLTRNSDNEKDM 313
Query: 369 FNIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAM 428
+IADVG+G SQ LPV+ L + +EQPE+HLHP
Sbjct: 314 VSIADVGFGVSQTLPVLVAL------------------LVAEPGQLVYIEQPEIHLHPRA 355
Query: 429 QAKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLGVDDVNILLFQKDD 488
Q +A+ L A + +VVETHS LI + EG L D V + F + +
Sbjct: 356 QVAMAEIL------ANAAKRGVKIVVETHSDRLILATQSLVAEGNLSPDLVKLHWFTRQE 409
Query: 489 NMQNSIIKQISYTNEGQLRDWPYGFFD 515
+ I G DWP F D
Sbjct: 410 DGITK-INSADLDETGAFGDWPEDFGD 435
>gi|154494816|ref|ZP_02033821.1| hypothetical protein PARMER_03858 [Parabacteroides merdae ATCC
43184]
gi|154085366|gb|EDN84411.1| hypothetical protein PARMER_03858 [Parabacteroides merdae ATCC
43184]
Length = 340
Score = 62.0 bits (149), Expect = 9e-08, Method: Composition-based stats.
Identities = 40/110 (36%), Positives = 60/110 (54%), Gaps = 11/110 (10%)
Query: 407 FRESVKTTILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQALINRIG 466
R IL+E PE+HLHPA QAK+ + L + R Q +++ETH + LI ++G
Sbjct: 231 LRAKKNDVILIENPEIHLHPASQAKVGEFLAFIV---RAGIQ---VIIETHCEHLIYKVG 284
Query: 467 RRIREGKLGVDDVNILLFQKDDNMQNSIIKQISYTNEGQL-RDWPYGFFD 515
I + + DDV IL + +QN I + + +G+ RD+P GFFD
Sbjct: 285 YEIYKKRFLKDDVTILY---KEGIQNPFI-VLGFKEDGKFTRDFPEGFFD 330
>gi|114331937|ref|YP_748159.1| hypothetical protein Neut_1965 [Nitrosomonas eutropha C91]
gi|114308951|gb|ABI60194.1| conserved hypothetical protein [Nitrosomonas eutropha C91]
Length = 427
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 38/145 (26%), Positives = 72/145 (49%), Gaps = 25/145 (17%)
Query: 370 NIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAMQ 429
N+ DVG G +LPV+T YF +T+++E+PE+HLHP Q
Sbjct: 286 NLRDVGIGIPLILPVLTA------------GYF------APPGSTVILEEPEVHLHPLAQ 327
Query: 430 AKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLGVDDVNILLFQKDDN 489
+ +A+ ++ + +R +VETHS+ L R+ I + + + +L ++ +
Sbjct: 328 SVLAEFFVR-LSRSRHIQ----FIVETHSEHLFRRMQTLIARKETSAEQIAMLFVERVAD 382
Query: 490 MQNSIIKQISYTNEGQLRDWPYGFF 514
N+I++++ G++ +WP FF
Sbjct: 383 --NAILRKLKVDEFGRVNNWPQHFF 405
>gi|16799908|ref|NP_470176.1| hypothetical protein lin0834 [Listeria innocua Clip11262]
gi|16413285|emb|CAC96066.1| lin0834 [Listeria innocua]
Length = 369
Score = 61.2 bits (147), Expect = 2e-07, Method: Composition-based stats.
Identities = 48/175 (27%), Positives = 84/175 (48%), Gaps = 31/175 (17%)
Query: 346 GVTVDVPMESGMK--SLRIKNSN---GHFNIADVGYGYSQVLPVITKLWHTAYMTSNQKD 400
GV +E+ ++ L +K SN G+ ++++VG+G S VLP++ +TS +
Sbjct: 204 GVEFSSTLENALRLVKLEVKQSNQDMGYVSLSNVGFGLSYVLPILLTC-----LTSKED- 257
Query: 401 YFFRYKFRESVKTTILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQA 460
+ I++E PE HLHP Q+ + L + R+ +VVETHS+
Sbjct: 258 ------------SLIIIENPEAHLHPKGQSLLGQFLAQVSSSKRQ------VVVETHSEH 299
Query: 461 LINRIGRRIREGKLGVDDVNILLFQKDDNMQNSIIKQISYTNEGQLRDWPYGFFD 515
+IN + + + +++ I F + N +K+IS +L +WP FFD
Sbjct: 300 IINGMRLFCLKNSISPNELTINFFSIKSSKTN--VKKISLNERMELLEWPDDFFD 352
>gi|154247664|ref|YP_001418622.1| hypothetical protein Xaut_3739 [Xanthobacter autotrophicus Py2]
gi|154161749|gb|ABS68965.1| conserved hypothetical protein [Xanthobacter autotrophicus Py2]
Length = 428
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 41/145 (28%), Positives = 72/145 (49%), Gaps = 25/145 (17%)
Query: 370 NIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAMQ 429
N+ DVG G SQVLPV+ +Y +TI++E+PE+HLHP Q
Sbjct: 286 NLRDVGIGISQVLPVLV----VSYFAPR--------------GSTIILEEPEIHLHPLAQ 327
Query: 430 AKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLGVDDVNILLFQKDDN 489
+ +A+ +D +RE + +VETHS+ L R+ I + D + ++
Sbjct: 328 SALANLF---VDVSRE--RGIQFLVETHSEHLFRRMQTLIAKQDASSTDSAMYFVRRKG- 381
Query: 490 MQNSIIKQISYTNEGQLRDWPYGFF 514
+++ + + + G++++WP GFF
Sbjct: 382 -KSAELCTLEIDDFGRVKNWPSGFF 405
>gi|156974069|ref|YP_001444976.1| hypothetical protein VIBHAR_01780 [Vibrio harveyi ATCC BAA-1116]
gi|156525663|gb|ABU70749.1| hypothetical protein VIBHAR_01780 [Vibrio harveyi ATCC BAA-1116]
Length = 378
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 52/181 (28%), Positives = 78/181 (43%), Gaps = 34/181 (18%)
Query: 335 ISYDDFIKDILGVTVDVPMESGMKSLRIKNSNGHFNIADVGYGYSQVLPVITKLWHTAYM 394
IS F+ + V+V ++G +SL+ +N VG+G S VL ++T L +T
Sbjct: 214 ISVSSFLYEEASTAVNVFGQNGNESLKPQN---------VGFGVSYVLSIVTALLYT--- 261
Query: 395 TSNQKDYFFRYKFRESVKTTILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVV 454
+++E PE HLHP Q+ + + T A ++V
Sbjct: 262 ---------------KKGGVVIIENPEAHLHPRGQSYLGRLIALT------AKSGVQVIV 300
Query: 455 ETHSQALINRIGRRIREGKLGVDDVNILLFQKDDNMQNSIIKQISYTNEGQLRDWPYGFF 514
ETHS L+N I R I D L+ + S + IS EG+L +WP GFF
Sbjct: 301 ETHSDHLLNGI-RVIGRIDDSFDAELFTLYYISQGQKQSNVDIISLNKEGKLSEWPSGFF 359
Query: 515 D 515
D
Sbjct: 360 D 360
>gi|30249834|ref|NP_841904.1| hypothetical protein NE1882 [Nitrosomonas europaea ATCC 19718]
gi|30180871|emb|CAD85793.1| conserved hypothetical protein [Nitrosomonas europaea ATCC 19718]
Length = 429
Score = 60.5 bits (145), Expect = 3e-07, Method: Composition-based stats.
Identities = 39/145 (26%), Positives = 68/145 (46%), Gaps = 25/145 (17%)
Query: 370 NIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAMQ 429
N+ DVG G + VLPV+ YF +T+++E+PE+HLHP Q
Sbjct: 286 NLRDVGMGVALVLPVLVA------------GYF------APAGSTVILEEPEVHLHPLAQ 327
Query: 430 AKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLGVDDVNILLFQKDDN 489
A +A+ + + +R +VETHS+ L R+ I V+ + +L +
Sbjct: 328 AVLAEFFV-ALSRSRRIQ----FIVETHSEHLFRRMQTLIARKNTSVEQIALLFVESAAG 382
Query: 490 MQNSIIKQISYTNEGQLRDWPYGFF 514
N+ ++++ G++ +WP FF
Sbjct: 383 --NAALRRLEVDEFGRVSNWPQYFF 405
>gi|116620799|ref|YP_822955.1| hypothetical protein Acid_1680 [Solibacter usitatus Ellin6076]
gi|116223961|gb|ABJ82670.1| conserved hypothetical protein [Solibacter usitatus Ellin6076]
Length = 381
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 41/174 (23%), Positives = 79/174 (45%), Gaps = 24/174 (13%)
Query: 345 LGVTVDVPMESGMKSLRIKNSNGHFNIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFR 404
LG T +V ++ R + ++ + +VG+G + LP++ A + + +
Sbjct: 224 LGGTDEVLLQFRYDGRRGEPTSNFYRPTNVGFGLTYSLPILV-----ACLAAPRG----- 273
Query: 405 YKFRESVKTTILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQALINR 464
+ +L+E PE HLHP QA + + + R A ++VETHS L+N
Sbjct: 274 --------SLLLLENPEAHLHPQGQAALGELV------ARCASDGVQIIVETHSDHLLNG 319
Query: 465 IGRRIREGKLGVDDVNILLFQKDDNMQNSIIKQISYTNEGQLRDWPYGFFDPLD 518
+ ++ + V + F + ++ ++ + + G+L +WP GFFD D
Sbjct: 320 VRLAVKRRIIDNGTVALHFFTRSIETGDASVQSPAVLDSGRLSNWPAGFFDQWD 373
>gi|53713058|ref|YP_099050.1| hypothetical protein BF1769 [Bacteroides fragilis YCH46]
gi|52215923|dbj|BAD48516.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
Length = 377
Score = 58.2 bits (139), Expect = 1e-06, Method: Composition-based stats.
Identities = 37/104 (35%), Positives = 56/104 (53%), Gaps = 7/104 (6%)
Query: 415 ILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKL 474
+L+E PE HLHP QAK+A+ LI + +T +V+ETHS + N I R I+ +
Sbjct: 272 LLIENPEAHLHPCGQAKLAE-LIALVSQTG-----VQVVIETHSDHIFNGIRRAIKNNMV 325
Query: 475 GVDDVNILLFQKDDNMQNSIIKQISYTNEGQLRDWPYGFFDPLD 518
VD V I F K N ++ +I G+++++ G FD D
Sbjct: 326 DVDHVAIHYF-KHGNENHTEAVEIGIEKNGRVKNYEEGLFDQFD 368
>gi|113474043|ref|YP_720104.1| hypothetical protein Tery_0136 [Trichodesmium erythraeum IMS101]
gi|110165091|gb|ABG49631.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 224
Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats.
Identities = 44/153 (28%), Positives = 65/153 (42%), Gaps = 28/153 (18%)
Query: 369 FNIADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAM 428
+ +VG+G S LP+I + +T I++E PE HLHP
Sbjct: 86 YRSTNVGFGISYTLPIIVAI------------------LSSPPETLIIIENPEAHLHPKG 127
Query: 429 QAKIADTLIKTIDETREADQCAI-LVVETHSQALINRIGRRIREGKLGVDDVNILLFQKD 487
QAK+ + L A C + +VVETHS ++N I + K+ V + F
Sbjct: 128 QAKMGELLAL-------AASCGMQIVVETHSDHVLNGIRLIVYREKVDPKKVQLHYFDSQ 180
Query: 488 DNMQNSIIKQIS--YTNEGQLRDWPYGFFDPLD 518
+ I + IS G++ WP GFFD D
Sbjct: 181 KRGREFITEVISPQIDRNGRIDKWPEGFFDEWD 213
>gi|120554381|ref|YP_958732.1| hypothetical protein Maqu_1460 [Marinobacter aquaeolei VT8]
gi|120324230|gb|ABM18545.1| conserved hypothetical protein [Marinobacter aquaeolei VT8]
Length = 391
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 36/105 (34%), Positives = 52/105 (49%), Gaps = 12/105 (11%)
Query: 415 ILMEQPELHLHPAMQAKIADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKL 474
++++ PE HLHP Q+ + L K A ++VETHS ++N ++ +
Sbjct: 282 LIIDSPEAHLHPLGQSNMGKFLAKI------AASGVQVLVETHSDHVLNGARLAVKSRVI 335
Query: 475 GVDDVNILLF----QKDDNMQNSIIKQISYTNEGQLRDWPYGFFD 515
DV IL F QKD + ++ I I EG L DWP GFFD
Sbjct: 336 DSRDVGILFFNRPPQKDGDPEHIIAPSID--KEGNLSDWPLGFFD 378
>gi|90579612|ref|ZP_01235421.1| hypothetical protein VAS14_01581 [Vibrio angustum S14]
gi|90439186|gb|EAS64368.1| hypothetical protein VAS14_01581 [Vibrio angustum S14]
Length = 579
Score = 56.2 bits (134), Expect = 6e-06, Method: Composition-based stats.
Identities = 53/158 (33%), Positives = 75/158 (47%), Gaps = 39/158 (24%)
Query: 372 ADVGYGYSQVLPVITKLWHTAYMTSNQKDYFFRYKFRESVKTTILMEQPELHLHPAMQAK 431
AD+G G SQV P+I AY K I +EQPELH+HP Q
Sbjct: 446 ADIGAGVSQVFPLIVA---AAYAN----------------KGLICVEQPELHIHPKAQVG 486
Query: 432 IADTLIKTIDETREADQCAILVVETHSQALINRIGRRIREGKLGV----------DDVNI 481
+ D L+ +DE + +VETHS+ LI R+ RR+RE G +D++I
Sbjct: 487 LGD-LLTQVDEKNQ------FLVETHSEHLILRLLRRVRETTEGTLAKHLTPVKKEDISI 539
Query: 482 LLFQKDDNMQNSIIKQISYTNEGQLR-DWPYGFFDPLD 518
+ + ++ I + T++G L DWP GFFD D
Sbjct: 540 IYLETAES--GVIAHRTLLTDDGDLETDWPNGFFDERD 575
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.322 0.138 0.404
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,827,341,725
Number of Sequences: 5470121
Number of extensions: 76571668
Number of successful extensions: 218612
Number of sequences better than 1.0e-05: 42
Number of HSP's better than 0.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 35
Number of HSP's that attempted gapping in prelim test: 218480
Number of HSP's gapped (non-prelim): 78
length of query: 518
length of database: 1,894,087,724
effective HSP length: 137
effective length of query: 381
effective length of database: 1,144,681,147
effective search space: 436123517007
effective search space used: 436123517007
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 132 (55.5 bits)