BLASTP 2.2.18 [Mar-02-2008]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= PGN_0341 hypothetical protein
(506 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
6,515,104 sequences; 2,222,278,849 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|34540087|ref|NP_904566.1| hypothetical protein PG0236 [P... 1026 0.0
gi|154490837|ref|ZP_02030778.1| hypothetical protein PARMER... 210 3e-52
gi|150008703|ref|YP_001303446.1| hypothetical protein BDI_2... 206 3e-51
gi|110640071|ref|YP_680281.1| hypothetical protein CHU_3705... 156 4e-36
gi|163755490|ref|ZP_02162609.1| hypothetical protein KAOT1_... 139 7e-31
gi|86141553|ref|ZP_01060099.1| hypothetical protein MED217_... 133 3e-29
gi|91215782|ref|ZP_01252752.1| hypothetical protein P700755... 133 3e-29
gi|88713632|ref|ZP_01107714.1| hypothetical protein FB2170_... 131 1e-28
gi|88803708|ref|ZP_01119232.1| hypothetical protein PI23P_0... 131 2e-28
gi|83857281|ref|ZP_00950809.1| hypothetical protein CA2559_... 129 7e-28
gi|124003621|ref|ZP_01688470.1| hypothetical protein M23134... 129 7e-28
gi|86131952|ref|ZP_01050549.1| hypothetical protein MED134_... 129 8e-28
gi|89890252|ref|ZP_01201762.1| conserved hypothetical prote... 127 2e-27
gi|86135566|ref|ZP_01054147.1| hypothetical protein MED152_... 127 2e-27
gi|126663955|ref|ZP_01734949.1| hypothetical protein FBBAL3... 123 3e-26
gi|149372201|ref|ZP_01891471.1| hypothetical protein SCB49_... 112 9e-23
gi|146302447|ref|YP_001197038.1| hypothetical protein Fjoh_... 107 2e-21
gi|150024754|ref|YP_001295580.1| hypothetical protein FP066... 96 5e-18
>gi|34540087|ref|NP_904566.1| hypothetical protein PG0236 [Porphyromonas gingivalis W83]
gi|34396398|gb|AAQ65465.1| hypothetical protein PG_0236 [Porphyromonas gingivalis W83]
Length = 506
Score = 1026 bits (2653), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 502/506 (99%), Positives = 504/506 (99%)
Query: 1 MKEISRLTAVILLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTR 60
MKEISRLTAVILLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTR
Sbjct: 1 MKEISRLTAVILLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTR 60
Query: 61 TFMVYNRLNRSLRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEATFPE 120
TFMVYNRLNRSLRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEATFPE
Sbjct: 61 TFMVYNRLNRSLRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEATFPE 120
Query: 121 GESDDPVEVKDSLRFLINGRTDYVLLQGFRQNVDEVTALVIDRDTIFGAQRPTLLLDSLV 180
GESDDPVEVKDSLRFLINGRTDYVLLQGFRQNVDEVTALVIDRDTIFGAQRPTLLLDSLV
Sbjct: 121 GESDDPVEVKDSLRFLINGRTDYVLLQGFRQNVDEVTALVIDRDTIFGAQRPTLLLDSLV 180
Query: 181 VQQGATLTLPAGCRLLMANKAHIKVRGRLMAEGNPAKRVMIENLRHDLLVQDVPYTLVPG 240
VQQGATLTLPAGCRLLMANKAHIKVRGRLMAEGNPAKRVMIENLRHDLLVQDVPYTLVPG
Sbjct: 181 VQQGATLTLPAGCRLLMANKAHIKVRGRLMAEGNPAKRVMIENLRHDLLVQDVPYTLVPG 240
Query: 241 QWGGILFSEESRGNELRYTTIRNGRWGIIAEGGKDVTTPKLLLDGCMVTNTKGSGLAASG 300
QWGGILFSEESRGNELRYTTIRNGRWGIIAEGGKDVTTPKLLL+GCMVTNTKGSGLAASG
Sbjct: 241 QWGGILFSEESRGNELRYTTIRNGRWGIIAEGGKDVTTPKLLLEGCMVTNTKGSGLAASG 300
Query: 301 GYIRILNSEISNTLGYTVALFGSVCELTQSTVCNFYRWDNRQGEALRYVTAFAPDVAGGS 360
GYIRILNSEISNTLGYTVALFGSVCELTQSTVCNFYRWDNRQGEALRYVTAFAPDVAGGS
Sbjct: 301 GYIRILNSEISNTLGYTVALFGSVCELTQSTVCNFYRWDNRQGEALRYVTAFAPDVAGGS 360
Query: 361 YTPSSDSRLILSNSIVDGSRSVVKQGDKESGGEISLSDGSPSDNEAPVLARLTIRNSYVR 420
YTPSSDSRLILSNSIVDGSRSVVKQGDKESGGEISLSDGSPSDNEA VLARLTIRNSYVR
Sbjct: 361 YTPSSDSRLILSNSIVDGSRSVVKQGDKESGGEISLSDGSPSDNEASVLARLTIRNSYVR 420
Query: 421 ARSSILNVGYNVMEADKNNPTDSIYYSVGYDLIKKKYNFRYDYHPLPNAPFVGKADPAII 480
ARSSILNVGYNVMEADKNNP DSIYYSVGYDLIKKKYNFRYDYHPLPNAPFVGKADPAII
Sbjct: 421 ARSSILNVGYNVMEADKNNPADSIYYSVGYDLIKKKYNFRYDYHPLPNAPFVGKADPAII 480
Query: 481 ALFPNDLTGEPRRTATVGAFEVKPRP 506
ALFP+DLTGEPRRTATVGAFEVKPRP
Sbjct: 481 ALFPHDLTGEPRRTATVGAFEVKPRP 506
>gi|154490837|ref|ZP_02030778.1| hypothetical protein PARMER_00754 [Parabacteroides merdae ATCC
43184]
gi|154088585|gb|EDN87629.1| hypothetical protein PARMER_00754 [Parabacteroides merdae ATCC
43184]
Length = 501
Score = 210 bits (534), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 159/515 (30%), Positives = 241/515 (46%), Gaps = 40/515 (7%)
Query: 1 MKEISRLTAVILLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTR 60
+ E+ +L A + + VL SC + Y+ +P R FS DT+ FDT+F+ + S+T+
Sbjct: 12 INEMKKLAAPLFILFFIVLNLLSCDGLDENYSNNPNHRLSFSVDTLSFDTVFTTIGSATK 71
Query: 61 TFMVYNRLNRSLRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEATFPE 120
FM+YNR ++ L +SE+ L G+ G+R+NVDG G F ++ I DS+++FVE T
Sbjct: 72 EFMIYNRNDQPLLISEIMLASGEETGFRINVDGRKGDHFQNVRIQADDSLYVFVEVTVNP 131
Query: 121 GESDDPVEVKDSLRFLINGRTDYVLLQGFRQNVDEV-TALVIDRDTIFGAQRPTLLLDSL 179
S+ P+ V DS+ F NG V L+ + QNV+ L++ +DT F A+RP L+ DSL
Sbjct: 132 NASNQPLLVDDSIIFTTNGVKQSVRLEAYGQNVNLYKNGLILTQDTHFTAERPYLIYDSL 191
Query: 180 VVQQGATLTLPAGCRLLMANKAHIKVRGRLMAEGNPAKRVMIENLRHDLLVQDV-PYTLV 238
V+ TL + G M + A + G L+A+G ++ R D ++ D+ PY
Sbjct: 192 VISPNITLNIDPGATFYMHDTAKVITYGTLLAKGTRENPIVFRGDRLDFILNDILPYDRT 251
Query: 239 PGQWGGILFSEESRGNELRYTTIRNGRWGIIAEGGKDVTTPKLLLDGCMVTNTKGSGLAA 298
P QWGGI F ES N + +RNG+ G+ E + KL L +TN +A
Sbjct: 252 PSQWGGIFFRPESYHNIMDNVIVRNGKTGLTFEKSSPDES-KLKLSNSQITNMGEHLFSA 310
Query: 299 SGGYIRILNSEISNTLGYTVALFGSVCELTQSTVCNFYRWDNRQGEALRYVTAFAPDVAG 358
I + N+E++N G V L G T+ NF + R L +A
Sbjct: 311 RNCKIEVTNTELTNAGGGVVVLIGGNYCFVHCTLANFMTLEKRTTPCLT--------MAN 362
Query: 359 GSYTPSSDSRLILSNSIVDGSRSVVKQGDKESGGEISLS-DGSPSDNEAPVLARLTIRNS 417
+ + + N I+DGS K+ K GE++LS DG+ S
Sbjct: 363 NANEEAYPLKATFDNCIIDGSFDAGKEAYK---GELNLSVDGAAS-------------FE 406
Query: 418 YVRARSSILNVGYN---VMEADKNNPTDSIYYSVGYDLIKKKYNFRYDYHPLPNAPF-VG 473
Y+ I G N E N + S G +K +R+D+ P VG
Sbjct: 407 YLFNHCVIKTNGSNNDHFKEVLFTNTSPSYRLKGG-----EKNKYRFDFRPDSLTTLGVG 461
Query: 474 KADPAIIALFPNDLTGEPRRT---ATVGAFEVKPR 505
KAD + +P D G R T +GA+E P+
Sbjct: 462 KADIFVTQQYPVDRYGINRLTNDGPDIGAYEFVPK 496
>gi|150008703|ref|YP_001303446.1| hypothetical protein BDI_2093 [Parabacteroides distasonis ATCC
8503]
gi|149937127|gb|ABR43824.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 495
Score = 206 bits (524), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 153/517 (29%), Positives = 240/517 (46%), Gaps = 52/517 (10%)
Query: 4 ISRLTAVILLAVGFVLANGSCSDDFD-KYAESPEDRAEFSADTIKFDTLFSRVSSSTRTF 62
+ RLT + + +L D D Y+ +P R FS DT+ FDT+FS + S+TR F
Sbjct: 1 MKRLTITLFILATLLLNMLPACDGLDDHYSTNPTYRLSFSTDTLAFDTIFSTIGSTTRQF 60
Query: 63 MVYNRLNRSLRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEATFPEGE 122
M+YN+ + L + + L G+ G+R+NVDG G+ F+++ IL DSM++FVE T
Sbjct: 61 MIYNKNSEPLSIESIMLASGEATGFRMNVDGRKGSSFNNVGILANDSMYVFVEVTVDPNG 120
Query: 123 SDDPVEVKDSLRFLINGRTDYVLLQGFRQNVDEVT-ALVIDRDTIFGAQRPTLLLDSLVV 181
+ P+ ++DS+ F +NG VLL+ + Q+V+ + I +D+I A RP L+ DSLV+
Sbjct: 121 GNQPLLIQDSVLFTVNGIRQSVLLEAYGQDVNLYKGGVTITKDSILTANRPYLIYDSLVI 180
Query: 182 QQGATLTLPAGCRLLMANKAHIKVRGRLMAEGNPAKRVMIENLRHDLLVQDV-PYTLVPG 240
+G +L + G M +KA + V G + A G + + R D ++ D+ PY PG
Sbjct: 181 AKGVSLNIEKGATFYMHDKASLIVHGSMNALGTLDEPITFRGDRLDYILNDILPYDRTPG 240
Query: 241 QWGGILFSEESRGNELRYTTIRNGRWGIIAEGGKDVTT---PKLLLDGCMVTNTKGSGLA 297
QWGGI F +S GN +RNG G+ E ++T PK+ ++ +TN
Sbjct: 241 QWGGITFKADSYGNVWDNVIVRNGTSGVYCE----LSTPDRPKIKINNSQITNMGSDLFF 296
Query: 298 ASGGYIRILNSEISNTLGYTVALFGSVCELTQSTVCNFYRWDNRQ---------GEALRY 348
A + N+E SN G + L G T+ N+ R+ + L
Sbjct: 297 AINCDVIATNTEFSNAGGSVLTLVGGKYYFAHCTMANYMSLTKREMASETVPLDSKCLYL 356
Query: 349 VTAFAPDVAGGSYTPSSDSRLILSNSIVDGSRSVVKQGDKESGGEISLSDGSPSDNEAPV 408
+ D G P ++ N +DGS V + D + + N +
Sbjct: 357 LNNVTVDGNG----PYPITQAYFDNCTIDGSYDVELKADGSTDFDYRF-------NHCAL 405
Query: 409 LARLTIRNSYVRARSSILNVGYNVMEADKNNPTDSIYYSVGYDLIKKKYNFRYDYHPLPN 468
A+ + + + +L + K P+ Y VG K Y+FR D +
Sbjct: 406 KAKESSSDHF----KEVLFI--------KKTPS---YRKVGGKQNKYTYDFRPDS---VS 447
Query: 469 APFVGKADPAIIALFPNDLTGEPRRTA----TVGAFE 501
VGKADP I +P D G R T+ T+GA+E
Sbjct: 448 TTGVGKADPEITKNYPIDRYGVNRLTSSNGPTIGAYE 484
>gi|110640071|ref|YP_680281.1| hypothetical protein CHU_3705 [Cytophaga hutchinsonii ATCC 33406]
gi|110282752|gb|ABG60938.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
Length = 442
Score = 156 bits (395), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 91/266 (34%), Positives = 136/266 (51%), Gaps = 9/266 (3%)
Query: 3 EISRLTAVILLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTRTF 62
+++R+ V +L + F+ SC + + P DR EFS DT+ FDTLFS V S T+
Sbjct: 2 KLNRIPVVFILVLQFL----SCMRKDEVLSTDPSDRLEFSEDTVFFDTLFSSVGSITKRL 57
Query: 63 MVYNRLNRSLRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEATFPEGE 122
+VYN +R+S V L GG N Y V V+G G SD+ I DS+++ V+
Sbjct: 58 LVYNPGKNKIRISSVTLAGGSNSAYSVIVNGLNGPSISDIDIRGNDSIYVLVKVLIDPSN 117
Query: 123 SDDPVEVKDSLRFLINGRTDYVLLQGFRQNVDEVTALVIDRDTIFGAQRPTLLLDSLVVQ 182
D P V DS+ F N VLL + Q+ I D ++ +P +L D+L V
Sbjct: 118 QDLPFLVSDSILFYTNTNRQKVLLSAYGQDAHFFGKESIACDAVWVNDKPYVLYDTLQVN 177
Query: 183 QGATLTLPAGCRLLMANKAHIKVRGRLMAEGNPAKRVMIENLRHDLLVQDVPYTLVPGQW 242
G TLT+ GC++ A + VRG L+A+G+ ++ ++ + + QD GQW
Sbjct: 178 TGCTLTIQPGCKIYFHTAAALNVRGTLIAQGSSSEPLLFASDKLSKTAQD-----DRGQW 232
Query: 243 GGILFSEESRGNELRYTTIRNGRWGI 268
GG++F S GN + + TIRN I
Sbjct: 233 GGLIFQSSSSGNVISWATIRNSSTAI 258
>gi|163755490|ref|ZP_02162609.1| hypothetical protein KAOT1_04997 [Kordia algicida OT-1]
gi|161324403|gb|EDP95733.1| hypothetical protein KAOT1_04997 [Kordia algicida OT-1]
Length = 521
Score = 139 bits (349), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 144/538 (26%), Positives = 230/538 (42%), Gaps = 72/538 (13%)
Query: 7 LTAVILLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTRTFMVYN 66
L ++LA+ + + SC DF S + +FS DT+ DT+F+ + SST VYN
Sbjct: 5 LYTFLILAITILWS--SCRKDFVTVPSSGQ--LQFSKDTVYLDTVFTNIGSSTYNLKVYN 60
Query: 67 RLNRSLRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEAT--------- 117
R + +R+ ++L G++ YR+NVDG G F D+ +L DSMFIFVE T
Sbjct: 61 RSDDDIRIPTIQLGEGESSNYRLNVDGIAGKVFQDVELLANDSMFIFVETTLDINDFAGG 120
Query: 118 ----------FPEGESDDPVE----VKDSLRFLINGRTD----YVLLQGFRQNVDEVTA- 158
F G + VE V+D++ FL + D LL G +E
Sbjct: 121 NQFLYTDAIEFDTGNNQQKVELVTLVQDAV-FLYPQQFDDGTVETLLLGVDDEGNETRIE 179
Query: 159 --LVIDRDTIFGAQRPTLLLDSLVVQQGATLTLPAGCR--------LLMANKAHIKVRGR 208
+ D + + ++P ++ V Q LT+ AG R L++ N ++V G
Sbjct: 180 GFFLEDSELTWTNEKPYVVYGYAAVGQNDVLTVEAGARVHFHADSGLIIGNMGSLQVNGV 239
Query: 209 LMAEGNPAKRVMIENLRHDLLVQDVPYTLVPGQWGGILFSEESRGNELRYTTIRNGRWGI 268
L + V+ E R + + VPGQWG I ++ S N + Y TI+N GI
Sbjct: 240 LSTTEDLEGEVIFEGDR-----LEPGFANVPGQWGTIWLTDGSTNNNINYATIKNASVGI 294
Query: 269 IAEGGKDVTTPKLLLDGCMVTNTKGSGLAASGGYIRILNSEISNTLGYTVAL-FGSVCEL 327
+ E T P L L+ + N+ GL A G+I NS I + ++ L G
Sbjct: 295 LTENSDGTTNPTLTLNNSKIYNSATIGLLARTGHIEANNSVIGSAGQASLYLNLGGEYTF 354
Query: 328 TQSTVCNFYRWDNRQGEALRYVTAFAPDVAGGSYTPSSDSRLILSNSIVDGSRSVVKQGD 387
T N+ W N A + +V G+ + + SN I+DGS S+
Sbjct: 355 RHCTFANY--WTNSFRVAPTVLIDNFLEVGDGTVFTADLEQADFSNCIIDGSNSI----- 407
Query: 388 KESGGEISLSDGSPSDNEAPVLARLTIRNSYVR---ARSSILNVGYNVMEADKNNPT-DS 443
E+ L +N++ VL RN ++ + LN + + A N+ D
Sbjct: 408 -----ELIL---QKVENDSSVLLNYNFRNCLIKFDDFNNRFLN---DELYAFNNSAIFDE 456
Query: 444 IYYSVGYDLIKKKYNFRYDYHPLPNAPFVGKADPAIIALFPNDLTGEPRRT-ATVGAF 500
I + + I K ++ ++ +G+ + + I +DL G PR + VGA+
Sbjct: 457 IELVINSNTIDYKDTANNEFFIGADSAAIGEGNTSNIPAVQSDLLGNPRTNPSDVGAY 514
>gi|86141553|ref|ZP_01060099.1| hypothetical protein MED217_06027 [Flavobacterium sp. MED217]
gi|85832112|gb|EAQ50567.1| hypothetical protein MED217_06027 [Leeuwenhoekiella blandensis
MED217]
Length = 506
Score = 133 bits (335), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 111/374 (29%), Positives = 174/374 (46%), Gaps = 57/374 (15%)
Query: 4 ISRLTAVILLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTRTFM 63
I L +++LL+ SC +DF+ S EFS DT+ DT+FS + SST +
Sbjct: 2 IPALLSIVLLS--------SCREDFEPRLSS--GNLEFSRDTVYLDTVFSTIGSSTYSLK 51
Query: 64 VYNRLNRSLRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEATFPEGES 123
V+NR N + + V L G+N YR+NVDG G F ++ I KDS+FIF+EAT +
Sbjct: 52 VFNRSNDLISIPRVGLAQGENSKYRLNVDGLAGKTFENVEIRAKDSIFIFIEATIDLDQD 111
Query: 124 DDPVEVKDSLRFLINGRTDYVLL----------------QGFRQNVD----------EVT 157
+ + D L+F +G T V L G ++ ++ V
Sbjct: 112 KNALLYTDQLQFESSGNTQEVALVTLVQDAIFLFPSRNASGIKETLEIGTDASGNPISVE 171
Query: 158 ALVIDRDTI-FGAQRPTLLLDSLVVQQGATLTLPAGCRLLMANKAHIKV--RGRLMAEGN 214
+++ + + F ++P ++ V G T+ + AG RL + I V GRL+A G
Sbjct: 172 GFLLEEEQLRFTNEKPYVIYGYAGVPSGKTMQIEAGARLHFHENSGIIVANEGRLLAHGL 231
Query: 215 PAKRVMIENLRHDLLVQ----DVPYTLVPGQWGGILFSEESRGNELRYTTIRNGRWGIIA 270
++ + L +++ + + Y VPGQW I F+ S G+ L++TTI+N + GI+
Sbjct: 232 LSEDQQV--LEREIIFEGDRLEPKYGNVPGQWNTIWFTPGSTGS-LQHTTIKNSQIGILT 288
Query: 271 EGGKDVTTPKLLLDGCMVTNTKGSGL-----AASGGYIRILNSEISNTLGYTVALFGSVC 325
+G T L L + N+ GL A +G + + NS ISN G
Sbjct: 289 DGHSGNTDRNLELKNVQIYNSSAVGLYAVNAAVAGENLLVGNSGISNLWLRQ----GGNY 344
Query: 326 ELTQSTVCNFYRWD 339
T ST N+ WD
Sbjct: 345 SFTHSTFANY--WD 356
>gi|91215782|ref|ZP_01252752.1| hypothetical protein P700755_02657 [Psychroflexus torquis ATCC
700755]
gi|91186248|gb|EAS72621.1| hypothetical protein P700755_02657 [Psychroflexus torquis ATCC
700755]
Length = 516
Score = 133 bits (334), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 180/382 (47%), Gaps = 52/382 (13%)
Query: 1 MKEISRLTAVILLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTR 60
MK+I IL+A+G +L+ SC DDF+ E EFS DT+ DT+F+ + SST
Sbjct: 1 MKQI----CFILVALG-LLSMTSCRDDFE--TEPSFGSLEFSRDTVYLDTVFTDIGSSTY 53
Query: 61 TFMVYNRLNRSLRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEAT--- 117
+F VYNR ++ + + V+L G++ +R+NVDG G S++ +L KDS+F+FVE T
Sbjct: 54 SFKVYNRSSQDISIPNVQLNQGESSNFRLNVDGVAGKSISNINVLAKDSIFVFVEVTADI 113
Query: 118 -----------------FPEGESDDPVE----VKDSLRFLINGRTDYVLLQGFRQNVDE- 155
F G + VE V+D++ FL R + + +DE
Sbjct: 114 QDLSQNELSFLYSDQINFDSGVNLQQVELVTLVQDAI-FLFPERFEDGSSETLTLGIDEE 172
Query: 156 -----VTALVIDRDTI-FGAQRPTLLLDSLVVQQGATLTLPAGCR--------LLMANKA 201
+ ++D + + F ++P ++ V G TLT+ AG R ++ AN A
Sbjct: 173 GEAIQIDGFILDAEHLTFTKEKPYVIYGFAGVPSGETLTIEAGARIHFHTNSGIIAANNA 232
Query: 202 HIKVRGRLMAEGNPAKRVMIENLRHDLLVQDVPYTLVPGQWGGILFSEESRGNELRYTTI 261
++ G L ++ + +I D L + Y+ +PGQW I +E S + + TI
Sbjct: 233 SVQAIGGLSSDPELLENEII--FEGDRL--EPEYSNIPGQWASIWLTEGSTNHNFEHVTI 288
Query: 262 RNGRWGIIAEGGKDVTTPKLLLDGCMVTNTKGSGLAASGGYIRILNSEISNTLGYTVAL- 320
+N GI+ + P L + + N+ GL A G+I N I+N+ ++ L
Sbjct: 289 KNSTVGILMDSNDGSAQPTLTIKNTQIYNSANVGLLARTGFIDAENLVINNSGQASLNLS 348
Query: 321 FGSVCELTQSTVCNFYRWDNRQ 342
G ST N+++ RQ
Sbjct: 349 LGGRYNFRHSTFANYWQNSFRQ 370
>gi|88713632|ref|ZP_01107714.1| hypothetical protein FB2170_13201 [Flavobacteriales bacterium
HTCC2170]
gi|88708142|gb|EAR00380.1| hypothetical protein FB2170_13201 [Flavobacteriales bacterium
HTCC2170]
Length = 509
Score = 131 bits (330), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 116/425 (27%), Positives = 193/425 (45%), Gaps = 68/425 (16%)
Query: 1 MKEISRLTAVILLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTR 60
M+ +V+LL + + SC DF+ A EFS DT+ DT+F+ + SST
Sbjct: 1 MQRYLYFISVLLLVILW----SSCRKDFEYAANVG--NLEFSKDTVFLDTIFTNIGSSTY 54
Query: 61 TFMVYNRLNRSLRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEATFPE 120
T VYN + + + L G+N YR+NVDG G +F ++ IL +DS+FIF+E T+ +
Sbjct: 55 TLKVYNPNRDDIEIPTIGLEQGQNSKYRLNVDGVAGKEFRNIPILAQDSLFIFIETTYDQ 114
Query: 121 GESDDP----------------------VEVKDSL----RFLINGRTDYVLLQGFRQNVD 154
S D VKD++ R +G + +LL G N +
Sbjct: 115 SASIDNKFLYTDVIIFDSGINQQEVPLITLVKDAVFLYPRTQADGTKETILL-GLDVNGE 173
Query: 155 EVTA--LVIDRDTI-FGAQRPTLLLDSLVVQQGATLTLPAGCRL--------LMANKAHI 203
E+ A +D + F ++P ++ V++G T+ + AG RL +++ A I
Sbjct: 174 EIHAEGFYLDETELEFNNEKPYVIYGFAAVKEGETINIAAGSRLHFHKNSGIYVSDGASI 233
Query: 204 KVRGRLMAEGNPAKRVMIEN---LRHDLLVQDVPYTLVPGQWGGILFSEESRGNELRYTT 260
+ G L + R ++EN D L + ++ +PGQWG + + S N + Y T
Sbjct: 234 SINGSLSND-----RDILENEVIFEGDRL--EPEFSNIPGQWGLLWIASGSNSNNIEYLT 286
Query: 261 IRNGRWGIIAEGGKDVTTPKLLLDGCMVTNTKGSGLAASGGYI----RILNSEISNTLGY 316
++N G+ EG + + +P L + + N+ + L A +I IL +N+L
Sbjct: 287 LKNATTGVFVEGDEQLLSPTLNIRNTQIYNSANTNLWAKNAFIVAENVILGGAGNNSLHC 346
Query: 317 TVALFGSVCELTQSTVCNFYRWDNRQGEALRYVTAFAPDVAGGSYTPSSDSRLILSNSIV 376
+ G T ST+ N++ R G ALR + P+ G ++ N IV
Sbjct: 347 NL---GGNYTFTHSTIANYWIHGFRTGTALR-IDNHDPNEVG------ELTKANFYNCIV 396
Query: 377 DGSRS 381
DG+ +
Sbjct: 397 DGNNT 401
>gi|88803708|ref|ZP_01119232.1| hypothetical protein PI23P_00410 [Polaribacter irgensii 23-P]
gi|88780441|gb|EAR11622.1| hypothetical protein PI23P_00410 [Polaribacter irgensii 23-P]
Length = 503
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 113/405 (27%), Positives = 185/405 (45%), Gaps = 43/405 (10%)
Query: 10 VILLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTRTFMVYNRLN 69
V ++ +++ SC DF S EFS DT+ DT+F + S+T + VYNR N
Sbjct: 5 VFFISCVVLISASSCRKDFSTVPNSGN--LEFSKDTVFLDTIFKNIGSATYSLKVYNRGN 62
Query: 70 RSLRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEAT------------ 117
++ + ++L GK+ YR++VDG G +F+D+ IL KDS++IF+E T
Sbjct: 63 NAITIPRIQLKKGKSSLYRLSVDGIPGKEFNDIDILAKDSIYIFIETTVNNTKKRALLYT 122
Query: 118 ----FPEGESDDPVE----VKDSLRFLING------RTDYVLLQGFRQNVDEVTALVIDR 163
F GE ++ +KD+ F+ G R D + L G Q + D
Sbjct: 123 DKILFDNGEHQQNIDLITLIKDA-NFIYPGKDAVSMRIDSLSLDG--QATTIKGRFLKDS 179
Query: 164 DTIFGAQRPTLLLDSLVVQQGATLTLPAGCRLLMANKAHIKV--RGRLMAEGNPAKRVMI 221
+ IF ++PT++ + TLT+ G R+ + + + V +G L G PA++V+
Sbjct: 180 ELIFTKEKPTVIYGYAAIPANKTLTITEGARIYFHDNSGLIVDKKGSLKVNGTPAEKVIF 239
Query: 222 ENLRHDLLVQDVPYTLVPGQWGGILFSEESRGNELRYTTIRNGRWGIIAEGGKDVTTPKL 281
E R +Q+ + PGQWG I S+ N + + I+NG GI+ + T P L
Sbjct: 240 EGDR----LQN-RFRKTPGQWGTIWMRAGSKDNTINHAQIKNGIIGILIDSISASTAPTL 294
Query: 282 LLDGCMVTNTKGSGLAASGGYIRILNSEISNTLGYTVAL-FGSVCELTQSTVCNFYRWDN 340
+ + N G+ A I N I ++A+ G T ST N++
Sbjct: 295 NIQNTEIYNHFNFGILARETSIDGHNVVIGAAGQASLAVTMGGTYNFTHSTFANYWNNGI 354
Query: 341 RQGEALRYVTAFA--PDVAGGSYTPSSDSRLI-LSNSIVDGSRSV 382
RQ A+ + FA + G T + D + +N I DG+ ++
Sbjct: 355 RQLPAV-LINNFAVYNNEGGQEITETRDLKAANFTNCIFDGNNNI 398
>gi|83857281|ref|ZP_00950809.1| hypothetical protein CA2559_10793 [Croceibacter atlanticus
HTCC2559]
gi|83848648|gb|EAP86517.1| hypothetical protein CA2559_10793 [Croceibacter atlanticus
HTCC2559]
Length = 515
Score = 129 bits (323), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 111/396 (28%), Positives = 174/396 (43%), Gaps = 64/396 (16%)
Query: 12 LLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTRTFMVYNRLNRS 71
LL ++ SC +DF+ E + EFS DT+ DT+F+ + SST VYNR ++
Sbjct: 7 LLLFCCIIFWSSCRNDFE--FEPSTGQLEFSKDTVYLDTVFTNIGSSTYNLKVYNRSDQD 64
Query: 72 LRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEAT-------------- 117
+R+ V+L G++ YR+NVDG G F D+ IL DS+FIFVE T
Sbjct: 65 IRIPNVQLSQGESSSYRLNVDGVAGKVFRDVEILANDSIFIFVETTADIQELSQNALEFL 124
Query: 118 ------FPEGESDDPVE----VKDSLRFLINGRT-----------------DYVLLQGFR 150
F GE+ VE +KD++ FL R D +L++GF
Sbjct: 125 YEDAILFDGGENTQRVELVTLIKDAV-FLFPERDAMTGEVETLQLGTDPNGDPILIEGF- 182
Query: 151 QNVDEVTALVIDRDTIFGAQRPTLLLDSLVVQQGATLTLPAGCRLLMANKAHIKVRGRLM 210
+ D + F ++P ++ V TLT+ AG R+L + I V +
Sbjct: 183 --------FLEDNELTFTNEKPYVIYGFAAVGSNKTLTVNAGARVLFHANSGIIVADQGS 234
Query: 211 AEGNPAKRVMIENLRHDLLVQ----DVPYTLVPGQWGGILFSEESRGNELRYTTIRNGRW 266
+ N E L ++++ + + Y+ +PGQW I + S + YTTI+NG
Sbjct: 235 MQVNGELSTDPELLENEVIFESDRLETAYSNIPGQWSTIWLTAGSTNHNFNYTTIKNGTV 294
Query: 267 GIIA---EGGKDVTTPKLLLDGCMVTNTKGSGLAASGGYIRILNSEISNTLGYTVAL-FG 322
G++ +GG+D P L + + N+ GL + G I N I+ + L G
Sbjct: 295 GLLMDSNDGGED---PTLTIRNSQIYNSSNIGLLSRTGSILGENLVIAEAGQSAMVLELG 351
Query: 323 SVCELTQSTVCNFYRWDNRQGEALRYVTAFAPDVAG 358
E +T N++ RQ A+ F +A
Sbjct: 352 GSYEFNHATFANYWSRSFRQTPAVVISNTFGETLAA 387
>gi|124003621|ref|ZP_01688470.1| hypothetical protein M23134_03280 [Microscilla marina ATCC 23134]
gi|123991190|gb|EAY30642.1| hypothetical protein M23134_03280 [Microscilla marina ATCC 23134]
Length = 450
Score = 129 bits (323), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 143/297 (48%), Gaps = 8/297 (2%)
Query: 41 FSADTIKFDTLFSRVSSSTRTFMVYNRLNRSLRLSEVELVGGKNRGYRVNVDGHVGTKFS 100
FS I FDTLF+ S T+ VYN ++R++ +E+ G N Y V++ G GT F+
Sbjct: 36 FSDKAIVFDTLFTNTRSITQRLRVYNPDKNAIRINRIEIGGKANSPYSVSIKGEKGTVFT 95
Query: 101 DLTILPKDSMFIFVEATFPEGESDDPVEVKDSLRFLINGRTDYVLLQGFRQNVDEVTALV 160
D+ +L KDS+ + VE P V DSL F N + V L + +N +
Sbjct: 96 DVELLGKDSLLVLVEINVPANSQTGVVIAFDSLLFTTNQQQQQVKLIAWSENAHVLQNYT 155
Query: 161 IDRDTIFGAQRPTLLLDSLVVQQGATLTLPAGCRLLMANK-AHIKVRGRLMAEGNPAKRV 219
I + + +P ++ DS+ V GA LT+ R+ +K A I+V+G L+ +G+ V
Sbjct: 156 ITGNETWDKTKPYIIQDSVTVAAGAILTIGDSTRVYGLDKNAFIRVKGNLIVKGDTGSIV 215
Query: 220 MIENLRHDLLVQDVPYTLVPGQWGGILFSEESRGNELRYTTIRNGRWGIIAEGGKDVTTP 279
+R +++ + GQW GI+ E G ++ Y I+N GI G T P
Sbjct: 216 TFTGIR-----REIEFEEQLGQWRGIVM-EAGAGVDISYALIKNAYTGIAITGNDADTAP 269
Query: 280 KLLLDGCMVTNTKGSGLAASGGYIRILNSEISNTLGYTVALF-GSVCELTQSTVCNF 335
L++ + N G+ A+ + + NS I+N +G T+ + G L T+ N+
Sbjct: 270 DLIVKNTTIKNMFEHGINANNADVLLENSLITNCIGNTLGVTNGGSYTLKHCTLANY 326
>gi|86131952|ref|ZP_01050549.1| hypothetical protein MED134_03095 [Cellulophaga sp. MED134]
gi|85817774|gb|EAQ38948.1| hypothetical protein MED134_03095 [Dokdonia donghaensis MED134]
Length = 517
Score = 129 bits (323), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 104/377 (27%), Positives = 168/377 (44%), Gaps = 47/377 (12%)
Query: 11 ILLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTRTFMVYNRLNR 70
L+A+ ++ SC +DF+ + EFS DTI DT+F+ + SST VYNR +
Sbjct: 6 FLIALAGIITVSSCRNDFETTPSTG--NLEFSRDTIFLDTVFTNIGSSTYNLKVYNRSDE 63
Query: 71 SLRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEATFPEGESDDPVE-- 128
++ + V L G++ YR+NVDG G +F ++ IL DS+F+FVE T E + E
Sbjct: 64 AISIPSVRLGRGQDSDYRLNVDGVAGKEFEEVEILANDSIFVFVETTLDINEVSNASEEF 123
Query: 129 -VKDSLRFLINGRTDYVLLQGFRQNV-------DEVTALV-----------------IDR 163
D++ F G T V L ++ D T +V +D
Sbjct: 124 LYTDAIEFNSVGATQNVELVTLVKDAIFLFPERDGATGMVETLTVNNVETTLEGRYLMDD 183
Query: 164 DTIFGAQRPTLLLDSLVV-----QQGATLTLPAGCR--------LLMANKAHIKVRGRLM 210
+ F +P ++ + V TLT+ AG R LL+A+ A + + G L
Sbjct: 184 ELTFTNDKPYVIYGYMAVGDPDGSTAKTLTIEAGARVHFHADSGLLIADNATLNINGALS 243
Query: 211 AEGNPAKRVMIENLRHDLLVQDVPYTLVPGQWGGILFSEESRGNELRYTTIRNGRWGIIA 270
A+ + +I D L + ++ VPGQWG IL ++ S N + Y TI+N GI+
Sbjct: 244 ADPIALENEVI--FESDRL--EPAFSNVPGQWGTILLTDGSVNNTINYATIKNATVGILN 299
Query: 271 EGGKDVTTPKLLLDGCMVTNTKGSGLAASGGYIRILNSEISNTLGYTVAL-FGSVCELTQ 329
E + + L L + N GL + N+ I+N +++ + G + T
Sbjct: 300 EASPNAASASLTLTNSQIYNASAVGLLNRFTSVEASNTVINNCGQFSMLVQLGGIYNFTH 359
Query: 330 STVCNFYRWDNRQGEAL 346
T N++ R A+
Sbjct: 360 CTFTNYWTQSFRDTPAV 376
>gi|89890252|ref|ZP_01201762.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
gi|89517167|gb|EAS19824.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
Length = 512
Score = 127 bits (319), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 116/391 (29%), Positives = 175/391 (44%), Gaps = 45/391 (11%)
Query: 1 MKEISRLTAVILLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTR 60
MK+ + +LL + F SC DDF ++ S D FS DT+ DT+F+ + SSTR
Sbjct: 1 MKKYFIIIPSLLLCIIF----ASCRDDF-AFSNSTGDLG-FSQDTVFLDTVFTNIGSSTR 54
Query: 61 TFMVYNRLNRSLRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEATFP- 119
TF VYN + + + V L G++ YR+ VDG G F ++ +L KDS+F+FVE T
Sbjct: 55 TFKVYNNSSDDIVIPRVALAQGESSKYRLAVDGIPGRIFENVELLAKDSLFVFVETTIDI 114
Query: 120 -EGESDDPVEVKDSLRF----------LINGRTDYVLLQGFR--QNVDEVTAL------- 159
+ S D D++ F L+ D + L R Q ++E L
Sbjct: 115 NDFSSGDEFLYTDTIEFDSGPNQQKVELVTLVQDAIFLFPERDAQGIEETLLLGTTDDGE 174
Query: 160 --------VIDRDTIFGAQRPTLLLDSLVVQQGATLTLPAGCRLLMANKAHIKV--RGRL 209
+ D + A +P ++ V TLT+ AG RL N + I V G L
Sbjct: 175 DIRISGFFLDDTELTLTAAKPYVIYGYAGVPPNKTLTIDAGARLHFHNDSGIIVANEGSL 234
Query: 210 MAEGNPAKRVMIEN---LRHDLLVQDVPYTLVPGQWGGILFSEESRGNELRYTTIRNGRW 266
G P+ +EN D L + YT + GQWG I + S+ N + TI+N
Sbjct: 235 QVNGLPSSTEELENEVIFEGDRL--EPTYTDIAGQWGAIWLTAGSKNNIINNATIKNASV 292
Query: 267 GIIAEGGKDVTT-PKLLLDGCMVTNTKGSGLAASGGYIRILNSEISNTLGYTVAL-FGSV 324
GII + +T L L + N+ SGL A+G I N I+N+ ++ L G
Sbjct: 293 GIIMDSVNTTSTGATLKLSNTQIYNSSNSGLIATGANIEAENLIINNSGQSSLVLRLGGE 352
Query: 325 CELTQSTVCNFYRWDNRQGEALRYVTAFAPD 355
T+ N++ RQ L +++ P+
Sbjct: 353 YTFNNCTIANYWNNSFRQDPTL-FISNIIPN 382
>gi|86135566|ref|ZP_01054147.1| hypothetical protein MED152_12699 [Tenacibaculum sp. MED152]
gi|85819739|gb|EAQ40896.1| hypothetical protein MED152_12699 [Polaribacter dokdonensis MED152]
Length = 500
Score = 127 bits (318), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 122/433 (28%), Positives = 181/433 (41%), Gaps = 62/433 (14%)
Query: 6 RLTAVILLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTRTFMVY 65
R +L+ VG + SC DF EFS DT+ DT+F+ + S+T VY
Sbjct: 2 RYLVALLICVGLITM-SSCRKDFSTIPNFGS--LEFSKDTVFLDTVFTNIGSATYNLKVY 58
Query: 66 NRLNRSLRLSEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEATFPEGESDD 125
NR ++ + + L G + YR+NVDG G +F+++ IL +DS+F+FVE T +
Sbjct: 59 NRGGNAITIPRIALENGTSSNYRLNVDGIPGKEFNNIDILAEDSIFVFVETTIDANSITN 118
Query: 126 PVEVKDSLRFLINGRTDYVLLQ--GFRQNVDEVTALVIDRDTIFGAQRP-TLLLDSLV-- 180
P+ TD +L +Q+VD VT LV D + IF + P T+ +DSL
Sbjct: 119 PL------------YTDRILFDTGANQQDVDLVT-LVQDANFIFPGKDPITMKIDSLTLD 165
Query: 181 -------------------------------VQQGATLTLPAGCRL-LMANKAHI-KVRG 207
V TLT+ AG ++ AN I
Sbjct: 166 GQPTTIKGRFLEDSELTINNTKPTVIYGYAAVAANKTLTINAGSQIYFHANSGLIIDKEA 225
Query: 208 RLMAEGNPAKRVMIENLRHDLLVQDVPYTLVPGQWGGILFSEESRGNELRYTTIRNGRWG 267
L G ++V + R + ++ +PGQWG I S+ NE+++ IRNG G
Sbjct: 226 SLRVNGTLDEKVTFQGDR-----LENSFSKIPGQWGTIWMRAGSKNNEIKHAQIRNGVIG 280
Query: 268 IIAEGGKDVTTPKLLLDGCMVTNTKGSGLAASGGYIRILNSEI-SNTLGYTVALFGSVCE 326
I+ + TTP L L+ + N G+ A I N + S A G
Sbjct: 281 ILIDSIGSNTTPTLKLENSEIYNNSNFGILARETNIEAYNVVVGSAGQASLAATVGGTYN 340
Query: 327 LTQSTVCNFYRWDNRQGEALRYVTAFA-PDVAGGSYTPSSD-SRLILSNSIVDGSRSVVK 384
T ST NF+ RQ A+ F D G T + D + +N I DG+ ++
Sbjct: 341 FTHSTFANFWNNGIRQLPAVLVNNFFVFIDANGNEVTATRDLNAANFTNCIFDGNNNIEF 400
Query: 385 QGDKESGGEISLS 397
DK G + S
Sbjct: 401 LLDKVDGSTFNYS 413
>gi|126663955|ref|ZP_01734949.1| hypothetical protein FBBAL38_12470 [Flavobacteria bacterium BAL38]
gi|126623904|gb|EAZ94598.1| hypothetical protein FBBAL38_12470 [Flavobacteria bacterium BAL38]
Length = 511
Score = 123 bits (309), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 100/360 (27%), Positives = 159/360 (44%), Gaps = 46/360 (12%)
Query: 15 VGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTRTFMVYNRLNRSLRL 74
+G ++ SC +D D ES FS DT+ DT+F+ + SST T VYN+ ++++ +
Sbjct: 10 IGIAVSITSCRNDLD--FESNTGSLRFSKDTVYLDTVFTNIGSSTYTLKVYNKSDKNIAI 67
Query: 75 SEVELVGGKNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEAT----------------- 117
+ L G N YR+ VDG G +F ++ +L KDSM+IF+ T
Sbjct: 68 PTLRLANGVNSKYRLMVDGMAGQEFENVEMLAKDSMYIFISVTADVADANPTDFLYTDQI 127
Query: 118 -FPEGESDDPVE----VKDSLRFLINGRTDYVLLQGFRQNVDEVTALVIDR-DTIFGAQ- 170
F +G + VE ++D++ FL R + DE+ +D D G +
Sbjct: 128 LFGDGMNQQKVELVTLIQDAV-FLYPQRFGDGTTETLPIGDDEIYGFFLDEADATNGNEL 186
Query: 171 -----RPTLLLDSLVVQQGATLTLPAGCR--------LLMANKAHIKVRGRLMAEGNPAK 217
+P ++ V TL + AG R L++AN A + + G
Sbjct: 187 HWTNTKPYVIYGYAAVPSNKTLVVDAGARVHFHAESGLIVANNASLHIEGDNSTTDALEN 246
Query: 218 RVMIENLRHDLLVQDVPYTLVPGQWGGILFSEESRGNELRYTTIRNGRWGIIAEGGKDVT 277
V+ E R + D VPGQWG I ++ S N+++ TI+N G + G +
Sbjct: 247 EVIFEGDRLEPDFSD-----VPGQWGTIWLTQGSTDNQIKNLTIKNATVGFLVSGNDGTS 301
Query: 278 TPKLLLDGCMVTNTKGSGLAASGGYIRILNSEISNTLGYTVA-LFGSVCELTQSTVCNFY 336
TP L LD + N+ G+ A G I N I+N + + +G E T T N++
Sbjct: 302 TPTLNLDNTQIYNSANVGILARTGNIAGRNVVINNCGQASFSGSYGGSYEFTHCTFANYW 361
>gi|149372201|ref|ZP_01891471.1| hypothetical protein SCB49_00735 [unidentified eubacterium SCB49]
gi|149354968|gb|EDM43530.1| hypothetical protein SCB49_00735 [unidentified eubacterium SCB49]
Length = 465
Score = 112 bits (279), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 101/348 (29%), Positives = 150/348 (43%), Gaps = 39/348 (11%)
Query: 23 SCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTRTFMVYNRLNRSLRLSEVELVGG 82
SC +DF+ AE EFS DT+ DT+F+ + SST VYN N + + V L G
Sbjct: 18 SCRNDFE--AEPNTGNLEFSRDTVYLDTIFTNIGSSTYNLKVYNNSNEDINIPTVGLAKG 75
Query: 83 KNRGYRVNVDGHVGTKFSDLTILPKDSMFIFVEAT--------------------FPEGE 122
N YR+NVDG G +F D+ +L KDS+FIFVE T F G
Sbjct: 76 DNSLYRLNVDGLAGKQFEDIRVLAKDSIFIFVETTADIQTLPGSDPNFLYTDQIVFDSGA 135
Query: 123 SDDPVE----VKDSLRFLINGRTDYVLLQGFRQNVDEVTALV----IDRDTI-FGAQRPT 173
++ VE ++D++ FL R D + E L+ +D D + F + P
Sbjct: 136 NEQEVELVTLIQDAV-FLFPERFDDGTTETLNLGTTEEPLLIDGYFLDDDELTFTNELPY 194
Query: 174 LLLDSLVVQQGATLTLPAGCRLLMANKAHIKVRGRLMAEGNPAKRVMIENLRHDLLVQ-- 231
++ V G TL + G R+ + I V + N A V E + ++ Q
Sbjct: 195 VIYGFAAVAPGKTLNILPGARVHFHADSGILVANTGSIKANGAPSVDAELQENQIIFQGD 254
Query: 232 --DVPYTLVPGQWGGILFSEESRGNELRYTTIRNGRWGIIAEGGKDVTTPKLLLDGCMVT 289
+ + VPGQW I ++ S NE Y TI+N G++ + T L L +
Sbjct: 255 RLEPSFENVPGQWFSIWMTQGSTNNEFSYCTIKNSIVGLLMDSNDGDET--LTLKNVELY 312
Query: 290 NTKGSGLAASGGYIRILNSEISNTLGYTVAL-FGSVCELTQSTVCNFY 336
N +GL A + N I+N ++ L G ST N++
Sbjct: 313 NHSNNGLLARTSNVYGENVIINNCGLSSLQLSLGGNYTFNHSTFSNYW 360
>gi|146302447|ref|YP_001197038.1| hypothetical protein Fjoh_4720 [Flavobacterium johnsoniae UW101]
gi|146156865|gb|ABQ07719.1| hypothetical protein Fjoh_4720 [Flavobacterium johnsoniae UW101]
Length = 503
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 137/298 (45%), Gaps = 50/298 (16%)
Query: 12 LLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTRTFMVYNRLNRS 71
+L +G +L SC DFD A S + +FS DT+ DT+F + SST VYNR
Sbjct: 7 ILVLGLILVFSSCRTDFDTVASSGD--LKFSRDTVYLDTVFKNIGSSTYQLKVYNRSKDD 64
Query: 72 LRLSEVELVGGKNRGYRVNVDGHVGTK---FSDLTILPKDSMFIFVEATFPEGESDDPVE 128
+ + ++L G + YR+ VDG G F D+T+L KDS++IF+E T + +P +
Sbjct: 65 ISIPIIQLKKGLSSKYRMTVDGMSGNNGKIFKDVTLLAKDSLYIFIETT-ADITDANPTD 123
Query: 129 V--KDSLRF----------LINGRTDYVLLQGFRQNVD-----------EVTALVIDR-- 163
D ++F L+ D V L +QN D +V ++
Sbjct: 124 FLYTDEIQFDSGANLQEVALVTLIQDAVFLFP-KQNADGTKEKIQIEGKDVDGFYLNEND 182
Query: 164 -----DTIFGAQRPTLLLDSLVVQQGATLTLPAGCR--------LLMANKAHIKVRGRLM 210
+ IF Q+P ++ V + T+T AG R L +++KA +++ G+
Sbjct: 183 PENGNELIFTNQKPYVIYGYAGVPENKTVTFEAGARVHFHANSGLYVSDKASLEINGKTS 242
Query: 211 AEGNPAKRVMIENLRHDLLVQDVPYTLVPGQWGGILFSEESRGNELRYTTIRNGRWGI 268
V+ E R + L Y+ +PGQW ++FS S + + + T++N G+
Sbjct: 243 TTEKLENEVIFEGDRLESL-----YSAIPGQWNSVIFSNGSTNHSINHLTLKNAVIGL 295
>gi|150024754|ref|YP_001295580.1| hypothetical protein FP0660 [Flavobacterium psychrophilum JIP02/86]
gi|149771295|emb|CAL42764.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86]
Length = 513
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 97/382 (25%), Positives = 157/382 (41%), Gaps = 48/382 (12%)
Query: 11 ILLAVGFVLANGSCSDDFDKYAESPEDRAEFSADTIKFDTLFSRVSSSTRTFMVYNRLNR 70
IL + V+ SC DF+ S + EFS T+ DT+F+ + SST VYNR N
Sbjct: 5 ILFILALVICITSCRKDFETVPSS--GKLEFSKTTVYLDTVFANIGSSTYMLKVYNRSNN 62
Query: 71 SLRLSEVELVGGKNRGYRVNVDGHVGTK---FSDLTILPKDSMFIFVEAT---------- 117
+ + + L G YR+ VDG GT F ++ +L DS+F+F+E T
Sbjct: 63 DITIPSIALGKGNASKYRLMVDGMRGTNGKIFPNVQLLAHDSLFVFIETTVDIAAANPAD 122
Query: 118 --------FPEGESDDPVE----VKDSLRFLINGRTDYVLLQGFRQNVDEVTA--LVIDR 163
F G V ++D++ FL RT LQ + + T +
Sbjct: 123 MLYTDEILFDAGALQQKVNLVTLIQDAV-FLFPDRTINEQLQLDNNDPNSYTYGFELTGT 181
Query: 164 DTIFGAQRPTLLLDSLVVQQGATLTLPAGCRLLMANKAHIKVR--GRLMAEGNPA----- 216
F +P ++ V G L + G R+ +K+ I V+ G + G P+
Sbjct: 182 KLHFTKAKPYVIYGYAGVDSGKKLIIDKGARIHFHDKSGIFVKRGGSIQVNGLPSPTATP 241
Query: 217 --KRVMIENLRHDLLVQDVPYTLVPGQWGGILFSEESRGNELRYTTIRNGRWGIIAEGGK 274
V+ E R + + PGQWG I +E S N + TI+N GI + +
Sbjct: 242 LVNEVIFEGDR-----LEPSFAETPGQWGIIYLAEGSTNNVFNHVTIKNSTVGIFID-RQ 295
Query: 275 DVTTPKLLLDGCMVTNTKGSGLAASGGYIRILNSEISNTLGYTVA-LFGSVCELTQSTVC 333
D T + + + N+ G+ A G I N I+N ++A + G T
Sbjct: 296 DATAVQ--ISNTQIYNSSNYGIIARKGKINGDNIVINNAGKSSLACILGGTYNFVHCTFA 353
Query: 334 NFYRWDNRQGEALRYVTAFAPD 355
++++ +RQ + A+ +
Sbjct: 354 DYWQGTSRQSPTVYLDNAYTEN 375
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
Posted date: May 10, 2008 4:54 AM
Number of letters in database: 884,634,002
Number of sequences in database: 2,620,852
Database: /apps/blastdb/nr.01
Posted date: May 10, 2008 4:52 AM
Number of letters in database: 976,814,986
Number of sequences in database: 2,761,530
Database: /apps/blastdb/nr.02
Posted date: May 10, 2008 4:46 AM
Number of letters in database: 360,829,861
Number of sequences in database: 1,132,722
Lambda K H
0.318 0.136 0.394
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,208,765,097
Number of Sequences: 6515104
Number of extensions: 96727664
Number of successful extensions: 213416
Number of sequences better than 1.0e-04: 18
Number of HSP's better than 0.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 213361
Number of HSP's gapped (non-prelim): 24
length of query: 506
length of database: 2,222,278,849
effective HSP length: 138
effective length of query: 368
effective length of database: 1,323,194,497
effective search space: 486935574896
effective search space used: 486935574896
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 124 (52.4 bits)