BLASTP 2.2.18 [Mar-02-2008]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= PGN_0645 hypothetical protein
(346 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
6,515,104 sequences; 2,222,278,849 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|34540408|ref|NP_904887.1| hypothetical protein PG0602 [P... 708 0.0
gi|150010304|ref|YP_001305047.1| hypothetical protein BDI_3... 262 3e-68
gi|154491785|ref|ZP_02031411.1| hypothetical protein PARMER... 256 1e-66
gi|91214810|ref|ZP_01251783.1| hypothetical protein P700755... 151 6e-35
gi|89890785|ref|ZP_01202294.1| conserved hypothetical prote... 148 5e-34
gi|124004960|ref|ZP_01689803.1| conserved hypothetical prot... 144 1e-32
gi|163788769|ref|ZP_02183214.1| hypothetical protein FBALC1... 142 3e-32
gi|83857192|ref|ZP_00950720.1| hypothetical protein CA2559_... 139 3e-31
gi|86131914|ref|ZP_01050511.1| hypothetical protein MED134_... 138 5e-31
gi|86142696|ref|ZP_01061135.1| hypothetical protein MED217_... 136 3e-30
gi|86133039|ref|ZP_01051621.1| hypothetical protein MED152_... 135 6e-30
gi|146300505|ref|YP_001195096.1| hypothetical protein Fjoh_... 133 2e-29
gi|150025761|ref|YP_001296587.1| hypothetical protein FP171... 132 3e-29
gi|163756085|ref|ZP_02163201.1| hypothetical protein KAOT1_... 128 5e-28
gi|126663822|ref|ZP_01734817.1| hypothetical protein FBBAL3... 124 1e-26
gi|149372805|ref|ZP_01891826.1| hypothetical protein SCB49_... 123 2e-26
gi|110639365|ref|YP_679574.1| hypothetical protein CHU_2991... 91 2e-16
gi|167729551|emb|CAO80463.1| hypothetical protein; putative... 65 8e-09
>gi|34540408|ref|NP_904887.1| hypothetical protein PG0602 [Porphyromonas gingivalis W83]
gi|34396721|gb|AAQ65786.1| hypothetical protein PG_0602 [Porphyromonas gingivalis W83]
Length = 346
Score = 708 bits (1828), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 344/346 (99%), Positives = 345/346 (99%)
Query: 1 MMEKCIFAHYPHNLVFMIRKHFGIILGFLSLVFSAGAQQEKQVFHFLNLPATAQALAAGG 60
MMEKCIFAHYPHNLVFMIRKHFGIILGFLSLVFSAGAQQEKQVFHFLNLPATAQALAAGG
Sbjct: 1 MMEKCIFAHYPHNLVFMIRKHFGIILGFLSLVFSAGAQQEKQVFHFLNLPATAQALAAGG 60
Query: 61 KAITIVDDNPGLAFENPALLGYESGGRAFLSYLYYMSGSHMGNACYASSVGERGMWGVGM 120
KAITIVDDNPGLAFENPALLGYESGGRAFLSYLYYMSGSHMGNACYASSVGERGMWGVGM
Sbjct: 61 KAITIVDDNPGLAFENPALLGYESGGRAFLSYLYYMSGSHMGNACYASSVGERGMWGVGM 120
Query: 121 RFLNYGSMQGYDQNAIATGSFSASDIAVQGFYSHELSNHFRGGVSLKALYSSIETYSSFG 180
RFLNYGSMQGYDQNAIATGSFSASDIAVQGFYSHELSNHFRGGVSLKALYSSIETYSSFG
Sbjct: 121 RFLNYGSMQGYDQNAIATGSFSASDIAVQGFYSHELSNHFRGGVSLKALYSSIETYSSFG 180
Query: 181 LGVDVGISYYDDDKGYSASALFKNVGAQLKGYNEEREPFDWDFQLGFSRSFINAPFRLHI 240
LGVDVGISYYDDDKGYSASALFKNVGAQLKGYNEEREP DWDFQLGFSRSFINAPFRLHI
Sbjct: 181 LGVDVGISYYDDDKGYSASALFKNVGAQLKGYNEEREPLDWDFQLGFSRSFINAPFRLHI 240
Query: 241 TLFNLNPHYFKRLVPRDLSKMQKFLRHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGGN 300
TLFNLNPHYFKRLVPRDLSKMQKFLRHFSIGAEFTPSE+FWVGLGYTPQIAQDFEVEGGN
Sbjct: 241 TLFNLNPHYFKRLVPRDLSKMQKFLRHFSIGAEFTPSERFWVGLGYTPQIAQDFEVEGGN 300
Query: 301 KWGGLSAGVGFTSGVVRVGVSAATYHPAALSFMCSVGIRLDDKSIF 346
KWGGLSAGVGFTSGVVRVGVSAATYHPAALSFMCSVGIRLDDKSIF
Sbjct: 301 KWGGLSAGVGFTSGVVRVGVSAATYHPAALSFMCSVGIRLDDKSIF 346
>gi|150010304|ref|YP_001305047.1| hypothetical protein BDI_3738 [Parabacteroides distasonis ATCC
8503]
gi|149938728|gb|ABR45425.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 336
Score = 262 bits (669), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 139/332 (41%), Positives = 199/332 (59%), Gaps = 8/332 (2%)
Query: 17 MIRKHFGIILGFLSLVFSAGAQQEKQVFHFLNLPATAQALAAGGKAITIVDDNPGLAFEN 76
M K F +IL ++L S AQ + + FL P + +A A GG + +V+ +P L F N
Sbjct: 1 MRNKLFILILFVVTL--SVSAQNGSEAYTFLRFPTSTRANALGGHTVALVERDPSLIFHN 58
Query: 77 PALLGYESGGRAFLSYLYYMSGSHMGNACYASSVGERGMWGVGMRFLNYGSMQGYDQNAI 136
PALLG E G L+Y+ Y+S ++G+A + + GE+G WGVG F++YG ++ + +
Sbjct: 59 PALLGAEMDGMINLNYMNYISDINVGSALFTKAHGEKGAWGVGATFISYGDIKEVLPDNV 118
Query: 137 ATG-SFSASDIAVQGFYSHELSNHFRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDKG 195
TG S SA DI+V GFYS +L+ +RGG+SLK LYS + Y+S GL VD G+SYY+ DKG
Sbjct: 119 VTGASLSAKDISVNGFYSRDLNERWRGGLSLKFLYSGLADYTSIGLCVDAGLSYYNSDKG 178
Query: 196 YSASALFKNVGAQLKGYNEEREPFDWDFQLGFSRSFINAPFRLHITLFNLNPHYFKRLVP 255
+S KN+GAQLK Y +ER+ WD Q+G ++ +AP R +T LN F +
Sbjct: 179 FSFGFALKNIGAQLKAYEDERQKMPWDIQMGITQKMAHAPIRFSLTAQYLNRWKFDYIDN 238
Query: 256 RDL-----SKMQKFLRHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGGNKWGGLSAGVG 310
D S ++ +HF IG +F PSE FWVG+G+ P++ D +++GG + G SAG G
Sbjct: 239 TDKEYDGDSFVKTLAKHFIIGVDFIPSENFWVGVGFNPKVNMDMKLKGGGSFSGFSAGAG 298
Query: 311 FTSGVVRVGVSAATYHPAALSFMCSVGIRLDD 342
+ VG S A YHP+A+S M SV L D
Sbjct: 299 VRIKMFDVGFSLAKYHPSAMSMMISVSTTLAD 330
>gi|154491785|ref|ZP_02031411.1| hypothetical protein PARMER_01401 [Parabacteroides merdae ATCC
43184]
gi|154088026|gb|EDN87071.1| hypothetical protein PARMER_01401 [Parabacteroides merdae ATCC
43184]
Length = 330
Score = 256 bits (655), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 129/311 (41%), Positives = 187/311 (60%), Gaps = 5/311 (1%)
Query: 37 AQQEKQVFHFLNLPATAQALAAGGKAITIVDDNPGLAFENPALLGYESGGRAFLSYLYYM 96
AQ + FL P++A+A A GG +++V+ +P L F NPALLG E L+YL Y+
Sbjct: 19 AQTGDTGYTFLRYPSSARANAMGGNTMSLVERDPSLIFHNPALLGAEMDQMVNLNYLNYI 78
Query: 97 SGSHMGNACYASSVGERGMWGVGMRFLNYGSMQGYDQNAIATGSFSASDIAVQGFYSHEL 156
S ++G+A + + E+G WGVG F + G ++G + + TG F+A DI+V GF+S++L
Sbjct: 79 SDINVGSALFTKAYKEKGAWGVGASFFSQGKIRGMSEEGLPTGDFTAKDISVNGFFSYDL 138
Query: 157 SNHFRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDKGYSASALFKNVGAQLKGYNEER 216
S +RGG SLK LYS I Y+S G+ VD G+SYYD +KG+S FKN+GAQLK Y +ER
Sbjct: 139 SERWRGGASLKFLYSGIGDYTSIGMVVDAGLSYYDSEKGFSFGFAFKNIGAQLKAYEDER 198
Query: 217 EPFDWDFQLGFSRSFINAPFRLHITLFNLNPHYFKRLVPRDLSKM-----QKFLRHFSIG 271
+ WD QLG ++ +AP RL +T L + + D + F++H IG
Sbjct: 199 QKMPWDIQLGITKQMAHAPIRLSLTAQYLTKWKVEYVDDYDREYTGDNFFKSFVKHLVIG 258
Query: 272 AEFTPSEKFWVGLGYTPQIAQDFEVEGGNKWGGLSAGVGFTSGVVRVGVSAATYHPAALS 331
++ PS+ FW+G+GY P+ A D +++G N G S G G + VGVS A YHP+ALS
Sbjct: 259 VDYIPSDNFWLGIGYNPKTALDMKLQGSNALAGFSGGAGVRIKMFDVGVSVAKYHPSALS 318
Query: 332 FMCSVGIRLDD 342
M S+ + D
Sbjct: 319 MMLSISTTISD 329
>gi|91214810|ref|ZP_01251783.1| hypothetical protein P700755_18134 [Psychroflexus torquis ATCC
700755]
gi|91187237|gb|EAS73607.1| hypothetical protein P700755_18134 [Psychroflexus torquis ATCC
700755]
Length = 340
Score = 151 bits (382), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 103/327 (31%), Positives = 160/327 (48%), Gaps = 16/327 (4%)
Query: 30 SLVFSAGAQQEKQ-VFHFLNLPATAQALAAGGKAITIVDDNPGLAFENPALLGYESGGRA 88
S+ +S AQ Q + FLNLP + + A GGK IT +P A NPAL+ ++ +
Sbjct: 13 SVTWSTKAQIGGQNTYQFLNLPVSPKVSALGGKNITSYSFDPSDAMANPALINFDMHNQM 72
Query: 89 FLSYLYYMSGSHMGNACYASSVGERG-MWGVGMRFLNYGSMQGYDQNAIATGSFSASDIA 147
++Y+ Y + + G A YA +G R + G+ F++YG +G+D+ +T F S+ A
Sbjct: 73 SVNYMNYFADVNYGTASYAYDIGRRTQIIQAGVTFIDYGRFEGFDEAGNSTSKFGGSEAA 132
Query: 148 VQGFYSHEL-SNHFRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDKGYSASALFKNVG 206
+ YS + + + GV+LK + SS+E YSSF DVG+SY + S + +N G
Sbjct: 133 LSVGYSRRIGRSDYYVGVNLKLISSSLEQYSSFAGAADVGVSYIYPEWDLIISGVVRNFG 192
Query: 207 AQLKGYNEEREPFDWDFQLGFSRSFINAPFRLHITLFNLNPHYFK-RLVPRDLSKM---- 261
Q K +N+ RE ++ LG S+ AP R H+TL NL R RD+ +
Sbjct: 193 TQFKAFNDVRESMPFEVVLGISQKLKKAPIRWHVTLENLQQWNLSFRNTARDIEDLTGNV 252
Query: 262 --------QKFLRHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGGNKWGGLSAGVGFTS 313
LRH +GAE P F + LGY + ++ ++ + GL+ GV
Sbjct: 253 TTDDPSFINNALRHTILGAELFPDGGFSIQLGYNFRRGEELRIQDQRAFSGLTGGVSIKL 312
Query: 314 GVVRVGVSAATYHPAALSFMCSVGIRL 340
+R S A ++ AA S + I L
Sbjct: 313 NKLRFSYSYARFNRAASSSFFGLNINL 339
>gi|89890785|ref|ZP_01202294.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
gi|89516930|gb|EAS19588.1| conserved hypothetical protein [Flavobacteria bacterium BBFL7]
Length = 343
Score = 148 bits (374), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 92/315 (29%), Positives = 151/315 (47%), Gaps = 15/315 (4%)
Query: 41 KQVFHFLNLPATAQALAAGGKAITIVDDNPGLAFENPALLGYESGGRAFLSYLYYMSGSH 100
+ + FLNLPA + A GG+ +T VD +P NPA + + + ++Y Y+ +
Sbjct: 28 RTTYQFLNLPAGTKQAALGGRVLTGVDYDPTSGIFNPATINPKMDNQLQVNYANYLGDVN 87
Query: 101 MGNACYASSVGER-GMWGVGMRFLNYGSMQGYDQNAIATGSFSASDIAVQGFYSHELS-N 158
G A YA + + VG+ +L+YG+ GY++ +ATG F +++A+ Y++ + +
Sbjct: 88 YGTAAYAYTWDRHVQTFHVGVTYLDYGTFDGYNEQGVATGEFGGNEVAISAGYAYNIPFS 147
Query: 159 HFRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDKGYSASALFKNVGAQLKGYNEEREP 218
G ++K + S +E Y+S G +D G YY++DK + + +N+G Q YNE E
Sbjct: 148 DIYLGANVKVISSKLERYTSLGGAIDFGALYYNEDKDIRVALVVRNIGTQFTPYNEVYEK 207
Query: 219 FDWDFQLGFSRSFINAPFRLHITLFNLNPHYFK-------------RLVPRDLSKMQKFL 265
+ LGFS + N P RLH+TL NL ++ L
Sbjct: 208 LPLEVALGFSNNMRNLPLRLHVTLENLQQWNIAFSNDANAQTDLDGNVIEDKPGFFNNAL 267
Query: 266 RHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGGNKWGGLSAGVGFTSGVVRVGVSAATY 325
RH G E P + F V LGY + AQ+ ++ + G+SAG +R + A Y
Sbjct: 268 RHTVFGVEIFPEQAFQVRLGYNFRRAQELSIQDTRAFSGVSAGFSLKINKMRFSYTHARY 327
Query: 326 HPAALSFMCSVGIRL 340
A+ + + V I L
Sbjct: 328 TLASHTSLFGVNINL 342
>gi|124004960|ref|ZP_01689803.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
gi|123989638|gb|EAY29184.1| conserved hypothetical protein [Microscilla marina ATCC 23134]
Length = 352
Score = 144 bits (363), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 100/324 (30%), Positives = 157/324 (48%), Gaps = 22/324 (6%)
Query: 24 IILGFLSLVFSAGAQQ--EKQVFHFLNLPATAQALAAGGKAITIVDDNPGLAFENPALLG 81
I + LSLVF + Q ++ F FL +P A+ G +++ +P + ++NPAL+
Sbjct: 5 IAVVLLSLVFLSAQAQIGGRRNFEFLQVPGNARLAGVGRVNVSLHQADPNVLWQNPALID 64
Query: 82 YESGGRAFLSYLYYMSGSHMGNACYASSVGERGMWGVGMRFLNYGSMQGYDQNAIATGSF 141
S + +Y Y + + Y V + G +G G++++ YGS + N G+F
Sbjct: 65 SSSSRKLGFNYTPYFADIKNTHLSYVHHVPKVGTFGAGLQYMAYGSFEQTSPNGQVVGTF 124
Query: 142 SASDIAVQGFYSHELSNHFRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDKGYSASAL 201
A+D A Y+H++ +FR G SLK + SSIETY+S + D+G ++ ++ +
Sbjct: 125 VANDFAFNVGYAHQVK-YFRMGASLKFIGSSIETYNSTAIAADIGGAFIHPKYDWTIGLV 183
Query: 202 FKNVGAQLKGYNE---EREPFDWDFQLGFSRSFINAPFRLHITLFNLNPHYFKRL----- 253
FKN+G L YNE R PF + QLG S + PFR +TL ++ L
Sbjct: 184 FKNIGVVLSNYNEFTQSRLPF--EVQLGTSFKPTHMPFRFSLTLQHMQQFDITYLDPLQD 241
Query: 254 ---------VPRDLSKMQKFLRHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGGNKWGG 304
VP++ S LRHF IG EF P + F V LGY I ++ G +++ G
Sbjct: 242 VTFDASGNTVPKEKSFGDNLLRHFVIGGEFLPEKVFSVRLGYNHLINRELRETGASRFSG 301
Query: 305 LSAGVGFTSGVVRVGVSAATYHPA 328
S G + + S A+YH A
Sbjct: 302 FSYGFRVKIKALELAFSRASYHAA 325
>gi|163788769|ref|ZP_02183214.1| hypothetical protein FBALC1_11047 [Flavobacteriales bacterium
ALC-1]
gi|159876006|gb|EDP70065.1| hypothetical protein FBALC1_11047 [Flavobacteriales bacterium
ALC-1]
Length = 333
Score = 142 bits (359), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 96/320 (30%), Positives = 157/320 (49%), Gaps = 19/320 (5%)
Query: 41 KQVFHFLNLPATAQALAAGGKAITIVDDNPGLAFENPALLGYESGGRAFLSYLYYMSGSH 100
+ + FLNL ++ + A GGK IT D + NPA + YE G + ++ Y+ G
Sbjct: 14 ESTYQFLNLISSPRQAALGGKIITNFDHDVTEGLYNPASINYEMGNQLAVNVSNYLGGIT 73
Query: 101 MGNACYASSVGER-GMWGVGMRFLNYGSMQGYDQNAIATGSFSASDIAVQGFYSHELS-N 158
G+A YA + + G+ ++NYGS GYD N +TG+F+ ++ AV Y++ +
Sbjct: 74 YGSAAYAYTWDRHVQTFHFGVTYINYGSFDGYDVNGNSTGTFNGNEAAVSFGYNYNIPFT 133
Query: 159 HFRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDKGYSASALFKNVGAQLKGYNEEREP 218
F G + K + S++E Y+S G +D+G Y ++D + A+ +N+G Q Y + EP
Sbjct: 134 DFYIGANAKIITSALEQYNSIGGALDIGAMYINEDLDFHAALTVRNIGTQFTTYAGQNEP 193
Query: 219 FDWDFQLGFSRSFINAPFRLHITLFNL--------NPHYFKRLVPRDLSKMQ-------K 263
+ G S++ N P R H+TL NL NP + + D ++ Q K
Sbjct: 194 LPLEVNFGMSQTLENVPIRWHLTLDNLQKWPIGVSNPA--RAITDLDGNQTQEKVSFFNK 251
Query: 264 FLRHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGGNKWGGLSAGVGFTSGVVRVGVSAA 323
LRH +GAE P F + GY + A++ +E + GLS G+G +R + A
Sbjct: 252 GLRHLILGAELWPKRGFNLRFGYNFRRAEELRIEDQRNFSGLSFGIGIKLNKMRFSYTHA 311
Query: 324 TYHPAALSFMCSVGIRLDDK 343
Y A+ + + I LD +
Sbjct: 312 RYTSASNTSFFGLQIDLDGR 331
>gi|83857192|ref|ZP_00950720.1| hypothetical protein CA2559_10348 [Croceibacter atlanticus
HTCC2559]
gi|83848559|gb|EAP86428.1| hypothetical protein CA2559_10348 [Croceibacter atlanticus
HTCC2559]
Length = 344
Score = 139 bits (351), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 96/328 (29%), Positives = 156/328 (47%), Gaps = 15/328 (4%)
Query: 28 FLSLVFSAGAQQEKQVFHFLNLPATAQALAAGGKAITIVDDNPGLAFENPALLGYESGGR 87
FL++ F G + + FLNL + + A GGK ITI D +P NPA + + +
Sbjct: 16 FLNINFIYGQIGGQSTYQFLNLINSPRQAALGGKNITIYDQDPTSGLYNPANINFRMDNQ 75
Query: 88 AFLSYLYYMSGSHMGNACYASSVGERG-MWGVGMRFLNYGSMQGYDQNAIATGSFSASDI 146
++Y+ Y++ + G A YA R + G+ ++NYGS GYD+N AT +FS +
Sbjct: 76 LSVNYVNYIADVNYGTASYAYLYDRRTQVIHAGITYINYGSFDGYDENGNATNTFSGGEA 135
Query: 147 AVQGFYSHELS-NHFRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDKGYSASALFKNV 205
A+ Y++ + + F G + K + S +E YSS G +D+GI+Y+ +D +A+ +N+
Sbjct: 136 ALSLGYAYNIPYSDFYIGANAKFISSKLEQYSSLGGALDLGITYFYEDWDLVIAAVARNI 195
Query: 206 GAQLKGYNEEREPFDWDFQLGFSRSFINAPFRLHITLFNLNPHYFK-RLVPRDLSKM--- 261
G Q Y++ E ++ G S+ P + H+T NL R RD +
Sbjct: 196 GTQFTAYDDTYESIPFEVNFGISQQLRKVPLQWHLTFENLQQWQIAFRNTNRDEEDLSGN 255
Query: 262 ---------QKFLRHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGGNKWGGLSAGVGFT 312
LRH +G E P F + LGY+ + ++ + + GLSAG G
Sbjct: 256 VIEDDPGFFNNVLRHTILGVELFPKGGFNIRLGYSFRRGEELRIVDQRSFAGLSAGFGVK 315
Query: 313 SGVVRVGVSAATYHPAALSFMCSVGIRL 340
VR + + Y+ AA S + I L
Sbjct: 316 FNRVRFNYAYSRYNSAASSSFFGLNIDL 343
>gi|86131914|ref|ZP_01050511.1| hypothetical protein MED134_02905 [Cellulophaga sp. MED134]
gi|85817736|gb|EAQ38910.1| hypothetical protein MED134_02905 [Dokdonia donghaensis MED134]
Length = 339
Score = 138 bits (348), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 98/339 (28%), Positives = 162/339 (47%), Gaps = 16/339 (4%)
Query: 17 MIRKHFGIILGFLSLVFSAGAQQEKQVFHFLNLPATAQALAAGGKAITIVDDNPGLAFEN 76
M+RK I+L FL+ V + G + + FLNL ++ + A GGK IT D +P A N
Sbjct: 1 MMRKVCYILL-FLTAVSAYGQLGGRATYQFLNLMSSPRQAALGGKIITNYDYDPDSALYN 59
Query: 77 PALLGYESGGRAFLSYLYYMSGSHMGNACYASSVGERG-MWGVGMRFLNYGSMQGYDQNA 135
PA + Y + ++Y+ Y++ + G A YA R + G+ ++NYGS G D+N
Sbjct: 60 PANINYRMDNQLSVNYVNYLADINYGTASYAYLWDRRTQVLHAGITYVNYGSFDGRDENG 119
Query: 136 IATGSFSASDIAVQ-GFYSHELSNHFRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDK 194
ATG F S+ A+ G+ ++ F G ++K + S++ Y+S G +D+G++Y +D
Sbjct: 120 NATGEFGGSEAALSLGYATNIPYTDFYVGANVKLITSTLAEYTSAGGAIDLGLTYNYEDW 179
Query: 195 GYSASALFKNVGAQLKGYNEEREPFDWDFQLGFSRSFINAPFRLHITLFNL--------- 245
+A+ + +N+G Q Y + E + G S+ N P R H+TL NL
Sbjct: 180 DLNAAVVVRNIGTQFTPYVDTIEKLPLEIDAGISQIVPNVPIRWHLTLENLQLWNIAFEN 239
Query: 246 ----NPHYFKRLVPRDLSKMQKFLRHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGGNK 301
+ + +RH +G E P F + LGY + +++ +
Sbjct: 240 EARGTTDLDGNTTAEKIGILDNVIRHAIVGVELFPRGGFNLRLGYNFRRSEELRIINQRS 299
Query: 302 WGGLSAGVGFTSGVVRVGVSAATYHPAALSFMCSVGIRL 340
+ G+SAG G +R + A Y+ AA S +GI L
Sbjct: 300 FAGISAGFGIKINKMRFNYAYARYNSAAASSFFGIGIDL 338
>gi|86142696|ref|ZP_01061135.1| hypothetical protein MED217_07271 [Flavobacterium sp. MED217]
gi|85830728|gb|EAQ49186.1| hypothetical protein MED217_07271 [Leeuwenhoekiella blandensis
MED217]
Length = 339
Score = 136 bits (342), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 96/340 (28%), Positives = 161/340 (47%), Gaps = 16/340 (4%)
Query: 17 MIRKHFGIILGFLSLVFSAGAQQEKQVFHFLNLPATAQALAAGGKAITIVDDNPGLAFEN 76
M++K I+ F S F A + + FLNL ++ + A GGK +T +P N
Sbjct: 1 MVKKLLLILALFTSFNFWAQLGG-RATYQFLNLVSSPKQAALGGKLLTDYSYDPTSGLFN 59
Query: 77 PALLGYESGGRAFLSYLYYMSGSHMGNACYASSVGER-GMWGVGMRFLNYGSMQGYDQNA 135
PA + E + L+Y+ Y++ + G YA R G++ G+ ++NYG+ +GYD+
Sbjct: 60 PATINPEMHNQLSLNYVNYLADVNYGTVGYAYEYDRRSGVFHAGVTYVNYGTFEGYDERG 119
Query: 136 IATGSFSASDIAVQGFYSHEL-SNHFRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDK 194
AT FS ++A Y+ + F G ++K + S +E Y+S G +D+G Y +D+
Sbjct: 120 NATADFSGGEVAFSTGYAFNIPRTDFFVGANVKLISSKLEQYTSLGGALDLGFIYVNDEL 179
Query: 195 GYSASALFKNVGAQLKGYNEEREPFDWDFQLGFSRSFINAPFRLHITLFNL--------N 246
+ + + +N+G Q Y+ E + LG S+ + P R H TL NL N
Sbjct: 180 ELNIAGVVRNIGTQFTAYDVTYERLPLEIDLGISQKLEHVPLRWHFTLENLQNWNLAFAN 239
Query: 247 PHYFK-----RLVPRDLSKMQKFLRHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGGNK 301
P P +++ + LRH AE P + F + LGY+ + A++ +
Sbjct: 240 PARATSDLNGTTTPENVNFFDEALRHMIFAAELFPDKGFNIRLGYSVRRAEELRIVDQRS 299
Query: 302 WGGLSAGVGFTSGVVRVGVSAATYHPAALSFMCSVGIRLD 341
+ GLSAG +R+ S A Y+ AA S + + L+
Sbjct: 300 FAGLSAGFSVKFNKLRLSYSYARYNSAASSGFFGLNVDLN 339
>gi|86133039|ref|ZP_01051621.1| hypothetical protein MED152_00005 [Tenacibaculum sp. MED152]
gi|85819902|gb|EAQ41049.1| hypothetical protein MED152_00005 [Polaribacter dokdonensis MED152]
Length = 340
Score = 135 bits (339), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 98/317 (30%), Positives = 160/317 (50%), Gaps = 18/317 (5%)
Query: 43 VFHFLNLPATAQALAAGGKAITIVDDNPGLAFENPALLGYESGGRAFLSYLYYMSGSHMG 102
V+ FLNL ++A+ +A GG+ + + +D A NPA++ E G+ L+Y Y+S ++G
Sbjct: 26 VYQFLNLSSSARQIALGGEVLNLYND-VNQASWNPAVINDEMDGKLALNYSSYLSDINIG 84
Query: 103 NACYASSVGER-GMWGVGMRFLNYGSMQGYDQNAIATGSFSASDIAVQGFYSHEL--SNH 159
+ YA + R G + +L+YGS G D+ TG F A+DI+V Y+ L +N
Sbjct: 85 SISYARLISRRFGTIHGSINYLDYGSFIGADEEGNETGEFGANDISVSLGYALNLPWTNL 144
Query: 160 FRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDKGYSASALFKNVGAQLKGYNEEREPF 219
F G +LK + S+I+++SS G+ DVG+ YY K Y+ + + +N G Q+K +N RE
Sbjct: 145 FFGA-NLKFINSNIDSFSSVGIAGDVGVFYYSPYKNYTFTLVARNFGTQIKTFNGTREKL 203
Query: 220 DWDFQLGFSRSFINAPFRLHITLFNL--------NPHYFK-----RLVPRDLSKMQKFLR 266
+ LG S P + ++ + NL NP ++S + LR
Sbjct: 204 PFKVALGASYKLNYVPLKWYLAIDNLQKWDISVPNPSEQSTDLEGNTTNEEISFLNNALR 263
Query: 267 HFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGGNKWGGLSAGVGFTSGVVRVGVSAATYH 326
HF IGAE P + GY + A + +++ +GG+S G G + + + YH
Sbjct: 264 HFVIGAELFPESAINIRAGYNFRRAAELKLQEVRTFGGVSFGFGIKMNKFKFNYAYSKYH 323
Query: 327 PAALSFMCSVGIRLDDK 343
A+ S S+ I LD +
Sbjct: 324 SASNSSTFSLQIDLDQR 340
>gi|146300505|ref|YP_001195096.1| hypothetical protein Fjoh_2755 [Flavobacterium johnsoniae UW101]
gi|146154923|gb|ABQ05777.1| hypothetical protein Fjoh_2755 [Flavobacterium johnsoniae UW101]
Length = 339
Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 95/325 (29%), Positives = 151/325 (46%), Gaps = 15/325 (4%)
Query: 20 KHFGIILGFLSLVFSAGAQQEKQVFHFLNLPATAQALAAGGKAITIVDDNPGLAFENPAL 79
KH+ + L L S G + + FLNL + A GGK ITI D++ A NPA
Sbjct: 3 KHYVLFLMILICSVSFGQIGGRYTYQFLNLTTNPRQAALGGKTITIYDEDVNQAMSNPAA 62
Query: 80 LGYESGGRAFLSYLYYMSGSHMGNACYASSVGER-GMWGVGMRFLNYGSMQGYDQNAIAT 138
L + L+Y Y + G YA + + G+ ++NYGS +GYD+N T
Sbjct: 63 LNADMDNHLALNYGNYYGEASYGTGSYAYTYDRHLQTFYAGVNYINYGSFEGYDENGQRT 122
Query: 139 GSFSASDIAVQGFYSHELS-NHFRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDKGYS 197
F+ S+ A+ Y++ + GV+ K + S++E+Y+S G +D+G Y D+ +
Sbjct: 123 SDFTGSEGALSVGYAYNVPFTDLHIGVNGKLITSTLESYNSIGGALDLGFLYIDERNDIN 182
Query: 198 ASALFKNVGAQLKGYNEEREPFDWDFQLGFSRSFINAPFRLHITLFNL--------NPHY 249
+ +F+N+G Q K Y+ +E ++ G S+ + P R H+TL NL NP
Sbjct: 183 YALVFRNIGTQFKTYSGIKENLPFEITAGISQELEHIPLRWHLTLENLQQWDIAFSNPVR 242
Query: 250 FKRLV-----PRDLSKMQKFLRHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGGNKWGG 304
+ + +S + LRH G E P F V LGY + ++ VE + G
Sbjct: 243 GETNIDGTTNAEKVSFVNNALRHVVFGVELFPQRAFNVRLGYNFRRGEELRVEEQRNFSG 302
Query: 305 LSAGVGFTSGVVRVGVSAATYHPAA 329
+S G G ++ S + Y AA
Sbjct: 303 VSLGFGLRMNKLKFNYSYSRYTLAA 327
>gi|150025761|ref|YP_001296587.1| hypothetical protein FP1713 [Flavobacterium psychrophilum JIP02/86]
gi|149772302|emb|CAL43780.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86]
Length = 343
Score = 132 bits (333), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 99/341 (29%), Positives = 162/341 (47%), Gaps = 20/341 (5%)
Query: 17 MIRKHFGIILGFLSLVFSAGAQQEKQ-VFHFLNLPATAQALAAGGKAITIVDDNPGLAFE 75
M +KH +IL ++ S +Q Q V+ FLNL ++ + A GGK IT D +
Sbjct: 1 MQKKH--LILLLFTIYTSTYSQIGGQGVYQFLNLISSPRQAALGGKIITNYDYDVNQPLF 58
Query: 76 NPALLGYESGGRAFLSYLYYMSGSHMGNACYASSVGERGMWGV--GMRFLNYGSMQGYDQ 133
NPA + E GR L+Y Y G A +A + +R + + + ++NYG GYD+
Sbjct: 59 NPASINTEMDGRLALNYGNYFGDVTYGTAAFAYTY-DRHLETLHGAITYINYGKFDGYDE 117
Query: 134 NAIATGSFSASDIAVQGFYSHELS-NHFRGGVSLKALYSSIETYSSFGLGVDVGISYYDD 192
ATG F+ ++IA+ Y++ + G + K + S++E+Y+SFG+ D+ Y DD
Sbjct: 118 FKAATGQFTGNEIALSLGYAYNIPWTKIYLGANAKLISSTLESYNSFGVAADLAAMYKDD 177
Query: 193 DKGYSASALFKNVGAQLKGYNEEREPFDWDFQLGFSRSFINAPFRLHITLFNL------- 245
+ + + +N G Q+K YN+ E ++ G S+ N P R H+TL NL
Sbjct: 178 KNDINYALVVRNFGTQIKSYNDTNEKLPFEVIAGISQELENVPIRWHLTLENLQHWNVSF 237
Query: 246 ------NPHYFKRLVPRDLSKMQKFLRHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGG 299
P + +S + +RH +GAE P + F + L Y + A + ++
Sbjct: 238 ANPARSQPTIDGEPIEEKVSFLGNTMRHVIVGAEIFPKKTFTLRLSYNFRRAAELKILEQ 297
Query: 300 NKWGGLSAGVGFTSGVVRVGVSAATYHPAALSFMCSVGIRL 340
+ G+SAG G R S + Y AA + + + I L
Sbjct: 298 RTFSGISAGFGIRFRKFRFDYSYSRYTLAANTSLFGLTINL 338
>gi|163756085|ref|ZP_02163201.1| hypothetical protein KAOT1_09476 [Kordia algicida OT-1]
gi|161323959|gb|EDP95292.1| hypothetical protein KAOT1_09476 [Kordia algicida OT-1]
Length = 339
Score = 128 bits (322), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 84/315 (26%), Positives = 154/315 (48%), Gaps = 15/315 (4%)
Query: 41 KQVFHFLNLPATAQALAAGGKAITIVDDNPGLAFENPALLGYESGGRAFLSYLYYMSGSH 100
+ + FLNL ++ + A GGK IT D + A NPA + Y L+Y+ Y+ +
Sbjct: 24 RYTYQFLNLISSPRQAALGGKVITNYDKDVNQALFNPASINYTMDNHLSLNYVNYLGDVN 83
Query: 101 MGNACYASSVGER-GMWGVGMRFLNYGSMQGYDQNAIATGSFSASDIAVQGFYSHELS-N 158
G+A YA + R + G+ +++YG GYD+ TGSF+ ++IA+ Y++ +
Sbjct: 84 YGSAAYAYTWDRRVQTFHAGVTYVSYGQFDGYDEQGNETGSFTGNEIALSMGYAYNIPWT 143
Query: 159 HFRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDKGYSASALFKNVGAQLKGYNEEREP 218
G + K + S +E Y+SFG +D+GI Y +++ + + +N+G Q Y +E
Sbjct: 144 DIHIGANAKFINSKLEQYNSFGGALDLGIMYINEELELNVALAARNIGIQFTTYAGVQEQ 203
Query: 219 FDWDFQLGFSRSFINAPFRLHITLFNL--------NPHYFKR-----LVPRDLSKMQKFL 265
++ G S+ + P R H+T NL NP+ ++ + ++ ++ +
Sbjct: 204 LPFELIFGISQKLEHVPIRWHLTFENLQQWNIAFANPNRAEQSIEGGVTEEKVTFLKNLI 263
Query: 266 RHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGGNKWGGLSAGVGFTSGVVRVGVSAATY 325
RH +G E P + F + LGY + ++ ++ + G++AG G ++ + A Y
Sbjct: 264 RHTIVGMELFPDKGFNIRLGYNFRRGEELKIVDQRNFSGITAGFGLRINKLKFDYTYARY 323
Query: 326 HPAALSFMCSVGIRL 340
AA + M V + L
Sbjct: 324 SIAANTSMFGVTLNL 338
>gi|126663822|ref|ZP_01734817.1| hypothetical protein FBBAL38_10082 [Flavobacteria bacterium BAL38]
gi|126624086|gb|EAZ94779.1| hypothetical protein FBBAL38_10082 [Flavobacteria bacterium BAL38]
Length = 338
Score = 124 bits (310), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 91/315 (28%), Positives = 147/315 (46%), Gaps = 15/315 (4%)
Query: 41 KQVFHFLNLPATAQALAAGGKAITIVDDNPGLAFENPALLGYESGGRAFLSYLYYMSGSH 100
K V+ FLNL + + A GGK +T+VD + AF NPA + + R +Y Y
Sbjct: 24 KSVYQFLNLAQSPRQAALGGKTVTVVDYDVNQAFYNPATINIKMHNRLSANYGSYYGEVS 83
Query: 101 MGNACYASSVGER-GMWGVGMRFLNYGSMQGYDQNAIATGSFSASDIAVQGFYSHELS-N 158
G A YA + + G+ ++NYG+ +G D+ T F+ S+ A+ Y++ L
Sbjct: 84 YGTAAYAYTYDRHLQTFHAGISYVNYGTFEGRDEFGNLTSDFTGSEAALSLGYAYNLPWT 143
Query: 159 HFRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDKGYSASALFKNVGAQLKGYNEEREP 218
G + K + S++E+Y+S+G VD+G Y D D + +N+G Q+K Y + E
Sbjct: 144 DMYVGANAKLISSTLESYNSWGAAVDLGFLYVDYDNDINYGLTVRNLGFQIKPYEDTNEK 203
Query: 219 FDWDFQLGFSRSFINAPFRLHITLFNL--------NPHYFKRLV-----PRDLSKMQKFL 265
G S+ N P R H+T NL NP+ + + +S L
Sbjct: 204 LPLAIDAGISQLMENVPIRWHMTFENLQQWNIAFSNPNRAEGSLDGGSEEEKVSFFNNAL 263
Query: 266 RHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGGNKWGGLSAGVGFTSGVVRVGVSAATY 325
RH +GAE P + F + LGY + +++ + + G+S G G G V+ S + Y
Sbjct: 264 RHLILGAELFPEKGFNIRLGYNFRRSEELRILEQRNFSGISVGFGIRFGKVKFDYSYSRY 323
Query: 326 HPAALSFMCSVGIRL 340
AA + + + I L
Sbjct: 324 TVAANTSLFGLMIDL 338
>gi|149372805|ref|ZP_01891826.1| hypothetical protein SCB49_12619 [unidentified eubacterium SCB49]
gi|149354502|gb|EDM43067.1| hypothetical protein SCB49_12619 [unidentified eubacterium SCB49]
Length = 340
Score = 123 bits (309), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 93/317 (29%), Positives = 153/317 (48%), Gaps = 15/317 (4%)
Query: 41 KQVFHFLNLPATAQALAAGGKAITIVDDNPGLAFENPALLGYESGGRAFLSYLYYMSGSH 100
+ + FLNL + A GGK +T D +P NPA + E + L+Y Y+ +
Sbjct: 24 QSTYQFLNLVNNPRQAALGGKIVTNYDYDPTQGLFNPASINPEMDNQLSLNYTNYIGDVN 83
Query: 101 MGNACYASSVGERG-MWGVGMRFLNYGSMQGYDQNAIATGSFSASDIAVQGFYSHELS-N 158
G A YA R + G+ ++NYG GYD+N T SF+ S++A+ ++ ++
Sbjct: 84 YGTATYAYLWDRRTQVLHTGITYVNYGQFDGYDENGEPTASFTGSEVALSFGHARNIAFT 143
Query: 159 HFRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDKGYSASALFKNVGAQLKGYNEEREP 218
F GV+ K + SS+E+YSS G+ VD+G+ Y +D + + +N+G+Q+ Y+ EP
Sbjct: 144 DFHIGVNAKLISSSLESYSSLGIAVDIGVMYVYEDWDLHITGVARNIGSQITPYDTIYEP 203
Query: 219 FDWDFQLGFSRSFINAPFRLHITLFNLNPHYFK-RLVPRDLSKMQ--------KFL---- 265
D G S++ N P R H T+ NL RD++ ++ F+
Sbjct: 204 LPLDIIFGISQTLENIPIRWHFTMDNLQQWKLGFENTNRDVTDLEGGTTSEQINFIDHAF 263
Query: 266 RHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGGNKWGGLSAGVGFTSGVVRVGVSAATY 325
RH +G E P F + LGY + ++ + + G+SAG +R+ S A +
Sbjct: 264 RHMILGLELFPESGFNLRLGYNLRRGEELRILEQRSFAGISAGFSIKLNKLRLSYSYAKF 323
Query: 326 HPAALSFMCSVGIRLDD 342
+ AA S + I L+D
Sbjct: 324 NGAASSGYLGLNIDLND 340
>gi|110639365|ref|YP_679574.1| hypothetical protein CHU_2991 [Cytophaga hutchinsonii ATCC 33406]
gi|110282046|gb|ABG60232.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
Length = 346
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 150/343 (43%), Gaps = 19/343 (5%)
Query: 17 MIRKHFGIILGFLSLVFSAGAQQEKQVFHFLNLPATAQALAAGGKAITIVDDNPGLAFEN 76
++R F IL L V S + + FL +P A+ A GG ++I D + + + N
Sbjct: 2 ILRLFFISILAVLP-VLSHAQLGGRTGYSFLEVPVAARQAAVGGYNVSIRDKDVAMGYSN 60
Query: 77 PALLGYESGGRAFLSYLYYMSGSHMGNACYASSVGE-RGMWGVGMRFLNYGSMQGYDQNA 135
PALL +Y + + A + + +G+ G+ + NYGS+ D N
Sbjct: 61 PALLNDTISKTVSFTYQPFYADIKKSTLFGAYGLKDNKGVISGGVNYFNYGSINQTDANG 120
Query: 136 IATGS-FSASDIAVQGFYSHELSNHFRGGVSLKALYSSIETYSSFGLGVDVGISYYDDDK 194
TG+ F + A+ Y+ + F G + K ++SSIETY++ + VD G+ Y +
Sbjct: 121 NETGAVFHPVEYALVLGYARGMG-PFSMGANAKYVHSSIETYNAAAVMVDFGVLYKHPKQ 179
Query: 195 GYSASALFKNVGAQLKGYNE-EREPFDWDFQLGFSRSFINAPFRLHITL----------- 242
+ +FKNVG + Y + + +D QLG S + P RL +T
Sbjct: 180 QLTVGLVFKNVGFNVNNYVKGQAVNLPFDIQLGASYKPDHMPVRLSLTTHHLYKFDVVYN 239
Query: 243 ---FNLNPHYFKRLVPRDLSKMQKFLRHFSIGAEFTPSEKFWVGLGYTPQIAQDFEVEGG 299
N+ + + P + + K +RHF IG EF ++ F GY + ++ ++
Sbjct: 240 DPAINVKVNLDGTVTPIETTFGDKLMRHFVIGGEFLFTKNFHARFGYNFLMRRELRLQDR 299
Query: 300 NKWGGLSAGVGFTSGVVRVGVSAATYHPAALSFMCSVGIRLDD 342
+ G++ G +G + A Y + ++GI +++
Sbjct: 300 SATAGMTWGFMLRVKKFDIGYTRAYYSIKGGTSYFTLGININE 342
>gi|167729551|emb|CAO80463.1| hypothetical protein; putative signal peptide [Candidatus
Cloacamonas acidaminovorans]
Length = 300
Score = 65.1 bits (157), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 46/188 (24%), Positives = 83/188 (44%), Gaps = 3/188 (1%)
Query: 44 FHFLNLPATAQALAAGGKAITIVDDNPGLAFENPALLGYESGGRAFLSYLYYMSGSHMGN 103
+ FLN+P +L+ G+ + +D NPG PA +++ +++ +
Sbjct: 30 YKFLNVPYGPVSLSLAGRGVFSID-NPGSFLLQPAASCMNDQHLLGITHNMWLADTQANM 88
Query: 104 ACYASSVGERGMWGVGMRFLNYGSMQGYDQNAIATGSFSASDIAVQGFYSHELSNHFRGG 163
Y S + +G+ MR L+YG ++ D G++ DI V Y+H ++ G
Sbjct: 89 IAY-SFAQRKSHFGIAMRNLDYGEIENRDDMGFLIGNYHPLDIDVTANYAHRVTPSIYAG 147
Query: 164 VSLKALYSSIETYSSFGLGVDVGISYYDDDKGYSASALFKNVGAQLKGYNEEREPFDWDF 223
+L LY + T SS L D+G + K + + +N+G +EER F
Sbjct: 148 ANLGILYQKLNTASSLALHTDLGFCWLPPVKDAKITLVGRNLGIA-NHTDEERVKLPVCF 206
Query: 224 QLGFSRSF 231
+L ++ F
Sbjct: 207 ELDINKGF 214
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
Posted date: May 10, 2008 4:54 AM
Number of letters in database: 884,634,002
Number of sequences in database: 2,620,852
Database: /apps/blastdb/nr.01
Posted date: May 10, 2008 4:52 AM
Number of letters in database: 976,814,986
Number of sequences in database: 2,761,530
Database: /apps/blastdb/nr.02
Posted date: May 10, 2008 4:46 AM
Number of letters in database: 360,829,861
Number of sequences in database: 1,132,722
Lambda K H
0.323 0.139 0.427
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,572,676,788
Number of Sequences: 6515104
Number of extensions: 69705580
Number of successful extensions: 148395
Number of sequences better than 1.0e-04: 19
Number of HSP's better than 0.0 without gapping: 15
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 148335
Number of HSP's gapped (non-prelim): 19
length of query: 346
length of database: 2,222,278,849
effective HSP length: 134
effective length of query: 212
effective length of database: 1,349,254,913
effective search space: 286042041556
effective search space used: 286042041556
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 122 (51.6 bits)