BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= ANA_1219
(467 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|124521111|ref|ZP_01696123.1| polysaccharide biosynthesis... 230 1e-58
gi|56684466|gb|AAW22435.1| Wzx [Lactobacillus rhamnosus] 216 2e-54
gi|56684526|gb|AAW22489.1| Wzx [Lactobacillus rhamnosus] 216 3e-54
gi|56684506|gb|AAW22471.1| Wzx [Lactobacillus rhamnosus] 216 3e-54
gi|56684486|gb|AAW22453.1| Wzx [Lactobacillus rhamnosus] 216 3e-54
gi|118587269|ref|ZP_01544696.1| membrane protein [Oenococcu... 209 3e-52
gi|23464837|ref|NP_695440.1| hypothetical transmembrane pro... 196 3e-48
gi|15902361|ref|NP_357911.1| The type 2 capsule locus of St... 193 2e-47
gi|22316056|gb|AAL32506.1| wzx [Streptococcus thermophilus] 189 4e-46
gi|55823026|ref|YP_141467.1| exopolysaccharide biosynthesis... 188 5e-46
gi|90655826|gb|ABD96532.1| EpsM [Streptococcus thermophilus] 188 6e-46
gi|119026404|ref|YP_910249.1| hypothetical transmembrane pr... 174 9e-42
gi|68643792|emb|CAI33988.1| flippase Wzx [Streptococcus pne... 173 2e-41
gi|68644308|emb|CAI34412.1| flippase Wzx [Streptococcus pne... 172 3e-41
gi|3320394|gb|AAC38752.1| repeat unit transporter [Streptoc... 163 2e-38
gi|68643625|emb|CAI33843.1| flippase Wzx [Streptococcus pne... 163 2e-38
gi|149018047|ref|ZP_01834506.1| hypothetical protein CGSSp2... 162 3e-38
gi|68643597|emb|CAI33820.1| flippase Wzx [Streptococcus pne... 162 4e-38
gi|116618520|ref|YP_818891.1| Polysaccharide transport memb... 134 1e-29
gi|150007213|ref|YP_001301956.1| exopolysaccharide biosynth... 127 2e-27
gi|156869588|gb|EDO62960.1| hypothetical protein CLOLEP_002... 103 3e-20
gi|154148138|ref|YP_001407016.1| polysaccharide biosynthesi... 102 7e-20
gi|156868909|gb|EDO62281.1| hypothetical protein CLOLEP_011... 97 2e-18
gi|20093247|ref|NP_619322.1| polysaccharide biosynthesis pr... 85 1e-14
gi|149182088|ref|ZP_01860572.1| polysaccharide biosynthesis... 81 2e-13
gi|119478059|ref|ZP_01618138.1| Membrane protein involved i... 78 1e-12
gi|118580872|ref|YP_902122.1| polysaccharide biosynthesis p... 75 9e-12
gi|56421698|ref|YP_149016.1| polysaccharide biosynthesis [G... 72 6e-11
gi|21227237|ref|NP_633159.1| Transporter [Methanosarcina ma... 72 8e-11
gi|156740780|ref|YP_001430909.1| polysaccharide biosynthesi... 72 1e-10
gi|28378721|ref|NP_785613.1| repeat unit transporter [Lacto... 70 3e-10
gi|153941483|ref|YP_001392372.1| putative polysaccharide tr... 70 4e-10
gi|154249271|ref|YP_001410096.1| polysaccharide biosynthesi... 69 5e-10
gi|28211855|ref|NP_782799.1| transporter [Clostridium tetan... 69 6e-10
gi|148381045|ref|YP_001255586.1| O antigen repeat unit flip... 69 6e-10
gi|89891385|ref|ZP_01202891.1| capsular polysaccharide repe... 69 6e-10
gi|53711455|ref|YP_097447.1| polysaccharide biosynthesis pr... 69 8e-10
gi|73668656|ref|YP_304671.1| transporter [Methanosarcina ba... 68 1e-09
gi|88804486|ref|ZP_01120006.1| polysaccharide biosynthesis ... 67 2e-09
gi|40388615|gb|AAR85520.1| Wzx [Thermoanaerobacterium therm... 67 3e-09
gi|153158522|ref|ZP_01375303.2| polysaccharide biosynthesis... 67 3e-09
gi|77165939|ref|YP_344464.1| Polysaccharide biosynthesis pr... 66 6e-09
gi|153807205|ref|ZP_01959873.1| hypothetical protein BACCAC... 64 1e-08
gi|93005451|ref|YP_579888.1| polysaccharide biosynthesis pr... 64 3e-08
gi|156111618|gb|EDO13363.1| hypothetical protein BACOVA_008... 63 3e-08
gi|15643386|ref|NP_228430.1| lipopolysaccharide biosynthesi... 63 5e-08
gi|29348696|ref|NP_812199.1| polysaccharide biosynthesis pr... 62 6e-08
gi|110598884|ref|ZP_01387135.1| Polysaccharide biosynthesis... 62 6e-08
gi|148976924|ref|ZP_01813579.1| polysaccharide biosynthesis... 62 7e-08
gi|83645504|ref|YP_433939.1| Membrane protein involved in t... 62 8e-08
gi|149375176|ref|ZP_01892948.1| Membrane protein involved i... 62 8e-08
gi|156861962|gb|EDO55393.1| hypothetical protein BACUNI_010... 61 2e-07
gi|37725468|gb|AAO60454.1| Wzx [Streptococcus pneumoniae] 61 2e-07
gi|119356242|ref|YP_910886.1| polysaccharide biosynthesis p... 60 2e-07
gi|37725462|gb|AAO60450.1| Wzx [Streptococcus pneumoniae] >... 60 3e-07
gi|78188447|ref|YP_378785.1| polysaccharide biosynthesis pr... 60 3e-07
gi|110798949|ref|YP_694929.1| putative polysaccharide trans... 60 5e-07
gi|37725480|gb|AAO60462.1| Wzx [Streptococcus pneumoniae] 59 6e-07
gi|145621782|ref|ZP_01777748.1| polysaccharide biosynthesis... 59 6e-07
gi|68551915|ref|ZP_00591308.1| Polysaccharide biosynthesis ... 59 8e-07
gi|91794015|ref|YP_563666.1| hypothetical protein Sden_2664... 58 2e-06
gi|89092774|ref|ZP_01165726.1| hypothetical protein MED92_0... 57 2e-06
gi|67938692|ref|ZP_00531213.1| Polysaccharide biosynthesis ... 57 3e-06
gi|145621520|ref|ZP_01777489.1| polysaccharide biosynthesis... 57 3e-06
gi|145621524|ref|ZP_01777493.1| polysaccharide biosynthesis... 56 5e-06
gi|67918833|ref|ZP_00512425.1| Polysaccharide biosynthesis ... 56 6e-06
gi|114566262|ref|YP_753416.1| hypothetical protein Swol_072... 56 6e-06
>gi|124521111|ref|ZP_01696123.1| polysaccharide biosynthesis protein [Bacillus coagulans 36D1]
gi|124497090|gb|EAY44660.1| polysaccharide biosynthesis protein [Bacillus coagulans 36D1]
Length = 467
Score = 230 bits (587), Expect = 1e-58, Method: Composition-based stats.
Identities = 122/414 (29%), Positives = 244/414 (58%), Gaps = 1/414 (0%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L+ N+++FA+G L K + L+P+YT L+ ++G +LL S + +LP+ SL + +++
Sbjct: 8 LINNSIIFAIGNLGNKLIVFFLVPIYTYYLSKNDFGLVDLLTSTLNFILPIFSLSIFDSV 67
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVCVFKA 123
RF +D + K+E+ SLVV+ G +C+ + + S + + +F + + +F
Sbjct: 68 LRFCMDRNYDKEEVLTNSLVVICCGFLCSFIIYPVFSMVLPFDGLMGYFYIILLLQLFYT 127
Query: 124 T-TQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLVAF 182
T Q R +G V+ + G++N++ +++S + LV GI+GYL S+ + ++ V F
Sbjct: 128 TFNQFIRAIGLVKLYSFVGILNSIILLISNLIFLVNLSLGIKGYLLSFIVSNVLCITVIF 187
Query: 183 LGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGLFT 242
+ + + ++ + + L + +L+YS PL+PN L WW++ +S RY+V + GL+A GL+
Sbjct: 188 ICAKISKYISLSKMNLKLTKELLLYSTPLIPNALMWWIMGLSDRYIVTFYLGLSANGLYA 247
Query: 243 AASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLVIALNR 302
A+K+PS++N++ S+F QAWQ S E +S +R F+ +V + YS+ + + L++ +
Sbjct: 248 VANKIPSILNVLNSIFFQAWQMSAIEEANSKERSKFYSNVFKYYSIFLIISTSLLLVFLK 307
Query: 303 PISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVI 362
+ ++++ + F W+Y+PLL++ F + F GT Y A +R + ++++GA VN+I
Sbjct: 308 LLLKLLVASNFYTSWKYIPLLLIGVVFSCYSSFLGTNYIASKKTRGVFKTSILGATVNII 367
Query: 363 LGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALLS 416
+P++G GA L+ +++ ++ VVR D R+ +D+ ++ ++ +L+
Sbjct: 368 CNFIFIPWIGINGASLSTMLSFFIIWVVRIIDTRQFVDIKLNYRKMISSFVILA 421
>gi|56684466|gb|AAW22435.1| Wzx [Lactobacillus rhamnosus]
Length = 463
Score = 216 bits (550), Expect = 2e-54, Method: Composition-based stats.
Identities = 135/445 (30%), Positives = 242/445 (54%), Gaps = 15/445 (3%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
LLGN+ +FA+G L K ++ +++PL+T L+ ++GT +L + + ++ P+++L + +A+
Sbjct: 9 LLGNSAIFAVGNLGSKLITFLMVPLFTNYLSTEQFGTVDLATTTVNMLSPIVALSIADAV 68
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAA--AFFVLFCSVCVF 121
+RF +DD+ +F L + + + H A + +++ ++ +
Sbjct: 69 FRFLMDDESDDQAIFTTGLTF----TITVSLVLLFLYPVVRFFHIANGGYILVYLTLVIL 124
Query: 122 KATTQ-LARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLV 180
+A Q R + +V+ F G+ + L M V+ Y L+ G+ GY + L +
Sbjct: 125 QALLQNFIRAIEYVKLFAFNGIFSTLIMAVTGYYLIAVLKQGVTGYFCALIFSALSSICL 184
Query: 181 AFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGL 240
F GS +R + +FD ++LR +L YS+PL+PN W+ + + R+ ++ GL+A GL
Sbjct: 185 TFFGSRGWRFFSVKKFDTSILRSLLKYSIPLIPNAFMWFFTNDASRFFIVAIVGLSANGL 244
Query: 241 FTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRG-YSLATLSAAGLVIA 299
+ A+K+P++IN++ +VF QAWQ S E R +FF +L +L+ +S +G++
Sbjct: 245 YAVANKIPTIINVLYNVFTQAWQISAVEEYQENPRSSFFSEILNANIALSMISLSGILFI 304
Query: 300 LNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMV 359
L +P+ RV + +F E W+ VPLL++AA F + F GT Y A +R +M ST+ G +
Sbjct: 305 L-KPLMRVFVAPDFYESWKLVPLLLIAAVFANFSSFIGTLYLATKRTRAIMSSTVFGMIS 363
Query: 360 NVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALLSVMA 419
NV+ L+P G GAGL + + LV +R +D++R I L + ++L LLS++
Sbjct: 364 NVLFNSLLIPSFGVQGAGLGAMLGFLLVAAIRYKDIQRYISLKANFNQL-----LLSLIG 418
Query: 420 VCTSFDGGSWLN-GAVWVCLILLAT 443
+ D +L G V L+ L T
Sbjct: 419 IVLMIDVNYFLQWGVTSVALLSLIT 443
>gi|56684526|gb|AAW22489.1| Wzx [Lactobacillus rhamnosus]
Length = 463
Score = 216 bits (550), Expect = 3e-54, Method: Composition-based stats.
Identities = 135/445 (30%), Positives = 242/445 (54%), Gaps = 15/445 (3%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
LLGN+ +FA+G L K ++ +++PL+ L+ ++GT +L + + ++ P+++L + +A+
Sbjct: 9 LLGNSAIFAVGNLGSKLITFLMVPLFANYLSTEQFGTVDLATTTVNMLSPIVALSIADAV 68
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAA--AFFVLFCSVCVF 121
+RFS+DD+ +F L + + + H A + +++ ++ +
Sbjct: 69 FRFSMDDESDDQAIFTTGLTF----TITVSLVLLFLYPVVRFFHIANGGYILVYLTLVIL 124
Query: 122 KATTQ-LARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLV 180
+A Q R + +V+ F G+ + L M V+ Y L+ G+ GY + L +
Sbjct: 125 QALLQNFIRAIEYVKLFAFNGIFSTLIMAVTGYYLIAVLKQGVTGYFCALIFSALSSICL 184
Query: 181 AFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGL 240
F GS +R + +FD ++LR +L YS+PL+PN W+ + + R+ ++ GL+A GL
Sbjct: 185 TFFGSRGWRFFSVKKFDTSILRSLLKYSIPLMPNAFMWFFTNDASRFFIVAIVGLSANGL 244
Query: 241 FTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRG-YSLATLSAAGLVIA 299
+ A+K+P++IN++ +VF QAWQ S E R +FF +L +L+ +S +G++
Sbjct: 245 YAVANKIPTIINVLYNVFTQAWQISAVEEYQENPRSSFFSEILNANIALSMISLSGILFI 304
Query: 300 LNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMV 359
L +P+ RV + +F E W+ VPLL++AA F + F GT Y A +R +M ST+ G +
Sbjct: 305 L-KPLMRVFVAPDFYESWKLVPLLLIAAVFANFSSFIGTLYLATKRTRAIMSSTVFGMIS 363
Query: 360 NVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALLSVMA 419
NV+ L+P G GAGL + + LV +R +D++R I L + ++L LLS++
Sbjct: 364 NVLFNSLLIPSFGVQGAGLGAMLGFLLVAAIRYKDIQRYISLKANFNQL-----LLSLIG 418
Query: 420 VCTSFDGGSWLN-GAVWVCLILLAT 443
+ D +L G V L+ L T
Sbjct: 419 IVLMIDVNYFLQWGVTSVALLSLIT 443
>gi|56684506|gb|AAW22471.1| Wzx [Lactobacillus rhamnosus]
Length = 430
Score = 216 bits (550), Expect = 3e-54, Method: Composition-based stats.
Identities = 127/419 (30%), Positives = 231/419 (55%), Gaps = 9/419 (2%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
LLGN+ +FA+G L K ++ +++PL+T L+ ++GT +L + + ++ P+++L + +A+
Sbjct: 9 LLGNSAIFAVGNLGSKLITFLMVPLFTNYLSTEQFGTVDLATTTVNMLSPIVALSIADAV 68
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAA--AFFVLFCSVCVF 121
+RF +DD+ +F L + + + H A + +++ ++ +
Sbjct: 69 FRFLMDDESDDQAIFTTGLTF----TITVSLVLLFLYPVVRFFHIANGGYILVYLTLVIL 124
Query: 122 KATTQ-LARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLV 180
+A Q R + +V+ F G+ + L M V+ Y L+ G+ GY + L +
Sbjct: 125 QALLQNFIRAIEYVKLFAFNGIFSTLIMAVTGYYLIAVLKQGVTGYFCALIFSALSSICL 184
Query: 181 AFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGL 240
F GS +R + +FD ++LR +L YS+PL+PN W+ + + R+ ++ GL+A GL
Sbjct: 185 TFFGSRGWRFFSVKKFDTSILRSLLKYSIPLIPNAFMWFFTNDASRFFIVAIVGLSANGL 244
Query: 241 FTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRG-YSLATLSAAGLVIA 299
+ A+K+P++IN++ +VF QAWQ S E R +FF +L +L+ +S +G++
Sbjct: 245 YAVANKIPTIINVLYNVFTQAWQISAVEEYQENPRSSFFSEILNANIALSMISLSGILFI 304
Query: 300 LNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMV 359
L +P+ RV + +F E W+ VPLL++AA F + F GT Y A +R +M ST+ G +
Sbjct: 305 L-KPLMRVFVAPDFYESWKLVPLLLIAAVFANFSSFIGTLYLATKRTRAIMSSTVFGMIS 363
Query: 360 NVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALLSVM 418
NV+ L+P G GAGL + + LV +R +D++R I L + ++L L + +M
Sbjct: 364 NVLFNSLLIPSFGVQGAGLGAMLGFLLVAAIRYKDIQRYISLKANFNQLLLSLIGIVLM 422
>gi|56684486|gb|AAW22453.1| Wzx [Lactobacillus rhamnosus]
Length = 463
Score = 216 bits (549), Expect = 3e-54, Method: Composition-based stats.
Identities = 127/419 (30%), Positives = 231/419 (55%), Gaps = 9/419 (2%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
LLGN+ +FA+G L K ++ +++PL+T L+ ++GT +L + + ++ P+++L + +A+
Sbjct: 9 LLGNSAIFAVGNLGSKLITFLMVPLFTNYLSTEQFGTVDLATTTVNMLSPIVALSIADAV 68
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAA--AFFVLFCSVCVF 121
+RF +DD+ +F L + + + H A + +++ ++ +
Sbjct: 69 FRFLMDDESDDQAIFTTGLTF----TITVSLVLLFLYPVVRFFHIANGGYILVYLTLVIL 124
Query: 122 KATTQ-LARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLV 180
+A Q R + +V+ F G+ + L M V+ Y L+ G+ GY + L +
Sbjct: 125 QALLQNFIRAIEYVKLFAFNGIFSTLIMAVTGYYLIAVLKQGVTGYFCALIFSALSSICL 184
Query: 181 AFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGL 240
F GS +R + +FD ++LR +L YS+PL+PN W+ + + R+ ++ GL+A GL
Sbjct: 185 TFFGSRGWRFFSVKKFDTSILRSLLKYSIPLIPNAFMWFFTNDASRFFIVAIVGLSANGL 244
Query: 241 FTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRG-YSLATLSAAGLVIA 299
+ A+K+P++IN++ +VF QAWQ S E R +FF +L +L+ +S +G++
Sbjct: 245 YAVANKIPTIINVLYNVFTQAWQISAVEEYQENPRSSFFSEILNANIALSMISLSGILFI 304
Query: 300 LNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMV 359
L +P+ RV + +F E W+ VPLL++AA F + F GT Y A +R +M ST+ G +
Sbjct: 305 L-KPLMRVFVAPDFYESWKLVPLLLIAAVFANFSSFIGTLYLATKRTRAIMSSTVFGMIS 363
Query: 360 NVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALLSVM 418
NV+ L+P G GAGL + + LV +R +D++R I L + ++L L + +M
Sbjct: 364 NVLFNSLLIPSFGVQGAGLGAMLGFLLVAAIRYKDIQRYISLKANFNQLLLSLIGIVLM 422
>gi|118587269|ref|ZP_01544696.1| membrane protein [Oenococcus oeni ATCC BAA-1163]
gi|118432258|gb|EAV38997.1| membrane protein [Oenococcus oeni ATCC BAA-1163]
Length = 466
Score = 209 bits (532), Expect = 3e-52, Method: Composition-based stats.
Identities = 125/415 (30%), Positives = 223/415 (53%), Gaps = 1/415 (0%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L+ N+LVFA+G L K +S +L+PLYT LT +YGT ++L + + + LP+ SL + +A+
Sbjct: 8 LMSNSLVFAIGNLGSKLISFLLVPLYTYVLTTSQYGTVDVLTTTVSVFLPVSSLSIFDAV 67
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVCVFKA 123
+RF +D + K +F L+V V +A + + + A F ++ +F
Sbjct: 68 FRFIMDKNENKKSVFTNGLLVTLYSTVLMIIAYPI-LLFFKVPLALPFLIILFLNILFSI 126
Query: 124 TTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLVAFL 183
RG+G V+ F L G+++A+ + +S L LV GI GYL+S L + L
Sbjct: 127 FQNFTRGIGFVKIFALAGIVSAVTLGLSNILFLVIFKLGISGYLFSIIFSLLCSIIFISL 186
Query: 184 GSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGLFTA 243
+ +R + +++ L YS+PL+PN LSWWL + + R+ +L+ G++ GLF
Sbjct: 187 STRIWRFIDLKYASFKEIKKFLKYSIPLIPNSLSWWLTNDASRFFILFFVGVSGNGLFAV 246
Query: 244 ASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLVIALNRP 303
++K+P+++++ S+F QAWQ S E D D F+ V + + + +P
Sbjct: 247 SNKIPTILSVFFSIFAQAWQISAVSEFDKDDASEFYSKVFNLLISFSFVLIAIFVLFVKP 306
Query: 304 ISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVIL 363
++ + ++F + W YVP+L+LAATF + F GT Y A + + +T+ G ++NV+
Sbjct: 307 FMQIYVSSKFFQAWEYVPVLLLAATFSNFSAFLGTTYLAAKRTSGIFSTTVFGMVINVMA 366
Query: 364 GVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALLSVM 418
L+P +G GAG+ G++ + V ++R +R + + +D ++ L+ M
Sbjct: 367 CWLLIPVIGVHGAGIGGSLGFLTVTILRYFQTKRFLIITVDWNKSISSFVLVLFM 421
>gi|23464837|ref|NP_695440.1| hypothetical transmembrane protein possibly involved in
polysaccharide biosynthesis [Bifidobacterium longum
NCC2705]
gi|23325421|gb|AAN24076.1| hypothetical transmembrane protein possibly involved in
polysaccharide biosynthesis [Bifidobacterium longum
NCC2705]
Length = 479
Score = 196 bits (497), Expect = 3e-48, Method: Composition-based stats.
Identities = 138/441 (31%), Positives = 231/441 (52%), Gaps = 14/441 (3%)
Query: 2 RLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVE 61
R L+ NT++FA+ +A K ++ L+PLYT ++AGEYG ++ + I + PL++ + E
Sbjct: 5 RRLILNTVLFAINAVATKLITFFLVPLYTYYMSAGEYGLTDMSLTVINLATPLVTFSIAE 64
Query: 62 ALYRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACA---LGSALWNMEHAAAFFVLFCSV 118
A RF + D +D+ A S+++ VV V LG+ E+ A F + + +
Sbjct: 65 AAVRFIVGDSDRQDDYVAISILITLFSVVLVTVLSPILDLGAFGGLGEYKAWFILAYATS 124
Query: 119 CVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIG----- 173
++ARG+G ++ + I+++ + + + + GI GY S + G
Sbjct: 125 AFMNLCGEVARGIGEIKLIPICAGISSITTFILALVFIGQLKMGITGYFISVSAGPLLAV 184
Query: 174 --YLVGGLV--AFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVV 229
Y+V G + AFL + + R ++R M++Y+LPL+PN L WWL + R +
Sbjct: 185 IIYMVAGGIGKAFLSGMKRMRIVAVRDTWNIVRPMIIYALPLIPNNLFWWLSTGINRLFI 244
Query: 230 LWGSGLAAAGLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLA 289
G+AA+G+F AASK+P L+N VFQQAWQ S +E+ G+FF V A
Sbjct: 245 TGMLGIAASGMFAAASKIPGLLNTAYMVFQQAWQLSAYQEVKDKKIGSFFSPVFCVLQ-A 303
Query: 290 TLSAAGLVIALNRP-ISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRM 348
L+ V++ P ++ MLQ E + W + +L++A F V + F+GT Y A M++
Sbjct: 304 VLTVLCTVLSFFAPLVATFMLQGETYKAWPMISILLIANLFSVFSSFYGTVYSATMHTSF 363
Query: 349 LMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRL 408
+M +T+ GA+ V+ L+P MG GA +A A+ A+V +R D +R I + L
Sbjct: 364 VMKTTVFGAVACVVFTPMLLPIMGVTGACVASALGQAMVFAMRVIDAKRFIQFEVGWKYL 423
Query: 409 TYQLALLSVMAVCTSFDGGSW 429
L LL ++ T+++ G W
Sbjct: 424 APTLVLLITQSIFTAWEIGGW 444
>gi|15902361|ref|NP_357911.1| The type 2 capsule locus of Streptococcus pneumoniae [Streptococcus
pneumoniae R6]
gi|116516835|ref|YP_815840.1| hypothetical protein SPD_0325 [Streptococcus pneumoniae D39]
gi|4200431|gb|AAD10179.1| Cps2J [Streptococcus pneumoniae D39]
gi|15457872|gb|AAK99121.1| The type 2 capsule locus of Streptococcus pneumoniae [Streptococcus
pneumoniae R6]
gi|68642304|emb|CAI32729.1| flippase Wzx [Streptococcus pneumoniae]
gi|116077411|gb|ABJ55131.1| membrane protein, putative [Streptococcus pneumoniae D39]
Length = 469
Score = 193 bits (490), Expect = 2e-47, Method: Composition-based stats.
Identities = 115/421 (27%), Positives = 232/421 (55%), Gaps = 4/421 (0%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
LL N + L K + +++PLYT+ L+ +YGT +L N+ I +++P++S+ + E +
Sbjct: 8 LLKNIGLLTLSNFGSKILVFLMVPLYTSVLSTSDYGTYDLFNTTISLLIPIISINISEGV 67
Query: 64 YRFSIDDDVPKDELFA--GSLVVLGGGVVCTGVACALGSALWNM--EHAAAFFVLFCSVC 119
RF++D+ +++ ++++ G VV G+ ++ + E++ F +L+ S
Sbjct: 68 LRFALDEKNDSSIVYSIGWNIIIKGFLVVVLGIIFNNIFNIFPLLKENSITFLLLYLSTI 127
Query: 120 VFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGL 179
V++ + RG+ V + ++N ++++ L L+ G+ GY WS +G ++ L
Sbjct: 128 VYQFLSSFIRGIDKVSILSIAAILNTISILGFNILFLIIIPLGLVGYFWSNILGLVLPSL 187
Query: 180 VAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAG 239
++Y + ++ L +R++ YS+PL+ N L WW+ + RYVV+ G+A G
Sbjct: 188 YLIYKISQYNIKYTSLQNKKLQQRLVSYSIPLILNSLGWWINNAIDRYVVIAFCGVAVNG 247
Query: 240 LFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLVIA 299
+++ K+PS++NI A++F QAW S+ + D FF V Y++ + +GL+I+
Sbjct: 248 IYSVGYKIPSILNIFANIFNQAWILSSVKSYRDEDSEYFFSQVYNKYNMIMVLISGLLIS 307
Query: 300 LNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMV 359
++ +++ + EF + W++VP L++A FG ++ F G + A+ +S++ ST++GA+V
Sbjct: 308 CSKILAKFLYMNEFYDAWKFVPFLLIANVFGAISGFAGGIFSAVKDSKIYSQSTLVGAIV 367
Query: 360 NVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALLSVMA 419
N+I V + G GA +A ++Y +V ++R +R+ I L + R + LL +
Sbjct: 368 NIIFTFVFVYYYGAIGAAIATMISYFVVWIIRVHTMRKYIKLKIFIRRDVFSYVLLIFQS 427
Query: 420 V 420
+
Sbjct: 428 I 428
>gi|22316056|gb|AAL32506.1| wzx [Streptococcus thermophilus]
Length = 473
Score = 189 bits (480), Expect = 4e-46, Method: Composition-based stats.
Identities = 133/423 (31%), Positives = 226/423 (53%), Gaps = 8/423 (1%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
LL N+LVF +G L K + +L+PLYT A+T EYG A+L + ++LPL+++ V +A
Sbjct: 7 LLSNSLVFTIGNLGSKLLVFLLVPLYTYAMTPQEYGMADLYQTTANLLLPLITMNVFDAT 66
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVCVFKA 123
RF+++ + K+ + SLVV V T + + AL N+ + + L + +F+
Sbjct: 67 LRFAMEKSMTKESVLTNSLVVWCFSAVFTCLGACIIYAL-NLSNKW-YLALLLTFNLFQG 124
Query: 124 ----TTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGL 179
+Q ARG+G + F G+I L LV GI GYL S + VG +
Sbjct: 125 GQSILSQYARGIGKSKIFAAGGVILTFLTGALNILFLVYLPLGITGYLMSLVLAN-VGTI 183
Query: 180 VAFLGSAEYRLLAPFRF-DRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAA 238
+ F G+ F+ D+ L+ +ML Y+LPL+P+ + WWL++ S RY VL+ G A
Sbjct: 184 LFFAGTLSIWKEISFKIIDKKLIWQMLYYALPLIPSSILWWLLNASSRYFVLFFLGAGAN 243
Query: 239 GLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLVI 298
GL A+K+PS+I+I ++F QAWQ S E DS + ++ V + L +
Sbjct: 244 GLLAVATKIPSIISIFNTIFTQAWQISAIEEYDSHQKSKYYSDVFHYLATFLLLGTSAFM 303
Query: 299 ALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAM 358
+ +PI ++ +++A W+YVP ML+ F + FFGT Y A ++ + ++++ G +
Sbjct: 304 IVLKPIVEKVVSSDYASSWQYVPFFMLSMLFSSFSDFFGTNYIAAKQTKGVFMTSIYGTI 363
Query: 359 VNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALLSVM 418
V V+L V L+P +G GAGL+ + + ++R +D ++ + + + L L ++
Sbjct: 364 VCVLLQVVLLPIIGLDGAGLSAMLGFLTTFLLRVKDTQKFVVIQIKWRILISNLLIVLAQ 423
Query: 419 AVC 421
+C
Sbjct: 424 ILC 426
>gi|55823026|ref|YP_141467.1| exopolysaccharide biosynthesis protein [Streptococcus thermophilus
CNRZ1066]
gi|1276886|gb|AAC44020.1| EpsM
gi|22218126|gb|AAM94578.1| wzx [Streptococcus thermophilus]
gi|33313730|gb|AAQ04258.1| EpsM [Streptococcus thermophilus]
gi|55739011|gb|AAV62652.1| exopolysaccharide biosynthesis protein [Streptococcus thermophilus
CNRZ1066]
gi|1588817|prf||2209356P epsM gene
Length = 473
Score = 188 bits (478), Expect = 5e-46, Method: Composition-based stats.
Identities = 130/399 (32%), Positives = 217/399 (54%), Gaps = 8/399 (2%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
LL N+LVF +G L K + +L+PLYT A+T EYG A+L + ++LPL+++ V +A
Sbjct: 7 LLSNSLVFTIGNLGSKLLVFLLVPLYTYAMTPQEYGMADLYQTTANLLLPLITMNVFDAT 66
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVCVFKA 123
RF+++ + K+ + SLVV V T + + AL N+ + + L + +F+
Sbjct: 67 LRFAMEKSMTKESVLTNSLVVWCFSAVFTCLGACIIYAL-NLSNKW-YLALLLTFNLFQG 124
Query: 124 ----TTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGL 179
+Q ARG+G + F G+I L LV GI GYL S + VG +
Sbjct: 125 GQSILSQYARGIGKSKIFAAGGVILTFLTGALNILFLVYLPLGITGYLMSLVLAN-VGTI 183
Query: 180 VAFLGSAEYRLLAPFRF-DRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAA 238
+ F G+ F+ D+ L+ +ML Y+LPL+P+ + WWL++ S RY VL+ G A
Sbjct: 184 LFFAGTLSIWKEISFKIIDKKLIWQMLYYALPLIPSSILWWLLNASSRYFVLFFLGAGAN 243
Query: 239 GLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLVI 298
GL A+K+PS+I+I ++F QAWQ S E DS + ++ V + L +
Sbjct: 244 GLLAVATKIPSIISIFNTIFTQAWQISAIEEYDSHQKSKYYSDVFHYLATFLLLGTSAFM 303
Query: 299 ALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAM 358
+ +PI ++ +++A W+YVP ML+ F + FFGT Y A ++ + ++++ G +
Sbjct: 304 IVLKPIVEKVVSSDYASSWQYVPFFMLSMLFSSFSDFFGTNYIAAKQTKGVFMTSIYGTI 363
Query: 359 VNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRR 397
V V+L V L+P +G GAGL+ + + ++R +D ++
Sbjct: 364 VCVLLQVVLLPIIGLDGAGLSAMLGFLTTFLLRVKDTQK 402
>gi|90655826|gb|ABD96532.1| EpsM [Streptococcus thermophilus]
Length = 473
Score = 188 bits (478), Expect = 6e-46, Method: Composition-based stats.
Identities = 130/399 (32%), Positives = 217/399 (54%), Gaps = 8/399 (2%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
LL N+LVF +G L K + +L+PLYT A+T EYG A+L + ++LPL+++ V +A
Sbjct: 7 LLSNSLVFTIGNLGSKLLVFLLVPLYTYAMTPQEYGMADLYQTTANLLLPLITMNVFDAT 66
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVCVFKA 123
RF+++ + K+ + SLVV V T + + AL N+ + + L + +F+
Sbjct: 67 LRFAMEKSMTKESVLTNSLVVWCFSAVFTCLGACIIYAL-NLSNKW-YLALLLTFNLFQG 124
Query: 124 ----TTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGL 179
+Q ARG+G + F G+I L LV GI GYL S + VG +
Sbjct: 125 GQSILSQYARGIGKSKIFAAGGVILTFLTGALNILFLVYLPLGITGYLMSLVLAN-VGTI 183
Query: 180 VAFLGSAEYRLLAPFRF-DRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAA 238
+ F G+ F+ D+ L+ +ML Y+LPL+P+ + WWL++ S RY VL+ G A
Sbjct: 184 LFFAGTLSIWKEISFKIIDKKLIWQMLYYALPLIPSSILWWLLNASSRYFVLFFLGAGAN 243
Query: 239 GLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLVI 298
GL A+K+PS+I+I ++F QAWQ S E DS + ++ V + L +
Sbjct: 244 GLLAVATKIPSIISIFNTIFPQAWQISAIEEYDSHQKSKYYSDVFHYLATFLLLGTSAFM 303
Query: 299 ALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAM 358
+ +PI ++ +++A W+YVP ML+ F + FFGT Y A ++ + ++++ G +
Sbjct: 304 IVLKPIVEKVVSSDYASSWQYVPFFMLSMLFSSFSDFFGTNYIAAKQTKGVFMTSIYGTI 363
Query: 359 VNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRR 397
V V+L V L+P +G GAGL+ + + ++R +D ++
Sbjct: 364 VCVLLQVVLLPIIGLDGAGLSAMLGFLTTFLLRVKDTQK 402
>gi|119026404|ref|YP_910249.1| hypothetical transmembrane protein possibly involved in
polysaccharide biosynthesis [Bifidobacterium
adolescentis ATCC 15703]
gi|118765988|dbj|BAF40167.1| hypothetical transmembrane protein possibly involved in
polysaccharide biosynthesis [Bifidobacterium
adolescentis ATCC 15703]
Length = 471
Score = 174 bits (442), Expect = 9e-42, Method: Composition-based stats.
Identities = 125/448 (27%), Positives = 229/448 (51%), Gaps = 14/448 (3%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
LL N +F L +A K ++ +LMPLYT L+ EYG ++ + + P+L+L + E +
Sbjct: 7 LLINVGIFGLSAVATKLMAFILMPLYTLYLSTEEYGIMDMATIMVTTLFPVLTLLISEGM 66
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNM-------EHAAAFFVLFC 116
RF++DD +++V+ + + V A+ ++++ + F + +
Sbjct: 67 LRFTLDDKSKAAFYITETMLVM----LASCVLLAIILPVFDLPIFGGLGRYKIWFLLSYA 122
Query: 117 SVCVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLV 176
++C +AR + + +++AL M V Y+L+ H G+ GY +SY +G
Sbjct: 123 ALCFPSVMGTVARAMDQTKLIAYASMLSALIMGVLAYVLIAGMHLGLMGYFYSYIVGNGS 182
Query: 177 GGLVAFLGSAEYRLL--APFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSG 234
LV +Y+ + +R + +L +++ YSLPL PN L + + R+++ G
Sbjct: 183 AILVYLFAGKQYQFIDFTVWRNNASLRKQLWRYSLPLAPNSLCNQIQTTVSRFIITGVLG 242
Query: 235 LAAAGLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAA 294
++A+GL+ AASK+P+L+N++ + QQAWQ S +E S F+ + R Y +
Sbjct: 243 ISASGLYAAASKIPNLLNVLQQIVQQAWQLSAFQEFKSSGLKHFYDVIWRVYHALMSIGS 302
Query: 295 GLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTM 354
LVIAL+ I++V++Q +F W + +L+LA G + F GT YQA M ++ L+++T+
Sbjct: 303 ALVIALSPFIAKVLMQRQFYSVWPLISILVLAFYLGAINNFLGTIYQAYMCTKPLLIATI 362
Query: 355 MGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLAL 414
+G + + V G A L V + V+R D+RR + + M + L
Sbjct: 363 VGTVSCLAFTAMFVHSWGILAAALGVLVGSLTIFVIRVIDIRRLMRVDMRPFPTAITMGL 422
Query: 415 LSVMAVCTSFDGGSWLNGAVWVCLILLA 442
L+ +V T G +L ++ +CL+ +A
Sbjct: 423 LAAQSVVTLSQCGHYLFLSL-ICLLSIA 449
>gi|68643792|emb|CAI33988.1| flippase Wzx [Streptococcus pneumoniae]
gi|89994612|emb|CAJ84825.1| flippase Wzx [Streptococcus pneumoniae]
Length = 471
Score = 173 bits (438), Expect = 2e-41, Method: Composition-based stats.
Identities = 102/422 (24%), Positives = 211/422 (50%), Gaps = 2/422 (0%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L+ NT++F +G L K + +L+PLYT LTA ++G E+L +A+ +++P+ S+ + + L
Sbjct: 8 LVNNTIIFTIGSLGSKFIQFLLVPLYTYTLTAAQFGITEILLTAVNLLIPVFSISIADGL 67
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVCVFKA 123
RF +D + ++ + + ++ G + + ++ + S + +F++ ++ +++
Sbjct: 68 LRFGLDKTLRRENVLKSAFIISILGTILSIISIPIFSLYPTLSEWMVYFIIILNLRMYRD 127
Query: 124 TTQLARGL-GHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLVAF 182
+ + G F +I + +++ + LV GI GY ++Y + +
Sbjct: 128 VFAIQLKVEGKNTLFACDSMIYTFVLSLASIVFLVPFSLGISGYFFAYIVSNGISIFFIL 187
Query: 183 LGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGLFT 242
++ RF++ L+ ++L YS P++ N ++WW+ + S R+++ W A GL+
Sbjct: 188 FFGGVWKSFTSGRFEKQLMIQLLKYSAPMILNGIAWWITNASDRFMLQWFMDDRAVGLYG 247
Query: 243 AASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLVIALNR 302
+K+P LI VF QAW S E + + F+ V Y A + + + L +
Sbjct: 248 VVAKLPLLIGTFTGVFNQAWIISAVEEFEEENEEWFYQKVFHQYYAALFLSVSVFLLLLQ 307
Query: 303 PISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVI 362
P +V + F E W+Y P L+L++ + F FY A + ++ +T+ GA N++
Sbjct: 308 PFMKVYVSPSFYEAWQYAPFLLLSSVVSGIAAFMTGFYVAQKKNLNIIYTTIAGAFANIL 367
Query: 363 LGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALLSVMAVCT 422
L +P +G GA +A +++ ++ + R +D+ P+D+ Y L LL + +
Sbjct: 368 LNAMFIPMLGVLGASIATFLSWFVIAIYRMKDVENFACFPLDKKVFWY-LFLLCIQTITM 426
Query: 423 SF 424
+F
Sbjct: 427 TF 428
>gi|68644308|emb|CAI34412.1| flippase Wzx [Streptococcus pneumoniae]
Length = 471
Score = 172 bits (437), Expect = 3e-41, Method: Composition-based stats.
Identities = 102/422 (24%), Positives = 211/422 (50%), Gaps = 2/422 (0%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L+ NT++F +G L K + +L+PLYT LTA ++G E+L +A+ +++P+ S+ + + L
Sbjct: 8 LVHNTIIFTIGSLGSKFIQFLLVPLYTYTLTASQFGITEILLTAVNLLIPVFSISIADGL 67
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVCVFKA 123
RF +D + ++ + + ++ G + + ++ + S + +F++ ++ +++
Sbjct: 68 LRFGLDKTLRRENVLKSAFIISILGTILSIISIPIFSLYPTLSEWMVYFIIILNLRMYRD 127
Query: 124 TTQLARGL-GHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLVAF 182
+ + G F +I + +++ + LV GI GY ++Y + +
Sbjct: 128 VFAIQLKVEGKNTLFACDSMIYTFVLSLASIVFLVPFSLGISGYFFAYIVSNGISIFFIL 187
Query: 183 LGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGLFT 242
++ RF++ L+ ++L YS P++ N ++WW+ + S R+++ W A GL+
Sbjct: 188 FFGGVWKSFTSGRFEKQLMIQLLKYSAPMILNGIAWWITNASDRFMLQWFMDDRAVGLYG 247
Query: 243 AASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLVIALNR 302
+K+P LI VF QAW S E + + F+ V Y A + + + L +
Sbjct: 248 VVAKLPLLIGTFTGVFNQAWIISAVEEFEEENEEWFYQKVFHQYYAALFLSVSVFLLLLQ 307
Query: 303 PISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVI 362
P +V + F E W+Y P L+L++ + F FY A + ++ +T+ GA N++
Sbjct: 308 PFMKVYVSPSFYEAWQYAPFLLLSSVVSGIAAFMTGFYVAQKKNLNIIYTTIAGAFANIL 367
Query: 363 LGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALLSVMAVCT 422
L +P +G GA +A +++ ++ + R +D+ P+D+ Y L LL + +
Sbjct: 368 LNAMFIPMLGVLGASIATFLSWFVIAIYRMKDVENFACFPLDKKVFWY-LFLLCIQTITM 426
Query: 423 SF 424
+F
Sbjct: 427 TF 428
>gi|3320394|gb|AAC38752.1| repeat unit transporter [Streptococcus pneumoniae]
Length = 461
Score = 163 bits (413), Expect = 2e-38, Method: Composition-based stats.
Identities = 115/426 (26%), Positives = 215/426 (50%), Gaps = 16/426 (3%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L NT +FAL + K + +L+P+YT LT EYG +L+ + I++ +P+L+L + EA+
Sbjct: 7 LAKNTGIFALANFSSKILIFLLVPIYTRVLTTTEYGFYDLVYTTIQLFVPILTLNISEAV 66
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNM--------EHAAAFFVLF 115
RF + D V K +F S+ VL + +A AL + N+ +++ FV+F
Sbjct: 67 MRFLMKDGVSKKSVF--SIAVLD--IFIGSIAFALLLLVNNLFSLSDLISQYSIYIFVIF 122
Query: 116 CSVCVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYL 175
+ Q ++G+ + + G+I+ M+ +LLV G+ G+ + GY+
Sbjct: 123 VFYTLNNFLIQFSKGIDKIGVTAISGVISTAVMLAMNVILLVLFDWGLLGFFIANVCGYV 182
Query: 176 VGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGL 235
+ + + L + D+ L M+ Y+LPLV N+LSWW+ + S RY+V G+
Sbjct: 183 IP-CIYIVSKLRLWELFEIKIDKKLQWEMVYYALPLVLNILSWWVNNTSNRYIVTAIVGI 241
Query: 236 AAAGLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAG 295
A+ + + A K+P +++ ++++F Q+WQ S + + F ++L Y+ L A
Sbjct: 242 QASAIISVAYKIPQILSTISAIFIQSWQISAIKIQEDKSDTTFVSNMLLYYNALLLIIAS 301
Query: 296 LVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMM 355
+I +PIS ++ F W VP L++++ F ++ G A M++ + S ++
Sbjct: 302 GIILFVKPISNILFGISFYSAWELVPFLIISSLFNAISGCIGAIMGAKMDTHNIAKSALV 361
Query: 356 GAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALL 415
G + N+IL + L MGP G ++ +A L+ +R ++ ++ + R Y +L
Sbjct: 362 GMIANIILNIVLTFLMGPQGITISTLIASFLIFYMRKDSVK---EINSETYRAIYLSWIL 418
Query: 416 SVMAVC 421
V+ C
Sbjct: 419 LVVKAC 424
>gi|68643625|emb|CAI33843.1| flippase Wzx [Streptococcus pneumoniae]
Length = 462
Score = 163 bits (412), Expect = 2e-38, Method: Composition-based stats.
Identities = 106/392 (27%), Positives = 205/392 (52%), Gaps = 5/392 (1%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L NT FAL + K + +L+P+YT LT EYG +L+ + I++++P+L+L + EA+
Sbjct: 7 LAKNTGTFALANFSSKILIFLLVPIYTKVLTTTEYGFYDLVYTTIQLLVPILTLNISEAV 66
Query: 64 YRFSIDDDVPKDELFAGSL--VVLGGGVVCTGVACALGSALWNM--EHAAAFFVLFCSVC 119
RF + +DV K +F+ ++ + LG + C + +L + +++ +F
Sbjct: 67 MRFLMKEDVSKKSVFSIAILDIFLGSIIFCLLLLVNQIFSLSELISQYSIYIMAIFAFYT 126
Query: 120 VFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGL 179
+ Q ++G+ + + G+I+A M+ LLLV + G+ G+ + GY++ +
Sbjct: 127 LNNFLIQYSKGIDKIGVTAISGVISAAVMLSMNILLLVVLNWGLLGFFIANICGYVIPCV 186
Query: 180 VAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAG 239
+ + L + DR+L M+ Y+LPL+ N LSWW+ + S RY++ G+ A+
Sbjct: 187 YIIVKLKLWDLFE-LKIDRSLQWEMIYYTLPLILNTLSWWVNNTSDRYIITVIIGIQASA 245
Query: 240 LFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLVIA 299
+ + A K+P + + ++++F Q+WQ S + + + F +L Y+ L A +I
Sbjct: 246 IISVAYKIPQIFSTISAIFIQSWQISAIKIQEEKEGNTFISKMLLYYNALLLIIASGIIL 305
Query: 300 LNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMV 359
+PIS ++ A F W VP L++++ F ++ + G A M+++ + S ++G +
Sbjct: 306 FVKPISNILFGASFYSAWTLVPFLIISSLFNAISGYIGAIMGAKMDTKNIAKSALVGMIA 365
Query: 360 NVILGVALVPFMGPWGAGLAGAVAYALVLVVR 391
NV L + L MG G ++ +A L+ +R
Sbjct: 366 NVFLNIVLTFLMGLQGITISTMIASFLIFYMR 397
>gi|149018047|ref|ZP_01834506.1| hypothetical protein CGSSp23BS72_11930 [Streptococcus pneumoniae
SP23-BS72]
gi|3818491|gb|AAC69533.1| Cps23fJ [Streptococcus pneumoniae]
gi|68643653|emb|CAI33865.1| flippase Wzx [Streptococcus pneumoniae]
gi|147931611|gb|EDK82589.1| hypothetical protein CGSSp23BS72_11930 [Streptococcus pneumoniae
SP23-BS72]
Length = 461
Score = 162 bits (411), Expect = 3e-38, Method: Composition-based stats.
Identities = 115/426 (26%), Positives = 215/426 (50%), Gaps = 16/426 (3%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L NT +FAL + K + +L+P+YT LT EYG +L+ + I++ +P+L+L + EA+
Sbjct: 7 LAKNTGIFALANFSSKILIFLLVPIYTRVLTTTEYGFYDLVYTTIQLFVPILTLNISEAV 66
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNM--------EHAAAFFVLF 115
RF + D V K +F S+ VL + +A AL + N+ +++ FV+F
Sbjct: 67 MRFLMKDGVSKKSVF--SIAVLD--IFIGSIAFALLLLVNNLFSLSDLISQYSIYIFVIF 122
Query: 116 CSVCVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYL 175
+ Q ++G+ + + G+I+ M+ +LLV G+ G+ + GY+
Sbjct: 123 VFYTLNNFLIQFSKGIDKIGVTAISGVISTAVMLAMNVILLVVFDWGLLGFFIANVCGYV 182
Query: 176 VGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGL 235
+ + + L + D+ L M+ Y+LPLV N+LSWW+ + S RY+V G+
Sbjct: 183 IP-CIYIVSRLRLWELFEIKIDKKLQWEMVYYALPLVLNILSWWVNNTSDRYIVTAIVGI 241
Query: 236 AAAGLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAG 295
A+ + + A K+P +++ ++++F Q+WQ S + + F ++L Y+ L A
Sbjct: 242 QASAIISVAYKIPQILSTISAIFIQSWQISAIKIQEDKSDTTFVSNMLLYYNALLLIIAS 301
Query: 296 LVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMM 355
+I +PIS ++ F W VP L++++ F ++ G A M++ + S ++
Sbjct: 302 GIILFVKPISNILFGISFYSAWELVPFLIISSLFNAISGCIGAIMGAKMDTHNIAKSALV 361
Query: 356 GAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALL 415
G + N+IL + L MGP G ++ +A L+ +R ++ ++ + R Y +L
Sbjct: 362 GMIANIILNIVLTFLMGPQGITISTLIASFLIFYMRKDSVK---EINSETYRAIYLSWIL 418
Query: 416 SVMAVC 421
V+ C
Sbjct: 419 LVVEAC 424
>gi|68643597|emb|CAI33820.1| flippase Wzx [Streptococcus pneumoniae]
Length = 461
Score = 162 bits (410), Expect = 4e-38, Method: Composition-based stats.
Identities = 115/426 (26%), Positives = 215/426 (50%), Gaps = 16/426 (3%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L NT +FAL + K + +L+P+YT LT EYG +L+ + I++ +P+L+L + EA+
Sbjct: 7 LAKNTGIFALANFSSKILIFLLVPIYTRVLTTTEYGFYDLVYTTIQLFVPILTLNISEAV 66
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNM--------EHAAAFFVLF 115
RF + D V K +F S+ VL + +A AL + N+ +++ FV+F
Sbjct: 67 MRFLMKDGVSKKSVF--SIAVLD--IFIGSIAFALLLLVNNLFSLSDLISQYSIYIFVIF 122
Query: 116 CSVCVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYL 175
+ Q ++G+ + + G+I+ M+ +LLV G+ G+ + GY+
Sbjct: 123 VFYTLNNFLIQFSKGIDKIGVTAISGVISTAVMLAMNVILLVVFDWGLLGFFIANVCGYV 182
Query: 176 VGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGL 235
+ + + L + D+ L M+ Y+LPLV N+LSWW+ + S RY+V G+
Sbjct: 183 IP-CIYIVSRLRLWELFEIKIDKKLQWEMVYYALPLVLNILSWWVNNTSDRYIVTAIVGI 241
Query: 236 AAAGLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAG 295
A+ + + A K+P +++ ++++F Q+WQ S + + F ++L Y+ L A
Sbjct: 242 QASAIISVAYKIPQILSTISAIFIQSWQISAIKIQEDKSGTTFVSNMLLYYNALLLIIAS 301
Query: 296 LVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMM 355
+I +PIS ++ F W VP L++++ F ++ G A M++ + S ++
Sbjct: 302 GIILFVKPISNILFGISFYSAWELVPFLIISSLFNAISGCIGAIMGAKMDTHNIAKSALV 361
Query: 356 GAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALL 415
G + N+IL + L MGP G ++ +A L+ +R ++ ++ + R Y +L
Sbjct: 362 GMIANIILNIVLTFLMGPQGITISTLIASFLIFYMRKDSVK---EINSETYRAIYLSWIL 418
Query: 416 SVMAVC 421
V+ C
Sbjct: 419 LVVEAC 424
>gi|116618520|ref|YP_818891.1| Polysaccharide transport membrane protein [Leuconostoc
mesenteroides subsp. mesenteroides ATCC 8293]
gi|116097367|gb|ABJ62518.1| Polysaccharide transport membrane protein [Leuconostoc
mesenteroides subsp. mesenteroides ATCC 8293]
Length = 469
Score = 134 bits (337), Expect = 1e-29, Method: Composition-based stats.
Identities = 98/390 (25%), Positives = 194/390 (49%), Gaps = 7/390 (1%)
Query: 7 NTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYRF 66
+T++ + L+ K + VL+P+YT ALT ++G +L+ S +++PL+SL +A++R+
Sbjct: 13 DTILITIASLSSKFIGFVLLPVYTHALTPTQFGLGDLIFSISSLLVPLVSLSSFDAIFRY 72
Query: 67 SIDDD--VPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVCVFKAT 124
+ DD + K L + + G V + + L N+ + + + +F +
Sbjct: 73 MLKDDNVIDKKSLVSSVTFISIVGSTLLFVIVSFIN-LNNINGLKIWLCISVVMGIFNSL 131
Query: 125 TQ-LARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLVAFL 183
Q +G + G++ ++ +S + L + GY + IG F
Sbjct: 132 LQAYTKGTRQNKYLASSGILGSIVTAISAVVTLKLFDLSLNGYFLTLCIGTFCSNFYLFF 191
Query: 184 GSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGLFTA 243
+ + L+ + L++ LVYS+PL+PN +SWW+ S R + G G+F
Sbjct: 192 RTGITKELSYKYVNVTTLKKTLVYSMPLIPNAISWWITSDISRLFIYSLIGSFGNGMFAV 251
Query: 244 ASKMPSLINIVASVFQQAWQYSTAREIDSPDRGA--FFGSVLRGYSLATLSAAGLVIALN 301
ASK+PSL+N+ +F QAWQ + +ID + G S+ R T AGL++ L
Sbjct: 252 ASKIPSLLNMFFGIFNQAWQITAITKIDGDETGTEYLLSSIYRTMQF-TFVMAGLLVILI 310
Query: 302 RPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNV 361
I +++ + W+ VP L+ A ++ GT++ +++++++T++G ++N+
Sbjct: 311 PEIYQIIAPMTYFSSWKIVPPLLFGAIMSMVAGQMGTYFLTKEKTKIILITTIIGMILNM 370
Query: 362 ILGVALVPFMGPWGAGLAGAVAYALVLVVR 391
+LG L +G +G+ + +++ +V V+R
Sbjct: 371 VLGYLLTLSLGLFGSAITSGISFLIVSVLR 400
>gi|150007213|ref|YP_001301956.1| exopolysaccharide biosynthesis protein [Parabacteroides distasonis
ATCC 8503]
gi|149935637|gb|ABR42334.1| exopolysaccharide biosynthesis protein [Parabacteroides distasonis
ATCC 8503]
Length = 479
Score = 127 bits (319), Expect = 2e-27, Method: Composition-based stats.
Identities = 98/394 (24%), Positives = 194/394 (49%), Gaps = 12/394 (3%)
Query: 7 NTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYRF 66
N+L+ +G + K ++++++P YT L+ YG +L+++ + +++ +++L V +A++RF
Sbjct: 18 NSLLVFVGSIGSKLLAILMLPFYTAWLSVESYGDVDLVSTYVSLIMGIVTLCVTDAIFRF 77
Query: 67 SIDDDVPKDE-LFAGSL------VVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVC 119
+ LF+ L +VL GG+ + A G W + A A ++ +
Sbjct: 78 PQGKGREEQRILFSSGLAVVSLSLVLFGGLYYGLLKWATG-VRWGVFDAYAGYIYTLLIF 136
Query: 120 VFKAT--TQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVG 177
F Q +R + + FV+ G+I + +S+ LL+ + G++GYL S IG+
Sbjct: 137 SFLQLYLQQFSRSIDRMSVFVMSGVILTFVIALSSLLLVPK--WGVDGYLISQLIGFAAS 194
Query: 178 GLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAA 237
R L+ + ++ ML YS+P++PN WW++ S R +
Sbjct: 195 ISYTSFSMRIDRYLSYSSISVSAMKEMLSYSIPMIPNATLWWILGTSNRLFLEHYHSTDL 254
Query: 238 AGLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLV 297
G+F +++ PS+I +V + F +WQ S E + P F+ ++R + L L+
Sbjct: 255 VGIFAVSNRFPSIITMVFNTFFLSWQISVLEEFEKPGFRIFYNRIMRLCFVLLLLVEFLL 314
Query: 298 IALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGA 357
+ + R + + WRY+P L LAA F + GT + A N+R + +++ G
Sbjct: 315 SLSSYWLIRFFADEAYLDAWRYIPFLGLAAVFSSFSTLMGTMFMATKNTRYFLTTSLWGG 374
Query: 358 MVNVILGVALVPFMGPWGAGLAGAVAYALVLVVR 391
++ + + + L+P G GA ++ +A+ ++L +R
Sbjct: 375 LLCLGMNLLLIPLYGIMGATVSLLIAHFIILYLR 408
>gi|156869588|gb|EDO62960.1| hypothetical protein CLOLEP_00255 [Clostridium leptum DSM 753]
Length = 463
Score = 103 bits (257), Expect = 3e-20, Method: Composition-based stats.
Identities = 94/419 (22%), Positives = 182/419 (43%), Gaps = 10/419 (2%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L V+ +G + K + +L+P+YT YG +L + + ++ LL + V L
Sbjct: 3 FLKKMAVYFIGTIFNKIIVFLLLPIYTANFDPASYGDNDLSMTTVTMLASLLFMEVWTPL 62
Query: 64 YRFSIDDDV--PKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLF-CSVCV 120
RFS D+ K ++F ++V L G +A + A W + + + S+ +
Sbjct: 63 LRFSYDEHTLEGKQKIFT-NVVSLSGACFPLYIAGCVLIAFWQELPNPGWMIAYGLSLLI 121
Query: 121 FKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLV 180
AR +G + F++ G+++++ + + +L+ G L + + + L
Sbjct: 122 LHIAQFEARAIGDSKDFMISGMVSSVFQFILSAVLIYGFRVGAVAILIAPAVSNV---LA 178
Query: 181 AFLGSAEYRLLAPFRF---DRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAA 237
A A +R L R +A+LR + YS PL N +++W ++ RY W A
Sbjct: 179 AMYIEARHRFLRKVRLKDVSKAMLRDLTKYSFPLAINAVAFWGMTNINRYFAKWYLSEDA 238
Query: 238 AGLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLV 297
G A+K +LI + V+ AWQ S S R ++ +L Y V
Sbjct: 239 NGYIALANKFAALIVTLIKVYALAWQESAYEHSGSDQRSRYYSKMLVVYMDFMAFGTAAV 298
Query: 298 IALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGA 357
I + + + +P+ L+A M+ F+G + A + +L+ ST++GA
Sbjct: 299 IVGTNVLFPFFIDDAYTLTQLILPIYYLSAFANAMSNFYGHIFNAEKKTNILLYSTLLGA 358
Query: 358 MVNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALLS 416
VN+ L A + G + +A + Y +++R +++ + + D RL + +++
Sbjct: 359 AVNIGLLYATIQIWGLYAVPIALTLGYLANVLMRILSIQKTVKVSFDLKRLALDIVVMA 417
>gi|154148138|ref|YP_001407016.1| polysaccharide biosynthesis protein [Campylobacter hominis ATCC
BAA-381]
gi|153804147|gb|ABS51154.1| polysaccharide biosynthesis protein [Campylobacter hominis ATCC
BAA-381]
Length = 471
Score = 102 bits (253), Expect = 7e-20, Method: Composition-based stats.
Identities = 92/443 (20%), Positives = 197/443 (44%), Gaps = 15/443 (3%)
Query: 10 VFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYRFSID 69
++ LG + K ++ ++P+YT L E+G +L + ++ + L + + +F +D
Sbjct: 13 IYFLGSILSKLIAFFMLPIYTRYLNPAEFGEYDLSLAYMQFFISFFFLDIYIGIMKFLLD 72
Query: 70 DDVP-----KDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVCVFKAT 124
++ K+++ + + + L +++++ + L +
Sbjct: 73 PNLKEIENYKEKVLFSGFFIFFISTLVYFLFFYLFMQIYDIKFKSILLCLGLFSNLQGVY 132
Query: 125 TQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLVA-FL 183
+ R G FV+ G++N++ +++++LL + G E +G L G LVA F+
Sbjct: 133 GYICRAYGKNSVFVISGVVNSILYSITSFVLLYKFDFGYEALF----LGALFGNLVAIFM 188
Query: 184 GSAEYRLLAPFRF---DRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGL 240
+++ +F D +R+L YSLPL N LS+W +V GR V+ A G
Sbjct: 189 MECRVKVIKKIKFALFDFTFFKRLLYYSLPLCLNSLSYWFAAVYGRSVIANKLSYADNGY 248
Query: 241 FTAASKMPSLINIVASVFQQAWQ-YSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLVIA 299
+T A K +I++VA F+ WQ S ++++ + F+ Y L + +++
Sbjct: 249 YTIALKFAMIISLVAMCFKLGWQEVSFSKDLKKQNNSIFYSRACNEYLKFMLLSVAVLLP 308
Query: 300 LNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMV 359
+ + I + + F E Y+PL +L A + F + + ++ L ++T GA +
Sbjct: 309 VIKIIFPFFINSAFNEALIYIPLTLLGAIVFGFSEFLMSIINTINKNKYLFIATACGATL 368
Query: 360 NVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALLSVMA 419
N+IL + + + + Y+LV + + + L + + L +L + +
Sbjct: 369 NIILLYLFIYKFKVFAVAFSYIICYSLVCFIMVLVINKTFKLNIKMLNIMLYLFILLIQS 428
Query: 420 -VCTSFDGGSWLNGAVWVCLILL 441
+ +G + + + VCL+ L
Sbjct: 429 LIMIKCNGYANIISFIVVCLLFL 451
>gi|156868909|gb|EDO62281.1| hypothetical protein CLOLEP_01142 [Clostridium leptum DSM 753]
Length = 468
Score = 97.1 bits (240), Expect = 2e-18, Method: Composition-based stats.
Identities = 83/355 (23%), Positives = 164/355 (46%), Gaps = 2/355 (0%)
Query: 9 LVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYRFSI 68
+V+ +G + K +S L+PLYTT ++ EYG + ++I++ +L + + + RF
Sbjct: 1 MVYFMGNVLTKIISFFLLPLYTTRISTDEYGYFNTSTNYLDILIAILCMEIWSVIMRFMF 60
Query: 69 DDD--VPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVCVFKATTQ 126
D + K++ LV+ ++ A+ + ++++ F++ + T
Sbjct: 61 DYEGRRGKEKAITNGLVIFCVSLLGYCAVFAVLALALDLQYLLLIFIMGLCSMIQAIYTY 120
Query: 127 LARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLVAFLGSA 186
RGLG+ F + G++ +L ++ L++ ++ + GY+V L+
Sbjct: 121 TTRGLGYNAVFAVSGIVGSLVNSLTNVALILGFSMTVKSLYLAAIAGYVVQILMLERKVH 180
Query: 187 EYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGLFTAASK 246
R + P LL+RML +SLPL N + ++ +V GL+ G+F+AA K
Sbjct: 181 LLRAIHPKLVRPRLLKRMLRFSLPLAANAVCACFLTNYNSIMVTNLLGLSENGVFSAAGK 240
Query: 247 MPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLVIALNRPISR 306
+ +V+S F AWQ + D+GAF+ + Y L++ + +
Sbjct: 241 FAVALTLVSSCFSMAWQELFFSKGGEEDKGAFYTRAMSYYYRFLTLTLALLLPVVNVVFS 300
Query: 307 VMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNV 361
+ E+A + VPL +++ + + F G + + + ++MVST+ A VNV
Sbjct: 301 FFIGPEYAGAFPLVPLYLISTGASIYSGFLGYIFSSDKRTNVVMVSTLAAAAVNV 355
>gi|20093247|ref|NP_619322.1| polysaccharide biosynthesis protein [Methanosarcina acetivorans
C2A]
gi|19918600|gb|AAM07802.1| polysaccharide biosynthesis protein [Methanosarcina acetivorans
C2A]
Length = 490
Score = 84.7 bits (208), Expect = 1e-14, Method: Composition-based stats.
Identities = 89/388 (22%), Positives = 171/388 (44%), Gaps = 21/388 (5%)
Query: 22 SLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYRFSIDDDVPK---DELF 78
+ L+P+ T L A +YG +N + +V PL +G+ + RF + K + ++
Sbjct: 25 TFFLLPVITKTLGAYDYGIWAQINITVSLVSPLALMGLSMSFIRFLSSETEKKKIREVVY 84
Query: 79 AGSLVVLGGGVVCTGVACALGSALWNM---EHAAAFFVLFCSVCVFKATTQ-----LARG 130
+ V G + + + L + +A +FV S+ +F + R
Sbjct: 85 SILFFVTVSGFLASSLLYVFAEPLATFGFQDPSATYFVQAGSLLIFLNVIESISLFYFRV 144
Query: 131 LGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLVAFLGSAEYRL 190
+++F + L ++ T + L + G L T +V GL+ +
Sbjct: 145 FRQIKKFSYFTLFETFGKLLFTLIFLKMGY----GLLGVITATLMVQGLIFLIAFVTIVS 200
Query: 191 LAPFRFDR-ALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGLFTAASKMPS 249
F + +R L +SLPL PN+L W+ S RY+V + GL + G+++AA + +
Sbjct: 201 QIGFVIPQFTCIREHLQFSLPLTPNVLIRWVTDSSDRYMVTYFLGLGSVGVYSAACSIGN 260
Query: 250 LINIVASVFQQAWQYSTAREID---SPDRGAFFGSVLRGYSLATLSAAGLVIALNRPISR 306
LI + S Q ++ D + + + LR + + + A + AL++P+
Sbjct: 261 LIQLFVSPLQLILFPELSKLFDENKTDEVRIYMSHSLRYFLIIAIPAVFGLSALSKPLLG 320
Query: 307 VMLQAEFAEGWRYVPLLMLAATF-GVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVILGV 365
V+ +F GW +P++ A G+ IF T + + +R ++ A+ NV++ +
Sbjct: 321 VLTTQDFVSGWFVIPIIAFAGLLVGIFQIFVNTIF-LIKKTRPATYINILAAVSNVLINL 379
Query: 366 ALVPFMGPWGAGLAGAVAYALVLVVRAR 393
L+P +G GA L+ V+Y L+ + R
Sbjct: 380 ILIPSVGIAGAALSTLVSYFLMAALCMR 407
>gi|149182088|ref|ZP_01860572.1| polysaccharide biosynthesis [Bacillus sp. SG-1]
gi|148850190|gb|EDL64356.1| polysaccharide biosynthesis [Bacillus sp. SG-1]
Length = 462
Score = 80.9 bits (198), Expect = 2e-13, Method: Composition-based stats.
Identities = 95/391 (24%), Positives = 175/391 (44%), Gaps = 26/391 (6%)
Query: 16 LAVKAVSLVLMPLYTTALT-AGEYGTAELLNSAIEIVLPLLSLGVVEAL--YRFSIDDDV 72
+ K ++ +++P+YT L+ +YG + ++ ++ L+ G AL Y F D
Sbjct: 3 VGTKIIAFIMLPIYTRFLSDPSQYGVLDYIDRITSMLTFLVIFGTDSALAYYYFEAKDQK 62
Query: 73 PKD------ELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVCVFKATTQ 126
+D +F +V++ G VV G L L +E ++L ++ T
Sbjct: 63 LRDYYVRTVMMFRLVIVMILGLVVFAG-GDWLSELL--LEETGKSYLLLIAIGTLFFDTI 119
Query: 127 LARGLGHVR------RFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLV 180
L L +R R V + ++ L + V +YL+L+ + G L + + VG +
Sbjct: 120 LTLILTVLRYDFFTKRVVFFTVLKMLLIAVVSYLMLITIWPTVTGILAARILS--VGFVA 177
Query: 181 AFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGL 240
L + + P+ FD+ LL +L Y+ PLVP L++W++ + + + G+
Sbjct: 178 LLLFKYAVKYVRPY-FDKKLLVEVLKYAAPLVPASLAFWVIVNANTMFLKEYTSFEEVGI 236
Query: 241 FTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLVIAL 300
+ A K +LI +V S Q AW+ + D + F + L + +++A
Sbjct: 237 YGTAIKFAALITLVTSGVQMAWRPFSMSLKDKENNARIFSKIYMAILLIG-TFGIMIVAT 295
Query: 301 NRPISRVMLQAEFAEGWRYVPLLMLAA--TFGVMTIFFGTFYQALMNSRMLMVSTMMGAM 358
P +L + + E ++YV L+ F M I G F+ ++ + + M A+
Sbjct: 296 IMPWVIKVLDSNYWEAYKYVALISTTTFLNFYYMIISIGIFFTK--KTKYISYAFGMAAI 353
Query: 359 VNVILGVALVPFMGPWGAGLAGAVAYALVLV 389
VNVIL +AL+P WGA A ++Y + +V
Sbjct: 354 VNVILNIALIPKFSIWGAVAAYLLSYIVAIV 384
>gi|119478059|ref|ZP_01618138.1| Membrane protein involved in the export of O-antigen and teichoic
acid [marine gamma proteobacterium HTCC2143]
gi|119448765|gb|EAW30008.1| Membrane protein involved in the export of O-antigen and teichoic
acid [marine gamma proteobacterium HTCC2143]
Length = 472
Score = 78.2 bits (191), Expect = 1e-12, Method: Composition-based stats.
Identities = 93/414 (22%), Positives = 179/414 (43%), Gaps = 30/414 (7%)
Query: 16 LAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYRFSIDDDVPKD 75
+A A S++++P+YT L+ YGTAELLN +++ + LL + +++F D + K+
Sbjct: 1 MARTATSIIMLPIYTRYLSPSVYGTAELLNIILDLTVLLLGARITTGMFKFYADAENQKE 60
Query: 76 E-------LFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAF-FVLFCSVCVFKATTQL 127
+ L+ G L + ++ G + + AL + A +++F F ++
Sbjct: 61 KHTVLSTCLWLGLLFNIVAMMILWGASPLIALALGDPSITEALRWIMF--TLAFATIGEI 118
Query: 128 ARGLGHVR------RFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLVA 181
G+G+ R R++L L+ + + ++ +V G+ G ++S IG G A
Sbjct: 119 --GMGYFRVNDQAGRYLLVSLLRLASQIAASLYFIVYKEAGLWGVIYSALIG---AGFQA 173
Query: 182 FLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGLF 241
L RF + + ++++S+P++ + ++ + ++ RY + ++ G++
Sbjct: 174 LLLLTVIISTIGLRFSTTIAKNLIIFSMPIIISSIAMYYMTFGDRYFLQIFHDTSSVGIY 233
Query: 242 TAASKMP-SLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATL----SAAGL 296
K L+ +V F W SP FF + +S+AT A G+
Sbjct: 234 ALGYKFGFMLLALVWGPFMSYWGAKQFDHARSPGGAEFFS---QAFSIATYILWAGATGM 290
Query: 297 VIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMG 356
+I + P + M A + VP ++LA F F N+R L T +
Sbjct: 291 LIFV-APAIQFMADASYHSAAHLVPPIVLAYVFHGWAQFHQFGILDSKNTRFLNTYTWIS 349
Query: 357 AMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTY 410
A V I + L+P G GA +A V L V+ R ++ D ++ +
Sbjct: 350 AAVMSIFYILLIPPYGGMGAAIATLVGMVLRFVLIYRKSQQYFKFITDWKKIFF 403
>gi|118580872|ref|YP_902122.1| polysaccharide biosynthesis protein [Pelobacter propionicus DSM
2379]
gi|118503582|gb|ABL00065.1| polysaccharide biosynthesis protein [Pelobacter propionicus DSM
2379]
Length = 490
Score = 75.1 bits (183), Expect = 9e-12, Method: Composition-based stats.
Identities = 98/400 (24%), Positives = 174/400 (43%), Gaps = 34/400 (8%)
Query: 7 NTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYRF 66
++L++ LG A+ L ++PL+T L +YG +L+S++E++ L +LG A+ RF
Sbjct: 13 HSLIYMLGWGIAAAIRLGMLPLFTRCLGQQDYGIISILDSSVELLRILCALGFSSAIVRF 72
Query: 67 SIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVCVFKATTQ 126
D D F + V+ G G ++ + F+L V V
Sbjct: 73 YNDS---TDVCFRRT-VMATGACFSLLATVFFGLGIFPFSKQISEFILGNGVNVLYFKLA 128
Query: 127 LARGLGHVRRFVL------------YGLINALAMVVSTYL---LLVRAHTGIEGYLWSYT 171
A L ++ R V Y +IN + +V + L L+V G+ G L
Sbjct: 129 FATILLNIIRSVADSYLTVNKYSVQYIVINTVQVVFQSSLNLYLVVFRDLGVTGML---- 184
Query: 172 IGYLVGGLVAFLGSAEYRLLAPFR----FDRALLRRMLVYSLPLVPNLLSWWLVSVSGRY 227
+G L LVAF+ F+ D ++ ML +SLPLVP +L+ + R+
Sbjct: 185 VGNL---LVAFIFDVGLYAFVAFKNGLVIDFKVIIPMLKFSLPLVPTVLAAAAMHNLDRF 241
Query: 228 VVLWGSGLAAAGLFTAASKMPSLINIV-ASVFQQAWQYSTAREIDS-PDRGAFFGSVLRG 285
+ + + + GL++ A + P ++N V S F + W+ S+ EI PD +G +
Sbjct: 242 FIKFFASMEDVGLYSLAYQFPFMLNTVFMSSFVRIWESSSIYEIAKYPDASYQYGKICT- 300
Query: 286 YSLATLSAAGLVIA-LNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALM 344
Y + L+ A L++A ++ + ++ + Y+P++ L + F
Sbjct: 301 YFMTILAYALLILAVMSDLVMKIFAAPSYFTAHEYIPIISLGVWGYALHTFVRVGVNLSK 360
Query: 345 NSRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAY 384
+ + ++ M+ + NV L LVP G +GA A Y
Sbjct: 361 KTHLFTINYMLTLIYNVGLNCLLVPKWGAFGAAWATVGTY 400
>gi|56421698|ref|YP_149016.1| polysaccharide biosynthesis [Geobacillus kaustophilus HTA426]
gi|56381540|dbj|BAD77448.1| polysaccharide biosynthesis [Geobacillus kaustophilus HTA426]
Length = 482
Score = 72.4 bits (176), Expect = 6e-11, Method: Composition-based stats.
Identities = 110/482 (22%), Positives = 203/482 (42%), Gaps = 43/482 (8%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L ++L++A + K ++ +++P+YT+ L+ +YG L++ ++ L+ G AL
Sbjct: 8 LGADSLLYAFMNVGTKLIAFLMLPIYTSYLSKAQYGAVYLIDQWTSMLTFLVIFGTDSAL 67
Query: 64 YRFSID-DDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFF-------VLF 115
+ D DD K L+ +++ VV + + W A A F +L+
Sbjct: 68 SFYYYDTDDKEKRLLYVRNVMYFRLFVVAILFLAVVLAGPWI---AGALFQEPRYVDLLY 124
Query: 116 CSVC------VFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWS 169
S+ +F T + R ++ V++ L+ L + V +Y L EG L
Sbjct: 125 ISIATLLLDTIFVMATTVLRFEFQTKKVVIWTLVKMLLVAVLSYAALRWFAATPEGLL-- 182
Query: 170 YTIGYLVGGLVAFLGSAEYR---LLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGR 226
IG LV + FL ++ RFD +L+ +L Y+ PLVP L++W+++
Sbjct: 183 --IGRLVSSALVFLLMLHLTVKYMVWRVRFD--VLKELLAYAAPLVPTSLAFWVIANVST 238
Query: 227 YVVLWGSGLAAAGLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGY 286
+ + + L G+F A ++ ++I ++ S Q AW+ + D P+ A F V
Sbjct: 239 FFIQRFASLEEVGVFGVALRLATVITLITSGVQMAWRPYSMSMKDRPESRALFAKVY--M 296
Query: 287 SLATLSAAG-LVIALNRP-ISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALM 344
+L + A G L+IA P + + E+ + Y+P L + T
Sbjct: 297 ALLLIGAFGLLLIATASPWLVETFFKPEYRDAAFYIPFLSAVTFLNFYYLIVSTGLFLTK 356
Query: 345 NSRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMD 404
+ + + A++++ L LVP WGA A + Y + + R ++ +P+
Sbjct: 357 ETGYISRVFTLAALLHLALNAVLVPLWLSWGAVAASLITYIVAVAFIFRKSQQVYPVPVS 416
Query: 405 RSRLTYQLALLSVMAVCTSFDGGSWLNGAVWVCLILLATSDMSVLGGGARAVAAAFAGRL 464
++ L G+ L V V + + +D VL G A FA R+
Sbjct: 417 WKKMALVLV-------------GTLLALIVIVYVQQASLADGWVLAGWGLFAATLFASRV 463
Query: 465 GR 466
R
Sbjct: 464 DR 465
>gi|21227237|ref|NP_633159.1| Transporter [Methanosarcina mazei Go1]
gi|20905581|gb|AAM30831.1| Transporter [Methanosarcina mazei Go1]
Length = 492
Score = 72.0 bits (175), Expect = 8e-11, Method: Composition-based stats.
Identities = 84/384 (21%), Positives = 161/384 (41%), Gaps = 23/384 (5%)
Query: 22 SLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYRF---SIDDDVPKDELF 78
S L+P+ T L A +YG +N + ++ L +G+ + RF D ++ ++
Sbjct: 25 SFFLLPIITKTLGAYDYGLWAQINITVSLISSLALMGLSMSFVRFLSSETDKKKIREAVY 84
Query: 79 AGSLVVLGGGVVCTGVACALGSALWNM---EHAAAFFVLFCSVCVFKATTQ-----LARG 130
+ V G + + V L + AA +FV S+ + + R
Sbjct: 85 SILFFVTVSGFLASFVLYTFAEPLATFGFKDPAATYFVQAGSLLILVNVIESISLFYFRI 144
Query: 131 LGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLVAFLGSAEYRL 190
+ ++ + ++ + L + G L T +V GL+ +
Sbjct: 145 FRQIEKYSYFTFFETFGKLLFILIFLKMGY----GLLGVITATLIVQGLIFLITFIIIIS 200
Query: 191 LAPFRFDR-ALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGLFTAASKMPS 249
F R ++ L +SLPL PN L W+ S RY+V + GL + G+++AA + +
Sbjct: 201 QIGFVIPRFTYIKEYLEFSLPLTPNALIRWVTESSDRYMVTYFLGLGSVGIYSAACSIGN 260
Query: 250 LINIVASVFQQAWQYSTAREIDS---PDRGAFFGSVLRGYSLATLSAAGLVIALNRPISR 306
LI + + Q ++ D + + LR + + ++ A + AL +P+
Sbjct: 261 LIQLFVNPLQLILFPELSKLFDQNRMDEVRIYMSHSLRYFLIISIPAVFGLSALAKPLLA 320
Query: 307 VMLQAEFAEGWRYVPLLMLAATF-GVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVILGV 365
V+ +F GW +P++ A G+ IF T Y + +R ++ A+ NV++ +
Sbjct: 321 VLTTPDFISGWFVIPIIAFAGLMAGIFQIFINTMY-LIKETRPATYINIIAAVSNVLINL 379
Query: 366 ALVPF--MGPWGAGLAGAVAYALV 387
L+P +G GA + V+Y L+
Sbjct: 380 ILIPIPSIGILGAAFSTLVSYFLM 403
>gi|156740780|ref|YP_001430909.1| polysaccharide biosynthesis protein [Roseiflexus castenholzii DSM
13941]
gi|156232108|gb|ABU56891.1| polysaccharide biosynthesis protein [Roseiflexus castenholzii DSM
13941]
Length = 488
Score = 71.6 bits (174), Expect = 1e-10, Method: Composition-based stats.
Identities = 92/402 (22%), Positives = 174/402 (43%), Gaps = 20/402 (4%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYG---TAELLNSAIEIVLPLLSLGVV 60
L+ ++++ +G + +KA + L+P+YT L +YG L+ + I + L +
Sbjct: 8 LILTSVIYGIGDILLKAFNFFLLPIYTRLLNPSDYGILAVTGFLSFIMSIFMSLSLHSFL 67
Query: 61 EALYRFSIDDDVPKDELFAGSLVVLG---GGVVCTG---VACALGSALWNMEHAAAFFVL 114
LY FS + + E GSL +L GGV+ VA S + + L
Sbjct: 68 TPLY-FSTGNHQERRENI-GSLFLLSILVGGVIALSIDQVARPFFSLVIRDVPFDPYIRL 125
Query: 115 FCSVCVFKATTQLARGLGHVRR----FVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSY 170
+++ L VR +VL L ++L V + +V GIEG L +
Sbjct: 126 TIWTSYLSTWSRIPLLLLRVRERSFAYVLITLSSSLLQTVLSIWFVVYLKRGIEGLLIAN 185
Query: 171 TIGYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVL 230
I V ++ + + +L+ + + L++SLPL+P+ LS W++ +S R ++
Sbjct: 186 LIATSVTSVICLISVSSNMILS---LKVKIWKHALIFSLPLIPHELSGWILELSDRAILQ 242
Query: 231 WGSGLAAAGLFTAASKMPSLINIVASVFQQAW-QYSTAREIDSPDRGAFFGSVLRGYSLA 289
W L G+++ A SLI ++ AW + + +R + S + Y
Sbjct: 243 WYVSLDQVGIYSLAYSYGSLITLIGYAMNMAWVPFLHKTDSIEGERASERFSYMATYFTV 302
Query: 290 TLSAAGLVIAL-NRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRM 348
TL GL + L +P+ ++ A F + PL++ A + F F +
Sbjct: 303 TLCFFGLFLGLAAKPVISLITTALFHDAGGIAPLIVAALLLSNIYYFPVNFIFLRRKTTK 362
Query: 349 LMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVV 390
+ +T++ + N+IL + L+P G A ++Y ++ ++
Sbjct: 363 VAFATVVSSFTNIILNLWLIPVHGVIAAAWTTFISYGVMTLL 404
>gi|28378721|ref|NP_785613.1| repeat unit transporter [Lactobacillus plantarum WCFS1]
gi|28271558|emb|CAD64463.1| repeat unit transporter [Lactobacillus plantarum WCFS1]
Length = 483
Score = 70.1 bits (170), Expect = 3e-10, Method: Composition-based stats.
Identities = 102/439 (23%), Positives = 184/439 (41%), Gaps = 32/439 (7%)
Query: 8 TLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYRFS 67
+L F + K++S++ P++T ++ EYG +L S + IV ++SL + +Y
Sbjct: 19 SLWFLVCAFFEKSISIIATPIFTRIMSTSEYGQFNVLYSWLTIVTIIVSLNLCYGVYTQG 78
Query: 68 IDDDVPKDELFAGSLVVLGGGVVCTGVACALG-SALWN------MEHAAAFFVLFCSVCV 120
+ ++ +L L +V LG WN A ++ + V
Sbjct: 79 LIKFSHDRRRYSAALQGLTVVLVLAWTLVYLGFRDFWNSVFSLTTTQMLAMLLMVWTSSV 138
Query: 121 FKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLV 180
F R + V+ L+ A+A + LL+V A+ + T +L LV
Sbjct: 139 FNFWAGERRVALQYKSLVMVTLLVAIAKPLVGVLLVVYANDKV-------TARFLGLALV 191
Query: 181 AFLGSAEYRLLAPFR----FDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLA 236
+G + +R F R R L++++PL P+ LS ++S + R ++
Sbjct: 192 ELVGYTGLFFVQMYRGKTFFSRHFWRHALMFNIPLAPHYLSQIVLSSADRIMIANMVNTK 251
Query: 237 AAGLFTAA---SKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSA 293
+AG+++ A S++ L N+ S W Y ++ R A V G +
Sbjct: 252 SAGIYSLAYSLSQIMILFNVALSQTLSPWIYQKIKD----RRVADIAGVAYGALVLIAVV 307
Query: 294 AGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVST 353
L+IA I + +A+ VP + ++ F F F + ++M+++
Sbjct: 308 NLLLIAFAPEIVSIFAPRAYAKATWIVPPIAMSVFFIFAFDLFAKFEFYYERTSLIMIAS 367
Query: 354 MMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVR----ARDLRRRID-LPMDRSRL 408
++GA++NV L L+P +G AG + Y + V R R ++D LP DR L
Sbjct: 368 VIGAVLNVALNYWLIPILGYEVAGYTTLICYVMYAAVHYWFMNRICREKLDTLPYDRKIL 427
Query: 409 TYQLALLSVMAVCTSFDGG 427
+ L+ +AV F GG
Sbjct: 428 G--MITLTFLAVGFLFLGG 444
>gi|153941483|ref|YP_001392372.1| putative polysaccharide transporter [Clostridium botulinum F str.
Langeland]
gi|152937379|gb|ABS42877.1| putative polysaccharide transporter [Clostridium botulinum F str.
Langeland]
Length = 474
Score = 69.7 bits (169), Expect = 4e-10, Method: Composition-based stats.
Identities = 87/390 (22%), Positives = 167/390 (42%), Gaps = 13/390 (3%)
Query: 11 FALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYR-FSID 69
F++G + +S +P+ T + E+G A + A+ I + LG+ ++ R F+ +
Sbjct: 15 FSMGPILSAIISFFTVPITTYFVVPTEFGKASMYTMALSISSMFIYLGMDQSFTREFNAE 74
Query: 70 DDVPKDELFAGSLVV-----LGGGVVCTGVACALGSALWNMEHAAAFFVLFCSV---CVF 121
+D K LF SL+V L GV + L + + +L S+ +
Sbjct: 75 ED--KKSLFWNSLIVPLIFSLILGVFYIILYKPLSMLMIDTVDRYIVVILALSLPFAVID 132
Query: 122 KATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLVA 181
+ L R R + L+ +I+ + VV L+ +G + + + +V
Sbjct: 133 RFNLLLIRMEEKARLYSLFNVISKVINVVILVPYLLYIDKSFKGIINAGFVSLFFMCIVE 192
Query: 182 FLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGLF 241
+ EY F+ ++AL+ +M + LPL+P + W ++ + + S GL+
Sbjct: 193 CFFTREY-WFTKFKLNKALINKMFKFGLPLIPAYVIVWFLNSMDKLAMRQWSTFEEIGLY 251
Query: 242 TAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLVIALN 301
+AA K+ ++++IV S F W + R + + F V + ++ + I L
Sbjct: 252 SAAFKIVAVVDIVKSAFCTFWTPTAFRWYEEKVKEENFMKV-SNMLMCFMNFMFVGIVLF 310
Query: 302 RPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNV 361
+ + +L A +A VP L+L ++ + ++ +++ A+VN
Sbjct: 311 KDLIIKLLDASYASSAIIVPFLLLLPIMYTVSESTCLGISFSRKTSYNIIVSLIAAVVNY 370
Query: 362 ILGVALVPFMGPWGAGLAGAVAYALVLVVR 391
IL LVP G GA +A +AY + VR
Sbjct: 371 ILNYLLVPKYGALGASIATGIAYTVFFWVR 400
>gi|154249271|ref|YP_001410096.1| polysaccharide biosynthesis protein [Fervidobacterium nodosum
Rt17-B1]
gi|154153207|gb|ABS60439.1| polysaccharide biosynthesis protein [Fervidobacterium nodosum
Rt17-B1]
Length = 479
Score = 69.3 bits (168), Expect = 5e-10, Method: Composition-based stats.
Identities = 90/440 (20%), Positives = 181/440 (41%), Gaps = 13/440 (2%)
Query: 2 RLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVE 61
R LL + F+LG +S + P+ T + E+G A + ++A I+L + LG
Sbjct: 6 RNLLKKYISFSLGTWFKALISFFITPITTWLINPEEFGKAAMFSTAYSIILLVSILGTPS 65
Query: 62 ALYRF-SIDDDVPKDELFAGSLV---VLGG--GVVCTGVACALGSALWNMEHAAAFFVLF 115
AL RF + K L SL+ +L +V ++ + L + A +L
Sbjct: 66 ALLRFFPQKSEEEKTILLWSSLITPIILSALISIVVFIFKSSINAFLVGTSVSNAHVILI 125
Query: 116 CSVC--VFKA-TTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTI 172
++ +F+ L R G F + +++ + L + L++
Sbjct: 126 ATLITGIFQTFNMNLVRTKGRGILFSAIDITQSISQIGFILLYALLVKRDFYALLYAQLF 185
Query: 173 GYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWG 232
++ + +Y P + D+ L+ ++ Y PLV + L WW+++ R V+
Sbjct: 186 SNILALGIGMYFERDYWF--PIKIDKKLVFEVVQYGYPLVFSGLFWWILNWIDRVVLRLY 243
Query: 233 SGLAAAGLFTAASKMPSLINIVASVFQQAW-QYSTAREIDSPDRGAFFGSVLRGYSLATL 291
+ GL++AA K+ S +N++ S F W ++ + +P+ F L + T
Sbjct: 244 VDFSEIGLYSAAFKIISAMNLLTSGFSTLWYPFAYEQYEKNPENKIIFKRTLDYVAFLTF 303
Query: 292 SAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMV 351
S AG V+ + + ++L + P L+L M I + +V
Sbjct: 304 S-AGFVLLSFKDVIFLLLAKTYRPAAAISPFLILNPVMITMAIVVARGIDFSKKTYWFIV 362
Query: 352 STMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQ 411
S A+ N+I L+P G GA ++ +++ +V + + ++ +P + ++ Y
Sbjct: 363 SNSTAAIFNLIGNFLLIPIFGAKGAAVSTGLSFIIVFAIESSVSKKLYPVPYNLRKVYYL 422
Query: 412 LALLSVMAVCTSFDGGSWLN 431
+ + A +F +L+
Sbjct: 423 VIIFIFSASFHTFSQNLFLS 442
>gi|28211855|ref|NP_782799.1| transporter [Clostridium tetani E88]
gi|28204297|gb|AAO36736.1| transporter [Clostridium tetani E88]
Length = 488
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 86/401 (21%), Positives = 167/401 (41%), Gaps = 31/401 (7%)
Query: 9 LVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYR-FS 67
+ F+LG + +S + +P+ T + E+G A + A+ I + LG+ +A R ++
Sbjct: 30 IAFSLGPIISACISFITVPITTHFVVPEEFGKAAMYTMALSISSLFIFLGMDQAFTREYN 89
Query: 68 IDDDVPKDELFAGSLVV---LGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVC----- 119
DD K LF SL+V + V ++W E ++ +V
Sbjct: 90 TQDD--KKSLFWNSLIVPLIFSFFIGAIYVMFYKTISIWMFESLEKHVMIMLAVSLPFSI 147
Query: 120 VFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGL 179
+ + L R R + + +++ LA V+ ++ +G + + +V +
Sbjct: 148 IDRFNLLLIRMQERARIYSILNVVSKLANVIVLVPYVLYIDKSFKGIINANFFSLIVMCI 207
Query: 180 VAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAG 239
+ ++ L+ F+ ++ L+ +ML Y PLVP + WL++ + + + S L G
Sbjct: 208 IECFFVKDF-WLSKFKVNKDLIHKMLCYGFPLVPATIVSWLLNSMDKMAMRYWSTLHEIG 266
Query: 240 LFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATLSAAGLVIA 299
L+ AA K+ +++ IV F W + R + + S+ + + + I
Sbjct: 267 LYEAAFKIVAVVAIVQQAFCTFWTPTAFRWYEENVENEKYISI-SNMLMCFMVFIFIFIV 325
Query: 300 LNRPISRVMLQAEFAEGWRYVPLLML---------AATFGVMTIFFGTFYQALMNSRMLM 350
L R + +L +A VP L+ A T G+ + T+Y L+
Sbjct: 326 LFRDLIIKILSPNYANASAIVPFLLFYPIMYTVSEATTLGI-SFSKKTYYNILV------ 378
Query: 351 VSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVR 391
+++ +N IL VP G GA +A ++Y + VR
Sbjct: 379 --SIIATGINYILNYMFVPKYGAIGASVATGISYMVFFWVR 417
>gi|148381045|ref|YP_001255586.1| O antigen repeat unit flippase [Clostridium botulinum A str. ATCC
3502]
gi|153933011|ref|YP_001385416.1| putative polysaccharide transporter [Clostridium botulinum A str.
ATCC 19397]
gi|153935999|ref|YP_001388823.1| putative polysaccharide transporter [Clostridium botulinum A str.
Hall]
gi|148290529|emb|CAL84657.1| O antigen repeat unit flippase [Clostridium botulinum A str. ATCC
3502]
gi|152929055|gb|ABS34555.1| putative polysaccharide transporter [Clostridium botulinum A str.
ATCC 19397]
gi|152931913|gb|ABS37412.1| putative polysaccharide transporter [Clostridium botulinum A str.
Hall]
Length = 474
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 87/405 (21%), Positives = 172/405 (42%), Gaps = 43/405 (10%)
Query: 11 FALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYR-FSID 69
F++G + +S +P+ T + E+G A + A+ I + LG+ ++ R F+ +
Sbjct: 15 FSMGPILSAIISFFTVPITTYFVVPTEFGKASMYTMALSISSMFIFLGMDQSFTREFNSE 74
Query: 70 DDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVCVFKATT---- 125
+D K LF SL+V ++++ A + + + + V T
Sbjct: 75 ED--KKSLFWNSLIV---------------PLIFSLILGAFYIIYYKPLSVLMIDTVDRY 117
Query: 126 -----QLARGLGHVRRF-----------VLYGLINALAMVVSTYLL---LVRAHTGIEGY 166
L+ + RF LY ++N ++ V++ +L L+ +G
Sbjct: 118 IVVILALSLPFAVIDRFNLLLIRMEEKARLYSVLNVISKVINVVVLVPYLLYIDKSFKGI 177
Query: 167 LWSYTIGYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGR 226
+ + + + +V L +Y +A F+ ++ L+ +M Y +PL+P + W ++ +
Sbjct: 178 INAGFVSLVFMCIVECLFIGDY-WIAKFKINKTLINKMFRYGIPLIPASVIVWFLNSMDK 236
Query: 227 YVVLWGSGLAAAGLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRGY 286
+ S GL++AA K+ ++++IV S F W + R + G F V
Sbjct: 237 IAMRQWSSFEEIGLYSAAFKIVTVVSIVQSAFCTFWTPTAFRWYEEKVEGEKFMKV-SNM 295
Query: 287 SLATLSAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNS 346
+ ++ + I L R + +L A +A VP L+L ++ +
Sbjct: 296 LMCFMNFMFVGIVLFRDLIIKLLDASYASSAIIVPFLLLLPIMYTVSESTCLGISFSRKT 355
Query: 347 RMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVR 391
++ +++ A+VN IL LVP G GA +A +AY + VR
Sbjct: 356 SYNIIVSLIAAVVNYILNYLLVPKYGALGASIATGIAYTVFFWVR 400
>gi|89891385|ref|ZP_01202891.1| capsular polysaccharide repeat unit transporter, polysacc_synt
family [Flavobacteria bacterium BBFL7]
gi|89516416|gb|EAS19077.1| capsular polysaccharide repeat unit transporter, polysacc_synt
family [Flavobacteria bacterium BBFL7]
Length = 424
Score = 68.9 bits (167), Expect = 6e-10, Method: Composition-based stats.
Identities = 93/405 (22%), Positives = 176/405 (43%), Gaps = 18/405 (4%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L N ++ L A+ +L+P+ T+ L+ EYG ++ N+ I +P++ L +V +
Sbjct: 13 LFQNLGMYTLVNFINSAIPFLLLPILTSNLSEAEYGIVDIFNNLSFIFIPIIGLNIVSCV 72
Query: 64 YRFSIDDDVPKDELFA--GSLVVLGGGVVCTGVACALGSA---LWNMEHAAAFFVLFCSV 118
RF DD++ ++ + + +++ G ++ VA + S + N E + +L
Sbjct: 73 LRFYYDDNIDFNKFLSTVTTTLLICGTIIILAVAAFIYSTDVHIINEEIPNSVLLLALLF 132
Query: 119 CVFKATTQLARGLGH-VRRFVLYGLI----NALAMVVSTYLLLVRAHTGIEGYLWSYTIG 173
++ G+ R + YGL+ L + +S Y ++V G EG ++ +G
Sbjct: 133 AFSSQIIEIILGIFRATERPLNYGLVRIVKTGLDLSLSIYCVVV-LKLGWEGRIYP-AVG 190
Query: 174 YLVGGLVAFLGSAEYRLLAPFRF--DRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLW 231
VG L+A L R + LR YS PL+ + L+ +++S S R+++L
Sbjct: 191 --VGLLIALLILLYLFYAYNVRLVIKKDYLRIAFKYSTPLIFHSLAGYIISFSDRFIILE 248
Query: 232 GSGLAAAGLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFG-SVLRGYSLAT 290
GL+ A ++ +++ V + F QAW + D S Y
Sbjct: 249 YMSADDVGLYAVAYQVGMIMSFVNNSFNQAWTPYVFSILKGGDLSKIKQLSKFNYYYFLL 308
Query: 291 LSAAGLVIALNRP-ISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRML 349
+ L I L P I + +F ++ V ++L F M + ++ L
Sbjct: 309 MIVMALGIFLMVPLIYEWFIDGKFEVDYQIVFWVLLGYAFNGMYKILVNYLFYYKKTQKL 368
Query: 350 MVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARD 394
T+ A++N+ L + LVP MG GA ++ +A+ + + A D
Sbjct: 369 SYITIFSAVLNIGLCLYLVPRMGILGAAISTTIAFLSMFIFVAID 413
>gi|53711455|ref|YP_097447.1| polysaccharide biosynthesis protein [Bacteroides fragilis YCH46]
gi|60679725|ref|YP_209869.1| possible polysaccharide biosynthesis transmembrane protein
[Bacteroides fragilis NCTC 9343]
gi|52214320|dbj|BAD46913.1| polysaccharide biosynthesis protein [Bacteroides fragilis YCH46]
gi|60491159|emb|CAH05907.1| possible polysaccharide biosynthesis transmembrane protein
[Bacteroides fragilis NCTC 9343]
Length = 510
Score = 68.6 bits (166), Expect = 8e-10, Method: Composition-based stats.
Identities = 93/423 (21%), Positives = 183/423 (43%), Gaps = 39/423 (9%)
Query: 1 MRLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTA--GEYGTAELLNSAIEIVLPLLSLG 58
++ L+ +T ++ L + + ++ +L+PLYT L A G YG + + +++ LL+ G
Sbjct: 16 LKSLVKDTALYGLSSMVGRFLNYLLVPLYTAVLPAASGGYGVVTNVYAWAGLIMVLLTFG 75
Query: 59 VVEALYRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACA--LGSALWNMEHAAAFFVLFC 116
+ +RF+ + +++A SL+ +GG + + C L +E+ +
Sbjct: 76 METGFFRFANKSEEDPVKVYANSLISVGGISLIFAILCLTFLQPVSHLLEYGDHPDFIGM 135
Query: 117 SVCVFKATTQLARGLGHVR------RFVLYGLINALAMVVSTYLLLVRA---HTGIEGYL 167
+ V L ++R +FV ++ +A +V L+ H ++
Sbjct: 136 MIIVMALDAFLCIPFAYLRFKKRPIKFVAIKFVSIIANIVLNLFFLLLCPWLHEHFPAWV 195
Query: 168 -WSYTIGYLVG----------GLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLL 216
W Y YLVG L F E R A +R D+ LL+RML+YS P+ L
Sbjct: 196 DWFYNPTYLVGYIFVSNLITTCLQLFCLIPELRGFA-YRVDKQLLKRMLIYSFPI----L 250
Query: 217 SWWLVSVSGR------YVVLWG---SGLAAAGLFTAASKMPSLINIVASVFQQAWQYSTA 267
+ LV + + Y L+ GL G++ AA+K+ ++ + F+ A++
Sbjct: 251 IFGLVGILNQTVDKIIYPFLFADRQEGLVQLGIYGAATKIAMVMAMFTQAFRYAYEPFVF 310
Query: 268 REIDSPDRGAFFGSVLRGYSLATLSAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAA 327
+ D + ++ Y L A LV+ + R M+ ++ G V +++ A
Sbjct: 311 GKQKEGDNRRMYAQAMK-YFLIFAMFAFLVVMFYLDLLRYMVAPDYWAGLSVVAIVIGAE 369
Query: 328 TFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALV 387
F + +Y+ + +R +++G ++ V + V LVP G + A YA++
Sbjct: 370 IFKGIYFNLSFWYKLIDETRWGAYFSIVGCVIIVGMNVMLVPTYGFVASAWASVAGYAVI 429
Query: 388 LVV 390
++
Sbjct: 430 TIL 432
>gi|73668656|ref|YP_304671.1| transporter [Methanosarcina barkeri str. Fusaro]
gi|72395818|gb|AAZ70091.1| transporter [Methanosarcina barkeri str. Fusaro]
Length = 488
Score = 68.2 bits (165), Expect = 1e-09, Method: Composition-based stats.
Identities = 83/385 (21%), Positives = 158/385 (41%), Gaps = 21/385 (5%)
Query: 22 SLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYRFSIDDDVPK---DELF 78
+ L+P+ T L +YG +N+ + ++ PL +G+ RF + PK + ++
Sbjct: 24 TFFLLPIITKTLGTYDYGLWAQINTTVSLISPLALMGLSMGFVRFLSSETEPKIIREAVY 83
Query: 79 AGSLVVLGGGVVCTGVACALGSALWNM---EHAAAFFVLFCSVCVFKATTQ-----LARG 130
+ V G++ + + L + A +F+ S+ + + R
Sbjct: 84 SILFFVTISGLLASFLLYTFAEPLATFGFKDPHATYFIQAGSLLILLTVIESISLFYFRI 143
Query: 131 LGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVGGLVAFLGSAEYRL 190
++ F L + LL + G L LV G + +
Sbjct: 144 FRQIQTFSYLTLFETFGKLFFILFLLKMGY----GLLGVIAATLLVQGSIFLISLLMIIS 199
Query: 191 LAPFRFDR-ALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGLFTAASKMPS 249
F R ++ L +SLPL PN L W+ S RY+V + GL + G+++AA + S
Sbjct: 200 QIGFVIPRFTYIKEYLQFSLPLTPNSLVRWITESSDRYMVTYFLGLRSVGVYSAACSIGS 259
Query: 250 LINIVASVFQQAWQYSTAREIDSPDRGAF---FGSVLRGYSLATLSAAGLVIALNRPISR 306
LI + S Q ++ D LR + L ++ A + AL +P+
Sbjct: 260 LIQLFVSSLQLILLPELSKLFDENKMDEVRICMSHSLRYFLLFSIPAVFGLSALAKPLLG 319
Query: 307 VMLQAEFAEGWRYVPLLMLAATF-GVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVILGV 365
++ +F GW +P++ + G+ IF T + ++ ++ A+ NV++ +
Sbjct: 320 ILTTDDFLSGWLVIPIIAFSGLLAGIFQIFVNTML-LIKQTKTATYINIVAAVSNVLINL 378
Query: 366 ALVPFMGPWGAGLAGAVAYALVLVV 390
L+P +G GA L+ +Y L+ V+
Sbjct: 379 LLIPSIGIVGASLSTLFSYFLMAVL 403
>gi|88804486|ref|ZP_01120006.1| polysaccharide biosynthesis protein [Robiginitalea biformata
HTCC2501]
gi|88785365|gb|EAR16534.1| polysaccharide biosynthesis protein [Robiginitalea biformata
HTCC2501]
Length = 488
Score = 67.4 bits (163), Expect = 2e-09, Method: Composition-based stats.
Identities = 97/447 (21%), Positives = 190/447 (42%), Gaps = 42/447 (9%)
Query: 2 RLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVE 61
R L T ++ L + + +S +L+PLYT + G YG L+ + I +L+ G+
Sbjct: 5 RKLFRQTFIYGLATVLPRMLSFLLVPLYTEVMPPGGYGEITLIFTFFAIFNVILAYGMET 64
Query: 62 ALYRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALG----SALWNMEHAAAFFVLFCS 117
A +RF +D K+ + + SL+ +GG + L + L N+E +F F
Sbjct: 65 AFFRFYSKED-DKERVLSTSLLSIGGSTLLFFTLGVLMREPLAGLMNIE--VNYFWYFII 121
Query: 118 VCVFKATTQLA----RGLGHVRRFVLYGLINALAMVVSTYLLLVR----AHTGIEG---- 165
+ A + R + + + N +V L+ A + EG
Sbjct: 122 ILSLDALVIVPFARLRAQESPGIYAMVKIANVTTNIVLNLFFLLALPRLAESETEGIWSA 181
Query: 166 -YLWSYTIGYLV------GGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSW 218
Y+ + I Y++ GL + S YR + FDRAL RRML Y LP++ +++
Sbjct: 182 LYVPDFQINYILIANTVASGLTLLILSGNYRHMR-LAFDRALWRRMLAYGLPVMVAGMAF 240
Query: 219 WLVSVSGRYV------VLWGSGLAAAGLFTAASKMPSLINIVASVFQ---QAWQYSTARE 269
+ V +Y+ + A G + A K+ + + A+ F+ + + +S A
Sbjct: 241 TINEVFDKYLLSELLPLTPEEAKAEVGKYAACYKLAMFMTLFATAFRLGIEPFFFSHATS 300
Query: 270 IDSPDRGAFFGSVLRGYSLATLSAAGLVIALNRPISRVML--QAEFAEGWRYVPLLMLAA 327
D+P R + + Y + S L + + + +V+ A + E VP+++L +
Sbjct: 301 -DNPKRAY---AQITNYFVVLGSVILLAVVVFADVLKVLFVRDAAYWEAMEVVPIVLLGS 356
Query: 328 TFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALV 387
+ +Y+ +R V +++GA++ + + V +P G + A AY +
Sbjct: 357 FCLGIYHNLSVWYKVTDRTRFGAVISLVGAVLTIAINVVFIPEYGYTASAWATLAAYGTM 416
Query: 388 LVVRARDLRRRIDLPMDRSRLTYQLAL 414
+V+ +R +P + ++ + L +
Sbjct: 417 MVLSYLLGKRYYPIPYNMRKIAFYLGI 443
>gi|40388615|gb|AAR85520.1| Wzx [Thermoanaerobacterium thermosaccharolyticum]
Length = 491
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 82/409 (20%), Positives = 179/409 (43%), Gaps = 31/409 (7%)
Query: 2 RLLLGNTLVFALGGLAVKAVSLVLMPLYTTAL-TAGEYGTAELLNSAIEIVLPLLSLGVV 60
+L + N V+ LG + K + L ++P+ T + YG ++ N I + +G+
Sbjct: 5 KLFIENFFVYGLGSVISKIIPLFMLPIVTRLMPNTFYYGLNDISNIVISFGSAIAIMGMY 64
Query: 61 EALYR--FSIDDDVPKDELFAG--------SLVVLGGGVVCTGVACALGSALW---NMEH 107
+A++R F DD K E+ + S++V ++ L + + N+
Sbjct: 65 DAMFRLFFDKDDLEFKKEVCSSAFYFVTLTSIIVFAILIIFKSFFSKLFFSSYKYSNLLV 124
Query: 108 AAAFFVLF-CSVCVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGY 166
+AF +L S + A T++ + RR L + N LA ++S + + G+ Y
Sbjct: 125 ISAFTILIGTSNSIISAPTRMQ----NKRRIFL--VTNTLAPIISYCISVPLLIKGM--Y 176
Query: 167 LWSYTIGYLVGGLVA-----FLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLV 221
+ + I L+ L FL S+ + + + ++ L+ ML LPL+PN L +W+
Sbjct: 177 IVALPIAGLISSLSMLIIFYFLNSSWFSVK---KINKKLITDMLKIGLPLLPNFLIYWIF 233
Query: 222 SVSGRYVVLWGSGLAAAGLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGS 281
+ R ++ G G++ +++ S+ + + F WQY + S D+
Sbjct: 234 NSCDRLIIAKFLGNGEVGIYGIGARVASISQFIYTAFAGGWQYFAFSTMKSKDQVELTSR 293
Query: 282 VLRGYSLATLSAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQ 341
+ + + ++ + +++ I ++ + ++ +G P L L+ ++ G +
Sbjct: 294 IFEYLGIISFTSTIFLTSVSDVIFKLFFKGDYVKGAIVFPYLFLSPLLLMLYQTIGNQFL 353
Query: 342 ALMNSRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVV 390
+ + + +GA+ N+++ LVP +G GA + + Y + + +
Sbjct: 354 VIKKTGPSAIFLSLGAITNILINYYLVPIIGIEGAAIGTLMGYIVSVFI 402
>gi|153158522|ref|ZP_01375303.2| polysaccharide biosynthesis protein [Campylobacter concisus 13826]
gi|112801992|gb|EAT99336.1| cytosol aminopeptidase [Campylobacter concisus 13826]
Length = 464
Score = 66.6 bits (161), Expect = 3e-09, Method: Composition-based stats.
Identities = 61/255 (23%), Positives = 110/255 (43%), Gaps = 5/255 (1%)
Query: 189 RLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLAAAGLFTAASKMP 248
R L FR + +LR++L + +P +P+ +++W+ S S R ++ S L G++T A +
Sbjct: 196 RNLLSFRINIGILRKLLSFGIPFLPSSIAYWIYSSSDRIMLERMSALEDLGVYTVAVSLS 255
Query: 249 SLINIVASVFQQAWQYSTAR--EIDSPDRGAFFGSVLRGYSLATLSAAGLVIALNRPISR 306
S++ IV + QAW + E D F+ L L +
Sbjct: 256 SVMAIVCNSIAQAWSPHAIQIYEEDQYAAKKFYVKFLNLLLFLFLFVMFFACIFGKDFIM 315
Query: 307 VMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVILGVA 366
+ ++ + + +LM + F V T + + + A VNV L
Sbjct: 316 IAFPYDYEKAFYPFLILMTSFGFQVTTQVTAIGISLSKKTMYFLYVSFFVAFVNVFLNFV 375
Query: 367 LVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSRLTYQLALLSVMAVCTSFDG 426
L+P+ G +GA A A++Y L+ ++ A +R L D + +L ++ C SF
Sbjct: 376 LIPYYGVYGAAFATAISYLLLTLIYAMISQRLFFLNYDYG-IILTFCILLLIVFCISF-- 432
Query: 427 GSWLNGAVWVCLILL 441
S+L V C I++
Sbjct: 433 LSFLYKIVVFCFIVI 447
>gi|77165939|ref|YP_344464.1| Polysaccharide biosynthesis protein [Nitrosococcus oceani ATCC
19707]
gi|76884253|gb|ABA58934.1| Polysaccharide biosynthesis protein [Nitrosococcus oceani ATCC
19707]
Length = 477
Score = 65.9 bits (159), Expect = 6e-09, Method: Composition-based stats.
Identities = 98/418 (23%), Positives = 178/418 (42%), Gaps = 27/418 (6%)
Query: 3 LLLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAI----EIVLPLLSLG 58
+LL ++ ++ L ++ + +YT L EYG L+ +A+ ++ L+LG
Sbjct: 1 MLLRHSALYTLARGLPGLINFAALAVYTRLLAPDEYGRYALVIAAVGLANAVLFQWLNLG 60
Query: 59 VVEALYRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALW------NMEHAAAFF 112
++ L R+ L AG L+V+ ++ +G LW M
Sbjct: 61 LLRYLARYRDCKPRFLSTLAAGYLMVVL-------LSAMVGLVLWGIWPEPEMRSLIGLG 113
Query: 113 VLFC-SVCVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYT 171
+LF + +F A Q+ +R+ L + AL + +L G G LW T
Sbjct: 114 ILFLWAQALFDAHLQMTASQLTPQRYGLLAITKALTSLTLGSILAWWGF-GATGVLWGLT 172
Query: 172 IGYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLW 231
G ++ + E+R L+P+ D+ L+ +++ Y LPL ++VS S R ++ W
Sbjct: 173 GGLILA--IVIWAREEWRHLSPYYVDKELMGQLISYGLPLTATFALTFVVSSSDRLLLGW 230
Query: 232 GSGLAAAGLFTAASKMP-SLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLR---GYS 287
G +AGL+ +P ++ ++ V A ++ GA + + G
Sbjct: 231 LQGSHSAGLYAVGYDLPHQILGVLMMVVHLAAYPLAVHALEQEGWGAAQVQLKKNAIGLL 290
Query: 288 LATLSAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATF--GVMTIFFGTFYQALMN 345
L A +I L I++V+L EF + + + A+F G+ +F +Q
Sbjct: 291 CIALPATVGLILLAPNIAQVVLGIEFRKAAVALSFWIAMASFLAGIKAYYFDLAFQLGQR 350
Query: 346 SRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPM 403
+ + ++ A +N+IL + L+P +G GA A AY LV+ RR LP+
Sbjct: 351 TLAQVWIALVAATINLILNLWLIPKLGIMGAAYGTACAYLSALVLSVILGRRYFKLPI 408
>gi|153807205|ref|ZP_01959873.1| hypothetical protein BACCAC_01483 [Bacteroides caccae ATCC 43185]
gi|149130325|gb|EDM21535.1| hypothetical protein BACCAC_01483 [Bacteroides caccae ATCC 43185]
Length = 498
Score = 64.3 bits (155), Expect = 1e-08, Method: Composition-based stats.
Identities = 95/453 (20%), Positives = 197/453 (43%), Gaps = 39/453 (8%)
Query: 1 MRLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTA--GEYGTAELLNSAIEIVLPLLSLG 58
++ L +T ++ L + + ++ +L+PLYT L A G YG + + ++L LL+ G
Sbjct: 4 LKSLAKDTAIYGLSSIVGRFLNYMLVPLYTAVLPASSGGYGVVSNVYAFTALMLVLLTFG 63
Query: 59 VVEALYRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALG----SALWNMEHAAAFFVL 114
+ +RF+ +++A SL+ +GG + + C L + L + + F +
Sbjct: 64 METGFFRFANKSGEDPMKVYANSLLSVGGVSLIFVLLCLLFLQPIANLLDYGNHPEFIAM 123
Query: 115 FCSVCVFKATTQLARG-LGHVRRFVLYGLINALAMV----VSTYLLLVRAHTGIE---GY 166
V + + L + +R + + I L+++ ++ + LL+ +
Sbjct: 124 MAMVVALDSFQCIPFAYLRYKKRPIKFAAIKLLSIIGGIGLNLFFLLLCPWLNVHFPATV 183
Query: 167 LWSYTIGYLVG----------GLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLL 216
W Y YLVG + F E R A ++ D+ALL+RM+VYS P++ L
Sbjct: 184 SWFYDPDYLVGYIFISNLIISAVQMFFFIPELRGFA-YKLDKALLKRMVVYSFPVLILGL 242
Query: 217 SWWLVSVSGRYVVLW-----GSGLAAAGLFTAASKMPSLINIVASVFQQAWQYSTAREID 271
L + + + GL G++ A SK I +V ++F QA++Y+ +
Sbjct: 243 VGILNQTVDKMIYPFLFEDRQEGLVQLGIYAATSK----IAMVMAMFTQAFRYAYEPFVF 298
Query: 272 SPDRGA----FFGSVLRGYSLATLSAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAA 327
DR + + ++ + + +L A L + + R ++ + EG V ++MLA
Sbjct: 299 GKDREGDNRKMYAAAMKYFLIFSL-LAFLAVMFYLDLLRYLVARGYWEGLGVVAVVMLAE 357
Query: 328 TFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALV 387
+ +Y+ +R +++G + V++ + VP G + A Y ++
Sbjct: 358 ICKGIYFNLSFWYKLTDETRWGAYFSLIGCAIIVVMNIVFVPVYGYIASAWASVAGYGVI 417
Query: 388 LVVRARDLRRRIDLPMDRSRLTYQLALLSVMAV 420
+++ +++ + D L + L +V+ V
Sbjct: 418 MLLSYWMGQKKYPIQYDLKSLGLYILLAAVLYV 450
>gi|93005451|ref|YP_579888.1| polysaccharide biosynthesis protein [Psychrobacter cryohalolentis
K5]
gi|92393129|gb|ABE74404.1| polysaccharide biosynthesis protein [Psychrobacter cryohalolentis
K5]
Length = 475
Score = 63.5 bits (153), Expect = 3e-08, Method: Composition-based stats.
Identities = 101/433 (23%), Positives = 185/433 (42%), Gaps = 21/433 (4%)
Query: 1 MRLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVV 60
++ L +++ + L L + +S L+PLYT LT +YG+ +L + I+ ++L V
Sbjct: 2 LKRLFKDSVTYTLPSLISRGMSFFLIPLYTKVLTTADYGSFDLFLIFVNIINLTIALEVS 61
Query: 61 EALYR-FSIDDDVPKDELFAGSLVVLGGGVVCTGVACAL---------GSALWNMEHAAA 110
+ L R F+ +D+ L+A S V L N+E+A
Sbjct: 62 QGLARYFTSEDNTNSKVLYASSAFWFTFAVYTIFSVLLLIFHKKLSFMVMGQLNVENAFI 121
Query: 111 FFVLFC-SVCVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWS 169
+L+ S +F R + + + +I ++A + L G+EG L+
Sbjct: 122 LGILYIWSNGLFYLIQNQFRWELRSKEYAVISIIMSVATAFVSVLAAYFLDYGLEGLLFG 181
Query: 170 YTIGYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVV 229
IG L+GG+ G R FRF L+ MLV+S+PLV + ++ WL R ++
Sbjct: 182 LLIGNLIGGI---FGLWLLRNTFRFRFSLIKLKEMLVFSIPLVFSGVAVWLSLYIDRIMI 238
Query: 230 LWGSGLAAAGLFTAASKMPSLINIVASVFQQAWQ---YSTAREIDSPDRGAFFGSVLRGY 286
++ GL+ ++ S+ + FQ A Y E +P + + R +
Sbjct: 239 KHLMTISDVGLYGLGYRVSSIAGLAMVGFQGALTPLIYKHYHEKTTPPQ---LARIFRYF 295
Query: 287 SLATLSAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNS 346
L L+ I +++ EF + + L+ A M IF A S
Sbjct: 296 VTLILLIFILISFFAENILHLIVTEEFYYSYSVIIFLVPAIILSNMYIFAPGMAIAKKTS 355
Query: 347 RMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRS 406
++V+ ++GA +N++L L+P G GA +A ++Y +V + ++ +P D
Sbjct: 356 YFVIVN-IIGATLNIVLNYVLIPIFGIQGAAVATLISYFIVFTIYMILSQKIYYVPHDFK 414
Query: 407 RLTYQLALLSVMA 419
+ L+S++A
Sbjct: 415 PIIASTLLVSLLA 427
>gi|156111618|gb|EDO13363.1| hypothetical protein BACOVA_00897 [Bacteroides ovatus ATCC 8483]
Length = 498
Score = 63.2 bits (152), Expect = 3e-08, Method: Composition-based stats.
Identities = 92/422 (21%), Positives = 182/422 (43%), Gaps = 37/422 (8%)
Query: 1 MRLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTA--GEYGTAELLNSAIEIVLPLLSLG 58
++ L +T ++ L + + ++ +L+PLYT L A G YG + + ++L LL+ G
Sbjct: 4 LKSLAKDTAIYGLSSIVGRFLNYMLVPLYTAVLPASTGGYGVVSNVYAFTALMLVLLTFG 63
Query: 59 VVEALYRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALG----SALWNMEHAAAFFVL 114
+ +RF+ +++A SL+ +GG + C L S L + F +
Sbjct: 64 METGFFRFANKSGEDPMKVYANSLLSVGGVSLIFVFLCLLFLQPISNLLDYGDHPEFIAM 123
Query: 115 FCSVCVFKATTQLARG-LGHVRRFVLYGLINALAMV----VSTYLLLVRAHTGIE---GY 166
V + + L + +R + + I L++V ++ + LLV +
Sbjct: 124 MAVVVALDSFQCIPFAYLRYKKRPIKFAAIKLLSIVGGIGLNLFFLLVCPWLNVHCPSTI 183
Query: 167 LWSYTIGYLVGGLVA---FLGSAEYRLLAP------FRFDRALLRRMLVYSLPLVPNLLS 217
W Y YLVG + + + P ++ DR LL+RM+VYS P++ L
Sbjct: 184 SWFYDPDYLVGYIFISNLIISVVQMFFFIPELTGFAYKLDRVLLKRMVVYSFPVLILGLV 243
Query: 218 WWLVSVSGRYVVLW-----GSGLAAAGLFTAASKMPSLINIVASVFQQAWQYSTAREIDS 272
L + + + GL G++ A SK I +V ++F QA++Y+ +
Sbjct: 244 GILNQTVDKMIYPFLFEDRQEGLVQLGIYAATSK----IAMVMAMFTQAFRYAYEPFVFG 299
Query: 273 PDRGA----FFGSVLRGYSLATLSAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAAT 328
DR + + ++ + + +L A L + + R ++ + EG V ++MLA
Sbjct: 300 KDREGDNRKMYAAAMKYFLIFSL-LAFLAVMFYLDLLRYLVARGYWEGLGVVAIVMLAEI 358
Query: 329 FGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVL 388
+ +Y+ + +++G ++ V+L + VP G + A YA++L
Sbjct: 359 CKGIYFNLSFWYKLTDKTYWGAYFSVIGCVIIVVLNILFVPVYGYLASAWASVAGYAVIL 418
Query: 389 VV 390
++
Sbjct: 419 LL 420
>gi|15643386|ref|NP_228430.1| lipopolysaccharide biosynthesis protein [Thermotoga maritima MSB8]
gi|4981141|gb|AAD35705.1|AE001736_3 lipopolysaccharide biosynthesis protein [Thermotoga maritima MSB8]
Length = 479
Score = 62.8 bits (151), Expect = 5e-08, Method: Composition-based stats.
Identities = 89/437 (20%), Positives = 179/437 (40%), Gaps = 21/437 (4%)
Query: 2 RLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVE 61
R LL + F+LG +S P+ T + E+G A + ++ I+L + LG
Sbjct: 6 RDLLKKYISFSLGTWFRALISFFTTPITTWMINPEEFGKATMFSTVYSILLLVALLGTPN 65
Query: 62 ALYRFSIDDDVPKDELFAGSLVVLGGGVVCTGVA------CALGSALWNMEHAAAFFVLF 115
+ RF + + S V+ + ++ + L + A +L
Sbjct: 66 SFLRFFPQKSEEEKPVLLWSSVMPPVLLSILLSLVVFIFRSSINTFLVGTPDSKAHVILI 125
Query: 116 CSVC--VFKA-TTQLARGLGHVRRFVLYGLINALAMV--VSTYLLLVRA--HTGIEGYLW 168
++ +F+ L R G F +I +L+ + + Y LLV +T + L+
Sbjct: 126 ATLITGIFQTFNLNLVRSKGRAILFSAIQVIQSLSQMGFIVLYALLVSRDFYTLLYAQLF 185
Query: 169 SYTIGYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYV 228
S + VG L E P + D+ L+ ++ Y P V + L WL++ R+V
Sbjct: 186 SNVVALAVGMLF------ERSYWFPVKIDKKLVFEVIKYGYPFVFSGLLLWLLNWIDRFV 239
Query: 229 VLWGSGLAAAGLFTAASKMPSLINIVASVFQQAW-QYSTAREIDSPDRGAFFGSVLRGYS 287
+ + + GL++AA K+ S ++++ + F W ++ + +P+ F L +
Sbjct: 240 LRLYTSFSDIGLYSAAFKIVSAMSLLTTGFSTLWYPFAYEQYEKNPEDKMIFKRALDYMA 299
Query: 288 LATLSAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSR 347
SA L+++ + + ++L + P L+L M I +
Sbjct: 300 FLVFSAGFLLLSF-KDVIFLLLARSYRPAAAISPFLILNPVMITMAIVVARGIDFSKKTY 358
Query: 348 MLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSR 407
+VS A+ N+ LVP +G GA ++ +++ +V + + ++ +P D +
Sbjct: 359 WFIVSDGAAALFNLAGNFLLVPTLGAKGAAVSTGLSFIIVFAIESSVSKKLYPVPYDLKK 418
Query: 408 LTYQLALLSVMAVCTSF 424
+ ++L AV +F
Sbjct: 419 VYCLVSLFVFSAVLHTF 435
>gi|29348696|ref|NP_812199.1| polysaccharide biosynthesis protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340601|gb|AAO78393.1| polysaccharide biosynthesis protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 482
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 82/410 (20%), Positives = 188/410 (45%), Gaps = 29/410 (7%)
Query: 1 MRLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTA--GEYGTAELLNSAIEIVLPLLSLG 58
++ L +T ++ + K ++ L+P+YT +L A G YG + + + ++L +L+ G
Sbjct: 4 LKSLAKDTAIYGASSIIGKFLNYFLVPIYTFSLPAASGGYGVITKMYAIVALLLVVLTFG 63
Query: 59 VVEALYRFSIDDDVPKDELFAGSLVVLGG-----GVVCTGVACALGSALWNMEHAAAFFV 113
+ +RF+ D ++++ SL+++GG ++C + + EH +
Sbjct: 64 METGFFRFANKGDDDPKKVYSVSLLMVGGVSLLFLLLCLVFLNPIAGLMGYSEHPWYLGM 123
Query: 114 LFCSVCVFKATTQLARGLGHVRRFVLYGLINALAMVVST-----YLLLVRAHTGIEGYLW 168
+ + + L + +R + + + L + VS Y ++++ +L
Sbjct: 124 MLIVLAMDAIQAIPFAYLRYKKRPIKFAGLKLLFIFVSILLNILYFVVMKGDDVGVAFLI 183
Query: 169 SYTIGYLVGGLVAFLG-SAEYRLLAPFRF--DRALLRRMLVYSLPLVPNLLSWWLVSVSG 225
+ L+ V LG E+R FR+ DR L++RML YS P++ ++ + V
Sbjct: 184 N-----LICSFVVMLGLIPEFR---EFRYCPDRLLMKRMLYYSFPILILGVAGIVNQVGD 235
Query: 226 RYVV--LWGSGLAA---AGLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFG 280
+ + ++ + A G+++AA+K+ ++ ++ F+ A++ + D +
Sbjct: 236 KIIFPFVYPDKVEADVQLGIYSAATKIAMIMAMITQAFRFAYEPFVFGKSKDKDNKRIYA 295
Query: 281 SVLRGYSLATLSAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFY 340
++ + + TL A L + I R ++ ++ EG R VP++M+A F + +Y
Sbjct: 296 QAMKFFIIFTL-LAFLAVMFYLDILRYIIAEDYWEGLRVVPIVMIAEMFMGIYFNLSFWY 354
Query: 341 QALMNSRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVV 390
+ ++R +++ + V++ + L+P G AG YA+ +++
Sbjct: 355 KLTDDTRWGAYFSIIACLTVVLMNIFLIPVYGYVACAWAGFTGYAIAMLL 404
>gi|110598884|ref|ZP_01387135.1| Polysaccharide biosynthesis protein [Chlorobium ferrooxidans DSM
13031]
gi|110339497|gb|EAT58021.1| Polysaccharide biosynthesis protein [Chlorobium ferrooxidans DSM
13031]
Length = 487
Score = 62.4 bits (150), Expect = 6e-08, Method: Composition-based stats.
Identities = 97/428 (22%), Positives = 181/428 (42%), Gaps = 44/428 (10%)
Query: 1 MRLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVV 60
++LL +T+++ + + ++++ +L+PLY LT + G ++ + I + L S G+
Sbjct: 5 LKLLFKDTVIYGVSSILARSLNYLLVPLYANKLTTFDNGIQTIVYANIALANVLFSYGLE 64
Query: 61 EALYRFSIDD-DVPKDE--LFAGSLVVLGGGVVCTGVACALGSALWNME-------HAAA 110
+ + + D + +DE LF+ + + L T AL AL+ E A+
Sbjct: 65 TSYLKVASDSTNRERDETRLFSTAFICL----FLTSTLFALLIALFAPEISRLIGLSASD 120
Query: 111 FFVLFCSVCVFKATTQLARGLGHVR------RFVLYGLINALAMVVSTYLLLVRAHTGIE 164
F + + + T L +R +F + + +A+V+S LL+++ + G+
Sbjct: 121 FPFIRYAALILWLDTLLVIPFAELRLKRKALQFAMARVGGVIAVVISALLLILQFNAGLH 180
Query: 165 GYLWSYTIGYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVS 224
G + G LV + ++RL F LR ML LP VP ++ L+ +
Sbjct: 181 GVFIAEAFGSLVSLIFVVPVFQQFRLF----FSTLQLREMLRIGLPYVPTGIAGLLIHLI 236
Query: 225 GRYVV----------LWGSGLAAA---GLFTAASKMPSLINIVASVFQQAWQYSTAREID 271
R ++ ++G G A+ G++ + ++ + VF+ AWQ +
Sbjct: 237 DRNILIRISPAEIERIYGKGFEASDIVGIYGRIAAFGVVLQLFIQVFRFAWQPFFLQHSK 296
Query: 272 SPDRGAFFGSVLRGYSLAT----LSAAGLVIALNR---PISRVMLQAEFAEGWRYVPLLM 324
PD F +L +L T L+A+ V L R S +L + G +P +
Sbjct: 297 DPDAKQLFRHILSISTLFTMFLALAASLFVPDLVRYHYGGSFYLLPPRYWIGLSVLPWIF 356
Query: 325 LAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAY 384
L+ F +++ N+R L V T GA V + L+P G GA A
Sbjct: 357 LSYVFDMVSTNLSAGLLITGNTRYLPVVTFSGAAVTALCCWWLIPLGGMDGAAYAIVAGT 416
Query: 385 ALVLVVRA 392
++ +V A
Sbjct: 417 FMMCLVMA 424
>gi|148976924|ref|ZP_01813579.1| polysaccharide biosynthesis [Vibrionales bacterium SWAT-3]
gi|145963798|gb|EDK29058.1| polysaccharide biosynthesis [Vibrionales bacterium SWAT-3]
Length = 442
Score = 62.4 bits (150), Expect = 7e-08, Method: Composition-based stats.
Identities = 64/288 (22%), Positives = 133/288 (46%), Gaps = 20/288 (6%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L+ NT+ + LG +A + V+L+L+P+YT + YG+ EL+ + I+ ++ SL + A
Sbjct: 6 LIKNTITYGLGVVASRGVALILLPIYTAYFSTERYGSIELITTLIQFLIVFGSLQIETAY 65
Query: 64 YRFSIDDDVPKDELFAGSLV---VLGGGVVCTGVACALGSALW---NMEHAAAFFVLFCS 117
R+ ++ + + + LV L ++ A + + + + E + ++LF
Sbjct: 66 QRYYYSEESNNNLMVSLCLVSIFSLCSSLLAMFFAEEISTYITGGIDKEVIKSIYLLFGL 125
Query: 118 VCVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLVG 177
+ + T + L + R + + ++ + ++VS + H I ++ +G L+G
Sbjct: 126 IMLTNVNTIIQIELRNSDRLIAFNVLTFIQVIVSAIYTIGALHY-ISVSIYHVFLGQLLG 184
Query: 178 GLVAFLGSAEY---RLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSG 234
+V FL S + +L++ F+ + + Y+LP VP + + S R +VL
Sbjct: 185 LIVYFLFSLRFFYSKLVSQLSFNVSTAWEIFQYALPQVPARILSFANIYSNRVLVLIFLS 244
Query: 235 LAAAGLFTAASKMPSLINIVASVFQQAW----------QYSTAREIDS 272
G+ + A K+ S+ ++ F +W Q T REI++
Sbjct: 245 STDVGILSVAIKIASISMLIHQAFVMSWGPYVFKKDFTQKQTLREIEN 292
>gi|83645504|ref|YP_433939.1| Membrane protein involved in the export of O-antigen and teichoic
acid [Hahella chejuensis KCTC 2396]
gi|83633547|gb|ABC29514.1| Membrane protein involved in the export of O-antigen and teichoic
acid [Hahella chejuensis KCTC 2396]
Length = 485
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 84/431 (19%), Positives = 179/431 (41%), Gaps = 17/431 (3%)
Query: 1 MRLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVV 60
++ L ++L++ + +A S++L+PLYT L+ YG ELL+ ++ V
Sbjct: 6 VKKLFSDSLIYGVAIIARSLSSIILLPLYTRYLSTEGYGVIELLSMMVDFTAIFFGARVG 65
Query: 61 EALYRFS--IDDDVPKDELFAGSLVVLG-----GGVVCTGVACALGSALWNMEHAAAFFV 113
++ R+ +++ K E+F+ S +L G ++ + ++G L+ + A +
Sbjct: 66 QSFLRYYGLANNEKEKSEVFSTSFSLLSFTHLVGALLLILFSSSIGMLLFGGQEYKAVLI 125
Query: 114 LFCSVCVFKATTQLA----RGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWS 169
+F +F +++ R G + + L + + LV G+ G +
Sbjct: 126 VFSLNLLFGGMSEIPLAWLRANGKAASVLFFSLTKLVLQISLCVGFLVYLEWGVMG---A 182
Query: 170 YTIGYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVV 229
L G +A L + + ++++ +S P++ +S + ++ R+ +
Sbjct: 183 ALASVLAQGTIAILLMIYCVKSNKIILSKEIAKKLVGFSWPIILAAVSMFFITYGDRFFI 242
Query: 230 LWGSGLAAAGLFTAASKMPSLINIVA-SVFQQAWQYSTAREIDSPDRGAFFGSVLRGYS- 287
L+ G++ A K ++ + FQ W+ R + F ++ S
Sbjct: 243 RTYLTLSDVGIYALAYKFGFILYAIGWQPFQSMWEAERYRIYREESQHYLFPAIFSFMSV 302
Query: 288 LATLSAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSR 347
L A GL + + + ++M EF YVP+++L F T + N++
Sbjct: 303 LLVFMAFGLSVWIGYFL-QIMSDKEFWPAAEYVPIIILGYVFLSWTGYCNLGIFTSGNTK 361
Query: 348 MLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSR 407
+ VS+M A+ ++ +P G GA + A+A+ + V R + R D+ ++
Sbjct: 362 VFGVSSMFTALFCLVAYWFSIPQYGLMGAAVVTALAFFVRFFVIHRLAKARFDMKLNWYP 421
Query: 408 LTYQLALLSVM 418
L L S +
Sbjct: 422 AISSLCLASAL 432
>gi|149375176|ref|ZP_01892948.1| Membrane protein involved in the export of O-antigen and teichoic
acid [Marinobacter algicola DG893]
gi|149360540|gb|EDM48992.1| Membrane protein involved in the export of O-antigen and teichoic
acid [Marinobacter algicola DG893]
Length = 487
Score = 62.0 bits (149), Expect = 8e-08, Method: Composition-based stats.
Identities = 87/398 (21%), Positives = 177/398 (44%), Gaps = 27/398 (6%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
++ ++ ++A+G ++ + V +++P+YT LT +YG LL + + +L + +A+
Sbjct: 7 MVKHSTIYAVGNISRQLVGFIMLPIYTHYLTPADYGVIGLLVFLVSLFEVVLGGHMFQAV 66
Query: 64 YRFSIDDDVP--KDELFAGSLVVLG-----GGVVCTGVACALGSALWNMEHAAAFFVLFC 116
+F ++ K+ + + +L+V ++ + L ++ E + + ++F
Sbjct: 67 PKFYHQEETKLLKNSVVSTALLVTSFFSGMASILMASFSGPLAEVVFGKEEYSIYIIIFS 126
Query: 117 SVCVFKATTQLARGLGHVR------RFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSY 170
++ V A Q GL ++R + + +I + L +V G+ G S
Sbjct: 127 ALIVTHALEQY--GLTYLRIVKKPWTYFNFNMIKLGLQLGLNILTIVVLDWGLMGLALSS 184
Query: 171 TIGYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVL 230
I ++ L A L Y+ F +A+ +L +S PL + L + S RY +
Sbjct: 185 LISSVIIAL-ALLIYTVYK--TGFLVRKAIAIYILKFSWPLWISGLIGLYIGSSNRYFIR 241
Query: 231 WGSGLAAAGLFTAASKMPSLINIVASV-FQQAWQ---YSTAREIDS-PDRGAFFGSVLRG 285
S L GLF A+K S+++++ + F Q WQ + A+ + PD F +
Sbjct: 242 IFSSLDEVGLFELAAKFGSIVSVLIWLPFSQYWQTERFEIAKLKNPYPDYSMAFRMIT-- 299
Query: 286 YSLATLSAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMN 345
+L ++ G+ I + I +M F ++ +P L++A F +TIF +
Sbjct: 300 -ALLVIAGVGVSI-FSGVIINIMSDELFHTAYKAIPYLVIAGIFQCLTIFNNFSFMLTDR 357
Query: 346 SRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVA 383
+ + + + A+ I L+PF G GA +A A+
Sbjct: 358 TFEITKNNAITAIAISIFYFILIPFYGFVGASIAFALG 395
>gi|156861962|gb|EDO55393.1| hypothetical protein BACUNI_01065 [Bacteroides uniformis ATCC 8492]
Length = 488
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 82/407 (20%), Positives = 180/407 (44%), Gaps = 23/407 (5%)
Query: 1 MRLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTA--GEYGTAELLNSAIEIVLPLLSLG 58
++ L T ++ L + + ++ +L+P+YT A++A G YG + + + ++L LL+ G
Sbjct: 12 LKSLAKETAIYGLSSIVGRFLNYLLVPVYTMAMSAQSGGYGVVTNVYAWVALMLVLLTCG 71
Query: 59 VVEALYRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNM--------EHAAA 110
+ +RF+ + +++ +L+ + G + ALG N +H
Sbjct: 72 METGFFRFANKGEDDPMRVYSTTLLSVSVGAL---TFLALGLLFLNPIAGWLEYGDHPWY 128
Query: 111 FFVLFCSVCVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEG--YLW 168
++ V + + L + +R + + + L + ++ L L+ + G++G +
Sbjct: 129 VGMMMIVVAMDAIQSIPFAYLRYKKRPIKFAALKLLFIFLNIALNLLY-YVGMKGDDVGY 187
Query: 169 SYTIGYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYV 228
++ + + E R A + DR LL+RML Y LPL+ L+ L V+ + +
Sbjct: 188 AFLFNLICTSTIMLCMIPELRGFA-YVLDRKLLKRMLAYCLPLLVLGLAGILNQVADKII 246
Query: 229 VLW-----GSGLAAAGLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVL 283
+ G++ AASK+ ++ ++ F+ A++ + D + +
Sbjct: 247 FPFVYPDEAEASVQLGIYGAASKIAMVMAMLTQAFRYAYEPFVFGKSRDKDNKQMYAQAM 306
Query: 284 RGYSLATLSAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQAL 343
+ + + TL A L + I R ++ ++ G R VP++M A F + +Y+ +
Sbjct: 307 KFFIIFTL-LAFLAVMFYLDILRHIIGRDYWPGLRVVPIVMAAEIFMGIYFNLSFWYKLI 365
Query: 344 MNSRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVV 390
+R +++G V V++ V L+P AG Y + +++
Sbjct: 366 DETRWGAYFSLIGCTVLVVMNVLLIPRYSYMACAWAGFCGYGIAMLL 412
>gi|37725468|gb|AAO60454.1| Wzx [Streptococcus pneumoniae]
Length = 167
Score = 60.8 bits (146), Expect = 2e-07, Method: Composition-based stats.
Identities = 47/162 (29%), Positives = 84/162 (51%), Gaps = 12/162 (7%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L NT +FAL + K + +L+P+YT LT EYG +L+ + I++ +P+L+L + EA+
Sbjct: 7 LAKNTGIFALANFSSKILIFLLVPIYTRVLTTTEYGFYDLVYTTIQLFVPILTLNISEAV 66
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNM--------EHAAAFFVLF 115
RF + D V K +F S+ VL + +A AL + N+ +++ FV+F
Sbjct: 67 MRFLMKDGVSKKSVF--SIAVL--DIFIGSIAFALLLLVNNLFSLSDLISQYSIYIFVIF 122
Query: 116 CSVCVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLV 157
+ Q ++G+ + + G+I+ M+ +LLV
Sbjct: 123 VFYTLNNFLIQFSKGIDKIGVTAISGIISTAXMLAMNVILLV 164
>gi|119356242|ref|YP_910886.1| polysaccharide biosynthesis protein [Chlorobium phaeobacteroides
DSM 266]
gi|119353591|gb|ABL64462.1| polysaccharide biosynthesis protein [Chlorobium phaeobacteroides
DSM 266]
Length = 504
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 92/413 (22%), Positives = 172/413 (41%), Gaps = 40/413 (9%)
Query: 1 MRLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVV 60
++LL +T+++ + ++++ +L+PLY L+ + G L+ + I + L S G+
Sbjct: 5 LKLLARDTVIYGASTILSRSLNYLLVPLYANKLSTFDNGVQTLVYANIALANVLFSYGL- 63
Query: 61 EALYRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALG--------SALWNMEHAAAFF 112
E Y S D + + + G ++ T +L S L M A F
Sbjct: 64 ETAYLKSASDSLREGKDGKGFFSTAFFSLLVTATLFSLIIVFFSADISVLLGMTPAEYAF 123
Query: 113 VLFCSVCVFKATTQLARGLGHVR------RFVLYGLINALAMVVSTYLLLVRAHTGIEGY 166
V + ++ ++ T L +R +F + + +A+V++ +L++ H G++G
Sbjct: 124 VRYAAIILWIDTI-LVIPFAELRLKRKAVQFAIARVTGVVAVVIAAMILVIPFHAGLQGA 182
Query: 167 LWSYTIGYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGR 226
+ IG V GL+ F + L PF F R ++ LP VP ++ L+ + R
Sbjct: 183 FIANIIGSAVSGLMVF---PVFMQLRPF-FSFHQFRELIQIGLPYVPAGIAGLLIHLIDR 238
Query: 227 YVVL----------WGSGLAAA---GLFTAASKMPSLINIVASVFQQAWQYSTAREIDSP 273
+++ +G G A+ G++ ++ + VF+ AWQ + P
Sbjct: 239 NILIRISSSDIERIYGEGYVASDILGIYGRVVAFGIILQLFIQVFRFAWQPFFLQHASDP 298
Query: 274 DRGAFFGSVLRGYSLATLSAAGLVIALNRPISRV-------MLQAEFAEGWRYVPLLMLA 326
D F VL +L T+ A + R +L + G +P + L+
Sbjct: 299 DAKRLFRYVLSISTLFTMVLALASTFFVPDLVRYHYADRFYLLPPRYWIGLSVLPWIFLS 358
Query: 327 ATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLA 379
F +++ N+R L + T GA+V ++ ALVP G GA A
Sbjct: 359 YVFDMISTNLTAGLLITGNTRSLPMVTFSGALVTSLVCWALVPVNGMEGAAYA 411
>gi|37725462|gb|AAO60450.1| Wzx [Streptococcus pneumoniae]
gi|37725465|gb|AAO60452.1| Wzx [Streptococcus pneumoniae]
gi|37725471|gb|AAO60456.1| Wzx [Streptococcus pneumoniae]
gi|37725474|gb|AAO60458.1| Wzx [Streptococcus pneumoniae]
gi|37725477|gb|AAO60460.1| Wzx [Streptococcus pneumoniae]
Length = 167
Score = 60.5 bits (145), Expect = 3e-07, Method: Composition-based stats.
Identities = 47/162 (29%), Positives = 84/162 (51%), Gaps = 12/162 (7%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L NT +FAL + K + +L+P+YT LT EYG +L+ + I++ +P+L+L + EA+
Sbjct: 7 LAKNTGIFALANFSSKILIFLLVPIYTRVLTTTEYGFYDLVYTTIQLFVPILTLNISEAV 66
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNM--------EHAAAFFVLF 115
RF + D V K +F S+ VL + +A AL + N+ +++ FV+F
Sbjct: 67 MRFLMKDGVSKKSVF--SIAVL--DIFIGSIAFALLLLVNNLFSLSDLISQYSIYIFVIF 122
Query: 116 CSVCVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLV 157
+ Q ++G+ + + G+I+ M+ +LLV
Sbjct: 123 VFYTLNNFLIQFSKGIDKIGVTAISGVISTAVMLAMNVILLV 164
>gi|78188447|ref|YP_378785.1| polysaccharide biosynthesis protein [Chlorobium chlorochromatii
CaD3]
gi|78170646|gb|ABB27742.1| polysaccharide biosynthesis protein [Chlorobium chlorochromatii
CaD3]
Length = 511
Score = 60.1 bits (144), Expect = 3e-07, Method: Composition-based stats.
Identities = 100/425 (23%), Positives = 179/425 (42%), Gaps = 42/425 (9%)
Query: 1 MRLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVV 60
++LL +T+++ + ++++ VL+P+Y L+ E G ++ + I + L S G+
Sbjct: 7 LKLLFKDTVIYGASTILARSLNYVLVPVYANTLSTFENGIQTIIYANIALANVLFSYGLE 66
Query: 61 EALYRFSIDDDVPKDE----LFAGSLVVLGGGVVCTGVACALGS----ALWNMEHAAAFF 112
+ + + D E LF+ +++ L + L + AL ++ AA F
Sbjct: 67 TSYLKVAADTHREGSEGEKPLFSTAVLTLLATSTLFALLIVLLAPWIGALVGLDSGAAPF 126
Query: 113 VLFCSVCVFKATTQLARGLGHVR------RFVLYGLINALAMVVSTYLLLVRAHTGIEGY 166
V + ++ ++ T L +R F L+ +A+V+ L +V G+ G
Sbjct: 127 VRYAALILW-LDTMLVIPFAELRLRRKALHFATARLLGVVAVVLCALLFIVVMKVGLSGV 185
Query: 167 LWSYTIGYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGR 226
+ G +V LV +R F F LR ML LP VP ++ L+ + R
Sbjct: 186 FLAEAAGSVVSLLVVLPLFRGFR----FGFSGQQLREMLRIGLPYVPTGIAGLLIHLIDR 241
Query: 227 YVV----------LWGSGLAAA---GLFTAASKMPSLINIVASVFQQAWQYSTAREIDSP 273
++ L+G+G + G++ + ++ +V VF+ AWQ + P
Sbjct: 242 NILIRIAPSDIERLYGAGYQPSDIVGIYGRIAAFGVVLQLVIQVFRFAWQPFFLQHGKEP 301
Query: 274 DRGAFFGSVLRGYSLAT----LSAAGLVIALNR---PISRVMLQAEFAEGWRYVPLLMLA 326
D F VL +L T LSA V L R + +L + G +P + L+
Sbjct: 302 DAQQLFHHVLSISTLLTMVLALSATFFVPDLVRYHYGGAFYLLPPPYWIGLSILPAIFLS 361
Query: 327 ATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVILGVALVPFMGPWGAG---LAGAVA 383
F +++ N+R L V T GA+V + LVP G GA +AG V
Sbjct: 362 YVFDMVSTNLSAGLLLTGNTRYLPVVTFAGAVVTALSCWWLVPLYGMDGAAYAIVAGTVV 421
Query: 384 YALVL 388
++V+
Sbjct: 422 MSVVM 426
>gi|110798949|ref|YP_694929.1| putative polysaccharide transporter protein [Clostridium
perfringens ATCC 13124]
gi|110673596|gb|ABG82583.1| putative polysaccharide transporter protein [Clostridium
perfringens ATCC 13124]
Length = 488
Score = 59.7 bits (143), Expect = 5e-07, Method: Composition-based stats.
Identities = 85/401 (21%), Positives = 183/401 (45%), Gaps = 38/401 (9%)
Query: 9 LVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYRF-S 67
++A G + K+++ + +PLYT L YG L++ ++ + LG+ RF
Sbjct: 10 FIYAFGQILSKSINFIFIPLYTNKLGTYGYGQLALIDMLFSLISVFIILGINSGYIRFYK 69
Query: 68 IDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALW-----NMEHAAAFFVLFCSVCVFK 122
+++ K L +L + + + S + ++E+ F L C
Sbjct: 70 TYNELEKKRLMNTTLTFSIIFSIVFIIVNMIISQFYINKILSLENLNLIFFLLLIRC--- 126
Query: 123 ATTQ------LARGLGHVRRFVL----YGLINALAMVVSTYLLLVRAHTGIEGYLWSYTI 172
AT Q L + + + V+ + LI L ++V Y + + + GI G Y I
Sbjct: 127 ATEQIIYLMILEYSMNYEAKIVVKLECFKLILNLTLIV--YFVAI-VNQGILGMYKGYVI 183
Query: 173 GYLVGGLVAFLGSAEYRLLAPFRF--DRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVL 230
+ ++AFL ++ + F+ D +L+ ML YS+ L+P+ +S +++++ RY++
Sbjct: 184 SNCI--ILAFL---IFKNINKFKLEIDFKMLKNMLKYSIQLIPSGISAIVLNLADRYILE 238
Query: 231 WGSGLAAAGLFTAASKMPSLINIV-----ASVFQQAWQYSTAREIDSPDRGAFFGSVLRG 285
+ SGL+ G+++ K +LI+ + VF +++ R+ D+ ++
Sbjct: 239 FFSGLSITGIYSLGYKFGTLIDPLFILPFKKVF-TPFKFEIYRDNDANEK---LNEWYYK 294
Query: 286 YSLATLSAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMN 345
Y++ ++ ++ L + I + EF ++ +PL++++ + F+
Sbjct: 295 YNIIGIAVVFMISILGKLIIIITSPVEFINAYKIIPLILISYFIYGKSEFYSLGIYISNK 354
Query: 346 SRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYAL 386
+R ++ ++N+IL L+PF G +GA +A ++Y L
Sbjct: 355 TRFDCYIMILSGIINIILNFILIPFAGMYGATIATIISYYL 395
>gi|37725480|gb|AAO60462.1| Wzx [Streptococcus pneumoniae]
Length = 167
Score = 59.3 bits (142), Expect = 6e-07, Method: Composition-based stats.
Identities = 46/161 (28%), Positives = 83/161 (51%), Gaps = 12/161 (7%)
Query: 4 LLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEAL 63
L NT +FAL + K + +L+P+YT LT EYG +L+ + I++ +P+L+L + EA+
Sbjct: 7 LAKNTGIFALANFSSKILIFLLVPIYTRVLTTTEYGFYDLVYTTIQLFVPILTLNISEAV 66
Query: 64 YRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNM--------EHAAAFFVLF 115
RF + D V K +F S+ VL + +A AL + N+ +++ FV+F
Sbjct: 67 MRFLMKDGVSKKSVF--SIAVL--DIFIGSIAFALLLLVNNLFSLSDLISQYSIYIFVIF 122
Query: 116 CSVCVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLL 156
+ Q ++G+ + + G+I+ M+ +LL
Sbjct: 123 VFYTLNNFLIQFSKGIDKIGVTAISGVISTAVMLAMNVILL 163
>gi|145621782|ref|ZP_01777748.1| polysaccharide biosynthesis protein [Petrotoga mobilis SJ95]
gi|144947806|gb|EDJ82834.1| polysaccharide biosynthesis protein [Petrotoga mobilis SJ95]
Length = 483
Score = 59.3 bits (142), Expect = 6e-07, Method: Composition-based stats.
Identities = 90/399 (22%), Positives = 167/399 (41%), Gaps = 27/399 (6%)
Query: 11 FALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYR--FSI 68
F++G +S + P+ T + E+G A + A ++L + LG ++ R +
Sbjct: 14 FSMGQWIAALISFITTPITTWLIIPEEFGKASMFTLAFNLLLNVALLGADQSFVRMFYER 73
Query: 69 DDDVPKDELFAGSLVVLGGGVVCTGVACALGSAL-----WNMEHAAAFFVLFCSVCV--F 121
+D +D L+ L L GVV V L + H F+L ++ +
Sbjct: 74 SEDKRRDLLWDSLLPSLSIGVVVFVVIGIFWKELSFILFGDYNHFLPIFLLGVTILIGIL 133
Query: 122 KATTQLA-----RGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLV 176
+ + LA RG+ V+ G+ NA+ ++ + + I G +S+ +V
Sbjct: 134 ERFSTLAVRMKKRGIAFSTLRVVNGVTNAVFTILYALFVSRSFYAVIIGLFFSH----IV 189
Query: 177 GGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLA 236
L+A E F+ D ++ ++ Y LP VP L WL R + S
Sbjct: 190 TALLAIFFEREL-WFGKFKVDFKSIKAIVRYGLPFVPTFLITWLFQSIDRLALRNYSDFT 248
Query: 237 AAGLFTAASKMPSLINIVASVFQQAWQYST--AREIDSPDRGAFFGSVLRGYSLATLSAA 294
GL++AA K+ S++N++ + F W + + E + +G F + + A +
Sbjct: 249 EIGLYSAAFKVVSVMNLIQTGFTMFWTPVSYESYEKEPESKGIFEKTSV--IIAAAMFVF 306
Query: 295 GLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVM--TIFFGTFYQALMNSRMLMVS 352
GL+I + + + ++L++ + + L+L + T G ++ ML+ S
Sbjct: 307 GLLIVVFKDVIFLLLESSYRQAAGISSFLILMPIMYTVSETTVVGINFKKKTYWHMLIAS 366
Query: 353 TMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVR 391
G VNV + LVP G GA + V+Y + +R
Sbjct: 367 VAAG--VNVFGNLMLVPVYGAKGAAFSTGVSYIVFFCMR 403
>gi|68551915|ref|ZP_00591308.1| Polysaccharide biosynthesis protein [Prosthecochloris aestuarii DSM
271]
gi|68241038|gb|EAN23306.1| Polysaccharide biosynthesis protein [Prosthecochloris aestuarii DSM
271]
Length = 497
Score = 58.5 bits (140), Expect = 8e-07, Method: Composition-based stats.
Identities = 90/416 (21%), Positives = 171/416 (41%), Gaps = 46/416 (11%)
Query: 1 MRLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVV 60
+RLL +T+++ + + ++ VL+PLY LT E G L+ + I + L + G +
Sbjct: 5 LRLLAKDTVIYGTSTILARGLNYVLVPLYANLLTTFENGIQALIYANIALANVLFAYG-M 63
Query: 61 EALYRFSIDDDVPKD-----------------ELFAGSLVVLGGGVVCTGVACALGSALW 103
E Y S + + ++ + +VL + + G + +
Sbjct: 64 ETSYLKSASEALRQEGGSSRYFSTAFLTLLLSSTLFAAAIVLFAPWIAVAIGLGPGESEF 123
Query: 104 NMEHAAAFFVLFCSVCVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGI 163
+ +AA L + V A +L R H RF + + +V+ + L+V TG+
Sbjct: 124 -IRYAAVILWLDALLVVPFADLRLKR---HAIRFAAARVTGVITVVICAWGLIVGFETGL 179
Query: 164 EGYLWSYTIGYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSV 223
+G + G LV L +++R F ++L+ ML LP VP ++ L+ +
Sbjct: 180 KGAFLANIAGSLVSFLFVLPVFSQFRAF----FSASMLKEMLRIGLPYVPTGIAGLLIHL 235
Query: 224 SGRYVV----------LWGSGLAAA---GLFTAASKMPSLINIVASVFQQAWQYSTAREI 270
R ++ L+G+G + G++ + L+ + VF+ AWQ +
Sbjct: 236 IDRNILIRVPQETLEALYGNGYTPSDIVGIYGRVAAFGILLQLFIQVFRFAWQPFFLQHA 295
Query: 271 DSPDRGAFFGSVLRGYSLATLSAAGL-------VIALNRPISRVMLQAEFAEGWRYVPLL 323
D P+ F VL +L T+ A ++ + S +L + G +P +
Sbjct: 296 DDPEAKTLFRHVLSLSTLFTMLVALCGTFFVPDIVRYHYGGSFYLLPPVYWMGLSILPWI 355
Query: 324 MLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLA 379
+ F +++ ++R L + T GA + ++LVP MG GA +A
Sbjct: 356 FCSYIFDMISTNLTAGILITGSTRYLPLVTFAGAGTTTAVCLSLVPSMGMDGAAVA 411
>gi|91794015|ref|YP_563666.1| hypothetical protein Sden_2664 [Shewanella denitrificans OS217]
gi|91716017|gb|ABE55943.1| hypothetical protein Sden_2664 [Shewanella denitrificans OS217]
Length = 485
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 86/401 (21%), Positives = 168/401 (41%), Gaps = 25/401 (6%)
Query: 7 NTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYRF 66
+++++ LG + ++S + +P++ + + +YGT L+ I + LGV AL R
Sbjct: 9 SSILYTLGSIFSASISFIFLPVFMSEFSVSQYGTYSLILICSSIGAAVFYLGVTSALNRS 68
Query: 67 SIDDDVPKDEL--FAGSLVVLGGGVVCTGVACALGSALWNMEHAAAFFVLFCSVCVFKAT 124
D +D + F +LV+L G G+ LG + F + + VF A
Sbjct: 69 YFDYPEHQDRMNCFYTTLVLLAFG---GGLQIILGYIFADFISILIFDNVTWAQGVFYAL 125
Query: 125 TQLARG------LGHVR------RFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTI 172
+ G L + R F+ + +++ +V Y ++ G+ G L +
Sbjct: 126 ITASFGFINFAFLTYFRLKNEPYLFLFFSILSVFGNIVGIYYFVIFEGMGVIGALIGPLV 185
Query: 173 GYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWG 232
++ +V FL YRL +F R L Y V + L + + ++ +
Sbjct: 186 SQVIIFIVFFL--FHYRLFFQSKFMRKEAVLQLNYGFYTVLSSLGGLTILWADQFFINKY 243
Query: 233 SGLAAAGLFTAASKMPSLINIVASV-FQQAWQYSTAREIDSPDRGAFFGSVLRGYSLATL 291
+ G+++ + K+ +I ++ + F Q + + +S Y L L
Sbjct: 244 LSINDVGVYSLSVKLAGIITVIFTTPFIQVFNPIVMEQRESNGVKVLISKTYESYVLIGL 303
Query: 292 SAAGLV-IALNRPISRVMLQAEFAEGWRYV-PLLMLAATFGVMTIF-FGTFYQALMNSRM 348
+ L+ A + A +AE Y+ PL++ +G++ + G ++ + +
Sbjct: 304 FLSILISFAAEEFVYLFGKHAGYAESIIYIFPLMLSVCIYGLVNVVSIGLSFKRKLGRQT 363
Query: 349 LMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLV 389
L+ ++VN+IL V L+P G WGA L+ V Y L+ V
Sbjct: 364 LVYFAF--SLVNIILNVFLIPAFGIWGAILSTFVTYFLITV 402
>gi|89092774|ref|ZP_01165726.1| hypothetical protein MED92_09949 [Oceanospirillum sp. MED92]
gi|89082799|gb|EAR62019.1| hypothetical protein MED92_09949 [Oceanospirillum sp. MED92]
Length = 481
Score = 57.4 bits (137), Expect = 2e-06, Method: Composition-based stats.
Identities = 98/422 (23%), Positives = 190/422 (45%), Gaps = 32/422 (7%)
Query: 7 NTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYRF 66
++L++A KA++L+++P+ T LT +YG ++L + +++ ++ +G+ E L+RF
Sbjct: 14 HSLIYASALAFSKALALIMVPVATNFLTPEDYGRLDVLQTLADLLSIVIGMGMAETLFRF 73
Query: 67 --SIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSAL------W---NMEHAAAFFVLF 115
S DDD DE S + G + + ALG L W N+ A +L
Sbjct: 74 AGSADDD---DERRKASANIFGLAICLGLFSLALGQFLAGHISQWLPGNIAELEARLILA 130
Query: 116 CSVCVFKATTQLA--RGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIG 173
V LA R G + G +A+ V+ + L+ G+ G L + I
Sbjct: 131 SLSMVGTILVPLAWLRMQGSAWSY-FAGTAGRVALQVAIAVPLLFMGFGVTGVLSATLIS 189
Query: 174 YLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGS 233
++ L A+L ++++ F+ A + VY PL+ +S +++ R+++
Sbjct: 190 AII--LCAWLVRSQWKDTG-ISFEFARFKAYSVYGGPLIFVGISGFVLGSFDRWILADTV 246
Query: 234 GLAAAGLFTAASKMPSLINIVASVFQQAW---QYSTAREIDSPDRGAFFGSVLRGYSLAT 290
G A + A+K + ++ F W ++S +E + +R A ++ G ++A
Sbjct: 247 GTAEMAQYALAAKFGLITAVLIQPFDLWWHPRRFSCVKEQNGKERCAQIATI--GVAIAI 304
Query: 291 LSAAGLVIALNRP-ISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMNSRML 349
LSA L+IA P + R+M + + Y+P L A G + A+ +
Sbjct: 305 LSA--LIIAATGPTLVRLMTPEAYHQSIAYIPWLAALAALHNANSNLG--FGAMSAHTTV 360
Query: 350 MVSTMMGAMVNVILG--VALVPFMGPWGAGLAGAVAYALVLVVRARDLRRRIDLPMDRSR 407
+ + G + L + L+P WGA +A ++A ++ ++ R ++ ++LP R
Sbjct: 361 KPAIIDGIAAGIALAGYLLLIPIYHAWGAIIATSLALSIRMLATYRVSQKALNLPYRIRR 420
Query: 408 LT 409
L+
Sbjct: 421 LS 422
>gi|67938692|ref|ZP_00531213.1| Polysaccharide biosynthesis protein [Chlorobium phaeobacteroides
BS1]
gi|67915050|gb|EAM64377.1| Polysaccharide biosynthesis protein [Chlorobium phaeobacteroides
BS1]
Length = 501
Score = 57.0 bits (136), Expect = 3e-06, Method: Composition-based stats.
Identities = 93/431 (21%), Positives = 176/431 (40%), Gaps = 55/431 (12%)
Query: 1 MRLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVV 60
++LL +T+++ + + ++ VL+PLY LT + G L+ + I + L + G+
Sbjct: 5 LKLLAKDTVIYGSSTILARGLNYVLVPLYANLLTTFDNGVHALIYANIALANVLFAYGME 64
Query: 61 EALYRFSIDDDVPKDE------------LFAGSLVVLGGGVVCTGVACALGSALWN---M 105
+ + + D+ + L ++ G+A +G + +
Sbjct: 65 TSYLKVASDNTRSGGDSARCFSTAFISLLLTSTVFTAAILFFAPGIAELIGLSENQKDFI 124
Query: 106 EHAAAFFVLFCSVCVFKATTQLARGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEG 165
+AA L + + A +L R RF + ++ + +V+S + L+V+ TG+ G
Sbjct: 125 RYAAVILWLDALLVIPFADLRLKR---KAIRFAIARILGVVTIVISAFTLIVQFKTGLHG 181
Query: 166 YLWSYTIGYLVGGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSG 225
+ G LV L +++ F LR ML LP VP ++ L+ +
Sbjct: 182 AFLANIAGSLVSLLFVLPVFGQFQRF----FSSDTLREMLRIGLPYVPTGIAGLLIHLID 237
Query: 226 RYVV----------LWGSGLAAA---GLFTAASKMPSLINIVASVFQQAWQYSTAREIDS 272
R ++ ++G+G + G++ + LI + VF+ AWQ + D
Sbjct: 238 RNILIRMRPEDIENIYGAGYVQSDIVGIYGRVAAFGILIQLFIQVFRFAWQPFFLQHADD 297
Query: 273 PDRGAFFGSVLRGYSLATLSAAGLVIALNRPI------------SRVMLQAEFAEGWRYV 320
P+ F VL S++T+ A +VIAL +L + G +
Sbjct: 298 PEAKKLFRHVL---SISTVFA--MVIALVSTFYVPDLIRYHYFERLYILPPAYWVGLSIL 352
Query: 321 PLLMLAATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVILGVALVPFMGPWGAG--- 377
P + + F +++ ++R L V T GA V + + LVP +G GA
Sbjct: 353 PWIFFSYIFDMISTNLTAGILITGSTRYLPVVTFAGAGVTTVTCLLLVPSLGMEGAAISI 412
Query: 378 LAGAVAYALVL 388
LAG V ++ +
Sbjct: 413 LAGTVVMSICM 423
>gi|145621520|ref|ZP_01777489.1| polysaccharide biosynthesis protein [Petrotoga mobilis SJ95]
gi|144948157|gb|EDJ83184.1| polysaccharide biosynthesis protein [Petrotoga mobilis SJ95]
Length = 483
Score = 57.0 bits (136), Expect = 3e-06, Method: Composition-based stats.
Identities = 87/399 (21%), Positives = 168/399 (42%), Gaps = 27/399 (6%)
Query: 11 FALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYR--FSI 68
F++G +S + P+ T + E+G A + A ++L + LG ++ R +
Sbjct: 14 FSMGQWIAALISFITTPITTWLIIPEEFGKASMFTLAFNLLLNVALLGADQSFVRMFYER 73
Query: 69 DDDVPKDELFAGSLVVLGGGVVCTGVACALGSAL-----WNMEHAAAFFVLFCSVCV--F 121
+D +D L+ L L GVV V L + H F+L ++ +
Sbjct: 74 SEDKRRDLLWDSLLPSLSIGVVVFVVIGIFWKELSFILFGDYNHFLPIFLLGVTILIGIL 133
Query: 122 KATTQLA-----RGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLV 176
+ + LA RG+ V+ G+ NA+ ++ + + I G +S+ +V
Sbjct: 134 ERFSTLAVRMKKRGIAFSTLRVVNGVTNAVFTILYALFVSRSFYAVIIGLFFSH----IV 189
Query: 177 GGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLA 236
L+A E F+ D ++ ++ Y LP VP L WL R + S
Sbjct: 190 TALLAIFFEREL-WFGKFKVDFKSIKAIVRYGLPFVPTFLITWLFQSIDRLSLRNYSDFT 248
Query: 237 AAGLFTAASKMPSLINIVASVFQQAWQYST--AREIDSPDRGAFFGSVLRGYSLATLSAA 294
GL++AA K+ S+++++ + F W + + E + +G F + + + A +
Sbjct: 249 EIGLYSAAFKVVSVMSLIQAGFTTFWTPVSYESYEKEPESKGIFEKTSV--FIAAAMFVF 306
Query: 295 GLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVM--TIFFGTFYQALMNSRMLMVS 352
GL++ + + + ++L++ + + L+L + T G ++ ML+ +
Sbjct: 307 GLLLVVFKDVIFLLLESSYRQAAGISSFLILMPIMYTVSETTVVGINFKKKTYWHMLIAT 366
Query: 353 TMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVR 391
G VNV + LVP G GA + V+Y + +R
Sbjct: 367 VAAG--VNVFGNLMLVPVYGAKGAAFSTGVSYIVFFCMR 403
>gi|145621524|ref|ZP_01777493.1| polysaccharide biosynthesis protein [Petrotoga mobilis SJ95]
gi|144948161|gb|EDJ83188.1| polysaccharide biosynthesis protein [Petrotoga mobilis SJ95]
Length = 483
Score = 56.2 bits (134), Expect = 5e-06, Method: Composition-based stats.
Identities = 87/399 (21%), Positives = 168/399 (42%), Gaps = 27/399 (6%)
Query: 11 FALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVVEALYR--FSI 68
F++G +S + P+ T + E+G A + A ++L + LG ++ R +
Sbjct: 14 FSMGQWIAALISFITTPITTWLIIPEEFGKASMFTLAFNLLLNVALLGADQSFVRMFYER 73
Query: 69 DDDVPKDELFAGSLVVLGGGVVCTGVACALGSAL-----WNMEHAAAFFVLFCSVCV--F 121
+D +D L+ L L GVV V L + H F+L ++ +
Sbjct: 74 SEDKRRDLLWDSLLPSLSIGVVVFVVIGIFWKELSFILFGDYNHFLPIFLLGVTILIGIL 133
Query: 122 KATTQLA-----RGLGHVRRFVLYGLINALAMVVSTYLLLVRAHTGIEGYLWSYTIGYLV 176
+ + LA RG+ V+ G+ NA+ ++ + + I G +S+ +V
Sbjct: 134 ERFSTLAVRMKKRGIAFSTLRVVNGVTNAVFTILYALFVSRSFYAVIIGLFFSH----IV 189
Query: 177 GGLVAFLGSAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSGRYVVLWGSGLA 236
L+A E F+ D ++ ++ Y LP VP L WL R + S
Sbjct: 190 TALLAIFFEREL-WFGKFKVDFKSIKAIVRYGLPFVPTFLITWLFQSIDRLSLRNYSDFT 248
Query: 237 AAGLFTAASKMPSLINIVASVFQQAWQYST--AREIDSPDRGAFFGSVLRGYSLATLSAA 294
GL++AA K+ S+++++ + F W + + E + +G F + + + A +
Sbjct: 249 EIGLYSAAFKVVSVMSLIQAGFTTFWTPVSYESYEKEPESKGIFEKTSV--FIAAAMFVF 306
Query: 295 GLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVM--TIFFGTFYQALMNSRMLMVS 352
GL++ + + + ++L++ + + L+L + T G ++ ML+ +
Sbjct: 307 GLLLVVFKDVIFLLLESSYRQAAGISSFLILMPIMYTVSETTVVGINFKKKTYWHMLIAT 366
Query: 353 TMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVVR 391
G VNV + LVP G GA + V+Y + +R
Sbjct: 367 VAAG--VNVFGNLMLVPVYGAKGAAFSTGVSYIVFFCMR 403
>gi|67918833|ref|ZP_00512425.1| Polysaccharide biosynthesis protein [Chlorobium limicola DSM 245]
gi|67783503|gb|EAM42890.1| Polysaccharide biosynthesis protein [Chlorobium limicola DSM 245]
Length = 504
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 103/426 (24%), Positives = 182/426 (42%), Gaps = 45/426 (10%)
Query: 1 MRLLLGNTLVFALGGLAVKAVSLVLMPLYTTALTAGEYGTAELLNSAIEIVLPLLSLGVV 60
++LL +T+++ + ++++ +L+PLY LT + G ++ + I + L S G+
Sbjct: 5 LKLLAKDTVIYGASTIVARSLNYLLVPLYANKLTTFDNGIQTVVYANIALANVLFSYGL- 63
Query: 61 EALYRFSIDDDVPKDELFAGSLVVLGGGVVCTGVACALGSALWNMEHAAAF--------F 112
E Y S D + G ++ T AL L++ + AA F
Sbjct: 64 ETSYLKSASDTLHHSGDARGYFSTAFFSLLVTSTLFALIMVLFSADIAAMIGLGPEEGEF 123
Query: 113 VLFCSVCVFKATTQLARGLGHVR------RFVLYGLINALAMVVSTYLLLVRAHTGIEGY 166
+ + ++ ++ T L +R RF + +I +A+V+S +LL+ G+ G
Sbjct: 124 INYAAIILW-IDTLLVIPFAELRLKRKALRFAVARIIGVVAVVLSALVLLIVFDAGLPGA 182
Query: 167 LWSYTIGYLVGGLVAFLGSAEYRLLAP-FRFDRALLRRMLVYSLPLVPNLLSWWLVSVSG 225
++ IG LV L A L +R L P F FD LR ML LP VP ++ L+ +
Sbjct: 183 FFANIIGSLVS-LAAVL--PLFRELKPVFSFD--YLREMLRIGLPYVPTGIAGLLIHLID 237
Query: 226 RYVVL----------WGSGLAAA---GLFTAASKMPSLINIVASVFQQAWQYSTAREIDS 272
R +++ +G G A+ G++ + ++ +V VF+ AWQ + +
Sbjct: 238 RNLLIRISSSDIRRIYGDGYMASDIVGIYGRVAAFGIILQLVIQVFRFAWQPFFLQHAND 297
Query: 273 PDRGAFFGSVLRGYSLATLSAAGLVIALNRPISRV-------MLQAEFAEGWRYVPLLML 325
PD F VL + T+ A + R +L ++ G +P + L
Sbjct: 298 PDAKRLFRYVLSISTFITMFGALAATFFVPDLVRYHYGGNFYILPPKYWIGLSVLPWIFL 357
Query: 326 AATFGVMTIFFGTFYQALMNSRMLMVSTMMGAMVNVILGVALVPFMGPWGAG---LAGAV 382
+ F +++ + N+R L V T GA V + L+P G GA +AG V
Sbjct: 358 SYVFDMISTNLSSGMLVTGNTRYLPVVTFAGAAVTGVACWLLIPLSGMDGAAYAIVAGTV 417
Query: 383 AYALVL 388
LV+
Sbjct: 418 VMCLVM 423
>gi|114566262|ref|YP_753416.1| hypothetical protein Swol_0722 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
gi|114337197|gb|ABI68045.1| hypothetical protein Swol_0722 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
Length = 430
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 59/285 (20%), Positives = 121/285 (42%), Gaps = 8/285 (2%)
Query: 110 AFFVLFCSVCVFKATTQLARGLGHVRRFVLYGLINALAMVVS---TYLLLVRAHTGIEGY 166
AF + +V + Q L +R+ ++ L+N +++S L ++ G++GY
Sbjct: 61 AFRIAIITVPLTAIYNQFIIVLRLMRKPWVFLLLNVSFVIISFILVILFIIYLEIGLQGY 120
Query: 167 LWSYTIGYLVGGLVAFLG-SAEYRLLAPFRFDRALLRRMLVYSLPLVPNLLSWWLVSVSG 225
+ LV ++ F+ + YR+L F R L++ML YSLP +P++++ W ++
Sbjct: 121 FLANLCTTLVCVIICFIYLKSFYRIL----FSRKHLKKMLKYSLPQLPSVVANWCMTSFN 176
Query: 226 RYVVLWGSGLAAAGLFTAASKMPSLINIVASVFQQAWQYSTAREIDSPDRGAFFGSVLRG 285
R++++ GLF SK+ S+ ++ SVF+ AW + + +
Sbjct: 177 RFIMVGYLSRMDIGLFAIGSKVASIAMVIVSVFRMAWDPLAISIYKTDNAKEIYVKAFEA 236
Query: 286 YSLATLSAAGLVIALNRPISRVMLQAEFAEGWRYVPLLMLAATFGVMTIFFGTFYQALMN 345
Y + L+ ++ I ++ E+ +L+L T G
Sbjct: 237 YGTVVVLGVALIGFFSKEILTILTTPEYYGASLLTSVLVLQFLAQGFTNIVGIGISIEKK 296
Query: 346 SRMLMVSTMMGAMVNVILGVALVPFMGPWGAGLAGAVAYALVLVV 390
+ +L + ++ ++ +I L+P G GA A + V+
Sbjct: 297 THILSYAMILALLILLICSYILIPIFGAMGAAWANVMGAWTAFVI 341
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.328 0.140 0.426
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,579,558,950
Number of Sequences: 5470121
Number of extensions: 64401394
Number of successful extensions: 254114
Number of sequences better than 1.0e-05: 142
Number of HSP's better than 0.0 without gapping: 25
Number of HSP's successfully gapped in prelim test: 117
Number of HSP's that attempted gapping in prelim test: 253825
Number of HSP's gapped (non-prelim): 168
length of query: 467
length of database: 1,894,087,724
effective HSP length: 136
effective length of query: 331
effective length of database: 1,150,151,268
effective search space: 380700069708
effective search space used: 380700069708
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 132 (55.5 bits)