BLASTP 2.2.18 [Mar-02-2008]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= SPy_0797 hypothetical protein
(428 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
6,515,104 sequences; 2,222,278,849 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_269013.1| hypothetical protein SPy_0797 [Streptococc... 856 0.0
ref|YP_602193.1| Transcriptional activator amrA [Streptococ... 854 0.0
ref|NP_607014.1| hypothetical protein spyM18_0859 [Streptoc... 853 0.0
ref|YP_001128737.1| putative exopolysaccharide biosynthesis... 852 0.0
ref|YP_598276.1| Transcriptional activator amrA [Streptococ... 851 0.0
ref|YP_596399.1| transcriptional activator [Streptococcus p... 849 0.0
ref|ZP_00366456.1| COG2244: Membrane protein involved in th... 793 0.0
ref|YP_141830.1| polysaccharide/teichoic acid transporter, ... 346 2e-93
ref|YP_820801.1| Membrane protein involved in the export of... 345 4e-93
ref|ZP_00875229.1| conserved hypothetical protein [Streptoc... 335 4e-90
ref|YP_001198649.1| polysaccharide/teichoic acid transporte... 332 5e-89
ref|YP_001035463.1| Polysaccharide/teichoic acid transporte... 320 2e-85
ref|YP_139899.1| polysaccharide biosynthesis protein, putat... 287 1e-75
ref|YP_001450304.1| hypothetical protein SGO_1015 [Streptoc... 268 8e-70
ref|YP_001200855.1| polysaccharide/teichoic acid transporte... 245 5e-63
gb|AAN64553.1| hypothetical protein [Streptococcus gordonii] 238 7e-61
ref|ZP_02432184.1| hypothetical protein CLOSCI_02429 [Clost... 222 5e-56
ref|ZP_02442856.1| hypothetical protein ANACOL_02154 [Anaer... 162 3e-38
ref|ZP_02080928.1| hypothetical protein CLOLEP_02391 [Clost... 153 2e-35
ref|ZP_02045178.1| hypothetical protein ACTODO_02068 [Actin... 152 4e-35
ref|ZP_01772528.1| Hypothetical protein COLAER_01534 [Colli... 126 3e-27
ref|ZP_02087505.1| hypothetical protein CLOBOL_05049 [Clost... 124 1e-26
ref|ZP_02544449.1| Transcriptional activator amrA [candidat... 114 2e-23
ref|ZP_02912393.1| polysaccharide biosynthesis protein [Geo... 88 1e-15
ref|YP_826100.1| polysaccharide biosynthesis protein [Solib... 82 1e-13
ref|ZP_00110716.1| COG2244: Membrane protein involved in th... 74 2e-11
ref|YP_589916.1| polysaccharide biosynthesis protein [Acido... 63 3e-08
ref|YP_001430646.1| polysaccharide biosynthesis protein [Ro... 63 3e-08
ref|YP_001715530.1| putative polysaccharide biosynthesis pr... 60 2e-07
ref|YP_001277945.1| polysaccharide biosynthesis protein [Ro... 60 3e-07
ref|ZP_02579279.1| polysaccharide biosynthesis protein [Bac... 59 6e-07
ref|YP_663564.1| hypothetical protein Patl_4010 [Pseudoalte... 50 3e-04
sp|P39855|CAPF_STAAU Capsular polysaccharide biosynthesis p... 49 6e-04
ref|ZP_01905856.1| polysaccharide biosynthesis protein [Ple... 46 0.004
ref|XP_976745.1| Leishmanolysin family protein [Tetrahymena... 45 0.016
ref|YP_001815039.1| polysaccharide biosynthesis protein [Ex... 44 0.029
ref|YP_001517033.1| polysaccharide biosynthesis protein, pu... 41 0.15
ref|YP_213068.1| putative LPS biosynthesis related polysacc... 40 0.32
ref|YP_001097823.1| polysaccharide biosynthesis protein [Me... 40 0.44
ref|YP_001274132.1| polysaccharide biosynthesis protein, Mv... 40 0.50
ref|YP_001274133.1| polysaccharide biosynthesis protein, Mv... 39 0.74
ref|ZP_01612341.1| AmrA [Alteromonadales bacterium TW-7] >g... 39 0.87
ref|YP_345310.1| Capsular polysaccharide biosynthesis prote... 38 1.4
ref|ZP_02134108.1| MATE efflux family protein [Desulfatibac... 35 8.3
ref|XP_001613551.1| hypothetical protein PVX_081380 [Plasmo... 35 9.6
>ref|NP_269013.1| hypothetical protein SPy_0797 [Streptococcus pyogenes M1 GAS]
ref|NP_664335.1| hypothetical protein SpyM3_0531 [Streptococcus pyogenes MGAS315]
ref|NP_802585.1| hypothetical protein SPs1323 [Streptococcus pyogenes SSI-1]
ref|YP_280060.1| transcriptional activator [Streptococcus pyogenes MGAS6180]
ref|YP_281975.1| transcriptional activator [Streptococcus pyogenes MGAS5005]
gb|AAK33734.1| conserved hypothetical protein [Streptococcus pyogenes M1 GAS]
gb|AAM79138.1| conserved hypothetical protein [Streptococcus pyogenes MGAS315]
dbj|BAC64418.1| conserved hypothetical protein [Streptococcus pyogenes SSI-1]
gb|AAX71705.1| transcriptional activator [Streptococcus pyogenes MGAS6180]
gb|AAZ51230.1| transcriptional activator [Streptococcus pyogenes MGAS5005]
Length = 428
Score = 856 bits (2212), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/428 (100%), Positives = 428/428 (100%)
Query: 1 MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL 60
MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL
Sbjct: 1 MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL 60
Query: 61 FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS 120
FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS
Sbjct: 61 FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS 120
Query: 121 TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF 180
TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF
Sbjct: 121 TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF 180
Query: 181 IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM 240
IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM
Sbjct: 181 IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM 240
Query: 241 TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAY 300
TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAY
Sbjct: 241 TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAY 300
Query: 301 LGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
LGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK
Sbjct: 301 LGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
Query: 361 QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG 420
QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG
Sbjct: 361 QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG 420
Query: 421 RVNATIYD 428
RVNATIYD
Sbjct: 421 RVNATIYD 428
>ref|YP_602193.1| Transcriptional activator amrA [Streptococcus pyogenes MGAS10750]
gb|ABF37649.1| Transcriptional activator amrA [Streptococcus pyogenes MGAS10750]
Length = 428
Score = 854 bits (2206), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/428 (99%), Positives = 427/428 (99%)
Query: 1 MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL 60
MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTS+DSDIYAFAYSFANMMVVVGL
Sbjct: 1 MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSSDSDIYAFAYSFANMMVVVGL 60
Query: 61 FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS 120
FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS
Sbjct: 61 FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS 120
Query: 121 TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF 180
TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVA CIVSLVF
Sbjct: 121 TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAACIVSLVF 180
Query: 181 IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM 240
IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM
Sbjct: 181 IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM 240
Query: 241 TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAY 300
TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAY
Sbjct: 241 TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAY 300
Query: 301 LGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
LGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK
Sbjct: 301 LGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
Query: 361 QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG 420
QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG
Sbjct: 361 QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG 420
Query: 421 RVNATIYD 428
RVNATIYD
Sbjct: 421 RVNATIYD 428
>ref|NP_607014.1| hypothetical protein spyM18_0859 [Streptococcus pyogenes MGAS8232]
ref|YP_059947.1| AmrA [Streptococcus pyogenes MGAS10394]
gb|AAL97513.1| conserved hypothetical protein [Streptococcus pyogenes MGAS8232]
gb|AAT86764.1| AmrA [Streptococcus pyogenes MGAS10394]
Length = 428
Score = 853 bits (2205), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/428 (99%), Positives = 427/428 (99%)
Query: 1 MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL 60
MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL
Sbjct: 1 MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL 60
Query: 61 FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS 120
FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS
Sbjct: 61 FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS 120
Query: 121 TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF 180
TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF
Sbjct: 121 TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF 180
Query: 181 IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM 240
IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM
Sbjct: 181 IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM 240
Query: 241 TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAY 300
TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAY
Sbjct: 241 TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAY 300
Query: 301 LGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
LGVFSLI LVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK
Sbjct: 301 LGVFSLIVLVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
Query: 361 QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG 420
QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG
Sbjct: 361 QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG 420
Query: 421 RVNATIYD 428
RVNATIYD
Sbjct: 421 RVNATIYD 428
>ref|YP_001128737.1| putative exopolysaccharide biosynthesis protein [Streptococcus
pyogenes str. Manfredo]
emb|CAM30520.1| putative exopolysaccharide biosynthesis protein [Streptococcus
pyogenes str. Manfredo]
Length = 428
Score = 852 bits (2201), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 426/428 (99%), Positives = 427/428 (99%)
Query: 1 MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL 60
MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL
Sbjct: 1 MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL 60
Query: 61 FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS 120
FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS
Sbjct: 61 FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS 120
Query: 121 TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF 180
TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILY KNLTLALVAVCIVSLVF
Sbjct: 121 TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYYKNLTLALVAVCIVSLVF 180
Query: 181 IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM 240
IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM
Sbjct: 181 IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM 240
Query: 241 TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAY 300
TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAY
Sbjct: 241 TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAY 300
Query: 301 LGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
LGVFSLIALVGSGLFGIPFLS+LYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK
Sbjct: 301 LGVFSLIALVGSGLFGIPFLSMLYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
Query: 361 QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG 420
QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG
Sbjct: 361 QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG 420
Query: 421 RVNATIYD 428
RVNATIYD
Sbjct: 421 RVNATIYD 428
>ref|YP_598276.1| Transcriptional activator amrA [Streptococcus pyogenes MGAS10270]
gb|ABF33732.1| Transcriptional activator amrA [Streptococcus pyogenes MGAS10270]
Length = 428
Score = 851 bits (2199), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/428 (99%), Positives = 428/428 (100%)
Query: 1 MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL 60
MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL
Sbjct: 1 MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL 60
Query: 61 FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS 120
FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS
Sbjct: 61 FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS 120
Query: 121 TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF 180
TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF
Sbjct: 121 TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF 180
Query: 181 IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM 240
IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM
Sbjct: 181 IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM 240
Query: 241 TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAY 300
TTLGE+ALGSQTIFNILFMPAFVMNLLILFFRP+ITQMAIALIRGQIKEFNKIQVQLFAY
Sbjct: 241 TTLGEIALGSQTIFNILFMPAFVMNLLILFFRPYITQMAIALIRGQIKEFNKIQVQLFAY 300
Query: 301 LGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
LGVFSLIALVGSGLFGIPFLS+LYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK
Sbjct: 301 LGVFSLIALVGSGLFGIPFLSMLYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
Query: 361 QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG 420
QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG
Sbjct: 361 QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG 420
Query: 421 RVNATIYD 428
RVNATIYD
Sbjct: 421 RVNATIYD 428
>ref|YP_596399.1| transcriptional activator [Streptococcus pyogenes MGAS9429]
ref|YP_600273.1| Transcriptional activator amrA [Streptococcus pyogenes MGAS2096]
gb|ABF31855.1| transcriptional activator [Streptococcus pyogenes MGAS9429]
gb|ABF35729.1| Transcriptional activator amrA [Streptococcus pyogenes MGAS2096]
Length = 428
Score = 849 bits (2194), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 425/428 (99%), Positives = 428/428 (100%)
Query: 1 MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL 60
MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL
Sbjct: 1 MINPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGL 60
Query: 61 FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS 120
FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS
Sbjct: 61 FQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRS 120
Query: 121 TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF 180
TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF
Sbjct: 121 TDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF 180
Query: 181 IMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM 240
IMYYDIG+SKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM
Sbjct: 181 IMYYDIGYSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELM 240
Query: 241 TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAY 300
TTLGEVALGSQTIFNILFMPAFVMNLLILFFRP+ITQMAIALIRGQIKEFNKIQVQLFAY
Sbjct: 241 TTLGEVALGSQTIFNILFMPAFVMNLLILFFRPYITQMAIALIRGQIKEFNKIQVQLFAY 300
Query: 301 LGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
LGVFSLIALVGSGLFGIPFLS+LYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK
Sbjct: 301 LGVFSLIALVGSGLFGIPFLSMLYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
Query: 361 QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG 420
QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG
Sbjct: 361 QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKG 420
Query: 421 RVNATIYD 428
RVNATIYD
Sbjct: 421 RVNATIYD 428
>ref|ZP_00366456.1| COG2244: Membrane protein involved in the export of O-antigen and
teichoic acid [Streptococcus pyogenes M49 591]
Length = 398
Score = 793 bits (2047), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/398 (99%), Positives = 398/398 (100%)
Query: 31 MVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQATDINEKYSFSQYLVARLMTCLL 90
MVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQATDINEKYSFSQYLVARLMTCLL
Sbjct: 1 MVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQATDINEKYSFSQYLVARLMTCLL 60
Query: 91 MLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLYQGMFQQHERLDIAGKSLAYRNT 150
MLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLYQGMFQQHERLDIAGKSLAYRNT
Sbjct: 61 MLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLYQGMFQQHERLDIAGKSLAYRNT 120
Query: 151 LIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYDIGHSKKFQKLMFSELLSNISFQNSL 210
LIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYDIGHSKKFQKLMFSELLSNISFQNSL
Sbjct: 121 LIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYDIGHSKKFQKLMFSELLSNISFQNSL 180
Query: 211 KLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGEVALGSQTIFNILFMPAFVMNLLILF 270
KLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGE+ALGSQTIFNILFMPAFVMNLLILF
Sbjct: 181 KLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGEIALGSQTIFNILFMPAFVMNLLILF 240
Query: 271 FRPHITQMAIALIRGQIKEFNKIQVQLFAYLGVFSLIALVGSGLFGIPFLSILYGTNLTD 330
FRP+ITQMAIALIRGQIKEFNKIQVQLFAYLGVFSLIALVGSGLFGIPFLS+LYGTNLTD
Sbjct: 241 FRPYITQMAIALIRGQIKEFNKIQVQLFAYLGVFSLIALVGSGLFGIPFLSMLYGTNLTD 300
Query: 331 YWVDFMLIMLGGSIGSFATVIDNILTAMRKQQLLLIPYTGGFLISLLITNLFVMKYHILG 390
YWVDFMLIMLGGSIGSFATVIDNILTAMRKQQLLLIPYTGGFLISLLITNLFVMKYHILG
Sbjct: 301 YWVDFMLIMLGGSIGSFATVIDNILTAMRKQQLLLIPYTGGFLISLLITNLFVMKYHILG 360
Query: 391 AALSFLITMLVWLGLSIMIYLFIMNRFKKGRVNATIYD 428
AALSFLITMLVWLGLSIMIYLFIMNRFKKGRVNATIYD
Sbjct: 361 AALSFLITMLVWLGLSIMIYLFIMNRFKKGRVNATIYD 398
>ref|YP_141830.1| polysaccharide/teichoic acid transporter, putative [Streptococcus
thermophilus CNRZ1066]
gb|AAV63015.1| polysaccharide/teichoic acid transporter, putative [Streptococcus
thermophilus CNRZ1066]
Length = 419
Score = 346 bits (887), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 188/419 (44%), Positives = 268/419 (63%), Gaps = 10/419 (2%)
Query: 9 QTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQA 68
Q +F WN+LGS+S+A +SVILL +VTR L SA +D Y+FAY+ AN+ V+V FQVR++QA
Sbjct: 7 QKVFFWNILGSMSSAAVSVILLFIVTRSLNSASADTYSFAYAIANLFVIVASFQVRDFQA 66
Query: 69 TDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLY 128
TDI EKYSF Y V R+++ + M+ + V YL I+F V F+R ++A SD++
Sbjct: 67 TDIKEKYSFDTYFVTRIISNVAMVLLLVTYLIFNTNTHSNLGIIFWVSFFRVSEALSDVF 126
Query: 129 QGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYDIGH 188
QG+FQQ ERLDIAGKSL RNT+ +V+ ++ SKNL ++++ I S VFI +D H
Sbjct: 127 QGLFQQKERLDIAGKSLFLRNTISTIVFALTLVISKNLLWSVISQTISSFVFIALFDYPH 186
Query: 189 SKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGEVAL 248
SK F +L L ++ N + +LK+ PLF+N FL++ IY QPKYA+ + G +
Sbjct: 187 SKFFHRLN----LKSVKPSNIINVLKDCLPLFINAFLLVSIYNQPKYALNDIFNQGLIGN 242
Query: 249 GSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAYL-GVFSLI 307
G Q F+ILF P F MNL+I+F RP ITQ+A+ L +I F + LF L G LI
Sbjct: 243 GVQRDFSILFTPIFTMNLMIVFLRPMITQLAVFLEEKKISHFVTYKNNLFKILFGTCILI 302
Query: 308 ALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRKQQLLLIP 367
L+G IP L I+YGTNL Y F++++LGG +F+T+ DNILT RKQ L+I
Sbjct: 303 FLIG-AFIAIPALDIVYGTNLKQYQTSFVVLLLGGIASTFSTICDNILTIFRKQHFLVIS 361
Query: 368 YTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNR---FKKGRVN 423
+T G+++S+L V K+ I GA+LSFL M+ WL S++IY F+ N F++ ++
Sbjct: 362 FTVGYIVSILTAKPLVSKFEIFGASLSFLCAMIAWLLASLVIY-FVTNPYTIFRRKKIK 419
>ref|YP_820801.1| Membrane protein involved in the export of polysaccharides
[Streptococcus thermophilus LMD-9]
gb|ABJ66605.1| Membrane protein involved in the export of polysaccharides
[Streptococcus thermophilus LMD-9]
Length = 419
Score = 345 bits (885), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 188/419 (44%), Positives = 269/419 (64%), Gaps = 10/419 (2%)
Query: 9 QTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQA 68
Q +F WN+LGS+S+A +SVILL +VTR L SA +D Y+FAY+ AN+ V+V FQVR++QA
Sbjct: 7 QKVFFWNILGSMSSAAVSVILLFIVTRALNSASADTYSFAYAIANLFVIVASFQVRDFQA 66
Query: 69 TDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLY 128
TDI EKYSF Y V R+++ + M+ + V+YL I+F V F+R ++A SD++
Sbjct: 67 TDIREKYSFDTYFVTRIISNVAMVLLLVMYLIFNTNTHSNLGIIFWVSFFRVSEALSDVF 126
Query: 129 QGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYDIGH 188
QG+FQQ ERLDIAGKSL RNT+ +V+ ++ SKNL ++++ I S VFI +D H
Sbjct: 127 QGLFQQKERLDIAGKSLFLRNTISTIVFALTLVISKNLLWSVISQTISSFVFIALFDYPH 186
Query: 189 SKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGEVAL 248
SK F +L L ++ N + +LK+ PLF+N FL++ IY QPKYA+ + G +
Sbjct: 187 SKFFHRLN----LKSVKPSNIINVLKDCLPLFINAFLLVSIYNQPKYALNDIFNQGLIGN 242
Query: 249 GSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAYL-GVFSLI 307
G Q F+ILF P F MNL+I+F RP ITQ+A+ L +I F + LF L G LI
Sbjct: 243 GVQRDFSILFTPIFAMNLMIVFLRPMITQLAVFLEEKKISHFVTYKNNLFKILFGTCILI 302
Query: 308 ALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRKQQLLLIP 367
L+G+ IP L I+YGTNL Y F++++LGG +F+TV DNILT RKQ L+I
Sbjct: 303 FLIGA-FIAIPALDIVYGTNLKQYQTSFVVLLLGGIASTFSTVCDNILTIFRKQHFLVIS 361
Query: 368 YTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNR---FKKGRVN 423
+ G+++S+L V K+ I GA+LSFL M+ WL S++IY F+ N F++ ++
Sbjct: 362 FIVGYIVSILTAKPLVSKFEIFGASLSFLCAMIAWLLASLVIY-FVTNPYIIFRRKKIK 419
>ref|ZP_00875229.1| conserved hypothetical protein [Streptococcus suis 89/1591]
gb|EAP40613.1| conserved hypothetical protein [Streptococcus suis 89/1591]
Length = 419
Score = 335 bits (859), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 182/403 (45%), Positives = 278/403 (68%), Gaps = 4/403 (0%)
Query: 9 QTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQA 68
+TIF+WN+LGS+S+A IS+ LL++VTRLLT ++DI++FAY+ AN+ V++ FQVR+YQA
Sbjct: 9 KTIFIWNLLGSISSAAISIFLLLLVTRLLTELEADIFSFAYTVANLFVIIASFQVRDYQA 68
Query: 69 TDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLY 128
TD+++K+SFSQYL RL+T +ML + + Y+ L+K + KS +FL+C YR +DA SD++
Sbjct: 69 TDVSKKFSFSQYLATRLITITIMLLLALSYIFLSKYEFQKSACIFLICLYRGSDALSDVF 128
Query: 129 QGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYDIGH 188
QG+FQQ+ RLDIAGKSL RN+++ + + + + NL L+L+ + I S +F+ ++D+ +
Sbjct: 129 QGLFQQNARLDIAGKSLFLRNSIVILTFGFGLFITNNLLLSLIYLVISSYLFVFFFDVTN 188
Query: 189 SKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGEVAL 248
+F +++ E I+ + +L E PLF+N FL++ IY QPKYA+ G +
Sbjct: 189 LFQFTRIIKEE----INLKAIKNILLECLPLFINAFLLVTIYNQPKYALNTFFERGVIGT 244
Query: 249 GSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAYLGVFSLIA 308
G Q FNILFMP F MN+L++ FRP ITQ+AI G +F + Q ++ + +++
Sbjct: 245 GVQRDFNILFMPVFSMNILLILFRPMITQLAIYRRAGDYNQFKQYQKRIVKMVVGLAVLV 304
Query: 309 LVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRKQQLLLIPY 368
LVG + GIP L+ILYGTNL YW+ F++ MLGG +FAT+ DN+LT +RKQ+ L+I +
Sbjct: 305 LVGGIVLGIPALNILYGTNLNKYWLSFIITMLGGIASTFATICDNMLTVLRKQKYLVISF 364
Query: 369 TGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYL 411
L+S+LI+N V Y ILGAA++F+ +M W +S +IYL
Sbjct: 365 AISCLLSILISNPLVEYYGILGAAIAFVSSMWTWFLISFVIYL 407
>ref|YP_001198649.1| polysaccharide/teichoic acid transporter, putative [Streptococcus
suis 05ZYH33]
gb|ABP90250.1| polysaccharide/teichoic acid transporter, putative [Streptococcus
suis 05ZYH33]
Length = 419
Score = 332 bits (850), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 180/403 (44%), Positives = 278/403 (68%), Gaps = 4/403 (0%)
Query: 9 QTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQA 68
+ IF+WN+LGS+S+A IS+ LL++VTRLLT ++DI++FAY+ AN+ V++ FQVR+YQA
Sbjct: 9 KIIFIWNLLGSVSSAAISIFLLLLVTRLLTELEADIFSFAYAVANLFVIIASFQVRDYQA 68
Query: 69 TDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLY 128
TD+++K+SFSQYL RL+T +ML + + Y+ L++ + KS +FL+C YR +DA SD++
Sbjct: 69 TDVSKKFSFSQYLATRLITITIMLLLALSYIFLSQYEFQKSACIFLICLYRGSDALSDVF 128
Query: 129 QGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYDIGH 188
QG+FQQ+ RLDIAGKSL RN+++ + + + + NL L+L+ + I S +F+ ++D+ +
Sbjct: 129 QGLFQQNARLDIAGKSLFLRNSIVILTFGFGLFITNNLLLSLIYLVISSYLFVFFFDVTN 188
Query: 189 SKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGEVAL 248
+F +++ E I+ + +L E PLF+N FL++ IY QPKYA+ G +
Sbjct: 189 LFQFTRIIKEE----INLKAIKNILLECLPLFINAFLLVTIYNQPKYALNTFFERGVIGT 244
Query: 249 GSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAYLGVFSLIA 308
G Q FNILFMP F MN+L++ FRP ITQ+AI G +F + Q ++ + +++
Sbjct: 245 GVQRDFNILFMPVFSMNILLILFRPMITQLAIYRRAGDYNQFKQYQKRIVKMVVGLAVLV 304
Query: 309 LVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRKQQLLLIPY 368
LVG + GIP L+ILYGTNL YW+ F++ MLGG +FAT+ DN+LT +RKQ+ L+I +
Sbjct: 305 LVGGIVLGIPALNILYGTNLNKYWLSFIITMLGGIASTFATICDNMLTVLRKQKYLVISF 364
Query: 369 TGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYL 411
L+S+LI+N V Y ILGAA++F+ +M W +S++IYL
Sbjct: 365 AISCLLSILISNPLVEYYGILGAAIAFVSSMWTWFLISLVIYL 407
>ref|YP_001035463.1| Polysaccharide/teichoic acid transporter, putative [Streptococcus
sanguinis SK36]
gb|ABN44913.1| Polysaccharide/teichoic acid transporter, putative [Streptococcus
sanguinis SK36]
Length = 424
Score = 320 bits (819), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 177/402 (44%), Positives = 266/402 (66%), Gaps = 4/402 (0%)
Query: 9 QTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQA 68
Q IF WN+LGSLS+A +SVILL +VTR L+S +D+Y+F+Y+ AN++V+V FQVR++QA
Sbjct: 14 QKIFFWNILGSLSSAAVSVILLFIVTRTLSSESADLYSFSYAIANLLVIVAGFQVRDFQA 73
Query: 69 TDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLY 128
TDI EKYSF YL R +T +LM+ I + YL L + I+F V F+R ++A SD++
Sbjct: 74 TDIKEKYSFDAYLTTRFLTNILMILILLGYLILNSSTHENFWIIFWVSFFRVSEALSDVF 133
Query: 129 QGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYDIGH 188
QG+FQQ ERLDIAG+SL +RN + + + ++L+SKNL L++V + S + ++++D
Sbjct: 134 QGLFQQKERLDIAGQSLFFRNMISTITFAVLLLFSKNLLLSIVFQTLTSFIVVLFFDFPK 193
Query: 189 SKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGEVAL 248
SK F ++ +S + ++ +LK+ PLF+N FL++ IY QPKYA+ + G +
Sbjct: 194 SKLFHRIN----ISTVKLKDIYSILKDCLPLFINAFLLVSIYNQPKYALNDIFNRGLIEA 249
Query: 249 GSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAYLGVFSLIA 308
G Q F+ILF P F MNL+I+F RP +TQ+AI +I F + LF L S++
Sbjct: 250 GVQRDFSILFTPIFAMNLMIVFLRPMVTQLAIFKEENKISHFITYKNNLFKILWGTSVLI 309
Query: 309 LVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRKQQLLLIPY 368
+G + IP L+I+YGT L Y + F++++LGG +F+TV DNILT RK L+I +
Sbjct: 310 CLGGTIVAIPILNIIYGTRLDQYQISFVILLLGGIASTFSTVCDNILTVFRKHHYLVISF 369
Query: 369 TGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIY 410
G+L+S+L V +Y I GA+LSFLI+M+ WL +S++IY
Sbjct: 370 LAGYLVSILTAEPLVSQYGIFGASLSFLISMIAWLSVSLIIY 411
>ref|YP_139899.1| polysaccharide biosynthesis protein, putative transporter
[Streptococcus thermophilus LMG 18311]
gb|AAV61084.1| polysaccharide biosynthesis protein, putative transporter
[Streptococcus thermophilus LMG 18311]
Length = 424
Score = 287 bits (734), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 162/426 (38%), Positives = 264/426 (61%), Gaps = 12/426 (2%)
Query: 5 SKQNQ-----TIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVG 59
SK+NQ TI+ WN+LG+L+ + +SV+ L++VTRL ++ +D ++ +S + VV+G
Sbjct: 2 SKKNQLPDAKTIYFWNLLGNLAASGVSVLYLLIVTRLTATSVADQFSLVWSIGTLWVVIG 61
Query: 60 LFQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIV---FLVC 116
LFQVRNY TD+ +K+SF Y AR++T L M+ + YL + + Y S+++ FL+
Sbjct: 62 LFQVRNYHGTDVRQKHSFRAYFQARILTILAMIVTLLPYLKIIGGNRYPSSVILMAFLMI 121
Query: 117 FYRSTDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIV 176
YR+ DA SDL+QG+FQQ ER+DIAGK++ YR + +V + SK+L +L+A+ I
Sbjct: 122 LYRAWDAVSDLFQGLFQQRERMDIAGKTMFYRYSTSAVVLFLSLFVSKSLITSLLALTIW 181
Query: 177 SLVFIMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYA 236
+ +FI+ Y+ F+ + + + SL +LKE FPLFLNGF+++Y+ +PK
Sbjct: 182 NGLFILLYEFRFVHHFESINWRGVFDLRKIYESLDILKECFPLFLNGFILLYVLNEPKLI 241
Query: 237 IELMTTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQ 296
IE + + G Q FNILFMP F M+L+IL RP ITQ+A + + + + I +
Sbjct: 242 IERGLSEEVLQTGMQRDFNILFMPVFFMSLIILMVRPLITQLAFLYVDKEYDKLDSIIKK 301
Query: 297 LFAYLGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILT 356
L Y+ L+ + + L G+ L +++G +L Y + F +++L G + + A + +NILT
Sbjct: 302 LLLYIIGGGLLVVCLAYLLGVQVLGLVFGLDLASYQLPFTILILAGVLYAVAIIFENILT 361
Query: 357 AMRKQQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWL-GLSIMIYLFIMN 415
MRKQ LL+ Y +++LLIT +FV + +LGA+L+FL+ M+V++ G+SI ++
Sbjct: 362 IMRKQHLLIAIYVAMLVVTLLITKMFVYSWGMLGASLAFLVVMIVYVFGISI---IYFRE 418
Query: 416 RFKKGR 421
R K+ R
Sbjct: 419 RIKERR 424
>ref|YP_001450304.1| hypothetical protein SGO_1015 [Streptococcus gordonii str. Challis
substr. CH1]
gb|ABV10853.1| membrane protein, putative [Streptococcus gordonii str. Challis
substr. CH1]
Length = 418
Score = 268 bits (684), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 157/402 (39%), Positives = 247/402 (61%), Gaps = 13/402 (3%)
Query: 9 QTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQA 68
+ I+ WN+LG+L A S I +M+V+RL ++ +D+++ AY A ++VV+GLFQVR YQ
Sbjct: 4 KEIYFWNLLGNLMFAGSSAIFMMIVSRLSSAKMADVFSLAYGIAGILVVLGLFQVRTYQG 63
Query: 69 TDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTK---TDSYKSTIVFLVCFYRSTDAFS 125
TD+ K+SF+ Y++AR+ + LML YL L +D+ K +V L +R +A S
Sbjct: 64 TDVTFKHSFTSYMIARVFSITLMLITLFPYLFLVHFDFSDTSKLAVVVLYVLFRMCEAIS 123
Query: 126 DLYQGMFQQHERLDIAGKSLAYR-NTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYY 184
DL+QG+FQQHERLDIAGKS+ R IF++ ++I+ K+L ++L+ + + + VF+ Y
Sbjct: 124 DLFQGLFQQHERLDIAGKSMTIRYGASIFILLISLIVL-KSLEVSLLILFLFNFVFVWIY 182
Query: 185 DIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLG 244
D S F K+ F + + F+++ +LK PLF++GFL+ YI+ +PK AI+ LG
Sbjct: 183 DFPKSLSFDKVSFDKSTIRLQFKDAFLILKGCIPLFISGFLLAYIFNEPKIAIDKAIQLG 242
Query: 245 EVALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAYL--- 301
++A G Q +NILFMP F M+L IL RP T +AI R + +F++ Q+ YL
Sbjct: 243 KLAEGLQRNYNILFMPVFFMSLFILILRPLTTSLAIQWQRKEFAKFDRTVKQIGIYLLGG 302
Query: 302 -GVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
+ +++A L G P LSI++G +L + +++ G + S V+ +ILT R
Sbjct: 303 GAILTMLAF----LIGTPVLSIVFGVDLAGDALTLTILVFSGILYSVGIVLGDILTIFRM 358
Query: 361 QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVW 402
Q+ L++ Y F++S+ ITN FVM +LGAA SFL ML++
Sbjct: 359 QRKLIVVYLLMFVVSIAITNPFVMSKGLLGAAYSFLFVMLIY 400
>ref|YP_001200855.1| polysaccharide/teichoic acid transporter, putative [Streptococcus
suis 98HAH33]
gb|ABP92456.1| polysaccharide/teichoic acid transporter, putative [Streptococcus
suis 98HAH33]
Length = 324
Score = 245 bits (625), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 132/300 (44%), Positives = 206/300 (68%), Gaps = 7/300 (2%)
Query: 9 QTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQA 68
+ IF+WN+LGS+S+A IS+ LL++VTRLLT ++DI++FAY+ AN+ V++ FQVR+YQA
Sbjct: 9 KIIFIWNLLGSVSSAAISIFLLLLVTRLLTELEADIFSFAYAVANLFVIIASFQVRDYQA 68
Query: 69 TDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLY 128
TD+++K+SFSQYL RL+T +ML + + Y+ L++ + KS +FL+C YR +DA SD++
Sbjct: 69 TDVSKKFSFSQYLATRLITITIMLLLALSYIFLSQYEFQKSACIFLICLYRGSDALSDVF 128
Query: 129 QGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYDIGH 188
QG+FQQ+ RLDIAGKSL RN+++ + + + + NL L+L+ + I S +F+ ++D+ +
Sbjct: 129 QGLFQQNARLDIAGKSLFLRNSIVILTFGFGLFITNNLLLSLIYLVISSYLFVFFFDVTN 188
Query: 189 SKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGEVAL 248
+F +++ E I+ + +L E PLF+N FL++ IY QPKYA+ G +
Sbjct: 189 LFQFTRIIKEE----INLKAIKNILLECLPLFINAFLLVTIYNQPKYALNTFFERGVIGT 244
Query: 249 GSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRG---QIKEFNKIQVQLFAYLGVFS 305
G Q FNILFMP F MN+L++ FRP ITQ+AI G Q K++ K V++ + +G S
Sbjct: 245 GVQRDFNILFMPVFSMNILLILFRPMITQLAIYRRAGDYNQFKQYQKRIVKMVSRIGCVS 304
>gb|AAN64553.1| hypothetical protein [Streptococcus gordonii]
Length = 383
Score = 238 bits (607), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 149/386 (38%), Positives = 231/386 (59%), Gaps = 9/386 (2%)
Query: 42 SDIYAFAYSFANMMVVVGLFQVRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTL 101
+D+++ AY A ++VV+GLFQVR YQ TD+ K+SF+ Y +AR+ + LML YL L
Sbjct: 2 ADVFSLAYGIAGILVVLGLFQVRTYQGTDVTFKHSFTSYTIARVFSITLMLITLFPYLFL 61
Query: 102 TK---TDSYKSTIVFLVCFYRSTDAFSDLYQGMFQQHERLDIAGKSLAYR-NTLIFMVYT 157
+D+ K +V L +R +A SDL+QG+FQQHERLDIAGKS+ R IF++
Sbjct: 62 VHFDFSDTSKLAVVVLYVLFRMCEAISDLFQGLFQQHERLDIAGKSMTIRYGASIFILLI 121
Query: 158 AIILYSKNLTLALVAVCIVSLVFIMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESF 217
+++ K+L ++L+ + + + VF+ YD S F K+ F + + +++ +LK
Sbjct: 122 PLVVL-KSLEVSLLILFLFNFVFVWIYDFPKSLSFDKVSFDKSTIRLQVKDAFLILKGCI 180
Query: 218 PLFLNGFLIIYIYTQPKYAIELMTTLGEVALGSQTIFNILFMPAFVMNLLILFFRPHITQ 277
PLF++GFL+ YI+ +PK AI+ LG++A G Q +NILFMP F M+L IL RP T
Sbjct: 181 PLFISGFLLAYIFNEPKIAIDKAIQLGKLAEGLQRNYNILFMPVFFMSLFILILRPLTTS 240
Query: 278 MAIALIRGQIKEFNKIQVQLFAYLGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFML 337
+AI R + +F++ Q+ YL I + + L G P LSI++G +L + +
Sbjct: 241 LAIQWQRKEYAKFDRTVKQIGIYLLGGGAILTMLAFLIGTPVLSIVFGVDLAGDALTLTI 300
Query: 338 IMLGGSIGSFATVIDNILTAMRKQQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLI 397
++ G + S V+ +ILT R Q+ L++ Y F++S+ ITN FVM +LGA+ SFL
Sbjct: 301 LVFSGILYSVGIVLGDILTIFRMQRKLILVYLLMFIVSIAITNPFVMSKGLLGASYSFLF 360
Query: 398 TMLVWLGLSIMIYL-FIMNRFKKGRV 422
ML++ SI YL +I R KG++
Sbjct: 361 VMLIY---SIGSYLTYIKVRKWKGKL 383
>ref|ZP_02432184.1| hypothetical protein CLOSCI_02429 [Clostridium scindens ATCC 35704]
gb|EDS06480.1| hypothetical protein CLOSCI_02429 [Clostridium scindens ATCC 35704]
Length = 417
Score = 222 bits (565), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 137/391 (35%), Positives = 229/391 (58%), Gaps = 13/391 (3%)
Query: 13 LWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQATDIN 72
LWNM S+ + S + L VVT + + + I++ A + +V +G + +RNYQATD+
Sbjct: 19 LWNMFSSILGSAQSAVFLFVVTHVCDTVYAGIFSIATTLGYQIVTIGNYGMRNYQATDVR 78
Query: 73 EKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLYQGMF 132
+KY F +YL++R +T L ML + Y+ + + + K+ I+F+ +++ D D++ G F
Sbjct: 79 QKYKFREYLLSRYITSLFMLLFLISYIVIKEYNIEKAMIIFVFGIFKAIDVIEDVFHGEF 138
Query: 133 QQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMY--YDIGHSK 190
Q++ RLDI ++ R + F V+ I++ ++NL +A + + S+ +Y Y++ K
Sbjct: 139 QRYNRLDIGAICMSIRYVISFAVFAIILVITENLLVACIWETLTSICVFIYLTYEVVKPK 198
Query: 191 KFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGEVALGS 250
K K+ + N Q L LL E FPLFL GFL +YI PKYAI+ ++ S
Sbjct: 199 KVLKIN----IKNALIQAKL-LLNECFPLFLGGFLYLYICNAPKYAID-----NCLSQES 248
Query: 251 QTIFNILFMPAFVMNLLILF-FRPHITQMAIALIRGQIKEFNKIQVQLFAYLGVFSLIAL 309
Q F ILFMP F++NLL F +RP +T+MAI + + KEF I + ++ + +++ +
Sbjct: 249 QAYFAILFMPVFIVNLLSGFIYRPLLTRMAICWVEKRNKEFVHIISRQVVFIILATVLGI 308
Query: 310 VGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRKQQLLLIPYT 369
VG+ G+ LS +YG N+ Y +++MLGG + N+LT MR Q+++LI Y
Sbjct: 309 VGAYFIGVYLLSWIYGVNINVYRDALVILMLGGGFAAITGYFMNVLTIMRLQKVMLIGYC 368
Query: 370 GGFLISLLITNLFVMKYHILGAALSFLITML 400
+IS++I+NLFV+ + ILGA++ ++I ML
Sbjct: 369 IVAVISIIISNLFVVNWGILGASMLYVILML 399
>ref|ZP_02442856.1| hypothetical protein ANACOL_02154 [Anaerotruncus colihominis DSM
17241]
gb|EDS10973.1| hypothetical protein ANACOL_02154 [Anaerotruncus colihominis DSM
17241]
Length = 406
Score = 162 bits (411), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 119/392 (30%), Positives = 207/392 (52%), Gaps = 12/392 (3%)
Query: 8 NQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQ 67
N+ +WN LGSL S I+L +V+R+ T + + A++ A ++ +VGLF V ++Q
Sbjct: 2 NKKGVIWNSLGSLMYGANSFIMLALVSRVGTVEQAGYFGIAFTTAQILYIVGLFGVPHFQ 61
Query: 68 ATDINEKYSFSQYLVARLMTCLLMLAITVIYL-TLTKTDSYKSTIVFLVCFYRSTDAFSD 126
TD EKY FS Y+ AR +CLLM I + T + S VFL + +
Sbjct: 62 MTDYGEKYRFSDYIHARRFSCLLMACGCAIAIWGFHFTGAKASYTVFLTALML-LNVIGE 120
Query: 127 LYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYDI 186
LYQ +F Q RLD++G +L YR +++ I+ ++++ +AL + +L +YY +
Sbjct: 121 LYQSLFFQKNRLDLSGSALFYRTLWPLLLFCIILWVTRSIIVALSVQILANLFLTIYYAV 180
Query: 187 GHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGEV 246
+ +F + S QN LL E FPLF++ L+ + KY IELM + ++
Sbjct: 181 WVAPRFISAQPCDRASG-QVQN---LLMECFPLFVSLLLMNIVINASKYGIELM--MNDL 234
Query: 247 ALGSQTIFNILFMPAFVMNLLILF-FRPHITQMAIALIRGQIKEFNKIQVQLFAYLGVFS 305
A Q +N++FMPA V+NL F F+P + + + L QI+ F + ++ + + +
Sbjct: 235 A---QGYYNMIFMPAQVINLCSQFLFKPLLNRYSKLLSERQIRTFGILLLRQIVLIALLT 291
Query: 306 LIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRKQQLLL 365
+ G+ GIP LS+LY +++ + +L++LGG I + + I +R+Q+ ++
Sbjct: 292 CVCCAGAYAMGIPVLSLLYQKDISALRIHLILVVLGGGIFAVCQLYYYIFVILRRQKWIM 351
Query: 366 IPYTGGFLISLLITNLFVMKYHILGAALSFLI 397
Y ++++ T + V ++GA LSF+I
Sbjct: 352 KIYLCITVVAVPTTAVCVHSAGLMGAVLSFVI 383
>ref|ZP_02080928.1| hypothetical protein CLOLEP_02391 [Clostridium leptum DSM 753]
gb|EDO60789.1| hypothetical protein CLOLEP_02391 [Clostridium leptum DSM 753]
Length = 397
Score = 153 bits (387), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 131/394 (33%), Positives = 221/394 (56%), Gaps = 29/394 (7%)
Query: 16 MLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQATDINEKY 75
ML S T V++LMV++R+ D+ I+ AY+ N+M+ +G + +R +QA+D+ EKY
Sbjct: 1 MLNSFQT----VVILMVISRIDPVNDAGIFVIAYAIGNLMLTIGRYGIRQFQASDVVEKY 56
Query: 76 SFSQYLVARLMTCLLMLAITVIYLTLT----KTDSYKSTIVFLVCFYRSTDAFSDLYQGM 131
S+ +Y +R++T LMLAI++ Y+ + D KS +V L+C + DAF D++ GM
Sbjct: 57 SYREYYYSRILTTGLMLAISLAYIGFEYGTGQYDGNKSIVVLLICLVKWIDAFEDVFHGM 116
Query: 132 FQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYDIGHSKK 191
QQH+RLDI GK L R L ++Y + +K+L L + +VS + + G + K
Sbjct: 117 LQQHDRLDIGGKILTVRLFLYTVLYMVLYAVTKDLILTSLISLLVSFALFVLLN-GMALK 175
Query: 192 FQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGEVALGSQ 251
+ EL SF+ +L E FPLF + FLI+YI PKYAI+ + + + Q
Sbjct: 176 SFPVEKGEL----SFRKIGAMLWECFPLFGSTFLIMYIGNAPKYAIDSVLSNQD-----Q 226
Query: 252 TIFNILFMPAFVMNLLILF-FRPHITQMAIALIRGQIKEFNKIQVQ-LFAYLGVFSLIAL 309
FN +FMP FV++LL F ++P + ++A+ + + F K+ + + A LG+ + +AL
Sbjct: 227 ASFNYVFMPVFVISLLSNFIYQPVLNKLAVIWNQRETSRFWKLIAKYIAAILGLTAAVAL 286
Query: 310 VGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRKQQLLLIPYT 369
G GIP L+++YG +L Y + ++++GG + + I+T +R Q+ L+
Sbjct: 287 -GGYFLGIPVLNLIYGVHLEGYLLHLEILLVGGGLLALINFFTMIITIVRFQKYLI---- 341
Query: 370 GGFLIS----LLITNLFVMKYHILGAALSFLITM 399
GG++I LL + V Y +LG + + ++M
Sbjct: 342 GGYIIVSLAFLLFGSKCVQAYGVLGISCFYTLSM 375
>ref|ZP_02045178.1| hypothetical protein ACTODO_02068 [Actinomyces odontolyticus ATCC
17982]
gb|EDN81590.1| hypothetical protein ACTODO_02068 [Actinomyces odontolyticus ATCC
17982]
Length = 451
Score = 152 bits (385), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 113/398 (28%), Positives = 205/398 (51%), Gaps = 22/398 (5%)
Query: 12 FLWNMLGSLSTAVISVILLMVVTRL-----LTSADSDIYAFAYSFANMMVVVGLFQVRNY 66
+LWN SL +++ VI+ + + R A ++ A + VGL++VR +
Sbjct: 39 YLWNTAASLMSSLAVVIMGVAIMRSGATDSFARAQYGLFTLALAIGQQYQTVGLYEVRTF 98
Query: 67 QATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKT-DSYKS-TIVFLVCFYRSTDAF 124
TD+ ++ F YL RL+TCL+M+++ +++ + T D Y + T++ + R DAF
Sbjct: 99 HVTDVRRRFDFGTYLSTRLLTCLVMVSLILLHSWNSSTKDPYPAFTVIAAMALLRIFDAF 158
Query: 125 SDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSK--NLTLALVAVCIVSLVFIM 182
D+Y FQ+ RLDIAGK+ R ++T L+S T L+A I++ V
Sbjct: 159 EDVYYSEFQRSGRLDIAGKACFAR------IFTTTFLWSGLYWFTQDLLASTIITFVVTC 212
Query: 183 YYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTT 242
+ + +FS LL + + + +L E PLF+ FL Y+ P++AI +
Sbjct: 213 VVLVVAYGLPARGVFS-LLPSFNLRGITGILWECLPLFIAAFLNQYLANAPRFAIH--AS 269
Query: 243 LGEVALGSQTIFNILFMPAFVMNLLILF-FRPHITQMAIALIRGQIKEFNKIQVQLFAYL 301
LG+ LG +F I++MPA +N+L LF FRP +T+MA+ G+ EF I +
Sbjct: 270 LGDEELG---VFAIIYMPAVAINMLSLFVFRPLLTRMALRWAEGKRVEFLSIVRKGLVTT 326
Query: 302 GVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRKQ 361
++ + G P L++++GT+++ Y + M+++L G++ + ++ L MR+
Sbjct: 327 AAAFVVVAAVTYFIGAPLLTLVFGTDVSGYVPELMVLVLAGALNAAGVILYYALATMRRL 386
Query: 362 QLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITM 399
+ +L+ Y + LI + + ++GA+L++ TM
Sbjct: 387 RAVLVAYAAAGATAYLIAPMLTKSHAMMGASLAYAATM 424
>ref|ZP_01772528.1| Hypothetical protein COLAER_01534 [Collinsella aerofaciens ATCC
25986]
gb|EBA39243.1| Hypothetical protein COLAER_01534 [Collinsella aerofaciens ATCC
25986]
Length = 440
Score = 126 bits (317), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 109/415 (26%), Positives = 205/415 (49%), Gaps = 18/415 (4%)
Query: 12 FLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQATDI 71
+LWN LG+ + +L +V T+L+ + ++ ++ A+ ++++ + VRN+Q +DI
Sbjct: 34 YLWNTLGTAVWGMAFPLLTIVSTQLVGAEEAGKFSIAFVTGTLIMIACNYGVRNFQVSDI 93
Query: 72 NEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLYQGM 131
+EK SF+ Y + R + L LA ++Y + D+ +TI V Y+ D +D+Y+G
Sbjct: 94 DEKTSFASYQLNRWLCGALALACGLVYSSARGYDAQMATIGLGVYLYKVIDGIADVYEGR 153
Query: 132 FQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLT---LALVAVCIVSLVFIMYYDIGH 188
QQ ++L +AG S R+ + +V++ + ++++ +A+ I SLV +
Sbjct: 154 LQQADKLYLAGMSQTLRSAGVIVVFSVALFLTRSMPIAAMAMGIAAIASLVLV------- 206
Query: 189 SKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGEVAL 248
L+ +E +S + L + PLF FL I + PK+ +E G +A
Sbjct: 207 -TAPLALLETEKSRRVSLREVGHLFIQCAPLFGALFLFNLIESMPKFVME-----GTLAY 260
Query: 249 GSQTIFNILFMPAFVMNLLILF-FRPHITQM-AIALIRGQIKEFNKIQVQLFAYLGVFSL 306
Q FN LF PA + L I F ++P + ++ +I + + F+ I V + A + V +
Sbjct: 261 KYQLYFNALFFPAQAILLGIGFIYKPQLLRLSSIWANPRKRRRFDLIIVAVMALIVVITG 320
Query: 307 IALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRKQQLLLI 366
GIP +S +YG N Y +L+++ G + + I I+T +R +
Sbjct: 321 ACAAFMAGPGIPIMSFMYGLNFERYRTLALLMVVAGGVTAAIDFIYAIITVLRHAGDVTK 380
Query: 367 PYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKGR 421
Y F +S+++ + + ++GA +S+L M + L L I+ Y I R ++ R
Sbjct: 381 IYLICFAVSVVVPVILIKLLDLMGAVVSYLAIMALLLVLLIIEYAHIRQRIERDR 435
>ref|ZP_02087505.1| hypothetical protein CLOBOL_05049 [Clostridium bolteae ATCC
BAA-613]
gb|EDP14507.1| hypothetical protein CLOBOL_05049 [Clostridium bolteae ATCC
BAA-613]
Length = 440
Score = 124 bits (311), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 106/411 (25%), Positives = 199/411 (48%), Gaps = 30/411 (7%)
Query: 5 SKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYS-FANMMVVVGLFQV 63
+QN +WN+ GS A+ S++L +V R++ I++F +S M +V F +
Sbjct: 16 GRQN---VVWNIAGSFVYALASMVLSFLVIRVVGDGQGGIFSFGFSTLGQQMFIVAYFGI 72
Query: 64 RNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLT----LTKTDSYKSTIVFLVCFYR 119
R +Q TD +YSF YL R +TC++ LA ++LT + + + K I+ L+ Y+
Sbjct: 73 RPFQITDGTGEYSFGDYLEHRNITCIMALAAGAVFLTFMHGVGRYPADKCMILILLVIYK 132
Query: 120 STDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLV 179
D ++D+Y+ FQ+ L + GKS +R V+ + ++L + +A
Sbjct: 133 VIDGYADVYESEFQRQGSLYLTGKSNFFRTLFSVSVFLVTLAAFEHLLFSCLAAVAAQAA 192
Query: 180 FIMYY--DIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAI 237
I + D+ H+ + N + + +L K + LF++ FL Y+++ KYAI
Sbjct: 193 GIALFNLDVIHA-------LPSVDWNKGERKTGRLFKSTLFLFISAFLDFYVFSAAKYAI 245
Query: 238 ELMTTLGEVALGSQTIFNILFMPAFVMNLLILF-FRPHITQMAIALIRGQIKEFNKIQVQ 296
+ + + A G FN++FMP V+ ++ F RP +T++ F K ++
Sbjct: 246 D--ARMNDAASG---YFNLIFMPTSVIYMVANFVIRPFLTRLTDLWTGRDYDCFKKELMR 300
Query: 297 LFAYLGVFSLIALVGSGLFGIPFLSIL-------YGTNLTDYWVDFMLIMLGGSIGSFAT 349
+ A + +++A+ + + G LS++ Y L Y+ F++I+LGG + A
Sbjct: 301 IGAIILGLTVLAVGATAVLGKWVLSVMEMILGSGYEGRLVSYYGAFIIIVLGGGFYALAN 360
Query: 350 VIDNILTAMRKQQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITML 400
++ L MR+Q+ + Y + + + V K+ I GAA +L+ M+
Sbjct: 361 LMYYALVIMRRQKAIFTVYLAAAAAAAVSSGFLVSKFGINGAAGCYLLLMI 411
>ref|ZP_02544449.1| Transcriptional activator amrA [candidate division TM7 single-cell
isolate TM7c]
Length = 305
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 70/289 (24%), Positives = 155/289 (53%), Gaps = 7/289 (2%)
Query: 6 KQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRN 65
+ + ++WN LGSL + IS +LL+V+TRL DS +++FA S + + + L+ R
Sbjct: 2 QSQKKDYIWNSLGSLLQSAISPVLLIVITRLNGIEDSGLFSFALSLSVVFWAISLWGGRT 61
Query: 66 YQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFS 125
YQA+D+ ++ Y+ R + L++ V++ L ++ K+ ++ ++ ++ ++ +
Sbjct: 62 YQASDVKREFCSGGYVAVRFVASLIVAVSAVVFCVLNGYNATKTGLIMILVAFKILESIA 121
Query: 126 DLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYD 185
D G+ Q H +L IAG SL + L F + + + +KN+ +A+ +V+ + I+ YD
Sbjct: 122 DSLYGVLQIHRKLYIAGISLTMKAMLGFTTFIVVDIVTKNVMYGTLAILLVNALIILLYD 181
Query: 186 IGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGE 245
+ ++ + + ++ ++ ++K + +F+ FL ++ P+Y ++ + +
Sbjct: 182 VLWVRRVEAIAMNKKFIKEYAGQAVVIMKRTSAVFVVMFLTMFSLNIPRYFLD-KSHPDQ 240
Query: 246 VALGSQTIFNILFMPAFVMNLLILF-FRPHITQMAIALIRGQIKEFNKI 293
+ F I+ MP ++ L I F +P++ ++ L +G++KEF +I
Sbjct: 241 IGY-----FGIMAMPITLLGLFISFIIQPNVVNLSELLAKGKLKEFARI 284
>ref|ZP_02912393.1| polysaccharide biosynthesis protein [Geobacillus sp. WCH70]
gb|EDT36498.1| polysaccharide biosynthesis protein [Geobacillus sp. WCH70]
Length = 411
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 96/402 (23%), Positives = 195/402 (48%), Gaps = 30/402 (7%)
Query: 6 KQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRN 65
K+N F WN G+L A +L ++ +L ++ + +++ Q+ +
Sbjct: 11 KKN---FYWNFTGNLIYAFAQWAILSLLAKLGNPQMVGQFSLGLAITAPIILFTNLQLNS 67
Query: 66 YQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFS 125
Q TD KY F +YL R++T + + IT+ + L + S ++ LV F + +++S
Sbjct: 68 IQVTDTQHKYKFGEYLGLRIVTNFIAILITIFVILLGNYNPLTSLVIILVAFSKVIESWS 127
Query: 126 DLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYD 185
D+ G QQ ER+D+ S + L+ ++ + ++ + N+ ++ +C+ ++ +YD
Sbjct: 128 DVVFGYLQQRERMDLTAISRILKAVLMLLLISILLFITHNVIWMVIGLCLSYMLVFFFYD 187
Query: 186 IGHSKKF--QKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIY--IYTQ-PKYAIELM 240
+ KKF KL+F +++N + ++K + PL G +++ +YT P+ IE
Sbjct: 188 LKVLKKFITPKLIF-------NYKNYVDIVKLALPL---GIVLMLGSLYTNIPRIIIE-- 235
Query: 241 TTLGEVALG--SQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLF 298
LGE LG + + I+ F+ + + ++AI +F K+ ++L
Sbjct: 236 KYLGEEQLGYFAAIAYLIVAGNTFIGAI----GQAAAPRLAILFSEKNFIQFKKLLIKLV 291
Query: 299 AYLGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAM 358
+ + + ++ + LFG LSILY + Y V F L+++ G+ +T + LTA
Sbjct: 292 SIGFITGIFGVIITLLFGELILSILYKPSYAKYNVLFTLLLISGTFSFSSTFLGIGLTAT 351
Query: 359 RKQQLLLIPYTGG--FLISLLITNLFVMKYHILGAALSFLIT 398
+ ++ PY G ++S + + + + K ++GA S +I+
Sbjct: 352 KTFKIQ--PYLGAIWIVVSFISSIILIPKIGLIGAGYSVIIS 391
>ref|YP_826100.1| polysaccharide biosynthesis protein [Solibacter usitatus Ellin6076]
gb|ABJ85815.1| polysaccharide biosynthesis protein [Solibacter usitatus Ellin6076]
Length = 446
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/360 (21%), Positives = 167/360 (46%), Gaps = 12/360 (3%)
Query: 3 NPSKQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQ 62
P ++ F W + G++ A +++ + +L +S ++ + A+ +++
Sbjct: 19 TPGLSLRSNFAWVLTGNVVYAACQWGMIVALAKLGSSFMIGQFSLGVAIASPVLMFTNLH 78
Query: 63 VRNYQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTD 122
++ QATD SF++YL R + L LA+ + ++ V + +
Sbjct: 79 LKAVQATDALRMTSFAEYLRLRGVMTLCGLAVIAGIACFGNYEPQTRLVILTVALAKGIE 138
Query: 123 AFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIM 182
SD++ G+FQ ++RLD G+S+ R L + + ++++ V + +V L ++
Sbjct: 139 TLSDIHYGLFQLNDRLDQVGRSMMLRGALSVAALSTGLYVTRSVLGGCVGLALVWLGALL 198
Query: 183 YYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTT 242
++D+ ++F + SE + S L+ + PL ++ + P+Y I
Sbjct: 199 FFDVPRGRRFA--VASE--GERGLRRSWSLMWMALPLGISTTMAALNLNMPRYFIH--AR 252
Query: 243 LGEVALGSQTIFNILFMPAFVMNLLILFFRPH--ITQMAIALIRGQIKEFNKIQVQLFAY 300
LGE LG I++ + M +L+ H I +M+ G++ EF + ++L A
Sbjct: 253 LGERQLG---IYSAMAYATVAM-ILVADSLGHCAIPRMSRLYTAGRLTEFRSLLLKLLAA 308
Query: 301 LGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
G L LV + + G+ L++LYG ++ F++++L +I A++ + +T+ R+
Sbjct: 309 GGTLGLAGLVVAQVMGVRLLTLLYGHEYAAHYRVFLVLILATAIYCVASMFTSAITSARR 368
>ref|ZP_00110716.1| COG2244: Membrane protein involved in the export of O-antigen and
teichoic acid [Nostoc punctiforme PCC 73102]
ref|YP_001865946.1| polysaccharide biosynthesis protein [Nostoc punctiforme PCC 73102]
gb|ACC81003.1| polysaccharide biosynthesis protein [Nostoc punctiforme PCC 73102]
Length = 412
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 64/241 (26%), Positives = 113/241 (46%), Gaps = 10/241 (4%)
Query: 12 FLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQATDI 71
F W G+L A +L+++ +L + + + +++ Q+R QATD
Sbjct: 13 FSWTFAGNLVYAACQWGMLVILAKLGSPEMVGQFTLGLAITAPIIMFTNLQLRIVQATDA 72
Query: 72 NEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLYQGM 131
++YSFS YL RL++ L LAI + L S ++FL+ ++ ++ SD++ G+
Sbjct: 73 RKQYSFSDYLGLRLISTALALAIVTVISLLGGFRWETSLVIFLMGLAKAFESISDIFHGL 132
Query: 132 FQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYDIGHSKK 191
QQ+ER+D SL + L ++ + S ++ +V + V ++ +DI
Sbjct: 133 IQQYERMDRIATSLMIKGPLSLLLLGIGVYMSGHILWGVVGLVFAWAVVLVAWDI--RSG 190
Query: 192 FQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQ---PKYAIELMTTLGEVAL 248
L S+L + +KL+ PL GF+++ I P+Y IE LGE L
Sbjct: 191 ILILYRSQLQPRWHRKTLVKLVWLCLPL---GFVMMLISLNTNIPRYFIE--RYLGEREL 245
Query: 249 G 249
G
Sbjct: 246 G 246
>ref|YP_589916.1| polysaccharide biosynthesis protein [Acidobacteria bacterium
Ellin345]
gb|ABF39842.1| polysaccharide biosynthesis protein [Acidobacteria bacterium
Ellin345]
Length = 460
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 58/241 (24%), Positives = 115/241 (47%), Gaps = 27/241 (11%)
Query: 13 LWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFA-------YSFANMMVVVGLFQVRN 65
+W + G+ A +L+ + +L +AF Y F NM Q+R+
Sbjct: 44 IWTLCGNFIYAFSQWAMLVCIAKLGDPTMVGQFAFGLAVSAPIYMFTNM-------QLRS 96
Query: 66 YQATDINEKYSFSQYLVARLMTCLL-MLAITVIYLTLTKTDSYKST--IVFLVCFYRSTD 122
QATD +Y FS+Y R++ + +LA+ V+ ++ S ++T +VF V + +
Sbjct: 97 VQATDAKSEYRFSEYFGLRMLASVAGLLAVCVVS---ARSSSMRTTALVVFGVGLAKFME 153
Query: 123 AFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIM 182
+ SD+ G+ Q+HER+D S++ + ++ Y+ NL A++A+ + ++
Sbjct: 154 SVSDVIYGLCQKHERMDSIAISMSIKGLGSVAALVGVLRYTHNLVYAVLAMAGWWALLLL 213
Query: 183 YYDIGHSKKFQKLMFSELLSNI-SFQN----SLKLLKESFPLFLNGFLIIYIYTQPKYAI 237
+ D+ + KF ++ ++ + I SF+ SL +L + P+ + L P+Y I
Sbjct: 214 FVDLRWAHKFAQIDPADQGTIIPSFERKILFSLGVL--ALPMGIQTMLASLTTNIPRYVI 271
Query: 238 E 238
+
Sbjct: 272 Q 272
>ref|YP_001430646.1| polysaccharide biosynthesis protein [Roseiflexus castenholzii DSM
13941]
gb|ABU56628.1| polysaccharide biosynthesis protein [Roseiflexus castenholzii DSM
13941]
Length = 449
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 85/399 (21%), Positives = 170/399 (42%), Gaps = 19/399 (4%)
Query: 12 FLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQATDI 71
F W +G++ A +LM + +L + ++A + + ++ +R QATD
Sbjct: 19 FSWTFVGNVVYAACQWGMLMALAKLGSPEMVGVFALGLAITAPVFMLSNLHLRTIQATDA 78
Query: 72 NEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLYQGM 131
++Y F YL RL T L L + S +F V ++ ++ SD++ G+
Sbjct: 79 RQQYQFRDYLGVRLATTTLALLAIAGIVLAIGYPWQTSLTIFAVGCAKACESISDIFYGL 138
Query: 132 FQQHERLDIAGKSLAYRNTL-IFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYDIGH-- 188
Q+ ER+D + + + +F + T + L S ++ +V + + YDI
Sbjct: 139 LQRLERMDRIAWGMILKGAMSLFALATGVYL-SGSVLWGVVGMAGAWATVLAVYDIPQGL 197
Query: 189 -SKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQ-PKYAIELMTTLGEV 246
+ + ++ + L + PL + G +++ + T P+Y +E M LG
Sbjct: 198 SAVRGASIVHPSAAPRQRMAVARNLAWMALPLGV-GIMLVSLSTAIPRYFVERM--LGAE 254
Query: 247 ALGS-QTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAYLGVFS 305
+LG I +I V+ L P + Q G ++ F + L A
Sbjct: 255 SLGIFAAIASIQVAGTTVIGALAAAANPRLAQH---YADGNVRAFRALLRNLVAIAVALG 311
Query: 306 LIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK---QQ 362
I ++ + L G L+++Y Y F+L+M+ S+G + + +TA+R+ Q
Sbjct: 312 GIGVLIAWLIGGWILTMIYRPEYGAYNHIFVLVMIAASVGYIGWFVGDAMTAVRRLRAQA 371
Query: 363 LLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLV 401
L + T + ++ + + + GAAL+ ++T ++
Sbjct: 372 FLFLAMT---VATIGACAWLIPSFGLTGAALATMVTSVI 407
>ref|YP_001715530.1| putative polysaccharide biosynthesis protein [Acinetobacter
baumannii]
emb|CAM88573.1| putative polysaccharide biosynthesis protein [Acinetobacter
baumannii]
Length = 400
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 98/415 (23%), Positives = 187/415 (45%), Gaps = 45/415 (10%)
Query: 14 WNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQATDIN- 72
W + G+ A ++L+ + R+ T + Y+ A + + + Q+R D+N
Sbjct: 10 WLIGGNFVFAFSQWVILIFLARMTTQENLGQYSLALAIVTPVFAIFNLQLRPLYILDLNG 69
Query: 73 -EKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLYQGM 131
+KY +S + RL+T ++ L + + + + +V LV F + +A+SD+
Sbjct: 70 EQKYRYSNFYYLRLITSIVALFVCLFVCLFSNVSFF---VVVLVAFLKFFEAYSDIIYAY 126
Query: 132 FQQHERLDIAGKSLAYRNTLIFMVYTAII-LYSKNLTLALVAVCIVSLVFIMYYD---IG 187
+ H++ + KSL + +F V +I LY N +AL+ ++ L + D I
Sbjct: 127 YNAHDQTKLISKSLFIKG--VFSVGGMLIGLYFFNFYIALILFLLIYLFVWLGLDNSYIV 184
Query: 188 HSKKFQKLMFSELLSNISFQNSLKL----LKESFP-LFLNGFLIIYIYTQPKYAIELMTT 242
+ + +K+ + N++ + L L+ + P LFL ++
Sbjct: 185 RTNELKKIRIDYSIMNLAIPMGISLGIVTLQSNIPRLFLGHYI----------------- 227
Query: 243 LGEVALGSQTIFN-ILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAYL 301
G A+G T+ + + + + +N + + P +T+ EF KI
Sbjct: 228 -GVKAVGVFTVLSYFIIVGSIFINSICQYLSPRLTRAW----NSNKNEFRKILFLAITIA 282
Query: 302 GVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTA---M 358
G +IA++ S FG FL+++YG +Y +F LIM+ G I TV+ LTA +
Sbjct: 283 GSLGVIAIIVSYFFGEFFLNLIYGEIYKEYTYEFNLIMVAGFILYVCTVLGYTLTAIGFI 342
Query: 359 RKQQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSIMIYLFI 413
+KQ L ++ + SLLI+ L + +Y +LG + + + + LSI + LF+
Sbjct: 343 KKQAFL---FSIVLIFSLLISYLCIPEYGVLGGIYTLIGSYSIQCILSIFVILFL 394
>ref|YP_001277945.1| polysaccharide biosynthesis protein [Roseiflexus sp. RS-1]
gb|ABQ91995.1| polysaccharide biosynthesis protein [Roseiflexus sp. RS-1]
Length = 440
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 82/397 (20%), Positives = 171/397 (43%), Gaps = 15/397 (3%)
Query: 12 FLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQATDI 71
F W +G++ A +LM + +L + A ++A + + + +R QATD
Sbjct: 18 FSWTFVGNVVYAACQWGMLMALAKLGSPAMVGMFALGLAITAPVFMFANLHLRTIQATDA 77
Query: 72 NEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLYQGM 131
++Y F YL RL T L L + + L + ++ V ++ ++ SD++ G+
Sbjct: 78 RQQYQFRDYLSVRLATTALALLVVAAIVVLVGYAWETAAVILAVGCAKACESISDIFYGL 137
Query: 132 FQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYDIGHSKK 191
Q+ ER+D + + + + + + + +V + V V + +YD+
Sbjct: 138 LQRLERMDRIAAGMMLKGIVSLVALATGVSLTGSAFWGVVGLTAVWAVVLAFYDV--PNG 195
Query: 192 FQKLMFSELLSNISFQN-----SLKLLKESFPLFLNGFLIIYIYTQ-PKYAIELMTTLGE 245
Q L ++ + S + +L + PL + G +++ + T P+Y +E + LGE
Sbjct: 196 LQALRQTQPIRERSSPPQRVAVATRLAWMALPLGV-GIMLVSLSTAIPRYFVERI--LGE 252
Query: 246 VALGS-QTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFAYLGVF 304
+LG I I V+ L P + Q G+I F + +L A
Sbjct: 253 ESLGIFAAIAYIQVAGTTVVGALAAAANPRLAQH---YAFGEIHAFRGLLFKLAAIALAL 309
Query: 305 SLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRKQQLL 364
++ + L G L+++Y Y F+L+M+ IG + + +TA+R+ +
Sbjct: 310 GGAGVLIAWLIGGWVLTLIYRPEYGAYNHVFVLVMVAAGIGYVGWFVGDAMTAVRRLRAQ 369
Query: 365 LIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLV 401
+ + G ++ + + + GAA++ ++T ++
Sbjct: 370 ALLFLGMTAATIGACAWLIPLFGLTGAAVATMVTSVI 406
>ref|ZP_02579279.1| polysaccharide biosynthesis protein [Bacillus cereus B4264]
Length = 416
Score = 58.9 bits (141), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 96/418 (22%), Positives = 192/418 (45%), Gaps = 24/418 (5%)
Query: 6 KQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRN 65
K+N F W +G++ A +LM+ T+L + +++ + + + Q++
Sbjct: 16 KKN---FSWTFIGNIIYAACQWAILMLFTKLGSVKMVGVFSLGLAITAPVYMFLNLQLQG 72
Query: 66 YQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFS 125
ATD YSF++Y RLMT + + + +++L ++ D +VFL+ + D+FS
Sbjct: 73 ILATDKKNNYSFNEYFSLRLMTSNIGMLLIMVFLLISNYDLVTKWVVFLIALAKYFDSFS 132
Query: 126 DLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMYYD 185
++ G+ Q E + SL + L + + ++ ++++ + S V + YD
Sbjct: 133 EIIFGLLQNKELMKRISISLILKGLLSVSSMFLSLYITGDILISMICYAVSSCVIFILYD 192
Query: 186 IGHSKKFQKLMFSELLSNISF--QNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTTL 243
+ K F++ + +SF L S P+ L LI P+Y IE L
Sbjct: 193 MKSMKMFKQCI------KVSFVFSKLKSLFLLSLPMGLVMLLISLNTNIPRYFIE--DYL 244
Query: 244 GEVALGSQTIFNILFMPAFVM----NLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFA 299
G +LG F+ L A+VM ++ + ++++A + IK F + +L
Sbjct: 245 GAESLG---YFSAL---AYVMVAGNTVISALGQACVSRLAQYYVEMDIKSFRVLLFKLIG 298
Query: 300 YLGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMR 359
+ +I +V G G L+I+Y +Y F+LIM+ IG ++ + +TA R
Sbjct: 299 IGILIGIIGVVVIGFIGEEILTIIYSDAYKEYNHIFILIMISAGIGYISSFMGYGMTAAR 358
Query: 360 KQQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWL-GLSIMIYLFIMNR 416
++ + + ++++L++ V +Y + GA ++ + +V L G S++I + R
Sbjct: 359 CYKVQPVIFGIVSIVTILLSYWCVPQYGLTGAGITLIFASIVQLIGSSLVIVYLLKKR 416
>ref|YP_663564.1| hypothetical protein Patl_4010 [Pseudoalteromonas atlantica T6c]
gb|ABG42510.1| hypothetical protein Patl_4010 [Pseudoalteromonas atlantica T6c]
Length = 420
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 81/390 (20%), Positives = 174/390 (44%), Gaps = 23/390 (5%)
Query: 15 NMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQATDINEK 74
+L +LS ++ + +LL+V+ ++ +A + A S + + ++ ++R D++ +
Sbjct: 12 TLLANLSFSLTNWLLLVVIAKVYDAAFLGQFVLALSIVSPVFLLSSLKLRTLIVVDVDNE 71
Query: 75 YSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLYQGMFQQ 134
YS QYL RL+ + L ++++ L ++ D+ ++ L+ Y+ D++ +L+ + +
Sbjct: 72 YSLEQYLGTRLLLSSIGLTLSIL-LGISFFDNVPFLLMLLIGIYKWCDSWCELFYAYYHR 130
Query: 135 HERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVF-----IMYYDIGHS 189
R D A S R+ L V I L S + L + +++F +++ ++
Sbjct: 131 TGRFDTATFSQCGRSVLSISVVIIIALLSDSAILMVSGWVATTVIFSVVDTVVFTNLRRR 190
Query: 190 KKFQKLMFSELLS-NISFQNSLKLLKESFPLFLN---GFLIIYIYTQPKYAIELMTTLGE 245
+ + LLS + + L++LK+ + L L+ G L +YI P + IE + + E
Sbjct: 191 NETVSTNWISLLSVQYAMRTPLRILKKYYTLSLSLVVGALFVYI---PNFVIEKYSGV-E 246
Query: 246 VALGSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLF---AYLG 302
A + L +++ + P + +A +G K + ++ A +G
Sbjct: 247 AAGEFAAVSYFLIAGGLLIHSVSQASSPRLAALA---KQGNGKGLINLTFKMCLIGAAIG 303
Query: 303 VFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRKQQ 362
V +I + +G F FL + Y + + +++ +I I + A+++
Sbjct: 304 VSGVIVALVAGEF---FLELFYNPEIALLSEELTWVLVAAAIRYIYIFIGTAMNALKQFH 360
Query: 363 LLLIPYTGGFLISLLITNLFVMKYHILGAA 392
Y G L L+ +V +Y LGAA
Sbjct: 361 TQTYIYAVGTLCVLVACLFWVPEYGSLGAA 390
>sp|P39855|CAPF_STAAU Capsular polysaccharide biosynthesis protein capF
gb|AAA64645.1| type 1 capsule synthesis gene; CapF [Staphylococcus aureus]
Length = 396
Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 101/414 (24%), Positives = 183/414 (44%), Gaps = 47/414 (11%)
Query: 6 KQNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRN 65
K +F+ N+L +L +I L+V+ RL T D Y +A + + ++R+
Sbjct: 3 KNFNYMFVANILSALCKFLI----LLVIVRLGTPEDVGRYNYALVITAPIFLFISLKIRS 58
Query: 66 YQATDINEKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFS 125
T N+KYS ++Y+ A L ++ L I++ + T + +V + +
Sbjct: 59 VIVT--NDKYSPNEYISAILSLNIITLIFVAIFVYVLGNGDL--TTILIVSLIKLFENIK 114
Query: 126 DLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLAL---VAVCIVSLVFIM 182
++ G++Q++E L + G S+ N L +++ I +S NL +AL V CI S I
Sbjct: 115 EVPYGIYQKNESLKLLGISMGIYNILSLILFYIIYSFSHNLNMALLFLVISCIFSFAII- 173
Query: 183 YYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESF----PLFLNGFLIIYIYTQPKYAIE 238
D + K+ + + + N++ KE F PL + L P+ +E
Sbjct: 174 --DRWYLSKYYNI-------KLHYNNNIAKFKEIFILTIPLAFSSALGSLNTGIPRIVLE 224
Query: 239 LMTTLGEVALGS-QTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIK-EFNKIQVQ 296
G+ LG TI +L + N + F P + + L + + K EF K+ +
Sbjct: 225 --NLFGKYTLGIFSTIAYVLVIGGLFANSISQVFLPKLRK----LYKDEKKIEFEKLTRK 278
Query: 297 LFAYLGVF-SLIALVGSGLFGIPFLSILYG-----TNLTDYWVDF-MLIMLGGSIGSFAT 349
+ ++G+F + +++ S G LS+L+G N+ + F +L +L G
Sbjct: 279 M-VFIGIFIGMCSVILSLFLGEALLSLLFGKEYGENNIILIILSFGLLFILSGIFLGTTI 337
Query: 350 VIDNILTAMRKQQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWL 403
+ K L+L+ F I L+ + L + KY +LGAAL+ I+ V L
Sbjct: 338 IATGKYNVNYKISLILL-----FCI-LIFSFLLIPKYSLLGAALTITISQFVAL 385
>ref|ZP_01905856.1| polysaccharide biosynthesis protein [Plesiocystis pacifica SIR-1]
gb|EDM81091.1| polysaccharide biosynthesis protein [Plesiocystis pacifica SIR-1]
Length = 399
Score = 46.2 bits (108), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/122 (25%), Positives = 53/122 (43%)
Query: 30 LMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQATDINEKYSFSQYLVARLMTCL 89
LMVV +L + YA + A ++V+ +R D+ ++ F YL R++
Sbjct: 10 LMVVAKLGSPEALGRYALGLAVATPIIVLANLHLRPIYVVDVRSRWRFGDYLRLRMLLIP 69
Query: 90 LMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLYQGMFQQHERLDIAGKSLAYRN 149
LA T + + +V LV R++ + +D+ Q+ E +D G S A R
Sbjct: 70 GALAATAGVCLVRGWPALTIGVVLLVALIRASGSATDILYARAQRAEAMDPIGISRAVRG 129
Query: 150 TL 151
L
Sbjct: 130 VL 131
>ref|XP_976745.1| Leishmanolysin family protein [Tetrahymena thermophila SB210]
gb|EAR86150.1| Leishmanolysin family protein [Tetrahymena thermophila SB210]
Length = 1297
Score = 44.7 bits (104), Expect = 0.016, Method: Composition-based stats.
Identities = 45/194 (23%), Positives = 84/194 (43%), Gaps = 38/194 (19%)
Query: 7 QNQTIFLWNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNY 66
QN + LW I IL+++ +L Y F + + QV
Sbjct: 1054 QNSQVILW----------IQAILVVLFYKLE-------YKKCLKFQSSQNPQNITQVAQT 1096
Query: 67 QATDINE----------KYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVC 116
QAT+IN+ KY R + +++L ++ + + ++Y I+F+ C
Sbjct: 1097 QATEINQENKQLFYFIDKYVKQDNFATRNIIQIIVLKQMIVAIAIISENAYVQIILFIFC 1156
Query: 117 FYRSTDAFSDLYQGMFQ--QHERLDIAGKSLAYRNTLIFMVYTAIILYSK----NLTLAL 170
++ AF LY G+F+ + + +I ++ NT + +VY I+Y N +L+L
Sbjct: 1157 YF----AFC-LYTGIFRPLKQFKQNIILLTIYLINTTLSVVYAVSIIYQNDPQINKSLSL 1211
Query: 171 VAVCIVSLVFIMYY 184
+ IV V++M +
Sbjct: 1212 TMLAIVISVYVMLF 1225
>ref|YP_001815039.1| polysaccharide biosynthesis protein [Exiguobacterium sibiricum
255-15]
gb|ACB62022.1| polysaccharide biosynthesis protein [Exiguobacterium sibiricum
255-15]
Length = 420
Score = 43.5 bits (101), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 64/299 (21%), Positives = 135/299 (45%), Gaps = 14/299 (4%)
Query: 114 LVCFYRSTDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKN----LTLA 169
LV + ++F+DL G Q H S +R +L+ + A+IL++ + LA
Sbjct: 111 LVYLNKYMESFADLAYGFLQGHMAFKEVALSKIFR-SLVNVSGAALILFTTHSIHGFVLA 169
Query: 170 LVAVCIVSLVFIMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYI 229
LVA +L+ ++ YD+ ++ ++ +S +Q +L ++ PL LI
Sbjct: 170 LVAG---NLIMLILYDLPTVRRVGHGFDNQQMSE-RYQTGRQLFFKAVPLGFVALLIALN 225
Query: 230 YTQPKYAIELMTTLGEVALGS-QTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIK 288
P+ + +G LG +I +L + + ++ L+ P+ + + R +
Sbjct: 226 ANIPRLFVG--HAIGTEELGYYASIAYLLVLGSLFIHSLVAVLLPNFSSDSGE--RQSLP 281
Query: 289 EFNKIQVQLFAYLGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFA 348
E K+ + ++ ++GS FG L+I Y + Y F+L+M+ +
Sbjct: 282 ELRKLTRSMLLMTNAVGILLIIGSIFFGKWGLTIFYNASFVQYHTIFVLMMVASLFFYNS 341
Query: 349 TVIDNILTAMRKQQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVWLGLSI 407
TVI +LT ++ ++ I G +++++ ++ + Y + GA ++ + + +GL I
Sbjct: 342 TVIQALLTGFQQFRVQTIAIFGSVIVNIVACSILIPMYGLYGATTAYGLCAVTQIGLLI 400
>ref|YP_001517033.1| polysaccharide biosynthesis protein, putative [Acaryochloris marina
MBIC11017]
gb|ABW27717.1| polysaccharide biosynthesis protein, putative [Acaryochloris marina
MBIC11017]
Length = 472
Score = 41.2 bits (95), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 61/282 (21%), Positives = 128/282 (45%), Gaps = 27/282 (9%)
Query: 129 QGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNL---TLALVAVCIVSLVFIMYYD 185
+G+++ + + LA ++ ++ + +++ SK+L T A V I+ L +++
Sbjct: 136 RGVYRGNSQYKSETILLASERIILGIIASLVLVISKDLFFVTAAFAGVRIIDLFVVIFI- 194
Query: 186 IGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGE 245
S++F LLS I+ Q +K+++P L G L + Y + ++T +
Sbjct: 195 --LSRQFT------LLSAINSQTVWNAVKKAYPFALTGILWVVYYQIDLLMLNTLSTSEQ 246
Query: 246 VAL--GSQTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQLFA---Y 300
V S IF I + L + F T++A R + K+ +LF
Sbjct: 247 VGYYSASYRIFEIF------LTLPRIIFLVSFTKLA----RHNFNDKPKLSTELFNSVIL 296
Query: 301 LGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDFMLIMLGGSIGSFATVIDNILTAMRK 360
L VF L ++ +GL +P ++I+YG++ ++++ I F T+I A ++
Sbjct: 297 LIVFVLPFIIAAGLLSVPLVNIIYGSSFYAAIPSLVILLPSLGIKMFGTLIQYFFEATKR 356
Query: 361 QQLLLIPYTGGFLISLLITNLFVMKYHILGAALSFLITMLVW 402
+++L + ++ + + + LGAAL+ LI+ ++
Sbjct: 357 EKILPPLLLTTVIFNISANAILIPLWGALGAALATLISEFIF 398
>ref|YP_213068.1| putative LPS biosynthesis related polysaccharide transporter
[Bacteroides fragilis NCTC 9343]
emb|CAH09154.1| putative LPS biosynthesis related polysaccharide transporter
[Bacteroides fragilis NCTC 9343]
Length = 449
Score = 40.0 bits (92), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 63/280 (22%), Positives = 126/280 (45%), Gaps = 34/280 (12%)
Query: 1 MINPSKQNQTIFL---WNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVV 57
++N S + +FL W +LG ++ + ++ + ++V R L +D + + S+ ++ ++
Sbjct: 10 LLNLSDTKKRVFLNVFWAILGKVANMLGALFVGILVARYLGPSDYGLMNYVISYVSIFLI 69
Query: 58 VGLFQVRNYQATDINEKYSFSQYLVARLMT---CLLMLAITVIYLTLT--KTDSYKSTI- 111
+ F + + + + + Q ++ C +LA +I ++L KTD + STI
Sbjct: 70 ISSFGLDDIEIREFSRNPQKYQTIIGTAFCIRFCFALLAYILIGISLLIYKTDLFTSTII 129
Query: 112 ------VFLVCFYRSTDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIIL--YS 163
VF CF + F+ + Q + KS +R +I + IIL Y
Sbjct: 130 LIYAVTVFSNCFNVIRNYFTSILQN--------EYIVKSELFR--IIIGAFLKIILLWYK 179
Query: 164 KNLTLALVAVCIVSLVFIMYYDIGHSKKFQKLMFSELLSNISFQNSLKLLKESFPLFLNG 223
L ++A +++ Y + + +K K+ + I+F L+KESFPL L+G
Sbjct: 180 APLEYFIIATMFDTVLVSGGYFLSYYRKIGKVSYWNFNKKIAFF----LIKESFPLVLSG 235
Query: 224 FLIIYIYTQPKYAIELM---TTLGEVALGSQTIFNILFMP 260
++ + I+ M ++G A + + ILF+P
Sbjct: 236 AAVVIYQRIDQVMIKNMIDNESVGYFATAGRFLDIILFLP 275
>ref|YP_001097823.1| polysaccharide biosynthesis protein [Methanococcus maripaludis C5]
gb|ABO35609.1| polysaccharide biosynthesis protein [Methanococcus maripaludis C5]
Length = 477
Score = 39.7 bits (91), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 64/272 (23%), Positives = 129/272 (47%), Gaps = 34/272 (12%)
Query: 67 QATDINEKY---SFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDA 123
+ T + EKY S + L+ + T LL+L +T I + K +Y +++++ Y A
Sbjct: 72 RDTSLTEKYLGNSIAIKLILSIFTFLLILGMTNI-MGYPKETTY---VIYILFIYTIFSA 127
Query: 124 FSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCIVSLVFIMY 183
+++++ ++Q HE++ G + LIF+ T + +Y + S +F+
Sbjct: 128 YNNIFYSIYQAHEKMAYFGVGGLINSFLIFLS-TMVGVYCNAPMYYFAYAYLFSNIFVFI 186
Query: 184 YDIGHSK-KFQKLMFSELLSNISFQNSLKLLKESFPLFLNG-FLIIYIYTQP---KYAIE 238
Y++ + F K+ F ++ SF LK ++P L+G F+ IY + Y I+
Sbjct: 187 YNMVITNLNFTKINF---FADFSFWKD--FLKNAWPFALSGIFVTIYFWMDSIMISYFID 241
Query: 239 LMTTLGEVALGSQTIFNILFMPA-FVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQL 297
+++G + + ++ +LF+P+ + + + + + A+ LI G+ ++
Sbjct: 242 -ESSVGLYSAAYRLVYVLLFIPSVYFSTMYPILSKKYRDSGAVKLIYGR-------SLKY 293
Query: 298 FAYLGVFSLIALVGS--GLFGIPFLSILYGTN 327
FA LGVF +GS LF +S++YG
Sbjct: 294 FAILGVF-----MGSLTTLFSENIISLIYGNE 320
>ref|YP_001274132.1| polysaccharide biosynthesis protein, MviN-like family
[Methanobrevibacter smithii ATCC 35061]
gb|ABQ87764.1| polysaccharide biosynthesis protein, MviN-like family
[Methanobrevibacter smithii ATCC 35061]
Length = 476
Score = 39.7 bits (91), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 45/179 (25%), Positives = 84/179 (46%), Gaps = 16/179 (8%)
Query: 14 WNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQATDINE 73
W M+ + T+V + + ++ R L +D I A SF+ +++VV V Y I+
Sbjct: 13 WLMISQIITSVCAFVWTILTARYLGVSDYGILGTATSFSVIVIVVADLGVTTYITRSISV 72
Query: 74 KYSF-SQY----LVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLY 128
Y+ ++Y L +L+ +L LA+ + L D++ I FL +F +L
Sbjct: 73 DYNVEAEYLGNALSLKLILSVLYLAVVIFISYLLGWDNFTILITFLFAIESLIKSFYNLL 132
Query: 129 QGMFQQHERLDIAGKSLAYRNTL---IFMVYTAIILYSK----NLTLALVAVCIVSLVF 180
FQ HE++ K A NTL + +V+ +I ++ +T A +A ++ L++
Sbjct: 133 FASFQAHEQM----KYQAITNTLLNVLTLVFIVMICFTDFGLLGITFAYIAANLIGLIY 187
>ref|YP_001274133.1| polysaccharide biosynthesis protein, MviN-like family
[Methanobrevibacter smithii ATCC 35061]
gb|ABQ87765.1| polysaccharide biosynthesis protein, MviN-like family
[Methanobrevibacter smithii ATCC 35061]
Length = 475
Score = 38.9 bits (89), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 47/186 (25%), Positives = 76/186 (40%), Gaps = 11/186 (5%)
Query: 14 WNMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQ----AT 69
W M + T+V + + ++ R L +D + A SFA + V V Y +T
Sbjct: 13 WLMFSQIITSVCAFVWTILTARYLGVSDYGLLGTATSFATIFGVCADLGVTTYIVRSIST 72
Query: 70 DIN-EKYSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLY 128
D + EK + +L+ + LA+ + L + D+Y I FL +F
Sbjct: 73 DFDSEKKYLGNAIGIKLILAVFYLAVVSLALFILGWDNYTVVICFLFAVENVIKSFQTAM 132
Query: 129 QGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNL---TLALVAVCIVSLVFIMYYD 185
FQ HE + + N L F+ A+ + L TLA +A ++ LV Y
Sbjct: 133 YSSFQAHEMMKYQAITNTLLNVLTFIFIVAVTFTNYGLWGITLAYIAANLIGLV---YAT 189
Query: 186 IGHSKK 191
+ SKK
Sbjct: 190 LALSKK 195
>ref|ZP_01612341.1| AmrA [Alteromonadales bacterium TW-7]
gb|EAW28535.1| AmrA [Alteromonadales bacterium TW-7]
Length = 405
Score = 38.5 bits (88), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 81/361 (22%), Positives = 163/361 (45%), Gaps = 54/361 (14%)
Query: 15 NMLGSLSTAVISVILLMVVTRLLTSADSDIYAFAYSFANMMVVVGLFQVRNYQATDINEK 74
N+ G+ A+ +IL + ++R IY+++ + A +M +V +RN ATD++ K
Sbjct: 8 NLFGNAFFALSQLILFVYISRQYGVESLGIYSYSLAIATIMYMVSNVGLRNVLATDLSSK 67
Query: 75 YSFSQYLVARLMTCLLMLAITVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLYQGMFQQ 134
S Y+ RL ++ L + V+ ++ + I+ L+ Y ++ D+Y G +
Sbjct: 68 SEDSTYIKLRLSLSIITLFVGVVVFMISINTNLLFCILILLIKY--IESIQDIYVGFLHR 125
Query: 135 HERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTLALVAVCI---VSLVFIMYYDIGHSKK 191
+ ++ +F+ ++L ++ L+ + + + L F++Y
Sbjct: 126 KQLFAKIRTVNFFKGAFVFLSIIGVMLMELSINSLLLILVLFNSIILCFLIYN------- 178
Query: 192 FQKLMFSELLSNISFQNS-----LKLLKE----SFPLFLNGFLIIYIYTQPKYAIELMTT 242
SNI F NS ++ K+ SFP + G L P+ I+ +
Sbjct: 179 ----------SNIDFGNSSNELIVRKYKDLILFSFPFAVMGVLATVNLNGPRIFIK--SN 226
Query: 243 LGEVALGS-QTIFNILFMPAFVMNLLILFFRPHITQMAIALIRGQIKEFNKIQVQ----- 296
LG AL ++ I+F+ + V+ PH+T + ++G + ++ K+ ++
Sbjct: 227 LGLEALAKYSAMYQIVFLGSVVVLAYGQAVLPHLTNL---FMKGMVDKWLKLIIKSMLAI 283
Query: 297 LFAYLGVFSLIALVGSGLFGIPFLSILYGTNLTDYWVDF-MLIMLGGSIGSFATVIDNIL 355
LF + VF LVG +G+ +S ++GT +T + ++ M I+LG FA+ NI+
Sbjct: 284 LFCTMAVF----LVGYH-YGVAIMSYVFGT-ITFHRLEISMFILLG-----FASYFLNIM 332
Query: 356 T 356
+
Sbjct: 333 S 333
>ref|YP_345310.1| Capsular polysaccharide biosynthesis protein, putative [Rhodobacter
sphaeroides 2.4.1]
gb|ABA81569.1| Capsular polysaccharide biosynthesis protein, putative [Rhodobacter
sphaeroides 2.4.1]
Length = 399
Score = 38.1 bits (87), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 62/138 (44%), Gaps = 5/138 (3%)
Query: 51 FANMMVVVGLFQVRNYQATDINEKYSFSQYLVARLMTCL--LMLAITVIYLTLTKTDSYK 108
FA + ++ GL +R A K S S L+ R T L +LA + YL +++ +
Sbjct: 41 FAPLCLLTGL-NLRVAMAVSDPPKISPSTALLLRSTTTLACFILAGAITYLVSASSETGR 99
Query: 109 STIVFLVCFYRSTDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFMVYTAIILYSKNLTL 168
S I+ L RS D SD+ G FQQ + G+S R + +L
Sbjct: 100 SAILLLA--LRSFDQISDVSVGYFQQRNQRSNVGRSFLVRGLGNLAPFLVAFELGFSLDG 157
Query: 169 ALVAVCIVSLVFIMYYDI 186
ALV + +++ + Y+DI
Sbjct: 158 ALVISLLSTVLVVAYFDI 175
>ref|ZP_02134108.1| MATE efflux family protein [Desulfatibacillum alkenivorans AK-01]
gb|EDQ24324.1| MATE efflux family protein [Desulfatibacillum alkenivorans AK-01]
Length = 454
Score = 35.4 bits (80), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 60/115 (52%), Gaps = 12/115 (10%)
Query: 312 SGLFGI-PFLSILYGTN--LTDYWVDFMLIMLGGSIGSFATVIDNILTAMRKQQLLLIPY 368
+GLF I P L I ++ L W F +IML G + +FA +N + A ++ ++
Sbjct: 114 AGLFAITPLLRIFGASDTLLPLTWEYFRIIMLAGPLMTFAMTANNAVRAEGAAKMAMLTM 173
Query: 369 TGGFLISLLITNLF--VMKYHILGAALSFLITMLVWLGLSIMIYLFIMNRFKKGR 421
G +++ ++ +F V+K I GAA + + +M + ++M+ L+ FK GR
Sbjct: 174 MSGAVLNTILDPIFIYVLKMGIRGAAWATVASMFL---STVMLLLY----FKSGR 221
>ref|XP_001613551.1| hypothetical protein PVX_081380 [Plasmodium vivax SaI-1]
gb|EDL43824.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 226
Score = 35.4 bits (80), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 68/156 (43%), Gaps = 18/156 (11%)
Query: 95 TVIYLTLTKTDSYKSTIVFLVCFYRSTDAFSDLYQGMFQQHERLDIAGKSLAYRNTLIFM 154
VIY + + D+ FY D+F + +++ +LDI AY NT +
Sbjct: 58 NVIYKQIKERDT-------PFKFYSIGDSFYGRILSISKKNIKLDILCDRKAYLNTSEYF 110
Query: 155 VYTAIILYSKNLTLALVAVCIVSLVFIMYYDIGHSK---KFQKLMFSELLSNISFQNSLK 211
+ Y AL+ V V V I Y H K + QK + E+LS SFQN
Sbjct: 111 RLPNLHKY----VFALLKVHNVIRVKIKYIQRVHQKIAVQIQKYSYEEILS--SFQNGQS 164
Query: 212 LLKESFPLFLNGFLIIYIYTQPKYAIELMTTLGEVA 247
L+ L+G++++Y+ P+ +L+ GE A
Sbjct: 165 LMNAKILDVLDGYVLLYL--APQIHAKLLLREGERA 198
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
Posted date: May 10, 2008 4:54 AM
Number of letters in database: 2,222,278,849
Number of sequences in database: 6,515,104
Lambda K H
0.332 0.144 0.414
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 6515104
Number of Hits to DB: 1,544,733,530
Number of extensions: 59694997
Number of successful extensions: 245059
Number of sequences better than 10.0: 379
Number of HSP's gapped: 247607
Number of HSP's successfully gapped: 379
Length of query: 428
Length of database: 2,222,278,849
Length adjustment: 137
Effective length of query: 291
Effective length of database: 1,329,709,601
Effective search space: 386945493891
Effective search space used: 386945493891
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (22.0 bits)
S2: 80 (35.4 bits)