BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SGO_1501
(252 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|157075538|gb|ABV10221.1| membrane protein, putative [Str... 441 e-122
gi|125718483|ref|YP_001035616.1| hypothetical protein SSA_1... 293 9e-78
gi|154498320|ref|ZP_02036698.1| hypothetical protein BACCAP... 222 1e-56
gi|81096298|ref|ZP_00874645.1| conserved hypothetical prote... 198 3e-49
gi|29376219|ref|NP_815373.1| hypothetical protein EF1665 [E... 184 4e-45
gi|154509386|ref|ZP_02045028.1| hypothetical protein ACTODO... 181 4e-44
gi|24380217|ref|NP_722172.1| hypothetical protein SMU.1856c... 158 2e-37
gi|153940512|ref|YP_001391097.1| TraX family protein [Clost... 108 2e-22
gi|148379802|ref|YP_001254343.1| F pilin acetylation protei... 103 7e-21
gi|148380772|ref|YP_001255313.1| membrane protein [Clostrid... 100 1e-19
gi|153940880|ref|YP_001392090.1| hypothetical protein CLI_2... 98 6e-19
gi|15673353|ref|NP_267527.1| hypothetical protein L9467 [La... 91 5e-17
gi|116512221|ref|YP_809437.1| hypothetical protein LACR_150... 84 6e-15
gi|125623913|ref|YP_001032396.1| hypothetical protein llmg_... 84 7e-15
gi|106885341|ref|ZP_01352703.1| conserved hypothetical prot... 82 2e-14
gi|126699278|ref|YP_001088175.1| hypothetical protein CD167... 76 2e-12
gi|81428906|ref|YP_395906.1| Hypothetical membrane protein ... 66 2e-09
gi|29377245|ref|NP_816399.1| hypothetical protein EF2771 [E... 60 9e-08
gi|116627726|ref|YP_820345.1| hypothetical protein STER_092... 57 1e-06
gi|55822860|ref|YP_141301.1| hypothetical protein str0900 [... 57 1e-06
gi|156864725|gb|EDO58156.1| hypothetical protein CLOL250_01... 56 2e-06
gi|55820940|ref|YP_139382.1| Conserved hypothetical, predic... 55 3e-06
gi|156864935|gb|EDO58366.1| hypothetical protein CLOL250_00... 55 3e-06
gi|150005888|ref|YP_001300632.1| hypothetical protein BVU_3... 54 9e-06
>gi|157075538|gb|ABV10221.1| membrane protein, putative [Streptococcus gordonii str. Challis
substr. CH1]
Length = 252
Score = 441 bits (1133), Expect = e-122, Method: Composition-based stats.
Identities = 252/252 (100%), Positives = 252/252 (100%)
Query: 1 MKKGNGLNGFQIKLIMALLMVLDHVDKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFI 60
MKKGNGLNGFQIKLIMALLMVLDHVDKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFI
Sbjct: 1 MKKGNGLNGFQIKLIMALLMVLDHVDKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFI 60
Query: 61 HTRNRITYNLRLFFWAGLMFFGNVILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGFSK 120
HTRNRITYNLRLFFWAGLMFFGNVILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGFSK
Sbjct: 61 HTRNRITYNLRLFFWAGLMFFGNVILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGFSK 120
Query: 121 EKLELNGRDKWIRYGLGAVVFLVAAFLTEGGMVMIPFMLITYGYRKNEALRNLLYLALAV 180
EKLELNGRDKWIRYGLGAVVFLVAAFLTEGGMVMIPFMLITYGYRKNEALRNLLYLALAV
Sbjct: 121 EKLELNGRDKWIRYGLGAVVFLVAAFLTEGGMVMIPFMLITYGYRKNEALRNLLYLALAV 180
Query: 181 LLFVMSIEVYPSLSETLSMMLYNSDWLFITVLPFLYIYNGERGYTSKWSKYFFYIFYPAH 240
LLFVMSIEVYPSLSETLSMMLYNSDWLFITVLPFLYIYNGERGYTSKWSKYFFYIFYPAH
Sbjct: 181 LLFVMSIEVYPSLSETLSMMLYNSDWLFITVLPFLYIYNGERGYTSKWSKYFFYIFYPAH 240
Query: 241 LWLIALIAFWVK 252
LWLIALIAFWVK
Sbjct: 241 LWLIALIAFWVK 252
>gi|125718483|ref|YP_001035616.1| hypothetical protein SSA_1682 [Streptococcus sanguinis SK36]
gi|125498400|gb|ABN45066.1| Conserved hypothetical protein [Streptococcus sanguinis SK36]
Length = 251
Score = 293 bits (749), Expect = 9e-78, Method: Composition-based stats.
Identities = 158/250 (63%), Positives = 192/250 (76%)
Query: 3 KGNGLNGFQIKLIMALLMVLDHVDKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFIHT 62
K G+N FQ+KL MA LMV DH+ +IPGL+P GWDG+ HA+TRCVGV F+F AVEGF+HT
Sbjct: 2 KSKGINAFQLKLFMAFLMVFDHISQIPGLVPDGWDGVLHALTRCVGVAFAFMAVEGFLHT 61
Query: 63 RNRITYNLRLFFWAGLMFFGNVILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGFSKEK 122
RNR+ YN+RLFFWA LM GN IL L Q K I+ ++NIFLTLACGVL L +FFGFS+
Sbjct: 62 RNRLAYNMRLFFWAALMQTGNCILTLLFQEKGIYLTHNIFLTLACGVLMLSLFFGFSENG 121
Query: 123 LELNGRDKWIRYGLGAVVFLVAAFLTEGGMVMIPFMLITYGYRKNEALRNLLYLALAVLL 182
R + +R G +V LV +EGGM ++PFML+TY +R RNL Y+ A +L
Sbjct: 122 GAAKDRKRGLRIAAGVLVLLVGLLFSEGGMALLPFMLLTYLFRNQVFFRNLSYVVWAGIL 181
Query: 183 FVMSIEVYPSLSETLSMMLYNSDWLFITVLPFLYIYNGERGYTSKWSKYFFYIFYPAHLW 242
F MSI++YP+L +TLSM+LYNSDWLFI+VLP L+ YNGERG +SKWSKYFFYIFYPAHLW
Sbjct: 182 FAMSIQIYPTLQDTLSMLLYNSDWLFISVLPLLHFYNGERGSSSKWSKYFFYIFYPAHLW 241
Query: 243 LIALIAFWVK 252
LIALIAFWVK
Sbjct: 242 LIALIAFWVK 251
>gi|154498320|ref|ZP_02036698.1| hypothetical protein BACCAP_02309 [Bacteroides capillosus ATCC
29799]
gi|150272631|gb|EDM99809.1| hypothetical protein BACCAP_02309 [Bacteroides capillosus ATCC
29799]
Length = 248
Score = 222 bits (566), Expect = 1e-56, Method: Composition-based stats.
Identities = 123/235 (52%), Positives = 162/235 (68%), Gaps = 3/235 (1%)
Query: 8 NGFQIKLIMALLMVLDHVDKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFIHTRNRIT 67
N F++KL MALLMVLDH+D IPGLLP +FHAVTRCVGVWF++ AVEGF+HTR+R+
Sbjct: 7 NAFELKLFMALLMVLDHLDHIPGLLPPELSALFHAVTRCVGVWFAYLAVEGFLHTRSRLR 66
Query: 68 YNLRLFFWAGLMFFGNVILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGFSKEKLELNG 127
YNLRLF WA +M GN + N L K + SNNIF TLA GVL L + G +
Sbjct: 67 YNLRLFGWAAVMALGNQLYNLLAAGKGLTLSNNIFFTLALGVLMLNLLAGAQPDAPLWQ- 125
Query: 128 RDKWIRYGLGAVVFLVAAFLTEGGMVMIPFMLITYGYRKNEALRNLLYLALAVLLFVMSI 187
+ +R G V +EGG V++PF+LITY +R +RN+LYL ++LLF +
Sbjct: 126 --RALRIVGGCAVLFAGLICSEGGPVILPFILITYYFRGRPLVRNILYLGFSLLLFFSAF 183
Query: 188 EVYPSLSETLSMMLYNSDWLFITVLPFLYIYNGERGYTSKWSKYFFYIFYPAHLW 242
+YP+L ETL M+ +NSD+LFI+VLPF+ +YNG+RG + ++KYFFY+FYP HLW
Sbjct: 184 HLYPTLRETLLMLGFNSDFLFISVLPFIALYNGQRGPNTPFAKYFFYVFYPLHLW 238
>gi|81096298|ref|ZP_00874645.1| conserved hypothetical protein [Streptococcus suis 89/1591]
gi|80977642|gb|EAP41178.1| conserved hypothetical protein [Streptococcus suis 89/1591]
Length = 244
Score = 198 bits (503), Expect = 3e-49, Method: Composition-based stats.
Identities = 119/244 (48%), Positives = 159/244 (65%), Gaps = 4/244 (1%)
Query: 8 NGFQIKLIMALLMVLDHVDKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFIHTRNRIT 67
N +Q+KL MA LMVLDH+ IP +P +FH TRCV V+F++ VEGF+HTRN
Sbjct: 5 NAYQLKLAMAGLMVLDHLKIIPDFIPDQLALVFHLATRCVAVFFAYMLVEGFLHTRNVKA 64
Query: 68 YNLRLFFWAGLMFFGNVILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGFSKEKLELNG 127
Y LRL+ AGLM GN +LN L QSK I+ SNNIFLTLA G+ L FS E
Sbjct: 65 YLLRLYGAAGLMATGNTLLNSLYQSKGIYISNNIFLTLAVGLTLLICLQNFS----EHTR 120
Query: 128 RDKWIRYGLGAVVFLVAAFLTEGGMVMIPFMLITYGYRKNEALRNLLYLALAVLLFVMSI 187
+ I+ L ++ ++ A TEGG V++PF+LITY R++ R + Y LA++L MS
Sbjct: 121 AKQIIQVLLAGLLTIIGALFTEGGTVVLPFILITYLGRESATKRTIAYGFLALVLLAMSY 180
Query: 188 EVYPSLSETLSMMLYNSDWLFITVLPFLYIYNGERGYTSKWSKYFFYIFYPAHLWLIALI 247
+ Y L T+ MMLYN+DWLFI VLP L +YNG+ G + +S+YFFYIFYP HLW++A I
Sbjct: 181 QPYERLEMTIQMMLYNADWLFILVLPILSLYNGQAGPRTVFSRYFFYIFYPLHLWILATI 240
Query: 248 AFWV 251
+++
Sbjct: 241 GYFL 244
>gi|29376219|ref|NP_815373.1| hypothetical protein EF1665 [Enterococcus faecalis V583]
gi|29343682|gb|AAO81443.1| conserved hypothetical protein [Enterococcus faecalis V583]
Length = 240
Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats.
Identities = 119/246 (48%), Positives = 154/246 (62%), Gaps = 8/246 (3%)
Query: 7 LNGFQIKLIMALLMVLDHVDKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFIHTRNRI 66
+N ++KL+M LMVLDH+ +P W IFH +TRCVGV+F + AVEGF +TRN
Sbjct: 1 MNANRLKLLMMGLMVLDHISY---FVPPEWALIFHVITRCVGVFFGYMAVEGFNYTRNVY 57
Query: 67 TYNLRLFFWAGLMFFGNVILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGFSKEKLELN 126
YN RL+ WA +MF GN +LN L+ + + NNIF TLA GV L + +K LE+
Sbjct: 58 RYNGRLYIWAAIMFVGNTLLNHLVNNPAVAVHNNIFFTLALGVSMLIV----TKAMLEMP 113
Query: 127 GRDKWIRYGLGAVVFL-VAAFLTEGGMVMIPFMLITYGYRKNEALRNLLYLALAVLLFVM 185
I + + L + A EGG+VM+PFMLITY RK LRN LY ALA+ V
Sbjct: 114 KISLKIVLLISILAILGIGAMFAEGGIVMLPFMLITYLARKRLVLRNCLYGALALFFLVT 173
Query: 186 SIEVYPSLSETLSMMLYNSDWLFITVLPFLYIYNGERGYTSKWSKYFFYIFYPAHLWLIA 245
S + T+ M+ YNSD++FITVLPF+ +YNGERG + + KY FY FYP HLWLIA
Sbjct: 174 SFQWLGDWPTTIEMLAYNSDFMFITVLPFISLYNGERGSKAPFFKYLFYGFYPLHLWLIA 233
Query: 246 LIAFWV 251
LIA +V
Sbjct: 234 LIANYV 239
>gi|154509386|ref|ZP_02045028.1| hypothetical protein ACTODO_01917 [Actinomyces odontolyticus ATCC
17982]
gi|153799020|gb|EDN81440.1| hypothetical protein ACTODO_01917 [Actinomyces odontolyticus ATCC
17982]
Length = 262
Score = 181 bits (459), Expect = 4e-44, Method: Composition-based stats.
Identities = 110/245 (44%), Positives = 155/245 (63%), Gaps = 6/245 (2%)
Query: 6 GLNGFQIKLIMALLMVLDHVDKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFIHTRNR 65
G+N + IKL MA LMVLDH+ +PGL+P W IFH VTRCV VWF++ AVEG ++T +
Sbjct: 19 GVNQYWIKLAMAALMVLDHLPHVPGLVPVMWTDIFHVVTRCVAVWFAYGAVEGVLYTSSM 78
Query: 66 ITYNLRLFFWAGLMFFGNVILNFLLQSKEIH-NSNNIFLTLACGVLSLGIFFGFSKEKLE 124
Y RL+ + +M GN L LL ++++H NNIFLTLA G L + +S K
Sbjct: 79 RKYLARLWGASLIMAAGNYALGLLLATRDVHMYDNNIFLTLAVGTSLLALVKRWSGTKWH 138
Query: 125 LNGRDKWIRYGLGAVVFLVAAFLTEGGMVMIPFMLITYGYRKNEALRNLLYLALAVLLFV 184
KW G+ + + A EGG+ ++PFM+ITY R+L YLALA +F
Sbjct: 139 -----KWAIGGVSVLSLVCAVLPIEGGLPLLPFMVITYALYSRVVWRDLAYLALAAAMFA 193
Query: 185 MSIEVYPSLSETLSMMLYNSDWLFITVLPFLYIYNGERGYTSKWSKYFFYIFYPAHLWLI 244
+ + Y + T+SM+ NSD++ I V+P L++YNGE G +++SKYFFY+FYPAHLWL+
Sbjct: 194 LVWQPYDTWQATVSMLAQNSDFMLILVIPILHLYNGEHGPHTRFSKYFFYVFYPAHLWLL 253
Query: 245 ALIAF 249
AL+A+
Sbjct: 254 ALVAY 258
>gi|24380217|ref|NP_722172.1| hypothetical protein SMU.1856c [Streptococcus mutans UA159]
gi|24378224|gb|AAN59478.1|AE015012_7 conserved hypothetical protein [Streptococcus mutans UA159]
Length = 239
Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats.
Identities = 105/246 (42%), Positives = 158/246 (64%), Gaps = 15/246 (6%)
Query: 7 LNGFQIKLIMALLMVLDHVDKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFIHTRNRI 66
+N FQ KL +A M+LDH+D + W H +TR V V F++ AVEGF +T++
Sbjct: 1 MNRFQFKLFLATFMMLDHIDFLVSEEMGIW---LHILTRFVAVGFAYLAVEGFFYTKDIT 57
Query: 67 TYNLRLFFWAGLMFFGNVILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGFSKEKLELN 126
Y +RL+ AGLMF GN ++NF+L ++ +NIFLTLA GV L ++ +K+E
Sbjct: 58 KYLMRLYIAAGLMFLGNNLINFMLHKPQVAAYHNIFLTLALGVTMLWLY-----QKIE-- 110
Query: 127 GRDKWIRYGLGAVVFLVAAFLTEGGMVMIPFMLITYGYRKNEALRNLLYLALAVLLF--V 184
K + + V L+ TEGG V++PFMLITY N RNL YLAL+VLLF +
Sbjct: 111 --HKVTKIIVTVSVLLLGFIFTEGGDVVLPFMLITYLNFSNSLRRNLWYLALSVLLFFTI 168
Query: 185 MSIEVYPSLSETLS-MMLYNSDWLFITVLPFLYIYNGERGYTSKWSKYFFYIFYPAHLWL 243
+ I + S E+L+ ++L N D+ F+T++P L++YNG++G +K+S+YFFY+FYPAHLW+
Sbjct: 169 VGIPMPSSSQESLTNLILLNPDFCFVTIIPLLFLYNGQQGLKNKFSQYFFYVFYPAHLWI 228
Query: 244 IALIAF 249
+A++ +
Sbjct: 229 LAIMHY 234
>gi|153940512|ref|YP_001391097.1| TraX family protein [Clostridium botulinum F str. Langeland]
gi|152936408|gb|ABS41906.1| TraX family protein [Clostridium botulinum F str. Langeland]
Length = 233
Score = 108 bits (271), Expect = 2e-22, Method: Composition-based stats.
Identities = 88/249 (35%), Positives = 137/249 (55%), Gaps = 21/249 (8%)
Query: 7 LNGFQIKLIMALLMVLDHVDKIPGLLPA--GWDGIFHAVTRCVGVWFSFAAVEGFIHTRN 64
L+GF++K+I +LMVLDH+ K P GW G R V F F EGF HTR+
Sbjct: 3 LDGFKLKIIAMILMVLDHLPKAFNNTPIWFGWLG------RLVAPIFFFFVAEGFFHTRS 56
Query: 65 RITYNLRLFFWAGLMFFGNVILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGFSKEKLE 124
+ Y +RLF W +MF G+ ILN+ L KE NNIFL+L VL + I ++
Sbjct: 57 KSKYLIRLFGWGAIMFAGSSILNYALPGKEPLQ-NNIFLSLGLSVLLMCI--------ID 107
Query: 125 LNGRDKWIRYGLG-AVVFLVAAFLTEGGMVMIPFMLITYGYRKNEALRNLLYLALAVLLF 183
+ K ++G+ A++ + + TE + L+ Y +R+++ ++ Y+ +++ F
Sbjct: 108 YTRKSKNYKFGIPLAIIVSIVSLFTEASFDGVLMTLVLYFFREDKIKLSIGYILISLFEF 167
Query: 184 VMSIEVYPSLSETLSMMLYNSDWLFITVLPFLYIYNGERGYTSKWSKYFFYIFYPAHLWL 243
+M V L++ + N WL I LP + +YNGERG +K+ KY FY FYP HLW+
Sbjct: 168 IM---VSGEGLTYLNLFVLNYQWLMIFALPLILMYNGERGLNNKFIKYMFYAFYPIHLWI 224
Query: 244 IALIAFWVK 252
I +I+ ++K
Sbjct: 225 ITIISHFLK 233
>gi|148379802|ref|YP_001254343.1| F pilin acetylation protein [Clostridium botulinum A str. ATCC
3502]
gi|153932311|ref|YP_001384099.1| TraX family protein [Clostridium botulinum A str. ATCC 19397]
gi|153937061|ref|YP_001387639.1| TraX family protein [Clostridium botulinum A str. Hall]
gi|148289286|emb|CAL83382.1| putative F pilin acetylation protein [Clostridium botulinum A str.
ATCC 3502]
gi|152928355|gb|ABS33855.1| TraX family protein [Clostridium botulinum A str. ATCC 19397]
gi|152932975|gb|ABS38474.1| TraX family protein [Clostridium botulinum A str. Hall]
Length = 233
Score = 103 bits (258), Expect = 7e-21, Method: Composition-based stats.
Identities = 87/249 (34%), Positives = 135/249 (54%), Gaps = 21/249 (8%)
Query: 7 LNGFQIKLIMALLMVLDHVDKIPGLLPA--GWDGIFHAVTRCVGVWFSFAAVEGFIHTRN 64
L+ F++K+I +LMVLDH+ K P GW G R V F F EGF HT++
Sbjct: 3 LDSFKLKIIAMILMVLDHLPKAFNNTPIWFGWLG------RLVAPIFFFFVAEGFFHTKS 56
Query: 65 RITYNLRLFFWAGLMFFGNVILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGFSKEKLE 124
+ Y +RLF W +MF G+ ILN+ L KE NNIFL+L VL + I ++
Sbjct: 57 KSKYLIRLFGWGAIMFLGSSILNYALPGKEPLQ-NNIFLSLGLSVLLMCI--------ID 107
Query: 125 LNGRDKWIRYGLG-AVVFLVAAFLTEGGMVMIPFMLITYGYRKNEALRNLLYLALAVLLF 183
+ K + G+ A+V + A TE + L+ Y +R+++ ++ Y+ +++ F
Sbjct: 108 YTRKSKNYKSGIPLAIVVDILALFTEASFDGVLMTLVFYFFREDKIKLSIGYILISLFEF 167
Query: 184 VMSIEVYPSLSETLSMMLYNSDWLFITVLPFLYIYNGERGYTSKWSKYFFYIFYPAHLWL 243
+M V L++ + N WL I LP + +YNG+RG +K+ KY FY FYP HLW+
Sbjct: 168 IM---VSGGGLTYLNLFVLNYQWLMIFALPIILMYNGKRGLNNKFIKYMFYAFYPVHLWI 224
Query: 244 IALIAFWVK 252
I +I+ ++K
Sbjct: 225 ITVISHFLK 233
>gi|148380772|ref|YP_001255313.1| membrane protein [Clostridium botulinum A str. ATCC 3502]
gi|153934241|ref|YP_001385056.1| hypothetical protein CLB_2755 [Clostridium botulinum A str. ATCC
19397]
gi|153936155|ref|YP_001388526.1| hypothetical protein CLC_2688 [Clostridium botulinum A str. Hall]
gi|148290256|emb|CAL84375.1| putative membrane protein [Clostridium botulinum A str. ATCC 3502]
gi|152930285|gb|ABS35785.1| putative membrane protein [Clostridium botulinum A str. ATCC 19397]
gi|152932069|gb|ABS37568.1| putative membrane protein [Clostridium botulinum A str. Hall]
Length = 266
Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats.
Identities = 87/276 (31%), Positives = 134/276 (48%), Gaps = 41/276 (14%)
Query: 3 KGNGLNGFQIKLIMALLMVLDHVDKIPGL---LPAGWDGIFHAVTRCVGVWFSFAAVEGF 59
K GL GFQ+KLI LM+ DH+ ++ G +P F+ V R V F F V+GF
Sbjct: 2 KEKGLTGFQLKLIGLFLMIFDHIHEMFGFTNNIPVA----FNWVGRIVAPIFIFMTVQGF 57
Query: 60 IHTRNRITYNLRLFFWAGLMFFGNVIL-NFLLQSKEIHNSNNIFLTLACGVLSLGIFFGF 118
IHTRNR Y +RL+ + LM GN I+ + ++ ++ NNIF TL V+ L I
Sbjct: 58 IHTRNRKKYAIRLYIGSVLMNLGNFIIPKYFQRTDDLALFNNIFTTLFMIVIYLSIIEYL 117
Query: 119 SKEKLELNGRDKWIRYGLG-----------------------AVVFLVAAFLTEGGMVMI 155
K E N I G+G AV + EGG V I
Sbjct: 118 GKSIKEKNTLG--IVKGIGLFILPIAIGFIIIMNITAPGMMYAVFIIPTPLFVEGGPVFI 175
Query: 156 PFMLITYGYRKNEALRNLLYLALAVLLFVMSIEVYPSLSETLSMMLYNSDWLFITVLPFL 215
+I Y +R+ + + ++Y L++++ +M ++ ++ N W+ I P
Sbjct: 176 LLGIIMYLFREKKKMLVIVYSILSIIIMLMGGDI-----TIQGLLFKNYQWMMIFAAPLF 230
Query: 216 YIYNGERGYTSKWSKYFFYIFYPAHLWLIALIAFWV 251
Y+YNG++G K KY FYIFYPAH+++ +I+ ++
Sbjct: 231 YLYNGKKG---KGVKYLFYIFYPAHIYIFYIISVYM 263
>gi|153940880|ref|YP_001392090.1| hypothetical protein CLI_2862 [Clostridium botulinum F str.
Langeland]
gi|152936776|gb|ABS42274.1| putative membrane protein [Clostridium botulinum F str. Langeland]
Length = 272
Score = 97.8 bits (242), Expect = 6e-19, Method: Composition-based stats.
Identities = 88/280 (31%), Positives = 138/280 (49%), Gaps = 43/280 (15%)
Query: 3 KGNGLNGFQIKLIMALLMVLDHVDKIPGL---LPAGWDGIFHAVTRCVGVWFSFAAVEGF 59
K GL GFQ+KLI LM+ DH+ ++ G +P F+ V R V F F V+GF
Sbjct: 2 KEKGLTGFQLKLIGLFLMIFDHIHEMFGFTNNIPVA----FNWVGRIVAPIFIFMTVQGF 57
Query: 60 IHTRNRITYNLRLFFWAGLMFFGN-VILNFLLQSKEIHNSNNIFLTLACGVLSLGI--FF 116
IHTRNR Y RL+ + LM FGN +I + ++ + NNIF TL V+ L I +
Sbjct: 58 IHTRNRKKYATRLYIGSVLMNFGNYLIPEYFQRTDNLGLMNNIFTTLFMIVIYLSIIEYL 117
Query: 117 GFSKEKLELNGRDKWI----------------------RYGL---GAVVFLVAAFLTEGG 151
G S +K G K I RY + A+ + L EGG
Sbjct: 118 GKSIKKKNTLGIVKGIGLFILPIAIGFIMLMILTAPIMRYAMFMRYAMFIIPTPLLVEGG 177
Query: 152 MVMIPFMLITYGYRKNEALRNLLYLALAVLLFVMSIEVYPSLSETLSMMLYNSDWLFITV 211
+ I +I Y +R+ + + ++Y L++++ +M ++ ++ N W+ I
Sbjct: 178 PIFILLGIIMYLFREKKKMLVIVYSILSIIIMLMGGDI-----TIQGLLFKNYQWMMIFA 232
Query: 212 LPFLYIYNGERGYTSKWSKYFFYIFYPAHLWLIALIAFWV 251
P Y+YNG++G K KY FY+FYPAH+++ +I+ ++
Sbjct: 233 APLFYLYNGKKG---KGVKYLFYVFYPAHIYIFYIISVYM 269
>gi|15673353|ref|NP_267527.1| hypothetical protein L9467 [Lactococcus lactis subsp. lactis
Il1403]
gi|12724356|gb|AAK05469.1|AE006369_5 HYPOTHETICAL PROTEIN [Lactococcus lactis subsp. lactis Il1403]
Length = 252
Score = 91.3 bits (225), Expect = 5e-17, Method: Composition-based stats.
Identities = 83/261 (31%), Positives = 142/261 (54%), Gaps = 27/261 (10%)
Query: 1 MKKGNGLNGFQIKLIMALLMVLDHV--DKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEG 58
MK+ GL+G+Q+K+I + M+LDH+ + + GL I +R V F F +EG
Sbjct: 1 MKQRFGLSGYQLKIIAIIFMLLDHIYLEVLLGLPGIPDFSILDMASRFVSPLFFFLMIEG 60
Query: 59 FIHTRNRITYNLRLFFWAGLMFFGNVILNFLLQSK----EIHNSNNIFLTLACGVLSLGI 114
F +TR+R Y RL +M GN+++++L+ I N N IFL+LACG ++ +
Sbjct: 61 FFYTRSRKKYLTRLLVAGAVMALGNLVIHYLMNVSISFFTILNPN-IFLSLACGFGAVWL 119
Query: 115 FFGFSKEKLELNGRDKWIRYGLGAVVFLVA-AFLTEGGMVMIPFMLITYGYRKNEALRNL 173
++K L + ++F+ A + TE +V + + Y RK+ +
Sbjct: 120 LDTIIEKKKILL---------IFPLIFVSALSIFTEASLVALILPYLMYASRKS-GKDWI 169
Query: 174 LYLALAVLLFVMSIEVYP-----SLSETLSMMLYNSDWLFITVLPFLYIYNGER-GYTSK 227
LY+ +L + ++ + SL +++S+ N ++L ITVLPF+Y+YNG++ G +S
Sbjct: 170 LYIGTLLLSILFLLQAFSFDTSMSLWQSISL---NPEFLIITVLPFIYLYNGKKGGRSST 226
Query: 228 WSKYFFYIFYPAHLWLIALIA 248
+ KYFFY FYP H+W++ +I
Sbjct: 227 FEKYFFYGFYPIHIWILFIIG 247
>gi|116512221|ref|YP_809437.1| hypothetical protein LACR_1507 [Lactococcus lactis subsp. cremoris
SK11]
gi|116107875|gb|ABJ73015.1| hypothetical protein LACR_1507 [Lactococcus lactis subsp. cremoris
SK11]
Length = 251
Score = 84.3 bits (207), Expect = 6e-15, Method: Composition-based stats.
Identities = 82/255 (32%), Positives = 127/255 (49%), Gaps = 16/255 (6%)
Query: 1 MKKGNGLNGFQIKLIMALLMVLDHV--DKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEG 58
MK+ GL+G+Q+K+I M+LDH+ + + GL I +R V F F +EG
Sbjct: 1 MKQRYGLSGYQLKIIAIAFMLLDHIYTEVLVGLPGIPDFSILDMASRFVSPLFFFLMIEG 60
Query: 59 FIHTRNRITYNLRLFFWAGLMFFGNVILNFLLQSKEIHNS---NNIFLTLACGVLSLGIF 115
+ +TR+R Y RL +M GN+I +F++ + + NIFL+LA G GI
Sbjct: 61 YFYTRSRQKYLSRLLTTGIVMAIGNLITHFIMNAPITFYTILNPNIFLSLAAG---FGIV 117
Query: 116 FGFSKEKLELNGRDKWIRYGLGAVVFLVAAFLTEGGMVMIPFMLITYGYRKNEALRNLLY 175
+ L+ K ++ V TE + + F + Y RK +LY
Sbjct: 118 W-----LLDTIIEKKKCLLIFPLILVSVLTLFTEASIFALVFPYLMYISRKT-GKSWILY 171
Query: 176 LALAVLLFVMSIEVYPSLSETLSMML-YNSDWLFITVLPFLYIYNGERGYTSK-WSKYFF 233
L +L + + S TL L +N ++L TVLPF+Y+YNG++G TS + KYFF
Sbjct: 172 LGTLLLSALFLSQSLSDASMTLWQKLSFNPEFLVFTVLPFIYLYNGKKGGTSSAFEKYFF 231
Query: 234 YIFYPAHLWLIALIA 248
Y FYP H+W + ++
Sbjct: 232 YGFYPIHIWFLFILG 246
>gi|125623913|ref|YP_001032396.1| hypothetical protein llmg_1082 [Lactococcus lactis subsp. cremoris
MG1363]
gi|124492721|emb|CAL97676.1| putative membrane protein [Lactococcus lactis subsp. cremoris
MG1363]
Length = 251
Score = 84.0 bits (206), Expect = 7e-15, Method: Composition-based stats.
Identities = 82/255 (32%), Positives = 127/255 (49%), Gaps = 16/255 (6%)
Query: 1 MKKGNGLNGFQIKLIMALLMVLDHV--DKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEG 58
MK+ GL+G+Q+K+I M+LDH+ + + GL I +R V F F +EG
Sbjct: 1 MKQRFGLSGYQLKIIAIAFMLLDHIYTEVLVGLPGIPDFSILDMASRFVSPLFFFLMIEG 60
Query: 59 FIHTRNRITYNLRLFFWAGLMFFGNVILNFLLQSKEIHNS---NNIFLTLACGVLSLGIF 115
+ +TR+R Y RL +M GN+I +F++ + + NIFL+LA G GI
Sbjct: 61 YFYTRSRQKYLSRLLTTGIVMAIGNLITHFIMNAPITFYTILNPNIFLSLAAG---FGIV 117
Query: 116 FGFSKEKLELNGRDKWIRYGLGAVVFLVAAFLTEGGMVMIPFMLITYGYRKNEALRNLLY 175
+ L+ K ++ V TE + + F + Y RK +LY
Sbjct: 118 W-----LLDTIIEKKKCLLIFPLILVSVLTLFTEASIFALVFPYLMYISRKT-GKSWILY 171
Query: 176 LALAVLLFVMSIEVYPSLSETLSMML-YNSDWLFITVLPFLYIYNGERGYTSK-WSKYFF 233
L +L + + S TL L +N ++L TVLPF+Y+YNG++G TS + KYFF
Sbjct: 172 LGTLLLSALFLSQSLSDASMTLWQKLSFNPEFLVFTVLPFIYLYNGKKGGTSSAFEKYFF 231
Query: 234 YIFYPAHLWLIALIA 248
Y FYP H+W + ++
Sbjct: 232 YGFYPIHIWFLFILG 246
>gi|106885341|ref|ZP_01352703.1| conserved hypothetical protein [Clostridium phytofermentans ISDg]
gi|106767214|gb|EAT23931.1| conserved hypothetical protein [Clostridium phytofermentans ISDg]
Length = 263
Score = 82.4 bits (202), Expect = 2e-14, Method: Composition-based stats.
Identities = 83/277 (29%), Positives = 135/277 (48%), Gaps = 46/277 (16%)
Query: 3 KGNGLNGFQIKLIMALLMVLDHVDKI---PGLLPAGWDGIFHAVTRCVGVWFSFAAVEGF 59
K GL GFQ+K+I + MV DH+ + G++P F+ V R V F F VEG+
Sbjct: 2 KEKGLTGFQLKIIGLIFMVFDHIHEFFGFTGVIPVA----FNRVGRIVAPIFIFMTVEGY 57
Query: 60 IHTRNRITYNLRLFFWAGLMFFGN-VILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGF 118
HTRN+ Y LRL+ + LM GN I N+ ++ NNIF+TL + L I F
Sbjct: 58 THTRNKKKYMLRLYIGSLLMNIGNYFIPNYFQRTDSFAIMNNIFVTLFMITVYLCI-IDF 116
Query: 119 SKEKLELNGRDKWIRYGLGAVVFLVAAFLTEGGMV----------MIPFMLIT------- 161
K+ ++ + ++ +G V+FL+ L+ MV +IP L
Sbjct: 117 IKKGIK---EKRIMKIFVGGVLFLIPITLSILFMVNMENLFYLIFIIPTPLFVEGGPIFI 173
Query: 162 ------YGYRKNEALRNLLYLALAV-LLFVMSIEVYPSLSETLSMMLYNSDWLFITVLPF 214
Y R N+ + Y+A+ V ++F + ++ + N W+ + P
Sbjct: 174 GIGIIMYLLRGNKKKLLIAYIAICVAIIFTGDLSIH-------GLFFNNYQWMMVFAAPL 226
Query: 215 LYIYNGERGYTSKWSKYFFYIFYPAHLWLIALIAFWV 251
LY+YNG++G K KY FY+FYPAH+++ +++ ++
Sbjct: 227 LYLYNGKKG---KGMKYLFYVFYPAHIYVFYILSCYL 260
>gi|126699278|ref|YP_001088175.1| hypothetical protein CD1673 [Clostridium difficile 630]
gi|115250715|emb|CAJ68539.1| putative membrane protein [Clostridium difficile 630]
Length = 274
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 75/276 (27%), Positives = 122/276 (44%), Gaps = 42/276 (15%)
Query: 6 GLNGFQIKLIMALLMVLDHVDKIPGLLPAGWDG--IFHAVTRCVGVWFSFAAVEGFIHTR 63
G++GF IK++ + M DH I +P FH V R F F AVEGF HT
Sbjct: 7 GISGFTIKILALIFMTFDH---IAAFMPQTMQIPIWFHWVGRISAPLFIFMAVEGFYHTS 63
Query: 64 NRITYNLRLFFWAGLMFFGNVILNFLLQSKE----IHNSNNIFLTLACGVLSLGIFFGFS 119
NR Y RL+ W+ +M GN I+N + E I+N + +A + ++ F
Sbjct: 64 NRKKYISRLYIWSVIMAIGNQIINNVFSHPEGAIIINNIFSTLFLIAIYLQAIEFIKKFR 123
Query: 120 KEK------------------------LELNGRDKWIRYGLGAVVFLVAAFLTEGGMVMI 155
KEK L ++ I + + + FL EGG + I
Sbjct: 124 KEKEIKYFIIGLLMIIIPIILGIFTVALLFKVTNRVIALFM---ILVPVPFLVEGGPIWI 180
Query: 156 PFMLITYGYRKNEALRNLLYLALAVLLFVMSIEVYPSLSETLSMMLYNSDWLFITVLPFL 215
+I Y R + ++ Y+ + + +F SL ++ L N W+ I LP +
Sbjct: 181 ILGIIFYLCRGKKFSLSICYVLMCIFIFTTMSNGDYSLKNSI---LQNYQWMMIASLPLM 237
Query: 216 YIYNGERGYTSKWSKYFFYIFYPAHLWLIALIAFWV 251
+YN E+G K KY FY++YP H++++ ++ ++
Sbjct: 238 LLYNEEKG---KSMKYLFYLYYPIHVYILYILGIYL 270
>gi|81428906|ref|YP_395906.1| Hypothetical membrane protein [Lactobacillus sakei subsp. sakei
23K]
gi|15212473|gb|AAK92008.1|AF400065_6 hypothetical membrane protein LaaN [Lactobacillus sakei]
gi|78610548|emb|CAI55599.1| Hypothetical membrane protein [Lactobacillus sakei subsp. sakei
23K]
Length = 268
Score = 65.9 bits (159), Expect = 2e-09, Method: Composition-based stats.
Identities = 75/269 (27%), Positives = 118/269 (43%), Gaps = 40/269 (14%)
Query: 7 LNGFQIKLIMALLMVLDHVDKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFIHTRNRI 66
LN F IK++ LLM +DH+ ++ AG R V F F AVEGF HTRN+
Sbjct: 9 LNNFDIKILGILLMFVDHIHQM--FAGAGVPDWVDWFGRPVATIFFFMAVEGFTHTRNQK 66
Query: 67 TYNLRLFFWAGLMFFGNVILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGFSKEKLELN 126
Y +L +M FG+ ++ +I SNNIF L VL++ + + N
Sbjct: 67 RYLTQLLIGFWVMNFGDRLVQQFFTVGDIALSNNIFTDLFIAVLAMYGIQEVTAGRRAHN 126
Query: 127 GRDKWIRYGLGAVVFLV--------------AAFLTEGGMVMIPFMLIT----------- 161
I G+ A++ + A L ++IP + +
Sbjct: 127 ANQMVI--GILAIIVPIMMSVLVILLLVNPKTAMLAANIGMVIPTITLAENSIFLYIGVF 184
Query: 162 -YGYRKNEALRNLLYLALAVLLFVMSIEVYPSLSETLSMMLY-NSDWLFITVLPFLYIYN 219
Y +R N L+ L + AV I + T + +L N+ W+ I + + +YN
Sbjct: 185 FYLFRNNRLLQCLTIIVFAV------INAGANSGFTFTGLLTTNTQWMMIFAIIPILLYN 238
Query: 220 GERGYTSKWSKYFFYIFYPAHLWLIALIA 248
G++G + K FFY FYP H+WL+ ++A
Sbjct: 239 GQKGRSMK---SFFYFFYPIHIWLLYILA 264
>gi|29377245|ref|NP_816399.1| hypothetical protein EF2771 [Enterococcus faecalis V583]
gi|29344711|gb|AAO82469.1| conserved hypothetical protein [Enterococcus faecalis V583]
Length = 261
Score = 60.5 bits (145), Expect = 9e-08, Method: Composition-based stats.
Identities = 77/276 (27%), Positives = 118/276 (42%), Gaps = 44/276 (15%)
Query: 4 GNGLNGFQIKLIMALLMVLDHVDKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFIHTR 63
G L GF +K+I + MV DH+ + L G G F + R F F + EGFIHT
Sbjct: 2 GKTLTGFHLKVIGVISMVFDHLLQFFSFL--GVPGWFGWIGRIAAPIFLFESSEGFIHTS 59
Query: 64 NRITYNLRLFFWAGLMFFGNVILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGFSKEKL 123
NR Y RL +M N LN + + NNIF TL G + + F ++++
Sbjct: 60 NRRKYMFRLLLGFWIMGILNGFLNAYFSTGGLI-INNIFGTLFLGTVYMQSMDYFKQKQI 118
Query: 124 ELNGRDKWIRYGLGAVVFLVAAFLTEGGMVMIP----------------FMLITYGYRKN 167
G G + F+V ++ +V+ F LI
Sbjct: 119 -----------GKGLLWFIVPLLISALPLVVFSSPDILSNPAILIGFQIFNLIVPSLMMT 167
Query: 168 EALRNLLYLALAVLLF-------VMSIEVYPSLSET----LSMMLYNSDWLFITVLPFLY 216
E + LA+A LF + +I V +S + N W+ I +
Sbjct: 168 EGGFLFVLLAVAFYLFHGKKWLQISAIGVVALISAASYNFQELFGVNHQWMMILAAIPIV 227
Query: 217 IYNGERGYTSKWSKYFFYIFYPAHLWLIALIAFWVK 252
+YNGE+G + + FFYIFYPAH+ + A+I+F+++
Sbjct: 228 LYNGEKG---RGMRNFFYIFYPAHIAIFAIISFFMQ 260
>gi|116627726|ref|YP_820345.1| hypothetical protein STER_0924 [Streptococcus thermophilus LMD-9]
gi|116101003|gb|ABJ66149.1| Uncharacterized conserved protein [Streptococcus thermophilus
LMD-9]
Length = 274
Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats.
Identities = 75/272 (27%), Positives = 125/272 (45%), Gaps = 46/272 (16%)
Query: 7 LNGFQIKLIMALLMVLDHVDKI---PGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFIHTR 63
L+GFQ+K I + MV DH+ G +P W F + R F F +EGFIHT
Sbjct: 6 LSGFQLKYIALITMVFDHIHYFFDYTGKIPI-W---FAMIGRLAAPLFLFCVIEGFIHTH 61
Query: 64 NRITYNLRLFFWA---GLMFFGNV-ILNFLLQSKEIHNSNNIFLTLACGVLSL-GIFFGF 118
NR Y L+++ A GL+ FG L+ L++ N + + A +++L GI +
Sbjct: 62 NRKKYFLKIYSLAIVMGLIQFGFYNFLHPLVRPDGFFPQNMMLSSFAILLVALQGI--AW 119
Query: 119 SKEKLELNGRDK-------------------------WIRYGLGAVVFLVAAFLTEGGMV 153
+EK L G ++ L V V F+ +GG
Sbjct: 120 IQEKKYLKGIPTLLFPLLLPWLMVPFYLLSVNKPMFGFLLNLLNFTVLPVHTFINDGGTW 179
Query: 154 MIPFMLITYGYRKNEALRNLLYLALAVLLFVMSIEVY-PSLSETLSMMLYNSDWLFITVL 212
++ + Y +N L +++++++ +M+I + PS + ++ +W+ I
Sbjct: 180 LLLTGIAMYLCHRNLKKEVLAFMSVSLVWLLMAIVLSRPSFHD---LIFKYFEWMEIFSA 236
Query: 213 PFLYIYNGERGYTSKWSKYFFYIFYPAHLWLI 244
P + YNG+RG K SKY FY+FYP H++L+
Sbjct: 237 PLMLSYNGQRG---KGSKYLFYVFYPTHIYLL 265
>gi|55822860|ref|YP_141301.1| hypothetical protein str0900 [Streptococcus thermophilus CNRZ1066]
gi|55738845|gb|AAV62486.1| hypothetical protein str0900 [Streptococcus thermophilus CNRZ1066]
Length = 274
Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats.
Identities = 75/272 (27%), Positives = 125/272 (45%), Gaps = 46/272 (16%)
Query: 7 LNGFQIKLIMALLMVLDHVDKI---PGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFIHTR 63
L+GFQ+K I + MV DH+ G +P W F + R F F +EGFIHT
Sbjct: 6 LSGFQLKYIALITMVFDHIHYFFDYTGKIPI-W---FAMIGRLAAPLFLFCVIEGFIHTH 61
Query: 64 NRITYNLRLFFWA---GLMFFGNV-ILNFLLQSKEIHNSNNIFLTLACGVLSL-GIFFGF 118
NR Y L+++ A GL+ FG L+ L++ N + + A +++L GI +
Sbjct: 62 NRKKYFLKIYSLAIVMGLIQFGFYNFLHPLVRPDGFFPQNMMLSSFAILLVALQGI--AW 119
Query: 119 SKEKLELNGRDK-------------------------WIRYGLGAVVFLVAAFLTEGGMV 153
+EK L G ++ L V V F+++GG
Sbjct: 120 IQEKKYLKGIPTLLFPLLLPWLMMPFYLLSVNKPMFGFLLNLLNFTVLPVHTFISDGGTW 179
Query: 154 MIPFMLITYGYRKNEALRNLLYLALAVLLFVMSIEVY-PSLSETLSMMLYNSDWLFITVL 212
++ + Y +N L +++++++ +M I + PS + ++ +W+ I
Sbjct: 180 LLLTGIAMYLCHRNLKKEVLAFMSVSLVWLLMGIVLSRPSFHD---LIFKYFEWMEIFSA 236
Query: 213 PFLYIYNGERGYTSKWSKYFFYIFYPAHLWLI 244
P + YNG+RG K SKY FY+FYP H++L+
Sbjct: 237 PLMLSYNGQRG---KGSKYLFYVFYPTHIYLL 265
>gi|156864725|gb|EDO58156.1| hypothetical protein CLOL250_01294 [Clostridium sp. L2-50]
Length = 248
Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats.
Identities = 73/259 (28%), Positives = 114/259 (44%), Gaps = 38/259 (14%)
Query: 6 GLNGFQIKLIMALLMVLDHVDKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFIHTRNR 65
G+ Q+KL+ + M+ DHV + L+ W I AV R F+F VEG+ HT +
Sbjct: 8 GITSNQLKLLACIFMLCDHVGFV--LMNNNW--IMRAVGRLAFPIFAFLLVEGYRHTSDI 63
Query: 66 ITYNLRLFFWAGLMFFGNVILNFLLQSKEIH-NSNNIFLTLACG--VLSLGIFFGFSK-- 120
Y +RLF +A V + + NIF TLA G VL LG +++
Sbjct: 64 RKYFIRLFLFA---LISEVPFDLASTGQVFDLQKQNIFFTLAAGLIVLYLGKVAKWNQMG 120
Query: 121 -----EKLELNGRDKWIRYGLGAVVFLVAAFL-TEGGMVMIPFM---LITYGYRKNEALR 171
+ + YG+ ++ +V + T+ G F L+T Y + LR
Sbjct: 121 AVIGIVVIMVVAEALHFDYGIAGILLIVLMYYSTQDGTAEHAFRNAGLVTGHYPFSRKLR 180
Query: 172 NLLYLALAVLLFVMSIEVYPSLSETLSMMLYNSDWLF--ITVLPFLYIYNGERGYTSKWS 229
+ A+ +S L + Y L+ + +LP + +YNGE G +K
Sbjct: 181 QNMGFAI--------------VSAVLYFLFYGIRQLYAVLAILP-ISLYNGEYGRKNKVL 225
Query: 230 KYFFYIFYPAHLWLIALIA 248
KY FY+FYPAHL ++ L+
Sbjct: 226 KYAFYVFYPAHLLILYLLG 244
>gi|55820940|ref|YP_139382.1| Conserved hypothetical, predicted membrane protein (TMS8)
[Streptococcus thermophilus LMG 18311]
gi|55736925|gb|AAV60567.1| Conserved hypothetical, predicted membrane protein (TMS8)
[Streptococcus thermophilus LMG 18311]
Length = 274
Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats.
Identities = 74/272 (27%), Positives = 125/272 (45%), Gaps = 46/272 (16%)
Query: 7 LNGFQIKLIMALLMVLDHVD---KIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFIHTR 63
L+GFQ+K I + MV DH+ G +P W F + R F F +EGFIH
Sbjct: 6 LSGFQLKYIALITMVFDHIHYFFNYTGKIPI-W---FAMIGRLAAPLFLFCVIEGFIHAH 61
Query: 64 NRITYNLRLFFWA---GLMFFGNV-ILNFLLQSKEIHNSNNIFLTLACGVLSL-GIFFGF 118
NR Y L+++ A GL+ FG L+ L++ N + + A +++L GI +
Sbjct: 62 NRKKYFLKIYSLAIVMGLIQFGFYNFLHPLVRPDGFFPQNMMLSSFAILLVALRGI--AW 119
Query: 119 SKEKLELNGRDK-------------------------WIRYGLGAVVFLVAAFLTEGGMV 153
+EK L G ++ L V V F+++GG
Sbjct: 120 IQEKKYLKGIPTLLFPLLLPWLMMPFYLLSVNKPMFGFLLNLLNFTVLPVHTFISDGGTW 179
Query: 154 MIPFMLITYGYRKNEALRNLLYLALAVLLFVMSIEVY-PSLSETLSMMLYNSDWLFITVL 212
++ + Y +N L +++++++ +M+I + PS + ++ +W+ I
Sbjct: 180 LLLTGIAMYLCHRNLKKEVLAFMSVSLVWLLMAIVLSRPSFHD---LIFKYFEWMEIFSA 236
Query: 213 PFLYIYNGERGYTSKWSKYFFYIFYPAHLWLI 244
P + YNG+RG K SKY FY+FYP H++L+
Sbjct: 237 PLMLSYNGQRG---KGSKYLFYVFYPTHIYLL 265
>gi|156864935|gb|EDO58366.1| hypothetical protein CLOL250_00910 [Clostridium sp. L2-50]
Length = 247
Score = 55.1 bits (131), Expect = 3e-06, Method: Composition-based stats.
Identities = 65/244 (26%), Positives = 108/244 (44%), Gaps = 17/244 (6%)
Query: 1 MKKGNGLNGFQIKLIMALLMVLDHVDKIPGLLPAGWDGIFHAVTRCVGVWFSFAAVEGFI 60
++K GLN +KLI + MV+ H + +AV R F VEG+
Sbjct: 13 LQKNTGLNANTLKLIAVIAMVIQHATIVFYPTENALHWTLYAVGRITAPVMCFMIVEGYY 72
Query: 61 HTRNRITYNLRLFFWAGLMFFGNVILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGFSK 120
HT N Y +RLF A L+ L F E +I L G+L+L IF +K
Sbjct: 73 HTSNIKKYLMRLFV-ATLISHIPHALAFGYSPYEFWKVTSIMWCLFLGLLALVIF---NK 128
Query: 121 EKLELNGRDKWIRYGLGAVVFLVAAFLTEGGMVMIPFMLITYGYRKNEALRNLLYLALAV 180
+++ L R +G+G L +F +++ +++ +R + + Y A +
Sbjct: 129 KEIPLILR----LFGVGICCIL--SFPGNLNCIIVLWIMAFGVFRDSNRKK---YTAFFI 179
Query: 181 LLFVMSIEVYPSLSETLSMMLYNSDWLFITVLPFLYIYNGERGYTSKWSKYFFYIFYPAH 240
+ E + S + + S + +P L +YNG+ G SKW +Y +Y+FYP H
Sbjct: 180 VTLCYLAEFWVISSHGIPWIRLVS----LLAIPLLLMYNGKLGKKSKWIQYGYYLFYPLH 235
Query: 241 LWLI 244
L ++
Sbjct: 236 LLVL 239
>gi|150005888|ref|YP_001300632.1| hypothetical protein BVU_3384 [Bacteroides vulgatus ATCC 8482]
gi|149934312|gb|ABR41010.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 231
Score = 53.9 bits (128), Expect = 9e-06, Method: Composition-based stats.
Identities = 74/254 (29%), Positives = 111/254 (43%), Gaps = 51/254 (20%)
Query: 7 LNGFQIKLIMALLMVLDHVDKIPGLLPAGWDGIFHAVTRCVGVW----FSFAAVEGFIHT 62
L+G +K+I L MV DH L+ G + + V RC G F+F EGF HT
Sbjct: 13 LSGSALKIIAVLSMVADHGAYY--LMEHG--TLLYEVMRCFGRIAFPVFAFLIAEGFRHT 68
Query: 63 RNRITYNLRLFFWAGLMFFGNVILNFLLQSKEIHNSNNIFLTLACGVLSLGIFFGFSKEK 122
RNR+ Y L+L +A + +LN ++N+ TLA GV++L F K+
Sbjct: 69 RNRMKYFLQLLGFAVVSEVPWYLLN------GADGTHNVLFTLALGVMALASFEALKKDG 122
Query: 123 LELNGRDKWIRYGLGAVVFLVAAFLTEGGMVMIPFMLITYGYRKNEALRNLLYLALAVLL 182
+ G V+ +A F T G+ + R +L + + L
Sbjct: 123 IL-----------CGTVILSIAGFATWSGV--------------DYEWRGILMMVVFYLF 157
Query: 183 FVMSIEVYPS------LSETLSMMLYNSDWLFITVLPFLYI--YNGERGYT-SKWSKYFF 233
+S +PS L MM Y + +L +L I Y+G RG+ K +KY F
Sbjct: 158 GNVSNLSFPSGRKAQLFCAFLLMMHYG---IVGALLAYLVIACYDGTRGFIHGKVAKYGF 214
Query: 234 YIFYPAHLWLIALI 247
Y FYP HL+ I ++
Sbjct: 215 YAFYPVHLFFILVM 228
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.332 0.147 0.474
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 940,908,650
Number of Sequences: 5470121
Number of extensions: 37886965
Number of successful extensions: 122334
Number of sequences better than 1.0e-05: 36
Number of HSP's better than 0.0 without gapping: 9
Number of HSP's successfully gapped in prelim test: 27
Number of HSP's that attempted gapping in prelim test: 122235
Number of HSP's gapped (non-prelim): 52
length of query: 252
length of database: 1,894,087,724
effective HSP length: 130
effective length of query: 122
effective length of database: 1,182,971,994
effective search space: 144322583268
effective search space used: 144322583268
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 128 (53.9 bits)