BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= PI1683
(337 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|156859421|gb|EDO52852.1| hypothetical protein BACUNI_036... 328 2e-88
gi|156107230|gb|EDO08975.1| hypothetical protein BACOVA_048... 326 1e-87
gi|60680266|ref|YP_210410.1| putative transmembrane protein... 323 6e-87
gi|53712072|ref|YP_098064.1| putative dolichol-P-glucose sy... 323 1e-86
gi|150004477|ref|YP_001299221.1| putative dolichol-P-glucos... 308 3e-82
gi|29349452|ref|NP_812955.1| putative dolichol-P-glucose sy... 300 1e-79
gi|153807427|ref|ZP_01960095.1| hypothetical protein BACCAC... 299 2e-79
gi|150009181|ref|YP_001303924.1| hypothetical protein BDI_2... 206 1e-51
gi|154494070|ref|ZP_02033390.1| hypothetical protein PARMER... 196 2e-48
gi|34539999|ref|NP_904478.1| hypothetical protein PG0136 [P... 164 8e-39
gi|110640198|ref|YP_680408.1| integral membrane protein [Cy... 147 1e-33
gi|89889628|ref|ZP_01201139.1| conserved hypothetical trans... 145 4e-33
gi|150024218|ref|YP_001295044.1| hypothetical protein FP010... 133 2e-29
gi|91217257|ref|ZP_01254218.1| putative dolichol-P-glucose ... 127 7e-28
gi|83855709|ref|ZP_00949238.1| putative transmembrane prote... 125 3e-27
gi|88804772|ref|ZP_01120292.1| putative dolichol-P-glucose ... 123 2e-26
gi|124009557|ref|ZP_01694231.1| membrane protein, putative ... 123 2e-26
gi|88714129|ref|ZP_01108206.1| putative dolichol-P-glucose ... 120 1e-25
gi|86143758|ref|ZP_01062134.1| hypothetical protein MED217_... 116 2e-24
gi|126646247|ref|ZP_01718764.1| putative transmembrane prot... 115 4e-24
gi|149370962|ref|ZP_01890557.1| integral membrane protein [... 114 1e-23
gi|120437861|ref|YP_863547.1| conserved hypothetical protei... 110 9e-23
gi|88801688|ref|ZP_01117216.1| hypothetical protein PI23P_0... 110 1e-22
gi|86134686|ref|ZP_01053268.1| hypothetical protein MED152_... 109 2e-22
gi|86130864|ref|ZP_01049463.1| putative transmembrane prote... 105 4e-21
gi|124007614|ref|ZP_01692318.1| membrane protein, putative ... 103 2e-20
gi|126662931|ref|ZP_01733929.1| integral membrane protein [... 94 1e-17
gi|83815240|ref|YP_445691.1| hypothetical protein SRU_1570 ... 94 1e-17
gi|148264726|ref|YP_001231432.1| conserved hypothetical pro... 74 2e-11
gi|154151310|ref|YP_001404928.1| glycosyl transferase, fami... 67 2e-09
gi|145619426|ref|ZP_01775476.1| conserved hypothetical prot... 66 3e-09
gi|110601058|ref|ZP_01389261.1| conserved hypothetical prot... 63 2e-08
gi|118745410|ref|ZP_01593385.1| conserved hypothetical prot... 60 2e-07
gi|55378611|ref|YP_136461.1| dolichol-P-glucose synthetase ... 55 5e-06
gi|124486309|ref|YP_001030925.1| hypothetical protein Mlab_... 55 5e-06
gi|78222601|ref|YP_384348.1| hypothetical protein Gmet_1389... 55 6e-06
gi|148654275|ref|YP_001274480.1| conserved hypothetical pro... 55 7e-06
>gi|156859421|gb|EDO52852.1| hypothetical protein BACUNI_03647 [Bacteroides uniformis ATCC 8492]
Length = 328
Score = 328 bits (842), Expect = 2e-88, Method: Composition-based stats.
Identities = 171/323 (52%), Positives = 234/323 (72%)
Query: 5 KLLYNIVKIVLPLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRG 64
KLL +KI+LP++LGG ILYW+YR FDF + +V++++ +W WMLLS+ FG+++ RG
Sbjct: 3 KLLKKTLKIILPILLGGFILYWVYRDFDFSKAEEVLLHQTNWWWMLLSLFFGVMSHVLRG 62
Query: 65 WRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTV 124
WRWKQ LEP+ + S CV++IF+SYA +LV+PRVGE +RCGVL +YD VSF K+LGTV
Sbjct: 63 WRWKQTLEPLDAHPKTSDCVDAIFVSYATNLVLPRVGEVSRCGVLAKYDNVSFAKSLGTV 122
Query: 125 VTERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVV 184
VTER VDT+ +LLIT + L+Q+ VF FF TGT + ++ + ++ + V + V+
Sbjct: 123 VTERLVDTLCILLITGITFLAQMPVFFRFFEETGTKIPSLVHLVTSPWFYVALFSIIGVL 182
Query: 185 ILLYIIRRHIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFC 244
+LLY + R + ++KVK + IW+GV+S+K+VKN+PLFV Y++ IW YF HFY+TF+C
Sbjct: 183 VLLYFLLRMLSFFEKVKGVVLNIWEGVMSLKNVKNVPLFVLYTLLIWLCYFYHFYITFYC 242
Query: 245 FQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVH 304
FQ T L LV FV G+ AVIVPTPNGAGPWHFAV TM++LYGV A F LIVH
Sbjct: 243 FQFTEHLSFLAGLVMFVGGTFAVIVPTPNGAGPWHFAVITMMMLYGVNATDAGIFALIVH 302
Query: 305 TLQTLLVILLGIYAWIALAFTHK 327
+QTLLVI+LGIY + LA ++
Sbjct: 303 GIQTLLVIVLGIYGSLHLALANR 325
>gi|156107230|gb|EDO08975.1| hypothetical protein BACOVA_04833 [Bacteroides ovatus ATCC 8483]
Length = 328
Score = 326 bits (836), Expect = 1e-87, Method: Composition-based stats.
Identities = 166/312 (53%), Positives = 231/312 (74%)
Query: 16 PLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQVLEPMG 75
P++LGG IL+W+YR FDF + V+++ +W WML S+ FG+ AQ FRGWRW+Q LEP+
Sbjct: 14 PVVLGGFILFWVYRDFDFTKAGDVLLHGTNWWWMLFSLVFGVFAQVFRGWRWRQTLEPLD 73
Query: 76 EQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTERAVDTVIV 135
+ S CVN+IF+SYA SLV+PR+GE +RCGVL +YD VSF K+LGTVVTER VDT+ +
Sbjct: 74 AFPKKSDCVNAIFISYAASLVVPRIGEVSRCGVLAKYDNVSFAKSLGTVVTERLVDTLTI 133
Query: 136 LLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILLYIIRRHIP 195
LLIT + +L Q+ +F F +TGT + ++ + ++ + + C + V ILLY +R+ +
Sbjct: 134 LLITGITVLLQLPIFVTFLQQTGTKIPSLVHLLTSVWFYIILFCFIGVAILLYYLRKTLF 193
Query: 196 IYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQETSSLGVTC 255
Y++VK + IW+G++S+K V+NIPLF+FY++AIW YFLHFY TF+CF T+ LG+
Sbjct: 194 FYERVKGFVLNIWEGIMSLKGVRNIPLFIFYTLAIWACYFLHFYFTFYCFAFTAHLGILA 253
Query: 256 ALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQTLLVILLG 315
ALV FV G+ AVIVPTPNGAGPWHFA+ +M++LYGV A F LIVH +QT LV+LLG
Sbjct: 254 ALVMFVGGTFAVIVPTPNGAGPWHFAIISMMMLYGVNVTDAGIFALIVHGIQTFLVVLLG 313
Query: 316 IYAWIALAFTHK 327
IY + AL+FT++
Sbjct: 314 IYGFAALSFTNR 325
>gi|60680266|ref|YP_210410.1| putative transmembrane protein [Bacteroides fragilis NCTC 9343]
gi|60491700|emb|CAH06453.1| putative transmembrane protein [Bacteroides fragilis NCTC 9343]
Length = 326
Score = 323 bits (829), Expect = 6e-87, Method: Composition-based stats.
Identities = 170/313 (54%), Positives = 228/313 (72%)
Query: 16 PLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQVLEPMG 75
PL+LGG ILYW+YR FDFV+ +V+ + +W WM S+ FGI AQ FRGWRW+Q LEP+G
Sbjct: 14 PLVLGGFILYWVYRDFDFVKATEVLQHGTNWWWMAFSLLFGIFAQVFRGWRWRQTLEPLG 73
Query: 76 EQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTERAVDTVIV 135
R CV++IF+SYA SLV+PRVGE +RCGVL +YD VSF K+LGTVVTER VDTV +
Sbjct: 74 AFPRRRDCVDAIFISYAASLVVPRVGEVSRCGVLAKYDNVSFAKSLGTVVTERLVDTVTI 133
Query: 136 LLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILLYIIRRHIP 195
LLIT V +L Q+ VF F +TGT + + + ++ + + C + V++LLY + R +
Sbjct: 134 LLITGVTVLLQMPVFVTFLEQTGTKIPSFMHLLTSVWFYIILFCTIGVIVLLYYLIRTLS 193
Query: 196 IYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQETSSLGVTC 255
++KVK + + +G++S+++VKN+PLF+ YS IW SYFLHFY TF+CF T+ LG+
Sbjct: 194 FFEKVKGVVLNVCEGIMSLRNVKNLPLFLLYSFLIWLSYFLHFYFTFYCFAFTAHLGLLA 253
Query: 256 ALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQTLLVILLG 315
ALV FV G+ AVIVPTPNGAGPWHFAV TM++LYGV A F LIVH +QTLLVILLG
Sbjct: 254 ALVMFVGGTFAVIVPTPNGAGPWHFAVITMMMLYGVNATDAGIFALIVHGIQTLLVILLG 313
Query: 316 IYAWIALAFTHKE 328
+Y + ++F H++
Sbjct: 314 VYGLVTISFLHRK 326
>gi|53712072|ref|YP_098064.1| putative dolichol-P-glucose synthetase [Bacteroides fragilis YCH46]
gi|52214937|dbj|BAD47530.1| putative dolichol-P-glucose synthetase [Bacteroides fragilis YCH46]
Length = 333
Score = 323 bits (828), Expect = 1e-86, Method: Composition-based stats.
Identities = 169/313 (53%), Positives = 228/313 (72%)
Query: 16 PLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQVLEPMG 75
PL+LGG ILYW+YR FDFV+ +V+ + +W WM S+ FGI AQ FRGWRW+Q LEP+G
Sbjct: 21 PLVLGGFILYWVYRDFDFVKATEVLQHGTNWWWMAFSLLFGIFAQVFRGWRWRQTLEPLG 80
Query: 76 EQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTERAVDTVIV 135
R CV++IF+SYA SLV+PRVGE +RCGVL +YD VSF K+LGTVVTER VDTV +
Sbjct: 81 AFPRRRDCVDAIFISYAASLVVPRVGEVSRCGVLAKYDNVSFAKSLGTVVTERLVDTVTI 140
Query: 136 LLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILLYIIRRHIP 195
LLIT V +L Q+ VF F +TGT + + + ++ + + C + V++LLY + R +
Sbjct: 141 LLITGVTVLLQMPVFVTFLEQTGTKIPSFMHLLTSVWFYIILFCTIGVIVLLYYLIRTLS 200
Query: 196 IYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQETSSLGVTC 255
++KVK + + +G++S+++VKN+PLF+ YS IW SYFLHFY TF+CF T+ LG+
Sbjct: 201 FFEKVKGVVLNVCEGIMSLRNVKNLPLFLLYSFLIWLSYFLHFYFTFYCFAFTAHLGLLA 260
Query: 256 ALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQTLLVILLG 315
ALV FV G+ AVIVPTPNGAGPWHFAV TM++LYGV A F LIVH +QTLLVILLG
Sbjct: 261 ALVMFVGGTFAVIVPTPNGAGPWHFAVITMMMLYGVNATDAGIFALIVHGIQTLLVILLG 320
Query: 316 IYAWIALAFTHKE 328
+Y + ++F +++
Sbjct: 321 VYGLVTISFLYRK 333
>gi|150004477|ref|YP_001299221.1| putative dolichol-P-glucose synthetase [Bacteroides vulgatus ATCC
8482]
gi|149932901|gb|ABR39599.1| putative dolichol-P-glucose synthetase [Bacteroides vulgatus ATCC
8482]
Length = 334
Score = 308 bits (790), Expect = 3e-82, Method: Composition-based stats.
Identities = 167/324 (51%), Positives = 228/324 (70%)
Query: 4 KKLLYNIVKIVLPLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFR 63
KK++ ++I PL LG IL WMY GF+F ++ +V+ M++ WML+S+ FG+ + FR
Sbjct: 7 KKIINKGLQIAFPLFLGAAILIWMYHGFNFSRVWEVLDGGMNYGWMLVSLVFGVFSHIFR 66
Query: 64 GWRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGT 123
GWRWK L P+GE + S CV +IF+SYA +LV+PRVGE +RCGVL +YD SF K+LGT
Sbjct: 67 GWRWKLTLAPLGEHPKTSDCVYAIFVSYAANLVVPRVGEISRCGVLAKYDGTSFSKSLGT 126
Query: 124 VVTERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAV 183
VVTER +DT+ V LIT V ++ Q +VF FF TGT + F++ + +T +C LAV
Sbjct: 127 VVTERLIDTLCVSLITGVTLIMQARVFDTFFKETGTDTTVLAQVFTSGHFYITIVCVLAV 186
Query: 184 VILLYIIRRHIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFF 243
++L + + R++ ++ KVK L +W GV+S++ VK +PLF+ Y+V IW YFL FY++FF
Sbjct: 187 LVLAFFLIRNVTVFAKVKGILHNVWVGVLSLRHVKRMPLFILYTVGIWTCYFLQFYVSFF 246
Query: 244 CFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLIV 303
CF + +LGV LV F VGSIAV+VPTPNGAGPWHFAV TM++LYGV A F L+V
Sbjct: 247 CFDFSDNLGVMAGLVMFAVGSIAVVVPTPNGAGPWHFAVITMMMLYGVGKEDAGIFALLV 306
Query: 304 HTLQTLLVILLGIYAWIALAFTHK 327
H +QT L+ILLGIY AL FT+K
Sbjct: 307 HGIQTFLLILLGIYGLAALPFTNK 330
>gi|29349452|ref|NP_812955.1| putative dolichol-P-glucose synthetase [Bacteroides
thetaiotaomicron VPI-5482]
gi|29341361|gb|AAO79149.1| putative dolichol-P-glucose synthetase [Bacteroides
thetaiotaomicron VPI-5482]
Length = 328
Score = 300 bits (767), Expect = 1e-79, Method: Composition-based stats.
Identities = 167/313 (53%), Positives = 227/313 (72%)
Query: 16 PLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQVLEPMG 75
P++LGG ILYW+YR FDF ++ +V+ + +W WML S+ FG+LAQ FRGWRW+Q LEP+
Sbjct: 14 PIVLGGFILYWVYRDFDFSRVGEVLRHGTNWWWMLFSLLFGVLAQVFRGWRWRQTLEPLD 73
Query: 76 EQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTERAVDTVIV 135
R S CVN+IF+SYA SLV+PR+GE +RCGVL +YD VSF K+LGTVVTER VDT+ +
Sbjct: 74 AFPRRSDCVNAIFISYAASLVVPRIGEVSRCGVLAKYDNVSFAKSLGTVVTERLVDTLTI 133
Query: 136 LLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILLYIIRRHIP 195
LIT + +L Q+ VF F TGT + + ++ + + C + VV+LLY +R+ +
Sbjct: 134 FLITGITVLLQMPVFVTFLENTGTKIPSFAYLLTSVWFYIVLFCFIGVVVLLYYLRKTLF 193
Query: 196 IYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQETSSLGVTC 255
Y++VK + IW+G++S+K V+NIPLF+FY++AIW YF HFY TF+CF T+ LG+
Sbjct: 194 FYERVKGFVLNIWEGIMSLKGVRNIPLFIFYTLAIWACYFFHFYFTFYCFAFTAHLGILA 253
Query: 256 ALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQTLLVILLG 315
ALV FV G+ AVIVPTPNGAGPWHFA+ M++LYGV A F LIVH +QT LV+LLG
Sbjct: 254 ALVMFVGGTFAVIVPTPNGAGPWHFAIMEMMMLYGVNVTDAGIFALIVHGIQTFLVVLLG 313
Query: 316 IYAWIALAFTHKE 328
+Y AL FT+++
Sbjct: 314 VYGLAALPFTNRQ 326
>gi|153807427|ref|ZP_01960095.1| hypothetical protein BACCAC_01707 [Bacteroides caccae ATCC 43185]
gi|149129789|gb|EDM21001.1| hypothetical protein BACCAC_01707 [Bacteroides caccae ATCC 43185]
Length = 328
Score = 299 bits (766), Expect = 2e-79, Method: Composition-based stats.
Identities = 165/313 (52%), Positives = 231/313 (73%)
Query: 16 PLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQVLEPMG 75
P++LGG IL+W+Y FDF ++ +V+++ +W WM S+ FG+LAQ FRGWRWKQ+LEP+
Sbjct: 14 PIVLGGFILFWVYHNFDFTKVGEVLLHGTNWWWMFFSLLFGVLAQVFRGWRWKQMLEPLE 73
Query: 76 EQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTERAVDTVIV 135
+ S CVN+IF+SYA SL++PRVGE +RCGVL +YD VSF K+LGTVVTER VDT+ +
Sbjct: 74 AFPKRSDCVNAIFISYAASLIVPRVGEVSRCGVLAKYDNVSFAKSLGTVVTERLVDTLTI 133
Query: 136 LLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILLYIIRRHIP 195
LLIT V +L Q+ +F F +TGT + ++ + ++ + + C + V +LLY +R+ +
Sbjct: 134 LLITGVTVLLQLPIFVTFLQQTGTKIPSLLHLLTSVWFYIVLFCFIGVGMLLYYLRKTLF 193
Query: 196 IYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQETSSLGVTC 255
Y++VK + IW+GV+S+K V+NIPLF FY++AIWG YF HFY TF+CF T+ L +
Sbjct: 194 FYERVKGFVLNIWEGVMSLKGVRNIPLFSFYTLAIWGCYFFHFYFTFYCFAFTAHLSILA 253
Query: 256 ALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQTLLVILLG 315
ALV FV G+ AVIVPTPNGAGPWHFAV +M++LYGV + A F LIVH +QT LV+LLG
Sbjct: 254 ALVMFVGGTFAVIVPTPNGAGPWHFAVISMMMLYGVNETDAGIFALIVHGIQTFLVVLLG 313
Query: 316 IYAWIALAFTHKE 328
+Y AL+ T+++
Sbjct: 314 VYGLAALSLTNRQ 326
>gi|150009181|ref|YP_001303924.1| hypothetical protein BDI_2583 [Parabacteroides distasonis ATCC
8503]
gi|149937605|gb|ABR44302.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 337
Score = 206 bits (525), Expect = 1e-51, Method: Composition-based stats.
Identities = 119/328 (36%), Positives = 195/328 (59%)
Query: 1 MGTKKLLYNIVKIVLPLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQ 60
M K +L +KI+LPL G +L+++YR D ++ +VV + + +L+S+ FG+ A
Sbjct: 1 MDFKSILRTFLKIILPLAFGCLLLWFLYRKMDITEIWRVVKEGVRYDIILVSLLFGLFAN 60
Query: 61 TFRGWRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKA 120
RG RW ++ +GE+ + S + ++ +YAV+LV+PRVGE RCG++ +YDK+SF K
Sbjct: 61 IVRGLRWGLLISSLGERFKMSNVIYAVLGNYAVNLVLPRVGEVWRCGMITKYDKISFTKL 120
Query: 121 LGTVVTERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICA 180
LGT++ +R DT++V LIT + + I FT+FF + ++ + F++ V I
Sbjct: 121 LGTLLIDRVSDTIMVGLITLMIFIFNISFFTSFFAKNPALLEGFQSMFNSIWIYVAVIIF 180
Query: 181 LAVVILLYIIRRHIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYL 240
+AV+ ++ + + +K K L IW G+ S+ +++ FV ++ IWG YF +FY+
Sbjct: 181 VAVIWFVFTYMSNFTLVQKAKGMLKNIWTGMKSIWYMEHKMRFVIETLLIWGGYFCYFYI 240
Query: 241 TFFCFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFV 300
TF+ F T LG+ L+ F + SI V VP G GPWHF V L+ +GV + A F
Sbjct: 241 TFYAFDFTKDLGIVVGLITFTMSSIGVAVPVQGGIGPWHFMVIATLVCFGVNENDAAAFA 300
Query: 301 LIVHTLQTLLVILLGIYAWIALAFTHKE 328
L+VHT+QT+ L G++ +AL T++E
Sbjct: 301 LVVHTVQTVWTGLCGLFGVVALPLTNRE 328
>gi|154494070|ref|ZP_02033390.1| hypothetical protein PARMER_03415 [Parabacteroides merdae ATCC
43184]
gi|154086330|gb|EDN85375.1| hypothetical protein PARMER_03415 [Parabacteroides merdae ATCC
43184]
Length = 338
Score = 196 bits (498), Expect = 2e-48, Method: Composition-based stats.
Identities = 116/327 (35%), Positives = 189/327 (57%)
Query: 1 MGTKKLLYNIVKIVLPLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQ 60
M K +L +KI+LPL G +L+++Y D ++ V+ + + +L S+ FG+ A
Sbjct: 1 MDFKAILRTFLKIILPLAFGCLLLWYLYSKMDIGEIWNVIRKGVRYEIILFSLLFGLGAN 60
Query: 61 TFRGWRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKA 120
RG RW ++ +G++ + + ++ +YAV+LV+PRVGE RCG++ +YDK+ F +
Sbjct: 61 IVRGLRWGLLIRSLGDKVKTCNVIYAVLGNYAVNLVLPRVGEVWRCGMITKYDKIPFTRL 120
Query: 121 LGTVVTERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICA 180
LGT++ +R DT++V LIT I+ F +FF + +D + F++ V G+
Sbjct: 121 LGTLLIDRVSDTIMVGLITMSIIIFNFDFFHSFFAKNPALLDGFQSMFNSIWIYVAGVIF 180
Query: 181 LAVVILLYIIRRHIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYL 240
+A + ++ + + KK K L +W G+ S+ +K LFV ++ IW YFL+FY+
Sbjct: 181 IAGIWFIFTYMSNFTLVKKAKSMLQNVWDGMKSIWLMKRKGLFVIQTLLIWTGYFLYFYI 240
Query: 241 TFFCFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFV 300
TF+ F T LGVT L+ F + SIAV VP G GPWHF V L+ +GV + A F
Sbjct: 241 TFYAFDFTRDLGVTVGLIAFTMSSIAVAVPVQGGIGPWHFMVIATLMCFGVKETDAAAFA 300
Query: 301 LIVHTLQTLLVILLGIYAWIALAFTHK 327
L+VHT+QT + + G++ +AL F +K
Sbjct: 301 LVVHTVQTAWLGITGLFGVVALPFVNK 327
>gi|34539999|ref|NP_904478.1| hypothetical protein PG0136 [Porphyromonas gingivalis W83]
gi|34396310|gb|AAQ65377.1| hypothetical protein PG_0136 [Porphyromonas gingivalis W83]
Length = 349
Score = 164 bits (415), Expect = 8e-39, Method: Composition-based stats.
Identities = 113/330 (34%), Positives = 179/330 (54%), Gaps = 1/330 (0%)
Query: 9 NIVKIVLPLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWK 68
++ K+++PL +GG +L+ +YR DF + K+V + +++ + S+ FG+ A RG RW+
Sbjct: 20 SVAKVIIPLAIGGLLLWLVYRKMDFSAIGKIVRDGVNYYIIAFSLLFGLAANCIRGLRWQ 79
Query: 69 QVLEPMGE-QTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTE 127
++EP+ R + + +Y V++ +PR GEF RC +RY+K+ FP+ LGT+ +
Sbjct: 80 LLIEPLASPHPRKINAILTTLGNYTVNMALPRAGEFWRCAEESRYEKIPFPQLLGTLFMD 139
Query: 128 RAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILL 187
R +D V+V LIT ++ F+ FF R F FS+ V + + LL
Sbjct: 140 RIMDLVMVGLITLSIMMGFQGFFSAFFARNPQLTQGFFTIFSSIWLYVVVVGIGLLFFLL 199
Query: 188 YIIRRHIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQE 247
Y H+ +KV + I +G+ S+ +++ LF+ YS+ +W YF +FY TFF F
Sbjct: 200 YKYLSHVGPIRKVAALIGRILEGLRSIWHMEHKWLFILYSILLWVGYFFYFYTTFFAFDF 259
Query: 248 TSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQ 307
T SLG+ L+ F + SIAV VP G GPWHF V L+ +GV A F L+VHT Q
Sbjct: 260 TRSLGMGVGLISFAMSSIAVAVPVQGGVGPWHFMVIATLVAFGVTKEDAGAFALVVHTTQ 319
Query: 308 TLLVILLGIYAWIALAFTHKEALKTSNVDN 337
T+ G A L F +K+ + +N
Sbjct: 320 TVWTTAAGFVAIGLLPFVNKKYDRIKQSNN 349
>gi|110640198|ref|YP_680408.1| integral membrane protein [Cytophaga hutchinsonii ATCC 33406]
gi|110282879|gb|ABG61065.1| integral membrane protein [Cytophaga hutchinsonii ATCC 33406]
Length = 342
Score = 147 bits (370), Expect = 1e-33, Method: Composition-based stats.
Identities = 98/322 (30%), Positives = 168/322 (52%), Gaps = 17/322 (5%)
Query: 5 KLLYNIVKIVLPLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRG 64
K L + K VL LGG + +++ + D ++ + N ++ W+ +S+ ++A R
Sbjct: 3 KTLKDATKYVLMFGLGGLLFWYVIKDQDPNKIAEDFRNA-NYFWIAMSILASLVAYWSRS 61
Query: 65 WRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTV 124
RW +LEP+ + ++ Y ++++PR GE ARC +LN+ K F G+V
Sbjct: 62 VRWNLLLEPLDIKPPVYKTFLALMSGYFANVLLPRAGEVARCLMLNKMSKAPFNATFGSV 121
Query: 125 VTERAVDTVIVLLITAVAILSQI--------KVFTNFFFRTGTSVDNIFNKFSATGWLVT 176
V ER D + +L + V + + ++F F +S+ ++ +L
Sbjct: 122 VAERVFDLIALLTLIGVTFVIEFDRISTFLSEIFIEKFQNLFSSLQQMYIYLVVFAFLGI 181
Query: 177 GICALAVVILLYIIRRHI---PIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGS 233
G+CA+ L+ +R I Y+KV L+G+W+GVIS++ ++ LF+F+++ IW
Sbjct: 182 GLCAM-----LWFLRHEIRKNSTYQKVSVFLSGVWEGVISIRKLEKKWLFLFHTLLIWFC 236
Query: 234 YFLHFYLTFFCFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVD 293
Y+L YL FF F+ T+ LGV LV VVG I + P G G +H V + L++YGV
Sbjct: 237 YYLMTYLVFFAFEPTAHLGVNAGLVLLVVGGIGMSAPVQGGIGIFHILVSSALVIYGVTK 296
Query: 294 VRALYFVLIVHTLQTLLVILLG 315
+ + ++HT QTL VI++G
Sbjct: 297 EDGISYAFLLHTSQTLTVIIIG 318
>gi|89889628|ref|ZP_01201139.1| conserved hypothetical transmembrane protein [Flavobacteria
bacterium BBFL7]
gi|89517901|gb|EAS20557.1| conserved hypothetical transmembrane protein [Flavobacteria
bacterium BBFL7]
Length = 327
Score = 145 bits (366), Expect = 4e-33, Method: Composition-based stats.
Identities = 103/330 (31%), Positives = 184/330 (55%), Gaps = 6/330 (1%)
Query: 4 KKLLYNIVKIVLPLILGGGILYWMYRGFDFVQMRKV--VMNEMDWTWMLLSMPFGILAQT 61
KK +KI+LPL LG ++Y Y F+ Q+ ++ + + D++W++L + +
Sbjct: 2 KKASLKTLKILLPLALGVFLIYISYAQFNENQINEIKSYLIDADYSWIILGITLAFFSHL 61
Query: 62 FRGWRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKAL 121
R WRW +L+ +G Q V +I YA++L+IPR GE AR V+NR D V KA+
Sbjct: 62 SRAWRWNYMLKAIGHQPAFLTNVMAIGTGYAMNLIIPRSGEVARAVVVNRIDNVPVDKAI 121
Query: 122 GTVVTERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICAL 181
GT++ ER +D +I+L+ITA A+L V +FF ++ F K ++ ++ GI AL
Sbjct: 122 GTIIAERVLDFIILLIITATALLVSGTVIIDFF---KEHLNIAFAKADSSTIIIYGIIAL 178
Query: 182 AVVILLYIIRRHIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLT 241
+++ ++ R+I ++K+K+ ++G+ G ++ +K L++ +++ IW Y L FY++
Sbjct: 179 IFTVIVVVLFRYIKPFQKLKNFISGLKDGFNTIWTMKQKWLYLAHTLFIWSLYLLMFYVS 238
Query: 242 FFCFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVL 301
F T S+ ++ L FV GS AV T G G + + + +L+L+GV +
Sbjct: 239 IFALPGTRSIPLSGILSAFVAGSFAVAF-TNGGFGAYPYFIAQVLLLFGVAETLGTSLGW 297
Query: 302 IVHTLQTLLVILLGIYAWIALAFTHKEALK 331
I+ QT LV++ G+ ++I L+ + + K
Sbjct: 298 ILWISQTALVLVYGLISFIMLSIKNSVSAK 327
>gi|150024218|ref|YP_001295044.1| hypothetical protein FP0106 [Flavobacterium psychrophilum JIP02/86]
gi|149770759|emb|CAL42224.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86]
Length = 319
Score = 133 bits (334), Expect = 2e-29, Method: Composition-based stats.
Identities = 101/327 (30%), Positives = 174/327 (53%), Gaps = 11/327 (3%)
Query: 4 KKLLYNIVKIVLPLILGGGILYWMYRGFDFVQMRKV--VMNEMDWTWMLLSMPFGILAQT 61
KK + + I LPLILG ++Y+ Y F Q+ ++ ++ ++ LS+ +
Sbjct: 2 KKKIGKWLSIGLPLILGIYLIYYKYNEFTTEQIHEIKGYFKNANYFYIYLSLVIALFGFI 61
Query: 62 FRGWRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKAL 121
R +RWK ++ +G QT + ++ + Y ++L IPR GEF+R VL Y+ + F KA
Sbjct: 62 SRAYRWKYAIQHLGYQTHFYNNLMAVCVGYFMNLTIPRSGEFSRALVLKNYENMPFDKAF 121
Query: 122 GTVVTERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICAL 181
GT+V ER VDT+I LL ++L Q V N+ T V+N+ L +
Sbjct: 122 GTIVAERVVDTLIFLLFVFASLLFQFNVLKNYVL-TKIPVENLI-------ILASIGFVG 173
Query: 182 AVVILLYIIRRHIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLT 241
V++LL I + I +K+ L+G+ +G+ S+ ++ F+F+S IW +Y L FY+T
Sbjct: 174 FVLLLLLWIYSNWKIVSAIKEKLSGLIEGMSSILKMEQKWKFLFHSFFIWFTYILMFYVT 233
Query: 242 FFCFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVL 301
F +ETS++ ++ FV G++AV T +G G + + + +LYGV D F
Sbjct: 234 IFALKETSNISFGAVIIAFVFGTLAVGF-TNSGFGVYPLLIAEIFMLYGVPDTAGTAFGW 292
Query: 302 IVHTLQTLLVILLGIYAWIALAFTHKE 328
+ QT+L+I+LG +++ L +++
Sbjct: 293 LTWASQTILMIVLGGLSFLFLPILNRK 319
>gi|91217257|ref|ZP_01254218.1| putative dolichol-P-glucose synthetase [Psychroflexus torquis ATCC
700755]
gi|91184600|gb|EAS70982.1| putative dolichol-P-glucose synthetase [Psychroflexus torquis ATCC
700755]
Length = 320
Score = 127 bits (320), Expect = 7e-28, Method: Composition-based stats.
Identities = 101/331 (30%), Positives = 166/331 (50%), Gaps = 20/331 (6%)
Query: 4 KKLLYNIVKIVLPLILGGGILYWMYRGFDFVQMRKVVMN--EMDWTWMLLSMPFGILAQT 61
KK + V+ + P++LG ++++ F ++ ++ N +D TW+ LS GI +
Sbjct: 2 KKAIKKWVRTLGPMVLGLFLIWFSLSKFSSEELGQLWFNIRNVDITWVALSFVMGIASHL 61
Query: 62 FRGWRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKAL 121
R +RW+ +LEP+ + C S+ L Y +L IPR GE R L Y+ V F K
Sbjct: 62 SRAYRWRFLLEPVQIFPKFYNCFLSLMLGYLANLGIPRSGEVLRGATLASYEDVKFEKTF 121
Query: 122 GTVVTERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICAL 181
GT++TER VD +++L I +A LSQ FF D N F GI +
Sbjct: 122 GTIITERLVDLIMLLSIVGIASLSQSDKIFGFF------KDQNINPFFG------GIIIV 169
Query: 182 AVVILLYIIRRHIP-----IYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFL 236
+++++L I R I + +KVK + +++G+ SV +K F+ ++ IWG Y
Sbjct: 170 SMILMLVIGFRIIKNSGNTLIQKVKSFIEELFKGMKSVFSMKKKKQFLLHTFFIWGLYLG 229
Query: 237 HFYLTFFCFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRA 296
FY+ F F +T LG+ LV F+ G+ A+ + T G G + A+ +GV +
Sbjct: 230 MFYVLKFAFPDTQDLGLQATLVAFIFGAFAMSI-TNGGIGLYPIAIGIAFNTFGVSNATG 288
Query: 297 LYFVLIVHTLQTLLVILLGIYAWIALAFTHK 327
F ++ QT LVI+LG ++I L ++
Sbjct: 289 EAFGWVMWGTQTALVIVLGGLSFIILPLLNR 319
>gi|83855709|ref|ZP_00949238.1| putative transmembrane protein [Croceibacter atlanticus HTCC2559]
gi|83849509|gb|EAP87377.1| putative transmembrane protein [Croceibacter atlanticus HTCC2559]
Length = 298
Score = 125 bits (315), Expect = 3e-27, Method: Composition-based stats.
Identities = 92/286 (32%), Positives = 149/286 (52%), Gaps = 8/286 (2%)
Query: 43 EMDWTWMLLSMPFGILAQTFRGWRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGE 102
E D W+ +S+ G+L+ R +RWK +LEP+G + + ++ ++Y +L IPR GE
Sbjct: 19 EADKFWIAVSLILGLLSHASRAYRWKFLLEPLGYKPKFYNSFMAVMVAYLANLGIPRSGE 78
Query: 103 FARCGVLNRYDKVSFPKALGTVVTERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVD 162
R ++ Y+KV F KA GT+V+ER D +++LL+ A AIL Q N+F D
Sbjct: 79 VLRGATISTYEKVPFEKAFGTIVSERIADLIMLLLVVATAILLQTDTLLNYF------ND 132
Query: 163 NIFNKFSATGWLVTGICALAVVILLYIIRRHIPIYKKVKDALAGIWQGVISVKDVKNIPL 222
N A G +V + + V + + H + K++ G+ G+ S+ +KN
Sbjct: 133 QDINPLMAVGIIVVLVVGVLVGVRILKSSTH-KFFVKLRTFAEGLLDGMKSILHMKNKWA 191
Query: 223 FVFYSVAIWGSYFLHFYLTFFCFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAV 282
F+F++V IW SY L F + +C T +L + L F VGS A I T G G + FA+
Sbjct: 192 FIFHTVFIWLSYVLMFGVIKYCIPGTENLSIAGILAAFAVGSFA-ISATNGGIGVYPFAI 250
Query: 283 KTMLILYGVVDVRALYFVLIVHTLQTLLVILLGIYAWIALAFTHKE 328
+LI + + F I+ QTLL I+LG +++ L +++
Sbjct: 251 GAILIYFNIDKQDGEAFGWILWGSQTLLNIVLGGLSFLLLPILNRK 296
>gi|88804772|ref|ZP_01120292.1| putative dolichol-P-glucose synthetase [Robiginitalea biformata
HTCC2501]
gi|88785651|gb|EAR16820.1| putative dolichol-P-glucose synthetase [Robiginitalea biformata
HTCC2501]
Length = 321
Score = 123 bits (309), Expect = 2e-26, Method: Composition-based stats.
Identities = 95/314 (30%), Positives = 163/314 (51%), Gaps = 10/314 (3%)
Query: 11 VKIVLPLILGGGILYWMYRGFDFVQMRKVV--MNEMDWTWMLLSMPFGILAQTFRGWRWK 68
+K++LP+ LG ++++ Y R++V + E D W+ +S+ GIL+ R RW
Sbjct: 9 LKVILPIALGVFLIWYSYNMTSPADRREIVRYIREADLFWVGMSILIGILSHISRAIRWN 68
Query: 69 QVLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTER 128
+LEP+G R + I ++Y +L IPR GEF R L Y+ V F K GT+VTER
Sbjct: 69 YLLEPIGYSPRLRNNILIILMAYLANLGIPRSGEFLRATALATYEGVPFQKGFGTIVTER 128
Query: 129 AVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILLY 188
+D +++LLI +A+ SQ + + G + +A+ L+ G V +
Sbjct: 129 VIDLLMLLLIILLALASQTDIIMGYLSENGLGL-------TASALLLGGGIVGLFVFRAF 181
Query: 189 IIRRHIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQET 248
+ + +K++ + G+ +GV+S+ +K F+ +++ IW +Y F++ + ET
Sbjct: 182 LRKSKWAFAQKMRGFVRGLLEGVMSIFRMKRKWAFLGHTLFIWAAYIGMFWIIKYTVPET 241
Query: 249 SSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQT 308
SL LV F+ G+ A + TP G G + AV L+L+G+ A F I+ QT
Sbjct: 242 LSLNFWELLVAFIAGAFA-MTATPGGLGLYPIAVAQALMLFGISANSADAFGWIIWIAQT 300
Query: 309 LLVILLGIYAWIAL 322
L+V+L G +++ L
Sbjct: 301 LMVVLFGAISFLLL 314
>gi|124009557|ref|ZP_01694231.1| membrane protein, putative [Microscilla marina ATCC 23134]
gi|123984796|gb|EAY24771.1| membrane protein, putative [Microscilla marina ATCC 23134]
Length = 352
Score = 123 bits (308), Expect = 2e-26, Method: Composition-based stats.
Identities = 87/331 (26%), Positives = 164/331 (49%), Gaps = 17/331 (5%)
Query: 10 IVKIVLPLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQ 69
+++ V+ L + +L ++++ F+F Q+ + D+ W++LS + R RW
Sbjct: 8 VLQYVISLAIATALLLYLFQDFNF-QLIAEAFKKADYKWVVLSAVLTFTSHLMRAHRWNI 66
Query: 70 VLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTERA 129
+L+P+G ++ Y ++LV+PR GE AR G+L + +++ KA GT+V ER
Sbjct: 67 ILKPLGYTPSLYTSFLAVMSGYFINLVVPRGGEVARSGLLQKMERIPATKAFGTIVLERI 126
Query: 130 VDTVI----VLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVI 185
+D +I ++++ AV + ++ T+ F + + + +KF + L V+
Sbjct: 127 IDVLILGTLIVMLLAVEVDQITQIITSTF---SSKLSGLQSKFYLI-VIAGVGLLLLTVL 182
Query: 186 LLYIIRRHIP---IYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTF 242
+ + ++ I +Y K D + + +GV+S+ VKN F F+++ IW Y+L ++ F
Sbjct: 183 IFFRFKQKIQQSLLYAKGMDIVKKVLEGVVSILKVKNQGAFWFHTIGIWAMYYLMAFVLF 242
Query: 243 FCFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILY----GVVDVRALY 298
CF TS L VVG I + P G G +H + +L+ Y L
Sbjct: 243 QCFPITSDLSPWVGFTVLVVGGIGMAAPVQGGIGAYHILITPVLMYYLASKNPQKDEVLS 302
Query: 299 FVLIVHTLQTLLVILL-GIYAWIALAFTHKE 328
V +H QTL+ I++ G+ W++ A T ++
Sbjct: 303 IVTFMHAAQTLVTIMIGGVALWLSTAMTKRQ 333
>gi|88714129|ref|ZP_01108206.1| putative dolichol-P-glucose synthetase [Flavobacteriales bacterium
HTCC2170]
gi|88707513|gb|EAQ99756.1| putative dolichol-P-glucose synthetase [Flavobacteriales bacterium
HTCC2170]
Length = 321
Score = 120 bits (300), Expect = 1e-25, Method: Composition-based stats.
Identities = 96/302 (31%), Positives = 153/302 (50%), Gaps = 10/302 (3%)
Query: 16 PLILGGGILYWMYRGFDFVQMRKVV--MNEMDWTWMLLSMPFGILAQTFRGWRWKQVLEP 73
P+ LG +++++Y Q ++++ + + D W+ LS+ GIL+ R RW +LEP
Sbjct: 14 PIALGVFLVWYLYNSTTPEQRQEILGYITQADPLWISLSIAIGILSHISRAIRWNYLLEP 73
Query: 74 MGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTERAVDTV 133
+G + S V I +Y +L IPR GEF R L Y+ V F K GT+VTER VD +
Sbjct: 74 LGYSPKLSNNVLIILTAYLANLGIPRSGEFLRATALATYEDVPFEKGFGTIVTERVVDVI 133
Query: 134 IVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILLYIIRRH 193
++L+I VA+L Q ++ F +G + F L+ I L + I L I +
Sbjct: 134 MLLIIIFVALLLQTELIIGFIKSSGIGL------FGGIIVLIISILGLFLTIRL-IKKSS 186
Query: 194 IPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQETSSLGV 253
K+K L + +G++S+ +K F+F++ IW Y F++ + ET+ L +
Sbjct: 187 SKFALKLKGFLKNLLEGILSIFKMKRKWAFIFHTFLIWACYIAMFWVIKYTVLETADLSL 246
Query: 254 TCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQTLLVIL 313
LV FV G+IA + T G + V L ++GV V F I+ QTL+VI+
Sbjct: 247 GALLVAFVGGAIA-MTTTNGGFIAYPAFVSKSLEIFGVSIVSGNAFGWIMWIAQTLMVIV 305
Query: 314 LG 315
G
Sbjct: 306 FG 307
>gi|86143758|ref|ZP_01062134.1| hypothetical protein MED217_00655 [Flavobacterium sp. MED217]
gi|85829801|gb|EAQ48263.1| hypothetical protein MED217_00655 [Leeuwenhoekiella blandensis
MED217]
Length = 322
Score = 116 bits (291), Expect = 2e-24, Method: Composition-based stats.
Identities = 91/329 (27%), Positives = 164/329 (49%), Gaps = 12/329 (3%)
Query: 3 TKKLLYNIVKIVLPLILGGGILYWMYRGFDFVQMRKVVMN--EMDWTWMLLSMPFGILAQ 60
+KK +KI +PL +G ++Y+ + + N + ++ S+ FG L+
Sbjct: 2 SKKSFIKFLKIAIPLGIGILVIYYSLSAATPEERATLWKNIKGANPVYIAASLVFGTLSH 61
Query: 61 TFRGWRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKA 120
R +RW+ +L+PMG + S ++ +Y +L IPR GEF R +L Y++V F KA
Sbjct: 62 LSRAYRWQYLLQPMGYHPKLSNRFMAVMAAYLANLGIPRSGEFLRGALLTTYEEVPFEKA 121
Query: 121 LGTVVTERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICA 180
GT+++ER D +++LL+ AI Q + + + N +L+ +
Sbjct: 122 FGTIISERIADFIMLLLVVGFAITLQTDMLLTYL------KEQNINPLYTVAFLIFAVG- 174
Query: 181 LAVVILLYIIRR-HIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFY 239
+VI II+R I K+K+ + G+ +G+ S+ +++N F+ +++ IW Y L F+
Sbjct: 175 -GIVIGFKIIQRAQAGILVKLKNFMNGLIEGMQSILNMRNKWAFIGHTLFIWVMYVLMFW 233
Query: 240 LTFFCFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYF 299
+ F E S L FV+GS A+ V T G G + ++ + + +G F
Sbjct: 234 VIKFTIPEISYASTAVILAAFVIGSFAISV-TNGGIGVYPISIGALFVFFGYSKEGGEAF 292
Query: 300 VLIVHTLQTLLVILLGIYAWIALAFTHKE 328
IV QTLLV++LG +++ L +++
Sbjct: 293 GWIVWGSQTLLVLVLGALSFLFLPILNRK 321
>gi|126646247|ref|ZP_01718764.1| putative transmembrane protein [Algoriphagus sp. PR1]
gi|126577879|gb|EAZ82099.1| putative transmembrane protein [Algoriphagus sp. PR1]
Length = 334
Score = 115 bits (288), Expect = 4e-24, Method: Composition-based stats.
Identities = 86/315 (27%), Positives = 158/315 (50%), Gaps = 5/315 (1%)
Query: 11 VKIVLPLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQV 70
+++ + L + I +++Y+ + + + +W W+ S+ RGWRW +
Sbjct: 10 IQVAISLGIAVWIFWFLYKDIAIESLIQQIKTS-NWLWIGASIFISFFGYYLRGWRWTLL 68
Query: 71 L-EPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTERA 129
+ E G++ + ++ + Y V+L+IPR GE A+CGVL R + VS LGTV+ ER+
Sbjct: 69 IHEVEGKKVTPNRGYHATMVGYLVNLLIPRAGEVAKCGVLTRTNGVSLGHLLGTVILERS 128
Query: 130 VDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILLYI 189
VD + ++ +A + Q ++F + ++D++ + +V G + ++IL I
Sbjct: 129 VDLLCLIATIFLAFILQNELFIELAGQL-VNIDSLLASLQSNLPIVFGGLLVLILILRLI 187
Query: 190 IRRHIP--IYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQE 247
+R I K++ I G+ + + N F S+ +W YF+ YL
Sbjct: 188 FKRFSDHGIINKIQHFFREIGGGLKRIGQMDNPWGFWIGSIVLWIIYFMTMYLVSLGISS 247
Query: 248 TSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQ 307
T++L L+ V+GSI ++ P G G +H V +LI +GV +V F I+H Q
Sbjct: 248 TANLSSGEVLLVMVMGSIGMVAPVQGGIGTFHALVAFILIQFGVSEVDGKIFAAIIHGTQ 307
Query: 308 TLLVILLGIYAWIAL 322
+LVI+LG+ +W+ +
Sbjct: 308 LILVIVLGLISWLTM 322
>gi|149370962|ref|ZP_01890557.1| integral membrane protein [unidentified eubacterium SCB49]
gi|149355748|gb|EDM44306.1| integral membrane protein [unidentified eubacterium SCB49]
Length = 323
Score = 114 bits (284), Expect = 1e-23, Method: Composition-based stats.
Identities = 92/325 (28%), Positives = 164/325 (50%), Gaps = 11/325 (3%)
Query: 5 KLLYNIVKIVLPLILGGGILYWMYRGFDFVQMR--KVVMNEMDWTWMLLSMPFGILAQTF 62
K I+KI +PL LG ++++++ F Q+ K ++ ++++++ I +
Sbjct: 3 KTFSQILKIAIPLGLGIFLIWYIFSDFTDTQVEDLKSYFRNANYGYVIIAVSLSIFSHIS 62
Query: 63 RGWRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALG 122
R +RW +LEP+G + + + SI ++Y ++L IP+ GE R +L+RY+ V F K G
Sbjct: 63 RAYRWLFMLEPLGYKPKLTNNFMSIAVAYLMNLGIPKSGEITRGVLLSRYEGVPFDKGFG 122
Query: 123 TVVTERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALA 182
T+++ER VD + +L T +AI + + F T + I K G + + +
Sbjct: 123 TIISERVVDLIFLLAFTLLAIFLKYDI----LFAYVTELIPI-KKLLILGVVGVVLLIVG 177
Query: 183 VVILLYIIRRHIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTF 242
V+L Y+ K+K ++G+ GV+S+ +K FVF++ IWG Y L FY
Sbjct: 178 YVLLRYL---KFGFISKIKTFISGLKDGVLSIWTMKKKGAFVFHTFLIWGLYILSFYSAT 234
Query: 243 FCFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLI 302
++TS + + ++ FVVGS T +G G + AV +L L+ + F I
Sbjct: 235 LALEQTSDIAFSTLIITFVVGSFTFAF-TNSGFGTYPAAVAGVLSLFAINFTVGTAFGWI 293
Query: 303 VHTLQTLLVILLGIYAWIALAFTHK 327
V ++L+GI + +AL +K
Sbjct: 294 VWASNIAAILLIGITSLVALPIYNK 318
>gi|120437861|ref|YP_863547.1| conserved hypothetical protein, membrane [Gramella forsetii KT0803]
gi|117580011|emb|CAL68480.1| conserved hypothetical protein, membrane [Gramella forsetii KT0803]
Length = 321
Score = 110 bits (276), Expect = 9e-23, Method: Composition-based stats.
Identities = 85/320 (26%), Positives = 153/320 (47%), Gaps = 10/320 (3%)
Query: 5 KLLYNIVKIVLPLILGGGILYWMYRGFDFVQMRKVVMNEMDWT--WMLLSMPFGILAQTF 62
K +KI +PL LG ++++ + + + + +D W+ +S G +
Sbjct: 3 KKFIKFLKISIPLFLGIFLIWYSLKSSTQEERENLWQSIVDANKFWIFVSFLLGATSHFS 62
Query: 63 RGWRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALG 122
R +RWK +LEPMG ++ + ++ ++Y + IPR GE R L+ Y+ V F K G
Sbjct: 63 RAYRWKFMLEPMGYKSSQANRFMAVMVAYLANFGIPRSGEVLRAVTLSTYENVPFEKGFG 122
Query: 123 TVVTERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALA 182
T+++ER D +I++LI V ++ Q + N T + G+ +
Sbjct: 123 TIISERVADLLILMLIIGVVVILQTDDLLIYLNDQNIKPLN-------TLLIFLGLVGVI 175
Query: 183 VVILLYIIRRHIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTF 242
++ + + R + ++K+K G+ +G+ S+ +K F+F+++ IW Y + F++
Sbjct: 176 IIGINIVKRSNWLPFQKIKKLAKGLLEGMKSILSMKQKWAFIFHTIFIWSMYLIMFFIIK 235
Query: 243 FCFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLI 302
ET+ + FVVGS AV T G G + AV +L +GV + A F I
Sbjct: 236 LSIPETAGTSAGIIMAAFVVGSFAV-SATNGGIGVYPLAVGGVLTFFGVQEHAAEAFGWI 294
Query: 303 VHTLQTLLVILLGIYAWIAL 322
QT +V+L G ++I L
Sbjct: 295 SWATQTFVVLLFGGLSFILL 314
>gi|88801688|ref|ZP_01117216.1| hypothetical protein PI23P_03477 [Polaribacter irgensii 23-P]
gi|88782346|gb|EAR13523.1| hypothetical protein PI23P_03477 [Polaribacter irgensii 23-P]
Length = 323
Score = 110 bits (276), Expect = 1e-22, Method: Composition-based stats.
Identities = 91/303 (30%), Positives = 150/303 (49%), Gaps = 11/303 (3%)
Query: 10 IVKIVLPLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQ 69
I+KIVLPL LGG L W ++ E +++++ L + FGIL+ R +RWK
Sbjct: 15 ILKIVLPLTLGG-FLVWYSLSDISLKTLGTFFKEANYSFIFLGLFFGILSHLSRAYRWKF 73
Query: 70 VLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTERA 129
+L P+G + + + + ++ + Y V+L IPR GE +R VL Y+++ F K GT+V ER
Sbjct: 74 MLAPLGFKPKFTNSILAVLVGYLVNLAIPRAGEVSRALVLTNYEEIPFEKGFGTIVAERI 133
Query: 130 VDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILLYI 189
D +++L + I +F F F G + N + +V GI +I
Sbjct: 134 ADLIMMLCVVI------ITLFVQFDFIYGLLIRNFNPIKISVSLVVLGIGFFTS--YAFI 185
Query: 190 IRRHIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQETS 249
+ K+K +AG+ +GV S+ +KN F+F++V IW Y F+ T E
Sbjct: 186 KKAKSGFLLKIKTFVAGLIEGVTSIFKMKNKWAFIFHTVFIWIMYVAMFWATIPAI-EGL 244
Query: 250 SLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQTL 309
+ + L+ F+ G + I T G G + AV + L L+ + A F I+ T QT
Sbjct: 245 HVPLGGILIGFIAGGFS-IAATNGGIGLYPIAVASALALFDIPTETATAFGWIMWTAQTA 303
Query: 310 LVI 312
+++
Sbjct: 304 MIV 306
>gi|86134686|ref|ZP_01053268.1| hypothetical protein MED152_08240 [Tenacibaculum sp. MED152]
gi|85821549|gb|EAQ42696.1| hypothetical protein MED152_08240 [Polaribacter dokdonensis MED152]
Length = 299
Score = 109 bits (273), Expect = 2e-22, Method: Composition-based stats.
Identities = 87/312 (27%), Positives = 148/312 (47%), Gaps = 18/312 (5%)
Query: 21 GGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQVLEPMGEQTRN 80
GGIL W + E +++W+ L + FGIL+ R +RWK +LEP+G + +
Sbjct: 2 GGILVWYSISKISFDVLLAYFKEANYSWIFLGLFFGILSHLSRAYRWKFMLEPLGYKPKF 61
Query: 81 SVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTERAVDTVIVLLITA 140
+ V ++ + Y V+L +PR GE +R L+ Y+ V F K GT+V ER D +++LLI A
Sbjct: 62 TNSVLAVLIGYLVNLALPRAGEVSRAYALSNYENVPFEKGFGTIVAERIADLIMMLLIVA 121
Query: 141 VAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILL--YIIRRHIPIYK 198
+ + Q N + F+ T ++ + + ++ +
Sbjct: 122 LTLFVQFDFIYNL----------LTENFNPTKIIIGLAILIIAFYIFSSFVKKAKSGFLL 171
Query: 199 KVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQ--ETSSLGVTCA 256
K+K + G+ +G S+ +KN F+F++V IW Y F+ T E G+
Sbjct: 172 KIKTFVTGLIEGATSIFKMKNKWAFIFHTVFIWVMYVAMFWATIPAIDGLEVPFGGI--- 228
Query: 257 LVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQTLLVILLGI 316
L+ F+ G + I T G G + AV L L+ + A F I+ T QT ++I+ G
Sbjct: 229 LIAFIAGGFS-IAATNGGIGLYPIAVAGALALFDIPTEPATAFGWIMWTAQTAMIIVFGG 287
Query: 317 YAWIALAFTHKE 328
A++ L +K+
Sbjct: 288 LAFVLLPIVNKK 299
>gi|86130864|ref|ZP_01049463.1| putative transmembrane protein [Cellulophaga sp. MED134]
gi|85818275|gb|EAQ39435.1| putative transmembrane protein [Dokdonia donghaensis MED134]
Length = 318
Score = 105 bits (262), Expect = 4e-21, Method: Composition-based stats.
Identities = 97/328 (29%), Positives = 157/328 (47%), Gaps = 17/328 (5%)
Query: 1 MGTKKLLYNIVKIVLPLILGGGILYWMYRGFDFVQMRKVV---MNEMDWTWMLLSMPFGI 57
M TKK +KI+LPL+ G L W V R + + +D W+++S+ FG+
Sbjct: 1 MFTKKHTSLALKIILPLV--GVFLVWYQLNKMTVTERSDMWEAIKTVDPIWIIISLLFGL 58
Query: 58 LAQTFRGWRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSF 117
+ R +RWK +LEP+G + R S +I + Y + I R GE R L+R D + F
Sbjct: 59 FSHFSRAYRWKFLLEPLGYKPRFSSSFMAILIGYFANTFIIRSGEVLRGVSLSRTDNIPF 118
Query: 118 PKALGTVVTERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTG 177
KA GT+V ER D +++LL+ A AI Q ++F + I L+ G
Sbjct: 119 EKAFGTIVAERIADLIVLLLVMATAITLQSTALVSYFRDNANPIGFII-------LLIVG 171
Query: 178 ICALAVVILLYIIRRHIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLH 237
I + + L H I KK++D G+ G+ S+ +K+ F+F+++ IW Y
Sbjct: 172 IATGIIGLRLLKTSSHSFI-KKIRDFGLGLLDGIKSILKMKHNGAFIFHTLFIWFMYIGM 230
Query: 238 FYLTFFCFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRAL 297
F++ F +S + L FV+G+ A + T G G + A+ + +G +
Sbjct: 231 FWVMKFTVPGIASAPLGVILASFVIGAFA-MSATNAGMGVYPIAMGAIFSFFGYEG--GI 287
Query: 298 YFVLIVHTLQTLL-VILLGIYAWIALAF 324
F ++ QT+ V++ GI A I L F
Sbjct: 288 IFGWLLWGTQTIFNVVVGGICALIVLIF 315
>gi|124007614|ref|ZP_01692318.1| membrane protein, putative [Microscilla marina ATCC 23134]
gi|123986912|gb|EAY26677.1| membrane protein, putative [Microscilla marina ATCC 23134]
Length = 333
Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats.
Identities = 90/339 (26%), Positives = 164/339 (48%), Gaps = 25/339 (7%)
Query: 4 KKLLYNIVKIVLPLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFR 63
KK L+NI L+LG +L+ ++ +F + + + +++++W + + +L Q R
Sbjct: 2 KKTLFNI----FFLVLGVALLWLAFKEQNFTLIAQE-LRKVNYSWSIPVLGVSLLGQASR 56
Query: 64 GWRWKQVLEPMG-EQTRNSVCVN---SIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPK 119
RWK +L+P+ +Q + VN ++ Y V+L +PR+GE +RC + R + + F
Sbjct: 57 AMRWKLLLDPLKIDQNKPVSVVNVWWALMFGYFVNLGVPRLGEVSRCVAVRRSENIGFEA 116
Query: 120 ALGTVVTERAVDTVIVLLITAVAILSQIKVFTNFFF--------RTGTSVDNIFNKFSAT 171
+GTVV ERA+D + L+ Q + +F ++ N+ +A
Sbjct: 117 TVGTVVAERAIDVFCLFLLVVFTFFFQFDIAADFLRQYIFAPLEQSLRQKQNVLYILAAA 176
Query: 172 GWLVTGICALAVVILLYIIRRHIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIW 231
G + L +L I RR + K++ + + +G++S++ ++ F+ +++ IW
Sbjct: 177 GGCFLLLLGLLWRVL--IKRRW---FLKIRKFMVEVLKGLMSIRYMRRPWAFLGHTLFIW 231
Query: 232 GSYFLHFYLTFFCFQETSSLGVTCALVCFVVGSIAVIVPTPNGA-GPWHFAVKTMLIL-- 288
YFL Y FF T SLG+ L V+GSI VP G G +HF V L L
Sbjct: 232 FCYFLMTYCWFFSMPATQSLGIQGGLFLLVLGSIGKSVPIQGGGMGAYHFLVTKGLSLSP 291
Query: 289 YGVVDVRALYFVLIVHTLQTLLVILLGIYAWIALAFTHK 327
+ + AL ++H Q L +L+G A + + + ++
Sbjct: 292 FLIAATPALTLATVIHLTQVLFSLLVGAIAAVVVLYQNR 330
>gi|126662931|ref|ZP_01733929.1| integral membrane protein [Flavobacteria bacterium BAL38]
gi|126624589|gb|EAZ95279.1| integral membrane protein [Flavobacteria bacterium BAL38]
Length = 235
Score = 94.0 bits (232), Expect = 1e-17, Method: Composition-based stats.
Identities = 76/228 (33%), Positives = 122/228 (53%), Gaps = 9/228 (3%)
Query: 86 SIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTERAVDTVIVLLITAVAILS 145
++ +SY ++L IPR GE +R +L +Y+KV F K GT+V ER VD +I LL +A +
Sbjct: 2 TVCVSYFINLTIPRSGEISRAALLKKYEKVPFDKGFGTIVAERIVDLLIFLLFVIIAFVL 61
Query: 146 QIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILLYIIRRHIPIYKKVKDALA 205
Q F V+ I +L+ G + V+ + I I KK+K L+
Sbjct: 62 QFDKLYKFLIDK-LPVEKII-------YLLIGGFLVFVIFIFVWIYAEWKIIKKLKQKLS 113
Query: 206 GIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQETSSLGVTCALVCFVVGSI 265
G+ +G+ S+ +K+ ++F+S IW +Y L FY+T + ETS++G ++ F+ GS+
Sbjct: 114 GLIEGMTSILKMKDKWNYIFHSFFIWFTYLLMFYVTIYALPETSNIGFDVVIMGFIFGSL 173
Query: 266 AVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQTLLVIL 313
AV T G G + A+ + LYG+ + + F +V T QTLL I
Sbjct: 174 AVGF-TNGGIGAYPLAIALIYSLYGIPNDVGVAFGWLVWTSQTLLTIF 220
>gi|83815240|ref|YP_445691.1| hypothetical protein SRU_1570 [Salinibacter ruber DSM 13855]
gi|83756634|gb|ABC44747.1| conserved hypothetical protein [Salinibacter ruber DSM 13855]
Length = 396
Score = 93.6 bits (231), Expect = 1e-17, Method: Composition-based stats.
Identities = 84/336 (25%), Positives = 140/336 (41%), Gaps = 46/336 (13%)
Query: 17 LILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWK---QVLEP 73
+L GG+L G D + E D+ W+L + + + FR WRW+ + L P
Sbjct: 13 FVLAGGLLALAVYGVDMGNIW-TAFREADYRWLLPLVVLVLGSNLFRAWRWQILVEALPP 71
Query: 74 MGEQTRNSV-------CVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVT 126
E+ ++ +S+ + Y V+ V PR+GE AR L+ F GTVV+
Sbjct: 72 PEERADHTARPRMLEASFSSVMIGYMVNYVAPRMGEVARTANLSARTPYRFSSIFGTVVS 131
Query: 127 ERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICAL----- 181
ER DT ++ A+LS + + N +D + +F A W +L
Sbjct: 132 ERIFDTAVL----GAALLSAVGLLFN-------RLDVLREQFVAPAWARLQSISLDWLAG 180
Query: 182 -----------AVVILLYIIRRHIPIYKKV-----KDALAGIWQGVISVKDVKNIPLFVF 225
A+ ++++R +V K A +G+ ++ V
Sbjct: 181 GTLGLLLLLVLAIGGARWLLQREDSWLGRVWRTTLKPAAMSFQKGMATLVASPRRGAIVL 240
Query: 226 YSVAIWGSYFLHFYLTF--FCFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVK 283
+V +W Y L YL F E +G+ A +G+I ++VP+P G G +H+ +
Sbjct: 241 STVGMWAGYLLMAYLPFRMLHLAEPYGIGILDAWALMAIGAIGILVPSPGGIGSYHYITE 300
Query: 284 TMLI-LYGVVDVRALYFVLIVHTLQTLLVILLGIYA 318
L+ LYGV AL + + H Q + L G+ A
Sbjct: 301 QALVHLYGVPSAEALTYAFLTHGAQLVFYTLAGLVA 336
>gi|148264726|ref|YP_001231432.1| conserved hypothetical protein 374 [Geobacter uraniumreducens Rf4]
gi|146398226|gb|ABQ26859.1| conserved hypothetical protein 374 [Geobacter uraniumreducens Rf4]
Length = 342
Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats.
Identities = 89/326 (27%), Positives = 150/326 (46%), Gaps = 42/326 (12%)
Query: 17 LILGGGI----LYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQVLE 72
L+LG GI L+ ++R DF ++ + +EMD+ ++L ++ ++ FR RWK +L
Sbjct: 10 LLLGIGISLFFLFLLFRKIDFNKL-VIAFSEMDYRYLLPAVVVTFVSYYFRAVRWKYLLL 68
Query: 73 PMGEQTRNSVCVNSIFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTERAVDT 132
P+ + + ++ ++I A +L+ R+GEF R VL + +K+ T+V +R D
Sbjct: 69 PIKKTSMANLFPSTIIGYMANNLLPARLGEFVRAYVLGQKEKIETSAVFATLVVDRLFDG 128
Query: 133 VIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILLYIIRR 192
VLLI V T F R ++N+ + G++ I AL V L+ + +R
Sbjct: 129 FTVLLILLV---------TFFTVRLPAGMENVQHGLVVGGYVTLAIYALVVAFLVVLKKR 179
Query: 193 -----HI----------PIYKKVKDALAGIWQGV-ISVKDVKNIPLF----VFYSVAIWG 232
H+ I +KV L G+ +S K V+ LF + ++ AIW
Sbjct: 180 TSWTIHLVGSLLKPFPARISEKVIPLLGSFISGLRLSSKPVELFALFGTSLIIWATAIWP 239
Query: 233 SYFLHFYLTFFCFQETSSLGVTCALVCFVVGSIAVIVP-TPNGAGPWHFAVKTMLILYGV 291
+H L F L +T ++ V AV+VP +P G +H A L+ + +
Sbjct: 240 ---VHMLLRSFGI----VLPITASMFIMVFLVFAVMVPASPGYVGTYHAACVYGLMAFAI 292
Query: 292 VDVRALYFVLIVHTLQTLLVILLGIY 317
+AL L++H L VI G Y
Sbjct: 293 QKEQALSVALVIHGLSFFPVICAGFY 318
>gi|154151310|ref|YP_001404928.1| glycosyl transferase, family 2 [Candidatus Methanoregula boonei
6A8]
gi|153999862|gb|ABS56285.1| glycosyl transferase, family 2 [Candidatus Methanoregula boonei
6A8]
Length = 586
Score = 67.0 bits (162), Expect = 2e-09, Method: Composition-based stats.
Identities = 71/329 (21%), Positives = 137/329 (41%), Gaps = 38/329 (11%)
Query: 13 IVLPLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQVLE 72
I+L L+ G IL +++ F + ++ + W W+ S + + R WRW +L
Sbjct: 263 IILALVFGLLILAFLFTLTGFSTIFSILAT-VSWPWIAASCIAILFSFVVRTWRWSVLLR 321
Query: 73 PMGEQTRNSVCVNSIFLSYAVSLVIP-RVGEFARCGVLNRYDKVSFPKALGTVVTERAVD 131
G + + S+ ++ ++P R+G+ AR L F L T+V ER +D
Sbjct: 322 SAGYVYPRDILFKCLMFSWFLNYILPARLGDIARAAALKTTSDAPFGMTLSTIVIERILD 381
Query: 132 TVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILLYIIR 191
+ + L VA + K + + G+ V I A V++LL I +
Sbjct: 382 MITLALFLGVASMFVYKA-SFVYIEAGSFV----------------IIAAMVLVLLMIYK 424
Query: 192 RHIPIYKKVKDALAGIWQGVISVK--------DVKNIPLFVFYSVAIWGSYFLHFYLTFF 243
I + + + I Q ++ +K + + + L S+ +W L FF
Sbjct: 425 YEETIIRLFERRIPSIRQSLVLLKKGLDEIATNPEAMVLCFILSIPVW---LLEVSSIFF 481
Query: 244 C-----FQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALY 298
+ + T +V F+V ++ + TP G G ++ +L+L+ V +
Sbjct: 482 AARSVGYNLSFVYAATAGVVAFIVQALPL---TPAGLGVQEASITGVLMLFSVPSALGMS 538
Query: 299 FVLIVHTLQTLLVILLGIYAWIALAFTHK 327
L+ H + L++ ++G+ A I +AF +
Sbjct: 539 IALVDHFARGLVIYIVGLIATIHIAFASR 567
>gi|145619426|ref|ZP_01775476.1| conserved hypothetical protein 374 [Geobacter bemidjiensis Bem]
gi|144944338|gb|EDJ79413.1| conserved hypothetical protein 374 [Geobacter bemidjiensis Bem]
Length = 368
Score = 66.2 bits (160), Expect = 3e-09, Method: Composition-based stats.
Identities = 83/324 (25%), Positives = 138/324 (42%), Gaps = 32/324 (9%)
Query: 12 KIVLPLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQVL 71
K+++ L + L ++R DF +M MD+ ++ ++ ++ FR RWK +L
Sbjct: 37 KLLVGLAISALCLLLLFRKIDFGKMAAAFAG-MDYRYLAPAIILTFVSYYFRALRWKFLL 95
Query: 72 EPMGEQTRNSVCVNSIFLSYAVSLVIP-RVGEFARCGVLNRYDKVSFPKALGTVVTERAV 130
EP+ ++TR S S + Y + ++P R+GE R VL R + + ++V +R
Sbjct: 96 EPI-KKTRLSNLFPSTLIGYMANNLLPARLGELVRAYVLGRKEGIDTSAVFASLVVDRLC 154
Query: 131 DTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILLYII 190
D VLL+ V T F R + + + TG VT I L V++ L ++
Sbjct: 155 DGFTVLLVL---------VATFFTIRLPAGKEGM-QQGLVTGGYVTFILYLGVLLFLALL 204
Query: 191 RRHI------------PIYKKVKDALAGIWQGVIS----VKDVKNIPLFVFYSVAIWGSY 234
R+ P K + A + IS + S IWG+
Sbjct: 205 RKRTDWTLALVTRLLAPFSAKASEKAASLLGSFISGVRMPAGTAGMAATAATSFLIWGAA 264
Query: 235 FLHFYLTFFCFQETSSLGVTCALVCFVVGSIAVIVP-TPNGAGPWHFAVKTMLILYGVVD 293
L F L + ++ F+V AV+VP +P G +H A T L + +V
Sbjct: 265 IWPIDLLLRAFGVELPLTASMFIMVFLV--FAVMVPASPGFVGTYHLACVTALSAFDIVG 322
Query: 294 VRALYFVLIVHTLQTLLVILLGIY 317
RAL +++H L L VI G++
Sbjct: 323 ERALSIAIVIHALGFLPVIPAGLF 346
>gi|110601058|ref|ZP_01389261.1| conserved hypothetical protein [Geobacter sp. FRC-32]
gi|110548235|gb|EAT61458.1| conserved hypothetical protein [Geobacter sp. FRC-32]
Length = 339
Score = 63.2 bits (152), Expect = 2e-08, Method: Composition-based stats.
Identities = 67/307 (21%), Positives = 131/307 (42%), Gaps = 28/307 (9%)
Query: 27 MYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQVLEPMGEQTRNSVCVNS 86
++R DF ++ + MD+ ++L + ++ R RWK +L P+ ++ +
Sbjct: 24 LFRKIDFDKVLEA-FRAMDYRYLLPVVFLTFVSYYLRAVRWKYLLLPIKRTVMANLFPAT 82
Query: 87 IFLSYAVSLVIPRVGEFARCGVLNRYDKVSFPKALGTVVTERAVDTVIVLLITAVAILSQ 146
I A +++ R+GEF R VL + +++ T+V +R D VLLI +
Sbjct: 83 IIGYMANNILPARLGEFVRAYVLGQKEEIETSAVFATLVVDRLFDGFTVLLILLI----- 137
Query: 147 IKVFTNFFFRTGTSVDNIFNKFSATGWLVTGIC-------------ALAVVILLYIIRRH 193
T F + ++N+ + G++ G+ + + ++ ++ R
Sbjct: 138 ----TFFTVKLPAGMENVQDGLKMGGYVTLGVYLVVLAFLFLLKKRTMRTIHVVGVLLRP 193
Query: 194 IP--IYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQETSSL 251
P + +KV L G+ I + +F S A+W + +L F T L
Sbjct: 194 FPAKVSEKVIPLLGSFISGIRLSSRPSEILILLFTSFAVWATCIWPLHLILQSFNIT--L 251
Query: 252 GVTCALVCFVVGSIAVIVP-TPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQTLL 310
+T +++ V AV+VP +P G +H A L+ + + ++L L++H
Sbjct: 252 PITASMLIMVFLVFAVMVPASPGYVGTYHAACVYGLMAFNISKEQSLSVALVMHGTGFFP 311
Query: 311 VILLGIY 317
VI+ G Y
Sbjct: 312 VIVAGFY 318
>gi|118745410|ref|ZP_01593385.1| conserved hypothetical protein 374 [Geobacter lovleyi SZ]
gi|118681696|gb|EAV88119.1| conserved hypothetical protein 374 [Geobacter lovleyi SZ]
Length = 339
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 75/314 (23%), Positives = 138/314 (43%), Gaps = 16/314 (5%)
Query: 29 RGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQVLEPMGEQTRNSVCVNSIF 88
R DF + + + +D +++ ++ F L+ R RW+ +L R S S+
Sbjct: 24 RKIDFHSLAEA-LRRLDLRYLVAAILFTFLSYWLRAVRWRYLL-IHERPVRLSSLYPSVI 81
Query: 89 LSYAVSLVIP-RVGEFARCGVLNRYDKVSFPKALGTVVTERAVDTVIVLLITAVAILS-Q 146
+ Y + + P R+GEF R VL +++ P ++V +R D V+++ A +L+ Q
Sbjct: 82 IGYMANNLFPARLGEFIRAWVLAEREQMQAPSVFASLVIDRLFDGFSVMVMLAGVLLTLQ 141
Query: 147 IKVF---TNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVILLYIIRRHIP--IYKKVK 201
+ + R G IF ++ + +A + LL + + P + +K
Sbjct: 142 LPPGMEQSAAVLRAGGVTTLIFYSVVIASLILLKVRPVATLALLGKLLKPFPAAVAEKCI 201
Query: 202 DALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQETSSLGVTCALVCFV 261
+ G+ + ++ + S+ IW S L YL F L +T + V
Sbjct: 202 PLVGSFLGGLHVSRRSADLMAVLVSSLLIWLSATLPIYLVLVGF--GIHLPLTASFFIMV 259
Query: 262 VGSIAVIVPT-PNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQTLLVILLGIY-AW 319
+ AV+VP P G +H A T L +G+ D ++ L++H + VIL G+Y W
Sbjct: 260 LLVFAVMVPAAPGYIGTYHLACYTGLAAFGLPDTESVSIALVIHGVGFFPVILAGLYHVW 319
Query: 320 ---IALAFTHKEAL 330
++LA K+A+
Sbjct: 320 SQGVSLASMRKQAV 333
>gi|55378611|ref|YP_136461.1| dolichol-P-glucose synthetase [Haloarcula marismortui ATCC 43049]
gi|55231336|gb|AAV46755.1| dolichol-P-glucose synthetase [Haloarcula marismortui ATCC 43049]
Length = 605
Score = 55.5 bits (132), Expect = 5e-06, Method: Composition-based stats.
Identities = 70/303 (23%), Positives = 126/303 (41%), Gaps = 29/303 (9%)
Query: 51 LSMPFGILAQTFRGWRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIP-RVGEFARCGVL 109
LS F ++ RG R++ +L MG + R +IF+S +LV P R G+ R V+
Sbjct: 286 LSAVFYAVSWPLRGIRYRDILVSMGYRERWDFLTGAIFISQTGNLVFPARAGDAVRAYVI 345
Query: 110 NRYDKVSFPKALGTVVTERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFS 169
+ +P ++ ER D + + L+ V ++ + T + D + +
Sbjct: 346 KARRSIPYPSGFASLAVERVFDLLTITLLAGVVMIGLAVTGSAEQLLTALTGDAVGGDAA 405
Query: 170 ATGWLV------TGICALAVVILLYIIRR------HIPIYKKVKDALAGIWQGVIS--VK 215
++G G+ A+ V+ + R I + D+ A G+I V
Sbjct: 406 SSGRTAVAVAGGVGLAAIGAVVAIVASARSDRNLVRAGIGRLSSDSYADYVAGIIEGFVG 465
Query: 216 DVKNIPL-------FVFYSVAIWGSYFLHFYLTF--FCFQETSSLGVTCALVCFVVGSIA 266
DV+ + S+ IW + + F F + T SL V VG++A
Sbjct: 466 DVQTVAADGSAFTRVGIGSLLIWTFDVITALIVFAAFGYSLTPSL-VAVGFFAVSVGNLA 524
Query: 267 VIVP-TPNGAGPWHFAVKTMLILYGVVDVRALYFVLIV-HTLQTLLVILLGIYA--WIAL 322
++P TP G G + A ++ V V A V IV H ++ ++ I+ G+ + W+ +
Sbjct: 525 KVLPLTPGGVGLYEGAFTVIVASLTPVGVAAAIGVAIVDHAVKNIVTIIGGVASMGWLNV 584
Query: 323 AFT 325
+ T
Sbjct: 585 SLT 587
>gi|124486309|ref|YP_001030925.1| hypothetical protein Mlab_1494 [Methanocorpusculum labreanum Z]
gi|124363850|gb|ABN07658.1| conserved hypothetical protein 374 [Methanocorpusculum labreanum Z]
Length = 333
Score = 55.5 bits (132), Expect = 5e-06, Method: Composition-based stats.
Identities = 64/313 (20%), Positives = 138/313 (44%), Gaps = 18/313 (5%)
Query: 10 IVKIVLPLILGGGIL-YWMYRGFDFVQMR-KVVMNEMDWTWMLLSMPFGILAQTFRGWRW 67
+ +V+P ++ +L Y + R ++ +Q ++ + TW++ ++ +L RG+R+
Sbjct: 5 VSAVVIPTVIAAALLGYMLMRVWNELQGNLDSILESLVPTWLIAAIGICVLGWFLRGFRY 64
Query: 68 KQVLEPMGEQTRNSVCVNSIFLSYAVSLVIP-RVGEFARCGVLNRYDKVSFPKALGTVVT 126
K +++ +G + I++S +L+IP R+G+F R +L + + + +++
Sbjct: 65 KYIVKKLGTEIGIIFSTACIYVSQTANLIIPARLGDFVRMFILKHEKGMPYTNSFTSLIV 124
Query: 127 ERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVIL 186
ER D ++VL + + L F + D + + L+ GI + +++L
Sbjct: 125 ERVYD-ILVLAVLGLCSLP--------FLISLVPEDYGWFVWLIIFVLIAGIVGIIILLL 175
Query: 187 LYIIRRHIPIYKKVKDALAGIWQGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQ 246
+ I K+ + A Q + + L SV IW + YL C
Sbjct: 176 AKWMHAENKILNKILEVFAQFRQ---VSSTISALGLLSGTSVIIWMMDVITCYL--ICMM 230
Query: 247 ETSSLGVTCALVCFVVGSIAVIVP-TPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHT 305
+ L+ ++G++ VP TP G G + A+ + + GV A +I H
Sbjct: 231 LAVDIPFMLVLLAIIIGNLIKAVPITPGGIGTYEAALAIVFEIGGVASFTAFLIAVIDHL 290
Query: 306 LQTLLVILLGIYA 318
++ L+ ++ G+ +
Sbjct: 291 VKNLVTLVGGMLS 303
>gi|78222601|ref|YP_384348.1| hypothetical protein Gmet_1389 [Geobacter metallireducens GS-15]
gi|78193856|gb|ABB31623.1| conserved hypothetical protein [Geobacter metallireducens GS-15]
Length = 336
Score = 55.1 bits (131), Expect = 6e-06, Method: Composition-based stats.
Identities = 70/288 (24%), Positives = 120/288 (41%), Gaps = 33/288 (11%)
Query: 48 WMLLSMPFGILAQTFRGWRWKQVLEPMGEQTRNSVCVNSIFLSYAVSLVIP-RVGEFARC 106
W++ ++ F L+ R RWK +L P+ +QT V++ + Y + ++P R+GE R
Sbjct: 44 WLVPAVVFTFLSYLMRAVRWKYLLSPL-KQTSFPNLVSATLIGYMANNLLPARLGELVRA 102
Query: 107 GVLNRYDKVSFPKALGTVVTERAVD--TVIVLLITAVAILSQIKVFTNFFFRTGTSVDNI 164
VL + + + T+V +R D TV++LL+TAV F R +
Sbjct: 103 YVLGEQEGIGTGAVVATLVIDRLADGFTVLLLLVTAV-----------FTLRLPPGSEAA 151
Query: 165 FNKFSATGWLVTGIC-------------ALAVVILLYIIRRHIP--IYKKVKDALAGIWQ 209
A G++ G+ + + LL + R P I +KV L
Sbjct: 152 QQGLVAGGYITLGLYLAVVVFLVLLRRNTMGTIRLLERLLRPFPGKIAEKVIPFLGAFID 211
Query: 210 GVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTFFCFQETSSLGVTCALVCFVVGSIAVIV 269
G + V S+ IWG + F + V+ ++ +V AV+V
Sbjct: 212 GARITPHWRERLALVASSLVIWGFAVFPIHCVLKAFGLNFPIPVSLFIMVLLV--FAVMV 269
Query: 270 P-TPNGAGPWHFAVKTMLILYGVVDVRALYFVLIVHTLQTLLVILLGI 316
P +P G +H A L+ GV AL +++H + VI++G+
Sbjct: 270 PASPGFVGTYHAACVYGLLALGVPRGMALSVAIVIHGINFFPVIVVGL 317
>gi|148654275|ref|YP_001274480.1| conserved hypothetical protein 374 [Roseiflexus sp. RS-1]
gi|148566385|gb|ABQ88530.1| conserved hypothetical protein 374 [Roseiflexus sp. RS-1]
Length = 350
Score = 55.1 bits (131), Expect = 7e-06, Method: Composition-based stats.
Identities = 71/319 (22%), Positives = 134/319 (42%), Gaps = 20/319 (6%)
Query: 12 KIVLPLILGGGILYWMYRGFDFVQMRKVVMNEMDWTWMLLSMPFGILAQTFRGWRWKQVL 71
K+ L +++ L+ RG DF Q VM + ++ W+ + LA R WRW +L
Sbjct: 27 KLWLGIVVSAVCLWIALRGLDF-QTFWQVMRQANYWWLAPGVGVYFLAVWARTWRWHYML 85
Query: 72 EPMGEQTRNSVCVNSIF----LSYAVSLVIP-RVGEFARCGVLNRYDKVSFPKALGTVVT 126
+ + V+ +F + Y + V P R GE R VL R ++V +L TVV
Sbjct: 86 RHLA-----PIPVSRLFPIVVIGYMGNNVYPARAGEVLRSYVLRRKERVPISASLATVVL 140
Query: 127 ERAVDTVIVLLITAVAILSQIKVFTNFFFRTGTSVDNIFNKFSATGWLVTGICALAVVIL 186
ER D +++LL V + + T + +F A ++
Sbjct: 141 ERLFDGLVMLLFVFVTL--PFAPLPPVYSSLVTVLSVVFLAALAAFLVIAARPERMSRTY 198
Query: 187 LYIIRRHIPIYKKVKDALAGIW----QGVISVKDVKNIPLFVFYSVAIWGSYFLHFYLTF 242
+I+ R +P+ + + + G++ +G+ S++ +++ L S IW + ++
Sbjct: 199 AWIVERFVPV--RFRSLVHGLFDRFVEGLQSLRSPRDLALIFVSSTLIWLTETGKYWFVM 256
Query: 243 FCFQETSSLGVTCALVCFVVGSIAVIVPTPNGAGPWHFAVKTMLILYGVVDVRALYFVLI 302
F +S + L+ VV + TP G +H +L+ + V A + ++
Sbjct: 257 HAFPFETSF-LVLMLMTAVVNLFTTLPSTPGYIGTFHVPGIAVLMAFNVDQAIATSYTVV 315
Query: 303 VHTLQTLLVILLGIYAWIA 321
+H L + LG + I+
Sbjct: 316 LHVALWLPITALGAWYMIS 334
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.331 0.143 0.455
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,190,407,570
Number of Sequences: 5470121
Number of extensions: 47575013
Number of successful extensions: 159913
Number of sequences better than 1.0e-05: 54
Number of HSP's better than 0.0 without gapping: 27
Number of HSP's successfully gapped in prelim test: 27
Number of HSP's that attempted gapping in prelim test: 159810
Number of HSP's gapped (non-prelim): 57
length of query: 337
length of database: 1,894,087,724
effective HSP length: 133
effective length of query: 204
effective length of database: 1,166,561,631
effective search space: 237978572724
effective search space used: 237978572724
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 130 (54.7 bits)