BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= PG0303
(384 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|34540163|ref|NP_904642.1| hypothetical protein PG0327 [P... 715 0.0
gi|117618250|ref|YP_858282.1| transporter gate domain prote... 371 e-101
gi|145297539|ref|YP_001140380.1| hypothetical protein ASA_0... 364 4e-99
gi|148379054|ref|YP_001253595.1| membrane protein [Clostrid... 354 4e-96
gi|89211306|ref|ZP_01189677.1| conserved hypothetical prote... 353 1e-95
gi|145955939|ref|ZP_01804939.1| hypothetical protein CdifQ_... 343 1e-92
gi|126698100|ref|YP_001086997.1| hypothetical protein CD051... 341 4e-92
gi|126699377|ref|YP_001088274.1| hypothetical protein CD176... 341 6e-92
gi|153939600|ref|YP_001390422.1| hypothetical protein CLI_1... 336 2e-90
gi|153853939|ref|ZP_01995272.1| hypothetical protein DORLON... 326 2e-87
gi|150021212|ref|YP_001306566.1| hypothetical protein Tmel_... 319 2e-85
gi|154248879|ref|YP_001409704.1| membrane protein [Fervidob... 305 4e-81
gi|42527827|ref|NP_972925.1| hypothetical protein TDE2325 [... 295 4e-78
gi|153952962|ref|YP_001393727.1| hypothetical protein CKL_0... 165 4e-39
gi|153952961|ref|YP_001393726.1| hypothetical protein CKL_0... 121 9e-26
>gi|34540163|ref|NP_904642.1| hypothetical protein PG0327 [Porphyromonas gingivalis W83]
gi|34396475|gb|AAQ65541.1| hypothetical protein PG_0327 [Porphyromonas gingivalis W83]
Length = 384
Score = 715 bits (1845), Expect = 0.0, Method: Composition-based stats.
Identities = 384/384 (100%), Positives = 384/384 (100%)
Query: 1 METRSQHKKLTFRRHLFEALGFIILFFAIFGYVGHVMGLPNMLNTIMQTSYHLLLETVFY 60
METRSQHKKLTFRRHLFEALGFIILFFAIFGYVGHVMGLPNMLNTIMQTSYHLLLETVFY
Sbjct: 1 METRSQHKKLTFRRHLFEALGFIILFFAIFGYVGHVMGLPNMLNTIMQTSYHLLLETVFY 60
Query: 61 IMGITVLSGALGSLLVEFHVVDMLERILRPLMKPLYNLPGVSALGGILTFLSDNPAIISL 120
IMGITVLSGALGSLLVEFHVVDMLERILRPLMKPLYNLPGVSALGGILTFLSDNPAIISL
Sbjct: 61 IMGITVLSGALGSLLVEFHVVDMLERILRPLMKPLYNLPGVSALGGILTFLSDNPAIISL 120
Query: 121 AQDRKFCSYFKKYQLVSLTNFGTAFGMGMVVIIFMAGQGYGSGAIIGLFGAIIGSIVSTR 180
AQDRKFCSYFKKYQLVSLTNFGTAFGMGMVVIIFMAGQGYGSGAIIGLFGAIIGSIVSTR
Sbjct: 121 AQDRKFCSYFKKYQLVSLTNFGTAFGMGMVVIIFMAGQGYGSGAIIGLFGAIIGSIVSTR 180
Query: 181 LMQRSTLRRYPEMDSPVNVEEMQHEETKEHKENLFLRMLNAVLDGGKTGVDLGIAIIPGV 240
LMQRSTLRRYPEMDSPVNVEEMQHEETKEHKENLFLRMLNAVLDGGKTGVDLGIAIIPGV
Sbjct: 181 LMQRSTLRRYPEMDSPVNVEEMQHEETKEHKENLFLRMLNAVLDGGKTGVDLGIAIIPGV 240
Query: 241 LIISTAVMMLTNGPAGPDGTFAGKAYEGVGLLPALADKVDFLFRWLFGFTDSRLIAFPIT 300
LIISTAVMMLTNGPAGPDGTFAGKAYEGVGLLPALADKVDFLFRWLFGFTDSRLIAFPIT
Sbjct: 241 LIISTAVMMLTNGPAGPDGTFAGKAYEGVGLLPALADKVDFLFRWLFGFTDSRLIAFPIT 300
Query: 301 ALGAVGAALGLVPNFSAQGILDGNAVAVFTAMGMCWSGFLSTHTAMLDSLKYRRLITQAI 360
ALGAVGAALGLVPNFSAQGILDGNAVAVFTAMGMCWSGFLSTHTAMLDSLKYRRLITQAI
Sbjct: 301 ALGAVGAALGLVPNFSAQGILDGNAVAVFTAMGMCWSGFLSTHTAMLDSLKYRRLITQAI 360
Query: 361 GAHAIGGLVAGIFAHWLFVLISLI 384
GAHAIGGLVAGIFAHWLFVLISLI
Sbjct: 361 GAHAIGGLVAGIFAHWLFVLISLI 384
>gi|117618250|ref|YP_858282.1| transporter gate domain protein [Aeromonas hydrophila subsp.
hydrophila ATCC 7966]
gi|117559657|gb|ABK36605.1| transporter gate domain protein [Aeromonas hydrophila subsp.
hydrophila ATCC 7966]
Length = 479
Score = 371 bits (953), Expect = e-101, Method: Composition-based stats.
Identities = 211/385 (54%), Positives = 277/385 (71%), Gaps = 13/385 (3%)
Query: 4 RSQHKKLTFRRHLFEALGFIILFFAIFGYVGHVMGLPNMLNTIMQTSYHLLLETVFYIMG 63
+ H+ R E + +++ ++FGY+G MGL ++NT+ T++ LLL TV +IMG
Sbjct: 96 KDSHQGAASLRRWLEPVLVLVILGSLFGYLGSQMGLSALVNTLFNTAHQLLLNTVLFIMG 155
Query: 64 ITVLSGALGSLLVEFHVVDMLERILRPLMKPLYNLPGVSALGGILTFLSDNPAIISLAQD 123
ITVLSGAL LL EF V+ +LE +L PLMKP++ LPG +AL ++TF SDNPA+ISLA+D
Sbjct: 156 ITVLSGALSQLLSEFGVIRLLEVLLAPLMKPVFRLPGRTALAALMTFFSDNPAVISLAKD 215
Query: 124 RKFCSYFKKYQLVSLTNFGTAFGMGMVVIIFMA------GQGYGSGAIIGLFGAIIGSIV 177
+F F +QLVSLTNFGTAFGMG++V+ FMA G+ S A+IGL GA+IGS+V
Sbjct: 216 SRFRKGFTPWQLVSLTNFGTAFGMGLIVVTFMATLQLPGGESTASAALIGLLGALIGSVV 275
Query: 178 STRLMQRSTLRRYPEMDSPVNVEEMQ---HEETKEHKENLFLRMLNAVLDGGKTGVDLGI 234
STRLMQR + R ++P+ E + +LR+LNA+LDGGK+GV+LG+
Sbjct: 276 STRLMQR--MIRPLVGETPITDEVASVGAKPGLSQEAAPTWLRVLNALLDGGKSGVELGM 333
Query: 235 AIIPGVLIISTAVMMLTNGPAGPDGTFAGKAYEGVGLLPALADKVDFLFRWLFGFTDSRL 294
A+IPGVLIIST VM+LT GP G G ++G+A++GV LLP LA KV +LF LFGFT + L
Sbjct: 334 AVIPGVLIISTFVMLLTFGP-GEQG-YSGEAFQGVALLPVLAAKVGWLFELLFGFTHAEL 391
Query: 295 IAFPITALGAVGAALGLVPNFSAQGILDGNAVAVFTAMGMCWSGFLSTHTAMLDSLKYRR 354
+AFP+T+LGAVGAA+ LVP F +G + GN VAVFTAMGMCWSGFLSTHTAMLD+L YR
Sbjct: 392 VAFPVTSLGAVGAAMSLVPPFIGKGWITGNEVAVFTAMGMCWSGFLSTHTAMLDALGYRH 451
Query: 355 LITQAIGAHAIGGLVAGIFAHWLFV 379
L ++AI AH IGGL AG+ AH L++
Sbjct: 452 LTSRAIVAHTIGGLCAGVAAHQLYL 476
>gi|145297539|ref|YP_001140380.1| hypothetical protein ASA_0460 [Aeromonas salmonicida subsp.
salmonicida A449]
gi|142850311|gb|ABO88632.1| conserved hypothetical protein [Aeromonas salmonicida subsp.
salmonicida A449]
Length = 390
Score = 364 bits (935), Expect = 4e-99, Method: Composition-based stats.
Identities = 213/393 (54%), Positives = 277/393 (70%), Gaps = 25/393 (6%)
Query: 4 RSQHKKLTFRRHLFEALGFIILFFAIFGYVGHVMGLPNMLNTIMQTSYHLLLETVFYIMG 63
+ H T R E + + + ++FGY+G +MGL ++NT+ T++ LLL TV +IMG
Sbjct: 7 KDSHVVQTSLRRWLEPVLVLAILGSLFGYLGSLMGLSALVNTLFNTAHQLLLNTVLFIMG 66
Query: 64 ITVLSGALGSLLVEFHVVDMLERILRPLMKPLYNLPGVSALGGILTFLSDNPAIISLAQD 123
ITVLSGAL LL EF V+ +LE +L PLMKP++ LPG +AL ++TF SDNPA+ISLA+D
Sbjct: 67 ITVLSGALSQLLSEFGVIRLLEVLLAPLMKPIFRLPGRTALAALMTFFSDNPAVISLAKD 126
Query: 124 RKFCSYFKKYQLVSLTNFGTAFGMGMVVIIFMA------GQGYGSGAIIGLFGAIIGSIV 177
+F F +QLVSLTNFGTAFGMG++V+ FMA G+ S A+IGL GA+IGS+V
Sbjct: 127 SRFRKGFTPWQLVSLTNFGTAFGMGLIVVTFMATLQLPSGESTASAALIGLLGALIGSVV 186
Query: 178 STRLMQRSTLRRYPEMDSPVNVEEMQHEET---------KEHKENLFLRMLNAVLDGGKT 228
STRLMQR M P+ E + +E + +LR+LNA+LDGGK+
Sbjct: 187 STRLMQR--------MIRPLVGETLIDDEVVSVGAKPGLSQEAAPTWLRVLNALLDGGKS 238
Query: 229 GVDLGIAIIPGVLIISTAVMMLTNGPAGPDGTFAGKAYEGVGLLPALADKVDFLFRWLFG 288
GV+LG+A+IPGVLIIST VM+LT GP D + G+A++GV LLP LA KV +LF LFG
Sbjct: 239 GVELGMAVIPGVLIISTFVMLLTFGPG--DKGYTGEAFQGVALLPVLAAKVGWLFEMLFG 296
Query: 289 FTDSRLIAFPITALGAVGAALGLVPNFSAQGILDGNAVAVFTAMGMCWSGFLSTHTAMLD 348
FT + L+AFP+T+LGAVGAA+ LVP F +G + GN VAVFTAMGMCWSGFLSTHTAMLD
Sbjct: 297 FTHAELVAFPVTSLGAVGAAMSLVPPFIGKGWITGNEVAVFTAMGMCWSGFLSTHTAMLD 356
Query: 349 SLKYRRLITQAIGAHAIGGLVAGIFAHWLFVLI 381
+L YR L ++AI AH IGGL AG+ AH L++L+
Sbjct: 357 ALGYRHLTSRAIVAHTIGGLCAGVAAHQLYLLL 389
>gi|148379054|ref|YP_001253595.1| membrane protein [Clostridium botulinum A str. ATCC 3502]
gi|153932591|ref|YP_001383437.1| hypothetical protein CLB_1105 [Clostridium botulinum A str. ATCC
19397]
gi|153936777|ref|YP_001386984.1| hypothetical protein CLC_1117 [Clostridium botulinum A str. Hall]
gi|148288538|emb|CAL82618.1| putative membrane protein [Clostridium botulinum A str. ATCC 3502]
gi|152928635|gb|ABS34135.1| putative membrane protein [Clostridium botulinum A str. ATCC 19397]
gi|152932691|gb|ABS38190.1| putative membrane protein [Clostridium botulinum A str. Hall]
Length = 404
Score = 354 bits (909), Expect = 4e-96, Method: Composition-based stats.
Identities = 200/382 (52%), Positives = 265/382 (69%), Gaps = 25/382 (6%)
Query: 21 GFIIL--FFAIFGYVGHVMGLPNMLNTIMQTSYHLLLETVFYIMGITVLSGALGSLLVEF 78
GFI L F F +G MG NM NT+M+T Y LL+ETVFYIM I VL+GA+ LL EF
Sbjct: 30 GFICLALFLGFFIILGTKMGTVNMFNTLMKTGYQLLMETVFYIMAIAVLAGAISGLLSEF 89
Query: 79 HVVDMLERILRPLMKPLYNLPGVSALGGILTFLSDNPAIISLAQDRKFCSYFKKYQLVSL 138
V+ ++ + L PLMKPLY LPG SALG + T+LSDNPAIISLA+D+ F YFKK+Q+ +L
Sbjct: 90 GVISLINKGLSPLMKPLYKLPGASALGVVTTYLSDNPAIISLAKDKGFLKYFKKFQVPAL 149
Query: 139 TNFGTAFGMGMVVIIFM------AGQGYGSGAIIGLFGAIIGSIVSTRLMQRSTLRRYPE 192
TN GT+FGMG+++ FM G+ + AIIG GA+IGSIVS RLM R T + Y E
Sbjct: 150 TNLGTSFGMGLILTTFMIAQKSPTGENFVQAAIIGDIGAVIGSIVSVRLMIRHTRKYYGE 209
Query: 193 MDSPVNVEEMQHEETKEHKENLFL----------RMLNAVLDGGKTGVDLGIAIIPGVLI 242
+ +EM +E + ++L R+L ++L+GGK+GV+LG+AIIPGV+I
Sbjct: 210 -----HADEMVYESDDDGYDSLKYREVREGGVGARLLESLLEGGKSGVELGLAIIPGVVI 264
Query: 243 ISTAVMMLTNGPAGPDGTFAGKAYEGVGLLPALADKVDFLFRWLFGFTDSRLIAFPITAL 302
I T V+MLTNGP+ P G + G AYEG+ P + DK+ F+ LFGF IAFPIT+L
Sbjct: 265 ICTLVLMLTNGPS-PKG-YTGAAYEGIAFFPWVGDKLSFILNPLFGFKHPEAIAFPITSL 322
Query: 303 GAVGAALGLVPNFSAQGILDGNAVAVFTAMGMCWSGFLSTHTAMLDSLKYRRLITQAIGA 362
GAVGAA+ LVP F ++ ++ N +AVFTAMGMCWSG+LSTH AM+DSL R+L +AI +
Sbjct: 323 GAVGAAMSLVPQFLSKNLIGANEIAVFTAMGMCWSGYLSTHIAMMDSLNCRKLTGKAILS 382
Query: 363 HAIGGLVAGIFAHWLFVLISLI 384
H +GGLVAG+ AH +++L++LI
Sbjct: 383 HTVGGLVAGVSAHIIYMLVTLI 404
>gi|89211306|ref|ZP_01189677.1| conserved hypothetical protein [Halothermothrix orenii H 168]
gi|89159094|gb|EAR78771.1| conserved hypothetical protein [Halothermothrix orenii H 168]
Length = 395
Score = 353 bits (905), Expect = 1e-95, Method: Composition-based stats.
Identities = 189/395 (47%), Positives = 266/395 (67%), Gaps = 11/395 (2%)
Query: 1 METRSQHKKLTFRRHLFEALGFIILFFAIFGYVGHVMGLPNMLNTIMQTSYHLLLETVFY 60
M ++ K+ F + + + F+ + FGY G MG+ NM +T+M T+Y LL++TVFY
Sbjct: 1 MNHKTHSGKVKFLKRYLDTILFVGIIGLFFGYTGTKMGIANMFSTLMNTAYKLLMDTVFY 60
Query: 61 IMGITVLSGALGSLLVEFHVVDMLERILRPLMKPLYNLPGVSALGGILTFLSDNPAIISL 120
IM I VL+GA G + EF +V +L +I PLM+PLYNLPGVS LG + T+LSDNPAIISL
Sbjct: 61 IMAIAVLAGAFGKMATEFGLVRILNKIFAPLMRPLYNLPGVSFLGIVTTYLSDNPAIISL 120
Query: 121 AQDRKFCSYFKKYQLVSLTNFGTAFGMGMVVIIFMAGQGYGSGAIIGLFGAIIGSIVSTR 180
++R + YF + ++ L N GTAFGMG+++ FM G GY A+IG GA+IGSI+S R
Sbjct: 121 TKERGYLKYFHRVEVPCLCNLGTAFGMGLILTTFMTGLGYFKEALIGNLGAVIGSIISVR 180
Query: 181 LMQRSTLRRYPEMDSPVNVEEMQHEE-----------TKEHKENLFLRMLNAVLDGGKTG 229
+M+ + + D ++++ + E T E + R L+++L+GGK G
Sbjct: 181 IMKYRVKKYFNIEDYSTDIDQKKIENIEERDFLNLKVTPEGTNSFMERFLSSMLEGGKLG 240
Query: 230 VDLGIAIIPGVLIISTAVMMLTNGPAGPDGTFAGKAYEGVGLLPALADKVDFLFRWLFGF 289
VD+G++IIPGV+II T +M+LT GPA P + G+A+EGV LLP + + + + LFGF
Sbjct: 241 VDIGLSIIPGVVIICTVIMLLTFGPADPSTGYQGQAFEGVELLPRIGKFLSPIIKPLFGF 300
Query: 290 TDSRLIAFPITALGAVGAALGLVPNFSAQGILDGNAVAVFTAMGMCWSGFLSTHTAMLDS 349
IAFPITALGAVGAAL LVP F A+ I+ GN +AVFTAMGMCWSGFLSTH AM+D+
Sbjct: 301 KSPEAIAFPITALGAVGAALSLVPKFLAENIIGGNEIAVFTAMGMCWSGFLSTHVAMMDA 360
Query: 350 LKYRRLITQAIGAHAIGGLVAGIFAHWLFVLISLI 384
L +R+LI++AI +H +GGL AGI A++++ +LI
Sbjct: 361 LNHRKLISKAITSHFLGGLAAGIAANFIYKFFTLI 395
>gi|145955939|ref|ZP_01804939.1| hypothetical protein CdifQ_04000540 [Clostridium difficile
QCD-32g58]
Length = 392
Score = 343 bits (880), Expect = 1e-92, Method: Composition-based stats.
Identities = 195/376 (51%), Positives = 259/376 (68%), Gaps = 12/376 (3%)
Query: 18 EALGFIILFFAIFGYVGHVMGLPNMLNTIMQTSYHLLLETVFYIMGITVLSGALGSLLVE 77
E FI+L FGYVG +MG M IM T++ LLL+TVF IM + VL+GAL +LL E
Sbjct: 16 ETFVFIVLLAVGFGYVGSIMGAGMMFKVIMSTAHALLLDTVFLIMAMAVLAGALSALLSE 75
Query: 78 FHVVDMLERILRPLMKPLYNLPGVSALGGILTFLSDNPAIISLAQDRKFCSYFKKYQLVS 137
F V+ ++ +I + LM+P++ LPG S G + T+LSDNPAII A+D+ F YFKKYQ+ +
Sbjct: 76 FGVISLVNKIFKGLMRPIWGLPGASIAGVVATYLSDNPAIIPFAKDKTFTQYFKKYQVPA 135
Query: 138 LTNFGTAFGMGMVVIIFMAGQG--YGSGAIIGLFGAIIGSIVSTRLMQRSTLRRY---PE 192
L N GTAFGMG++V FM QG Y AIIG GAIIGSI+S R+M T + Y P+
Sbjct: 136 LCNLGTAFGMGLIVTTFMIAQGKEYVLPAIIGNVGAIIGSIISVRIMLTFTKKYYNYDPK 195
Query: 193 MDSP--VNVEEMQHEETKEHKE-NLFLRMLNAVLDGGKTGVDLGIAIIPGVLIISTAVMM 249
D+ +N + + EE +E ++ N+F R L+A+L+GGK GV++G+AIIPGVL++ T VM+
Sbjct: 196 NDTEKQINDKGAKLEEFREIRDGNVFQRTLDAILEGGKLGVEMGMAIIPGVLVVCTLVML 255
Query: 250 LTNGPAGPDGT----FAGKAYEGVGLLPALADKVDFLFRWLFGFTDSRLIAFPITALGAV 305
LT GP+ T + G AYEG+ LLPA+ DK+ F+ LFGFT IAFP+TALGAV
Sbjct: 256 LTFGPSTDPATGQAVYTGAAYEGIKLLPAIGDKISFIIEPLFGFTSPEAIAFPVTALGAV 315
Query: 306 GAALGLVPNFSAQGILDGNAVAVFTAMGMCWSGFLSTHTAMLDSLKYRRLITQAIGAHAI 365
GAA+ LVP F G + N +AVFTAMGMCWSG+LSTH M+D+L R L +AI +H I
Sbjct: 316 GAAISLVPEFIKSGAITPNDIAVFTAMGMCWSGYLSTHIGMMDALDARPLAGKAILSHTI 375
Query: 366 GGLVAGIFAHWLFVLI 381
GGL AGI AH++F+L+
Sbjct: 376 GGLCAGICAHFIFMLV 391
>gi|126698100|ref|YP_001086997.1| hypothetical protein CD0519 [Clostridium difficile 630]
gi|115249537|emb|CAJ67353.1| putative membrane protein [Clostridium difficile 630]
Length = 392
Score = 341 bits (875), Expect = 4e-92, Method: Composition-based stats.
Identities = 194/376 (51%), Positives = 258/376 (68%), Gaps = 12/376 (3%)
Query: 18 EALGFIILFFAIFGYVGHVMGLPNMLNTIMQTSYHLLLETVFYIMGITVLSGALGSLLVE 77
E FI+L FGYVG +MG M IM T++ LLL+TVF IM + VL+GAL +LL E
Sbjct: 16 ETFVFIVLLAVGFGYVGSIMGAGMMFKVIMSTAHALLLDTVFLIMAMAVLAGALSALLSE 75
Query: 78 FHVVDMLERILRPLMKPLYNLPGVSALGGILTFLSDNPAIISLAQDRKFCSYFKKYQLVS 137
F V+ ++ +I + LM+P++ LPG S G + T+LSDNPAII A+D+ F YFKKYQ+ +
Sbjct: 76 FGVISLVNKIFKGLMRPIWGLPGASIAGVVATYLSDNPAIIPFAKDKTFTQYFKKYQVPA 135
Query: 138 LTNFGTAFGMGMVVIIFMAGQG--YGSGAIIGLFGAIIGSIVSTRLMQRSTLRRY---PE 192
L N GTAFGMG++V FM QG Y AIIG AIIGSI+S R+M T + Y P+
Sbjct: 136 LCNLGTAFGMGLIVTTFMIAQGKEYVLPAIIGNVAAIIGSIISVRIMLTFTKKYYNYDPK 195
Query: 193 MDSP--VNVEEMQHEETKEHKE-NLFLRMLNAVLDGGKTGVDLGIAIIPGVLIISTAVMM 249
D+ +N + + EE +E ++ N+F R L+A+L+GGK GV++G+AIIPGVL++ T VM+
Sbjct: 196 NDTEKQINDKGAKLEEFREIRDGNVFQRTLDAILEGGKLGVEMGMAIIPGVLVVCTLVML 255
Query: 250 LTNGPAGPDGT----FAGKAYEGVGLLPALADKVDFLFRWLFGFTDSRLIAFPITALGAV 305
LT GP+ T + G AYEG+ LLPA+ DK+ F+ LFGFT IAFP+TALGAV
Sbjct: 256 LTFGPSTDPATGQAVYTGAAYEGIKLLPAIGDKISFIIEPLFGFTSPEAIAFPVTALGAV 315
Query: 306 GAALGLVPNFSAQGILDGNAVAVFTAMGMCWSGFLSTHTAMLDSLKYRRLITQAIGAHAI 365
GAA+ LVP F G + N +AVFTAMGMCWSG+LSTH M+D+L R L +AI +H I
Sbjct: 316 GAAISLVPEFIKSGAITPNDIAVFTAMGMCWSGYLSTHIGMMDALDARPLAGKAILSHTI 375
Query: 366 GGLVAGIFAHWLFVLI 381
GGL AGI AH++F+L+
Sbjct: 376 GGLCAGICAHFIFMLV 391
>gi|126699377|ref|YP_001088274.1| hypothetical protein CD1768 [Clostridium difficile 630]
gi|145954649|ref|ZP_01803654.1| hypothetical protein CdifQ_04002045 [Clostridium difficile
QCD-32g58]
gi|115250814|emb|CAJ68638.1| putative membrane protein [Clostridium difficile 630]
Length = 389
Score = 341 bits (874), Expect = 6e-92, Method: Composition-based stats.
Identities = 198/387 (51%), Positives = 256/387 (66%), Gaps = 13/387 (3%)
Query: 8 KKLTFRRHL----FEALGFIILFFAIFGYVGHVMGLPNMLNTIMQTSYHLLLETVFYIMG 63
+KLT RR + E F+IL FGYVG +MG M IM T++ LLLETVF IM
Sbjct: 2 EKLTDRRQVKAVSMETFIFLILLVVGFGYVGSIMGAGMMFKVIMSTAHALLLETVFLIMA 61
Query: 64 ITVLSGALGSLLVEFHVVDMLERILRPLMKPLYNLPGVSALGGILTFLSDNPAIISLAQD 123
+ VL+GAL +LL EF V+ ++ +I MKPLY LPG S G I T+LSDNPAII A+D
Sbjct: 62 MAVLAGALSALLSEFGVIALINKIFAVFMKPLYGLPGASIAGAITTYLSDNPAIIPFAKD 121
Query: 124 RKFCSYFKKYQLVSLTNFGTAFGMGMVVIIFMAGQG--YGSGAIIGLFGAIIGSIVSTRL 181
+ F YFK+YQ+ +L N GTAFGMG+++ FM QG Y A+IG GAIIGS++S R+
Sbjct: 122 KTFTQYFKQYQVPALCNLGTAFGMGLILTTFMISQGTEYVLPALIGNLGAIIGSVISVRI 181
Query: 182 MQRSTLRRY---PEMDSPVNVEEMQHEETKEHKENLFLRMLNAVLDGGKTGVDLGIAIIP 238
M T + Y PE D E + E + + N+F R L+A+L+GGK GVD+G+AIIP
Sbjct: 182 MLTFTKKFYKYNPEEDKATGTLEKKDEFREIREGNVFQRALDAILEGGKMGVDMGMAIIP 241
Query: 239 GVLIISTAVMMLTNGPAGPDGT----FAGKAYEGVGLLPALADKVDFLFRWLFGFTDSRL 294
GVL++ T VM+LT GP+ T + G AYEG+ LLP + DK+ F+ LFGFT
Sbjct: 242 GVLVVCTLVMLLTFGPSTDPVTGQEVYTGAAYEGIKLLPVIGDKLGFILEPLFGFTSPEA 301
Query: 295 IAFPITALGAVGAALGLVPNFSAQGILDGNAVAVFTAMGMCWSGFLSTHTAMLDSLKYRR 354
IAFPIT+LGAVGAA+ LVP F G + N +AVFTAMGMCWSG+LSTH M+D+L R+
Sbjct: 302 IAFPITSLGAVGAAMSLVPEFIKSGAITPNDIAVFTAMGMCWSGYLSTHIGMMDALNARQ 361
Query: 355 LITQAIGAHAIGGLVAGIFAHWLFVLI 381
L +AI +H IGGL AG AH++F L+
Sbjct: 362 LAGKAILSHTIGGLCAGAAAHFIFTLV 388
>gi|153939600|ref|YP_001390422.1| hypothetical protein CLI_1156 [Clostridium botulinum F str.
Langeland]
gi|152935496|gb|ABS40994.1| putative membrane protein [Clostridium botulinum F str. Langeland]
Length = 404
Score = 336 bits (861), Expect = 2e-90, Method: Composition-based stats.
Identities = 202/382 (52%), Positives = 266/382 (69%), Gaps = 25/382 (6%)
Query: 21 GFIIL--FFAIFGYVGHVMGLPNMLNTIMQTSYHLLLETVFYIMGITVLSGALGSLLVEF 78
GFI L F F +G MG NM NT+M+T Y LL+ETVFYIM I VL+GA+ LL EF
Sbjct: 30 GFICLALFLGFFIILGTKMGTVNMFNTLMKTGYQLLMETVFYIMAIAVLAGAISGLLSEF 89
Query: 79 HVVDMLERILRPLMKPLYNLPGVSALGGILTFLSDNPAIISLAQDRKFCSYFKKYQLVSL 138
V+ ++ + L PLMKPLY LPG +ALG + T+LSDNPAIISLA+D+ F YFKKYQ+ +L
Sbjct: 90 GVISLINKGLSPLMKPLYKLPGAAALGVVTTYLSDNPAIISLAKDKGFLKYFKKYQVPAL 149
Query: 139 TNFGTAFGMGMVVIIFMAGQGYGSG------AIIGLFGAIIGSIVSTRLMQRSTLRRYPE 192
TN GT+FGMG+++ FM Q +G AIIG GAIIGSIVS RLM R T + Y E
Sbjct: 150 TNLGTSFGMGLILTTFMIAQKSPTGENFVQAAIIGDIGAIIGSIVSVRLMVRHTRKYYGE 209
Query: 193 MDSPVNVEEMQHEETKEHKENLFL----------RMLNAVLDGGKTGVDLGIAIIPGVLI 242
+ +EM +E + ++L R+L ++L+GGK+GV+LG+AIIPGV+I
Sbjct: 210 -----HADEMVYESDDDGYDSLKYREVREGGIGARLLESLLEGGKSGVELGLAIIPGVVI 264
Query: 243 ISTAVMMLTNGPAGPDGTFAGKAYEGVGLLPALADKVDFLFRWLFGFTDSRLIAFPITAL 302
I T V+MLTNGP+ P+G + G AYEG+ P + DK+ F+ LFGF IAFPIT+L
Sbjct: 265 ICTLVLMLTNGPS-PNG-YTGAAYEGIAFFPWVGDKLSFILNPLFGFKHPEAIAFPITSL 322
Query: 303 GAVGAALGLVPNFSAQGILDGNAVAVFTAMGMCWSGFLSTHTAMLDSLKYRRLITQAIGA 362
GAVGAA+ LVP F ++ ++ N +AVFTAMGMCWSG+LSTH AM+DSL R+L +AI +
Sbjct: 323 GAVGAAMSLVPQFLSKNLIGANEIAVFTAMGMCWSGYLSTHIAMMDSLNCRKLTGKAILS 382
Query: 363 HAIGGLVAGIFAHWLFVLISLI 384
H +GGLVAG+ AH +++L++LI
Sbjct: 383 HTVGGLVAGVSAHIIYMLVTLI 404
>gi|153853939|ref|ZP_01995272.1| hypothetical protein DORLON_01263 [Dorea longicatena DSM 13814]
gi|149753321|gb|EDM63252.1| hypothetical protein DORLON_01263 [Dorea longicatena DSM 13814]
Length = 389
Score = 326 bits (835), Expect = 2e-87, Method: Composition-based stats.
Identities = 200/377 (53%), Positives = 260/377 (68%), Gaps = 11/377 (2%)
Query: 17 FEALGFIILFFAIFGYVGHVMGLPNMLNTIMQTSYHLLLETVFYIMGITVLSGALGSLLV 76
E F+ +F IFG +G MG NMLNT+M T Y LLLETVFYIM I VL+GA+ L
Sbjct: 12 LEGFVFLAIFLGIFGGMGMKMGGVNMLNTLMNTGYQLLLETVFYIMAIAVLAGAISGLFS 71
Query: 77 EFHVVDMLERILRPLMKPLYNLPGVSALGGILTFLSDNPAIISLAQDRKFCSYFKKYQLV 136
EF V+ M+ ++L PLMKPLYNLPG +ALG I T+LSDNPAI+ LA+D+ F YFKK+QL
Sbjct: 72 EFGVISMVNKLLSPLMKPLYNLPGAAALGVITTYLSDNPAILGLAEDKNFRKYFKKFQLP 131
Query: 137 SLTNFGTAFGMGMVVIIFMAGQGY-----GSGAIIGLFGAIIGSIVSTRLMQRSTLRRYP 191
+LTN GT+FGMG++V FM G G+ ++G F AIIGSI+S R+M T + Y
Sbjct: 132 ALTNLGTSFGMGLIVSTFMIGLKLKGGHAGTAVLVGNFSAIIGSIISVRIMLHFTKKEYG 191
Query: 192 EMDSPVNVEEMQHEE----TKEHKE-NLFLRMLNAVLDGGKTGVDLGIAIIPGVLIISTA 246
+ V EE + + T+E ++ + R + A+L+GGK GVD+G+AIIPGV+ I T
Sbjct: 192 TEEYCVQFEEHEDMDAIMNTREIRDGGIGGRAIEALLEGGKNGVDVGLAIIPGVITICTL 251
Query: 247 VMMLTNGPAGPDGTFAGKAYEGVGLLPALADKVDFLFRWLFGFTDSRLIAFPITALGAVG 306
VMMLTNG A DGT+ G AYEG+ LP L K++F+ LFGFTD+ I+ PITALGA G
Sbjct: 252 VMMLTNG-ASADGTYTGAAYEGIAFLPWLGKKLEFILSPLFGFTDASGISVPITALGAAG 310
Query: 307 AALGLVPNFSAQGILDGNAVAVFTAMGMCWSGFLSTHTAMLDSLKYRRLITQAIGAHAIG 366
AA+GLVP+ + G + N +AVFTAM MCWSG+LSTH AM+ SLK +L +AI +H IG
Sbjct: 311 AAIGLVPHMAEAGTVLANDIAVFTAMCMCWSGYLSTHVAMMSSLKVNKLTGKAILSHTIG 370
Query: 367 GLVAGIFAHWLFVLISL 383
GL AG+ A+W+F L+ L
Sbjct: 371 GLCAGVAANWIFKLVML 387
>gi|150021212|ref|YP_001306566.1| hypothetical protein Tmel_1332 [Thermosipho melanesiensis BI429]
gi|149793733|gb|ABR31181.1| hypothetical protein Tmel_1332 [Thermosipho melanesiensis BI429]
Length = 374
Score = 319 bits (818), Expect = 2e-85, Method: Composition-based stats.
Identities = 180/374 (48%), Positives = 248/374 (66%), Gaps = 8/374 (2%)
Query: 13 RRHLFEALGFIILFFAIFGYVGHVMGLPNMLNTIMQTSYHLLLETVFYIMGITVLSGALG 72
+R EAL F++ F IF V MG+ N T+M T++ LL+ TVF+IM I VL+G
Sbjct: 2 KRIRIEALMFLVFLFLIFWGVDSYMGVSNFFKTLMLTAHDLLINTVFFIMAIAVLTGGFS 61
Query: 73 SLLVEFHVVDMLERILRPLMKPLYNLPGVSALGGILTFLSDNPAIISLAQDRKFCSYFKK 132
+LL EF V D+++ +L+PL+KPLYNLPGV+A+G + T+ SDNPAII+LA+D++F F +
Sbjct: 62 ALLFEFGVADLIDILLKPLIKPLYNLPGVAAMGILSTYFSDNPAIIALAKDKRFMKNFDR 121
Query: 133 YQLVSLTNFGTAFGMGMVVIIFMAGQGYG------SGAIIGLFGAIIGSIVSTRLMQRST 186
+Q L N GT+FGMG++V FM QG +IG GA++GSI S R+ T
Sbjct: 122 WQQPLLCNLGTSFGMGLIVSTFMIAQGTRMSVNLFPAVLIGNIGALVGSIASVRIFSVYT 181
Query: 187 LRRYPEMDSPVNVEEMQHEETKEHKENLFLRMLNAVLDGGKTGVDLGIAIIPGVLIISTA 246
+ + + E + ET+ + R L A+LDGGK+GV++G+ IIPGVLIIST
Sbjct: 182 KNKLGTFSNDEFIPEKYYTETR--PGSFMERFLEALLDGGKSGVEIGLGIIPGVLIISTF 239
Query: 247 VMMLTNGPAGPDGTFAGKAYEGVGLLPALADKVDFLFRWLFGFTDSRLIAFPITALGAVG 306
VMM+T GP + + G AYEG+ LLP + +K+ + RWLFGF++ LIAFP+T+LG+ G
Sbjct: 240 VMMITFGPKDQNVGYQGLAYEGIPLLPYIGEKIFVVLRWLFGFSNPELIAFPLTSLGSTG 299
Query: 307 AALGLVPNFSAQGILDGNAVAVFTAMGMCWSGFLSTHTAMLDSLKYRRLITQAIGAHAIG 366
AAL LVP F I+ N +AVFTAMGM WSG+LSTH AM+D L YR L +AI +H IG
Sbjct: 300 AALALVPKFIDMKIVTPNDIAVFTAMGMTWSGYLSTHIAMMDELGYRYLTGKAIFSHTIG 359
Query: 367 GLVAGIFAHWLFVL 380
G+VAGI AH +++
Sbjct: 360 GIVAGITAHLIYMF 373
>gi|154248879|ref|YP_001409704.1| membrane protein [Fervidobacterium nodosum Rt17-B1]
gi|154152815|gb|ABS60047.1| membrane protein [Fervidobacterium nodosum Rt17-B1]
Length = 380
Score = 305 bits (781), Expect = 4e-81, Method: Composition-based stats.
Identities = 176/350 (50%), Positives = 230/350 (65%), Gaps = 8/350 (2%)
Query: 37 MGLPNMLNTIMQTSYHLLLETVFYIMGITVLSGALGSLLVEFHVVDMLERILRPLMKPLY 96
MG N T+M T++ LLL TVF+IM + VL+GA +LL EF V+ + +L LMKPLY
Sbjct: 29 MGSANFFKTLMSTAHDLLLNTVFFIMAVAVLTGAFAALLNEFGVIYWIHLLLDKLMKPLY 88
Query: 97 NLPGVSALGGILTFLSDNPAIISLAQDRKFCSYFKKYQLVSLTNFGTAFGMGMVVIIFMA 156
NLPG++A+G + T+ SDNPAII+LA+D+ F S+F+K+Q L N GTAFGMGM+V F
Sbjct: 89 NLPGIAAMGILSTYFSDNPAIIALAKDKSFISHFEKWQEPLLCNLGTAFGMGMIVSTFFI 148
Query: 157 GQGYGSGA------IIGLFGAIIGSIVSTRLMQRSTLRRYPEMDSPVNVEEMQHEETKEH 210
QG +G I+G IIGSIVS RL T ++ + V ++H E +E
Sbjct: 149 AQGGRAGVNLVPAVIVGNIATIIGSIVSVRLFSIWTKKKLGSVHKEVENVNLKHREIREG 208
Query: 211 KENLFLRMLNAVLDGGKTGVDLGIAIIPGVLIISTAVMMLTNGPAGPDGTFAGKAYEGVG 270
N F R L A+LDGGKTGVD+GI IIPGVLIIST VMMLT GP + G AYEG+
Sbjct: 209 --NAFERFLEAMLDGGKTGVDIGIGIIPGVLIISTLVMMLTFGPKDNSVGYQGLAYEGIA 266
Query: 271 LLPALADKVDFLFRWLFGFTDSRLIAFPITALGAVGAALGLVPNFSAQGILDGNAVAVFT 330
L + + + + LFGF +LIAFPIT LG+ GAAL LVP F G++ + VAVFT
Sbjct: 267 LFDKIGKYLFWPLKILFGFDSPQLIAFPITCLGSTGAALALVPRFMEHGLIKPSDVAVFT 326
Query: 331 AMGMCWSGFLSTHTAMLDSLKYRRLITQAIGAHAIGGLVAGIFAHWLFVL 380
A+GM WSG+LSTH M+D+L YR L ++AI +H IGG+VAG A+ ++ L
Sbjct: 327 AIGMTWSGYLSTHVGMMDALGYRYLTSKAILSHTIGGIVAGFSANLMYNL 376
>gi|42527827|ref|NP_972925.1| hypothetical protein TDE2325 [Treponema denticola ATCC 35405]
gi|41818655|gb|AAS12844.1| conserved hypothetical protein [Treponema denticola ATCC 35405]
Length = 381
Score = 295 bits (754), Expect = 4e-78, Method: Composition-based stats.
Identities = 171/354 (48%), Positives = 233/354 (65%), Gaps = 7/354 (1%)
Query: 33 VGHVMGLPNMLNTIMQTSYHLLLETVFYIMGITVLSGALGSLLVEFHVVDMLERILRPLM 92
VG VMG NM+ T+M+T + LL+ Y+M + VL+GA+ L EF + ++ +IL LM
Sbjct: 29 VGSVMGGVNMIKTMMETGFDLLINICLYLMAVAVLAGAVSGLFSEFGTIALINKILSKLM 88
Query: 93 KPLYNLPGVSALGGILTFLSDNPAIISLAQDRKFCSYFKKYQLVSLTNFGTAFGMGMVVI 152
KPLY+LPG S+LG + FLSDNPAI++LA D F YFK+YQL +LTN GTAFGMG++
Sbjct: 89 KPLYDLPGASSLGILNCFLSDNPAILTLADDDNFRRYFKQYQLPALTNLGTAFGMGLITT 148
Query: 153 IFMAGQGYGS---GAIIGLFGAIIGSIVSTRLMQRSTLRRY--PEMDSPVNVEEMQHEET 207
M G S A+IG GAI GSIVS RLM + +RY S V+ + +
Sbjct: 149 TAMMGLNIKSAVPAALIGNVGAIAGSIVSVRLMIYFSKKRYGTEAFVSTKRVDPIPEKMR 208
Query: 208 KEHKENLFLRMLNAVLDGGKTGVDLGIAIIPGVLIISTAVMMLTNGPAGPDGTFAGKAYE 267
+ R + A+LDGGK+GV +G+AIIPGV+II T V+MLTNGP+ DGT+ G A E
Sbjct: 209 PVREGGAGGRFIQAMLDGGKSGVSMGLAIIPGVVIICTIVIMLTNGPSA-DGTYTGGARE 267
Query: 268 GVGLLPALADKVDFLFRWLFGFTDSRLIAFPITALGAVGAALGLVPNFSAQGILDGNAVA 327
G+ +LP + K+ F+ ++GF+ I+ PITALG+ GAALG+V S G ++ N +A
Sbjct: 268 GIAVLPWIGQKLSFILNPMYGFSSPEAISVPITALGSTGAALGIVKEMSFAGKINANDIA 327
Query: 328 VFTAMGMCWSGFLSTHTAMLDSLKYRRLITQAIGAHAIGGLVAGIFAHWL-FVL 380
VFTA+ MCWSG++STH AM+D+L + + +AI +H IGGL AGI AH + FVL
Sbjct: 328 VFTAICMCWSGYISTHIAMMDALDTKEMTGKAILSHTIGGLFAGIVAHLIAFVL 381
>gi|153952962|ref|YP_001393727.1| hypothetical protein CKL_0325 [Clostridium kluyveri DSM 555]
gi|146345843|gb|EDK32379.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555]
Length = 208
Score = 165 bits (418), Expect = 4e-39, Method: Composition-based stats.
Identities = 85/176 (48%), Positives = 122/176 (69%), Gaps = 2/176 (1%)
Query: 208 KEHKENLFLRMLNAVLDGGKTGVDLGIAIIPGVLIISTAVMMLTNGPAGPDG--TFAGKA 265
K + N F R LNA DGGKTGV LG++IIPG+LI +T VM+LTNGP+ DG + G A
Sbjct: 32 KVREGNAFQRGLNATFDGGKTGVQLGLSIIPGILIFTTLVMILTNGPSIVDGQAVYQGVA 91
Query: 266 YEGVGLLPALADKVDFLFRWLFGFTDSRLIAFPITALGAVGAALGLVPNFSAQGILDGNA 325
YEG GLL + DK+ F+ LFGF +S ++ P+T+LGA GA++ + G+L+G+
Sbjct: 92 YEGTGLLKDIGDKLSFILTPLFGFANSEVLGLPLTSLGACGASIAGAKQLAESGLLNGHD 151
Query: 326 VAVFTAMGMCWSGFLSTHTAMLDSLKYRRLITQAIGAHAIGGLVAGIFAHWLFVLI 381
+AV+ A+ CW+GFLS+H ++ DS+K R + T A+ H IGGLVAG+ A++ ++LI
Sbjct: 152 MAVYFAIAYCWAGFLSSHASIADSMKTREITTYAMLTHFIGGLVAGVIANYAYILI 207
>gi|153952961|ref|YP_001393726.1| hypothetical protein CKL_0324 [Clostridium kluyveri DSM 555]
gi|146345842|gb|EDK32378.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555]
Length = 163
Score = 121 bits (303), Expect = 9e-26, Method: Composition-based stats.
Identities = 62/145 (42%), Positives = 94/145 (64%), Gaps = 5/145 (3%)
Query: 1 METRSQHKKLTFRRHLFEALGFIILFFAIFGYVGHVMGLPNMLNTIMQTSYHLLLETVFY 60
ME K++T +L +GFI+L I GY+ VMG+ + IM+T++ LL+ T FY
Sbjct: 9 MEENRYRKEITKGTYL--CIGFIVL---ILGYLSIVMGVGKTFSVIMKTAHDLLINTAFY 63
Query: 61 IMGITVLSGALGSLLVEFHVVDMLERILRPLMKPLYNLPGVSALGGILTFLSDNPAIISL 120
IM + VL+GA+ S+ EF V +L +++ P+MKPL+ LPG ++LG I + SDNP+I+
Sbjct: 64 IMSVAVLAGAVSSVFSEFGVTALLNKLISPIMKPLFKLPGAASLGAITCYFSDNPSIVIN 123
Query: 121 AQDRKFCSYFKKYQLVSLTNFGTAF 145
++D + YFKKYQ ++ NFGT F
Sbjct: 124 SKDPGYAKYFKKYQWTTMINFGTTF 148
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.328 0.143 0.426
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,377,412,053
Number of Sequences: 5470121
Number of extensions: 59253207
Number of successful extensions: 213337
Number of sequences better than 1.0e-05: 18
Number of HSP's better than 0.0 without gapping: 15
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 213279
Number of HSP's gapped (non-prelim): 18
length of query: 384
length of database: 1,894,087,724
effective HSP length: 135
effective length of query: 249
effective length of database: 1,155,621,389
effective search space: 287749725861
effective search space used: 287749725861
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 130 (54.7 bits)