BLASTP 2.2.18 [Mar-02-2008]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= PGN_0141 hypothetical protein
(394 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
6,515,104 sequences; 2,222,278,849 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|34541671|ref|NP_906150.1| hypothetical protein PG2089 [P... 809 0.0
gi|29346891|ref|NP_810394.1| hypothetical protein BT_1481 [... 590 e-167
gi|53712870|ref|YP_098862.1| hypothetical protein BF1578 [B... 588 e-166
gi|160886206|ref|ZP_02067209.1| hypothetical protein BACOVA... 585 e-165
gi|160889013|ref|ZP_02070016.1| hypothetical protein BACUNI... 585 e-165
gi|153808093|ref|ZP_01960761.1| hypothetical protein BACCAC... 585 e-165
gi|167763398|ref|ZP_02435525.1| hypothetical protein BACSTE... 573 e-162
gi|150005452|ref|YP_001300196.1| hypothetical protein BVU_2... 528 e-148
gi|167761680|ref|ZP_02433807.1| hypothetical protein BACSTE... 471 e-131
gi|94967517|ref|YP_589565.1| hypothetical protein Acid345_0... 75 1e-11
gi|150401425|ref|YP_001325191.1| hypothetical protein Maeo_... 69 5e-10
gi|167749568|ref|ZP_02421695.1| hypothetical protein EUBSIR... 68 1e-09
gi|169187793|ref|ZP_02847951.1| putative DNA helicase [Paen... 61 1e-07
gi|89901811|ref|YP_524282.1| DNA helicase related protein [... 60 2e-07
>gi|34541671|ref|NP_906150.1| hypothetical protein PG2089 [Porphyromonas gingivalis W83]
gi|34397989|gb|AAQ67049.1| hypothetical protein PG_2089 [Porphyromonas gingivalis W83]
Length = 394
Score = 809 bits (2090), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 394/394 (100%), Positives = 394/394 (100%)
Query: 1 MAYLGTKAKWLLLLSSIVLLAIVGVWIFQYSGYGERFEIVDELGGNIFPSAILSVATTDA 60
MAYLGTKAKWLLLLSSIVLLAIVGVWIFQYSGYGERFEIVDELGGNIFPSAILSVATTDA
Sbjct: 1 MAYLGTKAKWLLLLSSIVLLAIVGVWIFQYSGYGERFEIVDELGGNIFPSAILSVATTDA 60
Query: 61 EVIKPVEGLYVGNPKSVISIRLKTRRANSRVRVEVAETPFFSQSVSEFVLDKSGSTYTIY 120
EVIKPVEGLYVGNPKSVISIRLKTRRANSRVRVEVAETPFFSQSVSEFVLDKSGSTYTIY
Sbjct: 61 EVIKPVEGLYVGNPKSVISIRLKTRRANSRVRVEVAETPFFSQSVSEFVLDKSGSTYTIY 120
Query: 121 PDIIWKYEALRNNNQAGPVSVAIKVEINGANWGQQVRTFSVRSLNECLLGYMSGGRVFCE 180
PDIIWKYEALRNNNQAGPVSVAIKVEINGANWGQQVRTFSVRSLNECLLGYMSGGRVFCE
Sbjct: 121 PDIIWKYEALRNNNQAGPVSVAIKVEINGANWGQQVRTFSVRSLNECLLGYMSGGRVFCE 180
Query: 181 TGIFFAAYVNEENPKIDKLLREALNTRIVNRFSGYQSAQNPVDRQVYALWNILQKRQFRY 240
TGIFFAAYVNEENPKIDKLLREALNTRIVNRFSGYQSAQNPVDRQVYALWNILQKRQFRY
Sbjct: 181 TGIFFAAYVNEENPKIDKLLREALNTRIVNRFSGYQSAQNPVDRQVYALWNILQKRQFRY 240
Query: 241 SSVANSSLSSNVVLSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRMPGHMF 300
SSVANSSLSSNVVLSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRMPGHMF
Sbjct: 241 SSVANSSLSSNVVLSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRMPGHMF 300
Query: 301 VGYYTDISKKEKTFLETTMIGDVDLDDFFPDEQLDSTMIGKSQNQMSKLTFEKSKEYANK 360
VGYYTDISKKEKTFLETTMIGDVDLDDFFPDEQLDSTMIGKSQNQMSKLTFEKSKEYANK
Sbjct: 301 VGYYTDISKKEKTFLETTMIGDVDLDDFFPDEQLDSTMIGKSQNQMSKLTFEKSKEYANK 360
Query: 361 KYQENEADIHSGKSGYMFLEISKDVRRRIQSIGK 394
KYQENEADIHSGKSGYMFLEISKDVRRRIQSIGK
Sbjct: 361 KYQENEADIHSGKSGYMFLEISKDVRRRIQSIGK 394
>gi|29346891|ref|NP_810394.1| hypothetical protein BT_1481 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29338789|gb|AAO76588.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 397
Score = 590 bits (1521), Expect = e-167, Method: Compositional matrix adjust.
Identities = 294/379 (77%), Positives = 335/379 (88%), Gaps = 1/379 (0%)
Query: 17 IVLLAIVGVWIFQYSGYGERFEIVDELGGNIFPSAILSVATTDAEVIKPVEGLYVGNPKS 76
++LL I G+ +F+Y+ + FEIVD+LGGNIFPSAILSVATTDA+VI P + +GNPKS
Sbjct: 19 VLLLLIGGISVFKYTSFNSGFEIVDDLGGNIFPSAILSVATTDAQVITPSDSTCLGNPKS 78
Query: 77 VISIRLKTRRANSRVRVEVAETPFFSQSVSEFVLDKSGSTYTIYPDIIWKYEALRNNNQA 136
I+IR+K+R A SRVR+EVAETPFFS+SVSEFVL+K + YTIYPDIIW YEAL+NN QA
Sbjct: 79 CIAIRVKSRTAYSRVRIEVAETPFFSRSVSEFVLNKPRTEYTIYPDIIWNYEALKNNAQA 138
Query: 137 GPVSVAIKVEINGANWGQQVRTFSVRSLNECLLGYMSGGRVFCETGIFFAAYVNEENPKI 196
PVSVA+KVE+NG + GQ+VRTFSVRS+NECLLGY++ G F +T IFFAAYVNEENP I
Sbjct: 139 EPVSVAVKVEMNGKDLGQRVRTFSVRSVNECLLGYVANGTKFYDTSIFFAAYVNEENPMI 198
Query: 197 DKLLREALNTRIVNRFSGYQS-AQNPVDRQVYALWNILQKRQFRYSSVANSSLSSNVVLS 255
D+LLREALNTRIVNRF GYQS A+ VD+QVYALWNILQKR+FRYSSV+N+SLSSNVV S
Sbjct: 199 DQLLREALNTRIVNRFLGYQSTAKGAVDKQVYALWNILQKRKFRYSSVSNTSLSSNVVFS 258
Query: 256 QRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRMPGHMFVGYYTDISKKEKTFL 315
QRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVR PGHMFVGYYTD S K+ FL
Sbjct: 259 QRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRTPGHMFVGYYTDNSHKDMNFL 318
Query: 316 ETTMIGDVDLDDFFPDEQLDSTMIGKSQNQMSKLTFEKSKEYANKKYQENEADIHSGKSG 375
ETTMIGDVDLDDFFPDEQLDSTM+GKSQN+MS LTFEKSK+YANKKY++NEA IHSGK
Sbjct: 319 ETTMIGDVDLDDFFPDEQLDSTMVGKSQNEMSLLTFEKSKQYANKKYKDNEAGIHSGKLN 378
Query: 376 YMFLEISKDVRRRIQSIGK 394
YMFLEISK+VRR+IQ IGK
Sbjct: 379 YMFLEISKEVRRKIQPIGK 397
>gi|53712870|ref|YP_098862.1| hypothetical protein BF1578 [Bacteroides fragilis YCH46]
gi|60681088|ref|YP_211232.1| hypothetical protein BF1592 [Bacteroides fragilis NCTC 9343]
gi|52215735|dbj|BAD48328.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
gi|60492522|emb|CAH07293.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
Length = 397
Score = 588 bits (1517), Expect = e-166, Method: Compositional matrix adjust.
Identities = 294/385 (76%), Positives = 337/385 (87%), Gaps = 2/385 (0%)
Query: 12 LLLSSIVLLAI-VGVWIFQYSGYGERFEIVDELGGNIFPSAILSVATTDAEVIKPVEGLY 70
+L + + L I +GV +F++S FEI DELGGNIFPSAILSVATTDA+VI+P + +Y
Sbjct: 13 VLWTVVALFFIAIGVSVFKFSSATSGFEITDELGGNIFPSAILSVATTDAQVIQPADSIY 72
Query: 71 VGNPKSVISIRLKTRRANSRVRVEVAETPFFSQSVSEFVLDKSGSTYTIYPDIIWKYEAL 130
+GNPKS I+I++K+R A SRVR++VAETPFFS+SVSEFVL+K + YTIYPDIIW YEAL
Sbjct: 73 LGNPKSCIAIKVKSRSAYSRVRIDVAETPFFSRSVSEFVLNKPRTEYTIYPDIIWNYEAL 132
Query: 131 RNNNQAGPVSVAIKVEINGANWGQQVRTFSVRSLNECLLGYMSGGRVFCETGIFFAAYVN 190
+NNNQA P+SVA+ VE+NG + GQ+VRTFSVRS+NECLLGY++ G F +TGIFFAAYVN
Sbjct: 133 KNNNQAEPISVAVTVEMNGEDLGQRVRTFSVRSVNECLLGYVTNGTKFHDTGIFFAAYVN 192
Query: 191 EENPKIDKLLREALNTRIVNRFSGYQS-AQNPVDRQVYALWNILQKRQFRYSSVANSSLS 249
EENP ID+LLREALNTRIVNRF GYQ+ A VD+QVYALWN+LQKR+FRYSSV+N+SLS
Sbjct: 193 EENPMIDQLLREALNTRIVNRFLGYQNPAPGAVDKQVYALWNVLQKRKFRYSSVSNTSLS 252
Query: 250 SNVVLSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRMPGHMFVGYYTDISK 309
SNVV SQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVR PGHMFVGYYTD S
Sbjct: 253 SNVVYSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRTPGHMFVGYYTDNSH 312
Query: 310 KEKTFLETTMIGDVDLDDFFPDEQLDSTMIGKSQNQMSKLTFEKSKEYANKKYQENEADI 369
K+ FLETTMIGDVDLDDFFPDEQLDSTM+GKSQNQMS LTFEKSK+YANKKY+ENE I
Sbjct: 313 KDMNFLETTMIGDVDLDDFFPDEQLDSTMVGKSQNQMSLLTFEKSKQYANKKYKENEKGI 372
Query: 370 HSGKSGYMFLEISKDVRRRIQSIGK 394
HSGK YMFLEISKDVRR+IQ IGK
Sbjct: 373 HSGKLNYMFLEISKDVRRKIQPIGK 397
>gi|160886206|ref|ZP_02067209.1| hypothetical protein BACOVA_04213 [Bacteroides ovatus ATCC 8483]
gi|156108091|gb|EDO09836.1| hypothetical protein BACOVA_04213 [Bacteroides ovatus ATCC 8483]
Length = 397
Score = 585 bits (1509), Expect = e-165, Method: Compositional matrix adjust.
Identities = 294/385 (76%), Positives = 337/385 (87%), Gaps = 3/385 (0%)
Query: 13 LLSSIVLLAIV--GVWIFQYSGYGERFEIVDELGGNIFPSAILSVATTDAEVIKPVEGLY 70
++ S+V+L I+ G+ +F+Y+ + FEIVD+LGGNIFPSAILSVATTDA+VI P + Y
Sbjct: 13 IIGSVVVLLILLGGMSVFKYTSFNSGFEIVDDLGGNIFPSAILSVATTDAQVIVPSDSNY 72
Query: 71 VGNPKSVISIRLKTRRANSRVRVEVAETPFFSQSVSEFVLDKSGSTYTIYPDIIWKYEAL 130
+GNPKS I++R+K++ A SRVR+EVAETPFFS+SVSEFVL+K + YTIYPDIIW YEAL
Sbjct: 73 LGNPKSCIAVRVKSKTAYSRVRIEVAETPFFSRSVSEFVLNKPRTEYTIYPDIIWNYEAL 132
Query: 131 RNNNQAGPVSVAIKVEINGANWGQQVRTFSVRSLNECLLGYMSGGRVFCETGIFFAAYVN 190
+N QA PVSVAI VE+NG + GQ+VRTFSVRS+NECLLGY+S G F +T IFFAAYVN
Sbjct: 133 KNEVQAEPVSVAITVEMNGKDLGQRVRTFSVRSINECLLGYVSNGTKFHDTSIFFAAYVN 192
Query: 191 EENPKIDKLLREALNTRIVNRFSGYQS-AQNPVDRQVYALWNILQKRQFRYSSVANSSLS 249
EENP ID+LLREALNTRIVNRF GYQS A+ VD+QVYALWNILQKR+FRYSSV+N+SLS
Sbjct: 193 EENPMIDQLLREALNTRIVNRFLGYQSKAKGAVDKQVYALWNILQKRKFRYSSVSNTSLS 252
Query: 250 SNVVLSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRMPGHMFVGYYTDISK 309
SNVV SQRVRTFDDALESSQINCVDGSVLFASLLRAINI+PILVR PGHMFVGYYTD S
Sbjct: 253 SNVVFSQRVRTFDDALESSQINCVDGSVLFASLLRAINIDPILVRTPGHMFVGYYTDNSH 312
Query: 310 KEKTFLETTMIGDVDLDDFFPDEQLDSTMIGKSQNQMSKLTFEKSKEYANKKYQENEADI 369
+K FLETTMIGDVDLDDFFPDEQLDSTM+GKSQN+MS LTFEKSK+YANKKY+ENE I
Sbjct: 313 TDKNFLETTMIGDVDLDDFFPDEQLDSTMVGKSQNEMSLLTFEKSKQYANKKYKENEEGI 372
Query: 370 HSGKSGYMFLEISKDVRRRIQSIGK 394
HSGK YMFLEISKDVRR+IQ IGK
Sbjct: 373 HSGKLNYMFLEISKDVRRKIQPIGK 397
>gi|160889013|ref|ZP_02070016.1| hypothetical protein BACUNI_01433 [Bacteroides uniformis ATCC 8492]
gi|156861480|gb|EDO54911.1| hypothetical protein BACUNI_01433 [Bacteroides uniformis ATCC 8492]
Length = 398
Score = 585 bits (1508), Expect = e-165, Method: Compositional matrix adjust.
Identities = 290/389 (74%), Positives = 335/389 (86%), Gaps = 1/389 (0%)
Query: 6 TKAKWLLLLSSIVLLAIVGVWIFQYSGYGERFEIVDELGGNIFPSAILSVATTDAEVIKP 65
+K +++ +++L+ I VW+ + + FEI DELGGNIFPS+ILSVATTDA+VI P
Sbjct: 11 SKRHLSVIMGTLLLVIIGSVWL-NTATFRSGFEITDELGGNIFPSSILSVATTDAQVIVP 69
Query: 66 VEGLYVGNPKSVISIRLKTRRANSRVRVEVAETPFFSQSVSEFVLDKSGSTYTIYPDIIW 125
V+ +YVGNPKS I++R+++R A SRVRVEVAETPFFS+SVSEFVL K + Y IYPDIIW
Sbjct: 70 VDSMYVGNPKSCIAVRVRSRNAYSRVRVEVAETPFFSRSVSEFVLAKPRTEYIIYPDIIW 129
Query: 126 KYEALRNNNQAGPVSVAIKVEINGANWGQQVRTFSVRSLNECLLGYMSGGRVFCETGIFF 185
YEAL+NNNQA P+SVA+ VE+NG + GQ+VRTFSVRS+NECLLGY++ G F +TGIFF
Sbjct: 130 NYEALKNNNQAEPISVAVMVEMNGKDLGQRVRTFSVRSINECLLGYVTNGTKFHDTGIFF 189
Query: 186 AAYVNEENPKIDKLLREALNTRIVNRFSGYQSAQNPVDRQVYALWNILQKRQFRYSSVAN 245
AAYVNEENP IDKLLREAL+TRIVNRF GYQ VDRQVYALWN+LQKR+FRYSSV+N
Sbjct: 190 AAYVNEENPMIDKLLREALDTRIVNRFLGYQGGAEVVDRQVYALWNVLQKRKFRYSSVSN 249
Query: 246 SSLSSNVVLSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRMPGHMFVGYYT 305
+SLSSNVV SQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVR PGHMFVGYYT
Sbjct: 250 TSLSSNVVFSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRTPGHMFVGYYT 309
Query: 306 DISKKEKTFLETTMIGDVDLDDFFPDEQLDSTMIGKSQNQMSKLTFEKSKEYANKKYQEN 365
D S K+ FLETTMIGDVDLDDFFPDEQLDSTM+GKSQNQMS++TF+KSKEYANKKY +N
Sbjct: 310 DSSHKDMNFLETTMIGDVDLDDFFPDEQLDSTMVGKSQNQMSRITFDKSKEYANKKYAQN 369
Query: 366 EADIHSGKSGYMFLEISKDVRRRIQSIGK 394
+ IHSGK YMFLEISK+VRRRIQ IGK
Sbjct: 370 KEGIHSGKLNYMFLEISKEVRRRIQPIGK 398
>gi|153808093|ref|ZP_01960761.1| hypothetical protein BACCAC_02379 [Bacteroides caccae ATCC 43185]
gi|149128996|gb|EDM20212.1| hypothetical protein BACCAC_02379 [Bacteroides caccae ATCC 43185]
Length = 397
Score = 585 bits (1508), Expect = e-165, Method: Compositional matrix adjust.
Identities = 289/379 (76%), Positives = 335/379 (88%), Gaps = 1/379 (0%)
Query: 17 IVLLAIVGVWIFQYSGYGERFEIVDELGGNIFPSAILSVATTDAEVIKPVEGLYVGNPKS 76
++ L I G+ +F+Y+ + FEIVD+LGGNIFPSAILSVATTDA+VI P + +Y+GNPKS
Sbjct: 19 VLFLLIGGISVFKYTSFNSGFEIVDDLGGNIFPSAILSVATTDAQVITPSDSMYLGNPKS 78
Query: 77 VISIRLKTRRANSRVRVEVAETPFFSQSVSEFVLDKSGSTYTIYPDIIWKYEALRNNNQA 136
I++R+K+R A SRVR+EVAETPFFS+SVSEFVL++ + YTIYPDIIW YEAL+NN QA
Sbjct: 79 CIAVRVKSRTAYSRVRIEVAETPFFSRSVSEFVLNRPRTEYTIYPDIIWNYEALKNNVQA 138
Query: 137 GPVSVAIKVEINGANWGQQVRTFSVRSLNECLLGYMSGGRVFCETGIFFAAYVNEENPKI 196
PVSVA+KVE+NG + GQ+VRTFSVRS+NECLLGY++ G F +T IFFAAYVNEENP I
Sbjct: 139 EPVSVAVKVEMNGDDLGQRVRTFSVRSINECLLGYVANGTKFYDTSIFFAAYVNEENPMI 198
Query: 197 DKLLREALNTRIVNRFSGYQS-AQNPVDRQVYALWNILQKRQFRYSSVANSSLSSNVVLS 255
D+LLREALNTRIVNRF GYQS A+ VD+QVYALWNILQKR+FRYSSV+N+SLSSNVV S
Sbjct: 199 DQLLREALNTRIVNRFLGYQSKAKGVVDKQVYALWNILQKRKFRYSSVSNTSLSSNVVFS 258
Query: 256 QRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRMPGHMFVGYYTDISKKEKTFL 315
QRVRTFDDAL+SSQINCVDGSVLFASLLR+INI+PILVR PGHMFVGYYTD S K+ FL
Sbjct: 259 QRVRTFDDALDSSQINCVDGSVLFASLLRSINIDPILVRTPGHMFVGYYTDNSHKDMNFL 318
Query: 316 ETTMIGDVDLDDFFPDEQLDSTMIGKSQNQMSKLTFEKSKEYANKKYQENEADIHSGKSG 375
ETTMIGDVDLDDFFPDEQLDSTM+GKSQN+MS LTFEKSK+YANKKY+ENE IHSGK
Sbjct: 319 ETTMIGDVDLDDFFPDEQLDSTMVGKSQNEMSLLTFEKSKQYANKKYKENEEGIHSGKLN 378
Query: 376 YMFLEISKDVRRRIQSIGK 394
YMFLEISK+VRR+IQ IGK
Sbjct: 379 YMFLEISKEVRRKIQPIGK 397
>gi|167763398|ref|ZP_02435525.1| hypothetical protein BACSTE_01772 [Bacteroides stercoris ATCC
43183]
gi|167698692|gb|EDS15271.1| hypothetical protein BACSTE_01772 [Bacteroides stercoris ATCC
43183]
Length = 400
Score = 573 bits (1477), Expect = e-162, Method: Compositional matrix adjust.
Identities = 281/389 (72%), Positives = 333/389 (85%), Gaps = 1/389 (0%)
Query: 7 KAKWLL-LLSSIVLLAIVGVWIFQYSGYGERFEIVDELGGNIFPSAILSVATTDAEVIKP 65
++KW + L+ +++ + I+G + + FEI DELGGNIFPS+ILSVATTDA+VI P
Sbjct: 12 QSKWRMPLIVAVLAVVIIGSIWLNVAPFRSGFEITDELGGNIFPSSILSVATTDAQVIVP 71
Query: 66 VEGLYVGNPKSVISIRLKTRRANSRVRVEVAETPFFSQSVSEFVLDKSGSTYTIYPDIIW 125
+ ++VGNPKS I++++++ +A SRVR+EVAETPFFS+SVSEFVL + + YTIYPDIIW
Sbjct: 72 ADSMFVGNPKSCIAVKVRSAKAYSRVRIEVAETPFFSRSVSEFVLARPRTEYTIYPDIIW 131
Query: 126 KYEALRNNNQAGPVSVAIKVEINGANWGQQVRTFSVRSLNECLLGYMSGGRVFCETGIFF 185
YEAL+NNNQA P+SVA+ VE+N GQ+VRTFSVRS+NECLLGY++GG F +TGIFF
Sbjct: 132 NYEALKNNNQAEPISVAVTVEMNRKELGQRVRTFSVRSINECLLGYVTGGTRFHDTGIFF 191
Query: 186 AAYVNEENPKIDKLLREALNTRIVNRFSGYQSAQNPVDRQVYALWNILQKRQFRYSSVAN 245
AAYVNEENP ID LLREALNTRIVNRF GYQ + VD QVYALWN+LQKR FRYSSV+N
Sbjct: 192 AAYVNEENPMIDHLLREALNTRIVNRFLGYQGSPEAVDNQVYALWNVLQKRNFRYSSVSN 251
Query: 246 SSLSSNVVLSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRMPGHMFVGYYT 305
+SLSSNVV SQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVR PGHMFVGYYT
Sbjct: 252 TSLSSNVVFSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRTPGHMFVGYYT 311
Query: 306 DISKKEKTFLETTMIGDVDLDDFFPDEQLDSTMIGKSQNQMSKLTFEKSKEYANKKYQEN 365
D + K+ FLETTMIGDVDLDDFFPDE+LDSTM+GK+QNQMS++TF+KSKEYA++KY+EN
Sbjct: 312 DANHKDMKFLETTMIGDVDLDDFFPDERLDSTMVGKTQNQMSRITFDKSKEYASRKYKEN 371
Query: 366 EADIHSGKSGYMFLEISKDVRRRIQSIGK 394
E IHSG+ YMFLEISKDVRRRIQ IGK
Sbjct: 372 EKGIHSGRLNYMFLEISKDVRRRIQPIGK 400
>gi|150005452|ref|YP_001300196.1| hypothetical protein BVU_2935 [Bacteroides vulgatus ATCC 8482]
gi|149933876|gb|ABR40574.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 396
Score = 528 bits (1359), Expect = e-148, Method: Compositional matrix adjust.
Identities = 259/391 (66%), Positives = 315/391 (80%), Gaps = 3/391 (0%)
Query: 7 KAKWLLLLSSIVLLAIVGVWIFQYSGYGERFEIVDELGGNIFPSAILSVATTDAEVIKPV 66
K +W L+ S +L+ V+I++Y + FEI D+LGGNIFP+ ILS ATTDA +I P
Sbjct: 6 KWQWSTLILSAILVMAFSVFIYRYEPFKSGFEITDQLGGNIFPATILSTATTDASLIVPA 65
Query: 67 EGLYVGNPKSVISIRLKTRRANSRVRVEVAETPFFSQSVSEFVLDKSGSTYTIYPDIIWK 126
+ Y+GNPKS I+IRLK ANS++R+EVAETPFFSQSVSEF+L K+G Y ++PDIIW
Sbjct: 66 DSDYIGNPKSCIAIRLKNSYANSKLRIEVAETPFFSQSVSEFILPKAGKEYLVFPDIIWN 125
Query: 127 YEALRNNNQAGPVSVAIKVEINGANWGQQVRTFSVRSLNECLLGYMSGGRVFCETGIFFA 186
Y+ALR+NNQA PVS+++K E+N Q+++T S+RS+NEC LGY+ F +TG FFA
Sbjct: 126 YQALRDNNQAVPVSISVKAELNKKELPQRLKTISMRSINECPLGYVDDKMKFHDTGEFFA 185
Query: 187 AYVNEENPKIDKLLREALNTRIVNRFSGYQSAQNP---VDRQVYALWNILQKRQFRYSSV 243
AYVNEE+P+IDKLLREAL+TRIVNRF GYQ + VD+QVYALWN+LQKR F+YSS
Sbjct: 186 AYVNEEHPQIDKLLREALDTRIVNRFLGYQGNAHQSENVDKQVYALWNVLQKRNFKYSST 245
Query: 244 ANSSLSSNVVLSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRMPGHMFVGY 303
N+SLSSNVV +QRVRT DDALESSQINCVDGSVL ASL++AINI PILVR+PGHMFVGY
Sbjct: 246 TNTSLSSNVVYTQRVRTLDDALESSQINCVDGSVLLASLMKAININPILVRIPGHMFVGY 305
Query: 304 YTDISKKEKTFLETTMIGDVDLDDFFPDEQLDSTMIGKSQNQMSKLTFEKSKEYANKKYQ 363
YTD S K FLETTMIGDV+LDDFFPDE+LDSTM+GKSQNQMSKLT+EKSKEYA +KY+
Sbjct: 306 YTDKSHKNMNFLETTMIGDVNLDDFFPDEKLDSTMVGKSQNQMSKLTYEKSKEYATRKYR 365
Query: 364 ENEADIHSGKSGYMFLEISKDVRRRIQSIGK 394
EN++ IHSGK YMFLEI K R ++Q IGK
Sbjct: 366 ENDSLIHSGKVNYMFLEIDKKTRAQVQPIGK 396
>gi|167761680|ref|ZP_02433807.1| hypothetical protein BACSTE_00014 [Bacteroides stercoris ATCC
43183]
gi|167700453|gb|EDS17032.1| hypothetical protein BACSTE_00014 [Bacteroides stercoris ATCC
43183]
Length = 304
Score = 471 bits (1211), Expect = e-131, Method: Compositional matrix adjust.
Identities = 233/304 (76%), Positives = 265/304 (87%)
Query: 91 VRVEVAETPFFSQSVSEFVLDKSGSTYTIYPDIIWKYEALRNNNQAGPVSVAIKVEINGA 150
+R+EVAETPFFS+SVSE VL ++ + YTIYPDII Y AL+NN QA P++V + VE+N
Sbjct: 1 MRIEVAETPFFSRSVSESVLARARTEYTIYPDIIRHYVALKNNIQAEPITVDVTVEMNRK 60
Query: 151 NWGQQVRTFSVRSLNECLLGYMSGGRVFCETGIFFAAYVNEENPKIDKLLREALNTRIVN 210
GQ+VRT SVRS+NECLLGY++GG F +TGIFFAAYVNEENP ID LLREALNTRIVN
Sbjct: 61 ELGQRVRTSSVRSINECLLGYVTGGTRFHDTGIFFAAYVNEENPMIDHLLREALNTRIVN 120
Query: 211 RFSGYQSAQNPVDRQVYALWNILQKRQFRYSSVANSSLSSNVVLSQRVRTFDDALESSQI 270
RF GYQ + VD QVYALWN+LQKR FRYSSV+N+SLSSNVV SQRVRTFDDALESSQI
Sbjct: 121 RFLGYQGSPEAVDNQVYALWNVLQKRNFRYSSVSNTSLSSNVVFSQRVRTFDDALESSQI 180
Query: 271 NCVDGSVLFASLLRAINIEPILVRMPGHMFVGYYTDISKKEKTFLETTMIGDVDLDDFFP 330
NCVDGSVLFASLLRAINIEPILVR PGHMFVGYYTD + K+ FLETTMIGDVDLDDFFP
Sbjct: 181 NCVDGSVLFASLLRAINIEPILVRTPGHMFVGYYTDANHKDMKFLETTMIGDVDLDDFFP 240
Query: 331 DEQLDSTMIGKSQNQMSKLTFEKSKEYANKKYQENEADIHSGKSGYMFLEISKDVRRRIQ 390
DE+LDSTM+GK+QNQMS++TF+KSKEYA++KY+ENE IHSG+ YMFLEISKDVRRRIQ
Sbjct: 241 DERLDSTMVGKTQNQMSRITFDKSKEYASRKYKENEKGIHSGRLNYMFLEISKDVRRRIQ 300
Query: 391 SIGK 394
IGK
Sbjct: 301 PIGK 304
>gi|94967517|ref|YP_589565.1| hypothetical protein Acid345_0486 [Acidobacteria bacterium
Ellin345]
gi|94549567|gb|ABF39491.1| hypothetical protein Acid345_0486 [Acidobacteria bacterium
Ellin345]
Length = 380
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/219 (22%), Positives = 99/219 (45%), Gaps = 17/219 (7%)
Query: 116 TYTIYPDIIWKYEALRNNNQAGPVSVAIKV-EINGANWGQQVRTFSVRSLNECLLGYMSG 174
TY P + ++ NN + + + V ++ G Q +RS + G
Sbjct: 112 TYVFAPTFLPRF---YNNQEIAAATTVVNVSDMGGQTLYLQTVPVKIRSAEDMFWGSK-- 166
Query: 175 GRVFCETGIFFAAYVNEENPKIDKLLREALNTRIVNRFSGYQSAQNPVDRQ------VYA 228
+ F A+++ + +++++LR+A R GY+ ++ V ++ A
Sbjct: 167 ----FQFAPFIASWITPHDERVEEILRKAKEFMPGRRLPGYEPEKDAVGQEQMTYSEARA 222
Query: 229 LWNILQKRQFRYSSVANSSLSSNVVLSQRVRTFDDALESSQINCVDGSVLFASLLRAINI 288
++ LQ Y ++ +L + S+RVR + +L NC+DG V++ASL + +
Sbjct: 223 IYRALQDAGVSYVK-SSMTLGGHQDASERVRMPESSLRDVSANCIDGVVMYASLFENLGM 281
Query: 289 EPILVRMPGHMFVGYYTDISKKEKTFLETTMIGDVDLDD 327
EP+++ +PGH +VG + ++ET + G +D
Sbjct: 282 EPVVLLLPGHAYVGVRVSPKSDKYLYIETAITGRASFED 320
>gi|150401425|ref|YP_001325191.1| hypothetical protein Maeo_1001 [Methanococcus aeolicus Nankai-3]
gi|150014128|gb|ABR56579.1| hypothetical protein Maeo_1001 [Methanococcus aeolicus Nankai-3]
Length = 479
Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 78/152 (51%), Gaps = 11/152 (7%)
Query: 184 FFAAYVNEENPKIDKLLREALNTRIVNRFSGYQSAQNPV-------DRQVYALWNILQKR 236
+ ++ ++ I +LL A +GYQ + + + D QV A+++ L+
Sbjct: 290 YVTVFITPKDDAIMELLGIAKEYHPERSLAGYQYSGDDLEGWREYTDLQVKAIYDALK-- 347
Query: 237 QFRYS-SVANSSLSSNVVLSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRM 295
+ Y S N+ + Q+V+ + L +S NC+DG+VLFAS + A+ + P ++ +
Sbjct: 348 -YGYGVSYVNTPTAYGKDTVQKVKLPKETLATSSGNCIDGAVLFASAIEALGMHPYIIVI 406
Query: 296 PGHMFVGYYTDISKKEKTFLETTMIGDVDLDD 327
PGH FV + D S LETTM+G+ D +D
Sbjct: 407 PGHAFVAWDVDGSGNYIEALETTMVGNYDFED 438
>gi|167749568|ref|ZP_02421695.1| hypothetical protein EUBSIR_00526 [Eubacterium siraeum DSM 15702]
gi|167657492|gb|EDS01622.1| hypothetical protein EUBSIR_00526 [Eubacterium siraeum DSM 15702]
Length = 1921
Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 67/237 (28%), Positives = 110/237 (46%), Gaps = 25/237 (10%)
Query: 89 SRVRVEVAETPFFSQSVS-EFVLDKSGSTYTIYP-DIIWKYEALRNNNQAGPVSVAIKVE 146
S VR+ +A P F+++ + + SG + I P II E L + +A +V+ +
Sbjct: 41 SDVRLTIAFDPAFAKAFTYDISCVPSGKSVEISPLRIILSTEMLFSLTEAVSGTVSFSLS 100
Query: 147 INGANWGQQVRTFSVRSLNECLLGYMSGGRVFCETGIFFAAYVNEENPKIDKLLREA--- 203
+ Q+ + + NE G F AA+V +P+I K++ EA
Sbjct: 101 KDEKELYQKDVPVQLLAFNEW------SGLAFDPE--LIAAFVTPNHPEIAKVISEASVY 152
Query: 204 -LNTRIVNRFSGYQSAQNP--VDRQVYALWNILQKRQFRYSSVANSSLSSNVVLSQRVRT 260
FS YQ QNP V +Q+ AL+ L R+ Y N +S + Q++R
Sbjct: 153 LKKWADTTAFSAYQR-QNPNFVKQQMAALYAALCARRIAY----NMPPASFEYIGQKIRL 207
Query: 261 FDDALESSQINCVDGSVLFASLLRAINIEPILVRMPGHMFVGYYTDISKKEKTFLET 317
+ LE Q C+D +VL+AS L A+ + PI++ + GH F G + + E+TF +
Sbjct: 208 ANTVLEQKQGTCLDLAVLYASCLEAVGLNPIIIFIEGHAFCGCHLE----EETFADC 260
>gi|169187793|ref|ZP_02847951.1| putative DNA helicase [Paenibacillus sp. JDR-2]
gi|169005209|gb|EDS52062.1| putative DNA helicase [Paenibacillus sp. JDR-2]
Length = 1963
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 69/137 (50%), Gaps = 13/137 (9%)
Query: 186 AAYVNEENPKIDKLLREALNTRI----VNRFSGYQSAQ-NPVDRQVYALWNILQKRQFRY 240
AA+V + I +++REA + + F+ YQS N V Q A++ LQ R+ Y
Sbjct: 133 AAFVMPNHHHIAQVVREAADILAKWTGSSSFTAYQSKDPNKVRTQAAAIYAALQNRKIAY 192
Query: 241 SSVANSSLSSNVVLSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRMPGHMF 300
N + S V+ QRVR + NC+D S+L+ + L A+ + P+LV GH F
Sbjct: 193 ----NVAPPSFEVIGQRVRLPETIFTHRMGNCLDLSLLYTACLEAVGLHPLLVFTKGHAF 248
Query: 301 VGYYTDISKKEKTFLET 317
G + + E+TF E+
Sbjct: 249 SGLWLE----EETFAES 261
>gi|89901811|ref|YP_524282.1| DNA helicase related protein [Rhodoferax ferrireducens T118]
gi|89346548|gb|ABD70751.1| DNA helicase related protein [Rhodoferax ferrireducens T118]
Length = 2222
Score = 60.5 bits (145), Expect = 2e-07, Method: Composition-based stats.
Identities = 49/191 (25%), Positives = 92/191 (48%), Gaps = 21/191 (10%)
Query: 186 AAYVNEENPKIDKLLR-EALNTRIVNR---FSGYQSAQNPVDRQVYALWNILQKRQFRYS 241
AA+V +P +D+LL+ AL+ + ++ GY A+W + +R+ YS
Sbjct: 157 AAFVQPNDPAVDRLLKGAALSLQAADKSGSIDGYTHGPKRAWELTSAIWAAVLQRKLNYS 216
Query: 242 SVANSSLSSNVVLSQRVRTFDDALESSQINCVDGSVLFASLLRAINIEPILVRMPGHMFV 301
+ +S Q+VR+ ++S C D ++LFA+ L N+ P+LV GH FV
Sbjct: 217 -LPPASFEHT---GQKVRSPSQIMDSGIATCFDSTLLFAACLEQANLNPLLVFTKGHSFV 272
Query: 302 GYYTDISKKEKTFLETTMIGDVD-LDDFFPDEQL---DSTMIGKSQNQMSKLTFEKSKEY 357
G++ + E+ T ++ D+ L ++L ++T+ + Q + F ++ +
Sbjct: 273 GFWL---RNEE--FSTAVVDDITALRKRLKLQELVVFETTLATQGQ----PVNFSQAIDN 323
Query: 358 ANKKYQENEAD 368
AN++ E E D
Sbjct: 324 ANRQLAEEEED 334
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
Posted date: May 10, 2008 4:54 AM
Number of letters in database: 884,634,002
Number of sequences in database: 2,620,852
Database: /apps/blastdb/nr.01
Posted date: May 10, 2008 4:52 AM
Number of letters in database: 976,814,986
Number of sequences in database: 2,761,530
Database: /apps/blastdb/nr.02
Posted date: May 10, 2008 4:46 AM
Number of letters in database: 360,829,861
Number of sequences in database: 1,132,722
Lambda K H
0.319 0.134 0.383
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,578,739,012
Number of Sequences: 6515104
Number of extensions: 61415916
Number of successful extensions: 163099
Number of sequences better than 1.0e-04: 14
Number of HSP's better than 0.0 without gapping: 11
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 163080
Number of HSP's gapped (non-prelim): 14
length of query: 394
length of database: 2,222,278,849
effective HSP length: 136
effective length of query: 258
effective length of database: 1,336,224,705
effective search space: 344745973890
effective search space used: 344745973890
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 123 (52.0 bits)