BLASTP 2.2.18 [Mar-02-2008]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= PGN_0048 hypothetical protein
(138 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
6,515,104 sequences; 2,222,278,849 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|160889326|ref|ZP_02070329.1| hypothetical protein BACUNI... 169 8e-41
gi|46242770|gb|AAS83475.1| Ctn002 [Bacteroides fragilis] 162 5e-39
gi|160891043|ref|ZP_02072046.1| hypothetical protein BACUNI... 152 7e-36
gi|53711397|ref|YP_097389.1| hypothetical protein BF0106 [B... 152 9e-36
gi|160884348|ref|ZP_02065351.1| hypothetical protein BACOVA... 152 1e-35
gi|150005903|ref|YP_001300647.1| hypothetical protein BVU_3... 146 3e-34
gi|150005375|ref|YP_001300119.1| hypothetical protein BVU_2... 106 4e-22
gi|53713760|ref|YP_099752.1| hypothetical protein BF2469 [B... 101 1e-20
gi|146300808|ref|YP_001195399.1| hypothetical protein Fjoh_... 91 3e-17
gi|146302120|ref|YP_001196711.1| hypothetical protein Fjoh_... 75 1e-12
gi|146301322|ref|YP_001195913.1| hypothetical protein Fjoh_... 74 4e-12
>gi|160889326|ref|ZP_02070329.1| hypothetical protein BACUNI_01749 [Bacteroides uniformis ATCC 8492]
gi|156861333|gb|EDO54764.1| hypothetical protein BACUNI_01749 [Bacteroides uniformis ATCC 8492]
Length = 139
Score = 169 bits (427), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 82/138 (59%), Positives = 101/138 (73%), Gaps = 1/138 (0%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAESFRKENKNIDECITYILTEVQRMGCAGLSDEEV 60
MKGT+HFK IQ YLE RA D LFA+++R KNID+C+TYIL VQ+ GC G +D E+
Sbjct: 1 MKGTDHFKRTIQMYLEQRAEEDTLFAKNYRNPAKNIDDCVTYILNYVQKSGCNGFTDGEI 60
Query: 61 YSLAVHYYDEDNIEVGKYINCQVVVNHTIELTEEEKAEARKKAIERYQAEEYRKLTAKKP 120
Y AVHYYDE+ IEVGK I CQV VNH +ELT EEKAEAR++A+ RYQ EE RKL +
Sbjct: 61 YGQAVHYYDENEIEVGKPIQCQVAVNHIVELTAEEKAEARQQAVRRYQDEELRKLQNRTK 120
Query: 121 KLEKQVEQQIAQPSLFEF 138
+ + E Q+ QPSLF+F
Sbjct: 121 PTKAKTETQV-QPSLFDF 137
>gi|46242770|gb|AAS83475.1| Ctn002 [Bacteroides fragilis]
Length = 142
Score = 162 bits (411), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 81/140 (57%), Positives = 101/140 (72%), Gaps = 2/140 (1%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAESFRKENKNIDECITYILTEVQRMGCAGLSDEEV 60
MK T+HFK IQ YLE RA+ D LFA+++R KNID+C+TYIL VQ+ GC G +D E+
Sbjct: 1 MKTTDHFKRTIQMYLEQRAAEDALFAKNYRNPAKNIDDCVTYILNYVQKSGCNGFTDGEI 60
Query: 61 YSLAVHYYDEDNIEVGKYINCQVVVNHTIELTEEEKAEARKKAIERYQAEEYRKLTAK-K 119
Y AVHYYDE+ IEVGK I CQV VNH +ELT EEKAEAR+ AI +YQ EE R+L + K
Sbjct: 61 YGQAVHYYDENEIEVGKPIQCQVAVNHVVELTAEEKAEARQNAIRQYQDEEVRRLQNRNK 120
Query: 120 PKLE-KQVEQQIAQPSLFEF 138
P+ K Q++ QPSLF+
Sbjct: 121 PRTATKATVQEVQQPSLFDL 140
>gi|160891043|ref|ZP_02072046.1| hypothetical protein BACUNI_03490 [Bacteroides uniformis ATCC 8492]
gi|156859264|gb|EDO52695.1| hypothetical protein BACUNI_03490 [Bacteroides uniformis ATCC 8492]
Length = 141
Score = 152 bits (384), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 77/141 (54%), Positives = 101/141 (71%), Gaps = 6/141 (4%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAESFRKENKNIDECITYILTEVQRMGCAGLSDEEV 60
M TE+FK IQ YLE RA DELFA + +KNID+C+TYIL VQ+ GC G D+E+
Sbjct: 1 MNTTEYFKRTIQAYLEERAMEDELFAAKYDNPDKNIDDCVTYILNWVQKSGCNGFCDDEI 60
Query: 61 YSLAVHYYDEDNIEVGKYINCQVVVNHTIELTEEEKAEARKKAIERYQAEEYRKL----T 116
Y A+HYY+E +IEVGK +NCQV VNH IELTEEEKA+AR++AI +YQ E+ K+ T
Sbjct: 61 YGQAIHYYEEKDIEVGKPLNCQVSVNHHIELTEEEKAQARQEAIRQYQQEQMNKMRNRDT 120
Query: 117 AKKPKLEKQVEQQIAQPSLFE 137
AK+ ++ E ++ QPSLF+
Sbjct: 121 AKRTS--QRTETEVHQPSLFD 139
>gi|53711397|ref|YP_097389.1| hypothetical protein BF0106 [Bacteroides fragilis YCH46]
gi|154490906|ref|ZP_02030847.1| hypothetical protein PARMER_00823 [Parabacteroides merdae ATCC
43184]
gi|167762710|ref|ZP_02434837.1| hypothetical protein BACSTE_01068 [Bacteroides stercoris ATCC
43183]
gi|52214262|dbj|BAD46855.1| hypothetical protein [Bacteroides fragilis YCH46]
gi|154088654|gb|EDN87698.1| hypothetical protein PARMER_00823 [Parabacteroides merdae ATCC
43184]
gi|167699050|gb|EDS15629.1| hypothetical protein BACSTE_01068 [Bacteroides stercoris ATCC
43183]
Length = 138
Score = 152 bits (383), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 74/139 (53%), Positives = 99/139 (71%), Gaps = 4/139 (2%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAESFRKENKNIDECITYILTEVQRMGCAGLSDEEV 60
MKGT+HFK I YLE RA D LFA+ +R KN+DEC+T+IL VQ+ GC G +D E+
Sbjct: 1 MKGTDHFKRTIYMYLEQRAEEDALFAKKYRNPAKNMDECVTHILNYVQKSGCNGFTDGEI 60
Query: 61 YSLAVHYYDEDNIEVGKYINCQVVVNHTIELTEEEKAEARKKAIERYQAEEYRKLTAK-K 119
+ A+HYY+E+ IEVGK ++CQVVVNH ++LT EEKAEAR+ A+ +YQ EE RKL + +
Sbjct: 61 FGQAIHYYEENEIEVGKPMDCQVVVNHVVKLTAEEKAEARQNAVRKYQEEELRKLQNRHR 120
Query: 120 PKLEKQVEQQIAQPSLFEF 138
P K+ + QPSLF+
Sbjct: 121 PSARKENQ---PQPSLFDL 136
>gi|160884348|ref|ZP_02065351.1| hypothetical protein BACOVA_02326 [Bacteroides ovatus ATCC 8483]
gi|156110087|gb|EDO11832.1| hypothetical protein BACOVA_02326 [Bacteroides ovatus ATCC 8483]
Length = 140
Score = 152 bits (383), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 75/140 (53%), Positives = 98/140 (70%), Gaps = 2/140 (1%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAESFRKENKNIDECITYILTEVQRMGCAGLSDEEV 60
MKGTE FK++I+NYL+ RA DELF + + ID+ +TYIL EV++ GC G SD EV
Sbjct: 1 MKGTEQFKEIIKNYLDNRAKEDELFRAKYETTERTIDDVVTYILNEVKQSGCCGFSDMEV 60
Query: 61 YSLAVHYYDEDNIEVGKYINCQVVVNHTIELTEEEKAEARKKAIERYQAEEYRKLTAK-- 118
+S+AVH DE +E+GK +NC VVVN I+LTEEEKAE + A++RYQ EE RKL +
Sbjct: 61 FSMAVHAIDELTLEIGKPVNCDVVVNRHIDLTEEEKAEQKALALKRYQEEELRKLQVRHS 120
Query: 119 KPKLEKQVEQQIAQPSLFEF 138
KP+ K E + QPSLF+F
Sbjct: 121 KPRTSKPQETKQPQPSLFDF 140
>gi|150005903|ref|YP_001300647.1| hypothetical protein BVU_3399 [Bacteroides vulgatus ATCC 8482]
gi|149934327|gb|ABR41025.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 143
Score = 146 bits (369), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 96/139 (69%), Gaps = 2/139 (1%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAESFRKENKNIDECITYILTEVQRMGCAGLSDEEV 60
MKGT+HFK++I+NYL+ RA DELF + ID+ + YI VQ+ GC G SD+EV
Sbjct: 1 MKGTDHFKELIKNYLDNRAKEDELFRAKYETTTHTIDDVVNYIFHAVQQSGCCGFSDDEV 60
Query: 61 YSLAVHYYDEDNIEVGKYINCQVVVNHTIELTEEEKAEARKKAIERYQAEEYRKLTAK-- 118
YS+AVH DE ++++GK +NC VVVNH IELTEEEKAE R A++RYQ EE RKL +
Sbjct: 61 YSMAVHAIDEPDLKIGKPMNCNVVVNHHIELTEEEKAEQRAIALKRYQEEEMRKLQQRNS 120
Query: 119 KPKLEKQVEQQIAQPSLFE 137
+PK K + I + SLF+
Sbjct: 121 RPKAAKPQSKPIQELSLFQ 139
>gi|150005375|ref|YP_001300119.1| hypothetical protein BVU_2851 [Bacteroides vulgatus ATCC 8482]
gi|149933799|gb|ABR40497.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 139
Score = 106 bits (265), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 61/141 (43%), Positives = 87/141 (61%), Gaps = 7/141 (4%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAESFRKENKNIDECITYILTEVQRMGCA-GLSDEE 59
M E FK I+ YL+ RA D LFA + E K+IDEC +YI+ E ++ G A +SDEE
Sbjct: 1 MASNESFKQAIKAYLDKRAEEDSLFAPKYANEKKSIDECCSYIMGEARKRGNAVAISDEE 60
Query: 60 VYSLAVHYYDEDNIEVGKYINC-QVVVNHT---IELTEEEKAEARKKAIERYQAEEYRKL 115
VY +AVHYYDED+I++ + + V+ + +ELTEE+K AR KAI R E+Y+ L
Sbjct: 61 VYGMAVHYYDEDDIKINRLPAGEKTSVSSSAKPVELTEEDKKAARDKAIARLAEEQYQTL 120
Query: 116 TAKKPKLEKQVEQQIAQPSLF 136
+K + K+ + + Q SLF
Sbjct: 121 --RKKNVRKKADDNVQQMSLF 139
>gi|53713760|ref|YP_099752.1| hypothetical protein BF2469 [Bacteroides fragilis YCH46]
gi|52216625|dbj|BAD49218.1| hypothetical protein [Bacteroides fragilis YCH46]
Length = 139
Score = 101 bits (252), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/135 (45%), Positives = 88/135 (65%), Gaps = 6/135 (4%)
Query: 7 FKDVIQNYLETRASYDELFAESFRKENKNIDECITYILTEVQRMGCA-GLSDEEVYSLAV 65
F+ I++YL+ RA DELFA+++ KENK+IDEC +YIL E ++ G A + D EV+ +AV
Sbjct: 6 FQAAIKSYLDERAKADELFAKAYNKENKSIDECCSYILGEAKKRGNAVAIFDAEVFGMAV 65
Query: 66 HYYDEDNIEVGKY-INCQVVVNH---TIELTEEEKAEARKKAIERYQAEEYRKLTAKKPK 121
HYYDEDNI+V K N V+ + LTEE+K +AR+ A+ R + E+Y L KKP
Sbjct: 66 HYYDEDNIKVEKIPANTGSSVSGLSASTVLTEEDKEKAREAALRRLEEEQY-ALLKKKPT 124
Query: 122 LEKQVEQQIAQPSLF 136
K+ ++ Q SLF
Sbjct: 125 RAKKEIIEVQQMSLF 139
>gi|146300808|ref|YP_001195399.1| hypothetical protein Fjoh_3062 [Flavobacterium johnsoniae UW101]
gi|146155226|gb|ABQ06080.1| hypothetical protein Fjoh_3062 [Flavobacterium johnsoniae UW101]
Length = 127
Score = 90.5 bits (223), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/90 (43%), Positives = 58/90 (64%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAESFRKENKNIDECITYILTEVQRMGCAGLSDEEV 60
MK +E FK I+NYL T A D FA+S KE KNI+ C +YI EV++ G ++E+
Sbjct: 1 MKPSEDFKTAIENYLNTTAQGDSAFAQSLAKETKNIESCCSYIFGEVKKTGLCAFDNQEI 60
Query: 61 YSLAVHYYDEDNIEVGKYINCQVVVNHTIE 90
+ +A+ YY++D+I INC+VVVN ++
Sbjct: 61 FDMALKYYNDDSIGAPAPINCKVVVNQPVK 90
>gi|146302120|ref|YP_001196711.1| hypothetical protein Fjoh_4385 [Flavobacterium johnsoniae UW101]
gi|146156538|gb|ABQ07392.1| hypothetical protein Fjoh_4385 [Flavobacterium johnsoniae UW101]
Length = 126
Score = 75.1 bits (183), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 31/86 (36%), Positives = 53/86 (61%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAESFRKENKNIDECITYILTEVQRMGCAGLSDEEV 60
MK ++ FK I++YL +A D LF+ ++KE+KN++ C YI EV++ G ++E+
Sbjct: 1 MKASDKFKTAIESYLSEKAQNDALFSADYKKESKNLESCFHYIFGEVKKTGECAFDNQEI 60
Query: 61 YSLAVHYYDEDNIEVGKYINCQVVVN 86
+ +AV YY +D I + C+V V+
Sbjct: 61 FDMAVKYYTDDTIGTPAPVQCRVAVS 86
>gi|146301322|ref|YP_001195913.1| hypothetical protein Fjoh_3580 [Flavobacterium johnsoniae UW101]
gi|146155740|gb|ABQ06594.1| hypothetical protein Fjoh_3580 [Flavobacterium johnsoniae UW101]
Length = 136
Score = 73.6 bits (179), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 52/90 (57%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAESFRKENKNIDECITYILTEVQRMGCAGLSDEEV 60
MKG+E FK I+ +L+ ++ D FA K +KN++ C+ YI++EV++ G D E+
Sbjct: 1 MKGSEKFKTRIEGFLKGKSLSDAAFAPMLEKASKNVENCLKYIISEVKKTGECAFDDSEI 60
Query: 61 YSLAVHYYDEDNIEVGKYINCQVVVNHTIE 90
+ +AV YY +D I I C+V N E
Sbjct: 61 FDMAVTYYTDDTIGNLPDIRCRVTTNQPKE 90
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
Posted date: May 10, 2008 4:54 AM
Number of letters in database: 884,634,002
Number of sequences in database: 2,620,852
Database: /apps/blastdb/nr.01
Posted date: May 10, 2008 4:52 AM
Number of letters in database: 976,814,986
Number of sequences in database: 2,761,530
Database: /apps/blastdb/nr.02
Posted date: May 10, 2008 4:46 AM
Number of letters in database: 360,829,861
Number of sequences in database: 1,132,722
Lambda K H
0.314 0.131 0.359
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 553,493,523
Number of Sequences: 6515104
Number of extensions: 21117159
Number of successful extensions: 89581
Number of sequences better than 1.0e-04: 11
Number of HSP's better than 0.0 without gapping: 10
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 89565
Number of HSP's gapped (non-prelim): 11
length of query: 138
length of database: 2,222,278,849
effective HSP length: 103
effective length of query: 35
effective length of database: 1,551,223,137
effective search space: 54292809795
effective search space used: 54292809795
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 116 (49.3 bits)