BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= PI0277
(138 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|156861333|gb|EDO54764.1| hypothetical protein BACUNI_017... 149 4e-35
gi|46242770|gb|AAS83475.1| Ctn002 [Bacteroides fragilis] 145 6e-34
gi|53711397|ref|YP_097389.1| hypothetical protein BF0106 [B... 134 1e-30
gi|150005903|ref|YP_001300647.1| hypothetical protein BVU_3... 130 3e-29
gi|156110087|gb|EDO11832.1| hypothetical protein BACOVA_023... 130 3e-29
gi|156859264|gb|EDO52695.1| hypothetical protein BACUNI_034... 129 4e-29
gi|150005375|ref|YP_001300119.1| hypothetical protein BVU_2... 87 2e-16
gi|53713760|ref|YP_099752.1| hypothetical protein BF2469 [B... 86 5e-16
gi|146300808|ref|YP_001195399.1| hypothetical protein Fjoh_... 84 3e-15
gi|146302120|ref|YP_001196711.1| hypothetical protein Fjoh_... 71 2e-11
gi|146301322|ref|YP_001195913.1| hypothetical protein Fjoh_... 69 6e-11
>gi|156861333|gb|EDO54764.1| hypothetical protein BACUNI_01749 [Bacteroides uniformis ATCC 8492]
Length = 139
Score = 149 bits (377), Expect = 4e-35, Method: Composition-based stats.
Identities = 84/138 (60%), Positives = 101/138 (73%), Gaps = 1/138 (0%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAENFRKENKNIDECITYILTEVQKMGCAGLSDEEV 60
MKGT+HFK IQ YLE RA D LFA+N+R KNID+C+TYIL VQK GC G +D E+
Sbjct: 1 MKGTDHFKRTIQMYLEQRAEEDTLFAKNYRNPAKNIDDCVTYILNYVQKSGCNGFTDGEI 60
Query: 61 YSLAVHYYDEDNIEVGKSINCQVVVNHTIELTEEEKAEARKKAIERYQAEEYRKLTAKKP 120
Y AVHYYDE+ IEVGK I CQV VNH +ELT EEKAEAR++A+ RYQ EE RKL +
Sbjct: 61 YGQAVHYYDENEIEVGKPIQCQVAVNHIVELTAEEKAEARQQAVRRYQDEELRKLQNRTK 120
Query: 121 KAEKQVEQQIAQPSLFEF 138
+ + E Q+ QPSLF+F
Sbjct: 121 PTKAKTETQV-QPSLFDF 137
>gi|46242770|gb|AAS83475.1| Ctn002 [Bacteroides fragilis]
Length = 142
Score = 145 bits (367), Expect = 6e-34, Method: Composition-based stats.
Identities = 84/140 (60%), Positives = 102/140 (72%), Gaps = 2/140 (1%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAENFRKENKNIDECITYILTEVQKMGCAGLSDEEV 60
MK T+HFK IQ YLE RA+ D LFA+N+R KNID+C+TYIL VQK GC G +D E+
Sbjct: 1 MKTTDHFKRTIQMYLEQRAAEDALFAKNYRNPAKNIDDCVTYILNYVQKSGCNGFTDGEI 60
Query: 61 YSLAVHYYDEDNIEVGKSINCQVVVNHTIELTEEEKAEARKKAIERYQAEEYRKLTAK-K 119
Y AVHYYDE+ IEVGK I CQV VNH +ELT EEKAEAR+ AI +YQ EE R+L + K
Sbjct: 61 YGQAVHYYDENEIEVGKPIQCQVAVNHVVELTAEEKAEARQNAIRQYQDEEVRRLQNRNK 120
Query: 120 PK-AEKQVEQQIAQPSLFEF 138
P+ A K Q++ QPSLF+
Sbjct: 121 PRTATKATVQEVQQPSLFDL 140
>gi|53711397|ref|YP_097389.1| hypothetical protein BF0106 [Bacteroides fragilis YCH46]
gi|154490906|ref|ZP_02030847.1| hypothetical protein PARMER_00823 [Parabacteroides merdae ATCC
43184]
gi|52214262|dbj|BAD46855.1| hypothetical protein [Bacteroides fragilis YCH46]
gi|154088654|gb|EDN87698.1| hypothetical protein PARMER_00823 [Parabacteroides merdae ATCC
43184]
Length = 138
Score = 134 bits (337), Expect = 1e-30, Method: Composition-based stats.
Identities = 76/139 (54%), Positives = 100/139 (71%), Gaps = 4/139 (2%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAENFRKENKNIDECITYILTEVQKMGCAGLSDEEV 60
MKGT+HFK I YLE RA D LFA+ +R KN+DEC+T+IL VQK GC G +D E+
Sbjct: 1 MKGTDHFKRTIYMYLEQRAEEDALFAKKYRNPAKNMDECVTHILNYVQKSGCNGFTDGEI 60
Query: 61 YSLAVHYYDEDNIEVGKSINCQVVVNHTIELTEEEKAEARKKAIERYQAEEYRKLTAK-K 119
+ A+HYY+E+ IEVGK ++CQVVVNH ++LT EEKAEAR+ A+ +YQ EE RKL + +
Sbjct: 61 FGQAIHYYEENEIEVGKPMDCQVVVNHVVKLTAEEKAEARQNAVRKYQEEELRKLQNRHR 120
Query: 120 PKAEKQVEQQIAQPSLFEF 138
P A K+ + QPSLF+
Sbjct: 121 PSARKENQ---PQPSLFDL 136
>gi|150005903|ref|YP_001300647.1| hypothetical protein BVU_3399 [Bacteroides vulgatus ATCC 8482]
gi|149934327|gb|ABR41025.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 143
Score = 130 bits (327), Expect = 3e-29, Method: Composition-based stats.
Identities = 74/139 (53%), Positives = 97/139 (69%), Gaps = 2/139 (1%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAENFRKENKNIDECITYILTEVQKMGCAGLSDEEV 60
MKGT+HFK++I+NYL+ RA DELF + ID+ + YI VQ+ GC G SD+EV
Sbjct: 1 MKGTDHFKELIKNYLDNRAKEDELFRAKYETTTHTIDDVVNYIFHAVQQSGCCGFSDDEV 60
Query: 61 YSLAVHYYDEDNIEVGKSINCQVVVNHTIELTEEEKAEARKKAIERYQAEEYRKLTAK-- 118
YS+AVH DE ++++GK +NC VVVNH IELTEEEKAE R A++RYQ EE RKL +
Sbjct: 61 YSMAVHAIDEPDLKIGKPMNCNVVVNHHIELTEEEKAEQRAIALKRYQEEEMRKLQQRNS 120
Query: 119 KPKAEKQVEQQIAQPSLFE 137
+PKA K + I + SLF+
Sbjct: 121 RPKAAKPQSKPIQELSLFQ 139
>gi|156110087|gb|EDO11832.1| hypothetical protein BACOVA_02326 [Bacteroides ovatus ATCC 8483]
Length = 140
Score = 130 bits (326), Expect = 3e-29, Method: Composition-based stats.
Identities = 75/140 (53%), Positives = 98/140 (70%), Gaps = 2/140 (1%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAENFRKENKNIDECITYILTEVQKMGCAGLSDEEV 60
MKGTE FK++I+NYL+ RA DELF + + ID+ +TYIL EV++ GC G SD EV
Sbjct: 1 MKGTEQFKEIIKNYLDNRAKEDELFRAKYETTERTIDDVVTYILNEVKQSGCCGFSDMEV 60
Query: 61 YSLAVHYYDEDNIEVGKSINCQVVVNHTIELTEEEKAEARKKAIERYQAEEYRKLTAK-- 118
+S+AVH DE +E+GK +NC VVVN I+LTEEEKAE + A++RYQ EE RKL +
Sbjct: 61 FSMAVHAIDELTLEIGKPVNCDVVVNRHIDLTEEEKAEQKALALKRYQEEELRKLQVRHS 120
Query: 119 KPKAEKQVEQQIAQPSLFEF 138
KP+ K E + QPSLF+F
Sbjct: 121 KPRTSKPQETKQPQPSLFDF 140
>gi|156859264|gb|EDO52695.1| hypothetical protein BACUNI_03490 [Bacteroides uniformis ATCC 8492]
Length = 141
Score = 129 bits (325), Expect = 4e-29, Method: Composition-based stats.
Identities = 75/139 (53%), Positives = 99/139 (71%), Gaps = 2/139 (1%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAENFRKENKNIDECITYILTEVQKMGCAGLSDEEV 60
M TE+FK IQ YLE RA DELFA + +KNID+C+TYIL VQK GC G D+E+
Sbjct: 1 MNTTEYFKRTIQAYLEERAMEDELFAAKYDNPDKNIDDCVTYILNWVQKSGCNGFCDDEI 60
Query: 61 YSLAVHYYDEDNIEVGKSINCQVVVNHTIELTEEEKAEARKKAIERYQAEEYRKLTAKK- 119
Y A+HYY+E +IEVGK +NCQV VNH IELTEEEKA+AR++AI +YQ E+ K+ +
Sbjct: 61 YGQAIHYYEEKDIEVGKPLNCQVSVNHHIELTEEEKAQARQEAIRQYQQEQMNKMRNRDT 120
Query: 120 -PKAEKQVEQQIAQPSLFE 137
+ ++ E ++ QPSLF+
Sbjct: 121 AKRTSQRTETEVHQPSLFD 139
>gi|150005375|ref|YP_001300119.1| hypothetical protein BVU_2851 [Bacteroides vulgatus ATCC 8482]
gi|149933799|gb|ABR40497.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 139
Score = 87.4 bits (215), Expect = 2e-16, Method: Composition-based stats.
Identities = 61/141 (43%), Positives = 84/141 (59%), Gaps = 7/141 (4%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAENFRKENKNIDECITYILTEVQKMGCA-GLSDEE 59
M E FK I+ YL+ RA D LFA + E K+IDEC +YI+ E +K G A +SDEE
Sbjct: 1 MASNESFKQAIKAYLDKRAEEDSLFAPKYANEKKSIDECCSYIMGEARKRGNAVAISDEE 60
Query: 60 VYSLAVHYYDEDNIEVGKSINCQVVVNHT----IELTEEEKAEARKKAIERYQAEEYRKL 115
VY +AVHYYDED+I++ + + + +ELTEE+K AR KAI R E+Y+ L
Sbjct: 61 VYGMAVHYYDEDDIKINRLPAGEKTSVSSSAKPVELTEEDKKAARDKAIARLAEEQYQTL 120
Query: 116 TAKKPKAEKQVEQQIAQPSLF 136
+K K+ + + Q SLF
Sbjct: 121 --RKKNVRKKADDNVQQMSLF 139
>gi|53713760|ref|YP_099752.1| hypothetical protein BF2469 [Bacteroides fragilis YCH46]
gi|52216625|dbj|BAD49218.1| hypothetical protein [Bacteroides fragilis YCH46]
Length = 139
Score = 86.3 bits (212), Expect = 5e-16, Method: Composition-based stats.
Identities = 60/135 (44%), Positives = 90/135 (66%), Gaps = 6/135 (4%)
Query: 7 FKDVIQNYLETRASYDELFAENFRKENKNIDECITYILTEVQKMGCA-GLSDEEVYSLAV 65
F+ I++YL+ RA DELFA+ + KENK+IDEC +YIL E +K G A + D EV+ +AV
Sbjct: 6 FQAAIKSYLDERAKADELFAKAYNKENKSIDECCSYILGEAKKRGNAVAIFDAEVFGMAV 65
Query: 66 HYYDEDNIEVGK----SINCQVVVNHTIELTEEEKAEARKKAIERYQAEEYRKLTAKKPK 121
HYYDEDNI+V K + + ++ + LTEE+K +AR+ A+ R + E+Y L K +
Sbjct: 66 HYYDEDNIKVEKIPANTGSSVSGLSASTVLTEEDKEKAREAALRRLEEEQYALLKKKPTR 125
Query: 122 AEKQVEQQIAQPSLF 136
A+K++ ++ Q SLF
Sbjct: 126 AKKEI-IEVQQMSLF 139
>gi|146300808|ref|YP_001195399.1| hypothetical protein Fjoh_3062 [Flavobacterium johnsoniae UW101]
gi|146155226|gb|ABQ06080.1| hypothetical protein Fjoh_3062 [Flavobacterium johnsoniae UW101]
Length = 127
Score = 84.0 bits (206), Expect = 3e-15, Method: Composition-based stats.
Identities = 39/90 (43%), Positives = 58/90 (64%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAENFRKENKNIDECITYILTEVQKMGCAGLSDEEV 60
MK +E FK I+NYL T A D FA++ KE KNI+ C +YI EV+K G ++E+
Sbjct: 1 MKPSEDFKTAIENYLNTTAQGDSAFAQSLAKETKNIESCCSYIFGEVKKTGLCAFDNQEI 60
Query: 61 YSLAVHYYDEDNIEVGKSINCQVVVNHTIE 90
+ +A+ YY++D+I INC+VVVN ++
Sbjct: 61 FDMALKYYNDDSIGAPAPINCKVVVNQPVK 90
>gi|146302120|ref|YP_001196711.1| hypothetical protein Fjoh_4385 [Flavobacterium johnsoniae UW101]
gi|146156538|gb|ABQ07392.1| hypothetical protein Fjoh_4385 [Flavobacterium johnsoniae UW101]
Length = 126
Score = 70.9 bits (172), Expect = 2e-11, Method: Composition-based stats.
Identities = 32/86 (37%), Positives = 54/86 (62%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAENFRKENKNIDECITYILTEVQKMGCAGLSDEEV 60
MK ++ FK I++YL +A D LF+ +++KE+KN++ C YI EV+K G ++E+
Sbjct: 1 MKASDKFKTAIESYLSEKAQNDALFSADYKKESKNLESCFHYIFGEVKKTGECAFDNQEI 60
Query: 61 YSLAVHYYDEDNIEVGKSINCQVVVN 86
+ +AV YY +D I + C+V V+
Sbjct: 61 FDMAVKYYTDDTIGTPAPVQCRVAVS 86
>gi|146301322|ref|YP_001195913.1| hypothetical protein Fjoh_3580 [Flavobacterium johnsoniae UW101]
gi|146155740|gb|ABQ06594.1| hypothetical protein Fjoh_3580 [Flavobacterium johnsoniae UW101]
Length = 136
Score = 69.3 bits (168), Expect = 6e-11, Method: Composition-based stats.
Identities = 34/90 (37%), Positives = 52/90 (57%)
Query: 1 MKGTEHFKDVIQNYLETRASYDELFAENFRKENKNIDECITYILTEVQKMGCAGLSDEEV 60
MKG+E FK I+ +L+ ++ D FA K +KN++ C+ YI++EV+K G D E+
Sbjct: 1 MKGSEKFKTRIEGFLKGKSLSDAAFAPMLEKASKNVENCLKYIISEVKKTGECAFDDSEI 60
Query: 61 YSLAVHYYDEDNIEVGKSINCQVVVNHTIE 90
+ +AV YY +D I I C+V N E
Sbjct: 61 FDMAVTYYTDDTIGNLPDIRCRVTTNQPKE 90
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.312 0.129 0.353
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 472,878,462
Number of Sequences: 5470121
Number of extensions: 17961102
Number of successful extensions: 83126
Number of sequences better than 1.0e-05: 11
Number of HSP's better than 0.0 without gapping: 11
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 83108
Number of HSP's gapped (non-prelim): 11
length of query: 138
length of database: 1,894,087,724
effective HSP length: 103
effective length of query: 35
effective length of database: 1,330,665,261
effective search space: 46573284135
effective search space used: 46573284135
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 124 (52.4 bits)