BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= FN0031
(271 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|19703383|ref|NP_602945.1| hypothetical protein FN0031 [F... 474 e-132
gi|34763874|ref|ZP_00144779.1| hypothetical protein [Fusoba... 359 8e-98
gi|148322643|gb|EDK87893.1| hypothetical protein FNP_0074 [... 190 8e-47
gi|156935437|ref|YP_001439353.1| hypothetical protein ESA_0... 59 4e-07
gi|19705191|ref|NP_602686.1| hypothetical protein FN1886 [F... 57 1e-06
>gi|19703383|ref|NP_602945.1| hypothetical protein FN0031 [Fusobacterium nucleatum subsp.
nucleatum ATCC 25586]
gi|19713449|gb|AAL94244.1| unknown [Fusobacterium nucleatum subsp. nucleatum ATCC 25586]
Length = 271
Score = 474 bits (1220), Expect = e-132, Method: Composition-based stats.
Identities = 270/271 (99%), Positives = 271/271 (100%)
Query: 1 VFFISNSLFAATNDKHIFYLDNPTTKNIEITLDNKVYKLKPKTYEVLNLKMGQHIAELSD 60
+FFISNSLFAATNDKHIFYLDNPTTKNIEITLDNKVYKLKPKTYEVLNLKMGQHIAELSD
Sbjct: 1 MFFISNSLFAATNDKHIFYLDNPTTKNIEITLDNKVYKLKPKTYEVLNLKMGQHIAELSD 60
Query: 61 GTKVYFKIFANSKGGIINPSGATYTINYFRYQSPRICVDWSEPEDTVLLTFDDFIIDKNY 120
GTKVYFKIFANSKGGIINPSGATYTINYFRYQSPRICVDWSEPEDTVLLTFDDFIIDKNY
Sbjct: 61 GTKVYFKIFANSKGGIINPSGATYTINYFRYQSPRICVDWSEPEDTVLLTFDDFIIDKNY 120
Query: 121 IAWEYDIFEEVTRESMPKKLSPEVDIYVFTKIYSPSEFKDIDYDIEKPKANLPKIDSDYN 180
IAWEYDIFEEVTRESMPKKLSPEVDIYVFTKIYSPSEFKDIDYDIEKPKANLPKIDSDYN
Sbjct: 121 IAWEYDIFEEVTRESMPKKLSPEVDIYVFTKIYSPSEFKDIDYDIEKPKANLPKIDSDYN 180
Query: 181 IPNNEDKTFQNYIKQIINLDKTYKDTNDAKKQKKILKEYDKIAKIIWSEYPKYNIAQGSY 240
IPNNEDKTFQNYIKQIINLDKTYKDTNDAKKQKKILKEYDKIAKIIWSEYPKYNIAQGSY
Sbjct: 181 IPNNEDKTFQNYIKQIINLDKTYKDTNDAKKQKKILKEYDKIAKIIWSEYPKYNIAQGSY 240
Query: 241 DNVDLKALNLKSLDRGVIITKIEDDLNDRKR 271
DNVDLKALNLKSLDRGVIITKIEDDLNDRKR
Sbjct: 241 DNVDLKALNLKSLDRGVIITKIEDDLNDRKR 271
>gi|34763874|ref|ZP_00144779.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii ATCC
49256]
gi|27886358|gb|EAA23628.1| hypothetical protein [Fusobacterium nucleatum subsp. vincentii ATCC
49256]
Length = 272
Score = 359 bits (922), Expect = 8e-98, Method: Composition-based stats.
Identities = 202/255 (79%), Positives = 229/255 (89%), Gaps = 4/255 (1%)
Query: 11 ATNDKHIFYLDNPTTKNIEITLDNKVYKLKPKTYEVLNLKMGQHIAELSDGTKVYFKIFA 70
+TN ++IFYLDNP+ K+IEI LD+K+YKLKPKTYE+LNLKMG+HIAELS+G KVYFKIFA
Sbjct: 21 STNKEYIFYLDNPSNKDIEIILDSKIYKLKPKTYEILNLKMGEHIAELSNGAKVYFKIFA 80
Query: 71 NSKGGIINPSGATYTI-NYFRYQSPRICVDWSEPEDTVLLTFDDFIIDKNYIAWEYDIFE 129
NSKGGIINP+GATYTI N RYQSPR+CVDW+EP++T L T DDFIIDKNYI WEYDIFE
Sbjct: 81 NSKGGIINPTGATYTIDNSIRYQSPRVCVDWAEPKNTTLSTIDDFIIDKNYIDWEYDIFE 140
Query: 130 EVTRESMPKKLSPEVDIYVFTKIYSPSEFKDIDYDIEKPKANLPKIDSDYNIPNNEDKTF 189
E T ESMPKKL P VDIYVFTKIYSPSEFKD++YDIEKPKANLPKI+SDYNIPNN+D+TF
Sbjct: 141 EATNESMPKKLPPNVDIYVFTKIYSPSEFKDVNYDIEKPKANLPKIESDYNIPNNKDETF 200
Query: 190 QNYIKQIINLDKTYKDTNDAKKQKKILKEYDKIAKIIWSEYPKYNIAQGSYDNVDLKALN 249
QNYIKQII LDK YKDTND KKQ KILKEYDKIAKI+W KY+ +G+YD V+LK+LN
Sbjct: 201 QNYIKQIIELDKAYKDTNDVKKQNKILKEYDKIAKILWL---KYSFGEGTYDKVNLKSLN 257
Query: 250 LKSLDRGVIITKIED 264
LKSLDRGVIITKIE+
Sbjct: 258 LKSLDRGVIITKIEE 272
>gi|148322643|gb|EDK87893.1| hypothetical protein FNP_0074 [Fusobacterium nucleatum subsp.
polymorphum ATCC 10953]
Length = 272
Score = 190 bits (482), Expect = 8e-47, Method: Composition-based stats.
Identities = 128/271 (47%), Positives = 182/271 (67%), Gaps = 37/271 (13%)
Query: 11 ATNDKHIFYLDNPTTKNIEITLDNKVYKLKPKTYEVLNLKMGQHIAELSDGTKVYFKIFA 70
+ +++HIFYLDNP+ ++IEI LD++ YKLKP+T+E+L LK G+HI ELSD K+YFKIF
Sbjct: 21 SNDNEHIFYLDNPSNEDIEIILDSQTYKLKPRTFEILKLKTGEHIVELSDKRKIYFKIFE 80
Query: 71 NSKGGIINPSGATYTINYF-RYQSPRICVDWSEPEDTVL----LTF-------DDFIIDK 118
NSKGGIINP+ ++Y NYF Y +P I V+W+EPE + L+F +DFIIDK
Sbjct: 81 NSKGGIINPTTSSYVFNYFIPYSAPNIFVEWAEPEINTISIDNLSFTGTYKVVNDFIIDK 140
Query: 119 NYIAWEYDIFEEVTRESMPKKLSPEVDIYVFTKIYSPSEF---KDIDY-DIEKPKANLPK 174
NYI W++DIFEE+ + K+SPE++I +F+KI++P EF ++++Y DI K K NL +
Sbjct: 141 NYIDWDFDIFEEINFD----KISPEIEIKLFSKIFNPYEFLKQENLEYKDINKTKLNL-E 195
Query: 175 IDSDYNIPNNEDKTFQNYIKQIINLDKTYKDTNDAKKQKKILKEYDKIAKIIWSEY--PK 232
I DY+IPN EDK + Y+++II LDK YK ND +QKK+L++Y+ + E P
Sbjct: 196 IKEDYSIPNFEDKKVKKYVEEIIELDKVYKVENDENEQKKLLEKYNTLVDKYIDEVGAPV 255
Query: 233 YNIAQGSYDNVDLKALNLKSLDRGVIITKIE 263
Y + + LD+GVIITK+E
Sbjct: 256 Y--------------IKILYLDKGVIITKVE 272
>gi|156935437|ref|YP_001439353.1| hypothetical protein ESA_03296 [Enterobacter sakazakii ATCC
BAA-894]
gi|156533691|gb|ABU78517.1| hypothetical protein ESA_03296 [Enterobacter sakazakii ATCC
BAA-894]
Length = 299
Score = 58.5 bits (140), Expect = 4e-07, Method: Composition-based stats.
Identities = 64/241 (26%), Positives = 98/241 (40%), Gaps = 35/241 (14%)
Query: 14 DKHIFYLDNPTTKNIEITLDNKVYKLKPKTYEVLNLKMGQHIAELSDGTKVYFKIFA--- 70
D F +DNPT K + I++D++ L + + + L GQH L +G KV F +F+
Sbjct: 34 DSKEFLIDNPTGKPLAISVDDQKITLAAEQSQTIKLDAGQHTLTLENGDKVKFSVFSAMP 93
Query: 71 -NSKGGIINPSGATYTINYFRYQSPRICVDWSEPEDTVLLTFD-------------DFII 116
+ G+INP+ Y +Y + + SE D LT D I
Sbjct: 94 RSGVSGLINPTRTRYIYVIQKYLAEGVTPS-SENGDVHTLTIDGQTVTGPFEDMGSGLFI 152
Query: 117 DKNYIAWEYDIFEEVTRESMPKKLSPEVDIYVFTKIYSPSEFKDIDYDIEKPKA----NL 172
D WE + E P+ +S TK++ EFKD + P N+
Sbjct: 153 DNFTKEWELN-----PTEPFPESMSSTSADNYKTKLFRLEEFKDYYNNQFSPSVEYTENM 207
Query: 173 PKIDSDYNIPNNEDKTFQNYIKQIINLDKTYKDTND------AKKQKKILKEYDKIAKII 226
+S Y P + ++Q NL++ K ND A QK +LK +DK +
Sbjct: 208 RITESRYQPPEITAQFTSPELQQ--NLNEATKIYNDFIHAESAGDQKDLLKAFDKQNREK 265
Query: 227 W 227
W
Sbjct: 266 W 266
>gi|19705191|ref|NP_602686.1| hypothetical protein FN1886 [Fusobacterium nucleatum subsp.
nucleatum ATCC 25586]
gi|19713134|gb|AAL93985.1| unknown [Fusobacterium nucleatum subsp. nucleatum ATCC 25586]
Length = 91
Score = 57.4 bits (137), Expect = 1e-06, Method: Composition-based stats.
Identities = 29/74 (39%), Positives = 47/74 (63%), Gaps = 1/74 (1%)
Query: 15 KHIFYLDNPTTKNIEITLDNKVYKLKPKTYEVLNLKMGQHIAELSDGTKVYFKIFANSKG 74
K F LDNP+ + I +T+D K + L KT+E + L +G+H E +D KV F ++++SKG
Sbjct: 14 KSTFTLDNPSDEKITVTIDGKEHSLDAKTHEKVELTVGEHTVE-NDKYKVAFTVYSDSKG 72
Query: 75 GIINPSGATYTINY 88
GII ++T ++
Sbjct: 73 GIIELHPKSWTQDW 86
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.316 0.137 0.396
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,086,858,942
Number of Sequences: 5470121
Number of extensions: 50567065
Number of successful extensions: 152005
Number of sequences better than 1.0e-05: 5
Number of HSP's better than 0.0 without gapping: 3
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 151994
Number of HSP's gapped (non-prelim): 5
length of query: 271
length of database: 1,894,087,724
effective HSP length: 131
effective length of query: 140
effective length of database: 1,177,501,873
effective search space: 164850262220
effective search space used: 164850262220
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 128 (53.9 bits)