BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SSA_0685
(106 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|125717539|ref|YP_001034672.1| hypothetical protein SSA_0... 141 1e-32
gi|149006725|ref|ZP_01830411.1| thiol peroxidase [Streptoco... 77 3e-13
gi|148984093|ref|ZP_01817388.1| thiol peroxidase [Streptoco... 72 1e-11
gi|149011481|ref|ZP_01832728.1| integral membrane protein [... 69 7e-11
gi|148993772|ref|ZP_01823199.1| integral membrane protein [... 69 9e-11
gi|28377503|ref|NP_784395.1| prophage Lp1 protein 6 [Lactob... 66 8e-10
gi|28377745|ref|NP_784637.1| integral membrane protein [Lac... 65 1e-09
gi|149021250|ref|ZP_01835496.1| integral membrane protein [... 64 2e-09
gi|125717324|ref|YP_001034457.1| hypothetical protein SSA_0... 59 7e-08
gi|685120|gb|AAB31986.1| insulin activator factor; INSAF [H... 59 8e-08
>gi|125717539|ref|YP_001034672.1| hypothetical protein SSA_0685 [Streptococcus sanguinis SK36]
gi|125497456|gb|ABN44122.1| Hypothetical protein SSA_0685 [Streptococcus sanguinis SK36]
Length = 106
Score = 141 bits (356), Expect = 1e-32, Method: Composition-based stats.
Identities = 106/106 (100%), Positives = 106/106 (100%)
Query: 1 MKTKTLALVNGLVGLIGGIILLLWPFFIWVIYFMLGVFDILKISILALGIAGIVYYKDDN 60
MKTKTLALVNGLVGLIGGIILLLWPFFIWVIYFMLGVFDILKISILALGIAGIVYYKDDN
Sbjct: 1 MKTKTLALVNGLVGLIGGIILLLWPFFIWVIYFMLGVFDILKISILALGIAGIVYYKDDN 60
Query: 61 RVGPAGSILMIVGGIFTFNDFLGWIGAILSIIGGSLYLASLKRFQA 106
RVGPAGSILMIVGGIFTFNDFLGWIGAILSIIGGSLYLASLKRFQA
Sbjct: 61 RVGPAGSILMIVGGIFTFNDFLGWIGAILSIIGGSLYLASLKRFQA 106
>gi|149006725|ref|ZP_01830411.1| thiol peroxidase [Streptococcus pneumoniae SP18-BS74]
gi|147761640|gb|EDK68604.1| thiol peroxidase [Streptococcus pneumoniae SP18-BS74]
Length = 126
Score = 77.4 bits (189), Expect = 3e-13, Method: Composition-based stats.
Identities = 59/124 (47%), Positives = 77/124 (62%), Gaps = 18/124 (14%)
Query: 1 MKTKTLALVNGLVGLIGGIILLLWPFFIWVIYFMLG------------------VFDILK 42
MK KTLAL++G+VGL+GGI+LL+ PF + + + LK
Sbjct: 1 MKLKTLALISGIVGLVGGILLLIGPFVLLGTTVNTAATTLNGGATAGAFSGVALLLNALK 60
Query: 43 ISILALGIAGIVYYKDDNRVGPAGSILMIVGGIFTFNDFLGWIGAILSIIGGSLYLASLK 102
I L LGI IVYYK D RVG A S+LMIV G + FLGW+G IL+IIGGSL+LA+LK
Sbjct: 61 IENLVLGIIAIVYYKGDKRVGAAPSVLMIVSGGVSLIPFLGWVGGILAIIGGSLFLATLK 120
Query: 103 RFQA 106
+F++
Sbjct: 121 KFKS 124
>gi|148984093|ref|ZP_01817388.1| thiol peroxidase [Streptococcus pneumoniae SP3-BS71]
gi|147923382|gb|EDK74495.1| thiol peroxidase [Streptococcus pneumoniae SP3-BS71]
Length = 126
Score = 72.0 bits (175), Expect = 1e-11, Method: Composition-based stats.
Identities = 57/124 (45%), Positives = 77/124 (62%), Gaps = 18/124 (14%)
Query: 1 MKTKTLALVNGLVGLIGGIILLLWPFFIW------------------VIYFMLGVFDILK 42
MK KTLAL++G+VGL+GGI+LL+ F + + + + LK
Sbjct: 1 MKLKTLALISGIVGLVGGILLLIGSFVLLGTAVNTAATTLNGGATAGAFSGVALLLNALK 60
Query: 43 ISILALGIAGIVYYKDDNRVGPAGSILMIVGGIFTFNDFLGWIGAILSIIGGSLYLASLK 102
I+ L LGI IVYYK D RVG A +LMIV G + FLGW+G IL+IIGGSL+LA+LK
Sbjct: 61 IANLVLGIIAIVYYKGDKRVGAAPYVLMIVSGGVSLIPFLGWVGGILAIIGGSLFLATLK 120
Query: 103 RFQA 106
+F++
Sbjct: 121 KFKS 124
>gi|149011481|ref|ZP_01832728.1| integral membrane protein [Streptococcus pneumoniae SP19-BS75]
gi|147764471|gb|EDK71402.1| integral membrane protein [Streptococcus pneumoniae SP19-BS75]
Length = 120
Score = 69.3 bits (168), Expect = 7e-11, Method: Composition-based stats.
Identities = 52/115 (45%), Positives = 71/115 (61%), Gaps = 18/115 (15%)
Query: 10 NGLVGLIGGIILLLWPFFIW------------------VIYFMLGVFDILKISILALGIA 51
+G+VGL+GGI+LL+ PF + + + + LKI+ L LGI
Sbjct: 4 SGIVGLVGGILLLICPFVLLGTAVNTAATTLNGGATAGAFSGVALLLNALKIANLVLGII 63
Query: 52 GIVYYKDDNRVGPAGSILMIVGGIFTFNDFLGWIGAILSIIGGSLYLASLKRFQA 106
IVYYK D RVG A S+LMIV G + FLGW+G IL+IIGGSL+LA+LK+F++
Sbjct: 64 AIVYYKGDKRVGAAPSVLMIVSGGVSLIPFLGWVGGILAIIGGSLFLATLKKFKS 118
>gi|148993772|ref|ZP_01823199.1| integral membrane protein [Streptococcus pneumoniae SP9-BS68]
gi|147927728|gb|EDK78752.1| integral membrane protein [Streptococcus pneumoniae SP9-BS68]
Length = 120
Score = 68.9 bits (167), Expect = 9e-11, Method: Composition-based stats.
Identities = 51/115 (44%), Positives = 70/115 (60%), Gaps = 18/115 (15%)
Query: 10 NGLVGLIGGIILLLWPFFIW------------------VIYFMLGVFDILKISILALGIA 51
+G+VGL+GGI+LL+ PF + + + + LKI+ L LGI
Sbjct: 4 SGIVGLVGGILLLICPFVLLGTAVNTAATTLNGGATAGAFSGVALLLNTLKIANLVLGII 63
Query: 52 GIVYYKDDNRVGPAGSILMIVGGIFTFNDFLGWIGAILSIIGGSLYLASLKRFQA 106
IVYYK D VG A S+LMIV G + FLGW+G IL+IIGGSL+LA+LK+F++
Sbjct: 64 AIVYYKGDKHVGAAPSVLMIVSGEVSLIPFLGWVGGILAIIGGSLFLATLKKFKS 118
>gi|28377503|ref|NP_784395.1| prophage Lp1 protein 6 [Lactobacillus plantarum WCFS1]
gi|28270335|emb|CAD63236.1| prophage Lp1 protein 6 [Lactobacillus plantarum WCFS1]
Length = 122
Score = 65.9 bits (159), Expect = 8e-10, Method: Composition-based stats.
Identities = 60/117 (51%), Positives = 76/117 (64%), Gaps = 13/117 (11%)
Query: 1 MKTKTLALVNGLVGLIGGIILLLWPFFI-------------WVIYFMLGVFDILKISILA 47
MKTK LAL NG+VGL+GGIILL +F+ + + +ILKI+ILA
Sbjct: 1 MKTKNLALTNGIVGLVGGIILLFGGWFVAGGALSDAATGSATSTSGTVALLNILKIAILA 60
Query: 48 LGIAGIVYYKDDNRVGPAGSILMIVGGIFTFNDFLGWIGAILSIIGGSLYLASLKRF 104
LGI ++YYK D+RV A +L+IVGG FLGWIG I++IIGGSLYLA+LK F
Sbjct: 61 LGIIALIYYKGDSRVNTAPGVLLIVGGAIALIPFLGWIGGIIAIIGGSLYLAALKNF 117
>gi|28377745|ref|NP_784637.1| integral membrane protein [Lactobacillus plantarum WCFS1]
gi|28270578|emb|CAD63482.1| integral membrane protein [Lactobacillus plantarum WCFS1]
Length = 122
Score = 65.1 bits (157), Expect = 1e-09, Method: Composition-based stats.
Identities = 63/117 (53%), Positives = 74/117 (63%), Gaps = 13/117 (11%)
Query: 1 MKTKTLALVNGLVGLIGGIILLLWPFFIW-------------VIYFMLGVFDILKISILA 47
MK+K LAL NG+VGL+GGIILL +FI +ILKI+ILA
Sbjct: 1 MKSKNLALTNGIVGLVGGIILLFGGWFIAGGIVSDAATGSVSNTSGAAAFLNILKIAILA 60
Query: 48 LGIAGIVYYKDDNRVGPAGSILMIVGGIFTFNDFLGWIGAILSIIGGSLYLASLKRF 104
LGI ++YYK D RVG A +L+IVGG FLGWIG IL+IIGGSLYLASLK F
Sbjct: 61 LGIIALIYYKGDKRVGTASGVLLIVGGAIALIPFLGWIGGILAIIGGSLYLASLKHF 117
>gi|149021250|ref|ZP_01835496.1| integral membrane protein [Streptococcus pneumoniae SP23-BS72]
gi|147930351|gb|EDK81335.1| integral membrane protein [Streptococcus pneumoniae SP23-BS72]
Length = 111
Score = 64.3 bits (155), Expect = 2e-09, Method: Composition-based stats.
Identities = 41/70 (58%), Positives = 53/70 (75%)
Query: 37 VFDILKISILALGIAGIVYYKDDNRVGPAGSILMIVGGIFTFNDFLGWIGAILSIIGGSL 96
+ + LKI+ L LGI IVYYK D RVG A S+LMIV G + FLGW+G IL+IIGGSL
Sbjct: 40 LLNALKIANLVLGIIAIVYYKGDKRVGAAPSVLMIVSGGVSLIPFLGWVGGILAIIGGSL 99
Query: 97 YLASLKRFQA 106
+LA+LK+F++
Sbjct: 100 FLATLKKFKS 109
>gi|125717324|ref|YP_001034457.1| hypothetical protein SSA_0459 [Streptococcus sanguinis SK36]
gi|125497241|gb|ABN43907.1| Hypothetical protein SSA_0459 [Streptococcus sanguinis SK36]
Length = 111
Score = 59.3 bits (142), Expect = 7e-08, Method: Composition-based stats.
Identities = 52/111 (46%), Positives = 67/111 (60%), Gaps = 7/111 (6%)
Query: 1 MKTKTLALVNGLVGLIGGIILLLWPFFIWVIYFMLG------VFDILKISILALGIAGIV 54
MKTKTLAL++GLVGLIGG ILLL F +LG + ++K++ L LGI +
Sbjct: 1 MKTKTLALISGLVGLIGGTILLLSNIFFVPFSALLGDSGLNLLGAVIKLAALGLGITSLA 60
Query: 55 YYKDDNRVGPAGSILMIVGGI-FTFNDFLGWIGAILSIIGGSLYLASLKRF 104
YYK R A +L+IVG F+GWI IL IIGG+L+LAS +F
Sbjct: 61 YYKGQMRFSVAAGVLLIVGSSNIAPIPFIGWIAGILLIIGGALFLASSSKF 111
>gi|685120|gb|AAB31986.1| insulin activator factor; INSAF [Homo sapiens]
Length = 744
Score = 58.9 bits (141), Expect = 8e-08, Method: Composition-based stats.
Identities = 35/90 (38%), Positives = 52/90 (57%), Gaps = 8/90 (8%)
Query: 1 MKTKTLALVNGLVGLIGGIILLLWPFFI------WVIYFML--GVFDILKISILALGIAG 52
M TK LA+ NG+VGL+GGI LLL + + W F + +F +LKI+ L LGI G
Sbjct: 1 MNTKKLAVTNGVVGLVGGIALLLGGWIVLYSGNRWNRTFAIVNVLFLLLKIATLVLGIVG 60
Query: 53 IVYYKDDNRVGPAGSILMIVGGIFTFNDFL 82
+V + + + A S+L++VGG F+
Sbjct: 61 LVQFSKEAVITKAPSVLLVVGGALAIIPFM 90
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.333 0.154 0.478
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 397,526,713
Number of Sequences: 5470121
Number of extensions: 15414000
Number of successful extensions: 65560
Number of sequences better than 1.0e-05: 11
Number of HSP's better than 0.0 without gapping: 10
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 65539
Number of HSP's gapped (non-prelim): 12
length of query: 106
length of database: 1,894,087,724
effective HSP length: 74
effective length of query: 32
effective length of database: 1,489,298,770
effective search space: 47657560640
effective search space used: 47657560640
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 124 (52.4 bits)