BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= SSA_0947
(197 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|125717786|ref|YP_001034919.1| hypothetical protein SSA_0... 359 5e-98
gi|125717787|ref|YP_001034920.1| hypothetical protein SSA_0... 280 4e-74
gi|157075074|gb|ABV09757.1| hypothetical protein SGO_1063 [... 202 8e-51
gi|125717788|ref|YP_001034921.1| hypothetical protein SSA_0... 202 1e-50
gi|157075688|gb|ABV10371.1| hypothetical protein SGO_1062 [... 159 6e-38
gi|157076257|gb|ABV10940.1| hypothetical protein SGO_1065 [... 155 2e-36
gi|157076021|gb|ABV10704.1| hypothetical protein SGO_1061 [... 147 4e-34
gi|157075310|gb|ABV09993.1| hypothetical protein SGO_1064 [... 133 5e-30
gi|125718556|ref|YP_001035689.1| hypothetical protein SSA_1... 125 2e-27
gi|125718557|ref|YP_001035690.1| hypothetical protein SSA_1... 124 4e-27
>gi|125717786|ref|YP_001034919.1| hypothetical protein SSA_0947 [Streptococcus sanguinis SK36]
gi|125497703|gb|ABN44369.1| Hypothetical protein SSA_0947 [Streptococcus sanguinis SK36]
Length = 197
Score = 359 bits (921), Expect = 5e-98, Method: Composition-based stats.
Identities = 197/197 (100%), Positives = 197/197 (100%)
Query: 1 MKKKLFTTLALILVLVSGGIYIYNKLTKPNFSPKTTKLYQRGFRFLEEQIGTYIKEKYSG 60
MKKKLFTTLALILVLVSGGIYIYNKLTKPNFSPKTTKLYQRGFRFLEEQIGTYIKEKYSG
Sbjct: 1 MKKKLFTTLALILVLVSGGIYIYNKLTKPNFSPKTTKLYQRGFRFLEEQIGTYIKEKYSG 60
Query: 61 IEKIEFSPIYVTGDEDSSMLNAYVRPTIYDKYGNKATLGTQIKKYVPNSFGIEADLVLDF 120
IEKIEFSPIYVTGDEDSSMLNAYVRPTIYDKYGNKATLGTQIKKYVPNSFGIEADLVLDF
Sbjct: 61 IEKIEFSPIYVTGDEDSSMLNAYVRPTIYDKYGNKATLGTQIKKYVPNSFGIEADLVLDF 120
Query: 121 DWSGNEVIELLDSEDNSIDVSNAKELPEEAKLTDAKSIDINIQMLVEDGQLKDVVKDEKG 180
DWSGNEVIELLDSEDNSIDVSNAKELPEEAKLTDAKSIDINIQMLVEDGQLKDVVKDEKG
Sbjct: 121 DWSGNEVIELLDSEDNSIDVSNAKELPEEAKLTDAKSIDINIQMLVEDGQLKDVVKDEKG 180
Query: 181 SPEAQIIYNVKLSKEEG 197
SPEAQIIYNVKLSKEEG
Sbjct: 181 SPEAQIIYNVKLSKEEG 197
>gi|125717787|ref|YP_001034920.1| hypothetical protein SSA_0948 [Streptococcus sanguinis SK36]
gi|125497704|gb|ABN44370.1| Hypothetical protein SSA_0948 [Streptococcus sanguinis SK36]
Length = 197
Score = 280 bits (715), Expect = 4e-74, Method: Composition-based stats.
Identities = 143/196 (72%), Positives = 164/196 (83%)
Query: 1 MKKKLFTTLALILVLVSGGIYIYNKLTKPNFSPKTTKLYQRGFRFLEEQIGTYIKEKYSG 60
MKKK+ L ++ +++SGGIYIYNKLTKPN PKTTKLYQRGFR LEEQ GTY KE Y G
Sbjct: 1 MKKKIIIVLTILFIVISGGIYIYNKLTKPNLGPKTTKLYQRGFRLLEEQYGTYFKEHYKG 60
Query: 61 IEKIEFSPIYVTGDEDSSMLNAYVRPTIYDKYGNKATLGTQIKKYVPNSFGIEADLVLDF 120
IEKI+FSPIY+ GD SMLNAYVRPTIYDKYGNKATLGT I+ Y PNS+G+ + LDF
Sbjct: 61 IEKIKFSPIYIEGDNGGSMLNAYVRPTIYDKYGNKATLGTTIENYTPNSYGLVTHIFLDF 120
Query: 121 DWSGNEVIELLDSEDNSIDVSNAKELPEEAKLTDAKSIDINIQMLVEDGQLKDVVKDEKG 180
D +GN+VIEL+DS N IDVSNAK LP+EAKLT AK D+NI++LVEDGQLKDVVKDEKG
Sbjct: 121 DGAGNDVIELMDSHGNDIDVSNAKHLPDEAKLTRAKGTDLNIELLVEDGQLKDVVKDEKG 180
Query: 181 SPEAQIIYNVKLSKEE 196
+PEA+IIYNVKLSK E
Sbjct: 181 TPEAEIIYNVKLSKGE 196
>gi|157075074|gb|ABV09757.1| hypothetical protein SGO_1063 [Streptococcus gordonii str. Challis
substr. CH1]
Length = 202
Score = 202 bits (515), Expect = 8e-51, Method: Composition-based stats.
Identities = 111/194 (57%), Positives = 135/194 (69%)
Query: 1 MKKKLFTTLALILVLVSGGIYIYNKLTKPNFSPKTTKLYQRGFRFLEEQIGTYIKEKYSG 60
MKKK T LALILVLVSGGI++YNKLTKPNF PKTT+LYQ GF+ LEEQI TYIKE Y G
Sbjct: 1 MKKKFLTILALILVLVSGGIFVYNKLTKPNFGPKTTRLYQHGFQLLEEQIATYIKEHYKG 60
Query: 61 IEKIEFSPIYVTGDEDSSMLNAYVRPTIYDKYGNKATLGTQIKKYVPNSFGIEADLVLDF 120
I+KIEFSPIYVTGD+ SSMLNA + P IYD +GNKA G K + ++G L L F
Sbjct: 61 IKKIEFSPIYVTGDDGSSMLNAEIVPIIYDYHGNKAKFGGLYKNFQHPAYGTIGYLRLSF 120
Query: 121 DWSGNEVIELLDSEDNSIDVSNAKELPEEAKLTDAKSIDINIQMLVEDGQLKDVVKDEKG 180
D+SG IEL DV+ + LPEE K K ID N + L+++ +LK V K + G
Sbjct: 121 DYSGKPYIELSTDTGEFKDVTYGQSLPEEIKGKKIKDIDFNFETLIKEERLKGVEKSDVG 180
Query: 181 SPEAQIIYNVKLSK 194
SP A++IYN++L K
Sbjct: 181 SPNAEVIYNLELKK 194
>gi|125717788|ref|YP_001034921.1| hypothetical protein SSA_0949 [Streptococcus sanguinis SK36]
gi|125497705|gb|ABN44371.1| Hypothetical protein SSA_0949 [Streptococcus sanguinis SK36]
Length = 202
Score = 202 bits (513), Expect = 1e-50, Method: Composition-based stats.
Identities = 106/194 (54%), Positives = 137/194 (70%)
Query: 1 MKKKLFTTLALILVLVSGGIYIYNKLTKPNFSPKTTKLYQRGFRFLEEQIGTYIKEKYSG 60
MKKK+ L ++ ++ SGGIY+YNKLTKPNF KTTKLYQ GFR LEEQIGTYIKE YSG
Sbjct: 1 MKKKIIIGLTILFIVFSGGIYMYNKLTKPNFGSKTTKLYQHGFRLLEEQIGTYIKENYSG 60
Query: 61 IEKIEFSPIYVTGDEDSSMLNAYVRPTIYDKYGNKATLGTQIKKYVPNSFGIEADLVLDF 120
IEKIEFSPIY+TGD+ SSMLNA V P +YD YGNKA G K + ++G L + F
Sbjct: 61 IEKIEFSPIYITGDDGSSMLNAEVVPIVYDSYGNKAKFGGLYKNFQQPAYGTIGYLRVSF 120
Query: 121 DWSGNEVIELLDSEDNSIDVSNAKELPEEAKLTDAKSIDINIQMLVEDGQLKDVVKDEKG 180
D+SG IEL +V+ + LP+E KL + K +D N + L+ +G+LK + K +KG
Sbjct: 121 DYSGKSYIELSTDSGEFKEVTYGQSLPKEIKLREMKDVDFNFETLIREGKLKGIEKSDKG 180
Query: 181 SPEAQIIYNVKLSK 194
SP+A+I+YN++L K
Sbjct: 181 SPDAEIVYNLQLKK 194
>gi|157075688|gb|ABV10371.1| hypothetical protein SGO_1062 [Streptococcus gordonii str. Challis
substr. CH1]
Length = 206
Score = 159 bits (403), Expect = 6e-38, Method: Composition-based stats.
Identities = 97/198 (48%), Positives = 124/198 (62%), Gaps = 4/198 (2%)
Query: 1 MKKKLFTTL-ALILVLVSGGIYIYNKLTKPNFSPKTTKLYQRGFRFLEEQIGTYIKEKYS 59
MKKK+ ALI++ +G +Y+ + K K+ + GFR LEEQIGTYIKE YS
Sbjct: 1 MKKKILLIFSALIILFSTGFVYMTQFHSSTGLDRKEEKIIKHGFRLLEEQIGTYIKENYS 60
Query: 60 GIEKIEFSPIYVTGDEDSSMLNAYVRPTIYDKYGNKATLGTQIKKYVPNSFGIEADLVLD 119
GI KIEFSPI++ G +D+ A V P IYDK GN+A +G +I K ++G DL LD
Sbjct: 61 GISKIEFSPIFIQGGKDNPPFTADVLPVIYDKEGNRAVMGKRIGKKGYPTYGTTGDLTLD 120
Query: 120 FDWSGNEVIELLDSE--DNSIDVSNAKELPEEAKLTDA-KSIDINIQMLVEDGQLKDVVK 176
FD GNE I L S D IDVS+A LPEEAKLT K D NI LV DGQLK+V +
Sbjct: 121 FDQMGNENIFLKGSSILDEKIDVSSANHLPEEAKLTPPIKGTDDNIDALVNDGQLKNVNR 180
Query: 177 DEKGSPEAQIIYNVKLSK 194
E GSP A++ YN+++ +
Sbjct: 181 SENGSPNAKMDYNLEIKR 198
>gi|157076257|gb|ABV10940.1| hypothetical protein SGO_1065 [Streptococcus gordonii str. Challis
substr. CH1]
Length = 187
Score = 155 bits (391), Expect = 2e-36, Method: Composition-based stats.
Identities = 84/161 (52%), Positives = 111/161 (68%), Gaps = 1/161 (0%)
Query: 34 KTTKLYQRGFRFLEEQIGTYIKEKYSGIEKIEFSPIYVTGDEDSSMLNAYVRPTIYDKYG 93
K KLY+RGFR LEEQ+ TYIKE YSG+ KIEFSPI+V G + +M +A + P IYD +G
Sbjct: 21 KGVKLYKRGFRLLEEQLATYIKEHYSGVSKIEFSPIFVQGGDGQTMFDANIVPVIYDNHG 80
Query: 94 NKATLGTQIKKYVPNSFGIEADLVLDFDWSGNEVIELLDSEDNSIDVSNAKELPEEAKLT 153
NKA LG ++ K+ S+G+ DL LDF+ EVIE +D + +D++N K LP +AKLT
Sbjct: 81 NKAYLGRKVGKHGYASYGLLGDLRLDFNGFDEEVIE-IDVDGKFLDITNYKSLPPKAKLT 139
Query: 154 DAKSIDINIQMLVEDGQLKDVVKDEKGSPEAQIIYNVKLSK 194
S+D NI LV G LKDVVK EKGS +A++ YN ++ K
Sbjct: 140 INPSMDENIVALVNAGHLKDVVKSEKGSLKAEVAYNTEIRK 180
>gi|157076021|gb|ABV10704.1| hypothetical protein SGO_1061 [Streptococcus gordonii str. Challis
substr. CH1]
Length = 206
Score = 147 bits (370), Expect = 4e-34, Method: Composition-based stats.
Identities = 86/181 (47%), Positives = 119/181 (65%), Gaps = 4/181 (2%)
Query: 20 IYIYNKLTKPNFSPKTTKLYQRGFRFLEEQIGTYIKEKYSGIEKIEFSPIYVTGDEDSSM 79
+Y+ + K K+ + GFR LEEQ+GTYIKE YSGI KIEFSPI++ G +D+
Sbjct: 21 VYMTQFHSSTGLDRKEEKIIKHGFRLLEEQLGTYIKENYSGISKIEFSPIFIQGGKDNPP 80
Query: 80 LNAYVRPTIYDKYGNKATLGTQIKKYVPNSFGIEADLVLDFDWSGNEVIELLDSEDN--- 136
A V P IYD+ GN+A LG QI K S+G+ DL LDFD + NE+IEL DS
Sbjct: 81 FTADVLPVIYDEEGNRAVLGGQIGKTGYPSYGLLTDLRLDFDHNDNEIIELEDSAKGLGI 140
Query: 137 SIDVSNAKELPEEAKL-TDAKSIDINIQMLVEDGQLKDVVKDEKGSPEAQIIYNVKLSKE 195
+DVSNAK LPE+AK+ + D NI LV+ +LK+V K+++GSP+A++IYN+++ +
Sbjct: 141 EVDVSNAKTLPEKAKIKAELDGTDENIAALVDTKKLKNVRKNKEGSPKAELIYNLEIKRG 200
Query: 196 E 196
E
Sbjct: 201 E 201
>gi|157075310|gb|ABV09993.1| hypothetical protein SGO_1064 [Streptococcus gordonii str. Challis
substr. CH1]
Length = 189
Score = 133 bits (335), Expect = 5e-30, Method: Composition-based stats.
Identities = 74/161 (45%), Positives = 105/161 (65%)
Query: 34 KTTKLYQRGFRFLEEQIGTYIKEKYSGIEKIEFSPIYVTGDEDSSMLNAYVRPTIYDKYG 93
K KLY+RGFR LEEQ+ TYIKE YSG+ KIEFSPI+V G + +M +A + ++YDK
Sbjct: 21 KGVKLYKRGFRLLEEQLATYIKEHYSGVSKIEFSPIFVQGGDGQTMFSANIVFSVYDKNQ 80
Query: 94 NKATLGTQIKKYVPNSFGIEADLVLDFDWSGNEVIELLDSEDNSIDVSNAKELPEEAKLT 153
NKA G I ++ D+ +DF+ E+IEL S++ IDVS+ + LP EAKLT
Sbjct: 81 NKAYFGGTIGDITYPTYNNVWDVRMDFNGWDEEIIELRTSDEKLIDVSDFQHLPPEAKLT 140
Query: 154 DAKSIDINIQMLVEDGQLKDVVKDEKGSPEAQIIYNVKLSK 194
+ +D NI+ LV+D Q++ V KD KGS + ++YN ++ K
Sbjct: 141 VSNKVDENIEALVQDKQIEGVRKDSKGSADLDVVYNDEIKK 181
>gi|125718556|ref|YP_001035689.1| hypothetical protein SSA_1757 [Streptococcus sanguinis SK36]
gi|125498473|gb|ABN45139.1| Hypothetical protein SSA_1757 [Streptococcus sanguinis SK36]
Length = 202
Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats.
Identities = 74/198 (37%), Positives = 115/198 (58%), Gaps = 11/198 (5%)
Query: 3 KKLFTTLALILVLVSGGIYIYNKLTKPNFSPKTTKLYQRGFRFLEEQIGTYIKEKYSGIE 62
K + +L VLV GG + +++ K LY+ GF+ EEQI TY+KE YSG+
Sbjct: 6 KSILLSLLCFFVLVIGGKFYMDRM-------KVDNLYRHGFQLYEEQIATYLKEHYSGVS 58
Query: 63 KIEFSPIYVTGDEDSSMLNAYVRPTIYDKYGNKATLGTQ--IKKYVPNSFGIEADLVLDF 120
KIEFSPI+++G S +NA + P +YD YGNK L + VP+ +G A L L F
Sbjct: 59 KIEFSPIFISGGGGESFVNARIVPVVYDSYGNKVYLRNDGVLDMAVPD-YGTLAGLDLSF 117
Query: 121 DWS-GNEVIELLDSEDNSIDVSNAKELPEEAKLTDAKSIDINIQMLVEDGQLKDVVKDEK 179
+ + G+E++ L ++E S+ + LPE+ KL + D + +G LK V K+ +
Sbjct: 118 NVNDGSEIVYLRNNERESVSSEVYQHLPEQLKLQKEEFTDEVMTAFSREGHLKGVEKNSQ 177
Query: 180 GSPEAQIIYNVKLSKEEG 197
GSP+A+IIYN+++ + +G
Sbjct: 178 GSPQAEIIYNLEIRRVQG 195
>gi|125718557|ref|YP_001035690.1| hypothetical protein SSA_1758 [Streptococcus sanguinis SK36]
gi|125498474|gb|ABN45140.1| Hypothetical protein SSA_1758 [Streptococcus sanguinis SK36]
Length = 202
Score = 124 bits (310), Expect = 4e-27, Method: Composition-based stats.
Identities = 76/198 (38%), Positives = 113/198 (57%), Gaps = 17/198 (8%)
Query: 3 KKLFTTLALILVLVSGGIYIYNKLTKPNFSPKTTKLYQRGFRFLEEQIGTYIKEKYSGIE 62
K + +L + VLV GG + +++ K LY+ GF+ EEQI TY+KE YSGI
Sbjct: 6 KSILLSLFCVFVLVIGGKFYIDRM-------KVDNLYRHGFQLYEEQIATYLKEHYSGIS 58
Query: 63 KIEFSPIYVTGDEDSSMLNAYVRPTIYDKYGNK------ATLGTQIKKYVPNSFGIEADL 116
KIEFSPI+ +G +A + P +YD +GNK T+ T + YV S GIE D
Sbjct: 59 KIEFSPIFKSGGRGEGFAHARIIPVVYDTHGNKVYLRNDGTIDTLVPNYVLTS-GIELDF 117
Query: 117 VLDFDWSGNEVIELLDSEDNSIDVSNAKELPEEAKLTDAKSIDINIQMLVEDGQLKDVVK 176
++ G+E+I L + ++ SI+V + LPE KL KS D I + G LK V K
Sbjct: 118 NVN---DGSEIIYLYNEKNESIEVGQYQHLPEHLKLKTYKSTDAIITAYSQKGTLKGVEK 174
Query: 177 DEKGSPEAQIIYNVKLSK 194
+ +GSP+A+I+YN+++ +
Sbjct: 175 NSQGSPKAEIVYNLEIRR 192
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.313 0.135 0.369
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 719,781,007
Number of Sequences: 5470121
Number of extensions: 30850760
Number of successful extensions: 68888
Number of sequences better than 1.0e-05: 11
Number of HSP's better than 0.0 without gapping: 10
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 68869
Number of HSP's gapped (non-prelim): 11
length of query: 197
length of database: 1,894,087,724
effective HSP length: 126
effective length of query: 71
effective length of database: 1,204,852,478
effective search space: 85544525938
effective search space used: 85544525938
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 126 (53.1 bits)