BLASTX+BEAUTY Search Results

WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.

BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.

BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract

Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract



RepeatMasker repeats found in sequence:

No Repeats Found.

Reference:  Gish, Warren (1994-1997).  unpublished.
Gish, Warren and David J. States (1993).  Identification of protein coding
regions by database similarity search.  Nat. Genet. 3:266-72.

Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.

Query= 'E11A07_A07_02.ab1' (456 letters)

  Translating both strands of query sequence in all 6 reading frames

Database: nr 625,274 sequences; 197,782,623 total letters.



     Observed Numbers of Database Sequences Satisfying
    Various EXPECTation Thresholds (E parameter values)

        Histogram units:      = 3 Sequences     : less than 3 sequences

 EXPECTation Threshold
 (E parameter)
    |
    V   Observed Counts-->
  10000 627 118 |=======================================
   6310 509  89 |=============================
   3980 420 137 |=============================================
   2510 283  95 |===============================
   1580 188  43 |==============
   1000 145  43 |==============
    631 102  36 |============
    398  66  29 |=========
    251  37   7 |==
    158  30  11 |===
    100  19   6 |==
   63.1  13   4 |=
   39.8   9   3 |=
   25.1   6   0 |
   15.8   6   3 |=
 >>>>>>>>>>>>>>>>>>>>>  Expect = 10.0, Observed = 3  <<<<<<<<<<<<<<<<<
   10.0   3   0 |
   6.31   3   1 |:
   3.98   2   0 |
   2.51   2   1 |:
   1.58   1   0 |
   1.00   1   0 |
   0.63   1   0 |
   0.40   1   0 |
   0.25   1   0 |
   0.16   1   0 |
   0.10   1   0 |
  0.063   1   0 |
  0.040   1   0 |
  0.025   1   0 |
  0.016   1   0 |
  0.010   1   0 |
 0.0063   1   1 |:


                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N
gi|320932|pir||A44971hypothetical protein 1 - Plasmod... -3    68  0.0052    2
gi|862343|gb|AAA68426.1|(L10908) Gcap1 gene product [... -3    65  0.83      1
gi|8778738|gb|AAF79746.1|AC009317_5(AC009317) T30E16.... -3    61  0.994     1

Use the and icons to retrieve links to Entrez:

E = Retrieve Entrez links (e.g., Medline abstracts, FASTA-formatted sequence reports).
R = Retrieve links to Related sequences (neighbors).
Use the icon (if present) to retrieve links to the Sequence Retrieval System (SRS).
Use the icon (if present) to retrieve links to the Ligand Enzyme and Chemical Compound Database .
Use the icon (if present) to retrieve links to the Protein Data Bank database.


to_Entrezto_Relatedto_Related >gi|320932|pir||A44971  hypothetical protein 1 - Plasmodium brasilianum
            Length = 96

Frame -2 hits (HSPs):     ______                                          
Frame -3 hits (HSPs):                _________________________            
                        __________________________________________________
Database sequence:     |         |          |         |          |        | 96
                       0        20         40        60         80

  Minus Strand HSPs:

 Score = 68 (23.9 bits), Expect = 0.0053, Sum P(2) = 0.0052
 Identities = 14/33 (42%), Positives = 17/33 (51%), Frame = -3

Query:   235 HTYISYIYIHLFHLIHVYISYIYNHTEHIK*KH 137
             H+Y SY   HL+H  H Y SY   H+ H    H
Sbjct:    32 HSYHSYHSYHLYHSYHSYHSYHSYHSYHSHHSH 64

 Score = 67 (23.6 bits), Expect = 0.019, Sum P(2) = 0.019
 Identities = 13/28 (46%), Positives = 16/28 (57%), Frame = -3

Query:   235 HTYISYIYIHLFHLIHVYISYIYNHTEH 152
             H+Y SY   H +HL H Y SY   H+ H
Sbjct:    29 HSYHSYHSYHSYHLYHSYHSYHSYHSYH 56

 Score = 65 (22.9 bits), Expect = 0.061, Sum P(2) = 0.059
 Identities = 13/33 (39%), Positives = 16/33 (48%), Frame = -3

Query:   235 HTYISYIYIHLFHLIHVYISYIYNHTEHIK*KH 137
             H+Y SY   H +H  H Y SY   H+ H    H
Sbjct:    35 HSYHSYHLYHSYHSYHSYHSYHSYHSHHSHHSH 67

 Score = 62 (21.8 bits), Expect = 0.33, Sum P(2) = 0.28
 Identities = 12/28 (42%), Positives = 15/28 (53%), Frame = -3

Query:   235 HTYISYIYIHLFHLIHVYISYIYNHTEH 152
             H Y SY   H +H  H+Y SY   H+ H
Sbjct:    26 HPYHSYHSYHSYHSYHLYHSYHSYHSYH 53

 Score = 61 (21.5 bits), Expect = 0.60, Sum P(2) = 0.45
 Identities = 12/33 (36%), Positives = 16/33 (48%), Frame = -3

Query:   235 HTYISYIYIHLFHLIHVYISYIYNHTEHIK*KH 137
             H+Y  Y   H +H  H Y SY  +H+ H    H
Sbjct:    38 HSYHLYHSYHSYHSYHSYHSYHSHHSHHSHHSH 70

 Score = 60 (21.1 bits), Expect = 0.89, Sum P(2) = 0.59
 Identities = 12/33 (36%), Positives = 16/33 (48%), Frame = -3

Query:   235 HTYISYIYIHLFHLIHVYISYIYNHTEHIK*KH 137
             H Y SY   H +H  H Y S+  +H+ H    H
Sbjct:    41 HLYHSYHSYHSYHSYHSYHSHHSHHSHHSHHSH 73

 Score = 42 (14.8 bits), Expect = 0.0053, Sum P(2) = 0.0052
 Identities = 9/12 (75%), Positives = 9/12 (75%), Frame = -2

Query:   371 LTHFSQIFKFIF 336
             L HFS IF FIF
Sbjct:     5 LAHFSCIFIFIF 16


to_Entrezto_Relatedto_Related >gi|862343|gb|AAA68426.1|  (L10908) Gcap1 gene product [Mus musculus]
            >gi|1092097|prf||2022314A granule cell marker protein [Mus
            musculus] >gi|1092098|prf||2022315A granule cell marker protein
            [Mus musculus]
            Length = 85

Frame  3 hits (HSPs):    _____________________________________            
Frame -3 hits (HSPs):                               _______________       
                        __________________________________________________
Database sequence:     |           |          |           |           |   | 85
                       0          20         40          60          80

  Plus Strand HSPs:

 Score = 62 (21.8 bits), Expect = 3.9, P = 0.98
 Identities = 16/63 (25%), Positives = 34/63 (53%), Frame = +3

Query:   180 MYTCIKWKRCMYIY---EMYVCICFHL-IFMXYVLTL*FPLFLQSH*YG-VYVF*FVKYK 344
             +Y C+    C+Y+Y    +Y+CI  +L I++   + L    ++ +H +   Y++ ++ Y 
Sbjct:     3 VYMCLCVCLCVYVYACVSLYICISIYLSIYLSISIYLSIYTYIHTHTHTHTYIYIYI-YI 61

Query:   345 LKYL 356
               YL
Sbjct:    62 YIYL 65

  Minus Strand HSPs:

 Score = 65 (22.9 bits), Expect = 1.8, P = 0.83
 Identities = 12/24 (50%), Positives = 17/24 (70%), Frame = -3

Query:   241 HMHTYIS---YIYIHLFHLIHVYI 179
             H HTYI    YIYI+L+  +H+Y+
Sbjct:    50 HTHTYIYIYIYIYIYLYLCMHIYV 73


to_Entrezto_Relatedto_Related >gi|8778738|gb|AAF79746.1|AC009317_5  (AC009317) T30E16.7 [Arabidopsis thaliana]
            Length = 35

Frame -3 hits (HSPs):                 __________________________________  
                        __________________________________________________
Database sequence:     |                           |                      | 35
                       0                          20

  Minus Strand HSPs:

 Score = 61 (21.5 bits), Expect = 5.2, P = 0.99
 Identities = 14/25 (56%), Positives = 19/25 (76%), Frame = -3

Query:   223 SYIYIHLFHL-IHVYISYIYNHTEHI 149
             SY Y+++  L I++YI YIYNH  HI
Sbjct:    11 SYAYVYICKLYIYIYI-YIYNHI-HI 34


Parameters:
  filter=none
  matrix=BLOSUM62
  V=50
  B=50
  E=10
  gi
  H=1
  sort_by_pvalue
  echofilter

  ctxfactor=5.97

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   Std.    0   BLOSUM62                                 0.318   0.135   0.401  
   +3      0   BLOSUM62        0.318   0.135   0.401    0.371   0.165   0.633  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +2      0   BLOSUM62        0.318   0.135   0.401    0.382   0.175   0.649  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +1      0   BLOSUM62        0.318   0.135   0.401    0.374   0.175   0.630  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -1      0   BLOSUM62        0.318   0.135   0.401    0.371   0.173   0.598  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -2      0   BLOSUM62        0.318   0.135   0.401    0.375   0.170   0.587  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -3      0   BLOSUM62        0.318   0.135   0.401    0.365   0.168   0.600  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E    S W   T  X   E2     S2
   +3      0      151       150       10.  74 3  12 22  0.092   34
                                                    30  0.12    36
   +2      0      151       150       10.  74 3  12 22  0.092   34
                                                    30  0.12    36
   +1      0      152       151       10.  74 3  12 22  0.093   34
                                                    30  0.12    36
   -1      0      152       151       10.  74 3  12 22  0.093   34
                                                    30  0.12    36
   -2      0      151       150       10.  74 3  12 22  0.092   34
                                                    30  0.12    36
   -3      0      151       150       10.  74 3  12 22  0.092   34
                                                    30  0.12    36


Statistics:

  Database:  /usr/local/dot5/sl_home/beauty/seqdb/blast/nr
    Title:  nr
    Release date:  unknown
    Posted date:  4:06 PM CST Feb 28, 2001
    Format:  BLAST
  # of letters in database:  197,782,623
  # of sequences in database:  625,274
  # of database sequences satisfying E:  3
  No. of states in DFA:  586 (58 KB)
  Total size of DFA:  173 KB (192 KB)
  Time to generate neighborhood:  0.01u 0.00s 0.01t  Elapsed: 00:00:00
  No. of threads or processors used:  6
  Search cpu time:  124.59u 1.11s 125.70t  Elapsed: 00:00:31
  Total cpu time:  124.61u 1.13s 125.74t  Elapsed: 00:00:31
  Start:  Wed Jan 23 15:28:20 2002   End:  Wed Jan 23 15:28:51 2002

Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000