Please help us to improve our services and obtain funding for the
BCM Search Launcher
-- take a minute to complete our User Survey


BLASTX+BEAUTY Search Results

WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.

BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.

BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract

Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract



RepeatMasker repeats found in sequence:

No Repeats Found.

Reference:  Gish, Warren (1994-1997).  unpublished.
Gish, Warren and David J. States (1993).  Identification of protein coding
regions by database similarity search.  Nat. Genet. 3:266-72.

Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.

Query= A05E08_CONSENSUS (281 letters)

  Translating both strands of query sequence in all 6 reading frames

Database: nr 625,274 sequences; 197,782,623 total letters.



     Observed Numbers of Database Sequences Satisfying
    Various EXPECTation Thresholds (E parameter values)

        Histogram units:      = 6 Sequences     : less than 6 sequences

 EXPECTation Threshold
 (E parameter)
    |
    V   Observed Counts-->
  10000 1298 311 |===================================================
   6310  987 227 |=====================================
   3980  760 246 |=========================================
   2510  514 117 |===================
   1580  397 139 |=======================
   1000  258  67 |===========
    631  191  71 |===========
    398  120  45 |=======
    251   75  28 |====
    158   47   9 |=
    100   38  13 |==
   63.1   25   9 |=
   39.8   16   7 |=
   25.1    9   4 |:
   15.8    5   1 |:
 >>>>>>>>>>>>>>>>>>>>>  Expect = 10.0, Observed = 4  <<<<<<<<<<<<<<<<<
   10.0    4   2 |:
   6.31    2   0 |
   3.98    2   0 |
   2.51    2   1 |:
   1.58    1   0 |
   1.00    1   0 |
   0.63    1   1 |:


                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N
gi|7021729|gb|AAF35410.1|(AC024081) hypothetical prot... -1    69  0.41      1
gi|414449|gb|AAC43196.1|(U01721) unknown [Mycoplasma ... -1    64  0.89      1
gi|10639654|emb|CAC11626.1|(AL445064) cellulose synth... +3    67  0.9998    1
gi|102037|pir||B28118ubiquinol--cytochrome-c reductas... +1    34  0.99990   2



Locally-aligned regions (HSPs) with respect to query sequence:

Locus_ID                Frame 3 Hits
gi|10639654            |         ___________                              
                        __________________________________________________
Query sequence:        |          |         |          |          |       | 94
                       0         20        40         60         80


Locus_ID                Frame 2 Hits
gi|102037              |                                     _________    
                        __________________________________________________
Query sequence:        |          |         |          |          |       | 94
                       0         20        40         60         80


Locus_ID                Frame 1 Hits
gi|102037              |            ________                              
                        __________________________________________________
Query sequence:        |          |         |          |          |       | 94
                       0         20        40         60         80


Locus_ID                Frame -1 Hits
gi|7021729             |                                                  
gi|414449              |             ________________________             
Prosite Hits:                          _____                              
                        __________________________________________________
Query sequence:        |          |         |          |          |       | 94
                       0         20        40         60         80
__________________
Prosite hits:
   TYR_PHOSPHO_SITE     Tyrosine kinase phosphorylation site.    31..37
__________________

Use the and icons to retrieve links to Entrez:

E = Retrieve Entrez links (e.g., Medline abstracts, FASTA-formatted sequence reports).
R = Retrieve links to Related sequences (neighbors).
Use the icon (if present) to retrieve links to the Sequence Retrieval System (SRS).
Use the icon (if present) to retrieve links to the Ligand Enzyme and Chemical Compound Database .
Use the icon (if present) to retrieve links to the Protein Data Bank database.


to_Entrezto_Relatedto_Related >gi|7021729|gb|AAF35410.1|  (AC024081) hypothetical protein [Arabidopsis
            thaliana]
            Length = 143

Frame -1 hits (HSPs):                          ________________________   
                        __________________________________________________
Database sequence:     |                 |                |               | 143
                       0                50              100

  Minus Strand HSPs:

 Score = 69 (24.3 bits), Expect = 0.54, P = 0.41
 Identities = 21/68 (30%), Positives = 34/68 (50%), Frame = -1

Query:   254 RVFRDKSVGLLENHLKTVHHFKFRDLQYPLAGQIYMTFLEQPHFQALIQYRVKKREKKKE 75
             R  RD+ +G  E  +K   +    D+  P    I  TF E+   Q+++   ++K EK K 
Sbjct:    68 RYGRDEFIGQDEEQIKATIYEILNDVGVPPYLGIMKTFAEE--IQSILNSPLEKLEKIKV 125

Query:    74 KIKGIDHM 51
             KI+ + HM
Sbjct:   126 KIEVVAHM 133


to_Entrezto_Relatedto_Related >gi|414449|gb|AAC43196.1|  (U01721) unknown [Mycoplasma genitalium]
            Length = 99

Frame -1 hits (HSPs):                          _______________________    
                        __________________________________________________
Database sequence:     |         |         |         |         |          | 99
                       0        20        40        60        80

  Minus Strand HSPs:

 Score = 64 (22.5 bits), Expect = 2.2, P = 0.89
 Identities = 17/45 (37%), Positives = 22/45 (48%), Frame = -1

Query:   209 KTVHHFKFRDLQYP-LAGQIYMTFLEQPHFQALIQYRVKKREKKK 78
             + V HFK      P L G+ Y  FLE P+F  L   + K R+  K
Sbjct:    47 EVVKHFKISKQDEPSLIGRFYQDFLEDPNFVYLGDRKWKLRDFMK 91


to_Entrezto_Relatedto_Related >gi|10639654|emb|CAC11626.1|  (AL445064) cellulose synthase, subunit A related
            protein [Thermoplasma acidophilum]
            Length = 561

Frame  3 hits (HSPs):                                       __            
                        __________________________________________________
Database sequence:     |             |            |             |         | 561
                       0           150          300           450

  Plus Strand HSPs:

 Score = 67 (23.6 bits), Expect = 8.4, P = 1.0
 Identities = 13/21 (61%), Positives = 17/21 (80%), Frame = +3

Query:    54 MINPLNFFFFLFP-FLYSVLY 113
             MINP+ F FF FP F+YS+L+
Sbjct:   406 MINPVFFVFFYFPYFIYSMLF 426


to_Entrezto_Relatedto_Related >gi|102037|pir||B28118  ubiquinol--cytochrome-c reductase (EC 1.10.2.2)
            cytochrome b - Crithidia fasciculata mitochondrion (fragment)
            Length = 48

Frame  2 hits (HSPs):                                    _______________  
Frame  1 hits (HSPs):         ______________                              
                        __________________________________________________
Database sequence:     |                   |                    |         | 48
                       0                  20                   40

  Plus Strand HSPs:

 Score = 34 (12.0 bits), Expect = 9.3, Sum P(2) = 1.0
 Identities = 7/14 (50%), Positives = 8/14 (57%), Frame = +1

Query:    73 FSFFFSLFFTLYCI 114
             F  FF LF  L C+
Sbjct:     7 FLLFFLLFRNLCCL 20

 Score = 33 (11.6 bits), Expect = 9.3, Sum P(2) = 1.0
 Identities = 5/14 (35%), Positives = 8/14 (57%), Frame = +2

Query:   215 GFLINLHFCL*ILC 256
             GF +    C+ I+C
Sbjct:    33 GFSLGFFICIQIIC 46


Parameters:
  filter=none
  matrix=BLOSUM62
  V=50
  B=50
  E=10
  gi
  H=1
  sort_by_pvalue
  echofilter

  ctxfactor=5.97

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   Std.    0   BLOSUM62                                 0.318   0.135   0.401  
   +3      0   BLOSUM62        0.318   0.135   0.401    0.362   0.164   0.611  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +2      0   BLOSUM62        0.318   0.135   0.401    0.384   0.181   0.689  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +1      0   BLOSUM62        0.318   0.135   0.401    0.350   0.155   0.604  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -1      0   BLOSUM62        0.318   0.135   0.401    0.344   0.150   0.523  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -2      0   BLOSUM62        0.318   0.135   0.401    0.369   0.168   0.619  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -3      0   BLOSUM62        0.318   0.135   0.401    0.338   0.146   0.462  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E    S W   T  X   E2     S2
   +3      0       93        92       10.  65 3  12 22  0.095   32
                                                    28  0.12    33
   +2      0       93        93       10.  65 3  12 22  0.096   32
                                                    28  0.12    33
   +1      0       93        92       10.  65 3  12 22  0.095   32
                                                    28  0.12    33
   -1      0       93        92       10.  65 3  12 22  0.095   32
                                                    28  0.12    33
   -2      0       93        93       10.  65 3  12 22  0.096   32
                                                    28  0.12    33
   -3      0       93        93       10.  65 3  12 22  0.096   32
                                                    28  0.12    33


Statistics:

  Database:  /usr/local/dot5/sl_home/beauty/seqdb/blast/nr
    Title:  nr
    Release date:  unknown
    Posted date:  4:06 PM CST Feb 28, 2001
    Format:  BLAST
  # of letters in database:  197,782,623
  # of sequences in database:  625,274
  # of database sequences satisfying E:  4
  No. of states in DFA:  584 (58 KB)
  Total size of DFA:  139 KB (192 KB)
  Time to generate neighborhood:  0.01u 0.00s 0.01t  Elapsed: 00:00:00
  No. of threads or processors used:  6
  Search cpu time:  90.19u 1.14s 91.33t  Elapsed: 00:00:16
  Total cpu time:  90.22u 1.15s 91.37t  Elapsed: 00:00:16
  Start:  Mon Oct  1 19:50:37 2001   End:  Mon Oct  1 19:50:53 2001

Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000