Please help us to improve our services and obtain funding for the
BCM Search Launcher
-- take a minute to complete our User Survey


BLASTX+BEAUTY Search Results

WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.

BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.

BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract

Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract



RepeatMasker repeats found in sequence:

No Repeats Found.

Reference:  Gish, Warren (1994-1997).  unpublished.
Gish, Warren and David J. States (1993).  Identification of protein coding
regions by database similarity search.  Nat. Genet. 3:266-72.

Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.

Query= A07C02_CONSENSUS (104 letters)

  Translating both strands of query sequence in all 6 reading frames

Database: nr 625,274 sequences; 197,782,623 total letters.



     Observed Numbers of Database Sequences Satisfying
    Various EXPECTation Thresholds (E parameter values)

        Histogram units:      = 31 Sequences     : less than 31 sequences

 EXPECTation Threshold
 (E parameter)
    |
    V   Observed Counts-->
  10000 9342 1670 |=====================================================
   6310 7672 1837 |===========================================================
   3980 5835 1746 |========================================================
   2510 4089 1386 |============================================
   1580 2703  864 |===========================
   1000 1839  653 |=====================
    631 1186  488 |===============
    398  698  304 |=========
    251  394  153 |====
    158  241  101 |===
    100  140   56 |=
   63.1   84   33 |=
   39.8   51   26 |:
   25.1   25   13 |:
   15.8   12    7 |:
 >>>>>>>>>>>>>>>>>>>>>  Expect = 10.0, Observed = 5  <<<<<<<<<<<<<<<<<
   10.0    5    2 |:
   6.31    3    0 |
   3.98    3    2 |:
   2.51    1    1 |:


                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N
gi|1076557|pir||S54157   extensin-like protein - cowp... +3    56  0.91      2
gi|7487811|pir||T00609   hypothetical protein T8K22.1... +3    53  0.93      2
gi|7300325|gb|AAF55485.1|(AE003720) CG12349 gene prod... +3    60  0.96      1
gi|7291971|gb|AAF47387.1|(AE003468) CG13885 gene prod... +3    57  0.999     1
gi|7446850|pir||A69981   conserved hypothetical prote... -3    38  0.9994    2



Locally-aligned regions (HSPs) with respect to query sequence:

Locus_ID                Frame 3 Hits
gi|1076557             |                  ___________________________     
gi|7487811             |                    ________________________      
gi|7300325             |     ________________________________________     
gi|7291971             |                  ___________________             
                        __________________________________________________
Query sequence:        |                           |                      | 35
                       0                          20


Locus_ID                Frame 2 Hits
gi|1076557             |_________________                                 
gi|7487811             |    __________________                            
                        __________________________________________________
Query sequence:        |                           |                      | 35
                       0                          20


Locus_ID                Frame -3 Hits
gi|7446850             |     ______________________________               
                        __________________________________________________
Query sequence:        |                           |                      | 35
                       0                          20

Use the and icons to retrieve links to Entrez:

E = Retrieve Entrez links (e.g., Medline abstracts, FASTA-formatted sequence reports).
R = Retrieve links to Related sequences (neighbors).
Use the icon (if present) to retrieve links to the Sequence Retrieval System (SRS).
Use the icon (if present) to retrieve links to the Ligand Enzyme and Chemical Compound Database .
Use the icon (if present) to retrieve links to the Protein Data Bank database.


to_Entrezto_Relatedto_Related >gi|1076557|pir||S54157  extensin-like protein - cowpea (fragment)
            Length = 279

Frame  3 hits (HSPs):                                         ____  ____  
Frame  2 hits (HSPs):                            ___                      
                        __________________________________________________
Database sequence:     |        |        |        |        |        |     | 279
                       0       50      100      150      200      250

  Plus Strand HSPs:

 Score = 56 (19.7 bits), Expect = 2.4, Sum P(2) = 0.91
 Identities = 11/18 (61%), Positives = 12/18 (66%), Frame = +3

Query:    42 IHSHHHHL*HQH--MFTT 89
             I  HHHHL H H  MFT+
Sbjct:   215 ISPHHHHLLHHHLLMFTS 232

 Score = 54 (19.0 bits), Expect = 3.9, Sum P(2) = 0.98
 Identities = 10/19 (52%), Positives = 12/19 (63%), Frame = +3

Query:    42 IHSHHHHL*HQHMF-TTIP 95
             I  HHHHL H H+  +T P
Sbjct:   247 ISRHHHHLLHHHLLMSTSP 265

 Score = 29 (10.2 bits), Expect = 2.4, Sum P(2) = 0.91
 Identities = 6/11 (54%), Positives = 8/11 (72%), Frame = +2

Query:     2 HEILSTSNPFL 34
             H  +STS+P L
Sbjct:   146 HLPMSTSHPLL 156


to_Entrezto_Relatedto_Related >gi|7487811|pir||T00609  hypothetical protein T8K22.16 - Arabidopsis thaliana
            >gi|3184285|gb|AAC18932.1| (AC004136) hypothetical protein
            [Arabidopsis thaliana]
            Length = 310

Frame  3 hits (HSPs):            ____                                     
Frame  2 hits (HSPs):         ___                                         
                        __________________________________________________
Database sequence:     |       |       |        |       |       |       | | 310
                       0      50     100      150     200     250     300

  Plus Strand HSPs:

 Score = 53 (18.7 bits), Expect = 2.6, Sum P(2) = 0.93
 Identities = 9/16 (56%), Positives = 11/16 (68%), Frame = +3

Query:    45 HSHHHHL*HQHMFTTI 92
             HSHHHH+ +  M T I
Sbjct:    62 HSHHHHVGYNIMVTNI 77

 Score = 33 (11.6 bits), Expect = 2.6, Sum P(2) = 0.93
 Identities = 5/12 (41%), Positives = 11/12 (91%), Frame = +2

Query:    11 LSTSNPFLLTNS 46
             ++TSNP L++++
Sbjct:    41 ITTSNPLLVSSN 52


to_Entrezto_Relatedto_Related >gi|7300325|gb|AAF55485.1|  (AE003720) CG12349 gene product [Drosophila
            melanogaster]
            Length = 102

Frame  3 hits (HSPs):                                      ______________ 
                        __________________________________________________
Database sequence:     |         |         |        |         |         | | 102
                       0        20        40       60        80       100

  Plus Strand HSPs:

 Score = 60 (21.1 bits), Expect = 3.3, P = 0.96
 Identities = 12/27 (44%), Positives = 12/27 (44%), Frame = +3

Query:    15 QPQTPSY*LIHSHHHHL*HQHMFTTIP 95
             Q Q     L H HHHH  HQ  F   P
Sbjct:    73 QQQAQQQHLSHHHHHHHHHQQQFLMTP 99


to_Entrezto_Relatedto_Related >gi|7291971|gb|AAF47387.1|  (AE003468) CG13885 gene product [Drosophila
            melanogaster]
            Length = 71

Frame  3 hits (HSPs):                   _________                         
                        __________________________________________________
Database sequence:     |             |             |             |        | 71
                       0            20            40            60

  Plus Strand HSPs:

 Score = 57 (20.1 bits), Expect = 6.8, P = 1.0
 Identities = 9/12 (75%), Positives = 9/12 (75%), Frame = +3

Query:    42 IHSHHHHL*HQH 77
             IHSHHHH  H H
Sbjct:    25 IHSHHHHHSHSH 36


to_Entrezto_Relatedto_Related >gi|7446850|pir||A69981  conserved hypothetical protein yrvI - Bacillus subtilis
            >gi|2635223|emb|CAB14718.1| (Z99118) similar to hypothetical
            proteins [Bacillus subtilis]
            Length = 119

Frame -3 hits (HSPs):    _____                            ______          
                        __________________________________________________
Database sequence:     |                    |                    |        | 119
                       0                   50                  100

  Minus Strand HSPs:

 Score = 38 (13.4 bits), Expect = 7.5, Sum P(2) = 1.0
 Identities = 6/11 (54%), Positives = 10/11 (90%), Frame = -3

Query:    75 VGVTNDDDENE 43
             VG+T+DD E++
Sbjct:     4 VGITHDDTEDD 14

 Score = 33 (11.6 bits), Expect = 7.5, Sum P(2) = 1.0
 Identities = 6/12 (50%), Positives = 9/12 (75%), Frame = -3

Query:    48 NELVSRKGFEVE 13
             N+L+  KG +VE
Sbjct:    83 NDLLREKGIKVE 94


Parameters:
  filter=none
  matrix=BLOSUM62
  V=50
  B=50
  E=10
  gi
  H=1
  sort_by_pvalue
  echofilter

  ctxfactor=5.96

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   Std.    0   BLOSUM62                                 0.318   0.135   0.401  
   +3      0   BLOSUM62        0.318   0.135   0.401    0.384   0.169   0.737  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +2      0   BLOSUM62        0.316   0.129   0.371    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +1      0   BLOSUM62        0.318   0.135   0.401    0.343   0.153   0.494  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -1      0   BLOSUM62        0.318   0.135   0.401    0.352   0.150   0.501  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -2      0   BLOSUM62        0.318   0.135   0.401    0.437   0.208   1.23   
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -3      0   BLOSUM62        0.318   0.135   0.401    0.352   0.163   0.526  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E    S W   T  X   E2     S2
   +3      0       34        34       10.  58 3  12 22  0.089   27
                                                    24  0.011   27
   +2      0       34        34       10.  58 3  12 22  0.11    26
                                                    24  0.014   26
   +1      0       34        34       10.  58 3  12 22  0.089   27
                                                    24  0.011   27
   -1      0       34        34       10.  58 3  12 22  0.089   27
                                                    24  0.011   27
   -2      0       34        34       10.  58 3  12 22  0.089   27
                                                    24  0.011   27
   -3      0       34        34       10.  58 3  12 22  0.089   27
                                                    24  0.011   27


Statistics:

  Database:  /usr/local/dot5/sl_home/beauty/seqdb/blast/nr
    Title:  nr
    Release date:  unknown
    Posted date:  4:06 PM CST Feb 28, 2001
    Format:  BLAST
  # of letters in database:  197,782,623
  # of sequences in database:  625,274
  # of database sequences satisfying E:  5
  No. of states in DFA:  524 (52 KB)
  Total size of DFA:  82 KB (128 KB)
  Time to generate neighborhood:  0.01u 0.00s 0.01t  Elapsed: 00:00:00
  No. of threads or processors used:  6
  Search cpu time:  38.33u 1.25s 39.58t  Elapsed: 00:00:07
  Total cpu time:  38.34u 1.27s 39.61t  Elapsed: 00:00:07
  Start:  Mon Oct  1 20:40:08 2001   End:  Mon Oct  1 20:40:15 2001

Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000