BLASTX+BEAUTY Search Results

WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.

BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.

BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract

Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract



RepeatMasker repeats found in sequence:

No Repeats Found.

Reference:  Gish, Warren (1994-1997).  unpublished.
Gish, Warren and David J. States (1993).  Identification of protein coding
regions by database similarity search.  Nat. Genet. 3:266-72.

Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.

Query= 'E04C06_C06_06.ab1' (421 letters)

  Translating both strands of query sequence in all 6 reading frames

Database: nr 625,274 sequences; 197,782,623 total letters.



     Observed Numbers of Database Sequences Satisfying
    Various EXPECTation Thresholds (E parameter values)

        Histogram units:      = 4 Sequences     : less than 4 sequences

 EXPECTation Threshold
 (E parameter)
    |
    V   Observed Counts-->
  10000 760 195 |================================================
   6310 565 114 |============================
   3980 451 108 |===========================
   2510 343  97 |========================
   1580 246  80 |====================
   1000 166  53 |=============
    631 113  31 |=======
    398  82  15 |===
    251  67  40 |==========
    158  27  11 |==
    100  16   2 |:
   63.1  14   1 |:
   39.8  13   5 |=
   25.1   8   0 |
   15.8   8   2 |:
 >>>>>>>>>>>>>>>>>>>>>  Expect = 10.0, Observed = 6  <<<<<<<<<<<<<<<<<
   10.0   6   0 |
   6.31   6   1 |:
   3.98   5   0 |
   2.51   5   0 |
   1.58   5   0 |
   1.00   5   0 |
   0.63   5   0 |
   0.40   5   0 |
   0.25   5   1 |:
   0.16   4   0 |
   0.10   4   0 |
  0.063   4   0 |
  0.040   4   1 |:
  0.025   3   0 |
  0.016   3   0 |
  0.010   3   0 |
 0.0063   3   0 |
 0.0040   3   0 |
 0.0025   3   0 |
 0.0016   3   1 |:


                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N
gi|9758068|dbj|BAB08647.1|(AB009048) frnE protein-lik... +1   274  6.9e-23   1
gi|10174520|dbj|BAB05621.1|(AP001513) BH1902~unknown ... +1   104  7.1e-05   1
gi|3978168|gb|AAD03806.1|(AF047427) unknown [Mannheim... +1    93  0.0010    1
gi|7471903|pir||E75491frnE protein - Deinococcus radi... +1    94  0.036     1
gi|3170570|gb|AAC18100.1|(AF058302) FrnE [Streptomyce... +1    88  0.18      1
gi|1353122|sp|P34655|YOT0_CAEELHYPOTHETICAL 8.7 KD PR... -2    61  0.99      1

Use the and icons to retrieve links to Entrez:

E = Retrieve Entrez links (e.g., Medline abstracts, FASTA-formatted sequence reports).
R = Retrieve links to Related sequences (neighbors).
Use the icon (if present) to retrieve links to the Sequence Retrieval System (SRS).
Use the icon (if present) to retrieve links to the Ligand Enzyme and Chemical Compound Database .
Use the icon (if present) to retrieve links to the Protein Data Bank database.


to_Entrezto_Relatedto_Related >gi|9758068|dbj|BAB08647.1|  (AB009048) frnE protein-like [Arabidopsis thaliana]
            Length = 217

Frame  1 hits (HSPs):                                  ___________________
                        __________________________________________________
Database sequence:     |           |          |           |          |    | 217
                       0          50        100         150        200

  Plus Strand HSPs:

 Score = 274 (96.5 bits), Expect = 6.9e-23, P = 6.9e-23
 Identities = 53/78 (67%), Positives = 62/78 (79%), Frame = +1

Query:    13 GKYIGDHKFLLESAAKVGIEGAEEFLKDPNNGLKEVEEELKTYSGNISGVPYYVINGNHK 192
             GK+IGD +FL+E+A KVGIEGAEEFL DPNNG+ EV+EEL  YS NI+GVP Y ING  K
Sbjct:   139 GKFIGDREFLVETANKVGIEGAEEFLSDPNNGVTEVKEELAKYSKNITGVPNYTINGKVK 198

Query:   193 LSGGQPPEVFLRAFQVAT 246
             LSG QPPE F  AF+ A+
Sbjct:   199 LSGAQPPETFQSAFKAAS 216


to_Entrezto_Relatedto_Related >gi|10174520|dbj|BAB05621.1|  (AP001513) BH1902~unknown conserved protein in
            others [Bacillus halodurans]
            Length = 95

Frame  1 hits (HSPs):               ____________________                  
                        __________________________________________________
Database sequence:     |          |         |          |         |        | 95
                       0         20        40         60        80

  Plus Strand HSPs:

 Score = 104 (36.6 bits), Expect = 7.1e-05, P = 7.1e-05
 Identities = 20/38 (52%), Positives = 26/38 (68%), Frame = +1

Query:   124 EELKTYSGNISGVPYYVINGNHKLSGGQPPEVFLRAFQ 237
             EE K  S  + GVPY+VIN  + LSG QP +VF+RA +
Sbjct:    24 EEAKAQSLQVRGVPYFVINDKYALSGAQPTDVFVRALK 61


to_Entrezto_Relatedto_Related >gi|3978168|gb|AAD03806.1|  (AF047427) unknown [Mannheimia haemolytica]
            Length = 94

Frame  1 hits (HSPs):              _____________________________________  
                        __________________________________________________
Database sequence:     |          |         |          |          |       | 94
                       0         20        40         60         80

  Plus Strand HSPs:

 Score = 93 (32.7 bits), Expect = 0.0010, P = 0.0010
 Identities = 24/70 (34%), Positives = 37/70 (52%), Frame = +1

Query:    40 LLESAAKVGIEGAE--EFLKDPNNGLKEVEEELKTYSGNISGVPYYVINGNHKLSGGQPP 213
             L+  A  +G+E  E  + L   + G +  E+E   +   I  VP++VIN    +SG QPP
Sbjct:    22 LISLALDIGLERDEIAQLLTGDDFGHEVREDERVAHKYGIHSVPFFVINEKLGVSGAQPP 81

Query:   214 EVFLRAFQVA 243
             E+ L A + A
Sbjct:    82 EILLDAIKQA 91


to_Entrezto_Relatedto_Related >gi|7471903|pir||E75491  frnE protein - Deinococcus radiodurans (strain R1)
            >gi|6458362|gb|AAF10238.1|AE001923_5 (AE001923) frnE protein
            [Deinococcus radiodurans]
            Length = 252

Frame  1 hits (HSPs):                              ________________       
                        __________________________________________________
Database sequence:     |         |         |         |         |         || 252
                       0        50       100       150       200       250

  Plus Strand HSPs:

 Score = 94 (33.1 bits), Expect = 0.037, P = 0.036
 Identities = 24/75 (32%), Positives = 40/75 (53%), Frame = +1

Query:    13 GKYIGDHKFLLESAAKVGIE-GAEEFLKDPNNGLKEVE-EELKTYSGNISGVPYYVINGN 186
             G+ + D   L + AA+VG++ GA     +     + V  +E +     I+GVP++V+ G 
Sbjct:   140 GQNVNDLDTLQKLAAEVGLDAGAARAALEAGTYAQAVRYDEAQAQQLGITGVPFFVLGGK 199

Query:   187 HKLSGGQPPEVFLRA 231
             + +SG Q PE  L A
Sbjct:   200 YGVSGAQAPETLLGA 214


to_Entrezto_Relatedto_Related >gi|3170570|gb|AAC18100.1|  (AF058302) FrnE [Streptomyces roseofulvus]
            Length = 216

Frame  1 hits (HSPs):                                 _________________   
                        __________________________________________________
Database sequence:     |           |          |           |           |   | 216
                       0          50        100         150         200

  Plus Strand HSPs:

 Score = 88 (31.0 bits), Expect = 0.20, P = 0.18
 Identities = 24/71 (33%), Positives = 35/71 (49%), Frame = +1

Query:    13 GKYIGDHKFLLESAAKVGIEGA---EEFLKDPNNGLKEVEEELKTYSGNISGVPYYVING 183
             G  +GDH  LL  A + G++ A   E    D +      +E+     G + GVP +VI G
Sbjct:   132 GVSVGDHPTLLALAEEAGLDAAAAAEVLAGDAHAEDVRADEDRAARLG-VGGVPAFVIGG 190

Query:   184 NHKLSGGQPPEV 219
                +SG QP E+
Sbjct:   191 RWSVSGAQPAEL 202


to_Entrezto_Relatedto_Related >gi|1353122|sp|P34655|YOT0_CAEEL  HYPOTHETICAL 8.7 KD PROTEIN ZK632.10 IN
            CHROMOSOME III >gi|3881704|emb|CAA80190.1| (Z22181) contains
            similarity to Pfam domain: PF01679 (Uncharacterized protein
            family), Score=99.2, E-value=2.7e-26, N=1 [Caenorhabditis elegans]
            Length = 80

Frame -2 hits (HSPs):                   _______________________           
Annotated Domains:      _________________________________________________ 
                        __________________________________________________
Database sequence:     |           |            |           |             | 80
                       0          20           40          60
__________________

Annotated Domains:
   BLOCKS               BL01309: Uncharacterized protein family  30..50
   Entrez               Transmembrane region: POTENTIAL.         4..24
   Entrez               Transmembrane region: POTENTIAL.         32..52
   PFAM                 UPF0057: Uncharacterized protein family  2..52
   PRODOM               PD004756:                                4..50
   PRODOM               PD051035: YOT0_CAEEL                     52..79
__________________


  Minus Strand HSPs:

 Score = 61 (21.5 bits), Expect = 4.4, P = 0.99
 Identities = 12/39 (30%), Positives = 26/39 (66%), Frame = -2

Query:   132 ELLLNLLQTIVGVLKKLFCSFYTNLCSRFQKKLMVTNVF 16
             +LL+N+L T +G++  +  ++Y  LC   ++K +V N++
Sbjct:    27 DLLINILLTCLGIIPGIIHAWYIILC---KEKTVVQNIY 62


Parameters:
  filter=none
  matrix=BLOSUM62
  V=50
  B=50
  E=10
  gi
  H=1
  sort_by_pvalue
  echofilter

  ctxfactor=5.99

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   Std.    0   BLOSUM62                                 0.318   0.135   0.401  
   +3      0   BLOSUM62        0.318   0.135   0.401    0.353   0.156   0.616  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +2      0   BLOSUM62        0.318   0.135   0.401    0.364   0.165   0.556  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +1      0   BLOSUM62        0.318   0.135   0.401    0.331   0.149   0.471  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -1      0   BLOSUM62        0.318   0.135   0.401    0.354   0.160   0.562  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -2      0   BLOSUM62        0.318   0.135   0.401    0.360   0.164   0.568  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -3      0   BLOSUM62        0.318   0.135   0.401    0.355   0.160   0.566  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E    S W   T  X   E2     S2
   +3      0      139       139       10.  73 3  12 22  0.12    33
                                                    30  0.10    36
   +2      0      140       140       10.  73 3  12 22  0.12    33
                                                    30  0.11    36
   +1      0      140       140       10.  73 3  12 22  0.12    33
                                                    30  0.11    36
   -1      0      140       140       10.  73 3  12 22  0.12    33
                                                    30  0.11    36
   -2      0      140       140       10.  73 3  12 22  0.12    33
                                                    30  0.11    36
   -3      0      139       139       10.  73 3  12 22  0.12    33
                                                    30  0.10    36


Statistics:

  Database:  /usr/local/dot5/sl_home/beauty/seqdb/blast/nr
    Title:  nr
    Release date:  unknown
    Posted date:  4:06 PM CST Feb 28, 2001
    Format:  BLAST
  # of letters in database:  197,782,623
  # of sequences in database:  625,274
  # of database sequences satisfying E:  6
  No. of states in DFA:  591 (58 KB)
  Total size of DFA:  176 KB (192 KB)
  Time to generate neighborhood:  0.01u 0.00s 0.01t  Elapsed: 00:00:00
  No. of threads or processors used:  6
  Search cpu time:  137.39u 0.85s 138.24t  Elapsed: 00:00:36
  Total cpu time:  137.41u 0.87s 138.28t  Elapsed: 00:00:36
  Start:  Fri Jan 18 12:02:07 2002   End:  Fri Jan 18 12:02:43 2002

Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000