BLASTX+BEAUTY Search Results

WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.

BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.

BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract

Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract



RepeatMasker repeats found in sequence:

No Repeats Found.

Reference:  Gish, Warren (1994-1997).  unpublished.
Gish, Warren and David J. States (1993).  Identification of protein coding
regions by database similarity search.  Nat. Genet. 3:266-72.

Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.

Query= 'D16H02_P14_15.ab1' (361 letters)

  Translating both strands of query sequence in all 6 reading frames

Database: nr 625,274 sequences; 197,782,623 total letters.



     Observed Numbers of Database Sequences Satisfying
    Various EXPECTation Thresholds (E parameter values)

        Histogram units:      = 5 Sequences     : less than 5 sequences

 EXPECTation Threshold
 (E parameter)
    |
    V   Observed Counts-->
  10000 946 273 |======================================================
   6310 673 174 |==================================
   3980 499 115 |=======================
   2510 384 105 |=====================
   1580 279 121 |========================
   1000 158  61 |============
    631  97  34 |======
    398  63  25 |=====
    251  38  21 |====
    158  17   7 |=
    100  10   2 |:
   63.1   8   1 |:
   39.8   7   3 |:
   25.1   4   0 |
   15.8   4   1 |:
 >>>>>>>>>>>>>>>>>>>>>  Expect = 10.0, Observed = 3  <<<<<<<<<<<<<<<<<
   10.0   3   0 |
   6.31   3   0 |
   3.98   3   1 |:
   2.51   2   0 |
   1.58   2   1 |:


                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N
gi|9758595|dbj|BAB09228.1|(AB009050) gene_id:MDF20.5~... +3   199  1.1e-14   1
gi|9798570|emb|CAC03525.1|(AJ276410) BlpN protein [St... +3    63  0.79      1
gi|109294|pir||S17872translation initiation factor eI... +3    61  0.93      1

Use the and icons to retrieve links to Entrez:

E = Retrieve Entrez links (e.g., Medline abstracts, FASTA-formatted sequence reports).
R = Retrieve links to Related sequences (neighbors).
Use the icon (if present) to retrieve links to the Sequence Retrieval System (SRS).
Use the icon (if present) to retrieve links to the Ligand Enzyme and Chemical Compound Database .
Use the icon (if present) to retrieve links to the Protein Data Bank database.


to_Entrezto_Relatedto_Related >gi|9758595|dbj|BAB09228.1|  (AB009050) gene_id:MDF20.5~unknown protein
            [Arabidopsis thaliana]
            Length = 329

Frame  3 hits (HSPs):                                          ___________
                        __________________________________________________
Database sequence:     |                      |                      |    | 329
                       0                    150                    300

  Plus Strand HSPs:

 Score = 199 (70.1 bits), Expect = 1.1e-14, P = 1.1e-14
 Identities = 38/66 (57%), Positives = 47/66 (71%), Frame = +3

Query:     9 TNQVGVLGQKVFIAXVRCTSSLIFASIGAGIGATLIRPSLGQWVGCALGDLGGPIIVAFC 188
             + Q   LG KV    VRC +SL+FA+IGAGI + LIRPS GQW+GCALGDL GP++V+ C
Sbjct:   261 SQQAKDLGNKVVGITVRCGASLVFAAIGAGICSCLIRPSTGQWIGCALGDLAGPMVVSVC 320

Query:   189 ADKLFQ 206
               K  Q
Sbjct:   321 LQKTLQ 326


to_Entrezto_Relatedto_Related >gi|9798570|emb|CAC03525.1|  (AJ276410) BlpN protein [Streptococcus pneumoniae]
            >gi|11611211|emb|CAC18598.1| (AJ278419) putative bacteriocin
            [Streptococcus pneumoniae]
            Length = 67

Frame  3 hits (HSPs):                    _____________________________    
                        __________________________________________________
Database sequence:     |              |              |              |     | 67
                       0             20             40             60

  Plus Strand HSPs:

 Score = 63 (22.2 bits), Expect = 1.6, P = 0.79
 Identities = 12/39 (30%), Positives = 20/39 (51%), Frame = +3

Query:    75 IFASIGAGIGATLIRPSLGQWVGCALGDLGGPIIVAFCA 191
             + A++G   G       LG W G A+G +GG ++  + A
Sbjct:    25 VVAALGCAAGGVKYGRLLGPW-GAAIGGIGGAVVCGYLA 62


to_Entrezto_Relatedto_Related >gi|109294|pir||S17872  translation initiation factor eIF-2 gamma chain - rabbit
            Length = 78

Frame  3 hits (HSPs):                 ______________________________      
                        __________________________________________________
Database sequence:     |            |            |           |            | 78
                       0           20           40          60

  Plus Strand HSPs:

 Score = 61 (21.5 bits), Expect = 2.6, P = 0.93
 Identities = 16/49 (32%), Positives = 24/49 (48%), Frame = +3

Query:    18 VGVLGQKVFIAXVRCTSSLIFASIGAGIGATLIRPSLGQWVGCALGDLG 164
             +G L    +++ V+     I  + G   G TL +P L  WVG  LG +G
Sbjct:    23 IGSLSTGGWVSAVKADLGKIVLTAGGEAGVTLGQPXL--WVGQVLGAVG 69


Parameters:
  filter=none
  matrix=BLOSUM62
  V=50
  B=50
  E=10
  gi
  H=1
  sort_by_pvalue
  echofilter

  ctxfactor=5.95

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   Std.    0   BLOSUM62                                 0.318   0.135   0.401  
   +3      0   BLOSUM62        0.318   0.135   0.401    0.355   0.161   0.550  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +2      0   BLOSUM62        0.318   0.135   0.401    0.357   0.157   0.685  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +1      0   BLOSUM62        0.318   0.135   0.401    0.353   0.162   0.530  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -1      0   BLOSUM62        0.318   0.135   0.401    0.343   0.148   0.519  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -2      0   BLOSUM62        0.318   0.135   0.401    0.373   0.167   0.662  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -3      0   BLOSUM62        0.318   0.135   0.401    0.337   0.141   0.434  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E    S W   T  X   E2     S2
   +3      0      119       117       10.  71 3  12 22  0.094   33
                                                    29  0.10    35
   +2      0      120       118       10.  71 3  12 22  0.095   33
                                                    29  0.10    35
   +1      0      120       119       10.  71 3  12 22  0.096   33
                                                    29  0.11    35
   -1      0      120       118       10.  71 3  12 22  0.095   33
                                                    29  0.10    35
   -2      0      120       119       10.  71 3  12 22  0.096   33
                                                    29  0.11    35
   -3      0      119       117       10.  71 3  12 22  0.094   33
                                                    29  0.10    35


Statistics:

  Database:  /usr/local/dot5/sl_home/beauty/seqdb/blast/nr
    Title:  nr
    Release date:  unknown
    Posted date:  4:06 PM CST Feb 28, 2001
    Format:  BLAST
  # of letters in database:  197,782,623
  # of sequences in database:  625,274
  # of database sequences satisfying E:  3
  No. of states in DFA:  594 (59 KB)
  Total size of DFA:  161 KB (192 KB)
  Time to generate neighborhood:  0.01u 0.00s 0.01t  Elapsed: 00:00:00
  No. of threads or processors used:  6
  Search cpu time:  114.09u 1.05s 115.14t  Elapsed: 00:00:25
  Total cpu time:  114.12u 1.05s 115.17t  Elapsed: 00:00:25
  Start:  Thu Jan 17 16:16:22 2002   End:  Thu Jan 17 16:16:47 2002

Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000