BLASTX+BEAUTY Search Results

WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.

BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.

BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract

Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract



RepeatMasker repeats found in sequence:

No Repeats Found.

Reference:  Gish, Warren (1994-1997).  unpublished.
Gish, Warren and David J. States (1993).  Identification of protein coding
regions by database similarity search.  Nat. Genet. 3:266-72.

Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.

Query= B10F06.seq(1>489) (459 letters)

  Translating both strands of query sequence in all 6 reading frames

Database: nr 625,274 sequences; 197,782,623 total letters.



     Observed Numbers of Database Sequences Satisfying
    Various EXPECTation Thresholds (E parameter values)

        Histogram units:      = 6 Sequences     : less than 6 sequences

 EXPECTation Threshold
 (E parameter)
    |
    V   Observed Counts-->
  10000 1914 326 |======================================================
   6310 1588 237 |=======================================
   3980 1351 305 |==================================================
   2510 1046 287 |===============================================
   1580  759 115 |===================
   1000  644 118 |===================
    631  526 135 |======================
    398  391  97 |================
    251  294  39 |======
    158  255  69 |===========
    100  186  35 |=====
   63.1  151  97 |================
   39.8   54  31 |=====
   25.1   23   6 |=
   15.8   17   9 |=
 >>>>>>>>>>>>>>>>>>>>>  Expect = 10.0, Observed = 8  <<<<<<<<<<<<<<<<<
   10.0    8   1 |:
   6.31    7   1 |:
   3.98    6   0 |
   2.51    6   2 |:
   1.58    4   0 |
   1.00    4   1 |:
   0.63    3   0 |
   0.40    3   0 |
   0.25    3   2 |:


                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N
gi|12322207|gb|AAG51143.1|AC069273_14(AC069273) unkno... +3   341  5.5e-30   1
gi|282138|pir||B42645ribosomal protein L29 - Chlamydi... -3    74  0.15      1
gi|6920076|sp|P28538|RL29_CHLTR50S RIBOSOMAL PROTEIN ... -3    74  0.15      1
gi|11362655|pir||G81664ribosomal protein L29 TC0807 [... -3    68  0.56      1
gi|2245471|gb|AAB62522.1|(AF000523) Tat protein [Huma... +2    64  0.91      1
gi|7297461|gb|AAF52718.1|(AE003623) CG15868 gene prod... +3    64  0.91      1
gi|1749861|gb|AAB39117.1|(U80467) tat protein [Human ... +2    61  0.995     1
gi|102568|pir||S08572chymotrypsin/elastase inhibitor ... -1    59  0.9999    1

Use the and icons to retrieve links to Entrez:

E = Retrieve Entrez links (e.g., Medline abstracts, FASTA-formatted sequence reports).
R = Retrieve links to Related sequences (neighbors).
Use the icon (if present) to retrieve links to the Sequence Retrieval System (SRS).
Use the icon (if present) to retrieve links to the Ligand Enzyme and Chemical Compound Database .
Use the icon (if present) to retrieve links to the Protein Data Bank database.


to_Entrezto_Relatedto_Related >gi|12322207|gb|AAG51143.1|AC069273_14  (AC069273) unknown protein [Arabidopsis
            thaliana]
            Length = 97

Frame  3 hits (HSPs):   _________________________________________________ 
                        __________________________________________________
Database sequence:     |         |          |         |         |         | 97
                       0        20         40        60        80

  Plus Strand HSPs:

 Score = 341 (120.0 bits), Expect = 5.5e-30, P = 5.5e-30
 Identities = 64/96 (66%), Positives = 81/96 (84%), Frame = +3

Query:    57 MAFNNALRSAAKLVASSESSFSNSVSRGFHSTGMKRM-GGGHGHDEPYYLHAKHMYNLDK 233
             MA + ++RS +K++ASSE+S S SV+R FHSTG+K+M GGGHG  + YYLHAKHMYNLD+
Sbjct:     1 MALSTSIRSVSKIIASSEASVSRSVTRSFHSTGVKKMSGGGHGGYDEYYLHAKHMYNLDR 60

Query:   234 MKHQGLKMSLAVFIAFSIGVAVPVYAVIFQQKKTAS 341
             MK+Q LKMSL VF AFSIGV VP++AV+FQQ+KT S
Sbjct:    61 MKYQALKMSLGVFTAFSIGVGVPIFAVVFQQRKTQS 96


to_Entrezto_Relatedto_Related >gi|282138|pir||B42645  ribosomal protein L29 - Chlamydia trachomatis
            >gi|144619|gb|AAA23170.1| (M80325) ribosomal protein CtrL29e
            [Chlamydia trachomatis]
            Length = 72

Frame -3 hits (HSPs):     _________________________________               
Annotated Domains:                                    ___________         
                        __________________________________________________
Database sequence:     |             |             |            |         | 72
                       0            20            40           60
__________________

Annotated Domains:
   PROSITE              RIBOSOMAL_L29: Ribosomal protein L29 sig 45..59
__________________


  Minus Strand HSPs:

 Score = 74 (26.0 bits), Expect = 0.17, P = 0.15
 Identities = 19/47 (40%), Positives = 27/47 (57%), Frame = -3

Query:   142 KPLLTELEKEDSEEATSFAADLSALLKAIGFESDALENEMVSLHAYS 2
             K LL EL ++ SEE   F  D    L A+  E+ AL+N++V  H +S
Sbjct:     5 KNLLAELREKSSEELDEFIRDNKKALFALRAEA-ALQNKVVKTHQFS 50


to_Entrezto_Relatedto_Related >gi|6920076|sp|P28538|RL29_CHLTR  50S RIBOSOMAL PROTEIN L29
            >gi|3328957|gb|AAC68121.1| (AE001323) L29 Ribosomal Protein
            [Chlamydia trachomatis]
            Length = 72

Frame -3 hits (HSPs):     _________________________________               
Annotated Domains:      _________________________________________________ 
                        __________________________________________________
Database sequence:     |             |             |            |         | 72
                       0            20            40           60
__________________

Annotated Domains:
   BLOCKS               BL00579A: Ribosomal protein L29 proteins 10..19
   BLOCKS               BL00579B: Ribosomal protein L29 proteins 37..66
   DOMO                 DM01647: RIBOSOMALPROTEINL29             1..68
   PFAM                 Ribosomal_L29: Ribosomal L29 protein     8..71
   PRODOM               PD187745: O84525(1) RL29(1) Q9Z7R5(1)    1..71
   PROSITE              RIBOSOMAL_L29: Ribosomal protein L29 sig 45..59
__________________


  Minus Strand HSPs:

 Score = 74 (26.0 bits), Expect = 0.17, P = 0.15
 Identities = 19/47 (40%), Positives = 27/47 (57%), Frame = -3

Query:   142 KPLLTELEKEDSEEATSFAADLSALLKAIGFESDALENEMVSLHAYS 2
             K LL EL ++ SEE   F  D    L A+  E+ AL+N++V  H +S
Sbjct:     5 KNLLAELREKSSEELDEFIRDNKKALFALRAEA-ALQNKVVKTHQFS 50


to_Entrezto_Relatedto_Related >gi|11362655|pir||G81664  ribosomal protein L29 TC0807 [imported] - Chlamydia
            muridarum (strain Nigg) >gi|7190835|gb|AAF39610.1| (AE002347)
            ribosomal protein L29 [Chlamydia muridarum]
            Length = 72

Frame -3 hits (HSPs):     _________________________________               
                        __________________________________________________
Database sequence:     |             |             |            |         | 72
                       0            20            40           60

  Minus Strand HSPs:

 Score = 68 (23.9 bits), Expect = 0.83, P = 0.56
 Identities = 18/47 (38%), Positives = 25/47 (53%), Frame = -3

Query:   142 KPLLTELEKEDSEEATSFAADLSALLKAIGFESDALENEMVSLHAYS 2
             K LL EL ++ SEE   F  D    L  +  E+ AL+N+ V  H +S
Sbjct:     5 KNLLAELREKSSEELDEFIRDNKKALFTLRAEA-ALQNKAVKTHQFS 50


to_Entrezto_Relatedto_Related >gi|2245471|gb|AAB62522.1|  (AF000523) Tat protein [Human immunodeficiency virus
            type 1]
            Length = 71

Frame  2 hits (HSPs):            ___________________                      
                        __________________________________________________
Database sequence:     |             |             |             |        | 71
                       0            20            40            60

  Plus Strand HSPs:

 Score = 64 (22.5 bits), Expect = 2.4, P = 0.91
 Identities = 10/27 (37%), Positives = 12/27 (44%), Frame = +2

Query:   242 PGVENVPCCVHCFQHWCCSSCVCCHFP 322
             PG +  P C  C+   CC  C  C  P
Sbjct:    14 PGSQPKPACTTCYCKKCCLHCQVCFIP 40


to_Entrezto_Relatedto_Related >gi|7297461|gb|AAF52718.1|  (AE003623) CG15868 gene product [Drosophila
            melanogaster]
            Length = 67

Frame  3 hits (HSPs):                   __________________________        
                        __________________________________________________
Database sequence:     |              |              |              |     | 67
                       0             20             40             60

  Plus Strand HSPs:

 Score = 64 (22.5 bits), Expect = 2.4, P = 0.91
 Identities = 12/35 (34%), Positives = 19/35 (54%), Frame = +3

Query:   144 HSTGMKRMGGGHGHDE--PYYLHAKHMYNLDKMKH 242
             HS+  K  G GHGH E  P+++H+     +   +H
Sbjct:    23 HSSSGKHQGSGHGHREQQPHHVHSLATSTMASSRH 57


to_Entrezto_Relatedto_Related >gi|1749861|gb|AAB39117.1|  (U80467) tat protein [Human immunodeficiency virus
            type 1]
            Length = 72

Frame  2 hits (HSPs):            _________________                        
                        __________________________________________________
Database sequence:     |             |             |            |         | 72
                       0            20            40           60

  Plus Strand HSPs:

 Score = 61 (21.5 bits), Expect = 5.3, P = 0.99
 Identities = 9/24 (37%), Positives = 11/24 (45%), Frame = +2

Query:   242 PGVENVPCCVHCFQHWCCSSCVCC 313
             PG +    C  C+   CC  C CC
Sbjct:    14 PGSQPKTACTSCYCKRCCFHCQCC 37


to_Entrezto_Relatedto_Related >gi|102568|pir||S08572  chymotrypsin/elastase inhibitor - common roundworm
            Length = 63

Frame -1 hits (HSPs):                      ____________________________   
                        __________________________________________________
Database sequence:     |               |              |               |   | 63
                       0              20             40              60

  Minus Strand HSPs:

 Score = 59 (20.8 bits), Expect = 9.1, P = 1.0
 Identities = 14/35 (40%), Positives = 17/35 (48%), Frame = -1

Query:   252 STPGASSC--PSCTCALHEDSRAHHDHVHLPSSSC 154
             +TP A  C  PSC C+     R  HD   +P S C
Sbjct:    25 NTPCALMCRPPSCECSPGRGMRRTHDGKCVPVSEC 59


Parameters:
  filter=none
  matrix=BLOSUM62
  V=50
  B=50
  E=10
  gi
  H=1
  sort_by_pvalue
  echofilter

  ctxfactor=5.99

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   Std.    0   BLOSUM62                                 0.318   0.135   0.401  
   +3      0   BLOSUM62        0.318   0.135   0.401    0.340   0.143   0.459  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +2      0   BLOSUM62        0.318   0.135   0.401    0.342   0.150   0.555  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +1      0   BLOSUM62        0.318   0.135   0.401    0.353   0.154   0.575  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -1      0   BLOSUM62        0.318   0.135   0.401    0.348   0.147   0.509  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -2      0   BLOSUM62        0.318   0.135   0.401    0.349   0.151   0.514  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -3      0   BLOSUM62        0.318   0.135   0.401    0.350   0.150   0.506  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E    S W   T  X   E2     S2
   +3      0      152       152       10.  74 3  12 22  0.094   34
                                                    30  0.12    36
   +2      0      152       151       10.  74 3  12 22  0.093   34
                                                    30  0.12    36
   +1      0      153       152       10.  74 3  12 22  0.094   34
                                                    30  0.12    36
   -1      0      153       152       10.  74 3  12 22  0.094   34
                                                    30  0.12    36
   -2      0      152       152       10.  74 3  12 22  0.094   34
                                                    30  0.12    36
   -3      0      152       152       10.  74 3  12 22  0.094   34
                                                    30  0.12    36


Statistics:

  Database:  /usr/local/dot5/sl_home/beauty/seqdb/blast/nr
    Title:  nr
    Release date:  unknown
    Posted date:  4:06 PM CST Feb 28, 2001
    Format:  BLAST
  # of letters in database:  197,782,623
  # of sequences in database:  625,274
  # of database sequences satisfying E:  8
  No. of states in DFA:  593 (58 KB)
  Total size of DFA:  192 KB (192 KB)
  Time to generate neighborhood:  0.01u 0.00s 0.01t  Elapsed: 00:00:00
  No. of threads or processors used:  6
  Search cpu time:  154.30u 1.11s 155.41t  Elapsed: 00:00:36
  Total cpu time:  154.33u 1.13s 155.46t  Elapsed: 00:00:36
  Start:  Wed Feb  6 11:47:13 2002   End:  Wed Feb  6 11:47:49 2002

Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000