BLASTX+BEAUTY Search Results

WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.

BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.

BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract

Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract

RepeatMasker Server unavailable.

Reference:  Gish, Warren (1994-1997).  unpublished.
Gish, Warren and David J. States (1993).  Identification of protein coding
regions by database similarity search.  Nat. Genet. 3:266-72.

Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.

Query= B05H01.seq(1>482) (446 letters)

  Translating both strands of query sequence in all 6 reading frames

Database: nr 625,274 sequences; 197,782,623 total letters.



     Observed Numbers of Database Sequences Satisfying
    Various EXPECTation Thresholds (E parameter values)

        Histogram units:      = 4 Sequences     : less than 4 sequences

 EXPECTation Threshold
 (E parameter)
    |
    V   Observed Counts-->
  10000 1103 239 |===========================================================
   6310  864 137 |==================================
   3980  727 194 |================================================
   2510  533 110 |===========================
   1580  423 118 |=============================
   1000  305  73 |==================
    631  232  60 |===============
    398  172  28 |=======
    251  144  35 |========
    158  109  40 |==========
    100   69  18 |====
   63.1   51  10 |==
   39.8   41   8 |==
   25.1   33  21 |=====
   15.8   12   5 |=
 >>>>>>>>>>>>>>>>>>>>>  Expect = 10.0, Observed = 7  <<<<<<<<<<<<<<<<<
   10.0    7   4 |=
   6.31    3   1 |:
   3.98    2   0 |
   2.51    2   2 |:


                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N
gi|139588|sp|P28087|VSH_MUMPRSMALL HYDROPHOBIC PROTEI... +2    64  0.89      1
gi|102069|pir||B35349H+-transporting ATP synthase (EC... +3    64  0.89      1
gi|6425501|emb|CAB60603.1|(AL132865) contains similar... +3    80  0.997     1
gi|1840457|gb|AAC56591.1|(U50293) SH protein [Mumps v... +2    60  0.998     1
gi|808679|gb|AAA66559.1|(M19004) unknown protein [Sai... +1    59  0.9998    1
gi|1840429|gb|AAC56577.1|(U50181) SH protein [Mumps v... +2    59  0.9998    1
gi|1840439|gb|AAC56582.1|(U50284) SH protein [Mumps v... +2    59  0.9998    1

Use the and icons to retrieve links to Entrez:

E = Retrieve Entrez links (e.g., Medline abstracts, FASTA-formatted sequence reports).
R = Retrieve links to Related sequences (neighbors).
Use the icon (if present) to retrieve links to the Sequence Retrieval System (SRS).
Use the icon (if present) to retrieve links to the Ligand Enzyme and Chemical Compound Database .
Use the icon (if present) to retrieve links to the Protein Data Bank database.


to_Entrezto_Relatedto_Related >gi|139588|sp|P28087|VSH_MUMPR  SMALL HYDROPHOBIC PROTEIN >gi|94037|pir||S24827
            small hydrophobic protein - mumps virus >gi|60595|emb|CAA45241.1|
            (X63708) SH gene [Mumps virus]
            Length = 57

Frame  2 hits (HSPs):               __________________________            
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |                |                 |               | 57
                       0               20                40
__________________

Annotated Domains:
   Entrez               Transmembrane region: POTENTIAL.         8..32
   PFAM                 SH: Viral small hydrophobic protein      1..57
   PRODOM               PD001504: VSH(13)                        27..56
__________________


  Plus Strand HSPs:

 Score = 64 (22.5 bits), Expect = 2.2, P = 0.89
 Identities = 13/30 (43%), Positives = 20/30 (66%), Frame = +2

Query:   161 LLLLFRGLCLVKIVIVLLFYQTVVRHCVLH 250
             L+LL+R + L   +I+ + Y+T VRH  LH
Sbjct:    15 LILLYRIITLYVWIILTITYKTSVRHAALH 44


to_Entrezto_Relatedto_Relatedto_ec >gi|102069|pir||B35349  H+-transporting ATP synthase (EC 3.6.1.34) protein 6 -
            Sauroleishmania tarentolae mitochondrion (SGC6) (fragment)
            Length = 67

Frame  3 hits (HSPs):   _________________________________                 
Annotated Domains:                                     ____________       
                        __________________________________________________
Database sequence:     |              |              |              |     | 67
                       0             20             40             60
__________________

Annotated Domains:
   PROSITE              CRYSTALLIN_BETAGAMMA: Crystallins beta a 43..58
__________________


  Plus Strand HSPs:

 Score = 64 (22.5 bits), Expect = 2.2, P = 0.89
 Identities = 19/45 (42%), Positives = 25/45 (55%), Frame = +3

Query:   153 MFDYCYCLEDSV*LK*LLCFCF------IRQL*GTVFYMCLQLSACI 275
             MF +  C  D V ++ LLCFC+      I  L   VFY+C +L  CI
Sbjct:     1 MFVFFVC--DLVIMRILLCFCYSVWSRIIFVLPVNVFYICTELMFCI 45


to_Entrezto_Relatedto_Related >gi|6425501|emb|CAB60603.1|  (AL132865) contains similarity to Pfam domain:
            PF00001 (7 transmembrane receptor (rhodopsin family)), Score=10.9,
            E-value=0.0031, N=1 [Caenorhabditis elegans]
            Length = 310

Frame  3 hits (HSPs):            ______________                           
                        __________________________________________________
Database sequence:     |       |       |        |       |       |       | | 310
                       0      50     100      150     200     250     300

  Plus Strand HSPs:

 Score = 80 (28.2 bits), Expect = 5.7, P = 1.0
 Identities = 26/89 (29%), Positives = 42/89 (47%), Frame = +3

Query:   123 CSLQIHKEV*MFDYCYCLEDSV*LK*LLCFCFIRQL*GTVFYMCLQLSACIVIFKVRFIK 302
             C+  IH  V MF   Y L  +         CF+    G +F + +Q S C++I   R   
Sbjct:    59 CTYLIHLRVLMFLDMYMLTSTQ--------CFLLSAYG-LFALNMQSSLCLIIGLDRLYN 109

Query:   303 LTSQINYSLSPSSGGIDLFLYLIEVFIIV 389
             +T  + YS  P+S  I LFL+ +   +++
Sbjct:   110 VTYPLRYSQLPNSFYIALFLFCVGFSVLI 138


to_Entrezto_Relatedto_Related >gi|1840457|gb|AAC56591.1|  (U50293) SH protein [Mumps virus]
            Length = 57

Frame  2 hits (HSPs):               __________________________            
                        __________________________________________________
Database sequence:     |                |                 |               | 57
                       0               20                40

  Plus Strand HSPs:

 Score = 60 (21.1 bits), Expect = 6.4, P = 1.0
 Identities = 13/30 (43%), Positives = 19/30 (63%), Frame = +2

Query:   161 LLLLFRGLCLVKIVIVLLFYQTVVRHCVLH 250
             L LL+  + L   +I+ + Y+T VRH VLH
Sbjct:    15 LTLLYLIITLYIWIILTITYETTVRHAVLH 44


to_Entrezto_Relatedto_Related >gi|808679|gb|AAA66559.1|  (M19004) unknown protein [Saimiriine herpesvirus 2]
            Length = 53

Frame  1 hits (HSPs):   ___________________________                       
                        __________________________________________________
Database sequence:     |                 |                  |             | 53
                       0                20                 40

  Plus Strand HSPs:

 Score = 59 (20.8 bits), Expect = 8.4, P = 1.0
 Identities = 12/29 (41%), Positives = 15/29 (51%), Frame = +1

Query:   199 SYC-AFVLSDSCKALCSTCVCSCLHALLF 282
             S C A   SD C+    TC  SC+H L +
Sbjct:     1 SICRAAAASDLCELCQGTCPASCIHTLFY 29


to_Entrezto_Relatedto_Related >gi|1840429|gb|AAC56577.1|  (U50181) SH protein [Mumps virus]
            Length = 57

Frame  2 hits (HSPs):               __________________________            
                        __________________________________________________
Database sequence:     |                |                 |               | 57
                       0               20                40

  Plus Strand HSPs:

 Score = 59 (20.8 bits), Expect = 8.4, P = 1.0
 Identities = 12/30 (40%), Positives = 19/30 (63%), Frame = +2

Query:   161 LLLLFRGLCLVKIVIVLLFYQTVVRHCVLH 250
             L+LL+  + L   +I+ + Y+T VRH  LH
Sbjct:    15 LILLYLIITLYVWIILTITYKTAVRHAALH 44


to_Entrezto_Relatedto_Related >gi|1840439|gb|AAC56582.1|  (U50284) SH protein [Mumps virus]
            >gi|1840447|gb|AAC56586.1| (U50288) SH protein [Mumps virus]
            >gi|1840449|gb|AAC56587.1| (U50289) SH protein [Mumps virus]
            >gi|1840453|gb|AAC56589.1| (U50291) SH protein [Mumps virus]
            >gi|9581809|emb|CAC00540.1| (AJ272363) small hydrophobic protein
            [Mumps virus]
            Length = 57

Frame  2 hits (HSPs):               __________________________            
                        __________________________________________________
Database sequence:     |                |                 |               | 57
                       0               20                40

  Plus Strand HSPs:

 Score = 59 (20.8 bits), Expect = 8.4, P = 1.0
 Identities = 12/30 (40%), Positives = 19/30 (63%), Frame = +2

Query:   161 LLLLFRGLCLVKIVIVLLFYQTVVRHCVLH 250
             L+LL+  + L   +I+ + Y+T VRH  LH
Sbjct:    15 LILLYLIITLYVWIILTITYKTAVRHAALH 44


Parameters:
  filter=none
  matrix=BLOSUM62
  V=50
  B=50
  E=10
  gi
  H=1
  sort_by_pvalue
  echofilter

  ctxfactor=5.97

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   Std.    0   BLOSUM62                                 0.318   0.135   0.401  
   +3      0   BLOSUM62        0.318   0.135   0.401    0.354   0.161   0.561  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +2      0   BLOSUM62        0.318   0.135   0.401    0.358   0.163   0.609  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +1      0   BLOSUM62        0.318   0.135   0.401    0.356   0.154   0.543  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -1      0   BLOSUM62        0.318   0.135   0.401    0.352   0.158   0.527  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -2      0   BLOSUM62        0.318   0.135   0.401    0.348   0.152   0.535  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -3      0   BLOSUM62        0.318   0.135   0.401    0.375   0.172   0.697  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E    S W   T  X   E2     S2
   +3      0      148       147       10.  74 3  12 22  0.12    33
                                                    30  0.11    36
   +2      0      148       147       10.  74 3  12 22  0.12    33
                                                    30  0.11    36
   +1      0      148       147       10.  74 3  12 22  0.12    33
                                                    30  0.11    36
   -1      0      148       148       10.  74 3  12 22  0.091   34
                                                    30  0.11    36
   -2      0      148       147       10.  74 3  12 22  0.12    33
                                                    30  0.11    36
   -3      0      148       147       10.  74 3  12 22  0.12    33
                                                    30  0.11    36


Statistics:

  Database:  /usr/local/dot5/sl_home/beauty/seqdb/blast/nr
    Title:  nr
    Release date:  unknown
    Posted date:  4:06 PM CST Feb 28, 2001
    Format:  BLAST
  # of letters in database:  197,782,623
  # of sequences in database:  625,274
  # of database sequences satisfying E:  7
  No. of states in DFA:  591 (58 KB)
  Total size of DFA:  186 KB (192 KB)
  Time to generate neighborhood:  0.01u 0.00s 0.01t  Elapsed: 00:00:00
  No. of threads or processors used:  6
  Search cpu time:  147.96u 1.05s 149.01t  Elapsed: 00:00:26
  Total cpu time:  147.98u 1.07s 149.05t  Elapsed: 00:00:26
  Start:  Sat Feb  2 04:53:56 2002   End:  Sat Feb  2 04:54:22 2002

Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000