Please help us to improve our services and obtain funding for the
BCM Search Launcher
-- take a minute to complete our User Survey


BLASTX+BEAUTY Search Results

WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.

BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.

BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract

Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract



RepeatMasker repeats found in sequence:

No Repeats Found.

Reference:  Gish, Warren (1994-1997).  unpublished.
Gish, Warren and David J. States (1993).  Identification of protein coding
regions by database similarity search.  Nat. Genet. 3:266-72.

Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.

Query= A01i19 (930 letters)

  Translating both strands of query sequence in all 6 reading frames

Database: nr 505,245 sequences; 158,518,215 total letters.



     Observed Numbers of Database Sequences Satisfying
    Various EXPECTation Thresholds (E parameter values)

        Histogram units:      = 2 Sequences     : less than 2 sequences

 EXPECTation Threshold
 (E parameter)
    |
    V   Observed Counts-->
  10000 457 107 |=====================================================
   6310 350  55 |===========================
   3980 295  44 |======================
   2510 251  53 |==========================
   1580 198  59 |=============================
   1000 139  31 |===============
    631 108  28 |==============
    398  80  17 |========
    251  63   7 |===
    158  56  18 |=========
    100  38  16 |========
   63.1  22   1 |:
   39.8  21   1 |:
   25.1  20   1 |:
   15.8  19   0 |
 >>>>>>>>>>>>>>>>>>>>>  Expect = 10.0, Observed = 19  <<<<<<<<<<<<<<<<<
   10.0  19   0 |
   6.31  19   0 |
   3.98  19   2 |=
   2.51  17   2 |=
   1.58  15   0 |
   1.00  15   0 |
   0.63  15   0 |
   0.40  15   0 |
   0.25  15   0 |
   0.16  15   0 |
   0.10  15   0 |
  0.063  15   0 |
  0.040  15   1 |:


                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N
gi|6671963|gb|AAF23222.1|AC013454_9(AC013454) unknown... +3   742  1.4e-72   1
gi|6714407|gb|AAF26096.1|AC012393_22(AC012393) unknow... +3   316  3.4e-46   2
gi|7485938|pir||T01201hypothetical protein F21E10.13 ... +3   404  9.3e-37   1
gi|7110635ref|NP_036396.1| chromosome 22 open reading... +3   335  1.9e-29   1
gi|6572254|emb|CAB63066.1|(AL020993) dJ5O6.2 (novel p... +3   335  1.9e-29   1
gi|7503052|pir||T21697hypothetical protein F40E10.6 -... +3   340  2.3e-29   1
gi|7292104|gb|AAF47516.1|(AE003472) CG12004 gene prod... +3   331  5.1e-29   1
gi|6572253|emb|CAB63065.1|(AL020993) dJ5O6.2 (novel p... +3   317  1.5e-27   1
gi|7297586|gb|AAF52840.1|(AE003626) CG5850 gene produ... +3   275  2.6e-22   1
gi|7023136|dbj|BAA91851.1|(AK001708) unnamed protein ... +3   242  4.5e-19   1
gi|1351659|sp|Q09906|YAJ6_SCHPOHYPOTHETICAL 49.3 KD P... +3   217  2.4e-16   1
gi|6322904ref|NP_012977.1| Ykr051wp >gi|549619|sp|P36... +3   168  3.6e-09   1
gi|7487004|pir||T00451hypothetical protein T14N5.8 - ... +3   153  2.1e-07   2
gi|7485716|pir||T05165hypothetical protein F18E5.190 ... +3   140  2.9e-06   1
gi|7496248|pir||T15541hypothetical protein C18A3.4 - ... +3   106  0.031     1
gi|7485951|pir||T05664hypothetical protein F22I13.130... +3    89  0.83      2
gi|7020081|dbj|BAA90988.1|(AK000169) unnamed protein ... +3    89  0.89      1
gi|7707666|dbj|BAA95343.1|(AB027560) ATPase subunit 6... +3    83  0.95      1
gi|625591|pir||D61399hypothetical early protein E8 - ... +3    66  0.98      1



Locally-aligned regions (HSPs) with respect to query sequence:

Locus_ID                Frame 3 Hits
gi|6671963             |             _____________________________________
gi|6714407             |             _____________       _________________
gi|7485938             |             _____________________________________
gi|7110635             |             _____________________________________
gi|6572254             |             _____________________________________
gi|7503052             |               ___________________________________
gi|7292104             |              ___________________________________ 
gi|6572253             |                 _________________________________
gi|7297586             |              ____________________________________
gi|7023136             |              ____________________________________
gi|1351659             |               __________________________________ 
gi|6322904             |             _____________________________________
gi|7487004             |               _____________              ________
gi|7485716             |              ___________________________________ 
gi|7496248             |              ________________________            
gi|7485951             |               ___________________________________
gi|7020081             |                               ___________________
gi|7707666             |              ________________________            
gi|625591              |                     _______                      
                        __________________________________________________
Query sequence:        |       |       |        |       |       |       | | 310
                       0      50     100      150     200     250     300

Use the and icons to retrieve links to Entrez:

E = Retrieve Entrez links (e.g., Medline abstracts, FASTA-formatted sequence reports).
R = Retrieve links to Related sequences (neighbors).
Use the icon (if present) to retrieve links to the Sequence Retrieval System (SRS).
Use the icon (if present) to retrieve links to the Ligand Enzyme and Chemical Compound Database .
Use the icon (if present) to retrieve links to the Protein Data Bank database.


to_Entrezto_Relatedto_Related >gi|6671963|gb|AAF23222.1|AC013454_9  (AC013454) unknown protein [Arabidopsis
            thaliana]
            Length = 422

Frame  3 hits (HSPs):   ___________________________                       
                        __________________________________________________
Database sequence:     |                 |                 |              | 422
                       0               150               300

  Plus Strand HSPs:

 Score = 742 (261.2 bits), Expect = 1.4e-72, P = 1.4e-72
 Identities = 146/222 (65%), Positives = 166/222 (74%), Frame = +3

Query:   249 MVPIFLYIVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLV 428
             ++PI+L I+AF+CT GAIAL+L HIYKHLLNYTEP YQR+IVRIVFMVPVYALMSFL+LV
Sbjct:     4 LLPIYLIILAFLCTVGAIALALFHIYKHLLNYTEPIYQRYIVRIVFMVPVYALMSFLALV 63

Query:   429 LPQGSIYFNSIREIYEAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPL 608
             LP+ SIYFNSIRE+YEAWVIYNFLSLCL WVGGPGS ++SLTGR LKPSW L  CC+PPL
Sbjct:    64 LPKSSIYFNSIREVYEAWVIYNFLSLCLAWVGGPGSVVISLTGRSLKPSWHLMTCCIPPL 123

Query:   609 ALDGGFIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSLYFFLTQ 788
              LDG FIR CKQGCLQFV  +  LV VTL  YAKGKYKDGNF      +    +Y     
Sbjct:   124 PLDGRFIRRCKQGCLQFVILKPILVAVTLVLYAKGKYKDGNFSPDQSYLYLTIIYTISYT 183

Query:   789 WXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVL 914
                 AL+ F +ACK +    +  PKF   K VVF TY  GVL
Sbjct:   184 VALYALVLFYVACKDLLQPFNPVPKFVIIKSVVFLTYWQGVL 225


to_Entrezto_Relatedto_Related >gi|6714407|gb|AAF26096.1|AC012393_22  (AC012393) unknown protein [Arabidopsis
            thaliana]
            Length = 372

Frame  3 hits (HSPs):   ________________________                          
                        __________________________________________________
Database sequence:     |                    |                   |         | 372
                       0                  150                 300

  Plus Strand HSPs:

 Score = 316 (111.2 bits), Expect = 3.4e-46, Sum P(2) = 3.4e-46
 Identities = 59/74 (79%), Positives = 69/74 (93%), Frame = +3

Query:   249 MVPIFLYIVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLV 428
             ++PI+L I+AF+CT GAIAL+L HIYKHLLNYTEP YQR+IVRIVFMVPVYALMSFL+LV
Sbjct:     4 LLPIYLIILAFLCTVGAIALALFHIYKHLLNYTEPIYQRYIVRIVFMVPVYALMSFLALV 63

Query:   429 LPQGSIYFNSIREI 470
             LP+ SIYFNSIRE+
Sbjct:    64 LPKSSIYFNSIREV 77

 Score = 194 (68.3 bits), Expect = 3.4e-46, Sum P(2) = 3.4e-46
 Identities = 47/97 (48%), Positives = 54/97 (55%), Frame = +3

Query:   624 FIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSLYFFLTQWXXKA 803
             FIR CKQGCLQFV  +  LV VTL  YAKGKYKDGNF      +    +Y         A
Sbjct:    79 FIRRCKQGCLQFVILKPILVAVTLVLYAKGKYKDGNFSPDQSYLYLTIIYTISYTVALYA 138

Query:   804 LLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVL 914
             L+ F +ACK +    +  PKF   K VVF TY  GVL
Sbjct:   139 LVLFYVACKDLLQPFNPVPKFVIIKSVVFLTYWQGVL 175


to_Entrezto_Relatedto_Related >gi|7485938|pir||T01201  hypothetical protein F21E10.13 - Arabidopsis thaliana
            >gi|3047085|gb|AAC13598.1| (AF058914) F21E10.13 gene product
            [Arabidopsis thaliana]
            Length = 396

Frame  3 hits (HSPs):   _________________________                         
                        __________________________________________________
Database sequence:     |                  |                  |            | 396
                       0                150                300

  Plus Strand HSPs:

 Score = 404 (142.2 bits), Expect = 9.3e-37, P = 9.3e-37
 Identities = 83/145 (57%), Positives = 93/145 (64%), Frame = +3

Query:   480 WVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPLALDGGFIR*CKQGCLQF 659
             WVIYNFLSLCL WVGGPGS +LSL+GR LKPSW L  CC PPL LDG FIR CKQGCLQF
Sbjct:    55 WVIYNFLSLCLAWVGGPGSVVLSLSGRSLKPSWSLMTCCFPPLTLDGRFIRRCKQGCLQF 114

Query:   660 VNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSLYFFLTQWXXKALLXFXLACKXIC 839
             V  +  LV VTL  YAKGKYKDGNF      +    +Y         AL+ F +AC+ + 
Sbjct:   115 VILKPILVAVTLVLYAKGKYKDGNFNPDQAYLYLTIIYTISYTVALYALVLFYMACRDLL 174

Query:   840 FNNSSXPKFXXXKIVVFPTYGXGVL 914
                +  PKF   K VVF TY  GVL
Sbjct:   175 QPFNPVPKFVIIKSVVFLTYWQGVL 199

 Score = 232 (81.7 bits), Expect = 3.8e-18, P = 3.8e-18
 Identities = 47/84 (55%), Positives = 63/84 (75%), Frame = +3

Query:   249 MVPIFLYIVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPV-YALMSFLSL 425
             ++P +L IVAF+CT GAIAL++ HIY+HLLNYTEPTYQR+IVRI+FMVPV + + +FLSL
Sbjct:     4 LIPFYLNIVAFLCTVGAIALAIFHIYRHLLNYTEPTYQRYIVRIIFMVPVTWVIYNFLSL 63

Query:   426 VLP----QGSIYFN-SIREIYEAW 482
              L      GS+  + S R +  +W
Sbjct:    64 CLAWVGGPGSVVLSLSGRSLKPSW 87


to_Entrezto_Related >gi|7110635  ref|NP_036396.1| chromosome 22 open reading frame 5
            >gi|5596705|emb|CAB51403.1| (AL096879) hypothetical protein [Homo
            sapiens]
            Length = 373

Frame  3 hits (HSPs):   ________________________________                  
                        __________________________________________________
Database sequence:     |                   |                    |         | 373
                       0                 150                  300

  Plus Strand HSPs:

 Score = 335 (117.9 bits), Expect = 1.9e-29, P = 1.9e-29
 Identities = 83/231 (35%), Positives = 120/231 (51%), Frame = +3

Query:   255 PIFLYIVAFICTCG-----AIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFL 419
             P+FL   A     G     A+ ++   IY HL  Y+ P  QR+IVRI+F+VP+YA  S+L
Sbjct:     4 PVFLMTTAAQAISGFFVWTALLITCHQIYMHLRCYSCPNEQRYIVRILFIVPIYAFDSWL 63

Query:   420 SLVL---PQGSIYFNSIREIYEAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXN 590
             SL+     Q  +YF ++R+ YEA VIYNFLSLC E++GG  S +  + G+ ++ S     
Sbjct:    64 SLLFFTNDQYYVYFGTVRDCYEALVIYNFLSLCYEYLGGESSIMSEIRGKPIESSCMYGT 123

Query:   591 CCLPPLALDGGFIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSL 770
             CCL       GF+R CKQ  LQF   +  + V T+   A GKY+DG+F      +    +
Sbjct:   124 CCLWGKTYSIGFLRFCKQATLQFCVVKPLMAVSTVVLQAFGKYRDGDFDVTSGYLYVTII 183

Query:   771 YFFLTQWXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVLFPL 923
             Y         AL  F  A + +    S   KF   K V+F ++  G+L  +
Sbjct:   184 YNISVSLALYALFLFYFATRELLSPYSPVLKFFMVKSVIFLSFWQGMLLAI 234


to_Entrezto_Relatedto_Related >gi|6572254|emb|CAB63066.1|  (AL020993) dJ5O6.2 (novel protein similar to C.
            elegans F40E10.6 (isoform 1)) [Homo sapiens]
            Length = 293

Frame  3 hits (HSPs):   ________________________________________          
                        __________________________________________________
Database sequence:     |        |       |        |       |        |       | 293
                       0       50     100      150     200      250

  Plus Strand HSPs:

 Score = 335 (117.9 bits), Expect = 1.9e-29, P = 1.9e-29
 Identities = 83/231 (35%), Positives = 120/231 (51%), Frame = +3

Query:   255 PIFLYIVAFICTCG-----AIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFL 419
             P+FL   A     G     A+ ++   IY HL  Y+ P  QR+IVRI+F+VP+YA  S+L
Sbjct:     4 PVFLMTTAAQAISGFFVWTALLITCHQIYMHLRCYSCPNEQRYIVRILFIVPIYAFDSWL 63

Query:   420 SLVL---PQGSIYFNSIREIYEAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXN 590
             SL+     Q  +YF ++R+ YEA VIYNFLSLC E++GG  S +  + G+ ++ S     
Sbjct:    64 SLLFFTNDQYYVYFGTVRDCYEALVIYNFLSLCYEYLGGESSIMSEIRGKPIESSCMYGT 123

Query:   591 CCLPPLALDGGFIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSL 770
             CCL       GF+R CKQ  LQF   +  + V T+   A GKY+DG+F      +    +
Sbjct:   124 CCLWGKTYSIGFLRFCKQATLQFCVVKPLMAVSTVVLQAFGKYRDGDFDVTSGYLYVTII 183

Query:   771 YFFLTQWXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVLFPL 923
             Y         AL  F  A + +    S   KF   K V+F ++  G+L  +
Sbjct:   184 YNISVSLALYALFLFYFATRELLSPYSPVLKFFMVKSVIFLSFWQGMLLAI 234


to_Entrezto_Relatedto_Related >gi|7503052|pir||T21697  hypothetical protein F40E10.6 - Caenorhabditis elegans
            >gi|3876602|emb|CAA93657.1| (Z69790) cDNA EST yk376g11.5 comes from
            this gene~cDNA EST yk442f1.5 comes from this gene~cDNA EST
            yk455h10.5 comes from this gene~cDNA EST yk457h6.5 comes from this
            gene~cDNA EST yk464d8.5 comes from this gene [Caenorhabditis
            elegans] >gi|3877015|emb|CAA93669.1| (Z69792) cDNA EST yk376g11.5
            comes from this gene~cDNA EST yk442f1.5 comes from this gene~cDNA
            EST yk455h10.5 comes from this gene~cDNA EST yk457h6.5 comes from
            this gene~cDNA EST yk464d8.5 comes from this gene [Caenorhabditis
            elegans]
            Length = 595

Frame  3 hits (HSPs):                     ___________________             
                        __________________________________________________
Database sequence:     |            |            |           |            | 595
                       0          150          300         450

  Plus Strand HSPs:

 Score = 340 (119.7 bits), Expect = 2.3e-29, P = 2.3e-29
 Identities = 82/211 (38%), Positives = 120/211 (56%), Frame = +3

Query:   300 IALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQGS--IYFNSIREIY 473
             I LS L IY+HL  Y+ P  QR+IVRI+F+VP+YA  S+LSL+    +  IYFNSIR+ Y
Sbjct:   226 IQLSKLQIYQHLRFYSCPAEQRWIVRILFIVPIYAFDSWLSLIFFSDNVYIYFNSIRDCY 285

Query:   474 EAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLX-NCCLPPLALDGGFIR*CKQGC 650
             EA+VIY+FLSLC E++GG  + +  + G+ ++P+ +L   CCL        F+R CKQ  
Sbjct:   286 EAFVIYSFLSLCYEYLGGESNIMAEIRGKPIRPTNYLTCTCCLAGKQYTIEFLRFCKQAT 345

Query:   651 LQFVNFETHLVVVTLYFYAKGKYKDGNFF--QXIIIVS*QSLYFFLTQWXXKALLXFXLA 824
             LQF   +  + V+TL   A GKY+DGN+   Q  I ++   +Y          +  F  A
Sbjct:   346 LQFCFIKPIMAVITLMLTAIGKYEDGNWSLDQGYIYIT--LVYNVSISLALYGMFLFYAA 403

Query:   825 CKXICFNNSSXPKFXXXKIVVFPTYGXGVLFPL 923
              + +        KF   K V+F ++  G L  +
Sbjct:   404 TRDLLSPYRPVLKFLTVKSVIFLSFWQGFLIAI 436


to_Entrezto_Relatedto_Related >gi|7292104|gb|AAF47516.1|  (AE003472) CG12004 gene product [Drosophila
            melanogaster]
            Length = 348

Frame  3 hits (HSPs):      ________________________________               
                        __________________________________________________
Database sequence:     |                     |                    |       | 348
                       0                   150                  300

  Plus Strand HSPs:

 Score = 331 (116.5 bits), Expect = 5.1e-29, P = 5.1e-29
 Identities = 75/217 (34%), Positives = 115/217 (52%), Frame = +3

Query:   270 IVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQGS-- 443
             ++A +C   A+ ++   IY+HL  YT P  QR+IVRI+F+VP+YA  S++SL+       
Sbjct:    24 VLAGVCVWAALFITCQQIYQHLRWYTNPQEQRWIVRILFIVPIYATYSWISLLFFNSDNV 83

Query:   444 -IYFNSIREIYEAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPLALDG 620
              IYF ++R+ YEA+VIYNFLSLC E++GG G+ +  + G+ +K S     CCL       
Sbjct:    84 YIYFFTVRDCYEAFVIYNFLSLCYEYLGGEGNIMSEIRGKPIKTSCLYGTCCLKGKTYTI 143

Query:   621 GFIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSLYFFLTQWXXK 800
             GF+R CKQ  LQF   +  +  + ++  A G Y DG++      +    +Y         
Sbjct:   144 GFLRFCKQATLQFCLVKPLVAFIIIFLQAFGHYHDGDWSADGGYIYITIIYNISVSLALY 203

Query:   801 ALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGV 911
              L  F  A + +        KF   K V+F ++  GV
Sbjct:   204 GLYLFYFATRDLLTPFEPVLKFCTIKSVIFLSFWQGV 240


to_Entrezto_Relatedto_Related >gi|6572253|emb|CAB63065.1|  (AL020993) dJ5O6.2 (novel protein similar to C.
            elegans F40E10.6 (isoform 2)) [Homo sapiens]
            Length = 261

Frame  3 hits (HSPs):   _______________________________________           
                        __________________________________________________
Database sequence:     |         |        |         |         |        |  | 261
                       0        50      100       150       200      250

  Plus Strand HSPs:

 Score = 317 (111.6 bits), Expect = 1.5e-27, P = 1.5e-27
 Identities = 75/201 (37%), Positives = 108/201 (53%), Frame = +3

Query:   330 HLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVL---PQGSIYFNSIREIYEAWVIYNFL 500
             HL  Y+ P  QR+IVRI+F+VP+YA  S+LSL+     Q  +YF ++R+ YEA VIYNFL
Sbjct:     2 HLRCYSCPNEQRYIVRILFIVPIYAFDSWLSLLFFTNDQYYVYFGTVRDCYEALVIYNFL 61

Query:   501 SLCLEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPLALDGGFIR*CKQGCLQFVNFETHL 680
             SLC E++GG  S +  + G+ ++ S     CCL       GF+R CKQ  LQF   +  +
Sbjct:    62 SLCYEYLGGESSIMSEIRGKPIESSCMYGTCCLWGKTYSIGFLRFCKQATLQFCVVKPLM 121

Query:   681 VVVTLYFYAKGKYKDGNFFQXIIIVS*QSLYFFLTQWXXKALLXFXLACKXICFNNSSXP 860
              V T+   A GKY+DG+F      +    +Y         AL  F  A + +    S   
Sbjct:   122 AVSTVVLQAFGKYRDGDFDVTSGYLYVTIIYNISVSLALYALFLFYFATRELLSPYSPVL 181

Query:   861 KFXXXKIVVFPTYGXGVLFPL 923
             KF   K V+F ++  G+L  +
Sbjct:   182 KFFMVKSVIFLSFWQGMLLAI 202


to_Entrezto_Relatedto_Related >gi|7297586|gb|AAF52840.1|  (AE003626) CG5850 gene product [Drosophila
            melanogaster]
            Length = 569

Frame  3 hits (HSPs):       ____________________                          
                        __________________________________________________
Database sequence:     |             |            |            |          | 569
                       0           150          300          450

  Plus Strand HSPs:

 Score = 275 (96.8 bits), Expect = 2.6e-22, P = 2.6e-22
 Identities = 69/217 (31%), Positives = 117/217 (53%), Frame = +3

Query:   264 LYIVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQGS 443
             L ++  +    A+ +S+ HI +H++++T+P  Q+ I+RI++MVP+YAL +++ L  P+ S
Sbjct:    51 LILIGGLFVLSAVPVSIWHIIQHVIHFTKPILQKHIIRILWMVPIYALNAWIGLFFPKHS 110

Query:   444 IYFNSIREIYEAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPLALDGG 623
             IY +S+RE YEA+VIYNF+   L ++        ++  +   P +F   CC+ P  +   
Sbjct:   111 IYVDSLRECYEAYVIYNFMVYLLNYLNLGMDLEATMEYKPQVPHFFPL-CCMRPWVMGRE 169

Query:   624 FIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNF-----FQXIIIVS*QSLYFFLTQ 788
             FI  CK G LQ+         +++     G Y +G F     F  I++V+  ++  F+  
Sbjct:   170 FIHNCKHGILQYTVVRPITTFISVICELCGVYGEGEFAGNVAFPYIVVVN--NISQFVAM 227

Query:   789 WXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVL 914
             +    L+ F  A K         PKF   K VVF ++  GVL
Sbjct:   228 Y---CLVLFYRANKEDLKPMKPIPKFLCIKAVVFFSFFQGVL 266


to_Entrezto_Relatedto_Related >gi|7023136|dbj|BAA91851.1|  (AK001708) unnamed protein product [Homo sapiens]
            Length = 438

Frame  3 hits (HSPs):        __________________________                   
                        __________________________________________________
Database sequence:     |                 |                |               | 438
                       0               150              300

  Plus Strand HSPs:

 Score = 242 (85.2 bits), Expect = 4.5e-19, P = 4.5e-19
 Identities = 67/219 (30%), Positives = 110/219 (50%), Frame = +3

Query:   267 YIVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQGSI 446
             + +A I     I +SL  I +HL++YT+P  Q+ I+RI++MVP+Y+L S+++L  P  +I
Sbjct:    49 WFIAGIFLLLTIPISLWVILQHLVHYTQPELQKPIIRILWMVPIYSLDSWIALKYPGIAI 108

Query:   447 YFNSIREIYEAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPLALDGGF 626
             Y ++ RE YEA+VIYNF+     ++      ++ +     +   F   CC PP A+    
Sbjct:   109 YVDTCRECYEAYVIYNFMGFLTNYLTNRYPNLVLILEAKDQQKHFPPLCCCPPWAMGEVL 168

Query:   627 IR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNF-----FQXIIIVS*QSLYFFLTQW 791
             +  CK G LQ+        +V L     G Y +GNF     +  ++I++  S  F +   
Sbjct:   169 LFRCKLGVLQYTVVRPFTTIVALICELLGIYDEGNFSFSNAWTYLVIINNMSQLFAMY-- 226

Query:   792 XXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVLFPL 923
                 LL F    K          KF   ++VVF ++   V+  L
Sbjct:   227 ---CLLLFYKVLKEELSPIQPVGKFLCVRLVVFVSFWQAVVIAL 267


to_Entrezto_Relatedto_Related >gi|1351659|sp|Q09906|YAJ6_SCHPO  HYPOTHETICAL 49.3 KD PROTEIN C30D11.06C IN
            CHROMOSOME I >gi|2130405|pir||S62564 hypothetical protein
            SPAC30D11.06c - fission yeast (Schizosaccharomyces pombe)
            >gi|7491068|pir||T38593 hypothetical protein SPAC30D11.06c -
            fission yeast (Schizosaccharomyces pombe)
            >gi|1065893|emb|CAA91892.1| (Z67961) hypothetical protein
            [Schizosaccharomyces pombe]
            Length = 426

Frame  3 hits (HSPs):    _________________________                        
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |                 |                 |              | 426
                       0               150               300
__________________

Annotated Domains:
   DOMO                 DM06417:                                 1..340
   DOMO                 DM08523:                                 341..381
   Entrez               Transmembrane region: POTENTIAL.         39..59
   Entrez               Transmembrane region: POTENTIAL.         73..93
   Entrez               Transmembrane region: POTENTIAL.         133..153
   Entrez               Transmembrane region: POTENTIAL.         172..192
   Entrez               Transmembrane region: POTENTIAL.         223..243
   PRODOM               PD014035:                                4..272
   PRODOM               PD128189: YAJ6_SCHPO                     274..425
__________________


  Plus Strand HSPs:

 Score = 217 (76.4 bits), Expect = 2.4e-16, P = 2.4e-16
 Identities = 62/203 (30%), Positives = 101/203 (49%), Frame = +3

Query:   297 AIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQ-GSIYFNSIREIY 473
             A+ LS + I  HL NY +P  QR +VRI+ M+ +Y+ +SFLS+   + GSI F   REIY
Sbjct:    16 ALVLSCISIITHLKNYKKPVLQRSVVRILMMIVIYSSVSFLSVYNEKIGSI-FEPFREIY 74

Query:   474 EAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPLALDGGF-IR*CKQGC 650
             EA+ +Y F  L ++++GG  + ++SL G + +P  +  N     + L   +     K+G 
Sbjct:    75 EAFALYCFFCLLIDYLGGERAAVISLHGHLPRPRLWPLNYLQDDIDLSDPYTFLSIKRGI 134

Query:   651 LQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSLYFFLTQWXXKALLXFXLACK 830
             LQ+   +  LV+  L     G Y   +  Q +   +   L+  L       L  + L   
Sbjct:   135 LQYTWLKPFLVIAVLLTKVTGVYDRED--QPVYASA--DLWIGLVYNISITLSLYSLTTF 190

Query:   831 XICFNNSSXP-----KFXXXKIVVFPTY 899
              +C +    P     KF   K ++F +Y
Sbjct:   191 WVCLHEELAPFRPFPKFLSVKAIIFASY 218


to_Entrezto_Related >gi|6322904  ref|NP_012977.1| Ykr051wp >gi|549619|sp|P36142|YK31_YEAST
            HYPOTHETICAL 48.8 KD PROTEIN IN TRK2-MRS4 INTERGENIC REGION
            >gi|539262|pir||S38125 hypothetical protein YKR051w - yeast
            (Saccharomyces cerevisiae) >gi|486505|emb|CAA82129.1| (Z28276) ORF
            YKR051w [Saccharomyces cerevisiae]
            Length = 418

Frame  3 hits (HSPs):   ___________________________                       
                        __________________________________________________
Database sequence:     |                 |                 |              | 418
                       0               150               300

  Plus Strand HSPs:

 Score = 168 (59.1 bits), Expect = 3.6e-09, P = 3.6e-09
 Identities = 62/223 (27%), Positives = 100/223 (44%), Frame = +3

Query:   246 KMVPIFLYIVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSL 425
             K++  +LY      +  A  +S   I +HLLNY +P  QR  +RI+ +VP++++     +
Sbjct:     4 KLLCWWLYWPCVYSSIIATIISFYTITRHLLNYRKPYEQRLSIRILLLVPIFSVSCASGI 63

Query:   426 VLPQGS-IYFNSIREIYEAWVIYNFLSLCLEWVGGPGSXI--LSLTGRVLK-PSWFLXNC 593
             + P+ +  Y + IRE YEA+VIY F +     +GG  + I  LSL     + P   +   
Sbjct:    64 IKPEAAQFYVDPIREFYEAFVIYTFFTFLTLLLGGERNIITVLSLNHAPTRHPIPLIGKI 123

Query:   594 CLPPLALDGGF-IR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSL 770
             C  P+ L   F     K+G LQ+V F+      TL   A    K    F+  + V     
Sbjct:   124 C-KPIDLSDPFDFLFVKKGILQYVWFKPFYCFGTLICSAWKLPK----FEIFLNV----F 174

Query:   771 YFFLTQWXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVL 914
             Y     W   +L  F               KF   K+++F +Y   ++
Sbjct:   175 YNISVTWSLYSLALFWKCLYPELTPYKPWLKFLCVKLIIFASYWQSII 222


to_Entrezto_Relatedto_Related >gi|7487004|pir||T00451  hypothetical protein T14N5.8 - Arabidopsis thaliana
            >gi|3540198|gb|AAC34348.1| (AC004260) Unknown protein [Arabidopsis
            thaliana]
            Length = 500

Frame  3 hits (HSPs):        ________           _____                     
                        __________________________________________________
Database sequence:     |              |              |              |     | 500
                       0            150            300            450

  Plus Strand HSPs:

 Score = 153 (53.9 bits), Expect = 2.1e-07, Sum P(2) = 2.1e-07
 Identities = 32/76 (42%), Positives = 49/76 (64%), Frame = +3

Query:   297 AIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQGSIYFNSIREIYE 476
             AI L +  I++HL +Y +P  Q+F++ ++ MVPVYA+ SFLSLV  + +     IR+ YE
Sbjct:    53 AILLPMYLIFEHLASYNQPEEQKFLIGLILMVPVYAVESFLSLVNSEAAFNCEVIRDCYE 112

Query:   477 AWVIYNF---LSLCLE 515
             A+ +Y F   L  CL+
Sbjct:   113 AFALYCFERYLIACLD 128

 Score = 40 (14.1 bits), Expect = 2.1e-07, Sum P(2) = 2.1e-07
 Identities = 11/42 (26%), Positives = 16/42 (38%), Frame = +3

Query:   789 WXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVL 914
             W    L+ F    K          KF   K +VF T+  G++
Sbjct:   249 WALYCLVQFYNVIKDKLAPIKPLAKFLTFKSIVFLTWWQGII 290


to_Entrezto_Relatedto_Related >gi|7485716|pir||T05165  hypothetical protein F18E5.190 - Arabidopsis thaliana
            >gi|3080401|emb|CAA18721.1| (AL022603) putative protein
            [Arabidopsis thaliana] >gi|4455265|emb|CAB36801.1| (AL035527)
            putative protein [Arabidopsis thaliana] >gi|7268954|emb|CAB81264.1|
            (AL161555) putative protein [Arabidopsis thaliana]
            Length = 294

Frame  3 hits (HSPs):    ______________________________________           
                        __________________________________________________
Database sequence:     |        |       |        |       |        |       | 294
                       0       50     100      150     200      250

  Plus Strand HSPs:

 Score = 140 (49.3 bits), Expect = 2.9e-06, P = 2.9e-06
 Identities = 59/218 (27%), Positives = 102/218 (46%), Frame = +3

Query:   273 VAFICTCGAIALSLLH-----IYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQ 437
             + F C+  ++ L+L H     + +HL ++  P  Q+ I+ IV M P+YA++SF+ L+  +
Sbjct:    12 ITFYCSAFSVLLTL-HFTIQLVSQHLFHWKNPKEQKAILIIVLMAPIYAVVSFIGLLEVK 70

Query:   438 GS----IYFNSIREIYEAWVIYNFLSLCLEWVG-GPGSXIL--SLTGRVLKPSWFLXNCC 596
             GS    ++  SI+E YEA VI  FL+L   ++       IL   + GR +  S F     
Sbjct:    71 GSETFFLFLESIKECYEALVIAKFLALMYSYLNISMSKNILPDGIKGREIHHS-FPMTLF 129

Query:   597 LPPLA-LDGGFIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSLY 773
              P +  LD   ++  K    QFV        + +     G Y     +   IIV+     
Sbjct:   130 QPHVVRLDRHTLKLLKYWTWQFVVIRPVCSTLMIALQLIGFYPSWLSWTFTIIVN----- 184

Query:   774 FFLTQWXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGV 911
             F ++      ++ + +  K +  +N    KF   K +VF  +  G+
Sbjct:   185 FSVSLALYSLVIFYHVFAKELAPHNPLA-KFLCIKGIVFFVFWQGI 229


to_Entrezto_Relatedto_Related >gi|7496248|pir||T15541  hypothetical protein C18A3.4 - Caenorhabditis elegans
            >gi|861347|gb|AAA68368.1| (U28944) C18A3.4 gene product
            [Caenorhabditis elegans]
            Length = 342

Frame  3 hits (HSPs):         _______________________                     
                        __________________________________________________
Database sequence:     |                     |                     |      | 342
                       0                   150                   300

  Plus Strand HSPs:

 Score = 106 (37.3 bits), Expect = 0.032, P = 0.031
 Identities = 36/150 (24%), Positives = 70/150 (46%), Frame = +3

Query:   273 VAFICTCGAIALSLLH-IYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQGSIY 449
             VA   T G + L++LH IY H    T  + +  IV +    P+ +L++ +++ +P+    
Sbjct:    48 VATAVTVGTVCLAVLHLIYIHFY-ITHSSRRLHIVLLACTAPLVSLLALVAMYMPRVWFL 106

Query:   450 FNSIREIYEAWVIYNFLSLCLEWVGGPGSXILSLTGR-----VLKPSWFLXNCCLPPLAL 614
              + +  +Y ++ ++  + L L    G  + +  +  R     +  P +     CLP + L
Sbjct:   107 SHLLSFLYFSFALWVIICLLLHIFDGHHALVTKMMQRLQYVEIATPPFCCLFPCLPKVRL 166

Query:   615 DGGFIR*CKQGCLQ--FVNFETHLVVVTLYF 701
             +G  IR C+   +Q   V     LV + +YF
Sbjct:   167 EGKKIRWCELMVMQAPIVRLFATLVSLVIYF 197


to_Entrezto_Relatedto_Related >gi|7485951|pir||T05664  hypothetical protein F22I13.130 - Arabidopsis thaliana
            >gi|4539344|emb|CAB37492.1| (AL035539) putative protein
            [Arabidopsis thaliana] >gi|7270820|emb|CAB80501.1| (AL161593)
            putative protein [Arabidopsis thaliana]
            Length = 466

Frame  3 hits (HSPs):      _______________________                        
                        __________________________________________________
Database sequence:     |               |                |               | | 466
                       0             150              300             450

  Plus Strand HSPs:

 Score = 89 (31.3 bits), Expect = 1.8, Sum P(2) = 0.83
 Identities = 50/199 (25%), Positives = 74/199 (37%), Frame = +3

Query:   378 IVF-MVPVYALMSFLSLVLPQGSIYFNSIREIYEAWVIYNFLSLCLEWVGGPGSXI--LS 548
             +VF  +  Y    F SLV P  S+    +R+ YE++ +Y F    +  +GG    I  + 
Sbjct:    38 LVFDHLSTYKNPEFASLVKPSISVDCGILRDCYESFAMYCFGRYLVACIGGEERTIEFME 97

Query:   549 LTGRV-LKPSW-------------FLXNCCLPPLALDGGFIR*CKQGCLQFVNFETHLVV 686
               GR   K                F  N  L P  L   F +  K G +Q++  ++   +
Sbjct:    98 RQGRKSFKTPLLDHKDEKGIIKHPFPMNLFLKPWRLSPWFYQVVKFGIVQYMIIKSLTAL 157

Query:   687 VTLYFYAKGKYKDGNFFQXIIIVS*QSLYFFLTQWXXKALLXFXLACKXICFNNSSXPKF 866
               L   A G Y +G F           +  F   W    L+ F  A K    +     KF
Sbjct:   158 TALILEAFGVYCEGEFKWGCGYPYLAVVLNFSQSWALYCLVQFYGATKDELAHIQPLAKF 217

Query:   867 XXXKIVVFPTYGXGVLFPL 923
                K +VF T+  GV   L
Sbjct:   218 LTFKSIVFLTWWQGVAIAL 236

 Score = 43 (15.1 bits), Expect = 1.8, Sum P(2) = 0.83
 Identities = 7/24 (29%), Positives = 13/24 (54%), Frame = +3

Query:   300 IALSLLHIYKHLLNYTEPTYQRFI 371
             ++LSL  ++ HL  Y  P +   +
Sbjct:    32 LSLSLFLVFDHLSTYKNPEFASLV 55


to_Entrezto_Relatedto_Related >gi|7020081|dbj|BAA90988.1|  (AK000169) unnamed protein product [Homo sapiens]
            Length = 313

Frame  3 hits (HSPs):       ___________________                           
                        __________________________________________________
Database sequence:     |       |       |       |       |       |       |  | 313
                       0      50     100     150     200     250     300

  Plus Strand HSPs:

 Score = 89 (31.3 bits), Expect = 2.2, P = 0.89
 Identities = 32/111 (28%), Positives = 47/111 (42%), Frame = +3

Query:   591 CCLPPLALDGGFIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNF-----FQXIIIV 755
             CC PP A+    +  CK G LQ+        +V L     G Y +GNF     +  ++I+
Sbjct:    32 CCCPPWAMGEVLLFRCKLGVLQYTVVRPFTTIVALICELLGIYDEGNFSFSNAWTYLVII 91

Query:   756 S*QSLYFFLTQWXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVLFPL 923
             +  S  F +       LL F    K          KF   K+VVF ++   V+  L
Sbjct:    92 NNMSQLFAMY-----CLLPFYKVLKEELSPIQPVGKFLCVKLVVFVSFWQAVVIAL 142


to_Entrezto_Relatedto_Related >gi|7707666|dbj|BAA95343.1|  (AB027560) ATPase subunit 6 [Echinococcus vogeli]
            Length = 170

Frame  3 hits (HSPs):      _________________________________________      
                        __________________________________________________
Database sequence:     |              |              |             |      | 170
                       0             50            100           150

  Plus Strand HSPs:

 Score = 83 (29.2 bits), Expect = 3.0, P = 0.95
 Identities = 42/147 (28%), Positives = 69/147 (46%), Frame = +3

Query:   264 LYIVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQGS 443
             +Y++ F C      + L+ +    L Y  P    F +  VF+V V  +M F+SL L +  
Sbjct:    14 IYVLVFNCVSYYYFVLLVFVLMWFLIYRLPYCYSFYLFSVFLVGVVFVM-FVSLFLCR-- 70

Query:   444 IYFNSIREIYEAWV-IYNFLSLC-LEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPLALD 617
               F+S+   + ++V +   L +C L  V    S I+     +L+P   +   CL  +AL 
Sbjct:    71 -VFSSVSSFFASFVPLGTPLYICFLVCVAETISYIIRPVVLILRPFINISLGCLGAVAL- 128

Query:   618 GGFIR*CKQGCLQFVNFETHLVVVTLYFY 704
                      G L FV++   LV+V L+FY
Sbjct:   129 ---------GNLCFVSWWWSLVLVGLFFY 148


to_Entrezto_Relatedto_Related >gi|625591|pir||D61399  hypothetical early protein E8 - bovine papillomavirus
            type 3
            Length = 75

Frame  3 hits (HSPs):               _________________________             
Annotated Domains:                            ____________________________
                        __________________________________________________
Database sequence:     |            |             |            |          | 75
                       0           20            40           60
__________________

Annotated Domains:
   DOMO                 DM04650:                                 34..75
__________________


  Plus Strand HSPs:

 Score = 66 (23.2 bits), Expect = 3.8, P = 0.98
 Identities = 19/37 (51%), Positives = 22/37 (59%), Frame = +3

Query:   411 SFLSLVLPQGSIYFNSIREIYEA---WVIYNFLSLCL 512
             S LSL  P GSI   S+  IY     WV ++FLSLCL
Sbjct:    20 SSLSLHGPLGSICIMSLTLIYWLLLLWVSFHFLSLCL 56


Parameters:
  filter=none
  matrix=BLOSUM62
  V=50
  B=50
  E=10
  gi
  H=1
  sort_by_pvalue
  echofilter

  ctxfactor=5.97

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   Std.    0   BLOSUM62                                 0.318   0.135   0.401  
   +3      0   BLOSUM62        0.318   0.135   0.401    0.340   0.153   0.510  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +2      0   BLOSUM62        0.318   0.135   0.401    0.347   0.152   0.573  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +1      0   BLOSUM62        0.318   0.135   0.401    0.346   0.152   0.527  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -1      0   BLOSUM62        0.318   0.135   0.401    0.352   0.154   0.565  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -2      0   BLOSUM62        0.318   0.135   0.401    0.361   0.165   0.583  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -3      0   BLOSUM62        0.318   0.135   0.401    0.353   0.157   0.550  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E    S W   T  X   E2     S2
   +3      0      309       295       10.  78 3  12 22  0.11    36
                                                    33  0.10    40
   +2      0      309       294       10.  78 3  12 22  0.10    36
                                                    33  0.10    40
   +1      0      310       296       10.  78 3  12 22  0.11    36
                                                    33  0.10    40
   -1      0      310       297       10.  78 3  12 22  0.11    36
                                                    33  0.10    40
   -2      0      309       296       10.  78 3  12 22  0.11    36
                                                    33  0.10    40
   -3      0      309       294       10.  78 3  12 22  0.10    36
                                                    33  0.10    40


Statistics:

  Database:  /usr/local/dot5/sl_home/beauty/seqdb/blast/nr
    Title:  nr
    Release date:  unknown
    Posted date:  8:50 PM CDT May 27, 2000
    Format:  BLAST
  # of letters in database:  158,518,215
  # of sequences in database:  505,245
  # of database sequences satisfying E:  19
  No. of states in DFA:  600 (59 KB)
  Total size of DFA:  295 KB (320 KB)
  Time to generate neighborhood:  0.02u 0.01s 0.03t  Elapsed: 00:00:00
  No. of threads or processors used:  4
  Search cpu time:  384.93u 1.46s 386.39t  Elapsed: 00:02:48
  Total cpu time:  385.02u 1.49s 386.51t  Elapsed: 00:02:48
  Start:  Mon Oct 16 19:22:23 2000   End:  Mon Oct 16 19:25:11 2000

Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000