Please help us to improve our services and obtain funding for the
BCM Search Launcher
-- take a minute to complete our User Survey


BLASTX+BEAUTY Search Results

WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.

BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.

BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract

Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract



RepeatMasker repeats found in sequence:

No Repeats Found.

Reference:  Gish, Warren (1994-1997).  unpublished.
Gish, Warren and David J. States (1993).  Identification of protein coding
regions by database similarity search.  Nat. Genet. 3:266-72.

Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.

Query= SSH8H02.SEQ(1>446) (417 letters)

  Translating both strands of query sequence in all 6 reading frames

Database: nr 505,245 sequences; 158,518,215 total letters.



     Observed Numbers of Database Sequences Satisfying
    Various EXPECTation Thresholds (E parameter values)

        Histogram units:      = 8 Sequences     : less than 8 sequences

 EXPECTation Threshold
 (E parameter)
    |
    V   Observed Counts-->
  10000 2264 452 |========================================================
   6310 1812 243 |==============================
   3980 1569 209 |==========================
   2510 1360 247 |==============================
   1580 1113 164 |====================
   1000  949 119 |==============
    631  830  59 |=======
    398  771  57 |=======
    251  714  35 |====
    158  679  31 |===
    100  648  16 |==
   63.1  632  12 |=
   39.8  620   6 |:
   25.1  614   8 |=
   15.8  606   4 |:
 >>>>>>>>>>>>>>>>>>>>>  Expect = 10.0, Observed = 602  <<<<<<<<<<<<<<<<<
   10.0  602   5 |:
   6.31  597   1 |:
   3.98  596   3 |:
   2.51  593   2 |:
   1.58  591   9 |=
   1.00  582   4 |:
   0.63  578   3 |:
   0.40  575   5 |:
   0.25  570   3 |:
   0.16  567   2 |:
   0.10  565   6 |:
  0.063  559   8 |=
  0.040  551   9 |=
  0.025  542   1 |:
  0.016  541   2 |:
  0.010  539   0 |
 0.0063  539   3 |:
 0.0040  536   3 |:
 0.0025  533   0 |
 0.0016  533   2 |:


                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N
gi|7435795|pir||T12040cysteine proteinase (EC 3.4.22.... -3   503  3.0e-47   1
gi|7242888|dbj|BAA92495.1|(AB038598) cysteine proteas... -3   492  4.4e-46   1
gi|7381221|gb|AAF61441.1|AF138265_1(AF138265) papain-... -3   487  1.5e-45   1
gi|7211741|gb|AAF40414.1|AF216783_1(AF216783) papain-... -3   486  1.9e-45   1
gi|7381219|gb|AAF61440.1|AF138264_1(AF138264) papain-... -3   486  1.9e-45   1
gi|5051468|emb|CAB44983.1|(AJ242994) putative preproc... -3   483  4.0e-45   1
gi|4757570|gb|AAD29084.1|AF082181_1(AF082181) cystein... -3   480  8.3e-45   1
gi|7211745|gb|AAF40416.1|AF216785_1(AF216785) papain-... -3   478  1.3e-44   1
gi|542004|pir||S42882cysteine proteinase (EC 3.4.22.-... -3   476  2.2e-44   1
gi|100203|pir||S24988cysteine proteinase (EC 3.4.22.-... -3   474  3.6e-44   1
gi|1401242|gb|AAB67878.1|(U59465) pre-pro-cysteine pr... -3   474  3.6e-44   1
gi|1172872|sp|P43296|RD19_ARATHCYSTEINE PROTEINASE RD... -3   472  5.8e-44   1
gi|1168251|sp|P43295|A494_ARATHPROBABLE CYSTEINE PROT... -3   469  1.2e-43   1
gi|4567274|gb|AAD23687.1|AC006841_20(AC006841) putati... -3   469  1.2e-43   1
gi|7435816|pir||T08844cysteine proteinase (EC 3.4.22.... -3   466  2.5e-43   1
gi|7435793|pir||T09528probable cysteine proteinase (E... -3   466  2.5e-43   1
gi|118150|sp|P25804|CYSP_PEACYSTEINE PROTEINASE 15A P... -3   463  5.2e-43   1
gi|7435803|pir||D71428cysteine proteinase (EC 3.4.22.... -3   463  5.2e-43   1
gi|419782|pir||S30150cysteine proteinase (EC 3.4.22.-... -3   461  8.5e-43   1
gi|7211743|gb|AAF40415.1|AF216784_1(AF216784) papain-... -3   460  1.1e-42   1
gi|419781|pir||S30149cysteine proteinase (EC 3.4.22.-... -3   457  2.3e-42   1
gi|1706260|sp|Q10716|CYS1_MAIZECYSTEINE PROTEINASE 1 ... -3   418  3.1e-38   1
gi|7435815|pir||T08845cysteine proteinase (EC 3.4.22.... -3   404  9.3e-37   1
gi|5679322|gb|AAD46920.1|AF167986_1(AF167986) putativ... -3   322  1.3e-31   2
gi|7435792|pir||T12042cysteine proteinase (EC 3.4.22.... -3   318  6.3e-30   2
gi|1362047|pir||S55923cysteine proteinase (EC 3.4.22.... -3   334  2.4e-29   1
gi|7435814|pir||T10949cysteine proteinase (EC 3.4.22.... -3   327  1.3e-28   1
gi|7435812|pir||T06726cysteine proteinase (EC 3.4.22.... -3   323  3.6e-28   1
gi|1353726|gb|AAB01769.1|(U42758) cysteine proteinase... -3   300  9.8e-26   1
gi|118117|sp|P04988|CYS1_DICDICYSTEINE PROTEINASE 1 P... -3   290  1.1e-24   1
gi|118124|sp|P25250|CYS2_HORVUCYSTEINE PROTEINASE EP-... -3   253  9.5e-24   2
gi|118120|sp|P25249|CYS1_HORVUCYSTEINE PROTEINASE EP-... -3   251  1.5e-23   2
gi|537437|gb|AAC35211.1|(U12637) cysteine proteinase ... -3   266  3.9e-22   1
gi|1834307|dbj|BAA09820.1|(D63670) cysteine proteinas... -3   265  5.0e-22   1
gi|1085124|pir||JX0366cysteine endopeptidase (EC 3.4.... -3   262  1.0e-21   1
gi|7435804|pir||T03694cysteine proteinase (EC 3.4.22.... -3   260  1.7e-21   1
gi|4426617|gb|AAD20453.1|(AF099203) cysteine endopept... -3   259  2.2e-21   1
gi|5761329|dbj|BAA83473.1|(AB004819) cysteine endopep... -3   258  2.8e-21   1
gi|415567|gb|AAB28289.1|cysteine proteinase=39 kda ac... -3   257  3.5e-21   1
gi|2118132|pir||JC4848cysteine proteinase (EC 3.4.22.... -3   260  5.1e-21   1
gi|2160175|gb|AAB60738.1|(AC000132) Strong similarity... -3   258  5.5e-21   1
gi|5777611|emb|CAB53397.1|(AJ245868) cysteine proteas... -3   255  5.7e-21   1
gi|1079183|pir||A53810cathepsin L (EC 3.4.22.15) prec... -3   254  7.3e-21   1
gi|399190|sp|Q02765|CATS_RATCATHEPSIN S PRECURSOR >gi... -3   253  9.4e-21   1
gi|1706263|sp|P54640|CYS5_DICDICYSTEINE PROTEINASE 5 ... -3   252  1.2e-20   1
gi|600111|emb|CAA84378.1|(Z34895) cysteine proteinase... -3   251  1.5e-20   1
gi|1076552|pir||S49166cysteine proteinase precursor -... -3   251  1.5e-20   1
gi|7381610|gb|AAF61565.1|AF227957_1(AF227957) catheps... -3   251  1.5e-20   1
gi|1185457|gb|AAA87848.1|(U38475) cathepsin L [Schist... -3   250  1.9e-20   1
gi|7435775|pir||JC5443cathepsin L-like cysteine prote... -3   250  1.9e-20   1



Locally-aligned regions (HSPs) with respect to query sequence:

Locus_ID                Frame -1 Hits
gi|5679322             |                                      _________   
gi|7435792             |                                           ____   
                        __________________________________________________
Query sequence:        |                 |                 |              | 139
                       0                50               100


Locus_ID                Frame -3 Hits
gi|7435795             |                                                  
gi|7242888             |                                                  
gi|7381221             |                                                  
gi|7211741             |                                                  
gi|7381219             |                                                  
gi|5051468             |                                                  
gi|4757570             |                                                  
gi|7211745             |                                                  
gi|542004              |                                                  
gi|100203              |                                                  
gi|1401242             |                                                  
gi|1172872             |                                                  
gi|1168251             |                                                  
gi|4567274             |                                                  
gi|7435816             |                                                  
gi|7435793             |                                                  
gi|118150              |                                                  
gi|7435803             |                                                  
gi|419782              |                                                  
gi|7211743             |                                                  
gi|419781              |                                                  
gi|1706260             |                                                  
gi|7435815             |                                                  
gi|5679322             |                                                  
gi|7435792             |                                                  
gi|1362047             |                                                  
gi|7435814             |                                                  
gi|7435812             |                                                  
gi|1353726             |                                                  
gi|118117              |                                                  
gi|118124              |______                                            
gi|118120              |______                                            
gi|537437              |                                                  
gi|1834307             |                                                  
gi|1085124             |                                                  
gi|7435804             |                                                  
gi|4426617             |                                                  
gi|5761329             |                                                  
gi|415567              |                                                  
gi|2118132             |                                                  
gi|2160175             |                                                  
gi|5777611             |_                                                 
gi|1079183             |                                                  
gi|399190              |                                                  
gi|1706263             |                                                  
gi|600111              |                                                  
gi|1076552             |                                                  
gi|7381610             |                                                  
gi|1185457             |                                                  
gi|7435775             |                                                  
                        __________________________________________________
Query sequence:        |                 |                 |              | 139
                       0                50               100

Use the and icons to retrieve links to Entrez:

E = Retrieve Entrez links (e.g., Medline abstracts, FASTA-formatted sequence reports).
R = Retrieve links to Related sequences (neighbors).
Use the icon (if present) to retrieve links to the Sequence Retrieval System (SRS).
Use the icon (if present) to retrieve links to the Ligand Enzyme and Chemical Compound Database .
Use the icon (if present) to retrieve links to the Protein Data Bank database.

WARNING:  Descriptions of 552 database sequences were not reported due to the
          limiting value of parameter V = 50.



to_Entrezto_Relatedto_Related >gi|7435795|pir||T12040  cysteine proteinase (EC 3.4.22.-) 2 precursor - kidney
            bean >gi|2511691|emb|CAB17075.1| (Z99953) cysteine proteinase
            precursor [Phaseolus vulgaris]
            Length = 365

Frame -3 hits (HSPs):             ____________________                    
                        __________________________________________________
Database sequence:     |                    |                   |         | 365
                       0                  150                 300

  Minus Strand HSPs:

 Score = 503 (177.1 bits), Expect = 3.0e-47, P = 3.0e-47
 Identities = 99/137 (72%), Positives = 107/137 (78%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDF 236
             RA+ H +LDP          +DLTP   F   FLGLKPLRLP+ AQKAPILPTN+LP DF
Sbjct:    81 RARLHAQLDP-SAVHGVTKFSDLTPAE-FHRKFLGLKPLRLPAHAQKAPILPTNNLPKDF 138

Query:   235 DWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEER 56
             DWR+ GAVT VK+QGSCGSCWSFS  GALEGAHFL TGELVSLSEQQLVDCD  CDPEE 
Sbjct:   139 DWRDKGAVTNVKDQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEY 198

Query:    55 GACDSGCNGGLMTIAFE 5
             G+CDSGCNGGLM  AFE
Sbjct:   199 GSCDSGCNGGLMNNAFE 215


to_Entrezto_Relatedto_Related >gi|7242888|dbj|BAA92495.1|  (AB038598) cysteine protease [Vigna mungo]
            Length = 364

Frame -3 hits (HSPs):             ____________________                    
                        __________________________________________________
Database sequence:     |                    |                    |        | 364
                       0                  150                  300

  Minus Strand HSPs:

 Score = 492 (173.2 bits), Expect = 4.4e-46, P = 4.4e-46
 Identities = 97/137 (70%), Positives = 106/137 (77%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDF 236
             RA+ H +LDP          +DLT    F   FLGLKPL LP++AQKAPILPTN+LP DF
Sbjct:    80 RARLHAQLDP-SAVHGVTKFSDLT-AAEFQRQFLGLKPLGLPANAQKAPILPTNNLPKDF 137

Query:   235 DWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEER 56
             DWR+ GAVT VK+QG+CGSCWSFS  GALEGAHFL TGELVSLSEQQLVDCD  CDPEE 
Sbjct:   138 DWRDKGAVTNVKDQGACGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEY 197

Query:    55 GACDSGCNGGLMTIAFE 5
             GACDSGCNGGLM  AFE
Sbjct:   198 GACDSGCNGGLMNNAFE 214


to_Entrezto_Relatedto_Related >gi|7381221|gb|AAF61441.1|AF138265_1  (AF138265) papain-like cysteine proteinase
            isoform II [Ipomoea batatas]
            Length = 366

Frame -3 hits (HSPs):             ____________________                    
                        __________________________________________________
Database sequence:     |                    |                   |         | 366
                       0                  150                 300

  Minus Strand HSPs:

 Score = 487 (171.4 bits), Expect = 1.5e-45, P = 1.5e-45
 Identities = 95/137 (69%), Positives = 109/137 (79%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGL-KPLRLPSDAQKAPILPTNDLPTD 239
             RAK HQ+LDP          +DLTP   F   FLGL + L+ P+DA+ APILPT++LP+D
Sbjct:    79 RAKQHQELDP-AAVHGVTQFSDLTPTE-FRRKFLGLNRRLKFPADAKTAPILPTDELPSD 136

Query:   238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59
             FDWR+HGAVT VKNQG+CGSCWSFS  GALEGA+FL TG+LVSLSEQQLVDCD ECDPEE
Sbjct:   137 FDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 196

Query:    58 RGACDSGCNGGLMTIAFE 5
              G+CDSGCNGGLM  AFE
Sbjct:   197 AGSCDSGCNGGLMNSAFE 214


to_Entrezto_Relatedto_Related >gi|7211741|gb|AAF40414.1|AF216783_1  (AF216783) papain-like cysteine proteinase
            isoform I [Ipomoea batatas]
            Length = 368

Frame -3 hits (HSPs):             ____________________                    
                        __________________________________________________
Database sequence:     |                    |                   |         | 368
                       0                  150                 300

  Minus Strand HSPs:

 Score = 486 (171.1 bits), Expect = 1.9e-45, P = 1.9e-45
 Identities = 95/137 (69%), Positives = 109/137 (79%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGL-KPLRLPSDAQKAPILPTNDLPTD 239
             RAK HQ+LDP          +DLTP   F   FLGL + L+ P+DA+ APILPT++LP+D
Sbjct:    81 RAKRHQELDP-AAVHGVTQFSDLTPTE-FRRKFLGLNRRLKFPADAKTAPILPTDELPSD 138

Query:   238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59
             FDWR+HGAVT VKNQG+CGSCWSFS  GALEGA+FL TG+LVSLSEQQLVDCD ECDPEE
Sbjct:   139 FDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 198

Query:    58 RGACDSGCNGGLMTIAFE 5
              G+CDSGCNGGLM  AFE
Sbjct:   199 AGSCDSGCNGGLMNSAFE 216


to_Entrezto_Relatedto_Related >gi|7381219|gb|AAF61440.1|AF138264_1  (AF138264) papain-like cysteine proteinase
            isoform I [Ipomoea batatas]
            Length = 368

Frame -3 hits (HSPs):             ____________________                    
                        __________________________________________________
Database sequence:     |                    |                   |         | 368
                       0                  150                 300

  Minus Strand HSPs:

 Score = 486 (171.1 bits), Expect = 1.9e-45, P = 1.9e-45
 Identities = 95/137 (69%), Positives = 109/137 (79%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGL-KPLRLPSDAQKAPILPTNDLPTD 239
             RAK HQ+LDP          +DLTP   F   FLGL + L+ P+DA+ APILPT++LP+D
Sbjct:    81 RAKRHQELDP-AAVHGVTQFSDLTPTE-FRRKFLGLNRRLKFPADAKTAPILPTDELPSD 138

Query:   238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59
             FDWR+HGAVT VKNQG+CGSCWSFS  GALEGA+FL TG+LVSLSEQQLVDCD ECDPEE
Sbjct:   139 FDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 198

Query:    58 RGACDSGCNGGLMTIAFE 5
              G+CDSGCNGGLM  AFE
Sbjct:   199 AGSCDSGCNGGLMNSAFE 216


to_Entrezto_Relatedto_Related >gi|5051468|emb|CAB44983.1|  (AJ242994) putative preprocysteine proteinase
            [Nicotiana tabacum]
            Length = 363

Frame -3 hits (HSPs):             ___________________                     
                        __________________________________________________
Database sequence:     |                    |                    |        | 363
                       0                  150                  300

  Minus Strand HSPs:

 Score = 483 (170.0 bits), Expect = 4.0e-45, P = 4.0e-45
 Identities = 94/137 (68%), Positives = 106/137 (77%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDF 236
             RA+ HQ LDP         S DLTP   F  ++LGL   +   +A+KAPILPT+DLP DF
Sbjct:    77 RARRHQLLDPSAEHGITKFS-DLTPSE-FRRTYLGLHKPKPKLNAEKAPILPTSDLPADF 134

Query:   235 DWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEER 56
             DWR+HGAVTGVKNQGSCGSCWSFS  GA+EGAHFL TGELVSLSEQQLVDCD ECDPE++
Sbjct:   135 DWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQ 194

Query:    55 GACDSGCNGGLMTIAFE 5
              ACD+GC GGLMT AFE
Sbjct:   195 DACDAGCGGGLMTTAFE 211


to_Entrezto_Relatedto_Related >gi|4757570|gb|AAD29084.1|AF082181_1  (AF082181) cysteine proteinase precursor
            [Solanum melongena]
            Length = 363

Frame -3 hits (HSPs):             ___________________                     
                        __________________________________________________
Database sequence:     |                    |                    |        | 363
                       0                  150                  300

  Minus Strand HSPs:

 Score = 480 (169.0 bits), Expect = 8.3e-45, P = 8.3e-45
 Identities = 96/137 (70%), Positives = 104/137 (75%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDF 236
             RA+ HQ LDP         S DLTP   F  ++LGL   R   +AQKAPILPT+DLP DF
Sbjct:    77 RARRHQLLDPTAEHGITQFS-DLTPSE-FRRTYLGLHKPRPKLNAQKAPILPTSDLPEDF 134

Query:   235 DWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEER 56
             DWRE GAVTGVKNQGSCGSCWSFS  GA+EGAHFL TGELVSLSEQQLVDCD ECD EE+
Sbjct:   135 DWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEEK 194

Query:    55 GACDSGCNGGLMTIAFE 5
               CD+GCNGGLMT AFE
Sbjct:   195 SECDAGCNGGLMTTAFE 211


to_Entrezto_Relatedto_Related >gi|7211745|gb|AAF40416.1|AF216785_1  (AF216785) papain-like cysteine proteinase
            isoform III [Ipomoea batatas] >gi|7381223|gb|AAF61442.1|AF138266_1
            (AF138266) papain-like cysteine proteinase isoform III [Ipomoea
            batatas]
            Length = 366

Frame -3 hits (HSPs):             ____________________                    
                        __________________________________________________
Database sequence:     |                    |                   |         | 366
                       0                  150                 300

  Minus Strand HSPs:

 Score = 478 (168.3 bits), Expect = 1.3e-44, P = 1.3e-44
 Identities = 94/137 (68%), Positives = 108/137 (78%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGL-KPLRLPSDAQKAPILPTNDLPTD 239
             RAK HQ+LDP          +DLTP   F   FLGL + L+ P+DA+ APILPT++LP+D
Sbjct:    79 RAKRHQQLDP-AAVHGVTQFSDLTPTE-FRRKFLGLNRRLKFPADAKTAPILPTDELPSD 136

Query:   238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59
             FDWR+ GAVT VKNQG+CGSCWSFS  GALEGA+FL TG+LVSLSEQQLVDCD ECDPEE
Sbjct:   137 FDWRDRGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 196

Query:    58 RGACDSGCNGGLMTIAFE 5
              G+CDSGCNGGLM  AFE
Sbjct:   197 AGSCDSGCNGGLMNSAFE 214


to_Entrezto_Relatedto_Related >gi|542004|pir||S42882  cysteine proteinase (EC 3.4.22.-) precursor - spring
            vetch >gi|2129905|pir||S51818 cysteine proteinase precursor -
            spring vetch >gi|457756|emb|CAA82995.1| (Z30338) cysteine
            proteinase [Vicia sativa]
            Length = 358

Frame -3 hits (HSPs):             ___________________                     
Annotated Domains:                          __                  _______   
                        __________________________________________________
Database sequence:     |                    |                    |        | 358
                       0                  150                  300
__________________

Annotated Domains:
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 316..335
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 145..156
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 292..302
__________________


  Minus Strand HSPs:

 Score = 476 (167.6 bits), Expect = 2.2e-44, P = 2.2e-44
 Identities = 97/137 (70%), Positives = 105/137 (76%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGL-KPLRLPSDAQKAPILPTNDLPTD 239
             +AK HQKLDP         S DLT    F   FLGL K LRLP+ AQKAPILPT +LP D
Sbjct:    73 KAKLHQKLDPTAEHGITKFS-DLT-ASEFRRQFLGLNKRLRLPAHAQKAPILPTTNLPED 130

Query:   238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59
             FDWRE GAVT VK+QGSCGSCW+FS  GALEGAH+L TG+LVSLSEQQLVDCD  CDPEE
Sbjct:   131 FDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEE 190

Query:    58 RGACDSGCNGGLMTIAFE 5
              G+CDSGCNGGLM  AFE
Sbjct:   191 AGSCDSGCNGGLMNNAFE 208


to_Entrezto_Relatedto_Relatedto_ec >gi|100203|pir||S24988  cysteine proteinase (EC 3.4.22.-) precursor - tomato
            (fragment) >gi|19195|emb|CAA78403.1| (Z14028) pre-pro-cysteine
            proteinase [Lycopersicon esculentum]
            Length = 361

Frame -3 hits (HSPs):             ___________________                     
Annotated Domains:                          __                     ____   
                        __________________________________________________
Database sequence:     |                    |                    |        | 361
                       0                  150                  300
__________________

Annotated Domains:
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 316..335
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 146..157
__________________


  Minus Strand HSPs:

 Score = 474 (166.9 bits), Expect = 3.6e-44, P = 3.6e-44
 Identities = 95/137 (69%), Positives = 103/137 (75%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDF 236
             RAK HQ LDP         S DLTP   F  ++LGL   R   +A+KAPILPT DLP+DF
Sbjct:    75 RAKRHQLLDPSAEHGITQFS-DLTPSE-FRRTYLGLNKPRPNLNAEKAPILPTKDLPSDF 132

Query:   235 DWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEER 56
             DWRE GAVT VKNQGSCGSCWSFS  GA+EGAHFL TGELVSLSEQQLVDCD ECDP E+
Sbjct:   133 DWREKGAVTDVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPVEK 192

Query:    55 GACDSGCNGGLMTIAFE 5
               CD+GCNGGLMT AFE
Sbjct:   193 NDCDAGCNGGLMTTAFE 209


to_Entrezto_Relatedto_Related >gi|1401242|gb|AAB67878.1|  (U59465) pre-pro-cysteine proteinase [Vicia faba]
            Length = 363

Frame -3 hits (HSPs):             ____________________                    
                        __________________________________________________
Database sequence:     |                    |                    |        | 363
                       0                  150                  300

  Minus Strand HSPs:

 Score = 474 (166.9 bits), Expect = 3.6e-44, P = 3.6e-44
 Identities = 96/137 (70%), Positives = 105/137 (76%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKP-LRLPSDAQKAPILPTNDLPTD 239
             +AK HQKLDP         S DLT    F   FLGLK  LRLP+ AQKAPILPT +LP D
Sbjct:    78 KAKLHQKLDPTAEHGITKFS-DLT-ASEFRRQFLGLKKRLRLPAHAQKAPILPTTNLPED 135

Query:   238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59
             FDWRE GAVT VK+QGSCGSCW+FS  GALEGAH+L TG+LVSLSEQQLVDCD  CDPE+
Sbjct:   136 FDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQ 195

Query:    58 RGACDSGCNGGLMTIAFE 5
              G+CDSGCNGGLM  AFE
Sbjct:   196 AGSCDSGCNGGLMNNAFE 213


to_Entrezto_Relatedto_Relatedto_ec >gi|1172872|sp|P43296|RD19_ARATH  CYSTEINE PROTEINASE RD19A PRECURSOR
            >gi|541856|pir||JN0718 cysteine proteinase (EC 3.4.22.-) RD19A
            precursor, drought-inducible - Arabidopsis thaliana
            >gi|435618|dbj|BAA02373.1| (D13042) thiol protease [Arabidopsis
            thaliana] >gi|4539328|emb|CAB38829.1| (AL035679) drought-inducible
            cysteine proteinase RD19A precursor [Arabidopsis thaliana]
            >gi|7270892|emb|CAB80572.1| (AL161594) drought-inducible cysteine
            proteinase RD19A precursor [Arabidopsis thaliana]
            Length = 368

Frame -3 hits (HSPs):             ____________________                    
Annotated Domains:      _________________________________________________ 
                        __________________________________________________
Database sequence:     |                    |                   |         | 368
                       0                  150                 300
__________________

Annotated Domains:
   BLOCKS               BL00139A: Eukaryotic thiol (cysteine) pr 153..162
   BLOCKS               BL00139B: Eukaryotic thiol (cysteine) pr 203..211
   BLOCKS               BL00139C: Eukaryotic thiol (cysteine) pr 301..310
   BLOCKS               BL00139D: Eukaryotic thiol (cysteine) pr 324..340
   DOMO                 DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 40..360
   Entrez               active site: BY SIMILARITY.              159
   Entrez               active site: BY SIMILARITY.              302
   Entrez               active site: BY SIMILARITY.              329
   Entrez               glycosylation site: POTENTIAL.           253
   PFAM                 Peptidase_C1: Papain family cysteine pro 135..360
   PRINTS               PAPAIN1: Papain cysteine protease family 153..168
   PRINTS               PAPAIN2: Papain cysteine protease family 302..312
   PRINTS               PAPAIN3: Papain cysteine protease family 324..330
   PRODOM               PD078495: RD19_ARATH                     1..48
   PRODOM               PD000247: CYSP(11) CATL(7) CYS2(5)       50..118
   PRODOM               PD000158: CYSP(14) CATL(9) CYS1(8)       135..356
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 324..343
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 153..164
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 300..310
__________________


  Minus Strand HSPs:

 Score = 472 (166.2 bits), Expect = 5.8e-44, P = 5.8e-44
 Identities = 94/137 (68%), Positives = 104/137 (75%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKP-LRLPSDAQKAPILPTNDLPTD 239
             RA+ HQKLDP          +DLT    F    LG++   +LP DA KAPILPT +LP D
Sbjct:    81 RARRHQKLDP-SATHGVTQFSDLTRSE-FRKKHLGVRSGFKLPKDANKAPILPTENLPED 138

Query:   238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59
             FDWR+HGAVT VKNQGSCGSCWSFSA GALEGA+FL TG+LVSLSEQQLVDCD ECDPEE
Sbjct:   139 FDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 198

Query:    58 RGACDSGCNGGLMTIAFE 5
               +CDSGCNGGLM  AFE
Sbjct:   199 ADSCDSGCNGGLMNSAFE 216


to_Entrezto_Relatedto_Relatedto_ec >gi|1168251|sp|P43295|A494_ARATH  PROBABLE CYSTEINE PROTEINASE A494 PRECURSOR
            >gi|1076384|pir||S46535 cysteine proteinase (EC 3.4.22.-) (clone
            A1494) - Arabidopsis thaliana (fragment) >gi|516865|emb|CAA52403.1|
            (X74359) putative thiol protease [Arabidopsis thaliana]
            Length = 313

Frame -3 hits (HSPs):       _______________________                       
Annotated Domains:                   _____________________________________
                        __________________________________________________
Database sequence:     |       |       |       |       |       |       |  | 313
                       0      50     100     150     200     250     300
__________________

Annotated Domains:
   Entrez               active site: BY SIMILARITY.              108
   Entrez               active site: BY SIMILARITY.              251
   Entrez               active site: BY SIMILARITY.              278
   PFAM                 Peptidase_C1: Papain family cysteine pro 84..309
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 273..292
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 102..113
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 249..259
__________________


  Minus Strand HSPs:

 Score = 469 (165.1 bits), Expect = 1.2e-43, P = 1.2e-43
 Identities = 94/137 (68%), Positives = 103/137 (75%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKP-LRLPSDAQKAPILPTNDLPTD 239
             RA  HQK+DP  R      S DLT    F    LG+K   +LP DA +APILPT +LP +
Sbjct:    30 RAMRHQKMDPSARHGVTQFS-DLTRSE-FRRKHLGVKGGFKLPKDANQAPILPTQNLPEE 87

Query:   238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59
             FDWR+ GAVT VKNQGSCGSCWSFS  GALEGAHFL TG+LVSLSEQQLVDCD ECDPEE
Sbjct:    88 FDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEE 147

Query:    58 RGACDSGCNGGLMTIAFE 5
              G+CDSGCNGGLM  AFE
Sbjct:   148 EGSCDSGCNGGLMNSAFE 165


to_Entrezto_Relatedto_Related >gi|4567274|gb|AAD23687.1|AC006841_20  (AC006841) putative cysteine proteinase
            [Arabidopsis thaliana]
            Length = 361

Frame -3 hits (HSPs):             ____________________                    
                        __________________________________________________
Database sequence:     |                    |                    |        | 361
                       0                  150                  300

  Minus Strand HSPs:

 Score = 469 (165.1 bits), Expect = 1.2e-43, P = 1.2e-43
 Identities = 94/137 (68%), Positives = 103/137 (75%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKP-LRLPSDAQKAPILPTNDLPTD 239
             RA  HQK+DP  R      S DLT    F    LG+K   +LP DA +APILPT +LP +
Sbjct:    78 RAMRHQKMDPSARHGVTQFS-DLTRSE-FRRKHLGVKGGFKLPKDANQAPILPTQNLPEE 135

Query:   238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59
             FDWR+ GAVT VKNQGSCGSCWSFS  GALEGAHFL TG+LVSLSEQQLVDCD ECDPEE
Sbjct:   136 FDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEE 195

Query:    58 RGACDSGCNGGLMTIAFE 5
              G+CDSGCNGGLM  AFE
Sbjct:   196 EGSCDSGCNGGLMNSAFE 213


to_Entrezto_Relatedto_Related >gi|7435816|pir||T08844  cysteine proteinase (EC 3.4.22.-) isoform B - soybean
            (fragment) >gi|1619903|gb|AAB16996.1| (U71379) thiol protease
            isoform B [Glycine max]
            Length = 319

Frame -3 hits (HSPs):           ___________________                       
                        __________________________________________________
Database sequence:     |       |       |       |       |       |      |   | 319
                       0      50     100     150     200     250    300

  Minus Strand HSPs:

 Score = 466 (164.0 bits), Expect = 2.5e-43, P = 2.5e-43
 Identities = 88/117 (75%), Positives = 95/117 (81%), Frame = -3

Query:   355 TDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDFDWREHGAVTGVKNQGSCGSC 176
             +DLTP   F   FLGLK +R P+ AQKAPILPT DLP DFDWR+ GAVT VK+QG CGSC
Sbjct:    54 SDLTPAE-FRRQFLGLKAVRFPAHAQKAPILPTKDLPKDFDWRDKGAVTNVKDQGGCGSC 112

Query:   175 WSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5
             WSFS  GALEGA++L TGELVSLSEQQLVDCD  CDPEE GACDSGCNGGLM  AFE
Sbjct:   113 WSFSTTGALEGAYYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFE 169


to_Entrezto_Relatedto_Related >gi|7435793|pir||T09528  probable cysteine proteinase (EC 3.4.22.-) precursor -
            chickpea >gi|3377952|emb|CAA08906.1| (AJ009878) cysteine proteinase
            [Cicer arietinum]
            Length = 362

Frame -3 hits (HSPs):             ____________________                    
                        __________________________________________________
Database sequence:     |                    |                    |        | 362
                       0                  150                  300

  Minus Strand HSPs:

 Score = 466 (164.0 bits), Expect = 2.5e-43, P = 2.5e-43
 Identities = 95/137 (69%), Positives = 105/137 (76%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKP-LRLPSDAQKAPILPTNDLPTD 239
             +AK HQKLDP         S DLT    F   FLGLK  LRLP+ AQKAPILPTN+LP D
Sbjct:    77 KAKLHQKLDPSAEHGVTKFS-DLT-ASEFRRQFLGLKKRLRLPAHAQKAPILPTNNLPED 134

Query:   238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59
             FDWRE GAVT VK+QGSCGSCW+FS  GALEGA++L TG+LVSLSEQQLVDCD  CDP+E
Sbjct:   135 FDWREKGAVTPVKDQGSCGSCWAFSTTGALEGANYLATGKLVSLSEQQLVDCDHVCDPDE 194

Query:    58 RGACDSGCNGGLMTIAFE 5
               +CDSGCNGGLM  AFE
Sbjct:   195 YNSCDSGCNGGLMNNAFE 212


to_Entrezto_Relatedto_Relatedto_ec >gi|118150|sp|P25804|CYSP_PEA  CYSTEINE PROTEINASE 15A PRECURSOR
            (TURGOR-RESPONSIVE PROTEIN 15A) >gi|100050|pir||S11862 cysteine
            proteinase (EC 3.4.22.-) - garden pea >gi|20679|emb|CAA38242.1|
            (X54358) 363 aa peptide [Pisum sativum]
            Length = 363

Frame -3 hits (HSPs):             ____________________                    
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |                    |                    |        | 363
                       0                  150                  300
__________________

Annotated Domains:
   DOMO                 DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 37..357
   Entrez               active site: BY SIMILARITY.              156
   Entrez               active site: BY SIMILARITY.              299
   Entrez               active site: BY SIMILARITY.              326
   Entrez               glycosylation site: POTENTIAL.           249
   PFAM                 Peptidase_C1: Papain family cysteine pro 132..357
   PRINTS               PAPAIN1: Papain cysteine protease family 150..165
   PRINTS               PAPAIN2: Papain cysteine protease family 299..309
   PRINTS               PAPAIN3: Papain cysteine protease family 321..327
   PRODOM               PD031710: CYSP(1) Q41671(1) O81930(1)    1..45
   PRODOM               PD000247: CYSP(11) CATL(7) CYS2(5)       47..115
   PRODOM               PD000158: CYSP(14) CATL(9) CYS1(8)       132..353
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 321..340
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 150..161
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 297..307
__________________


  Minus Strand HSPs:

 Score = 463 (163.0 bits), Expect = 5.2e-43, P = 5.2e-43
 Identities = 94/137 (68%), Positives = 103/137 (75%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKP-LRLPSDAQKAPILPTNDLPTD 239
             +AK HQ  DP         S DLT    F   FLGLK  LRLP+ AQKAPILPT +LP D
Sbjct:    78 KAKLHQNRDPTAEHGITKFS-DLT-ASEFRRQFLGLKKRLRLPAHAQKAPILPTTNLPED 135

Query:   238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59
             FDWRE GAVT VK+QGSCGSCW+FS  GALEGAH+L TG+LVSLSEQQLVDCD  CDPE+
Sbjct:   136 FDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQ 195

Query:    58 RGACDSGCNGGLMTIAFE 5
              G+CDSGCNGGLM  AFE
Sbjct:   196 AGSCDSGCNGGLMNNAFE 213


to_Entrezto_Relatedto_Related >gi|7435803|pir||D71428  cysteine proteinase (EC 3.4.22.-) - Arabidopsis
            thaliana >gi|2244977|emb|CAB10398.1| (Z97340) cysteine proteinase
            like protein [Arabidopsis thaliana] >gi|7268368|emb|CAB78661.1|
            (AL161543) cysteine proteinase like protein [Arabidopsis thaliana]
            Length = 373

Frame -3 hits (HSPs):              ___________________                    
                        __________________________________________________
Database sequence:     |                   |                    |         | 373
                       0                 150                  300

  Minus Strand HSPs:

 Score = 463 (163.0 bits), Expect = 5.2e-43, P = 5.2e-43
 Identities = 94/137 (68%), Positives = 104/137 (75%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKP--LRLPSDAQKAPILPTNDLPT 242
             RA+ +Q LDP          +DLTP   F   FLGLK    RLP+D Q APILPT+DLPT
Sbjct:    85 RARRNQLLDP-SAVHGVTQFSDLTPKE-FRRKFLGLKRRGFRLPTDTQTAPILPTSDLPT 142

Query:   241 DFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPE 62
             +FDWRE GAVT VKNQG CGSCWSFSA+GALEGAHFL T ELVSLSEQQLVDCD ECDP 
Sbjct:   143 EFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPA 202

Query:    61 ERGACDSGCNGGLMTIAFE 5
             +  +CDSGC+GGLM  AFE
Sbjct:   203 QANSCDSGCSGGLMNNAFE 221


to_Entrezto_Relatedto_Related >gi|419782|pir||S30150  cysteine proteinase (EC 3.4.22.-) precursor (clone
            CYP-8) - common tobacco >gi|19851|emb|CAA78365.1| (Z13964) tobacco
            pre-pro-cysteine proteinase [Nicotiana tabacum]
            Length = 365

Frame -3 hits (HSPs):             ____________________                    
Annotated Domains:                          __                     ____   
                        __________________________________________________
Database sequence:     |                    |                   |         | 365
                       0                  150                 300
__________________

Annotated Domains:
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 320..339
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 150..161
__________________


  Minus Strand HSPs:

 Score = 461 (162.3 bits), Expect = 8.5e-43, P = 8.5e-43
 Identities = 90/137 (65%), Positives = 105/137 (76%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDF 236
             RA+ +Q LDP         S DLTP   F  ++LGL   +   +A+KAPILPT+DLP D+
Sbjct:    79 RARLNQLLDPSAEHGITKFS-DLTPSE-FRRTYLGLHKPKPKVNAEKAPILPTSDLPADY 136

Query:   235 DWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEER 56
             DWR+HGAVTGVKNQGSCGSCWSFS  GA+EGAHFL TGELVSLSEQQLVDCD ECD E++
Sbjct:   137 DWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDSEQQ 196

Query:    55 GACDSGCNGGLMTIAFE 5
              +CD+GC GGLMT AFE
Sbjct:   197 DSCDAGCGGGLMTTAFE 213


to_Entrezto_Relatedto_Related >gi|7211743|gb|AAF40415.1|AF216784_1  (AF216784) papain-like cysteine proteinase
            isoform II [Ipomoea batatas]
            Length = 368

Frame -3 hits (HSPs):             ____________________                    
                        __________________________________________________
Database sequence:     |                    |                   |         | 368
                       0                  150                 300

  Minus Strand HSPs:

 Score = 460 (161.9 bits), Expect = 1.1e-42, P = 1.1e-42
 Identities = 91/137 (66%), Positives = 105/137 (76%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGL-KPLRLPSDAQKAPILPTNDLPTD 239
             RAK HQ+LDP          +D TP   F   FLGL + L+ P+DA+ APILPT++LP+D
Sbjct:    81 RAKRHQELDP-AAVHGVTQFSDSTPTE-FRRKFLGLNRRLKFPADAKTAPILPTDELPSD 138

Query:   238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59
             FDWR+ GAVT VKNQG+CG CWSFS  GALEGA+FL TG+LVSLSEQQLVDCD ECDPEE
Sbjct:   139 FDWRDRGAVTPVKNQGTCGLCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 198

Query:    58 RGACDSGCNGGLMTIAFE 5
              G+CD GCNGGLM  AFE
Sbjct:   199 AGSCDFGCNGGLMNSAFE 216


to_Entrezto_Relatedto_Related >gi|419781|pir||S30149  cysteine proteinase (EC 3.4.22.-) precursor (clone
            CYP-7) - common tobacco >gi|19849|emb|CAA78361.1| (Z13959) tobacco
            pre-pro-cysteine proteinase [Nicotiana tabacum]
            Length = 363

Frame -3 hits (HSPs):             ___________________                     
Annotated Domains:          _____________________________________________ 
                        __________________________________________________
Database sequence:     |                    |                    |        | 363
                       0                  150                  300
__________________

Annotated Domains:
   DOMO                 DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 36..354
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 318..337
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 148..159
__________________


  Minus Strand HSPs:

 Score = 457 (160.9 bits), Expect = 2.3e-42, P = 2.3e-42
 Identities = 90/137 (65%), Positives = 103/137 (75%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDF 236
             RA+ +Q LDP         S DLTP   F  ++LGL   +   +A+KAPILPT+DLP DF
Sbjct:    77 RARLNQLLDPSAEHGITKFS-DLTPSE-FRRTYLGLHKPKPKLNAEKAPILPTSDLPADF 134

Query:   235 DWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEER 56
             DWR+HGAVTGVKNQGSCGSCWSFS  GA+EGAHFL TGELVSLSEQQLVDCD ECDPE++
Sbjct:   135 DWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQ 194

Query:    55 GACDSGCNGGLMTIAFE 5
              ACD+GC GG    AFE
Sbjct:   195 DACDAGCGGGHYATAFE 211


to_Entrezto_Relatedto_Relatedto_ec >gi|1706260|sp|Q10716|CYS1_MAIZE  CYSTEINE PROTEINASE 1 PRECURSOR
            >gi|2118131|pir||S59597 cysteine proteinase (EC 3.4.22.-) 1
            precursor - maize >gi|643597|dbj|BAA08244.1| (D45402) cysteine
            proteinase [Zea mays]
            Length = 371

Frame -3 hits (HSPs):             ____________________                    
Annotated Domains:        _______________________________________________ 
                        __________________________________________________
Database sequence:     |                    |                   |         | 371
                       0                  150                 300
__________________

Annotated Domains:
   BLOCKS               BL00139A: Eukaryotic thiol (cysteine) pr 155..164
   BLOCKS               BL00139B: Eukaryotic thiol (cysteine) pr 205..213
   BLOCKS               BL00139C: Eukaryotic thiol (cysteine) pr 302..311
   BLOCKS               BL00139D: Eukaryotic thiol (cysteine) pr 325..341
   DOMO                 DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 37..364
   Entrez               active site: BY SIMILARITY.              161
   Entrez               active site: BY SIMILARITY.              303
   Entrez               active site: BY SIMILARITY.              330
   Entrez               glycosylation site: POTENTIAL.           254
   PFAM                 Peptidase_C1: Papain family cysteine pro 137..364
   PRINTS               PAPAIN1: Papain cysteine protease family 155..170
   PRINTS               PAPAIN2: Papain cysteine protease family 303..313
   PRINTS               PAPAIN3: Papain cysteine protease family 325..331
   PRODOM               PD078471: CYS1_MAIZE                     22..44
   PRODOM               PD000247: CYSP(11) CATL(7) CYS2(5)       46..112
   PRODOM               PD171174: CYS1_MAIZE                     114..135
   PRODOM               PD000158: CYSP(14) CATL(9) CYS1(8)       137..360
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 325..344
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 155..166
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 301..311
__________________


  Minus Strand HSPs:

 Score = 418 (147.1 bits), Expect = 3.1e-38, P = 3.1e-38
 Identities = 87/140 (62%), Positives = 98/140 (70%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLR------LPSDAQKAPILPTN 254
             RA+ HQ LDP         S DLTP   F  ++LGL+  R      L   A +AP+LPT+
Sbjct:    78 RARRHQLLDPSAEHGVTKFS-DLTPAE-FRRTYLGLRKSRRALLRELGESAHEAPVLPTD 135

Query:   253 DLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLE 74
              LP DFDWR+HGAV  VKNQGSCGSCWSFSA GALEGAH+L TG+L  LSEQQ VDCD E
Sbjct:   136 GLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHE 195

Query:    73 CDPEERGACDSGCNGGLMTIAF 8
             CD  E  +CDSGCNGGLMT AF
Sbjct:   196 CDSSEPDSCDSGCNGGLMTTAF 217


to_Entrezto_Relatedto_Related >gi|7435815|pir||T08845  cysteine proteinase (EC 3.4.22.-) isoform A - soybean
            (fragment) >gi|1619905|gb|AAB16997.1| (U71380) thiol protease
            isoform A [Glycine max]
            Length = 318

Frame -3 hits (HSPs):              ________________                       
                        __________________________________________________
Database sequence:     |       |       |       |       |       |       |  | 318
                       0      50     100     150     200     250     300

  Minus Strand HSPs:

 Score = 404 (142.2 bits), Expect = 9.3e-37, P = 9.3e-37
 Identities = 75/99 (75%), Positives = 81/99 (81%), Frame = -3

Query:   301 LRLPSDAQKAPILPTNDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTG 122
             +R P+ AQKAPILPT DLP DFDWR+ GAVT VK+ G CGSCWSFS  GALE + +L TG
Sbjct:    71 VRFPAHAQKAPILPTKDLPKDFDWRDKGAVTNVKDLGGCGSCWSFSTTGALEVSFYLATG 130

Query:   121 ELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5
             ELVSLSEQQLVDCD  CDPEE GACDSGCNGGLM  AFE
Sbjct:   131 ELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFE 169


to_Entrezto_Relatedto_Related >gi|5679322|gb|AAD46920.1|AF167986_1  (AF167986) putative cysteine proteinase
            GmPM33 [Glycine max]
            Length = 363

Frame -1 hits (HSPs):               _____                                 
Frame -3 hits (HSPs):                  _____________                      
                        __________________________________________________
Database sequence:     |                    |                    |        | 363
                       0                  150                  300

  Minus Strand HSPs:

 Score = 322 (113.3 bits), Expect = 1.3e-31, Sum P(2) = 1.3e-31
 Identities = 58/89 (65%), Positives = 72/89 (80%), Frame = -3

Query:   274 APILPTNDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQ 95
             AP L  + LP +FDWRE GAVT VK QG CGSCW+FS  G++EGA+FL TG+LVSLS+QQ
Sbjct:   115 APPLEVDGLPENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFLATGKLVSLSDQQ 174

Query:    94 LVDCDLECDPEERGACDSGCNGGLMTIAF 8
             L+DCD +CD  E+ +CD+GCNGGLMT A+
Sbjct:   175 LLDCDNKCDITEKTSCDNGCNGGLMTNAY 203

 Score = 49 (17.2 bits), Expect = 1.3e-31, Sum P(2) = 1.3e-31
 Identities = 13/26 (50%), Positives = 17/26 (65%), Frame = -1

Query:   387 PSAVHGVPQV-LPISLPAE--FSPPV 319
             P+AVHGV Q  LP+S  A    +PP+
Sbjct:    93 PTAVHGVTQFSLPVSNNAAGGIAPPL 118


to_Entrezto_Relatedto_Related >gi|7435792|pir||T12042  cysteine proteinase (EC 3.4.22.-) 4 precursor - kidney
            bean >gi|2511695|emb|CAB17077.1| (Z99955) cysteine proteinase
            precursor [Phaseolus vulgaris]
            Length = 377

Frame -1 hits (HSPs):               __                                    
Frame -3 hits (HSPs):                    ____________                     
                        __________________________________________________
Database sequence:     |                   |                   |          | 377
                       0                 150                 300

  Minus Strand HSPs:

 Score = 318 (111.9 bits), Expect = 6.3e-30, Sum P(2) = 6.3e-30
 Identities = 57/90 (63%), Positives = 70/90 (77%), Frame = -3

Query:   274 APILPTNDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQ 95
             AP L  + LP DFDWRE GAVT VK QG CGSCW+FS  G++EGA+F+ TG+L++LSEQQ
Sbjct:   130 APPLKVDGLPEDFDWREKGAVTEVKMQGKCGSCWAFSTTGSIEGANFIATGKLLNLSEQQ 189

Query:    94 LVDCDLECDPEERGACDSGCNGGLMTIAFE 5
             LVDCD +CD  E   CD+GC GGLMT A++
Sbjct:   190 LVDCDSQCDITESTTCDNGCMGGLMTNAYK 219

 Score = 37 (13.0 bits), Expect = 6.3e-30, Sum P(2) = 6.3e-30
 Identities = 6/9 (66%), Positives = 8/9 (88%), Frame = -1

Query:   387 PSAVHGVPQ 361
             P+A+HGV Q
Sbjct:    92 PTAIHGVTQ 100


to_Entrezto_Relatedto_Relatedto_ec >gi|1362047|pir||S55923  cysteine proteinase (EC 3.4.22.-) precursor - soybean
            >gi|479060|emb|CAA83673.1| (Z32795) cysteine proteinase [Glycine
            max] >gi|1096153|prf||2111244A Cys protease [Glycine max]
            Length = 380

Frame -3 hits (HSPs):             ___________________                     
Annotated Domains:      ___________________ ___                 __ ___    
                        __________________________________________________
Database sequence:     |                   |                   |          | 380
                       0                 150                 300
__________________

Annotated Domains:
   Entrez               domain: signal sequence                  1..29
   Entrez               domain: propeptide                       30..139
   Entrez               active site: Cys, His, Asn               164
   Entrez               active site: Cys, His, Asn               307
   Entrez               active site: Cys, His, Asn               334
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 329..348
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 158..169
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 305..315
__________________


  Minus Strand HSPs:

 Score = 334 (117.6 bits), Expect = 2.4e-29, P = 2.4e-29
 Identities = 73/137 (53%), Positives = 89/137 (64%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKA----PILPTNDL 248
             RA  HQ LDP          +DLT    F   + G+     PS    A    P L  + L
Sbjct:    84 RAAEHQALDP-TAVHGVTQFSDLTEDE-FEKLYTGVNG-GFPSSNNAAGGIAPPLEVDGL 140

Query:   247 PTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECD 68
             P +FDWRE GAVT VK QG CGSCW+FS  G++EGA+FL TG+LVSLSEQQL+DCD +CD
Sbjct:   141 PENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDNKCD 200

Query:    67 PEERGACDSGCNGGLMTIAF 8
               E+ +CD+GCNGGLMT A+
Sbjct:   201 ITEKTSCDNGCNGGLMTNAY 220


to_Entrezto_Relatedto_Related >gi|7435814|pir||T10949  cysteine proteinase (EC 3.4.22.-) precursor - spring
            vetch >gi|2414683|emb|CAB16316.1| (Z99172) cysteine proteinase
            precursor [Vicia sativa]
            Length = 379

Frame -3 hits (HSPs):              __________________                     
                        __________________________________________________
Database sequence:     |                   |                   |          | 379
                       0                 150                 300

  Minus Strand HSPs:

 Score = 327 (115.1 bits), Expect = 1.3e-28, P = 1.3e-28
 Identities = 70/137 (51%), Positives = 89/137 (64%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQK--APILPTNDLPT 242
             +A  HQ LDP          +DL+    F   + G K     S+A    AP L     P 
Sbjct:    85 KAAEHQALDP-TAIHGVTQFSDLSEEE-FERFYTGFKGGFPSSNAAGGVAPPLDVKGFPE 142

Query:   241 DFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPE 62
             +FDWRE GAVTG+K QG CGSCW+F+  G++EGA+FL TG+LVSLSEQQLVDCD +CD  
Sbjct:   143 NFDWREKGAVTGIKTQGKCGSCWAFTTTGSIEGANFLATGKLVSLSEQQLVDCDNKCDIT 202

Query:    61 ERGACDSGCNGGLMTIAFE 5
             +  +CD+GCNGGLMT A++
Sbjct:   203 KT-SCDNGCNGGLMTTAYD 220


to_Entrezto_Relatedto_Related >gi|7435812|pir||T06726  cysteine proteinase (EC 3.4.22.-) F28P10.80 -
            Arabidopsis thaliana >gi|4678299|emb|CAB41090.1| (AL049655)
            cysteine proteinase precursor-like protein [Arabidopsis thaliana]
            Length = 363

Frame -3 hits (HSPs):              ___________________                    
                        __________________________________________________
Database sequence:     |                    |                    |        | 363
                       0                  150                  300

  Minus Strand HSPs:

 Score = 323 (113.7 bits), Expect = 3.6e-28, P = 3.6e-28
 Identities = 70/137 (51%), Positives = 90/137 (65%), Frame = -3

Query:   415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPL---RLPSDAQKAPILPTNDLP 245
             +A  HQ +DP          +DLT    F   + G+  +   R  +   +AP++  + LP
Sbjct:    81 KAAEHQMMDP-SAVHGVTQFSDLTEEE-FKRMYTGVADVGGSRGGTVGAEAPMVEVDGLP 138

Query:   244 TDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDP 65
              DFDWRE G VT VKNQG+CGSCW+FS  GA EGAHF+ TG+L+SLSEQQLVDCD + D 
Sbjct:   139 EDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCD-QADK 197

Query:    64 EERGACDSGCNGGLMTIAFE 5
             +   ACD+GC GGLMT A+E
Sbjct:   198 K---ACDNGCGGGLMTNAYE 214


to_Entrezto_Relatedto_Related >gi|1353726|gb|AAB01769.1|  (U42758) cysteine proteinase homolog [Naegleria
            fowleri]
            Length = 347

Frame -3 hits (HSPs):              ___________________                    
                        __________________________________________________
Database sequence:     |                     |                     |      | 347
                       0                   150                   300

  Minus Strand HSPs:

 Score = 300 (105.6 bits), Expect = 9.8e-26, P = 9.8e-26
 Identities = 68/124 (54%), Positives = 84/124 (67%), Frame = -3

Query:   355 TDLTPGGVFAASFLGLKPLRLPSDAQK---AP---ILPTNDL---PTDFDWREHGAVTGV 203
             +DLTP   F   FL +K    P +A+K   AP   +L   ++   PT FDWR+HGAVT V
Sbjct:    81 SDLTPEE-FKRMFL-MKTYT-PEEAKKILAAPQHAVLSEKEVQTAPTSFDWRQHGAVTRV 137

Query:   202 KNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDP-EERGACDSGCNGG 26
             KNQG+CGSCW+FS  G +EG   +  G+LVSLSEQQLVDCD  C   + + ACDSGCNGG
Sbjct:   138 KNQGACGSCWTFSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGG 197

Query:    25 LMTIAFE 5
             LM  AF+
Sbjct:   198 LMWSAFQ 204


to_Entrezto_Relatedto_Relatedto_ec >gi|118117|sp|P04988|CYS1_DICDI  CYSTEINE PROTEINASE 1 PRECURSOR
            >gi|67647|pir||KHDO cysteine proteinase 1 (EC 3.4.22.-) precursor -
            slime mold (Dictyostelium discoideum) >gi|1617037|emb|CAA26255.1|
            (X02407) cysteine proteinase I precursor [Dictyostelium discoideum]
            Length = 343

Frame -3 hits (HSPs):                   _____________                     
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |                     |                     |      | 343
                       0                   150                   300
__________________

Annotated Domains:
   BLOCKS               BL00139A: Eukaryotic thiol (cysteine) pr 136..145
   BLOCKS               BL00139B: Eukaryotic thiol (cysteine) pr 187..195
   BLOCKS               BL00139C: Eukaryotic thiol (cysteine) pr 285..294
   BLOCKS               BL00139D: Eukaryotic thiol (cysteine) pr 306..322
   DOMO                 DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 19..342
   Entrez               active site: BY SIMILARITY.              142
   Entrez               active site: BY SIMILARITY.              286
   Entrez               active site: BY SIMILARITY.              311
   PFAM                 Peptidase_C1: Papain family cysteine pro 118..342
   PRINTS               PAPAIN1: Papain cysteine protease family 136..151
   PRINTS               PAPAIN2: Papain cysteine protease family 286..296
   PRINTS               PAPAIN3: Papain cysteine protease family 306..312
   PRODOM               PD171194: CYS1_DICDI                     1..25
   PRODOM               PD000247: CYSP(11) CATL(7) CYS2(5)       27..101
   PRODOM               PD000158: CYSP(14) CATL(9) CYS1(8)       118..338
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 306..325
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 136..147
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 284..294
__________________


  Minus Strand HSPs:

 Score = 290 (102.1 bits), Expect = 1.1e-24, P = 1.1e-24
 Identities = 54/84 (64%), Positives = 60/84 (71%), Frame = -3

Query:   256 NDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDL 77
             N +PT FDWR  GAVT VKNQG CGSCWSFS  G +EG HF+   +LVSLSEQ LVDCD 
Sbjct:   116 NSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 175

Query:    76 EC-DPEERGACDSGCNGGLMTIAF 8
             EC + E   ACD GCNGGL   A+
Sbjct:   176 ECMEYEGEEACDEGCNGGLQPNAY 199


to_Entrezto_Relatedto_Relatedto_ec >gi|118124|sp|P25250|CYS2_HORVU  CYSTEINE PROTEINASE EP-B 4 PRECURSOR
            >gi|82386|pir||JQ1110 cysteine proteinase (EC 3.4.22.-) EP-B 4
            precursor - barley >gi|1146118|gb|AAA85036.1| (U19384) cysteine
            proteinase EPB2 precursor [Hordeum vulgare]
            Length = 373

Frame -3 hits (HSPs):             __________________                ___   
Annotated Domains:         _____________ _______________________________  
                        __________________________________________________
Database sequence:     |                   |                    |         | 373
                       0                 150                  300
__________________

Annotated Domains:
   BLOCKS               BL00139A: Eukaryotic thiol (cysteine) pr 152..161
   BLOCKS               BL00139B: Eukaryotic thiol (cysteine) pr 194..202
   BLOCKS               BL00139C: Eukaryotic thiol (cysteine) pr 296..305
   BLOCKS               BL00139D: Eukaryotic thiol (cysteine) pr 313..329
   Entrez               active site: BY SIMILARITY.              158
   Entrez               active site: BY SIMILARITY.              297
   Entrez               active site: BY SIMILARITY.              318
   Entrez               glycosylation site: POTENTIAL.           130
   PFAM                 Peptidase_C1: Papain family cysteine pro 134..353
   PRINTS               PAPAIN1: Papain cysteine protease family 152..167
   PRINTS               PAPAIN2: Papain cysteine protease family 297..307
   PRINTS               PAPAIN3: Papain cysteine protease family 313..319
   PRODOM               PD154723: CYS1(1) CYS2(1)                24..43
   PRODOM               PD000247: CYSP(11) CATL(7) CYS2(5)       45..116
   PRODOM               PD000158: CYSP(14) CATL(9) CYS1(8)       134..352
   PROSITE              ALDEHYDE_DEHYDR_GLU: Aldehyde dehydrogen 272..279
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 313..332
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 152..163
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 295..305
__________________


  Minus Strand HSPs:

 Score = 253 (89.1 bits), Expect = 9.5e-24, Sum P(2) = 9.5e-24
 Identities = 63/134 (47%), Positives = 77/134 (57%), Frame = -3

Query:   406 SHQKLD--PFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAP-----ILPTNDL 248
             SH K    P+R      G  D      F A+F+G      PS     P      L  +DL
Sbjct:    78 SHNKRGDHPYRLHLNRFGDMDQAE---FRATFVGDLRRDTPSKPPSVPGFMYAALNVSDL 134

Query:   247 PTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECD 68
             P   DWR+ GAVTGVK+QG CGSCW+FS V ++EG + + TG LVSLSEQ+L+DCD    
Sbjct:   135 PPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT--- 191

Query:    67 PEERGACDSGCNGGLMTIAFE 5
                  A + GC GGLM  AFE
Sbjct:   192 -----ADNDGCQGGLMDNAFE 207

 Score = 43 (15.1 bits), Expect = 9.5e-24, Sum P(2) = 9.5e-24
 Identities = 9/14 (64%), Positives = 10/14 (71%), Frame = -3

Query:    46 DSGCNGGLMTIAFE 5
             DSG +GGL  IA E
Sbjct:   335 DSGASGGLCGIAME 348


to_Entrezto_Relatedto_Relatedto_ec >gi|118120|sp|P25249|CYS1_HORVU  CYSTEINE PROTEINASE EP-B 1 PRECURSOR
            >gi|82385|pir||JQ1111 cysteine proteinase (EC 3.4.22.-) EP-B 1
            precursor - barley >gi|1146116|gb|AAA85035.1| (U19359) cysteine
            proteinase EPB1 precursor [Hordeum vulgare]
            Length = 371

Frame -3 hits (HSPs):             __________________                 __   
Annotated Domains:         _____________________________________________  
                        __________________________________________________
Database sequence:     |                    |                   |         | 371
                       0                  150                 300
__________________

Annotated Domains:
   BLOCKS               BL00139A: Eukaryotic thiol (cysteine) pr 152..161
   BLOCKS               BL00139B: Eukaryotic thiol (cysteine) pr 194..202
   BLOCKS               BL00139C: Eukaryotic thiol (cysteine) pr 296..305
   BLOCKS               BL00139D: Eukaryotic thiol (cysteine) pr 313..329
   DOMO                 DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 36..353
   Entrez               active site: BY SIMILARITY.              158
   Entrez               active site: BY SIMILARITY.              297
   Entrez               active site: BY SIMILARITY.              318
   Entrez               glycosylation site: POTENTIAL.           130
   PFAM                 Peptidase_C1: Papain family cysteine pro 134..353
   PRINTS               PAPAIN1: Papain cysteine protease family 152..167
   PRINTS               PAPAIN2: Papain cysteine protease family 297..307
   PRINTS               PAPAIN3: Papain cysteine protease family 313..319
   PRODOM               PD154723: CYS1(1) CYS2(1)                24..43
   PRODOM               PD000247: CYSP(11) CATL(7) CYS2(5)       45..116
   PRODOM               PD000158: CYSP(14) CATL(9) CYS1(8)       134..352
   PROSITE              ALDEHYDE_DEHYDR_GLU: Aldehyde dehydrogen 272..279
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 313..332
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 152..163
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 295..305
__________________


  Minus Strand HSPs:

 Score = 251 (88.4 bits), Expect = 1.5e-23, Sum P(2) = 1.5e-23
 Identities = 64/134 (47%), Positives = 80/134 (59%), Frame = -3

Query:   406 SHQKLD--PFRRPRRPPGSTDLTPGGVFAASFLG-LK---PLRLPS-DAQKAPILPTNDL 248
             SH K    P+R      G  D      F A+F+G L+   P + PS        L  +DL
Sbjct:    78 SHNKRGDHPYRLHLNRFGDMDQAE---FRATFVGDLRRDTPAKPPSVPGFMYAALNVSDL 134

Query:   247 PTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECD 68
             P   DWR+ GAVTGVK+QG CGSCW+FS V ++EG + + TG LVSLSEQ+L+DCD    
Sbjct:   135 PPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT--- 191

Query:    67 PEERGACDSGCNGGLMTIAFE 5
                  A + GC GGLM  AFE
Sbjct:   192 -----ADNDGCQGGLMDNAFE 207

 Score = 43 (15.1 bits), Expect = 1.5e-23, Sum P(2) = 1.5e-23
 Identities = 9/14 (64%), Positives = 10/14 (71%), Frame = -3

Query:    46 DSGCNGGLMTIAFE 5
             DSG +GGL  IA E
Sbjct:   335 DSGASGGLCGIAME 348


to_Entrezto_Relatedto_Related >gi|537437|gb|AAC35211.1|  (U12637) cysteine proteinase [Hemerocallis hybrid
            cultivar]
            Length = 359

Frame -3 hits (HSPs):                  _____________                      
                        __________________________________________________
Database sequence:     |                    |                    |        | 359
                       0                  150                  300

  Minus Strand HSPs:

 Score = 266 (93.6 bits), Expect = 3.9e-22, P = 3.9e-22
 Identities = 57/99 (57%), Positives = 67/99 (67%), Frame = -3

Query:   301 LRLPSDAQKAPILPTNDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTG 122
             LR   DA +      +DLPT  DWRE GAVTGVK+QG CGSCW+FS V A+EG + + T 
Sbjct:   112 LRGVKDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGINQIKTN 171

Query:   121 ELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5
             ELVSLSEQQLVDCD +         +SGCNGGLM  AF+
Sbjct:   172 ELVSLSEQQLVDCDTK---------NSGCNGGLMDYAFD 201


to_Entrezto_Relatedto_Related >gi|1834307|dbj|BAA09820.1|  (D63670) cysteine proteinase [Spirometra
            erinaceieuropaei] >gi|1834309|dbj|BAA09821.1| (D63671) cysteine
            proteinase [Spirometra erinaceieuropaei]
            Length = 336

Frame -3 hits (HSPs):               _________________                     
                        __________________________________________________
Database sequence:     |                      |                     |     | 336
                       0                    150                   300

  Minus Strand HSPs:

 Score = 265 (93.3 bits), Expect = 5.0e-22, P = 5.0e-22
 Identities = 61/117 (52%), Positives = 76/117 (64%), Frame = -3

Query:   355 TDLTPGGVFAASFLGLKPLRLPSDAQKAPI-LPTND-LPTDFDWREHGAVTGVKNQGSCG 182
             +DLTPG  FA  +L L+ + L    +K  + +P  + LP   +WRE GAVT VKNQG CG
Sbjct:    85 SDLTPGE-FAERYLCLRGIVLTKLRRKEAVSVPLKENLPDSVNWRERGAVTSVKNQGQCG 143

Query:   181 SCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5
             SCWSFSA GA+EGA  + TG L SLSEQQL+DC  +         + GCNGGLM  AF+
Sbjct:   144 SCWSFSANGAIEGAIQIKTGALRSLSEQQLMDCSWDYG-------NQGCNGGLMPQAFQ 195


to_Entrezto_Relatedto_Relatedto_ec >gi|1085124|pir||JX0366  cysteine endopeptidase (EC 3.4.22.-) precursor -
            silkworm >gi|957281|gb|AAB33990.1| (S77508) cysteine proteinase,
            BCP {EC 3.4.22.-} [Bombyx mori=silkmoths, pupae, eggs, Peptide, 344
            aa]
            Length = 344

Frame -3 hits (HSPs):                    _____________                    
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |                     |                     |      | 344
                       0                   150                   300
__________________

Annotated Domains:
   DOMO                 DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 124..343
   Entrez               domain: signal sequence                  1..16
   Entrez               domain: propeptide                       17..120
   Entrez               binding site: carbohydrate (Asn) (covale 98
   Entrez               active site: Cys, His, Asn               151
   Entrez               active site: Cys, His, Asn               290
   Entrez               active site: Cys, His, Asn               311
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 306..325
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 145..156
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 288..298
__________________


  Minus Strand HSPs:

 Score = 262 (92.2 bits), Expect = 1.0e-21, P = 1.0e-21
 Identities = 53/88 (60%), Positives = 62/88 (70%), Frame = -3

Query:   268 ILPTN-DLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQL 92
             I P N  LP   DWR+HGAVT +K+QG CGSCWSFS  GALEG HF  +G LVSLSEQ L
Sbjct:   120 ISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNL 179

Query:    91 VDCDLECDPEERGACDSGCNGGLMTIAFE 5
             +DC      E+ G  ++GCNGGLM  AF+
Sbjct:   180 IDCS-----EQYG--NNGCNGGLMDNAFK 201


to_Entrezto_Relatedto_Related >gi|7435804|pir||T03694  cysteine proteinase (EC 3.4.22.-) - rice
            >gi|1514953|dbj|BAA11170.1| (D76415) cysteine proteinase [Oryza
            sativa]
            Length = 368

Frame -3 hits (HSPs):             __________________                      
                        __________________________________________________
Database sequence:     |                    |                   |         | 368
                       0                  150                 300

  Minus Strand HSPs:

 Score = 260 (91.5 bits), Expect = 1.7e-21, P = 1.7e-21
 Identities = 64/126 (50%), Positives = 75/126 (59%), Frame = -3

Query:   373 RRPPGSTDLTPGG-----VFAASFLGLKPLRLPSDAQKAPILP------TNDLPTDFDWR 227
             +R PG   L   G      F A+F G     L  D   AP LP        DLP   DWR
Sbjct:    81 KRAPGYAPLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWR 140

Query:   226 EHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGAC 47
               GAVTGVK+QG CGSCW+FS V ++EG + + TG LVSLSEQ+L+DCD         A 
Sbjct:   141 RKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDT--------AD 192

Query:    46 DSGCNGGLMTIAFE 5
             +SGC GGLM  AFE
Sbjct:   193 NSGCQGGLMENAFE 206


to_Entrezto_Relatedto_Related >gi|4426617|gb|AAD20453.1|  (AF099203) cysteine endopeptidase precursor [Oryza
            sativa]
            Length = 368

Frame -3 hits (HSPs):             __________________                      
                        __________________________________________________
Database sequence:     |                    |                   |         | 368
                       0                  150                 300

  Minus Strand HSPs:

 Score = 259 (91.2 bits), Expect = 2.2e-21, P = 2.2e-21
 Identities = 64/126 (50%), Positives = 75/126 (59%), Frame = -3

Query:   373 RRPPGSTDLTPGG-----VFAASFLGLKPLRLPSDAQKAPILP------TNDLPTDFDWR 227
             +R PG   L   G      F A+F G     L  D   AP LP        DLP   DWR
Sbjct:    81 KRAPGYPPLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWR 140

Query:   226 EHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGAC 47
               GAVTGVK+QG CGSCW+FS V ++EG + + TG LVSLSEQ+L+DCD         A 
Sbjct:   141 RKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDT--------AD 192

Query:    46 DSGCNGGLMTIAFE 5
             +SGC GGLM  AFE
Sbjct:   193 NSGCQGGLMENAFE 206


to_Entrezto_Relatedto_Related >gi|5761329|dbj|BAA83473.1|  (AB004819) cysteine endopeptidase [Oryza sativa]
            Length = 371

Frame -3 hits (HSPs):                _______________                      
                        __________________________________________________
Database sequence:     |                    |                   |         | 371
                       0                  150                 300

  Minus Strand HSPs:

 Score = 258 (90.8 bits), Expect = 2.8e-21, P = 2.8e-21
 Identities = 59/109 (54%), Positives = 69/109 (63%), Frame = -3

Query:   331 FAASFLGLKPLRLPSDAQKAPILP------TNDLPTDFDWREHGAVTGVKNQGSCGSCWS 170
             F A+F G     L  D   AP LP        DLP   DWR  GAVTGVK+QG CGSCW+
Sbjct:   102 FRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCGSCWA 161

Query:   169 FSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5
             FS V ++EG + + TG LVSLSEQ+L+DCD         A +SGC GGLM  AFE
Sbjct:   162 FSTVVSVEGINAIRTGRLVSLSEQELIDCDT--------ADNSGCQGGLMENAFE 208


to_Entrezto_Related >gi|415567|gb|AAB28289.1|  cysteine proteinase=39 kda activated form [Bombyx
            mori=silkworms, eggs, Peptide, 176 aa]
            Length = 176

Frame -3 hits (HSPs):   ______________________                            
Annotated Domains:           ____                                    ____ 
                        __________________________________________________
Database sequence:     |             |              |             |       | 176
                       0            50            100           150
__________________

Annotated Domains:
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 19..30
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 162..172
__________________


  Minus Strand HSPs:

 Score = 257 (90.5 bits), Expect = 3.5e-21, P = 3.5e-21
 Identities = 50/82 (60%), Positives = 59/82 (71%), Frame = -3

Query:   250 LPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLEC 71
             LP   DWR+HGAVT +K+QG CGSCWSFS  GALEG HF  +G LVSLSEQ L+DC    
Sbjct:     1 LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS--- 57

Query:    70 DPEERGACDSGCNGGLMTIAFE 5
               E+ G  ++GCNGGLM  AF+
Sbjct:    58 --EQYG--NNGCNGGLMDNAFK 75


to_Entrezto_Relatedto_Relatedto_ec >gi|2118132|pir||JC4848  cysteine proteinase (EC 3.4.22.-) - Douglas fir
            >gi|1208549|gb|AAC49455.1| (U41902) Pseudotzain [Pseudotsuga
            menziesii]
            Length = 454

Frame -3 hits (HSPs):              ____________                           
Annotated Domains:                      __             _____        __    
                        __________________________________________________
Database sequence:     |                |               |                || 454
                       0              150             300              450
__________________

Annotated Domains:
   PROSITE              PA2_HIS: Phospholipase A2 histidine acti 406..413
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 307..326
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 150..161
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 290..300
__________________


  Minus Strand HSPs:

 Score = 260 (91.5 bits), Expect = 5.1e-21, P = 5.1e-21
 Identities = 58/109 (53%), Positives = 70/109 (64%), Frame = -3

Query:   331 FAASFLGLK---PLRLP-SDAQKAPILPTNDLPTDFDWREHGAVTGVKNQGSCGSCWSFS 164
             F A++LG K     RL  S + +       DLP   DWRE GAVT VKNQGSCGSCW+FS
Sbjct:   101 FKAAYLGTKLDAKKRLSRSPSPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFS 160

Query:   163 AVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5
              V A+EG + + TG L SLSEQ+LVDCD         + + GCNGGLM  AF+
Sbjct:   161 TVAAVEGINQIVTGNLTSLSEQELVDCDT--------SYNQGCNGGLMDYAFQ 205


to_Entrezto_Relatedto_Related >gi|2160175|gb|AAB60738.1|  (AC000132) Strong similarity to Dianthus cysteine
            proteinase (gb|U17135). [Arabidopsis thaliana]
            Length = 416

Frame -3 hits (HSPs):             _____________                           
                        __________________________________________________
Database sequence:     |                 |                 |              | 416
                       0               150               300

  Minus Strand HSPs:

 Score = 258 (90.8 bits), Expect = 5.5e-21, P = 5.5e-21
 Identities = 58/109 (53%), Positives = 74/109 (67%), Frame = -3

Query:   331 FAASFLGLKPLRLPSD--AQKAPILPTN-DLPTDFDWREHGAVTGVKNQGSCGSCWSFSA 161
             F AS LGL  +  PS   A K   L  +  +P   DWR+ GAVT VK+QGSCG+CWSFSA
Sbjct:    87 FKASRLGLS-VSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSA 145

Query:   160 VGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5
              GA+EG + + TG+L+SLSEQ+L+DCD         + ++GCNGGLM  AFE
Sbjct:   146 TGAMEGINQIVTGDLISLSEQELIDCDK--------SYNAGCNGGLMDYAFE 189


to_Entrezto_Relatedto_Related >gi|5777611|emb|CAB53397.1|  (AJ245868) cysteine protease [Medicago sativa]
            Length = 209

Frame -3 hits (HSPs):   _______________                                   
                        __________________________________________________
Database sequence:     |           |           |           |           |  | 209
                       0          50         100         150         200

  Minus Strand HSPs:

 Score = 255 (89.8 bits), Expect = 5.7e-21, P = 5.7e-21
 Identities = 47/61 (77%), Positives = 52/61 (85%), Frame = -3

Query:   187 CGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAF 8
             CGS W+FS  GALEGA++L TG+LVSLSEQQLVDCD  CDPEER +CDSGCNGGLM  AF
Sbjct:     1 CGSGWAFSTTGALEGANYLATGKLVSLSEQQLVDCDHVCDPEERNSCDSGCNGGLMNNAF 60

Query:     7 E 5
             E
Sbjct:    61 E 61


to_Entrezto_Relatedto_Relatedto_ec >gi|1079183|pir||A53810  cathepsin L (EC 3.4.22.15) precursor - flesh fly
            (Sarcophaga peregrina) >gi|505140|dbj|BAA03970.1| (D16533)
            cathepsin L precursor [Sarcophaga peregrina]
            Length = 339

Frame -3 hits (HSPs):                    ____________                     
Annotated Domains:        ________________________________________________
                        __________________________________________________
Database sequence:     |                     |                      |     | 339
                       0                   150                    300
__________________

Annotated Domains:
   DOMO                 DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 18..338
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 301..320
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 140..151
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 283..293
__________________


  Minus Strand HSPs:

 Score = 254 (89.4 bits), Expect = 7.3e-21, P = 7.3e-21
 Identities = 50/81 (61%), Positives = 57/81 (70%), Frame = -3

Query:   250 LPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLEC 71
             +P   DWREHGAVTGVK+QG CGSCW+FS+ GALEG HF   G LVSLSEQ LVDC  + 
Sbjct:   122 VPKSVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKY 181

Query:    70 DPEERGACDSGCNGGLMTIAF 8
                     ++GCNGGLM  AF
Sbjct:   182 G-------NNGCNGGLMDNAF 195


to_Entrezto_Relatedto_Relatedto_ec >gi|399190|sp|Q02765|CATS_RAT  CATHEPSIN S PRECURSOR >gi|348347|pir||A45087
            cathepsin S (EC 3.4.22.27) - rat >gi|203650|gb|AAA40994.1| (L03201)
            cathepsin S precursor [Rattus norvegicus]
            Length = 330

Frame -3 hits (HSPs):              __________________                     
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |                      |                      |    | 330
                       0                    150                    300
__________________

Annotated Domains:
   DOMO                 DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 16..329
   Entrez               active site: BY SIMILARITY.              137
   Entrez               active site: BY SIMILARITY.              277
   Entrez               active site: BY SIMILARITY.              297
   Entrez               glycosylation site: POTENTIAL.           100
   Entrez               glycosylation site: POTENTIAL.           110
   PFAM                 Peptidase_C1: Papain family cysteine pro 113..329
   PRINTS               PAPAIN1: Papain cysteine protease family 131..146
   PRINTS               PAPAIN2: Papain cysteine protease family 277..287
   PRINTS               PAPAIN3: Papain cysteine protease family 292..298
   PRODOM               PD171175: CATS_RAT                       1..24
   PRODOM               PD000247: CYSP(11) CATL(7) CYS2(5)       26..97
   PRODOM               PD000158: CYSP(14) CATL(9) CYS1(8)       113..327
   PROSITE              LEUCINE_ZIPPER: Leucine zipper pattern.  280..301
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 292..311
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 131..142
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 275..285
__________________


  Minus Strand HSPs:

 Score = 253 (89.1 bits), Expect = 9.4e-21, P = 9.4e-21
 Identities = 60/116 (51%), Positives = 73/116 (62%), Frame = -3

Query:   352 DLTPGGVFAASFLGLKPLRLPSDAQKAPILPTND---LPTDFDWREHGAVTGVKNQGSCG 182
             D+TP  V    ++G   LR+P    ++  L ++    LP   DWRE G VT VK QGSCG
Sbjct:    80 DMTPEEVIG--YMG--SLRIPRPWNRSGTLKSSSNQTLPDSVDWREKGCVTNVKYQGSCG 135

Query:   181 SCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5
             SCW+FSA GALEG   L TG+LVSLS Q LVDC  E   E+ G  + GC GG MT AF+
Sbjct:   136 SCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTE---EKYG--NKGCGGGFMTEAFQ 189


to_Entrezto_Relatedto_Relatedto_ec >gi|1706263|sp|P54640|CYS5_DICDI  CYSTEINE PROTEINASE 5 PRECURSOR
            >gi|1222694|gb|AAA92018.1| (L36205) CP5 [Dictyostelium discoideum]
            Length = 344

Frame -3 hits (HSPs):                  ____________                       
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |                     |                     |      | 344
                       0                   150                   300
__________________

Annotated Domains:
   BLOCKS               BL00139A: Eukaryotic thiol (cysteine) pr 130..139
   BLOCKS               BL00139B: Eukaryotic thiol (cysteine) pr 171..179
   BLOCKS               BL00139C: Eukaryotic thiol (cysteine) pr 271..280
   BLOCKS               BL00139D: Eukaryotic thiol (cysteine) pr 306..322
   DOMO                 DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 20..343
   Entrez               Domain: SER-RICH.                        196..340
   Entrez               active site: BY SIMILARITY.              136
   Entrez               active site: BY SIMILARITY.              272
   Entrez               active site: BY SIMILARITY.              311
   Entrez               glycosylation site: POTENTIAL.           110
   Entrez               glycosylation site: POTENTIAL.           297
   PFAM                 Peptidase_C1: Papain family cysteine pro 112..343
   PRINTS               PAPAIN1: Papain cysteine protease family 130..145
   PRINTS               PAPAIN2: Papain cysteine protease family 272..282
   PRINTS               PAPAIN3: Papain cysteine protease family 306..312
   PRODOM               PD151262: CYS4(1) CYS5(1) Q94503(1)      1..28
   PRODOM               PD000247: CYSP(11) CATL(7) CYS2(5)       30..113
   PRODOM               PD000158: CYSP(14) CATL(9) CYS1(8)       115..342
   PROSITE              THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 306..325
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 130..141
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 270..280
__________________


  Minus Strand HSPs:

 Score = 252 (88.7 bits), Expect = 1.2e-20, P = 1.2e-20
 Identities = 51/85 (60%), Positives = 55/85 (64%), Frame = -3

Query:   259 TNDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCD 80
             TN      DWR  GAVT VKNQG CG CWSFS  G+ EGAHF   GELVSLSEQ L+DC 
Sbjct:   109 TNSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCS 168

Query:    79 LECDPEERGACDSGCNGGLMTIAFE 5
              E         +SGC+GGLMT AFE
Sbjct:   169 TE---------NSGCDGGLMTYAFE 184


to_Entrezto_Relatedto_Related >gi|600111|emb|CAA84378.1|  (Z34895) cysteine proteinase [Vicia sativa]
            Length = 359

Frame -3 hits (HSPs):                    ___________                      
                        __________________________________________________
Database sequence:     |                    |                    |        | 359
                       0                  150                  300

  Minus Strand HSPs:

 Score = 251 (88.4 bits), Expect = 1.5e-20, P = 1.5e-20
 Identities = 50/83 (60%), Positives = 60/83 (72%), Frame = -3

Query:   253 DLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLE 74
             D+P+  DWR  GAVTGVK+QG CGSCW+FS + A+EG + + T +LVSLSEQQLVDCD E
Sbjct:   127 DVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQLVDCDTE 186

Query:    73 CDPEERGACDSGCNGGLMTIAFE 5
                 E    + GCNGGLM  AFE
Sbjct:   187 ----E----NEGCNGGLMEYAFE 201


to_Entrezto_Relatedto_Related >gi|1076552|pir||S49166  cysteine proteinase precursor - spring vetch
            Length = 357

Frame -3 hits (HSPs):                    ____________                     
Annotated Domains:          ____________________________________________  
                        __________________________________________________
Database sequence:     |                    |                    |        | 357
                       0                  150                  300
__________________

Annotated Domains:
   DOMO                 DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 30..340
   PROSITE              THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 146..157
   PROSITE              THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 284..294
__________________


  Minus Strand HSPs:

 Score = 251 (88.4 bits), Expect = 1.5e-20, P = 1.5e-20
 Identities = 50/83 (60%), Positives = 60/83 (72%), Frame = -3

Query:   253 DLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLE 74
             D+P+  DWR  GAVTGVK+QG CGSCW+FS + A+EG + + T +LVSLSEQQLVDCD E
Sbjct:   127 DVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQLVDCDTE 186

Query:    73 CDPEERGACDSGCNGGLMTIAFE 5
                 E    + GCNGGLM  AFE
Sbjct:   187 ----E----NEGCNGGLMEYAFE 201


to_Entrezto_Relatedto_Related >gi|7381610|gb|AAF61565.1|AF227957_1  (AF227957) cathepsin L-like proteinase
            precursor [Boophilus microplus]
            Length = 332

Frame -3 hits (HSPs):               _________________                     
                        __________________________________________________
Database sequence:     |                      |                      |    | 332
                       0                    150                    300

  Minus Strand HSPs:

 Score = 251 (88.4 bits), Expect = 1.5e-20, P = 1.5e-20
 Identities = 58/109 (53%), Positives = 67/109 (61%), Frame = -3

Query:   331 FAASFLGLKPLRLPSDAQKAPILPTND--LPTDFDWREHGAVTGVKNQGSCGSCWSFSAV 158
             FA  F G    R    +   P    ND  LP   DWR+ GAVT VK+QG CGSCW+FSA 
Sbjct:    87 FARIFNGHHGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSAT 146

Query:   157 GALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5
             G+LEG HFL  GELVSLSEQ LVDC      +  G  ++GC GGLM  AF+
Sbjct:   147 GSLEGQHFLKNGELVSLSEQNLVDCS-----QSFG--NNGCEGGLMEDAFK 190


to_Entrezto_Relatedto_Related >gi|1185457|gb|AAA87848.1|  (U38475) cathepsin L [Schistosoma japonicum]
            Length = 224

Frame -3 hits (HSPs):   ___________________                               
                        __________________________________________________
Database sequence:     |          |           |          |          |     | 224
                       0         50         100        150        200

  Minus Strand HSPs:

 Score = 250 (88.0 bits), Expect = 1.9e-20, P = 1.9e-20
 Identities = 50/89 (56%), Positives = 59/89 (66%), Frame = -3

Query:   271 PILPTNDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQL 92
             P     D+P +FDWRE GAVT VKNQG CGSCW+FS  G +E   F  TG+L+SLSEQQL
Sbjct:     3 PRREVGDIPNNFDWREKGAVTEVKNQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQQL 62

Query:    91 VDCDLECDPEERGACDSGCNGGLMTIAFE 5
             VDCD         + D GCNGGL + A+E
Sbjct:    63 VDCD---------SLDDGCNGGLPSNAYE 82


to_Entrezto_Relatedto_Related >gi|7435775|pir||JC5443  cathepsin L-like cysteine proteinase (EC 3.4.-.-) c1 -
            Maize weevil >gi|2804262|dbj|BAA24442.1| (D82884) cysteine
            proteinase [Sitophilus zeamais]
            Length = 338

Frame -3 hits (HSPs):                   _____________                     
                        __________________________________________________
Database sequence:     |                      |                     |     | 338
                       0                    150                   300

  Minus Strand HSPs:

 Score = 250 (88.0 bits), Expect = 1.9e-20, P = 1.9e-20
 Identities = 54/87 (62%), Positives = 60/87 (68%), Frame = -3

Query:   268 ILPTN-DLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQL 92
             I P N  LP   DWR+ GAVT VK+QG CGSCWSFSA G+LEG HF  TG+LVSLSEQ L
Sbjct:   114 ISPANVKLPDTVDWRDKGAVTEVKDQGHCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNL 173

Query:    91 VDCDLECDPEERGACDSGCNGGLMTIAF 8
             VDC         G  ++GCNGGLM  AF
Sbjct:   174 VDCS-----GRYG--NNGCNGGLMDNAF 194


WARNING:  HSPs involving 552 database sequences were not reported due to the
          limiting value of parameter B = 50.


Parameters:
  filter=none
  matrix=BLOSUM62
  V=50
  B=50
  E=10
  gi
  H=1
  sort_by_pvalue
  echofilter

  ctxfactor=5.98

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   Std.    0   BLOSUM62                                 0.318   0.135   0.401  
   +3      0   BLOSUM62        0.318   0.135   0.401    0.331   0.140   0.459  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +2      0   BLOSUM62        0.318   0.135   0.401    0.327   0.139   0.419  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +1      0   BLOSUM62        0.318   0.135   0.401    0.339   0.148   0.513  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -1      0   BLOSUM62        0.318   0.135   0.401    0.342   0.148   0.476  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -2      0   BLOSUM62        0.318   0.135   0.401    0.357   0.157   0.611  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -3      0   BLOSUM62        0.318   0.135   0.401    0.318   0.138   0.440  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E    S W   T  X   E2     S2
   +3      0      138       137       10.  72 3  12 22  0.11    33
                                                    30  0.10    36
   +2      0      138       137       10.  72 3  12 22  0.11    33
                                                    30  0.10    36
   +1      0      139       138       10.  72 3  12 22  0.12    33
                                                    30  0.10    36
   -1      0      139       138       10.  72 3  12 22  0.12    33
                                                    30  0.10    36
   -2      0      138       138       10.  72 3  12 22  0.12    33
                                                    30  0.10    36
   -3      0      138       137       10.  72 3  12 22  0.11    33
                                                    30  0.10    36


Statistics:

  Database:  /usr/local/dot5/sl_home/beauty/seqdb/blast/nr
    Title:  nr
    Release date:  unknown
    Posted date:  8:50 PM CDT May 27, 2000
    Format:  BLAST
  # of letters in database:  158,518,215
  # of sequences in database:  505,245
  # of database sequences satisfying E:  602
  No. of states in DFA:  589 (58 KB)
  Total size of DFA:  176 KB (192 KB)
  Time to generate neighborhood:  0.01u 0.01s 0.02t  Elapsed: 00:00:00
  No. of threads or processors used:  4
  Search cpu time:  214.56u 1.43s 215.99t  Elapsed: 00:02:10
  Total cpu time:  214.67u 1.50s 216.17t  Elapsed: 00:02:11
  Start:  Thu Feb 15 02:48:14 2001   End:  Thu Feb 15 02:50:25 2001

WARNINGS ISSUED:  2

Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000