WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= SSH8H02.SEQ(1>446) (417 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 505,245 sequences; 158,518,215 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 8 Sequences : less than 8 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 2264 452 |======================================================== 6310 1812 243 |============================== 3980 1569 209 |========================== 2510 1360 247 |============================== 1580 1113 164 |==================== 1000 949 119 |============== 631 830 59 |======= 398 771 57 |======= 251 714 35 |==== 158 679 31 |=== 100 648 16 |== 63.1 632 12 |= 39.8 620 6 |: 25.1 614 8 |= 15.8 606 4 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 602 <<<<<<<<<<<<<<<<< 10.0 602 5 |: 6.31 597 1 |: 3.98 596 3 |: 2.51 593 2 |: 1.58 591 9 |= 1.00 582 4 |: 0.63 578 3 |: 0.40 575 5 |: 0.25 570 3 |: 0.16 567 2 |: 0.10 565 6 |: 0.063 559 8 |= 0.040 551 9 |= 0.025 542 1 |: 0.016 541 2 |: 0.010 539 0 | 0.0063 539 3 |: 0.0040 536 3 |: 0.0025 533 0 | 0.0016 533 2 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|7435795|pir||T12040cysteine proteinase (EC 3.4.22.... -3 503 3.0e-47 1 gi|7242888|dbj|BAA92495.1|(AB038598) cysteine proteas... -3 492 4.4e-46 1 gi|7381221|gb|AAF61441.1|AF138265_1(AF138265) papain-... -3 487 1.5e-45 1 gi|7211741|gb|AAF40414.1|AF216783_1(AF216783) papain-... -3 486 1.9e-45 1 gi|7381219|gb|AAF61440.1|AF138264_1(AF138264) papain-... -3 486 1.9e-45 1 gi|5051468|emb|CAB44983.1|(AJ242994) putative preproc... -3 483 4.0e-45 1 gi|4757570|gb|AAD29084.1|AF082181_1(AF082181) cystein... -3 480 8.3e-45 1 gi|7211745|gb|AAF40416.1|AF216785_1(AF216785) papain-... -3 478 1.3e-44 1 gi|542004|pir||S42882cysteine proteinase (EC 3.4.22.-... -3 476 2.2e-44 1 gi|100203|pir||S24988cysteine proteinase (EC 3.4.22.-... -3 474 3.6e-44 1 gi|1401242|gb|AAB67878.1|(U59465) pre-pro-cysteine pr... -3 474 3.6e-44 1 gi|1172872|sp|P43296|RD19_ARATHCYSTEINE PROTEINASE RD... -3 472 5.8e-44 1 gi|1168251|sp|P43295|A494_ARATHPROBABLE CYSTEINE PROT... -3 469 1.2e-43 1 gi|4567274|gb|AAD23687.1|AC006841_20(AC006841) putati... -3 469 1.2e-43 1 gi|7435816|pir||T08844cysteine proteinase (EC 3.4.22.... -3 466 2.5e-43 1 gi|7435793|pir||T09528probable cysteine proteinase (E... -3 466 2.5e-43 1 gi|118150|sp|P25804|CYSP_PEACYSTEINE PROTEINASE 15A P... -3 463 5.2e-43 1 gi|7435803|pir||D71428cysteine proteinase (EC 3.4.22.... -3 463 5.2e-43 1 gi|419782|pir||S30150cysteine proteinase (EC 3.4.22.-... -3 461 8.5e-43 1 gi|7211743|gb|AAF40415.1|AF216784_1(AF216784) papain-... -3 460 1.1e-42 1 gi|419781|pir||S30149cysteine proteinase (EC 3.4.22.-... -3 457 2.3e-42 1 gi|1706260|sp|Q10716|CYS1_MAIZECYSTEINE PROTEINASE 1 ... -3 418 3.1e-38 1 gi|7435815|pir||T08845cysteine proteinase (EC 3.4.22.... -3 404 9.3e-37 1 gi|5679322|gb|AAD46920.1|AF167986_1(AF167986) putativ... -3 322 1.3e-31 2 gi|7435792|pir||T12042cysteine proteinase (EC 3.4.22.... -3 318 6.3e-30 2 gi|1362047|pir||S55923cysteine proteinase (EC 3.4.22.... -3 334 2.4e-29 1 gi|7435814|pir||T10949cysteine proteinase (EC 3.4.22.... -3 327 1.3e-28 1 gi|7435812|pir||T06726cysteine proteinase (EC 3.4.22.... -3 323 3.6e-28 1 gi|1353726|gb|AAB01769.1|(U42758) cysteine proteinase... -3 300 9.8e-26 1 gi|118117|sp|P04988|CYS1_DICDICYSTEINE PROTEINASE 1 P... -3 290 1.1e-24 1 gi|118124|sp|P25250|CYS2_HORVUCYSTEINE PROTEINASE EP-... -3 253 9.5e-24 2 gi|118120|sp|P25249|CYS1_HORVUCYSTEINE PROTEINASE EP-... -3 251 1.5e-23 2 gi|537437|gb|AAC35211.1|(U12637) cysteine proteinase ... -3 266 3.9e-22 1 gi|1834307|dbj|BAA09820.1|(D63670) cysteine proteinas... -3 265 5.0e-22 1 gi|1085124|pir||JX0366cysteine endopeptidase (EC 3.4.... -3 262 1.0e-21 1 gi|7435804|pir||T03694cysteine proteinase (EC 3.4.22.... -3 260 1.7e-21 1 gi|4426617|gb|AAD20453.1|(AF099203) cysteine endopept... -3 259 2.2e-21 1 gi|5761329|dbj|BAA83473.1|(AB004819) cysteine endopep... -3 258 2.8e-21 1 gi|415567|gb|AAB28289.1|cysteine proteinase=39 kda ac... -3 257 3.5e-21 1 gi|2118132|pir||JC4848cysteine proteinase (EC 3.4.22.... -3 260 5.1e-21 1 gi|2160175|gb|AAB60738.1|(AC000132) Strong similarity... -3 258 5.5e-21 1 gi|5777611|emb|CAB53397.1|(AJ245868) cysteine proteas... -3 255 5.7e-21 1 gi|1079183|pir||A53810cathepsin L (EC 3.4.22.15) prec... -3 254 7.3e-21 1 gi|399190|sp|Q02765|CATS_RATCATHEPSIN S PRECURSOR >gi... -3 253 9.4e-21 1 gi|1706263|sp|P54640|CYS5_DICDICYSTEINE PROTEINASE 5 ... -3 252 1.2e-20 1 gi|600111|emb|CAA84378.1|(Z34895) cysteine proteinase... -3 251 1.5e-20 1 gi|1076552|pir||S49166cysteine proteinase precursor -... -3 251 1.5e-20 1 gi|7381610|gb|AAF61565.1|AF227957_1(AF227957) catheps... -3 251 1.5e-20 1 gi|1185457|gb|AAA87848.1|(U38475) cathepsin L [Schist... -3 250 1.9e-20 1 gi|7435775|pir||JC5443cathepsin L-like cysteine prote... -3 250 1.9e-20 1 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame -1 Hits gi|5679322 | _________ gi|7435792 | ____ __________________________________________________ Query sequence: | | | | 139 0 50 100 Locus_ID Frame -3 Hits gi|7435795 | gi|7242888 | gi|7381221 | gi|7211741 | gi|7381219 | gi|5051468 | gi|4757570 | gi|7211745 | gi|542004 | gi|100203 | gi|1401242 | gi|1172872 | gi|1168251 | gi|4567274 | gi|7435816 | gi|7435793 | gi|118150 | gi|7435803 | gi|419782 | gi|7211743 | gi|419781 | gi|1706260 | gi|7435815 | gi|5679322 | gi|7435792 | gi|1362047 | gi|7435814 | gi|7435812 | gi|1353726 | gi|118117 | gi|118124 |______ gi|118120 |______ gi|537437 | gi|1834307 | gi|1085124 | gi|7435804 | gi|4426617 | gi|5761329 | gi|415567 | gi|2118132 | gi|2160175 | gi|5777611 |_ gi|1079183 | gi|399190 | gi|1706263 | gi|600111 | gi|1076552 | gi|7381610 | gi|1185457 | gi|7435775 | __________________________________________________ Query sequence: | | | | 139 0 50 100
Use the and icons to retrieve links to Entrez:
WARNING: Descriptions of 552 database sequences were not reported due to the limiting value of parameter V = 50. >gi|7435795|pir||T12040 cysteine proteinase (EC 3.4.22.-) 2 precursor - kidney bean >gi|2511691|emb|CAB17075.1| (Z99953) cysteine proteinase precursor [Phaseolus vulgaris] Length = 365 Frame -3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 365 0 150 300 Minus Strand HSPs: Score = 503 (177.1 bits), Expect = 3.0e-47, P = 3.0e-47 Identities = 99/137 (72%), Positives = 107/137 (78%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDF 236 RA+ H +LDP +DLTP F FLGLKPLRLP+ AQKAPILPTN+LP DF Sbjct: 81 RARLHAQLDP-SAVHGVTKFSDLTPAE-FHRKFLGLKPLRLPAHAQKAPILPTNNLPKDF 138 Query: 235 DWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEER 56 DWR+ GAVT VK+QGSCGSCWSFS GALEGAHFL TGELVSLSEQQLVDCD CDPEE Sbjct: 139 DWRDKGAVTNVKDQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEY 198 Query: 55 GACDSGCNGGLMTIAFE 5 G+CDSGCNGGLM AFE Sbjct: 199 GSCDSGCNGGLMNNAFE 215 >gi|7242888|dbj|BAA92495.1| (AB038598) cysteine protease [Vigna mungo] Length = 364 Frame -3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 364 0 150 300 Minus Strand HSPs: Score = 492 (173.2 bits), Expect = 4.4e-46, P = 4.4e-46 Identities = 97/137 (70%), Positives = 106/137 (77%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDF 236 RA+ H +LDP +DLT F FLGLKPL LP++AQKAPILPTN+LP DF Sbjct: 80 RARLHAQLDP-SAVHGVTKFSDLT-AAEFQRQFLGLKPLGLPANAQKAPILPTNNLPKDF 137 Query: 235 DWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEER 56 DWR+ GAVT VK+QG+CGSCWSFS GALEGAHFL TGELVSLSEQQLVDCD CDPEE Sbjct: 138 DWRDKGAVTNVKDQGACGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEY 197 Query: 55 GACDSGCNGGLMTIAFE 5 GACDSGCNGGLM AFE Sbjct: 198 GACDSGCNGGLMNNAFE 214 >gi|7381221|gb|AAF61441.1|AF138265_1 (AF138265) papain-like cysteine proteinase isoform II [Ipomoea batatas] Length = 366 Frame -3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 366 0 150 300 Minus Strand HSPs: Score = 487 (171.4 bits), Expect = 1.5e-45, P = 1.5e-45 Identities = 95/137 (69%), Positives = 109/137 (79%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGL-KPLRLPSDAQKAPILPTNDLPTD 239 RAK HQ+LDP +DLTP F FLGL + L+ P+DA+ APILPT++LP+D Sbjct: 79 RAKQHQELDP-AAVHGVTQFSDLTPTE-FRRKFLGLNRRLKFPADAKTAPILPTDELPSD 136 Query: 238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59 FDWR+HGAVT VKNQG+CGSCWSFS GALEGA+FL TG+LVSLSEQQLVDCD ECDPEE Sbjct: 137 FDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 196 Query: 58 RGACDSGCNGGLMTIAFE 5 G+CDSGCNGGLM AFE Sbjct: 197 AGSCDSGCNGGLMNSAFE 214 >gi|7211741|gb|AAF40414.1|AF216783_1 (AF216783) papain-like cysteine proteinase isoform I [Ipomoea batatas] Length = 368 Frame -3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 368 0 150 300 Minus Strand HSPs: Score = 486 (171.1 bits), Expect = 1.9e-45, P = 1.9e-45 Identities = 95/137 (69%), Positives = 109/137 (79%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGL-KPLRLPSDAQKAPILPTNDLPTD 239 RAK HQ+LDP +DLTP F FLGL + L+ P+DA+ APILPT++LP+D Sbjct: 81 RAKRHQELDP-AAVHGVTQFSDLTPTE-FRRKFLGLNRRLKFPADAKTAPILPTDELPSD 138 Query: 238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59 FDWR+HGAVT VKNQG+CGSCWSFS GALEGA+FL TG+LVSLSEQQLVDCD ECDPEE Sbjct: 139 FDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 198 Query: 58 RGACDSGCNGGLMTIAFE 5 G+CDSGCNGGLM AFE Sbjct: 199 AGSCDSGCNGGLMNSAFE 216 >gi|7381219|gb|AAF61440.1|AF138264_1 (AF138264) papain-like cysteine proteinase isoform I [Ipomoea batatas] Length = 368 Frame -3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 368 0 150 300 Minus Strand HSPs: Score = 486 (171.1 bits), Expect = 1.9e-45, P = 1.9e-45 Identities = 95/137 (69%), Positives = 109/137 (79%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGL-KPLRLPSDAQKAPILPTNDLPTD 239 RAK HQ+LDP +DLTP F FLGL + L+ P+DA+ APILPT++LP+D Sbjct: 81 RAKRHQELDP-AAVHGVTQFSDLTPTE-FRRKFLGLNRRLKFPADAKTAPILPTDELPSD 138 Query: 238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59 FDWR+HGAVT VKNQG+CGSCWSFS GALEGA+FL TG+LVSLSEQQLVDCD ECDPEE Sbjct: 139 FDWRDHGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 198 Query: 58 RGACDSGCNGGLMTIAFE 5 G+CDSGCNGGLM AFE Sbjct: 199 AGSCDSGCNGGLMNSAFE 216 >gi|5051468|emb|CAB44983.1| (AJ242994) putative preprocysteine proteinase [Nicotiana tabacum] Length = 363 Frame -3 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | 363 0 150 300 Minus Strand HSPs: Score = 483 (170.0 bits), Expect = 4.0e-45, P = 4.0e-45 Identities = 94/137 (68%), Positives = 106/137 (77%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDF 236 RA+ HQ LDP S DLTP F ++LGL + +A+KAPILPT+DLP DF Sbjct: 77 RARRHQLLDPSAEHGITKFS-DLTPSE-FRRTYLGLHKPKPKLNAEKAPILPTSDLPADF 134 Query: 235 DWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEER 56 DWR+HGAVTGVKNQGSCGSCWSFS GA+EGAHFL TGELVSLSEQQLVDCD ECDPE++ Sbjct: 135 DWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQ 194 Query: 55 GACDSGCNGGLMTIAFE 5 ACD+GC GGLMT AFE Sbjct: 195 DACDAGCGGGLMTTAFE 211 >gi|4757570|gb|AAD29084.1|AF082181_1 (AF082181) cysteine proteinase precursor [Solanum melongena] Length = 363 Frame -3 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | 363 0 150 300 Minus Strand HSPs: Score = 480 (169.0 bits), Expect = 8.3e-45, P = 8.3e-45 Identities = 96/137 (70%), Positives = 104/137 (75%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDF 236 RA+ HQ LDP S DLTP F ++LGL R +AQKAPILPT+DLP DF Sbjct: 77 RARRHQLLDPTAEHGITQFS-DLTPSE-FRRTYLGLHKPRPKLNAQKAPILPTSDLPEDF 134 Query: 235 DWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEER 56 DWRE GAVTGVKNQGSCGSCWSFS GA+EGAHFL TGELVSLSEQQLVDCD ECD EE+ Sbjct: 135 DWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEEK 194 Query: 55 GACDSGCNGGLMTIAFE 5 CD+GCNGGLMT AFE Sbjct: 195 SECDAGCNGGLMTTAFE 211 >gi|7211745|gb|AAF40416.1|AF216785_1 (AF216785) papain-like cysteine proteinase isoform III [Ipomoea batatas] >gi|7381223|gb|AAF61442.1|AF138266_1 (AF138266) papain-like cysteine proteinase isoform III [Ipomoea batatas] Length = 366 Frame -3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 366 0 150 300 Minus Strand HSPs: Score = 478 (168.3 bits), Expect = 1.3e-44, P = 1.3e-44 Identities = 94/137 (68%), Positives = 108/137 (78%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGL-KPLRLPSDAQKAPILPTNDLPTD 239 RAK HQ+LDP +DLTP F FLGL + L+ P+DA+ APILPT++LP+D Sbjct: 79 RAKRHQQLDP-AAVHGVTQFSDLTPTE-FRRKFLGLNRRLKFPADAKTAPILPTDELPSD 136 Query: 238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59 FDWR+ GAVT VKNQG+CGSCWSFS GALEGA+FL TG+LVSLSEQQLVDCD ECDPEE Sbjct: 137 FDWRDRGAVTPVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 196 Query: 58 RGACDSGCNGGLMTIAFE 5 G+CDSGCNGGLM AFE Sbjct: 197 AGSCDSGCNGGLMNSAFE 214 >gi|542004|pir||S42882 cysteine proteinase (EC 3.4.22.-) precursor - spring vetch >gi|2129905|pir||S51818 cysteine proteinase precursor - spring vetch >gi|457756|emb|CAA82995.1| (Z30338) cysteine proteinase [Vicia sativa] Length = 358 Frame -3 hits (HSPs): ___________________ Annotated Domains: __ _______ __________________________________________________ Database sequence: | | | | 358 0 150 300 __________________ Annotated Domains: PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 316..335 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 145..156 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 292..302 __________________ Minus Strand HSPs: Score = 476 (167.6 bits), Expect = 2.2e-44, P = 2.2e-44 Identities = 97/137 (70%), Positives = 105/137 (76%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGL-KPLRLPSDAQKAPILPTNDLPTD 239 +AK HQKLDP S DLT F FLGL K LRLP+ AQKAPILPT +LP D Sbjct: 73 KAKLHQKLDPTAEHGITKFS-DLT-ASEFRRQFLGLNKRLRLPAHAQKAPILPTTNLPED 130 Query: 238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59 FDWRE GAVT VK+QGSCGSCW+FS GALEGAH+L TG+LVSLSEQQLVDCD CDPEE Sbjct: 131 FDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEE 190 Query: 58 RGACDSGCNGGLMTIAFE 5 G+CDSGCNGGLM AFE Sbjct: 191 AGSCDSGCNGGLMNNAFE 208 >gi|100203|pir||S24988 cysteine proteinase (EC 3.4.22.-) precursor - tomato (fragment) >gi|19195|emb|CAA78403.1| (Z14028) pre-pro-cysteine proteinase [Lycopersicon esculentum] Length = 361 Frame -3 hits (HSPs): ___________________ Annotated Domains: __ ____ __________________________________________________ Database sequence: | | | | 361 0 150 300 __________________ Annotated Domains: PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 316..335 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 146..157 __________________ Minus Strand HSPs: Score = 474 (166.9 bits), Expect = 3.6e-44, P = 3.6e-44 Identities = 95/137 (69%), Positives = 103/137 (75%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDF 236 RAK HQ LDP S DLTP F ++LGL R +A+KAPILPT DLP+DF Sbjct: 75 RAKRHQLLDPSAEHGITQFS-DLTPSE-FRRTYLGLNKPRPNLNAEKAPILPTKDLPSDF 132 Query: 235 DWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEER 56 DWRE GAVT VKNQGSCGSCWSFS GA+EGAHFL TGELVSLSEQQLVDCD ECDP E+ Sbjct: 133 DWREKGAVTDVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPVEK 192 Query: 55 GACDSGCNGGLMTIAFE 5 CD+GCNGGLMT AFE Sbjct: 193 NDCDAGCNGGLMTTAFE 209 >gi|1401242|gb|AAB67878.1| (U59465) pre-pro-cysteine proteinase [Vicia faba] Length = 363 Frame -3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 363 0 150 300 Minus Strand HSPs: Score = 474 (166.9 bits), Expect = 3.6e-44, P = 3.6e-44 Identities = 96/137 (70%), Positives = 105/137 (76%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKP-LRLPSDAQKAPILPTNDLPTD 239 +AK HQKLDP S DLT F FLGLK LRLP+ AQKAPILPT +LP D Sbjct: 78 KAKLHQKLDPTAEHGITKFS-DLT-ASEFRRQFLGLKKRLRLPAHAQKAPILPTTNLPED 135 Query: 238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59 FDWRE GAVT VK+QGSCGSCW+FS GALEGAH+L TG+LVSLSEQQLVDCD CDPE+ Sbjct: 136 FDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQ 195 Query: 58 RGACDSGCNGGLMTIAFE 5 G+CDSGCNGGLM AFE Sbjct: 196 AGSCDSGCNGGLMNNAFE 213 >gi|1172872|sp|P43296|RD19_ARATH CYSTEINE PROTEINASE RD19A PRECURSOR >gi|541856|pir||JN0718 cysteine proteinase (EC 3.4.22.-) RD19A precursor, drought-inducible - Arabidopsis thaliana >gi|435618|dbj|BAA02373.1| (D13042) thiol protease [Arabidopsis thaliana] >gi|4539328|emb|CAB38829.1| (AL035679) drought-inducible cysteine proteinase RD19A precursor [Arabidopsis thaliana] >gi|7270892|emb|CAB80572.1| (AL161594) drought-inducible cysteine proteinase RD19A precursor [Arabidopsis thaliana] Length = 368 Frame -3 hits (HSPs): ____________________ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | 368 0 150 300 __________________ Annotated Domains: BLOCKS BL00139A: Eukaryotic thiol (cysteine) pr 153..162 BLOCKS BL00139B: Eukaryotic thiol (cysteine) pr 203..211 BLOCKS BL00139C: Eukaryotic thiol (cysteine) pr 301..310 BLOCKS BL00139D: Eukaryotic thiol (cysteine) pr 324..340 DOMO DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 40..360 Entrez active site: BY SIMILARITY. 159 Entrez active site: BY SIMILARITY. 302 Entrez active site: BY SIMILARITY. 329 Entrez glycosylation site: POTENTIAL. 253 PFAM Peptidase_C1: Papain family cysteine pro 135..360 PRINTS PAPAIN1: Papain cysteine protease family 153..168 PRINTS PAPAIN2: Papain cysteine protease family 302..312 PRINTS PAPAIN3: Papain cysteine protease family 324..330 PRODOM PD078495: RD19_ARATH 1..48 PRODOM PD000247: CYSP(11) CATL(7) CYS2(5) 50..118 PRODOM PD000158: CYSP(14) CATL(9) CYS1(8) 135..356 PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 324..343 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 153..164 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 300..310 __________________ Minus Strand HSPs: Score = 472 (166.2 bits), Expect = 5.8e-44, P = 5.8e-44 Identities = 94/137 (68%), Positives = 104/137 (75%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKP-LRLPSDAQKAPILPTNDLPTD 239 RA+ HQKLDP +DLT F LG++ +LP DA KAPILPT +LP D Sbjct: 81 RARRHQKLDP-SATHGVTQFSDLTRSE-FRKKHLGVRSGFKLPKDANKAPILPTENLPED 138 Query: 238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59 FDWR+HGAVT VKNQGSCGSCWSFSA GALEGA+FL TG+LVSLSEQQLVDCD ECDPEE Sbjct: 139 FDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 198 Query: 58 RGACDSGCNGGLMTIAFE 5 +CDSGCNGGLM AFE Sbjct: 199 ADSCDSGCNGGLMNSAFE 216 >gi|1168251|sp|P43295|A494_ARATH PROBABLE CYSTEINE PROTEINASE A494 PRECURSOR >gi|1076384|pir||S46535 cysteine proteinase (EC 3.4.22.-) (clone A1494) - Arabidopsis thaliana (fragment) >gi|516865|emb|CAA52403.1| (X74359) putative thiol protease [Arabidopsis thaliana] Length = 313 Frame -3 hits (HSPs): _______________________ Annotated Domains: _____________________________________ __________________________________________________ Database sequence: | | | | | | | | 313 0 50 100 150 200 250 300 __________________ Annotated Domains: Entrez active site: BY SIMILARITY. 108 Entrez active site: BY SIMILARITY. 251 Entrez active site: BY SIMILARITY. 278 PFAM Peptidase_C1: Papain family cysteine pro 84..309 PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 273..292 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 102..113 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 249..259 __________________ Minus Strand HSPs: Score = 469 (165.1 bits), Expect = 1.2e-43, P = 1.2e-43 Identities = 94/137 (68%), Positives = 103/137 (75%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKP-LRLPSDAQKAPILPTNDLPTD 239 RA HQK+DP R S DLT F LG+K +LP DA +APILPT +LP + Sbjct: 30 RAMRHQKMDPSARHGVTQFS-DLTRSE-FRRKHLGVKGGFKLPKDANQAPILPTQNLPEE 87 Query: 238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59 FDWR+ GAVT VKNQGSCGSCWSFS GALEGAHFL TG+LVSLSEQQLVDCD ECDPEE Sbjct: 88 FDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEE 147 Query: 58 RGACDSGCNGGLMTIAFE 5 G+CDSGCNGGLM AFE Sbjct: 148 EGSCDSGCNGGLMNSAFE 165 >gi|4567274|gb|AAD23687.1|AC006841_20 (AC006841) putative cysteine proteinase [Arabidopsis thaliana] Length = 361 Frame -3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 361 0 150 300 Minus Strand HSPs: Score = 469 (165.1 bits), Expect = 1.2e-43, P = 1.2e-43 Identities = 94/137 (68%), Positives = 103/137 (75%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKP-LRLPSDAQKAPILPTNDLPTD 239 RA HQK+DP R S DLT F LG+K +LP DA +APILPT +LP + Sbjct: 78 RAMRHQKMDPSARHGVTQFS-DLTRSE-FRRKHLGVKGGFKLPKDANQAPILPTQNLPEE 135 Query: 238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59 FDWR+ GAVT VKNQGSCGSCWSFS GALEGAHFL TG+LVSLSEQQLVDCD ECDPEE Sbjct: 136 FDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEE 195 Query: 58 RGACDSGCNGGLMTIAFE 5 G+CDSGCNGGLM AFE Sbjct: 196 EGSCDSGCNGGLMNSAFE 213 >gi|7435816|pir||T08844 cysteine proteinase (EC 3.4.22.-) isoform B - soybean (fragment) >gi|1619903|gb|AAB16996.1| (U71379) thiol protease isoform B [Glycine max] Length = 319 Frame -3 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | | | | | 319 0 50 100 150 200 250 300 Minus Strand HSPs: Score = 466 (164.0 bits), Expect = 2.5e-43, P = 2.5e-43 Identities = 88/117 (75%), Positives = 95/117 (81%), Frame = -3 Query: 355 TDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDFDWREHGAVTGVKNQGSCGSC 176 +DLTP F FLGLK +R P+ AQKAPILPT DLP DFDWR+ GAVT VK+QG CGSC Sbjct: 54 SDLTPAE-FRRQFLGLKAVRFPAHAQKAPILPTKDLPKDFDWRDKGAVTNVKDQGGCGSC 112 Query: 175 WSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5 WSFS GALEGA++L TGELVSLSEQQLVDCD CDPEE GACDSGCNGGLM AFE Sbjct: 113 WSFSTTGALEGAYYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFE 169 >gi|7435793|pir||T09528 probable cysteine proteinase (EC 3.4.22.-) precursor - chickpea >gi|3377952|emb|CAA08906.1| (AJ009878) cysteine proteinase [Cicer arietinum] Length = 362 Frame -3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 362 0 150 300 Minus Strand HSPs: Score = 466 (164.0 bits), Expect = 2.5e-43, P = 2.5e-43 Identities = 95/137 (69%), Positives = 105/137 (76%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKP-LRLPSDAQKAPILPTNDLPTD 239 +AK HQKLDP S DLT F FLGLK LRLP+ AQKAPILPTN+LP D Sbjct: 77 KAKLHQKLDPSAEHGVTKFS-DLT-ASEFRRQFLGLKKRLRLPAHAQKAPILPTNNLPED 134 Query: 238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59 FDWRE GAVT VK+QGSCGSCW+FS GALEGA++L TG+LVSLSEQQLVDCD CDP+E Sbjct: 135 FDWREKGAVTPVKDQGSCGSCWAFSTTGALEGANYLATGKLVSLSEQQLVDCDHVCDPDE 194 Query: 58 RGACDSGCNGGLMTIAFE 5 +CDSGCNGGLM AFE Sbjct: 195 YNSCDSGCNGGLMNNAFE 212 >gi|118150|sp|P25804|CYSP_PEA CYSTEINE PROTEINASE 15A PRECURSOR (TURGOR-RESPONSIVE PROTEIN 15A) >gi|100050|pir||S11862 cysteine proteinase (EC 3.4.22.-) - garden pea >gi|20679|emb|CAA38242.1| (X54358) 363 aa peptide [Pisum sativum] Length = 363 Frame -3 hits (HSPs): ____________________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 363 0 150 300 __________________ Annotated Domains: DOMO DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 37..357 Entrez active site: BY SIMILARITY. 156 Entrez active site: BY SIMILARITY. 299 Entrez active site: BY SIMILARITY. 326 Entrez glycosylation site: POTENTIAL. 249 PFAM Peptidase_C1: Papain family cysteine pro 132..357 PRINTS PAPAIN1: Papain cysteine protease family 150..165 PRINTS PAPAIN2: Papain cysteine protease family 299..309 PRINTS PAPAIN3: Papain cysteine protease family 321..327 PRODOM PD031710: CYSP(1) Q41671(1) O81930(1) 1..45 PRODOM PD000247: CYSP(11) CATL(7) CYS2(5) 47..115 PRODOM PD000158: CYSP(14) CATL(9) CYS1(8) 132..353 PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 321..340 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 150..161 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 297..307 __________________ Minus Strand HSPs: Score = 463 (163.0 bits), Expect = 5.2e-43, P = 5.2e-43 Identities = 94/137 (68%), Positives = 103/137 (75%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKP-LRLPSDAQKAPILPTNDLPTD 239 +AK HQ DP S DLT F FLGLK LRLP+ AQKAPILPT +LP D Sbjct: 78 KAKLHQNRDPTAEHGITKFS-DLT-ASEFRRQFLGLKKRLRLPAHAQKAPILPTTNLPED 135 Query: 238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59 FDWRE GAVT VK+QGSCGSCW+FS GALEGAH+L TG+LVSLSEQQLVDCD CDPE+ Sbjct: 136 FDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQ 195 Query: 58 RGACDSGCNGGLMTIAFE 5 G+CDSGCNGGLM AFE Sbjct: 196 AGSCDSGCNGGLMNNAFE 213 >gi|7435803|pir||D71428 cysteine proteinase (EC 3.4.22.-) - Arabidopsis thaliana >gi|2244977|emb|CAB10398.1| (Z97340) cysteine proteinase like protein [Arabidopsis thaliana] >gi|7268368|emb|CAB78661.1| (AL161543) cysteine proteinase like protein [Arabidopsis thaliana] Length = 373 Frame -3 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | 373 0 150 300 Minus Strand HSPs: Score = 463 (163.0 bits), Expect = 5.2e-43, P = 5.2e-43 Identities = 94/137 (68%), Positives = 104/137 (75%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKP--LRLPSDAQKAPILPTNDLPT 242 RA+ +Q LDP +DLTP F FLGLK RLP+D Q APILPT+DLPT Sbjct: 85 RARRNQLLDP-SAVHGVTQFSDLTPKE-FRRKFLGLKRRGFRLPTDTQTAPILPTSDLPT 142 Query: 241 DFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPE 62 +FDWRE GAVT VKNQG CGSCWSFSA+GALEGAHFL T ELVSLSEQQLVDCD ECDP Sbjct: 143 EFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPA 202 Query: 61 ERGACDSGCNGGLMTIAFE 5 + +CDSGC+GGLM AFE Sbjct: 203 QANSCDSGCSGGLMNNAFE 221 >gi|419782|pir||S30150 cysteine proteinase (EC 3.4.22.-) precursor (clone CYP-8) - common tobacco >gi|19851|emb|CAA78365.1| (Z13964) tobacco pre-pro-cysteine proteinase [Nicotiana tabacum] Length = 365 Frame -3 hits (HSPs): ____________________ Annotated Domains: __ ____ __________________________________________________ Database sequence: | | | | 365 0 150 300 __________________ Annotated Domains: PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 320..339 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 150..161 __________________ Minus Strand HSPs: Score = 461 (162.3 bits), Expect = 8.5e-43, P = 8.5e-43 Identities = 90/137 (65%), Positives = 105/137 (76%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDF 236 RA+ +Q LDP S DLTP F ++LGL + +A+KAPILPT+DLP D+ Sbjct: 79 RARLNQLLDPSAEHGITKFS-DLTPSE-FRRTYLGLHKPKPKVNAEKAPILPTSDLPADY 136 Query: 235 DWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEER 56 DWR+HGAVTGVKNQGSCGSCWSFS GA+EGAHFL TGELVSLSEQQLVDCD ECD E++ Sbjct: 137 DWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDSEQQ 196 Query: 55 GACDSGCNGGLMTIAFE 5 +CD+GC GGLMT AFE Sbjct: 197 DSCDAGCGGGLMTTAFE 213 >gi|7211743|gb|AAF40415.1|AF216784_1 (AF216784) papain-like cysteine proteinase isoform II [Ipomoea batatas] Length = 368 Frame -3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 368 0 150 300 Minus Strand HSPs: Score = 460 (161.9 bits), Expect = 1.1e-42, P = 1.1e-42 Identities = 91/137 (66%), Positives = 105/137 (76%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGL-KPLRLPSDAQKAPILPTNDLPTD 239 RAK HQ+LDP +D TP F FLGL + L+ P+DA+ APILPT++LP+D Sbjct: 81 RAKRHQELDP-AAVHGVTQFSDSTPTE-FRRKFLGLNRRLKFPADAKTAPILPTDELPSD 138 Query: 238 FDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEE 59 FDWR+ GAVT VKNQG+CG CWSFS GALEGA+FL TG+LVSLSEQQLVDCD ECDPEE Sbjct: 139 FDWRDRGAVTPVKNQGTCGLCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 198 Query: 58 RGACDSGCNGGLMTIAFE 5 G+CD GCNGGLM AFE Sbjct: 199 AGSCDFGCNGGLMNSAFE 216 >gi|419781|pir||S30149 cysteine proteinase (EC 3.4.22.-) precursor (clone CYP-7) - common tobacco >gi|19849|emb|CAA78361.1| (Z13959) tobacco pre-pro-cysteine proteinase [Nicotiana tabacum] Length = 363 Frame -3 hits (HSPs): ___________________ Annotated Domains: _____________________________________________ __________________________________________________ Database sequence: | | | | 363 0 150 300 __________________ Annotated Domains: DOMO DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 36..354 PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 318..337 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 148..159 __________________ Minus Strand HSPs: Score = 457 (160.9 bits), Expect = 2.3e-42, P = 2.3e-42 Identities = 90/137 (65%), Positives = 103/137 (75%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAPILPTNDLPTDF 236 RA+ +Q LDP S DLTP F ++LGL + +A+KAPILPT+DLP DF Sbjct: 77 RARLNQLLDPSAEHGITKFS-DLTPSE-FRRTYLGLHKPKPKLNAEKAPILPTSDLPADF 134 Query: 235 DWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEER 56 DWR+HGAVTGVKNQGSCGSCWSFS GA+EGAHFL TGELVSLSEQQLVDCD ECDPE++ Sbjct: 135 DWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQ 194 Query: 55 GACDSGCNGGLMTIAFE 5 ACD+GC GG AFE Sbjct: 195 DACDAGCGGGHYATAFE 211 >gi|1706260|sp|Q10716|CYS1_MAIZE CYSTEINE PROTEINASE 1 PRECURSOR >gi|2118131|pir||S59597 cysteine proteinase (EC 3.4.22.-) 1 precursor - maize >gi|643597|dbj|BAA08244.1| (D45402) cysteine proteinase [Zea mays] Length = 371 Frame -3 hits (HSPs): ____________________ Annotated Domains: _______________________________________________ __________________________________________________ Database sequence: | | | | 371 0 150 300 __________________ Annotated Domains: BLOCKS BL00139A: Eukaryotic thiol (cysteine) pr 155..164 BLOCKS BL00139B: Eukaryotic thiol (cysteine) pr 205..213 BLOCKS BL00139C: Eukaryotic thiol (cysteine) pr 302..311 BLOCKS BL00139D: Eukaryotic thiol (cysteine) pr 325..341 DOMO DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 37..364 Entrez active site: BY SIMILARITY. 161 Entrez active site: BY SIMILARITY. 303 Entrez active site: BY SIMILARITY. 330 Entrez glycosylation site: POTENTIAL. 254 PFAM Peptidase_C1: Papain family cysteine pro 137..364 PRINTS PAPAIN1: Papain cysteine protease family 155..170 PRINTS PAPAIN2: Papain cysteine protease family 303..313 PRINTS PAPAIN3: Papain cysteine protease family 325..331 PRODOM PD078471: CYS1_MAIZE 22..44 PRODOM PD000247: CYSP(11) CATL(7) CYS2(5) 46..112 PRODOM PD171174: CYS1_MAIZE 114..135 PRODOM PD000158: CYSP(14) CATL(9) CYS1(8) 137..360 PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 325..344 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 155..166 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 301..311 __________________ Minus Strand HSPs: Score = 418 (147.1 bits), Expect = 3.1e-38, P = 3.1e-38 Identities = 87/140 (62%), Positives = 98/140 (70%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLR------LPSDAQKAPILPTN 254 RA+ HQ LDP S DLTP F ++LGL+ R L A +AP+LPT+ Sbjct: 78 RARRHQLLDPSAEHGVTKFS-DLTPAE-FRRTYLGLRKSRRALLRELGESAHEAPVLPTD 135 Query: 253 DLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLE 74 LP DFDWR+HGAV VKNQGSCGSCWSFSA GALEGAH+L TG+L LSEQQ VDCD E Sbjct: 136 GLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHE 195 Query: 73 CDPEERGACDSGCNGGLMTIAF 8 CD E +CDSGCNGGLMT AF Sbjct: 196 CDSSEPDSCDSGCNGGLMTTAF 217 >gi|7435815|pir||T08845 cysteine proteinase (EC 3.4.22.-) isoform A - soybean (fragment) >gi|1619905|gb|AAB16997.1| (U71380) thiol protease isoform A [Glycine max] Length = 318 Frame -3 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | | | | 318 0 50 100 150 200 250 300 Minus Strand HSPs: Score = 404 (142.2 bits), Expect = 9.3e-37, P = 9.3e-37 Identities = 75/99 (75%), Positives = 81/99 (81%), Frame = -3 Query: 301 LRLPSDAQKAPILPTNDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTG 122 +R P+ AQKAPILPT DLP DFDWR+ GAVT VK+ G CGSCWSFS GALE + +L TG Sbjct: 71 VRFPAHAQKAPILPTKDLPKDFDWRDKGAVTNVKDLGGCGSCWSFSTTGALEVSFYLATG 130 Query: 121 ELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5 ELVSLSEQQLVDCD CDPEE GACDSGCNGGLM AFE Sbjct: 131 ELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFE 169 >gi|5679322|gb|AAD46920.1|AF167986_1 (AF167986) putative cysteine proteinase GmPM33 [Glycine max] Length = 363 Frame -1 hits (HSPs): _____ Frame -3 hits (HSPs): _____________ __________________________________________________ Database sequence: | | | | 363 0 150 300 Minus Strand HSPs: Score = 322 (113.3 bits), Expect = 1.3e-31, Sum P(2) = 1.3e-31 Identities = 58/89 (65%), Positives = 72/89 (80%), Frame = -3 Query: 274 APILPTNDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQ 95 AP L + LP +FDWRE GAVT VK QG CGSCW+FS G++EGA+FL TG+LVSLS+QQ Sbjct: 115 APPLEVDGLPENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFLATGKLVSLSDQQ 174 Query: 94 LVDCDLECDPEERGACDSGCNGGLMTIAF 8 L+DCD +CD E+ +CD+GCNGGLMT A+ Sbjct: 175 LLDCDNKCDITEKTSCDNGCNGGLMTNAY 203 Score = 49 (17.2 bits), Expect = 1.3e-31, Sum P(2) = 1.3e-31 Identities = 13/26 (50%), Positives = 17/26 (65%), Frame = -1 Query: 387 PSAVHGVPQV-LPISLPAE--FSPPV 319 P+AVHGV Q LP+S A +PP+ Sbjct: 93 PTAVHGVTQFSLPVSNNAAGGIAPPL 118 >gi|7435792|pir||T12042 cysteine proteinase (EC 3.4.22.-) 4 precursor - kidney bean >gi|2511695|emb|CAB17077.1| (Z99955) cysteine proteinase precursor [Phaseolus vulgaris] Length = 377 Frame -1 hits (HSPs): __ Frame -3 hits (HSPs): ____________ __________________________________________________ Database sequence: | | | | 377 0 150 300 Minus Strand HSPs: Score = 318 (111.9 bits), Expect = 6.3e-30, Sum P(2) = 6.3e-30 Identities = 57/90 (63%), Positives = 70/90 (77%), Frame = -3 Query: 274 APILPTNDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQ 95 AP L + LP DFDWRE GAVT VK QG CGSCW+FS G++EGA+F+ TG+L++LSEQQ Sbjct: 130 APPLKVDGLPEDFDWREKGAVTEVKMQGKCGSCWAFSTTGSIEGANFIATGKLLNLSEQQ 189 Query: 94 LVDCDLECDPEERGACDSGCNGGLMTIAFE 5 LVDCD +CD E CD+GC GGLMT A++ Sbjct: 190 LVDCDSQCDITESTTCDNGCMGGLMTNAYK 219 Score = 37 (13.0 bits), Expect = 6.3e-30, Sum P(2) = 6.3e-30 Identities = 6/9 (66%), Positives = 8/9 (88%), Frame = -1 Query: 387 PSAVHGVPQ 361 P+A+HGV Q Sbjct: 92 PTAIHGVTQ 100 >gi|1362047|pir||S55923 cysteine proteinase (EC 3.4.22.-) precursor - soybean >gi|479060|emb|CAA83673.1| (Z32795) cysteine proteinase [Glycine max] >gi|1096153|prf||2111244A Cys protease [Glycine max] Length = 380 Frame -3 hits (HSPs): ___________________ Annotated Domains: ___________________ ___ __ ___ __________________________________________________ Database sequence: | | | | 380 0 150 300 __________________ Annotated Domains: Entrez domain: signal sequence 1..29 Entrez domain: propeptide 30..139 Entrez active site: Cys, His, Asn 164 Entrez active site: Cys, His, Asn 307 Entrez active site: Cys, His, Asn 334 PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 329..348 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 158..169 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 305..315 __________________ Minus Strand HSPs: Score = 334 (117.6 bits), Expect = 2.4e-29, P = 2.4e-29 Identities = 73/137 (53%), Positives = 89/137 (64%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKA----PILPTNDL 248 RA HQ LDP +DLT F + G+ PS A P L + L Sbjct: 84 RAAEHQALDP-TAVHGVTQFSDLTEDE-FEKLYTGVNG-GFPSSNNAAGGIAPPLEVDGL 140 Query: 247 PTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECD 68 P +FDWRE GAVT VK QG CGSCW+FS G++EGA+FL TG+LVSLSEQQL+DCD +CD Sbjct: 141 PENFDWREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDNKCD 200 Query: 67 PEERGACDSGCNGGLMTIAF 8 E+ +CD+GCNGGLMT A+ Sbjct: 201 ITEKTSCDNGCNGGLMTNAY 220 >gi|7435814|pir||T10949 cysteine proteinase (EC 3.4.22.-) precursor - spring vetch >gi|2414683|emb|CAB16316.1| (Z99172) cysteine proteinase precursor [Vicia sativa] Length = 379 Frame -3 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | 379 0 150 300 Minus Strand HSPs: Score = 327 (115.1 bits), Expect = 1.3e-28, P = 1.3e-28 Identities = 70/137 (51%), Positives = 89/137 (64%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQK--APILPTNDLPT 242 +A HQ LDP +DL+ F + G K S+A AP L P Sbjct: 85 KAAEHQALDP-TAIHGVTQFSDLSEEE-FERFYTGFKGGFPSSNAAGGVAPPLDVKGFPE 142 Query: 241 DFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPE 62 +FDWRE GAVTG+K QG CGSCW+F+ G++EGA+FL TG+LVSLSEQQLVDCD +CD Sbjct: 143 NFDWREKGAVTGIKTQGKCGSCWAFTTTGSIEGANFLATGKLVSLSEQQLVDCDNKCDIT 202 Query: 61 ERGACDSGCNGGLMTIAFE 5 + +CD+GCNGGLMT A++ Sbjct: 203 KT-SCDNGCNGGLMTTAYD 220 >gi|7435812|pir||T06726 cysteine proteinase (EC 3.4.22.-) F28P10.80 - Arabidopsis thaliana >gi|4678299|emb|CAB41090.1| (AL049655) cysteine proteinase precursor-like protein [Arabidopsis thaliana] Length = 363 Frame -3 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | 363 0 150 300 Minus Strand HSPs: Score = 323 (113.7 bits), Expect = 3.6e-28, P = 3.6e-28 Identities = 70/137 (51%), Positives = 90/137 (65%), Frame = -3 Query: 415 RAKSHQKLDPFRRPRRPPGSTDLTPGGVFAASFLGLKPL---RLPSDAQKAPILPTNDLP 245 +A HQ +DP +DLT F + G+ + R + +AP++ + LP Sbjct: 81 KAAEHQMMDP-SAVHGVTQFSDLTEEE-FKRMYTGVADVGGSRGGTVGAEAPMVEVDGLP 138 Query: 244 TDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDP 65 DFDWRE G VT VKNQG+CGSCW+FS GA EGAHF+ TG+L+SLSEQQLVDCD + D Sbjct: 139 EDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCD-QADK 197 Query: 64 EERGACDSGCNGGLMTIAFE 5 + ACD+GC GGLMT A+E Sbjct: 198 K---ACDNGCGGGLMTNAYE 214 >gi|1353726|gb|AAB01769.1| (U42758) cysteine proteinase homolog [Naegleria fowleri] Length = 347 Frame -3 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | 347 0 150 300 Minus Strand HSPs: Score = 300 (105.6 bits), Expect = 9.8e-26, P = 9.8e-26 Identities = 68/124 (54%), Positives = 84/124 (67%), Frame = -3 Query: 355 TDLTPGGVFAASFLGLKPLRLPSDAQK---AP---ILPTNDL---PTDFDWREHGAVTGV 203 +DLTP F FL +K P +A+K AP +L ++ PT FDWR+HGAVT V Sbjct: 81 SDLTPEE-FKRMFL-MKTYT-PEEAKKILAAPQHAVLSEKEVQTAPTSFDWRQHGAVTRV 137 Query: 202 KNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDP-EERGACDSGCNGG 26 KNQG+CGSCW+FS G +EG + G+LVSLSEQQLVDCD C + + ACDSGCNGG Sbjct: 138 KNQGACGSCWTFSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHNCVTYQNQQACDSGCNGG 197 Query: 25 LMTIAFE 5 LM AF+ Sbjct: 198 LMWSAFQ 204 >gi|118117|sp|P04988|CYS1_DICDI CYSTEINE PROTEINASE 1 PRECURSOR >gi|67647|pir||KHDO cysteine proteinase 1 (EC 3.4.22.-) precursor - slime mold (Dictyostelium discoideum) >gi|1617037|emb|CAA26255.1| (X02407) cysteine proteinase I precursor [Dictyostelium discoideum] Length = 343 Frame -3 hits (HSPs): _____________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 343 0 150 300 __________________ Annotated Domains: BLOCKS BL00139A: Eukaryotic thiol (cysteine) pr 136..145 BLOCKS BL00139B: Eukaryotic thiol (cysteine) pr 187..195 BLOCKS BL00139C: Eukaryotic thiol (cysteine) pr 285..294 BLOCKS BL00139D: Eukaryotic thiol (cysteine) pr 306..322 DOMO DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 19..342 Entrez active site: BY SIMILARITY. 142 Entrez active site: BY SIMILARITY. 286 Entrez active site: BY SIMILARITY. 311 PFAM Peptidase_C1: Papain family cysteine pro 118..342 PRINTS PAPAIN1: Papain cysteine protease family 136..151 PRINTS PAPAIN2: Papain cysteine protease family 286..296 PRINTS PAPAIN3: Papain cysteine protease family 306..312 PRODOM PD171194: CYS1_DICDI 1..25 PRODOM PD000247: CYSP(11) CATL(7) CYS2(5) 27..101 PRODOM PD000158: CYSP(14) CATL(9) CYS1(8) 118..338 PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 306..325 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 136..147 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 284..294 __________________ Minus Strand HSPs: Score = 290 (102.1 bits), Expect = 1.1e-24, P = 1.1e-24 Identities = 54/84 (64%), Positives = 60/84 (71%), Frame = -3 Query: 256 NDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDL 77 N +PT FDWR GAVT VKNQG CGSCWSFS G +EG HF+ +LVSLSEQ LVDCD Sbjct: 116 NSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 175 Query: 76 EC-DPEERGACDSGCNGGLMTIAF 8 EC + E ACD GCNGGL A+ Sbjct: 176 ECMEYEGEEACDEGCNGGLQPNAY 199 >gi|118124|sp|P25250|CYS2_HORVU CYSTEINE PROTEINASE EP-B 4 PRECURSOR >gi|82386|pir||JQ1110 cysteine proteinase (EC 3.4.22.-) EP-B 4 precursor - barley >gi|1146118|gb|AAA85036.1| (U19384) cysteine proteinase EPB2 precursor [Hordeum vulgare] Length = 373 Frame -3 hits (HSPs): __________________ ___ Annotated Domains: _____________ _______________________________ __________________________________________________ Database sequence: | | | | 373 0 150 300 __________________ Annotated Domains: BLOCKS BL00139A: Eukaryotic thiol (cysteine) pr 152..161 BLOCKS BL00139B: Eukaryotic thiol (cysteine) pr 194..202 BLOCKS BL00139C: Eukaryotic thiol (cysteine) pr 296..305 BLOCKS BL00139D: Eukaryotic thiol (cysteine) pr 313..329 Entrez active site: BY SIMILARITY. 158 Entrez active site: BY SIMILARITY. 297 Entrez active site: BY SIMILARITY. 318 Entrez glycosylation site: POTENTIAL. 130 PFAM Peptidase_C1: Papain family cysteine pro 134..353 PRINTS PAPAIN1: Papain cysteine protease family 152..167 PRINTS PAPAIN2: Papain cysteine protease family 297..307 PRINTS PAPAIN3: Papain cysteine protease family 313..319 PRODOM PD154723: CYS1(1) CYS2(1) 24..43 PRODOM PD000247: CYSP(11) CATL(7) CYS2(5) 45..116 PRODOM PD000158: CYSP(14) CATL(9) CYS1(8) 134..352 PROSITE ALDEHYDE_DEHYDR_GLU: Aldehyde dehydrogen 272..279 PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 313..332 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 152..163 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 295..305 __________________ Minus Strand HSPs: Score = 253 (89.1 bits), Expect = 9.5e-24, Sum P(2) = 9.5e-24 Identities = 63/134 (47%), Positives = 77/134 (57%), Frame = -3 Query: 406 SHQKLD--PFRRPRRPPGSTDLTPGGVFAASFLGLKPLRLPSDAQKAP-----ILPTNDL 248 SH K P+R G D F A+F+G PS P L +DL Sbjct: 78 SHNKRGDHPYRLHLNRFGDMDQAE---FRATFVGDLRRDTPSKPPSVPGFMYAALNVSDL 134 Query: 247 PTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECD 68 P DWR+ GAVTGVK+QG CGSCW+FS V ++EG + + TG LVSLSEQ+L+DCD Sbjct: 135 PPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT--- 191 Query: 67 PEERGACDSGCNGGLMTIAFE 5 A + GC GGLM AFE Sbjct: 192 -----ADNDGCQGGLMDNAFE 207 Score = 43 (15.1 bits), Expect = 9.5e-24, Sum P(2) = 9.5e-24 Identities = 9/14 (64%), Positives = 10/14 (71%), Frame = -3 Query: 46 DSGCNGGLMTIAFE 5 DSG +GGL IA E Sbjct: 335 DSGASGGLCGIAME 348 >gi|118120|sp|P25249|CYS1_HORVU CYSTEINE PROTEINASE EP-B 1 PRECURSOR >gi|82385|pir||JQ1111 cysteine proteinase (EC 3.4.22.-) EP-B 1 precursor - barley >gi|1146116|gb|AAA85035.1| (U19359) cysteine proteinase EPB1 precursor [Hordeum vulgare] Length = 371 Frame -3 hits (HSPs): __________________ __ Annotated Domains: _____________________________________________ __________________________________________________ Database sequence: | | | | 371 0 150 300 __________________ Annotated Domains: BLOCKS BL00139A: Eukaryotic thiol (cysteine) pr 152..161 BLOCKS BL00139B: Eukaryotic thiol (cysteine) pr 194..202 BLOCKS BL00139C: Eukaryotic thiol (cysteine) pr 296..305 BLOCKS BL00139D: Eukaryotic thiol (cysteine) pr 313..329 DOMO DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 36..353 Entrez active site: BY SIMILARITY. 158 Entrez active site: BY SIMILARITY. 297 Entrez active site: BY SIMILARITY. 318 Entrez glycosylation site: POTENTIAL. 130 PFAM Peptidase_C1: Papain family cysteine pro 134..353 PRINTS PAPAIN1: Papain cysteine protease family 152..167 PRINTS PAPAIN2: Papain cysteine protease family 297..307 PRINTS PAPAIN3: Papain cysteine protease family 313..319 PRODOM PD154723: CYS1(1) CYS2(1) 24..43 PRODOM PD000247: CYSP(11) CATL(7) CYS2(5) 45..116 PRODOM PD000158: CYSP(14) CATL(9) CYS1(8) 134..352 PROSITE ALDEHYDE_DEHYDR_GLU: Aldehyde dehydrogen 272..279 PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 313..332 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 152..163 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 295..305 __________________ Minus Strand HSPs: Score = 251 (88.4 bits), Expect = 1.5e-23, Sum P(2) = 1.5e-23 Identities = 64/134 (47%), Positives = 80/134 (59%), Frame = -3 Query: 406 SHQKLD--PFRRPRRPPGSTDLTPGGVFAASFLG-LK---PLRLPS-DAQKAPILPTNDL 248 SH K P+R G D F A+F+G L+ P + PS L +DL Sbjct: 78 SHNKRGDHPYRLHLNRFGDMDQAE---FRATFVGDLRRDTPAKPPSVPGFMYAALNVSDL 134 Query: 247 PTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECD 68 P DWR+ GAVTGVK+QG CGSCW+FS V ++EG + + TG LVSLSEQ+L+DCD Sbjct: 135 PPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT--- 191 Query: 67 PEERGACDSGCNGGLMTIAFE 5 A + GC GGLM AFE Sbjct: 192 -----ADNDGCQGGLMDNAFE 207 Score = 43 (15.1 bits), Expect = 1.5e-23, Sum P(2) = 1.5e-23 Identities = 9/14 (64%), Positives = 10/14 (71%), Frame = -3 Query: 46 DSGCNGGLMTIAFE 5 DSG +GGL IA E Sbjct: 335 DSGASGGLCGIAME 348 >gi|537437|gb|AAC35211.1| (U12637) cysteine proteinase [Hemerocallis hybrid cultivar] Length = 359 Frame -3 hits (HSPs): _____________ __________________________________________________ Database sequence: | | | | 359 0 150 300 Minus Strand HSPs: Score = 266 (93.6 bits), Expect = 3.9e-22, P = 3.9e-22 Identities = 57/99 (57%), Positives = 67/99 (67%), Frame = -3 Query: 301 LRLPSDAQKAPILPTNDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTG 122 LR DA + +DLPT DWRE GAVTGVK+QG CGSCW+FS V A+EG + + T Sbjct: 112 LRGVKDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGINQIKTN 171 Query: 121 ELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5 ELVSLSEQQLVDCD + +SGCNGGLM AF+ Sbjct: 172 ELVSLSEQQLVDCDTK---------NSGCNGGLMDYAFD 201 >gi|1834307|dbj|BAA09820.1| (D63670) cysteine proteinase [Spirometra erinaceieuropaei] >gi|1834309|dbj|BAA09821.1| (D63671) cysteine proteinase [Spirometra erinaceieuropaei] Length = 336 Frame -3 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | 336 0 150 300 Minus Strand HSPs: Score = 265 (93.3 bits), Expect = 5.0e-22, P = 5.0e-22 Identities = 61/117 (52%), Positives = 76/117 (64%), Frame = -3 Query: 355 TDLTPGGVFAASFLGLKPLRLPSDAQKAPI-LPTND-LPTDFDWREHGAVTGVKNQGSCG 182 +DLTPG FA +L L+ + L +K + +P + LP +WRE GAVT VKNQG CG Sbjct: 85 SDLTPGE-FAERYLCLRGIVLTKLRRKEAVSVPLKENLPDSVNWRERGAVTSVKNQGQCG 143 Query: 181 SCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5 SCWSFSA GA+EGA + TG L SLSEQQL+DC + + GCNGGLM AF+ Sbjct: 144 SCWSFSANGAIEGAIQIKTGALRSLSEQQLMDCSWDYG-------NQGCNGGLMPQAFQ 195 >gi|1085124|pir||JX0366 cysteine endopeptidase (EC 3.4.22.-) precursor - silkworm >gi|957281|gb|AAB33990.1| (S77508) cysteine proteinase, BCP {EC 3.4.22.-} [Bombyx mori=silkmoths, pupae, eggs, Peptide, 344 aa] Length = 344 Frame -3 hits (HSPs): _____________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 344 0 150 300 __________________ Annotated Domains: DOMO DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 124..343 Entrez domain: signal sequence 1..16 Entrez domain: propeptide 17..120 Entrez binding site: carbohydrate (Asn) (covale 98 Entrez active site: Cys, His, Asn 151 Entrez active site: Cys, His, Asn 290 Entrez active site: Cys, His, Asn 311 PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 306..325 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 145..156 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 288..298 __________________ Minus Strand HSPs: Score = 262 (92.2 bits), Expect = 1.0e-21, P = 1.0e-21 Identities = 53/88 (60%), Positives = 62/88 (70%), Frame = -3 Query: 268 ILPTN-DLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQL 92 I P N LP DWR+HGAVT +K+QG CGSCWSFS GALEG HF +G LVSLSEQ L Sbjct: 120 ISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNL 179 Query: 91 VDCDLECDPEERGACDSGCNGGLMTIAFE 5 +DC E+ G ++GCNGGLM AF+ Sbjct: 180 IDCS-----EQYG--NNGCNGGLMDNAFK 201 >gi|7435804|pir||T03694 cysteine proteinase (EC 3.4.22.-) - rice >gi|1514953|dbj|BAA11170.1| (D76415) cysteine proteinase [Oryza sativa] Length = 368 Frame -3 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | 368 0 150 300 Minus Strand HSPs: Score = 260 (91.5 bits), Expect = 1.7e-21, P = 1.7e-21 Identities = 64/126 (50%), Positives = 75/126 (59%), Frame = -3 Query: 373 RRPPGSTDLTPGG-----VFAASFLGLKPLRLPSDAQKAPILP------TNDLPTDFDWR 227 +R PG L G F A+F G L D AP LP DLP DWR Sbjct: 81 KRAPGYAPLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWR 140 Query: 226 EHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGAC 47 GAVTGVK+QG CGSCW+FS V ++EG + + TG LVSLSEQ+L+DCD A Sbjct: 141 RKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDT--------AD 192 Query: 46 DSGCNGGLMTIAFE 5 +SGC GGLM AFE Sbjct: 193 NSGCQGGLMENAFE 206 >gi|4426617|gb|AAD20453.1| (AF099203) cysteine endopeptidase precursor [Oryza sativa] Length = 368 Frame -3 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | 368 0 150 300 Minus Strand HSPs: Score = 259 (91.2 bits), Expect = 2.2e-21, P = 2.2e-21 Identities = 64/126 (50%), Positives = 75/126 (59%), Frame = -3 Query: 373 RRPPGSTDLTPGG-----VFAASFLGLKPLRLPSDAQKAPILP------TNDLPTDFDWR 227 +R PG L G F A+F G L D AP LP DLP DWR Sbjct: 81 KRAPGYPPLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWR 140 Query: 226 EHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGAC 47 GAVTGVK+QG CGSCW+FS V ++EG + + TG LVSLSEQ+L+DCD A Sbjct: 141 RKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDT--------AD 192 Query: 46 DSGCNGGLMTIAFE 5 +SGC GGLM AFE Sbjct: 193 NSGCQGGLMENAFE 206 >gi|5761329|dbj|BAA83473.1| (AB004819) cysteine endopeptidase [Oryza sativa] Length = 371 Frame -3 hits (HSPs): _______________ __________________________________________________ Database sequence: | | | | 371 0 150 300 Minus Strand HSPs: Score = 258 (90.8 bits), Expect = 2.8e-21, P = 2.8e-21 Identities = 59/109 (54%), Positives = 69/109 (63%), Frame = -3 Query: 331 FAASFLGLKPLRLPSDAQKAPILP------TNDLPTDFDWREHGAVTGVKNQGSCGSCWS 170 F A+F G L D AP LP DLP DWR GAVTGVK+QG CGSCW+ Sbjct: 102 FRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCGSCWA 161 Query: 169 FSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5 FS V ++EG + + TG LVSLSEQ+L+DCD A +SGC GGLM AFE Sbjct: 162 FSTVVSVEGINAIRTGRLVSLSEQELIDCDT--------ADNSGCQGGLMENAFE 208 >gi|415567|gb|AAB28289.1| cysteine proteinase=39 kda activated form [Bombyx mori=silkworms, eggs, Peptide, 176 aa] Length = 176 Frame -3 hits (HSPs): ______________________ Annotated Domains: ____ ____ __________________________________________________ Database sequence: | | | | | 176 0 50 100 150 __________________ Annotated Domains: PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 19..30 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 162..172 __________________ Minus Strand HSPs: Score = 257 (90.5 bits), Expect = 3.5e-21, P = 3.5e-21 Identities = 50/82 (60%), Positives = 59/82 (71%), Frame = -3 Query: 250 LPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLEC 71 LP DWR+HGAVT +K+QG CGSCWSFS GALEG HF +G LVSLSEQ L+DC Sbjct: 1 LPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS--- 57 Query: 70 DPEERGACDSGCNGGLMTIAFE 5 E+ G ++GCNGGLM AF+ Sbjct: 58 --EQYG--NNGCNGGLMDNAFK 75 >gi|2118132|pir||JC4848 cysteine proteinase (EC 3.4.22.-) - Douglas fir >gi|1208549|gb|AAC49455.1| (U41902) Pseudotzain [Pseudotsuga menziesii] Length = 454 Frame -3 hits (HSPs): ____________ Annotated Domains: __ _____ __ __________________________________________________ Database sequence: | | | || 454 0 150 300 450 __________________ Annotated Domains: PROSITE PA2_HIS: Phospholipase A2 histidine acti 406..413 PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 307..326 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 150..161 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 290..300 __________________ Minus Strand HSPs: Score = 260 (91.5 bits), Expect = 5.1e-21, P = 5.1e-21 Identities = 58/109 (53%), Positives = 70/109 (64%), Frame = -3 Query: 331 FAASFLGLK---PLRLP-SDAQKAPILPTNDLPTDFDWREHGAVTGVKNQGSCGSCWSFS 164 F A++LG K RL S + + DLP DWRE GAVT VKNQGSCGSCW+FS Sbjct: 101 FKAAYLGTKLDAKKRLSRSPSPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFS 160 Query: 163 AVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5 V A+EG + + TG L SLSEQ+LVDCD + + GCNGGLM AF+ Sbjct: 161 TVAAVEGINQIVTGNLTSLSEQELVDCDT--------SYNQGCNGGLMDYAFQ 205 >gi|2160175|gb|AAB60738.1| (AC000132) Strong similarity to Dianthus cysteine proteinase (gb|U17135). [Arabidopsis thaliana] Length = 416 Frame -3 hits (HSPs): _____________ __________________________________________________ Database sequence: | | | | 416 0 150 300 Minus Strand HSPs: Score = 258 (90.8 bits), Expect = 5.5e-21, P = 5.5e-21 Identities = 58/109 (53%), Positives = 74/109 (67%), Frame = -3 Query: 331 FAASFLGLKPLRLPSD--AQKAPILPTN-DLPTDFDWREHGAVTGVKNQGSCGSCWSFSA 161 F AS LGL + PS A K L + +P DWR+ GAVT VK+QGSCG+CWSFSA Sbjct: 87 FKASRLGLS-VSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSA 145 Query: 160 VGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5 GA+EG + + TG+L+SLSEQ+L+DCD + ++GCNGGLM AFE Sbjct: 146 TGAMEGINQIVTGDLISLSEQELIDCDK--------SYNAGCNGGLMDYAFE 189 >gi|5777611|emb|CAB53397.1| (AJ245868) cysteine protease [Medicago sativa] Length = 209 Frame -3 hits (HSPs): _______________ __________________________________________________ Database sequence: | | | | | | 209 0 50 100 150 200 Minus Strand HSPs: Score = 255 (89.8 bits), Expect = 5.7e-21, P = 5.7e-21 Identities = 47/61 (77%), Positives = 52/61 (85%), Frame = -3 Query: 187 CGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAF 8 CGS W+FS GALEGA++L TG+LVSLSEQQLVDCD CDPEER +CDSGCNGGLM AF Sbjct: 1 CGSGWAFSTTGALEGANYLATGKLVSLSEQQLVDCDHVCDPEERNSCDSGCNGGLMNNAF 60 Query: 7 E 5 E Sbjct: 61 E 61 >gi|1079183|pir||A53810 cathepsin L (EC 3.4.22.15) precursor - flesh fly (Sarcophaga peregrina) >gi|505140|dbj|BAA03970.1| (D16533) cathepsin L precursor [Sarcophaga peregrina] Length = 339 Frame -3 hits (HSPs): ____________ Annotated Domains: ________________________________________________ __________________________________________________ Database sequence: | | | | 339 0 150 300 __________________ Annotated Domains: DOMO DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 18..338 PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 301..320 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 140..151 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 283..293 __________________ Minus Strand HSPs: Score = 254 (89.4 bits), Expect = 7.3e-21, P = 7.3e-21 Identities = 50/81 (61%), Positives = 57/81 (70%), Frame = -3 Query: 250 LPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLEC 71 +P DWREHGAVTGVK+QG CGSCW+FS+ GALEG HF G LVSLSEQ LVDC + Sbjct: 122 VPKSVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKY 181 Query: 70 DPEERGACDSGCNGGLMTIAF 8 ++GCNGGLM AF Sbjct: 182 G-------NNGCNGGLMDNAF 195 >gi|399190|sp|Q02765|CATS_RAT CATHEPSIN S PRECURSOR >gi|348347|pir||A45087 cathepsin S (EC 3.4.22.27) - rat >gi|203650|gb|AAA40994.1| (L03201) cathepsin S precursor [Rattus norvegicus] Length = 330 Frame -3 hits (HSPs): __________________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 330 0 150 300 __________________ Annotated Domains: DOMO DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 16..329 Entrez active site: BY SIMILARITY. 137 Entrez active site: BY SIMILARITY. 277 Entrez active site: BY SIMILARITY. 297 Entrez glycosylation site: POTENTIAL. 100 Entrez glycosylation site: POTENTIAL. 110 PFAM Peptidase_C1: Papain family cysteine pro 113..329 PRINTS PAPAIN1: Papain cysteine protease family 131..146 PRINTS PAPAIN2: Papain cysteine protease family 277..287 PRINTS PAPAIN3: Papain cysteine protease family 292..298 PRODOM PD171175: CATS_RAT 1..24 PRODOM PD000247: CYSP(11) CATL(7) CYS2(5) 26..97 PRODOM PD000158: CYSP(14) CATL(9) CYS1(8) 113..327 PROSITE LEUCINE_ZIPPER: Leucine zipper pattern. 280..301 PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 292..311 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 131..142 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 275..285 __________________ Minus Strand HSPs: Score = 253 (89.1 bits), Expect = 9.4e-21, P = 9.4e-21 Identities = 60/116 (51%), Positives = 73/116 (62%), Frame = -3 Query: 352 DLTPGGVFAASFLGLKPLRLPSDAQKAPILPTND---LPTDFDWREHGAVTGVKNQGSCG 182 D+TP V ++G LR+P ++ L ++ LP DWRE G VT VK QGSCG Sbjct: 80 DMTPEEVIG--YMG--SLRIPRPWNRSGTLKSSSNQTLPDSVDWREKGCVTNVKYQGSCG 135 Query: 181 SCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5 SCW+FSA GALEG L TG+LVSLS Q LVDC E E+ G + GC GG MT AF+ Sbjct: 136 SCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTE---EKYG--NKGCGGGFMTEAFQ 189 >gi|1706263|sp|P54640|CYS5_DICDI CYSTEINE PROTEINASE 5 PRECURSOR >gi|1222694|gb|AAA92018.1| (L36205) CP5 [Dictyostelium discoideum] Length = 344 Frame -3 hits (HSPs): ____________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 344 0 150 300 __________________ Annotated Domains: BLOCKS BL00139A: Eukaryotic thiol (cysteine) pr 130..139 BLOCKS BL00139B: Eukaryotic thiol (cysteine) pr 171..179 BLOCKS BL00139C: Eukaryotic thiol (cysteine) pr 271..280 BLOCKS BL00139D: Eukaryotic thiol (cysteine) pr 306..322 DOMO DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 20..343 Entrez Domain: SER-RICH. 196..340 Entrez active site: BY SIMILARITY. 136 Entrez active site: BY SIMILARITY. 272 Entrez active site: BY SIMILARITY. 311 Entrez glycosylation site: POTENTIAL. 110 Entrez glycosylation site: POTENTIAL. 297 PFAM Peptidase_C1: Papain family cysteine pro 112..343 PRINTS PAPAIN1: Papain cysteine protease family 130..145 PRINTS PAPAIN2: Papain cysteine protease family 272..282 PRINTS PAPAIN3: Papain cysteine protease family 306..312 PRODOM PD151262: CYS4(1) CYS5(1) Q94503(1) 1..28 PRODOM PD000247: CYSP(11) CATL(7) CYS2(5) 30..113 PRODOM PD000158: CYSP(14) CATL(9) CYS1(8) 115..342 PROSITE THIOL_PROTEASE_ASN: Eukaryotic thiol (cy 306..325 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 130..141 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 270..280 __________________ Minus Strand HSPs: Score = 252 (88.7 bits), Expect = 1.2e-20, P = 1.2e-20 Identities = 51/85 (60%), Positives = 55/85 (64%), Frame = -3 Query: 259 TNDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCD 80 TN DWR GAVT VKNQG CG CWSFS G+ EGAHF GELVSLSEQ L+DC Sbjct: 109 TNSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCS 168 Query: 79 LECDPEERGACDSGCNGGLMTIAFE 5 E +SGC+GGLMT AFE Sbjct: 169 TE---------NSGCDGGLMTYAFE 184 >gi|600111|emb|CAA84378.1| (Z34895) cysteine proteinase [Vicia sativa] Length = 359 Frame -3 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | 359 0 150 300 Minus Strand HSPs: Score = 251 (88.4 bits), Expect = 1.5e-20, P = 1.5e-20 Identities = 50/83 (60%), Positives = 60/83 (72%), Frame = -3 Query: 253 DLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLE 74 D+P+ DWR GAVTGVK+QG CGSCW+FS + A+EG + + T +LVSLSEQQLVDCD E Sbjct: 127 DVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQLVDCDTE 186 Query: 73 CDPEERGACDSGCNGGLMTIAFE 5 E + GCNGGLM AFE Sbjct: 187 ----E----NEGCNGGLMEYAFE 201 >gi|1076552|pir||S49166 cysteine proteinase precursor - spring vetch Length = 357 Frame -3 hits (HSPs): ____________ Annotated Domains: ____________________________________________ __________________________________________________ Database sequence: | | | | 357 0 150 300 __________________ Annotated Domains: DOMO DM00081: EUKARYOTICTHIOL(CYSTEINE)PROTEA 30..340 PROSITE THIOL_PROTEASE_CYS: Eukaryotic thiol (cy 146..157 PROSITE THIOL_PROTEASE_HIS: Eukaryotic thiol (cy 284..294 __________________ Minus Strand HSPs: Score = 251 (88.4 bits), Expect = 1.5e-20, P = 1.5e-20 Identities = 50/83 (60%), Positives = 60/83 (72%), Frame = -3 Query: 253 DLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQLVDCDLE 74 D+P+ DWR GAVTGVK+QG CGSCW+FS + A+EG + + T +LVSLSEQQLVDCD E Sbjct: 127 DVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQLVDCDTE 186 Query: 73 CDPEERGACDSGCNGGLMTIAFE 5 E + GCNGGLM AFE Sbjct: 187 ----E----NEGCNGGLMEYAFE 201 >gi|7381610|gb|AAF61565.1|AF227957_1 (AF227957) cathepsin L-like proteinase precursor [Boophilus microplus] Length = 332 Frame -3 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | 332 0 150 300 Minus Strand HSPs: Score = 251 (88.4 bits), Expect = 1.5e-20, P = 1.5e-20 Identities = 58/109 (53%), Positives = 67/109 (61%), Frame = -3 Query: 331 FAASFLGLKPLRLPSDAQKAPILPTND--LPTDFDWREHGAVTGVKNQGSCGSCWSFSAV 158 FA F G R + P ND LP DWR+ GAVT VK+QG CGSCW+FSA Sbjct: 87 FARIFNGHHGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSAT 146 Query: 157 GALEGAHFLYTGELVSLSEQQLVDCDLECDPEERGACDSGCNGGLMTIAFE 5 G+LEG HFL GELVSLSEQ LVDC + G ++GC GGLM AF+ Sbjct: 147 GSLEGQHFLKNGELVSLSEQNLVDCS-----QSFG--NNGCEGGLMEDAFK 190 >gi|1185457|gb|AAA87848.1| (U38475) cathepsin L [Schistosoma japonicum] Length = 224 Frame -3 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | | | 224 0 50 100 150 200 Minus Strand HSPs: Score = 250 (88.0 bits), Expect = 1.9e-20, P = 1.9e-20 Identities = 50/89 (56%), Positives = 59/89 (66%), Frame = -3 Query: 271 PILPTNDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQL 92 P D+P +FDWRE GAVT VKNQG CGSCW+FS G +E F TG+L+SLSEQQL Sbjct: 3 PRREVGDIPNNFDWREKGAVTEVKNQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQQL 62 Query: 91 VDCDLECDPEERGACDSGCNGGLMTIAFE 5 VDCD + D GCNGGL + A+E Sbjct: 63 VDCD---------SLDDGCNGGLPSNAYE 82 >gi|7435775|pir||JC5443 cathepsin L-like cysteine proteinase (EC 3.4.-.-) c1 - Maize weevil >gi|2804262|dbj|BAA24442.1| (D82884) cysteine proteinase [Sitophilus zeamais] Length = 338 Frame -3 hits (HSPs): _____________ __________________________________________________ Database sequence: | | | | 338 0 150 300 Minus Strand HSPs: Score = 250 (88.0 bits), Expect = 1.9e-20, P = 1.9e-20 Identities = 54/87 (62%), Positives = 60/87 (68%), Frame = -3 Query: 268 ILPTN-DLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLYTGELVSLSEQQL 92 I P N LP DWR+ GAVT VK+QG CGSCWSFSA G+LEG HF TG+LVSLSEQ L Sbjct: 114 ISPANVKLPDTVDWRDKGAVTEVKDQGHCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNL 173 Query: 91 VDCDLECDPEERGACDSGCNGGLMTIAF 8 VDC G ++GCNGGLM AF Sbjct: 174 VDCS-----GRYG--NNGCNGGLMDNAF 194 WARNING: HSPs involving 552 database sequences were not reported due to the limiting value of parameter B = 50. Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.98 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.331 0.140 0.459 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.327 0.139 0.419 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.339 0.148 0.513 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.342 0.148 0.476 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.357 0.157 0.611 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.318 0.138 0.440 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 138 137 10. 72 3 12 22 0.11 33 30 0.10 36 +2 0 138 137 10. 72 3 12 22 0.11 33 30 0.10 36 +1 0 139 138 10. 72 3 12 22 0.12 33 30 0.10 36 -1 0 139 138 10. 72 3 12 22 0.12 33 30 0.10 36 -2 0 138 138 10. 72 3 12 22 0.12 33 30 0.10 36 -3 0 138 137 10. 72 3 12 22 0.11 33 30 0.10 36 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 8:50 PM CDT May 27, 2000 Format: BLAST # of letters in database: 158,518,215 # of sequences in database: 505,245 # of database sequences satisfying E: 602 No. of states in DFA: 589 (58 KB) Total size of DFA: 176 KB (192 KB) Time to generate neighborhood: 0.01u 0.01s 0.02t Elapsed: 00:00:00 No. of threads or processors used: 4 Search cpu time: 214.56u 1.43s 215.99t Elapsed: 00:02:10 Total cpu time: 214.67u 1.50s 216.17t Elapsed: 00:02:11 Start: Thu Feb 15 02:48:14 2001 End: Thu Feb 15 02:50:25 2001 WARNINGS ISSUED: 2
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000