WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= 'D08E07_J19_10.ab1' (678 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 3 Sequences : less than 3 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 779 155 |=================================================== 6310 624 95 |=============================== 3980 529 127 |========================================== 2510 402 96 |================================ 1580 306 80 |========================== 1000 226 42 |============== 631 184 38 |============ 398 146 25 |======== 251 121 8 |== 158 113 9 |=== 100 104 10 |=== 63.1 94 7 |== 39.8 87 4 |= 25.1 83 4 |= 15.8 79 12 |==== >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 67 <<<<<<<<<<<<<<<<< 10.0 67 5 |= 6.31 62 3 |= 3.98 59 5 |= 2.51 54 3 |= 1.58 51 2 |: 1.00 49 8 |== 0.63 41 8 |== 0.40 33 1 |: 0.25 32 1 |: 0.16 31 3 |= 0.10 28 1 |: 0.063 27 0 | 0.040 27 2 |: 0.025 25 1 |: 0.016 24 3 |= 0.010 21 0 | 0.0063 21 0 | 0.0040 21 3 |= Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|6728988|gb|AAF26986.1|AC018363_31(AC018363) putati... +3 542 2.7e-51 1 gi|9759039|dbj|BAB09366.1|(AB026661) aspartyl proteas... +3 541 3.5e-51 1 gi|4646203|gb|AAD26876.1|AC007230_10(AC007230) Belong... +3 431 1.6e-39 1 gi|6850312|gb|AAF29389.1|AC009999_9(AC009999) Contain... +3 278 2.6e-23 1 gi|11993877|gb|AAG42922.1|AF329505_1(AF329505) putati... +3 280 5.3e-23 1 gi|10177232|dbj|BAB10606.1|(AB005243) protease-like p... +3 276 2.1e-22 1 gi|4415912|gb|AAD20143.1|(AC006282) putative protease... +3 265 2.0e-21 1 gi|6579210|gb|AAF18253.1|AC011438_15(AC011438) T23G18... +3 175 5.7e-21 2 gi|11357751|pir||T45858hypothetical protein F3A4.130 ... +3 184 2.7e-12 1 gi|9757837|dbj|BAB08274.1|(AB008267) contains similar... +3 168 1.3e-10 1 gi|11358319|pir||T47313hypothetical protein T32A11.12... +3 145 3.7e-07 1 gi|11994412|dbj|BAB02414.1|(AB024033) chloroplast nuc... +3 141 2.0e-06 1 gi|7489412|pir||T06213probable aspartic proteinase (E... +3 128 6.8e-05 1 gi|7489434|pir||T04372protein EEA1 - barley >gi|25704... +3 128 6.8e-05 1 gi|4803959|gb|AAD29831.1|AC006202_9(AC006202) putativ... +3 126 0.00029 1 gi|11994761|dbj|BAB03090.1|(AP001313) chloroplast nuc... +3 123 0.00032 1 gi|4063754|gb|AAC98462.1|(AC005851) putative chloropl... +3 121 0.00042 1 gi|4063755|gb|AAC98463.1|(AC005851) putative chloropl... +3 120 0.00057 1 gi|11357516|pir||T47790hypothetical protein F17J16.13... +3 116 0.0027 1 gi|12082174|dbj|BAB20797.1|(AB045379) pepsinogen C [X... +3 114 0.0028 1 gi|4510415|gb|AAD21501.1|(AC006929) putative chloropl... +3 114 0.0029 1 gi|104296|pir||A39314gastricsin (EC 3.4.23.3) precurs... +3 109 0.011 1 gi|12321511|gb|AAG50814.1|AC079281_16(AC079281) hypot... +3 110 0.012 1 gi|10334495|emb|CAC10209.1|(AJ299060) putative extrac... +3 108 0.013 1 gi|12322538|gb|AAG51267.1|AC027135_8(AC027135) chloro... +3 108 0.018 1 gi|416749|sp|P32951|CAR1_CANPACANDIDAPEPSIN 1 PRECURS... +3 106 0.026 1 gi|477734|pir||B47701aspartic proteinase ACPR (EC 3.4... +3 106 0.026 1 gi|4512658|gb|AAD21712.1|(AC006931) hypothetical prot... +3 102 0.093 1 gi|2160166|gb|AAB60729.1|(AC000132) F21M12.13 gene pr... +3 101 0.11 1 gi|7715602|gb|AAF68120.1|AC010793_15(AC010793) F20B17... +3 100 0.13 1 gi|9711879|dbj|BAB07973.1|(AP002524) hypothetical pro... +3 100 0.14 1 gi|5042418|gb|AAD38257.1|AC006193_13(AC006193) Hypoth... +3 99 0.17 1 gi|477042|pir||A47701aspartic proteinase ACPL (EC 3.4... +3 97 0.24 1 gi|12323376|gb|AAG51657.1|AC010704_1(AC010704) nucell... +3 96 0.33 1 gi|7485278|pir||T08860hypothetical protein A_TM017A05... +3 96 0.37 1 gi|12328547|dbj|BAB21205.1|(AP002913) nucleoid DNA-bi... +3 96 0.40 1 gi|7486515|pir||T08979hypothetical protein F6G3.60 - ... +3 95 0.40 1 gi|7486516|pir||T08980hypothetical protein F6G3.70 - ... +3 95 0.41 1 gi|11514162|pdb|1FKN|AChain A, Structure Of Beta-Secr... +3 94 0.45 1 gi|6598808|gb|AAB80784.2|(AF024504) putative chloropl... +3 95 0.45 1 gi|6330045|dbj|BAA86463.1|(AB032975) KIAA1149 protein... +3 94 0.46 1 gi|9506421ref|NP_062077.1| beta-site APP cleaving enz... +3 95 0.48 1 gi|6857759ref|NP_035922.2| beta-site APP cleaving enz... +3 95 0.48 1 gi|6685249|sp|P56818|BACE_MOUSEBETA-SECRETASE PRECURS... +3 95 0.48 1 gi|12736860ref|XP_006430.2| beta-site APP-cleaving en... +3 94 0.49 1 gi|2664292|emb|CAA75754.1|(Y15744) cellular aspartic ... +3 93 0.55 1 gi|7770346|gb|AAF69716.1|AC016041_21(AC016041) F27J15... +3 95 0.55 1 gi|6470293|gb|AAF13715.1|AF200193_1(AF200193) memapsi... +3 94 0.56 1 gi|6912266ref|NP_036236.1| beta-site APP-cleaving enz... +3 94 0.57 1 gi|9665144|gb|AAF97328.1|AC023628_9(AC023628) Unknown... +3 92 0.75 1
Use the and icons to retrieve links to Entrez:
WARNING: Descriptions of 17 database sequences were not reported due to the limiting value of parameter V = 50. >gi|6728988|gb|AAF26986.1|AC018363_31 (AC018363) putative aspartyl protease [Arabidopsis thaliana] Length = 488 Frame 3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | | 488 0 150 300 450 Plus Strand HSPs: Score = 542 (190.8 bits), Expect = 2.7e-51, P = 2.7e-51 Identities = 102/195 (52%), Positives = 136/195 (69%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 V L +IEV +L+L S+ FDS + KG +IDSGTTL YLP VY+ L+ ++LA P L L Sbjct: 286 VNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTL 345 Query: 186 YLVEQQFRCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQT 365 + V++ F CF YT +DR FP V F S+SL VYP +YLFQ ++ WC GWQ QT Sbjct: 346 HTVQESFTCFHYTDKLDR-FPTVTFQFDKSVSLAVYPREYLFQVREDTWCFGWQNGGLQT 404 Query: 366 KNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCSSSIKVKDEATGIVHTVVAHNISSA 545 K G +T+LGD+ LSNKLV+YD+E VIGWT++NCS I+VKDE +G ++TV AHN+S + Sbjct: 405 KGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCSGGIQVKDEESGAIYTVGAHNLSWS 464 Query: 546 STLFIGRILTFFLLL 590 S+L I ++LT LL Sbjct: 465 SSLAITKLLTLVSLL 479 >gi|9759039|dbj|BAB09366.1| (AB026661) aspartyl protease-like [Arabidopsis thaliana] Length = 478 Frame 3 hits (HSPs): _____________________ __________________________________________________ Database sequence: | | | | | 478 0 150 300 450 Plus Strand HSPs: Score = 541 (190.4 bits), Expect = 3.5e-51, P = 3.5e-51 Identities = 106/195 (54%), Positives = 144/195 (73%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKG-TVIDSGTTLAYLPAIVYDELIQKVLARQPGLK 182 V+LK ++VD D + LP + S NG G T+IDSGTTLAYLP +Y+ LI+K+ A+Q +K Sbjct: 276 VILKGMDVDGDPIDLPPSLA-STNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQ-VK 333 Query: 183 LYLVEQQFRCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQ 362 L++V++ F CF +T N D+ FPVV LHF+DSL L+VYPHDYLF ++ ++C GWQ Sbjct: 334 LHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMT 393 Query: 363 TKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCSSSIKVKDEATGIVHTVVAHN-IS 539 T++G D+ LLGDLVLSNKLV+YDLE VIGW D+NCSSSIKVKD +G + + A N IS Sbjct: 394 TQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKVKD-GSGAAYQLGAENLIS 452 Query: 540 SASTLFIGRILTFFLLL 590 +AS++ G ++T +L Sbjct: 453 AASSVMNGTLVTLLSIL 469 >gi|4646203|gb|AAD26876.1|AC007230_10 (AC007230) Belongs to PF|00026 Eukaryotic aspartyl protease family. [Arabidopsis thaliana] Length = 449 Frame 3 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | 449 0 150 300 Plus Strand HSPs: Score = 431 (151.7 bits), Expect = 1.6e-39, P = 1.6e-39 Identities = 82/154 (53%), Positives = 109/154 (70%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 V+L ++VD L LP I NG GT++DSGTTLAY P ++YD LI+ +LARQP +KL Sbjct: 276 VMLMGMDVDGTSLDLPRSIVR--NG-GTIVDSGTTLAYFPKVLYDSLIETILARQP-VKL 331 Query: 186 YLVEQQFRCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQT 365 ++VE+ F+CF ++ NVD FP V F+DS+ LTVYPHDYLF ++ ++C GWQ T Sbjct: 332 HIVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTT 391 Query: 366 KNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYN 467 ++ LLGDLVLSNKLV+YDL+ VIGW D+N Sbjct: 392 DERSEVILLGDLVLSNKLVVYDLDNEVIGWADHN 425 >gi|6850312|gb|AAF29389.1|AC009999_9 (AC009999) Contains similarity to nucellin from Hordeum vulgare gb|U87148. ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come from this gene. [Arabidopsis thaliana] Length = 388 Frame 3 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | 388 0 150 300 Plus Strand HSPs: Score = 278 (97.9 bits), Expect = 2.6e-23, P = 2.6e-23 Identities = 49/102 (48%), Positives = 75/102 (73%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 V + +++V + L +P+D+F + KG +IDSGTTLAYLP I+Y+ L++K +P LK+ Sbjct: 285 VNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAYLPEIIYEPLVKK----EPALKV 340 Query: 186 YLVEQQFRCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLF 311 ++V++ ++CF Y+G VD GFP V HF++S+ L VYPHDYLF Sbjct: 341 HIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDYLF 382 >gi|11993877|gb|AAG42922.1|AF329505_1 (AF329505) putative protease protein [Arabidopsis thaliana] Length = 492 Frame 3 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | | 492 0 150 300 450 Plus Strand HSPs: Score = 280 (98.6 bits), Expect = 5.3e-23, P = 5.3e-23 Identities = 68/167 (40%), Positives = 92/167 (55%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 V L+SI V+ IL + +F G GT+ID+GTTLAYLP Y IQ V Sbjct: 288 VNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGR 347 Query: 186 YLVEQQFRCFLYT-GNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDG---IWCIGWQRS 353 + + ++CF T G+VD FP V L F S+ + P YL F IWCIG+QR Sbjct: 348 PITYESYQCFEITAGDVDV-FPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRM 406 Query: 354 VAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCSSSIKVKDEATG 506 + + +T+LGDLVL +K+V+YDL IGW +Y+CS + V G Sbjct: 407 -----SHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCSLEVNVSASRGG 452 >gi|10177232|dbj|BAB10606.1| (AB005243) protease-like protein [Arabidopsis thaliana] Length = 539 Frame 3 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | 539 0 150 300 450 Plus Strand HSPs: Score = 276 (97.2 bits), Expect = 2.1e-22, P = 2.1e-22 Identities = 62/161 (38%), Positives = 95/161 (59%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 V L SI V+ L + +F + NG+GT+ID+GTTLAYL Y ++ + Sbjct: 287 VNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVR 346 Query: 186 YLVEQQFRCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKD----GIWCIGWQRS 353 +V + +C++ T +V FP V L+F S+ + P DYL Q + +WCIG+QR Sbjct: 347 PVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQR- 405 Query: 354 VAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCSSSIKV 488 +N + +T+LGDLVL +K+ +YDL IGW +Y+CS+S+ V Sbjct: 406 ---IQN-QGITILGDLVLKDKIFVYDLVGQRIGWANYDCSTSVNV 446 >gi|4415912|gb|AAD20143.1| (AC006282) putative protease [Arabidopsis thaliana] Length = 469 Frame 3 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | | 469 0 150 300 450 Plus Strand HSPs: Score = 265 (93.3 bits), Expect = 2.0e-21, P = 2.0e-21 Identities = 57/159 (35%), Positives = 93/159 (58%), Frame = +3 Query: 12 LKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKLYL 191 L SI V+ +L L + +F++ N +GT++D+GTTL YL YD + + L + Sbjct: 307 LLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPI 366 Query: 192 VEQQFRCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFK--DG--IWCIGWQRSVA 359 + +C+L + ++ FP V L+F S+ + P DYLF + DG +WCIG+Q++ Sbjct: 367 ISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAP- 425 Query: 360 QTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCSSSIKV 488 ++ T+LGDLVL +K+ +YDL IGW Y+C + +V Sbjct: 426 -----EEQTILGDLVLKDKVFVYDLARQRIGWASYDCKCNHRV 463 >gi|6579210|gb|AAF18253.1|AC011438_15 (AC011438) T23G18.7 [Arabidopsis thaliana] Length = 566 Frame 3 hits (HSPs): ______ _________ __________________________________________________ Database sequence: | | | | | 566 0 150 300 450 Plus Strand HSPs: Score = 175 (61.6 bits), Expect = 5.7e-21, Sum P(2) = 5.7e-21 Identities = 41/94 (43%), Positives = 57/94 (60%), Frame = +3 Query: 198 QQFRCFLYT-GNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDG---IWCIGWQRSVAQT 365 + ++CF T G+VD FP V L F S+ + P YL F IWCIG+QR Sbjct: 446 ESYQCFEITAGDVDV-FPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRM---- 500 Query: 366 KNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCSSS 479 + + +T+LGDLVL +K+V+YDL IGW +Y+C S Sbjct: 501 -SHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCEFS 537 Score = 114 (40.1 bits), Expect = 5.7e-21, Sum P(2) = 5.7e-21 Identities = 25/51 (49%), Positives = 31/51 (60%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKV 158 V L+SI V+ IL + +F G GT+ID+GTTLAYLP Y IQ V Sbjct: 315 VNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAV 365 >gi|11357751|pir||T45858 hypothetical protein F3A4.130 - Arabidopsis thaliana >gi|6522926|emb|CAB62113.1| (AL132978) putative protein [Arabidopsis thaliana] Length = 632 Frame 3 hits (HSPs): _____________ __________________________________________________ Database sequence: | | | | | | 632 0 150 300 450 600 Plus Strand HSPs: Score = 184 (64.8 bits), Expect = 2.7e-12, P = 2.7e-12 Identities = 57/157 (36%), Positives = 81/157 (51%), Frame = +3 Query: 12 LKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLK-LY 188 L I V L L S +FD +G V+DSGTT AYLP + + V+ LK + Sbjct: 281 LTGIRVAGKQLSLHSRVFDGEHG--AVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQID 338 Query: 189 LVEQQFR--CFL-----YTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKD--GIWCIG 341 + F+ CF Y + + FP V++ FK S + P +Y+F+ G +C+G Sbjct: 339 GPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLG 398 Query: 342 WQRSVAQTKNGKD-MTLLGDLVLSNKLVIYDLEXMVIGWTDYNCS 473 NGKD TLLG +V+ N LV+YD E +G+ NCS Sbjct: 399 ------VFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCS 437 >gi|9757837|dbj|BAB08274.1| (AB008267) contains similarity to chloroplast nucleoid DNA-binding protein~gene_id:MMG4.12 [Arabidopsis thaliana] Length = 586 Frame 3 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | | 586 0 150 300 450 Plus Strand HSPs: Score = 168 (59.1 bits), Expect = 1.3e-10, P = 1.3e-10 Identities = 53/155 (34%), Positives = 80/155 (51%), Frame = +3 Query: 12 LKSIEVDTDILQLPSDIFDSVNGK-GTVIDSGTTLAYLPAIVYDELIQKVLARQPGLK-L 185 LK + V L+L +F NGK GTV+DSGTT AY P + + V+ P LK + Sbjct: 264 LKQMHVAGKSLKLNPKVF---NGKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRI 320 Query: 186 YLVEQQFR--CFLYTGN----VDRGFPVVKLHFKDSLSLTVYPHDYLFQFKD--GIWCIG 341 + + + CF G + FP + + F + L + P +YLF+ G +C+G Sbjct: 321 HGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLG 380 Query: 342 WQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCS 473 + ++ TLLG +V+ N LV YD E +G+ NCS Sbjct: 381 ----IFPDRDST--TLLGGIVVRNTLVTYDRENDKLGFLKTNCS 418 >gi|11358319|pir||T47313 hypothetical protein T32A11.120 - Arabidopsis thaliana >gi|7413629|emb|CAB85978.1| (AL138653) putative protein [Arabidopsis thaliana] Length = 356 Frame 3 hits (HSPs): _______________ __________________________________________________ Database sequence: | | | | 356 0 150 300 Plus Strand HSPs: Score = 145 (51.0 bits), Expect = 3.7e-07, P = 3.7e-07 Identities = 39/103 (37%), Positives = 50/103 (48%), Frame = +3 Query: 42 LQLPSD--IFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKLYLVEQQFRCF 215 L+LP D +F G GT+IDSGTTL + P YD LIQ +L + + F+CF Sbjct: 240 LRLPIDPSVFSVAKGYGTIIDSGTTLVHFPGEAYDPLIQAILNVVSQYGRPIPYESFQCF 299 Query: 216 LYTGNVDRG------FPVVKLHFKDSLSLTVYPHDYLFQ-FKD 323 T + FP V L F S+ + P YLFQ F D Sbjct: 300 NITSGISSHLVIADMFPEVHLGFAGGASMVIKPEAYLFQKFLD 342 >gi|11994412|dbj|BAB02414.1| (AB024033) chloroplast nucleoid DNA binding protein-like [Arabidopsis thaliana] Length = 461 Frame 3 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | | 461 0 150 300 450 Plus Strand HSPs: Score = 141 (49.6 bits), Expect = 2.0e-06, P = 2.0e-06 Identities = 39/151 (25%), Positives = 73/151 (48%), Frame = +3 Query: 21 IEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQ---KVLARQPGLKLYL 191 I + D+L +PS ++D+ +G GT++DSGT+L L Y +++ + L +K Sbjct: 313 ISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEG 372 Query: 192 VEQQFRCFLYTG--NVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQT 365 V ++ CF +T NV + P + H K + YL G+ C+G+ + Sbjct: 373 VPIEY-CFSFTSGFNVSK-LPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPA 430 Query: 366 KNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCS 473 N ++G+++ N L +DL + + C+ Sbjct: 431 TN-----VIGNIMQQNYLWEFDLMASTLSFAPSACT 461 >gi|7489412|pir||T06213 probable aspartic proteinase (EC 3.4.23.-) - barley >gi|2290202|gb|AAB96882.1| (U87148) nucellin [Hordeum vulgare] >gi|2290204|gb|AAB96883.1| (U87149) nucellin [Hordeum vulgare] Length = 410 Frame 3 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | 410 0 150 300 Plus Strand HSPs: Score = 128 (45.1 bits), Expect = 6.8e-05, P = 6.8e-05 Identities = 39/139 (28%), Positives = 64/139 (46%), Frame = +3 Query: 90 VIDSGTTLAYLPAIVYDELIQKV--------LARQPGLKLYLVEQQFRCFLYTGNVDRGF 245 V DSG+T ++PA +Y+E++ KV L G L L + + F +V F Sbjct: 258 VFDSGSTYTHVPAQIYNEIVSKVRGTLSESSLEEVKGRALPLCWKGKKPFGSVNDVKNQF 317 Query: 246 PVVKL---HFKDSLSLTVYPHDYLFQFKDGIWCIG-WQRSVAQTKNGKDMTLLGDLVLSN 413 + L H + + +L + P +YLF +DG C+ S+ + L+G + + + Sbjct: 318 KALSLKITHARGTNNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFILIGAVTMQD 377 Query: 414 KLVIYDLEXMVIGWTDYNC 470 VIYD E +GW C Sbjct: 378 LFVIYDNEKKQLGWVRAQC 396 >gi|7489434|pir||T04372 protein EEA1 - barley >gi|2570402|gb|AAB97155.1| (AF017430) EEA1 [Hordeum vulgare] Length = 410 Frame 3 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | 410 0 150 300 Plus Strand HSPs: Score = 128 (45.1 bits), Expect = 6.8e-05, P = 6.8e-05 Identities = 39/139 (28%), Positives = 64/139 (46%), Frame = +3 Query: 90 VIDSGTTLAYLPAIVYDELIQKV--------LARQPGLKLYLVEQQFRCFLYTGNVDRGF 245 V DSG+T ++PA +Y+E++ KV L G L L + + F +V F Sbjct: 258 VFDSGSTYTHVPAQIYNEIVSKVRGTFSESSLEEVKGRALPLCWKGKKPFGSVNDVKNQF 317 Query: 246 PVVKL---HFKDSLSLTVYPHDYLFQFKDGIWCIG-WQRSVAQTKNGKDMTLLGDLVLSN 413 + L H + + +L + P +YLF +DG C+ S+ + L+G + + + Sbjct: 318 KALSLKITHARGTNNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFILIGAVTMQD 377 Query: 414 KLVIYDLEXMVIGWTDYNC 470 VIYD E +GW C Sbjct: 378 LFVIYDNEKKQLGWVRAQC 396 >gi|4803959|gb|AAD29831.1|AC006202_9 (AC006202) putative chloroplast nucleoid DNA-binding protein [Arabidopsis thaliana] Length = 756 Frame 3 hits (HSPs): ___________ ___________ __________________________________________________ Database sequence: | | | | | || 756 0 150 300 450 600 750 Plus Strand HSPs: Score = 126 (44.4 bits), Expect = 0.00029, P = 0.00029 Identities = 44/155 (28%), Positives = 77/155 (49%), Frame = +3 Query: 12 LKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQ-PGLKLY 188 L ++ V+ +++ F + +G IDSGTTL Y P + Y L+++ + + +K+ Sbjct: 605 LDAVSVEDNLIATLGTPFHAEDGN-IFIDSGTTLTYFP-MSYCNLVREAVEQVVTAVKVP 662 Query: 189 -LVEQQFRCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHD-YLFQFKDGIWCIGWQRSVAQ 362 + C+ Y+ +D FPV+ +HF L + ++ YL GI+C+ A Sbjct: 663 DMGSDNLLCY-YSDTIDI-FPVITMHFSGGADLVLDKYNMYLETITGGIFCL------AI 714 Query: 363 TKNGKDM-TLLGDLVLSNKLVIYDLEXMVIGWTDYNCSS 476 N M + G+ +N LV YD VI ++ NCS+ Sbjct: 715 GCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTNCSA 753 Score = 94 (33.1 bits), Expect = 1.4, P = 0.76 Identities = 35/151 (23%), Positives = 75/151 (49%), Frame = +3 Query: 12 LKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKLYL 191 L ++ V+ + ++ F + +G VIDSG+T+ Y P + Y L++K + Q + + Sbjct: 266 LDAVSVEDNRIETLGTPFHAEDGN-IVIDSGSTVTYFP-VSYCNLVRKAV-EQVVTAVRV 322 Query: 192 VE---QQFRCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHD-YLFQFKDGIWCIGWQRSVA 359 + C+ ++ +D FPV+ +HF L + ++ Y+ G++C+ + Sbjct: 323 PDPSGNDMLCY-FSETIDI-FPVITMHFSGGADLVLDKYNMYMESNSGGLFCLA---IIC 377 Query: 360 QTKNGKDMTLLGDLVLSNKLVIYDLEXMVI-GWTDY 464 + + + G+ +N LV YD +++ G + Y Sbjct: 378 NSPTQE--AIFGNRAQNNFLVGYDSSSLLLQGASPY 411 >gi|11994761|dbj|BAB03090.1| (AP001313) chloroplast nucleoid DNA binding protein-like; nucellin-like protein [Arabidopsis thaliana] Length = 452 Frame 3 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | || 452 0 150 300 450 Plus Strand HSPs: Score = 123 (43.3 bits), Expect = 0.00032, P = 0.00032 Identities = 51/157 (32%), Positives = 81/157 (51%), Frame = +3 Query: 6 VVLKSIEVDTDILQL-PS--DIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPG 176 V LKS+ V+ L++ PS +I DS NG GTV+DSGTTLA+L Y +I V R Sbjct: 294 VKLKSVFVNGAKLRIDPSIWEIDDSGNG-GTVVDSGTTLAFLAEPAYRSVIAAVRRR--- 349 Query: 177 LKLYLVEQQFRCFLYTGNV------DRGFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCI 338 +KL + + F NV ++ P +K F P +Y + ++ I C+ Sbjct: 350 VKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCL 409 Query: 339 GWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCS 473 Q SV K G +++G+L+ L +D + +G++ C+ Sbjct: 410 AIQ-SV-DPKVG--FSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450 >gi|4063754|gb|AAC98462.1| (AC005851) putative chloroplast nucleoid DNA-binding protein [Arabidopsis thaliana] Length = 389 Frame 3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 389 0 150 300 Plus Strand HSPs: Score = 121 (42.6 bits), Expect = 0.00042, P = 0.00042 Identities = 38/155 (24%), Positives = 72/155 (46%), Frame = +3 Query: 12 LKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKLYL 191 L ++ V ++ F ++ G VIDSG+TL Y P Y L++K + Q + Sbjct: 241 LDAVSVGNTRIETVGTPFHALKGN-IVIDSGSTLTYFPES-YCNLVRKAV-EQVVTAVRF 297 Query: 192 VEQQFRCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHD-YLFQFKDGIWCIGWQRSVAQTK 368 C+ Y+ +D FPV+ +HF L + ++ Y+ G++C+ + + Sbjct: 298 PRSDILCY-YSKTIDI-FPVITMHFSGGADLVLDKYNMYVASNTGGVFCLA---IICNSP 352 Query: 369 NGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCSS 476 + + G+ +N LV YD +++ + NCS+ Sbjct: 353 I--EEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCSA 386 >gi|4063755|gb|AAC98463.1| (AC005851) putative chloroplast nucleoid DNA-binding protein [Arabidopsis thaliana] Length = 392 Frame 3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 392 0 150 300 Plus Strand HSPs: Score = 120 (42.2 bits), Expect = 0.00057, P = 0.00057 Identities = 38/155 (24%), Positives = 74/155 (47%), Frame = +3 Query: 12 LKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQ-PGLKLY 188 L ++ V ++ F ++ G +IDSGTTL Y P + Y L+++ + ++ Sbjct: 241 LDAVSVGDTHVETMGTTFHALEGN-IIIDSGTTLTYFP-VSYCNLVREAVDHYVTAVRTA 298 Query: 189 -LVEQQFRCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHD-YLFQFKDGIWCIGWQRSVAQ 362 C+ YT +D FPV+ +HF L + ++ Y+ G +C+ ++ Sbjct: 299 DPTGNDMLCY-YTDTIDI-FPVITMHFSGGADLVLDKYNMYIETITRGTFCL----AIIC 352 Query: 363 TKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCSS 476 +D + G+ +N LV YD +++ ++ NCS+ Sbjct: 353 NNPPQD-AIFGNRAQNNFLVGYDSSSLLVSFSPTNCSA 389 >gi|11357516|pir||T47790 hypothetical protein F17J16.130 - Arabidopsis thaliana >gi|7529751|emb|CAB86936.1| (AL163527) putative protein [Arabidopsis thaliana] Length = 535 Frame 3 hits (HSPs): _______________ __________________________________________________ Database sequence: | | | | | 535 0 150 300 450 Plus Strand HSPs: Score = 116 (40.8 bits), Expect = 0.0027, P = 0.0027 Identities = 41/156 (26%), Positives = 80/156 (51%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFD-SVNGKG-TVIDSGTTLAYLPAIVYDELIQKVLARQPGL 179 V +KSI V ++L +P + ++ S +G G T+IDSGTTL+Y Y+ + K+ + G Sbjct: 379 VQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGK 438 Query: 180 KLYLVEQQFR----CFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQF-KDGIWCIGW 344 Y V + F CF +G + P + + F D ++ +P + F + + + C+ Sbjct: 439 --YPVYRDFPILDPCFNVSGIHNVQLPELGIAFADG-AVWNFPTENSFIWLNEDLVCLAM 495 Query: 345 QRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCS 473 + K+ +++G+ N ++YD + +G+ C+ Sbjct: 496 ---LGTPKSA--FSIIGNYQQQNFHILYDTKRSRLGYAPTKCA 533 >gi|12082174|dbj|BAB20797.1| (AB045379) pepsinogen C [Xenopus laevis] Length = 383 Frame 3 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | 383 0 150 300 Plus Strand HSPs: Score = 114 (40.1 bits), Expect = 0.0028, P = 0.0028 Identities = 34/126 (26%), Positives = 58/126 (46%), Frame = +3 Query: 78 GKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKLYLVEQQFRCFLYTGNVDRGFPVVK 257 G ++D+GT+L P V+ LIQ + A+Q Y+V C N+ + P + Sbjct: 264 GCQAIVDTGTSLLTAPQSVFSSLIQSIGAQQDQNGQYVVS----C----SNI-QNLPTIS 314 Query: 258 LHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLE 437 +S + P Y+ Q G IG + ++NG+ + +LGD+ L +YDL Sbjct: 315 FTIS-GVSFPLPPSAYVLQQSSGYCTIGIMPTYLPSQNGQPLWILGDVFLREYYSVYDLG 373 Query: 438 XMVIGW 455 +G+ Sbjct: 374 NNQVGF 379 >gi|4510415|gb|AAD21501.1| (AC006929) putative chloroplast nucleoid DNA-binding protein [Arabidopsis thaliana] Length = 396 Frame 3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 396 0 150 300 Plus Strand HSPs: Score = 114 (40.1 bits), Expect = 0.0029, P = 0.0029 Identities = 35/155 (22%), Positives = 69/155 (44%), Frame = +3 Query: 12 LKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKLYL 191 L ++ V ++ F ++ G VIDSGTTL Y P + Y L+++ + Sbjct: 245 LDAVSVGNTRIETMGTTFHALEGN-IVIDSGTTLTYFP-VSYCNLVRQAVEHVVTAVRAA 302 Query: 192 VEQQFRCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHD-YLFQFKDGIWCIGWQRSVAQTK 368 Y + FPV+ +HF + L + ++ Y+ G++C+ + + Sbjct: 303 DPTGNDMLCYNSDTIDIFPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLA---IICNSP 359 Query: 369 NGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCSS 476 + + G+ +N LV YD +++ ++ NCS+ Sbjct: 360 TQE--AIFGNRAQNNFLVGYDSSSLLVSFSPTNCSA 393 >gi|104296|pir||A39314 gastricsin (EC 3.4.23.3) precursor - bullfrog >gi|213688|gb|AAA49530.1| (M73750) pepsinogen [Rana catesbeiana] Length = 384 Frame 3 hits (HSPs): __________________ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | 384 0 150 300 __________________ Annotated Domains: DOMO DM00126: EUKARYOTICANDVIRALASPARTYLPROTE 16..382 PROSITE ASP_PROTEASE: Eukaryotic and viral aspar 268..279 PROSITE ASP_PROTEASE: Eukaryotic and viral aspar 82..93 __________________ Plus Strand HSPs: Score = 109 (38.4 bits), Expect = 0.011, P = 0.011 Identities = 37/129 (28%), Positives = 62/129 (48%), Frame = +3 Query: 69 SVNGKGT---------VIDSGTTLAYLPAIVYDELIQKVLARQPGLKLYLVEQQFRCFLY 221 SVNG+ T ++D+GT+L P V+ L+Q + A+Q Y V C Sbjct: 253 SVNGQATGWCSQGCQGIVDTGTSLLTAPQSVFSSLMQSIGAQQDQNGQYAVS----C--- 305 Query: 222 TGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDL 401 N+ + P + +S + P Y+ Q G IG + ++NG+ + +LGD+ Sbjct: 306 -SNI-QSLPTISFTIS-GVSFPLPPSAYVLQQNSGYCTIGIMPTYLPSQNGQPLWILGDV 362 Query: 402 VLSNKLVIYDLEXMVIGW 455 L +YDL +G+ Sbjct: 363 FLRQYYSVYDLGNNQVGF 380 >gi|12321511|gb|AAG50814.1|AC079281_16 (AC079281) hypothetical protein [Arabidopsis thaliana] Length = 483 Frame 3 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | 483 0 150 300 450 Plus Strand HSPs: Score = 110 (38.7 bits), Expect = 0.012, P = 0.012 Identities = 37/153 (24%), Positives = 64/153 (41%), Frame = +3 Query: 12 LKSIEVDTDILQLPSDIF--DSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 L I V ++LQ+P F D G +IDSGT + L +Y+ L + L+ Sbjct: 333 LTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEK 392 Query: 186 YLVEQQF-RCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKD-GIWCIGWQRSVA 359 F C+ + P V HF L + +Y+ G +C+ + A Sbjct: 393 AAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAF----A 448 Query: 360 QTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNC 470 T + + ++G++ V +DL +IG++ C Sbjct: 449 PTASS--LAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483 >gi|10334495|emb|CAC10209.1| (AJ299060) putative extracellular dermal glycoprotein [Cicer arietinum] Length = 369 Frame 3 hits (HSPs): ______________________ __________________________________________________ Database sequence: | | | | 369 0 150 300 Plus Strand HSPs: Score = 108 (38.0 bits), Expect = 0.013, P = 0.013 Identities = 34/155 (21%), Positives = 71/155 (45%), Frame = +3 Query: 12 LKSIEVDTDILQLPSDIF--DSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 +K+I++D ++ L ++ D+ GT I + + L VY I+ L + KL Sbjct: 204 VKAIKIDGKVVNLKPSLWSIDNKGNGGTKISTMSPFTELQRSVYKPFIRDFLKKASDRKL 263 Query: 186 YLVEQ--QFR-CFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSV 356 VE F CF T N++ P + L + + ++Y ++ + K + C+G+ Sbjct: 264 KKVESVAPFEACFEST-NIENSLPRIDLVLQGGVQWSIYGNNLMVNVKKNVACLGFVDGG 322 Query: 357 AQTKNG--KDMTLLGDLVLSNKLVIYDLEXMVIGWT 458 + + K ++G L + L+++DL + ++ Sbjct: 323 TEPRMSFAKASIVIGGHQLEDNLLVFDLNSSKLSFS 358 >gi|12322538|gb|AAG51267.1|AC027135_8 (AC027135) chloroplast nucleoid DNA binding protein, putative [Arabidopsis thaliana] Length = 445 Frame 3 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | 445 0 150 300 Plus Strand HSPs: Score = 108 (38.0 bits), Expect = 0.018, P = 0.018 Identities = 35/131 (26%), Positives = 63/131 (48%), Frame = +3 Query: 90 VIDSGTTLAYLPAIVYDELIQKVLARQPGLKLYLVEQQF--RCFLYTGNVDRGFPVVKLH 263 +IDSGTTL L + YD+ V G K Q CF +G+ + G P + +H Sbjct: 322 IIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFK-SGDKEIGLPAITMH 380 Query: 264 FKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXM 443 F ++ + + P + + + C+ S+ T ++ + G++V + LV YDLE Sbjct: 381 FTNA-DVKLSPINAFVKLNEDTVCL----SMIPTT---EVAIYGNMVQMDFLVGYDLETK 432 Query: 444 VIGWTDYNCSSSI 482 + + +CS ++ Sbjct: 433 TVSFQRMDCSGNL 445 >gi|416749|sp|P32951|CAR1_CANPA CANDIDAPEPSIN 1 PRECURSOR (ASPARTATE PROTEASE 1) (ACP 1) >gi|578135|emb|CAA77977.1| (Z11919) pro-acid protease [Candida parapsilosis] Length = 402 Frame 3 hits (HSPs): ________________ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | 402 0 150 300 __________________ Annotated Domains: BLOCKS BL00141A: Eukaryotic and viral aspartyl 89..104 BLOCKS BL00141B: Eukaryotic and viral aspartyl 177..188 BLOCKS BL00141C: Eukaryotic and viral aspartyl 237..246 BLOCKS BL00141D: Eukaryotic and viral aspartyl 279..288 BLOCKS BL00141E: Eukaryotic and viral aspartyl 365..388 DOMO DM00126: EUKARYOTICANDVIRALASPARTYLPROTE 45..390 Entrez active site: BY SIMILARITY. 94 Entrez active site: BY SIMILARITY. 282 Entrez glycosylation site: POTENTIAL. 52 PFAM asp: Eukaryotic aspartyl protease 68..391 PRINTS PEPSIN1: Pepsin family motif I - 1 82..102 PRINTS PEPSIN2: Pepsin family motif II - 1 232..245 PRINTS PEPSIN3: Pepsin family motif III - 1 279..290 PRINTS PEPSIN4: Pepsin family motif IV - 1 364..379 PRODOM PD187239: CAR1(1) CAR2(1) CAR8(1) 1..51 PRODOM PD000286: CARP(9) PEPA(8) CATD(5) 69..115 PRODOM PD000182: CARP(10) PEPA(8) CATD(5) 130..254 PRODOM PD006087: CAR1(2) CARP(2) 256..305 PRODOM PD000310: CARP(9) PEPA(8) CATD(5) 316..389 PROSITE ASP_PROTEASE: Eukaryotic and viral aspar 279..290 PROSITE ASP_PROTEASE: Eukaryotic and viral aspar 91..102 __________________ Plus Strand HSPs: Score = 106 (37.3 bits), Expect = 0.026, P = 0.026 Identities = 43/135 (31%), Positives = 63/135 (46%), Frame = +3 Query: 78 GKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKLYLVEQQFRCFLYTGNVDRGFPVVK 257 G G ++DSGTTL Y P+ D Q LA + G +L V + + N D V Sbjct: 276 GDGALLDSGTTLTYFPS---DFAAQ--LADKAGARLVQVARDQYLYFIDCNTDTSGTTV- 329 Query: 258 LHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKL-VIYDL 434 +F + +TV +Y++Q DG G Q S D T+LGD L + ++Y+L Sbjct: 330 FNFGNGAKITVPNTEYVYQNGDGTCLWGIQPS--------DDTILGDNFLRHAYYLLYNL 381 Query: 435 EX--MVIGWTDYNCSSSI 482 + + I Y SSI Sbjct: 382 DANTISIAQVKYTTDSSI 399 >gi|477734|pir||B47701 aspartic proteinase ACPR (EC 3.4.23.-) precursor - yeast (Candida parapsilosis) Length = 402 Frame 3 hits (HSPs): ________________ Annotated Domains: __ __ __________________________________________________ Database sequence: | | | | 402 0 150 300 __________________ Annotated Domains: PROSITE ASP_PROTEASE: Eukaryotic and viral aspar 279..290 PROSITE ASP_PROTEASE: Eukaryotic and viral aspar 91..102 __________________ Plus Strand HSPs: Score = 106 (37.3 bits), Expect = 0.026, P = 0.026 Identities = 43/135 (31%), Positives = 63/135 (46%), Frame = +3 Query: 78 GKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKLYLVEQQFRCFLYTGNVDRGFPVVK 257 G G ++DSGTTL Y P+ D Q LA + G +L V + + N D V Sbjct: 276 GDGALLDSGTTLTYFPS---DFAAQ--LADKAGARLVQVARDQYLYFIDCNTDTSGTTV- 329 Query: 258 LHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKL-VIYDL 434 +F + +TV +Y++Q DG G Q S D T+LGD L + ++Y+L Sbjct: 330 FNFGNGAKITVPNTEYVYQNGDGTCLWGIQPS--------DDTILGDNFLRHAYYLLYNL 381 Query: 435 EX--MVIGWTDYNCSSSI 482 + + I Y SSI Sbjct: 382 DANTISIAQVKYTTDSSI 399 >gi|4512658|gb|AAD21712.1| (AC006931) hypothetical protein [Arabidopsis thaliana] Length = 481 Frame 3 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | | 481 0 150 300 450 Plus Strand HSPs: Score = 102 (35.9 bits), Expect = 0.097, P = 0.093 Identities = 41/157 (26%), Positives = 81/157 (51%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFD-SVNGKG-TVIDSGTTLAYL--PA--IVYDELIQKVLAR 167 + +KSI V L +P + ++ S +G G T+IDSGTTL+Y PA I+ ++ +K+ Sbjct: 323 IQIKSILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKEN 382 Query: 168 QPGLKLYLVEQQFRCFLYTGNVDRGF--PVVKLHFKDSLSLTVYPHDYLFQF-KDGIWCI 338 P + + V CF +G + P + + F D ++ +P + F + + + C+ Sbjct: 383 YPIFRDFPVLDP--CFNVSGIEENNIHLPELGIAFVDG-TVWNFPAENSFIWLSEDLVCL 439 Query: 339 GWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCS 473 ++ T +++G+ N ++YD + +G+T C+ Sbjct: 440 ----AILGTPKST-FSIIGNYQQQNFHILYDTKRSRLGFTPTKCA 479 >gi|2160166|gb|AAB60729.1| (AC000132) F21M12.13 gene product [Arabidopsis thaliana] Length = 449 Frame 3 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | 449 0 150 300 Plus Strand HSPs: Score = 101 (35.6 bits), Expect = 0.12, P = 0.11 Identities = 40/156 (25%), Positives = 70/156 (44%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 V + S++V D + L FD+ +G GT+IDSGT + VY E I+ +Q + Sbjct: 301 VSVGSVQVPVDPVYLT---FDANSGAGTIIDSGTVITRFAQPVY-EAIRDEFRKQVNVSS 356 Query: 186 YLVEQQF-RCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDG-IWCIGWQRSVA 359 + F CF + + + P + LH SL L + + L G + C+ + Sbjct: 357 FSTLGAFDTCF--SADNENVAPKITLHMT-SLDLKLPMENTLIHSSAGTLTCLS-MAGIR 412 Query: 360 QTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCS 473 Q N + ++ +L N +++D+ IG C+ Sbjct: 413 QNANAV-LNVIANLQQQNLRILFDVPNSRIGIAPEPCN 449 >gi|7715602|gb|AAF68120.1|AC010793_15 (AC010793) F20B17.14 [Arabidopsis thaliana] >gi|12324588|gb|AAG52249.1|AC011717_17 (AC011717) putative aspartyl protease; 105611-106921 [Arabidopsis thaliana] Length = 436 Frame 3 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | 436 0 150 300 Plus Strand HSPs: Score = 100 (35.2 bits), Expect = 0.14, P = 0.13 Identities = 34/131 (25%), Positives = 59/131 (45%), Frame = +3 Query: 78 GKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL---YLVEQQFRCFLYTGNVDRGFP 248 G+G +IDSGT + LP +Y + + L + G Y + CF T D P Sbjct: 305 GRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDT--CFNLTSYEDISIP 362 Query: 249 VVKLHFKDSLSLTVYPHDYLFQFKD--GIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLV 422 ++K+ F+ + L V + K + C+ ++A ++ ++G+ N+ V Sbjct: 363 IIKMIFQGNAELEVDVTGVFYFVKPDASLVCL----ALASLSYENEVGIIGNYQQKNQRV 418 Query: 423 IYDLEXMVIGWTDYNC 470 IYD +G NC Sbjct: 419 IYDTTQERLGIVGENC 434 >gi|9711879|dbj|BAB07973.1| (AP002524) hypothetical protein~similar to Arabidopsis thaliana chromosome 1, F13O11.13 [Oryza sativa] Length = 451 Frame 3 hits (HSPs): _______________ __________________________________________________ Database sequence: | | | || 451 0 150 300 450 Plus Strand HSPs: Score = 100 (35.2 bits), Expect = 0.15, P = 0.14 Identities = 35/130 (26%), Positives = 67/130 (51%), Frame = +3 Query: 90 VIDSGTTLAYL-PAI---VYDELIQKVL---ARQPGLKLYLVEQQFRCFLYTGN-VDRG- 242 ++DSGTTL +L P++ + DEL +++ + P L L C+ G V+ G Sbjct: 321 IVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQL------CYNVAGREVEAGE 374 Query: 243 -FPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKL 419 P + L F ++ + P + ++G C+ ++ T + +++LG+L N Sbjct: 375 SIPDLTLEFGGGAAVALKPENAFVAVQEGTLCL----AIVATTEQQPVSILGNLAQQNIH 430 Query: 420 VIYDLEXMVIGWTDYNCSSS 479 V YDL+ + + +C+ S Sbjct: 431 VGYDLDAGTVTFAGADCAGS 450 >gi|5042418|gb|AAD38257.1|AC006193_13 (AC006193) Hypothetical Protein [Arabidopsis thaliana] Length = 431 Frame 3 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | 431 0 150 300 Plus Strand HSPs: Score = 99 (34.8 bits), Expect = 0.18, P = 0.17 Identities = 41/154 (26%), Positives = 68/154 (44%), Frame = +3 Query: 12 LKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKLYL 191 L++I V + +Q S IF + G VIDSGTTL LP+ Y EL + V+A +K Sbjct: 286 LEAISVGSKKIQFTSTIFGTGEGN-IVIDSGTTLTLLPSNFYYEL-ESVVAST--IKAER 341 Query: 192 VEQQ--FRCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQT 365 V+ Y + P + +HFK + + + + + C + + Sbjct: 342 VQDPDGILSLCYRDSSSFKVPDITVHFKGG-DVKLGNLNTFVAVSEDVSCFAFAAN---- 396 Query: 366 KNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNCS 473 + +T+ G+L N LV YD + + +CS Sbjct: 397 ---EQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDCS 429 >gi|477042|pir||A47701 aspartic proteinase ACPL (EC 3.4.23.-) precursor - yeast (Candida parapsilosis) Length = 395 Frame 3 hits (HSPs): __________________ Annotated Domains: __ ___ __________________________________________________ Database sequence: | | | | 395 0 150 300 __________________ Annotated Domains: PROSITE ASP_PROTEASE: Eukaryotic and viral aspar 269..280 PROSITE ASP_PROTEASE: Eukaryotic and viral aspar 90..101 __________________ Plus Strand HSPs: Score = 97 (34.1 bits), Expect = 0.27, P = 0.24 Identities = 42/139 (30%), Positives = 71/139 (51%), Frame = +3 Query: 66 DSVN-GKGTVIDSGTTLAYLPAIVYDELIQKVLAR-QP-GL--KLYLVEQQFRCFLYTGN 230 D+ N G G V+DSGTT++YLP + ++L KV A +P GL +LY ++ C N Sbjct: 261 DNSNAGFGVVVDSGTTISYLPDSIVNDLANKVGAYLEPVGLGNELYFID----C---NAN 313 Query: 231 VDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLS 410 +G F + +TV +++ Q C+ W + +N +LGD L Sbjct: 314 -PQGS--ASFTFDNGAKITVPLSEFVLQSTANA-CV-WGLQSSDRQNVPP--ILGDNFLR 366 Query: 411 NKLVIYDL--EXMVIGWTDYNCSSSI 482 + V+++L E + + Y +SS+ Sbjct: 367 HAYVVFNLDKETVSLAQVKYTSASSV 392 >gi|12323376|gb|AAG51657.1|AC010704_1 (AC010704) nucellin-like protein; 27671-25467 [Arabidopsis thaliana] Length = 427 Frame 3 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | 427 0 150 300 Plus Strand HSPs: Score = 96 (33.8 bits), Expect = 0.40, P = 0.33 Identities = 40/149 (26%), Positives = 60/149 (40%), Frame = +3 Query: 54 SDIFDSVNGKGTVIDSGTTLAYLPAIVYD---ELIQKVLARQP------GLKLYLVEQQF 206 +D V G V DSG++ Y A Y +LI+K L +P L + + Sbjct: 266 NDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGK 325 Query: 207 RCFLYTGNVDRGFPVVKLHF---KDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGK 377 + V + F + L F K+ V P YL + G C+G G Sbjct: 326 KPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEGY 385 Query: 378 DMTLLGDLVLSNKLVIYDLEXMVIGWTDYNC 470 + ++GD+ +VIYD E IGW +C Sbjct: 386 N--IIGDISFQGIMVIYDNEKQRIGWISSDC 414 >gi|7485278|pir||T08860 hypothetical protein A_TM017A05.8 - Arabidopsis thaliana Length = 472 Frame 3 hits (HSPs): _______________ __________________________________________________ Database sequence: | | | | | 472 0 150 300 450 Plus Strand HSPs: Score = 96 (33.8 bits), Expect = 0.46, P = 0.37 Identities = 35/133 (26%), Positives = 61/133 (45%), Frame = +3 Query: 90 VIDSGTTLAYLPAIVYDELIQKV--LARQPGLKLYLVEQQFR-CF-----LYTG----NV 233 V DSGT+ YL Y + + LA + E F C+ LY+G N Sbjct: 276 VFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNK 335 Query: 234 DR-GFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLS 410 D +P V L K S VY + K ++C+ + +D++++G ++ Sbjct: 336 DSFQYPAVNLTMKGGSSYPVYHPLVVIPMKVNVYCLAIMKI-------EDISIIGQNFMT 388 Query: 411 NKLVIYDLEXMVIGWTDYNC 470 V++D E +++GW + +C Sbjct: 389 GYRVVFDREKLILGWKESDC 408 >gi|12328547|dbj|BAB21205.1| (AP002913) nucleoid DNA-binding protein cnd41-like protein [Oryza sativa] Length = 504 Frame 3 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | 504 0 150 300 450 Plus Strand HSPs: Score = 96 (33.8 bits), Expect = 0.50, P = 0.40 Identities = 39/155 (25%), Positives = 66/155 (42%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIF--DSVNGKGTVIDSGTTLAYLPAIVY----DELIQ--KVL 161 V L + V IL +P F DS G ++DSGT + L + Y D ++ + L Sbjct: 352 VGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSL 411 Query: 162 ARQPGLKLYLVEQQFRCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKD-GIWCI 338 R G+ L+ C+ + P V L F L + +YL G +C+ Sbjct: 412 PRTSGVSLFDT-----CYDLSDRTSVEVPAVSLRFAGGGELRLPAKNYLIPVDGAGTYCL 466 Query: 339 GWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNC 470 + A T ++++G++ V +D +G+T C Sbjct: 467 AF----APTNAA--VSIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504 >gi|7486515|pir||T08979 hypothetical protein F6G3.60 - Arabidopsis thaliana >gi|4938479|emb|CAB43838.1| (AL078464) putative protein [Arabidopsis thaliana] >gi|7269903|emb|CAB80996.1| (AL161576) putative protein [Arabidopsis thaliana] Length = 424 Frame 3 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | 424 0 150 300 Plus Strand HSPs: Score = 95 (33.4 bits), Expect = 0.52, P = 0.40 Identities = 40/154 (25%), Positives = 66/154 (42%), Frame = +3 Query: 12 LKSIEVDTDILQLPSDIFDSVNGKG-TVIDSGTTLAYLPAIVYDELIQKV-LARQPGLKL 185 L++I +L + F +G TVID+G + L Y+ L +++ L+ Sbjct: 266 LQAISFGEKLLDIEPGTFQRYRSQGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRR 325 Query: 186 YLVEQQFRCFLYTGNVDR---GFPVVKLHFKDSLSLTVYPHDYLFQFKDG-IWCIGWQRS 353 Q+ Y GN+ GFPVV HF L + + G +C+ Sbjct: 326 VKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLFVSSESGDSFCL----- 380 Query: 354 VAQTKNG-KDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNC 470 A T N DM+++G + N V Y+L M + + +C Sbjct: 381 -AMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419 >gi|7486516|pir||T08980 hypothetical protein F6G3.70 - Arabidopsis thaliana >gi|4938480|emb|CAB43839.1| (AL078464) putative protein [Arabidopsis thaliana] >gi|7269904|emb|CAB80997.1| (AL161576) putative protein [Arabidopsis thaliana] Length = 427 Frame 3 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | 427 0 150 300 Plus Strand HSPs: Score = 95 (33.4 bits), Expect = 0.52, P = 0.41 Identities = 37/153 (24%), Positives = 68/153 (44%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKG---TVIDSGTTLAYLPAIVYDEL---IQKVLAR 167 V +++I VD IL + +F+ + G T+ID+G +L L Y L I+ + Sbjct: 275 VTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEG 334 Query: 168 QPGLKLYLVEQQFRCFLYTGNVDR-----GFPVVKLHFKDSLSLTVYPHDYLFQFKDGIW 332 + + + Y GN +R GFP+V HF + L++ + ++ Sbjct: 335 RFTAADVSQDDMIKMECYNGNFERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPNVF 394 Query: 333 CIGWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGW 455 C+ A T ++ +G + + YDLE M + + Sbjct: 395 CL------AVTPG--NLNSIGATAQQSYNIGYDLEAMEVSF 427 >gi|11514162|pdb|1FKN|A Chain A, Structure Of Beta-Secretase Complexed With Inhibitor >gi|11514163|pdb|1FKN|B Chain B, Structure Of Beta-Secretase Complexed With Inhibitor Length = 391 Frame 3 hits (HSPs): _____________________ __________________________________________________ Database sequence: | | | | 391 0 150 300 Plus Strand HSPs: Score = 94 (33.1 bits), Expect = 0.60, P = 0.45 Identities = 36/159 (22%), Positives = 75/159 (47%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 V++ +E++ L++ + N +++DSGTT LP V++ ++ + A K Sbjct: 207 VIIVRVEINGQDLKMDCKEY---NYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKF 263 Query: 186 ---YLVEQQFRCFLYTGNVDRG-FPVVKLHF-----KDSLSLTVYPHDYLFQFKDGIWCI 338 + + +Q C+ G FPV+ L+ S +T+ P YL +D Sbjct: 264 PDGFWLGEQLVCW-QAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQ 322 Query: 339 G--WQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNC 470 ++ +++Q+ G T++G +++ V++D IG+ C Sbjct: 323 DDCYKFAISQSSTG---TVMGAVIMEGFYVVFDRARKRIGFAVSAC 365 >gi|6598808|gb|AAB80784.2| (AF024504) putative chloroplast nucleoid DNA-binding protein [Arabidopsis thaliana] Length = 473 Frame 3 hits (HSPs): _______________ __________________________________________________ Database sequence: | | | | | 473 0 150 300 450 Plus Strand HSPs: Score = 95 (33.4 bits), Expect = 0.61, P = 0.45 Identities = 36/134 (26%), Positives = 62/134 (46%), Frame = +3 Query: 90 VIDSGTTLAYLPAIVYDELIQKV--LARQPGLKLYLVEQQFR-CF-----LYTG----NV 233 V DSGT+ YL Y + + LA + E F C+ LY+G N Sbjct: 276 VFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNK 335 Query: 234 DR-GFPVVKLHFKDSLSLTVYPHDYLFQFKD-GIWCIGWQRSVAQTKNGKDMTLLGDLVL 407 D +P V L K S VY + KD ++C+ + +D++++G + Sbjct: 336 DSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIMKI-------EDISIIGQNFM 388 Query: 408 SNKLVIYDLEXMVIGWTDYNC 470 + V++D E +++GW + +C Sbjct: 389 TGYRVVFDREKLILGWKESDC 409 >gi|6330045|dbj|BAA86463.1| (AB032975) KIAA1149 protein [Homo sapiens] Length = 396 Frame 3 hits (HSPs): _____________________ __________________________________________________ Database sequence: | | | | 396 0 150 300 Plus Strand HSPs: Score = 94 (33.1 bits), Expect = 0.61, P = 0.46 Identities = 36/159 (22%), Positives = 75/159 (47%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 V++ +E++ L++ + N +++DSGTT LP V++ ++ + A K Sbjct: 157 VIIVRVEINGQDLKMDCKEY---NYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKF 213 Query: 186 ---YLVEQQFRCFLYTGNVDRG-FPVVKLHF-----KDSLSLTVYPHDYLFQFKDGIWCI 338 + + +Q C+ G FPV+ L+ S +T+ P YL +D Sbjct: 214 PDGFWLGEQLVCW-QAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQ 272 Query: 339 G--WQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNC 470 ++ +++Q+ G T++G +++ V++D IG+ C Sbjct: 273 DDCYKFAISQSSTG---TVMGAVIMEGFYVVFDRARKRIGFAVSAC 315 >gi|9506421 ref|NP_062077.1| beta-site APP cleaving enzyme [Rattus norvegicus] >gi|6685233|sp|P56819|BACE_RAT BETA-SECRETASE PRECURSOR (BETA-SITE APP CLEAVING ENZYME) (BETA-SITE AMYLOID PRECURSOR PROTEIN CLEAVING ENZYME) (ASPARTYL PROTEASE 2) (ASP 2) (ASP2) (MEMBRANE-ASSOCIATED ASPARTIC PROTEASE 2) (MEMAPSIN-2) >gi|6118543|gb|AAF04144.1|AF190727_1 (AF190727) beta-site APP cleaving enzyme [Rattus norvegicus] Length = 501 Frame 3 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | 501 0 150 300 450 Plus Strand HSPs: Score = 95 (33.4 bits), Expect = 0.65, P = 0.48 Identities = 37/159 (23%), Positives = 75/159 (47%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 V++ +E++ L++ + N +++DSGTT LP V++ ++ + A K Sbjct: 262 VIIVRVEINGQDLKMDCKEY---NYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKF 318 Query: 186 ---YLVEQQFRCFLYTGNVDRG-FPVVKLHF-----KDSLSLTVYPHDYLFQFKDGIWCI 338 + + +Q C+ G FPV+ L+ S +T+ P YL +D Sbjct: 319 PDGFWLGEQLVCW-QAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQ 377 Query: 339 G--WQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNC 470 ++ +V+Q+ G T++G +++ V++D IG+ C Sbjct: 378 DDCYKFAVSQSSTG---TVMGAVIMEGFYVVFDRARKRIGFAVSAC 420 >gi|6857759 ref|NP_035922.2| beta-site APP cleaving enzyme; APP beta-secretase [Mus musculus] >gi|6561820|gb|AAF17082.1| (AF200346) aspartyl protease 2 [Mus musculus] >gi|6760477|gb|AAF04143.2|AF190726_1 (AF190726) beta-site APP cleaving enzyme [Mus musculus] Length = 501 Frame 3 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | 501 0 150 300 450 Plus Strand HSPs: Score = 95 (33.4 bits), Expect = 0.65, P = 0.48 Identities = 37/159 (23%), Positives = 75/159 (47%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 V++ +E++ L++ + N +++DSGTT LP V++ ++ + A K Sbjct: 262 VIIVRVEINGQDLKMDCKEY---NYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKF 318 Query: 186 ---YLVEQQFRCFLYTGNVDRG-FPVVKLHF-----KDSLSLTVYPHDYLFQFKDGIWCI 338 + + +Q C+ G FPV+ L+ S +T+ P YL +D Sbjct: 319 PDGFWLGEQLVCW-QAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQ 377 Query: 339 G--WQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNC 470 ++ +V+Q+ G T++G +++ V++D IG+ C Sbjct: 378 DDCYKFAVSQSSTG---TVMGAVIMEGFYVVFDRARKRIGFAVSAC 420 >gi|6685249|sp|P56818|BACE_MOUSE BETA-SECRETASE PRECURSOR (BETA-SITE APP CLEAVING ENZYME) (BETA-SITE AMYLOID PRECURSOR PROTEIN CLEAVING ENZYME) (ASPARTYL PROTEASE 2) (ASP 2) (ASP2) (MEMBRANE-ASSOCIATED ASPARTIC PROTEASE 2) (MEMAPSIN-2) Length = 501 Frame 3 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | 501 0 150 300 450 Plus Strand HSPs: Score = 95 (33.4 bits), Expect = 0.65, P = 0.48 Identities = 37/159 (23%), Positives = 75/159 (47%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 V++ +E++ L++ + N +++DSGTT LP V++ ++ + A K Sbjct: 262 VIIVRVEINGQDLKMDCKEY---NYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKF 318 Query: 186 ---YLVEQQFRCFLYTGNVDRG-FPVVKLHF-----KDSLSLTVYPHDYLFQFKDGIWCI 338 + + +Q C+ G FPV+ L+ S +T+ P YL +D Sbjct: 319 PDGFWLGEQLVCW-QAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQ 377 Query: 339 G--WQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNC 470 ++ +V+Q+ G T++G +++ V++D IG+ C Sbjct: 378 DDCYKFAVSQSSTG---TVMGAVIMEGFYVVFDRARKRIGFAVSAC 420 >gi|12736860 ref|XP_006430.2| beta-site APP-cleaving enzyme [Homo sapiens] Length = 423 Frame 3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 423 0 150 300 Plus Strand HSPs: Score = 94 (33.1 bits), Expect = 0.67, P = 0.49 Identities = 36/159 (22%), Positives = 75/159 (47%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 V++ +E++ L++ + N +++DSGTT LP V++ ++ + A K Sbjct: 184 VIIVRVEINGQDLKMDCKEY---NYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKF 240 Query: 186 ---YLVEQQFRCFLYTGNVDRG-FPVVKLHF-----KDSLSLTVYPHDYLFQFKDGIWCI 338 + + +Q C+ G FPV+ L+ S +T+ P YL +D Sbjct: 241 PDGFWLGEQLVCW-QAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQ 299 Query: 339 G--WQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNC 470 ++ +++Q+ G T++G +++ V++D IG+ C Sbjct: 300 DDCYKFAISQSSTG---TVMGAVIMEGFYVVFDRARKRIGFAVSAC 342 >gi|2664292|emb|CAA75754.1| (Y15744) cellular aspartic protease [Aspergillus fumigatus] >gi|4200293|emb|CAA10674.1| (AJ132504) aspartic protease [Aspergillus fumigatus] Length = 398 Frame 3 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | 398 0 150 300 Plus Strand HSPs: Score = 93 (32.7 bits), Expect = 0.79, P = 0.55 Identities = 37/143 (25%), Positives = 65/143 (45%), Frame = +3 Query: 24 EVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKLYLVEQQ 203 EVD D + L ++ + N G ++D+GT+L LP+ + D L +++ A++ Y +E Sbjct: 264 EVDFDAIALGDNVAELEN-TGIILDTGTSLIALPSTLADLLNKEIGAKKGFTGQYSIECD 322 Query: 204 FRCFLYTGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDM 383 R L D F + +F T+ P+DY + + CI + + + Sbjct: 323 KRDSL----PDLTFTLAGHNF------TIGPYDYTLEVQGS--CISSFMGMDFPEPVGPL 370 Query: 384 TLLGDLVLSNKLVIYDLEXMVIG 452 +LGD L +YDL +G Sbjct: 371 AILGDAFLRKWYSVYDLGNNAVG 393 >gi|7770346|gb|AAF69716.1|AC016041_21 (AC016041) F27J15.15 [Arabidopsis thaliana] Length = 583 Frame 3 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | | 583 0 150 300 450 Plus Strand HSPs: Score = 95 (33.4 bits), Expect = 0.80, P = 0.55 Identities = 40/150 (26%), Positives = 64/150 (42%), Frame = +3 Query: 66 DSVNGK-GTVI-DSGTTLAYLPAIVYDELIQKVLARQPGLKLYLVEQQ------FRC--- 212 D NG+ G V+ D+G++ Y P Y +L+ L GL+L + +R Sbjct: 419 DGENGRVGKVLFDTGSSYTYFPNQAYSQLVTS-LQEVSGLELTRDDSDETLPICWRAKTN 477 Query: 213 --FLYTGNVDRGFPVVKLHFKD-----SLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKN 371 F +V + F + L S L + P DYL G C+G + + Sbjct: 478 FPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDG-SSVHD 536 Query: 372 GKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNC 470 G + +LGD+ + L++YD IGW +C Sbjct: 537 GSTI-ILGDISMRGHLIVYDNVKRRIGWMKSDC 568 >gi|6470293|gb|AAF13715.1|AF200193_1 (AF200193) memapsin 2 [Homo sapiens] Length = 488 Frame 3 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | | 488 0 150 300 450 Plus Strand HSPs: Score = 94 (33.1 bits), Expect = 0.82, P = 0.56 Identities = 36/159 (22%), Positives = 75/159 (47%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 V++ +E++ L++ + N +++DSGTT LP V++ ++ + A K Sbjct: 249 VIIVRVEINGQDLKMDCKEY---NYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKF 305 Query: 186 ---YLVEQQFRCFLYTGNVDRG-FPVVKLHF-----KDSLSLTVYPHDYLFQFKDGIWCI 338 + + +Q C+ G FPV+ L+ S +T+ P YL +D Sbjct: 306 PDGFWLGEQLVCW-QAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQ 364 Query: 339 G--WQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNC 470 ++ +++Q+ G T++G +++ V++D IG+ C Sbjct: 365 DDCYKFAISQSSTG---TVMGAVIMEGFYVVFDRARKRIGFAVSAC 407 >gi|6912266 ref|NP_036236.1| beta-site APP-cleaving enzyme [Homo sapiens] >gi|6685248|sp|P56817|BACE_HUMAN BETA-SECRETASE PRECURSOR (BETA-SITE APP CLEAVING ENZYME) (BETA-SITE AMYLOID PRECURSOR PROTEIN CLEAVING ENZYME) (ASPARTYL PROTEASE 2) (ASP 2) (ASP2) (MEMBRANE-ASSOCIATED ASPARTIC PROTEASE 2) (MEMAPSIN-2) >gi|7435835|pir||A59090 aspartic proteinase (EC 3.4.23.-) BACE precursor - human >gi|6118539|gb|AAF04142.1|AF190725_1 (AF190725) beta-site APP cleaving enzyme [Homo sapiens] >gi|6561814|gb|AAF17079.1| (AF200343) aspartyl protease 2 [Homo sapiens] >gi|6601445|gb|AAF18982.1|AF201468_1 (AF201468) APP beta-secretase [Homo sapiens] >gi|6715310|gb|AAF26367.1|AF204943_1 (AF204943) transmembrane aspartic proteinase Asp 2 [Homo sapiens] Length = 501 Frame 3 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | 501 0 150 300 450 Plus Strand HSPs: Score = 94 (33.1 bits), Expect = 0.85, P = 0.57 Identities = 36/159 (22%), Positives = 75/159 (47%), Frame = +3 Query: 6 VVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSGTTLAYLPAIVYDELIQKVLARQPGLKL 185 V++ +E++ L++ + N +++DSGTT LP V++ ++ + A K Sbjct: 262 VIIVRVEINGQDLKMDCKEY---NYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKF 318 Query: 186 ---YLVEQQFRCFLYTGNVDRG-FPVVKLHF-----KDSLSLTVYPHDYLFQFKDGIWCI 338 + + +Q C+ G FPV+ L+ S +T+ P YL +D Sbjct: 319 PDGFWLGEQLVCW-QAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQ 377 Query: 339 G--WQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLEXMVIGWTDYNC 470 ++ +++Q+ G T++G +++ V++D IG+ C Sbjct: 378 DDCYKFAISQSSTG---TVMGAVIMEGFYVVFDRARKRIGFAVSAC 420 >gi|9665144|gb|AAF97328.1|AC023628_9 (AC023628) Unknown protein [Arabidopsis thaliana] Length = 485 Frame 3 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | | 485 0 150 300 450 Plus Strand HSPs: Score = 92 (32.4 bits), Expect = 1.4, P = 0.75 Identities = 37/136 (27%), Positives = 65/136 (47%), Frame = +3 Query: 66 DSVNGKGTVIDSGTTLAYL--PA-IVYDELIQ---KVLARQPGLKLYLVEQQFRCFLYTG 227 D + G +IDSGT++ L PA I + + K L R P L+ CF + Sbjct: 355 DQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDT-----CFDLSN 409 Query: 228 NVDRGFPVVKLHFKDSLSLTVYPHDYLFQFK-DGIWCIGWQRSVAQTKNGKDMTLLGDLV 404 + P V LHF+ + +++ +YL +G +C + A T G ++++G++ Sbjct: 410 MNEVKVPTVVLHFRGA-DVSLPATNYLIPVDTNGKFCFAF----AGTMGG--LSIIGNIQ 462 Query: 405 LSNKLVIYDLEXMVIGWTDYNCS 473 V+YDL +G+ C+ Sbjct: 463 QQGFRVVYDLASSRVGFAPGGCA 485 WARNING: HSPs involving 17 database sequences were not reported due to the limiting value of parameter B = 50. Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.95 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.336 0.150 0.470 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.362 0.161 0.676 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.373 0.171 0.683 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.345 0.152 0.514 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.346 0.152 0.500 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.344 0.147 0.478 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 225 221 10. 77 3 12 22 0.11 35 32 0.12 38 +2 0 225 221 10. 77 3 12 22 0.11 35 32 0.12 38 +1 0 226 224 10. 77 3 12 22 0.11 35 32 0.12 38 -1 0 226 222 10. 77 3 12 22 0.11 35 32 0.12 38 -2 0 225 222 10. 77 3 12 22 0.11 35 32 0.12 38 -3 0 225 222 10. 77 3 12 22 0.11 35 32 0.12 38 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 67 No. of states in DFA: 597 (59 KB) Total size of DFA: 251 KB (256 KB) Time to generate neighborhood: 0.02u 0.01s 0.03t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 226.94u 0.93s 227.87t Elapsed: 00:01:42 Total cpu time: 227.01u 0.97s 227.98t Elapsed: 00:01:42 Start: Wed Jan 16 21:21:47 2002 End: Wed Jan 16 21:23:29 2002 WARNINGS ISSUED: 2
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000