WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= A01g05 (914 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 505,245 sequences; 158,518,215 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 8 Sequences : less than 8 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 3091 438 |====================================================== 6310 2653 411 |=================================================== 3980 2242 355 |============================================ 2510 1887 396 |================================================= 1580 1491 278 |================================== 1000 1213 226 |============================ 631 987 222 |=========================== 398 765 141 |================= 251 624 134 |================ 158 490 122 |=============== 100 368 68 |======== 63.1 300 60 |======= 39.8 240 51 |====== 25.1 189 35 |==== 15.8 154 30 |=== >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 124 <<<<<<<<<<<<<<<<< 10.0 124 29 |=== 6.31 95 25 |=== 3.98 70 12 |= 2.51 58 7 |: 1.58 51 7 |: 1.00 44 12 |= 0.63 32 2 |: 0.40 30 2 |: 0.25 28 4 |: 0.16 24 1 |: 0.10 23 1 |: 0.063 22 2 |: 0.040 20 3 |: 0.025 17 1 |: 0.016 16 0 | 0.010 16 1 |: 0.0063 15 0 | 0.0040 15 1 |: 0.0025 14 0 | 0.0016 14 2 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|1172874|sp|Q08298|RD22_ARATHDEHYDRATION-RESPONSIVE... +1 244 9.7e-39 2 gi|7488713|pir||T08896Sali3-2 protein, aluminium-indu... +2 115 4.8e-09 3 gi|5257263|dbj|BAA81762.1|(AP000364) EST AU029260(E30... +1 122 1.4e-08 3 gi|5430766|gb|AAD43166.1|AC007504_21(AC007504) Putati... +1 140 2.1e-06 1 gi|7488443|pir||T07844BURP domain-containing protein ... +1 140 2.2e-06 1 gi|100102|pir||S14068seed protein precursor - tick be... +2 109 2.3e-06 2 gi|119096|sp|P21746|EA87_VICFAEMBRYONIC ABUNDANT PROT... +2 115 3.8e-05 2 gi|119095|sp|P21745|EA30_VICFAEMBRYONIC ABUNDANT PROT... +2 108 4.0e-05 2 gi|119097|sp|P21747|EA92_VICFAEMBRYONIC ABUNDANT PROT... +2 109 4.9e-05 2 gi|7488154|pir||T02289probable polygalacturonase (EC ... +1 130 0.00014 1 gi|7489055|pir||T07426probable polygalacturonase (EC ... +1 127 0.00031 1 gi|7489056|pir||T07587probable polygalacturonase (EC ... +1 127 0.00031 1 gi|135067|sp|P09059|SVF3_VICFAUNKNOWN SEED PROTEIN 30... +2 102 0.0014 2 gi|320585|pir||JQ1670polygalacturonase (EC 3.2.1.15) ... +1 121 0.0014 1 gi|7488920|pir||T14306glycine-rich protein - carrot (... +1 93 0.0038 1 gi|100864|pir||S08315cell wall protein - maize (fragm... -3 90 0.0081 1 gi|485515|pir||S33622ADR6 protein - soybean >gi|29644... +2 67 0.023 3 gi|1762584|gb|AAB39546.1|(U63373) polygalacturonase i... +1 109 0.031 1 gi|7488153|pir||T01485probable polygalacturonase (EC ... +1 109 0.031 1 gi|1914851|gb|AAC53191.1|(U92454) WW domain binding p... -3 84 0.036 1 gi|4056458|gb|AAC98031.1|(AC005990) Identical to gb|A... +1 108 0.040 1 gi|1175115|sp|P41479|Y091_NPVACHYPOTHETICAL 24.1 KD P... -3 101 0.042 1 gi|168457|gb|AAA33456.1|(M36913) cell wall protein (p... -3 81 0.075 1 gi|462345|sp|P35310|HSP1_PONPYSPERM PROTAMINE P1 (CYS... +1 79 0.12 1 gi|7299574|gb|AAF54759.1|(AE003695) CG14742 gene prod... +2 77 0.19 1 gi|7305409ref|NP_038665.1| protamine 1 >gi|123692|sp|... +1 77 0.20 1 gi|123695|sp|P10118|HSP1_RATSPERM PROTAMINE P1 (CYSTE... +1 77 0.20 1 gi|123696|sp|P24714|HSP1_SAGIMSPERM PROTAMINE P1 (CYS... +1 77 0.20 1 gi|100216|pir||S14975extensin class II (clone uJ-2) -... -3 76 0.24 1 gi|1762428|gb|AAB39538.1|(U59467) aromatic rich glyco... +1 100 0.27 1 gi|82698|pir||JQ0985hydroxyproline-rich glycoprotein ... -3 94 0.44 1 gi|7008009|dbj|BAA90877.1|(AB031227) PsAD1 [Pisum sat... +2 73 0.44 1 gi|7290578|gb|AAF46028.1|(AE003434) CG12732 gene prod... -3 86 0.55 1 gi|283045|pir||S28264hydroxyproline-rich glycoprotein... -3 92 0.58 1 gi|7292503|gb|AAF47906.1|(AE003481) CG15023 gene prod... -3 87 0.61 1 gi|3025832|gb|AAC12760.1|(AF055985) pyrrolidone-rich ... -3 71 0.62 1 gi|7296368|gb|AAF51657.1|(AE003592) CG11458 gene prod... +2 71 0.62 1 gi|123699|sp|P15342|HSP2_HORSESPERM HISTONE P2A (ST2A... +1 71 0.63 1 gi|123697|sp|P04102|HSP1_SHEEPSPERM PROTAMINE P1 (CYS... +1 71 0.63 1 gi|123694|sp|P10119|HSP1_RABITSPERM PROTAMINE P1 (CYS... +1 71 0.63 1 gi|123704|sp|P15343|HSP3_HORSESPERM HISTONE P2B (ST2B... +1 71 0.63 1 gi|108389|pir||S21673protamine 2 - pig +1 71 0.63 1 gi|462336|sp|P35302|HSP1_ALOSESPERM PROTAMINE P1 >gi|... +1 71 0.63 1 gi|5306259|gb|AAD41991.1|AC006233_18(AC006233) hypoth... -3 83 0.63 1 gi|228938|prf||1814452CHyp-rich glycoprotein [Zea dip... -3 92 0.66 1 gi|283032|pir||S22456hydroxyproline-rich glycoprotein... -3 92 0.67 1 gi|5902409|gb|AAD55511.1|AC008148_21(AC008148) Unknow... -3 83 0.69 1 gi|1076719|pir||S53051glycine rich protein - barley (... +2 70 0.71 1 gi|462341|sp|P35306|HSP1_HYLLASPERM PROTAMINE P1 (CYS... +1 70 0.72 1 gi|462337|sp|P35304|HSP1_CAVPOSPERM PROTAMINE P1 (CYS... +1 70 0.72 1 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame 3 Hits gi|5257263 | ____ __________________________________________________ Query sequence: | | | | | | || 305 0 50 100 150 200 250 300 Locus_ID Frame 2 Hits gi|1172874 | ____________________ gi|7488713 | _________ gi|5257263 | _____________ gi|100102 | __________ gi|119096 | _________ gi|119095 | _________ gi|119097 | _________ gi|135067 | _________ gi|485515 | ________ gi|7299574 | ________ gi|7008009 | ________ gi|7296368 | _______ gi|1076719 | _______ Prosite Hits: ______ __ __________________________________________________ Query sequence: | | | | | | || 305 0 50 100 150 200 250 300 __________________ Prosite hits: LEUCINE_ZIPPER Leucine zipper pattern. 7..28 LEUCINE_ZIPPER Leucine zipper pattern. 14..35 TYR_PHOSPHO_SITE Tyrosine kinase phosphorylation site. 63..70 __________________ Locus_ID Frame 1 Hits gi|1172874 | ______________________ gi|7488713 | ______ _________ gi|5257263 | _________________ gi|5430766 | ___________________ gi|7488443 | ___________________ gi|100102 | __________ gi|119096 | __________ gi|119095 | __________ gi|119097 | __________ gi|7488154 | ___________________ gi|7489055 | _________________ gi|7489056 | _________________ gi|135067 | __________ gi|320585 | _________________ gi|7488920 | _________ gi|485515 | _____ _________ gi|1762584 | _________________ gi|7488153 | _________________ gi|4056458 | _________________ gi|462345 | ________ gi|7305409 | ________ gi|123695 | ________ gi|123696 | ________ gi|1762428 | _________________ gi|123699 | ________ gi|123697 | _______ gi|123694 | ________ gi|123704 | ________ gi|108389 | ______ gi|462336 | _______ gi|462341 | _______ gi|462337 | ________ Prosite Hits: ___ __________________________________________________ Query sequence: | | | | | | || 305 0 50 100 150 200 250 300 __________________ Prosite hits: PROKAR_LIPOPROTEIN Prokaryotic membrane lipoprotein lipid a 28..38 __________________ Locus_ID Frame -3 Hits gi|100864 | _______ gi|1914851 | ________ gi|1175115 | _______ gi|168457 | _______ gi|100216 | ________ gi|82698 | _______ gi|7290578 | ______ gi|283045 | _______ gi|7292503 | _______ gi|3025832 | _______ gi|5306259 | _______ gi|228938 | _______ gi|283032 | _______ gi|5902409 | _______ __________________________________________________ Query sequence: | | | | | | || 305 0 50 100 150 200 250 300
Use the and icons to retrieve links to Entrez:
WARNING: Descriptions of 74 database sequences were not reported due to the limiting value of parameter V = 50. >gi|1172874|sp|Q08298|RD22_ARATH DEHYDRATION-RESPONSIVE PROTEIN RD22 PRECURSOR >gi|479589|pir||S34823 dehydration-induced protein RD22 - Arabidopsis thaliana >gi|391608|dbj|BAA01546.1| (D10703) rd22 [Arabidopsis thaliana] >gi|447134|prf||1913421A rd22 gene [Arabidopsis thaliana] Length = 392 Frame 2 hits (HSPs): ___________________ Frame 1 hits (HSPs): _________________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 392 0 150 300 __________________ Annotated Domains: DOMO DM02982: 118..391 Entrez Domain: 5 X APPROXIMATE REPEATS. 57..164 Entrez Repetitive region: 1. 57..75 Entrez Repetitive region: 2. 78..97 Entrez Repetitive region: 3. 100..120 Entrez Repetitive region: 4. 125..142 Entrez Repetitive region: 5. 145..164 PRODOM PD122867: RD22_ARATH 1..167 PRODOM PD004669: 169..390 __________________ Plus Strand HSPs: Score = 244 (85.9 bits), Expect = 9.7e-39, Sum P(2) = 9.7e-39 Identities = 58/129 (44%), Positives = 72/129 (55%), Frame = +1 Query: 421 RETQLHDDPNVALFFLEKDLHPGTKLNLHFTTSSNI--QATFLPRQVADSIPFSSNKVEV 594 +ETQLHDDPN ALFFLEKDL G ++N+ F + FLPR A+++PF S K Sbjct: 163 KETQLHDDPNAALFFLEKDLVRGKEMNVRFNAEDGYGGKTAFLPRGEAETVPFGSEKFSE 222 Query: 595 VFNKFSVKPGSEQAXSXK--ILSXNFKKVASXXKEKXCAPSXESMNDFXTSXLGKNXKXF 768 +FSV+ GSE+A K I +KV+ +EK CA S ESM DF S LGK Sbjct: 223 TLKRFSVEAGSEEAEMMKKTIEECEARKVSG--EEKYCATSLESMVDFSVSKLGKYHVRA 280 Query: 769 PXIXEXKKRIP 801 KK P Sbjct: 281 VSTEVAKKNAP 291 Score = 200 (70.4 bits), Expect = 9.7e-39, Sum P(2) = 9.7e-39 Identities = 58/142 (40%), Positives = 72/142 (50%), Frame = +2 Query: 86 LPIFTLL-NLALVAIHAALPPEVYWKSVLPTTPMPKAITDILYPDWVEXKSTSVNVGGKG 262 LP+ LL + +VAI A L PE YW + LP TP+P ++ ++L D+ + KST+V VG G Sbjct: 5 LPLICLLGSFMVVAIAADLTPERYWSTALPNTPIPNSLHNLLTFDFTDEKSTNVQVGKGG 64 Query: 263 VNV--HAGKGGGGTNVNVG-----------------------GKGSGGGVNVHAGHKGKA 367 VNV H GK G GT VNVG GKG GGGV VH G GK Sbjct: 65 VNVNTHKGKTGSGTAVNVGKGGVRVDTGKGKPGGGTHVSVGSGKGHGGGVAVHTGKPGKR 124 Query: 368 XACFCWLKVSIQLHLRFNGRLNY 436 K + +H R GR Y Sbjct: 125 TDVGVG-KGGVTVHTRHKGRPIY 146 Score = 143 (50.3 bits), Expect = 9.2e-33, Sum P(2) = 9.2e-33 Identities = 36/69 (52%), Positives = 41/69 (59%), Frame = +2 Query: 236 TSVNVGGKGVNVHAGKG--GGGTNVNVG-GKGSGGGVNVHAGHKGKAXACFCWLKVSIQL 406 T+VNVG GV V GKG GGGT+V+VG GKG GGGV VH G GK K + + Sbjct: 78 TAVNVGKGGVRVDTGKGKPGGGTHVSVGSGKGHGGGVAVHTGKPGKRTDVGVG-KGGVTV 136 Query: 407 HLRFNGRLNY 436 H R GR Y Sbjct: 137 HTRHKGRPIY 146 Score = 87 (30.6 bits), Expect = 6.8e-27, Sum P(2) = 6.8e-27 Identities = 25/44 (56%), Positives = 28/44 (63%), Frame = +2 Query: 236 TSVNVG-GKG----VNVHAGKGGGGTNVNVGGKGSGGGVNVHAGHKGK 364 T V+VG GKG V VH GK G T+V VG GGV VH HKG+ Sbjct: 100 THVSVGSGKGHGGGVAVHTGKPGKRTDVGVGK----GGVTVHTRHKGR 143 Score = 43 (15.1 bits), Expect = 2.7e-22, Sum P(2) = 2.7e-22 Identities = 12/27 (44%), Positives = 12/27 (44%), Frame = +2 Query: 230 KSTSVNVGGKGVNVHAGKGGGGTNVNV 310 K T V VG GV VH G V V Sbjct: 123 KRTDVGVGKGGVTVHTRHKGRPIYVGV 149 >gi|7488713|pir||T08896 Sali3-2 protein, aluminium-induced - soybean >gi|2317900|gb|AAB66369.1| (U89693) Sali3-2 [Glycine max] Length = 276 Frame 2 hits (HSPs): __________ Frame 1 hits (HSPs): ______ _________ __________________________________________________ Database sequence: | | | | | | | 276 0 50 100 150 200 250 Plus Strand HSPs: Score = 115 (40.5 bits), Expect = 4.8e-09, Sum P(3) = 4.8e-09 Identities = 26/53 (49%), Positives = 36/53 (67%), Frame = +2 Query: 71 MEYRLLPI-FTLL-NLALVA---IHAALPPEVYWKSVLPTTPMPKAITDILYP 214 ME+R I FT+L +LAL +HA+LP E YW++V P TP+P A+ D+L P Sbjct: 1 MEFRCSVISFTILFSLALAGESHVHASLPEEDYWEAVWPNTPIPTALRDVLKP 53 Score = 61 (21.5 bits), Expect = 4.8e-09, Sum P(3) = 4.8e-09 Identities = 12/32 (37%), Positives = 17/32 (53%), Frame = +1 Query: 418 QRETQLHDDPNVALFFLEKDLHPGTKLNLHFT 513 Q Q+ D FF ++DLHPG + + FT Sbjct: 62 QLPKQIDDTQYPKTFFYKEDLHPGKTMKVQFT 93 Score = 60 (21.1 bits), Expect = 4.8e-09, Sum P(3) = 4.8e-09 Identities = 17/48 (35%), Positives = 25/48 (52%), Frame = +1 Query: 613 VKPGSEQAXSXKILSXNFKKVASXXKEKXCAPSXESMNDFXTSXLGKN 756 +K S++ S + + KK A +EK CA S ++ F S LGKN Sbjct: 110 IKDTSKEGYSFEEIC--IKKEAFEGEEKFCAKSLGTVIGFAISKLGKN 155 >gi|5257263|dbj|BAA81762.1| (AP000364) EST AU029260(E30024) corresponds to a region of the predicted gene.; Similar to Arabidopsis thaliana rd22 gene. (D10703) [Oryza sativa] Length = 413 Frame 3 hits (HSPs): ___ Frame 2 hits (HSPs): ____ ________ Frame 1 hits (HSPs): _____________ __________________________________________________ Database sequence: | | | | 413 0 150 300 Plus Strand HSPs: Score = 122 (42.9 bits), Expect = 1.4e-08, Sum P(3) = 1.4e-08 Identities = 29/98 (29%), Positives = 49/98 (50%), Frame = +1 Query: 457 LFFLEKDLHPGTKLNLHFTTSSNIQATFLPRQVADSIPFSSNKVEVVFNKFSVKPGSEQA 636 +FF E+ + G +L +F ++ FLPR+VADSIPF++ + V F V P + +A Sbjct: 190 VFFHEEAVRVGERLPFYFPAATTSALGFLPRRVADSIPFTAAALPAVLALFGVAPDTAEA 249 Query: 637 XSXKILSXNFKKVASXXKEKXCAPSXESMNDFXTSXLG 750 + + + K CA S E++ + + LG Sbjct: 250 AGMRETLRTCEWPTLAGESKFCATSLEALVEGAMAALG 287 Score = 74 (26.0 bits), Expect = 1.4e-08, Sum P(3) = 1.4e-08 Identities = 12/24 (50%), Positives = 18/24 (75%), Frame = +2 Query: 146 EVYWKSVLPTTPMPKAITDILYPD 217 EV+W++VLP +P+P A +L PD Sbjct: 30 EVFWRAVLPESPLPDAFLRLLRPD 53 Score = 43 (15.1 bits), Expect = 1.4e-08, Sum P(3) = 1.4e-08 Identities = 14/54 (25%), Positives = 26/54 (48%), Frame = +2 Query: 209 YP-DWVEXKSTSVNVGGKGVNVHAGKGGG----GTNVNVGGKGSGGGVNVHAGHK 358 +P D+ + + + G+++ AG G G + + G+G GGG AG + Sbjct: 72 FPFDYTDYRGSDSPTTASGLDL-AGDFGEPAPFGYDYSAQGEGGGGGAAAAAGEQ 125 Score = 39 (13.7 bits), Expect = 3.6e-08, Sum P(3) = 3.6e-08 Identities = 6/15 (40%), Positives = 10/15 (66%), Frame = +3 Query: 384 GSKSPFNYIYASTGD 428 G +PF Y Y++ G+ Sbjct: 98 GEPAPFGYDYSAQGE 112 >gi|5430766|gb|AAD43166.1|AC007504_21 (AC007504) Putative BURP domain containing protein [Arabidopsis thaliana] Length = 280 Frame 1 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | | | | 280 0 50 100 150 200 250 Plus Strand HSPs: Score = 140 (49.3 bits), Expect = 2.1e-06, P = 2.1e-06 Identities = 35/107 (32%), Positives = 56/107 (52%), Frame = +1 Query: 439 DDPNVALFFLEKDLHPGTKLNLHFTTSSNIQAT--FLPRQVADSIPFSSNKVEVVFNKFS 612 DDP++ ++F DL GTKL ++F +++Q L RQ AD IPF+ +K++ + + FS Sbjct: 51 DDPSLYMYFTLNDLKLGTKLLIYFY-KNDLQKLPPLLTRQQADLIPFTKSKLDFLLDHFS 109 Query: 613 VKPGSEQAXSXKILSXNFKKVASXXKEKXCAPSXESMNDFXTSXLGKN 756 + S Q + K + A + K C S ES+ D +G N Sbjct: 110 ITKDSPQGKAIKETLGHCDAKAIEGEHKFCGTSLESLIDLVKKTMGYN 157 >gi|7488443|pir||T07844 BURP domain-containing protein - rape >gi|3098571|gb|AAC15700.1| (AF049028) BURP domain containing protein [Brassica napus] Length = 282 Frame 1 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | | | | 282 0 50 100 150 200 250 Plus Strand HSPs: Score = 140 (49.3 bits), Expect = 2.2e-06, P = 2.2e-06 Identities = 36/107 (33%), Positives = 54/107 (50%), Frame = +1 Query: 439 DDPNVALFFLEKDLHPGTKLNLHFTTSSNIQAT-FLPRQVADSIPFSSNKVEVVFNKFSV 615 +DP + +FF DL GTKL ++F + + L RQ AD IPFS + ++ + N FS+ Sbjct: 53 EDPTMYMFFKISDLKLGTKLPIYFNKNDLRKVPPLLTRQEADLIPFSESNLDFLLNHFSI 112 Query: 616 KPGSEQAXSXK--ILSXNFKKVASXXKEKXCAPSXESMNDFXTSXLGKN 756 S Q + K + +FK + + K C S ESM D + N Sbjct: 113 SKDSPQGKAMKETLKRCDFKAIEG--EYKFCGTSLESMLDLAKKTIASN 159 >gi|100102|pir||S14068 seed protein precursor - tick bean >gi|22043|emb|CAA39696.1| (X56240) unknown seed protein [Vicia faba] Length = 268 Frame 2 hits (HSPs): __________ Frame 1 hits (HSPs): ___________ Annotated Domains: ____ __________________________________________________ Database sequence: | | | | | | | 268 0 50 100 150 200 250 __________________ Annotated Domains: Entrez domain: signal sequence 1..22 __________________ Plus Strand HSPs: Score = 109 (38.4 bits), Expect = 2.3e-06, Sum P(2) = 2.3e-06 Identities = 21/51 (41%), Positives = 30/51 (58%), Frame = +2 Query: 71 MEYRLLPIFTLLNLALVAIHAALPPEVYWKSVLPTTPMPKAITDILYPDWV 223 ME+ L + +L LA V I A E YW+S+ P TP+PK +D+L P + Sbjct: 1 MEFAHLTVLSLFCLAFVGITATSSGEDYWQSIWPNTPLPKTFSDLLIPSGI 51 Score = 70 (24.6 bits), Expect = 2.3e-06, Sum P(2) = 2.3e-06 Identities = 25/56 (44%), Positives = 29/56 (51%), Frame = +1 Query: 457 LFFLEKDLHPGTKLNLHFTTS--SNIQATFLPRQ-VADSIPFSSNKVEVVFNKFSVKP 621 LFF E DLHPG NL T S S I+ RQ V DSI + +NK + F P Sbjct: 68 LFF-EHDLHPGKNFNLGHTNSVGSIIRPFTKSRQGVTDSI-WLANKEKQSLEDFCYSP 123 >gi|119096|sp|P21746|EA87_VICFA EMBRYONIC ABUNDANT PROTEIN USP87 PRECURSOR >gi|82001|pir||S04135 embryonic abundant protein precursor (clone pUSP87) - tick bean >gi|22049|emb|CAA31603.1| (X13211) USP precursor [Vicia faba] Length = 268 Frame 2 hits (HSPs): _________ Frame 1 hits (HSPs): ___________ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | | | | 268 0 50 100 150 200 250 __________________ Annotated Domains: Entrez Domain: 3 X 6 AA APPROXIMATE REPEATS. 50..106 Entrez Repetitive region: 1-1. 50..55 Entrez Repetitive region: 1-2. 83..88 Entrez Repetitive region: 1-3. 101..106 Entrez Domain: 2 X APPROXIMATE REPEATS. 166..222 Entrez Repetitive region: 2-1. 166..183 Entrez Repetitive region: 2-2. 202..222 Entrez glycosylation site: POTENTIAL. 259 PRODOM PD009870: 1..79 PRODOM PD013884: 86..126 PRODOM PD004669: 128..256 __________________ Plus Strand HSPs: Score = 115 (40.5 bits), Expect = 3.8e-05, Sum P(2) = 3.8e-05 Identities = 22/48 (45%), Positives = 30/48 (62%), Frame = +2 Query: 71 MEYRLLPIFTLLNLALVAIHAALPPEVYWKSVLPTTPMPKAITDILYP 214 ME+ L + +L LA V I A P E YW+S+ P TP+PK +D+L P Sbjct: 1 MEFAHLTVLSLFCLAFVGITATSPREDYWQSIWPNTPLPKTFSDMLIP 48 Score = 51 (18.0 bits), Expect = 3.8e-05, Sum P(2) = 3.8e-05 Identities = 23/56 (41%), Positives = 27/56 (48%), Frame = +1 Query: 457 LFFLEKDLHPGTKLNLHFTTS--SNIQATFLPRQ-VADSIPFSSNKVEVVFNKFSVKP 621 LFF E DLHP L T S S I+ RQ V DSI + +NK + F P Sbjct: 68 LFF-EHDLHPRKNFILGNTNSVGSIIRPFTKSRQGVTDSI-WLANKEKQSLEDFCYSP 123 >gi|119095|sp|P21745|EA30_VICFA EMBRYONIC ABUNDANT PROTEIN VF30.1 PRECURSOR >gi|82003|pir||S05471 embryonic abundant protein precursor (clone USP Vf30.1) - tick bean Length = 268 Frame 2 hits (HSPs): _________ Frame 1 hits (HSPs): ___________ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | | | | 268 0 50 100 150 200 250 __________________ Annotated Domains: DOMO DM02982: 17..258 Entrez Domain: 3 X 6 AA APPROXIMATE REPEATS. 50..106 Entrez Repetitive region: 1-1. 50..55 Entrez Repetitive region: 1-2. 83..88 Entrez Repetitive region: 1-3. 101..106 Entrez Domain: 2 X APPROXIMATE REPEATS. 166..222 Entrez Repetitive region: 2-1. 166..183 Entrez Repetitive region: 2-2. 202..222 Entrez glycosylation site: POTENTIAL. 259 PRODOM PD009870: 1..79 PRODOM PD013884: 86..126 PRODOM PD004669: 128..256 __________________ Plus Strand HSPs: Score = 108 (38.0 bits), Expect = 4.0e-05, Sum P(2) = 4.0e-05 Identities = 21/48 (43%), Positives = 29/48 (60%), Frame = +2 Query: 71 MEYRLLPIFTLLNLALVAIHAALPPEVYWKSVLPTTPMPKAITDILYP 214 ME+ L + +L LA V I A E YW+S+ P TP+PK +D+L P Sbjct: 1 MEFAHLTVLSLFCLAFVGITATSSGEDYWQSIWPNTPLPKTFSDLLIP 48 Score = 59 (20.8 bits), Expect = 4.0e-05, Sum P(2) = 4.0e-05 Identities = 24/56 (42%), Positives = 28/56 (50%), Frame = +1 Query: 457 LFFLEKDLHPGTKLNLHFTTS--SNIQATFLPRQ-VADSIPFSSNKVEVVFNKFSVKP 621 LFF E DLHPG L T S S I+ RQ V DSI + +NK + F P Sbjct: 68 LFF-EHDLHPGKNFILGNTNSVGSIIRPFTKSRQGVTDSI-WLANKEKQSLEDFCYSP 123 >gi|119097|sp|P21747|EA92_VICFA EMBRYONIC ABUNDANT PROTEIN USP92 PRECURSOR >gi|82002|pir||S04136 embryonic abundant protein precursor (clone pUSP92) - tick bean >gi|22051|emb|CAA31602.1| (X13210) USP precursor [Vicia faba] Length = 268 Frame 2 hits (HSPs): _________ Frame 1 hits (HSPs): ___________ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | | | | 268 0 50 100 150 200 250 __________________ Annotated Domains: Entrez Domain: 3 X 6 AA APPROXIMATE REPEATS. 50..106 Entrez Repetitive region: 1-1. 50..55 Entrez Repetitive region: 1-2. 83..88 Entrez Repetitive region: 1-3. 101..106 Entrez Domain: 2 X APPROXIMATE REPEATS. 166..222 Entrez Repetitive region: 2-1. 166..183 Entrez Repetitive region: 2-2. 202..222 Entrez glycosylation site: POTENTIAL. 259 PRODOM PD009870: 1..79 PRODOM PD013884: 86..126 PRODOM PD004669: 128..256 __________________ Plus Strand HSPs: Score = 109 (38.4 bits), Expect = 4.9e-05, Sum P(2) = 4.9e-05 Identities = 21/48 (43%), Positives = 29/48 (60%), Frame = +2 Query: 71 MEYRLLPIFTLLNLALVAIHAALPPEVYWKSVLPTTPMPKAITDILYP 214 ME+ L + +L LA V I A E YW+S+ P TP+PK +D+L P Sbjct: 1 MEFAHLTVLSLFCLAFVGITATSSEEDYWQSIWPNTPLPKTFSDLLIP 48 Score = 57 (20.1 bits), Expect = 4.9e-05, Sum P(2) = 4.9e-05 Identities = 24/56 (42%), Positives = 28/56 (50%), Frame = +1 Query: 457 LFFLEKDLHPGTKLNLHFTTS--SNIQATFLPRQ-VADSIPFSSNKVEVVFNKFSVKP 621 LFF E DLHP L T S S I+ RQ V DSI + +NK + F F P Sbjct: 68 LFF-EHDLHPRKNFILGNTNSVGSIIRPFTKSRQGVTDSI-WLANKEKQSFEDFCYSP 123 >gi|7488154|pir||T02289 probable polygalacturonase (EC 3.2.1.15) 1 beta chain T13D8.26 - Arabidopsis thaliana >gi|3249081|gb|AAC24065.1| (AC004473) Strong similarity to AR0GP2 gene gb|1762634 from Lycopersicon esculentum. [Arabidopsis thaliana] Length = 624 Frame 1 hits (HSPs): __________ __________________________________________________ Database sequence: | | | | | | 624 0 150 300 450 600 Plus Strand HSPs: Score = 130 (45.8 bits), Expect = 0.00014, P = 0.00014 Identities = 33/115 (28%), Positives = 51/115 (44%), Frame = +1 Query: 460 FFLEKDLHPGTKLNLHFTTSSNIQATFLPRQVADSIPFSSNKVEVVFNKFSVKPGSEQAX 639 FF E L GT + + + TFLPR + ++PFSS+ + ++ F S A Sbjct: 409 FFREAMLKEGTLMQMPDIKDKMPKRTFLPRNIVKNLPFSSSTIGEIWRVFGAGENSSMAG 468 Query: 640 SXKILSXNFKKVASXXKEKXCAPSXESMNDFXTSXLGKNX--KXFPXIXEXKKRI 798 ++ AS + K C S E M DF TS LG+ + + KK++ Sbjct: 469 IISSAVSECERPASHGETKRCVGSAEDMIDFATSVLGRGVVVRTTENVVGSKKKV 523 >gi|7489055|pir||T07426 probable polygalacturonase (EC 3.2.1.15) 1 - tomato >gi|1762634|gb|AAB39556.1| (U64789) AROGP2 [Lycopersicon esculentum] Length = 629 Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | | 629 0 150 300 450 600 Plus Strand HSPs: Score = 127 (44.7 bits), Expect = 0.00031, P = 0.00031 Identities = 32/99 (32%), Positives = 48/99 (48%), Frame = +1 Query: 460 FFLEKDLHPGTKLNLHFTTSSNIQATFLPRQVADSIPFSSNKVEVVFNKFSVKPGSEQAX 639 FF EK L GT + + + +FLPR +A +PFS++K++ + F S+ A Sbjct: 414 FFREKMLKSGTIMPMPDIKDKMPKRSFLPRAIAAKLPFSTSKIDELKKIFHAANDSQVAK 473 Query: 640 SXKILSXNFKKVASXXKEKXCAPSXESMNDFXTSXLGKN 756 ++ S + K C S E M DF TS LG+N Sbjct: 474 MIGDALSECERAPSPGETKQCVNSAEDMIDFATSVLGRN 512 >gi|7489056|pir||T07587 probable polygalacturonase (EC 3.2.1.15) 1 - tomato >gi|1762636|gb|AAB39557.1| (U64790) AROGP3 [Lycopersicon esculentum] Length = 632 Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | | 632 0 150 300 450 600 Plus Strand HSPs: Score = 127 (44.7 bits), Expect = 0.00031, P = 0.00031 Identities = 32/99 (32%), Positives = 48/99 (48%), Frame = +1 Query: 460 FFLEKDLHPGTKLNLHFTTSSNIQATFLPRQVADSIPFSSNKVEVVFNKFSVKPGSEQAX 639 FF EK L GT + + + +FLPR +A +PFS++K++ + F S+ A Sbjct: 417 FFREKMLKSGTIMPMPDIKDKMPKRSFLPRAIAAKLPFSTSKIDELKKIFHAANDSQVAK 476 Query: 640 SXKILSXNFKKVASXXKEKXCAPSXESMNDFXTSXLGKN 756 ++ S + K C S E M DF TS LG+N Sbjct: 477 MIGDALSECERAPSPGETKQCVNSAEDMIDFATSVLGRN 515 >gi|135067|sp|P09059|SVF3_VICFA UNKNOWN SEED PROTEIN 30.1 PRECURSOR (VF30.1) >gi|82000|pir||S03328 embryonic abundant protein precursor (clone pUSP14) - tick bean >gi|22046|emb|CAA31626.1| (X13242) seed protein [Vicia faba] Length = 268 Frame 2 hits (HSPs): _________ Frame 1 hits (HSPs): ___________ Annotated Domains: ________________________________________________ __________________________________________________ Database sequence: | | | | | | | 268 0 50 100 150 200 250 __________________ Annotated Domains: PRODOM PD009870: 1..79 PRODOM PD013884: 86..126 PRODOM PD004669: 128..256 __________________ Plus Strand HSPs: Score = 102 (35.9 bits), Expect = 0.0014, Sum P(2) = 0.0014 Identities = 20/48 (41%), Positives = 28/48 (58%), Frame = +2 Query: 71 MEYRLLPIFTLLNLALVAIHAALPPEVYWKSVLPTTPMPKAITDILYP 214 ME+ L + +L LA V I A E YW+S+ P TP+PK +D+ P Sbjct: 1 MEFAHLTVLSLFCLAFVGITATSSGEDYWQSIWPNTPLPKTFSDLSIP 48 Score = 51 (18.0 bits), Expect = 0.0014, Sum P(2) = 0.0014 Identities = 23/56 (41%), Positives = 27/56 (48%), Frame = +1 Query: 457 LFFLEKDLHPGTKLNLHFTTS--SNIQATFLPRQ-VADSIPFSSNKVEVVFNKFSVKP 621 LFF E DLHP L T S S I+ RQ V DSI + +NK + F P Sbjct: 68 LFF-EHDLHPRKNFILGNTNSVGSIIRPFTKSRQGVTDSI-WLANKEKQSLEDFCYSP 123 >gi|320585|pir||JQ1670 polygalacturonase (EC 3.2.1.15) 1 beta chain precursor - tomato >gi|170480|gb|AAA34181.1| (M98466) polygalacturonase isoenzyme 1 beta subunit [Lycopersicon esculentum] >gi|1762586|gb|AAB39547.1| (U63374) polygalacturonase isoenzyme 1 beta subunit [Lycopersicon esculentum] Length = 630 Frame 1 hits (HSPs): _________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | | 630 0 150 300 450 600 __________________ Annotated Domains: DOMO DM02622: 135..173 DOMO DM02622: 175..215 DOMO DM02622: 217..257 DOMO DM02622: 259..306 DOMO DM02622: 308..353 DOMO DM02982: 355..628 Entrez domain: signal sequence 1..30 Entrez domain: amino-terminal propeptide 31..108 Entrez domain: carboxyl-terminal propeptide 398..630 Entrez binding site: carbohydrate (Asn) (covale 124 Entrez binding site: carbohydrate (Asn) (covale 142 Entrez binding site: carbohydrate (Asn) (covale 256 Entrez binding site: carbohydrate (Asn) (covale 334 Entrez binding site: carbohydrate (Asn) (covale 369 Entrez binding site: carbohydrate (Asn) (covale 387 __________________ Plus Strand HSPs: Score = 121 (42.6 bits), Expect = 0.0014, P = 0.0014 Identities = 31/99 (31%), Positives = 46/99 (46%), Frame = +1 Query: 460 FFLEKDLHPGTKLNLHFTTSSNIQATFLPRQVADSIPFSSNKVEVVFNKFSVKPGSEQAX 639 FF EK L GT + + + +FLPR +A +PFS++K+ + F S+ Sbjct: 415 FFREKMLKSGTIMPMPDIKDKMPKRSFLPRVIASKLPFSTSKIAELKKIFHAGDESQVEK 474 Query: 640 SXKILSXNFKKVASXXKEKXCAPSXESMNDFXTSXLGKN 756 ++ S + K C S E M DF TS LG+N Sbjct: 475 MIGDALSECERAPSAGETKRCVNSAEDMIDFATSVLGRN 513 >gi|7488920|pir||T14306 glycine-rich protein - carrot (fragment) >gi|1276971|gb|AAB01097.1| (U47097) glycine-rich protein [Daucus carota] Length = 111 Frame 1 hits (HSPs): ________________________ __________________________________________________ Database sequence: | | | | 111 0 50 100 Plus Strand HSPs: Score = 93 (32.7 bits), Expect = 0.0038, P = 0.0038 Identities = 23/52 (44%), Positives = 28/52 (53%), Frame = +1 Query: 211 PRLGGKXKYLSECWRQGRKRA-CRKRRR--WHQCQR-WWKRIRRRRERA-CR 351 PR G + CW R R+ CR RRR W + +R WW+R RRR R CR Sbjct: 36 PRRGRWCWWRCRCWIWSRLRSWCRSRRRIRWRRRRRRWWRRRRRRSSRRWCR 87 >gi|100864|pir||S08315 cell wall protein - maize (fragment) >gi|22269|emb|CAA31860.1| (X13506) cell wall protein (108 AA) [Zea mays] >gi|168459|gb|AAA33457.1| (M36914) cell wall protein (put.); putative [Zea mays] Length = 108 Frame -3 hits (HSPs): ______________________________________ __________________________________________________ Database sequence: | | | | | | | 108 0 20 40 60 80 100 Minus Strand HSPs: Score = 90 (31.7 bits), Expect = 0.0081, P = 0.0081 Identities = 17/34 (50%), Positives = 18/34 (52%), Frame = -3 Query: 351 PACTFTPPPDPFPPTLTLVPPPPFPACTFTPLPP 250 P T P P P PPT T P PP T+TP PP Sbjct: 63 PPATKPPTPKPTPPTYTPTPKPPAKPPTYTPTPP 96 Score = 77 (27.1 bits), Expect = 0.64, P = 0.47 Identities = 16/31 (51%), Positives = 17/31 (54%), Frame = -3 Query: 333 PPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 P P P PPT T P PP P P PPT+T Sbjct: 16 PTPKPTPPTYTPSPKPPTPK----PTPPTYT 42 Score = 77 (27.1 bits), Expect = 0.64, P = 0.47 Identities = 16/31 (51%), Positives = 17/31 (54%), Frame = -3 Query: 333 PPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 P P P PPT T P PP P P PPT+T Sbjct: 32 PTPKPTPPTYTPSPKPPTPK----PTPPTYT 58 Score = 72 (25.3 bits), Expect = 7.6, P = 1.0 Identities = 16/32 (50%), Positives = 17/32 (53%), Frame = -3 Query: 333 PPPDPFPPTLTLVPPPPFPAC-TFTPLPPTFT 241 P P P PPT T P PP T P PPT+T Sbjct: 48 PTPKPTPPTYTPSPKPPATKPPTPKPTPPTYT 79 >gi|485515|pir||S33622 ADR6 protein - soybean >gi|296445|emb|CAA49340.1| (X69639) auxin down regulated [Glycine max] >gi|2304955|gb|AAB65592.1| (U64866) similar to ADR6 encoded by GenBank Accession Number X69639; aluminum induced [Glycine max] Length = 272 Frame 2 hits (HSPs): _______ Frame 1 hits (HSPs): ______ _________ Annotated Domains: _____________________________________________ __________________________________________________ Database sequence: | | | | | | | 272 0 50 100 150 200 250 __________________ Annotated Domains: DOMO DM02982: 18..258 __________________ Plus Strand HSPs: Score = 67 (23.6 bits), Expect = 0.023, Sum P(3) = 0.023 Identities = 17/41 (41%), Positives = 25/41 (60%), Frame = +2 Query: 92 IFTLLNLALVAIHAALPPEVYWKSVLPTTPMPKAITDILYP 214 +FTL LA + HA E +W +V P TP+P ++ D+L P Sbjct: 13 LFTL-GLARES-HAR--DEDFWHAVWPNTPIPSSLRDLLKP 49 Score = 59 (20.8 bits), Expect = 0.023, Sum P(3) = 0.023 Identities = 17/48 (35%), Positives = 25/48 (52%), Frame = +1 Query: 613 VKPGSEQAXSXKILSXNFKKVASXXKEKXCAPSXESMNDFXTSXLGKN 756 +K +++ S + L KK A +EK CA S ++ F S LGKN Sbjct: 106 IKDTTKEGYSFEELC--IKKEAIEGEEKFCAKSLGTVIGFAISKLGKN 151 Score = 53 (18.7 bits), Expect = 0.023, Sum P(3) = 0.023 Identities = 9/28 (32%), Positives = 16/28 (57%), Frame = +1 Query: 430 QLHDDPNVALFFLEKDLHPGTKLNLHFT 513 Q+ + FF ++DLHPG + + F+ Sbjct: 62 QIEETQYPKTFFYKEDLHPGKTMKVQFS 89 >gi|1762584|gb|AAB39546.1| (U63373) polygalacturonase isoenzyme 1 beta subunit homolog [Arabidopsis thaliana] Length = 626 Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | | 626 0 150 300 450 600 Plus Strand HSPs: Score = 109 (38.4 bits), Expect = 0.032, P = 0.031 Identities = 28/99 (28%), Positives = 43/99 (43%), Frame = +1 Query: 460 FFLEKDLHPGTKLNLHFTTSSNIQATFLPRQVADSIPFSSNKVEVVFNKFSVKPGSEQAX 639 FF E L GT + + + +FLPR + +PFS++K+ + F S Sbjct: 411 FFRESSLKEGTVIPMPDIKDKMPKRSFLPRSIITKLPFSTSKLGEIKRIFHAVENSTMGG 470 Query: 640 SXKILSXNFKKVASXXKEKXCAPSXESMNDFXTSXLGKN 756 ++ S + K C S E M DF TS LG++ Sbjct: 471 IITDAVTECERPPSVGETKRCVGSAEDMIDFATSVLGRS 509 >gi|7488153|pir||T01485 probable polygalacturonase (EC 3.2.1.15) 1 beta chain F17O7.9 - Arabidopsis thaliana >gi|3176680|gb|AAC18803.1| (AC003671) Identical to polygalacuronase isoenzyme 1 beta subunit homolog mRNA gb|U63373. EST gb|AA404878 comes from this gene. [Arabidopsis thaliana] Length = 626 Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | | 626 0 150 300 450 600 Plus Strand HSPs: Score = 109 (38.4 bits), Expect = 0.032, P = 0.031 Identities = 28/99 (28%), Positives = 43/99 (43%), Frame = +1 Query: 460 FFLEKDLHPGTKLNLHFTTSSNIQATFLPRQVADSIPFSSNKVEVVFNKFSVKPGSEQAX 639 FF E L GT + + + +FLPR + +PFS++K+ + F S Sbjct: 411 FFRESSLKEGTVIPMPDIKDKMPKRSFLPRSIITKLPFSTSKLGEIKRIFHAVENSTMGG 470 Query: 640 SXKILSXNFKKVASXXKEKXCAPSXESMNDFXTSXLGKN 756 ++ S + K C S E M DF TS LG++ Sbjct: 471 IITDAVTECERPPSVGETKRCVGSAEDMIDFATSVLGRS 509 >gi|1914851|gb|AAC53191.1| (U92454) WW domain binding protein 5; WBP5 [Mus musculus] Length = 61 Frame -3 hits (HSPs): __________________________ __________________________________________________ Database sequence: | | | | | 61 0 20 40 60 Minus Strand HSPs: Score = 84 (29.6 bits), Expect = 0.037, P = 0.036 Identities = 19/41 (46%), Positives = 22/41 (53%), Frame = -3 Query: 333 PPPDPFPPTLTLVPPPPFPACTFTPLPPTFTEVLXFSTQSG 211 PPP P PP PPPP P PLPP F E++ F +G Sbjct: 6 PPPPPPPPP----PPPPPP-----PLPPRFLEIIAFPYSAG 37 >gi|4056458|gb|AAC98031.1| (AC005990) Identical to gb|ATU59467 aromatic rich glycoprotein which is strongly similar to gb|U63373 polygalacturonase isozyme 1 from Arabidopsis thaliana. EST gb|AA395212 comes from this gene Length = 622 Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | | 622 0 150 300 450 600 Plus Strand HSPs: Score = 108 (38.0 bits), Expect = 0.041, P = 0.040 Identities = 27/99 (27%), Positives = 43/99 (43%), Frame = +1 Query: 460 FFLEKDLHPGTKLNLHFTTSSNIQATFLPRQVADSIPFSSNKVEVVFNKFSVKPGSEQAX 639 FF E L GT + + + +FLPR + +PFS++K+ + F S Sbjct: 407 FFRESMLKEGTLIWMPDIKDKMPKRSFLPRSIVSKLPFSTSKIAEIKRVFHANDNSTMEG 466 Query: 640 SXKILSXNFKKVASXXKEKXCAPSXESMNDFXTSXLGKN 756 ++ + + K C S E M DF TS LG++ Sbjct: 467 IITDAVRECERPPTVSETKRCVGSAEDMIDFATSVLGRS 505 >gi|1175115|sp|P41479|Y091_NPVAC HYPOTHETICAL 24.1 KD PROTEIN IN LEF4-P33 INTERGENIC REGION >gi|7460452|pir||D72861 AcOrf-91 protein - Autographa californica nuclear polyhedrosis virus >gi|559160|gb|AAA66721.1| (L22858) AcOrf-91 peptide [Autographa californica nucleopolyhedrovirus] Length = 224 Frame -3 hits (HSPs): ___________________ Annotated Domains: ____________________________________________ __________________________________________________ Database sequence: | | | | | | 224 0 50 100 150 200 __________________ Annotated Domains: DOMO DM00215: PROLINE-RICHPROTEIN3 30..111 PRODOM PD000540: H1(18) O76786(11) TONB(10) 35..137 PRODOM PD028890: Y091(2) O92451(1) 143..223 __________________ Minus Strand HSPs: Score = 101 (35.6 bits), Expect = 0.043, P = 0.042 Identities = 22/37 (59%), Positives = 23/37 (62%), Frame = -3 Query: 360 PL*PACTFTPPPDPFPPTLTLVPPP-PFPACTFTPLPP 250 P+ P T TPPP P PPT T PPP P P T TP PP Sbjct: 71 PIPPTPTPTPPPTPIPPTPTPTPPPSPIPP-TPTPSPP 107 Score = 100 (35.2 bits), Expect = 0.056, P = 0.055 Identities = 22/37 (59%), Positives = 23/37 (62%), Frame = -3 Query: 360 PL*PACTFTPPPDPFPPTLTLVPPP-PFPACTFTPLPP 250 P+ P T TPPP P PPT T PPP P P T TP PP Sbjct: 84 PIPPTPTPTPPPSPIPPTPTPSPPPSPIPP-TPTPSPP 120 Score = 96 (33.8 bits), Expect = 0.16, P = 0.15 Identities = 21/37 (56%), Positives = 23/37 (62%), Frame = -3 Query: 360 PL*PACTFTPPPDPFPPTLTLVPPP-PFPACTFTPLPP 250 P+ P T +PPP P PPT T PPP P P T TP PP Sbjct: 97 PIPPTPTPSPPPSPIPPTPTPSPPPSPIPP-TPTPSPP 133 Score = 96 (33.8 bits), Expect = 0.16, P = 0.15 Identities = 22/46 (47%), Positives = 26/46 (56%), Frame = -3 Query: 360 PL*PACTFTPPPDPFPPTLTLVPPP------PFPACTFTPLPPTFT 241 P+ P T TPPP P PPT T PPP P P+ +P+PPT T Sbjct: 84 PIPPTPTPTPPPSPIPPTPTPSPPPSPIPPTPTPSPPPSPIPPTPT 129 Score = 92 (32.4 bits), Expect = 0.48, P = 0.38 Identities = 19/29 (65%), Positives = 19/29 (65%), Frame = -3 Query: 336 TPPPDPFPPTLTLVPPP-PFPACTFTPLPP 250 TPPP P PPT T PPP P P T TP PP Sbjct: 66 TPPPTPIPPTPTPTPPPTPIPP-TPTPTPP 94 Score = 86 (30.3 bits), Expect = 2.4, P = 0.91 Identities = 20/37 (54%), Positives = 22/37 (59%), Frame = -3 Query: 351 PACTFTP-PPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 P T TP PP P PP T +PP P P TP+PPT T Sbjct: 54 PPPTPTPSPPSPTPPP-TPIPPTPTPTPPPTPIPPTPT 90 >gi|168457|gb|AAA33456.1| (M36913) cell wall protein (put.); putative [Zea mays] Length = 109 Frame -3 hits (HSPs): _______________________________________ __________________________________________________ Database sequence: | | | | | | | 109 0 20 40 60 80 100 Minus Strand HSPs: Score = 81 (28.5 bits), Expect = 0.078, P = 0.075 Identities = 17/35 (48%), Positives = 18/35 (51%), Frame = -3 Query: 351 PACTFTPPPDPFPPTLTLVPPPPFPAC-TFTPLPP 250 P T P P P PPT T P PP T+TP PP Sbjct: 63 PPATKPPTPKPTPPTYTPTPKPPATKPPTYTPTPP 97 Score = 77 (27.1 bits), Expect = 0.86, P = 0.58 Identities = 16/31 (51%), Positives = 17/31 (54%), Frame = -3 Query: 333 PPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 P P P PPT T P PP P P PPT+T Sbjct: 16 PTPKPTPPTYTPSPKPPTPK----PTPPTYT 42 Score = 77 (27.1 bits), Expect = 0.86, P = 0.58 Identities = 16/31 (51%), Positives = 17/31 (54%), Frame = -3 Query: 333 PPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 P P P PPT T P PP P P PPT+T Sbjct: 32 PTPKPTPPTYTPSPKPPTPK----PTPPTYT 58 Score = 72 (25.3 bits), Expect = 8.3, P = 1.0 Identities = 16/32 (50%), Positives = 17/32 (53%), Frame = -3 Query: 333 PPPDPFPPTLTLVPPPPFPAC-TFTPLPPTFT 241 P P P PPT T P PP T P PPT+T Sbjct: 48 PTPKPTPPTYTPSPKPPATKPPTPKPTPPTYT 79 >gi|462345|sp|P35310|HSP1_PONPY SPERM PROTAMINE P1 (CYSTEINE-RICH PROTAMINE) >gi|294041|gb|AAA36946.1| (L14589) protamine [Pongo pygmaeus] >gi|6979740|gb|AAF34623.1|AF215710_1 (AF215710) protamine 1 [Pongo pygmaeus] Length = 51 Frame 1 hits (HSPs): __________________________________________ Annotated Domains: ________________________________________________ __________________________________________________ Database sequence: | | | | 51 0 20 40 __________________ Annotated Domains: BLOCKS BL00048: Protamine P1 proteins. 1..27 PFAM protamine_P1: Protamine P1 1..49 PROSITE PROTAMINE_P1: Protamine P1 signature. 2..13 __________________ Plus Strand HSPs: Score = 79 (27.8 bits), Expect = 0.13, P = 0.12 Identities = 16/42 (38%), Positives = 23/42 (54%), Frame = +1 Query: 247 CWRQGRKRACRKRRRWHQCQRWWKRIRRRRERACRSQGESXC 372 C Q + R CR+R+R H+ +R + RRR R CR + C Sbjct: 7 CRSQSQSRCCRRRQRCHRRRRRCCQTRRRAMRCCRRRYRLRC 48 >gi|7299574|gb|AAF54759.1| (AE003695) CG14742 gene product [Drosophila melanogaster] Length = 74 Frame 2 hits (HSPs): __________________________________________ __________________________________________________ Database sequence: | | | | | 74 0 20 40 60 Plus Strand HSPs: Score = 77 (27.1 bits), Expect = 0.21, P = 0.19 Identities = 18/40 (45%), Positives = 20/40 (50%), Frame = +2 Query: 245 NVGGKGVNVHAGKGGGGTNVNVGGKGSGGGVNVHAGHKGK 364 N GG G G GGGG GG G GGG +G+K K Sbjct: 4 NKGGGGGGGGGGGGGGGGGNKKGGGGGGGGGGGGSGNKNK 43 Score = 63 (22.2 bits), Expect = 7.2, P = 1.0 Identities = 20/45 (44%), Positives = 21/45 (46%), Frame = +2 Query: 251 GG--KGVNVHAGKGGGGT-NVNVGGKGSGGGVN-VHAGHKGKAXA 373 GG KG G GGGG+ N N GG G GG G KA A Sbjct: 21 GGNKKGGGGGGGGGGGGSGNKNKGGDGGGGDKGGAKKGQDQKAAA 65 >gi|7305409 ref|NP_038665.1| protamine 1 >gi|123692|sp|P02319|HSP1_MOUSE SPERM PROTAMINE P1 (CYSTEINE-RICH PROTAMINE) >gi|2144756|pir||HSMSS1 protamine - mouse >gi|53789|emb|CAA30472.1| (X07625) protamine 1 [Mus musculus] >gi|53791|emb|CAA32169.1| (X14003) protamine 1 (AA 1-51) [Mus musculus] >gi|200489|gb|AAA39980.1| (K02926) protamine 1 [Mus musculus] >gi|200503|gb|AAA39985.1| (M27500) protamine [Mus musculus] >gi|1360005|emb|CAA87410.1| (Z47352) protamine 1 [Mus musculus] Length = 51 Frame 1 hits (HSPs): _____________________________________ __________________________________________________ Database sequence: | | | | 51 0 20 40 Plus Strand HSPs: Score = 77 (27.1 bits), Expect = 0.22, P = 0.20 Identities = 16/37 (43%), Positives = 21/37 (56%), Frame = +1 Query: 262 RKRACRKRRRWHQCQRWWKRIRRRRERACRSQGESXC 372 R R CR + R +C+R +R RRRR R CR + C Sbjct: 3 RYRCCRSKSR-SRCRRRRRRCRRRRRRCCRRRRRRCC 38 Score = 75 (26.4 bits), Expect = 0.36, P = 0.30 Identities = 17/35 (48%), Positives = 22/35 (62%), Frame = +1 Query: 247 CWRQGRKRACRKRRRWHQCQRWWKRI-RRRRERACR 351 C R + CR+RRR +C+R +R RRRR R CR Sbjct: 6 CCRSKSRSRCRRRRR--RCRRRRRRCCRRRRRRCCR 39 >gi|123695|sp|P10118|HSP1_RAT SPERM PROTAMINE P1 (CYSTEINE-RICH PROTAMINE) >gi|92658|pir||S03997 protamine 1 - rat >gi|1359528|emb|CAA87061.1| (Z46939) protamine 1 [Rattus norvegicus] Length = 51 Frame 1 hits (HSPs): _____________________________________ Annotated Domains: ________________________________________________ __________________________________________________ Database sequence: | | | | 51 0 20 40 __________________ Annotated Domains: PFAM protamine_P1: Protamine P1 1..49 PRODOM PD001830: HSP1(29) VE2(21) GAG(12) 7..45 PROSITE PROTAMINE_P1: Protamine P1 signature. 2..13 __________________ Plus Strand HSPs: Score = 77 (27.1 bits), Expect = 0.22, P = 0.20 Identities = 16/37 (43%), Positives = 21/37 (56%), Frame = +1 Query: 262 RKRACRKRRRWHQCQRWWKRIRRRRERACRSQGESXC 372 R R CR + R +C+R +R RRRR R CR + C Sbjct: 3 RYRCCRSKSR-SRCRRRRRRCRRRRRRCCRRRRRRCC 38 Score = 75 (26.4 bits), Expect = 0.36, P = 0.30 Identities = 17/35 (48%), Positives = 22/35 (62%), Frame = +1 Query: 247 CWRQGRKRACRKRRRWHQCQRWWKRI-RRRRERACR 351 C R + CR+RRR +C+R +R RRRR R CR Sbjct: 6 CCRSKSRSRCRRRRR--RCRRRRRRCCRRRRRRCCR 39 >gi|123696|sp|P24714|HSP1_SAGIM SPERM PROTAMINE P1 (CYSTEINE-RICH PROTAMINE) >gi|348569|pir||S22582 protamine 1 - Saguinus imperator >gi|4494091|emb|CAA43853.1| (X61678) protamine 1 [Saguinus imperator] Length = 50 Frame 1 hits (HSPs): ______________________________________________ Annotated Domains: ________________________________________________ __________________________________________________ Database sequence: | | | | 50 0 20 40 __________________ Annotated Domains: BLOCKS BL00048: Protamine P1 proteins. 1..27 PFAM protamine_P1: Protamine P1 1..48 PROSITE PROTAMINE_P1: Protamine P1 signature. 2..13 __________________ Plus Strand HSPs: Score = 77 (27.1 bits), Expect = 0.22, P = 0.20 Identities = 17/37 (45%), Positives = 21/37 (56%), Frame = +1 Query: 262 RKRACRKRRRWHQCQRWWKRIRRRRERACRSQGESXC 372 R R CR + R +C R +R RRRR R CR + S C Sbjct: 3 RYRCCRSQSR-SRCYRQRRRGRRRRRRTCRRRRASRC 38 Score = 67 (23.6 bits), Expect = 2.7, P = 0.93 Identities = 17/42 (40%), Positives = 22/42 (52%), Frame = +1 Query: 247 CWRQGRKRACRKRRRWHQCQRWWKRIRRRRERACRSQGESXC 372 C Q R R R+RRR + +R R RRR R CR + + C Sbjct: 7 CRSQSRSRCYRQRRRGRRRRRRTCR-RRRASRCCRRRYKLTC 47 Score = 64 (22.5 bits), Expect = 5.7, P = 1.0 Identities = 16/37 (43%), Positives = 23/37 (62%), Frame = +1 Query: 241 SECWRQGRKRACRKRRRWHQCQRWWKRIRRRRERACR 351 S C+RQ R+R R+RRR + +R + RRR + CR Sbjct: 13 SRCYRQ-RRRGRRRRRRTCRRRRASRCCRRRYKLTCR 48 >gi|100216|pir||S14975 extensin class II (clone uJ-2) - tomato >gi|1345538|emb|CAA39216.1| (X55686) extensin (class II) [Lycopersicon esculentum] Length = 82 Frame -3 hits (HSPs): __________________________________ Annotated Domains: ____________________________________________ __________________________________________________ Database sequence: | | | | | | 82 0 20 40 60 80 __________________ Annotated Domains: DOMO DM05291: 12..82 __________________ Minus Strand HSPs: Score = 76 (26.8 bits), Expect = 0.28, P = 0.24 Identities = 16/29 (55%), Positives = 16/29 (55%), Frame = -3 Query: 333 PPPDPFPPTLTLV-PPPPFPACTFTPLPP 250 PPP P PP T PPPP P PLPP Sbjct: 37 PPPSPSPPPPTYSSPPPPPPFYENIPLPP 65 Score = 74 (26.0 bits), Expect = 0.46, P = 0.37 Identities = 18/38 (47%), Positives = 23/38 (60%), Frame = -3 Query: 351 PACTFT--PPPDPFPPTLTLV---PPPPFPACTFTPLPPTFT 241 P T++ PPP P PP T PPPP P+ P PPT++ Sbjct: 12 PPYTYSSPPPPSPSPPPPTYYYSSPPPPSPS----PPPPTYS 49 Score = 70 (24.6 bits), Expect = 1.3, P = 0.71 Identities = 20/48 (41%), Positives = 25/48 (52%), Frame = -3 Query: 351 PACTFT--PPPDPFPPTLTLV---PPPPFPAC---TFT--PLPPTFTE 238 P T++ PPP P PP T PPPP P+ T++ P PP F E Sbjct: 12 PPYTYSSPPPPSPSPPPPTYYYSSPPPPSPSPPPPTYSSPPPPPPFYE 59 >gi|1762428|gb|AAB39538.1| (U59467) aromatic rich glycoprotein JP630 [Arabidopsis thaliana] Length = 622 Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | | 622 0 150 300 450 600 Plus Strand HSPs: Score = 100 (35.2 bits), Expect = 0.32, P = 0.27 Identities = 26/99 (26%), Positives = 42/99 (42%), Frame = +1 Query: 460 FFLEKDLHPGTKLNLHFTTSSNIQATFLPRQVADSIPFSSNKVEVVFNKFSVKPGSEQAX 639 FF E L GT + + + +FLP + +PFS++K+ + F S Sbjct: 407 FFRESMLKEGTLIWMPDIKDKMPKRSFLPCSIVSKLPFSTSKIAEIKRVFHANDNSTMEG 466 Query: 640 SXKILSXNFKKVASXXKEKXCAPSXESMNDFXTSXLGKN 756 ++ + + K C S E M DF TS LG++ Sbjct: 467 IITDAVRECERPPTVSETKRCVGSAEDMIDFATSVLGRS 505 >gi|82698|pir||JQ0985 hydroxyproline-rich glycoprotein precursor - maize >gi|257041|gb|AAB23539.1| (S45164) hydroxyproline-rich glycoprotein, HRGP [maize, Peptide, 328 aa] [Zea mays] >gi|4007865|emb|CAA10387.1| (AJ131535) Hydroxyproline-rich Glycoprotein (HRGP) [Zea mays] Length = 328 Frame -3 hits (HSPs): ___________ ______ Annotated Domains: ____ __________________________________________________ Database sequence: | | | | 328 0 150 300 __________________ Annotated Domains: Entrez domain: signal sequence 1..26 __________________ Minus Strand HSPs: Score = 94 (33.1 bits), Expect = 0.59, P = 0.44 Identities = 21/41 (51%), Positives = 22/41 (53%), Frame = -3 Query: 360 PL*PACTFTP-PPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 P P T TP PP P PPT T P P P TP PPT+T Sbjct: 90 PTPPTYTPTPTPPKPTPPTYTPAPTPHKPTPKPTPTPPTYT 130 Score = 88 (31.0 bits), Expect = 2.8, P = 0.94 Identities = 17/31 (54%), Positives = 19/31 (61%), Frame = -3 Query: 342 TFTPPPDPFPPTLTLVPPPPFPAC-TFTPLP 253 T+TP P P PPT T P PP P T+TP P Sbjct: 83 TYTPSPKPTPPTYTPTPTPPKPTPPTYTPAP 113 Score = 84 (29.6 bits), Expect = 7.9, P = 1.0 Identities = 17/32 (53%), Positives = 18/32 (56%), Frame = -3 Query: 336 TPPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 TP P P PPT T P PP P P PPT+T Sbjct: 119 TPKPTPTPPTYTPTPKPPTPK----PTPPTYT 146 Score = 84 (29.6 bits), Expect = 7.9, P = 1.0 Identities = 18/37 (48%), Positives = 19/37 (51%), Frame = -3 Query: 351 PACTFTPPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 P T P P P PPT T P PP P P PPT+T Sbjct: 183 PPATKPPTPKPTPPTYTPSPKPPTPK----PTPPTYT 215 >gi|7008009|dbj|BAA90877.1| (AB031227) PsAD1 [Pisum sativum] Length = 87 Frame 2 hits (HSPs): __________________________ __________________________________________________ Database sequence: | | | | | | 87 0 20 40 60 80 Plus Strand HSPs: Score = 73 (25.7 bits), Expect = 0.59, P = 0.44 Identities = 18/40 (45%), Positives = 20/40 (50%), Frame = +2 Query: 248 VGGKGVNVHAGKGGGGTNVNVGGKGSGGGVNVHAGHKGKA 367 +GGKG + KGGGG G KG GGG G G A Sbjct: 1 MGGKGGSGGGAKGGGGGG---GAKGGGGGGGAKGGSGGGA 37 Score = 65 (22.9 bits), Expect = 4.4, P = 0.99 Identities = 15/30 (50%), Positives = 15/30 (50%), Frame = +2 Query: 251 GGKGVNVHAGKGGGGTNVNVGG--KGSGGG 334 GG G G GGGG GG KG GGG Sbjct: 14 GGGGGGAKGGGGGGGAKGGSGGGAKGGGGG 43 Score = 64 (22.5 bits), Expect = 5.6, P = 1.0 Identities = 17/41 (41%), Positives = 17/41 (41%), Frame = +2 Query: 251 GGKGVNVHAGKGGGGTNVNVGGKG----SGGGVNVHAGHKG 361 GG G G GGGG GG G SGGG G G Sbjct: 5 GGSGGGAKGGGGGGGAKGGGGGGGAKGGSGGGAKGGGGGSG 45 >gi|7290578|gb|AAF46028.1| (AE003434) CG12732 gene product [Drosophila melanogaster] Length = 153 Frame -3 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | | 153 0 50 100 150 Minus Strand HSPs: Score = 86 (30.3 bits), Expect = 0.80, P = 0.55 Identities = 17/33 (51%), Positives = 20/33 (60%), Frame = -3 Query: 345 CTFTPPPDPFPPTLTLVPP-PPFPACT-FTPLPP 250 C TPPP P PP + ++PP PP P PLPP Sbjct: 20 CHATPPPPPMPP-MPVIPPLPPMPVIPPIPPLPP 52 >gi|283045|pir||S28264 hydroxyproline-rich glycoprotein - maize >gi|22333|emb|CAA44844.1| (X63134) hydroxyproline-rich glycoprotein [Zea mays] >gi|228936|prf||1814452A Hyp-rich glycoprotein [Zea mays] Length = 303 Frame -3 hits (HSPs): _________ ______ _______ __________________________________________________ Database sequence: | | | | | | || 303 0 50 100 150 200 250 300 Minus Strand HSPs: Score = 92 (32.4 bits), Expect = 0.87, P = 0.58 Identities = 21/42 (50%), Positives = 22/42 (52%), Frame = -3 Query: 360 PL*PACTFTPPPDP--FPPTLTLVPPPPFPACTFTPLPPTFT 241 P P T TPPP P PPT T P P P TP PPT+T Sbjct: 94 PTPPTYTPTPPPTPKPTPPTYTPAPTPHKPTPKPTPTPPTYT 135 Score = 87 (30.6 bits), Expect = 3.2, P = 0.96 Identities = 18/32 (56%), Positives = 20/32 (62%), Frame = -3 Query: 342 TFTPPPDPFPPTLTLVPPP-PFPAC-TFTPLP 253 T+TP P P PPT T PPP P P T+TP P Sbjct: 87 TYTPSPKPTPPTYTPTPPPTPKPTPPTYTPAP 118 Score = 84 (29.6 bits), Expect = 7.0, P = 1.0 Identities = 18/37 (48%), Positives = 19/37 (51%), Frame = -3 Query: 351 PACTFTPPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 P T P P P PPT T P PP P P PPT+T Sbjct: 156 PPATKPPTPKPTPPTYTPSPKPPTPK----PTPPTYT 188 Score = 83 (29.2 bits), Expect = 9.0, P = 1.0 Identities = 18/37 (48%), Positives = 19/37 (51%), Frame = -3 Query: 351 PACTFTPPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 P T P P P PPT T P PP P P PPT+T Sbjct: 225 PPATKPPTPKPTPPTYTPSPKPPTPK----PSPPTYT 257 >gi|7292503|gb|AAF47906.1| (AE003481) CG15023 gene product [Drosophila melanogaster] Length = 172 Frame -3 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | | 172 0 50 100 150 Minus Strand HSPs: Score = 87 (30.6 bits), Expect = 0.93, P = 0.61 Identities = 17/37 (45%), Positives = 19/37 (51%), Frame = -3 Query: 351 PACTFTPPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 P T+ PPP P T T P P PA T+ P PP T Sbjct: 68 PVPTYLPPPPPTTTTTTTTTPAPTPAPTYLPPPPPTT 104 Score = 82 (28.9 bits), Expect = 3.6, P = 0.97 Identities = 17/34 (50%), Positives = 19/34 (55%), Frame = -3 Query: 351 PACTFTPPPDPFPPTLTLVPPPPFPACTFTPLPP 250 PA T+ PPP P T T P P PA T+ P PP Sbjct: 92 PAPTYLPPPPPTTTTTTTTPAPT-PAPTYLPPPP 124 >gi|3025832|gb|AAC12760.1| (AF055985) pyrrolidone-rich antigen [Onchocerca volvulus] Length = 93 Frame -3 hits (HSPs): _____________________ __________________________________________________ Database sequence: | | | | | | 93 0 20 40 60 80 Minus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.97, P = 0.62 Identities = 17/39 (43%), Positives = 19/39 (48%), Frame = -3 Query: 333 PPPDPF-----PPTL-TLVPPPPFPACTFTPLPPTFTEV 235 PPP P PPT T PPPP P F +PP F + Sbjct: 11 PPPPPITPGIRPPTTPTPTPPPPPPPRGFPRIPPPFPPI 49 >gi|7296368|gb|AAF51657.1| (AE003592) CG11458 gene product [Drosophila melanogaster] Length = 98 Frame 2 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | | | 98 0 20 40 60 80 Plus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.97, P = 0.62 Identities = 16/34 (47%), Positives = 17/34 (50%), Frame = +2 Query: 251 GGKGVNVHAGKGGGGTNVNVGGKGSGGGVNVHAG 352 GG G H G GGG + GG G GGG H G Sbjct: 63 GGGGGGKHGGGNGGGGK-HGGGGGGGGGGGKHGG 95 >gi|123699|sp|P15342|HSP2_HORSE SPERM HISTONE P2A (ST2A) >gi|108203|pir||S10754 protamine St2a - horse Length = 62 Frame 1 hits (HSPs): ___________________________________________ Annotated Domains: ___________________________________________ __________________________________________________ Database sequence: | | | | | 62 0 20 40 60 __________________ Annotated Domains: PRODOM PD001830: HSP1(29) VE2(21) GAG(12) 9..61 __________________ Plus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.99, P = 0.63 Identities = 14/34 (41%), Positives = 21/34 (61%), Frame = +1 Query: 250 WRQGRKRACRKRRRWHQCQRWWKRIRRRRERACR 351 +R+ R+R C RR + +R ++ RRRR R CR Sbjct: 8 YRRYRRRCCSPRRLYRLRRRRYRSSRRRRRRPCR 41 Score = 71 (25.0 bits), Expect = 0.99, P = 0.63 Identities = 18/42 (42%), Positives = 25/42 (59%), Frame = +1 Query: 226 KXKYLSECWRQGRKRACRKRRRWHQCQRWWKRIRRRRERACR 351 + +Y S R+ R+R CR+RR C +R+RRRR R CR Sbjct: 26 RRRYRSS--RRRRRRPCRRRRHRRVC----RRVRRRR-RCCR 60 >gi|123697|sp|P04102|HSP1_SHEEP SPERM PROTAMINE P1 (CYSTEINE-RICH PROTAMINE) >gi|70787|pir||HSSH sperm histone - sheep Length = 50 Frame 1 hits (HSPs): ___________________________________ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | 50 0 20 40 __________________ Annotated Domains: BLOCKS BL00048: Protamine P1 proteins. 1..27 PFAM protamine_P1: Protamine P1 1..49 PROSITE PROTAMINE_P1: Protamine P1 signature. 1..12 __________________ Plus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.99, P = 0.63 Identities = 17/34 (50%), Positives = 21/34 (61%), Frame = +1 Query: 247 CWRQGRKRACRKRRRWHQCQRWWKRI-RRRRERAC 348 C R R CR+RRR +C+R +R RRRR R C Sbjct: 6 CLTHSRSR-CRRRRR-RRCRRRRRRFGRRRRRRVC 38 Score = 67 (23.6 bits), Expect = 2.7, P = 0.93 Identities = 17/35 (48%), Positives = 21/35 (60%), Frame = +1 Query: 247 CWRQGRKRACRKRRRWHQCQRWWKRIRRRRER--ACR 351 C R R CR+RRR +C+R +R RRR R CR Sbjct: 6 CLTHSRSR-CRRRRR-RRCRRRRRRFGRRRRRRVCCR 40 >gi|123694|sp|P10119|HSP1_RABIT SPERM PROTAMINE P1 (CYSTEINE-RICH PROTAMINE) >gi|90132|pir||S02007 protamine I - rabbit Length = 49 Frame 1 hits (HSPs): ____________________________________ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | 49 0 20 40 __________________ Annotated Domains: BLOCKS BL00048: Protamine P1 proteins. 1..27 PFAM protamine_P1: Protamine P1 1..48 PROSITE PROTAMINE_P1: Protamine P1 signature. 1..12 __________________ Plus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.99, P = 0.63 Identities = 15/37 (40%), Positives = 21/37 (56%), Frame = +1 Query: 262 RKRACRKRRRWHQCQRWWKRIRRRRERACRSQGESXC 372 R R CR + R +C+R +R RRRR R C+ + C Sbjct: 2 RYRCCRSQSR-SRCRRRRRRCRRRRRRCCQRRRVRKC 37 Score = 65 (22.9 bits), Expect = 4.4, P = 0.99 Identities = 17/34 (50%), Positives = 22/34 (64%), Frame = +1 Query: 247 CWRQGRKRACRKRRRWHQCQRWWKRI-RRRRERAC 348 C Q R R CR+RRR +C+R +R +RRR R C Sbjct: 6 CRSQSRSR-CRRRRR--RCRRRRRRCCQRRRVRKC 37 >gi|123704|sp|P15343|HSP3_HORSE SPERM HISTONE P2B (ST2B) >gi|108204|pir||S10755 protamine St2b - horse Length = 58 Frame 1 hits (HSPs): __________________________________________ Annotated Domains: ___________________________________________ __________________________________________________ Database sequence: | | | | 58 0 20 40 __________________ Annotated Domains: PRODOM PD001830: HSP1(29) VE2(21) GAG(12) 3..52 __________________ Plus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.99, P = 0.63 Identities = 18/42 (42%), Positives = 25/42 (59%), Frame = +1 Query: 226 KXKYLSECWRQGRKRACRKRRRWHQCQRWWKRIRRRRERACR 351 + +Y S R+ R+R CR+RR C +R+RRRR R CR Sbjct: 22 RRRYRSS--RRRRRRPCRRRRHRRVC----RRVRRRR-RCCR 56 Score = 65 (22.9 bits), Expect = 4.4, P = 0.99 Identities = 13/30 (43%), Positives = 18/30 (60%), Frame = +1 Query: 262 RKRACRKRRRWHQCQRWWKRIRRRRERACR 351 R+R C RR + +R ++ RRRR R CR Sbjct: 8 RRRRCSPRRLYRLRRRRYRSSRRRRRRPCR 37 >gi|108389|pir||S21673 protamine 2 - pig Length = 91 Frame 1 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | | | 91 0 20 40 60 80 Plus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.99, P = 0.63 Identities = 18/31 (58%), Positives = 22/31 (70%), Frame = +1 Query: 262 RKRACRKRRRWHQCQ-RWWKR-IRR-RRERACR 351 R+R+CR RRR C+ RW +R RR RR R CR Sbjct: 58 RRRSCRSRRR--ACRHRWHRRGCRRIRRRRRCR 88 >gi|462336|sp|P35302|HSP1_ALOSE SPERM PROTAMINE P1 >gi|289128|gb|AAA51404.1| (L14592) protamine [Alouatta seniculus] Length = 52 Frame 1 hits (HSPs): _____________________________________ Annotated Domains: ________________________________________________ __________________________________________________ Database sequence: | | | | 52 0 20 40 __________________ Annotated Domains: BLOCKS BL00048: Protamine P1 proteins. 1..27 PFAM protamine_P1: Protamine P1 1..50 PROSITE PROTAMINE_P1: Protamine P1 signature. 2..13 __________________ Plus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.99, P = 0.63 Identities = 17/38 (44%), Positives = 19/38 (50%), Frame = +1 Query: 262 RKRACRKRRRWH-QCQRWWKRIRRRRERACRSQGESXC 372 R R CR R +C R R RRRR R+CR S C Sbjct: 3 RYRCCRSRSLSRSRCYRQRPRCRRRRRRSCRRPRASRC 40 >gi|5306259|gb|AAD41991.1|AC006233_18 (AC006233) hypothetical protein [Arabidopsis thaliana] Length = 134 Frame -3 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | 134 0 50 100 Minus Strand HSPs: Score = 83 (29.2 bits), Expect = 0.99, P = 0.63 Identities = 21/38 (55%), Positives = 24/38 (63%), Frame = -3 Query: 363 FPL*PACTFTPPPDPFPPTLTLVP--PPPFPACTFTPLPP 250 FPL P +PPP P PP+ + P PPPFPA F P PP Sbjct: 34 FPLLP---LSPPPSP-PPSPSSPPRLPPPFPAL-FPPEPP 68 >gi|228938|prf||1814452C Hyp-rich glycoprotein [Zea diploperennis] Length = 349 Frame -3 hits (HSPs): ________ _______________ __________________________________________________ Database sequence: | | | | 349 0 150 300 Minus Strand HSPs: Score = 92 (32.4 bits), Expect = 1.1, P = 0.66 Identities = 18/34 (52%), Positives = 20/34 (58%), Frame = -3 Query: 342 TFTPPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 T+TP P P PPT T P PP P P PPT+T Sbjct: 83 TYTPSPKPTPPTYTPTPTPPTPK----PTPPTYT 112 Score = 90 (31.7 bits), Expect = 1.8, P = 0.84 Identities = 18/34 (52%), Positives = 20/34 (58%), Frame = -3 Query: 342 TFTPPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 T+TP P P PPT T P PP P P PPT+T Sbjct: 184 TYTPSPKPTPPTYTPSPKPPTPK----PTPPTYT 213 Score = 88 (31.0 bits), Expect = 3.1, P = 0.95 Identities = 20/36 (55%), Positives = 21/36 (58%), Frame = -3 Query: 342 TFTPP-PDPFPPTLTLVPPP-PFPACTFTPLPPTFT 241 T TPP P P PPT T P P P P TP PPT+T Sbjct: 98 TPTPPTPKPTPPTYTPAPTPKPTPTPKPTPTPPTYT 133 Score = 84 (29.6 bits), Expect = 8.6, P = 1.0 Identities = 18/37 (48%), Positives = 19/37 (51%), Frame = -3 Query: 351 PACTFTPPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 P T P P P PPT T P PP P P PPT+T Sbjct: 154 PPATKPPTPKPTPPTYTPSPKPPTPK----PTPPTYT 186 Score = 84 (29.6 bits), Expect = 8.6, P = 1.0 Identities = 18/37 (48%), Positives = 19/37 (51%), Frame = -3 Query: 351 PACTFTPPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 P T P P P PPT T P PP P P PPT+T Sbjct: 218 PPATKPPTPKPTPPTYTPSPKPPTPK----PTPPTYT 250 >gi|283032|pir||S22456 hydroxyproline-rich glycoprotein - perennial teosinte >gi|22092|emb|CAA45514.1| (X64173) hydroxyproline-rich glycoprotein [Zea diploperennis] Length = 350 Frame -3 hits (HSPs): _________ ______________ __________________________________________________ Database sequence: | | | | 350 0 150 300 Minus Strand HSPs: Score = 92 (32.4 bits), Expect = 1.1, P = 0.67 Identities = 18/34 (52%), Positives = 20/34 (58%), Frame = -3 Query: 342 TFTPPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 T+TP P P PPT T P PP P P PPT+T Sbjct: 83 TYTPSPKPTPPTYTPTPTPPTPK----PTPPTYT 112 Score = 90 (31.7 bits), Expect = 1.8, P = 0.84 Identities = 18/34 (52%), Positives = 20/34 (58%), Frame = -3 Query: 342 TFTPPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 T+TP P P PPT T P PP P P PPT+T Sbjct: 185 TYTPSPKPTPPTYTPSPKPPTPK----PTPPTYT 214 Score = 86 (30.3 bits), Expect = 5.2, P = 0.99 Identities = 20/37 (54%), Positives = 21/37 (56%), Frame = -3 Query: 342 TFTPP-PDPFPPTLTLVPPP--PFPACTFTPLPPTFT 241 T TPP P P PPT T P P P P TP PPT+T Sbjct: 98 TPTPPTPKPTPPTYTPAPTPHKPTPTPKPTPTPPTYT 134 Score = 84 (29.6 bits), Expect = 8.7, P = 1.0 Identities = 18/37 (48%), Positives = 19/37 (51%), Frame = -3 Query: 351 PACTFTPPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 P T P P P PPT T P PP P P PPT+T Sbjct: 155 PPATKPPTPKPTPPTYTPSPKPPTPK----PTPPTYT 187 Score = 84 (29.6 bits), Expect = 8.7, P = 1.0 Identities = 18/37 (48%), Positives = 19/37 (51%), Frame = -3 Query: 351 PACTFTPPPDPFPPTLTLVPPPPFPACTFTPLPPTFT 241 P T P P P PPT T P PP P P PPT+T Sbjct: 219 PPATKPPTPKPTPPTYTPSPKPPTPK----PTPPTYT 251 >gi|5902409|gb|AAD55511.1|AC008148_21 (AC008148) Unknown protein [Arabidopsis thaliana] Length = 138 Frame -3 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | 138 0 50 100 Minus Strand HSPs: Score = 83 (29.2 bits), Expect = 1.2, P = 0.69 Identities = 17/35 (48%), Positives = 19/35 (54%), Frame = -3 Query: 351 PACTFTPPPDPFPPTLTLVPPPPFPACTFTPLPPT 247 P PPP P PP+ PPPP AC PLPP+ Sbjct: 48 PCLQNIPPPSPPPPS----PPPPSQACPPPPLPPS 78 >gi|1076719|pir||S53051 glycine rich protein - barley (fragment) >gi|728596|emb|CAA88559.1| (Z48625) glycine rich protein [Hordeum vulgare] Length = 64 Frame 2 hits (HSPs): _________________________ __________________________________________________ Database sequence: | | | | | 64 0 20 40 60 Plus Strand HSPs: Score = 70 (24.6 bits), Expect = 1.3, P = 0.71 Identities = 18/38 (47%), Positives = 21/38 (55%), Frame = +2 Query: 272 HAGKGGGGTNVNVGGKGSGGGVNVHAGHKGKAXACFCW 385 H GKGGGG + GG G GGG + GH G + C W Sbjct: 3 HGGKGGGGYPGH-GG-GGGGG---YPGHGGGSSGCH-W 34 >gi|462341|sp|P35306|HSP1_HYLLA SPERM PROTAMINE P1 (CYSTEINE-RICH PROTAMINE) >gi|290888|gb|AAA51527.1| (L14588) protamine [Hylobates lar] Length = 51 Frame 1 hits (HSPs): ________________________________________________ Annotated Domains: ________________________________________________ __________________________________________________ Database sequence: | | | | 51 0 20 40 __________________ Annotated Domains: BLOCKS BL00048: Protamine P1 proteins. 1..27 PFAM protamine_P1: Protamine P1 1..49 PROSITE PROTAMINE_P1: Protamine P1 signature. 2..13 __________________ Plus Strand HSPs: Score = 70 (24.6 bits), Expect = 1.3, P = 0.72 Identities = 17/38 (44%), Positives = 24/38 (63%), Frame = +1 Query: 241 SECWRQG------RKRACRKRRRWHQCQRWWKRIRRRR 336 S C+R+G R+R+C+ RRR +C R R+RRRR Sbjct: 13 SRCYRRGQRSRRRRRRSCQTRRRAMRCCRPRYRLRRRR 50 Score = 68 (23.9 bits), Expect = 2.1, P = 0.88 Identities = 14/32 (43%), Positives = 21/32 (65%), Frame = +1 Query: 262 RKRACRKRRRWHQCQRWWKRIRRRRERACRSQ 357 R R CR + R +C R +R RRRR R+C+++ Sbjct: 3 RYRCCRSQSR-SRCYRRGQRSRRRRRRSCQTR 33 >gi|462337|sp|P35304|HSP1_CAVPO SPERM PROTAMINE P1 (CYSTEINE-RICH PROTAMINE) >gi|348278|pir||S29972 protamine 1 - guinea pig >gi|7439893|pir||S29973 protamine 1 - guinea pig >gi|49562|emb|CAA77644.1| (Z11545) protamine 1 [Cavia porcellus] >gi|49564|emb|CAA77643.1| (Z11544) protamine 1 [Cavia porcellus] >gi|607016|gb|AAA58349.1| (M83896) protamine 1 [Cavia porcellus] Length = 48 Frame 1 hits (HSPs): _________________________________________ Annotated Domains: ________________________________________________ __________________________________________________ Database sequence: | | | | 48 0 20 40 __________________ Annotated Domains: BLOCKS BL00048: Protamine P1 proteins. 1..27 PFAM protamine_P1: Protamine P1 1..46 PROSITE PROTAMINE_P1: Protamine P1 signature. 2..13 __________________ Plus Strand HSPs: Score = 70 (24.6 bits), Expect = 1.3, P = 0.72 Identities = 16/42 (38%), Positives = 21/42 (50%), Frame = +1 Query: 247 CWRQGRKRACRKRRRWHQCQRWWKRIRRRRERACRSQGESXC 372 C R + CR+RRR +R +R RRR R CR + C Sbjct: 6 CCRSPSRSRCRRRRRRFYRRR--RRCHRRRRRCCRRRYTRRC 45 WARNING: HSPs involving 74 database sequences were not reported due to the limiting value of parameter B = 50. Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.93 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.348 0.152 0.529 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.329 0.145 0.486 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.326 0.136 0.436 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.350 0.156 0.544 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.353 0.153 0.606 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.340 0.154 0.518 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 304 277 10. 78 3 12 22 0.098 36 33 0.12 39 +2 0 304 277 10. 78 3 12 22 0.098 36 33 0.12 39 +1 0 304 280 10. 78 3 12 22 0.099 36 33 0.12 39 -1 0 304 275 10. 78 3 12 22 0.097 36 33 0.12 39 -2 0 304 275 10. 78 3 12 22 0.097 36 33 0.12 39 -3 0 304 277 10. 78 3 12 22 0.098 36 33 0.12 39 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 8:50 PM CDT May 27, 2000 Format: BLAST # of letters in database: 158,518,215 # of sequences in database: 505,245 # of database sequences satisfying E: 124 No. of states in DFA: 600 (59 KB) Total size of DFA: 286 KB (320 KB) Time to generate neighborhood: 0.04u 0.00s 0.04t Elapsed: 00:00:00 No. of threads or processors used: 4 Search cpu time: 383.02u 2.15s 385.17t Elapsed: 00:03:25 Total cpu time: 383.15u 2.23s 385.38t Elapsed: 00:03:26 Start: Mon Oct 16 17:18:40 2000 End: Mon Oct 16 17:22:06 2000 WARNINGS ISSUED: 2
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000