WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker Server unavailable.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= B04B06.seq(1>329); (295 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 24 Sequences : less than 24 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 8411 1418 |=========================================================== 6310 6993 1041 |=========================================== 3980 5952 959 |======================================= 2510 4993 864 |==================================== 1580 4129 683 |============================ 1000 3446 647 |========================== 631 2799 489 |==================== 398 2310 389 |================ 251 1921 320 |============= 158 1601 257 |========== 100 1344 231 |========= 63.1 1113 190 |======= 39.8 923 126 |===== 25.1 797 113 |==== 15.8 684 104 |==== >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 580 <<<<<<<<<<<<<<<<< 10.0 580 67 |== 6.31 513 78 |=== 3.98 435 102 |==== 2.51 333 66 |== 1.58 267 69 |== 1.00 198 39 |= 0.63 159 49 |== 0.40 110 26 |= 0.25 84 18 |: 0.16 66 14 |: 0.10 52 9 |: 0.063 43 4 |: 0.040 39 10 |: 0.025 29 5 |: 0.016 24 4 |: 0.010 20 4 |: 0.0063 16 2 |: 0.0040 14 4 |: 0.0025 10 0 | 0.0016 10 5 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|9279651|dbj|BAB01151.1|(AP000373) flavonol 3-O-glu... +1 153 3.5e-09 1 gi|7294399|gb|AAF49745.1|(AE003535) CG13482 gene prod... +3 104 7.1e-05 1 gi|11071236|emb|CAC14290.1|(AJ297853) bicoid protein ... +3 104 0.00069 1 gi|11071238|emb|CAC14291.1|(AJ297854) bicoid protein ... +3 104 0.00069 1 gi|11078691|emb|CAC14293.1|(AJ297850) bicoid protein ... +3 104 0.00069 1 gi|544411|sp|Q06885|GP10_DICDIGLYCOPROTEIN GP100 PREC... +2 92 0.0011 2 gi|11258169|pir||T45603glucosyltransferase-like prote... +1 87 0.0011 2 gi|7512984|pir||T00065hypothetical protein KIAA0442 -... +3 84 0.0013 2 gi|8885608|dbj|BAA97538.1|(AB028606) UDP-glucose:anth... +1 84 0.0014 2 gi|8885563|dbj|BAA97493.1|(AB025604) UDP-glycose:flav... +1 82 0.0015 2 gi|3582342|gb|AAC35239.1|(AC005496) putative flavonol... +1 98 0.0031 1 gi|7300213|gb|AAF55377.1|(AE003716) CG5225 gene produ... +3 70 0.0031 2 gi|7267545|emb|CAB78027.1|(AL161513) arabinogalactan-... +2 88 0.0035 1 gi|7433902|pir||T01732UTP-glucose glucosyltransferase... +1 97 0.0039 1 gi|7267604|emb|CAB80916.1|(AL161491) putative flavono... +1 97 0.0041 1 gi|12718478|emb|CAC28807.1|(AL513466) hypothetical pr... +3 77 0.0054 2 gi|7940276|gb|AAF70835.1|AC003113_2(AC003113) F24O1.6... +2 50 0.0075 2 gi|9630030ref|NP_046248.1| unknown [Orgyia pseudotsug... +2 91 0.0078 1 gi|2134202|pir||I51171transcription factor RcC/EPB-1 ... +3 91 0.0085 1 gi|688080|gb|AAB31576.1|lectin=chitin-binding protein... +2 59 0.0096 2 gi|11061709|emb|CAC14289.1|(AJ297856) bicoid protein ... +3 88 0.011 1 gi|7518953|pir||A71099hypothetical protein PH1053 - P... +2 83 0.012 1 gi|1708075|sp|P54583|GUN1_ACICEENDOGLUCANASE E1 PRECU... +2 81 0.013 2 gi|2232354|gb|AAB62270.1|(AF006081) UDPG glucosyltran... +1 92 0.013 1 gi|100212|pir||S14976extensin class II (clones u1/u2)... +2 81 0.019 1 gi|806720|gb|AAA66362.1|(U13066) arabinogalactan-prot... +2 81 0.019 1 gi|11359668|pir||T48707related to regulatory protein ... +3 94 0.020 1 gi|3582343|gb|AAC35240.1|(AC005496) putative flavonol... +1 90 0.022 1 gi|6103625|gb|AAF03693.1|(AF172095) unknown [Picea ru... +2 82 0.024 1 gi|1079316|pir||A56554transcription factor C/EBP - Af... +3 87 0.025 1 gi|322757|pir||PQ0476pistil extensin-like protein (cl... +2 81 0.025 1 gi|7298314|gb|AAF53543.1|(AE003651) CG13260 gene prod... +3 91 0.026 1 gi|19923|emb|CAA78394.1|(Z14016) pistil extensin like... +2 81 0.026 1 gi|4314356|gb|AAD15567.1|(AC006340) putative anthocya... +1 75 0.028 2 gi|10567858|gb|AAG18592.1|AC067971_33(AC067971) Conta... +1 89 0.029 1 gi|9665137|gb|AAF97321.1|AC023628_2(AC023628) Similar... +1 89 0.029 1 gi|8778962|gb|AAD49768.2|AC007932_16(AC007932) F11A17... +2 77 0.033 2 gi|5815436|gb|AAD52672.1|AF178772_1(AF178772) 98kDa H... +2 89 0.035 1 gi|544262|sp|Q03211|EXLP_TOBACPISTIL-SPECIFIC EXTENSI... +2 73 0.036 2 gi|7299505|gb|AAF54693.1|(AE003692) CG6923 gene produ... +2 78 0.048 2 gi|11358911|pir||T48374UDPG glucosyltransferase-like ... +1 86 0.058 1 gi|7508913|pir||T33997hypothetical protein W03G1.5 - ... +3 86 0.059 1 gi|2133677|pir||S62335I71-7 protein - fruit fly (Dros... +2 70 0.060 2 gi|478673|pir||S23737proline-rich protein precursor -... +3 83 0.064 1 gi|1911629|gb|AAB50771.1|(S83377) huntingtin protein ... +2 46 0.075 2 gi|7292503|gb|AAF47906.1|(AE003481) CG15023 gene prod... +2 78 0.077 1 gi|7294245|gb|AAF49596.1|(AE003530) Eig71Ee gene prod... +2 70 0.080 2 gi|12408113|gb|AAG53694.1|AF327708_1(AF327708) retino... +2 75 0.080 1 gi|228938|prf||1814452CHyp-rich glycoprotein [Zea dip... +2 83 0.081 1 gi|283032|pir||S22456hydroxyproline-rich glycoprotein... +2 83 0.081 1
Use the and icons to retrieve links to Entrez:
WARNING: Descriptions of 530 database sequences were not reported due to the limiting value of parameter V = 50. >gi|9279651|dbj|BAB01151.1| (AP000373) flavonol 3-O-glucosyltransferase-like protein [Arabidopsis thaliana] Length = 462 Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | 462 0 150 300 450 Plus Strand HSPs: Score = 153 (53.9 bits), Expect = 3.5e-09, P = 3.5e-09 Identities = 39/79 (49%), Positives = 49/79 (62%), Frame = +1 Query: 10 GEDTVVLYPAPGIGHIVSMVELAK-LLQLHAH-SITILLTTGLLDHPSIDTYIHRISISH 183 GE+ +VLYPAP IGH+VSMVEL K +L + SI I+L S TYI +S S Sbjct: 2 GEEAIVLYPAPPIGHLVSMVELGKTILSKNPSLSIHIILVPPPYQPESTATYISSVSSSF 61 Query: 184 PSIFFHRLPH-TXLSTTTT 237 PSI FH LP T S+++T Sbjct: 62 PSITFHHLPAVTPYSSSST 80 >gi|7294399|gb|AAF49745.1| (AE003535) CG13482 gene product [Drosophila melanogaster] Length = 102 Frame 3 hits (HSPs): _______________________________________ __________________________________________________ Database sequence: | | | | | | | 102 0 20 40 60 80 100 Plus Strand HSPs: Score = 104 (36.6 bits), Expect = 7.1e-05, P = 7.1e-05 Identities = 24/56 (42%), Positives = 28/56 (50%), Frame = +3 Query: 108 HNSSHHGSP*PPLYRHLHTPNLH--LSPFNLLPPPPPHXTLHHHHREHGRQSLQLHQP 275 H+ HHG PP++ H P+ H P PPPPPH HHHH HG H P Sbjct: 45 HHHHHHG---PPMHHHGPPPHHHHHYGP----PPPPPHYDHHHHH--HGSHFDHHHGP 93 Score = 85 (29.9 bits), Expect = 0.0073, P = 0.0073 Identities = 22/52 (42%), Positives = 26/52 (50%), Frame = +3 Query: 108 HNSSHHGSP*PPLYRHLHTPNLH--LSPFNLLPPPPPHXTLHHHHR----------EHG 248 H+ HHG PP++ H P+ H P PPPPPH HHHH HG Sbjct: 45 HHHHHHG---PPMHHHGPPPHHHHHYGP----PPPPPHYDHHHHHHGSHFDHHHGPHHG 96 Score = 73 (25.7 bits), Expect = 0.14, P = 0.13 Identities = 17/39 (43%), Positives = 20/39 (51%), Frame = +3 Query: 120 HHGSP*PPLYRHLHTPNLHLSPFNLLPPPPPHXTLHHHH 236 HH P PP++ H H H P + PPPH HHHH Sbjct: 36 HHYHPPPPVHHHHH----HGPPMHH-HGPPPH---HHHH 66 Score = 73 (25.7 bits), Expect = 0.14, P = 0.13 Identities = 22/58 (37%), Positives = 26/58 (44%), Frame = +3 Query: 108 HNSSHHGSP*PPLYRHLHTPNLHLSPFNLLPPPPPHXTLHHHHREHGRQSLQLHQPKP 281 H+ HH P PP++ H H PPPP H HHHH HG + H P P Sbjct: 24 HHHHHHHPP-PPVH-HYH------------PPPPVH---HHHH--HG-PPMHHHGPPP 61 Score = 67 (23.6 bits), Expect = 0.78, P = 0.54 Identities = 18/43 (41%), Positives = 22/43 (51%), Frame = +3 Query: 108 HNSSHHGSP*PPL-YRHLHTPNLHLSPFNLLPPPPPHXTLHHHH 236 H+ H+G P PP Y H H + H S F+ PH HHHH Sbjct: 62 HHHHHYGPPPPPPHYDHHH--HHHGSHFD--HHHGPHHGHHHHH 101 Score = 67 (23.6 bits), Expect = 0.78, P = 0.54 Identities = 21/57 (36%), Positives = 25/57 (43%), Frame = +3 Query: 108 HNSSHHGSP*PPLYRHLHTPNLH--LSPFNLLPPPPPHXTLHHHHR-------------- 239 H+ HHG PP++ H P+ H P PPPPPH HHHH Sbjct: 45 HHHHHHG---PPMHHHGPPPHHHHHYGP----PPPPPHYDHHHHHHGSHFDHHHGPHHGH 97 Query: 240 --EH 245 H Sbjct: 98 HHHH 101 >gi|11071236|emb|CAC14290.1| (AJ297853) bicoid protein [Musca domestica] Length = 468 Frame 3 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | 468 0 150 300 450 Plus Strand HSPs: Score = 104 (36.6 bits), Expect = 0.00069, P = 0.00069 Identities = 23/59 (38%), Positives = 31/59 (52%), Frame = +3 Query: 105 NHNSSHHGSP*PPLYRHLHTP---NLHLSPFNLLPPPPPHXTLHHHHREHGRQSLQLHQP 275 NH+S+HH PP + L P + + PPPPP +L HHH +H +Q H P Sbjct: 233 NHHSTHHHHHQPPHHATLTHPYGCSAGATGGQYYPPPPPPSSLQHHHSQHQQQ---YHSP 289 Query: 276 KP 281 P Sbjct: 290 HP 291 >gi|11071238|emb|CAC14291.1| (AJ297854) bicoid protein [Musca domestica] Length = 468 Frame 3 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | 468 0 150 300 450 Plus Strand HSPs: Score = 104 (36.6 bits), Expect = 0.00069, P = 0.00069 Identities = 23/59 (38%), Positives = 31/59 (52%), Frame = +3 Query: 105 NHNSSHHGSP*PPLYRHLHTP---NLHLSPFNLLPPPPPHXTLHHHHREHGRQSLQLHQP 275 NH+S+HH PP + L P + + PPPPP +L HHH +H +Q H P Sbjct: 233 NHHSTHHHHHQPPHHATLTHPYGCSAGATGGQYYPPPPPPSSLQHHHSQHQQQ---YHSP 289 Query: 276 KP 281 P Sbjct: 290 HP 291 >gi|11078691|emb|CAC14293.1| (AJ297850) bicoid protein [Musca domestica] Length = 468 Frame 3 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | 468 0 150 300 450 Plus Strand HSPs: Score = 104 (36.6 bits), Expect = 0.00069, P = 0.00069 Identities = 23/59 (38%), Positives = 31/59 (52%), Frame = +3 Query: 105 NHNSSHHGSP*PPLYRHLHTP---NLHLSPFNLLPPPPPHXTLHHHHREHGRQSLQLHQP 275 NH+S+HH PP + L P + + PPPPP +L HHH +H +Q H P Sbjct: 233 NHHSTHHHHHQPPHHATLTHPYGCSAGATGGQYYPPPPPPSSLQHHHSQHQQQ---YHSP 289 Query: 276 KP 281 P Sbjct: 290 HP 291 >gi|544411|sp|Q06885|GP10_DICDI GLYCOPROTEIN GP100 PRECURSOR (P29F8) >gi|167797|gb|AAC37369.1| (L04286) glycoprotein gp100 [Dictyostelium discoideum] Length = 544 Frame 2 hits (HSPs): ___________ __ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | 544 0 150 300 450 __________________ Annotated Domains: DOMO DM00698: 77..188 Entrez Domain: EXTRACELLULAR (POTENTIAL). 20..489 Entrez Transmembrane region: POTENTIAL. 490..510 Entrez Domain: CYTOPLASMIC (POTENTIAL). 511..544 Entrez Domain: THR/PRO-RICH. 117..208 Entrez glycosylation site: POTENTIAL. 80 Entrez glycosylation site: POTENTIAL. 224 Entrez glycosylation site: POTENTIAL. 308 Entrez glycosylation site: POTENTIAL. 332 Entrez glycosylation site: POTENTIAL. 366 Entrez glycosylation site: POTENTIAL. 380 Entrez glycosylation site: POTENTIAL. 410 Entrez glycosylation site: POTENTIAL. 422 Entrez glycosylation site: POTENTIAL. 478 PRODOM PD074171: GP10_DICDI 1..115 PRODOM PD000540: H1(18) O76786(11) TONB(10) 123..209 PRODOM PD132319: GP10_DICDI 211..543 __________________ Plus Strand HSPs: Score = 92 (32.4 bits), Expect = 0.017, P = 0.016 Identities = 25/65 (38%), Positives = 35/65 (53%), Frame = +2 Query: 119 SPRVSLTTPLSTPTYTESPSLTLQS----SSTASPTHXSPP---PPP*AWPPKPSTSSTK 277 S S TP STPT T +P+ T Q+ ++T PT S P P P P +PS+S +K Sbjct: 157 SKPTSTPTPTSTPTPTSTPTPTSQTIPPPTTTPKPTKSSKPTKTPVPTPTPTRPSSSVSK 216 Query: 278 AFNFI 292 ++ I Sbjct: 217 GYDII 221 Score = 90 (31.7 bits), Expect = 0.0011, Sum P(2) = 0.0011 Identities = 23/51 (45%), Positives = 30/51 (58%), Frame = +2 Query: 119 SPRV--SLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPSTSS 271 SP V T+P S PT T +P+ T +ST +PT + PPP PKP+ SS Sbjct: 145 SPTVPPQTTSPTSKPTSTPTPTSTPTPTSTPTPTSQTIPPP--TTTPKPTKSS 195 Score = 69 (24.3 bits), Expect = 9.8, P = 1.0 Identities = 22/51 (43%), Positives = 26/51 (50%), Frame = +2 Query: 131 SLTTPLSTPTYTESPSLTLQS--SSTASPTHXSPPPPP*AWPP--KPSTSST 274 S TP TPT T P+ T S S T PT SP PP P KP+++ T Sbjct: 114 SKQTPTPTPTPTSKPTSTPTSTPSQTIPPT-VSPTVPPQTTSPTSKPTSTPT 164 Score = 34 (12.0 bits), Expect = 0.0011, Sum P(2) = 0.0011 Identities = 6/13 (46%), Positives = 9/13 (69%), Frame = +2 Query: 257 PSTSSTKAFNFIN 295 P T++T +FIN Sbjct: 406 PKTTNTTMLSFIN 418 >gi|11258169|pir||T45603 glucosyltransferase-like protein - Arabidopsis thaliana >gi|6523069|emb|CAB62336.1| (AL133314) glucosyltransferase-like protein [Arabidopsis thaliana] Length = 453 Frame 1 hits (HSPs): _______ ___ __________________________________________________ Database sequence: | | | || 453 0 150 300 450 Plus Strand HSPs: Score = 87 (30.6 bits), Expect = 0.0011, Sum P(2) = 0.0011 Identities = 22/49 (44%), Positives = 28/49 (57%), Frame = +1 Query: 19 TVVLYPAPGIGHIVSMVELAKLLQLHAHSITILLTTGLLDHPSIDTYIH 165 +VVL P P GHI M++LAK L L SIT++ T PS D + H Sbjct: 9 SVVLVPFPAQGHISPMMQLAKTLHLKGFSITVVQTKFNYFSPS-DDFTH 56 Score = 35 (12.3 bits), Expect = 0.0011, Sum P(2) = 0.0011 Identities = 6/19 (31%), Positives = 12/19 (63%), Frame = +1 Query: 202 RLPHTXLSTTTTVSMAAKA 258 +LP+ STT+ + A ++ Sbjct: 127 KLPNIIFSTTSATAFACRS 145 >gi|7512984|pir||T00065 hypothetical protein KIAA0442 - human (fragment) >gi|2662165|dbj|BAA23714.1| (AB007902) HH0712 cDNA clone for KIAA0442 has a 574-bp insertion at position 1474 of the sequence of KIAA0442. [Homo sapiens] Length = 1172 Frame 3 hits (HSPs): ____ __ Frame 2 hits (HSPs): __ __________________________________________________ Database sequence: | | | | | | | | | 1172 0 150 300 450 600 750 900 1050 Plus Strand HSPs: Score = 84 (29.6 bits), Expect = 0.0013, Sum P(2) = 0.0013 Identities = 26/66 (39%), Positives = 32/66 (48%), Frame = +3 Query: 15 RHSSVVSSPRHRPHCIHG*AC*ASPITCSF-NHNSSHHGSP*--PPLYRHLHTPNLHLSP 185 R S+ + PH H + ASP S NH+ H +P PP H H PN+ P Sbjct: 321 RSSTPAKTQPAPPHISHHPS--ASPFPLSLPNHSPLHSFTPTLQPPA--HSHHPNMFAPP 376 Query: 186 FNLLPPPPP 212 LPPPPP Sbjct: 377 -TALPPPPP 384 Score = 47 (16.5 bits), Expect = 0.0013, Sum P(2) = 0.0013 Identities = 9/28 (32%), Positives = 11/28 (39%), Frame = +3 Query: 189 NLLPPPPPHXTLHHHHREHGRQSLQLHQ 272 +L PPP H H +H HQ Sbjct: 426 SLGPPPYLRTEFHQHQHQHQHTHQHTHQ 453 Score = 40 (14.1 bits), Expect = 0.0065, Sum P(2) = 0.0065 Identities = 11/32 (34%), Positives = 12/32 (37%), Frame = +3 Query: 195 LPPPPPHXT---LHHHHREHGRQSLQLHQPKP 281 L PPP T H H +H Q H P Sbjct: 427 LGPPPYLRTEFHQHQHQHQHTHQHTHQHTFTP 458 Score = 37 (13.0 bits), Expect = 0.013, Sum P(2) = 0.013 Identities = 8/13 (61%), Positives = 8/13 (61%), Frame = +2 Query: 203 ASPTH-XSPPPPP 238 A TH S PPPP Sbjct: 909 APQTHRASEPPPP 921 >gi|8885608|dbj|BAA97538.1| (AB028606) UDP-glucose:anthocysnin 5-O-glucosyltransferase-like [Arabidopsis thaliana] Length = 449 Frame 3 hits (HSPs): ______ Frame 1 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | 449 0 150 300 Plus Strand HSPs: Score = 84 (29.6 bits), Expect = 0.0014, Sum P(2) = 0.0014 Identities = 20/44 (45%), Positives = 25/44 (56%), Frame = +1 Query: 22 VVLYPAPGIGHIVSMVELAKLLQLHAHSITILLTTGLLDHPSID 153 VVL P P GHI M++LAK L SIT++ T +PS D Sbjct: 11 VVLVPVPAQGHITPMIQLAKALHSKGFSITVVQTKFNYLNPSND 54 Score = 37 (13.0 bits), Expect = 0.0014, Sum P(2) = 0.0014 Identities = 12/48 (25%), Positives = 22/48 (45%), Frame = +3 Query: 144 LYRHLHTPNLHLSPFNLLPPPPPHXTLHHHHREHGRQSLQ-LHQPKPST 287 L + L P + P +++ PP L E ++ L++ KPS+ Sbjct: 224 LQQELEIPVYSIGPLHMVVSAPPTSLL-----EENESCIEWLNKQKPSS 267 >gi|8885563|dbj|BAA97493.1| (AB025604) UDP-glycose:flavonoid glycosyltransferase-like [Arabidopsis thaliana] Length = 449 Frame 1 hits (HSPs): _____ _____ __________________________________________________ Database sequence: | | | | 449 0 150 300 Plus Strand HSPs: Score = 82 (28.9 bits), Expect = 0.0015, Sum P(2) = 0.0015 Identities = 16/37 (43%), Positives = 22/37 (59%), Frame = +1 Query: 13 EDTVVLYPAPGIGHIVSMVELAKLLQLHAHSITILLT 123 E +VL P P GH+ M++L K L SIT++LT Sbjct: 8 ETRIVLVPVPAQGHVTPMMQLGKALHSKGFSITVVLT 44 Score = 39 (13.7 bits), Expect = 0.0015, Sum P(2) = 0.0015 Identities = 9/36 (25%), Positives = 18/36 (50%), Frame = +1 Query: 151 DTYIHRISISHPSIFFHRLPHTXLSTTTTVSMAAKA 258 D Y++ SH ++ +LP STT+ + ++ Sbjct: 114 DEYMY---FSHAAVKEFQLPSVVFSTTSATAFVCRS 146 >gi|3582342|gb|AAC35239.1| (AC005496) putative flavonol 3-O-glucosyltransferase [Arabidopsis thaliana] Length = 467 Frame 1 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | | 467 0 150 300 450 Plus Strand HSPs: Score = 98 (34.5 bits), Expect = 0.0031, P = 0.0031 Identities = 29/92 (31%), Positives = 47/92 (51%), Frame = +1 Query: 22 VVLYPAPGIGHIVSMVELAKLL--QLHAHSITILLTTGLLDHPSIDTYIHRISISHPSIF 195 ++ P P +GH+V +E A+ L Q ITILL L +DTY+ I+ S P + Sbjct: 6 LIFIPTPTVGHLVPFLEFARRLIEQDDRIRITILLMK-LQGQSHLDTYVKSIASSQPFVR 64 Query: 196 FHRLPHTXLSTT--TTVSMAAKAFNFINQSLQL 288 F +P T +T S+ A ++ I +++ L Sbjct: 65 FIDVPELEEKPTLGSTQSVEAYVYDVIERNIPL 97 >gi|7300213|gb|AAF55377.1| (AE003716) CG5225 gene product [Drosophila melanogaster] Length = 594 Frame 3 hits (HSPs): ____ Frame 2 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | | 594 0 150 300 450 Plus Strand HSPs: Score = 70 (24.6 bits), Expect = 0.0031, Sum P(2) = 0.0031 Identities = 17/44 (38%), Positives = 19/44 (43%), Frame = +3 Query: 105 NHNSSHHGSP*PPLYRHLHTPNLHLSPFNLLPPPPPHXTLHHHH 236 +H+ HH P PP P P PPPPPH H HH Sbjct: 143 DHHDHHHHHPAPP-------PPPPPPPPPPPPPPPPHSHPHSHH 179 Score = 51 (18.0 bits), Expect = 0.0031, Sum P(2) = 0.0031 Identities = 9/13 (69%), Positives = 9/13 (69%), Frame = +2 Query: 224 PPPPP*AWPPKPS 262 PPPPP PP PS Sbjct: 240 PPPPPPPAPPPPS 252 Score = 39 (13.7 bits), Expect = 0.051, Sum P(2) = 0.050 Identities = 7/12 (58%), Positives = 7/12 (58%), Frame = +2 Query: 224 PPPPP*AWPPKP 259 PP PP PP P Sbjct: 217 PPGPPGTGPPGP 228 >gi|7267545|emb|CAB78027.1| (AL161513) arabinogalactan-protein homolog [Arabidopsis thaliana] >gi|10880497|gb|AAG24278.1|AF195891_1 (AF195891) arabinogalactan protein [Arabidopsis thaliana] Length = 127 Frame 2 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 127 0 50 100 Plus Strand HSPs: Score = 88 (31.0 bits), Expect = 0.0035, P = 0.0035 Identities = 23/51 (45%), Positives = 28/51 (54%), Frame = +2 Query: 122 PRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPSTSST 274 PR + TP TPT T +PS T +++ SP SP P A PP P TS T Sbjct: 40 PRTAAPTPSITPTPTPTPSAT-PTAAPVSPPAGSPLPSS-ASPPAPPTSLT 88 Score = 71 (25.0 bits), Expect = 0.22, P = 0.20 Identities = 23/53 (43%), Positives = 28/53 (52%), Frame = +2 Query: 122 PRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPSTSSTKA 280 PR + TP TPT T +PS T TA+P SPP A P PS++S A Sbjct: 40 PRTAAPTPSITPTPTPTPSAT----PTAAPV--SPP----AGSPLPSSASPPA 82 Score = 70 (24.6 bits), Expect = 0.28, P = 0.25 Identities = 18/50 (36%), Positives = 25/50 (50%), Frame = +2 Query: 122 PRVSLTTPLSTPTYTESPSLTLQSSSTASP------THXSPPPPP*AWPP 253 PR + TP TPT T +PS T ++ + P + SPP PP + P Sbjct: 40 PRTAAPTPSITPTPTPTPSATPTAAPVSPPAGSPLPSSASPPAPPTSLTP 89 >gi|7433902|pir||T01732 UTP-glucose glucosyltransferase homolog A_IG002N01.15 - Arabidopsis thaliana >gi|2191136|gb|AAB61023.1| (AF007269) Similar to UTP-Glucose Glucosyltransferase; coded for by A. thaliana cDNA T46230; coded for by A. thaliana cDNA H76538; coded for by A. thaliana cDNA H76290 [Arabidopsis thaliana] Length = 462 Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | 462 0 150 300 450 Plus Strand HSPs: Score = 97 (34.1 bits), Expect = 0.0039, P = 0.0039 Identities = 24/74 (32%), Positives = 40/74 (54%), Frame = +1 Query: 22 VVLYPAPGIGHIVSMVELAKLLQLHAHSITILLTTGLLDHPSID--TYIHRISISHPSIF 195 V + P+PG+GH++ +VE AK L +H H +T+ PS T + + S S+F Sbjct: 9 VAIIPSPGMGHLIPLVEFAKRL-VHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSSISSVF 67 Query: 196 FHRLPHTXLSTTTTV 240 + T LS++T + Sbjct: 68 LPPVDLTDLSSSTRI 82 >gi|7267604|emb|CAB80916.1| (AL161491) putative flavonol glucosyltransferase [Arabidopsis thaliana] Length = 480 Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | 480 0 150 300 450 Plus Strand HSPs: Score = 97 (34.1 bits), Expect = 0.0041, P = 0.0041 Identities = 24/74 (32%), Positives = 40/74 (54%), Frame = +1 Query: 22 VVLYPAPGIGHIVSMVELAKLLQLHAHSITILLTTGLLDHPSID--TYIHRISISHPSIF 195 V + P+PG+GH++ +VE AK L +H H +T+ PS T + + S S+F Sbjct: 9 VAIIPSPGMGHLIPLVEFAKRL-VHLHGLTVTFVIAGEGPPSKAQRTVLDSLPSSISSVF 67 Query: 196 FHRLPHTXLSTTTTV 240 + T LS++T + Sbjct: 68 LPPVDLTDLSSSTRI 82 >gi|12718478|emb|CAC28807.1| (AL513466) hypothetical protein [Neurospora crassa] Length = 697 Frame 3 hits (HSPs): ____ ___ __________________________________________________ Database sequence: | | | | | | 697 0 150 300 450 600 Plus Strand HSPs: Score = 77 (27.1 bits), Expect = 0.0055, Sum P(2) = 0.0054 Identities = 14/27 (51%), Positives = 16/27 (59%), Frame = +3 Query: 198 PPPPPHXTLHHHHREHGRQSLQLHQPK 278 PPPPPH HH +H Q Q HQP+ Sbjct: 596 PPPPPHSNQPPHHAQH--QQHQQHQPQ 620 Score = 43 (15.1 bits), Expect = 0.0055, Sum P(2) = 0.0054 Identities = 15/41 (36%), Positives = 19/41 (46%), Frame = +3 Query: 108 HNS---SHHGSP*PPLYRHLHTPNL----HLSPFNLLPPPPP 212 HN + H P PP+ + P L H P +PPPPP Sbjct: 488 HNQPVGNFHPVP-PPIGFPVGAPPLPGQPHQFPSFPVPPPPP 528 Score = 43 (15.1 bits), Expect = 0.0055, Sum P(2) = 0.0054 Identities = 16/42 (38%), Positives = 20/42 (47%), Frame = +3 Query: 108 HNS---SHHGSP*PPLYRHLHTPNL----HLSP-FNLLPPPPP 212 HN + H P PP+ + P L H P F + PPPPP Sbjct: 488 HNQPVGNFHPVP-PPIGFPVGAPPLPGQPHQFPSFPVPPPPPP 529 >gi|7940276|gb|AAF70835.1|AC003113_2 (AC003113) F24O1.6 [Arabidopsis thaliana] Length = 70 Frame 3 hits (HSPs): _______________________ Frame 2 hits (HSPs): _________________________ __________________________________________________ Database sequence: | | | | | 70 0 20 40 60 Plus Strand HSPs: Score = 50 (17.6 bits), Expect = 0.0075, Sum P(2) = 0.0075 Identities = 9/17 (52%), Positives = 9/17 (52%), Frame = +2 Query: 209 PTHXSPPPPP*AWPPKP 259 P PPPPP PP P Sbjct: 24 PEPRPPPPPPGPQPPPP 40 Score = 49 (17.2 bits), Expect = 0.0075, Sum P(2) = 0.0075 Identities = 12/27 (44%), Positives = 13/27 (48%), Frame = +3 Query: 132 P*PPLYRHLHTPNLHLSPFNLLPPPPP 212 P PP +RH P H P PPP P Sbjct: 2 PYPPPHRH--PPPFHHPPPPRPPPPEP 26 Score = 49 (17.2 bits), Expect = 0.024, Sum P(2) = 0.023 Identities = 9/17 (52%), Positives = 10/17 (58%), Frame = +2 Query: 224 PPPPP*AWPPKPSTSST 274 PPPPP PP P +T Sbjct: 39 PPPPPRPDPPPPLPGAT 55 Score = 49 (17.2 bits), Expect = 0.024, Sum P(2) = 0.023 Identities = 13/31 (41%), Positives = 14/31 (45%), Frame = +3 Query: 132 P*PPLYRHL----HTPNLHLSPFNLLPPPPP 212 P PP +RH H P P PPPPP Sbjct: 2 PYPPPHRHPPPFHHPPPPRPPPPEPRPPPPP 32 Score = 47 (16.5 bits), Expect = 0.038, Sum P(2) = 0.037 Identities = 9/17 (52%), Positives = 10/17 (58%), Frame = +2 Query: 209 PTHXSPPPPP*AWPPKP 259 P PPPPP PP+P Sbjct: 32 PPGPQPPPPP---PPRP 45 Score = 41 (14.4 bits), Expect = 0.15, Sum P(2) = 0.14 Identities = 13/32 (40%), Positives = 14/32 (43%), Frame = +3 Query: 132 P*PPLYRH---LH--TPNLHLSPFNLLPPPPP 212 P PP +RH H P P PPPPP Sbjct: 2 PYPPPHRHPPPFHHPPPPRPPPPEPRPPPPPP 33 Score = 36 (12.7 bits), Expect = 0.49, Sum P(2) = 0.38 Identities = 7/15 (46%), Positives = 8/15 (53%), Frame = +3 Query: 195 LPPPPPHX---TLHH 230 +P PPPH HH Sbjct: 1 MPYPPPHRHPPPFHH 15 Score = 35 (12.3 bits), Expect = 0.61, Sum P(2) = 0.46 Identities = 9/17 (52%), Positives = 9/17 (52%), Frame = +2 Query: 209 PTHXSPPPP-P*A-WPP 253 P PPPP P A W P Sbjct: 42 PPRPDPPPPLPGATWFP 58 >gi|9630030 ref|NP_046248.1| unknown [Orgyia pseudotsugata nuclear polyhedrosis virus] >gi|2493240|sp|O10341|Y091_NPVOP HYPOTHETICAL 29.3 KD PROTEIN (ORF92) >gi|7515481|pir||T10361 hypothetical protein 92 - Orgyia pseudotsugata nuclear polyhedrosis virus >gi|1911338|gb|AAC59091.1| (U75930) unknown [Orgyia pseudotsugata nuclear polyhedrosis virus] Length = 279 Frame 2 hits (HSPs): __________________________________ __________________________________________________ Database sequence: | | | | | | | 279 0 50 100 150 200 250 Plus Strand HSPs: Score = 91 (32.0 bits), Expect = 0.0079, P = 0.0078 Identities = 23/49 (46%), Positives = 26/49 (53%), Frame = +2 Query: 119 SPRVSLT-TPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPS 262 SP S T TP TP+ T +PS T + T SPT PPP PP PS Sbjct: 142 SPTPSPTPTPSPTPSPTPTPSPTPSPTPTPSPTPSPTPPPSPTPPPSPS 190 Score = 84 (29.6 bits), Expect = 0.046, P = 0.045 Identities = 26/53 (49%), Positives = 29/53 (54%), Frame = +2 Query: 119 SPRVSLT-TPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWP-PKPSTSST 274 SP S T TP TP+ T +PS T S T SPT SP P P P P P+ S T Sbjct: 112 SPTPSPTPTPSPTPSPTPTPSPTPTPSPTPSPTP-SPTPTPSPTPSPTPTPSPT 164 Score = 82 (28.9 bits), Expect = 0.076, P = 0.073 Identities = 24/52 (46%), Positives = 29/52 (55%), Frame = +2 Query: 119 SPRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWP-PKPSTSST 274 SP +S TP TPT + +PS T + T SPT SP P P P P P+ S T Sbjct: 84 SPTLS-PTPSPTPTPSPTPSPTPSPTPTPSPTP-SPTPTPSPTPSPTPTPSPT 134 Score = 82 (28.9 bits), Expect = 0.076, P = 0.073 Identities = 25/53 (47%), Positives = 29/53 (54%), Frame = +2 Query: 119 SPRVSLT-TPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWP-PKPSTSST 274 SP S T TP TPT + +PS T + T SPT SP P P P P P+ S T Sbjct: 122 SPTPSPTPTPSPTPTPSPTPSPTPSPTPTPSPTP-SPTPTPSPTPSPTPTPSPT 174 Score = 81 (28.5 bits), Expect = 0.098, P = 0.094 Identities = 25/53 (47%), Positives = 28/53 (52%), Frame = +2 Query: 119 SPRVSLTTPLSTPTYTESPSLTLQSSSTASPTHX-SPPPPP*AWP-PKPSTSST 274 SP S TP TPT + +PS T S T SPT SP P P P P PS + T Sbjct: 98 SPTPS-PTPSPTPTPSPTPSPTPTPSPTPSPTPTPSPTPTPSPTPSPTPSPTPT 150 Score = 81 (28.5 bits), Expect = 0.098, P = 0.094 Identities = 25/70 (35%), Positives = 32/70 (45%), Frame = +2 Query: 65 WLSLLSFSNYMLIQSQFFSPRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*A 244 WL L S + + SP + TP TPT T SP+ T + T +PT P P A Sbjct: 19 WL-LTSLKQNIETKPPLPSPTPT-PTPSPTPTPTPSPTPTPTPTPTPTPTPTPSPTPTPA 76 Query: 245 WPPKPSTSST 274 P P+ S T Sbjct: 77 LSPTPTPSPT 86 Score = 80 (28.2 bits), Expect = 0.13, P = 0.12 Identities = 23/52 (44%), Positives = 29/52 (55%), Frame = +2 Query: 119 SPRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWP-PKPSTSST 274 +P +S TP +PT + +PS T S T SPT SP P P P P P+ S T Sbjct: 74 TPALS-PTPTPSPTLSPTPSPTPTPSPTPSPTP-SPTPTPSPTPSPTPTPSPT 124 Score = 80 (28.2 bits), Expect = 0.13, P = 0.12 Identities = 25/57 (43%), Positives = 30/57 (52%), Frame = +2 Query: 119 SPRVSLT---TPLSTPTYTESPSLTLQSSSTASPTHX-SPPPPP*AWP-PKPSTSST 274 SP +S T TP +PT + +PS T S T SPT SP P P P P P+ S T Sbjct: 84 SPTLSPTPSPTPTPSPTPSPTPSPTPTPSPTPSPTPTPSPTPSPTPTPSPTPTPSPT 140 Score = 79 (27.8 bits), Expect = 0.16, P = 0.15 Identities = 24/53 (45%), Positives = 29/53 (54%), Frame = +2 Query: 119 SPRVSLT-TPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWP-PKPSTSST 274 SP S T TP TP+ T +PS T + T SPT +P P P P P P+ S T Sbjct: 102 SPTPSPTPTPSPTPSPTPTPSPTPSPTPTPSPTP-TPSPTPSPTPSPTPTPSPT 154 Score = 78 (27.5 bits), Expect = 0.21, P = 0.19 Identities = 26/57 (45%), Positives = 30/57 (52%), Frame = +2 Query: 119 SPRVSLT-TPLSTPTYTESPSLT--LQSSSTASPTHXSPPPPP*AWP---PKPSTSST 274 SP + T TP TPT T SP+ T L + T SPT SP P P P P P+ S T Sbjct: 52 SPTPTPTPTPTPTPTPTPSPTPTPALSPTPTPSPT-LSPTPSPTPTPSPTPSPTPSPT 108 Score = 76 (26.8 bits), Expect = 0.34, P = 0.29 Identities = 24/57 (42%), Positives = 29/57 (50%), Frame = +2 Query: 119 SPRVSLT-TPLSTPTYTESPSLTLQSSSTASPTHX---SPPPPP*AWP-PKPSTSST 274 SP S T TP TPT + +PS T + T SPT +P P P P P P+ S T Sbjct: 122 SPTPSPTPTPSPTPTPSPTPSPTPSPTPTPSPTPSPTPTPSPTPSPTPTPSPTPSPT 178 Score = 75 (26.4 bits), Expect = 0.44, P = 0.36 Identities = 26/59 (44%), Positives = 30/59 (50%), Frame = +2 Query: 119 SPRVSLT-TPLSTPTYTESPSLT--LQSSSTASPTHX---SPPPPP*AWP-PKPSTSST 274 SP + T TP TPT T SP+ T L + T SPT SP P P P P PS + T Sbjct: 52 SPTPTPTPTPTPTPTPTPSPTPTPALSPTPTPSPTLSPTPSPTPTPSPTPSPTPSPTPT 110 Score = 73 (25.7 bits), Expect = 0.74, P = 0.52 Identities = 25/78 (32%), Positives = 39/78 (50%), Frame = +2 Query: 53 TLYPWLSLLSFSNYMLIQ-SQFFSPRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPP 229 TL ++ ++ + ++L Q + L +P TPT T SP+ T S T +PT +P Sbjct: 6 TLLNYIIIILLTLWLLTSLKQNIETKPPLPSP--TPTPTPSPTPTPTPSPTPTPTP-TPT 62 Query: 230 PPP*AWPPKPSTSSTKAFN 286 P P P PS + T A + Sbjct: 63 PTP---TPTPSPTPTPALS 78 Score = 70 (24.6 bits), Expect = 1.6, P = 0.79 Identities = 24/52 (46%), Positives = 29/52 (55%), Frame = +2 Query: 119 SPRVSLT-TPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWP-PKPSTSST 274 +P S T TP +PT T SP+L S T SPT +P P P P P P+ S T Sbjct: 66 TPTPSPTPTPALSPTPTPSPTL----SPTPSPTP-TPSPTPSPTPSPTPTPSPT 114 Score = 68 (23.9 bits), Expect = 7.7, P = 1.0 Identities = 19/53 (35%), Positives = 25/53 (47%), Frame = +2 Query: 119 SPRVSLT-TPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPSTSST 274 SP + T +P TPT T +P+ T S T +P P P P PS + T Sbjct: 44 SPTPTPTPSPTPTPTPTPTPTPTPTPSPTPTPALSPTPTPSPTLSPTPSPTPT 96 >gi|2134202|pir||I51171 transcription factor RcC/EPB-1 - bullfrog >gi|478889|gb|AAA52223.1| (U08604) transcription factor RcC/EPB-1 [Rana catesbeiana] Length = 292 Frame 3 hits (HSPs): ______________ Annotated Domains: ______ __________________________________________________ Database sequence: | | | | | | | 292 0 50 100 150 200 250 __________________ Annotated Domains: PROSITE LEUCINE_ZIPPER: Leucine zipper pattern. 251..272 PROSITE LEUCINE_ZIPPER: Leucine zipper pattern. 258..279 __________________ Plus Strand HSPs: Score = 91 (32.0 bits), Expect = 0.0085, P = 0.0085 Identities = 22/66 (33%), Positives = 28/66 (42%), Frame = +3 Query: 102 FNHNSSHHGSP*PPLYRHLHTPNLHLSPFNLLPPPPPHXTLHH--HHREHGRQSLQLHQP 275 + H S+ H S H +HL P + PPP P + HH HH H LQ Sbjct: 129 YPHPSNQHPSHLQYQVAHCAQTTMHLQPGHPTPPPTPVPSPHHLPHHHHHHHHQLQASSS 188 Query: 276 KPSTSS 293 K +SS Sbjct: 189 KAMSSS 194 Score = 80 (28.2 bits), Expect = 0.14, P = 0.13 Identities = 23/72 (31%), Positives = 29/72 (40%), Frame = +3 Query: 102 FNHNSSHHGSP*PPLYRHLHTPNLHLSPFNLLPPPPP-----HXTLHHHHREHGRQ---S 257 + H S+ H S H +HL P + PPP P H HHHH H Q S Sbjct: 129 YPHPSNQHPSHLQYQVAHCAQTTMHLQPGHPTPPPTPVPSPHHLPHHHHHHHHQLQASSS 188 Query: 258 LQLHQPKPSTSS 293 + S+SS Sbjct: 189 KAMSSSSSSSSS 200 >gi|688080|gb|AAB31576.1| lectin=chitin-binding protein [Solanum tuberosum=potatoes, tubers, Peptide Partial, 27 aa] Length = 27 Frame 3 hits (HSPs): _________ Frame 2 hits (HSPs): ____________________________________ __________________________________________________ Database sequence: | | | 27 0 20 Plus Strand HSPs: Score = 59 (20.8 bits), Expect = 0.0096, Sum P(2) = 0.0096 Identities = 11/18 (61%), Positives = 11/18 (61%), Frame = +2 Query: 209 PTHXSPPPPP*AWPPKPS 262 P H SPPPP PP PS Sbjct: 8 PPHPSPPPPSPPSPPPPS 25 Score = 52 (18.3 bits), Expect = 0.049, Sum P(2) = 0.048 Identities = 11/19 (57%), Positives = 11/19 (57%), Frame = +2 Query: 209 PTHXSPPPP--P*AWPPKP 259 P H SPPPP P PP P Sbjct: 8 PPHPSPPPPSPPSPPPPSP 26 Score = 52 (18.3 bits), Expect = 0.049, Sum P(2) = 0.048 Identities = 11/21 (52%), Positives = 11/21 (52%), Frame = +2 Query: 209 PTHXSPPPPP*AWPPKPSTSS 271 P H SPPPP PP P S Sbjct: 8 PPHPSPPPPS---PPSPPPPS 25 Score = 35 (12.3 bits), Expect = 0.0096, Sum P(2) = 0.0096 Identities = 5/5 (100%), Positives = 5/5 (100%), Frame = +3 Query: 198 PPPPP 212 PPPPP Sbjct: 4 PPPPP 8 >gi|11061709|emb|CAC14289.1| (AJ297856) bicoid protein [Lucilia sericata] Length = 227 Frame 3 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | | 227 0 50 100 150 200 Plus Strand HSPs: Score = 88 (31.0 bits), Expect = 0.011, P = 0.011 Identities = 24/67 (35%), Positives = 34/67 (50%), Frame = +3 Query: 99 SFN--HNSSHHGSP*PPLYRHLHTPNLHLS---PF-----NLLPPPPPHXTLHHHHREHG 248 +FN + ++HH S PP + H + H S P+ PPPPP +L HH +H Sbjct: 154 AFNPYYYNNHHASH-PPHHHHQAHHHTHASLTHPYAAAGTQYYPPPPPPGSLQHHQHQHQ 212 Query: 249 RQSLQLHQPKP 281 +Q H P P Sbjct: 213 QQ---YHAPHP 220 >gi|7518953|pir||A71099 hypothetical protein PH1053 - Pyrococcus horikoshii >gi|3257468|dbj|BAA30151.1| (AP000004) 139aa long hypothetical protein [Pyrococcus horikoshii] Length = 139 Frame 2 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | 139 0 50 100 Plus Strand HSPs: Score = 83 (29.2 bits), Expect = 0.012, P = 0.012 Identities = 20/47 (42%), Positives = 29/47 (61%), Frame = +2 Query: 146 LSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPSTSSTKAFN 286 LSTPT + SP ++ S + SP+ SPPPPP P P T+++ F+ Sbjct: 47 LSTPTISTSP-ISTASFACFSPSFISPPPPPPRNAP-PRTAASAIFS 91 >gi|1708075|sp|P54583|GUN1_ACICE ENDOGLUCANASE E1 PRECURSOR (ENDO-1,4-BETA-GLUCANASE E1) (CELLULASE E1) (ENDOCELLULASE E1) >gi|988300|gb|AAA75477.1| (U33212) E I beta-1,4-endoglucanase precursor [Acidothermus cellulolyticus] Length = 562 Frame 3 hits (HSPs): __ Frame 2 hits (HSPs): _______ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | | 562 0 150 300 450 __________________ Annotated Domains: BLOCKS BL00659A: Glycosyl hydrolases family 5 p 202..206 BLOCKS BL00659B: Glycosyl hydrolases family 5 p 271..287 DOMO DM01559: GLYCOSYLHYDROLASESFAMILY5 13..393 DOMO DM00357: CELLULOSE-BINDINGDOMAIN,BACTERI 461..558 Entrez Domain: CATALYTIC. 42..400 Entrez Domain: PRO/SER/THR-RICH (LINKER). 401..461 Entrez Domain: CELLULOSE-BINDING (BY SIMILARITY 462..562 Entrez active site: PROTON DONOR. 203 Entrez active site: NUCLEOPHILE. 323 PFAM cellulase: Cellulase (glycosyl hydrolase 51..371 PFAM CBD_2: Cellulose binding domain 465..559 PRODOM PD000557: GUNA(7) GUN1(6) GUN(6) 36..324 PRODOM PD000651: VL2(74) O18758(34) MUC2(10) 326..460 PRODOM PD196400: GUN1_ACICE 462..494 PRODOM PD001333: GUNA(3) CHIT(2) GUN1(2) 496..555 PROSITE GLYCOSYL_HYDROL_F5: Glycosyl hydrolases 196..205 __________________ Plus Strand HSPs: Score = 81 (28.5 bits), Expect = 0.013, Sum P(2) = 0.013 Identities = 20/40 (50%), Positives = 24/40 (60%), Frame = +2 Query: 119 SPRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP 238 SP S TP TPT T SP+ TL ++T +PT SP P P Sbjct: 420 SPSAS-RTPTPTPTPTASPTPTLTPTATPTPT-ASPTPSP 457 Score = 69 (24.3 bits), Expect = 0.23, Sum P(2) = 0.21 Identities = 20/58 (34%), Positives = 31/58 (53%), Frame = +2 Query: 101 IQSQFFSPRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPSTSST 274 I+S F P + +P S P+ + SPS + S++ +PT P P P A P P+ + T Sbjct: 391 IKSSIFDPVGASASPSSQPSPSVSPSPSPSPSASRTPT---PTPTPTA-SPTPTLTPT 444 Score = 67 (23.6 bits), Expect = 5.1, Sum P(2) = 0.99 Identities = 21/64 (32%), Positives = 31/64 (48%), Frame = +2 Query: 101 IQSQFFSPRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPP---PPP*AWP---PKPS 262 I+S F P + +P S P+ + SPS + S++ +PT P P P P P P+ Sbjct: 391 IKSSIFDPVGASASPSSQPSPSVSPSPSPSPSASRTPTPTPTPTASPTPTLTPTATPTPT 450 Query: 263 TSST 274 S T Sbjct: 451 ASPT 454 Score = 33 (11.6 bits), Expect = 0.013, Sum P(2) = 0.013 Identities = 5/6 (83%), Positives = 5/6 (83%), Frame = +3 Query: 42 RHRPHC 59 RHRP C Sbjct: 156 RHRPDC 161 >gi|2232354|gb|AAB62270.1| (AF006081) UDPG glucosyltransferase [Solanum berthaultii] Length = 465 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | 465 0 150 300 450 Plus Strand HSPs: Score = 92 (32.4 bits), Expect = 0.013, P = 0.013 Identities = 22/64 (34%), Positives = 38/64 (59%), Frame = +1 Query: 22 VVLYPAPGIGHIVSMVELAKLLQLHAHSITILLT---TGLLD-----HPSIDTYIHRISI 177 ++++P P GHI+++++L L LH ITIL+T +LD +PS++T + Sbjct: 10 ILIFPFPAQGHILALLDLTHQLLLHGFKITILVTPKNVPILDPLISTNPSVETLVFPFP- 68 Query: 178 SHPSI 192 HPS+ Sbjct: 69 GHPSL 73 >gi|100212|pir||S14976 extensin class II (clones u1/u2) - tomato >gi|1345539|emb|CAA39217.1| (X55687) extensin (class II) [Lycopersicon esculentum] Length = 75 Frame 2 hits (HSPs): _____________________________________ Annotated Domains: ___________________________________________ __________________________________________________ Database sequence: | | | | | 75 0 20 40 60 __________________ Annotated Domains: DOMO DM01369: PROLINE-RICHPROTEIN 1..65 __________________ Plus Strand HSPs: Score = 81 (28.5 bits), Expect = 0.019, P = 0.019 Identities = 18/47 (38%), Positives = 23/47 (48%), Frame = +2 Query: 122 PRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*A-WPPKPS 262 P+ P TP+Y E P + P + PPPPP + W PKPS Sbjct: 16 PQPQSPPPPPTPSY-EHPKTPSHPTPPTPPCNEPPPPPPNSHWEPKPS 62 Score = 60 (21.1 bits), Expect = 3.3, P = 0.96 Identities = 16/49 (32%), Positives = 22/49 (44%), Frame = +2 Query: 143 PLSTPTY------TESPSLTLQSSSTASPTHXSPPPPP*AWPPKPSTSS 271 P TP+Y + P T +P+H +PP PP PP P +S Sbjct: 7 PPPTPSYEHPQPQSPPPPPTPSYEHPKTPSHPTPPTPPCNEPPPPPPNS 55 >gi|806720|gb|AAA66362.1| (U13066) arabinogalactan-protein precursor [Nicotiana alata] Length = 132 Frame 2 hits (HSPs): _____________________ __________________________________________________ Database sequence: | | | | 132 0 50 100 Plus Strand HSPs: Score = 81 (28.5 bits), Expect = 0.019, P = 0.019 Identities = 25/55 (45%), Positives = 30/55 (54%), Frame = +2 Query: 119 SPRVSLTTPL-STPTYTESP-SLTLQSSSTASPTHXSPPPPP*AWPPKPSTSSTKAF 283 SP+ S P+ S PT +P S QS STA+ SP P A PP P T+ T AF Sbjct: 31 SPKASPVAPVASPPTAVVTPVSAPSQSPSTAA----SPSESPLASPPAPPTADTPAF 83 >gi|11359668|pir||T48707 related to regulatory protein wetA [imported] - Neurospora crassa >gi|11544665|emb|CAB88506.2| (AL353817) related to regulatory protein wetA [Neurospora crassa] Length = 963 Frame 3 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | | | | | 963 0 150 300 450 600 750 900 Plus Strand HSPs: Score = 94 (33.1 bits), Expect = 0.020, P = 0.020 Identities = 29/62 (46%), Positives = 33/62 (53%), Frame = +3 Query: 108 HNSSHHG----SP*PPLYRHLHTPNLHLSP--FNLLPPPPPHXTLHHHHREHGRQSLQLH 269 H+ H G P PPL H H H P +L PPPPPH HHHH +S LH Sbjct: 757 HHPHHRGVMGYPPMPPLPAH-HL-QAHHGPGASSLSPPPPPH---HHHHPS---KSFSLH 808 Query: 270 QPKPSTSS 293 P S+SS Sbjct: 809 -PSSSSSS 815 >gi|3582343|gb|AAC35240.1| (AC005496) putative flavonol 3-O-glucosyltransferase [Arabidopsis thaliana] Length = 467 Frame 1 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | | 467 0 150 300 450 Plus Strand HSPs: Score = 90 (31.7 bits), Expect = 0.022, P = 0.022 Identities = 29/93 (31%), Positives = 45/93 (48%), Frame = +1 Query: 22 VVLYPAPGIGHIVSMVELAKLL--QLHAHSITILLTTGLLDHPSIDTYIHRISISHPSIF 195 ++ P P +GH+V +E A+ L Q IT LL +D+Y+ IS S P + Sbjct: 6 LIFIPTPTVGHLVPFLEFARRLIEQDDRIRITFLLMKQQ-GQSHLDSYVKTISSSLPFVR 64 Query: 196 FHRLPHTXLSTTT-TVSMAAKAFNFINQSLQLHQ 294 F +P T T S+ A ++FI ++ L Q Sbjct: 65 FIDVPELEEKPTLGTQSVEAYVYDFIETNVPLVQ 98 >gi|6103625|gb|AAF03693.1| (AF172095) unknown [Picea rubens] Length = 165 Frame 2 hits (HSPs): ________________________ __________________________________________________ Database sequence: | | | | | 165 0 50 100 150 Plus Strand HSPs: Score = 82 (28.9 bits), Expect = 0.024, P = 0.024 Identities = 26/68 (38%), Positives = 35/68 (51%), Frame = +2 Query: 74 LLSFSNYMLIQSQFFSPRVSLTTPLSTPTYTESPSLTLQS--SSTASPTHXSPPP----P 235 L F + Q+ SP S TTP PT T +P T + ++TA+P +PPP P Sbjct: 14 LAGFLVSSMAQAPSASPTKSPTTP--APTTTAAPPTTTAAPPTTTATPPVSTPPPVASPP 71 Query: 236 P*AWPPKPST 265 P A PP +T Sbjct: 72 PVATPPPVAT 81 Score = 75 (26.4 bits), Expect = 0.16, P = 0.15 Identities = 27/74 (36%), Positives = 32/74 (43%), Frame = +2 Query: 74 LLSFSNYMLIQSQFFSPRVSLTTPLST-----PTYTESPSLTLQSS--STASPTHXSPP- 229 L F + Q+ SP S TTP T PT T +P T + ST P PP Sbjct: 14 LAGFLVSSMAQAPSASPTKSPTTPAPTTTAAPPTTTAAPPTTTATPPVSTPPPVASPPPV 73 Query: 230 --PPP*AWPPKPST 265 PPP A PP +T Sbjct: 74 ATPPPVATPPPVAT 87 >gi|1079316|pir||A56554 transcription factor C/EBP - African clawed frog >gi|255567|gb|AAB23276.1| (S44193) CCAAT/enhancer core binding protein, C/EBP [Xenopus laevis, Peptide, 305 aa] Length = 305 Frame 3 hits (HSPs): _____________ Annotated Domains: ________________________________________________ __________________________________________________ Database sequence: | | | | | | || 305 0 50 100 150 200 250 300 __________________ Annotated Domains: DOMO DM05062: 8..154 DOMO DM00107: BZIPTRANSCRIPTIONFACTORSBASICDO 156..298 PROSITE LEUCINE_ZIPPER: Leucine zipper pattern. 264..285 PROSITE LEUCINE_ZIPPER: Leucine zipper pattern. 271..292 __________________ Plus Strand HSPs: Score = 87 (30.6 bits), Expect = 0.025, P = 0.025 Identities = 25/77 (32%), Positives = 32/77 (41%), Frame = +3 Query: 81 ASPITCSFNHNS-SHHGSP*PPLYRHLHTPNLHLSPFNLLPPPPP-----HXTLHHHHRE 242 AS + + H++ S H S H +HL + PPP P H HHHH Sbjct: 136 ASSLAALYPHHAASQHSSHLQYQVAHCAQTTMHLQSGHPTPPPTPVPSPHHHPAHHHHHH 195 Query: 243 HGRQSLQLHQPKPSTSS 293 SL+ P STSS Sbjct: 196 LQTSSLKGISPSSSTSS 212 >gi|322757|pir||PQ0476 pistil extensin-like protein (clone pMG08) - common tobacco (fragment) Length = 154 Frame 2 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | 154 0 50 100 150 Plus Strand HSPs: Score = 81 (28.5 bits), Expect = 0.026, P = 0.025 Identities = 22/47 (46%), Positives = 26/47 (55%), Frame = +2 Query: 113 FF--SPRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPP 253 FF SP+ S ++P TP SPS Q S+ P SPPPPP PP Sbjct: 50 FFGKSPKKSPSSP--TPVNKPSPSPPPQVKSSLPPPAKSPPPPPAKSPP 96 >gi|7298314|gb|AAF53543.1| (AE003651) CG13260 gene product [Drosophila melanogaster] Length = 640 Frame 3 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | | 640 0 150 300 450 600 Plus Strand HSPs: Score = 91 (32.0 bits), Expect = 0.026, P = 0.026 Identities = 24/82 (29%), Positives = 35/82 (42%), Frame = +3 Query: 45 HRPHC-IHG*AC*ASPITCSFNHNSSHHGSP*PPLYRHLHTPNLHLSPFNLLPPPPPHXT 221 +R +C HG A P + H+ H P P + H H + H + Sbjct: 466 YRSYCPAHGPPAVAVP-NGAIGHHPHIHAHP-PHTHSHAHPTHSHHAQQQQQQQQQQQQQ 523 Query: 222 LHHHHREHGRQSLQLHQPKPSTS 290 LHHHH++HG+ L P P T+ Sbjct: 524 LHHHHQQHGQHQQHL-TPSPVTN 545 >gi|19923|emb|CAA78394.1| (Z14016) pistil extensin like protein, partial CDS [Nicotiana tabacum] Length = 155 Frame 2 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | 155 0 50 100 150 Plus Strand HSPs: Score = 81 (28.5 bits), Expect = 0.026, P = 0.026 Identities = 22/47 (46%), Positives = 26/47 (55%), Frame = +2 Query: 113 FF--SPRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPP 253 FF SP+ S ++P TP SPS Q S+ P SPPPPP PP Sbjct: 50 FFGKSPKKSPSSP--TPVNKPSPSPPPQVKSSLPPPAKSPPPPPAKSPP 96 >gi|4314356|gb|AAD15567.1| (AC006340) putative anthocyanidin-3-glucoside rhamnosyltransferase [Arabidopsis thaliana] Length = 470 Frame 2 hits (HSPs): ___ Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | 470 0 150 300 450 Plus Strand HSPs: Score = 75 (26.4 bits), Expect = 0.029, Sum P(2) = 0.028 Identities = 21/75 (28%), Positives = 41/75 (54%), Frame = +1 Query: 22 VVLYPAPGIGHIVSMVELAKLLQLHAHSITILLTTGLLDH--P----SIDTYIHRISISH 183 VV++P GH+V +EL+KL+ H ++ + T +D P ++ + I+ + +S Sbjct: 16 VVMFPWLAFGHMVPYLELSKLIAQKGHKVSFISTPRNIDRLLPWLPENLSSVINFVKLSL 75 Query: 184 PSIFFHRLPHTXLSTT 231 P + ++LP +TT Sbjct: 76 P-VGDNKLPEDGEATT 90 Score = 34 (12.0 bits), Expect = 0.029, Sum P(2) = 0.028 Identities = 7/15 (46%), Positives = 7/15 (46%), Frame = +2 Query: 215 HXSPPPPP*AWPPKP 259 H P P PPKP Sbjct: 244 HRKPVIPVGVLPPKP 258 >gi|10567858|gb|AAG18592.1|AC067971_33 (AC067971) Contains similarity to an unknown flavonol 3-o-glucosyltransferase At2g29750 gi|3582329 from Arabidopsis thaliana BAC T27A16 gb|AC005496. It contains a UDP-glucoronosyl and UDP-glucosyl transferases domain PF|00201. ESTs gb|AI993795, gb|N9> Length = 479 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | 479 0 150 300 450 Plus Strand HSPs: Score = 89 (31.3 bits), Expect = 0.030, P = 0.029 Identities = 26/68 (38%), Positives = 37/68 (54%), Frame = +1 Query: 13 EDTVVLYPAPGIGHIVSMVELAK-LLQL-HA-HSITIL-LTTGLLDHPSIDTYIHRISIS 180 E ++ P P GHI+ +E AK L+ L H H+ITIL L++ H S+ + + S Sbjct: 4 ETELIFIPVPSTGHILVHIEFAKRLINLDHRIHTITILNLSSPSSPHASV--FARSLIAS 61 Query: 181 HPSIFFHRLP 210 P I H LP Sbjct: 62 QPKIRLHDLP 71 >gi|9665137|gb|AAF97321.1|AC023628_2 (AC023628) Similar to UTP-glucose glucosyltransferases [Arabidopsis thaliana] Length = 481 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | 481 0 150 300 450 Plus Strand HSPs: Score = 89 (31.3 bits), Expect = 0.030, P = 0.029 Identities = 23/68 (33%), Positives = 38/68 (55%), Frame = +1 Query: 22 VVLYPAPGIGHIVSMVELAK-LLQLHAHSITILLTTGLLDHPSIDTYIHRISISHPSIFF 198 V + P+PGIGH++ +VELAK LL H ++T ++ + + ++ + S S+F Sbjct: 9 VAIIPSPGIGHLIPLVELAKRLLDNHGFTVTFIIPGDSPPSKAQRSVLNSLPSSIASVF- 67 Query: 199 HRLPHTXLS 225 LP LS Sbjct: 68 --LPPADLS 74 >gi|8778962|gb|AAD49768.2|AC007932_16 (AC007932) F11A17.16 [Arabidopsis thaliana] Length = 558 Frame 3 hits (HSPs): __ Frame 2 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | | 558 0 150 300 450 Plus Strand HSPs: Score = 77 (27.1 bits), Expect = 0.034, Sum P(2) = 0.033 Identities = 24/55 (43%), Positives = 31/55 (56%), Frame = +2 Query: 119 SP-RVSLTTPLSTPTYTESPSLTL-----QSSSTASPTHXSPPPPP*AWPPKPSTSSTKA 280 SP R+ T PL P + SP+ +L SS A PT PPPPP PP+P + +A Sbjct: 225 SPSRLPPTPPL--PKFLVSPASSLGKRDENSSPFAPPTPPPPPPPP---PPRPLAKAARA 279 Score = 33 (11.6 bits), Expect = 0.034, Sum P(2) = 0.033 Identities = 6/9 (66%), Positives = 7/9 (77%), Frame = +3 Query: 18 HSSVVSSPR 44 H SV+S PR Sbjct: 20 HYSVISKPR 28 >gi|5815436|gb|AAD52672.1|AF178772_1 (AF178772) 98kDa HDM allergen [Dermatophagoides farinae] Length = 555 Frame 2 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | 555 0 150 300 450 Plus Strand HSPs: Score = 89 (31.3 bits), Expect = 0.036, P = 0.035 Identities = 20/51 (39%), Positives = 27/51 (52%), Frame = +2 Query: 119 SPRVSLTTPL-STPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPSTS 268 SP TTP +TPT T SP+ + S +PT +P P P P P+T+ Sbjct: 437 SPTTPTTTPSPTTPTTTPSPTTPTTTPSPTTPTPTTPTPAPTTSTPSPTTT 487 Score = 75 (26.4 bits), Expect = 1.1, P = 0.68 Identities = 20/54 (37%), Positives = 28/54 (51%), Frame = +2 Query: 119 SPRVSLTTPL-STPTYTESPSLTLQSSSTASPTHXSPP--PPP*AWPPKPSTSS 271 +P + TTP +TPT T SP+ + S +PT P P P P P+TS+ Sbjct: 428 TPTTTPTTPSPTTPTTTPSPTTPTTTPSPTTPTTTPSPTTPTPTTPTPAPTTST 481 Score = 74 (26.0 bits), Expect = 1.5, P = 0.77 Identities = 18/49 (36%), Positives = 25/49 (51%), Frame = +2 Query: 119 SPRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPST 265 +P + TTP +TPT T SP+ + S +PT P P P P+T Sbjct: 421 TPTTTPTTPTTTPT-TPSPTTPTTTPSPTTPTTTPSPTTP-TTTPSPTT 467 >gi|544262|sp|Q03211|EXLP_TOBAC PISTIL-SPECIFIC EXTENSIN-LIKE PROTEIN PRECURSOR (PELP) >gi|322761|pir||JQ1696 pistil extensin-like protein precursor (clone pMG15) - common tobacco >gi|19929|emb|CAA78397.1| (Z14019) pistil extensin like protein [Nicotiana tabacum] Length = 426 Frame 3 hits (HSPs): __ Frame 2 hits (HSPs): ______ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 426 0 150 300 __________________ Annotated Domains: DOMO DM05812: 1..91 DOMO DM00698: 130..231 DOMO DM04077: 233..387 Entrez Domain: 4 X 5 AA REPEATS OF S-P(4). 69..182 Entrez Repetitive region: 1. 69..73 Entrez Repetitive region: 2. 76..80 Entrez Repetitive region: 3. 83..87 Entrez Repetitive region: 4. 178..182 Entrez glycosylation site: POTENTIAL. 310 PFAM Pollen_Ole_e_I: Pollen proteins Ole e I 291..425 PRINTS PSTLEXTENSIN1: Pistil-specific extensin 36..47 PRINTS PSTLEXTENSIN2: Pistil-specific extensin 237..260 PRINTS PSTLEXTENSIN3: Pistil-specific extensin 271..289 PRINTS PSTLEXTENSIN4: Pistil-specific extensin 300..324 PRINTS PSTLEXTENSIN5: Pistil-specific extensin 325..346 PRINTS PSTLEXTENSIN6: Pistil-specific extensin 352..369 PRINTS PSTLEXTENSIN7: Pistil-specific extensin 378..399 PRINTS PSTLEXTENSIN8: Pistil-specific extensin 406..426 PRODOM PD027171: EXLP(1) Q40385(1) 1..46 PRODOM PD026483: EXLP_TOBAC 96..137 PRODOM PD000540: H1(18) O76786(11) TONB(10) 148..279 PRODOM PD007363: 287..419 __________________ Plus Strand HSPs: Score = 73 (25.7 bits), Expect = 0.036, Sum P(2) = 0.036 Identities = 18/48 (37%), Positives = 22/48 (45%), Frame = +2 Query: 122 PRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPST 265 P + +P PT+ P L S SP+ PPPP PP PST Sbjct: 46 PLPDIPSPFDGPTFVLPPPSPLPSPPPPSPS----PPPPSPSPPPPST 89 Score = 34 (12.0 bits), Expect = 0.036, Sum P(2) = 0.036 Identities = 6/10 (60%), Positives = 9/10 (90%), Frame = +3 Query: 264 LHQPKPSTSS 293 ++QPKPS+ S Sbjct: 134 VNQPKPSSPS 143 >gi|7299505|gb|AAF54693.1| (AE003692) CG6923 gene product [Drosophila melanogaster] Length = 1256 Frame 2 hits (HSPs): __ __ __________________________________________________ Database sequence: | | | | 1256 0 500 1000 Plus Strand HSPs: Score = 78 (27.5 bits), Expect = 0.049, Sum P(2) = 0.048 Identities = 17/37 (45%), Positives = 21/37 (56%), Frame = +2 Query: 179 LTLQSSSTASPTHXSPPPPP*AWPP--KPSTSSTKAF 283 L + SS P H +PPPPP PP P SST+A+ Sbjct: 716 LPINESSMWMPGHPAPPPPPPPMPPILLPDNSSTEAY 752 Score = 38 (13.4 bits), Expect = 0.049, Sum P(2) = 0.048 Identities = 11/25 (44%), Positives = 14/25 (56%), Frame = +2 Query: 92 YMLIQSQFFSPRVSLTTPLSTPTYT 166 YML+ + F S S + S PTYT Sbjct: 50 YMLM-TPFLSSSQSHSHTRSPPTYT 73 >gi|11358911|pir||T48374 UDPG glucosyltransferase-like protein - Arabidopsis thaliana >gi|7378633|emb|CAB83309.1| (AL162751) UDPG glucosyltransferase-like protein [Arabidopsis thaliana] Length = 465 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | 465 0 150 300 450 Plus Strand HSPs: Score = 86 (30.3 bits), Expect = 0.060, P = 0.058 Identities = 20/68 (29%), Positives = 38/68 (55%), Frame = +1 Query: 22 VVLYPAPGIGHIVSMVELAKLLQLHAHSITILLTTGLLDHPSIDTYIHRISISHPSIFFH 201 +V++P P GH++ +++L L L ++++++T G L + S H S++ S+ F Sbjct: 20 IVVFPFPAQGHLLPLLDLTHQLCLRGFNVSVIVTPGNLTYLSPLLSAHPSSVT--SVVFP 77 Query: 202 RLPHTXLS 225 PH LS Sbjct: 78 FPPHPSLS 85 >gi|7508913|pir||T33997 hypothetical protein W03G1.5 - Caenorhabditis elegans >gi|4262637|gb|AAD14753.1| (AF125964) contains similarity to collagens [Caenorhabditis elegans] Length = 471 Frame 3 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | 471 0 150 300 450 Plus Strand HSPs: Score = 86 (30.3 bits), Expect = 0.061, P = 0.059 Identities = 23/54 (42%), Positives = 28/54 (51%), Frame = +3 Query: 105 NHNSSHHGS-P*PPLYRHLHTPNLHLSPF-NLLPPPPPHXTLH------HHHREHGRQS 257 +H+ HHG P PP + H H P PF PPPPP H HHH +G +S Sbjct: 415 HHHHHHHGCRPFPPHHGHHHFP-----PFWPPCPPPPPFWPPHRRGGHCHHHHHNGHRS 468 >gi|2133677|pir||S62335 I71-7 protein - fruit fly (Drosophila melanogaster) >gi|775239|gb|AAA65115.1| (U24246) I71-7 [Drosophila melanogaster] >gi|940000|gb|AAA74179.1| (U23836) I71-7 gene product [Drosophila melanogaster] >gi|1587054|prf||2205331D L71-7 gene [Drosophila melanogaster] Length = 393 Frame 3 hits (HSPs): ____ Frame 2 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | 393 0 150 300 Plus Strand HSPs: Score = 70 (24.6 bits), Expect = 0.061, Sum P(2) = 0.060 Identities = 22/56 (39%), Positives = 29/56 (51%), Frame = +2 Query: 122 PRVSLTTPLSTP--TYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKP-STSSTKAFN 286 P +TT S P T TES + ++S+ST T PP PP P ST +T+ N Sbjct: 310 PVTDITTQKSNPPCTCTESTTQKIKSTSTTQGTE--PPSTQKTLPPNPPSTKNTEPPN 365 Score = 34 (12.0 bits), Expect = 0.061, Sum P(2) = 0.060 Identities = 9/20 (45%), Positives = 10/20 (50%), Frame = +3 Query: 6 TR-RRHSSVVSSPRHRPHCI 62 TR RRH + PR CI Sbjct: 79 TRERRHHTKTRKPRKPVPCI 98 >gi|478673|pir||S23737 proline-rich protein precursor - kidney bean >gi|21046|emb|CAA42942.1| (X60391) proline-rich protein [Phaseolus vulgaris] Length = 297 Frame 3 hits (HSPs): _________ Annotated Domains: ______________________________________ __________________________________________________ Database sequence: | | | | | | | 297 0 50 100 150 200 250 __________________ Annotated Domains: DOMO DM04077: 54..270 __________________ Plus Strand HSPs: Score = 83 (29.2 bits), Expect = 0.066, P = 0.064 Identities = 21/56 (37%), Positives = 23/56 (41%), Frame = +3 Query: 108 HNSSHHGSP*PPLYRHLHTPNLHLSPFNLLPPPPPHXTLHHHHREHGRQSLQLHQP 275 H HH P P LH P+ P PP P H HHHH H S +H P Sbjct: 44 HPHHHHHHPPAPAPAPLHPPSPPSHPH--YPPAPAHPPTHHHHH-H--PSAPVHPP 94 >gi|1911629|gb|AAB50771.1| (S83377) huntingtin protein {poly Glu and poly Pro region} [gorilla, white blood cells, Peptide Partial, 33 aa] [Gorilla gorilla] Length = 33 Frame 3 hits (HSPs): ___________ Frame 2 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | 33 0 20 Plus Strand HSPs: Score = 46 (16.2 bits), Expect = 0.078, Sum P(2) = 0.075 Identities = 8/13 (61%), Positives = 9/13 (69%), Frame = +2 Query: 224 PPPPP*AWPPKPS 262 PPPPP PP P+ Sbjct: 20 PPPPPPPPPPGPA 32 Score = 39 (13.7 bits), Expect = 0.078, Sum P(2) = 0.075 Identities = 6/7 (85%), Positives = 6/7 (85%), Frame = +3 Query: 195 LPPPPPH 215 LP PPPH Sbjct: 2 LPQPPPH 8 >gi|7292503|gb|AAF47906.1| (AE003481) CG15023 gene product [Drosophila melanogaster] Length = 172 Frame 2 hits (HSPs): _______________________ __________________________________________________ Database sequence: | | | | | 172 0 50 100 150 Plus Strand HSPs: Score = 78 (27.5 bits), Expect = 0.080, P = 0.077 Identities = 21/49 (42%), Positives = 29/49 (59%), Frame = +2 Query: 137 TTPLSTP--TYTESPS-LTLQSSSTASPTHXS---PPPPP*AWPPKPSTSST 274 TTP TP TY P T +++T +PT PPPPP PP+ +T++T Sbjct: 86 TTPAPTPAPTYLPPPPPTTTTTTTTPAPTPAPTYLPPPPP---PPRTTTTTT 134 Score = 73 (25.7 bits), Expect = 0.30, P = 0.26 Identities = 17/51 (33%), Positives = 25/51 (49%), Frame = +2 Query: 122 PRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPSTSST 274 P + P PTY P T +++T +P +P P P PP P T++T Sbjct: 59 PPPTYLPPKPVPTYLPPPPPTTTTTTTTTP---APTPAPTYLPPPPPTTTT 106 >gi|7294245|gb|AAF49596.1| (AE003530) Eig71Ee gene product [Drosophila melanogaster] Length = 445 Frame 3 hits (HSPs): ___ Frame 2 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | 445 0 150 300 Plus Strand HSPs: Score = 70 (24.6 bits), Expect = 0.083, Sum P(2) = 0.080 Identities = 22/56 (39%), Positives = 29/56 (51%), Frame = +2 Query: 122 PRVSLTTPLSTP--TYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKP-STSSTKAFN 286 P +TT S P T TES + ++S+ST T PP PP P ST +T+ N Sbjct: 242 PVTDITTQKSNPPCTCTESTTQKIKSTSTTQGTE--PPSTQKTLPPNPPSTKNTEPPN 297 Score = 34 (12.0 bits), Expect = 0.083, Sum P(2) = 0.080 Identities = 9/20 (45%), Positives = 10/20 (50%), Frame = +3 Query: 6 TR-RRHSSVVSSPRHRPHCI 62 TR RRH + PR CI Sbjct: 79 TRERRHHTKTRKPRKPVPCI 98 >gi|12408113|gb|AAG53694.1|AF327708_1 (AF327708) retinoic acid receptor-gamma [Salmo salar] Length = 86 Frame 2 hits (HSPs): ___________________________________ __________________________________________________ Database sequence: | | | | | | 86 0 20 40 60 80 Plus Strand HSPs: Score = 75 (26.4 bits), Expect = 0.084, P = 0.080 Identities = 24/66 (36%), Positives = 34/66 (51%), Frame = +2 Query: 56 LYPWLSLLSFSNYMLIQSQFFSPRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPP 235 LY SL + +Y L Q++ S +++L T LS T+S S S P+ SPPPP Sbjct: 3 LYHKGSLTTLFDYKLKQNKI-SKQINLFTLLSVAVETQSTS-----SEEMVPSSPSPPPP 56 Query: 236 P*AWPP 253 P + P Sbjct: 57 PRIYKP 62 >gi|228938|prf||1814452C Hyp-rich glycoprotein [Zea diploperennis] Length = 349 Frame 2 hits (HSPs): _____________ ________ __________ __________________________________________________ Database sequence: | | | | 349 0 150 300 Plus Strand HSPs: Score = 83 (29.2 bits), Expect = 0.084, P = 0.081 Identities = 21/54 (38%), Positives = 27/54 (50%), Frame = +2 Query: 116 FSPRVSLTTPLSTP-TYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPSTSSTK 277 ++P TP TP TYT SP T + T SP +P P P + P P +TK Sbjct: 169 YTPSPKPPTPKPTPPTYTPSPKPT-PPTYTPSPKPPTPKPTPPTYTPSPKPPATK 222 Score = 74 (26.0 bits), Expect = 0.80, P = 0.55 Identities = 20/56 (35%), Positives = 26/56 (46%), Frame = +2 Query: 119 SPRVSLTTPLSTPTYTESPSLTLQSSS----TASPTHXSPPPPP*AWPPKPSTSSTK 277 SP+ T P + PTYT SP + T SP +P P P + P P +TK Sbjct: 252 SPKPPATKP-TPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATK 307 Score = 73 (25.7 bits), Expect = 1.0, P = 0.64 Identities = 19/51 (37%), Positives = 25/51 (49%), Frame = +2 Query: 122 PRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPSTSST 274 P+ TP PTYT SP T + T +PT +P P P + P P+ T Sbjct: 74 PKEHKPTP---PTYTPSPKPT-PPTYTPTPTPPTPKPTPPTYTPAPTPKPT 120 Score = 71 (25.0 bits), Expect = 1.7, P = 0.81 Identities = 18/48 (37%), Positives = 22/48 (45%), Frame = +2 Query: 116 FSPRVSLTTPLSTP-TYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKP 259 ++P TP TP TYT SP + PT+ P PP PP P Sbjct: 265 YTPSPKPPTPKPTPPTYTPSPKPP--TPKPTPPTYTPSPKPPATKPPTP 311 Score = 69 (24.3 bits), Expect = 5.5, P = 1.0 Identities = 17/47 (36%), Positives = 22/47 (46%), Frame = +2 Query: 140 TPLSTPTYTESPSLT-LQSSSTASPTHXSPPPPP*AWPPKPSTSSTK 277 TP TP T +P T + T SP +P P P + P P +TK Sbjct: 112 TPAPTPKPTPTPKPTPTPPTYTPSPKPPTPKPTPPTYAPSPKPPATK 158 >gi|283032|pir||S22456 hydroxyproline-rich glycoprotein - perennial teosinte >gi|22092|emb|CAA45514.1| (X64173) hydroxyproline-rich glycoprotein [Zea diploperennis] Length = 350 Frame 2 hits (HSPs): _______ ________ _________ __________________________________________________ Database sequence: | | | | 350 0 150 300 Plus Strand HSPs: Score = 83 (29.2 bits), Expect = 0.085, P = 0.081 Identities = 21/54 (38%), Positives = 27/54 (50%), Frame = +2 Query: 116 FSPRVSLTTPLSTP-TYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPSTSSTK 277 ++P TP TP TYT SP T + T SP +P P P + P P +TK Sbjct: 170 YTPSPKPPTPKPTPPTYTPSPKPT-PPTYTPSPKPPTPKPTPPTYTPSPKPPATK 223 Score = 74 (26.0 bits), Expect = 0.80, P = 0.55 Identities = 20/56 (35%), Positives = 26/56 (46%), Frame = +2 Query: 119 SPRVSLTTPLSTPTYTESPSLTLQSSS----TASPTHXSPPPPP*AWPPKPSTSSTK 277 SP+ T P + PTYT SP + T SP +P P P + P P +TK Sbjct: 253 SPKPPATKP-TPPTYTPSPKPPTPKPTPPTYTPSPKPPTPKPTPPTYTPSPKPPATK 308 Score = 71 (25.0 bits), Expect = 1.7, P = 0.82 Identities = 18/48 (37%), Positives = 22/48 (45%), Frame = +2 Query: 116 FSPRVSLTTPLSTP-TYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKP 259 ++P TP TP TYT SP + PT+ P PP PP P Sbjct: 266 YTPSPKPPTPKPTPPTYTPSPKPP--TPKPTPPTYTPSPKPPATKPPTP 312 Score = 70 (24.6 bits), Expect = 2.2, P = 0.89 Identities = 18/47 (38%), Positives = 24/47 (51%), Frame = +2 Query: 122 PRVSLTTPLSTPTYTESPSLTLQSSSTASPTHXSPPPPP*AWPPKPS 262 P+ TP PTYT SP T + T +PT +P P P + P P+ Sbjct: 74 PKEHKPTP---PTYTPSPKPT-PPTYTPTPTPPTPKPTPPTYTPAPT 116 WARNING: HSPs involving 530 database sequences were not reported due to the limiting value of parameter B = 50. Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.94 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.335 0.144 0.529 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.329 0.135 0.455 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.324 0.136 0.395 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.356 0.166 0.749 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.334 0.150 0.509 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.355 0.151 0.578 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 97 95 10. 66 3 12 22 0.099 32 28 0.12 33 +2 0 98 96 10. 66 3 12 22 0.10 32 28 0.095 34 +1 0 98 96 10. 66 3 12 22 0.10 32 28 0.095 34 -1 0 98 97 10. 66 3 12 22 0.10 32 28 0.097 34 -2 0 98 96 10. 66 3 12 22 0.10 32 28 0.095 34 -3 0 97 96 10. 66 3 12 22 0.10 32 28 0.095 34 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 580 No. of states in DFA: 584 (58 KB) Total size of DFA: 158 KB (192 KB) Time to generate neighborhood: 0.00u 0.01s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 138.99u 1.07s 140.06t Elapsed: 00:00:25 Total cpu time: 139.04u 1.11s 140.15t Elapsed: 00:00:26 Start: Fri Feb 1 18:52:56 2002 End: Fri Feb 1 18:53:22 2002 WARNINGS ISSUED: 2
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000