WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= A01G20_CONSENSUS (515 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 14 Sequences : less than 14 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 4263 817 |========================================================== 6310 3446 621 |============================================ 3980 2825 588 |========================================== 2510 2237 524 |===================================== 1580 1713 436 |=============================== 1000 1277 314 |====================== 631 963 218 |=============== 398 745 161 |=========== 251 584 145 |========== 158 439 146 |========== 100 293 72 |===== 63.1 221 66 |==== 39.8 155 27 |= 25.1 128 43 |=== 15.8 85 20 |= >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 65 <<<<<<<<<<<<<<<<< 10.0 65 14 |= 6.31 51 13 |: 3.98 38 6 |: 2.51 32 3 |: 1.58 29 4 |: 1.00 25 3 |: 0.63 22 2 |: 0.40 20 1 |: 0.25 19 1 |: 0.16 18 0 | 0.10 18 1 |: 0.063 17 0 | 0.040 17 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|9294018|dbj|BAB01921.1|(AP001307) gb|AAC14054.1~ge... +2 294 5.3e-25 1 gi|9294017|dbj|BAB01920.1|(AP001307) gb|AAC14054.1~ge... +2 282 9.8e-24 1 gi|12323168|gb|AAG51564.1|AC027034_10(AC027034) hypot... +2 275 5.4e-23 1 gi|9802742|gb|AAF99811.1|AC034257_3(AC034257) Unknown... +2 273 8.8e-23 1 gi|7485990|pir||T00733hypothetical protein F22O13.28 ... +2 228 5.2e-18 1 gi|11281603|pir||T47728hypothetical protein F18O21.70... +2 210 4.2e-16 1 gi|7487157|pir||T02509hypothetical protein T19C21.15 ... +2 208 6.8e-16 1 gi|4586054|gb|AAD25672.1|AC007020_14(AC007020) unknow... +2 207 8.7e-16 1 gi|10176750|dbj|BAB09981.1|(AB010692) emb|CAB87410.1~... +2 207 8.7e-16 1 gi|7487097|pir||T09929hypothetical protein T16L4.170 ... +2 192 3.4e-14 1 gi|1903367|gb|AAB70450.1|(AC000104) ESTs gb|N65789,gb... +2 190 5.5e-14 1 gi|11281601|pir||T48185hypothetical protein F7A7.160 ... +2 187 1.1e-13 1 gi|11281602|pir||T48482hypothetical protein T28J14.50... +2 185 1.9e-13 1 gi|9758750|dbj|BAB09114.1|(AB023029) gb|AAC83072.1~ge... +2 170 7.3e-12 1 gi|8778486|gb|AAF79494.1|AC002328_2(AC002328) F20N2.7... +2 162 5.1e-11 1 gi|12597896|gb|AAG60204.1|AC084763_24(AC084763) hypot... +2 115 0.00011 1 gi|7504006|pir||T16439hypothetical protein F53A9.8 - ... -2 82 0.026 1 gi|8163873|gb|AAF73890.1|AF223972_1(AF223972) non-cla... -2 91 0.070 1 gi|3064158|gb|AAC14225.1|(AF036414) mucin-like protei... -3 74 0.20 1 gi|10175381|dbj|BAB06479.1|(AP001516) BH2760~unknown ... -2 93 0.29 1 gi|12721246|gb|AAK03010.1|(AE006132) FimA [Pasteurell... -2 92 0.35 1 gi|6634477|emb|CAB64450.1|(AJ251974) hydrophilic acyl... -2 86 0.42 1 gi|1082603|pir||S53365mucin 5AC (clone CEL2) - human ... -3 69 0.57 1 gi|2352447|gb|AAC72445.1|(AF004379) orf81 [Streptococ... +3 69 0.57 1 gi|419748|pir||S31097cold acclimation protein - spina... -2 91 0.62 1 gi|7295863|gb|AAF51163.1|(AE003581) CG17264 gene prod... -3 91 0.74 1 gi|9634286ref|NP_037825.1| ORF65 p6.9 DNA binding pro... -2 67 0.76 1 gi|7484702|pir||T10738hypothetical protein FbLate-2 -... -2 87 0.78 1 gi|6324615ref|NP_014685.1| Yor042wp [Saccharomyces ce... -2 88 0.79 1 gi|7487086|pir||T04991hypothetical protein T16L1.230 ... -2 84 0.83 1 gi|6177896|dbj|BAA86073.1|(AB020482) heme-copper oxid... +2 72 0.85 1 gi|1361323|pir||D53203hypothetical protein 4 - Desulf... -1 65 0.91 1 gi|1175147|sp|P44526|ZNUA_HAEINHIGH-AFFINITY ZINC UPT... -2 85 0.93 1 gi|903866|gb|AAA74210.1|(L43851) surface antigen [Pne... -2 84 0.94 1 gi|6651243|gb|AAF22235.1|AF152557_1(AF152557) histidi... -2 84 0.94 1 gi|12239366|gb|AAG49447.1|AF141344_1(AF141344) LYST-i... +3 75 0.95 1 gi|70792|pir||GACHprotamine - chicken >gi|229596|prf|... +1 64 0.95 1 gi|7509339|pir||T26452hypothetical protein Y113G7C.1 ... -2 91 0.96 1 gi|11357492|pir||T48398hypothetical protein F17C15.13... -2 63 0.98 1 gi|2134214|pir||C58213protamine II - American alligator +1 63 0.98 1 gi|4504487ref|NP_002143.1| histidine-rich calcium-bin... -2 84 0.99 2 gi|786117|gb|AAA98076.1|(L41834) nuclear protein [Ens... -2 85 0.990 1 gi|9845306ref|NP_064120.1| pr4.1 [rat cytomegalovirus... -3 81 0.992 1 gi|462338|sp|P35305|HSP1_DIDMASPERM PROTAMINE P1 >gi|... +1 62 0.994 1 gi|10801034|emb|CAC12965.1|(AJ250175) putative metall... +3 62 0.994 1 gi|730133|sp|P40139|NG2_DROMENEW-GLUE PROTEIN 2 PRECU... -3 72 0.995 1 gi|3242350|emb|CAA19671.1|(AL024484) EG:96G10.4 [Dros... -3 72 0.995 1 gi|6018876|emb|CAB58070.1|(AL121805) BACN4L24.a [Dros... -3 72 0.995 1 gi|11424081ref|XP_008915.1| histidine-rich calcium-bi... -2 84 0.995 2 gi|85349|pir||PS0274homeotic protein box6 - sea urchi... -2 55 0.997 2 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame 3 Hits gi|2352447 | ____________________ gi|12239366 | ____________________ gi|10801034 | ________ __________________________________________________ Query sequence: | | | | | 172 0 50 100 150 Locus_ID Frame 2 Hits gi|9294018 |___________________________ gi|9294017 |___________________________ gi|12323168 |___________________________ gi|9802742 |____________________________ gi|7485990 |_______________________________ gi|11281603 |__________________________ gi|7487157 |________________________________ gi|4586054 |__________________________ gi|10176750 |__________________________ gi|7487097 |______________________________ gi|1903367 |______________________________ gi|11281601 |__________________________ gi|11281602 |___________________________ gi|9758750 |___________________________ gi|8778486 |__________________________ gi|12597896 |_________________________ gi|6177896 |________________ __________________________________________________ Query sequence: | | | | | 172 0 50 100 150 Locus_ID Frame 1 Hits gi|70792 | ____________ gi|2134214 | ______________ gi|462338 | ____________ __________________________________________________ Query sequence: | | | | | 172 0 50 100 150 Locus_ID Frame -1 Hits gi|1361323 | ___________ gi|4504487 | ________ gi|11424081 | ________ gi|85349 | ____ __________________________________________________ Query sequence: | | | | | 172 0 50 100 150 Locus_ID Frame -2 Hits gi|7504006 | __________ gi|8163873 | _____________ gi|10175381 | gi|12721246 | gi|6634477 | gi|419748 | gi|9634286 | gi|7484702 | gi|6324615 | gi|7487086 | gi|1175147 | gi|903866 |_________________ gi|6651243 |_________________ gi|7509339 | gi|11357492 | gi|4504487 | gi|786117 | gi|11424081 | gi|85349 | ______________ __________________________________________________ Query sequence: | | | | | 172 0 50 100 150 Locus_ID Frame -3 Hits gi|3064158 | gi|1082603 | gi|7295863 | gi|9845306 | gi|730133 | gi|3242350 | gi|6018876 | __________________________________________________ Query sequence: | | | | | 172 0 50 100 150
Use the and icons to retrieve links to Entrez:
WARNING: Descriptions of 15 database sequences were not reported due to the limiting value of parameter V = 50. >gi|9294018|dbj|BAB01921.1| (AP001307) gb|AAC14054.1~gene_id:MMM17.14~similar to unknown protein [Arabidopsis thaliana] Length = 188 Frame 2 hits (HSPs): _________________________ __________________________________________________ Database sequence: | | | | | 188 0 50 100 150 Plus Strand HSPs: Score = 294 (103.5 bits), Expect = 5.3e-25, P = 5.3e-25 Identities = 59/91 (64%), Positives = 71/91 (78%), Frame = +2 Query: 2 WHPISLIVFVVLMAAWLFLYFLRDEPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIGNI 181 WHP SLIVF VL+ W+FLYFLRDEP+ +F I DR VLIV++VLTVVLLLLT A NI Sbjct: 86 WHPTSLIVFTVLVVVWIFLYFLRDEPIKLFRFQIDDRTVLIVLSVLTVVLLLLTNATFNI 145 Query: 182 LVALLIGAVLVVAHAALRKTDDLFFDEXEAT 274 + AL+ GAVLV+ H+ +RKT+DLF DE AT Sbjct: 146 VGALVTGAVLVLIHSVVRKTEDLFLDEEAAT 176 >gi|9294017|dbj|BAB01920.1| (AP001307) gb|AAC14054.1~gene_id:MMM17.13~similar to unknown protein [Arabidopsis thaliana] Length = 188 Frame 2 hits (HSPs): _________________________ __________________________________________________ Database sequence: | | | | | 188 0 50 100 150 Plus Strand HSPs: Score = 282 (99.3 bits), Expect = 9.8e-24, P = 9.8e-24 Identities = 57/91 (62%), Positives = 68/91 (74%), Frame = +2 Query: 2 WHPISLIVFVVLMAAWLFLYFLRDEPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIGNI 181 WHP SLIVF L+ W+FLYFLRD PL +F I DR VLI ++V+T+VLLLLT A NI Sbjct: 86 WHPTSLIVFTGLVFLWIFLYFLRDVPLKVFRFQIDDRAVLIGLSVITIVLLLLTNATFNI 145 Query: 182 LVALLIGAVLVVAHAALRKTDDLFFDEXEAT 274 + AL+ GAVLV+ HA +RKTDDLF DE AT Sbjct: 146 VAALMAGAVLVLIHAVIRKTDDLFLDEEAAT 176 >gi|12323168|gb|AAG51564.1|AC027034_10 (AC027034) hypothetical protein; 89971-89402 [Arabidopsis thaliana] Length = 189 Frame 2 hits (HSPs): _________________________ __________________________________________________ Database sequence: | | | | | 189 0 50 100 150 Plus Strand HSPs: Score = 275 (96.8 bits), Expect = 5.4e-23, P = 5.4e-23 Identities = 55/90 (61%), Positives = 70/90 (77%), Frame = +2 Query: 2 WHPISLIVFVVLMAAWLFLYFLRDEPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIGNI 181 +HP SLIV +L+ W+FLYFLRDEPL++FG I DR VLI ++VLTVV+LLLT A NI Sbjct: 86 YHPTSLIVLSILVVFWIFLYFLRDEPLVVFGYQIDDRTVLIGLSVLTVVMLLLTHATSNI 145 Query: 182 LVALLIGAVLVVAHAALRKTDDLFFDEXEA 271 L +LL AVLV+ HAA+R++D+LF DE A Sbjct: 146 LGSLLTAAVLVLIHAAVRRSDNLFLDEEAA 175 >gi|9802742|gb|AAF99811.1|AC034257_3 (AC034257) Unknown protein [Arabidopsis thaliana] Length = 180 Frame 2 hits (HSPs): ___________________________ __________________________________________________ Database sequence: | | | | | 180 0 50 100 150 Plus Strand HSPs: Score = 273 (96.1 bits), Expect = 8.8e-23, P = 8.8e-23 Identities = 52/95 (54%), Positives = 73/95 (76%), Frame = +2 Query: 2 WHPISLIVFVVLMAAWLFLYFLRDEPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIGNI 181 W+P SL+V + L+ AWLFLYFLRDEPL +F R I R+VLI+M+V+T+ +L LT A NI Sbjct: 86 WNPFSLLVLLALLGAWLFLYFLRDEPLTVFDREIDHRIVLIIMSVITLSILFLTDAKLNI 145 Query: 182 LVALLIGAVLVVAHAALRKTDDLFFDEXEATRLSP 286 VA++ GA+ V++HAA+RKT+DLF + E + L+P Sbjct: 146 AVAIVAGALAVLSHAAVRKTEDLFQTDEETSLLNP 180 >gi|7485990|pir||T00733 hypothetical protein F22O13.28 - Arabidopsis thaliana >gi|9802565|gb|AAF99767.1|AC003981_17 (AC003981) F22O13.26 [Arabidopsis thaliana] Length = 209 Frame 2 hits (HSPs): __________________________ __________________________________________________ Database sequence: | | | | | | 209 0 50 100 150 200 Plus Strand HSPs: Score = 228 (80.3 bits), Expect = 5.2e-18, P = 5.2e-18 Identities = 45/105 (42%), Positives = 72/105 (68%), Frame = +2 Query: 2 WHPISLIVFVVLMAAWLFLYFLRD--EPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIG 175 +HP+S+I F+V+ W+ LYF RD + ++I G+ + D++VL++++++TV+ L+ T Sbjct: 97 YHPMSMIAFIVVFIGWILLYFSRDANDSIVISGKEVDDKIVLVLLSLVTVLALVYTDVGE 156 Query: 176 NILVALLIGAVLVVAHAALRKTDDLFFDEXEATRLSPPGAPLSGSPN 316 N+LV+L+IG ++V AH A R TDDLF DE A R G +GS N Sbjct: 157 NVLVSLIIGLLIVGAHGAFRNTDDLFLDEESARR---GGLVSAGSGN 200 >gi|11281603|pir||T47728 hypothetical protein F18O21.70 - Arabidopsis thaliana >gi|7572909|emb|CAB87410.1| (AL163763) putative protein [Arabidopsis thaliana] Length = 209 Frame 2 hits (HSPs): ______________________ __________________________________________________ Database sequence: | | | | | | 209 0 50 100 150 200 Plus Strand HSPs: Score = 210 (73.9 bits), Expect = 4.2e-16, P = 4.2e-16 Identities = 40/90 (44%), Positives = 61/90 (67%), Frame = +2 Query: 5 HPISLIVFVVLMAAWLFLYFLR--DEPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIGN 178 HP+SL+V + L+ W+FLY R D+PL++FGR SDR L+ + + T+V++ +T ++G+ Sbjct: 95 HPLSLLVLIGLLGGWMFLYLFRPSDQPLVVFGRTFSDRETLLALVLSTIVVVFMT-SVGS 153 Query: 179 ILV-ALLIGAVLVVAHAALRKTDDLFFDEXE 268 +L AL+IG +V H A DDLF DE E Sbjct: 154 LLTSALMIGVAIVCVHGAFVVPDDLFLDEQE 184 >gi|7487157|pir||T02509 hypothetical protein T19C21.15 - Arabidopsis thaliana >gi|3395436|gb|AAC28768.1| (AC004683) unknown protein [Arabidopsis thaliana] Length = 220 Frame 2 hits (HSPs): __________________________ __________________________________________________ Database sequence: | | | | | | 220 0 50 100 150 200 Plus Strand HSPs: Score = 208 (73.2 bits), Expect = 6.8e-16, P = 6.8e-16 Identities = 48/112 (42%), Positives = 72/112 (64%), Frame = +2 Query: 5 HPISLIVFVVLMAAWLFLYFLR--DEPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIGN 178 HP SL+ + L+A+WLFLY R D+P+++FGR SDR L + + ++ ++ LT +G+ Sbjct: 105 HPFSLVFLLCLLASWLFLYLFRPTDQPIVLFGRTFSDRETLGCLILFSIFVIFLTD-VGS 163 Query: 179 ILV-ALLIGAVLVVAHAALRKTDDLFFDEXE--ATR-LS-PPGAPLSGSPNLL 322 +LV A++IG L+ AH A R +DLF DE E AT LS GA S +P ++ Sbjct: 164 VLVSAMMIGVALICAHGAFRAPEDLFLDEQEPAATGFLSFLGGAASSAAPAVI 216 >gi|4586054|gb|AAD25672.1|AC007020_14 (AC007020) unknown protein [Arabidopsis thaliana] Length = 213 Frame 2 hits (HSPs): ______________________ __________________________________________________ Database sequence: | | | | | | 213 0 50 100 150 200 Plus Strand HSPs: Score = 207 (72.9 bits), Expect = 8.7e-16, P = 8.7e-16 Identities = 41/90 (45%), Positives = 61/90 (67%), Frame = +2 Query: 5 HPISLIVFVVLMAAWLFLYFLR--DEPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIGN 178 HP SL+V + L+ +W+FLY R D+PL++FGR SDR L+ + + T+V++ +T ++G+ Sbjct: 97 HPFSLLVLLSLLGSWMFLYLFRSSDQPLVLFGRSFSDRETLLGLVLTTIVVVFMT-SVGS 155 Query: 179 ILV-ALLIGAVLVVAHAALRKTDDLFFDEXE 268 +L AL IG +V H A R DDLF DE E Sbjct: 156 LLTSALTIGIAIVCLHGAFRVPDDLFLDEQE 186 >gi|10176750|dbj|BAB09981.1| (AB010692) emb|CAB87410.1~gene_id:K18I23.19~similar to unknown protein [Arabidopsis thaliana] Length = 217 Frame 2 hits (HSPs): _____________________ __________________________________________________ Database sequence: | | | | | | 217 0 50 100 150 200 Plus Strand HSPs: Score = 207 (72.9 bits), Expect = 8.7e-16, P = 8.7e-16 Identities = 42/90 (46%), Positives = 61/90 (67%), Frame = +2 Query: 5 HPISLIVFVVLMAAWLFLYFLR--DEPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIGN 178 HP SL+V + L AW+FLY R D+PL++ GR SDR L V+ +LT+V++ LT ++G+ Sbjct: 98 HPFSLLVLLCLFCAWIFLYLFRPSDQPLVVLGRTFSDRETLGVLVILTIVVVFLT-SVGS 156 Query: 179 ILV-ALLIGAVLVVAHAALRKTDDLFFDEXE 268 +L AL+IG +V H A R +DLF D+ E Sbjct: 157 LLTSALMIGFGIVCLHGAFRVPEDLFLDDQE 187 >gi|7487097|pir||T09929 hypothetical protein T16L4.170 - Arabidopsis thaliana >gi|5123560|emb|CAB45326.1| (AL079344) putative protein [Arabidopsis thaliana] >gi|7269865|emb|CAB79724.1| (AL161575) putative protein [Arabidopsis thaliana] Length = 272 Frame 2 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | | | | 272 0 50 100 150 200 250 Plus Strand HSPs: Score = 192 (67.6 bits), Expect = 3.4e-14, P = 3.4e-14 Identities = 37/106 (34%), Positives = 63/106 (59%), Frame = +2 Query: 2 WHPISLIVFVVLMAAWLFLYFLRDEPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIGNI 181 W P+ L VFV+L+ AWL++Y +EP +IFG +I D +++V+ VLT+ + LLT I Sbjct: 166 WQPVHLSVFVILIVAWLYVYSRDNEPWVIFGSVIDDSTLVLVLLVLTIGIFLLTDVSRGI 225 Query: 182 LVALLIGAVLVVAHAALRKTDDLFF---DEXEATRLSPPGAPLSGS 310 ++ +L G +V+ H R+ ++ F D+ E ++ + LS S Sbjct: 226 VIGVLAGLPVVLVHGMCRRNTEMLFVLEDDEEKVAMNTSSSSLSSS 271 >gi|1903367|gb|AAB70450.1| (AC000104) ESTs gb|N65789,gb|T04628 come from this gene. [Arabidopsis thaliana] Length = 182 Frame 2 hits (HSPs): ___________________________ __________________________________________________ Database sequence: | | | | | 182 0 50 100 150 Plus Strand HSPs: Score = 190 (66.9 bits), Expect = 5.5e-14, P = 5.5e-14 Identities = 42/101 (41%), Positives = 61/101 (60%), Frame = +2 Query: 8 PISLIVFVVLMAAWLFLYFLRDEPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIGNILV 187 PI+++ F+ + AW FLYF R+EPL IFG I D +V +++ L++ L+ TG L Sbjct: 72 PIAILAFIAVGLAWFFLYFAREEPLTIFGFTIDDGIVAVLLIGLSIGSLVTTGVWLRALT 131 Query: 188 ALLIGAVLVVAHAALRKTDDLFFDEXEATRLSPPGAPLSGS 310 + G ++++ HAALR TDDL D+ E SP G LS S Sbjct: 132 TVGFGVLVLILHAALRGTDDLVSDDLE----SPYGPMLSTS 168 >gi|11281601|pir||T48185 hypothetical protein F7A7.160 - Arabidopsis thaliana >gi|7327823|emb|CAB82280.1| (AL161946) putative protein [Arabidopsis thaliana] Length = 223 Frame 2 hits (HSPs): _____________________ __________________________________________________ Database sequence: | | | | | | 223 0 50 100 150 200 Plus Strand HSPs: Score = 187 (65.8 bits), Expect = 1.1e-13, P = 1.1e-13 Identities = 39/90 (43%), Positives = 57/90 (63%), Frame = +2 Query: 5 HPISLIVFVVLMAAWLFLYFLR--DEPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIGN 178 HP SLI+ + L A+WLFLY R D PLI+FGR S+ L + + T+ ++ T ++G+ Sbjct: 105 HPFSLILLLCLAASWLFLYLFRPSDRPLILFGRSFSEYETLGGLILSTIAVIFFT-SVGS 163 Query: 179 ILV-ALLIGAVLVVAHAALRKTDDLFFDEXE 268 +L+ AL+IG + H A R DDLF DE + Sbjct: 164 VLISALMIGIATICVHGAFRAPDDLFLDEQD 194 >gi|11281602|pir||T48482 hypothetical protein T28J14.50 - Arabidopsis thaliana >gi|7546689|emb|CAB87267.1| (AL163652) putative protein [Arabidopsis thaliana] >gi|9759567|dbj|BAB11169.1| (AB010697) emb|CAB87267.1~gene_id:MOJ9.28~similar to unknown protein [Arabidopsis thaliana] Length = 216 Frame 2 hits (HSPs): ______________________ __________________________________________________ Database sequence: | | | | | | 216 0 50 100 150 200 Plus Strand HSPs: Score = 185 (65.1 bits), Expect = 1.9e-13, P = 1.9e-13 Identities = 39/91 (42%), Positives = 60/91 (65%), Frame = +2 Query: 5 HPISLIVFVVLMAAWLFLYFLR--DEPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIGN 178 HP +L + L A+WLFLYF R D+PL+I GR SD L ++ + TVV++ +T ++G+ Sbjct: 95 HPFALFLLASLAASWLFLYFFRPADQPLVIGGRTFSDLETLGILCLSTVVVMFMT-SVGS 153 Query: 179 ILVALL-IGAVLVVAHAALRKTDDLFFDEXEA 271 +L++ L +G + V H A R +DLF +E EA Sbjct: 154 LLMSTLAVGIMGVAIHGAFRAPEDLFLEEQEA 185 >gi|9758750|dbj|BAB09114.1| (AB023029) gb|AAC83072.1~gene_id:K24C1.4~similar to unknown protein [Arabidopsis thaliana] Length = 186 Frame 2 hits (HSPs): ________________________ __________________________________________________ Database sequence: | | | | | 186 0 50 100 150 Plus Strand HSPs: Score = 170 (59.8 bits), Expect = 7.3e-12, P = 7.3e-12 Identities = 30/88 (34%), Positives = 55/88 (62%), Frame = +2 Query: 8 PISLIVFVVLMAAWLFLYFLRDEPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIGNILV 187 P++LIV ++A WL +F R++PLI++ + DR VL+ + + +V + T + N+ V Sbjct: 88 PVALIVVGAIIALWLIFHFFREDPLILWSFQVGDRTVLLFLVLASVWAIWFTNSAVNLAV 147 Query: 188 ALLIGAVLVVAHAALRKTDDLFFDEXEA 271 + +G +L + HA R +D+LF +E +A Sbjct: 148 GVSVGLLLCIIHAVFRNSDELFLEEDDA 175 >gi|8778486|gb|AAF79494.1|AC002328_2 (AC002328) F20N2.7 [Arabidopsis thaliana] Length = 187 Frame 2 hits (HSPs): ________________________ __________________________________________________ Database sequence: | | | | | 187 0 50 100 150 Plus Strand HSPs: Score = 162 (57.0 bits), Expect = 5.1e-11, P = 5.1e-11 Identities = 29/87 (33%), Positives = 54/87 (62%), Frame = +2 Query: 8 PISLIVFVVLMAAWLFLYFLRDEPLIIFGRLISDRVVLIVMAVLTVVLLLLTGAIGNILV 187 P++L+ +A WL LYF RD PL+++GR ISDRV++ + + ++ L ++ +++ Sbjct: 90 PMALVTVASFVAMWLLLYFYRDHPLVLYGRHISDRVIVFGLILGSLWALWFINSLQCLIL 149 Query: 188 ALLIGAVLVVAHAALRKTDDLFFDEXE 268 ++ +L + HA +R +DDLF E + Sbjct: 150 GVVTSVLLCLVHAIIRNSDDLFVQEKD 176 >gi|12597896|gb|AAG60204.1|AC084763_24 (AC084763) hypothetical protein [Oryza sativa] Length = 212 Frame 2 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | | | 212 0 50 100 150 200 Plus Strand HSPs: Score = 115 (40.5 bits), Expect = 0.00011, P = 0.00011 Identities = 31/84 (36%), Positives = 46/84 (54%), Frame = +2 Query: 5 HPISLIVFVVLMAAWLFLY--FLRDEP-LIIFGRLISDRVVLIVMAVLTVVLLLLTGAIG 175 H S++ F++ + L LY LR P + RL+ R+V +V L + L GAI Sbjct: 83 HRASML-FLMAASKGLLLYGGLLRVFPNSALLRRLLDRRLVALVFVALVLADLAAAGAIA 141 Query: 176 NILVALLIGAVLVVAHAALRKTDDL 250 N+L AL +G ++V HA+ R DDL Sbjct: 142 NLLAALAVGVPVIVLHASFRVRDDL 166 >gi|7504006|pir||T16439 hypothetical protein F53A9.8 - Caenorhabditis elegans >gi|746559|gb|AAC46563.1| (U23523) histidine-rich; contained in tandem repeat [Caenorhabditis elegans] Length = 87 Frame -2 hits (HSPs): ________________________________________________ __________________________________________________ Database sequence: | | | | | | 87 0 20 40 60 80 Minus Strand HSPs: Score = 82 (28.9 bits), Expect = 0.027, P = 0.026 Identities = 24/71 (33%), Positives = 32/71 (45%), Frame = -2 Query: 217 HHQHRSDEQRHEDVAD-GA-GEKQQHDGEH---RHDDKHDAVAD*SSEDY*RFVAEEVEE 53 HH H DE HED + GA GE H G+H +H H A + + E E+ Sbjct: 13 HHDHH-DEHHHEDHHEHGADGEHVHHAGDHCDTQHGGNHQAGEHCAKTQHQD--GEHHEQ 69 Query: 52 EPRRHEHHEHDQRD 11 H+ H HD + Sbjct: 70 HHDAHDSHAHDSHE 83 Score = 68 (23.9 bits), Expect = 1.1, P = 0.66 Identities = 11/29 (37%), Positives = 16/29 (55%), Frame = -2 Query: 187 HEDVADGAGEKQQHDGEHRHDDKHDAVAD 101 H++ G G+ H EH H+D H+ AD Sbjct: 3 HQEHGHGDGDHHDHHDEHHHEDHHEHGAD 31 Score = 68 (23.9 bits), Expect = 1.1, P = 0.66 Identities = 24/61 (39%), Positives = 30/61 (49%), Frame = -2 Query: 217 HHQHRSDEQR------HEDVADG----AGE---KQQH-DGEH--RHDDKHDAVAD*SSED 86 HH+H +D + H D G AGE K QH DGEH +H D HD+ A S E Sbjct: 25 HHEHGADGEHVHHAGDHCDTQHGGNHQAGEHCAKTQHQDGEHHEQHHDAHDSHAHDSHEG 84 Query: 85 Y 83 + Sbjct: 85 H 85 >gi|8163873|gb|AAF73890.1|AF223972_1 (AF223972) non-classical Kazal type inhibitor bdellin-KL [Hirudo nipponia] Length = 155 Frame -2 hits (HSPs): _______________________________ __________________________________________________ Database sequence: | | | | | 155 0 50 100 150 Minus Strand HSPs: Score = 91 (32.0 bits), Expect = 0.073, P = 0.070 Identities = 17/43 (39%), Positives = 24/43 (55%), Frame = -2 Query: 232 QRRVRHHQHRSD---EQRHEDVADGAGEKQQHDGEHRHDDKHD 113 +R+ HH+ D E+ H+D D E+ HD EH+ DD HD Sbjct: 99 ERKDEHHEEGHDDHHEEGHDDHHDDEHEEDHHDDEHKEDDHHD 141 Score = 88 (31.0 bits), Expect = 0.19, P = 0.17 Identities = 22/69 (31%), Positives = 33/69 (47%), Frame = -2 Query: 232 QRRVRHHQHRSD---EQRHEDVADGAGEKQQHDGEHRHDDKHDAVAD*SSEDY*RFVAEE 62 +R+ HH+ D E+ H+D D E+ HD EH+ DD HD +E Sbjct: 99 ERKDEHHEEGHDDHHEEGHDDHHDDEHEEDHHDDEHKEDDHHD---------------DE 143 Query: 61 VEEEPRRHEHHE 26 +E+ EHH+ Sbjct: 144 HKEDDHHEEHHD 155 Score = 88 (31.0 bits), Expect = 0.19, P = 0.17 Identities = 25/82 (30%), Positives = 40/82 (48%), Frame = -2 Query: 241 GFPQRRV-RHHQ---HRSD-EQRHEDVADGAGEKQQHDGEHRHDDKHDAVAD*SSED-Y* 80 GF + HH+ H+ + + H+D D G ++ H+GE R D+ H+ D E+ + Sbjct: 60 GFVEHHEDEHHEGEEHKEEGHEGHDDHHDD-GHEEHHEGEERKDEHHEEGHDDHHEEGHD 118 Query: 79 RFVAEEVEEEPRRHEHHEHDQRD 11 +E EE+ EH E D D Sbjct: 119 DHHDDEHEEDHHDDEHKEDDHHD 141 >gi|3064158|gb|AAC14225.1| (AF036414) mucin-like protein [Trypanosoma cruzi] Length = 95 Frame -3 hits (HSPs): _____________________________________ __________________________________________________ Database sequence: | | | | | | 95 0 20 40 60 80 Minus Strand HSPs: Score = 74 (26.0 bits), Expect = 0.22, P = 0.20 Identities = 24/76 (31%), Positives = 34/76 (44%), Frame = -3 Query: 297 GAPGGDNRVASXSSKNKSSVFLNAACATTNTAPMSSATRMLPMAPVRSSSTTVSTAMTIS 118 G G N+ A + +N TT AP ++ T P AP STT + A T + Sbjct: 3 GPQGSKNKSADQTYQNGEDPAATTTTTTTTMAPTTTTTA--PQAP----STTTTEAPTTT 56 Query: 117 TTRSLINLPKIINGSS 70 +TR+ L +I G S Sbjct: 57 STRAPSRLRRIDGGLS 72 >gi|10175381|dbj|BAB06479.1| (AP001516) BH2760~unknown conserved protein [Bacillus halodurans] Length = 380 Frame -2 hits (HSPs): __________ __________________________________________________ Database sequence: | | | | 380 0 150 300 Minus Strand HSPs: Score = 93 (32.7 bits), Expect = 0.34, P = 0.29 Identities = 23/70 (32%), Positives = 30/70 (42%), Frame = -2 Query: 214 HQHRSDEQRHE-DVADGAGEKQQHDGEHRHDDKHDAVAD*SSEDY*RFVAEEVEEEPRRH 38 H H E HE D D + E+ H GE HD H+ +D+ E EE H Sbjct: 147 HDHSHGEYGHEEDDHDHSHEEHGH-GEDDHDHSHEEHGH-EEDDHDHSHEEHGHEEDDHH 204 Query: 37 EHHEHDQRDWM 5 HH+ D W+ Sbjct: 205 HHHDEDPHVWL 215 >gi|12721246|gb|AAK03010.1| (AE006132) FimA [Pasteurella multocida] Length = 366 Frame -2 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | 366 0 150 300 Minus Strand HSPs: Score = 92 (32.4 bits), Expect = 0.44, P = 0.35 Identities = 21/71 (29%), Positives = 32/71 (45%), Frame = -2 Query: 214 HQHRSDEQRHEDVADGAGEKQQHDGEHRHDDKHDAVAD*SSEDY*RFVAEEVEEEPRRHE 35 H+H D + D K +HD +H HD KHD D + + + + +H+ Sbjct: 129 HKHDHDHKHDHDHKHDHDHKHEHDHKHDHDHKHDH--DHKHDHAHKHEHDHKHDHEHKHD 186 Query: 34 H---HEHDQR-DW 8 H HEHD +W Sbjct: 187 HAHGHEHDHSTNW 199 Score = 89 (31.3 bits), Expect = 1.0, P = 0.63 Identities = 24/78 (30%), Positives = 36/78 (46%), Frame = -2 Query: 250 QIIGFPQRRVRHHQHRSDE-QRHE-DVADGAGEKQQHDGEHRHDDKHDAVAD*SSEDY*R 77 +I G + H+H D +H+ D K HD +H HD KHD D+ Sbjct: 109 EIGGLLEGEAHDHKHEHDHTHKHDHDHKHDHDHKHDHDHKHEHDHKHDHDHK-HDHDHKH 167 Query: 76 FVAEEVEEEPRRHEH-HEHD 20 A + E + +H+H H+HD Sbjct: 168 DHAHKHEHD-HKHDHEHKHD 186 >gi|6634477|emb|CAB64450.1| (AJ251974) hydrophilic acylated surface protein B [Leishmania mexicana] Length = 176 Frame -2 hits (HSPs): __________________________________ __________________________________________________ Database sequence: | | | | | 176 0 50 100 150 Minus Strand HSPs: Score = 86 (30.3 bits), Expect = 0.54, P = 0.42 Identities = 30/116 (25%), Positives = 50/116 (43%), Frame = -2 Query: 331 KQKEKIGGSRKRSTGRRQSSGLXFVEEQIIGFPQRRVRHHQHRSDEQ--RHED----VAD 170 K+ EK + +TG + G ++ G ++ H ++D +H+D D Sbjct: 11 KEPEKRADNIDTTTGSNKKDGGHDHHQRTDGDGEKN-DHDGEKADGDAGKHDDDHHQKTD 69 Query: 169 GAGEKQQHDGEH------RHDDKHDAVAD*SSEDY*RFVAEEVEEEPRRHEHHEHDQR 14 G GEK HDGE +HDD H D E E+ + + +H+ +H Q+ Sbjct: 70 GDGEKNDHDGEKADGDAGKHDDDHHQKTDGDGEKN-DHDGEKADGDAGKHDDDDHHQK 126 >gi|1082603|pir||S53365 mucin 5AC (clone CEL2) - human (fragment) >gi|563373|emb|CAA84030.1| (Z34276) mucin [Homo sapiens] Length = 94 Frame -3 hits (HSPs): _______________________________________________ __________________________________________________ Database sequence: | | | | | | 94 0 20 40 60 80 Minus Strand HSPs: Score = 69 (24.3 bits), Expect = 0.83, P = 0.57 Identities = 26/88 (29%), Positives = 42/88 (47%), Frame = -3 Query: 273 VASXSSKNKSSVFLNAACATTNTAPMSSATRMLPMAPVRSSSTTVST-AMTISTTRSLIN 97 V + S+ + S+ + TT + +++T +P S+STT +T A T STT Sbjct: 7 VPTASTTSASTTSTTSGPGTTPSPVPTTSTISVPTTSTTSASTTSTTSASTTSTTSGPGT 66 Query: 96 LPKII-NGSSRRK*RKSHAAMSTTNTIS 16 P + S+ S + STT+TIS Sbjct: 67 TPSPVPTTSTTSAPTTSTTSASTTSTIS 94 >gi|2352447|gb|AAC72445.1| (AF004379) orf81 [Streptococcus thermophilus bacteriophage Sfi21] Length = 81 Frame 3 hits (HSPs): ______________________________________________ __________________________________________________ Database sequence: | | | | | | 81 0 20 40 60 80 Plus Strand HSPs: Score = 69 (24.3 bits), Expect = 0.83, P = 0.57 Identities = 22/74 (29%), Positives = 37/74 (50%), Frame = +3 Query: 300 FLDPPIFSFCFFL*ISYLVNLVNNFRFGLYA--------FALFRRG-NFHLESVCEERKM 452 FL P +F F L + ++ + NNF ++ F ++R H+ + C RK+ Sbjct: 6 FLQP-LFLFWSCLFVIFIFDKRNNFNAIIFIDPIVFSNLFKMYRNSVALHIYNCCRFRKI 64 Query: 453 FGEVKFLFLRFVNLI 497 F + +F+RFV LI Sbjct: 65 FDVITNIFIRFVFLI 79 >gi|419748|pir||S31097 cold acclimation protein - spinach >gi|2673888|gb|AAB88628.1| (M96259) 85 kDa cold acclimation protein [Spinacia oleracea] Length = 535 Frame -2 hits (HSPs): ____________ Annotated Domains: ___________ __________ _________________________ __________________________________________________ Database sequence: | | | | | 535 0 150 300 450 __________________ Annotated Domains: DOMO DM02152: DEHYDRINS 1..85 DOMO DM00110: DEHYDRINS 87..113 DOMO DM00110: DEHYDRINS 134..157 DOMO DM02972: 159..203 DOMO DM00110: DEHYDRINS 205..233 DOMO DM00110: DEHYDRINS 268..295 DOMO DM02972: 297..348 DOMO DM00110: DEHYDRINS 350..377 DOMO DM02972: 379..401 DOMO DM00110: DEHYDRINS 403..428 DOMO DM02972: 430..487 DOMO DM00110: DEHYDRINS 489..516 PROSITE DEHYDRIN_2: Dehydrins signature 2. 105..112 PROSITE DEHYDRIN_2: Dehydrins signature 2. 151..158 PROSITE DEHYDRIN_2: Dehydrins signature 2. 182..189 PROSITE DEHYDRIN_2: Dehydrins signature 2. 225..232 PROSITE DEHYDRIN_2: Dehydrins signature 2. 287..294 PROSITE DEHYDRIN_2: Dehydrins signature 2. 333..340 PROSITE DEHYDRIN_2: Dehydrins signature 2. 369..376 PROSITE DEHYDRIN_2: Dehydrins signature 2. 420..427 PROSITE DEHYDRIN_2: Dehydrins signature 2. 508..515 __________________ Minus Strand HSPs: Score = 91 (32.0 bits), Expect = 0.97, P = 0.62 Identities = 29/117 (24%), Positives = 59/117 (50%), Frame = -2 Query: 352 K*EIHRKKQKEKIGGSRKRSTGRRQSSGLXFVEEQIIGFPQRRVR-HHQHRSDEQRHEDV 176 K + H+ +++EK GG+ + + G +Q+ P HH H+ DE+ + V Sbjct: 164 KQDYHQHQEEEKKGGALDKIKDKLPGQGNAGHTQQLYPAPDHNYNTHHVHQ-DEENKDSV 222 Query: 175 ADGAGEKQ--QHDGE----HRH--DDKHDAVAD*SSEDY*RFVAEEVEEEPRRHEHHEHD 20 D +K QH+ + H H ++K D+V D +D ++ + E++ + HH+ + Sbjct: 223 LDKIKDKLPGQHEDKKNDYHHHQEEEKKDSVLD-KIKDK---MSGQHEDKKNDYHHHQEE 278 Query: 19 QR 14 ++ Sbjct: 279 EK 280 Score = 89 (31.3 bits), Expect = 1.7, P = 0.82 Identities = 23/116 (19%), Positives = 53/116 (45%), Frame = -2 Query: 352 K*EIHRKKQKEKIGGSRKRSTGRRQSSGLXFVEEQIIGFPQRRVR-HHQHRSDEQRHEDV 176 K + H+ +++EK GG+ + + G +Q+ P HH H+ DE+ + V Sbjct: 164 KQDYHQHQEEEKKGGALDKIKDKLPGQGNAGHTQQLYPAPDHNYNTHHVHQ-DEENKDSV 222 Query: 175 ADGAGEKQQHDGEHRHDDKHDAVAD*SSEDY*RFVAEEV--EEEPRRHEHHEHDQRD 11 D +K E + +D H + + + +++ + E +++++H H + + Sbjct: 223 LDKIKDKLPGQHEDKKNDYHHHQEEEKKDSVLDKIKDKMSGQHEDKKNDYHHHQEEE 279 >gi|7295863|gb|AAF51163.1| (AE003581) CG17264 gene product [Drosophila melanogaster] Length = 704 Frame -3 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | | 704 0 150 300 450 600 Minus Strand HSPs: Score = 91 (32.0 bits), Expect = 1.4, P = 0.74 Identities = 30/83 (36%), Positives = 43/83 (51%), Frame = -3 Query: 270 ASXSSKNKSSVFLNAACATTNTAPMSSATRMLPMAPVRSSSTTVSTAMTISTTRSLINLP 91 A + ++ L A TT TAP SS T M +A + +S+TT T MT+ + S + Sbjct: 445 AQSDAAGATATTLGNAAVTT-TAPTSS-TLMQMLANMANSTTT--TLMTMMSGNSTLAAS 500 Query: 90 KIINGSSRRK*RKSHAAMSTTNT 22 + +GS+ S AAMSTT T Sbjct: 501 SVSSGSTSSSSSSSTAAMSTTTT 523 >gi|9634286 ref|NP_037825.1| ORF65 p6.9 DNA binding protein [Spodoptera exigua nucleopolyhedrovirus] >gi|6960525|gb|AAF33595.1|AF169823_65 (AF169823) ORF65 p6.9 DNA binding protein [Spodoptera exigua nucleopolyhedrovirus] Length = 75 Frame -2 hits (HSPs): ________________________________________ __________________________________________________ Database sequence: | | | | | 75 0 20 40 60 Minus Strand HSPs: Score = 67 (23.6 bits), Expect = 1.4, P = 0.76 Identities = 19/63 (30%), Positives = 33/63 (52%), Frame = -2 Query: 334 KKQKEKIGGSRKRSTG-RRQSSGLXFVEEQIIGFPQRRVRHHQHRSDEQRHEDVADGAGE 158 + + GG R+RS+G RR+SS + G+ +R R + RS +R + G G Sbjct: 15 RSTRRSSGGYRRRSSGYRRRSSSNRRTYRRSSGYHRRPGRPRKRRSSRRR----SSGGGY 70 Query: 157 KQQH 146 +++H Sbjct: 71 RRRH 74 >gi|7484702|pir||T10738 hypothetical protein FbLate-2 - sea-island cotton >gi|1143224|gb|AAA84881.1| (U34401) FbLate-2 gene product [Gossypium barbadense] Length = 333 Frame -2 hits (HSPs): ___________________________________ __________________________________________________ Database sequence: | | | | 333 0 150 300 Minus Strand HSPs: Score = 87 (30.6 bits), Expect = 1.5, P = 0.78 Identities = 28/114 (24%), Positives = 53/114 (46%), Frame = -2 Query: 367 FTKLTK*EIHRKKQKEKIGGSRKRSTGRRQSSGLXFVEEQIIGFPQRRVRHHQHRSDEQR 188 + K K EIH K++K+K + +S +++ FP+ + +H E Sbjct: 53 YPKHEKPEIH-KEEKQKPCKQHEEYHESHKSKEHEEYQKEKPEFPKLE-KPKEHEKHEVE 110 Query: 187 HEDVADGAGEKQQHDGEHRHDDKHDAVAD*SSEDY*RFVAEEVE-EEPRRHEHHE 26 + + + E Q EH+H++ H++ E+Y + E + E+P+ HE HE Sbjct: 111 YPKILEYK-ENQDEGKEHKHEEYHESRESKEHEEYEKEKPEFPKLEKPKEHEKHE 164 Score = 85 (29.9 bits), Expect = 2.6, P = 0.93 Identities = 28/108 (25%), Positives = 49/108 (45%), Frame = -2 Query: 346 EIHRKKQKEKIGGSRKRSTGRR--QSSGLXFVEEQIIGFPQRRVRHHQHRSDEQRHEDVA 173 EI K+K+ G K + +S E++ FP+ + +H E + + Sbjct: 169 EIPEYKEKQDEGKEHKHEECHKSHESKEHEEYEKEKPNFPKGE-KPKEHEKHEVEYPKIP 227 Query: 172 DGAGEKQQHDGEHRHDDKHDAVAD*SSEDY*RFVAEEVE-EEPRRHEHHE 26 + EKQ EH+HD+ H++ E+Y + + E+P+ HE HE Sbjct: 228 EYK-EKQDEGKEHKHDECHESHELKEHEEYEKEKPNFPKGEKPKEHEKHE 276 >gi|6324615 ref|NP_014685.1| Yor042wp [Saccharomyces cerevisiae] >gi|2132035|pir||S66916 hypothetical protein YOR042w - yeast (Saccharomyces cerevisiae) >gi|1420167|emb|CAA99232.1| (Z74949) ORF YOR042w [Saccharomyces cerevisiae] Length = 411 Frame -2 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | 411 0 150 300 Minus Strand HSPs: Score = 88 (31.0 bits), Expect = 1.5, P = 0.79 Identities = 33/137 (24%), Positives = 58/137 (42%), Frame = -2 Query: 412 PLRKRAKAYKPNLKLFTKLTK*EIHRKKQKEKIGGSRKRSTGRRQSSGLXFVEEQIIGFP 233 P+RK +A P + T+L + E+ ++ E+ S R R +++ + + P Sbjct: 153 PVRKNPEA--PARRRQTQLEQDELLARQLDEQFNSSHSRRRNRDRATRSMHEQRRRRHNP 210 Query: 232 QRRVRHHQHRSDEQRHED-VADGAGEKQQHDGEHRHDDKHDAVAD*SSEDY*RFVAEEVE 56 R +HH+ +E V E G D + V++ S+ Y R A E Sbjct: 211 NEREQHHEDSEEEDSWSQFVEKDLPELTDRAGRSLQDTANK-VSNWISDAYRRNFASGNE 269 Query: 55 EEPRRHEHHEHDQRDWMP 2 + +H H DQ++W P Sbjct: 270 QNDNQHGHQ--DQQEWEP 285 >gi|7487086|pir||T04991 hypothetical protein T16L1.230 - Arabidopsis thaliana >gi|3549676|emb|CAA20587.1| (AL031394) putative protein [Arabidopsis thaliana] >gi|7270323|emb|CAB80091.1| (AL161584) putative protein [Arabidopsis thaliana] Length = 227 Frame -2 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | | | 227 0 50 100 150 200 Minus Strand HSPs: Score = 84 (29.6 bits), Expect = 1.8, P = 0.83 Identities = 24/78 (30%), Positives = 43/78 (55%), Frame = -2 Query: 217 HHQHRSDEQRHEDVADGAGEKQQ----HDGEHRHD---DKHDAVAD*SSEDY*RF---VA 68 + +H +++ E+++ G EK++ +G H + D+ + VA+ ED + VA Sbjct: 84 NEKHVEEDEDEEEISHGGEEKEKKSKVENGNHEEEVEKDEEEEVAEDDEEDKNKQGEEVA 143 Query: 67 EEVEEEPRRHEHHEHDQRD 11 EE EEE +HE E D++D Sbjct: 144 EEDEEE-NKHEEDEIDEQD 161 >gi|6177896|dbj|BAA86073.1| (AB020482) heme-copper oxidase subunit IV [Aeropyrum pernix] Length = 103 Frame 2 hits (HSPs): ____________________________ __________________________________________________ Database sequence: | | | | | | | 103 0 20 40 60 80 100 Plus Strand HSPs: Score = 72 (25.3 bits), Expect = 1.9, P = 0.85 Identities = 21/55 (38%), Positives = 33/55 (60%), Frame = +2 Query: 5 HPISLIVFVVLMAAWLF-LYF--LRDEPLIIFGRLISDRVVLIVMAVLTVVLLLLT 163 +P ++ V L + L L+F LRDEP+II G +S VLI + +++ V +LT Sbjct: 42 NPFVFVLAVALFQSSLIALFFQHLRDEPIIIRGITVSG-AVLIAILIISAVTSVLT 96 >gi|1361323|pir||D53203 hypothetical protein 4 - Desulfovibrio vulgaris (strain Miyazaki) >gi|476040|dbj|BAA04828.1| (D21804) unnamed protein product [Desulfovibrio vulgaris] Length = 68 Frame -1 hits (HSPs): _______________________________ __________________________________________________ Database sequence: | | | | | 68 0 20 40 60 Minus Strand HSPs: Score = 65 (22.9 bits), Expect = 2.4, P = 0.91 Identities = 19/42 (45%), Positives = 20/42 (47%), Frame = -1 Query: 269 PLXRRRTNHRFS------STPRA-PPPTPLR*AAPRGCCRWR 165 P RRR + R S PRA PPP R A R CRWR Sbjct: 2 PARRRRASSRSSHPCRAVGPPRAAPPPVRSRWPARRSGCRWR 43 >gi|1175147|sp|P44526|ZNUA_HAEIN HIGH-AFFINITY ZINC UPTAKE SYSTEM PROTEIN ZNUA >gi|1073823|pir||D64049 adhesin homolog HI0119 - Haemophilus influenzae (strain Rd KW20) >gi|1573074|gb|AAC21794.1| (U32698) conserved hypothetical protein [Haemophilus influenzae Rd] Length = 337 Frame -2 hits (HSPs): _________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 337 0 150 300 __________________ Annotated Domains: DOMO DM06600: 1..56 DOMO DM08399: 58..272 DOMO DM06599: 274..336 Entrez Domain: HIS-RICH. 115..163 PFAM Lipoprotein_4: Adhesion lipoprotein 3..336 PRODOM PD002751: ADHS(4) ZNUA(2) 2..115 PRODOM PD002761: ADHS(4) ZNUA(2) 171..324 __________________ Minus Strand HSPs: Score = 85 (29.9 bits), Expect = 2.6, P = 0.93 Identities = 23/69 (33%), Positives = 32/69 (46%), Frame = -2 Query: 217 HHQHRSDEQRHEDVADGAGEKQQHDGEHRHDDKHDAVAD*SSEDY*RFVAEEVEEEPRRH 38 HH+H HED G+ HD +H+H+ KHD D D+ + E H Sbjct: 115 HHEHF-----HED-----GD---HDHDHKHEHKHDHKHD-HDHDH-----DHKHEHKHDH 155 Query: 37 EHHEHDQRD 11 EHH+HD + Sbjct: 156 EHHDHDHHE 164 >gi|903866|gb|AAA74210.1| (L43851) surface antigen [Pneumocystis carinii] Length = 288 Frame -2 hits (HSPs): __________ __________________________________________________ Database sequence: | | | | | | | 288 0 50 100 150 200 250 Minus Strand HSPs: Score = 84 (29.6 bits), Expect = 2.8, P = 0.94 Identities = 20/54 (37%), Positives = 25/54 (46%), Frame = -2 Query: 172 DGAGEKQ-QHDGEHRHDDKHDAVAD*SSEDY*RFVAEEVEEEPRRHEHHEHDQRD 11 DG G+ +HD EH HD H+ + ED + E HEHH HD D Sbjct: 200 DGDGDDDDEHDHEHGHDHDHEDGQEHEDEDG----HDHGHEHEDGHEHHHHDHDD 250 >gi|6651243|gb|AAF22235.1|AF152557_1 (AF152557) histidine and aspartic acid rich protein [Pneumocystis carinii f. sp. carinii] Length = 296 Frame -2 hits (HSPs): __________ __________________________________________________ Database sequence: | | | | | | | 296 0 50 100 150 200 250 Minus Strand HSPs: Score = 84 (29.6 bits), Expect = 2.9, P = 0.94 Identities = 20/54 (37%), Positives = 25/54 (46%), Frame = -2 Query: 172 DGAGEKQ-QHDGEHRHDDKHDAVAD*SSEDY*RFVAEEVEEEPRRHEHHEHDQRD 11 DG G+ +HD EH HD H+ + ED + E HEHH HD D Sbjct: 208 DGDGDDDDEHDHEHGHDHDHEDGQEHEDEDG----HDHGHEHEDGHEHHHHDHDD 258 >gi|12239366|gb|AAG49447.1|AF141344_1 (AF141344) LYST-interacting protein LIP8 [Homo sapiens] Length = 119 Frame 3 hits (HSPs): ___________________________ __________________________________________________ Database sequence: | | | | 119 0 50 100 Plus Strand HSPs: Score = 75 (26.4 bits), Expect = 2.9, P = 0.95 Identities = 26/64 (40%), Positives = 30/64 (46%), Frame = +3 Query: 123 SSWRCSPSCCCFSPAPSATSSWRCSSERCW-WWRTRR*GKPMICSSTXKRPLDCLLPV-- 293 S W S C + TSSWR S W WW+ R P C T RP C P+ Sbjct: 6 SDWPGSKREC--ANCRVGTSSWRSSG---WSWWKDCR---PC-CRPTGMRPTSCSAPLSR 56 Query: 294 -----LLFLDPP 314 LL +DPP Sbjct: 57 RPTLKLLLMDPP 68 >gi|70792|pir||GACH protamine - chicken >gi|229596|prf||764186A protamine [Gallus gallus] Length = 65 Frame 1 hits (HSPs): ______________________________ __________________________________________________ Database sequence: | | | | | 65 0 20 40 60 Plus Strand HSPs: Score = 64 (22.5 bits), Expect = 3.1, P = 0.95 Identities = 15/38 (39%), Positives = 20/38 (52%), Frame = +1 Query: 67 PRRTVNNLRKINQRPRRAYRHGGAHRRAAASHRRHRQH 180 PRR + R+ RR+ R GG RR S RR R++ Sbjct: 28 PRRRRSRRRRRYGSARRSRRSGGVRRRRYGSRRRRRRY 65 >gi|7509339|pir||T26452 hypothetical protein Y113G7C.1 - Caenorhabditis elegans >gi|3979926|emb|CAA22059.1| (AL033509) predicted using Genefinder~contains similarity to Pfam domain: PF00102 (Protein-tyrosine phosphatase), Score=110.1, E-value=1.3e-29, N=1; PF02206 (Domain of unknown function), Score=70.8, E-value=9.2e-18, N=1~cDNA EST yk18b9.3 comes from thi> Length = 1494 Frame -2 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | 1494 0 500 1000 Minus Strand HSPs: Score = 91 (32.0 bits), Expect = 3.2, P = 0.96 Identities = 32/111 (28%), Positives = 52/111 (46%), Frame = -2 Query: 346 EIHRKKQKEKIGGSRKRSTGRRQSSGLXFVEEQIIGFPQRRVRHHQHRSDEQRHEDVADG 167 +IH K ++E+ +KR T Q +E +R++ + + DE+R + Sbjct: 1012 KIHEKDREEE----KKRKTQDFQDQQDADIERAEADEQRRKMNVERAKKDEERAKKDKRD 1067 Query: 166 AGEKQQHDGEHRHDDKHDAV-AD*SSEDY*RFVAEEVEEEPR---RHEHHEHDQR 14 A E+++ D E DK DA A E + AE E E + RHE E D++ Sbjct: 1068 AEERRKKDEERAKKDKEDAKKAKEDQEKIDKLAAEVKEREEKSLARHEKEEEDEK 1122 >gi|11357492|pir||T48398 hypothetical protein F17C15.130 - Arabidopsis thaliana >gi|7340656|emb|CAB82936.1| (AL162506) putative protein [Arabidopsis thaliana] Length = 81 Frame -2 hits (HSPs): __________________________________________ __________________________________________________ Database sequence: | | | | | | 81 0 20 40 60 80 Minus Strand HSPs: Score = 63 (22.2 bits), Expect = 4.0, P = 0.98 Identities = 13/66 (19%), Positives = 30/66 (45%), Frame = -2 Query: 211 QHRSDEQRHEDVADGAGEKQQHDGEHRHDDKHDAVAD*SSEDY*RFVAEEVEEEPRRHEH 32 + +E+ E+ + E+++ + E +++ + + E+ EE EEE E Sbjct: 8 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEED 67 Query: 31 HEHDQR 14 E ++R Sbjct: 68 REREER 73 Score = 62 (21.8 bits), Expect = 5.1, P = 0.99 Identities = 13/63 (20%), Positives = 29/63 (46%), Frame = -2 Query: 199 DEQRHEDVADGAGEKQQHDGEHRHDDKHDAVAD*SSEDY*RFVAEEVEEEPRRHEHHEHD 20 +E+ E+ + E+++ + E +++ + + E+ EE EEE E E + Sbjct: 7 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE 66 Query: 19 QRD 11 R+ Sbjct: 67 DRE 69 Score = 61 (21.5 bits), Expect = 6.7, P = 1.0 Identities = 13/62 (20%), Positives = 28/62 (45%), Frame = -2 Query: 196 EQRHEDVADGAGEKQQHDGEHRHDDKHDAVAD*SSEDY*RFVAEEVEEEPRRHEHHEHDQ 17 E+ E+ + E+++ + E +++ + + E+ EE EEE E E ++ Sbjct: 6 EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE 65 Query: 16 RD 11 D Sbjct: 66 ED 67 >gi|2134214|pir||C58213 protamine II - American alligator Length = 56 Frame 1 hits (HSPs): ________________________________________ Annotated Domains: __________ __________________________________________________ Database sequence: | | | | 56 0 20 40 __________________ Annotated Domains: PROSITE PROTAMINE_P1: Protamine P1 signature. 1..12 __________________ Plus Strand HSPs: Score = 63 (22.2 bits), Expect = 4.0, P = 0.98 Identities = 15/44 (34%), Positives = 21/44 (47%), Frame = +1 Query: 70 RRTVNNLRKINQRPRRAYRHGGAHRRAAASHRRHRQHPRGAAHR 201 R + R ++R RR +R G RR HR+H RG + R Sbjct: 4 RHNRSRSRSRHRRRRRGHRGGRYRRRRRRGRYGHRRHHRGHSRR 47 >gi|4504487 ref|NP_002143.1| histidine-rich calcium-binding protein precursor [Homo sapiens] >gi|134873|sp|P23327|SRCH_HUMAN SARCOPLASMIC RETICULUM HISTIDINE-RICH CALCIUM-BINDING PROTEIN PRECURSOR >gi|1082444|pir||A54660 histidine rich calcium binding protein - human >gi|183919|gb|AAA88071.1| (M60052) histidine-rich calcium binding protein [Homo sapiens] Length = 699 Frame -1 hits (HSPs): ___ Frame -2 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | | 699 0 150 300 450 600 Minus Strand HSPs: Score = 84 (29.6 bits), Expect = 4.5, Sum P(2) = 0.99 Identities = 27/80 (33%), Positives = 37/80 (46%), Frame = -2 Query: 256 EEQIIGFPQRRVRHHQHRSDEQRHEDVADGA---GEKQQHDGEHRHDDKHDAVAD*SSED 86 EE + + RH H S+E EDV+DG G +H G DD D D +D Sbjct: 203 EEASTEYGHQAHRHRGHGSEED--EDVSDGHHHHGPSHRHQGHEEDDDDDDDDDDDDDDD 260 Query: 85 Y*RFVAEEVEEEPRRHEHH--EHDQ 17 V+ E + RH+ H E D+ Sbjct: 261 D---VSIEYRHQAHRHQGHGIEEDE 282 Score = 40 (14.1 bits), Expect = 4.5, Sum P(2) = 0.99 Identities = 10/27 (37%), Positives = 16/27 (59%), Frame = -1 Query: 350 IRNSQKEAEGED-WGIQKEEHREETIE 273 +R+ + +GED G ++EE EE E Sbjct: 178 LRHGHRGHDGEDDEGEEEEEEEEEEEE 204 >gi|786117|gb|AAA98076.1| (L41834) nuclear protein [Ensis minor] Length = 505 Frame -2 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | | 505 0 150 300 450 Minus Strand HSPs: Score = 85 (29.9 bits), Expect = 4.6, P = 0.99 Identities = 30/134 (22%), Positives = 62/134 (46%), Frame = -2 Query: 514 EKNSDIIKLTKRKKRNFTSPNIFLSSQTDSK*KFPLRKRAKAYKPNL-KLFTKLTK*EIH 338 +K S K + KKR+ + ++ S+ + +KR+K+ K + K +K K Sbjct: 194 KKRSHSRKRSASKKRSHSRKRSASKKRSKSRKRSASKKRSKSRKRSASKKRSKSRKRSAS 253 Query: 337 RKKQKE-KIGGSRKRSTGRRQSSGLXFVEEQIIGFPQRRVRHHQHRSDEQRHEDVADGAG 161 +K+ K K S+KRS R++S+ + + ++R + + + ++R + A Sbjct: 254 KKRSKSRKRSASKKRSKSRKRSASKKRSKSRKRSASKKRSKSRKRSASKKRSKSRKRSAS 313 Query: 160 EKQQHDGEHRHDDK 119 +K+ H + R K Sbjct: 314 KKRSHSRKRRPSKK 327 >gi|9845306 ref|NP_064120.1| pr4.1 [rat cytomegalovirus Maastricht] >gi|9800241|gb|AAF99115.1|AF232689_6 (AF232689) pr4.1 [rat cytomegalovirus Maastricht] Length = 247 Frame -3 hits (HSPs): ________________________ __________________________________________________ Database sequence: | | | | | | 247 0 50 100 150 200 Minus Strand HSPs: Score = 81 (28.5 bits), Expect = 4.8, P = 0.99 Identities = 33/117 (28%), Positives = 58/117 (49%), Frame = -3 Query: 432 QTPNENFPSEKGQKHINRT*NYLLN*PNKKFTERSRRRRLGDPERG-APGGD------NR 274 QTP+E+ P+ + + + + + + SRR R P+ +P G +R Sbjct: 18 QTPDESSPAAPSESRASSDSSLVDGGARGGYGQSSRRNR---PQGSTSPSGSTSRPRRSR 74 Query: 273 VASXSSKNKSSVFLNAACATTNTAPMSSATRMLPMAPVRSSSTTVSTAMTISTTRSLINL 94 +S +S + SS ++A + + AP SS+ R P +P S +T ++ T STT S + Sbjct: 75 SSSSASSSASSSASSSASSFPSDAP-SSSRRSPPTSPATSGPSTSLSSDTTSTTVSRSSS 133 Query: 93 P 91 P Sbjct: 134 P 134 >gi|462338|sp|P35305|HSP1_DIDMA SPERM PROTAMINE P1 >gi|1071970|pir||S34045 protamine - North American opossum >gi|294036|gb|AAA02812.1| (L17007) protamine 1 [Didelphis marsupialis] >gi|407063|emb|CAA52193.1| (X74044) protamine P1 [Didelphis marsupialis] >gi|598253|gb|AAA74612.1| (L35448) protamine P1 [Monodelphis domestica] >gi|1582108|prf||2117429A protamine P1 [Didelphis marsupialis] >gi|1582109|prf||2117429B protamine P1 [Monodelphis domestica] Length = 58 Frame 1 hits (HSPs): _________________________________ Annotated Domains: ________________________________________________ __________________________________________________ Database sequence: | | | | 58 0 20 40 __________________ Annotated Domains: BLOCKS BL00048: Protamine P1 proteins. 1..27 DOMO DM01168: PROTAMINEP1 1..56 PFAM protamine_P1: Protamine P1 1..55 PRODOM PD001830: HSP1(29) VE2(21) GAG(12) 2..56 PROSITE PROTAMINE_P1: Protamine P1 signature. 2..12 __________________ Plus Strand HSPs: Score = 62 (21.8 bits), Expect = 5.1, P = 0.99 Identities = 17/39 (43%), Positives = 23/39 (58%), Frame = +1 Query: 70 RRTVNNLRKINQRPRRAYRHG-GAHRRAAASHRRHRQHPR 186 RR+ + R+ +R RR R G G HRR+ HRR R+ R Sbjct: 21 RRSRSRRRRSRRRRRRRGRRGRGYHRRSP--HRRRRRRRR 58 >gi|10801034|emb|CAC12965.1| (AJ250175) putative metallothionein [Yarrowia lipolytica] Length = 54 Frame 3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 54 0 20 40 Plus Strand HSPs: Score = 62 (21.8 bits), Expect = 5.1, P = 0.99 Identities = 11/22 (50%), Positives = 12/22 (54%), Frame = +3 Query: 144 SCCCFSPAPSATSSWRCSSERC 209 SCCC PA T+S CS C Sbjct: 28 SCCCSKPAEKPTNSCTCSKCAC 49 >gi|730133|sp|P40139|NG2_DROME NEW-GLUE PROTEIN 2 PRECURSOR (NG-2) >gi|479421|pir||S33822 salivary glue protein ng-2 - fruit fly (Drosophila melanogaster) >gi|296041|emb|CAA43951.1| (X61945) ng-2 [Drosophila melanogaster] Length = 112 Frame -3 hits (HSPs): _______________________________________ Annotated Domains: ______________________________ __________________________________________________ Database sequence: | | | | 112 0 50 100 __________________ Annotated Domains: Entrez Domain: 4 X 8 AA TANDEM REPEATS OF T-S-A 31..62 Entrez Repetitive region: 1. 31..38 Entrez Repetitive region: 2. 39..46 Entrez Repetitive region: 3. 47..54 Entrez Repetitive region: 4. 55..62 PRODOM PD044799: NG2_DROME 1..22 PRODOM PD000651: VL2(74) O18758(34) MUC2(10) 25..68 __________________ Minus Strand HSPs: Score = 72 (25.3 bits), Expect = 5.3, P = 1.0 Identities = 20/85 (23%), Positives = 40/85 (47%), Frame = -3 Query: 267 SXSSKNKSSVFLNAACATTNTAPMSSATRMLPMAPVRSSSTTVSTAMTISTTRSLINLPK 88 + +S + S+ +A ATT T+ ++ T +S ++ S T++ + + PK Sbjct: 27 TTTSTSASATTTTSASATTTTSASATTTTSASATTTTASPSSSSKKKTVTHYKRKVKRPK 86 Query: 87 IINGSSRRK*RKSHAAMSTTNTISE 13 + +RR+ +S S+ N SE Sbjct: 87 KVRKITRRRGLRSRNGRSSRNRRSE 111 >gi|3242350|emb|CAA19671.1| (AL024484) EG:96G10.4 [Drosophila melanogaster] Length = 112 Frame -3 hits (HSPs): _______________________________________ __________________________________________________ Database sequence: | | | | 112 0 50 100 Minus Strand HSPs: Score = 72 (25.3 bits), Expect = 5.3, P = 1.0 Identities = 20/85 (23%), Positives = 40/85 (47%), Frame = -3 Query: 267 SXSSKNKSSVFLNAACATTNTAPMSSATRMLPMAPVRSSSTTVSTAMTISTTRSLINLPK 88 + +S + S+ +A ATT T+ ++ T +S ++ S T++ + + PK Sbjct: 27 TTTSTSASATTTTSASATTTTSASATTTTSASATTTTASPSSSSKKKTVTHYKRKVKRPK 86 Query: 87 IINGSSRRK*RKSHAAMSTTNTISE 13 + +RR+ +S S+ N SE Sbjct: 87 KVRKITRRRGLRSRNGRSSRNRRSE 111 >gi|6018876|emb|CAB58070.1| (AL121805) BACN4L24.a [Drosophila melanogaster] >gi|7290396|gb|AAF45854.1| (AE003426) ng2 gene product [Drosophila melanogaster] Length = 112 Frame -3 hits (HSPs): _______________________________________ __________________________________________________ Database sequence: | | | | 112 0 50 100 Minus Strand HSPs: Score = 72 (25.3 bits), Expect = 5.3, P = 1.0 Identities = 20/85 (23%), Positives = 40/85 (47%), Frame = -3 Query: 267 SXSSKNKSSVFLNAACATTNTAPMSSATRMLPMAPVRSSSTTVSTAMTISTTRSLINLPK 88 + +S + S+ +A ATT T+ ++ T +S ++ S T++ + + PK Sbjct: 27 TTTSTSASATTTTSASATTTTSASATTTTSASATTTTASPSSSSKKKTVTHYKRKVKRPK 86 Query: 87 IINGSSRRK*RKSHAAMSTTNTISE 13 + +RR+ +S S+ N SE Sbjct: 87 KVRKITRRRGLRSRNGRSSRNRRSE 111 >gi|11424081 ref|XP_008915.1| histidine-rich calcium-binding protein precursor [Homo sapiens] Length = 755 Frame -1 hits (HSPs): ___ Frame -2 hits (HSPs): ______ __________________________________________________ Database sequence: | | | | | || 755 0 150 300 450 600 750 Minus Strand HSPs: Score = 84 (29.6 bits), Expect = 5.4, Sum P(2) = 1.0 Identities = 27/80 (33%), Positives = 37/80 (46%), Frame = -2 Query: 256 EEQIIGFPQRRVRHHQHRSDEQRHEDVADGA---GEKQQHDGEHRHDDKHDAVAD*SSED 86 EE + + RH H S+E EDV+DG G +H G DD D D +D Sbjct: 259 EEASTEYGHQAHRHRGHGSEED--EDVSDGHHHHGPSHRHQGHEEDDDDDDDDDDDDDDD 316 Query: 85 Y*RFVAEEVEEEPRRHEHH--EHDQ 17 V+ E + RH+ H E D+ Sbjct: 317 D---VSIEYRHQAHRHQGHGIEEDE 338 Score = 40 (14.1 bits), Expect = 5.4, Sum P(2) = 1.0 Identities = 10/27 (37%), Positives = 16/27 (59%), Frame = -1 Query: 350 IRNSQKEAEGED-WGIQKEEHREETIE 273 +R+ + +GED G ++EE EE E Sbjct: 234 LRHGHRGHDGEDDEGEEEEEEEEEEEE 260 >gi|85349|pir||PS0274 homeotic protein box6 - sea urchin (Parechinus angulosus) (fragments) Length = 88 Frame -1 hits (HSPs): ______ Frame -2 hits (HSPs): ___________________________ Annotated Domains: _________________________________ __________________________________________________ Database sequence: | | | | | | 88 0 20 40 60 80 __________________ Annotated Domains: Entrez domain: homeobox homology #label HOX 9..65 PROSITE HOMEOBOX_1: 'Homeobox' domain signature. 41..64 __________________ Minus Strand HSPs: Score = 55 (19.4 bits), Expect = 5.8, Sum P(2) = 1.0 Identities = 12/46 (26%), Positives = 25/46 (54%), Frame = -2 Query: 250 QIIGFPQRRVR-HHQHRSDEQRHEDVADGAGEKQQHDGEHRHDDKH 116 Q + +R+++ Q+R + + E V DG G+++ + + DD H Sbjct: 43 QAVCLSERQIKIWFQNRRMKWKKERVRDGNGDEEDDEAKEGDDDVH 88 Score = 41 (14.4 bits), Expect = 5.8, Sum P(2) = 1.0 Identities = 6/11 (54%), Positives = 10/11 (90%), Frame = -1 Query: 476 EKEFHFTKHLS 444 EKEFH+ ++L+ Sbjct: 24 EKEFHYNRYLT 34 WARNING: HSPs involving 15 database sequences were not reported due to the limiting value of parameter B = 50. Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=6.00 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.346 0.147 0.563 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.351 0.161 0.537 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.342 0.150 0.503 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.358 0.160 0.581 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.327 0.140 0.424 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.332 0.137 0.402 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 171 170 10. 75 3 12 22 0.11 34 31 0.11 37 +2 0 171 170 10. 75 3 12 22 0.11 34 31 0.11 37 +1 0 171 170 10. 75 3 12 22 0.11 34 31 0.11 37 -1 0 171 170 10. 75 3 12 22 0.11 34 31 0.11 37 -2 0 171 170 10. 75 3 12 22 0.11 34 31 0.11 37 -3 0 171 170 10. 75 3 12 22 0.11 34 31 0.11 37 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 65 No. of states in DFA: 587 (58 KB) Total size of DFA: 189 KB (192 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 191.94u 1.00s 192.94t Elapsed: 00:00:33 Total cpu time: 192.01u 1.04s 193.05t Elapsed: 00:00:33 Start: Mon Oct 1 17:26:41 2001 End: Mon Oct 1 17:27:14 2001 WARNINGS ISSUED: 2
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000