WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker Server unavailable.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= B03A12.seq(194>845); (634 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 3 Sequences : less than 3 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 687 148 |================================================= 6310 539 115 |====================================== 3980 424 121 |======================================== 2510 303 79 |========================== 1580 224 79 |========================== 1000 145 47 |=============== 631 98 30 |========== 398 68 20 |====== 251 48 16 |===== 158 32 8 |== 100 24 4 |= 63.1 20 5 |= 39.8 15 2 |: 25.1 13 0 | 15.8 13 1 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 12 <<<<<<<<<<<<<<<<< 10.0 12 1 |: 6.31 11 2 |: 3.98 9 0 | 2.51 9 0 | 1.58 9 0 | 1.00 9 0 | 0.63 9 0 | 0.40 9 0 | 0.25 9 0 | 0.16 9 0 | 0.10 9 0 | 0.063 9 0 | 0.040 9 1 |: 0.025 8 0 | 0.016 8 2 |: 0.010 6 0 | 0.0063 6 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|11357570|pir||T51370hypothetical protein F1N13_10 ... -2 642 7.0e-62 1 gi|7488646|pir||T07142beta-glucan-elicitor receptor -... -2 625 4.4e-60 1 gi|5019452|emb|CAA71307.1|(Y10257) beta-glucan bindin... -2 625 4.4e-60 1 gi|6625560|gb|AAF19265.1|AF088188_1(AF088188) beta-gl... -2 601 1.5e-57 1 gi|6714292|gb|AAF25988.1|AC013354_7(AC013354) F15H18.... -2 275 1.2e-40 2 gi|3850142|emb|CAA21943.1|(AL033391) conserved hypoth... -2 112 0.0059 1 gi|6562336|emb|CAB62580.1|(AJ251465) endo-1,3-beta-gl... -2 112 0.010 1 gi|6562334|emb|CAB62579.1|(AJ251464) endo-1,3-beta-gl... -2 113 0.013 1 gi|10937881|gb|AAF13033.2|AF121133_1(AF121133) beta(1... -2 108 0.030 1 gi|2723863|dbj|BAA24094.1|(D63890) membrane protein [... +3 62 0.997 2 gi|2723867|dbj|BAA24096.1|(D63892) membrane protein [... +3 62 0.997 2 gi|9631499ref|NP_048149.1| ORF MSV078 hypothetical pr... +1 61 0.99995 1
Use the and icons to retrieve links to Entrez:
>gi|11357570|pir||T51370 hypothetical protein F1N13_10 - Arabidopsis thaliana >gi|9755633|emb|CAC01786.1| (AL391145) putative protein [Arabidopsis thaliana] Length = 745 Frame -2 hits (HSPs): ____________ __________________________________________________ Database sequence: | | | | | | 745 0 150 300 450 600 Minus Strand HSPs: Score = 642 (226.0 bits), Expect = 7.0e-62, P = 7.0e-62 Identities = 115/170 (67%), Positives = 139/170 (81%), Frame = -2 Query: 633 LEIHAAQMWWHVKEGDNQYDEEFGKENKVVGVLWANKRDSGLWFAPPEWRECRLGIQLLP 454 LEIHAA+MWW VKE D Y ++F EN+VVGVLW+ KRDSGLWFAP EW+ECRLGIQLLP Sbjct: 576 LEIHAAKMWWQVKEDDAIYPQDFTSENRVVGVLWSTKRDSGLWFAPKEWKECRLGIQLLP 635 Query: 453 ILPISEVLFSNVDFVKELVEWTLPALNREGVGEGWKGFVYALQGIYDDEGAMQKIRSLNG 274 +LP+SEVLFS+V FVK+LV WT+PAL R+ VGEGWKGFVYAL+ +YD +GAM+KI+ LNG Sbjct: 636 LLPVSEVLFSDVTFVKQLVNWTMPALARDSVGEGWKGFVYALESMYDKDGAMEKIKGLNG 695 Query: 273 FDDGNSLTNLLWWIHSRS--DDEEF---------GHG---KHCWFGHYCH 166 FDDGNSL+NLLWW+HSR+ DD+++ GHG K+C FGHYCH Sbjct: 696 FDDGNSLSNLLWWVHSRNNDDDDDYEEDDEGGYGGHGGGGKYCSFGHYCH 745 >gi|7488646|pir||T07142 beta-glucan-elicitor receptor - soybean >gi|1752734|dbj|BAA11407.1| (D78510) beta-glucan-elicitor receptor [Glycine max] Length = 668 Frame -2 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | | | 668 0 150 300 450 600 Minus Strand HSPs: Score = 625 (220.0 bits), Expect = 4.4e-60, P = 4.4e-60 Identities = 109/141 (77%), Positives = 127/141 (90%), Frame = -2 Query: 633 LEIHAAQMWWHVKEGDNQYDEEFGKENKVVGVLWANKRDSGLWFAPPEWRECRLGIQLLP 454 LEI +MWWHVKEG Y++EF +EN+V+GVLW+NKRD+GLWFAP EW+ECRLGIQLLP Sbjct: 528 LEIEGTKMWWHVKEGGTLYEKEFTQENRVMGVLWSNKRDTGLWFAPAEWKECRLGIQLLP 587 Query: 453 ILPISEVLFSNVDFVKELVEWTLPALNREG-VGEGWKGFVYALQGIYDDEGAMQKIRSLN 277 + PISE +FSNVDFVKELVEWTLPAL+REG VGEGWKGFVYAL+G+YD+E A+QKIR+L Sbjct: 588 LAPISEAIFSNVDFVKELVEWTLPALDREGGVGEGWKGFVYALEGVYDNESALQKIRNLK 647 Query: 276 GFDDGNSLTNLLWWIHSRSDD 214 GFD GNSLTNLLWWIHSRSD+ Sbjct: 648 GFDGGNSLTNLLWWIHSRSDE 668 >gi|5019452|emb|CAA71307.1| (Y10257) beta-glucan binding protein [Glycine max] Length = 668 Frame -2 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | | | 668 0 150 300 450 600 Minus Strand HSPs: Score = 625 (220.0 bits), Expect = 4.4e-60, P = 4.4e-60 Identities = 109/141 (77%), Positives = 127/141 (90%), Frame = -2 Query: 633 LEIHAAQMWWHVKEGDNQYDEEFGKENKVVGVLWANKRDSGLWFAPPEWRECRLGIQLLP 454 LEI +MWWHVKEG Y++EF +EN+V+GVLW+NKRD+GLWFAP EW+ECRLGIQLLP Sbjct: 528 LEIEGTKMWWHVKEGGTLYEKEFTQENRVMGVLWSNKRDTGLWFAPAEWKECRLGIQLLP 587 Query: 453 ILPISEVLFSNVDFVKELVEWTLPALNREG-VGEGWKGFVYALQGIYDDEGAMQKIRSLN 277 + PISE +FSNVDFVKELVEWTLPAL+REG VGEGWKGFVYAL+G+YD+E A+QKIR+L Sbjct: 588 LAPISEAIFSNVDFVKELVEWTLPALDREGGVGEGWKGFVYALEGVYDNESALQKIRNLK 647 Query: 276 GFDDGNSLTNLLWWIHSRSDD 214 GFD GNSLTNLLWWIHSRSD+ Sbjct: 648 GFDGGNSLTNLLWWIHSRSDE 668 >gi|6625560|gb|AAF19265.1|AF088188_1 (AF088188) beta-glucan binding protein [Phaseolus vulgaris] Length = 662 Frame -2 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | | | 662 0 150 300 450 600 Minus Strand HSPs: Score = 601 (211.6 bits), Expect = 1.5e-57, P = 1.5e-57 Identities = 102/136 (75%), Positives = 123/136 (90%), Frame = -2 Query: 633 LEIHAAQMWWHVKEGDNQYDEEFGKENKVVGVLWANKRDSGLWFAPPEWRECRLGIQLLP 454 LEI +MWWHV+EG + Y+EEF +EN+++GVLW+NKRDSGLWFAP EW+ECRLGIQLLP Sbjct: 523 LEILGTKMWWHVEEGGSLYEEEFTRENRIMGVLWSNKRDSGLWFAPAEWKECRLGIQLLP 582 Query: 453 ILPISEVLFSNVDFVKELVEWTLPALNREGVGEGWKGFVYALQGIYDDEGAMQKIRSLNG 274 + PISE +FSN ++VK+LVEWTLPALNR+GVGEGWKGFVYAL+GIYD+E A+QKIR+L G Sbjct: 583 LAPISEAIFSNAEYVKQLVEWTLPALNRDGVGEGWKGFVYALEGIYDNESALQKIRNLAG 642 Query: 273 FDDGNSLTNLLWWIHS 226 FD GNSL+NLLWWIHS Sbjct: 643 FDGGNSLSNLLWWIHS 658 >gi|6714292|gb|AAF25988.1|AC013354_7 (AC013354) F15H18.17 [Arabidopsis thaliana] Length = 630 Frame -2 hits (HSPs): __________ __________________________________________________ Database sequence: | | | | | | 630 0 150 300 450 600 Minus Strand HSPs: Score = 275 (96.8 bits), Expect = 1.2e-40, Sum P(2) = 1.2e-40 Identities = 46/64 (71%), Positives = 57/64 (89%), Frame = -2 Query: 414 FVKELVEWTLPALNREGVGEGWKGFVYALQGIYDDEGAMQKIRSLNGFDDGNSLTNLLWW 235 FVK+LV WTLPAL R GVGEGWKGF+YAL+ +YD +GA++KI+ LN +DDGNSL+NLLWW Sbjct: 566 FVKQLVNWTLPALQRNGVGEGWKGFLYALESLYDKDGAIKKIKRLNMYDDGNSLSNLLWW 625 Query: 234 IHSR 223 +HSR Sbjct: 626 VHSR 629 Score = 200 (70.4 bits), Expect = 1.2e-40, Sum P(2) = 1.2e-40 Identities = 33/51 (64%), Positives = 41/51 (80%), Frame = -2 Query: 633 LEIHAAQMWWHVKEGDNQYDEEFGKENKVVGVLWANKRDSGLWFAPPEWRE 481 LEIHAA+MWW VK+G+ Y ++F EN+VVGVLW+ KRDS LWF P EW+E Sbjct: 514 LEIHAAKMWWQVKKGEALYPKDFTAENRVVGVLWSTKRDSSLWFGPKEWKE 564 >gi|3850142|emb|CAA21943.1| (AL033391) conserved hypothetical protein [Candida albicans] Length = 481 Frame -2 hits (HSPs): __________ __________________________________________________ Database sequence: | | | | | 481 0 150 300 450 Minus Strand HSPs: Score = 112 (39.4 bits), Expect = 0.0059, P = 0.0059 Identities = 33/85 (38%), Positives = 48/85 (56%), Frame = -2 Query: 555 NKVVGVLWANKRDSGLWFAP----PEWRECRLGIQLLPILPISEVLFSNVDFVKELVEWT 388 NKV G+L+ NK D +F EW GI +LPI P+S F + FVKE EW Sbjct: 366 NKVSGILFENKIDYATYFGRGTIGDEWIH---GIHMLPITPVSSY-FRSSKFVKE--EWD 419 Query: 387 --LPALNREGVGEGWKGFVYALQGIYDDEGA 301 L A+ + + +GWKG + + ++D + A Sbjct: 420 EKLGAIVDQ-IPDGWKGILMLNKALFDPKSA 449 >gi|6562336|emb|CAB62580.1| (AJ251465) endo-1,3-beta-glucanase [Candida albicans] Length = 734 Frame -2 hits (HSPs): ______ __________________________________________________ Database sequence: | | | | | | 734 0 150 300 450 600 Minus Strand HSPs: Score = 112 (39.4 bits), Expect = 0.010, P = 0.010 Identities = 33/85 (38%), Positives = 48/85 (56%), Frame = -2 Query: 555 NKVVGVLWANKRDSGLWFAP----PEWRECRLGIQLLPILPISEVLFSNVDFVKELVEWT 388 NKV G+L+ NK D +F EW GI +LPI P+S F + FVKE EW Sbjct: 619 NKVSGILFENKIDYATYFGRGTIGDEWIH---GIHMLPITPVSSY-FRSSKFVKE--EWD 672 Query: 387 --LPALNREGVGEGWKGFVYALQGIYDDEGA 301 L A+ + + +GWKG + + ++D + A Sbjct: 673 EKLGAIVDQ-IPDGWKGILMLNKALFDPKSA 702 >gi|6562334|emb|CAB62579.1| (AJ251464) endo-1,3-beta-glucanase [Candida albicans] >gi|6562338|emb|CAB62581.1| (AJ251504) endo-1,3-beta-glucanase [Candida albicans] Length = 1145 Frame -2 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | | | | | | 1145 0 150 300 450 600 750 900 1050 Minus Strand HSPs: Score = 113 (39.8 bits), Expect = 0.013, P = 0.013 Identities = 36/116 (31%), Positives = 56/116 (48%), Frame = -2 Query: 612 MWWHVKEGDNQYDEEFGKENKVVGVLWANKRDSGLWFAPP-EWRECRLGIQLLPILPISE 436 M+++ K DN + + NKV G+ + NK D +F P + E GI +LPI P S Sbjct: 1010 MYFYYKS-DNTVEPKQILPNKVSGIFFENKVDYTTFFGTPADHPEYVHGIHMLPITPSSS 1068 Query: 435 VLFSNVDFVKELVEWTLPALNR-EGVGEGWKGFVYALQGIYDDEGAMQKIRSLNGFDD 265 L +V+E EW + V GW G + Q ++D + + + S N +DD Sbjct: 1069 -LVRKTSYVQE--EWKDQIAGFIDNVDSGWTGILRLNQALFDPKSSYEFFAS-NNWDD 1122 >gi|10937881|gb|AAF13033.2|AF121133_1 (AF121133) beta(1-3)endoglucanase [Aspergillus fumigatus] Length = 727 Frame -2 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | | 727 0 150 300 450 600 Minus Strand HSPs: Score = 108 (38.0 bits), Expect = 0.030, P = 0.030 Identities = 36/81 (44%), Positives = 43/81 (53%), Frame = -2 Query: 555 NKVVGVLWANKRDSGLWFAPPEWRECRLGIQLLPILPISEVLFSNVDFVKELVEW-TLPA 379 NKV G+L+ NK D +F E GI +LPILP S S FVKE EW + A Sbjct: 609 NKVTGILFENKVDHTTYFGNN--LEYIQGIHMLPILPCSAFTRSK-QFVKE--EWDAMFA 663 Query: 378 LN----REGVGEGWKGFVYALQGIYD 313 N E V GWKG +YA + D Sbjct: 664 SNGPDPAENVVGGWKGVLYANLALVD 689 >gi|2723863|dbj|BAA24094.1| (D63890) membrane protein [human herpesvirus 5] Length = 267 Frame 3 hits (HSPs): __________ Frame 1 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | | | 267 0 50 100 150 200 250 Plus Strand HSPs: Score = 62 (21.8 bits), Expect = 5.8, Sum P(2) = 1.0 Identities = 17/47 (36%), Positives = 22/47 (46%), Frame = +3 Query: 291 SSALHLHRHKYPAEHRQTPSILLQHLPYSKQAMSTPQAPS-QNQHWKT 431 +S H HRH P E + + H+ A TP PS Q+QH T Sbjct: 176 TSTTH-HRHSNPKESTTPKTHVEHHVGLGATAAETPLQPSPQHQHVAT 222 Score = 58 (20.4 bits), Expect = 5.8, Sum P(2) = 1.0 Identities = 12/33 (36%), Positives = 20/33 (60%), Frame = +1 Query: 37 TTHILLNKNCKKQTSDIYRNL--FLYSARNYHN 129 T H LL N ++ +D YR + F++ + N+HN Sbjct: 91 TNHSLLLINVTEEYTDYYRTMTTFVHRSHNWHN 123 >gi|2723867|dbj|BAA24096.1| (D63892) membrane protein [human herpesvirus 5] >gi|2723875|dbj|BAA24100.1| (D63896) membrane protein [human herpesvirus 5] Length = 267 Frame 3 hits (HSPs): __________ Frame 1 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | | | 267 0 50 100 150 200 250 Plus Strand HSPs: Score = 62 (21.8 bits), Expect = 5.8, Sum P(2) = 1.0 Identities = 17/47 (36%), Positives = 22/47 (46%), Frame = +3 Query: 291 SSALHLHRHKYPAEHRQTPSILLQHLPYSKQAMSTPQAPS-QNQHWKT 431 +S H HRH P E + + H+ A TP PS Q+QH T Sbjct: 176 TSTTH-HRHSNPKESTTPKTHVEHHVGLGATAAETPLQPSPQHQHVAT 222 Score = 58 (20.4 bits), Expect = 5.8, Sum P(2) = 1.0 Identities = 12/33 (36%), Positives = 20/33 (60%), Frame = +1 Query: 37 TTHILLNKNCKKQTSDIYRNL--FLYSARNYHN 129 T H LL N ++ +D YR + F++ + N+HN Sbjct: 91 TNHSLLLINVTEEYTDYYRTMTTFVHRSHNWHN 123 >gi|9631499 ref|NP_048149.1| ORF MSV078 hypothetical protein [Melanoplus sanguinipes entomopoxvirus] >gi|11362310|pir||T28239 ORF MSV078 hypothetical protein - Melanoplus sanguinipes entomopoxvirus >gi|4049852|gb|AAC97812.1| (AF063866) ORF MSV078 hypothetical protein [Melanoplus sanguinipes entomopoxvirus] Length = 67 Frame 1 hits (HSPs): ___________________________ __________________________________________________ Database sequence: | | | | | 67 0 20 40 60 Plus Strand HSPs: Score = 61 (21.5 bits), Expect = 9.9, P = 1.0 Identities = 13/38 (34%), Positives = 24/38 (63%), Frame = +1 Query: 16 FFFFFKSTTHILLNKNCKKQTSDIYRNLFLYSARNYHN 129 FF F ++ ++ N C K TS I++ F+++A NY++ Sbjct: 32 FFIFNVNSIFLVFNIACMKFTSHIFK--FIFNAINYNS 67 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.99 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.346 0.148 0.507 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.354 0.155 0.544 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.342 0.149 0.473 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.368 0.164 0.710 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.334 0.149 0.523 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.361 0.162 0.562 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 210 210 10. 77 3 12 22 0.099 35 31 0.11 38 +2 0 211 211 10. 77 3 12 22 0.10 35 31 0.11 38 +1 0 211 211 10. 77 3 12 22 0.10 35 31 0.11 38 -1 0 211 211 10. 77 3 12 22 0.10 35 31 0.11 38 -2 0 211 211 10. 77 3 12 22 0.10 35 31 0.11 38 -3 0 210 210 10. 77 3 12 22 0.099 35 31 0.11 38 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 12 No. of states in DFA: 593 (58 KB) Total size of DFA: 248 KB (256 KB) Time to generate neighborhood: 0.02u 0.00s 0.02t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 214.30u 1.17s 215.47t Elapsed: 00:00:50 Total cpu time: 214.34u 1.18s 215.52t Elapsed: 00:00:50 Start: Wed Jan 30 17:35:19 2002 End: Wed Jan 30 17:36:09 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000