WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= 'E11A07_A07_02.ab1' (456 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 3 Sequences : less than 3 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 627 118 |======================================= 6310 509 89 |============================= 3980 420 137 |============================================= 2510 283 95 |=============================== 1580 188 43 |============== 1000 145 43 |============== 631 102 36 |============ 398 66 29 |========= 251 37 7 |== 158 30 11 |=== 100 19 6 |== 63.1 13 4 |= 39.8 9 3 |= 25.1 6 0 | 15.8 6 3 |= >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 3 <<<<<<<<<<<<<<<<< 10.0 3 0 | 6.31 3 1 |: 3.98 2 0 | 2.51 2 1 |: 1.58 1 0 | 1.00 1 0 | 0.63 1 0 | 0.40 1 0 | 0.25 1 0 | 0.16 1 0 | 0.10 1 0 | 0.063 1 0 | 0.040 1 0 | 0.025 1 0 | 0.016 1 0 | 0.010 1 0 | 0.0063 1 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|320932|pir||A44971hypothetical protein 1 - Plasmod... -3 68 0.0052 2 gi|862343|gb|AAA68426.1|(L10908) Gcap1 gene product [... -3 65 0.83 1 gi|8778738|gb|AAF79746.1|AC009317_5(AC009317) T30E16.... -3 61 0.994 1
Use the and icons to retrieve links to Entrez:
>gi|320932|pir||A44971 hypothetical protein 1 - Plasmodium brasilianum Length = 96 Frame -2 hits (HSPs): ______ Frame -3 hits (HSPs): _________________________ __________________________________________________ Database sequence: | | | | | | 96 0 20 40 60 80 Minus Strand HSPs: Score = 68 (23.9 bits), Expect = 0.0053, Sum P(2) = 0.0052 Identities = 14/33 (42%), Positives = 17/33 (51%), Frame = -3 Query: 235 HTYISYIYIHLFHLIHVYISYIYNHTEHIK*KH 137 H+Y SY HL+H H Y SY H+ H H Sbjct: 32 HSYHSYHSYHLYHSYHSYHSYHSYHSYHSHHSH 64 Score = 67 (23.6 bits), Expect = 0.019, Sum P(2) = 0.019 Identities = 13/28 (46%), Positives = 16/28 (57%), Frame = -3 Query: 235 HTYISYIYIHLFHLIHVYISYIYNHTEH 152 H+Y SY H +HL H Y SY H+ H Sbjct: 29 HSYHSYHSYHSYHLYHSYHSYHSYHSYH 56 Score = 65 (22.9 bits), Expect = 0.061, Sum P(2) = 0.059 Identities = 13/33 (39%), Positives = 16/33 (48%), Frame = -3 Query: 235 HTYISYIYIHLFHLIHVYISYIYNHTEHIK*KH 137 H+Y SY H +H H Y SY H+ H H Sbjct: 35 HSYHSYHLYHSYHSYHSYHSYHSYHSHHSHHSH 67 Score = 62 (21.8 bits), Expect = 0.33, Sum P(2) = 0.28 Identities = 12/28 (42%), Positives = 15/28 (53%), Frame = -3 Query: 235 HTYISYIYIHLFHLIHVYISYIYNHTEH 152 H Y SY H +H H+Y SY H+ H Sbjct: 26 HPYHSYHSYHSYHSYHLYHSYHSYHSYH 53 Score = 61 (21.5 bits), Expect = 0.60, Sum P(2) = 0.45 Identities = 12/33 (36%), Positives = 16/33 (48%), Frame = -3 Query: 235 HTYISYIYIHLFHLIHVYISYIYNHTEHIK*KH 137 H+Y Y H +H H Y SY +H+ H H Sbjct: 38 HSYHLYHSYHSYHSYHSYHSYHSHHSHHSHHSH 70 Score = 60 (21.1 bits), Expect = 0.89, Sum P(2) = 0.59 Identities = 12/33 (36%), Positives = 16/33 (48%), Frame = -3 Query: 235 HTYISYIYIHLFHLIHVYISYIYNHTEHIK*KH 137 H Y SY H +H H Y S+ +H+ H H Sbjct: 41 HLYHSYHSYHSYHSYHSYHSHHSHHSHHSHHSH 73 Score = 42 (14.8 bits), Expect = 0.0053, Sum P(2) = 0.0052 Identities = 9/12 (75%), Positives = 9/12 (75%), Frame = -2 Query: 371 LTHFSQIFKFIF 336 L HFS IF FIF Sbjct: 5 LAHFSCIFIFIF 16 >gi|862343|gb|AAA68426.1| (L10908) Gcap1 gene product [Mus musculus] >gi|1092097|prf||2022314A granule cell marker protein [Mus musculus] >gi|1092098|prf||2022315A granule cell marker protein [Mus musculus] Length = 85 Frame 3 hits (HSPs): _____________________________________ Frame -3 hits (HSPs): _______________ __________________________________________________ Database sequence: | | | | | | 85 0 20 40 60 80 Plus Strand HSPs: Score = 62 (21.8 bits), Expect = 3.9, P = 0.98 Identities = 16/63 (25%), Positives = 34/63 (53%), Frame = +3 Query: 180 MYTCIKWKRCMYIY---EMYVCICFHL-IFMXYVLTL*FPLFLQSH*YG-VYVF*FVKYK 344 +Y C+ C+Y+Y +Y+CI +L I++ + L ++ +H + Y++ ++ Y Sbjct: 3 VYMCLCVCLCVYVYACVSLYICISIYLSIYLSISIYLSIYTYIHTHTHTHTYIYIYI-YI 61 Query: 345 LKYL 356 YL Sbjct: 62 YIYL 65 Minus Strand HSPs: Score = 65 (22.9 bits), Expect = 1.8, P = 0.83 Identities = 12/24 (50%), Positives = 17/24 (70%), Frame = -3 Query: 241 HMHTYIS---YIYIHLFHLIHVYI 179 H HTYI YIYI+L+ +H+Y+ Sbjct: 50 HTHTYIYIYIYIYIYLYLCMHIYV 73 >gi|8778738|gb|AAF79746.1|AC009317_5 (AC009317) T30E16.7 [Arabidopsis thaliana] Length = 35 Frame -3 hits (HSPs): __________________________________ __________________________________________________ Database sequence: | | | 35 0 20 Minus Strand HSPs: Score = 61 (21.5 bits), Expect = 5.2, P = 0.99 Identities = 14/25 (56%), Positives = 19/25 (76%), Frame = -3 Query: 223 SYIYIHLFHL-IHVYISYIYNHTEHI 149 SY Y+++ L I++YI YIYNH HI Sbjct: 11 SYAYVYICKLYIYIYI-YIYNHI-HI 34 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.97 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.371 0.165 0.633 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.382 0.175 0.649 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.374 0.175 0.630 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.371 0.173 0.598 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.375 0.170 0.587 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.365 0.168 0.600 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 151 150 10. 74 3 12 22 0.092 34 30 0.12 36 +2 0 151 150 10. 74 3 12 22 0.092 34 30 0.12 36 +1 0 152 151 10. 74 3 12 22 0.093 34 30 0.12 36 -1 0 152 151 10. 74 3 12 22 0.093 34 30 0.12 36 -2 0 151 150 10. 74 3 12 22 0.092 34 30 0.12 36 -3 0 151 150 10. 74 3 12 22 0.092 34 30 0.12 36 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 3 No. of states in DFA: 586 (58 KB) Total size of DFA: 173 KB (192 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 124.59u 1.11s 125.70t Elapsed: 00:00:31 Total cpu time: 124.61u 1.13s 125.74t Elapsed: 00:00:31 Start: Wed Jan 23 15:28:20 2002 End: Wed Jan 23 15:28:51 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000