WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= 'D05F06.seq' (409 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 3 Sequences : less than 3 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 715 166 |======================================================= 6310 549 113 |===================================== 3980 436 101 |================================= 2510 335 128 |========================================== 1580 207 90 |============================== 1000 117 31 |========== 631 86 34 |=========== 398 52 30 |========== 251 22 11 |=== 158 11 4 |= 100 7 0 | 63.1 7 3 |= 39.8 4 1 |: 25.1 3 1 |: 15.8 2 0 | >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 2 <<<<<<<<<<<<<<<<< 10.0 2 0 | 6.31 2 0 | 3.98 2 0 | 2.51 2 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|9758595|dbj|BAB09228.1|(AB009050) gene_id:MDF20.5~... +1 203 3.7e-15 1 gi|9798570|emb|CAC03525.1|(AJ276410) BlpN protein [St... +1 63 0.91 1
Use the and icons to retrieve links to Entrez:
>gi|9758595|dbj|BAB09228.1| (AB009050) gene_id:MDF20.5~unknown protein [Arabidopsis thaliana] Length = 329 Frame 1 hits (HSPs): __________ __________________________________________________ Database sequence: | | | | 329 0 150 300 Plus Strand HSPs: Score = 203 (71.5 bits), Expect = 3.7e-15, P = 3.7e-15 Identities = 38/60 (63%), Positives = 46/60 (76%), Frame = +1 Query: 16 LGQKVFIATVRCTSSLIFASIGAGIGATLIRPSLGQWVGCALGDLGGPIIVAFCADKLFQ 195 LG KV TVRC +SL+FA+IGAGI + LIRPS GQW+GCALGDL GP++V+ C K Q Sbjct: 267 LGNKVVGITVRCGASLVFAAIGAGICSCLIRPSTGQWIGCALGDLAGPMVVSVCLQKTLQ 326 >gi|9798570|emb|CAC03525.1| (AJ276410) BlpN protein [Streptococcus pneumoniae] >gi|11611211|emb|CAC18598.1| (AJ278419) putative bacteriocin [Streptococcus pneumoniae] Length = 67 Frame 1 hits (HSPs): _____________________________ __________________________________________________ Database sequence: | | | | | 67 0 20 40 60 Plus Strand HSPs: Score = 63 (22.2 bits), Expect = 2.4, P = 0.91 Identities = 12/39 (30%), Positives = 20/39 (51%), Frame = +1 Query: 64 IFASIGAGIGATLIRPSLGQWVGCALGDLGGPIIVAFCA 180 + A++G G LG W G A+G +GG ++ + A Sbjct: 25 VVAALGCAAGGVKYGRLLGPW-GAAIGGIGGAVVCGYLA 62 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.99 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.355 0.158 0.654 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.363 0.168 0.574 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.356 0.162 0.575 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.373 0.168 0.638 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.338 0.143 0.461 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.340 0.147 0.500 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 135 135 10. 73 3 12 22 0.11 33 30 0.099 36 +2 0 136 136 10. 73 3 12 22 0.11 33 30 0.10 36 +1 0 136 136 10. 73 3 12 22 0.11 33 30 0.10 36 -1 0 136 136 10. 73 3 12 22 0.11 33 30 0.10 36 -2 0 136 136 10. 73 3 12 22 0.11 33 30 0.10 36 -3 0 135 135 10. 73 3 12 22 0.11 33 30 0.099 36 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 2 No. of states in DFA: 594 (59 KB) Total size of DFA: 173 KB (192 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 130.83u 1.17s 132.00t Elapsed: 00:00:27 Total cpu time: 130.86u 1.19s 132.05t Elapsed: 00:00:27 Start: Wed Jan 16 15:51:27 2002 End: Wed Jan 16 15:51:54 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000