WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= 'D16H02_P14_15.ab1' (361 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 5 Sequences : less than 5 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 946 273 |====================================================== 6310 673 174 |================================== 3980 499 115 |======================= 2510 384 105 |===================== 1580 279 121 |======================== 1000 158 61 |============ 631 97 34 |====== 398 63 25 |===== 251 38 21 |==== 158 17 7 |= 100 10 2 |: 63.1 8 1 |: 39.8 7 3 |: 25.1 4 0 | 15.8 4 1 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 3 <<<<<<<<<<<<<<<<< 10.0 3 0 | 6.31 3 0 | 3.98 3 1 |: 2.51 2 0 | 1.58 2 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|9758595|dbj|BAB09228.1|(AB009050) gene_id:MDF20.5~... +3 199 1.1e-14 1 gi|9798570|emb|CAC03525.1|(AJ276410) BlpN protein [St... +3 63 0.79 1 gi|109294|pir||S17872translation initiation factor eI... +3 61 0.93 1
Use the and icons to retrieve links to Entrez:
>gi|9758595|dbj|BAB09228.1| (AB009050) gene_id:MDF20.5~unknown protein [Arabidopsis thaliana] Length = 329 Frame 3 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | 329 0 150 300 Plus Strand HSPs: Score = 199 (70.1 bits), Expect = 1.1e-14, P = 1.1e-14 Identities = 38/66 (57%), Positives = 47/66 (71%), Frame = +3 Query: 9 TNQVGVLGQKVFIAXVRCTSSLIFASIGAGIGATLIRPSLGQWVGCALGDLGGPIIVAFC 188 + Q LG KV VRC +SL+FA+IGAGI + LIRPS GQW+GCALGDL GP++V+ C Sbjct: 261 SQQAKDLGNKVVGITVRCGASLVFAAIGAGICSCLIRPSTGQWIGCALGDLAGPMVVSVC 320 Query: 189 ADKLFQ 206 K Q Sbjct: 321 LQKTLQ 326 >gi|9798570|emb|CAC03525.1| (AJ276410) BlpN protein [Streptococcus pneumoniae] >gi|11611211|emb|CAC18598.1| (AJ278419) putative bacteriocin [Streptococcus pneumoniae] Length = 67 Frame 3 hits (HSPs): _____________________________ __________________________________________________ Database sequence: | | | | | 67 0 20 40 60 Plus Strand HSPs: Score = 63 (22.2 bits), Expect = 1.6, P = 0.79 Identities = 12/39 (30%), Positives = 20/39 (51%), Frame = +3 Query: 75 IFASIGAGIGATLIRPSLGQWVGCALGDLGGPIIVAFCA 191 + A++G G LG W G A+G +GG ++ + A Sbjct: 25 VVAALGCAAGGVKYGRLLGPW-GAAIGGIGGAVVCGYLA 62 >gi|109294|pir||S17872 translation initiation factor eIF-2 gamma chain - rabbit Length = 78 Frame 3 hits (HSPs): ______________________________ __________________________________________________ Database sequence: | | | | | 78 0 20 40 60 Plus Strand HSPs: Score = 61 (21.5 bits), Expect = 2.6, P = 0.93 Identities = 16/49 (32%), Positives = 24/49 (48%), Frame = +3 Query: 18 VGVLGQKVFIAXVRCTSSLIFASIGAGIGATLIRPSLGQWVGCALGDLG 164 +G L +++ V+ I + G G TL +P L WVG LG +G Sbjct: 23 IGSLSTGGWVSAVKADLGKIVLTAGGEAGVTLGQPXL--WVGQVLGAVG 69 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.95 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.355 0.161 0.550 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.357 0.157 0.685 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.353 0.162 0.530 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.343 0.148 0.519 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.373 0.167 0.662 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.337 0.141 0.434 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 119 117 10. 71 3 12 22 0.094 33 29 0.10 35 +2 0 120 118 10. 71 3 12 22 0.095 33 29 0.10 35 +1 0 120 119 10. 71 3 12 22 0.096 33 29 0.11 35 -1 0 120 118 10. 71 3 12 22 0.095 33 29 0.10 35 -2 0 120 119 10. 71 3 12 22 0.096 33 29 0.11 35 -3 0 119 117 10. 71 3 12 22 0.094 33 29 0.10 35 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 3 No. of states in DFA: 594 (59 KB) Total size of DFA: 161 KB (192 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 114.09u 1.05s 115.14t Elapsed: 00:00:25 Total cpu time: 114.12u 1.05s 115.17t Elapsed: 00:00:25 Start: Thu Jan 17 16:16:22 2002 End: Thu Jan 17 16:16:47 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000