WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= A07C02_CONSENSUS (104 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 31 Sequences : less than 31 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 9342 1670 |===================================================== 6310 7672 1837 |=========================================================== 3980 5835 1746 |======================================================== 2510 4089 1386 |============================================ 1580 2703 864 |=========================== 1000 1839 653 |===================== 631 1186 488 |=============== 398 698 304 |========= 251 394 153 |==== 158 241 101 |=== 100 140 56 |= 63.1 84 33 |= 39.8 51 26 |: 25.1 25 13 |: 15.8 12 7 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 5 <<<<<<<<<<<<<<<<< 10.0 5 2 |: 6.31 3 0 | 3.98 3 2 |: 2.51 1 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|1076557|pir||S54157 extensin-like protein - cowp... +3 56 0.91 2 gi|7487811|pir||T00609 hypothetical protein T8K22.1... +3 53 0.93 2 gi|7300325|gb|AAF55485.1|(AE003720) CG12349 gene prod... +3 60 0.96 1 gi|7291971|gb|AAF47387.1|(AE003468) CG13885 gene prod... +3 57 0.999 1 gi|7446850|pir||A69981 conserved hypothetical prote... -3 38 0.9994 2 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame 3 Hits gi|1076557 | ___________________________ gi|7487811 | ________________________ gi|7300325 | ________________________________________ gi|7291971 | ___________________ __________________________________________________ Query sequence: | | | 35 0 20 Locus_ID Frame 2 Hits gi|1076557 |_________________ gi|7487811 | __________________ __________________________________________________ Query sequence: | | | 35 0 20 Locus_ID Frame -3 Hits gi|7446850 | ______________________________ __________________________________________________ Query sequence: | | | 35 0 20
Use the and icons to retrieve links to Entrez:
>gi|1076557|pir||S54157 extensin-like protein - cowpea (fragment) Length = 279 Frame 3 hits (HSPs): ____ ____ Frame 2 hits (HSPs): ___ __________________________________________________ Database sequence: | | | | | | | 279 0 50 100 150 200 250 Plus Strand HSPs: Score = 56 (19.7 bits), Expect = 2.4, Sum P(2) = 0.91 Identities = 11/18 (61%), Positives = 12/18 (66%), Frame = +3 Query: 42 IHSHHHHL*HQH--MFTT 89 I HHHHL H H MFT+ Sbjct: 215 ISPHHHHLLHHHLLMFTS 232 Score = 54 (19.0 bits), Expect = 3.9, Sum P(2) = 0.98 Identities = 10/19 (52%), Positives = 12/19 (63%), Frame = +3 Query: 42 IHSHHHHL*HQHMF-TTIP 95 I HHHHL H H+ +T P Sbjct: 247 ISRHHHHLLHHHLLMSTSP 265 Score = 29 (10.2 bits), Expect = 2.4, Sum P(2) = 0.91 Identities = 6/11 (54%), Positives = 8/11 (72%), Frame = +2 Query: 2 HEILSTSNPFL 34 H +STS+P L Sbjct: 146 HLPMSTSHPLL 156 >gi|7487811|pir||T00609 hypothetical protein T8K22.16 - Arabidopsis thaliana >gi|3184285|gb|AAC18932.1| (AC004136) hypothetical protein [Arabidopsis thaliana] Length = 310 Frame 3 hits (HSPs): ____ Frame 2 hits (HSPs): ___ __________________________________________________ Database sequence: | | | | | | | | 310 0 50 100 150 200 250 300 Plus Strand HSPs: Score = 53 (18.7 bits), Expect = 2.6, Sum P(2) = 0.93 Identities = 9/16 (56%), Positives = 11/16 (68%), Frame = +3 Query: 45 HSHHHHL*HQHMFTTI 92 HSHHHH+ + M T I Sbjct: 62 HSHHHHVGYNIMVTNI 77 Score = 33 (11.6 bits), Expect = 2.6, Sum P(2) = 0.93 Identities = 5/12 (41%), Positives = 11/12 (91%), Frame = +2 Query: 11 LSTSNPFLLTNS 46 ++TSNP L++++ Sbjct: 41 ITTSNPLLVSSN 52 >gi|7300325|gb|AAF55485.1| (AE003720) CG12349 gene product [Drosophila melanogaster] Length = 102 Frame 3 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | | | | 102 0 20 40 60 80 100 Plus Strand HSPs: Score = 60 (21.1 bits), Expect = 3.3, P = 0.96 Identities = 12/27 (44%), Positives = 12/27 (44%), Frame = +3 Query: 15 QPQTPSY*LIHSHHHHL*HQHMFTTIP 95 Q Q L H HHHH HQ F P Sbjct: 73 QQQAQQQHLSHHHHHHHHHQQQFLMTP 99 >gi|7291971|gb|AAF47387.1| (AE003468) CG13885 gene product [Drosophila melanogaster] Length = 71 Frame 3 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | 71 0 20 40 60 Plus Strand HSPs: Score = 57 (20.1 bits), Expect = 6.8, P = 1.0 Identities = 9/12 (75%), Positives = 9/12 (75%), Frame = +3 Query: 42 IHSHHHHL*HQH 77 IHSHHHH H H Sbjct: 25 IHSHHHHHSHSH 36 >gi|7446850|pir||A69981 conserved hypothetical protein yrvI - Bacillus subtilis >gi|2635223|emb|CAB14718.1| (Z99118) similar to hypothetical proteins [Bacillus subtilis] Length = 119 Frame -3 hits (HSPs): _____ ______ __________________________________________________ Database sequence: | | | | 119 0 50 100 Minus Strand HSPs: Score = 38 (13.4 bits), Expect = 7.5, Sum P(2) = 1.0 Identities = 6/11 (54%), Positives = 10/11 (90%), Frame = -3 Query: 75 VGVTNDDDENE 43 VG+T+DD E++ Sbjct: 4 VGITHDDTEDD 14 Score = 33 (11.6 bits), Expect = 7.5, Sum P(2) = 1.0 Identities = 6/12 (50%), Positives = 9/12 (75%), Frame = -3 Query: 48 NELVSRKGFEVE 13 N+L+ KG +VE Sbjct: 83 NDLLREKGIKVE 94 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.96 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.384 0.169 0.737 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.316 0.129 0.371 same same same Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.343 0.153 0.494 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.352 0.150 0.501 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.437 0.208 1.23 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.352 0.163 0.526 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 34 34 10. 58 3 12 22 0.089 27 24 0.011 27 +2 0 34 34 10. 58 3 12 22 0.11 26 24 0.014 26 +1 0 34 34 10. 58 3 12 22 0.089 27 24 0.011 27 -1 0 34 34 10. 58 3 12 22 0.089 27 24 0.011 27 -2 0 34 34 10. 58 3 12 22 0.089 27 24 0.011 27 -3 0 34 34 10. 58 3 12 22 0.089 27 24 0.011 27 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 5 No. of states in DFA: 524 (52 KB) Total size of DFA: 82 KB (128 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 38.33u 1.25s 39.58t Elapsed: 00:00:07 Total cpu time: 38.34u 1.27s 39.61t Elapsed: 00:00:07 Start: Mon Oct 1 20:40:08 2001 End: Mon Oct 1 20:40:15 2001
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000