WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= A05E08_CONSENSUS (281 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 6 Sequences : less than 6 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 1298 311 |=================================================== 6310 987 227 |===================================== 3980 760 246 |========================================= 2510 514 117 |=================== 1580 397 139 |======================= 1000 258 67 |=========== 631 191 71 |=========== 398 120 45 |======= 251 75 28 |==== 158 47 9 |= 100 38 13 |== 63.1 25 9 |= 39.8 16 7 |= 25.1 9 4 |: 15.8 5 1 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 4 <<<<<<<<<<<<<<<<< 10.0 4 2 |: 6.31 2 0 | 3.98 2 0 | 2.51 2 1 |: 1.58 1 0 | 1.00 1 0 | 0.63 1 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|7021729|gb|AAF35410.1|(AC024081) hypothetical prot... -1 69 0.41 1 gi|414449|gb|AAC43196.1|(U01721) unknown [Mycoplasma ... -1 64 0.89 1 gi|10639654|emb|CAC11626.1|(AL445064) cellulose synth... +3 67 0.9998 1 gi|102037|pir||B28118ubiquinol--cytochrome-c reductas... +1 34 0.99990 2 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame 3 Hits gi|10639654 | ___________ __________________________________________________ Query sequence: | | | | | | 94 0 20 40 60 80 Locus_ID Frame 2 Hits gi|102037 | _________ __________________________________________________ Query sequence: | | | | | | 94 0 20 40 60 80 Locus_ID Frame 1 Hits gi|102037 | ________ __________________________________________________ Query sequence: | | | | | | 94 0 20 40 60 80 Locus_ID Frame -1 Hits gi|7021729 | gi|414449 | ________________________ Prosite Hits: _____ __________________________________________________ Query sequence: | | | | | | 94 0 20 40 60 80 __________________ Prosite hits: TYR_PHOSPHO_SITE Tyrosine kinase phosphorylation site. 31..37 __________________
Use the and icons to retrieve links to Entrez:
>gi|7021729|gb|AAF35410.1| (AC024081) hypothetical protein [Arabidopsis thaliana] Length = 143 Frame -1 hits (HSPs): ________________________ __________________________________________________ Database sequence: | | | | 143 0 50 100 Minus Strand HSPs: Score = 69 (24.3 bits), Expect = 0.54, P = 0.41 Identities = 21/68 (30%), Positives = 34/68 (50%), Frame = -1 Query: 254 RVFRDKSVGLLENHLKTVHHFKFRDLQYPLAGQIYMTFLEQPHFQALIQYRVKKREKKKE 75 R RD+ +G E +K + D+ P I TF E+ Q+++ ++K EK K Sbjct: 68 RYGRDEFIGQDEEQIKATIYEILNDVGVPPYLGIMKTFAEE--IQSILNSPLEKLEKIKV 125 Query: 74 KIKGIDHM 51 KI+ + HM Sbjct: 126 KIEVVAHM 133 >gi|414449|gb|AAC43196.1| (U01721) unknown [Mycoplasma genitalium] Length = 99 Frame -1 hits (HSPs): _______________________ __________________________________________________ Database sequence: | | | | | | 99 0 20 40 60 80 Minus Strand HSPs: Score = 64 (22.5 bits), Expect = 2.2, P = 0.89 Identities = 17/45 (37%), Positives = 22/45 (48%), Frame = -1 Query: 209 KTVHHFKFRDLQYP-LAGQIYMTFLEQPHFQALIQYRVKKREKKK 78 + V HFK P L G+ Y FLE P+F L + K R+ K Sbjct: 47 EVVKHFKISKQDEPSLIGRFYQDFLEDPNFVYLGDRKWKLRDFMK 91 >gi|10639654|emb|CAC11626.1| (AL445064) cellulose synthase, subunit A related protein [Thermoplasma acidophilum] Length = 561 Frame 3 hits (HSPs): __ __________________________________________________ Database sequence: | | | | | 561 0 150 300 450 Plus Strand HSPs: Score = 67 (23.6 bits), Expect = 8.4, P = 1.0 Identities = 13/21 (61%), Positives = 17/21 (80%), Frame = +3 Query: 54 MINPLNFFFFLFP-FLYSVLY 113 MINP+ F FF FP F+YS+L+ Sbjct: 406 MINPVFFVFFYFPYFIYSMLF 426 >gi|102037|pir||B28118 ubiquinol--cytochrome-c reductase (EC 1.10.2.2) cytochrome b - Crithidia fasciculata mitochondrion (fragment) Length = 48 Frame 2 hits (HSPs): _______________ Frame 1 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | 48 0 20 40 Plus Strand HSPs: Score = 34 (12.0 bits), Expect = 9.3, Sum P(2) = 1.0 Identities = 7/14 (50%), Positives = 8/14 (57%), Frame = +1 Query: 73 FSFFFSLFFTLYCI 114 F FF LF L C+ Sbjct: 7 FLLFFLLFRNLCCL 20 Score = 33 (11.6 bits), Expect = 9.3, Sum P(2) = 1.0 Identities = 5/14 (35%), Positives = 8/14 (57%), Frame = +2 Query: 215 GFLINLHFCL*ILC 256 GF + C+ I+C Sbjct: 33 GFSLGFFICIQIIC 46 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.97 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.362 0.164 0.611 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.384 0.181 0.689 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.350 0.155 0.604 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.344 0.150 0.523 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.369 0.168 0.619 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.338 0.146 0.462 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 93 92 10. 65 3 12 22 0.095 32 28 0.12 33 +2 0 93 93 10. 65 3 12 22 0.096 32 28 0.12 33 +1 0 93 92 10. 65 3 12 22 0.095 32 28 0.12 33 -1 0 93 92 10. 65 3 12 22 0.095 32 28 0.12 33 -2 0 93 93 10. 65 3 12 22 0.096 32 28 0.12 33 -3 0 93 93 10. 65 3 12 22 0.096 32 28 0.12 33 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 4 No. of states in DFA: 584 (58 KB) Total size of DFA: 139 KB (192 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 90.19u 1.14s 91.33t Elapsed: 00:00:16 Total cpu time: 90.22u 1.15s 91.37t Elapsed: 00:00:16 Start: Mon Oct 1 19:50:37 2001 End: Mon Oct 1 19:50:53 2001
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000