WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= 'E04C06_C06_06.ab1' (421 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 4 Sequences : less than 4 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 760 195 |================================================ 6310 565 114 |============================ 3980 451 108 |=========================== 2510 343 97 |======================== 1580 246 80 |==================== 1000 166 53 |============= 631 113 31 |======= 398 82 15 |=== 251 67 40 |========== 158 27 11 |== 100 16 2 |: 63.1 14 1 |: 39.8 13 5 |= 25.1 8 0 | 15.8 8 2 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 6 <<<<<<<<<<<<<<<<< 10.0 6 0 | 6.31 6 1 |: 3.98 5 0 | 2.51 5 0 | 1.58 5 0 | 1.00 5 0 | 0.63 5 0 | 0.40 5 0 | 0.25 5 1 |: 0.16 4 0 | 0.10 4 0 | 0.063 4 0 | 0.040 4 1 |: 0.025 3 0 | 0.016 3 0 | 0.010 3 0 | 0.0063 3 0 | 0.0040 3 0 | 0.0025 3 0 | 0.0016 3 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|9758068|dbj|BAB08647.1|(AB009048) frnE protein-lik... +1 274 6.9e-23 1 gi|10174520|dbj|BAB05621.1|(AP001513) BH1902~unknown ... +1 104 7.1e-05 1 gi|3978168|gb|AAD03806.1|(AF047427) unknown [Mannheim... +1 93 0.0010 1 gi|7471903|pir||E75491frnE protein - Deinococcus radi... +1 94 0.036 1 gi|3170570|gb|AAC18100.1|(AF058302) FrnE [Streptomyce... +1 88 0.18 1 gi|1353122|sp|P34655|YOT0_CAEELHYPOTHETICAL 8.7 KD PR... -2 61 0.99 1
Use the and icons to retrieve links to Entrez:
>gi|9758068|dbj|BAB08647.1| (AB009048) frnE protein-like [Arabidopsis thaliana] Length = 217 Frame 1 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | | | 217 0 50 100 150 200 Plus Strand HSPs: Score = 274 (96.5 bits), Expect = 6.9e-23, P = 6.9e-23 Identities = 53/78 (67%), Positives = 62/78 (79%), Frame = +1 Query: 13 GKYIGDHKFLLESAAKVGIEGAEEFLKDPNNGLKEVEEELKTYSGNISGVPYYVINGNHK 192 GK+IGD +FL+E+A KVGIEGAEEFL DPNNG+ EV+EEL YS NI+GVP Y ING K Sbjct: 139 GKFIGDREFLVETANKVGIEGAEEFLSDPNNGVTEVKEELAKYSKNITGVPNYTINGKVK 198 Query: 193 LSGGQPPEVFLRAFQVAT 246 LSG QPPE F AF+ A+ Sbjct: 199 LSGAQPPETFQSAFKAAS 216 >gi|10174520|dbj|BAB05621.1| (AP001513) BH1902~unknown conserved protein in others [Bacillus halodurans] Length = 95 Frame 1 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | | | 95 0 20 40 60 80 Plus Strand HSPs: Score = 104 (36.6 bits), Expect = 7.1e-05, P = 7.1e-05 Identities = 20/38 (52%), Positives = 26/38 (68%), Frame = +1 Query: 124 EELKTYSGNISGVPYYVINGNHKLSGGQPPEVFLRAFQ 237 EE K S + GVPY+VIN + LSG QP +VF+RA + Sbjct: 24 EEAKAQSLQVRGVPYFVINDKYALSGAQPTDVFVRALK 61 >gi|3978168|gb|AAD03806.1| (AF047427) unknown [Mannheimia haemolytica] Length = 94 Frame 1 hits (HSPs): _____________________________________ __________________________________________________ Database sequence: | | | | | | 94 0 20 40 60 80 Plus Strand HSPs: Score = 93 (32.7 bits), Expect = 0.0010, P = 0.0010 Identities = 24/70 (34%), Positives = 37/70 (52%), Frame = +1 Query: 40 LLESAAKVGIEGAE--EFLKDPNNGLKEVEEELKTYSGNISGVPYYVINGNHKLSGGQPP 213 L+ A +G+E E + L + G + E+E + I VP++VIN +SG QPP Sbjct: 22 LISLALDIGLERDEIAQLLTGDDFGHEVREDERVAHKYGIHSVPFFVINEKLGVSGAQPP 81 Query: 214 EVFLRAFQVA 243 E+ L A + A Sbjct: 82 EILLDAIKQA 91 >gi|7471903|pir||E75491 frnE protein - Deinococcus radiodurans (strain R1) >gi|6458362|gb|AAF10238.1|AE001923_5 (AE001923) frnE protein [Deinococcus radiodurans] Length = 252 Frame 1 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | || 252 0 50 100 150 200 250 Plus Strand HSPs: Score = 94 (33.1 bits), Expect = 0.037, P = 0.036 Identities = 24/75 (32%), Positives = 40/75 (53%), Frame = +1 Query: 13 GKYIGDHKFLLESAAKVGIE-GAEEFLKDPNNGLKEVE-EELKTYSGNISGVPYYVINGN 186 G+ + D L + AA+VG++ GA + + V +E + I+GVP++V+ G Sbjct: 140 GQNVNDLDTLQKLAAEVGLDAGAARAALEAGTYAQAVRYDEAQAQQLGITGVPFFVLGGK 199 Query: 187 HKLSGGQPPEVFLRA 231 + +SG Q PE L A Sbjct: 200 YGVSGAQAPETLLGA 214 >gi|3170570|gb|AAC18100.1| (AF058302) FrnE [Streptomyces roseofulvus] Length = 216 Frame 1 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | | | 216 0 50 100 150 200 Plus Strand HSPs: Score = 88 (31.0 bits), Expect = 0.20, P = 0.18 Identities = 24/71 (33%), Positives = 35/71 (49%), Frame = +1 Query: 13 GKYIGDHKFLLESAAKVGIEGA---EEFLKDPNNGLKEVEEELKTYSGNISGVPYYVING 183 G +GDH LL A + G++ A E D + +E+ G + GVP +VI G Sbjct: 132 GVSVGDHPTLLALAEEAGLDAAAAAEVLAGDAHAEDVRADEDRAARLG-VGGVPAFVIGG 190 Query: 184 NHKLSGGQPPEV 219 +SG QP E+ Sbjct: 191 RWSVSGAQPAEL 202 >gi|1353122|sp|P34655|YOT0_CAEEL HYPOTHETICAL 8.7 KD PROTEIN ZK632.10 IN CHROMOSOME III >gi|3881704|emb|CAA80190.1| (Z22181) contains similarity to Pfam domain: PF01679 (Uncharacterized protein family), Score=99.2, E-value=2.7e-26, N=1 [Caenorhabditis elegans] Length = 80 Frame -2 hits (HSPs): _______________________ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | | 80 0 20 40 60 __________________ Annotated Domains: BLOCKS BL01309: Uncharacterized protein family 30..50 Entrez Transmembrane region: POTENTIAL. 4..24 Entrez Transmembrane region: POTENTIAL. 32..52 PFAM UPF0057: Uncharacterized protein family 2..52 PRODOM PD004756: 4..50 PRODOM PD051035: YOT0_CAEEL 52..79 __________________ Minus Strand HSPs: Score = 61 (21.5 bits), Expect = 4.4, P = 0.99 Identities = 12/39 (30%), Positives = 26/39 (66%), Frame = -2 Query: 132 ELLLNLLQTIVGVLKKLFCSFYTNLCSRFQKKLMVTNVF 16 +LL+N+L T +G++ + ++Y LC ++K +V N++ Sbjct: 27 DLLINILLTCLGIIPGIIHAWYIILC---KEKTVVQNIY 62 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.99 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.353 0.156 0.616 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.364 0.165 0.556 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.331 0.149 0.471 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.354 0.160 0.562 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.360 0.164 0.568 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.355 0.160 0.566 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 139 139 10. 73 3 12 22 0.12 33 30 0.10 36 +2 0 140 140 10. 73 3 12 22 0.12 33 30 0.11 36 +1 0 140 140 10. 73 3 12 22 0.12 33 30 0.11 36 -1 0 140 140 10. 73 3 12 22 0.12 33 30 0.11 36 -2 0 140 140 10. 73 3 12 22 0.12 33 30 0.11 36 -3 0 139 139 10. 73 3 12 22 0.12 33 30 0.10 36 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 6 No. of states in DFA: 591 (58 KB) Total size of DFA: 176 KB (192 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 137.39u 0.85s 138.24t Elapsed: 00:00:36 Total cpu time: 137.41u 0.87s 138.28t Elapsed: 00:00:36 Start: Fri Jan 18 12:02:07 2002 End: Fri Jan 18 12:02:43 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000