WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= 'D13B05_C05_03.ab1' (333 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 5 Sequences : less than 5 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 1120 260 |==================================================== 6310 860 143 |============================ 3980 717 168 |================================= 2510 549 242 |================================================ 1580 307 91 |================== 1000 216 99 |=================== 631 117 46 |========= 398 71 21 |==== 251 50 16 |=== 158 34 15 |=== 100 19 5 |= 63.1 14 1 |: 39.8 13 3 |: 25.1 10 2 |: 15.8 8 1 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 7 <<<<<<<<<<<<<<<<< 10.0 7 1 |: 6.31 6 0 | 3.98 6 1 |: 2.51 5 0 | 1.58 5 0 | 1.00 5 0 | 0.63 5 0 | 0.40 5 1 |: 0.25 4 0 | 0.16 4 0 | 0.10 4 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|7485600|pir||T04014hypothetical protein F17A8.20 -... +1 136 1.5e-07 1 gi|5091625|gb|AAD39613.1|AC007454_12(AC007454) Simila... +1 121 9.6e-06 1 gi|7488933|pir||T14319protein AX110P - carrot >gi|285... +1 105 0.00041 1 gi|12322603|gb|AAG51297.1|AC026480_4(AC026480) oxidor... +1 84 0.068 1 gi|2062334|gb|AAB53349.1|(U57401) unknown [Choristone... +2 50 0.25 2 gi|619390|gb|AAB32148.1|B cell antigen CD20-specific ... -1 60 0.96 1 gi|7516956|pir||F72518hypothetical protein APE2123 - ... +2 68 0.9997 1
Use the and icons to retrieve links to Entrez:
>gi|7485600|pir||T04014 hypothetical protein F17A8.20 - Arabidopsis thaliana >gi|4538897|emb|CAB39634.1| (AL049482) AX110P-like protein [Arabidopsis thaliana] >gi|7267662|emb|CAB78090.1| (AL161515) AX110P-like protein [Arabidopsis thaliana] Length = 362 Frame 1 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | 362 0 150 300 Plus Strand HSPs: Score = 136 (47.9 bits), Expect = 1.5e-07, P = 1.5e-07 Identities = 25/43 (58%), Positives = 33/43 (76%), Frame = +1 Query: 76 ESSGVQFMDSTIWVHNPRTATMAHFLNDAQRFGILKSAFSIHS 204 E++GVQ MD T+WVHNPRTA + FL+D++RFG LK+ S S Sbjct: 119 EANGVQIMDGTMWVHNPRTALLKEFLSDSERFGQLKTVQSCFS 161 >gi|5091625|gb|AAD39613.1|AC007454_12 (AC007454) Similar to gb|D14605 AX110P embryogenesis-associated protein from Daucus carota and is a member of the PF|01408 Oxidoreductase family. ESTs gb|Z35057, gb|T20683 and gb|Z48399 come from this gene. [Arabidopsis thaliana] Length = 450 Frame 1 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | 450 0 150 300 Plus Strand HSPs: Score = 121 (42.6 bits), Expect = 9.6e-06, P = 9.6e-06 Identities = 22/43 (51%), Positives = 30/43 (69%), Frame = +1 Query: 76 ESSGVQFMDSTIWVHNPRTATMAHFLNDAQRFGILKSAFSIHS 204 E +GVQFMD T W+H+PRT + F+ND + FG +KS +S S Sbjct: 120 EVNGVQFMDGTQWMHSPRTDKIKEFVNDLESFGQIKSVYSCFS 162 >gi|7488933|pir||T14319 protein AX110P - carrot >gi|285739|dbj|BAA03455.1| (D14605) AX110P [Daucus carota] >gi|740202|prf||2004427A embryogenesis-associated protein [Daucus carota] Length = 390 Frame 1 hits (HSPs): ______ __________________________________________________ Database sequence: | | | | 390 0 150 300 Plus Strand HSPs: Score = 105 (37.0 bits), Expect = 0.00041, P = 0.00041 Identities = 19/40 (47%), Positives = 29/40 (72%), Frame = +1 Query: 76 ESSGVQFMDSTIWVHNPRTATMAHFLNDAQRFGILKSAFS 195 + +GVQ+MD T+ H+PR+A M +L+DA+ FG L+S S Sbjct: 122 DDNGVQYMDGTMLQHHPRSAKMREYLDDAEHFGQLRSIIS 161 >gi|12322603|gb|AAG51297.1|AC026480_4 (AC026480) oxidoreductase, putative [Arabidopsis thaliana] Length = 364 Frame 1 hits (HSPs): ______ __________________________________________________ Database sequence: | | | | 364 0 150 300 Plus Strand HSPs: Score = 84 (29.6 bits), Expect = 0.070, P = 0.068 Identities = 16/40 (40%), Positives = 25/40 (62%), Frame = +1 Query: 76 ESSGVQFMDSTIWVHNPRTATMAHFLNDAQRFGILKSAFS 195 E +GVQFMD TIW+H+ RT + + D+ G ++ +S Sbjct: 120 EYNGVQFMDGTIWLHHQRTVKIRDTMFDSGLLGDVRHMYS 159 >gi|2062334|gb|AAB53349.1| (U57401) unknown [Choristoneura fumiferana nucleopolyhedrovirus] Length = 70 Frame 2 hits (HSPs): ____________________ Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | 70 0 20 40 60 Plus Strand HSPs: Score = 50 (17.6 bits), Expect = 0.29, Sum P(2) = 0.25 Identities = 13/29 (44%), Positives = 17/29 (58%), Frame = +2 Query: 107 PSGSITQGLLPWPTSSTMRNVLVSSNRHS 193 P+ I +G PWP S T R+ V + RHS Sbjct: 2 PARHIARGYTPWP-SRTTRDRQVRT-RHS 28 Score = 37 (13.0 bits), Expect = 0.29, Sum P(2) = 0.25 Identities = 5/13 (38%), Positives = 11/13 (84%), Frame = +1 Query: 238 QKWLQTLGSVSCH 276 ++++QT+ V+CH Sbjct: 57 RRFVQTMRGVACH 69 >gi|619390|gb|AAB32148.1| B cell antigen CD20-specific IgG light chain variable region {C-terminal} [mice, hybridoma cell line Mem97, Peptide Partial, 55 aa] Length = 55 Frame -1 hits (HSPs): ____________________________ __________________________________________________ Database sequence: | | | | 55 0 20 40 Minus Strand HSPs: Score = 60 (21.1 bits), Expect = 3.3, P = 0.96 Identities = 14/33 (42%), Positives = 16/33 (48%), Frame = -1 Query: 102 IHELHPTGFGNYFAHLTFFWYTQQFFVPGRKAE 4 I+ L P FGNY+ FW T F G K E Sbjct: 21 INRLQPEDFGNYYCQ--HFWSTPWTFGGGTKLE 51 >gi|7516956|pir||F72518 hypothetical protein APE2123 - Aeropyrum pernix (strain K1) >gi|5105822|dbj|BAA81134.1| (AP000063) 128aa long hypothetical protein [Aeropyrum pernix] Length = 128 Frame 2 hits (HSPs): _______________ __________________________________________________ Database sequence: | | | | 128 0 50 100 Plus Strand HSPs: Score = 68 (23.9 bits), Expect = 8.3, P = 1.0 Identities = 16/37 (43%), Positives = 24/37 (64%), Frame = +2 Query: 71 FPN-PVGCSS-WIAPSGSITQGLLPWPTSSTMRNVLVS 178 FP+ PVG S+ W P ++ GL+PW S T+ +L+S Sbjct: 33 FPSSPVGTSTTWNPPMAALA-GLVPWAESGTIITLLLS 69 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.96 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.332 0.145 0.499 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.329 0.138 0.465 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.338 0.146 0.478 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.334 0.144 0.472 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.368 0.165 0.639 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.358 0.163 0.664 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 110 110 10. 69 3 12 22 0.12 32 29 0.12 34 +2 0 110 110 10. 69 3 12 22 0.12 32 29 0.12 34 +1 0 111 111 10. 70 3 12 22 0.12 32 29 0.12 34 -1 0 111 111 10. 70 3 12 22 0.12 32 29 0.12 34 -2 0 110 110 10. 69 3 12 22 0.12 32 29 0.12 34 -3 0 110 110 10. 69 3 12 22 0.12 32 29 0.12 34 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 7 No. of states in DFA: 588 (58 KB) Total size of DFA: 167 KB (192 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 121.55u 1.25s 122.80t Elapsed: 00:00:22 Total cpu time: 121.58u 1.26s 122.84t Elapsed: 00:00:22 Start: Thu Jan 17 10:31:48 2002 End: Thu Jan 17 10:32:10 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000