WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= 'D13E05_I05_09.ab1' (292 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 10 Sequences : less than 10 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 2416 588 |========================================================== 6310 1828 523 |==================================================== 3980 1305 488 |================================================ 2510 817 285 |============================ 1580 532 174 |================= 1000 358 117 |=========== 631 241 68 |====== 398 173 31 |=== 251 142 32 |=== 158 110 36 |=== 100 74 32 |=== 63.1 42 12 |= 39.8 30 4 |: 25.1 26 8 |: 15.8 18 2 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 16 <<<<<<<<<<<<<<<<< 10.0 16 1 |: 6.31 15 1 |: 3.98 14 0 | 2.51 14 0 | 1.58 14 0 | 1.00 14 1 |: 0.63 13 0 | 0.40 13 0 | 0.25 13 1 |: 0.16 12 1 |: 0.10 11 0 | 0.063 11 0 | 0.040 11 0 | 0.025 11 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|31291|emb|CAA36016.1|(X51728) fumarylacetoacetase ... +1 113 4.6e-05 1 gi|3157928|gb|AAC17611.1|(AC002131) Similar to fumary... +1 114 4.7e-05 1 gi|4557587ref|NP_000128.1| fumarylacetoacetase [Homo ... +1 113 6.3e-05 1 gi|12739036ref|XP_007704.2| fumarylacetoacetase [Homo... +1 113 6.7e-05 1 gi|8569274|pdb|1QCO|AChain A, Crystal Structure Of Fu... +1 110 0.00014 1 gi|6753814ref|NP_034306.1| fumarylacetoacetate hydrol... +1 109 0.00017 1 gi|8393349ref|NP_058877.1| fumarylacetoacetate hydrol... +1 109 0.00017 1 gi|544273|sp|P35505|FAAA_MOUSEFUMARYLACETOACETASE (FU... +1 109 0.00017 1 gi|253320|gb|AAB22822.1|fumarylacetoacetate hydrolase... +1 109 0.00017 1 gi|8569272|pdb|1QCN|AChain A, Crystal Structure Of Fu... +1 109 0.00017 1 gi|7292429|gb|AAF47833.1|(AE003480) CG14993 gene prod... +1 89 0.019 1 gi|7505698|pir||T25813hypothetical protein K10C2.4 - ... +1 83 0.10 1 gi|12313291|emb|CAC24418.1|(AL512978) Hypothetical [S... +3 82 0.17 1 gi|11348567|pir||E83394fumarylacetoacetase PA2008 [im... +1 76 0.48 1 gi|110562|pir||S25462Ig kappa chain V region - mouse ... +3 62 0.98 1 gi|197274|gb|AAA38993.1|(M31268) IgK chain [Mus muscu... +3 64 0.9995 1
Use the and icons to retrieve links to Entrez:
>gi|31291|emb|CAA36016.1| (X51728) fumarylacetoacetase (AA 1-349) [Homo sapiens] Length = 349 Frame 1 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | 349 0 150 300 Plus Strand HSPs: Score = 113 (39.8 bits), Expect = 4.6e-05, P = 4.6e-05 Identities = 17/26 (65%), Positives = 24/26 (92%), Frame = +1 Query: 1 VILTGYCKGNGYSVGFGTCSGKIVPA 78 VI+TGYC+G+GY +GFG C+GK++PA Sbjct: 320 VIITGYCQGDGYRIGFGQCAGKVLPA 345 >gi|3157928|gb|AAC17611.1| (AC002131) Similar to fumarylacetoacetate hydrolase, gb|L41670 from Emericella nidulans. [Arabidopsis thaliana] Length = 408 Frame 1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | 408 0 150 300 Plus Strand HSPs: Score = 114 (40.1 bits), Expect = 4.7e-05, P = 4.7e-05 Identities = 19/28 (67%), Positives = 24/28 (85%), Frame = +1 Query: 1 VILTGYCKGNGYSVGFGTCSGKIVPAAP 84 V +G CKG+GY+VGFGTC+GKIVP+ P Sbjct: 381 VTFSGVCKGDGYNVGFGTCTGKIVPSPP 408 >gi|4557587 ref|NP_000128.1| fumarylacetoacetase [Homo sapiens] >gi|119778|sp|P16930|FAAA_HUMAN FUMARYLACETOACETASE (FUMARYLACETOACETATE HYDROLASE) (BETA-DIKETONASE) (FAA) >gi|106043|pir||A37926 fumarylacetoacetase (EC 3.7.1.2) - human >gi|182393|gb|AAA52422.1| (M55150) fumarylacetoacetate hydrolase [Homo sapiens] Length = 419 Frame 1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | 419 0 150 300 Plus Strand HSPs: Score = 113 (39.8 bits), Expect = 6.3e-05, P = 6.3e-05 Identities = 17/26 (65%), Positives = 24/26 (92%), Frame = +1 Query: 1 VILTGYCKGNGYSVGFGTCSGKIVPA 78 VI+TGYC+G+GY +GFG C+GK++PA Sbjct: 390 VIITGYCQGDGYRIGFGQCAGKVLPA 415 >gi|12739036 ref|XP_007704.2| fumarylacetoacetase [Homo sapiens] Length = 437 Frame 1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | 437 0 150 300 Plus Strand HSPs: Score = 113 (39.8 bits), Expect = 6.7e-05, P = 6.7e-05 Identities = 17/26 (65%), Positives = 24/26 (92%), Frame = +1 Query: 1 VILTGYCKGNGYSVGFGTCSGKIVPA 78 VI+TGYC+G+GY +GFG C+GK++PA Sbjct: 408 VIITGYCQGDGYRIGFGQCAGKVLPA 433 >gi|8569274|pdb|1QCO|A Chain A, Crystal Structure Of Fumarylacetoacetate Hydrolase Complexed With Fumarate And Acetoacetate >gi|8569275|pdb|1QCO|B Chain B, Crystal Structure Of Fumarylacetoacetate Hydrolase Complexed With Fumarate And Acetoacetate Length = 423 Frame 1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | 423 0 150 300 Plus Strand HSPs: Score = 110 (38.7 bits), Expect = 0.00014, P = 0.00014 Identities = 19/31 (61%), Positives = 27/31 (87%), Frame = +1 Query: 1 VILTGYCKGNGYSVGFGTCSGKIVPA-AP*G 90 VI+TG+C+G+GY VGFG C+GK++PA +P G Sbjct: 392 VIITGHCQGDGYRVGFGQCAGKVLPALSPAG 422 >gi|6753814 ref|NP_034306.1| fumarylacetoacetate hydrolase [Mus musculus] >gi|1083328|pir||A56825 fumarylacetoacetase (EC 3.7.1.2) - mouse >gi|50973|emb|CAA77819.1| (Z11774) fumarylacetoacetase [Mus musculus] Length = 419 Frame 1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | 419 0 150 300 Plus Strand HSPs: Score = 109 (38.4 bits), Expect = 0.00017, P = 0.00017 Identities = 17/26 (65%), Positives = 24/26 (92%), Frame = +1 Query: 1 VILTGYCKGNGYSVGFGTCSGKIVPA 78 VI+TG+C+G+GY VGFG C+GK++PA Sbjct: 390 VIITGHCQGDGYRVGFGQCAGKVLPA 415 >gi|8393349 ref|NP_058877.1| fumarylacetoacetate hydrolase [Rattus norvegicus] >gi|119779|sp|P25093|FAAA_RAT FUMARYLACETOACETASE (FUMARYLACETOACETATE HYDROLASE) (BETA-DIKETONASE) (FAA) >gi|92242|pir||JH0467 fumarylacetoacetase (EC 3.7.1.2) - rat >gi|204090|gb|AAA41142.1| (M77694) fumarylacetoacetate hydrolase [Rattus norvegicus] Length = 419 Frame 1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | 419 0 150 300 Plus Strand HSPs: Score = 109 (38.4 bits), Expect = 0.00017, P = 0.00017 Identities = 17/26 (65%), Positives = 24/26 (92%), Frame = +1 Query: 1 VILTGYCKGNGYSVGFGTCSGKIVPA 78 VI+TG+C+G+GY VGFG C+GK++PA Sbjct: 390 VIITGHCQGDGYRVGFGQCAGKVLPA 415 >gi|544273|sp|P35505|FAAA_MOUSE FUMARYLACETOACETASE (FUMARYLACETOACETATE HYDROLASE) (BETA-DIKETONASE) (FAA) >gi|284750|pir||A40219 fumarylacetoacetate hydrolase - mouse >gi|8569283|pdb|1QQJ|A Chain A, Crystal Structure Of Mouse Fumarylacetoacetate Hydrolase Refined At 1.55 Angstrom Resolution >gi|8569284|pdb|1QQJ|B Chain B, Crystal Structure Of Mouse Fumarylacetoacetate Hydrolase Refined At 1.55 Angstrom Resolution >gi|193222|gb|AAA37591.1| (M84145) fumarylacetoacetate hydrolase [Mus musculus] Length = 419 Frame 1 hits (HSPs): ____ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 419 0 150 300 __________________ Annotated Domains: PFAM FAA_hydrolase: Fumarylacetoacetate (FAA) 145..359 PRODOM PD009659: FAAA(3) Q94272(1) O65374(1) 1..162 PRODOM PD002459: FAAA(3) HPCE(2) Q46978(2) 164..263 PRODOM PD009658: FAAA(3) Q94272(1) O65374(1) 266..414 PROSITE BZIP_BASIC: bZIP transcription factors b 82..97 __________________ Plus Strand HSPs: Score = 109 (38.4 bits), Expect = 0.00017, P = 0.00017 Identities = 17/26 (65%), Positives = 24/26 (92%), Frame = +1 Query: 1 VILTGYCKGNGYSVGFGTCSGKIVPA 78 VI+TG+C+G+GY VGFG C+GK++PA Sbjct: 390 VIITGHCQGDGYRVGFGQCAGKVLPA 415 >gi|253320|gb|AAB22822.1| fumarylacetoacetate hydrolase, FAH [mice, Peptide, 419 aa] Length = 419 Frame 1 hits (HSPs): ____ Annotated Domains: ___ __________________________________________________ Database sequence: | | | | 419 0 150 300 __________________ Annotated Domains: PROSITE BZIP_BASIC: bZIP transcription factors b 82..97 __________________ Plus Strand HSPs: Score = 109 (38.4 bits), Expect = 0.00017, P = 0.00017 Identities = 17/26 (65%), Positives = 24/26 (92%), Frame = +1 Query: 1 VILTGYCKGNGYSVGFGTCSGKIVPA 78 VI+TG+C+G+GY VGFG C+GK++PA Sbjct: 390 VIITGHCQGDGYRVGFGQCAGKVLPA 415 >gi|8569272|pdb|1QCN|A Chain A, Crystal Structure Of Fumarylacetoacetate Hydrolase >gi|8569273|pdb|1QCN|B Chain B, Crystal Structure Of Fumarylacetoacetate Hydrolase Length = 421 Frame 1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | 421 0 150 300 Plus Strand HSPs: Score = 109 (38.4 bits), Expect = 0.00017, P = 0.00017 Identities = 17/26 (65%), Positives = 24/26 (92%), Frame = +1 Query: 1 VILTGYCKGNGYSVGFGTCSGKIVPA 78 VI+TG+C+G+GY VGFG C+GK++PA Sbjct: 392 VIITGHCQGDGYRVGFGQCAGKVLPA 417 >gi|7292429|gb|AAF47833.1| (AE003480) CG14993 gene product [Drosophila melanogaster] Length = 349 Frame 1 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | 349 0 150 300 Plus Strand HSPs: Score = 89 (31.3 bits), Expect = 0.019, P = 0.019 Identities = 14/28 (50%), Positives = 21/28 (75%), Frame = +1 Query: 1 VILTGYCKGNGYSVGFGTCSGKIVPAAP 84 VI+ G+C+ NG +GFG C G+++PA P Sbjct: 319 VIIRGHCEKNGLRIGFGECVGQVLPAHP 346 >gi|7505698|pir||T25813 hypothetical protein K10C2.4 - Caenorhabditis elegans >gi|1515342|gb|AAB06900.1| (U39852) coded for by C. elegans cDNA yk83g11.5; coded for by C. elegans cDNA yk59b11.5; coded for by C. elegans cDNA yk101c4.5; coded for by C. elegans cDNA yk108e2.5; coded for by C. elegans cDNA cm15b10; coded for by C. elegans cDNA yk62c7.5; coded > Length = 418 Frame 1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | 418 0 150 300 Plus Strand HSPs: Score = 83 (29.2 bits), Expect = 0.11, P = 0.10 Identities = 14/26 (53%), Positives = 19/26 (73%), Frame = +1 Query: 1 VILTGYCKGNGYSVGFGTCSGKIVPA 78 V L+G C+ NG +GFG C GK++PA Sbjct: 391 VNLSGVCEKNGVRIGFGECRGKVLPA 416 >gi|12313291|emb|CAC24418.1| (AL512978) Hypothetical [Sulfolobus solfataricus] Length = 512 Frame 3 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | 512 0 150 300 450 Plus Strand HSPs: Score = 82 (28.9 bits), Expect = 0.18, P = 0.17 Identities = 22/69 (31%), Positives = 33/69 (47%), Frame = +3 Query: 18 LQGKWLLCWVWYLLRQDCTRSSMRLVSFHSLFQLWWEELYNQEPACLIYSAHNMTLIKLV 197 L G LC + Y + T S+ + S QLW + L N P + YS ++ + Sbjct: 307 LSGYVPLCMIVYTY--NLTNSNSGFYTVGSYSQLWIKPLDNNSPVYITYSGYSYS----- 359 Query: 198 FYEYYTFSF 224 +YEYY F+F Sbjct: 360 YYEYYNFTF 368 >gi|11348567|pir||E83394 fumarylacetoacetase PA2008 [imported] - Pseudomonas aeruginosa (strain PAO1) >gi|9948011|gb|AAG05396.1|AE004627_4 (AE004627) fumarylacetoacetase [Pseudomonas aeruginosa] Length = 432 Frame 1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | 432 0 150 300 Plus Strand HSPs: Score = 76 (26.8 bits), Expect = 0.65, P = 0.48 Identities = 15/29 (51%), Positives = 19/29 (65%), Frame = +1 Query: 1 VILTGYCKGNGY-SVGFGTCSGKIVPAAP 84 VIL CK G+ S+GFG C GK++ A P Sbjct: 400 VILRARCKREGHVSIGFGECRGKVLAALP 428 >gi|110562|pir||S25462 Ig kappa chain V region - mouse >gi|938263|emb|CAA47881.1| (X67623) IgG light chain V region [Mus musculus] Length = 91 Frame 3 hits (HSPs): _________________________ __________________________________________________ Database sequence: | | | | | | 91 0 20 40 60 80 Plus Strand HSPs: Score = 62 (21.8 bits), Expect = 4.0, P = 0.98 Identities = 16/44 (36%), Positives = 24/44 (54%), Frame = +3 Query: 57 LRQDCT---RSSMRLVSFHSLFQLWWEELYNQEPACLIYSAHNM 179 LRQ T R+S + S+ + F W+++ Q P LIY A N+ Sbjct: 15 LRQRATISCRASESVDSYGNSFMHWYQQKPGQPPKLLIYHASNL 58 >gi|197274|gb|AAA38993.1| (M31268) IgK chain [Mus musculus] Length = 110 Frame 3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | | | | 110 0 20 40 60 80 100 Plus Strand HSPs: Score = 64 (22.5 bits), Expect = 7.7, P = 1.0 Identities = 16/44 (36%), Positives = 24/44 (54%), Frame = +3 Query: 57 LRQDCT---RSSMRLVSFHSLFQLWWEELYNQEPACLIYSAHNM 179 LRQ T R+S + S+ + F W+++ Q P LIY A N+ Sbjct: 15 LRQRATISCRASESVDSYGNSFMYWYQQKPGQPPKLLIYRASNL 58 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.98 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.341 0.148 0.561 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.353 0.159 0.536 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.366 0.169 0.638 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.362 0.158 0.610 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.334 0.142 0.474 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.357 0.164 0.538 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 96 96 10. 66 3 12 22 0.10 32 28 0.095 34 +2 0 97 97 10. 66 3 12 22 0.10 32 28 0.097 34 +1 0 97 97 10. 66 3 12 22 0.10 32 28 0.097 34 -1 0 97 97 10. 66 3 12 22 0.10 32 28 0.097 34 -2 0 97 97 10. 66 3 12 22 0.10 32 28 0.097 34 -3 0 96 96 10. 66 3 12 22 0.10 32 28 0.095 34 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 16 No. of states in DFA: 590 (58 KB) Total size of DFA: 154 KB (192 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 103.43u 1.16s 104.59t Elapsed: 00:00:18 Total cpu time: 103.46u 1.18s 104.64t Elapsed: 00:00:18 Start: Thu Jan 17 11:03:15 2002 End: Thu Jan 17 11:03:33 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000