WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= B13C01.seq(1>571) (540 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 4 Sequences : less than 4 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 944 239 |=========================================================== 6310 705 137 |================================== 3980 568 185 |============================================== 2510 383 133 |================================= 1580 250 74 |================== 1000 176 84 |===================== 631 92 33 |======== 398 59 20 |===== 251 39 21 |===== 158 18 1 |: 100 17 3 |: 63.1 14 2 |: 39.8 12 1 |: 25.1 11 0 | 15.8 11 2 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 9 <<<<<<<<<<<<<<<<< 10.0 9 0 | 6.31 9 0 | 3.98 9 0 | 2.51 9 0 | 1.58 9 0 | 1.00 9 0 | 0.63 9 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|9759413|dbj|BAB09868.1|(AB008268) beta-ureidopropi... +3 589 2.9e-56 1 gi|7298937|gb|AAF54141.1|(AE003676) CG3027 gene produ... +3 344 2.6e-30 1 gi|3108075|gb|AAC15764.1|(AF060797) putative beta-ure... +3 324 3.5e-28 1 gi|11417872ref|XP_009883.1| beta-ureidopropionase [Ho... +3 310 1.1e-26 1 gi|7706509ref|NP_057411.1| beta-ureidopropionase [Hom... +3 308 1.7e-26 1 gi|6288790|gb|AAF06739.1|(AF169560) beta-ureidopropio... +3 308 1.7e-26 1 gi|416730|sp|Q03248|BUP_RATBETA-UREIDOPROPIONASE (BET... +3 305 3.6e-26 1 gi|7499051|pir||T16068hypothetical protein F13H8.7 - ... +3 265 6.2e-22 1 gi|7304000|gb|AAF59043.1|(AE003835) CG13755 gene prod... -1 44 0.36 2
Use the and icons to retrieve links to Entrez:
>gi|9759413|dbj|BAB09868.1| (AB008268) beta-ureidopropionase [Arabidopsis thaliana] Length = 405 Frame 3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 405 0 150 300 Plus Strand HSPs: Score = 589 (207.3 bits), Expect = 2.9e-56, P = 2.9e-56 Identities = 110/158 (69%), Positives = 130/158 (82%), Frame = +3 Query: 45 QNGEEKEETTASFCGYDSLHRLLKDNLNPHHFQEVSRLLTGLNCGKALEAVSLPESATSL 224 +NGE E S CGYDSLH+LL NL P +QEV+RLL G NCG++LE + LPESA +L Sbjct: 4 ENGETSAE--GSICGYDSLHQLLSANLKPELYQEVNRLLLGRNCGRSLEQIVLPESAKAL 61 Query: 225 SAEHGFDIQAFSFCADKELLREPRIVRVGLIQNSIVLPTTAHFADQKKAIFEKLKPIIEA 404 S++H FD+QA SF ADKE +R PR+VRVGLIQNSI LPTTA F+DQ + IF+KLKPII+A Sbjct: 62 SSKHDFDLQAASFSADKEQMRNPRVVRVGLIQNSIALPTTAPFSDQTRGIFDKLKPIIDA 121 Query: 405 AGSSGVNVLCLQEAWMMPFAFCTREKRWCEFAEPVDGE 518 AG +GVN+LCLQEAW MPFAFCTRE+RWCEFAEPVDGE Sbjct: 122 AGVAGVNILCLQEAWTMPFAFCTRERRWCEFAEPVDGE 159 Score = 561 (197.5 bits), Expect = 2.7e-53, P = 2.7e-53 Identities = 109/165 (66%), Positives = 129/165 (78%), Frame = +3 Query: 45 QNGEEKEETTASFCGYDSLHRLLKDNLNPHHFQEVSRLLTGLNCGKALEAVSLPESATSL 224 +NGE E S CGYDSLH+LL NL P +QEV+RLL G NCG++LE + LPESA +L Sbjct: 4 ENGETSAE--GSICGYDSLHQLLSANLKPELYQEVNRLLLGRNCGRSLEQIVLPESAKAL 61 Query: 225 SAEHGFDIQAFSFCADKELLREPRIVRVGLIQNSIVLPTTAHFADQKKAIFEKLKPIIEA 404 S++H FD+QA SF ADKE +R PR+VRVGLIQNSI LPTTA F+DQ + IF+KLKPII+A Sbjct: 62 SSKHDFDLQAASFSADKEQMRNPRVVRVGLIQNSIALPTTAPFSDQTRGIFDKLKPIIDA 121 Query: 405 AGSSGVNVLCLQEAWMMPFAFCTREKRWCEFAEPVDGEFAEPVDG 539 AG +GVN+LCLQEAW MPFAFCTRE+RWCEFAEPVDG Sbjct: 122 AGVAGVNILCLQEAWTMPFAFCTRERRWCEFAEPVDG-------- 158 >gi|7298937|gb|AAF54141.1| (AE003676) CG3027 gene product [Drosophila melanogaster] Length = 406 Frame 3 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | 406 0 150 300 Plus Strand HSPs: Score = 344 (121.1 bits), Expect = 2.6e-30, P = 2.6e-30 Identities = 70/139 (50%), Positives = 97/139 (69%), Frame = +3 Query: 96 SLHRLLKDNLNPHHFQEVSRLLTGLNCGKALEAVSLPESATSLSAEHGFDIQAFSFCADK 275 +L+ L+ +L P +EV R+L G+ + LE LP SA ++ ++GFDI+ + F A + Sbjct: 8 NLNDCLEKHLPPDELKEVKRILYGVEEDQTLE---LPTSAKDIAEQNGFDIKGYRFTARE 64 Query: 276 ELLREPRIVRVGLIQNSIVLPTTAHFADQKKAIFEKLKPIIEAAGSSGVNVLCLQEAWMM 455 E R+ RIVRVG IQNSIV+PTTA Q++AI+ K+K +I+AA +G N++C QEAW M Sbjct: 65 EQTRKRRIVRVGAIQNSIVIPTTAPIEKQREAIWNKVKTMIKAAAEAGCNIVCTQEAWTM 124 Query: 456 PFAFCTREK-RWCEFAEPVD 512 PFAFCTREK WCEFAE + Sbjct: 125 PFAFCTREKFPWCEFAEEAE 144 >gi|3108075|gb|AAC15764.1| (AF060797) putative beta-ureidopropionase [Manduca sexta] Length = 185 Frame 3 hits (HSPs): ______________________________________ __________________________________________________ Database sequence: | | | | | 185 0 50 100 150 Plus Strand HSPs: Score = 324 (114.1 bits), Expect = 3.5e-28, P = 3.5e-28 Identities = 67/139 (48%), Positives = 93/139 (66%), Frame = +3 Query: 96 SLHRLLKDNLNPHHFQEVSRLLTGLNCGKALEAVSLPESATSLSAEHGFDIQAFSFCADK 275 SL ++++NL+ E +R+ G LE V L +S+ + + E F++ A++F A K Sbjct: 6 SLEAIIENNLSGRDLDEFNRIYYGRK--NHLE-VKLKDSSLAAAKEADFEVAAYAFPAKK 62 Query: 276 ELLREPRIVRVGLIQNSIVLPTTAHFADQKKAIFEKLKPIIEAAGSSGVNVLCLQEAWMM 455 E R PRIV+VG+IQ+SI PT +QKKAIF+K+K II+ AG GVN++C QE W M Sbjct: 63 EQTRPPRIVKVGVIQHSIGAPTDRPVNEQKKAIFDKVKKIIDVAGQEGVNIICFQELWNM 122 Query: 456 PFAFCTREKR-WCEFAEPVD 512 PFAFCTREK+ WCEFAE + Sbjct: 123 PFAFCTREKQPWCEFAESAE 142 >gi|11417872 ref|XP_009883.1| beta-ureidopropionase [Homo sapiens] Length = 404 Frame 3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | 404 0 150 300 Plus Strand HSPs: Score = 310 (109.1 bits), Expect = 1.1e-26, P = 1.1e-26 Identities = 70/152 (46%), Positives = 91/152 (59%), Frame = +3 Query: 60 KEETTASFCG--YDSLHRLLKDNLNPHHFQEVSRLLTGLNCGKALEAVSLPESATSLSAE 233 K T + G + SL L+ +L QEV R+L G K L + LP A ++ Sbjct: 14 KHWRTVAMAGAEWKSLEECLEKHLPLPDLQEVKRVLYG----KELRKLDLPREAFEAASR 69 Query: 234 HGFDIQAFSFCADKELLREPRIVRVGLIQNSIVLPTTAHFADQKKAIFEKLKPIIEAAGS 413 F++Q ++F A +E LR PRIV VGL+QN I LP A A+Q A+ ++K I+E A Sbjct: 70 EDFELQGYAFEAAEEQLRRPRIVHVGLVQNRIPLPANAPVAEQVSALHRRIKAIVEVAAM 129 Query: 414 SGVNVLCLQEAWMMPFAFCTREKR-WCEFAEPV-DG 515 GVN++C QEAW MPFAFCTREK W EFAE DG Sbjct: 130 CGVNIICFQEAWTMPFAFCTREKLPWTEFAESAEDG 165 >gi|7706509 ref|NP_057411.1| beta-ureidopropionase [Homo sapiens] >gi|6288771|gb|AAF06735.1|AF163312_1 (AF163312) beta-ureidopropionase [Homo sapiens] >gi|6635205|dbj|BAA88634.1| (AB013885) beta-ureidopropionase [Homo sapiens] Length = 384 Frame 3 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | 384 0 150 300 Plus Strand HSPs: Score = 308 (108.4 bits), Expect = 1.7e-26, P = 1.7e-26 Identities = 67/142 (47%), Positives = 87/142 (61%), Frame = +3 Query: 90 YDSLHRLLKDNLNPHHFQEVSRLLTGLNCGKALEAVSLPESATSLSAEHGFDIQAFSFCA 269 + SL L+ +L QEV R+L G K L + LP A ++ F++Q ++F A Sbjct: 6 WKSLEECLEKHLPLPDLQEVKRVLYG----KELRKLDLPREAFEAASREDFELQGYAFEA 61 Query: 270 DKELLREPRIVRVGLIQNSIVLPTTAHFADQKKAIFEKLKPIIEAAGSSGVNVLCLQEAW 449 +E LR PRIV VGL+QN I LP A A+Q A+ ++K I+E A GVN++C QEAW Sbjct: 62 AEEQLRRPRIVHVGLVQNRIPLPANAPVAEQVSALHRRIKAIVEVAAMCGVNIICFQEAW 121 Query: 450 MMPFAFCTREKR-WCEFAEPV-DG 515 MPFAFCTREK W EFAE DG Sbjct: 122 TMPFAFCTREKLPWTEFAESAEDG 145 >gi|6288790|gb|AAF06739.1| (AF169560) beta-ureidopropionase [Homo sapiens] Length = 387 Frame 3 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | 387 0 150 300 Plus Strand HSPs: Score = 308 (108.4 bits), Expect = 1.7e-26, P = 1.7e-26 Identities = 67/142 (47%), Positives = 87/142 (61%), Frame = +3 Query: 90 YDSLHRLLKDNLNPHHFQEVSRLLTGLNCGKALEAVSLPESATSLSAEHGFDIQAFSFCA 269 + SL L+ +L QEV R+L G K L + LP A ++ F++Q ++F A Sbjct: 6 WKSLEECLEKHLPLPDLQEVKRVLYG----KELRKLDLPREAFEAASREDFELQGYAFEA 61 Query: 270 DKELLREPRIVRVGLIQNSIVLPTTAHFADQKKAIFEKLKPIIEAAGSSGVNVLCLQEAW 449 +E LR PRIV VGL+QN I LP A A+Q A+ ++K I+E A GVN++C QEAW Sbjct: 62 AEEQLRRPRIVHVGLVQNRIPLPANAPVAEQVSALHRRIKAIVEVAAMCGVNIICFQEAW 121 Query: 450 MMPFAFCTREKR-WCEFAEPV-DG 515 MPFAFCTREK W EFAE DG Sbjct: 122 TMPFAFCTREKLPWTEFAESAEDG 145 >gi|416730|sp|Q03248|BUP_RAT BETA-UREIDOPROPIONASE (BETA-ALANINE SYNTHASE) (N-CARBAMOYL-BETA-ALANINE AMIDOHYDROLASE) >gi|285064|pir||S27881 beta-alanine synthase - rat >gi|203106|gb|AAA40804.1| (M97662) beta-alanine synthase [Rattus norvegicus] Length = 393 Frame 3 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | 393 0 150 300 Plus Strand HSPs: Score = 305 (107.4 bits), Expect = 3.6e-26, P = 3.6e-26 Identities = 65/142 (45%), Positives = 90/142 (63%), Frame = +3 Query: 90 YDSLHRLLKDNLNPHHFQEVSRLLTGLNCGKALEAVSLPESATSLSAEHGFDIQAFSFCA 269 + SL + L+ +L P +V R+L G K + LP A ++E F+++ ++F A Sbjct: 6 WQSLEQCLEKHLPPDDLSQVKRILYG----KQTRNLDLPRKALEAASERNFELKGYAFGA 61 Query: 270 DKELLREPRIVRVGLIQNSIVLPTTAHFADQKKAIFEKLKPIIEAAGSSGVNVLCLQEAW 449 KE R P+IVRVGL+QN I LPT+A A+Q A+ ++++ I E A GVN++C QEAW Sbjct: 62 AKEQQRCPQIVRVGLVQNRIPLPTSAPVAEQVSALHKRIEEIAEVAAMCGVNIICFQEAW 121 Query: 450 MMPFAFCTREKR-WCEFAEPV-DG 515 MPFAFCTREK W EFAE DG Sbjct: 122 NMPFAFCTREKLPWTEFAESAEDG 145 >gi|7499051|pir||T16068 hypothetical protein F13H8.7 - Caenorhabditis elegans >gi|722377|gb|AAC46683.1| (U23139) highly similar to beta-ureidopropionase (SP:BUP_RAT) [Caenorhabditis elegans] Length = 387 Frame 3 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | 387 0 150 300 Plus Strand HSPs: Score = 265 (93.3 bits), Expect = 6.2e-22, P = 6.2e-22 Identities = 62/140 (44%), Positives = 83/140 (59%), Frame = +3 Query: 90 YDSLHRLLKDNLNPHHFQEVSRLLTGLNCGKALEAVSLPESATSLSAEHGFDIQAFSFCA 269 +D + L + L+ EV R+L G +ALE S+ E L+ + F + + A Sbjct: 8 FDGVETALAEKLDGVSLDEVERILYGRPY-RALEISSIAEK---LAQDGDFQLSGYIVDA 63 Query: 270 DKELLREPRIVRVGLIQNSIVLPTTAHFADQKKAIFEKLKPIIEAAGSSGVNVLCLQEAW 449 KE R PR+VRV IQN I PTT +Q+ AI +++ +IEAA S+G NV+ LQEAW Sbjct: 64 QKEQTRAPRLVRVAAIQNKIHRPTTDSVVEQRDAIHQRVGAMIEAAASAGANVIGLQEAW 123 Query: 450 MMPFAFCTREKR-WCEFAEPV 509 MPFAFCTRE+ W EFAE V Sbjct: 124 TMPFAFCTRERLPWTEFAESV 144 >gi|7304000|gb|AAF59043.1| (AE003835) CG13755 gene product [Drosophila melanogaster] Length = 60 Frame -1 hits (HSPs): ________________ _____________________ __________________________________________________ Database sequence: | | | | 60 0 20 40 Minus Strand HSPs: Score = 44 (15.5 bits), Expect = 0.44, Sum P(2) = 0.36 Identities = 10/16 (62%), Positives = 10/16 (62%), Frame = -1 Query: 537 HQQVQQIPHQQVQQIH 490 HQQ QQ QQ QQ H Sbjct: 13 HQQHQQQQQQQQQQQH 28 Score = 44 (15.5 bits), Expect = 0.44, Sum P(2) = 0.36 Identities = 10/25 (40%), Positives = 14/25 (56%), Frame = -1 Query: 270 QHKRKRPGCQNH-VPQRGKLQILGE 199 QH KR C+ H + L++LGE Sbjct: 33 QHATKRRRCRQHWIKHLQLLRLLGE 57 Score = 40 (14.1 bits), Expect = 1.1, Sum P(2) = 0.67 Identities = 10/14 (71%), Positives = 10/14 (71%), Frame = -1 Query: 537 HQQVQQIPHQQVQQ 496 HQQ QQ HQQ QQ Sbjct: 10 HQQHQQ--HQQQQQ 21 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.98 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.325 0.139 0.432 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.361 0.160 0.684 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.347 0.155 0.482 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.344 0.151 0.496 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.339 0.147 0.498 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.331 0.136 0.425 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 179 178 10. 76 3 12 22 0.11 34 31 0.11 37 +2 0 179 178 10. 76 3 12 22 0.11 34 31 0.11 37 +1 0 180 179 10. 76 3 12 22 0.11 34 31 0.12 37 -1 0 180 179 10. 76 3 12 22 0.11 34 31 0.12 37 -2 0 179 179 10. 76 3 12 22 0.11 34 31 0.12 37 -3 0 179 178 10. 76 3 12 22 0.11 34 31 0.11 37 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 9 No. of states in DFA: 592 (58 KB) Total size of DFA: 204 KB (256 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 184.01u 0.93s 184.94t Elapsed: 00:00:38 Total cpu time: 184.05u 0.95s 185.00t Elapsed: 00:00:38 Start: Wed Feb 6 15:43:52 2002 End: Wed Feb 6 15:44:30 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000