WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= A10A10MR_CONSENSUS (484 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 5 Sequences : less than 5 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 1409 281 |======================================================== 6310 1128 222 |============================================ 3980 906 235 |=============================================== 2510 671 196 |======================================= 1580 475 159 |=============================== 1000 316 88 |================= 631 228 63 |============ 398 165 49 |========= 251 116 41 |======== 158 75 21 |==== 100 54 15 |=== 63.1 39 7 |= 39.8 32 5 |= 25.1 27 7 |= 15.8 20 5 |= >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 15 <<<<<<<<<<<<<<<<< 10.0 15 2 |: 6.31 13 3 |: 3.98 10 1 |: 2.51 9 0 | 1.58 9 1 |: 1.00 8 1 |: 0.63 7 2 |: 0.40 5 1 |: 0.25 4 0 | 0.16 4 0 | 0.10 4 0 | 0.063 4 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|12321169|gb|AAG50671.1|AC079829_4(AC079829) unknow... +1 340 3.1e-29 1 gi|12320748|gb|AAG50526.1|AC084221_8(AC084221) unknow... +1 221 1.4e-26 2 gi|10177004|dbj|BAB10254.1|(AB020744) gene_id:K9E15.1... +1 106 9.4e-05 2 gi|4200286|emb|CAA68149.1|(X99836) rap55 [Pleurodeles... +1 86 0.052 2 gi|999021|gb|AAB34153.1|nuclear antigen EBNA1 [Epstei... +1 71 0.33 1 gi|330370|gb|AAA45883.1|(M13180) nuclear antigen (EBN... +1 70 0.41 1 gi|628184|pir||S42440nuclear antigen EBNA1 - human he... +1 70 0.41 1 gi|475488|gb|AAA17816.1|(U08173) phytochrome [Lycoper... +2 78 0.49 1 gi|999017|gb|AAB34150.1|nuclear antigen EBNA1 [Epstei... +1 66 0.78 1 gi|11419686ref|XP_010095.1| hypothetical protein FLJ2... +1 81 0.97 1 gi|808666|gb|AAA66540.1|(M12553) unknown protein [Hum... +1 62 0.99 1 gi|94231|pir||S23822hypothetical protein 2 - feline i... -2 61 0.996 1 gi|123999|sp||IAMY_COILA_4[Segment 4 of 6] ALPHA-AMYL... +1 61 0.996 1 gi|2323295|gb|AAB66534.1|(AF010418) ORF-2 [Hepatitis ... +1 38 0.9998 2 gi|2323299|gb|AAB66537.1|(AF010419) ORF-2 [Hepatitis ... +1 38 0.9998 2 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame 2 Hits gi|475488 | ___________________ __________________________________________________ Query sequence: | | | | | 162 0 50 100 150 Locus_ID Frame 1 Hits gi|12321169 |____________________________________________ gi|12320748 |____________ ______________________________ gi|10177004 |_________________ ________________________ gi|4200286 |_________ __________________________ gi|999021 | ____________ gi|330370 | ____________ gi|628184 | ____________ gi|999017 | ____________ gi|11419686 | ____________________ gi|808666 | ____________ gi|123999 | __________________ gi|2323295 | _____ ____ gi|2323299 | _____ ____ __________________________________________________ Query sequence: | | | | | 162 0 50 100 150 Locus_ID Frame -2 Hits gi|94231 | __________ __________________________________________________ Query sequence: | | | | | 162 0 50 100 150
Use the and icons to retrieve links to Entrez:
>gi|12321169|gb|AAG50671.1|AC079829_4 (AC079829) unknown protein [Arabidopsis thaliana] Length = 611 Frame 1 hits (HSPs): ____________ __________________________________________________ Database sequence: | | | | || 611 0 150 300 450 600 Plus Strand HSPs: Score = 340 (119.7 bits), Expect = 3.1e-29, P = 3.1e-29 Identities = 74/143 (51%), Positives = 95/143 (66%), Frame = +1 Query: 1 VEVVQVSSTSSPEPSVPVSAETQPPILPLPVTSRPSYRPGVAPIQIHHGYNYRGRGRGRG 180 VEVVQVSS++ E SVPV++E QPPILPLP ++RP+ +P H+GY RGRGRGRG Sbjct: 392 VEVVQVSSSAGLEQSVPVTSEAQPPILPLPSSARPTQKPNGHSFPNHNGYRGRGRGRGRG 451 Query: 181 TGGLHPVTKFTEDFDFTAMNXKFKKDEVWGSSW*KLSLIQKKKDGERKMPLMKDYQDEEX 360 G H V KFTEDFDFTAMN KF KDEVWG K + + +D + P + + + + Sbjct: 452 AGRSHQVMKFTEDFDFTAMNEKFNKDEVWGHLG-KSTTLDGDEDDDS--PTVDEAELPK- 507 Query: 361 *RCFXL*S*PIYNKDDFFESLSS 429 + + P+YNKDDFF+SLSS Sbjct: 508 -----IEAKPVYNKDDFFDSLSS 525 >gi|12320748|gb|AAG50526.1|AC084221_8 (AC084221) unknown protein [Arabidopsis thaliana] Length = 643 Frame 1 hits (HSPs): ____ ____ ________ ___ __________________________________________________ Database sequence: | | | | | | 643 0 150 300 450 600 Plus Strand HSPs: Score = 221 (77.8 bits), Expect = 1.4e-26, Sum P(2) = 1.4e-26 Identities = 49/97 (50%), Positives = 62/97 (63%), Frame = +1 Query: 139 HHGYNYRGRGRGRGTGGLHPVTKFTEDFDFTAMNXKFKKDEVWGSSW*KLSLIQKKKDGE 318 H+GY RGRGRGRG G H V KFTEDFDFTAMN KF KDEVWG K + + +D + Sbjct: 470 HNGYRGRGRGRGRGAGRSHQVMKFTEDFDFTAMNEKFNKDEVWGHLG-KSTTLDGDEDDD 528 Query: 319 RKMPLMKDYQDEEX*RCFXL*S*PIYNKDDFFESLSS 429 P + + + + + + P+YNKDDFF+SLSS Sbjct: 529 S--PTVDEAELPK------IEAKPVYNKDDFFDSLSS 557 Score = 122 (42.9 bits), Expect = 1.4e-26, Sum P(2) = 1.4e-26 Identities = 24/38 (63%), Positives = 32/38 (84%), Frame = +1 Query: 1 VEVVQVSSTSSPEPSVPVSAETQPPILPLPVTSRPSYR 114 VEVVQVSS++ E SVPV++E QPPILPLP ++RP+ + Sbjct: 392 VEVVQVSSSAGLEQSVPVTSEAQPPILPLPSSARPTQK 429 Score = 53 (18.7 bits), Expect = 8.3e-09, Sum P(2) = 8.3e-09 Identities = 9/12 (75%), Positives = 10/12 (83%), Frame = +1 Query: 145 GYNYRGRGRGRG 180 GY Y GRG+GRG Sbjct: 626 GYGYGGRGQGRG 637 Score = 49 (17.2 bits), Expect = 6.0e-19, Sum P(2) = 6.0e-19 Identities = 11/33 (33%), Positives = 19/33 (57%), Frame = +1 Query: 16 VSSTSSPEPSVPVSAETQPPILP--LPVTSRPS 108 + ST PS +++E PP+L P+T+ P+ Sbjct: 267 LQSTLQSAPSPSLASEMAPPLLSNKAPITAPPT 299 Score = 41 (14.4 bits), Expect = 1.4e-07, Sum P(2) = 1.4e-07 Identities = 7/11 (63%), Positives = 9/11 (81%), Frame = +1 Query: 142 HGYNYRGRGRG 174 +GY RG+GRG Sbjct: 627 YGYGGRGQGRG 637 Score = 39 (13.7 bits), Expect = 2.3e-07, Sum P(2) = 2.3e-07 Identities = 10/17 (58%), Positives = 10/17 (58%), Frame = +1 Query: 145 GYNYRGRGR--GRGTGG 189 GY RG G GRG GG Sbjct: 608 GYGGRGYGGYGGRGGGG 624 >gi|10177004|dbj|BAB10254.1| (AB020744) gene_id:K9E15.11~unknown protein [Arabidopsis thaliana] Length = 571 Frame 1 hits (HSPs): _______________ __________________________________________________ Database sequence: | | | | | 571 0 150 300 450 Plus Strand HSPs: Score = 106 (37.3 bits), Expect = 9.4e-05, Sum P(2) = 9.4e-05 Identities = 29/74 (39%), Positives = 40/74 (54%), Frame = +1 Query: 205 KFTEDFDFTAMNXKFKKDEVWGSSW*KLSLIQKKKDGERKMPLMKDYQDEEX*RCFXL*S 384 ++TE+FDF AMN KFKK E+WG L ++ +R DY +E Sbjct: 426 EYTEEFDFEAMNEKFKKSELWGY----LG-----RNNQRNQ---NDYGEETAIEPNAEGK 473 Query: 385 *PIYNKDDFFESLS 426 P YNKDDFF+++S Sbjct: 474 -PAYNKDDFFDTIS 486 Score = 49 (17.2 bits), Expect = 9.4e-05, Sum P(2) = 9.4e-05 Identities = 11/30 (36%), Positives = 16/30 (53%), Frame = +1 Query: 28 SSPEPSVPVSAETQPPILPLPVTSRPSYRP 117 S+ PS + P+LPLPV++ S P Sbjct: 392 SANVPSQSFAPRNHAPLLPLPVSAHQSRIP 421 Score = 45 (15.8 bits), Expect = 0.00024, Sum P(2) = 0.00024 Identities = 15/59 (25%), Positives = 26/59 (44%), Frame = +1 Query: 4 EVVQVSSTSSPEPSVP----VSAETQPPILPLPVTSRPSYRPGVAP-IQIHHGYNYRGR 165 +VV ++ P S+P A P++P P++ P + P +Q YRG+ Sbjct: 324 KVVYDPQSNHPHRSIPHELPAVASNSAPVIPGPLSKSPESFFDMDPSLQSRQQMVYRGQ 382 >gi|4200286|emb|CAA68149.1| (X99836) rap55 [Pleurodeles waltl] Length = 467 Frame 1 hits (HSPs): ___ __________ __________________________________________________ Database sequence: | | | | | 467 0 150 300 450 Plus Strand HSPs: Score = 86 (30.3 bits), Expect = 0.053, Sum P(2) = 0.052 Identities = 32/86 (37%), Positives = 44/86 (51%), Frame = +1 Query: 115 PGVAPIQIHHGYNYRGRGRGR-GTGGLHPVTKFTEDFDFTAMNXKFKKDEVWGSSW*KLS 291 PG P + G +RG GRGR G P+ KF +DFDF + N +F K+E+ KL Sbjct: 267 PGAPPARRGRG-GHRG-GRGRFGIRRDGPM-KFEKDFDFESANAQFTKEEIDREFHNKLK 323 Query: 292 L-------IQKKKDGERKMPLMKDYQDEE 357 L ++K +GE K D Q+ E Sbjct: 324 LKDDKPEKVEKPVNGEDKGDSGIDTQNSE 352 Score = 50 (17.6 bits), Expect = 0.053, Sum P(2) = 0.052 Identities = 11/27 (40%), Positives = 13/27 (48%), Frame = +1 Query: 10 VQVSSTSSPEPSVPVSAETQPPILPLP 90 VQ + S P PV + PP PLP Sbjct: 199 VQTTPASHLPPPGPVGRRSPPPARPLP 225 >gi|999021|gb|AAB34153.1| nuclear antigen EBNA1 [Epstein-Barr virus EBV, C15 isolate, Peptide Partial, 90 aa, segment 2 of 2] Length = 90 Frame 1 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | | | 90 0 20 40 60 80 Plus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.39, P = 0.33 Identities = 19/35 (54%), Positives = 20/35 (57%), Frame = +1 Query: 94 TSRPSYRPGVAPIQIHHGYNY-RGRGRGRGTGGLHP 198 TS P G P Q G N+ RGRGRGRG GG P Sbjct: 20 TSGPDGSSGSGP-QRRGGDNHGRGRGRGRGRGGGRP 54 >gi|330370|gb|AAA45883.1| (M13180) nuclear antigen (EBNA 1) [Human herpesvirus 4] Length = 58 Frame 1 hits (HSPs): ______________________________ __________________________________________________ Database sequence: | | | | 58 0 20 40 Plus Strand HSPs: Score = 70 (24.6 bits), Expect = 0.52, P = 0.41 Identities = 19/35 (54%), Positives = 20/35 (57%), Frame = +1 Query: 94 TSRPSYRPGVAPIQIHHGYNY-RGRGRGRGTGGLHP 198 TS P G P Q G N+ RGRGRGRG GG P Sbjct: 20 TSGPEGSGGSGP-QRRGGDNHGRGRGRGRGRGGGRP 54 >gi|628184|pir||S42440 nuclear antigen EBNA1 - human herpesvirus 4 >gi|555157|gb|AAA45889.1| (M13941) nuclear antigen 1 [Human herpesvirus 4] Length = 66 Frame 1 hits (HSPs): ___________________________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | 66 0 20 40 60 __________________ Annotated Domains: DOMO DM06186: EPSTEIN-BARRVIRUSNUCLEARANTIGEN 1..66 __________________ Plus Strand HSPs: Score = 70 (24.6 bits), Expect = 0.52, P = 0.41 Identities = 19/35 (54%), Positives = 20/35 (57%), Frame = +1 Query: 94 TSRPSYRPGVAPIQIHHGYNY-RGRGRGRGTGGLHP 198 TS P G P Q G N+ RGRGRGRG GG P Sbjct: 20 TSGPEGSGGSGP-QRRGGDNHGRGRGRGRGRGGGRP 54 >gi|475488|gb|AAA17816.1| (U08173) phytochrome [Lycopersicon esculentum] Length = 117 Frame 2 hits (HSPs): _______________________ __________________________________________________ Database sequence: | | | | 117 0 50 100 Plus Strand HSPs: Score = 78 (27.5 bits), Expect = 0.68, P = 0.49 Identities = 19/59 (32%), Positives = 31/59 (52%), Frame = +2 Query: 44 LCQFLLKLNHQYCHCQ*LHGPVTDPV*LLSKFIMAIIIEDVEEEGELGVCTQSQNSLRI 220 LC L+ H YCH Q + + ++ +MA+++ D +EEGE +QSQ R+ Sbjct: 41 LCGSTLRAPH-YCHLQYMENMNS-----IASLVMAVVVNDGDEEGESSDSSQSQKRKRL 93 >gi|999017|gb|AAB34150.1| nuclear antigen EBNA1 [Epstein-Barr virus EBV, NPC isolate, Peptide Partial, 90 aa, segment 2 of 2] Length = 90 Frame 1 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | | | 90 0 20 40 60 80 Plus Strand HSPs: Score = 66 (23.2 bits), Expect = 1.5, P = 0.78 Identities = 18/35 (51%), Positives = 20/35 (57%), Frame = +1 Query: 94 TSRPSYRPGVAPIQIHHGYNY-RGRGRGRGTGGLHP 198 +S P G P Q G N+ RGRGRGRG GG P Sbjct: 20 SSGPEGSGGSGP-QRRGGDNHGRGRGRGRGRGGGRP 54 >gi|11419686 ref|XP_010095.1| hypothetical protein FLJ20506 [Homo sapiens] Length = 238 Frame 1 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | | | 238 0 50 100 150 200 Plus Strand HSPs: Score = 81 (28.5 bits), Expect = 3.4, P = 0.97 Identities = 20/62 (32%), Positives = 33/62 (53%), Frame = +1 Query: 13 QVSSTSSPEPSVPVSAETQPPILP-LPVTSRPSYRPGVAPIQIHH-GYNYRGRGRGRGTG 186 Q+ T++ P P +A P+ P LP + +Y G PI+ HH +++ G+ + Sbjct: 42 QLPPTAALAPGAPRAARGSVPLQPPLPPAALGAYSGGAGPIRHHHPAHHFHHHGQAQP-- 99 Query: 187 GLHP 198 GLHP Sbjct: 100 GLHP 103 >gi|808666|gb|AAA66540.1| (M12553) unknown protein [Human herpesvirus 4] Length = 82 Frame 1 hits (HSPs): ______________________ __________________________________________________ Database sequence: | | | | | | 82 0 20 40 60 80 Plus Strand HSPs: Score = 62 (21.8 bits), Expect = 4.3, P = 0.99 Identities = 18/35 (51%), Positives = 19/35 (54%), Frame = +1 Query: 94 TSRPSYRPGVAPIQIHHGYNY-RGRGRGRGTGGLHP 198 TS P P Q G N+ RGRGRGRG GG P Sbjct: 20 TSGPEGSGRSGP-QRRGGDNHGRGRGRGRGRGGGRP 54 >gi|94231|pir||S23822 hypothetical protein 2 - feline immunodeficiency virus >gi|59289|emb|CAA40320.1| (X57002) ORF2 [Feline immunodeficiency virus] Length = 78 Frame -2 hits (HSPs): __________________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | 78 0 20 40 60 __________________ Annotated Domains: DOMO DM04868: 1..78 __________________ Minus Strand HSPs: Score = 61 (21.5 bits), Expect = 5.6, P = 1.0 Identities = 12/28 (42%), Positives = 18/28 (64%), Frame = -2 Query: 414 KEVILVVNRLTSKXETSSXFFILVIFHQ 331 +E+I++ NR+T K E S I V+ HQ Sbjct: 2 EEIIVLFNRVTEKLEKDSAIRIFVLAHQ 29 >gi|123999|sp||IAMY_COILA_4 [Segment 4 of 6] ALPHA-AMYLASE INHIBITOR/ENDOCHITINASE Length = 76 Frame 1 hits (HSPs): ____________________________________ __________________________________________________ Database sequence: | | | | | 76 0 20 40 60 Plus Strand HSPs: Score = 61 (21.5 bits), Expect = 5.6, P = 1.0 Identities = 18/55 (32%), Positives = 22/55 (40%), Frame = +1 Query: 55 SAETQPPILPLPVTSRPSYRPGVAPIQIHHGYNYRGRGRGRGTGGLHPVTKFTED 219 +A P P + Y G PIQI YNY GR G GL + +D Sbjct: 2 NAYCDPSKTQKPCAAGKKYY-GRGPIQISXNYNYGPAGRAIGMDGLGNPDRVAQD 55 >gi|2323295|gb|AAB66534.1| (AF010418) ORF-2 [Hepatitis E virus] Length = 39 Frame 1 hits (HSPs): _____________________________ __________________________________________________ Database sequence: | | | 39 0 20 Plus Strand HSPs: Score = 38 (13.4 bits), Expect = 8.6, Sum P(2) = 1.0 Identities = 7/14 (50%), Positives = 10/14 (71%), Frame = +1 Query: 73 PILPLPVTSRPSYR 114 P+LP P + +PS R Sbjct: 15 PMLPAPPSGQPSGR 28 Score = 36 (12.7 bits), Expect = 8.6, Sum P(2) = 1.0 Identities = 7/10 (70%), Positives = 8/10 (80%), Frame = +1 Query: 160 GRGRGRGTGG 189 GR RGR +GG Sbjct: 27 GRRRGRRSGG 36 >gi|2323299|gb|AAB66537.1| (AF010419) ORF-2 [Hepatitis E virus] >gi|2323303|gb|AAB66540.1| (AF010420) ORF-2 [Hepatitis E virus] Length = 39 Frame 1 hits (HSPs): _____________________________ __________________________________________________ Database sequence: | | | 39 0 20 Plus Strand HSPs: Score = 38 (13.4 bits), Expect = 8.6, Sum P(2) = 1.0 Identities = 7/14 (50%), Positives = 10/14 (71%), Frame = +1 Query: 73 PILPLPVTSRPSYR 114 P+LP P + +PS R Sbjct: 15 PMLPAPPSGQPSGR 28 Score = 36 (12.7 bits), Expect = 8.6, Sum P(2) = 1.0 Identities = 7/10 (70%), Positives = 8/10 (80%), Frame = +1 Query: 160 GRGRGRGTGG 189 GR RGR +GG Sbjct: 27 GRRRGRRSGG 36 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.96 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.361 0.158 0.584 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.351 0.155 0.535 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.330 0.145 0.466 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.352 0.156 0.536 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.348 0.153 0.574 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.354 0.159 0.569 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 160 156 10. 74 3 12 22 0.097 34 30 0.12 36 +2 0 161 157 10. 74 3 12 22 0.098 34 30 0.12 36 +1 0 161 156 10. 74 3 12 22 0.097 34 30 0.12 36 -1 0 161 156 10. 74 3 12 22 0.097 34 30 0.12 36 -2 0 161 156 10. 74 3 12 22 0.097 34 30 0.12 36 -3 0 160 154 10. 74 3 12 22 0.095 34 30 0.12 36 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 15 No. of states in DFA: 590 (58 KB) Total size of DFA: 191 KB (192 KB) Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 155.82u 1.17s 156.99t Elapsed: 00:00:26 Total cpu time: 155.84u 1.20s 157.04t Elapsed: 00:00:26 Start: Mon Oct 1 22:22:33 2001 End: Mon Oct 1 22:22:59 2001
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000