WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= 'D10E03_J03_10.ab1' (656 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 4 Sequences : less than 4 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 1063 224 |======================================================== 6310 839 155 |====================================== 3980 684 198 |================================================= 2510 486 155 |====================================== 1580 331 84 |===================== 1000 247 92 |======================= 631 155 42 |========== 398 113 35 |======== 251 78 25 |====== 158 53 16 |==== 100 37 2 |: 63.1 35 9 |== 39.8 26 3 |: 25.1 23 8 |== 15.8 15 6 |= >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 9 <<<<<<<<<<<<<<<<< 10.0 9 0 | 6.31 9 1 |: 3.98 8 1 |: 2.51 7 0 | 1.58 7 1 |: 1.00 6 0 | 0.63 6 1 |: 0.40 5 0 | 0.25 5 0 | 0.16 5 0 | 0.10 5 0 | 0.063 5 0 | 0.040 5 0 | 0.025 5 0 | 0.016 5 0 | 0.010 5 0 | 0.0063 5 0 | 0.0040 5 0 | 0.0025 5 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|7484675|pir||T09117multisubunit regulator protein ... +2 786 3.8e-77 1 gi|1169013|sp|P43255|COP9_ARATHCOP9 PROTEIN (FUSCA PR... +2 760 2.2e-74 1 gi|5729779ref|NP_006701.1| COP9 homolog [Homo sapiens... +2 284 6.0e-24 1 gi|12729210ref|XP_010948.1| similar to COP9 homolog (... +2 269 2.3e-22 1 gi|7297391|gb|AAF52650.1|(AE003621) CG13383 gene prod... +2 97 0.0020 1 gi|3914462|sp|O42897|PSD3_SCHPOPROBABLE 26S PROTEASOM... +2 96 0.33 1 gi|87791|pir||A24629Ig gamma-3 chain C region - human... -1 69 0.74 1 gi|6473221|dbj|BAA87112.1|(AB027808) Hypothetical nuc... -1 65 0.98 1 gi|102986|pir||S05589Balbiani ring protein 1-beta (cl... -1 64 0.992 1
Use the and icons to retrieve links to Entrez:
>gi|7484675|pir||T09117 multisubunit regulator protein COP9 - spinach >gi|1256771|gb|AAA96516.1| (U51270) COP9 [Spinacia oleracea] Length = 204 Frame 2 hits (HSPs): __________________________________________________ __________________________________________________ Database sequence: | | | | | | 204 0 50 100 150 200 Plus Strand HSPs: Score = 786 (276.7 bits), Expect = 3.8e-77, P = 3.8e-77 Identities = 144/204 (70%), Positives = 176/204 (86%), Frame = +2 Query: 17 IEELGV-MDFSSVRAALDSKSYDKVADVCDNLMLQVAAEGIAYQDDWPYTIHLLSHIYVH 193 +EE+ MDFS + A+ S+SYDK+AD+CDNLMLQV+AEGI +Q+DWPY IHLL HIY Sbjct: 1 MEEVATTMDFSPLTDAIASESYDKIADICDNLMLQVSAEGIVFQNDWPYAIHLLGHIYAG 60 Query: 194 DINSARFLWKSIPSSIKESQPEVTAVWKIGQKLWLRDYAGVHEAIRGFDWTQELQTLVAA 373 DINSARFLWKSIP +IKESQPE+ AVW IGQKLW RDYAGV+EA+R F+W+ ++ +AA Sbjct: 61 DINSARFLWKSIPIAIKESQPEIIAVWGIGQKLWTRDYAGVYEAVRSFNWSPQIHPFIAA 120 Query: 374 FSELYTKEMFQLLLSAYSTISIKDTALFLGMNEDDATNYVLQQGWTVDPASQMLIVKKQP 553 FS+ TK+MFQLL++AYSTIS++DTALFLGM+ED+AT YVLQ+GW +D A++ML VKKQ Sbjct: 121 FSDNNTKKMFQLLVAAYSTISVQDTALFLGMSEDEATTYVLQEGWILDSAARMLTVKKQT 180 Query: 554 VVTEQKLDPSKLQRLTEYVFHLEH 625 +VTEQKLDPSKLQRLTEYVFHLEH Sbjct: 181 IVTEQKLDPSKLQRLTEYVFHLEH 204 >gi|1169013|sp|P43255|COP9_ARATH COP9 PROTEIN (FUSCA PROTEIN FUS7) >gi|625971|pir||A54842 COP9 protein - Arabidopsis thaliana >gi|530870|gb|AAA32773.1| (L32874) CSN8 [Arabidopsis thaliana] >gi|2244767|emb|CAB10190.1| (Z97335) COP9 protein [Arabidopsis thaliana] >gi|7268116|emb|CAB78453.1| (AL161538) COP9 protein [Arabidopsis thaliana] Length = 197 Frame 2 hits (HSPs): __________________________________________________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | 197 0 50 100 150 __________________ Annotated Domains: PRODOM PD022805: COP9(1) Q41369(1) Q99627(1) 1..196 __________________ Plus Strand HSPs: Score = 760 (267.5 bits), Expect = 2.2e-74, P = 2.2e-74 Identities = 142/197 (72%), Positives = 163/197 (82%), Frame = +2 Query: 35 MDFSSVRAALDSKSYDKVADVCDNLMLQVAAEGIAYQDDWPYTIHLLSHIYVHDINSARF 214 MD S V+ AL +KS+DK+AD+CD LMLQVA+EGI Y DDWPY IHLL + YV D +SARF Sbjct: 1 MDLSPVKEALAAKSFDKIADICDTLMLQVASEGIEYHDDWPYAIHLLGYFYVDDCDSARF 60 Query: 215 LWKSIPSSIKESQPEVTAVWKIGQKLWLRDYAGVHEAIRGFDWTQELQTLVAAFSELYTK 394 LWK IP++IKE +PEV A W IGQKLW DYAGV+EAIRG+DW+QE + +VAAFS+LYTK Sbjct: 61 LWKRIPTAIKERKPEVVAAWGIGQKLWTHDYAGVYEAIRGYDWSQEAKDMVAAFSDLYTK 120 Query: 395 EMFQLLLSAYSTISIKDTALFLGMNEDDATNYVLQQGWTVDPASQMLIVKKQPVVTEQKL 574 MFQLLLSAYSTI+I D ALFLGM EDDAT YV++ GWTVD ASQM VKKQ V EQK+ Sbjct: 121 RMFQLLLSAYSTITIHDLALFLGMTEDDATTYVVENGWTVDAASQMASVKKQAVKREQKV 180 Query: 575 DPSKLQRLTEYVFHLEH 625 D SKLQRLTEYVFHLEH Sbjct: 181 DSSKLQRLTEYVFHLEH 197 >gi|5729779 ref|NP_006701.1| COP9 homolog [Homo sapiens] >gi|1730284|gb|AAB38529.1| (U51205) COP9 signalosome subunit 1 CSN1 [Homo sapiens] Length = 209 Frame 2 hits (HSPs): __________________________________________ __________________________________________________ Database sequence: | | | | | | 209 0 50 100 150 200 Plus Strand HSPs: Score = 284 (100.0 bits), Expect = 6.0e-24, P = 6.0e-24 Identities = 59/177 (33%), Positives = 102/177 (57%), Frame = +2 Query: 74 SYDKVADVCDNLMLQVAAEGIAYQDDWPYTIHLLSHIYVH-DINSARFLWKSIPSSIKES 250 S+ K+ D C+N L+ A GIA P LL+ +H D+N+AR+LWK IP +IK + Sbjct: 12 SFKKLLDQCENQELE-APGGIATP---PVYGQLLALYLLHNDMNNARYLWKRIPPAIKSA 67 Query: 251 QPEVTAVWKIGQKLWLRDYAGVHEAIRGFDWTQELQTLVAAFSELYTKEMFQLLLSAYST 430 E+ +W +GQ++W RD+ G++ I W++ +Q ++ A + + F L+ AY++ Sbjct: 68 NSELGGIWSVGQRIWQRDFPGIYTTINAHQWSETVQPIMEALRDATRRRAFALVSQAYTS 127 Query: 431 ISIKDTALFLGMNEDDATNYVLQQGWTVDPASQMLIVKKQPVVTEQKLDPSKLQRLTE 604 I D A F+G+ ++A +L+QGW D ++M++ +K PV + +K L+E Sbjct: 128 IIADDFAAFVGLPVEEAVKGILEQGWQADSTTRMVLPRK-PVAGALDVSFNKFIPLSE 184 >gi|12729210 ref|XP_010948.1| similar to COP9 homolog (H. sapiens) [Homo sapiens] Length = 209 Frame 2 hits (HSPs): __________________________________________ __________________________________________________ Database sequence: | | | | | | 209 0 50 100 150 200 Plus Strand HSPs: Score = 269 (94.7 bits), Expect = 2.3e-22, P = 2.3e-22 Identities = 55/177 (31%), Positives = 101/177 (57%), Frame = +2 Query: 74 SYDKVADVCDNLMLQVAAEGIAYQDDWPYTIHLLSHIYVHDINSARFLWKSIPSSIKESQ 253 S+ K+ D C+N L+ A GIA + + L + +D+N+AR+LWK IP + K + Sbjct: 12 SFKKLLDQCENQELE-APGGIATPPVYGQRLALC--LLRNDMNNARYLWKRIPPATKSAN 68 Query: 254 PEVTAVWKIGQKLWLRDYAGVHEAIRGFDWTQELQTLVAAFSELYTKEMFQLLLSAYSTI 433 E+ +W +G+++W RD+ G++ I W++ +Q ++ A + + F L+ AY++I Sbjct: 69 SELGGIWSVGRRVWQRDFPGIYTTINAHQWSETVQPIMEALRDATRRRTFALVSQAYTSI 128 Query: 434 SIKDTALFLGMNEDDATNYVLQQGWTVDPASQMLIVKKQPVVTEQKLDPSKLQRLTE 604 D A F+G+ ++A +L+QGW VD ++M++ +K PV + +K L+E Sbjct: 129 ITDDFAAFVGLPIEEAVKGLLEQGWQVDSTTRMVLPRK-PVAGALDVSFNKFIPLSE 184 >gi|7297391|gb|AAF52650.1| (AE003621) CG13383 gene product [Drosophila melanogaster] Length = 134 Frame 2 hits (HSPs): __________________________________________________ __________________________________________________ Database sequence: | | | | 134 0 50 100 Plus Strand HSPs: Score = 97 (34.1 bits), Expect = 0.0020, P = 0.0020 Identities = 34/133 (25%), Positives = 72/133 (54%), Frame = +2 Query: 227 IPSSIKESQPEVTAVWKIGQKLWLRDYAGVHEAIRGFDWTQELQTLVAAFSELYTKEMFQ 406 +P+++++ + E+ + + L +YA + I+ ++W++ +++ V +E+F+ Sbjct: 3 VPANLRDDK-ELIQLNLLNIALQNNNYADFFKHIK-YEWSERVKSPVEDLLNKQREELFK 60 Query: 407 LLLSAYSTISIKDTALFLG-MNEDDATNYVLQQGWTVDPASQMLIVKKQPVVTEQ---KL 574 L+ SAY +I + L L M+ED+ + WT + +I+K P V E + Sbjct: 61 LMGSAYMSI-YQHNLLELSLMSEDELKHACAALNWTEELDGDRVILK--PKVQEAPPARG 117 Query: 575 DPSKLQRLTEYVFHLEH 625 + +L +LTE+V LE+ Sbjct: 118 NDDQLLKLTEFVTFLEN 134 >gi|3914462|sp|O42897|PSD3_SCHPO PROBABLE 26S PROTEASOME REGULATORY SUBUNIT S3 >gi|7492878|pir||T39299 probable proteosome subunit - fission yeast (Schizosaccharomyces pombe) (fragment) >gi|2959362|emb|CAA17916.1| (AL022117) 26s proteasome regulatory subunit [Schizosaccharomyces pombe] Length = 436 Frame 2 hits (HSPs): ______________ Annotated Domains: ____ __________ __________________________________________________ Database sequence: | | | | 436 0 150 300 __________________ Annotated Domains: PFAM PCI: PCI domain 292..373 PROSITE LEUCINE_ZIPPER: Leucine zipper pattern. 164..185 __________________ Plus Strand HSPs: Score = 96 (33.8 bits), Expect = 0.40, P = 0.33 Identities = 29/116 (25%), Positives = 56/116 (48%), Frame = +2 Query: 158 YTIHLLSHIYVHDINSAR-FLWKSIPSSIKESQPEVTAVWKIGQKLWLRDYAGVHEAIRG 334 Y +H++ + + +I R F KS+ ++ AV +IG D +EA Sbjct: 231 YKLHIVVQLLMGEIPERRIFRQKSLEKTLVPYLRISQAV-RIGDLCAFTDALSKYEAEFR 289 Query: 335 FDWTQELQTLVAAFSELYTKEMFQLLLSAYSTISIKDTALFLGMNEDDATNYVLQQG 505 FD L TL+ K +++ +YS IS++D + LG++ +++ Y++ +G Sbjct: 290 FDG---LYTLICRLRHTVIKTGLRMISLSYSRISLRDVCIKLGLDSEESAEYIVAKG 343 >gi|87791|pir||A24629 Ig gamma-3 chain C region - human (fragment) >gi|184745|gb|AAC82525.1| (J00220) immunoglobulin gamma-3 heavy chain constant region [Homo sapiens] >gi|683576|emb|CAA28307.1| (X04646) immunoglobulin gamma heavy chain [Homo sapiens] Length = 90 Frame -1 hits (HSPs): _______________________________ __________________________________________________ Database sequence: | | | | | | 90 0 20 40 60 80 Minus Strand HSPs: Score = 69 (24.3 bits), Expect = 1.4, P = 0.74 Identities = 20/57 (35%), Positives = 24/57 (42%), Frame = -1 Query: 188 HKCERGGELCRANRPDKQCPQLQPEASSCRTHPQPCRTTSNPTQ-PSPKRNPSLPIPQ 18 H C R E + P CP+ PE SC T P PC P +P P P P+ Sbjct: 22 HTCPRCPEPKSCDTPPP-CPRC-PEPKSCDT-PPPCPRCPEPKSCDTPPPCPRCPAPE 76 >gi|6473221|dbj|BAA87112.1| (AB027808) Hypothetical nuclear protein [Schizosaccharomyces pombe] Length = 71 Frame -1 hits (HSPs): _______________________________ __________________________________________________ Database sequence: | | | | | 71 0 20 40 60 Minus Strand HSPs: Score = 65 (22.9 bits), Expect = 3.8, P = 0.98 Identities = 14/45 (31%), Positives = 24/45 (53%), Frame = -1 Query: 176 RGGELCRANRPDKQCPQLQPEASSCRTHPQPCRTTSNPTQPSPKR 42 R ++CR + +C QP S+C +H PC T+ P + + +R Sbjct: 9 RACDMCRKRKI--RCDGKQPACSNCVSHGIPCVFTARPKRRTGQR 51 >gi|102986|pir||S05589 Balbiani ring protein 1-beta (clone 14-2) - midge (Chironomus pallidivittatus) (fragment) >gi|683541|emb|CAA31444.1| (X13030) giant secretory protein [Chironomus pallidivittatus] Length = 75 Frame -1 hits (HSPs): _________________________________________ __________________________________________________ Database sequence: | | | | | 75 0 20 40 60 Minus Strand HSPs: Score = 64 (22.5 bits), Expect = 4.9, P = 0.99 Identities = 18/61 (29%), Positives = 28/61 (45%), Frame = -1 Query: 197 CREHKCERGGELCR---ANRPDKQCPQLQPEASSCRTHPQPCRTTSNPTQPSPKR-NPSL 30 C + G+ CR AN+P K P+ + + S +P ++ P +PS R PS Sbjct: 14 CAKKNGRFNGKNCRCTSANKPSKSGPKPERPSKSGPKPERPSKSGPKPERPSKSRPKPSR 73 Query: 29 P 27 P Sbjct: 74 P 74 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=6.00 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.355 0.153 0.527 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.328 0.139 0.430 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.354 0.158 0.599 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.333 0.139 0.443 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.347 0.153 0.490 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.369 0.158 0.558 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 218 218 10. 77 3 12 22 0.10 35 32 0.12 38 +2 0 218 218 10. 77 3 12 22 0.10 35 32 0.12 38 +1 0 218 218 10. 77 3 12 22 0.10 35 32 0.12 38 -1 0 218 218 10. 77 3 12 22 0.10 35 32 0.12 38 -2 0 218 218 10. 77 3 12 22 0.10 35 32 0.12 38 -3 0 218 218 10. 77 3 12 22 0.10 35 32 0.12 38 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 9 No. of states in DFA: 595 (59 KB) Total size of DFA: 230 KB (256 KB) Time to generate neighborhood: 0.02u 0.00s 0.02t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 262.32u 1.04s 263.36t Elapsed: 00:00:55 Total cpu time: 262.36u 1.06s 263.42t Elapsed: 00:00:55 Start: Thu Jan 17 01:38:14 2002 End: Thu Jan 17 01:39:09 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000