WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= B12D05.seq(1>455) (426 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 4 Sequences : less than 4 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 1005 242 |============================================================ 6310 763 232 |========================================================== 3980 531 127 |=============================== 2510 404 121 |============================== 1580 283 69 |================= 1000 214 67 |================ 631 147 34 |======== 398 113 27 |====== 251 86 17 |==== 158 69 17 |==== 100 52 11 |== 63.1 41 18 |==== 39.8 23 4 |= 25.1 19 5 |= 15.8 14 3 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 11 <<<<<<<<<<<<<<<<< 10.0 11 1 |: 6.31 10 0 | 3.98 10 3 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|7487070|pir||T04747hypothetical protein T16H5.20 -... +3 495 2.6e-46 1 gi|9758986|dbj|BAB09496.1|(AB019224) regulatory prote... +3 478 1.7e-44 1 gi|1773295|gb|AAC49611.1|(U76707) regulatory protein ... +3 285 2.9e-23 1 gi|9988453|dbj|BAB12719.1|(AP002746) putative regulat... +3 260 1.4e-20 1 gi|7487977|pir||T04267NPR1 protein homolog F20B18.230... +3 244 8.4e-19 1 gi|11357616|pir||T47773hypothetical protein F24I3.210... +3 191 2.7e-13 1 gi|3894187|gb|AAC78536.1|(AC005662) hypothetical prot... +3 154 3.0e-09 1 gi|4507183ref|NP_003554.1| speckle-type POZ protein [... +3 82 0.97 1 gi|9964427ref|NP_064895.1| AMV113 [Amsacta moorei ent... +1 62 0.97 1 gi|11359861|pir||JC7326bood POZ containing protein - ... +3 83 0.97 1 gi|6679309ref|NP_032861.1| per-hexamer repeat gene 4 ... +2 65 0.9999 1
Use the
and
icons to retrieve links to Entrez:
>gi|7487070|pir||T04747 hypothetical protein T16H5.20 - Arabidopsis thaliana >gi|3250675|emb|CAA19683.1| (AL024486) putative protein [Arabidopsis thaliana] >gi|7268762|emb|CAB78968.1| (AL161551) putative protein [Arabidopsis thaliana] Length = 601 Frame 3 hits (HSPs): ____________ __________________________________________________ Database sequence: | | | | || 601 0 150 300 450 600 Plus Strand HSPs: Score = 495 (174.2 bits), Expect = 2.6e-46, P = 2.6e-46 Identities = 94/132 (71%), Positives = 114/132 (86%), Frame = +3 Query: 6 PFSVHRCILASRSKFFHELFKREKGSSEKEGKLKYNMNDLLPYGKVGYEAFLIFLGYVYT 185 P SVHRC+LA+RSKFF +LFK++K SSEK K KY M DLLPYG VG EAFL FL Y+YT Sbjct: 66 PVSVHRCVLAARSKFFLDLFKKDKDSSEK--KPKYQMKDLLPYGNVGREAFLHFLSYIYT 123 Query: 186 GKLKPSPMEVSTCVDNVCAHDACRPAINFAVELMYASSIFQIPELVSLFQRRLLNFIGKA 365 G+LKP P+EVSTCVD+VCAHD+C+PAI+FAVELMYAS +FQIP+LVS FQR+L N++ K+ Sbjct: 124 GRLKPFPIEVSTCVDSVCAHDSCKPAIDFAVELMYASFVFQIPDLVSSFQRKLRNYVEKS 183 Query: 366 LVEDVIPILTVA 401 LVE+V+PIL VA Sbjct: 184 LVENVLPILLVA 195
>gi|9758986|dbj|BAB09496.1| (AB019224) regulatory protein NPR1-like; transcription factor inhibitor I kappa B-like [Arabidopsis thaliana] Length = 593 Frame 3 hits (HSPs): _____________ __________________________________________________ Database sequence: | | | | | 593 0 150 300 450 Plus Strand HSPs: Score = 478 (168.3 bits), Expect = 1.7e-44, P = 1.7e-44 Identities = 90/140 (64%), Positives = 112/140 (80%), Frame = +3 Query: 6 PFSVHRCILASRSKFFHELFKREKGSSEKEGKLKYNMNDLLPYGKVGYEAFLIFLGYVYT 185 P VHRCILA+RSKFF +LFK+EK S+ E K KY + ++LPYG V +EAFL FL Y+YT Sbjct: 70 PVGVHRCILAARSKFFQDLFKKEKKISKTE-KPKYQLREMLPYGAVAHEAFLYFLSYIYT 128 Query: 186 GKLKPSPMEVSTCVDNVCAHDACRPAINFAVELMYASSIFQIPELVSLFQRRLLNFIGKA 365 G+LKP P+EVSTCVD VC+HD CRPAI+F V+LMYASS+ Q+PELVS FQRRL NF+ K Sbjct: 129 GRLKPFPLEVSTCVDPVCSHDCCRPAIDFVVQLMYASSVLQVPELVSSFQRRLCNFVEKT 188 Query: 366 LVEDVIPILTVAMSSQSSLL 425 LVE+V+PIL VA + + + L Sbjct: 189 LVENVLPILMVAFNCKLTQL 208
>gi|1773295|gb|AAC49611.1| (U76707) regulatory protein NPR1 [Arabidopsis thaliana] >gi|1916912|gb|AAB58262.1| (U87794) transcription factor inhibitor I kappa B homolog [Arabidopsis thaliana] >gi|12323466|gb|AAG51705.1|AC066689_4 (AC066689) transcription factor inhibitor I kappa B, putative; 88267-90345 [Arabidopsis thaliana] Length = 593 Frame 3 hits (HSPs): ____________ __________________________________________________ Database sequence: | | | | | 593 0 150 300 450 Plus Strand HSPs: Score = 285 (100.3 bits), Expect = 2.9e-23, P = 2.9e-23 Identities = 53/132 (40%), Positives = 87/132 (65%), Frame = +3 Query: 12 SVHRCILASRSKFFHELF---KREKGSSEKEGKLKYNMNDLLPYGKVGYEAFLIFLGYVY 182 S HRC+L++RS FF K+EK S+ +K + ++ +VG+++ + L YVY Sbjct: 78 SFHRCVLSARSSFFKSALAAAKKEKDSNNTAA-VKLELKEIAKDYEVGFDSVVTVLAYVY 136 Query: 183 TGKLKPSPMEVSTCVDNVCAHDACRPAINFAVELMYASSIFQIPELVSLFQRRLLNFIGK 362 + +++P P VS C D C H ACRPA++F +E++Y + IF+IPEL++L+QR LL+ + K Sbjct: 137 SSRVRPPPKGVSECADENCCHVACRPAVDFMLEVLYLAFIFKIPELITLYQRHLLDVVDK 196 Query: 363 ALVEDVIPILTVA 401 ++ED + IL +A Sbjct: 197 VVIEDTLVILKLA 209
>gi|9988453|dbj|BAB12719.1| (AP002746) putative regulatory protein NPR1 [Oryza sativa] >gi|10934082|dbj|BAB16860.1| (AP002537) Arabidopsis thaliana regulatory protein NPR1 like protein [Oryza sativa] Length = 582 Frame 3 hits (HSPs): ____________ __________________________________________________ Database sequence: | | | | | 582 0 150 300 450 Plus Strand HSPs: Score = 260 (91.5 bits), Expect = 1.4e-20, P = 1.4e-20 Identities = 54/137 (39%), Positives = 84/137 (61%), Frame = +3 Query: 15 VHRCILASRSKFFHELFKREK----GSSEKEGKLKYNMNDLLPYG----KVGYEAFLIFL 170 VHRC+L++RS F +F R G ++G + + +LL G +VGYEA + L Sbjct: 73 VHRCVLSARSPFLRGVFARRAAAAAGGGGEDGGERLELRELLGGGGEEVEVGYEALRLVL 132 Query: 171 GYVYTGKLKPSPMEVSTCVDNVCAHDACRPAINFAVELMYASSIFQIPELVSLFQRRLLN 350 Y+Y+G++ P CVD CAH C PA+ F ++++A+S FQ+ EL +LFQRRLL+ Sbjct: 133 DYLYSGRVGDLPKAACLCVDEDCAHVGCHPAVAFMAQVLFAASTFQVAELTNLFQRRLLD 192 Query: 351 FIGKALVEDVIPILTVA 401 + K V++++ IL+VA Sbjct: 193 VLDKVEVDNLLLILSVA 209
>gi|7487977|pir||T04267 NPR1 protein homolog F20B18.230 - Arabidopsis thaliana >gi|4538941|emb|CAB39677.1| (AL049483) NPR1 like protein [Arabidopsis thaliana] >gi|7269463|emb|CAB79467.1| (AL161564) NPR1 like protein [Arabidopsis thaliana] Length = 600 Frame 3 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | | 600 0 150 300 450 Plus Strand HSPs: Score = 244 (85.9 bits), Expect = 8.4e-19, P = 8.4e-19 Identities = 51/126 (40%), Positives = 80/126 (63%), Frame = +3 Query: 12 SVHRCILASRSKFFHELF---KREKGSSEKEGKLKYNMNDLLPYGKVGYEAFLIFLGYVY 182 S HRCIL++R F K +K S+ + +LK D Y +VG+++ + L YVY Sbjct: 80 SFHRCILSARIPVFKSALATVKEQKSSTTVKLQLKEIARD---Y-EVGFDSVVAVLAYVY 135 Query: 183 TGKLKPSPMEVSTCVDNVCAHDACRPAINFAVELMYASSIFQIPELVSLFQRRLLNFIGK 362 +G+++ P S CVD+ C H ACR ++F VE++Y S +FQI ELV+L++R+ L + K Sbjct: 136 SGRVRSPPKGASACVDDDCCHVACRSKVDFMVEVLYLSFVFQIQELVTLYERQFLEIVDK 195 Query: 363 ALVEDVIPI 389 +VED++ I Sbjct: 196 VVVEDILVI 204
>gi|11357616|pir||T47773 hypothetical protein F24I3.210 - Arabidopsis thaliana >gi|6911883|emb|CAB72183.1| (AL138655) putative protein [Arabidopsis thaliana] Length = 467 Frame 3 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | 467 0 150 300 450 Plus Strand HSPs: Score = 191 (67.2 bits), Expect = 2.7e-13, P = 2.7e-13 Identities = 46/142 (32%), Positives = 75/142 (52%), Frame = +3 Query: 18 HRCILASRSKFFHELFKR--------EKGSSEKEGKLKYNMNDLLPYGKVGYEAFLIFLG 173 HRCILA+RS FF + F E + G + ++P VGYE FL+ L Sbjct: 41 HRCILAARSLFFRKFFCESDPSQPGAEPANQTGSGARAAAVGGVIPVNSVGYEVFLLLLQ 100 Query: 174 YVYTGKLK--PSPMEV-STCVDNVCAHDACRPAINFAVELMYASSIFQIPELVSLFQRRL 344 ++Y+G++ P E S C D C H C A++ +++++ A+ F + +L L Q+ L Sbjct: 101 FLYSGQVSIVPHKHEPRSNCGDRGCWHTHCTAAVDLSLDILAAARYFGVEQLALLTQKHL 160 Query: 345 LNFIGKALVEDVIPILTVAMSSQ 413 + + KA +EDV+ +L +A Q Sbjct: 161 TSMVEKASIEDVMKVL-IASRKQ 182
>gi|3894187|gb|AAC78536.1| (AC005662) hypothetical protein [Arabidopsis thaliana] Length = 491 Frame 3 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | 491 0 150 300 450 Plus Strand HSPs: Score = 154 (54.2 bits), Expect = 3.0e-09, P = 3.0e-09 Identities = 32/99 (32%), Positives = 57/99 (57%), Frame = +3 Query: 123 LLPYGKVGYEAFLIFLGYVYTGKLKPSPMEVS---TCVDNVCAHDACRPAINFAVELMYA 293 ++P VGYE FL+ L ++Y+G++ P + C + C H C A++ A++ + A Sbjct: 89 IIPVNSVGYEVFLLLLQFLYSGQVSIVPQKHEPRPNCGERGCWHTHCSAAVDLALDTLAA 148 Query: 294 SSIFQIPELVSLFQRRLLNFIGKALVEDVIPILTVAMSSQ 413 S F + +L L Q++L + + KA +EDV+ +L +A Q Sbjct: 149 SRYFGVEQLALLTQKQLASMVEKASIEDVMKVL-IASRKQ 187 Score = 84 (29.6 bits), Expect = 2.8, P = 0.94 Identities = 24/79 (30%), Positives = 39/79 (49%), Frame = +3 Query: 18 HRCILASRSKFFHELF------KREKG-SSEKEGKLKYNMN-------DLLPYGKVGYEA 155 HRCILA+RS FF + F + G + G + + ++P VGYE Sbjct: 40 HRCILAARSLFFRKFFCGTDSPQPVTGIDPTQHGSVPASPTRGSTAPAGIIPVNSVGYEV 99 Query: 156 FLIFLGYVYTGKLKPSPME 212 FL+ L ++Y+G++ P + Sbjct: 100 FLLLLQFLYSGQVSIVPQK 118
>gi|4507183 ref|NP_003554.1| speckle-type POZ protein [Homo sapiens] >gi|11434423 ref|XP_008436.1| speckle-type POZ protein [Homo sapiens] >gi|8134708|sp|O43791|SPOP_HUMAN SPECKLE-TYPE POZ PROTEIN >gi|2695708|emb|CAA04199.1| (AJ000644) SPOP [Homo sapiens] >gi|12654851|gb|AAH01269.1|AAH01269 (BC001269) speckle-type POZ protein [Homo sapiens] Length = 374 Frame 3 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | 374 0 150 300 Plus Strand HSPs: Score = 82 (28.9 bits), Expect = 3.4, P = 0.97 Identities = 21/61 (34%), Positives = 32/61 (52%), Frame = +3 Query: 9 FSVHRCILASRSKFFHELFKREKGSSEKEGKLKYNMNDLLPYGKVGYEAFLIFLGYVYTG 188 F H+ ILA+RS F +F+ E S+K + +ND+ P E F + ++YTG Sbjct: 211 FQAHKAILAARSPVFSAMFEHEMEESKKN---RVEINDVEP------EVFKEMMCFIYTG 261 Query: 189 K 191 K Sbjct: 262 K 262
>gi|9964427 ref|NP_064895.1| AMV113 [Amsacta moorei entomopoxvirus] >gi|9944636|gb|AAG02819.1|AF250284_113 (AF250284) AMV113 [Amsacta moorei entomopoxvirus] Length = 67 Frame 1 hits (HSPs): ________________________________________ __________________________________________________ Database sequence: | | | | | 67 0 20 40 60 Plus Strand HSPs: Score = 62 (21.8 bits), Expect = 3.4, P = 0.97 Identities = 18/53 (33%), Positives = 27/53 (50%), Frame = +1 Query: 127 CLMARLDMKPSSYSLAMYILVNSSPLQWRCLLVLTM--FVLMMRVDLPLTLLL 279 C A L S+ ++I +NSS L CL+V++ +L+ LTLLL Sbjct: 3 CFSANLSNFIFCSSILLFICINSSILSSFCLIVISFSSIILVYLFSTSLTLLL 55
>gi|11359861|pir||JC7326 bood POZ containing protein - human Length = 478 Frame 3 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | 478 0 150 300 450 Plus Strand HSPs: Score = 83 (29.2 bits), Expect = 3.5, P = 0.97 Identities = 25/73 (34%), Positives = 35/73 (47%), Frame = +3 Query: 6 PFSVHRCILASRSKFFHELFKRE-KGSSEKEGKLKYNMNDLLPYGKVGYEAFLIFLGYVY 182 PF VHRC+L +RS +F + + KG S +L + + AF L Y+Y Sbjct: 125 PFRVHRCVLGARSAYFANMLDTKWKGKSVV----------VLRHPLINPVAFGALLQYLY 174 Query: 183 TGKLKPSPMEVSTC 224 TG+L VS C Sbjct: 175 TGRLDIGVEHVSDC 188
>gi|6679309 ref|NP_032861.1| per-hexamer repeat gene 4 [Mus musculus] >gi|141401|sp|P15974|PHX4_MOUSE PUTATIVE PER-HEXAMER REPEAT PROTEIN 4 >gi|90666|pir||S02186 hypothetical protein SP5 - mouse >gi|4803702|emb|CAB42649.1| (X12806) ORF (AA 1 - 95) [Mus musculus] Length = 95 Frame 2 hits (HSPs): ______________________________ __________________________________________________ Database sequence: | | | | | | 95 0 20 40 60 80 Plus Strand HSPs: Score = 65 (22.9 bits), Expect = 8.9, P = 1.0 Identities = 20/56 (35%), Positives = 30/56 (53%), Frame = +2 Query: 215 VYLC*QCLCS*CV*TC---H*LCC*VNVCLF-HFSNTRVGIT--FPET-ST*LYRKG 364 +Y+C CLC C C H LC V+VC + H T + + P++ ST L++ G Sbjct: 4 IYVCGVCLCV-CFSVCMCVHVLCVYVHVCTYAHIWTTALHLRCLLPKSFSTLLFKAG 59 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.98 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.324 0.140 0.408 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.373 0.171 0.773 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.346 0.150 0.541 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.360 0.160 0.609 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.352 0.158 0.542 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.336 0.144 0.457 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 141 140 10. 73 3 12 22 0.12 33 30 0.11 36 +2 0 141 140 10. 73 3 12 22 0.12 33 30 0.11 36 +1 0 142 141 10. 73 3 12 22 0.12 33 30 0.11 36 -1 0 142 141 10. 73 3 12 22 0.12 33 30 0.11 36 -2 0 141 141 10. 73 3 12 22 0.12 33 30 0.11 36 -3 0 141 140 10. 73 3 12 22 0.12 33 30 0.11 36 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 11 No. of states in DFA: 596 (59 KB) Total size of DFA: 186 KB (192 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 143.98u 1.10s 145.08t Elapsed: 00:00:26 Total cpu time: 144.01u 1.12s 145.13t Elapsed: 00:00:27 Start: Wed Feb 6 14:17:39 2002 End: Wed Feb 6 14:18:06 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000