WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker Server unavailable.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= B04G02.seq(1>488); (450 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 5 Sequences : less than 5 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 1154 269 |===================================================== 6310 885 155 |=============================== 3980 730 233 |============================================== 2510 497 136 |=========================== 1580 361 114 |====================== 1000 247 92 |================== 631 155 53 |========== 398 102 37 |======= 251 65 10 |== 158 55 15 |=== 100 40 5 |= 63.1 35 5 |= 39.8 30 2 |: 25.1 28 1 |: 15.8 27 2 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 25 <<<<<<<<<<<<<<<<< 10.0 25 3 |: 6.31 22 0 | 3.98 22 0 | 2.51 22 0 | 1.58 22 2 |: 1.00 20 0 | 0.63 20 0 | 0.40 20 0 | 0.25 20 0 | 0.16 20 0 | 0.10 20 0 | 0.063 20 0 | 0.040 20 0 | 0.025 20 0 | 0.016 20 0 | 0.010 20 0 | 0.0063 20 0 | 0.0040 20 0 | 0.0025 20 0 | 0.0016 20 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|9758410|dbj|BAB08952.1|(AB006700) lysine decarboxy... +2 361 4.2e-32 1 gi|11357791|pir||T45885hypothetical protein F4P12.150... +2 345 2.1e-30 1 gi|10140743|gb|AAG13575.1|AC037425_6(AC037425) unknow... +2 325 2.7e-28 1 gi|11358493|pir||T48554lysine decarboxylase-like prot... +2 284 6.0e-24 1 gi|11358492|pir||T48348lysine decarboxylase-like prot... +2 267 3.8e-22 1 gi|4371280|gb|AAD18138.1|(AC006260) hypothetical prot... +2 261 1.6e-21 1 gi|4510370|gb|AAD21458.1|(AC007017) unknown protein [... +2 254 9.1e-21 1 gi|9757778|dbj|BAB08387.1|(AB005240) lysine decarboxy... +2 253 1.2e-20 1 gi|12231051|sp|P48636|YDC3_PSEAEHYPOTHETICAL PROTEIN ... +2 207 8.7e-16 1 gi|7486909|pir||T04966hypothetical protein T12J5.60 -... +2 202 2.9e-15 1 gi|77664|pir||PQ0114hypothetical protein (azu region)... +2 197 1.0e-14 1 gi|7451099|pir||D70033conserved hypothetical protein ... +2 167 1.5e-11 1 gi|10175706|dbj|BAB06803.1|(AP001517) BH3084~unknown ... +2 165 2.4e-11 1 gi|11280345|pir||T45176conserved hypothetical protein... +2 144 4.1e-09 1 gi|10175367|dbj|BAB06465.1|(AP001516) lysine decarbox... +2 144 4.1e-09 1 gi|1169649|sp|P46378|FAS6_RHOFAHYPOTHETICAL 21.1 KD P... +2 132 7.7e-08 1 gi|4337446|gb|AAD18125.1|(U89166) ECORLD_ORF1; simila... +2 122 8.8e-07 1 gi|6322406ref|NP_012480.1| Yjl055wp [Saccharomyces ce... +2 124 1.2e-06 1 gi|10954698ref|NP_066633.1| similar to orf6 gene(unkn... +2 106 0.00013 1 gi|7451100|pir||C70609hypothetical protein Rv1205 - M... +2 103 0.0010 1 gi|7462102|pir||A72302conserved hypothetical protein ... +2 81 0.74 1 gi|12697620|dbj|BAB21615.1|(AB037974) cytochrome oxid... +2 41 0.75 2 gi|1196510|gb|AAA88231.1|(M15467) unknown protein [My... -1 76 0.998 1 gi|7479785|pir||T35807hypothetical protein SC8D9.03 S... +2 76 0.999 1 gi|7451672|pir||H70312hypothetical protein aq_134 - A... +2 74 0.9995 1
Use the and icons to retrieve links to Entrez:
>gi|9758410|dbj|BAB08952.1| (AB006700) lysine decarboxylase-like protein [Arabidopsis thaliana] Length = 217 Frame 2 hits (HSPs): ______________________ __________________________________________________ Database sequence: | | | | | | 217 0 50 100 150 200 Plus Strand HSPs: Score = 361 (127.1 bits), Expect = 4.2e-32, P = 4.2e-32 Identities = 68/93 (73%), Positives = 80/93 (86%), Frame = +2 Query: 170 MMKSSSRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVV 349 M ++ SRF+RICVFCG+S GK PSYQ AAIQL +LVER IDLVYGGGS+GLMGL+SQ V Sbjct: 1 MEETKSRFKRICVFCGSSSGKKPSYQEAAIQLGNELVERRIDLVYGGGSVGLMGLVSQAV 60 Query: 350 FDGGRHVLGVIPTTLMPREITGESVGEVESVGE 448 GGRHVLGVIP TLMPREITGE++GEV++V + Sbjct: 61 HHGGRHVLGVIPKTLMPREITGETIGEVKAVAD 93 >gi|11357791|pir||T45885 hypothetical protein F4P12.150 - Arabidopsis thaliana >gi|6729496|emb|CAB67652.1| (AL132966) putative protein [Arabidopsis thaliana] Length = 215 Frame 2 hits (HSPs): _______________________ __________________________________________________ Database sequence: | | | | | | 215 0 50 100 150 200 Plus Strand HSPs: Score = 345 (121.4 bits), Expect = 2.1e-30, P = 2.1e-30 Identities = 68/103 (66%), Positives = 81/103 (78%), Frame = +2 Query: 140 MEIEEQTMKMMMKSSSRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSI 319 ME+ +TM+ S+F RICVFCG+S GK SYQ AA+ L +LV RNIDLVYGGGSI Sbjct: 1 MEVNNETMQ-----KSKFGRICVFCGSSQGKKSSYQDAAVDLGNELVLRNIDLVYGGGSI 55 Query: 320 GLMGLISQVVFDGGRHVLGVIPTTLMPREITGESVGEVESVGE 448 GLMGL+SQ V DGGRHV+GVIP TLMPRE+TGE+VGEV +V + Sbjct: 56 GLMGLVSQAVHDGGRHVIGVIPKTLMPRELTGETVGEVRAVAD 98 >gi|10140743|gb|AAG13575.1|AC037425_6 (AC037425) unknown protein [Oryza sativa] Length = 204 Frame 2 hits (HSPs): _______________________ __________________________________________________ Database sequence: | | | | | | 204 0 50 100 150 200 Plus Strand HSPs: Score = 325 (114.4 bits), Expect = 2.7e-28, P = 2.7e-28 Identities = 63/88 (71%), Positives = 73/88 (82%), Frame = +2 Query: 185 SRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGR 364 SRF+RICVFCG+S GK SY AAI+L +LV R+IDLVYGGGSIGLMGL+SQ VFDGGR Sbjct: 4 SRFKRICVFCGSSQGKKRSYHDAAIELGNELVARSIDLVYGGGSIGLMGLVSQAVFDGGR 63 Query: 365 HVLGVIPTTLMPREITGESVGEVESVGE 448 HV+GVIP TLM EI+GE+VGEV V + Sbjct: 64 HVIGVIPKTLMTPEISGETVGEVRPVAD 91 >gi|11358493|pir||T48554 lysine decarboxylase-like protein - Arabidopsis thaliana >gi|7573362|emb|CAB87668.1| (AL163812) lysine decarboxylase-like protein [Arabidopsis thaliana] Length = 215 Frame 2 hits (HSPs): _____________________ __________________________________________________ Database sequence: | | | | | | 215 0 50 100 150 200 Plus Strand HSPs: Score = 284 (100.0 bits), Expect = 6.0e-24, P = 6.0e-24 Identities = 54/88 (61%), Positives = 69/88 (78%), Frame = +2 Query: 185 SRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGR 364 SRFR+ICVFCG+ G + AAI+L +LV+R IDLVYGGGS+GLMGLIS+ V++GG Sbjct: 7 SRFRKICVFCGSHSGHREVFSDAAIELGNELVKRKIDLVYGGGSVGLMGLISRRVYEGGL 66 Query: 365 HVLGVIPTTLMPREITGESVGEVESVGE 448 HVLG+IP LMP EI+GE+VG+V V + Sbjct: 67 HVLGIIPKALMPIEISGETVGDVRVVAD 94 >gi|11358492|pir||T48348 lysine decarboxylase-like protein - Arabidopsis thaliana >gi|7413604|emb|CAB86094.1| (AL163002) lysine decarboxylase-like protein [Arabidopsis thaliana] Length = 229 Frame 2 hits (HSPs): _________________________ __________________________________________________ Database sequence: | | | | | | 229 0 50 100 150 200 Plus Strand HSPs: Score = 267 (94.0 bits), Expect = 3.8e-22, P = 3.8e-22 Identities = 61/114 (53%), Positives = 73/114 (64%), Frame = +2 Query: 140 MEIEEQTMKMMMKSSSRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSI 319 ME EE +M K SSRF+ ICVFCG+S G SYQ AAI LAK+LV R IDLVYGGGSI Sbjct: 1 MENEEGKREMTKKQSSRFKSICVFCGSSNGNKASYQDAAIDLAKELVMRKIDLVYGGGSI 60 Query: 320 GLMGLISQVVFDGGRH-----------VLGVIPTTLMPREITGESVGEVESVGE 448 GLMGL+SQ V DGGRH + + ++TGE+VGEV+ V + Sbjct: 61 GLMGLVSQAVHDGGRHNNNNNGNDDALFCHSVNVSQTNSKLTGETVGEVKEVAD 114 >gi|4371280|gb|AAD18138.1| (AC006260) hypothetical protein [Arabidopsis thaliana] Length = 178 Frame 2 hits (HSPs): ________________________ __________________________________________________ Database sequence: | | | | | 178 0 50 100 150 Plus Strand HSPs: Score = 261 (91.9 bits), Expect = 1.6e-21, P = 1.6e-21 Identities = 58/103 (56%), Positives = 71/103 (68%), Frame = +2 Query: 140 MEIEEQTMKMMMKSSSRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSI 319 MEI+ ++M+ S+FRRICVFCG+S GK SYQ AA+ L +LV RNIDLVYGGGSI Sbjct: 1 MEIKGESMQ-----KSKFRRICVFCGSSQGKKSSYQDAAVDLGNELVSRNIDLVYGGGSI 55 Query: 320 GLMGLISQVVFDGGRHVLGVIPTTLMPREITGESVGEVESVGE 448 GLMGL+SQ V DGGRH +TGE+VGEV +V + Sbjct: 56 GLMGLVSQAVHDGGRH-------------LTGETVGEVRAVAD 85 >gi|4510370|gb|AAD21458.1| (AC007017) unknown protein [Arabidopsis thaliana] Length = 181 Frame 2 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | | 181 0 50 100 150 Plus Strand HSPs: Score = 254 (89.4 bits), Expect = 9.1e-21, P = 9.1e-21 Identities = 50/70 (71%), Positives = 57/70 (81%), Frame = +2 Query: 170 MMKSSSRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVV 349 M ++ SRFRRICVFCG+S G +Y AA+QLA QLVERNIDLVYGGGS+GLMGLISQ V Sbjct: 1 MEETKSRFRRICVFCGSSSGNKTTYHDAALQLAHQLVERNIDLVYGGGSVGLMGLISQAV 60 Query: 350 FDGGRHVLGV 379 DGGR V+ V Sbjct: 61 HDGGREVITV 70 >gi|9757778|dbj|BAB08387.1| (AB005240) lysine decarboxylase-like protein [Arabidopsis thaliana] Length = 220 Frame 2 hits (HSPs): ________________________ __________________________________________________ Database sequence: | | | | | | 220 0 50 100 150 200 Plus Strand HSPs: Score = 253 (89.1 bits), Expect = 1.2e-20, P = 1.2e-20 Identities = 57/105 (54%), Positives = 68/105 (64%), Frame = +2 Query: 167 MMMKSSSRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQV 346 M K SSRF+ ICVFCG+S G SYQ AAI LAK+LV R IDLVYGGGSIGLMGL+SQ Sbjct: 1 MTKKQSSRFKSICVFCGSSNGNKASYQDAAIDLAKELVMRKIDLVYGGGSIGLMGLVSQA 60 Query: 347 VFDGGRH-----------VLGVIPTTLMPREITGESVGEVESVGE 448 V DGGRH + + ++TGE+VGEV+ V + Sbjct: 61 VHDGGRHNNNNNGNDDALFCHSVNVSQTNSKLTGETVGEVKEVAD 105 >gi|12231051|sp|P48636|YDC3_PSEAE HYPOTHETICAL PROTEIN PA4923 >gi|11348257|pir||A83031 conserved hypothetical protein PA4923 [imported] - Pseudomonas aeruginosa (strain PAO1) >gi|9951201|gb|AAG08308.1|AE004905_6 (AE004905) conserved hypothetical protein [Pseudomonas aeruginosa] Length = 195 Frame 2 hits (HSPs): ______________________ __________________________________________________ Database sequence: | | | | | 195 0 50 100 150 Plus Strand HSPs: Score = 207 (72.9 bits), Expect = 8.7e-16, P = 8.7e-16 Identities = 38/83 (45%), Positives = 53/83 (63%), Frame = +2 Query: 194 RRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVL 373 R +CVFCG SPG +P YQ AA+ L + L ER + LVYGGG++GLMG ++ G V+ Sbjct: 4 RSVCVFCGASPGASPVYQEAAVALGRHLAERGLTLVYGGGAVGLMGTVADAALAAGGEVI 63 Query: 374 GVIPTTLMPREITGESVGEVESV 442 G+IP +L EI + + +E V Sbjct: 64 GIIPQSLQEAEIGHKGLTRLEVV 86 >gi|7486909|pir||T04966 hypothetical protein T12J5.60 - Arabidopsis thaliana >gi|4455345|emb|CAB36726.1| (AL035522) putative protein [Arabidopsis thaliana] >gi|7270471|emb|CAB80236.1| (AL161587) putative protein [Arabidopsis thaliana] Length = 268 Frame 2 hits (HSPs): _____________________ __________________________________________________ Database sequence: | | | | | | | 268 0 50 100 150 200 250 Plus Strand HSPs: Score = 202 (71.1 bits), Expect = 2.9e-15, P = 2.9e-15 Identities = 49/104 (47%), Positives = 62/104 (59%), Frame = +2 Query: 185 SRFRRICVFCGTSPGKNPSYQLAAIQLAKQLVE----------------RNIDLVYGGGS 316 SRF+R+CVFCG+S GK Y AA LA++LV R ++LVYGGGS Sbjct: 6 SRFKRVCVFCGSSSGKRECYSDAATDLAQELVRLCLNLNESLENLKWVTRRLNLVYGGGS 65 Query: 317 IGLMGLISQVVFDGGRHVLGVIPTTLMPREITGESVGEVESVGE 448 IGLMGL+SQ V + G HVLG + ITGE+ GEV +V + Sbjct: 66 IGLMGLVSQAVHEAGGHVLGYAQIYDLFTLITGETYGEVIAVAD 109 >gi|77664|pir||PQ0114 hypothetical protein (azu region) - Pseudomonas aeruginosa (fragment) >gi|7251673|gb|AAA25729.2| (M30389) ORF1 [Pseudomonas aeruginosa] Length = 71 Frame 2 hits (HSPs): _______________________________________________ __________________________________________________ Database sequence: | | | | | 71 0 20 40 60 Plus Strand HSPs: Score = 197 (69.3 bits), Expect = 1.0e-14, P = 1.0e-14 Identities = 34/67 (50%), Positives = 46/67 (68%), Frame = +2 Query: 194 RRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVL 373 R +CVFCG SPG +P YQ AA+ L + L ER + LVYGGG++GLMG ++ G V+ Sbjct: 4 RSVCVFCGASPGASPVYQEAAVALGRHLAERGLTLVYGGGAVGLMGTVADAALAAGSEVI 63 Query: 374 GVIPTTL 394 G+IP +L Sbjct: 64 GIIPQSL 70 >gi|7451099|pir||D70033 conserved hypothetical protein yvdD - Bacillus subtilis >gi|1945663|emb|CAB08033.1| (Z94043) hypothetical protein [Bacillus subtilis] >gi|2635977|emb|CAB15469.1| (Z99121) similar to hypothetical proteins [Bacillus subtilis] Length = 191 Frame 2 hits (HSPs): ______________________ __________________________________________________ Database sequence: | | | | | 191 0 50 100 150 Plus Strand HSPs: Score = 167 (58.8 bits), Expect = 1.5e-11, P = 1.5e-11 Identities = 31/83 (37%), Positives = 51/83 (61%), Frame = +2 Query: 194 RRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVL 373 + ICVF G++PG N +Y+ A +L + E+ I LVYGG +GLMG I+ + + G + Sbjct: 2 KTICVFAGSNPGGNEAYKRKAAELGVYMAEQGIGLVYGGSRVGLMGTIADAIMENGGTAI 61 Query: 374 GVIPTTLMPREITGESVGEVESV 442 GV+P+ L E+ +++ E+ V Sbjct: 62 GVMPSGLFSGEVVHQNLTELIEV 84 >gi|10175706|dbj|BAB06803.1| (AP001517) BH3084~unknown conserved protein [Bacillus halodurans] Length = 187 Frame 2 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | | 187 0 50 100 150 Plus Strand HSPs: Score = 165 (58.1 bits), Expect = 2.4e-11, P = 2.4e-11 Identities = 33/72 (45%), Positives = 46/72 (63%), Frame = +2 Query: 197 RICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVLG 376 +I VFCG+S G + Y+ A QL K+L R I LVYGG S+G+MG ++ V + G V+G Sbjct: 2 KIAVFCGSSNGASDVYKEGARQLGKELARRGITLVYGGASVGIMGAVADSVLEAGGEVIG 61 Query: 377 VIPTTLMPREIT 412 V+P L EI+ Sbjct: 62 VMPRFLEEPEIS 73 >gi|11280345|pir||T45176 conserved hypothetical protein ctf [imported] - Mycobacterium leprae >gi|699154|gb|AAA62920.1| (U15180) ctf [Mycobacterium leprae] Length = 187 Frame 2 hits (HSPs): _____________________ __________________________________________________ Database sequence: | | | | | 187 0 50 100 150 Plus Strand HSPs: Score = 144 (50.7 bits), Expect = 4.1e-09, P = 4.1e-09 Identities = 31/78 (39%), Positives = 43/78 (55%), Frame = +2 Query: 200 ICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVLGV 379 ICVFC P +LAA +L + + ER LV+GGG + MG ++ + G ++GV Sbjct: 13 ICVFCAAGPMHPELLELAA-ELGEAIAERGWTLVWGGGRVSAMGAVASAAWTRGGRIVGV 71 Query: 380 IPTTLMPREITGESVGEV 433 IP L REI VGE+ Sbjct: 72 IPEMLQRREIADTYVGEL 89 >gi|10175367|dbj|BAB06465.1| (AP001516) lysine decarboxylase [Bacillus halodurans] Length = 190 Frame 2 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | | 190 0 50 100 150 Plus Strand HSPs: Score = 144 (50.7 bits), Expect = 4.1e-09, P = 4.1e-09 Identities = 26/63 (41%), Positives = 43/63 (68%), Frame = +2 Query: 197 RICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVLG 376 +IC+F G+S G++P Y L +Q+ ++ +++YGGG+ GLMG+++Q V D G V G Sbjct: 2 KICLFSGSSLGQHPIYAEQVRALGEQIGKQGWEVIYGGGNAGLMGVLAQSVLDNGGRVTG 61 Query: 377 VIP 385 +IP Sbjct: 62 IIP 64 >gi|1169649|sp|P46378|FAS6_RHOFA HYPOTHETICAL 21.1 KD PROTEIN IN FASCIATION LOCUS (ORF6) >gi|1076047|pir||F55578 hypothetical protein 2 (ipt 3' region) - Rhodococcus fascians plasmid pFiD188 >gi|455006|emb|CAA82746.1| (Z29635) orf6 [Rhodococcus fascians] Length = 198 Frame 2 hits (HSPs): _________________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | 198 0 50 100 150 __________________ Annotated Domains: DOMO DM06442: 1..197 PRODOM PD005712: YJF5(2) 20..160 PRODOM PD090879: FAS6_RHOFA 162..197 __________________ Plus Strand HSPs: Score = 132 (46.5 bits), Expect = 7.7e-08, P = 7.7e-08 Identities = 24/64 (37%), Positives = 35/64 (54%), Frame = +2 Query: 194 RRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVL 373 + + VFCG PG+ Y A + + + + LVYGG +GLMG ++ D G V+ Sbjct: 20 KSVTVFCGAMPGRGTKYGQLAEGMGRAIARSKLRLVYGGARVGLMGTLANAALDSGGTVV 79 Query: 374 GVIP 385 GVIP Sbjct: 80 GVIP 83 >gi|4337446|gb|AAD18125.1| (U89166) ECORLD_ORF1; similar to the Pseudomonas aeruginosa ORF upstream of the Azu gene and to the Rhodococcus fascians fas operon ORF6 protein, encoded by GenBank Accession Numbers M30388 and Z29635, respectively [Eikenella corrodens] Length = 183 Frame 2 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | 183 0 50 100 150 Plus Strand HSPs: Score = 122 (42.9 bits), Expect = 8.8e-07, P = 8.8e-07 Identities = 26/57 (45%), Positives = 33/57 (57%), Frame = +2 Query: 239 SYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVLGVIPTTLMPREI 409 SY AA +L + + ER LVYGGG IGLMG ++ G V G+IPT L E+ Sbjct: 1 SYSQAARELGRAIAERGSRLVYGGGGIGLMGEVASAALAAGGKVTGIIPTFLRHEEM 57 >gi|6322406 ref|NP_012480.1| Yjl055wp [Saccharomyces cerevisiae] >gi|1352984|sp|P47044|YJF5_YEAST HYPOTHETICAL 26.9 KD PROTEIN IN BTN1-PEP8 INTERGENIC REGION >gi|1077814|pir||S56827 conserved hypothetical protein YJL055w - yeast (Saccharomyces cerevisiae) >gi|1008195|emb|CAA89346.1| (Z49330) ORF YJL055w [Saccharomyces cerevisiae] Length = 245 Frame 2 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | | | 245 0 50 100 150 200 Plus Strand HSPs: Score = 124 (43.7 bits), Expect = 1.2e-06, P = 1.2e-06 Identities = 30/82 (36%), Positives = 44/82 (53%), Frame = +2 Query: 194 RRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVF--DGGRH 367 + +CV+CG+S G Y +A +L + LVYGGG+ GLMG I++ D Sbjct: 19 KSVCVYCGSSFGAKALYSESAEELGALFHKLGWKLVYGGGTTGLMGKIARSTMGPDLSGQ 78 Query: 368 VLGVIPTTLMPREITGESVGEV 433 V G+IP L+ +E T E +V Sbjct: 79 VHGIIPNALVSKERTDEDKEDV 100 >gi|10954698 ref|NP_066633.1| similar to orf6 gene(unknown,in P450 operon) in Rhodococcus fascians [Agrobacterium rhizogenes] >gi|8918698|dbj|BAA97763.1| (AB039932) similar to orf6 gene in Rhodococcus fascians [Agrobacterium rhizogenes] >gi|10567362|dbj|BAB16171.1| (AP002086) similar to orf6 gene(unknown,in P450 operon) in Rhodococcus fascians [Agrobacterium rhizogenes] Length = 169 Frame 2 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | | 169 0 50 100 150 Plus Strand HSPs: Score = 106 (37.3 bits), Expect = 0.00013, P = 0.00013 Identities = 23/56 (41%), Positives = 30/56 (53%), Frame = +2 Query: 275 LVERNIDLVYGGGSIGLMGLISQVVFDGGRHVLGVIPTTLMPREITGESVGEVESV 442 + I LVYGG SIGLMG I+ G V+GVIP L +EI + ++ V Sbjct: 1 MARSGIGLVYGGASIGLMGAIADAARSDGGEVIGVIPRALAEKEIAHTDLADLRVV 56 >gi|7451100|pir||C70609 hypothetical protein Rv1205 - Mycobacterium tuberculosis (strain H37RV) >gi|1929079|emb|CAB07828.1| (Z93777) hypothetical protein Rv1205 [Mycobacterium tuberculosis] Length = 187 Frame 2 hits (HSPs): _____________________ __________________________________________________ Database sequence: | | | | | 187 0 50 100 150 Plus Strand HSPs: Score = 103 (36.3 bits), Expect = 0.0010, P = 0.0010 Identities = 24/78 (30%), Positives = 38/78 (48%), Frame = +2 Query: 200 ICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVLGV 379 + V+C SP +LAA ++ + R LV+GGG + MG ++ G +GV Sbjct: 13 VAVYCAASPTHAELLELAA-EVGAAIAGRGWTLVWGGGHVSAMGAVASAARACGGWTVGV 71 Query: 380 IPTTLMPREITGESVGEV 433 IP L+ RE+ E+ Sbjct: 72 IPKMLVYRELADHDADEL 89 >gi|7462102|pir||A72302 conserved hypothetical protein - Thermotoga maritima (strain MSB8) >gi|4981597|gb|AAD36132.1|AE001765_11 (AE001765) conserved hypothetical protein [Thermotoga maritima] Length = 171 Frame 2 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | | 171 0 50 100 150 Plus Strand HSPs: Score = 81 (28.5 bits), Expect = 1.4, P = 0.74 Identities = 22/66 (33%), Positives = 39/66 (59%), Frame = +2 Query: 194 RRICVFCGTSP-GKNPSYQLAAI--QLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGR 364 +++ V + P K+P +L I +L + L ++ LV+ GG G+M L+SQ V + G Sbjct: 2 KKVVVVGYSGPVNKSPVSELRDICLELGRTLAKKGY-LVFNGGRDGVMELVSQGVREAGG 60 Query: 365 HVLGVIP 385 V+G++P Sbjct: 61 TVVGILP 67 >gi|12697620|dbj|BAB21615.1| (AB037974) cytochrome oxidase subunit I [Thalassiosira nordenskioeldii] Length = 57 Frame 2 hits (HSPs): ___________ ________________________ __________________________________________________ Database sequence: | | | | 57 0 20 40 Plus Strand HSPs: Score = 41 (14.4 bits), Expect = 1.4, Sum P(2) = 0.75 Identities = 7/13 (53%), Positives = 9/13 (69%), Frame = +2 Query: 47 FFPHPSLYVHIHP 85 FF HP +Y+ I P Sbjct: 1 FFGHPEVYILILP 13 Score = 40 (14.1 bits), Expect = 1.4, Sum P(2) = 0.75 Identities = 9/27 (33%), Positives = 16/27 (59%), Frame = +2 Query: 257 IQLAKQLVERNIDLVYGGGSIGLMGLI 337 + AK+ + + +VY SIG++G I Sbjct: 23 VSTAKKPIFGYLGMVYAMFSIGVLGFI 49 >gi|1196510|gb|AAA88231.1| (M15467) unknown protein [Mycobacterium tuberculosis] Length = 175 Frame -1 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | | 175 0 50 100 150 Minus Strand HSPs: Score = 76 (26.8 bits), Expect = 6.3, P = 1.0 Identities = 22/64 (34%), Positives = 32/64 (50%), Frame = -1 Query: 447 SPTLSTSPTLSPVISLG-IRVVGITPN-TWRPPSNTTCEIKPINPMLPPP*TKSMFLSTS 274 +P S T S S+ +R G+ P T R PS T + ++P P T S+FL+TS Sbjct: 40 APLSRVSVTFSTAFSMPRLRPSGLAPAATLRRPSRTNAWASTVAVVVPSPATSSVFLATS 99 Query: 273 CLAS 262 +S Sbjct: 100 LTSS 103 >gi|7479785|pir||T35807 hypothetical protein SC8D9.03 SC8D9.03 - Streptomyces coelicolor >gi|4467242|emb|CAB37567.1| (AL035569) SC8D9.03, unknown, len: 182aa; similar to many of undefined function eg. TR:Q49952 (EMBL:U15180) hypothetical protein from Mycobacterium leprae (187 aa) fasta scores; opt: 331, z-score: 394.0, E(): 1.2e-14, (36.1% identity in 166 aa overlap) Length = 182 Frame 2 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | | 182 0 50 100 150 Plus Strand HSPs: Score = 76 (26.8 bits), Expect = 6.8, P = 1.0 Identities = 20/60 (33%), Positives = 32/60 (53%), Frame = +2 Query: 200 ICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVLGV 379 ICVF + + Y A + A+ L + LV+GG +GLM +++ V + G +LGV Sbjct: 6 ICVFLSAAD-LDEHYTRPAKEFAELLGKGGHTLVWGGSDVGLMKVVADGVQESGGKLLGV 64 >gi|7451672|pir||H70312 hypothetical protein aq_134 - Aquifex aeolicus >gi|2982880|gb|AAC06500.1| (AE000675) putative protein [Aquifex aeolicus] Length = 151 Frame 2 hits (HSPs): _____________________ __________________________________________________ Database sequence: | | | || 151 0 50 100 150 Plus Strand HSPs: Score = 74 (26.0 bits), Expect = 7.6, P = 1.0 Identities = 19/65 (29%), Positives = 37/65 (56%), Frame = +2 Query: 194 RRICVFCGTSPGKNPSYQLAAIQLAKQLVERNIDLVYGGGSIGLMGLISQVVFDGGRHVL 373 R++ V G+S Y+ A +L K+L +RN+ +V GG + G+M + + + G + Sbjct: 2 RQVSVI-GSSKASEEEYEFA-YRLGKELAKRNLVVVCGGRT-GVMEAVCKGAKEEGGLTI 58 Query: 374 GVIPT 388 G++P+ Sbjct: 59 GIMPS 63 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.98 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.358 0.155 0.599 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.335 0.148 0.458 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.340 0.148 0.550 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.341 0.150 0.525 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.347 0.151 0.548 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.351 0.154 0.571 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 149 148 10. 74 3 12 22 0.091 34 30 0.11 36 +2 0 149 148 10. 74 3 12 22 0.091 34 30 0.11 36 +1 0 150 149 10. 74 3 12 22 0.092 34 30 0.12 36 -1 0 150 149 10. 74 3 12 22 0.092 34 30 0.12 36 -2 0 149 149 10. 74 3 12 22 0.092 34 30 0.12 36 -3 0 149 148 10. 74 3 12 22 0.091 34 30 0.11 36 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 25 No. of states in DFA: 591 (58 KB) Total size of DFA: 203 KB (256 KB) Time to generate neighborhood: 0.02u 0.00s 0.02t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 164.72u 1.14s 165.86t Elapsed: 00:00:28 Total cpu time: 164.75u 1.20s 165.95t Elapsed: 00:00:28 Start: Fri Feb 1 22:11:31 2002 End: Fri Feb 1 22:11:59 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000