WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= 'D17H01_O13_15.ab1' (585 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 5 Sequences : less than 5 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 1059 250 |================================================== 6310 809 188 |===================================== 3980 621 130 |========================== 2510 491 156 |=============================== 1580 335 105 |===================== 1000 230 63 |============ 631 167 27 |===== 398 140 48 |========= 251 92 28 |===== 158 64 11 |== 100 53 10 |== 63.1 43 8 |= 39.8 35 6 |= 25.1 29 6 |= 15.8 23 1 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 22 <<<<<<<<<<<<<<<<< 10.0 22 1 |: 6.31 21 1 |: 3.98 20 0 | 2.51 20 2 |: 1.58 18 1 |: 1.00 17 2 |: 0.63 15 0 | 0.40 15 3 |: 0.25 12 1 |: 0.16 11 1 |: 0.10 10 0 | 0.063 10 2 |: 0.040 8 1 |: 0.025 7 0 | 0.016 7 1 |: 0.010 6 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|11283525|pir||T47912hypothetical protein T20K12.10... +3 333 3.9e-29 1 gi|1903364|gb|AAB70447.1|(AC000104) EST gb|T45093 com... +3 177 1.3e-12 1 gi|4210351|emb|CAA21139.1|(AL031775) dJ30M3.1 (novel ... +3 111 1.7e-05 1 gi|8923812ref|NP_060943.1| uncharacterized hypothalam... +3 111 1.7e-05 1 gi|465883|sp|P34419|YLZ6_CAEELHYPOTHETICAL 20.1 KD PR... +3 115 0.00016 1 gi|7484139|pir||S74052hypothetical protein c0116 - Su... +3 96 0.0085 1 gi|7496519|pir||T15630hypothetical protein C25H3.3 - ... +3 105 0.011 1 gi|7630243|dbj|BAA94776.1|(AP001859) hypothetical pro... +3 97 0.025 1 gi|7630244|dbj|BAA94777.1|(AP001859) Similar to Arabi... +3 96 0.040 1 gi|7491588|pir||T40205hypothetical protein SPBC31F10.... +3 94 0.052 1 gi|11350388|pir||E82995hypothetical protein PA5202 [i... +3 87 0.11 1 gi|11499845ref|NP_071089.1| conserved hypothetical pr... +3 89 0.18 1 gi|5733877|gb|AAD49765.1|AC007932_13(AC007932) F11A17... +3 88 0.25 1 gi|7292257|gb|AAF47666.1|(AE003475) CG16985 gene prod... +3 87 0.27 1 gi|11350055|pir||A83149hypothetical protein PA3971 [i... +3 86 0.29 1 gi|7473619|pir||E75289probable phenylacetic acid degr... +3 84 0.51 1 gi|7292258|gb|AAF47667.1|(AE003475) CG16986 gene prod... +3 83 0.58 1 gi|141254|sp|P20378|YPHR_HALHAHYPOTHETICAL 15.6 KD PR... +3 82 0.77 1 gi|7471259|pir||E75467ComA-related protein - Deinococ... +3 77 0.86 1 gi|3980377|gb|AAC95180.1|(AC004561) unknown protein [... +3 81 0.90 1 gi|11350298|pir||B83042hypothetical protein PA4830 [i... +3 79 0.997 1 gi|10177184|dbj|BAB10318.1|(AB017061) gb|AAD49765.1~g... +3 77 0.9992 1
Use the and icons to retrieve links to Entrez:
>gi|11283525|pir||T47912 hypothetical protein T20K12.100 - Arabidopsis thaliana >gi|6850887|emb|CAB71050.1| (AL137898) putative protein [Arabidopsis thaliana] Length = 188 Frame 3 hits (HSPs): ___________________________________________ __________________________________________________ Database sequence: | | | | | 188 0 50 100 150 Plus Strand HSPs: Score = 333 (117.2 bits), Expect = 3.9e-29, P = 3.9e-29 Identities = 68/161 (42%), Positives = 107/161 (66%), Frame = +3 Query: 6 ISKEVDPSHASETLRIVNAMGAATPIPANCNARGFYDAFLRSF---IKVDHIQRGRISCT 176 +SK +DP++ L + + A +P +CN +D+F F + I RGR+SC+ Sbjct: 29 VSKVIDPNYV---LMVADFFKAISP-DESCNDFTSFDSFSVLFQNNTRALSIARGRVSCS 84 Query: 177 VVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISISYLSATPANEEV 356 V P I N + LHGG+V S+ E ++ AC +TVV++DK LF+GE+S+SYLS+ P + E+ Sbjct: 85 VTVTPGISNFFKGLHGGAVASIAERVAMACVKTVVSEDKHLFIGELSMSYLSSAPISSEL 144 Query: 357 LANASVVKTGRNLTVVAVEFKLKKAGNLLYITHSTFYNMPVASL 488 L +VV+TGRNL+VV VEFK+K+ + Y++ +TFY+ P++ L Sbjct: 145 LVEGTVVRTGRNLSVVTVEFKIKETMKVTYLSRATFYHSPISKL 188 >gi|1903364|gb|AAB70447.1| (AC000104) EST gb|T45093 comes from this gene. [Arabidopsis thaliana] Length = 155 Frame 3 hits (HSPs): _____________________________________________ __________________________________________________ Database sequence: | | | | | 155 0 50 100 150 Plus Strand HSPs: Score = 177 (62.3 bits), Expect = 1.3e-12, P = 1.3e-12 Identities = 44/140 (31%), Positives = 76/140 (54%), Frame = +3 Query: 69 AATPIPANCNARGFYDAFLRSFIKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVE 248 A P+ A R F + F+ + +KVD I+ GRI C++ P + N LHGG+ +LV+ Sbjct: 18 AKEPMVAKLPHR-FLERFVTNGLKVDLIEPGRIVCSMKIPPHLLNAGKFLHGGATATLVD 76 Query: 249 ILSNACARTVVAKDKELFLGEISISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLKK 428 ++ +A T A + + EI++SYL A +EE+ + ++ G+ + VV+VE + K Sbjct: 77 LIGSAVIYTAGASHSGVSV-EINVSYLDAAFLDEEIEIESKALRVGKAVAVVSVELRKKT 135 Query: 429 AGNLLYITHSTFYNMPVASL 488 G ++ T Y P ++L Sbjct: 136 TGKIIAQGRHTKYFAPRSNL 155 >gi|4210351|emb|CAA21139.1| (AL031775) dJ30M3.1 (novel protein similar to (predicted) plant, worm, yeast and archaea bacterial proteins) [Homo sapiens] Length = 113 Frame 3 hits (HSPs): __________________________________________ __________________________________________________ Database sequence: | | | | 113 0 50 100 Plus Strand HSPs: Score = 111 (39.1 bits), Expect = 1.7e-05, P = 1.7e-05 Identities = 26/95 (27%), Positives = 47/95 (49%), Frame = +3 Query: 159 GRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISISYLSAT 338 G++ C + + N GTLHGG +LV+ +S A + +++I+Y+S Sbjct: 9 GKVICEMKVEEEHTNAIGTLHGGLTATLVDNISTM-ALLCTERGAPGVSVDMNITYMSPA 67 Query: 339 PANEEVLANASVVKTGRNLTVVAVEFKLKKAGNLL 443 E+++ A V+K G+ L +V+ K G L+ Sbjct: 68 KLGEDIVITAHVLKQGKTLAFTSVDLTNKATGKLI 102 >gi|8923812 ref|NP_060943.1| uncharacterized hypothalamus protein HT012 [Homo sapiens] >gi|11418468 ref|XP_004262.1| uncharacterized hypothalamus protein HT012 [Homo sapiens] >gi|7020647|dbj|BAA91215.1| (AK000508) unnamed protein product [Homo sapiens] >gi|7677052|gb|AAF67006.1|AF155649_1 (AF155649) hypothetical 15 kDa protein [Homo sapiens] >gi|7689023|gb|AAF67651.1|AF220186_1 (AF220186) uncharacterized hypothalamus protein HT012 [Homo sapiens] >gi|12654153|gb|AAH00894.1|AAH00894 (BC000894) uncharacterized hypothalamus protein HT012 [Homo sapiens] Length = 140 Frame 3 hits (HSPs): __________________________________ __________________________________________________ Database sequence: | | | | 140 0 50 100 Plus Strand HSPs: Score = 111 (39.1 bits), Expect = 1.7e-05, P = 1.7e-05 Identities = 26/95 (27%), Positives = 47/95 (49%), Frame = +3 Query: 159 GRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISISYLSAT 338 G++ C + + N GTLHGG +LV+ +S A + +++I+Y+S Sbjct: 36 GKVICEMKVEEEHTNAIGTLHGGLTATLVDNISTM-ALLCTERGAPGVSVDMNITYMSPA 94 Query: 339 PANEEVLANASVVKTGRNLTVVAVEFKLKKAGNLL 443 E+++ A V+K G+ L +V+ K G L+ Sbjct: 95 KLGEDIVITAHVLKQGKTLAFTSVDLTNKATGKLI 129 >gi|465883|sp|P34419|YLZ6_CAEEL HYPOTHETICAL 20.1 KD PROTEIN F42H10.6 IN CHROMOSOME III >gi|1078863|pir||S44652 f42h10.6 protein - Caenorhabditis elegans >gi|289680|gb|AAA28024.1| (L08403) putative [Caenorhabditis elegans] Length = 184 Frame 3 hits (HSPs): _____________________ Annotated Domains: __________________________________________ __________________________________________________ Database sequence: | | | | | 184 0 50 100 150 __________________ Annotated Domains: PRODOM PD006741: Q18187(2) 14..165 __________________ Plus Strand HSPs: Score = 115 (40.5 bits), Expect = 0.00016, P = 0.00016 Identities = 26/77 (33%), Positives = 41/77 (53%), Frame = +3 Query: 210 GTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISISYLSATPANEEVLANASVVKTGR 389 GTLHGG +L ++++ A A V KDK + E+++SYL + + A V+K GR Sbjct: 84 GTLHGGQTATLTDVIT-ARAVGVTVKDKGMASVELAVSYLLPVKVGDVLEITAHVLKVGR 142 Query: 390 NLTVVAVEFKLKKAGNL 440 + EF+ K G + Sbjct: 143 TMAFTDCEFRRKSDGKM 159 >gi|7484139|pir||S74052 hypothetical protein c0116 - Sulfolobus solfataricus >gi|1707746|emb|CAA69466.1| (Y08256) orf c01016 [Sulfolobus solfataricus] >gi|12313068|emb|CAC23784.1| (AL512975) ORF-c01_016 [Sulfolobus solfataricus] Length = 140 Frame 3 hits (HSPs): __________________________________ __________________________________________________ Database sequence: | | | | 140 0 50 100 Plus Strand HSPs: Score = 96 (33.8 bits), Expect = 0.0085, P = 0.0085 Identities = 28/95 (29%), Positives = 46/95 (48%), Frame = +3 Query: 135 IKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVV-AKDKELFLGE 311 +KV ++++GR + K R G LHGG + S ++I A TV A D+ E Sbjct: 24 VKVINLEKGRAVVEIPYKEEFTRRGGVLHGGIIMSAIDITGGLAALTVNDAMDQ--VTQE 81 Query: 312 ISISYLSATPANEEVLANASVVKTGRNLTVVAVEFK 419 + I++L + V++ G + VV +EFK Sbjct: 82 LKINFLEPMYKGPFTI-EGKVLRKGSTVIVVEIEFK 116 >gi|7496519|pir||T15630 hypothetical protein C25H3.3 - Caenorhabditis elegans >gi|868253|gb|AAA68782.1| (U29535) C25H3.3 gene product [Caenorhabditis elegans] Length = 273 Frame 3 hits (HSPs): __________________ _____________________ __________________________________________________ Database sequence: | | | | | | | 273 0 50 100 150 200 250 Plus Strand HSPs: Score = 105 (37.0 bits), Expect = 0.011, P = 0.011 Identities = 28/114 (24%), Positives = 56/114 (49%), Frame = +3 Query: 111 YDAFLRSFIKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKD 290 Y A R+ ++ H + G + + N++ TLHGG +L++ + ++ K+ Sbjct: 154 YAAGARN-VRAVHAEEGNLRVEFEVEKDQTNQFETLHGGCTAALIDCFTTGAL--LLTKE 210 Query: 291 KELFLG-EISISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLKKAGNLLYIT 452 + ++ I+YL+A E ++ N++V+K GR+L E +K N + T Sbjct: 211 ARPGVSVDLHITYLTAANIGETLVLNSTVIKQGRSLGFTKAEL-YRKRDNAMIAT 264 Score = 96 (33.8 bits), Expect = 0.13, P = 0.13 Identities = 21/97 (21%), Positives = 45/97 (46%), Frame = +3 Query: 135 IKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEI 314 ++ H + G + + N + TLHGG +L++I + A + + ++ Sbjct: 31 VRAVHAEEGNLRVEFEVEKDQSNHFNTLHGGCTSTLIDIFTTG-ALLLTKPARPGVSVDL 89 Query: 315 SISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLK 425 ++YL+A E ++ +++V+K G+ L E K Sbjct: 90 HVTYLTAAKIGETLVLDSTVIKQGKTLAFTKAELYRK 126 >gi|7630243|dbj|BAA94776.1| (AP001859) hypothetical protein [Oryza sativa] Length = 167 Frame 3 hits (HSPs): ___________________________________ __________________________________________________ Database sequence: | | | | | 167 0 50 100 150 Plus Strand HSPs: Score = 97 (34.1 bits), Expect = 0.026, P = 0.025 Identities = 28/116 (24%), Positives = 55/116 (47%), Frame = +3 Query: 102 RGFYDAFLRSFIKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVV 281 R ++A + +V + GR C++ + + G H G++ + + + CA ++ Sbjct: 35 RRAFNALPLAGARVSLAEAGRAVCSLRVTAELTDAEGNWHPGAIAAAAD---DVCAAAIM 91 Query: 282 AKDKELFLG-EISISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLKKAGNLLYI 449 + + + + ISY S +EEV + VV+ +T V VE + K +G L+ I Sbjct: 92 SVEGIIKVSVHYDISYFSPAKLHEEVELDGRVVEQKGKMTAVTVEIRKKDSGELVAI 148 >gi|7630244|dbj|BAA94777.1| (AP001859) Similar to Arabidopsis thaliana chromosome 1 BAC F19P19; unknown protein (AC000104) [Oryza sativa] Length = 174 Frame 3 hits (HSPs): ___________________________________________ __________________________________________________ Database sequence: | | | | | 174 0 50 100 150 Plus Strand HSPs: Score = 96 (33.8 bits), Expect = 0.041, P = 0.040 Identities = 39/147 (26%), Positives = 61/147 (41%), Frame = +3 Query: 24 PSHASETLRIVNAMGAATPIPANCNAR----GFYDAF---LRSFIKVDHIQRGRISCTVV 182 P+ A+ LR+ P + AR G DAF + +V + GR+ C+ Sbjct: 6 PAAAAAALRLAAVARRWLENPRDSLARSREEGCGDAFNTVVMPGFRVSLAEPGRLVCSFC 65 Query: 183 AKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDK-ELFLGEISISYLSATPANEEVL 359 + + G H G++ + V+ N CA V D F ++S+ S EEV Sbjct: 66 VPAAVADADGRWHAGAMAAAVD---NLCAAVVYTADGVHRFTISQAMSFFSPAAHGEEVE 122 Query: 360 ANASVVKTGRNLTVVAVEFKLKKAGNLLYI 449 + V LT VE + K +G L+ I Sbjct: 123 MDGRVAHRKGKLTAAVVEVRRKASGELVAI 152 >gi|7491588|pir||T40205 hypothetical protein SPBC31F10.02 - fission yeast (Schizosaccharomyces pombe) >gi|2226413|emb|CAB10079.1| (Z97204) hypothetical protein [Schizosaccharomyces pombe] Length = 161 Frame 3 hits (HSPs): __________________________________ __________________________________________________ Database sequence: | | | | | 161 0 50 100 150 Plus Strand HSPs: Score = 94 (33.1 bits), Expect = 0.053, P = 0.052 Identities = 29/107 (27%), Positives = 53/107 (49%), Frame = +3 Query: 96 NARGFYDAFLRSFIKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACART 275 N GF DA + S I++ G + C++ + NR G LHGG + +L ++ + Sbjct: 23 NTNGF-DAHVVSDIQIISAVPGFVECSLKLQKHHLNRMGNLHGGCIAALTDL-----GGS 76 Query: 276 VVAKDKELFLGEISI----SYL-SATPANEEVLANASVVKTGRNLTVVAVEF 416 + + LF+ +SI ++L S +L +A + G N+ +V+F Sbjct: 77 LALASRGLFISGVSIDMNQTFLQSGGTLGSSILLHAKCDRLGSNIAFTSVDF 128 >gi|11350388|pir||E82995 hypothetical protein PA5202 [imported] - Pseudomonas aeruginosa (strain PAO1) >gi|9951508|gb|AAG08587.1|AE004933_3 (AE004933) hypothetical protein [Pseudomonas aeruginosa] Length = 129 Frame 3 hits (HSPs): _____________________________ __________________________________________________ Database sequence: | | | | 129 0 50 100 Plus Strand HSPs: Score = 87 (30.6 bits), Expect = 0.11, P = 0.11 Identities = 23/73 (31%), Positives = 37/73 (50%), Frame = +3 Query: 201 NRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISISYLSATPANEEVLANASVVK 380 NR G +HGG++ SL+++ + D++ E I+Y+ A A+ EV A V+ Sbjct: 42 NRGGVMHGGALFSLMDVTMGLACSSSHGFDRQSVTLECKINYIRAV-ADGEVRCVARVLH 100 Query: 381 TGRNLTVVAVEFK 419 GR VV E + Sbjct: 101 AGRRSLVVEAEVR 113 >gi|11499845 ref|NP_071089.1| conserved hypothetical protein [Archaeoglobus fulgidus] >gi|3334444|sp|O28020|YM64_ARCFU HYPOTHETICAL PROTEIN AF2264 >gi|7430238|pir||H69532 conserved hypothetical protein AF2264 - Archaeoglobus fulgidus >gi|2648253|gb|AAB88986.1| (AE000948) conserved hypothetical protein [Archaeoglobus fulgidus] Length = 154 Frame 3 hits (HSPs): ____________________________________ __________________________________________________ Database sequence: | | | | | 154 0 50 100 150 Plus Strand HSPs: Score = 89 (31.3 bits), Expect = 0.20, P = 0.18 Identities = 28/112 (25%), Positives = 47/112 (41%), Frame = +3 Query: 138 KVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEIS 317 ++ ++ G +V K N HGG + SL ++ A A + K E+S Sbjct: 39 RILEMKEGYAKVEMVVKKEHLNAANVCHGGIIFSLADL---AFALASNSHGKLALAIEVS 95 Query: 318 ISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLKKAGNLLYITHSTFYNM 473 I+Y+ A E+++A A V G +E K A L+ + T Y + Sbjct: 96 ITYMKAAYEGEKLVAEAKEVNLGNKTATYLMEVK-NSANKLIALAKGTVYRV 146 >gi|5733877|gb|AAD49765.1|AC007932_13 (AC007932) F11A17.13 [Arabidopsis thaliana] Length = 156 Frame 3 hits (HSPs): ______________________________________ __________________________________________________ Database sequence: | | | | | 156 0 50 100 150 Plus Strand HSPs: Score = 88 (31.0 bits), Expect = 0.28, P = 0.25 Identities = 32/117 (27%), Positives = 56/117 (47%), Frame = +3 Query: 144 DHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLG-EISI 320 D + RI+ + P C + LHGG + E L++ A +A + G ++SI Sbjct: 23 DELSPTRITGRLPVSPVCCQPFKVLHGGVSALIAESLASMGAH--MASGFKRVAGIQLSI 80 Query: 321 SYLSATPANEEVLANASVVKTGRNLTVVAVE-FKL--KKAGNLLYITHST---FYNMPV 479 ++L + + V A A+ V TG+ + V V+ +K K N + I+ S N+P+ Sbjct: 81 NHLKSADLGDLVFAEATPVSTGKTIQVWEVKLWKTTQKDKANKILISSSRVTLICNLPI 139 >gi|7292257|gb|AAF47666.1| (AE003475) CG16985 gene product [Drosophila melanogaster] Length = 149 Frame 3 hits (HSPs): ______________________________________ __________________________________________________ Database sequence: | | | | 149 0 50 100 Plus Strand HSPs: Score = 87 (30.6 bits), Expect = 0.32, P = 0.27 Identities = 30/115 (26%), Positives = 56/115 (48%), Frame = +3 Query: 99 ARGFYDAFLRSFIKVDHIQRGR-ISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACART 275 + GF D L+ IK+ GR I VA + NR GTLHGG ++V+ N Sbjct: 21 SNGF-DRVLK-MIKITGGGDGRAIGEFTVANEHL-NRQGTLHGGLTATIVD---NCTTYA 74 Query: 276 VVAKDKELFL-GEISISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLKKAGNLL 443 +++K + +++SY++A E + + + V+ G+ + + + K G ++ Sbjct: 75 LMSKGSHPGVTANLNVSYIAAAKPGELIEIDCNTVRAGKKMAYLDCILRRKSDGKII 131 >gi|11350055|pir||A83149 hypothetical protein PA3971 [imported] - Pseudomonas aeruginosa (strain PAO1) >gi|9950161|gb|AAG07358.1|AE004815_2 (AE004815) hypothetical protein [Pseudomonas aeruginosa] Length = 143 Frame 3 hits (HSPs): __________________________________ __________________________________________________ Database sequence: | | | | 143 0 50 100 Plus Strand HSPs: Score = 86 (30.3 bits), Expect = 0.35, P = 0.29 Identities = 28/100 (28%), Positives = 48/100 (48%), Frame = +3 Query: 189 PPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISISYLSATPAN-EEVLAN 365 P + G +H G +L + + A A T+V + + + E ++ L PA + +L Sbjct: 44 PRHAQQNGFIHAGVQATLADHAAGAAAATLVEEGQTVLTLEFKLNLLR--PALCQRLLCR 101 Query: 366 ASVVKTGRNLTVVAVEFKLKKAGNLLYITHSTFYNMPVASL 488 A V+K GR +TVV E ++ G + +T M V +L Sbjct: 102 AEVLKAGRQVTVVEAEVFAERDGRRHLFSKATV-TMAVVAL 141 >gi|7473619|pir||E75289 probable phenylacetic acid degradation protein PaaI - Deinococcus radiodurans (strain R1) >gi|6460126|gb|AAF11862.1|AE002063_5 (AE002063) phenylacetic acid degradation protein PaaI, putative [Deinococcus radiodurans] Length = 146 Frame 3 hits (HSPs): _________________________________________ __________________________________________________ Database sequence: | | | | 146 0 50 100 Plus Strand HSPs: Score = 84 (29.6 bits), Expect = 0.72, P = 0.51 Identities = 35/130 (26%), Positives = 58/130 (44%), Frame = +3 Query: 54 VNAMGAATPIPANCNARGFYDAFLRSFIKVDHIQRGRISCTVVAKPPICNRYGTLHGGSV 233 V+A A TP P A + + + + R++ TV N +GT HGG + Sbjct: 13 VSAPPARTPYP---EAMSYAEVLGMTILDASP-DLTRVALTVTEAG--LNMHGTAHGGLI 66 Query: 234 GSLVE----ILSNACARTVVAKDKELFLGEISISYLSATPANEEVLANASVVKTGRNLTV 401 SL + ++SN A+ V A E +S+ A E ++A A+ + GR L Sbjct: 67 FSLADEAFAVISNLDAQAVAA--------ETHMSFFRAAREGERLVAVATPERVGRTLAT 118 Query: 402 VAVEFKLKKAGNLL 443 +E + + G +L Sbjct: 119 YRIEVRRGEEGEVL 132 >gi|7292258|gb|AAF47667.1| (AE003475) CG16986 gene product [Drosophila melanogaster] Length = 143 Frame 3 hits (HSPs): __________________________________ __________________________________________________ Database sequence: | | | | 143 0 50 100 Plus Strand HSPs: Score = 83 (29.2 bits), Expect = 0.87, P = 0.58 Identities = 27/97 (27%), Positives = 48/97 (49%), Frame = +3 Query: 138 KVDHIQRGRISCTVVAK--PPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLG- 308 KV + G +CT K N Y LHGG + +LV++++ T K G Sbjct: 30 KVKIVDGGDGACTAELKVDQDHVNLYKFLHGGYIMTLVDLIT-----TYALMSKPCHPGV 84 Query: 309 --EISISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLKK 428 ++S+++L+ ++V+ A++ K G+ L + K KK Sbjct: 85 SVDLSVNFLNGAKLGDDVVIQANLSKVGKYLAFIDCTLKHKK 126 >gi|141254|sp|P20378|YPHR_HALHA HYPOTHETICAL 15.6 KD PROTEIN IN PHR 5'REGION >gi|99200|pir||A32580 hypothetical 15K protein (deoxyribodipyrimidine photo-lyase 5' region) - Halobacterium salinarum >gi|148792|gb|AAA72748.1| (M24544) ORF151 [Halobacterium halobium] >gi|10580850|gb|AAG19672.1| (AE005055) Vng1336c [Halobacterium sp. NRC-1] Length = 151 Frame 3 hits (HSPs): _______________________ Annotated Domains: ________________________________________________ __________________________________________________ Database sequence: | | | || 151 0 50 100 150 __________________ Annotated Domains: PRODOM PD175497: YPHR_HALHA 1..23 PRODOM PD006741: Q18187(2) 25..144 __________________ Plus Strand HSPs: Score = 82 (28.9 bits), Expect = 1.5, P = 0.77 Identities = 19/67 (28%), Positives = 37/67 (55%), Frame = +3 Query: 210 GTLHGGSVGSLVEILSNACARTVVAKDKELFLG--EISISYLSATPANEEVLANASVVKT 383 G +HGG +L++ R+ + K + ++++SYL PA +++A+ASVV+ Sbjct: 56 GDVHGGIAATLIDTAGGLAVRSALPKPVAANVATIDLNVSYLR--PARGDLIADASVVRV 113 Query: 384 GRNLTVVAV 410 G + V + Sbjct: 114 GSTVGVAEI 122 >gi|7471259|pir||E75467 ComA-related protein - Deinococcus radiodurans (strain R1) >gi|6458566|gb|AAF10426.1|AE001939_3 (AE001939) ComA-related protein [Deinococcus radiodurans] Length = 119 Frame 3 hits (HSPs): ______________________________________________ __________________________________________________ Database sequence: | | | | 119 0 50 100 Plus Strand HSPs: Score = 77 (27.1 bits), Expect = 1.9, P = 0.86 Identities = 27/109 (24%), Positives = 46/109 (42%), Frame = +3 Query: 135 IKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLG-E 311 I+ HI+RG + + +P + G LH SV +L + R ++ + F E Sbjct: 4 IRFTHIERGLLRSELTVRPELFAPNGYLHAASVVALADTTCGYGTRVLLPDEATGFTTIE 63 Query: 312 ISISYLSATPANEEVLANASVVKTGRNLTVVAVEFKLKKAGNLLYITHST 461 + ++L T V A V GR V E + + GN++ + T Sbjct: 64 LKSNHLG-TSRQGVVTCEARAVHAGRTTQVWDAEVR-NEQGNVMALFRCT 111 >gi|3980377|gb|AAC95180.1| (AC004561) unknown protein [Arabidopsis thaliana] Length = 157 Frame 3 hits (HSPs): _______________________________________ __________________________________________________ Database sequence: | | | | | 157 0 50 100 150 Plus Strand HSPs: Score = 81 (28.5 bits), Expect = 2.3, P = 0.90 Identities = 31/121 (25%), Positives = 61/121 (50%), Frame = +3 Query: 102 RGFYDAFLRSFIKVDHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVV 281 + FY+ F I+V+ ++ G ISC+ + +R L G++ +LV+ + A V Sbjct: 31 KSFYENFSLRGIRVNRVEPGFISCSFKVPLRLTDRDKNLANGAIANLVDEVGGAL---VH 87 Query: 282 AKDKELFLG-EISISYLSATPANEEVLANASVV--KTGRNLTVVAVEFKLKKAGNLLYI- 449 + + + ++SI++LS EE+ + ++ + G T+V V K+ G ++ Sbjct: 88 GEGLPMSVSVDMSIAFLSKAKLGEELEITSRLLGERGGYKGTIVVVRNKM--TGEIIAEG 145 Query: 450 THSTF 464 HS F Sbjct: 146 RHSMF 150 >gi|11350298|pir||B83042 hypothetical protein PA4830 [imported] - Pseudomonas aeruginosa (strain PAO1) >gi|9951099|gb|AAG08215.1|AE004896_5 (AE004896) hypothetical protein [Pseudomonas aeruginosa] Length = 179 Frame 3 hits (HSPs): _________________________ __________________________________________________ Database sequence: | | | | | 179 0 50 100 150 Plus Strand HSPs: Score = 79 (27.8 bits), Expect = 5.8, P = 1.0 Identities = 29/87 (33%), Positives = 40/87 (45%), Frame = +3 Query: 201 NRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISISYLSA-TPANEEVLANASVV 377 N G++HGG SL++ T + + ++ ISYL A T V A VV Sbjct: 85 NPLGSVHGGYAASLLDSCMGCAIHTRLQAGQGYTTTDLRISYLRALTDKVGPVRAEGRVV 144 Query: 378 KTGRNLTVVAVEFKLKKAGNLLYITHST 461 GR+ T VA E +L + LY ST Sbjct: 145 HLGRS-TAVA-EGRLYDVDDRLYAVGST 170 >gi|10177184|dbj|BAB10318.1| (AB017061) gb|AAD49765.1~gene_id:K19E20.6~similar to unknown protein [Arabidopsis thaliana] Length = 157 Frame 3 hits (HSPs): _______________________________ __________________________________________________ Database sequence: | | | | | 157 0 50 100 150 Plus Strand HSPs: Score = 77 (27.1 bits), Expect = 7.2, P = 1.0 Identities = 26/95 (27%), Positives = 42/95 (44%), Frame = +3 Query: 144 DHIQRGRISCTVVAKPPICNRYGTLHGGSVGSLVEILSNACARTVVAKDKELFLGEISIS 323 D + R+S + C + LHGG + E L++ A + + K + +SI Sbjct: 22 DELSATRVSGHLTLTEKCCQPFKVLHGGVSALIAEALASLGAG-IASGFKRVAGIHLSIH 80 Query: 324 YLSATPANEEVLANASVVKTGRNLTVVAVE-FKLKK 428 +L E V A + V G+N+ V V +K KK Sbjct: 81 HLRPAALGEIVFAESFPVSVGKNIQVWEVRLWKAKK 116 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.98 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.325 0.139 0.405 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.359 0.159 0.702 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.342 0.149 0.509 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.358 0.157 0.654 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.328 0.140 0.431 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.348 0.149 0.474 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 194 194 10. 76 3 12 22 0.091 35 31 0.10 38 +2 0 194 194 10. 76 3 12 22 0.091 35 31 0.10 38 +1 0 195 195 10. 76 3 12 22 0.091 35 31 0.10 38 -1 0 195 195 10. 76 3 12 22 0.091 35 31 0.10 38 -2 0 194 194 10. 76 3 12 22 0.091 35 31 0.10 38 -3 0 194 194 10. 76 3 12 22 0.091 35 31 0.10 38 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 22 No. of states in DFA: 594 (59 KB) Total size of DFA: 236 KB (256 KB) Time to generate neighborhood: 0.02u 0.00s 0.02t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 208.11u 1.22s 209.33t Elapsed: 00:00:36 Total cpu time: 208.18u 1.22s 209.40t Elapsed: 00:00:36 Start: Thu Jan 17 17:27:24 2002 End: Thu Jan 17 17:28:00 2002
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000