WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= A09B02_CONSENSUS (630 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 3 Sequences : less than 3 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 732 172 |========================================================= 6310 560 116 |====================================== 3980 444 106 |=================================== 2510 338 96 |================================ 1580 242 77 |========================= 1000 165 65 |===================== 631 100 29 |========= 398 71 17 |===== 251 54 19 |====== 158 35 7 |== 100 28 5 |= 63.1 23 3 |= 39.8 20 1 |: 25.1 19 2 |: 15.8 17 1 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 16 <<<<<<<<<<<<<<<<< 10.0 16 1 |: 6.31 15 1 |: 3.98 14 0 | 2.51 14 0 | 1.58 14 3 |= Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|7488713|pir||T08896Sali3-2 protein, aluminium-indu... +2 499 9.9e-47 1 gi|485515|pir||S33622ADR6 protein - soybean >gi|29644... +2 457 2.8e-42 1 gi|2129886|pir||S70755hypothetical protein - garden p... +2 170 2.3e-16 2 gi|119097|sp|P21747|EA92_VICFAEMBRYONIC ABUNDANT PROT... +2 160 3.7e-15 2 gi|100102|pir||S14068seed protein precursor - tick be... +2 152 3.5e-14 2 gi|119095|sp|P21745|EA30_VICFAEMBRYONIC ABUNDANT PROT... +2 152 3.5e-14 2 gi|135067|sp|P09059|SVF3_VICFAUNKNOWN SEED PROTEIN 30... +2 152 1.5e-13 2 gi|119096|sp|P21746|EA87_VICFAEMBRYONIC ABUNDANT PROT... +2 152 1.5e-13 2 gi|7488827|pir||T06815probable embryonic abundant pro... +2 133 5.9e-10 2 gi|1172874|sp|Q08298|RD22_ARATHDEHYDRATION-RESPONSIVE... +2 125 7.6e-05 1 gi|7106540|dbj|BAA92225.1|(AB038692) similar to the B... +2 101 0.00027 1 gi|451592|gb|AAA18047.1|(U04982) vpx [Simian immunode... +1 68 0.77 1 gi|451604|gb|AAA18057.1|(U04984) vpx [Simian immunode... +1 68 0.77 1 gi|7462014|pir||T11554vpx protein - simian immunodefi... +1 68 0.77 1 gi|9719737|gb|AAF97839.1|AC034107_22(AC034107) ESTs g... +2 63 0.995 1 gi|451598|gb|AAA18052.1|(U04983) vpx [Simian immunode... +1 65 0.998 1 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame 3 Hits gi|2129886 | ________ gi|119097 | ________ gi|100102 | ________ gi|119095 | ________ gi|135067 | ________ gi|119096 | ________ gi|7488827 | ________ __________________________________________________ Query sequence: | | | | | | 210 0 50 100 150 200 Locus_ID Frame 2 Hits gi|7488713 |__________________________________________ gi|485515 |_____________________________________ gi|2129886 | ___________________________ gi|119097 | ___________________________ gi|100102 | ___________________________ gi|119095 | ___________________________ gi|135067 | ___________________________ gi|119096 | ___________________________ gi|7488827 | ___________________________ gi|1172874 |_____________________________________ gi|7106540 | ____________________________ gi|9719737 | ___________________ __________________________________________________ Query sequence: | | | | | | 210 0 50 100 150 200 Locus_ID Frame 1 Hits gi|451592 | ____________ gi|451604 | ____________ gi|7462014 | ____________ gi|451598 | ____________ __________________________________________________ Query sequence: | | | | | | 210 0 50 100 150 200
Use the and icons to retrieve links to Entrez:
>gi|7488713|pir||T08896 Sali3-2 protein, aluminium-induced - soybean >gi|2317900|gb|AAB66369.1| (U89693) Sali3-2 [Glycine max] Length = 276 Frame 2 hits (HSPs): ________________________________ __________________________________________________ Database sequence: | | | | | | | 276 0 50 100 150 200 250 Plus Strand HSPs: Score = 499 (175.7 bits), Expect = 9.9e-47, P = 9.9e-47 Identities = 110/176 (62%), Positives = 129/176 (73%), Frame = +2 Query: 2 HEQPYGVYTWLTDIKDTSKEGYSFEEICIKKEAFEGEEKFCAKSLGTVIGFAISKLGKNI 181 + QPYGVYTWLTDIKDTSKEGYSFEEICIKKEAFEGEEKFCAKSLGTVIGFAISKLGKNI Sbjct: 97 YAQPYGVYTWLTDIKDTSKEGYSFEEICIKKEAFEGEEKFCAKSLGTVIGFAISKLGKNI 156 Query: 182 QVLSSSFVNKQEQYTVEGVAESWRQSSDVVHGLNFRNCSYFTAINPLKQQLS-MVPLVAG 358 QVLSSSFVNKQEQYTVEGV ++ + + HGLNFR ++ + +++ + +VPLVAG Sbjct: 157 QVLSSSFVNKQEQYTVEGV-QNLGDKAVMCHGLNFRTAVFYC--HKVRETTAFVVPLVAG 213 Query: 359 DGTKTPALAVCPSXYFWNESSHAS*LLGFDP*TX-LLPFPWNQAFXGSPFSXDLPTXY 529 DGTKT ALAVC S L+G DP T + F ++A P + + T Y Sbjct: 214 DGTKTQALAVCHSDTSGMNHHILHELMGVDPGTNPVCHFLGSKAILWVP-NISMDTAY 270 >gi|485515|pir||S33622 ADR6 protein - soybean >gi|296445|emb|CAA49340.1| (X69639) auxin down regulated [Glycine max] >gi|2304955|gb|AAB65592.1| (U64866) similar to ADR6 encoded by GenBank Accession Number X69639; aluminum induced [Glycine max] Length = 272 Frame 2 hits (HSPs): ____________________________ Annotated Domains: _____________________________________________ __________________________________________________ Database sequence: | | | | | | | 272 0 50 100 150 200 250 __________________ Annotated Domains: DOMO DM02982: 18..258 __________________ Plus Strand HSPs: Score = 457 (160.9 bits), Expect = 2.8e-42, P = 2.8e-42 Identities = 96/151 (63%), Positives = 115/151 (76%), Frame = +2 Query: 5 EQPYGVYTWLTDIKDTSKEGYSFEEICIKKEAFEGEEKFCAKSLGTVIGFAISKLGKNIQ 184 +QP+GV TWL +IKDT+KEGYSFEE+CIKKEA EGEEKFCAKSLGTVIGFAISKLGKNIQ Sbjct: 94 QQPWGVGTWLKEIKDTTKEGYSFEELCIKKEAIEGEEKFCAKSLGTVIGFAISKLGKNIQ 153 Query: 185 VLSSSFVNKQEQYTVEGVAESWRQSSDVVHGLNFRNCSYFTAINPLKQQLS-MVPLVAGD 361 VLSSSFVNKQ+QYTVEGV ++ + + H LNFR ++ + +++ + MVPLVAGD Sbjct: 154 VLSSSFVNKQDQYTVEGV-QNLGDKAVMCHRLNFRTAVFYC--HEVRETTAFMVPLVAGD 210 Query: 362 GTKTPALAVCPSXYFWNESSHAS*LLGFDP*T 457 GTKT ALA+C S L+G DP T Sbjct: 211 GTKTQALAICHSNTSGMNHQMLHQLMGVDPGT 242 >gi|2129886|pir||S70755 hypothetical protein - garden pea >gi|20914|emb|CAA38756.1| (X55012) unknown seed protein [Pisum sativum] Length = 221 Frame 3 hits (HSPs): ________ Frame 2 hits (HSPs): __________________________ __________________________________________________ Database sequence: | | | | | | 221 0 50 100 150 200 Plus Strand HSPs: Score = 170 (59.8 bits), Expect = 2.3e-16, Sum P(2) = 2.3e-16 Identities = 45/113 (39%), Positives = 59/113 (52%), Frame = +2 Query: 53 SKEGYSFEEICIKKEAFEGEEKFCAKSLGTVIGFAISKLGKN-IQVLSSSFVNKQEQYTV 229 +K+ S E+ C A E K C SL ++I IS G I+ +SS+F Q+QY V Sbjct: 76 NKDEQSLEDFCYSPTAI-AEHKHCVSSLNSMIDEVISHFGTTKIKAISSNFAQNQDQYDV 134 Query: 230 EGVAESWRQSSDVVHGLNFRNCSYFTAINPLKQQLSMVPLVAGDGTKTPALAVC 391 E V + +++ + H LNF N F K MV LVA DGTKT AL VC Sbjct: 135 EEVKKV-SENAVMCHRLNFENV-VFNCHQVSKTTAYMVSLVASDGTKTNALTVC 186 Score = 56 (19.7 bits), Expect = 2.3e-16, Sum P(2) = 2.3e-16 Identities = 10/29 (34%), Positives = 18/29 (62%), Frame = +3 Query: 402 TSGMNPPMLHDSWDLIP-ELXCCHSLGTR 485 T GMNP +L+++ + P + CH +G + Sbjct: 190 TRGMNPELLYEALQVTPGTVPVCHFIGNK 218 >gi|119097|sp|P21747|EA92_VICFA EMBRYONIC ABUNDANT PROTEIN USP92 PRECURSOR >gi|82002|pir||S04136 embryonic abundant protein precursor (clone pUSP92) - tick bean >gi|22051|emb|CAA31602.1| (X13210) USP precursor [Vicia faba] Length = 268 Frame 3 hits (HSPs): ______ Frame 2 hits (HSPs): _____________________ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | | | | 268 0 50 100 150 200 250 __________________ Annotated Domains: Entrez Domain: 3 X 6 AA APPROXIMATE REPEATS. 50..106 Entrez Repetitive region: 1-1. 50..55 Entrez Repetitive region: 1-2. 83..88 Entrez Repetitive region: 1-3. 101..106 Entrez Domain: 2 X APPROXIMATE REPEATS. 166..222 Entrez Repetitive region: 2-1. 166..183 Entrez Repetitive region: 2-2. 202..222 Entrez glycosylation site: POTENTIAL. 259 PRODOM PD009870: 1..79 PRODOM PD013884: 86..126 PRODOM PD004669: 128..256 __________________ Plus Strand HSPs: Score = 160 (56.3 bits), Expect = 3.7e-15, Sum P(2) = 3.7e-15 Identities = 44/113 (38%), Positives = 58/113 (51%), Frame = +2 Query: 53 SKEGYSFEEICIKKEAFEGEEKFCAKSLGTVIGFAISKLGKN-IQVLSSSFVNKQEQYTV 229 +KE SFE+ C A E K C SL ++I IS G I+ +SS+F Q+QY V Sbjct: 110 NKEKQSFEDFCYSPTAI-AEHKHCVSSLKSMIDQVISHFGSTKIKAISSNFAPYQDQYVV 168 Query: 230 EGVAESWRQSSDVVHGLNFRNCSYFTAINPLKQQLSMVPLVAGDGTKTPALAVC 391 E V + ++ + H LNF F + +V LVA DGTKT AL VC Sbjct: 169 EDVKKVG-DNAVMCHRLNFEKV-VFNCHQVRETTAYVVSLVASDGTKTKALTVC 220 Score = 58 (20.4 bits), Expect = 3.7e-15, Sum P(2) = 3.7e-15 Identities = 10/29 (34%), Positives = 19/29 (65%), Frame = +3 Query: 402 TSGMNPPMLHDSWDLIP-ELXCCHSLGTR 485 T GMNP +L+++ ++ P + CH +G + Sbjct: 224 TRGMNPELLYEALEVTPGTVPVCHFIGNK 252 >gi|100102|pir||S14068 seed protein precursor - tick bean >gi|22043|emb|CAA39696.1| (X56240) unknown seed protein [Vicia faba] Length = 268 Frame 3 hits (HSPs): ______ Frame 2 hits (HSPs): _____________________ Annotated Domains: ____ __________________________________________________ Database sequence: | | | | | | | 268 0 50 100 150 200 250 __________________ Annotated Domains: Entrez domain: signal sequence 1..22 __________________ Plus Strand HSPs: Score = 152 (53.5 bits), Expect = 3.5e-14, Sum P(2) = 3.5e-14 Identities = 43/113 (38%), Positives = 56/113 (49%), Frame = +2 Query: 53 SKEGYSFEEICIKKEAFEGEEKFCAKSLGTVIGFAISKLGKN-IQVLSSSFVNKQEQYTV 229 +KE S E+ C A E K C SL ++I IS G I+ +SS+F Q+QY V Sbjct: 110 NKEKQSLEDFCYSPTAI-AEHKHCVSSLKSMIDQVISHFGSTKIKAISSNFAPYQDQYVV 168 Query: 230 EGVAESWRQSSDVVHGLNFRNCSYFTAINPLKQQLSMVPLVAGDGTKTPALAVC 391 E V + ++ + H LNF F +V LVA DGTKT AL VC Sbjct: 169 EDVKKVG-DNAVMCHRLNFEKV-VFNCHQVRDTTAYVVSLVASDGTKTKALTVC 220 Score = 58 (20.4 bits), Expect = 3.5e-14, Sum P(2) = 3.5e-14 Identities = 10/29 (34%), Positives = 19/29 (65%), Frame = +3 Query: 402 TSGMNPPMLHDSWDLIP-ELXCCHSLGTR 485 T GMNP +L+++ ++ P + CH +G + Sbjct: 224 TRGMNPELLYEALEVTPGTVPVCHFIGNK 252 >gi|119095|sp|P21745|EA30_VICFA EMBRYONIC ABUNDANT PROTEIN VF30.1 PRECURSOR >gi|82003|pir||S05471 embryonic abundant protein precursor (clone USP Vf30.1) - tick bean Length = 268 Frame 3 hits (HSPs): ______ Frame 2 hits (HSPs): _____________________ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | | | | 268 0 50 100 150 200 250 __________________ Annotated Domains: DOMO DM02982: 17..258 Entrez Domain: 3 X 6 AA APPROXIMATE REPEATS. 50..106 Entrez Repetitive region: 1-1. 50..55 Entrez Repetitive region: 1-2. 83..88 Entrez Repetitive region: 1-3. 101..106 Entrez Domain: 2 X APPROXIMATE REPEATS. 166..222 Entrez Repetitive region: 2-1. 166..183 Entrez Repetitive region: 2-2. 202..222 Entrez glycosylation site: POTENTIAL. 259 PRODOM PD009870: 1..79 PRODOM PD013884: 86..126 PRODOM PD004669: 128..256 __________________ Plus Strand HSPs: Score = 152 (53.5 bits), Expect = 3.5e-14, Sum P(2) = 3.5e-14 Identities = 43/113 (38%), Positives = 56/113 (49%), Frame = +2 Query: 53 SKEGYSFEEICIKKEAFEGEEKFCAKSLGTVIGFAISKLGKN-IQVLSSSFVNKQEQYTV 229 +KE S E+ C A E K C SL ++I IS G I+ +SS+F Q+QY V Sbjct: 110 NKEKQSLEDFCYSPTAI-AEHKHCVSSLKSMIDQVISHFGSTKIKAISSNFAPYQDQYVV 168 Query: 230 EGVAESWRQSSDVVHGLNFRNCSYFTAINPLKQQLSMVPLVAGDGTKTPALAVC 391 E V + ++ + H LNF F +V LVA DGTKT AL VC Sbjct: 169 EDVKKVG-DNAVMCHRLNFEKV-VFNCHQVRDTTAYVVSLVASDGTKTKALTVC 220 Score = 58 (20.4 bits), Expect = 3.5e-14, Sum P(2) = 3.5e-14 Identities = 10/29 (34%), Positives = 19/29 (65%), Frame = +3 Query: 402 TSGMNPPMLHDSWDLIP-ELXCCHSLGTR 485 T GMNP +L+++ ++ P + CH +G + Sbjct: 224 TRGMNPELLYEALEVTPGTVPVCHFIGNK 252 >gi|135067|sp|P09059|SVF3_VICFA UNKNOWN SEED PROTEIN 30.1 PRECURSOR (VF30.1) >gi|82000|pir||S03328 embryonic abundant protein precursor (clone pUSP14) - tick bean >gi|22046|emb|CAA31626.1| (X13242) seed protein [Vicia faba] Length = 268 Frame 3 hits (HSPs): ______ Frame 2 hits (HSPs): _____________________ Annotated Domains: ________________________________________________ __________________________________________________ Database sequence: | | | | | | | 268 0 50 100 150 200 250 __________________ Annotated Domains: PRODOM PD009870: 1..79 PRODOM PD013884: 86..126 PRODOM PD004669: 128..256 __________________ Plus Strand HSPs: Score = 152 (53.5 bits), Expect = 1.5e-13, Sum P(2) = 1.5e-13 Identities = 43/113 (38%), Positives = 56/113 (49%), Frame = +2 Query: 53 SKEGYSFEEICIKKEAFEGEEKFCAKSLGTVIGFAISKLGKN-IQVLSSSFVNKQEQYTV 229 +KE S E+ C A E K C SL ++I IS G I+ +SS+F Q+QY V Sbjct: 110 NKEKQSLEDFCYSPTAI-AEHKHCVSSLKSMIDQVISHFGSTKIKAISSNFAPYQDQYVV 168 Query: 230 EGVAESWRQSSDVVHGLNFRNCSYFTAINPLKQQLSMVPLVAGDGTKTPALAVC 391 E V + ++ + H LNF F +V LVA DGTKT AL VC Sbjct: 169 EDVKKVG-DNAVMCHRLNFEKV-VFNCHQVRDTTAYVVSLVASDGTKTKALTVC 220 Score = 52 (18.3 bits), Expect = 1.5e-13, Sum P(2) = 1.5e-13 Identities = 9/29 (31%), Positives = 19/29 (65%), Frame = +3 Query: 402 TSGMNPPMLHDSWDL-IPELXCCHSLGTR 485 T GMNP +L+++ ++ + + CH +G + Sbjct: 224 TRGMNPELLYEALEVTLGTVPVCHFIGNK 252 >gi|119096|sp|P21746|EA87_VICFA EMBRYONIC ABUNDANT PROTEIN USP87 PRECURSOR >gi|82001|pir||S04135 embryonic abundant protein precursor (clone pUSP87) - tick bean >gi|22049|emb|CAA31603.1| (X13211) USP precursor [Vicia faba] Length = 268 Frame 3 hits (HSPs): ______ Frame 2 hits (HSPs): _____________________ Annotated Domains: _________________________________________________ __________________________________________________ Database sequence: | | | | | | | 268 0 50 100 150 200 250 __________________ Annotated Domains: Entrez Domain: 3 X 6 AA APPROXIMATE REPEATS. 50..106 Entrez Repetitive region: 1-1. 50..55 Entrez Repetitive region: 1-2. 83..88 Entrez Repetitive region: 1-3. 101..106 Entrez Domain: 2 X APPROXIMATE REPEATS. 166..222 Entrez Repetitive region: 2-1. 166..183 Entrez Repetitive region: 2-2. 202..222 Entrez glycosylation site: POTENTIAL. 259 PRODOM PD009870: 1..79 PRODOM PD013884: 86..126 PRODOM PD004669: 128..256 __________________ Plus Strand HSPs: Score = 152 (53.5 bits), Expect = 1.5e-13, Sum P(2) = 1.5e-13 Identities = 43/113 (38%), Positives = 56/113 (49%), Frame = +2 Query: 53 SKEGYSFEEICIKKEAFEGEEKFCAKSLGTVIGFAISKLGKN-IQVLSSSFVNKQEQYTV 229 +KE S E+ C A E K C SL ++I IS G I+ +SS+F Q+QY V Sbjct: 110 NKEKQSLEDFCYSPTAI-AEHKHCVSSLKSMIDQVISHFGSTKIKAISSNFAPYQDQYVV 168 Query: 230 EGVAESWRQSSDVVHGLNFRNCSYFTAINPLKQQLSMVPLVAGDGTKTPALAVC 391 E V + ++ + H LNF F +V LVA DGTKT AL VC Sbjct: 169 EDVKKVG-DNAVMCHRLNFEKV-VFNCHQVRDTTAYVVSLVASDGTKTKALTVC 220 Score = 52 (18.3 bits), Expect = 1.5e-13, Sum P(2) = 1.5e-13 Identities = 9/29 (31%), Positives = 19/29 (65%), Frame = +3 Query: 402 TSGMNPPMLHDSWDL-IPELXCCHSLGTR 485 T GMNP +L+++ ++ + + CH +G + Sbjct: 224 TRGMNPELLYEALEVTLGTVPVCHFIGNK 252 >gi|7488827|pir||T06815 probable embryonic abundant protein - garden pea (fragment) >gi|20912|emb|CAA38755.1| (X55011) internal part of pea Unknown Seed Protein (USP) [Pisum sativum] Length = 221 Frame 3 hits (HSPs): ________ Frame 2 hits (HSPs): __________________________ __________________________________________________ Database sequence: | | | | | | 221 0 50 100 150 200 Plus Strand HSPs: Score = 133 (46.8 bits), Expect = 5.9e-10, Sum P(2) = 5.9e-10 Identities = 40/113 (35%), Positives = 55/113 (48%), Frame = +2 Query: 53 SKEGYSFEEICIKKEAFEGEEKFCAKSLGTVIGFAISKLGKN-IQVLSSSFVNKQEQYTV 229 +K+ S E+ C A E K C SL ++I IS G+ I+ +SS+ Q+QY Sbjct: 76 NKDEQSLEDFCYSPTAI-AEHKHCVSSLNSMIDEVISHFGQTKIKAISSNSAQNQDQYAW 134 Query: 230 EGVAESWRQSSDVVHGLNFRNCSYFTAINPLKQQLSMVPLVAGDGTKTPALAVC 391 E V + +++ + H LN N F K MV LVA DG KT L VC Sbjct: 135 EEV-KIVTENAVMCHRLNSDNV-VFNCHQVSKTTAYMVSLVASDGYKTNDLTVC 186 Score = 56 (19.7 bits), Expect = 5.9e-10, Sum P(2) = 5.9e-10 Identities = 10/29 (34%), Positives = 18/29 (62%), Frame = +3 Query: 402 TSGMNPPMLHDSWDLIP-ELXCCHSLGTR 485 T GMNP +L+++ + P + CH +G + Sbjct: 190 TRGMNPELLYEALQVTPGTVPVCHFIGNK 218 >gi|1172874|sp|Q08298|RD22_ARATH DEHYDRATION-RESPONSIVE PROTEIN RD22 PRECURSOR >gi|479589|pir||S34823 dehydration-induced protein RD22 - Arabidopsis thaliana >gi|391608|dbj|BAA01546.1| (D10703) rd22 [Arabidopsis thaliana] >gi|447134|prf||1913421A rd22 gene [Arabidopsis thaliana] Length = 392 Frame 2 hits (HSPs): _____________________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 392 0 150 300 __________________ Annotated Domains: DOMO DM02982: 118..391 Entrez Domain: 5 X APPROXIMATE REPEATS. 57..164 Entrez Repetitive region: 1. 57..75 Entrez Repetitive region: 2. 78..97 Entrez Repetitive region: 3. 100..120 Entrez Repetitive region: 4. 125..142 Entrez Repetitive region: 5. 145..164 PRODOM PD122867: RD22_ARATH 1..167 PRODOM PD004669: 169..390 __________________ Plus Strand HSPs: Score = 125 (44.0 bits), Expect = 7.6e-05, P = 7.6e-05 Identities = 48/162 (29%), Positives = 77/162 (47%), Frame = +2 Query: 11 PYGVYTWLTDIKDTSKEGYSFEEICIKK-----EA--FEGEEKFCAKSLGTVIGFAISKL 169 P+G + +K S E S E +KK EA GEEK+CA SL +++ F++SKL Sbjct: 214 PFGSEKFSETLKRFSVEAGSEEAEMMKKTIEECEARKVSGEEKYCATSLESMVDFSVSKL 273 Query: 170 GK-NIQVLSSSFVNKQ---EQYTVE--GVAESWRQSSDVVHGLNFRNCSYFTAINPLKQQ 331 GK +++ +S+ K ++Y + GV + S V H + + F + Sbjct: 274 GKYHVRAVSTEVAKKNAPMQKYKIAAAGVKKLSDDKSVVCHKQKYP-FAVFYCHKAMMTT 332 Query: 332 LSMVPLVAGDGTKTPALAVC-PSXYFWNESSHAS*LLGFDP*T 457 + VPL +G + A+AVC + WN + A +L P T Sbjct: 333 VYAVPLEGENGMRAKAVAVCHKNTSAWNPNHLAFKVLKVKPGT 375 >gi|7106540|dbj|BAA92225.1| (AB038692) similar to the BURP domain [Vigna unguiculata] Length = 132 Frame 2 hits (HSPs): ____________________________________________ __________________________________________________ Database sequence: | | | | 132 0 50 100 Plus Strand HSPs: Score = 101 (35.6 bits), Expect = 0.00027, P = 0.00027 Identities = 35/114 (30%), Positives = 59/114 (51%), Frame = +2 Query: 125 AKSLGTVIGFAISKLGKNIQVLSSSFVNKQ---EQYTVE-GVAESWRQSSDVVHGLNFRN 292 A SL +++ F+ SKLGKN+ VLS+ V+++ +QYT+ GV + ++ V H ++ Sbjct: 2 ATSLESMVDFSTSKLGKNVAVLSTE-VDQETGLQQYTIAPGVKKVSGDNAVVCHKQSYPY 60 Query: 293 CSYFTAINPLKQQLSMVPLVAGDGTKTPALAVCPSXYF-WNESSHAS*LLGFDP*T 457 ++ + VPL +G + A+AVC + WN A +L P T Sbjct: 61 AVFYCHKTETTRTYP-VPLEGANGIRVKAVAVCHTDTSQWNPKHLAFEVLKVKPGT 115 >gi|451592|gb|AAA18047.1| (U04982) vpx [Simian immunodeficiency virus] >gi|451644|gb|AAA18090.1| (U04991) vpx [Simian immunodeficiency virus] Length = 91 Frame 1 hits (HSPs): ____________________________ __________________________________________________ Database sequence: | | | | | | 91 0 20 40 60 80 Plus Strand HSPs: Score = 68 (23.9 bits), Expect = 1.5, P = 0.77 Identities = 17/49 (34%), Positives = 27/49 (55%), Frame = +1 Query: 112 REVLCKILGNSNWFCHFKAGKEHSST-FKFLC--Q*ARAIHCGRSCRIL 249 RE++ ++ S + H + G S T +++LC Q A +HC R CR L Sbjct: 21 RELIFQVWRRSWEYWHDEMGMSESYTKYRYLCLIQKALFVHCKRGCRCL 69 >gi|451604|gb|AAA18057.1| (U04984) vpx [Simian immunodeficiency virus] Length = 91 Frame 1 hits (HSPs): ____________________________ __________________________________________________ Database sequence: | | | | | | 91 0 20 40 60 80 Plus Strand HSPs: Score = 68 (23.9 bits), Expect = 1.5, P = 0.77 Identities = 17/49 (34%), Positives = 27/49 (55%), Frame = +1 Query: 112 REVLCKILGNSNWFCHFKAGKEHSST-FKFLC--Q*ARAIHCGRSCRIL 249 RE++ ++ S + H + G S T +++LC Q A +HC R CR L Sbjct: 21 RELIFQVWQRSWEYWHDEMGMSESYTKYRYLCLIQKALFVHCKRGCRCL 69 >gi|7462014|pir||T11554 vpx protein - simian immunodeficiency virus SIVsm (strain 62) (fragment) >gi|451610|gb|AAA18062.1| (U04985) vpx [Simian immunodeficiency virus] Length = 91 Frame 1 hits (HSPs): ____________________________ __________________________________________________ Database sequence: | | | | | | 91 0 20 40 60 80 Plus Strand HSPs: Score = 68 (23.9 bits), Expect = 1.5, P = 0.77 Identities = 17/49 (34%), Positives = 27/49 (55%), Frame = +1 Query: 112 REVLCKILGNSNWFCHFKAGKEHSST-FKFLC--Q*ARAIHCGRSCRIL 249 RE++ ++ S + H + G S T +++LC Q A +HC R CR L Sbjct: 21 RELIFQVWRRSWEYWHDEMGMSESYTKYRYLCLIQKALFVHCKRGCRCL 69 >gi|9719737|gb|AAF97839.1|AC034107_22 (AC034107) ESTs gb|T22732, gb|AA585816 come from this gene. [Arabidopsis thaliana] Length = 85 Frame 2 hits (HSPs): ____________________________________________ __________________________________________________ Database sequence: | | | | | | 85 0 20 40 60 80 Plus Strand HSPs: Score = 63 (22.2 bits), Expect = 5.2, P = 0.99 Identities = 22/75 (29%), Positives = 33/75 (44%), Frame = +2 Query: 80 ICIKKEAFEGEEKFCAKSLGTVIGFAISKLGKNIQVLSSSFVNKQEQYTVEGVAES-WRQ 256 +C + E E + + L T +G AIS G V+S+S + G A S WR Sbjct: 8 VCCGSDVLEVESCYDSLRLWTPVGVAISSRGS--AVVSASATTAEVDREKVGSANSPWRL 65 Query: 257 SSDVVHGLNFRNCSYF 304 + D+ + R S F Sbjct: 66 TVDLPRFVGCRGFSVF 81 >gi|451598|gb|AAA18052.1| (U04983) vpx [Simian immunodeficiency virus] >gi|451616|gb|AAA18067.1| (U04986) vpx [Simian immunodeficiency virus] >gi|451622|gb|AAA18072.1| (U04987) vpx [Simian immunodeficiency virus] >gi|451628|gb|AAA18077.1| (U04988) vpx [Simian immunodeficiency virus] >gi|451639|gb|AAA18086.1| (U04990) vpx [Simian immunodeficiency virus] Length = 91 Frame 1 hits (HSPs): ____________________________ __________________________________________________ Database sequence: | | | | | | 91 0 20 40 60 80 Plus Strand HSPs: Score = 65 (22.9 bits), Expect = 6.4, P = 1.0 Identities = 16/49 (32%), Positives = 27/49 (55%), Frame = +1 Query: 112 REVLCKILGNSNWFCHFKAGKEHSST-FKFLC--Q*ARAIHCGRSCRIL 249 RE++ ++ S + H + G S T +++LC Q A +HC + CR L Sbjct: 21 RELIFQVWRRSWEYWHDEMGMSESYTKYRYLCLIQKALFVHCKKGCRCL 69 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.96 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.338 0.149 0.529 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.325 0.140 0.453 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.354 0.159 0.641 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.325 0.137 0.443 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.344 0.151 0.548 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.343 0.153 0.519 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 209 198 10. 76 3 12 22 0.093 35 31 0.10 38 +2 0 209 197 10. 76 3 12 22 0.092 35 31 0.10 38 +1 0 210 199 10. 77 3 12 22 0.093 35 31 0.10 38 -1 0 210 198 10. 76 3 12 22 0.093 35 31 0.10 38 -2 0 209 198 10. 76 3 12 22 0.093 35 31 0.10 38 -3 0 209 197 10. 76 3 12 22 0.092 35 31 0.10 38 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 16 No. of states in DFA: 592 (58 KB) Total size of DFA: 245 KB (256 KB) Time to generate neighborhood: 0.02u 0.00s 0.02t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 212.91u 0.99s 213.90t Elapsed: 00:00:36 Total cpu time: 212.97u 0.99s 213.96t Elapsed: 00:00:36 Start: Mon Oct 1 21:13:49 2001 End: Mon Oct 1 21:14:25 2001
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000