WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= SSH1F07.SEQ(1>232) (208 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 505,245 sequences; 158,518,215 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 21 Sequences : less than 21 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 6923 1263 |============================================================ 6310 5660 995 |=============================================== 3980 4665 767 |==================================== 2510 3898 715 |================================== 1580 3183 632 |============================== 1000 2551 517 |======================== 631 2034 397 |================== 398 1637 392 |================== 251 1245 358 |================= 158 887 257 |============ 100 630 150 |======= 63.1 480 159 |======= 39.8 321 116 |===== 25.1 205 68 |=== 15.8 137 39 |= >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 98 <<<<<<<<<<<<<<<<< 10.0 98 26 |= 6.31 72 22 |= 3.98 50 15 |: 2.51 35 12 |: 1.58 23 2 |: 1.00 21 4 |: 0.63 17 4 |: 0.40 13 2 |: 0.25 11 3 |: 0.16 8 1 |: 0.10 7 0 | 0.063 7 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|7488979|pir||T07612cellulase (EC 3.2.1.4) Cel3, me... -1 235 6.7e-18 1 gi|5689613|emb|CAB51903.1|(AJ242807) cellulase [Brass... -1 225 8.2e-17 1 gi|2129559|pir||S71215cellulase homolog OR16pep precu... -1 217 6.0e-16 1 gi|7484858|pir||T09889cellulase homolog T22A6.90 - Ar... -1 200 4.0e-14 1 gi|7527353|dbj|BAA94257.1|(AB040769) endo-1,4-beta-gl... -1 199 5.2e-14 1 gi|2190558|gb|AAB60922.1|(AC001229) F5I14.14 [Arabido... -1 169 8.8e-11 1 gi|7023440|dbj|BAA91964.1|(AK001891) unnamed protein ... -1 52 0.048 3 gi|102426|pir||C41132collagen-related protein 3 precu... -1 57 0.14 2 gi|2897589|emb|CAA71195.1|(Y10108) cytochrome b558/56... +3 50 0.17 3 gi|3445206|gb|AAC32436.1|(AC004786) unknown protein [... -2 58 0.21 2 gi|6671965|gb|AAF23224.1|AC013454_11(AC013454) unknow... -1 48 0.22 2 gi|7485344|pir||T04826hypothetical protein F10M23.370... -2 58 0.24 2 gi|575478|gb|AAA67703.1|(L37443) protease [human herp... -1 55 0.28 2 gi|1869848|emb|CAB06750.1|(Z86099) protease [human he... -1 55 0.34 2 gi|1224097|gb|AAA92139.1|(U49329) UL26 protease [huma... -1 55 0.34 2 gi|7485358|pir||T05400hypothetical protein F10M6.80 -... -1 80 0.36 1 gi|1076291|pir||S51171amino acid transport protein AA... -1 55 0.44 2 gi|7020284|dbj|BAA91064.1|(AK000296) unnamed protein ... -1 52 0.53 2 gi|425682|gb|AAB28462.1|extensin=nodule-specific prol... -1 65 0.54 1 gi|2078307|gb|AAB54090.1|(U67264) AcMNPV ORF8/ORF1629... -1 50 0.57 2 gi|1020287|gb|AAA79496.1|(U31793) putative [Human pap... -1 64 0.63 1 gi|5668787|gb|AAD46013.1|AC007894_11(AC007894) F21H2.... -1 51 0.64 2 gi|2134384|pir||I50620procKr2 - chicken (fragment) >g... -3 64 0.77 2 gi|1491688|emb|CAA63877.1|(X94164) putative [Human pa... -1 62 0.80 1 gi|6978799ref|NP_036683.1| early growth response 1 >g... +3 52 0.81 2 gi|113926|sp|P05140|ANP_HEMAMANTIFREEZE PROTEIN PRECU... -1 65 0.82 1 gi|7508781|pir||T26064hypothetical protein W01G7.1 - ... -1 73 0.83 1 gi|198605|gb|AAA39382.1|(M19643) Krox-24 protein [Mus... +3 51 0.85 2 gi|7484731|pir||T09873probable cellulase (EC 3.2.1.4)... -1 69 0.87 1 gi|3514051|gb|AAC34103.1|(AF082809) glycoprotein gJ [... -1 61 0.87 1 gi|7022861|dbj|BAA91748.1|(AK001542) unnamed protein ... -1 60 0.89 2 gi|90459|pir||JS0304developmental control protein Kro... +3 51 0.90 2 gi|6681285ref|NP_031939.1| early growth response 1 >g... +3 51 0.90 2 gi|213874|gb|AAA49617.1|(J02593) antifreeze polypepti... -1 65 0.92 1 gi|5453736ref|NP_005351.2| v-maf musculoaponeurotic f... +3 51 0.92 2 gi|7940276|gb|AAF70835.1|AC003113_2(AC003113) F24O1.6... -1 60 0.93 1 gi|6677613ref|NP_033581.1| zinc finger protein 40 >gi... -3 60 0.94 2 gi|102425|pir||B41132collagen-related protein 2 - Hyd... -1 62 0.94 1 gi|1020247|gb|AAA79461.1|(U31788) putative [Human pap... -1 62 0.94 1 gi|1020271|gb|AAA79482.1|(U31791) putative [Human pap... -1 62 0.94 1 gi|7523490|dbj|BAA94218.1|(AP001633) hypothetical pro... -1 53 0.95 2 gi|2760483|emb|CAA60014.1|(X86019) SH3-domain interac... +3 52 0.96 2 gi|4507911ref|NP_003378.1| Wiskott-Aldrich syndrome p... +3 52 0.96 2 gi|1586823|prf||2204390Asynaptojanin [Rattus norvegicus] -1 63 0.96 2 gi|7514092|pir||S68448synaptojanin, 170K - rat -1 63 0.96 2 gi|284517|pir||D42825Kruppel-type zinc finger protein... -2 59 0.97 1 gi|1363496|pir||S57663cellulase (EC 3.2.1.4) 3D precu... -1 69 0.97 1 gi|1655543|emb|CAA65826.1|(X97188) cellulase [Capsicu... -1 69 0.97 1 gi|116223|sp|P17110|CH36_CERCACHORION PROTEIN S36 >gi... +3 49 0.98 2 gi|5901928ref|NP_008938.1| pre-mRNA cleavage factor I... -1 56 0.98 2 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame 3 Hits gi|2897589 | _______________________ gi|6978799 | ____________ ________________ gi|198605 | ____________ ________________ gi|90459 | ____________ ________________ gi|6681285 | ____________ ________________ gi|5453736 | ______________ ___________________ gi|2760483 | ____________ __________________ gi|4507911 | ____________ __________________ gi|116223 | _____________ ____________________ __________________________________________________ Query sequence: | | | | | 70 0 20 40 60 Locus_ID Frame 2 Hits gi|2897589 | ________ _____________ __________________________________________________ Query sequence: | | | | | 70 0 20 40 60 Locus_ID Frame -1 Hits gi|7488979 | _________________________________________ gi|5689613 | _________________________________________ gi|2129559 | _______________________________________ gi|7484858 | _________________________________________ gi|7527353 | _________________________________________ gi|2190558 | _______________________________________ gi|7023440 | ____________ gi|102426 | ___________________________________ gi|3445206 | _________ gi|6671965 | ________________________________ gi|7485344 | ____________________ gi|575478 | ________________________ gi|1869848 | ________________________ gi|1224097 | ________________________ gi|7485358 | ___________________ gi|1076291 |_______________ ______________________ gi|7020284 | ____________ gi|425682 | ____________ gi|2078307 | _____________________________________ gi|1020287 | ________________ gi|5668787 | _________ ______________ gi|2134384 | _________ gi|1491688 | ________________ gi|113926 | ____________________________________ gi|7508781 | ______________________ gi|7484731 | _____________________________________ gi|3514051 | __________________________ gi|7022861 | ______________________ gi|213874 | ____________________________________ gi|7940276 | ____________ gi|102425 | _________________ gi|1020247 | __________________ gi|1020271 | __________________ gi|7523490 | __________________________________ gi|1586823 | _______________________________________ gi|7514092 | _______________________________________ gi|1363496 | ____________________________ gi|1655543 | ____________________________ gi|5901928 | ______________________________ __________________________________________________ Query sequence: | | | | | 70 0 20 40 60 Locus_ID Frame -2 Hits gi|7023440 | _______ _________ gi|3445206 | ______________________ gi|7485344 | ______________________ gi|7020284 | _______ gi|7022861 | _______ gi|6677613 | _________ gi|284517 | ______________________ __________________________________________________ Query sequence: | | | | | 70 0 20 40 60 Locus_ID Frame -3 Hits gi|2134384 | __________________________________ gi|6677613 | _______________________________ __________________________________________________ Query sequence: | | | | | 70 0 20 40 60
Use the and icons to retrieve links to Entrez:
WARNING: Descriptions of 48 database sequences were not reported due to the limiting value of parameter V = 50. >gi|7488979|pir||T07612 cellulase (EC 3.2.1.4) Cel3, membrane-anchored - tomato >gi|2065531|gb|AAC49704.1| (U78526) endo-1,4-beta-glucanase [Lycopersicon esculentum] Length = 617 Frame -1 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | | | 617 0 150 300 450 600 Minus Strand HSPs: Score = 235 (82.7 bits), Expect = 6.7e-18, P = 6.7e-18 Identities = 42/56 (75%), Positives = 48/56 (85%), Frame = -1 Query: 208 TEYSHTEPTLAGNAGLVDAVVALSGDKGTSIDKNTILCAVPPMFHTPPPPPAPWKP 41 + Y++TEPTLAGNAGLV A+VALSGD+ IDKNT+ AVPPMF TPPPPPAPWKP Sbjct: 562 SNYNYTEPTLAGNAGLVAALVALSGDRDVGIDKNTLFSAVPPMFPTPPPPPAPWKP 617 >gi|5689613|emb|CAB51903.1| (AJ242807) cellulase [Brassica napus] Length = 621 Frame -1 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | | | 621 0 150 300 450 600 Minus Strand HSPs: Score = 225 (79.2 bits), Expect = 8.2e-17, P = 8.2e-17 Identities = 43/58 (74%), Positives = 49/58 (84%), Frame = -1 Query: 208 TEYSHTEPTLAGNAGLVDAVVALSGDKGTS--IDKNTILCAVPPMFHTPPPPPAPWKP 41 T Y++TEPTLAGNAGLV A+VALSG++ S IDKNTI AVPP+F TPPPPPAPWKP Sbjct: 564 TNYNYTEPTLAGNAGLVAALVALSGEEEASGTIDKNTIFSAVPPLFPTPPPPPAPWKP 621 >gi|2129559|pir||S71215 cellulase homolog OR16pep precursor - Arabidopsis thaliana >gi|1022807|gb|AAB60304.1| (U37702) cellulase [Arabidopsis thaliana] >gi|3493633|gb|AAC33467.1| (AF074092) cellulase [Arabidopsis thaliana] >gi|3598956|gb|AAC35344.1| (AF074375) cellulase [Arabidopsis thaliana] >gi|3978258|gb|AAC83240.1| (AF073875) endo-1,4-beta-D-glucanase KORRIGAN [Arabidopsis thaliana] Length = 621 Frame -1 hits (HSPs): _____ Annotated Domains: __ ___ __________________________________________________ Database sequence: | | | | | | 621 0 150 300 450 600 __________________ Annotated Domains: PROSITE GLYCOSYL_HYDROL_F9_1: Glycosyl hydrolase 499..515 PROSITE GLYCOSYL_HYDROL_F9_2: Glycosyl hydrolase 559..577 __________________ Minus Strand HSPs: Score = 217 (76.4 bits), Expect = 6.0e-16, P = 6.0e-16 Identities = 41/56 (73%), Positives = 48/56 (85%), Frame = -1 Query: 202 YSHTEPTLAGNAGLVDAVVALSGDKGTS--IDKNTILCAVPPMFHTPPPPPAPWKP 41 Y++TEPTLAGNAGLV A+VALSG++ + IDKNTI AVPP+F TPPPPPAPWKP Sbjct: 566 YNYTEPTLAGNAGLVAALVALSGEEEATGKIDKNTIFSAVPPLFPTPPPPPAPWKP 621 >gi|7484858|pir||T09889 cellulase homolog T22A6.90 - Arabidopsis thaliana >gi|5051768|emb|CAB45061.1| (AL078637) endo-1, 4-beta-glucanase like protein [Arabidopsis thaliana] >gi|7269276|emb|CAB79336.1| (AL161561) endo-1, 4-beta-glucanase like protein [Arabidopsis thaliana] Length = 620 Frame -1 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | | | 620 0 150 300 450 600 Minus Strand HSPs: Score = 200 (70.4 bits), Expect = 4.0e-14, P = 4.0e-14 Identities = 38/57 (66%), Positives = 44/57 (77%), Frame = -1 Query: 208 TEYSHTEPTLAGNAGLVDAVVALSGDKGTS-IDKNTILCAVPPMFHTPPPPPAPWKP 41 T Y++TEPTLAGNAGLV A+VALSG+K IDKNT+ AVPP+ PPPPAPW P Sbjct: 564 TNYNYTEPTLAGNAGLVAALVALSGEKAVGGIDKNTMFSAVPPLVMATPPPPAPWTP 620 >gi|7527353|dbj|BAA94257.1| (AB040769) endo-1,4-beta-glucanase Cel1 [Hordeum vulgare] Length = 621 Frame -1 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | | | 621 0 150 300 450 600 Minus Strand HSPs: Score = 199 (70.1 bits), Expect = 5.2e-14, P = 5.2e-14 Identities = 39/58 (67%), Positives = 46/58 (79%), Frame = -1 Query: 208 TEYSHTEPTLAGNAGLVDAVVALSG-DKGT-SIDKNTILCAVPPMFHTPPPPPAPWKP 41 T Y++TEPTLA NAGLV A+++L+ D G SIDKNTI AVPPMF TPPPPP+ WKP Sbjct: 564 TNYNYTEPTLAANAGLVAALISLADIDTGRYSIDKNTIFSAVPPMFPTPPPPPSAWKP 621 >gi|2190558|gb|AAB60922.1| (AC001229) F5I14.14 [Arabidopsis thaliana] Length = 623 Frame -1 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | | | 623 0 150 300 450 600 Minus Strand HSPs: Score = 169 (59.5 bits), Expect = 8.8e-11, P = 8.8e-11 Identities = 30/54 (55%), Positives = 40/54 (74%), Frame = -1 Query: 202 YSHTEPTLAGNAGLVDAVVALSGDKGTSIDKNTILCAVPPMFHTPPPPPAPWKP 41 Y+ +EPTL+GNAGLV A+V+L+ G IDKNT+ +VPP++ PPPP WKP Sbjct: 570 YNASEPTLSGNAGLVAALVSLTSSGGQQIDKNTMFNSVPPLYSPTPPPPKAWKP 623 >gi|7023440|dbj|BAA91964.1| (AK001891) unnamed protein product [Homo sapiens] Length = 360 Frame -1 hits (HSPs): ___ Frame -2 hits (HSPs): __ __ __________________________________________________ Database sequence: | | | | 360 0 150 300 Minus Strand HSPs: Score = 52 (18.3 bits), Expect = 0.050, Sum P(3) = 0.048 Identities = 9/15 (60%), Positives = 9/15 (60%), Frame = -1 Query: 85 PMFHTPPPPPAPWKP 41 P H PPPPP P P Sbjct: 180 PASHFPPPPPPPPLP 194 Score = 34 (12.0 bits), Expect = 0.050, Sum P(3) = 0.048 Identities = 6/9 (66%), Positives = 7/9 (77%), Frame = -2 Query: 117 LTKIPFCVL 91 LT +P CVL Sbjct: 155 LTNVPACVL 163 Score = 31 (10.9 bits), Expect = 0.050, Sum P(3) = 0.048 Identities = 6/12 (50%), Positives = 9/12 (75%), Frame = -2 Query: 207 QNTVTQSQHLLE 172 QN +TQS +L+ Sbjct: 88 QNCITQSLEVLK 99 >gi|102426|pir||C41132 collagen-related protein 3 precursor - Hydra magnipapillata >gi|102430|pir||S21931 mini-collagen - Hydra sp >gi|9451|emb|CAA43381.1| (X61047) mini-collagen [Hydra sp.] Length = 186 Frame -1 hits (HSPs): ______ ________ _______ Annotated Domains: ______________________________ __________________________________________________ Database sequence: | | | | | 186 0 50 100 150 __________________ Annotated Domains: DOMO DM00042: FIBRILLARCOLLAGENCARBOXYL-TERMI 55..161 __________________ Minus Strand HSPs: Score = 57 (20.1 bits), Expect = 0.15, Sum P(2) = 0.14 Identities = 12/26 (46%), Positives = 13/26 (50%), Frame = -1 Query: 97 CAVPPMFHTPPPPPAPWKP*AHRAIG 20 C PP PPPPP P P A +G Sbjct: 72 CMPPPPPPPPPPPPYPGPPGAPGPMG 97 Score = 53 (18.7 bits), Expect = 0.41, Sum P(2) = 0.34 Identities = 9/16 (56%), Positives = 9/16 (56%), Frame = -1 Query: 88 PPMFHTPPPPPAPWKP 41 PP PPPPP P P Sbjct: 70 PPCMPPPPPPPPPPPP 85 Score = 52 (18.3 bits), Expect = 0.53, Sum P(2) = 0.41 Identities = 9/20 (45%), Positives = 10/20 (50%), Frame = -1 Query: 100 LCAVPPMFHTPPPPPAPWKP 41 +C P PPPPP P P Sbjct: 153 ICPAPAPACVPPPPPPPPPP 172 Score = 51 (18.0 bits), Expect = 0.68, Sum P(2) = 0.49 Identities = 9/15 (60%), Positives = 9/15 (60%), Frame = -1 Query: 97 CAVPPMFHTPPPPPA 53 C PP PPPPPA Sbjct: 161 CVPPPPPPPPPPPPA 175 Score = 34 (12.0 bits), Expect = 0.15, Sum P(2) = 0.14 Identities = 8/20 (40%), Positives = 12/20 (60%), Frame = -1 Query: 163 LVDAVVALSGDKGTSIDKNT 104 LV AVVA + K ++K + Sbjct: 8 LVFAVVAYASAKSVDLEKRS 27 >gi|2897589|emb|CAA71195.1| (Y10108) cytochrome b558/566, subunit A [Sulfolobus acidocaldarius] Length = 462 Frame 3 hits (HSPs): _____ Frame 2 hits (HSPs): __ __ __________________________________________________ Database sequence: | | | | | 462 0 150 300 450 Plus Strand HSPs: Score = 50 (17.6 bits), Expect = 0.18, Sum P(3) = 0.17 Identities = 12/34 (35%), Positives = 20/34 (58%), Frame = +3 Query: 108 FLSIDVPLSPDNATTASTRPA--FPASVGSV*LY 203 FLS+++ +P +TT ST P +++ V LY Sbjct: 407 FLSLELVTTPPTSTTTSTSPVTTISSAIPPVTLY 440 Score = 33 (11.6 bits), Expect = 0.18, Sum P(3) = 0.17 Identities = 5/9 (55%), Positives = 8/9 (88%), Frame = +2 Query: 17 ITNGTMSLW 43 ++NGT+ LW Sbjct: 181 MSNGTILLW 189 Score = 32 (11.3 bits), Expect = 0.18, Sum P(3) = 0.17 Identities = 7/18 (38%), Positives = 8/18 (44%), Frame = +2 Query: 62 W-WRCMKHWGNSTQNGIF 112 W W W NST + F Sbjct: 269 WMWVSGATWNNSTYDPAF 286 >gi|3445206|gb|AAC32436.1| (AC004786) unknown protein [Arabidopsis thaliana] Length = 279 Frame -1 hits (HSPs): __ Frame -2 hits (HSPs): ______ __________________________________________________ Database sequence: | | | | | | | 279 0 50 100 150 200 250 Minus Strand HSPs: Score = 58 (20.4 bits), Expect = 0.24, Sum P(2) = 0.21 Identities = 14/31 (45%), Positives = 16/31 (51%), Frame = -2 Query: 102 FCVLFPQCFIHRHHLQRHGNHKLIVPLVISL 10 FCVLF Q F H HG + PLV+ L Sbjct: 188 FCVLFCQLF----HAWAHGTKSKLPPLVVGL 214 Score = 36 (12.7 bits), Expect = 0.24, Sum P(2) = 0.21 Identities = 7/11 (63%), Positives = 9/11 (81%), Frame = -1 Query: 190 EPTLAGNAGLV 158 EP+LAG AG + Sbjct: 96 EPSLAGFAGYI 106 >gi|6671965|gb|AAF23224.1|AC013454_11 (AC013454) unknown protein [Arabidopsis thaliana] >gi|6714405|gb|AAF26094.1|AC012393_20 (AC012393) unknown protein [Arabidopsis thaliana] Length = 126 Frame -1 hits (HSPs): ______________ _______ __________________________________________________ Database sequence: | | | | 126 0 50 100 Minus Strand HSPs: Score = 48 (16.9 bits), Expect = 0.25, Sum P(2) = 0.22 Identities = 9/16 (56%), Positives = 9/16 (56%), Frame = -1 Query: 88 PPMFHTPPPPPAPWKP 41 PP P PPPAP P Sbjct: 82 PPEPEKPKPPPAPEPP 97 Score = 45 (15.8 bits), Expect = 0.25, Sum P(2) = 0.22 Identities = 11/32 (34%), Positives = 19/32 (59%), Frame = -1 Query: 172 NAGLVDAVVALSGDKGTSIDK-NTILCAVPPM 80 N+ +++AV + G S+D+ N+IL V M Sbjct: 18 NSAIMEAVTEIEGVNHISLDEGNSILTVVGTM 49 >gi|7485344|pir||T04826 hypothetical protein F10M23.370 - Arabidopsis thaliana >gi|4455226|emb|CAB36549.1| (AL035440) putative protein [Arabidopsis thaliana] >gi|7269556|emb|CAB79558.1| (AL161566) putative protein [Arabidopsis thaliana] Length = 323 Frame -1 hits (HSPs): _____ Frame -2 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | | | | | 323 0 50 100 150 200 250 300 Minus Strand HSPs: Score = 58 (20.4 bits), Expect = 0.27, Sum P(2) = 0.24 Identities = 13/31 (41%), Positives = 17/31 (54%), Frame = -2 Query: 102 FCVLFPQCFIHRHHLQRHGNHKLIVPLVISL 10 FC+LF Q F H HG + PLV++L Sbjct: 220 FCILFSQQF----HAWAHGTKSKLPPLVVAL 246 Score = 37 (13.0 bits), Expect = 0.27, Sum P(2) = 0.24 Identities = 12/26 (46%), Positives = 15/26 (57%), Frame = -1 Query: 190 EPTLAGNAGLVDAVVALSGDKGTSID 113 EP LAG AG + A + SG +ID Sbjct: 128 EPALAGYAGYILADLG-SGVYHWAID 152 >gi|575478|gb|AAA67703.1| (L37443) protease [human herpesvirus 2] >gi|1095743|prf||2109370A protease [Human herpesvirus 2] Length = 636 Frame -1 hits (HSPs): __ __ __________________________________________________ Database sequence: | | | | | | 636 0 150 300 450 600 Minus Strand HSPs: Score = 55 (19.4 bits), Expect = 0.33, Sum P(2) = 0.28 Identities = 9/16 (56%), Positives = 9/16 (56%), Frame = -1 Query: 88 PPMFHTPPPPPAPWKP 41 P H PPPPP P P Sbjct: 575 PAHAHPPPPPPGPTPP 590 Score = 46 (16.2 bits), Expect = 0.33, Sum P(2) = 0.28 Identities = 9/20 (45%), Positives = 14/20 (70%), Frame = -1 Query: 139 SGDKGT-SIDKNTILCAVPP 83 SGD G ++D +T+ A+PP Sbjct: 32 SGDSGELALDPDTVRAALPP 51 >gi|1869848|emb|CAB06750.1| (Z86099) protease [human herpesvirus 2] Length = 637 Frame -1 hits (HSPs): __ __ __________________________________________________ Database sequence: | | | | | | 637 0 150 300 450 600 Minus Strand HSPs: Score = 55 (19.4 bits), Expect = 0.42, Sum P(2) = 0.34 Identities = 9/16 (56%), Positives = 9/16 (56%), Frame = -1 Query: 88 PPMFHTPPPPPAPWKP 41 P H PPPPP P P Sbjct: 576 PAHAHPPPPPPGPTPP 591 Score = 45 (15.8 bits), Expect = 0.42, Sum P(2) = 0.34 Identities = 9/20 (45%), Positives = 14/20 (70%), Frame = -1 Query: 139 SGDKGT-SIDKNTILCAVPP 83 SGD G ++D +T+ A+PP Sbjct: 32 SGDPGELALDPDTVRAALPP 51 >gi|1224097|gb|AAA92139.1| (U49329) UL26 protease [human herpesvirus 2] Length = 638 Frame -1 hits (HSPs): __ __ __________________________________________________ Database sequence: | | | | | | 638 0 150 300 450 600 Minus Strand HSPs: Score = 55 (19.4 bits), Expect = 0.42, Sum P(2) = 0.34 Identities = 9/16 (56%), Positives = 9/16 (56%), Frame = -1 Query: 88 PPMFHTPPPPPAPWKP 41 P H PPPPP P P Sbjct: 577 PAHAHPPPPPPGPTPP 592 Score = 45 (15.8 bits), Expect = 0.42, Sum P(2) = 0.34 Identities = 9/20 (45%), Positives = 14/20 (70%), Frame = -1 Query: 139 SGDKGT-SIDKNTILCAVPP 83 SGD G ++D +T+ A+PP Sbjct: 32 SGDPGELALDPDTVRAALPP 51 >gi|7485358|pir||T05400 hypothetical protein F10M6.80 - Arabidopsis thaliana >gi|2864615|emb|CAA16962.1| (AL021811) putative protein [Arabidopsis thaliana] >gi|7270132|emb|CAB79946.1| (AL161580) putative protein [Arabidopsis thaliana] Length = 842 Frame -1 hits (HSPs): ___ __________________________________________________ Database sequence: | | | | | | | 842 0 150 300 450 600 750 Minus Strand HSPs: Score = 80 (28.2 bits), Expect = 0.44, P = 0.36 Identities = 14/23 (60%), Positives = 15/23 (65%), Frame = -1 Query: 118 IDKNTILCAVPPMFHTPPPPPAP 50 +D N I PP HTPPPPPAP Sbjct: 589 VDMNEIKALPPPENHTPPPPPAP 611 Score = 80 (28.2 bits), Expect = 0.44, P = 0.36 Identities = 16/27 (59%), Positives = 17/27 (62%), Frame = -1 Query: 118 IDKNTILCAVPPMFHTPPPPPAPW-KP 41 +D N I PP HTPPPPPAP KP Sbjct: 589 VDMNEIKALPPPENHTPPPPPAPEPKP 615 Score = 71 (25.0 bits), Expect = 4.0, P = 0.98 Identities = 15/30 (50%), Positives = 17/30 (56%), Frame = -1 Query: 118 IDKNTILCAVPPMFHTPPPPPAP----WKP 41 +D N I PP HTPPPPPAP +P Sbjct: 589 VDMNEIKALPPPENHTPPPPPAPEPKPQQP 618 >gi|1076291|pir||S51171 amino acid transport protein AAT1 - Arabidopsis thaliana >gi|2911069|emb|CAA17531.1| (AL021960) amino acid transport protein AAT1 [Arabidopsis thaliana] >gi|7268909|emb|CAB79112.1| (AL161554) amino acid transport protein AAT1 [Arabidopsis thaliana] Length = 533 Frame -1 hits (HSPs): ____ ___ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | 533 0 150 300 450 __________________ Annotated Domains: DOMO DM01125: 4..324 DOMO DM05033: 326..529 __________________ Minus Strand HSPs: Score = 55 (19.4 bits), Expect = 0.57, Sum P(2) = 0.44 Identities = 11/29 (37%), Positives = 18/29 (62%), Frame = -1 Query: 205 EYSHTEPTLAGNAGLVDAVVALSGDKGTS 119 +YSH +P G ++ V+A+ G KG+S Sbjct: 138 DYSHLDPIAVGVCAII-CVLAVVGTKGSS 165 Score = 42 (14.8 bits), Expect = 0.57, Sum P(2) = 0.44 Identities = 9/20 (45%), Positives = 10/20 (50%), Frame = -1 Query: 61 PPAPWKP*AHRAIGYFFIAS 2 P PW P A AI F + S Sbjct: 455 PLVPWLPSASIAINIFLLGS 474 >gi|7020284|dbj|BAA91064.1| (AK000296) unnamed protein product [Homo sapiens] Length = 210 Frame -1 hits (HSPs): _____ Frame -2 hits (HSPs): ___ __________________________________________________ Database sequence: | | | | | | 210 0 50 100 150 200 Minus Strand HSPs: Score = 52 (18.3 bits), Expect = 0.76, Sum P(2) = 0.53 Identities = 9/15 (60%), Positives = 9/15 (60%), Frame = -1 Query: 85 PMFHTPPPPPAPWKP 41 P H PPPPP P P Sbjct: 30 PASHFPPPPPPPPLP 44 Score = 34 (12.0 bits), Expect = 0.76, Sum P(2) = 0.53 Identities = 6/9 (66%), Positives = 7/9 (77%), Frame = -2 Query: 117 LTKIPFCVL 91 LT +P CVL Sbjct: 5 LTNVPACVL 13 >gi|425682|gb|AAB28462.1| extensin=nodule-specific proline-rich protein {clone VfNDS-E} [Vicia faba=broadbeans, root nodules, Peptide Partial, 116 aa] Length = 116 Frame -1 hits (HSPs): _______ _______ __________________________________________________ Database sequence: | | | | 116 0 50 100 Minus Strand HSPs: Score = 65 (22.9 bits), Expect = 0.78, P = 0.54 Identities = 9/14 (64%), Positives = 12/14 (85%), Frame = -1 Query: 85 PMFHTPPPPPAPWK 44 P++H+PPPPP P K Sbjct: 52 PVYHSPPPPPTPHK 65 Score = 64 (22.5 bits), Expect = 0.99, P = 0.63 Identities = 9/14 (64%), Positives = 11/14 (78%), Frame = -1 Query: 85 PMFHTPPPPPAPWK 44 P +H+PPPPP P K Sbjct: 18 PFYHSPPPPPTPHK 31 Score = 62 (21.8 bits), Expect = 1.6, P = 0.80 Identities = 10/16 (62%), Positives = 12/16 (75%), Frame = -1 Query: 85 PMFHTPPPPPAPWK-P 41 P +H+PPPPP P K P Sbjct: 18 PFYHSPPPPPTPHKKP 33 >gi|2078307|gb|AAB54090.1| (U67264) AcMNPV ORF8/ORF1629 homolog [Helicoverpa zea nuclear polyhedrosis virus] Length = 412 Frame -1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | 412 0 150 300 Minus Strand HSPs: Score = 50 (17.6 bits), Expect = 0.84, Sum P(2) = 0.57 Identities = 11/47 (23%), Positives = 22/47 (46%), Frame = -1 Query: 190 EPTLAGNAGLVDAVVA--LSGDKGTSIDKNTILCAVPPMFHTPPPPP 56 +PT + ++ ++ + D +S + + +PP PPPPP Sbjct: 102 KPTTVSTSDIMQKTISPTIIIDSESSSPQTLMPPPIPPPIPPPPPPP 148 Score = 43 (15.1 bits), Expect = 0.84, Sum P(2) = 0.57 Identities = 7/10 (70%), Positives = 7/10 (70%), Frame = -1 Query: 70 PPPPPAPWKP 41 PPPPP P P Sbjct: 157 PPPPPPPPPP 166 >gi|1020287|gb|AAA79496.1| (U31793) putative [Human papillomavirus type 61] Length = 105 Frame -1 hits (HSPs): __________ __________________________________________________ Database sequence: | | | | | | | 105 0 20 40 60 80 100 Minus Strand HSPs: Score = 64 (22.5 bits), Expect = 0.99, P = 0.63 Identities = 11/22 (50%), Positives = 11/22 (50%), Frame = -1 Query: 97 CAVPPMFHTPPPPPAPWKP*AH 32 C P H PPPPP W P H Sbjct: 27 CGTTP--HRPPPPPRAWAPPRH 46 >gi|5668787|gb|AAD46013.1|AC007894_11 (AC007894) F21H2.12 [Arabidopsis thaliana] Length = 451 Frame -1 hits (HSPs): ___ __ __________________________________________________ Database sequence: | | | || 451 0 150 300 450 Minus Strand HSPs: Score = 51 (18.0 bits), Expect = 1.0, Sum P(2) = 0.64 Identities = 8/11 (72%), Positives = 9/11 (81%), Frame = -1 Query: 82 MFHTPPPPPAP 50 +F PPPPPAP Sbjct: 436 VFGVPPPPPAP 446 Score = 42 (14.8 bits), Expect = 1.0, Sum P(2) = 0.64 Identities = 11/20 (55%), Positives = 13/20 (65%), Frame = -1 Query: 184 TLAGNAGLVD-AVVALSGDK 128 T AG VD A+ AL+GDK Sbjct: 123 TFAGTFVSVDEAIAALAGDK 142 >gi|2134384|pir||I50620 procKr2 - chicken (fragment) >gi|577019|emb|CAA40140.1| (X56805) procKr2 [Gallus gallus] Length = 1173 Frame -1 hits (HSPs): _ _ Frame -3 hits (HSPs): ___ Annotated Domains: ________ ____________ __________________________________________________ Database sequence: | | | | | | | | | 1173 0 150 300 450 600 750 900 1050 __________________ Annotated Domains: PROSITE ATP_GTP_A: ATP/GTP-binding site motif A 453..460 PROSITE CYTOCHROME_C: Cytochrome c family heme-b 269..274 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 125..145 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 13..33 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 153..173 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 241..261 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 269..289 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 314..334 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 342..362 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 370..390 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 398..418 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 41..61 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 426..446 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 454..474 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 482..502 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 69..89 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 97..117 __________________ Minus Strand HSPs: Score = 64 (22.5 bits), Expect = 1.5, Sum P(2) = 0.77 Identities = 13/45 (28%), Positives = 25/45 (55%), Frame = -3 Query: 191 RANTCWKCRSS*RSSGIVRRQRHIY*QKYHFVCCSPNVSYTATTS 57 R C C + ++S +RR RHI+ + +VC + ++T +T+ Sbjct: 65 RPYKCTSCPKAFKNSSSLRRHRHIHTGERPYVCSACGKAFTQSTN 109 Score = 36 (12.7 bits), Expect = 1.5, Sum P(2) = 0.77 Identities = 7/11 (63%), Positives = 7/11 (63%), Frame = -1 Query: 73 TPPPPPAPWKP 41 TP PPPA P Sbjct: 293 TPHPPPATPTP 303 Score = 31 (10.9 bits), Expect = 4.6, Sum P(2) = 0.99 Identities = 6/10 (60%), Positives = 7/10 (70%), Frame = -1 Query: 73 TPPPPPAPWK 44 +P P PAP K Sbjct: 1094 SPVPTPAPPK 1103 >gi|1491688|emb|CAA63877.1| (X94164) putative [Human papillomavirus type 72] Length = 106 Frame -1 hits (HSPs): __________ __________________________________________________ Database sequence: | | | | | | | 106 0 20 40 60 80 100 Minus Strand HSPs: Score = 62 (21.8 bits), Expect = 1.6, P = 0.80 Identities = 11/22 (50%), Positives = 11/22 (50%), Frame = -1 Query: 97 CAVPPMFHTPPPPPAPWKP*AH 32 C PP PPPPP W P H Sbjct: 26 CQTPP---NPPPPPRAWAPPRH 44 >gi|6978799 ref|NP_036683.1| early growth response 1 >gi|119244|sp|P08154|EGR1_RAT EARLY GROWTH RESPONSE PROTEIN 1 (EGR-1) (NERVE GROWTH FACTOR-INDUCED PROTEIN A) (NGFI-A) >gi|112027|pir||A32225 nerve growth factor-induced A protein - rat >gi|205694|gb|AAA61927.1| (M18416) nerve growth factor-induced protein [Rattus norvegicus] >gi|623563|gb|AAA60740.1| (J04154) nerve growth factor [Rattus norvegicus] Length = 508 Frame 3 hits (HSPs): __ ___ __________________________________________________ Database sequence: | | | | | 508 0 150 300 450 Plus Strand HSPs: Score = 52 (18.3 bits), Expect = 1.7, Sum P(2) = 0.81 Identities = 13/21 (61%), Positives = 14/21 (66%), Frame = +3 Query: 132 SPDNATT-ASTRPAFPASVGS 191 SP ATT AS PAFPA V + Sbjct: 457 SPSVATTYASVPPAFPAQVST 477 Score = 40 (14.1 bits), Expect = 1.7, Sum P(2) = 0.81 Identities = 9/15 (60%), Positives = 10/15 (66%), Frame = +3 Query: 51 GAGGGGGV*NIGGTA 95 G GGGGG N G +A Sbjct: 44 GGGGGGGS-NSGSSA 57 >gi|113926|sp|P05140|ANP_HEMAM ANTIFREEZE PROTEIN PRECURSOR (AFP) >gi|1072445|pir||A34313 antifreeze protein II precursor - sea raven Length = 163 Frame -1 hits (HSPs): ________________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | 163 0 50 100 150 __________________ Annotated Domains: DOMO DM00035: C-TYPELECTIN 34..160 Entrez Domain: C-TYPE LECTIN (LONG FORM). 39..163 PFAM lectin_c: Lectin C-type domain 67..159 PRINTS ANTIFREEZEII1: Antifreeze motif I - 2 40..52 PRINTS ANTIFREEZEII2: Antifreeze motif II - 2 52..69 PRINTS ANTIFREEZEII3: Antifreeze motif III - 2 70..87 PRINTS ANTIFREEZEII4: Antifreeze motif IV - 2 97..113 PRINTS ANTIFREEZEII5: Antifreeze motif V - 2 119..130 PRINTS ANTIFREEZEII6: Antifreeze motif VI - 2 131..142 PRINTS ANTIFREEZEII7: Antifreeze motif VII - 2 146..159 PRODOM PD060982: ANP_HEMAM 1..77 PRODOM PD000254: PA2R(10) MANR(8) LEM1(8) 79..137 PRODOM PD197100: ANP_HEMAM 139..162 __________________ Minus Strand HSPs: Score = 65 (22.9 bits), Expect = 1.7, P = 0.82 Identities = 19/49 (38%), Positives = 23/49 (46%), Frame = -1 Query: 163 LVDAVVALSGDKGTSIDKNTILCAVPPMFHTPPPPPAPWKP*AHRAIGY 17 LV A++AL+ I K T A P PP PA W+P R I Y Sbjct: 7 LVCAMMALTQANDDKILKGTATEAGPVSQRAPPNCPAGWQPLGDRCIYY 55 >gi|7508781|pir||T26064 hypothetical protein W01G7.1 - Caenorhabditis elegans >gi|3924900|emb|CAB03453.1| (Z81135) predicted using Genefinder~cDNA EST EMBL:D68390 comes from this gene~cDNA EST yk442e12.5 comes from this gene~cDNA EST yk466e9.5 comes from this gene [Caenorhabditis elegans] Length = 627 Frame -1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | | | 627 0 150 300 450 600 Minus Strand HSPs: Score = 73 (25.7 bits), Expect = 1.8, P = 0.83 Identities = 15/30 (50%), Positives = 17/30 (56%), Frame = -1 Query: 130 KGTSIDKNTILCAVPPMFHTPPPPPAPWKP 41 K S D +LC PP+ PPPPP P KP Sbjct: 498 KNVSADAIRVLCNRPPLPPLPPPPPPP-KP 526 Score = 70 (24.6 bits), Expect = 3.7, P = 0.97 Identities = 15/31 (48%), Positives = 17/31 (54%), Frame = -1 Query: 130 KGTSIDKNTILCAVPPMFHTPPPPPAPW-KP 41 K S D +LC PP+ PPPPP P KP Sbjct: 498 KNVSADAIRVLCNRPPLPPLPPPPPPPKPKP 528 >gi|198605|gb|AAA39382.1| (M19643) Krox-24 protein [Mus musculus] Length = 484 Frame 3 hits (HSPs): ___ ___ __________________________________________________ Database sequence: | | | | | 484 0 150 300 450 Plus Strand HSPs: Score = 51 (18.0 bits), Expect = 1.9, Sum P(2) = 0.85 Identities = 13/21 (61%), Positives = 13/21 (61%), Frame = +3 Query: 132 SPDNATT-ASTRPAFPASVGS 191 SP ATT AS PAFP V S Sbjct: 433 SPSVATTFASVPPAFPTQVSS 453 Score = 40 (14.1 bits), Expect = 1.9, Sum P(2) = 0.85 Identities = 9/15 (60%), Positives = 10/15 (66%), Frame = +3 Query: 51 GAGGGGGV*NIGGTA 95 G GGGGG N G +A Sbjct: 20 GGGGGGGT-NSGTSA 33 Score = 33 (11.6 bits), Expect = 9.4, Sum P(2) = 1.0 Identities = 8/13 (61%), Positives = 9/13 (69%), Frame = +3 Query: 54 AGGGGGV*NIGGT 92 +GGGGG GGT Sbjct: 19 SGGGGG----GGT 27 >gi|7484731|pir||T09873 probable cellulase (EC 3.2.1.4) - upland cotton (fragment) >gi|2244740|dbj|BAA21111.1| (D88417) endo-1,4-beta-glucanase [Gossypium hirsutum] Length = 324 Frame -1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | | | | 324 0 50 100 150 200 250 300 Minus Strand HSPs: Score = 69 (24.3 bits), Expect = 2.0, P = 0.87 Identities = 17/52 (32%), Positives = 23/52 (44%), Frame = -1 Query: 202 YSHTEPTLAGNAGLVDAVVALSGDKGTSIDKNTILCAVP-PMFHTPPPPPAP 50 Y TEP NA L+ + L+G G ++ VP P+ P P P P Sbjct: 168 YEQTEPATYNNAPLLGILARLAGGHGGYNQLLPVVAPVPNPVIAKPKPAPKP 219 >gi|3514051|gb|AAC34103.1| (AF082809) glycoprotein gJ [Baboon herpesvirus 2] Length = 116 Frame -1 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | 116 0 50 100 Minus Strand HSPs: Score = 61 (21.5 bits), Expect = 2.1, P = 0.87 Identities = 13/35 (37%), Positives = 18/35 (51%), Frame = -1 Query: 154 AVVALSGDKGTSIDKNTILCAVPPMFHTPPPPPAP 50 + A SGD TSI+ + + A P H+ P P P Sbjct: 37 SATASSGDNATSINAGSTITAAAPPGHSTPWPSLP 71 >gi|7022861|dbj|BAA91748.1| (AK001542) unnamed protein product [Homo sapiens] Length = 659 Frame -1 hits (HSPs): ___ Frame -2 hits (HSPs): __ __________________________________________________ Database sequence: | | | | | | 659 0 150 300 450 600 Minus Strand HSPs: Score = 60 (21.1 bits), Expect = 2.2, Sum P(2) = 0.89 Identities = 11/30 (36%), Positives = 15/30 (50%), Frame = -1 Query: 130 KGTSIDKNTILCAVPPMFHTPPPPPAPWKP 41 KG + +L +P + PPPPP W P Sbjct: 337 KGKDVSYCPVLAPLPLLLPPPPPPPM-WNP 365 Score = 33 (11.6 bits), Expect = 2.2, Sum P(2) = 0.89 Identities = 5/10 (50%), Positives = 7/10 (70%), Frame = -2 Query: 48 GNHKLIVPLV 19 GNH + P+V Sbjct: 375 GNHGFVAPVV 384 >gi|90459|pir||JS0304 developmental control protein Krox-24 - mouse >gi|1325982|gb|AAB00468.1| (M28845) zinc finger protein [Mus musculus] Length = 533 Frame 3 hits (HSPs): __ __ Annotated Domains: _________ __________________________________________________ Database sequence: | | | | | 533 0 150 300 450 __________________ Annotated Domains: Entrez region: zinc fingers 331..416 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 338..360 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 368..388 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 396..416 __________________ Plus Strand HSPs: Score = 51 (18.0 bits), Expect = 2.3, Sum P(2) = 0.90 Identities = 13/21 (61%), Positives = 13/21 (61%), Frame = +3 Query: 132 SPDNATT-ASTRPAFPASVGS 191 SP ATT AS PAFP V S Sbjct: 482 SPSVATTFASVPPAFPTQVSS 502 Score = 40 (14.1 bits), Expect = 2.3, Sum P(2) = 0.90 Identities = 9/15 (60%), Positives = 10/15 (66%), Frame = +3 Query: 51 GAGGGGGV*NIGGTA 95 G GGGGG N G +A Sbjct: 69 GGGGGGGT-NSGTSA 82 >gi|6681285 ref|NP_031939.1| early growth response 1 >gi|119243|sp|P08046|EGR1_MOUSE EARLY GROWTH RESPONSE PROTEIN 1 (EGR-1) (KROX-24 PROTEIN) (ZIF268) >gi|201934|gb|AAA40416.1| (M22326) growth factor-induced protein [Mus musculus] >gi|309213|gb|AAA37544.1| (M20157) Egr-1 protein [Mus musculus] Length = 533 Frame 3 hits (HSPs): __ __ __________________________________________________ Database sequence: | | | | | 533 0 150 300 450 Plus Strand HSPs: Score = 51 (18.0 bits), Expect = 2.3, Sum P(2) = 0.90 Identities = 13/21 (61%), Positives = 13/21 (61%), Frame = +3 Query: 132 SPDNATT-ASTRPAFPASVGS 191 SP ATT AS PAFP V S Sbjct: 482 SPSVATTFASVPPAFPTQVSS 502 Score = 40 (14.1 bits), Expect = 2.3, Sum P(2) = 0.90 Identities = 9/15 (60%), Positives = 10/15 (66%), Frame = +3 Query: 51 GAGGGGGV*NIGGTA 95 G GGGGG N G +A Sbjct: 69 GGGGGGGS-NSGSSA 82 >gi|213874|gb|AAA49617.1| (J02593) antifreeze polypeptide (AFP) precursor [Hemitripterus americanus] Length = 195 Frame -1 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | | 195 0 50 100 150 Minus Strand HSPs: Score = 65 (22.9 bits), Expect = 2.5, P = 0.92 Identities = 19/49 (38%), Positives = 23/49 (46%), Frame = -1 Query: 163 LVDAVVALSGDKGTSIDKNTILCAVPPMFHTPPPPPAPWKP*AHRAIGY 17 LV A++AL+ I K T A P PP PA W+P R I Y Sbjct: 39 LVCAMMALTQANDDKILKGTATEAGPVSQRAPPNCPAGWQPLGDRCIYY 87 >gi|5453736 ref|NP_005351.2| v-maf musculoaponeurotic fibrosarcoma (avian) oncogene homolog >gi|3335150|gb|AAC27038.1| (AF055377) long form transcription factor C-MAF [Homo sapiens] Length = 403 Frame 3 hits (HSPs): ___ ____ __________________________________________________ Database sequence: | | | | 403 0 150 300 Plus Strand HSPs: Score = 51 (18.0 bits), Expect = 2.5, Sum P(2) = 0.92 Identities = 10/19 (52%), Positives = 10/19 (52%), Frame = +3 Query: 42 GFHGAGGGGGV*NIGGTAH 98 G G GGGGG GG H Sbjct: 229 GGGGGGGGGGAAGAGGALH 247 Score = 37 (13.0 bits), Expect = 2.5, Sum P(2) = 0.92 Identities = 12/25 (48%), Positives = 13/25 (52%), Frame = +3 Query: 114 SIDVPLSPDNATTASTRPAFPASVG 188 S D P SP+ T TR P SVG Sbjct: 362 SSDNPSSPEFFITEPTRKLEP-SVG 385 >gi|7940276|gb|AAF70835.1|AC003113_2 (AC003113) F24O1.6 [Arabidopsis thaliana] Length = 70 Frame -1 hits (HSPs): ____________ __________________________________________________ Database sequence: | | | | | 70 0 20 40 60 Minus Strand HSPs: Score = 60 (21.1 bits), Expect = 2.6, P = 0.93 Identities = 11/17 (64%), Positives = 12/17 (70%), Frame = -1 Query: 88 PPMFHTPPPP-PAPWKP 41 PP FH PPPP P P +P Sbjct: 10 PPPFHHPPPPRPPPPEP 26 >gi|6677613 ref|NP_033581.1| zinc finger protein 40 >gi|477332|pir||A48830 probable transcription regulator NT fin12 - mouse >gi|286105|dbj|BAA01482.1| (D10632) zinc finger protein [Mus musculus] Length = 728 Frame -2 hits (HSPs): _ Frame -3 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | | | 728 0 150 300 450 600 Minus Strand HSPs: Score = 60 (21.1 bits), Expect = 2.8, Sum P(2) = 0.94 Identities = 17/42 (40%), Positives = 23/42 (54%), Frame = -3 Query: 206 RIQSHRANTCWKCRSS*RSSGIVRRQRHIY*QKYHFVCCSPN 81 R + HR N C K SS SSG+ R QR I+ + ++C N Sbjct: 291 RRKLHRCNECGKSLSS--SSGLQRHQR-IHRGEKAYICAECN 329 Score = 33 (11.6 bits), Expect = 2.8, Sum P(2) = 0.94 Identities = 5/12 (41%), Positives = 9/12 (75%), Frame = -2 Query: 84 QCFIHRHHLQRH 49 +CFI + +L+ H Sbjct: 526 KCFIQKANLRTH 537 >gi|102425|pir||B41132 collagen-related protein 2 - Hydra magnipapillata (fragment) >gi|102429|pir||S21930 mini-collagen - Hydra sp >gi|9449|emb|CAA43380.1| (X61046) mini-collagen [Hydra sp.] Length = 142 Frame -1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | 142 0 50 100 Minus Strand HSPs: Score = 62 (21.8 bits), Expect = 2.8, P = 0.94 Identities = 12/21 (57%), Positives = 12/21 (57%), Frame = -1 Query: 103 ILCAVPPMFHTPPPPPAPWKP 41 I CA PP PPPPP P P Sbjct: 42 ICCAPPPPPPPPPPPPPPPPP 62 Score = 61 (21.5 bits), Expect = 3.7, P = 0.97 Identities = 11/22 (50%), Positives = 12/22 (54%), Frame = -1 Query: 106 TILCAVPPMFHTPPPPPAPWKP 41 T +C PP PPPPP P P Sbjct: 40 TPICCAPPPPPPPPPPPPPPPP 61 >gi|1020247|gb|AAA79461.1| (U31788) putative [Human papillomavirus type 44] Length = 144 Frame -1 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | 144 0 50 100 Minus Strand HSPs: Score = 62 (21.8 bits), Expect = 2.9, P = 0.94 Identities = 11/21 (52%), Positives = 13/21 (61%), Frame = -1 Query: 103 ILCAVPPMF---HTPPPPPAP 50 +LC P+ HTPPPPP P Sbjct: 49 VLCKTYPLLGLLHTPPPPPPP 69 Score = 61 (21.5 bits), Expect = 3.8, P = 0.98 Identities = 13/29 (44%), Positives = 16/29 (55%), Frame = -1 Query: 103 ILCAVPPMF---HTPPPPPAP--WKP*AH 32 +LC P+ HTPPPPP P +P H Sbjct: 49 VLCKTYPLLGLLHTPPPPPPPPLHRPHPH 77 >gi|1020271|gb|AAA79482.1| (U31791) putative [Human papillomavirus type 55] Length = 144 Frame -1 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | 144 0 50 100 Minus Strand HSPs: Score = 62 (21.8 bits), Expect = 2.9, P = 0.94 Identities = 11/21 (52%), Positives = 13/21 (61%), Frame = -1 Query: 103 ILCAVPPMF---HTPPPPPAP 50 +LC P+ HTPPPPP P Sbjct: 49 VLCKTYPLLGLLHTPPPPPPP 69 Score = 61 (21.5 bits), Expect = 3.8, P = 0.98 Identities = 13/29 (44%), Positives = 16/29 (55%), Frame = -1 Query: 103 ILCAVPPMF---HTPPPPPAP--WKP*AH 32 +LC P+ HTPPPPP P +P H Sbjct: 49 VLCKTYPLLGLLHTPPPPPPPPLHRPHLH 77 >gi|7523490|dbj|BAA94218.1| (AP001633) hypothetical protein [Oryza sativa] Length = 436 Frame -1 hits (HSPs): _______ __ __________________________________________________ Database sequence: | | | | 436 0 150 300 Minus Strand HSPs: Score = 53 (18.7 bits), Expect = 3.0, Sum P(2) = 0.95 Identities = 16/45 (35%), Positives = 23/45 (51%), Frame = -1 Query: 172 NAGLVDAVVALSGDKGTSIDKNTILCAVP------PMFHTPPPPPA 53 +A + D V A +GD+ T +DK L + P P + PPP A Sbjct: 53 DAAMGDEVAA-AGDEATMLDKGDALHSSPRLRLLAPRVSSTPPPSA 97 Score = 35 (12.3 bits), Expect = 3.0, Sum P(2) = 0.95 Identities = 5/8 (62%), Positives = 7/8 (87%), Frame = -1 Query: 55 APWKP*AH 32 APW+P +H Sbjct: 288 APWEPTSH 295 >gi|2760483|emb|CAA60014.1| (X86019) SH3-domain interacting protein [Homo sapiens] Length = 494 Frame 3 hits (HSPs): ___ ___ ___ __________________________________________________ Database sequence: | | | | | 494 0 150 300 450 Plus Strand HSPs: Score = 52 (18.3 bits), Expect = 3.1, Sum P(2) = 0.96 Identities = 10/16 (62%), Positives = 11/16 (68%), Frame = +3 Query: 42 GFHGAGGGGGV*NIGG 89 GF G GGGGG + GG Sbjct: 79 GFGGGGGGGGGGSFGG 94 Score = 37 (13.0 bits), Expect = 3.1, Sum P(2) = 0.96 Identities = 8/23 (34%), Positives = 11/23 (47%), Frame = +3 Query: 123 VPLSPDNATTASTRPAFPASVGS 191 VP P + T P FP + G+ Sbjct: 206 VPGGPRQPSPGPTPPPFPGNRGT 228 Score = 32 (11.3 bits), Expect = 9.8, Sum P(2) = 1.0 Identities = 6/19 (31%), Positives = 10/19 (52%), Frame = +3 Query: 126 PLSPDNATTASTRPAFPAS 182 PL PD + + P P++ Sbjct: 418 PLPPDRPSAGAPPPPPPST 436 >gi|4507911 ref|NP_003378.1| Wiskott-Aldrich syndrome protein interacting protein >gi|2906006|gb|AAC03767.1| (AF031588) WASP interacting protein [Homo sapiens] Length = 503 Frame 3 hits (HSPs): ___ ___ __________________________________________________ Database sequence: | | | | | 503 0 150 300 450 Plus Strand HSPs: Score = 52 (18.3 bits), Expect = 3.2, Sum P(2) = 0.96 Identities = 10/16 (62%), Positives = 11/16 (68%), Frame = +3 Query: 42 GFHGAGGGGGV*NIGG 89 GF G GGGGG + GG Sbjct: 79 GFGGGGGGGGGGSFGG 94 Score = 37 (13.0 bits), Expect = 3.2, Sum P(2) = 0.96 Identities = 8/23 (34%), Positives = 11/23 (47%), Frame = +3 Query: 123 VPLSPDNATTASTRPAFPASVGS 191 VP P + T P FP + G+ Sbjct: 206 VPGGPRQPSPGPTPPPFPGNRGT 228 >gi|1586823|prf||2204390A synaptojanin [Rattus norvegicus] Length = 1558 Frame -1 hits (HSPs): __ __ __ __________________________________________________ Database sequence: | | | | | 1558 0 500 1000 1500 Minus Strand HSPs: Score = 63 (22.2 bits), Expect = 3.3, Sum P(2) = 0.96 Identities = 14/35 (40%), Positives = 15/35 (42%), Frame = -1 Query: 145 ALSGDKGTSIDKNTILCAVPPMFHTPPPPPAPWKP 41 A S D S N C +P PPPPP P P Sbjct: 1498 AASFDDDWSKGTNVSFCVLPARRPPPPPPPVPLLP 1532 Score = 36 (12.7 bits), Expect = 3.3, Sum P(2) = 0.96 Identities = 6/17 (35%), Positives = 9/17 (52%), Frame = -1 Query: 202 YSHTEPTLAGNAGLVDA 152 Y PT+ AG++ A Sbjct: 1172 YGAARPTIPARAGVISA 1188 Score = 32 (11.3 bits), Expect = 8.2, Sum P(2) = 1.0 Identities = 9/23 (39%), Positives = 10/23 (43%), Frame = -1 Query: 199 SHTEPTLAGNAGLVDAVVALSGD 131 S T TL G V A + GD Sbjct: 992 SSTSSTLLGEDAEVSADFDMEGD 1014 >gi|7514092|pir||S68448 synaptojanin, 170K - rat Length = 1575 Frame -1 hits (HSPs): __ __ __ __________________________________________________ Database sequence: | | | | | 1575 0 500 1000 1500 Minus Strand HSPs: Score = 63 (22.2 bits), Expect = 3.3, Sum P(2) = 0.96 Identities = 14/35 (40%), Positives = 15/35 (42%), Frame = -1 Query: 145 ALSGDKGTSIDKNTILCAVPPMFHTPPPPPAPWKP 41 A S D S N C +P PPPPP P P Sbjct: 1515 AASFDDDWSKGTNVSFCVLPARRPPPPPPPVPLLP 1549 Score = 36 (12.7 bits), Expect = 3.3, Sum P(2) = 0.96 Identities = 6/17 (35%), Positives = 9/17 (52%), Frame = -1 Query: 202 YSHTEPTLAGNAGLVDA 152 Y PT+ AG++ A Sbjct: 1188 YGAARPTIPARAGVISA 1204 Score = 32 (11.3 bits), Expect = 8.4, Sum P(2) = 1.0 Identities = 9/23 (39%), Positives = 10/23 (43%), Frame = -1 Query: 199 SHTEPTLAGNAGLVDAVVALSGD 131 S T TL G V A + GD Sbjct: 992 SSTSSTLLGEDAEVSADFDMEGD 1014 >gi|284517|pir||D42825 Kruppel-type zinc finger protein ZNF70 - human (fragment) Length = 91 Frame -2 hits (HSPs): _________________ Annotated Domains: ____________ ____________ ____________ __________________________________________________ Database sequence: | | | | | | 91 0 20 40 60 80 __________________ Annotated Domains: PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 10..30 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 38..58 PROSITE ZINC_FINGER_C2H2: Zinc finger, C2H2 type 66..86 __________________ Minus Strand HSPs: Score = 59 (20.8 bits), Expect = 3.4, P = 0.97 Identities = 14/30 (46%), Positives = 15/30 (50%), Frame = -2 Query: 138 QATKAHLLTKIPFCVLFPQCFIHRHHLQRH 49 Q K H L K C L + F HR HL RH Sbjct: 25 QHRKIHTLKKPHECDLCGKAFCHRSHLIRH 54 >gi|1363496|pir||S57663 cellulase (EC 3.2.1.4) 3D precursor - pepper >gi|1247397|emb|CAA60737.1| (X87323) Beta-1,4-endoglycanohydrolase [Capsicum annuum] Length = 506 Frame -1 hits (HSPs): _____ Annotated Domains: _______________________________________________ __________________________________________________ Database sequence: | | | | | 506 0 150 300 450 __________________ Annotated Domains: DOMO DM00748: GLYCOSYLHYDROLASESFAMILY9 13..478 PROSITE GLYCOSYL_HYDROL_F9_1: Glycosyl hydrolase 386..402 PROSITE GLYCOSYL_HYDROL_F9_2: Glycosyl hydrolase 450..468 __________________ Minus Strand HSPs: Score = 69 (24.3 bits), Expect = 3.6, P = 0.97 Identities = 17/41 (41%), Positives = 22/41 (53%), Frame = -1 Query: 208 TEYSHTEPTLAGNAGLVDAVVALSGDKGTSIDKN---TILC 95 ++YSH+EPT NA V +V AL G I+ ILC Sbjct: 455 SDYSHSEPTTYMNAAFVGSVAALIGQNRRQINSQFNEPILC 495 >gi|1655543|emb|CAA65826.1| (X97188) cellulase [Capsicum annuum] Length = 506 Frame -1 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | | 506 0 150 300 450 Minus Strand HSPs: Score = 69 (24.3 bits), Expect = 3.6, P = 0.97 Identities = 17/41 (41%), Positives = 22/41 (53%), Frame = -1 Query: 208 TEYSHTEPTLAGNAGLVDAVVALSGDKGTSIDKN---TILC 95 ++YSH+EPT NA V +V AL G I+ ILC Sbjct: 455 SDYSHSEPTTYMNAAFVGSVAALIGQNRRQINSQFNEPILC 495 >gi|116223|sp|P17110|CH36_CERCA CHORION PROTEIN S36 >gi|103020|pir||S09208 chorion protein s36 - Mediterranean fruit fly >gi|295730|emb|CAA35723.1| (X51342) chorion protein s36 [Ceratitis capitata] Length = 320 Frame 3 hits (HSPs): ___ _____ Annotated Domains: _______________________________________ ____ _ __________________________________________________ Database sequence: | | | | | | | | 320 0 50 100 150 200 250 300 __________________ Annotated Domains: DOMO DM07741: 1..194 DOMO DM08239: 196..245 Entrez Repetitive region: 1. 178..181 Entrez Repetitive region: 2. 258..261 Entrez Repetitive region: 3. 266..269 Entrez Repetitive region: 4. 274..277 Entrez Repetitive region: 5. 290 PRODOM PD022729: CH36(3) 63..245 __________________ Plus Strand HSPs: Score = 49 (17.2 bits), Expect = 3.7, Sum P(2) = 0.98 Identities = 9/17 (52%), Positives = 11/17 (64%), Frame = +3 Query: 21 PMAR*AYGFHGAGGGGG 71 P+A +YG GGGGG Sbjct: 14 PLATASYGSSSGGGGGG 30 Score = 35 (12.3 bits), Expect = 3.7, Sum P(2) = 0.98 Identities = 9/27 (33%), Positives = 14/27 (51%), Frame = +3 Query: 105 VFLSIDVPLSPDNATTASTRPAFPASV 185 V+ S+ VP +N +T+P P V Sbjct: 132 VYRSLLVPSGQNNHQVIATQPLPPIIV 158 >gi|5901928 ref|NP_008938.1| pre-mRNA cleavage factor Im (68kD) >gi|1362819|pir||S57447 HPBRII-7 protein - human >gi|871299|emb|CAA47752.1| (X67337) Human pre-mRNA cleavage factor I 68 kDa subunit [Homo sapiens] >gi|871301|emb|CAA47751.1| (X67336) HPBRII-7 [Homo sapiens] Length = 551 Frame -1 hits (HSPs): ____ __ __________________________________________________ Database sequence: | | | | | 551 0 150 300 450 Minus Strand HSPs: Score = 56 (19.7 bits), Expect = 3.9, Sum P(2) = 0.98 Identities = 9/16 (56%), Positives = 10/16 (62%), Frame = -1 Query: 88 PPMFHTPPPPPAPWKP 41 PP PPPPP P+ P Sbjct: 310 PPPQQGPPPPPGPFPP 325 Score = 33 (11.6 bits), Expect = 3.9, Sum P(2) = 0.98 Identities = 7/27 (25%), Positives = 14/27 (51%), Frame = -1 Query: 163 LVDAVVALSGDKGTSIDKNTILCAVPP 83 L D V++ S + G + + + +PP Sbjct: 32 LYDDVISPSANNGDAPEDRDYMDTLPP 58 WARNING: HSPs involving 48 database sequences were not reported due to the limiting value of parameter B = 50. Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.96 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.349 0.158 0.527 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.343 0.145 0.670 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.348 0.150 0.548 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.325 0.141 0.469 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.353 0.153 0.563 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.341 0.136 0.493 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 68 68 10. 57 3 12 22 0.12 30 27 0.10 31 +2 0 69 69 10. 57 3 12 22 0.12 30 27 0.11 31 +1 0 69 68 10. 57 3 12 22 0.12 30 27 0.10 31 -1 0 69 69 10. 57 3 12 22 0.12 30 27 0.11 31 -2 0 69 69 10. 57 3 12 22 0.12 30 27 0.11 31 -3 0 68 68 10. 57 3 12 22 0.12 30 27 0.10 31 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 8:50 PM CDT May 27, 2000 Format: BLAST # of letters in database: 158,518,215 # of sequences in database: 505,245 # of database sequences satisfying E: 98 No. of states in DFA: 584 (58 KB) Total size of DFA: 138 KB (192 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 4 Search cpu time: 114.82u 1.02s 115.84t Elapsed: 00:01:18 Total cpu time: 114.87u 1.12s 115.99t Elapsed: 00:01:19 Start: Wed Feb 14 11:09:30 2001 End: Wed Feb 14 11:10:49 2001 WARNINGS ISSUED: 2
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000