WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= SSH4B03.SEQ(1>638) (603 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 505,245 sequences; 158,518,215 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 4 Sequences : less than 4 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 692 191 |=============================================== 6310 501 95 |======================= 3980 406 84 |===================== 2510 322 72 |================== 1580 250 77 |=================== 1000 173 55 |============= 631 118 39 |========= 398 79 20 |===== 251 59 14 |=== 158 45 11 |== 100 34 4 |= 63.1 30 4 |= 39.8 26 1 |: 25.1 25 1 |: 15.8 24 0 | >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 24 <<<<<<<<<<<<<<<<< 10.0 24 0 | 6.31 24 0 | 3.98 24 0 | 2.51 24 1 |: 1.58 23 0 | 1.00 23 1 |: 0.63 22 0 | 0.40 22 0 | 0.25 22 0 | 0.16 22 1 |: 0.10 21 0 | 0.063 21 0 | 0.040 21 2 |: 0.025 19 1 |: 0.016 18 0 | 0.010 18 0 | 0.0063 18 0 | 0.0040 18 4 |= 0.0025 14 0 | 0.0016 14 3 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|7487587|pir||T08918hypothetical protein T32A16.160... +1 218 5.7e-16 1 gi|2262116|gb|AAB63624.1|(AC002343) cellulose synthas... +1 218 6.5e-16 1 gi|7487589|pir||T08920hypothetical protein T32A16.180... +1 198 8.7e-14 1 gi|2262114|gb|AAB63622.1|(AC002343) cellulose synthas... +1 198 9.5e-14 1 gi|7487588|pir||T08919hypothetical protein T32A16.170... +1 191 4.5e-13 1 gi|2262115|gb|AAB63623.1|(AC002343) cellulose synthas... +1 191 4.9e-13 1 gi|7484865|pir||T02552cellulose synthase homolog T26B... +1 142 5.9e-07 1 gi|7484862|pir||T02553cellulose synthase homolog T26B... +1 137 4.4e-06 1 gi|7484715|pir||T10800cellulose synthase (EC 2.4.1.-)... +1 127 8.8e-05 1 gi|7484863|pir||T02560cellulose synthase homolog T26B... +1 126 0.00014 1 gi|4432865|gb|AAD20713.1|(AC006300) putative cellulos... +1 124 0.00037 1 gi|2827143|gb|AAC39336.1|(AF027174) cellulose synthas... +1 120 0.0012 1 gi|7484864|pir||T02561cellulose synthase homolog T26B... +1 118 0.0014 1 gi|7484861|pir||T05351cellulose synthase (EC 2.4.1.-)... +1 119 0.0016 1 gi|7484859|pir||T08583cellulose synthase (EC 2.4.1.-)... +1 117 0.0028 1 gi|4886756|gb|AAD32031.1|AF088917_1(AF088917) cellulo... +1 116 0.0034 1 gi|5230423|gb|AAD40885.1|AF091713_1(AF091713) cellulo... +1 116 0.0034 1 gi|6446577|gb|AAD39534.2|(AF150630) cellulose synthas... +1 116 0.0035 1 gi|4417271|gb|AAD20396.1|(AC007019) putative cellulos... +1 110 0.019 1 gi|7484714|pir||T10797cellulose synthase (EC 2.4.1.-)... +1 107 0.037 1 gi|4115905|gb|AAD03417.1|(AF072131) secondary xylem c... +1 107 0.037 1 gi|3135611|gb|AAC29067.1|(AF062485) cellulose synthas... +1 103 0.12 1 gi|320932|pir||A44971hypothetical protein 1 - Plasmod... +2 70 0.51 1 gi|2924781|gb|AAC04910.1|(AC002334) putative cellulos... +1 93 0.81 1 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame 2 Hits gi|320932 | ___________ Prosite Hits: ___ __________________________________________________ Query sequence: | | | | || 201 0 50 100 150 200 __________________ Prosite hits: TYR_PHOSPHO_SITE Tyrosine kinase phosphorylation site. 99..106 __________________ Locus_ID Frame 1 Hits gi|7487587 |_______________________________________ gi|2262116 |_______________________________________ gi|7487589 |______________________________ gi|2262114 |______________________________ gi|7487588 |____________________________ gi|2262115 |____________________________ gi|7484865 |_____________________________ gi|7484862 |____________________________ gi|7484715 |____________________________ gi|7484863 | _________________________ gi|4432865 |_____________________________ gi|2827143 |_____________________________ gi|7484864 | _________________________ gi|7484861 |_______________________________________ gi|7484859 |_______________________________________ gi|4886756 |________________________________ gi|5230423 |________________________________ gi|6446577 |_____________________________ gi|4417271 |_______________________________________ gi|7484714 |____________________________ gi|4115905 |____________________________ gi|3135611 |_______________________________________ gi|2924781 |_______________________________________ __________________________________________________ Query sequence: | | | | || 201 0 50 100 150 200
Use the and icons to retrieve links to Entrez:
>gi|7487587|pir||T08918 hypothetical protein T32A16.160 - Arabidopsis thaliana >gi|4972103|emb|CAB43899.1| (AL078468) cellulose synthase catalytic subunit-like protein [Arabidopsis thaliana] >gi|7269248|emb|CAB81317.1| (AL161560) cellulose synthase catalytic subunit-like protein [Arabidopsis thaliana] Length = 689 Frame 1 hits (HSPs): ____________ __________________________________________________ Database sequence: | | | | | | 689 0 150 300 450 600 Plus Strand HSPs: Score = 218 (76.7 bits), Expect = 5.7e-16, P = 5.7e-16 Identities = 53/153 (34%), Positives = 79/153 (51%), Frame = +1 Query: 7 MGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCEY 186 +GYC Y WA SLP + Y F+ + LL +FP+ S W + FL YG L ++ Sbjct: 474 VGYCQYACWAFWSLPLIVYGFLPQLALLYQSSVFPKSSDPWFWLYIVLFLGAYGQDLLDF 533 Query: 187 LICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTKDVQREVL 366 ++ G T GWWN QR+ I +S+LFGFI+ K L LS F +T K + Q + Sbjct: 534 VLEGGTYGGWWNDQRMWSIRGFSSHLFGFIEFTLKTLNLSTHGFNVTSKANDDEEQSKRY 593 Query: 367 SKKFI*IWKLFQSCLTHI*LQ-LALLNLFGLLLG 465 K+ I++ S + L +A++NL + G Sbjct: 594 EKE---IFEFGPSSSMFLPLTTVAIVNLLAFVWG 624 >gi|2262116|gb|AAB63624.1| (AC002343) cellulose synthase isolog [Arabidopsis thaliana] Length = 747 Frame 1 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | | | 747 0 150 300 450 600 Plus Strand HSPs: Score = 218 (76.7 bits), Expect = 6.5e-16, P = 6.5e-16 Identities = 53/153 (34%), Positives = 79/153 (51%), Frame = +1 Query: 7 MGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCEY 186 +GYC Y WA SLP + Y F+ + LL +FP+ S W + FL YG L ++ Sbjct: 532 VGYCQYACWAFWSLPLIVYGFLPQLALLYQSSVFPKSSDPWFWLYIVLFLGAYGQDLLDF 591 Query: 187 LICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTKDVQREVL 366 ++ G T GWWN QR+ I +S+LFGFI+ K L LS F +T K + Q + Sbjct: 592 VLEGGTYGGWWNDQRMWSIRGFSSHLFGFIEFTLKTLNLSTHGFNVTSKANDDEEQSKRY 651 Query: 367 SKKFI*IWKLFQSCLTHI*LQ-LALLNLFGLLLG 465 K+ I++ S + L +A++NL + G Sbjct: 652 EKE---IFEFGPSSSMFLPLTTVAIVNLLAFVWG 682 >gi|7487589|pir||T08920 hypothetical protein T32A16.180 - Arabidopsis thaliana >gi|4972105|emb|CAB43901.1| (AL078468) putative protein [Arabidopsis thaliana] >gi|7269250|emb|CAB81319.1| (AL161560) putative protein [Arabidopsis thaliana] Length = 727 Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | | 727 0 150 300 450 600 Plus Strand HSPs: Score = 198 (69.7 bits), Expect = 8.7e-14, P = 8.7e-14 Identities = 41/118 (34%), Positives = 60/118 (50%), Frame = +1 Query: 7 MGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCEY 186 +GYCN S+P Y + + L+ G+ +FP+ S W + F Y L ++ Sbjct: 484 LGYCNSPFKPFWSIPLTVYGLLPQLALISGVSVFPKASDPWFWLYIILFFGAYAQDLSDF 543 Query: 187 LICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTKDVQRE 360 L+ G T WWN QR+ I +S+ FGFI+ + K L LS KF +T K D QR+ Sbjct: 544 LLEGGTYRKWWNDQRMLMIKGLSSFFFGFIEFILKTLNLSTPKFNVTSKANDDDEQRK 601 >gi|2262114|gb|AAB63622.1| (AC002343) cellulose synthase isolog [Arabidopsis thaliana] Length = 770 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | | | 770 0 150 300 450 600 750 Plus Strand HSPs: Score = 198 (69.7 bits), Expect = 9.5e-14, P = 9.5e-14 Identities = 41/118 (34%), Positives = 60/118 (50%), Frame = +1 Query: 7 MGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCEY 186 +GYCN S+P Y + + L+ G+ +FP+ S W + F Y L ++ Sbjct: 527 LGYCNSPFKPFWSIPLTVYGLLPQLALISGVSVFPKASDPWFWLYIILFFGAYAQDLSDF 586 Query: 187 LICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTKDVQRE 360 L+ G T WWN QR+ I +S+ FGFI+ + K L LS KF +T K D QR+ Sbjct: 587 LLEGGTYRKWWNDQRMLMIKGLSSFFFGFIEFILKTLNLSTPKFNVTSKANDDDEQRK 644 >gi|7487588|pir||T08919 hypothetical protein T32A16.170 - Arabidopsis thaliana >gi|4972104|emb|CAB43900.1| (AL078468) putative protein [Arabidopsis thaliana] >gi|7269249|emb|CAB81318.1| (AL161560) putative protein [Arabidopsis thaliana] Length = 686 Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | | 686 0 150 300 450 600 Plus Strand HSPs: Score = 191 (67.2 bits), Expect = 4.5e-13, P = 4.5e-13 Identities = 35/109 (32%), Positives = 56/109 (51%), Frame = +1 Query: 7 MGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCEY 186 +GYC+Y W +P + Y + + L+ G+ +FP+ S W + FL Y L ++ Sbjct: 471 LGYCHYAFWPFWCIPLVVYGILPQVALIHGVSVFPKASDPWFWLYIILFLGGYAQDLSDF 530 Query: 187 LICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDK 333 L+ G T WWN QR+ + +S+ FGF + K L LS + +T K Sbjct: 531 LLEGGTYRKWWNDQRMWMVRGLSSFFFGFTEFTLKTLNLSTQGYNVTSK 579 >gi|2262115|gb|AAB63623.1| (AC002343) cellulose synthase isolog [Arabidopsis thaliana] Length = 730 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | | 730 0 150 300 450 600 Plus Strand HSPs: Score = 191 (67.2 bits), Expect = 4.9e-13, P = 4.9e-13 Identities = 35/109 (32%), Positives = 56/109 (51%), Frame = +1 Query: 7 MGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCEY 186 +GYC+Y W +P + Y + + L+ G+ +FP+ S W + FL Y L ++ Sbjct: 515 LGYCHYAFWPFWCIPLVVYGILPQVALIHGVSVFPKASDPWFWLYIILFLGGYAQDLSDF 574 Query: 187 LICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDK 333 L+ G T WWN QR+ + +S+ FGF + K L LS + +T K Sbjct: 575 LLEGGTYRKWWNDQRMWMVRGLSSFFFGFTEFTLKTLNLSTQGYNVTSK 623 >gi|7484865|pir||T02552 cellulose synthase homolog T26B15.9 - Arabidopsis thaliana >gi|3298541|gb|AAC25935.1| (AC004681) putative cellulose synthase [Arabidopsis thaliana] Length = 712 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | | 712 0 150 300 450 600 Plus Strand HSPs: Score = 142 (50.0 bits), Expect = 5.9e-07, P = 5.9e-07 Identities = 35/113 (30%), Positives = 52/113 (46%), Frame = +1 Query: 7 MGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCEY 186 + Y W S+P L Y + CLL LFP+ + ++ Y SL E+ Sbjct: 489 LAYLYIFTWGLRSIPELIYCLLPAYCLLHNAALFPKGVYLGIVVTLVGMHCLY--SLWEF 546 Query: 187 LICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTK 345 + G + W+ Q I T S+LF D + K LG+S+T FI+T K + K Sbjct: 547 MSLGFSVQSWFASQSFWRIKTTCSWLFSIPDIILKLLGISKTVFIVTKKTMPK 599 >gi|7484862|pir||T02553 cellulose synthase homolog T26B15.10 - Arabidopsis thaliana >gi|3298542|gb|AAC25936.1| (AC004681) putative cellulose synthase [Arabidopsis thaliana] Length = 755 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | || 755 0 150 300 450 600 750 Plus Strand HSPs: Score = 137 (48.2 bits), Expect = 4.4e-06, P = 4.4e-06 Identities = 33/111 (29%), Positives = 52/111 (46%), Frame = +1 Query: 7 MGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCEY 186 + Y W S+P L Y + CLL LFP+ + ++ Y +L E+ Sbjct: 532 LAYLYVFSWGLRSIPELFYCLLPAYCLLHNSALFPKGVYLGIIITLVGIHCLY--TLWEF 589 Query: 187 LICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVV 339 + G + W+ Q I T S+LF +D + K LG+S+T FI+T K + Sbjct: 590 MNLGFSIQSWYVTQSFGRIKTTCSWLFSVLDVILKLLGISKTVFIVTKKTM 640 >gi|7484715|pir||T10800 cellulose synthase (EC 2.4.1.-) catalytic chain celA2 - upland cotton (fragment) >gi|1706958|gb|AAB37767.1| (U58284) cellulose synthase [Gossypium hirsutum] Length = 685 Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | | 685 0 150 300 450 600 Plus Strand HSPs: Score = 127 (44.7 bits), Expect = 8.8e-05, P = 8.8e-05 Identities = 31/110 (28%), Positives = 52/110 (47%), Frame = +1 Query: 4 QMGYCNYLLWAPMSLPTLCYVFVSPICLLRG---IPLFPQLSSIWVLPFAYAFLATYGFS 174 ++ Y N +++ S+P L Y + +CLL G IP L+S+W L + +AT Sbjct: 456 RLAYINTIVYPFTSIPLLAYCTIPAVCLLTGKFIIPTLSNLTSVWFLALFLSIIAT---G 512 Query: 175 LCEYLICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDK 333 + E G + WW ++ I +++LF + K L T F +T K Sbjct: 513 VLELRWSGVSIQDWWRNEQFWVIGGVSAHLFAVFQGLLKVLAGVDTNFTVTAK 565 >gi|7484863|pir||T02560 cellulose synthase homolog T26B15.17 - Arabidopsis thaliana >gi|3298549|gb|AAC25943.1| (AC004681) putative cellulose synthase [Arabidopsis thaliana] Length = 748 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | | 748 0 150 300 450 600 Plus Strand HSPs: Score = 126 (44.4 bits), Expect = 0.00014, P = 0.00014 Identities = 33/99 (33%), Positives = 47/99 (47%), Frame = +1 Query: 43 SLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCEYLICGSTANGWWN 222 S+P L Y + CLL LFP+ + + Y +L E++ G + W Sbjct: 535 SIPELIYCLLPAYCLLHNSTLFPKGLYLGITVTLVGIHCLY--TLWEFMSLGYSVQSWLV 592 Query: 223 LQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVV 339 Q + I T+S+LF D K LG+S+T FIIT K V Sbjct: 593 SQSVWRIVATSSWLFSIFDITLKLLGISETVFIITKKTV 631 >gi|4432865|gb|AAD20713.1| (AC006300) putative cellulose synthase catalytic subunit [Arabidopsis thaliana] Length = 1065 Frame 1 hits (HSPs): ______ __________________________________________________ Database sequence: | | | | | | | || 1065 0 150 300 450 600 750 900 1050 Plus Strand HSPs: Score = 124 (43.7 bits), Expect = 0.00037, P = 0.00037 Identities = 27/115 (23%), Positives = 51/115 (44%), Frame = +1 Query: 4 QMGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCE 183 ++ Y N +++ S+P L Y + CL+ + P++S++ L F F + Y ++ E Sbjct: 837 RIAYINTIVYPITSIPLLAYCMLPAFCLITNTFIIPEISNLASLCFMLLFASIYASAILE 896 Query: 184 YLICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTKD 348 WW ++ I T+++LF + K T F +T K +D Sbjct: 897 LKWSDVALEDWWRNEQFWVIGGTSAHLFAVFQGLLKVFAGIDTNFTVTSKASDED 951 >gi|2827143|gb|AAC39336.1| (AF027174) cellulose synthase catalytic subunit [Arabidopsis thaliana] Length = 1065 Frame 1 hits (HSPs): ______ __________________________________________________ Database sequence: | | | | | | | || 1065 0 150 300 450 600 750 900 1050 Plus Strand HSPs: Score = 120 (42.2 bits), Expect = 0.0012, P = 0.0012 Identities = 29/115 (25%), Positives = 51/115 (44%), Frame = +1 Query: 4 QMGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCE 183 + Y N ++ S+P L Y + +CL + PQ+S+I + F FL+ + + E Sbjct: 835 RFAYVNTTIYPITSIPLLMYCTLLAVCLFTNQFIIPQISNIASIWFLSLFLSIFATGILE 894 Query: 184 YLICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTKD 348 G + WW ++ I +++LF + K L T F +T K +D Sbjct: 895 MRWSGVGIDEWWRNEQFWVIGGVSAHLFAVFQGILKVLAGIDTNFTVTSKASDED 949 >gi|7484864|pir||T02561 cellulose synthase homolog T26B15.18 - Arabidopsis thaliana >gi|3298550|gb|AAC25944.1| (AC004681) putative cellulose synthase [Arabidopsis thaliana] Length = 757 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | || 757 0 150 300 450 600 750 Plus Strand HSPs: Score = 118 (41.5 bits), Expect = 0.0014, P = 0.0014 Identities = 29/99 (29%), Positives = 47/99 (47%), Frame = +1 Query: 43 SLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCEYLICGSTANGWWN 222 S+P L Y + CLL LFP+ + + Y +L E++ G + W+ Sbjct: 543 SIPELIYCLLPAYCLLHNSALFPKGLCLGITMLLAGMHCLY--TLWEFMCLGHSIQSWYV 600 Query: 223 LQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVV 339 Q I T+S+LF D + K LGLS+ F+++ K + Sbjct: 601 SQSFWRIVATSSWLFSIFDIILKLLGLSKNVFLVSKKTM 639 >gi|7484861|pir||T05351 cellulose synthase (EC 2.4.1.-) catalytic chain RSW1 - Arabidopsis thaliana >gi|2827139|gb|AAC39334.1| (AF027172) cellulose synthase catalytic subunit [Arabidopsis thaliana] >gi|4049343|emb|CAA22568.1| (AL034567) cellulose synthase catalytic subunit (RSW1) [Arabidopsis thaliana] >gi|7270145|emb|CAB79958.1| (AL161581) cellulose synthase catalytic subunit (RSW1) [Arabidopsis thaliana] Length = 1081 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | | | | | 1081 0 150 300 450 600 750 900 1050 Plus Strand HSPs: Score = 119 (41.9 bits), Expect = 0.0016, P = 0.0016 Identities = 34/154 (22%), Positives = 67/154 (43%), Frame = +1 Query: 4 QMGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCE 183 ++ Y N +++ S+P + Y + CL+ + P++S+ + F F++ + E Sbjct: 850 RIAYINTIVYPITSIPLIAYCILPAFCLITDRFIIPEISNYASIWFILLFISIAVTGILE 909 Query: 184 YLICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTKDVQREV 363 G + WW ++ I T+++LF + K L T F +T K +D Sbjct: 910 LRWSGVSIEDWWRNEQFWVIGGTSAHLFAVFQGLLKVLAGIDTNFTVTSKATDEDGD--- 966 Query: 364 LSKKFI*IWKLFQSCLTHI*LQLALLNLFGLLLG 465 ++ +I W T + L+NL G++ G Sbjct: 967 FAELYIFKWTALLIPPTTV----LLVNLIGIVAG 996 >gi|7484859|pir||T08583 cellulose synthase (EC 2.4.1.-) catalytic chain - Arabidopsis thaliana >gi|2827141|gb|AAC39335.1| (AF027173) cellulose synthase catalytic subunit [Arabidopsis thaliana] >gi|4914447|emb|CAB43650.1| (AL050351) cellulose synthase catalytic subunit (Ath-A) [Arabidopsis thaliana] >gi|7270919|emb|CAB80598.1| (AL161595) cellulose synthase catalytic subunit (Ath-A) [Arabidopsis thaliana] Length = 1084 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | | | | | 1084 0 150 300 450 600 750 900 1050 Plus Strand HSPs: Score = 117 (41.2 bits), Expect = 0.0028, P = 0.0028 Identities = 36/154 (23%), Positives = 66/154 (42%), Frame = +1 Query: 4 QMGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCE 183 + Y N +++ SLP + Y + +CLL G + P++S+ + F F++ + E Sbjct: 854 RFSYINSVVYPWTSLPLIVYCSLPAVCLLTGKFIVPEISNYAGILFMLMFISIAVTGILE 913 Query: 184 YLICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTKDVQREV 363 G + WW ++ I +S+LF + K L T F +T K Sbjct: 914 MQWGGVGIDDWWRNEQFWVIGGASSHLFALFQGLLKVLAGVNTNFTVTSKAADDGA---- 969 Query: 364 LSKKFI*IWKLFQSCLTHI*LQLALLNLFGLLLG 465 S+ +I W T L ++N+ G+++G Sbjct: 970 FSELYIFKWTTLLIPPT----TLLIINIIGVIVG 999 >gi|4886756|gb|AAD32031.1|AF088917_1 (AF088917) cellulose synthase catalytic subunit [Arabidopsis thaliana] Length = 1026 Frame 1 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | | | | 1026 0 150 300 450 600 750 900 Plus Strand HSPs: Score = 116 (40.8 bits), Expect = 0.0034, P = 0.0034 Identities = 31/125 (24%), Positives = 54/125 (43%), Frame = +1 Query: 4 QMGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCE 183 + Y N ++ S+P L Y + ICLL + P +S+ L F F++ + E Sbjct: 797 RFAYANTTIYPFTSIPLLAYCILPAICLLTDKFIMPPISTFASLFFISLFMSIIVTGILE 856 Query: 184 YLICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTKDVQREV 363 G + WW ++ I +++LF + + K L T F +T K D E+ Sbjct: 857 LRWSGVSIEEWWRNEQFWVIGGISAHLFAVVQGLLKILAGIDTNFTVTSKATDDDDFGEL 916 Query: 364 LSKKF 378 + K+ Sbjct: 917 YAFKW 921 >gi|5230423|gb|AAD40885.1|AF091713_1 (AF091713) cellulose synthase catalytic subunit [Arabidopsis thaliana] Length = 1026 Frame 1 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | | | | 1026 0 150 300 450 600 750 900 Plus Strand HSPs: Score = 116 (40.8 bits), Expect = 0.0034, P = 0.0034 Identities = 31/125 (24%), Positives = 54/125 (43%), Frame = +1 Query: 4 QMGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCE 183 + Y N ++ S+P L Y + ICLL + P +S+ L F F++ + E Sbjct: 797 RFAYANTTIYPFTSIPLLAYCILPAICLLTDKFIMPPISTFASLFFISLFMSIIVTGILE 856 Query: 184 YLICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTKDVQREV 363 G + WW ++ I +++LF + + K L T F +T K D E+ Sbjct: 857 LRWSGVSIEEWWRNEQFWVIGGISAHLFAVVQGLLKILAGIDTNFTVTSKATDDDDFGEL 916 Query: 364 LSKKF 378 + K+ Sbjct: 917 YAFKW 921 >gi|6446577|gb|AAD39534.2| (AF150630) cellulose synthase catalytic subunit [Gossypium hirsutum] Length = 1067 Frame 1 hits (HSPs): ______ __________________________________________________ Database sequence: | | | | | | | || 1067 0 150 300 450 600 750 900 1050 Plus Strand HSPs: Score = 116 (40.8 bits), Expect = 0.0035, P = 0.0035 Identities = 27/115 (23%), Positives = 52/115 (45%), Frame = +1 Query: 4 QMGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCE 183 + Y N ++ ++P L Y + +CLL + PQ+S++ + F FL+ + + + Sbjct: 837 RFAYVNTTIYPVTAIPLLMYCTLPAVCLLTNKFIIPQISNLASIWFISLFLSIFATGILK 896 Query: 184 YLICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTKD 348 G + WW ++ I +++LF + K L T F +T K +D Sbjct: 897 MKWNGVGIDQWWRNEQFWVIGGVSAHLFAVFQGLLKVLAGIDTNFTVTSKASDED 951 >gi|4417271|gb|AAD20396.1| (AC007019) putative cellulose synthase catalytic subunit [Arabidopsis thaliana] Length = 1088 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | | | | | 1088 0 150 300 450 600 750 900 1050 Plus Strand HSPs: Score = 110 (38.7 bits), Expect = 0.019, P = 0.019 Identities = 38/154 (24%), Positives = 67/154 (43%), Frame = +1 Query: 4 QMGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCE 183 + Y N +++ SLP L Y + ICLL G + P++S+ + F F++ + E Sbjct: 858 RFSYINSVVYPWTSLPLLVYCSLPAICLLTGKFIVPEISNYAGILFLLMFMSIAVTGILE 917 Query: 184 YLICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTKDVQREV 363 + WW ++ I +S+LF + K L T F +T K D + Sbjct: 918 MQWGKIGIDDWWRNEQFWVIGGVSSHLFALFQGLLKVLAGVSTNFTVTSKAAD-DGE--- 973 Query: 364 LSKKFI*IWKLFQSCLTHI*LQLALLNLFGLLLG 465 S+ +I W T L ++N+ G+++G Sbjct: 974 FSELYIFKWTSLLIPPT----TLLIINIVGVIVG 1003 >gi|7484714|pir||T10797 cellulose synthase (EC 2.4.1.-) catalytic chain celA1 - upland cotton >gi|1706956|gb|AAB37766.1| (U58283) cellulose synthase [Gossypium hirsutum] Length = 974 Frame 1 hits (HSPs): ______ __________________________________________________ Database sequence: | | | | | | | | 974 0 150 300 450 600 750 900 Plus Strand HSPs: Score = 107 (37.7 bits), Expect = 0.037, P = 0.037 Identities = 29/110 (26%), Positives = 50/110 (45%), Frame = +1 Query: 4 QMGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCE 183 ++ Y N +++ SLP + Y + ICLL G + P LS++ + F FL+ ++ E Sbjct: 743 RLAYINTIVYPFTSLPLIAYCSLPAICLLTGKFIIPTLSNLASVLFLGLFLSIIVTAVLE 802 Query: 184 YLICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDK 333 G + W ++ I +++LF K L T F +T K Sbjct: 803 LRWSGVSIEDLWRNEQFWVIGGVSAHLFAVFQGFLKMLAGIDTNFTVTAK 852 >gi|4115905|gb|AAD03417.1| (AF072131) secondary xylem cellulose synthase [Populus tremuloides] Length = 978 Frame 1 hits (HSPs): ______ __________________________________________________ Database sequence: | | | | | | | | 978 0 150 300 450 600 750 900 Plus Strand HSPs: Score = 107 (37.7 bits), Expect = 0.037, P = 0.037 Identities = 27/110 (24%), Positives = 50/110 (45%), Frame = +1 Query: 4 QMGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGFSLCE 183 ++ Y N +++ SLP + Y + +CLL G + P LS++ + F F++ ++ E Sbjct: 748 RLAYINTIVYPFTSLPLIAYCTIPAVCLLTGKFIIPTLSNLASMLFLGLFISIIVTAVLE 807 Query: 184 YLICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDK 333 G + W ++ I +++LF K L T F +T K Sbjct: 808 LRWSGVSIEDLWRNEQFWVIGGVSAHLFAVFQGFLKMLAGIDTNFTVTAK 857 >gi|3135611|gb|AAC29067.1| (AF062485) cellulose synthase [Arabidopsis thaliana] Length = 1081 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | | | | | 1081 0 150 300 450 600 750 900 1050 Plus Strand HSPs: Score = 103 (36.3 bits), Expect = 0.12, P = 0.12 Identities = 38/154 (24%), Positives = 71/154 (46%), Frame = +1 Query: 4 QMGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAF--LATYGFSL 177 ++ Y N +++ SLP + Y + ICLL G + P++S+ + F F +A G Sbjct: 852 RLSYINSVVYPWTSLPLIVYCSLPAICLLTGKFIVPEISNYASILFMALFSSIAITGILE 911 Query: 178 CEYLICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTKDVQR 357 ++ G + WW ++ I +++LF + K L T F +T K D + Sbjct: 912 MQWGKVG--IDDWWRNEQFWVIGGVSAHLFALFQGLLKVLAGVDTNFTVTSKAAD-DGE- 967 Query: 358 EVLSKKFI*IWKLFQSCLTHI*LQLALLNLFGLLLG 465 S ++ W S L + L ++N+ G+++G Sbjct: 968 --FSDLYLFKWT---SLLIPP-MTLLIINVIGVIVG 997 >gi|320932|pir||A44971 hypothetical protein 1 - Plasmodium brasilianum Length = 96 Frame 2 hits (HSPs): ______________________ __________________________________________________ Database sequence: | | | | | | 96 0 20 40 60 80 Plus Strand HSPs: Score = 70 (24.6 bits), Expect = 0.72, P = 0.51 Identities = 14/41 (34%), Positives = 23/41 (56%), Frame = +2 Query: 47 CLLSVMSLFLLFAY-FVAFHCSHSFQAYGSFHL-HMHFSLH 163 C+ + F L ++ F +H HS+ +Y S+HL H + S H Sbjct: 10 CIFIFIFTFTLISFAFHPYHSYHSYHSYHSYHLYHSYHSYH 50 >gi|2924781|gb|AAC04910.1| (AC002334) putative cellulose synthase [Arabidopsis thaliana] Length = 1036 Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | | | | | 1036 0 150 300 450 600 750 900 Plus Strand HSPs: Score = 93 (32.7 bits), Expect = 1.7, P = 0.81 Identities = 36/154 (23%), Positives = 66/154 (42%), Frame = +1 Query: 4 QMGYCNYLLWAPMSLPTLCYVFVSPICLLRGIPLFPQLSSIWVLPFAYAFLATYGF-SLC 180 ++ Y N ++ S+ + Y F+ +CL G Q I L + T SL Sbjct: 809 RVAYLNVGIYPFTSIFLVVYCFLPALCLFSG-KFIVQSLDIHFLSYLLCITVTLTLISLL 867 Query: 181 EYLICGSTANGWWNLQRIKFIHRTTSYLFGFIDTMKKQLGLSQTKFIITDKVVTKDVQRE 360 E G WW ++ I T+++L + + K + + F +T K +D + + Sbjct: 868 EVKWSGIGLEEWWRNEQFWLIGGTSAHLAAVVQGLLKVIAGIEISFTLTSKASGED-EDD 926 Query: 361 VLSKKFI*IWK-LFQSCLTHI*LQLALLNLFGLLLG 465 + + +I W LF LT I ++NL +++G Sbjct: 927 IFADLYIVKWTGLFIMPLTII-----IVNLVAIVIG 957 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=6.00 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.354 0.159 0.613 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.347 0.151 0.544 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.345 0.154 0.543 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.344 0.151 0.524 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.344 0.148 0.514 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.363 0.161 0.645 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 200 200 10. 76 3 12 22 0.094 35 31 0.10 38 +2 0 200 199 10. 76 3 12 22 0.093 35 31 0.10 38 +1 0 201 200 10. 76 3 12 22 0.094 35 31 0.10 38 -1 0 201 200 10. 76 3 12 22 0.094 35 31 0.10 38 -2 0 200 200 10. 76 3 12 22 0.094 35 31 0.10 38 -3 0 200 200 10. 76 3 12 22 0.094 35 31 0.10 38 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 8:50 PM CDT May 27, 2000 Format: BLAST # of letters in database: 158,518,215 # of sequences in database: 505,245 # of database sequences satisfying E: 24 No. of states in DFA: 596 (59 KB) Total size of DFA: 260 KB (320 KB) Time to generate neighborhood: 0.03u 0.00s 0.03t Elapsed: 00:00:00 No. of threads or processors used: 4 Search cpu time: 273.95u 1.23s 275.18t Elapsed: 00:01:39 Total cpu time: 274.04u 1.26s 275.30t Elapsed: 00:01:39 Start: Wed Feb 14 17:37:38 2001 End: Wed Feb 14 17:39:17 2001
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000