WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= A01A12_CONSENSUS (586 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 2 Sequences : less than 2 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 573 94 |=============================================== 6310 479 79 |======================================= 3980 400 113 |======================================================== 2510 287 85 |========================================== 1580 202 55 |=========================== 1000 147 38 |=================== 631 109 36 |================== 398 73 16 |======== 251 57 20 |========== 158 37 10 |===== 100 27 7 |=== 63.1 20 8 |==== 39.8 12 4 |== 25.1 8 1 |: 15.8 7 1 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 6 <<<<<<<<<<<<<<<<< 10.0 6 1 |: 6.31 5 1 |: 3.98 4 0 | 2.51 4 3 |= Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|8843774|dbj|BAA97322.1|(AB020754) gene_id:MYN8.6~s... +1 512 5.4e-54 2 gi|7469320|pir||S75332fibrillin - Synechocystis sp. (... +1 84 0.82 1 gi|5020103|gb|AAD38023.1|AF148219_1(AF148219) fibrill... +1 83 0.87 1 gi|7489044|pir||T07125plastid-lipid-associated protei... +1 80 0.87 1 gi|6018307|gb|AAF01813.1|AF187532_9(AF187532) nogalon... +1 76 0.991 1 gi|4139097|gb|AAD03693.1|(AF084554) fibrillin [Brassi... +1 80 0.9992 1 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame 3 Hits gi|8843774 | ______________ __________________________________________________ Query sequence: | | | | | 196 0 50 100 150 Locus_ID Frame 1 Hits gi|8843774 |_________________________________ gi|7469320 | _________ gi|5020103 | ______________ gi|7489044 | ___________________ gi|6018307 | _________________ gi|4139097 | _________________ __________________________________________________ Query sequence: | | | | | 196 0 50 100 150
Use the and icons to retrieve links to Entrez:
>gi|8843774|dbj|BAA97322.1| (AB020754) gene_id:MYN8.6~similar to unknown protein~sp|P29618 [Arabidopsis thaliana] Length = 670 Frame 3 hits (HSPs): _____ Frame 1 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | | | 670 0 150 300 450 600 Plus Strand HSPs: Score = 512 (180.2 bits), Expect = 5.4e-54, Sum P(2) = 5.4e-54 Identities = 92/131 (70%), Positives = 109/131 (83%), Frame = +1 Query: 1 LRHPFLCGPRWRVVPSMDIIRWGLGSTAMRIREKYIYRQPQQSRLAHFIDLMXMLNPHPQ 180 L+HPFLCGPRWRV PSMDIIRWGLGSTA++I E+YIYR PQ+ RLAHFI LM MLNP+P+ Sbjct: 394 LKHPFLCGPRWRVAPSMDIIRWGLGSTAVKISEEYIYRMPQRQRLAHFIGLMEMLNPYPK 453 Query: 181 PRNWLELLPGKWRLLYSTGKHVGLTLRQPPLRVLVSDVHLTVTRESKLKDN--LSFGSDI 354 P WLELLPG+WRLLYSTGKH+GLTLRQP R L+ +VHLT+TR S+ +N LSF SDI Sbjct: 454 PNCWLELLPGRWRLLYSTGKHIGLTLRQPSTRALIGNVHLTITRASESINNTSLSFTSDI 513 Query: 355 GFSVMIGQDWP 387 F+ + +DWP Sbjct: 514 RFTAITSKDWP 524 Score = 73 (25.7 bits), Expect = 5.4e-54, Sum P(2) = 5.4e-54 Identities = 21/54 (38%), Positives = 25/54 (46%), Frame = +3 Query: 429 SLFTLXSGXRLYLKQDXTTX-KFFFGPSTMFEALLXNLRAXNGKRY-PFXGFPS 584 S F L +G RLYLK++ KF G E L L K+ PF FPS Sbjct: 538 SQFRLIAGKRLYLKEEKKNIGKFSMGEPDAEEGLAEKLETEKWKKVVPFKEFPS 591 >gi|7469320|pir||S75332 fibrillin - Synechocystis sp. (strain PCC 6803) >gi|1652323|dbj|BAA17246.1| (D90904) fibrillin [Synechocystis sp.] Length = 202 Frame 1 hits (HSPs): _________ __________________________________________________ Database sequence: | | | | || 202 0 50 100 150 200 Plus Strand HSPs: Score = 84 (29.6 bits), Expect = 1.7, P = 0.82 Identities = 17/36 (47%), Positives = 23/36 (63%), Frame = +1 Query: 166 NPHPQPRNWLELLPGKWRLLYSTGKHVGLTLRQPPL 273 NPHP+P LL G WRLLY++ + + L L + PL Sbjct: 47 NPHPKPLQEKNLLDGNWRLLYTSSQSI-LGLNRLPL 81 >gi|5020103|gb|AAD38023.1|AF148219_1 (AF148219) fibrillin [Nostoc sp. PCC 8009] Length = 194 Frame 1 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | | 194 0 50 100 150 Plus Strand HSPs: Score = 83 (29.2 bits), Expect = 2.1, P = 0.87 Identities = 23/52 (44%), Positives = 29/52 (55%), Frame = +1 Query: 115 QPQQSRLAHFIDLMXMLNPHPQPRNWLELLPGKWRLLYSTGKHVGLTLRQPP 270 Q +Q+ LA L LNP P+P LL G WRLLY+T K + L L + P Sbjct: 23 QQKQAILAAIARLED-LNPTPRPVEATNLLDGNWRLLYTTSKAL-LNLDRVP 72 >gi|7489044|pir||T07125 plastid-lipid-associated protein PAP - tomato (fragment) >gi|2632090|emb|CAA75658.1| (Y15490) Plastid-lipid-Associated Protein [Lycopersicon esculentum] Length = 146 Frame 1 hits (HSPs): ____________________________ __________________________________________________ Database sequence: | | | | 146 0 50 100 Plus Strand HSPs: Score = 80 (28.2 bits), Expect = 2.1, P = 0.87 Identities = 25/78 (32%), Positives = 40/78 (51%), Frame = +1 Query: 136 AHFIDLMXML---NPHPQPRNWLELLPGKWRLLYSTGKHVGLTL-RQPPLRVLVSDVHLT 303 A ++L+ L NP+P P L LL GKW L Y++ + +L R L V V ++ T Sbjct: 62 AEIVELITQLESKNPNPAPTEALTLLNGKWILAYTSFSGLFPSLSRGNLLLVRVEEISQT 121 Query: 304 VTRES-KLKDNLSFGSDI 354 + ES ++D+ F + Sbjct: 122 IDSESFTVQDSAVFAGPL 139 >gi|6018307|gb|AAF01813.1|AF187532_9 (AF187532) nogalonic acid methyl ester cyclase [Streptomyces nogalater] Length = 134 Frame 1 hits (HSPs): __________________________ __________________________________________________ Database sequence: | | | | 134 0 50 100 Plus Strand HSPs: Score = 76 (26.8 bits), Expect = 4.7, P = 0.99 Identities = 20/69 (28%), Positives = 29/69 (42%), Frame = +1 Query: 208 GKWRLLYST--GKHVGLTLRQPPL--RVLVSDVHLTVTRESKLKDNLSFGSDIGFSVMIG 375 G W Y G+HVG + PP R VHL + K++D+ + G +G Sbjct: 64 GPWVKAYLVLYGRHVGRLVGMPPTDRRFSGEQVHLMRIVDGKIRDHRDWPDFQGTLRQLG 123 Query: 376 QDWPP*QSW 402 WP + W Sbjct: 124 DPWPDDEGW 132 >gi|4139097|gb|AAD03693.1| (AF084554) fibrillin [Brassica napus] Length = 237 Frame 1 hits (HSPs): ______________ __________________________________________________ Database sequence: | | | | | | 237 0 50 100 150 200 Plus Strand HSPs: Score = 80 (28.2 bits), Expect = 7.1, P = 1.0 Identities = 22/66 (33%), Positives = 36/66 (54%), Frame = +1 Query: 124 QSRLAHFIDLMXMLNPHPQPRNWLELLPGKWRLLYST--GKHVGLTLRQPPLRVLVSDVH 297 ++ ++ I + NP+P P L LL GKW L+Y++ G L+ R PL V V ++ Sbjct: 35 RAEISELITQLESKNPNPAPNEALFLLNGKWILVYTSFVGLFPLLSRRISPL-VKVDEIS 93 Query: 298 LTVTRES 318 T+ +S Sbjct: 94 QTIDSDS 100 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.95 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.336 0.144 0.494 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.353 0.156 0.580 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.338 0.151 0.532 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.333 0.145 0.453 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.347 0.151 0.570 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.351 0.158 0.586 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 194 183 10. 76 3 12 22 0.12 34 31 0.12 37 +2 0 195 184 10. 76 3 12 22 0.12 34 31 0.12 37 +1 0 195 184 10. 76 3 12 22 0.12 34 31 0.12 37 -1 0 195 183 10. 76 3 12 22 0.12 34 31 0.12 37 -2 0 195 185 10. 76 3 12 22 0.12 34 31 0.12 37 -3 0 194 182 10. 76 3 12 22 0.12 34 31 0.12 37 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 6 No. of states in DFA: 600 (59 KB) Total size of DFA: 225 KB (256 KB) Time to generate neighborhood: 0.01u 0.01s 0.02t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 187.35u 1.07s 188.42t Elapsed: 00:00:35 Total cpu time: 187.38u 1.10s 188.48t Elapsed: 00:00:35 Start: Mon Oct 1 16:06:57 2001 End: Mon Oct 1 16:07:32 2001
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000