WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
processing output: cycle 1 cycle 2 cycle 3 cycle 4Repeat sequence:
SW perc perc perc query position in query matching repeat position in repeat score div. del. ins. sequence begin end (left) repeat class/family begin end (left) ID 21 0.0 0.0 0.0 'E09H03_O15_16.ab1' 15 35 (482) + AT_rich Low_complexity 1 21 (0) 237 28.1 0.0 0.0 'E09H03_O15_16.ab1' 190 271 (246) + (ATG)n Simple_repeat 3 84 (0)Alignments:
21 0.00 0.00 0.00 'E09H03_O15_16.ab1' 15 35 (482) AT_rich#Low_complexity 2 22 (278) 5 'E09H03_O15_16. 15 ATTAATTTTTTAATTAATTTT 35 vv vvvvv v v vv vv AT_rich#Low_com 2 AAAATAAAATAAATAATATAA 22 Transitions / transversions = 0.00 (0 / 13) Gap_init rate = 0.00 (0 / 21), avg. gap size = 0.00 (0 / 0) 237 28.05 0.00 0.00 'E09H03_O15_16.ab1' 190 271 (246) C (CAT)n#Simple_repeat (96) 84 3 5 'E09H03_O15_16. 190 ATGATGAAGATGATGCCAGTGGAGAGGAGGACCACCAAGAGGACGAGGAT 239 v viii iv v v iv iv v v i v C (CAT)n#Simple_r 84 ATGATGATGATGATGATGATGATGATGATGATGATGATGATGATGATGAT 35 'E09H03_O15_16. 240 GATGAGGAGGAAGAGGACGATGAAGATGATGA 271 v v v v i v C (CAT)n#Simple_r 34 GATGATGATGATGATGATGATGATGATGATGA 3 Transitions / transversions = 0.53 (8 / 15) Gap_init rate = 0.00 (0 / 82), avg. gap size = 0.00 (0 / 0)Masked Sequence:
>'E09H03_O15_16.ab1' TTACGGCCGGGGGGNNNNNNNNNNNNNNNNNNNNNGACTCCTCTCTTTGA AATGTAAAGGAGAAAGCTCCATATGAAGCCAAGGCTGCAAAAAGGAAAGC TGAGTATGAAAAACTTATCAAAGCTTATGAGAAAAAGCAGGCAAGCTCTG CAGATGATGATGAATCAGACAAGTCCAAATCTGAAGTGANNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNCTGAGATTGATGTCCAAAACCTTGGTTAT GATATTTTGTTTATGAGGGAGGCTATTATATATTCTGACTGTGCATACTA GGCTACTTTTTATGTTTGTGCTGGTAGAAAAGAATGTCTTCTTTGTTGGT ACGATATATTGCCAACTACTTGAGCTTCCCTCACTTAGCTATCGTACCTG AATCACTTTTTGTGCATTGTAGCCTTAAATGGGTTGCTTTAAGTAATATA ATTTCTTTCTGACCAAASummary:
================================================== file name: /repeatmasker/tmp/RM2seq sequences: 1 total length: 517 bp GC level: 38.68 % bases masked: 103 bp ( 19.92 %) ================================================== number of length percentage elements* occupied of sequence -------------------------------------------------- SINEs: 0 0 bp 0.00 % ALUs 0 0 bp 0.00 % MIRs 0 0 bp 0.00 % LINEs: 0 0 bp 0.00 % LINE1 0 0 bp 0.00 % LINE2 0 0 bp 0.00 % L3/CR1 0 0 bp 0.00 % LTR elements: 0 0 bp 0.00 % MaLRs 0 0 bp 0.00 % ERVL 0 0 bp 0.00 % ERV_classI 0 0 bp 0.00 % ERV_classII 0 0 bp 0.00 % DNA elements: 0 0 bp 0.00 % MER1_type 0 0 bp 0.00 % MER2_type 0 0 bp 0.00 % Unclassified: 0 0 bp 0.00 % Total interspersed repeats: 0 bp 0.00 % Small RNA: 0 0 bp 0.00 % Satellites: 0 0 bp 0.00 % Simple repeats: 1 82 bp 15.86 % Low complexity: 1 21 bp 4.06 % ================================================== * most repeats fragmented by insertions or deletions have been counted as one element The sequence(s) were assumed to be of primate origin. RepeatMasker version 07/16/2000 default ProcessRepeats version 07/16/2000 Repbase version 03/31/2000