BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= STr0962
(220 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|55822916|ref|YP_141357.1| hypothetical protein str0962 [... 410 e-113
gi|55820997|ref|YP_139439.1| hypothetical protein stu0962 [... 406 e-112
gi|116627768|ref|YP_820387.1| CRISPR-system related protein... 403 e-111
gi|125718069|ref|YP_001035202.1| hypothetical protein SSA_1... 239 9e-62
gi|15609958|ref|NP_217337.1| hypothetical protein Rv2821c [... 201 2e-50
gi|114567269|ref|YP_754423.1| hypothetical protein Swol_175... 192 2e-47
gi|121533438|ref|ZP_01665266.1| CRISPR-associated RAMP prot... 166 6e-40
gi|57865882|ref|YP_190002.1| CRISPR-associated TM1792 famil... 161 3e-38
gi|20090782|ref|NP_616857.1| hypothetical protein MA1933 [M... 153 7e-36
gi|15644553|ref|NP_229606.1| hypothetical protein TM1809 [T... 149 2e-34
gi|91201510|emb|CAJ74570.1| conserved hypothetical protein ... 140 5e-32
gi|14590102|ref|NP_142166.1| hypothetical protein PH0165 [P... 136 8e-31
gi|67939843|ref|ZP_00532328.1| Protein of unknown function ... 135 1e-30
gi|52425704|ref|YP_088841.1| hypothetical protein MS1649 [M... 135 2e-30
gi|154250068|ref|YP_001410893.1| CRISPR-associated RAMP pro... 134 4e-30
gi|150401504|ref|YP_001325270.1| CRISPR-associated RAMP pro... 130 4e-29
gi|134299486|ref|YP_001112982.1| CRISPR-associated RAMP pro... 130 5e-29
gi|15679091|ref|NP_276208.1| hypothetical protein MTH1080 [... 129 1e-28
gi|78043659|ref|YP_360956.1| CRISPR-associated RAMP protein... 128 2e-28
gi|113939521|ref|ZP_01425374.1| protein of unknown function... 127 3e-28
gi|68548728|ref|ZP_00588197.1| Protein of unknown function ... 125 2e-27
gi|118047322|ref|ZP_01515960.1| CRISPR-associated RAMP Csm3... 124 4e-27
gi|13540941|ref|NP_110629.1| hypothetical protein TVN0110 [... 124 6e-27
gi|76258119|ref|ZP_00765776.1| Protein of unknown function ... 123 6e-27
gi|15669865|ref|NP_248679.1| hypothetical protein MJ1669 [M... 119 1e-25
gi|150021546|ref|YP_001306900.1| CRISPR-associated RAMP pro... 115 2e-24
gi|124008646|ref|ZP_01693337.1| crispr-associated ramp prot... 115 2e-24
gi|48477124|ref|YP_022830.1| hypothetical protein PTO0052 [... 114 3e-24
gi|156741956|ref|YP_001432085.1| CRISPR-associated RAMP pro... 114 4e-24
gi|114332179|ref|YP_748401.1| CRISPR-associated RAMP protei... 113 6e-24
gi|149195213|ref|ZP_01872303.1| hypothetical protein CMTB2_... 113 8e-24
gi|30248152|ref|NP_840222.1| hypothetical protein NE0121 [N... 110 4e-23
gi|146295151|ref|YP_001178922.1| CRISPR-associated RAMP pro... 108 2e-22
gi|116748786|ref|YP_845473.1| CRISPR-associated RAMP protei... 103 7e-21
gi|119357840|ref|YP_912484.1| CRISPR-associated RAMP protei... 103 1e-20
gi|121540755|ref|ZP_01672514.1| conserved hypothetical prot... 100 9e-20
gi|46255175|ref|YP_006087.1| hypothetical protein TT_P0104 ... 88 4e-16
gi|55978332|ref|YP_145388.1| hypothetical protein TTHB149 [... 87 5e-16
gi|116620021|ref|YP_822177.1| CRISPR-associated RAMP protei... 80 7e-14
gi|119873035|ref|YP_931042.1| CRISPR-associated RAMP protei... 75 4e-12
gi|20094752|ref|NP_614599.1| Predicted component of a therm... 73 1e-11
gi|18311720|ref|NP_558387.1| hypothetical protein PAE0114 [... 72 2e-11
gi|119720269|ref|YP_920764.1| CRISPR-associated RAMP protei... 67 5e-10
gi|73668880|ref|YP_304895.1| hypothetical protein Mbar_A135... 55 2e-06
gi|126465220|ref|YP_001040329.1| CRISPR-associated RAMP pro... 55 3e-06
gi|21229459|ref|NP_635381.1| hypothetical protein MM_3357 [... 55 4e-06
>gi|55822916|ref|YP_141357.1| hypothetical protein str0962 [Streptococcus thermophilus CNRZ1066]
gi|55738901|gb|AAV62542.1| conserved hypothetical protein [Streptococcus thermophilus
CNRZ1066]
Length = 220
Score = 410 bits (1055), Expect = e-113, Method: Composition-based stats.
Identities = 220/220 (100%), Positives = 220/220 (100%)
Query: 1 MTFAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL 60
MTFAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL
Sbjct: 1 MTFAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL 60
Query: 61 AKVYNEKVAEKPSDDSDILSRLFGNSKDKRFKMGRLIFRDAFLSNADELDSLGVRSYTEV 120
AKVYNEKVAEKPSDDSDILSRLFGNSKDKRFKMGRLIFRDAFLSNADELDSLGVRSYTEV
Sbjct: 61 AKVYNEKVAEKPSDDSDILSRLFGNSKDKRFKMGRLIFRDAFLSNADELDSLGVRSYTEV 120
Query: 121 KFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRDGLKLLELD 180
KFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRDGLKLLELD
Sbjct: 121 KFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRDGLKLLELD 180
Query: 181 YLGGSGSRGYGKVAFENLKATTVFGNYDVKTLNELLTAEV 220
YLGGSGSRGYGKVAFENLKATTVFGNYDVKTLNELLTAEV
Sbjct: 181 YLGGSGSRGYGKVAFENLKATTVFGNYDVKTLNELLTAEV 220
>gi|55820997|ref|YP_139439.1| hypothetical protein stu0962 [Streptococcus thermophilus LMG 18311]
gi|55736982|gb|AAV60624.1| conserved hypothetical protein [Streptococcus thermophilus LMG
18311]
Length = 220
Score = 406 bits (1043), Expect = e-112, Method: Composition-based stats.
Identities = 218/220 (99%), Positives = 218/220 (99%)
Query: 1 MTFAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL 60
MTFAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL
Sbjct: 1 MTFAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL 60
Query: 61 AKVYNEKVAEKPSDDSDILSRLFGNSKDKRFKMGRLIFRDAFLSNADELDSLGVRSYTEV 120
AKVYNEKVAEKPSDDSDILSRLFGNSKDKRFKMGRLIFRDAFLSNADELDSLGVRSYTEV
Sbjct: 61 AKVYNEKVAEKPSDDSDILSRLFGNSKDKRFKMGRLIFRDAFLSNADELDSLGVRSYTEV 120
Query: 121 KFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRDGLKLLELD 180
KFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEED KVIRDGLKLLELD
Sbjct: 121 KFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDFKVIRDGLKLLELD 180
Query: 181 YLGGSGSRGYGKVAFENLKATTVFGNYDVKTLNELLTAEV 220
YLGGSGSRGYGKVAFE LKATTVFGNYDVKTLNELLTAEV
Sbjct: 181 YLGGSGSRGYGKVAFEKLKATTVFGNYDVKTLNELLTAEV 220
>gi|116627768|ref|YP_820387.1| CRISPR-system related protein, RAMP superfamily [Streptococcus
thermophilus LMD-9]
gi|116101045|gb|ABJ66191.1| CRISPR-system related protein, RAMP superfamily [Streptococcus
thermophilus LMD-9]
Length = 220
Score = 403 bits (1036), Expect = e-111, Method: Composition-based stats.
Identities = 216/220 (98%), Positives = 217/220 (98%)
Query: 1 MTFAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL 60
MTFAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITN+PIIPGSSLKGKMRTLL
Sbjct: 1 MTFAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNIPIIPGSSLKGKMRTLL 60
Query: 61 AKVYNEKVAEKPSDDSDILSRLFGNSKDKRFKMGRLIFRDAFLSNADELDSLGVRSYTEV 120
AKVYNEKVAEKPSDDSDILSRLFGNSKDKRFKMGRLIFRDAFLSNADELDSLGVRSYTEV
Sbjct: 61 AKVYNEKVAEKPSDDSDILSRLFGNSKDKRFKMGRLIFRDAFLSNADELDSLGVRSYTEV 120
Query: 121 KFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRDGLKLLELD 180
KFENTIDRITAEANPRQIERAIR STFDFELIYEITDENENQVEED KVIRDGLKLLELD
Sbjct: 121 KFENTIDRITAEANPRQIERAIRNSTFDFELIYEITDENENQVEEDFKVIRDGLKLLELD 180
Query: 181 YLGGSGSRGYGKVAFENLKATTVFGNYDVKTLNELLTAEV 220
YLGGSGSRGYGKVAFE LKATTVFGNYDVKTLNELLTAEV
Sbjct: 181 YLGGSGSRGYGKVAFEKLKATTVFGNYDVKTLNELLTAEV 220
>gi|125718069|ref|YP_001035202.1| hypothetical protein SSA_1249 [Streptococcus sanguinis SK36]
gi|125497986|gb|ABN44652.1| Conserved hypothetical protein [Streptococcus sanguinis SK36]
Length = 236
Score = 239 bits (610), Expect = 9e-62, Method: Composition-based stats.
Identities = 127/209 (60%), Positives = 164/209 (78%), Gaps = 4/209 (1%)
Query: 1 MTFAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL 60
MT++K+K S I L++GLHIG S+AFAAIGA ++P+IKDP+TNLP IPGS+LKGKMR+LL
Sbjct: 1 MTYSKVKISGIIELKSGLHIGTSNAFAAIGATNNPIIKDPLTNLPYIPGSTLKGKMRSLL 60
Query: 61 AKV-YNEKVAEKPSDDSDILSRLFGNSKDKRFKMGRLIFRDAFLSNADELDSLGVRSYTE 119
+ YN EK S DS +SRLFGNS+ +K+GRL+FRDAFL+N +EL V +YTE
Sbjct: 61 YRTDYNSNTTEKLSKDSLEISRLFGNSET--YKIGRLVFRDAFLNNKEELGKR-VSTYTE 117
Query: 120 VKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRDGLKLLEL 179
VKFENTI+RITAEA PRQ+ERAIR S F FE+IY I ++ +VE+D ++++ G +LLE
Sbjct: 118 VKFENTINRITAEATPRQVERAIRESEFAFEIIYSIQEKELTEVEKDMEILKAGFELLEW 177
Query: 180 DYLGGSGSRGYGKVAFENLKATTVFGNYD 208
DY+GGSGSRGYGKV F+N + TVFG +D
Sbjct: 178 DYIGGSGSRGYGKVCFKNFEVKTVFGEFD 206
>gi|15609958|ref|NP_217337.1| hypothetical protein Rv2821c [Mycobacterium tuberculosis H37Rv]
gi|15842362|ref|NP_337399.1| CRISPR-associated TM1792 family protein [Mycobacterium tuberculosis
CDC1551]
gi|31793997|ref|NP_856490.1| hypothetical protein Mb2845c [Mycobacterium bovis AF2122/97]
gi|81253177|ref|ZP_00877738.1| COG1337: Uncharacterized protein predicted to be involved in DNA
repair (RAMP superfamily) [Mycobacterium tuberculosis C]
gi|121638700|ref|YP_978924.1| hypothetical protein BCG_2840c [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|148662663|ref|YP_001284186.1| hypothetical protein MRA_2845 [Mycobacterium tuberculosis H37Ra]
gi|148824010|ref|YP_001288764.1| hypothetical protein TBFG_12835 [Mycobacterium tuberculosis F11]
gi|1648899|emb|CAB03665.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium tuberculosis H37Rv]
gi|13882660|gb|AAK47213.1| CRISPR-associated protein, TM1792 family [Mycobacterium
tuberculosis CDC1551]
gi|31619591|emb|CAD95030.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium bovis AF2122/97]
gi|121494348|emb|CAL72828.1| Conserved hypothetical protein [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|124601975|gb|EAY60985.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|134150983|gb|EBA43028.1| conserved hypothetical protein [Mycobacterium tuberculosis str.
Haarlem]
gi|148506815|gb|ABQ74624.1| conserved hypothetical protein [Mycobacterium tuberculosis H37Ra]
gi|148722537|gb|ABR07162.1| conserved hypothetical protein [Mycobacterium tuberculosis F11]
Length = 236
Score = 201 bits (511), Expect = 2e-50, Method: Composition-based stats.
Identities = 109/228 (47%), Positives = 156/228 (68%), Gaps = 16/228 (7%)
Query: 2 TFAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLA 61
++AKI+ + + + TGL IG D F+AIGA+D PV++DP++ LP+IPG+SLKGK+RTLL+
Sbjct: 4 SYAKIEITGTLTVLTGLQIGAGDGFSAIGAVDKPVVRDPLSRLPMIPGTSLKGKVRTLLS 63
Query: 62 KVY---NEKVAEKPSDDSDILSRLFGNSKDKRFKMGRLIFRDAFLSNADELDSLGVRSYT 118
+ Y E KP++D + RLFG++++ + GRL+FRD L+N D+L++ G ++ T
Sbjct: 64 RQYGADTETFYRKPNEDHAHIRRLFGDTEE--YMTGRLVFRDTKLTNKDDLEARGAKTLT 121
Query: 119 EVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEIT-----DENE------NQVEEDS 167
EVKFEN I+R+TA+AN RQ+ER I S F F L+YE++ +E + +++ ED
Sbjct: 122 EVKFENAINRVTAKANLRQMERVIPGSEFAFSLVYEVSFGTPGEEQKASLPSSDEIIEDF 181
Query: 168 KVIRDGLKLLELDYLGGSGSRGYGKVAFENLKATTVFGNYDVKTLNEL 215
I GLKLLELDYLGGSG+RGYG+V F NLKA G D L +L
Sbjct: 182 NAIARGLKLLELDYLGGSGTRGYGQVKFSNLKARAAVGALDGSLLEKL 229
>gi|114567269|ref|YP_754423.1| hypothetical protein Swol_1754 [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
gi|114338204|gb|ABI69052.1| conserved hypothetical protein [Syntrophomonas wolfei subsp. wolfei
str. Goettingen]
Length = 233
Score = 192 bits (487), Expect = 2e-47, Method: Composition-based stats.
Identities = 106/202 (52%), Positives = 138/202 (68%), Gaps = 9/202 (4%)
Query: 3 FAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAK 62
+ KI ++ + TG+HIGGS AF+AIGA+DSPVI+D T P++PGSSLKGKMRTLLAK
Sbjct: 4 YGKILIKCKMTVLTGMHIGGSSAFSAIGAVDSPVIRDSFTGEPMLPGSSLKGKMRTLLAK 63
Query: 63 -VYNEKVAEKPSDDSDILSRLFGNSKDKRF----KMGRLIFRDAFLSNADELDSLGVRSY 117
+ N + ++P+ D + + RLFG + D R K RL F DAFL NAD L +
Sbjct: 64 AIKNHYITQEPAKDPEEIKRLFGTAGDNRKQEWPKAARLQFYDAFLVNADTLKNRS--GM 121
Query: 118 TEVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRDGLKLL 177
TEVKFENTI+R+TA ANPRQIER +R S F L+Y++ E+ + ++ D I GLKLL
Sbjct: 122 TEVKFENTINRLTAIANPRQIERVVRGSEFAINLVYDM--EDTDSLKSDFTNIARGLKLL 179
Query: 178 ELDYLGGSGSRGYGKVAFENLK 199
+DYLGG GSRGYGKV F + +
Sbjct: 180 SMDYLGGHGSRGYGKVGFTDFE 201
>gi|121533438|ref|ZP_01665266.1| CRISPR-associated RAMP protein, Csm3 family [Thermosinus
carboxydivorans Nor1]
gi|121307997|gb|EAX48911.1| CRISPR-associated RAMP protein, Csm3 family [Thermosinus
carboxydivorans Nor1]
Length = 238
Score = 166 bits (421), Expect = 6e-40, Method: Composition-based stats.
Identities = 103/205 (50%), Positives = 133/205 (64%), Gaps = 8/205 (3%)
Query: 3 FAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAK 62
F KI A++ L TGLHIG S A+IG +D VI+DP+T P+IPGSSLKGK+RTLLAK
Sbjct: 10 FGKINLQAKLSLVTGLHIGASKDNASIGDVDCIVIRDPLTRRPMIPGSSLKGKIRTLLAK 69
Query: 63 VYNEK-VAEKPSDDSDILSRLFGNSKDKRFKMGRLIFRDAFLSN--ADELDSLGVRSY-T 118
+E V +P D ++RLFG+SK K RL F D F+++ A+++ + Y T
Sbjct: 70 ALSENPVLGEPDSDPIEVTRLFGSSKP--VKHARLQFYDVFMTDDSANKISRMDTDLYLT 127
Query: 119 EVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRDGLKLLE 178
EVKFEN IDR+TA ANPRQ+ER + F F L+Y I E ++ D K + DGLKLLE
Sbjct: 128 EVKFENAIDRLTAVANPRQVERVPAGAEFAFRLVYNI--EALDEAAADLKALADGLKLLE 185
Query: 179 LDYLGGSGSRGYGKVAFENLKATTV 203
LDYLGG GSRGYG+V F + +
Sbjct: 186 LDYLGGHGSRGYGRVKFADFHINVI 210
>gi|57865882|ref|YP_190002.1| CRISPR-associated TM1792 family protein [Staphylococcus epidermidis
RP62A]
gi|57636540|gb|AAW53328.1| CRISPR-associated protein, TM1792 family [Staphylococcus
epidermidis RP62A]
Length = 214
Score = 161 bits (407), Expect = 3e-38, Method: Composition-based stats.
Identities = 106/212 (50%), Positives = 136/212 (64%), Gaps = 4/212 (1%)
Query: 3 FAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAK 62
++KIK S I + TGLHIGG + IGAIDSPV++D T LPIIPGSS+KGKMR LLAK
Sbjct: 2 YSKIKISGTIEVVTGLHIGGGGESSMIGAIDSPVVRDLQTKLPIIPGSSIKGKMRNLLAK 61
Query: 63 VYNEKVA-EKPSDDSDILSRLFGNSKDKRFKMGRLIFRDAFLS-NADELDSLGVRSYTEV 120
+ K+ E + D + + RLFG+S+ + RL DAF S E + +YTE
Sbjct: 62 HFGLKMKQESHNQDDERVLRLFGSSEKGNIQRARLQISDAFFSEKTKEHFAQNDIAYTET 121
Query: 121 KFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRDGLKLLELD 180
KFENTI+R+TA ANPRQIER R S FDF IY + + E+QVE+D + I + LLE D
Sbjct: 122 KFENTINRLTAVANPRQIERVTRGSEFDFVFIYNV--DEESQVEDDFENIEKAIHLLEND 179
Query: 181 YLGGSGSRGYGKVAFENLKATTVFGNYDVKTL 212
YLGG G+RG G++ F++ TV G YD L
Sbjct: 180 YLGGGGTRGNGRIQFKDTNIETVVGEYDSTNL 211
>gi|20090782|ref|NP_616857.1| hypothetical protein MA1933 [Methanosarcina acetivorans C2A]
gi|19915844|gb|AAM05337.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
Length = 258
Score = 153 bits (386), Expect = 7e-36, Method: Composition-based stats.
Identities = 100/214 (46%), Positives = 132/214 (61%), Gaps = 18/214 (8%)
Query: 4 AKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKV 63
KI + ++++ TG+HIG S IG IDSPVI+DP+T+ P IPGSSLKGK+R+L K
Sbjct: 12 GKILITGEMKVVTGMHIGASKETVKIGGIDSPVIRDPMTDFPYIPGSSLKGKLRSLSEKS 71
Query: 64 YNEK---VAEKP-------SDDSDILSRLFGNSK-----DKRFKMGRLIFRDAFLSNADE 108
++ ++ P +D+ + RLFG+SK +++ RLI RD LSN E
Sbjct: 72 LEKELFAISHNPDINIHVCTDEHCEICRLFGSSKKDKNEEQKHIPSRLIVRDMHLSNDKE 131
Query: 109 LDSLGV-RSYTEVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDS 167
L + YTE KFEN+IDRI++ ANPRQIER + F FEL+Y+ +E E EED
Sbjct: 132 LFDIDTGLPYTEWKFENSIDRISSAANPRQIERIPAGAKFKFELVYDAEEETEL--EEDI 189
Query: 168 KVIRDGLKLLELDYLGGSGSRGYGKVAFENLKAT 201
I+ LKLLE D LGG GSRGYGKV FE + T
Sbjct: 190 TRIQMALKLLEQDALGGHGSRGYGKVKFERVTYT 223
>gi|15644553|ref|NP_229606.1| hypothetical protein TM1809 [Thermotoga maritima MSB8]
gi|148270225|ref|YP_001244685.1| CRISPR-associated RAMP protein, Csm3 family [Thermotoga petrophila
RKU-1]
gi|4982390|gb|AAD36872.1|AE001818_7 conserved hypothetical protein [Thermotoga maritima MSB8]
gi|147735769|gb|ABQ47109.1| CRISPR-associated RAMP protein, Csm3 family [Thermotoga petrophila
RKU-1]
Length = 247
Score = 149 bits (375), Expect = 2e-34, Method: Composition-based stats.
Identities = 91/203 (44%), Positives = 126/203 (62%), Gaps = 12/203 (5%)
Query: 3 FAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAK 62
K +I LETGL IGG + IG ID+PVI++P+T P IPGSS+KGKMR+L+ +
Sbjct: 6 LGKYIIKGKIILETGLRIGGQELGVNIGGIDNPVIRNPLTGEPYIPGSSVKGKMRSLMER 65
Query: 63 VYN-----EKVAEKPSDDSDI-LSRLFGN-SKDKRFKMGRLIFRDAFLSNADELDSLGVR 115
+ N KV ++ + + R+FG+ SK+ RL+ RDAFL+ + L +
Sbjct: 66 LLNLDISGNKVRRHECEERECKVCRVFGSTSKEGNNIPSRLLVRDAFLTEDSKTKLLSME 125
Query: 116 S---YTEVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRD 172
+ YTE K EN +DR+T +A+PR ER + F+FE+IY T ENE ++ED + I
Sbjct: 126 TDLPYTEWKTENALDRVTCKADPRSFERIPAGAEFEFEIIY--TAENEKHIKEDLENIAT 183
Query: 173 GLKLLELDYLGGSGSRGYGKVAF 195
L+LLE DYLGG+GSRGYGKV F
Sbjct: 184 ALELLEDDYLGGNGSRGYGKVKF 206
>gi|91201510|emb|CAJ74570.1| conserved hypothetical protein [Candidatus Kuenenia
stuttgartiensis]
Length = 265
Score = 140 bits (354), Expect = 5e-32, Method: Composition-based stats.
Identities = 86/208 (41%), Positives = 124/208 (59%), Gaps = 17/208 (8%)
Query: 3 FAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAK 62
KI +I+ ETGLHIGGS IG ID+PV++DP+T P IPGSSLKGK+R+L +
Sbjct: 10 LGKIFILGKIKCETGLHIGGSKEKMDIGGIDAPVMRDPLTREPYIPGSSLKGKLRSLFER 69
Query: 63 VYNEKV---------AEKPSDDSDILSRLFGNS---KDKRFKMGRLIFRDAFL--SNADE 108
+ N++ + +D + RLFG++ D+ RL RD L + ++
Sbjct: 70 MENKQFNRSGGGDVWRHECTDSQCYVCRLFGSTGSNADENLP-SRLSVRDCLLEEESREK 128
Query: 109 LDSLGV-RSYTEVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDS 167
L + YTE KFEN++DR+TA ANPRQ+ER + F+FE++Y + N + ++D
Sbjct: 129 LKKIDTGLQYTEWKFENSLDRVTAAANPRQLERIPAGAKFEFEIVYNVEASN-GEAKKDL 187
Query: 168 KVIRDGLKLLELDYLGGSGSRGYGKVAF 195
+ + + LL+ DYLGG GSRGYGKV F
Sbjct: 188 SNLLELISLLQDDYLGGHGSRGYGKVGF 215
>gi|14590102|ref|NP_142166.1| hypothetical protein PH0165 [Pyrococcus horikoshii OT3]
gi|3256551|dbj|BAA29234.1| 291aa long hypothetical protein [Pyrococcus horikoshii OT3]
Length = 291
Score = 136 bits (343), Expect = 8e-31, Method: Composition-based stats.
Identities = 90/235 (38%), Positives = 118/235 (50%), Gaps = 44/235 (18%)
Query: 3 FAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAK 62
+ KI S +I TGLHIG + IG ID+PVIKDP T LP IPGSSLKG++R+L
Sbjct: 6 YGKIIISGEIEAVTGLHIGSQREVSEIGGIDNPVIKDPHTGLPYIPGSSLKGRLRSLFEI 65
Query: 63 VYNEKVAEKPSDDSDI------------------------------------------LS 80
N ++ E S S + +
Sbjct: 66 YVNTRLDELKSKYSSLSNYSKGSCRDVGKENCGKFFNKKLNNVWIHVCSTYEMARNCPVC 125
Query: 81 RLFGNSKDKRFKMGRLIFRDAFLSNADELDSLGVRSYTEVKFENTIDRITAEANPRQIER 140
RL+G+S + RLI RDAFL+ + + TE K E IDR+T++ANPR ER
Sbjct: 126 RLYGSSGKESNFPSRLIVRDAFLTEEWKKKWENGEAITEAKIEVGIDRVTSQANPRTTER 185
Query: 141 AIRTSTFDFELIYEITDENENQVEEDSKVIRDGLKLLELDYLGGSGSRGYGKVAF 195
+ + FDFE+IY I D E ++D + + + LLE YLGGSGSRGYGKV F
Sbjct: 186 VVAGTRFDFEIIYTIEDLKE--WKDDLRNLLTSMLLLEDSYLGGSGSRGYGKVRF 238
>gi|67939843|ref|ZP_00532328.1| Protein of unknown function DUF324 [Chlorobium phaeobacteroides
BS1]
gi|67913933|gb|EAM63296.1| Protein of unknown function DUF324 [Chlorobium phaeobacteroides
BS1]
Length = 244
Score = 135 bits (341), Expect = 1e-30, Method: Composition-based stats.
Identities = 91/212 (42%), Positives = 125/212 (58%), Gaps = 23/212 (10%)
Query: 4 AKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKV 63
K++ S +I+ ETGLHIGGS IG ID VIK P +P IPGSSLKGK+R++LA+
Sbjct: 6 GKVRLSGRIKAETGLHIGGSKTALDIGGIDLNVIKTP-GGVPFIPGSSLKGKLRSILARE 64
Query: 64 YNEKVAEKPS----------DDSDILSRLFGNSKDKRFKM-GRLIFRDAFLSNA---DEL 109
+ K +D I+ LFGN+ DK+ R+I RDAFL D +
Sbjct: 65 HGSMAISKKEKGVRDGDTTDEDIPIIIELFGNAGDKKDACPSRIIVRDAFLDKEFFNDPV 124
Query: 110 DSLGVRS-----YTEVKFENTIDRITAEA-NPRQIERAIRTSTFDFELIYEITDENENQV 163
+ G+ S YTE K+ENTI R T A NPR +ER + FDFE+IY + ++++ ++
Sbjct: 125 KNEGIFSELEMDYTESKWENTILRKTGTAINPRPVERVPVGTKFDFEIIYNVFNDDKKKL 184
Query: 164 EEDSKVIRDGLKLLELDYLGGSGSRGYGKVAF 195
+ I L++LE DYLGG GSRGYGK++F
Sbjct: 185 HLNE--IIKALRILEDDYLGGLGSRGYGKISF 214
>gi|52425704|ref|YP_088841.1| hypothetical protein MS1649 [Mannheimia succiniciproducens MBEL55E]
gi|52307756|gb|AAU38256.1| unknown [Mannheimia succiniciproducens MBEL55E]
Length = 231
Score = 135 bits (340), Expect = 2e-30, Method: Composition-based stats.
Identities = 97/214 (45%), Positives = 127/214 (59%), Gaps = 25/214 (11%)
Query: 6 IKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKVYN 65
I+ A++ L+TGLHIG D+ IG ID+ VIK IT P IPGSSLKGK+RTLL + Y+
Sbjct: 7 IEIKAKLVLKTGLHIGAGDSEMHIGGIDNSVIKHSITQSPYIPGSSLKGKIRTLL-EWYS 65
Query: 66 EKVAEKPSDDSDILS-----------RLFG------NSKD--KRFKMGRLIFRDAFLS-N 105
+V +P +++ S RLFG N+K+ + K RL F D L+ +
Sbjct: 66 GEVKSEPLSINNVASANNSENVKNILRLFGFAGHSENNKELCQELKSSRLAFWDCALNED 125
Query: 106 ADELDSLGVRSYTEVKFENTIDRITAEA-NPRQIERAIRTSTFDFELIYEITDENENQVE 164
+++ + TE K ENTIDRITA A NPRQ ER + FDF+L + E E
Sbjct: 126 WEKMIREDNQLLTEAKSENTIDRITATAGNPRQTERVPAGAEFDFKLALR---QFEGDSE 182
Query: 165 EDSKVIRDGLKLLELDYLGGSGSRGYGKVAFENL 198
E K++ GL+LLELD LGGSGSRGYGKV F+ L
Sbjct: 183 ELVKLVLKGLRLLELDSLGGSGSRGYGKVEFQGL 216
>gi|154250068|ref|YP_001410893.1| CRISPR-associated RAMP protein, Csm3 family [Fervidobacterium
nodosum Rt17-B1]
gi|154154004|gb|ABS61236.1| CRISPR-associated RAMP protein, Csm3 family [Fervidobacterium
nodosum Rt17-B1]
Length = 242
Score = 134 bits (337), Expect = 4e-30, Method: Composition-based stats.
Identities = 91/202 (45%), Positives = 125/202 (61%), Gaps = 18/202 (8%)
Query: 10 AQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKVYNEKVA 69
A IRL TGLHIG S IG +D+PVIKDP P IPGS+LKGK+R +L++ +N KV
Sbjct: 14 ADIRLITGLHIGTSKDDLEIGGLDNPVIKDP-EGKPYIPGSTLKGKLR-VLSEFFNGKVD 71
Query: 70 E--KP---SDDSDILSRLFGN----SKDKRFKMGRLIFRDAFL--SNADELDSLGVRSYT 118
E KP D+ ++ LFG ++ K + RLI RDA++ + +L+ +T
Sbjct: 72 ESGKPHACDDEKCVVCGLFGTGILRTESKTLYLRRLIVRDAYIDPESLKDLEEYLETKWT 131
Query: 119 EVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITD-ENENQVEEDSKVIRDGLKLL 177
EVK EN I+R+T+ ANPR ER + F E + I + +NE+ + E K+ +KLL
Sbjct: 132 EVKHENMINRLTSRANPRPQERVPAGAVFKAEFVVNIFEGDNEDYLNELLKI----MKLL 187
Query: 178 ELDYLGGSGSRGYGKVAFENLK 199
E DY+GGSGSRGYGK+ FEN+K
Sbjct: 188 EDDYIGGSGSRGYGKIKFENIK 209
>gi|150401504|ref|YP_001325270.1| CRISPR-associated RAMP protein, Csm3 family [Methanococcus aeolicus
Nankai-3]
gi|150014207|gb|ABR56658.1| CRISPR-associated RAMP protein, Csm3 family [Methanococcus aeolicus
Nankai-3]
Length = 242
Score = 130 bits (328), Expect = 4e-29, Method: Composition-based stats.
Identities = 91/204 (44%), Positives = 117/204 (57%), Gaps = 15/204 (7%)
Query: 4 AKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL--- 60
K+ +I ETGLHIGG IG D+PVI+D + IPGSSLKGK+R+LL
Sbjct: 9 GKLILKGKIITETGLHIGGIAETLKIGGSDNPVIRDKNGRV-FIPGSSLKGKIRSLLEVK 67
Query: 61 -AKVYNEKVAEKPSDDSDI-LSRLFGNSKDKRFKM-GRLIFRDAFLSNADELDSLGVRSY 117
K K +P + D + LFG K + K R I RDA+L+ V Y
Sbjct: 68 DGKYKLNKGEAQPCNCGDCPICMLFGPHKSENIKEPARAIVRDAYLNKEKP-----VHEY 122
Query: 118 TEVKFENTIDRITAEA-NPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRDGLKL 176
E+K EN IDR+ A +PR IER + S FD+E+++ I DEN ++ E K +G+KL
Sbjct: 123 LEIKPENVIDRVKGTAQHPRFIERVVAGSEFDYEVVFNIYDENRDK--ELIKKFIEGIKL 180
Query: 177 LELDYLGGSGSRGYGKVAFENLKA 200
LE DYLGGSGSRGYGK+ FE L A
Sbjct: 181 LEDDYLGGSGSRGYGKIKFEKLTA 204
>gi|134299486|ref|YP_001112982.1| CRISPR-associated RAMP protein, Csm3 family [Desulfotomaculum
reducens MI-1]
gi|134052186|gb|ABO50157.1| CRISPR-associated RAMP protein, Csm3 family [Desulfotomaculum
reducens MI-1]
Length = 215
Score = 130 bits (328), Expect = 5e-29, Method: Composition-based stats.
Identities = 84/200 (42%), Positives = 117/200 (58%), Gaps = 15/200 (7%)
Query: 10 AQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKVYNEKVA 69
+I TG+ IGGS IG ID+PVIK P+ N P IPGSSLKGKMR+ + K+ KV
Sbjct: 11 GKIECITGMRIGGSAEAIEIGGIDNPVIKHPVNNEPYIPGSSLKGKMRSQMEKI-EGKVN 69
Query: 70 EKP---SDDSDILSRLFGNSKDKRFKMG--RLIFRDAFLSNADELDSLGV----RSYTEV 120
EKP +D ++ R+FG + +G R++ RDA LS+ + + + +SY E+
Sbjct: 70 EKPCGCADKGCMVCRVFGPHNRPKHDLGPTRILVRDAMLSDESRQEMIRIIQEGKSYIEI 129
Query: 121 KFENTIDRITAEAN-PRQIERAIRTSTFDFELIYEITDENENQVEEDS-KVIRDGLKLLE 178
K EN I R T AN PR ER + FDFE++ ++ D + +E+D ++ LK +E
Sbjct: 130 KTENIIVRTTGVANHPRTQERVPAGAKFDFEIVVQVFDID---LEKDVIDFVKKALKSVE 186
Query: 179 LDYLGGSGSRGYGKVAFENL 198
YLG SGSRGYG+V F L
Sbjct: 187 NSYLGSSGSRGYGQVHFTKL 206
>gi|15679091|ref|NP_276208.1| hypothetical protein MTH1080 [Methanothermobacter
thermautotrophicus str. Delta H]
gi|41688748|sp|O27152|Y1080_METTH Uncharacterized protein MTH_1080
gi|2622180|gb|AAB85569.1| conserved protein [Methanothermobacter thermautotrophicus str.
Delta H]
Length = 245
Score = 129 bits (324), Expect = 1e-28, Method: Composition-based stats.
Identities = 83/208 (39%), Positives = 119/208 (57%), Gaps = 21/208 (10%)
Query: 9 SAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKVYNEKV 68
+ +I TGLHIG S IG D+P+I+DP+T LP IPGSS+KGKMR+LL +
Sbjct: 10 TGEILCRTGLHIGVSKDSIEIGGSDNPIIRDPVTRLPYIPGSSIKGKMRSLLELELDRVS 69
Query: 69 AEKPSDDSDI-LSRLFGNSKDKRFKMG-------------RLIFRDAFLSN---ADELDS 111
P + R+FG++ D G R+I RDAF ++ + +S
Sbjct: 70 NGGPCKCGKCEICRVFGSAADSSSSSGPTRTDSSSSSGPTRIIVRDAFPTDETVEEWKES 129
Query: 112 LGVRSYTEVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIR 171
V E+K+EN ++RIT+ ANPR ER R S F FE+I D + + + +++
Sbjct: 130 SEVVEGAELKYENNLNRITSMANPRNQERVPRGSKFGFEIIVSEYDGDSDNL----RIVL 185
Query: 172 DGLKLLELDYLGGSGSRGYGKVAFENLK 199
+GL+LLE YLGGSG+RGYGK+ F+N+K
Sbjct: 186 EGLRLLEDSYLGGSGTRGYGKIEFKNIK 213
>gi|78043659|ref|YP_360956.1| CRISPR-associated RAMP protein, Csm3 family [Carboxydothermus
hydrogenoformans Z-2901]
gi|77995774|gb|ABB14673.1| CRISPR-associated RAMP protein, Csm3 family [Carboxydothermus
hydrogenoformans Z-2901]
Length = 266
Score = 128 bits (322), Expect = 2e-28, Method: Composition-based stats.
Identities = 94/227 (41%), Positives = 125/227 (55%), Gaps = 36/227 (15%)
Query: 3 FAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL-- 60
KI +I+ TGLHIGG+ IG +D+ VIKD P IPGSSLKGK+R+LL
Sbjct: 7 LGKIVIKGKIKALTGLHIGGAQGNTEIGGVDNSVIKDE-EGKPYIPGSSLKGKLRSLLEN 65
Query: 61 -------AKVYNEKVAEKP------SDDSDILSRLFGNSKDK-------------RFKMG 94
K+ +K +P ++ + +FG + K
Sbjct: 66 HEGYLSATKLVLQKKGAEPIRIHICNEPECPVCIIFGRNHGKYTLADNQTELVISNATPT 125
Query: 95 RLIFRDAFL---SNADELDSLGVRSYTEVKFENTIDRITAEANPRQIERAIRTSTFDFEL 151
RL FRDA L S D +L + +TEVKFEN+IDRIT+ ANPRQ ER R + F FEL
Sbjct: 126 RLYFRDACLDEESIKDIKPNLDLE-WTEVKFENSIDRITSAANPRQTERVPRGAEFCFEL 184
Query: 152 IYEITDENENQVEEDSKVIRDGLKLLELDYLGGSGSRGYGKVAFENL 198
+Y + E + ++ SKV+ +KLLE DYLGGSGSRGYGK+ F++L
Sbjct: 185 VYNVLREEDKELF--SKVL-TAMKLLEDDYLGGSGSRGYGKIMFKDL 228
>gi|113939521|ref|ZP_01425374.1| protein of unknown function DUF324 [Herpetosiphon aurantiacus ATCC
23779]
gi|113898819|gb|EAU17828.1| protein of unknown function DUF324 [Herpetosiphon aurantiacus ATCC
23779]
Length = 255
Score = 127 bits (320), Expect = 3e-28, Method: Composition-based stats.
Identities = 86/213 (40%), Positives = 124/213 (58%), Gaps = 22/213 (10%)
Query: 4 AKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKV 63
+I + +I TGLHIGG+ AIG +D+PVI++P + P +PGSSL+GKMR+ L K+
Sbjct: 12 GRIFVNFEIHALTGLHIGGAAGTLAIGNVDNPVIRNPFNSEPYVPGSSLRGKMRSQLEKL 71
Query: 64 Y----NEKVAEKPS----------DDSDILSRLFG-NSKDKRFKMGRLIFRDAFLSNADE 108
Y N + S D+S +L +FG + D + RLI RDA LS
Sbjct: 72 YGLAQNTSIGRDVSIHSAKTQAEYDNSPVL-HIFGIPASDFLTEPIRLIVRDAALSEQTR 130
Query: 109 LDSLGVRS---YTEVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEE 165
R+ YTEVK+E IDR+T+ A PRQ ER + FD L + + ++ + ++
Sbjct: 131 AAFRDARTDLPYTEVKWEAAIDRVTSAATPRQQERVPAGAIFDGALTFTLYNDQDTKLF- 189
Query: 166 DSKVIRDGLKLLELDYLGGSGSRGYGKVAFENL 198
+ VIR GL+L+E DYLGG G+RG G+VAF+N+
Sbjct: 190 -NTVIR-GLELVEEDYLGGQGARGSGQVAFKNI 220
>gi|68548728|ref|ZP_00588197.1| Protein of unknown function DUF324 [Pelodictyon phaeoclathratiforme
BU-1]
gi|68244286|gb|EAN26478.1| Protein of unknown function DUF324 [Pelodictyon phaeoclathratiforme
BU-1]
Length = 285
Score = 125 bits (313), Expect = 2e-27, Method: Composition-based stats.
Identities = 87/235 (37%), Positives = 120/235 (51%), Gaps = 42/235 (17%)
Query: 3 FAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAK 62
F I I +TG+HIG S IG ID+PVIK PIT P IPGSSLKGKMR+L+ K
Sbjct: 6 FGHIVLKGTIIAKTGMHIGASADTVEIGGIDTPVIKHPITFEPYIPGSSLKGKMRSLMEK 65
Query: 63 V--------YNEKVAEK----------------------PSDDSDILSRLFGNSKDKRFK 92
+ YN V + P + + RLFG++ + K
Sbjct: 66 IEVSKGVITYNRLVVKNNNIWQHVCSDSEHEMLISNHKTPGATNCDVCRLFGSTGENGNK 125
Query: 93 M--GRLIFRDAFLSNADELDSLGVRSYTEVKFENTIDRITAEANPRQIERAIRTSTFDFE 150
R++ RDA LSN ++L L E K EN +DR+ A A PR IER + F+FE
Sbjct: 126 NHPARILVRDAKLSNPNDL-KLDALLIMEAKMENVLDRVHAAATPRTIERVPVGAKFNFE 184
Query: 151 LIYEITDE---------NENQVEEDSKVIRDGLKLLELDYLGGSGSRGYGKVAFE 196
++Y + E ++ +V+ D K I L+ +E + LGG+ SRGYG+V+FE
Sbjct: 185 IVYRVEGEGCVTGIKNSDKTKVQTDVKNILLLLRAVEREGLGGNTSRGYGQVSFE 239
>gi|118047322|ref|ZP_01515960.1| CRISPR-associated RAMP Csm3 [Chloroflexus aggregans DSM 9485]
gi|117996132|gb|EAV10335.1| CRISPR-associated RAMP Csm3 [Chloroflexus aggregans DSM 9485]
Length = 305
Score = 124 bits (311), Expect = 4e-27, Method: Composition-based stats.
Identities = 86/215 (40%), Positives = 118/215 (54%), Gaps = 25/215 (11%)
Query: 4 AKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKV 63
+I F A I L TGLHIGG+ IG +D PVI++P TN P IPGSSLKGK+R+LL KV
Sbjct: 44 GRIIFRANIELLTGLHIGGAAGGLEIGGLDKPVIRNPRTNQPYIPGSSLKGKLRSLLEKV 103
Query: 64 Y--------NEKV-AEKPSDDSDI-----LSRLFGN-SKDKRFKMG---RLIFRDAFLSN 105
Y NE V P+++ + ++ +FG ++ + RL RD L+
Sbjct: 104 YGAPQTFRVNEGVFVHVPTNEEEYTRYRPIAGVFGTLPSHQKLTISVPTRLAMRDVPLTE 163
Query: 106 ADELDSLGVRS---YTEVKFENTIDRITAEANPRQIERAIRTSTFD-FELIYEITDENEN 161
E + +R+ YTEVK+E IDR+T+ A PRQIER + F E +Y + N
Sbjct: 164 ESEQELRQLRTDLPYTEVKWEAAIDRVTSAAAPRQIERVPAGAVFGPAEAVYSVYVGNGE 223
Query: 162 QVEEDSKV---IRDGLKLLELDYLGGSGSRGYGKV 193
QV E + + + LE DYLGG GSRG G++
Sbjct: 224 QVSEVVALFGTLLEAFTYLEDDYLGGMGSRGNGQI 258
>gi|13540941|ref|NP_110629.1| hypothetical protein TVN0110 [Thermoplasma volcanium GSS1]
gi|14324323|dbj|BAB59251.1| hypothetical protein [Thermoplasma volcanium GSS1]
Length = 229
Score = 124 bits (310), Expect = 6e-27, Method: Composition-based stats.
Identities = 87/198 (43%), Positives = 119/198 (60%), Gaps = 17/198 (8%)
Query: 6 IKFSAQIRLETGLHIGGSDAFAAIGAIDSPVI--KDPITN-----LPIIPGSSLKGKMRT 58
I ++ +I + TGLHIGGS+ IG DSPVI K I N LP IPGSS+KGK+R+
Sbjct: 10 IIYNMEIEVLTGLHIGGSNEELKIGGTDSPVITTKYLINNVEPCDLPYIPGSSIKGKIRS 69
Query: 59 LLAKVYNEKVAEKPSDDSDILSRLFG---NSKDKRFKMGRLIFRDAFLSNADELDSLGVR 115
LL E V K + DI+S++FG N+ + ++ RLI RDAFL + + R
Sbjct: 70 LL-----ENVDYKGKNGDDIVSKMFGYYPNAGEGIKRLTRLIIRDAFLDDGHIKSAEDAR 124
Query: 116 SYTEVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRDGLK 175
+ E+K EN ID I ++A PR IER R + F ++I I E +N+ EE K ++ G+
Sbjct: 125 NVIEIKSENKIDPIESKATPRFIERVRRGTKFKGKIILSIY-EGDNE-EEMIKCLKTGIS 182
Query: 176 LLELDYLGGSGSRGYGKV 193
LLE YLGG+G+RGYG V
Sbjct: 183 LLEDSYLGGNGTRGYGSV 200
>gi|76258119|ref|ZP_00765776.1| Protein of unknown function DUF324 [Chloroflexus aurantiacus
J-10-fl]
gi|76167205|gb|EAO61328.1| Protein of unknown function DUF324 [Chloroflexus aurantiacus
J-10-fl]
Length = 272
Score = 123 bits (309), Expect = 6e-27, Method: Composition-based stats.
Identities = 90/223 (40%), Positives = 122/223 (54%), Gaps = 31/223 (13%)
Query: 4 AKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKV 63
+I A I L TGLHIGG+ IG +D PVI++PITN P IPGSSLKGK+R+L+ KV
Sbjct: 10 GRIVLRASIELLTGLHIGGAAGGLEIGGLDKPVIRNPITNQPYIPGSSLKGKLRSLMEKV 69
Query: 64 Y--------NEKV-AEKPSDDSDI-----LSRLFG---NSKDKRFKM-GRLIFRDAFLSN 105
Y NE V P++ + ++ +FG N K + RLI RD L+
Sbjct: 70 YGAPQTFRINEGVFIHVPTNPEEYQRYYQIAGVFGTLPNHKQLTIDVPTRLIVRDVPLTR 129
Query: 106 ADELDSLGVRS---YTEVKFENTIDRITAEANPRQIERAIRTSTFD-FELIYEITDENEN 161
E + +R+ YTE+K+E IDR+T+ A PRQIER + F E++Y + N
Sbjct: 130 ESEQELRRLRTDLPYTEIKWEAAIDRVTSAAAPRQIERVPAGAVFGPAEVVYSLYVGNGE 189
Query: 162 QVEEDSKVIR------DGLKLLELDYLGGSGSRGYGKVAFENL 198
Q+ S VIR + LE DYLGG GSRG G+V ++
Sbjct: 190 QL---SDVIRLFGSVYEAFTYLEDDYLGGMGSRGNGQVRLRDI 229
>gi|15669865|ref|NP_248679.1| hypothetical protein MJ1669 [Methanocaldococcus jannaschii DSM
2661]
gi|41688764|sp|Q59063|Y1669_METJA Uncharacterized protein MJ1669
gi|1500572|gb|AAB99689.1| conserved hypothetical protein [Methanocaldococcus jannaschii DSM
2661]
Length = 248
Score = 119 bits (299), Expect = 1e-25, Method: Composition-based stats.
Identities = 83/207 (40%), Positives = 117/207 (56%), Gaps = 19/207 (9%)
Query: 4 AKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKV 63
K+ I LETG+HIGG+ IG D+PVI+D + +IPGSSLKGK+R LL +
Sbjct: 8 GKVILEGIIELETGMHIGGTKETLKIGGTDNPVIRDAFGRI-LIPGSSLKGKIRALLER- 65
Query: 64 YNEKVAEK------PSDDSDI-LSRLFGNSKDKRFKMG-RLIFRDAFLSNADELDSLGVR 115
+ K E P D + + ++FG K K R+I RDA+L +
Sbjct: 66 KDGKYKEDGRGNYLPHDCGECEICKIFGPHDSKNIKEPVRVIVRDAYLQPEENKKDY--- 122
Query: 116 SYTEVKFENTIDRI---TAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRD 172
Y E+K ENTIDR+ T + R +ER + S F FE+++ I E++ ++ K +
Sbjct: 123 DYLEIKVENTIDRLKGTTIKGGIRNMERVVAGSKFKFEVVFNIYKESDKEL---IKKFIE 179
Query: 173 GLKLLELDYLGGSGSRGYGKVAFENLK 199
G+KLLE DYLGGSGSRGYGK+ F ++K
Sbjct: 180 GMKLLEDDYLGGSGSRGYGKIKFRDIK 206
>gi|150021546|ref|YP_001306900.1| CRISPR-associated RAMP protein, Csm3 family [Thermosipho
melanesiensis BI429]
gi|149794067|gb|ABR31515.1| CRISPR-associated RAMP protein, Csm3 family [Thermosipho
melanesiensis BI429]
Length = 269
Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats.
Identities = 93/229 (40%), Positives = 119/229 (51%), Gaps = 39/229 (17%)
Query: 4 AKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL--- 60
KI A I + TGL IG ++ IG ID+PVIKD + P IPGSSLKGK+RTL+
Sbjct: 8 GKIIVKADIHVVTGLQIGKENSME-IGGIDNPVIKDSLGK-PYIPGSSLKGKLRTLMEYF 65
Query: 61 -AKVYNEKVAEKPSDDSDI-----------LSRLFGNSKDK-----------RFKM---G 94
K+ N+ + ++ I + LFG + K FK
Sbjct: 66 HGKIKNDVLVVAKGGENQIRIHQCDEKDCPVCNLFGRNHGKHRYASNPDVEVEFKNIMPT 125
Query: 95 RLIFRDAFL---SNADELDSLGVRSYTEVKFENTIDRITAEANPRQIERAIRTSTFDFEL 151
RL RDA L S +E+ +TEVK ENT+DRIT+ ANPR ER + F E
Sbjct: 126 RLFVRDALLDENSITEEMKKNLDNEFTEVKPENTLDRITSAANPRFNERVPAGAIFKSEF 185
Query: 152 IYEI-TDENENQVEEDSKVIRDGLKLLELDYLGGSGSRGYGKVAFENLK 199
+ + D+NE + E + L LLE DYLGGSGSRGYGKV FEN+K
Sbjct: 186 VINVYDDDNEKYLRE----LLTALSLLEDDYLGGSGSRGYGKVRFENIK 230
>gi|124008646|ref|ZP_01693337.1| crispr-associated ramp protein, Csm3 family [Microscilla marina
ATCC 23134]
gi|123985890|gb|EAY25754.1| crispr-associated ramp protein, Csm3 family [Microscilla marina
ATCC 23134]
Length = 309
Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats.
Identities = 94/246 (38%), Positives = 123/246 (50%), Gaps = 61/246 (24%)
Query: 4 AKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKV 63
A + +I TGLHIGGS IG +DSPV+++P T P IPGSS+KGK+R LL
Sbjct: 21 ANLVLRGKIECVTGLHIGGSKEKLEIGGVDSPVLRNPQTRYPYIPGSSIKGKLRYLLE-- 78
Query: 64 YNEKVAEKP--------SDDSDILSRLFG---------------NSKDKRFKMG------ 94
Y+ KP S DI+ R+FG + DK++K
Sbjct: 79 YSTGAVTKPVKNKFGDVSVAKDIV-RIFGIGADDKEVEVDNDKDSESDKKYKASVQYLKE 137
Query: 95 ----RLIFRDAFLSNADE-----LDSLGVRSYTEVKFENTIDRITAEANPRQIERAIRTS 145
RLI RD +A + LDS + YTE K ENTIDR+T+ ANPR IER + S
Sbjct: 138 TGPTRLIVRDCNPDDATQEMWKNLDSELL--YTEYKPENTIDRLTSAANPRFIERVVAGS 195
Query: 146 TFDFELIY-----EITDENENQV-------------EEDSKVIRDGLKLLELDYLGGSGS 187
FDFE+I E+ DE N V ++D + L+LLE + LG SGS
Sbjct: 196 YFDFEVILGVFYNELMDEQGNNVTTEDYRTEQVNASKKDVANLMQALRLLENNTLGKSGS 255
Query: 188 RGYGKV 193
RGYG++
Sbjct: 256 RGYGQI 261
>gi|48477124|ref|YP_022830.1| hypothetical protein PTO0052 [Picrophilus torridus DSM 9790]
gi|48429772|gb|AAT42637.1| conserved hypothetical protein [Picrophilus torridus DSM 9790]
Length = 250
Score = 114 bits (286), Expect = 3e-24, Method: Composition-based stats.
Identities = 77/199 (38%), Positives = 119/199 (59%), Gaps = 23/199 (11%)
Query: 6 IKFSAQIRLETGLHIGGSDAFAAIGAIDSPVI------KDPITNLPIIPGSSLKGKMRTL 59
I + +I+++TGLHIGG++ IG ID+ VI + + +LP IPGSS+KGK+R++
Sbjct: 39 IIYDLEIKVKTGLHIGGTNEEIKIGGIDNQVITTSYEYNNKMYDLPYIPGSSIKGKLRSI 98
Query: 60 LAKVYNEKVAEKPSDDSDILSRLFG---NSKDKRFKMGRLIFRDAFLSNADELDSLGVRS 116
L+ Y K +I+ ++FG N KD + RLI RD +L+ DE+ +
Sbjct: 99 LSAFYQNK---------EIIEKVFGRGNNEKDNIDRRTRLIVRDFYLTE-DEIKNYVDND 148
Query: 117 Y--TEVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRDGL 174
+ TE+K EN ID +T++A PR IER + F+ + I I ++++ E + +I DGL
Sbjct: 149 FKLTEIKGENIIDPLTSKATPRFIERIKPGTVFEGKFILSIYEDDDE--EAMTGLIMDGL 206
Query: 175 KLLELDYLGGSGSRGYGKV 193
+L+ YLGG+GSRGYG V
Sbjct: 207 ELIRDSYLGGNGSRGYGSV 225
>gi|156741956|ref|YP_001432085.1| CRISPR-associated RAMP protein, Csm3 family [Roseiflexus
castenholzii DSM 13941]
gi|156233284|gb|ABU58067.1| CRISPR-associated RAMP protein, Csm3 family [Roseiflexus
castenholzii DSM 13941]
Length = 278
Score = 114 bits (285), Expect = 4e-24, Method: Composition-based stats.
Identities = 87/226 (38%), Positives = 114/226 (50%), Gaps = 31/226 (13%)
Query: 4 AKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKV 63
+I A I L TGLHIGG+ A IG +D PVI++P TN P IPGSSLKGK+R+L+ K
Sbjct: 11 GRIFLRADIELLTGLHIGGAAAGLEIGGLDKPVIRNPRTNEPYIPGSSLKGKLRSLMEKA 70
Query: 64 Y--------NEKV-AEKPSDDSD------------ILSRLFGNSKDKRFKM---GRLIFR 99
+ NE V P + +L R N F + RL R
Sbjct: 71 HGAPQTFRVNEGVFVHAPESIAQYRQYQMIGGVFGVLPRWGPNGNRNAFTVPAPTRLAVR 130
Query: 100 DAFL--SNADELDSLGVR-SYTEVKFENTIDRITAEANPRQIERAIRTSTFD-FELIYEI 155
D L + +EL L +TE+K+E IDR+T+ A PRQIER + F EL+Y +
Sbjct: 131 DVPLRAESREELKRLRTDLPFTEIKWEAAIDRVTSAAAPRQIERVPAGAVFGPAELVYSV 190
Query: 156 ---TDENENQVEEDSKVIRDGLKLLELDYLGGSGSRGYGKVAFENL 198
+E V + + L LE DYLGG GSRG G+V +NL
Sbjct: 191 YVGKEETPTGVAALFDTVIEALLHLEDDYLGGMGSRGNGQVRLKNL 236
>gi|114332179|ref|YP_748401.1| CRISPR-associated RAMP protein, Csm3 family protein [Nitrosomonas
eutropha C91]
gi|114309193|gb|ABI60436.1| CRISPR-associated RAMP protein, Csm3 family protein [Nitrosomonas
eutropha C91]
Length = 238
Score = 113 bits (283), Expect = 6e-24, Method: Composition-based stats.
Identities = 89/216 (41%), Positives = 119/216 (55%), Gaps = 28/216 (12%)
Query: 7 KFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL------ 60
K I L++GLHIG D+ IG DSPV+KDP+T P IPGSSLKGK+R+LL
Sbjct: 8 KIIGTIVLQSGLHIGAGDSEMRIGGTDSPVVKDPLTGQPYIPGSSLKGKIRSLLEWRHGL 67
Query: 61 -----AKVYNEKVAEKPSDDS--DILSRLFGNSKDK-------RFKMGRLIFRDAFLSNA 106
Y+ K + ++S + +LFG + DK RL F D LSN
Sbjct: 68 VLAAGGAPYSFKQLAQDENNSAGRAVIKLFGGAPDKAEDQLVTSIGPTRLAFWDCPLSND 127
Query: 107 DELDSLGVRSY--TEVKFENTIDRITAEA-NPRQIERAIRTSTFDFELIYEITDENENQV 163
+++ + R+ TEVK EN+I+RI A +PR IER I + FDF L ++ ++
Sbjct: 128 WKIEEVDSRNLLITEVKSENSINRIAGTAESPRFIERVIAGACFDFTLTLKVLAGDDLL- 186
Query: 164 EEDSKVIRDGLKLLELDYLGGSGSRGYGKVAFENLK 199
+ GL+LLELD LGGSGSRGYGK+ F LK
Sbjct: 187 ----DTVLLGLRLLELDSLGGSGSRGYGKIKFAALK 218
>gi|149195213|ref|ZP_01872303.1| hypothetical protein CMTB2_05852 [Caminibacter mediatlanticus TB-2]
gi|149134646|gb|EDM23132.1| hypothetical protein CMTB2_05852 [Caminibacter mediatlanticus TB-2]
Length = 235
Score = 113 bits (282), Expect = 8e-24, Method: Composition-based stats.
Identities = 88/219 (40%), Positives = 125/219 (57%), Gaps = 25/219 (11%)
Query: 16 TGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL----AKVYNEKVAEK 71
TGLHIGGS IG ID+PVIK+P+TN P IPGSSLKGK+R+L+ KV + K +
Sbjct: 17 TGLHIGGSSDTIKIGGIDNPVIKNPLTNEPYIPGSSLKGKIRSLIEWSSGKVNDGKPLDY 76
Query: 72 PSDDSDILS---RLFGNS---KDKRF--KMG--RLIFRDAFLSNADELDSLGVRSYTEVK 121
+ +I+ +LFGN KD++ ++G R+ F D L N D+L L + TE K
Sbjct: 77 TKQEEEIVKKIVKLFGNGATIKDEKIAKEIGPTRVSFSDCSLLNKDKL--LEKNALTEDK 134
Query: 122 FENTIDRITAE---ANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIRDGLKLLE 178
E TIDR+ PR +ER + FDF + + E + +E ++ G+KLLE
Sbjct: 135 VEVTIDRVKGTVGGGGPRHMERVPAGAEFDFS-VSLLEFEGDEDLE---AILAFGMKLLE 190
Query: 179 LDYLGGSGSRGYGKVAFENLKA--TTVFGNYDVKTLNEL 215
+ LGG+GSRGYGK+ F ++K + N +K+ EL
Sbjct: 191 MTNLGGNGSRGYGKIEFVDIKGLDSKFLDNGKLKSYEEL 229
>gi|30248152|ref|NP_840222.1| hypothetical protein NE0121 [Nitrosomonas europaea ATCC 19718]
gi|30180037|emb|CAD84032.1| conserved hypothetical protein [Nitrosomonas europaea ATCC 19718]
Length = 238
Score = 110 bits (276), Expect = 4e-23, Method: Composition-based stats.
Identities = 88/217 (40%), Positives = 119/217 (54%), Gaps = 30/217 (13%)
Query: 7 KFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKVYNE 66
K + + L++GLHIG D+ IG DSPV+KDP+T+ P IPGSSLKGK+R+LL +
Sbjct: 8 KITGTLILKSGLHIGAGDSEMRIGGTDSPVVKDPLTDQPYIPGSSLKGKIRSLLEWRHGL 67
Query: 67 KVA--------------EKPSDDSDILSRLFGNSKD-------KRFKMGRLIFRDAFLSN 105
VA E S D++ +LFG + D K RL F D L+
Sbjct: 68 VVATGGAPYSFKHLAQDENNSAGRDVI-KLFGGAPDKAEDQLVKNIGPTRLAFWDCPLNG 126
Query: 106 ADELDSLGVRSY--TEVKFENTIDRITAEA-NPRQIERAIRTSTFDFELIYEITDENENQ 162
+ ++ R TEVK EN+I+RI A +PR IER I + FDF L ++ + ++
Sbjct: 127 DWKKEAADSRHLLTTEVKSENSINRIAGTAEHPRFIERVIAGARFDFTLTLKVLEGDDLL 186
Query: 163 VEEDSKVIRDGLKLLELDYLGGSGSRGYGKVAFENLK 199
+ GL+LLELD LGGSGSRGYGK+ F LK
Sbjct: 187 -----NTVLLGLRLLELDSLGGSGSRGYGKIKFAELK 218
>gi|146295151|ref|YP_001178922.1| CRISPR-associated RAMP protein, Csm3 family [Caldicellulosiruptor
saccharolyticus DSM 8903]
gi|145408727|gb|ABP65731.1| CRISPR-associated RAMP protein, Csm3 family [Caldicellulosiruptor
saccharolyticus DSM 8903]
Length = 263
Score = 108 bits (271), Expect = 2e-22, Method: Composition-based stats.
Identities = 93/225 (41%), Positives = 120/225 (53%), Gaps = 39/225 (17%)
Query: 11 QIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL----AKVYNE 66
+IR TGLHIG + IG ID+ V+KD P IPGSSLKGKMR L+ KV ++
Sbjct: 14 KIRAVTGLHIGEGNNSIEIGGIDNAVVKDA-EGKPYIPGSSLKGKMRALMEFAEGKVKDD 72
Query: 67 ------KVAEKPS------DDSDI-LSRLFGNSKDKRFKMG-------------RLIFRD 100
K +KP DD D + LFG + K RLI RD
Sbjct: 73 LMVVAVKKGDKPEICLHMCDDKDCPVCGLFGRNHGLHDKKSGGKIDLTDAVIPTRLIVRD 132
Query: 101 AFL---SNADELDSLGVRSYTEVKFENTIDRITAEANPRQIERAIRTSTFDFELI---YE 154
A L S DE+ +TEVKFEN IDRIT++A+PRQ ER + F E + YE
Sbjct: 133 AKLIESSITDEMKENLDLEWTEVKFENNIDRITSKAHPRQSERVPAGAEFSAEFVVNRYE 192
Query: 155 ITDENENQVEEDSKVIRDGLKLLELDYLGGSGSRGYGKVAFENLK 199
+ D +++ + SK I+ +KLLE DYLGG GSRG GKV F +++
Sbjct: 193 V-DGSDDGEKYLSKFIK-AMKLLEDDYLGGQGSRGNGKVKFVDIE 235
>gi|116748786|ref|YP_845473.1| CRISPR-associated RAMP protein, Csm3 family [Syntrophobacter
fumaroxidans MPOB]
gi|116697850|gb|ABK17038.1| CRISPR-associated RAMP protein, Csm3 family [Syntrophobacter
fumaroxidans MPOB]
Length = 241
Score = 103 bits (257), Expect = 7e-21, Method: Composition-based stats.
Identities = 88/212 (41%), Positives = 120/212 (56%), Gaps = 23/212 (10%)
Query: 8 FSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKVYNE- 66
+ IR+ TGLHIG IG +DSPV+K+P T+ P IPGSSLKGK+R L+ N
Sbjct: 9 LTGHIRVLTGLHIGAGKDAIEIGGVDSPVVKNPYTDEPYIPGSSLKGKLRCLMEWATNRV 68
Query: 67 ----KVAEKPSDD------SDILSRLFGNSKDKRFKMG--RLIFRDAFLSNADELDSLGV 114
K E + D + R+FG + K++ G RL+ RD L N DSL
Sbjct: 69 EESGKTWEGGGEKDPVRLAQDPVLRIFGTTS-KQWVAGPTRLVVRDCSL-NEQWRDSLIG 126
Query: 115 RS--YTEVKFENTIDRITAEANP--RQIERAIRTSTFDFELIYEITDENENQVEEDSKVI 170
R +TE KFEN IDRI +A R+ ER + FD ++Y + D + Q E+D ++
Sbjct: 127 RGLPFTEEKFENNIDRIQGKAGVGIRKTERVPAGAVFDLMMVYRVFDTGD-QGEQDKRLF 185
Query: 171 RDGL---KLLELDYLGGSGSRGYGKVAFENLK 199
++ L +LLE D LGGSGSRGYG++ FENL+
Sbjct: 186 KEFLRVMRLLEHDALGGSGSRGYGRIKFENLR 217
>gi|119357840|ref|YP_912484.1| CRISPR-associated RAMP protein, Csm3 family [Chlorobium
phaeobacteroides DSM 266]
gi|119355189|gb|ABL66060.1| CRISPR-associated RAMP protein, Csm3 family [Chlorobium
phaeobacteroides DSM 266]
Length = 285
Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats.
Identities = 86/242 (35%), Positives = 115/242 (47%), Gaps = 53/242 (21%)
Query: 6 IKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAK--- 62
IK + I TGLHIGG+ G ID+PVIK+P+TN P IPGSS +G+MR+LL K
Sbjct: 14 IKITGIIEALTGLHIGGTADSIDKGGIDNPVIKNPVTNEPYIPGSSFRGRMRSLLEKKTA 73
Query: 63 ----------------------VYNEKVAEKPSD--DSDILSRLFGNSKDKRFKMGRLIF 98
YN A+ D +S++ R+FGNS LI
Sbjct: 74 EYLSPMTGNKEIWMEIYKAEDEKYNNNKAQSTIDAMNSEV-CRVFGNSASYESVPSVLIV 132
Query: 99 RDAFLSNADELDSL----GVRSYTEVKFENTIDRITAEANPRQIERAIRTSTFDFELIYE 154
RDA + + G TE K E +DRITA A PR IER + F FE++Y+
Sbjct: 133 RDALYTEKTRESYMQGGKGGLPITEAKMEIAVDRITAHALPRTIERVPAGAKFAFEIVYK 192
Query: 155 IT--------DE------------NENQVEEDSKVIRDGLKLL-ELDYLGGSGSRGYGKV 193
I DE ++ +E+D + I LK + E D LGG+ SRG+G+V
Sbjct: 193 IQSTSFICVIDEKGKPKEVKSYMADDKVIEKDIENILWALKQIEEHDGLGGNTSRGHGQV 252
Query: 194 AF 195
F
Sbjct: 253 KF 254
>gi|121540755|ref|ZP_01672514.1| conserved hypothetical protein [Candidatus Desulfococcus oleovorans
Hxd3]
gi|121519260|gb|EAX56109.1| conserved hypothetical protein [Candidatus Desulfococcus oleovorans
Hxd3]
Length = 242
Score = 100 bits (248), Expect = 9e-20, Method: Composition-based stats.
Identities = 89/210 (42%), Positives = 119/210 (56%), Gaps = 21/210 (10%)
Query: 9 SAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKVYN--- 65
+ QI + TGLHIG IG +D+PV+K+P T P IPGSSLKGK+R L+ V
Sbjct: 10 TGQIEILTGLHIGAGKDAVEIGGVDNPVVKNPYTGEPYIPGSSLKGKLRCLMEWVTGCVE 69
Query: 66 ------EKVAEKPSDD--SDILSRLFGNSKDKRFKMG--RLIFRDAFLSN--ADELDSLG 113
E E D +D + R+FG + K + G RLI RDA L+ + + G
Sbjct: 70 DSGKTWEGGGETGPDKLAADPILRIFGTTS-KNWDAGPTRLIVRDACLNKEWKERIIDRG 128
Query: 114 VRSYTEVKFENTIDRITAEANP--RQIERAIRTSTFDFELIYEITDEN-ENQVEEDS-KV 169
+ TE KFEN IDRI +A P R+ ER S FDFE++Y I D N E + + D+
Sbjct: 129 L-PLTEEKFENNIDRIQGKAGPGIRKTERVPAGSLFDFEMVYRIFDTNDEGRTDVDNLDN 187
Query: 170 IRDGLKLLELDYLGGSGSRGYGKVAFENLK 199
+ ++LLE D LGGSGSRGYG++ F+ LK
Sbjct: 188 LFAIMRLLEQDALGGSGSRGYGRIRFDKLK 217
>gi|46255175|ref|YP_006087.1| hypothetical protein TT_P0104 [Thermus thermophilus HB27]
gi|46198024|gb|AAS82434.1| hypothetical conserved protein [Thermus thermophilus HB27]
Length = 241
Score = 88.2 bits (217), Expect = 4e-16, Method: Composition-based stats.
Identities = 79/212 (37%), Positives = 115/212 (54%), Gaps = 25/212 (11%)
Query: 6 IKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLA---- 61
I+ + + +TGL IG S AIG +D+PV+++P+T+ P IPGSSLKGK+R LL
Sbjct: 7 IRIRSVLLAKTGLRIGMSRDQMAIGDLDNPVVRNPLTDEPYIPGSSLKGKLRYLLEWSLG 66
Query: 62 -----KVYNEKVAEKPSDDSDILSRLFG---NSKDKRFKMG------RLIFRDAFLSN-- 105
K ++ V P D D ++R+FG + ++ + RL+ RDA+L+
Sbjct: 67 GDYILKAKDKHVYASP-DPKDPVARIFGLAPENDERSLAVARERGPTRLLVRDAYLTEDA 125
Query: 106 ADELDSLGVRS--YTEVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQV 163
+ L+ R YTE+K E I R+ ANPR ER + F E+ Y + D+ +
Sbjct: 126 KEALERTSARGGLYTEIKQEVFIPRLGGNANPRTTERVPAGARFRVEMTYRVLDDLDE-- 183
Query: 164 EEDSKVIRDGLKLLELDYLGGSGSRGYGKVAF 195
E K + L+LLELD LGG SRGYG+V F
Sbjct: 184 EYFGKYLLRALELLELDGLGGHISRGYGQVYF 215
>gi|55978332|ref|YP_145388.1| hypothetical protein TTHB149 [Thermus thermophilus HB8]
gi|55773505|dbj|BAD71945.1| conserved hypothetical protein [Thermus thermophilus HB8]
Length = 241
Score = 87.4 bits (215), Expect = 5e-16, Method: Composition-based stats.
Identities = 79/212 (37%), Positives = 114/212 (53%), Gaps = 25/212 (11%)
Query: 6 IKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLA---- 61
I+ + + +TGL IG S AIG +D+PV+++P+T+ P IPGSSLKGK+R LL
Sbjct: 7 IRIRSVLLAKTGLRIGMSRDQMAIGDLDNPVVRNPLTDEPYIPGSSLKGKLRYLLEWSLG 66
Query: 62 -----KVYNEKVAEKPSDDSDILSRLFG---NSKDKRFKMG------RLIFRDAFLSN-- 105
K +V P D D ++R+FG + ++ + RL+ RDA+L+
Sbjct: 67 GDYILKAKERQVYASP-DPKDPVARIFGLAPENDERSLAVARERGPTRLLVRDAYLTEDA 125
Query: 106 ADELDSLGVRS--YTEVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQV 163
+ L+ R YTE+K E I R+ ANPR ER + F E+ Y + D+ +
Sbjct: 126 KEALERTSARGGLYTEIKQEVFIPRLGGNANPRTTERVPAGARFRVEMTYRVLDDLDE-- 183
Query: 164 EEDSKVIRDGLKLLELDYLGGSGSRGYGKVAF 195
E K + L+LLELD LGG SRGYG+V F
Sbjct: 184 EYFGKYLLRALELLELDGLGGHISRGYGQVYF 215
>gi|116620021|ref|YP_822177.1| CRISPR-associated RAMP protein, Csm3 family [Solibacter usitatus
Ellin6076]
gi|116223183|gb|ABJ81892.1| CRISPR-associated RAMP protein, Csm3 family [Solibacter usitatus
Ellin6076]
Length = 278
Score = 80.5 bits (197), Expect = 7e-14, Method: Composition-based stats.
Identities = 80/242 (33%), Positives = 117/242 (48%), Gaps = 49/242 (20%)
Query: 4 AKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLLAKV 63
K+ ++ ETGLH+G IG D+PV+KD P +PGSSL+GK+R+LL +
Sbjct: 14 GKLILEGEMLCETGLHVGAGKGSLEIGGSDNPVVKDAFGR-PYVPGSSLRGKIRSLLEQS 72
Query: 64 YNEKV-------------------AEKPSDDSDIL--------SRLFGNSKD-KRFKMGR 95
V +++P D+ +L R+ G S D + R
Sbjct: 73 SGLAVPGELVYLSRRKGQEVRIHQSDRPDDEICLLFGRNPGRMERVQGESLDTSQATPAR 132
Query: 96 LIFRDAFLSNADELDSLGVR-------SYTEVKFENTIDRITAEANPRQIERAIRTSTFD 148
L DA L ++DS+ TEVK EN IDRIT++ANPR +ER + F
Sbjct: 133 LAVFDAPL----DVDSITAAMRENLDDELTEVKSENAIDRITSQANPRTLERVPMGARFK 188
Query: 149 FELIYEITDENENQVEEDSKV---IRDGLKLLELDYLGGSGSRGYGKVAFENLKATTVFG 205
+ ++ +ED+ + + +GL+LLE D LGG GSRG G+V+F NL+
Sbjct: 189 TRFVMDVL------CDEDAPLFMRVLEGLRLLEDDALGGGGSRGSGRVSFSNLRLAWRSK 242
Query: 206 NY 207
NY
Sbjct: 243 NY 244
>gi|119873035|ref|YP_931042.1| CRISPR-associated RAMP protein, Csm3 family [Pyrobaculum islandicum
DSM 4184]
gi|119674443|gb|ABL88699.1| CRISPR-associated RAMP protein, Csm3 family [Pyrobaculum islandicum
DSM 4184]
Length = 329
Score = 74.7 bits (182), Expect = 4e-12, Method: Composition-based stats.
Identities = 64/222 (28%), Positives = 98/222 (44%), Gaps = 60/222 (27%)
Query: 44 LPIIPGSSLKGKMRTLLA------------------------------------------ 61
+P IPGSSLKG+MR+LL
Sbjct: 71 VPYIPGSSLKGRMRSLLELAHGLPLYTTDNKIWQHVRSLSAMQLEDFLDDLENRCLIDEL 130
Query: 62 --------KVYNEKVAEKPSDDSDILSRLFGNSKDKRFKMGRLIFRDAFLSNADELDSLG 113
K +EK+ E +D + + F N+ K + RL+F D F + ++ LG
Sbjct: 131 FGWAATSYKQVDEKIKEVKKED---IRKKFENAW-KNAGITRLLFDD-FFPTCETINKLG 185
Query: 114 VR-----SYTEVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSK 168
R + E K EN IDR+TA A+PR + R F + + D + + V++ +
Sbjct: 186 GRFVGISDFLEDKSENRIDRVTATADPRNVVRVKPGVVFYGYIRMLLFDNDRDMVKKYLQ 245
Query: 169 VIRDGLKLLELDYLGGSGSRGYGKVAFENLKATTVFGNYDVK 210
++ G++L+E YLG SGSRGYG+VAF+N + +VK
Sbjct: 246 TLQAGIELIENTYLGASGSRGYGRVAFKNFNVVILKSECEVK 287
>gi|20094752|ref|NP_614599.1| Predicted component of a thermophile-specific DNA repair system,
contains a RAMP domain [Methanopyrus kandleri AV19]
gi|19887946|gb|AAM02529.1| Predicted component of a thermophile-specific DNA repair system,
contains a RAMP domain [Methanopyrus kandleri AV19]
Length = 351
Score = 72.8 bits (177), Expect = 1e-11, Method: Composition-based stats.
Identities = 45/107 (42%), Positives = 61/107 (57%), Gaps = 7/107 (6%)
Query: 92 KMGRLIFRDAFLSN---ADELDSLGVRSYTEVKFENTIDRITAEANPRQIERAIRTSTFD 148
K R+ FRDA + D + G TEVK EN I+R++ EANPR +ER + S F
Sbjct: 168 KEARVAFRDAHPTTYTVNDVFERAG--EPTEVKHENAINRVSGEANPRSMERVPKGSRFG 225
Query: 149 FELIYEITDENENQVEEDSKVIRDGLKLLELDYLGGSGSRGYGKVAF 195
E++Y + D E +E D K + LKL+E +G S SRGYG+V F
Sbjct: 226 LEVVYRVEDGEE--LESDLKYLMSSLKLVEDQGIGHSTSRGYGRVEF 270
Score = 59.7 bits (143), Expect = 1e-07, Method: Composition-based stats.
Identities = 27/55 (49%), Positives = 36/55 (65%)
Query: 6 IKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNLPIIPGSSLKGKMRTLL 60
I +IRL TG IG S+ IG +D+PVI+DP++ P +PGSSLKG+ R L
Sbjct: 8 ITLVGEIRLRTGTRIGTSEEEIEIGGLDNPVIRDPVSGYPYVPGSSLKGRARALF 62
>gi|18311720|ref|NP_558387.1| hypothetical protein PAE0114 [Pyrobaculum aerophilum str. IM2]
gi|18159122|gb|AAL62569.1| conserved hypothetical protein [Pyrobaculum aerophilum str. IM2]
Length = 308
Score = 72.4 bits (176), Expect = 2e-11, Method: Composition-based stats.
Identities = 78/259 (30%), Positives = 113/259 (43%), Gaps = 71/259 (27%)
Query: 13 RLET-GLHIGGSDAFAAIG-------AIDSPVIKDP---ITNLPIIPGSSLKGKMRTLL- 60
RLET GL I A +G AI+ P + D I ++P IPGSSLKG+ R LL
Sbjct: 16 RLETYGLLIRSGKAREVLGLADIMPMAIEYPFVIDNRRYILSVPYIPGSSLKGRARALLE 75
Query: 61 -----------AKVY-NEKVAEKPSDDSDI---LSRLFGN--------SKDKRFKMG--- 94
K+Y + ++ D D + +FG +++KRF +
Sbjct: 76 TVLGLPLYTTDGKIYLHTRIVRNEIRDEDPYCPVDNVFGTPAIPPNMVAEEKRFILDCWA 135
Query: 95 --RLIFRDAFLSN---------ADELDSLGVRSYTEVKFENTIDRITAEANPRQIERAIR 143
R IFRD F S+ + + + + E K+EN IDR+T+ A PR I R
Sbjct: 136 PTRAIFRDLFPSDEYLRRLCDAKGGCEGVQLADFLEEKWENRIDRVTSTAEPRNIMRLRP 195
Query: 144 TSTFDFE---LIYEITDENENQVEEDSK-------------------VIRDGLKLLELDY 181
F LIY++ + +E S+ ++ D LKL+E Y
Sbjct: 196 GVEFKGHISFLIYDLDVCRKPVCDEKSQRRELIVGKYKGVPALYYLDMLIDSLKLVERTY 255
Query: 182 LGGSGSRGYGKVAFENLKA 200
LG SG+RGYG V F+ + A
Sbjct: 256 LGASGTRGYGTVKFKGITA 274
>gi|119720269|ref|YP_920764.1| CRISPR-associated RAMP protein, Csm3 family [Thermofilum pendens
Hrk 5]
gi|119525389|gb|ABL78761.1| CRISPR-associated RAMP protein, Csm3 family [Thermofilum pendens
Hrk 5]
Length = 331
Score = 67.4 bits (163), Expect = 5e-10, Method: Composition-based stats.
Identities = 35/87 (40%), Positives = 54/87 (62%)
Query: 112 LGVRSYTEVKFENTIDRITAEANPRQIERAIRTSTFDFELIYEITDENENQVEEDSKVIR 171
L + + E K EN IDRIT+ A+PRQ+ R F L + D ++ V+ + +++
Sbjct: 201 LAISDFLEEKGENRIDRITSAADPRQVARVKPGVVFQGALRLLVFDIDKGYVKRNLELVA 260
Query: 172 DGLKLLELDYLGGSGSRGYGKVAFENL 198
GL+L+E YLG SGSRGYG+V F+++
Sbjct: 261 KGLRLVEETYLGASGSRGYGRVKFKDI 287
>gi|73668880|ref|YP_304895.1| hypothetical protein Mbar_A1354 [Methanosarcina barkeri str.
Fusaro]
gi|72396042|gb|AAZ70315.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 364
Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats.
Identities = 52/178 (29%), Positives = 82/178 (46%), Gaps = 18/178 (10%)
Query: 32 IDSPVIKDPITNLPIIPGSSLKGKMRTLLAKVYNEKVAEKPSDDSDILSRLFGN-----S 86
ID+P+ KD + +LP + SS KG +R L ++ + ++D + R+FGN S
Sbjct: 80 IDNPLRKDKVLHLPFVSPSSWKGSLRNSLWQLNYDY-------ENDKIRRIFGNERSPIS 132
Query: 87 KDKRFKMGRLIFRDAFLSNADELDSLGVRSYTEVKFENTIDRITAEANPRQIERAIRTST 146
+D +MGRL F F + SL + + + K I E+ P+
Sbjct: 133 EDIALRMGRLCFFPTFFTK----KSLEIINPHDRKRRVGTVPILMESVPQDTTGFFTLLY 188
Query: 147 FDFELIYEITDENENQVEEDSKVIRDGLKLLELDY-LGGSGSRGYGKVAFENLKATTV 203
F+LI E QV ED +++ +GLK + Y G S GYG +EN+ T+
Sbjct: 189 VPFDLIGCDEKEIRKQVSEDIQLVSEGLKSMFTYYGFGAKTSSGYG-TTYENITDGTI 245
>gi|126465220|ref|YP_001040329.1| CRISPR-associated RAMP protein, Csm3 family [Staphylothermus
marinus F1]
gi|126014043|gb|ABN69421.1| CRISPR-associated RAMP protein, Csm3 family [Staphylothermus
marinus F1]
Length = 316
Score = 55.1 bits (131), Expect = 3e-06, Method: Composition-based stats.
Identities = 79/242 (32%), Positives = 115/242 (47%), Gaps = 63/242 (26%)
Query: 16 TGLHIGGSDAFAAIGAID----------SPVIKDPIT------NLPIIPGSSLKGKMRTL 59
TGL I A A IG D +IKD +P IPGSSLKG++R+L
Sbjct: 26 TGLLIRSPLAKATIGGADIQPMFIVKNYGKIIKDKQVYEIGEIEVPYIPGSSLKGRIRSL 85
Query: 60 L-----AKVYN------------------------EKVAEKPSDDSDILSRLFGNSKD-- 88
L A +Y+ + + PS D LS++ +K+
Sbjct: 86 LEIYEGAPLYSSDNEIFIHVRDIAKNWCTDLEHSLDNLFGSPSVD---LSKIIKENKELT 142
Query: 89 ----KRFKMGRLIFRDAFLSN------ADELDSLGVRSYTEVKFENTIDRITAEANPRQI 138
+++ RLI D + S AD+ L ++ E K ENTIDRIT+ ANPR I
Sbjct: 143 NEILEKYAPTRLIVEDLYPSEDYVVKLADK-GMLTKEAFIEEKTENTIDRITSAANPRTI 201
Query: 139 ERAIRTSTF--DFELIYEITDENENQVEEDSKVIRDGLKLLELDYLGGSGSRGYGKVAFE 196
R F +F+++ D+ +++E +++ G+KLLE YLGGSGSRGYG+V F+
Sbjct: 202 LRVKPDVEFKGNFKMLLYDVDQKLGKIKEYIELLLTGMKLLEDTYLGGSGSRGYGRVKFK 261
Query: 197 NL 198
N+
Sbjct: 262 NI 263
>gi|21229459|ref|NP_635381.1| hypothetical protein MM_3357 [Methanosarcina mazei Go1]
gi|20908057|gb|AAM33053.1| hypothetical protein MM_3357 [Methanosarcina mazei Go1]
Length = 370
Score = 54.7 bits (130), Expect = 4e-06, Method: Composition-based stats.
Identities = 57/185 (30%), Positives = 85/185 (45%), Gaps = 32/185 (17%)
Query: 32 IDSPVIKDPITNLPIIPGSSLKGKMRTLLAKVYNEKVAEKPSDDSDILSRLFG-----NS 86
ID+P+ KD + NLP + SS KG +R L ++ + ++D + R+FG NS
Sbjct: 80 IDNPLRKDKVLNLPFVAPSSWKGSLRNSLWQLNYDY-------ENDKIRRIFGNERSPNS 132
Query: 87 KDKRFKMGRLIFRDAFLSNADELDSLGVRSYTEVKFENTIDRITAEANPRQIERAIRTST 146
+D +MGRL F F S +S + N R+ P +E + +T
Sbjct: 133 EDIVLRMGRLYFFPTFFSK---------KSLEIINPHNRESRVGTV--PILMESVPQDTT 181
Query: 147 FDFELIYEITD-----ENE--NQVEEDSKVIRDGLKLLELDY-LGGSGSRGYGKVAFENL 198
F LIY D ENE QV D ++I GLK + Y G S GYG ++E++
Sbjct: 182 GYFTLIYVPFDLIGCEENEIKKQVACDIQLISKGLKSMFTYYGFGAKTSSGYG-TSYEDI 240
Query: 199 KATTV 203
T+
Sbjct: 241 TDGTI 245
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.316 0.136 0.369
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 745,523,526
Number of Sequences: 5470121
Number of extensions: 30337483
Number of successful extensions: 91662
Number of sequences better than 1.0e-05: 46
Number of HSP's better than 0.0 without gapping: 32
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 91509
Number of HSP's gapped (non-prelim): 54
length of query: 220
length of database: 1,894,087,724
effective HSP length: 128
effective length of query: 92
effective length of database: 1,193,912,236
effective search space: 109839925712
effective search space used: 109839925712
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 127 (53.5 bits)