BLASTP 2.2.18 [Mar-02-2008]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= SPy_0666 hypothetical protein
(244 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
6,515,104 sequences; 2,222,278,849 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_268907.1| hypothetical protein SPy_0666 [Streptococc... 498 e-139
ref|NP_150149.1| hypothetical protein MM1p18 [Streptococcus... 299 1e-79
ref|ZP_01831598.1| phage protein [Streptococcus pneumoniae ... 298 3e-79
ref|YP_001693511.1| conserved hypothetical protein [Strepto... 296 8e-79
ref|NP_782698.1| hypothetical protein CTC02139 [Clostridium... 182 2e-44
ref|ZP_02729390.1| hypothetical protein CdifQCD_16880 [Clos... 162 1e-38
ref|ZP_02327243.1| hypothetical protein Plarl_06290 [Paenib... 161 3e-38
ref|ZP_02330190.1| hypothetical protein Plarl_21501 [Paenib... 160 1e-37
ref|ZP_02764724.1| hypothetical protein CdifQCD-_14148 [Clo... 157 7e-37
ref|ZP_02727036.1| hypothetical protein CdifQCD_04835 [Clos... 156 1e-36
ref|YP_535159.1| hypothetical protein LSL_0261 [Lactobacill... 156 1e-36
ref|ZP_02758961.1| hypothetical protein CdifQCD-7_20008 [Cl... 155 3e-36
ref|YP_805803.1| hypothetical protein LSEI_0511 [Lactobacil... 149 2e-34
ref|ZP_02687744.1| hypothetical protein LmonFSL_07419 [List... 148 3e-34
ref|YP_001886128.1| conserved hypothetical protein [Clostri... 146 1e-33
ref|ZP_02233184.1| hypothetical protein DORFOR_00016 [Dorea... 133 8e-30
ref|ZP_01274482.1| hypothetical protein Lreu23DRAFT_0652 [L... 129 2e-28
ref|ZP_02439983.1| hypothetical protein CLOSS21_02472 [Clos... 127 8e-28
ref|YP_065761.1| hypothetical protein DP2025 [Desulfotalea ... 112 3e-23
ref|YP_001112545.1| hypothetical protein Dred_1186 [Desulfo... 112 3e-23
ref|ZP_02240721.1| hypothetical protein Plarl_00100 [Paenib... 105 3e-21
gb|AAZ17495.1| unknown [Bacillus thuringiensis] 88 6e-16
ref|ZP_02091329.1| hypothetical protein FAEPRAM212_01601 [F... 87 8e-16
gb|ABP73607.1| hypothetical protein [Bacillus thuringiensis] 87 9e-16
gb|ABP73606.1| hypothetical protein [Bacillus thuringiensis] 74 1e-11
ref|ZP_02429529.1| hypothetical protein CLORAM_02952 [Clost... 73 1e-11
ref|YP_767551.1| putative universal stress family protein [... 38 0.70
ref|NP_988566.1| Zn-finger, ZPR1 type:Zn-finger, ZPR1-relat... 37 1.0
ref|ZP_01909173.1| putative carbamoyl transferase [Plesiocy... 37 1.4
ref|ZP_02298215.1| UspA domain protein [Rhizobium leguminos... 35 5.9
ref|XP_001610630.1| DNA polymerase alpha subunit, putative ... 34 7.4
>ref|NP_268907.1| hypothetical protein SPy_0666 [Streptococcus phage 370.1]
ref|YP_600159.1| phage protein [Streptococcus phage 2096.1]
gb|AAK33628.1| hypothetical protein, phage associated [Streptococcus phage 370.1]
gb|ABF35615.1| phage protein [Bacteriophage 2096.1]
Length = 244
Score = 498 bits (1282), Expect = e-139, Method: Compositional matrix adjust.
Identities = 244/244 (100%), Positives = 244/244 (100%)
Query: 1 MTENIKEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLI 60
MTENIKEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLI
Sbjct: 1 MTENIKEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLI 60
Query: 61 DYLKSDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSD 120
DYLKSDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSD
Sbjct: 61 DYLKSDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSD 120
Query: 121 FNISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPV 180
FNISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPV
Sbjct: 121 FNISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPV 180
Query: 181 TLRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELADQTNI 240
TLRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELADQTNI
Sbjct: 181 TLRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELADQTNI 240
Query: 241 TILA 244
TILA
Sbjct: 241 TILA 244
>ref|NP_150149.1| hypothetical protein MM1p18 [Streptococcus phage MM1]
emb|CAC48083.1| hypothetical protein [Streptococcus pneumoniae bacteriophage MM1]
gb|AAZ82429.1| hypothetical protein [Streptococcus pneumoniae bacteriophage MM1
1998]
Length = 245
Score = 299 bits (765), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 148/243 (60%), Positives = 188/243 (77%)
Query: 2 TENIKEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLID 61
TEN+K ALEYAV + + +DG EYYD NK++L EL K YPKTL L+TL SL+D
Sbjct: 3 TENLKSALEYAVELNEHGLEILTAADGTEYYDANKFNLKELDPKRYPKTLELSTLTSLVD 62
Query: 62 YLKSDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSDF 121
YLK+D+NN+ ++RL+V VE EV V E+D+ +R +LV V+A IP + FG + S F
Sbjct: 63 YLKTDLNNLKNQRLIVAVEKNDEVCVWSENDEIEHRTLLVDVKARIPELSFGRFLSLEQF 122
Query: 122 NISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVT 181
NI LQS F++ +DR T++EFASALKIENG+EI D+G+ Q AT+K GVASLAK KAPNPVT
Sbjct: 123 NIMLQSNFIDDNDRGTLLEFASALKIENGAEIEDNGVSQVATVKTGVASLAKGKAPNPVT 182
Query: 182 LRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELADQTNIT 241
LRPYRTFSEVEQP+S F+FRI++ ANMALFEADG +W DA+ NIA+YLKE+LADQ +IT
Sbjct: 183 LRPYRTFSEVEQPASLFVFRIDKQANMALFEADGKRWVADAVGNIASYLKEQLADQKHIT 242
Query: 242 ILA 244
+LA
Sbjct: 243 VLA 245
>ref|ZP_01831598.1| phage protein [Streptococcus pneumoniae SP18-BS74]
gb|EDK67435.1| phage protein [Streptococcus pneumoniae SP18-BS74]
Length = 245
Score = 298 bits (762), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 147/243 (60%), Positives = 186/243 (76%)
Query: 2 TENIKEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLID 61
TEN+K ALEYAV + + +DG EYYD NK++L EL K YPKTL L+TL SL+D
Sbjct: 3 TENLKSALEYAVELNEHGLEILTAADGTEYYDANKFNLKELDPKRYPKTLELSTLTSLVD 62
Query: 62 YLKSDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSDF 121
YLK+D+NN+ ++RL+V VE EV V E+D+ R +LV V+A +P + FG + S F
Sbjct: 63 YLKTDLNNLKNQRLIVAVEKNDEVCVWSENDELERRTLLVDVKARVPELSFGRFLSSEQF 122
Query: 122 NISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVT 181
NI LQS F++ +DR T++EFASALKIENG+EI D+G+ Q AT+K GVASLAK KAPNPVT
Sbjct: 123 NIMLQSNFIDDNDRGTLLEFASALKIENGAEIEDNGVSQVATVKTGVASLAKGKAPNPVT 182
Query: 182 LRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELADQTNIT 241
LRPYRTFSEVEQP+S F+FRI++ ANMALFEADG +W DA+ NIA YLKE+LADQ +IT
Sbjct: 183 LRPYRTFSEVEQPASLFVFRIDKQANMALFEADGKRWVADAVGNIAAYLKEQLADQKHIT 242
Query: 242 ILA 244
+LA
Sbjct: 243 VLA 245
>ref|YP_001693511.1| conserved hypothetical protein [Streptococcus pneumoniae
Hungary19A-6]
gb|ACA36475.1| conserved hypothetical protein [Streptococcus pneumoniae
Hungary19A-6]
Length = 245
Score = 296 bits (758), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 148/243 (60%), Positives = 185/243 (76%)
Query: 2 TENIKEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLID 61
TEN+K ALEYAV + + +DG EYYD NK++L EL K YPKTL L+TL SL+D
Sbjct: 3 TENLKAALEYAVELNEHGLEILTAADGTEYYDANKFNLKELDPKRYPKTLELSTLTSLVD 62
Query: 62 YLKSDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSDF 121
YLK+D+NN+ ++RL+V VE EV V E+D+ R +LV V+A IP + FG + S F
Sbjct: 63 YLKTDLNNLKNQRLIVAVEKNDEVCVWSENDELERRTLLVDVKARIPELSFGRFLSSEQF 122
Query: 122 NISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVT 181
NI LQS F++ +DR T++EFASALKIENG+EI D+G+ Q AT+K GVASLAK KAPNPVT
Sbjct: 123 NIMLQSNFIDDNDRGTLLEFASALKIENGAEIEDNGVSQVATVKTGVASLAKGKAPNPVT 182
Query: 182 LRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELADQTNIT 241
LRPYRTFSEVEQP+S F+FRI++ ANMALFEADG +W DA+ NIA YLKE+LADQ IT
Sbjct: 183 LRPYRTFSEVEQPASLFVFRIDKQANMALFEADGKRWVADAVGNIAAYLKEQLADQKLIT 242
Query: 242 ILA 244
+LA
Sbjct: 243 VLA 245
>ref|NP_782698.1| hypothetical protein CTC02139 [Clostridium tetani E88]
gb|AAO36635.1| phage-related protein [Clostridium tetani E88]
Length = 238
Score = 182 bits (461), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 102/240 (42%), Positives = 140/240 (58%), Gaps = 8/240 (3%)
Query: 6 KEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLIDYLKS 65
KEALEY V K ++ DG++Y + Y + + K P LN TL +L+DYLKS
Sbjct: 6 KEALEYLVGLGETK---VIEIDGQKYSTKELYRVRDPK----PSELNATTLTALVDYLKS 58
Query: 66 DMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPN-VQFGHYDSPSDFNIS 124
+ + +SK+L+V V+ P V +C E D +R + EA+ PN + F + FNI
Sbjct: 59 NFDAKSSKKLLVHVKSPSCVELCSELRGDRDRENYIKCEALTPNNIVFDRFLDTEQFNIM 118
Query: 125 LQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVTLRP 184
LQS F DR +++ +K E+ DDG+ Q ATIK GVAS+ + K PNPV L P
Sbjct: 119 LQSSFAENKDRGLLLKVTGCVKDSAVKEVGDDGVSQAATIKTGVASVNEVKVPNPVVLAP 178
Query: 185 YRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELADQTNITILA 244
+RTF E+EQP S+FIFR+ ALFEADGG WR +A+ I YL+E+L NI I+A
Sbjct: 179 FRTFPEIEQPESKFIFRMQSGPRAALFEADGGAWRNEAMIKIKAYLEEQLKGLGNIQIIA 238
>ref|ZP_02729390.1| hypothetical protein CdifQCD_16880 [Clostridium difficile
QCD-37x79]
ref|ZP_02746228.1| hypothetical protein CdifQCD-6_10039 [Clostridium difficile
QCD-63q42]
Length = 237
Score = 162 bits (411), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 82/195 (42%), Positives = 123/195 (63%), Gaps = 1/195 (0%)
Query: 51 LNLNTLDSLIDYLKSDMNN-INSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPN 109
+ L+TLD LI Y+KS +++ + S +LMV VE P+ VI+C E + R++ + A++P+
Sbjct: 43 VRLSTLDGLIGYIKSSIDSKLKSGKLMVQVESPKSVILCSELLESGKRDIYIESRALLPH 102
Query: 110 VQFGHYDSPSDFNISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVA 169
V F FN+ LQS FV+ +D+ +++ + +K + + DDG+ Q T+ GVA
Sbjct: 103 VNFDCSLDAEQFNVMLQSSFVDTEDKKLLLKVSGNVKETSVKTVQDDGVSQVVTMSTGVA 162
Query: 170 SLAKAKAPNPVTLRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANY 229
S+ PN V L+PYRTF+EV+QP SEFIFR+ + L EADGG WRL+AI NI Y
Sbjct: 163 SVEDVILPNRVKLKPYRTFAEVDQPESEFIFRVKEGIAFKLIEADGGAWRLEAIKNIKEY 222
Query: 230 LKEELADQTNITILA 244
L+++L NI I+A
Sbjct: 223 LEKKLEGIKNIDIIA 237
>ref|ZP_02327243.1| hypothetical protein Plarl_06290 [Paenibacillus larvae subsp.
larvae BRL-230010]
Length = 236
Score = 161 bits (408), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 93/231 (40%), Positives = 131/231 (56%), Gaps = 9/231 (3%)
Query: 6 KEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLIDYLKS 65
++ALEY + + I+ +G++Y Y +VE P+ L + L LIDYLKS
Sbjct: 4 RDALEY---LSEQGQPEIIEVNGQQYATHQLYEVVEPT----PEPLVVRNLSGLIDYLKS 56
Query: 66 DMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSDFNISL 125
+ + L+V V P +V V ++D R ++ EA++P +FG Y DF I L
Sbjct: 57 EFDK--QAPLLVHVVSPTQVTVVSSYNNDYARKEIIKAEALLPEYRFGSYYEAEDFIIKL 114
Query: 126 QSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVTLRPY 185
QS FV +DR +++ +K EN I DDG+ Q+ T K GVA++ K PNPV L PY
Sbjct: 115 QSGFVANEDRAKLLKVVGNVKEENVRSIGDDGVSQSVTAKTGVATVEDVKVPNPVLLAPY 174
Query: 186 RTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELAD 236
RTF EV QP S F+FR+ ALFEADGG WR AI+ +A YL E+L++
Sbjct: 175 RTFVEVIQPESAFVFRMKNGPLAALFEADGGAWRNAAIDAVATYLVEKLSE 225
>ref|ZP_02330190.1| hypothetical protein Plarl_21501 [Paenibacillus larvae subsp.
larvae BRL-230010]
Length = 236
Score = 160 bits (404), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 91/231 (39%), Positives = 129/231 (55%), Gaps = 9/231 (3%)
Query: 6 KEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLIDYLKS 65
++ALEY + + I+ +G++Y Y ++E P L + L L+DYLKS
Sbjct: 4 RDALEY---LSEQGQSEIIEVNGQQYATHQLYKVLEPT----PAALVVRNLSGLVDYLKS 56
Query: 66 DMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSDFNISL 125
+ L+V V P +V V ++D R ++ EA++P +FG Y DF I L
Sbjct: 57 KFDK--QAPLLVHVVSPTQVTVVSSYNNDYARREVIKAEALLPEYRFGSYYEAEDFIIKL 114
Query: 126 QSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVTLRPY 185
QS FV +DR +++ +K EN I DDG+ Q+ T K GVA++ K PNPV L PY
Sbjct: 115 QSGFVANEDRAKLLKVVGNVKEENVRSIGDDGVSQSVTAKTGVATVEDVKVPNPVLLAPY 174
Query: 186 RTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELAD 236
RTF EV QP S F+FR+ ALFEADGG WR +AI+ +A YL EL++
Sbjct: 175 RTFVEVIQPESAFVFRMKNGPLAALFEADGGAWRNEAIDAVATYLTAELSE 225
>ref|ZP_02764724.1| hypothetical protein CdifQCD-_14148 [Clostridium difficile
QCD-97b34]
Length = 234
Score = 157 bits (396), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 95/245 (38%), Positives = 140/245 (57%), Gaps = 17/245 (6%)
Query: 5 IKEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKT--LNLNTLDSLIDY 62
I+ A+EY V LG D KY+ L+ PK L + TL L+DY
Sbjct: 2 IRNAMEYLVK---------LGKRETVCVDGFKYTTDALEIIKEPKANELEITTLSGLVDY 52
Query: 63 LKSDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSD-- 120
+KS +++ + L++ V PR+V++ + +R + A++P+ +F D P D
Sbjct: 53 IKSGIDHEEDELLLIQVLTPRKVLLKSALRKNRDRETYIECVALLPDKRF---DRPLDLD 109
Query: 121 -FNISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNP 179
FNI LQS F+ +DR+T+++ A ++ + DDG+ Q+ TIK GVAS+++A PN
Sbjct: 110 TFNIMLQSAFIKNNDRDTLLKIAGNIQESTVKNVGDDGVSQSVTIKTGVASVSEAIVPNR 169
Query: 180 VTLRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELADQTN 239
V L PYRTF EVEQP SEFIFR++ L+EADGG W+ A+ NI YL+EEL D N
Sbjct: 170 VKLIPYRTFQEVEQPESEFIFRMSTGPAAGLYEADGGAWKNQAMLNIKAYLQEELKDFKN 229
Query: 240 ITILA 244
+ I+A
Sbjct: 230 VNIIA 234
>ref|ZP_02727036.1| hypothetical protein CdifQCD_04835 [Clostridium difficile
QCD-37x79]
ref|ZP_02747796.1| hypothetical protein CdifQCD-6_17418 [Clostridium difficile
QCD-63q42]
Length = 234
Score = 156 bits (395), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 95/245 (38%), Positives = 140/245 (57%), Gaps = 17/245 (6%)
Query: 5 IKEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKT--LNLNTLDSLIDY 62
I+ A+EY V LG D KY+ L+ PK L + TL L+DY
Sbjct: 2 IRNAMEYLVK---------LGKRETVCVDGFKYTTDALEIIKEPKANELEITTLSGLVDY 52
Query: 63 LKSDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSD-- 120
+KS +++ + L++ V PR+V++ + +R + A++P+ +F D P D
Sbjct: 53 IKSGIDHEEDELLLIQVLTPRKVLLKSALRKNRDRETYIECVALLPDKRF---DRPLDLD 109
Query: 121 -FNISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNP 179
FNI LQS F+ +DR+T+++ A ++ + DDG+ Q+ TIK GVAS+++A PN
Sbjct: 110 TFNIILQSAFIKNNDRDTLLKIAGNIQESTVKNVGDDGVSQSVTIKTGVASVSEAIVPNR 169
Query: 180 VTLRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELADQTN 239
V L PYRTF EVEQP SEFIFR++ L+EADGG W+ A+ NI YL+EEL D N
Sbjct: 170 VKLIPYRTFQEVEQPESEFIFRMSTGPAAGLYEADGGAWKNQAMLNIKAYLQEELKDFKN 229
Query: 240 ITILA 244
+ I+A
Sbjct: 230 VNIIA 234
>ref|YP_535159.1| hypothetical protein LSL_0261 [Lactobacillus phage Sal2]
gb|ABD99076.1| Hypothetical protein, phage associated [Lactobacillus phage Sal2]
Length = 242
Score = 156 bits (394), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 95/241 (39%), Positives = 134/241 (55%), Gaps = 5/241 (2%)
Query: 6 KEALEYAVNTAHEKPKTILGSDGKEYY-DRNKYSLVELKTKYYPK-TLNLNTLDSLIDYL 63
KEAL+Y + L +G+EY D+N ++ Y K + LNTL L+DY+
Sbjct: 5 KEALDYLTERGIRPEERKLKINGQEYIIDKNGEPVLVEPVIYKAKEPIRLNTLSGLVDYI 64
Query: 64 KSDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSDFNI 123
KS+++ + L V+ E E+ + + D R +L AI+P F FNI
Sbjct: 65 KSNIDGFDDLILHVVDEKLVELKGKLQPNGD--RELLAVATAIVPEFDFDSCMDIESFNI 122
Query: 124 SLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVTLR 183
+LQS FV DDR+ +++ LK +N DDGI Q TIK G+A+ K PNPV L
Sbjct: 123 ALQSQFVKTDDRDILLKVVGNLKEDNVRSTGDDGISQAVTIKSGIATAENIKVPNPVILA 182
Query: 184 PYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELADQTNITIL 243
PYRTF EVEQP S+FIFR+ A+FE DGG WR++AI IA +L++ L + T + +L
Sbjct: 183 PYRTFVEVEQPESKFIFRMQSGPRGAIFEGDGGLWRVEAIKTIAKFLEKRL-EGTGVVLL 241
Query: 244 A 244
A
Sbjct: 242 A 242
>ref|ZP_02758961.1| hypothetical protein CdifQCD-7_20008 [Clostridium difficile
QCD-76w55]
Length = 234
Score = 155 bits (391), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 94/245 (38%), Positives = 139/245 (56%), Gaps = 17/245 (6%)
Query: 5 IKEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKT--LNLNTLDSLIDY 62
I+ +EY V LG D KY+ L+ PK L + TL L+DY
Sbjct: 2 IRNVMEYLVK---------LGKRETVCVDGFKYTTDALEIIKEPKANELEITTLSGLVDY 52
Query: 63 LKSDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSD-- 120
+KS +++ + L++ V PR+V++ + +R + A++P+ +F D P D
Sbjct: 53 IKSGIDHEEDELLLIQVLTPRKVLLKSALRKNRDRETYIECVALLPDKRF---DRPLDLD 109
Query: 121 -FNISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNP 179
FNI LQS F+ +DR+T+++ A ++ + DDG+ Q+ TIK GVAS+++A PN
Sbjct: 110 TFNIMLQSAFIKNNDRDTLLKIAGNIQESTVKNVGDDGVSQSVTIKTGVASVSEAIVPNR 169
Query: 180 VTLRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELADQTN 239
V L PYRTF EVEQP SEFIFR++ L+EADGG W+ A+ NI YL+EEL D N
Sbjct: 170 VKLIPYRTFQEVEQPESEFIFRMSTGPAAGLYEADGGAWKNQAMLNIKAYLQEELKDFKN 229
Query: 240 ITILA 244
+ I+A
Sbjct: 230 VNIIA 234
>ref|YP_805803.1| hypothetical protein LSEI_0511 [Lactobacillus casei ATCC 334]
gb|ABJ69361.1| hypothetical protein LSEI_0511 [Lactobacillus casei ATCC 334]
Length = 245
Score = 149 bits (375), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 78/197 (39%), Positives = 121/197 (61%), Gaps = 2/197 (1%)
Query: 49 KTLNLNTLDSLIDYLKSDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIP 108
+ L ++TL +++DY+ ++ ++ L+V V V V E DD NR VL+ +A+ P
Sbjct: 50 EALQVHTLTAVVDYI-GKVSERSASPLIVQVVDENSVFVLGELDDFGNREVLIKAKAMHP 108
Query: 109 NVQFGHYDSPSDFNISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGV 168
+G + F I+LQ+ F+ +D +++F LK E+ +VDDG+ Q AT + GV
Sbjct: 109 CYPYGQWLDSESFIIALQAQFMTTEDGKVLLKFVGNLKDESAQTMVDDGVSQVATARSGV 168
Query: 169 ASLAKAKAPNPVTLRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIAN 228
AS+ K PNP++LRPYRTF EVEQ S+F+FR+++ +ALF ADG WR AI+ +A
Sbjct: 169 ASVGKVVVPNPISLRPYRTFLEVEQQESDFVFRMHEGPKLALFSADGDAWRNTAIDLVAE 228
Query: 229 YLKEELAD-QTNITILA 244
YL E+L ++T+LA
Sbjct: 229 YLSEKLVKFSKHVTVLA 245
>ref|ZP_02687744.1| hypothetical protein LmonFSL_07419 [Listeria monocytogenes FSL
J2-071]
Length = 235
Score = 148 bits (374), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 91/233 (39%), Positives = 132/233 (56%), Gaps = 9/233 (3%)
Query: 5 IKEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLIDYLK 64
++EA+EY V + K ++ + +++ D + L E+K + L ++TL L+DY++
Sbjct: 2 LQEAIEYIVGLKNRKEVVMV--EDRKWIDTD---LTEVKDPLATQ-LCISTLTGLVDYIR 55
Query: 65 SDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSDFNIS 124
S+ + ++ +V V P EV + + NRN V EAIIPN + + NI
Sbjct: 56 SNFDG--QEKCLVHVVSPTEVELLGLLNKQQNRNKFVCAEAIIPNFSYDSFYDTETLNIK 113
Query: 125 LQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVTLRP 184
LQS+F+ DR+ +++ + EN DDGI Q I+QG+ PNPVTL P
Sbjct: 114 LQSMFLTTLDRDIILQAIGNISEENVKNTGDDGISQVVEIRQGIKK-NDVLVPNPVTLTP 172
Query: 185 YRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELADQ 237
RTFSEVEQPSS+FIFR+ A+FEADG WR +AI NIANYLK EL +
Sbjct: 173 IRTFSEVEQPSSQFIFRMKDGPFGAVFEADGKSWRNEAILNIANYLKAELVKE 225
>ref|YP_001886128.1| conserved hypothetical protein [Clostridium botulinum B str. Eklund
17B]
gb|ACD22593.1| conserved hypothetical protein [Clostridium botulinum B str. Eklund
17B]
Length = 237
Score = 146 bits (369), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 90/242 (37%), Positives = 136/242 (56%), Gaps = 11/242 (4%)
Query: 6 KEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKT--LNLNTLDSLIDYL 63
+EAL+Y V E I D+ YS V+L PK L+++TL +DY+
Sbjct: 4 QEALQYLVQLGEESEPII-------ELDQGTYSRVDLNRVKQPKAQGLSISTLTGFVDYI 56
Query: 64 KSDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIP-NVQFGHYDSPSDFN 122
KS+++ I++K L++ V + V + + D R + A +P N+++ + FN
Sbjct: 57 KSNIDAIDTK-LLIHVASHKRVDLYGPLNADRERECYLVAGAELPTNIRYEQFLDTEQFN 115
Query: 123 ISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVTL 182
I LQS F + +++ ++++ ++ E DDGI Q T+K VAS+A+A NPV+L
Sbjct: 116 IMLQSSFADNGNKDVLLKYTGLVRDEAVKTTGDDGISQKVTVKTRVASVAEAVVLNPVSL 175
Query: 183 RPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELADQTNITI 242
PYRTF E+EQP S+FIFR+ A+FEADGG WR +I NI YLK+EL NI I
Sbjct: 176 APYRTFPEIEQPLSKFIFRMKDGPRAAIFEADGGAWRNGSILNIKEYLKKELEGLENIEI 235
Query: 243 LA 244
+A
Sbjct: 236 IA 237
>ref|ZP_02233184.1| hypothetical protein DORFOR_00016 [Dorea formicigenerans ATCC
27755]
gb|EDR48581.1| hypothetical protein DORFOR_00016 [Dorea formicigenerans ATCC
27755]
Length = 246
Score = 133 bits (335), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 88/252 (34%), Positives = 133/252 (52%), Gaps = 14/252 (5%)
Query: 1 MTENIKEALEYAVNTAHE--KPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDS 58
M E +KEAL+Y E +PK +L +G+ Y ++ Y L++NTL +
Sbjct: 1 MMEGLKEALQYITGLKAESMEPK-LLEINGETYCTKDLTRYHRFPMASY---LSVNTLTA 56
Query: 59 LIDYLKSDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSP 118
L+DY+K + ++ +V P +V + D+ N L+ AI+ +F Y
Sbjct: 57 LVDYIKGKPEELRESSILHVV-SPTKVFLFSGLIDERNGESLMAAGAIVNEFRFDDYYDQ 115
Query: 119 SDFNISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPN 178
F I LQ+ FV +D T+++ A +K + DDG+ Q TIK GV LA PN
Sbjct: 116 ERFLIELQANFVETEDLTTIMQVAGNIKSGTTANYSDDGVSQKTTIKSGV-ELADVIVPN 174
Query: 179 PVTLRPYRTFSEVEQPSSEFIFRIN---QLANMALFEADGGKWRLDAINNIANYLKEELA 235
PV LRPYRTF+E+EQP S ++FRI + + L EADGG W+ + I YL+ ELA
Sbjct: 175 PVKLRPYRTFAEIEQPESSYVFRIKDSERGPHFKLVEADGGLWKNATMKKIKEYLEFELA 234
Query: 236 DQT---NITILA 244
++ +IT++A
Sbjct: 235 EELKEHHITVIA 246
>ref|ZP_01274482.1| hypothetical protein Lreu23DRAFT_0652 [Lactobacillus reuteri
100-23]
gb|EAS88534.1| hypothetical protein Lreu23DRAFT_0652 [Lactobacillus reuteri
100-23]
Length = 248
Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 80/224 (35%), Positives = 121/224 (54%), Gaps = 2/224 (0%)
Query: 9 LEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLIDYLKSDMN 68
E VN A+ + I G +G++Y + L L LN L SLI++L+S+ +
Sbjct: 12 FEQLVNDANGN-QLITGLNGRQYLLDHDGDAALLNDPIIAHPLQLNQLTSLIEWLESEGS 70
Query: 69 NINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSDFNISLQSL 128
+IN L + V P +V V R V A++ ++ Y I LQS
Sbjct: 71 SINDA-LKIHVVSPTKVEVIGNLAKGGQRPCFAEVRAVVDSLNLESYLDQESMIIMLQSH 129
Query: 129 FVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVTLRPYRTF 188
F DDRN +++ S L+ E+ + DDG+ Q+ I GVAS+ + K PNPV L P+RTF
Sbjct: 130 FEENDDRNIILKVVSNLRDESIHQQTDDGVSQSVQINSGVASVDEVKVPNPVKLIPFRTF 189
Query: 189 SEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKE 232
EV+QP+S+FIFR+ + ALF AD +W+++A NNI Y+++
Sbjct: 190 QEVDQPASKFIFRMREGMQSALFMADNNQWQVEAKNNIKKYIQQ 233
>ref|ZP_02439983.1| hypothetical protein CLOSS21_02472 [Clostridium sp. SS2/1]
gb|EDS20838.1| hypothetical protein CLOSS21_02472 [Clostridium sp. SS2/1]
Length = 248
Score = 127 bits (318), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 79/238 (33%), Positives = 126/238 (52%), Gaps = 9/238 (3%)
Query: 5 IKEALEYAVN-TAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLIDYL 63
++EA+E+ T + ++ GK Y D+ S+ + + + L +L+SLIDY+
Sbjct: 6 LREAMEHIEELTDSAREPHVVEIAGKTYCDK---SMSRYDREEFAEPLTATSLNSLIDYI 62
Query: 64 KSDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSDFNI 123
+ + +++ VE P +V + ++ NR L V F HY F I
Sbjct: 63 SGKSEELR-ESMIIHVESPTKVRLLSGLTNERNREELFRVGTNPNGFDFDHYYDQEAFVI 121
Query: 124 SLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVTLR 183
++Q+ F +D+ ++ A ++ + + DDG+ Q ATI +G+A PNPVTLR
Sbjct: 122 NMQTAFKQSDETELILSVAGNVENKTVANYGDDGVSQKATITKGIAGKEDVIVPNPVTLR 181
Query: 184 PYRTFSEVEQPSSEFIFRINQLAN----MALFEADGGKWRLDAINNIANYLKEELADQ 237
PYRTF EVEQP S+FIFRI + +N L EADGG W+ +A++ I YL E L ++
Sbjct: 182 PYRTFLEVEQPESKFIFRIREGSNGEPLFKLVEADGGLWKYEAVDAIKKYLTENLPEE 239
>ref|YP_065761.1| hypothetical protein DP2025 [Desulfotalea psychrophila LSv54]
emb|CAG36754.1| conserved hypothetical protein [Desulfotalea psychrophila LSv54]
Length = 235
Score = 112 bits (279), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 101/184 (54%), Gaps = 7/184 (3%)
Query: 51 LNLNTLDSLIDYLKSDMNNINSKRLMVIVE--GPREVIVCEE-DDDDLNRNVLVTVEAII 107
L +++L ++ DY NN+ + + +V P +V++ D D +R +
Sbjct: 42 LEVHSLQAIADY----ANNLTEEGVAFVVNVLSPSKVVLLSGLDPDHRDRETFLESSCAY 97
Query: 108 PNVQFGHYDSPSDFNISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQG 167
FG Y +F +SLQ+ FV + +++ + + + DDG+ Q A +K G
Sbjct: 98 DPFVFGRYYDVEEFIVSLQAQFVQDEVIGAILKMVGNMVTRSDVGLYDDGVTQIAEVKTG 157
Query: 168 VASLAKAKAPNPVTLRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNIA 227
+A++ K PNPV++ PYRTF E+EQP S F+ R+ + + ALFEADGG W+++AI IA
Sbjct: 158 LANVEKVALPNPVSIAPYRTFPELEQPCSNFVLRLTKDHSCALFEADGGAWKIEAIEKIA 217
Query: 228 NYLK 231
++ +
Sbjct: 218 DFFR 221
>ref|YP_001112545.1| hypothetical protein Dred_1186 [Desulfotomaculum reducens MI-1]
gb|ABO49720.1| conserved hypothetical protein [Desulfotomaculum reducens MI-1]
Length = 238
Score = 112 bits (279), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 70/214 (32%), Positives = 105/214 (49%), Gaps = 4/214 (1%)
Query: 23 ILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLIDYLKSDMNNINSKRLMVIVEGP 82
I+ DG +Y R + + E P +L +++ + + + L++ + P
Sbjct: 18 IVDHDGLQYSSRALHLIPEAS----PVAFATKSLAGIVELINKEHEHKRLNDLIIHIVDP 73
Query: 83 REVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSDFNISLQSLFVNADDRNTVIEFA 142
V V +DL R L + A +P FG+Y + I L+S FV D R+ +I+
Sbjct: 74 IRVDVYSMLREDLGRFHLYSAIAELPGQNFGNYMNLETAIIRLKSTFVQTDARDKLIQVL 133
Query: 143 SALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVTLRPYRTFSEVEQPSSEFIFRI 202
+K + DDGI QT T K G+A++ K + L PYRTF EVEQP EF+ R+
Sbjct: 134 GNIKEDAVKTSSDDGISQTVTAKTGIATVGNVKIEPIIKLAPYRTFIEVEQPEGEFLLRL 193
Query: 203 NQLANMALFEADGGKWRLDAINNIANYLKEELAD 236
ALFEADGG W++ A NI Y ++L D
Sbjct: 194 RNGPEAALFEADGGAWKMQARKNIKEYFMDKLDD 227
>ref|ZP_02240721.1| hypothetical protein Plarl_00100 [Paenibacillus larvae subsp.
larvae BRL-230010]
Length = 118
Score = 105 bits (261), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 51/104 (49%), Positives = 67/104 (64%)
Query: 133 DDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVTLRPYRTFSEVE 192
+DR +++ +K EN I DDG+ Q+ T K GVA++ K PNPV L PYRTF EV
Sbjct: 4 EDRAKLLKVVGNVKEENVRSIGDDGVSQSVTAKTGVATVEDVKVPNPVLLAPYRTFVEVI 63
Query: 193 QPSSEFIFRINQLANMALFEADGGKWRLDAINNIANYLKEELAD 236
QP S F+FR+ ALFEADGG WR +AI+ +A YL EL++
Sbjct: 64 QPESAFVFRMKNGPLAALFEADGGAWRNEAIDAVATYLTAELSE 107
>gb|AAZ17495.1| unknown [Bacillus thuringiensis]
Length = 92
Score = 87.8 bits (216), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 39/73 (53%), Positives = 52/73 (71%)
Query: 165 KQGVASLAKAKAPNPVTLRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAIN 224
K +A++ + PNPV+L+PYRTF EV QP S+FIFR+ +L+EADGG W+L+AI
Sbjct: 10 KVALATVDEVAVPNPVSLQPYRTFVEVAQPESDFIFRMKDGPRCSLYEADGGAWKLEAIK 69
Query: 225 NIANYLKEELADQ 237
NI YL ELAD+
Sbjct: 70 NIKEYLNAELADE 82
>ref|ZP_02091329.1| hypothetical protein FAEPRAM212_01601 [Faecalibacterium prausnitzii
M21/2]
gb|EDP21779.1| hypothetical protein FAEPRAM212_01601 [Faecalibacterium prausnitzii
M21/2]
Length = 263
Score = 87.4 bits (215), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 69/252 (27%), Positives = 125/252 (49%), Gaps = 28/252 (11%)
Query: 1 MTENIKEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLI 60
+ E K+A E + A + + GSD Y+ +E P+ + +LD+L+
Sbjct: 15 LAELGKKAAEPSFQKAEGRTFLVTGSD---------YTEIEPMELPKPEKVVTRSLDALV 65
Query: 61 DYLKSDMNNINSKRLMVIVEGPR---EVIVCEEDDDDLNR----NVLVT-VEAIIPNVQF 112
+K++ + + + I G EV +DDL+R L T + ++ NV++
Sbjct: 66 ALIKTEAASQFTDLPLYISCGSASTVEVFTKPNPEDDLHRWQPYCALATDLPTLVENVRW 125
Query: 113 GHYDSPSDFNISLQSLFVN-------ADDRNTVIEFASALKIENGSEIVDDGIGQTATIK 165
++ I L+S F +D + +I+ S + ++ + D+G+ QT ++
Sbjct: 126 TFDEA----MIKLRSAFQRPLGIPGETNDVDYIIDLLSHMSVDQSIKSDDNGVTQTVQVR 181
Query: 166 QGVASLAKAKAPNPVTLRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINN 225
+G++ + VTL PYRTF EV+QP+SEF+FR+ + +++L ADGG W+L A +
Sbjct: 182 KGISFVQNQPVRPIVTLAPYRTFQEVQQPASEFVFRVYEDRSISLTAADGGMWKLAARDA 241
Query: 226 IANYLKEELADQ 237
YL + LAD+
Sbjct: 242 AKRYLTDALADE 253
>gb|ABP73607.1| hypothetical protein [Bacillus thuringiensis]
Length = 82
Score = 87.0 bits (214), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 38/71 (53%), Positives = 51/71 (71%)
Query: 167 GVASLAKAKAPNPVTLRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDAINNI 226
+A++ + PNPV+L+PYRTF EV QP S+FIFR+ +L+EADGG W+L+AI NI
Sbjct: 2 ALATVDEVAVPNPVSLQPYRTFVEVAQPESDFIFRMKDGPRCSLYEADGGAWKLEAIKNI 61
Query: 227 ANYLKEELADQ 237
YL ELAD+
Sbjct: 62 KEYLNAELADE 72
>gb|ABP73606.1| hypothetical protein [Bacillus thuringiensis]
Length = 165
Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 82/163 (50%), Gaps = 9/163 (5%)
Query: 5 IKEALEYAVNTAHEKPKTILGSDGKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLIDYLK 64
IKEAL+Y V + + K I G + + +VE T + + + L L+DY+K
Sbjct: 2 IKEALQYLVKLGNTETKDIDG----QTFSTQPLHVVEEPT---ARAILVRNLSGLVDYVK 54
Query: 65 SDMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPSDFNIS 124
S+ + ++LM+ + P V + + NR+ V EA+IP+ FG++ +F I+
Sbjct: 55 SNFDV--EEQLMIHIASPTTVNCFTAINGNYNRSTFVKTEALIPDFDFGYWHDVENFIIN 112
Query: 125 LQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQG 167
LQS FV DDR+ ++ + EN D+G+ Q+ K G
Sbjct: 113 LQSAFVKNDDRDVMLRVVGNITEENVKSWGDNGVSQSVVAKSG 155
>ref|ZP_02429529.1| hypothetical protein CLORAM_02952 [Clostridium ramosum DSM 1402]
gb|EDS18156.1| hypothetical protein CLORAM_02952 [Clostridium ramosum DSM 1402]
Length = 251
Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/195 (27%), Positives = 94/195 (48%), Gaps = 6/195 (3%)
Query: 49 KTLNLNTLDSLIDYLKS----DMNNINSKRLMVIVEGPREVIVCEEDDDDLNRNVLVTVE 104
++++ +L L++ +KS + N+N + ++I E+ V DD R +L+ +
Sbjct: 47 ESIDTRSLSGLVEIIKSYIDVEAKNLNIQLPLIIQADGNEIKVLSSVDDTYERQILIHCK 106
Query: 105 AIIPNVQFGHYDSPSDFNISLQSLFVNADDRNTVIEFASALKIENGSEIVDDGIGQTATI 164
I+P + + SP + I+L + +V D+ + +I S L + VD+GIG +
Sbjct: 107 PIVPRLILNEWISPEEMIINLNTCYVPTDNTDKLISTVSNLYNSKTVKQVDNGIGVNLVV 166
Query: 165 KQ-GVASLAKAKAPNP-VTLRPYRTFSEVEQPSSEFIFRINQLANMALFEADGGKWRLDA 222
+ A A NP VTL P T+ E+ Q +F R++Q +ALF AD G + +
Sbjct: 167 ESDAFAGGAGKVTINPIVTLTPVATYPELIQVERKFNLRVDQHGRVALFVADRGYFEKEV 226
Query: 223 INNIANYLKEELADQ 237
I YL EL +
Sbjct: 227 QKLIKQYLTAELPKE 241
>ref|YP_767551.1| putative universal stress family protein [Rhizobium leguminosarum
bv. viciae 3841]
emb|CAK07442.1| putative universal stress family protein [Rhizobium leguminosarum
bv. viciae 3841]
Length = 280
Score = 37.7 bits (86), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 42/89 (47%), Gaps = 9/89 (10%)
Query: 131 NADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVTLRPYRTFSE 190
+D+ V +FA A+ E+G+ + IG A I V +A + P+PV ++ + +
Sbjct: 12 TSDNSGAVADFAFAIAAESGAHV----IGLHAEIISAVPLVAPMEIPDPVAVQALQDMAH 67
Query: 191 VEQPSSEFIFRINQLANMALFEADGGKWR 219
E + E IFR A A FE WR
Sbjct: 68 SETIAVERIFRAKAEAAGASFE-----WR 91
>ref|NP_988566.1| Zn-finger, ZPR1 type:Zn-finger, ZPR1-related type [Methanococcus
maripaludis S2]
emb|CAF31002.1| Zn-finger, ZPR1 type:Zn-finger, ZPR1-related type [Methanococcus
maripaludis S2]
Length = 190
Score = 37.0 bits (84), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 66/164 (40%), Gaps = 32/164 (19%)
Query: 28 GKEYYDRNKYSLVELKTKYYPKTLNLNTLDSLIDYLKSDMNNINSKRLMVIVEGPREVIV 87
GKE D K EL+ Y+ K + L Y KSD+ ++ K P+ +
Sbjct: 14 GKE--DSLKLITQELEIPYFGKVIETTILCEACKYKKSDIFPVDVKE-------PKRYTL 64
Query: 88 CEEDDDDLNRNVLVTVEAIIPNVQFGHYDSPS------------------DFNISLQSLF 129
ED+ DLN+ V+ I +FG SP D +L+S
Sbjct: 65 TVEDEYDLNKRVIRGSSGHISIPEFGFEVSPGPASEAYVSNVEGVLTRMEDAIKTLKSWV 124
Query: 130 VNADDRNTVIEFASAL-KIENGSE----IVDDGIGQTATIKQGV 168
N D+R E L +++ G E I++D +G +A I GV
Sbjct: 125 ENEDERKKADELIEKLEQVKLGKEKITLILEDPLGHSAIIGDGV 168
>ref|ZP_01909173.1| putative carbamoyl transferase [Plesiocystis pacifica SIR-1]
gb|EDM77944.1| putative carbamoyl transferase [Plesiocystis pacifica SIR-1]
Length = 1178
Score = 36.6 bits (83), Expect = 1.4, Method: Composition-based stats.
Identities = 23/73 (31%), Positives = 36/73 (49%), Gaps = 4/73 (5%)
Query: 118 PSDFNISLQSLFVNADDRNTVIEFASALKIENGS----EIVDDGIGQTATIKQGVASLAK 173
PS + +L + DR + FA A +E S +++ DG+ + + VASL +
Sbjct: 182 PSPEGVRQAALLAASQDRGALQWFAQAAHVEAPSGPVPDVLIDGLAWSVARRPTVASLHR 241
Query: 174 AKAPNPVTLRPYR 186
AKA PV + P R
Sbjct: 242 AKADAPVRVDPER 254
>ref|ZP_02298215.1| UspA domain protein [Rhizobium leguminosarum bv. trifolii WSM1325]
gb|EDR73407.1| UspA domain protein [Rhizobium leguminosarum bv. trifolii WSM1325]
Length = 280
Score = 34.7 bits (78), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 42/89 (47%), Gaps = 9/89 (10%)
Query: 131 NADDRNTVIEFASALKIENGSEIVDDGIGQTATIKQGVASLAKAKAPNPVTLRPYRTFSE 190
+D+ V +FA A+ E+G+ + IG A I V +A + P+PV ++ + +
Sbjct: 12 TSDNSTAVADFAFAIAAESGAHV----IGLHAEIISAVPLVAPMEIPDPVAVQALQDMAH 67
Query: 191 VEQPSSEFIFRINQLANMALFEADGGKWR 219
E + E IFR A+ A +WR
Sbjct: 68 SETIAVERIFRAKAEASGA-----SSEWR 91
>ref|XP_001610630.1| DNA polymerase alpha subunit, putative [Babesia bovis T2Bo]
gb|EDO07062.1| DNA polymerase alpha subunit, putative [Babesia bovis]
Length = 1483
Score = 34.3 bits (77), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 43/178 (24%), Positives = 72/178 (40%), Gaps = 34/178 (19%)
Query: 37 YSLVELKTKYYPKTLNLNTLDSLIDYLKSDMNNINSKRLMVIVEG--PREVIVCEEDDDD 94
YSL E + K L+ I Y K+ RL ++EG P+++ C + DD
Sbjct: 1169 YSLTEFQQK---------GLEPDIQYYKTQQLLPPILRLCGVIEGTDPQKLASCLQIQDD 1219
Query: 95 LNRNVLVTVEAIIPNVQFGHYDSPSDFNISLQSLFVNADDRNTVIEFASALKIENGSEIV 154
+NR +L N +G +D + +L SL DDR +E + ++ +
Sbjct: 1220 VNRTIL--------NTNYGSFDYAEHESKAL-SLINRTDDRYREVEITTTVR----CQFC 1266
Query: 155 DDGIGQTATIKQGVASLAKAKAP-----NPVTLRPYRTFSEVEQPSSEFIFRINQLAN 207
++G+ ++ S P N +T RT E+ S+ F RI + N
Sbjct: 1267 NNGVAANYFLRNMTCSNCGNWIPLHAMRNWIT----RTVYEIAMQSA-FCIRICNICN 1319
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
Posted date: May 10, 2008 4:54 AM
Number of letters in database: 2,222,278,849
Number of sequences in database: 6,515,104
Lambda K H
0.314 0.132 0.364
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 6515104
Number of Hits to DB: 1,023,854,714
Number of extensions: 41357554
Number of successful extensions: 99674
Number of sequences better than 10.0: 81
Number of HSP's gapped: 101130
Number of HSP's successfully gapped: 81
Length of query: 244
Length of database: 2,222,278,849
Length adjustment: 130
Effective length of query: 114
Effective length of database: 1,375,315,329
Effective search space: 156785947506
Effective search space used: 156785947506
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 76 (33.9 bits)