BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= TF1644
(933 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|147919342|ref|YP_686922.1| predicted endonuclease [uncul... 637 e-180
gi|108761850|ref|YP_631790.1| hypothetical protein MXAN_360... 586 e-165
gi|89901842|ref|YP_524313.1| endonuclease [Rhodoferax ferri... 577 e-162
gi|23014638|ref|ZP_00054444.1| hypothetical protein Magn030... 577 e-162
gi|69937048|ref|ZP_00631769.1| endonuclease [Paracoccus den... 555 e-156
gi|134095049|ref|YP_001100124.1| hypothetical protein HEAR1... 553 e-155
gi|119773419|ref|YP_926159.1| endonuclease [Shewanella amaz... 543 e-152
gi|85706636|ref|ZP_01037728.1| endonuclease [Roseovarius sp... 538 e-151
gi|118714150|ref|ZP_01566710.1| endonuclease [Burkholderia ... 521 e-145
gi|94264147|ref|ZP_01287944.1| endonuclease [delta proteoba... 473 e-131
gi|152994001|ref|YP_001359722.1| endonuclease [Sulfurovum s... 465 e-129
gi|56479068|ref|YP_160657.1| endonuclease [Azoarcus sp. EbN... 453 e-125
gi|94971293|ref|YP_593341.1| endonuclease [Acidobacteria ba... 451 e-124
gi|86360839|ref|YP_472726.1| hypothetical protein RHE_PF001... 451 e-124
gi|153941166|ref|YP_001391388.1| hypothetical protein CLI_2... 424 e-116
gi|115525254|ref|YP_782165.1| endonuclease [Rhodopseudomona... 414 e-113
gi|23097630|ref|NP_691096.1| endonuclease [Oceanobacillus i... 384 e-104
gi|93007193|ref|YP_581630.1| endonuclease [Psychrobacter cr... 362 5e-98
gi|89055354|ref|YP_510805.1| endonuclease [Jannaschia sp. C... 324 2e-86
gi|84495902|ref|ZP_00994756.1| endonuclease [Janibacter sp.... 288 2e-75
gi|154488613|ref|ZP_02029462.1| hypothetical protein BIFADO... 275 2e-71
gi|78778758|ref|YP_396870.1| endonuclease [Prochlorococcus ... 270 4e-70
gi|145593136|ref|YP_001157433.1| hypothetical protein Strop... 265 1e-68
gi|134098148|ref|YP_001103809.1| endonuclease [Saccharopoly... 258 1e-66
gi|125718616|ref|YP_001035749.1| Conserved uncharacterized ... 241 2e-61
gi|23100795|ref|NP_694262.1| hypothetical protein OB3340 [O... 225 9e-57
gi|123967965|ref|YP_001008823.1| hypothetical protein A9601... 216 8e-54
gi|88854625|ref|ZP_01129292.1| endonuclease [marine actinob... 203 5e-50
gi|115376475|ref|ZP_01463710.1| conserved hypothetical prot... 145 2e-32
gi|116748759|ref|YP_845446.1| hypothetical protein Sfum_132... 128 2e-27
gi|113939920|ref|ZP_01425766.1| hypothetical protein HaurDR... 114 5e-23
gi|119962574|ref|YP_946950.1| hypothetical protein AAur_116... 110 4e-22
gi|23127883|ref|ZP_00109742.1| COG0468: RecA/RadA recombina... 106 1e-20
gi|19552980|ref|NP_600982.1| stress-sensitive restriction s... 100 5e-19
gi|26554430|ref|NP_758364.1| hypothetical protein MYPE9820 ... 96 8e-18
gi|23100677|ref|NP_694144.1| hypothetical protein OB3222 [O... 93 1e-16
gi|59800807|ref|YP_207519.1| putative stress-sensitive rest... 89 2e-15
gi|46156670|ref|ZP_00204752.1| hypothetical protein Haso020... 69 2e-09
>gi|147919342|ref|YP_686922.1| predicted endonuclease [uncultured methanogenic archaeon RC-I]
gi|110622318|emb|CAJ37596.1| predicted endonuclease [uncultured methanogenic archaeon RC-I]
Length = 917
Score = 637 bits (1643), Expect = e-180, Method: Composition-based stats.
Identities = 402/941 (42%), Positives = 549/941 (58%), Gaps = 86/941 (9%)
Query: 20 ASVTDSEINEAAIKAKMIYPDV--DAAKLKNDLLSMYSVKIDAFQILEGRER-RDPWLKD 76
+ V D +I I A + YPD+ D LK D+ + S ID + L+G + W D
Sbjct: 23 SDVIDKKIRAVKIMASIEYPDIIIDWDMLKRDIEASCSTWIDIGKALDGDPKAHQNWFYD 82
Query: 77 FRANKMSKWEFWMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNEIQISKKGLV 136
R ++++ W FW RY YL++ +A L + D+++ +L +PQR + +G+V
Sbjct: 83 -RKDQIN-WRFWNRYVRYLQEENGWAEITTKGLGDNVDQIIQRLEDPQRPG-KWDCRGMV 139
Query: 137 VGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYERAY 196
VGQVQSGKTANY GLICKA DAG+ LIIVLAG+HN+LRSQTQ RIDEG LG+DT+ Y
Sbjct: 140 VGQVQSGKTANYIGLICKAIDAGYKLIIVLAGVHNSLRSQTQMRIDEGILGYDTKQSMNY 199
Query: 197 TMNNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQPILLVVKKNAS 256
+ N +IGVG + G + A S T E GDF + N P++LV+KKN S
Sbjct: 200 STANI-RIGVGRLKGIEFFSAQSLTNRDENGDFKKHRQISI---INGNDPVILVIKKNKS 255
Query: 257 VLNRLY-------KWLQTQTINEKITNKSLLIIDDEADNASINTNRKELD---------- 299
VL LY K T + N LLIIDDEADNASINT + E D
Sbjct: 256 VLENLYYSATNVQKERDPSTNRPIVRNIPLLIIDDEADNASINTKQIEFDDDGQVSDEFE 315
Query: 300 PTTINRNICSIISLFNRSAYVGYTATPFANIFI-PQNE-----DDLFPRDFIINIPAPTN 353
PT IN I ++ F +SAYVGYTATPFANIFI P+ + +DLFPR FI+N+PA ++
Sbjct: 316 PTIINGLIRKLLDAFEKSAYVGYTATPFANIFILPRGDTDREGEDLFPRSFILNLPAASS 375
Query: 354 YIGPEKVFGTSIIPDDTNS------DLLPIIYPIKDYDFFVPQGHKKDDDKPKFEDIPES 407
YIGP+KVFG D+ NS + LPII + DY F+P HKK +P +PES
Sbjct: 376 YIGPKKVFGI----DEDNSVGIEHEEGLPIIRIVDDYTDFIPDNHKKGH-RPN--SLPES 428
Query: 408 LRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSRFQMWQNHIKELVENLFNYYKHEIE--- 464
L+ A+K FI++ A RIARGQ HNSMLIHV+R+ Q +++LV K IE
Sbjct: 429 LKRAIKSFIISSAARIARGQDKSHNSMLIHVTRYTDVQTRVRDLVSEELESLKKRIEYGD 488
Query: 465 -ASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFSDIDKNLSVHSWEEIKPQLFKAV 523
S++ I+ E ++ + YK T+ + E F D+ ++ W++IK L KA
Sbjct: 489 GRSESNIFNELEELW------FTDYKPTTHAVIERFF---DQRITSLEWKQIKDNLQKAA 539
Query: 524 QKIEVKSINGTSGDCLTYYENEKNGISVIAIGGDKLSRGLTLEGLSVSYFLRASKMYDTL 583
KI +K++NGT D L Y E+ NG++ I IGGDKLSRGLTLEGL+VSY+LR S MYDTL
Sbjct: 540 SKIIIKTVNGTVADVLDYKES-PNGLNAIIIGGDKLSRGLTLEGLTVSYYLRTSNMYDTL 598
Query: 584 MQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEFNYLAESGGTPENYALKV 643
MQMGRWFG+RPGYVDLCRL+T++EL + ++HIT ASEEL+ EF Y+ G PE+Y L+V
Sbjct: 599 MQMGRWFGHRPGYVDLCRLYTTDELVDRYKHITKASEELKREFEYMVAMGRRPEDYGLRV 658
Query: 644 RTHPGS-LQITSVSKMRYAKQISVSWAGRLIESYQLPMDKGIKKRNLIATDNFISAQGMP 702
RT PG L IT+ +KMRY + + +S+ GRL+ES + + I ++N A + FI +G
Sbjct: 659 RTDPGDILLITAKNKMRYGRDMQLSYEGRLVESVKFNLSNAIIEKNFNAFNEFIKKRGQH 718
Query: 703 EKKGNNYLWRNVSPDDVCDYLSKFKVANSLKKVDLDMISNYIQELVKKGELTSWSVVVMN 762
+K NYLW NVS + D L+ ++ D ++++YIQ K EL W+V ++N
Sbjct: 719 SEKWGNYLWENVSASSIIDLLNNIQIEGYSAVTDSRILTHYIQAQQKHNELKQWTVALIN 778
Query: 763 KNQPAVQYTFSNSIQAGCFDRNRAEDTNWNTYY-IRKNHIVGNQTDEFIDLDEDLINAAL 821
KN YT G +R A +TN N +Y + K+HI+ ++ DE+IDL ++ + AL
Sbjct: 779 KNNADNHYTIG-GYPVGLVERRNANETNPNEFYQMAKSHII-SRRDEYIDLSDEELKLAL 836
Query: 822 ERTRQRKAELNRGWDKEYPAPEIVRQEFRPRTNPLLLIYPLNPECANVKDKHGNIQSGTI 881
R+ + ++ + P PE +R + RP LLLIYPL+ H + + I
Sbjct: 837 SRSIKSDGTMS-----DTPTPEAIR-DVRPSHRGLLLIYPLD---------HNKLDNTNI 881
Query: 882 SYSKTDDPFIGFVISFPSSSTNIAISYAVNQAA--EFAKTE 920
+ P IG ISFPSS T I Y VN EF + E
Sbjct: 882 T-----KPVIGLAISFPSSKTATRIGYRVNSVGDVEFLENE 917
>gi|108761850|ref|YP_631790.1| hypothetical protein MXAN_3602 [Myxococcus xanthus DK 1622]
gi|108465730|gb|ABF90915.1| conserved hypothetical protein [Myxococcus xanthus DK 1622]
Length = 951
Score = 586 bits (1511), Expect = e-165, Method: Composition-based stats.
Identities = 386/984 (39%), Positives = 555/984 (56%), Gaps = 118/984 (11%)
Query: 1 MNDNYQVAIEICQSIIG---RKASVTDSEINEAAIK-AKMIYPD----VDAAKLKNDLLS 52
MN+ Y A I Q ++ ++ V ++ A + A + P+ VD +L DL S
Sbjct: 1 MNNAYDSAWSIAQILLRGQVQQGQVLTRDMIAAKVDLALQMDPNSKSQVDRERLVADLES 60
Query: 53 MYSVKIDAFQILEGRERRDPWLKDFRANKMSKWEFWMRYAEYLEKHKKFAPSVILQLDEL 112
++V + LE E WL +++ W+FW RY L++ K++A + + +LDEL
Sbjct: 61 SFTVWMGNAVTLEHNEDHIAWLNSRKSD--IDWKFWKRYQRLLQQ-KRWAAASLEKLDEL 117
Query: 113 TDRVLDKLFNPQRNEIQISKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNN 172
T L +L +P R + S++GLVVG VQSGKTANYTGLICKAADAG+ +IIVLAGIH +
Sbjct: 118 TYETLGRLEDPNRKGMW-SRRGLVVGHVQSGKTANYTGLICKAADAGYKVIIVLAGIHKS 176
Query: 173 LRSQTQNRIDEGFLGFDTQYERAYTMNNTTKIGVGLIPGFDNAI-ANSYTTSLEKGDFTS 231
LRSQTQ R+DEGFLG++++ E + + + IGVGLI D +I AN+ TT + GDF
Sbjct: 177 LRSQTQIRLDEGFLGYESKDE-SVEGSISKHIGVGLI---DPSIRANTITTRADDGDFKM 232
Query: 232 RAANTAGFNFNVPQ---PILLVVKKNASVLNRLYKWLQ-----TQTINEK----ITNKSL 279
NF++ P+L VVKKNA VL L W++ T T+ + + + L
Sbjct: 233 SVRR----NFHIGPGGIPLLFVVKKNARVLANLNSWVERFAQKTLTVRDVERRVVPDVPL 288
Query: 280 LIIDDEADNASINTNRKEL----------DPTTINRNICSIISLFNRSAYVGYTATPFAN 329
L+IDDEADNASI+T +E DPT IN+ I ++ LF R+AYVGYTATPFAN
Sbjct: 289 LVIDDEADNASIDTRAQEFNEEGRPDPDHDPTAINKQIRKLLHLFERNAYVGYTATPFAN 348
Query: 330 IFI------PQNEDDLFPRDFIINIPAPTNYIGPEKVFGTSIIP--DDTNSDLLPIIYPI 381
IFI ++ DLFPR FI N+PAP++Y+GP +VFG P + + L +I I
Sbjct: 349 IFIHDAGQTDEHGADLFPRSFITNLPAPSDYVGPVRVFGLDPDPMLNAEEREPLGLIRHI 408
Query: 382 KDY---------DFFVPQGHKKDDDKPKFE---DIPESLRIAVKCFIVTCAIRIARGQGT 429
D+ + ++P HKK+ KP+F+ ++P SLR A+ F++ CA R ARGQ
Sbjct: 409 SDHADSPNPGERNGWIPPVHKKEH-KPRFKGKSELPPSLRTALHSFLLVCAARRARGQLN 467
Query: 430 RHNSMLIHVSRFQMWQNHIKELVENLFNYYKHEIE----ASDTAIYEEFRKILEDDTANY 485
HNSML+HV+RF Q + V+ + +E A T+++ E R++ ++D
Sbjct: 468 EHNSMLVHVTRFTDVQKEVFGQVKRALKEVQDHLELGTAAGVTSLHSELRRLWDED---- 523
Query: 486 KSYKTITNEIKESKFSDIDKNLSVHSWEEIKPQLFKAVQKIEVKSINGTSGDCLTYYENE 545
+ T I D +L V SW ++ L AV I+VK INGT+GD L Y E+
Sbjct: 524 --FVPTTQSIN-------DLSLPVLSWSKVSLHLKAAVAGIKVKQINGTAGDVLDYEEHR 574
Query: 546 KNGISVIAIGGDKLSRGLTLEGLSVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTS 605
G+SVIA+GGDKL+RGLTLEGLSVSYFLRAS+MYDTLMQMGRWFGYRPGY+DLCRL+ +
Sbjct: 575 ATGLSVIAVGGDKLARGLTLEGLSVSYFLRASRMYDTLMQMGRWFGYRPGYLDLCRLYMT 634
Query: 606 EELNEWFRHITIASEELRGEFNYLAESGGTPENYALKVRTHPGSLQITSVSKMRYAKQIS 665
EEL++WF+H+T+A EELR EF+ ++ GG P +Y LKVR+H +L ITS KMR +
Sbjct: 635 EELDDWFQHVTMAGEELRQEFDRMSAVGGFPADYGLKVRSH-STLLITSPVKMRNGVDLM 693
Query: 666 VSWAGRLIESYQLPMDKGIKKRNLIATDNFISAQGM------------PEKKGNNYLWRN 713
+S+ G +IE+ + + + NL +T F+ G E+ +Y+W
Sbjct: 694 ISFDGDIIETTNFDPRREVLEHNLASTVRFLEKLGRRAPGWIQSRPSGSEEWVTSYVWPE 753
Query: 714 VSPDDVCDYLSKFKVANSLKKVDLDMISNYIQELVKKGELTSWSVVVMNKNQPAVQYTFS 773
VS + V ++L+ V S K ++S YI++ + + EL+ W+VV+ + V+ +
Sbjct: 754 VSAEQVAEFLATLHVPQSSTKAVGVLLSEYIKKQLSQRELSQWTVVLFGGGE-GVKENIA 812
Query: 774 NSIQAGCFDRN-RAEDTNWNTYYIRKNHIVGNQTDEFIDLDEDLINAALERTRQR----K 828
+ G R + +D Y IR+ + + DE IDLD AL+ T+
Sbjct: 813 -GLPIGLTKRQAKNKDATNGLYKIRR---LVSPRDEAIDLDAAAYARALKLTKDNFHPDS 868
Query: 829 AELNRGWDKEYPAPEIVRQEFRPRTNPLLLIYPLNPECANVKDKHGNIQSGTISYSKTDD 888
R E P+ +RQ RP+ LL+IYPL+P + T +
Sbjct: 869 GRTKRDKMPEAPSGPFIRQ-VRPKDRGLLIIYPLDP------------AADTSKVLQLTS 915
Query: 889 PF-IGFVISFPSSSTNIAISYAVN 911
P IG+ +SFP+S T + + Y VN
Sbjct: 916 PIPIGWAVSFPASETALKVPYRVN 939
>gi|89901842|ref|YP_524313.1| endonuclease [Rhodoferax ferrireducens T118]
gi|89346579|gb|ABD70782.1| endonuclease [Rhodoferax ferrireducens DSM 15236]
Length = 966
Score = 577 bits (1488), Expect = e-162, Method: Composition-based stats.
Identities = 373/966 (38%), Positives = 521/966 (53%), Gaps = 152/966 (15%)
Query: 39 PDVDAAKLKNDLLSMYSVKIDAFQILEGRERRDPWLKDFRANKMSKWEFWMRYAEYLEKH 98
PD D + L+ +S I L+ E WL R W +W RY YLE+
Sbjct: 47 PDSDQDMAISTLIQRFSHWIGKDSTLQDTEGHVAWLVSARKKD---WRYWPRYQTYLER- 102
Query: 99 KKFAPSVILQLDELTDRVLDKLFNPQRNEIQISKKGLVVGQVQSGKTANYTGLICKAADA 158
K + V+ LDE TD +L L +P+R+ ++GLVVG VQSGKT NY+GL+CKAADA
Sbjct: 103 -KLSVDVVGALDESTDHILGMLEDPKRDG-SWDRRGLVVGHVQSGKTGNYSGLVCKAADA 160
Query: 159 GFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYERAYTMNNTTKIGVGLIPGFDNAIA- 217
G+ +IIVLAG+HNNLRSQTQ R++E FLG++T +R +GVG I D AIA
Sbjct: 161 GYKIIIVLAGMHNNLRSQTQIRLEETFLGYETSEDRI----PGKPLGVGEIDS-DTAIAP 215
Query: 218 NSYTTSLEKGDFTSRAANTAGFNFNVP---QPILLVVKKNASVLNRLYKWLQTQTINEK- 273
+ TT +KGDF + T +F + +P L VVKKN +VL +L KW++ + K
Sbjct: 216 HCATTRADKGDFNA----TVQRHFAISPEEKPWLFVVKKNKTVLTQLLKWIRNHVADSKD 271
Query: 274 -------ITNKSLLIIDDEADNASINTNRKELD----------PTTINRNICSIISLFNR 316
+T + LLIIDDEADNAS++T + +D P IN I SI+ F++
Sbjct: 272 NATGRRIVTKRPLLIIDDEADNASVDTGEQVIDSDGKPNDDHQPKAINGLIRSILHSFDK 331
Query: 317 SAYVGYTATPFANIFIPQNE------DDLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDT 370
AYVGYTATPFANIFI + DLFP+ FI+N+ AP+NY+GP +VFG ++ D
Sbjct: 332 KAYVGYTATPFANIFIHRKNATTDEGPDLFPQSFIVNLAAPSNYVGPARVFG--LLTPDG 389
Query: 371 NSDLLPIIYPIKDYDFFVPQGHKKDDDK----PKF-----------EDIPESLRIAVKCF 415
LP++ + D+ F P + D+ PK E IP SL+ A+ F
Sbjct: 390 RVGGLPLMRNVSDH-FTAPDPNSADEPSGWMPPKHNKEHVPVVNGQEIIPPSLKEAIHSF 448
Query: 416 IVTCAIRIARGQGTRHNSMLIHVSRFQMWQNHI--------KELVENLFNYYKHEIEASD 467
+ CAIR RGQG H+SMLIHV+RF Q + K++V+ L HE
Sbjct: 449 ALACAIRTLRGQGREHSSMLIHVTRFANVQKEVFGQVDETVKKMVQRLTRRIDHE----- 503
Query: 468 TAIYEEFRKILEDD-TANYKSYKTITNEIKESKFSDIDKNLSVHSWEEIKPQLFKAVQKI 526
+ EE R + E D + + E ESK + W I L AV I
Sbjct: 504 -TLVEELRSLWERDFVPTSATVAQLLEEPGESKLAP--------DWPAILAALPDAVSDI 554
Query: 527 EVKSINGTSGDCLTYYENEKNGISVIAIGGDKLSRGLTLEGLSVSYFLRASKMYDTLMQM 586
+VK+INGT+ D L Y E E++G+ VIA+GGDKL+RGLTLEGL SYF+R +KMYDTLMQM
Sbjct: 555 KVKTINGTAKDALDYLEQEEHGLKVIAVGGDKLARGLTLEGLCTSYFVRTTKMYDTLMQM 614
Query: 587 GRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEFNYLAESGGTPENYALKVRTH 646
GRWFGYRPGYVDLCRL+T+E+L EWF HI ASEELR EF+ +AESG TP+ Y LKV++H
Sbjct: 615 GRWFGYRPGYVDLCRLYTTEDLVEWFGHIADASEELREEFDAMAESGATPKEYGLKVQSH 674
Query: 647 PGSLQITSVSKMRYAKQISVSWAGRLIESYQLPMDKGIKKRNLIATDNFISAQGMP---- 702
P L +TS KMR AK + +S++G L+E+ D +NL T+ I+A P
Sbjct: 675 P-VLLVTSPLKMRTAKSLQLSFSGELLETIAFFKDDARLDQNLTVTNRLIAAMQKPSEVN 733
Query: 703 --------EKKGNNYLWRNVSPDDVCDYLSKFKVANSLKKVDLDMISNYIQELV-KKGEL 753
++ N +LW V + V D+L + +KV+ +++ +++E+V K EL
Sbjct: 734 PVRRRAGVDQPTNGFLWNAVPAEHVADFLESYVTHPKARKVNSKLLAEFVREMVSKSAEL 793
Query: 754 TSWSVVVMNKNQPAVQYTFSNSIQA-GCFDRNRAEDTNWNTYYIRKNHIVG---NQTDEF 809
TSW+V ++ + ++TF + G R+ D +++ + +G + DE
Sbjct: 794 TSWTVALIGGGR-GPEFTFDGGLTVDGTLQRSADPD-------VKERYSIGRLLSPRDEA 845
Query: 810 IDLDEDLINAALERTRQRKAELNRGWDKEYPAPE---IVRQEFRPRTNP----------- 855
IDLD +AAL T+ R W K PA + +V++E P
Sbjct: 846 IDLDGPAWSAALAATK-------RAW-KPDPAKQSAGVVQKEPEVPNGPAIRRVRGKGAD 897
Query: 856 ---------LLLIYPLNPECANVKDKHGNIQSGTISYSKTDDPFIGFVISFPSSSTNIAI 906
+LL+YPL+P+ A G + P +GF +SFPSS + + +
Sbjct: 898 GVQSAPERGVLLLYPLDPKLA-----------GAGVFPDRTKPIMGFGVSFPSSESGVKV 946
Query: 907 SYAVNQ 912
Y V+
Sbjct: 947 EYKVDH 952
>gi|23014638|ref|ZP_00054444.1| hypothetical protein Magn03009084 [Magnetospirillum magnetotacticum
MS-1]
Length = 954
Score = 577 bits (1487), Expect = e-162, Method: Composition-based stats.
Identities = 370/959 (38%), Positives = 521/959 (54%), Gaps = 102/959 (10%)
Query: 19 KASVTDSEINEAAIKAKMIYPD----VDAAKLKNDLLSMYSVKIDAFQILEGRERRDPWL 74
+++VT + I+E + P +D + ++L+ +S+ I L+ + WL
Sbjct: 26 RSAVTPALISEKIDLVLTMKPKWGEGLDREAVTDELIRRFSLWIGEDTTLKSDAGHEAWL 85
Query: 75 KDFRANKMSKWEFWMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNEIQISKKG 134
A + W +W RY E+LE+ + + + LD+ TD VL L +P R E ++G
Sbjct: 86 V---ATRKRDWRYWQRYREWLER--RLSYKAVEALDKSTDPVLAMLEDPLR-EGAWDRRG 139
Query: 135 LVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYER 194
LVVG VQSGKT +Y+GLICKAADAG+ +IIVLAG+HNNLRSQTQ R+DE FLG++T+
Sbjct: 140 LVVGHVQSGKTGHYSGLICKAADAGYKIIIVLAGLHNNLRSQTQMRLDEAFLGYETKPNH 199
Query: 195 AYTMNNTTKIGVGLIPGFDNAIANSYTTS-LEKGDFTSRAANTAGFNFNVPQPILLVVKK 253
+T IGV I G D AI +Y T+ GDF + A G +P L VVKK
Sbjct: 200 ----EDTLPIGVSEIDG-DPAIRPNYATNRTNGGDFNAGIAQKLGITPE-QRPWLFVVKK 253
Query: 254 NASVLNRLYKWLQTQTIN--------EKITNKSLLIIDDEADNASINTNRKELD------ 299
N +VL RL +W++ N +TN L +IDDEAD+AS++T LD
Sbjct: 254 NKTVLERLLRWIRNHVANVHDPETGRRIVTNLPLFLIDDEADHASVDTGETVLDSDGKPD 313
Query: 300 ----PTTINRNICSIISLFNRSAYVGYTATPFANIFIPQNED------DLFPRDFIINIP 349
PT INR I+ F+RSAYVGYTATPFANIFI + + DLFP FI+N+
Sbjct: 314 PDHQPTAINRLTRRILHSFSRSAYVGYTATPFANIFIHERGETREEGPDLFPASFIVNLA 373
Query: 350 APTNYIGPEKVFGTSIIPDDTNSDLLPIIYPIKDY------DFFVPQGHKKDDDKPKF-- 401
AP+NYIGP KVFGTS + LP++ + DY ++P H+ + +P
Sbjct: 374 APSNYIGPGKVFGTS--GKEGRDGGLPLVRQVDDYCTSDGLGGWMPPKHR-NGHQPLVDG 430
Query: 402 ED-IPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSRFQMWQNHIKELVENLFNYYK 460
ED +P SL A+ F++ CAIR RGQ H+SML+HV+RF Q + V+ + K
Sbjct: 431 EDTLPGSLNEAIDAFVLACAIRRLRGQADGHSSMLVHVTRFTSVQKAVHSQVDERVRHIK 490
Query: 461 HEI--EASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFSDIDKNLSVHSWEEIKPQ 518
+ + R + E D ++ T+++ E+ + N W EI+
Sbjct: 491 QRLTRRIGHEPVVAALRGLWERD------FQPTTSDMAEAHPDLV--NGDTFGWREIEAT 542
Query: 519 LFKAVQKIEVKSINGTSGDCLTYYENEKNGISVIAIGGDKLSRGLTLEGLSVSYFLRASK 578
L + I+V+ INGT+ D L Y +++ G+ VIAIGGDKL+RGLTLEGL+VSYFLRASK
Sbjct: 543 LADTISDIQVRMINGTAKDALDYADHDGAGLKVIAIGGDKLARGLTLEGLTVSYFLRASK 602
Query: 579 MYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEFNYLAESGGTPEN 638
MYDTLMQMGRWFGYRPGY+DLCRL+T+ EL EWF HI ASEELR EF+ +A SG TP
Sbjct: 603 MYDTLMQMGRWFGYRPGYLDLCRLYTTGELCEWFGHIADASEELREEFDLMAASGATPRE 662
Query: 639 YALKVRTHPGSLQITSVSKMRYAKQISVSWAGRLIESYQLPMDKGIKKRNLIATDNFISA 698
Y LKV++HP L +TS KMR A+ + +S++G+L+E+ ++ + +RNL A +S+
Sbjct: 663 YGLKVQSHP-VLLVTSRLKMRSARNLMLSFSGQLLETVAFHRERDVLERNLAAARRLVSS 721
Query: 699 QGMPEKKG-----------NNYLWRNVSPDDVCDYLSKFKVANSLKKVDLDMISNYIQEL 747
G PE+ YLW V DV D+L ++ KV+ ++S +IQ L
Sbjct: 722 LGAPEENPLRIRNGTKQVWRGYLWEGVPAADVVDFLIAYRTHPEAHKVNSVLLSQFIQSL 781
Query: 748 VKKGELTSWSVVVMNKNQPAVQYTFSNSIQAGCFDRNRAEDTNWNTYYIRKNHIVGNQTD 807
+ +GELT+W+V ++ Q F + + R A + + Y IR+ + + D
Sbjct: 782 MAEGELTNWTVALIGGGD-GKQAAFRDGVSVDML-RRAAHGAHTDRYSIRR---LMSPRD 836
Query: 808 EFIDLDEDLINAALERTRQR-KAELNRGWDKEYP----APEIVR-QEFRPRTNP------ 855
E IDLDE + AL TR A+ R E P P I R + F P
Sbjct: 837 EAIDLDEAAWSVALAVTRAAFHADPARHGSGEPPDAPNGPAIRRIRGFGGEGVPARPDRG 896
Query: 856 LLLIYPLNPECANVKDKHGNIQSGTISYSKTDDPFIGFVISFPSSSTNIAISYAVNQAA 914
+L IY ++P+ A + P + F +SFP S + + Y VN A
Sbjct: 897 VLFIYAIDPDLAGPE----------AGLPPDAPPVVAFAVSFPDSRSGTKVEYKVNNVA 945
>gi|69937048|ref|ZP_00631769.1| endonuclease [Paracoccus denitrificans PD1222]
gi|119384820|ref|YP_915876.1| endonuclease [Paracoccus denitrificans PD1222]
gi|119374587|gb|ABL70180.1| endonuclease [Paracoccus denitrificans PD1222]
Length = 955
Score = 555 bits (1431), Expect = e-156, Method: Composition-based stats.
Identities = 366/970 (37%), Positives = 542/970 (55%), Gaps = 122/970 (12%)
Query: 6 QVAIEICQSIIGRKASVTDSEINEAAIKAKMIYPDVDAAKLKNDLLSMYSVKIDAFQILE 65
++A E QS + + + E+ + +I + + VD L ++L+ S + L
Sbjct: 21 RLAAERAQSPV--TPEMIEKELTKLSIMMEDDFALVDRDALVDELIRRSSRTVGENATLS 78
Query: 66 GRERRDPWLKDFRANKMSKWEFWMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQR 125
E WL A + W +W RY+EY+E + + + LD TD VL +L +P R
Sbjct: 79 SGEDHVAWLD---AERKKGWTYWQRYSEYMEA--RIPWTALDALDVATDEVLSQLEDPTR 133
Query: 126 NEIQISKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGF 185
E ++GLVVG VQSGKT NYTGLICKAADAG+ +IIVLAG+HNNLR+QTQ R+DEGF
Sbjct: 134 -EGAWDRRGLVVGHVQSGKTGNYTGLICKAADAGYKIIIVLAGLHNNLRAQTQIRLDEGF 192
Query: 186 LGFDTQYERAYTMNNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQ 245
LGF T + + +GVGLI + N+ T +KGDF + A A N + Q
Sbjct: 193 LGFATIAD----ADELPAVGVGLIDKDMSVRPNAATNRSDKGDFNTAVA--AKMNISPEQ 246
Query: 246 -PILLVVKKNASVLNRLYKWLQTQTIN--------EKITNKSLLIIDDEADNASINTNRK 296
P L VVKKN +VL RL W++ + N + +TN L++IDDE+D+ S++T
Sbjct: 247 RPWLFVVKKNKTVLERLLHWIRNRVANHVDPETGRKLVTNLPLMVIDDESDHGSVDTGED 306
Query: 297 ELD----------PTTINRNICSIISLFNRSAYVGYTATPFANIFI------PQNEDDLF 340
+D P TINR I SI+ F+R AYVGYTATPFANIFI ++ DLF
Sbjct: 307 VVDEFGNPDLEHEPKTINRLIRSILHHFSRKAYVGYTATPFANIFIHDRGETQEHGPDLF 366
Query: 341 PRDFIINIPAPTNYIGPEKVFGT-SIIPDDTNSDLLPIIYPIKDYDF--FVPQGHKKDDD 397
P FI ++ AP+NY+GP +VFG+ S P+D LP++ P+ D +F ++P HK +
Sbjct: 367 PAAFITSLAAPSNYVGPGRVFGSASSTPED-----LPLVRPLSDDEFQPWMPPRHK-NGY 420
Query: 398 KPKFED---IPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSRFQMWQN----HIKE 450
+P+++ +P+SL A++ F+ CA+R RGQG++H+SMLIHV+RF QN + E
Sbjct: 421 RPRWQGEDRVPDSLAEAIRSFVYACAVRKLRGQGSKHSSMLIHVTRFTSVQNAVVNQVAE 480
Query: 451 LVENLFNYYKHEIEAS--DTAIYEEFRKILEDDTANYKSYKTITNEIKESKFSDIDKNLS 508
V +L Y IE + ++ E+++ + I + + E + L
Sbjct: 481 YVRDLKGRYTRGIELDNLEASMRTEYQETF------LPGMQRIRSALVEGE------ALE 528
Query: 509 VHSWEEIKPQLFKAVQKIEVKSINGTSGDCLTYYENEKNGISVIAIGGDKLSRGLTLEGL 568
SW +I+ L + I V+ INGT+ D L Y EN+ G+ VIAIGGDKL+RGLTLEGL
Sbjct: 529 DFSWADIRAVLPDVLSDIRVREINGTAKDALDYAENDGTGLKVIAIGGDKLARGLTLEGL 588
Query: 569 SVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEFNY 628
SYFLR ++MYDTLMQMGRWFGYR GY+D+CRL+TS+E+ EWF HI A+EELR EF+
Sbjct: 589 CTSYFLRTARMYDTLMQMGRWFGYRDGYLDVCRLYTSQEMVEWFGHIADAAEELRQEFDN 648
Query: 629 LAESGGTPENYALKVRTHPGSLQITSVSKMRYAKQISVSWAGRLIESYQLPMDKGIKKRN 688
+ +G TP+ + L+V++H L +TS +KMR A+ + ++++G L+++ P K N
Sbjct: 649 MVAAGATPKQFGLRVKSH-SVLTVTSRAKMRNARAMQLTYSGDLLQTIVFPNRKDDITAN 707
Query: 689 LIATDNFISAQG----------MPE-KKGNNYLWRNVSPDDVCDYLSKFKVANSLKKVDL 737
ATD FI+A G +P+ +K N +LWR+V V +L ++ + ++
Sbjct: 708 FKATDRFINALGPSTDLNDQHFVPQGQKWNGHLWRDVPALSVISFLRDYRTHPASFRIMS 767
Query: 738 DMISNYIQELVKKGELTSWSVVVMNKNQ-PAVQYTFSNSIQAGCFDRNRAEDTNWNTYYI 796
+I+++I+E+ K ELTSW+V ++ K+ P +Y R R + + + Y I
Sbjct: 768 PLIADFIEEMNKDSELTSWTVALIGKDAGPDDKYRKVGGSSVNMLKRRRTTE-HADRYSI 826
Query: 797 RKNHIVGNQTDEFIDLDEDLINAAL---ERTRQRKAELNRGWDKEYPA----PEI----- 844
+ + + D+ IDL E AAL ++T + + N G KE P+ P+I
Sbjct: 827 KT---LISPRDQAIDLTEAEWKAALGLSQKTWRNDTDRNEG--KEPPSEPRGPQIRHILG 881
Query: 845 -------VRQEFRPRTNPLLLIYPLNPECANVKDKHGNIQSGTISYSKTDDPFIGFVISF 897
VR R LL++Y L+PE A V++ K DP + + ISF
Sbjct: 882 EGDAEAGVRAR---RERGLLMLYLLDPEGAEVEE------------LKDADPVVAWAISF 926
Query: 898 PSSSTNIAIS 907
PSS++ +S
Sbjct: 927 PSSNSERRVS 936
>gi|134095049|ref|YP_001100124.1| hypothetical protein HEAR1845 [Herminiimonas arsenicoxydans]
gi|133738952|emb|CAL61999.1| conserved hypothetical protein [Herminiimonas arsenicoxydans]
Length = 967
Score = 553 bits (1424), Expect = e-155, Method: Composition-based stats.
Identities = 358/987 (36%), Positives = 525/987 (53%), Gaps = 120/987 (12%)
Query: 3 DNYQVAIEICQSIIGR---KASVTDSEINEAAIKAKMIYPD----VDAAKLKNDLLSMYS 55
+N + + Q+++ R K+S+T I E + P+ VD + ++L+ +S
Sbjct: 13 ENLNSIVHLAQTLLLRIKDKSSITPMLIAEKVAIVLALDPEMADGVDTQNVIDELVRRFS 72
Query: 56 VKIDAFQILEGRERRDPWLKDFRANKMSKWEFWMRYAEYLEKHKKFAPSVILQLDELTDR 115
+ I IL E + WL A + W +W RY+ +LE+ K + + + L E TD
Sbjct: 73 LWIGEDSILSSDEGHEAWLN---AARKKDWRYWSRYSGWLER--KMSATAVDSLGESTDH 127
Query: 116 VLDKLFNPQRNEIQISKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRS 175
+L L +P+R E ++GLVVG VQSGKT NY+GL+CKAADAG+ LIIVLAG+HNNLRS
Sbjct: 128 ILGLLEDPKR-EGAWDRRGLVVGHVQSGKTGNYSGLVCKAADAGYKLIIVLAGLHNNLRS 186
Query: 176 QTQNRIDEGFLGFDTQYERAYTMNNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAAN 235
QTQ R++E FLG YE + + + +GVG I +T GDF ++ AN
Sbjct: 187 QTQIRLEESFLG----YETSASGDAVKMVGVGEIDSDTKIFPACFTNRSNNGDFNTKVAN 242
Query: 236 TAGFNFNVPQ--PILLVVKKNASVLNRLYKWLQTQTINEK-------ITNKSLLIIDDEA 286
N P+ P L VVKKN +VL RL KW++ + ++ LL+IDDEA
Sbjct: 243 HLA---NRPEERPWLFVVKKNKTVLERLLKWIRNHVADATDSSGRRYVSGLPLLLIDDEA 299
Query: 287 DNASINTNRK----------ELDPTTINRNICSIISLFNRSAYVGYTATPFANIFIPQNE 336
D+AS++T + E P IN I I+ F++S+YVGYTATPFAN+FI +
Sbjct: 300 DHASVDTGEQIHDADGLPDDEYQPKAINSRIRKILHSFSKSSYVGYTATPFANVFIHRRN 359
Query: 337 ------DDLFPRDFIINIPAPTNYIGPEKVFG-------------TSIIPDDTNSDLLPI 377
DLFP FIIN+ AP+NY+GP ++FG T II D N+D
Sbjct: 360 ATKDEGPDLFPSAFIINLAAPSNYVGPARIFGLNRSEGRTEGLPLTRIISDHVNADKTGG 419
Query: 378 IYPIKDYDFFVPQGHKKDDDKPKFEDIPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIH 437
P K +P D +P+SL AV F++ CA+R RGQ H+SMLIH
Sbjct: 420 WMPAKHNKEHIPSVDGVDT-------LPKSLEQAVNAFVIACAVRNLRGQTAEHSSMLIH 472
Query: 438 VSRFQMWQNHIKELVENLFNYYKHEIEASDTAIYEEFRKILEDDT-ANYKSYKTITNEIK 496
V+RF Q + VEN + I R I + +T S T E
Sbjct: 473 VTRFTAVQGEVHRQVENQIRRMRQRI----------LRGIDQQETLVQLHSLWTTDFEPT 522
Query: 497 ESKFSDI---DKNLSVH---SWEEIKPQLFKAVQKIEVKSINGTSGDCLTYYENEKNGIS 550
++F ++ D+ L+ SW I L + ++ IEVK INGT+ D L Y +E G+
Sbjct: 523 YTEFCELAGRDEGLAPPMNPSWSSILDALPQVLEDIEVKMINGTAKDALDYANSEGKGLK 582
Query: 551 VIAIGGDKLSRGLTLEGLSVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNE 610
+IAIGGDKL+RGLTLEGL SYF+R SKMYDTLMQMGRWFGYRPGY+D+CRL+T+E+L E
Sbjct: 583 IIAIGGDKLARGLTLEGLCTSYFVRTSKMYDTLMQMGRWFGYRPGYLDVCRLYTTEDLIE 642
Query: 611 WFRHITIASEELRGEFNYLAESGGTPENYALKVRTHPGSLQITSVSKMRYAKQISVSWAG 670
WF HI ASEELR EF+ + ++G TP++Y +KV+ H L +TS KMR A+ + +S++G
Sbjct: 643 WFGHIADASEELREEFDVMVQNGDTPKDYGMKVKAH-DILLVTSPLKMRSAETLHLSFSG 701
Query: 671 RLIESYQLPMDKGIKKRNLIATDNFISAQGMPEKK-------------GNNYLWRNVSPD 717
LI++ L D+ ++NL AT+ + + G+P ++ ++W + +
Sbjct: 702 DLIQTVVLHTDEVPLEKNLKATNKLLESMGVPTERTPTKNRDDTKDQLSGAFVWSGIGHE 761
Query: 718 DVCDYLSKFKVANSLKKVDLDMISNYIQELVKKGELTSWSVVVMNKNQ-PAVQYTFSNSI 776
V ++LS + +V+ M+ +I+++V GELTSW+V +M ++ YTF + I
Sbjct: 762 TVIEFLSAYSTHKDAYRVNSAMLVKFIEKMVIHGELTSWTVALMGVSKGNGGAYTFESGI 821
Query: 777 QAGCFDRNRAEDTNWNTYYIRKNHIVGNQTDEFIDLDEDLINAALERTR---QRKAELNR 833
+ Y I ++ + +DE IDLD+ AAL T+ + NR
Sbjct: 822 TVDSMPMRGSNSLGKGKYSI---GVLTDPSDEGIDLDDLEWTAALNMTKAAWKPDPARNR 878
Query: 834 GWDKEYPAPEIVR-------QEFRPRTNP-LLLIYPLNPECANVKDKHGNIQSGTISYSK 885
P+ +++R + P + LLL+YP++P K+ SG
Sbjct: 879 INQPTKPSGKMIRIVRGLGTSDIPPSPDRGLLLLYPVSPLSEGAKELVPKNWSG------ 932
Query: 886 TDDPFIGFVISFPSSSTNIAISYAVNQ 912
P +GF ISFP+S + ++Y V+
Sbjct: 933 ---PIMGFAISFPASESGETVAYKVDH 956
>gi|119773419|ref|YP_926159.1| endonuclease [Shewanella amazonensis SB2B]
gi|119765919|gb|ABL98489.1| endonuclease [Shewanella amazonensis SB2B]
Length = 951
Score = 543 bits (1399), Expect = e-152, Method: Composition-based stats.
Identities = 369/973 (37%), Positives = 518/973 (53%), Gaps = 99/973 (10%)
Query: 1 MNDNYQVAIEICQSIIGRKASVTDS----EINEAAIKAKMIYPDVDAAKLKNDLLSMYSV 56
+N Q + Q+++ + VT + +IN K D D + + ++L+ S
Sbjct: 3 LNPTEQKVLIFVQNMLEAGSQVTPAIIQEQINLVLNMKKEWRADTDESAVIDELIRRAST 62
Query: 57 KIDAFQILEGRERRDPWLKDFRANKMSKWEFWMRYAEYLEKHKKFAPSVILQLDELTDRV 116
+ +L E WL ++ W +W RY EY EK K + +++ LD+ TD +
Sbjct: 63 WVGDDALLTNDEGHIAWLNQ---DRKQGWRYWQRYREYQEK--KLSWNIVEGLDKSTDLI 117
Query: 117 LDKLFNPQRNEIQISKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQ 176
L L +P+R + ++GLVVG VQSGKT NYTGLICKAADAG+ +IIVLAG+HNNLRSQ
Sbjct: 118 LGNLEDPKR-QGPWDRRGLVVGHVQSGKTGNYTGLICKAADAGYKIIIVLAGMHNNLRSQ 176
Query: 177 TQNRIDEGFLGFDTQYERAYTMNNTTKIGVGLIPGFDNAIANSYTTSL-EKGDFTSRAAN 235
TQ R+DEGFLGF T + + +GVG I D AI ++ T+ EKGDF ++AA
Sbjct: 177 TQIRLDEGFLGFTT----SVIQDEMHIVGVGEIDR-DPAIRPNFATNRSEKGDFNTKAAK 231
Query: 236 TAGFNFNVPQPILLVVKKNASVLNRLYKWLQTQTIN--------EKITNKSLLIIDDEAD 287
G + +P L VVKKN SVL RL KW+Q N + +T+ LLIIDDEAD
Sbjct: 232 HLGISPE-ERPWLFVVKKNKSVLERLLKWIQDHVANIQDPETGRKLVTHLPLLIIDDEAD 290
Query: 288 NASINTNRKELD----------PTTINRNICSIISLFNRSAYVGYTATPFANIFIPQNED 337
NAS++T + D PT INR I+ F R AYVGYTATPFANIFI +
Sbjct: 291 NASVDTGDQAYDEFGKPDLEHEPTAINRLTRRILHSFTRKAYVGYTATPFANIFIHNRSE 350
Query: 338 ------DLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNSDLLPIIYPIKDY------D 385
DLFP FI N+ A ++YIGP KVFG S ++ LP++ + D+
Sbjct: 351 TEKEGPDLFPSAFIANLSASSSYIGPAKVFGRS--SENGREGGLPLVKVVNDHCSEDEKS 408
Query: 386 FFVPQGHKKD--DDKPKFEDIPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSRFQM 443
++P H+ + IPESLR +V F++ CA R RGQ H SMLIHV+RF
Sbjct: 409 GWMPTKHRNGYYPSSDQQHGIPESLRESVDAFVLACAARRLRGQKDEHCSMLIHVTRFTS 468
Query: 444 WQNHIKELVENLFNYYKHEI--EASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFS 501
Q H+KE V ++ K + + + +++ E D + T I
Sbjct: 469 VQGHVKEQVAAYVDWLKRRLVRRIDHYDVINQLKQLWESD------FLATTETIHSMHPD 522
Query: 502 DIDKNLSVHSWEEIKPQLFKAVQKIEVKSINGTSGDCLTYYENEKNGISVIAIGGDKLSR 561
+D+N +W +++ +L A+ +IEV++ING + + L Y ++ G+ VIA+GGDKL+R
Sbjct: 523 MVDEN--TLTWSQVEAELSDAISEIEVRAINGFAKEALDYADSAV-GLKVIAVGGDKLAR 579
Query: 562 GLTLEGLSVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEE 621
GLTLEGL VSYFLRAS+MYDTLMQMGRWFGYRPGY+DLCRL+T+ +L EWF HI ASEE
Sbjct: 580 GLTLEGLCVSYFLRASRMYDTLMQMGRWFGYRPGYLDLCRLYTTIDLVEWFEHIADASEE 639
Query: 622 LRGEFNYLAESGGTPENYALKVRTHPGSLQITSVSKMRYAKQISVSWAGRLIESYQLPMD 681
LR EF+ + SG TP+ Y LKV +HP L +TS KMR A+ + +S++G+ +E+ L D
Sbjct: 640 LREEFDLMTASGATPKEYGLKVASHP-VLMVTSRLKMRNAQSLYLSFSGQSVETVSLLKD 698
Query: 682 KGIKKRNLIATDNFISAQGMP-----------EKKGNNYLWRNVSPDDVCDYLSKFKVAN 730
K NL A D G P +K GN W NV + + +LS++K
Sbjct: 699 LDSLKSNLQAFDRLTLRLGEPKPIPDRNRNGSKKSGNGVQWTNVPAEVIVQFLSEYKTHP 758
Query: 731 SLKKVDLDMISNYIQELVKKGELTSWSVVVMNKNQPAVQYTFSNSIQAGCFDRNRAEDTN 790
+ KV+ ++S++IQ + K+ ELTSW+V ++N Q + +S + +
Sbjct: 759 AAMKVNSRLLSDFIQNMNKEHELTSWTVAIINGGQERTYSIYGSSSVTVKLVKRTPKTIT 818
Query: 791 WNTYYIRK------NHIVGNQTDEFIDLDEDLINAALERTRQRKAELNRGWDKEYPAPEI 844
+ Y I + + + L+E N + R K + PA
Sbjct: 819 EDRYSIGRLLSPADEALDLDDDAWDAALNETRNNWKNDPARSEKYNGEPPKEPNGPALRK 878
Query: 845 VRQ------EFRPRTNPLLLIYPLNPECANVKDKHGNIQSGTISYSKTDDPFIGFVISFP 898
VR P LLLIY L+PE V D SY ++ P + F ISFP
Sbjct: 879 VRGFGAGHIAGHPERG-LLLIYLLSPE--GVDD----------SYDESTAPIVSFGISFP 925
Query: 899 SSSTNIAISYAVN 911
S + + Y VN
Sbjct: 926 GSDSGTKVKYEVN 938
>gi|85706636|ref|ZP_01037728.1| endonuclease [Roseovarius sp. 217]
gi|85668694|gb|EAQ23563.1| endonuclease [Roseovarius sp. 217]
Length = 955
Score = 538 bits (1387), Expect = e-151, Method: Composition-based stats.
Identities = 360/960 (37%), Positives = 533/960 (55%), Gaps = 122/960 (12%)
Query: 19 KASVTDSEINEAAIKAKMIYPD----VDAAKLKNDLLSMYSVKIDAFQILEGRERRDPWL 74
++SVT I + K ++ D VD L ++L+ S + L E WL
Sbjct: 28 QSSVTPEMIEKELTKLSIMMEDDFALVDRDALVDELIRRSSRTVGENATLSSGEDHIAWL 87
Query: 75 KDFRANKMSKWEFWMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNEIQISKKG 134
A + W +W RY+EY+E + + + LD TD VL +L +P R E ++G
Sbjct: 88 D---AERKKGWTYWQRYSEYMEA--RIPWTALDALDVATDEVLSQLEDPTR-EGAWDRRG 141
Query: 135 LVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYER 194
LVVG VQSGKT NYTGLICKAADAG+ +IIVLAG+HNNLRSQTQ R++EGFLG Y
Sbjct: 142 LVVGHVQSGKTGNYTGLICKAADAGYKIIIVLAGLHNNLRSQTQIRLEEGFLG----YGL 197
Query: 195 AYTMNNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVP---QPILLVV 251
+ + + +GVG I + ++ T GDFT+ A+ NV +P L VV
Sbjct: 198 SNSSDELELVGVGSIDADPSVRPHTATNRRNNGDFTTAIAS----KLNVSPEHRPWLFVV 253
Query: 252 KKNASVLNRLYKWLQTQTIN--------EKITNKSLLIIDDEADNASINTNRK------- 296
KKN +VL RL W++ + IN + +TN LL+IDDE+DN S++T
Sbjct: 254 KKNKTVLERLLHWIRNRAINHVDPATGRKLVTNLPLLVIDDESDNGSVDTGEDIVDEFGN 313
Query: 297 ---ELDPTTINRNICSIISLFNRSAYVGYTATPFANIFI------PQNEDDLFPRDFIIN 347
E +P TINR I SI+ F+R AYVGYTATPFANIFI + DLFP FI +
Sbjct: 314 PDLEHEPKTINRLIRSILHHFSRKAYVGYTATPFANIFIHDRGTTQEYGPDLFPAAFITS 373
Query: 348 IPAPTNYIGPEKVFGT-SIIPDDTNSDLLPIIYPIKDYDF--FVPQGHKKDDDKPKFED- 403
+ AP+NY+GP +VFG+ S P+D LP++ + D +F ++P HK + +P ++
Sbjct: 374 LAAPSNYVGPGRVFGSASSTPED-----LPLVRSLSDDEFQPWMPPRHK-NGYRPGWQGE 427
Query: 404 --IPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSRFQMWQN----HIKELVENLFN 457
+P+SL A++ F+ CA+R RGQG++H+SMLIHV+RF QN + + V +L
Sbjct: 428 DRVPDSLAEAIRSFVYACAVRKLRGQGSKHSSMLIHVTRFTSVQNAVVNQVADYVRDLKG 487
Query: 458 YYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFSDID-KNLSVHSWEEIK 516
Y IE D + RK E KT ++ +++ ++ + L +W +I+
Sbjct: 488 RYTRGIELED--LEASMRKEYE---------KTFLPGMQGIRYALVEGETLEDFAWADIR 536
Query: 517 PQLFKAVQKIEVKSINGTSGDCLTYYENEKNGISVIAIGGDKLSRGLTLEGLSVSYFLRA 576
L + I V+ INGT+ D L Y EN+ G+ VIAIGGDKL+RGLTLEGL SYFLR
Sbjct: 537 AVLPDVLSDIRVREINGTAKDALDYAENDGTGLKVIAIGGDKLARGLTLEGLCTSYFLRT 596
Query: 577 SKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEFNYLAESGGTP 636
++MYDTLMQMGRWFGYR GY+D+CRL+TS+E+ EWF HI A+EELR EF+ + +G TP
Sbjct: 597 ARMYDTLMQMGRWFGYRDGYLDVCRLYTSQEMVEWFGHIADAAEELRQEFDNMVAAGATP 656
Query: 637 ENYALKVRTHPGSLQITSVSKMRYAKQISVSWAGRLIESYQLPMDKGIKKRNLIATDNFI 696
+ + L+V++H L +TS +KMR A+ + ++++G L+++ P K N ATD FI
Sbjct: 657 KQFGLRVKSH-SVLTVTSRAKMRNARAMQLTYSGDLLQTIVFPNRKDDITANFEATDRFI 715
Query: 697 SAQG----------MPE-KKGNNYLWRNVSPDDVCDYLSKFKVANSLKKVDLDMISNYIQ 745
+A G +P+ +K N +LWR+V V +L ++ + ++ +I+++I+
Sbjct: 716 NALGPSTDLNDQHFVPQGQKWNGHLWRDVPALSVISFLRDYRTHPASFRIMSPLIADFIE 775
Query: 746 ELVKKGELTSWSVVVMNKNQPAVQYTFSNSIQAGC----FDRNRAEDTNWNTYYIRKNHI 801
E+ K ELT+W+V ++ K+ + GC R R + + + Y I+
Sbjct: 776 EMNKDRELTNWTVALIGKDSGQDD---KHRTVGGCSVNMLQRKRTTE-HADRYSIKT--- 828
Query: 802 VGNQTDEFIDLDEDLINAALERTRQR-KAELNRGWDKEYPA----PEI-------VRQEF 849
+ + D+ IDL E AA + +++ + + +R KE P+ P+I V
Sbjct: 829 LISPRDQAIDLTEAEWKAARDLSQKTWRNDTDRNEGKEPPSEPRGPQIRHILGEGVTDVH 888
Query: 850 RP--RTNPLLLIYPLNPECANVKDKHGNIQSGTISYSKTDDPFIGFVISFPSSSTNIAIS 907
P R LL++Y L+P A V + K DP + + ISFPSS++ +S
Sbjct: 889 IPARRERGLLMLYLLDPAEAKVDE------------IKDADPVVAWAISFPSSTSERRVS 936
>gi|118714150|ref|ZP_01566710.1| endonuclease [Burkholderia cenocepacia MC0-3]
gi|118648398|gb|EAV55203.1| endonuclease [Burkholderia cenocepacia MC0-3]
Length = 964
Score = 521 bits (1341), Expect = e-145, Method: Composition-based stats.
Identities = 356/985 (36%), Positives = 516/985 (52%), Gaps = 134/985 (13%)
Query: 19 KASVTDSEINEAAIKAKMIY-----PDVDAAKLKNDLLSMYSVKIDAFQILEGRERRDPW 73
K+ +T + I E +A ++ VD + + L+ +S I L+ W
Sbjct: 31 KSKITAAYIAEKVARAADMFETDATSTVDQSLAVSTLIQRFSHWIGKATTLKDDAGHIHW 90
Query: 74 LKDFRANKMSKWEFWMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNEIQISKK 133
L A + W +W RY +YLE K + V+ LD+ TD +L L +P R + ++
Sbjct: 91 LN---AARKKDWHYWRRYRDYLEA--KLSDRVVDGLDDATDNILALLEDPHRTDAW-DRR 144
Query: 134 GLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYE 193
GLVVG VQSGKT+NY+GLICKAADAG+ +IIVLAG HNNLRSQTQ R++EGFLG++T
Sbjct: 145 GLVVGHVQSGKTSNYSGLICKAADAGYKIIIVLAGTHNNLRSQTQMRLEEGFLGYET--- 201
Query: 194 RAYTMNNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQ--PILLVV 251
T+N + +G+ ++ NS TT + GDF A F+ P+ P L VV
Sbjct: 202 ---TVNRDPGLPIGVAEFGEDLKTNSATTRADNGDFNKAIAKH--FHGISPEERPWLFVV 256
Query: 252 KKNASVLNRLYKWLQTQTIN------EKITNKSLLIIDDEADNASINTNRKELD------ 299
KK +VL L W+Q++ + + +T LL+IDDEADNAS++T + D
Sbjct: 257 KKQKTVLTALLNWIQSRVFDATKDGRKVVTKLPLLMIDDEADNASVDTGEQLFDEDGTPD 316
Query: 300 ----PTTINRNICSIISLFNRSAYVGYTATPFANIFI------PQNEDDLFPRDFIINIP 349
P TIN I I+ F R AYVGYTATPFANIFI + DLFPR FIIN+
Sbjct: 317 EEHQPKTINSLIRQILHAFTRKAYVGYTATPFANIFIHHKGTTTKEGPDLFPRSFIINLA 376
Query: 350 APTNYIGPEKVFGTSIIPDDTNSDLLPIIYPIKD-YD-----FFVPQGHKKDDDKPKF-- 401
AP+NY+GP ++FG + D LP+ I D YD ++P HKK P +
Sbjct: 377 APSNYVGPARLFGR--MTKDGRKGELPLSRAILDHYDPETDSGWMPPKHKKTH-VPVYNG 433
Query: 402 -EDIPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSRFQMWQNHIKELVENLFNYYK 460
E P SLR A+ F++ CA+R RGQG H+SMLIHV+RF Q+H++ VE+ +
Sbjct: 434 QEMAPPSLRKAICAFVLACAVRELRGQGAAHSSMLIHVTRFVAVQDHVRLQVEDSVRSMR 493
Query: 461 HEI----EASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFS-DIDKNLSVHSWEEI 515
+I EA ++ + +++ E D A K S+ S D ++ ++ +W E+
Sbjct: 494 QKICRGIEAD--SLLTQMKELWELDFA--------PTSAKVSELSPDEERPAALPTWNEV 543
Query: 516 KPQLFKAVQKIEVKSINGTSGDCLTYYENEKNGISVIAIGGDKLSRGLTLEGLSVSYFLR 575
+ L ++ IEV+SINGT+ D L Y + + VIAIGGDKL+RGLTLEGL VSYF+R
Sbjct: 544 QAALPDVLEDIEVRSINGTAKDALDY-ATPGSALKVIAIGGDKLARGLTLEGLCVSYFVR 602
Query: 576 ASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEFNYLAESGGT 635
+KMYDTLMQMGRWFGYRPGY+DLCRL+TS +L WF HI ASEELR EF+++A + T
Sbjct: 603 TTKMYDTLMQMGRWFGYRPGYLDLCRLYTSPDLVRWFGHIADASEELREEFDFMASANLT 662
Query: 636 PENYALKVRTHPGSLQITSVSKMRYAKQISVSWAGRLIESYQLPMDKGIKKRNLIATDNF 695
PE+Y LKV +H L +TS KMR A +S++++G ++ D ++ NL ATD+
Sbjct: 663 PEDYGLKVISHE-VLTVTSPLKMRNAHTLSLTYSGTRPQTILFHRDLKTQQANLSATDDL 721
Query: 696 ISAQGMPEKKGNNY-------------LWRNVSPDDVCDYLSKFKVANSLKKVDLDMISN 742
I++ G P G + LW V V +L + + +++
Sbjct: 722 IASLGQPNVHGQKFERDGKADSWPRSRLWTGVDVSKVLAFLGAYATHPNATSAKAPVLAE 781
Query: 743 YIQELVKKGELTSWSVVVMNK-NQPAVQYTFSNSIQAGCFDRNRAEDTNWNTYYIRKNHI 801
+I+++ + G+L W+V ++ + + + + F I+ F +D N
Sbjct: 782 FIRKMNEIGQLDQWNVALLAEGSDESNPHEFPGGIRIESFPMRTPDDHGSLDQRDLANFA 841
Query: 802 VG---NQTDEFIDLDEDLINAALERTRQRKAELNRGWDKEYPAPEIVRQEFRPRTNPLLL 858
+G + DE IDLD+D+ ALE T+ W K+ PA R R +
Sbjct: 842 IGVLTDPADEGIDLDDDVWREALEITQA-------AW-KQDPA--------RGR-----V 880
Query: 859 IYPLNPECANVKDKHGNIQSGTISYSKT--------------------DDPFIGFVISFP 898
P P ++ G + G + + P + F I+FP
Sbjct: 881 TMPTVPSGKGMRMARGKL-GGAVDRGLLLLYPLAPYAGKPKSQIVPGWNKPIMAFAIAFP 939
Query: 899 SSSTNIAISYAVN---QAAEFAKTE 920
+S + I++ Y VN E+ TE
Sbjct: 940 ASDSGISVEYEVNVLYWTQEYGATE 964
>gi|94264147|ref|ZP_01287944.1| endonuclease [delta proteobacterium MLMS-1]
gi|93455405|gb|EAT05603.1| endonuclease [delta proteobacterium MLMS-1]
Length = 884
Score = 473 bits (1218), Expect = e-131, Method: Composition-based stats.
Identities = 317/852 (37%), Positives = 463/852 (54%), Gaps = 107/852 (12%)
Query: 84 KWEFWMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNEIQISKKGLVVGQVQSG 143
+W + Y +YL K +++ P+++ L ++ R+L L +P +E S++G+V+G VQSG
Sbjct: 89 QWTYSDAYEKYL-KTQRWHPTLVHSLSDVGSRILGHLQDPC-SEGSWSRRGMVIGHVQSG 146
Query: 144 KTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYERAYTMNNTTK 203
KTANY L+ KAADAG+ LIIV+AGIHNNLR QTQ RIDEGF+G R+ NN
Sbjct: 147 KTANYISLVTKAADAGYKLIIVIAGIHNNLRKQTQERIDEGFVG------RSSDPNNRVP 200
Query: 204 IGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNF-NVPQPILLVVKKNASVLNRLY 262
IGVGL+ D+ + T + DF + AN G + +PI+LV+KKN SVLN LY
Sbjct: 201 IGVGLL---DSDYPHPTTFTTIHADFNKQIANQTGLGIRDRGRPIILVIKKNVSVLNALY 257
Query: 263 KWLQTQTINE--KITNKSLLIIDDEADNASINTNRKELDPTTINRNICSIISLFNRSAYV 320
+WL+ + +I++ +L+IDDEADNASINTN+ +LDPT NR I I+ LF +S Y+
Sbjct: 258 RWLKELNVQRDGRISDVPMLLIDDEADNASINTNKPDLDPTATNRMIRQILGLFAKSCYI 317
Query: 321 GYTATPFANIFI-PQNED-----DLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNSDL 374
GYTATPFANIFI P+N D DLFPRDFI + APT Y GPEKVF DD S++
Sbjct: 318 GYTATPFANIFINPENYDPDVYEDLFPRDFIYCLDAPTTYFGPEKVF-----LDDATSEI 372
Query: 375 LPIIYPIKDYDFFVPQGHKKDDDKPKFEDIPESLRIAVKCFIVTCAIRIARGQGTRHNSM 434
I D + ++P HKKD ++P SL A+ F++ A+R RG +H SM
Sbjct: 373 F--TEAITDCEDYIPWSHKKD---YPVTELPPSLYRALHQFVIARAVRNLRGHSNKHCSM 427
Query: 435 LIHVSRFQMWQNHIKELVENLFNYYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNE 494
+I++SRF Q ++ L+ Y+ ++ + A Y + ++ + Y
Sbjct: 428 MINISRFVAVQKTVRVLISQ----YEKKMREAVQAYYA-----MPEEISRKNQYVAALKN 478
Query: 495 IKESKFSDIDKNLSVHSWEEIKPQLFKAVQKIEVKSINGTSGDCLTY--YENEKNGISVI 552
+ E FSD H+W E++ L+ + + +N + + L Y YE E +G++ I
Sbjct: 479 VFEESFSDCG-----HTWPEVQRALWGVFDSLRLYVVNSQTDEALDYRKYEREGHGLTAI 533
Query: 553 AIGGDKLSRGLTLEGLSVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWF 612
A+GG LSRGLT+EGL VSY R ++MYDTLMQMGRWFGYRP Y DLCR+ + W+
Sbjct: 534 AVGGLSLSRGLTIEGLCVSYMYRNTRMYDTLMQMGRWFGYRPDYEDLCRVHLPPDSINWY 593
Query: 613 RHITIASEELRGEFNYLAESGGTPENYALKVRTHPGSLQITSVSKMRYAKQISVS--WAG 670
HI ASEELR + + G +P + L VR+HP SL IT+ +KMR+A+ ++V ++
Sbjct: 594 AHIAEASEELRQQIKRMRRDGLSPRQFGLYVRSHPDSLLITAPNKMRHAQSVTVKVDFSD 653
Query: 671 RLIESYQLPMDKGIKKRN--LIAT---DNFISAQGMPEKKGNNYLWRNVSPDDVCDYLSK 725
++ ES+ L + I ++N LIAT + F A +K G +++R+VS D + D+L
Sbjct: 654 KIKESHALHTNPEINRKNEELIATFWRNGFGGAN--EDKTGKGWIFRDVSVDVIADFLRG 711
Query: 726 FKVANSLKKVDLDMISNYIQELVKKGE-LTSWSVVVMNKNQPAVQYTFSNSIQAGCFDRN 784
F + DM S I L+ E + V++++KN G +R+
Sbjct: 712 FCAHQDFE----DMKSAVINYLLSISENHPTGDVLLISKNSGQ-----ETKYHLGAQERS 762
Query: 785 RA--EDTNWNTYYIRKNHIVGNQTDEFIDLDEDLINAALERTRQRKAEL--NRGWDKEYP 840
+ W +R V ++ DE + L E E+ RQ +A++ R +D Y
Sbjct: 763 SEGWAGSKWTLKKLR----VASRGDEKLGLTE-------EQKRQAEADVATGRPFDTHYR 811
Query: 841 APEIVRQEFRPRTNPLLLIYPLNPECANVKDKHGNIQSGTISYSKTDDPFIGFVISFPSS 900
R R PLL+I+ L P K+K G P G +SFP
Sbjct: 812 ---------RVRNKPLLMIHILEP-----KEKPG-----------VRVPAFG--VSFPPG 844
Query: 901 STNIAISYAVNQ 912
+ + VN+
Sbjct: 845 DNDTTVEIVVNK 856
>gi|152994001|ref|YP_001359722.1| endonuclease [Sulfurovum sp. NBC37-1]
gi|151425862|dbj|BAF73365.1| endonuclease [Sulfurovum sp. NBC37-1]
Length = 923
Score = 465 bits (1197), Expect = e-129, Method: Composition-based stats.
Identities = 308/816 (37%), Positives = 462/816 (56%), Gaps = 68/816 (8%)
Query: 68 ERRDPWLKDFRANKMSKWEFWMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNE 127
E +PWL + ++++ + FW RY E L FAP V+ +D +T++++ +L +P + E
Sbjct: 74 ENFEPWLHNKKSSENFEPYFWNRY-ETLLLQNGFAPPVVSSIDNVTNKIVARLEDPDK-E 131
Query: 128 IQISKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLG 187
++G+VVG VQSGKTAN+ G++ KA+D G+ LI++LAG+ LRSQTQ R+DEG++G
Sbjct: 132 GPWDRRGMVVGHVQSGKTANFIGVVNKASDVGYELIVILAGMQETLRSQTQERVDEGYIG 191
Query: 188 FDTQYERAYTMNNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQ-- 245
D+ + + + IGVG + + I S+TT + DF R+ NF V +
Sbjct: 192 QDSSKKNSVDFAESL-IGVGNLN--HDKIPYSFTT--KHKDFDDRS------NFVVKKYD 240
Query: 246 -PILLVVKKNASVLNRLYKWLQ--TQTINEKITNKSLLIIDDEADNASINTNRKELDPTT 302
+LVVKKNA VL +L WL+ +I+ I+N LL+IDDEAD+AS+NTN + DPTT
Sbjct: 241 DATVLVVKKNARVLEKLRDWLKKNNSSIDGTISNLPLLLIDDEADHASVNTNDSDKDPTT 300
Query: 303 INRNICSIISLFNRSAYVGYTATPFANIFI-PQNEDD-----LFPRDFIINIPAPTNYIG 356
IN+ I I++LF+R Y+ YTATPFANIFI P +EDD LFP+DFII++ P+NY+G
Sbjct: 301 INKRIREILNLFHRKCYLAYTATPFANIFIDPSSEDDMYKDNLFPKDFIISLDPPSNYVG 360
Query: 357 PEKVFGTSIIPDDTNSDLLPIIYPIKDYDFFVPQGHKKDDDKPKFEDIPESLRIAVKCFI 416
EK+FG DD D I+ I D + +P HKKD +PESL+ AV I
Sbjct: 361 SEKIFGN----DDEAID---IVRDIGDIEDIIPLKHKKDQ---TISCLPESLKKAVNSHI 410
Query: 417 VTCAIRIARGQGTRHNSMLIHVSRFQMWQNHIKELVENLFNYYKHEIEASDTAIYEEFRK 476
++ AIRI RGQ +H+SML+H+SR++ Q + +L+ ++ +IE Y+
Sbjct: 411 ISKAIRILRGQSNKHHSMLVHLSRYKDVQKEVFDLIVQ----FRQDIEKGVRYYYK---- 462
Query: 477 ILEDDTANYKSYKTITNEIKESKFSDIDKNLSVHSWEEIKPQLFKAVQKIEVKSINGTSG 536
L+ + A Y + E+ F+ + W +I+ +L +A ++V ING S
Sbjct: 463 -LQKEEALNNPYMADLYSMWENDFAAM-----YSEWSDIQEKLLEAAASMQVMLINGDSP 516
Query: 537 DCLTYYENEKNGISVIAIGGDKLSRGLTLEGLSVSYFLRASKMYDTLMQMGRWFGYRPGY 596
D L Y + KNG++VIA+GGDKLSRGLTLEGL+ SYF R++ MYDTLMQMGRWFGYR G+
Sbjct: 517 DSLDY-KAYKNGLNVIAVGGDKLSRGLTLEGLATSYFYRSTSMYDTLMQMGRWFGYRDGF 575
Query: 597 VDLCRLFTSEELNEWFRHITIASEELRGEFNYLAESGGTPENYALKVRTHPGSLQITSVS 656
DLCR++ + +F HI+ A EELR E + + TP+ + LKVRTHPGSL IT+ +
Sbjct: 576 ADLCRIYLTPMTESYFAHISNAVEELREELKEMFAANLTPKEFGLKVRTHPGSLLITARN 635
Query: 657 KMRYAKQI--SVSWAGRLIESYQLPMDKGIKKRNLIATDNF------ISAQGMPEKKGNN 708
KMR ++ + +GRL+E+Y + + +K NL + I E NN
Sbjct: 636 KMRASETLLHRTDLSGRLVETYVVDLKPEHRKNNLDLFNEISEKLQKIEKAEYSEASSNN 695
Query: 709 YLWRNVSPDDVCDYLSKFKVANSLKKVDLDMISNYIQELVKKGELTSWSVVVMNKNQPAV 768
Y+W+ VS + V +++ +F + I +Y++ + W +V ++
Sbjct: 696 YVWKYVSANIVEEFVKRFDNHPRAQMTQTKPILSYLERASE--HFPKWDIVFISSRSSKE 753
Query: 769 QYTFSN-SIQAG-----CFDRNRAEDTNWNTYYIRKNHIVGNQTDEFIDLDEDLINAALE 822
+Y+ +N I G C + + E N+ R+ G E + ED I ALE
Sbjct: 754 RYSAANGDIDIGIQKRTCNPKGQYEKDNYIEISTRRRVASGTNDAERAGMAEDEIAEALE 813
Query: 823 ---RTRQRKAELNRGWDKEYPAPEIVRQEFRPRTNP 855
R RQ KA+ + A EI+ ++ + + P
Sbjct: 814 RWKRERQGKADKEKENGNSEKAKEILEKDLKISSIP 849
>gi|56479068|ref|YP_160657.1| endonuclease [Azoarcus sp. EbN1]
gi|56315111|emb|CAI09756.1| Endonuclease [Azoarcus sp. EbN1]
Length = 860
Score = 453 bits (1165), Expect = e-125, Method: Composition-based stats.
Identities = 316/890 (35%), Positives = 482/890 (54%), Gaps = 89/890 (10%)
Query: 54 YSVKIDAFQILEGRERRDPWLKDFRANKMSKWEFWMRYAEYLEKHKKFAPSVILQLDELT 113
Y + + +++ + R PWL D R N +W RY L K SVI DE+T
Sbjct: 6 YGISMGLGAVVDAEDFR-PWLHDARINGEIGDFYWSRYRRLLNL-KGLPKSVIDATDEVT 63
Query: 114 DRVLDKLFNPQRNEIQISKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNL 173
DRVLD+L +P +N S++G+VVG VQSGKTANYTGLICKAADAG+ LI+V+AGIHNNL
Sbjct: 64 DRVLDRLGDP-KNMTPWSRRGMVVGHVQSGKTANYTGLICKAADAGYRLIVVIAGIHNNL 122
Query: 174 RSQTQNRIDEGFLGFDT-QYERAYTMNNTTKIGVGLIPGFDN-AIANSYTTSLEKGDFTS 231
R+QTQ RIDEGF+G DT + A IGVG FD S T +L + +
Sbjct: 123 RNQTQARIDEGFIGRDTGRLAHANKAQRQKIIGVGT---FDQREFPVSLTNTLRDFNKAT 179
Query: 232 RAANTAGF-NFNVPQPILLVVKKNASVLNRLYKWLQTQTINE--KITNKSLLIIDDEADN 288
NT+ +NVP ++LV+KKN+S L L +WL+ ++++ ++ ++ +L+IDDEADN
Sbjct: 180 ATTNTSQIGQYNVP--VVLVIKKNSSTLKNLLEWLKEHSVHQGTQMVSQPMLLIDDEADN 237
Query: 289 ASINTNRKELDPTTINRNICSIISLFNRSAYVGYTATPFANIFI-PQNED-----DLFPR 342
ASINT + T IN I ++SLF+RS YVGYTATPFANIFI P +D DLFPR
Sbjct: 238 ASINTAYSRDEVTRINGQIRELLSLFHRSCYVGYTATPFANIFIDPDTDDEALKQDLFPR 297
Query: 343 DFIINIPAPTNYIGPEKVF----GTSIIPDDTNSDLLPIIYPIKDYDFFVPQGHKKDDDK 398
FII + AP+NY G +KVF + D N D+LP+ + I D
Sbjct: 298 HFIIGLDAPSNYFGAQKVFLDARDRHVRLIDDNEDILPMKHKI---------------DH 342
Query: 399 PKFEDIPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSRFQMWQNHIKELVENLFNY 458
P + +P+SL AV+ FIV AIR ARGQ H SML++ SRF Q ++ V ++ ++
Sbjct: 343 P-VDVLPDSLVRAVRAFIVARAIRNARGQQASHASMLVNASRFTDVQGRLRSKVADVVSH 401
Query: 459 YKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFSDIDKNLSVHSWEEIKPQ 518
+ + A D + +L++ + + E++++D + W E++ +
Sbjct: 402 IRDAV-AVDGG--KGRAALLQNPEI------AALHAVWEAEYADAEGA----GWSEVQAR 448
Query: 519 LFKAVQKIEVKSINGTS-GDCLTYYENEKNGISVIAIGGDKLSRGLTLEGLSVSYFLRAS 577
L + + V +N + L Y + ++G++VIA+GG LSRGLTLEGL++SYFLR S
Sbjct: 449 LHEVLVAARVVEVNASKRSQPLDYEQGGEHGVTVIAVGGFSLSRGLTLEGLTISYFLRNS 508
Query: 578 KMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEFNYLAESGGTPE 637
MYDTLMQMGRWFGYRPGY DLCR++ + W+ HI A ++L+ + + + TPE
Sbjct: 509 MMYDTLMQMGRWFGYRPGYEDLCRVWIPADGVGWYAHIHEAMDDLQAQLKRMELAKATPE 568
Query: 638 NYALKVRTHPGSLQITSVSKMRYAKQ--ISVSWAGRLIESYQLPMDKGIKKRNLIATDNF 695
+ L VR+HP SL +T+ +KM K+ + V A LIE+ ++ +D + N+ A +
Sbjct: 569 QFGLTVRSHPESLIVTARNKMGSGKELPVKVGLAENLIETTRIVLDSPQLEANIRAGEGL 628
Query: 696 ISAQ----GMPEKKGNNYLWRNVSPDDVCDYLSKFKVANSLKKV----DLDMISNYIQEL 747
++A + + YL + V + + D+L +++ ++ V D +I++YI E
Sbjct: 629 LAAAVKAGVLVVSRPRGYLLKGVRVNSIMDFLREYRTELGVRPVNPLTDPKLINDYI-EA 687
Query: 748 VKKGELTSWSVVVMNKNQPAVQYTFSNSIQAGCFDRNRAEDTNWNTY--YIRKNHIVGNQ 805
EL W + + + + ++ + ++ + D + + + +G+
Sbjct: 688 RAASELAEWDIFIASSTRKDMKPLPFAGLDIVPYELSVDADLAKSGVLAFSGASRRIGSA 747
Query: 806 TDEFIDLDEDLINAALERTRQRKAELNRGWDKEYPA---PEIVRQEFRPRTNPLLLIYPL 862
DE L+E I A E+ R L+R DK+ P P I RQ +PLL++ +
Sbjct: 748 EDESEGLEEQQITDAKEKFR-----LSRD-DKKLPKTVNPRIYRQV--SGRHPLLILRLV 799
Query: 863 NPECANVKDKHGNIQSGTISYSKTDDPFIGFVISFPSSSTNIA-ISYAVN 911
P+ + +++ G +++ +SFPSS + Y VN
Sbjct: 800 KPKLDATRSAGEDVE-GVLAWG----------LSFPSSQIGGGTVEYVVN 838
>gi|94971293|ref|YP_593341.1| endonuclease [Acidobacteria bacterium Ellin345]
gi|94553343|gb|ABF43267.1| endonuclease [Acidobacteria bacterium Ellin345]
Length = 899
Score = 451 bits (1161), Expect = e-124, Method: Composition-based stats.
Identities = 315/859 (36%), Positives = 454/859 (52%), Gaps = 69/859 (8%)
Query: 24 DSEINEAAIKAKMIYP--DVDAAKLKNDLLSMYSVKIDAFQILEGRERRDPWLKDFRANK 81
+ I E A + + +P D D A L L + ++ +D +L G E PWL +A
Sbjct: 25 EEAILELATRLRQAFPLSDDDFASLIKRLHAKLAITMDTGVVLLGEEEHTPWLSSRKA-- 82
Query: 82 MSKWEFWMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNEIQISKKGLVVGQVQ 141
+ +W R+ + L++ K + P V+ L+ +TD +LD L +P + ++GLV+G VQ
Sbjct: 83 VIDPFYWQRFLQLLQR-KDWPPKVLSTLNSVTDNILDLLGDPAKPG-SWKRRGLVIGDVQ 140
Query: 142 SGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYERAYTMNNT 201
SGKTA YT L CKA DAG+ LI++L G +LR QTQ R+DEGF+GFD+ NN
Sbjct: 141 SGKTATYTALSCKAGDAGYRLIVLLTGTLESLRRQTQERLDEGFVGFDSSGILRKIRNNR 200
Query: 202 TKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFN-VPQPILLVVKKNASVLNR 260
+GVG + D + TS ++ DF+ N+ G N + +P+L+VVKKN +L
Sbjct: 201 A-VGVGTL---DARRSAGVFTSRDR-DFSKTLVNSLGIRINSIKEPVLVVVKKNRKILEN 255
Query: 261 LYKWL-QTQTINEKITNKSLLIIDDEADNASINTNRKELDPTTINRNICSIISLFNRSAY 319
L KWL + ++ + LL+IDDEAD+AS+NTN DPT IN+ I ++++LF RS+Y
Sbjct: 256 LEKWLTEYNAGDDGKIDVPLLLIDDEADSASVNTNPLSTDPTEINKRIRALLALFKRSSY 315
Query: 320 VGYTATPFANIFI-PQNE-----DDLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNSD 373
+G+TATPFANIFI P +E DDLFPRDFI + PTNY+GP +FG P D
Sbjct: 316 IGFTATPFANIFINPDSENDMLGDDLFPRDFIYTLDPPTNYVGPVVMFGDE--PRDG--- 370
Query: 374 LLPIIYPIKDYDFFVPQGHKKDDDKPKFEDIPESLRIAVKCFIVTCAIRIARGQGTRHNS 433
I+ PI D + P HK D+P+SLR AV F++ IR RG H S
Sbjct: 371 ---ILEPISDAESVFPSRHKC---SWPINDLPQSLRDAVTSFVIANTIRDLRGDSATHRS 424
Query: 434 MLIHVSRFQMWQNHIKELVENLFNYYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITN 493
ML++VSRF Q+ + L+ + N I ++ R + D A KTI+
Sbjct: 425 MLVNVSRFTAVQDQVAVLINSDLN-----------RIQQDIRNYSQLDPAIALRNKTIS- 472
Query: 494 EIKESKFSDIDKNLSVHSWEEIKPQLFKAVQKIEVKSINGTSGDC-LTYYENEKNGISVI 552
EI + S N +WE ++ L + I VK++N +G L Y N +NG+ VI
Sbjct: 473 EIHQVWRSSY--NTKEFAWEGVQRALLASALPIVVKAVNQRTGAASLDYASNRENGLRVI 530
Query: 553 AIGGDKLSRGLTLEGLSVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWF 612
AIGG+ LSRGLTLEGLS SYF R S+MYDTL+QMGRWFGYR Y DLC+++ SE+ +W+
Sbjct: 531 AIGGNSLSRGLTLEGLSTSYFYRNSQMYDTLLQMGRWFGYRDNYSDLCKVWLSEDAIQWY 590
Query: 613 RHITIASEELRGEFNYLAESGGTPENYALKVRTHPGSLQITSVSKMRYAKQIS--VSWAG 670
HIT A+EELR E + TP + LKVR HP SL +T+ +KMR A I +S +
Sbjct: 591 SHITAATEELRFEVKRMRRMNATPREFGLKVRAHPDSLIVTAQNKMRLAHTIERVISIST 650
Query: 671 RLIESYQLPMDKGIKKRNLIATDNFI-----SAQGMPEKKGNNYLWRNVSPDDVCDYLSK 725
IES +L + I N N I + + NN +WR V + V +
Sbjct: 651 EAIESTRLKSSRVIISANKQVVANAIANFERAGIACESSEWNNPIWREVPKELVSALIRN 710
Query: 726 FKVANSLKKVDLDMISNYIQELVKKGELTSWSVVVMNKNQPAVQYTFSNSIQAGCFDRNR 785
F+V + +++Y + +L W VV+ N +P + + + A F R
Sbjct: 711 FEVHPLNVAFQSEDLADYFTNTTEP-KLQKWDVVLPNGGEPEIIFVRTRVRPAKRFVLPR 769
Query: 786 AEDTNWNTYYIRKNHIVGNQTDEFIDLDEDLINAALERTRQRKAELNRGWDKEYPAPEIV 845
+N VG++ E L ++ ++ + K ++ D +
Sbjct: 770 DN----GILVSGRNMRVGSRGIEREGLPSGIVREINDQAKLTKKNVS---DHAF------ 816
Query: 846 RQEFRPRTNPLLLIYPLNP 864
+E RPR PLLLI+ L P
Sbjct: 817 -RERRPR--PLLLIHVLAP 832
>gi|86360839|ref|YP_472726.1| hypothetical protein RHE_PF00106 [Rhizobium etli CFN 42]
gi|86284941|gb|ABC93999.1| hypothetical conserved protein [Rhizobium etli CFN 42]
Length = 914
Score = 451 bits (1160), Expect = e-124, Method: Composition-based stats.
Identities = 317/891 (35%), Positives = 458/891 (51%), Gaps = 83/891 (9%)
Query: 49 DLLSMYSVKIDAFQILEGRERRDPWLKDFRANKMSKWEFWMRYAEYLEKHKKFAPSVILQ 108
DL +++ V L G + PW + + F+ RY + L + +AP +
Sbjct: 59 DLEALFVVAQGPSIRLLGETQPPPW---YLGERRRPGAFFARYLQKLGE-SGWAPRALEA 114
Query: 109 LDELTDRVLDKLFNPQRNEIQISKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAG 168
L+E T VL+ L +P R ++ + +GLVVG VQSGKTA+Y G+I +AADAG+ +IIVLAG
Sbjct: 115 LEESTADVLELLDDPNR-DVPWNWRGLVVGNVQSGKTAHYAGVINRAADAGYRVIIVLAG 173
Query: 169 IHNNLRSQTQNRIDEGFLGFDTQYERAYTMNNTTKIGVGLIPGFDNAIANSYTTSLEKGD 228
+H LR QTQ R+D+ FLGFDT +GVGL+PG + +S TTS GD
Sbjct: 174 MHKILRRQTQLRLDQDFLGFDTA--GVAGSGGRRPVGVGLLPG-PVQLVDSLTTSQLNGD 230
Query: 229 FTSRAANTAGFNFNVPQPILLVVKKNASVLNRLYKWLQTQTINEKITNKSLLIIDDEADN 288
F A + F +P + VVKKN +VL + +W+ + E+ N LL+IDDEAD
Sbjct: 231 FNRTVAQNSNFA-PANRPFVFVVKKNGAVLKNINRWIAR--LPEESRNAPLLVIDDEADQ 287
Query: 289 ASINTNRK----------ELDPTTINRNICSIISLFNRSAYVGYTATPFANIFIPQN--- 335
AS +T + + DP IN I ++S F RS YVGYTATPFANI I
Sbjct: 288 ASPDTGDQGFLPDGSFDEDYDPKRINGEIRKLLSGFRRSVYVGYTATPFANIMIHDERAA 347
Query: 336 ED---DLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNSDLLPIIYPI-KDYDFFVPQG 391
ED DLFP FI+++ AP +Y GP VFG DD ++ LP+I + + + ++P
Sbjct: 348 EDYGADLFPSTFIVSLSAPDDYFGPLAVFGRD---DDADAVGLPVIRHLDQTAESWIPDP 404
Query: 392 HKKDDDKPKFED---IPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSRFQMWQNHI 448
H K +P + +P SL A++ F++ CA R ARGQ H +ML+HVSRFQ + +
Sbjct: 405 HDKTW-RPTYRGEARVPPSLLEAIRSFVICCAARAARGQANAHKTMLVHVSRFQDVHDPV 463
Query: 449 KELVENLFNYYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFSDIDKNLS 508
VE ++ I A+D E ++ +D ++ T + S F+ +++
Sbjct: 464 HAQVEGALRSIRNAIAAADPFEMAELERLWHED------FEPTTEAMASSVFT---RSIR 514
Query: 509 VHSWEEIKPQLFKAVQKIEVKSINGTSGDCLTYYENEKNG--ISVIAIGGDKLSRGLTLE 566
W+E+ QL +I+V NG S + Y + G +S I IGGDKLSRGLTLE
Sbjct: 515 RIRWDEVALQLGDEADRIQVVVSNGRSRTGIDYDAADAAGQSLSAIVIGGDKLSRGLTLE 574
Query: 567 GLSVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEF 626
GLS SYFLR S+ YD+L+QMGRWFGYR GY DLCRL+T+ ++ WFRH+ +E+LR +
Sbjct: 575 GLSTSYFLRVSRQYDSLLQMGRWFGYRRGYADLCRLYTTTDMETWFRHLATVNEDLRAQL 634
Query: 627 NYLAESGGTPENYALKVRTHPGSLQITSVSKMRYAKQISVSWAGRLIESYQLPMDKGIKK 686
++ +GGTP+ Y L + H + +T+ +K R+A V++AG L D
Sbjct: 635 AHMRVTGGTPKLYGLSIADH-SIMNVTAANKRRHAVLRPVTYAGEGKIQTVLYRDIDTTI 693
Query: 687 RNLIATDNFISAQGMPE----KKGN-----NYLWRNVSPDDVCDYLSKFKVANSLKKVDL 737
N A + + G PE + G+ ++WRNV + V L VD
Sbjct: 694 ANAAAVNGLLEELGEPEIDPARPGDRPPAAGFIWRNVRGNRVAALLEALAFPPESSDVDG 753
Query: 738 DMISNYIQELVKKGELTSWSVVVMNKNQPAVQYTFSN--SIQAGCFDRNRAEDTNWNTYY 795
++ YI + GEL+ W+V V VQ + S++ R+ +
Sbjct: 754 KRMAAYITTQLHLGELSDWTVFVPAGVGATVQVAGRDLRSVRRSPIVRST------TARF 807
Query: 796 IRKNHIVGNQTDEFIDLDEDLINAALERTRQRKAELNRGWDKEYPA-PEIVRQEFRPRTN 854
I K+ + + DE IDL +D A E+T Q A+ G + PA P I R
Sbjct: 808 ITKS--ILSPLDEAIDLSDDQYRRAQEKTDQVLAD-EGGPPADRPAGPWIRRVRGEDPRR 864
Query: 855 PLLLIYPLNPECANVKDKHGNIQSGTISYSKTDDPFIGFVISFPSSSTNIA 905
LLL+YP++P A V + G P G V+SFP+S T A
Sbjct: 865 GLLLLYPIDP--AGVSPEAG-------------IPLWGVVVSFPTSPTASA 900
>gi|153941166|ref|YP_001391388.1| hypothetical protein CLI_2133 [Clostridium botulinum F str.
Langeland]
gi|152937062|gb|ABS42560.1| conserved hypothetical protein [Clostridium botulinum F str.
Langeland]
Length = 927
Score = 424 bits (1089), Expect = e-116, Method: Composition-based stats.
Identities = 298/829 (35%), Positives = 457/829 (55%), Gaps = 78/829 (9%)
Query: 5 YQVAIEICQSIIGRKASVTDSEINEAAIKAKMIYPDV--DAAKLKNDLLSMYSVKIDAFQ 62
Y+ A E + + + A DS I +A + I+ V +A L+ L S+ + Q
Sbjct: 4 YKSAYEFYEGKLDKNADDIDSAIRDAVDQTMFIFSSVVLNADNLRKYLDEQISMIREQQQ 63
Query: 63 --ILEGRERRDP-WLKDFRANKMSKWEFWMRYAEYLEKHKKFAPSVILQ-LDELTDRVLD 118
+ G E R+ W +F K + E+W RY +YL ++K + S I + +D T++V++
Sbjct: 64 PVFMTGLEVRNSVWWDEFI--KENSTEYWNRYEKYLLENKGWPRSSIDKSIDNTTNKVMN 121
Query: 119 KLFNPQRNEIQISKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQ 178
+ +P + ++ I +KG+V+G VQSGKTA+Y G+I KA DAG+ +IIVLAG+HNNLRSQTQ
Sbjct: 122 AIADPNK-KMAIERKGMVIGYVQSGKTAHYIGVINKAIDAGYKIIIVLAGMHNNLRSQTQ 180
Query: 179 NRIDEGFLGFDTQYERAYTMNNTTK----IGVGLIPGFDNAIANSYTTSLEKGDFTSRAA 234
+RIDE LGF+T + Y +N K IGVG N + + T+ EKGDF
Sbjct: 181 SRIDEEILGFETSLD--YLLNQMKKEANIIGVGKKLVVKND-SQTLTSRDEKGDFNRDRQ 237
Query: 235 NTAGFNFNVPQ-PILLVVKKNASVLN---RLYKWLQTQTINEKI------TNKSLLIIDD 284
F P+ P + VVKKN+ VL R +K + ++++ N LL+IDD
Sbjct: 238 QI----FVSPEIPTIFVVKKNSKVLEALIRFFKNNKKHSVDKDTGRKYMDANYPLLLIDD 293
Query: 285 EADNASINTNRKELDPTTINRNICSIIS--------LFNRSAYVGYTATPFANIFIPQNE 336
EAD ASINT + D N++ S I+ +F+ +Y+GYTATP+ANIFIP
Sbjct: 294 EADQASINTKYEYKDGVIKNKDKLSRINAQTRDLFHIFDCRSYIGYTATPYANIFIPSEV 353
Query: 337 D------DLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNSDLLPIIYPIK-DYDFFVP 389
+ DLFP FI+ +P P NYIG + FG D +P+ I + FV
Sbjct: 354 ESKKYGKDLFPTHFILTLPKPPNYIGAHEYFGNESQLD------MPLRRQISINSLLFVD 407
Query: 390 QGHKKDDDKPKFEDIPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSRFQMWQNHIK 449
Q +K +++ + L+ A+K F+++ AIRI RGQ N+MLIHV+R Q +
Sbjct: 408 QKNKTVK-----KELLDDLKRAIKSFLISTAIRICRGQNGEPNTMLIHVTRLTDTQRLVH 462
Query: 450 ELVENLFNYYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFSDIDKNLSV 509
+ V + + ++I + + I ++D Y+ + E DI
Sbjct: 463 KRVAHYYENIGNKIIDGHLETISDLKTIWQED---YEQTTQNMRRLHERFMKDIPDT--- 516
Query: 510 HSWEEIKPQLFKAVQKIEVK--SINGTSGDCLTYYENEKNGISVIAIGGDKLSRGLTLEG 567
+WEE+ ++ ++ +VK SING S D L Y E++ +VIAIGGDKLSRGLTLEG
Sbjct: 517 -TWEEVFQKICNLIENDQVKIYSINGKSKDALLYKEHKGKQYNVIAIGGDKLSRGLTLEG 575
Query: 568 LSVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEFN 627
L+VSYF R SKMYDTLMQMGRWFG+RPGY DLCRLF ++L WFRHI+ A+++LR +
Sbjct: 576 LTVSYFTRESKMYDTLMQMGRWFGFRPGYADLCRLFVQDDLYAWFRHISFATDDLREQIE 635
Query: 628 YLAESGGTPENYALKVRTHPGSLQITSVSKMRYAKQISVSWAGRLIESYQLPMDKGIKKR 687
Y+ TPEN+ L+V THP +++I+ +K++ ++ ++++ ++ + ++ +
Sbjct: 636 YMNNIEETPENFGLRVATHP-NMKISGANKVQSGEERRITFSNVFSQTRSMDINAEKYNK 694
Query: 688 NLIATDNFISAQGMP-----EKKG-----NNYL-WRNVSPDDVCDYLSKFKVANSLKKVD 736
N A DN + G P EK G NN+L W+NV + D+L ++ + K +
Sbjct: 695 NFEAVDNLLKLCGKPLENYWEKIGRGNNRNNHLFWQNVDGHYIKDFLLNYETSRQANKAN 754
Query: 737 LDMISNYIQELVKKGELTSWSVVVMNKNQPAVQYTFSN-SIQAGCFDRN 784
+++YI++ +K G L W+V ++N + FSN S+ +G +N
Sbjct: 755 SKYMADYIEDQMKSGGLLRWTVCLLNLGDENNKIEFSNVSVASGMMRKN 803
>gi|115525254|ref|YP_782165.1| endonuclease [Rhodopseudomonas palustris BisA53]
gi|115519201|gb|ABJ07185.1| endonuclease [Rhodopseudomonas palustris BisA53]
Length = 891
Score = 414 bits (1063), Expect = e-113, Method: Composition-based stats.
Identities = 294/824 (35%), Positives = 437/824 (53%), Gaps = 90/824 (10%)
Query: 74 LKDFRANKMSKW----EFWMRYAEYLE---KHKKFAPSVILQLDELTDRVLDKLFNPQRN 126
L D A+ +W E YA+ E + + P+V+ L ++ R+L L +P +
Sbjct: 70 LVDDEADHDDEWVYKRELQTTYADAYEAFLRQDGWHPTVVRSLSDVCTRILGHLQDPTSD 129
Query: 127 EIQISKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFL 186
+++GLV+G VQSGKTANYTG+I KAADAG+ I+V AGIH+NLR QTQ RIDE F+
Sbjct: 130 G-SWNRRGLVIGHVQSGKTANYTGVIAKAADAGYKFIVVSAGIHSNLRRQTQERIDEAFI 188
Query: 187 GFDTQYERAYTMNNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFN-VPQ 245
G R+ N IGVGL D+ + T++ DF + A+ AG+ N +
Sbjct: 189 G------RSSDPENRMPIGVGL--NRDDYPHPATLTTIHS-DFNRKTADDAGWKINDFSK 239
Query: 246 PILLVVKKNASVLNRLYKWLQTQTINE--KITNKSLLIIDDEADNASINTNRKELDPTTI 303
PI++++KKN L+ L+KWL+ + +I + +L IDDEADNASINTN+++++PT
Sbjct: 240 PIVIIIKKNVRTLDSLFKWLKELNARKDGRIGDVPMLFIDDEADNASINTNKEDINPTRT 299
Query: 304 NRNICSIISLFNRSAYVGYTATPFANIFI-PQNEDD-----LFPRDFIINIPAPTNYIGP 357
N + I+ LF +S YVGYTATPFANIFI P DD LFPR FI ++ P +Y GP
Sbjct: 300 NAMVRRILGLFTKSCYVGYTATPFANIFINPDAYDDDVREELFPRHFIYSLDPPNSYFGP 359
Query: 358 EKVFGTSIIPDDTNSDLLPIIYPIKDYDFFVPQGHKKDDDKPKFEDIPESLRIAVKCFIV 417
KVF D+ +S ++ I D + ++P HK DD P D+P SL A+ FIV
Sbjct: 360 HKVF-----VDEVSS--ARVVRLIDDCENYIPSSHKNGDDVP---DLPPSLYGALDEFIV 409
Query: 418 TCAIRIARGQGTRHNSMLIHVSRFQMWQNHIKELVENLFNYYKHEIEASDTAIYEEFRKI 477
IR RGQ +H SML++VSRF Q +++ + E +I
Sbjct: 410 ARTIRNLRGQTGKHCSMLVNVSRFVSVQKQVRDFIS------------------ERQNRI 451
Query: 478 LEDDTANYKSY--KTITNEIKESKFSDIDKNLSVH--SWEEIKPQLFKAVQKIEVKSING 533
+ ANY K T+E +K V +W E++ LF + + + IN
Sbjct: 452 MLAVKANYAKSVPKEETDEHMRRLRGVFEKEFDVSGLTWREVRRALFATFEHMRIYVINS 511
Query: 534 TSGDCLTY--YENEKNGISVIAIGGDKLSRGLTLEGLSVSYFLRASKMYDTLMQMGRWFG 591
S + L Y YE E G++ I IGG LSRGLT+EGL+VSY R ++MYDTL+QMGRWFG
Sbjct: 512 KSDEVLDYKKYEREGVGLTAITIGGLSLSRGLTVEGLTVSYMFRNTRMYDTLLQMGRWFG 571
Query: 592 YRPGYVDLCRLFTSEELNEWFRHITIASEELRGEFNYLAESGGTPENYALKVRTHPGSLQ 651
YRP + DLCR+ S + W+R+I A+EELR + + + TP + L V H +L
Sbjct: 572 YRPNFEDLCRVHLSGDSINWYRYIARATEELREQIARMRRANMTPLQFGLYVEQHSDALL 631
Query: 652 ITSVSKMRYAKQISV--SWAGRLIESYQLPMDKGIKKRN--LIAT--DNFISAQGMPEKK 705
+T+ +KMR +Q++V S++G+L ES ++P+ I ++N LIA + + P K
Sbjct: 632 VTAANKMRSGQQVTVSQSFSGQLKESSKVPISDDINEKNEELIAEFWRSGFGGKVRPTTK 691
Query: 706 GNNYLWRNVSPDDVCDYLSKFKVANSLKKVDLDMISNYIQELVKKGELTSWSVVVMNKNQ 765
G + +V + + D+L +F+V + +++ + K ++ V + + +
Sbjct: 692 G--WAIDDVEQEVIQDFLGRFRVHKDFNPMKGSVLTYLDRIRDKHPKIDVLLVSLSSGGE 749
Query: 766 PAVQYTFSNSIQAGCFDRN----RAEDTNWNTYYIRKNHIVGNQTDEFIDLDEDLINAAL 821
V + + G DRN R +D W R V ++ DE + L ++ I A
Sbjct: 750 DGVAF------KLGAQDRNAGPKRIKDDGWRLSSYR----VASRGDEKLGLTKEQIEQA- 798
Query: 822 ERTRQRKAELNRGWDKEYPAPEIVRQEFRP-RTNPLLLIYPLNP 864
T + N D+ AP V FR R PL++I+ L P
Sbjct: 799 -TTNALADDANEDKDR---APSDV--HFRAVRNKPLVMIHVLRP 836
>gi|23097630|ref|NP_691096.1| endonuclease [Oceanobacillus iheyensis HTE831]
gi|22775853|dbj|BAC12131.1| endonuclease [Oceanobacillus iheyensis HTE831]
Length = 880
Score = 384 bits (986), Expect = e-104, Method: Composition-based stats.
Identities = 275/805 (34%), Positives = 428/805 (53%), Gaps = 86/805 (10%)
Query: 40 DVDAAKLKNDLLSMYSVKIDAFQILEGRE--RRDPWLKDFRANKMSKWEFWMRYAEYLEK 97
D D ++ +L + + V+++ +++G E +RD + + SK +W RY E++++
Sbjct: 62 DSDWNRMNRELETYFDVQMEQGVLVQGEEQQKRDNTWWTSKYKQESKSYYWNRYKEFMKQ 121
Query: 98 HKKFAPSVILQLDELTDRVLDKLFNPQRNEIQISKKGLVVGQVQSGKTANYTGLICKAAD 157
V+ + E TD V++ L NP+ S+ G+VVG VQSGKTANY+ L+CKAAD
Sbjct: 122 --SLPTDVVKGIGEDTDVVMNNLENPKVE--SFSRYGMVVGHVQSGKTANYSALLCKAAD 177
Query: 158 AGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYERAYTMNNTTKIGVGLIPGFDNAIA 217
AG+ I+V+AG NNLR+QTQ R++E F+G D + +GVG + +
Sbjct: 178 AGYKFIVVIAGGINNLRNQTQERLNESFVGRDER----------VPVGVGKLGNLKKELL 227
Query: 218 NSYTTSLEKGDFTSRAAN--TAGFNF-NVPQPILLVVKKNASVLNRLYKWLQTQTINEKI 274
T+ E+ DF R AN G NF N+ PI+LV+KK+ L + WL++Q ++I
Sbjct: 228 PISLTTKEQ-DFNKRDANKNAQGLNFDNIRSPIILVIKKHTRTLTNVIDWLKSQ-YGKEI 285
Query: 275 TNKSLLIIDDEADNASINTNRKELDPTTINRNICSIISLFNRSAYVGYTATPFANIFIPQ 334
++L+IDDE+D ASINT + E PT+IN+ I ++ LF + AYV YTATP+ANIFI
Sbjct: 286 PRHAMLLIDDESDYASINT-KDEDSPTSINKKIRELLYLFKKRAYVAYTATPYANIFIDH 344
Query: 335 NED------DLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNSDLLPIIYPIKDYDFFV 388
D D+FP DFI ++ APTNY G EK+F S N+ L I D + +
Sbjct: 345 VADHDKVGRDIFPDDFIYSLEAPTNYFGAEKIFLNS------NTKYL---VEINDCENHI 395
Query: 389 PQGHKKDDDKPKFEDIPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSRFQMWQNHI 448
P HKKD D +PESL A++ FI+ IR RGQG +HNSML+H +RF I
Sbjct: 396 PIKHKKDFD---LYSLPESLHDAIRLFIINIGIRSLRGQGNKHNSMLVHATRFTRVHQQI 452
Query: 449 KELVENLFNYYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFSDIDKNLS 508
+E+ Y ++ A A Y + + D+ Y Y E E++ + S
Sbjct: 453 STFIED----YLSDLTAGVVA-YGKLK-----DSHKYSLYIKTLKETFETRLPN-----S 497
Query: 509 VHSWEEIKPQLFKAVQKIEVKSINGTSGDCLTYYENEKNGISVIAIGGDKLSRGLTLEGL 568
+W E+ ++ + ++ + ++ ++ + L Y ++ + I IGG L+RG TLEGL
Sbjct: 498 EFAWVEVIDRIVETIETVIIREVHQNTKVHLEYRDDSVT--NAIVIGGTSLARGFTLEGL 555
Query: 569 SVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEFNY 628
SVSYF R + YDTLMQMGRWFGYR Y DLCR++ + + E F I A+ +L +
Sbjct: 556 SVSYFYRNTVFYDTLMQMGRWFGYRTDYEDLCRIYMTPTMFENFGLIIEATTDLMDDLKR 615
Query: 629 LAESGGTPENYALKVRTHPGS-LQITSVSKMRYAKQI--SVSWAGRLIESYQLPMDKGIK 685
++ + TP ++ L + HP S LQ+T+ +K + +K I + G+L E+ L D I
Sbjct: 616 MSIAKMTPRDFGLSIIHHPDSGLQVTARNKQKNSKDIYFEMKLDGKLKETSWLHKDPRII 675
Query: 686 KRNLIATDNFI------SAQGMPEKKGNNYLWRNVSPDDVCDYLSKFKVANS-----LKK 734
+ NL+A + M + +YLWRN+ V +L+ FKV + +
Sbjct: 676 QENLVAIREIVRYLEQNKKTEMIGQSNYDYLWRNIDKSHVLRFLNSFKVVYTDPFGIQTR 735
Query: 735 VDLDMISNYIQELVKKGELTSWSVVVMNKNQPAVQYTFSNSIQAGCFDRNRAED--TNWN 792
+ +D I +Y++++ + W + + + + +N + G NR + +
Sbjct: 736 MPIDFIKDYVRDID-----SLWDIALYSGST-------NNIFKEGNVSINRQKRGVEIKD 783
Query: 793 TYYIRKNHIVGNQTDEFIDLD-EDL 816
TYY +N V + T E I L+ EDL
Sbjct: 784 TYYEIQNRQVSSGTSESICLENEDL 808
>gi|93007193|ref|YP_581630.1| endonuclease [Psychrobacter cryohalolentis K5]
gi|92394871|gb|ABE76146.1| endonuclease [Psychrobacter cryohalolentis K5]
Length = 895
Score = 362 bits (930), Expect = 5e-98, Method: Composition-based stats.
Identities = 282/902 (31%), Positives = 455/902 (50%), Gaps = 116/902 (12%)
Query: 48 NDLLSMYSVKIDAFQILEGRERRDPWLKDFRANKMSKWEFWMRYAEYLEKHKKFAPSVIL 107
N+L + K + +L+ +E PWL ++K + RY YL+ ++ V+
Sbjct: 52 NNLNLFRNTKQENAHVLDDQEI-TPWLHTVSSDKY----YSDRYYTYLQNEERLPARVVD 106
Query: 108 QLDELTDRVLDKLFNPQRNEIQISKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLA 167
+ + +L++L NP + ++GLVVG VQSGKTANY LI AAD + LII++
Sbjct: 107 VMKRTNEDILERLGNPNI-DTPFKRQGLVVGNVQSGKTANYLSLINLAADYDYKLIILIT 165
Query: 168 GIHNNLRSQTQNRIDEGFLGFDTQYERAYTMNNTTKIGVGLIPGFDNAIANSYT---TSL 224
GIHNNLRSQTQ R+++GF+G +++ ++ T +GV P D I TSL
Sbjct: 166 GIHNNLRSQTQKRVNKGFIGVNSEEQK------LTGVGVTNNPVLDKEIKAKQPHCLTSL 219
Query: 225 EKGDFTSRAANTAGFNF-NVPQPILLVVKKNASVLNRLYKWLQTQTINEKITNKSLLIID 283
+ DF+ + T N P+++V+KKN L + KWL+ ++ N+ +LIID
Sbjct: 220 TE-DFSGKIRKTTAVRLENAKNPVIMVIKKNHHTLGNVIKWLRNNEQGQRFINEPVLIID 278
Query: 284 DEADNASINTNRKELDPTTINRNICSIISLFNRSAYVGYTATPFANIFI-PQNED----- 337
DEADNASINT++ + T IN I +LF + +Y+GYTATPFANIFI P N D
Sbjct: 279 DEADNASINTSKHPNETTKINGQIRETFNLFRQGSYIGYTATPFANIFIDPANHDEALAD 338
Query: 338 DLFPRDFIINIPAPTNYIGPEKVF-GTSIIPDDTNSDLLPIIYPIKDYDFFVPQGHKKDD 396
DLFP+ FI + P+NY+GP + F G P+ + P + I+D + +P KD
Sbjct: 339 DLFPKHFIFTLVPPSNYLGPAQFFLGEDGEPNYDS----PYVEFIQDNEDSIPLKKPKDF 394
Query: 397 DKPKFEDIPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSRFQMWQNHIKELVENLF 456
++P+SL +A+ ++VT I+ R +H SMLI++S Q + LV
Sbjct: 395 ---VLTELPKSLTLALYEYLVTIVIKSVRSVSNKHTSMLINISHKTQEQADAQYLVSQTL 451
Query: 457 NYYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFSDIDKNLSVHSWEEIK 516
EE R ++ A + + + N E + +L+ + +E+
Sbjct: 452 ---------------EEIRLAVKFHGAKPQQER-LKNPHLEGLYRKFLPHLNKQNVDELF 495
Query: 517 PQLFKAVQKIEVKSINGTSGDCLTYYENEKNGISVIAIGGDKLSRGLTLEGLSVSYFLRA 576
QL K + ++V+ IN S D L Y +N +NG++VIAIGG LSRG TLEGL++SY +R
Sbjct: 496 TQLQKVIDSVKVRVINSESKDKLDY-DNYENGLNVIAIGGYSLSRGFTLEGLTISYLIRN 554
Query: 577 SKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEFNYLAESGGTP 636
+ M DTL+QMGRWFGYR Y DLC+L+ + EW+ I + EELR EF+ + ++G TP
Sbjct: 555 TAMADTLLQMGRWFGYREQYDDLCKLYIPDASFEWYAFIAQSIEELRSEFSIMEQNGLTP 614
Query: 637 ENYALKVRTHPGSLQITSVSKMRYAKQISVSWA-------------GRLIESYQLPMDKG 683
+ LKVRT L IT+ +KM + + +++S + L+ +L D
Sbjct: 615 AEFGLKVRTSNTGLLITARNKMHHTEAVTLSESFSEEAFTLRGIALNNLLSQRKLFTDFA 674
Query: 684 IKKRNLIATD---NFISAQGMPEKKGNNYLWRNVSPDDVCDYLSKFK--VANSLKKVDLD 738
+K +L N++ A E +L + V D+V + L++++ + ++ +K +
Sbjct: 675 LKSADLYGFKPVYNYVRA----ENDKQFHLIQKVGIDEVINLLNEYEHSIDDNGRK---E 727
Query: 739 MISNYIQELVKKGELTSWSVVV----MNKNQPAVQYTFSNSIQAGCFDRNRAEDTNWNTY 794
+ +YI+E ++ EL +W + + + +N + TF+ S C+ + + +
Sbjct: 728 ALISYIEE--REDELATWDIAIHKSYLGENALSQVRTFAKS----CYHSDHVKTADGGGR 781
Query: 795 YIRKNHIVGNQTDEFIDLDEDLINAALERTRQRKAELNRGWDKEYPAPEIVRQEFRPRTN 854
+R + F DE +A ++ + + + R + R R+
Sbjct: 782 ILRP------WEEAFGLTDEQYASAEQQKEKSKVKQTARFYR-------------RNRSR 822
Query: 855 PLLLIYPLNPECANVKDKHGNIQSGTISYSKTDDP----FIGFVISFPSSSTNIAISYAV 910
PLL++ + + K+ K DDP + ISFP SS + +SY V
Sbjct: 823 PLLVLKSFDLYVGSKKEGD----------KKGDDPKYQDVPAYAISFPKSSNHNPVSYLV 872
Query: 911 NQ 912
N+
Sbjct: 873 NK 874
>gi|89055354|ref|YP_510805.1| endonuclease [Jannaschia sp. CCS1]
gi|88864903|gb|ABD55780.1| endonuclease [Jannaschia sp. CCS1]
Length = 872
Score = 324 bits (831), Expect = 2e-86, Method: Composition-based stats.
Identities = 262/854 (30%), Positives = 410/854 (48%), Gaps = 105/854 (12%)
Query: 88 WMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNEIQISKKGLVVGQVQSGKTAN 147
W EYL K K + P I LD+ + V+ L NP R + + +GLVVG VQSGKTAN
Sbjct: 89 WTALHEYL-KTKGWNPHTIGSLDKASTEVVSLLGNPGRG--KFACRGLVVGYVQSGKTAN 145
Query: 148 YTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYERAYTMNNTTKIGVG 207
T ++ KA DAG+N+IIVL G+ N LR QTQ+R ++ L + ++
Sbjct: 146 MTAVMAKAVDAGYNMIIVLGGVTNKLRKQTQDRFEQDVLRHRSLWQL------------- 192
Query: 208 LIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNF-NVPQPILLVVKKNASVLNRLYKWLQ 266
YTTS GDF A GF + L+V+KK S L +L + +
Sbjct: 193 ------------YTTSDSAGDFVQPA--NGGFAMPSEAHAQLIVMKKEGSRLKQLRRTI- 237
Query: 267 TQTINEKITNKSLLIIDDEADNASINTNRKELDPTTINRNICSIISLFNRSAYVGYTATP 326
+T + +L+IDDE D AS+N+ R E D T IN I ++++ YVGYTATP
Sbjct: 238 ARTPAVVLRELRVLLIDDECDQASVNSARGEFDMTRINAEIRTVLAALPAVTYVGYTATP 297
Query: 327 FANIFI-------PQNEDDLFPRDFIINIPAPTNYIGPEKVFGT-SIIPDDTNSDLLPII 378
FAN+FI ++ DDL+PRDFI + P Y G +VFG+ S P+ D++ I+
Sbjct: 298 FANVFINPYPYGNDEDLDDLYPRDFITALEQPIGYFGAREVFGSDSAEPEGEERDMIRIL 357
Query: 379 YPIKDYDFFVPQGHKKDDDKPKFE-DIPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIH 437
+ D D P K +K F ++ SL A+ F+ +CAIR +RGQ H SML+H
Sbjct: 358 HQ-DDPDRLRPTSPK---NKESFHPEMTRSLEDAILWFLTSCAIRRSRGQDGEHMSMLVH 413
Query: 438 VSRFQMWQNHIKELVENLFNYYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKE 497
S++ ++ L+ N K ++ A E R++ E + ++ + I E
Sbjct: 414 SSQYVRQHEYMSGLIRNWVEQRKSDLIAGTGEPAERLREVFERELE--RTAPAGRDRIPE 471
Query: 498 SKFSDIDKNLSVHSWEEIKPQLFKAVQKIEVKSINGTSGDCLTYYENEKNGISVIAIGGD 557
D D L+ L + + NG + + L + + ++ I +GG
Sbjct: 472 ----DFDTVLAY---------LPAVIDALRYPVENGETEEALR-LDYTGDPVTCIVVGGT 517
Query: 558 KLSRGLTLEGLSVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITI 617
L+RGLTLEGL VS+FLR SK YDTL+QMGRWFGYR Y DL RL+T+E+L FR + +
Sbjct: 518 VLARGLTLEGLCVSFFLRTSKQYDTLLQMGRWFGYRRDYEDLPRLWTTEDLASKFRSLAV 577
Query: 618 ASEELRGEFNYLAESGGTPENYALKVRTHPGSLQITSVSKMRYAKQISVSWAGRLIESYQ 677
EE+R + E+ TP ++A+KVR+ PG + IT+ +KM++A + S+S++G+ +++ +
Sbjct: 578 IEEEIRADIAVYRENKLTPRDFAVKVRSIPG-MAITAATKMKHALRTSMSFSGKHVQTIR 636
Query: 678 LP-MDKGIKKRNLIATDNFI-----SAQGMPEKKGNNYLWRNVSPDDVCDYLSKFKVANS 731
D I N A + +A+ E +G L+R++ + +F A
Sbjct: 637 FDHRDSEIVGGNWNAASQLVDSMLETAKATEETRG--ILFRDIR----FQLVRRFVAATE 690
Query: 732 LKKVDLDMISNYIQELVKKGE--LTSWSV-VVMNKNQPAVQYTFSNSIQAGCFDRNRAED 788
+ +D+ ++ + + E L W+V V+ K+ Q R+R +
Sbjct: 691 ISGEHMDLKKPHLIRYLDEAESSLAHWNVAVIQTKSNTKSAKPLGKLGQVPTMRRSRLSE 750
Query: 789 TNWNTYYIRKNHIVGNQTDEFIDLDEDLINAALERTRQRKAELNRGWDKEYPAPEIVRQE 848
+ I+ + ++ D ID D R+ E W +
Sbjct: 751 ASPRYADIKA---LMSKADILIDAD-------------REPEGREDW--------TTYKA 786
Query: 849 FRPRTNPLLLIYPLNPECANVKDK-HGNIQSGTISYSKTDDPFIGFVISFPSSSTNIAIS 907
RP PLLL+Y ++ + + K G+ ++ D +GF I FP
Sbjct: 787 CRPEA-PLLLLYLIDAKSKPQERKSSGDKPPSRVALDAVSD-IVGFGIVFPGQKDRSGGY 844
Query: 908 YAVN-QAAEFAKTE 920
++V+ +A +TE
Sbjct: 845 FSVDIEAPALEETE 858
>gi|84495902|ref|ZP_00994756.1| endonuclease [Janibacter sp. HTCC2649]
gi|84382670|gb|EAP98551.1| endonuclease [Janibacter sp. HTCC2649]
Length = 890
Score = 288 bits (736), Expect = 2e-75, Method: Composition-based stats.
Identities = 210/603 (34%), Positives = 316/603 (52%), Gaps = 71/603 (11%)
Query: 90 RYAEYLEKHKKFA--PSVILQLDELTDRVLDKLFNPQRNEIQISKKGLVVGQVQSGKTAN 147
RY YL + +V+ +D +++V+ +P ++ KKGLV+G VQSGKTAN
Sbjct: 90 RYWGYLRSQIEGTGLATVLPDIDVASNKVVAHFADPGIRRLK--KKGLVLGYVQSGKTAN 147
Query: 148 YTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYERAYTMNNTTKIGVG 207
Y ++ KAADAG+ L IV++G+HNNLR QTQ R++ +GVG
Sbjct: 148 YAAVMAKAADAGYRLCIVMSGMHNNLRRQTQVRLNR-------------------DLGVG 188
Query: 208 LIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQPILLVVKKNASVLNRLYKWLQT 267
N + E DF A N + VVKKN L RL WL
Sbjct: 189 ----------NWNALTTEDRDFGEVLHGQALLQ-NPQARTIAVVKKNPMRLRRLRDWL-- 235
Query: 268 QTINEKITNKS-LLIIDDEADNASINTNRKELDPTTINRNICSIISLFNRSAYVGYTATP 326
Q I+ + ++ +L++DDEAD A+ N+ + + + IN+ I I + +YVGYTATP
Sbjct: 236 QEIDPAVRAQAPILLLDDEADQATPNSLAAKQEMSKINKLIREIWAEIPTGSYVGYTATP 295
Query: 327 FANIFI-PQNEDDLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNS----DLLPIIYPI 381
FANIF+ P +E +++P DFI+++P Y G E++FG + D + D++ +I
Sbjct: 296 FANIFMDPNDETEMYPSDFILDLPRSDEYFGAERIFGRQSVEDADDPEPGLDMVRMISVA 355
Query: 382 KDYDFFVPQGHKKDDDKPKFE-DIPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSR 440
+ P + D+ F+ D+P SL AV F+V IR RGQ H+SML+H +
Sbjct: 356 EATSLRPPSSSQ---DRASFDPDMPSSLGDAVVWFVVATCIRRLRGQ-MDHSSMLVHTTS 411
Query: 441 FQMWQNHIKELVENLFNYYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKF 500
+ ++ +E++ + ++E D + FRK +D++
Sbjct: 412 YVAPHFAMQGRIESMIFKLRRQLEDGDLST---FRKSFDDESERVPGV------------ 456
Query: 501 SDIDKNLSVHSWEEIKPQLFKAVQKIEVKSINGTSGDCLTYYENEKNGI----SVIAIGG 556
L + +W+++ L + +I V NG S D L Y +G +VIA+GG
Sbjct: 457 ----TQLGMPAWDDVTSVLVDVLDEIRVVVDNGRSTDRLDYGRLSDSGRPITETVIAVGG 512
Query: 557 DKLSRGLTLEGLSVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHIT 616
LSRGLTLEGL+VSYF R S YDTL+QMGRWFGYRPGY DL R++ + +L+E F+ +
Sbjct: 513 GTLSRGLTLEGLTVSYFTRTSNTYDTLLQMGRWFGYRPGYEDLPRIWMTNDLSEDFQFLA 572
Query: 617 IASEELRGEFNYLAESGG-TPENYALKVRTHPGSLQITSVSKMRYAKQISVSWAGRLIES 675
EE+R E L GG TP ++VR HPG L IT KM +A+ + VS++G+ ++
Sbjct: 573 TVEEEIRIEIRRLTAMGGITPSQMGVRVRAHPGRLSITDPKKMHHARLVQVSFSGQRHQT 632
Query: 676 YQL 678
+ L
Sbjct: 633 FLL 635
>gi|154488613|ref|ZP_02029462.1| hypothetical protein BIFADO_01920 [Bifidobacterium adolescentis
L2-32]
gi|154082750|gb|EDN81795.1| hypothetical protein BIFADO_01920 [Bifidobacterium adolescentis
L2-32]
Length = 910
Score = 275 bits (702), Expect = 2e-71, Method: Composition-based stats.
Identities = 238/791 (30%), Positives = 384/791 (48%), Gaps = 107/791 (13%)
Query: 45 KLKNDLLSMYSVKID-AFQILEGRERRDPWLKDFRANKMSKWEFWMRYAEYLEKHKKFAP 103
+L+N+L+ KID + + PWL D +++ + Y +YL K +
Sbjct: 58 ELRNELVP----KIDEGITLTDQNSAWKPWLSDMQSSDTWETPRSDSYYQYLVMDKNSSY 113
Query: 104 SVILQLDELTDRVLDKLFNPQRNEIQISKKGLVVGQVQSGKTANYTGLICKAADAGFNLI 163
S LD + ++ L +P+R++ +S+KGL++G VQSGKT Y L+ KAAD G+ LI
Sbjct: 114 ST---LDYTANEIVKLLADPRRDQPAVSRKGLILGDVQSGKTRTYIALMNKAADCGYRLI 170
Query: 164 IVLAGIHNNLRSQTQNRIDEGFLGFDTQYERAYTMNNTTKIGVGLIPGFDNAIANSYTTS 223
IVL + NLR QTQ RID F+G+ + ++G+G + IA+ +
Sbjct: 171 IVLTSDNENLRQQTQERIDTDFIGW----------QDGIRVGIG---KYQQNIAHPSQLT 217
Query: 224 LEKGDFTSRAANTAGFNF-----NVPQPILLVVKKNASVLNRLYKWLQTQTINEKITNKS 278
E DF + A + A +F N P + V+KKNAS+L++ KW + ++ +
Sbjct: 218 NEN-DFVA-AYDKAFHSFPRPTWNSAAPCVAVIKKNASILSKFNKWFDSPEFDKDL---P 272
Query: 279 LLIIDDEADNASINTNRKELDPTTINRNICSIISLFNRSAYVGYTATPFANIFIPQ-NED 337
+LIIDDE+D AS+N+ + + PT IN I + + +R++YV TATPFANIFI +E
Sbjct: 273 VLIIDDESDYASVNSAKLDDSPTRINSLIRDLCQISSRTSYVAVTATPFANIFIDDADES 332
Query: 338 DLFPRDFIINIPAPTNYIGPEKVFG-TSIIPDDTNSDLLPIIYPIKDYDF--FVPQGHKK 394
DLFP+DFI + +P YIG +K+FG +P+D++ + I + + ++P H K
Sbjct: 333 DLFPQDFIHILKSPDAYIGAKKLFGDMDSVPEDSSC-----VREIDEGELESWLPVSHGK 387
Query: 395 DDD--KPKFEDIPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSRFQMWQNHIKELV 452
+ D P+ +D ++ AV FI CA+R SMLIH+SRF Q I + V
Sbjct: 388 NYDIVDPELDD---QVKHAVCTFINACALR--PNAEDEQQSMLIHMSRFTDVQRQIADRV 442
Query: 453 ENLFNYYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFSDIDKNLSVHSW 512
Y ++E + + + ++D ++S + +I SW
Sbjct: 443 SG----YLRQVENA-VRFHADGDPRIDDLQEAFESEYYTSTDI---------------SW 482
Query: 513 EEIKPQLFKAVQ--KIEVKSINGTSGDCLTYYENEKNGIS---VIAIGGDKLSRGLTLEG 567
+ ++ + VQ ++ V+ +N S D + + S I IGG++LSRG+TL G
Sbjct: 483 GMMFLRIRRLVQSSRLRVRLVNSDSDDWSLRNDVPPDLTSNECTIFIGGNQLSRGMTLSG 542
Query: 568 LSVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEFN 627
L S F R DTL+QMGRWFGYRP Y +L R++ E +R+ EEL+ +
Sbjct: 543 LICSVFYRRVTASDTLLQMGRWFGYRPNYANLQRIWLLPESVLDYRYSCSIVEELKESAS 602
Query: 628 YLAESGGTPENYALKVRTHPGS-LQITSVSKMRYAKQ----ISVSWAGRLIESYQLPMDK 682
+ G TP+ + L +R +P ++IT+ SKMR A + AG +IES +L +D
Sbjct: 603 RMKHLGMTPKQFGLAIRKNPNKGVRITNASKMRNAVEGIGYQEFDMAGEIIESIKLDVDM 662
Query: 683 GIKKRNLIATDNFISAQGMPEKKGN------NYLWRNVSPDDVCDYLSKFKVANSLKKVD 736
+ +N A + P+ + +++NV V D+LS+++
Sbjct: 663 KRRNQNDEAFMKLLGVCNAPQVISSVSPLVETQVFQNVPAKAVVDFLSQYR--------- 713
Query: 737 LDMISNYIQELVKKGELTSWSVVVMNKNQPAVQYTFSNSIQAGCFDRNRAEDTNWNTYYI 796
S Y T + +M ++ S + Q C + D WN +I
Sbjct: 714 ----SGYRD--------TFFGPTLMTYRDQEIEMNTSMAEQYACTQLSENPDMTWNIGFI 761
Query: 797 RKNHIVGNQTD 807
N GN+ +
Sbjct: 762 NGN---GNEVE 769
>gi|78778758|ref|YP_396870.1| endonuclease [Prochlorococcus marinus str. MIT 9312]
gi|78712257|gb|ABB49434.1| endonuclease [Prochlorococcus marinus str. MIT 9312]
Length = 908
Score = 270 bits (690), Expect = 4e-70, Method: Composition-based stats.
Identities = 243/804 (30%), Positives = 382/804 (47%), Gaps = 98/804 (12%)
Query: 88 WMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNEIQISKKGLVVGQVQSGKTAN 147
W + +YL K ++ I +DE + +V++++ +P +E + +GLV+G VQSGKT+N
Sbjct: 93 WFAFKKYLSVSKSWSDEEINSVDESSTKVVNQILSPA-SEKEKRFQGLVLGYVQSGKTSN 151
Query: 148 YTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYERAYTMNNTTKIGVG 207
I KAAD G+ LII+LAG+ ++LR QTQ R+ + LG Q +R Y +
Sbjct: 152 MAATIAKAADRGYKLIIILAGLTDSLRKQTQIRMQKD-LGQHLQ-DRWYFCTD------- 202
Query: 208 LIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQPILLVVKKNASVLNRLYKWLQT 267
E+ DFTS N + +L++KKN +L RL K ++
Sbjct: 203 -----------------EENDFTSYTELPTWDNER--KTTILIIKKNVFILKRLLKKIKN 243
Query: 268 QTINEKITNKSLLIIDDEADNASINTNRKELDPTTINRNICSIISLFNRSAYVGYTATPF 327
Q+ K + LI+DDE D AS+NT + N+ I I+ + Y+GYTATP+
Sbjct: 244 QS-EVKRKGMTTLIVDDECDQASLNTKAYREQVSQTNKYIRQILENLRKVTYLGYTATPY 302
Query: 328 ANIFIPQN----EDDLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNSDLLPIIYPIKD 383
ANI PQ + DL+P DFI+++ P NY G K+FG D+ N LP I +K
Sbjct: 303 ANILTPQKSVDGKLDLYPNDFIVSLDEPKNYFGARKLFGDEFDVDNDNE--LPFIRRVKA 360
Query: 384 YDFFVPQGHKKDDDKPKFEDIP---ESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSR 440
+ + + K +F+ P +SL + +++ + RGQG H MLIH +
Sbjct: 361 DEI---ENLQPPSQKARFDFTPSLTQSLVDSCDYYLLCLCAKTLRGQGKDHCCMLIHTTI 417
Query: 441 FQMWQNHIKE-LVENLFNYYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKESK 499
+ +K L++ N IE D + + E KESK
Sbjct: 418 YSETHKDLKNLLLKEWLNPLIKNIENGDEFTLNRLKFLWE----------------KESK 461
Query: 500 FSDID--KNLS----VHSWEEIKPQLFKAVQKIEVKSINGT--SGDCLTYYENEKNGISV 551
D + LS + S+ EIK + K + IE+ N T S D L ++ +KN I
Sbjct: 462 VLDTNFRNKLSCPECIESFNEIKDLILKEAKSIELVIENSTVNSKDRLD-FDTDKN-IHA 519
Query: 552 IAIGGDKLSRGLTLEGLSVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEW 611
I IGG+ L+RGLT+EGL S+F+R+S YDTLMQMGRWFGYR GY DL R++ + +L
Sbjct: 520 IVIGGNVLARGLTIEGLICSFFIRSSTQYDTLMQMGRWFGYRKGYEDLPRIWMTFDLEYN 579
Query: 612 FRHITIASEELRGEFNYLAESGGTPENYALKVRTHPGSLQITSVSKMRYAKQISV---SW 668
FR + +R + + + + G TP+ +++V +L IT+ +K+ K +SV S
Sbjct: 580 FRDLVNVENLIRSDISDMGKEGLTPQEMSIRVPLL-ANLNITARNKLNMNK-LSVCVGSL 637
Query: 669 AGRLIESYQLPMDKGIKKRNLIATDNFIS-----AQGMPEKKGNNYLWRNVSPDDVCDYL 723
G ++ P DK K N +N I+ + +K N+Y+ ++V + +
Sbjct: 638 YGTYKQTIAFPTDKNFHKSNFSCIENLINNSTKYTKDNFQKTDNSYVLKDVEYPPILKFF 697
Query: 724 SKFKV-ANSLKKVDLDMISNYIQELVKKGELTSWSVVVMNKNQPAVQYTFSNSIQAGCFD 782
FK +L+K+D I N + E L W++ ++ + G +
Sbjct: 698 RSFKFNEETLQKID-QFIENEVDE--DSSSLGKWNIGIIGSKTSNREIKIGKLDDVGTVN 754
Query: 783 RNRAEDTNWNTYYIRKNHIVGNQTDEFIDLDEDLINAALERTRQRKAELNRGWDKEYPAP 842
R++ + + +D +D+D N + +Q K + R WD
Sbjct: 755 RSKQFIDTASLKNKISIKALMFASDLLVDVDRQEYN----KWKQEKDDSIREWD------ 804
Query: 843 EIVRQEFRPRT---NPLLLIYPLN 863
+VRQ FR PLLLI+P+N
Sbjct: 805 -LVRQ-FREEVLGKRPLLLIFPIN 826
>gi|145593136|ref|YP_001157433.1| hypothetical protein Strop_0574 [Salinispora tropica CNB-440]
gi|145302473|gb|ABP53055.1| hypothetical protein Strop_0574 [Salinispora tropica CNB-440]
Length = 967
Score = 265 bits (676), Expect = 1e-68, Method: Composition-based stats.
Identities = 219/670 (32%), Positives = 318/670 (47%), Gaps = 83/670 (12%)
Query: 78 RANKMSKWEFWMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNEIQISKKGLVV 137
RA + W + RY L+K + P + LD T+ V+ +L +P R + KGLVV
Sbjct: 134 RAERTFYWAHYHRY--LLDKWRN--PDAVADLDRATEEVVRRLSDPTR-PVAYQAKGLVV 188
Query: 138 GQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDT------- 190
G VQSGKTAN+TG++ KA DAG+ L+IVL G N LR+QTQ R+D G +
Sbjct: 189 GYVQSGKTANFTGVVAKAVDAGYRLVIVLTGTTNMLRAQTQRRLDIELCGRENIEREISP 248
Query: 191 QYERAYTMNNTTK------IGVGLIPGFDNAIANSYTTSLEKGD------------FTSR 232
++A+ N + G P D + + S GD F R
Sbjct: 249 HDQKAHEYQNDPDWKGDRFVRHGGRPS-DAGYPDIHRLSNHAGDYRRLKMGFSVLEFRGR 307
Query: 233 AANTAGF---NFNVPQPILLVVKKNASVLNRLYKWLQTQTINEKITNKSLLIIDDEADNA 289
N F N + L+V KKNASVL L L I++++ +LI+DDE+D A
Sbjct: 308 ERNRKFFDPANLHTSDARLVVAKKNASVLEALVADLGR--ISDRLGEVPVLIVDDESDQA 365
Query: 290 SINT------NRKELDPTTINRNICSIISLFNRSAYVGYTATPFANIFI-PQNEDDLFPR 342
S+NT T INR I ++ + R+ YVGYTATP+AN+FI P + +D+FPR
Sbjct: 366 SVNTVSPKKWKEDSKKRTAINRLIGQLLRMMPRAQYVGYTATPYANVFIDPSDVEDIFPR 425
Query: 343 DFIINIPAPTNYIGPEKVFGTSI-IPDDTNSDLLPIIYPIKDYDFFVPQGHKKDDDKPKF 401
DF+I++P P Y+G E + +P D + K+ G + DD +
Sbjct: 426 DFLISLPRPDGYMGAEDFHDFDVELPLDQRP-----LTTSKERAHVRYLGDEPDDSE--- 477
Query: 402 EDIPESLRIAVKCFIVTCAIRIAR---GQGTRHNSMLIHVSRFQMWQNHIKELVENLFN- 457
LR AV F++T A+++ R G RH++ML+H + + EL+ +L+N
Sbjct: 478 ------LRCAVDMFVLTGALKLYRQRHGTSYRHHTMLVHEAMGKDSHRQTAELIGHLWNA 531
Query: 458 --YYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFSDIDKNLSVHSWEEI 515
YY+ A +Y +L A + T N F D+ ++ + I
Sbjct: 532 AGYYQPTGLARLRDLYHT--DVLPVSRAQAPTLLTPNN------FDDLRDDIG-DALRLI 582
Query: 516 KPQLFKAVQKIEVKSING--TSGDCLTYYENEKNGISVIAIGGDKLSRGLTLEGLSVSYF 573
P I V S + L + + E I I +GG+KL+RG T+EGL+V+Y+
Sbjct: 583 SPADRNGSPVIVVNSDTDLEKQQESLDFDQRE---IWRILVGGNKLARGFTVEGLTVTYY 639
Query: 574 LRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEF-NYLAES 632
R + DTLMQMGRWFG+RPGY DL RL+T+ +L++ F + LR E Y A +
Sbjct: 640 RRTTAQVDTLMQMGRWFGFRPGYRDLVRLYTTPDLHDMFEAACRDEDFLRRELRQYAAPT 699
Query: 633 GG----TPENYALKVRTHPGSLQITSVSKMRYAKQISVSWAGRLIESYQLPMDKGIKKRN 688
GG TP + H L+ T +KM A + S GR +E P K N
Sbjct: 700 GGSPQLTPRQIPPLIAQHRPDLRPTGRNKMWNAMLVLKSSPGRPMEPVAFPSTVAKVKHN 759
Query: 689 LIATDNFISA 698
L I+A
Sbjct: 760 LEQWKPLIAA 769
>gi|134098148|ref|YP_001103809.1| endonuclease [Saccharopolyspora erythraea NRRL 2338]
gi|133910771|emb|CAM00884.1| endonuclease [Saccharopolyspora erythraea NRRL 2338]
Length = 966
Score = 258 bits (660), Expect = 1e-66, Method: Composition-based stats.
Identities = 207/673 (30%), Positives = 316/673 (46%), Gaps = 100/673 (14%)
Query: 72 PWLK-DFRANKMSKWEFWMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNEIQI 130
PW + RA S+ +W Y L K + + LD TD+V+++L +P R E
Sbjct: 129 PWYTPEIRA---SRDFYWSSYKTLLTG-KGWDADAVTALDRTTDQVIERLSDPSRPE-AF 183
Query: 131 SKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDT 190
KGLVVG VQSGKTAN+TG++ KA DAG+ LIIVL G + LRSQTQ R+D +G +
Sbjct: 184 QAKGLVVGHVQSGKTANFTGVVAKAIDAGYRLIIVLTGTTDMLRSQTQRRLDMELIGVEN 243
Query: 191 ----------------QYERAYTMNNTTKIGVGLIPG-FDNAIANSYTTSLEKGDFTSRA 233
Y+ + G+ P D + TT GD+ S
Sbjct: 244 ILGGIDPDDADLLDEVDYQDDPDWIEGKFLSHGVQPNDMDRPAIHRLTTY--AGDYKSLK 301
Query: 234 ANTAGFNFN--------------VPQPI-LLVVKKNASVLNRLYKWLQTQTINEKITNKS 278
NF +P + +VKKN +VL +L L +
Sbjct: 302 QGIEALNFPKADRRKRFFELENLLPADTKVAIVKKNGTVLRKLASDLHRIKKKTNLGEIP 361
Query: 279 LLIIDDEADNASINTNRK------ELDPTTINRNICSIISLFNRSAYVGYTATPFANIFI 332
LI+DDE+D AS+NT+ E++ T IN I ++ R+ YVGY+ATPFAN+FI
Sbjct: 362 TLIVDDESDQASVNTSNPKNWTNGEVERTAINGLISDLLDALPRAQYVGYSATPFANVFI 421
Query: 333 -PQNEDDLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNSDL--LPIIYPIKDYDFFVP 389
P + +D+FP+DFI+++ P Y G D SDL Y + + F+
Sbjct: 422 DPSDTEDIFPKDFILSLDPPPGYTGAAAF-------HDLESDLDEADRTYANSNRNSFI- 473
Query: 390 QGHKKDDDKPKFEDIPESLRIAVKCFIVTCAIRIAR----GQGTRHNSMLIH----VSRF 441
+ +P ED E+LR A+ F++T AI++ R + RH++ML+H +
Sbjct: 474 ----RFVREPSGED-DETLRKAIDTFVLTGAIKLYRQEHGARTFRHHTMLVHETMRTADH 528
Query: 442 QMWQNHIKELVENLFNYYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFS 501
M I+ + ++ YY T + ++ E D +K +
Sbjct: 529 SMQAGRIRAMWKSA-GYYS-------TGSEQRLHRVFEGD-------------VKPVSLA 567
Query: 502 DIDKNLSVHSWEEIKPQLFKAVQKI----EVKSINGTSGDCLTYYENEKNGISVIAIGGD 557
+ + +V ++E++P + KAV ++ V +N + +++ + I +GG+
Sbjct: 568 RSESSAAVPEFDELRPYIAKAVSRVGRFDPVLVVNSDKDAVTEQLDFDRDDVWRILVGGN 627
Query: 558 KLSRGLTLEGLSVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITI 617
KL+RG T+EGL+V+YFLR +KM D +MQMGRWFG+R Y DL RLF +EEL E F I +
Sbjct: 628 KLARGFTVEGLTVAYFLRKTKMADAMMQMGRWFGFRRSYADLMRLFLTEELYEAFEAIAL 687
Query: 618 ASEELRGEFNYLA-----ESGGTPENYALKVRTHPGSLQITSVSKMRYAKQISVSWAGRL 672
+ R E A + TP+ V L+ TS +KM A+ + G
Sbjct: 688 DEQYFRKELKRYARPVHGQKQITPKQVPPLVAQRLPWLKPTSANKMYNARLVERRSPGGA 747
Query: 673 IESYQLPMDKGIK 685
+E P K ++
Sbjct: 748 VEPGAYPAKKDLR 760
>gi|125718616|ref|YP_001035749.1| Conserved uncharacterized protein [Streptococcus sanguinis SK36]
gi|125498533|gb|ABN45199.1| Conserved uncharacterized protein [Streptococcus sanguinis SK36]
Length = 905
Score = 241 bits (615), Expect = 2e-61, Method: Composition-based stats.
Identities = 218/718 (30%), Positives = 340/718 (47%), Gaps = 87/718 (12%)
Query: 88 WMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNEIQISKKGLVVGQVQSGKTAN 147
W Y LEK + + I L + T +L L N R ISK GLV+G VQSGKTAN
Sbjct: 96 WKNYERKLEK-QGWNQISIETLKKSTVEILSYLSN-SRESNGISK-GLVLGNVQSGKTAN 152
Query: 148 YTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYERAYTMNNTTKIGVG 207
+G+I AAD G+N IVL+G NLR+QT NR+ Y +N T V
Sbjct: 153 MSGVISMAADLGYNFFIVLSGTIENLRNQTANRL----------YNDVRETSNLTWRLVD 202
Query: 208 LIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQPILLVVKKNASVLNRLYKWLQT 267
P + + + + N N L V KN S L L +WL +
Sbjct: 203 R-PSLRSNLPDQKISKFL-------------LNENDKDRYLTVCLKNKSRLENLIRWLYS 248
Query: 268 QTINEKITNKSLLIIDDEADNASINTNR-KELDPTTINRNICSIISLFNRSA--YVGYTA 324
K +L+IDDEAD AS+NTN+ +E DPT IN+ I +++ N A Y+ YTA
Sbjct: 249 D--ENKTRQLKVLVIDDEADQASVNTNKIEEQDPTKINKLIKELVNKGNVKAMNYIAYTA 306
Query: 325 TPFANIFIPQNEDDLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNSDLLPIIYPIKDY 384
TP+ANI D L+P+DFI+ +P T+YIGPE++FGTS + + I+ I DY
Sbjct: 307 TPYANILNETANDSLYPKDFIVVLPKSTDYIGPEEMFGTS---EPEQVQKVDIVREISDY 363
Query: 385 DFFVPQGHKKDDDKPKFEDIPESLRIAVKCFIVTCAIRIARGQGTRHN-SMLIHVSRFQM 443
D V +G K + IP+SL A+ FI++ R G R SMLIH S
Sbjct: 364 DVQVIRGLGKGEAI----QIPKSLEEAIDWFIISTGA--MRHLGYRKPISMLIHTS---- 413
Query: 444 WQNHIKELVENLFNYYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFSDI 503
++ + E + + Y +I + + + + + ++ ++ + + N +S
Sbjct: 414 FKVNDHESIAKIVTSYVKKIRNNPSEFFSKLEILYMNEKIDFSRQRFLDNM---ENYSSP 470
Query: 504 DKNLSVHSWEEIKPQLFKAVQ------------------------KIEVKSINGTSGDCL 539
D + +W +I+ Q+ + + I + + TS D +
Sbjct: 471 DAVPAYPNWVDIREQIERIFRLDDNEYLSHVQLTDNGEPKYHEGFHIAIDNSKMTSADEM 530
Query: 540 T--YYENEKNGISV----IAIGGDKLSRGLTLEGLSVSYFLRASKMYDTLMQMGRWFGYR 593
Y + N + I IGG+ LSRGLT+EGL ++FLR + DTLMQMGRWFGYR
Sbjct: 531 IRLVYPSSVNATKLAPAFIVIGGNTLSRGLTIEGLVSTFFLRNTNQADTLMQMGRWFGYR 590
Query: 594 PGYVDLCRLFTSEELNEWFRHITIASEELRGEFNYLAESGGTPENYALKVRTHPGS--LQ 651
GY R++ + E F+ ++ ELR +E TP A K++ P ++
Sbjct: 591 KGYEIFPRVWMEYDAYERFQFLSQLDYELRANLAEYSERNLTPIEVAPKIKNSPDYQLVK 650
Query: 652 ITSVSKMRYAKQISVSWAGRLIESYQLPMDKGIKKRNLIATDNFISAQGMPEKKG---NN 708
ITSV+KM+ A +++G ++ D N T F+++ G P+K +
Sbjct: 651 ITSVNKMQSAISAEFNFSGFNSQTIYFKDDTEQLVHNFRLTKKFLNSLGAPQKSALSDSK 710
Query: 709 YLWRNVSPDDVCDYLSKFKVANSLKKVDLDMISNYIQELVKKGE-LTSWSVVVMNKNQ 765
+W+ V V +L +++V + + + + I+N I+ L + + W++V+ +K +
Sbjct: 711 LVWKEVDSVQVEKFLQEYQVIS--EDIRMATINNLIEWLKENNHTINDWNIVLSSKGR 766
>gi|23100795|ref|NP_694262.1| hypothetical protein OB3340 [Oceanobacillus iheyensis HTE831]
gi|22779029|dbj|BAC15296.1| hypothetical protein [Oceanobacillus iheyensis HTE831]
Length = 927
Score = 225 bits (574), Expect = 9e-57, Method: Composition-based stats.
Identities = 219/725 (30%), Positives = 338/725 (46%), Gaps = 87/725 (12%)
Query: 83 SKWEFWMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNEIQISKKGLVVGQVQS 142
SK W +Y +L+K+K ++ I L+E TD +L +L N +R ++ +KGLVVG VQS
Sbjct: 106 SKGSNWYKYKLHLKKNKGWSDESISNLEESTDWILKRL-NTKRRSLEDIRKGLVVGNVQS 164
Query: 143 GKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYERAYTMNNTT 202
GKTA+ GL+ A+D GFN I+++G+ +NL+ QT+ R+ + +N+
Sbjct: 165 GKTASMAGLMAAASDHGFNFFIIMSGMISNLKKQTEERLID-------------DLNS-- 209
Query: 203 KIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFN---VPQPILLVVKKNASVLN 259
D+ + SY S+ +R+ A VV KN + L
Sbjct: 210 ----------DDNVDGSYWESITNIYNKNRSHELARLKLEDNLRSDKYFTVVLKNTNHLA 259
Query: 260 RLYKWLQTQTINEKITNKSLLIIDDEADNASINT------NRKELDPTTINRNICSIISL 313
L WL+ K N +LIIDDE+D A INT + ++ D +TIN+ + I++
Sbjct: 260 GLITWLKKDKEKMKKLN--VLIIDDESDQAGINTKLMDFASDEDFDRSTINKRLIDIVNF 317
Query: 314 FNRSA------YVGYTATPFANIFIPQNEDDLFPRDFIINIPAPTNYIGPEKVFGTSIIP 367
N YV YTATP+ANI LFP FI ++ YIGP++++G P
Sbjct: 318 ANEGESFGAMNYVAYTATPYANILNENEFKSLFPHHFIHSLEPSPLYIGPKQIYGH---P 374
Query: 368 DDTNSDLLPIIYPIKDYDFFVPQGHKKDDDKPKFEDIPESLRIAVKCFIVTCAIRIARGQ 427
+DT SD I + + D K+ + + +P L A+ F+ +I +
Sbjct: 375 EDTESDNSLDIVNLINRDEI-----KEIREVEELSSLPYHLESALLWFVNGLSI-LRNAN 428
Query: 428 GTRHNSMLIHVSRFQMWQNHIKELVENLFNYYKHE--IEASDTAIYEEFRKILEDDTANY 485
+ SMLIH S+ +K L+ + FN E ++ + E K+ + D
Sbjct: 429 YKKPVSMLIHTSQKITDHESVKRLIVDWFNDITKEKFLKKCEIQFNNEKNKLSKKDFLKI 488
Query: 486 --KSYKTITNEIKESKFSDIDKNLSVHSWEEIKP-QLFKAVQKIEV-------KSINGTS 535
+ K IT +I FSD+ L E + +LF ++ S +
Sbjct: 489 LPEYEKEITIDIPNIDFSDLIPELEYIYKESLDSIKLFDEEREFSSGIHLCVDNSKHNLV 548
Query: 536 GDCLTY---YENEKNGISV------IAIGGDKLSRGLTLEGLSVSYFLRASKMYDTLMQM 586
D +Y Y N S+ I IGG LSRGLTLEGL +YF+R +K+ DTLMQM
Sbjct: 549 IDNESYRLAYPNHSQLKSMDKAPGFIVIGGATLSRGLTLEGLISTYFIRTTKIADTLMQM 608
Query: 587 GRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEFNYLAESGGTPENYALKVRTH 646
GRWFGYR GY + R++ S E E FR + E LR TP+ Y ++V
Sbjct: 609 GRWFGYRIGYELIPRIWMSNESLERFRFLVQVDENLRKIIKNYNIGDKTPKEYGMEVTNS 668
Query: 647 P--GSLQITSVSKMRYAKQISVSWAGRLIESYQLPMDKGIKKRNLIATDNFISAQGMPEK 704
P ++IT ++ + A++ +++ G + D I K N+ T+ F + G +
Sbjct: 669 PDYNFMRITGKNRSQAAQETEINFTGYKPQDINFTNDSKILKHNIDLTEEFFADLG---R 725
Query: 705 KGNNYL-----WRNVSPDDV-CDYLSKFKVANSLKKVDLDMISNYIQELVKKGELTSWSV 758
NNYL W +V+ D V DYL KF+ NS ++ + I++ + SW+V
Sbjct: 726 SQNNYLNRAVFWESVALDRVFSDYLEKFQF-NSRTAAFNNLKA--IRKWCEDNRFLSWNV 782
Query: 759 VVMNK 763
+ K
Sbjct: 783 AAVGK 787
>gi|123967965|ref|YP_001008823.1| hypothetical protein A9601_04281 [Prochlorococcus marinus str.
AS9601]
gi|123198075|gb|ABM69716.1| Hypothetical protein A9601_04281 [Prochlorococcus marinus str.
AS9601]
Length = 901
Score = 216 bits (549), Expect = 8e-54, Method: Composition-based stats.
Identities = 227/861 (26%), Positives = 382/861 (44%), Gaps = 95/861 (11%)
Query: 45 KLKNDLLSMYSVKI-----DAFQILEGRERRD-PWLKDFRANKMSKWEF--------WMR 90
K KND + + K A + LE E + + K + K W F W
Sbjct: 38 KYKNDYIKFFDSKFIIELEAAKKFLEEDEIKALDYSKTLSSRKQESWYFGPKENHKNWNS 97
Query: 91 YAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNEIQISKKGLVVGQVQSGKTANYTG 150
Y YL K + L+ T V++++ NP + KGLV+G VQSGKTAN G
Sbjct: 98 YKSYLLNVKNRTKESVEILNNETTEVVNQILNPYAPSGK-KIKGLVLGYVQSGKTANMAG 156
Query: 151 LICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYERAYTMN-NTTKIGVGLI 209
+I KAAD+G+ LIIVLAG+ LR+QTQ RI L + T N N + G+ I
Sbjct: 157 VIAKAADSGYKLIIVLAGLTKALRAQTQARIQNDILQHNDILWHPLTDNENDIEEGLDAI 216
Query: 210 PGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQPILLVVKKNASVLNRLYKWLQTQT 269
P F + + ++KKN ++ +L ++
Sbjct: 217 PIF------------------------------TKKTTICIIKKNVQIIIKLISKMKRMP 246
Query: 270 INEKITNKSLLIIDDEADNASINTNR---KELDPTTINRNICSIISLFNRSAYVGYTATP 326
E S LIIDDE D AS+NT K + ++ N+ + ++ F YVGYTATP
Sbjct: 247 KLE--MQASTLIIDDECDQASLNTKEYKDKANEISSTNKLLKQLLIDFKNVTYVGYTATP 304
Query: 327 FANIFI----PQNEDDLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNSDLLPIIYPIK 382
A P L+P DFI + P +Y G K+F +I + LP I I
Sbjct: 305 NAPFLTHPKAPDGLQSLYPSDFITPLEEPADYFGVNKLFANNITNESGEDLSLPFIKRIP 364
Query: 383 DYDFFVPQGHKKDDDKPKFE-DIPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSR- 440
D + +KD P F+ + SLR A F++ + R R H M+IHVSR
Sbjct: 365 DSELESLTCKRKD--LPIFKPSLTNSLRDACDYFLLVLSARSLRNLKEDHCCMMIHVSRS 422
Query: 441 FQMWQNHIKELVENLFNYYKHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKF 500
+M + + + E F K +E D I + + +K + I S
Sbjct: 423 LKMHELYRNLIFEEWFIPIKIGLENKDKEIIDRLENL----------WKIESKAINSSVR 472
Query: 501 SDIDKNLSVHSWEEIKPQLFKAVQ--KIEVKSINGTSGDCLTYYENEKNG----ISVIAI 554
S+++ L + S+++I+ L + I V++ + ++ D ++++++ I I I
Sbjct: 473 SNLNCPLKIESFQKIEKNLLNELNDLSINVENSDESNFDERLEFKDKRDKDYKTIHSIVI 532
Query: 555 GGDKLSRGLTLEGLSVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRH 614
GGD LSRGLT++GL S+FLR +K DTLMQMGRWFG+R GY DL R++ + ++ F +
Sbjct: 533 GGDVLSRGLTIDGLVSSFFLRETKQDDTLMQMGRWFGFRYGYEDLPRIWMTYKVELDFTN 592
Query: 615 ITIASEELRGEFNYLAESGGTPENYALKVRTHPGSLQITSVSKMRYAKQISVSWAGRLIE 674
R + N++ TP ++ + V+ + + K+ S ++ GR +
Sbjct: 593 FVSGENFYREQINWMNSQNLTPSDFPVLVQMFEQRNPTSKGKISKKVKKSSWTFFGRELW 652
Query: 675 SYQLPMDKGIKKRNLIATDNFI-------SAQGMPEKKGNNYLWRNVSPDDVCDYLSKFK 727
+ + +D+ I K N A N I + + +K+ N ++ +++ + +L +
Sbjct: 653 TLRFKLDQQIHKDNEKAIINLIDECESSTQSHFVYKKRRNAHIIKDIGFKVLLKFLDNYN 712
Query: 728 VANSLKKVDLDMISNYIQELVKKGE--LTSWSVVVMNKNQPAVQYTFSNSIQAGCFDRNR 785
+ + + N+I ++ + +W++ + K ++ + +R +
Sbjct: 713 F--HFHEDLFNNVKNFINTDLENIDSIFKTWNIAI--KGGSYLKRDIGTVLNVKTMNRTK 768
Query: 786 AED-TNWNTYYIRKNHIVGNQTDEFIDLDEDLINAALERTRQRKAELNRGWDKEYPAPEI 844
++ T+ + I + + D +D+D+ IN E ++K G A +
Sbjct: 769 IQNGTSEDDLKIIDIKSLTSPDDLLVDIDQKDINEWDESEEKKKF----GKQLRSAAKKC 824
Query: 845 VRQEFRPRTNPLLLIYPLNPE 865
+ PR PLL+IYP++ +
Sbjct: 825 REKYLGPR--PLLIIYPISKD 843
>gi|88854625|ref|ZP_01129292.1| endonuclease [marine actinobacterium PHSC20C1]
gi|88816433|gb|EAR26288.1| endonuclease [marine actinobacterium PHSC20C1]
Length = 1017
Score = 203 bits (516), Expect = 5e-50, Method: Composition-based stats.
Identities = 181/645 (28%), Positives = 288/645 (44%), Gaps = 116/645 (17%)
Query: 68 ERRDPWLKDFRANKMSKWEFWMRYAEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRNE 127
E DPW + A + + +F+ R + + K + + L++ T ++ +L +P E
Sbjct: 127 EHWDPW---YTAERQGENDFYWRAYRGVLERKGWDVDALTNLNKSTTDIVGRLSDPSA-E 182
Query: 128 IQISKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLG 187
+ KGLVVG VQSGKTAN+TG++ KA D+G+ LIIVL G LR QTQ R+D +G
Sbjct: 183 VGYQTKGLVVGHVQSGKTANFTGIVAKAVDSGYRLIIVLTGTIELLRGQTQRRLDMELVG 242
Query: 188 FDTQYERAYTMNNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAAN------------ 235
+ E + T + I D + G F A +
Sbjct: 243 EENILEGVDRTDETAIRDIDYIGNQD--------VDWQNGKFLKHAVDIHSTPGVPAIKR 294
Query: 236 --TAGFNF-------------------NVPQPI------------LLVVKKNASVLNRLY 262
TAG ++ N +P+ + V+KKN + L L
Sbjct: 295 LTTAGKDYRKLRLGRDALDFRRNGELRNPAKPVYDPENIHSVDARIAVMKKNTTALKNLL 354
Query: 263 KWLQTQTINEKITNKSLLIIDDEADNASINT---------NRKELDPTTINRNICSIISL 313
L+ I +LIIDDEAD AS+NT + + IN+ I +++
Sbjct: 355 HDLRN--IRADAREIPVLIIDDEADQASVNTVNPRSKKAAAAESKKRSAINKLINDLLNT 412
Query: 314 FNRSAYVGYTATPFANIFI-PQNEDDLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNS 372
R+ Y+GYTATPFAN+FI P + +D+FP+DFI+++ + Y+G + + +
Sbjct: 413 MPRAQYIGYTATPFANVFISPDDSEDVFPKDFIVSLDPSSEYMGGRQFHDLFGLDPEAKG 472
Query: 373 DLLPIIYPIKDYDFFVPQGHKKDDDKPKFEDIPESLRIAVKCFIVTCAIRIAR-----GQ 427
D + + FV + +P E + +R A+ F+++ I+ R G
Sbjct: 473 DA-----SLSNEAAFVRDLRADGESEPDEER--QEIRNAIDAFVLSGGIKKWREAQDDGF 525
Query: 428 GTRHNSMLIHVSRFQMWQNHIKELVENLFNYYKHEIEASDTAIYEEFRKILEDDTANYKS 487
RH++ML+H S + + +++N + S + R++ +D
Sbjct: 526 SYRHHTMLVHESIRVAEHADLADTFRSVWNLGGY----SSPSGLARLRELYHED------ 575
Query: 488 YKTITNEIKE-SKFSDIDKNLSVHSWEEIKPQLFKAVQKIEVKS-----INGTSGDCLTY 541
+ +T +E + D +++++K + AV I S +NG+
Sbjct: 576 FLRVTESRQEWGRLPMPD------TFDDLKAYIGLAVDAITAASDPVVVVNGSKDSDYNA 629
Query: 542 YENEKNGISVIAIGGDKLSRGLTLEGLSVSYFLRASKMYDTLMQMGRWFGYRPGYVDLCR 601
+ I +GG KLSRG T+EGL++SY+ R + DTLMQMGRWFGYRPGY DL R
Sbjct: 630 MNFQVAPYWRIMVGGAKLSRGFTVEGLTISYYRRRTAAADTLMQMGRWFGYRPGYNDLVR 689
Query: 602 LF----------TSEELNEWFRHITIASEELRGE---FNYLAESG 633
L+ S +L + F I EE R + F+ L E G
Sbjct: 690 LYIGRDVLDGKKKSYDLYDAFTSIIEDEEEFRAQLRKFSELTEDG 734
>gi|115376475|ref|ZP_01463710.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
gi|115366543|gb|EAU65543.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
Length = 722
Score = 145 bits (365), Expect = 2e-32, Method: Composition-based stats.
Identities = 143/559 (25%), Positives = 237/559 (42%), Gaps = 89/559 (15%)
Query: 92 AEYLEKHKKFAPSVILQLDELTDRVLDKLFNPQRN------EIQISKKGLVVGQVQSGKT 145
+YLE+ K L+ E RV D RN ++ GL +G VQSGKT
Sbjct: 30 GQYLERIGK------LKTSEAVTRVRDDALAIVRNCRPFTSATDDTRTGLAIGYVQSGKT 83
Query: 146 ANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYERAYTMNNTTKIG 205
+ T + A D G +II+LAG+ NL Q +R + + +N+ G
Sbjct: 84 MSMTTVSALARDNGCRIIILLAGVTTNLLQQNADRFQKDLREASGRPSAWRILNSAKGFG 143
Query: 206 VGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQPILLVVKKNASVLNRLYKWL 265
+ A + S + D + +L +V KN + L+ L L
Sbjct: 144 EHAVRDLQRAAEDWRDGSFREDD---------------KKTLLYLVLKNHTHLDGLASLL 188
Query: 266 QTQTINEKITNKSL--LIIDDEADNASINTNRKELDPTTINRNICSIISLFNRSAYVGYT 323
K+ + + LI+DDEAD A +NT + +T + I + + Y+ YT
Sbjct: 189 S------KVDLRGIPALILDDEADQAGLNTKPNAAEASTTYKRIARVRAALPNHTYLQYT 242
Query: 324 ATPFANIFIPQNEDDLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNSDLLPIIYPIKD 383
ATP A + I DD+ F + Y G + FG P +++ I
Sbjct: 243 ATPQAPLLIAL--DDMLSPAFAELVRPGDGYTGGQTFFGVDANP--------ALVHAIPA 292
Query: 384 YDFFVPQGHKKDDDKPKFEDIPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSRFQM 443
+D D E+ PE+L A++ F V A+ R Q R SMLIH S+ Q
Sbjct: 293 HDL----------DVSATEEPPETLLSAMRVFFVGAAVGGLRNQ-PRPISMLIHPSQRQD 341
Query: 444 WQNHIKELVENLFNYYKHEIEASD----TAIYEEFRKILEDDTANYKSYKTITNEIKESK 499
V+ + + + + D T EFR +D A
Sbjct: 342 DHKRYLAWVQAIIGRWTTGLRSQDNDERTDTLAEFRPAYDDLRAT--------------- 386
Query: 500 FSDIDKNLSVHSWEEIKPQLFKAVQKIEVKSINGTSGDCLTYYENEKNGISVIAIGGDKL 559
+ ++ +++E+ QL + V + + +N ++G+ ++N + + I +GG+KL
Sbjct: 387 ------DPTLPTFDELIVQLRRCVSDVHLSEVNSSNGNDQVDWDNAE---THILVGGEKL 437
Query: 560 SRGLTLEGLSVSYFLRASKMY--DTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITI 617
+RG T+EGL+V+Y R++ + DT+ Q R+FGY+ Y+ LCRL+ ++ +R
Sbjct: 438 NRGFTVEGLTVTYMPRSAGDWNADTIQQRARFFGYKQAYLSLCRLYLHPDVIHAYRSYVR 497
Query: 618 ASEELRGEFNYLAESGGTP 636
E++R + LA+ G P
Sbjct: 498 HEEDVRTQ---LAQHRGRP 513
>gi|116748759|ref|YP_845446.1| hypothetical protein Sfum_1320 [Syntrophobacter fumaroxidans MPOB]
gi|116697823|gb|ABK17011.1| hypothetical protein Sfum_1320 [Syntrophobacter fumaroxidans MPOB]
Length = 702
Score = 128 bits (322), Expect = 2e-27, Method: Composition-based stats.
Identities = 131/509 (25%), Positives = 212/509 (41%), Gaps = 76/509 (14%)
Query: 131 SKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDT 190
++ GLV G VQSGKTA+ T + A D G+ +II++AG NL +Q + R++
Sbjct: 52 ARTGLVCGYVQSGKTASMTAVSALAKDNGYRIIILIAGTTTNLVAQNRERLETHLRKAAP 111
Query: 191 QYERAYTMNNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQPILLV 250
++ N + I + Y ++ + + +
Sbjct: 112 EWSWLMLTNPRLRRNRQDIEPLAQEWRSEYYEEDDR------------------RTLFIS 153
Query: 251 VKKNASVLNRLYKWLQTQTINEKITNKSLLIIDDEADNASINTNRKELDPTTINRNICSI 310
V KN + L +L + L+ ++ +I DDEAD AS+NT EL ++ R I +
Sbjct: 154 VMKNHTHLQKLAELLEAV----DLSGFPAIIFDDEADQASLNTQPLELTASSTYRTIDEL 209
Query: 311 ISLFNRSAYVGYTATPFANIFIPQNEDDLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDT 370
Y+ YTATP A + I + D DF + YIG ++F +
Sbjct: 210 RRSLPFHTYLQYTATPQAPLLITR--IDSLSADFAELVSPGEGYIGGRELFQSP------ 261
Query: 371 NSDLLPIIYPIKDYDFFVPQGHKKDDDKPKFEDIPESLRIAVKCFIVTCAIRIARGQGTR 430
+S++ I PQ D E P SL A++ F V A G
Sbjct: 262 SSNVRRI-----------PQSEIYSDGNLPVEP-PRSLISAMQAFFVGVAAG-RMGPMNE 308
Query: 431 HNSMLIHVSRFQMWQNHIKELVENLFNYYKHEIEASD---TAIYEEFRKILEDDTANYKS 487
H SMLIH S+ M L +Y+ + S + EEF + +D
Sbjct: 309 HRSMLIHPSKATMTHRQYYNWANGLKSYWHQVLLTSGPDREELLEEFHGVHQD------- 361
Query: 488 YKTITNEIKESKFSDIDKNLSVHSWEEIKPQLFKAVQKIEVKSINGTSGDCLTYYENEKN 547
+D L V +EE++ +L A+ + + +N G + + N
Sbjct: 362 ------------LADTCDQLPV--FEEVQAKLPVAINQTFITLVNSVDGREVPW----GN 403
Query: 548 GISVIAIGGDKLSRGLTLEGLSVSYFLRASKMY--DTLMQMGRWFGYRPGYVDLCRLFTS 605
G S I +GG+KL RG T++GL+V+Y R+ + DT+ Q R+FGY Y+ CR++
Sbjct: 404 GYSHILVGGEKLGRGYTIKGLTVTYMPRSPGGWTADTIQQRARFFGYHSRYLGYCRIYLH 463
Query: 606 EELNEWFRHITIASEELRGEFNYLAESGG 634
+++ + E++R LAE G
Sbjct: 464 PDVHTAYSAYVSHEEDMRSR---LAEHAG 489
>gi|113939920|ref|ZP_01425766.1| hypothetical protein HaurDRAFT_1938 [Herpetosiphon aurantiacus ATCC
23779]
gi|113898479|gb|EAU17495.1| hypothetical protein HaurDRAFT_1938 [Herpetosiphon aurantiacus ATCC
23779]
Length = 691
Score = 114 bits (284), Expect = 5e-23, Method: Composition-based stats.
Identities = 146/552 (26%), Positives = 233/552 (42%), Gaps = 121/552 (21%)
Query: 110 DELTDRVLDKLFN-PQRNEIQISKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAG 168
D+L + +L+ + P E ++ GL++G +QSGKT +T I AAD G+ L I+L
Sbjct: 30 DKLINNILELIQELPSAEEGSQNRHGLLLGYIQSGKTFAFTTAIALAADNGYRLFIILTS 89
Query: 169 IHNNLRSQTQNRIDEGFL---------GFDTQYERAYTMNNTTKIGVGLIPGFDNAIANS 219
+ L +QT IDE G D+ ++ M T K G++
Sbjct: 90 NNLILYNQT---IDERLKQDLQSIEVEGKDSWEQKILMMTQTLKDPKGVL---------- 136
Query: 220 YTTSLEKGDFTSRAANTAGFNFNVPQPILLVVKKNASVLNRLYKWLQTQTINEKITNKSL 279
+LV KN ++L +L + L+T K+
Sbjct: 137 ----------------------------VLVTTKNTAILYKLEQTLRTIQEELKMGLPIA 168
Query: 280 LIIDDEADNASINTN--RKELDPTTINRNICSIISLFNR----SAYVGYTATPFANIFIP 333
LIIDDEAD ++TN R+ ++P S I R + TATP A +F+
Sbjct: 169 LIIDDEADEGGLDTNTRRRSVNPLIEAGPTFSAIEEIRRLVPNHVRLQVTATPQA-LFLQ 227
Query: 334 QNEDDLFPRDFIINIPAPTNYIGPEKVFGTSIIPD---------DTNSDLLPIIYPIKDY 384
+ + P F + + +Y+G E+ F D + II I +
Sbjct: 228 DSGHESRP-GFTVLLEPGADYVGSEQFFALKQEIDMIYENDDENELEERKSKIIRRIDQH 286
Query: 385 DFFVPQGHKKDDDKPKFEDIPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSR---- 440
D + + D IP+SLR A+ F + I+I TR S L H+S
Sbjct: 287 DIHMMIEQEGDS-------IPDSLRDALLTFYIGATIKIVDEPSTRF-SFLCHISARKAD 338
Query: 441 ----FQMWQNHIKELVENLFNYYKHEIEASDTAIYEEFRKILEDDTANYK---SYKTITN 493
Q+ +I L ++L +Y + I + D E KI D + Y+ S TI N
Sbjct: 339 HDKISQIINKYIGVLRKSLIDYVDNNITSEDIYYLE---KIYTDIISTYEDGISLGTIIN 395
Query: 494 EIKESKFSDIDKNLSVHSWEEIKPQLFKAVQKIEVKSINGTSGDCLTYYENEKNGISVIA 553
E++ES + K ++ IN + T Y+ +G I
Sbjct: 396 ELRES------------------------IIKTDISVINSS-----TTYQPTYSGKYNIF 426
Query: 554 IGGDKLSRGLTLEGLSVSYFLRASKM--YDTLMQMGRWFGYRPGYVDLCRLFTSEELNEW 611
IGG K++RG+T++ L V+Y+ R K+ DT++Q R +GYR ++D+ RLF +EE+ +
Sbjct: 427 IGGTKIARGVTIKNLIVTYYGRQPKVTNMDTMLQHARMYGYRKNHMDVTRLFITEEIEKR 486
Query: 612 FRHITIASEELR 623
F I + + LR
Sbjct: 487 FTVIYESEKALR 498
>gi|119962574|ref|YP_946950.1| hypothetical protein AAur_1164 [Arthrobacter aurescens TC1]
gi|119949433|gb|ABM08344.1| conserved domain protein [Arthrobacter aurescens TC1]
Length = 942
Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats.
Identities = 158/584 (27%), Positives = 240/584 (41%), Gaps = 104/584 (17%)
Query: 132 KKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRI----DEGFLG 187
++GLV+G VQSGKTA+ G+ A D G N++I+LAG +L QT R+ D G
Sbjct: 80 RRGLVMGSVQSGKTASMLGVAALALDHGVNIVIILAGTRLSLWRQTFGRLRGQLDSGPAS 139
Query: 188 FDTQYERAYTM-NNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQP 246
+ + R + + G +P +A Y + RA + QP
Sbjct: 140 IEKERRRILVPPSGSEDYASGSVP-----LATLYRFQPAQ---IRRALHHR-------QP 184
Query: 247 ILLVVKKNASVLNRLYKWLQTQ---TINEKITNKSLLIIDDEADNASINTNRKELDPTTI 303
+++V K L L + L+ TI T LL++DDEAD+ SI R E I
Sbjct: 185 LIVVAMKQTDHLRALARSLRESVYPTIRSLDTPVHLLVLDDEADDGSILDARVEASEDPI 244
Query: 304 NRNICSII-----------------SLFNRSAYVGYTATPFANIFIPQNEDDLFPRDFII 346
N+ I +LF + YVGYTATP AN F+ + + L PR+F++
Sbjct: 245 YGNLKQIPRAIADLWAPPSLQTVPRNLF--ATYVGYTATPQAN-FLQEEHNPLAPREFVL 301
Query: 347 NIPAPTNYIGPEKVFGTSIIPDDTNSDLLPIIYPIKDY----DFFVPQGHKKDDDKPKFE 402
++ P + T P + +Y + F +G
Sbjct: 302 SLRTPLDRGELRPRSSTYFEPTG-----------LANYYTGGEVFYRRGTPAGLCVATSN 350
Query: 403 DIPESLRIAVKCFIVTCAIRIARGQ---GTRHNSMLIHVSRFQMWQN---------HIKE 450
++ E AV+ F+V AIR R G S SR ++ H
Sbjct: 351 NLDEDRADAVRAFLVAGAIRTLRASSRLGPTTASSQTFASREEVLAAIPPIHSMLVHPSA 410
Query: 451 LVENLFNYYKHEIEASDTAIYEEFRKIL----------------EDDT--ANYKSYKTIT 492
LV+ F +E + EE I+ E+D A + Y++ +
Sbjct: 411 LVDEQFQEAARILEWAGARSPEESESIMNSGGYLPEILAAKVDEEEDKWQAWVERYRS-S 469
Query: 493 NEIKESKFSDIDKNLSVHSWEEIKPQLFKAV-QKIEVKSINGTSG-DCLTYYE------- 543
E FS N ++ SW EI+ L + + V +N G D YE
Sbjct: 470 AEAVHFAFSTPTPN-AIPSWSEIRGALIQEIIPNTRVAVVNSDPGADDRPEYEPWCDADG 528
Query: 544 --NEKNGISVIAIGGDKLSRGLTLEGLSVSYFLRASK--MYDTLMQMGRWFGYRPGYVDL 599
+ +S I + G+ +SRGLTLEGL+ + FLR S + D+ MQM RWFGYR Y++L
Sbjct: 529 NWHAPRDLSTIFVSGNVMSRGLTLEGLTTTLFLRTSNQALADSQMQMQRWFGYRGAYIEL 588
Query: 600 CRLFTSEELNEWFRHITIASEELRGEF-NYLAESGGTPENYALK 642
CR+F + ++F E LR + + E P+ L+
Sbjct: 589 CRIFAPQPQLDFFTAYHEVDEALRQILTDAMQEDASAPDPVVLQ 632
>gi|23127883|ref|ZP_00109742.1| COG0468: RecA/RadA recombinase [Nostoc punctiforme PCC 73102]
Length = 724
Score = 106 bits (264), Expect = 1e-20, Method: Composition-based stats.
Identities = 138/506 (27%), Positives = 207/506 (40%), Gaps = 85/506 (16%)
Query: 134 GLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYE 193
GL+ G+VQSGKT + A + F I+L + L QT NR
Sbjct: 80 GLIYGRVQSGKTNTTIATLALAHENNFCCFIILTSDNTWLGKQTANR------------- 126
Query: 194 RAYTMNNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQPILLVVKK 253
NN + G FD S + +T ++LV K
Sbjct: 127 ----FNNQVQGGPVF---FDWEAWKKDPDSFAETKIVPYIKDTG---------VVLVSTK 170
Query: 254 NASVLNRLYKWLQTQTINEKITNKSLLIIDDEADNASINTN--------RKELDPTTINR 305
N L+ L K L+ K+ LI DDEADNAS+NTN + +D + I
Sbjct: 171 NGHHLDNLLKVLKAA----KVRGVPTLIFDDEADNASLNTNESKQAKKGKDAIDDSKIFE 226
Query: 306 NICSIISLFNRSAYVGYTATPFANIFIPQNEDDLFPRDFIINIPAPTN-YIGPEKVFGTS 364
I I Y+ TATP + + Q+ F +P P + Y+G E F
Sbjct: 227 TIGKIRQEVANHIYIQITATPQS--LLLQSLPHFCKPKFCAALPEPGDSYMGGELFFADK 284
Query: 365 ----IIPDDTNSDLLPIIYPIKDYDFFVPQGHKKDDDKPKFEDIPESLRIAVKCFIVTCA 420
I D I +K + G+K IP+ LR+A+ CF +
Sbjct: 285 SKYCCIVDSGE------INQLKKQKGAINPGNK--------WIIPDGLRLALCCFFLGSI 330
Query: 421 IR-IARGQGTRHNSMLIHVSRFQMWQNHIKELVENLFNYYKHEIEASDTAIYEEFRKILE 479
+ ++ G+ S L HV ++ I ++E + + + + D AI E+
Sbjct: 331 YKMLSSGKEDEKYSFLAHVC----YKQDIHSVLEKVISQFVINL---DQAIREK-----S 378
Query: 480 DDTANYKSYKTITNEIKESKFSDIDKNLSVHSWEEIKPQLFKAVQKIEVKSINGTSGDCL 539
DT N ++ K + E K + + +E+K QL A+ KI IN + +
Sbjct: 379 SDTENKQARKWLEQAYDELKKTADNLPPITELIDELKIQLRHAIPKI----INANNPEKE 434
Query: 540 TYYENEKNGISVIAIGGDKLSRGLTLEGLSVSYFLRASK--MYDTLMQMGRWFGYRPGYV 597
Y N I IGG++L RG+T+ GL V+Y+ R +K + DT+ Q R +GYRP
Sbjct: 435 PNYNPGMN----ILIGGNRLGRGVTINGLMVTYYGRDAKQKVMDTVHQHARMYGYRPQLK 490
Query: 598 DLCRLFTSEELNEWFRHITIASEELR 623
D+ RLF E + + FR I A E +R
Sbjct: 491 DVTRLFLPEHILDAFRSIHEADEGMR 516
>gi|19552980|ref|NP_600982.1| stress-sensitive restriction system protein 2 [Corynebacterium
glutamicum ATCC 13032]
gi|62390657|ref|YP_226059.1| RESTRICTION ENDONUCLEASE CGLIIR PROTEIN [Corynebacterium glutamicum
ATCC 13032]
gi|549844|gb|AAC00044.1| This orf may encode a typeI or typeIII restriction endonuclease
which is stress-sensitive and ATP-dependent. It contains
a typical ATP binding region (Walker motif)
gi|21324547|dbj|BAB99171.1| Stress-sensitive restriction system protein 2 [Corynebacterium
glutamicum ATCC 13032]
gi|41325995|emb|CAF20158.1| RESTRICTION ENDONUCLEASE CGLIIR PROTEIN [Corynebacterium glutamicum
ATCC 13032]
Length = 632
Score = 100 bits (249), Expect = 5e-19, Method: Composition-based stats.
Identities = 115/501 (22%), Positives = 204/501 (40%), Gaps = 108/501 (21%)
Query: 135 LVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYER 194
L+ G VQSGKT++ G+I D+ F+ I++L + L QT +R+ + F
Sbjct: 48 LLYGDVQSGKTSHMLGIIADCLDSTFHTIVILTSPNTRLVQQTYDRVAQAF--------- 98
Query: 195 AYTMNNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQPILLVVKKN 254
T ++ + AN P+ ++VV K
Sbjct: 99 ------------------------PDTLVCDRDGYNDFRANQKSLT---PRKSIVVVGKI 131
Query: 255 ASVLNRLYKWLQTQTINEKITNKSLLIIDDEADNASINTNRKELDPTTINRNICSIISLF 314
+VL WL+ + ++ +LIIDDEAD S+NT + D +TIN + SI L
Sbjct: 132 PAVLG---NWLRVFNDSGALSGHPVLIIDDEADATSLNTKVNQSDVSTINHQLTSIRDLA 188
Query: 315 NRSAYVGYTATPFANIFIPQNEDDLFPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNSDL 374
Y+ T TP A + Q++D + + +++ +YIG + F + N+
Sbjct: 189 TGCIYLQVTGTPQAVLL--QSDDSNWAAEHVLHFAPGESYIGGQLFF------SELNNPY 240
Query: 375 LPIIYPIKDYDFFVPQGHKKDDDKPKFEDIPESLRIAVKCFIVTCAIRIARGQGTRHNSM 434
L + + + D+ +F D A+ +++T A+ RG+ +M
Sbjct: 241 LRLF------------ANTQFDEDSRFSD-------AIYTYLLTAALFKLRGESL--CTM 279
Query: 435 LIH-----VSRFQMWQNHIKELVENLFNYYKHEIEASDTAIYEEFRKILEDDTANYKSYK 489
LIH S Q +L +Y+ I+ + YE+ L +N +
Sbjct: 280 LIHPSHTASSHRDFAQEARLQLTFAFERFYEPMIQHNFQRAYEQ----LAQTDSNLPPLR 335
Query: 490 TITNEIKESKFSDIDKNLSVHSWEEIKPQLFKAVQKIEVKSINGTSGDCLTYYENEKNGI 549
I N + ++ + S+H P T E+ +G
Sbjct: 336 KILNIL-----GGMEDDFSIHIVNSDNP----------------------TVEEDWADGY 368
Query: 550 SVIAIGGDKLSRGLTLEGLSVSYFLRASK--MYDTLMQMGRWFGYRPGYVDLCRLFTSEE 607
++I +GG+ L RGLT L +++R SK DTL Q R FGY+ + D R+F
Sbjct: 369 NII-VGGNSLGRGLTFNNLQTVFYVRESKRPQADTLWQHARMFGYKR-HKDTMRVFMPAT 426
Query: 608 LNEWFRHITIASEELRGEFNY 628
+ + F+ + + +E ++ + ++
Sbjct: 427 IAQTFQEVYLGNEAIKNQLDH 447
>gi|26554430|ref|NP_758364.1| hypothetical protein MYPE9820 [Mycoplasma penetrans HF-2]
gi|26454440|dbj|BAC44768.1| conserved hypothetical protein [Mycoplasma penetrans HF-2]
Length = 721
Score = 96.3 bits (238), Expect = 8e-18, Method: Composition-based stats.
Identities = 136/560 (24%), Positives = 227/560 (40%), Gaps = 111/560 (19%)
Query: 131 SKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFDT 190
+K L+VG+VQSGKT+ A D F +I+L N L QT R F
Sbjct: 67 NKNVLLVGKVQSGKTSFLEMFTALALDNEFQCVILLGATDNKLLGQTNERFINTFTKNIK 126
Query: 191 QYE-RAYTMNNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQPILL 249
+ Y+M+ + ++ + S++ +F S +N P+
Sbjct: 127 DVDGELYSMD---------VKDWNKKPYFTLADSIDSTNFYSLVSNKI--------PLFF 169
Query: 250 VVKKNASVLNRLYKWLQTQTINEKITNKSLLIIDDEADNASINTNRKELDPTTINRNICS 309
V K+ ++++ +++ E K+L IIDDE D AS+N E + T I
Sbjct: 170 VSLKSKKQIDKVCDFIEKNIDGEGNFLKTL-IIDDEGDQASLNNKFTENEITATYGAISR 228
Query: 310 IISLFNRSAYVGYTATPFANIFIPQNEDDLFPRDFIINIPAPTNYIGPEKVFGTSIIPDD 369
+ L Y+ TATP AN+ + GTS+ P
Sbjct: 229 MKELLKDHLYLSITATPHANVLLT----------------------------GTSLKPG- 259
Query: 370 TNSDLLPIIYPIKDY---DFFVPQGHKKDDDKPKFEDIPESLRIAVKCFIVTCAIRIARG 426
+L +IYP Y D F H DD F IP + ++
Sbjct: 260 ----ILRLIYPADGYNGSDTF----HMADDSY--FITIPNDEKENIQ------------- 296
Query: 427 QGTRHNSMLIHVSRFQMWQNHIKELVENLFNYYKHEIEASDTAIYEEFRKI--------L 478
+G S+L FQ+ +K +E + Y S+ ++ + +KI +
Sbjct: 297 EGILTKSVLNAFYYFQIASAILK--IEGISEY-------SEMIVHNDLKKINHKNLLNSI 347
Query: 479 EDDTANYKSY-KTITNEIKESKFSDI----DKN------LSVHSWEEIKPQLFKAVQKIE 527
E+ T N K Y K E ES F ++ +KN L + + ++K + K + +
Sbjct: 348 ENKTNNLKEYCKKNNKEALESHFFELKSVYNKNYFNESILDKYKFNDLKDVIKKVIIDTQ 407
Query: 528 VKSINGTSGDCLTYYENEKNGISVIAIGGDKLSRGLTLEGLSVSYFLRASKM---YDTLM 584
V N + D + N+++ I IGG+ + RG+T + L ++F R K DT +
Sbjct: 408 VILFNSDAKD----FSNKEHMFHRIYIGGNLMQRGITFDYLITTFFTRWPKKGGNMDTTL 463
Query: 585 QMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGEFNYLAESGGTPENYALKVR 644
Q RWFGYR Y+ +C++F + + FR++T EL +F + + + ++ +K
Sbjct: 464 QRARWFGYREKYLFVCKIFMPDSAQKEFRNLTETDTELWEQFQLVQTNALSLDSIVIK-- 521
Query: 645 THPGSLQITSVSKMRYAKQI 664
T+ L T R+ K I
Sbjct: 522 TNSKELNPTRRGVARWMKNI 541
>gi|23100677|ref|NP_694144.1| hypothetical protein OB3222 [Oceanobacillus iheyensis HTE831]
gi|22778911|dbj|BAC15178.1| hypothetical protein [Oceanobacillus iheyensis HTE831]
Length = 729
Score = 92.8 bits (229), Expect = 1e-16, Method: Composition-based stats.
Identities = 145/580 (25%), Positives = 243/580 (41%), Gaps = 104/580 (17%)
Query: 130 ISKKGLVVGQVQSGKTANYTGLICKAADAGFNLIIVLAGIHNNLRSQTQNRIDEGFLGFD 189
+++ G+++G++QSGKT + GL A D G+++IIVL L QT R+ + F G +
Sbjct: 44 MNRPGMLLGKIQSGKTRTFIGLSGLAMDNGYDVIIVLTKGTRALARQTLQRLYQEFEGIE 103
Query: 190 TQYERAYTMNNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQPILL 249
+ L+ D S N + + Q ++
Sbjct: 104 DE------------------------------DVLQIHDIMSMPNNL--IEYELQQKQMI 131
Query: 250 VVKKNASVLNRLYKWLQTQTINEKITNKSLLIIDDEADNASINTNRKELDPTTINRNICS 309
VVKK + L RL + L + K L+IDDEAD ASI ++ + + T IN
Sbjct: 132 VVKKETNNLKRLEEVLSVSY--PALATKKTLVIDDEADFASIGFSKTKREVTEINVIAGQ 189
Query: 310 IISLFNRSAYVGY---TATPFANIFIPQN-----EDDLF---PRDFIINIPAPTNYIGPE 358
I L A V + TATP++ P +F F +P P YIG +
Sbjct: 190 IDRLRRNLADVSFLQVTATPYSLYLQPDTLKVDGSSKVFQPVKPAFTELVPVPNEYIGGK 249
Query: 359 KVFGTSIIPDDTNSDLLPIIYPIKDYDFFVPQGHKKDDDKPKFEDI-----PESLRIAVK 413
F S D +S I PI + + + K+D + K E+ SLR ++
Sbjct: 250 YYFEESEAGDSISSH---IYEPINEDELMALK--KQDRRRFKLEEALHSNKISSLRNSII 304
Query: 414 CFIVTCAIR--IARGQGTRHN--SMLIHVSRFQMWQNHIKELVENLFNYYKHEIEASDTA 469
F+V +R + QG R N S +IH + ++ HE + +T
Sbjct: 305 HFVVGGVMRRIQQKQQGRRMNKYSFIIHTEQAKL----------------SHEWQ--ETI 346
Query: 470 IYEEFRKILE--DDTANYKSYKTITNEIKESKFSDIDKNL-----SVHSWEEIKPQLFKA 522
I E R++ + DD K + N I+ S F ++ K+L + S+ E++ + A
Sbjct: 347 ILEIKRQLQQAADDQP-----KLLLNLIRAS-FENMKKSLMMLGGDIPSFAEVQFESLWA 400
Query: 523 VQK--IEVKSINGTSGDCLTYYEN----EKNGISVIAIGGDKLSRGLTLEGLSVSYFLRA 576
+QK + + +N + D +N + I IGG L RG+T+ L Y+ R
Sbjct: 401 LQKDHLLISKVNSET-DINQLLDNTGQLKLRAPLNIFIGGQILDRGITIANLIGFYYGRN 459
Query: 577 SKMY--DTLMQMGRWFGYRPGY-VDLCRLFTSEELNEWFRHITIASEELRGEFNYLAESG 633
+ + DT++Q R +G+RP + + R +T++E+ R I LR F E G
Sbjct: 460 PRRFQQDTVLQHSRMYGFRPKEDLAVTRFYTTKEIYHVMRKIYEFDTGLRKAF----EQG 515
Query: 634 GTPENYALKVRTHPGSLQITSVSKMRYAKQISVSWAGRLI 673
G + + + S +K+ +K ++ R++
Sbjct: 516 GQDQGVVFIQQDQQNEIIPCSPNKLLLSKTTTLRSYKRML 555
>gi|59800807|ref|YP_207519.1| putative stress-sensitive restriction system protein [Neisseria
gonorrhoeae FA 1090]
gi|59717702|gb|AAW89107.1| putative stress-sensitive restriction system protein [Neisseria
gonorrhoeae FA 1090]
Length = 629
Score = 89.0 bits (219), Expect = 2e-15, Method: Composition-based stats.
Identities = 135/528 (25%), Positives = 217/528 (41%), Gaps = 97/528 (18%)
Query: 103 PSVILQLDELTDRVLDKLFNPQRNEIQISKKGLVVGQVQSGKTANYTGLICKAADAG-FN 161
P + + D ++KL + E +I++ L++G VQSGKTA G++ AD G
Sbjct: 12 PELADSVKNTVDGFMEKL---SQTEPKIAQNVLLLGNVQSGKTAQVLGVLSALADDGDHK 68
Query: 162 LIIVLAGIHNNLRSQTQNRIDEGFLGFDTQYERAYTMNNTTKIGVGLIPGFDNAIANSYT 221
+ + L +L+ QT R F E A S+
Sbjct: 69 VFLYLTTDSVDLQDQTVKRAKANLKNFIVLSE---------------------ADDRSFM 107
Query: 222 TSLEKGDFTSRAANTAGFNFNVPQPILLVVKKNASVLNRLYKWLQTQTINEKITNKSLLI 281
+ +A N PIL+V+KKNA VL R +Q+ + L+I
Sbjct: 108 EVM-------KAEN----------PILVVIKKNARVLKRWRNLFASQS---SLKGYPLVI 147
Query: 282 IDDEADNASINTN--RKELDPTTINRNICSIISLFNRSAYVGYTATPFANIFIPQNEDDL 339
+DDEAD AS+NTN + D +TIN+ + I + +S ++ TATP ++ + E D
Sbjct: 148 VDDEADAASLNTNSDKPAKDASTINKLLNDIKNSCCQSLFIQLTATP-QSLLLQHEESDW 206
Query: 340 FPRDFIINIPAPTNYIGPEKVFGTSIIPDDTNSDLLPIIYPIKDYDFFVPQGHKKDDDKP 399
P +FI A YIG VF SD P Y ++ D + DD K
Sbjct: 207 QP-EFIHFFEAGEKYIGGNFVF----------SD--PPSYIVRFID------SELDDMKD 247
Query: 400 KFEDIPESLRIAVKCFIVTCAIRIARGQGTRHNSMLIHVSRFQMWQNHIKELVENLFNYY 459
+ +I E + A+ F++TCA A N L + Q Q
Sbjct: 248 ESGEIAEGAKQALLSFLITCA-EFALCDKANCNFALHPSYKIQDHQ-------------- 292
Query: 460 KHEIEASDTAIYEEFRKILEDDTANYKSYKTITNEIKESKFSDIDKNLSVHSWEEIKPQL 519
A ++ + L D + + + KES +H ++EI +L
Sbjct: 293 ---------AFSKKIQAFLNDLVQAVNNGEDLAGSFKESYLDLQKTKPDIHHFDEIYEKL 343
Query: 520 FKAVQKIEVKSINGTSGDCLTYYENEKNGISVIAIGGDKLSRGLTLEGLSVSYFLRASKM 579
++ ++ ++ S T ++ EK G ++I IGG+ + RGLT+ L Y+ R +K
Sbjct: 344 TALLENKQISTLVVNS-QTETDFDLEK-GFNII-IGGNVIGRGLTIPKLQTVYYSRTAKK 400
Query: 580 --YDTLMQMGRWFGYRPGYVDLCRLFTSEELNEWFRHITIASEELRGE 625
DT Q R FGY L RL+ ++ +F + A+ + G+
Sbjct: 401 PNADTFWQHSRIFGYDRDK-SLLRLYIPFDVYYFFVQLNQANNLIIGQ 447
>gi|46156670|ref|ZP_00204752.1| hypothetical protein Haso02000658 [Haemophilus somnus 2336]
Length = 238
Score = 68.6 bits (166), Expect = 2e-09, Method: Composition-based stats.
Identities = 72/249 (28%), Positives = 109/249 (43%), Gaps = 49/249 (19%)
Query: 127 EIQISKKGLVVGQVQSGKTANYTGLICKAADAG-FNLIIVLAGIHNNLRSQTQNRIDEGF 185
E++ ++ L++G VQSGKTA G++ AD G + + L +L+ QT R
Sbjct: 33 ELKKAQNVLLLGNVQSGKTAQVLGVLSALADDGDHKIFLYLTTDSVDLQQQTVKRAKASL 92
Query: 186 LGFDTQYERAYTMNNTTKIGVGLIPGFDNAIANSYTTSLEKGDFTSRAANTAGFNFNVPQ 245
F E + T + K D
Sbjct: 93 DKFIVLSEDD----------------------DKSFTQVMKAD----------------N 114
Query: 246 PILLVVKKNASVLNRLYKWLQTQTINEKITNKSLLIIDDEADNASINT--NRKELDPTTI 303
PIL+V+KKNASVL R ++Q E + L+I+DDEAD AS+NT N+ + D +T+
Sbjct: 115 PILVVIKKNASVLRRWRNLFKSQ---ENLKGYPLVIVDDEADAASLNTNVNKHDKDASTV 171
Query: 304 NRNICSIISLFNRSAYVGYTATPFANIFIPQNEDDLFPRDFIINIPAPTNYIGPEKVFG- 362
N+ + I + +S ++ TATP ++ + E + P DFI YIG VF
Sbjct: 172 NKLLNEIKNSCCQSLFIQLTATP-QSLLLQHLESNWQP-DFIHFFEPGEKYIGGNFVFSD 229
Query: 363 --TSIIPDD 369
+ I P D
Sbjct: 230 PPSYIFPSD 238
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.317 0.134 0.394
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,498,736,822
Number of Sequences: 5470121
Number of extensions: 153876187
Number of successful extensions: 396288
Number of sequences better than 1.0e-05: 38
Number of HSP's better than 0.0 without gapping: 28
Number of HSP's successfully gapped in prelim test: 10
Number of HSP's that attempted gapping in prelim test: 395965
Number of HSP's gapped (non-prelim): 74
length of query: 933
length of database: 1,894,087,724
effective HSP length: 142
effective length of query: 791
effective length of database: 1,117,330,542
effective search space: 883808458722
effective search space used: 883808458722
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 135 (56.6 bits)