BLASTP 2.2.18 [Mar-02-2008]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= PGN_2004 hypothetical protein
(351 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
6,515,104 sequences; 2,222,278,849 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|34539927|ref|NP_904406.1| hypothetical protein PG0055 [P... 718 0.0
gi|150009348|ref|YP_001304091.1| hypothetical protein BDI_2... 157 1e-36
gi|167762899|ref|ZP_02435026.1| hypothetical protein BACSTE... 157 2e-36
gi|60682108|ref|YP_212252.1| hypothetical protein BF2629 [B... 156 3e-36
gi|53713899|ref|YP_099891.1| hypothetical protein BF2607 [B... 155 4e-36
gi|153806665|ref|ZP_01959333.1| hypothetical protein BACCAC... 151 9e-35
gi|150005295|ref|YP_001300039.1| hypothetical protein BVU_2... 145 4e-33
gi|160885694|ref|ZP_02066697.1| hypothetical protein BACOVA... 140 2e-31
gi|29345993|ref|NP_809496.1| hypothetical protein BT_0583 [... 139 2e-31
gi|160888143|ref|ZP_02069146.1| hypothetical protein BACUNI... 130 1e-28
gi|154490396|ref|ZP_02030657.1| hypothetical protein PARMER... 130 2e-28
gi|88806557|ref|ZP_01122074.1| hypothetical protein RB2501_... 114 1e-23
gi|120435346|ref|YP_861032.1| membrane protein containing P... 113 3e-23
gi|146297961|ref|YP_001192552.1| phage shock protein C, Psp... 112 3e-23
gi|149277005|ref|ZP_01883147.1| hypothetical protein PBAL39... 112 4e-23
gi|126661774|ref|ZP_01732773.1| hypothetical protein FBBAL3... 112 6e-23
gi|83857034|ref|ZP_00950562.1| hypothetical protein CA2559_... 110 2e-22
gi|163756282|ref|ZP_02163396.1| hypothetical protein KAOT1_... 108 8e-22
gi|89891646|ref|ZP_01203150.1| conserved hypothetical trans... 106 3e-21
gi|86133719|ref|ZP_01052301.1| hypothetical protein MED152_... 104 8e-21
gi|149372114|ref|ZP_01891384.1| hypothetical protein SCB49_... 102 4e-20
gi|163787924|ref|ZP_02182370.1| hypothetical protein FBALC1... 101 9e-20
gi|86131736|ref|ZP_01050333.1| hypothetical protein MED134_... 101 9e-20
gi|167752669|ref|ZP_02424796.1| hypothetical protein ALIPUT... 100 1e-19
gi|88802403|ref|ZP_01117930.1| hypothetical protein PI23P_0... 100 3e-19
gi|91214578|ref|ZP_01251551.1| hypothetical protein P700755... 99 6e-19
gi|150024770|ref|YP_001295596.1| hypothetical protein FP067... 93 3e-17
gi|169837695|ref|ZP_02870883.1| membrane protein containing... 93 3e-17
gi|88712084|ref|ZP_01106171.1| hypothetical protein FB2170_... 92 7e-17
gi|126646579|ref|ZP_01719089.1| hypothetical protein ALPR1_... 71 1e-10
gi|67941871|ref|ZP_00533857.1| PspC [Chlorobium phaeobacter... 65 1e-08
gi|167752665|ref|ZP_02424792.1| hypothetical protein ALIPUT... 62 6e-08
gi|160890553|ref|ZP_02071556.1| hypothetical protein BACUNI... 60 2e-07
gi|182416334|ref|YP_001821400.1| phage shock protein C, Psp... 55 6e-06
>gi|34539927|ref|NP_904406.1| hypothetical protein PG0055 [Porphyromonas gingivalis W83]
gi|34396238|gb|AAQ65305.1| conserved domain protein [Porphyromonas gingivalis W83]
Length = 351
Score = 718 bits (1853), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/351 (99%), Positives = 351/351 (100%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR
Sbjct: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA
Sbjct: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI
Sbjct: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
Query: 181 APPAITASQKLEMYGEPITVENIGKRVAADVDTGLHRRNASVLNTLGCVLKGLFVLVGIG 240
APPAITASQKLEMYGEPITVENIGKRVAADVDTGLHRRNASVLNTLGCVLKGLFVLVGIG
Sbjct: 181 APPAITASQKLEMYGEPITVENIGKRVAADVDTGLHRRNASVLNTLGCVLKGLFVLVGIG 240
Query: 241 LILAVVFGLFGLILELFADGYVAHAPLTSDWISIPSWMILCATISVFIIVAVPIMAIVFR 300
LILAVVFGLFGLILELFADGYVAHAPLTSDWISIPSWMILCATISVFIIVAVP+MAIVFR
Sbjct: 241 LILAVVFGLFGLILELFADGYVAHAPLTSDWISIPSWMILCATISVFIIVAVPVMAIVFR 300
Query: 301 HQGQGRNGIIPAGLKWLGLTIWLAAAVLLAVVFILITRELGLNFFLHRHSV 351
HQGQGRNGIIPAGLKWLGLTIWLAAAVLLAVVFILITRELGLNFFLHRHSV
Sbjct: 301 HQGQGRNGIIPAGLKWLGLTIWLAAAVLLAVVFILITRELGLNFFLHRHSV 351
>gi|150009348|ref|YP_001304091.1| hypothetical protein BDI_2756 [Parabacteroides distasonis ATCC
8503]
gi|149937772|gb|ABR44469.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 370
Score = 157 bits (396), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/356 (32%), Positives = 191/356 (53%), Gaps = 40/356 (11%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MKKTLT+NLGG VFHIDEDAY LL++YL NL HF +E +D + +++FE +S+++ R
Sbjct: 1 MKKTLTVNLGGTVFHIDEDAYQLLDKYLSNLRIHFRKEEGSD-EIMDDFEMRISELLNER 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ + VI+I + EVI+++ E E+ +T Y +++ET +
Sbjct: 60 IRLGYEVITIEQVEEVIKRMGKPEEIFEEEEKST---DHEDNYRTQQQET------HAQT 110
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIG-YLAVWM 179
K++ RD D+ +L GV G AAY W+ A+R+++ L+ F+ I + Y +W+
Sbjct: 111 TKKRLMRDPDNRILGGVAGGFAAYMDWDPTAVRIVLFLLM-----FFYGITVPLYFLLWI 165
Query: 180 IAPPAITASQKLEMYGEPITVENIGKRV---------------AADVDTGLHRRNASVLN 224
I P A TA++KLEM G+ +TVENIGK V ++D ++ A V
Sbjct: 166 IVPMARTATEKLEMRGQSVTVENIGKTVTDGFEKVSNNVNDFISSDKPRNFFQKLADVFV 225
Query: 225 -TLGCVLKGLFVLVGIGLILAVVFGLFGLILELFA-----DGYVAH-APLTSDWIS-IPS 276
+G +LK +L GI L+ +V +F L++ FA G++ + +P +D I+ P
Sbjct: 226 LVIGFILKFFIILAGIFLLPPLVLVVFILVVVTFALLMGGAGFIYNLSPFGADLINGAPL 285
Query: 277 WMILCATISVFIIVAVPIMAIVFRHQGQ-GRNGIIPAGLKWLGLTIWLAAAVLLAV 331
M + I + + +PI ++++ Q + +P +W+ L +WL + VL V
Sbjct: 286 SMAIMGCIGTILFIGIPIFSLIYAICCQLFKVRPLPTQARWILLALWLISLVLCGV 341
>gi|167762899|ref|ZP_02435026.1| hypothetical protein BACSTE_01263 [Bacteroides stercoris ATCC
43183]
gi|167699239|gb|EDS15818.1| hypothetical protein BACSTE_01263 [Bacteroides stercoris ATCC
43183]
Length = 387
Score = 157 bits (396), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 117/359 (32%), Positives = 189/359 (52%), Gaps = 31/359 (8%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MKKTLT+NLGG VFHIDEDAY LL+ YL NL HF +E D + I++ E +S++ +
Sbjct: 20 MKKTLTVNLGGTVFHIDEDAYRLLDNYLSNLKIHFRKEAGAD-EIIDDIERRISELFTEK 78
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
L + VI+I + EVI ++ E + +G + G S T + E
Sbjct: 79 LAAGSQVITIAYVEEVIARMGKPEELEPDAGDAASGSGSWDG-SNHAGNTGAGASATAEK 137
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
FYR+ D +L GV SG+AAY W+V +R+++V++ I F ++I Y+ W+I
Sbjct: 138 VSHHFYRNPDDKMLGGVVSGLAAYWGWDVTMLRLVLVIIMIFG---FKLLIPAYIICWII 194
Query: 181 APPAITASQKLEMYGEPITVENIGKRVAA---DVDTGLH-----RRNASVLNTLGCVL-- 230
P A TA++KL M GE +TV+NIGK V V G++ + + L LG L
Sbjct: 195 VPEARTAAEKLSMRGEAVTVDNIGKTVTGGFEKVANGVNDYMRSDKPRTFLQKLGDALVM 254
Query: 231 -KGLFV---LVGIGLILAVVFGLFGLILE--LFADGYVAHAPLTSDWISIPSW-MILCAT 283
GLF+ LV +I + + +FG++ LFA VA + P++ ++L A+
Sbjct: 255 VVGLFLKICLVIFAIICSPLLFVFGVVFVALLFAAVMVAIGGGAALISMFPTFNVVLPAS 314
Query: 284 --------ISVFIIVAVPIMAIVFRHQGQ-GRNGIIPAGLKWLGLTIWLAAAVLLAVVF 333
I+ ++V +P++++V+ Q + + +GLKW + +W+ +A + + F
Sbjct: 315 PLSAIVMYIAGILVVGIPLVSLVWMIFSQIFKWQPMVSGLKWTLVILWIVSACVFGICF 373
>gi|60682108|ref|YP_212252.1| hypothetical protein BF2629 [Bacteroides fragilis NCTC 9343]
gi|60493542|emb|CAH08329.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
Length = 359
Score = 156 bits (394), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 116/359 (32%), Positives = 184/359 (51%), Gaps = 43/359 (11%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MKKTLT+NLGG VFHIDEDAY LL+ YL NL HF ++ + + + + E +S++ +
Sbjct: 1 MKKTLTVNLGGTVFHIDEDAYRLLDNYLCNLRLHFRKQEGAE-EIVNDIENRISELFAEK 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
L + VI+I + EVI ++ E F ED+ G + ++ T G
Sbjct: 60 LSAGSQVITIADVEEVIARMGKPEDFG--EDT---------GEEEPQKTTGQTGVQQGAT 108
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
+R+ YR+ D +L GV SG+AAY W+V +R+I+ ++ I + ++I Y+ W++
Sbjct: 109 IRRRLYRNPDDKILGGVISGLAAYLNWDVTVLRLIMFVVLICG---YGVLIPIYIICWLV 165
Query: 181 APPAITASQKLEMYGEPITVENIGKRVAAD-----------VDTG-----LHRRNASVLN 224
P A TA++KL M GE IT+ENIG+ V V++G L + ++++
Sbjct: 166 IPEARTAAEKLNMRGEDITIENIGRTVTDGFERMANGVNNYVNSGKPRSFLQKVGDALVS 225
Query: 225 TLGCVLKGLFVLVGI-----GLILAVVF--GLFGLILELFADGYVAHAPLTS-DW---IS 273
G LK V++ I +LA+VF + I G + L S DW IS
Sbjct: 226 IAGFFLKACLVVLAIICSPVLFVLAIVFVALVIAAIAVAIGGGAALYQMLPSVDWSPLIS 285
Query: 274 IPSWMILCATISVFIIVAVPIMAIVFRHQGQGRN-GIIPAGLKWLGLTIWLAAAVLLAV 331
M + +I+ ++ +P+ AI+F Q N + +GLKW L IW+ A V+ +
Sbjct: 286 TSPMMTIAGSIAGVVLAGIPLAAIIFVILRQIFNWSPMSSGLKWSLLIIWILAVVIFVI 344
>gi|53713899|ref|YP_099891.1| hypothetical protein BF2607 [Bacteroides fragilis YCH46]
gi|52216764|dbj|BAD49357.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
Length = 359
Score = 155 bits (392), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 114/359 (31%), Positives = 183/359 (50%), Gaps = 43/359 (11%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MKKTLT+NLGG VFHIDEDAY LL+ YL NL HF ++ + + + + E +S++ +
Sbjct: 1 MKKTLTVNLGGTVFHIDEDAYRLLDNYLCNLRLHFRKQEGAE-EIVNDIENRISELFAEK 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
L + VI+I + EVI ++ E F + + + G +Q+ G
Sbjct: 60 LSAGSQVITIADVEEVIARMGKPEDFGEDTEEEEPKKTTGQTGAQQ-----------GAT 108
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
+R+ YR+ D +L GV SG+AAY W+V +R+I+ ++ I + ++I Y+ W++
Sbjct: 109 IRRRLYRNPDDKILGGVISGLAAYLNWDVTVLRLIMFVVLICG---YGVLIPIYIICWLV 165
Query: 181 APPAITASQKLEMYGEPITVENIGKRVAAD-----------VDTG-----LHRRNASVLN 224
P A TA++KL M GE IT+ENIG+ V V++G L + ++++
Sbjct: 166 IPEARTAAEKLNMRGEDITIENIGRTVTDGFERMANGVNNYVNSGKPRSFLQKVGDALVS 225
Query: 225 TLGCVLKGLFVLVGI-----GLILAVVF--GLFGLILELFADGYVAHAPLTS-DW---IS 273
G LK V++ I +LA+VF + I G + L S DW IS
Sbjct: 226 IAGFFLKACLVVLAIICSPVLFVLAIVFVALVIAAIAVAIGGGAALYQMLPSVDWSPLIS 285
Query: 274 IPSWMILCATISVFIIVAVPIMAIVFRHQGQGRN-GIIPAGLKWLGLTIWLAAAVLLAV 331
M + +I+ ++ +P+ AI+F Q N + +GLKW L IW+ A V+ +
Sbjct: 286 TSPMMTIAGSIAGVVLAGIPLAAIIFVILRQIFNWSPMSSGLKWSLLIIWILAVVIFVI 344
>gi|153806665|ref|ZP_01959333.1| hypothetical protein BACCAC_00935 [Bacteroides caccae ATCC 43185]
gi|149131342|gb|EDM22548.1| hypothetical protein BACCAC_00935 [Bacteroides caccae ATCC 43185]
Length = 365
Score = 151 bits (381), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 186/359 (51%), Gaps = 39/359 (10%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MKKTLT+NLGG V+HID+DAY LL+ YL NL H+ R+ + + I + E ++++ +
Sbjct: 1 MKKTLTVNLGGTVYHIDDDAYRLLDNYLSNL-KHYFRKQESAEEIINDIEMRIAELFAEK 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ + V+++ + EVI ++ E F ED + + E+ +S + + A
Sbjct: 60 VAAGKQVVTVQDVEEVIARVGKPEDFGITEDDAESN-------KRTEQSSSASQTYTRTA 112
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
R+ +RD D +L GV +G+AAY W++ +R++++++ + P MII+ Y+ W++
Sbjct: 113 GPRRLFRDPDSKLLGGVAAGLAAYLGWDITLVRILMIVLVFV--PYCPMIIL-YIVGWIV 169
Query: 181 APPAITASQKLEMYGEPITVENIGK-------RVAADVDTGLHR-RNASVLNTLGCVLKG 232
P A TA++KL M GE +T+ENIGK RVA V+ ++ + + L +G V
Sbjct: 170 IPEAHTAAEKLSMRGEAVTIENIGKTVTDGFERVADGVNNYVNSGKPRTFLQKIGDVFVA 229
Query: 233 L----FVLVGIGLILAVVFGLFGLILELFADGYVAHAPLTS------------DWISIPS 276
+ F + + L++ LF L + L A + A A S DW I S
Sbjct: 230 IAAVFFKIFLVALVIICCPVLFVLAIVLVALVFAAIAVAVSGGALLYELLPAIDWTPIAS 289
Query: 277 ---WMILCATISVFIIVAVPIMAIVFRHQGQGRN-GIIPAGLKWLGLTIWLAAAVLLAV 331
M L TI+ ++ +P+ A ++ Q + + GLKW L +W+ AV++ +
Sbjct: 290 VTPMMTLLGTIAGVALIGIPLGAFLYTILRQLFHWAPMGTGLKWSLLILWILGAVIMII 348
>gi|150005295|ref|YP_001300039.1| hypothetical protein BVU_2767 [Bacteroides vulgatus ATCC 8482]
gi|149933719|gb|ABR40417.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 348
Score = 145 bits (366), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 105/332 (31%), Positives = 169/332 (50%), Gaps = 55/332 (16%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MKKTLT+NLGG V++IDEDAY+LL+ YL NL HF RE + + + + E +S++ R
Sbjct: 1 MKKTLTINLGGTVYYIDEDAYHLLDNYLTNLRIHFCREEGAE-EIVHDIELRISELFTDR 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
L+ VI+I + E+I ++ E S E +G S+K++ T+
Sbjct: 60 LNEGKQVITIEDVEEIIARMGKPEDLSDEESGEASG-------SEKQKGTT--------- 103
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
R+ +RD D+ VL GV SG+AAY W+V +R+I++++ +I+ Y+ W+I
Sbjct: 104 -MRRLFRDPDNKVLGGVASGLAAYMGWDVTWVRIILLVLGFFVHG----VILAYIIAWII 158
Query: 181 APPAITASQKLEMYGEPITVENIGKRVAADVD------------TGLHRRNASVLNTLGC 228
P A TA +KL M G I VENIGK V + + L + +++ G
Sbjct: 159 IPMARTAPEKLAMKGAAINVENIGKTVTDGFEKVNDYVRSDRPRSILQKIGEGIVSVAGF 218
Query: 229 VLKGLFVLVGIG-----LILAVVFGLFGLILELFADGYVAHAPLT-------SDWISIPS 276
++K L V + I +L +VF F L++ A G +A P +W ++ S
Sbjct: 219 LIKFLLVFIAICCAPVLFVLLIVF--FALLMA--ATGLIAALPAVLYEVLPAVNWATVGS 274
Query: 277 WMILCATISV--FIIVAVPIMAIV---FRHQG 303
L +SV +++ +PI+ ++ RH G
Sbjct: 275 SPGLTVAMSVAGILVIGIPIIGLIHMLMRHFG 306
>gi|160885694|ref|ZP_02066697.1| hypothetical protein BACOVA_03698 [Bacteroides ovatus ATCC 8483]
gi|156108507|gb|EDO10252.1| hypothetical protein BACOVA_03698 [Bacteroides ovatus ATCC 8483]
Length = 365
Score = 140 bits (352), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 105/365 (28%), Positives = 187/365 (51%), Gaps = 51/365 (13%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MKKTLT+NLGG V+HID+DAY LL+ YL NL +F ++ + + + + E ++++ +
Sbjct: 1 MKKTLTINLGGIVYHIDDDAYRLLDNYLSNLKHYFRKQEGAE-EIVNDIEMRIAELFAEK 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFS-SNEDSTTAGYSAGAGYSQKERETSGYRYMSGE 119
+ VI++ + E+I ++ E F ++ED + + E+ +S + +
Sbjct: 60 VTEGKQVITVSDVEEIIARVGKPEDFGIADEDMDSQ--------KRTEQTSSANQGSTQT 111
Query: 120 APKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWM 179
A +R+++RD D+ +L GV +G+AAY W++ +R++++++ + P MII+ Y+ W+
Sbjct: 112 AAQRRWFRDPDNKLLGGVAAGLAAYFGWDITLVRILMIILVFV--PYCPMIIL-YIIGWI 168
Query: 180 IAPPAITASQKLEMYGEPITVENIGKRVAADVDTGLHRRNASVLNTLGCVLKGLFVLVGI 239
+ P A TA++KL M GE +T+ENIGK V G R V N + F L I
Sbjct: 169 VIPEARTAAEKLSMRGEAVTIENIGKTVT----DGFERVADGVNNYMNSGKPRTF-LQKI 223
Query: 240 G----LILAVVFGLFGLILELF------------------------ADGYVAHAPLTS-D 270
G I AV+F +F + L + + G + + L + D
Sbjct: 224 GDVFVSIAAVLFKIFLVALVILCCPVLFVLAVVLVALVFAAIAVAVSGGALLYEMLPAID 283
Query: 271 WI---SIPSWMILCATISVFIIVAVPIMAIVFRHQGQGRN-GIIPAGLKWLGLTIWLAAA 326
W+ S+ M L TI+ ++ +P+ A ++ Q + + GLKW L +W+ A
Sbjct: 284 WMPVASVSPMMTLLGTIAGVALIGIPLGAFLYTILRQLFHWSPMGTGLKWSLLILWILGA 343
Query: 327 VLLAV 331
V++ +
Sbjct: 344 VIMII 348
>gi|29345993|ref|NP_809496.1| hypothetical protein BT_0583 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29337887|gb|AAO75690.1| putative membrane protein [Bacteroides thetaiotaomicron VPI-5482]
Length = 364
Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 107/360 (29%), Positives = 188/360 (52%), Gaps = 42/360 (11%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MKKTLT+NLGG V+HID+DAY LL++YL NL HF R+ + + + E ++++ +
Sbjct: 1 MKKTLTVNLGGTVYHIDDDAYRLLDDYLSNL-KHFFRKQEGAEEIVNDIEIRIAELFAEK 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ + VI+I + E+I ++ E F ++D + +KE+ S + +
Sbjct: 60 VSAGKQVITIADVEEIIARVGKPEDFGVSDDESEP--------HKKEQTASSGQGYTRTT 111
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVI-VVLMTILASPLFWMIIIGYLAVWM 179
R+ +RD D+ +L GV SG+AAY W++ +R++ +VL+ + P+ II Y+ W+
Sbjct: 112 TARRLFRDPDNKLLGGVASGLAAYFDWDITLVRILMIVLLFVPYCPM----IILYIIGWI 167
Query: 180 IAPPAITASQKLEMYGEPITVENIGK-------RVAADVDTGLH--------RRNASVLN 224
I P A TA++KL M GE +T+ENIGK RVA V+ ++ ++ V
Sbjct: 168 IIPEARTAAEKLSMRGEAVTIENIGKTVTDGFERVADGVNNFVNSDKPRTFLQKVGDVFV 227
Query: 225 TLGCVLKGLFVLVGIGLILAVVF--------GLFGLILELFADGYVAHAPLTS-DWISIP 275
T+ ++ +F++ + + V+F +F +I L G + + L + DW I
Sbjct: 228 TIAAIILKIFLVALVIICCPVLFVLAVVIVALVFAVIAALVGGGALLYEMLPAIDWTPIA 287
Query: 276 SW---MILCATISVFIIVAVPIMAIVFRHQGQGRN-GIIPAGLKWLGLTIWLAAAVLLAV 331
+ M L TIS ++A+P+ A ++ Q + + GLKW +W+ V++ +
Sbjct: 288 TISPVMTLLGTISGIALIAIPLGAFLYTIMRQLFHWSPMGTGLKWSLFILWVLGLVIVII 347
>gi|160888143|ref|ZP_02069146.1| hypothetical protein BACUNI_00551 [Bacteroides uniformis ATCC 8492]
gi|156862278|gb|EDO55709.1| hypothetical protein BACUNI_00551 [Bacteroides uniformis ATCC 8492]
Length = 369
Score = 130 bits (328), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 109/372 (29%), Positives = 186/372 (50%), Gaps = 57/372 (15%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MKKTLT+NLGG VF+ID+DAY LL+ YL NL HF +E D + +++ E +S++ +
Sbjct: 1 MKKTLTVNLGGTVFNIDDDAYRLLDNYLSNLKMHFRKEAGAD-EIVDDIERRISELFAEK 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
L + VI+I + EVI ++ E + +S SA +G + + S A
Sbjct: 60 LSAGSQVITIADVEEVIARMGKPEDMEAEGESA----SADSGSTSAGGGYGAGAWNSNTA 115
Query: 121 ---PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAV 177
+R+ YR+ D +L GV SG+AAY W+V +R++++++ I +I Y+
Sbjct: 116 YGTTRRRLYRNPDDKMLGGVISGMAAYLGWDVTLLRLLLLVILICG---VGTLIPVYIVC 172
Query: 178 WMIAPPAITASQKLEMYGEPITVENIGK-------RVAADVD---------TGLHRRNAS 221
W++ P A TA++KL M GE +TVENIGK +VA V+ T L + +
Sbjct: 173 WLVIPEARTAAEKLSMRGEAVTVENIGKTVTDGFEKVANGVNDYMRSDKPRTFLQKLGDA 232
Query: 222 VLNTLGCVLKGLFVLVGI--------------GLI---LAVVFGLFGLILELF--ADGYV 262
++ +G LK V+ I L+ +AV G ++ F AD +
Sbjct: 233 LVMIVGWFLKICLVIFAIICSPVLFVFGVVFVALLFAAIAVAVGGGAALISFFPMADVVL 292
Query: 263 AHAPLTSDWISIPSWMILCATISVFIIVAVPIMAIVFRHQGQ-GRNGIIPAGLKWLGLTI 321
+PL++ + I+ ++V +P++++V+ Q + + +GLKW + +
Sbjct: 293 PTSPLSA----------IVMYIAGILLVGIPLVSLVWAIFSQIFKWQPMNSGLKWTLVIL 342
Query: 322 WLAAAVLLAVVF 333
W+ +A + F
Sbjct: 343 WIVSAACFGICF 354
>gi|154490396|ref|ZP_02030657.1| hypothetical protein PARMER_00629 [Parabacteroides merdae ATCC
43184]
gi|154089007|gb|EDN88051.1| hypothetical protein PARMER_00629 [Parabacteroides merdae ATCC
43184]
Length = 363
Score = 130 bits (327), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 118/367 (32%), Positives = 186/367 (50%), Gaps = 57/367 (15%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MKKTLT+NLGG VFHIDEDAY LL++YL NL HF +E ++ + + +FE +S++ R
Sbjct: 1 MKKTLTVNLGGTVFHIDEDAYQLLDKYLANLRIHFRKEEGSE-EIMNDFEMRISELFNER 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ + VI+I + EVI+++ E E+ A Q+E G
Sbjct: 60 VRLGYEVITIEHVEEVIKRMGKPEELFEGEEEKEYKEEARTQAFQEEEIPRG-------- 111
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRV-IVVLMTILASPLFWMIIIGYLAVWM 179
+K RD D+ VL GV GIAAY W+V A+R+ +++L+ I +P+ II YL +W+
Sbjct: 112 -PKKLMRDPDNRVLGGVAGGIAAYMGWDVTAVRLAMIILLFIPYAPI----IILYLILWL 166
Query: 180 IAPPAITASQKLEMYGEPITVENIGK-------RVAADVDTGLHR-RNASVLNTLGCVLK 231
+ P A TA+ KL M G+ +T+ENIGK +V+ +V+ + + S L L +
Sbjct: 167 VMPLARTAADKLMMRGQSVTLENIGKTVTDGFEKVSNNVNDYMSSDKPRSFLQKLADLFV 226
Query: 232 GL--FVL---------------VGIGLILAVV-FGLF----GLILEL--FADGYVAHAPL 267
G+ F+L + + IL VV F L G + +L F +A AP+
Sbjct: 227 GVVGFILKFLAILIGIILLPPLLLVAFILVVVTFALIAGGTGFLYQLSPFGANLIAGAPI 286
Query: 268 TSDWISIPSWMILCATISVFIIVAVPIMAIVFRHQGQ-GRNGIIPAGLKWLGLTIWLAAA 326
+ + + I +++ +PI A+V+ Q + +P KW L +WL +
Sbjct: 287 S---------LAIMGCIGFILLIGIPIFALVYAICMQLFKAKPLPNTAKWTLLILWLVSV 337
Query: 327 VLLAVVF 333
VL + F
Sbjct: 338 VLCVIYF 344
>gi|88806557|ref|ZP_01122074.1| hypothetical protein RB2501_00756 [Robiginitalea biformata
HTCC2501]
gi|88783389|gb|EAR14561.1| hypothetical protein RB2501_00756 [Robiginitalea biformata
HTCC2501]
Length = 582
Score = 114 bits (284), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 165/360 (45%), Gaps = 47/360 (13%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
M KT+ +NL +FHIDE+AY L YL + F +D + I + EA ++++ +
Sbjct: 1 MNKTVNINLANMLFHIDENAYQKLLRYLEAVKRSFAGTAGSD-EIIADIEARIAELFYEK 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ +E VI+ + VI + E + +ED G G S K R T
Sbjct: 60 MENERQVITQKEVDAVIAIMGQPEDYQVDED-IFEDVPPGTG-SAKSRTTRS-------- 109
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
+K YRD+DH + GVC+G+ Y + IR+I +++ + F I Y+ +W++
Sbjct: 110 -AKKLYRDIDHKYIGGVCAGLEHYLGLDALWIRLIFIILAVFTGFGF----IAYILLWIL 164
Query: 181 APPAITASQKLEMYGEPITVENIGKRVAADVDTGLHR---------------RNASVLNT 225
P A T +QKL+M GEP+ + NI ++V D R + +T
Sbjct: 165 VPEAATTAQKLDMTGEPVNISNIERKVKEGFDDVAERVRSVDYEKVGSRVKSSGKTFFDT 224
Query: 226 LGCVLKGLFVLVG----IGLILAVVFGLFGLILELFADGYVAHAPLTS-DWISI------ 274
LG V+ F ++G I LI+ L GL + LF G V + D I +
Sbjct: 225 LGDVIMFFFKVIGKFIGILLIIIGAATLIGLFIALFTVGVVDAVQIPGVDLIGLLNSTET 284
Query: 275 PSWMILCATISVFIIVAVPIMAIVFRHQGQGRNGIIPAG--LKWLGLTIWLAAAVLLAVV 332
P W++ ++ VF+ V +P +++ N + G K+ L +WL A + LAV+
Sbjct: 285 PVWIV---SLLVFLTVGIPFFFLLYLGLKILVNNLKSIGNIAKFSLLGLWLIAVISLAVL 341
>gi|120435346|ref|YP_861032.1| membrane protein containing PspC domain [Gramella forsetii KT0803]
gi|117577496|emb|CAL65965.1| membrane protein containing PspC domain [Gramella forsetii KT0803]
Length = 586
Score = 113 bits (282), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 100/323 (30%), Positives = 150/323 (46%), Gaps = 57/323 (17%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
M KT+ +NL G FHIDEDAY L+ YL + F D + I + EA ++++ +
Sbjct: 1 MNKTVNINLAGTFFHIDEDAYARLQRYLEAIRHSFSNTQGRD-EIISDIEARIAELFSEK 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ VISI + EVI + E + +E+ + + + T R +
Sbjct: 60 RKDDRQVISIKEVEEVITIMGQPEDYMVDEEI----------FEDEPKRTKSTRTIG--- 106
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
++ +RD ++ + GV SG+ Y +R++ VL+TI +S F +I Y+A W+
Sbjct: 107 --KQLFRDTENGHVGGVSSGLGHYLGIEAIWVRLLWVLLTIFSSGAFVLI---YIAFWIF 161
Query: 181 APPAITASQKLEMYGEPITVENIGKRVAADVDTGLHRRNASVLN---------------- 224
P A T + KL M GE +TV NI K++ G H + SV N
Sbjct: 162 VPEAKTTADKLAMRGEEVTVSNIEKKIRE----GFHDVSESVKNVDYGKYGKKASAGATS 217
Query: 225 ---TLGCVLK---GLFV-LVGIGLILAVVFGLFGLILELFADGY--VAHAPLTSDWISI- 274
TLG ++K LFV VGI L+L L GL + LFA G + AP T D+I +
Sbjct: 218 AATTLGDIIKFCLKLFVKFVGILLLLIAGTTLIGLFVGLFAVGTFGIVDAPWT-DYIDMV 276
Query: 275 ----PSWMILCATISVFIIVAVP 293
P W+I T F V +P
Sbjct: 277 NSGAPIWVISLLT---FFAVGIP 296
>gi|146297961|ref|YP_001192552.1| phage shock protein C, PspC [Flavobacterium johnsoniae UW101]
gi|146152379|gb|ABQ03233.1| phage shock protein C, PspC [Flavobacterium johnsoniae UW101]
Length = 578
Score = 112 bits (281), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 71/256 (27%), Positives = 130/256 (50%), Gaps = 22/256 (8%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
M KT+ +NLGG FHIDEDAY L Y + + + D + I++ E +S+++ +
Sbjct: 1 MNKTVNINLGGMFFHIDEDAYLKLSRYFDAIKRSLNNSSGQD-EIIKDIEMRVSELLTEK 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
SE V+ + + EVI + E + ++ +T+ + + Y+ +
Sbjct: 60 QKSEKHVVGLKDVDEVIAVMGQPEDYRIEDEESTS--QSNSSYNTRRH------------ 105
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
+K YRD ++ ++ GV +G+A Y + ++A+ V +V + I F I+ Y +W++
Sbjct: 106 --KKLYRDKENGMVGGVATGLAHY--FGIDAVWVKIVFL-IFVFAGFGTGILAYFVLWVV 160
Query: 181 APPAITASQKLEMYGEPITVENIGKRVAADVDTGLHRRNASVLNTLGCVLK--GLFVLVG 238
P A+T S+KLEM GEP+T+ NI K+V ++D+ + + + +G +K +
Sbjct: 161 TPEAVTTSEKLEMTGEPVTISNIEKKVREEIDSLSEKFKNADYDKMGNQVKSGAERISSS 220
Query: 239 IGLILAVVFGLFGLIL 254
G + VF +F L
Sbjct: 221 FGDFIMTVFKIFAKFL 236
>gi|149277005|ref|ZP_01883147.1| hypothetical protein PBAL39_08956 [Pedobacter sp. BAL39]
gi|149231882|gb|EDM37259.1| hypothetical protein PBAL39_08956 [Pedobacter sp. BAL39]
Length = 516
Score = 112 bits (280), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 88/317 (27%), Positives = 157/317 (49%), Gaps = 50/317 (15%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MKKTL +N+G + H++EDAY LL+ YL+ + HF + N D + + + E ++++
Sbjct: 1 MKKTLNINIGNSIIHLEEDAYELLKGYLNEVKQHFSK-NADDFEIVTDIENRIAELFAEM 59
Query: 61 LH-SEHAVISIGLIREVIEKINTGEHFSSNE--DSTTAGYSAGAGYSQKERETSGYRYMS 117
L + VI + ++ ++ ++ + F ++E +ST Y A AGY Q
Sbjct: 60 LQVQQKQVIGVEDVQAMMLQMGSVSDFENSEADESTVEDYVAAAGYDQ------------ 107
Query: 118 GEAPKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAV 177
+K YRD D A++ GVC+G Y ++ +RV+ +L +L ++ YL +
Sbjct: 108 ----VKKLYRDTDQAMIAGVCAGFGHYLDLDIRWVRVLALLSFLLGGSG----VVAYLVM 159
Query: 178 WMIAPPAITASQKLEMYGEPITVENIGKRVAADVDTGLHRRN----ASVLNTLGCVLKGL 233
W++ P A + ++K+ MYGE + K A L R++ A ++ LG + +
Sbjct: 160 WIVIPKAESRAEKMAMYGETPNL----KGFANSHLNPLFRQSRGFIAELMVFLGNIFQKS 215
Query: 234 FVLVGIGLILAVVFGLFGLILELFADGYVA------------HAPLTSDWISIPSWMILC 281
V + A+V +FG +L LF G +A + PL+ I +M+
Sbjct: 216 SRFVFKAIAAAIV--IFGSMLLLFLIGCLAAFMGFWDASIYSYFPLS---IVNEDYMV-S 269
Query: 282 ATISVFIIVAVPIMAIV 298
TI+VF+ AVP++A++
Sbjct: 270 LTIAVFLSFAVPLLALI 286
>gi|126661774|ref|ZP_01732773.1| hypothetical protein FBBAL38_00445 [Flavobacteria bacterium BAL38]
gi|126625153|gb|EAZ95842.1| hypothetical protein FBBAL38_00445 [Flavobacteria bacterium BAL38]
Length = 592
Score = 112 bits (279), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 112/213 (52%), Gaps = 21/213 (9%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
M KT+++NLGG FHIDEDAY L Y + + + +++ E+ ++++ R
Sbjct: 1 MNKTISINLGGFFFHIDEDAYQKLSRYFDAVKRSLSPDGRDEI--MKDIESRIAELFQER 58
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
L +E V+ + I EVI + E + ++D T+ Y ++ Y Y
Sbjct: 59 LKNEKQVVGLLEIEEVISIMGQPEDYKIDDDKTS--------YQSNSTSSTNYYY----- 105
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTI-LASPLFWMIIIGYLAVWM 179
P ++ YRD D+ ++ GV +G+ Y + +R+++V++ + LF Y+ +W+
Sbjct: 106 PSKRLYRDKDNGMIGGVMAGLGHYLGVDALWLRILMVILFFGFGTGLFV-----YIVLWI 160
Query: 180 IAPPAITASQKLEMYGEPITVENIGKRVAADVD 212
+ P AIT +QKLEM G+PIT+ NI K+V D
Sbjct: 161 LVPEAITTTQKLEMKGQPITISNIEKKVKEGFD 193
>gi|83857034|ref|ZP_00950562.1| hypothetical protein CA2559_09558 [Croceibacter atlanticus
HTCC2559]
gi|83848401|gb|EAP86270.1| hypothetical protein CA2559_09558 [Croceibacter atlanticus
HTCC2559]
Length = 582
Score = 110 bits (274), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/325 (26%), Positives = 155/325 (47%), Gaps = 51/325 (15%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
M KT+ +NL G FH+DE+AYN L+ YL + F D + I + EA +S++ +
Sbjct: 1 MNKTVNINLAGFFFHLDEEAYNKLQRYLEAIKRSFTDAQGRD-EIIHDIEARISELFSEK 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ +++ VI++ + EVI + E + +++ YS+ ++ Y
Sbjct: 60 IETDNQVITLKEVDEVIAVMGQPEDYQLDDEIFEDDYSS--------KKQPNY------- 104
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
++ YRD DH + GV SG+ Y + +R++ VL+TI +S F +I Y+ W+
Sbjct: 105 --KQLYRDPDHKYIGGVSSGLGHYLGIDALWMRLLWVLLTIFSSGAFILI---YIVFWIF 159
Query: 181 APPAITASQKLEMYGEPITVENIGKRVAADVDTGLHRRN---------------ASVLNT 225
P A T ++KL M GE + + NI K+V DT + ++++
Sbjct: 160 VPEAHTTAEKLAMRGEAVNISNIEKKVREGFDTVASKVKDVNYDQYSNKAKSGVSAIVEA 219
Query: 226 LGCV----LKGLFVLVGIGLILAVVFGLFGLILELFADGY--VAHAPLTSDWISI----- 274
+G + LK ++G+ L+L L L + LF+ G + AP T D++ +
Sbjct: 220 IGKIIMFCLKVFVKIIGVFLLLIGGVTLIALFIGLFSVGTFGIIDAPWT-DYVEMANVGA 278
Query: 275 PSWMILCATISVFIIVAVPIMAIVF 299
P W++ +I +F V +P + +
Sbjct: 279 PIWVV---SILIFFAVGIPFFFVFY 300
>gi|163756282|ref|ZP_02163396.1| hypothetical protein KAOT1_01499 [Kordia algicida OT-1]
gi|161323634|gb|EDP94969.1| hypothetical protein KAOT1_01499 [Kordia algicida OT-1]
Length = 578
Score = 108 bits (269), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 80/255 (31%), Positives = 124/255 (48%), Gaps = 24/255 (9%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
M KT+ +NL FHIDE+AYN L+ YL+ + F D + I + EA ++++ R
Sbjct: 1 MNKTININLANMFFHIDEEAYNSLQRYLNAIKRSFTDSQGRD-EIIADIEARVAELFSER 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ +E VIS+ ++EVIE + E Y + E +T R S
Sbjct: 60 MKTERQVISMKEVQEVIEIMGQPED-----------YLVDDVIFEDEPKTHHRRTYSSH- 107
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRV--IVVLMTILASPLFWMIIIGYLAVW 178
RK YRD D+ L GVC+G+A Y + IR+ I+V++ + SP I+ Y+ +W
Sbjct: 108 --RKLYRDYDNKFLGGVCAGLAQYFGIDALWIRLLAIIVVLAGVGSP-----ILVYIILW 160
Query: 179 MIAPPAITASQKLEMYGEPITVENIGKRVAADVDTGLHRRNASVLNTLGCVLK--GLFVL 236
+I P A T S+KL M G+P + NI +++ +D + N + +G K
Sbjct: 161 IIIPKANTTSEKLAMAGKPANISNIEQKIKEGLDDVQEKFNDVNFDRVGQRAKSGAASFF 220
Query: 237 VGIGLILAVVFGLFG 251
IG IL V +F
Sbjct: 221 DTIGNILTVFLKIFA 235
>gi|89891646|ref|ZP_01203150.1| conserved hypothetical transmembrane protein [Flavobacteria
bacterium BBFL7]
gi|89516193|gb|EAS18856.1| conserved hypothetical transmembrane protein [Flavobacteria
bacterium BBFL7]
Length = 583
Score = 106 bits (264), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 93/355 (26%), Positives = 164/355 (46%), Gaps = 50/355 (14%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
M KT+ +NL G FHIDEDAYN L++YL + F ++ + + + E+ ++++ + +
Sbjct: 1 MNKTININLAGLFFHIDEDAYNKLQKYLAAVRRSFSGMQGSE-EIMADIESRVAELFLEK 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+E VISI + EVI + E + +E+ + ER ++ +
Sbjct: 60 RANEQQVISITHVEEVISIMGQPEDYEVDEE-----------IFEGERRSTAQEMKT--R 106
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
R +RD + + GVC+G+ + + +R+ V+++ ++ P ++ Y+ W++
Sbjct: 107 FSRPLFRDTLNGYIGGVCAGLGHFLGIDAIWVRIFFVVVSFISFPF---NVLAYVIFWIV 163
Query: 181 APPAITASQKLEMYGEPITVENIGKRVAADVDT-------------GLHRRNASV--LNT 225
A A+T +Q+L M G+ + + NIG+ + D G + +V +
Sbjct: 164 AKDAVTTNQRLAMMGKEVNISNIGENFKSGFDDVVDGQTDADYRIVGQKGKRGTVRFFSF 223
Query: 226 LGCVLKGLFVLVG--IGLILAVVFGLFGLILELFADGYVAHAPLTSD------WISIPS- 276
LG +KG+F + IGL + + G GLI F+ + A L SD +SIPS
Sbjct: 224 LGRFIKGIFKAIAKIIGLFM-FLLGATGLIALFFSLIGIGTAQLQSDDFMLLMNMSIPSN 282
Query: 277 ----WMILCA----TISVFIIVAVPIMAIVFRHQGQGRNGIIPAGLKWLGLTIWL 323
W+ L A I FII + + +V GR + GL W+ I L
Sbjct: 283 LSTWWIYLTAFFLIGIPFFIIAVLGLRLLVSNLASIGRTAKVIIGLLWVASVIGL 337
>gi|86133719|ref|ZP_01052301.1| hypothetical protein MED152_03405 [Tenacibaculum sp. MED152]
gi|85820582|gb|EAQ41729.1| hypothetical protein MED152_03405 [Polaribacter dokdonensis MED152]
Length = 576
Score = 104 bits (260), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 102/389 (26%), Positives = 183/389 (47%), Gaps = 86/389 (22%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
M KT+ +NLGG FHIDE AY L+ YL +++ + + I + EA +S+++ +
Sbjct: 1 MNKTININLGGFFFHIDEVAYQKLKRYLESISRSLSDDPQGKNEIIADIEARISELLSEK 60
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ V++ G I +VI+ + E ++ E++ YS S Y Y A
Sbjct: 61 ITDARQVVNEGDIDDVIKIMGQPEDYADAEEA----------YSD-----SSYSYKRNSA 105
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
+K +RD D L GV SGIA Y ++++ I + + L+ + F +I Y+ +W++
Sbjct: 106 SGKKLFRDGDDKFLGGVASGIAHY--FDIDTIWIRLGLLALFFGAGFG--VILYIILWIL 161
Query: 181 APPAITASQKLEMYGEPITVENIGKRVAADVD---------------------------- 212
P A T ++KL+M GEP+ ++NI K++ + +
Sbjct: 162 LPEAKTTAEKLQMEGEPVNIDNIEKKIREEFNNVSENVRVVANQASEKLKEGANEFSDKM 221
Query: 213 ----TGLHRRN---ASVLNTLGCVLKGLFVLVG--IG---------LILAVVFGLF--GL 252
+G +N + +T+G ++ +F ++G IG +IL+++ G F G
Sbjct: 222 SKTFSGKTTKNNGASDFFDTIGKIILAVFKVIGKFIGVIIIFVSAAVILSLIIGGFSVGS 281
Query: 253 ILELFADG-YVAHAPLTSDWISIPSWMI-LCATISVFIIVAVPIMAI------VFRHQGQ 304
+ L DG +V + P D I +P W++ LC+ F+++ +P + + + Q
Sbjct: 282 LEWLNVDGDFVTYPPFFHDAI-LPVWLLTLCS----FLLIGIPFLVLFILGLRILSSNVQ 336
Query: 305 GRNGIIPAGLKWLGLTIWLAAAVLLAVVF 333
N P L LG IW+ + LLA++F
Sbjct: 337 KLNK--PTSLTLLG--IWILS--LLAMIF 359
>gi|149372114|ref|ZP_01891384.1| hypothetical protein SCB49_00300 [unidentified eubacterium SCB49]
gi|149354881|gb|EDM43443.1| hypothetical protein SCB49_00300 [unidentified eubacterium SCB49]
Length = 596
Score = 102 bits (254), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 173/372 (46%), Gaps = 70/372 (18%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MKKT+ +NL G FHIDEDAY L YL + +D + I + EA ++++ +
Sbjct: 1 MKKTVNINLAGTFFHIDEDAYGKLSRYLDAIKKSLSDPQGSD-EIIRDIEARIAELFSEK 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ S VISI + EVI + E +S +E+ + SG R
Sbjct: 60 IESNTQVISISELDEVIAVMGQPEDYSVDEE-----------IFEDAPLNSGKR-----T 103
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
+ +RD+D+ + GV SG+ Y + + +R+I + +T++ S +F +I + +W++
Sbjct: 104 TYNQLFRDIDNKFIAGVSSGLGHYFKIDAIWVRLIWIFLTLVTSGMFILIY---ILLWIL 160
Query: 181 APPAITASQKLEMYGEPITVENIGKRVA------------ADVD---TGLHRRNASVLNT 225
P AIT S KL+M E + + NI K++ AD D +++ + T
Sbjct: 161 IPAAITVSDKLKMNREAVNITNIEKKIKEGYETVADGIKNADYDKYGKKINKGASGFFET 220
Query: 226 LGCVLKGLF-----------VLVGIGLILAVVFGLFGL-ILELFADG----YVAHAPLTS 269
LG + K +F V+VG +++++ LF L ++ DG Y+ H TS
Sbjct: 221 LGKIFKTIFKIFGKFFGVLMVIVGFATVISLIIALFTAGTLGIYKDGASMDYI-HMANTS 279
Query: 270 DWISIPSWMILCATISVFIIVAVPIMAIVFRHQGQGRNGIIPAGLKWLGLT-------IW 322
D P W++ ++ + + +P +A+ I+ L+ +G T +W
Sbjct: 280 D---APIWLV---SLLLLFAIGIPFLALAILGL-----KILIKNLRSMGWTAKVILIVLW 328
Query: 323 LAAAVLLAVVFI 334
LA+ V LA++ I
Sbjct: 329 LASLVGLAIIGI 340
>gi|163787924|ref|ZP_02182370.1| hypothetical protein FBALC1_06083 [Flavobacteriales bacterium
ALC-1]
gi|159876244|gb|EDP70302.1| hypothetical protein FBALC1_06083 [Flavobacteriales bacterium
ALC-1]
Length = 598
Score = 101 bits (251), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 114/219 (52%), Gaps = 30/219 (13%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
M KT+ +NL G FHIDEDAY L+ YL + F ++ + I + EA ++++ R
Sbjct: 1 MNKTVNINLAGIFFHIDEDAYLKLQRYLEAIKRSF-TDSQGRSEIISDIEARIAELFSER 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSN----EDSTTAGYSAGAGYSQKERETSGYRYM 116
+ +E V+ + L+ EVI + E + + ED ++R++S
Sbjct: 60 IQNEKQVVGVKLVDEVITIMGQPEDYLVDDEIFEDEPQP----------RQRQSS----- 104
Query: 117 SGEAPKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAI--RVIVVLMTILASPLFWMIIIGY 174
P +K YRD D++ + GV +G++ Y + + +I R+I +L+ I + F +I Y
Sbjct: 105 ---KPSKKLYRDTDNSYIGGVAAGLSHY--FGIESIWTRLIWLLLAIGSGGTFILI---Y 156
Query: 175 LAVWMIAPPAITASQKLEMYGEPITVENIGKRVAADVDT 213
L W + P A T ++KL M G+P+ + NI K++ +D+
Sbjct: 157 LIFWALVPEAKTTAEKLTMSGDPVNISNIEKKIKDGIDS 195
>gi|86131736|ref|ZP_01050333.1| hypothetical protein MED134_02015 [Cellulophaga sp. MED134]
gi|85817558|gb|EAQ38732.1| hypothetical protein MED134_02015 [Dokdonia donghaensis MED134]
Length = 598
Score = 101 bits (251), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 165/369 (44%), Gaps = 59/369 (15%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
M KT+ +NL G FHIDEDAY L+ YL + F+ D + I + EA +S++ R
Sbjct: 1 MNKTVNINLAGVFFHIDEDAYGKLQRYLAAIKRSFEGMQGED-EIIADIEARISELFSER 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ E VI + EVI + E + ++D + + +S S +A
Sbjct: 60 IKDERQVIGSQELDEVIAVMGQPEDYMVDDDI----------FEDEPAASSNSSKKSKQA 109
Query: 121 PK---RKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIG---- 173
P+ R+FYRD D+A + GV SG+ Y + +R+ VL+ L W + G
Sbjct: 110 PRIAQRRFYRDTDNAYIGGVSSGMGHYLGIDPLWVRIGWVLLIALT----WFTLGGTALI 165
Query: 174 YLAVWMIAPPAITASQKLEMYGEPITVENIGKRV------------AADVDTGLHRRNAS 221
YLA+W+ P A T + KL M G+ + ++NI ++V + D D ++
Sbjct: 166 YLALWVFVPEAQTTADKLAMRGKAVNIDNITEKVKEGFENVADTVKSVDYDKYGNKVKTG 225
Query: 222 VLNTLGCV-------LKGLFVLVGIGLIL---AVVFGLF-----GLILELFADGYVAHAP 266
G + K L ++G+ LI+ VV L G +++LF +A P
Sbjct: 226 AQGFFGTIGKIIMFFFKVLAKIIGVLLIITGATVVISLLITFLTGGVVDLFTPN-LADLP 284
Query: 267 LTSDWISIPSWMILCATISVFIIVAVPIMAIVFRH----QGQGRNGIIPAGLKWLGLTIW 322
++ +P W++L T+ V +P + + ++ + A L GL W
Sbjct: 285 WVNNETGLPIWLVLLLTL---FAVGIPFFFLFYLGLKIVSSNLKSMPVSAKLSLFGL--W 339
Query: 323 LAAAVLLAV 331
L +A+++ V
Sbjct: 340 LVSAIVIGV 348
>gi|167752669|ref|ZP_02424796.1| hypothetical protein ALIPUT_00926 [Alistipes putredinis DSM 17216]
gi|167659738|gb|EDS03868.1| hypothetical protein ALIPUT_00926 [Alistipes putredinis DSM 17216]
Length = 415
Score = 100 bits (249), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 92/325 (28%), Positives = 150/325 (46%), Gaps = 45/325 (13%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MK+ ++ G F ++ DA+ +L YL +L + R + +E+ EA ++++++
Sbjct: 1 MKEVKKCSISGIAFTMEADAFQVLSRYLESLNDTY-RNTDGGKEIVEDIEARIAELILS- 58
Query: 61 LHSEHA-VISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGE 119
H +++ V+ + L+ E+I ++ + E RE G GE
Sbjct: 59 -HQDNSKVVELSLVEEIIAQMGSAEAI---------------------REQEGREAPRGE 96
Query: 120 AP-KRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILAS-----PLFWMII-- 171
R+ YRDMD A L GVCSG+A Y + IR+++ L IL P FW ++
Sbjct: 97 GRIPRRLYRDMDQARLGGVCSGVAKYFGTSPTWIRLLMFLPLILLLLLGWIPFFWWLVPL 156
Query: 172 ---------IGYLAVWMIAPPAITASQKLEMYGEPITVENIGKRVAA--DVDTGLHRRNA 220
I YL +W P A TA QKLE G+ ITV++I + AA DVD A
Sbjct: 157 MSNLLGVFFICYLIMWFAVPAARTARQKLEQNGQRITVQSIEEASAANHDVDADAKPVVA 216
Query: 221 SVLNTLGCVLKGLFVLVGIGLILAVVFGLFGLILELFADGYVAHAPLTSD-WISIPSWMI 279
+ LG V+ + L+ L+ ++ G LI+ LFA G ++ D I S I
Sbjct: 217 KAVYALGQVVLIVLKLLAGLLVFGLILGACALIIGLFAVGMAGPEVVSIDAGIWTVSLGI 276
Query: 280 LCATISVFIIVAVPIMAIVFRHQGQ 304
+ A + V +++ V + I R +
Sbjct: 277 MTALVPVLLLIYVLMCLIASRKPSR 301
>gi|88802403|ref|ZP_01117930.1| hypothetical protein PI23P_07435 [Polaribacter irgensii 23-P]
gi|88781261|gb|EAR12439.1| hypothetical protein PI23P_07435 [Polaribacter irgensii 23-P]
Length = 554
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 105/401 (26%), Positives = 183/401 (45%), Gaps = 92/401 (22%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
M KT+ +NLGG FHIDE AY L YL++++S ++ + I + EA +S+++ +
Sbjct: 1 MNKTININLGGFFFHIDEIAYEKLRRYLNSISSSLSDDSQGRNEIISDIEARISELLSEK 60
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ V+S I ++I + E +S E A YS +SQ + +G
Sbjct: 61 IKDSRQVVSESDIEDIIVIMGQPEDYSEPE----ASYSE-PNFSQSKNNATG-------- 107
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
+K +RD + L GV SGIA Y +N++ I + + L+ + F ++I Y +W++
Sbjct: 108 --KKLFRDGEDKFLGGVASGIAHY--FNMDTIWIRLGLLALFFGAGFGILI--YSILWIL 161
Query: 181 APPAITASQKLEMYGEPITVENIGKRV-----------------AAD-VDTGLH------ 216
P A T ++KL+M+GE + ++NI K++ A+D + G H
Sbjct: 162 LPEARTTAEKLQMHGEAVNIDNIEKKIREEFTNVSDSVRSAAHHASDKIKDGAHEFSEKI 221
Query: 217 --------RRN---ASVLNTLGCVLKGLFVLVG--IGLILAVVFG--LFGLILELFADGY 261
++N +NT+ +++ F + G IG +L + LI+ F+ G
Sbjct: 222 GQTFSGKKKKNNGLQDFINTIEKIIRIFFKIFGKFIGFLLVFTAATVILCLIIGGFSIGS 281
Query: 262 VAHAPLTSDWIS---------IPSWMILCATISVFIIVAVPIMAIV---FRHQGQGRNGI 309
+ S++IS +P WM+ TIS+ ++ +P + ++ R I
Sbjct: 282 FEFLNIESNFISYPDFFYAATLPRWML---TISLLTLIGIPFLILLVLGLR--------I 330
Query: 310 IPAGLKWLGLT-------IWLAAAVLLAVVFILITRELGLN 343
+ + LK T IW A LL V+F + E G N
Sbjct: 331 LSSNLKRFSKTASLTLLGIWFIA--LLIVIFTAV--EFGTN 367
>gi|91214578|ref|ZP_01251551.1| hypothetical protein P700755_16974 [Psychroflexus torquis ATCC
700755]
gi|91187005|gb|EAS73375.1| hypothetical protein P700755_16974 [Psychroflexus torquis ATCC
700755]
Length = 524
Score = 98.6 bits (244), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 84/328 (25%), Positives = 149/328 (45%), Gaps = 69/328 (21%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
M KT+ +NL FHIDEDA+ L+ YL + + +E D + +++ EA ++++
Sbjct: 1 MNKTININLANVFFHIDEDAFKKLDSYLKAIERYLSKEQSKD-EILQDIEARIAELFTES 59
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ VI++ + +I + E + ED SA +
Sbjct: 60 TAQVNQVITMSQVNAMIGIMGEPEAYMM-EDDEEPSSSASPKFK---------------- 102
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMI---------I 171
RK YRD++ + GV +G+ Y N IR LFW+I +
Sbjct: 103 ASRKLYRDLERKYVGGVSAGLNHYLGINTLLIR------------LFWVISAFFSVGGTV 150
Query: 172 IGYLAVWMIAPPAITASQKLEMYGEPITVENIGKRVA----------ADVDTGLHRRN-- 219
Y+ +W++ P A T +QKL+M GEP+ + NI ++V D+D + +
Sbjct: 151 AIYVIIWILIPAARTTAQKLDMQGEPVNLSNIERKVKEGYSKFADKVGDIDYEKYGQQTK 210
Query: 220 ---ASVLNTLGCVLK--GLFVLVGIGLILAVVFG--LFGLILELFADGYVAHAPLTSDW- 271
L+ LG VL+ G+F+ +G++L ++ G L GL++ LF+ G ++ L D+
Sbjct: 211 SGLTKFLDGLGEVLRALGVFLSKFLGIVLLLISGPVLIGLLIFLFSFGTISVFEL-GDFS 269
Query: 272 ------ISIPSWMILCATISVFIIVAVP 293
+ IP+W+ + F++ A+P
Sbjct: 270 QIEIFVLGIPNWI---QIVLFFLVAAIP 294
>gi|150024770|ref|YP_001295596.1| hypothetical protein FP0676 [Flavobacterium psychrophilum JIP02/86]
gi|149771311|emb|CAL42780.1| Protein of unknown function [Flavobacterium psychrophilum JIP02/86]
Length = 593
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/223 (27%), Positives = 112/223 (50%), Gaps = 24/223 (10%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
M KT+ MNLGG FHIDEDAY L Y + + + + + E+ +S++ +
Sbjct: 1 MNKTVNMNLGGFFFHIDEDAYQKLNRYFDAIKKSLSSDGREEI--MNDIESRVSELFSEK 58
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
L VIS+ + +VI + E + +++ Y Y S +
Sbjct: 59 LTGAKQVISLKEVDDVIAVMGQPEDYKIEDEAP---------------RQPNYNYASSGS 103
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
RK YRD ++ ++ GV +G+ Y + ++ + + ++++ +L S F + Y+ +W++
Sbjct: 104 --RKLYRDKENGIIGGVLAGLGYY--FGIDKVWLRLIMLVLLLS--FGTGFLLYIILWIV 157
Query: 181 APPAITASQKLEMYGEPITVENIGKRVAADVDTGLHR-RNASV 222
P A+T ++KLEM GEPI + NI K+V + + + +NA V
Sbjct: 158 MPEAVTTTEKLEMQGEPINISNIEKKVKEEFENLSEKFKNADV 200
>gi|169837695|ref|ZP_02870883.1| membrane protein containing PspC domain [candidate division TM7
single-cell isolate TM7a]
Length = 247
Score = 92.8 bits (229), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 67/251 (26%), Positives = 125/251 (49%), Gaps = 32/251 (12%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MK+ ++L + I+ DA +L++YL + E+ T+ E EA + +++ R
Sbjct: 1 MKEITRIHLAKTPYDIELDAKEVLQKYLSEIKQMMGSED-----TMYEIEARMVELLGER 55
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ +I++ + ++ K+ + FS +E + E S +
Sbjct: 56 GVQNNGIITMSDVEDLRSKMGLPKEFSDSEST----------------EDSQADLAPSNS 99
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPL--FWMIIIGYLAVW 178
P ++ RD D+A+ GVC+GIAAY W +N + V ++ + SP F ++ Y+ +W
Sbjct: 100 PAKRLMRDTDNAIFGGVCAGIAAY--WGINPLWVRLLFII---SPFITFGTALLVYIIIW 154
Query: 179 MIAPPAITASQKLEMYGEPITVENIGKRVAADVDTGLHRRNASVLNTLG-CVLKGLFVLV 237
+ P A TA++KL+M GEP+T++++ K AA+ +R ++ L C+ GLF
Sbjct: 155 ISLPEAKTAAEKLQMRGEPVTLDSLKK--AANNSESKYRAKETLAKILRICLALGLF-FT 211
Query: 238 GIGLILAVVFG 248
+GL+ +V G
Sbjct: 212 TLGLLAVLVVG 222
>gi|88712084|ref|ZP_01106171.1| hypothetical protein FB2170_14383 [Flavobacteriales bacterium
HTCC2170]
gi|88709490|gb|EAR01723.1| hypothetical protein FB2170_14383 [Flavobacteriales bacterium
HTCC2170]
Length = 595
Score = 91.7 bits (226), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 80/310 (25%), Positives = 143/310 (46%), Gaps = 45/310 (14%)
Query: 16 IDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVRLHSEHAVISIGLIRE 75
+DE A+N + YL + F +D + + EA ++++ +L +E VI+I + E
Sbjct: 1 MDEVAFNKMRRYLEAIKRSFANTPGSD-EIQADIEARIAELFYEKLENERQVITIKEVDE 59
Query: 76 VIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEAPKRKFYRDMDHAVLC 135
VI + E + +ED S+ + T+G +K YRD++H +
Sbjct: 60 VIAVMGQPEDYMIDEDIFD---DEPQPKSKTSKSTTGRV--------KKLYRDIEHKYIG 108
Query: 136 GVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMIAPPAITASQKLEMYG 195
GVCSG+ Y ++ IR+I +L+ + F I Y+ +W++ P A+T SQKL+M G
Sbjct: 109 GVCSGLEHYLGFDALWIRLIFILLAVTTGFGF----IAYILLWILVPEAVTTSQKLDMTG 164
Query: 196 EPITVENIGKRVAADVD---------------TGLHRRNASVLNTLGCVLKGLFVLVG-- 238
+ + + NI ++V D + + + +TLG V+ LF ++G
Sbjct: 165 KAVNISNIERKVKEGFDDVADKVKSVDYEKMGNKVKSSSKTFFDTLGDVIMFLFKVLGKF 224
Query: 239 --IGLILAVVFGLFGLILELFADGYVA--HAPLTS-----DWISIPSWMILCATISVFII 289
I LI+ L GL + LF G + H P + S P W++ ++ F +
Sbjct: 225 IGILLIIIGAATLIGLFVGLFTVGVLDMIHIPGVDFYNMVNSTSAPVWVV---SLLAFFV 281
Query: 290 VAVPIMAIVF 299
V +P +++
Sbjct: 282 VGIPFFFVLY 291
>gi|126646579|ref|ZP_01719089.1| hypothetical protein ALPR1_17603 [Algoriphagus sp. PR1]
gi|126576627|gb|EAZ80875.1| hypothetical protein ALPR1_17603 [Algoriphagus sp. PR1]
Length = 650
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 82/150 (54%), Gaps = 13/150 (8%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFD--RENHTDCKTIEEFEAYLSDMMV 58
MKKT+++N+ G +FHI+ED Y+ L +YL + HF ++NH + I + E ++++ +
Sbjct: 1 MKKTISINISGILFHIEEDGYDSLRKYLDAINKHFSSYKDNH---EIISDIENRIAEIFL 57
Query: 59 VRLHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSG 118
L + VI+ + ++IEK+ T F++ E+ AG ++ Y+Y++
Sbjct: 58 SNLKNNKQVITAENVEKLIEKMGTIADFATVEEEKIEEEYEKAGPKSEDY----YKYVTP 113
Query: 119 EAPKRKFYRDM----DHAVLCGVCSGIAAY 144
++ Y+ + + +L GVC+GIA Y
Sbjct: 114 PHSEKGGYKKLVRLENKKILGGVCAGIAHY 143
>gi|67941871|ref|ZP_00533857.1| PspC [Chlorobium phaeobacteroides BS1]
gi|67911914|gb|EAM61777.1| PspC [Chlorobium phaeobacteroides BS1]
Length = 100
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 52/84 (61%), Gaps = 7/84 (8%)
Query: 129 MDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILA-SPLFWMIIIGYLAVWMIAPPAITA 187
MDH ++ GVCSG+ AY + R+ V+ T+ SPL ++I+ W++ PPA T
Sbjct: 1 MDHRMIGGVCSGLGAYFNTDPTWFRLAFVVATLSGLSPLVYVIL------WIVVPPARTI 54
Query: 188 SQKLEMYGEPITVENIGKRVAADV 211
++KLEM+GEP+ + NI K + ++
Sbjct: 55 AEKLEMFGEPVNISNIEKAIMEEM 78
>gi|167752665|ref|ZP_02424792.1| hypothetical protein ALIPUT_00922 [Alistipes putredinis DSM 17216]
gi|167659734|gb|EDS03864.1| hypothetical protein ALIPUT_00922 [Alistipes putredinis DSM 17216]
Length = 178
Score = 62.0 bits (149), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 41/182 (22%), Positives = 81/182 (44%), Gaps = 19/182 (10%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTDCKTIEEFEAYLSDMMVVR 60
MK+T+++NL + F D+DAY L+EYL + R D + + + EA L+++ R
Sbjct: 1 MKETISVNLASQAFTFDKDAYRRLKEYLDAIRR---RLPAGDPEILNDVEARLAEIFRSR 57
Query: 61 LHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYMSGEA 120
+ S V+++ +++ + ++ F +K E S +
Sbjct: 58 ISSPMMVVTLQMVQGAMAQMGDPSEFGE------------LPTGEKRDEASVDAESASCV 105
Query: 121 PKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTILASPLFWMIIIGYLAVWMI 180
R YR + + GVC GIA + + +R+I + + + W+ Y+ +W++
Sbjct: 106 SSRHLYRSRTNRSIAGVCGGIAEFFGIDSTILRLITLFLILFGGLSLWV----YIILWIV 161
Query: 181 AP 182
P
Sbjct: 162 IP 163
>gi|160890553|ref|ZP_02071556.1| hypothetical protein BACUNI_02995 [Bacteroides uniformis ATCC
8492]
gi|156860285|gb|EDO53716.1| hypothetical protein BACUNI_02995 [Bacteroides uniformis ATCC
8492]
Length = 48
Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats.
Identities = 27/42 (64%), Positives = 32/42 (76%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLHNLASHFDRENHTD 42
MKKTLT+NLGG VF+ID+DAY LL+ YL NL HF +E D
Sbjct: 1 MKKTLTVNLGGTVFNIDDDAYRLLDNYLSNLKMHFRKEAGAD 42
>gi|182416334|ref|YP_001821400.1| phage shock protein C, PspC [Opitutus terrae PB90-1]
gi|177843548|gb|ACB77800.1| phage shock protein C, PspC [Opitutus terrae PB90-1]
Length = 400
Score = 55.5 bits (132), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 73/166 (43%), Gaps = 10/166 (6%)
Query: 1 MKKTLTMNLGGKVFHIDEDAYNLLEEYLH----NLASHFDRENHTDCKTIEEFEAYLSDM 56
M K +T+NL G F ++E Y+ L YL LA + DRE + + + E ++
Sbjct: 1 MNKVITINLNGTAFQLEEAGYDTLRAYLDAAAGKLAGNPDRE-----EILSDIEQAFAEK 55
Query: 57 MVVRLHSEHAVISIGLIREVIEKINTGEHFSSNEDSTTAGYSAGAGYSQKERETSGYRYM 116
+RL + V++ + I ++ + F+ TTAG +
Sbjct: 56 FRLRLSAHKNVVTTEQVHAAIAEMGPVD-FADEAAETTAGGTGADATGSAAASEESAAAA 114
Query: 117 SGEAPKRKFYRDMDHAVLCGVCSGIAAYTRWNVNAIRVIVVLMTIL 162
AP+R + + A++ GVC+GI AY + IR+ ++ ++
Sbjct: 115 DAGAPRRLYRLYPEGAMISGVCNGIGAYFNIDPTFIRLAFIVAAVI 160
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
Posted date: May 10, 2008 4:54 AM
Number of letters in database: 884,634,002
Number of sequences in database: 2,620,852
Database: /apps/blastdb/nr.01
Posted date: May 10, 2008 4:52 AM
Number of letters in database: 976,814,986
Number of sequences in database: 2,761,530
Database: /apps/blastdb/nr.02
Posted date: May 10, 2008 4:46 AM
Number of letters in database: 360,829,861
Number of sequences in database: 1,132,722
Lambda K H
0.326 0.140 0.427
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,543,678,346
Number of Sequences: 6515104
Number of extensions: 64672015
Number of successful extensions: 257436
Number of sequences better than 1.0e-04: 35
Number of HSP's better than 0.0 without gapping: 13
Number of HSP's successfully gapped in prelim test: 22
Number of HSP's that attempted gapping in prelim test: 257324
Number of HSP's gapped (non-prelim): 42
length of query: 351
length of database: 2,222,278,849
effective HSP length: 135
effective length of query: 216
effective length of database: 1,342,739,809
effective search space: 290031798744
effective search space used: 290031798744
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 122 (51.6 bits)