BLASTP 2.2.18 [Mar-02-2008]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= PGN_1834 hypothetical protein
(663 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
6,515,104 sequences; 2,222,278,849 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|34541509|ref|NP_905988.1| hypothetical protein PG1903 [P... 1384 0.0
gi|153808668|ref|ZP_01961336.1| hypothetical protein BACCAC... 760 0.0
gi|150004410|ref|YP_001299154.1| hypothetical protein BVU_1... 756 0.0
gi|160885087|ref|ZP_02066090.1| hypothetical protein BACOVA... 752 0.0
gi|29347667|ref|NP_811170.1| hypothetical protein BT_2257 [... 751 0.0
gi|160891325|ref|ZP_02072328.1| hypothetical protein BACUNI... 739 0.0
gi|150009873|ref|YP_001304616.1| hypothetical protein BDI_3... 737 0.0
gi|53711963|ref|YP_097955.1| hypothetical protein BF0673 [B... 737 0.0
gi|154494033|ref|ZP_02033353.1| hypothetical protein PARMER... 720 0.0
gi|167765039|ref|ZP_02437160.1| hypothetical protein BACSTE... 525 e-147
gi|167752357|ref|ZP_02424484.1| hypothetical protein ALIPUT... 387 e-105
gi|42526358|ref|NP_971456.1| hypothetical protein TDE0846 [... 155 8e-36
gi|15639837|ref|NP_219287.1| hypothetical protein TP0851 [T... 154 2e-35
>gi|34541509|ref|NP_905988.1| hypothetical protein PG1903 [Porphyromonas gingivalis W83]
gi|34397826|gb|AAQ66887.1| conserved hypothetical protein [Porphyromonas gingivalis W83]
Length = 663
Score = 1384 bits (3581), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 663/663 (100%), Positives = 663/663 (100%)
Query: 1 MKEKEKIYRHLTEEEISVLEGNYCQCADWTMVEVADGFDANRCLHVMFSGQIRIGSTRGS 60
MKEKEKIYRHLTEEEISVLEGNYCQCADWTMVEVADGFDANRCLHVMFSGQIRIGSTRGS
Sbjct: 1 MKEKEKIYRHLTEEEISVLEGNYCQCADWTMVEVADGFDANRCLHVMFSGQIRIGSTRGS 60
Query: 61 HLLPGGLHVPTGINIARLHNCIVGDEVVIFHVNSYLANYRIGDYTRIENIDTIAVTESTS 120
HLLPGGLHVPTGINIARLHNCIVGDEVVIFHVNSYLANYRIGDYTRIENIDTIAVTESTS
Sbjct: 61 HLLPGGLHVPTGINIARLHNCIVGDEVVIFHVNSYLANYRIGDYTRIENIDTIAVTESTS 120
Query: 121 FGNGVEVTVLNETGGREVTITDNLTAQTAYLMAMYRHDKVLTESLAAMARAYAESRRSDM 180
FGNGVEVTVLNETGGREVTITDNLTAQTAYLMAMYRHDKVLTESLAAMARAYAESRRSDM
Sbjct: 121 FGNGVEVTVLNETGGREVTITDNLTAQTAYLMAMYRHDKVLTESLAAMARAYAESRRSDM 180
Query: 181 GAIGSHVVIHGCNTIENVCIGDYTEITGASHLCNGSINSKKEAPVSVGYNVVCRDFILCS 240
GAIGSHVVIHGCNTIENVCIGDYTEITGASHLCNGSINSKKEAPVSVGYNVVCRDFILCS
Sbjct: 181 GAIGSHVVIHGCNTIENVCIGDYTEITGASHLCNGSINSKKEAPVSVGYNVVCRDFILCS 240
Query: 241 GSSVTDGATVSHCFVGQGTHLGHLFSAHDSLFFANCQGENGEACAVFAGPYTVSMHKSSL 300
GSSVTDGATVSHCFVGQGTHLGHLFSAHDSLFFANCQGENGEACAVFAGPYTVSMHKSSL
Sbjct: 241 GSSVTDGATVSHCFVGQGTHLGHLFSAHDSLFFANCQGENGEACAVFAGPYTVSMHKSSL 300
Query: 301 LIAGLYSFLNAGSGSNQSNHLYKLGPIHQGIVERGSKTTSDSYILWPSRIGPFTLVMGRH 360
LIAGLYSFLNAGSGSNQSNHLYKLGPIHQGIVERGSKTTSDSYILWPSRIGPFTLVMGRH
Sbjct: 301 LIAGLYSFLNAGSGSNQSNHLYKLGPIHQGIVERGSKTTSDSYILWPSRIGPFTLVMGRH 360
Query: 361 VHHIDSSAFPFSYIIEDRNESYLVPGINLRSVGTIRDAKKWPARDKRKDPDLLDNINFNL 420
VHHIDSSAFPFSYIIEDRNESYLVPGINLRSVGTIRDAKKWPARDKRKDPDLLDNINFNL
Sbjct: 361 VHHIDSSAFPFSYIIEDRNESYLVPGINLRSVGTIRDAKKWPARDKRKDPDLLDNINFNL 420
Query: 421 LSPFTAGKMMRGSRKLKKLRKILGSEGQTYVYHDMRIRSSALDKGLMFYEMGLNKFFGNS 480
LSPFTAGKMMRGSRKLKKLRKILGSEGQTYVYHDMRIRSSALDKGLMFYEMGLNKFFGNS
Sbjct: 421 LSPFTAGKMMRGSRKLKKLRKILGSEGQTYVYHDMRIRSSALDKGLMFYEMGLNKFFGNS 480
Query: 481 IIKRLEDCPCRTDEEVLSALKPTHNEGDGSWVDICGMIAPQQQIHRLASLIKEGKIKSLA 540
IIKRLEDCPCRTDEEVLSALKPTHNEGDGSWVDICGMIAPQQQIHRLASLIKEGKIKSLA
Sbjct: 481 IIKRLEDCPCRTDEEVLSALKPTHNEGDGSWVDICGMIAPQQQIHRLASLIKEGKIKSLA 540
Query: 541 EVNERMKEIHSRYYDYEWCWAYNALLEWYELNPEDLTRSKVAELVHTWMRSVIRLDEMLY 600
EVNERMKEIHSRYYDYEWCWAYNALLEWYELNPEDLTRSKVAELVHTWMRSVIRLDEMLY
Sbjct: 541 EVNERMKEIHSRYYDYEWCWAYNALLEWYELNPEDLTRSKVAELVHTWMRSVIRLDEMLY 600
Query: 601 EDAKKEFQMHSQVGFGIDLVGEMKHRDFDEVRGDFESNPFIVEVVEHIRKKRALGEEMLS 660
EDAKKEFQMHSQVGFGIDLVGEMKHRDFDEVRGDFESNPFIVEVVEHIRKKRALGEEMLS
Sbjct: 601 EDAKKEFQMHSQVGFGIDLVGEMKHRDFDEVRGDFESNPFIVEVVEHIRKKRALGEEMLS 660
Query: 661 RLE 663
RLE
Sbjct: 661 RLE 663
>gi|153808668|ref|ZP_01961336.1| hypothetical protein BACCAC_02967 [Bacteroides caccae ATCC 43185]
gi|149128494|gb|EDM19712.1| hypothetical protein BACCAC_02967 [Bacteroides caccae ATCC 43185]
Length = 662
Score = 760 bits (1963), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/658 (54%), Positives = 462/658 (70%)
Query: 6 KIYRHLTEEEISVLEGNYCQCADWTMVEVADGFDANRCLHVMFSGQIRIGSTRGSHLLPG 65
K YR LTE+E+ L+ C DW V VA+GF+ H FSG++++G LPG
Sbjct: 2 KDYRRLTEDEVLQLKSQSCLADDWGNVSVAEGFNCEYVHHTRFSGEVKLGVLDAEFTLPG 61
Query: 66 GLHVPTGINIARLHNCIVGDEVVIFHVNSYLANYRIGDYTRIENIDTIAVTESTSFGNGV 125
G+ +G+ LHN VGD I ++ +Y+ANY IG+ T IEN+D I V T+FGNGV
Sbjct: 62 GIRKHSGLRHVTLHNVTVGDNCCIENIQNYIANYEIGNDTFIENVDIILVDRLTTFGNGV 121
Query: 126 EVTVLNETGGREVTITDNLTAQTAYLMAMYRHDKVLTESLAAMARAYAESRRSDMGAIGS 185
E TVLNETGGREV I D L+A AY++A+YRH L + A+ Y+ S +G+IG
Sbjct: 122 EATVLNETGGREVLINDKLSAHQAYILALYRHRPELINRMKAITDYYSNKHASTVGSIGD 181
Query: 186 HVVIHGCNTIENVCIGDYTEITGASHLCNGSINSKKEAPVSVGYNVVCRDFILCSGSSVT 245
HV+I +I NV IGDY I G L NGSINS PV +G+ V+C DFI+ SGS V
Sbjct: 182 HVMILNTGSIRNVRIGDYCHICGTCRLTNGSINSNVTTPVHIGHGVICDDFIISSGSDVD 241
Query: 246 DGATVSHCFVGQGTHLGHLFSAHDSLFFANCQGENGEACAVFAGPYTVSMHKSSLLIAGL 305
DG +S CFVGQ LGH +SA DSLFF+NCQGENGEACA+FAGP+TV+ HKS+LLIAG+
Sbjct: 242 DGTMLSRCFVGQACKLGHNYSASDSLFFSNCQGENGEACAIFAGPFTVTHHKSTLLIAGM 301
Query: 306 YSFLNAGSGSNQSNHLYKLGPIHQGIVERGSKTTSDSYILWPSRIGPFTLVMGRHVHHID 365
+SF+NAGSGSNQSNH+YKLGPIHQG +ERG+KTTSDSYILWP+R+G F+LVMGRHV+H D
Sbjct: 302 FSFMNAGSGSNQSNHMYKLGPIHQGTMERGAKTTSDSYILWPARVGAFSLVMGRHVNHAD 361
Query: 366 SSAFPFSYIIEDRNESYLVPGINLRSVGTIRDAKKWPARDKRKDPDLLDNINFNLLSPFT 425
+S PFSY+IE RN +YLVPG+NLRSVGTIRDA+KWP RDKRKDP+ LD IN+NLLSP+T
Sbjct: 362 TSNLPFSYLIEQRNTTYLVPGVNLRSVGTIRDAQKWPKRDKRKDPNRLDYINYNLLSPYT 421
Query: 426 AGKMMRGSRKLKKLRKILGSEGQTYVYHDMRIRSSALDKGLMFYEMGLNKFFGNSIIKRL 485
KM +G LK+L+++ G + Y Y +I++S+L+ G+ FYE+ +NKF GNSIIKRL
Sbjct: 422 IQKMFKGRSILKELKRVSGETSEIYSYQSAKIKNSSLNNGIRFYEIAINKFLGNSIIKRL 481
Query: 486 EDCPCRTDEEVLSALKPTHNEGDGSWVDICGMIAPQQQIHRLASLIKEGKIKSLAEVNER 545
E ++++E+ LKP G G WVDI G+IAP+ +I RL I+ G I L +N
Sbjct: 482 EGINFQSNDEIRQRLKPDTEIGVGEWVDISGLIAPKSEIDRLLDGIENGSINRLKSINAS 541
Query: 546 MKEIHSRYYDYEWCWAYNALLEWYELNPEDLTRSKVAELVHTWMRSVIRLDEMLYEDAKK 605
E+H YY YEW WAYN + E+Y LNPE +T V +V W ++V+ LD+M+YEDAKK
Sbjct: 542 FAEMHENYYTYEWTWAYNKIQEFYGLNPETVTAQDVVTIVKAWQKAVVGLDKMVYEDAKK 601
Query: 606 EFQMHSQVGFGIDLVGEMKHRDFDEVRGDFESNPFIVEVVEHIRKKRALGEEMLSRLE 663
EF + S GFG+D + +DF++VRGDFESN F+ V++HI K ALG E++ R+E
Sbjct: 602 EFSLSSMTGFGVDGSRDDMKQDFEQVRGDFESNTFVTAVLKHIEDKTALGNELIKRIE 659
>gi|150004410|ref|YP_001299154.1| hypothetical protein BVU_1857 [Bacteroides vulgatus ATCC 8482]
gi|149932834|gb|ABR39532.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 661
Score = 756 bits (1951), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 353/657 (53%), Positives = 465/657 (70%)
Query: 6 KIYRHLTEEEISVLEGNYCQCADWTMVEVADGFDANRCLHVMFSGQIRIGSTRGSHLLPG 65
K YR LT+EEI L+ C DW +EV + F + H FSG++R+G L G
Sbjct: 2 KTYRSLTQEEIQQLKERSCTAVDWDEIEVVENFKTDYIYHTRFSGKVRLGVFEDEFTLAG 61
Query: 66 GLHVPTGINIARLHNCIVGDEVVIFHVNSYLANYRIGDYTRIENIDTIAVTESTSFGNGV 125
G+ +G+ A LHN VGD I ++ +Y+ANY IGDY IEN+D I V + FGNGV
Sbjct: 62 GMRKHSGLYHATLHNVTVGDNCCIENIKNYIANYIIGDYVFIENVDIILVDGRSKFGNGV 121
Query: 126 EVTVLNETGGREVTITDNLTAQTAYLMAMYRHDKVLTESLAAMARAYAESRRSDMGAIGS 185
EV VLNETGGREV I D L+A AY++A+YRH L + A+ YAE SD G IG
Sbjct: 122 EVAVLNETGGREVPIHDRLSAHQAYILALYRHRPELICRMKAIIDRYAEENASDTGTIGH 181
Query: 186 HVVIHGCNTIENVCIGDYTEITGASHLCNGSINSKKEAPVSVGYNVVCRDFILCSGSSVT 245
HV I I+NV IGDY +I GA L NGS+NS ++AP+ +GY VVC DFI+ SGS+V
Sbjct: 182 HVTIVDAGYIKNVRIGDYCKIEGAGRLKNGSLNSNEQAPIHIGYGVVCDDFIISSGSNVE 241
Query: 246 DGATVSHCFVGQGTHLGHLFSAHDSLFFANCQGENGEACAVFAGPYTVSMHKSSLLIAGL 305
DG ++ CF+ Q HLGH +SA DSLFF+NCQ ENGEACA+FAGP+TV+ HKS+LLIAG+
Sbjct: 242 DGTMLTRCFISQACHLGHNYSASDSLFFSNCQEENGEACAIFAGPFTVTHHKSTLLIAGM 301
Query: 306 YSFLNAGSGSNQSNHLYKLGPIHQGIVERGSKTTSDSYILWPSRIGPFTLVMGRHVHHID 365
+SF+NAGSGSNQSNH+YKLGPIHQG +ERG+KTTSDSYILWP+R+G F+LVMGRHV+H D
Sbjct: 302 FSFMNAGSGSNQSNHMYKLGPIHQGAMERGAKTTSDSYILWPARVGAFSLVMGRHVNHAD 361
Query: 366 SSAFPFSYIIEDRNESYLVPGINLRSVGTIRDAKKWPARDKRKDPDLLDNINFNLLSPFT 425
+S PFSY+IE +N +YLVPG+NLRSVGTIRDA+KWP RDKRKDP+ LD IN+NLLSP+T
Sbjct: 362 TSNLPFSYLIEQQNTTYLVPGVNLRSVGTIRDAQKWPKRDKRKDPNRLDQINYNLLSPYT 421
Query: 426 AGKMMRGSRKLKKLRKILGSEGQTYVYHDMRIRSSALDKGLMFYEMGLNKFFGNSIIKRL 485
KMM+G LK+LRK+ G +TY Y +I++S+L+ G+ FYE ++KF GNS+IKRL
Sbjct: 422 IQKMMKGRSILKELRKVSGETSETYSYQSAKIKNSSLNNGIRFYETAIHKFLGNSLIKRL 481
Query: 486 EDCPCRTDEEVLSALKPTHNEGDGSWVDICGMIAPQQQIHRLASLIKEGKIKSLAEVNER 545
E+ +DEE+ + L P G G WVDI G+IAP+ +I +L + I+ G + ++ ++++R
Sbjct: 482 EEVRFSSDEEIRARLIPDTEIGTGEWVDISGLIAPKSEIEKLMADIESGILTNVDQIHDR 541
Query: 546 MKEIHSRYYDYEWCWAYNALLEWYELNPEDLTRSKVAELVHTWMRSVIRLDEMLYEDAKK 605
E+H YY YEW WAY +LE+Y L +++T V +V W +V+ LD+M+Y DAKK
Sbjct: 542 FVEMHRNYYTYEWTWAYGKMLEFYNLRSDEITAKDVIAIVKKWQEAVVGLDKMVYADAKK 601
Query: 606 EFQMHSQVGFGIDLVGEMKHRDFDEVRGDFESNPFIVEVVEHIRKKRALGEEMLSRL 662
EF + + GFG D E +DF++VRG FESNPF+ V++HI K ALG E++ R+
Sbjct: 602 EFSLSAMTGFGADGSREEMEQDFEQVRGVFESNPFVTAVLQHIEAKTALGNELIERI 658
>gi|160885087|ref|ZP_02066090.1| hypothetical protein BACOVA_03085 [Bacteroides ovatus ATCC 8483]
gi|156109437|gb|EDO11182.1| hypothetical protein BACOVA_03085 [Bacteroides ovatus ATCC 8483]
Length = 663
Score = 752 bits (1941), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 352/658 (53%), Positives = 464/658 (70%)
Query: 6 KIYRHLTEEEISVLEGNYCQCADWTMVEVADGFDANRCLHVMFSGQIRIGSTRGSHLLPG 65
K YR LTE+EI L+ C DW V VA+GF+ H FSG++++G LPG
Sbjct: 2 KDYRRLTEDEILQLKSQSCLADDWGNVSVAEGFNCEYVHHTRFSGEVKLGVFEAEFTLPG 61
Query: 66 GLHVPTGINIARLHNCIVGDEVVIFHVNSYLANYRIGDYTRIENIDTIAVTESTSFGNGV 125
G+ +G+ LHN VGD I ++ +Y+ANY IG T IEN+D I V ++FGNGV
Sbjct: 62 GIKKHSGLRHVTLHNVSVGDNCCIENIQNYIANYEIGSDTFIENVDIILVDRLSTFGNGV 121
Query: 126 EVTVLNETGGREVTITDNLTAQTAYLMAMYRHDKVLTESLAAMARAYAESRRSDMGAIGS 185
EV VLNETGGREV + D L+A AY++A+YRH L + ++A Y+ S +G+IG+
Sbjct: 122 EVAVLNETGGREVLMNDKLSAHQAYILALYRHRPELINRMKSIADYYSNKHASAVGSIGN 181
Query: 186 HVVIHGCNTIENVCIGDYTEITGASHLCNGSINSKKEAPVSVGYNVVCRDFILCSGSSVT 245
HV+I +I+NV IGDY I G L NGS+NS APV +G+ V+C DFI+ SGS V
Sbjct: 182 HVMILNTGSIKNVRIGDYCHICGTCRLSNGSVNSNVTAPVHIGHGVICDDFIISSGSKVD 241
Query: 246 DGATVSHCFVGQGTHLGHLFSAHDSLFFANCQGENGEACAVFAGPYTVSMHKSSLLIAGL 305
DG ++ CFVGQ LGH +SA DSLFF+NCQGENGEACA+FAGP+TV+ HKS+LLIAG+
Sbjct: 242 DGTMLTRCFVGQSCKLGHNYSASDSLFFSNCQGENGEACAIFAGPFTVTHHKSTLLIAGM 301
Query: 306 YSFLNAGSGSNQSNHLYKLGPIHQGIVERGSKTTSDSYILWPSRIGPFTLVMGRHVHHID 365
+SF+NAGSGSNQSNH+YKLGPIHQG +ERG+KTTSDSYILWP+R+G F+LVMGRHV+H D
Sbjct: 302 FSFMNAGSGSNQSNHMYKLGPIHQGTMERGAKTTSDSYILWPARVGAFSLVMGRHVNHAD 361
Query: 366 SSAFPFSYIIEDRNESYLVPGINLRSVGTIRDAKKWPARDKRKDPDLLDNINFNLLSPFT 425
+S PFSY+IE RN +YLVPG+NLRSVGTIRDA+KWP RDKRKDP+ LD IN+NLLSP+T
Sbjct: 362 TSNLPFSYLIEQRNTTYLVPGVNLRSVGTIRDAQKWPKRDKRKDPNRLDYINYNLLSPYT 421
Query: 426 AGKMMRGSRKLKKLRKILGSEGQTYVYHDMRIRSSALDKGLMFYEMGLNKFFGNSIIKRL 485
KM +G LK+L+++ G + Y Y +I++S+L+ G+ FYE+ ++KF GNSIIKRL
Sbjct: 422 IQKMFKGRSILKELKRVSGETSEIYSYQSAKIKNSSLNNGIRFYEIAIHKFLGNSIIKRL 481
Query: 486 EDCPCRTDEEVLSALKPTHNEGDGSWVDICGMIAPQQQIHRLASLIKEGKIKSLAEVNER 545
E +T+EE+ LKP G G WVD+ G+IAP+ +I RL I+ G + L +N
Sbjct: 482 EGINFQTNEEIRQRLKPDTEIGLGEWVDVSGLIAPKSEIDRLLDGIENGTVNRLKSINAS 541
Query: 546 MKEIHSRYYDYEWCWAYNALLEWYELNPEDLTRSKVAELVHTWMRSVIRLDEMLYEDAKK 605
E+H YY YEW WAY+ + E+Y LNPE +T + +V W ++V+ LD+M+YEDAKK
Sbjct: 542 FAEMHENYYTYEWTWAYHKIQEFYGLNPETITAQDIIGIVKAWQQAVVGLDKMVYEDAKK 601
Query: 606 EFQMHSQVGFGIDLVGEMKHRDFDEVRGDFESNPFIVEVVEHIRKKRALGEEMLSRLE 663
EF + S GFG D + +DF++VRGDFESN F+ V++HI K ALG E++ R+E
Sbjct: 602 EFSLSSMTGFGADGSHDEMKQDFEQVRGDFESNTFVTAVLKHIEDKTALGNELIKRIE 659
>gi|29347667|ref|NP_811170.1| hypothetical protein BT_2257 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29339568|gb|AAO77364.1| lipoprotein, putative [Bacteroides thetaiotaomicron VPI-5482]
Length = 663
Score = 751 bits (1940), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/657 (53%), Positives = 464/657 (70%)
Query: 6 KIYRHLTEEEISVLEGNYCQCADWTMVEVADGFDANRCLHVMFSGQIRIGSTRGSHLLPG 65
K YR LTE+E+ L+ C DW V VA+GF+ H FSG++++G LPG
Sbjct: 2 KDYRKLTEDEVLQLKSQSCLADDWGNVLVAEGFNCEYVHHTRFSGEVKLGVFDAEFTLPG 61
Query: 66 GLHVPTGINIARLHNCIVGDEVVIFHVNSYLANYRIGDYTRIENIDTIAVTESTSFGNGV 125
G+ +G+ LHN +VGD I ++ +Y+ANY IG+ T IEN+D I V ++FGNGV
Sbjct: 62 GIRKHSGLRHVTLHNVVVGDNCCIENIQNYIANYEIGNDTFIENVDIILVDGLSTFGNGV 121
Query: 126 EVTVLNETGGREVTITDNLTAQTAYLMAMYRHDKVLTESLAAMARAYAESRRSDMGAIGS 185
E TVLNETGGREV I D L+A AY++A+YRH L + A+A Y+ S +G+IG
Sbjct: 122 EATVLNETGGREVLINDKLSAHQAYILALYRHRPELINRMKAIADYYSNKHASAVGSIGD 181
Query: 186 HVVIHGCNTIENVCIGDYTEITGASHLCNGSINSKKEAPVSVGYNVVCRDFILCSGSSVT 245
HV+I +I+NV IGDY I G L NGS+NS APV +G+ V+C DFI+ SGS V
Sbjct: 182 HVMILNTGSIKNVRIGDYCHICGTCRLTNGSVNSNVTAPVHIGHGVICDDFIISSGSEVD 241
Query: 246 DGATVSHCFVGQGTHLGHLFSAHDSLFFANCQGENGEACAVFAGPYTVSMHKSSLLIAGL 305
DG ++ CFVGQ LGH +SA DSLFF+NCQGENGEACA+FAGP+TV+ HKS+LLIAG+
Sbjct: 242 DGTMLTRCFVGQSCKLGHNYSASDSLFFSNCQGENGEACAIFAGPFTVTHHKSTLLIAGM 301
Query: 306 YSFLNAGSGSNQSNHLYKLGPIHQGIVERGSKTTSDSYILWPSRIGPFTLVMGRHVHHID 365
+SF+NAGSGSNQSNH+YKLGPIHQG +ERG+KTTSDSYILWP+R+G F+LVMGRHV+H D
Sbjct: 302 FSFMNAGSGSNQSNHMYKLGPIHQGTMERGAKTTSDSYILWPARVGAFSLVMGRHVNHAD 361
Query: 366 SSAFPFSYIIEDRNESYLVPGINLRSVGTIRDAKKWPARDKRKDPDLLDNINFNLLSPFT 425
+S PFSY+IE RN +YLVPG+NLRSVGTIRDA+KWP RDKRKDP+ LD IN+NLLSP+T
Sbjct: 362 TSNLPFSYLIEQRNTTYLVPGVNLRSVGTIRDAQKWPKRDKRKDPNRLDYINYNLLSPYT 421
Query: 426 AGKMMRGSRKLKKLRKILGSEGQTYVYHDMRIRSSALDKGLMFYEMGLNKFFGNSIIKRL 485
KM +G LK+LR++ G + Y Y +I++S+L+ G+ FYE+ ++KF GNSIIKRL
Sbjct: 422 IQKMFKGRSILKELRRVSGETSEIYSYQSAKIKNSSLNNGIRFYEIAIHKFLGNSIIKRL 481
Query: 486 EDCPCRTDEEVLSALKPTHNEGDGSWVDICGMIAPQQQIHRLASLIKEGKIKSLAEVNER 545
E +++EE+ LKP G G WVD+ G+IAP+ +I RL I+ G + L +N
Sbjct: 482 EGINFQSNEEIRQRLKPDTEIGTGEWVDMSGLIAPKSEIDRLLDGIENGSVNRLKSINAS 541
Query: 546 MKEIHSRYYDYEWCWAYNALLEWYELNPEDLTRSKVAELVHTWMRSVIRLDEMLYEDAKK 605
E+H YY YEW WAYN + E+Y LNP+++T + +V W +V+ LD+M+Y+DA+K
Sbjct: 542 FAEMHENYYTYEWTWAYNKIQEFYGLNPDEITAQDIIRIVKAWKEAVVGLDKMVYDDARK 601
Query: 606 EFQMHSQVGFGIDLVGEMKHRDFDEVRGDFESNPFIVEVVEHIRKKRALGEEMLSRL 662
EF + S GFG D + +DF++VRGDFESN F+ V++HI K ALG E++ R+
Sbjct: 602 EFSLSSMTGFGADGSHDEMKQDFEQVRGDFESNTFVTAVLKHIEDKTALGNELIKRI 658
>gi|160891325|ref|ZP_02072328.1| hypothetical protein BACUNI_03774 [Bacteroides uniformis ATCC 8492]
gi|156859546|gb|EDO52977.1| hypothetical protein BACUNI_03774 [Bacteroides uniformis ATCC 8492]
Length = 663
Score = 739 bits (1908), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/656 (53%), Positives = 456/656 (69%)
Query: 8 YRHLTEEEISVLEGNYCQCADWTMVEVADGFDANRCLHVMFSGQIRIGSTRGSHLLPGGL 67
YR LTE+EI L+ C DW V VA+ F H FSG++ +G +LPGG+
Sbjct: 3 YRRLTEDEILRLKSQSCLADDWGKVTVAEEFSTEFVHHTRFSGEVCLGVFHSEFMLPGGI 62
Query: 68 HVPTGINIARLHNCIVGDEVVIFHVNSYLANYRIGDYTRIENIDTIAVTESTSFGNGVEV 127
+G+ LHN VGD I ++ +Y+ANY IG T IEN+D I V + FGNGVEV
Sbjct: 63 RKHSGLRHVTLHNVTVGDNCCIENIQNYIANYEIGHDTFIENVDIILVDGVSKFGNGVEV 122
Query: 128 TVLNETGGREVTITDNLTAQTAYLMAMYRHDKVLTESLAAMARAYAESRRSDMGAIGSHV 187
+VLNETGGREV I D L+A AY++A+YRH L + + Y+ S +G+IG+HV
Sbjct: 123 SVLNETGGREVLINDKLSAHQAYILALYRHRPELIARMKEITDFYSNKHASAVGSIGNHV 182
Query: 188 VIHGCNTIENVCIGDYTEITGASHLCNGSINSKKEAPVSVGYNVVCRDFILCSGSSVTDG 247
+I +I+NV IGDY I G L NGSINS + APV +G+ V+C DFI+ +GS V DG
Sbjct: 183 MILNTGSIKNVRIGDYCRICGTCRLYNGSINSNEVAPVHIGHGVICDDFIISTGSHVDDG 242
Query: 248 ATVSHCFVGQGTHLGHLFSAHDSLFFANCQGENGEACAVFAGPYTVSMHKSSLLIAGLYS 307
A +S CFVGQ LGH +SA DSLFF+NCQGENGEACA+FAGPYTV+ HKS+LLIAG++S
Sbjct: 243 AMLSRCFVGQACKLGHNYSASDSLFFSNCQGENGEACAIFAGPYTVTHHKSTLLIAGMFS 302
Query: 308 FLNAGSGSNQSNHLYKLGPIHQGIVERGSKTTSDSYILWPSRIGPFTLVMGRHVHHIDSS 367
F+NAGSGSNQSNH+YKLGPIHQG +ERG+KTTSDSYILWP+R+G F+LVMGRHV+H D+S
Sbjct: 303 FMNAGSGSNQSNHMYKLGPIHQGTLERGAKTTSDSYILWPARVGAFSLVMGRHVNHSDTS 362
Query: 368 AFPFSYIIEDRNESYLVPGINLRSVGTIRDAKKWPARDKRKDPDLLDNINFNLLSPFTAG 427
PFSY+IE N +YLVPG+NLRSVGTIRDA+KWP RD R DP+ LD IN+NLLSP+T
Sbjct: 363 NLPFSYLIEQNNTTYLVPGVNLRSVGTIRDAQKWPKRDGRTDPNKLDYINYNLLSPYTVQ 422
Query: 428 KMMRGSRKLKKLRKILGSEGQTYVYHDMRIRSSALDKGLMFYEMGLNKFFGNSIIKRLED 487
KM +G L+ LR G Y +H +IR+SAL KG+ FYE+ ++KF GNS+IKRLE
Sbjct: 423 KMFKGRETLQNLRHASGELSDIYSFHSAKIRNSALVKGIRFYEIAIHKFLGNSVIKRLEG 482
Query: 488 CPCRTDEEVLSALKPTHNEGDGSWVDICGMIAPQQQIHRLASLIKEGKIKSLAEVNERMK 547
R++EE+ + LKP G G WVDI G+IAP+ +I L I+ GK+ L +N +
Sbjct: 483 IDFRSNEEIRARLKPDTAIGSGEWVDISGLIAPKSEIDALIDGIESGKVNRLKSINAEFE 542
Query: 548 EIHSRYYDYEWCWAYNALLEWYELNPEDLTRSKVAELVHTWMRSVIRLDEMLYEDAKKEF 607
++HS YY YEW WAY L E+Y + PE +T V +V W +V+ LD M+YEDAKKEF
Sbjct: 543 KMHSNYYTYEWTWAYEKLEEFYGIKPEGMTAEDVIHIVEKWKEAVVGLDRMVYEDAKKEF 602
Query: 608 QMHSQVGFGIDLVGEMKHRDFDEVRGDFESNPFIVEVVEHIRKKRALGEEMLSRLE 663
+ S GFG D K DF++VRGDFESNPF++ V++HI K ALG+E++ R++
Sbjct: 603 SLASMTGFGADGSRLEKELDFEQVRGDFESNPFVMAVLKHIEVKTALGDELIGRMQ 658
>gi|150009873|ref|YP_001304616.1| hypothetical protein BDI_3289 [Parabacteroides distasonis ATCC
8503]
gi|149938297|gb|ABR44994.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 662
Score = 737 bits (1902), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/655 (54%), Positives = 459/655 (70%)
Query: 9 RHLTEEEISVLEGNYCQCADWTMVEVADGFDANRCLHVMFSGQIRIGSTRGSHLLPGGLH 68
R+L +EI+ L C+ DW V V + FD HV FSG +++G+ R LPGGL
Sbjct: 5 RNLRPDEIATLRSQACRADDWNQVWVPEVFDIEYVNHVRFSGVVKLGAFRKIFTLPGGLV 64
Query: 69 VPTGINIARLHNCIVGDEVVIFHVNSYLANYRIGDYTRIENIDTIAVTESTSFGNGVEVT 128
+G+ LHNC +GD V+I +V +Y+ANY+IGD I+NI+ + V +FGN VEV+
Sbjct: 65 KHSGLRHVTLHNCTLGDNVLIENVQNYIANYQIGDDCFIQNINVMLVEGKATFGNNVEVS 124
Query: 129 VLNETGGREVTITDNLTAQTAYLMAMYRHDKVLTESLAAMARAYAESRRSDMGAIGSHVV 188
VLNETGGREV I D L+A AY++A+YRH L E L M AY E S G +G V
Sbjct: 125 VLNETGGREVPIYDGLSASLAYIIALYRHRPALIERLRDMITAYTEGIASTEGTVGDKVK 184
Query: 189 IHGCNTIENVCIGDYTEITGASHLCNGSINSKKEAPVSVGYNVVCRDFILCSGSSVTDGA 248
I TI NV IGDY I ++ L NGS+NSK+EAPV +G +V+ +DFI+ SG+ + D A
Sbjct: 185 IVNTGTIRNVKIGDYATIENSARLENGSVNSKREAPVFIGDSVIAQDFIVSSGAKIADAA 244
Query: 249 TVSHCFVGQGTHLGHLFSAHDSLFFANCQGENGEACAVFAGPYTVSMHKSSLLIAGLYSF 308
+ CF+GQ + H FSAHDSL F+NC ENGEACA+FAGP+TVSMHKSSLLIAG+YSF
Sbjct: 245 KIIRCFIGQACQVTHNFSAHDSLLFSNCAFENGEACAIFAGPFTVSMHKSSLLIAGMYSF 304
Query: 309 LNAGSGSNQSNHLYKLGPIHQGIVERGSKTTSDSYILWPSRIGPFTLVMGRHVHHIDSSA 368
LNAGSGSNQSNH+YKLGPIHQGIVERGSKTTSDSYILWP+RIG F+LVMGRH HH D+S
Sbjct: 305 LNAGSGSNQSNHMYKLGPIHQGIVERGSKTTSDSYILWPARIGAFSLVMGRHHHHSDTSD 364
Query: 369 FPFSYIIEDRNESYLVPGINLRSVGTIRDAKKWPARDKRKDPDLLDNINFNLLSPFTAGK 428
PFSY+IE +E+YLVPGINLRSVGTIRDA+KWP RDKR DPD LD IN+NLLSP+T K
Sbjct: 365 IPFSYLIEKDDETYLVPGINLRSVGTIRDAQKWPKRDKRTDPDRLDMINYNLLSPYTIQK 424
Query: 429 MMRGSRKLKKLRKILGSEGQTYVYHDMRIRSSALDKGLMFYEMGLNKFFGNSIIKRLEDC 488
M++ LK L+ ++G + Y Y + RI+ S+L L FY M +NKFFGNS+IKRLE
Sbjct: 425 MLKAVDILKNLQALVGETSEIYYYQNTRIKGSSLRNALNFYGMAINKFFGNSLIKRLEGT 484
Query: 489 PCRTDEEVLSALKPTHNEGDGSWVDICGMIAPQQQIHRLASLIKEGKIKSLAEVNERMKE 548
+ EEV L+PT ++G G W+D+ G+I P++ + L I++G+I SL +V +
Sbjct: 485 TYCSMEEVWEQLRPTESKGSGEWLDLAGLILPREPLEALLQDIEQGEIASLEDVECFFRL 544
Query: 549 IHSRYYDYEWCWAYNALLEWYELNPEDLTRSKVAELVHTWMRSVIRLDEMLYEDAKKEFQ 608
+H RYY EW WAY + +Y ++ ++ +++ +LV W VIRLDEMLYEDA+KEF
Sbjct: 545 VHGRYYSLEWTWAYEMIERYYGVDLRSISAAEIIDLVRRWQECVIRLDEMLYEDARKEFS 604
Query: 609 MHSQVGFGIDLVGEMKHRDFDEVRGDFESNPFIVEVVEHIRKKRALGEEMLSRLE 663
+ S +GFG+D + K +DF+ VRGDF +NPF+ V EHI KRALG+E++ RL+
Sbjct: 605 LTSMIGFGVDGSNKEKQQDFEGVRGDFVNNPFVTAVQEHIVNKRALGDELIERLK 659
>gi|53711963|ref|YP_097955.1| hypothetical protein BF0673 [Bacteroides fragilis YCH46]
gi|60680165|ref|YP_210309.1| hypothetical protein BF0599 [Bacteroides fragilis NCTC 9343]
gi|52214828|dbj|BAD47421.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
gi|60491599|emb|CAH06351.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
Length = 663
Score = 737 bits (1902), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 346/657 (52%), Positives = 457/657 (69%)
Query: 6 KIYRHLTEEEISVLEGNYCQCADWTMVEVADGFDANRCLHVMFSGQIRIGSTRGSHLLPG 65
K YR LTE+E+ L+ C DW V VA+ F H FSG++++G +LPG
Sbjct: 3 KTYRRLTEDEVLQLKSQSCLADDWNKVAVAEEFTTEFVHHTRFSGEVKLGVFHSDFILPG 62
Query: 66 GLHVPTGINIARLHNCIVGDEVVIFHVNSYLANYRIGDYTRIENIDTIAVTESTSFGNGV 125
G+ +G+ LHN VGD I ++ +Y+ANY IG+ T IEN+D I V T FGNGV
Sbjct: 63 GIKKHSGLRHVTLHNVTVGDNCCIENIQNYIANYEIGNNTFIENVDIILVDGLTQFGNGV 122
Query: 126 EVTVLNETGGREVTITDNLTAQTAYLMAMYRHDKVLTESLAAMARAYAESRRSDMGAIGS 185
E VLNETGGREV I D L+A AY++A+YRH L + + Y+ S +G IG+
Sbjct: 123 ETAVLNETGGREVLINDKLSAHQAYILALYRHRPELISRMKEITDYYSNKHASAVGTIGN 182
Query: 186 HVVIHGCNTIENVCIGDYTEITGASHLCNGSINSKKEAPVSVGYNVVCRDFILCSGSSVT 245
HV+I +I+NV IGD+ I G L NGSINS + APV +G+ V+C DFI+ SGS V
Sbjct: 183 HVMILNTGSIKNVRIGDFCRICGTCRLYNGSINSNESAPVHIGHGVICDDFIISSGSHVD 242
Query: 246 DGATVSHCFVGQGTHLGHLFSAHDSLFFANCQGENGEACAVFAGPYTVSMHKSSLLIAGL 305
DGA ++ CFVGQ LGH +SA DSLFF+NCQGENGEACA+FAGPYTV+ HKS+LLIAG+
Sbjct: 243 DGAMLTRCFVGQACQLGHNYSASDSLFFSNCQGENGEACAIFAGPYTVTHHKSTLLIAGM 302
Query: 306 YSFLNAGSGSNQSNHLYKLGPIHQGIVERGSKTTSDSYILWPSRIGPFTLVMGRHVHHID 365
+SF+NAGSGSNQSNH+YKLGPIHQG +ERG+KTTSDSYILWP+R+G F+LVMGRHV+H D
Sbjct: 303 FSFMNAGSGSNQSNHMYKLGPIHQGTLERGAKTTSDSYILWPARVGAFSLVMGRHVNHAD 362
Query: 366 SSAFPFSYIIEDRNESYLVPGINLRSVGTIRDAKKWPARDKRKDPDLLDNINFNLLSPFT 425
+S PFSY+IE +N +YLVPG+NLRSVGTIRDA+KWP RDKR DP+ LD IN+NLLSP+T
Sbjct: 363 TSNLPFSYLIEQQNTTYLVPGVNLRSVGTIRDAQKWPKRDKRTDPNRLDYINYNLLSPYT 422
Query: 426 AGKMMRGSRKLKKLRKILGSEGQTYVYHDMRIRSSALDKGLMFYEMGLNKFFGNSIIKRL 485
KM +G LK+L+++ G + Y Y +I++S+L+ G+ +YE+ ++KF GNSIIKRL
Sbjct: 423 IQKMFKGRSILKELKRVSGETSEIYSYQSAKIKNSSLNSGIRYYEIAIHKFLGNSIIKRL 482
Query: 486 EDCPCRTDEEVLSALKPTHNEGDGSWVDICGMIAPQQQIHRLASLIKEGKIKSLAEVNER 545
E + +EE+ LKP G G WVDI G+IAP+ ++ +L I+ G+I L +N
Sbjct: 483 EGINFKDNEEIRRRLKPDTEIGVGEWVDIAGLIAPKSEVEKLIDGIESGEINRLKSMNAC 542
Query: 546 MKEIHSRYYDYEWCWAYNALLEWYELNPEDLTRSKVAELVHTWMRSVIRLDEMLYEDAKK 605
+H YY YEW WAY+ + E+Y LNPE +T + +V W +V+ LD M+Y+DA+K
Sbjct: 543 FAAMHDNYYTYEWTWAYHKIQEFYGLNPETITAKDIIAIVRAWREAVVGLDRMVYDDARK 602
Query: 606 EFQMHSQVGFGIDLVGEMKHRDFDEVRGDFESNPFIVEVVEHIRKKRALGEEMLSRL 662
EF + S GFG D + DF +VRGDFESNPF+ V++HI K ALGEE+++R+
Sbjct: 603 EFSLSSMTGFGADGSRDEMKLDFGQVRGDFESNPFVTAVLKHIDDKTALGEELINRI 659
>gi|154494033|ref|ZP_02033353.1| hypothetical protein PARMER_03378 [Parabacteroides merdae ATCC
43184]
gi|154086293|gb|EDN85338.1| hypothetical protein PARMER_03378 [Parabacteroides merdae ATCC
43184]
Length = 664
Score = 720 bits (1859), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/659 (52%), Positives = 454/659 (68%)
Query: 5 EKIYRHLTEEEISVLEGNYCQCADWTMVEVADGFDANRCLHVMFSGQIRIGSTRGSHLLP 64
E RHL EEI+ LE N C +W ++VA F A +V FSG + +G LP
Sbjct: 3 EGYLRHLLSEEIADLEMNGCTADNWENIKVASPFHAEHVCNVHFSGSVALGLFEKEFTLP 62
Query: 65 GGLHVPTGINIARLHNCIVGDEVVIFHVNSYLANYRIGDYTRIENIDTIAVTESTSFGNG 124
GG+ +GI A LHNC +GD +I +V++Y++NY IGD I+N++ + V +SFGN
Sbjct: 63 GGVKKHSGIRNATLHNCKIGDNTLIENVHNYISNYFIGDDCFIQNVNVMYVEGRSSFGNN 122
Query: 125 VEVTVLNETGGREVTITDNLTAQTAYLMAMYRHDKVLTESLAAMARAYAESRRSDMGAIG 184
VEV+VLNETGGREV I + L+A AYL+A+YRH L L AM +AE + + G IG
Sbjct: 123 VEVSVLNETGGREVPIYNGLSASLAYLIALYRHRPALILRLQAMIADFAERQTGNYGFIG 182
Query: 185 SHVVIHGCNTIENVCIGDYTEITGASHLCNGSINSKKEAPVSVGYNVVCRDFILCSGSSV 244
+HV I T+ N I DY + + L NG++NS APV +G +V+ DFI+ SG+ V
Sbjct: 183 NHVKIINTGTVRNTVIADYATVENCTRLDNGTVNSNVNAPVYIGDSVIAEDFIISSGAVV 242
Query: 245 TDGATVSHCFVGQGTHLGHLFSAHDSLFFANCQGENGEACAVFAGPYTVSMHKSSLLIAG 304
D A + CF+GQ H+ H FSAHDSL F+NC ENGEACA+FAGP+TVSMHKSSLLIAG
Sbjct: 243 ADAAKIIRCFIGQACHVTHNFSAHDSLLFSNCAFENGEACAIFAGPFTVSMHKSSLLIAG 302
Query: 305 LYSFLNAGSGSNQSNHLYKLGPIHQGIVERGSKTTSDSYILWPSRIGPFTLVMGRHVHHI 364
+YSFLNAGSGSNQSNH+YKLGPIHQGIVERGSKTTSDSYILWP+R+G F+LVMGRH HH
Sbjct: 303 MYSFLNAGSGSNQSNHMYKLGPIHQGIVERGSKTTSDSYILWPARVGAFSLVMGRHHHHS 362
Query: 365 DSSAFPFSYIIEDRNESYLVPGINLRSVGTIRDAKKWPARDKRKDPDLLDNINFNLLSPF 424
D+S PFSY+IE +E+YLVPG+NLRSVGTIRDA+KWP RDKR D LD IN+NLLSP+
Sbjct: 363 DTSDMPFSYLIEKDDETYLVPGVNLRSVGTIRDAQKWPKRDKRTDQQRLDMINYNLLSPY 422
Query: 425 TAGKMMRGSRKLKKLRKILGSEGQTYVYHDMRIRSSALDKGLMFYEMGLNKFFGNSIIKR 484
T KMM+ LK L++++G + Y Y + RI+ S+L L Y M +NKF GNS+IKR
Sbjct: 423 TIYKMMKAVGILKNLQELVGETSEVYYYQNTRIKGSSLRTALDLYGMAINKFLGNSLIKR 482
Query: 485 LEDCPCRTDEEVLSALKPTHNEGDGSWVDICGMIAPQQQIHRLASLIKEGKIKSLAEVNE 544
LE + EEV S LKPT + G G W+D+ G+I P++++ RL ++EG+I SL + E
Sbjct: 483 LEGTDFCSMEEVWSQLKPTSSAGRGEWLDLSGLILPREELDRLIEKVEEGEITSLEAIEE 542
Query: 545 RMKEIHSRYYDYEWCWAYNALLEWYELNPEDLTRSKVAELVHTWMRSVIRLDEMLYEDAK 604
+HS YYD EW WAY+ L E+Y +N ++ +++ +LV W SVI LD +LY+DAK
Sbjct: 543 FFAAMHSNYYDMEWTWAYDMLEEYYGVNLSSISAAQIVDLVRRWQDSVIGLDNLLYKDAK 602
Query: 605 KEFQMHSQVGFGIDLVGEMKHRDFDEVRGDFESNPFIVEVVEHIRKKRALGEEMLSRLE 663
KEF + GFG+D + K DF+ VRG FESNPF+ V EHI K+ALG+E++ R+E
Sbjct: 603 KEFSLTFMTGFGVDGSDKEKQEDFEGVRGAFESNPFVTAVKEHIVVKKALGDELIERME 661
>gi|167765039|ref|ZP_02437160.1| hypothetical protein BACSTE_03433 [Bacteroides stercoris ATCC
43183]
gi|167697708|gb|EDS14287.1| hypothetical protein BACSTE_03433 [Bacteroides stercoris ATCC
43183]
Length = 423
Score = 525 bits (1351), Expect = e-147, Method: Compositional matrix adjust.
Identities = 241/418 (57%), Positives = 307/418 (73%)
Query: 246 DGATVSHCFVGQGTHLGHLFSAHDSLFFANCQGENGEACAVFAGPYTVSMHKSSLLIAGL 305
DGA ++ CFVGQ LGH +SA DSLFF+NCQGENGEACA+FAGPYTV+ HKS+LLIAG+
Sbjct: 3 DGAMLTRCFVGQACKLGHNYSASDSLFFSNCQGENGEACAIFAGPYTVTHHKSTLLIAGM 62
Query: 306 YSFLNAGSGSNQSNHLYKLGPIHQGIVERGSKTTSDSYILWPSRIGPFTLVMGRHVHHID 365
+SF+NAGSGSNQSNH+YKLGPIHQG +ERG+KTTSDSYILWP+R+G F+LVMGRHV+H D
Sbjct: 63 FSFMNAGSGSNQSNHMYKLGPIHQGTLERGAKTTSDSYILWPARVGAFSLVMGRHVNHSD 122
Query: 366 SSAFPFSYIIEDRNESYLVPGINLRSVGTIRDAKKWPARDKRKDPDLLDNINFNLLSPFT 425
+S PFSY+IE N +YLVPG+NLRSVGTIRDA+KWP RD+R D + LD IN+NLLSP+T
Sbjct: 123 TSNLPFSYLIEQNNTTYLVPGVNLRSVGTIRDAQKWPRRDQRTDTNKLDFINYNLLSPYT 182
Query: 426 AGKMMRGSRKLKKLRKILGSEGQTYVYHDMRIRSSALDKGLMFYEMGLNKFFGNSIIKRL 485
KM +G LK LR G Y +H +IR+SAL KG+ FYE+ ++KF GNS+IKRL
Sbjct: 183 VQKMFKGRETLKNLRYASGELSDIYSFHSAKIRNSALVKGIRFYEIAIHKFLGNSVIKRL 242
Query: 486 EDCPCRTDEEVLSALKPTHNEGDGSWVDICGMIAPQQQIHRLASLIKEGKIKSLAEVNER 545
E T+EE+ + LKP G G WVDI G+IAP+ +I L I+ G I L +N
Sbjct: 243 EGIGFHTNEEIRARLKPDTPIGSGEWVDISGLIAPKSEIDALIDGIESGAINRLKHINAE 302
Query: 546 MKEIHSRYYDYEWCWAYNALLEWYELNPEDLTRSKVAELVHTWMRSVIRLDEMLYEDAKK 605
+ +H YY YEW WAY L E+Y + PE++T + +V W +V+ LD M+YEDAKK
Sbjct: 303 FERMHRNYYTYEWTWAYEKLEEFYGIAPENMTAEDIIHIVEKWKEAVVGLDRMVYEDAKK 362
Query: 606 EFQMHSQVGFGIDLVGEMKHRDFDEVRGDFESNPFIVEVVEHIRKKRALGEEMLSRLE 663
EF + S GFG D K DF++VRGDFE+NPF+ V++HI K ALG+E++ R++
Sbjct: 363 EFSLASMTGFGADGSRLEKELDFEQVRGDFENNPFVTAVLKHIDVKTALGDELIGRMQ 420
>gi|167752357|ref|ZP_02424484.1| hypothetical protein ALIPUT_00601 [Alistipes putredinis DSM 17216]
gi|167660598|gb|EDS04728.1| hypothetical protein ALIPUT_00601 [Alistipes putredinis DSM 17216]
Length = 575
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 235/627 (37%), Positives = 319/627 (50%), Gaps = 58/627 (9%)
Query: 8 YRHLTEEEISVLEGNYCQCADWTMVEVADGFDANRCLHVMFSGQIRIGSTRGSHLLPGGL 67
YR T EI L +W +EVA F + G+++IG RG
Sbjct: 4 YRKPTPAEIEALTAAGNSAENWDAIEVAQDFTPAQLSGCRLEGRVQIG--RG-------- 53
Query: 68 HVPTGINIARLHNCIVGDEVVIFHVNSYLANYRIGDYTRIENIDTIAVTESTSFGNGVEV 127
ARL C + NYRIG+ IE + + +SFGNGV V
Sbjct: 54 --------ARLRRCTI-------------RNYRIGEEALIEGVTALECRRESSFGNGVRV 92
Query: 128 TVLNETGGREVTITDNLTAQTAYLMAMYRHDKVLTESLAAMARAYAESRRSDMGAIGSHV 187
+NE GGR V I D LTAQTAY++A+YR+ E++ M YA RR +G +G H
Sbjct: 93 AAINENGGRTVRIYDRLTAQTAYILAVYRYRPEAVEAIERMIERYATERRDTLGTVGPHA 152
Query: 188 VIHGCNTIENVCIGDYTEITGASHLCNGSINSKKEAPVSVGYNVVCRDFILCSGSSVTDG 247
I G I V IG+ I GAS L NG++ A VG +V RDFI G+ + G
Sbjct: 153 RITGARFIREVNIGEGATIDGASLLENGTVC----AGAYVGIDVQARDFIAAEGARIDGG 208
Query: 248 ATVSHCFVGQGTHLGHLFSAHDSLFFANCQGENGEACAVFAGPYTVSMHKSSLLIAGLYS 307
+ CF G+ L F+A DSLFFAN ENGEA ++FAGPYTVS HKSSLLIAG++S
Sbjct: 209 TLLERCFAGECCTLDKHFTAVDSLFFANSHCENGEAVSIFAGPYTVSHHKSSLLIAGMFS 268
Query: 308 FLNAGSGSNQSNHLYKLGPIHQGIVERGSKTTSDSYILWPSRIGPFTLVMGRHVHHIDSS 367
F NAGSG+NQSNHL+K G +HQ + RG K S +YI+ P+ GPFTLV+GRH H D+S
Sbjct: 269 FFNAGSGANQSNHLFKSGAVHQSVHLRGCKFGSSTYIMAPAIEGPFTLVLGRHTQHHDTS 328
Query: 368 AFPFSYIIEDRNESYLVPGINLRSVGTIRDAKKWPARDKRKDPDLLDNINFNLLSPFTAG 427
AFPFSY++E S L+PG NL S GT+RD KW RD+R D INF +P+ AG
Sbjct: 329 AFPFSYLVEQDGRSALMPGANLTSYGTVRDIGKWLERDRRTVK--RDRINFEEYNPYLAG 386
Query: 428 KMMRGSRKLKKLRKILGSEGQTYVYHDMRIRSSALDKGLMFYEMGLNKFFGNSIIKRLED 487
M+ L L + + ++YV++ IRS+ L +GL Y + G
Sbjct: 387 GMIDAVNTLNSLAEAH-PDAESYVHNHALIRSTQLQRGLKLYNKAIVASLG--------- 436
Query: 488 CPCRTDEEVLSALKPTHNEGDGSWVDICGMIAPQQQIHRLASLIKEGKIKSLAEVNERMK 547
+L +P +G G W D+ G P++++ R+ I G I SL ++
Sbjct: 437 -------AMLRNGEPGRADGTGRWNDVAGQYVPRREVKRILDAIANGGIDSLEGIDRAFD 489
Query: 548 EIHSRYYDYEWCWAYNALLEWYELNPEDLTRSKVAELVHTWMRSVIRLDEMLYEDAKKEF 607
I + Y Y WA L + P + ++AE V R+ L + +D ++
Sbjct: 490 RIAADYDHYARSWAEGVLAQLLGHAP---SPEEIAEAVTAGERTRETLRKSAEDDRARDC 546
Query: 608 QMHSQVGFGIDLVG-EMKHRDFDEVRG 633
VG+G+D E K +D+ VRG
Sbjct: 547 SPAMAVGYGVDADSEEEKMQDYHTVRG 573
>gi|42526358|ref|NP_971456.1| hypothetical protein TDE0846 [Treponema denticola ATCC 35405]
gi|41816470|gb|AAS11337.1| conserved hypothetical protein [Treponema denticola ATCC 35405]
Length = 720
Score = 155 bits (393), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 168/685 (24%), Positives = 281/685 (41%), Gaps = 68/685 (9%)
Query: 8 YRHLTEEEISVLEGNYCQCADWTMVEVADGFDANRCLHVMFSGQIRIGSTRGSHLLPGGL 67
YR L +EI L N DW+M+ V D FD + + +F+G +RI ST+ +L
Sbjct: 40 YRQLNSDEIERLIKNGNSSTDWSMILVEDPFDTDLITNSLFAGLVRIASTQNFYLKYHDF 99
Query: 68 HVPTGINIARLHNCIVGDEVVIFHVNSYLANYRIGDYTRIENIDTIAVTESTSFGNG--- 124
VP GI +++ +C +G+ I H +YL++Y IGD + ID + T + FG G
Sbjct: 100 TVPVGITNSKIISCDIGENCAI-HYCAYLSHYIIGDRVILSRIDEMCTTNHSKFGEGLVK 158
Query: 125 --------VEVTVLNETGGREVTITDNLTAQTAYLMAMYRHDKVLTESLAAMARAYAESR 176
V + +NE GGREV ++ A+L A YR D L + + + +S
Sbjct: 159 DGEDEKVRVTIDTVNEAGGREVYPFYDMITADAFLWARYRDDDKLIKKCEEITQNSKDSS 218
Query: 177 RSDMGAIGSHVVIHGCNTIENVCIGDYTEITGASHLCNGSINSKKEAPVSVGYNVVCRDF 236
R G +GS I C I++V G + GA+ L N +I S E P +G V +
Sbjct: 219 RGYYGEVGSDSAIKSCRIIKDVNFGSSVYVKGANKLKNLTIKSTAEEPSQIGEGVELVNG 278
Query: 237 ILCSGSSVTDGATVSHCFVGQGTHLGHLFSAHDSLFFANCQGENGEACAVFAGPYTVSMH 296
I+ GS V G +G L + S+ N E PY H
Sbjct: 279 IIGFGSRVFYGVKAVRFVLGNNCELKYGTRLIHSIVGDNSTISCCEVLNALIFPYHEQHH 338
Query: 297 KSSLLIAGL---YSFLNAGS--GSNQSNHLYKLGPIHQGIVERGSKTTSDSYILWPSRIG 351
+S L+A + S + AG+ GSN + G + I RG S + R
Sbjct: 339 NNSFLVAAMIQGQSNMAAGATVGSNHNTR----GNDGEIIAGRGFWPGLSSTLKHNCRFA 394
Query: 352 PFTLVMGRHVHHIDSSAFPFSYIIEDRNESYL--VPG----INLRSVGTIRDAKKWPARD 405
F ++ + FPFS ++ D +E L +P N+ ++ R+ KK+ RD
Sbjct: 395 SFVILAKANYPAELDIPFPFSLVLNDPHEDRLEIMPAYYWMYNMYALE--RNNKKFLKRD 452
Query: 406 KRKDPDLLDNINFNLLSPFTAGKMMRGSRKLKKLRKILGSEGQTYVYHDMRIRSSALDKG 465
KR +I L+P TA +++ L+K E I D+
Sbjct: 453 KRITK--TQHIETAYLAPDTAAEILNARTLLEKWICAAWIEAGNKALSIDTILKDKKDEA 510
Query: 466 LMFYEMGLNKFFGN---SIIKRLEDCPCRTDEEV---LSALKPTHNEG------------ 507
+ G N IIK +E D + ++ L ++G
Sbjct: 511 KNLFVKGENIERSKRRVKIIKAVESYAAYQDMLIWYGVTTLAEYFDKGLDKIGMSIEEFK 570
Query: 508 -----DGSWVDICGMIAPQQQIHRLASLIKEGKIKSLAEVNERMKEIHSRYYDYEWCWAY 562
D +WV++ G + P+++++ + IK GK+ + ++++ H Y + AY
Sbjct: 571 LSSDFDFNWVNMGGQLIPEKKLNSIKEDIKAGKLNTWQDIHKSYDYAHDSYCHDKAENAY 630
Query: 563 NALLEWYELNPEDLTRSKVAELVHTWMRSVIRLDEMLYEDAKKEFQMHSQVGFGIDLVGE 622
L E ++ E + + +L+ ++E ++ K++ H + +
Sbjct: 631 AVLCELMKV--EKIDAALWNKLLDKAAAIRKYIEEQIFYTKNKDYNNHFR---------D 679
Query: 623 MKHRDFDE---VRGDFESNPFIVEV 644
+ +R+ +E V G E N I+E
Sbjct: 680 ITYRNSEERKAVLGSVEDNALIIEA 704
>gi|15639837|ref|NP_219287.1| hypothetical protein TP0851 [Treponema pallidum subsp. pallidum
str. Nichols]
gi|14285954|sp|O83823|Y851_TREPA Uncharacterized protein TP_0851
gi|3323168|gb|AAC65821.1| predicted coding region TP0851 [Treponema pallidum subsp. pallidum
str. Nichols]
Length = 724
Score = 154 bits (390), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 171/659 (25%), Positives = 277/659 (42%), Gaps = 101/659 (15%)
Query: 3 EKEKIYRHLTEEEISVL--EGNYCQCADWTMVEVADGFDANRCLHVMFSGQIRIGSTRGS 60
E + +R L++EEI L +GN+C W V VAD FDA+ + F+G +RI S
Sbjct: 37 EPPRAWRPLSKEEIHTLIQKGNHCD--TWHDVLVADPFDASLIRNSSFAGLVRIASLERR 94
Query: 61 HLLPGGLHVPTGINIARLHNCIVGDEVVIFHVNSYLANYRIGDYTRIENIDTIAVTESTS 120
L VPTGI + L +C VG+ I H +Y+++Y IG++ + ID + T
Sbjct: 95 LLRYHDFTVPTGITHSTLISCDVGENCAIHHC-AYISHYIIGNHVILSRIDELCTTNHAK 153
Query: 121 FGNG---------VEVTV--LNETGGREVTITDNLTAQTAYLMAMYRHDKVLTESLAAMA 169
FG G V +T+ LNETGGR++ + A A+L A +R +L + +M
Sbjct: 154 FGAGIIKDGEQEAVRITIDPLNETGGRKIFPFVGMIAADAFLWACHRDRTLLMQRFESMT 213
Query: 170 RAYAESRRSDMGAIGSHVVIHGCNTIENVCIGDYTEITGASHLCNGSINSKKEAPVSVGY 229
+ ++RR G I + VI C I++VC G + + GA+ L N ++ S + P +G
Sbjct: 214 QQQHDTRRGYYGTIETQSVIKSCRIIKDVCFGPGSYVKGANKLKNLTVQSSLQEPTQIGE 273
Query: 230 NVVCRDFILCSGSSVTDGATV------SHCFVGQGTHLGHLFSAHDSLFFANCQGENGEA 283
V + ++ G V G ++C + GT L H S+ N E
Sbjct: 274 GVELVNGVIGYGCRVFYGVKAVRFVLGNNCALKYGTRLIH------SVLGDNSTISCCEV 327
Query: 284 CAVFAGPYTVSMHKSSLLIAGLY---SFLNAGS--GSNQSNHLYKLGPIHQGIVERGSKT 338
PY H +S LIA L S + AGS GSN N G I I RG +
Sbjct: 328 LNALIFPYHEQHHNNSFLIAALIRGQSNIAAGSTIGSNH-NTRKNDGEI---IAGRGFWS 383
Query: 339 TSDSYILWPSRIGPFTLVMGRHVHHIDSSAFPFSYIIEDRNESYL--VPG----INLRSV 392
S + R F L+ G++ FPFS + + E+ L +P N+ ++
Sbjct: 384 GLASTLKHNCRFASFVLITGKNYPAELDIPFPFSLVTNNERENRLEIMPAYYWLYNMYAI 443
Query: 393 GTIRDAKKWPARDKRKDPDLLDNINFNLLSPFTAGKM----------------------- 429
R+ KK+ ARDKRK + ++ +P T G++
Sbjct: 444 E--RNEKKFAARDKRKTKT--QTVEISVFAPDTIGEIENALALLDSAIERAWVNAGNSAL 499
Query: 430 ------MRGSRKLKKLRKILG----SEGQTYVYHDMRIRSSALDKGLMFYEMGLNKFFGN 479
++ K + + +L S T V + R + D + + L FF
Sbjct: 500 TAEDIVLKHPTKAQTIPVLLTGIEHSTRSTLVLKPLEARKAYRDILIWYCTKTLLSFFEQ 559
Query: 480 SIIKRLEDCPCRTDEEVLSALKPTHNEGDGSWVDICGMIAPQQQIHRLASLIKEGKIKSL 539
+ C + THN WV++ G + P+ + L I+ GK KS
Sbjct: 560 TT-------RCISHFTSHDPKTITHN-----WVNMGGQLVPEDKFETLLQNIETGKFKSW 607
Query: 540 AEVNERMKEIHSRYYDYEWCWAYNALLEWYELNPEDLTRSKV-AELVHTWMRSVIRLDE 597
+++ + + Y + AY L L R+++ A L+ T+++ I +++
Sbjct: 608 HHIHKEYDALAATYETDKALHAYAVLC--------SLARTRIDAPLLCTYVQHAITIEK 658
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
Posted date: May 10, 2008 4:54 AM
Number of letters in database: 884,634,002
Number of sequences in database: 2,620,852
Database: /apps/blastdb/nr.01
Posted date: May 10, 2008 4:52 AM
Number of letters in database: 976,814,986
Number of sequences in database: 2,761,530
Database: /apps/blastdb/nr.02
Posted date: May 10, 2008 4:46 AM
Number of letters in database: 360,829,861
Number of sequences in database: 1,132,722
Lambda K H
0.320 0.136 0.411
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,896,275,044
Number of Sequences: 6515104
Number of extensions: 125144075
Number of successful extensions: 260307
Number of sequences better than 1.0e-04: 13
Number of HSP's better than 0.0 without gapping: 13
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 260273
Number of HSP's gapped (non-prelim): 15
length of query: 663
length of database: 2,222,278,849
effective HSP length: 141
effective length of query: 522
effective length of database: 1,303,649,185
effective search space: 680504874570
effective search space used: 680504874570
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 125 (52.8 bits)