BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= pPI0483
(469 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|53714266|ref|YP_100258.1| tetratricopeptide repeat famil... 228 1e-57
gi|60682322|ref|YP_212466.1| hypothetical protein BF2852 [B... 226 3e-57
gi|150005406|ref|YP_001300150.1| tetratricopeptide repeat f... 222 3e-56
gi|156112341|gb|EDO14086.1| hypothetical protein BACOVA_001... 218 6e-55
gi|153808239|ref|ZP_01960907.1| hypothetical protein BACCAC... 214 1e-53
gi|156858395|gb|EDO51826.1| hypothetical protein BACUNI_043... 211 1e-52
gi|29346769|ref|NP_810272.1| Tetratricopeptide repeat famil... 205 6e-51
gi|150008164|ref|YP_001302907.1| tetratricopeptide repeat f... 191 1e-46
gi|154494836|ref|ZP_02033841.1| hypothetical protein PARMER... 182 5e-44
gi|148258470|ref|YP_001243055.1| hypothetical protein BBta_... 61 2e-07
gi|118400791|ref|XP_001032717.1| TPR Domain containing prot... 56 6e-06
gi|119357637|ref|YP_912281.1| TPR repeat-containing protein... 56 6e-06
>gi|53714266|ref|YP_100258.1| tetratricopeptide repeat family protein [Bacteroides fragilis
YCH46]
gi|52217131|dbj|BAD49724.1| tetratricopeptide repeat family protein [Bacteroides fragilis
YCH46]
Length = 568
Score = 228 bits (580), Expect = 1e-57, Method: Composition-based stats.
Identities = 152/484 (31%), Positives = 244/484 (50%), Gaps = 20/484 (4%)
Query: 1 VETMLGANEIYDLAKFKVSAKTTAAP------FAPSIDANTTAWVVIPAKAGDPIKASVS 54
VE +LGAN++YD+ KF+V P AP++ A K G+ + V
Sbjct: 85 VECILGANDMYDIIKFRVGITVKKVPALQVAALAPAVGAEVYLLPYSTQKGGNVTRGKVK 144
Query: 55 KVEKFM-EKYNYCVLATSAPEKNNGAPLLNDKGQLVGIYNNS-----GTLQSATDARYAN 108
KV+ +KY+Y L +K P+ G++ G+ S ++ A A +A
Sbjct: 145 KVDNIGGDKYHYYTLDMVLKDKMVSCPVTTADGKVFGVAQKSSGQDTASISYAAGAAFAM 204
Query: 109 DFALVGLSQNDPTLLQCGIRIGLPQSADEAILALMLSAAKND-EIRTATINDFLQKFPTL 167
+ L+ +DP L GI+ GLP+ D+A++ L +++ ++ E ++DF++ FP
Sbjct: 205 SQNISALALSDPALNAIGIKKGLPEDEDQALVYLFIASTQSTPEAYAIALDDFIKTFPNS 264
Query: 168 NNGYIALATN-LFGKGDIGETEKV---LLQAIAKVKAKDEAHYNYARLMYQGAVTPALTE 223
+GY+ A N +F D +K L A+ + KD+ +YN A+L+Y ++ T
Sbjct: 265 ADGYLRRAGNYVFADKDENHMDKAAADLEHALKVAQKKDDTYYNIAKLIYNYQLSKPET- 323
Query: 224 KAKAQGWTLDKAMSQANEAYKLNPTPIYKHLQAQITYAKGDYQKAYQDFEALTKTQLKNP 283
+ WT DKA+ A + P+Y+ L+ I +AK DY A+ ++ + +T+L +P
Sbjct: 324 --VYKDWTYDKALENVRSAIAIQSLPVYQQLEGDILFAKQDYAGAFASYDKVNQTELASP 381
Query: 284 ELYLEMAQCQENLNGNDEEILSLLNKSIELCDTPYTQTAAPYFLARAQQLDKMGKYREAI 343
+ A+ +E EE+++LL+ I C TP T APY L RAQ + KYR A+
Sbjct: 382 ASFFSAAKAKELSKAAPEEVIALLDSCIARCQTPITSDLAPYLLERAQMYMNVEKYRLAL 441
Query: 344 KDFYAYEYLYGSILEANFYYIREQCEVKGKLWQQALQDILIAARLDPAEPIYCAEAGSLL 403
D+ AY + FYY REQ K K +Q+AL DI A L+P + Y AE +
Sbjct: 442 ADYDAYFNAVKGSVNDLFYYYREQAAFKAKQFQRALDDIAKAIELNPEDLTYRAEQAVVN 501
Query: 404 LRLNKLEEAILAAQQAVALKADYAEAYLILGIAQCRNNQKQEGIANIEKAKSLGNPQADT 463
LR+ + EEA + A+A+ YAE Y +LGI Q + Q++ A+ KAK LG+P D
Sbjct: 502 LRVGRYEEAEKVLKDALAIDPKYAEGYRLLGICQIQLKQEKAACASFAKAKELGDPNVDE 561
Query: 464 FLNQ 467
+ +
Sbjct: 562 LIKK 565
>gi|60682322|ref|YP_212466.1| hypothetical protein BF2852 [Bacteroides fragilis NCTC 9343]
gi|60493756|emb|CAH08546.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
Length = 568
Score = 226 bits (576), Expect = 3e-57, Method: Composition-based stats.
Identities = 151/484 (31%), Positives = 244/484 (50%), Gaps = 20/484 (4%)
Query: 1 VETMLGANEIYDLAKFKVSAKTTAAP------FAPSIDANTTAWVVIPAKAGDPIKASVS 54
VE +LGAN++YD+ KF+V P AP++ A K G+ + V
Sbjct: 85 VECILGANDMYDIIKFRVGITVKKVPALQVAALAPAVGAEVYLLPYSTQKGGNVTRGKVK 144
Query: 55 KVEKFM-EKYNYCVLATSAPEKNNGAPLLNDKGQLVGIYNNS-----GTLQSATDARYAN 108
KV+ +KY+Y L +K P+ G++ G+ S ++ A A +A
Sbjct: 145 KVDNIGGDKYHYYTLDMVLKDKMVSCPVTTADGKVFGVAQKSSGQDTASISYAAGAAFAM 204
Query: 109 DFALVGLSQNDPTLLQCGIRIGLPQSADEAILALMLSAAKND-EIRTATINDFLQKFPTL 167
+ L+ +DP L GI+ GLP+ D+A++ L +++ ++ E ++DF++ FP
Sbjct: 205 SQNISALALSDPALNAIGIKKGLPEDEDQALVYLFIASTQSTPEAYAIALDDFIKTFPNS 264
Query: 168 NNGYIALATN-LFGKGDIGETEKV---LLQAIAKVKAKDEAHYNYARLMYQGAVTPALTE 223
+GY+ A N +F D +K L A+ + KD+ +YN A+L+Y ++ T
Sbjct: 265 ADGYLRRAGNYVFADKDENHMDKAAADLEHALKVAQKKDDTYYNIAKLIYNYQLSKPET- 323
Query: 224 KAKAQGWTLDKAMSQANEAYKLNPTPIYKHLQAQITYAKGDYQKAYQDFEALTKTQLKNP 283
+ WT D+A+ A + P+Y+ L+ I +AK DY A+ ++ + +T+L +P
Sbjct: 324 --VYKDWTYDQALENVRSAIAIQSLPVYQQLEGDILFAKQDYAGAFASYDKVNQTELASP 381
Query: 284 ELYLEMAQCQENLNGNDEEILSLLNKSIELCDTPYTQTAAPYFLARAQQLDKMGKYREAI 343
+ A+ +E EE+++LL+ I C TP T APY L RAQ + KYR A+
Sbjct: 382 ASFFSAAKAKELSKAAPEEVIALLDSCIARCQTPITSDLAPYLLERAQMYMNVEKYRLAL 441
Query: 344 KDFYAYEYLYGSILEANFYYIREQCEVKGKLWQQALQDILIAARLDPAEPIYCAEAGSLL 403
D+ AY + FYY REQ K K +Q+AL DI A L+P + Y AE +
Sbjct: 442 ADYDAYFNAVKGSVNDLFYYYREQAAFKAKQFQRALDDIAKAIELNPEDLTYRAEQAVVN 501
Query: 404 LRLNKLEEAILAAQQAVALKADYAEAYLILGIAQCRNNQKQEGIANIEKAKSLGNPQADT 463
LR+ + EEA + A+A+ YAE Y +LGI Q + Q++ A+ KAK LG+P D
Sbjct: 502 LRVGRYEEAEKVLKDALAIDPKYAEGYRLLGICQIQLKQEKAACASFAKAKELGDPNVDE 561
Query: 464 FLNQ 467
+ +
Sbjct: 562 LIKK 565
>gi|150005406|ref|YP_001300150.1| tetratricopeptide repeat family protein [Bacteroides vulgatus ATCC
8482]
gi|149933830|gb|ABR40528.1| tetratricopeptide repeat family protein [Bacteroides vulgatus ATCC
8482]
Length = 566
Score = 222 bits (566), Expect = 3e-56, Method: Composition-based stats.
Identities = 153/479 (31%), Positives = 255/479 (53%), Gaps = 16/479 (3%)
Query: 4 MLGANEIYDLAKFKVSA-KTTAAPFAPSIDANTTAWVVI----PAKAGDPIKASVSKVEK 58
+LGAN +YD+ KF K T A + S A+ V + KA +V+KV+
Sbjct: 88 ILGANSMYDIVKFNTETDKKTIALKSASQPASVGETVYLLPYSTQKAATCQTGTVTKVDT 147
Query: 59 FMEKYNYCVLATSAPEKNNGAPLLNDKGQLVGIYNNSGTLQS----ATDARYANDFALVG 114
+K Y LA + EK P++N G+++G+ + + ++ A A Y ++
Sbjct: 148 IGDKAYYYTLAMTTNEKTVSCPIMNANGEVLGLIQKNASDEAKESYAIGATYGASLSITA 207
Query: 115 LSQNDPTLLQCGIRIGLPQSADEAILAL-MLSAAKNDEIRTATINDFLQKFPTLNNGYIA 173
LS ND +L + GI+ GLP++ D+A++ L M S+ +N + T+NDFL+++P +GYI
Sbjct: 208 LSLNDMSLNKIGIKKGLPETEDQALVYLFMASSQQNQDEYITTLNDFLEQYPNSADGYIR 267
Query: 174 LATNLFGKGDIGET---EKVLLQAIAKVKAKDEAHYNYARLMYQGAVTPALTEKAKAQGW 230
AT G D + L +A+ K E YN A+L+Y + T +L +K W
Sbjct: 268 RATTYMGFNDDEHNALADADLKKALEVTANKSETQYNIAKLIY--SYTISLGDKKPYGDW 325
Query: 231 TLDKAMSQANEAYKLNPTPIYKHLQAQITYAKGDYQKAYQDFEALTKTQLKNPELYLEMA 290
+ DKA+S ++A + + PIY L+ I +A Y +AY +E + ++ + + + A
Sbjct: 326 SYDKALSIIHDAMQADNQPIYTQLEGDILFAMKKYPEAYAAYEKVNQSSIASAATFYSAA 385
Query: 291 QCQENLNGND-EEILSLLNKSIELCDTPYTQTAAPYFLARAQQLDKMGKYREAIKDFYAY 349
+ ++ + G D E+++L++ ++ PYT AAPYF RA+ + GKYREA+ D+ +
Sbjct: 386 KTKQLIEGTDMNEVIALMDSAVARFTKPYTSEAAPYFYERAEIKAQTGKYREAVIDYDTF 445
Query: 350 EYLYGSILEANFYYIREQCEVKGKLWQQALQDILIAARLDPAEPIYCAEAGSLLLRLNKL 409
G + A FY REQ E++ K++QQA+ DI A + P + E GS+ LR+ +
Sbjct: 446 YDAIGGRVTAAFYLQREQAEIQCKMYQQAINDINKAIEMTPEDIAMWVEKGSVHLRVGQH 505
Query: 410 EEAILAAQQAVALKADYAEAYLILGIAQCRNNQKQEGIANIEKAKSLGNPQADTFLNQF 468
EAI A ++A++L A AY +LG Q + + +E AN KAK LG+ D + ++
Sbjct: 506 NEAIEALEKAISLDPKAAAAYRMLGYCQIQLKKNKEACANFAKAKELGDEVVDGLIQKY 564
>gi|156112341|gb|EDO14086.1| hypothetical protein BACOVA_00198 [Bacteroides ovatus ATCC 8483]
Length = 571
Score = 218 bits (556), Expect = 6e-55, Method: Composition-based stats.
Identities = 151/488 (30%), Positives = 245/488 (50%), Gaps = 26/488 (5%)
Query: 1 VETMLGANEIYDLAKFKVSAKTTAAP------FAPSIDANTTAWVV--IPAKAGDPIKAS 52
V +LGAN++YD+ KF+V+ P AP A AW++ K+ +
Sbjct: 88 VSLILGANDMYDVIKFRVAITEKKVPALIVAKTAPV--AGAEAWMLPYSTQKSIACVNGK 145
Query: 53 VSKVEKFMEKYNYCVLATSAPEKNNGAPLLNDKGQLVGIYNNSGTLQSATD-----ARYA 107
V V K +Y+Y L+ +K P++N +GQ+ GI S + + T A +A
Sbjct: 146 VKDVSKVAGEYHYYTLSMHMKDKMVSCPVMNAEGQVFGIAQKSSGIDTVTTCYAAGAAFA 205
Query: 108 NDFALVGLSQNDPTLLQCGIRIGLPQSADEAILAL-MLSAAKNDEIRTATINDFLQKFPT 166
+ LS D L GIR GLP++ D+A++ L M S++ + E ++DF+++FP
Sbjct: 206 MSQKISALSLGDAALKSIGIRKGLPETEDQALVYLFMASSSLSGEDYEKLLDDFIRQFPA 265
Query: 167 LNNGYIALATNLFGKGDIGET--EKVLL---QAIAKVKAKDEAHYNYARLMYQGAVT-PA 220
+GY+ A KG +T +K + QA+ + KD+ +YN +LMY ++ P
Sbjct: 266 NADGYLRRANYYASKGKDDQTWYDKAVADFNQALKVAQKKDDVYYNIGKLMYAYQLSKPE 325
Query: 221 LTEKAKAQGWTLDKAMSQANEAYKLNPTPIYKHLQAQITYAKGDYQKAYQDFEALTKTQL 280
T K WT D A+ +A ++P PIY ++ I +A+ DY A +E + + +
Sbjct: 326 KTYK----DWTYDTALKNVRQAIAIDPLPIYTQMEGDILFAQQDYAGALAAYEKVNASNI 381
Query: 281 KNPELYLEMAQCQENLNGNDEEILSLLNKSIELCDTPYTQTAAPYFLARAQQLDKMGKYR 340
+P + A+ +E L G+ +E+++L++ I C P T APY L RAQ + R
Sbjct: 382 ASPATFFSAAKTKELLKGDPKEVVALMDSCIARCPQPITADFAPYLLERAQMNMNADQAR 441
Query: 341 EAIKDFYAYEYLYGSILEANFYYIREQCEVKGKLWQQALQDILIAARLDPAEPIYCAEAG 400
A+ D+ AY + FYY REQ +K + +Q+AL DI A ++P + Y AE
Sbjct: 442 NAMLDYDAYHTAVKGEVNDVFYYYREQAALKARQFQRALDDIAKAIEMNPTDLTYQAEHA 501
Query: 401 SLLLRLNKLEEAILAAQQAVALKADYAEAYLILGIAQCRNNQKQEGIANIEKAKSLGNPQ 460
+ LR+ + EEAI + Y EAY +LG+ Q + + E N +KAK LG+P
Sbjct: 502 VINLRVGRYEEAIQILNNILKTDPKYGEAYRLLGLCQIQLKKTDEACGNFKKAKELGDPN 561
Query: 461 ADTFLNQF 468
AD + ++
Sbjct: 562 ADELITKY 569
>gi|153808239|ref|ZP_01960907.1| hypothetical protein BACCAC_02527 [Bacteroides caccae ATCC 43185]
gi|149129142|gb|EDM20358.1| hypothetical protein BACCAC_02527 [Bacteroides caccae ATCC 43185]
Length = 568
Score = 214 bits (544), Expect = 1e-53, Method: Composition-based stats.
Identities = 147/486 (30%), Positives = 237/486 (48%), Gaps = 22/486 (4%)
Query: 1 VETMLGANEIYDLAKFKVSAKTTAAPF------APSIDANTTAWVVIPAKAGDPIKASVS 54
V+ +LGAN++YD+ KF+V+ P AP + A+ K+ + V
Sbjct: 85 VDVILGANDMYDVIKFRVAITEKKVPALNVAKAAPEVGADAWMLPYSTQKSIACVSGKVK 144
Query: 55 KVEKFMEKYNYCVLATSAPEKNNGAPLLNDKGQLVGIYNNSGTLQSATDARYANDFA--- 111
V K +Y+Y L +K P++N +GQ+ GI S + + T A
Sbjct: 145 DVSKVAGEYHYYTLGMQMKDKMVSCPVMNAEGQVFGISQKSSGIDTVTTCYAAGAAFAMA 204
Query: 112 --LVGLSQNDPTLLQCGIRIGLPQSADEAILAL-MLSAAKNDEIRTATINDFLQKFPTLN 168
+ LS D L GIR GLP+ D+A++ L M S+ DE ++DF+++FP
Sbjct: 205 QKINALSLGDAALKSIGIRKGLPEVEDQALVYLFMASSQMTDEEYGKLLDDFIRQFPNST 264
Query: 169 NGYIALATNLFGKGDIGET--EKVLL---QAIAKVKAKDEAHYNYARLMYQGAVT-PALT 222
+GYI A KG ++ +K + +A+ + KD+ +YN A+LM+ ++ P T
Sbjct: 265 DGYIRRANYYVTKGQEDQSWFDKAVADFNKALKVAQKKDDVYYNIAKLMHAYQLSKPEKT 324
Query: 223 EKAKAQGWTLDKAMSQANEAYKLNPTPIYKHLQAQITYAKGDYQKAYQDFEALTKTQLKN 282
K WT D A+ +A ++P P+Y L+ I +A+ DY A +E + + + +
Sbjct: 325 YK----DWTYDTALKNLRQAIAIDPLPVYTQLEGDILFAQQDYAGALAAYEKVNASNIAS 380
Query: 283 PELYLEMAQCQENLNGNDEEILSLLNKSIELCDTPYTQTAAPYFLARAQQLDKMGKYREA 342
P + A+ +E L + +E++ L++ I C P T APY L RAQ + R A
Sbjct: 381 PATFFSAAKTKELLKADPKEVIVLMDSCIARCPQPVTSAFAPYLLERAQMNLNANQARNA 440
Query: 343 IKDFYAYEYLYGSILEANFYYIREQCEVKGKLWQQALQDILIAARLDPAEPIYCAEAGSL 402
+ D+ AY + FYY REQ +K + +Q+AL DI A L+P + Y AE +
Sbjct: 441 MLDYDAYFKAVNGQVNDMFYYYREQAALKARQYQRALDDIAKAIELNPKDLTYKAEQAVV 500
Query: 403 LLRLNKLEEAILAAQQAVALKADYAEAYLILGIAQCRNNQKQEGIANIEKAKSLGNPQAD 462
LR+ + EEA+ + YAEAY +LG+ Q + + E N KAK LG+P D
Sbjct: 501 NLRVGRYEEAVQILNNILKEDPKYAEAYRLLGLCQIQLKKTDEACGNFNKAKELGDPNTD 560
Query: 463 TFLNQF 468
+ ++
Sbjct: 561 DLIKKY 566
>gi|156858395|gb|EDO51826.1| hypothetical protein BACUNI_04375 [Bacteroides uniformis ATCC 8492]
Length = 567
Score = 211 bits (536), Expect = 1e-52, Method: Composition-based stats.
Identities = 151/483 (31%), Positives = 248/483 (51%), Gaps = 19/483 (3%)
Query: 1 VETMLGANEIYDLAKFKVS---AKTTAAPFAPSIDANTTAWVVIP---AKAGDPIKASVS 54
VE ++GA+++YD+ KF+V K TA A A ++P K V
Sbjct: 85 VEAIMGADDMYDVVKFRVGISGKKVTALTLAAVAPAAGADVYLLPYSTQKDRSFTAGKVK 144
Query: 55 KVEKFMEKYNYCVLATSAPEKNNGAPLLNDKGQLVGIYNNSGTLQSAT-----DARYAND 109
+ +K Y+Y L +K PL+ GQ+ G+ S +AT DA +A
Sbjct: 145 EADKISGNYSYYTLDMRLKDKMVSCPLMTVDGQVFGLAQKSSGQDTATICYAIDANFAMS 204
Query: 110 FALVGLSQNDPTLLQCGIRIGLPQSADEAILAL-MLSAAKNDEIRTATINDFLQKFPTLN 168
+ LS D +L GI+ LP + ++A++ L M S+ + E T+NDF+ ++P
Sbjct: 205 QNISALSYGDMSLKGIGIKKALPDTEEQALVFLYMASSQLSPEKYMETLNDFIAQYPASA 264
Query: 169 NGYIALAT-NLFGKGDIGETEKV---LLQAIAKVKAKDEAHYNYARLMYQGAVTPALTEK 224
+GY+ A+ +LF + +KV + +A+ KD+ +YN A+++Y A+ EK
Sbjct: 265 DGYLRRASQHLFMSREDASMDKVAADMDKALEVAAKKDDVYYNRAKIIYNYAL--GKPEK 322
Query: 225 AKAQGWTLDKAMSQANEAYKLNPTPIYKHLQAQITYAKGDYQKAYQDFEALTKTQLKNPE 284
+ W+LDKA+ + +A ++ P+Y L+ I +AK DY A+ ++ + KT L +P
Sbjct: 323 V-YKDWSLDKALDEVRKAIAIDELPVYVQLEGDILFAKQDYPSAFTSYDKVNKTILASPA 381
Query: 285 LYLEMAQCQENLNGNDEEILSLLNKSIELCDTPYTQTAAPYFLARAQQLDKMGKYREAIK 344
+ A+ +E + EE+L+L++ + PYT+ AAPY L RAQ + R A+
Sbjct: 382 TFFSAAKTKELMQAPAEEVLALMDSCVARFTQPYTEEAAPYLLERAQARMNADQARNAML 441
Query: 345 DFYAYEYLYGSILEANFYYIREQCEVKGKLWQQALQDILIAARLDPAEPIYCAEAGSLLL 404
D+ AY + FYY REQ +K K +Q+AL D+ A L+P + Y AE + L
Sbjct: 442 DYDAYYNAVNGKVNDMFYYYREQAALKAKQYQRALDDMAKAIELNPEDLTYRAELAVVNL 501
Query: 405 RLNKLEEAILAAQQAVALKADYAEAYLILGIAQCRNNQKQEGIANIEKAKSLGNPQADTF 464
R+ + EEA+ + A+ YAEAY ++GIAQ + + +E A+ KAK LG+P D
Sbjct: 502 RVGRYEEALNVLKAALEKDPKYAEAYRLMGIAQLQMKKDKEACASFAKAKELGDPNVDAL 561
Query: 465 LNQ 467
+ +
Sbjct: 562 IEK 564
>gi|29346769|ref|NP_810272.1| Tetratricopeptide repeat family protein [Bacteroides
thetaiotaomicron VPI-5482]
gi|29338666|gb|AAO76466.1| Tetratricopeptide repeat family protein [Bacteroides
thetaiotaomicron VPI-5482]
Length = 568
Score = 205 bits (521), Expect = 6e-51, Method: Composition-based stats.
Identities = 146/487 (29%), Positives = 241/487 (49%), Gaps = 24/487 (4%)
Query: 1 VETMLGANEIYDLAKFKVSAKTTAAP------FAPSIDANTTAWVVIPAKAGDPIKASVS 54
V +LGA+++YD+ KF+V+ P AP+ A+ K+ + V
Sbjct: 85 VSLILGADDMYDVIKFRVAITEKKVPSLVVATTAPAAGADAWMLPYSTQKSIACVSGKVK 144
Query: 55 KVEKFMEKYNYCVLATSAPEKNNGAPLLNDKGQLVGIYNNSGTLQSATD-----ARYAND 109
V K +Y+Y L+ +K P++N +GQ+ GI S + + T A +A
Sbjct: 145 DVSKIAGEYHYYTLSMQMKDKMVSCPVMNAEGQVFGISQKSSGVDTVTTCYAAGAAFAMS 204
Query: 110 FALVGLSQNDPTLLQCGIRIGLPQSADEAILALMLSAAK--NDEIRTATINDFLQKFPTL 167
+ LS D L GIR GLP++ D+A++ L +++ + DE ++DF+++FP+
Sbjct: 205 QKISALSLGDVALKNIGIRKGLPEAEDQALVYLFMASTQMSADEYEK-LLDDFIRQFPSS 263
Query: 168 NNGYIALATNLFGKG--DIGETEKVLL---QAIAKVKAKDEAHYNYARLMYQGAVT-PAL 221
+GYI A KG D +K + QA+ KD+ +YN A+L+Y ++ P
Sbjct: 264 TDGYIRRANYYVAKGKDDQSYFDKAVADFNQALKVAAKKDDVYYNIAKLIYGYQLSKPET 323
Query: 222 TEKAKAQGWTLDKAMSQANEAYKLNPTPIYKHLQAQITYAKGDYQKAYQDFEALTKTQLK 281
T K WT D A+ +A ++P P+Y L+ I +A+ DY A +E + + L
Sbjct: 324 TYK----DWTYDTALKNLRQAMAIDPLPVYTQLEGDILFAQQDYAGALAAYEKVNASNLA 379
Query: 282 NPELYLEMAQCQENLNGNDEEILSLLNKSIELCDTPYTQTAAPYFLARAQQLDKMGKYRE 341
+ + A+ +E L + +E+L+L++ I C P T APY L RAQ + R
Sbjct: 380 SAASFFSAAKTKELLKADAKEVLALMDSCIARCPQPVTANFAPYLLERAQIYMNNDQARN 439
Query: 342 AIKDFYAYEYLYGSILEANFYYIREQCEVKGKLWQQALQDILIAARLDPAEPIYCAEAGS 401
A+ D+ AY + FYY REQ +K + +Q+AL DI+ A L P + Y AE
Sbjct: 440 AMLDYDAYYKAVNGQVNDLFYYYREQAALKARQYQRALDDIVKAVELSPKDLTYRAEHAV 499
Query: 402 LLLRLNKLEEAILAAQQAVALKADYAEAYLILGIAQCRNNQKQEGIANIEKAKSLGNPQA 461
+ LR+ + EE++ + + + Y EAY +LG+ Q + + E N KAK LG+P
Sbjct: 500 VNLRVGRYEESMKILNEILKDEPKYGEAYRLLGLCQIQLKKTDEACGNFNKAKELGDPNV 559
Query: 462 DTFLNQF 468
D + ++
Sbjct: 560 DELIKKY 566
>gi|150008164|ref|YP_001302907.1| tetratricopeptide repeat family protein [Parabacteroides distasonis
ATCC 8503]
gi|149936588|gb|ABR43285.1| tetratricopeptide repeat family protein [Parabacteroides distasonis
ATCC 8503]
Length = 575
Score = 191 bits (484), Expect = 1e-46, Method: Composition-based stats.
Identities = 144/490 (29%), Positives = 237/490 (48%), Gaps = 26/490 (5%)
Query: 1 VETMLGANEIYDLAKFKVSAKTTAA--PFAPSIDANTTAWVVIPAKAGDPIK---ASVSK 55
V +LGA+E+YD+ KFKV A P A + ++P G K VS+
Sbjct: 86 VSRILGADELYDVIKFKVEVPKKAVFLPIAREPISQGATAYLMPYSTGKITKFGEGPVSE 145
Query: 56 VEKFMEKYNYCVLATSAPEKNNGAPLLNDKGQLVGIYNNSGTLQS----ATDARYANDFA 111
V K E ++Y L+ + AP+L G++ G+ + + A A YAN
Sbjct: 146 VSKLKEPFSYYKLSMALESGQVNAPVLTADGEVFGLAQEDASGKKEDSYAVSAGYANSLT 205
Query: 112 LVGLSQNDPTLLQCGIRIGLPQSADEAILALMLSAAKND-EIRTATINDFLQKFPTLNNG 170
+ + T + GIR P A +A ++L L A+ D + AT+NDF+ FP +G
Sbjct: 206 IQSADAFNSTYSRIGIRKAWPADASQAQVSLYLMASSQDPKTYLATLNDFIATFPDSPDG 265
Query: 171 YIALATNL-FGKGDIGETEK----VLLQAIAKVKA-------KDEAHYNYARLMYQGAVT 218
Y+ A + + + D+ TE L +A+ + K + +N A+L+Y V
Sbjct: 266 YLNRANHYAYHRADLAPTEAEQGACLDKALEDINTASRFSERKGDVWFNRAKLIY--GVA 323
Query: 219 PALTEKAKAQGWTLDKAMSQANEAYKLNPTPIYKHLQAQITYAKGDYQKAYQDFEALTKT 278
A T K Q WT+D A +A P+Y+ L+ I + KGD+++A+ D+ + +
Sbjct: 324 AADTTLNKEQ-WTVDAATEAIQKAIGEEDLPVYRQLEGDIHFYKGDFEQAFADYMKVNDS 382
Query: 279 QLKNPELYLEMAQCQENLNG-NDEEILSLLNKSIELCDTPYTQTAAPYFLARAQQLDKMG 337
+ + + A+ + N+ G N +I++LL+ +I C P T AAPY L R K+
Sbjct: 383 DMASSTSWYWAAKAKANIRGANFGDIIALLDSAIAKCGNPPTNEAAPYILERVDLRLKLM 442
Query: 338 KYREAIKDFYAYEYLYGSILEANFYYIREQCEVKGKLWQQALQDILIAARLDPAEPIYCA 397
+Y+EA+ D+ Y L + F+Y REQ + + + AL DI A RL+P +P Y A
Sbjct: 443 QYKEAVDDYDLYYDLLKGQVGDRFFYYREQAKFRMNDFPGALADIQSAIRLNPGDPTYPA 502
Query: 398 EAGSLLLRLNKLEEAILAAQQAVALKADYAEAYLILGIAQCRNNQKQEGIANIEKAKSLG 457
E S+ +R+ ++A+ + + A+ + D+A Y + GI R +K E KAK LG
Sbjct: 503 EEASVYIRMENYDQALRSLENALRIAPDFASCYRLRGICYVRQGKKAEACEAFNKAKELG 562
Query: 458 NPQADTFLNQ 467
+P D + +
Sbjct: 563 DPVVDKLIKE 572
>gi|154494836|ref|ZP_02033841.1| hypothetical protein PARMER_03878 [Parabacteroides merdae ATCC
43184]
gi|154085386|gb|EDN84431.1| hypothetical protein PARMER_03878 [Parabacteroides merdae ATCC
43184]
Length = 575
Score = 182 bits (462), Expect = 5e-44, Method: Composition-based stats.
Identities = 138/492 (28%), Positives = 240/492 (48%), Gaps = 30/492 (6%)
Query: 1 VETMLGANEIYDLAKFKVSAKTTAA--PFAPSIDAN-TTAWVVIPAKAGDPI--KASVSK 55
V+ +LGA+E+YD KF+V A P A AN T A++++ + + ++++
Sbjct: 86 VKNILGADELYDAVKFQVEVPKKAVFLPIAAEPVANGTNAYLLLYSTGKNATFKSGAITE 145
Query: 56 VEKFMEKYNYCVLATSAPEKNNGAPLLNDKGQLVGIYNNSGTLQS----ATDARYANDFA 111
V K + Y Y +A + E APLL +G++ G+ + A YA +
Sbjct: 146 VSKLKDPYKYYKMAVALEENELNAPLLTPEGEVFGLAQADAGGKKDICYGLSAGYAGSLS 205
Query: 112 LVGLSQNDPTLLQCGIRIGLPQSADEAILAL-MLSAAKNDEIRTATINDFLQKFPTLNNG 170
+ I G P+ D+A +AL ++S ++ + R T+NDF+ FP +G
Sbjct: 206 IGSADYLSSAYRNINIPKGWPKELDQATVALYLISGTQDAKARLETVNDFITTFPDAPDG 265
Query: 171 YIALATNLFGKGDIGETEKVLLQAIAKVKAKDEAH-------------YNYARLMYQ-GA 216
Y+ ++L+ + QAI KA D+ YN A+L+Y +
Sbjct: 266 YLN-RSDLYAYNRAELANSMAEQAIYLQKALDDIKTASKCSDKKGDFWYNQAKLIYGVAS 324
Query: 217 VTPALTEKAKAQGWTLDKAMSQANEAYKLNPTPIYKHLQAQITYAKGDYQKAYQDFEALT 276
LT+ A WT++ AM ++A + + P Y L+A I + KG++Q+A+ ++ +
Sbjct: 325 ADSTLTDPA----WTIEAAMEALDKAIEEDNLPAYHQLRADILFNKGEFQQAFDEYMIVN 380
Query: 277 KTQLKNPELYLEMAQCQENLNG-NDEEILSLLNKSIELCDTPYTQTAAPYFLARAQQLDK 335
+ + + Y A+ +E + G N +++ LL+K+IE C T AA Y L R +
Sbjct: 381 NSDVASASSYYLAAKAKERVTGFNIGDVIDLLDKAIEKCGTNMNAEAAAYVLERINWRLR 440
Query: 336 MGKYREAIKDFYAYEYLYGSILEANFYYIREQCEVKGKLWQQALQDILIAARLDPAEPIY 395
+ +Y EAI D+ Y L G + NF+++REQ + + + AL+DI A + P+ P Y
Sbjct: 441 LAQYTEAIADYDLYYTLIGGQVLPNFFFLREQAKFRAGDLEGALKDIQAAIQGSPSTPDY 500
Query: 396 CAEAGSLLLRLNKLEEAILAAQQAVALKADYAEAYLILGIAQCRNNQKQEGIANIEKAKS 455
AE S+ +R K E+A+ + ++A+A+ D+ Y + G+ R +K E KAK
Sbjct: 501 YAEEASIYVRQQKYEDALKSIERAIAIAPDFGACYRLRGVCCVRLKKKTEACEAFNKAKE 560
Query: 456 LGNPQADTFLNQ 467
LG+P A + +
Sbjct: 561 LGDPLAGKLIKE 572
>gi|148258470|ref|YP_001243055.1| hypothetical protein BBta_7275 [Bradyrhizobium sp. BTAi1]
gi|146410643|gb|ABQ39149.1| hypothetical protein BBta_7275 [Bradyrhizobium sp. BTAi1]
Length = 385
Score = 61.2 bits (147), Expect = 2e-07, Method: Composition-based stats.
Identities = 41/140 (29%), Positives = 69/140 (49%), Gaps = 2/140 (1%)
Query: 328 ARAQQLDKMGKYREAIKDFYAYEYLYGSILEANFYYIREQCEVKGKLWQQALQDILIAAR 387
+R + +G A+ DF A L N Y R +Q+ALQD L +R
Sbjct: 35 SRGASYESLGLRDRALADFDAAIVLIPEF--PNLYLYRGVIWGDKGEYQRALQDFLTVSR 92
Query: 388 LDPAEPIYCAEAGSLLLRLNKLEEAILAAQQAVALKADYAEAYLILGIAQCRNNQKQEGI 447
L P +P+ G++ RL L++AI+ +A+ L+ADYA+AY +++ I
Sbjct: 93 LTPTDPLAFNNLGNVYDRLGDLDQAIVNFDRAIGLRADYAQAYYNRAHTYALKQERERAI 152
Query: 448 ANIEKAKSLGNPQADTFLNQ 467
A+ ++A SL +D ++N+
Sbjct: 153 ADYDQAISLQPLFSDAYVNR 172
>gi|118400791|ref|XP_001032717.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
gi|89287061|gb|EAR85054.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
Length = 721
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 54/194 (27%), Positives = 83/194 (42%), Gaps = 7/194 (3%)
Query: 238 QANEAYKLNPTPIYKHLQAQITYAKGDYQKAYQDFEALTKTQLKNPELYLEMAQCQENLN 297
QA + N Y HL+ Q YQKA DF+ K ++ N + CQ L
Sbjct: 77 QAIPSLSENRQAKYFHLKGQAYLGMKQYQKALFDFDTAVKQEIDNATYHASRGNCQLEL- 135
Query: 298 GNDEEILSLLNKSIELCDTPYTQTAAPYFLARAQQLDKMGKYREAIKDF-YAYEYLYGSI 356
GN +E L +++I L T +L RA+ + Y++AI+DF A +Y+ S
Sbjct: 136 GNIKEALQEFDETINL-----NPTDGHNYLNRAKVYFNLESYKKAIEDFTMAQKYIKDSN 190
Query: 357 LEANFYYIREQCEVKGKLWQQALQDILIAARLDPAEPIYCAEAGSLLLRLNKLEEAILAA 416
YY +C + K ++A+Q + A + + G L + EEA+
Sbjct: 191 SLFKIYYSLGECYRRTKQMEEAIQYLEKATLIKKEDSNAQNTLGLSLFEYGRYEEALEKF 250
Query: 417 QQAVALKADYAEAY 430
Q+A L AE Y
Sbjct: 251 QEARNLDDTKAEYY 264
>gi|119357637|ref|YP_912281.1| TPR repeat-containing protein [Chlorobium phaeobacteroides DSM 266]
gi|119354986|gb|ABL65857.1| TPR repeat-containing protein [Chlorobium phaeobacteroides DSM 266]
Length = 3035
Score = 55.8 bits (133), Expect = 6e-06, Method: Composition-based stats.
Identities = 46/140 (32%), Positives = 69/140 (49%), Gaps = 6/140 (4%)
Query: 329 RAQQLDKMGKYREAIKDFYAYEYLYGSILEANFYYIREQCEVKGKLWQQALQ--DILIAA 386
R L + +Y EA+ + L AN Y R +K K + AL+ D IA
Sbjct: 2378 RGNTLQGLRRYEEAVSSYDQAIALRSD--NANAYSNRGVAMMKLKRYADALESHDKAIAL 2435
Query: 387 RLDPAEPIYCAEAGSLLLRLNKLEEAILAAQQAVALKADYAEAYLILGIAQCRNNQKQEG 446
R D AE C+ G+ L L + EEA+++ +QA+ALK+DYAE Y G + +E
Sbjct: 2436 RPDYAEA--CSNRGNTLQELKRYEEALMSYKQAIALKSDYAEFYSNYGNVLEELKRYEEA 2493
Query: 447 IANIEKAKSLGNPQADTFLN 466
+ N E+A +L +D + N
Sbjct: 2494 LLNYEQAIALKPDFSDAYSN 2513
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.315 0.131 0.370
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,636,573,373
Number of Sequences: 5470121
Number of extensions: 66433021
Number of successful extensions: 206637
Number of sequences better than 1.0e-05: 28
Number of HSP's better than 0.0 without gapping: 10
Number of HSP's successfully gapped in prelim test: 18
Number of HSP's that attempted gapping in prelim test: 206400
Number of HSP's gapped (non-prelim): 182
length of query: 469
length of database: 1,894,087,724
effective HSP length: 137
effective length of query: 332
effective length of database: 1,144,681,147
effective search space: 380034140804
effective search space used: 380034140804
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 132 (55.5 bits)