BLASTP 2.2.18 [Mar-02-2008]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= PGN_1775 hypothetical protein
(494 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
6,515,104 sequences; 2,222,278,849 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|34541412|ref|NP_905891.1| hypothetical protein PG1790 [P... 1013 0.0
gi|150008335|ref|YP_001303078.1| hypothetical protein BDI_1... 466 e-129
gi|154492667|ref|ZP_02032293.1| hypothetical protein PARMER... 426 e-117
gi|150006353|ref|YP_001301097.1| hypothetical protein BVU_3... 350 2e-94
gi|108762884|ref|YP_632187.1| hypothetical protein MXAN_400... 206 2e-51
gi|153005936|ref|YP_001380261.1| hypothetical protein Anae1... 118 1e-24
gi|108756929|ref|YP_633429.1| hypothetical protein MXAN_527... 117 2e-24
gi|86156917|ref|YP_463702.1| hypothetical protein Adeh_0489... 111 2e-22
gi|163765551|ref|ZP_02172583.1| conserved hypothetical prot... 109 4e-22
gi|167455892|ref|ZP_02322113.1| conserved hypothetical prot... 109 6e-22
gi|163766903|ref|ZP_02173923.1| conserved hypothetical prot... 107 2e-21
gi|86159909|ref|YP_466694.1| hypothetical protein Adeh_3491... 106 5e-21
gi|167456939|ref|ZP_02323155.1| conserved hypothetical prot... 103 2e-20
gi|124265225|ref|YP_001019229.1| hypothetical protein Mpe_A... 103 3e-20
gi|145620156|ref|ZP_01776194.1| hypothetical protein GbemDR... 82 1e-13
gi|15611806|ref|NP_223457.1| hypothetical protein jhp0739 [... 69 6e-10
gi|15645422|ref|NP_207596.1| hypothetical protein HP0803 [H... 69 1e-09
gi|108563213|ref|YP_627529.1| hypothetical protein HPAG1_07... 68 1e-09
gi|6822150|emb|CAB71023.1| hypothetical protein [Helicobact... 60 5e-07
gi|157376731|ref|YP_001475331.1| hypothetical protein Ssed_... 58 1e-06
gi|42523741|ref|NP_969121.1| hypothetical protein Bd2288 [B... 58 2e-06
gi|127512116|ref|YP_001093313.1| hypothetical protein Shew_... 57 4e-06
>gi|34541412|ref|NP_905891.1| hypothetical protein PG1790 [Porphyromonas gingivalis W83]
gi|34397729|gb|AAQ66790.1| hypothetical protein PG_1790 [Porphyromonas gingivalis W83]
Length = 494
Score = 1013 bits (2620), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 491/494 (99%), Positives = 492/494 (99%)
Query: 1 MKRRFLSLLLLYILSSISLSAQRFPMVQGIELDTDSLFSLPKRPWRAIGKTIGVNLAVWG 60
MKRRFLSLLLLYILSSISLSAQRFPMVQGIELDTDSLFSLPKRPWRAIGKTIGVNLAVWG
Sbjct: 1 MKRRFLSLLLLYILSSISLSAQRFPMVQGIELDTDSLFSLPKRPWRAIGKTIGVNLAVWG 60
Query: 61 FDHFIMNEDFADISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFNAARANGLSFRH 120
FDHFIMNEDFADISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFNAAR+NGLSFRH
Sbjct: 61 FDHFIMNEDFADISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFNAARSNGLSFRH 120
Query: 121 SAPFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLIDNRTTGWERTGR 180
SAPFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLIDNRTTGWER GR
Sbjct: 121 SAPFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLIDNRTTGWERMGR 180
Query: 181 EVAIALINPMRFLNRLTAGEVTSVASRSGQIFQSVPINIVVDAGFRFLADKRHARTGATA 240
EVAIALINPMRFLNRLTAGEVTSV SRSGQIFQSVPINIVVDAGFRFLADKRHARTGATA
Sbjct: 181 EVAIALINPMRFLNRLTAGEVTSVGSRSGQIFQSVPINIVVDAGFRFLADKRHARTGATA 240
Query: 241 LTLNLRFDYGDPFRSETFSPYDFFQFKAGLSFSESQPLLSQINLIGILSGCQLLAHERTV 300
LTLNLRFDYGDPFRSETFSPYDFFQFKAGLSFSESQPLLSQINLIGILSGCQLLAHERTV
Sbjct: 241 LTLNLRFDYGDPFRSETFSPYDFFQFKAGLSFSESQPLLSQINLIGILSGCQLLAHERTV 300
Query: 301 LVGGLFQHFDYYNSEKRISKNSEEVLVTPYRISQVAALGGGLIFQHHGKFRRRPLELYAE 360
LVGGLFQHFDYYNSEKRISKNSEEVLVTPYRISQVAALGGGLIFQHHGKFRRRPLELYAE
Sbjct: 301 LVGGLFQHFDYYNSEKRISKNSEEVLVTPYRISQVAALGGGLIFQHHGKFRRRPLELYAE 360
Query: 361 TYLNVVPMGASLSDHYNVDNRDYNLGSGLSGKLYLGATYNDLWSWLLGVESYRLYTWIGY 420
TYLNVVPMGASLSDHYNVDNRDYNLGSGLSGKLYLGATYNDLWSWLLGVESYRLYTWIGY
Sbjct: 361 TYLNVVPMGASLSDHYNVDNRDYNLGSGLSGKLYLGATYNDLWSWLLGVESYRLYTWIGY 420
Query: 421 EEPHQKNTDVSSFMVQGDESKARLLVTSSEFAFHPGPWHVAIVARRFIRKTAYQFYPNVS 480
EEPHQKNTDVSSFMVQGDESKARLLVTSSEFAFHPGPWHVAIVARRFIRKTAYQFYPNVS
Sbjct: 421 EEPHQKNTDVSSFMVQGDESKARLLVTSSEFAFHPGPWHVAIVARRFIRKTAYQFYPNVS 480
Query: 481 FDTGDIQLRVGFHF 494
FDTGDIQLRVGFHF
Sbjct: 481 FDTGDIQLRVGFHF 494
>gi|150008335|ref|YP_001303078.1| hypothetical protein BDI_1707 [Parabacteroides distasonis ATCC
8503]
gi|149936759|gb|ABR43456.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
Length = 493
Score = 466 bits (1198), Expect = e-129, Method: Compositional matrix adjust.
Identities = 234/492 (47%), Positives = 326/492 (66%), Gaps = 6/492 (1%)
Query: 3 RRFLSLLLLYILSSISLSAQRFPMVQGIELDTDSLFSLPKRPWRAIGKTIGVNLAVWGFD 62
++ + + L+ +LSS + Q FP + +D+ +P+RPW A + G+N+AVW FD
Sbjct: 4 KKCMLIGLICLLSSQWMWGQHFPQMDARNYVSDTALFIPRRPWLAASEVFGMNMAVWTFD 63
Query: 63 HFIMNEDFADISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFNAARANGLSFRHSA 122
F+MNEDFA I+ TIK NF+TG WD DKF TNL AHPYHGSLYFNAAR+NGL+F S
Sbjct: 64 RFLMNEDFAKINGHTIKQNFKTGPVWDTDKFSTNLVAHPYHGSLYFNAARSNGLNFWQSI 123
Query: 123 PFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLIDNRTTGWERTGREV 182
PFA GSLMWE ME EPPSIND+ AT+ GGI LGE+ +RLSDL IDNR+ G ER GRE+
Sbjct: 124 PFAAGGSLMWEFFMETEPPSINDMLATSFGGIELGEITYRLSDLFIDNRSHGAERVGREI 183
Query: 183 AIALINPMRFLNRLTAGEVTSVASRSGQIFQSVPINIVVDAGFRFLADKRHARTGATALT 242
LI+PMR +NR+ GE +S G+++ SVP+N +V G RFLA++ ++ G T++
Sbjct: 184 LSGLISPMRAINRIITGEAWRHSSSKGRVYTSVPVNFIVGVGPRFLAEQEGSKHGTTSMH 243
Query: 243 LNLRFDYGDPFRSETFSPYDFFQFKAGLSFSESQPLLSQINLIGILSGCQLLAHERTVLV 302
++ R DYGDPF + +SPY++FQ KAG F SQPL+SQ+N +G + G Q+ + L
Sbjct: 244 VSFRLDYGDPFNDDFYSPYEWFQLKAGFDFFSSQPLISQVNAVGAIWGKQVWSKGPRSLA 303
Query: 303 GGLFQHFDYYNSEKRISKNSEEVLVTPYRISQVAALGGGLIFQHHGKFRRRPLELYAETY 362
G+FQHFDYY+SE + NS + V PYRIS+ AA+GGGLI+ G + +++YAE Y
Sbjct: 304 AGIFQHFDYYDSE--LKSNSSQT-VAPYRISEAAAVGGGLIYYKRGTPDNK-VDVYAELY 359
Query: 363 LNVVPMGASLSDHYNVDNRDYNLGSGLSGKLYLGATYNDLWSWLLGVESYRLYTWIGYEE 422
V +GAS+SD+ ++ RDYNLGSG S K + G Y+ W++L+ +E+Y ++TW GYE
Sbjct: 360 GTGVALGASISDYLRLEERDYNLGSGYSVKAFTGLAYDKRWAFLIDLENYHIFTWKGYEP 419
Query: 423 PHQKN-TDVSSFMVQGDESKARLLVTSSEFAFHPG-PWHVAIVARRFIRKTAYQFYPNVS 480
N D + VQGD ARL V S+ A+ W++A+ R F R+T Y+++ ++
Sbjct: 420 DIDWNAVDPTKLNVQGDAGNARLTVFSTTLAYMSKHKWNIALRNRYFSRRTHYKYHKDID 479
Query: 481 FDTGDIQLRVGF 492
T DI L +G+
Sbjct: 480 SSTYDIMLTLGW 491
>gi|154492667|ref|ZP_02032293.1| hypothetical protein PARMER_02302 [Parabacteroides merdae ATCC
43184]
gi|154086972|gb|EDN86017.1| hypothetical protein PARMER_02302 [Parabacteroides merdae ATCC
43184]
Length = 492
Score = 426 bits (1095), Expect = e-117, Method: Compositional matrix adjust.
Identities = 225/492 (45%), Positives = 315/492 (64%), Gaps = 7/492 (1%)
Query: 3 RRFLSLLLLYILSSISLSAQRF-PMVQGIELDTDSLFSLPKRPWRAIGKTIGVNLAVWGF 61
++F+ L+ + S + Q + P + +DS P+RPW+A +T G+N+ VW F
Sbjct: 2 KKFILTGLICLFSFPWIRGQEYIPQITHRHYTSDSTLLQPRRPWKAALETFGLNMLVWSF 61
Query: 62 DHFIMNEDFADISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFNAARANGLSFRHS 121
D +I+ ED+A I+ TIKSNF+ G WD D+F TNLF+HPYHGSLYFNAAR+NG++F S
Sbjct: 62 DRYIVKEDWAYINGHTIKSNFKKGPVWDTDQFTTNLFSHPYHGSLYFNAARSNGMNFWQS 121
Query: 122 APFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLIDNRTTGWERTGRE 181
APFA GSLMWE MENEPPSIND+ ATT GGI LGE+ +RLSDL IDNR++G ER GRE
Sbjct: 122 APFAAGGSLMWEFFMENEPPSINDMLATTFGGIELGEITYRLSDLFIDNRSSGAERVGRE 181
Query: 182 VAIALINPMRFLNRLTAGEVTSVASRSGQIFQSVPINIVVDAGFRFLADKRHARTGATAL 241
V +++P+R NR+ +GE S G+ + SVP+N +V G RFLA++ ++ G T+L
Sbjct: 182 VLAGILSPVRAFNRIISGEAWRHGSSKGRTYSSVPVNFIVTMGPRFLAEQEGSKRGTTSL 241
Query: 242 TLNLRFDYGDPFRSETFSPYDFFQFKAGLSFSESQPLLSQINLIGILSGCQLLAHERTVL 301
+N R DYG+PF + +SPY++F+F G+ +QP++SQ+N IG L G + L
Sbjct: 242 NINFRIDYGNPFNDDFYSPYEWFRFNFGMDLFSAQPVVSQVNAIGALWGKTVWTKGPRSL 301
Query: 302 VGGLFQHFDYYNSEKRISKNSEEVLVTPYRISQVAALGGGLIFQHHGKFRRRPLELYAET 361
GLFQHFD+YNSE R + ++ V PYRI+ AA GGGLI+ + ++YAE
Sbjct: 302 SAGLFQHFDFYNSELR---DGSDLTVPPYRIAAPAAAGGGLIYYKQATSGDK-TDIYAEL 357
Query: 362 YLNVVPMGASLSDHYNVDNRDYNLGSGLSGKLYLGATYNDLWSWLLGVESYRLYTWIGYE 421
Y N V +GASLSD+ + RDYNLGSG S K G YN ++LL +E+Y ++TW GY+
Sbjct: 358 YANGVALGASLSDYMLLGERDYNLGSGYSTKAGAGIIYNRRLAFLLNMENYHIFTWKGYD 417
Query: 422 -EPHQKNTDVSSFMVQGDESKARLLVTSSEFAF-HPGPWHVAIVARRFIRKTAYQFYPNV 479
E D + +QGD ARL + S + A+ W++ + R F R+T Y++YP V
Sbjct: 418 PEIDWSQADPGTLNIQGDAGNARLTIFSMKLAYLLMDKWNITLTNRYFSRRTNYRYYPRV 477
Query: 480 SFDTGDIQLRVG 491
+ T D+ L +G
Sbjct: 478 DYSTYDLMLGLG 489
>gi|150006353|ref|YP_001301097.1| hypothetical protein BVU_3872 [Bacteroides vulgatus ATCC 8482]
gi|149934777|gb|ABR41475.1| conserved hypothetical protein [Bacteroides vulgatus ATCC 8482]
Length = 491
Score = 350 bits (897), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 198/500 (39%), Positives = 289/500 (57%), Gaps = 15/500 (3%)
Query: 1 MKRRFLSLLLLYILSSI-SLSAQRFPMVQGIELDTDSLFSLPKRPWRAIGKTIGVNLAVW 59
MK + ++L+ L LSS +++AQ + TD K PWRA +T G+N+ VW
Sbjct: 1 MKLQHIALVCLLALSSAGNVTAQMLHRPDSMYTFTDPRLQ-KKHPWRAAAETFGMNVGVW 59
Query: 60 GFDHFIMNEDFADISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFNAARANGLSFR 119
FD ++MNEDFA IS +I+ N + GF WDND+F TNLFAHPYHG+LYFNAAR+NGL+F
Sbjct: 60 AFDRYVMNEDFAKISIGSIRRNIKHGFVWDNDQFSTNLFAHPYHGNLYFNAARSNGLTFW 119
Query: 120 HSAPFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLIDNRTTGWERTG 179
SAP+AF GSLMWE+ E EPP+INDL ATT+GGIALGE+ HR+S L++D+ G+ R
Sbjct: 120 ESAPYAFAGSLMWEIAAEVEPPAINDLMATTLGGIALGEVTHRMSSLVLDDSKRGFSRFT 179
Query: 180 REVAIALINPMRFLNRLTAGEVTSVASRSGQI--FQSVPINIVVDAGFRFLADKRHARTG 237
RE LI PMR LNR+ GE+ V + + +P++ + AG R+LAD + G
Sbjct: 180 REFLGTLICPMRGLNRMITGEMWKVKRSHYKYHDYDRIPVHFSIGAGDRYLADDNYLFRG 239
Query: 238 ATALTLNLRFDYGDPFRSETFSPYDFFQFKAGLSFSESQPLLSQINLIGILSGCQLLAHE 297
L R YGD F PYD+F +A S +QPL+SQINL+G L G L
Sbjct: 240 EHNPYLEFRVQYGDAFDKVNDGPYDYFTARATFGLSGNQPLISQINLMGKLWGVPLKTTT 299
Query: 298 RTVLVGGLFQHFDYYNSEKRISKNSEEVLVTPYRISQVAALGGGLIFQHHGKFRRRP--L 355
++ G+FQHF+Y++SE+ I + PY+IS+ A++G G+I+ KF R +
Sbjct: 300 GMEMMFGIFQHFNYFDSEEVIDGSGR----IPYKISEAASVGPGMIY----KFPRMNSLV 351
Query: 356 ELYAETYLNVVPMGASLSDHYNVDNRDYNLGSGLSGKLYLGATYNDLWSWLLGVESYRLY 415
L +L+ + +G SL+D+YNV +R+YN+GSG S K + + L + Y+++
Sbjct: 352 NLEQRVFLSAILLGGSLTDYYNVIDRNYNMGSGYSIKNNTILDFGRYGMFALNMHLYQIF 411
Query: 416 TWIGYEEPHQKNTDVSSFMVQGDESKARLLVTSSEFAFH-PGPWHVAIVARRFIRKTAYQ 474
TW GYE + D QGD+ L V + + + + + R T Y
Sbjct: 412 TWKGYEHKDLETIDPLYLNAQGDKGNVMLAVVNPIIELNLSSHFKANMEVSYYYRHTHYS 471
Query: 475 FYPNVSFDTGDIQLRVGFHF 494
++ ++ + T + +L + + F
Sbjct: 472 YHEDIKYKTFETRLGLIYQF 491
>gi|108762884|ref|YP_632187.1| hypothetical protein MXAN_4007 [Myxococcus xanthus DK 1622]
gi|108466764|gb|ABF91949.1| conserved hypothetical protein [Myxococcus xanthus DK 1622]
Length = 466
Score = 206 bits (525), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 139/402 (34%), Positives = 209/402 (51%), Gaps = 35/402 (8%)
Query: 9 LLLYILSSISLSAQRFPMVQGIELDTDSLFSLPKRP---WRAIGKTIGVNLAVWGFDHFI 65
LL+ +L S++ +A+ FP + + + P RP W A+G+ +NL VWGF+ +I
Sbjct: 21 LLVSLLPSLAQAAE-FP-ADAPRAEDEEAAAPPPRPRQFWWALGEVTAINLTVWGFNRYI 78
Query: 66 MNEDFADISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFNAARANGLSFRHSAPFA 125
MN+DFA + ++NF+ GF WD D F TN FAHPYHGS YFNAAR +G ++ + F
Sbjct: 79 MNKDFARVGLDAWRTNFEMGFVWDKDDFSTNQFAHPYHGSTYFNAARDHGFNYWGAMGFT 138
Query: 126 FFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLIDNRTTGWERTGREVAIA 185
GSL WEL EN PS NDL TT+GG A GE+ +RLS +L+D R TG R REV+
Sbjct: 139 LVGSLQWELFAENHLPSYNDLINTTMGGWAHGEVIYRLSSMLLDQRATGTRRVAREVSAG 198
Query: 186 LINPMRFLNRLTAGEVTSVASRSGQIFQSVPINIVVDAGFRFLADKRHARTGATALTLNL 245
L+NP+R NR+ G+ A + + +Q ++ G+ + A T L
Sbjct: 199 LLNPIRGFNRVVRGDAWRSAP-TPKDWQPPVFAVLARTGYLNI----DAGTTLNQFFAEL 253
Query: 246 RFDYGDPFRSETFSPYDFFQFKAGLSFSESQPLLSQINLIGILSGCQL--LAHERTVLVG 303
YGD R +P+D F + E++ L+S+ + G L+ L +ER +L
Sbjct: 254 DLRYGDAVRFPVDTPFDAFNMGVQFTTGENR-LISRAEVKGALAMLPLENSGNERVLL-- 310
Query: 304 GLFQHFDYYNSEKRISKNSEEVLVTPYRISQVAALGGGLIFQHHGKFRRRPLELYAETYL 363
G FQHFDY +++ Y + ++G GL++++ + R EL +L
Sbjct: 311 GAFQHFDYNDTQA-------------YELGG-QSIGAGLVYRYLTQGGR---ELRLALHL 353
Query: 364 NVVPMGASLSDHYNVDNRDYNLGSGLSGKLYLGATYNDLWSW 405
V + S++ + RDY+ G GL + A+Y W W
Sbjct: 354 RGVVLAGITSEYADKVGRDYDYGPGLG--FHFEASYGR-WPW 392
>gi|153005936|ref|YP_001380261.1| hypothetical protein Anae109_3081 [Anaeromyxobacter sp. Fw109-5]
gi|152029509|gb|ABS27277.1| conserved hypothetical protein [Anaeromyxobacter sp. Fw109-5]
Length = 524
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 143/304 (47%), Gaps = 23/304 (7%)
Query: 27 VQGIELDTDSLFSLPKRPWRAIGKTIGVNLAVWGFDHFIMNEDFADISWQTIKSNFQTGF 86
V G+E LP+ I ++ V L + G++ I ++A I+ TI N Q+ +
Sbjct: 93 VPGVEGRPARRGPLPEGHLVPIIESSVVTLVMLGWNATIGEAEWARINADTIGRNLQSAW 152
Query: 87 GWDNDKFVTNLFAHPYHGSLYFNAARANGLSFRHSAPFAFFGSLMWELLMENEPPSINDL 146
D+D F N HPY G + AAR+ G+ F S P+ F S +WEL+ E PPS+ND
Sbjct: 153 VLDDDAFWINQVGHPYQGIFPYGAARSAGVGFWASTPYPFVTSAVWELVGETTPPSVNDQ 212
Query: 147 CATTIGGIALGEMGHRLSDLLIDNRTTGWERTGREVAIALINPMRFLNRLTAGEVTSVAS 206
T++ G+ LGE HR + +++ + W R+ A ++ PM +NR G
Sbjct: 213 ITTSVAGVVLGEALHRATGMILAGPRSPW----RQAAAMIVAPMEAINRELVGTEPHA-- 266
Query: 207 RSGQIFQSVPINIVVDAGFRFLADKRHAR-TGATALTLNLRFDYGDP----FRSETFSPY 261
S P + AG KR A T + +R YG P FR E P+
Sbjct: 267 ------PSPPSRVEFRAGALVFDPKRGAPGTEGIVPEIGVRLSYGLPGDPGFRFE--RPF 318
Query: 262 DFFQFKAGLSFSESQPLLSQINLIGILSGCQLLAHERTVLVGGLFQHFDYYNSEKRISKN 321
D F+ ++ +FS + L+ + G+++G L ER + GL FD +++ +R S +
Sbjct: 319 DRFELES--TFSMEKNPLATLFARGVVAGTTLEG-ERVRGLAGLSLEFD-FSAVRRYSVS 374
Query: 322 SEEV 325
+ V
Sbjct: 375 TSAV 378
>gi|108756929|ref|YP_633429.1| hypothetical protein MXAN_5277 [Myxococcus xanthus DK 1622]
gi|108460809|gb|ABF85994.1| conserved hypothetical protein [Myxococcus xanthus DK 1622]
Length = 496
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 78/225 (34%), Positives = 110/225 (48%), Gaps = 21/225 (9%)
Query: 59 WGFDHFIMNEDFADISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFNAARANGLSF 118
WG ++ + +ADIS +T++ N G+ WD D F TN HPY GSL + AAR++G F
Sbjct: 90 WGTAK-LLGKPYADISLETMRRNVADGWEWDEDAFATNALGHPYQGSLAYTAARSSGFGF 148
Query: 119 RHSAPFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLIDNRTTGWERT 178
S PF S++WEL ME+E PS NDL T+IGG+ LGE+ HR + ++D G
Sbjct: 149 WGSVPFTVASSVLWELFMESETPSTNDLITTSIGGVVLGEVLHRTALAILDE---GEPTV 205
Query: 179 GREVAIALINPMRFLNRLTAGEVTSVASRSGQIFQSVPINIV-VDAGFRFLADKRHARTG 237
R V L+ P+ +NR G R F V N + + AGF D + G
Sbjct: 206 ARSVLSLLVEPVGGINRYLFG-------RRKNDFLPVQGNFLQLMAGFTSNVDNYNDIDG 258
Query: 238 A-------TALTLNLRFDYGDPFRS--ETFSPYDFFQFKAGLSFS 273
L + YG P + + P+D+F A + FS
Sbjct: 259 TRFDVPNKMEGHLGVEMVYGPPLLTTGDYDQPFDYFTLTARMGFS 303
>gi|86156917|ref|YP_463702.1| hypothetical protein Adeh_0489 [Anaeromyxobacter dehalogenans
2CP-C]
gi|85773428|gb|ABC80265.1| hypothetical protein Adeh_0489 [Anaeromyxobacter dehalogenans
2CP-C]
Length = 510
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 112/377 (29%), Positives = 167/377 (44%), Gaps = 45/377 (11%)
Query: 50 KTIGVNLAVWGFDHFIMNEDFADISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFN 109
++ V L + G++ ++ +ADIS ++I N ++ + D+D+F N F HPY G
Sbjct: 91 ESAAVTLVMMGWNRWVGEAPWADISGESIGRNLRSPWVLDDDQFWINQFGHPYQGMWSMT 150
Query: 110 AARANGLSFRHSAPFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLID 169
AAR+ GL F S P+ F SL+WE+ E EPPSIND T IGGI LGE+ RL+ +L
Sbjct: 151 AARSAGLGFWRSTPYTFASSLVWEIAGETEPPSINDQITTPIGGIVLGEVLFRLAGILRG 210
Query: 170 NRTTGWERTGREVAIALINPMRFLNR--LTAGEVTSVASRSGQIFQSVPINIVVDAGFRF 227
+ W RE+ L++PM LN L G R+ + P+ R
Sbjct: 211 DGRNPW----RELGAGLLSPMDALNHGLLHDGHDELDEPRA-WTLSAGPVAA------RL 259
Query: 228 LADKRHARTG-ATALTLNLRFDYG----DPFRSETFSPYDFFQFKAGLSFSESQPLLSQI 282
D G T+ + R +G FR E P+D F+ +A + S P + I
Sbjct: 260 AWDAAGPSPGWETSPEVAFRMVHGLAWHPAFRFE--RPFDHFELQAAYA-GRSNPDATLI 316
Query: 283 NLIGILSGCQLLAHERTVLVGGLFQHFDYYNSEKRISKNSEEVLVTP--YRISQVAALGG 340
L G+L+G V GL+ FD L TP +R+S +ALG
Sbjct: 317 -LRGLLAGATYGDAGGWRGVYGLYGSFD---------------LDTPGVFRVS-TSALG- 358
Query: 341 GLIFQHHGKFRRRPLELYAETYLNVVPMGAS--LSDHYNVDNRDYNLGSGLSGKLYLGAT 398
+ G+ L + ++ + MG ++ RDY LG G L L T
Sbjct: 359 --LGTSGGRLLAHDLLVEGTAIVSGILMGGGGQVARDAEGKGRDYRLGPGGQSVLELQLT 416
Query: 399 YNDLWSWLLGVESYRLY 415
+ + LG+ Y L+
Sbjct: 417 AAERAAVRLGLRQYLLF 433
>gi|163765551|ref|ZP_02172583.1| conserved hypothetical protein [Anaeromyxobacter sp. K]
gi|160353879|gb|EDP80558.1| conserved hypothetical protein [Anaeromyxobacter sp. K]
Length = 517
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 113/379 (29%), Positives = 167/379 (44%), Gaps = 49/379 (12%)
Query: 50 KTIGVNLAVWGFDHFIMNEDFADISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFN 109
++ V LA+ G++ ++ +ADI+ +I N ++ + D+D+F N F HPY G
Sbjct: 98 ESAAVTLAMMGWNRWVGEAPWADITGDSIGRNLRSAWVLDDDQFWINQFGHPYQGMWSMT 157
Query: 110 AARANGLSFRHSAPFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLID 169
AAR+ GL F S P+ SL+WE+ E EPPSIND T IGGI LGE+ RL+ +L
Sbjct: 158 AARSAGLGFWGSTPYTIASSLVWEIAGETEPPSINDQITTPIGGIVLGEVLFRLAGILRG 217
Query: 170 NRTTGWERTGREVAIALINPMRFLNR--LTAGEVTSVASRSGQIFQSVPINIVVDAGFRF 227
+ W RE+ L+ PM LN L G R+ + P+ R
Sbjct: 218 DGRNPW----RELGAGLLAPMDALNHGLLHDGHDELDEPRA-WTLSAGPVAA------RL 266
Query: 228 LADKRHARTG-ATALTLNLRFDYG----DPFRSETFSPYDFFQFKAGLSFSESQPLLSQI 282
D G T+ + R +G FR E P+D F+ +A + S P + I
Sbjct: 267 AWDAAGPPPGWETSPEVAFRMVHGLAWHPAFRFE--RPFDHFELEAAYA-GRSNPDATLI 323
Query: 283 NLIGILSGCQLLAHERTVLVGGLFQHFDYYNSEKRISKNSEEVLVTP--YRISQVAALG- 339
L G+L+G V GL+ FD L TP +R+S +ALG
Sbjct: 324 -LRGLLAGATYGDPAGWRGVYGLYGSFD---------------LDTPGVFRVS-TSALGL 366
Query: 340 ---GGLIFQHHGKFRRRPLELYAETYLNVVPMGASLSDHYNVDNRDYNLGSGLSGKLYLG 396
GG + H + +E A ++ G ++ RDY LG G L L
Sbjct: 367 GTSGGRLLAHDVR-----VEGTAIVSGILIGGGGQVARDAEGKGRDYRLGPGGQSVLELQ 421
Query: 397 ATYNDLWSWLLGVESYRLY 415
T + + LG+ +Y L+
Sbjct: 422 LTAAERATVRLGLRNYLLF 440
>gi|167455892|ref|ZP_02322113.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans
2CP-1]
gi|167417986|gb|EDR84687.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans
2CP-1]
Length = 517
Score = 109 bits (272), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 113/379 (29%), Positives = 166/379 (43%), Gaps = 49/379 (12%)
Query: 50 KTIGVNLAVWGFDHFIMNEDFADISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFN 109
++ V LA+ G++ ++ +ADI+ +I N ++ + D+D+F N F HPY G
Sbjct: 98 ESAAVTLAMMGWNRWVGEAPWADITGDSIGRNLRSAWVLDDDQFWINQFGHPYQGMWSMT 157
Query: 110 AARANGLSFRHSAPFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLID 169
AAR+ GL F S P+ SL+WE+ E EPPSIND T IGGI LGE+ RL+ +L
Sbjct: 158 AARSAGLGFWGSTPYTIASSLVWEIAGETEPPSINDQITTPIGGIVLGEVLFRLAGILRG 217
Query: 170 NRTTGWERTGREVAIALINPMRFLNR--LTAGEVTSVASRSGQIFQSVPINIVVDAGFRF 227
+ W RE+ L+ PM LN L G R+ + P+ R
Sbjct: 218 DGRNPW----RELGAGLLAPMDALNHGLLHDGHDELDEPRA-WTLSAGPVAA------RL 266
Query: 228 LADKRHARTG-ATALTLNLRFDYG----DPFRSETFSPYDFFQFKAGLSFSESQPLLSQI 282
D G T+ + R +G FR E P+D F+ +A + S P + I
Sbjct: 267 AWDAAGPPPGWETSPEVAFRMVHGLAWHPAFRFE--RPFDHFELEAAYA-GRSNPDATLI 323
Query: 283 NLIGILSGCQLLAHERTVLVGGLFQHFDYYNSEKRISKNSEEVLVTP--YRISQVAALG- 339
L G+L+G V GL+ FD L TP +R+S +ALG
Sbjct: 324 -LRGLLAGATYGDPAGWRGVYGLYGSFD---------------LDTPGVFRVS-TSALGL 366
Query: 340 ---GGLIFQHHGKFRRRPLELYAETYLNVVPMGASLSDHYNVDNRDYNLGSGLSGKLYLG 396
GG + H + +E A ++ G ++ RDY LG G L L
Sbjct: 367 GTSGGRLLAHDVR-----VEGTAIVSGILIGGGGQVARDAEGKGRDYRLGPGGQSVLELQ 421
Query: 397 ATYNDLWSWLLGVESYRLY 415
T + + LG+ Y L+
Sbjct: 422 LTAAERATVRLGLRHYLLF 440
>gi|163766903|ref|ZP_02173923.1| conserved hypothetical protein [Anaeromyxobacter sp. K]
gi|160352661|gb|EDP79352.1| conserved hypothetical protein [Anaeromyxobacter sp. K]
Length = 522
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 159/351 (45%), Gaps = 41/351 (11%)
Query: 50 KTIGVNLAVWGFDHFIMNEDFADISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFN 109
+++ VNLA+ G++ ++ +ADIS +I N ++ + D+D+F N F HPY G+ F+
Sbjct: 113 ESLSVNLAMMGWNRWVGAAPWADISGDSIGRNLRSRWVLDDDQFWVNQFGHPYQGTWAFS 172
Query: 110 AARANGLSFRHSAPFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLID 169
AAR+ G F S+ +AF S +WE+ E E PS ND TT+ G+ LGE+ R SD L
Sbjct: 173 AARSAGQGFWVSSAYAFAASGLWEIAGETERPSRNDQVTTTVAGMVLGEILGRFSDAL-- 230
Query: 170 NRTTGWERTGREVAIALINPMRFLNRLTAGEVTSVASRSGQIFQSVPINIVVDAGFRFLA 229
R G G +A A+++PM +NR G + S + ++ G R
Sbjct: 231 -RAEGGTWNG--MAAAVLSPMGAVNRRLLGPTRPERATSSRWALAMGGATGDATGDR--- 284
Query: 230 DKRHARTGATALTLNLRFDYGDPFRS--ETFSPYDFFQFKAGLSFSESQPLLSQINLIGI 287
G + L F +G P E P+D F +A F+ + + G+
Sbjct: 285 -------GTWGGGVGLDFAWGLPGDPDLELDRPFDHFVLEARYGFARDPD--ATVRARGL 335
Query: 288 LSGCQLLAHERTVLVGGLFQHFDYYNSEK-RISKNSEEVLVTPYRISQVAALGGGLIFQH 346
L+G A L GLF FD+ ++ R+S ++ + + A LGGGL +
Sbjct: 336 LAGRTFDAAPARGLY-GLFLLFDFDTPQRFRVSTSALGAGASAH-----ADLGGGLALE- 388
Query: 347 HGKFRRRPLELYAETYLNVVPMGASLSDHYNVD--NRDYNLGSGLSGKLYL 395
+ V +GA+ + D RDY +G G G+ L
Sbjct: 389 ------------GDLVGAAVILGAAGTVDRGDDGKGRDYRIGPGAQGQAGL 427
>gi|86159909|ref|YP_466694.1| hypothetical protein Adeh_3491 [Anaeromyxobacter dehalogenans
2CP-C]
gi|85776420|gb|ABC83257.1| hypothetical protein Adeh_3491 [Anaeromyxobacter dehalogenans
2CP-C]
Length = 529
Score = 106 bits (264), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 101/347 (29%), Positives = 158/347 (45%), Gaps = 41/347 (11%)
Query: 50 KTIGVNLAVWGFDHFIMNEDFADISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFN 109
+++ VNLA+ G++ ++ +ADIS +I N ++ + D+D+F N F HPY G+ F+
Sbjct: 120 ESLSVNLAMMGWNRWVGEAPWADISGDSIGRNLRSRWVLDDDQFWVNQFGHPYQGTWAFS 179
Query: 110 AARANGLSFRHSAPFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLID 169
AAR+ G F S+ +AF S +WE+ E E PS ND TT+ G+ LGE+ R SD L
Sbjct: 180 AARSAGQGFWVSSAYAFAASGLWEIAGETERPSRNDQVTTTVAGMVLGEILGRFSDAL-- 237
Query: 170 NRTTGWERTGREVAIALINPMRFLNRLTAGEVTSVASRSGQIFQSVPINIVVDAGFRFLA 229
R G G +A A+++PM +NR G T+ R+ ++ +
Sbjct: 238 -RAEGGTWNG--MAAAVLSPMGAVNRRLLG--TTRPERASSSRWALAMGGAT-------G 285
Query: 230 DKRHARTGATALTLNLRFDYGDPFRSETF--SPYDFFQFKAGLSFSESQPLLSQINLIGI 287
D R G + L F +G P + P+D F +A F+ + + G+
Sbjct: 286 DATGDR-GTWGGGVGLDFAWGLPGDPDLALEKPFDHFVLEARYGFARDPD--ATVRARGL 342
Query: 288 LSGCQLLAHERTVLVGGLFQHFDYYNSEK-RISKNSEEVLVTPYRISQVAALGGGLIFQH 346
L+G A L GLF FD+ ++ R+S ++ S A LGGGL +
Sbjct: 343 LAGRSFEAAPARGLY-GLFLLFDFDTPQRFRVSTSALGAGA-----SARADLGGGLALE- 395
Query: 347 HGKFRRRPLELYAETYLNVVPMGASLSDHYNVD--NRDYNLGSGLSG 391
+ V +GA+ + D RDY +G G G
Sbjct: 396 ------------GDLVGAAVILGAAGTVDRGDDGKGRDYRIGPGAQG 430
>gi|167456939|ref|ZP_02323155.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans
2CP-1]
gi|167416762|gb|EDR83473.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans
2CP-1]
Length = 523
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 90/298 (30%), Positives = 141/298 (47%), Gaps = 24/298 (8%)
Query: 50 KTIGVNLAVWGFDHFIMNEDFADISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFN 109
+++ VNLA+ G++ ++ +ADIS +I N ++ + D+D+F N F HPY G+ F+
Sbjct: 114 ESLSVNLAMMGWNRWVGAAPWADISGDSIGRNLRSRWVLDDDQFWVNQFGHPYQGTWAFS 173
Query: 110 AARANGLSFRHSAPFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLID 169
AAR+ G F S+ +AF S +WE+ E E PS ND TT+ G+ LGE+ R SD L
Sbjct: 174 AARSAGQGFWVSSAYAFAASGLWEIAGETERPSRNDQVTTTVAGMVLGEILGRFSDAL-- 231
Query: 170 NRTTGWERTGREVAIALINPMRFLNRLTAGEVTSVASRSGQIFQSVPINIVVDAGFRFLA 229
R G G VA ++++PM +NR G + S + ++ G R
Sbjct: 232 -RAEGGTWNG--VAASVLSPMGAVNRRLLGPTRPERATSSRWALAMGGATGDATGDR--- 285
Query: 230 DKRHARTGATALTLNLRFDYGDPFRS--ETFSPYDFFQFKAGLSFSESQPLLSQINLIGI 287
G + L F +G P E P+D F +A F+ + + G+
Sbjct: 286 -------GTWGGGVGLDFAWGLPGDPDLELEQPFDHFVLEARYGFARDPD--ATVRARGL 336
Query: 288 LSGCQLLAHERTVLVGGLFQHFDYYNSEKRISKNSEEVLVTPYRISQVAALGGGLIFQ 345
L+G A L GLF FD +++ +R ++ + S A LGGGL +
Sbjct: 337 LAGRSFEAAPARGLY-GLFLLFD-FDTPQRFHVSTSALGAG---ASARADLGGGLALE 389
>gi|124265225|ref|YP_001019229.1| hypothetical protein Mpe_A0032 [Methylibium petroleiphilum PM1]
gi|124258000|gb|ABM92994.1| hypothetical protein Mpe_A0032 [Methylibium petroleiphilum PM1]
Length = 496
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 85/267 (31%), Positives = 124/267 (46%), Gaps = 18/267 (6%)
Query: 56 LAVWGFDHFI--MNEDFA-----DISWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYF 108
L + GFD + N F+ D+S +I+ N + + DND F N FAHPY GSLY
Sbjct: 73 LQIIGFDFLLNRYNRRFSGTSDYDVSRASIRRNLRGPWVVDNDPFDINQFAHPYQGSLYH 132
Query: 109 NAARANGLSFRHSAPFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLI 168
AARA+GLS+ +A + F GS+ WE+ E PPS ND A+ I G LGE R++ L++
Sbjct: 133 GAARASGLSYWEAAGYTFAGSVGWEIFGEQTPPSYNDQIASGIAGSFLGEPLFRMAHLVL 192
Query: 169 DNRTTGWERTGREVAIALINPMRFLNRLTAGEVTSVA--SRSGQIFQSVPINIVVDAGFR 226
N G R RE A I+P RLT G A + + + + A
Sbjct: 193 -NGPGGLSRPWREAIAAGISPSLGFTRLTFGSRFDGAFVDNDPVYYSRLRLGVARAAQDN 251
Query: 227 FLADKRHARTGATALTLNLRFDYGDPFRSE-TF-SPYDFFQFKAGLSFSESQPLLSQINL 284
+ R A ++ DYG P + T+ P+D+F F A S + + +
Sbjct: 252 LGTSRDLERDEA---VIDFALDYGLPGKDGYTYRRPFDYFSFHAAASTANG---VETLAT 305
Query: 285 IGILSGCQLLAHERTVLVGGLFQHFDY 311
G+L G + + GL+ +DY
Sbjct: 306 RGLLFGTDYAIGDTVRGLWGLYGSYDY 332
>gi|145620156|ref|ZP_01776194.1| hypothetical protein GbemDRAFT_2558 [Geobacter bemidjiensis Bem]
gi|144943568|gb|EDJ78655.1| hypothetical protein GbemDRAFT_2558 [Geobacter bemidjiensis Bem]
Length = 534
Score = 81.6 bits (200), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 76/149 (51%), Gaps = 10/149 (6%)
Query: 60 GFDHFIMNEDFADISWQTIKSNFQT--------GFGWDNDKFVTNLFAHPYHGSLYFNAA 111
GFD I+N+ D + +NF T + +D D F N F HPY G+ + A
Sbjct: 109 GFDRLILNDSEKD-GKKVYSTNFSTIWDHLHRQNWNFDQDSFKVNQFGHPYEGATMYGLA 167
Query: 112 RANGLSFRHSAPFAFFGSLMWELLMENEPPSINDLCATTIGGIALGEMGHRLSDLLIDNR 171
R++GL+F S ++ GS +WE+ E PSIND T G LGE R++ L++++
Sbjct: 168 RSSGLNFWQSLVYSNVGSFLWEMAGETSRPSINDQITTGNAGSLLGEALFRMAGLVLEDA 227
Query: 172 TTGWERTGREVAIALINPMRFLNRLTAGE 200
G ++A A+I+P NR+ G+
Sbjct: 228 GPN-PPLGHKLAAAVISPSTAFNRIAFGD 255
>gi|15611806|ref|NP_223457.1| hypothetical protein jhp0739 [Helicobacter pylori J99]
gi|4155309|gb|AAD06325.1| putative [Helicobacter pylori J99]
Length = 279
Score = 69.3 bits (168), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 61/184 (33%), Positives = 90/184 (48%), Gaps = 21/184 (11%)
Query: 41 PKRPWRAIGKTIG---VNLAVWGFDHFIM--------NEDFADISWQTIKSNFQTGFGWD 89
P W+ +G +IG V+L + ++M E F SW N + G D
Sbjct: 82 PNSRWKYLGTSIGILGVSLVIGIVGLYLMPESVTNWDKEKFGVKSWF---ENVRMGPKLD 138
Query: 90 NDKFVTNLFAHPYHGSLYFNAARANGLSFRHSAPFAFFGS-LMWELLMEN--EPPSINDL 146
ND F+ N HPY G++Y+ R G S+ SA F+F S L WE +E E PS DL
Sbjct: 139 NDSFIFNEILHPYFGAMYYMQPRMAGFSWMTSAFFSFITSTLFWEYGLEAFVEVPSWQDL 198
Query: 147 CATTIGGIALGEMGHRLSDLLIDN--RTTGWERTGREVAIALINPMRFLNR-LTAGEVTS 203
T + G LGE ++L+ + N + G GR +AIAL++P+ F+ R L GE
Sbjct: 199 VITPLLGSILGEGFYQLTRYIQRNEGKLFGSLFLGR-LAIALMDPIGFIIRDLGLGEALG 257
Query: 204 VASR 207
+ ++
Sbjct: 258 IYNK 261
>gi|15645422|ref|NP_207596.1| hypothetical protein HP0803 [Helicobacter pylori 26695]
gi|2313940|gb|AAD07858.1| predicted coding region HP0803 [Helicobacter pylori 26695]
Length = 279
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 91/183 (49%), Gaps = 19/183 (10%)
Query: 41 PKRPWRAIGKTIG---VNLAVWGFDHFIMNEDFADISWQT----IKS---NFQTGFGWDN 90
P W+ +G +IG V+L + ++M E + W IKS N + G DN
Sbjct: 82 PNSRWKYLGTSIGILGVSLVIGIVGLYLMPESVTN--WDKEKFGIKSWFENVRMGPKLDN 139
Query: 91 DKFVTNLFAHPYHGSLYFNAARANGLSFRHSAPFAFFGS-LMWELLMEN--EPPSINDLC 147
D F+ N HPY G++Y+ R G S+ SA F+F S L WE +E E PS DL
Sbjct: 140 DSFIFNEILHPYFGAMYYMQPRMAGFSWMASAFFSFITSTLFWEYGLEAFVEVPSWQDLV 199
Query: 148 ATTIGGIALGEMGHRLSDLLIDN--RTTGWERTGREVAIALINPMRFLNR-LTAGEVTSV 204
T + G LGE ++L+ + N + G GR V IAL++P+ F+ R L GE +
Sbjct: 200 ITPLLGSILGEGFYQLTRYIQRNEGKLFGSLFLGRLV-IALMDPIGFIIRDLGLGEALGI 258
Query: 205 ASR 207
++
Sbjct: 259 YNK 261
>gi|108563213|ref|YP_627529.1| hypothetical protein HPAG1_0788 [Helicobacter pylori HPAG1]
gi|107836986|gb|ABF84855.1| hypothetical protein HPAG1_0788 [Helicobacter pylori HPAG1]
Length = 279
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 91/183 (49%), Gaps = 19/183 (10%)
Query: 41 PKRPWRAIGKTIG---VNLAVWGFDHFIMNEDFADISWQT----IKS---NFQTGFGWDN 90
P W+ +G +IG V+L + ++M E + W IKS N + G DN
Sbjct: 82 PNSRWKYLGTSIGILGVSLVIGIVGLYLMPESVTN--WDKEKFGIKSWFENVRMGPKLDN 139
Query: 91 DKFVTNLFAHPYHGSLYFNAARANGLSFRHSAPFAFFGS-LMWELLMEN--EPPSINDLC 147
D F+ N HPY G++Y+ R G S+ SA F+F S L WE +E E PS DL
Sbjct: 140 DSFIFNEILHPYFGAMYYMQPRMAGFSWMASAFFSFITSTLFWEYGLEAFVEVPSWQDLV 199
Query: 148 ATTIGGIALGEMGHRLSDLLIDN--RTTGWERTGREVAIALINPMRFLNR-LTAGEVTSV 204
T + G LGE ++L+ + N + G GR + IAL++P+ F+ R L GE +
Sbjct: 200 ITPLLGSILGEGFYQLTRYIQRNEGKLFGSLFLGRLI-IALMDPIGFIIRDLGLGEALGI 258
Query: 205 ASR 207
++
Sbjct: 259 YNK 261
>gi|6822150|emb|CAB71023.1| hypothetical protein [Helicobacter pylori]
Length = 240
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 68/145 (46%), Gaps = 17/145 (11%)
Query: 41 PKRPWRAIGKTIG---VNLAVWGFDHFIM--------NEDFADISWQTIKSNFQTGFGWD 89
P W+ +G +IG V+L + ++M E F SW N + G D
Sbjct: 82 PNSRWKYLGTSIGILGVSLVIGIVGLYLMPESVTNWDREKFGVKSWF---ENVRMGPKLD 138
Query: 90 NDKFVTNLFAHPYHGSLYFNAARANGLSFRHSAPFAFFGS-LMWELLMEN--EPPSINDL 146
ND F+ N HPY G++Y+ R G + SA F+F S L WE +E E PS DL
Sbjct: 139 NDSFIFNEILHPYFGAMYYMQPRMAGFGWMASAFFSFITSTLFWEYGLEAFVEVPSWQDL 198
Query: 147 CATTIGGIALGEMGHRLSDLLIDNR 171
T + G LGE ++L+ + N+
Sbjct: 199 VITPLLGSILGEGFYQLTRYIQRNQ 223
>gi|157376731|ref|YP_001475331.1| hypothetical protein Ssed_3599 [Shewanella sediminis HAW-EB3]
gi|157319105|gb|ABV38203.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3]
Length = 460
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 59/223 (26%), Positives = 90/223 (40%), Gaps = 27/223 (12%)
Query: 81 NFQTGFGWDNDKFVTNLFAHPYHGSLYFNAARANGLSFRHSAPFAF---FGSLMWELLME 137
N +G WD D F N HPY G +Y+ AAR +G +R F + + WE +E
Sbjct: 121 NVSSGPVWDRDNFAINYIGHPYFGGVYYQAARKSG--YRQWDAFMYSFMMSTFYWEYGVE 178
Query: 138 --NEPPSINDLCATTIGGIALGEMGHRLSDLLIDNRTTGWERTG-REVAIALINPMRFLN 194
E PSI DL T + G GE + T W G + A+ ++P+ L
Sbjct: 179 AFAEVPSIQDLVVTPVLGWVYGEWAFNKEREIRQRGGTVWGSEGWGDTALFFLDPIDSLG 238
Query: 195 R----------LTAG----EVTSVASRSGQIFQSVPINI--VVDAGFRFLADKRHARTGA 238
R + AG +T V +G+I + V + + AG + RH
Sbjct: 239 RGVNTLFGDDVVKAGTGYLSMTEVPLDNGRIDKQVQFQVQYAIGAGKANESVSRHQSANN 298
Query: 239 TALTLNLRFDY---GDPFRSETFSPYDFFQFKAGLSFSESQPL 278
A ++ D+ G S SP + + ++G + S S L
Sbjct: 299 KAGSVEDPVDFGIVGISIGSSYMSPGEVWNLESGWAPSASVGL 341
>gi|42523741|ref|NP_969121.1| hypothetical protein Bd2288 [Bdellovibrio bacteriovorus HD100]
gi|39575948|emb|CAE80114.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100]
Length = 366
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 69/156 (44%), Gaps = 25/156 (16%)
Query: 67 NEDFADI--SWQTIKSNFQTGFGWDNDKFVTNLFAHPYHGSLYFNAARANGLSFRHSAPF 124
E D+ +WQ N + G D D + N HPY G++Y+ AR G+ P
Sbjct: 189 KEKMKDLGKNWQ---ENVKEGPVMDKDDWAINYIGHPYSGAIYYQVARHAGV-----GPM 240
Query: 125 AFFG------SLMWELLME--NEPPSINDLCATTIGGIALGEMGHRLSDLLIDNRTTGW- 175
FG + WE +E E PSI DL T I G +GE+ +R + N W
Sbjct: 241 GSFGYSVLMSTFFWEYGVEAFAEKPSIQDLFITPIIGSIMGELFYRAEMKIQKNGGKVWG 300
Query: 176 -ERTGREVAIALINPM----RFLNRLTAGEVTSVAS 206
++ G V I L+NPM +N+L +V AS
Sbjct: 301 SKKIG-SVLIVLMNPMGAFSDQMNKLFGSKVIKSAS 335
>gi|127512116|ref|YP_001093313.1| hypothetical protein Shew_1184 [Shewanella loihica PV-4]
gi|126637411|gb|ABO23054.1| hypothetical protein Shew_1184 [Shewanella loihica PV-4]
Length = 541
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 54/112 (48%), Gaps = 7/112 (6%)
Query: 84 TGFGWD-NDKFVTNLFAHPYHGSLYFNAARANGLSFRHSAPFAFFGSLMWELLME-NEPP 141
TG W +D + + H Y G+LY+ A R +G ++ S F S +WE+ E E
Sbjct: 127 TGASWKYDDNDIGMNWGHAYAGALYYQAFRNHGFNYYESTLGTFAASTIWEVFAEYKEVV 186
Query: 142 SINDLCATTIGGIALGEMGHRLSDLLIDNRTTGWERTGREVAIALINPMRFL 193
SIND TT GG LGE + S++L GW TG L NP R L
Sbjct: 187 SINDQIVTTWGGAVLGESLFQFSEMLAAKE--GWLPTGLSY---LFNPSRTL 233
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
Posted date: May 10, 2008 4:54 AM
Number of letters in database: 884,634,002
Number of sequences in database: 2,620,852
Database: /apps/blastdb/nr.01
Posted date: May 10, 2008 4:52 AM
Number of letters in database: 976,814,986
Number of sequences in database: 2,761,530
Database: /apps/blastdb/nr.02
Posted date: May 10, 2008 4:46 AM
Number of letters in database: 360,829,861
Number of sequences in database: 1,132,722
Lambda K H
0.324 0.139 0.429
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,198,901,552
Number of Sequences: 6515104
Number of extensions: 95768063
Number of successful extensions: 170080
Number of sequences better than 1.0e-04: 29
Number of HSP's better than 0.0 without gapping: 15
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 170040
Number of HSP's gapped (non-prelim): 33
length of query: 494
length of database: 2,222,278,849
effective HSP length: 138
effective length of query: 356
effective length of database: 1,323,194,497
effective search space: 471057240932
effective search space used: 471057240932
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 124 (52.4 bits)