BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= TF3061
(373 letters)
Database: nr
5,470,121 sequences; 1,894,087,724 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|20803831|emb|CAD31409.1| HYPOTHETICAL PROTEIN [Mesorhizo... 203 1e-50
gi|114707920|ref|ZP_01440813.1| hypothetical protein FP2506... 185 3e-45
gi|83956338|ref|ZP_00964764.1| hypothetical protein NAS141_... 175 4e-42
gi|85372989|ref|YP_457051.1| hypothetical protein ELI_00810... 168 7e-40
gi|119952797|ref|YP_950396.1| hypothetical protein AAur_pTC... 156 2e-36
gi|116672174|ref|YP_833107.1| hypothetical protein Arth_363... 149 3e-34
gi|152984164|ref|YP_001348405.1| hypothetical protein PSPA7... 142 3e-32
>gi|20803831|emb|CAD31409.1| HYPOTHETICAL PROTEIN [Mesorhizobium loti]
Length = 372
Score = 203 bits (517), Expect = 1e-50, Method: Composition-based stats.
Identities = 141/382 (36%), Positives = 210/382 (54%), Gaps = 23/382 (6%)
Query: 1 MNSNLANQLLASIMKWDAPTLASERAALEFMGSMKYDAYDRYMPGMRFMSSLVQWLNDIK 60
M +LA LLA IM+W A+ERA LE S KYD Y ++ PG RF+ SL WL
Sbjct: 1 MRKDLAESLLAKIMEWSDEEKAAERAYLESFASYKYDEYQQFSPGKRFIESLALWLGQFT 60
Query: 61 E-EDRDKAYKFIKEKLVFISSMQMNYLVDLLYDSKIRPILLDMATVETDMPSYKRSSKVV 119
E+R +AY F++++L+FIS+ +MN+LV+L + + IRPIL+ E + +K + V
Sbjct: 61 GGEERREAYNFVRKRLLFISTDEMNHLVELTFPTIIRPILIADTATELGVDPHKVKAIVD 120
Query: 120 RTRFEIEKRSALVIGLSDGAHTDILRRS--AGFNNEQVLTNYYPDGKKLEDMLDELRKDE 177
+ R LV+GLSDGA TD RR+ +NEQV Y K E M+ +LRK
Sbjct: 121 TVEYRTRLRQTLVLGLSDGARTDWFRRANPQEMSNEQVFHAYDVSDPKSEGMVQDLRKHL 180
Query: 178 KLKVIENPF-----FRRIFLIDDFTASGKSFIRFDESDRKYHGKLKRIIDELCIKKGQEI 232
+ P FR I L+DDFT SG SFIR ++ + GK+ +I+ L +K G
Sbjct: 181 TKILGREPHDHEARFRYIVLLDDFTGSGTSFIR-EDGQGGWTGKIAKIVSGL-MKAGNLG 238
Query: 233 EHLSYLLNPEQKIQIDILFCIATEKARKNIKDSLGNYLKSVKLQDKVEFNIHIVQILEDK 292
+ ++ ++I I+ +A ++A ++I+ L K + VEF +V L +
Sbjct: 239 DAIA-----NSGVKIIIILYVAGDQAIEHIESRLP---KLTFGKGTVEF--EVVHKLGET 288
Query: 293 LSIDIKTDEDLVKLLKKDKHFVKECVISKSYKVGKNDNPWLGFDECALPVVLAHNTPNNS 352
+D +D ++ L+ D++F + S KVG + GF C LP+VLAHNTPNNS
Sbjct: 289 TPLDDASDSPILGLVADDRYFDDDADDEHS-KVGGTSKRY-GFANCRLPLVLAHNTPNNS 346
Query: 353 LPIIW-QDAERFHGLFPRISRH 373
+ ++W +D + GLFPR+SRH
Sbjct: 347 IYLLWAEDDQSVRGLFPRVSRH 368
>gi|114707920|ref|ZP_01440813.1| hypothetical protein FP2506_00405 [Fulvimarina pelagi HTCC2506]
gi|114536698|gb|EAU39829.1| hypothetical protein FP2506_00405 [Fulvimarina pelagi HTCC2506]
Length = 379
Score = 185 bits (470), Expect = 3e-45, Method: Composition-based stats.
Identities = 128/398 (32%), Positives = 188/398 (47%), Gaps = 45/398 (11%)
Query: 1 MNSNLANQLLASIMKWDAPTLASERAALEFMGSMKYDAYDRYMPGMRFMSSLVQWLNDIK 60
MN +LA ++L IM W+ E L M +KYD Y + GMRF+ SL WL +
Sbjct: 1 MNQDLALKILGQIMNWNDDRARQEFQWLRLMARLKYDGYRDFQAGMRFIESLATWLQQFE 60
Query: 61 EEDRDKAYKFIKEKLVFISSMQMNYLVDLLYDSKIRPILLDMATVETDMPSYK-RSSKVV 119
+R+ AY F++ LV++ + +M LVD Y +R L+ E MP Y+ +
Sbjct: 61 PHERETAYAFVRHTLVYLGAGEMQRLVDQFYPRTVRDRLVRTVAAERGMPPYRVLADPDA 120
Query: 120 RTRFEIEKRSALVIGLSDGAHTDILRRS-AG-FNNEQVLTNYYPDGKKLEDMLDELRKDE 177
R E +R L +GLSDGA D +R + AG NNEQ++ D +K D+LD LR +
Sbjct: 121 RAAVERLRRQTLFMGLSDGARIDTIRHANAGLLNNEQLVQGTQVDTEKWRDLLDNLRSE- 179
Query: 178 KLKVIENPFFRRIFLIDDFTASGKSFIRFDESDRKYHGKLKRIIDELCIKKGQEIEHLSY 237
V FR ++L+DDF +G SF+R+DE K+ GKL R D
Sbjct: 180 --LVDPEARFRLVYLVDDFAGTGTSFVRYDEKKEKWKGKLLRFRDS-------------- 223
Query: 238 LLNPEQKIQIDILF------CI----ATEKARKNIKDSLGNYLKSVKLQDKVEFNIHIVQ 287
+LN + ++ D LF CI A+ A + I+D L + +
Sbjct: 224 VLNAMEALEGDALFEDCWELCIHHYVASSAAAQAIEDRLARTAEVFDAGWARAMHASFGM 283
Query: 288 ILEDKLSIDIKTD--EDLVKLLKKDKHFVKECVISKSYKVGKNDNPWLGFDECALPVVLA 345
+L L ID+ +D +KL + + + +K VG + LG+ CALP+VL
Sbjct: 284 VLPPDLPIDVVPGRYDDFLKLTQS---YYDPKIRTKHTDVGGATHLGLGYGACALPLVLD 340
Query: 346 HNTPNNSLPIIWQ----------DAERFHGLFPRISRH 373
HNTPNN++ ++W DA LF R RH
Sbjct: 341 HNTPNNAVALLWAETDGGDREGVDAPAMRPLFRRRQRH 378
>gi|83956338|ref|ZP_00964764.1| hypothetical protein NAS141_04393 [Sulfitobacter sp. NAS-14.1]
gi|83839443|gb|EAP78625.1| hypothetical protein NAS141_04393 [Sulfitobacter sp. NAS-14.1]
Length = 378
Score = 175 bits (444), Expect = 4e-42, Method: Composition-based stats.
Identities = 114/371 (30%), Positives = 189/371 (50%), Gaps = 30/371 (8%)
Query: 1 MNSNLANQLLASIMKWDAPTLASERAALEFMGSMKYDAYDRYMPGMRFMSSLVQWLNDIK 60
MN +LA ++++ IM+WD E L+ M +KYD Y + GMRF SL WL
Sbjct: 1 MNQDLALRVMSDIMQWDDDESRKEFRWLKLMARLKYDGYRDFQAGMRFTESLATWLQQFD 60
Query: 61 EEDRDKAYKFIKEKLVFISSMQMNYLVDLLYDSKIRPILLDMATVETDMPSYK-RSSKVV 119
+E+R AY+F+KE++V++ ++ LV+ + + IR ++ + Y ++
Sbjct: 61 QEERKDAYRFVKERMVYVGPGEVRRLVEQFFPNTIRQRIVQTVASNLGIKPYTVLTNPDA 120
Query: 120 RTRFEIEKRSALVIGLSDGAHTDILRRS--AGFNNEQVLTNYYPDGKKLEDMLDELRKDE 177
+ R LV+GLSDGA DI+R + +NEQ++ D +K +D+L LR+D
Sbjct: 121 AAAIKRLSRQTLVLGLSDGARMDIVRHANVGRLSNEQLVLAPQIDTEKWKDLLKNLRED- 179
Query: 178 KLKVIENP--FFRRIFLIDDFTASGKSFIRFDESDRKYHGKLKRIIDELCIKKGQEIEHL 235
+E+P F+ I+L+DDF +G SF+R+ E D+K+ GKL R L
Sbjct: 180 ----LEDPDALFKIIYLVDDFAGTGTSFLRYKEKDKKWSGKLNRFRTSL----------F 225
Query: 236 SYLLNPE--QKIQIDILFC----IATEKARKNIKDSLGNYLKSVKLQD-KVEFNIHIVQI 288
+ + +PE + D C +AT A+ + S K +K ++ E ++ +
Sbjct: 226 NAISDPEVGNIVAPDWQLCAHHYMATANAKDKMIASENTARKDMKHKNWPEEVHLSFAMV 285
Query: 289 LEDKLSIDIKTDEDLVKLLKKDKHFVKECVISKSYKVGKNDNPWLGFDECALPVVLAHNT 348
++KL I TD+ L K + + +K VG + +G+ CALP+VL HNT
Sbjct: 286 FDEKLPISASTDDAFTALANK---YYDPVIETKHTAVGGVKHLGMGYGGCALPIVLEHNT 342
Query: 349 PNNSLPIIWQD 359
PNNS+ ++W +
Sbjct: 343 PNNSVALLWAE 353
>gi|85372989|ref|YP_457051.1| hypothetical protein ELI_00810 [Erythrobacter litoralis HTCC2594]
gi|84786072|gb|ABC62254.1| hypothetical protein ELI_00810 [Erythrobacter litoralis HTCC2594]
Length = 393
Score = 168 bits (425), Expect = 7e-40, Method: Composition-based stats.
Identities = 125/404 (30%), Positives = 194/404 (48%), Gaps = 53/404 (13%)
Query: 1 MNSNLANQLLASIMKWDAPTLASERAALEFMGSMKYDAYDRYMPGMRFMSSLVQWLNDIK 60
M + LA L+A++M WD +E A L M SMKYD Y + G+RF+ SLV WL +
Sbjct: 11 MINALALNLIATVMNWDNERATAEYAWLRLMSSMKYDGYSDFRAGVRFLESLVSWLRQFE 70
Query: 61 EEDRDKAYKFIKEKLVFISSMQMNYLVDLLYDSKIRPILLDMATVETDMPSYK--RSSKV 118
++DR+ AY F++E++V+IS+ +M +++ + P L E + Y+ R+
Sbjct: 71 QQDREAAYAFVQERMVYISTAEMQRIIENFIPETVTPYLRKAVAAELGIKPYEVWRTQAG 130
Query: 119 VRTRFEIEKRSALVIGLSDGAHTDILRR--SAGFNNEQVLTNYYPDGKKLEDMLDELRKD 176
+ E ++R L +GLSDG+ D+LRR S + EQV+ D +K + + +LR +
Sbjct: 131 AKAFLEHQRR-CLFVGLSDGSRIDVLRRANSGRLSQEQVVPMLNVDNEKWKGLGKDLRGE 189
Query: 177 EKLKVIENPFFRRIFLIDDFTASGKSFIRFDESDRKYHGKL-----------KRIIDELC 225
+ ++ F+ ++LIDDFTASG +FIRF + + K GKL K + D+
Sbjct: 190 ----LGDDARFQDVYLIDDFTASGTTFIRFPDGEPK--GKLAKFEQNAQEARKALKDDFP 243
Query: 226 IKKGQEIEHLSYLLNPEQKIQIDILFCIATEKARKNIKDSLGNYLKSVKLQDKVEFNIHI 285
+ G + H+ + ++ Q Q L E K S G E +I
Sbjct: 244 LADGYTL-HIHHYVSTAQACQ--ALEGRVAEAGEKLTDPSFG------------EAHITE 288
Query: 286 VQILEDKLSIDIKTDED-LVKLLKKDKHFVKEC----------VISKSYKVGKNDNPWLG 334
L L I ED +V + D+ + C + K K + G
Sbjct: 289 GMRLPAVLPIGTSGTEDAMVAVAPSDEPYFGLCGTYYDHNLFERLEKHCKEAGQVDMRYG 348
Query: 335 FDECALPVVLAHNTPNNSLPIIWQDAERFHG-----LFPRISRH 373
+ CALP+VL HNTPNNS+PI+W + + G LF R RH
Sbjct: 349 YANCALPLVLEHNTPNNSIPILWAETQGKLGHPMRPLFRRRDRH 392
>gi|119952797|ref|YP_950396.1| hypothetical protein AAur_pTC20250 [Arthrobacter aurescens TC1]
gi|119951927|gb|ABM10836.1| conserved hypothetical protein [Arthrobacter aurescens TC1]
Length = 371
Score = 156 bits (394), Expect = 2e-36, Method: Composition-based stats.
Identities = 114/386 (29%), Positives = 182/386 (47%), Gaps = 33/386 (8%)
Query: 1 MNSNLANQLLASIMKW-DAPTLASERAALEFMGSMKYDAYDRYMPGMRFMSSLVQWLNDI 59
M + LA QLL + D L L + +KYD Y Y PG+RF+ SL WL++
Sbjct: 1 MKALLAEQLLTEVTGIEDLGRLTELTQRLRTLAGIKYDDYGGYSPGVRFIESLAVWLSEF 60
Query: 60 KEEDRDKAYKFIKEKLVFISSMQMNYLVDLLYDSKIRPILLDMATVETDMPSYKRSSKVV 119
++EDR+ A FI +L +IS+ ++++LV +Y +RP LL A E +P +
Sbjct: 61 RQEDRETALNFILNRLTYISATELDHLVSTVYKDMLRPRLLRAAAGELGIPEWAVKKIAS 120
Query: 120 RTRFEIEKRSALVIGLSDGAHTDILRRSAGFNNEQVLTNYYPDGKKLEDMLDELRKD-EK 178
F +R L++G+SDGA D LRR++ + EQ D +K DM +L + E
Sbjct: 121 SPEFISLQRRTLILGMSDGARLDKLRRASPLSTEQFHLVSTVDEEKASDMSTKLAEALES 180
Query: 179 LKVIENPFFRRIFLIDDFTASGKSFIRFDESDRKYHGKLKRIIDELCIKKGQEIEHLSYL 238
K+ P F + ++DDF+ SG + +R D + GKL +I + K S +
Sbjct: 181 QKLPGEPKFSTVVVVDDFSGSGTTMLRPD--GEGWKGKLPKIYKHVSNLK------TSGV 232
Query: 239 LNPEQKIQIDILFCIATEKARKNIKDSLGNYLKSVKLQDKVEFNIHIVQILEDKLSIDIK 298
+ + ++ + + K++ + + Y + E ++ ED + +
Sbjct: 233 VAEDARVIVVLYLLTKLAKSQLTSRMAAAGYAEP-------EVSLVAAHTFEDDFPLTEE 285
Query: 299 TDEDLVKLLKKDKHFVKECVISKSYKVGKNDNPWLGFDECALPVVLAHNTPNNSLPIIWQ 358
DE KL +F +E + S S G+ + GF ALP+VL HN PNN+ PIIW+
Sbjct: 286 ADEAFWKLCS--DYFRQEWINSHSGLAGQLSH---GFGGSALPLVLHHNAPNNAPPIIWK 340
Query: 359 D-----------AERFHGLFPRISRH 373
D + G+FPR RH
Sbjct: 341 DESIESNNRRDSQREWIGVFPRHERH 366
>gi|116672174|ref|YP_833107.1| hypothetical protein Arth_3632 [Arthrobacter sp. FB24]
gi|116612283|gb|ABK05007.1| conserved hypothetical protein [Arthrobacter sp. FB24]
Length = 371
Score = 149 bits (376), Expect = 3e-34, Method: Composition-based stats.
Identities = 112/388 (28%), Positives = 182/388 (46%), Gaps = 37/388 (9%)
Query: 1 MNSNLANQLLASIMK-WDAPTLASERAALEFMGSMKYDAYDRYMPGMRFMSSLVQWLNDI 59
M + LA QLLA + D L L + +KYD Y Y PG+RF+ SL WL++
Sbjct: 1 MKALLAEQLLAEVTGIQDLAHLTEITQRLRTLADIKYDDYGGYTPGVRFIESLAVWLSEF 60
Query: 60 KEEDRDKAYKFIKEKLVFISSMQMNYLVDLLYDSKIRPILLDMATVETDMPSYKRSSKVV 119
K+EDR+ A F+ +L +IS+ ++++LV +Y +RP LL E +PS+
Sbjct: 61 KQEDRETALNFVLHRLAYISATELDHLVSTVYKDMLRPRLLKAVAAELGIPSWSVKKIAS 120
Query: 120 RTRFEIEKRSALVIGLSDGAHTDILRRSAGFNNEQVLTNYYPDGKKLEDMLDELRKD-EK 178
F +R L++G+SDGA D LRR++ + EQ D +K DM +L + E
Sbjct: 121 SPEFLSRQRRTLILGMSDGARLDKLRRASPLSTEQFHLVSTVDDEKASDMSAKLAEALES 180
Query: 179 LKVIENPFFRRIFLIDDFTASGKSFIRFDESDRKYHGKLKRIIDELCIKKGQEIEHLSYL 238
+ + F + ++DDF+ SG + +R E D + GKL +I + + K +
Sbjct: 181 MNLPGEAKFSLVIVVDDFSGSGTTMLR-REGD-GWKGKLPKIHNHVTNLKTSGV------ 232
Query: 239 LNPEQKIQIDILFCIATEKARKNIKDSLGN--YLKSVKLQDKVEFNIHIVQILEDKLSID 296
+ + +L + T+ A+ + + YL+ E ++ +D +
Sbjct: 233 --IAEDAHVIVLLYLLTKLAQSQLAARMAEAGYLEP-------EVSLVAAHTFDDDFPLT 283
Query: 297 IKTDEDLVKLLKKDKHFVKECVISKSYKVGKNDNPWLGFDECALPVVLAHNTPNNSLPII 356
+D + KL +F +E + S G + GF ALP+V+ HN PNN+ PII
Sbjct: 284 ENSDPEFWKLCS--AYFREEWDNAHSGLAGDLSH---GFGGSALPLVIHHNAPNNAPPII 338
Query: 357 WQDA-----------ERFHGLFPRISRH 373
W+D + G+FPR RH
Sbjct: 339 WKDESIEGRKSDDSLSEWLGVFPRHERH 366
>gi|152984164|ref|YP_001348405.1| hypothetical protein PSPA7_3045 [Pseudomonas aeruginosa PA7]
gi|150959322|gb|ABR81347.1| conserved hypothetical protein [Pseudomonas aeruginosa PA7]
Length = 372
Score = 142 bits (358), Expect = 3e-32, Method: Composition-based stats.
Identities = 112/393 (28%), Positives = 186/393 (47%), Gaps = 45/393 (11%)
Query: 1 MNSNLANQLLASIMKWDAPTLASERAALEFMGSMKYDAYDRYMPGMRFMSSLVQWLNDIK 60
M S++A +LLA + W + E AL+ M + KYD Y + P RF +L+ WL+
Sbjct: 1 MKSSIALRLLADQLAWTDEEVTEEFPALQLMVTHKYDGYQGFQPASRFHVALLNWLSQFA 60
Query: 61 E-EDRDKAYKFIKEKLVFISSMQMNYLVDLLYDSKIRPILLDMATVETDMPSYKR-SSKV 118
E R A+ FIK++L++I+ +M++LV L+ + R ++ E + ++ +
Sbjct: 61 SVEQRRLAFNFIKDRLIYITQREMHHLVSLMMPTLDR-LMRKQVAAELGIQFFETWTDTA 119
Query: 119 VRTRFEIEKRSALVIGLSDGAHTDILRR--SAGFNNEQVLTNYYPDGKKLEDMLDELRK- 175
R + +R L +GLSDGA D+ RR +NEQV+ +K ++DEL K
Sbjct: 120 AERRLGLLRRRTLFVGLSDGARIDVFRRYNEGLISNEQVVPFSEMSEQKWTSLVDELGKW 179
Query: 176 DEKLKVIENP-FFRRIFLIDDFTASGKSFIRFDESDRKYHGKLKRIIDELCIKKGQEIEH 234
+K + P F + L+DDF SG S +RFDE ++++ GK+ + + K G +
Sbjct: 180 LKKHDYKDEPAIFEAVCLVDDFAGSGASLLRFDEKEQRWKGKINKFYEANVGKIGTSLA- 238
Query: 235 LSYLLNPEQKIQIDILFCIATEKARKNIKDSLGNYLKSVKLQDKVEFNIHIVQILEDKLS 294
+ Q+ +A+ +A + I L Y + + F +L +
Sbjct: 239 --------LQCQLYAHHHLASHQAEQTIAQRLTGYSAN---HTEFCFKSTFSYVLPSDIV 287
Query: 295 IDIKTD-EDLVKLLKKDKHFVKECVISKSYKVGKNDN-----PWLGFDECALPVVLAHNT 348
I +T+ +LV+L +K C Y G D+ W G+ C LP++L HNT
Sbjct: 288 IGEQTEPAELVQL-------IKTC-----YDPGIEDDHLGSAAWFGYSGCGLPLILEHNT 335
Query: 349 PNNSLPIIWQDA--------ERFHGLFPRISRH 373
PNNS+ ++W D+ + LF R RH
Sbjct: 336 PNNSIALLWADSSGARQSCPHQMKPLFARRKRH 368
Database: nr
Posted date: Sep 17, 2007 11:41 AM
Number of letters in database: 999,999,834
Number of sequences in database: 2,976,859
Database: /nucleus1/users/jsaw/ncbi/db/nr.01
Posted date: Sep 17, 2007 11:48 AM
Number of letters in database: 894,087,890
Number of sequences in database: 2,493,262
Lambda K H
0.321 0.138 0.398
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,357,972,368
Number of Sequences: 5470121
Number of extensions: 57695956
Number of successful extensions: 160893
Number of sequences better than 1.0e-05: 7
Number of HSP's better than 0.0 without gapping: 6
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 160865
Number of HSP's gapped (non-prelim): 8
length of query: 373
length of database: 1,894,087,724
effective HSP length: 134
effective length of query: 239
effective length of database: 1,161,091,510
effective search space: 277500870890
effective search space used: 277500870890
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 130 (54.7 bits)