Extensive domain shuffling in transcription regulators of DNA viruses and implications for the origin of fungal APSES transcription factors.

Lakshminarayan M. Iyer Aravind, L. and Koonin. E.V. *

* Address for correspondence: Eugene Koonin (koonin@ncbi.nlm.nih.gov)

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA





Comparative analysis of the protein sequences encoded in the genomes of three families of large DNA viruses that replicate, completely or partly, in the cytoplasm of eukaryotic cells (poxviruses, asfarviruses, and iridoviruses) and phycodnaviruses that replicate in the nucleus reveals 9 genes that are shared by all of these viruses and 22 more genes that are present in at least three of the four compared viral families. Although orthologous proteins from different viral families typically show weak sequence similarity, because of which some of them have not been identified previously, at least five of the conserved genes appear to be synapomorphies (shared derived characters) that unite these four viral families, to the exclusion of all other known viruses and cellular life forms. Cladistic analysis with the genes shared by at least two viral families as evolutionary characters supports the monophyly of poxviruses, asfarviruses, iridoviruses, and phycodnaviruses. The results of genome comparison allow a tentative reconstruction of the ancestral viral genome and suggest that the common ancestor of all of these viral families was a nucleocytoplasmic virus with an icosahedral capsid, which encoded complex systems for DNA replication and transcription, a redox protein involved in disulfide bond formation in virion membrane proteins, and probably inhibitors of apoptosis. The conservation of the disulfide-oxidoreductase, a major capsid protein, and two virion membrane proteins indicates that the odd-shaped virions of poxviruses have evolved from the more common icosahedral virion seen in asfarviruses, iridoviruses, and phycodnaviruses.




------------------------------KILAN------------------------------------------------------------------------------
#
KilAN+ kilAC
-----------------------------------------------------
125404          KILA_BPP1     kilA N 1-128      + 143-266 (kilAC)

# kilA N + Bro-aC
 ----------------------------------------------------
9964424                     AMV110


9964414                     AMV100
9964341                     AMV027
9964550                     AMV236
9964470                     AMV156
15078718                    CIV006L 231-352 (CIV029)
9634794                     FPV124 N1R/p28 gene family prote.  + BROC (217-271)
2407299                     17K ORF [Heliothis armigera.  + BROC.
9964426                     AMV112 + BROC
9964338                     AMV024  + BROC

# kilAN+ Orf11D3
----------------------------

9635595                     Orf11 [Pseudomonas phage D3] >gi|889...
13559861                    unknown [Bacteriophage HK620] >gi|1.. + Orf``D3 C-terminus (105-161)



# kilAN + T5orf172
------------------------------
15079027  CIV315L +  Bro-E Cterminus 96-201


# kilA N (the Cs are distinct and not conserved
  -----------
11281012             hypothetical protein NMB0900 + C- nothing
11345968             phage-related protein XF2294 + C-nothing
11290039             hypothetical protein NMA1544 [imported] .... C distinct but not conserved
9634745              ORF FPV075 N1R/p28 gene family prote..... C-distinct but not conserved
9634833              C- distinct but not conserved FPV163
9634829              ORF FPV159 N1R/p28 gene family prote..  C-distinct but not conserved
9634825              ORF FPV155 N1R/p28 gene family prote.... C distinct but not conserved... low complexity
9634906              ORF FPV236 N1R/p28 gene family prote..C distinct but not conserved... low complexity
9634918              ORF FPV248 N1R/p28 gene family prote..--- little extension.. nothing
9634831              ORF FPV161 N1R/p28 gene family prote..----little extension.. nothing
1777419              ORF4 [Fowlpox virus].........little extension.. nothing
9964446             AMV132- C matches low complexity region C -terminal to MSV199-T5orf172


#  kilaN + CIV029R/BROC
 -----------------
  15079025        313L  10-130 (kilAN) +   130-237 (MSV199) 237-353 (CIV029R/BROC)

#P63C   + kilAN
 ------------
9634189              Gp73 [Bacteriophage HK97] >gi|690161.. kilaN inserted into 9632501 (or 4499795) 933W orf12

#kilAN + RING
---------------
 9634827              ORF FPV157 N1R/p28 gene family prote.... 2 RINGS or 1 zinc + RING
 9634820              ORF FPV150 N1R/p28 gene family prote..
 9633890              gp143R [Rabbit fibroma virus] >gi|46..
 12085126             143R protein [Yaba-like disease vir..
 9633779              m143R [Myxoma virus] >gi|6523998|gb|..
 6682986              Yb-C4R [Yaba monkey tumor ..

----------------------------------MSV199 like--------------------------------------------------------------------------------------------------------------------------
#  MSV199like solo
 ------------
 9631447             MSV199 fragment of MSV198
                     CIV200R fragment

# MSV199like motif + UVRC
--------------------------------
15078859             CIV146R  1-118 (UVRC domain)  143-243 (MSV199like motif)


# N-MSV199like motif + CIV029R/BROC like
 --------------------
 15079179            CIV468L 1-177 + C  177-376 (CIV029R/BROC)
 15079099            CIV388R 1-175 + C  226-344 (CIV029)
 15078923            CIV211L 1-180 + C 259-381 (CIV029R/BROC)
 15078924            CIV212L 30-155 + C 238-360 (CIV029R/BROC)
 15078950            CIV238R  N+ 86-230 (MSV199like) + C        315-436 (29R/BROC)
 15078732            CIV019R  N + 136-274 (MSV199like)
 15078861            148R _CIV 61-191 (MSV199like???) + C _ 265-330 (414)        + BROC (330-414)

#
 MSV199+C Bro-e C terminus
 --------------------------

9964508            AMV194  N? + 66-215 (MSV199like) + C Bro-e (252-358
9631448            MSV198 1-155 (MSV199like) + C Bro-e (192-292)
9631453            MSV191  1-120 (MSV199like) + C
9964523             AMV209  N + 72-215 (MSV199like) + C Bro-e 257-356
9964521            AMV207  77-228 + C 270-369 Bro-eC
15079131          CIV420R  1-154 + C Bro-e (234-327) +
9631537          52-144(T5orf172 + MSV021 (MSV199like) 105-250 +


-------------------------------T5orf172---------------------------------------------------------------------------------------------------------------------------------

# BRO-e C-terminus Looks like a uvrC nuclease to me
  -------------------------------------------------
9631041           Ld-bro-f [Lymantria dispar nucleopol..  10-129 (solo)
281258            hypothetical protein - phage T5 >gi|579090..   65-158  probably solo
93750             hypothetical protein 172 - phage T5             65-158  probably solo
7474985           hypothetical protein yeeC - Bacillus subt..   N--nothing + 265-374
14194257          hypothetical pro..   N--nothing + 137-233 (Unidentified bacterium)
8346568|          phage P27                                                 N--nothing + 266-376
11345564          hypothetical protein NMB1132, NMB1170 [i..    2-73 + C low complexity
15079171          460R CIV                                      4-88



--------------------------------- BRON-------------------------------------------------------------------------------------------------------------------------
# P22ANT-N + BroA like N-terminus + P22ARC
  ------------------------------------------
9635550      P22-ant   130-207 + -199-272

# Bro-A N + T5orf172
 --------------------------
15079001           CIV 289L                               N-BroN (1-120)  186-299 Bro-e C
15078913           CIV 201R                               N-Bro N (1-188) 188-301 Bro-eC
13751084           (AJ309235) Bro-I protein [Bombyx mor..  1-111 (BRON) + 111-220
9630900            BRO-b [Bombyx mori nuclear polyhedro..  1-111 (BRON) + 111-218
9630956            BRO-e [Bombyx mori nuclear polyhedro..  1-111 (BRON) + 111-220
9631082            Ld-bro-k [Lymantria dispar nucleopol..  1-108 (BRON) + 108-217
9631117            Ld-bro-m [Lymantria dispar nucleopol. . 1-102 (BRON) + 137-233
9631452            ORF MSV194 ALI motif gene family pro..  1-100 (BRON) + 175-290
9631535            ORF MSV023 ALI motif gene family pro..   ~1-100 (BRON) + 145-257
12597544           Heliocoverpa armigera nucleopo..  1-145 BRON+ 145-243
9635380            ORF130 [Xestia c-nigrum granulovirus..    1-84 BRON 84-197 C
9631042            Ld-bro-g [Lymantria dispar nucleopol... 17-100 (Bro-aN) + 100-222
13242588           Esv-1-117




# BroA-N + kilAC
---------------------------
13095813         bIL309       BRON + 137-247(kilAC)
1395130          LL-H _        BRON + 152-258 (kilAC)
1362213          2-139 (BRON) + 139-247(kilAC)..
1251473          prophage CP-933N  1-123 (BRON) + 117-229 (kilAC)
14246624         1-139 (BRON) + 139-252(kilAC)
14251162         BK5-T 1-138 (BRON) + 140-256 (kilAC)
9635686          phiPV83   1-143 (BRON) + 143-256 (kilAC)
1353522          ORF5_r1t   1-137 (BRON) + 139-255 (kilAC)
13622137         putative antirepressor - p...   6-92 +  kilAC

# BRON+ P63
--------------
>gi|15320633         p63 Bacteriophage Mx8 : Myxococcus xanthus

#BroA like N-terminus + P22ARC
------------------------------------------------------------
12514734         putative antire - 4-122+ 191-264
1175791          HI1418   11-124 + - 137-194

# Bro-aN
--------------------------------------------------------------------------------------------------
9964369              AMV055 [Amsacta moorei entomopoxviru...  solo (BRON)
9631040              Ld-bro-e [Lymantria dispar nucleopol... 1-82 (solo)
9631451              ORF MSV195 ALI motif gene family pro...  solo (BRON)
9631397              ORF MSV226 hypothetical protein [Mel...   3-95 (solo)
1395127              putative [Bacteriophage LL-H]   solo -- truncated?
12697190             putative antirepressor [N... 5-90 + C or probably solo
6599316              Broa-N solo
13623111             hypothetical protein - pha... 8-89 + C -- nothing
7480004              othetical protein SCGD3.15 - Streptomy... 17-120 + C nothing
11349554             othetical protein PA2423 [imported] -...  N-nothing 99-251 (bro) + C nothing
11349113             othetical protein PA1153 [imported] -... 1-139
9964576              AMV262 [Amsacta moorei entomopoxviru... 1-100 (BRON) + C nothing
9635312              ORF62 [Xestia c-nigrum granulovirus]... 28-121 + C nothing
AMV055
#BRON duplication
------------------------
11068085            PxORF82 peptide [Plutella xylostell... 120-200(BRON), 250-325 (BRON)
13160526            F274292) unknown [Culex nigripalpus... 36-100(BRON), 149-256 (BRON)  + C nothing



# Bro-aN + BROC
-----
9799895            hypothetical protein [Antica... 1-112 + C                             \
93042              othetical protein ORF2, ptp-region [impo... 1-113 + C                  |
10442572           38.7 kD-like pr.                          +295-390 (b                  |
347406             24 kDa ORF [Autographa califor... 45-139 + C +  147-206                       \
9627755            AcOrf-13 peptide [Autograph                 +215-326                          |
5565846            AcMNPV ORF13 homo.                        +107-207                            |
9627744            baculovirus repeated ORF [Autographa... 1-113        + C 133-          |
9629950            unknown [Orgyia pseudotsuga  BRON ?        +214-316                          |
9635364            ORF114 [Xestia c-nigrum granulovirus...  N-nothing 211-384 + C..427
9635326            ORF76 [Xestia c-nigrum granulovirus]...  N  + 143-236
9635409           ORF159 [Xestia c-nigrum granulovirus. Broa N 48-153  + 281-392 (BROC)
9631120            Ld-bro-n [Lymantria dispar nucleopol... 1-116 + C                      |
9631081            Ld-bro-j [Lymantria dispar nucleopol... 1-113 + C                     /
9631128           Ld-bro-p [Lymantria dispar nucleopol.  BRON 1-134    + 63-178 (BROC)
9630998           Ld-bro-a [Lymantria dispar nucleopol. Broa N 1-102   +216-327 (BROC)
9631113           Ld-bro-l [Lymantria dispar nucleopol. Broa N 1-108   +222-333 (BROC)
9631121           Ld-bro-o [Lymantria dispar nucleopol. Broa N 1-112   +205-316 (BROC)
9630999           Ld-bro-b [Lymantria dispar nucleopol. Broa N (1-113) +202-313 (BROC)
12597545           bro [Heliocoverpa armigera nucleopo. 1-107 BRON + BRON (183-284)+391-502 (BROC)- duplication of BRON
13751087          (AJ309236) Bro-II protein [Bombyx mo. BRON (1-115)  +197-306 (BROC)
9630821            AcMNPV orf13 [Bombyx mori n    49-143 (BRON)+ 219-330                        |
13751089           Bro-III protein [Bombyx m... 1-114 + C                                 |
9630901           BRO-c [Bombyx mori nuclear polyhedro. BRON          +195-304 (BROC)
9630839           BRO-a [Bombyx mori nuclear polyhedro. BRON (1-115)  +195-304 (BROC)
9630955            BRO-d=AcMNPV orf2 [Bombyx mori nucle... 1-115 + C                      |
9635359           ORF109 [Xestia c-nigrum granulovirus. BRON 1-82     +189-296 (BROC)
7672865           bro-a [Spodoptera.                    BRON( 1-113)  +200-309  (BROC)
12597608           38.7kd [Heliocoverpa armig                  + 292-382                        /
15213135           unknown [Epiphyas postvitt...    83  2e-15
9634234            ORF13 38.7kD [Spodoptera exigua nucl...                                |

9964371           AMV057                                             210-290 -(CIV029R/BROC)
9964491           AMV177                                             217-297 - (CIV029R/BROC)
9964489           AMV175                                             203-283- (CIV029R/BROC)

3510491          orf6Heliolithis BRON                            + C 190-271 (CIV029R/BROC)


*BROC solo
----------------
9630054            Orgyia pseudotsugata single. C-terminus solo-- NO BRO
15213228           unknown [Epiphyas postvitt...   146  9e-35 NO BRO-JUST THE C-terminal region and even that may be fragmented
9631089            LdOrf-122 peptide [Lymantria                + 97-176--solo
2760643          CIV029R and Bro-a C can be unified
15078742
11931724         DpAV4 (59-181)
11931708         DpAV4 (91-194)
11931709         DpAV4  (15-97)


Xylella specific family
BRON-BRON-BRON + C XF1559 (that has no bro)
------------------------------
11362500         phage-related protein XF2524 [imported] ... 32-122, 166-253, 283-371 (BRON) + C
11362477         phage-related protein XF0684 [imported] ... 6-96 (BRON) 140-226 (BRON) 256-344 (BRON) + C
11362484         phage-related protein XF1663 [imported] ... 12-120, 130-237 Only 2 Bro domains
11362478         phage-related protein XF0704 [imported] ...   17-104 + C
11362483         phage-related protein XF1645 [imported] ... N + 122-210 + C C XF0704
11362060         hypothetical protein XF2506 [imported] -... N 1-74 XF2129/XF1645 + 189-279 (BRON) + C XF0704

XF0704 + XF2129 = XF1645 and XF2506
XF1559+


#BRON- gp30 like
-----------------------
9633590          P43 [Bacteriophage APSE-1] >gi|61180...  1-96 + C
9630500          gp30 [Bacteriophage N15] >gi|7521545... 7-101 (BRON)+ C


# Broa-N -- C synapomorphic to this group
-----------------------------------------
9635310          ORF60 [Xestia c-nigrum granulovirus]... 1-114 + C + C
10442560         Orf60-like proti... 12-110 + C+C
12597590         bro [Heliocoverpa armigera nucleopo... 12-110 + C + C
9635381          ORF131 [Xestia c-nigrum granulovirus... 1-93 + C +C
9631038          Ld-bro-c [Lymantria dispar nucleopol... 1-112 + C +C
9631039          Ld-bro-d [Lymantria dispar nucleopol... 1-112 + C
9631080          Ld-bro-i [Lymantria dispar nucleopol... 1-112 + C  missing the middle domain

# Broa-NC
-----------------------------------------
9635363          ORF113 [Xestia c-nigrum granulovirus... N + 146-239+ C
14602336         ORF99 similar to XcGV ORF113 [Cydia...  150-240 (BRON) + BRON


# Broa-N + C- SinR like HTH
------------------
13623110         hypothetical protein - pha... N + 88-182 + C - HTH


# BRON + Vsr
---------------
15078782         069L [Chilo iridescent virus] >gi|7...
 9631450         ORF MSV196                    4-75 BRON + C vsr nuclease  (101-196)
9631533          ORF MSV026                   1-69 (BRON) + C vsr nuclease  (101-196)
9631534          ORF MSV024                           8-79 + C vsr nuclease
9631444          ORF MSV204 ALI motif gene family pro.1-91 + C vsr nuclease


#VSR nuclease
-------------
9631531          ORF MSV028 hypothetical protein [Mel.. (solo VSR).
9964571          AMV257 [Amsacta moorei entomopoxviru...(solo VSR)
9631399          ORF MSV229 leucine rich repeat gene ...


-----------------------------------------phi31orf238N----------------------------------------------------------------------------------------------------------------------------------------------------------------
#phi31orf238N + kilAC
-------
2897108                      tr thermophilus phageTP-J34      +  112-236 (238) -- N - phi31orf238N 1-111
9632967                     Strthe_orf287_Sfi21  -             +              N--phi31orf238N      46-116
13622110                     Spy Mgas_ 116-240 (242) ---       +              N  phi31orf238N      1-116
14247767, 13701788, 8918426, 1370718,  9635199            Sa 132-350 -- + N  phi31orf238N     10-124 ORF11_phi ETA_131-246 (250) -  --   N phi31orf238N     10-123
5823644                       A118_136-260-- N 9-108                              N phi31orf238N      1-111
13701788                                         anti repressor [Staphyloc 10-123  + C kilAC
#kilA middle + kilAC   Ant1/2  : Originally may have had a RHAdomain N-terminal to it as it seems to have a region C-terminus in RHAat its N (the DELE motif)
------------------
137444, 15742          kilA middle + 225-321 (kilAC)
12514711     kila middle + kilAC


# phi31orf238N domain + shares a C-terminal domain with orf6_BPbIL285
 --------------
 gi|7239197, 12724417, 13095749,  13487806  Orf238 1-103              + C


# phi31orf238N + P22ARC
-----------------------------------------
9632546, 9633476, 1175795     hypothetical protein [Bacteriophage  8-114   + C unknown  (191-264)- CP-933N like antipreressor C
11354036  NMA1293                                              4-128   + C

11138338 wonder if this is truncated                           shares a small domain wiht phi31orf238N proteins + KilAC


#phi31orf238N + phiSLT orf 81a
  ---------
15024925            Cab  1-98    + C- phiSLT orf 81a (which is a solo) -12719400


------------------------------------------RHA-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
#RHA+ kilAC
------------------
9630225               SpbC2       1-109    + 133-244 (kilAC)
13559853              Roi_HK620   23-116   + 143-233 (kilAC)
1197729               Roi_HK022   24-117   + 118-234 (kilAC) -- check extreme C-terminus, 9634208 (RHAidentical)
9632502               Roi_BP-933W 48-118   + 119-235 (kilAC)   4499797: orf14_933W    ilAN     48-118   +     119-235 (kilAC) gi:2668765               Roi_H-19B     kilA N  23-116    +      102-233 (kilAC)    gi:9633432               roi_VT2-Sa    kilA N 26-118     +   119-235 (kilAC)  gi:13360660              coli          kilA N   24-116   +     117-233 (kilAC), 13360660              coli          kilA N   24-116   +     117-233 (kilAC)
9634208               RHA+ KilA


 # RHA domain
 -------------
12719399             antirepressor [Staphylococcus aureu 2-108 --probably solo
2120256              rha protein - phage phi-80 >gi|1019108|gb  28-124---probably solo
12722192             unknown [Pasteurella multo 26-125  +  C-nothing
13095661             Orf3 [bacteriophage bIL311] >gi|127 14-100 \subfamily
14972611             hypothetical protein [Stre  3-102 /
13095876             anti-repressor [bacteriophage bIL31   8-127  RHA+ a C-terminal region specific to 14972611 and this protein
9633589              P42 [Bacteriophage APSE-1] >gi|61180  12-101 + kilA middle domain

#RHA+ P4ASH-N
------------
421263    hypothetical protein 179 - Shigella flexne 1-106 (P4ASH only part N)+ 106-168  (RHA)


#RHA+ Orf11D3
--------------
1175786              HYPOTHETICAL PROTEIN HI1412 >gi| 10-114 + Orf11D3 114-173
9635533              unknown protein [Enterobacteria phag       39-139 + Orf11D3 144-197

--------------------------------------------------------------------------------------------------------------------------------------




-----------------------------------------ALIGNMENTS-----------------------------------------------------------------------------------------------
 1. kilA N-terminus
PHD Sec. Structure         -EEEEE--------EEE-------------HHHHHHHHH--------HHHHH----------------------------------------EEEEEE------------HHHHHHHHHHHHH------HH--HHHHHHHHHHHHHH--
NMA1544_Nm_11290039        LIPRVESG---EIIPQRMSD-------GYINATALCKSVG----KSYSDYRQLQSTNHFLNELKAQTG---------------------LSEQQLIQQRIGGEPSL--QGSWVHPYLAINLAQ------WLSPAFAVKVSTWVHEWMSG \
NMB0900_Nm_11281012        NVSVLNFG---NTPVSFRQD-------GFLNATAIASHFG----KLPKDYLKSEQTQQYISALAENLSVRRKIL---------------TEANQIVIVKRGGSE----QGTWLHPKLAIHFAR------WLNPKFAVWCDEQIEILLNG /kilAN solo
AMV132_AMV_9964446         NYWCLHIN---DFNLIYNKKL------NLYNASRVCDIYE----KNIHIWLE-ENYDYTIKYLKIKEI---------------------NDHVSIINNNKESSL----NGLYVSEHILLGISI------WISEECYYKCINIILHNHDI-- has a C-terminus which matches the C-terminus of MSV-T5orf172
KILA_BPP1_125404           STTLPVIC---GVEITTDRA-------GRYNLNALHRASGLGAHKAPAQWLRTLSAKQLIEELEKET----------------------MQNCIVSFEGRGG-------GTFAHELLAVEYAG------WISPAFRLKVNQTFIDYRTG | kilAN + kilAC
HKBK_BPhk620_13559861      -MKAITLF---NTPIRVDES-------GMICLTDMWKASGKSESESPYHYLRNKQTKEFLAELEKN-----------------------HESVVFTERGVHG-------GTYGGKFVAYDYAA------WLNPGFKYAAYKVLDDYFTG |kilAN + ORF11D3-C
Orf11_D3__9635595          NVIPFHYQ---GKPVRFNSD-------GWINATDIAAAHG----MRLDNWLRNKETEAYIEALARHLNTSD------------------SRDLIRGQRGRGG-------GTWLHPKLAVAFAR------WISPDFAVWADLHIDALLRG |kilAN + ORF11D3-C
XF2294_Xf_11345968         TTQQLAIN---SLPIR-EQD-------GLYSLNDFHKASGGAVRHRPSEFLRLDKTKALVVELTNSPEFVSSIKGGA------------PHLFVRKEKGRAG-------STFACRELAIAYAS------WISPAFQLKVIRVFLASVVV |C-terminus
Gp73_HK97_9634189          NIIPIDFE---GHPMRFSDD-------GWFDATAAADKFN----KEPAQWLRLPETVRYIEALKSRYGNIT------------------YVKTSRARKDRGG-------GTWLHPKLAVRFAR------WLSVDFEIWCDEQIDAIIQG |Fused to p63C
                           -EEEEEEE---EEEEEEE-------------HHHHHHH--------HHHHHHHHHHHHHHHHHHHH-------------------------EEEEEE---------EEEEE---HHHHHHHHH------HHEEEEE----EEEEEEEE-
CIV315L_CIV_15079027       NFYYGLFR---DFKLVVDKNT------ECFNATKLCNSGG----KQFRQWTRLEKSKKLMEYYSRRG----------------------SQQMYEIKGDNKDQLVTQTTGTYAPIDFFEDIKR------WIQLPKASSASGVVYVVTTS   kilAN + Bro-eC
FPV124_FPV_9634794         RFCYIKYD---KFDLIMMKEN------RFINATKLCKLGG----KDFHRWKRLDGSKELMIKVNEMN-EMWKSAPPPPDL---------GGIIIEVNG-SNQYTEYDIAGSYVHQDLIPHIAS------WISPLFALKVSKIISCYVSG \
AMV112_AMV_9964426         SYYYGLFG---DFKLVIDKTT------GCFNATKLCNLSG----KRFRNWIRLDRSKQLLKYMENYRSSYV------------------SVGFYEVKGDNNNKTSKEITGQYVPKEVILDISS------WISVEFYLKCNDIIINYYNN  |
AMV024_AMV__9964338        SYYYGLFG---DFKLVIDKTT------GCFNATKLCNLGG----KKFKQWKRLEKSQELIDYIKNNRGGDP------------------HPGFYETKGDNKDENVKKITGCYVPKEVILDISS------WISVEFYLKCNDIIINYYNT  |
CIV006L_CIV_15078718       TFYKGLFG---DFPLIVDKKT------GCFNATKLCVLGG----KRFVDWNKTLRSKKLIQYYETRCDIKT------------------ESLLYEIKGDNNDEITKQITGTYLPKEFILDIAS------WISVEFYDKCNNIIINYFVN  |
CIV313L-CIV_15079025       NFYYGLFG---DFKLVVDKNT------ECFNATKLCNSGG----KRFRDWTKLEKSKKLMEYYKGRRDDHRG-----------------GSNFYEVKGDNKDDEVSKTTGQYVKKELILDIAS------WISTEFYDKCNQIVIDFFVV  |  kilAN + Broa-c
AMV110_AMV__9964424        SYYYGLFG---DFKLVIDKTT------GCFNATKLCNLGG----KQYRDWKRLEKSKELIKTLINVRRENS------------------RVWEYNIISNNNHEIHKQYTGYYVSKDLILDIAS------WIAPEFYLKCNDIIINYYNN  |
AMV100_AMV__9964414        TFYSAHIN---SYQLVIDKKT------GFFNASYVCIKNY----RKINNWLNNKKTIKLIKYYMNLLNNKNNN----------------NNKIKYKIVDKYDNIN----GIYLHPILLNHLLD------WINIKINNKYN--IIDYIIL /
FPV161_FPV_9634831         GFLILYYD---SIEIIVMSCN------HFINISALLAKKN----KDFNEWLKIESFREIIDTLDKIN--YDLGQRYCEEPYGASHSSVIIEVKASNLIDDRTA------GFYVHKDLIPYILT------CISIPFSLKVVRVLDTYIGE \
FPV236_FPV_9634906         YFMSMKLL---DVEVVIMRSN------GFVNITRLCNLEG----KDFNDWKQLESSRRLLNTLKDNN--KLHDP-----------------IINIRHTRIKIN------GEYVSQLLLDYVIP------WISPYVATRVSILMRYYRRC  |
FPV155_FPV_9634825         EFCYIQYS---GFHLVMMISN------CYINASKLCDT------KDFKKWLRLDSSLSLLQEIENTN---FPSEKKFSIKNSK------SVIILEKYYHEEVE------GYYIHPDILPHIVG------WLSPTFAISMSKFINGYISN  | C-nothing
FPV159_FPV_9634829         KFSYIIYD---KIKIIIMKSN------NYVNATRLCELRG----RKFTNWKKLSESKILVDNVKKIN---DKTNQLKTDMI--------IYVKDIDHKGRDTC------GYYVHQDLVSSISN------WISPLFAVKVNKIINYYICN  |
FPV248_FPV_9634918         NFCKLSYE---DIEIIMMKEN------EYINATRLCSSRG----RDILDWMSKESSVELINELDRIN---RSCNDYYDY----------RGIVLNVVSDSETS------ELYVHRDLILHISH------WISPLFSLKVVKFINSYIQD /
FPV163_FPV_9634833         HFCYIKYD---GITLTMMKDN------GYINATQLCMLGN----KDFKEWIKLDHSIELIKEIEKNI--NKETTKYVKAVISV------RSDYYNSETSNDIK------GFYIHGNIMPHICA------WISSKFAIKVSNIVHNYLND
FPV075_FPV_9634745         NFCFINYA---NIEVIMLKYN------GYINATKICDLGN----KNFRQWCRLESSKKLIKTLNYKN---GIYNKAVLE----------IGLASNSAYKYELV------GTYVHIDLVPHIIC------WVFPSIALNFSKILNSYLSN
FPV157_FPV_9634827         SFDSIKYR---DIKVIIMKNN------GYVNCSKLCKMRN----KYFSRWLRLSTSKALLDIYNNKS---VDNA---------------IVKVYGKGKKLIIT------GFYLKQNMIRYVIE------WIGDDFTNDIYKMINFYNAL \ + RING
FPV150_FPV_9634820         EYRVIEDN---GFSIILLKHT------EYINVTKLCKIHN----KEFYRWKRLISAGRIIETVSRDISNQGFESPL---------------VYVNRKGNKEFY------GFYAHPQLALYIAK------WISEDIFNKIKHLINSYTIS  |
p28_Ectro_1360841          LQYIDEPN---DIRLPVCIIRNINNITYFINITKINPDLA----NQFRAWKKRIAGRDYMTNLSRDT--GIQQSKL-------------TETIRNCQKNRNIY------GLYIHYNLVINVVI-----DWITDVIVQSILRGLVNWYIA  |
D6R_VAR_885801             LQYIDEPN---DIRLTVCIIQNINNITYYINITKINPHLA----NQFRAWKKRIAGRDYMTNLSRDT--GIQQSNL-------------TETIRNCQKNRNIY------GLYIHYNLVINVVI-----DWITDVIVQSILRGLVNWYID  |
YH22_VV_140731             LQYIDEPN---DIRLTVCIIRNINNITYYINITKINTHLA----NQFRAWKKRIAGRDYMTNLSRDT--GIQQSKL-------------TETIRNCQKNRNIY------GLYIHYNLVINVVI-----DWITDVIVQSILRGLVNWYIA -- no RING- truncated
PHDSec Str                 -EEEEE------EEEEEE----------EEEEHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHH----------------------EEEEEEEE----EEE------EEEHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH
1MB1_Sc_3402004            VDVYEFIH---STGSIMKRKK-----DDWVNATHILKA------ANFAKAKRTR----ILEKEVLK----------------------ETHEKV--QGGFGKY-----QGTWVPLNIAKQLAEKFSVYDQLKPLFDFTQTDGSASP
MBP1_Kla_729994            VDVYEFIH---PTGSIMKRKA-----DNWVNATHILKA------AKFPKAKRTR----ILEKEVIT----------------------DTHEKV--QGGFGKY-----QGTWIPLELASKLAEKFEVLDELKPLFDFTQQEGSASP
PCT1_Sp_11346262           VEVYECFI---KGVSVMRRRR-----DSWLNATQILKV------ADFDKPQRTR----VLERQVQI----------------------GAHEKV--QGGYGKY-----QGTWVPFQRGVDLATKYKVDGIMSPILSLDIDEGKAIA
SCT1_Sp_464742             VEVFEYTI---NGFPLMKRCH-----DNWLNATQILKI------AELDKPRRTR----ILEKFAQK----------------------GLHEKI--QGGCGKY-----QGTWVPSERAVELAHEYNVFDLIQPLIEYS---GSAFM
SWI4_Sc_666106             TDVYECYIRGFETKIVMRRTK-----DDWINITQVFKI------AQFSKTKRTK----ILEKESND----------------------MQHEKV--QGGYGRF-----QGTWIPLDSAKFLVNKYEIIDPVVNSILTFQFDPNNPP
CC10_Sp_115906             MKYMELSC---GDNVALRRCP-----DSYFNISQILRL------AGTSSSENAK----ELDDIIES----------------------GDYENV--DSKHPQI-----DGVWVPYDRAISIAKRYGVYEILQPLISFNLDLFPKFS
EFG1_Cal_1169477           TLCYQVDA---NNVSVVRRAD-----NNMINGTKLLNV------AQMTRGRRDG----ILKSE-------------------------KVRHVV--KIGSMHL-----KGVWIPFERALAMAQREQIVDMLYPLFVRDIKRVIQTG
Sok2_Sc__6323658           TLCYQVEA---NGISVVRRAD-----NDMVNGTKLLNV------TKMTRGRRDG----ILKAE-------------------------KIRHVV--KIGSMHL-----KGVWIPFERALAIAQREKIADYLYPLFIRDIQSVLKQN
Phd1-_Sc_6322808           TICYQVEA---NGISVVRRAD-----NNMINGTKLLNV------TKMTRGRRDG----ILRSE-------------------------KVREVV--KIGSMHL-----KGVWIPFERAYILAQREQILDHLYPLFVKDIESIVDAR
MGF-1_Yaly_5139660         TLCFQVEA---RGICVARRED-----NDMINGTKLLNV------AGMTRGRRDG----ILKGE-------------------------KLRHVV--KAGAMHL-----KGVWIPYDRALEFANKEKIIDLLFPLFVRDIKSVLYHP
AM1_Nc_1517923             SLCFQVEA---RGICVARRED-----NAMINGTKLLNV------AGMTRGRRDG----ILKSE-------------------------KVRHVV--KIGPMHL-----KGVWIPFERALDFANKEKITELLYPLFVHNIGALLYHP
StuA_Eni_549002            SLCYQVEA---KGVCVARRED-----NGMINGTKLLNV------AGMTRGRRDG----ILKSE-------------------------KVRNVV--KIGPMHL-----KGVWIPFDRALEFANKEKITDLLYPLFVQHISNLLYHP
SPBC19C7_10_Sp_7491471     LKCTNPES--KVPHFLMRMAK-----DSSISATSMFRS------AFPKATQEEE----DLEMRW------------------------IRDNLN--PIEDKRV-----AGLWVPPADALALAKDYSMTPFINALLEASSTPSTYAT
G6G8_4_Nc__12802359        SGIFKSSP---PSYFLMRRSQ-----DGYISATGMFKA------TFPYASQEEE----EAERKY------------------------IKSIPT--TSSEETA-----GNVWIPPEQALILAEEYQITPWIRALLDPSDIAVTATD
1MB1 Sec structure         EEEEEEEE-----EEEEEEE---------EEEHHHHHH----------HHHHHH----HHHHH----------------------------EEE---------------EEEE-HHHHHHHHHH---HH---HH------------








2. kilAC-terminus                                                   roi1                              roi2
---------------                                                      *                                *
PHD Sec Str.              ------------HHH-HHH-----EEEHHHHHHHHH----HHHHHHHHHHHH--EEE-----------HHHH---EEEEEEEEEE-----EEEEEEEE----HHHHHHHHHHH------
SA1801_SaN315_13701788    LETKIERDKPKIVFADAVATTKTSILVGELAKIIKQNGINIGQRRLFEWLRQNGFLIKRKGVDYNMPTQYSMERELFEIKETSITHSDGHTSISKTPKVTGKGQQYFVNKFLGEKQTS \
ORF11_BPETA_8918426       LETKIERDKPKIVFADAVATTKTSILVGELAKIIKQNGINIGQRRLFEWLRQNGFLIKRKGVDYNMPTQYSMERELFEIKETSITHSDGHTSISKTPKVTGKGQQYFVNKFLGETQTT |
orf238_BPTP-J34_2897108   LEAQIEADRPKVLFADAVSASKSSCLIGELAKILKQNGINIGQNKLFQWLRSNGYLISRRGDSWNQPTQKSMQLGLFELKKTNINHADGHTTTNTTTKVTGKGQQYFINKFLNQERLT |
orf287_BPSfi21_9632967    LEAQIEADRPKVLFADAVSASKSSCLIGELAKILKQNGINIGRNKLFQWLRSNGYLISRRGDSWNQPTQKSMQLGLFELKKTNINHADGHTTTNTTTKVTGKGQQYFINKFLNQERLT | phi31orf238N + kilAC
SPy0946_Spy_13622110      LEAQIEADRPKVLFADAVSASHTSILVGELAKLLKQNGVNIGATRLFTWLRKHGYLIKRNGRDWNMPTQKSVELGLIRVKETSITHSDGHITVSKTPLVTGKGQQYFINKFLNQEYLP |
ORF42_A118_5823644        ALNQIEEQKPKVIFADAVQTSENTVLVKDLATILKQNGLDIGQNRLFEWLRGSGYLLN-KGTYYNKPSQKAMNLGLFEQKTHIHTDRNGLMVTTYTPRVTGKGQVYLLNKLLEEHGLV /
ORF169a_BPmv4_11138338    LTLQLEESNKKASYLDIILGTPDLLATTQIAAD-----YGYSARTFNQLLKEVGIQH--KVNGQWILYKAYMGKGYVQSKSFAFKDRKGHDRSKPSTYWTQKGRKLIYDVLKENGTLP |shares a small domain with phi31orf238N
KILA_BPP1_125404          LEQKMLMDAPKVEFAERVATASG-VLIGNYAKV-----LGLGQNYLFTWLRDNGILIA-TGERRNVPKQEYISRGYFTLKETVIDTSNG-SRISFTTRITGKGQQWLMKRLLDAGVLV \+kilAN
yoqD_BPSPBC2_9630225      LQEQLTLAEPKVEKYDRFLNTDGLMKIGQVAKAIGI--KGMGQNNLFRFLRENKVLI--DGTNKNAPYQKYVERGFFQVKTQETS-----VGIKTITLVTPKGADFIVDLLKKHGHKR \
ROI_HK022_1197729         LENQLAIAAPKVEFADRVGEASG-ILIGNFAKV-----VGIGPNKLFAWMRDHKILIA-SGSRRNVPMQEYMDRGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKFVDNGMLK |
orf14_BP933W_4499797      LEKQLALAAPKVEFADRVGEASG-ILIGNFAKVV-----GIGPNKLFAWMRDHKILIA-SGARRNVPMQEYMDRGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK |
ROI_BP933W_9632502        LEKQLALAAPKVEFADRVGEASG-ILIGNFAKV-----VGIGPNKLFAWMRDHKILIA-SGARRNVPMQEYMDRGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK | RHA+ KilAC
ROI_BPH-19B_2668765       LEKQLALAAPKVEFADRVGEASG-ILIGNFAKVV-----GIGPNKLFAWMRDHKILIA-SGARRNVPMQEYMDRGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK |
ROI_BPHK97_9634208        LENQLAIAAPKVEFADRVGEASG-ILIGNFAKV-----VGIGPNKLFAWMRDHKILIA-SGARRNVPMQEYMERGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK |
ROI_BPHK620_13559853      LENQLAIAAPKVEFADRVGEASG-ILIGNFAKV-----VGIGPNKLFAWMRDHKILIA-SGSRRNVPMQEYMERGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK /
Ant1_BPP1_137444          LEQQLVAAAPKVDFADRVSVANG-ILIGNFAKV-----VGLKQNALFSWLRQNGILMA-FGARKNVPRQQYINAGYFTVKEVVLDDENG-YQIRLTPN-------------------- \
ANT2_BPP1_15742           LEQQLVAAAPKVDFADRVSVANG-ILIGNFAKVV-----GLKQNALFSWLRQNGILMA-FGARKNVPRQQYINAGYFTVKEVVLDDENG-YQIRLTPN--------------------  | truncated? kila middle + kilAC
Z1797_Ec_12514711         LENQLAIAAPKAEFVDNYVEASGLMGFREVAKLL-----GIKETDFRLFLLENGIMYR--LAGKMTPYSHHLDAGRFSVKTGEA----GNGHAFTQVKFTPKGVQWIAGLLAAWRATA /
ORF23_BPRLT_1353540       SITYVPIEK-K-----NIILSNQEISYSEFIELLELNNIKMSKIMFLKFMRDRRITIDEKGKFYNFPTAFSIEMGIMLLSSTTKENVQ-----KYIPKITIEGQKYFIEKFHYMIEDK | kilAC solo
ORF5_BPRLT_1353522        LNIELAAATEKTTYLDLILESPDDILITQIAQD-----YGFSAVKFNRILNELRIQR--KVNKQWVLYSRYMGKGYIGSRTQNYVDSKGQERTSITTTWKQKGRKFLYETLKKHGYLP \
ORF38_BPBK5-T_14251162    LNLELAAATEKTTYLDLILEIPDDILITQIAQD-----YGFSAVKLNRILNELRIQR--KVNKQWVLYSRYMGKGYIGSRTQNYVDSKGQERTSITTTWKQKGRKFLYETLKKHGYLP  |
SAV0855_Sa_14246624       LQQEIGELKPKADYVDEILKSTGTLATTQIAADY-----GISAQKLNKLLHEARLQRK-VNKQWVLYSEHM-GKSYTDSDTITIVRSDGREDTVLQTRWTQKGRLKIHEIMTEFGYEA  | + Broa-N
orf8_BPbIL309_13095813    LAVENQIMQPKAQYFDDLVERNLLTSFRDTAKML-----KVGQKQLIDWLLENKYIYR-DKKNKLMPYAQY-NNDLFEIKESKGATN---SWKGAQTLITPVGRETFNLLLN-EYKAS  |
ORF291_BPLL-H_1395130     AEQKLSEAKPKLDYVDKILASKKTILTTHLATDY-----GCSAVAFNRMLCDKKIQRK-VRDTYVLYSQYQ-GHGWTHTFARAIKTKHG-QEIKEQMEWTQKGNIGLYELLKDRFGLL  |
orf9_BPphiPV83_9635686    LQQQVEVNKPKVLFADSVAGSDNSILVGELAKILKQNGVDIGQNRLFKWLRNNGYLIKKSGESYNLPTQKSMDLKILDIKKRIINNPDGSSKVSRTPKVTGKGQQYFVNKFLGETQTT  |
SPy0980_Strpy_13622137    LSVENMVMKPKADYFDDLVDRNLLTSFRETAKQL-----KVKERRFIQFLLDKKYVYR-DKKGKLMPFADK-NNGLFEVKESVNEKTN---WAGTQTLITPKGRETFRLLFI------ /
onsensus/85%              Lp.bl.....Kh.ahD.l..sp..h.h.phAp.......sh....h..hhpp..hbb......b....p.....shhp.+p....p.p.......ps.hp.cG...h...h.......





3. ORF11CD3
--------------
PHD Sec. Str.               ---HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH----
HI1412_Hi_1175786           GYSLMHKYNELCIEHKAKKAFASLCGKGLREW-KGDKPVLEATLKLFEDKMQIELPIK  |+ RHA
orf201_BPP22_9635533        SESYEAERNAIMLEYMKEKDVASMSGRLLNRWGKIKKPQLLARIGRLEQHGQTVIPGL  |+ RHA
orf11_BPD3_9635595          ELTEKQAFDRACKQLEDGRQLASLHGKGLADW-KFKKPMLEHRVDEMRDRLQMVLGLE  |+ kilAN
hkbK_HK620_13559861         RNSLSAQLNMKCHEFDQKKDMASFCGQGLAAW-RYTKPVLVAEINSLANQLQITIPGL  |+ kilAN
ANT_BP7888_10799917         RMSVMEELNQACADMKRDKNIASVFATGLNEW-KQVKSAHVSKIRTLINEANLLIDFV  | + P22 ANT-N- near identical or identical, 9632512, 13361651, 12516389
ANT_BP933W_9632512          KMSVMEELNQACADMKRDKNIASVFATGLNEW-KQVKAAHVSKIRTLVNEANMLIDFV  || + P22 ANT-N- near identical or identical, 9632512, 13361651, 12516389
consensus/100%              ..o....ts..h.p....+.hASh.sp.L..W.+..Ks....pl..h.pp.p..ls..

4.  P22 AR-N
------------------
ANT_BP7888_10799917        MNMMTVPFHGDSLYVVNHNGEPYVPMKPVVAGMGLAWQSQLAKL-RQRFASTITEIVMVAEDGKRRNMVSLPLRKLAGWLQTINPNKVKPEIRGKVIQYQEECDDVLYEYWTKGFVVNPR|P22ANT-N + ORF11D3C
ANT_BPP22_9635550          VNTSYVPFNGQHVLTAMVAGVAYVAMKPVVDNIGLSWSSQVQKLLKMKDKFNYVDIDMVAGDMKKRLMGCIPLKKLNGWLFSINPEKVRADIRDKLIKYQEECFTVLYDYWTKGKAENPR| P22 ANT-N + Bro-AN+ P22ANTC


5. RHA                                                                                                              roi3
----------------                                                                                                    *
PHD Sec. Str.             ----------------------EE--HHHHHHHHHHHHHHHHHHHHHHHH---------EEE----------------------EEEEE--------EEEEEEEE----EEEEEEEEE----HHHHHHHHHHHHHHHHHH--
Orf3_BPbIL311_13095661    MKGLAFLTSP------DLSKAEVVTNHVVIAEYAGIERKSVRRLINNHKNDF------ENFGRL----------------RF-EITTLPDSR---GQKVKIYQLNRNQAMLMITYLDNTEVVRNFKIALVKRFDEMEKELYA \
SP1134_Spn__14972611      M-ELVY----------MDGKKEPYTTSEIIAECAEVQHHTITRLIRENKADF------EELGIL----------------GF-K-IHKLDTR---GQPKKSYILNEQQATFLITYLKNTETVRQFKLNLVKAFFEMREELS- /
orf14_bIL310_13095876     MNEITV------SLDVIIKNKNVIVSSLSVAKAFDRQHSHILRSIEDIKRDWDSLIQSKNGLNRNIIPLKSQGNKQVIFTDYFKESEYIAEN---GRLVKFFEMNRNGFMLLANSFNGKRIL-PIKLAFIERFDELE----- | shares a C-terminal domain with 14972611
PM1774_Pm__12722192       LQGAESAVNNAVFPKVFHKETVAMTDSLKVAHYFGKRHDNLLNTIKNLGC-------SDEFRLL----------------NF-KESYYLNEQ---NKKQPMFYMTQDGFTLLVMGFTGKKAM-QFKEQYIKEFNEMKKRLAT |solo
RHA_BPphi-80_2120256      MNNPSVIPAFDFREMVTTLDNKIITTSLKVADYFGKRHKDVLRAIRNLKC-------SDDFTQR----------------NF-APIDFIDKN---GDVQPMYNITRDGCMMLVMGFTGKTAA-AVKECYINAFNWMAEQLN- |solo
orf179_Sf_421263          MATILTLSHP----DATIENGRAVTTSVAVAEFFRKMHKNVIQKIETLEC-------SPEFNRL----------------NF-KPVTYTDAK---AKNAQCTKSPKTASFSW------------------------------  + ASH-N
orf182_BPphiSLT_12719399  MQAL----------QIVEQNETHYVDSREVAEMIGKRHDNLVRDIKGYIKVLED---SSKLSSH----------------NFFEESTYVNSQ---NKVQPCYLLTKKGCDIVANKMTGSKGI-LFTATYVDAFHKMDEYIKQ |solo
ORF201_9BPP22_9635533     MNELIANHDFDFRQLVTAAEGQPVTDTFQIAKAFGKRHADVLRALKNCHC-------SEDFRRA----------------HF-CVSEKINNLGIFDKKQIYYRMDFSGFVMLVMGFNGAKAD-AVKEAYINAFNWMSAELR- \ + ORF11D3-C
P42_BPAPSE-1__9633589     MQNLIT-------------FQSLTMSSLEIAELVNKRHDNVKRTIETLAK-------SEIIQLP----------------QS-EKVENKQSNSPNR-FTEVFIFEGEQGKRDSIIVVAQLC-PEFTACLVDRWQELEQKLNT|+ kilA middle
yoqD_BPSPBc2_9630225      -MESYL--------TVIEQNGQLLVDSREVAEMVGKRHTDLLRSIDGYVAILL----NAKLRS----------------VEFFLESTYKDAT---GRSLKHFHLTRKGCDMVANKMTGAKGV-LFTAQYVSKFEEMEKALKA \
hkbC_BPHK620_13559853     MNELIN-------------GNAIKMTSIEIAELVGKRHDNVKRTIETLAK-------NGVIRLP----------------QI-EVSERINNLGFNV-QYEHYVFEGEQGKRDSIVVVAQLS-PEFTARLVDRWRELEETAVN  |
Roi_BPH-19B_2668765       MNELIN-------------GNAIKMTSIEIAELVGKRHDNVKRTIETLAK-------NGVIRLP----------------QI-EVSERINNLGFNV-QYEHYVFEGEQGKRDSIVVVAQLS-PEFTARLVDRWRELEGATAK  |
ROI_BPVT2-Sa_9633432      MNELIN-------------SNAIKMTSIEIAELVGSRHDKVKQSIERLAV-------RGVIRNP----------------PM-VVFEKINNLGLLR-GVEAYVFEGEQGKRDSIIVVAQLS-PEFTARLVDRWRELEGATAK  | +kilAC
ROI_BP933W_9632502        MNELIN-------------SNAIKMTSIEIAELVGSQHGNVRISIERLAK-------RGVIQLP----------------SM-QKVENKQTISPNK-FTSVYIFEGEQGKRGSIIVVAQLS-PEFTARLVDRWRELEGATAK  |
Roi_BPHK022_1197729       MNELIN-------------GNAIKMTSIEIAELVESRHSNVKVSIDRLVK-------RGVIKPP----------------AL-QHTNIINDLGVITGKRDFYVFEGEQGKRDSIIVVAQLS-PEFTARLVDRWRELEEAAVN  |
ROI_BPHK97__9634208       MNELIN-------------GNAIKMTSIEIAELVESRHSNVKVSIDRLVK-------RGVIKPP----------------AL-QHTNIINDLGVITGKRDFYVFEGEKGKRDSIIVVAQLS-PEFTARLVDRWRELEEAAV-  /
consensus/100%            .........................sp..lAchh..b+.pl...lp.......................................................a.bp.p...b....h.s......hp..blp.a..b......

http://www.bmm.icnet.uk/servers/3dpssm/output/1f3d9c24585b2b85.job_summary.html


6. orf6N
-------------

PHD Sec. Str.               -----EE-----EEEEHHHHHHH----HHHHHHH------------EEEEE----HHHHH--------EEE-----------------------EEEE---HHHHH---
N.orf6_BPbIL285_13095686    MNELQITELNGQRVLTTQQIAEGYGTDSASITKNFNNNKSRFKEGKHFFLLQGADLKEFK---NNIQNLDV----------------VGNRAPKLYLWTEKGALLHAKS \ +ORF6C
N.ORF6_BPTP901-1_13786537   MNELQITELNGQRVLTTQQIAEGYGTDSASITKNFNNNKSRFKEGKHFFLLQGADLKEFK---NNIQNLDV----------------VGNRAPKLYLWTEKGALLHAKS /
orf13_GMSE-1_12276103       NTQLPVIEYQGQRVITTELLAQGYGAEVKSIHMNFTRNKSRFEETKHYFLLQGEELKAFI---NYPTNCGL----------------VDKRSPSLVLWTGRGS------       \
H0107_Ec_7649857            VETLSPITHNQIPVITTELLAQLYGTEPVRIRQNHHENKVRFVEGKHFFKVVGNDLKELRVALNYSQNLRVTLSNSQNLQPSLRGLQISPKARSLILWTERGAARHAKR /solo
ANT_BPVT2-Sa_9633431        VETLSPITHNQIPVITTELLAQLYGTEPVRIRQNHHENKVRFVEGKHFFKVVGNDLKELRVALNYSQNLRVTLSNSQNLQPSLRGLQISPKARSLILWTERGAARHAKM | + P22ARC


7. ORF6C : the proteins are almost identical but at least two of them are fusedto orf6N
-------------

PHD Sec. Str.               -------HHHHHHHHHH------HHHHHHHHHHHHHHHH-----HHHHHHHHHHHHEEEEE----HHHHHHHHHHHHHHHHHHHHHH---EEEEE------HHHHHHHHHHHH-----EEEEE------------
C.orf6_BPbIL285_13095686    EKQLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEE \ + orf6N
C.ORF6_BPTP901-1_13786537   EKQLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEE /
ORF55_BPpi3_12724417        QQLLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEA \
ORF6_BPTuc2009_13487806     QQLLPQTPEQQIALLARGNVNLNKKVERIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEE  | + phi31orf238N
orf6_BPbIL286_13095749      QHLLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEA  |
orf238_BPphi31.1_7239197    QQLLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTVRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTMLEIRGLNSQTSLSNYQ /



8. phi31orf238N
---------------------
PHD Sec. Str.             --EEEEEEE------EE--HHHHHH------HHHHHHHHHHH----------EEEEEEEE-----------------EEEEEHHHHHHHHHHH----HHHHHHHHHHHHH--
orf238_BPphi31_1_7239197  MNQLITITQNENNEQVVSGRELHQFLGV-KTRYNDWFED-MVKYG-FTENVDFIGFTEKRV-KPQG-----GRPSVDHALKLDMAKEISMIQRNEKGKQARQYFIEVEKELK\  + ORF6C
orf6_BPTuc2009_13487806   MNQLITITQNENNDQVVSGRELHEFLGV-KTRYNDWFED-MVKYG-FTENVDFIGFTEKRV-KPQG-----GRPSVDHALKLDMAKEISMIQRNEKGKQARQYFIEVEKELK |
orf6_BPbIL286_13095749    MNQLITITQNENNDQVVSGRELHEFLDI-TERYSTWFER-MLKYG-FVENIDFVGC--KVF-NTLA-----KQELQDHALKIDMAKEISMIQRNEKGKQARQYFIEVEKELK |
P55_BPpi3_12724417        MNQLITITQNENNDQVVSGRELHEFLDI-TERYSTWFER-MLKYG-FVENIDFVGC--KVF-NTLA-----KQELQDHALKIDMAKEISMIQRNEKGKQARQYFIEVEKELK /
orf238_BPTP-J34_2897108   MNELINITLNENQEPVVSGRQLHKALGV-KTAYKDWFPR-MTEYG-FTDGEDFSSFLSKSTG---------GRPSQDHIIKLDMAKEIAMIQRTDKGKEVRQYFIQVEKDFN \ + kilAC
SPy0946_Spy_13622110      MNQLINVTLNENQEPVVSGRDLHKVLEI-KTQYTKWLER-MSEYG-FVENEDFMAISQKRL-TAQGN----QTEYTDHVLKLDMAKEIAMLQRNEKSKEVRKYFIQVEKDFN  |
SA1801_Sa_13701788        IGEMFNIQEKENGEIAISGRELHQALEV-KTPYKKWFER-MSDYG-FEENIDYIVTDIFVH-NPLGG----RQNQTDHALTLDTAKEIAMIQRSEPGKRARQYFIQVEKAWN  |
ORF11_BPphiETA_8918426    IGEMFNIQEKENGEIAISGRELHQALEV-KTAYKDWFPR-MLKYG-FEENTDYTAIAQKRA-TAQGN----MTHYIDHALTLDTAKEIAMIQRSEPGKRARQYFIQVEKAWN  |
orf34_BPphiPVL_9635199    IGEMFNIQEKENGEIAISARELYKALEV-KKRFSAWAEI-NLKH--FKENRDFTSVLTSTV-VNNGA----VRQLEDYALTLDVAKHVAMMSGTEKGFDFREYFIQVEKAWN  |
ORF42_BPA118_5823644      ANEMLPVLENEKGEKFVNARTLHEKLMT-TTKFADWIKRRIRQYG-FVENEDFFSLLKNEK-RAIG-----GTTSIDYIFTLDSGKELAMVENTEQGRAIRKYFIEVEKQAR  |
SAV1994_Sa_14247767       IGEMFNIQEKENGEIAISGRELHQALEV-STRYDKWFER-MTEYG-FENGIDFISQVEKVH-GQKRAR---TYEQVNHILTLDTAKEIAMIQRSEPGKRARQYFIQVEKAWN  |
orf287_BPsfi21_9632967    MNELINVTLDKNNEPIVSARQLHKTLEV-KTRFSQWVEQ-NFKI--FKENEDFSSVVTTTQQNQYGG----TKELQDYAVTIRMAEHLAMMSKTNKGHEVREYFIKVEKDFN /
L0142_BP933W_9632546      RIPVFNGTIANETTLLVNARDLHTFLGV-GKRFASWITERIEEYG-FVENQDYIAISQKREIGY-------GRGKKDYHLTLDTAKETAMVERNEKGRQIRRYFIECEKKLR  \  + P22ARC
orf80_BPVT2-Sa_9633476    LIPVFNGTIANETTLLVNARDLHTFLGV-GKRFASWITERIEEYG-FVENQDYIAISQKREIGY-------GRGKKDYHLTLDTAKETAMVERNEKGRQIRRYFIECEKKLR   |
HI1422_Hi_1175795         LIPVFNGLIQNQPVQLCNARELHAFVES-KQQYTDWIKNRINEYG-FIQDEDYLVITERTN----------GRPRKEYHITLDMGKELGMVERNERGRQIRQYFIRCERTLK   |
NMA1293_Nm_11354036       LIPTVSGQLDNQTQALVDAHDLHKFLGV-ETPFSKWIQRRIEEYG-FTQALDFIGVDKIVR-TEAGFFGQRDKTVQGYYLSLDMAKELCMVERNDKGRQARRYFIEMEKQAK  /
CAC1945_Cab_15024925      MENLIRIS----DKGLVSAKELYLGLGLNKTNWSRWYPKNIQSNEFFKENIDWIGVRHNDE----------GNETMDFAISIEFAKHIAMMAKTEKSHEYRNYFIKCENKLK  + phi-SLT -orf81a
consensus/100%            ................hss+pLH..l....p.a..W......p...F.ps.Da.s......................a..plc.scc.sM...sp.s..hRpYFIbhEp..p



9. P22ARC
--------------------

PHD Sec. Str.            -------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------EE--HHHHHHHHHHHHHHHHHHHH---
L0142_BP933W_9632546     QAEPQQQFTDEEIILLCYMQLWMEKAQDLSKHLYPIMKELNSSYTNKLYDIAFETIYMVTKNRDVLLREAARLD \
orf80_BPVT2-Sa_9633476   QAEPQQQFTDEEIILLCYMQLWMEKAQDLSKHLYPIMKELNSSYTNKLYDIAFETIYMVTKNRDVLLREAARLD  |
HI1422_Hi_1175795        PEKFTHEFTEFEIETLVWLLIGHHQMNTLLGQLEKPLDAIGSNLHPAVYSYWKEYGRQYKDALPTIKRLMAPFK  | + phi31orf238N
HI1418_Hi_1175791        EKKFSFEFTEYELQQLVWLWFAFMRGIVTFQHIEKAFKALGSNMSGDIYGQAYEYLSVYAQQTKS--------- /
Z1818_Ec_12514734        QEKSTNELSAKEANSLVWLWDYANRSQALFRELYPALKQIQSNYSGRCYDYGHEFSYVIGMARDVLINHTRDVD | + Bro-a N
ANT_BPP22_9635550        QEKKTNDLSAKEANSLVWLWDYANRSQALFRELYPAMRQIQSNYSGKCYDYGHEFSYIIGIARDVLINHTRDVD |P22ANTN + Broa-N + P22ARC
ANT_BPVT2-Sa_9633431     QEKKLNGLSAKETDSLVWLWDYANRSQALFRELYPALKLIQSGYSGICHDYGYEFSYIIGRARGVLINHTRDID | +orf6N at N




 12514711 1-102 start from here  (kilAC)

10. ASH
-------------------
ECs2630_Ec_13362098       TQKNRLPCRNRSGYISAAPHKTGAGILNPIQSKAHNRASGFFVRTV------------LPRLFRVRIMAGRTGPTSVGPDSLLSGVENPVRLASP-RFSTL-DGELFL---------------------------------------------------------------------  + C low complexity
ash_BPP4_75898            ------------------------------MVWCVVSRADGIPCIL-----------PASAHYAAESMVAQAGQPPGWPVSCEAGILTPVWAIAI-ERENS-GDSVICYSQEAAIMATTLTPSHPEFVFVFAAVRRADRHPRICMLRTVAGDERSARRSLVRDYVLSLAARLPVVEV
ASH_BPphi-R73_93828       ------------------------------MVWRVVCRAGMIL--------------FAIACYATESMVAQAGQPPGWPVFFEAGIPTPVWAIAI-ERRNS-GDSSYLLLEGDGLMATTLTPSHPEFVFVFAAVRRADRHPRICMLRTVAGDERSARRSLVRDYVLSLAARLPVVEV
Z0337_Ec_12513051         YLYSGLLTVVISRYSFSAVAKSAAGIGVPYNLLATIDAPCVFFYVVAQAQPFSGLWCLCLHHGSIEIMVVRAGQPSGWPVSNKAGYANPVRAATS-EIGVS-GGSNNRYLLEAAIMATILTPSHPQYVFVFAAIRRADTHPRICMLRTVSCDERSARRLLVRDYVLSLSARLPAGEV
Orf179_Sf_421263          MLNVAIENQNGWNYSAPAPHKTGAGIATPTMTTAHNRAQAVF---------------LCVKHSHIQIMVGRAGQPQGWPVSVVTGCSNPVRLTTH-EIATS-GGESFKLTIEAAIMATILTLSHPD---------------------------------------------------  + RHA, the other paralogs dont seem to have this.. I wonder if this is an artificial fusion
ORF199_Sf_312621          MLNVAIENQNGWNYSASAPHKTGAGRGNPNVTRAHSRAEAVF---------------LCVMHSSIQIMVGCAGQSQDWPGSRVTGISTPVRLTTL-MVVENLGGELINLSLEDAIMATIPALSHPD---------------------------------------------------
gp32_BPN15_9630502        -----------------------------------------------------------------------MGPTSVGPVSSCTGVENPVWATTPIEILNS-GGSTLYKIGM-----------HTMFKFKFAAVVRTDKKSHIHRLSTIASSEREARRQFASRFVLVLSARIPVSEV

Reconstruction:

ancestor of Ecs2630/Z0337 + P4ASH protein insert  = Z0337,
Eca2630 secondary loss of C
C-terminal loss = ORF199
+ RHA= ORF179
gp32, secondary loss of linker region

or P4ASH plus extension = ECs2630



11. BRON
-----------------------------------
                          EEEEEEEEEEE-------------------------HHHHHHHHHHHHHH--H-HHHH---------------------------HHHHHHHHHHHHHHHHH--------------------------------EEEEEE---HHHHHHH-HHH-------HHHHHHHHHHHHHHHHHHHHHHHHHH
MSV226_MSV_9631397        EKVP----FVI------------------KK-DNETWYNMLDII-KILGYKKKL-HLHA---------------------------SLLNKNNK-KKFYQLLTK---------NTLKNKYFKY------TNVQKNRIFINEVALFYILLSSKKE--------NAII--CKNYVFG--NLFKLENLNL  |solo
AMV055_AMV_9964369        TFNEI---FKFN------NKSIDVI----GT-LNNPWFCGKDVL-NILEYEKSSFKKIL---------------------------QRLKESYK-KSYREILYKV-----------GDNLSP-----TLNGNNSKIIYINDSGLYTLIMNFNLN--------NAIV--FKEYVI-------------  |
AMV262_AMV_9964576        TIIKQ--IYISDT-----KEKYNIYIYVDIK-TKLSYFISNDIL-KILTESTDN-IY-----------------------------KYCEKSD---IFKWINIHN---------------------NIPSNISDETILINKNGLNNIISKLNNE--------KSNH--FRKWLND--IDINIIIKNE  |solo
SCGD3_15_Scoe_7480004     -DVSD---FVYA------ATGARVR-RLTMP-GGSHWFPAADVC-KELGYTTTR-KALL---------------------------DHVPEEHR-DSLETVTG------------SHSLSIPAG-----RKWRRDLQLIDLQGLILLVNACTKP--------ACAP--FKQWVA---EVVETVQREG |Clong but nothing significant
ORF1_Nm_12697190          -MNEI---FNFH------GQEVRTL---T-I-DDEPWFVGKDVA-DILGYSKAR-NAIA---------------------------LHVDEEDA-LK-QGI--------------------------PTSGGTQDMLIINESGLYSLILSSKLP--------QARE--FKRWVTS--EVLPAIRKQ-   |solo
ANT_LcBPA2_6599316        NELQH---FDFK------GRQVRTV---V-V-DNEPMFVGKDIA-EVLGYSKPA-NAVN---------------------------KYVPDKFK-GVTKL---------------------------MTPGGKQDFVVIAEPGLYKLVFKSDMP--------NADE--FTDWVAE--KVLPSIRKHG  | fragment of kilAC
PA2423_Pa_11349554        -QLAP--HYFFRQ-----QRLLRA----LLI-DDQAWFVLDDFA-RLIEHSQPE-QMLA----------------------------RLDDDQARR--ESL-------------------------RSERGEDQAQWLISESGAYAALIYQQRG--------DGGE--LRRWLSG--EVVPELRSAT |N-nothing + Bro + C nothing
PA1153_Pa_11349113        TLLQPS-RFTHH------HRVLRA---VL-L-DEEGWFVLSDLV-RLLGRYLGG-RAPAALCDEAPWPLATAEQRERLFALCHALERHLDTDQWRLAWL---------------------------HDERHGPRQDCLVSESGLYALLWLAAPG--------AARG--LRRWVSG--SVLPRLRSQS
SPy2128_Spy_13623111      NKTE-----TWN------GYTIRF----VEH-QGEWWAVLADIA-KALDL-NPK--FIK---------------------------QRLGDE----------------------VVSNNHV-----TDSLGRQQEMLIVNEFGIYETIFSSRKK--------EAKT--FKLWVFE--TIKQLRQSTG | solo
ORF5_BPRLT_1353522        KELQN---FNFN------NLPVRTV---L-I-NDEPWFVGKDVA-IAIGYKNFR-DALK---------------------------SHVKDKYK-RESRI---------------------------TTPSGVQSVTVISEPGLYQLAGESKLP--------SAEP--FQDWVYE--EVLPTIRSTE \
orf8_BPbIL309_13095813    KELQN---FT--------NGIFNLD--VKVD-GENILFSAEQAA-KAMGITQVK-NGK----------------------------EYV----K---WERVNSYL-----------PNS---------P--EVGKGSFISEPMVYKLAFKANNA--------VSEK--FTDWLAV--EVLPTIRKHG  |
orf9_BPphiPV_9635686      QALQT---FNFK------ELPVRTV----EI-ENEPYFVGKDIA-EILGYARTD-NAIR---------------------------NHVDSEDK-LTHQF---------------------------SASGQNRNMIIINESGLYSLIFDASKQSKNEKIRETARK--FKRWVTS--DVLPAIRKHG  | + kilAC
ORF38_BK5T_14251162       NELQN---FNFN------NLPVRTV---L-I-NDEPWFVGKDVA-IAIGYKNFR-DALK---------------------------SHVKDKYK-RESRI---------------------------TTPSGVQSVTVISEPGLYQLAGESKLP--------SAEP--FQDWVYE--EVLPTIRKHG  |
ORF291_BPLLH_1395130      NEVQI---FENN------GRGISLP--VKEV-GGQVYFEAEAAA-IGLGITT-----------------------------------EVNGDTY-VRWPRINSYL-------------GFATSG------KKIKKGDWITEPQFYKLAFKASND--------VAEK--FQDWVAS--EVLPSIRKHG  |
SPy0980_Spy_13622137      MELQV---FTNEQ-----FGEVRT----ATI-NNQIYFNLNDCC-QILELSNPR-KTIE---------------------------R-LNKDG--VTTSDII-------------------------DSLGRTQQANFINESNFYKLVFQSRKP--------EAEK--FADWVTS--EVLPSIRKH-  |
SAV0855_Sa_14246624       QALQT---FNFE------ELPVRTL---E-V-DGEPYFIGKDVA-DILGYANGR-DALS---------------------------KHVDEDDK-KVLTSRNTTL-----------------------ENLPNRGLTAVNESGLYSLIFSSKLE--------SAKR--FKRWVTS--DVLPAIRKYG /
Z1818_Ec_12514734         NDFTI---FKFG------DSEIRVI----NK-CGEPWFVAKDVC-DALALTNSR-KALT---------------------------ALDDDE-KGVTLSY----------------------------TLGGEQNLSIVSESGMYTLVLRCRDA---VNKGSVPHK--FRKWVTA--EVLPSIRKHG  | + p22ARC
gp30_BPN15_9630500        KALSV---FSFQE-----SHPIRVV---L-V-GGDPWFVALDIC-AALNIANPS-DALR---------------------------K-LDHDEK-LTLGLTEAQ-----------------------KLDRMAREVNVVSESGLYTIILRCRDA---VKQGTTAWR--FRKWVTN--EVLPAIRKNG  \N15 gp30 like BRON
P43_BPAPSE1_9633590       --MTT---LVFR------NTVLET----ISH-NGQIWFTSSVLA-KALQYSSSK-SV-------------------------------TDLYHK--NSDEFADHM--------SKVVDST-------TLGKSRNKTRIFSLRGAHLIAIFSRTP--------VAKE--FRKWVLD--ILDKQTVNQT  /
XF2506_Xf_11362060        QSIIP---FDFH------SHAVRVV---M-R-DGNPWFVATDVC-TALGYRNPS-KAVA---------------------------DHLDDDEK-SNQSLGLA-----------------------------GKPVIIISESGLYALVLRSRKP--------EARK--FSKWVTS--EVLPSIRKTC  \
M_XF2524_Xf_11362500      NAITP---FQFE------SHAVRT---VVDD-HGEVWFVGKDVA-DVLGYTNHN-KALG---------------------------DHCRGVTK-CYPIL---------------------------DSLGRSRETRIISEPDMLRLIVSSKLP--------AAER--FERWVFE--ELLPTLRKTG   |
C_XF2524_11362500         NAITP---FQFE------SKDVRIQ---LDE-ASAPWFNANDVC-AVLEFGNPH-QAIE---------------------------SHVDADDL-QKLEVI--------------------------DALGRTQRANHINESGLYALIMGSTKP--------AAKR--FKRWVTS--EVLPTLRKTG    | Xylella specific fusion
N_XF2524_Xf_11362500      MNAPSEFTLQFE------SHAVRVQ---VDE-AGTPWFNANDIC-TAVELLNPC-AALA---------------------------QHVGARNV--SKRKII-------------------------DTIGRTQRANYLNEPGMLTLLIGSTKE--------AAKR--LRRWLIS--EALPAAAVQK   |
XF0684_Xf_11362477        NAITP---FQFE------SHAVRT---VVDD-HGEVWFVGKDVA-DVLGYTNHN-KALG---------------------------DHCKGVPK-RYPL----------------------------QTPGGIQEIRIISEPDMLRLIVSSKLP--------AAER--FERWVTS--EVLPTIHKT-   |
XF1663_Xf_11362484        NAITP---FHFE------SQAVRT---VVDD-HGEVWFVGKDVA-DVLGYANHN-DALG---------------------------AHCKGVAK-RYPLP---------------------------DSLGRLQYFRIISEPDMFRLIAGSKLP--------AAER--FERWVFE--GVLPTIHKTG   |
XF1645_Xf_11362483        TLPAS---VDFS------DVSLTI----IDH-DGIPYLTAADLA-RALGYKDAS-AVLR---------------------------IYSRHTDE-FTSEM-------------SLTVNLTVKG---FGCGNSEKPVRLFSPRGCHLVAMFARTS--------VAAA--FRRWVLDVLEVLPSIRKTG   |
XF0704_Xf_11362478        TQLPA--AVCFS------GKSLS----IIDR-DGVPHLTAADLA-RALGYKDTS-AVLR---------------------------IYSRHTDE-FTYQM-------------SLVVNLTVKG---FGSGNSEKPVRLFSPRGCHLVAMFARTS--------VAAA--FRRWVLDVLEVLPSIRKTG  /
MSV194_MSV_9631452        MDLDN---LIFN------NKKIHIA----IY-ENKPYFKGKDIA-EILEYKDTN-DAIK---------------------------KHVDDDDK-SKYEDLINR--------PGILP----------SLTYNEKNTIYISESGLYSLILSSKKS--------EAKI--FKKWITN--EVLPNIRKHG  \  + T5orf172
BROE_BMNV_9630956         VKIGK---FKFG------EDTFTLR-YVLGG-EQPVRFVARDIA-NKLKFKNTK-KAIR---------------------------DHVDGKYK-CTFEQACI----------NISKEKHVKQG---NPLYLQTQTILLDKIGVIQLFMRSKMT--------NAAE--LQNWFYE--HVLPQCTARQ   |
orf117_ESV_13242588       DILQT---FVFN------NTRHKVV-ILRDE-NDDPLFKASDIG-KILSIKNIH-TSMI---------------------------D-LHDDDK--AIRTA--------------------------STPGGEQKTVFVTEKGVYKLIMRSRKP--------VAKP--FQDWVF---EVLKTIRKRG    |
BROM_LdNV_9631117         MALTK---VNFV------SGPLEVF-TVQDD-EQENWMAANPFA-ETLKYNNCN-KAIR---------------------------IHVSANNQ-KTLEELNID-----------------KSQ--VLPRNVQAKTKFINMNGVIELLLASQMQ--------QAKE--FRYWMTN--VKFAETSADP   |
BROK_LDNV_9631082         VKIGQ---FRFG------EDAFTLR-YVLAA-EQPVKFVAKDIA-RSLKYEKPA-NAIA---------------------------KHVDDKYK-SAFEQLCF-------------DDLRVKQG---DPLYLHKSTILIDKIGVIQLFMRSKLH--------NAAE--LQNWFYE--RVLPQCTARQ    |
BROI_BmNV_13751084        VKIGE---FKFG------EDTFTLR-YVLDA-EQQVKFVAKDIA-SSLKYVNCK-QAVI---------------------------VNVDNKYK-TTYEQACI----------NISKENRVKQG---DPLYLQSQTILLDKIGVIQLFMRSKMT--------NAAE--LQNWFYE--HVLPQCTARQ   |
Brob_BMNV_9630900         VKIGQ---FKFG------QDEFTLR-YVLGD-EQPVKFVAKDIA-RSLKYVNYE-KAVR---------------------------VHVDVKYK-TTYEQACI----------NISKENRVKHG---DPLYLSPQTILLDKIGVIQLFMRSKMH--------NAAE--LQNWFYE--HVLPQCTASA   |
BROG_LdNV_9631042         THLQH---FEASL---DDGVKFECW-GVVTP-DGKVACKLKEFM-DFLGYKEVN-SAYK----------------------------MIPKEWK-VYWHKLQDDL----------CVDS---------SVDLHPRNVFVYEPGMYAFMTRSGSP--------LAKW--CMGFLYD--VVVPTLKKNQ   |
ORF130_XnNV_9635380       --------------------------------MDKLLYTGHGVA-ESLGYKCPR-RALY---------------------------DHVKPQWR-KTWAEIKKL---------TFFNEAL-------LPSNWQPNTVFITEAGVYALINKSKLA--------GAEI--FREWLFD--TIIPQMRRAK   |
bro_HaNV_12597544         MSLTK---IQFG------DKEVET--YTVDF-NGEKWMVANPFA-EALNYSRAN-KAIL---------------------------EKVSDGNQ-KTFDQIKPYR--------IVHDGTGESSV---IPRNMKPNTKFINRAGVFELIMSSQME--------YARQ--FRYWLSS--VKLNTTVETD   |
201R_CIV_15078913         YMTIT---INGN------EHQIKLA----GI-IEDPYFCGKDVC-TILGYKDKE-QALR---------------------------KRVKSKHK-KSLSELFEKK----LPVVTTGNFFLGTQN---ELSYHEGKSIYINEPGLYNLIMSSEAP--------FAEQ--FQDMVYE--KILPSIRKYG   |
289L_CIV_15079001         YMTIT---FCNQ------EHQIKLA----GT-VDTPYFCGKDVC-KVLGYKDIK-DALK---------------------------KHVDREDK-LPLSEIKKVG-------GTAPPTFLGQTY--AYLSHNDGRAVYISEGGLYSLIMSSEAP--------FAKD--FRRLVCN--VILPSIRKFG   |
MSV023_MSV_9631535        DLIS----------------KINI----ITY-NNCSYYKAKDIA-DILNYKSVD-YFIK---------------------------KYVKNEHK-INYE-----------------------------------STIYVNNSGLYYIMFKSKKH--------EAEK--FQNWIKE--ENLPEIENNK  /
AMV175_AMV_9964489        TFNEI---FNYN------DVKIKVI----GT-INNPWFCGKNIL-KALEYSDDSHNKIL---------------------------NRLDDKFK-DNMYNILSSV-----------RDNLS------MTKNNKNKAIYLNEPGIYYIILHCTKD--------SAKG--FQDFILF--DLLPTIRKRT  \
AMV057_AMV_9964371        NFNNI---FKFN------NISINII----GS-LDNPWFKGKDILIDGLEYTDQSAKCVL---------------------------KRLNTSFK-KSYNDIISVE-------GNLPP-----------TKNNDNKAIYVNEAGLYYIILHCTKD--------SAKG--FQNYILF--DLLPSIRKRA   |
orf6_HaEPV_3510491        --MKS---FKYK------NINIDVL----GD-INYPWFNGKNILIDGLQYTEQSAKCVL---------------------------KRLESKFK-NKLSDIICVG------GNLPPTGNLDKIS--NITRHNDGKAIYINEAGLYYIIIHCTKE--------SAKP--FQDYILF--DLLPSIRKLA   |
AMV177_AMV_9964491        NFNKI---FKFK------DTDIKIN----GT-IDQPWFCLKDIIIYGFGYTKESYKSIL---------------------------KELNNSYK-KSLYDIIVEG---------------GKTP---PTKNNENKAIYVNESGLYYIVFQCTKD--------SAKD--FQKYILD--ELLPSIRKLA   |
BROA_LDNV_9630998         MALSK---VEFV------NGPLEVF-TVQDD-KQENWMAANPFA-ETLKYLNVN-RAIR---------------------------VHVSKHNQ-KTLDELQSD------------RNGL-------ITSSLHPQTKFINRAGVFELISASEMP--------AAKR--FKQWNAN--DLLPSLCREG   |         BROC
BROL_LDNV_9631113         MALSK---VEFV------NGPLEVF-TVQDE-NQEKWMVANPFA-EALGYTRLN-YAVT---------------------------QHVSVVNQ-KTYEEFKSQG-------STATDDS---SL---LPRNIQAKTKFINQAGVFELIGASEMP--------AAKR--FKTWNTN--DLLPTLCAEG    |
BroO_LdNV_9631121         MALTK---VEFV------NGPLEVF-TVQDE-NQEKWMVANPFA-ESLKYAIPH-IAIS---------------------------KFVSTVNQ-KTYEELRSMR---ITSRITSTDDS---SL---LPRNVQAKTKFINRAGVFELISASEMP--------AAKR--FKTWNTN--DLLPTLCAEG    |
BRO_HaNV_12597545         MSLTK---IQFG------DKEVETY--TVDF-NGEKWMVANPFA-EALSYSNVN-RAIR---------------------------VHVSEKNQ-QNYEEFKSDR--------VGLTDSV--TS---LPRNIQAKTKFINRAGVFELINASDMP--------GAKR--FQAWNNN--DLLPSLCQEG    |
ORF109_XnNV_9635359       -------------------------------------MVANPFA-EALNYSNVN-RAIR---------------------------VHVSNQNQ-KCMEELRSDR-------CGLTDDS---SC---LPRNIQAKTKFINRAGVFELINASEMP--------AAKR--FKAWNSN--DLLPTLCTDG   |
ORF159_XnNV_9635409       ARKQK---FLYC------NEELNVI-TQVDE-FGEPWMVANPFA-TVLQYYKPN-DAVR---------------------------KHVSEWNV-KSYEDFRSRR------IGADDSSHWVDE----ITSSLHPKTKFINRAGLFELIQSSRMP--------KAQE--FKNWVNS--DLLPKLCQEG   |
BRO-d_BmNPV_9630955       VKIGQ---FKFG------QDTFTLR-YVLEQGNPQVKFVAKDIA-SSLKYGNCK-DAVS---------------------------RHVDKKYK-YTYSESGARL-------PPSAPNSVAKQG---DPLYLQPHTVLITKSGVIQLIMKSKLP--------YAIE--LQEWLLE--EVIPQVLCT-   |  Essential for lytic infection
Bro-III_BmNPV_13751089    VKIGE---FKFG------EDTFTLR-YVLEQGNQQVKFVAKDIA-ISLKYASYE-KAVR---------------------------VHVDGKYK-STFEHAG-QI-------GHHAPNSVAKQG---DPLYLHPRTVLITKSGVIQLIMKSKLP--------YAIE--LQEWLLE--EVIPQVLCT-   |
ORF2_AcNPV_9627744        VKIGE---FKFG------EDTFNLR-YVLER-DQQVRFVAKDVA-NSLKYTVCD-KAIR---------------------------VHVDNKYK-SLFEQTI-QN-------GGPTSNSVVKRG---DPLYLQPHTVLITKSGVIQLIMKSKLP--------YAIE--LQEWLLE--EVIPQVLCT-   |
AntgemNPV_9799895         VKIGQ---FKFG------EDVFTLR-YVLDR--DIVKFVAKDIA-NSLKHTNAA-EAVR---------------------------NHVDIKYK-TTYEQGE-TV-------SHPASTSLVKRG---DPLYLQPHTVLITKSGVIQLIMKSKLP--------YAVE--LQEWLLE--EVIPQVLCT-   |
ORF153_LdNPV_9631120      VKIGE---FKFG------EDTFTLR-YVLEK-DQQVKFVARDVA-VSLRYERPA-DAVS---------------------------KHVDIKYK-STYAELGRQI-----ADPTLNVKLIVKKG---DPLYLQPHTVLITKSGVIQLIMKSKLP--------YAVE--LQEWLLE--EVIPQVLCT-   |
BRO-c__BmNPV_9630901      VKIGE---FKFG------EDTFTLR-YVLGD-EQPVRFVAKDIA-SSLKYVNCE-RAIR---------------------------VHVDGKYK-STFEHAD-QI-------QHHAPDSVAKQG---DPLYLHPHTVLITKSGVIQLIMKSKLP--------YAIE--LQEWLLE--EVIPQVLCT-   |
BRO-a_BMNV_9630839        VKIGE---FKFG------EDTFTLR-YVLEQGNLQVKFVAKDIA-SSLKYVNCK-QAVI---------------------------VNVDKKYK-TTYSESGSIP-------YTPAPDNVVKQG---DPLYLQPHTVLITKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG   |
bro-a_SpLNV_7672865       VKIGE---FKFG------EDTFSLR-YVLER-DQPLKFVAKDVA-ASLKYQDAK-RAIK---------------------------IHVDDKYR-STFEHGG-QI-------APLVSNALAKQG---DPLYLHPHTVLITKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG   |
BROB_LDNV_9630999         VKIGQ---FKFG------EEEFTLR-YVLER-DQSIKFVAKDVA-ASLKYVDCK-QAVR---------------------------INVDDKYK-FTFEQGCVP--------HTLASDSVAKQG---DPLYLHPHTVLITKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG   |
BROP_LDNV_9631128         VKMGE---FRFG------EDVFRLR-YVL---NDPVKFVAKDVA-GSLKYQDAK-RAIR---------------------------IHVDDKYK-STFEHGE-IR-------SHLASNALAKQG---DPLYLHPHTVLITKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG   |
ORF114_XnNV_9635364       RKQV----ILFQ------NEPVEVVFSDKTGPDGLVYYF-FEVT-PFARLMNVD-NPL----------------------------SKIDSQHV-IVVEEPVTA----------ADTNNW-------AVRNNTRSTTLVSEAGLYQLMFTGKPV--------TVRQGMVRNWLFD--IVLPTVKQFT    |
BROJ_LDNV_9631081         VKIGQ---FKFG------QDTFTLR-YVLGG-EQQVKFVAKDIA-SNLKHANCA-EAVR---------------------------KHVDGKYK-STFEHGE-IR-------SHLASNALAKQG---DPLYLHPHTVLVTKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG   |
ORF13_SeNV_9634234        -KTKR---LQFDD-----QFSFTVD-YIF---NDEVWIAGNKLA-EGLGFREPQ-TAID---------------------------EFVDGKYK-RTINELVFN-----------------------NSVDDTNGLVCVNKHGVLQLIDRLDFK--------NKAE--FTAWIIE--EVYVELENKF   |
38_7kd_HzNV_10442572      LERKR---INFDD-----QFSFTVR-HLTR--NQQMWMIGSDFA-SGIGFDEPE-FVVD---------------------------NYVSNHNK-ICLETLIFG-----------------KRV--EIENDDVKRSMCINRDGCLQLLNHIEFA--------NKSE--FIAWLVT--YAFDKLYSHM   /
MSV195_MSV_9631451        MNLDN---LIFN------NKKIHI---VIDN-NNKVLFKAKNCA-EILKYTNPL-KAIR---------------------------DHVRQKHQ-ISFKNINMN-------------DSF-------ILNNIHPDTIFITESDFYSLISK------------------------------------- | solo
MSV196_MSV_9631450        ---------------------MNI--YVAIF-NNKSYFRAKDCA-SILEFKHTK-DAIR---------------------------HYVSNGNK-IKFKNINIR-----------------------SKKYIHPHTVFINNFGLIELILKHKSI--------VHHN-IIDKLICK-FDLNVDLNITP \ + vsr nuclease
MSV024_MSV_9631534        MDLMQ---------------GI----HVINY-NDNLYFKAIDIA-KLLKHKNIY-RAIK---------------------------YKISDCNK-TLYKNISNT-----------------------NLSYKKNKMVYINKLGLIELIKESTTI--------VSPM-VINGLINKFNLNLDLPIKFI  |
MSV026_MSV_9631533        DIISN----------------IKT----INV-NNCLYFKGEDCA-KILKYKNTY-GAIR---------------------------NNVSKNNK-IKFE---------------------------------KNNDIYINKLGLSELIIKHKSI--------VSTN-TINTLIHNFNLNLDLFEKKK   |
MSV204_MSV_9631444        MEI-----INYN------NNQIHL---LYTT-IGEVYYKGKDIA-KILRYIDTK-KVIR---------------------------NNVLSTNK-VNYSTLIKNV----------SING--------QLHKTPHHTIFINTKGLKNLFDMP------------VKR--LSN--KEINDLIEFLNSHN   |
069L_CIV_15078782         GGLRA--IFNLD------GVTLDTP--IMGT-WDKPVFFGKEIA-EFLGFKKPK-DALQ---------------------------KHVKPKYK-TTLSKVLEKK---------LDTEPV---------SYNEGKRVLLYKEGVVELIKKTRLV--------GIEN--KIDALIE--AFELNLNVVH /
ORF62_XnNV_9635312        IVKKT---FTSD------KKKWELY-NITSC-PYHFYYEAYPIA-KLLCNKHPE-LAIK---------------------------NYVDRSCC-KIYEELKRWFRPYCIFQSVGSPCSPGPNN---QPIHWQSNTLFINKDGIISLINNSTLP--------VAHE--FKRWFLA--QRHDEAEVFK |solo
ORF60like_HzNV_10442560   LVNRK---CKLG----------EVW-ITEIE-ENRFLCSGHGVA-EALGYKCPR-RALY---------------------------DHVKPQWR-KTWAEIKGVL------NQHSLVTSSDSIE---MPLNWQPNTLFITEAGIYALIMRSKLP--------AAEE--FQSWLFE--EVLPELRRTG \
BRO_HzNV_12597590         LVNRK---CKLG----------EVW-ITEIE-ENRFLCSGHGVA-EALGYKCPR-RALY---------------------------DHVKPQWR-KTWAEIKGVL------NQHSLVTSSDSIE---MPLNWQPNTLFITEAGIYALIMRSKLP--------AAEE--FQSWLFE--EVLPELRRTG  |
ORF60_XnNV_9635310        LVNRK---CNMG------GINADIW-LTQME-MDKFLYMGHSIA-KSVGYANPQ-KAIR---------------------------DHVRPEWR-KTWSEIVDGT------NRSPLVTSFNDSH---LPANWQPNTVFITEAGVWALIIKSKLP--------AAEK--FQKWLFE--EVLPELRRTG  |
ORF131_XnNV_9635381       -----------------------------ME-MDKFLYMGHSIA-KSVGYANPQ-KAIR---------------------------DHVRPEWR-KTWSEIVDGT------NRSPLVTSFNDSH---LPANWQPNTVFITEAGVWALIIKSKLP--------AAEK--FQKWLFE--EVLPELRRTG  | BRON + C synapomorphy shared by this group
BROD_LdNV_9631039         MALQR---FEFPMSADEDESKFECW-GIVMP-DGSVAVKLKELA-EFLNYEDVK-KAYK----------------------------LVPDEWK-ITWNILQNKL-------EPSRPHLVAPST---TPANWQPETLFVLEPGVYALMARSTKP--------MAKE--KMKYVYE--TILPTIRKTG  |
BROI_LdNV_9631080         MALQR---FVFPMSADEDGAKFECW-GVVMP-DGDVAVKLKELA-LFLGYADVK-MSYK----------------------------HVPDEWK-ITWKNLQNKL-------ASKRHQLVAPPT---TPANWHPETLFVLEPGVYALLARSNKP--------LAKE--RMKFVYE--TILPTIRKTG  |
BROC_LdNV_9631038         MALQR---FEFPMSADEDESKFECW-GVVMP-DGSVAVKLKELA-LFLGYADVK-MSYK----------------------------LIPEEWK-ITWKNLQNKL-------ASKRHQLVAPPT---TPANWHPETLFVLEPGVYALMARSTKP--------MAKE--KMKFVYE--TILPTIRKTG /
CnBV__13160526            FQLQN---WDVD------DKSVVLRLYIHPI-TNEPWVVAADLA-RCLGYEKYR-QTHT----------------------------RILAAFKRKLSDLVHTEP-----FSGTVESEVARLEGAPVELSSRERDIVVVNEGGIHQMLIGSRLP--------NVQK--YKELVFG--KILPAARARG  \BRON duplication
PxORF82_PxNV_11068085     VGCSV--GILFD----------KLH--YIVI-DGVVWFKLNQIC-KYFD--------IP---------------------------KQCPD-YNIITWYTLSKRL----------------KSN-----ITWKLNTIMISDMGVYKLLIIKNEI--------IAEE--FYH------KRLHELRSTG  /
ORF99_CpGV_14602336       ESVDSVCGVL--------PSNIEF----FSV-NERTYFKGLDVA-RHLKCSPS--YTIN---------------------------KYVADTDM-VLWGDLRRYV-----------HDKYVWTN---CKNHWKDNTIFLKETGVKQLCIATQGD-------DKLYQ-EMMDGVYNYDSGDEQVVYAK   |bro duplication
SPy2127_Spy__13623110     QVITT---TNFH------GQPLDIY----GD-IQEPLFLARAVA-EMIDYTKTS-QGYY---------------------DVQAMLRKVDEDEK---LKGMAL--------------EGTTKN------FRSGQKVWFLTEHGLYEVLMRSNKP--------KAKE--FRKAVK---NILKEIRLNG  | SinR like HTH
p63_BPMx8_15320633        PTPEMPKPFLFEGS----TRIRVVVDE-----AGEPWFVAQDIA-HALEYRMAS-D-LT---------------------------RLLKPHHL-RTHAV---------------------------RTNRGERSATIISEPAMYRAVFLSKSK--------KAEP--FQEWVTS--DVLRSIRKTG  |p63C
consensus/85%             ........h............hp.............bh..pshh...L.h.p....sh..............................l..p.b........................................p..hlsc.Ghh.lh..sp...........s....hb.hh.....hls.h....












12. T5orf172
------------------------

PHD Sec Str.              ------EE---------HHHHHHHH----------EEEEEEEE---------HHHHHHHHHHHH-------------------------HHHHHHHHHHHH
orf172_BPT5__93750        PAWKNQYKIGMSQN---PKERLAQYQTYSPY-RD--YKLEHWS-FWF---DKRKGEKLIHQYFKDLK--------------EHEWFSINSRDLSKYLERINSSSD  \
yeeC_Bs_7474985           SSIKNLYKIGFTTG--SVENRIRNAENQSTYLYAPVEIVTTYQVFNM---NASKFETAIHHALENNNLDVSILGANGKMLVPKEWFVVTLEDLQAVIDEIVMMVH   |
orf240SM63E2_14194257     MRSAKRYKIGKSNS---PSRRYREVRLDLP---DA-TILVHTI-PTD---DPSGIEAYWHRRFADKRV------------RDTEFFNLTASDVTAFKRRKYQ---   |solos or with insignificant extensions
CIV460R_CIV_15079171      YEPLDIYKIGCTKD---INRRLKTMNASRI-SFDK-FFIVNQI-QTF---HYFKLEQGLHKLLKKYRL-------------NNEFFQCNVNIIEKAISDYANNNV   |
BROF_LdNV_9631041         YRDRRIYKIGRTAS---PADRLCALNTGRA-DDF--LYFEHVS-PDLGHEASVRVERLMHDSLAPLR-------------MHGDSFN------------------   |
NMB1170_Nm_11345564       TVIKGVYKIGISDV-SNFEGRMRHLENNGYANVAG-LERILAV-KTD---NYKEKENLLHEIFSKSRI------------GDTELFAVDENLVKRLFLSLRGEIV   |
ORF1_BPP27_8346568        SFGENVYKVGMTRR-LEPMDRVKELGDASV-PFD--FDVHAMI-SCD---DAPALEKALHDYLERYRV--------NKVNLRKEFFRVELEKIIEVVKHHHGNIE  /
CIV315L_CIV_15079027      LQVHNVFKIGYTKN---FEERLKTFNDYRH-SLEPQFFAVAIY-DTD---NAKKLETTIHKKLKDFRS-------------EGEFFQVELSVIKEAFLKEDCCLK   | + kilAN
AMV209_AMV_9964523        YASINNFKVGKTDN---LSSRQSNFNSSHI-DQDE-FYICFYQ-KVY---NMSKTENLIHDLLEDFR-----------DKKRKEIFIIHYTYLLDIINLVIKNIN  \
AMV207_AMV_9964521        YAMINNFKVGKTDN---LSSRQSNFNSSHN-TEDE-FYICYYE-KVF---NISKTENLIHDLLDNFR-----------DKKRKEIFVIHYKYLLDMVNLVIKNIN   |
MSV198_MSV_9631448        YAKLNTFKIGKTDN--LISKRQSQLNNSHT-SFDK-IYICYYE-AVY---NPNKVEQIIHDVLESFR-----------DSSNNEFFILHYKYLLNIVKLIIKNIN   | +MSV199
AMV194_AMV_9964508        YAAQNRFKIGGVENNNLIKPRLSTYNSRSA-EGDE-WYYTYIK-NIN---NYKHFENRFWSVMSSFR-----------DKKDKEIIVLYYNDLINIFNFISENYN   |
CIV420R_CIV_15079131      YAAQHRFKVGGVEGRRRLRGRLSDYNGRSA-SGDE-WYFCHLI-DVA---DFRKAEGRIEDIIGKFR-----------DKKDKEIYIMPYRKLLKVIELICQNYT   |
MSV021_MSV_9631537        YKEKNIYKIGYTND---VVGKLVKMNSNRL-KFEQ-FYYVKIY-KVN---NIFSIQNYIYKKLYPYI-------------LNYPYLNCD----LNVITNAMENID  /
orf117_ESV_13242588       EDNSVLVKIGSTKN---IRARTTGLVNEFG------SMAIFRIFECD---RYEEFEKSLHKHNDIKRY---RFKKPINGKRSMEVFNMTKEELQRAVNIAGSNVC \
BROM_LdNV_9631117         LQTVDAYKIGYTHD---LHDRIAELNVASP--LD--FKPVFVY-DTA---TPRRLEQQLHNYFLDKR-------------IKREFYKLDKEDLLMLPVVCNKLCA  |
ORF59_HaNV_12597544       LQMIDAYKIGYTFD---LTARLNELNVASP--LD--FKSVFVR-ESS---NPYDLEQKLHRHFHESRI-------------KREFFKLTEEDLALLPLICDNLLA  |
CIV289L_CIV_15079001      YQQQHKFKVGGVQTFDLLKSRLTQYNSGES-DSEA-HFFIYIR-KTV---NYRSIEHAIKGLLSGFR-----------ENQSNELYIMHYDWLVKFVDAIMDGNA  |
CIV201R_CIV_15078913      YQQHHKFKVGGVQSFKDLKSRLTQYNSGES-NSEA-HFFIYVR-KTV---SYRSIEHIIKGLLSGFR-----------ENQSNELYIMHCDWLVKFLDAIMDGNA  |
MSV194_MSV_9631452        DLSKNIFKIGKTNI-NSIKNRLSTYNTGAS---DP-YYYVFYK-EVY---DATKIEKDFNTLMNRYN---INVTSPNKTKLNNELYKLYYLDLEYVLNAVIDSND  |
MSV023_MSV_9631535        DLSNNIFKIGKTNI-NSIKNRLSTYNTGAS---DP-YYYVFYK-NVY---DGNKIEKEFNYLMNRYN------VLLNNNKINVELYKLYFPDLEYVLNAVIDSND  |  +BRON
BROB_BmNV__9630900        YAERNLFKIGQTTN---LTRRLATLNCGRA-DDDQMQYVLQTE-PTV---HHTLLEKLMKQELRPYR-------------NSGEVYCTDFEHIKRALESCLPHCS  |
BROI_BmNV_13751084        YAERNLFKIGQTTN---LTRRLATLNCGRA-DDDQMQYVLQTE-PTV---HHTLLEKLMKQELRPYR-------------NSGEVYCTDFEHIKRALESCLPRCS  |
BROK_LdNV_9631082         YAERNLFKIGQTTN---LTRRLAALNCGRA-DDDQMRYVLQTE-PTV---HHTLLEKLMKQELRPYR-------------NSGEVYCTDFEHIKRALESCLRRCS  |
BROE_BmNV_9630956         -AERNLFKIGQTTN---LTRRLVSLNCGRA-DDDQMRYVLQTE-PTV---HHTLLEKLMKQELRPYR-------------NSGEVYCTDFEHIKRALETCLPHCS  |
ORF130_XnGV_9635380       YKSKHIYKIGTSRS---PAKRVRQLNCGRP-YDL--LILDHCQAAAD---QGFIVEALMLNEYKTQQ-------------LHGEWVQFADNKQYQSAKKKLDEFI  |
BROG_LdNV_9631042         NRERNLYRIGRTAS---PTALLCFLNEDRH-EDR--FYLDYVS-PDVSREGSVRAERMIREHIESLQ-------------THGDFYQFATKEALDLMREAIVKIQ /
consensus/90%             ....p.aKlG.s........R...bps..........bh..h...s....p...hE..b...b...p..............p.-hb.h.......h...h.....

http://www.bmm.icnet.uk/servers/3dpssm/output/a47ba25a21f354b7.job_summary.html







13. MSV199
----------------
PHD Sec. Str.                HHHHHH------HHHHHHHHHHHHHHHHHH------------EEEHHHHHHHH------------------HHH------------------------------------------------HHHHHHHHHHH----EEEEE----------HHHHHHHHHHH---------EEEEE---HHHHHHHHH----HHHHHHHHHHHHHHH-
CIV146R_CIV_15078859      LIDIFIEEEQN-----------FGTILNE--MTCQ-------HKIYISKKLLKWIGYEG--------------DYKK------------------------------------------------QRDSFKKLLKRHNIDFEELKSNDIECEN--YPEIKVDMANL-SNGVISQSKWLILNIYNFKYI------------------------ |+UVRC
MSV199_MSV_9631447        MLNIFEFIEQN-NFEINLG-SWFNEIWLP--LFNK-------TELLITLNILHFIHYGTSKSVLDGNTT---LNYRE------------------------------------------------LKRDFEKILNNNKIKYKKIKYEEIVNNKNYYELVKNEIKNI-TPNNLNKSTWFILDVLQFKMLIMRLSTNVAKEICEYYVTLENILH |solo
MSV198_MSV_9631448        MLNIFEFIEQN-NFDIKLG-PWFNEIWIP--LFNE-------TELLITLNILNFIHYGTSNVVLDYHPM---RNNTN------------------------------------------------LKRDFEKILNNNKINYKKIKYNDIINNEDYYNKVKEEIENI-RPCNLEKSTWFILSVDEFKMLIMRLSTNVAIEVREYFILLEKILF  \
AMV209_AMV_9964523        FVDIFTFITNN-DYDFKLG-SWFKDIWYP--LFEE-------KDVLITNDILTFIYYFPEG---SQPPP---EMFKG------------------------------------------------YKKNLIDSLNNYNIKFIEIDYKHEYVLT--NKKLKNEIKFI-TPNNILRKRWIILSVENFKLLIMRLNTKSAHYIREYYLFIEGLLY   |
AMV194_AMV_9964508        FVDIFTFIKNN-NYEFKLG-EWFIDIWYP--LFER-------KDVLITNKILYFIHYGISGG-DTHPPL---EKYRL------------------------------------------------MRKDLEKILKNYNINYIKIKYYKNIDID--YNFLIDEIKNI-TPNNIIQKTWIKLSVKNFKKLILKIRTAIADDIRDYYITLEEILY   |
AMV207_AMV_9964521        LMDVSTFITYN-NYDIELG-SWFKDIWFP--LFNK-------KNVVITNEILNFIYNFQVGKCFPTYNL---DNYIQ------------------------------------------------YKKDYRSFLKKNNIEYNIIKYDENILNK--YNILKSELKLY-DKHALVQKTWLILSVDDFKESIMMMNNNNSKMIRKYYIKIEKILF   | +T5orf172
MSV021_MSV_9631537        VNNIFSIQNY---IYKKLYPYILNYPYLNCDLNVITN-----AMENIDKSLLSNNLY---------------TEYQN------------------------------------------------LKDNFETILTTNRIKFKKLKYHEMSDEN--REMLNSEVIKL-SMSELANTTWCILKTSDFKNLILQINTLPVEEIREYYLLIEKILL   |
MSV191_MSV_9631453        MEHIYEYIENKQNENIIMN-PWIKDICLP--MYNK-------SNVLITSSILKFLYFGPKIPINDSPGYIYVDEYKKNEIYAIYYSKDNIEFPCKIKINVDDMVLVKNYLCYKLSEYKYGDSGELFKCDFDIILRAMEIPYN-----NELLTKNLEKVLIEKNITY-SKSEYNDSFRLIVHIDQFKLLINKLN---IDILSKPYEAVEKIIQ   |
CIV420R_CIV_15079131      MTDLFTYIKDK-NIAIDLNSKWFQELWYP--LSKK-------TGSIITTRLLEWMGYSG--------------EYKL------------------------------------------------QRQNFKRLLDNNNIPYEEIYHNDDRFLE--HPSMIYEIEQT-DKKQIKQKRWITLEMRNFKKAILRLNTKNAEVIRDYYLNLEEACF  /
CIV468L_CIV_15079179      LLDIFKFIEIT-NFDLD--PIMTNWFWQV--MVNN-------HSTHLGRVVLEWFGYEG--------------EDSN------------------------------------------------QKQKFIDMLKRNKIPYKQLKHTDNEIEL--YPSIKEEMTLLPHKGAIASSKWLVMEPFNIKMAMLRLNTKNADIIKRYYIKMEELIR   \
CIV238R_CIV_15078950      ILD-----SAMNESKIKLDISWFFDNYMDQELTNVMNYFDGEEPIHINTVVLEWFGYEG--------------DLRT------------------------------------------------QKRKFIDMLKRNSIPYKELTSKE-EIEL--YPTIKEEILSLPHKGAIACSKWLVMKPYDIKIAMLRLNTKNSQIIKQYYIKMEELVR    |
CIV212L_CIV_15078924      LMDLETFIDTT-GFEKD--PIMNDYFWQI--MVTK-------QRTHLSAMLLQCLGYEG--------------EFRV------------------------------------------------QQQHFKRFLKSNNIHPLELTSSDPDIKN--YPTIQDEMKLL-KPNVISNRKWLIVEPREFKKVIMKLNTKHGDRIREYYLCLEEL--    |
CIV388R_CIV_15079099      YLEIETFMDVI-GFVKD--PVMTDYFWHI--MVDN-------HCRHLATVLLECLGYEG--------------TYNK------------------------------------------------QQYAIKRFLKSNRINYSELSSDDPQIDL--YPTIKEEMKNM-KPNAIACRKWLIIEPREFKKVIMKLNTKNGDNIREYYIRLEELIK    |  +BROC
CIV019R_CIV_15078732      KLEINEFIDLFIG---------EENKWNK--MFDSDL-----SGIHISSLILNQLGYEG--------------EFKN------------------------------------------------QQTCFKRFLKRNNIIIQEFSSSNPELKL--YPSIQEEMKNM-KTNVIANRKWLIANPRDLKKIIMKLNTKNGDAIREYYICMDELVQ    |
CIV211L_CIV_15078923      LLDIPSFMKVA-GIEFD--PIMFNHFWQV--LVDNGD-----RLPHVGETTLNWLGYEG--------------VFTK------------------------------------------------QKEKFINMLKRNQISFKELSYQDNEIQL--YPSIQKEMLLLPNESAKTKSKWLLMNPDDFKMAIMGLKTKNSEKIKRYYVTLEKTMK    |
CIV148R_CIV_15078861      IVDIIKFVEIT-NFDID--PFMIDKFWHT--MYDN-------SLLYISRDILEWMGYTG--------------EFGE------------------------------------------------QRKAFKKLLKRKNINFTELSNNDPTKHL--YPEIQKDSLLL-SNAVVSQSKWIIMNSDDFKDSILMLNTKNSGKIRKYYRSFEKLLK   /
consensus/85%             h.pl.phbp....b.b.....h...ba....h.pp.......p...ls..lLphh.Y................php.................................................bbpphbphLpp.pI.a.blp.pc.......b..lbp-hb.h.p.s.l.pppWhlhp..phKbhhhblps..s..lpcYY..hEphh.

http://www.bmm.icnet.uk/servers/3dpssm/output/e8c2fb22ee6b2ed4.job_summary.html

14. CIV029R / BROC
----------------------------------------

PHD Sec. Str.             -----------------------HHHHHHHHHHHHHHHHHHHHHH---------------------HHHHHHHHHHHHHHHHHHHHH--------------HHH----------------------EEEEEE-----------EEEEE---------------------EEEEE--------HHHHHHHHHHHH----
029R_CIV_15078742         -------------------------------------------------------------------------------*MVERLGI--------------AVED-------RSPK-LRKQAIRERFVLFKKNTERVE--KYEYYAIRGQSIYINGRLSKLQSERYPKMIILLDIFCQPNPRNLFLRFKERIDGKSEW \
ORF17_DpAV4_11931709      ------------------------------------------------------------------MGVQLDETNEQLNEMNNKLDV--------------AVED-------RAPI-PEDQSKVERFVFLKRPNE-----NYPYYAIRAQAASTKTAIRK-QQKEFGAIELLLDFETHPNTKTYYNRIKWR*------  | solos
ORF116_OpNV_9630054       ---------------------------------------------------------------------MLEDKDRRIQELYASLLE---------------MSE-------RAVQYPAKGHQTP-MLCVARE-------FNCLRAITGQKVHVTKMKREL-TD---AAELVIDA-MRPNPQVDLNNFVNRV*-----  |
ORF103_EpNV_15213228      --MSPPDLTPMEKLLE-----SIENQIKIK-DEQLRKNNEMLERYIML----------------------LEEKNKRIEELYRSLME---------------MTD-------RVVQYPAKSYQTP-MLCMTRE-------FNCLRAITGQKVHVNKMKRDL-TT---AAEIIIDS-VRPNPQVDFNNIVNYVESEFKE  |
ORF3_DpAV4_11931724       GHGDHPQSQHCIG-------YEPETGGEGIGGGEIREQRRLLNRLA-------------------QMGIQLNETNEQLNEMNNKLDV--------------ACED-------RAPI-PDDCSKVERFVFLKRRTS-----DYPYYAIRAKRRARRRPIRK-QQNEFGVINILLDFETHPNTKTYYNRINWALNKRGVK  |
ORF122_LdNV_9631089       TDSGYDEDYEEEEDEE----QNAILAHLRATNASIREIQQKLQTLEKI---------GGILNRADADADADDDLSFL------DEPD--------------VEPD-KPPVGATVKF-PRDATKHPWLTVLAKEVRREGAVATEIAFATSRAA-ASARKRKYS-----DMSLIYQG-VHPNPQLAVCCITEEWQERGLS  |
P20_LsNV_2760643          FFVNTKYFVDMEAH-------IETQQYLIK-----SIADKDVIIQHK-------------DAQIAELLNAILLANSQCMSLSKRLVD--------------IVQD-------VVVKPQNCQLLHA-LAVCELS-------CNKFAFLRTQLRSLKRSIKRLQRAEQHEPTIIYQSEYVPNSINILNKIKEQLPKDKFT /
ORF10_EpNV_15213135       LAIKTDKGYDCDD-------VRDNIKTVLKHIKTLNVNSDKFINAHKLFENQVCARFEQLEQRLETLERVPDA--------PTMP---------------------------GVIF-PRDVNKHQHLAVFVNQERG----NTQIGFARGQEEYFRKRKLEFEEE---DMHKMLET-VHPNPQMAVQCIKDRFISNGYK \
ORF12_OpNV_9629950        LAVGANKDHDRDN---LLDKIEAVLNHVKTLNTNSDKFISAHKSFKLEVGARFE-QFEQRLQTLDTKLNALQCA----APTRTAP---------------------------GVVF-PRDVTKHPHLAVFMGRVEDRG--VTQIAFARGQEEHFRKRKLEFEE----GMDVDVRG-RAPNPLLAVHCIKEEFANGGHK  |
ORF13_ACNV_9627755        --LHIQTEGERDDLRDKIESVLKHVKKLNANSEKFMVTHETFKNEVGN-------RFEQFELRLHELDAKLNML-QSAEKLKTAVVAE-------------SKNG-------TVTF-PRDITKHQHLAVFSERIDD----RIKLAFVLGQERHFRKRKMRFED----DMEVLYDG-VHPNPLLAIQCINEKLYDKHYK  |solos more closely related to the ones with BRON.. maybe truncated or maybe an ancestral solo
orf13_BmNV_9630821        --LHLQTEGERDDLRDKIESVLKHVKKLNTNSEKFMVTHETFKNDVGN-------RFEQFELRLNELDAKLNML-QSAEKLKTAIVTE-------------SKNG-------TVTF-PRDITKHQHLAIFSERIDD----RIKLAFVLGQERHFRKRKMRFED----DMEVLYDG-VHPNPLLAIQCINEKLYDKHYK  |
ORF13H_MbNV_5565846       ----------------------QVIEKFDAFDRRVAELNDKMNMYEN----------VDDLYRRLREHHRTLERPQHMSF--LSSSNTIN-----------DDHDQRCIRFDTVRF-PRDTSKHPRLSVFVKPVEEG---GTKVAFVAGQQRRICALKRKYS-----DMEMIYDS-VHPNPQLAMQCINEELDLKNLD /
BROA_BmNV_9630998         DDIIVEKDKIIVAK-------TEQNQQLAS---ALQEANQNLIEANKG---------------LMTAFNMINDARKETAQLANRMAD--------------IAQD-------VITKPSDPRLCHS-LAVCSLG-------GDQYAFLRPQKRNLKRSLDRLSVD---NREIVYKSEYVPNAMNVLNKVKESLPRDKFK \
BROL_LdNV_9631113         KEIICKKDEIIAVK-------EDENKKLTI---SLQETNQNLIIANKG--------LLQAFEIINEARKDSENARKETAQLANRMAD--------------IAQD-------VITKPSDPRLCHS-LAVCSLG-------GDQYAFLRPQKRNMKRSLDRLSVD---SREIVYKSEYVPNAMNVLNKVKENLPRDKFK  |
ORF109_XnGV_9635359       KQKIVEKDTIIAVK-------DEENKKLTV---ALQDANQNLIEANKG---------------LLQAFNIINEARKETAQLANRMAD--------------IAQD-------VIAKPSDPQLLHS-LAVCAMG-------GDQYAFVRPQKRSL----DRLSVD---EKDIVYRSDYVPNAMNVLNKVKEALPKEKYK  |
ORF60_HaNV_12597545       KLMLSHKDELLAVK-------DKENEALTV---ALQNANHNLAVANQG---------------LLKAFDVVNDARKETAEIAKRMAD--------------IAQD-------VIAKPSDPQLLHS-LAVCSMG-------GDQYAFLRPQKRSLKRSLDRLSVD---EKDIVYKSDYVPNSMNVLNKVKERLPKEKYK  |
BROP_LdNV_9631128         IAEESILRNEIVAK-------TEENKQLAT---ALIEANGKIILFAGA--------LVEANAGLLLANKNLHDANQTIGQMANRMAD--------------IAQD-------VIAKPSNPNLCHS-LAVCALG-------GDQYAFLRPQKRNMKRSLDRLSVD---NREIVFKREYVPNAINVLNKVKESLPRDKFK  |
BroO_LdNV_9631121         AVHVATNEGREAPW-------MKDLEEFKV---VLAEKDRKIDKLTNA--------LIQSNEKNNTLTQALIAVTERTDKLANRIID--------------LAQD-------VVTKPSNPNLCHS-LAVCALG-------GDQYAFLRPQKRNMKRSLDRLSVD---NREIVFKSEYVPNAMNVLNKVKENLPRDKFK  |
ORF159_XnGV_9635409       TVALQESNQKLVIT-------TEKLTDANE---KLTETNNKLVTLATA--------LVSANEGLIKANTMLNDARVETAQLANRMAD--------------VAQD-------VIAKPSDPQLLHS-LAVCSMG-------GDQYAFLRPQKRSLKRSLNRLSVD---DSQILFKSDYVPNSMNVLNKVKENLPKDKFK  |
38_7_HaNV_12597608        RADHHSANENMHK---------SILGKVGDIENRLSELDHKISAIEK----------IDVLYNHLKNYHRLQTNNSN------DTALY-------------SEED---NFVNGFRL-PRDSSKHPHLGVLVRSVDQH---NTEIEFLTGQRNYYQTRKRKLK-----SGDLIYDA-VHPNPQVAVHRFNEELDMKNLS  |
BROA_BmNV_9630839         ------PAVKMDTN-YGVI--EELNKKLAFASESLAEANEKIIHFANA--------LVTANAGLVQANTMLNEARRETAQLANRMAD--------------IAQD-------VIAKPNNPQLLHS-LAVCALG-------GEKYAFLRAQKRSLNRSIKRLG-----SSDVVFSSDYVPNAMNVLNKVKETLPRNQYK  |
BROA_SlNV_7672865         ------PAVEMDAN-YGAI--EELNKKLTFASESLAKANEKIIHFANALVTANT-GLVQANAMLNEARKDCENARRETAQLANRMAD--------------IAQD-------VIAKPDNPQLLHS-LAVCALG-------GEEYAFLRAQKRSLNRSIKRLG-----SSDVVFSSDYVPNAMNVLNKVKETLPRNQYK  |  + BRON
BROC_BmNV_9630901         ------PAVEMDTN-DVIAKIDDLTQKLTVANADLAEANRSLILFANE--------MIVARRDAETARQDCENARRETAQLANRMAD--------------IAQD-------VIAKPSNPQLCHS-LAVCDVG-------NNEFAFLRPQKRSLGRSLKRLG-----SNDVIFSSDYVPNSMNVLNKVKEAIPRNKFK  |
BROII_BmNV_13751087       ------PAVEMDTN-NDIAKIDDLTQKLTVANADLAEANRSLILFANE--------MIVARRDAETARKDCENARRETAQLANRMAD--------------IAQD-------VIAKPSNPQLCHS-LAVCDVG-------NNEFAFLRPQKRSLGRSLKRLG-----SNDVIFSSDYVPNSMNVLNKVKEAIPRNKFK  |
BROB_LdNV_9630999         ------PAVKMDTS-GALVKIDDLTAKLTEANANLMEANKSLIVFANEMIVARR-DAETARQDCEAARQDCEAARRETAQLANRMAD--------------IAQD-------VIAKPADPRLRHT-LAVCEIG-------QNEYAFLRPQKRNFRQSLNRLSVD---DRNVVFKSEYVPNAMNVLNKVKESLPRDKFK  |
BRON_LdNV_9631120         ELVKKQEFIERIVA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQR----KDEIMQQKDAQVTELV-------AKVVD---------------LSE-------RAVQYPADERKHP-VLCVARD-------GTTFMAIAGQKSYVRSQKHKRNID---AASVVAEA-TRPNPTVDWNNATHRLPAKKTK  |
BRO_AcNV_9627744          ELVKKQEFIERIVA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQK----KDEIMQKKDAQVTDLV-------AKVVD---------------LSD-------RAVQYPADKRKHP-VLCVTRD-------GTTFTAITGQKTYVENQKHKRNIN---VANIVVEN-IRPNPTVDWNNATDRLQAKRSK  |
ORF2_AcNV_93042           ELVKKQEFIERIVA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQK----KDEIMQKKDAQVTDLV-------AKVVD---------------LSD-------RAVQYPADKRKHP-VLCVTRD-------GTTFTAITGQLTYVEMQLHLRMIM---VANIVVEN-IRPNPTVDWNNATDRLQAKRSK  |
BROD_BmNV_9630955         ELFKKQEFIERIIA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQR----KDEMMHKKDELLQVKDTQVSNLIAKMID---------------LSD-------RAVQYPADKRKHP-VLCVTRD-------GTTFTAITGQKTYVESQKHKRNID---AANIVVEN-IRPNPTVDWNNATDRLQSKRSK  |
BROIII_BmNV_13751089      ELFKKQEFIERIIA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQR----KDEMMHKKDELLQVKDTQVSNLIAKMID---------------LSD-------RAVQYPADKRKHP-VLCVTRD-------GTTFTAITGQKTYVESQKHKRNID---AANIVVEN-IRPNPTVDWNNATDRLQSKRSK  |
BROJ_LdNV_9631081         -----EQFQETMQK-----KDEQFKETIQKKDEQFKETIQKKDEQFQE----------IIQKKDAQLQETIQRKDEQIARLIDAAMD---------------LSS-------RAVQYPADERKHP-VLCVARD-------GTTFHGIAGQRRYVQSQKRKLGVK---DDDLVLET-RRPNPALDWTNATHTTSAVKRSK |
ORF114_XnGV_9635364       -VYRERELESKTNQ------LANKEKQLKNALSLIEFKENQLSEVISL-------TQKKDIQLEQQFTMLSSLMGKHIKKIE--ISD---------------SDD------------ELPQNHDT-VLMIVREN------NTTFKGIAAKRRYVDQQKQKLRYH---ESMIVVHS-KRPDPKRDWNAAMDIVVELGVK  |
AMV175_AMV_9964489        RKRTQKKYIDIINN------KQDKIDILSIKLDNISKQNNELLTQNQ-----------LALNKLQELGINLIETKEEIKDVKDKLNV--------------VIED-------RNVK-PKEVKLQHKYLLLKNKII-----NNEYKFIRAQDQYIKTNKSNWLE----KHNVIIDEKYNPNPIDMCSRLKSKIYELDKI  |
AMV177_AMV_9964491        ------QKCKIDELFN---QNKKIISQNNELINKTEYQNNEILKLNKQ--------NQLALNKLQELGINLIETKEEIKDVKDKLNV--------------VIED-------RNVK-PKEVKLQHKYLLLKNKII-----NNEYKFIRAQDQYIKTNKSNWLE----KHNVIIDEKYNPNPIDMCSRLKSKIYELDKI  |
AMV057_AMV_9964371        MDIISNQKDKIDD-------LFKKIDNQSLEINNISKQNNELLTQNQ-----------LALNKLQELGINLIETKEEIKDVKDKLNV--------------VIED-------RNVK-PKEVKLQHKYLLLKNKII-----NNEYKFIRAQDQYIKTNKSNWLE----KHNVIIDEKYNPNPIDMCSRLKSKIYELDKI  |
orf6_HaEV_3510491         LDIINNKQDKID-------ILTQDLEEIKNQNILTIEQNNKLLQQNQ-----------LALNKLQELGINLIESKEEIKSINNRIDT--------------IIVD-------RNIK-PSNPKLHHKYLLLKNKN------KNEYKFIRAQDKYIKNNKSLWLE----KYNTIIEEKYNPNPIDLCSRLKDKIAKLNPQ  |
ORF13_SeNV_9634234        ----------------------QVIERFEWFNVQISELNKKMSTLNN----------VDELYRRLQDYHKSNNINTTMFSNNASSSTALSSSSSYENNLMGGIVDNEHTRYETVRF-PRDTSKHPRLSVFVKPSEE----GTDIAFITAQQRRHNALKRKFN-----DMEMIYDS-VHPNPQLAMHCINEELDIKQFN  |
38_7l_HzNV_10442572       RADHHSANENMHK---------SILGKVGDIENRLSELDHKISAIEK----------IDVLYNHLKNYHRLQTNNSN------DTALY-------------SEED---NFVNGFRL-PRDSSKHPHLGVLVRSVDQH---NTEIEFLTGQRNYYQTRKRKLK-----SGDLIYDA-VHPNPQVAVHRFNEELDMKNLS /
468L_CIV_15079179         MKITNHRQENMLIE------SHNMLRSMGVEIKDIRHENNDLLDQNN------------------ELLERVDDVLQKVNTVQKKLDI--------------SVED-------RAPQ-PDKNTRRERFLLLKRNND-----TFPYYTIRAQEINARKALKR-QRNMYTDVTVLLDIVCHPNTKTFYVRIKDDLKSKGVE \  + MSV199
238R_CIV_15078950         QEEERKLDRLMLTE------SRNMLQTMGIEIKTVKYNNNNLIDQNN------------------ELLERVDEVLHKVDVVQTKLNI--------------SVED-------RAPQ-PDKNKRRERFLLLKRNDE-----NYPYYTIRAQDINAKKALKR-QKDMFSDVTILLDLICHPNTKTFYVRIKDDLKKKGVE  |
211L_CIV_15078923         RQYMQKMGITLED-------TREEVKKVNIQNKDIKAQNEEIKAQN-------------------------EDLAFDLSDVRDRLIE--------------AAED-------RSPK-LETKPLRERFVIIKRKDS-----SFPYYAIRGQDVYVKGRLTHFKNTRYPELKIIFDTNYQPNPRNLYIRFKELKDERFII  |
148R_CIV_15078861         RLERKQSEERSIK-------QEQLLLSIGYNLKELQEQKEEDTQKID-----------VLIDQNEDLKQNIEETNDKLDSVVEKLGI--------------AVED-------RAPR-LKRASIRERFVLFKKNNSTNE--IYQYYAIRGQSVYVNGRLSKLQSEKYPDMIILIDIICQPNPRNLFLRFKERIDGKPEW  |
388R_CIV_15079099         --EYSLYFKEREAQ-----------IEKQKSQFHIETLEKKLDEMK-----------LEAEKRHDELLDKVEEVQYDLNVVGEKLDI--------------AVED-------RAPK-VKAELLRERFVVLNRNDKRA---SCQYYVMRGQDHYINGKIFSYK-NLHPNLKIIFDISCQPNPRNLFVRFKELKDNRFKV  |
212L_CIV_15078924         ----CVYFKEREAKLQIT-TLEQKLEQMNITMIEMKEEMNLSMEEHAD-------KLDTLVDQNEELKLDVSEANEKLETVTHKLGI--------------AVED-------RSPR-LEQKPLRERFVLFKRNVKNA---RFQYYAIRGQSIYVNGRLTL-YNERYPNLEIIIDIFCQPNPRNLFLRFKNYVKDDERF /
313L_CIV_15079025         IYASKRQEQMLLE-------SHNLLKSMGIEVKDIKEQNNELLNEVG-----------ELREDNNELQEQVENVQEQIQKVQVKLEI--------------SVED-------RAPQ-PDKRGKKERFILLKRNDE-----HYPYYTIRAQDINAKKAVKR-QQGKYEEVLILLDLVCHPNTKTFYVRIKDDLKKKGVK  \
006L_CIV_15078718         QEKNDKIDELILFSKRMEEDRKKDREMMIKQEKMLRELGIHLEDVSSQ--------NNELIEKVDEQVEQNAVLNFKIDNIQNKLEI--------------AVED-------RAPQ-PKQNLKRERFILLKRNDD-----YYPYYTIRAQDINARSALKR-QKNLYNEVSVLLDLTCHPNSKTLYVRVKDELKQKGVV   |
AMV112_AMV_9964426        TNIIEENEITIKQK-------DDKIDELIQINKRIEEQNIKLLKLAE-----------KQNIKLDEISDELDETNYKLDTLTQTVEEN-------------ILPD-------RNIQ-PNDINLKHNLVIY-KKI------NNIIKITRAQNKYINKIKIS-------EDNIIIKE-YVPNPIDFINRMKLYCIDLNKK   | kilAN
AMV110_AMV_9964424        IKQKDDKIDELNNKLD---IIITTNKILEQKSTNLENINNKLLKLAE-----------KQNIKLDEISDELDETNYKLDTLTQTVEEN-------------ILPD-------RNIQ-PNDINLKHNLVIY-KKI------NNIIKITRAQNKYINKIKIS-------EDNIIIKE-YVPNPIDFINRMKLYCIDLNKK   |
AMV024_AMV_9964338        INIVEDKELEINDLNKKLSDIINQNNKILESNKNLENQNKKLLKLAE-----------KQNIKLDEIGDELDETNFKLDTLTQTVEEN-------------ILPD-------RNIS-PKDVNLKHNLVIY-KN-------NNEIKIIRAQNKYINKIKIL-------DENIIIKE-YVPNPIDFINRMKLYCVDINKK   |
FPV124_FPV_9634794        HKFNNKYDKDTLE-------LKELYREQRKEAKSLRKINERIEEKYDK-------DTRELKQGLKELKDENKELKFEL----KKIEER--------------LRD-------KVIN-PFSPNKHHRLVILQKKID-----NNSFKTLRLQAERLNQEMNKY------KTNILYFL-MHTNLTQYPVLIG*--------  /

consensus/95%             ...........................................................................................................................p..h.hh.............h..h.st.......t............hlhp....PNs...h.ph..........
consensus/90%             ............................................................................p......................................t.......p..hhhh.............h..h.sQp..hp..t.p.........pllhp....PNs...h.php..h......
consensus/85%             ..................................h....pph..................................p......ph...................s..........t.s....tc..hhhh.............h.hlpsQp..hp..t.p.........pllhc..h.PNst..h.phpp.h......


http://www.bmm.icnet.uk/servers/3dpssm/output/d454e5b32aaf504f.job_summary.html


#P63C
-----------------------

p63_BPMx8_15320633       QAVAERFLGVGLAPYAKRFPTPFYEGIFRLRGWPWHGPGTP--RPGVIAYWTNDLVYERLAP-ELLRLLRERNPMDKDTGRRAAKHHQLLSEDIGHPALAVAAKVDALNLPLEQNQVRVVFNWLQ  | + BRON
orf12_BP933W_4499795     --ILEAFVAKEIQPYITTFPADYYEELFRLRGLE-YPPENPRFRPQYFGVLTNDIVYKRLAP-NILEELKKQNV----KASKGTKLFQGLTPNIGYQKL                            \ + kilAN
Gp73_BPHK97_9634189      ------FLLDKSQPWEKRFSDPFYSAMFKMSGLPRHRPGR---RPSLFGMISAKWVYGPVLPPEVYAEVKRR-------LAAGDKIHQHLKPD                                  /





PHAGES:                                                                           HOST
phiPV83, phi ETA, phiSLT                                                         S aureus
bIL285, bIL286, bIL311  pi3, BPphi31_1, TP901-1, RLT, BK5-T  LL-H Tuc2009        Lactococcus
LcBPA2                                                                           Lactobacillus
TP-J34 Sfi21                                                                     Streptococcus thermophilus
A118:                                                                            Listeria monocytogenes


HK97, N15 , HK620 , HK022 933W, VT2-Sa, phi-R73 P22 H-19B, P1                    Escherichia coli
D3                                                                               Pseudomonas
APSE-1, GMSE-1                                                                   Acyrthosiphon pisum    Endosymbionts

Mx8                                                                             Myxococcus xanthus


XF2506_Xf_11362060
M_XF2524_Xf_11362500
C_XF2524_11362500
N_XF2524_Xf_11362500
XF0684_Xf_11362477
XF1663_Xf_11362484
XF1645_Xf_11362483
XF0704_Xf_11362478