130
U NIVERSITY OF C OPENHAGEN F ACULTY OF S CIENCE D ANISH A RCHAEA C ENTRE PhD thesis Daniel Stiefler-Jensen STABILITY OF HYPERTHERMOPHILIC PROTEINS -Post-translational modifications and other factors Academic Supervisors: Qunxin She & Kaare Teilum Submitted: December 2016

PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

  • Upload
    others

  • View
    11

  • Download
    1

Embed Size (px)

Citation preview

Page 1: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

U N I V E R S I T Y O F C O P E N H A G E N F A C U L T Y O F S C I E N C E

D A N I S H A R C H A E A C E N T R E

PhD thesis

Daniel Stiefler-Jensen

STABILITY OF HYPERTHERMOPHILIC PROTEINS -Post-translational modifications and other factors

Academic Supervisors: Qunxin She & Kaare Teilum

Submitted: December 2016

Page 2: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the
Page 3: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

i

PhD thesis

Stability of Hyperthermophilic Proteins

- Post-translational modifications and other factors

This thesis has been submitted to the PhD School of The

Faculty of Science, University of Copenhagen

By Daniel Stiefler-Jensen

Academic supervisor: Qunxin She

Co-supervisor: Kaare Teilum

Page 4: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

ii

Institute name: Biologisk Institut

Name of department: Department of Biology

Section: Functional Genomics

Author: Daniel Stiefler-Jensen

Titel(dansk): Hyper-thermofile proteiners stabilitet

- Post-translatoriske modifikationer og andre faktorer

Title(english): Stability of Hyperthermophilic Proteins

- Post-translational modifications and other factors

Subject description: This thesis aims to evaluate the importance of Post-translation

modification as a mechanism of protein stabilisation in

hyperthermophilic archaea along with the investigation of other

factors involved in protein stability of hyper-thermophiles.

Main supervisor: Qunxin She

Co-supervisor: Kaare Teilum

Submitted: December 2016

Type: Synopsis

Page 5: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

iii

Preface

This thesis in the conclusion of my PhD. Study, started March 2012, at the Faculty of Science,

University of Copenhagen. The majority of the experimental work was carried out at the Danish

Archaea Centre (DAC) and Structural Biology and NMR Laboratory (SBiNLab) of the

Department of Biology.

The thesis was supervised by Director of DAC and Associate Professor Qunxin She with a heavy

involvement of Co-supervisor Kaare Teilum, group leader at SBiNLab and Associate Professor.

During my PhD. I had the opportunity and pleasure of joining the laboratory of Simonetta

Bartolucci, at the University of Naples Federico II, where I joined the groups of Gabriella

Fiorentino and Patrizia Contursi for three months.

I also had the pleasure of supervising two bachelor students and a laboratory technician through

their final projects and internships, respectively. Additionally, I assisted with the teaching of

Biochemistry III for two consecutive years.

Daniel Stiefler-Jensen

Copenhagen, Denmark

December, 2016

Page 6: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

iv

Summary

Enzymes are not only essential for the workings of every cells in our bodies. They are also

becoming more important for our daily life as they play an increasing role in production of and

as part of everyday items. Thus, increasing the efficiently of these enzymes can result in better

and possible cheaper products. One aspect that can have a large impact on the efficiency of an

enzyme is its stability. By increasing the enzyme stability production cost and time can be

reduced, and consumers will have a better product with longer activity.

In the past it was only possible to increasing enzymes stability by randomly generate mutants

and lengthy screening processes to identify the best new mutants. However, with the increase in

available genomic sequences of thermophilic or hyperthermophilic organisms a world of

enzymes with intrinsic high stability are now available. As these organisms are adapted to life at

high temperatures so are their enzymes, as a result the high stability is accompanied by low

activity at moderate temperatures. Thus, much effort had been put into decoding the mechanisms

behind the high stability of the thermophilic enzymes. The hope is to enable scientist to design

enzymes with high stability and activity at a target temperature. This thesis presents an

investigation of an hyperthermophilic esterase, and how post-translational modifications (PTM)

and other factors affect the stability of this enzyme. As part on the ongoing effort to

understanding the mechanisms involved in the high stability of hyperthermophilic enzymes.

The thesis starts with an introduction to the field of protein and enzyme stability with special

focus on the thermophilic and hyperthermophilic enzymes and proteins. After the introduction

three original research manuscripts present the experimental data related to this study. In the first

manuscript, the effect of lysine methylation on enzyme stability is investigated. This study

makes use of two different methods to acquire enzyme without the native lysine methylation.

The effect of the methylation is subsequently evaluated by testing the stability of the different

versions of the esterase. Here we show that methylation plays a minimal, yet significant, role in

stabilising the enzyme. We also show that if the esterase is produced from a mesophilic host, an

organism that lives at moderate temperatures, in this case E. coli, that additional factors that

serves to stabilise the enzyme is either missing or that factors from E. coli destabilises the

enzyme. We were however unable to identify these factors. The second manuscript present an

investigation of the importance of dimer formation for the stability of the esterase investigated in

Page 7: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

v

manuscript one. By the introduction of a novel cross dimer disulphide bond we demonstrated a

stabilysing effect of dimer formation by increasing the half-life of the esterase by 1.62 fold at a

temperature of 90°C. However, efforts to disrupt the native dimer formation by targeting the core

elements of the dimer interface failed. The mutants were however compromised in catalytic

stability at a temperature of 90°C. The third manuscript details problem area and makes

suggestions for better design of esterase activity assay for use at temperature up to 90°C.

Page 8: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

vi

Sammenfatning

Enzymer er ikke kun en vigtig for funktionen af hver eneste celle i vores kroppe, de er ligeledes

blevet en vigtig del af vores dagligdag. De spiller en stadig stigende rolle i produktionen af og

som en del af dagligvarer. Således kan en øget effektivt af disse enzymer resultere i bedre og

muligvis billigere produkter. Et aspekt, som kan have en stor indflydelse på effektiviteten af et

enzym er, dets stabilitet. At øge enzymers stabiliteten vil forlænge den tid hvor de forbliver

aktive. Dette kan resultere i reducerede produktionsomkostningerne og bedre produkter.

Traditionelt bliver enzymers stabilitet forøget ved hjælp af tilfældige mutationer efterfuldt af en

langvarig screenings proces hvor enzymer med forbedret stabilitet identificeres. Men med øgede

tilgængelige genom sekvenser fra termofile og hyper-termofile organismer, er en verden af

enzymer med en naturlig høj stabilitet nu tilgængelige. Da disse organismer er tilpasset til et livet

ved høje temperaturer er disse enzymers høje stabilitet er oftest ledsaget af lav aktivitet ved

moderate temperaturer. Af denne grund er der blevet gjort en stor indsats for at afkode de

mekanismer der ligger bag den høje stabilitet af disse termofile enzymer. Håbet er at denne viden

vil gøre os i stand til at designe enzymer med høj stabilitet og et passende temperatur optimum.

Denne afhandling præsenterer en undersøgelse af en hyper-termofile esterase, og hvordan post-

translationel modifikation (PTM) og andre faktorer påvirker stabiliteten af dette enzym.

Afhandlingen indledes med en synopsis over proteiner og enzymers stabilitet med særlig fokus

på de termofile og hyper-termofile enzymer og proteiner. Efter synopsis præsenterer de

eksperimentelle data relateret til dette studie i tre originale forskning manuskripter. I det første

manuskript, undersøges effekten af lysin methylering på esterasens stabilitet. Denne

undersøgelse gør brug af to forskellige metoder til at producere enzym uden den naturlige lysin

methylering. Virkningen af methylering bliver efterfølgende evalueret ved at teste stabiliteten af

de forskellige versioner af esterasen. Her viser vi at, lysin methylering spiller en minimal, men

signifikant, rolle for enzymets stabilitet. Vi viser ligeledes, hvis esterasen producers af en

mesofil organisme, en organisme der lever ved moderate temperaturer, i dette tilfælde E. coli,

resulter in reduceres stabilitet på grund af mangle faktorer der er med til at stabiliser esterasen,

eller faktorer fra E. coli der destabiliserer enzymet. Vi var dog ikke i stand til at identificere disse

faktorer. Det andet manuskript præsenterer en undersøgelse af betydningen af dimer dannelse

for den samme esterases stabilitet. Ved at introducer en ikke før beskrevet disulfid bro der

forbinder de to dele af dimeren, kunne vi øge esterasens halveringstiden med en faktor på 1,62

Page 9: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

vii

ved en temperatur på 90°C. Men bestræbelserne på at bryde dimer dannelse ved at målrettet at

mutere de centrale elementer i dimer grænsefladen mislykkedes. Mutanternes katalytiske

stabilitet blev imidlertid kompromitteret ved en temperatur på 90°C. Det tredje manuskript

beskriver problemområder og giver forslag til bedre udformning af esterase aktivitets assays til

anvendelse ved temperaturer op til 90°C og kontinuert data indsamling.

Page 10: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

viii

Acknowledgements

During the last 4, almost 5, years I have learned a lot, not only in respect to my research but also

about myself. I have learned that I can be my own worst enemy, and how to overcome myself. In

addition to the scientific knowledge, I have gained over this period of my life, I count these

experiences as the most important and I would like to thank all who have been involved in my

personal and professional evolution during my PhD. If I fail to mention you by name, I do

apologise. But do know that you have been a part of an important and amazing part of my life.

I would like to thanks Qunxin She and Kaare Teilum for the opportunity to conduct my PhD

study in their laboratories and for the support and guidance I have received over the years. I will

also like to thanks Roger A. Garrett and Xu Peng for their always enlighten input to a scientific

discussion. Special thanks go to my colleague and follow PhD student Troels Schwarz-Linnet,

thanks for great team work. I will also like to thanks current and former members of the Danish

Archaea Centre. Also thanks to Hien Phan, Mariana Awayez, and Pia Skovgaard for their

technical expertise in the laboratory, without laboratory technicians the labs would fall apart.

I will like to thank my parents and my in-laws for their support during this period of my life

without your support and understanding it would not have been possible.

The conclusion of the PhD would not have been possible without my wonderful and patient

family. I wished to express my deepest gratitude to my lovely wife, Pernille and my two

wonderful sons, Milo and Sirius. Without their support and understanding I would not have been

able to conclude my work. From my heart thanks.

-Daniel

Page 11: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

ix

List of manuscripts

I. Stiefler-Jensen, D., Schwarz-Linnet, T., Lichtenberg, C. D., Nguyen, T. T. T., Rand,

K. D., Li Huan, She, Q., and Teilum, K. (2016) The extraordinary thermal stability of

EstA from S. islandicus is independent of post translational modifications,

Manuscript in preparation

II. Stiefler-Jensen, D, Teilum, K., and She. Q. (2016) Effect of dimer interface integrity

on the stability of a hyperthermophilic esterase, Manuscript in preparation

III. Stiefler-Jensen, D., Schwarz-Linnet, T., She, Q., and Teilum, K. (2016) An esterase

activity assay suitable for use at temperatures up to 90 °C, Manuscript in preparation

Page 12: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

x

Table of Contents

Preface ............................................................................................................................................ iii

Summary ........................................................................................................................................ iv

Sammenfatning ............................................................................................................................... vi

Acknowledgements ...................................................................................................................... viii

List of manuscripts ......................................................................................................................... ix

Table of Contents ............................................................................................................................ x

Synopsis .......................................................................................................................................... 1

1 Aims of the thesis ............................................................................................................................................... 1

2 Introduction ......................................................................................................................................................... 2

3 Protein stability ................................................................................................................................................... 3

Protein folding ........................................................................................................................................... 4

Stabilising factors ....................................................................................................................................... 6

Non-covalent interactions. ......................................................................................................................... 6

Ionic interaction .................................................................................................................................................. 8

Van der Waals forces .......................................................................................................................................... 9

Hydrophobic interactions .................................................................................................................................... 9

Covalent interactions ............................................................................................................................... 10

Post-translational modifications ............................................................................................................... 11

4 Protein structure determination ......................................................................................................................... 12

Computational structural biology ............................................................................................................. 14

Homology modelling ........................................................................................................................................ 14

Discrete state off-lattice models ........................................................................................................................ 15

Structural fragment modelling .......................................................................................................................... 16

5 Protein engineering ........................................................................................................................................... 17

Directed evolution, traditional ................................................................................................................. 17

Rational design, modern .......................................................................................................................... 18

6 Protein adaptation to extreme temperatures ...................................................................................................... 19

Amino acid composition .......................................................................................................................... 20

Counter selection of thermolabile amino acids ........................................................................................ 21

Increased number of charged amino acids ............................................................................................... 21

Lysine methylation: ................................................................................................................................. 22

Structural optimisation ............................................................................................................................. 23

Hydrogen bonds ....................................................................................................................................... 24

Hydrophobicity ........................................................................................................................................ 24

Increased number of disulphide bonds ..................................................................................................... 24

Page 13: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

xi

Oligomerisation........................................................................................................................................ 25

7 Enzymes of special interest ............................................................................................................................... 25

EstA, an Archaeal esterase ....................................................................................................................... 26

aKMT, an Archaeal lysine methyltransferase .......................................................................................... 28

8 Summary of results ........................................................................................................................................... 29

Optimisation of esterase assay ................................................................................................................. 29

Expression of soluble dimeric EstA for E. coli host ................................................................................ 30

Lysine methylation as a factor in protein stability ................................................................................... 30

Investigation mesophilic expressed EstA increased sensitivity to detergents. ......................................... 31

The Effect of dimer integrity on the stability of EstA .............................................................................. 31

9 Conclusion & Perspective ................................................................................................................................. 31

10 References .................................................................................................................................................... 34

Chapter 1 ....................................................................................................................................... 43

Chapter 2 ....................................................................................................................................... 65

Chapter 3 ....................................................................................................................................... 89

Appendix I ..................................................................................................................................... 99

A – Introduction to Mutations Analyser .................................................................................................................. 99

B – Mutations Analyser python code .................................................................................................................... 106

Page 14: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the
Page 15: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

1

Synopsis

Stability of Hyper-Thermophilic Proteins

1 Aims of the thesis

The work presented here were made in an effort elucidate the role of post-translational

modifications, with special focus on lysine methylation, on the stability of proteins from

hyperthermophilic archaea of the genus Sulfolobus. To achieve this an esterase native to

Sulfolobus islandicus, EstA, previously showed to have compromised stability when expressed

in Escherichia coli, used as a model for post-translational modification dependent

thermostability. Through the use of this model the following is addressed:

Characterising the difference between protein expressed from E. coli, a mesophile, and

S. islandicus, a hyperthermophile.

o Addressing key issues in previously published studies of EstA stability when

expressed from E. coli: Oligomeric state, difference in amino acid sequence.

Identify post-translational modifications of EstA expressed from the native host S.

islandicus.

Produce EstA from S. islandicus, the native host, without in vivo lysine methylation

Evaluating the stabilising effect of post-translation modification.

Investigate the importance of the EstA oligomeric state in relation to stability.

Page 16: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

2

2 Introduction

For thousands of years’ and across the world, Mankind have exploited microorganisms, most

notably for fermentation of sugar to ethanol e.g. in the making of wine and brewing of beer,

and for leavening of bread. Our utilization of nature is now at the point where we can pick

“parts” from all manner of life and use them as we see fit, these “parts” or genes can code for

enzymes. The possible uses of enzymes are almost limit less, Enzymes can be used in the

production of medicines, biofuel, chemical, food, and in paper production and recycling, as

treatment for metabolic disorders in enzyme replacement therapy (Neufeld 2006), or as active

ingredients in e.g. laundry detergents, toothpastes and dish washing detergents and much

more. The use of enzymatic biocatalysts in an industrial production can be beneficially in

several ways e.g. by lowering energy usage, increasing product yields and quality, and

reducing the environmental impact. As enzymes in many cases can replace expensive and

polluting production methods in the chemical industry with relative mild reaction conditions.

This use of enzymes is known as white biotechnology (Frazzetto 2003). In the white

biotechnology industry and in general, high enzyme stability is a valued trait for almost all

application of enzymes, one exception is DNA restriction endonucleases used in molecular

biology where thermal inactivation is a practical trait of the enzymes. The stability of an

enzyme can have a large impact on the amount of enzyme and processing time needed and as

a result on the cost of using an enzymatic catalyst. Hence maximising the stability of an

enzyme, sometimes at the cost of activity, can be high beneficially (Cao et al. 2015), as an

increase in stability allows for an increase in the processing temperature, which in turn result

in acceleration of substrate conversion (Klibanov 1983). It is however not always possible to

obtain enzymes with a high stability from organisms living at normal temperatures, known as

mesophiles. However, thermophiles and hyperthermophiles, organisms living at high

temperatures above 45°C and 65°C respectively, possess enzymes that through evolution have

acquired high stability. It is not always possible to find a thermophilic enzyme suitable as a

replacement for a mesophilic enzyme. Hence decoding the factors that make thermophilic

enzymes stable, may allow us to engineer thermophilic enzymes from mesophilic enzymes.

The research presented in this thesis investigate the effect of post-translational modifications

and other factors on the stability of hyperthermophilic enzymes.

Page 17: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

3

In the following section I will introduce determining factors in protein stability and folding

along with some background information about protein structure determination and protein

modelling.

3 Protein stability

Enzyme activity is essential for life, as enzymes are responsible for the cells metabolism.

Enzyme functionality and activity is dependent on the active site, a micro-environment

tailored for catalysis of specific chemical reactions. Thesis micro-environments are the results

of the peptide-polymer of the enzyme folding to a specific three-dimensional (3D) structure

where the enzyme is active. This structure is known as the native conformation. If for any

reason this 3D structure is disrupted the enzyme loses it activity and functionality. Hence the

longer an enzyme can maintain its native conformation the better. Both the 3D structure and it

stability is dependent on the amino acid sequence of the enzymes. This was evident as early

as 1961 where it was show that peptides can fold spontaneously in vitro (Anfinsen et al.

1961). The amino acid sequence of all proteins is encoded in DNA as genes. The DNA

encoded amino acid sequence is transcribed from DNA to mRNA and translated from mRNA

into a peptide during protein synthesis. The flow of information from DNA to protein is

known as the central dogma of information flow, see Figure 1. As the amino acid sequence is

the only deterministic feature for the protein structure, it must also be deterministic for its

stability. How the amino acid sequence is responsible for the folding and stability, is the

subject of the following sections.

Figure 1. The central dogma of information flow. 1| a DNA encoded gene or open reading frame is transcribed to mRNA

by a RNA polymerase. 2| the mRNA is translated to a peptide sequence by the ribosome and tRNA’s.

Page 18: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

4

Protein folding

As indicated above protein folding and protein

stability are not two independent concepts, both are

dependent on the amino acid sequence of the

protein. The factors that determines the protein

folding are also the factors that determines the

stability of the fold. Proteins are synthesised as

linear polymers consisting of a combination of 20

standard amino acids. Each amino acid consists of

an α-amino group, α-carboxyl group, a α-carbon

and a side chain, see Figure 2. The α-amino group, α-carboxyl group, and a α-carbon form the

peptide back-bone, where individual amino acids are joined together with a peptide bond

through a condensation reaction between the α-carboxyl group of the first amino acid and the

α-amino group of the next in the poly-peptide chain. The side chain of each amino acid is

unique, and the amino acids can be divided into four groups, based on chemical properties of

their side chain: hydrophobic, polar uncharged, charged, and special cases. As proteins are

synthesised they start interaction with its solvent, water under normal circumstances. As a

result, some hydrogen bonds between water molecules will be broken and new ones will be

formed between the protein and the water. Usually this will be thermodynamic unfavourable.

This unfavourable interaction with water drives the folding of the poly-peptide to minimise

the Gibbs free energy for the protein/water system, see BOX I. The efficiency of this

minimisation determines the stability of the protein. The average difference in Gibbs-free

energy, obtained through protein folding, ΔGF, is only about 40 kJ/mol (Dill et al. 2008).

Figure 2. The parts of the amino acid: α-amino group, α-

carboxyl group, α-carbon, and R the side chain

BOX I: Thermodynamic of state change For a state change, like that of a protein folding from a non-native state to the native state, to be spontaneously and stable the change

in Gibbs-free energy, ΔG, most be < 0 and a minimum, see Eq. 1 where. GN and GD are the Gibbs-free energy of the native fold and

the non-native folds respectively.

Eq. 1 ∆𝐺𝐹 = 𝐺𝑁 − 𝐺𝐷,

The Gibbs-free energy of each state can be expressed as a function of entropy (S) and enthalpy (H), at constant pressure and

temperature in the following manner:

Eq. 2 𝐺 = 𝐻 − 𝑇𝑆

By combining Eq. 1 & Eq. 2 we get Eq. 3. From Eq. 3 it can be seen that any loss of entropy as a result of steric constrains must

be compensated for by a loss in enthalpy.

Eq. 3 ∆𝐺 = ∆𝐻 − 𝑇∆𝑆

A spontaneous State changes, ΔG < 0, can be enthalpy or entropy driven if ΔH < 0 & ΔH < ΔS <= 0 or ΔS > 0 & ΔS < ΔH >= 0

respectively.

Page 19: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

5

Most protein folds from a linear poly-

peptide to its native conformation on a

micro second (µs) time scale. An

estimation of folding speeds for single

domain proteins can be made from the

length of poly-peptide(N) by N/100 µs

(Kubelka et al. 2004). How protein and

enzymes fold to their 3D structure in such

a short time scale have troubled

researchers. Thus Levinthal’s paradox

states that proteins can’t fold to it native

conformation by random. As his thought

experiment demonstrated: a 101 amino

acid long peptide will take 1027 years to sample all possible structures if each peptide bond

only has 3 degrees of freedom and 1013 different conformations can be sampled per second

(Zwanzig et al. 1992). To overcome this problem an initial theory where protein folding

through specific pathways (Levinthal 1968) has now evolved to a theory where protein

folding is seen as a funnel shaped energy landscape, see Figure 3. Where intermedia

structures/local energy minimums guide the conformation to the native fold through multiple

pathways (Dill & Chan 1997), and thereby greatly accelerate protein folding to a microsecond

timescale.

Three mechanisms, that are not mutual exclusive, are used to describe the progression through

the energy landscape. The frame-work

mechanism accounts on the formation of

local secondary structures independent of

tertiary interaction, with a subsequent

conglomeration to the native

conformation by random movement and

collision (Karplus & Weaver 1994;

Kim Baldwin, R.L. 1982), see Figure 4a.

In contrast the hydrophobic collapse

model relies on the initial formation of a

molten globular structure as a result of

Figure 3. Proteins have a funnel-shaped energy landscape with

many high-energy, unfolded structures and only a few low-

energy, folded structures. Folding occurs via alternative

microscopic trajectories. from (Dill & MacCallum 2012)

Figure 4. Three types of protein folding pathways (Nölting &

Andert 2000).

Page 20: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

6

stacking hydrophobic residues. From the molten globular secondary and tertiary structures

will emerge (Dill 1985), see Figure 4b. The hydrophobic collapse is believed to be one of the

earliest events in protein folding and experiments evidence place it on the Nano-second scale

(Sadqi et al. 2003). The nucleation-condensation model is based on the initial formation of

local interactions which serve as a template around which the structure folds around (Fersht

1995; Nölting & Agard 2008), see Figure 4c. The next section will focus on the factors

stabilising and driving the formation of these structures.

Stabilising factors

As protein folds to minimise their Gibbs free energy, interactions are formed within the

protein structure and between the protein and its solvent. As a result, of the folding sterically

constrains are placed on the polypeptide. This destabilises the protein fold together with

factors like desolvation of polar and charged. These destabilising effects must be

compensated for by the formation of stabilising interactions. These interactions takes many

forms and are described in following sections.

Non-covalent interactions.

The non-covalent interaction constitutes the vast majority of intra molecular interaction in the

folded protein. Non-covalent interactions are transient in contrast to covalent bonds.

Hydrogen bonds are the most common interactions found. Hydrogen bonds stabilises the

protein through bonds between adjacent polar atoms, e.g. Oxygen and Nitrogen, and hydrogen

atoms in the protein structure, as it is the case in water and especially ice, see Figure 5.

Hydrogen bonds can be classified as; sidechain-sidechain, sidechain-backbone, or backbone-

backbone. Hydrogen bonding from backbone-backbone interactions account for 68% of all

hydrogen bonds in proteins (Stickle et al. 1992). The majority of a protein structures are made

up of repeater structure motifs, known as α-helixes, β-sheets, loops and turns. Here backbone-

backbone hydrogen bonds play an important role. An α-helix is formed by hydrogen bonds

BOX II: Definition of the hydrogen bond “The hydrogen bond is an attractive interaction between a hydrogen atom from a molecule or a molecular fragment X–H

in which X is more electronegative than H, and an atom or a group of atoms in the same or a different molecule, in which

there is evidence of bond formation. A typical hydrogen bond may be depicted as X–H•••Y–Z, where the three dots denote

the bond. X–H represents the hydrogen bond donor. The acceptor may be an atom or an anion Y, or a fragment or a molecule

Y–Z, where Y is bonded to Z. In some cases, X and Y are the same. In more specific cases, X and Y are the same and X–H

and Y–H distances are the same as well leading to symmetric hydrogen bonds. In any event, the acceptor is an electron rich

region such as, but not limited to, a lone pair of Y or π-bonded pair of Y–Z.” (Arunan et al. 2011)

Page 21: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

7

between the peptide backbone of two

amino acids separated by three amino

acids and are hence local interactions, see

Figure 6A. On an α-helix the side chain of

each amino acid will protrude outwards

from the centre of the helix. β-sheets on

the other hand are formed by hydrogen

bonds between the peptide backbone of

amino acids which not necessarily are

close to each other in the primary

structure. β-sheets can consist of parallel

and/or antiparallel strands, see Figure 6B & C. Between parallel β-strands each amino acid

forms a hydrogen bond with two different amino acids on the opposing strand. In antiparallel

β-strands the hydrogen bonds are formed between two amino acids one on each strand. In

both cases each amino acid participates in two hydrogen bonds. The side chains of a β-strand

alternate between pointing up and down from a horizontal backbone. Each hydrogen bond is

estimated to contribute with 1-2 kcal/mol on average and is considered a weak interaction

(Myers & Pace 1996). The maximum length of a hydrogen bond in protein structures are

estimated to about 3 Å (Stickle et al. 1992)

Figure 6 Hydrogen bonding in Secondary Structure. A| Hydrogen bonds in an α-Helix, between amino acids with 3

intervening amino acids B| Hydrogen bonds in parallel β-sheets, 1 bond between 2 amino acids on opposite strands C|

Hydrogen bonds in anti-parallel β-sheets, 2 bonds between 2 amino acids on opposite strands. Side-chains not shown.

Figure 5 Hydrogen bond network of water. The hydrogen

atoms form covalent bonds with the oxygen atom. This results

in a dipole that allows the oxygen atom to form hydrogen

bonds with it remaining electron pairs to the hydrogen’s of

other water molecules. The same is true for the hydrogen’s of

the same molecule. In total four hydrogen bonds can be formed

per water molecule.

Page 22: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

8

Ionic interaction is non-covalent interaction between positively charged side-chains of lysine,

arginine, and in some cases histidine, and the negative charge of the side-chains of aspartate

and glutamate. Ionic interaction has been defined in 1983 as ion pairs, when Nitrogen or

Oxygen atoms from two opposing charged side-chains or termini are within 4 Å of each other.

It has also been show that at low salt concentration 0.01 M weak interaction below 0.2

kcal/mol can extent more than 10 Å (Lee et al. 2002). Ionic interactions can be further

strengthened, by hydrogen bonding between the two charged side-chains, hence forming a

salt-bridges see Figure 7, this combination of interaction result in rather strong interaction, 4

kcal/mol (Kumar & Nussinov 1999). The pure charged based ionic interaction is sometimes

referred to as coulombic interaction. The idea that proteins are stabilised by ionic interactions

is challenges by desolvation and entropy cost of the. This aspect of ionic interactions is

discussed in more detail in section 6.3.

As ionic interactions are dependent of the charge of the residues, this type of interactions is

sensitive to charge screening by ions in the solvent. As a result, the salt concentration of

solvents can affect the stability of proteins. This effect can be both positive or negative

depending of the thermodynamics of the ionic interaction e.g. screening an unfavourable

charge thereby increasing the stability or vice versa. Additionally, as the charge of these

amino acids side-chain are the result of their basic and acidic nature, the charges are

dependent on pH.

Figure 7 Example of a possible salt bridge between Arginine and glutamate.

Page 23: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

9

Van der Waals forces (VdW) are the

weakest and has the shortest range of

the non-covalent interactions. VdW

interaction are the result of electrostatic

fluctuations between atoms that comes

in to close proximity, the resulting

complementary electrostatic di-pols

attracts the atoms to each other, see

Figure 8. Hence stabilising the protein

with 0.5-1 kcal/mol per atom pair (Berg

et al. 2007). However, if the atoms

come to close to each other the VdW

switches to a repulsive force as the electron clouds of the two atoms collide.

Hydrophobic interactions are the last of the none-covalent interaction. In contrast to other

none-covalent interaction hydrophobic interaction is not based on electrostatic attractions

between the residues. In contrast, hydrophobic interactions are the result of entropy loss

minimisation. The solvation of hydrophobic residues is the result of rearrangement of water

molecules around the hydrophobic side-chain. As the water can’t form hydrogen bonds with

the side-chains and only few orientations allows for optimal hydrogen bonding, a crystal-like

shell is formed around the hydrophobic side-chain (Dias et al. 2010). This state change from

bulk water to shell water is thermodynamically unfavourable. As, the enthalpy of the state

change can be positive, zero, or negative, but the entropy will be negative and much larger

that the change in enthalpy, ∆𝐻 < ∆𝑆. Hence per Eq. 3 in BOX I this results in a positive ΔG

of the state change. However, this

unfavourable Gibbs free energy can be

reduced by a seconded state change.

Here the desolvation of the side chains,

by collapsing them in upon them self

thereby reducing their surface area and

the amount of shell water results in a

net increase of enthalpy and entropy.

As before the change in entropy is much larger than that of enthalpy. As a result, the change

Figure 8. Van der Waals interaction is the results electrostatic di-

pols induced by electrostatic fluctuations. In this simplifyed figure

the proton and electrons are ilustrated by + and – respectivly.

Before the VdW interaction is estaplished the electron cloud are

evenly distributed aroun the atoms nuclius. Once the electrostatic

di-pol is estaplished and a a VdW interaction are taking plate the

electrol cloud is displased to allowing the electrostatic atraction.

Figure 9 Schematic representation of hydrophobic collapse of a 50

amino acid long peptide, in a ball and chain model. Polar and

hydrophobic residues are coloured red and blue respectively.

Page 24: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

10

in Gibbs free energy is negative and thermodynamically favourable. This minimisation of

shell water and its high cost in entropy are the force that drives hydrophobic interaction. From

Eq.3 we see that since the hydrophobic interactions are entropy driven, as ΔS>0 and ΔH>0,

that hydrophobic interactions are strengthent by increasing temperatures. As mentioned above

collapse of a hydrophobic core, see Figure 9, is believed to be an important pathway for

protein folding, and is driven by this entropy minimisation. Hydrophobic interaction is

however not limited to the folding of the protein it also plays an important role in complex

formation (Byun et al. 2007).

Covalent interactions

Of all the interactions observed in proteins, covalent bonds between individual amino acids,

excluding the peptide bonds, are the rarest. Only 6% or 34165 of the 552884 entries in

SWISS-PROT1 are annotated with disulphide bonds. Which under normal conditions are the

only covalent bonds found between residues in proteins. This kind of covalent bonds are

considered strong interactions due to their covalent nature, the enthalpy of such a bond is 62

kcal/mol (Jursic 1996). These covalent bonds impose severe steric constrains on the peptide-

chain. This greatly reduces their contribution to the stability to the protein fold, due to a large

entropy cost of the bond formation. The Chain-entropy model suggest that disulphide bonds

lower the entropy of none native conformation there by decreasing their stability (Pace et al.

1988) rather than directly increasing the stability of the fold. Experimental data suggest that

disulphide bonds can be both stabilizing and disruptive of both the native and denatured

1 SWISS-PROT homepage: http://www.uniprot.org/ Data extraction: 23-11-2016.

BOX III: Thermodynamic of State change in hydrophobic interactions The first state change: Side-chain solvation or from Bulk water to shell water:

Eq. 4 ∆𝐺𝑆𝑜𝑙 = 𝐺𝑆ℎ𝑒𝑙𝑙 − 𝐺𝐵𝑢𝑙𝑘 from Eq. 3 in BOX I we then get

Eq. 5 ∆𝐺𝑆𝑜𝑙 = ∆𝐻𝑆𝑜𝑙 − 𝑇∆𝑆𝑆𝑜𝑙 where ∆𝑆𝑆𝑜𝑙 < 0 and ∆𝑆𝑆𝑜𝑙 ≪ ∆𝐻𝑆𝑜𝑙 As a result

Eq. 6 ∆𝐺𝑠𝑜𝑙 > 0. And the state change is not spontaneous

The seconded state change: Side-chain desolvation or “hydrophobic collapse”:

Eq. 7 ∆𝐺𝐷𝑠𝑜𝑙 = 𝐺𝐻𝑦𝑑 − 𝐺𝑆ℎ𝑒𝑙𝑙

Eq. 8 ∆𝐺𝐷𝑠𝑜𝑙 = ∆𝐻𝐷𝑠𝑜𝑙 − 𝑇∆𝑆𝐷𝑠𝑜𝑙 where ∆𝐻𝐷𝑠𝑜𝑙 & ∆𝑆𝐷𝑠𝑜𝑙 > 0 and ∆𝑆𝑆𝑜𝑙 ≫ ∆𝐻𝑆𝑜𝑙 As a result

Eq. 9 ∆𝐺𝑠𝑜𝑙 < 0. And the state change is spontaneous

Temperature dependency on the strength of hydrophobic interactions:

As ∆𝑆 > 0 & ∆𝐻 > 0 & ∆𝑆 ≫ ∆𝐻 hydrophobic interactions are entropy driven. From Eq.3 from BoxI we see that the

contribution of entropy is a factor of the temperature and hence the Gibbs free energy of the hydrophobic interaction will decrease

as temperature increase thus increasing the strength of the interaction.

Page 25: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

11

structure (Zavodszky et al. 2001). It is estimated that disulphide bond can contribute with as

much as 2-4 kcal/mol (Betz 1993).

Post-translational modifications

Inter- and intra-molecular interactions are not the only factors that can affect protein stability.

Post-translational modifications of the amino acid side-chains generate an additional diversity

to the peptide chain otherwise not possible with the use of only 20 unique amino acids. These

protein modifications can range from small modifications like addition of a methyl group to

lysine to the addition of large branched carbon hydrates also known as oligosaccharides. This

type of modification is known as glycosylation. For some proteins like the human hormone

Erythropoietin (EPO) the oligosaccharides added to the protein constitutes 50% of the final

mass (Yamaguchi et al. 1991). Protein glycosylation comes in three forms, N-, O-, and S-

glycosylation, named after the glycosyl acceptor atom of an amino acid side chain, Nitrogen,

Oxygen, and Sulphur (Stepper et al. 2011) respectively. Protein glycosylation are seen in all

three domains of life, Archaea, Bacteria, and Eukarya (Wieland et al. 1985; Szymanski et al.

1999; Neuberger 1938) but most common in Eukaryotes. The chemical diversity of

glycosylation is enormous, and estimated to consist of over 7000 unique structures as

reviewed by R. D. Cummings in 2009 (Cummings 2009). Protein glycosylation has been

show to play a part in many biological functions e.g. secretion, adhesion, function etc.

Glycosylation has also been linked to protein folding and stabilisation. As N- glycosylation

was showed to increase protein folding rates and stability (Hanson et al. 2009). It is theorised

that the addition of the oligosaccharide to the poly-peptide decreases enthalpy of the unfolded

state hence stabilising the protein fold (Shental-Bechor & Levy 2008). Both these studies

show that the modifications are more important than the size of the oligosaccharide.

Methylation is like glycosylation best known from eukaryotes, and its role in epigenetic

regulation of gene expression, through lysine methylation of histone and DNA condensation

as reviewed by (Bannister & Kouzarides 2005; Kouzarides 2007). Lysine methylation has

however, been suggested as a general mechanism for stabilisation of proteins. As Crenarchaea

has been shown to have wide spread (Botting et al. 2010) and in some cases temperature

dependent lysine methylation (Baumann et al. 1994). The implication of methylation on

protein stability will be further discussed in section 6.4. The chemical diversity of methylation

is small compared to that of glycosylation. This is due to the simple chemical composition of

the methyl group. Both arginine and lysine subjects of methylation can be both mono- and di-

Page 26: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

12

methylation while lysine also can be tri-methylated, see Figure 10. Lysine methylation is

usually catalysed by a lysine methyl transferase, EC 2.1.1, through the conversion of S-

Adenosyl methionine (SAM) to S-Adenosyl-L-homocysteine (SAH) and the simultaneously

transfer of the methyl group from the SAM to the N-ε-amino group of the lysine side-chain.

Figure 10 Lysine Methylation. A| Unmethylated-lysine. B| Monomethyl-lysine. C| Dimethyl-lysine. D|

Trimethyl-lysine.

Another common type of PTM is proteolytic cleavage of the peptide backbone, to produce

truncated proteins. It is a well-known process with a few classic examples: The removal of the

N-terminal methionine (Sherman et al. 1985), and the maturation of insulin (Docherty et al.

1982).

Beside these three types of modifications, are there countless other. From the dbPTM a data

base of post translation modification there are 602 different types of side-chain modifications.

These 60 types are in addition to glycosylation, lipidation and protein crosslinks. Covering the

complete range is outside the scope of this thesis and will not continue beyond this point.

4 Protein structure determination

With all these different forces and interactions driving and stabilising the proteins folds, the

task of modelling the structure of proteins has, so far in most cases, been out of reach. The

scope to the problem was highlighted by Levinthal’s paradox in section 3.1. However, the

structure of a protein contains vast amount of useful information. To access this information,

we need a model of the structure. Currently, are there several ways to obtain structure

information from proteins each with its own strengths and weaknesses. The most common

methods used are: X-ray crystallography and Nuclear magnetic resonance spectroscopy

(NMR).

2 http://dbptm.mbc.nctu.edu.tw/index.php -Homepage of the dbPTM. Date of data extraction: 23-11-2016

Page 27: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

13

X-ray crystallography is the most used method, 90%3 of structures in the Protein Data Bank

(PDB) are obtained by X-ray diffraction. The main challenge in X-ray crystallography usually

is obtaining the necessary high amount of protein and then producing crystals. Finding the

proper condition for the formation of crystals can be very time consuming, as it is a process of

trial an error.

NMR has been used for solving 9%3 of the structures in PDB and is the second most used

method. It utilizes the magnetic moment or spin of specific elemental isotopes e.g. 1H 13C and

15N. As the natural abundancy of 13C and 15N are low and they are required for solving large

structures, obtaining protein containing these isotopes relies on the expression of the protein

from a host grown in a defined media where the heavy isotopes, 13C & 15N, are the only

Carbon and Nitrogen source. These media are expensive and not all organisms can be grown

in these sufficiently defined media, also known as minimal media. In contrast to X-ray

crystallography, NMR is limited to proteins of a size below 50 kDa.

In addition to classic experimental based methods like NMR and X-ray crystallography are

the computational or Computer based methods. The rapid evolution of computers is pushing

us towards the point where we can model protein folding. This emerging field is described in

section 4.1.

Protein structures are an important tool in research fields. In medical research structural

information can be used for rational drug design where information about protein-protein

interaction or ligand binding sites can be instrumental in the development of inhibitors of

virulence factors and oncogenes (Guillon et al. 2014; Lessene et al. 2013). Protein structures

are also widely used in the study of protein stability. Here the structures provide information

on how the different intra-molecular interactions come together and stabilises the protein fold.

E.g. structural information of two homologues enzymes, one mesophilic and one

thermophilic, can be compared and rationalising of differences in the structures can provide

insight into the basis for the difference in stability. Additionally, Large scale structural

analysis projects comparing structural tendencies of many proteins might provide information

of general mechanisms utilised for achieving high protein stability. Protein stability modelling

3 www.rcsb.org : H.M. Berman, J. Westbrook, Z. Feng, G. Gilliland, T.N. Bhat, H. Weissig, I.N. Shindyalov, P.E. Bourne (2000) The

Protein Data Bank, Nucleic Acids Research, 28: 235-242. Data extracted: 31-07-2016

Page 28: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

14

make use of experimentally determined protein structures. For more information see the

following section, 4.1.

Computational structural biology

Computational structural biology (CSB) offers an alternative to X-ray crystallography and

NMR for determining the structure of proteins. In CSB computer modelling an All-atom

Molecular Dynamic (MD) simulation are replacing empirical experimentation and data

acquisition with computation to generate structures of folded proteins. Determining a protein

structure purely in silico by MD relays on the knowledge of the factors and forces driving the

protein folding for the simulation. As demonstrated by Leviathans paradox, see 3.1, there is a

near endless number of possible structures for even short peptides. To do modelling protein

folding from linear peptides through folding to a native structure is extremely calculative

demanding, and a major limiting factor for the success of MD. The process of modelling

protein folding from only a known amino acid sequence is known as “Ab initio” (Bonneau &

Baker 2001) from Latin for “from the beginning”4. To circumvent the high demand of

calculation power needed to model protein folding Ab initio several methods has been

developed that simplify the protein to reduce the variables, or use data from all ready solved

structures to guide the modelling of novel structures. The following is an introduction of a

few general methods for in silico determination of novel protein structures.

Homology modelling (HM) is one such “short cut” that employs sequence homology of the

target protein to proteins with solved 3D structure. As a result, the final 3D model produced

by HM is affected by the quality, resolution(Å), of the structures used for the modelling of the

novel structure. HM can be simplified to three main steps (Khan et al. 2016): 1| Identification

of a suitable template(s), which is done by searching databases for homologues peptide

sequences with solved structures. An identity >40% are needed for a good result. 2| Model

building, based on the structure of the template a new 3D model is built e.g. by using the

peptide backbone as a guide. 3| Model evaluation and refinement. Here the thermodynamic

plausibility of the model is tested, if the novel structure fails, then energy minimization

algorithms are used to refine/tweak the novel structure until the result is thermodynamic

favourable.

4 https://en.oxforddictionaries.com/ homepage of English Oxford Living Dictionaries, accessed 28-11-2016

Page 29: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

15

One method that uses a simplified model of the protein to

estimate the structure for Ab initio structure modelling is

Lattice models. Here the peptide chain is reduced to its

simplest forms, ball and chain model and placed in a

lattice structure, a systematic arrangement in 2D or 3D

space (Dill et al. 1995). With this model, it is possible to

model interactions based on sequence, not at atomic level

but at a functional level, as each amino acid is treated at a single “atom”. In its simplest form,

amino acids are reduced to hydrophobic (H) or Polar (P). The ball and chain model of the

peptide can now be placed in its lattice grid (Figure 11) and each possible conformation can

rapidly be evaluated through a simple sum of contacts between adjacent balls as each possible

case is an assigned energy contribution, e.g. -1,0,0 for HH, HP, PP respectively. In this

simplified model the most likely structure, based on hydrophobic interactions is the one with

the lowest total energy. Bond angles and lengths, are defined by the lattice used in the

simulation and not the properties of the actual bond. Lattice based models allows only the

mapping of the peptide backbone in a rapid and calculation light manner compared to

complete atomic modelling.

Discrete state off-lattice models are a simplified versions of MD, in contrast to true MD the

complexity of these models are reduced (Thirumalai & Klimov 1999). Usually by limiting the

sterically conformation, and bond lengths of the side chains, while the back-bone

conformation is also limited to specific phi/psi angels. As a result, the degrees of freedom in

the simulation is reduced and the calculation time for sampling the conformational space in

reduced. In contrast to lattice based modelling this allows for the determination of secondary

structure as well as higher order structure as the amino acid side are included in the

modelling.

All-atom Molecular Dynamic (MD), the pinnacle of off-lattice models. Here the complexity

of the model is complete, meaning that all atoms have their movement and interaction

individually modelled. Theoretically this allows for the prediction of structures at all levels of

complexity. As a result of the high complexity MD is very calculating heavy, especially as the

number of atoms increases, and this limits it success. In an effort to overcome this problem

the distributed computing project: Folding@home was started in 2000 by Vijay Pande out of

Figure 11 3D Lattice model of hydrophobic

and polar interaction. (Yue et al., 1995

from Dill et al 1995)

Page 30: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

16

Standford University, currently ~85000 users provide ~77000 Teraflops5. This has allowed

simulations to exceed the previously short time scales of nano- and micro-seconds to

milliseconds (Voelz et al. 2010)

Beside the difference in complexity between lattice and off-lattice based models they differ

significantly in how they work when they predict the structure. Off-lattice based models are

time dependent simulations of the protein folding while lattice based models often are Monte

Carlo based. Monte Carlo based models are dependent of random sampling, in this case

sampling of protein conformations. The use of this method is only possible due to the reduced

complexity of the models, otherwise Leviathan’s paradox would also apply to the simulation,

however here the time scale will depend on the calculating power available. The simulation of

protein folding in MD is done with a force field that also account for atomic momentum. As a

result, it is possible to simulate changes in the structure as a function of time (Allen 2004).

Structural fragment modelling, contrast the other Ab initio methods by not using a simplified

or constrained model to simulate the folding of the protein. Structural fragment modelling is a

statistical based method. For this method, structural information, of a large set of non-

homologues protein structures, are divided into fragments, and sorted into recurring structural

elements. These recurring elements form a library of building blocks/motifs that can be

applied to a peptide sequence to build a peptide back bone. Once a back bone has been built

side chain conformation can be estimated by using preferred rotameter conformations

databases (Baeten et al. 2008).

A common feature for all the methods described above is in the use of energy force

fields/functions. Eq. 10 is an example of such an empiric force field use to calculate the

Gibbs-free energy of folding (Schymkowitz et al. 2005). In all cases but MD they are used for

evaluating the quality of the structure. In MD the force field directly involved in the

simulation resulting in a structure.

5 Fold@home homepage: http://folding.stanford.edu/home/ Date. 09-08-2016

Eq. 10 𝛥𝐺 = 𝑊𝑣𝑑𝑤 ⋅ 𝛥𝐺𝑣𝑑𝑤 + 𝑊𝑠𝑜𝑙𝑣𝐻 ⋅ 𝛥𝐺𝑠𝑜𝑙𝑣𝐻 + 𝑊𝑠𝑜𝑙𝑣𝑃 ⋅ 𝛥𝐺𝑠𝑜𝑙𝑣𝑃 + 𝛥𝐺𝑤𝑏 + 𝛥𝐺ℎ𝑏𝑜𝑛𝑑

+ 𝛥𝐺𝑒𝑙 + 𝛥𝐺𝐾𝑜𝑛 + 𝑊𝑚𝑐 ⋅ 𝑇 ⋅ 𝛥𝑆𝑚𝑐 + 𝑊𝑠𝑐 ⋅ 𝑇 ⋅ 𝛥𝑆𝑠𝑐

Where W are weighs. vdw: Van der Wells interactions, solvH: solvation of hydrophobic residues, solvP:solvation of polar

residues, wb: Water atoms with +2 Hydrogen bonds to the protein, hbond: hydrogen bonds, el: electrostatic, Kon:

electrostatic interaction from protein-protein interaction, mc: entropy of peptide bond angels (phi-psi), sc: entropy side

chains

Page 31: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

17

The use of these force fields is not limited to modelling of ab initio structures. They also play

an important role in in silico protein engineering, to evaluate the effect of mutations on the

thermodynamic of protein folds, see section 5.2.

5 Protein engineering

While nature is vast in terms of diversity, most enzymes of not optimised for industrial

processes. With protein engineering we can modify the amino acid sequences of proteins,

thereby changing their functionality. The aim of these modifications are usually to improve

the protein function in a specific context. Hence, stability, activity and substrate specificity

are common targets. While the modifications of natural enzymes are useful, the design of

novel enzymes engineered for specific applications provide near endless possibilities for

enzyme applications. An enzyme has already been made that catalysed reactions not

previously observed in nature (Siegel et al. 2010).

Protein engineering usually made by one of two methods or a combination of the two,

directed evolution or rational design Both types of protein engineering will be introduced in

the following section.

Directed evolution, traditional

Through selective breeding, many organisms have been adapted to our preferences e.g. higher

yield, temperament, physical appeal, etc. Hence, once an understanding of the functionality of

DNA was obtained the next step was the application of methods to speed up the evolutionary

process.

Chemical mutagenesis paved the way for modern DNA based methods, by demonstrating the

acquisition of advantageous phenotypes (Cobb et al. 2013). The idea behind direct evolution

is the induction of random genetic changes, diversification, followed by a selective process

where improved mutants are identified by screening. This process can be repeated in a

circular fashion, to further increase (Eijsink et al. 2005). There are many methods by which

genetic diversification can be obtained, chemical, irradiation, error prone PCR and other PCR

based methods, gene shuffling, etc. (Cobb et al. 2013). The true power of direct evolution is

the vast number of different mutants generated, thereby increasing the chance of wanted

improvements. The effectiveness of directed evolution relies heavily on the availability of a

high-throughput screening methods for the selection of advantageous mutants. The screenings

Page 32: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

18

methods will most likely vary greatly from case to case. Directed evolution can be employed

at organism and protein level, e.g. improving the growth rate of industrial relevant organisms

on cheap mediums, or stability of an enzyme respectively.

Modern direct evolution usually follows a twostep approach, 1| library generation and 2|

library screening (Cobb et al. 2013).

Rational design, modern

Rational design contrast direct evolution, by not being random and producing a comparable

small number of mutants to screen. With rational design, knowledge about the target protein

or organisms is used for the selection of changes to be made to achieve the sought-after effect

e.g. specific amino acid substitution to increase enzyme stability (Wijma et al. 2014) or

eliminate an unwanted metabolic pathway in improve yields (Sánchez et al. 2005).

The studies by (Wijma et al. 2014) is a good case story of the use of structural information

and CSB. This also highlights the importance of structure knowledge for efficient rational

design. Wijma and colleges used a combination of computational methods to evaluate amino

acid substitution effect on protein stability, prediction of disulphide bonds, and MD

simulation to eliminate mutations that result in increased protein stability. Furthermore, they

employed a manual curation of the 248 point mutations, to eliminate any mutation with

features know to be destabilising. In total, 64 point mutations were selected by experimental

verification. Where 21 showed an increase in stability. In the end through combining the

mutations an increase of TM of 35.5°C were obtained (Wijma et al. 2014). In the case

highlight above the empirical force field, Eq. 10, were used to estimate the effect of amino

acid substitutions of the stability of the protein structure.

To be able to predict changes to increase the stability of an enzyme is all well and good. And

it can be highly successful as with the example from (Wijma et al. 2014) to generate a

hydrolase with a temperature optimum 80°C and a long half-life at temperature over 70°C.

However, evolution has produced organisms that lives comfortably at temperature above

80°C, and ever higher. The subject of the next sections is the adaptation of proteins to these

extreme temperatures.

Page 33: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

19

6 Protein adaptation to extreme temperatures

All the before mentioned factors that works to stabilise proteins are general for all proteins

whether they originated from a mesophilic or a thermophilic organism, moderate and high

temperature loving organisms respectively. As a consequence of the high temperature of their

habitat, thermophilic organisms have evolved proteins with an intrinsic high thermal stability.

This is a result of an otherwise unviable increase in protein metabolism that the organisms

would have to maintain. In order to continually replenish its pool of active enzymes, if it

enzymes had stability similar to those of mesophils. Generally speaking high thermal stability

is associated with stability in general e.g. also towards denaturants such as detergents, urea

and organic solvents and long shelf life of purified protein (Vieille & Zeikus 2001). An

example of this is seen in the rational engineering to increase thermostability of a hydrolase

from Bacillus stearothermophilus. Here the half-life of the enzyme at 70°C was improved

with 12.7 fold, by a single point mutation. This increase in stability was accompanied by an

increase in resistance to non-ionic detergents of ~2 fold (Ece et al. 2015). Due to the high

stability and resistance to e.g. detergents of their enzymes, thermophiles and

hyperthermophiles are currently experiencing an increase in interest, as they are potential host

for enzymes of significant industrial interest. Beside the identification and characterisation of

enzymes with high stability much effort has also been put into deciphering the molecular

determinants of this increased stability. The hope is to identify general mechanisms that in

time can be applied in protein engineering and cell factory optimisation to improve the

stability of all enzymes, hence not relaying on the identification or random generation of new

enzymes with high stability. Some investigations have looked at thermophilic organisms in

general, to discern how thermophilic protein achieve their stability. While other have

distinguished between thermophiles and hyperthermophiles or bacteria and archaea, in order

to filter out convergent evolution that might obscure individual mechanisms.

A consensus is emerging where no single mechanism is responsible for the high stability of

thermophilic proteins. Instead optimisations of many different interactions and mechanisms

are responsible for the increased stability (Petsko 2001). In this section I will summarise what

is known about these protein traits and their association with high stability and a correlation

with high optimal growth temperature. Cellular traits like DNA composition and

condensation, plasma membrane chemistry, and specific enzyme activities will not be

discussed here.

Page 34: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

20

Amino acid composition

As we progress further into the post genomic age, and more genomes of thermophiles become

available a logical avenue of investigation is to look for differences in the amino acid

composition of mesophiles and thermophiles. The most significant tendencies that has come

from these investigations indicates that thermophilic protein has an overrepresentation of

charged amino acids (Szilágyi & Závodszky 2000; Fukuchi & Nishikawa 2001; Nakashima et

al. 2003; Suhre & Claverie 2003; Cambillau & Claverie 2000; Tompa et al. 2016; Taylor &

Vaisman 2010).

An analysis of the localisation of the amino acids within the protein structure of 47

thermophilic, optimal growth temperatures between 65°C and 70°C, and 144 mesophilic

proteins. Here they found an overrepresentation of charged residues on the surface of ~17%

compared to the mesophilic proteins. This correlated with an 15% and 17% increase of

charged residues in the interior and the general composition of the proteins, respectively

(Fukuchi & Nishikawa 2001) indicating a general increase in charged residues. This increase

in charged residues is compensated for by a general decrease of polar residues. For both

thermophilic and mesophilic proteins ~45% of amino acids fall in to one of those two

categories, polar or charged. Of the charged residues on the surface of thermophilic proteins,

glutamate, arginine, and lysine where overrepresented while aspartate was underrepresented.

This elimination of aspartate from the surface of the protein contrast the situation in the

interior where aspartate is overrepresented compared to the three other residues (Fukuchi &

Nishikawa 2001). An increase in the usage of branched hydrophobic residues was also

observed, i.e. alanine elimination was compensated for primarily by an increase in valine but

also leucine and isoleucine content (Fukuchi & Nishikawa 2001).

Many of the same tendencies have been reported in other studies, (Cambillau & Claverie

2000) compared mesophilic and hyper-thermophilic optimal growth temperatures between

83°C and 97°C proteins. (Szilágyi & Závodszky 2000) has the same general results but fail to

identify an increase in lysine usage of thermophilic enzymes of organisms with a growth

temperature below 100°C. (Nakashima et al. 2003) were able to demonstrate an almost linear

correlation between optimal grown temperature and the amino acid composition, in the

proteome of organisms living between 10 and 103°C. The major positive contributors to this

correlation were charged and hydrophobic amino acids.

Page 35: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

21

To summarise, the charged residues, arginine, lysine and glutamate, and the branched

hydrophobic, leucine, isoleucine, and valine have been positively selected for in thermophilic

proteins. While aspartate together with the polar residues have been counter selected.

Rationalisation of these differences along with other traits related to stability of thermophilic

proteins will be highlighted in the following sections.

Counter selection of thermolabile amino acids

The above mentioned counter selection of aspartate, asparagine, and glutamine in

thermophilic enzymes might be due to the thermolability of these amino acids, as reviewed in

(Daniel et al. 1996). At high temperatures asparagine and glutamine are labile to heat induced

deamination, while aspartate and asparagine also can participate in heat induced hydrolysis of

the peptide back-bone. However, these temperature-driven degradations only become

significant at temperatures above 80°C at neutral pH. The thermolability of these amino acids

can however be reduced or prevented by their local environment e.g. sterically hindrance of

the formation of intermediates, stabilisation by hydrogen bonding, or the presence of co-

factors like Ca+ ions.

Increased number of charged amino acids

The increase in the number of lysine, arginine, and glutamate residues in thermophilic

proteins, can be an indication of an increased number of ionic interaction and salt-bridges,

that serve to stabilise thermophilic proteins. This correlation has been observed in some cases

but is not universal (Xiao & Honig 1999). The stabilising effect of ionic interaction is

challenged by a high cost upon formation, due to desolvation (Hendsch & Tidor 1994), and

loss of entropy as a result of increased sterically movement (Sun et al. 1991). The formation

of a salt-bridges can compensate for the cost by the formation of hydrogen bonds between the

charged groups. The stabilising effect of salt bridges were evaluated in a study containing 222

salt-bridges from 36 proteins, primarily from mesophilic organisms. This showed that the

majority of the tested salt bridges were stabilising (Kumar & Nussinov 1999). 32 of the 222

salt bridges or 14% were determined to be destabilising the protein fold. 30% of the salt

bridges were buried and 88% of these were deemed to be stabilizing. And once the

desolvation cost was payed, the average salt bridge contributes to the stability of the folding

with 4 kcal/mol, but much higher contribution is also possible. In an addition study an inverse

correlation between temperature and desolvation cost of charged residues, thereby making salt

bridges contribute more to the stability at high temperatures and more advantages for

Page 36: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

22

thermophiles then mesophiles (Elcock 1998). These observation correlates with an increase

in the number of salt bridges in thermophilic proteins compared to mesophilic (Kumar et al.

2000).

Optimisation of the ionic interaction have been show to play a role in the stabilisation of

thermophilic enzymes, the optimisation of ionic interaction through the elimination of

unfavourable repulsion between residues with the same charge, has been proposed as a

relative simple mechanism, compared to the introduction of additional stabilising ionic

interaction (Perl & Schmid 2001). As the elimination of unfavourable repulsion aren’t

associated with the same desolvation and entropic cost as the formation of ion-pairs and salt-

bridges (Perl & Schmid 2001). The optimisation of these interactions was also seen when the

comparing the contribution to stability of ionic interaction from mesophilic a thermophilic

enzymes, however there were many different methods to achieve these optimisations,

compensation of desolvation by polar interaction with additional non charged residues,

increase in the number of ionic interaction, reduction of buried charged groups (Xiao &

Honig 1999). The localisation of ionic interaction on the surface of thermophilic enzymes

were found to be an identifiable trait of thermophilic enzymes (Taylor & Vaisman 2010), also

indicating differences in the utilisation of ionic interactions.

Lysine methylation:

Lysine methylation has been suggested to increase the stability of protein by strengthen ionic

bond formation, and preventing aggregation (Febbraio et al. 2004). Proteins from hyper-

thermophiles archaea of the order Sulfolobales show extensive lysine methylation (Maras et

al. 1992; Botting et al. 2010; Guo et al. 2008; Baumann et al. 1994; Febbraio et al. 2004). In

the case of sul7d, a histone like protein found in Crenaraeota, the degree of lysine methylation

seemed to be dependent of the growth temperature (Baumann et al. 1994). The observed

temperature dependent methylations seemed to be independent of the amino acid sequence

around lysine’s in α-helix’s (Febbraio et al. 2004; Botting et al. 2010). These observations are

in good correlation with the identification of the first crenarchaeal lysine methyl transferase,

aKMt, which activity appears to be unspecific but tending towards α-helixs (Chu et al. 2012;

Niu et al. 2013). The widespread lysine methylation and the unspecific nature of the aKMT,

contrasts the previously observed specific methylation observed in bacteria and eukaryotes.

aKMT is responsible for the vast majority of lysine methylation in S. islandicus (Chu et al.

2016). This has led to the hypothesis that the lysine methylation might be related to increased

Page 37: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

23

thermos ability. The research presented later in the thesis show that in vivo lysine methylation

has a positive, if minor, effect on the stability of thermophilic esterase EstA from Sulfolobus

islandicus (Stiefler-Jensen et al. 2016). These data correlate with recent in vitro experimental

evidence suggest a correlation between stability and lysine methylation (Xia et al. 2015). Here

in vitro lysine methylation was reported to improve the stability and activity of MCM from S.

islandicus when purified from E. coli. In contrast β-Glycosidase from S. Sulfolobus showed

no correlation between stability and lysine methylation (Moracci et al. 1995). Additionally, it

has been show that aKMT mediated lysine methylation is dispensable for growth at

temperatures up to 75°C on optimal growth media (Chu et al. 2016). All the evidence above

suggests, that lysine methylation in S. islandicus only play a minor role in heat tolerance.

Structural optimisation

Various structural optimisations have been linked to the high stability of thermophilic

proteins. These include reduced flexibility and minimisation of the protein structure along

with cavity filling. The reduction of cavities within the protein structure has in some cases

been demonstrated to be associated with increased stability (Vieille & Zeikus 2001; Szilágyi

& Závodszky 2000; Rhee et al. 2006; Ece et al. 2015). A reduction of cavities in thermophilic

proteins was indicated to be facilitated by an increase in amino acid with extended side chains

(Tompa et al. 2016). Reducing the flexibility of the protein structure is believed to be a

mechanism employed to increase protein stability of thermophiles (Scandurra et al. 1998). A

general reduction in protein flexibility comes with a cost, the activity of enzymes is often

depended upon conformational changes e.g. upon substrate binding and release. If this

flexibility is not maintained to some degree at physiological temperatures the effectiveness of

the enzyme will be compromised. The negative effect of low flexibility can be observed

through the low activity of thermophilic enzymes at low temperatures (Závodszky et al.

1998). Is has been proposed that, to counteract this negative effect on activity the reduction of

flexibility is limited to structural areas that do not interfere with activity, this mechanism is

observed both for thermophilic and mesophilic enzymes (Karshikoff et al. 2015).

Reduction of the peptide chain and shortening of surface loops has also been proposed as a

method for increasing thermostability (Chakravarty & Varadarajan 2000; Thompson &

Eisenberg 1999; Taylor & Vaisman 2010; Walden et al. 2001). Preferences in secondary

structure related to increased stability in not clearly evident. Some data do suggest a weak

Page 38: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

24

correlation between an increased presence of α-helix and β-strands and a reduction of

irregular structures (Szilágyi & Závodszky 2000; Chakravarty & Varadarajan 2000).

Hydrogen bonds

A 2016 study did not find a significant difference in the amount of hydrogen bonding

observed in a set of 373 thermophilic and mesophilic proteins (Tompa et al. 2016). It was

however shown that hydrogen bonds were utilized very differently by the two classes of

proteins. Thermophilic hydrogen bonds were to a higher degree used for anchoring of the

protein surface to the interior. In contrast mesophilic proteins utilised hydrogen bonding for

stabilising the protein core. An earlier study, on a much smaller dataset, also concluded that

there was no difference in the number of hydrogen bonds. It did however, find indications that

hydrogen bonding tended to be better optimised in thermophilic proteins, as unsatisfied

hydrogen bond donor and acceptor minimisation seemed correlated to optimal growth

temperature (Szilágyi & Závodszky 2000).

Hydrophobicity

Optimization of hydrophobic interactions has long been considered a trait of thermophilic

proteins (Perutz & Raidt 1975). Examination of protein structures of both thermophilic and

mesophilic proteins show that most thermophilic proteins had a better minimization of apolar

atoms accessible to the solvent (Spassov et al. 1995). In a later study it was shown that

thermophilic proteins are more stabilised by desolvation of side chain carbon, minimisation of

unfavourable apolar interaction, atoms than homologues mesophilic proteins (Tompa et al.

2016). As well as a stronger hydrophobic core (Taylor & Vaisman 2010). That hydrophobic

interactions are better optimised and more widespread in thermophilic protein correlates well

with the increased strength of hydrophobic interactions at high temperatures (Baldwin 1986;

Shimizu & Chan 2002).

Increased number of disulphide bonds

Disulfide bonds are generally associated with extracellular proteins due to the reducing nature

of the cytoplasm. Nevertheless, it seems that thermophiles, especially Archaea, make use of a

disulphide bond in cytoplasmic proteins. This is evident both in the frequency of cysteines

pairs in close proximity and the presence of genes that promote cytoplasmic formation of

disulphide bonds (Beeby et al. 2005; Jorda & Yeates 2011). Both native and engineered

Page 39: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

25

disulphide of cytoplasmic proteins have been shown to be enhancing protein stability (Hattori

et al. 2015; Yin et al. 2015; Cao et al. 2015).

Oligomerisation

Oligomerisation has been mentioned as a possible mechanism for increasing protein stability

(Walden et al. 2001; Tanaka et al. 2004; Korkhin et al. 1999; Villeret et al. 1998). It is

however not necessarily detectable in studies with large numbers of proteins (Kumar et al.

2000). This indicates that oligomerisation likely is a high specialised mechanism for

thermophilic adaptation.

The stabilising mechanism of oligomerisation is very similar to that of hydrophobic

interactions, see section 3.3. As water is perturbed from is bulk state, with only water to water

hydrogen bonds, to form a hydration shell around a protein, where water to protein hydrogen

bonds are formed, result in an increase in Gibbs free energy of the shell water due to loss of

entropy, Sbulk > Sshell → 0 > ΔSbulk→shell, and a thermodynamic unfavourable state, ΔGbulk→shell.

As the amount of shell water is determined by the solvent accessible surface arear (SASA) a

reduction of this surface will, like it was the case for hydrophobic interaction, result in an

increase in entropy, 0 < ΔS, and a more favourable thermodynamic state. We know form Eq.

3 that for a state change there is an inverse correlation between ΔS and it contribution to the

stability of the state change as well as a temperature dependency, -TΔS. Hence, as for

hydrophobic interaction the stabilising effect of reducing the SASA is strengthened by

increasing high temperatures.

Comparison of mesophilic and thermophilic protein-protein interfaces have shown that

additional hydrophobic and polar interaction stabilises the interface while the number of ionic

interaction are more or less similar between the two groups of proteins (Maugini et al. 2009).

Is has also been observed that cross-linking of the protein-protein interface via disulphide

bonds increase the stability of the proteins (Cao et al. 2015; Stiefler-Jensen et al. 2016).

7 Enzymes of special interest

In the following two section, two enzymes of special interest will be introduced, as they both

play a central role in the research presented in the last part of this thesis. Both enzymes are

native to the hyper-thermophilic and acidophilic crenarchaeal archaea Sulfolobus islandicus.

The first enzyme an esterase(EstA), was first characterised in 2012 by Mei et al. 2012. The

Page 40: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

26

results from that study made the foundations for this thesis. The second enzyme a lysine

methyltransferase (aKMT/aKMT4), was first identified as the first archaeal methyltransferase

in 2012.

EstA, an Archaeal esterase

EstA, WP_014511906, from S. islandicus is an

esterase with a classic alpha/beta hydrolase

fold6. with a central twisted β-sheet of 8

strands supported by α-helixes on both side

(Ollis et al. 1992), see Figure 12. The EstA

protein consists of 305 amino acids in its

native form as encoded by the gene

SIRE_RS01465. Or when expressed

recombinantly as a 314 amino acid large

fusion protein with a C-terminal 6x histidine

tag. EstA catalyses the hydrolysis of a

carboxylic ester through the consumption of a

single water molecule, as per EC3.1.1.1, see

Figure 13.

In 2012, EstA were reported to be exceptional stable both in regards to thermal and organic

solvents inactivation (Mei et al. 2012). At 90°C the recombinant enzyme remained active

after more than 24 hours’ thermal treatment, with a half-life of 16 hours, when expressed

from its native host, the hyperthermophilic Archaea S. islandicus(SiEstA). SiEstA purified as

a dimer in solution, and had a temperature optimum at 90°C, well above the growth

temperature of S. islandicus at 80°C. The high stability of SiEstA was also characterised by

the presence of dimers during SDS-PAGE analysis. The activity of EstA was only reduced

with up to 20% in the present of 5 mM 1,4-dithiothreitol (DTT), Urea, and mercaptoethanol.

Diaminoethane-tetraacetic acid

6 Marchler-Bauer A et al. (2015), "CDD: NCBI's conserved domain database.", Nucleic Acids

Res.43(D)222-6

Figure 12 EstA Subunit. The α/β hydrolase fold of EstA,

PDB ID 5LK6

Page 41: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

27

(EDTA) at 5 mM reduced that activity to

58% while phenylmethanesulfonyl fluoride

(PMSF) was the strongest inhibitor of SiEstA

and reduced activity with 90%. SiEstA was

able to hydrolyse p-Nitrophenyl ethylester

with ethyl chain length from two to 16 carbon atoms. The highest activity was observed with

a six carbon ethyl chain, p-Nitrophenyl Hexanoate.

In the Mei et al. paper from 2012, recombinant EstA from E. coli (EcEstAI) was also

analysed. In all aspects EcEstAI preformed worse than SiEstA, except in regards to resistance

to Urea and 1 mM EDTA. Here EcEstAI showed similar performance to SiEstA. The optimal

temperature of EcEstAI was determined to be 65°C, and the half-life at 90°C were 30

minutes, 32 fold lower than that of SiEstA. EcEstAI purified as a monomer contrasting the

dimer formation by SiEstA. In the paper the differences in thermostability were hypothesized

to be a result of different or absent PTM when EstA is purified from E. coli. The stability of

esterase of other thermophiles and hyperthermophiles has been the subject of investigation a

number of times in the past and reviewed in 2009 by (Levisson et al. 2009). In general, the

stability varies a lot, half-life of esterases from the genus Sulfolobus range from

approximately 40 to 6000 minutes at 80°C, half-lives of 50 hours at 90°Chas also been

reported. The closes homology to EstA investigated previously, with a 99% identityscore, had

a half-life of 40 min at 80°C and close to 10 minutes at 90°C, when expressed from E. coli in

a multimeric form (Kim & Lee 2004). Lipases and esterases share many of the same

properties, the major difference is in the ability of lipases to utilize substrates not in solution.

Both classes of enzymes catalyse the hydrolysis of ester bonds through similar mechanisms.

Carboxylesterases are currently best

described by their amino acid sequence

and are divided into three families: LPL-,

EST-, and HSL-family, denoted by L, C,

and HSL respectively, EstA is a member

of the HSL family. Historically lipases and

esterase’s has been classified by the

substrate specificity, however the

difference is more related to the physical

state of the

Figure 13 Carboxylic-ester hydrolase reaction, EC3.1.1.1.

The reaction catalysed by EstA.

Figure 14 EstA Dimer and Dimer interface. The EstA Dimer

with the 2 subunits, cyan and green, with the interface itself in

red, PDB id 5LK6

Page 42: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

28

substrate, soluble/insoluble (Chahinian & Sarda 2009). Lipases/esterase’s have an important

place in industrial use. Esterase’s are used in multiple steps during the production of recycled

paper, and in the biosynthesis of organic compounds, the production of wines and juice in the

food industry (Panda & Gowrishankar 2005; Hasan et al. 2006; Damhus et al. 2008).

aKMT, an Archaeal lysine methyltransferase

In 2012, the first lysine methyltransferase from a

crenarchaeal, were identified and characterised from S.

islandicus (Chu et al. 2012), see Figure 15 for the

structure. The protein

had a novel N-terminal catalytic domain, and was named

aKMT for Archaeal lysine MethylTransferase, with the

one-letter abbreviation of lysine: K in place of L. aKMT

is encoded by the gene SiRe_1449, and consist of 161

amino acid and has a molecular mass of 18.2 kDa.

aKMT methylates a broad range of proteins from

Sulfolobus, when purified both from the native host and

from E. coli. bovine serum albumin (BSA) was however not methylated by aKMT. Four

month later in 2013 a different group reported the identification of aKMT through sequence

similarities with aKMT4/Dot1 methyltransferase from eukaryotes and named it aKMT4 (Niu

et al. 2013). They demonstrated aKMT/aKMT4 auto-methylation activity, which regulate its

own activity. Auto-methylation was inhibited by the presence of DNA. The sequence

unspecificmethylation of aKMT/aKMT4

were confirmed in a study from 2016, see

Figure 16A, however a bias towards α-Helix

structures were observed, see Figure 16B.

The same study also investigated the effect

of aKMT/aKMT4 deletion on the proteome

of S. islandicus and found no correlation

aKMT/aKMt4 mediated methylation and

increased thermostability (Chu et al. 2016).

Figure 15 aKMT structure. aKMT interacting with

SAM (not shown). PDB id 5FA8(Chu et al. 2016)

Figure 16 Sequence and structural bias of aKMT

methylation. A| Amino acid sequence bias around aKMT

mediated. methylation. B| Predicted structural bias around

aKMT mediated methylation. (Chu et al. 2016)

Page 43: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

29

8 Summary of results

The aim of this project was to investigate, and if possibly, identify post-translational

modifications and/or other factors, not related to the primary structure, that are responsible for

the high stability of proteins expressed from the thermo- and acido-philic crenarchaeal

organism S. islandicus.

As a representative of the proteome of S. islandicus an esterase EstA, recently characterised

and shown to possess host dependent structural stability, i.e. 32 fold reduced thermos stability

when expressed in the mesophilic host E. coli were used.

As an initial step to gain insight into the post-translational modifications were the

determination of the crystal structure of EstA when expressed from its native host. This was

done in collaboration with Kaare Teilum group from the Structural Biology and NMR

Laboratory of University of Copenhagen. The work was led by PhD. Student Troels Schwarz-

Linnet. The project resulted in a structure of EstA in a double dimer conformation at a

resolution of 2.6 Å, covering residue 2 to 303 of the 314 amino acid large his-tagged

recombinant EstA. The structure has been submitted to The Worldwide PDB7, as 5LK6 where

it awaits publication. The crystal structure did not contain any discernible post-translational

modification or co-factors of EstA.

Optimisation of esterase assay

In order to assure optimal data acquisition the assay used previously to study EstA, (Mei et al.

2012), were optimised. This resulted in a robust and minimalistic assay, with a minimum of

substrate auto-hydrolysis. The conditions for data collection were also improved by

eliminating light scattering. The performance of the buffer at high temperatures, 90°C were

also improved.

The assay has been used in volumes ranging from 250 µl to 1 ml and is suitable for real-time

data acquisition in a temperature range from 25-90°C, by directly measuring the substrate

turnover by absorption at 410 nm.

7 http://wwpdb.org/

Page 44: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

30

Reaction mix: 50 mM NaH2PO4, 25 mM Sodium lauroyl sarcosinate (Sarkosyl), 10% v/v 2-

propanol, 1 mM p-Nitrophenol, 0.02 µg/ml EstA. As an alternative to sarkosyl can 1% Triton

X-305, Triton X-305 is a non-ionic detergent, hence less disruptive of the protein structures.

Expression of soluble dimeric EstA for E. coli host

One of the previously difficulties encountered in expression EstA from E. coli was its

apparent insolubility when expressed in this host. To overcome this EstA were previously

expressed and purified from E. coli as a fusion-peptide of EstA and thioredoxin (Trx). This

resulted in difference in the amino acid sequence of the mesophilic and thermophilic

expressed EstA, even after proteolytic removal of the Trx-tag. Additionally, the enzyme

purified from E. coli was monomeric and not dimeric as when purified from the native

thermophilic host. I order to assure a proper comparison between the 2 expression systems, E.

coli and S. islandicus, the differences between the enzymes were eliminated. Result in

expression of EstA from both meso- and thermo-philic hosts with sequences identical to the

dimeric recombinant EstA, previously purified from S. islandicus (SiEstA). This resulted in

the purification of dimeric EstA from E. coli (EcEstA) by expression for 2 hours at 30 or

37°C by induction with 1 mM IPTG. In general, 0.9 mg/l EstA were obtained at a

concentration of ~300 µg/ml.

Lysine methylation as a factor in protein stability

The initial hypothesis that lysine methylation might be a factor that positively affected the

stability of EstA were investigated through 2 experimental paths; 1| Comparison between

recombinant EstA expressed from, a mesophilic host, E. coli, without any cytoplasmic lysine

methylation, and the native thermophilic host, S. islandicus, with cytoplasmic lysine

methylation. 2| Comparison between EstA expressed from the native host, S. islandicus, with

and without cytoplasmic lysine methylation. Both experimental paths compared methylated

protein from S. islandicus with a none-methylated variant while the second path also

eliminated host dependent differences.

The methylation states of EstA were confirmed by mass spectrometry analysis. All three

variants showed similar temperature optimums at 75°C and half-life at 90°C of approximately

30 minutes. Kinetically the Sulfolobus expressed variants of EstA were similar while EstA

from purified from E. coli showed a reduced kinetics. From our in vitro stability analysis of

EstA with and without in vivo, methylation it is evident that lysine methylation plays a small

Page 45: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

31

role in stabilising the enzyme. The methylation is however not a major determinator for

thermostability. The differences in half-life of methylated and none-methylated EstA purified

from the native host is only 1.08 fold in favour of the methylated enzyme. In addition, the

kinetic of the enzyme does not seem to be affected by the methylation when the enzyme is

expressed from the native host.

Investigation mesophilic expressed EstA increased sensitivity to detergents.

It was observed that EstA purified from E. coli had an increased sensitivity towards

detergents, as the presence of 25 mM sarkosyl resulted in a 2.5 fold reduction in half-life at

90°C. Anion-exchange chromatography analysis did not yield any information about any

possible factors responsible for the reduced half-life. Isoelectric focusing gel electrophoresis

showed that EstA purified from E. coli was more homogenic compared to EstA purified from

S. islandicus, regardless of methylation state, indicating a possible link between protein

charge and detergent resistance and possibly the reduced kinetic of EcEstA.

The Effect of dimer integrity on the stability of EstA

Several mutations in EstA’s dimer interface were made in an effort to generate monomeric

EstA, to determine that stabilising effect of the oligomeric state of EstA. None of the

mutations converted EstA to a monomer, however amino acid substitution of phenylalanine

269, located in the dimer interface, severely compromised the stability of EstA.

In the reverse experiment, stabilisation of the dimer interface successful through the

substitution of S295 to cysteine thereby generating an intra molecular disulphide bond. The

introduction of this novel cysteine bridge increased the kinetic stability of EstA resulting in an

increase of EstA half-life from 12.3 to 20 minutes at 90°C. Thereby demonstrating a

stabilising effect of dimer formation of EstA. Amino acid substitution of S266 & N270 and

G261 to cysteine and aspartic acid, respectively. In contrast to the substitutions of

hydrophobic residues in the dimer interface the introduction of an inter-subunit di-sulphide

bond did not affect the activity of the mutated esterase.

9 Conclusion & Perspective

From the data presented in the following chapters and highlighted above we can see that in

the case of the hyperthermophilic esterase from S. islandicus, EstA, PTM and more

specifically Lysine mono methylation, do not play a critical role in the stability of hyper-

Page 46: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

32

thermophilic enzymes from crenarchaeal. It is however, evident that the esterase is stabilised

by the methylation. That Lysine methylation not by itself provides a high degree of stability to

crenarchaeal enzymes is not surprising. As a conglomeration of small optimisations, that are

more easily acquirable overtime is the more likely explanation for the development of hyper-

stable enzymes.

If this unspecific lysine methylation serves additional biological functions, beside the increase

it does provide to the stability, this function remains unknown. As in the case of this esterase

the methylation is not causally linked to changes in enzyme activity.

Even though the positive effect of this unspecific lysine methylation only is minimal, it would

be interesting to investigate if this protective effect can be transferred.

It has already been shown that the methyltransferase aKMT4 from S. islandicus is active and

able to methylate co-expressed Sulfolobus enzymes in E. coli. However, to my knowledge it

has not been tested whether these enzymes gain any stabilising effect from the in vivo

methylation in mesophilic host. Additionally, it would be interestingly to test whether

enzymes from other thermophilic organisms, lacking this kind of lysine methylation can

benefit from lysine methylation as EstA.

The best-case scenario is that enzymes co-expressed with aKMT4 gain some stability, hence

engineering production stain expressing aKMT4 could improve enzyme stability in general.

It is known that expression of thermophilic and hyperthermophilic enzymes from E. coli often

is associated with loss of stability, and true in the case of EstA. Usually the lack of prober

PTM or co-factors is the expected cause. In our investigation, we did not find any evidence

that pointed towards differences in modification of EstA expressed in E. coli or the native

host lacking in vivo methylation. However, their stability and sensitivity to ionic detergents

are vastly different. The only differences we could find were that enzyme expressed from the

native host had a more heterogeneous charge distribution, while E. coli expressed EstA, were

homogenous. The underlying cause for the observation are still unknown. Deamidation is a

possible explanation.

During an investigation of the importance of dimer formation of the stability of EstA, were

we able to improve EstA stability through the introduction of a novel dimer spanning

disulphide bond. This single modification of the enzyme increased that stability of mesophilic

esterase by 1.62 fold. Whether this effect will persist when the mutant is expressed from S.

islandicus, remains to be tested. In any case, this result demonstrates the potential stabilising

Page 47: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

33

effect of oligomerisation and the possible benefits of targeting protein interaction interfaces

when engineering enzymes for stability.

Finally, the optimisation p-nitrophenol based activity assays for esterase activity can be help

full in future investigations of thermophilic esterases.

In addition to the data presented in this thesis, I would like to point out some observation on

protein purification of thermophilic and hyper-thermophilic proteins. Usually it is suggested

to add between 150-500 mM NaCl to IMAC buffers to avoid ionic interaction with the

column. However, with the presence of an increased number of charged residues on the

surface of these types of proteins, and an expectation that they provide stabilising ionic

interaction. Having a high ion strength in the buffer can disrupt the positive interaction

provided by the charges residues, there by potentially reducing the stability and solubility of

the protein. a problem extents to the concatenation of imidazole as it under normal operation

condition carry’s a positive charge. As the argument for using buffers contain a certain ionic

strength is valid and the use of imidazole very convenient for IMAC purification. I would

recruitment a buffer exchange to a buffer with low ionic strength, e.g. 50 mM Na/K-

phosphate or similar, after the protein is purified. Personally, this approach has in two cases

rectified problem with protein precipitation after purification.

Page 48: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

34

10 References

Allen, M.P., 2004. Introduction to Molecular Dynamics Simulation. Computational Soft

Matter: From Synthetic Polymers to Proteins, 23, pp.1--28.

Anfinsen, C.B. et al., 1961. The kinetics of formation of native ribonuclease during oxidation

of the reduced polypeptide chain. Proceedings of the National Academy of Sciences of

the United States of America, 47(9), pp.1309–14. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/13683522.

Arunan, E. et al., 2011. Definition of the hydrogen bond (IUPAC Recommendations 2011).

Pure and Applied Chemistry, 83(8), pp.1637–1641.

Baeten, L. et al., 2008. Reconstruction of protein backbones from the BriX collection of

canonical protein fragments. PLoS computational biology, 4(5), p.e1000083. Available

at: http://www.ncbi.nlm.nih.gov/pubmed/18483555.

Baldwin, R.L., 1986. Temperature Dependence of the Hydrophobic Interaction in Protein

Folding. Proceedings of the National Academy of Sciences, 83(21), pp.8069–8072.

Available at:

http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=386868&tool=pmcentrez&re

ndertype=abstract%5Cnhttp://www.pnas.org/content/83/21/8069.short.

Bannister, A.J. & Kouzarides, T., 2005. Reversing histone methylation. Nature, 436(7054),

pp.1103–6. Available at: http://www.ncbi.nlm.nih.gov/pubmed/16121170.

Baumann, H. et al., 1994. Solution structure and DNA-binding properties of a thermostable

protein from the archaeon Sulfolobus solfataricus. Nature structural biology, 1(11),

pp.808–19. Available at: http://www.ncbi.nlm.nih.gov/pubmed/7634092.

Beeby, M. et al., 2005. The genomics of disulfide bonding and protein stabilization in

thermophiles. PLoS biology, 3(9), p.e309. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/16111437.

Berg, J.M., Tymoczko, J.L. & Stryer, L., 2007. Biochemistry 6th ed., New York: W. H.

Freeman and Company.

Betz, S.F., 1993. Disulfide bonds and the stability of globular proteins. Protein Science,

2(10), pp.1551–1558. Available at:

http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2142256&tool=pmcentrez&r

endertype=abstract.

Bonneau, R. & Baker, D., 2001. Ab Initio Protein Structure Prediction: Progress and

Prospects. Annual Review of Biophysics and Biomolecular Structure, 30(1), pp.173–189.

Available at:

http://www.annualreviews.org/doi/abs/10.1146/annurev.genom.4.070802.110300.

Botting, C.H. et al., 2010. Extensive lysine methylation in hyperthermophilic crenarchaea:

potential implications for protein stability and recombinant enzymes. Archaea

(Vancouver, B.C.), 2010. Available at: http://www.ncbi.nlm.nih.gov/pubmed/20811616.

Byun, J.-S. et al., 2007. Crystal structure of hyperthermophilic esterase EstE1 and the

relationship between its dimerization and thermostability properties. BMC structural

biology, 7, p.47.

Page 49: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

35

Cambillau, C. & Claverie, J.M., 2000. Structural and genomic correlates of

hyperthermostability. Journal of Biological Chemistry, 275(42), pp.32383–32386.

Cao, L.C. et al., 2015. Enhancing the Thermostability of Feruloyl Esterase EstF27 by

Directed Evolution and the Underlying Structural Basis. Journal of Agricultural and

Food Chemistry, 63(37), pp.8225–8233.

Chahinian, H. & Sarda, L., 2009. Distinction between esterases and lipases: comparative

biochemical properties of sequence-related carboxylesterases. Protein and peptide

letters, 16(10), pp.1149–61. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/19508178.

Chakravarty, S. & Varadarajan, R., 2000. Elucidation of determinants of protein stability

through genome sequence analysis. FEBS letters, 470(1), pp.65–9. Available at:

http://doi.wiley.com/10.1016/S0014-5793%2800%2901267-9.

Chu, Y. et al., 2016. aKMT catalyzes extensive protein lysine methylation in the

hyperthermophilic archaeon Sulfolobus islandicus but is dispensable for the growth of

the organism. Molecular & Cellular Proteomics, p.mcp.M115.057778. Available at:

http://www.mcponline.org/lookup/doi/10.1074/mcp.M115.057778.

Chu, Y. et al., 2012. Identification and characterization of a highly conserved crenarchaeal

protein lysine methyltransferase with broad substrate specificity. Journal of

bacteriology, (October). Available at: http://www.ncbi.nlm.nih.gov/pubmed/23086207

[Accessed November 5, 2012].

Cobb, R.E., Chao, R. & Zhao, H., 2013. Directed Evolution: Past, Present and Future. AIChE

journal. American Institute of Chemical Engineers, 59(5), pp.1432–1440. Available at:

http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=4344831&tool=pmcentrez&r

endertype=abstract.

Cummings, R.D., 2009. The repertoire of glycan determinants in the human glycome.

Molecular BioSystems, 5(10), p.1087. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/19756298.

Damhus, T. (Novozyms A., Kaasgaard, S. (Novozyms A. & Olsen, Hans, S. (Novozyms A.,

2008. Enzymes at Work. Novozymes A/S. Available at: https://www.novozymes.com/en/-

/media/Novozymes/en/about-us/brochures/Documents/Enzymes_at_work.pdf.

Daniel, R.M., Dines, M. & Petach, H.H., 1996. The denaturation and degradation of stable

enzymes at high temperatures. The Biochemical journal, 317 ( Pt 1, pp.1–11. Available

at: http://www.ncbi.nlm.nih.gov/pubmed/8694749.

Dias, C.L. et al., 2010. The hydrophobic effect and its role in cold denaturation. Cryobiology,

60(1), pp.91–99. Available at: http://dx.doi.org/10.1016/j.cryobiol.2009.07.005.

Dill, K.A. et al., 2008. The protein folding problem. Annual review of biophysics, 37(2),

pp.289–316. Available at: http://www.ncbi.nlm.nih.gov/pubmed/18573083.

Dill, K.A. & MacCallum, J.L., 2012. The Protein-Folding Problem, 50 Years On. Science,

338(6110), pp.1042–1046. Available at:

http://www.sciencemag.org/cgi/doi/10.1126/science.1219021.

Dill, K. a et al., 1995. Principles of protein folding--a perspective from simple exact models.

Protein science : a publication of the Protein Society, 4(4), pp.561–602. Available at:

Page 50: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

36

http://www.ncbi.nlm.nih.gov/pubmed/7613459.

Dill, K. a, 1985. Theory for the folding and stability of globular proteins. Biochemistry,

24(APRIL 1985), pp.1501–1509.

Dill, K. a & Chan, H.S., 1997. From Levinthal to pathways to funnels. Nature structural

biology, 4(1), pp.10–9. Available at: http://www.ncbi.nlm.nih.gov/pubmed/8989315.

Docherty, K., Carroll, R.J. & Steiner, D.F., 1982. Conversion of proinsulin to insulin:

involvement of a 31,500 molecular weight thiol protease. Proceedings of the National

Academy of Sciences of the United States of America, 79(15), pp.4613–7. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/6750605.

Ece, S. et al., 2015. Improving thermal and detergent stability of Bacillus stearothermophilus

neopullulanase by rational enzyme design. Protein Engineering Design and Selection,

28(6), pp.147–151. Available at: https://academic.oup.com/peds/article-

lookup/doi/10.1093/protein/gzv001.

Eijsink, V.G.H. et al., 2005. Directed evolution of enzyme stability. Biomolecular

engineering, 22(1–3), pp.21–30. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/15857780.

Elcock, A.H., 1998. The stability of salt bridges at high temperatures: implications for

hyperthermophilic proteins. Journal of molecular biology, 284(2), pp.489–502.

Available at: http://www.sciencedirect.com/science/article/pii/S0022283698921593.

Febbraio, F. et al., 2004. Thermal stability and aggregation of sulfolobus solfataricus beta-

glycosidase are dependent upon the N-epsilon-methylation of specific lysyl residues:

critical role of in vivo post-translational modifications. The Journal of biological

chemistry, 279(11), pp.10185–94. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/14660666.

Fersht, A.R., 1995. Optimization of rates of protein folding: the nucleation-condensation

mechanism and its implications. Proceedings of the National Academy of Sciences of the

United States of America, 92(24), pp.10869–73. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/7479900.

Frazzetto, G., 2003. White biotechnology. EMBO reports, 4(9), pp.835–7. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/12949582 [Accessed August 29, 2016].

Fukuchi, S. & Nishikawa, K., 2001. Protein surface amino acid compositions distinctively

differ between thermophilic and mesophilic bacteria. Journal of molecular biology,

309(4), pp.835–43. Available at: http://www.ncbi.nlm.nih.gov/pubmed/11399062.

Guillon, P. et al., 2014. Structure-guided discovery of potent and dual-acting human

parainfluenza virus haemagglutinin-neuraminidase inhibitors. Nature communications, 5,

p.5268. Available at:

http://www.nature.com/ncomms/2014/141020/ncomms6268/full/ncomms6268.html.

Guo, L. et al., 2008. Biochemical and structural characterization of Cren7, a novel chromatin

protein conserved among Crenarchaea. Nucleic Acids Research, 36(4), pp.1129–1137.

Hanson, S.R. et al., 2009. The core trisaccharide of an N-linked glycoprotein intrinsically

accelerates folding and enhances stability. Proceedings of the National Academy of

Sciences of the United States of America, 106(9), pp.3131–6. Available at:

Page 51: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

37

http://www.pnas.org/content/106/9/3131.full.

Hasan, F., Shah, A.A. & Hameed, A., 2006. Industrial applications of microbial lipases.

Enzyme and Microbial Technology, 39(2), pp.235–251. Available at:

http://linkinghub.elsevier.com/retrieve/pii/S0141022905004606.

Hattori, A. et al., 2015. In Vivo Formation of the Protein Disulfide Bond That Enhances the

Thermostability of Diphosphomevalonate Decarboxylase, an Intracellular Enzyme from

the Hyperthermophilic Archaeon Sulfolobus solfataricus W. W. Metcalf, ed. Journal of

Bacteriology, 197(21), pp.3463–3471. Available at:

http://jb.asm.org/lookup/doi/10.1128/JB.00352-15.

Hendsch, Z.S. & Tidor, B., 1994. Do salt bridges stabilize proteins? A continuum electrostatic

analysis. Protein science : a publication of the Protein Society, 3(2), pp.211–26.

Available at: http://www.ncbi.nlm.nih.gov/pubmed/8003958.

Jorda, J. & Yeates, T.O., 2011. Widespread disulfide bonding in proteins from thermophilic

archaea. Archaea, 2011.

Jursic, B.S., 1996. Computation of Bond Dissociation Energy for Sulfides and Disulfides with

Ab Initio and Density Functional. , 62, pp.291–296.

Karplus, M. & Weaver, D.L., 1994. Protein folding dynamics: the diffusion-collision model

and experimental data. Protein science : a publication of the Protein Society, 3(4),

pp.650–68. Available at:

http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2142854&tool=pmcentrez&r

endertype=abstract.

Karshikoff, A., Nilsson, L. & Ladenstein, R., 2015. Rigidity versus flexibility: the dilemma of

understanding protein thermal stability. The FEBS journal, 282(20), pp.3899–917.

Available at: http://www.ncbi.nlm.nih.gov/pubmed/26074325.

Khan, F.I. et al., 2016. Current updates on computer aided protein modeling and designing.

International journal of biological macromolecules, 85, pp.48–62. Available at:

http://dx.doi.org/10.1016/j.ijbiomac.2015.12.072.

Kim, S. & Lee, S.B., 2004. Thermostable esterase from a thermoacidophilic archaeon:

purification and characterization for enzymatic resolution of a chiral compound.

Bioscience, biotechnology, and biochemistry, 68(11), pp.2289–2298.

Kim Baldwin, R.L., P.S., 1982. Specific intermediates in the folding recations of small

proteins and the mechanism of protein folding pathways. Ann. Rev. Biochem., 51,

pp.459–489.

Klibanov, A.M., 1983. Stabilization of enzymes against thermal inactivation. Advances in

applied microbiology, 29, pp.1–28. Available at:

http://linkinghub.elsevier.com/retrieve/pii/S0065216408703526 [Accessed August 27,

2016].

Korkhin, Y. et al., 1999. Oligomeric integrity--the structural key to thermal stability in

bacterial alcohol dehydrogenases. Protein science : a publication of the Protein Society,

8, pp.1241–1249.

Kouzarides, T., 2007. Chromatin modifications and their function. Cell, 128(4), pp.693–705.

Available at: http://www.ncbi.nlm.nih.gov/pubmed/17320507.

Page 52: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

38

Kubelka, J., Hofrichter, J. & Eaton, W.A., 2004. The protein folding “speed limit”. Current

opinion in structural biology, 14(1), pp.76–88. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/15102453.

Kumar, S. & Nussinov, R., 1999. Salt bridge stability in monomeric proteins. Journal of

molecular biology, 293(5), pp.1241–55. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/10547298.

Kumar, S., Tsai, C.J. & Nussinov, R., 2000. Factors enhancing protein thermostability.

Protein engineering, 13(3), pp.179–91. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/10775659.

Lee, K.K., Fitch, C.A. & García-Moreno E., B., 2002. Distance dependence and salt

sensitivity of pairwise, coulombic interactions in a protein. Protein Science, 11(5),

pp.1004–1016. Available at: http://doi.wiley.com/10.1110/ps.4700102.

Lessene, G. et al., 2013. Structure-guided design of a selective BCL-X(L) inhibitor. Nature

chemical biology, 9(6), pp.390–7. Available at:

http://www.nature.com.sire.ub.edu/nchembio/journal/v9/n6/full/nchembio.1246.html.

Levinthal, C., 1968. Are there pathways for protein folding. Journal de Chimie Physique - Et

De Physico-Chimie Biologique, 65(1), pp.44–45.

Levisson, M., van der Oost, J. & Kengen, S.W.M., 2009. Carboxylic ester hydrolases from

hyperthermophiles. Extremophiles, 13(4), pp.567–581.

Maras, B. et al., 1992. The protein sequence of glutamate dehydrogenase from Sulfolobus

solfataricus, a thermoacidophilic archaebacterium. Is the presence of N-epsilon-

methyllysine related to thermostability? European journal of biochemistry / FEBS, 203,

pp.81–87. Available at: http://dx.doi.org/ 10.1111/j.1432-1033.1992.tb19831.x.

Maugini, E. et al., 2009. Structural adaptation of the subunit interface of oligomeric

thermophilic and hyperthermophilic enzymes. Computational Biology and Chemistry,

33(2), pp.137–148.

Mei, Y. et al., 2012. Exceptional thermal stability and organic solvent tolerance of an esterase

expressed from a thermophilic host. Applied microbiology and biotechnology, 93(5),

pp.1965–74. Available at: http://www.ncbi.nlm.nih.gov/pubmed/21847512 [Accessed

January 2, 2013].

Moracci, M. et al., 1995. Expression and extensive characterization of a beta-glycosidase

from the extreme thermoacidophilic archaeon Sulfolobus solfataricus in Escherichia coli:

authenticity of the recombinant enzyme. Enzyme and microbial technology, 17(11),

pp.992–7. Available at: http://www.ncbi.nlm.nih.gov/pubmed/8541022.

Myers, J.K. & Pace, C.N., 1996. Hydrogen bonding stabilizes globular proteins. Biophysical

journal, 71(4), pp.2033–9. Available at:

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1233669/%5Cnhttp://www.ncbi.nlm.nih.

gov/pmc/articles/PMC1233669/pdf/biophysj00044-0377.pdf.

Nakashima, H., Fukuchi, S. & Nishikawa, K., 2003. Compositional changes in RNA, DNA

and proteins for bacterial adaptation to higher and lower temperatures. Journal of

biochemistry, 133(4), pp.507–13. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/12761299.

Page 53: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

39

Neuberger, a, 1938. Carbohydrates in protein: The carbohydrate component of crystalline

egg albumin. The Biochemical journal, 32(9), pp.1435–51. Available at:

http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=1264208&tool=pmcentrez&r

endertype=abstract.

Neufeld, E.F., 2006. Enzyme replacement therapy – a brief history, Available at:

http://www.ncbi.nlm.nih.gov/books/NBK11588/%5Cnhttp://www.ncbi.nlm.nih.gov/boo

ks/NBK11588/?report=printable%5Cnhttp://www.ncbi.nlm.nih.gov/pubmed/21290685.

Niu, Y. et al., 2013. A prototypic lysine methyltransferase 4 from archaea with degenerate

sequence specificity methylates chromatin proteins Sul7d and Cren7 in different

patterns. The Journal of biological chemistry, 288(19), pp.13728–40. Available at:

http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=3650410&tool=pmcentrez&r

endertype=abstract.

Nölting, B. & Agard, D.A., 2008. How general is the nucleation-condensation mechanism?

Proteins: Structure, Function and Genetics, 73(3), pp.754–764.

Nölting, B. & Andert, K., 2000. Mechanism of protein folding. Proteins: Structure, Function,

and Genetics, 41(3), pp.288–298. Available at: http://doi.wiley.com/10.1002/1097-

0134%2820001115%2941%3A3%3C288%3A%3AAID-PROT20%3E3.0.CO%3B2-C.

Ollis, D.L. et al., 1992. The alpha/beta hydrolase fold. Protein engineering, 5(3), pp.197–211.

Available at: http://www.ncbi.nlm.nih.gov/pubmed/1409539.

Pace, C.N. et al., 1988. Conformational stability and activity of ribonuclease T1 with zero,

one, and two intact disulfide bonds. Journal of Biological Chemistry, 263(24),

pp.11820–11825.

Panda, T. & Gowrishankar, B.S., 2005. Production and applications of esterases. Applied

microbiology and biotechnology, 67(2), pp.160–9. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/15630579.

Perl, D. & Schmid, F.X., 2001. Electrostatic stabilization of a thermophilic cold shock

protein. Journal of Molecular Biology, 313(2), pp.343–357. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/11800561.

Perutz, M.F. & Raidt, H., 1975. Stereochemical basis of heat stability in bacterial ferredoxins

and in haemoglobin A2. Nature, 255(5505), pp.256–259. Available at:

http://www.nature.com/doifinder/10.1038/255256a0 [Accessed September 19, 2016].

Petsko, G.A., 2001. Structural basis of thermostability in hyperthermophilic proteins, or

“there’s more than one way to skin a cat”. Methods in enzymology, 334(i), pp.469–78.

Available at: http://linkinghub.elsevier.com/retrieve/pii/S0076687901344865.

Rhee, J.K. et al., 2006. Analysis of the thermostability determinants of hyperthermophilic

esterase EstE1 based on its predicted three-dimensional structure. Applied and

Environmental Microbiology, 72(4), pp.3021–3025.

Sadqi, M., Lapidus, L.J. & Muñoz, V., 2003. How fast is protein hydrophobic collapse?

Proceedings of the National Academy of Sciences of the United States of America,

100(21), pp.12117–22. Available at: http://www.ncbi.nlm.nih.gov/pubmed/14530404.

Sánchez, A.M., Bennett, G.N. & San, K.-Y., 2005. Novel pathway engineering design of the

anaerobic central metabolic pathway in Escherichia coli to increase succinate yield and

Page 54: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

40

productivity. Metabolic Engineering, 7(3), pp.229–239. Available at:

http://linkinghub.elsevier.com/retrieve/pii/S1096717605000315.

Scandurra, R. et al., 1998. Protein thermostability in extremophiles. Biochimie, 80(11),

pp.933–41. Available at: http://www.ncbi.nlm.nih.gov/pubmed/9893953.

Schwarz, F. & Aebi, M., 2011. Mechanisms and principles of N-linked protein glycosylation.

Current opinion in structural biology, 21(5), pp.576–82. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/19756298.

Schymkowitz, J. et al., 2005. The FoldX web server: An online force field. Nucleic Acids

Research, 33(SUPPL. 2), pp.382–388.

Shental-Bechor, D. & Levy, Y., 2008. Effect of glycosylation on protein folding: a close look

at thermodynamic stabilization. Proceedings of the National Academy of Sciences of the

United States of America, 105(24), pp.8256–61. Available at:

http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2448824&tool=pmcentrez&r

endertype=abstract.

Sherman, F., Stewart, J.W. & Tsunasawa, S., 1985. Methionine or not methionine at the

beginning of a protein. BioEssays : news and reviews in molecular, cellular and

developmental biology, 3(1), pp.27–31. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/3024631.

Shimizu, S. & Chan, H.S., 2002. Erratum: “Temperature dependence of hydrophobic

interactions: A mean force perspective, effects of water density, and nonadditivity of

thermodynamic signatures” [J. Chem. Phys. 113, 4683 (2000)]. The Journal of Chemical

Physics, 116(19), p.8636. Available at:

http://scitation.aip.org/content/aip/journal/jcp/116/19/10.1063/1.1471910.

Siegel, J.B. et al., 2010. Computational Design of an Enzyme Catalyst for a Stereoselective

Bimolecular Diels-Alder Reaction. Science, 329(5989), pp.309–313. Available at:

http://www.sciencemag.org/cgi/doi/10.1126/science.1190239.

Spassov, V.Z., Karshikoff, A.D. & Ladenstein, R., 1995. The optimization of protein-solvent

interactions: Thermostability and the role of hydrophobic and electrostatic interactions.

Protein Science, 4(8), pp.1516–1527. Available at:

http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2143201&tool=pmcentrez&r

endertype=abstract.

Stepper, J. et al., 2011. Cysteine S-glycosylation, a new post-translational modification found

in glycopeptide bacteriocins. FEBS letters, 585(4), pp.645–50. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/21251913.

Stickle, D.F. et al., 1992. Hydrogen bonding in globular proteins. Journal of molecular

biology, 226(4), pp.1143–59. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/1518048.

Stiefler-Jensen, D., Schwarz-Linnet, T., et al., 2016. The extraordinary thermal stability of

EstA from S. islandicus is independent of post translational modifications. PhD thesis -

STABILITY OF HYPERTHERMOPHILIC PROTEINS Post-Translational Modifications

and Other Factors, p.Chapter 1.

Stiefler-Jensen, D., Teilum, K. & She, Q.-X., 2016. Effect of dimer interface integrity on the

Page 55: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

41

stability of a hyperthermophilic esterase. PhD thesis - STABILITY OF

HYPERTHERMOPHILIC PROTEINS Post-Translational Modifications and Other

Factors, p.Chapter 2.

Suhre, K. & Claverie, J.-M., 2003. Genomic correlates of hyperthermostability, an update.

The Journal of biological chemistry, 278(19), pp.17198–202. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/12600994.

Sun, D.P. et al., 1991. Contributions of engineered surface salt bridges to the stability of T4

lysozyme determined by directed mutagenesis. Biochemistry, 30(29), pp.7142–7153.

Available at: http://www.ncbi.nlm.nih.gov/pubmed/1854726.

Szilágyi, A. & Závodszky, P., 2000. Structural differences between mesophilic, moderately

thermophilic and extremely thermophilic protein subunits: results of a comprehensive

survey. Structure (London, England : 1993), 8(5), pp.493–504. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/10801491.

Szymanski, C.M. et al., 1999. Evidence for a system of general protein glycosylation in

Campylobacter jejuni. Molecular microbiology, 32(5), pp.1022–30. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/10361304.

Tanaka, Y. et al., 2004. How oligomerization contributes to the thermostability of an archaeon

protein. Protein L-isoaspartyl-O-methyltransferase from Sulfolobus tokodaii. The

Journal of biological chemistry, 279(31), pp.32957–67. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/15169774.

Taylor, T.J. & Vaisman, I.I., 2010. Discrimination of thermophilic and mesophilic proteins.

BMC Structural Biology, 10(Suppl 1), p.S5. Available at:

http://www.worldscientific.com/doi/abs/10.1142/S0219720010004707.

Thirumalai, D. & Klimov, D.K., 1999. Deciphering the timescales and mechanisms of protein

folding using minimal off-lattice models. Current opinion in structural biology, 9(2),

pp.197–207. Available at: http://www.ncbi.nlm.nih.gov/pubmed/10322218.

Thompson, M.J. & Eisenberg, D., 1999. Transproteomic evidence of a loop-deletion

mechanism for enhancing protein thermostability. Journal of molecular biology, 290(2),

pp.595–604. Available at: http://www.ncbi.nlm.nih.gov/pubmed/10390356.

Tompa, D.R., Gromiha, M.M. & Saraboji, K., 2016. Contribution of main chain and side

chain atoms and their locations to the stability of thermophilic proteins. Journal of

Molecular Graphics and Modelling, 64, pp.85–93. Available at:

http://dx.doi.org/10.1016/j.jmgm.2016.01.001.

Vieille, C. & Zeikus, G.J., 2001. Hyperthermophilic enzymes: sources, uses, and molecular

mechanisms for thermostability. Microbiology and molecular biology reviews : MMBR,

65(1), pp.1–43. Available at: http://www.ncbi.nlm.nih.gov/pubmed/11238984.

Villeret, V. et al., 1998. The crystal structure of Pyrococcus furiosus ornithine

carbamoyltransferase reveals a key role for oligomerization in enzyme stability at

extremely high temperatures. Proceedings of the National Academy of Sciences of the

United States of America, 95(6), pp.2801–6. Available at:

http://www.pnas.org/content/95/6/2801.long.

Voelz, V.A. et al., 2010. Molecular simulation of ab initio protein folding for a millisecond

Page 56: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

42

folder NTL9(1-39). Journal of the American Chemical Society, 132(5), pp.1526–8.

Available at: http://www.ncbi.nlm.nih.gov/pubmed/20070076.

Walden, H. et al., 2001. Tiny TIM: a small, tetrameric, hyperthermostable triosephosphate

isomerase. Journal of molecular biology, 306(4), pp.745–57. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/11243785.

Wieland, F., Paul, G. & Sumper, M., 1985. Halobacterial flagellins are sulfated glycoproteins.

The Journal of biological chemistry, 260(28), pp.15180–5. Available at:

http://www.ncbi.nlm.nih.gov/pubmed/3934156.

Wijma, H.J. et al., 2014. Computationally designed libraries for rapid enzyme stabilization.

Protein Engineering Design and Selection, 27(2), pp.49–58. Available at:

https://academic.oup.com/peds/article-lookup/doi/10.1093/protein/gzt061.

Xia, Y. et al., 2015. The Helicase Activity of Hyperthermophilic Archaeal MCM is Enhanced

at High Temperatures by Lysine Methylation. Frontiers in Microbiology, 6(November),

pp.1–12. Available at:

http://journal.frontiersin.org/Article/10.3389/fmicb.2015.01247/abstract.

Xiao, L. & Honig, B., 1999. Electrostatic contributions to the stability of hyperthermophilic

proteins. Journal of Molecular Biology, 289(5), pp.1435–1444. Available at:

http://www.sciencedirect.com/science/article/pii/S0022283698919040.

Yamaguchi, K. et al., 1991. Effects of site-directed removal of N-glycosylation sites in human

erythropoietin on its production and biological properties. Journal of Biological

Chemistry, 266(30), pp.20434–20439.

Yin, X. et al., 2015. Contribution of Disulfide Bridges to the Thermostability of a Type A

Feruloyl Esterase from Aspergillus usamii. Plos One, 10(5), p.e0126864. Available at:

http://dx.plos.org/10.1371/journal.pone.0126864.

Zavodszky, M. et al., 2001. Disulfide bond effects on protein stability: designed variants of

Cucurbita maxima trypsin inhibitor-V. Protein science : a publication of the Protein

Society, 10(1), pp.149–60. Available at:

http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2249841&tool=pmcentrez&r

endertype=abstract.

Závodszky, P. et al., 1998. Adjustment of conformational flexibility is a key event in the

thermal adaptation of proteins. Proceedings of the National Academy of Sciences of the

United States of America, 95(13), pp.7406–11. Available at:

http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=22632&tool=pmcentrez&ren

dertype=abstract.

Zwanzig, R., Szabo, A. & Bagchi, B., 1992. Levinthal’s paradox. Proceedings of the National

Academy of Sciences of the United States of America, 89(1), pp.20–2. Available at:

http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=48166&tool=pmcentrez&ren

dertype=abstract.

Page 57: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

43

Chapter 1

In this Chapter is the first of three-presenting research conduction during my PhD study. Here

we present an investigation into post-translation modifications role for the stability of

hyperthermophilic enzymes.

The manuscript is still in preparation, but in the final state before submission to Protein

science as a full-length Article of maximum 5000 words.

Page 58: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

44

The extraordinary thermal stability of EstA from S.

islandicus is independent of post translational modifications

Daniel Stiefler-Jensen2,+, Troels Schwarz-Linnet1,+, Casper de Lichtenberg1, Tam T. T. N.

Nguyen3, Kasper D. Rand3, Li Huang4, Qunxin She2 and Kaare Teilum1,*

1Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen,

Denmark 2Archaea Centre, Department of Biology, University of Copenhagen, Denmark 3Department of Pharmacology, University of Copenhagen, Denmark 4State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of

Sciences, Beijing, China

*Correspondence: [email protected] +These authors contributed equally to this work.

Abbreviation:

1,4-Dithiothreitol DTT

Circular dichroism spectroscopy CD

Electrospray ionization mass spectrometry ESI-MS

EstA expressed from S. islandicus SiEstA

EstA expressed from S. islandicus with knock out of akmt ΔSiEstA

EstA expressed from E. coli Shuffle T7 EcEstA

Isoelectric focusing IEF

Isopropyl β-D-1-thiogalactopyranoside IPTG

N-lauroylsarcosine Sarkosyl

Optical density at 600nm OD600

Polyacrylamide gel electrophoresis PAGE

Post-translation modifications PTM

Size exclusion chromatography SEC

Thioredoxin TrxA

Abstract

Enzymes from thermophilic and hyper-thermophilic organisms have an intrinsic high stability.

Understanding the mechanisms behind their high stability will be an important tool for the

engineering of novel enzymes with high stability.

Histone like proteins, sul7d, from the hyperthermophilic and acidophilic archaea have been

shown to have both unspecific and temperature dependent lysine methylation. Here we test the

effect of eliminating this in vivo lysine methylation on the stability of an esterase (EstA) purified

from the native host S. islandicus as well as the effect of expression in a mesophilic host, E. coli,

without the machinery for in vivo lysine methylation. We find that lysine mono methylation

indeed has a significant positive effect on the stability of EstA, although the effect is small.

However, the effect is secondary to that of expression in E. coli, as the enzyme expressed in E.

Page 59: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

45

coli is compromised both on stability and activity which we cannot contribute to a covalent

difference.

Introduction

High stability and activity are key requirements for the use of enzymes in most industrial

applications.1 Extremophilic organisms are natural reservoirs for enzymes with high stability,

and many enzymes with high stability characterized from extremophiles show potential as

biocatalysts in industrial processes.1,2

An esterase (EstA) isolated from the hyperthermophilic and acidophilic crenarchaeal organism

Sulfolobus islandicus was recently characterized.3 High thermostability and the resistance to

solvents makes EstA a good candidate for industrial exploitation. As protein production from the

native host is expensive, and cumbersome due to its extreme growth conditions, alternative and

well-established mesophilic expression organisms, such as E. coli are more attractive for enzyme

production.4,5 Intriguingly, when EstA was purified from the native host, the half-life of the

enzyme (in terms of catalytic activity) was 16 hours at 90°C, whereas for the same protein

recombinantly expressed in E. coli, the half-life reduced 32-fold.3 EstA expressed in E. coli was

also more sensitive to organic solvents than EstA expressed in the native host.

The increased stability of EstA expressed in S. islandicus was suggested to be a result of post-

translational modifications which are not present in the protein when it is expressed in E. coli.

Mono methylation of the ε-amino group of lysine have been shown to be extensive and sequence

unspecific in Sulfolobus and other archaea.6–10 In the case of the Sso7d a DNA binding protein

from S. solfataricus, the methylation seemed to be dependent of the growth temperature of the

native host.7 In addition, in vitro methylation of recombinant MCM purified from E. coli was

linked with increased stability and activity.11 Lysine methylation of β-Glycosidase from S.

Sulfolobus did however, not show any stabilizing effect.12

Recently, a lysine methyl transferase, aKMT4, with broad substrate specificity from S. islandicus

was characterized.13,14 Deleting the gene for aKMT4 in S. islandicus significantly reduces the

level of methylated lysine, but only moderately impairs the growth of S. islandiscus.15 The

identification of this lysine methyl transferase opens an opportunity of investigating the role of

lysine methylation on the stability of proteins from creanarchaeal archaea independent of

mesophilic expression systems.

Page 60: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

46

In this work, we have been interested in understanding the underlying reason for the previously

observed difference in the stability of EstA expressed in E. coli and S. islandicus. Our hypothesis

was that lysine methylation occurring in S. islandicus conferred stability to the enzyme. We

therefore expressed and purified EstA from wild-type S. islandicus, S. islandicus(Δakmt) and E.

coli, termed SiEstA, ΔSiEstA and EcEstA respectively. All three variants were characterized

with respect to covalent modifications, enzyme kinetic activity and thermal stability. We find a

large difference in the enzymatic activity and thermal stability between EstA expressed in E. coli

and in S. islandicus. In contrast, there is only a minimal difference in the properties of EstA from

the two S. islandicus strains. Our results thus show that the methylation of lysine only has a

small influence on the stability of EstA and thus the difference in both activity and stability must

be a result of non-covalent interactions stabilizing the wildtype enzyme from S. islandicus.

Materials and methods

Sulfolobus strains, growth condition, and transformation.

The Sulfolobus strains were propagated in a salt and vitamin base medium16 supplemented with

0.2% (w/v) vitamin free casamino acids (Difco Vitamin Assay, BD), 0.125% yeast extract

(CVy*). As carbon source, 0.02% (w/v) sucrose were used for regular growth (SCVy*), and

0.02% (w/v) D-arabinose (Sigma-Aldrich, St. Louis, MO) (ACVy*) for expression cultures.17

S. islandicus REY15A E233S18, previously transformed with pSe-estA3, were used as a

thermophilic host for the expression of EstA, (EstA from this host is appreviated SiEstA). S.

islandicus REY15A E233S with an additional knockout of the gene coding for aKMT/aKMT4,

SiRe_1449, was transformed with pSe-estA, following procedures previously described18, with

the following modifications: After electroporation the cells were inoculated at 65°C in 227 mM

(NH4)2PO4, 28.7 mM K2SO4, 13.4 mM KCl, 93.2 mM glycine. Competent cells were prepared

by growth at 70°C in sucrose media without yeast extract (SCV). EstA expressed from host line

is abbreviated ΔSiEstA

E. coli vector modification

Based on a previous vector (p32a-estA) for expression of EstA in E. coli as a 50 kDa fusion

protein with an N-terminal thioredoxin (TrxA)3, a new vector construct for the expression of

EstA in E. coli with the exact same sequence as the EstA expressed in S. islandicus was prepared

by (Figure 1). EstA expressed in E. coli is called EcEstA.

Page 61: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

47

Expression and purification

SiEstA and ΔSiEstA were purified from 1 litre cultures of S. islandicus and S.

islandicus(Δakmt), respectively, grown at 75˚C in ACVy* until OD600 ~0.8. The cells were

harvested by centrifugation at 12000 rpm for 10 minutes at room temperature (centrifuge 5810R,

Eppendorf AG, Hamburg).

EcEstA were purified from 2 litre E. coli SHuffle T7 grown at 30°C in LB medium

supplemented with 100 µg/ml of ampicillin and 10 µg/ml chloramphenicol. Protein expression

was induced by 1 mM of Isopropyl β-D-1-thiogalactopyranoside (IPTG) at OD600 ~0.8 and

allowed to proceed for 2 hours. The E. coli cells were harvested by centrifugation at 12000 rpm

for 10 minutes at 4˚C (centrifuge 5810R, Eppendorf AG, Hamburg). EstA expressed in E. coli is

appreviated EcEstA.

Both S. islandicus and E. coli cells were resuspended in immobilized metal ion affinity

chromatography (IMAC) binding buffer supplemented with 1 µg/ml DNaseI. The cells were

lysis by three passes through a french press at 1500 psi. The cell extract was clarified by

centrifugation at 12000 rpm for 20 minutes (centrifuge 5810R, Eppendorf AG, Hamburg) at

room temperature or 4˚C. The E. coli supernatant was additionally heat precipitated in Falcon

Tubes in a water bath at 70˚C for 10 minutes, followed by an additional clarification, as above.

Supernatants was filtered by 0.2 µm before chromatography.

The proteins were purified by IMAC using a HisTrap HP 1 ml column (GE Healthcare Life

Science) with a 20-500mM imidiazole gradient in pH 7.4, 1.37 mM NaCl, 27 mM KCl, 100 mM

NaH2PO4, 18 mM KH2PO4. The main peak, at approximately 200mM imidazole, was collected

before buffer exchange to 50 mM K-phosphate, pH 7.4. SDS-PAGE was used to determine the

purity of the final EstA sample. Enzyme concentration was estimated by measuring absorbance

at 280nm with an extinction coefficient 31900 M-1cm-1 and a mass of 34.5 kDa.

Native PAGE analysis

6% Native PAGE was used to analyse heat treated enzyme. 50 µg/ml enzyme in 50 mM

KH2PO4, pH 7.4 and 25 mM sarkosyl, were incubated at 90˚C for up to 1 hour while shaking.

The heat treatment was quench by dilution, 1:10, with the same buffer at room temperature. 10

µl or 50 ng enzyme were loaded to the native PAGE. The gel was stained with Coomassie Blue.

Size exclusion chromatography analysis

100 µl of purified EstA were run at 750 µl/min in 50 mM NaH2PO4, 150 mM NaCl, pH 7.8 on a

Superdex 200 10/300 GL. As a standard of molecular mass 100 µl of the following proteins were

used: Ribonuclease A 1.5 mg/ml, Ovalbumin 2 mg/ml, and Conalbumin 2 mg/ml.

Page 62: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

48

Mass spectrometry

LC-MS was performed using a NanoAQUITY UPLC setup coupled to an ESI Synapt G2 Q-TOF

mass spectrometer (Waters, Miliford, USA). Protein samples (50 pmol) were loaded onto the

UPLC system, then trapped and desalted on a C4 trap column (ACQUITY UPLC Protein BEH

C4 VanGuard Pre-column, 300Å, 1.7 µm, 2.1 mm x 5 mm column, Waters, Miliford, USA) for 3

min at 40 µL/min of 99% of mobile phase A (0.23% Formic Acid). Isocratic elution from the C4

column into the mass spectrometer was performed at a flow-rate of 40 µL/min of 95% mobile

phase B (ACN, 0.23% Formic Acid). Mass spectra were acquired in positive ionization mode

over an m/z range of 50-2000 and a scan time of 1 second. Each sample was run in duplicate.

The acquired mass spectra were deconvoluted using the MaxEnt1 algorithm of MassLynx

(Waters, Miliford, USA). The three charge states with the highest intensity were used to estimate

the average Full Width at Half Maximum. The deconvolution resolution was set to 0.1 Da.

Circular dichroism spectroscopy

Far- and near-UV spectra of the esterases in 50 mM K-phosphate, pH 7. 4 at 25°C were recorded

with a resolution of 0.2 nm on a Jasco spectropolometer J-810 equipped with a PTC-423s peltier

element.

Esterase activity assay

Esterase activity was determined as the rate of hydrolysis of p-nitrophenyl esters by following

the formation of p-nitrophenolate by the change in absorbance at 410 nm. The activity was

assayed in 50mM Na-phosphate, 25 mM N-lauroylsarcosine (Sarkosyl) where indicated, 10%

(v/v) 2-propanol, pH 7.0 with 20 ng/ml EstA. The substrate was 1 mM p-nitrophenyl hexanoate

unless otherwise states. The change in absorbance was measured with an Infinite M200 (Tecan

Trading AG, Switzerland) plate reader at 40°C except for the temperature-activity profile which

was measured in a Jasco spectropolometer J-810 with PTC-423s peltier element.

Isoelectric focusing analysis

Isoelectric focusing analysis was made with pH 3-10 Criterion IEF™ Gel (BIO-RAD

Laboratories), and IEF Anode and Cathode and loading buffers (BIO-RAD Laboratories). 125-

200 µg/ml enzyme were used and the analysis was made following the recommendation from the

manufacture. The enzyme was visualized by Bio-Rad Silver stain (BIO-RAD Laboratories) by

following the recommended protocol.

Page 63: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

49

Anion Exchange chromatography

A MonoQ HR 5/5 column was used for anion exchange chromatography in 50 mM bis-tris, pH

6.5 with a gradient from 0 to 500 mM NaCl. Elution made by a shallow gradient, 0.5 %/min.

Results

A pET32a based vector (p32a-estA) for expression of EstA in E. coli as a 50 kDa fusion protein

with an N-terminal thioredoxin (TrxA) and an S-tag has previously been constructed.3

Proteolytic cleavage with enterokinase releases EstA in a form, which is different at both the N-

and the C- termini compared to the EstA expressed from pSe-estA in S. islandicus. At the N-

terminal in the E. coli expressed EstA, Ala-Met-Ala replace the starting Met and at the C-

terminal a linker sequence and a His-tag (DKSAAALEHHHHHH), which differs from the linker

and His-tag used in the pSe-estA by the additional Leu and Glu residues underlined in the

sequence above. A new vector construct for the expression of EstA in E. coli with the exact same

sequence as the EstA expressed in S. islandicus was prepared to eliminate any difference

between primary structure of the proteins purified from S. islandicus and E. coli. The sequence

coding for EstA in the expression vectors resembles that of wild-type EstA (Uniprot: F0NDQ1)

with the addition of a C-terminal extension with the sequence Ala3His6.

SiEstA was purified from S. islandicus REY15A E233S.18 Additionally, EstA was purified as

ΔSiEstA from S. islandicus REY15A E233S Δakmt with a knockout of the gene SiRe_1449

coding for a lysine methyl transferase.15 E. coli SHuffleT7 which is engineered for improved

formation of disulphide bond formation, was used for the expression of EcEstA.

The final samples of all three variants of EstA gave rise to a single band of expected 35 kDa in

SDS-PAGE (Figure 2). Size exclusion chromatography shows similar retention volumes for all

EstA variants with an estimated size slightly above 75 kDa corresponding to EstA being a dimer

in solution (Figure 3).

Covalent characterization

To identify any post-translational modifications, we characterized the covalent structure of the

different variants of EstA. As S. islandicus is able to mono-methylate the -amino group of

lysine residues we used amino acid analysis to quantify the amount of mono-methylated Lys in

SiEstA (Table 1). On average 0.4 of the 14 Lys-residues in SiEstA are mono-methylated. Mass

spectrometry confirmed the presence of multiple methyl groups on SiEstA (Figure 4). For non-

methylated EcEstA, the experimentally determined mass is 34367.5 Da, corresponding well with

an accuracy of 0.7 Da or 20 ppm to the predicted mass of EstA without the start codon

Page 64: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

50

methionine (34368.8 Da). For methylated SiEstA, besides a species with mass corresponding to

EstA (34366.6 Da), additional species (34381.0 and 34395.3 Da) were observed in the

deconvoluted mass spectrum (Figure 4). The mass differences of 14.4 and 28.7 Da of these

species compared to the EstA correspond to the addition of one or two methyl groups,

respectively. A minor component corresponding to the addition of three methyl groups can also

be observed a low intensity.

EstA has three Cys residues in the sequence. The redox state of the Cys residues were analysed

by SDS-PAGE in the absence and presence of DTT (Figure 5). Under reducing conditions the

protein migrates slightly longer than under non-reducing conditions. EstA purified from E. coli

migrate exactly the same as when it is purified from S. islandicus. These results demonstrate that

EstA forms an intramolecular disulphide bond and since the protein migrates as a monomer

under non-reducing conditions in SDS-PAGE the EstA dimer is not linked by a disulphide bond.

To assess if there are any differences in the non-covalent structure we compared both near- and

far-UV CD spectra of EcEstA and SiEstA (Figure 6). The spectra are identical demonstrating

that there are no differences in the secondary structure (near-UV CD) or in environment of the

aromatic residues (far-UV CD).

Enzymatic activity

Following the previous report about the difference in temperature optimum for catalysing the

hydrolysis of phenyl esters we compared the temperature-activity profiles of the EstA variants.

In contrast to the previous report on EstA, the temperature optima of all the variants are

approximately 75 °C (Figure 7A), which is close to the optimal growth temperature of S.

islandicus at 80 °C. At all temperatures the measured activity of EcEstA, however, is about half

of those measured for the EstA variants expressed in S. islandicus.

To compare the catalytic characteristics of the three EstA variants we complemented the

temperature-activity profiles with an Michaelis-Menten analysis of the substrate dependence of

the catalytic activity. The analysis was performed at 40 °C as the kinetic assay at this

temperature can be performed a in 96-well plate format, which is considerably more simple to

handle experimentally than the assays carried out in single cuvettes at elevated temperatures. p-

nitrophenylhexanoate starts precipitating above 1 mM and it was therefore not possible to reach

substrate saturation. The two variants of EstA expressed in S. islandicus have almost the same

kinetic parameters whereas both kcat and KM for EstA expressed in E. coli are significantly

reduced (Figure 7B and Table 2).

Page 65: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

51

Thermostability of EstA

Attempts to measure the thermodynamic stability of EstA using chemical denaturants failed as

the protein aggregates when it unfolds, which renders the folding irreversible. Temperature

unfolding of EstA is not possible either as the unfolding transition occurs above 95 °C both for

the protein expressed in E. coli and in S. islandicus. However, incubation of EstA at 90 °C leads

to loss of enzymatic activity (Figure 7C&D). We therefore used the half-life of the catalytic

activity after incubation at 90 °C as a measure of the thermostability of EstA. In phosphate

buffer at pH 7 the half-life of the different EstA preparations are 33.9 min for SiEstA, 31.4 min

for ΔSiEstA and 26.5 min for EcEstA. Although small, the differences are statistical significant

(ANOVA, p < 0.05). Addition of 25 mM sarkosyl, a strong anion surfactant, amplifies the

differences. The half-life of the activity of SiEstA increases slightly to 36.3 min while it is

reduced to only 10.5 min for EcEstA (Figure 7D). In sarkosyl the half-life of SiEstA is slightly

reduced relative to SiEstA. Native-PAGE analysis of the esterase under these conditions shows

changes in the electrophoretic mobility for EcEstA occurring much faster than for SiEstA

(Figure 8). Whereas the methylation state of EstA only has a slight effect on the thermostability,

the effect of the expression host is much more pronounced.

As we do not see any difference in the mass from ESI-MS of EcEstA and SiEstA but still has a

significant difference in enzymatic activity and thermostability a possible explanation could be

that the protein expressed in S. islandicus at 75 °C was deamidated, which is a post translational

modification not detectable by mass spectrometry of the intact protein. We therefore analysed

the EstA preparations by anion exchange chromatography and isoelectric focusing to identify

any difference in the net charge of the protein. The chromatograms from the anion exchange

show that the major fraction of EstA elutes at the same concentration of NaCl for all three

variants (Figure 9A). SiEstA and to some extent SiEstA have additional peaks eluting at higher

concentrations of NaCl. This heterogeneity of the protein purified from S. islandicus is also seen

in IEF-PAGE (Figure 9B). Measuring the activity and thermal stability of EstA in main peaks

eluting from the anion exchange column results in the same half-life of the activity as seen

before. Thus, although there are clear differences in both the elution profile from the anion

exchange column and in the IEF-PAGE profiles, the fractions of SiEstA that elutes at the same

NaCl concentration as EcEstA still has a longer half-life of the catalytic activity and a difference

in charge cannot explain the difference in activity.

Discussion

Page 66: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

52

The thermostability of EstA depends on the organism in which it is expressed. Such behaviour is

well known for proteins that become post-translationally modified 19. In particular glycosylations

are known to increase the kinetic stability and lower the aggregation propensity.20 Our initial

hypothesis about the poor stability of EstA expressed in E. coil reported previously3 was

therefore that the native EstA was post-translationally modified, and that the lack of this

modification destabilized the protein. Expression of identical constructs of EstA in E. coli and S.

islandicus shows that the EstA expressed in E. coli is indeed lacking post-translational

methylation of lysine residues. These modifications were recently found in a proteomic analysis

of aKMT4 targets in S. islandicus and shown to occur on four lysines (K8, K21, K50 and

K138).15 In agreement with our observation that on average only 0.4 lysine residue are

methylated per EstA molecule, the modification of the four lysine residues were found to be

incomplete.15 Comparing SiEstA and EcEstA the only covalent difference, which we have been

able to find, is that lysine residues in SiEstA are methylated. Using S. islandicus REY15A

E233S Δakmt as expression host also resulted in EstA that was unmethylated and the covalent

structure of this variant of EstA is identical to that of EstA expressed in E. coli. It is therefore

puzzling that the catalytic activity and the thermal stability of the two unmethylated variants of

EstA differs. Indeed, EstA expressed in and purified from the two different strains of S.

islandicus have much more similar properties.

One explanation for this behaviour is that some unidentified co-factor only present in S.

islandicus stabilize EstA. We therefore tested if EstA purified from E. coli could gain (some of)

the thermal stability that SiEstA possesses by incubating EcEstA with S. islandicus lysate. After

purification from the lysate EcEstA retains its reduced stability (data not shown). Thus, the

difference in stability remains elusive.

Our results show that the effect of lysine methylations on protein stability, at least of EstA, is

small, whereas some other effect of the expression host is much more important. First of all,

these observations demonstrate that proteins from hyperthermophilic organisms advantageously

may be expressed in a hyperthermophilic organism. If the factor responsible for the increased

stability of EstA also have an effect on a mesophilic protein is hard test by expression in S.

islandicus, as the melting temperatures of most mesophilic proteins are below the growth

temperature of S. islandicus. Recombinant expression in S. islandicus is therefore often only

feasible for thermophilic proteins.

Page 67: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

53

The observation that lysine methylation only has a minor effect on EstA stability adds to the

speculation about the physiological role of lysine methylation.21 The most prevalent hypothesis

is that lysine methylation increases the protein stability, and is in part based on the difference in

stability of a -glucosidase from S. solfataricus purified from the natural source and the protein

recombinantly expressed in E. coli.22 It is not known if the underlying reason for the increased

stability of this -glucosidase is the same as for the increased EstA stability, but it will be of

interest to determine if the stabilizing effect we observe on EstA is general or specific to EstA.

References

1. Haki G. Developments in industrially important thermostable enzymes: a review.

Bioresour Technol. 2003;89(1):17-34. doi:10.1016/S0960-8524(03)00033-6.

2. Littlechild JA. Enzymes from Extreme Environments and Their Industrial Applications.

Front Bioeng Biotechnol. 2015;3(October):161. doi:10.3389/fbioe.2015.00161.

3. Mei Y, Peng N, Zhao S, et al. Exceptional thermal stability and organic solvent tolerance

of an esterase expressed from a thermophilic host. Appl Microbiol Biotechnol.

2012;93(5):1965-1974. doi:10.1007/s00253-011-3504-z.

4. Chen R. Bacterial expression systems for recombinant protein production: E. coli and

beyond. Biotechnol Adv. 2012;30(5):1102-1107. doi:10.1016/j.biotechadv.2011.09.013.

5. Rosano GL, Ceccarelli EA. Recombinant protein expression in Escherichia coli: advances

and challenges. Front Microbiol. 2014;5(APR):172. doi:10.3389/fmicb.2014.00172.

6. Maras B, Consalvi V, Chiaraluce R, et al. The protein sequence of glutamate

dehydrogenase from Sulfolobus solfataricus, a thermoacidophilic archaebacterium. Is the

presence of N-epsilon-methyllysine related to thermostability? Eur J Biochem.

1992;203:81-87. http://dx.doi.org/ 10.1111/j.1432-1033.1992.tb19831.x.

7. Baumann H, Knapp S, Lundbäck T, Ladenstein R, Härd T. Solution structure and DNA-

binding properties of a thermostable protein from the archaeon Sulfolobus solfataricus.

Nat Struct Biol. 1994;1(11):808-819. doi:10.1038/nsb1194-808.

8. Febbraio F, Andolfo A, Tanfani F, et al. Thermal stability and aggregation of sulfolobus

solfataricus beta-glycosidase are dependent upon the N-epsilon-methylation of specific

lysyl residues: critical role of in vivo post-translational modifications. J Biol Chem.

2004;279(11):10185-10194. doi:10.1074/jbc.M308520200.

Page 68: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

54

9. Guo L, Feng Y, Zhang Z, et al. Biochemical and structural characterization of Cren7, a

novel chromatin protein conserved among Crenarchaea. Nucleic Acids Res.

2008;36(4):1129-1137. doi:10.1093/nar/gkm1128.

10. Botting CH, Talbot P, Paytubi S, White MF. Extensive lysine methylation in

hyperthermophilic crenarchaea: potential implications for protein stability and

recombinant enzymes. Archaea. 2010;2010. doi:10.1155/2010/106341.

11. Xia Y, Niu Y, Cui J, et al. The Helicase Activity of Hyperthermophilic Archaeal MCM is

Enhanced at High Temperatures by Lysine Methylation. Front Microbiol.

2015;6(November):1-12. doi:10.3389/fmicb.2015.01247.

12. Moracci M, Nucci R, Febbraio F, et al. Expression and extensive characterization of a

beta-glycosidase from the extreme thermoacidophilic archaeon Sulfolobus solfataricus in

Escherichia coli: authenticity of the recombinant enzyme. Enzyme Microb Technol.

1995;17(11):992-997. doi:10.1016/0141-0229(95)00012-7.

13. Niu Y, Xia Y, Wang S, et al. A prototypic lysine methyltransferase 4 from archaea with

degenerate sequence specificity methylates chromatin proteins Sul7d and Cren7 in

different patterns. J Biol Chem. 2013;288(19):13728-13740.

doi:10.1074/jbc.M113.452979.

14. Chu Y, Zhang Z, Wang Q, Luo Y, Huang L. Identification and characterization of a

highly conserved crenarchaeal protein lysine methyltransferase with broad substrate

specificity. J Bacteriol. 2012;(October). doi:10.1128/JB.01535-12.

15. Chu Y, Zhu Y, Chen Y, et al. aKMT Catalyzes Extensive Protein Lysine Methylation in

the Hyperthermophilic Archaeon Sulfolobus islandicus but is Dispensable for the Growth

of the Organism. Mol Cell Proteomics. 2016;15(9):2908-2923.

doi:10.1074/mcp.M115.057778.

16. Zillig W, Kletzin A, Schleper C, et al. Screening for Sulfolobales, their Plasmids and their

Viruses in Icelandic Solfataras. Syst Appl Microbiol. 1994;16(4):609-628.

doi:10.1016/S0723-2020(11)80333-4.

17. Peng N, Deng L, Mei Y, et al. A synthetic arabinose-inducible promoter confers high

levels of recombinant protein expression in hyperthermophilic archaeon Sulfolobus

islandicus. Appl Environ Microbiol. 2012;78(16):5630-5637. doi:10.1128/AEM.00855-

Page 69: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

55

12.

18. Deng L, Zhu H, Chen Z, Liang YX, She Q. Unmarked gene deletion and host-vector

system for the hyperthermophilic crenarchaeon Sulfolobus islandicus. Extremophiles.

2009;13(4):735-746. doi:10.1007/s00792-009-0254-2.

19. Baneyx F, Mujacic M. Recombinant protein folding and misfolding in Escherichia coli.

Nat Biotechnol. 2004;22(11):1399-1408. doi:10.1038/nbt1029.

20. Høiberg-Nielsen R, Fuglsang CC, Arleth L, Westh P. Interrelationships of Glycosylation

and Aggregation Kinetics for Peniophora lycii Phytase †. Biochemistry.

2006;45(15):5057-5066. doi:10.1021/bi0522955.

21. Lanouette S, Mongeon V, Figeys D, Couture J-F. The functional diversity of protein

lysine methylation. Mol Syst Biol. 2014;10(4):724. doi:10.1002/msb.134974.

22. Febbraio F, Andolfo A, Tanfani F, et al. Thermal stability and aggregation of sulfolobus

solfataricus beta-glycosidase are dependent upon the N-epsilon-methylation of specific

lysyl residues: critical role of in vivo post-translational modifications. J Biol Chem.

2004;279(11):10185-10194. doi:10.1074/jbc.M308520200.

Figures

Figure legends

Figure 1 Schematic representation of EstA constructs. A) Previous construct used for expression

in E. coli (EcSisEstAII) and in S. islandicus (SiEstA). B) Difference in protein sequence between

EstA produced from EcSisEstAII and from SiEstA. The sequence of EstA produced from the

new E. coli construct EcEstA is identical to that produced from SiEstA.

Figure 2 SDS-PAGE analysis of purified EstA. The samples were run on a 12% acrylamide/bis-

acrylamide gels.

Figure 3 Size exclusion chromatography. The retention volumes of the three EstA variants

SiEstA, ΔSiEstA, and EcEstA are similar and correspond dimers. The samples were run on a

Superdex200 10/300 GL column in 50 mM NaH2PO4, 150 mM NaCl at pH 7.8. The elution

Page 70: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

56

volumns of a set of standard proteins used for calibration of the column are also shown. The

dashed line is a linear fit to the data for the standard proteins.

Figure 4 Mass analysis of SiEstA and ΔSiEstA by ESI MS. (A) Representative mass spectrum

of SiEstA (A) and ΔSiEstA (C) and the deconvoluted spectrum of SiEstA (B) and ΔSiEstA (D).

Figure 5 Analysis of disulfide bond formation by SDS-PAGE analysis. Samples of purified

SiEstA and EcEstA were prepared in the presence or absence of 50 mM DTT as indicated.

Figure 6 Circular dichroism spectra of SisEstA (red) and EcEstA (blue). Far-UV (top) and near-

UV spectra (bottom) were recorded at pH 7.4 and 25 C. The far-UV spectra were recorded at

approximately 0.2 mg/ml EstA in a 1 mm cuvette and the near UV spectra at approximately 7

mg/ml EstA in a 10 mm cuvette. The measured signal was normalized to the concentration and

plotted as the molar ellipticity.

Figure 7 Activity bases characterisation of EstA variation. The activity of SiEstA (red square),

ΔSiEstA (green circle), and EcEstA (blue triangle) were measured by following the rate of

formation of p-nitrophenylate produced by the hydrolysis of 1 mM p-nitrophenyl-hexanoate by

20 µg/ml enzyme unless otherwise indicated. A) Temperature-activity profile of EstA. The assay

was performed at varying temperatures in 50mM Na-phosphate, 25 mM N-lauroylsarcosine,

10% 2-propanol, pH 7.0. B) Michaelis-Menten kenetics of EstA’s by the hydrolysis of varying

concentrations of p-nitrophenyl-hexanoate as incated. The assay was performed at 40 °C in

50mM Na-phosphate, 25 mM N-lauroylsarcosine, 10% 2-propanol, pH 7.0. C) Residual activity

of EstA’s. The activity of the EstA’s were measured after incubated at 90°C for 5-120 min at 90

°C in 50 mM NaH2PO4, pH 7. D) Residual activity of EstA’s. The activity of the EstA’s were

measured after incubated at 90°C for 5-120 min at 90 °C in 50mM Na-phosphate, 25 mM N-

lauroylsarcosine, 10% 2-propanol, pH 7.0

Figure 8 Native-PAGE analysis of temperature incubated EstA. Prior to the electrophoresis,

SiEstA and EcEstA were incubated for the indicated amount of time at 90 °C in 50 mM

NaH2PO4, 25 mM Sarkosyl, pH 7.

Figure 9 Anion exchange & IEF analysis of SiEstA (red), ΔSiEstA (green), and EcEstA (blue).

A) Anion exchange chromatrography of each of the three EstA variants as eluted from a MonoQ

Page 71: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

57

HR 5/5 column by a 0.5 %/min NaCl gradient from 0 to 500 mM. In 50 mM bis-tris buffer at pH

6.5. B) Isoelectric focusing gel electrophorese of the three EstA variants, separated on 3-10 pH

gradient.

Page 72: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

58

Figure 1

Figure 2

Page 73: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

59

Figure 3

Figure 4

Page 74: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

60

Figure 5

Figure 6

Page 75: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

61

Figure 7

Figure 8

Page 76: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

62

Figure 9

Tables

Table 1 Amino acid analysis of SiEstA

Amino acid mol % Number of amino

acid residuesa

Ala 9.9 31.0

Arg 5.1 16.0

Asp 10.9 34.4

Cys 1.0 3.2

Glu 7.4 23.2

Gly 8.5 26.6

His 2.5 7.8

Ile 6.6 20.7

Leu 10.2 32.0

Lys 4.2 13.1

MeLys 0.14 0.4

Met 0.7 2.3

Phe 5.5 17.4

Pro 6.1 19.2

Ser 6.9 21.5

Thr 2.0 6.4

Tyr 4.4 13.8

Val 7.7 24.2 aCalculated for a total of 313 amino acid residues

Page 77: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

63

Table 2 Enzyme kinetic parameters for EstA variants at

40°C

Variant KM [mM] kcat [102 s-1]

SiEstA 0.66 ±0.04 192 ± 6

ΔSiEstA 1.00 ±0.05 229 ± 7

EcEstA 0.42 ±0.02 100 ± 2

Page 78: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

64

Page 79: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

65

Chapter 2

In this manuscript, the importance of dimer formation on the stability of the hyper-thermophilic

esterase EstA is investigated.

To facilitate the in silico modelling presented in this manuscript a custom python script was

written by myself for the automation of the modelling, The modelling itself was made with

FoldX4, http://foldxsuite.crg.eu. In appendix I you can find an introduction to the functionality

of the script and a guide for use, as well as the python code.

The manuscript is still in preparation, and written as a full-length article of maximum 5000

words for Protein science.

Page 80: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

66

Effect of dimer interface integrity on the stability of a

hyperthermophilic esterase

Daniel Stiefler-Jensen1, Kaare Teilum2, and Qunxin She1,*

1Archaea Centre, Department of Biology, University of Copenhagen, Denmark 2Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen,

Denmark

*Correspondence: [email protected]

Abbreviations

Amino acid AA

Immobilized metal ion affinity chromatography IMAC

Isopropyl β-D-1-thiogalactopyranoside IPTG

N-lauroylsarcosine Sarkosyl

Protein Interaction Atom Distance Algorithm PIADA

Proteins, interfaces, Structures and Assemblies PISA

Protein Structure and Interaction Analyzer PSAIA

Site-directed ligation independent mutagenesis SLIM

Solvent accessible surface arear SASA

Thioredoxin Trx

Van der Waals VdW

Abstract

Many factors influence the stability of hyper-thermophilic enzymes by different mechanisms,

and protein oligomerisation is one of them. Oligomerisation increases enzyme stability by

reducing the solvent-exposing surface area through protein-protein interaction, presumably by an

entropy driven release of hydration water. Here we investigate the importance of dimer interface

integrity for the stability of an esterase from the hyper-thermophilic and acidophilic archaea

Sulfolobus islandicus (EstA). We found that compromising the interactions at the dimer interface

by alanine substitution of hydrophobic residues present in the core of the interface drastically

reduced the stability of mutant EstA proteins although these proteins still formed dimer at

ambient temperatures, suggesting that the protein stability could be correlated with the strength

of inter-subunit interactions. Indeed, strengthening the interaction at the dimer-interface by

generating a novel inter-molecular disulphide bond resulted in an increase in protein stability at

90°C by 1.63 fold.

Introduction

Page 81: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

67

The hunt for enzymes with the highest possible stability is an ever-ongoing process in the

biotech industry. One way to obtain enzymes with a higher stability is to utilise directed or

rational evolution of mesophilic enzymes to make them more thermophilic.1,2

(Hyper)Thermophilic organisms that lives at temperatures above 50°C are another source of

thermostable enzymes.3 Their enzymes have evolved to cope with the high temperatures of their

host organism’s natural environment and are therefore intrinsically stable. Pyrococcus furiosus4

and Pyrolobus fumarii5 are both examples of hyperthermophilic organisms isolated from

environments where the temperature are up to and above 100°C. From organisms like these,

there is a potential for isolation of enzymes with high stability. For example, A β-glucosidase

from P. furiosus showed an optimal temperature of 105°C and a half-life of 13 hours at 100°C6.

Beside the isolation of thermophilic enzymes for their use in the biotechnology industry7, they

are also studied to discern the mechanisms that provide their high stability. So far research into

these mechanisms has yielded many clues as to what makes thermophilic enzymes as stable as

they are. However, no general mechanism has yet, if it ever will, been identified. The

information accumulated so far, points to a wide range of optimisation that converge to

contribute to the high stability of thermophilic enzymes8. These include, among others, increased

number charged residues on the protein surface, an under representation of heat liable amino

acids, void filling, rigidity, volume reduction and oligomerisation3,9–12.

Oligomerisation, the formation of stable transient, protein-protein interaction is a possible

mechanism for protein stability. By engaging in protein-protein interaction, the overall solvent

accessible surface arear (SASA) are reduced. The reduction in SASA provides a stabilising

effect by reducing unfavourable interaction between water and the protein surface. Comparison

of protein-protein interfaces from mesophilic, thermophilic enzymes, has shown that

thermophilic interfaces are more hydrophobic and rely less on hydrogen bonding, compared to

mesophilic interfaces, while the number of ionic interaction are similar.13 The stabilising effect

of protein oligomerisation have been experimentally observed on several occasions.14–16

One such case, is an esterase from the hyper-thermophilic and acidophilic archaea Sulfolobus

islandicus (EstA). Here a large difference in stability was observed between monomeric and

dimeric version of EstA17,18, suggesting the dimerization provides a stabilising effect on the

esterase. Both versions of EstA has been reported to have similar, but reduced enzyme kinetics

when expressed in E. coli.17,18 Hence the reduction in the enzymatic kinetics was an effect of the

expression in E. coli and not an effect of the enzyme being monomeric. Consequently, EstA

Page 82: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

68

dimer formation is not linked to function, and the dimer formation of EstA likely serve an

alternative purpose e.g. stabilisation of the protein structure. As the catalytic activity is not

affected by the oligomeric state of the esterase, this enzyme is a good model for studying the

stabilising effect of protein oligomerisation in hyper-thermophilic enzymes.

In this study, we investigate the effect of dimer interface integrity on the stability of the hyper-

thermophilic esterase EstA of Sulfolobus islandicus. The hypothesis is that weakening or

disruption of the intra-molecular interaction will result in a reduced stability, while strengthening

these interactions will improve the stability of the esterase. To accomplish this, we used in silico

and structural analysis to pinpoint target for mutagenesis to disrupt the dimer and structural

inspection for identification of likely targets for stabilising mutations, then constructed the

mutants and characterised their stability in vitro.

Material and methods

Cell lines and vectors and enzyme preparation

All expression vectors were cloned and purified from E. coli DH5α. For expression and

purification of protein E. coli SHuffleT7 was used. Expression of wildtype EstA was done from

a pET32A based, p32a-estA_v2, vector containing the complete reading frame of EstA with a C-

terminal 6x histidine tag.18 Mutants of and EcEstA were purified from 2 litre E. coli Shuffle T7

grown at 30°C in LB medium supplemented with 100 µg/ml of ampicillin and 10 µg/ml

chloramphenicol. Protein expression was induced by 1 mM of Isopropyl β-D-1-

thiogalactopyranoside (IPTG) at OD600 ~0.8 and allowed to proceed for 2 hours. The E. coli cells

were harvested by centrifugation at 12000 rpm for 10 minutes at 4˚C (centrifuge 5810R,

Eppendorf AG, Hamburg). EstA expressed in E. coli is abbreviated EcEstA.

Both S. islandicus and E. coli cells were resuspended in immobilized metal ion affinity

chromatography (IMAC) binding buffer supplemented with 1 µg/ml DNaseI. The cells were

lysis by three passes through a french press at 1500 psi. The cell extract was clarified by

centrifugation at 12000 rpm for 20 minutes (centrifuge 5810R, Eppendorf AG, Hamburg) 4˚C.

The supernatant was heat precipitated in water bath at 70˚C for 10 minutes, followed by an

additional clarification, as above. Supernatants of the heat precipitation was filtered through a

0.2 µm filter before chromatography.

Page 83: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

69

The proteins were purified by IMAC using a HisTrap HP 1 ml column (GE Healthcare Life

Science) with a 20-500mM imidiazole gradient in pH 7.4, 150 mM NaCl, 50 mM NaH2PO4. The

main peak, at approximately 200mM imidazole, was collected before buffer exchange to 50 mM

Na-phosphate, pH 7. SDS-PAGE was used to determine the purity of the final EstA sample.

Enzyme concentration was estimated by measuring absorbance at 280nm with an extinction

coefficient 31900 M-1cm-1 and a mass of 34.5 kDa.

SLIM Mutagenesis

Mutation of the estA gene on p32a_estA_v2 was made the Site-directed ligation independent

mutagenesis (SLIM). 19,20 The mutagenesis was performed as described in.20 with the following

modifications: template removal by DpnI treatment was performed after the SLIM hybridisation.

The PCR was performed by an initial denaturing step at 98°C for 30s. Followed by 35 cycles of

98°C for 30s, 50°C for 30s, 72°C for 6 minutes. And a final elongation step of 72°C for 10min

concluded the reaction. The PCR was made with 2U of Phusion Hot Start II DNA Polymerase

(Thermo Fisher Scientific, USA). See Table I for primers used.

Size exclusion chromatography analysis

100 µl of purified EstA were run at 750 µl/min in 50 mM NaH2PO4, 150 mM NaCl, pH 7.8 on a

Superdex200 10/300 GL. As a standard of molecular mass 100 µl of the following proteins were

used: Ribonuclease A 1.5 mg/ml, Ovalbumin 2 mg/ml, and Conalbumin 2 mg/ml.

Esterase activity/stability assays

Esterase activity was determined as the rate of hydrolysis of p-nitrophenyl esters by following

the formation of p-nitrophenolate by the change in absorbance at 410 nm. The activity was

assayed in 50mM Na-phosphate, 25 mM N-lauroylsarcosine (Sarkosyl), 10% (v/v) 2-propanol,

pH 7.0 with 20 ng/ml EstA. The substrate was 1 mM p-nitrophenyl hexanoate unless otherwise

states. The change in absorbance was measured with an Infinite M200 (Tecan Trading AG,

Switzerland) plate reader at 40°C. Thermal inactivation of EstA was performed by incubating

5µg/ml enzyme in 50mM Na-phosphate, 25 mM N-lauroylsarcosine (Sarkosyl), for the

appropriate time.

Page 84: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

70

Interface prediction

PIADA analysis was performed with PSAIA Version 1.0, using standard settings.21 PISA

analysis was performed using standard settings from PDBePISA homepage8.22

Stability modelling/ model building (FoldX script)

FoldX423 were used to repair, modify PDB files with the RepairPDB and BuildModel commands

respectively. All Stability and AnalyseComplex calculation was made with the following values

for temperature, pH, and ion strength: 308.15K, 7, 0.1.

Structural inspection and image generation

All structural inspection and imagen was made with “The PyMOL Molecular Graphics System”

version 1.3.

Results and discussion

In silico analysis of the dimer interface

Recently the structure of an esterase native of hyperthermophilic archaea Sulfolobus islandicus

(EstA) has been solved with a 2.6 Å resolution (PDB ID: 5LK6). The enzyme is a dimer in the

crystal, and this is consistent with previously in vitro analysis, where the enzyme was purified as

a dimer from S. islandicus.17,18 Structural analysis by PIADA and PISA algorithms revealed that

the dimer interface of the enzyme is located in the C-terminal region and consist of the C-

terminal β-strand (8), of an alpha/beta hydrolase fold and α-helix 12. A total of 29 amino acid

(AA) residues were predicted to participate in the dimer formation (Figure 1). Of these AAs are

9 polar, 8 hydrophobic, 8 charged AAs ( 5 negative and 3 positive), as well as 3 proline and a

single glycine (Figure 2), thus all types of AAs are represented. PIADA analysis predicted 188

interactions this represents an over estimation since each interaction was predicted twice, once

for each subunit, and half of the interaction between the subunits can therefore expectedly be

removed once the redundancy of the two data set was confirmed. Resulting is a total of 63 Van

der Waals (VdW), 6 hydrophobic, 20 polar, 5 ionic interaction, totalling 94 unique interaction

predicted by PIADA analysis. The PISA analysis predicted 30 interactions, 24 hydrogen bonds

and 6 salt bridges. Of the 24 hydrogen bonds, 4 are those of the 4 salt bridges detected, hence 24

hydrogen bonds and 4 ionic interaction.

8 http://www.ebi.ac.uk/msd-srv/prot_int/pistart.html

Page 85: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

71

However, the PIADA analysis do not give any information about multiple interactions of the

same type between two defined amino acids, e.g. both back-bone to back-bone and side-chain to

side-chain hydrogen bonding between two amino acids. A curation of the polar and ionic

interaction from both PIADA and PISA identified 34 unique hydrogen bonds and 5 ionic

interactions. A final count of the predicted interactions yields 102 in total: 63 VdW, 6

hydrophobic interactions, 34 polar interaction/hydrogen bonds, and 5 ionic interactions, see

Table II for a list of all non- VdW interactions and Figure 3 for the EstA structure with the

interface highlighted. A possible explanation to the differences in the hydrogen bonds and ionic

interaction predicted by PIADA and PISA can arise from differences in the criteria of detection

interactions. PISA uses a maximum distance of 3.89Å for hydrogen bonds where PIADA uses

4.7Å. For ionic interactions, the distances are 4Å and 6Å for PISA and PIADA respectively. This

can explain why only PIADA predicted the ionic interaction between OE2 of Glu1242 and

NH1of Arg2268, where the distance is 5.9Å, hence outside the maximum range of PISA. This

also explains why PIADA do not predict the symmetric amino acid pair to form an interaction, as

the distance between NH1of Arg1268 and OE2 of Glu2242 is 6.4Å. The differences in maximum

distances also explain all but one of the cases where PISA fails to predict hydrogen bonding,

compared to PIADA. The exception is the interaction between Arg287 and Arg299, as the

minimum distances her are 3.7Å and 3.8Å for the two symmetrical amino acid pairs. In contrast,

why PIADA fail to detect hydrogen bonding between: Asn255 to Asn271, 3.04-3.05Å, Val264 to

Gln285, 3.28-3.46Å, or Thr265 to Asp288, 2.44-2.62Å, which are predicted by PISA is

unknown. It should be noted that, as for the polar interactions, the number of VdW interactions

do not specify if an amino acid participate in more than one VdW interaction, only that two

unique residues form at least one VdW interaction. As such, the 63 VdW interactions predicted

are properly an underestimation. The 29 amino acids selected for further analysis, include all

amino acids predicted by at least one of the methods to participate in at least one interaction

regardless of type.

The thermodynamic contribution to the dimer formation of each of the 29 residues identified

above, was investigated by generating 29 structural models. Each representing an EstA mutant

protein with alanine substitution for each of the 29 residues. The change in thermodynamic

energy of the dimer interface (ΔΔGD) of each alanine substitution, was as calculated as the

difference in ΔG of mutant dimer interface relative to the wild-type dimer interface. The

structural modelling and the thermodynamic calculation, were made in silico with FoldX4, and

Page 86: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

72

residues with a ΔΔGD of less than 2 kcal/mol upon substitution were discarded as possible

targets for in vitro analysis. This left only 10 of the 29 amino acids predicted with PISA and

PIADA analysis, for further analysis (Table III). Unfortunately FoldX4 is unsuitable for stability

modelling above a moderate temperature of 35°C, hence the increased contribution of ionic and

hydrophobic interaction cannot be modelled at relevant temperatures.23 Therefore, the estimation

of the thermodynamic contribution of the hydrophobic and charged amino acids, are likely

underestimated.

Of the 10 residues Glu251 is the only residue predicted to engage in the dimer interface through

VdW interaction alone. It is surprising that substitution of Glu251 with alanine has such a large

effect ( 2.3 kcal/mol ) on the stability of the dimer interface since Glu251 only contribute

through VdW interaction. It is however, possible that this surprisingly large effect of G251A

substitution on the dimer interface does not arise through its direct involvement in the dimer inter

face, i.e. the VdW contact with Asn271. Glu251 is part of a ionic network close to the dimer

interface, where it neutralises Arg26824 (see Table IV for alignment). Arg268 is one of the 10

residues predicted to contribute with more than -2 kcal/mol to the dimer interface, it is possible

that the loss of this negative charge and thus the neutralisation of Arg268, act as a secondary

effect destabilising the dimer interface (Figure 4). Modifications of the interaction between the

two residues, Glu251 and Arg268, and the resulting effect on dimer stability has been tested

elsewhere.24 By substituting Arg268 with glutamate, glycine and lysine, it has been show that the

dimer remains intact and only about 40% was disrupted by 3M urea of for the substitutions of

glycine and glutamate substitution.24 Consequently, E251 and R268 was excluded from in vitro

mutational analysis.

Also among the 10 residues, are the hydrophobic amino acids Val267, Phe269, and Leu292,

located in the core of the interface (Figure 5) and the polar residues Thr265, Asn270, and

Asn271, which participate in side-chain to side-chain hydrogen bonds (Figure 6). As the strength

of both hydrophobic25 and ionic interaction26 increases with temperatures, the hydrophobic and

charged residues are prioritised over polar, as hydrogen bonds become less relevant at the

physiological temperatures, 75-80°C, and above. Hence T265, 270, and 271 was eliminated for

in vitro analysis, since the importance of hydrophobic interactions in dimer interfaces of

homologous thermophilic esterases have been demonstrated elsewhere. By alanine substitutions

of residues homologous to Val267 and Phe269 in a homologue thermophilic esterase (EstE1),

(see Table IV for alignment) the mutations were shown to convert EstE1 to a monomer,

Page 87: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

73

supporting the importance of the hydrophobic core for dimer integrity.15 Likewise, glutamate

substitution of the Phe269 homologous residue also disrupted the dimer, thereby converting

EstE1 to a monomer.15 As a result, all three hydrophobic residues were selected for in vitro

analysis.

The last two of the 10 residues, Asp288 and Arg299, form a dimer spanning salt-bridge. Arg299

was selected for in vitro analysis over Asp288, as this residue had the highest number of

interface spanning interaction, 4 versus 3 (Figure 7), thereby disrupting most of the interactions

in the interaction network consisting of Argx299, Thrx265, Gluy284, Aspy288, Sery291 (Figure

7).

The 4 residues, Val267, Phe269, Leu292, and Arg299, selected for mutational analysis are

involved in 17 of the 34 non-VdW interaction predicted by PIADA and PISA (Table II), hence

by analysing only 4 of the 24 AAs participating in non-VdW interaction 50% of the interaction

are analysed. In fact, a triple alanine substitution of Val267+Phe269+R299, disrupt 16 of 34

dimers spanning non-VdW interaction. The 17 interactions are all based on the side-chain the

mutated residues i.e. no interaction with the backbone of the 4 AAs. This indicates side-chain

connectivity as a possible identifier if residues important for protein-protein interface stability

and as targets for disruptive mutagenesis.

A progressive mutation strategy was selected for studying function of the hydrophobic core in

the dimer interface. Mutants were generated with one, two, or three alanine substitution in the

hydrophobic core. A glutamate substitution of Phe269 was also included as a single point

mutation in the analysis of the hydrophobic core. To analyse the effect of Arg299, the arginine

was substituted with an alanine. Additionally, a triple mutation where Arg299, Val267, and

Phe269 were all substituted to alanine were constructed, as well.

Additional mutants

In addition to the four residues selected for studying the effect of destabilising the dimer

interface of EstA, four additional mutants were also designed. To test the importance of an

intramolecular disulphide bond between Cys101 and Cys103 for stability and activity, a serine

substitution of Cys101 was made. It has previously been shown that the cystine are present when

EstA is expressed from E. coli Shuffle T718, a cell line engineered to facilitate cytoplasmic

Page 88: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

74

cysteine formation in E. coli.27,28 Knowledge about functional importance of this disulphide bond

can have an impact on future choice of expression systems.

As destabilisation of the dimer is expected to lead to a reduced stability of the enzyme, it is

interesting to test whether further stabilisation of the dimer will result in an increase in stability

of EstA. To test this hypothesis, EstA mutants was designed with novel inter-molecular

interaction. An ionic interaction was introduced by substituting Gly261 with aspartate, as a

partner to Arg6 (Figure 8). Introduction of two symmetrical inter-molecular disulphide bond

through cysteine substitution of Ser266 and Asn270 (Figure 9) was similarly tested. A novel

interface spanning disulphide bond on the symmetrical axis of the EstA dimer by substitution of

Ser295 to cysteine was also hoped to stabilise the protein-protein interaction (Figure 10). These

targets were each identified by visual inspection of the dimer interface. The thermodynamic

effect of three mutants designed to stabilise the dimer interface was modelled similar to the other

mutants (Table V). Their effect on the dimer interface, ΔΔGD, is estimated to only 0.7 kcal/mol.

The substitution of Ser295 is modelled to not affect to stability of the esterase significantly,

while both the mutants with substitution of Ser292 and Asn270 for cysteine, and glutamate

substitution for Gly261 are modelled to have a significant effect on the stability of EstA, by -

1.153 kcal/mol and 7.261 kcla/mol. This suggests, making an introduction of a possible

disulphide bond through cysteine substitution of Ser251 and Asn270, to be the best candidate for

improved stability.

The 12 mutants, described above, were generated by SLIM mutagenesis of p32a-EstA_V2 and

the enzyme was expressed and purified from in E. coli ShuffleT7.

Biochemical characterisation of selected EstA mutant proteins

Wildtype EstA and the 12 mutants were successfully purified to homogeneity by IMAC from E.

coli (Figure 11). The oligomeric state of each EstA variant was evaluated by size-exclusion

chromatography (Figure 12A). All variation of EstA except those carrying substitutions of

Arg299, eluted as a single peak, with a retention time, corresponding to a dimer (Figure 12B).

The mutants with Arg299 substitutions elutes with an addition peak of about 20 kDa. The size of

this population seems to increase with additional mutations. Generally, the EstA dimer remain

intact despite the introduction of respective mutants this demonstrates interaction in the EstA

dimer interface is stronger than that of EstE1, a homologous thermophilic esterase identifyed

Page 89: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

75

from metagenomic data. In this esterase homologue, alanine substitution of Val267 and Phe269

or Phe269E substitution result in a complete disruption of the dimer at room temperature.15

The effect of the mutations on the catalytic stability of EstA, was evaluated by heat treatment of

the enzymes at 90°C up to 30 minutes. Here, in contrast to the size-exclusion chromatography,

the differences between different variations of EstA are clear. As predicted by the in silico

modelling, Table V, the stability of the esterase is severely compromised as mutations

accumulates in the hydrophobic core. The strength of the destabilisation for the single

substitutions mutants, shows a correlation with the hydrophobicity of the mutated residue,

Phe269A>Leu292>Val267. The double and triple mutants are both severely compromised and

are almost completely inactive after 5 min incubation at 90°C. Their half-lives are estimated to

be between 0.7-0.9±2.2 minutes (Figure 14A). The substitution of arginine 299 to alanine only

had a moderate effect on EstA stability, compared to the other single point mutants. The half-life

of the R299A mutant is reduced to 10.9±0.6 from 12.3±0.7 minutes of the wild-type. This

indicates that the additional population, observed in size-exclusion chromatography for the

R299A mutant, is unrelated to the stability.

As seen in Table VI, substitution of only one hydrophobic residues in the dimer interface,

correlate with a decrease of at least 54% in half-life at 90°C. This is also in good agreement with

the prediction from the in silico modelling, which predicted a larger effect on the stability

compared to the dimer formation (Table V).

Of the three mutants, with substitutions aimed to stabilise the dimer interface, only substitution

of S295 by cysteine resulted in an increased catalytic stability, by increasing the half-life of the

esterase from 12.3±0.68 to 20±1.64 minutes. It was shown by SDS-PAGE analysis in the

presence and absence of DTT, that the mutation successfully has introduced a stable inter-

molecular disulphide bond. It is therefore likely, that the increase in catalytic stability, is a result

of this stabilisation of the dimer interface through this covalent bond. This demonstrates the

importance of the dimer formation on the stability of EstA. Cross-dimer bonds has also

previously been identified as a method for stabilising esterases, here the disulphide bond was the

result of random mutagenesis.16

Page 90: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

76

The role of dimer formation in EstA

From our results, we can see that the introduction of an inter-molecular disulphide bond

increases the stability of EstA. It is unlikely that this stabilisation arises from an increase in intra-

molecular interactions, i.e. that cysteine substitution of Ser295, resulted in additional stabilising

interaction compared to the original serine residue. The stabilisation is most likely a result of a

stabilisation of the dimer formation and the protection the dimer provides. However, it is also

possible, that as a result of conformational constrains placed on the unfolded state, due to the

covalent bonding of the two peptides, it increased the stability of the folded structure.

As we were unable to obtain a monomeric EstA mutant protein, through mutagenesis, it is

difficult to determine if there is a stabilising effect of the dimer formation. However, it is

possible the mutations introduced a heat liability to the dimer interface, i.e. the dimer can remain

intact at ambient temperature, 25°C, while the dimer at a higher temperature e.g. above 75°C is

disrupted. If this is the case, the reduction of catalytic stability observed in vitro, can be the

result of monomerization. To test the hypothesis, dynamic light scattering experiment was

attempted without any conclusive results (data not shown).

In a paper from 2012, Mei et al. show data for EstA in a monomeric state when expressed as a

thioredoxin (Trx) fusion protein. This version of the enzyme remains as a monomer (EcEstA-M)

after removal of Trx-tag. There are however differences between the sequence of the EstA used

here and EcEstA-M used by Mei et al. The differences are limited to the N- and C-terminal of

the protein. But as both the termini are close to the dimer interface, it is possible these

differences can interfere with the dimer formation. In any case, here the monomeric EstA was

reported to have its stability significantly reduced, as the stability of the natively expressed and

dimeric EstA was 32 fold higher. We have previously shown that EstA expressed from E. coli as

a dimer, is relatively more stable compared to the monomeric EstA, while the stability of

natively expressed EstA is only 1.27 fold higher than that of dimeric EstA expressed from E.

coli.18 These results point towards a significant effect of dimer formation, on the stability of

EstA.

One could also argue whether disrupting the dimer, is a valid way of studying the stabilising

effect of oligomerisation. E.g. if one is successfully in disrupting the dimer formation with only a

few amino acid substitutions, the dimer interface will now be exposed to the solvent. The dimer

interface has evolved as a “buried” surface, i.e. optimised for protein-protein interaction and not

Page 91: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

77

protein-solvent interaction. It is therefore highly likely, that because of the interfaces native

environment, protein-protein, that the contact between the interface and solvent, due to

monomerisation/dimer disruption, will be less thermodynamic favourable, ΔG>0, than regular

solvent exposed surface. Hence reducing the Gibb free energy between native and denatured

conformation, ΔGFOLD.

As a result, the loss of the stabilising effect of oligomerisation, has to be considered in the

combination of the possible destabilising effect of exposing the protein-protein interface to the

solvent. An experimental design as the one used here, will be unable to provide a clear answer

about the stabilising effect of the oligomerisation, due to this double destabilising effect upon

conversion to monomeric state. In contrast, with an experimental design where the stabilising

effect of the dimer is evaluated by constructive and not disruptive amino acid substitution, it is

possible to determine if there is a stabilising effect of the oligomerisation, if the mutants are

designed to only strengthen the dimer interface and not protein fold directly. As in the case of the

introduction of a novel inter-molecular disulphide bone presented here.

Conclusion

The dimer interface of EstA is not stabilised by hydrophobic interaction to the same extent as the

hyperthermophilic esterase EstE1. EstA retain it dimeric state despite elimination of the

hydrophobic core and an ionic interaction network in the dimer interface. The stability of the

EstA dimer interface is clearly demonstrated by the triple mutant V267A+F269A+R299A, where

~47% of the dimer spanning non-VdW interaction in the interface are disrupted, still maintain a

dimeric state at ambient temperature. Regardless of the stability of the dimer interface the

mutants stability was compromised when heat treated at 90°C possibly indication a stabilising

effect by protein-protein interactions. We were able to demonstrate a stabilising effect of protein

oligomerisation, through stabilisation of the dimer interface by a novel inter-molecular

disulphide bonds resulting from a cysteine substitutions of Ser295, resulting in an increase of

EstA half-life by 1.63 fold. Together these results show the important of dimer formation for the

stability of EstA.

References

1. Eijsink, V. G. H., Gåseidnes, S., Borchert, T. V. & van den Burg, B. Directed evolution of

enzyme stability. Biomol. Eng. 22, 21–30 (2005).

2. Cobb, R. E., Chao, R. & Zhao, H. Directed Evolution: Past, Present and Future. AIChE J.

59, 1432–1440 (2013).

Page 92: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

78

3. Vieille, C. & Zeikus, G. J. Hyperthermophilic Enzymes: Sources, Uses, and Molecular

Mechanisms for Thermostability. Microbiol. Mol. Biol. Rev. 65, 1–43 (2001).

4. Fiala, G. & Stetter, K. O. Pyrococcus furiosus sp. nov. represents a novel genus of marine

heterotrophic archaebacteria growing optimally at 100oC. Arch. Microbiol. 145, 56–61

(1986).

5. Blöchl, E. et al. Pyrolobus fumarii, gen. and sp. nov., represents a novel group of archaea,

extending the upper temperature limit for life to 113 degrees C. Extremophiles 1, 14–21

(1997).

6. Kengen, S. W., Luesink, E. J., Stams, A. J. & Zehnder, A. J. Purification and

characterization of an extremely thermostable beta-glucosidase from the

hyperthermophilic archaeon Pyrococcus furiosus. Eur. J. Biochem. 213, 305–12 (1993).

7. Haki, G. Developments in industrially important thermostable enzymes: a review.

Bioresour. Technol. 89, 17–34 (2003).

8. Petsko, G. A. Structural basis of thermostability in hyperthermophilic proteins, or ‘there’s

more than one way to skin a cat’. Methods Enzymol. 334, 469–78 (2001).

9. Szilágyi, A. & Závodszky, P. Structural differences between mesophilic, moderately

thermophilic and extremely thermophilic protein subunits: results of a comprehensive

survey. Structure 8, 493–504 (2000).

10. Fukuchi, S. & Nishikawa, K. Protein surface amino acid compositions distinctively differ

between thermophilic and mesophilic bacteria. J. Mol. Biol. 309, 835–43 (2001).

11. Nakashima, H., Fukuchi, S. & Nishikawa, K. Compositional changes in RNA, DNA and

proteins for bacterial adaptation to higher and lower temperatures. J. Biochem. 133, 507–

13 (2003).

12. Ece, S., Evran, S., Janda, J.-O., Merkl, R. & Sterner, R. Improving thermal and detergent

stability of Bacillus stearothermophilus neopullulanase by rational enzyme design. Protein

Eng. Des. Sel. 28, 147–151 (2015).

13. Maugini, E., Tronelli, D., Bossa, F. & Pascarella, S. Structural adaptation of the subunit

interface of oligomeric thermophilic and hyperthermophilic enzymes. Comput. Biol.

Chem. 33, 137–148 (2009).

14. Tanaka, Y. et al. How oligomerization contributes to the thermostability of an archaeon

protein. Protein L-isoaspartyl-O-methyltransferase from Sulfolobus tokodaii. J. Biol.

Chem. 279, 32957–67 (2004).

15. Byun, J.-S. et al. Crystal structure of hyperthermophilic esterase EstE1 and the

relationship between its dimerization and thermostability properties. BMC Struct. Biol. 7,

47 (2007).

16. Cao, L. C., Chen, R., Xie, W. & Liu, Y. H. Enhancing the Thermostability of Feruloyl

Esterase EstF27 by Directed Evolution and the Underlying Structural Basis. J. Agric.

Food Chem. 63, 8225–8233 (2015).

17. Mei, Y. et al. Exceptional thermal stability and organic solvent tolerance of an esterase

expressed from a thermophilic host. Appl. Microbiol. Biotechnol. 93, 1965–74 (2012).

18. Stiefler-Jensen, D. et al. The extraordinary thermal stability of EstA from S. islandicus is

independent of post translational modifications. PhD thesis - Stability of

Hyperthermophilic Proteins - Post-translational modifications and other factors

(University of Copenhagen, 2016).

19. Chiu, J., March, P. E., Lee, R. & Tillett, D. Site-directed, Ligase-Independent

Mutagenesis (SLIM): a single-tube methodology approaching 100% efficiency in 4 h.

Nucleic Acids Res. 32, e174 (2004).

20. Chiu, J., Tillett, D., Dawes, I. W. & March, P. E. Site-directed, Ligase-Independent

Page 93: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

79

Mutagenesis (SLIM) for highly efficient mutagenesis of plasmids greater than 8kb. J.

Microbiol. Methods 73, 195–8 (2008).

21. Mihel, J., Sikić, M., Tomić, S., Jeren, B. & Vlahovicek, K. PSAIA - protein structure and

interaction analyzer. BMC Struct. Biol. 8, 21 (2008).

22. Krissinel, E. & Henrick, K. Inference of macromolecular assemblies from crystalline

state. J. Mol. Biol. 372, 774–97 (2007).

23. Schymkowitz, J. et al. The FoldX web server: An online force field. Nucleic Acids Res.

33, 382–388 (2005).

24. Angkawidjaja, C., Koga, Y., Takano, K. & Kanaya, S. Structure and stability of a

thermostable carboxylesterase from the thermoacidophilic archaeon Sulfolobus tokodaii.

FEBS J. 279, 3071–3084 (2012).

25. Baldwin, R. L. Temperature Dependence of the Hydrophobic Interaction in Protein

Folding. Proc. Natl. Acad. Sci. 83, 8069–8072 (1986).

26. Elcock, A. H. The stability of salt bridges at high temperatures: implications for

hyperthermophilic proteins. J. Mol. Biol. 284, 489–502 (1998).

27. Chen, J. Chaperone Activity of DsbC. J. Biol. Chem. 274, 19601–19605 (1999).

28. de Marco, A. Strategies for successful recombinant expression of disulfide bond-

dependent proteins in Escherichia coli. Microb. Cell Fact. 8, 26 (2009).

Figures

Figure legends

Figure 1 Distribution of amino acids in dimer interface by method.

Figure 2 Amino acids distribution by type

Figure 3 EstA Dimer interface based on PIADA analysis. Dashed lines are polar interactions,

max 4Å.

Figure 4 Ionic interaction network of close to the dimer interface of EstA. A| Wild type EstA

with Glu25 and Arg2681. The charge of Glu251 and Arg268 neutralises each other allowing the

Arg268 of each subunit to remain in close proximity. B| Mutant EstA with Ala251 and Arg268.

Arg268 is not neutralised causing the positive charge of Arg268 of each of the 2 subunits to repel

each other destabilising the dimer interface.

Figure 5 Hydrophobic amino acids in the EstA dimer interface, based on PIADA analysis. In the

centre, the hydrophobic core consisting of V267, P269, and L292 can be seen.

Figure 6 Dimer spanning Side-chain to side-chain interaction of Thr265, Asn270, and Asn271

Figure 7 Polar and ionic side chain interactions on helix 12 and around R299 of EstA.

Page 94: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

80

Figure 8 Glycine 261 substitution to aspartate together with other charged residues in the close

vicinity. Arginine 6 and glutamate 9 of the adjacent subunit. Glycine 261 is coloured red, a D261

is pink. The four residues are represented by a stick model.

Figure 9 S266 and N270 of the EstA dimer. The four residues are represented by a stick model.

Figure 10 S295 of both subunits of EstA. The two residues are represented by a stick model.

Figure 11 SDS-PAGE analysis of purified of EstA and mutants. 12% SDS-PAGE, 0.78 µg

enzyme per lane.

Figure 12 Size exclusion chromatography of EstA and mutants. Left panel| Chromatograms.

Right panel| size estimation. Column: Superdex200 10/300, flow 750 ml/min. Buffer: NaH2PO4

50 mM, NaCl 150 mM, pH 7.8.

Figure 13 SDS-PAGE analysis of Disulphide bond dependent dimerization. WT:1 & 5, S295C:

2 & 6, S266CN270C:3 & 7, C101S: 4 & 8. 12 % SDS-PAGE, 2 µg protein per lane

Figure 14 Residual activity of EstA and mutants after incubation at 90°C. 50 mM NaH2PO4, 25

mM Sarkosyl, 10% v/v 2-propanol, 1 mM p-nitrophenol hexanoate. A| Wild-type and mutants

with mutation targeting the dimer-interface for disruption. B| Wild-type and mutants with

mutation aimed at future stabilization of the dimer interface. ◆ WT, ○ V267A, ● F269A, ◐L292A,

◑R299A, ◓ F269E, ◒ VF, □ VFL, ■ VFR, ◧ G261D, ◨ SN, ▲ S295C.

Page 95: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

81

Figures

Figure 1

Figure 2

Figure 3

Page 96: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

82

Figure 4

Figure 5

Figure 6

Page 97: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

83

Figure 7

Figure 8

Figure 9

Figure 10

Page 98: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

84

Figure 11

Figure 12

Figure 13

Page 99: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

85

Figure 14

Page 100: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

86

Tables

Table I Primer for mutagenesis of EstA by SLIM. C101-X-F GTCGTAGTATCAGTTGATTATAGGT

C101-Y-F ACCAATGCATCCAACTGTGTCGTAGTATCAGTTGATTATAGGT

C101-Y-R GATAGCTCTACACAATGGGTC

C101-X-R ACAGTTGGATGCATTGGTGATAGCTCTACACAATGGGTC

G261-X-F TACCAGTGTTAGGTTCAATAACGTT

G261-Y-F CAAGCTGACGTACCGGTTACCAGTGTTAGGTTCAATAACGTT

G261-Y-R TAGTAATCTGTTCGCATAAGCTTCT

G261-X-R ACCGGTACGTCAGCTTGTAGTAATCTGTTCGCATAAGCTTCT

S266C+N270C-X-F GTTATACACGGATTCCTTTCATTTTTCCCGTT

S266C+N270C-Y-F TGCGTTAGGTTCTGCAACGTTATACACGGATTCCTTTCATTTTTCCCGTT

S266C+N270C-Y-R GGTAACCGGTACTCCAGCTTGTAGT

S266C+N270C-X-R GTTGCAGAACCTAACGCAGGTAACCGGTACTCCAGCTTGTAGT

V267/F269-X-F# ACGTTATACACGGATTCCTTTC

V267A-Y-F CCAGTGCAAGGTTCAATAACGTTATACACGGATTCCTTTC

F269A-Y-F CCAGTGTTAGGGCTAATAACGTTATACACGGATTCCTTTC

F269E-X-F CCAGTGTTAGGGAGAATAACGTTATACACGGATTCCTTTC

V267A+F269A-Y-F CCAGTGCAAGGGCTAATAACGTTATACACGGATTCCTTTC

V267/F269-Y-R# TAACCGGTACTCCAGCTTGT

V267A-X-R TATTGAACCTTGCACTGGTAACCGGTACTCCAGCTTGT

F269A-X-R TATTAGCCCTAACACTGGTAACCGGTACTCCAGCTTGT

F269E-X-R TATTCTCCCTAACACTGGTAACCGGTACTCCAGCTTGT

V267A+F269A-X-R TATTAGCCCTTGCACTGGTAACCGGTACTCCAGCTTGT

L292A-X-F TGTTTTAAGACGCACATTTTATGAT

L299A-Y-F TATAAGTGCGATAGGCTCTGTTTTAAGACGC

L292A-Y-R GCGTCTCTTCCCTGCTCTAT

L292A-X-R GAGCCTATCGCACTTATAGCGTCTCTTCCCTGCT

S295C-X-F GACGCACATTTTATGATAAAAGT

S295C-Y-F AATAGGCTGCGTTTTAAGACGCACATTTTATGATAAAAGT

S295C-Y-R AAACTTATAGCGTCTCTTCCCT

S295C-X-R TTAAAACGCAGCCTATTAAACTTATAGCGTCTCTTCCCT

R299A-X-F TGATAAAAGTGCGG

R299A-Y-F TTTAAGAGCAACATTTTATGATAAAAGTGCGG

R299A-Y-R ACAGAGCCTATTAAACTTATAGCGT

R299A-X-R TAAAATGTTGCTCTTAAAACAGAGCCTATTAAACTTATAGCGT

Each mutant is made the two set of primer pairs, set ‘X’ and set ‘Y’. The PCR

product for the individual reaction are mixed, and the resulting hybridisation

product are two self-circularisation linear vectors. The primer name indicates

mutation, primer set, and direction, each separated with a ‘-‘. (#) These primers are

used to generate four separate mutants, V267A, F269A, F269E, and

V267A+F269A.

Table II. Non-Van der Waals interaction of the dimer Interface

Hydrophobic (6) Ionic (5)

[Atoms] Hydrogen Bonds (34)

Pro15-Leu2258

Leu1258-Pro25

Val1267-Val2267

Val1267-Phe2269

Phe1269-Val2267

Leu1292-L2292

Glu1242-Arg2268

Glu1284-Arg2299

Asp1288-Arg2299

Arg1299-Glu2284

Arg1299-Asp2288

Backbone to backbone

(6)

Side-chain to backbone

(11)

(Not directional)

Side-chain to side-chain (17)

Arg16-Leu2258

Leu1258-Arg26 Ser1266-Asp2270

Arg1268-Arg2268

Arg1268-Arg2268 Asp1270-Ser2266

Arg16-Gly2261

Arg16-Val2262 Gly1261- Arg26

Val1262-Arg26

Val1264-Gln2285 Gln1285-Val2264

Arg1287-Arg2299

Ser1291-Ser2295 Ser1295-Ser2291

Ser1295-Leu2292

Arg1299-Arg2287

Tyr1721-Asp2288

Asn1255-Asn2271 Thr1265-Asp2288

Ser1266-Asn2270

Asn1270-Ser2266 Asn1271-Asn2255

Glu1284-Arg2299

Asp1288-Tyr2172 Asp1288-Thr2265

Asp1288-Arg2299

Asp1288-Arg2299

Ser1291-Arg2299

Ser1295-Ser2295

Arg1299-Ser2291

Arg1299-Glu2284

Arg1299-Asp2288

Arg1299-Asp2288

Subunit is marked by a subscript number. Bold: Mutagenesis targets, underlined: Part of salt bridge

Page 101: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

87

Table III. Effect of alanine substitution.

Residues ΔΔGS [kcal/mol] ΔΔGD [kcal/mol] ΔΔG𝐷

ΔΔG𝑆⁄

E251 2.425 2.2992 95%

T265 6.503 4.4011 68%

V267 7.163 3.8166 53%

R268 3.934 2.9045 74%

F269 11.077 5.1797 47%

N270 2.095 2.2895 109%

N271 3.6 2.1606 60%

D288 1.495 3.7114 248%

L292 7.353 2.7275 37%

R299 1.558 3.0598 196%

:**:** :** :.*. :. : *.: .* : : : : :.: .*: ** * .*.**** ** . .::* *

>SiRe_EstA MPLDPRIKELLESGFIVPIGKASVDEVRKIFRQ--LASAAPKVEVGKVEDIKIPGSEANINARVYLPKANGPYGVLIYLH

>St-EstA -MIDPKIKKLLESTIQLPIGKASVEEIRSLFKQ--FSSLTPREEVGKIEDITIPGSETNIKARVYYPKTQGPYGVLVYYH

>EstE1 MPLDPQIKPILERIRALSIAASPQELRRQVEEQSRLLTAAVQEPIAETRDVHIPVSGGSIRARVYFPKKAAGLPAVLYYH

1........10........20........30........40........50........60........70........80

*****:*.:*::* :** ::. .:.*.:********* ***:** *:: * :** :. :::. . . :*:.*********

>SiRe_EstA GGGFVIGDVESYDPLCRAITNACNCVVVSVDYRLAPEYKFPSAVIDSFDATNWVYNNLDKFDGKMG-VAIAGDSAGGNLA

>St-EstA GGGFVLGDIESYDPLCRAITNSCQCVTISVDYRLAPENKFPAAVVDSFDALKWVYNNSEKFNGKYG-IAVGGDSAGGNLA

>EstE1 GGGFVFGSIETHDHICRRLSRLSDSVVVSVDYRLAPEYKFPTAVEDAYAALKWVADRADELGVDPDRIAVAGDSAGGNLA

Δ

.........90........100.......110.......120.......130.......140.......150.......160

**.::*.: : :* *:****.*.: :. *: : .: *. * : *** ***: : **: **:** **..

>SiRe_EstA AVVALLSK--GKLNLKYQILIYPAVGFDSVS-RSMIEY--SDGFFLTREHIEWFGSQYLRSPADLLDFRFSPILAQDLSG

>St-EstA AVTAILSKK-ENIKLKYQVLIYPAVSFDLIT-KSLYDN--GEGFFLTREHIDWFGQQYLRSFADLLDFRFSPILA-DLND

>EstE1 AVVSILDRNSGEKLVKKQVLIYPVVNMTGVPTASLVEFGVAETTSLPIELMVWFGRQYLKRPEEAYDFKASPLLA-DLGG

.........170.......180.......190.......200.......210.......220.......230.......240

*****::***:*****:** ** :: :* ..:*** .::***:**:*::: **:*:.* . :* : .

>SiRe_EstA LPPALIITAEYDPLRDQGEAYANRLLQAGVPVTSVRFNNVIHGFLSFFPLIEQGRDAISLIGSVLRRTFYDKS

>St-EstA LPPALIITAEHDPLRDQGEAYANKLLQSGVQVTSVRFNNVIHGFVSFFPFIEQGRDAIGLIGYVLRKVFYGK-

>EstE1 LPPALVVTAEYDPLRDEGELYAYKMKASGSRAVAVRFAGMVHGFVSFYPFVDAGREALDLAAASIRSGLQPS-

ϵ^ ^ϵ ^ ϵ ^

.........250.......260.......270.......280.......290.......300.......310. Table IV Aligment of 3 Hyper thermophilic esterases. SiRe_EstA, EstA from S. islandicus. St-EstA, esterase from

S. tokodaii. EstE1, esterase from metagenomics data. Δ:Cystine 101,

^:Valine267,Phenylalanine269, Leusine292, ^: Arganine299, ϵ:Serine266 & 295,

Asparagine270. The alignment was generated whit ClustalX 2.1. Bold residues are in the dimer interface.

Table V. Stability of mutants

Amino acid substitutions ΔΔGS [kcal/mol] ΔΔGD [kcal/mol] C101S 1.958 -0.0068*

G261D 7.261 0.2642*

S266C+N270C -1.153 0.7103

F269E 12.667 4.9871

S295C 0.425* 0.5702

V267A 7.171 3.8178

F269A 11.062 5.1728

L292A 7.293 2.7467

V267A+F269A 15.943 7.3473

V267A+F269A+L292A 21.209 9.4451

V267A+F269A+R299A 18.116 10.791

R299A 1.418 2.9559

*Not significant

Page 102: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

88

Table VI. Enzymatoc half-life at 90°C

Mutant: Half-life [min]

Wild-type 12.31±0.68

V267A 5.33±0.14

F269A 1.64±0.55

L292A 5.69±0.73

R299A 10.86±0.55

F269E 1±0.46

V267A+F269A 0.89±0.51

V267A+F269A+L292A 0.75±1.88

V267A+F269A+R299A 0.69±2.17

C101S 12.99±0.71

G261D 9.23±0.6

S266C+N270C 12.73±1.34

S295C 20.05±1.64

Page 103: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

89

Chapter 3

Here, in the third and final manuscript, where we highlight some of the pitfalls associated with

using the classically esterase assays at high temperature and suggest a set of optimized

conditions better suited for temperatures relevant for hyperthermophilic enzymes.

The manuscript in formulates as a technical note of maximum 2400 words for submission to

Analytical Biochemistry: Methods in the Biological Sciences. The manuscript is still in

preparation.

Page 104: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

90

An esterase activity assay suitable for use at temperatures

up to 90 °C Daniel Stiefler-Jensen1, Troels Schwarz-Linnet2, Qunxin She1, and Kaare Teilum2,*

1Archaea Centre, Department of Biology, University of Copenhagen, Denmark 2Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen,

Denmark

*Correspondence: [email protected]

Abbriviations

pNPE p-Nitrophenyl esters

pNPH p-Nitrophenyl hexanoate

SDS Sodium dodecyl sulfate

Abstract

The hydrolysis of p-nitrophenol hexanoat is often used as an assay for esterase activity.

However, the common protocols are not optimized for high temperatures. Here we highlight

several aspects that need consideration when designing a temperature stable p-nitrophenol

hexanoat assay for esterase activity. The conditions (50 mM sodium phosphate, 1% Triton X-

305, 10% 2-propanol, 1 mM p-nitrophenol hexanoate, pH 7) we found to be optimal resulted in

an assay that is functional between 20-90°C and suitable for real-time data collection.

Introduction

Thermophilic and hyperthermophilic organisms are natural repositories of enzymes with high

stability. As many biotechnological applications of enzymes require elevated temperatures, the

interest in isolating and characterizing enzymes from these organisms is increasing. Esterases

and their cousins lipases are enzymes often used in industrial application as they catalyse

reactions that may replace traditional organic chemical reactions. Several reports on thermophilic

esterases and lipases use activity assays developed for analysis of mesophilic enzymes, with

optimal temperatures usually between 20°C and 40°C [1–3]. In contrast, hyperthermophilic

enzymes may have activity maxima at temperatures above 75 °C and still maintain activity

above 90 °C. For prober characterisation, especially determination of temperature-activity

profiles, an assays with a stable performance over a wide temperature range e.g. 20-90°C is

required. Here we will highlight some of the pitfalls associated with using the classically esterase

Page 105: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

91

assays at high temperature and suggest a set of optimized conditions better suited for these

temperatures.

Esterase activity is often measured by following the hydrolysis of p-nitrphenyl esters (pNPE),

which results in a p-nitrophenoxide product that has an absorption maxima at 410 nm [4,5]. p-

nitrophenoxide is the deprotonated state of p-nitrophenol, which has pKa of 7.159 [6]. The

titration results in a spectral change with an isosbestic point of 348 nm, and at pH values below

7.15 the extinction coefficient is low. For the sensitivity of the assay it is therefore desirable to

work at high pH. However not all enzymes are stable or active at this pH. In addition,

spontaneous hydrolysis of pNPEs become a large problem when pH increases above 8,

especially for short alkyl chains.

The pKa of all buffers are to some extent temperature dependent. The selection of a buffer must

therefore not only be based on the working pH but also on the temperature range the buffer is

expected to operate within. For the selection of a working pH these four factors need to be

considered: the substrate stability, the signal strength, the enzyme activity, and temperature

dependency of the buffer pKa.

As a consequence of the amphipathic nature of pNPE they will form micelles in aqueous

solutions. Micelle formation will limit the substrate availability and increase light scattering. To

counteract the micelle formation, detergents and/or organic solvents are often used to make an

emulsion. In some assys a mixture of Triton X-100 and Arabic gum is used [1,7].

Some non-ionic detergents have an inverse correlating between solubility and temperature,

meaning that above a temperature threshold, known as cloud point, the solution turns

turbid/opaque which results in a drastically increase in light scattering. Generally non-ionic

detergents have low cloud points compared to ionic detergents. Therefore, many non-ionic

detergents are not suitable for assays preformed at high temperatures, e.g. Triton X-100 and

Tween 20 have cloud points of 65 °C and 76 °C respectively.

Light scattering is not only arising from the detergent. As the reaction solution is an emulsion, it

has a certain emulsifying capacity. If the concentration of the micelle-forming component

surpasses this capacity, micelles will form and result in light scattering. Hence, the substrate

9 National Center for Biotechnology Information. PubChem Compound Database; CID=980,

https://pubchem.ncbi.nlm.nih.gov/compound/980 (accessed Jan. 7, 2016).

Page 106: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

92

concentration needs to be optimised for the highest possible concentration for e.g. Michaelis-

Menten kinetics while avoiding micelle formation.

Result and Discussion

We were motivated to optimizing the pNPE hydrolysis assay because we wished to characterize

an esterase (EstA) from the hyperthermophilic and acidophilic archaea Sulfolobus islandicus that

lives in acidic volcanic hot springs at temperatures around 75-80°C and pH ~3[8]. The optimal

conditions for pNPE hydrolysis catalysed by EstA is 75°C [9] and pH 8 [1].

The initial conditions we used for assaying the activity was 50 mM TrisHCl, pH 8.0, 0.4% Triton

X-100 and 0.1% Arabic gum with p-nitrophenyl hexanoate dissolved in 2-propanol as the

substrate, which are identical to the conditions suggested by Pencreac’h and Barratti [5]

Although a pH-value of 8 would be desirable, p-nitrophenyl hexanoate hydrolyses spontaneously

at pH 8, which at high temperatures results in a prohibitively high background. The rate of auto

hydrolysis is reduced by 80% using pH 7 instead of pH 8. This comes at a cost of a 40% reduced

extinction coefficient at 410 nm. (Figure S1 & Figure S2). We chose to use 50 mM sodium

phosphate to buffer the assay at pH 7 as the pKa temperature coefficient (dpKa/dT) for phosphate

is only -0.0028 which is much smaller than for the suitable alternatives BisTris, MOPS, and

HEPES that have dpKa/dT between -0.011 and -0.017 [10]. In the temperature interval from 20

to 90 °C where we performed the assay, the pH of a phosphate buffer will only decrease by 0.2

pH units.

The use of Triton X-100 and Arabic gum as emulsifiers to avoid that pNPEs forms micelles is

problematic at high temperatures as Triton X-100 has a could point of 66 °C. As alternatives to

0.4% Triton X-100 we tried another non-ionic detergent, 1% Triton X-305, and an anionic

detergent, 25 mM lauroyl sarcosinate (Sarkosyl), which both have cloud points above 100 °C

and make stable emulsions of 1 mM p-hydroxyl hexanoate even without the presence of Arabic

gum (Figure S4), see Table 1 information on different detergents. The effect of the different

detergents on the measured enzyme activity is show in Figure 1. Whereas all three detergents

give useful data at 40 °C it is clear that nothing can be measured at 90 °C when Triton X-100 is

used. The missing change in measured absorbance is a result of high background from light

scattering when passing the cloud point for Triton X-100, and not due to inactivation or

inhibition of the enzyme as the reaction still turns yellow. In chosing between Triton X-305 and

Page 107: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

93

Sarkosyl it is important to consider the stronger denaturating effect of Sarkosyl, which may

compromise the activity of the enzyme.

The highest quality data is obviously obtained if traces of the absorbance as a function of time

can be recorded. If, however, it is necessary to stop the reaction before measuring the absorbance

(e.g. if the spectrophotometer or plate reader has no temperature control) we suggest to stop the

reaction by addition of 0.2-1.8% SDS, which will denature the enzyme [7,11].

In addition to the consideration about the chemical composition if the esterase assays the way the

assay is mixed is also important. As both the detergents and organic solvents in the assay affect

the surface tension, regular pipettes are difficult to handle reproducible. For this reason, we

recommend the use of automatic dispensing pipettes that extract an excess volume.

Thermophilic and hyperthermophilic enzymes are typically restricted in their catalytic ability at

moderate temperatures, e.g. EstA has a temperature optimum of 75°C and only retain about 20%

activity at 40°C [9], which is believed to be the result of the evolution of a protein structure with

optimal functional flexibility close to the growth temperature of the organism it originates from

[12]. Working at high temperatures is usually not possible unless specialised equipment is

available. The equipment we found best suited for the assay at high temperatures were a

spectrophotometer with a Peltier temperature control unit. With this setup, we able to obtain

reproducible data up to 90 °C. Above this temperature the buffer starts to boil. For high

throughput analysis, we have used plate readers at 40°C.

In conclusion, with the optimized conditions, 50 mM Na phosphate, (25 mM Sarkosyl or 1%

Triton X-305), up to 1 mM p-Nitrophenyl hexanoate, 10% v/v 2-propanol, pH 7, and careful

preparation of the assay we were able to acquire high quality and reproducible activity data on

EstA in the temperature interval between 20 °C and 90 °C.

Tips & tricks

Use auto pipette whenever possible, due to the surfactant and 2-propanol in the buffer

modifies the surface tension. As a result, we obtained more consistent results with the use

of auto-pipets where effects coursed by surface tension can be omitted. i.e. with a pipet

capable of dispensing, retrieve 1 ml and dispense 3x 250 µl and discard the remaining

volume.

Page 108: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

94

Avoided vigorous mixing of the buffer to reduce the formation of bubbles and foam, due

to the presents of the surfactant.

The substrate should be prepared as late as possible. First dilute the substrate in e.g. 2-

propanol and then add the dilution gently to the buffer and mix by gently inverting.

Start the reaction by adding enzyme, this allows the substrate buffer mix to become clear

after mixing.

Avoided stopping reaction with base, acid, or PMSF. Use strong detergents e.g. SDS if

necessary.

Use boil proof plastic ware.

Suggested assay conditions

50 mM NaH2PO4, 25 mM Sarkosyl, up to 1 mM p-Nitrophenyl hexanoate, 10% v/v 2-

propanol, pH 7

50 mM NaH2PO4, 1% Triton X-305, up to 1 mM p-Nitrophenyl hexanoate, 10% v/v 2-

propanol, pH 7

Assay volumes between 250-1000 µl. Enzyme concentrations ~500 nM. Temperatures between

20-90°C.

References

[1] Y. Mei, N. Peng, S. Zhao, Y. Hu, H. Wang, Y. Liang, Q. She, Exceptional thermal

stability and organic solvent tolerance of an esterase expressed from a thermophilic host.,

Appl. Microbiol. Biotechnol. 93 (2012) 1965–74. doi:10.1007/s00253-011-3504-z.

[2] L.C. Cao, R. Chen, W. Xie, Y.H. Liu, Enhancing the Thermostability of Feruloyl Esterase

EstF27 by Directed Evolution and the Underlying Structural Basis, J. Agric. Food Chem.

63 (2015) 8225–8233. doi:10.1021/acs.jafc.5b03424.

[3] K.H. Nam, M.Y. Kim, S.J. Kim, A. Priyadarshi, W.H. Lee, K.Y. Hwang, Structural and

functional analysis of a novel EstE5 belonging to the subfamily of hormone-sensitive

lipase, Biochem. Biophys. Res. Commun. 379 (2009) 553–556.

doi:10.1016/j.bbrc.2008.12.085.

[4] U.K. Winkler, M. Stuckmann, Glycogen, hyaluronate, and some other polysaccharides

greatly enhance the formation of exolipase by Serratia marcescens, J. Bacteriol. 138

(1979) 663–670.

[5] G. Pencreac’h, J.C. Baratti, G. Pencreach, J.C. Baratti, Hydrolysis of p-nitrophenyl

palmitate in n-heptane by the Pseudomonas cepacia lipase: A simple test for the

determination of lipase activity in organic media, Enzyme Microb. Technol. 18 (1996)

417–422. doi:10.1016/0141-0229(95)00120-4.

[6] D.D. Perrin, B. Dempsey, E.P. Serjeant, pKa Prediction for Organic Acids and Bases,

Page 109: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

95

Springer Netherlands, Dordrecht, 1981. doi:10.1007/978-94-009-5883-8.

[7] S. Kim, S.B. Lee, Thermostable esterase from a thermoacidophilic archaeon: purification

and characterization for enzymatic resolution of a chiral compound., Biosci. Biotechnol.

Biochem. 68 (2004) 2289–2298. doi:10.1271/bbb.68.2289.

[8] W. Zillig, A. Kletzin, C. Schleper, I. Holz, D. Janekovic, J. Hain, M. Lanzendörfer, J.K.

Kristjansson, Screening for Sulfolobales, their Plasmids and their Viruses in Icelandic

Solfataras, Syst. Appl. Microbiol. 16 (1994) 609–628. doi:10.1016/S0723-

2020(11)80333-4.

[9] D. Stiefler-Jensen, T. Schwarz-Linnet, C. de Lichtenberg, T. T. T. N. Nguyen, K. D.

Rand, L. Huang, K. Teilum, Q. She, The extraordinary thermal stability of EstA from S.

islandicus is independent of post translational modifications, University of Copenhagen,

2016.

[10] P. Link, Biological Buffers, Applichem. (2008) 1–20. www.aplichem.com.

[11] P.Y. Li, P. Ji, C.Y. Li, Y. Zhang, G.L. Wang, X.Y. Zhang, B. Bin Xie, Q.L. Qin, X.L.

Chen, B.C. Zhou, Y.Z. Zhang, Structural basis for dimerization and catalysis of a novel

sterase from the GTSAG motif subfamily of the bacterial hormone-sensitive lipase family,

J. Biol. Chem. 289 (2014) 19031–19041. doi:10.1074/jbc.M114.574913.

[12] P. Závodszky, J. Kardos, Svingor, G.A. Petsko, Adjustment of conformational flexibility

is a key event in the thermal adaptation of proteins., Proc. Natl. Acad. Sci. U. S. A. 95

(1998) 7406–11. doi:10.1073/pnas.95.13.7406.

Page 110: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

96

Figures

Figure legends

Figure 1 Comparison of buffer performance at 40°C and 90°C. The activity of EstA was tested in the three buffers 50 mM

NaH2PO4, 25 mM sarkosyl, pH 7 (red). 50 mM NaH2PO4, 1% w/v Triton X-305, pH 7(blue), 50 mM Tris, 1% Triton X-100,

pH 7 (green), at 40°C (A) and 90°C (B). A| all three buffer have similar curves, when disregarding the differences in activity. B|

Here the two buffers with a cloud point above 100°C show a nice activity curve while signal is lost in the Triton X-100 buffer

due to light scatter. The flat lining of the signal is not due to reduced activity. But a result of extensive light scatter. The

difference in graph shape from 40°C to 90°C, is a result of increased reaction rate resulting in none linear substrate conversion

soon after measurement starts this can be postponed by increasing substrate concentrations from 0.5 mM used her to 1 mM. Here

the difference serves as a reminder of the different requirements for substrate concentrations and observation at low and high

temperatures.

Figure 1

Tables

Table 1. Basic information of Surfactans.

Name: Structur: Cloud point: Group:

Brij 58

>100°C Non-ionic

Sodium dodecyl sulfate (SDS)

>100°C Anionic

Sodium lauroyl sarcosinate (Sarkosyl)

Anionic

Triton X-100

66°C Non-ionic

Triton X-305 n=30

>100°C Non-ionic

Tween 20

76°C Non-ionic

Page 111: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

97

Supplementary figures

Figure legends

Figure S1 pH dependent auto-hydrolysis of p-Nitrophenyl hexanoate.1 mM pNPH in 50 mM

NaH2PO4, 25mM Sarkosyl, 10%v/v 2-propanol at indicated pH’s at 40°C.

Figure S2 pH dependent absorbance of p-Nitrophenol. 100 µM p-Nitrophenol in 50 mM

NaH2PO4, 25mM Sarkosyl, 10%v/v 2-propanol at indicated pH’s at 40°C

Figure S3 Temperature dependency of pKa, for H2PO4 (solid) and Tris (dashed).

Figure S4 Light scattering dependency on substrate and detergent. A| Triton X-305 B| Sarkosyl.

In 50 mM NaH2PO4 10% v/v 2-propanol at pH 7.

Figure S1

Figure S2

Page 112: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

98

Figure S3

Figure S4

Page 113: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

99

Appendix I

A – Introduction to Mutations Analyser

1 Disclaimer

This script is made to function with python 2.7 in a windows 10 64-bit system environment. The

script has not been tested on other systems to date, 2016-09-06. The Script has been tested by

myself and can be crashed, and I properly haven’t found all bugs. All credit for the FoldX utility

goes to the FoldX team, please visit http://foldxsuite.crg.eu/. The FoldX utility requires a license

and must not be freely distributed. The python code, as seen in Mutational_Analysis.py or in

appendix IB is written in its entirety by myself, Daniel Stiefler-Jensen.

All use of the python script is at your own risk.

IMPORTANT that temperature range of the empirical data that forms the basis for the FoldX

calculation is limited. As a result, data from temperatures above or below 35°C and 15°C

respectively, are at best rough estimations. The Standard deviation ΔΔG from Stability

calculations relative to experimental data is 0.46 Kcal/mol (Schymkowitz et al., 2005).

2 Purpose

This script was made to facilitate easy and fast analysis of multiple defined mutations on the

stability and integrity of complex through a temperature range or at a fixed temperatur. For

systematic studies of mutations AlaScan and Pssm command are recommended, see

http://foldxsuite.crg.eu/documentation#manual for more information on these commands and

additional not utilised by the script. The script makes use of the following command from

FoldX4: BuildModel, Stability, and AnalyseComplex.

The script has three parts, Generation of Mutated PDB file, calculating the ΔG of the new PDB

files, calculating the stability of dimer interaction of the new PDB files. The action is performed

by the FoldX program, the only ability of the script is an automation of combining the three

commands into a single action and thereby avoiding manual activation of each command.

The following section is a brief description of the three FoldX command and how they are set up

and works.

Page 114: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

100

BuildModel

The BuildModel command allow for modification of peptide chains in a PDB file, (*.pdb). In the

process the conformation of the new residues is optimised. Only a single PDB file can be

modified at a time.

e.g. a PDB file, EstA.pdb, contain 2 chains C & D of a homodimer hyperthermophilic esterase,

Accession: ADX84384. If one wishes to mutate valine267 of chain C to alanine the following

line must be in individual_list.txt: VC267A;. Where V is for Valine, C is for chain C, 267 is for

the for the residue number in the chain, A in for substitution to Alanine, and finally ‘;’ to end the

line. To mutate several residues in the same PDB file simple add “,” between the target

designation, e.g. “VC267A,VD267A” to mutate V267 to A267 in both chain C & D. Each line in

the individual_list.txt will generate a new mutant, i.e. multiple different mutants can be

generated from the same run of the script.

The BuildModel generates two files for each mutation set, one with WT_ prefix and one without.

The WT_ file for control, here the same residues that are mutated in the mutated file, the one

without the WT_ prefix, are mutated to the wildtype residues to eliminate any difference

introduced in the structure as a result of the process. ΔΔG analysis should be made between the

WT_ file and the mutant file, not the original PBD file listed in

\\Mutational_Analyser\PDB_BM.txt.

The results if the BuildModel command is a PDB filer for each mutation, they are place in the

following folder, \\Mutational_Analyser\PDB. Additional data files generated by the BuildModel

command can be found in \\Mutational_Analyser\Data_BM.

Stability

The Stability analyse command calculate that stability, ΔG, of the protein structure in the PDB

files listed in \\Mutational_Analyser\Data_BM\PdbList_New.fxout. For each protein structure in

this file the script will calculate the stability in a temperature range, defined by a start and stop

temperature and an ΔT. The individual data files can be found at

\\Mutation_Analyser_Kopi\Data_ST. The script also collects all the data into a single file at

\\Mutation_Analyser_Kopi\Results\Result_Stability.dat. This file is inspected with any text

editor. The results a tabulated and each line contains the results of the stability analysis of a

single structure. The first line of the file is a header identifying each value in the lines below. A

Page 115: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

101

brief description of each value can be found in the FoldX manual at

http://foldxsuite.crg.eu/documentation#manual. For general use only the three first values are of

interest: Temperature, Pdb, and Total energy.

The Temperature is the temperature of the analysis in Kelvin. Pdb contains the path to the PDB

subject to the stability analysis. The Total Energy in the ΔG of the structure in Kcal/mol.

Calculations of ΔΔGMut as a result of mutagenesis should be made between the Total Energy of

the mutant and the WT_ prefixed file of the mutant at the same temperature see Table 1. The

remaining values in the Result_Stability.dat is the individual components of the empirical force

field used for calculating the stability these includes Electrostatic interaction and Van der Waals

interaction and many more. For an example of a ΔΔGMut see Table 1.

Table 1 Results_Stability.dat example

Temperature [K]: Pdb Total Energy

[kcal/mol]

293.15 \\Mutation_Analyser_Kopi\PDB\EstA_R_VC267A_VD267A.pdb -100.376

293.15 \\Mutation_Analyser_Kopi\PDB\WT_EstA_R_VC267A_VD267A.pdb -107.133

Example of the first three lines of a Result_Stability.dat. ΔΔG calculation should be made between such a file

pair, WT_FILENAME.pdb and FILENAME.pdb. In this case ΔΔGMut=(-107.133 Kcal/mol)-(-100.376

Kcal/mol) = 6.757 Kcal/mol. The Data shown here is the different in stability of the mutation of Valine 267 in

both subunits of an hyperthermophilic esterase of S. islandicus calculated at 293.15°K or 20°C

AnalyseComplex

AnalyseComplex calculates the contribution of intermolecular interaction for the stability of the

complex, in Kcal/mol. For the script AnalyseComplex is treated in the same manner as the

Stability analysis. The data files are located in Analyses specific folder,

\\Mutation_Analyser_Kopi\Data_AC, and the Result file is named Result_AnalyseComplex.dat.

For more information about the folder structure see the Stability section.

The result file for data analysis is however different from that of the Stability analysis.

The results are tabulated and each line contains the results of the stability analysis of a single

structure. The first line of the file is a header identifying each value in the lines below. A brief

description of each value can be found in the FoldX manual at

http://foldxsuite.crg.eu/documentation#manual. For general use only the following values are of

intrest: Temperatur, Pdb, Interaction Energy. Value 1,2, and 7 in the tabulated data set. For an

Page 116: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

102

example of ΔΔGinter calculation see Table 2. The Script as only been optimised for dimer

analysis.

Table 2 Results_AnalyseComplex.dat data extract example

Temperature [K]: Pdb Interaction Energy

[kcal/mol]

293.15 \\Mutation_Analyser_Kopi\PDB\EstA_R_VC267A_VD267A.pdb -20.2628

293.15 \\Mutation_Analyser_Kopi\PDB\WT_EstA_R_VC267A_VD267A.pdb -23.9401

Example of of a data extract Result_AnalysisComplex.dat. ΔΔGinter calculation should be made between such a

file pair, WT_FILENAME.pdb and FILENAME.pdb. In this case ΔΔG=(-20.2628Kcal/mol)-(-23.9401 Kcal/mol)

= 3.6773 Kcal/mol. The Data shown here is the different in interaction energy of the mutation of Valine 267 in

both subunits of an hyperthermophilic esterase of S. islandicus calculated at 293.15°K or 20°C

RepairPDB

Before using a PDB file in combination with FoldX4 it is recemented to ensure that the structure is

optimised. FoldX4’s RepairPDB command do just that, optimising rotamer conformation and eliminating

Van der Waals clashes. The original pdb files must be repaired before using the script. pdb files generated

by the script through the BuilModel command can be repaired during the run, by pressing y to the

question is promted after selection the BuildModel command. Repairing pdb files takes a long time so if

you are generating a large number of mutant structures, expect long processing times.

Using the Mutation_Analyser

3 Setup before run

Installing FoldX4

Simply copy the following files to the Mutation_Analyser folder: foldx.exe, .DS_Store,

rotabase.txt

Files and paths

Before running the script, the following must files must be modified for your personal setup, all

can be found in \\Mutational_Analyser:

PDB_BM.txt

Page 117: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

103

Write the name of the PDB file to be mutated. The script can only handle 1 file at the

time, as a consequence this file shall only contain a single entry in the following

formale FILENAME.pdb

PDB_Dir_path.txt

Write the complete path to the folder containing the PDB file to be used for

BuildModel. The script can only handle 1 file at the time, as a consequence this file

shall only contain a single entry in the following formate: C:\PDB_folder

Individual_list

In this file write a new line for each modified structure following this syntax unit:

AnCnnnMn. Where An is the 1-letter abbreviation of the residue in the poly-peptide

chain, Cn is the Chain name where An is located, nn is the residue number of An in Cn,

Mn is the 1-letter abbreviation of the residue An in Cn shall be mutated to. Several

mutation can be introduced to the structure at the same time simply by adding

additional syntax units after the first each separated by “, ”. finally end the line with a

semi-colon, “;” the final result of a double substitution can look like this: A1C1n1M1,

A2C2n2M2;

Conditions

The script has been coded with standard values for the following FoldX settings:

Table 3 Standard setup for config files of

FoldX used by the script

Value Line number

BuildModel:

NumberOfRuns 1 165

Stability:

ionStrength [M] 0.1 149

pH 7 150

AnalyseComplex:

ionStrength [M] 0.1 159

pH 7 160

These values can be changes by modified the Mutant_Analyser.py in a text editor, see Table 3

for linenumbers. The temperature is defined when the script in initiated, see below.

Page 118: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

104

4 Running the script

Run the script by executing Mutant_Analyser.py. This can be done both from Window and the

command prompt.

Selection FoldX commands

The script will first ask you three yes/no question that must be answered by typing “y” or “n”

followed by hitting the enter key. These question ask what analysis should be run. Any

combination of BuildModel, Stability, AnalyseComplex, can be made. However, Stability and

AnalyseComplex requires that BuildModel has been run from the same folder at least once. Each

time one of the three FoldX commands are used the previously results from the command will be

overwritten.

Selecting temperature range

After selecting the wanted FoldX commands to be used, you will be asked if you want to use a

temperature gradient. By selecting yes the following standard temperature range for Stability and

AnalyseComplex analysis: 293.15°K to 373.15°K in 10°K intervals is suggested. Answer “y” or

“n” as before, Yes will start the script. answering No will prompt new question for you to setup a

temperature range. Start by inputting the new start temperature in kelvin, press enter and input

the new end temperature, press enter and enter the new temperature interval. The temperature

interval has to be number that results in an integer when the dividing the different between start

and end temperature with the interval. Saying no to a temperature grading will allow you to input

a single temperature for calculation. Depending of the mutations and to length of the temperature

range the execution time will vary, generally continual feedback will be given in the prompt. If

the script does not end with an overview message to “Press any key to continue to run overview”

an error has occurred and you have to troubleshoot. Troubleshooting can sometime be made

easier by executing the script from a command prompt.

Output/results

Once the FoldX commands you selected has been completed, you can inspect an overview of the

tasks performed and folder location by pressing any key. Exit the script by pressing any key a

final time.

The Calculations of ΔΔG must be made from the data provided by the script elsewhere.

Page 119: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

105

5 Known bugs and errors:

The file path the Mutation_analyser.py MUST NOT contain any BLANK spaces “ “ use _

instead.

Running the script with for the first time in a new folder without model builder option will

crash the script, warnings are in place.

6 References

Schymkowitz, J., Borg, J., Stricher, F., Nys, R., Rousseau, F., & Serrano, L. (2005). The FoldX

web server: An online force field. Nucleic Acids Research, 33(SUPPL. 2), 382–388.

http://doi.org/10.1093/nar/gki387

Page 120: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

106

B – Mutations Analyser python code

1 import subprocess

2 import os

3 import glob

4 import shutil

5 import sys

6 #---------

7 T=293.15

8 T_max=373.15

9 T_int=10

10 #---------

11

12 P=os.path.realpath(__file__)#path to the script file

13 D=os.path.dirname(P)#path to the folder of the script

14 Modes=['x','x','x','x'] #list defination for the set up function to return to

15 RP=False

16 def intro():#A simple itro message to be displayed

17 print('\n----------------------------------')

18 print('This script is for automatization of mutagenesis and analysis of the

stability of the protein fold and dimer interface through a temperature range.')

19 def setup():#configuration of the FoldX command to be used

20 global RP

21 global RP_C

22 if os.path.exists(D+'\Results'):#resers Result folder.

23 shutil.rmtree(D+'\Results')

24 print('\n----- Analysis Selection -----')

25 while True:

26 ask=raw_input('\nShall we run BuildModel ?, press \'y\' for yes and \'n\'

for no: \n')

27 if ask=='Y' or ask=='y':

28 Modes[0]='BM'

29 if os.path.exists(D+'\Data_BM'):

30 shutil.rmtree(D+'\Data_BM')#resets DATA files

31 if os.path.exists(D+'\PDB'):

32 shutil.rmtree(D+'\PDB')#reset structures

33 while True:

34 ask2=raw_input('\n do you wan to repair the generated pdb files after

mutagenesis? This can take a long time but gives the best results. press \'y\' for

yes and \'n\' for no: \n')

35 if ask2=='Y' or ask2=='y':

36 Modes[3]='RP'

37 break

38 elif ask2=='N' or ask2=='n':

39 Modes[3]='x'

40 break

41 else:

42 print('\n please use \'y\' or \'n\'')

43 break

44 elif ask=='N' or ask=='n':

45 Modes[0]='x'

46 if not os.path.exists(D+'\Data_BM\PdbList_.fxout'):#checks for and if

needed creats folder for BuildModel data

47 print('\n\n-----IMPORTANT-----\nif BuildModel has not been run before

in this folder, Stability and AnalysComplex will not work.\nStop scrpit by

\'Ctrl+C\'\n')

48 ask=raw_input('\nDo you still want to skip Buildt Model, this may

crash the script. press \'y\' for yes and \'n\' for no: \n')

49 if ask=='Y' or ask=='y':

50 Modes[0]='x'

51 break

52 elif ask=='N' or ask=='n':

53 Modes[0]='BM'

Page 121: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

107

54 break

55 else:

56 break

57 else:

58 print('\nPlease use \'y\' or \'n\'\n')

59

60 while True:

61 ask=raw_input('\nShall we run Stability ?, press \'y\' for yes and \'n\' for

no: \n')

62 if ask=='Y' or ask=='y':

63 Modes[1]='ST'# only usable if BM has already been run once

64 if os.path.exists(D+'\Data_ST'):

65 shutil.rmtree(D+'\Data_ST')#resets data files

66 break

67 elif ask=='N' or ask=='n':

68 Modes[1]='x'

69 break

70 else:

71 print('\nPlease use \'y\' or \'n\'.\n')

72 while True:

73 ask=raw_input('\nShall we run AnalysisComplex ?, press \'y\' for yes and

\'n\' for no: \n')

74 if ask=='Y' or ask=='y':

75 Modes[2]='AC'# only usable if BM has already been run once

76 if os.path.exists(D+'\Data_AC'):

77 shutil.rmtree(D+'\Data_AC')#resets data files

78 break

79 elif ask=='N' or ask=='n':

80 Modes[2]='x'

81 break

82 else:

83 print('\nPlease use \'y\' or \'n\'.\n')

84 if Modes[1]=='ST' or Modes[2]=='AC':

85 if os.path.exists(D+'\PDB_RP\Repaired_pdb.txt')==True or

Modes[3]=='RP':#checks Results folder and creates if necessary

86 while True:

87 ask=raw_input('\n Do you want to use repaired PDB files for

calculating?\n')

88 if ask=='Y' or ask=='y':

89 RP=True

90 break

91 elif ask=='N' or ask=='n':

92 RP=False

93 break

94 else:

95 print('Please use \'y\' or \'n\'.\n')

96 def Temp(T,T_max,T_int):# configuration of the temperature range used i Stability

and AnalysisComplex calculations

97 TS= 0

98 if Modes[1]=='ST' or Modes[2]=='AC':

99 print('\n\n----- Temperature Setup -----')

100 while True:

101 ask=raw_input('\nUse temperature gradient ? press \'y\' for yes and \'n\'

for no: \n')

102 if ask=='N' or ask=='n':

103 while True:

104 try:

105 TS=float(raw_input('\n At what temperature ? temperatures

between 288.15 and 308.15, are sugested ? use \'.\' as decimal separator:'))

106 if TS>0:

107 break

108 else:

109 print('\nTemperature must be positive number.\n')

110 except ValueError:

111 print('\npleas type in a number.\n')

Page 122: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

108

112 break

113 if ask=='Y' or ask=='y':

114 while True:

115 ask=raw_input('\nUse standart Temperature setting, %s K to %s K in

%s K intervals ? press \'y\' for yes and \'n\' for no: \n' %(T,T_max,T_int))

116 if ask=='Y' or ask=='y':

117 break

118 if ask=='N' or ask=='n':

119 while True:# input start temperature

120 try:

121 T=float(raw_input('\nNew Start\(low\) temp: \n'))

122 if T>0:

123 break

124 else:

125 print('\nTemperature must be positive.\n')

126 except ValueError:

127 print('\npleas type in a number.\n')

128 while True:# input end temperatur

129 try:

130 T_max=float(raw_input('\nNew End(High) temperature: \n'))

131 if T_max>T:

132 break

133 else:

134 print('\nThe number must be higher that %s.\n' %(T))

135 except ValueError:

136 print('\nplease type in a number, use \'.\' as decimal

seperator: \n')

137 while True:# input temperature interval

138 try:

139 T_int=float(raw_input('\nInput new temperature interval:

\n'))

140 test=str.split(str((T_max-T)/T_int),'.')

141 if test[1]=='0' and int(test[0])>0:

142 break

143 if int(test[0])<=0:

144 print('\nThe interval must be a positive number.\n')

145 if test[1]!='0':

146 print('\nThe interval dose not fit in between the start

and stop values as a integer.\n')

147 except ValueError:

148 print('\nThe temperature interval is dose not fit between

start and end values\n')

149 except ZeroDivisionError:

150 print('\nThe temperature intervals can\'t be zero.\n')

151 while True:# confirm new temperature gradient

152 print('\n------\n')

153 con=raw_input('\nyou picked Start: %s K End: %s K interval: %s

K.\n\tPlease confirm by \'y\' or \'n\': \n' %(T,T_max,T_int))

154 if con=='y' or con=='Y':

155 break

156 if con=='n' or con=='N':

157 Temp(T,T_max,T_int)

158 break

159 else:

160 print('\n\n------\n')

161 print('\nplease use \'y\' or \'n\'\n')

162 print('\n\n-----\nTemperatures for use is: %s K, %s K, %s

K.\n' %(T,T_max,T_int))

163 break

164 break

165

166 return T,T_max,T_int,TS

167 def PDB_Path():#readig path of PDB folder

168 Dir_path=open('PDB_Dir_path.txt','r+')

169 for line in Dir_path:

Page 123: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

109

170 PDBDIR=line

171 return PDBDIR

172 def Files(FILE):#Reading file names of PDB from BuildModel

173 size=file_len(FILE)

174 list=[]

175 i=0

176 while i < size:

177 list.append('x')

178 i=i+1

179 l=open(FILE,'r+')

180 list=[line.split('.') for line in l]

181 return list

182 def Config_ST(T):#Config file for stability analysis

183 config=open('config.cfg','w')

184 config.write('command=Stability\n')

185 if RP==False:

186 config.write('pdb-list='+D+'\Data_BM\PdbList_New.fxout\n')

187 config.write('pdb-dir='+D+'\PDB\n')

188 elif RP==True:

189 config.write('pdb-list='+D+'\PDB_RP\Repaired_PDB.txt\n')

190 config.write('pdb-dir='+D+'\PDB_RP\n')

191 config.write('temperature=%s\n' %(T))

192 config.write('ionStrength=0.1\n')

193 config.write('pH=7')

194 config.close()

195 def Config_AC(T):#Config file for AnalysisComplex analysis

196 config=open('config.cfg','w')

197 config.write('command=AnalyseComplex\n')

198 if RP==False:

199 config.write('pdb-list='+D+'\Data_BM\PdbList_New.fxout\n')

200 config.write('pdb-dir='+D+'\PDB\n')

201 elif RP==True:

202 config.write('pdb-list='+D+'\PDB_RP\Repaired_PDB.txt\n')

203 config.write('pdb-dir='+D+'\PDB_RP\n')

204 config.write('temperature=%s\n' %(T))

205 config.write('ionStrength=0.1\n')

206 config.write('pH=7')

207 config.close()

208 def Config_BM():#Config file for BuildModel, generaion of PDB file of mutants

209 config=open('config.cfg','w')

210 config.write('command=BuildModel\n')

211 config.write('numberOfRuns=1\n')

212 config.write('mutant-file=individual_list.txt\n')#add Mutations to this file

213 config.write('pdb-list=PDB_BM.txt\n')#add PDB files to be mutated here

214 config.write('pdb-dir='+PDBDIR+'\n')

215 config.close()

216 def Config_RP(File_RP):

217 config=open('config.cfg','w')

218 config.write('command=RepairPDB\n')

219 config.write('pdb='+File_RP+'.pdb\n')#add PDB files to be mutated here

220 config.write('pdb-dir='+D+'\PDB\n')

221 config.close()

222 def file_len(fname):

223 with open(fname) as f:

224 for i, l in enumerate(f):

225 pass

226 return i + 1

227 def Collect_ST(T):#Collection of Data files

228 FILENAME='output_'+T+'K_ST.dat'

229 out=open(FILENAME,'w')

230 out.write('Temperatur Pdb Total Energy Backbone Hbond

Sidechain Hbond Van der Waals Electrostatics Solvation Polar

Solvation Hydrophobic Van der Waals clashes Entropy Side Chain

Entropy Main Chain Sloop Entropy Mloop Entropy Cis Bond Torsional

Clash Backbone Clash Helix Dipole Water Bridge Disulfide Electrostatic

Page 124: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

110

Kon Partial Covalent Bonds Energy Ionisation Entropy Complex

Residue Number\n')

231 i=0

232 if RP==True:

233 D1='\PDB_RP\Repaired_PDB.txt'

234 elif RP==False:

235 D1='\Data_BM\PdbList_New.fxout'

236 File_names=Files(D+D1)

237 while i<file_len(D+D1):

238 File=File_names[i][0]+'_0_ST.fxout'

239 f=open(File, 'r+')

240 for line in f:

241 out.write('%s\t' %(T))

242 out.write(line)

243 f.close()

244 i=i+1

245 out.close()

246 def Collect_AC(T):#Collection of data files

247 FILENAME='output_'+T+'K_AC.dat'

248 out=open(FILENAME,'w')

249 out.write('Temperature Pdb Group1 Group2 IntraclashesGroup1

IntraclashesGroup2 Interaction Energy StabilityGroup1

StabilityGroup2\n')

250 i=0

251 if RP==True:

252 D1='\PDB_RP\Repaired_PDB.txt'

253 elif RP==False:

254 D1='\Data_BM\PdbList_New.fxout'

255 File_names=Files(D+D1)

256 while i<file_len(D+D1):

257 File='Summary_'+File_names[i][0]+'_AC.fxout'

258 f=open(File, 'r+')

259 LineNr = 10

260 o=1

261 for line in f:

262 if o==LineNr:

263 out.write('%s\t'%(T))

264 out.write(line)

265 o=o+1

266 f.close()

267

268 i=i+1

269 out.close()

270 def Collect_BM():#Collection of data files

271 if not os.path.exists(D+'\Data_BM'):#checks for and if needed creats folder

for BuildModel data

272 os.makedirs(D+'\Data_BM')

273 print('Folder for Stability results deposit created at:\n'+D+'\Data_BM\n')

274

275 if not os.path.exists(D+'\PDB'):#checks for and if needed creats folder for

PDB files one level down

276 os.makedirs(D+'\PDB')

277 print('Folder for Stability results deposit created at:\n'+D+'\Data_BM\n')

278

279

280 for file in glob.glob(D+'\*.fxout'):#Move data files from BuildModel to

...\Data_BM

281 F=os.path.split(file)

282 shutil.move(file,D+'\\Data_BM\\'+F[1])

283

284 for file in glob.glob(D+'\*.pdb'):#Move PDB files to PDB folder one level

down.

285 F=os.path.split(file)

286 shutil.move(file,D+'\\PDB\\'+F[1])

287

Page 125: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

111

288

289 File2=open(D+'\Data_BM\PdbList_New.fxout','w')

290

291 File3=open(D+'\PDB_BM.txt')

292 for line in File3:

293 Prefix=line

294 File3.close()

295 Prefix=Prefix.split('.pdb')

296 Prefix=Prefix[0]

297 Mut=1

298 c=0

299 for file_name in glob.glob(D+'\PDB\*.pdb'):

300 c=c+1

301 cc=1

302 while cc<=c/2:

303 Mut=cc

304 for file_name in glob.glob(D+'\PDB\*.pdb'):

305 F=os.path.split(file_name)#tumple

306 F1=str.split(F[1],str(Mut)+'.pdb')

307 F0=str(F[1])

308 F0=F0.split(Prefix+'_')

309 F0=F0[1].split('.pdb')

310 try:

311 int(F0[0])

312 Test=True

313 if int(F0[0])==Mut:

314 Test2=True

315 else:

316 Test2=False

317 except ValueError:

318 Test=False

319 if F0[0]!=str('') and Test2 == True and Test == True:

320 F0=int(F0[0])

321 LN=1

322 File=open(D+'\individual_list.txt','r+')

323 for line in File:

324 if LN == Mut:

325 New_name=line.replace(',','_')

326 New_name=New_name.replace(';','')

327 New_name=New_name.replace('\n','')

328 shutil.move(file_name,D+'\\PDB\\'+F1[0]+New_name+'.pdb')

329 File2.write(F1[0]+New_name+'.pdb\n')

330 line = None

331 File.close()

332 break

333 LN=LN+1

334 cc=cc+1

335 def Collect_RP():

336 if not os.path.exists(D+'\Data_BM_RP'):#checks for and if needed creats folder

for BuildModel data

337 os.makedirs(D+'\Data_BM_RP')

338 print('Folder for Stability results deposit created

at:\n'+D+'\Data_BM_RP\n')

339

340 if not os.path.exists(D+'\PDB_RP'):#checks for and if needed creats folder for

PDB files one level down

341 os.makedirs(D+'\PDB_RP')

342 print('Folder for Stability results deposit created

at:\n'+D+'\Data_BM_RP\n')

343

344

345 for file in glob.glob(D+'\*Repair.fxout'):#Move data files from BuildModel to

...\Data_BM

346 F=os.path.split(file)

347 shutil.move(file,D+'\\Data_BM_RP\\'+F[1])

Page 126: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

112

348

349 for file in glob.glob(D+'\*Repair.pdb'):#Move PDB files to PDB folder one

level down.

350 F=os.path.split(file)

351 shutil.move(file,D+'\\PDB_RP\\'+F[1])

352 #Generate list of new PDB files

353 out=open('Repaired_PDB.txt','w')

354 for file in glob.glob(D+'\PDB_RP\*Repair.pdb'):

355 F=os.path.split(file)

356 out.write(F[1]+'\n')

357 out.close()

358 shutil.move('Repaired_PDB.txt',D+'\\PDB_RP')

359 def Collect_Result_TS(TS,Mode):

360 if Mode=='ST':

361 out=open('Result_Stability.dat','w')

362 out.write('Temperatur Pdb Total Energy Backbone Hbond

Sidechain Hbond Van der Waals Electrostatics Solvation Polar

Solvation Hydrophobic Van der Waals clashes Entropy Side Chain

Entropy Main Chain Sloop Entropy Mloop Entropy Cis Bond Torsional

Clash Backbone Clash Helix Dipole Water Bridge Disulfide Electrostatic

Kon Partial Covalent Bonds Energy Ionisation Entropy Complex

Residue Number\n')

363 File='output_'+str(TS)+'K_ST.dat'

364 f=open(File, 'r+')

365 i=1

366 for line in f:

367 if i > 1 and i < file_len(File)+1:

368 out.write(line)

369 i=i+1

370 f.close()

371 if Mode=='AC':

372 out=open('Result_AnalyseComplex.dat','w')

373 out.write('Temperatur Pdb Group1 Group2 IntraclashesGroup1

IntraclashesGroup2 Interaction Energy StabilityGroup1

StabilityGroup2\n')

374 File='output_'+str(TS)+'K_AC.dat'

375 f=open(File, 'r+')

376 i=1

377 for line in f:

378 if i > 1 and i < file_len(File)+1:

379 out.write(line)

380 i=i+1

381 f.close()

382 def Collect_Result(t,t_max,T_int,Mode):# Mode ST=stability AC=AnalyseComplex Data

collection and generation of Result files

383 if Mode=='ST':

384 out=open('Result_Stability.dat','w')

385 out.write('Temperatur Pdb Total Energy Backbone Hbond

Sidechain Hbond Van der Waals Electrostatics Solvation Polar

Solvation Hydrophobic Van der Waals clashes Entropy Side Chain

Entropy Main Chain Sloop Entropy Mloop Entropy Cis Bond Torsional

Clash Backbone Clash Helix Dipole Water Bridge Disulfide Electrostatic

Kon Partial Covalent Bonds Energy Ionisation Entropy Complex

Residue Number\n')

386 while t<t_max+1:

387 File='output_'+str(t)+'K_ST.dat'

388 f=open(File, 'r+')

389 i=1

390 for line in f:

391 if i > 1 and i < file_len(File)+1:

392 out.write(line)

393 i=i+1

394 f.close()

395 t=t+T_int

396

Page 127: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

113

397 if Mode=='AC':

398 out=open('Result_AnalyseComplex.dat','w')

399 out.write('Temperatur Pdb Group1 Group2 IntraclashesGroup1

IntraclashesGroup2 Interaction Energy StabilityGroup1

StabilityGroup2\n')

400 while t<t_max+1:

401 File='output_'+str(t)+'K_AC.dat'

402 f=open(File, 'r+')

403 i=1

404 for line in f:

405 if i > 1 and i < file_len(File)+1:

406 out.write(line)

407 i=i+1

408 f.close()

409 t=t+T_int

410 def foldx(t,t_max,t_int,Mode):# Mode ST=stability AC=AnalyseComplex BM=BuildModel

411 if Mode =='BM':

412 Config_BM()

413 cmd='foldx.exe -f config.cfg'

414 subprocess.call(cmd, shell=True)

415 subprocess.call('exit 1', shell=True)

416 Collect_BM()

417 if Modes[3]=='RP':

418 File_names=Files(D+'\Data_BM\PdbList_New.fxout')

419 i=0

420 while i < len(File_names):

421 Config_RP(File_names[i][0])

422 cmd='foldx.exe -f config.cfg'

423 subprocess.call(cmd, shell=True)

424 subprocess.call('exit 1', shell=True)

425 i=i+1

426 Collect_RP()

427

428

429 if TS!=0:

430 if Mode =='ST':

431 Config_ST(TS)

432

433 if Mode =='AC':

434 Config_AC(TS)

435 cmd='foldx.exe -f config.cfg'

436 subprocess.call(cmd, shell=True)

437 subprocess.call('exit 1', shell=True)

438 if Mode =='ST':

439 Collect_ST(str(TS))

440 if not os.path.exists(D+'\Data_ST'):#checks for and if needed creats

folder to ST data files

441 os.makedirs(D+'\Data_ST')

442 print('Folder for Stability result deposit created

at:\n'+D+'\Results\n')

443 for file in glob.glob(D+'\*ST.fxout'):#Move files from Stability to data

folder and overwrite files in folder with same name, no promt.

444 F=os.path.split(file)

445 shutil.move(file,D+'\\Data_ST\\'+str(TS)+'K'+F[1])

446 if Mode =='AC':

447 Collect_AC(str(TS))

448 if not os.path.exists(D+'\Data_AC'):#checks for and if needed creats

folder to AC data files

449 os.makedirs(D+'\Data_AC')

450 print('Folder for AnalyseComplex result deposit created

at:\n'+D+'\Results\n')

451 for file in glob.glob(D+'\*AC.fxout'):#Move files from AnalysComplex to

data folder and overwrite files in folder with same name, no promt.

452 F=os.path.split(file)

453 shutil.move(file,D+'\\Data_AC\\'+str(TS)+'K'+F[1])

Page 128: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

114

454 # T=str(t)

455 else:

456 if t!=t_max:

457 x=(t_max-t)/t_int

458 else:

459 x=1

460 i=0

461 while i <= x:

462 if Mode =='ST':

463 Config_ST(t+(t_int*i))

464 if Mode =='AC':

465 Config_AC(t+(t_int*i))

466 if Mode =='ST' or Mode =='AC':

467 cmd='foldx.exe -f config.cfg'

468 subprocess.call(cmd, shell=True)

469 subprocess.call('exit 1', shell=True)

470 T=str(t)

471

472 if Mode=='ST':

473 Collect_ST(str(t+i*T_int))

474 if not os.path.exists(D+'\Data_ST'):#checks for and if needed creats

folder to ST data files

475 os.makedirs(D+'\Data_ST')

476 print('Folder for Stability result deposit created

at:\n'+D+'\Results\n')

477

478

479 for file in glob.glob(D+'\*ST.fxout'):#Move files from Stability to

data folder and overwrite files in folder with same name, no promt.

480 F=os.path.split(file)

481 shutil.move(file,D+'\\Data_ST\\'+str(t+i*T_int)+'K'+F[1])

482

483 if Mode=='AC':

484 Collect_AC(str(t+i*T_int))

485 if not os.path.exists(D+'\Data_AC'):#checks for and if needed creats

folder to AC data files

486 os.makedirs(D+'\Data_AC')

487 print('Folder for AnalyseComplex result deposit created

at:\n'+D+'\Results\n')

488

489 for file in glob.glob(D+'\*AC.fxout'):#Move files from AnalysComplex

to data folder and overwrite files in folder with same name, no promt.

490 F=os.path.split(file)

491 shutil.move(file,D+'\\Data_AC\\'+str(t+i*T_int)+'K'+F[1])

492 i=i+1

493 else:

494 break

495 def Cleanup():#Moving filen in to folder

496 print('\n\n----------------------------------\n')

497 print('Tasks preformed:')

498 if Modes[0]=='BM':

499 print('New PDB build with the following Mutation:')

500 F=open('PDB_BM.txt','r+')

501 for line in F:

502 SPLIT = str.split(line,'\n')

503 print(SPLIT[0]+':')

504 F1=open('individual_list.txt')

505 for lines in F1:

506 SPLIT1 = str.split(lines,'\n')

507 print('\t'+SPLIT1[0])

508 F1.close()

509 F.close()

510 print('')

511 if Modes[1]=='ST':

512 print('Stability analysis preformed of the following PDB files:')

Page 129: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

115

513 F=open(D+'\Data_BM\PdbList_New.fxout','r+')

514 for line in F:

515 SPLIT = str.split(line,'\n')

516 print('\t'+SPLIT[0])

517 F.close()

518 print('')

519 if Modes[2]=='AC':

520 print('AnalyseComplex analysis preformed of the following PDB files:')

521 F=open(D+'\Data_BM\PdbList_New.fxout','r+')

522 for line in F:

523 SPLIT = str.split(line,'\n')

524 print('\t'+SPLIT[0])

525 F.close()

526 print('')

527

528 print('---- File Information ----')

529 if Modes[1]=='ST' or Modes[2]=='AC':

530 if not os.path.exists(D+'\Results'):#checks Results folder and creats if

nessesary

531 os.makedirs(D+'\Results')

532 print('Folder for result deposit created at:\n'+D+'\Results\n')

533 if Modes[0]=='BM':

534 print('PDB files of Mutants are plased in the following

folder:\n\t'+D+'\PDB\n')

535 if Modes[1]=='ST':

536 if os.path.exists(D+'\Result_Stability.dat'):#checks for and if present move

Result_stability.dat

537 shutil.move(D+'\Result_Stability.dat',D+'\Results\Result_Stability.dat')

538 print('The result from Stability analysis are available in the following

folder as Result_Stability.dat:\n\t'+D+'\Results\n')

539 else:

540 print('No Result file for Stability anlysis\n')

541

542 if Modes[2]=='AC':

543 if os.path.exists(D+'\Result_AnalyseComplex.dat'):#checks for and if present

move Result_AnalyseComplex.dat

544 shutil.move(D+'\Result_AnalyseComplex.dat',D+'\Results\

Result_AnalyseComplex.dat')

545 print('The result from AnalyseComplex analysis are available in the

following folder as Result_AnalyseComplex.dat:\n\t'+D+'\Results\n')

546 else:

547 print('No Result file for AnalyseComplex anlysis\n')

548

549

550 for file in glob.glob(D+'\*.dat'):#remove temp files,*.dat

551 os.remove(file)

552 for file in glob.glob(D+'\*.cfg'):#remove temp files, *.cfg

553 os.remove(file)

554 for file in glob.glob(D+'\*.pdb'):#remove temp files, *.cfg

555 os.remove(file)

556 for file in glob.glob(D+'\*.fxout'):#remove temp files, *.cfg

557 os.remove(file)

558 def Credits():#Credits to the FoldX team

559 print(' _______________________________________ ')

560 print('| |')

561 print('| FoldX 4 (c) |')

562 print('| |')

563 print('| code by the FoldX Consortium |')

564 print('| |')

565 print('| Jesper Borg, Frederic Rousseau |')

566 print('| Joost Schymkowitz, Luis Serrano |')

567 print('| Peter Vanhee, Erik Verschueren |')

568 print('| Lien Beaten, Jevier Delgado |')

569 print('| And Francois Stricher |')

570 print('| And any other if the 9! permutations |')

Page 130: PhD thesis - ku Stiefler-Jensen.pdf · PhD thesis Daniel Stiefler-Jensen ... Type: Synopsis . iii Preface This thesis in the conclusion of my PhD. Study, started March 2012, at the

116

571 print('| |')

572 print('| Based on an orginal |')

573 print('| |')

574 print('| concept by |')

575 print('| Raphael Guerois and Luis Serrano |')

576 print('|_______________________________________|')

577 raw_input('\n\n----- Press any key to continue to run overview. -----')

578

579 PDBDIR=PDB_Path()

580 intro()

581 setup()

582 T,T_max,T_int,TS = Temp(T,T_max,T_int)

583 i=0

584 if TS!=0:

585 while i< len(Modes):

586 foldx(T,T_max,T_int,Modes[i])

587 Collect_Result_TS(TS,Modes[i])

588 i=i+1

589 else:

590 while i< len(Modes):#Execution of the selected commands in FoldX

591 foldx(T,T_max,T_int,Modes[i])

592 Collect_Result(T,T_max,T_int,Modes[i])

593 i=i+1

594 Credits()

595 Cleanup()

596

597 print('\n----------------------------------\n')

598 print('Script by Daniel Stiefler-Jensen, e-mail: [email protected]\n')

599 Quit=raw_input('----- Press any key to Exit -----')