Skip to content

Commit c2d6053

Browse files
committed
Update simulation tutorial
1 parent 77cbed6 commit c2d6053

File tree

1 file changed

+26
-12
lines changed

1 file changed

+26
-12
lines changed

education/molmod_online/simulation.md

Lines changed: 26 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@ By the end of this tutorial, you should know the steps involved in setting up, r
2020
## Use of virtual machines (VMs)
2121

2222
For this module of the practical, we will be using [**NMR**box](https://nmrbox.org){:target="_blank"}.
23-
NMRbox offers cloud-based virtual machines for executing various biomolecular software that can complement NMR (Nuclear Magnetic Resonance) experiements.
23+
NMRbox offers cloud-based virtual machines for executing various biomolecular software that can complement NMR (Nuclear Magnetic Resonance) experiments.
2424
NMRbox users can choose from 261 already installed software packages, that focus on various research topics such as metabolomics, molecular dynamics, structure, intrinsically disordered proteins or binding.
2525
One can search through all available packages on [https://nmrbox.org/software](https://nmrbox.org/software){:target="_blank"}.
2626

@@ -298,10 +298,10 @@ that have been parameterized to reproduce the properties of biological molecules
298298
first day of molecular dynamics simulations. There are several literature reviews available in
299299
PubMed that assess the quality and appropriateness of each force field and their several versions.
300300
Some are well-known for their artifacts, such as a biased propensity for alpha-helical conformations.
301-
Here, in this tutorial, we will use the AMBER99SB-ILDN force field, which is widely used
302-
in sampling and folding simulations and has been shown to reproduce fairly well experimental data
303-
([source](https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0032131){:target="_blank"}).
304-
Another, more practical, reason behind this choice is the availability of this force field in GROMACS.
301+
Here, in this tutorial, we will use the *CHARMM36m* force field, which was specifically
302+
optimized to improve backbone conformational sampling and is particularly well suited
303+
for folded and intrinsically disordered proteins. CHARMM36m has been shown to reproduce fairly well experimental data
304+
([source](https://doi.org/10.1038/nmeth.4067){:target="_blank"}).
305305

306306
Since the simulation takes place in a solvated environment, i.e. a box of water molecules, we have
307307
also to choose an appropriate solvent model. The model is simply an addition to the force field
@@ -310,9 +310,8 @@ such as density and freezing and vaporization temperatures. As such, particular
310310
to be tied to specific force fields. Due to the difficulties of reproducing the properties of water
311311
computationally - yes, even for such a simple molecule! - some models represent water with more
312312
than 3 atoms, using additional pseudo-particles to improve characteristics such as its electrostatic distribution.
313-
The water model suggested to use with the AMBER force field, and which
314-
we will use in this simulation, is the *TIP3P* model (for Transferable Interaction Potential with 3
315-
Points).
313+
The water model suggested to use with the CHARMM36m force field, and which
314+
we will use in this simulation, is the *TIP3P* model, specifically the CHARMM-modified TIP3P water model.
316315

317316
This choice is usually limited by the force field, unless there is a specific need for a particular solvent model.
318317

@@ -349,6 +348,21 @@ capped by an additional chemical group (e.g. N-terminal acetyl and C-terminal am
349348
important since leaving the termini charged (default) can lead to artificial charge-charge
350349
interactions, particular in small molecules. If a peptide is part of a larger structure, then it
351350
makes sense to cap the termini in order to neutralize their charge, as it would happen in reality.
351+
Terminal capping should be performed prior to topology generation using the `pdb_cap.py` script.
352+
This script replaces the first residue with an ACE cap and the last residue with an NME cap
353+
by modifying atom and residue names in the PDB file, making them compatible with the CHARMM36m force field.
354+
For capping to work correctly, the input structure must include one additional residue at both the N- and C-termini
355+
(i.e. residues *−1* and *N+1* relative to the peptide of interest).
356+
These residues act as placeholders and will be converted into caps. In practice, we add two glycine residues,
357+
one at each end of the peptide sequence, before capping.
358+
Capping is performed with:
359+
360+
<a class="prompt prompt-cmd">
361+
python pdb_cap.py --pdb peptide_helix.pdb --cap
362+
</a>
363+
364+
The script produces a new file named peptide_helix_capped.pdb, which should then be used as input for pdb2gmx.
365+
Once capped, pdb2gmx will recognize the ACE and NME residues automatically when using the CHARMM36m force field.
352366
Read through the output of `pdb2gmx` and check the choices the program made for histidine
353367
protonation states and the resulting charge of the peptide.
354368

@@ -374,7 +388,7 @@ in internal parameter libraries that are defined at the very top of the topology
374388

375389
<pre style="background-color:#DAE4E7;padding:15px">
376390
; Include forcefield parameters
377-
#include "amber99sb-ildn.ff/forcefield.itp"
391+
#include "charmm36-jul2022.ff/forcefield.itp"
378392

379393
[ moleculetype ]
380394
; Name nrexcl
@@ -400,7 +414,7 @@ Protein 3
400414

401415
<a class="prompt prompt-info">
402416
Look at the partial charge that each atom carries (column 7) and note the differences between different types of atom.
403-
<a>
417+
</a>
404418

405419
<a class="prompt prompt-question">
406420
SER is in principle a neutral amino acid within a protein sequence. Can you rationalize why in this case the sum of the charges add up to +1?
@@ -779,7 +793,7 @@ particles, the pressure is kept constant by varying the volume of the simulation
779793
gmx grompp -v -f $MOLMOD_DATA/mdp/04_npt_pr_PME.mdp -c peptide-NVT-PR1000.gro -r peptide-NVT-PR1000.gro -p peptide.top -o peptide-NPT-PR1000.tpr
780794
</a>
781795
<a class="prompt prompt-question">
782-
Were you able to succesfully execute the previous command? If not, read the error message carefully.
796+
Were you able to successfully execute the previous command? If not, read the error message carefully.
783797
</a>
784798

785799
Inside `04_npt_pr_PME.mdp` we define the Berendsen barostat to be used, although this weak-coupling algorithm is not rigorously compatible with a full isothermal-isobaric (NPT) ensemble.
@@ -894,7 +908,7 @@ calculations:
894908
The simulation will run for 50 nanoseconds, which is sufficient to derive some insights on the
895909
conformational dynamics of such a small peptide. Bear in mind that a proper simulation to fully and
896910
exhaustively sample the entire landscape should last much longer, and probably make use of more
897-
advance molecular dynamics protocols such as replica exchange. In this case, since several students
911+
advanced molecular dynamics protocols such as replica exchange. In this case, since several students
898912
are expected to work on the same peptide, using different random seeds and starting from different
899913
initial conformations, we assume that individual simulations of 50 nanoseconds are informative
900914
enough.

0 commit comments

Comments
 (0)