2020 DOCK tutorial 1 with PDBID 3VJK

From Rizzo_Lab
Revision as of 19:32, 14 February 2020 by Stonybrook (talk | contribs) (Protein)
Jump to: navigation, search


Welcome to the Rizzo lab!

This tutorial is provided by the students of stony brook to help the community better understand the DOCK toolset.

Software packages

To follow this tutorial you will need to have the following programs installed:


At several points this tutorial will reference these programs as commands in a shell environment. The students who did this ran their programs on a UNIX (CoreOS or Ubuntu) server, although this process should generalize to your specific setup. For help, please reference available documentation.

<<Where can outsiders find scripts like sphgen?>> UCLA website? rizzo lab website? There seem to be several sources on google


Object preparation

Preparing the Structure for Docking Downloading and Opening PDB File Download the PDB Format file from the associated rcsb page here.

Select: Download files -> PDB Format

This file provides information on the 3D orientation of the atoms within the protein and ligand as well as any co-factors (any other molecules present during the crystallization experiment, typically water and metal ions). The file can be opened up and manipulated in the program Chimera.

Open Chimera.

Select: File -> Open -> (Location where you downloaded PDB file)

Dimer 3000.png

The protein should appear the same as the image above. The image can be rotated to view from different angles. This is called a Ribbon diagram and shows the backbone of the protein, however some amino acid side chains are shown by default. Also shown explicitly are NAG amino acid modifications, the Oxygen of several water molecules and M51 (the ligand that is complexed with the protein). There are no Hydrogen atoms represented anywhere. This is because PDB files do not contain information on Hydrogen atoms.

Preparation of the Protein Receptor for Docking


Sphere Selection

By this step, you should have the mol2 extractions of ligand and protein, in both hydrogenated and unhydrogenated forms (4 files). The next activity is to create an efficient representation of empty space inside the protein. This is done with the sphgen script, which tries to generate the largest possible sphere for any given empty space. In general, it is desirable for the spheres will eclipse with each other, but not with the protein itself.

The sphgen software takes in a series of inputs from prompts to the user, but we can automate this by piping these arguments through a file. We shall can this file INSPH. Generate your INSPH file with the following syntax:

   <R flag> - enables sphere generation outside the protein surface (no eclipsing)
   <X flag  - uses all coordinates 
   <double> - distance that steric interactions are checked (units?)
   <double> - Maximum sphere radius of generated sphere (units?)
   <double> - Size of sphere that rolls over dms file surface for cavities (units?)

This is an example of how we wrote our file:


Does it matter if the dms is generated with the hydrogens?

This should produce an sph file that you can then run through sphgen

sphgen -i INSPH -o OUTSPH

3vjk selected sphere -- super temporary (hah!) -- for good pictures, add white background

Box localization

Grid formation