2020 DOCK tutorial 1 with PDBID 3VJK
DOCK 6 is a molecular modeling software that is used for investigating ligand binding geometry and ligand interactions. Consequently, its relevance in the field of drug discovery is clear. This program was initially developed by Dr. Irwin Kuntz and colleagues at the University of California San Francisco. A major feature of DOCK 6 is the search algorithm that is used: “anchor-and-grow”; this sets the software apart from its counterparts . This method first identifies the rigid structure of a particular ligand--anchors-- then the program docks the ligand using its geometry. Following the docking, a partial conformational search is performed. In simple terms, the positions of the anchor are allowed independently however once a favored conformation is found it is retained. Once this step is completed, energy minimization is carried out Full details here.
3VJK is the PDB code for the crystal structure of human dipeptidyl peptidase IV, also known as DPP-4, with MP-513, which is called Teneligliptin . DPP-4 is a symmetrical dimer and has 729 residues per chain. The crystal has a resolution of 2.49 Å, a R-value of 0.279, and a R-free value of 0.225. To add on, the molecule in the crystal--Teneligliptin--has been approved for the treatment of type II Diabetes in Japan and has shown promising results in vivo .
=Software packages =
To follow this tutorial you will need to have the following programs installed:
This tutorial used Dock 6.9 & Chimera 1.13.1
<<Where can outsiders find scripts like sphgen?>> UCLA website? rizzo lab website? There seem to be several sources on google
The following tutorial will use the organization of directories prepared below. This specific organization is not required but is recommended. The "mkdir" command will be employed which creates a new folder in which files can be saved. To navigate into a directory use the command "cd" followed by the directory name. To change to the directory the next level up, use the command "cd .." .
Within the Bash Shell environment:
mkdir 3VJK cd 3VJK mkdir 001.structure 002.surface_spheres 003.gridbox 004.dock 005.virtual_screen 006.virtual_screen_mpi 007.cartesianmin 008.rescore
All eight directories should be created now and this can be visually confirmed with the command "ls".
Preparing the Structure for Docking
Downloading and Opening PDB File
Download the PDB Format file from the associated rcsb page here. This web page includes associated articles, files, and other meta data.
Download files -> PDB Format
File -> Open -> (Location where you downloaded PDB file)
Preparation of the Protein Receptor for Docking
Select -> Chain -> B Actions -> Atoms/ Bonds -> Delete
Only one monomer of protein should remain now.
Select -> Residue -> All nonstandard Actions -> Atoms/ Bonds -> Delete
The receptor is now "clean" and should be saved prior to the next step.
File -> Save Mol2 -> "3VJK_rec_woH.mol2"
Adding Hydrogens and Charge
Structure Editing -> Add H -> Ok
Next partial charges will be added to each atom in the receptor.
Structure Editing -> Add Charge -> (AM1BCC charges should be selected) -> Ok
Now save this as a mol2 file "3VJK_rec_dockprep.mol2" and move it to the directory "001.structure"
Surface Generation & Sphere Selection
Surface Generation In Chimera a file which represents the surface of the protein will be created. The surface will be used to create a negative image of the protein (spheres which occupy the cavities and external face of the protein). These spheres are used to guide the ligand during docking.
In Chimera open "3VJK_rec_woH.mol2" :
Actions -> Surface -> Show Tools -> Structure Editing -> Write DMS -> "3VJK_rec_surface.dms"
Move this to the directory "002.surface_spheres"
Sphere Selection By this step, you should have the mol2 extractions of ligand and protein, in both hydrogenated and unhydrogenated forms (4 files). The next activity is to create an efficient representation of empty space inside the protein. This is done with the sphgen script, which tries to generate the largest possible sphere for any given empty space. In general, it is desirable for the spheres will eclipse with each other, but not with the protein itself.
The sphgen software takes in a series of inputs from prompts to the user, but we can automate this by piping these arguments through a file. We shall can this file INSPH. Generate your INSPH file with the following syntax:
[your_receptor].dms <R flag> - enables sphere generation outside the protein surface (no eclipsing) <X flag - uses all coordinates <double> - distance that steric interactions are checked (units?) <double> - Maximum sphere radius of generated sphere (units?) <double> - Size of sphere that rolls over dms file surface for cavities (units?) [your_receptor].sph
This is an example of how we wrote our file:
3VJK_rec_surface.dms R X 0.0 4.0 1.4 3VJK_receptor_woH.sph
Does it matter if the dms is generated with the hydrogens?
This should produce an sph file that you can then run through sphgen
sphgen -i INSPH -o OUTSPH