Difference between revisions of "DOCK6 POSE Reproduction"

From Rizzo_Lab
Jump to: navigation, search
(5.Generating a graphs of the results)
(5.Generating a graphs of the results)
Line 67: Line 67:
python DOCK6_Pose_Reproduction_analysis_v3.py CSV_File 3
python DOCK6_Pose_Reproduction_analysis_v3.py CSV_File 3

Revision as of 14:27, 15 July 2020

The purpose of this tutorial is to provide people with instructions on how to perform a commonly used test, pose reproduction that evaluates DOCK6's ability to reproduce known experimental results.

I. Introduction

DOCK6 is a commonly used docking protocol which assess the affinity of a ligand, a drug candidate to a target site (Protein, Enzyme, RNA). To evaluate a software's ability to accurately reproduce experimental results, an experiment called Pose Reproduction.

Pose Reproduction used an experimental known ligand and protein complex from the PDB database and attempts to dock this ligand back into it's original location. If the lowest energy ligand(most energetically favorable) is within 2.0 RMSDh of the original target site, this is referred to as a docking success. If any of the ligands, but not the lowest energy ligand is within 2.0 RMSDh of the original target site, this is referred to as a scoring failure. If none of the ligands are within 2.0 RMSDh of the original target site this is referred to as a sampling failure.

Pose Reproduction Intro.JPG

II. Preparing DOCK directories

     The first step in this process is to prepare all of the DOCK6 directories for each of the systems present. The first step in the procedure is to prepare all a directory for each PDB system. The testset used in this docking protocol was the SB2012 test set with 1043 systems. The testset should be formatted as below with a separate directory for each PDB system.

PDB Directories

     Following this each PDB system needs it 5 files within it to properly function, the original receptor file, the original ligand file, the grid files, the sphere's generated. These files are formatted as below. 

Files in 121P directory

     These files are used to perform the flexible docking of the ligand into the receptor in the following step. The remaining steps have prepared scripts to perform the remaining steps.

3.Performing Pose Reproduction

      Following this a script titled FLX.sh is used to generate a new directory of any name that is preferred within the directory. To modify the script at all it is recommended that you make your own version of this script.
<pre>bash FLX.sh name_of_run List_of_PDB_Files
 bash FLX.sh FLX clean.systems.all
 Need to check script for specific parameters again for FLX.sh file
     This script both creates a dock parameter file and docks the ligand into the receptor target site. Recommend to submit this script to slurm.

4.Generating a CSV and DAT file of Results

     Following this a CSV file will be generated that has the results of each of the scores, that can be further analyzed. There will also be 3 dat files generated that provides which systems are successes, scoring failures, and sampling failures.

Use script in directory that provides the link to all the PDB directories.

python /gpfs/projects/rizzo/yuchzhou/RCR/DOCK_testset/bickel_laverty_calculate_dock6_results.py FLX /gpfs/projects/rizzo/yuchzhou/RCR/DOCK_testset/clean.systems.all FOR_Tutorial

This creates a FOR_Tutorial.csv for all of the systems

5.Generating a graphs of the results

These are scripts used to analyze CSV Files generated in the previous graph. 3 different graphing options are created using the subplot to be analyzed with the docking experiment.

python DOCK6_Pose_Reproduction_analysis_v3.py CSV_File 1

python DOCK6_Pose_Reproduction_analysis_v3.py CSV_File 2 FOR Tutorial 2.png

python DOCK6_Pose_Reproduction_analysis_v3.py CSV_File 3

FOR Tutorial 3.png