Difference between revisions of "Cross Docking SB2024 V1 DOCK6.10 A"
(→III.Aligning protein families) |
(→III.Aligning protein families) |
||
Line 37: | Line 37: | ||
Certain variables need to be sourced every time a new session is started to run these scripts: | Certain variables need to be sourced every time a new session is started to run these scripts: | ||
source 000.source.env.sh | source 000.source.env.sh | ||
+ | |||
+ | For each protein family, create a file with the name of the protein family, and list all PDB structures for that family: | ||
+ | vi Acetylcholinesterase.txt | ||
+ | |||
+ | The file should like this (Note: The first PDB listed will be the reference which all other proteins are aligned to - a criteria should be chosen for which is first): | ||
+ | 1EVE | ||
+ | 1GPK | ||
+ | 1GPN | ||
+ | ... | ||
+ | |||
+ | After creating a file listing PDB codes for each protein family, create a file listing protein family names: | ||
+ | vi zzz.Families.txt | ||
+ | |||
+ | The file should look like this: | ||
+ | Acetylcholinesterase | ||
+ | ... | ||
==IV.Docking molecules== | ==IV.Docking molecules== |
Revision as of 22:07, 27 February 2024
!!!!!!Under Construction!!!!!!
The purpose of this tutorial is to develop a uniform method to test cross docking across the Rizzo lab with the DOCK software. Note any data in this tutorial is solely for the purpose of example.
Contents
I.Introduction
- Cross docking is a test which is fundamentally similar to pose reproduction. If you are not experienced running pose reproduction yet, begin with:
https://ringo.ams.stonybrook.edu/index.php/Pose_Reproduction_Tutorial
- Cross docking measures pose reproduction accuracy with differing protein conformations/ structures as an additonal variable. It is a more translatable test to "real world" virtual screening, because it tests the ability to identify native poses, even when protein conformation/ sidechain packing is not induced to the particular ligand. When virual screening with a rigid receptor, the particular conformation chosen will not be ideal for all binder chemotypes, but nonetheless it is desirable to predict near native poses.
- The outcomes for cross docking are the same as pose reproduction, although there is a fourth outcome termed "incompatible". This is when a ligand is energetically incompatible in its native pose when complexed in an alternative, rigid, structure of the protein.
II.Necessary files
Scripts to run Cross Docking in batch mode are found at:
https://github.com/rizzolab/Benchmarking_and_Validation
This tutorial rebuilds systems in an aligned frame. Thus standard test set files are not applicable because proteins families will not necessarily be aligned at protein backbone. Instead of directly using a finalized and prepared test set as Pose Reproduction does, this prepares files from initial preparatory files. The list of necessary files are:
${pdb_id}.lig.moe.mol2 ${pdb_id}.cof.moe.mol2 (if applicable) ${pdb_id}.rec.foramber.pdb
For further explanation of these 3 files see section "Preliminary File Preparation":
https://ringo.ams.stonybrook.edu/index.php/Test_Set_Tutorial_V1
III.Aligning protein families
Enter CrossDocking Directory:
cd Benchmarking_and_Validation/CrossDocking/
The first step is to align protein families using the program UCSF Chimera. This tutorial will not cover how protein families are determined, although one option is to restrict a family to structures with a single "UniProt ID" and no differing mutations within an active site. Lists of protein families can be found at (Note: see each paper for how families were determined):
https://ringo.ams.stonybrook.edu/index.php/Rizzo_Lab_Downloads
A list of PDB codes for each protein family needs to be provided:
cd zzz.family_lists
Certain variables need to be sourced every time a new session is started to run these scripts:
source 000.source.env.sh
For each protein family, create a file with the name of the protein family, and list all PDB structures for that family:
vi Acetylcholinesterase.txt
The file should like this (Note: The first PDB listed will be the reference which all other proteins are aligned to - a criteria should be chosen for which is first):
1EVE 1GPK 1GPN ...
After creating a file listing PDB codes for each protein family, create a file listing protein family names:
vi zzz.Families.txt
The file should look like this:
Acetylcholinesterase ...
IV.Docking molecules
V.Cross Docking Analysis
-SEE README FILE IN GIT REPO FOR ADDTIONAL DETAILS THAT MAY NOT BE COVERED HERE
>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
Tutorial Written By: Christopher Corbo, Rizzo Lab, Stony Brook University (2024)
>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>