Difference between revisions of "Virtual Screening Protocol"

From Rizzo_Lab
Jump to: navigation, search
(DOCKing)
(DOCKing)
Line 11: Line 11:
  
 
==DOCKing==
 
==DOCKing==
The version of DOCK currently being used for virtual screening is dock6_09-09-08.footprint.
+
The version of DOCK currently being used for virtual screening is dock6_09-09-08.footprint. The most up to date script is Trent's modifications, which include the database filter.  The database filter removes molecules with charges greater than +2 and less than -2, and also molecules with greater than 15 rotatable bonds. 
 +
 
 +
The script also takes the processed chunks from ZINC and splits them into two subsections: the first 60,000 and the remainder.  Because the sets are sorted by rotatable bonds, the first 60,000 should dock in fairly quickly, and be completed within the allowable wallclock limit.  The second set, or molecule 60,001 and beyond, will have higher numbers of rotatable bonds, and take exponentially more time for each molecule.  As per Sudipto's testset paper, we also have less confidence in the molecules with more rotatable bonds.  If this second job does not finish, DO NOT restart it.  The molecules that did not dock in will have higher numbers of rotatable bonds (should this be revised in light of the database filtering?)
 +
 
 +
Note that the values for the virtual screen are the default values. 
 +
 
 +
Path for script: ~balius/RCR/projects_BG/screening/run.dock2grid.max_lig_tebmod.csh
  
 
==Minimization of Poses==
 
==Minimization of Poses==

Revision as of 12:17, 29 June 2010

In this document, the current Rizzo group protocol will be described in detail. This protocol has be through successive iterations and has be used to select compounds for Flu and HIVgp41 (n=112).

ZINC Database

ZINC is a free, online database of commercially-available compounds. In June 2009, the "big" vendor libraries at a pH of 7 were downloaded and processed into chunks of approximately 100,000 molecules each. The ZINC molecules come with AMSOL charges already determined.

These chunks are then processed through MOE and sorted by rotatable bonds in ascending order. These processed chunks are available on BG, path=~pholden/RCR/projects_ZINC8/ZINC8, or on ringo, path=/media/sdb1/pholden/ZINC8.descending.rot.bonds. The lab is currently using the ChemDiv library for its virtual screen.

Downloading and Preparation of Receptor

Preparation of Reference Molecule for Footprint Rescoring

DOCKing

The version of DOCK currently being used for virtual screening is dock6_09-09-08.footprint. The most up to date script is Trent's modifications, which include the database filter. The database filter removes molecules with charges greater than +2 and less than -2, and also molecules with greater than 15 rotatable bonds.

The script also takes the processed chunks from ZINC and splits them into two subsections: the first 60,000 and the remainder. Because the sets are sorted by rotatable bonds, the first 60,000 should dock in fairly quickly, and be completed within the allowable wallclock limit. The second set, or molecule 60,001 and beyond, will have higher numbers of rotatable bonds, and take exponentially more time for each molecule. As per Sudipto's testset paper, we also have less confidence in the molecules with more rotatable bonds. If this second job does not finish, DO NOT restart it. The molecules that did not dock in will have higher numbers of rotatable bonds (should this be revised in light of the database filtering?)

Note that the values for the virtual screen are the default values.

Path for script: ~balius/RCR/projects_BG/screening/run.dock2grid.max_lig_tebmod.csh

Minimization of Poses

Pose Rescoring using Molecular Footprints

Clustering using MOE

Compound Selection