Difference between revisions of "2022 DOCK tutorial 1 with PDBID 6ME2"
(→Organizing File Directories)
|Line 153:||Line 153:|
= Organizing File Directories =
= Organizing File Directories =
It is imperative to one's ability to keep and access records, for any purpose, by having a detailed and organized directory list wherein any necessary files may be kept. While following this docking tutorial, it is recommended to have the following directories:
It is imperative to one's ability to keep and access records, for any purpose, by having a detailed and organized directory listwherein any necessary files may be kept. While following this docking tutorial, it is recommended to have the following directories :
= Dock Prep =
= Dock Prep =
Revision as of 21:51, 28 February 2022
- 1 Required Software
- 2 Chimera
- 3 DOCK
- 4 6ME2
- 5 Seawulf: Getting Started
- 6 Chimera: Getting Started
- 7 Organizing File Directories
- 8 Dock Prep
- 8.1 Initial investigation/preparation
- 8.2 Isolating the ligand and receptor
- 8.3 Surface Spheres
- 8.4 Generating Box/Grid
- 9 Docking
- 10 Virtual Screening
- 11 Virtual Screening Using MPI
- 12 Cartesian Minimization of Docked Molecules
- 13 Rescoring Docked Molecules
UCSF Chimera, UCSF DOCK 6.9
UCSF Chimera is a software which aids the interactive 3D visualization, editing, and analysis of biological macromolecules.
[insert screenshot of chimera]
Chimera will be used in this tutorial to prepare molecules for docking, as well as for simply exploring the structures of molecules of interest.
UCSF DOCK is a molecular docking program, used to find the optimal orientation of a ligand binding to a receptor. DOCK can be used for a number of different molecular modeling/design tasks, such as testing binding affinities, virtual screening, and de novo design.
6ME2 is the modeled structure of receptor MT1 of melatonin complexed with ramelteon (the ligand). Further details on the structure from the original researchers can be found at https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6696938/
Seawulf: Getting Started
Basics of the Command Line/Terminal
There are some basic commands for the command line/terminal that are necessary to know in order to complete this tutorial effectively.
When one first opens a command line window, it will begin in their home directory. To change this directory to, say, Desktop, type:
To get to a folder, say folder_1, in Desktop, type
To list out all of the directories/files contained within this current directory, type
To change the directory to one folder in the hierarchy above the current, type
to change directories a folder, say folder_2 inside the current directory, type
To create a file in the current directory, called "file_example", using vim, type
This will open up a prompt where one can type in, paste, or edit (if the file already exists in the current directory) the contents of the file. To edit the file once opened, type "i", which will change the environment to "INSERT" mode. "INSERT" is displayed at the bottom of the window when in this mode. To exit "INSERT" mode, press "Escape". If in "INSERT" mode, to save the file, press "Escape" and then type
which will save the file and open the terminal window back up to commands, in the current directory. If one types
the newly created "file_example" should be visible in the outputted list of files.
To move files between directories, say "file_example" is the file of interest, from Directory_1 to Directory_2, type
mv /Directory_1/file_example /Directory_2/file_example
The name of file_example may also be changed (to "file_new_name") during this move:
mv /Directory_1/file_example /Directory_2/file_new_name
Note this should be used to rename directories--move the file to the same directory and change the name to rename the file.
To copy files between directories, type the following
cp /Directory_1/file_example /Directory_2/file_new_name
And one can copy directories as well, by typing
cp -R /Directory_1/sub_directory /Directory_2/sub_directory_new_name
Note that the "-R" flag specifies to copy over the sub_directory folder and all of its contents from Directory_1 to Directory_2.
To create a new directory, called "new_directory" for example, type
To delete a file in the current directory, "file_example", type
To delete a directory called "directory_example", including all of its subdirectories and associated files, type
Note that if one wishes to perform the same action on multiple files at once, this can be done on the same line by separating the filenames with a space. For example to delete the files filename_1 in Directory_1 and filename_2 in Directory_2, type
rm Directory_1/filename_1 Directory_2/filename_2
Important note: the action of deleting files with rm cannot be undone. Use extreme caution when removing files/directories as there is no Ctrl + Z in the terminal.
SeaWulf is the High Performance Computing (HPC) cluster which researchers and students at Stony Brook University can gain access to use for computationally intensive tasks. Note Stony Brook students often do not automatically have access to SeaWulf, but students can request access from IT or be given access by their labs/instructors. For help getting access to SeaWulf, or questions/problems encountered when using the cluster at any point, submit a ticket to IT at https://it.stonybrook.edu/help/kb/understanding-seawulf.
If one has access to SeaWulf, they can access the cluster by typing the following into their terminal
ssh -X NETID@login.seawulf.stonybrook.edu
Note that the flag -X enables visualizations when running programs in SeaWulf, and NETID is the user's assigned NETID username, used to sign into Blackboard and all associated Stony Brook University domains. The terminal will then prompt the user for the password associated with the entered NETID, and then will give a list of options to use DUO Authentication to grant access. After cooperating with DUO, the user will be in their SeaWulf home directory.
To navigate to the AMS 536 space within SeaWulf, type
To access a user's personal scratch space, which is where users can keep files and test programs (but note all files in this directory will be automatically deleted if not modified in over 30 days), type
File transfers to/from SeaWulf
To transfer files between the user's local machine and the SeaWulf cluster, this can be done using scp. For example, to transfer the file "file_1" from the directory Directory_1 on the user's local directory to the user's scratch space on SeaWulf, type the following into the user's local terminal window (not connected to SeaWulf):
scp Directory_1/file_1 NETID@login.seawulf.stonybrook.edu:/gpfs/scratch/NETID
Likewise, to transfer file_1 from the scratch space in SeaWulf back to a local directory, type the following into the local terminal window:
scp NETID@login.seawulf.stonybrook.edu:/gpfs/scratch/NETID/file_1 /Directory_1
Chimera: Getting Started
Searching for/Downloading files from the PDB
The RCSB Protein Data Bank is a useful tool for searching for and downloading files containing the 3D structures of biological macromolecules. One can search for a molecule by using an exact 4 letter code (such as "6ME2") or by searching for a short description (such as "melatonin with ramelteon complex"). Note that there may be many molecules that fit a given description, so one may need to scroll through results shown after the search to find the one of interest.
(insert picture of PDB w/ search bar, results below after searching)
After finding the page for the structure of interest, one can download it in pdb or mol2 (or other) format by selecting the dark blue "Download Files" drop-down menu to the right of the page. Select the desired format, and the download will begin.
First-steps for visualizations in Chimera
Opening files in Chimera
Once one opens Chimera, they are met with a blue screen, and at the bottom right corner are two options called "Browse..." and "Fetch...". If the structure of interest is downloaded already to a local computer, choose "Browse..."; it is possible to download the structure from the PDB directly in Chimera using the "Fetch..." option, but note this requires knowledge of the 4-digit code for the structure listed in the PDB.
Chimera can be a helpful tool for producing publication-ready visualizations of molecules. When Chimera is first opened, the structure will be drawn against a black screen; if a different appearance is desired, click the "Presets" tab at the top, where there are a number of options. A recommended preset is "Publication 1 (silhouette, rounded ribbons)".
Note that it is possible to select portions of the current structure. To do so, click the "Select" tab at the top. Options will become visible to select by chain, residue, structure, and so on. One can also hit "Control" + "Click" to interactively select a small portion of the structure. If one presses the "up" arrow on their keyboard, this will select a larger substructure containing the initial selection. Hitting "up" again will select more, until the entire structure is selected. Alternatively, hitting the "down" arrow will go backwards in selection scope, until the initial small selection is restored.
Modifying selected substructures
One of the easiest ways to make a visualization in Chimera more clear is through color editing. If one is dealing with a protein-ligand complex, for example, it may be useful to color the two structures differently to contrast the two. To do this, select one of the structures (this can be done by interactively clicking and hitting the "up" arrow, or more easily by selecting by structure), then, while the substructure is selected, click the "Actions" tab, click "Color", and make a color choice.
If one wishes to isolate specific parts of a structure from the rest, it is necessary to understand how to delete substructures. This can be done by selecting a substructure of interest, clicking "Actions", hovering over "Atoms/Bonds", then under the shown options clicking "delete" at the bottom. This will remove the selected substructure, and cannot be immediately undone, so make sure that parts of the substructure have been colored to make their separation as clear as possible to avoid deleting any parts by accident.
If one wishes to save an entire session, with all configurations (such as visual orientation, focus, coloring, etc. which cannot be captures by just saving as a Mol2 or PDB file), one can go to "File" -> "Save Session As..." and this will save the current configurations.
(put in a picture?)
Organizing File Directories
It is imperative to one's ability to keep and access records, for any purpose, by having a detailed and organized directory list wherein any necessary files may be kept. While following this docking tutorial, it is recommended to have the following directories on both the local machine as well as in the user's designated space on SeaWulf:
000_files 001_structure 002_surface_spheres 003_gridbox 004_dock 005_virtual_screen 006_virtual_screen_mpi 007_cartesian_min 008_rescore
(put in here anything you notice when first opening up a sometimes messy structure--in 6ME2's case we had to remove detergents and water that were far from the of-interest binding site, handle non-standard amino acids, etc. Compare to the structure described in the literature to see if everything is accurate)
Isolating the ligand and receptor
(show how to create two separate files, one with just the ligand and one with just the receptor)
Prepping the ligand
(show how to add hydrogens/charges in the isolated ligand)
Prepping the receptor
(show how to add hydrogens/charges to the isolated receptor)
Generating DMS of receptor
(show how to get the surface/save as a DMS file)
Creating Surface Spheres
(show how to create the INSPH file in the correct directory, how to run sphgen using the INSPH file, show output).
Selecting Relevant Spheres
(show how to run sphere_selector and create necessary input file)
(show how to create showbox.in, how to run showbox)
(show how to create the file grid.in, how to run grid)