DOCK GA Development Goals

From Rizzo_Lab
Revision as of 10:52, 17 October 2022 by BrockBoysan (talk | contribs)
Jump to: navigation, search
Tasks src Owner Complete?
Horizontal pruning issue not catching chemically identical molecules LEP no
Fix bug that collapsed atom coordinates everywhere? nowhere? somewhere. LEP YES MUAHAHAHAHA!
Added Delimeter header conf_gen_ga LEP Yes
Fix xover only feature conf_gen_ga LEP Yes
Put in error messages for mut_rate > 1 conf_gen_ga LEP Yes
Manual user-defined mutation type conf_gen_ga LEP Yes
Remove check only option conf_gen_ga LEP Yes
Add single molecule evolution in testcase in install dir. install/test/genetic LEP yes
Add leading 0's to xover output filenames conf_gen_ga.cpp JDB no
DNM replacement unable to build list? conf_gen_ga.cpp JDB no
Multi-layer replacement for Amides conf_gen_ga.cpp JDB no
Compute delta slope of fitness score congen_ga.cpp BTB no
slow down molecular evolution so there are less drastic canges between each successive generation congen_ga.cpp BTB no
bring in new parents (e.g. from a pool of molecules) based on convergence confgen_ga.cpp BTB no
user defined point vs on-the-fly convergence confgen_ga.cpp BTB no
metropolis selection for tournament/roulette conf_gen_ga.cpp BTB no



To Do List

  1. tanimoto coefficient percent change - might be inaccurate due to tan coef behavior
  2. Rotatable bond changes
  3. Number of aromatic rings
  4. -xover (guided based on score) - Good v Good ; Bad v Good ; Bad v Bad THIS
  5. nonexhaustive xover (pick subset of xover based on probability)
  6. nonexhaustive xover for each pair of parents - have a set number of bonds for xover
  7. 2-3 point xover at once
  8. adaptive maintenance ensemble based on ensemble convergence THIS
  9. Mutations-
    1. adaptive mutation rate THIS
    2. pick location of mutation based on something
    3. pick mutation type based on behavior of ensemble
    4. molecules too large boost deletion
    5. molecules too small, add more groups
    6. change ...boost replace/sub
    7. mutation type selection based on probability vs ensemble
    8. complete x # y mutation so far so less prevalent etc
    9. 3 layer subs do no work so don't do them
    10. replace > 1 segment
  10. fitness-
    1. turn on and off niching adaptive/extinction
    2. reduce boost of fragments and all poor mols with niching
    3. pareto/mulitobjective ga
  11. selection-
    1. adaptive keep #p and #o (parents, offspring)
  12. extinction-
    1. user defined point vs on-the-fly convergence THIS
  13. stop-
    1. convergence
  14. which molecules are best-
    1. best first pruning - now uses descriptor score even if niching ned to delta to fitness/niching when used
    2. geometric diversity using Hingarian and Tan pruning
  15. Determine whether the dummy vdw file is necessary for all DN (including GA) or just DN.

Known Bugs

  1. Molecules Processed bug (dock.cpp)
  2. verbose mol stats (amber typer)
  3. molecule being renamed when going into repl even tho it's the same molecule