An effective docking strategy for virtual screening based on multi-objective optimization algorithm
Honglin Li
0
1
2
Hailei Zhang
4
Mingyue Zheng
1
Jie Luo
3
Ling Kang
0
Xiaofeng Liu
1
Xicheng Wang
0
Hualiang Jiang
1
2
0
Department of Engineering Mechanics, State Key Laboratory of Structural Analyses for Industrial Equipment, Dalian University of Technology
,
Dalian 116023
,
PR China
1
Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences
,
Shanghai 201203
,
PR China
2
School of Pharmacy, East China University of Science and Technology
,
Shanghai 200237
,
PR China
3
School of Information Science and Engineering, East China University of Science and Technology
,
Shanghai 200237
,
PR China
4
Department of Medical Oncology, Dana-Farber Cancer Institute and Harvard Medical School
,
Boston, MA 02115
,
USA
Background: Development of a fast and accurate scoring function in virtual screening remains a hot issue in current computer-aided drug research. Different scoring functions focus on diverse aspects of ligand binding, and no single scoring can satisfy the peculiarities of each target system. Therefore, the idea of a consensus score strategy was put forward. Integrating several scoring functions, consensus score re-assesses the docked conformations using a primary scoring function. However, it is not really robust and efficient from the perspective of optimization. Furthermore, to date, the majority of available methods are still based on single objective optimization design. Results: In this paper, two multi-objective optimization methods, called MOSFOM, were developed for virtual screening, which simultaneously consider both the energy score and the contact score. Results suggest that MOSFOM can effectively enhance enrichment and performance compared with a single score. For three different kinds of binding sites, MOSFOM displays an excellent ability to differentiate active compounds through energy and shape complementarity. EFMOGA performed particularly well in the top 2% of database for all three cases, whereas MOEA_Nrg and MOEA_Cnt performed better than the corresponding individual scoring functions if the appropriate type of binding site was selected. Conclusion: The multi-objective optimization method was successfully applied in virtual screening with two different scoring functions that can yield reasonable binding poses and can furthermore, be ranked with the potentially compromised conformations of each compound, abandoning those conformations that can not satisfy overall objective functions.
-
Background
With the thriving development and confirmative
significance of computational chemistry in drug discovery,
more and more medicinal chemists and pharmacologists
are using computational methods in their drug discovery
research[1, 2], and numerous drugs developed based on
the clues provided by computations (modeling and
simulation) have entered clinical trials or have been
launched into the market already[3]. For the
computational chemist, an attractive goal is to develop computer
programs capable of automatically evaluating large-scale
chemical libraries (databases). These computational
methods are referred to as virtual screening (VS)[4]. In
general, two strategies have been employed in virtual
screening: (1), using pharmacophore-based database
searching (PBDS) methods to identify potential hits
from chemical libraries, mostly in the cases where
threedimensional (3D) structures of the targets are unknown;
and (2), using molecular docking approaches to screen
the libraries in cases where the 3D structures of the
targets are available[4, 5]. Because more and more 3D
structures of drug target proteins are available, VS with
molecular docking approaches has become promising, as
demonstrated by numerous recent examples[2, 6-10].
The core steps of structure-based virtual screening (SBVS)
are docking and scoring. Since Kuntz et al.[11] published
the first docking algorithm DOCK in 1982, numerous
docking programs have been developed during the past
two decades [12-25]. Several comprehensive reviews of
the advances of docking algorithms and applications
have been published [26-30]. Scoring (ranking) the
compounds retrieved from a database is performed
simultaneously with the docking simulation. Molecular
docking is a typical optimization problem, for it is
difficult to obtain the global optimum solution. As the
fitness during the optimization process, scoring function
should be fast and accurate enough, allowing
simultaneous ranking of the retrieved poses in the optimization
process. Based on this idea, several scoring functions
have been developed [31-33]. Unfortunately, there is no
scoring function developed so far that can reliably and
consistently predict a ligand-protein binding mode and
the binding affinity at the same time[31, 32, 34].
Therefore, heuristic docking and consensus score
strategies are often used in virtual screening [34-36].
Since a huge number of compounds in a database have
to be thoroughly tested in the virtual screening process,
several crucial issues have to be addressed. One is the
computational cost; only docking programs capable of
docking a flexible ligand within a reasonable time scale
are acceptable for virtual screening[34]. The other one is
the ability to discriminate between true actives and
inactive compounds; only those docking approaches
able to distinguish the active molecules rapidly and
accurately, are suitable for virtual screening applications
in practice. Although the consensus score strategy has
demonstrated an advantage over single scores, it is
actually based on the results of single scoring
optimization. This means that consensus scoring only re-scores a
limited molecules or conformations (generally 30 or
more), thereby inevitably losing a number of true
positives[33, 34]. To some extent, consensus scoring
seems to be far-fetched and artificial [37].
Most conformational optimization methods in docking
program can only deal with a single objective, such as
binding energy, shape complementarity, or chemical
complementarity. However, real-world problems
normally involve multiple objectives (possibly conflicting
ones) or optimization criteria, which should be satisfied
simultaneously, and suitable solutions to the overall
problem cannot be found by using individual
optimization algorithms with single objectives[38]. For example,
an optimization solution for the binding affinity
(energy) between a ligand and a receptor is usually not
the optimization solution for other criteria (e.g. shape or
chemical complementarity, etc.). Similar problems in
combinatorial library design and structural
superposition of three-dimensional molecules have been reported
[39, 40]. Thus, there is a need for an optimization
algorithm compromising several objectives, which may
result in more reasonable and robust binding modes
between ligands and receptors. In fact, it is a problem of
multi-objective optimization (MO)[41], which tends to
find a set of alter (...truncated)