Flexible ligand docking to multiple receptor conformations: a practical alternative
Introduction
Molecular docking is playing an increasingly important role in lead discovery and design. Nevertheless, the docking field is still far from the goal of accurately and reliably predicting complex structures for arbitrary ligand–receptor pairs. It has long been recognized that a simplistic rigid ‘lock-and-key’ model of ligand–receptor interaction is inadequate and incorporation of ligand and receptor flexibility is required for accurate docking. While ligand flexibility has been addressed by a variety of algorithms, receptor flexibility remains a formidable challenge.
Several approaches to incorporate receptor flexibility in ligand docking were previously reviewed by Teodoro and Kavraki [1••]. They propose a classification of methods that spans five categories: (a) ‘soft’ receptors that limit penalties because of steric clashes, (b) selection of a few critical degrees of freedom in the binding site, (c) use of multiple receptor structures, (d) use of modified molecular simulation methods, and (e) use of collective degrees of freedom as a new basis of representation for protein flexibility. In this review, we focus on using static multiple receptor conformations, either experimental or computationally generated.
Direct modeling of protein movements associated with binding site flexibility represents a significant problem because of the dual challenge of high dimensionality of the conformational space and of the complexity of energy function. A typical ligand binding site for a drug-like molecule consists of ten to twenty amino acid side-chains, which may mean dozens of potentially rotatable torsions. This number can easily be several times larger than the number of degrees of freedom for the ligand (typically 6–12). Considering the backbone movements may dramatically worsen the situation since, in contrast to relatively independent side-chains, each backbone movement affects multiple side-chains. Thus, fully flexible receptor/ligand docking simulation may involve sampling of an order of magnitude higher number of degrees of freedom than typical rigid-receptor/flexible ligand simulations routinely used in current structure-based virtual screening and ligand design projects. Apart from being computationally demanding, this expansion of sampling imposes high requirements on the energy function that must be able to discriminate a small number of low energy structures that are actually realized in nature from the vast number of hypothetical conformations generated by the sampling procedures. Taken to the limit, ligand docking into the flexible receptor essentially becomes the protein folding problem in the presence of ligand. Therefore, practical approaches to receptor flexibility incorporation into docking simulations must restrict radically the subspace of the full protein conformational space that they actually search.
Limiting flexibility to side-chains makes the problem much more tractable, and an exhaustive search of the binding site side-chain conformations, similar to what has been done in protein docking [2], is possible for smaller ligands [3, 4]. As an alternative to exhaustive search, a ‘minimal rotation hypothesis’ was proposed by Zavodsky and Kuhn [5•]. Their docking algorithm, SLIDE, attempts to resolve ligand–receptor steric clashes by a minimal number of side-chain rotations, with the cost of side-chain movement evaluated as a product of the rotation angle and the number of atoms moved.
Depending on the specific system, side-chain flexibility alone may or may not be sufficient for adequate modeling. For example, conformational variability in the HIV protease binding site is apparently well described in terms of movements of several side-chains and a water molecule [6]. On the contrary, many kinases exhibit loop rearrangements as well as large-scale mutual movement of the two ‘lobes’ delimiting the active site [7]. Diversity in ligand binding mechanisms and the frequent unpredictability of receptor movement types makes the use of pre-determined (by experimental or computational means) multiple receptor conformations (MRC) an attractive practical alternative.
Section snippets
Problems beyond receptor flexibility
Detailed case analysis of a large number of incorrect ligand–receptor docking poses or inadequate binding scores finds many alternative sources of error beyond receptor flexibility. They include ‘fantasy’ (outside the electron density) positions of the ligand pocket atoms (side-chains or loops), incorrect orientations of His, Asn, and Gln side-chains, improperly assigned histidine tautomers and charged states for aspartate, glutatame and histidine, and improper proline ring puckering, among
Ensemble docking
Given the variety and success of available flexible ligand/rigid-receptor docking algorithms, the easiest way to include multiple conformations of receptor in a docking experiment is simply to run multiple independent simulations (Figure 1). However, integration of MRC sampling into the docking algorithm may offer advantages in terms of calculation speed as well as simplification of the data management. Such ‘ensemble docking’ extensions of original rigid-receptor algorithms have been reported,
Multiple receptor conformations and virtual screening
While MRC docking may improve pose prediction, each additional conformation increases the chance of a false positive in VLS. Possible increases in false positive rates with the number of MRCs are also encountered in protein–protein docking simulations, recently reviewed in reference [21••]. Furthermore, the usual tacit assumption that an MRC docking improves the results for each receptor, albeit at a higher cost, is not true at all. It is entirely possible that for those easier cases in which
Generation of receptor conformations
While significant advances have been made in the utilization of MRCs for ligand docking, automatic generation of reasonably small yet representative sets of receptor conformation remains challenging. For popular targets, this issue may be increasingly addressed by the rapid expansion of the PDB. Dozens of complex X-ray structures are available for several tyrosine kinases, HIV protease, and a number of other proteases and metalloproteases. These numbers further expand when close homologs are
Gapped models and ligand-guided conformer selection
Sometimes no atoms are better than incorrectly placed atoms. In reference [31] the alanine conversion method was proposed, side-chain conformations were successfully predicted without combinatorial search of the side-chain conformations of the neighbors by a simple conversion of the nearby side-chains to alanines. Later the side-chains can be built back and refined. The same idea can be applied to protein docking and ligand docking. Sherman et al. proposed a set of four rules to allow the
Conclusions
The nearly exponential growth in number of protein structures in PDB, the improved understanding of the induced fit, and improved conformational sampling methods make the multiple receptor conformations approach to the ligand docking increasingly attractive. This MRC approach is relatively fast and still suitable for virtual ligand screening as long as the number of fixed receptor conformations is relatively small and carefully chosen. The ligand-guided selection from a set of generated
References and recommended reading
• of special interest
•• of outstanding interest
Acknowledgements
The authors thank Irina Kufareva and Giovanni Bottegoni for helpful discussions and artistic assistance, and Kim Reynolds for reviewing the manuscript.
References (32)
- et al.
Diverse, high-quality test set for the validation of protein–ligand docking performance
J Med Chem
(2007) - et al.
Ensemble docking into flexible active sites. Critical evaluation of FlexE against JNK-3 and beta-secretase
J Chem Inf Model
(2006) - et al.
Representing receptor flexibility in ligand docking through relevant normal modes
J Am Chem Soc
(2005) - et al.
Functional plasticity in the substrate binding site of beta-secretase
Structure
(2005) - et al.
Discovery of antiandrogen activity of nonsteroidal scaffolds of marketed drugs
Proc Natl Acad Sci USA
(2007) - et al.
Conformational flexibility models for the receptor in structure based drug design
Curr Pharm Des
(2003) - et al.
Detailed ab-initio prediction of lysozyme-antibody complex with 1.6-Angstrom accuracy
Nat Struct Biol
(1994) - et al.
Molecular modelling prediction of ligand binding site flexibility
J Comput Aided Mol Des
(2004) - et al.
ROSETTALIGAND: protein-small molecule docking with full side-chain flexibility
Proteins
(2006) - et al.
Side-chain flexibility in protein–ligand binding: the minimal rotation hypothesis
Protein Sci
(2005)