Flexible ligand docking to multiple receptor conformations: a practical alternative

https://doi.org/10.1016/j.sbi.2008.01.004Get rights and content

State of the art docking algorithms predict an incorrect binding pose for about 50–70% of all ligands when only a single fixed receptor conformation is considered. In many more cases, lack of receptor flexibility results in meaningless ligand binding scores, even when the correct pose is obtained. Incorporating conformational rearrangements of the receptor binding pocket into predictions of both ligand binding pose and binding score is crucial for improving structure-based drug design and virtual ligand screening methodologies. However, direct modeling of protein binding site flexibility remains challenging because of the large conformational space that must be sampled, and difficulties remain in constructing a suitably accurate energy function. Here we show that using multiple fixed receptor conformations, either experimentally determined by crystallography or NMR, or computationally generated, is a practical shortcut that may improve docking calculations. In several cases, such an approach has led to experimentally validated predictions.

Introduction

Molecular docking is playing an increasingly important role in lead discovery and design. Nevertheless, the docking field is still far from the goal of accurately and reliably predicting complex structures for arbitrary ligand–receptor pairs. It has long been recognized that a simplistic rigid ‘lock-and-key’ model of ligand–receptor interaction is inadequate and incorporation of ligand and receptor flexibility is required for accurate docking. While ligand flexibility has been addressed by a variety of algorithms, receptor flexibility remains a formidable challenge.

Several approaches to incorporate receptor flexibility in ligand docking were previously reviewed by Teodoro and Kavraki [1••]. They propose a classification of methods that spans five categories: (a) ‘soft’ receptors that limit penalties because of steric clashes, (b) selection of a few critical degrees of freedom in the binding site, (c) use of multiple receptor structures, (d) use of modified molecular simulation methods, and (e) use of collective degrees of freedom as a new basis of representation for protein flexibility. In this review, we focus on using static multiple receptor conformations, either experimental or computationally generated.

Direct modeling of protein movements associated with binding site flexibility represents a significant problem because of the dual challenge of high dimensionality of the conformational space and of the complexity of energy function. A typical ligand binding site for a drug-like molecule consists of ten to twenty amino acid side-chains, which may mean dozens of potentially rotatable torsions. This number can easily be several times larger than the number of degrees of freedom for the ligand (typically 6–12). Considering the backbone movements may dramatically worsen the situation since, in contrast to relatively independent side-chains, each backbone movement affects multiple side-chains. Thus, fully flexible receptor/ligand docking simulation may involve sampling of an order of magnitude higher number of degrees of freedom than typical rigid-receptor/flexible ligand simulations routinely used in current structure-based virtual screening and ligand design projects. Apart from being computationally demanding, this expansion of sampling imposes high requirements on the energy function that must be able to discriminate a small number of low energy structures that are actually realized in nature from the vast number of hypothetical conformations generated by the sampling procedures. Taken to the limit, ligand docking into the flexible receptor essentially becomes the protein folding problem in the presence of ligand. Therefore, practical approaches to receptor flexibility incorporation into docking simulations must restrict radically the subspace of the full protein conformational space that they actually search.

Limiting flexibility to side-chains makes the problem much more tractable, and an exhaustive search of the binding site side-chain conformations, similar to what has been done in protein docking [2], is possible for smaller ligands [3, 4]. As an alternative to exhaustive search, a ‘minimal rotation hypothesis’ was proposed by Zavodsky and Kuhn [5]. Their docking algorithm, SLIDE, attempts to resolve ligand–receptor steric clashes by a minimal number of side-chain rotations, with the cost of side-chain movement evaluated as a product of the rotation angle and the number of atoms moved.

Depending on the specific system, side-chain flexibility alone may or may not be sufficient for adequate modeling. For example, conformational variability in the HIV protease binding site is apparently well described in terms of movements of several side-chains and a water molecule [6]. On the contrary, many kinases exhibit loop rearrangements as well as large-scale mutual movement of the two ‘lobes’ delimiting the active site [7]. Diversity in ligand binding mechanisms and the frequent unpredictability of receptor movement types makes the use of pre-determined (by experimental or computational means) multiple receptor conformations (MRC) an attractive practical alternative.

Section snippets

Problems beyond receptor flexibility

Detailed case analysis of a large number of incorrect ligand–receptor docking poses or inadequate binding scores finds many alternative sources of error beyond receptor flexibility. They include ‘fantasy’ (outside the electron density) positions of the ligand pocket atoms (side-chains or loops), incorrect orientations of His, Asn, and Gln side-chains, improperly assigned histidine tautomers and charged states for aspartate, glutatame and histidine, and improper proline ring puckering, among

Ensemble docking

Given the variety and success of available flexible ligand/rigid-receptor docking algorithms, the easiest way to include multiple conformations of receptor in a docking experiment is simply to run multiple independent simulations (Figure 1). However, integration of MRC sampling into the docking algorithm may offer advantages in terms of calculation speed as well as simplification of the data management. Such ‘ensemble docking’ extensions of original rigid-receptor algorithms have been reported,

Multiple receptor conformations and virtual screening

While MRC docking may improve pose prediction, each additional conformation increases the chance of a false positive in VLS. Possible increases in false positive rates with the number of MRCs are also encountered in protein–protein docking simulations, recently reviewed in reference [21••]. Furthermore, the usual tacit assumption that an MRC docking improves the results for each receptor, albeit at a higher cost, is not true at all. It is entirely possible that for those easier cases in which

Generation of receptor conformations

While significant advances have been made in the utilization of MRCs for ligand docking, automatic generation of reasonably small yet representative sets of receptor conformation remains challenging. For popular targets, this issue may be increasingly addressed by the rapid expansion of the PDB. Dozens of complex X-ray structures are available for several tyrosine kinases, HIV protease, and a number of other proteases and metalloproteases. These numbers further expand when close homologs are

Gapped models and ligand-guided conformer selection

Sometimes no atoms are better than incorrectly placed atoms. In reference [31] the alanine conversion method was proposed, side-chain conformations were successfully predicted without combinatorial search of the side-chain conformations of the neighbors by a simple conversion of the nearby side-chains to alanines. Later the side-chains can be built back and refined. The same idea can be applied to protein docking and ligand docking. Sherman et al. proposed a set of four rules to allow the

Conclusions

The nearly exponential growth in number of protein structures in PDB, the improved understanding of the induced fit, and improved conformational sampling methods make the multiple receptor conformations approach to the ligand docking increasingly attractive. This MRC approach is relatively fast and still suitable for virtual ligand screening as long as the number of fixed receptor conformations is relatively small and carefully chosen. The ligand-guided selection from a set of generated

References and recommended reading

  • • of special interest

  • •• of outstanding interest

Acknowledgements

The authors thank Irina Kufareva and Giovanni Bottegoni for helpful discussions and artistic assistance, and Kim Reynolds for reviewing the manuscript.

References (32)

  • F. Osterberg et al.

    Automated docking to multiple target structures: incorporation of protein mobility and structural water heterogeneity in AutoDock

    Proteins

    (2002)
  • C.N. Cavasotto et al.

    Protein flexibility in ligand docking and virtual screening to protein kinases

    J Mol Biol

    (2004)
  • N. Furnham et al.

    Is one solution good enough?

    Nat Struct Mol Biol

    (2006)
  • N. Furnham et al.

    Knowledge-based real-space explorations for low-resolution structure determination

    Structure

    (2006)
  • M. Schapira et al.

    Nuclear hormone receptor targeted virtual screening

    J Med Chem

    (2003)
  • H. Claussen et al.

    FlexE: efficient molecular docking considering protein structure variations

    J Mol Biol

    (2001)
  • Cited by (0)

    View full text