Abstract
Inhibition of p97, a key player in the ubiquitin-proteasome degradation pathway, has been proposed as a treatment of cancer. This concept was nearly realized recently when a potent p97 inhibitor, 1-[4-(benzylamino)-5H,7H,8H-pyrano[4,3-d]pyrimidin-2-yl]-2-methyl-1H-indole-4-carboxamide (CB-5083), was developed and demonstrated broad antitumor activity in various tumor models. CB-5083 functions as a competitive inhibitor that binds selectively to the ATP-binding site of the D2 domain, although both the D1 and D2 ATPase sites of p97 are highly similar. Despite its promising anticancer activity, CB-5083 failed its phase I clinical trials due to an unexpected off-target effect, which necessitates further improvement of the inhibitor. In this study, we determined the crystal structure of N-terminal domain-truncated p97 in complex with CB-5083. It provides a structural basis for the specificity of CB-5083 toward the D2 domain, offers an explanation in atomic detail for the mutations that confer resistance to CB-5083, and establishes a foundation for future structure-guided efforts to develop the next generation of p97 inhibitors.
Introduction
Targeting nononcogene pathways for cancer treatment has recently gained much attention because of the rise of resistance as a result of oncogene-targeted therapies (Kenny et al., 2017). One of the nononcogene pathways being targeted is the ubiquitin-proteasome system (UPS); the relevance of interfering with the UPS has been proven in clinical settings by the success of proteasome inhibitors for the treatment of multiple myeloma (Scalzulli et al., 2018). However, the development of resistance against (Kale and Moore, 2012) and the lack of activity in solid tumor settings (Milano et al., 2009) of the proteasome inhibitors support the need to develop inhibitors of other regulators of cellular protein homeostasis.
Valosin-containing protein, also known as p97 in mammals and cdc48 in yeast, is a member of the broad AAA family of proteins: ATPases associated with various cellular activities. p97 serves many different cellular functions (Xia et al., 2016), one of which is critical regulation of protein homeostasis pathways such as the endoplasmic reticulum–associated degradation, which is the main quality control mechanism for soluble, membrane-associated, glycosylated as well as nonglycosylated proteins as they are processed through the endoplasmic reticulum (Ye et al., 2001). Mutations in the gene of p97 have been linked to degenerative disorders named multisystem proteinopathy 1, previously known as inclusion body myopathy with Paget’s disease of bone and frontotemporal dementia (IBMPFD) (Taylor, 2015; Tang and Xia, 2016).
Inhibition of p97 has been shown to lead to cell death mediated mainly by the unfolded protein response (Wang et al., 2008, 2009), a pathway that acts both to resolve unfolded protein stress and to trigger cell death when the buildup of such unfolded proteins becomes irresolvable (Ghosh et al., 2014). Demonstrated in vitro as a molecular unfoldase (Beskow et al., 2009; Blythe et al., 2017; Bodnar and Rapoport, 2017), p97 is thought to function in vivo as a powerful extractor of proteins from complex molecular machines and various organelles. Indeed, the molecular makeup of p97 supports this proposed function. p97 is a molecular engine powered by six ATP-hydrolyzing subunits and each subunit consists of an N-terminal domain (N domain) and two tandem ATPase domains (D1 and D2) (Fig. 1A), qualifying it as a type II AAA ATPase. Both the D1 and D2 domains are capable of hydrolyzing ATP, although the D2 domain contributes more to the overall ATPase activity of p97 (Ye et al., 2003; Tang and Xia, 2013).
The essential role of p97 in the UPS led to the hypothesis that by inhibiting the activity of p97, it may be possible to circumvent the resistance encountered subsequent to the use of proteasome inhibitors. One p97 inhibitor, 1-[4-(benzylamino)-5H,7H,8H-pyrano[4,3-d]pyrimidin-2-yl]-2-methyl-1H-indole-4-carboxamide (CB-5083) (Fig. 1B), was developed as a result of an extensive lead-optimization effort, tracing its origin to DBeQ, a quinazolin-based p97 inhibitor (Chou and Deshaies, 2011; Chou et al., 2013). CB-5083 has been shown to be selective, specifically inhibiting the ATPase of the D2 domain of p97. Treatment of tumor cells with CB-5083 leads to accumulation of polyubiquitinated proteins and retention of endoplasmic reticulum–associated degradation substrates. This antitumor activity of CB-5083 is also exhibited in multiple myeloma, B acute lymphoblastic leukemia, and hematologic and solid tumor models by activating the apoptotic arm of the UPS (Anderson et al., 2015; Bastola et al., 2017; Gugliotta et al., 2017; Gareau et al., 2018). In preclinical studies, CB-5083 also showed auspicious results against proteasome inhibitor–resistant multiple myeloma patient samples (Le Moigne et al., 2017).
Based on the promising results from preclinical studies, CB-5083 was advanced to phase I clinical trials to treat patients with lymphoid hematologic malignancies (CLC-102) and advanced solid tumors. However, the trials experienced setbacks and were terminated due to cytotoxicity arising from an unexpected off-target effect (https://www.bizjournals.com/sanfrancisco/news/2018/05/31/can-viagra-help-revive-cleave-biosciences-biotech.html), suggesting that better, more selective p97 inhibitors are needed. As expected, the use of CB-5083 in laboratory settings quickly led to resistance, which has been shown to be largely a consequence of target site mutations (Anderson et al., 2015; Bastola et al., 2017). Disappointingly, not all resistance-conferring mutations confirm various models of CB-5083 binding to p97, which were created based on molecular docking experiments (Anderson et al., 2015; Bastola et al., 2017) and need to be replaced preferably by a more accurate experimental model. In this work, we report such a model, obtained by X-ray crystallography, for the interactions of CB-5083 with p97, which forms the basis for our interpretations of available mutational data. Additionally, this model offers a structural basis for the D2 domain selectivity, and more importantly provides guidance for future endeavors to develop more specific p97 inhibitors.
Materials and Methods
Protein Expression and Purification.
Expression and purification of p97 were carried out as previously described (Tang et al., 2010). Variants of p97 containing different lengths [FLp97 (residues 1–806), ND1p97 (residues 1–481), and D1D2p97 (residues 210–806)] or mutations (FLp97V474S, FLp97I479S, FLp97L526S, and FLp97I656S) were generated using the Quikchange Site-Directed Mutagenesis Kit (Agilent Technologies, Santa Clara, CA).
Determination of ATPase Activity.
The ATPase activity of p97 was determined by measuring the amount of inorganic phosphate released from ATP hydrolysis, which reacts with a complex of molybdate and malachite green (Hess and Derr, 1975; Lanzetta et al., 1979). The activity assay was performed in an assay buffer containing 50 mM Tris-HCl, pH 8.0, 20 mM MgCl2, and 1 mM EDTA. A total of 50 μl reaction mix containing 5 μg of protein and 4 mM ATP in the assay buffer was incubated at 37°C for 10–15 minutes. The reaction was immediately stopped by the addition of 800 μl dye buffer (a fresh mixture of 0.045% malachite green and 1.4% ammonium molybdate tetrahydrate in 4 N HCl in a 1:3 ratio) followed by the addition of 100 μl of 34% sodium citrate solution after 1 minute. After 10-minute incubation at room temperature, 16 μl of 10% Tween-20 was added to dissolve any precipitate. Absorbance was then measured at 660 nm. The amount of inorganic phosphate released was calculated based on a standard curve established by a known amount of KH2PO4 (50–300 μM) in assay buffer.
Crystallization and Structure Determination.
Crystals of D1D2p97 in complex with CB-5083 were grown using the sitting-drop vapor diffusion method at 16°C. A protein solution of 14 mg/ml was mixed with CB-5083 (ActiveBiochem, Kowloon, Hong Kong) (100 mM in dimethylsulfoxide) to reach a final concentration of 500 μM, and incubated overnight at 4°C with mixing. The admixture was then spun at 14,000 rpm for 30 minutes at 4°C. The supernatant was mixed with a well solution containing 0.1 M sodium citrate, pH 4.5, 250 mM Tri-sodium citrate, and 15% polyethylene glycol 3350 in a 1:1 ratio for crystallization. Crystals were cryo-protected first with the well solution followed by the same well solution supplemented with 19% polyethylene glycol 3350 and 10% glycerol, and flash-cooled in liquid nitrogen.
X-ray diffraction experiments were carried out at the SER-CAT and GM/CA beam lines of the Advanced Photon Source at Argonne National Laboratory (Lemont, IL) at 100K. Diffraction images were processed and scaled together with the HKL2000 package (Otwinowski and Minor, 1997). The D1D2p97-CB-5083 structure was determined by molecular replacement using Protein Data Bank: 3CF2 (Davies et al., 2008) as a search model using the program MOLREP (Vagin and Teplyakov, 2010) in the CCP4 program package (Collaborative Computational Project, Number 4, 1994). The structure was refined using Refmac (Murshudov et al., 1997). The structure model was manually built using the program COOT (Emsley and Cowtan, 2004).
Results and Discussion
CB-5083 Inhibits ATPase Activities of the Full-Length Protein and the D1-D2 Fragment of p97 but Not the N-D1 Fragment.
To identify a construct best for crystallizing the p97-CB-5083 complex, we tested three p97 constructs, the full-length protein (FLp97, residues 1–806), an N-D1 fragment (ND1p97, residues 1–481), and a D1-D2 fragment (D1D2p97, 210–806). These constructs were expressed and purified, and their ATPase activities were determined in the presence of CB-5083 (Fig. 1, C–E). In the presence of 2 μM CB-5083, both FLp97 and D1D2p97 retained ∼5% or less of their original ATPase activity, while that for ND1p97 was nearly unaffected. This result confirmed that the inhibitory action of CB-5083 affects the D2 domain and not the D1 domain. The IC50 values derived from the inhibition assay (Fig. 1; Table 1) also indicated that CB-5083 inhibits D1D2p97 better than FLp97, suggesting the D1D2p97 construct would be a better choice for our cocrystallization experiment.
CB-5083 Binds to the ATP-Binding Site at the D2 Domain.
Crystals of D1D2p97 in the presence of CB-5083 were successfully obtained and the structure of D1D2p97 in complex with CB-5083 was determined (Table 2). There are 12 independent D1D2p97 subunits in a crystallographic asymmetric unit, organizing into two hexameric D1D2p97 rings arranged in a head-to-head fashion (Fig. 2A). After the completion of the model building for the two rings, a difference electron density map was calculated using the Fourier coefficients mFo-dFc, which allowed us to look for any ligand bound to the protein. No difference density was observed at any of the 12 D1 nucleotide-binding pockets, demonstrating that the D1 domains are nucleotide or ligand free under the crystallization conditions. This observation is in sharp contrast to previous studies that showed complete nucleotide occupancy of the D1 domain. The bound nucleotides in the D1 ring regulate the N-domain conformation and the D2-domain ATPase activity, and vice versa (Ye et al., 2001; Tang et al., 2010; Niwa et al., 2012). It was also found that a subset of D1 domains of hexameric p97 in a purified protein preparation contains copurified ADP, also called prebound ADP (Davies et al., 2005; Briggs et al., 2008; Tang and Xia, 2013). These prebound ADPs are difficult to remove in wild-type p97 but their binding affinity is reduced in IBMPFD mutants. The lack of bound nucleotides in all of the D1 domains of the current D1-D2 structure suggests that the absence of the N domain, like the IBMPFD mutants, alters the regulation on the nucleotide state of the D1 domain and hence affects the binding of nucleotide to the D2 domain. In this work, our data show lower IC50 values for the binding of CB-5083 to the D1-D2 fragment, compared with that of wild-type p97. This indicates that removing the N domains leads to an empty D1 domain, which in turn leads to a lowered IC50 value for CB-5083 binding. This sequence of events is similar to IBMPFD mutant–induced higher ATPase activity at the D2 domain (Halawani et al., 2009; Manno et al., 2010; Niwa et al., 2012; Tang and Xia, 2013).
On the contrary, large positive densities were observed at the nucleotide-binding pockets of all 12 D2 domains (Fig. 2B). These densities have a shape and size resembling those of CB-5083 and were modeled with one molecule of CB-5083 for each domain. Subsequently, the model for the D1D2p97-CB-5083 complex was refined to a resolution of 3.77 Å, with Rwork and Rfree values of 22.2 and 26.5, respectively (Table 2).
A molecule of CB-5083 intercalates into part of the D2 ATP-binding pocket, occupying the space for the adenine base and ribose moiety of the nucleotide (Fig. 2C). The binding energy is clearly derived from both hydrogen-bonding and hydrophobic interactions (Fig. 2B). The benzylamino moiety of CB-5083 penetrates deep into the pocket, occupying a hydrophobic niche, in direct contact with five hydrophobic residues (I479, L482, L526, L527, and I656). The CB-5083 molecule is further stabilized by a strong hydrogen bond (2.5–3 Å) between the hydroxy group of residue T688 and the oxygen atom of the amide side chain of the carbamoyl-indole moiety (Fig. 2B). The orientation of the carbamoyl-indole moiety appears to be maintained by N660, which also forms a hydrogen bond with T688 (Fig. 2B). The dihydro-pyrano-pyrimidine core of CB-5083 is wedged between L526 and V474 on the one side and A659 on the other side.
Since the bound CB-5083 occupies to a large extent the same binding pocket as that for bound ATP at the D2 domain, we wondered if the observed positive difference density could possibly come from a bound nucleotide, although we did not include nucleotides in any of the experiments. We, therefore, superposed two D2 domains, one with CB-5083 bound and the other with adenylyl-imidodiphosphate (AMP-PNP) bound (Protein Data Bank: 3CF2) (Fig. 2C). The superposition shows that only the carbamoyl-indole moiety of the bound CB-5083 partially overlaps with the adenine and ribose moieties of AMP-PNP. There is no extra density in the CB-5083 structure corresponding to the phosphate groups of AMP-PNP, and the dihydro-pyrano-pyrimidine core and benzylamino moiety of CB-5083 do not correspond to any part of the nucleotide. In addition, the shape and size of the difference density do not fit well with the adenine and ribose moieties of AMP-PNP in standard geometry. Therefore, we are confident that the difference density belongs to the bound CB-5083.
The Experimental Model Is Supported by Structure-Activity Relationship and Mutagenesis Studies.
The structure of the p97-CB-5083 complex explains the following structure-activity relationship observations (Zhou et al., 2015): 1) no methyl group attaching to the nitrogen atom [−N(CH3)CH2Ph] in the benzylamino group can be tolerated; 2) no extra CH2 groups can be inserted into the benzylamino group (−NHCH2CH2Ph or −NHCH2CH2CH2Ph); 3) a hydrophilic group in place of the methyl group in the carbamoyl-indole moiety of CB-5083 is detrimental to binding affinity; and 4) no group can be tolerated at the fifth position of the carbamoyl-indole moiety.
The structure is also consistent with many key resistant mutants described in the literature (Table 1). 1) The T688A mutation led to a 58-fold increase in the IC50 value because of the loss of the critical hydrogen bond. 2) The N660K mutation led to a 29-fold increase in the IC50 value due to a broken hydrogen bond to the −OH group of the nearby residue T688. Also, the larger side chain of the lysine residue possibly destabilizes the orientation of the carbamoyl-indole moiety of CB-5083. 3) The five hydrophobic residues (I479, L482, L526, L527, and I656) form a hydrophobic niche and are in contact with the benzylamino group. We introduced three mutations to this niche, I479S, L526S, and I656S, individually, turning it into a more hydrophilic pocket. Except for the I656S mutation, either the I479S or L526S mutation led to ∼10-fold larger increase in the IC50 value. 4) Similarly, V474 is involved in contact with the dihydro-pyrano-pyrimidine core of CB-5083 and the resistance mutation V474A led to a 10-fold increase in the IC50 value, which was confirmed by a mutation (V474S) we also introduced that led to a similar enlargement in the IC50 value (Table 1).
Some Mutations that Confer Resistance to CB-5083 Are Found in the Entrance Portal to the ATP-Binding Site.
A number of mutations conferring resistance to CB-5083, which are not in direct contact with the bound inhibitor, have been identified, but were difficult to interpret in the absence of an experimental model (Table 1). A common characteristic of these mutations is the smaller increase in the IC50 values in CB-5083 binding. These mutations (E470K/D, P472L, Q473P, and V474A) were found to be localized at the D1D2 linker region (residues 460–481) and shown to alter the ATPase activities (Anderson et al., 2015; Bastola et al., 2017). Interestingly, the mutated residues are all clustered at the entrance leading to the nucleotide-binding pocket (Fig. 3). Thus, mutations of these residues may affect the accessibility of CB-5083 to the binding site.
Structural Difference in D1 Nucleotide-Binding Pocket Precludes Its Binding of CB-5083.
The two ATPase domains D1 (210–460) and D2 (481–760) of p97 share ∼40% and ∼60% sequence identity and similarity, respectively. Both domains are capable of binding and hydrolyzing ATP. Despite these similarities, biochemical and structural data showed CB-5083 binding specifically to the ATP-binding pocket of the D2 domain. To reveal the molecular details that define the binding preference of CB-5083 for the D2 domain, we looked at two possibilities for the binding difference: access of the drug to the sites and difference in molecular shapes of the two sites. The ATP-binding site of the D1 domain is accessible from either the top (Fig. 4A) or the side of the hexamer (data not shown). While the side entrance can be blocked by the movement of the N domain, the one on the top remains unchanged during the D1 ATPase cycle. Thus, it appears unlikely that access of CB-5083 to the D1 site is the cause for the drug’s preference for the D2 domain. The D2 domain, on the other hand, has only one access portal to the ATP-binding site, which is on the side of the hexamer, and the shape can be modulated by the D1D2 linker (Fig. 4B). The ATP site is less accessible from the bottom of the molecule.
To examine the molecular shape of the two sites, we superposed the two ATPase domains and compared their binding pockets (Fig. 4C). From the structure of the complex and the resistant mutations, we learned that several residues are important for interacting with and stabilizing CB-5083 bound at the D2 domain. These residues include T688 and N660 because mutations to these two residues cause 58- and 29-fold increases in the IC50 values, respectively, for CB-5083 binding (Fig. 2C; Table 1). The superposition shows that the residues in the D1 domain corresponding to T688 and N660 of the D2 domain are A412 and H384 (Fig. 4C), which are unable to form the hydrogen bonds essential for CB-5083 binding. Many D2 site residues that contact CB-5083 are not conserved for the D1 site (Table 3), indicating that the differences in size and shape between the D1 and D2 sites are sufficiently large to favor CB-5083 binding to the D2 site.
Furthermore, some residues of the D1D2 linker, such as V474, I479, and G480, are in direct contact with the bound CB-5083 (Fig. 3), being an integral part of the CB-5083-binding pocket. Indeed, mutations in these residues altered the sensitivity of p97 to CB-5083 (Table 1) (Anderson et al., 2015). This feature of the CB-5083-binding pocket of the D2 domain has no equivalence for the ATP-binding pocket of the D1 domain, although it may be approximated by the ND1 linker (residues 180–200), which has shown a large positional deviation from the D1D2 linker in the structural alignment. Moreover, the ND1 linker is conformationally variable depending upon the nucleotide states at the D1 domain, undergoing a helix-to-loop conformational switch (Tang et al., 2010) and thus introducing a dramatic change to the shape and size of the D1 ATP site.
Summary.
A recent update on the clinical trials of CB-5083, reported in the San Francisco Business Times, revealed an off-target effect of the drug interacting with phosphodiesterase-6, a protein involved in phototransduction in the eyes. While the clinical trials of CB-5083 have been halted for the safety of patients, the idea of targeting p97 for cancer therapy remains valid. To circumvent the off-target side effect of CB-5083, a new generation of drugs, preferably based on CB-5083, with enhanced specificity, is being developed by a multi-institutional consortium. Such an effort is best served with an accurate atomic model describing accurately the interactions of p97 with CB-5083. The significance of this work lies not only in providing an experimental model for the interaction between CB-5083 and p97 but also in offering guidance on how to modify the drug for better affinity and specificity. Such structure-based drug improvement has been shown to be most effective when an accurate model is available (Hao et al., 2012; Lounnas et al., 2013).
One suggestion is to modify the benzylamino moiety of CB-5083, which fits into the hydrophobic niche of the D2 nucleotide-binding site. At the end of this hydrophobic niche is the residue C522. A modification could be made to the benzylamino moiety such that it is able to form a covalent bond with C522, which could potentially increase the efficacy and perhaps the specificity of CB-5083. Also, R662 and D478 are located within 4 Å of the dihydro-pyrano-pyrimidine core. Alterations introduced to this core could potentially introduce salt bridges between the drug and p97, further enhancing existing interactions.
Acknowledgments
We thank the staff members of the SER-CAT and GM/CA beamlines at the Advanced Photon Source, Argonne National Laboratory (Lemont, IL), for assistance in data collection. All DNA sequencing services were conducted at the Center for Cancer Research Genomics Core at the National Cancer Institute, National Institutes of Health (Bethesda, MD). We also thank George Leiman for editorial assistance.
Authorship Contributions
Participated in research design: Tang, Xia.
Conducted experiments: Tang, Odzorig, Jin.
Performed data analysis: Tang, Xia.
Wrote or contributed to the writing of the manuscript: Tang, Xia.
Footnotes
- Received September 10, 2018.
- Accepted December 20, 2018.
This research was supported by the Intramural Research Program of the National Institutes of Health, National Cancer Institute, and Center for Cancer Research.
Abbreviations
- AMP-PNP
- adenylyl-imidodiphosphate
- CB-5083
- 1-[4-(benzylamino)-5H,7H,8H-pyrano[4,3-day]pyrimidin-2-yl]-2-methyl-1H-indole-4-carboxamide
- IBMPFD
- inclusion body myopathy with Paget’s disease of bone and frontotemporal dementia
- UPS
- ubiquitin-proteasome system
- U.S. Government work not protected by U.S. copyright