![]() |
|
|
Vol. 60, Issue 1, 1-19, July 2001
Novasite Pharmaceuticals, Inc. (J.A.B.), San Diego, California; Center for Molecular Recognition (L.S., J.A.J.) and Departments of Psychiatry (J.A.J.) and Pharmacology (L.S., J.A.J.), Columbia University College of Physicians and Surgeons, New York, New York
| |
Abstract |
|---|
|
|
|---|
The availability of a high-resolution structure of rhodopsin now allows
us to reconsider research attempts to understand structure-function relationships in other G protein-coupled receptors (GPCRs). A comparison of the rhodopsin structure with the results of previous sequence analysis and molecular modeling that incorporated experimental results demonstrates a high degree of success for these methods in
predicting the helix ends and protein-protein interface of GPCRs.
Moreover, the amino acid residues inferred to form the surface of the
binding-site crevice based on our application of the
substituted-cysteine accessibility method in the dopamine D2 receptor are in remarkable agreement with the rhodopsin
structure, with the notable exception of some residues in the fourth
transmembrane segment. Based on our analysis of the data reviewed, we
propose that the overall structures of rhodopsin and of amine receptors are very similar, although we also identified localized regions where
the structure of these receptors may diverge. We further propose that
several of the highly unusual structural features of rhodopsin are also
present in amine GPCRs, despite the absence of amino acids that might
have thought to have been critical to the adoption of these features.
Thus, different amino acids or alternate microdomains can support
similar deviations from regular
-helical structure, thereby
resulting in similar tertiary structure. Such structural mimicry is a
mechanism by which a common ancestor could diverge sufficiently to
develop the selectivity necessary to interact with diverse signals,
while still maintaining a similar overall fold. Through this process,
the core function of signaling activation through a conformational
change in the transmembrane segments that alters the conformation of
the cytoplasmic surface and subsequent interaction with G proteins is
presumably shared by the entire Class A family of receptors, despite
their selectivity for a diverse group of ligands.
| |
Introduction |
|---|
|
|
|---|
G
protein-coupled receptors (GPCRs) represent a very large superfamily of
receptors that are critical for signaling of a diverse group of ligands
to heterotrimeric G proteins (Gether, 2000
). Ligands for these
receptors include light, odorants, tastes, small molecule
neurotransmitters, peptides, glycoprotein hormones, proteases, and
others. Despite an enormous amount of research on the structure and function of these receptors, until very recently, no
high-resolution structure of any G protein-coupled receptor was
available. Inferences about the structure of these receptors have been
based on critical cryomicroscopy studies of rhodopsin that indicated
the existence of seven transmembrane segments (TMs) and gave an
indication of the relative disposition of these TMs (Unger et al.,
1997
). Molecular models of rhodopsin and other GPCRs have been built
based on this low-resolution structure, on additional experimental data
pertinent to structure, and on inferences from sequence alignments,
analyzed in terms of conservation and physicochemical properties
(Ballesteros and Weinstein, 1995
; Baldwin et al., 1997
; Visiers et al.,
2001
). The crystal structure of bovine rhodopsin was recently
determined to 2.8 Å (Palczewski et al., 2000
). This landmark
accomplishment provides a timely opportunity to review and reconsider
previous attempts to infer, through indirect means, the structure of
GPCRs. By analyzing the consistency of previous data from a number of different GPCRs with the recent structure of rhodopsin, we consider to
what extent the structure of other GPCRs is likely to resemble the
structure of rhodopsin. In this review, we focus such a comparison on
previous studies of amine GPCRs, with special emphasis on our work on
the dopamine D2 receptor.
The sequence identity between bovine rhodopsin and the human dopamine
D2 receptor is ~19% overall and ~25% in the
transmembrane domains. In the case of soluble proteins, for which
vastly more structural data is available, there is a clear structural
similarity between proteins with >~25% sequence identity, but below
this level of sequence identity, structural divergence increases
dramatically (Wilson et al., 2000
; Yang and Honig, 2000a
). Nonetheless,
recent work has suggested that a number of different soluble proteins can achieve the same fold with different sequence patterns
(Russell and Barton, 1994
; Jornvall et al., 1995
; Mirny and
Shakhnovich, 1999
; Yang and Honig, 2000b
). The extent of structural
similarity between rhodopsin and other GPCRs is not known, and we are
currently limited to only one high-resolution structure from this
entire superfamily of receptors. Although the presence of a number of highly conserved residues makes it possible to align the TM residues of
the class A receptors (Fig. 1), these
domains differ at ~75% of the TM positions. In a number of cases
these differences are dramatic, such as the presence or absence of a
Pro in a TM.
|
Nonetheless, based on our analysis of the data to be reviewed here, the
overall structures of the transmembrane domains of rhodopsin and of
amine receptors may be remarkably similar. We further propose that
several of the highly unusual structural features of rhodopsin are also
present in amine GPCRs, despite the absence of amino acids that might
have been thought to be critical for the adoption of these features.
Thus, through a form of structural mimicry, we propose that
different amino acids or alternate microdomains can support similar
local deviations from regular
-helical structure, thereby resulting
in very similar tertiary structures. Thus, by structural mimicry, a
common ancestor could diverge sufficiently to develop the selectivity
necessary to interact with diverse signals but still maintain a similar overall fold and the core function of signaling activation through a
conformational change in the TMs that alters the conformation of the
cytoplasmic surface and subsequent interaction with G proteins.
Transmembrane Helix Ends and Orientation
The helical net diagram of rhodopsin in Fig.
2A shows the residues comprising the
solvent accessible surface area (SASA) calculated from the
high-resolution structure of rhodopsin. In such a calculation, a sphere
the size of a water molecule is rolled along the entire receptor and
the accessible surface is quantified (Lee and Richards, 1971
; Shrake
and Rupley, 1973
). Because the crystal structure did not resolve lipid
or detergent, within the transmembrane helical domains, the surface
with the greatest calculated surface accessibility is the portion of
rhodopsin that is normally exposed to lipid. In Fig. 2A, the most
accessible residues are shown with blue circles, and the most
inaccessible are shown with red circles. Superimposed on these nets
(Fig. 2, A and B) are prior predictions regarding the lipid exposure of
residues in amine receptors (Ballesteros and Weinstein, 1995
; Visiers
et al., 2001
). Residues in the surface predicted to face lipid are
shown in blue letters, and residues predicted to face the protein
interior and/or the binding-site crevice are shown in red letters.
These predictions were based upon the pattern of conservation of
various physicochemical properties of the amino acid side chains within
this family of receptors, complemented by results from experimental
probing in various GPCRs. Except for the extracellular portion of TM1
and some residues in TM4, the predictions are remarkably consistent
with the calculated SASA from the rhodopsin structure, suggesting the
utility and reliability of these methods of analysis and prediction of
structural features, as well as the overall structural similarity among
Class A receptors. Overall, 91% of the residues with low SASA (<10%) were predicted to face the interior of the TM helix bundle, whereas 78% of the residues with high SASA (>15%) were predicted to face lipid. Whereas the divergence of the predictions from the SASA in TM1
may relate to real differences in the structure in this region because
of the nonconserved Pro, the disagreement in TM4 is more challenging,
as will be discussed below.
|
The boundaries of the interface between the membrane and aqueous
environment inferred from electron paramagnetic resonance spectroscopy
studies of spin-labeled Cys substituted for residues at the cytoplasmic
ends of each TM of rhodopsin (Farahbakhsh et al., 1995
; Altenbach et
al., 1996
, 1999a
,b
) are very consistent with the cytoplasmic ends of
the seven TM
-helices in the high-resolution structure of rhodopsin
(Palczewski et al., 2000
). As shown in the dotted circles in Fig. 2A,
however, the spin-labeling studies predicted a continuation of
-helical conformation for one to three turns at the cytoplasmic ends
of TM5, TM6, and TM7 within the aqueous milieu. Remarkably, these
residues are clearly not in an
-helical conformation in the
rhodopsin structure, with
values dramatically out of the permitted
range. Although there are a number of technical differences between
these studies, such as the differing detergent environments and the use
of Cys mutants, another possible explanation for the discrepancy is
that this region is dynamic and that a substantial conformational
reorganization may be critical to receptor activation and G protein
interaction. That the region from Gln2366.19 to
Glu2396.22 was not resolved in the rhodopsin
crystal structure also suggests that this region is highly dynamic even
in the inactive state, and the high average B values in the resolved
third intracellular loop are also consistent with such an interpretation.
Predictions of the helix ends are shown in Fig. 2B (Ballesteros and
Weinstein, 1995
; Visiers et al., 2001
). Interestingly, at the
cytoplasmic ends these predictions are more consistent with the results
of the spin-labeling studies than with the actual rhodopsin structure
(Fig. 2). At the extracellular side, both the predictions and
substituted-cysteine accessibility method (SCAM) studies (see below)
proposed helical ends that were either in agreement with or shorter
than the rhodopsin structure. A possible explanation for these
discrepancies is illustrated in Fig. 3A, in which rhodopsin is illustrated embedded in its natural environment, phospholipid molecules. As demonstrated by White and Wimley (1999)
, the
membrane is not a homogenous hydrophobic layer; instead, it consists of
a relatively constant, low dielectric region, approximately 30 Å,
sandwiched between two complex regions of variable dielectric behavior,
both of which are approximately 15 Å. The 30-Å core region
corresponds to the hydrophobic lipid chains, whereas the surrounding
15-Å regions correspond to the mixed hydrophobic/hydrophilic phase
formed by the phospholipid headgroup domains.
|
That approximately 40% of the transmembrane domain interacts with the
phospholipid headgroups (Fig. 3A) is surprising, given that the
literature on GPCRs, and most prediction methods, tends to equate the
membrane with the hydrophobic phase. Indeed, the most common methods
used to predict membrane-spanning helices have been the hydrophobicity
plot, which identifies segments with an overall high hydrophobicity,
and hydrophobic moment, which estimates the amphiphilic character of a
helical segment (
-helix with two faces, one more polar and another
more hydrophobic). Because the helical turn facing the lipid headgroups
is composed of polar residues on both sides of the helix, both methods
will fail to predict these helical turns (Ballesteros and Weinstein, 1995
; Visiers et al., 2001
). On the extracellular side, residues facing
the interior of the helical bundle at the level of the lipid headgroup
region are likely to participate in ligand recognition and thus are
divergent through evolution, so methods based on sequence conservation
may also fail to predict the extracellular end of helical segments.
Thus, we should expect the predicted helical ends at the extracellular
side to be minimal ends.
At the cytoplasmic side of the TMs, prediction methods based on the
hydrophobic plot or moment will also fail to recognize the helical
turns within the phospholipid headgroups. Arg and Lys concentrate at
the cytoplasmic boundary of membrane proteins, on one face of the
helix, where they interact with the lipid at the level of the
phospholipid headgroups (Ballesteros and Weinstein, 1992
, 1995
; Visiers
et al., 2001
). This is illustrated in Fig. 3A, in which the positively
charged side chains of the Arg and Lys residues in rhodopsin are
positioned to participate in favorable ionic interactions with the
negatively charged phospholipid headgroups. In amine GPCRs, the
cytoplasmic boundaries of the TMs contain approximately three times as
many basic residues (Arg and Lys) as acidic residues (Asp and Glu),
thereby resulting in a significant positive charge at the cytoplasmic
boundary of these receptors. This observation correlates with the known
preponderance of negatively charged phospholipid headgroups in the
cytoplasmic leaflet of eukaryotic plasma membranes. Thus, despite the
absence of conservation of the exact position of these Arg/Lys in
GPCRs, they are consistently present throughout evolution in the same
TM helices, at the cytoplasmic boundaries where they face the lipid
headgroups. What is conserved is their general location and presumed
structural/functional role.
The positions of these Arg/Lys can be used to predict the cytoplasmic
ends of the TM helices (Ballesteros and Weinstein, 1995
; Visiers et
al., 2001
). Notably, these predictions of the cytoplasmic helical ends
and a recent revision thereof (Jensen et al., 2000
) agree extremely
well with the spin-labeling studies in rhodopsin, even though the
inferred ends for TM5, TM6, and TM7 are longer than the current
rhodopsin structure. As discussed above, this might be explained by
protein dynamics. It is also possible, however, that in the crystal
structure (because the environment of these Arg/Lys-rich cytoplasmic
segments lack negatively charged phospholipid headgroups and thus may
not mimic the native environment), the result might be dynamic,
unstable, or unfolded helical ends at the cytoplasmic boundaries in the structure.
In addition to its structural significance, the portions of the
receptor embedded in the phospholipid headgroup regions perform critical functional roles. On the extracellular side, ligand passes through and binds to the portion of the receptor that resides in the
headgroup region, as demonstrated for the TRH receptor (Colson et al.,
1998
). Furthermore, there are an increasing number of GPCR ligands that
have been shown to partition from the aqueous medium into the
phospholipid headgroup region and then to diffuse across the membrane
to bind to the receptor. Examples of such a mode of binding include
lipid-like ligands such as cannabinoids (Yang et al., 1992
), less
hydrophobic compounds, such as opioid receptor ligands (Schwyzer,
1995
), and the PACAP peptide ligand (Inooka et al., 2001
), and it is
possibly a much more widespread mechanism of binding than anticipated.
At the cytoplasmic boundary, the receptor domains embedded in the phospholipid headgroup region contain the key residues inferred to interact with G-proteins. Lipid modification of G-proteins positions them within the phospholipid headgroup region as well. Thus the mixed hydrophilic/hydrophobic phospholipid headgroup regions provide an environment that may support the key interactions of GPCRs with ligands from the extracellular side and with G-proteins and other interacting proteins, such as receptor kinases, from the cytoplasmic side. From this perspective, the hydrophobic core of the membrane formed by the lipid side chains would be responsible for transducing the signal across the membrane, from ligand binding to conformational changes at the cytoplasmic boundaries that facilitate G protein coupling.
The Substituted-Cysteine Accessibility Method
We have used the substituted-cysteine accessibility method (SCAM)
(Akabas et al., 1992
; Karlin and Akabas, 1998
; Javitch et al., 2001
) to
map the surface of the binding-site crevice in the dopamine
D2 receptor (Javitch et al., 1994
, 1995a
,b
, 1998
,
1999
, 2000
; Fu et al., 1996
; L. S., M. M. Simpson, J. A. B., and
J. A. J., in preparation), a member of the class A receptor family. In
this method, each of the residues in the seven TMs is mutated to Cys,
one at a time, and the mutant receptors are expressed in heterologous
cells. After confirming that the receptor expresses and binds ligand in
a near-normal manner, we assess the water-accessibility of the
engineered Cys by determining the effects of treatment with a series of
charged sulfhydryl reagents. These reagents, derivatives of
methanethiosulfonate (MTS), are more than a billion times more reactive
with the thiolate anion than with the thiol (Roberts et al., 1986
) and
only water-accessible Cys are expected to be significantly ionized.
Because of this and because the reagents are charged, we expect that
water-accessible Cys residues will be vastly more reactive with the MTS
reagents than will Cys facing lipid or facing the tightly packed
protein interior. We infer that a wild-type residue is on the
water-accessible surface if reaction of a MTS reagent with the
corresponding engineered Cys irreversibly alters radioligand binding.
We further infer that the residue faces the binding-site crevice if
ligand is able to retard the reaction of the MTS reagents with the
engineered Cys.
The results of our SCAM studies of the dopamine
D2 receptor are shown in Figs. 2B and 3B and in
further detail below in our discussion of each TM. In general, there is
remarkably good agreement between the residues found to be accessible
in the D2 receptor and the residues that face the
protein-protein interface in the rhodopsin structure (Fig. 2B). Indeed,
of the residues that have been inferred to face the binding-site
crevice, 79% have low SASA (<10%). The patches of accessible
residues on helix faces are generally a bit narrower than those
predicted to face the protein-protein interface. This finding is
consistent with the notion that at the extreme margins of the
protein-protein interface the helices may be more tightly packed than
at the surface of the binding-site crevice. At the extracellular end of
the TMs, the SCAM results in general give a broader pattern of
accessibility than predicted, consistent with access from the
extracellular milieu to an entire turn of helix, with the greater
water-accessibility of residues in the phospholipid headgroup region
(see above) and/or with dynamic changes of the structure in these
regions broadening the experimentally determined accessibility compared
with either the predictions or the static crystal structure of the
inactive state of rhodopsin. We address below some of the detailed
results of SCAM of the D2 receptor, but the
overall results are presented in Fig. 3B, which shows an extracellular
view of the backbone of rhodopsin. The C
and backbone ribbon of the
residues for which the aligned positions in the
D2 receptor are accessible are shown in yellow
(accessible and protected) and in orange (accessible but not protected,
or protection could not be tested reliably because of small effect sizes).
Tertiary Structure Interactions
Various experimental methods have been used to explore helix-helix
interactions in rhodopsin and in rhodopsin-like receptors. These
include the creation of metal binding sites (Elling et al., 1997
),
disulfide cross-linking between adjacent helices (Farrens et al.,
1996
), and the discovery of double revertant mutations in which a
second-site mutation restores function lost by a first mutation (Zhou
et al., 1994
; Sealfon et al., 1995
). The findings from these studies
are summarized in Tables 1 and
2,
which show the actual distances
between the aligned residues in the three-dimensional structure of
rhodopsin. In Fig. 4 these interactions
are shown in an extracellular view of ideal
-helical wheels
positioned according to the rhodopsin projection structure. The vast
majority of these interactions are consistent with the high-resolution structure of rhodopsin. Certain interactions that appear to be inconsistent with the rhodopsin structure, such as 2.50 to 7.49, are in
fact compatible, because of the presence of bends and twists in the
helices that cannot be conveyed in ideal
-helical wheels. The
compatibility of tertiary structure distances among the aligned positions of different GPCRs suggests the presence of significant similarity in the overall helical packing, even in regions in which
rhodopsin contains local distortions in helical structure (see below).
Other experimentally determined interactions are not consistent with
the rhodopsin structure, and these require that the structure be
dynamic. Examples of these include the stretch of residues 6.30 to 6.34 at the cytoplasmic end of TM6 that were all cross-linked to 3.54 at the
cytoplasmic end of TM3. In this case, both the cross-linking data and
the high-resolution structure come from bovine rhodopsin, so the
differences are probably explained by protein dynamics or
conformational changes in the receptor. In other cases, such as 3.32 to
7.39 in the
2 adrenergic receptor and 3.55 to
7.63 in rhodopsin, the contacts were inferred to be present in the
active state; not surprisingly, these are incompatible with the
inactive rhodopsin structure and require a significant conformational
change.
|
|
|
To address in more detail the similarities and differences between
rhodopsin and other class A receptors, we need to focus on those
domains that show at least some significant sequence homology and could
support similarities at the structural level. The extracellular and
cytoplasmic domains of GPCRs are widely divergent, thus precluding any
direct comparison between rhodopsin and other GPCRs. We have therefore
focused our analysis on the transmembrane domain to explore the extent
to which the recently available structure of rhodopsin is a feasible
structural template for other class A receptors, with special emphasis
on amine GPCRs. We will review below each TM and examine the conserved
and nonconserved motifs and how these might be expected to affect the
resulting structure. The accessibility pattern determined in our SCAM
studies on the D2 receptor will be used to guide
this analysis. In such an analysis, special attention must be paid to
potential deviations from ideal
-helicity that result from residues
known to produce kinks and bends. Because their side chains are
covalently bonded to the backbone, prolines produce kinks in
-helices caused by steric clash of the pyrrolidine ring with the
backbone and to loss of the hydrogen bond (H-bond) between the amide
nitrogen and the carbonyl at the (i-4) position (Barlow and Thornton,
1988
; Sankararamakrishnan and Vishveshwara, 1992
; Sansom and Weinstein, 2000
). Glycines are also commonly found at kinks in
-helices, presumably because of the flexibility they induce (Kumar and Bansal, 1998
). In addition, serine, threonine, and cysteine can significantly bend and twist
-helices because of their ability to hydrogen bond
back to the backbone carbonyl of the preceding turn (Gray and Matthews,
1984
; Ballesteros et al., 2000
).
TM1
Most class A receptors do not have a Pro at position 1.48, whereas
Pro1.48 is present in most vertebrate opsins. The proline kink in TM1
of bovine rhodopsin bends TM1 inwards, toward the helix bundle. The
absence of such a kink in other GPCRs would orient an ideal
-helix
more outward, away from the bundle and toward lipid and toward TM7
(Fig. 5A). The extracellular portion of
TM1 in amine receptors is less conserved than the aligned region in the
opsin family. This is consistent with the relative lack of key
contacts, as well as with the generally minimal effect of mutations in
this region and with the SCAM analysis of the dopamine
D2 receptor in which the extracellular part of
TM1 does not seem to contribute to the surface of the binding-site
crevice (L. S., M. M. Simpson, J. A. B., and J. A. J., in
preparation). Thus, it is possible that TM1 has a more outward
orientation in the amine receptors, which lack Pro1.48 (Fig. 5A). There
may be changes in the relative disposition of TM2 and TM7 to adjust for the potentially altered packing of TM1, but it is also possible that
bulkier side chains may serve to fill the potential space. The large
number of bulky side chains of the D2 receptor in
the TM2-TM3-TM7 aromatic microdomain may be an example of such an adaptation (Javitch et al., 1999
; Simpson et al., 1999
).
|
Recent work, however, showed that removal of a conserved Pro from a
kinked
-helix in heat shock transcription factor did not prevent the
formation of the kink (Hardy et al., 2000
). Most opsins have a Pro at
1.48, but those that do not have a preponderance of Thr and Gly at the
1.49 position. The presence of these residues is also associated with
bent helices (Kumar and Bansal, 1998
; Ballesteros et al., 2000
) and
either might substitute for Pro to maintain a kink in TM1. Likewise,
Gly1.49 is very highly conserved in amine receptors, which lack a Pro
at 1.48, and the increased flexibility induced by this Gly might also
support a kink in TM1, even without the Pro.
TM2
Bovine rhodopsin has three Thr (2.59-2.61) in TM2, but other
rhodopsin sequences contain Pro at 2.58, 2.59, and/or 2.60, either individually or in combination. Although TM2 of bovine rhodopsin has no
Pro, it does contain a significant kink within the 2.56-2.60 helix
turn defined by Gly-Gly-X-Thr-Thr. In this motif, the Thr's side
chains hydrogen bond to the carbonyls at the (i-4) positions, stabilizing a bent conformation (Fig. 5B). The Gly residues probably add flexibility, further facilitating this bent structure. The D2 receptor and many other GPCRs contain Pro2.59,
so we expect a bend at the 2.55 to 2.59 helix turn formed by the
Pro-kink, at a similar position as the bend in rhodopsin. Pro-kinks are flexible and can adopt different extents of bending. We have
illustrated this flexibility and the conformational space that can be
explored by the Pro-kinked helix by performing a molecular dynamics
simulation of an
-helix with a Pro (Fig. 5C). Different GPCRs have a
single Pro at 2.58, 2.59, or 2.60, and each one of these positions
would be expected to orient the bend differently (Fig. 5C). If we
choose from the simulations the Pro-kink conformation with the lowest RMSD compared with rhodopsin (RMSD = 2.0 Å), then the resulting D2 receptor Pro-kinked structure has a backbone
orientation similar to that of rhodopsin and is consistent with the
SCAM data (Fig. 5D). Thus, the extent of bend could be quite similar in
related GPCRs, even to the point that it is achieved in TM2 with a
Gly-Gly-X-Thr-Thr motif as well as with Pro, an example of the sort of
structural mimicry we propose. Nonetheless, the fact that the Pro can
be in different positions suggests that this may be a source of
considerable structural diversity in other class A
receptors. An example involving a Thr-X-Pro motif in TM2 of the CCR5
receptor has been proposed recently (Govaerts et al., 2001
).
TM3
Compared with TM2, TM3 of bovine rhodopsin contains more subtle
bends that result from the presence of Ser, Thr, and Cys. As described
above, these residues can hydrogen bond to the backbone and thereby
bend and twist helices (Gray and Matthews, 1984
; Ballesteros et al.,
2000
). The resulting bends are likely to be somewhat different in
related GPCRs because of different positioning of Cys/Ser/Thr. Interestingly, when TM3 of the D2 receptor is
modeled as an ideal
-helix and superimposed on the conserved
cytoplasmic region of TM3 in rhodopsin, the ideal helix clashes
sterically with TM4 (white ribbon in Fig. 5E). Although the bends in
the D2 receptor may not be extreme enough to be
detected by SCAM as a gross alteration in the pattern of accessibility
in TM3, such bending may nonetheless alter the position of TM3 relative
to TM4 (see below). In Fig. 5E, we have illustrated the wide range of
bending and twisting that can result from the presence of Cys/Ser/Thr
residues in the D2 receptor. It has been shown
recently that different rotamer conformations of these side chains
differentially alter the
and
dihedral angles of the helix
backbone, resulting in different local bends (Gray and Matthews, 1984
;
Ballesteros et al., 2000
). For every combination of rotamer
conformations of Cys/Ser/Thr residues in TM3 of the
D2 receptor, we modeled the corresponding average
deviation in
and
angles starting from the ideal TM3 helix
backbone. The result is a set of alternative possible TM3 conformations
for the D2 receptor, shown in Fig. 5E
superimposed on TM3 of rhodopsin, which is also bent because of the
presence of these residues at different positions in rhodopsin.
Different TM3 conformations might represent different functional states of the same receptor, such as liganded versus unliganded or active versus inactive. Nonetheless, a conformation can be chosen for the
D2 receptor that closely mimics rhodopsin
(RMSD = 2.0 Å) (Fig. 5F), suggesting that particular sets of
interactions of otherwise divergent positioning of Cys/Ser/Thr residues
in TM3 of different receptors may lead to similar folds of the TM3 segment.
We quantified in each TM of amine receptors the number of positions at which either a Ser, a Thr, or a Cys is present in more than 50% of all amine GPCRs. TM3 contains seven of these conserved Cys/Ser/Thr positions, whereas TM5 contains 4, and the other TMs contain one to three. Thus, given the large number of Cys/Ser/Thr, the high degree of conformational heterogeneity and/or flexibility proposed above for TM3 of the D2 receptor may be shared by TM3 of other amine GPCRs. The fact that five of these positions are incompletely conserved as Cys/Ser/Thr provides for potential divergence in the conformation of TM3 in different amine GPCRs. Therefore, the set of different bends of TM3 in the D2 receptor illustrates not only the potential conformational variability within a single GPCR, but also the extent of conformational variability possible among different GPCRs in TM3 given the different patterns, yet prevalent presence of Cys/Ser/Thr in this TM.
TM4
TM4 is the shortest membrane-spanning helix in rhodopsin, and
sequence alignments indicate that it probably is also short in most
other GPCRs (Ballesteros and Weinstein, 1995
). The TM4
-helix in
rhodopsin is disrupted by a Pro-Pro motif at position 4.59, followed by
a short helical turn. The presence of Pro-Pro, or Pro-X-Pro motifs
around 4.59 is common to many class A GPCRs, whereas some GPCRs, such
as D2 or
2 adrenergic
receptors, have only a single Pro at 4.59 (Javitch et al., 2000
). The
lack of a significant conservation pattern in GPCRs beyond this motif suggests that the structure beyond these Pro motifs, in particular the
short helical turn, might not be conserved in other GPCRs. Our
accessibility studies in the D2 receptor,
however, were consistent with the presence of a short helical turn for
the equivalent residues in rhodopsin (Javitch et al., 2000
), suggesting
that such a motif might be conserved despite the absence of any
recognizable sequence conservation. Given the inherent flexibility of
all three motifs (Pro-kink, Pro-Pro, or Pro-X-Pro), the overall
orientation of a short helical turn ending the TM4 helix may either be
different in other GPCRs or, alternatively, may share a similar fold.
Trp4.50 is highly conserved, and thus we had predicted that it would
face the binding-site crevice. Moreover, we found that Cys substituted
for this Trp, as well as for Phe4.54 in the D2 receptor, were accessible to MTS reagents and protected by an antagonist (Javitch et al., 2000
). Surprisingly, in the rhodopsin structure, a significant part of the aligned Trp and of
Leu1654.54 is exposed to lipid at the margin of
TM3 and TM2. As can be seen in Fig. 6A,
the superimposition of the SCAM results on TM4 of rhodopsin seems to
place the accessible face rotated approximately 90° too far
counter-clockwise (seen extracellularly) given the relative position of
TM3. The explanation for this apparent anomaly is not clear and may
involve alterations in the packing and relative disposition of the TMs.
Nonetheless, it would be surprising for the structures to be
dramatically different in this region given the extremely high
conservation of Trp4.50. The lipid exposure of the conserved Trp4.50
may be related to the known requirement of rhodopsin activation for
polyunsaturated lipids (Gibson and Brown, 1993
), which would interact
favorably with Trp4.50. Alternatively, 4.50 and 4.54 may face outward
to participate in yet unknown protein-protein interactions within the
membrane, such as receptor dimerization, which could conceivably
explain the observed accessibility to charged MTS reagents.
|
TM5
TM5 in Class A receptors is predicted to be a Pro-kinked
-helix, because of the highly conserved Pro residue at position 5.50. However, relative to standard Pro-kink conformations, the rhodopsin structure shows a very unusual kink, characterized by a
bulge, or local unwinding, around residue
His2115.46. This His residue probably
participates in an ionic interaction or a strong H-bond with the
negatively charged Glu1223.37
in TM3 (Beck et al., 1998
), and such an interaction (Fig. 6B) in the
transmembrane domain may be responsible for perturbing the Pro-kink
backbone conformation in the unusual manner observed in the crystal structure.
The pair
His2115.46-Glu1223.37 is
not conserved in other GPCRs. The question is thus whether other GPCRs
also contain such an unusual bulge in TM5 or form a more standard
Pro-kink conformation. To address this question, we generated a
standard Pro-kinked
-helix, and we also created a set of 100 representative structures from molecular dynamics simulations of the
Pro-kinked TM5 of the D2 receptor; these were
superimposed on the relatively conserved cytoplasmic segment
(5.51-5.58) of rhodopsin. The representative Pro-kinked structure with
the best fit to the extracellular portion of TM5 of rhodopsin had an
RMSD of 2.6 Å. Nonetheless, when superimposed in this manner, neither
the ideal Pro-kink, nor any of the 100 Pro conformations, positioned
5.42 facing the binding-site crevice (Fig. 6C). This is inconsistent
with mutagenesis studies of residue 5.42 in various amine GPCRs, which
suggested the presence of a direct ligand-receptor interaction at this
locus, such as the proposed interaction with the meta-OH of
catecholamines (Liapakis et al., 2000
) and with SCAM data on the
D2 receptor showing accessibility and protection
of Cys substituted at position 5.42. Therefore, although we might have
expected a more standard Pro-kink conformation for TM5 in the
D2 and other catecholamine receptors compared
with rhodopsin, these structures cannot satisfy established
ligand-receptor contacts. In contrast, the face twist accomplished by
the bulge in rhodopsin TM5 reorients 5.42 and 5.43 toward the
binding-site crevice (Fig. 6C). This suggests that the structure in
these other GPCRs may be similar to that in rhodopsin, despite the
absence of the pair His5.46-Glu3.37.
What molecular mechanisms might be responsible for reorienting the
extracellular portion of TM5 (5.37-5.46) in the absence of the
His5.46-Glu3.37 bond present in rhodopsin? This portion of TM5 in amine
GPCRs such as the D2 receptor contains conserved Cys/Ser/Thr residues at positions 5.42, 5.43, and 5.46, all on the same
face of TM5 proposed to interact directly with catecholamine hydroxyls.
These Cys/Ser/Thr residues, as mentioned before, can hydrogen bond to
the backbone carbonyls of the preceding turn, thereby bending and
twisting the helix (Gray and Matthews, 1984
; Ballesteros et al., 2000
).
We found in molecular dynamics simulations that the local helix
distortions created by Ser residues at 5.42, 5.43, and 5.46, in
conjunction with the Pro-kink created by Pro5.50, are able to increase
the range of possible structures and accomplish the face shift
necessary to satisfy the SCAM and mutagenesis data by producing a bulge
rather similar to that seen in rhodopsin (Fig. 6D; the "best" RMSD
is only 1.2 Å). This demonstrates that the specific and unusual
features of the rhodopsin structure, such as the bulge in TM5, might be
accomplished through structural mimicry in which different molecular
mechanisms or interactions can produce similar structures in different receptors.
The high degree of conformational heterogeneity and/or flexibility
observed in the simulations for the top of TM5 of the
D2 receptor, enabled by the Pro-kink and the
multiple interactions of the Ser at 5.42, 5.43, and 5.46 with the
backbone, might also help to explain the broad pattern of accessibility
observed by SCAM in this region of TM5 (Javitch et al., 1995a
).
However, the equivalent SCAM studies on TM5 performed on the
2 receptor found a narrower face of TM5
accessible to the binding-site crevice (G. Liapakis, L. S., D. Fu, and
J. A. J., in preparation). This suggests that the extracellular
region of TM5 preceding Pro5.50 may behave differently in different
receptors, for instance being more flexible in the
D2 receptor than in the
2 receptor. The significant range of
conformational heterogeneity and/or flexibility observed at the top of
TM5 within the D2 receptor may actually be
related to different functional states of the receptor, such as
inactive versus active conformations. Furthermore, because different
amine GPCRs show different patterns of Cys/Ser/Thr in this region, and
many other GPCRs lack these Cys/Ser/Thr altogether, we might expect
some degree of structural divergence for the extracellular portion of
TM5 in different GPCRs.
TM6
TM6 in rhodopsin is a more typical
-helix with a pronounced
kink around the highly conserved Pro6.50. For other GPCRs, this conserved Pro residue is also expected to result in a Pro-kink, which
could adapt a number of conformations as illustrated in Fig. 5C for
TM2. Spin-labeling studies in rhodopsin were consistent with a
displacement of the cytoplasmic side of TM6 away from TM3 upon
rhodopsin activation (Farrens et al., 1996
), and Cys crosslinking of
residues in these regions blocked receptor activation (Sheikh et al.,
1996
). Although this movement has been interpreted as a rigid body
movement of the entire TM6 helix upon activation (Farrens et al.,
1996
), we have proposed that the extent of bending of the Pro-kink is
associated with receptor activation (Jensen et al., 2000
; Sansom and
Weinstein, 2000
; Ballesteros et al., 2001
; Visiers et al., 2001
). The
role of the conserved Pro-kink in TM6 as a flexible hinge underpinning
this conformational change has been recently supported by mutagenesis
of residues within the Pro-kink of the
2
adrenergic receptor combined with simulations, which suggest that a
straightening of TM6 about the kink and a resulting movement of the
cytoplasmic portion of TM6 away from TM3 is associated with receptor
activation (G. Liapakis, L. S., R. Xu, F. Guarnieri, J. A. B., and J. A. J., in preparation). There is, therefore, an apparent
conservation of the mechanism of receptor activation across different
receptors, involving a movement of the cytoplasmic side of TM6 away
from TM3, facilitated by the conserved Pro-kink at position 6.50 acting
as a flexible hinge. Therefore, we would expect that other GPCRs also
share a highly kinked conformation of the Pro-kink in TM6 in the
inactive conformation, which becomes less kinked upon receptor activation.
There is very good agreement between the empirically determined
water-accessible face of TM6 in the D2 receptor,
and the orientation of the aligned residues in rhodopsin (Fig. 6E),
consistent with our interpretation of the presence of a Pro-kink in the
D2 receptor (Javitch et al., 1998
). Also
corroborated in the rhodopsin structure is the orientation of the
cluster of aromatic residues in the D2 receptor
identified by SCAM and mutagenesis as facing the binding-site crevice
(Javitch et al., 1998
), where these residues form a critical interaction with the aromatic ring of catecholamine ligands. The interaction of the aromatic residues in this cluster with ligands in
the binding-site crevice was originally proposed in the
2 adrenergic (Dixon et al., 1988
) and 5HT2a
receptors (Choudhary et al., 1995
; Roth et al., 1997
) and is likely to
be a feature shared by other GPCRs as well.
TM7
TM7 of rhodopsin and of other class A receptors contains the
highly conserved Asn-Pro motif at 7.49 and 7.50, which is expected to
kink the TM7 helix. Curiously, TM7 of rhodopsin has a very unusual
kinked conformation around Pro7.50, with a turn of tightly wrapped
310 helix at position 7.43-7.46 (Fig. 6F). An
unusual conformation around the Asn-Pro motif in TM7 had been suggested based on photolabeling studies of rhodopsin (Findlay et al., 1984
). This unusual 310 helix conformation may be
related to or stabilized by the covalent attachment of retinal to
Lys2967.43 in rhodopsin (Fig. 6F). The question
thus arises whether the structure of TM7 in other GPCRs may contain a
similar motif in TM7, in the absence of the Lys7.43-retinal attachment
interaction. The first inference that the Asn-Pro motif in TM7 of other
GPCRs may be specially distorted arose from double-revertant mutant studies performed in the 5HT2a and GnRH receptors between Asp2.50 in
TM2 and Asn7.49 in TM7 (Zhou et al., 1994
; Sealfon et al., 1995
), which
suggested a direct interaction between these residues that was
inconsistent with a standard Pro-kink in TM7. The pattern of
accessibility of TM7 determined by SCAM was also more consistent with a
highly twisted
-helix around the Asn-Pro motif but not with a
standard
-helix (Fu et al., 1996
). In particular, residue Ser7.46 in
TM7 of the D2 receptor was not accessible, yet
surrounded by accessible residues in a standard
-helical
conformation (Fig. 2B).
Gouldson et al. (1997)
predicted the presence of
310 helix in TM7 to explain the
D2 receptor SCAM and other mutagenesis data, but
they placed the 310 helix one turn below its
position in the rhodopsin structure. The SCAM data in the
D2 receptor is remarkably consistent with the
rhodopsin TM7 conformation (Fig. 6G), whereas it is less compatible
with a standard Pro-kink or any of the simulated Pro-kinks (data not
shown). Indeed, the apparent lack of accessibility of Ser7.46, which
was difficult to explain even with the extreme bend and twist we
previously hypothesized to explain the TM7 SCAM data (Fu et al., 1996
)
is explained in the rhodopsin structure in which the 7.46 side chain is
completely surrounded by side chains from TM1 and TM2 (Fig. 6G).
Furthermore, residues Asp2.50 from TM2 and Asn7.49 from TM7 are nearby
in the rhodopsin structure, potentially interacting through a water
molecule, consistent with the predictions based on the double revertant
mutant studies. We suspect therefore that the structure of TM7 in the
other class A receptors that lack the
Lys2967.43-retinal is similar to that of
rhodopsin, although different molecular interactions may be responsible
for stabilizing a similar motif in different GPCRs, another example of
structural mimicry.
The cytoplasmic extension of TM7 has been proposed to form an
-helix
based on NMR studies of a peptide of this region (Albert and Yeagle,
2000
), and secondary structure predictions also suggested that this
region was
-helical (Ballesteros and Weinstein, 1995
; Visiers et
al., 2001
). Indeed, this region is an
-helix in rhodopsin, and it
has been termed H8. The linker between TM7 and H8 is highly divergent
among GPCRs, with several insertions, deletions, or nonconserved Pro
residues present, suggesting a lack of conserved secondary structure in
this region. Nonetheless, the linker in rhodopsin was inferred to be
-helical based on spin-labeling experiments (Altenbach et al.,
1999a
; Fig. 2A), suggesting that the structure of this region may be
dynamic. Although the structure of the linker is unclear, it is likely
that the H8 region also forms an
-helix in other receptors. Several
residues that face into the rhodopsin core are highly conserved,
including Phe3137.60, which is in close contact
with the cytoplasmic portion of TM1 (Thr581.53,
Val611.56, and Thr621.57).
Residues 1.53 and 1.57 in TM1 are highly conserved as
-branched residues among amine GPCRs, and in the D2
receptor, the residues at positions 1.53, 1.56, and 1.57 were found to
be accessible by SCAM (L. S., M. M. Simpson, J. A. B., and J. A. J.,
in preparation). The high conservation observed for the residues
forming the H8-TM1 interface suggests that a similar packing
arrangement may be present for H8 in other GPCRs. In addition, the
residues facing solvent are conserved in their
-helical pattern of
Arg/Lys. These basic residues have been proposed to face the
phospholipid headgroups (Fig. 3A), and are present on only one side of
H8 (every three to four residues), suggesting that the helical
conformation of H8 is maintained in other GPCRs. (An exception to this
is the GnRH receptor, which seems to lack H8.) Although H8 has been
proposed to play a role in activation, the details of such a role
remain uncertain. Crosslinking data in rhodopsin, however, suggest that the conformational movement of H8 with activation is quite large (Yang
et al., 1996
; Yu et al., 1999
; Table 1).
In summary, although the figures shown for the TMs of the D2 receptor are illustrations showing possible conformations, based on the SCAM and mutagenesis data, it appears possible that the helical packing and extent of bends and twists is quite similar in rhodopsin and in the D2 receptor and other amine GPCRs. We hypothesize that through structural mimicry these related GPCRs have evolved different microdomains (such as Thr-Thr versus Pro) to maintain their similar overall structure, while accommodating other changes necessary to make them selective for their ligands, such as the flexibility inherent in multiple Pro-kinks and Cys/Ser/Thr motifs. At the same time, divergence in some of these features, such as the position of the Pro in TM2 or of the Cys/Ser/Thr in TM3, is a potential mechanism for structural diversity in other class A receptors, and flexibility about the Pro-kinks or other induced bends in a given receptor is also a potential mechanism for conformational changes related to function.
The Binding Site
The SASA analysis of the high-resolution structure of rhodopsin
described above was used to quantify the lipid-accessible surface of
the protein (Fig. 2). We have also performed such an analysis in the
presence and absence of retinal. The difference between these two
calculated surfaces represents the surface of the binding-site crevice
that is protected from water by retinal. The residues identified by
this analysis in the TMs (see Table 3)
are highlighted in Fig. 7A, in which the
retinal is shown within the binding-site crevice formed by the
retinal-"protected" residues. The positions of these residues are
remarkably consistent with those of previously identified
ligand-receptor contact sites for amine GPCRs (Table 3), as well as
with our SCAM studies of the dopamine D2
receptor. Figure 7B illustrates this agreement with the backbone of the
rhodopsin structure and the side chains of the
2 adrenergic receptor that have been shown to
interact directly with catecholamine ligands. These residues include
Asp1133.32, which interacts with the protonated
amine of biogenic amines; Ser2035.42,
Ser2045.43, and Ser2075.46,
which interact with the meta-OH and para-OH of
catecholamines; Asn2936.55, which interacts with
the
-OH of epinephrine; and Phe2085.47 and
Trp2866.48, Phe2896.51, and
Phe2906.52; which form a cluster of aromatic
residues in TM5 and TM6 that interact with the aromatic ring of
ligands. Even without modification of the rhodopsin backbone, it is
clear that epinephrine fits remarkably well within the binding site
formed by these critical residues.
|
|
In Fig. 7C, we illustrate
p-(Bromoacetamido)benzyl-1-[125I]iodocarazolol
(IpBABC), an affinity label derivative of pindodol, docked within the
TMs of the rhodopsin structure with all the residues from Table 3
mutated to the aligned
2 adrenergic receptor residues. IpBABC is shown covalently attached to
His932.64 in TM2, the likely site of covalent
reaction (Dohlman et al., 1988
; Javitch et al., 1999
). Again, the
ligand is bound essentially to an overlapping set of the residues that
contact retinal in rhodopsin. It is important to note that although
some of the residues in the binding-site crevice of these receptors are
conserved, most are not. Thus, these residues have apparently evolved
to impart specificity within a certain receptor. Consequently, what is
"conserved" among these receptors is the positions of
the residues involved in ligand binding and thus the particular surface
that serves the role of ligand binding.
Although the location of the binding site seems to be conserved in
rhodopsin and amine receptors, this need not be the case for all class
A receptors. Agonists must stabilize (one of) the active state of the
receptor, and there is no logical necessity that they do this by making
the same contacts (Schwartz and Rosenkilde, 1996
). Indeed, there is
convincing evidence that peptide ligands bind to extracellular loops;
however, at least in certain cases, there is also evidence that parts
of peptides dip down into the TM domain and contact some of same
positions found to be critical for rhodopsin and the amine GPCRs
(DeMartino et al., 1994
; Macdonald et al., 2000
; Gerber et al., 2001
).
It is likely, however, that structurally dissimilar ligands bind to
some extent in different orientations, and these modes of binding can
be extremely difficult to predict (Shapiro et al., 2000
).
The Second Extracellular Loop
The second extracellular (E2) loop, which connects TM4 and TM5,
also contains residues that contact retinal, as evidenced both by
visual inspection and by the SASA analysis described above. This loop
also contains one of the two highly conserved Cys that are thought to
be disulfide-bonded and thus also link the E2 loop to the top of TM3 in
rhodopsin and in amine GPCRs (Savarese et al., 1992
). The E2 loop dives
down into the TM domain and contains two stretches of
-strand, one
of which,
4, lays directly over retinal. The E2 loop thus forms a
lid over retinal and protects it from the extracellular milieu. Given
the high degree of conservation of the amino acids in the
4 strand
in vertebrate opsins, and the variability within this region in other
class A receptors, it has been suggested that the
4 strand might
serve specifically to define the retinal binding-pocket in vertebrate
opsins and not other GPCRs (Menon et al., 2001
).
The sequence of the E2 loop is highly variable across class A
receptors; as we have noted previously, however, the sequence from 4.59 to 4.68 is highly conserved among functionally related receptors and
among species variants of these receptors (Javitch et al., 2000
). It is
likely, therefore, that this region plays a functional role. This
inference is supported by the identification of this region as the site
of covalent attachment of photoaffinity derivatives of agonist and
antagonist ligands of the
2 adrenergic receptor (Matsui et al., 1989
), by the observed ligand-specific effects
of mutations in this region (reviewed in Javitch et al., 2000
), and by
the predicted spatial proximity of this region to known ligand binding
sites in other TMs. Thus, although we do not yet know of any structural
similarity of the E2 loop between rhodopsin and other class A receptors
beyond the shared disulfide bond, there is evidence for an important
functional role of this loop in the entire family of receptors. Indeed,
in the
1 adrenergic receptor, Zhao et al.
(1996)
have demonstrated that several residues in the E2 loop partially
determine the pharmacological specificity of particular adrenergic
ligands, again suggesting that this loop may contact bound ligands in
other Class A receptors in addition to rhodopsin. Remarkably, we have
noted that these residues seem to be aligned with the retinal-contact
residues in the
4 region of bovine rhodopsin near the
disulfide-bond, raising the possibility that there may be some
structural similarity in this loop region in other class A receptors
and rhodopsin.
Summary
In conclusion, based on our analysis of the data reviewed, the
structures of rhodopsin and of amine receptors may be very similar. We
further propose that several of the highly unusual structural features
of rhodopsin are also present in amine GPCRs, despite the absence of
amino acids that might have been thought to be critical to the adoption
of these features. Thus, different amino acids or alternate
microdomains can support similar deviations from regular
-helical
structure, thereby resulting in similar tertiary structure. Such
structural mimicry is a mechanism by which a common ancestor could
diverge sufficiently to develop the selectivity necessary to interact
with diverse signals, while still maintaining a similar overall fold.
Through this process, the core function of signaling activation through
a conformational change in the TMs that alters the conformation of the
cytoplasmic surface and subsequent interaction with G-proteins is
presumably shared by the entire Class A family of receptors, despite
their selectivity for a diverse group of ligands.
It is important to note, however, as we have discussed above, that there are differences in sequence that might result in some extent of divergence in the resulting structure of some Class A receptors. An example of this is the presence of Pro in TM2 at different positions, where they might bend the helix differently (Fig. 5C). Furthermore, a significant range of conformational diversity could be generated within the binding-site crevice of different GPCRs by the presence of Pro-kinks and Cys/Ser/Thr residues. Conserved Pro-kinks in TM 5, 6, and 7 could adopt different conformations that could change significantly the binding sites of different GPCRs. Nonconserved Pro residues in TM 2 and 4, or nonconserved Cys/Ser/Thr in TM 3 and other helices, are another source of potential structural divergence in the binding-site crevice. Combined, these sources of structural diversity may be capable of generating a large diversity of binding site conformations. It seems, therefore, that these receptors have evolved to maintain an overall fold by means of alternative molecular mechanisms (structural mimicry) that enable localized variations within their binding sites suitable for recognizing a wide variety of ligands. This scenario supports the use of the crystal structure of rhodopsin as a template to model the transmembrane domain of other rhodopsin-like GPCRs. The initial template, however, may require substantial modifications to refine the particular conformation of the binding site to explore ligand-receptor interactions at the molecular level.