Review
The surprising complexity of signal sequences

https://doi.org/10.1016/j.tibs.2006.08.004Get rights and content

Most secreted and many membrane proteins contain cleavable N-terminal signal sequences that mediate their targeting to and translocation across the endoplasmic reticulum or bacterial cytoplasmic membrane. Recent studies have identified many exceptions to the widely held view that signal sequences are simple, degenerate and interchangeable. Growing evidence indicates that signal sequences contain information that specifies the choice of targeting pathway, the efficiency of translocation, the timing of cleavage and even postcleavage functions. As a consequence, signal sequences can have important roles in modulating protein biogenesis. Based on a synthesis of studies in numerous experimental systems, we propose that substrate-specific sequence elements embedded in a conserved domain structure impart unique and physiologically important functionality to signal sequences.

Introduction

The discovery that secreted proteins are encoded with short, removable N-terminal ‘signal sequences’ that earmark them for export [1] was one of the great breakthroughs in cell biology in the 1970s. Much work in the subsequent 30 years has shown that signal sequences are recognized by dedicated factors that catalyze the transport of secretory precursors (preproteins) across the endoplasmic reticulum (ER) or bacterial cytoplasmic membrane. The key molecules in this pathway are found in all living cells and include a cytosolic targeting factor called the signal recognition particle (SRP), the membrane-bound receptor for SRP, an integral membrane protein conducting channel called the Sec61p complex (in eukaryotes) or the SecY complex (in prokaryotes), and a membrane-bound peptidase that removes signal sequences from preproteins on the lumenal (or periplasmic) face of the membrane.

The signal hypothesis originally predicted that all signal sequences would share a distinctive sequence motif; however, the absence of any ‘consensus’ quickly became apparent when several preproteins were sequenced. Nevertheless, the concept of ‘signal sequence equivalence’ prevailed. An early comparative sequence analysis [2] showed that signal sequences have a typical size (∼20–30 residues) and a recognizable three-domain structure (a basic ‘N domain’, a ∼7–13 residue hydrophobic ‘H domain’ and a slightly polar ‘C domain’), but otherwise lack any significant homology. By the early 1980s, it had become clear that signal sequences are often readily interchangeable, tolerant of a wide range of mutations [3] and even capable of directing secretion in evolutionarily distant organisms 4, 5. A remarkable study published in 1987 showed that ∼20% of random sequences can promote the secretion of invertase in yeast [6], strengthening the conclusion that the only essential feature of signal sequences is a hydrophobic core that is uninterrupted by charged residues. These studies implied that the signal recognition machinery has a high degree of tolerance, but they also created the impression that signal sequences have only a very circumscribed role in protein biogenesis. Thus, it has become common ‘wisdom’ that signal sequences are simple, interchangeable domains with a low information content.

Although the discovery of species-specific features of signal sequences [7] and a distinct motif in bacterial lipoprotein signal sequences [8] suggested >20 years ago that not all signal sequences are equivalent, hints that signal sequence diversity reflects an underlying functional complexity have emerged only in more recent studies. These studies, often from seemingly unrelated fields and disparate experimental systems, share one important feature: they have each examined the biosynthesis of a preprotein or membrane protein that differs from the ‘model’ substrates that were originally used to define the basic principles of signal sequence function. Individually each study might seem to describe an ‘exception’ to the well-established paradigm, but viewed together they suggest that differences among signal sequences could ultimately prove to be as physiologically important as their similarities.

Recent experiments have demonstrated that sequence variation among signal sequences can affect protein targeting and translocation, signal sequence cleavage and even postcleavage events. Perhaps more importantly, these studies have shown that the modulation of specific steps in the recognition and processing of signal sequences can have an essential role in protein biogenesis. Here, we review evidence showing that signal sequences encode far more information than is commonly thought, and we synthesize these studies into a cohesive framework to explain signal sequence diversity.

Section snippets

Signal sequences influence the selection of protein targeting pathways

Polypeptides with a signal sequence are first recognized in the cytosol. Early work on mammalian cells identified SRP as the only factor both necessary and sufficient for signal sequence recognition [9]. SRP is a ribonucleoprotein complex that binds to signal sequences as they emerge from the ribosome, slows translation, and targets the ribosome–nascent chain complex to the ER via SRP receptor. By ensuring that preproteins reach the ER at an early stage of translation, SRP is thought to prevent

Signal sequence variation affects interactions with the translocon

Once a substrate is targeted to the ER or cytoplasmic membrane, the signal sequence must interact with and engage a translocon. Even though the composition of translocons can differ widely among different organisms and even within a single cell (Box 1), most signal sequence recognition is thought to involve interaction with the Sec61p or SecY complex 26, 27. Because this gating event is a prerequisite for initiating protein translocation, one might imagine that all substrates interact with the

Signal sequence variation influences events that occur after translocon gating

Two signal sequences that are equally efficient in their ability to initiate translocation can nonetheless differ in other, still poorly understood aspects of their interaction with either the translocon or the signal sequence cleavage machinery. This conclusion has emerged from various studies in which a specific signal sequence has been shown not to influence translocation per se, but rather to affect a cotranslational or posttranslational maturation event. The autotransporter EspP, a

Postcleavage roles of signal sequences

Given the tremendous functionality of peptides in general, it is intriguing to consider the possibility that signal peptides (or fragments thereof) could have important roles in the cell after they are removed from translocating polypeptides. Although it has long been assumed that cleaved signal peptides are degraded rapidly, the pathways and machinery involved in postcleavage processing are only now being explored. Studies following the fate of the cleaved preprolactin signal peptide have

A unifying framework for signal sequence diversity

The principal concept that has emerged here is that the highly degenerate sequence requirements for signal sequence recognition and processing by the core translocation machinery provides an incredibly elastic opportunity to embed additional information. This flexibility in primary sequence is likely to have been exploited during the course of evolution to maintain and to fine-tune any beneficial substrate-specific features of signal sequences that might have arisen by random chance [66]. In a

Acknowledgements

We thank Julie Donaldson, Erik Snapp, Cheryl Woolhead and Yihong Ye for critically reading the manuscript and providing helpful suggestions. Work in the authors’ laboratories is supported by the intramural research programs of the National Institute of Child Health and Human Development and the National Institutes of Diabetes and Digestive and Kidney Diseases.

References (79)

  • S.J. Kim

    Signal sequences control gating of the protein translocation channel in a substrate-specific manner

    Dev. Cell

    (2002)
  • M.G. Coppolino et al.

    Calreticulin

    Int. J. Biochem. Cell Biol.

    (1998)
  • K.L. Shaffer

    Regulation of protein compartmentalization expands the diversity of protein function

    Dev. Cell

    (2005)
  • D.T. Rutkowski

    Signal sequences initiate the pathway of maturation in the endoplasmic reticulum lumen

    J. Biol. Chem.

    (2003)
  • X. Chen

    Signal peptidase and oligosaccharyltransferase interact in a sequential and dependent manner within the endoplasmic reticulum

    J. Biol. Chem.

    (2001)
  • G. Kurys

    The long signal peptide isoform and its alternative processing direct the intracellular trafficking of interleukin-15

    J. Biol. Chem.

    (2000)
  • M. Tokunaga

    Prolipoprotein signal peptidase in Escherichia coli is distinct from the M13 procoat protein signal peptidase

    J. Biol. Chem.

    (1982)
  • M.K. Lemberg et al.

    Requirements for signal peptide peptidase-catalyzed intramembrane proteolysis

    Mol. Cell

    (2002)
  • M. Froeschke

    Long-lived signal peptide of lymphocytic choriomeningitis virus glycoprotein pGP-C

    J. Biol. Chem.

    (2003)
  • R.S. Hegde

    Targeting and beyond: new roles for old signal sequences

    Mol. Cell

    (2002)
  • S. Panzner

    Posttranslational protein transport in yeast reconstituted with a purified complex of Sec proteins and Kar2p

    Cell

    (1995)
  • H.A. Meyer

    Mammalian Sec61 is associated with Sec62 and Sec63

    J. Biol. Chem.

    (2000)
  • P. Holden

    Secretion of cartilage oligomeric matrix protein is affected by the signal peptide

    J. Biol. Chem.

    (2005)
  • G. Blobel et al.

    Transfer of proteins across membranes. I. Presence of proteolytically processed and non-processed nascent immunoglobulin light chains on membrane-bound ribosomes of murine myeloma

    J. Cell Biol.

    (1975)
  • L.M. Gierasch

    Signal sequences

    Biochemistry

    (1989)
  • K. Talmadge

    Eukaryotic signal sequence transports insulin antigen in Escherichia coli

    Proc. Natl. Acad. Sci. U. S. A.

    (1980)
  • C.A. Kaiser

    Many random sequences functionally replace the secretion signal sequence of yeast invertase

    Science

    (1987)
  • L. Abrahmsén

    Analysis of signals for secretion in the staphylococcal protein A gene

    EMBO J.

    (1985)
  • G. von Heijne

    The structure of signal peptides from bacterial lipoproteins

    Protein Eng.

    (1989)
  • P. Walter et al.

    Purification of a membrane-associated protein complex required for protein translocation across the endoplasmic reticulum

    Proc. Natl. Acad. Sci. U. S. A.

    (1980)
  • M.G. Waters et al.

    Secretory protein translocation in a yeast cell-free system can occur posttranslationally and requires ATP hydrolysis

    J. Cell Biol.

    (1986)
  • M.A. Poritz

    An E. coli ribonucleoprotein containing 4.5S RNA resembles mammalian signal recognition particle

    Science

    (1990)
  • R.T. Batey

    Crystal structure of the ribonucleoprotein core of the signal recognition particle

    Science

    (2000)
  • D.T. Ng

    Signal sequences specify the targeting route to the endoplasmic reticulum membrane

    J. Cell Biol.

    (1996)
  • J.-W. de Gier

    Differential use of the signal recognition particle translocase targeting pathway for inner membrane protein assembly in Escherichia coli

    Proc. Natl. Acad. Sci. U. S. A.

    (1998)
  • H.C. Lee et al.

    The targeting pathway of Escherichia coli presecretory and integral membrane proteins is specified by the hydrophobicity of the targeting signal

    Proc. Natl. Acad. Sci. U. S. A.

    (2001)
  • J.-W. de Gier et al.

    The ribosome and YidC. New insights into the biogenesis of Escherichia coli inner membrane proteins

    EMBO Rep.

    (2003)
  • S. Grallath

    L25 functions as a conserved ribosomal docking site shared by nascent chain-associated complex and signal-recognition particle

    EMBO Rep.

    (2006)
  • D. Huber

    Use of thioredoxin as a reporter to identify a subset of Escherichia coli signal sequences that promote signal recognition particle-dependent translocation

    J. Bacteriol.

    (2005)
  • Cited by (320)

    View all citing articles on Scopus
    View full text