|
|
||||||||
Guest Access | Sign In via User Name/Password |
|||||||||
* From the Department of Medicine (Drs. Bowler and Ellison), National Jewish Medical and Research Center; and the Department of Anesthesia (Dr. Reisdorph), University of Colorado Health Sciences Center, Denver, CO.
Correspondence to: Russell P. Bowler, MD, PhD, National Jewish Medical and Research Center, K729a, 1400 Jackson St, Denver, CO 80206; e-mail: BowlerR{at}njc.org
Abstract
Proteomics is the study of the entire protein complement of the genome (the proteome) in a biological system. Proteomic studies require a multidisciplinary approach and have only been practical with the convergence of technical and methodologic improvements including the following: advances in mass spectrometry and genomic sequencing that now permit the identification and relative quantization of small amounts (femtomole) of nearly any single protein; new methods in gel electrophoresis that allow the detection of subtle changes in protein expression, including posttranslational modifications; automation and miniaturization that permit high-throughput analysis of clinical samples; and new bioinformatics and computational methods that facilitate analysis and interpretation of the abundant data that are generated by proteomics experiments. This convergence makes proteomics studies practical for pulmonary researchers using BAL fluid, lung tissue, blood, and exhaled breath condensates, and will facilitate the research of complex, multifactorial lung diseases such as acute lung injury and COPD. This review describes how proteomics experiments are conducted and interpreted, their limitations, and how proteomics has been used in clinical pulmonary medicine.
Key Words: bioinformatics electrophoresis mass spectrometry
Similar to genomics, there are > 27 different definitions of "proteomics," yet all encompass the central concept of studying a nearly comprehensive set of proteins (as opposed to genes) expressed by a cell or organism. There are multiple advantages to studying protein expression rather than gene expression. First, it is primarily proteins, not genes, that determine how a cell functions. Second, a single gene or even a single mature messenger RNA may be associated with multiple proteins due to splicing, RNA editing, or posttranslational modifications, and there may be little correlation between messenger RNA levels and protein expression. Third, acellular compartments such as plasma and lung epithelial lining fluid (ELF) have little DNA or RNA but may have abundant proteins that are important markers of disease. A major disadvantage to proteomics vs genomics is technical. Nearly all genes are represented by only four nucleic acids, which can easily be detected and quantified using complementary nucleic acid sequences, whereas proteins are represented by > 20 amino acids and hundreds of distinct posttranslational modifications that require complicated identification and quantification (see following).
These technical limitations have been overcome, and proteomics research and publications have grown exponentially, primarily due to advances in protein separation, mass spectrometry (MS), and bioinformatics. This review will summarize common proteomics methods, discuss their limitations, and illustrate how proteomics has been used to study pulmonary biology.
Proteomic Methods
A typical workflow for a proteomics experiments includes the following: (1) sample acquisition and storage, (2) sample preparation and fractionation, (3) protein quantification and identification, and (4) bioinformatics.
Sample Acquisition and Storage
Proper study design is the first step and has been written about in depth elsewhere.1 Sample accessibility is the reason that most clinical studies use blood draws. Other factors are dependent on methodological approaches and may not be specific to proteomic studies (eg, subsets of proteins have diurnal expression patterns, heparinized blood tubes preserve different proteins from citrated tubes, increased freeze-thaw cycles may lead to differential protein degradation). Problems specific to proteomics include the use of certain reagents (eg, polyethylene glycol) or contamination (eg, by keratins). Inattention to these details is a frequent criticism of proteomics experiments; for example, it has been speculated that inconsistent sample processing has led to false ovarian cancer biomarker discovery using surface-enhanced laser desorption ionization (SELDI).2 Thus, investigators should be cautious and include appropriate experimental controls because any sample manipulations prior to proteomic analysis could add notable variation to protein measurement.
Preparation and Fractionation of Proteins
A typical complex biological sample such as serum or lung tissue has many thousands of proteins in concentrations that span > 10 orders of magnitude. For instance, serum albumin has a normal concentration range of 35 to 50 mg/mL (35 to 50 x 109 pg/mL), yet interleukin 6 has a normal range of just 0 to 5 pg/mL. Since no current technologies are capable of simultaneously resolving this number of proteins at such a great magnitude of concentrations, additional steps are needed to simplify the complexity of the sample, including separation by size, pH, and chromatographic properties, or by immunodepletion and enrichment strategies.
Size Separation: The most common laboratory method for separating proteins by molecular weight is sodium dodecyl sulfate-polyacrylamide gel electrophoresis (PAGE). Size separation by sodium dodecyl sulfate-PAGE relies on the property that protein migration in a constant electromagnetic field is roughly proportional to the molecular weight of a protein. Other methods include size exclusion (eg, column and membrane) and differential centrifugation. However, some reagents used with differential centrifugation are not compatible with direct downstream MS methods (eg, cesium chloride).
pH Separation: All proteins are made of acid, basic, and neutral amino acids. By altering the hydrogen concentration (pH) of a protein solution, one can force acidic and basic amino acids to change charge. By creating a pH gradient within an electric field, proteins will migrate to the point at which they are neutrally charged (ie, the isoelectric point [pI] of the protein). Advances in the manufacture of immobilized ampholyte gradients have revolutionized this approach by creating reproducible gel strips that include the pI of nearly all proteins (pI 310). Separation by pH (first dimension) is typically followed by PAGE (second dimension) to obtain a two-dimensional electrophoresis (2-DE) separation map. Certain classes of proteins are not well-suited for 2-DE, including membrane and insoluble proteins, low abundant proteins, and proteins with high molecular mass or extreme pH. In addition, several compounds interfere with the first dimension including salts, ionic detergents, and lipids. A major problem to 2-DE experiments is gel-to-gel variations that make comparisons difficult. A solution to this problem has been to label proteins from different samples with small molecular weight, electrically neutral fluorescent dyes, and then combining the samples before 2-DE separation.3 This technique has been termed differential in-gel electrophoresis (DIGE) and is useful for comparing two to three samples, although the cost of dyes can make these studies cost-prohibitive. MS is typically used downstream of 2-DE and DIGE to identify proteins of interest. Protein stains vary in their compatibility with MS, although protocols are available for processing commonly used stains (eg, silver, Coomassie, and Sypro Ruby).
Chromatographic Separation: Chromatographic separation relies on a the inherent difference of a protein in affinities for chemical substances. Reverse-phase chromatography is the most frequently used technique for proteomics and refers to the principle that hydrophobic substances (peptides and proteins) will elute from hydrophobic columns (eg, C4, C8, and C18) at progressively higher concentrations of organic solvent. Column-based liquid chromatography (LC) can be directly coupled to a mass spectrometer. Alternatively, surface chromatography (ie, SELDI) can be used to selectively capture proteins to a chip surface quantified using a mass spectrometer. A limitation of SELDI is that it is not always easy to identify the proteins that make up the profiles.
Immunodepletion and Enrichment Strategies: Immunodepletion, immunoprecipitation, and ultracentrifugation are strategies that improve detection on nonabundant proteins. Immunodepletion strategies remove high-abundance proteins such as albumin and Ig in order to detect medium-abundance and low-abundance proteins. Enrichment strategies rely on the ability of a specific antibody to pull out a single protein or related proteins from a complex solution. Ultracentrifugation of plasma has been successfully used to identify the proteins of plasma microparticles, which are spherical cell membrane fragments derived from either apoptotic or activated cells.4
Protein Identification and Quantification Using MS
MS:
MS is one of the most common technologies used in proteomics (see reviews567) and refers to an instrumental method for identifying the chemical constitution of a substance by means of the separation of gaseous ions according to their differing mass and charge (Fig 1
). Mass spectrometers can be used for profiling both peptides and small and large proteins. In a typical protein identification workflow, a protein typically is first digested using a proteolytic enzyme (eg, trypsin) that cleaves reproducibly at arginines and lysines. The resulting peptides are then ionized to produce charged (protonized) molecules, and travel through a mass analyzer and then to a mass detector. The two ionization techniques most commonly used are (1) matrix-assisted laser-desorption ionization (MALDI) and (2) electrospray ionization (ESI). With MALDI, proteins and peptides are mixed with an energy-absorbing matrix (eg, cinnamic acid) and then are ionized using a laser. With ESI, a sample is introduced in liquid form, and the application of a very high voltage forms a fine spray through a hypodermic needle to ionize peptides. An electromagnetic field causes the ionized peptides to travel through the mass analyzer to a detector. Mass analyzers include time-of-flight (TOF), ion traps, Fourier transform and quadrupoles or combinations (eg, quadrupole-TOF). In TOF MS, the time that it takes the peptide to reach the detector is converted into a mass/charge ratio and is visualized as a mass spectrum. MALDI is generally paired with TOF analyzers (MALDI-TOF). This is excellent for high-throughput studies and is somewhat forgiving of contaminants such as salts, although only single proteins or very simple mixtures can be analyzed. In tandem MS using an ion trap, peptides of a single mass are "trapped" in an electromagnetic field, fragmented, and the resulting fragments are detected and a second mass spectrum is formed. Similarly, fragmentation can occur in a quadrupole prior to TOF during a tandem MS experiment in a quadrupole TOF. Because the fragmentation of a peptide often results in sequence information, a single, high-quality peptide can often be used to identify a protein. In addition, this information can be used to localize posttranslational modifications. However, long run times make these instruments medium-throughput to low-throughput devices, and ESI is generally not as tolerant of salts as MALDI. Therefore, a typical workflow may include 2-DE followed by MALDI-TOF for high-throughput, single-gel-spot analyses. When additional sensitivity, complex sample analysis, or sequence information is required, ESI-MS/MS can be used.
|
|
|
|
The majority of pulmonary proteomic studies have been descriptive and have focused on the following major compartments in the lung: ELF, lung tissue, airway cells, and blood. For more detail, readers are referred to more recent exhaustive reviews.1112
The ELF Proteome
ELF represents the thin layer of fluid covering airway epithelial cells and is readily obtained as BAL fluid (BALF). Newer bronchoscopic microsampling techniques that do not dilute ELF may supplant the BALF procedure for proteomics experiments.13 One of the first studies of the BALF proteome was published in 197914 and used 2-DE to identify BALF proteins in patients with alveolar proteinosis. These early studies were typically limited in their ability to identify specific proteins; however, with recent MS advances, Guo et al15 have used one-dimensional electrophoresis and 2-DE LC coupled to a mass spectrometer to identify 297 unique proteins in the mouse BALF proteome. Most proteins in BALF are plasma proteins such as albumin and Ig; however, a small subset of proteins such as surfactant protein-A and glutathione-S-transferase are significantly more abundant in BALF compared to plasma or serum.1617 Descriptive BALF proteomics studies1112 have been published in healthy subjects, smokers, sarcoidosis, cystic fibrosis, pulmonary fibrosis, asbestosis and mesothelioma, hypersensitivity pneumonitis, immunosuppression, ozone exposure, and acute lung injury.
Proteomics of Lung Cells and Tissue
BAL can also be used to obtain alveolar macrophages for functional proteomics studies. For example, alveolar macrophages have a distinct proteomic profile that relates to their physiologic role in proteolysis, actin reorganization, and cellular adaptation compared to those of blood mononuclear cells.1819 Obtaining lung tissue requires invasive sampling techniques, and the majority of studies have focused on the differences between the normal lung and lung cancers. These studies have relied on both 2-DE and MALDI-TOF approaches. For instance, Yanagisawa et al20 examined 79 lung tumors and tissue from 14 normal lungs using MALDI-TOF and found 15 peaks that could distinguish non-small cell lung cancer patients with a good prognosis from those with a poor prognosis. Three of these peaks were identified as small ubiquitin-related modifier-2 protein, thymosin-ß4, and ubiquitin. MALDI-TOF has also recently been used21 to identify proteomic patterns that differentiate invasive lesions from normal bronchial epithelium. A disadvantage to many of these studies is that they included the surrounding stromal tissue. To overcome this limitation, laser capture microdissection of tumor specimens has been used to improve the specificity of 2-DE for biomarker discovery.22 Although many of these studies have identified promising biomarkers for lung cancer, large validation studies will be required before the biomarkers can be used clinically.
The Blood (Plasma and Serum) Proteome in Pulmonary Diseases
Serum and plasma proteomics are attractive areas of study because they are less invasive than other methods. The SELDI approach has been used to study plasma profiles of lung cancer23 and COPD.24 Other methods, such as 18O metabolic labeling, have recently been used25 to identify 211 proteins that were up-regulated and 246 proteins that were down-regulated in the sera of mice with lung adenocarcinomas (Lewis lung carcinoma), including vascular endothelial growth factor receptor 1.
Emerging Proteomic Technologies
Proteomic technologies are rapidly evolving new ways to look at the proteome. For example, the study of phosphorylated proteins, or phosphoproteomics, not only provides an indication of what proteins and pathways are involved in a particular disease, but can indicate which proteins are likely drug targets.26 More global approaches include phosphoprofiling, which combines enrichment using immobilized metal affinity chromatography with differential labeling of proteins by esterification for quantitative analysis. Various proteins have been detected in exhaled breath condensate by 2-DE,27 and exhaled breath condensate might be a noninvasive source for monitoring respiratory diseases. Finally, several respiratory diseases such as asthma, cystic fibrosis, and bronchitis are characterized by quantitative and qualitative changes in glycoproteins in which samples are enriched for glycoproteins using lectin.28
Conclusion
Although comparative and quantitative proteomics studies may be more technically difficult compared to gene expression studies, they provide orders of magnitude more qualitative information. As with genomics, all proteomics techniques have the advantage of being able to simultaneously study a subset of all proteins as opposed to a single protein. Although clinical proteomics is a technology that is still in its infancy, it has great potential to improve our understanding and treatment of lung disease by identifying patterns of protein expression. These protein expression profiles can reveal broad pathologic processes such as altered proteolytic processing or glycosylation that may not have been evident with other technologies or may reveal complex patterns that can serve as new diagnostic tools (eg, early cancer detection). However, the routine clinical use of proteomic technologies is likely several years away, since, as with other clinical studies, large studies will be needed to validate the clinical utility of each pattern.
Footnotes
Abbreviations: BALF = BAL fluid; CART = classification and regression tree; 2-DE = two-dimensional electrophoresis; DIGE = differential in-gel electrophoresis; ELF = epithelial lining fluid; ESI = electrospray ionization; LC = liquid chromatography; MALDI = matrix-assisted laser-desorption ionization; MS = mass spectrometry; PAGE = polyacrylamide gel electrophoresis; pI = isoelectric point; SELDI = surface-enhanced laser desorption ionization; TOF = time of flight
This work was supported by Flight Attendant Medical Research Institute (R.P.B.) and the Kenneth W. Monfort Program for Research in COPD (M.C.E.).
The authors have reported to the ACCP that no significant conflicts of interest exist with any companies/organizations whose products or services may be discussed in this article.
Received for publication February 6, 2006. Accepted for publication May 7, 2006.
References
This article has been cited by other articles:
![]() |
E. Kipnis, K. Hansen, T. Sawa, K. Moriyama, A. Zurawel, A. Ishizaka, and J. Wiener-Kronish Proteomic Analysis of Undiluted Lung Epithelial Lining Fluid Chest, August 1, 2008; 134(2): 338 - 345. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Farre and A. T. Dinh-Xuan Translational research in respiratory medicine Eur. Respir. J., December 1, 2007; 30(6): 1041 - 1042. [Full Text] [PDF] |
||||
![]() |
E. Kipnis, B. Guery, and J. Wiener-Kronish Promises and Limitations of the Bronchoscopic Microsampling Probe Chest, October 1, 2007; 132(4): 1414 - 1414. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |