Instrumental analysis

sama123
massspecarticle2019.pdf

Nano-LC FTICR Tandem Mass Spectrometry for Top-Down Proteomics: Routine Baseline Unit Mass Resolution of Whole Cell Lysate Proteins up to 72 kDa Jeremiah D. Tipton,†,‡,§,⊥,¶ John C. Tran,§ Adam D. Catherman,§ Dorothy R. Ahlf,§ Kenneth R. Durbin,§

Ji Eun Lee,§,⊥ John F. Kellie,§ Neil L. Kelleher,§ Christopher L. Hendrickson,†,¶ and Alan G. Marshall*,†,¶

†Ion Cyclotron Resonance Program, National High Magnetic Field Laboratory, Florida State University, 1800 East Paul Dirac Drive, Tallahassee, Florida 32310-4005, United States §Department of Molecular Biosciences, Northwestern University, 2205 Tech Drive, Evanston, Illinois 60208-3500, United States ¶Department of Chemistry and Biochemistry, Florida State University, 95 Chieftain Way, Tallahassee, Florida 32306-4390, United States

*S Supporting Information

ABSTRACT: Current high-throughput top-down proteomic platforms provide routine identification of proteins less than 25 kDa with 4-D separations. This short communication reports the application of technological developments over the past few years that improve protein identification and characterization for masses greater than 25 kDa. Advances in separation science have allowed increased numbers of proteins to be identified, especially by nanoliquid chromatography (nLC) prior to mass spectrometry (MS) analysis. Further, a goal of high-throughput top-down proteomics is to extend the mass range for routine nLC MS analysis up to 80 kDa because gene sequence analysis predicts that ∼70% of the human proteome is transcribed to be less than 80 kDa. Normally, large proteins greater than 50 kDa are identified and characterized by top-down proteomics through fraction collection and direct infusion at relatively low throughput. Further, other MS-based techniques provide top-down protein characterization, however at low resolution for intact mass measurement. Here, we present analysis of standard (up to 78 kDa) and whole cell lysate proteins by Fourier transform ion cyclotron resonance mass spectrometry (nLC electrospray ionization (ESI) FTICR MS). The separation platform reduced the complexity of the protein matrix so that, at 14.5 T, proteins from whole cell lysate up to 72 kDa are baseline mass resolved on a nano-LC chromatographic time scale. Further, the results document routine identification of proteins at improved throughput based on accurate mass measurement (less than 10 ppm mass error) of precursor and fragment ions for proteins up to 50 kDa.

Top-down proteomics comprises the direct mass spectro-metric (MS) isolation of intact protein(s) followed by fragmentation (MSn) for protein identification and amino acid site location of post-translational modifications.1−3 Bottom-up proteomics relies on proteolytic digestion of complex protein mixtures, prefractionation of peptides, and nanoliquid chroma- tography (nLC) MS/MS.4−6 Although the bottom-up approach can achieve extensive proteome coverage, sequence coverage of the identified proteins is typically incomplete, often lacking specific isoform information.7 Top-down proteomics can potentially provide 100% sequence coverage by accurate precursor mass measurement and efficient dissociation and measurement of resulting fragment ions.2,3,8 The strength of top-down proteomics is the ability to characterize protein truncations, isoforms, and combinatorial posttranslational modifications and even detect errors in annotated genome databases.7,9−15

However, top-down characterization of proteins (above 25 kDa in mass) is often limited to a targeted approach; i.e., an

off-line purified sample is directly infused into the mass spectrometer. Prior to recent developments in protein separation, platforms that automated the fraction collection/ direct infusion process improved throughput and number of identified proteins.14,16−19 Recent advances in solution-based electrophoresis and nLC separations,20−22 instrumenta- tion,23−29 and informatics28,30 have enabled an increase in the number of identified proteins. Furthermore, an increased number of reports indicate that high-throughput character- ization of proteins below 25 kDa can be routinely achieved by nano-LC linear quadrupole ion trap (LTQ)/Fourier transform ion cyclotron resonance mass spectrometry (FTICR MS).21,24,31−34 Recently, high-throughput top-down proteo- mics with a 4-D separation platform identified over 1000 proteins from whole cell lysate, thus increasing the capacity by

Received: October 6, 2011 Accepted: February 13, 2012

Letter

pubs.acs.org/ac

© XXXX American Chemical Society A dx.doi.org/10.1021/ac202651v | Anal. Chem. XXXX, XXX, XXX−XXX

20× for human while also extending the upper molecular weight limit per “discovery” type experiment.35 Further, the number of proteins was increased by 5× for the yeast proteome.36 In those studies, there was fragmentation evidence for 2−3× as many species for every gene product identification. In the human proteome study, protein fractions above 25 kDa were measured by low-resolution precursor mass measurement, followed by in-source dissociation. This strategy resulted in identification of many protein species above 25 kDa but without precursor ion measurement. Thus, a strategy for accurate mass measurement of proteins above 25 kDa remains a goal for top-down proteomics. The main reason for the need for high resolution measurement is for post-translational modification (PTM) and isoform characterization: i.e., protein species often coelute, and abundance may be directly measured for a particular system. Increased throughput for top-down proteomics is also related

to the chosen fragmentation method. Top-down protein characterization was originally implemented with electron capture dissociation (ECD).37−39 With ECD, a protein ion is dissociated into multiple fragment ions of lower abundance. Thus, coaddition (or signal averaging) is typically required to improve the signal-to-noise ratio for de novo sequencing of resulting fragments. Recent introduction of electron transfer dissociation (ETD)40−42 enabled ECD-type fragmentation in a quadrupole ion trap. ETD fragmentation with the linear quadrupole trap may improve throughput and fragmentation efficiency as advanced instrumentation continues to develop. In contrast to ECD or ETD, “thermal” activation [i.e., collision- induced dissociation (CID), infrared multiphoton dissociation (IRMPD), source-induced dissociation, etc.] heats an ion to a higher Boltzmann temperature and breaks the weakest bond (the peptide C−N linkage).43 For example, IRMPD44−46 dissociates a precursor peptide ion at fewer sites and, thus, typically provides less sequence information than ECD.3,8,47

Although CID provides less sequence coverage, throughput and sensitivity are increased because dissociation typically produces fragment ions that have populations required for single measurement events. Thus, the main difference between ECD and thermal activation (with regard to high-throughput analysis with online nLC) is rapid fragmentation into fewer modes, thus becoming the method of choice for fragmentation to increase the number of identified proteins by 20×.35

“Hybrid” instruments generate fragment ions in a quadrupole ion trap through collision-induced dissociation, source-induced dissociation, or high-energy collisional dissociation (HCD) prior to transfer to a high resolution mass ana- lyzer.14,17,24,27,48−51 In this report, we apply source-induced dissociation to identify whole cell lysate proteins between 25 and 50 kDa in molecular weight. The progression of high-throughput top-down proteomics to

higher molecular weight is limited by several sources of signal reduction.3,52 Even a relatively small protein (e.g., carbonic anhydrase, C1474H2286N412O439S14) contains heavy isotopes (13C, 15N, 34S) that generate a broad isotopic distribution for a given charge state. However, the complex isotopic distribution may be “deisotoped” to provide accurate mass for a large protein.53 Also, electrospray ionization (ESI),54,55 the method of choice for coupling nLC with a mass spectrometer, imparts multiple charges, creating a charge state distribution (each with its own isotopic distribution) for a protein.56 Moreover, a higher molecular weight protein typically exhibits higher (and more) charge states, further reducing the observed mass

spectral signal magnitude.52 Finally, the mass spectrum of a large electrosprayed protein may be further broadened by substitution of alkali metal ions (Na+, K+) for one or more of the protons and by nonspecific binding of solvent or small molecule ligands. In fact, the solvation state of a large protein is suggested to be one of the main limitations for consistent fragmentation by ECD, CID, or nozzle-skimmer dissocia- tion.57,58

Despite the challenges associated with analyzing large polypeptides, we report a simple top-down platform35 that can provide accurate precursor measurement of proteins (derived from whole cell lysate) up to 72 kDa. Results include rms mass errors of less than 10 ppm for identified proteins. The implementation of the previously reported 4-D separation platform with the addition of FTICR “zoom mapping” precursor detection allow us to resolve intact protein isoforms and posttranslational modifications up to 72 kDa.

■ EXPERIMENTAL PROCEDURES M-Phase-Arrested HeLa Cell Preparation, Solution

Isoelectric Focusing, and Gel-Eluted Liquid Fraction Entrapment Electrophoresis. M-phase-arrested HeLa S3 cells were prepared by treating asynchronous cells (3 × 105 cells/mL) with 1 μM colchicine for 18 h. Cells (5 × 108 cell/mL) were harvested, lysed, and split into cytosolic and nuclear fractions as previously described.21 Isolated protein pellets were acetone-precipitated after lysis and dissolved (3 mg each) in 3.2 mL of solution isoelectric focusing (sIEF) buffer (8 M urea, 2 M thiourea, 50 mm DTT, 1% w/v Biolyte 3/10 carrier ampholytes (Bio-Rad Laboratories, Hercules, CA)) and focused by use of a custom designed eight-chamber sIEF system as previously described.59 Acetone-precipitated sIEF fractions were resolved in denaturant buffer and separated by gel-eluted liquid fraction entrapment electrophoresis (GF) as previously described.20,60 The GF separation was visualized by silver stained SDS-PAGE (1/15 loading amount of GF fractions corresponding to low μg) to provide sample representative gel images.

Standard Protein Analysis by Automated Prefractio- nation and Direct Infusion. Prefractionation of standard proteins was performed with a Triversa Nanomate (Advion Biosciences, Ithaca, NY), followed by automated direct infusion of fractions collected in 96 well plates.14,19,33 Specifically, characterization of transferrin included separation and fraction collection of 5 pmol with a reversed-phase C4 column (GraceVydac, Deerfield, IL): 30 min gradient delivered at 50 μL/min. The Triversa Robot was trained to collect 30 s fractions, followed by direct infusion of the fractions for 10−15 min. Similar experiments were performed with protein samples from GF only or sIEF-GF (fractions visualized to be above 25 kDa). Results from that experiment indicated that sample lifetime deteriorates rapidly due to exposure to air and light and nonoptimal solvents.

Nano-Liquid Chromatography of sIEF/GF Fractions. Nano-LC was performed with a 1-D plus Nano-LC system (Eksigent, Dublin, CA), delivering a gradient (400 nL/min) of 5% B to 80% B in 30−50 min (Solvent A: 94.8% H2O, 5% acetonitrile, 0.2% formic acid; Solvent B: 4.8% H2O, 94% acetonitrile, 0.2% formic acid). Samples were loaded onto a 30 mm ×150 μm precolumn (1 μL/min) and separated over a 100 mm × 75 μm i.d. pico-Frit column with a 15 μm i.d. tip. The columns were packed with PLRP-S (5 μm, 1000 Å) material (New Objective, Woburn, MA).

Analytical Chemistry Letter

dx.doi.org/10.1021/ac202651v | Anal. Chem. XXXX, XXX, XXX−XXXB

Fraction Collection, Direct Infusion ESI FTICR Mass Spectrometry. Mass spectrometry was performed with a custom-built 14.5 T LTQ FTICR mass spectrometer (LTQ- FT, Thermo Fisher Corp., Bremen, Germany).23,25 Direct infusion (automatic timed infusion by the Triversa Nanomate) MS and MS/MS analysis were performed by ESI 14.5 T FTICR MS (1.8 s transient duration). Two modes of FTICR acquisition were used during the 10−15 min automatic ESI time frame. First, broadband detection was signal averaged for 3 to 5 min. Second, MS/MS by CID activation was performed for 7 to 10 min. Further, the instrument is also equipped with a wired octopole,61 thus increasing the capacity for collection of ions prior to transfer to the ICR cell.25 For precursor ion measurement, one million charges were accumulated in the LTQ prior to transfer to the wired octopole; 10 cycles were accumulated in the wired octopole prior to transfer to the ICR cell. Further, for CID MS/MS, 0.25 million charges were activated for dissociation in the LTQ prior to 15 cycles and transfer to the ICR cell. Data was compiled and analyzed with custom cRAWler software.62

Online NanoLC MS Thermal Activation with Source- Induced Dissociation. Protein fractions (above 25 kDa) were analyzed by nLC (i.e., analytical column, 75 μm i.d.) “zoom- mapping” (5 × 106 charges with 4 microscans) source-induced dissociation (6 × 106 charges with 6 microscans) Simply, “zoom-mapping” is a sequential “walking window” in which ions spanning an m/z range of 80 to 100 are chosen on the basis of the direct infusion experiments (i.e., highest m/z envelope signal magnitude). On the basis of the direct infusion experiments, the sequential scheme was optimized to be: measurement window 1 is m/z 890 to 950, and measurement window 2 is m/z 940 to 1000, followed by broad mass range source-induced dissociation. The sequence was repeated during the duration of nLC MS period. Data was extracted with custom cRAWler software62 and searched against differently annotated databases with ProSight PC 2.0.30 A two-tier search tree included a 200 Da absolute mass search followed by a 50 000 Da (delta mass turned on, fragment ions with a search tolerance of 10 ppm). The cutoff score for accepting a protein as a true identification was an E-value of 1 × 10−4. The feature map for visualization of the nLC MS data was generated with Proteome Display.62

■ RESULTS AND DISCUSSION The continued development of high-throughput top-down proteomics increases the number of isoforms and PTMs characterized from a single biological system during a single experiment. A recent example of this improvement was reported for human cell lines: a 20-fold increase in the total number of identified proteins over any prior report.35 Accurate precursor ion measurement precedes the characterization of increased numbers of isoforms and PTMs: i.e., an intact protein and its phosphorylated form are measured in the same spectrum, differing by 80 Da. If higher molecular weight proteins are accessible with high resolving power and high mass accuracy, multiple posttranslationally modified protein species may be resolved and identified simultaneously. A specific example of the need for high mass accuracy includes proteins that have been deamidated (a mass increase of 1 Da). The typical experiment for proteins above 25 kDa includes

prefractionation with reversed-phase chromatography followed by direct infusion mass spectrometry.14,19,33 Initial experiments for whole cell lysate proteins (above 25 kDa in mass) were

performed by that method. The constant infusion experiments resulted in the definition of two “zoom-mapping” windows for nLC MS. Transferrin, a commercially available 78 kDa protein (18 disulfide bonds, 1 site for phosphorylation, and 1 site for glycosylation) was fractionated off-line and directly infused. Signal averaging for 5 min at mass resolving power m/Δm50% = 400 000 at m/z 800 revealed the full baseline unit mass- resolved isotopic distribution (Figure 1a). Heterogeneous

superposition of phosphorylation/glycosylation and oxidation modifications generates unresolved “fine structure” for each resolved peak, in addition to fine structure caused by overlap of heavy isotopes (e.g., 13C vs 15N). Despite spectral complexity, dissociation of transferrin by LTQ CID and FTICR fragment ion measurement resulted in a sequence tag that was identified by ProSight 2.0 (Figure 1b). Transferrin represents a class of proteins currently difficult to analyze by top-down methods, namely, heterogeneous species with multiple sites for glycosylation, phosphorylation, and salt adducts. For character- ization of recombinant biologics, the fraction collection/direct infusion strategy is still very useful. The transition of the direct infusion experiment to nLC FT

MS for transferrin (∼1.2 pmol loaded on column; ∼20 s elution time) is shown in Figure 1c. In contrast to the full direct

Figure 1. (A) Baseline unit mass-resolved ESI 14.5 T FTICR mass spectrum of bovine transferrin after LC fraction collection and direct infusion. (B) Signal averaged CID product ion mass spectrum, providing a sequence tag for confident identification. (C) Baseline unit mass-resolved nLC ESI FTICR precursor ion mass spectrum from ∼1.2 pmol of transferrin loaded onto a 75 μm i.d. nano-LC (see text). Each “cycle” represents a repeated LTQ ion accumulation and transfer to an external octopole before injection into the ICR cell.

Analytical Chemistry Letter

dx.doi.org/10.1021/ac202651v | Anal. Chem. XXXX, XXX, XXX−XXXC

infusion experiment (10 min for MS and MS/MS data collection), the nLC elution time for less material was only 20 s. Online nLC MS obviously allows for multiple components to be analyzed, depending on the peak capacity of the chromatography material. Further, nonreduced transferrins were analyzed by nLC. Nonreduced transferrin resulted in the identification of transferrin with a ProSightPC sequence tag score of 3 × 10−10. Protein extracts were derived from mitotically arrested HeLa

S3 cells by cell lysis, fractionated into nuclear and cytosolic, and separated by solution isoelectric focusing a GF. Initial experiments for whole cell lysate proteins (above 25 kDa in mass) were performed by the same fraction collection/direct infusion method described above. Although the “targeted” fractionation collection/direct infusion approach for character- ization of proteins is typical for proteins above 30 kDa, that method is not considered high throughput. With improved throughput, an increased number of protein forms may be characterized and linked to phenotypes. Also, many of the proteins that were directly infused resulted in broad isotopic distributions due to oxidation, salt, and other solvent adducts. When the same types of samples were analyzed by nLC, broad isotopic distributions due to adducts were typically not observed. For intact protein analysis, samples deteriorate with longer exposure to air and MS compatible solvents. Finally, the constant infusion experiments resulted in the definition of two “zoom-mapping” (see Experimental Procedures section for definition) windows for nLC MS.63,64

Fractions that were visualized to fall between 25 and 50 kDa (representative gel image, Figure S1, Supporting Information) resulted in 32 confidently identified proteins. In all, 25 fractions greater than 25 kDa were collected from GF or sIEF-GF. Generally, for each sIEF fraction, 5 fractions between 25 and 50 were generated by GF fractionation with a first generation

commercial device. The number of proteins that were identified is apparently low, due to the dynamic range problem associated with all whole cell lysate proteomic studies. In the 1000 protein report, improved GF separations were achieved with a custom device and the use of low resolution mass spectrometry to detect intact masses greater than 25 kDa. In this study, many of the abundant proteins, such as GAPDH, enolase, and actin were found across multiple fractions: e.g., the “housekeeping” protein, GAPDH, appeared in 8 fractions. Further, highly abundant large proteins (as is also the case for gel-based bottom up proteomics) are known to “smear” throughout a gel. The continued improvements to selection and separations will improve top down proteomics in the same way as for bottom- up proteomics. The limit in the number of identified proteins is due to highly abundant proteins in the same molecular weight fractions across sIEF fractions. With regard to reversed-phase nLC, the peak width for proteins less than 25 kDa is typically between 10 and 60 s, whereas the peak width for proteins greater than 35 kDa is typically 1−2 min. Improvements in the separation resins and load/peak capacity have reduced peak widths for proteins up to 25 kDa and recently up to 50 kDa.35

If the next generation of materials provides improved peak resolution, improved analytics will be observed for the number of proteins above 25 kDa and their protein species and PTM relatives. In terms of the instrumentation and platform as a whole,

the identified proteins from whole cell lysate were baseline unit mass-resolved by online nLC (Figures 2b and c) and confidently characterized with mass errors of less than 10 ppm (Table 1). Mass measurements differing by more than 10 ppm could result from in vivo modifications, such as deamidation, assignment algorithm error (e.g., by 1 Da shift in fitting a given isotopic distribution), or other adducts (greater than 1 Da mass shift). A feature map presents many baseline

Figure 2. (A) Selected mass chromatograms for proteins baseline unit mass-resolved by nano-LC ESI 14.5 T FTICR MS. (B) Chromatographic separation and baseline unit mass-resolved mass spectrum for keratin 18 (54+ charge state). Proteins identified by activation of the protein in the LTQ are listed in Table 1. (C) Identification of phosphoglycerate kinase 1 and an isoform 80 Da higher in mass (corresponding to a single phosphorylation). The blue star indicates its location in (A).

Analytical Chemistry Letter

dx.doi.org/10.1021/ac202651v | Anal. Chem. XXXX, XXX, XXX−XXXD

resolved proteins above 10:1 S/N ratio, on the basis of signal averaging over 20−40 s of the total ion chromatogram (Figure 2a).62 Many proteins up to 48 kDa were identified from a single time-domain acquisition. For example, baseline unit mass resolution for keratin 18 (48 272 and 48 216 Da) (Figure 2b) demonstrates the platform’s ability to chromatographically separate isoforms with highly similar sequence identity. Source- induced dissociation of those isoforms generated 55 fragments from the N- and C-termini and a ProSightPC expectation value of 1.61 × 10−64 (i.e., essentially certain identification).65

As mentioned earlier in the discussion, bottom-up proteomics analysis of M-phase arrested HeLa S3 cells has been shown to be up-regulated with phosphoproteins.66 Figure 2c shows identification of phosphoglycerate kinase 1 and a second peak 80 Da higher in mass. The result is based on accurate measurement of precursor and fragment ions. The result is further corroborated by other M-phase arrested HeLa S3 cell studies.66 The result is significant because many phosphopro- teins are known to be between 30 and 50 kDa in mass. Thus, as selection techniques improve along with online separations, current instrumentation may be sufficient to characterize protein form complexity. In this study, the important metric is the accurate precursor

mass measurement of larger proteins on a nLC time scale. The

duty cycle is typically 33−60 ms for each collection and transfer of 5 to 10 million charges to the ICR. Each data-dependent MS−MS/MS cycle requires ∼10 s; the overall duty cycle for the experiment is limited by the FTICR data acquisition period (0.7 s) required to produce a mass resolving power, m/Δm50% = 100 000 at m/z 800, to baseline unit mass-resolve proteins up to 60 kDa. For a given mass resolving power, the detection period varies inversely with magnetic field strength.67 Thus, at 21 T, it should be possible to achieve the same resolving power at an increased duty cycle (shorter transient acquisition period) for nLC MS. Further, at higher magnetic field, an increased number of ions may be analyzed.68 For proof of concept at 14.5 T, we were able to obtain unit mass resolution from a cell- derived 72 kDa protein (Figure 3). At a given magnetic field strength, the sensitivity, mass accuracy, and duty cycle of new ICR MS cells further improve protein identification.29,69,70

In summary, we have presented several examples of routine (and reliable) identification of proteins up to 50 kDa by 2D solution electrophoresis nLC MS top-down proteomics, with 72 kDa as the current upper limit for baseline unit mass- resolved MS at a nano-LC time scale. Standard proteins up to 78 kDa (∼1 pmol loaded on column) were baseline unit mass resolved identified with nLC MS, thereby improving the likelihood of accurate mass detection of proteins derived from

Table 1. Results from the Analysis of 25 Fractions from Isoelectric Focusing/Gel Electrophoresis Fractions

gene accession fragments E-value calc’d mass exptl mass Δ mass ppm error protein description

K1C17 Q04695 47 1.6 × 10−64 48272.10 48271.02 −1.08 −22.33 keratin, type I cytoskeletal 17 K1C18 P05783 14 4.4 × 10−17 47939.50 47938.62 −0.88 −18.30 keratin, type I cytoskeletal 18 ENOA P06733 10 8.0 × 10−6 47391.40 47391.49 0.09 1.90 alpha-enolase PGK1 A8K4W6 8 4.2 × 10−7 44896.20 44896.20 0.01 0.20 phosphoglycerate kinase ACTB Q1KLZ0 27 1.5 × 10−39 41962.89 41962.81 −0.09 −2.14 actin ALDOA P04075 13 1.5 × 10−7 39720.40 39720.40 0.03 0.77 fructose-bisphosphate aldolase A ANXA2 P07355 7 8.8 × 10−4 38718.90 38718.90 0.01 0.22 annexin A2 LDHB Q5U077 32 8.9 × 10−52 36811.20 36811.10 −0.10 −2.71 L-lactate dehydrogenase MDHC P40925 18 2.0 × 10−28 36542.10 36542.11 0.01 0.19 malate dehydrogenase, cytoplasmic ALDR P15121 8 8.7 × 10−5 36140.60 36140.52 −0.08 −2.27 aldose reductase ROA2 P22626 32 8.3 × 10−36 36082.90 36082.83 −0.07 −1.94 heterogeneous nuclear ribonucleoproteins A2/B1 G3P P04406 42 3.5 × 10−37 36070.40 36070.30 −0.05 −1.40 glyceraldehyde-3-phosphate dehydrogenase SFXN1 Q9H9B4 10 5.7 × 10−7 35792.40 35792.51 0.11 3.07 sideroflexin-1 GBLP P63244 20 5.4 × 10−37 35421.70 35421.63 −0.07 −2.05 guanine nucleotide-binding protein subunit

beta-2-like 1 RM44 Q9H9J2 16 3.0 × 10−11 34639.10 34639.15 0.05 1.44 39S ribosomal protein L44, mitochondrial RLA0L Q8NHW5 11 6.1 × 10−14 34382.70 34570.40 187.7a 5429.44 60S acidic ribosomal protein P0-like ROA1 P09651 17 3.3 × 10−24 34200.31 34281.11 80.1c 5467.89 heterogeneous nuclear ribonucleoprotein A1 MTCH2 Q9Y6C9 5 5.1 × 10−5 33847.10 33846.94 −0.16 −4.68 mitochondrial carrier homologue 2 PHB2 Q99623 9 1.5 × 10−4 33186.90 33186.85 −0.05 −1.51 D-prohibitin IPYR Q15181 18 7.1 × 10−25 33006.30 33006.31 0.01 0.36 inorganic pyrophosphatase CAZA1 P52907 8 1.5 × 10−5 32984.40 32984.45 0.05 1.52 F-actin-capping protein subunit alpha-1 NPM P06748 21 4.3 × 10−13 32725.90 32904.86 178.96a 5438.66 nucleophosmin DECR Q16698 10 4.8 × 10−9 32485.80 32486.95 1.15b 35.40 2,4-dienoyl-CoA reductase VDAC2 P45880 5 3.1 × 10−5 31970.70 31970.69 −0.01 −0.31 voltage-dependent anion-selective channel protein 2 PHB Q6PUJ7 19 6.8 × 10−12 29769.90 29752.83 −17.07 −573.58 prohibitin EXOS6 Q5RKV6 10 1.6 × 10−8 28371.71 28371.67 −0.04 −1.41 mRNA transport regulator 3 homologue HDGF Q5SZ07 19 2.2 × 10−25 28172.90 28140.37 −32.53 492.58 hepatoma-derived growth factor ETFB P38117 10 6.8 × 10−8 27965.20 27981.26 16.06 573.96 electron transfer flavoprotein subunit beta HCD2 Q99714 30 1.4 × 10−36 27045.20 27045.26 0.06 2.22 3-hydroxyacyl-CoA dehydrogenase type-2 TPIS P60174 21 2.6 × 10−20 26806.80 26806.90 −0.01 −0.37 triosephosphate isomerase RTN3 O95197 10 1.4 × 10−17 25674.60 25675.82 1.22 47.52 neuroendocrine-specific protein-like 2 PRDX6 P30041 10 5.0 × 10−5 25002.20 25002.22 0.02 0.77 peroxiredoxin-6

aIndicates that these mass errors could not be attributed to specific adducts or polymorphism. bCorresponds to a mass error of 1 Da that can be attributed to peak extraction errors. cCorresponds to identification of a phosphoprotein pair.

Analytical Chemistry Letter

dx.doi.org/10.1021/ac202651v | Anal. Chem. XXXX, XXX, XXX−XXXE

cells. Recent encouraging improvements in resolving power for samples analyzed off-line include 0.01 Da mass resolution for bovine serum albumin (66.5 kDa)69 and 0.25 Da mass accuracy for a monoclonal antibody (148 kDa).70 The relatively small number of proteins listed in Table 1 is limited mainly by the separation efficiency for large biopolymers (greater than 30 kDa in mass) and by the overabundance of “housekeeping” proteins. Moreover, the number of MS-resolved precursor ions is much greater than the number of identified proteins, limited by the current fragmentation efficiency. However, the barriers for analysis of large intact proteins should continue to fall with improvements to separation methods, control of ionization, and instrumentation.

■ ASSOCIATED CONTENT *S Supporting Information Additional information as noted in text. This material is available free of charge via the Internet at http://pubs.acs.org.

■ AUTHOR INFORMATION Corresponding Author *E-mail: marshall@magnet.fsu.edu. Phone: +1-850-644-0529. Fax: +1-850-644-1366. Present Addresses ‡Florida Center for Drug Discovery and Innovation, University of South Florida, 3702 Spectrum Blvd, Suite 175, Tampa, FL 33612−9444. ⊥Center for Theragnosis, Korea Institute of Science and Technology, Seoul, South Korea. Notes The authors declare no competing financial interest.

■ ACKNOWLEDGMENTS We gratefully acknowledge support from the NSF Division of Materials Research through DMR-06-54118, the State of

Florida, and NIH GM-067193-08. We also thank Leonid Zamdborg and Paul Thomas for helpful discussions.

■ REFERENCES (1) Kelleher, N. L.; Lin, H. Y.; Valaskovic, G. A.; Aaserud, D. J.; Fridriksson, E. K.; McLafferty, F. W. J. Am. Chem. Soc. 1999, 121, 806−812. (2) Kelleher, N. L. Anal. Chem. 2004, 76, 197A−203A. (3) McLafferty, F. W.; Breuker, K.; Jin, M.; Han, X.; Infusini, G.; Jiang, H.; Kong, X.; Begley, T. P. FEBS J. 2007, 274, 6256−6268. (4) Link, A. J.; Eng, J.; Schieltz, D. M.; Carmack, E.; Mize, G. J.; Morris, D. R.; Garvik, B. M.; Yates, J. R. Nat. Biotechnol. 1999, 17, 676−682. (5) Washburn, M. P.; Wolters, D.; Yates, J. R. Nat. Biotechnol. 2001, 19, 242−247. (6) Wisniewski, J. R.; Zougman, A.; Nagaraj, N.; Mann, M. Nat. Methods 2009, 6, 359−362. (7) Tipton, J. D.; Tran, J. C.; Catherman, A. D.; Ahlf, D. R.; Durbin, K. R.; Kelleher, N. L. J. Biol. Chem. 2011, 286, 25451−25458. (8) Sze, S. K.; Ge, Y.; Oh, H.; McLafferty, F. W. Proc. Natl. Acad. Sci. U.S.A. 2002, 99, 1774−1779. (9) Pesavento, J. J.; Kim, Y.; Taylor, G. K.; Kelleher, N. L. J. Am. Chem. Soc. 2004, 126, 2286−3387. (10) Ge, Y.; Rybakov, I. N.; Xu, Q.; Moss, R. L. Proc. Natl. Acad. Sci. U.S.A. 2009, 106, 12658−12663. (11) Siuti, N.; Kelleher, N. L. Nat. Methods 2007, 4, 817−821. (12) Ayaz-Guner, S.; Zhang, J.; Li, L.; Walker, J. W.; Ge, Y. Biochemistry 2009, 48, 8161−8170. (13) Whitelegge, J. TrAC Trends Anal. Chem. 24, 576−582. (14) Patrie, S. M.; Ferguson, J. T.; Robinson, D. E.; Whipple, D.; Rother, M.; Metcalf, W. W.; Kelleher, N. L. Mol. Cell. Proteomics 2006, 5, 14−25. (15) Ferguson, J. T.; Wenger, C. D.; Metcalf, W. W.; Kelleher, N. L. J. Am. Soc. Mass Spectrom. 2009, 20, 1743−1750. (16) Johnson, J. R.; Meng, F.; Forbes, A. J.; Cargile, B. J.; Kelleher, N. L. Electrophoresis 2002, 23, 3217−3223. (17) Du, Y.; Meng, F.; Patrie, S. M.; Miller, L. M.; Kelleher, N. L. J. Proteome Res. 2004, 3, 801−806.

Figure 3. Cell-derived proteins baseline unit mass-resolved by nLC ion-marching/nozzle-skimmer experiment. Mass resolving power = 400 000 (1.6 s transient) at m/z 400.

Analytical Chemistry Letter

dx.doi.org/10.1021/ac202651v | Anal. Chem. XXXX, XXX, XXX−XXXF

(18) Patrie, S. M.; Robinson, D. E.; Meng, F.; Kelleher, N. L. Int. J. Mass Spectrom. 2005, 234, 175−184. (19) Wenger, C. D.; Boyne, M. T. II; Ferguson, J. T.; Robinson, D. E.; Kelleher, N. L. Anal. Chem. 2008, 80, 8055−8063. (20) Tran, J. C.; Doucette, A. A. Anal. Chem. 2008, 80, 1568−1573. (21) Lee, J. E.; Kellie, J. F.; Tran, J. C.; Tipton, J. D.; Catherman, A. D.; Thomas, H. M.; Ahlf, D. R.; Durbin, K. R.; Vellaichamy, A.; Ntai, I.; Marshall, A. G.; Kelleher, N. L. J. Am. Soc. Mass Spectrom. 2009, 20, 2183−2191. (22) Vellaichamy, A.; Tran, J. C.; Catherman, A. D.; Lee, L. E.; Kellie, J. F.; Sweet, S. M. M.; Zamdborg, L.; Thomas, P. M.; D.R., A.; Durbin, K. R.; Valaskovic, G. A.; Kelleher, N. L. Anal. Chem. 2010, 82, 1234− 1244. (23) Syka, J. E. P.; Marto, J. A.; Bai, D. L.; Horning, S.; Senko, M. W.; Schwartz, J. C.; Ueberheide, B.; Garcia, B.; Busby, S. A.; Muratore, T.; Shabanowitz, J.; Hunt, D. F. J. Proteome Res. 2004, 3, 621. (24) Parks, B. A.; Jiang, L.; Thomas, P. M.; Wenger, C. D.; Roth, M. J.; Boyne, M. T. II; Burke, P. V.; Kwast, K. E.; Kelleher, N. L. Anal. Chem. 2007, 79, 7984−7991. (25) Schaub, T. M.; Hendrickson, C. L.; Horning, S.; Quinn, J. P.; Senko, M. W.; Marshall, A. G. Anal. Chem. 2008, 80, 3985−3990. (26) Makarov, A.; Denisov, E.; Kholomeev, A.; Balschun, W.; Lange, O.; Strupat, K.; Horning, S. Anal. Chem. 2006, 78, 2113−2120. (27) Karabacak, N. M.; Li, L.; Tiwari, A.; Hayward, L. J.; Hong, P.; Easterling, M. L.; Agar, J. N. Mol. Cell. Proteomics 2009, 8, 846−856. (28) Cobb, J. S.; Easterling, M. L.; Agar, J. N. J. Am. Soc. Mass Spectrom. 2010, 21, 949−959. (29) Tolmachev, A. V.; Robinson, E. W.; Wu, S.; Pasǎ-Tolic,́ L.; Smith, R. D. Int. J. Mass Spectrom. 2009, 287, 32−38. (30) Zamdborg, L.; LeDuc, R. D.; Glowacz, K. J.; Kim, Y. B.; Viswanathan, V.; Spaulding, I. T.; Early, B. P.; Bluhm, E. J.; Babai, S.; Kelleher, N. L. Nucleic Acids Res. 2007, 35, W701−W706. (31) Lee, S. W.; Berger, S. J.; Martinovic,́ S.; Pasa-Tolic,́ L.; Anderson, G. A.; Shen, Y.; Zhao, R.; Smith, R. D. Proc. Natl. Acad. Sci. U.S.A. 2002, 99, 5942−5947. (32) Zabrouskov, V.; Senko, M. W.; Du, Y.; Leduc, R. D.; Kelleher, N. L. J. Am. Soc. Mass Spectrom. 2005, 16, 2027−2038. (33) Roth, M. J.; Parks, B. A.; Ferguson, J. T.; Boyne, M. T. II; Kelleher, N. L. Anal. Chem. 2008, 80, 2857−2866. (34) Wynne, C.; Edwards, N. J.; Fenselau, C. Proteomics 2010, 10, 3631−3643. (35) Tran, J. C.; Zamdborg, L.; Ahlf, D. R.; Lee, J. E.; Catherman, A. D.; Durbin, K. R.; Tipton, J. D.; Vellaichamy, A.; Kellie, J. F.; Li, M.; Wu, C.; Sweet, S. M. M.; Early, B. P.; Siuti, N.; LeDuc, R. D.; Compton, P. D.; Thomas, P. M.; Kelleher, N. L. Nature 2011, 480, 254−258. (36) Kellie, J. F.; Catherman, A. D.; Durbin, K. R.; Tran, J. C.; Tipton, J. D.; Norris, J. L.; Witkowski, C. E.; Thomas, P. M.; Kelleher, N. L. Anal. Chem. 2011. (37) Zubarev, R. A.; Kelleher, N. L.; McLafferty, F. W. J. Am. Chem. Soc. 1998, 120, 3265−3266. (38) Horn, D. M.; Ge, Y.; McLafferty, F. W. Anal. Chem. 2000, 72, 4778−4784. (39) Cooper, H. J.; Hak̊ansson, K.; Marshall, A. G. Mass Spectrom. Rev. 2005, 24, 201−222. (40) Reid, G. E.; McLuckey, S. A. J. Mass Spectrom. 2002, 37, 663− 675. (41) Coon, J. J.; Ueberheide, B.; Syka, J. E. P.; Dryhurst, D. D.; Ausio, J.; Shabanowitz, J.; Hunt, D. F. Proc. Natl. Acad. Sci. U.S.A. 2005, 102, 9463−9468. (42) Udeshi, N. D.; Compton, P. D.; Shabanowitz, J.; Hunt, D. F.; Rose, K. L. Nat. Protoc. 2009, 3, 1709−1717. (43) McLuckey, S. A.; Goeringer, D. E. J. Mass Spectrom. 1997, 35, 461−474. (44) Woodlin, R. L.; Bomse, D. S.; Beauchamp, J. L. J. Am. Chem. Soc. 1978, 100, 3248−3250. (45) Little, D. P.; Speir, J. P.; Senko, M. W.; O’Connor, P. B.; McLafferty, F. W. Anal. Chem. 1994, 66, 2809−2815.

(46) Li, W.; Hendrickson, C. L.; Emmett, M. R.; Marshall, A. G. Anal. Chem. 1999, 71, 4397−4402. (47) Zabrouskov, V.; Whitelegge, J. P. J. Proteome Res. 2007, 6, 2205−2210. (48) Chanthamontri, C.; Jian Liu, J.; McLuckey, S. A. Int. J. Mass Spectrom. 2009, 283, 9−16. (49) Armirotti, A.; Benatti, U.; Damonte, G. Rapid Commun. Mass Spectrom. 2009, 23, 661−666. (50) Macek, B.; Waanders, L. F.; Olsen, J. V.; Mann, M. Mol. Cell. Proteomics 2006, 5, 949−958. (51) Zhai, H.; Han, X.; Breuker, K.; McLafferty, F. W. Anal. Chem. 2005, 77, 5777−5784. (52) Compton, P. D.; Zamdborg, L.; Thomas, P. M.; Kelleher, N. L. Anal. Chem. 2011, 83, 6868−6874. (53) Horn, D. M.; Zubarev, R. A.; McLafferty, F. W. J. Am. Soc. Mass Spectrom. 2000, 11, 320−332. (54) Fenn, J.; Mann, M.; Meng, C.; Wong, S.; Whitehouse, C. Science 1989, 246, 64−71. (55) Loo, J. A.; Udseth, H. R.; Smith, R. D. Anal. Biochem. 1989, 179, 404−412. (56) Hendrickson, C. L.; Emmett, M. R. Annu. Rev. Phys. Chem. 1999, 50, 517−536. (57) Han, X.; Jin, M.; Breuker, K.; McLafferty, F. W. Science 2006, 314, 109. (58) Breuker, K.; McLafferty, F. W. Proc. Natl. Acad. Sci. 2008, 105, 18145−18152. (59) Tran, J. C.; Doucette, A. A. J. Proteomic Res. 2008, 7, 1761− 1765. (60) Tran, J. C.; Doucette, A. A. Anal. Chem. 2009, 81, 6201−6209. (61) Wilcox, B. E.; Hendrickson, C. L.; Marshall, A. G. J. Am. Soc. Mass Spectrom. 2002, 13, 1304−1312. (62) Durbin, K. R.; Tran, J. C.; Zamdborg, L.; Sweet, S. M. M.; Catherman, A. D.; Lee, J. E.; Li, M.; Kellie, J. F.; Kelleher, N. L. Proteomics 2010, 10, 3589−3597. (63) Tipton, J. D.; Kellie, J. F.; Thomas, P. M.; D., A.; Hendrickson, C. L.; Kelleher, N. L.; Marshall, A. G. 57th ASMS Conf. on Mass Spectrometry & Allied Topics, Philadelphia, PA, May 31−June 4, 2009. (64) Tipton, J. D.; Lee, J. E.; Catherman, A. C.; Durbin, K. R.; Hendrickson, C. L.; Emmett, M. R.; Marshall, A. G.; Kelleher, N. L. 58th Amer. Soc. for Mass Spectrom. Annual Conf. on Mass Spectrometry & Allied Topics, Salt Lake City, UT, May 23−27, 2010. (65) LeDuc, R. D.; Taylor, G. K.; Kim, Y. B.; Januszyk, T. E.; Bynum, L. H.; Sola, J. V.; Garavelli, J. S.; Kelleher, N. L. Nucleic Acids Res. 2004, 32, W340−W345. (66) Olsen, J. V.; Vermeulen, M.; Santamaria, A.; Kumar, C.; Miller, M. L.; Jensen, L. J.; Gnad, F.; Cox, J.; Jensen, T. S.; Nigg, E. A.; Brunak, S.; Mann, M. Sci. Signal. 2010, 3, ra3. (67) Marshall, A. G.; Hendrickson, C. L.; Jackson, G. S. Mass Spectrom. Rev. 1998, 17, 1−35. (68) Marshall, A. G.; Hendrickson, C. L. Int. J. Mass Spectrom. 2002, 215, 59−75. (69) Nikolaev, E. N.; Boldin, I. A.; Jertz, R.; Baykut, G. J. J. Am. Soc. Mass Spectrom. 2011, 22, 1125−1133. (70) Valeja, S. G.; Kaiser, N. K.; Xian, F.; Hendrickson, C. L.; Rouse, J. C.; Marshall, A. G. Anal. Chem. 2011, 83, 8391−8395.

Analytical Chemistry Letter

dx.doi.org/10.1021/ac202651v | Anal. Chem. XXXX, XXX, XXX−XXXG