Many research questions in fields such as personalized medicine, drug screens or systems biology depend on obtaining consistent and quantitatively accurate proteomics data from many samples. SWATH‐MS is a specific variant of data‐independent acquisition (DIA) methods and is emerging as a technology that combines deep proteome coverage capabilities with quantitative consistency and accuracy. In a SWATH‐MS measurement, all ionized peptides of a given sample that fall within a specified mass range are fragmented in a systematic and unbiased fashion using rather large precursor isolation windows. To analyse SWATH‐MS data, a strategy based on peptide‐centric scoring has been established, which typically requires prior knowledge about the chromatographic and mass spectrometric behaviour of peptides of interest in the form of spectral libraries and peptide query parameters. This tutorial provides guidelines on how to set up and plan a SWATH‐MS experiment, how to perform the mass spectrometric measurement and how to analyse SWATH‐MS data using peptide‐centric scoring. Furthermore, concepts on how to improve SWATH‐MS data acquisition, potential trade‐offs of parameter settings and alternative data analysis strategies are discussed.
The complete and specific proteolytic cleavage of protein samples into peptides is crucial for the success of every shotgun LC-MS/MS experiment. In particular, popular peptide-based label-free and targeted mass spectrometry approaches rely on efficient generation of fully cleaved peptides to ensure accurate and sensitive protein quantification. In contrast to previous studies, we globally and quantitatively assessed the efficiency of different digestion strategies using a yeast cell lysate, label-free quantification, and statistical analysis. Digestion conditions include double tryptic, surfactant-assisted, and tandem-combinatorial Lys-C/trypsin digestion. In comparison to tryptic digests, Lys-C/trypsin digests were found most efficient to yield fully cleaved peptides while reducing the abundance of miscleaved peptides. Subsequent sequence context analysis revealed improved digestion performances of Lys-C/trypsin for miscleaved sequence stretches flanked by charged basic and particulary acidic residues. Furthermore, targeted MS analysis demonstrated a more comprehensive protein cleavage only after Lys-C/trypsin digestion, resulting in a more accurrate absolute protein quantification and extending the number of peptides suitable for SRM assay development. Therefore, we conclude that a serial Lys-C/trypsin digestion is highly attractive for most applications in quantitative MS-based proteomics building on in-solution digestion schemes.
Mycobacterium tuberculosis remains a health concern due to its ability to enter a non-replicative dormant state linked to drug resistance. Understanding transitions into and out of dormancy will inform therapeutic strategies. We implemented a universally applicable, label-free approach to estimate absolute cellular protein concentrations on a proteome-wide scale based on SWATH mass spectrometry. We applied this approach to examine proteomic reorganization of M. tuberculosis during exponential growth, hypoxia-induced dormancy, and resuscitation. The resulting data set covering >2,000 proteins reveals how protein biomass is distributed among cellular functions during these states. The stress-induced DosR regulon contributes 20% to cellular protein content during dormancy, whereas ribosomal proteins remain largely unchanged at 5%-7%. Absolute protein concentrations furthermore allow protein alterations to be translated into changes in maximal enzymatic reaction velocities, enhancing understanding of metabolic adaptations. Thus, global absolute protein measurements provide a quantitative description of microbial states, which can support the development of therapeutic interventions.
SUMMARY Research advancing our understanding of Mycobacterium tuberculosis (Mtb) biology and complex host-Mtb interactions requires consistent and precise quantitative measurements of Mtb proteins. We describe the generation and validation of a compendium of assays to quantify 97% of the 4,012 annotated Mtb proteins by the targeted mass spectrometric method selected reaction monitoring (SRM). Furthermore, we estimate the absolute abundance for 55% of all Mtb proteins, revealing a dynamic range within the Mtb proteome of over four orders of magnitude, and identify previously un-annotated proteins. As an example of the assay library utility, we monitored the entire Mtb dormancy survival regulon (DosR), which is linked to anaerobic survival and Mtb persistence, and show its dynamic protein-level regulation during hypoxia. In conclusion, we present a publicly available research resource that supports the sensitive, precise, and reproducible quantification of virtually any Mtb protein by a robust and widely accessible mass spectrometric method.
The consistent detection and quantification of protein post-translational modifications (PTMs) across sample cohorts is an essential prerequisite for the functional analysis of biological processes. Data-independent acquisition (DIA), a bottom-up mass spectrometry based proteomic strategy, exemplified by SWATH-MS, provides complete precursor and fragment ion information of a sample and thus, in principle, the information to identify peptidoforms, the modified variants of a peptide. However, due to the convoluted structure of DIA data sets the confident and systematic identification and quantification of peptidoforms has remained challenging. Here we present IPF (Inference of PeptidoForms), a fully automated algorithm that uses spectral libraries to query, validate and quantify peptidoforms in DIA data sets. The method was developed on data acquired by SWATH-MS and benchmarked using a synthetic phosphopeptide reference data set and phosphopeptide-enriched samples. The data indicate that IPF reduced false site-localization by more than 7-fold in comparison to previous approaches, while recovering 85.4% of the true signals. IPF was applied to detect and quantify peptidoforms carrying ten different types of PTMs in DIA data acquired from more than 200 samples of undepleted blood plasma of a human twin cohort. The data approportioned, for the first time, the contribution of heritable, environmental and longitudinal effects on the observed quantitative variability of specific modifications in blood plasma of a human population.
As the SARS-CoV-2 pandemic continues to spread, thousands of scientists around the globe have changed research direction to understand better how the virus works and to find out how it may be tackled. The number of manuscripts on preprint servers is soaring and peer-reviewed publications using mass spectrometry-based proteomics are beginning to emerge. To facilitate proteomic research on SARS-CoV-2, this report presents deep-scale proteomes (10,000 proteins; >130,000 peptides) of common cell line models, notably Vero E6, Calu-3, Caco-2, and ACE2-A549 that characterize their protein expression profiles including viral entry factors such as ACE2 or TMPRSS2. Using the 9 kDa protein SRP9 and the breast cancer oncogene BRCA1 as examples, we show how the proteome expression data can be used to refine the annotation of protein-coding regions of the African green monkey and the Vero cell line genomes. Monitoring changes of the proteome upon viral infection revealed widespread expression changes including transcriptional regulators, protease inhibitors, and proteins involved in innate immunity. Based on a library of 98 stable-isotope labeled synthetic peptides representing 11 SARS-CoV-2 proteins, we developed PRM (parallel reaction monitoring) assays for nano-flow and micro-flow LC-MS/MS. We assessed the merits of these PRM assays using supernatants of virus-infected Vero E6 cells and challenged the assays by analyzing two diagnostic cohorts of 24 (+30) SARS-CoV-2 positive and 28 (+9) negative cases. In light of the results obtained and including recent publications or manuscripts on preprint servers, we critically discuss the merits of mass spectrometry-based proteomics for SARS-CoV-2 research and testing.
Accurate measurements of cellular protein concentrations are invaluable to quantitative studies of gene expression and physiology in living cells. Here, we developed a versatile mass spectrometric workflow based on data-independent acquisition proteomics (DIA/SWATH) together with a novel protein inference algorithm (xTop). We used this workflow to accurately quantify absolute protein abundances in Escherichia coli for > 2,000 proteins over > 60 growth conditions, including nutrient limitations, non-metabolic stresses, and non-planktonic states. The resulting high-quality dataset of protein mass fractions allowed us to characterize proteome responses from a coarse (groups of related proteins) to a fine (individual) protein level. Hereby, a plethora of novel biological findings could be elucidated, including the generic upregulation of low-abundant proteins under various metabolic limitations, the non-specificity of catabolic enzymes upregulated under carbon limitation, the lack of largescale proteome reallocation under stress compared to nutrient limitations, as well as surprising strain-dependent effects important for biofilm formation. These results present valuable resources for the systems biology community and can be used for future multi-omics studies of gene regulation and metabolic control in E. coli.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.