pFind Studio: a computational solution for mass spectrometry-based proteomics
2021
Molecular & Cellular Proteomics2021. Chen, ZW et al.
Univ Wisconsin, Dept Chem, 1101 Univ Ave, Madison, WI 53706 USA.
ABSTRACT:As the body fluid that directly interchanges with the extracellular fluid of the central nervous system (CNS), cerebrospinal fluid (CSF) serves as a rich source for CNS-related disease biomarker discovery. Extensive proteome profiling has been conducted for CSF, but studies aimed at unraveling site-specific CSF N-glycoproteome are lacking. Initial efforts into site-specific N-glycoproteomics study in CSF yield limited coverage, hindering further experimental design of glycosylation-based disease biomarker discovery in CSF. In the present study, we have developed an N-glycoproteomic approach that combines enhanced N-glycopeptide sequential enrichment by hydrophilic interaction chromatography (HILIC) and boronic acid enrichment with electron transfer and higher-energy collision dissociation (EThcD) for large-scale intact N-glycopeptide analysis. The application of the developed approach to the analyses of human CSF samples enabled identifications of a total of 2893 intact N-glycopeptides from 511 N-glycosites and 285 N-glycoproteins. To our knowledge, this is the largest site-specific N-glycoproteome dataset reported for CSF to date. Such dataset provides molecular basis for a better understanding of the structure-function relationships of glycoproteins and their roles in CNS-related physiological and pathological processes. As accumulating evidence suggests that defects in glycosylation are involved in Alzheimer's disease (AD) pathogenesis, in the present study, a comparative indepth N-glycoproteomic analysis was conducted for CSF samples from healthy control and AD patients, which yielded a comparable N-glycoproteome coverage but a distinct expression pattern for different categories of glycoforms, such as decreased fucosylation in AD CSF samples. Altered glycosylation patterns were detected for a number of N-glycoproteins including alpha-1-antichymotrypsin, ephrin-A3 and carnosinase CN1 etc., which serve as potentially interesting targets for further glycosylation-based AD study and may eventually lead to molecular elucidation of the role of glycosylation in AD progression.
Use: pGlyco
Glycobiology2021. Zhou, DP et al.
Tongji Univ, Sch Med, 1239 Siping Rd, Shanghai 200092, Peoples R China.
ABSTRACT:Coronaviruses hijack human enzymes to assemble the sugar coat on their spike glycoproteins. The mechanisms by which human antibodies may recognize the antigenic viral peptide epitopes hidden by the sugar coat are unknown. Glycosylation by insect cells differs from the native form produced in human cells, but insect cell-derived influenza vaccines have been approved by the US Food and Drug Administration. In this study, we analyzed recombinant severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike protein secreted from BTI-Tn-5B1-4 insect cells, by trypsin and chymotrypsin digestion followed by mass spectrometry analysis. We acquired tandem mass spectrometry (MS/MS) spectrums for glycopeptides of all 22 predicted N-glycosylated sites. We further analyzed the surface accessibility of spike proteins according to cryogenic electron microscopy and homolog-modeled structures and available antibodies that bind to SARS-CoV-1. All 22 N-glycosylated sites of SARS-CoV-2 are modified by high-mannose N-glycans. MS/MS fragmentation clearly established the glycopeptide identities. Electron densities of glycans cover most of the spike receptor-binding domain of SARS-CoV-2, except YQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQ, similar to a region FSPDGKPCTPPALNCYWPLNDYGFYTTTGIGYQ in SARS-CoV-1. Other surfaceexposed domains include those located on central helix, connecting region, heptad repeats and N-terminal domain. Because the majority of antibody paratopes bind to the peptide portion with or without sugar modification, we propose a snake-catching model for predicted paratopes: a minimal length of peptide is first clamped by a paratope and sugar modifications close to the peptide either strengthen or do not hinder the binding.
Use: pGlyco
ANALYTICA CHIMICA ACTA2021. He, FX et al.
Sun Yat Sen Univ, Guangdong Prov Key Lab Biomed Imaging, Affiliated Hosp 5, Zhuhai 519000, Peoples R China.
ABSTRACT:Gut ecosystem has profound effects on host physiology and health. Gastrointestinal (GI) symptoms were frequently observed in patients with COVID-19. Compared with other organs, gut antiviral response can result in more complicated immune responses because of the interactions between the gut microbiota and host immunity. However, there are still large knowledge gaps in the impact of COVID-19 on gut molecular profiles and commensal microbiome, hindering our comprehensive understanding of the pathogenesis of SARS-CoV-2 and the treatment of COVID-19. We performed longitudinal stool multiomics profiling to systemically investigate the molecular phenomics alterations of gut ecosystem in COVID-19. Gut proteomes of COVID-19 were characterized by disturbed immune, proteolysis and redox homeostasis. The expression and glycosylation of proteins involved in neutrophil degranulation and migration were suppressed, while those of proteases were upregulated. The variable domains of Ig heavy chains were downregulated and the overall glycosylation of IgA heavy chain constant regions, IgGFcbinding protein, and J chain were suppressed with glycan-specific variations. There was a reduction of beneficial gut bacteria and an enrichment of bacteria derived deleterious metabolites potentially associated with multiple types of diseases (such as ethyl glucuronide). The reduction of Ig heave chain variable domains may contribute to the increase of some Bacteroidetes species. Many bacteria ceramide lipids with a C17-sphingoid based were downregulated in COVID-19. In many cases, the gut phenome did not restore two months after symptom onset. Our study indicates widely disturbed gut molecular profiles which may play a role in the development of symptoms in COVID-19. Our findings also emphasis the need for ongoing investigation of the long-term gut molecular and microbial alterations during COVID-19 recovery process. Considering the gut ecosystem as a potential target could offer a valuable approach in managing the disease. (c) 2021 Published by Elsevier B.V.
Use: pGlyco
Annals of Neurology2021. Ligezka, AN et al.
Mayo Clin, Dept Clin Genom, 200 First St SW, Rochester, MN 55905 USA.
ABSTRACT:Objective Epalrestat, an aldose reductase inhibitor increases phosphomannomutase (PMM) enzyme activity in a PMM2-congenital disorders of glycosylation (CDG) worm model. Epalrestat also decreases sorbitol level in diabetic neuropathy. We evaluated the genetic, biochemical, and clinical characteristics, including the Nijmegen Progression CDG Rating Scale (NPCRS), urine polyol levels and fibroblast glycoproteomics in patients with PMM2-CDG. Methods We performed PMM enzyme measurements, multiplexed proteomics, and glycoproteomics in PMM2-deficient fibroblasts before and after epalrestat treatment. Safety and efficacy of 0.8 mg/kg/day oral epalrestat were studied in a child with PMM2-CDG for 12 months. Results PMM enzyme activity increased post-epalrestat treatment. Compared with controls, 24% of glycopeptides had reduced abundance in PMM2-deficient fibroblasts, 46% of which improved upon treatment. Total protein N-glycosylation improved upon epalrestat treatment bringing overall glycosylation toward the control fibroblasts' glycosylation profile. Sorbitol levels were increased in the urine of 74% of patients with PMM2-CDG and correlated with the presence of peripheral neuropathy, and CDG severity rating scale. In the child with PMM2-CDG on epalrestat treatment, ataxia scores improved together with significant growth improvement. Urinary sorbitol levels nearly normalized in 3 months and blood transferrin glycosylation normalized in 6 months. Interpretation Epalrestat improved PMM enzyme activity, N-glycosylation, and glycosylation biomarkers in vitro. Leveraging cellular glycoproteome assessment, we provided a systems-level view of treatment efficacy and discovered potential novel biosignatures of therapy response. Epalrestat was well-tolerated and led to significant clinical improvements in the first pediatric patient with PMM2-CDG treated with epalrestat. We also propose urinary sorbitol as a novel biomarker for disease severity and treatment response in future clinical trials in PMM2-CDG. ANN NEUROL 2021
Use: pGlyco
JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY2021. Wang, ZY et al.
Chinese Acad Sci, Dalian Inst Chem Phys, CAS Key Lab Separat Sci Analyt Chem, Dalian 116023, Peoples R China.
ABSTRACT:Protein N-glycosylation in human milk whey plays a substantial role in infant health during postnatal development. Changes in site-specific glycans in milk whey reflect the needs of infants under different circumstances. However, the conventional glycoproteomics analysis of milk whey cannot reveal the changes in site-specific glycans because the attached glycans are typically enzymatically removed from the glycoproteins prior to analysis. In this study, N-glycoproteomics analysis of milk whey was performed without removing the attached glycans, and 330 and 327 intact glycopeptides were identified in colostrum and mature milk whey, respectively. Label-free quantification of site-specific glycans was achieved by analyzing the identified intact glycopeptides, which revealed 9 significantly upregulated site-specific glycans on 6 glycosites and 11 significantly downregulated sitespecific glycans on 8 glycosites. Some interesting change trends in N-glycans attached to specific glycosites in human milk whey were observed. Bisecting GlcNAc was found attached to 11 glycosites on 8 glycoproteins in colostrum and mature milk. The dynamic changes in site-specific glycans revealed in this study provide insights into the role of protein N-glycosylation during infant development.
Use: pQuant; pGlyco
Analytical Chemistry2021. Liu, LY et al.
Chinese Acad Sci, Dalian Inst Chem Phys, CAS Key Lab Separat Sci Analyt Chem, Dalian 116023, Peoples R China.
ABSTRACT:Bottom-up proteomics has been increasingly applied in clinical research to study the disease pathophysiology and to discover disease biomarkers. However, glycoproteomic analysis always requires tedious experimental steps for intact glycopeptide enrichment, which has been the technique bottleneck for large-scale analysis of clinical samples. Herein, we developed an automated glycopeptide enrichment method for the analysis of serum site-specific N-glycoproteome. This automated method allowed for processing one sample within 20 min. It showed higher enrichment specificity, more intact glycopeptide identifications, and better quantitative reproducibility than the traditional manual method using microtip enrichment devices. We further applied this method to investigate the serum site-specific N-glycosylation changes between four patients with pancreatic cancer and seven healthy controls. The principal component analysis of intact N-glycopeptides showed good clustering across cancer and normal groups. Furthermore, we found that the site-specific glycoforms, monofucosylated and nonsialylated oligosaccharides, on IgG1 site 180 expressed a significant decrease in pancreatic cancer patients compared to healthy controls. Together, the automated method is a powerful tool for site-specific N-glycoproteomic analysis of complex biological samples, and it has great potential for clinical utilities.
Use: pGlyco
Analytical Chemistry2021. Fang, P et al.
Max Planck Inst Biophys Chem, Bioanalyt Mass Spectrometry Grp, D-37077 Gottingen, Germany.
ABSTRACT:The heterogeneity and complexity of glycosylation hinder the depth of site-specific glycoproteomics analysis. High-field asymmetric-waveform ion-mobility spectrometry (FAIMS) has been shown to improve the scope of bottom-up proteomics. The benefits of FAIMS for quantitative N-glycoproteomics have not been investigated yet. In this work, we optimized FAIMS settings for N-glycopeptide identification, with or without the tandem mass tag (TMT) label. The optimized FAIMS approach significantly increased the identification of site-specific N-glycopeptides derived from the purified immunoglobulin M (IgM) protein or human lymphoma cells. We explored in detail the changes in FAIMS mobility caused by N-glycopeptides with different characteristics, including TMT labeling, charge state, glycan type, peptide sequence, glycan size, and precursor m/z. Importantly, FAIMS also improved multiplexed N-glycopeptide quantification, both with the standard MS2 acquisition method and with our recently developed Glyco-SPS-MS3 method. The combination of FAIMS and Glyco-SPS-MS3 methods provided the highest quantitative accuracy and precision. Our results demonstrate the advantages of FAIMS for improved mass spectrometry-based qualitative and quantitative N-glycoproteomics.
Use: pGlyco; pQuant
molecules2021. Hackett, WE et al.
Boston Univ, Bioinformat Program, Boston, MA 02215 USA.
ABSTRACT:Protein glycosylation that mediates interactions among viral proteins, host receptors, and immune molecules is an important consideration for predicting viral antigenicity. Viral spike proteins, the proteins responsible for host cell invasion, are especially important to be examined. However, there is a lack of consensus within the field of glycoproteomics regarding identification strategy and false discovery rate (FDR) calculation that impedes our examinations. As a case study in the overlap between software, here as a case study, we examine recently published SARS-CoV-2 glycoprotein datasets with four glycoproteomics identification software with their recommended protocols: GlycReSoft, Byonic, pGlyco2, and MSFragger-Glyco. These software use different Target-Decoy Analysis (TDA) forms to estimate FDR and have different database-oriented search methods with varying degrees of quantification capabilities. Instead of an ideal overlap between software, we observed different sets of identifications with the intersection. When clustering by glycopeptide identifications, we see higher degrees of relatedness within software than within glycosites. Taking the consensus between results yields a conservative and non-informative conclusion as we lose identifications in the desire for caution; these non-consensus identifications are often lower abundance and, therefore, more susceptible to nuanced changes. We conclude that present glycoproteomics softwares are not directly comparable, and that methods are needed to assess their overall results and FDR estimation performance. Once such tools are developed, it will be possible to improve FDR methods and quantify complex glycoproteomes with acceptable confidence, rather than potentially misleading broad strokes.
Use: pGlyco
Molecular Omics2021. Saraswat, M et al.
Mayo Clin, Dept Lab Med & Pathol, Rochester, MN 55905 USA.
ABSTRACT:Several plasma glycoproteins are clinically useful as biomarkers in a variety of diseases. Although thousands of proteins are present in plasma, >95% of the plasma proteome by mass is represented by only 22 proteins. This necessitates strategies to deplete the abundant proteins and enrich other subsets of proteins. Although glycoproteins are abundant in plasma, in routine proteomic analyses, glycopeptides are not often investigated. Traditional methods such as lectin-based enrichment of glycopeptides followed by deglycosylation have helped understand the glycoproteome, but they lack any information about the attached glycans. Here, we apply size-exclusion chromatography (SEC) as a simple strategy to enrich intact N-glycopeptides based on their larger size which achieves broad selectivity regardless of the nature of attached glycans. Using this approach, we identified 1317 N-glycopeptides derived from 266 glycosylation sites on 154 plasma glycoproteins. The deep coverage achieved by this approach was evidenced by extensive heterogeneity that was observed. For instance, 20-100 glycopeptides were observed per protein for the 15 most-glycosylated glycoproteins. Notably, we discovered 615 novel glycopeptides of which 39 glycosylation sites (from 38 glycoproteins) were not included in protein databases such as Uniprot and GlyConnectDB. Finally, we also identified 12 novel glycopeptides containing di-sialic acid, which is a rare glycan epitope. Our results demonstrate the utility of SEC for efficient LC-MS/MS-based deep glycoproteomics analysis of human plasma. Overall, the SEC-based method described here is a simple, rapid and high-throughput strategy for characterization of any glycoproteome.
Use: pGlyco
Infectious Microbes & Diseases2021. Sun, ZY et al.
State Key Laboratory for the Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, No.79 Qingchun Road, Shangcheng District, Hangzhou 310003, China
ABSTRACT:The devastating coronavirus disease 2019 (COVID-19) pandemic has prompted worldwide efforts to study structural biological traits of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and its viral components. Compared to the Spike protein, which is the primary target for currently available vaccines or antibodies, knowledge about other virion structural components is incomplete. Using high-resolution mass spectrometry, we report a comprehensive post-translational modification (PTM) analysis of nucleocapsid phosphoprotein (NCP), the most abundant structural component of the SARS-CoV-2 virion. In addition to phosphoryl groups, we show that the SARS-CoV-2 NCP is decorated with a variety of PTMs, includingN-glycans and ubiquitin. Based on newly identified PTMs, refined protein structural models of SARS-CoV-2 NCP were proposed and potential immune recognition epitopes of NCP were aligned with PTMs. These data can facilitate the design of novel vaccines or therapeutics targeting NCP, as valuable alternatives to the current vaccination and treatment paradigm that is under threat of the ever-mutating SARS-CoV-2 Spike protein.
Use: pGlyco