Combined analysis of cleavage sites and glycosylation in a complex mixture

 

In one of our analyses of human serum samples using the patented ProQuant® DDA proteomics platform designed to investigate peptide, PTM and cleavages in a non-hypothesis manner, we noticed that the serum protein hemopexin was found with a range of residues lost from its N-terminus. We detected and quantified the following peptides:

TPLPPTSAHGNVAEGETKPDPDVTER
LPPTSAHGNVAEGETKPDPDVTER
PPTSAHGNVAEGETKPDPDVTER
PTSAHGNVAEGETKPDPDVTER
TSAHGNVAEGETKPDPDVTER
SAHGNVAEGETKPDPDVTER
AHGNVAEGETKPDPDVTER
HGNVAEGETKPDPDVTER
GNVAEGETKPDPDVTER
NVAEGETKPDPDVTER
VAEGETKPDPDVTER

In addition to these unmodified peptides, some of the same peptides were found O-glycosylated, with the following saccharides identified as being present in this region, consistent with published data.

 

  • Hex(1)HexNAc(1)NeuAc(1)
  • Hex(1)HexNAc(1)NeuAc(2)

The following peptides were found glycosylated: 

TPLPPTSAHGNVAEGETK          *
TPLPPTSAHGNVAEGETKPDPDVTER  *
TPLPPTSAHGNVAEGETKPDPDVTER **
  LPPTSAHGNVAEGETKPDPDVTER  *
  LPPTSAHGNVAEGETKPDPDVTER **

Some peptides have two separate glycosylations, as denoted by asterisks. While the software we use determined a location for the glycosylations, careful investigation of the MS2 fragmentation patterns revealed that the locations of the glycosylations were ambiguous, with multiple potential O-linked glycosylation sites (e.g. Thr-1, Thr-6, Ser-7, Thr-17 and Thr-24 on the longest peptide). We reflect that by showing peptides that contain one or more glycosylations but not their location.

Identities of all the above peptides were checked manually using our purpose-built suite of tools.

The precision with which the abundance of individual peptides can be determined using our ProQuant ® platform means into that we can investigate potential interesting biology associated with our observations. We started by determining whether there was a relationship between residues being lost at the N-terminus and the O-glycosylation. We quantified the proportion of the protein that has had residues lost from its N-terminus, and looked for the potential effect of glycosylation of the region. We can see clearly that this loss of residues is much more frequent in aglycosylated protein than it is on the glycosylated form.

Each data point represents the fraction of hemopexin that has lost N-terminal residues in each serum sample, each from a different study subject.

What is more, the precision with which we have determined the peptide abundances using ProQuant® enables us to investigate whether there is an association between the fraction of N-terminal residue loss between glycosylated and aglycosylated proteoforms even within an individual. We observe a strong positive correlation between the fraction of glycosylated peptides with N-terminal residue loss and the fraction of aglycosylated peptides with N-terminal residue loss, despite the difference in average fractional residue loss between these two peptide types.

Each data point represents the fraction of hemopexin that has lost N-terminal residues in each serum sample, each from a different study subject.

We inferred from this data that there may be biological relevance to the interplay between glycosylation and N-terminal residue loss. We therefore compared the fraction of protein that has lost residues at its N-terminus with a huge range of biochemical, immunological, medical and biomarker data from the same subjects. There is a potentially exciting association between the loss of residues at the N-terminus of hemopexin and both a medical phenotype and other proteomics data, which has opened up a whole new field of research for us, but which we cannot currently share publicly.

This technical note highlights a number of key features of RxCelerate’s ProQuant® platform. Firstly, observations such as these cannot be obtained without the incredible underlying precision inherent in our data at the peptide level. Secondly, that data has to be interrogated in a way that is open to identifying unusual patterns. Outputting a list of proteins with their relative abundances in the samples is simply insufficient in a world in which the importance of proteoforms is apparent and obvious. The proteomics team here at RxCelerate have a background in the research and investigation into proteoforms carried out as part of Methuselah Health, where we investigated the role of proteoforms in ageing and ageing-related diseases. When you are interested in finding out as much as possible about the proteins, PTMs and proteoforms in your sample, speak to the experts!

Details of the calculations

The calculations of fraction of protein that has lost N-terminal residues were made as follows: 

Fraction of aglycosylated protein that has lost N-terminal residues = Sum of abundance of aglycosylated peptides that contain VAEGETK but not the initial two residues TP divided by the sum of abundance of aglycosylated peptides that contain VAEGETK.

Fraction of glycosylated protein that has lost N-terminal residues = Sum of abundance of glycosylated peptides that contain VAEGETK but not the initial two residues TP divided by the sum of abundance of glycosylated peptides that contain VAEGETK.

 

Read more case studies