Site-specific glycosylation (SSG) analysis can be used to profile recombinant protein therapeutics, as glycans N- or C-attached to amino acid residues affect folding, structure and function. SSG analysis will reveal which glycosylation sites are occupied on a protein and will determine the microheterogeneity at each glycosylation site. Glycans at specific sites may have different effects on protein function so determining which glycoforms are present at which site can reveal how they may affect drug performance and safety. Pinpointing glycans on specific amino acid locations is achievable through mass spectrometry, including intact glycopeptide analysis or tandem fragmentation.
The most common post translational modification in recombinant biotherapeutics is glycosylation. Protein glycosylation affects protein folding, protein stability, PK and protein function. Glycosylation can occur co- or post-translationally by enzymatic addition of glycans to either asparagine residues (N-linked) or serine/threonine residues (O-linked). Glycosylation often leads to heterogeneity with multiple glycoforms of a protein. Glycosylation can affect protein solubility, protein folding stability, and protease sensitivity. Protein glycan binding can change protein function by modifying receptor binding, effector function or clearance rate.
Glycosylation is assigned as a CQA as changes in glycan composition or occupancy at any site will impact the clinical performance and safety. Some regulatory authorities have acknowledged that changes in glycosylation that may appear small, such as altering levels of sialylation, changes to fucosylation patterns or changes to antennary types, can have large effects on immunogenicity, potency of effector function, and clearance. Because of this variability, controls must be identified including cell line, culture conditions, and purification methods to ensure batch-to-batch consistency. Glycosylation is defined as a CQA, thus it must be qualified via validated methods with established acceptance criteria. This ensures that product released has met all defined criteria for glycan specification.
Fig. 1 Protein glycosylation classification.1,5
Recombinant protein therapeutics are characterized by significant structural complexity due to N-linked glycosylation and O-linked glycosylation. The former is attached to Asn residues in certain sequons while the latter is attached to Ser or Thr residues. The occupancy of each glycosylation site can vary leading to macroheterogeneity. Additionally, each glycosylation site can have variable glycoforms attached ranging from high mannose to complex types with different terminal sugars leading to microheterogeneity. Both high macro- and micro-heterogeneity can be present in therapeutic proteins and lead to heterogeneous populations that need to be thoroughly analyzed batch-to-batch.
Table 1 Impact of Site-Specific Glycosylation on Therapeutic Protein Properties
| Glycosylation Aspect | Functional Impact | Analytical Consideration |
| Protein folding | Conformational stability | Site occupancy assessment |
| Biological activity | Receptor binding modulation | Microheterogeneity mapping |
| Pharmacokinetics | Clearance rate determination | Terminal sugar analysis |
| Immunogenicity | Epitope presentation | Non-human glycan detection |
To avoid immunogenicity issues, glycosylation must also be controlled so as to avoid sugar sequences that are immunogenic such as high mannose glycans, incomplete glycans, galactosealpha1,3-galactose (gal), or glycans containing glycostructures not present in humans. Glycan safety evaluations are needed to rule out hypersensitivity reactions and anti-glycan antibody responses. Process robustness is also expected by regulatory agencies, demonstrated through batch-to-batch consistency of glycosylation. Glycan analysis should demonstrate throughout multiple batches during the life-cycle of a product that glycosylation is consistent and within acceptable limits. Biosimilar manufacturers must match glycosylation patterns of their reference product in order to claim therapeutic equivalence.
SSG analysis is a method that determines which glycans are attached to each amino acid on a therapeutic protein. SSG, in contrast to released glycan analysis, leaves intact each glycan's linkage to its place of attachment. SSG analysis provides information on what glycoforms are present at each glycosylation site. SSG can be used to better understand a protein's structure and function through detailed structural analysis, control the protein production process, and confirm bio-similarism.
In general, glycopeptide LC-MS/MS analysis involves the digestion with protease enzymes (commonly trypsin or Glu-C) to produce glycopeptides, separation of glycopeptides by RPC or HILIC, ionization and detection of eluents with HRMS for accurate mass measurement of precursor ions. The ions are then fragmented by CID or ETD. Fragment ions are then analyzed to determine the structure of the glycan and peptide sequence allowing for site-specific glycan identification.
Site occupancy concerns determination of which sites along the peptide sequence that can potentially be glycosylated are actually modified. Fully occupied sites can be distinguished from partially glycosylated/unoccupied sites, which cause protein macroheterogeneity. Relative quantification methods determine the relative abundance of each glycoform present at each site by integration of extracted ion chromatograms or spectral counting. This can be used to determine the microheterogeneity landscape commonly observed for glycosylation on recombinant proteins. Determination of low abundant glycoforms necessitates sensitive detection and enrichment strategies to identify glycoforms that may be present at low levels but still retain critical biological importance.
Site resolved profiling circumvents limitations associated with released glycan methods by maintaining attachment site information allowing linkage between the saccharide composition and its location on the protein structure. Functional correlation can then be made by linking a specific site-glycan with a function (e.g. receptor binding or half life). Comparability studies are enhanced since site similarity can be shown instead of just showing distributions of glycans are similar. This allows regulatory agencies to see proof of glycosylation biosimilarity needed for approval.
Fig. 2 MALDI MS spectrum of the glycans released from the fusion protein.2,5
Glycosylation profiling has been used as a basic analytical tool during development of recombinant proteins. During development, glycosylation characterization is employed to choose structural candidates for advancement, optimize production methods and control production. Creation of glycosylation profiles allow researchers to determine how glycosylation affects protein therapeutics and ensures glycosylation remains consistent when increasing production. Understanding glycosylation structure can affect a protein's efficacy, safety and regulatory concerns.
Early profiling allows for the identification of detrimental glycosylation that may impact efficacy or manufacturability. These glycans can consist of site specific microheterogeneity or high mannose/non-human glycan epitopes that have the potential to alter PK, immunogenicity or effector function. The glycosylation profiling data can then be used to aid molecular engineering decisions by identifying which glycosylation sites need to be mutated/altered or deleted to optimize the developability of the lead candidate before significant resources are invested into process development.
Confirmation that glycoengineering strategies have had their desired effect can be validated through glycosylation profiling. Glycosylation profiling can confirm that structural changes have occurred at desired locations (i.e. site-specific afucosylation or increase in sialylation). Furthermore, glycosylation profiling helps guarantee that new glycosylation sites are correctly decorated with the specific glycoforms, while also confirming no unwanted changes occur in glycan processing elsewhere on the protein.
Glycosylation profiling during process development can identify how changes in media composition shift intracellular pools of nucleotide sugars or Golgi enzyme kinetics that control terminal glycan processing. Effects of temperature and culture conditions on glycosyltransferase activities can be used to delineate design spaces where these critical activities are maintained and glycoform distributions remain consistent. The effect of bioreactor size on glycosylation can be characterized to identify potential mass transfer limitations or mixing heterogeneities that could shift glycan processing during scale-up.
Batch-release studies for process validation can be performed by longitudinal glycosylation profiling. Multiple lots over time are monitored to ensure site occupancy distributions do not drift outside predefined specifications. Since this type of release studies evaluate product consistency over time, they are able to detect small process drifts that could eventually affect product quality. Longitudinal glycosylation profiling also can be used during product stability studies to fully characterize possible remodeling of glycans over time due to deamidation, loss of sialic acid, or aggregates that may alter glycosylation patterns. Using this data you can also establish specifications and acceptance criteria since you will have determined the range of variability. These studies thoroughly document consistency and can be used to support regulatory filings to demonstrate control of glycosylation as a quality attribute.
The glycosylation of the recombinant protein product will differ based on the host cell line or organism expression system used as each species will have varying glycosyltransferases and pathways of glycan biosynthesis. Alterations to host cell lines and expression systems impact traits of N- and O-linked glycosylation including sialylation, fucosylation, and non-human epitopes. Glycosylation affects activity, half-life, and immunogenicity of therapeutic proteins therefore host cells and expression systems must be stringently validated.
CHO cells and HEK cells are the two mammalian expression systems most frequently utilized in therapeutic protein production. Glycoproteins produced from CHO cells tend to be more highly sialylated and extensively branched when compared to glycoproteins made in HEK cells. In contrast, N- and O-glycosyltransferase activities were found to be greater in HEK systems. This leads to differences in terminal glycan processing and molecular weight distribution. These differences can impact receptor interaction, clearance, and efficacy, and should be considered when choosing a cell line as glycosylation may play a role in the drug's mechanism of action. Glycosylation patterns should also be consistent to meet regulatory expectations at each stage of drug development.
Expression systems in non-mammalian cells allow for advantages with expression of recombinant proteins. However there are some glycosylation patterns that are possible with these systems that should be taken into account. Methylotrophic yeast hypermannosylates glycoproteins and the resulting glycan structures are foreign compared to glycans found in mammals. This can result in unwanted immunogenicity or clearance. Expression with insect cells and a baculovirus will hypermannosylate or add paucimannose or oligomannose glycans without sialic acid capping the terminal position. Alterations can be made to these systems to more closely mimic mammalian glycosylation. Plant derived glycoproteins contain alpha linked fucose and xylose residues on the core of the glycan. Neither of these residues are seen in glycans from mammals and can cause unwanted immunogenicity. Alterations need to be made in order to produce usable glycans.
Host-cell-specific glycan analysis is done to identify glycans that are considered undesirable for safety or efficacy reasons. Terminal Gal-α-linkages or host-cell specific sialic acids not found naturally in humans can lead to immunogenicity or differences in PK because they can be identified as foreign by the immune system. Host-cell-specific glycan analysis will identify these types of glycans. To ensure consistency, regulatory bodies mandate host cell glycan analysis to confirm that these glycans stay stable and meet predefined standards throughout the manufacturing process. Factors taken into consideration during risk assessment include neutralizing epitopes, clearance by glycan receptors, and product quality attributes relative to comparator proteins.
SSG analysis using glycopeptides enables determination of glycosylation profiles of various classes of recombinant therapeutic proteins such as monoclonal antibodies, cytokines, growth factors, immunoglobulins and fusion proteins. Glycosylation profiles influence potency, PK/PD, immunogenicity and aggregation characteristics of protein therapeutics. Glycopeptide analysis can be used to verify that glycosylation is appropriate for the mechanism of action of the therapeutic and is consistent between production runs. SSG analysis is also useful for demonstrating biosimilarity.
Table 2 Glycopeptide Analysis Applications Across Therapeutic Protein Classes
| Protein Class | Glycosylation Impact | Analytical Focus | Development Stage Application |
| Hormones/growth factors | Receptor binding modulation | Site occupancy verification | Potency optimization |
| Enzyme replacement therapies | Targeting and uptake efficiency | Mannose-6-phosphate content | Efficacy enhancement |
| Fusion proteins | Domain-specific function | Multi-site microheterogeneity | Structural consistency |
| Cytokines/immunomodulators | Immunogenicity risk | Non-human epitope detection | Safety assurance |
Characterization by glycopeptide analysis of recombinant hormones or growth factors can confirm that the pattern of glycosylation is consistent with proper orientation or presentation of these proteins for receptor binding and biological activity. In some cases, particular sialylation patterns may be needed for a recombinant protein to have acceptable serum half-life and prevent rapid clearance from the liver. SSG can also confirm reproducibility of manufacturing process that produce glycoforms needed to achieve maximum potency for therapeutic proteins when carbohydrates are directly involved in ligand-receptor binding or protein folding stability necessary for biological activity.
One example application of glycopeptide analysis is for quality control purposes during enzyme replacement therapies for lysosomal storage diseases. Molecules used for these therapies need to contain mannose-6-phosphate in order to bind to receptors involved in importing the therapeutic enzymes into cells. Analysis may also determine whether high mannose or complex glycans are targeted to diseased cells. Glycopeptide analysis confirms that glycoengineering techniques like synthesis of mannose capped glycans reach the desired glycan endpoints.
Fc-fusion proteins, consisting of an immunoglobulin Fc region and another protein domain such as a cytokine or a receptor extracellular domain, have complex glycosylation patterns with multiple glycosylation sites. Mapping of glycopeptides can be used to distinguish glycosylation on the Fc fragment versus the fusion partner in such constructs. The fusion partner may display glycosylation microheterogeneity unique from the Fc portion of the protein, which can have implications for PK, stability, or immunogenicity. Additionally, glycopeptide mapping can be used to show biosimilarity between a proposed biosimilar and its reference molecule, as fusion partners may glycosylate differently depending on the microenvironment and accessibility.
When analyzing cytokines and immunomodulators the intent of glycopeptide analysis is often to look for the presence or absence of foreign glycan epitopes that might sensitize patients and cause immunogenicity with administration of the therapeutic protein. Terminal sugar modifications need to be characterized for evidence of potentially immunogenic galactose or sialic acid linkages not found on human proteins. Site-specific analysis helps confirm that the expression system is glycosylating the protein in a way that appropriately humanizes the protein.
Regulatory expectations for glycosylation characterization include compliance with global guidelines, detailed submission and comparability studies. Current good manufacturing practice regulatory agencies expect glycosylation to be characterized as part of chemistry, manufacturing and controls information provided. Agencies require demonstration of product structure and consistency between production batches.
ICH Q6B require the thorough characterization of all aspects of composition, physical characteristics and structure of biotech products. Analysis of glycosylation is further discussed in the Physicochemical characteristics section of this document. As you know biologics are naturally heterogeneous. While consistency of heterogeneity from batch to batch used in preclinical and clinical studies is necessary, it should not impact safety or efficacy profiles. Characterization of structure needs to determine "the extent of oligosaccharide content, glycosylation sites present and the structure(s) of carbohydrate chains attached. Orthogonal characterization methods should be used to understand the glycan profile." While this may sound complicated, what it really means is you need to define all N-linked and O-linked glycosylation with regards to glycosylation site occupancy and microheterogeneity.
INDs and BLAs typically include comprehensive glycosylation characterization studies necessary to demonstrate the carbohydrate structures of the product are suitable for its mechanism of action. In-process controls should be developed along with structure characterization included in the Chemistry, manufacturing and controls section of the IND/BLA. Summary data on glycan heterogeneity including glycoform distributions, occupancy of each site, and batch-to-batch variations are required. Agencies generally expect multiple lots to be trended to help define the acceptable range of glycosylation variations. Specifications need to be developed and justified based on their impact on clinical activity. When glycosylation impacts potency or pharmacokinetics descriptions of which glycans structures are present on the product and associated with biological activity should be included in the submission. Batch-to-batch manufacturing consistency should be described to demonstrate glycoforms are within acceptance criteria. Comparison to reference material or clinical materials used for safety and efficacy should be assessed.
For manufacturing changes such as scale up, transfer of manufacturing sites, or alteration of cell lines, comparability studies guided by ICH Q5E should be performed. They should demonstrate that the glycosylation profile is unchanged from the pre-change product. At least one batch made before and several batches made after the change should be compared head-to-head using analytical methods that are sensitive to small changes in glycan patterns. If changes in glycosylation are beyond historical ranges, or could affect product safety or efficacy, bridging nonclinical or clinical studies may need to be conducted to support the change. Regulatory agencies must decide if any observed differences are meaningful changes or acceptable variations within the design space. This often requires risk assessment of how any glycosylation changes impact product function to assure quality.
ICH Q8 designates glycosylation as a quality attribute. A quality attribute is defined as any physical, chemical, or biological property that should be within an appropriate limit to provide the desired product quality, safety and efficacy. Glycosylation is known to have great effects on protein folding, activity, immunogenicity and clearance, thus glycosylation would be designated as a critical quality attribute (CQA). However, it should be noted that certain glycan attributes affect protein therapeutic function. For example, core fucosylation has been shown to affect the effector functions of antibodies, and high-mannose glycans affect the serum half-life of biotherapeutics. Regulatory agencies require proper control and validation that glycosylation remains within a set range, with justifications on how those ranges were chosen (such as clinically relevant differences and process capabilities). This control should be placed into the quality by design framework by designating glycosylation CQAs and controlling them through validated analytical assays and process parameters.
Enhanced glycopeptide analysis has increased resolution of microheterogeneity including identification of site-specific glycan structures. Utilizing high-resolution mass spectrometry (MS), improved sample prep, and robust bioinformatics analysis allows for glycan structural characterization required for biopharmaceutical product quality assessment, biosimilar comparability studies, and regulatory reporting.
High resolution mass spectrometry (MS) instruments such as Orbitrap and quadrupole-time-of-flight (qToF) MS have sufficient accuracy and resolution to resolve isomeric glycopeptides that differ by a single monosaccharide or small mass differences. Glycan composition and peptide identities can be determined with exact mass measurements. Fragmentation methods such as collision-induced dissociation (CID) coupled with electron transfer dissociation (ETD) provide complementary fragmentation that breaks the glycosidic bonds to generate carbohydrate structure while maintaining the peptide-glycan bond for confident localization.
Automated workflows allow selective enzymatic digestion to produce glycopeptides amenable to chromatographic separation. Glycopeptides are enriched because often the relative amount is low. Glycopeptides are often ion suppression and therefore trace amounts are purified by hydrophilic interaction chromatography or Lectin enrichment. Minor glycosylation that might not be represented in the overall glycome but can have high significance are not overlooked.
Bioinformatic software packages that include auto-searching functions allow users to align experimental tandem MS data with in silico glycopeptide databases. This process helps to streamline the analysis of larger datasets. Users may wish to manually inspect important spectra to verify the assigned glycan structures and localization. Programs that deconvolute fragmentation can report isomeric differentiation and relative abundance to help normalize reports between experiments.
Method validation defines parameters such as sensitivity and linearity by testing known standards. Robustness should be assessed by applying the method to samples across multiple days, analysts and reagent batches to demonstrate that SSG can be quantified consistently. Data produced by validated methods should align with regulatory guidelines for the intended quality control application.
Proteomic analysis of recombinant glycoproteins is technically challenging due to glycan heterogeneity, protein modifications diversity, and complex data analysis needed for biological interpretation. This requires specialized approaches, comprehensive sample preparation and rigorous quality control for accurate SSG profiling.
Therapeutic proteins produced via recombinant DNA techniques often bear numerous N- and O-glycosylation sites located within various protein domains that each have distinct microenvironments and processing tendencies. As such, complex combinations of glycopeptides are formed with the possibility that same glycans may be added to each site with different occupancy and that each site may have its own unique microheterogeneity profile. As such thorough mapping is necessary to achieve coverage of all glycosylation sites while resolving site-specific differences in glycosylation that could have meaningful implications for protein efficacy.
Rare glycoforms are difficult to detect since they are often suppressed during ionization by more abundant glycoforms, and there is a limited amount of absolute material typically available for analysis. While some of these low abundance glycoforms may represent specific afucosylated, sialylated, or phosphorylated forms that make up only a fraction of the total glycans present, they can have significant functional implications. Enhancing detection of these glycoforms may be possible through enrichment methods such as hydrophilic interaction chromatography (HILIC) or lectin enrichment to isolate the glycoform of interest, as well as utilizing more sensitive mass spectrometry methods to detect low stoichiometric peptides.
Two structural isomers with the same molecular weight but different glycosidic bondages, monosaccharide anomeric configurations, or branching can be difficult to distinguish using typical mass spectrometry methods. Isomers that differ in sialic acid linkage type, galactose position, or fucose linkage have been shown to have varying biological effects but co-elute as one peak during typical chromatographic methods. These isomers can usually be distinguished using certain fragmentation techniques like electron transfer dissociation or MS^n as well as orthogonal separation techniques such as ion mobility, which separate the isomers based on differences in their shapes in the gas phase.
Processing of mass spectral datasets produces large lists of experimental features (potentially thousands per glycopeptide sample) that must be assigned to peptide and glycan compositions. Computational tools are needed to differentiate signal from noise, deconvolve competing interpretations of fragment ions, and quantify composite isotope envelopes. Spectral assignments typically still require manual verification, especially in cases of atypical glycans or when investigating samples from unique expression platforms or modified cell lines.
The detailed outputs generated from glycopeptide characterization takes mass spectrometric information and packages it into files suitable for inclusion into biopharmaceutical product development reports or regulatory filings. These results allow for definitive assignment of glycosylation sites which can be leveraged to make judgements on product quality, process robustness, and structural similarity during the lifetime of a biotherapeutic.
Table 3 Standard Deliverables for Glycopeptide Analysis
| Deliverable Type | Content Description | Strategic Application |
| Site-specific maps | Visual glycan distribution per site | Structure-function correlation |
| Quantification data | Relative glycoform abundance | Batch consistency assessment |
| Annotated spectra | Fragment ion assignments | Structural verification |
| Technical reports | Comprehensive documentation | Regulatory submission support |
Maps show glycan structures at each individual residue position along the peptide backbone. The map visually displays occupancy and microheterogeneity at each site. You can quickly determine whether specific glycoforms are common or rare at each site. Localization of glycans to the sequence enables association with protein function and allows for comparison of biosimilar and reference product.
Relative quantification identifies how many of each glycans structures are attached to each glycosylation site, which can be used for Critical Quality Attribute assessment. Relative quantification reports the percentage of each glycoform (ie; high mannose, complex with varied terminal groups) at each site, allowing for batch-to-batch comparisons and process monitoring. This quantitative values can be used to determine allowable windows for each glycan population associated with activity which can be used to set specifications with clinical justification.
Verified tandem mass spectra are often supplied as evidence for glycopeptide assignments. These spectra are typically annotated with proposed fragment ions and mechanism for fragmentation. Annotations indicate glycosidic bond breaks which verify sugar content and peptide backbone ions which confirm amino acid sequences and localization. These notes provide insight into how the glycoform assignment was determined and should allow an auditor to verify the conclusion.
Regulatory-ready package reports summarize glycopeptide characterization results and are designed for inclusion in chemistry, manufacturing and controls (CMC) sections of INDs and BLAs. They report analytical methods and sample preparation techniques used along with acceptance criteria, making assay strategies completely transparent. Summary tables of site-specific glycoform abundance and batch comparison statistics are included to convey structural data needed for agency reviewers to make conclusions about product quality and control.
Accurate, site-specific glycosylation characterization is essential for recombinant therapeutic proteins, where even subtle glycan variations can impact biological activity, pharmacokinetics, and regulatory acceptance. Beyond technical capability, successful glycopeptide analysis requires a partner who understands the broader CMC strategy, development milestones, and regulatory context of biologics. Our team combines advanced LC-MS expertise with deep biopharmaceutical development experience to deliver data that is not only analytically robust, but also strategically aligned with your program objectives.
Glycosylation is frequently defined as a Critical Quality Attribute (CQA) for recombinant therapeutic proteins. We design and execute glycopeptide studies with full awareness of how site-specific glycosylation data supports CMC documentation, comparability assessments, and regulatory filings. Our experience spans:
We understand how regulators evaluate glycosylation variability, host-cell-specific glycan signatures, and microheterogeneity. Our analytical strategy ensures that glycopeptide data integrates seamlessly into risk assessments, control strategies, and overall quality documentation.
Recombinant protein programs vary significantly depending on molecule type, expression system, and stage of development. A one-size-fits-all analytical approach is rarely appropriate. We tailor each site-specific glycosylation analysis project to match your development goals and regulatory strategy. For early discovery and preclinical programs, we focus on:
For process development and scale-up phases, we emphasize:
For late-stage or commercial programs, we provide:
By aligning analytical depth with development stage, we deliver meaningful insights without unnecessary complexity or cost.
High-quality analytical data must be accompanied by clear, structured documentation suitable for regulatory review. Our glycopeptide analysis reports are prepared with regulatory integration in mind, facilitating direct incorporation into CMC modules. Deliverables typically include:
Biopharmaceutical development timelines are often tight, especially during scale-up, comparability assessments, or regulatory submission preparation. We prioritize transparency, proactive communication, and milestone-driven project management throughout every engagement. Clients benefit from:
Our approach ensures that you receive not only high-resolution glycosylation data, but also a collaborative analytical partnership that supports confident decision-making at every stage of recombinant therapeutic protein development.
References