Automated Analysis report for rMpL_CFG_1 evaluated at 20 ug/mL

List of Primary rMpL_CFG_1 Motifs

Primary Motif ID Primary Motif Minimal Structure Primary Motif Complete Structure Relative Binding Number of Glycans
A 1.00 33
B 0.38 79
C 0.76 3
D 0.45 3
E 0.01 68
F 0.34 6
0 0.01 416

Minimal and complete motif definitions match the same set of glycans defined in the array. Components found in the complete motif but excluded in the minimal motif are not tested by the array. Monosaccharide identities and subsitution intolerance may or may not have been tested by the array, they are retained from the complete motif for readability.

Boxplot of Primary Motifs

List of Fine-Specificity rMpL_CFG_1 Motifs

Motif ID Nearest Common Name (Accuracy%**) Motif Graphic Structure Relative Binding Number of Glycans P-Value***
A3* Type 2 TriLacNAc/i Antigen (97%) 0.99 8 <0.001
A2 Type 2 TriLacNAc/i Antigen (96%) 0.60 3 <0.001
A0* Terminal Type 2 LacNAc N-Glycan (97%) 0.47 18 <0.001
A1 Terminal Type 2 LacNAc O-GalNAc Core 4 (100%) 0.28 4 <0.001
B6 Terminal Type 1 LacNAc (97%) 1.00 3 <0.001
B1 Terminal beta-Galactose O-GlcNAc Core (96%) 0.86 5 <0.001
B5 Type 1 -Type 2 DiLacNAc (100%) 0.76 4 <0.001
B0* Terminal Type 1 LacNAc (97%) 0.19 20 <0.001
B3 Neolacto Glycosphingolipid (99%) 0.16 3 0.211
B2 Terminal Type 2 LacNAc O-GlcNAc Core (99%) 0.16 6 0.015
B4* Terminal Type 2 LacNAc (91%) 0.05 38 0.415
C0 Terminal Type 2 LacDiNAc (100%) 0.45 3 <0.001
D0 Terminal Hexuronate (100%) 0.26 3 0.208
E0 0.00 68 1.000
F1 O-GlcNAc Core a6 Sialyl Type 2 LacNAc (99%) 0.49 3 0.001
F0* O-GlcNAc Core a3 Sialyl Type 2 LacNAc (99%) 0.01 3 1.000
0 Non-Binders (100%) 0.00 416 NA

Key:

See Symbol Nomenclature for Glycans (SNFG) for complete key: https://www.ncbi.nlm.nih.gov/glycans/snfg.html

*Motif indicates the remaining glycans not matched by motifs which are a subset. Motif definition needs to be taken in the context of the model.

**Accuracy describes the consistency between common-name definition of the motif and the formal, text-based definition of the motif, in terms of percent agreement in the glycans containing the two motifs. Common Name label definitions given here.

***P-Value refers to difference from Non-Binders with multiple testing correction (Dunnet’s Test)

Motifs with a red motif ID fail to show a logistic response to protein concentration in the range of concentrations analyzed. These motifs may be nonbinding motifs (motifs which define nonbinding exceptions) or simply fail to fit. Nonbinding motifs are determined based on concentration dependent response when available or the average binding of non-motif glycans otherwise.

Boxplot of Fine-Specificity Motifs

Figure 1. Glycan binding grouped by motif and motif family. Individual glycans are given as points on the plot.

Motif Intensity Map

Figure 2. Glycan intensity and motif distribution plot. The top half of the plot presents the observed glycan binding intensity of various glycans used in the array over their rank binding intensity; only the top glycans are shown. The second plot indicates the position of glycans containing the various motifs in the top plot with a yellow tick.

Motif Family Membership Map

*Motif indicates the remaining glycans not matched by motifs which are a subset. Motif definition needs to be taken in the context of the model.

Figure 3. Treemap of glycan binding grouped by motif and family structure. The model structure can be represented as nested boxes where box size is proportional to the number of glycans with the motif and color changes with change in average relative binding of glycans with the motif. Only three layers of data splitting are included here, though further splitting may be possible.

Detailed Model Breakdown

Motif Glycan Examples:

Motif ID Motif Minimal Graphic Motif Complete Graphic Highest Glycan Moderate Glycan Lowest Glycan
A1
A2
A3
A0
B1
B2
B3
B4
B5
B6
B0
C0
D0
E0
F1
F0
0

Key:

All Concentration Plots:

Figure 4. Boxplots of glycan binding grouped by motif for each dataset in the model. Motifs are listed in ascending average binding intensity (for the selected concentration) and colored by family.

Model Structure:

*Motif matches the remaining glycans not matched by earlier motifs in the model.

Figure 5. Tree representation of the regression tree model trained on array data. Data flows through the tree (top-down) and is split by the various motifs. The motif used the split the data at each point has the id “family+split number” except when further split. In the case of futher splits the id of the motif used to split the data is denoted with an asterisk.


Curve Fitting:

Figure 6. Logistic curves fit to average motif binding. Curves are fit with as many parameters as possible given the data. All curves are based on the logistic curve with a fixed intercept of 0. Nonbinding motifs are excluded from the plot.

Motif Curve Parameter Scatterplots

Figure 7. Scatterplots of motif curve parameters. Plotting the asymptote of the curve and the slope of the curve over the log 10 Kd of the motif allows for critical assesment and comparison of motif curves. Note that the apparent Kd of the motif is the concentration at which the curve is half of the asymptote. A standard logistic slope of 1 is marked in a dashed line for reference. Nonbinding motifs are excluded from the plot. The highest affinity motifs which show the best concentration dependent response are in the top left of the top plot. Curves which have very low slopes may also indicate nonspecific binding while higher slopes may indicate higher order binding.

Motif ID Motif Graphic Structure Estimated Kd (ug/mL) Curve Asymptote Hill Slope
A1 110.610 32949 1.149
A2 51.211 35626 1.204
A3* 43.880 50516 1.187
A0* 70.967 39194 1.234
B1 12.298 18956 1.307
B2 133.504 29208 1.290
B3 79.372 25147 1.649
B4* 79.423 8286 1.762
B5 55.602 44044 1.086
B6 27.621 36793 1.352
B0* 66.358 16451 1.319
C0 63.036 28278 1.066
D0 10.921 5633 1.193
F1 7.233 8308 1.708

Motif Text Structures:

Motif ID Motif Graphic Motif Text
A1 <2f3f4f6f>GalB1-4<3f6f>GlcNAcB1-2Skip?1-6(<3f6f>GlcNAcB1-2Skip?1-3)<2f4f><GalNAcorMan>A
A2 <2f3f4f6f>GalB1-4<3f6f>GlcNAcB1-3<2f4f6f>GalB1-4<3f6f>GlcNAcB1-3<2f4f6f>GalB1-4<3f6f>GlcNAcB1-2<3f4f6f>ManA1-6<2f4f>ManB
A3* <2f3f4f6f>GalB1-4<3f6f>GlcNAcB1-3<2f4f6f>GalB1-4<3f6f>GlcNAcB1-3<2f4f6f>GalB1-4<3f6f>GlcNAcB1-<2or3><4f><GalorMan>?
A0* <2f3f4f6f>GalB1-4<3f6f>GlcNAcB1-2Skip?1-3(<2f3f4f6f>GalB1-4<3f6f>GlcNAcB1-2Skip?1-<4or6>)<2f><GalorGalNAcorMan>?
B1 <2f3f4f6f>GalB1-<3or4><6f>GlcNAcA
B2 <2f3f4f6f>GalB1-4<3f6f>GlcNAcB1-<3or6><2f4f>GalB1-4<3f6f>GlcNAcB
B3 <2f3f4f6f>GalB1-4<3f6f>GlcNAcB1-3<2f4f>GalB1-4<1f2f3f6f>GlcB
B4* <2f3f4f6f>GalB1-4<6f>GlcNAcB
B5 <2f3f4f6f>GalB1-3<4f6f>GlcNAcB1-3<2f4f6f>GalB1-4<3f6f>GlcNAcB1-<2or3or6><4f><GalorMan>?
B6 <2f3f4f6f>GalB1-3<4f6f>GlcNAcB1-2<3f4f6f>ManA1-3<2f>ManB1-4<3f6f>GlcNAcB1-4<1f3f6f>GlcNAcB
B0* <2f3f4f6f>GalB1-<3or4><6f>GlcNAc?
C0 <3f4f6f>GalNAcB1-4<3f6f>GlcNAcB1-<2or3><4f6f><GalNAcorMan>?
D0 <2f3f4f>GlcAB1-<3or6><1f2f4f><GalorGlcNAc>B
E0 <6f><GlcNAcorMan>?1-2<3f6f>ManA1-3<2f4f>ManB1-4<3f6f>GlcNAcB1-4<1f3f>GlcNAcB
F1 <3f4f6f8f><GlcNAcorNeu5,9Ac2orNeu5Gc>A<1or2>-6<2f3f4f>GalB
F0* <3f4f6f8f><GlcNAcorNeu5,9Ac2orNeu5Gc>A<1or2>-<3xor6><2f4f>GalB1-4<1f3f6f>GlcNAcB
0 Non-Binders

Key: