Automated Analysis report for MAL-I_ZBioManyArray_Vector_HighComp evaluated at 10 ug/mL

List of Primary MAL-I_ZBioManyArray_Vector_HighComp Motifs

Primary Motif ID	Relative Binding	Number of Glycans
A	0.84	109
B	0.05	43
C	0.10	17
D	0.73	4
0	0.00	450

Minimal and complete motif definitions match the same set of glycans defined in the array. Components found in the complete motif but excluded in the minimal motif are not tested by the array. Monosaccharide identities and subsitution intolerance may or may not have been tested by the array, they are retained from the complete motif for readability.

Boxplot of Primary Motifs

List of Fine-Specificity MAL-I_ZBioManyArray_Vector_HighComp Motifs

Motif ID	Nearest Common Name (Accuracy%**)	Relative Binding	Number of Glycans	P-Value***
A1	a3 Sialyl Type 2 LacNAc (96%)	1.00	82	<0.001
A0*	a3 Sialyl Type 2 LacNAc Neolacto Glycosphingolipid (99%)	0.95	27	<0.001
B1	5-Glycolyl Neuramic Acid (93%)	0.04	40	0.987
B0*	5-Glycolyl Neuramic Acid Non-N-Glycan (94%)	0.02	3	1.000
C0	Neolacto Glycosphingolipid Terminal Type 2 LacNAc (98%)	0.10	17	0.930
D0	Terminal 3’ Sulfated Galactose (100%)	0.42	4	0.046
0	Non-Binders (100%)	0.00	450	NA

Key:

See Symbol Nomenclature for Glycans (SNFG) for complete key: https://www.ncbi.nlm.nih.gov/glycans/snfg.html

*Motif indicates the remaining glycans not matched by motifs which are a subset. Motif definition needs to be taken in the context of the model.

**Accuracy describes the consistency between common-name definition of the motif and the formal, text-based definition of the motif, in terms of percent agreement in the glycans containing the two motifs. Common Name label definitions given here.

***P-Value refers to difference from Non-Binders with multiple testing correction (Dunnet’s Test)

Motifs with a red motif ID fail to show a logistic response to protein concentration in the range of concentrations analyzed. These motifs may be nonbinding motifs (motifs which define nonbinding exceptions) or simply fail to fit. Nonbinding motifs are determined based on concentration dependent response when available or the average binding of non-motif glycans otherwise.

Boxplot of Fine-Specificity Motifs

Figure 1. Glycan binding grouped by motif and motif family. Individual glycans are given as points on the plot.

Motif Intensity Map

Figure 2. Glycan intensity and motif distribution plot. The top half of the plot presents the observed glycan binding intensity of various glycans used in the array over their rank binding intensity; only the top glycans are shown. The second plot indicates the position of glycans containing the various motifs in the top plot with a yellow tick.

Motif Family Membership Map

*Motif indicates the remaining glycans not matched by motifs which are a subset. Motif definition needs to be taken in the context of the model.

Figure 3. Treemap of glycan binding grouped by motif and family structure. The model structure can be represented as nested boxes where box size is proportional to the number of glycans with the motif and color changes with change in average relative binding of glycans with the motif. Only three layers of data splitting are included here, though further splitting may be possible.

Detailed Model Breakdown

Motif Glycan Examples:

Motif ID	Motif Minimal Graphic	Motif Complete Graphic	Highest Glycan	Moderate Glycan	Lowest Glycan
A1
A0
B1
B0
C0
D0
0

Key:

All Concentration Plots:

Figure 4. Boxplots of glycan binding grouped by motif for each dataset in the model. Motifs are listed in ascending average binding intensity (for the selected concentration) and colored by family.

Model Structure:

*Motif matches the remaining glycans not matched by earlier motifs in the model.

Figure 5. Tree representation of the regression tree model trained on array data. Data flows through the tree (top-down) and is split by the various motifs. The motif used the split the data at each point has the id “family+split number” except when further split. In the case of futher splits the id of the motif used to split the data is denoted with an asterisk.

Curve Fitting:

No curves were fit for model

Motif Text Structures:

Motif ID	Motif Graphic	Motif Text
A1		<4f5f6f8f>SiaA2-3<2f4f6f>GalB1-4<3f6f>GlcNAcB1-2<3f4f>ManA
A0*		<4f5f6f>SiaA2-3<2f4f6f>GalB1-4<3f6f>GlcNAcB
B1		<4f6f8f>Neu5GcA2-<3or6><2f4f>GalB1-<3or4><6f>GlcNAcB
B0*		<4f6f8f>Neu5GcA2-<3or6><2f4f>GalB1-<3or4><6f><GalNAcorGlcNAc>?
C0		<2f4f6f>GalB1-4<3f6f>GlcNAcB1-3<2f4f6f>GalB
D0		(3S)<2f><GalorGlcNS>?
0		Non-Binders

Key: