Automated Analysis report for GNA EY Labs AsymN

Figure 1. Barcharts of raw glycan binding data. Glycans are grouped by family (automatically if not provided) and sorted by binding strength. Mouseover to see glycan structures.

*Motif indicates the remaining glycans not matched by motifs which are a subset. Motif definition needs to be taken in the context of the model.

**Accuracy describes the consistency between common-name definition of the motif and the formal, text-based definition of the motif, in terms of percent agreement in the glycans containing the two motifs. Common Name label definitions given here.

***P-Value refers to difference from Non-Binders with multiple testing correction (Dunnet’s Test)

Motifs with a red motif ID fail to show a logistic response to protein concentration in the range of concentrations analyzed. These motifs may be nonbinding motifs (motifs which define nonbinding exceptions) or simply fail to fit. Nonbinding motifs are determined based on concentration dependent response when available or the average binding of non-motif glycans otherwise.

Figure 4. Glycan intensity and motif distribution plot. The top half of the plot presents the observed glycan binding intensity of various glycans used in the array over their rank binding intensity; only the top glycans are shown. The second plot indicates the position of glycans containing the various motifs in the top plot with a yellow tick.

*Motif indicates the remaining glycans not matched by motifs which are a subset. Motif definition needs to be taken in the context of the model.

Figure 5. Treemap of glycan binding grouped by motif and family structure. The model structure can be represented as nested boxes where box size is proportional to the number of glycans with the motif and color changes with change in average relative binding of glycans with the motif. Only three layers of data splitting are included here, though further splitting may be possible.

*Motif matches the remaining glycans not matched by earlier motifs in the model.

Figure 6. Tree representation of the regression tree model trained on array data. Data flows through the tree (top-down) and is split by the various motifs. The motif used the split the data at each point has the id “family+split number” except when further split. In the case of futher splits the id of the motif used to split the data is denoted with an asterisk.

0 5000 10000 15000 0.1 1.0 10.0 Concentration (ug/mL) Observed Binding Motif Curves 0 5000 10000 15000 0.1 1.0 10.0 Concentration (ug/mL) Observed Binding Motif A1 A2 A3 A0 B1 B0 0 Glycan Curves

Figure 7. Logistic curves fit to average motif binding and glycan binding. Curves are fit with as many parameters as possible given the data. All curves are based on the logistic curve with a fixed intercept of 0. Mouse-over to view curve info and highlight glycans/motifs on the other plot.

A1 A2 A3 A0 B1 B0 20000 40000 60000 1e-02 1e-01 1e+00 1e+01 1e+02 Kd Asymptote A1 A2 A3 A0 B1 B0 1e-06 1e-03 1e+00 1e-02 1e-01 1e+00 1e+01 1e+02 Kd Slope

Figure 8. Scatterplots of logistic curve parameters for glycans (points) and motifs (text).

ABCDEFGHIJ0123456789
ID
<dbl>
Motif
<fct>
Kd
<dbl>
Asymptote
<dbl>
Slope
<dbl>
31A27.672220e-0119481.2301.1168360
88B17.897627e-0115951.2301.1075590
85A08.363977e-0120168.9900.6666811
19A09.414033e-0118014.8300.7732704
35B01.332686e+0015144.5201.1850310
32A21.518877e+0019783.0001.1583590
20A01.529524e+0018643.5900.7964527
53B01.686377e+0021370.2700.8213691
90B01.718521e+0012318.4201.2260670
37B12.192069e+0016919.4400.9025222
Motif ID Motif Graphic   Motif Full Graphic Kd Asymptote Slope Motif Text
A1 2.535 8288 1.616 {<4f6f8f>Neu5AcA2-<3or6><2f4f>GalB1-4<3f6f>GlcNAcB1-2<3f4f6f>ManA1-6(<2f3f4f6f>ManA1-3)<2f4f>ManB1-4<3f6f>GlcNAcB1-4<1f3f6f>GlcNAcB} {Spacer:AEAB}
A2 1.093 19681 1.103 {<3f6f>GlcNAcB1-2<3f4f6f>ManA1-6(<2f3f4f6f>ManA1-3)<2f4f>ManB1-4<3f6f>GlcNAcB1-4<1f3f6f>GlcNAcB} {Spacer:AEAB}
A3 2.753 9594 1.197 {<4f6f8f>Neu5AcA2-<3or6><2f4f>GalB1-4<3f6f>GlcNAcB1-2<3f4f6f>ManA1-3(<2f3f4f6f>ManA1-6)<2f4f>ManB1-4<3f6f>GlcNAcB1-4<1f3f6f>GlcNAcB} {Spacer:AEAB}
A0 1.098 19085 0.726 {<3f6f>GlcNAcB1-2<3f4f6f>ManA1-<3or6>(<2f3f4f6f>ManA1-<3or6>)<2f4f>ManB1-4<3f6f>GlcNAcB1-4<1f3f>GlcNAcB} {Spacer:AEAB}
B1 3.724 2706 0.988 {<6f>GlcNAcB1-2<3f4f6f>ManA1-3<2f4f>ManB1-4<3f6f>GlcNAcB1-4<1f3f>GlcNAcB} {Spacer:AEAB}
B0 1.840 13700 0.982 {<3f4f6f>ManA1-3<2f4f>ManB1-4<3f6f>GlcNAcB1-4<1f3f>GlcNAcB} {Spacer:AEAB}

See Symbol Nomenclature for Glycans (SNFG) for complete key: https://www.ncbi.nlm.nih.gov/glycans/snfg.html