Automated Analysis report for GSL-I Vector CFG

Array Chart

Figure 1. Barcharts of raw glycan binding data. Glycans are grouped by family (automatically if not provided) and sorted by binding strength. Mouseover to see glycan structures.

Primary Motifs

Figure 2. Boxplots of glycans grouped by primary motif. Individual points are glycans. Mouseover to see glycan structures.
Primary Motif ID Motif Graphic Structure Relative Binding Number of Glycans
A A0p 0.94 7
B B0p 0.75 8
C C0p 1.00 6
D D0p 0.22 23
E E0p 0.18 35
0 0.00 530
Figure 2. Boxplots of glycans grouped by primary motif. Individual points are glycans. Mouseover to see glycan structures.
Primary Motif ID Motif Graphic Structure Relative Binding Number of Glycans
A A0p 0.94 7
B B0p 0.60 8
C C0p 1.00 6
D D0p 0.10 23
E E0p 0.04 35
0 0.00 530
Figure 2. Boxplots of glycans grouped by primary motif. Individual points are glycans. Mouseover to see glycan structures.
Primary Motif ID Motif Graphic Structure Relative Binding Number of Glycans
A A0p 1.00 7
B B0p 0.55 8
C C0p 0.88 6
D D0p 0.05 23
E E0p 0.01 35
0 0.00 530
Figure 2. Boxplots of glycans grouped by primary motif. Individual points are glycans. Mouseover to see glycan structures.
Primary Motif ID Motif Graphic Structure Relative Binding Number of Glycans
A A0p 0.61 7
B B0p 0.39 8
C C0p 1.00 6
D D0p 0.02 23
E E0p 0.01 35
0 0.00 530
Figure 2. Boxplots of glycans grouped by primary motif. Individual points are glycans. Mouseover to see glycan structures.
Primary Motif ID Motif Graphic Structure Kd Number of Glycans
A A0p 215.265 7
B B0p 103.663 8
C C0p 140.887 6
D D0p 575.172 17
E E0p 690.151 31
0 740.037 13

All Motifs

Figure 3. Boxplots of glycans grouped by motif. Individual points are glycans. Mouseover to see glycan structures.
Motif ID Nearest Common Name (Accuracy%**) Motif Graphic Structure Relative Binding Number of Glycans P-Value***
A0 Terminal alpha-Galactose (92%) A0p 0.53 7 0.000
B1 Terminal alpha-Galactose N-Glycan (98%) B1p 1.00 1 0.000
B0* Terminal alpha-Galactose (92%) B0p 0.34 7 0.000
C0* Tn Antigen (100%) C0p 0.91 1 0.000
C1 Terminal N-Actyl Galactosamine O-GlcNAc Core (98%) C1p 0.86 1 0.000
C4 Terminal N-Actyl Galactosamine (91%) C4p 0.76 1 0.000
C5 Terminal N-Actyl Galactosamine O-GalNAc (99%) C5p 0.47 1 0.000
C2 Blood Group A Type 2 (99%) C2p 0.23 1 0.000
C3 Blood Group A (97%) C3p 0.14 1 0.058
D0 Blood Group A (99%) D0p 0.12 23 0.000
E0 Terminal alpha-Galactose (97%) E0p 0.10 35 0.000
0 Non-Binders (100%) 0.00 530 NA
Figure 3. Boxplots of glycans grouped by motif. Individual points are glycans. Mouseover to see glycan structures.
Motif ID Nearest Common Name (Accuracy%**) Motif Graphic Structure Relative Binding Number of Glycans P-Value***
A0 Terminal alpha-Galactose (92%) A0p 0.49 7 0.000
B1 Terminal alpha-Galactose N-Glycan (98%) B1p 1.00 1 0.000
B0* Terminal alpha-Galactose (92%) B0p 0.22 7 0.000
C4 Terminal N-Actyl Galactosamine (91%) C4p 1.00 1 0.000
C1 Terminal N-Actyl Galactosamine O-GlcNAc Core (98%) C1p 0.97 1 0.000
C0* Tn Antigen (100%) C0p 0.81 1 0.000
C5 Terminal N-Actyl Galactosamine O-GalNAc (99%) C5p 0.23 1 0.000
C2 Blood Group A Type 2 (99%) C2p 0.10 1 0.451
C3 Blood Group A (97%) C3p 0.03 1 1.000
D0 Blood Group A (99%) D0p 0.05 23 0.000
E0 Terminal alpha-Galactose (97%) E0p 0.02 35 0.236
0 Non-Binders (100%) 0.00 530 NA
Figure 3. Boxplots of glycans grouped by motif. Individual points are glycans. Mouseover to see glycan structures.
Motif ID Nearest Common Name (Accuracy%**) Motif Graphic Structure Relative Binding Number of Glycans P-Value***
A0 Terminal alpha-Galactose (92%) A0p 0.51 7 0.000
B1 Terminal alpha-Galactose N-Glycan (98%) B1p 1.00 1 0.000
B0* Terminal alpha-Galactose (92%) B0p 0.18 7 0.000
C0* Tn Antigen (100%) C0p 0.92 1 0.000
C1 Terminal N-Actyl Galactosamine O-GlcNAc Core (98%) C1p 0.83 1 0.000
C4 Terminal N-Actyl Galactosamine (91%) C4p 0.77 1 0.000
C5 Terminal N-Actyl Galactosamine O-GalNAc (99%) C5p 0.13 1 0.471
C2 Blood Group A Type 2 (99%) C2p 0.06 1 0.994
C3 Blood Group A (97%) C3p -0.00 1 1.000
D0 Blood Group A (99%) D0p 0.02 23 0.704
E0 Terminal alpha-Galactose (97%) E0p 0.01 35 1.000
0 Non-Binders (100%) 0.00 530 NA
Figure 3. Boxplots of glycans grouped by motif. Individual points are glycans. Mouseover to see glycan structures.
Motif ID Nearest Common Name (Accuracy%**) Motif Graphic Structure Relative Binding Number of Glycans P-Value***
A0 Terminal alpha-Galactose (92%) A0p 0.21 7 0.000
B1 Terminal alpha-Galactose N-Glycan (98%) B1p 0.48 1 0.000
B0* Terminal alpha-Galactose (92%) B0p 0.08 7 0.000
C1 Terminal N-Actyl Galactosamine O-GlcNAc Core (98%) C1p 1.00 1 0.000
C4 Terminal N-Actyl Galactosamine (91%) C4p 0.68 1 0.000
C0* Tn Antigen (100%) C0p 0.26 1 0.000
C5 Terminal N-Actyl Galactosamine O-GalNAc (99%) C5p 0.10 1 0.073
C2 Blood Group A Type 2 (99%) C2p 0.00 1 1.000
C3 Blood Group A (97%) C3p -0.00 1 1.000
D0 Blood Group A (99%) D0p 0.01 23 0.991
E0 Terminal alpha-Galactose (97%) E0p 0.00 35 0.999
0 Non-Binders (100%) 0.00 530 NA
Figure 3. Boxplots of glycans grouped by motif. Individual points are glycans. Mouseover to see glycan structures.
Motif ID Nearest Common Name (Accuracy%**) Motif Graphic Structure Kd Number of Glycans P-Value***
A0 Terminal alpha-Galactose (92%) A0p 215.265 7 0.019
B1 Terminal alpha-Galactose N-Glycan (98%) B1p 20.1104 1 0.379
B0* Terminal alpha-Galactose (92%) B0p 115.599 7 0.003
C4 Terminal N-Actyl Galactosamine (91%) C4p 7.80831 1 0.357
C1 Terminal N-Actyl Galactosamine O-GlcNAc Core (98%) C1p 15.3475 1 0.370
C0* Tn Antigen (100%) C0p 35.0639 1 0.407
C3 Blood Group A (97%) C3p 77.6507 1 0.493
C5 Terminal N-Actyl Galactosamine O-GalNAc (99%) C5p 254.595 1 0.845
C2 Blood Group A Type 2 (99%) C2p 454.858 1 0.996
D0 Blood Group A (99%) D0p 575.172 17 0.877
E0 Terminal alpha-Galactose (97%) E0p 690.151 31 1.000
0 Non-Binders (100%) 740.037 13 NA

*Motif indicates the remaining glycans not matched by motifs which are a subset. Motif definition needs to be taken in the context of the model.

**Accuracy describes the consistency between common-name definition of the motif and the formal, text-based definition of the motif, in terms of percent agreement in the glycans containing the two motifs. Common Name label definitions given here.

***P-Value refers to difference from Non-Binders with multiple testing correction (Dunnet’s Test)

Motifs with a red motif ID fail to show a logistic response to protein concentration in the range of concentrations analyzed. These motifs may be nonbinding motifs (motifs which define nonbinding exceptions) or simply fail to fit. Nonbinding motifs are determined based on concentration dependent response when available or the average binding of non-motif glycans otherwise.

Motif Intensity Maps

Figure 4. Glycan intensity and motif distribution plot. The top half of the plot presents the observed glycan binding intensity of various glycans used in the array over their rank binding intensity; only the top glycans are shown. The second plot indicates the position of glycans containing the various motifs in the top plot with a yellow tick.

Treemap Diagram

*Motif indicates the remaining glycans not matched by motifs which are a subset. Motif definition needs to be taken in the context of the model.

Figure 5. Treemap of glycan binding grouped by motif and family structure. The model structure can be represented as nested boxes where box size is proportional to the number of glycans with the motif and color changes with change in average relative binding of glycans with the motif. Only three layers of data splitting are included here, though further splitting may be possible.

Model Diagram

*Motif matches the remaining glycans not matched by earlier motifs in the model.

Figure 6. Tree representation of the regression tree model trained on array data. Data flows through the tree (top-down) and is split by the various motifs. The motif used the split the data at each point has the id “family+split number” except when further split. In the case of futher splits the id of the motif used to split the data is denoted with an asterisk.

Binding Curves

Figure 7. Logistic curves fit to average motif binding and glycan binding. Curves are fit with as many parameters as possible given the data. All curves are based on the logistic curve with a fixed intercept of 0. Mouse-over to view curve info and highlight glycans/motifs on the other plot.

## Function `ggiraph()` is replaced by `girafe()` and will be removed soon.

Figure 8. Scatterplots of logistic curve parameters for glycans (points) and motifs (text).

Motif Details

Motif ID Motif Graphic   Motif Full Graphic Kd Asymptote Slope Motif Text
A0 A0p A0Full 29.607 27734 0.697 {<2f3f4f6f>GalA1-4<2f3f6f><GalorGlcNAc>B}
B1 B1p B1Full 19.898 47947 0.733 {<2f3f4f6f>GalA1-3<2f4f6f>GalB1-4<3f6f>GlcNAcB1-2<3f4f6f>ManA1-3(<2f4f6f>GalB1-4<3f6f>GlcNAcB1-2<3f4f6f>ManA1-6)<2f4f>ManB1-4<3f6f>GlcNAcB1-4<1f3f6f>GlcNAc?} {Spacer:KVANKT}
B0 B0p B0Full 151.581 29374 0.695 {<2f3f4f6f>GalA1-3<2f4f6f>GalB1-4<6f>GlcNAcB}
C1 C1p C1Full 15.191 39719 0.737 {<3f4f6f>GalNAcA1-4(<2f3f4f>FucA1-2)<3f6f>GalB1-4<1f3f6f>GlcNAcB} {Spacer:-NCCCO}
C2 C2p C2Full 282.637 28024 0.797 {<3f4f6f>GalNAcA1-3(<2f3f4f>FucA1-2)<4f6f>GalB1-4<1f3f6f>GlcNAcB} {Spacer:-NCCCO}
C3 C3p C3Full 36.798 6227 1.956 {<3f4f6f>GalNAcA1-3(<2f3f4f>FucA1-2)<1f4f6f>GalB} {Spacer:-NCCCO}
C4 C4p C4Full 7.754 31180 0.892 {<3f4f6f>GalNAcA1-3<1f2f4f6f>GalB} {Spacer:-NCCCO}
C5 C5p C5Full 215.640 49003 0.780 {<3f4f6f>GalNAcA1-3<1f4f6f>GalNAcB} {Spacer:-NCCCO}
C0 C0p C0Full 33.349 48847 0.685 {<3f4f6f>GalNAcA} {Spacer:-NCCCO}
D0 D0p D0Full 200.918 13030 0.858 {<3f4f6f>GalNAcA}
E0 E0p E0Full 1577.501 65536 1.016 {<2f3f4f6f>GalA1-<2or3or4><6f>GalB}

See Symbol Nomenclature for Glycans (SNFG) for complete key: https://www.ncbi.nlm.nih.gov/glycans/snfg.html