Automated Analysis report for AAL Vector CFG

Array Chart

Figure 1. Barcharts of raw glycan binding data. Glycans are grouped by family (automatically if not provided) and sorted by binding strength. Mouseover to see glycan structures.

Primary Motifs

Figure 2. Boxplots of glycans grouped by primary motif. Individual points are glycans. Mouseover to see glycan structures.
Primary Motif ID Motif Graphic Structure Relative Binding Number of Glycans
A A0p 0.64 201
B B0p 1.00 1
0 0.00 407
Figure 2. Boxplots of glycans grouped by primary motif. Individual points are glycans. Mouseover to see glycan structures.
Primary Motif ID Motif Graphic Structure Relative Binding Number of Glycans
A A0p 0.62 201
B B0p 1.00 1
0 0.00 407
Figure 2. Boxplots of glycans grouped by primary motif. Individual points are glycans. Mouseover to see glycan structures.
Primary Motif ID Motif Graphic Structure Relative Binding Number of Glycans
A A0p 0.47 201
B B0p 1.00 1
0 0.00 407
Figure 2. Boxplots of glycans grouped by primary motif. Individual points are glycans. Mouseover to see glycan structures.
Primary Motif ID Motif Graphic Structure Relative Binding Number of Glycans
A A0p 0.32 201
B B0p 1.00 1
0 0.00 407
Figure 2. Boxplots of glycans grouped by primary motif. Individual points are glycans. Mouseover to see glycan structures.
Primary Motif ID Motif Graphic Structure Relative Binding Number of Glycans
A A0p 0.18 201
B B0p 1.00 1
0 0.00 407
Figure 2. Boxplots of glycans grouped by primary motif. Individual points are glycans. Mouseover to see glycan structures.
Primary Motif ID Motif Graphic Structure Kd Number of Glycans
A A0p 57.8762 160
B B0p 0.25986 1
0 1.03851 3

All Motifs

Figure 3. Boxplots of glycans grouped by motif. Individual points are glycans. Mouseover to see glycan structures.
Motif ID Nearest Common Name (Accuracy%**) Motif Graphic Structure Relative Binding Number of Glycans P-Value***
A0* A0p 0.91 88 0.000
A5* Blood Group H (94%) A5p 0.89 39 0.000
A4 Blood Group H (95%) A4p 0.55 17 0.000
A6 Sialyl Lewis X (98%) A6p 0.23 6 0.010
A3 A3p 0.19 5 0.111
A2 Blood Group H a-Series Ganglioside (100%) A2p 0.11 4 0.846
A1 A1p 0.06 42 0.166
B0 beta-Fucose (100%) B0p 1.00 1 0.000
0 Non-Binders (100%) 0.00 407 NA
Figure 3. Boxplots of glycans grouped by motif. Individual points are glycans. Mouseover to see glycan structures.
Motif ID Nearest Common Name (Accuracy%**) Motif Graphic Structure Relative Binding Number of Glycans P-Value***
A0* A0p 0.87 88 0.000
A5* Blood Group H (94%) A5p 0.87 39 0.000
A4 Blood Group H (95%) A4p 0.54 17 0.000
A3 A3p 0.20 5 0.059
A6 Sialyl Lewis X (98%) A6p 0.18 6 0.075
A2 Blood Group H a-Series Ganglioside (100%) A2p 0.15 4 0.486
A1 A1p 0.07 42 0.102
B0 beta-Fucose (100%) B0p 1.00 1 0.000
0 Non-Binders (100%) 0.00 407 NA
Figure 3. Boxplots of glycans grouped by motif. Individual points are glycans. Mouseover to see glycan structures.
Motif ID Nearest Common Name (Accuracy%**) Motif Graphic Structure Relative Binding Number of Glycans P-Value***
A5* Blood Group H (94%) A5p 0.81 39 0.000
A0* A0p 0.65 88 0.000
A4 Blood Group H (95%) A4p 0.24 17 0.000
A3 A3p 0.07 5 0.983
A6 Sialyl Lewis X (98%) A6p 0.06 6 0.985
A2 Blood Group H a-Series Ganglioside (100%) A2p 0.02 4 1.000
A1 A1p 0.02 42 0.999
B0 beta-Fucose (100%) B0p 1.00 1 0.000
0 Non-Binders (100%) 0.00 407 NA
Figure 3. Boxplots of glycans grouped by motif. Individual points are glycans. Mouseover to see glycan structures.
Motif ID Nearest Common Name (Accuracy%**) Motif Graphic Structure Relative Binding Number of Glycans P-Value***
A5* Blood Group H (94%) A5p 0.67 39 0.000
A0* A0p 0.39 88 0.000
A4 Blood Group H (95%) A4p 0.20 17 0.000
A3 A3p 0.06 5 0.976
A6 Sialyl Lewis X (98%) A6p 0.03 6 1.000
A2 Blood Group H a-Series Ganglioside (100%) A2p 0.02 4 1.000
A1 A1p 0.01 42 0.999
B0 beta-Fucose (100%) B0p 1.00 1 0.000
0 Non-Binders (100%) 0.00 407 NA
Figure 3. Boxplots of glycans grouped by motif. Individual points are glycans. Mouseover to see glycan structures.
Motif ID Nearest Common Name (Accuracy%**) Motif Graphic Structure Relative Binding Number of Glycans P-Value***
A5* Blood Group H (94%) A5p 0.53 39 0.000
A0* A0p 0.16 88 0.000
A6 Sialyl Lewis X (98%) A6p 0.05 6 0.962
A4 Blood Group H (95%) A4p 0.02 17 0.995
A2 Blood Group H a-Series Ganglioside (100%) A2p 0.00 4 1.000
A1 A1p 0.00 42 1.000
A3 A3p -0.01 5 1.000
B0 beta-Fucose (100%) B0p 1.00 1 0.000
0 Non-Binders (100%) 0.00 407 NA
Figure 3. Boxplots of glycans grouped by motif. Individual points are glycans. Mouseover to see glycan structures.
Motif ID Nearest Common Name (Accuracy%**) Motif Graphic Structure Kd Number of Glycans P-Value***
A3 A3p 1.47777 2 1.000
A4 Blood Group H (95%) A4p 2.03304 14 1.000
A0* A0p 24.9195 85 1.000
A5* Blood Group H (94%) A5p 26.8774 38 1.000
A6 Sialyl Lewis X (98%) A6p 266.905 4 0.381
A1 A1p 286.954 14 0.161
A2 Blood Group H a-Series Ganglioside (100%) A2p 334.766 3 0.234
B0 beta-Fucose (100%) B0p 0.25986 1 1.000
0 Non-Binders (100%) 1.03851 3 NA

*Motif indicates the remaining glycans not matched by motifs which are a subset. Motif definition needs to be taken in the context of the model.

**Accuracy describes the consistency between common-name definition of the motif and the formal, text-based definition of the motif, in terms of percent agreement in the glycans containing the two motifs. Common Name label definitions given here.

***P-Value refers to difference from Non-Binders with multiple testing correction (Dunnet’s Test)

Motifs with a red motif ID fail to show a logistic response to protein concentration in the range of concentrations analyzed. These motifs may be nonbinding motifs (motifs which define nonbinding exceptions) or simply fail to fit. Nonbinding motifs are determined based on concentration dependent response when available or the average binding of non-motif glycans otherwise.

Motif Intensity Maps

Figure 4. Glycan intensity and motif distribution plot. The top half of the plot presents the observed glycan binding intensity of various glycans used in the array over their rank binding intensity; only the top glycans are shown. The second plot indicates the position of glycans containing the various motifs in the top plot with a yellow tick.

Treemap Diagram

*Motif indicates the remaining glycans not matched by motifs which are a subset. Motif definition needs to be taken in the context of the model.

Figure 5. Treemap of glycan binding grouped by motif and family structure. The model structure can be represented as nested boxes where box size is proportional to the number of glycans with the motif and color changes with change in average relative binding of glycans with the motif. Only three layers of data splitting are included here, though further splitting may be possible.

Model Diagram

*Motif matches the remaining glycans not matched by earlier motifs in the model.

Figure 6. Tree representation of the regression tree model trained on array data. Data flows through the tree (top-down) and is split by the various motifs. The motif used the split the data at each point has the id “family+split number” except when further split. In the case of futher splits the id of the motif used to split the data is denoted with an asterisk.

Binding Curves

Figure 7. Logistic curves fit to average motif binding and glycan binding. Curves are fit with as many parameters as possible given the data. All curves are based on the logistic curve with a fixed intercept of 0. Mouse-over to view curve info and highlight glycans/motifs on the other plot.

Figure 8. Scatterplots of logistic curve parameters for glycans (points) and motifs (text).

Motif Details

Motif ID Motif Graphic   Motif Full Graphic Kd Asymptote Slope Motif Text
A1 A1p A1Full 2.053 2566 1.623 {<2f6f><GalorGalNAc>?1-3(<2f3f4f>FucA1-2)<4f6f>GalB1-<3or4><6f><GalNAcorGlcNAc>?}
A2 A2p A2Full 1.173 4908 11.545 {<2f3f4f>FucA1-2<3f4f6f>GalB1-3<4f6f>GalNAcB}
A3 A3p A3Full 2.074 7752 1.109 {<2f4f6f><GalorGalNAc>A1-3(<2f3f4f>FucA1-2)<4f6f>GalB1-4<1f2f6f>GlcB}
A4 A4p A4Full 1.845 21430 0.942 {<2f3f4f>FucA1-2<3f4f6f>GalB1-3<6f><GalNAcorGlcNAc>?}
A5 A5p A5Full 0.352 30854 1.174 {<2f3f4f>FucA1-2GalB}
A6 A6p A6Full 2.940 8055 1.038 {<2f4f6f>GalB1-<3or4>(<2f3f4f>FucA1-<3or4>)<6f>GlcNAcB1-<3or6><4f>GalNAcA} {Spacer:Thr}
A0 A0p A0Full 0.691 31929 1.120 {<2f4f>FucA}
B0 B0p B0Full 0.259 35292 1.189 {<2f3f4f>FucB1-3<1f4f6f>GlcNAcB} {Spacer:NCCCO}

See Symbol Nomenclature for Glycans (SNFG) for complete key: https://www.ncbi.nlm.nih.gov/glycans/snfg.html