Home-Download-Recalibrate-Pre-process-Annotate1-Annotate2-Coregister-Statistics
Statistical analysis
Note that this is only relevant for files which have been pre-processed, annotated and coregistered. See the previous sections for further information.
Single file
For performing statistical analysis on a single file, use the workflow file StatsWorkflow.m
in the main folder. The following sections are briefly described here with more information available in the file itself.
The code is slightly different to what has been prepared previously. The file MSData.m
contains a class definition and various methods associated with that class. On importing a file, we create an instance of the class called on object. The methods can be run on the object fairly easily.
Import data
To import the data, provide a file path and name to a coregistered .mat file. The various parts are imported and an MSData class object with name `d’ is returned. The name can be changed as you wish. When the file is imported, the tissue/background mask is taken from the coregistration stage; this can be checked in the next section and amended as required.
Normalisation
There are four options available for normalisation: 'tic' | 'vec2' | 'pqn' | 'none'
. Following this stage, a log transformation can be applied by setting the value to true | false
.
Principal components analysis
Perform PCA over either all pixels or just the tissue pixels. Images can be plot as either single components or as RGB images of 3 components.
Cross-validated predictions
The annotations extracted from the H&E image in a previous section are used, along with (if selected) a series of random background pixels. These are split into 4 groups and a leave region out cross validation is performed, whereby one ‘rectangle’ is omitted each turn and the other regions are used to predict that omitted rectangle. Results can be presented as a confusion matrix or an image.
Whole image predictions
This uses logistic regression to predict the class of all pixels in the image. A model is trained using the annotated regions and subsequently applied to all pixels.
Univariate statistical analysis
ANOVA is performed for each variable to determine differences between annotation groups (requires more than 1). The results can be output as a table sorted according to the FDR-corrected p values, which provides basic variable information such as mean intensity of the annotated regions.
Variable visualisation
Plot ion images or box plots for specific variables. Ion images can be either individual or in RGB form (i.e. 3 ions) and intensities for each ion are modified, whereby intensities above the 95th percentile are set to that value. In RGB form, the individual images are scaled between 0-1 for visualisation purposes.
Box plots can be generated in a few different ways, either classically as box and whisker plots with outliers, or as scattered plots to show the variable distribution.
Multiple files
No further information will be included to this section. Further development will be in another repository.
Home-Download-Recalibrate-Pre-process-Annotate1-Annotate2-Coregister-Statistics