Skip to the content.

Gemma website

The Gemma website provides access to curated transcriptomic datasets and differential expression analysis results. From the main page, you can search for a specific dataset using its GSE accession number or explore a dataset using the Gemma browser, which includes advanced search options.

Dataset pages

Each dataset page in Gemma includes four main tabs: Overview, Experimental Design, Visualize Expression and Diagnostics. Below we describe the content and functionality of each.

Overview

The Overview tab summarizes key information about the dataset:

It also includes dataset tags and status indicators.

Dataset tags

There are three types of tags shown:

Dataset status

Colour-coded badges and emoticons indicate dataset quality and processing state, such as batching information and whether data have been reanalyzed. Emoticons correspond to the dataset’s GEEQ score (see Data curation for details). Hovering over the emoticon will reveal the numerical value of the score.

Differential expression analysis

Differential expression results are summarized using several visualization options:

A complete table of differential expression results (i.e. log2 fold change, t-statistics and P-value) can be downloaded for further inspection.

Experimental design

The ‘Experimental Design’ tab displays the layout of the dataset’s experimental design in a tabular format:

This view allows you to quickly understand how the study was structured.

Visualize expression

The ‘Visualize expression’ tab shows a heatmap of 20 randomly selected expressed genes (platform elements). Heatmap columns correspond to samples, while the color bars above the heatmap show the distribution of factor values for each experimental factor (including the batch, if present) across the samples. The heatmap can be redrawn for a different selection of genes by clicking on the ‘Visualize’ button.

The user has options to toggle sample name labels, switch to a line plot or download the displayed data in a tab-delimited format, by clicking on the icons at the bottom of the page.

Diagnostics

The ‘Diagnostics’ tab provides plots for assessing dataset quality:

Clicking on a plot opens a larger version with additional details, such as sample names and color legend.

This tab also includes information about removed outlier samples, which can be observed as ‘grayed’ out rows/columns in the sample correlation heatmap.

Gemma browser

The Gemma browser uses a familiar “shopping-style” interface, offering multiple ways to search for datasets. The search parameters are described in the left panel:

Behind the scenes, your selections are translated into API queries that retrieve the most relevant matches. The results are displayed in a table on the right, which lists the identified experiments along with key information for each.