A Molecular Atlas of the Adult Mouse Brain

Licensing

You are welcome to reuse the data in any way as long as you provide proper reference to our work.

Download coronal plates

Expand +

A coronal atlas of the molecular clusters in a vectorized format. It can be easily modified with a vector graphics editor.

This .zip folder contains 41 .pdf files: one per coronal plate et a merged file with all plates combined together. The Allen Brain Atlas is displayed on the left side and the molecular clusters as spots on the right side.

This .zip folder contains 41 .pdf files: one per coronal plate et a merged file with all plates combined together. The Allen Brain Atlas is displayed on the left side and the molecular clusters as continuous regions on the right side.

This .zip folder contains 41 .pdf files: one per coronal plate et a merged file with all plates combined together. The molecular clusters are displayed as spots on the left side and the molecular clusters as continuous regions on the right side.

info iconAll files are in vector format so they can easily be edited. Neuroanatomical definitions and borders can be selected and manually corrected. In order to do so, it is recommended to first delete the clipping masks covering the plots. On Adobe Illustrator, proceed as follow: first, Select > All. Then Object > Clipping Mask > Release

info iconSpots and regions are color coded based on their cluster assignment using the categorical color scheme. In order to facilitate selection, each cluster color was made unique by small adjustment of the RGB value. You can select all shapes belonging to a cluster (including in the legend) by using the selection tool. First select one spot, then go to Select > Same > Fill Color or use the “Select similar object” shortcut.

Download processed data

Expand +

The data in an easy to use format (matrix of raw counts, meta table, etc.) after minimal processing.

download iconThe meta table

See info +

A meta table in .tsv format with spots as rows. Rows are indexed by a unique spot identifier. Columns represent respectively:

  1. The section ID (section_index)
  2. The Medial-Lateral stereotaxic coordinate in mm (stereo_ML)
  3. The Dorsal-Ventral stereotaxic coordinate in mm (stereo_DV)
  4. The Antero-Posterior stereotaxic coordinate in mm (stereo_AP)
  5. The X coordinate on the HE picture (HE_X)
  6. The Y coordinate on the HE picture (HE_Y)
  7. The acronym of the region in which the spot was registered according to the Allen Atlas (ABA_acronym)
  8. The name of the region in which the spot was registered according to the Allen Atlas (ABA_name)
  9. The name of the region in which the spot was registered according to the Allen Atlas at a coarse scale (ABA_parent)
  10. The number of nuclei segmented in the spot (nuclei_segmented)
  11. The estimated radius of the spot in pixels (spot_radius)
  12. A boolean indicating whether the spot successfully passed the quality control (passed_QC)
  13. The ID of the cluster to which the spot was assigned (cluster_id)
  14. The name (cluster_name) of the cluster to which the spot was assigned (cluster_name)

The matrix of UMI raw counts in .tsv format. Each row corresponds to one spot and is indexed by a unique spot identifier. Each column corresponds to the gene indicated in the header.

A matrix of normalized expression in .tsv format. Data is log-normalized using the computeSumFactors from the scran package and is batch corrected for animals. Each row corresponds to one spot and is indexed by a unique spot identifier. Each column corresponds to the gene indicated in the header.

A .zip file containing microscopy images of coronal sections with Hematoxylin-and-Eosin (HE) staining. Sections are cropped around the spatial transcriptomics array.

A .tsv matrix with t-SNE and UMAP coordinates based on molecular expression, both in 2D and 3D space. The columns respectively correspond to:

  1. Cols 1,2: 2D t-SNE coordinates
  2. Cols 3,4,5: 3D t-SNE coordinates
  3. Cols 6,7: 2D UMAP coordinates
  4. Cols 8,9,10: 3D UMAP coordinates

A matrix containing the scores of every spot on each Independent Component (IC) in .tsv format. Each row corresponds to one spot and is indexed by a unique spot identifier. The 80 columns correspond to the first 80 components obtained with Independent Component Analysis (ICA).

A matrix containing the loads of every gene on each Independent Component (IC) in .tsv format. Each row corresponds to one gene used as row name. The 80 columns correspond to the first 80 components obtained with Independent Component Analysis (ICA).

info iconIf you are using R, the .tsv files can be parsed using the following instruction:
read.table(path, sep = '\t', stringsAsFactors = F, header = T, row.names = 1)

info iconParsing the expression matrix might be long due to its size. Using lazy loading will reduce initial loading time to few seconds. This can be done by calling the fread function from the data.table package or by using the vroom package.

Download raw data

Expand +

The raw data sequences in FASTQ format.

The raw data was deposited to the GEO platform. It can be accessed using the accession number GSE147747. You will find the following files for each section :

  • The 2 associated FASTQ raw files from paired-end experiment
  • The matrix of counts
  • A meta table with information about spots
  • The HE-stained microscopy image

Download intermediary data

Expand +

Intermediary data objects. They can be used to reproduce specific steps or figures using the code available on GitHub.

A directory containing all the data required to run the registration script.

download iconData for figures

See info +

A directory containing all the data required to reproduce the figures.

info iconAs most of the analysis was conducted in R, many of these objects are in .RData format.

Download ST viewer data

Expand +

The data required to load the molecular atlas in the ST Viewer software.

download iconMolecular data file

See info +

This file should be input in the ST Viewer as "data file". It contains the expression matrix with spots as rows and genes as columns. This is exactly the same file as the expression matrix of raw counts from the “Download processed data” section.

download iconSpots coordinates

See info +

This file should be input in the ST Viewer as "spot coordinates file". It contains a table in .tsv format with spots as rows. The three columns correspond to mediolateral (ML), dorsoventral (DV) and anteroposterior (AP) stereotaxic coordinates in mm from the bregma reference point.

download iconGlass brain

See info +

This file should be input in the ST Viewer as “mesh". It contains a mesh representing the brain outline provided by the Allen Institute in .obj format.

download iconClusters table

See info +

This file should be input to the ST Viewer when loading spots colors from a file. It contains a table in .tsv format with spots as rows. The two columns correspond to the cluster ID and the cluster name.

This .zip folder contains individual meta files for each section. They should be inputted in the ST Viewer as "spot coordinates file". Expression matrices and HE images can be downloaded from the GEO repository.

Download coordinates files

Expand +

The coordinates of the molecular regions for visualization purposes.

download iconCluster name table

See info +

A table in .tsv format with clusters as rows. Columns represent respectively:

  1. The cluster name (cluster_name)
  2. The cluster ID (cluster_id)
  3. The number of spots assigned to the cluster (spots_count)
  4. A boolean indicating if there is a mesh associated with this cluster (has_mesh)

There is one .obj file per molecular cluster in this .zip folder. Each file contains the triangular mesh corresponding to the cluster with coordinates being expressed in the Common Coordinate Framework 2017 from the Allen Institute. Files are indexed with an ID which can be mapped to a cluster name using the cluster name table (see top of this section). The meshes for the Allen Brain Atlas can be downloaded here.

info iconThe .obj files can be easily visualized using the open source software MeshLab. Please note that some shading might be inverted.

info iconNot all clusters have a mesh associated: applying the Support Vector Machine (SVM) classifier erased 4 small clusters.

Reference: Ortiz et al., Molecular atlas of the adult mouse brain. Sci. Adv. 6, eabb3446 (2020)
Website contact: Cantin Ortiz