4DN FISH Omics Format - Chromatin Tracing (FOF-CT)
Background
A key output of the (4D Nucleome) project is the open release of datasets that measure the spatial arrangement of DNA, RNA, and proteins within the human cell nucleus, thereby uncovering the functional dynamics of the genome in three dimensions and over time (referred to as 4D).
FISH Omics techniques description
Figure 1: FISH omics methods can be utilized to map chromatin structures across multiple genomic scales (Figure credit: Sarah Aufmkolk; Dekker et al., 2023, Mol.Cell.
Specific genomic sequences can be visualized by fluorescence in situ hybridization (FISH) using fluorescently labeled DNA probes, and their location in individual cells can be imaged using either traditional microscopy techniques (e.g., widefield or confocal) or super-resolution microscopy as reviewed by (Fraser et al. 2015) and (Jerkovic and Cavilli 2021).
Evolutions of these techniques expanded the number of imaged targets to multiple and potentially all DNA segments across the genome (see recent reviews: Jercovic and Cavalli 2021, Boettiger and Murphy 2020, Hu and Wang 2020, Maslova and Krasikova 2021, Zhuang 2021, and Bouwman et al. 2022).
Collectively these technologies can be called interchangeably multiplexed FISH or FISH omics, which emphasize the visualization of multiple or ideally all genomic targets, respectively. These methods provide an expanded understanding of how higher-order chromosome structure relates to transcriptional activity and cell development.
A variety of protocols have been developed in the past few years, and they can be divided into two main categories called respectively ball-and-stick or volumetric Chromatin Tracing, depending on whether the targeted genomic segment is visualized as a centroid (i.e., fitting the imaged spot of a detected fluorescent signal) or a cloud of single-molecule localizations (i.e., often rendered as a volume; see Figure 2 in Dekker et al. 2023).
FISH Omics Format rationale
Despite the advances described in the previous section, FISH-based image-data is not yet routinely made publicly available upon publication because of the sheer size of the data and the lack of community specifications for data exchange. This challenge is experienced across the bioimaging community, as a result a solution built, tested and proven in 4DN can have a wide impact all over the world.
This document describes the 4DN FISH Omics Format - Chromatin Tracing (FOF-CT), a community data format designed for capturing and exchanging the results of chromosome imaging experiments produced within the context of the 4D Nucleome project and beyond.
FOF-CT is directly compatible with several ball-and-stick FISH-omics techniques including, but not limited to, Optical Reconstruction of Chromatin Architecture (ORCA), Multiplexed Imaging of Nucleome Architectures (MINA), Hi-M, DNA Sequential Fluorescence In Situ Hybridization (seqFISH+), Oligonucleotide Fluorescent In Situ Sequencing (OligoFISSEQ), DNA Multiplexed error-robust fluorescence in situ hybridization (DNA-MERFISH), and In-situ Genomic Sequencing (IGS).
In addition, the format is designed to be consistent with the ongoing development of extensions that will encompass volumetric chromatin tracing methods, such as OligoSTORM and OligoDNA-PAINT (Belivau et al. 2017, Bintu et al. 2018, Nir et al. 2018, Luppino et al. 2020)
Note
FOF-CT is designed to work with both unmodified genomes and genomes containing INSERTIONS or DELETIONS. For guidance on capturing genomic variants using FOF-CT, refer to the instructions in :ref:core, and include a description of the custom build using the Variant Call Format (VCF) as outlined in The Variant Call Format (VCF).
FISH Omics Format summary
Figure 2: Schematic representation of 12 tables composing the Fish Omics Format for Chromatin Tracing (Figure credit: Sarah Aufmkolk).
In ball-and-stick Chromatin Tracing experiments, polymer tracing algorithms are used to string together the localization of individual DNA bright Spots to reconstruct the three-dimensional (3D) path of chromatin fibers.
In order to capture chromatin traces as well as additional metrics captured during the course of the experiment, the FOF-CT format is organized around multiple tables.
The root of the format consists of the mandatory DNA-Spot/Trace Data core table table that defines Chromatin Traces as ordered lists of individual DNA-FISH bright Spots. Additional Tables support the integration of this core table with:
Supplementary Spot properties such as:
Quality metrics.
Physical coordinates placing the Spot/Trace in the context of cellular space.
The results of multiplexed RNA-FISH experiments
Supplementary global data that is better captured at level of:
Trace (e.g., expression level of nascent RNA transcripts associated with a given Trace or overall localization of the Trace with respect to cellular or nuclear landmarks).
Cell (e.g., boundaries and volume).
Sub-cellular Region of Interest (ROI; e.g., Nuclear feature or Nucleolus).
Extracellular ROI (e.g., Tissue).
FOF-CT: Chromatin Tracing
- Background
- Tables
- Format description: overview
- Downloadable Templates
- DNA-Spot/Trace Data core table
- Spot Demultiplexing table
- Trace Data table
- RNA Spot Data table
- Spot Quality table
- RNA Spot Quality table
- Spot Biological Data table
- RNA Spot Biological Data table
- Cell Data table
- Sub-Cell ROI Data table
- Extra-Cell ROI Data table
- Cell/ROI Mapping table
- Example published datasets
- Additional Information
- Allowable value lists
- Contributors