Skip to content

Latest commit

 

History

History
110 lines (84 loc) · 13.3 KB

imaging-rounds-configuration-file.md

File metadata and controls

110 lines (84 loc) · 13.3 KB

looptrace imaging rounds configuration file

The imaging rounds configuration file is where, in JSON format you tell looptrace about the rounds of imaging you did during a sequential FISH experiment. It's from this file that a name becomes associated with each imaging round, rounds are conceptually distinguished from one another (e.g., blank vs. regional or local FISH), and the relationships between rounds are defined for analysis.

Imaging timepoints sequence: imagingRounds

This section specifies the time sequence of imaging rounds, with details about each round like whether it was for a regional barcode, if it was a blank, or if it was a repeat.

Guidance

  • imagingRounds must be present, and it enumerates the imaging rounds that comprise the experiment. For every entry, time is required (as a nonnegative integer), and either probe or name is required. No time value can be repeated, and the timepoints should form the sequence (0, 1, 2, ..., N-1), where N is the number of imaging rounds in the experiment. Each value in this array represents a single imaging round; each round is one of 3 cases:
    • Blank round: specify time as a nonnegative integer, and name as a string; set isBlank to true.
    • Locus FISH round: specify time as a nonnegative integer, omit isBlank or set it to false, and specify probe as a string. If this is a repeat of some earlier timepoint, specify repeat as a positive integer. name can be specified or inferred from probe and repeat; isRegional must be absent or false.
    • Regional FISH round: this is as for a locus-specific round, but ensure that isRegional is set to true, and this cannot have a repeat (omit it.)
  • For any non-regional round, repeat may be included with a natural number as the value. In that case, if name is absent (derived from probe), the suffix _repeatX will be added to the probe value to determine the name.
  • In general, if name is absent, it's set to probe value (though this is not possible for a blank round, which is why name is required when isBlank is true).
  • The set of names for the rounds of the imaging sequence must have no repeated value (including a collision of a derived value with some explicit or derived value).
  • locusGrouping can be present as a top-level key, if and only if there's one or more locus imaging rounds in the imagingRounds.
    • Each key must be a string, but specifying the timepoint of one of the regional rounds from the imagingRounds sequence.
    • Each value is a list of locus imaging timepoints associated with the regional timepoint to which it's keyed.
    • The values in the lists must be such that the union of the lists is the set of locus imaging timepoints from the imagingRounds sequence.
    • The values lists should have unique values, though as an ensemble they may not necessarily be disjoint.
  • checkLocusTimepointCovering should generally be absent or set to true (the default). If, however, your experimental design is such that you don't want to trace all of the locus-specific timepoints, then you should set this to false so that a locusGrouping section which doesn't cover all of the locus-specific timepoints in the imagingRounds section of that file won't be considered an error.

Relating regional and locus-specific timepoints: locusGrouping

This section declares how the regional and locus-specific timepoints from the imaging rounds sequence relate to one another. More specifically, for each regional barcode timepoint, you declare a corresponding collection of locus-specific imaging timepoints. This, determines which image volumes are extracted for which regional spot, and therefore how traces are generated as well as how downstream analysis and visualisation works.

This section may be absent or empty, in which case the every timepoint is fit/traced for every regional timepoint; call this "all-by-all", or "universal tracing", maybe, as a way to quickly conceptualise what omission of this section implies.

If present, however, the data in this section must comply with certain rules:

  • The data must be a JSON object (key-value) mapping.
  • Each key must be a (quoted) regional barcode imaging timepoint (corresponding to an entry in the imaging rounds section).
  • Each value must be a list (JSON array) of locus-specific timepoints, again corresponding to entries in the imaging rounds sequence. A locus imaging timepoint may be present in multiple lists, but it may not occur multiple times within the same list. The regional timepoint itself will also be traced, but it (nor anything else other than locus timepoints) may be present in any list.

If this section is present and nonempty, only the data present will be used! Any regional or locus imaging timepoint which is not present will not be traced! Note, however, that any data not present here must be present in the tracing exclusions section. Otherwise, if this section's nonempty and missing data isn't in the tracing exclusions, there will be an error.

Proper handling of the interaction between this section and the tracingExclusions is done here, with some example config files here.

Proximity filtration section: proximityFilterStrategy

This section specifies how to filter regional barcode spots for being too close together. When doing filtration, note that the threshold for minimum separation distance must be violated in all three dimensions for spots to be considered proximal, or "neighboring". This is because spots that are still well separated in at least one dimension are not really close together in how we normally conceive of distance. Note, however, that because of how this threshold is used in the filtration, it's not a "distance" in the typical Euclidean sense in which we usually think.

Note also that there's some unfortunate asymmetry between $z$ vs. $x$ and $y$ that's not analogously asymmetrically treated by the software. More specifically, although image acquisition is generally conducted in much more discrete fashion in $z$ than in $x$ and $y$ (generally far fewer planes/slices in $z$ being imaged, compared to the number of pixels which comprise the length and width of each slice), there's a single threshold value, applied identically in every dimension. This means that in general, you'll want to choose a threshold for minimum separation that's relatively small; in particular, it would make little sense to have this value be any more than about half of the "depth" (number of slices) in $z$ that you've acquired.

Other things to note:

  • Field of view matters: For obvious reason, two spots could have identical coordinates but remain even after filtration, and this would be because they're from different fields of view. The coordinate system is specific to each field of view, therefore neighbors/proximity consideration is done only within each FOV.
  • Imaging timepoint matters: Two spots from the same imaging timepoint, in the same FOV, never cancel each other for proximity; instead, this special case of proximal spots will be handled by attempting to merge the ROIs for the respective spots.

Guidance

  • proximityFilterStrategy is required and should be a mapping, with at minimum a semantic key.
  • Other key-value pairs are conditionally required or prohibited, depending on the value given for semantic.
  • The value for the semantic must be one of the following:
    • "UniversalProximityPermission": This says that no proximity-based filtration of regional spots should be done, essentially making the proximity filtration step of the pipeline a no-op. This behavior corresponds to what, in previous versions of the software, would have been achieved by setting the minimum spot sepearation distance (min_spot_dist) to 0. Because of the meaning of this value, nothing additional is permitted in the section.
    • "UniversalProximityProhibition": This says that any spots which are closer than some threshold should mutually cancel one another. This requires minimum separation distance, to be provided as a positive number for the minimumPixelLikeSeparation key.
    • "SelectiveProximityPermission": This says that spots from imaging timepoints which are grouped together may violate the minimum separation threshold. Put differently, the grouping grants permission; that is, spots from timepoints which are grouped together are given permission to violate the minimum separation threshold. As such, this requires groups in addition to minimumPixelLikeSeparation. The value for groups must be a list of lists, with each element being the integer corresponding to a regional barcode imaging round, in the imaging rounds section. This strategy is useful, for example, if you're imaging at different timepoints DNA regions that you expect to be in very close proximity (e.g., to to being very close in genomic distance).
    • "SelectiveProximityProhibition": This is as for the "SelectiveProximityPermission", but says instead that spots from timepoints which are grouped together are the ones which should mutually exclude each others' spots if they're too close. This strategy is useful, for example, when you're multiplexing and need the regional barcodes to disambiguate loci being targeted with the same locus-specific barcode (i.e., when 2 separate barcodes function jointly as a uniue identifier of a locus).
  • If you're using a strategy for which groups is specified, the value for groups must satisfy these properties:
    • No repeats: Each group must have no repeat value.
    • No overlaps: The groups must be disjoint.
    • Covering: The union of the groups must cover the set of regional round timepoints from the imagingRounds.
    • Covered: The set of regional round timepoints from the imagingRounds must cover the union of groups.

How to group regions for tracing: mergeRulesForTracing

This section is optional: grouping of spots/ROIs for tracing will be done if and only if this config section is present.

If present, this section must specify several things: * Which regional spots should be grouped together for tracing when sufficiently close (groups key) * What defines "sufficiently close" (distanceThreshold) * Whether a spot without a full complement of group members nearby should kept be discarded (requirementType) * Whether ungrouped spots should be kept or discarded (discardRoisNotInGroupsOfInterest)

The groups key maps to a list-of-lists, each list representing a group of imaging timepoints. * Each sublist must contain at least two values. * Each value must correspond to a regional timepoint declared in the imaging rounds section. * No value may be repeated, neither within a sublist nor between sublists.

The distanceThreshold should be a positive value representing a threshold on Euclidean distance. As of v0.11.0, the units are nanometers, but future versions will aim to allow specification of units. If the respective center of each of two spots are such that they're separated by less than this threshold, they will be considered part of the same structure for tracing. Proximity is transitive, such that spots can be "chained" together in a larger structure, rather than every single spot needing to be pairwise-proximal with a spot from other timepoints in its group.

The requirementType must map to either "Conjunctive", Disjunctive, or "Lackadaisical". "Conjunctive" means that any spot from a timepoint declared in a grouping must be close to a spot from each of its group members, otherwise it's discarded. "Disjunctive" relaxes this requirement so that the presence of at least one group partner's spot in close proximity is sufficient to retain the spots for analysis. "Lackadaisical" relaxes this completely, keeping any spot from a timepoint in a declared grouping, regardless of whether a spot from any group partner's is present nearby.

discardRoisNotInGroupsOfInterest must map to either true or false if present; if absent, false is the default. This indicates what to do with spots from regional timepoints which aren't in any of the declared groupings for merger for tracing.

Tracing exclusions section: tracingExclusions

This sections specifies which imaging timepoints should be ignored for chromatin fiber tracing, interlocus distance measurements, and generally other downstream analysis. Note that these timepoints will still have image volumes extracted and have a Gaussian fit done, but they won't be written to the "filtered" traces file (and therefore will not consumed by downstream analyses).

Guidance

  • tracingExclusions should generally be specified, and its value should be a list of timepoints of imaging rounds to exclude from tracing, typically the blank / pre-imaging rounds, and regional barcode rounds. The list can't contain any timepoints not seen in the values of the imagingRounds.

Examples

To see some examples, please refer to the folder for data for tests of this configuration file.