Graphical-model framework for automated annotation of cell identities in dense cellular images

Abstract
Data availability
Article and author information
Metrics

Abstract

Although identifying cell names in dense image stacks is critical in analyzing functional whole-brain data enabling comparison across experiments, unbiased identification is very difficult, and relies heavily on researchers' experiences. Here we present a probabilistic-graphical-model framework, CRF_ID, based on Conditional Random Fields, for unbiased and automated cell identification. CRF_ID focuses on maximizing intrinsic similarity between shapes. Compared to existing methods, CRF_ID achieves higher accuracy on simulated and ground-truth experimental datasets, and better robustness against challenging noise conditions common in experimental data. CRF_ID can further boost accuracy by building atlases from annotated data in highly computationally efficient manner, and by easily adding new features (e.g. from new strains). We demonstrate cell annotation in C. elegans images across strains, animal orientations, and tasks including gene-expression localization, multi-cellular and whole-brain functional imaging experiments. Together, these successes demonstrate that unbiased cell annotation can facilitate biological discovery, and this approach may be valuable to annotation tasks for other systems.

Data availability

All data generated or analysed during this study are included in the manuscript and supporting files. Source data files are provided at https://github.com/shiveshc/CRF_Cell_ID.git.

Article and author information

Author details

Shivesh Chaudhary

Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, United States

Competing interests
The authors declare that no competing interests exist.
Sol Ah Lee

Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, United States

Competing interests
The authors declare that no competing interests exist.
Yueyi Li

Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, United States

Competing interests
The authors declare that no competing interests exist.
Dhaval S Patel

Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, United States

Competing interests
The authors declare that no competing interests exist.
Hang Lu

Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, GA, United States

For correspondence
hang.lu@gatech.edu

Competing interests
The authors declare that no competing interests exist.

"This ORCID iD identifies the author of this article:" 0000-0002-6881-660X

Funding

National Institutes of Health (R21DC015652)

Hang Lu

National Institutes of Health (R01NS096581)

Hang Lu

National Institutes of Health (R01GM088333)

Hang Lu

National Science Foundation (1764406)

Hang Lu

National Science Foundation (1707401)

Hang Lu

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

2,543

views
311

downloads
35

citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Article PDF

Open citations (links to open the citations from this article in various online reference manager services)

Mendeley

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Shivesh Chaudhary
Sol Ah Lee
Yueyi Li
Dhaval S Patel
Hang Lu

(2021)

Graphical-model framework for automated annotation of cell identities in dense cellular images

eLife 10:e60321.

https://doi.org/10.7554/eLife.60321

Categories and tags

Research organism

C. elegans

1. Neuroscience
Automated cell annotation in multi-cell images using an improved CRF_ID algorithm

Hyun Jee Lee, Jingting Liang ... Hang Lu

Research Advance Jan 24, 2025
Cell identification is an important yet difficult process in data analysis of biological images. Previously, we developed an automated cell identification method called CRF_ID and demonstrated its high performance in Caenorhabditis elegans whole-brain images (Chaudhary et al., 2021). However, because the method was optimized for whole-brain imaging, comparable performance could not be guaranteed for application in commonly used C. elegans multi-cell images that display a subpopulation of cells. Here, we present an advancement, CRF_ID 2.0, that expands the generalizability of the method to multi-cell imaging beyond whole-brain imaging. To illustrate the application of the advance, we show the characterization of CRF_ID 2.0 in multi-cell imaging and cell-specific gene expression analysis in C. elegans. This work demonstrates that high-accuracy automated cell annotation in multi-cell imaging can expedite cell identification and reduce its subjectivity in C. elegans and potentially other biological images of various origins.
1. Computational and Systems Biology
2. Neuroscience
Neural population dynamics underlying evidence accumulation in multiple rat brain regions

Brian DePasquale, Carlos D Brody, Jonathan W Pillow

Research Article Updated Apr 17, 2025
Accumulating evidence to make decisions is a core cognitive function. Previous studies have tended to estimate accumulation using either neural or behavioral data alone. Here, we develop a unified framework for modeling stimulus-driven behavior and multi-neuron activity simultaneously. We applied our method to choices and neural recordings from three rat brain regions—the posterior parietal cortex (PPC), the frontal orienting fields (FOF), and the anterior-dorsal striatum (ADS)—while subjects performed a pulse-based accumulation task. Each region was best described by a distinct accumulation model, which all differed from the model that best described the animal’s choices. FOF activity was consistent with an accumulator where early evidence was favored while the ADS reflected near perfect accumulation. Neural responses within an accumulation framework unveiled a distinct association between each brain region and choice. Choices were better predicted from all regions using a comprehensive, accumulation-based framework and different brain regions were found to differentially reflect choice-related accumulation signals: FOF and ADS both reflected choice but ADS showed more instances of decision vacillation. Previous studies relating neural data to behaviorally inferred accumulation dynamics have implicitly assumed that individual brain regions reflect the whole-animal level accumulator. Our results suggest that different brain regions represent accumulated evidence in dramatically different ways and that accumulation at the whole-animal level may be constructed from a variety of neural-level accumulators.
1. Biochemistry and Chemical Biology
2. Computational and Systems Biology
Allosteric modulation by the fatty acid site in the glycosylated SARS-CoV-2 spike

A Sofia F Oliveira, Fiona L Kearns ... Adrian J Mulholland

Short Report Apr 10, 2025
The spike protein is essential to the SARS-CoV-2 virus life cycle, facilitating virus entry and mediating viral-host membrane fusion. The spike contains a fatty acid (FA) binding site between every two neighbouring receptor-binding domains. This site is coupled to key regions in the protein, but the impact of glycans on these allosteric effects has not been investigated. Using dynamical nonequilibrium molecular dynamics (D-NEMD) simulations, we explore the allosteric effects of the FA site in the fully glycosylated spike of the SARS-CoV-2 ancestral variant. Our results identify the allosteric networks connecting the FA site to functionally important regions in the protein, including the receptor-binding motif, an antigenic supersite in the N-terminal domain, the fusion peptide region, and another allosteric site known to bind heme and biliverdin. The networks identified here highlight the complexity of the allosteric modulation in this protein and reveal a striking and unexpected link between different allosteric sites. Comparison of the FA site connections from D-NEMD in the glycosylated and non-glycosylated spike revealed that glycans do not qualitatively change the internal allosteric pathways but can facilitate the transmission of the structural changes within and between subunits.