New Tool Brings Order to Single-Cell Data Chaos

A new methodology that allows for the categorization and organisation of single-cell data has been launched. It can be used to create a harmonized dataset for the study of human health and disease.

Researchers at the Wellcome Sanger Institute, the University of Cambridge, EMBL's European Bioinformatics Institute (EMBL-EBI), and collaborators developed the tool, known as CellHint. CellHint uses machine learning to unify data produced across the world, allowing it to be accessed by the wider research community, potentially driving new discoveries.

In a new study, published today (21 December) in Cell, researchers applied CellHint to reveal underexplored connections between healthy and diseased lung cell states. They looked at eight diseases, such as interstitial lung disease and chronic obstructive pulmonary lung disease, and showed the possible benefits of this tool. They also applied CellHint to 12 tissues from 38 datasets, providing a deeply curated cross-tissue database with around 3.7 million cells.

Cellhint is freely available worldwide and was created as part of the Human Cell Atlas initiative which aims to map every cell type in the human body to transform understanding of health and disease.

Single-cell genomics enables the understanding of every cell in the context of the human body at high resolution. Currently, a challenge in assembling the diverse datasets produced by single-cell research is that there is no unified system for naming and organising data.

To address this, researchers from the Wellcome Sanger Institute, and collaborators developed CellHint, which can unify cell types produced by independent laboratories. CellHint then places the data into a defined graph that shows the relationships between cell subtypes, giving a full picture of all the cells identified across different datasets.

The team applied CellHint to current data and revealed underexplored relationships between healthy and diseased lung cell states in eight diseases. It also identified cell types in adult human hippocampus that could be of potential interest for future research.

The researchers also applied CellHint to 12 tissues from 38 datasets, providing a deeply curated cross-tissue database with around 3.7 million cells. Each cell was annotated, which is the process of labelling cells with particular information. They also showed how it can create various models for automatic cell annotation across human tissues.

CellHint stands out from other tools because it makes full use of the often inconsistent but valuable cell annotation information from individual studies, to achieve biologically-driven data integration. We are excited that with CellHint, cells from independent laboratories can be re-annotated and researchers can utilise the resulting information to put each cell into different contexts beyond the original study. We hope that this tool will greatly facilitate the reuse of molecular and cellular data and information across laboratories, potentially driving new discoveries in biology."

Dr Chuan Xu, first author from the Wellcome Sanger Institute

Dr Sarah Teichmann, senior author from the Wellcome Sanger Institute and co-founder of the Human Cell Atlas, said: "The Human Cell Atlas is creating detailed reference maps of all cells in the human body to transform our understanding of biology, health and disease, and single-cell technologies underpin this hugely ambitious project. Global collaboration and open data sharing are vital to achieve the aim of a representative Human Cell Atlas that will benefit humanity worldwide. CellHint enables the unification and sharing of single-cell data, which allows the global research community to contribute to and benefit from the ongoing research that is happening around the world, and help drive advances in health and healthcare."

Journal reference:

Xu, C., et al. (2023) Automatic cell-type harmonization and integration across Human Cell Atlas datasets. Cell.


The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of AZoLifeSciences.
Post a new comment

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
KIT Researchers Uncover New Cell Type Responsible for Vascular Growth