Professor Ruibang Luo from the School of Computing and Data Science and his team worked on the research for the topic “ClairS-TO: a deep-learning method for long-read tumor-only somatic small variant calling”. The research findings were published by Nature Communications on October 31, 2025.

Details of the publication:
ClairS-TO: a deep-learning method for long-read tumor-only somatic small variant calling
Lei Chen, Zhenxian Zheng, Junhao Su, Xian Yu, Angel On Ki Wong, Jingcheng Zhang, Yan-Lam Lee & Ruibang Luo
Article in Nature Communications
https://www.nature.com/articles/s41467-025-64547-z
Abstract
Accurate detection of somatic variants in tumors is of critical importance and remains challenging. Current methods typically require matched normal samples for reliable detection, which are often unavailable in real-world research and clinical scenarios. Without a matched normal sample, more proficient algorithms are required to distinguish true somatic variants from germline variants and technical artifacts. However, existing tumor-only somatic variant callers that were designed for short-read sequencing data are not able to work well with long-read data. To fill the gap, we present ClairS-TO, a deep-learning-based method for long-read tumor-only somatic variant calling. ClairS-TO uses an ensemble of two disparate neural networks trained from the same samples but for opposite tasks—how likely/not likely a candidate is a somatic variant. Benchmarks using COLO829 and HCC1395 cancer cell lines show that ClairS-TO outperforms DeepSomatic and smrest in ONT and PacBio long-read data. ClairS-TO is also applicable to short-read data and outperforms Mutect2, Octopus, Pisces, and DeepSomatic. Extensive experiments across various sequencing coverages, variant allelic fractions, and tumor purities support that ClairS-TO is a reliable tool for somatic variant discovery. ClairS-TO is open-source, available at https://github.com/HKU-BAL/ClairS-TO.