TY - JOUR
T1 - Semi-supervised Assisted Multi-Task Learning for Oral Optical Coherence Tomography Image Segmentation and Denoising
AU - Liao, Jinpeng
AU - Zhang, Thomas
AU - Shepherd, Simon
AU - Macluskey, Michaelina
AU - Li, Chunhui
AU - Huang, Zhihong
N1 - Publisher Copyright:
© 2025 Optica Publishing Group (formerly OSA). All rights reserved.
PY - 2025/3/1
Y1 - 2025/3/1
N2 - Optical coherence tomography (OCT) is promising to become an essential imaging tool for non-invasive oral mucosal tissue assessment, but it faces challenges like speckle noise and motion artifacts. In addition, it is difficult to distinguish different layers of oral mucosal tissues from gray level OCT images due to the similarity of optical properties between different layers. We introduce the Efficient Segmentation-Denoising Model (ESDM), a multi-task deep learning framework designed to enhance OCT imaging by reducing scan time from ∼8s to ∼2s and improving oral epithelium layer segmentation. ESDM integrates the local feature extraction capabilities of the convolution layer and the long-term information processing advantages of the transformer, achieving better denoising and segmentation performance compared to existing models. Our evaluation shows that ESDM outperforms state-of-the-art models with a PSNR of 26.272, SSIM of 0.737, mDice of 0.972, and mIoU of 0.948. Ablation studies confirm the effectiveness of our design, such as the feature fusion methods, which enhance performance with minimal model complexity increase. ESDM also presents high accuracy in quantifying oral epithelium thickness, achieving mean absolute errors as low as 5 µm compared to manual measurements. This research shows that ESDM can notably improve OCT imaging and reduce the cost of accurate oral epithermal segmentation, improving diagnostic capabilities in clinical settings.
AB - Optical coherence tomography (OCT) is promising to become an essential imaging tool for non-invasive oral mucosal tissue assessment, but it faces challenges like speckle noise and motion artifacts. In addition, it is difficult to distinguish different layers of oral mucosal tissues from gray level OCT images due to the similarity of optical properties between different layers. We introduce the Efficient Segmentation-Denoising Model (ESDM), a multi-task deep learning framework designed to enhance OCT imaging by reducing scan time from ∼8s to ∼2s and improving oral epithelium layer segmentation. ESDM integrates the local feature extraction capabilities of the convolution layer and the long-term information processing advantages of the transformer, achieving better denoising and segmentation performance compared to existing models. Our evaluation shows that ESDM outperforms state-of-the-art models with a PSNR of 26.272, SSIM of 0.737, mDice of 0.972, and mIoU of 0.948. Ablation studies confirm the effectiveness of our design, such as the feature fusion methods, which enhance performance with minimal model complexity increase. ESDM also presents high accuracy in quantifying oral epithelium thickness, achieving mean absolute errors as low as 5 µm compared to manual measurements. This research shows that ESDM can notably improve OCT imaging and reduce the cost of accurate oral epithermal segmentation, improving diagnostic capabilities in clinical settings.
UR - http://www.scopus.com/inward/record.url?scp=85219112456&partnerID=8YFLogxK
U2 - 10.1364/BOE.545377
DO - 10.1364/BOE.545377
M3 - Article
C2 - 40109516
SN - 2156-7085
VL - 16
SP - 1197
EP - 1215
JO - Biomedical Optics Express
JF - Biomedical Optics Express
IS - 3
ER -