UAlbertaVISUAL Video Image Signal Understanding and Learning Laboratory at University of Alberta

Time-Embedding U-Net for Temporally Consistent Left Ventricular Segmentation in 3D Echocardiography

Key Result Image
Key Result Image

YEAR:

2026

KEYWORDS:

Echocardiography, Neural Networks, Transformers, Time-embedding, Left-Ventricle, Image-Segmentation

Abstract

Three-dimensional (3D) echocardiography is an inherently noisy modality, but allows for capture of temporally complex cardiac motions. Existing segmentation methods often treat cardiac frames independently, resulting in inconsistent delineations across the cardiac cycle, reduced segmentation accuracy, and, in turn, impacting the estimation of key clinical metrics. To address this, our work introduces temporal positional embeddings based on the cardiac cycle to improve 3D echocardiography segmentation. By encoding cardiac phase information into sinusoidal functions, we inject temporal embeddings into the bottleneck and decoder of a U-Net. Our approach (TU-Net) outperforms state-of-the-art models, including nnU-Net and Transformer baselines, UNETR and SwinUNETR. TU-Net achieved a Dice score of 84.7% along with improved temporal consistency, evaluated over two test cases with 18 and 16 frames across the cardiac cycle. Testing this on a lightweight U-Net, which is both efficient and suitable for clinical settings, demonstrates the significance of temporal information in enhancing segmentation quality without complex models. These results highlight the potential for further improvements, not only in 3D echocardiography but also in other dynamic medical imaging modalities.

Team