PAPER_TITLE

FIRST_AUTHOR_LAST, FIRST_AUTHOR_FIRST; SECOND_AUTHOR_LAST, SECOND_AUTHOR_FIRST

U-Bench: A Comprehensive Understanding of U-Net through
100-Variant Benchmarking

Fenghe Tang, Chengqi Dong, Wenxin Ma, Zikang Xu, Heqin Zhu, Zihang Jiang, Rongsheng Wang, Yuhao Wang, Chenxu Wu, Shaohua Kevin Zhou^*

University of Science and Technology of China
^*Corresponding author

Paper Code arXiv 🤗 Datasets 🤗 Weights

Abstract

Over the past decade, U-Net has been the dominant architecture in medical image segmentation, leading to the development of thousands of U-shaped variants. Despite its widespread adoption, there is still no comprehensive benchmark to systematically evaluate their performance and utility, largely because of insufficient statistical validation and limited consideration of efficiency and generalization across diverse datasets. To bridge this gap, we present U-Bench, the first large-scale, statistically rigorous benchmark that evaluates 100 U-Net variants across 28 datasets and 10 imaging modalities. Our contributions are threefold: (1) Comprehensive Evaluation: U-Bench evaluates models along three key dimensions: statistical robustness, zero-shot generalization, and computational efficiency. We introduce a novel metric, U-Score, which jointly captures the performance-efficiency trade-off, offering a deployment-oriented perspective on model progress. (2) Systematic Analysis and Model Selection Guidance: We summarize key findings from the large-scale evaluation and systematically analyze the impact of dataset characteristics and architectural paradigms on model performance. Based on these insights, we propose a model advisor agent to guide researchers in selecting the most suitable models for specific datasets and tasks. (3) Public Availability: We provide all code, models, protocols, and weights, enabling the community to reproduce our results and extend the benchmark with future methods. In summary, U-Bench not only exposes gaps in previous evaluations but also establishes a foundation for fair, reproducible, and practically relevant benchmarking in the next decade of U-Net-based segmentation models.

Top-10 variants ranked by performance (IoU) under in-domain and zero-shot settings.

Your image here Second research result visualization

Top-10 variants ranked by efficiency (U-Score) under in-domain and zero-shot settings.

Performance analysis under varying foreground properties.

Performance trends of SOTA models over the past decade.

Statistical significance analysis against U-Net across 28 datasets across 10 modalities.

Ranking of the top 10 models

U-Score

U-Score Calculator

Select Dataset:

IoU:

Params (M):

GFLOPs:

FPS:

BibTeX

@article{tang2025u,
  title={U-Bench: A Comprehensive Understanding of U-Net through 100-Variant Benchmarking},
  author={Tang, Fenghe and Dong, Chengqi and Ma, Wenxin and Xu, Zikang and Zhu, Heqin and Jiang, Zihang and Wang, Rongsheng and Wang, Yuhao and Wu, Chenxu and Zhou, Shaohua Kevin},
  journal={arXiv preprint arXiv:2510.07041},
  year={2025}
}

Rank	BUSI (IoU)	BUSI (U-Score)	BUSBRA (IoU)	BUSBRA (U-Score)	TNSCUI (IoU)	TNSCUI (U-Score)
	RWKV-UNet	LGMSNet	RWKV-UNet	CMUNeXt	RWKV-UNet	LGMSNet
	PraNet	CMUNeXt	EViT-UNet	LGMSNet	MEGANet	LV-UNet
	Mobile U-ViT	MBSNet	CaraNet	MBSNet	MADGNet	CMUNeXt
#4	DA-TransUNet	LV-UNet	MADGNet	LV-UNet	TA-Net	MBSNet
#5	MEGANet	Mobile U-ViT	TA-Net	TinyU-Net	UACANet	TinyU-Net
#6	TransResUNet	MDSA-UNet	FAT-Net	Mobile U-ViT	EViT-UNet	Mobile U-ViT
#7	MADGNet	U-RWKV	UACANet	U-RWKV	CaraNet	DCSAU-Net
#8	CFFormer	U-KAN	FCBFormer	DCSAU-Net	Swin-umamba	U-KAN
#9	ESKNet	DCSAU-Net	MEGANet	U-KAN	FAT-Net	CFPNet-M
#10	CASCADE	CFPNet-M	AURA-Net	CFPNet-M	UTANet	MDSA-UNet

Rank	ISIC2018 (IoU)	ISIC2018 (U-Score)	SkinCancer (IoU)	SkinCancer (U-Score)
	RWKV-UNet	LV-UNet	RWKV-UNet	LGMSNet
	CFFormer	LGMSNet	DA-TransUNet	CMUNeXt
	MEGANet	Mobile U-ViT	MSLAU-Net	LV-UNet
#4	Swin-umamba	MBSNet	PraNet	U-KAN
#5	PraNet	DCSAU-Net	FCBFormer	ULite
#6	TransResUNet	CMUNeXt	EMCAD	MUCM_Net
#7	TA-Net	CFPNet-M	MCA-UNet	Mobile U-ViT
#8	CE-Net	MDSA-UNet	CaraNet	MBSNet
#9	CaraNet	RWKV-UNet	TransNorm	RWKV-UNet
#10	AURA-Net	U-RWKV	AURA-Net	Polyp-PVT

Rank	Kvasir (IoU)	Kvasir (U-Score)	Robotool (IoU)	Robotool (U-Score)
	Swin-umamba	LV-UNet	MEGANet	LV-UNet
	VMUNet	LGMSNet	RWKV-UNet	LGMSNet
	UACANet	Mobile U-ViT	AURA-Net	Mobile U-ViT
#4	CFFormer	MambaUnet	TA-Net	MBSNet
#5	RWKV-UNet	MBSNet	EViT-UNet	CMUNeXt
#6	FCBFormer	U-KAN	TransResUNet	RWKV-UNet
#7	PraNet	CMUNeXt	MADGNet	CFPNet-M
#8	CASCADE	DCSAU-Net	CE-Net	CE-Net
#9	CENet	VMUNetV2	PraNet	U-KAN
#10	MADGNet	RWKV-UNet	UACANet	TA-Net

Rank	CHASE (IoU)	CHASE (U-Score)	DRIVE (IoU)	DRIVE (U-Score)
	CMU-Net	LGMSNet	FCBFormer	TinyU-Net
	AttU-Net	CMUNeXt	MT-UNet	ULite
	U-Net	MBSNet	ColonSegNet	LFU-Net
#4	UNet3plus	U-RWKV	UTNet	SimpleUNet
#5	Perspective-Unet	TinyU-Net	ESKNet	LGMSNet
#6	UCTransNet	Mobile U-ViT	CMU-Net	CMUNeXt
#7	ESKNet	U-KAN	Swin-umamba	MBSNet
#8	ColonSegNet	UNeXt	UNet3plus	U-RWKV
#9	MT-UNet	ULite	RollingUnet	CFPNet-M
#10	Swin-umamba	LV-UNet	D-TrAttUnet	UNeXt

Rank	DSB (IoU)	DSB (U-Score)	Cell (IoU)	Cell (U-Score)
	MT-UNet	LGMSNet	EMCAD	TinyU-Net
	DoubleUNet	ULite	RWKV-UNet	LGMSNet
	TransAttUnet	U-RWKV	CASCADE	LV-UNet
#4	DCSAU-Net	MBSNet	MSLAU-Net	MBSNet
#5	UTNet	CMUNeXt	UTANet	CMUNeXt
#6	D-TrAttUnet	TinyU-Net	DDANet	SimpleUNet
#7	ESKNet	DCSAU-Net	MERIT	UNeXt
#8	AURA-Net	LFU-Net	MBSNet	U-RWKV
#9	LGMSNet	Mobile U-ViT	CENet	CFPNet-M
#10	DDANet	LV-UNet	U-Net	LFU-Net

Rank	Synapse (IoU)	Synapse (U-Score)
	CENet	MBSNet
	Swin-umambaD	TinyU-Net
	DoubleUNet	LGMSNet
#4	RWKV-UNet	CMUNeXt
#5	DDANet	SimpleUNet
#6	AttU-Net	LV-UNet
#7	EViT-UNet	CFPNet-M
#8	FCBFormer	Mobile U-ViT
#9	G-CASCADE	U-RWKV
#10	MSRFNet	MambaUnet

Rank	CHASE-STARE (IoU)	CHASE-STARE (U-Score)
	RWKV-UNet	ULite
	DS-TransUNet	LV-UNet
	SwinUNETR	SwinUNETR
#4	FCBFormer	CMUNeXt
#5	MCA-UNet	MUCM-Net
#6	UNETR	CFPNet-M
#7	DA-TransUNet	UNeXt
#8	MSRFNet	LGMSNet
#9	DoubleUNet	TinyU-Net
#10	Swin-umamba	MBSNet

More Works

Mobile U-ViT: Revisiting large kernel and U-shaped ViT for efficient medical image segmentation

Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation Booster

Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation

MambaMIM: Pre-training Mamba with State Space Token-interpolation and its Application to Medical Image Segmentation

HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training

U-Bench: A Comprehensive Understanding of U-Net through 100-Variant Benchmarking