November 30, 2023

A Simple Recipe for Language-guided Domain Generalized Segmentation

The paper tackles the challenge of generalization in neural networks to new domains not seen during training. The authors introduce a simple framework for generalizing semantic segmentation networks by using language as the source of randomization. The framework includes three key components: preserving the intrinsic robustness of CLIP through minimal fine-tuning, language-driven local style augmentation, and randomization by locally mixing the source and augmented styles during training. The approach showed promising results in various generalization benchmarks.

Publication date: 30 Nov 2023
Project Page: https://astra-vision.github.io/FAMix
Paper: https://arxiv.org/pdf/2311.17922

Post Views: 315

artificial neural networks, CLIP, data augmentation training strategy, Domain Generalization, Semantic Segmentation

A Simple Recipe for Language-guided Domain Generalized Segmentation

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

Interactive Autonomous Navigation with Internal State Inference and Interactivity Estimation

Do text-free diffusion models learn discriminative visual representations?

Leave a Reply Cancel reply

Please allow ads on our site