October 21, 2023

Putting the Object Back into Video Object Segmentation

The article introduces Cutie, a video object segmentation (VOS) network that employs object-level memory reading for better results. Unlike previous VOS systems that use bottom-up pixel-level memory reading, Cutie uses a top-down approach with object queries that act as a high-level summary of the target object. This method allows for better separation of the foreground object from the background. The system has been tested on the challenging MOSE dataset and has shown significant improvements over other methods.

Publication date: 19 Oct 2023
Project Page: hkchengrex.github.io/Cutie
Paper: https://arxiv.org/pdf/2310.12982

Post Views: 326

Cutie, object transformer, object-level memory reading, pixel-level memory reading, video object segmentation

Putting the Object Back into Video Object Segmentation

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

Predict the Future from the Past? On the Temporal Data Distribution Shift in Financial Sentiment Classifications

HumanTOMATO: Text-aligned Whole-body Motion Generation

Leave a Reply Cancel reply

Please allow ads on our site