DS1 spectrogram: From Local Matches to Global Masks: Novel Instance Detection in Open-World Scenes

From Local Matches to Global Masks: Novel Instance Detection in Open-World Scenes

March 3, 20262603.03577

Authors

Qifan Zhang,Sai Haneesh Allu,Jikai Wang,Yangxiao Lu,Yu Xiang

Abstract

Detecting and segmenting novel object instances in open-world environments is a fundamental problem in robotic perception. Given only a small set of template images, a robot must locate and segment a specific object instance in a cluttered, previously unseen scene.

Existing proposal-based approaches are highly sensitive to proposal quality and often fail under occlusion and background clutter. We propose L2G-Det, a local-to-global instance detection framework that bypasses explicit object proposals by leveraging dense patch-level matching between templates and the query image.

Locally matched patches generate candidate points, which are refined through a candidate selection module to suppress false positives. The filtered points are then used to prompt an augmented Segment Anything Model (SAM) with instance-specific object tokens, enabling reliable reconstruction of complete instance masks.

Experiments demonstrate improved performance over proposal-based methods in challenging open-world settings.

Resources

Stay in the loop

Get tldr.takara.ai to Your Email, Everyday.

tldr.takara.aiHome·Daily at 6am UTC·© 2026 takara.ai Ltd

Content is sourced from third-party publications.