International Conference on Learning Representations (ICLR) 2024

SLiMe

Segment Like Me

Using just one user-annotated image with various granularities (as shown in the leftmost column), SLiMe can segment different test images in accordance with those same granularities (as depicted in the other columns).

Abstract

SLiMe: Segment Like Me

Aliasghar Khani, Saeid Asgari , Aditya Sanghi, Ali Mahdavi-Amiri, Ghassan Hamarneh

Significant strides have been made using large vision-language models, like Stable Diffusion (SD), for a variety of downstream tasks, including image editing, image correspondence, and 3D shape generation. Inspired by these advancements, we explore leveraging these extensive vision-language models for segmenting images at any desired granularity using as few as one annotated sample by proposing SLiMe. SLiMe frames this problem as an optimization task. Specifically, given a single training image and its segmentation mask, we first extract attention maps, including our novel “weighted accumulated self-attention map” from the SD prior. Then, using the extracted attention maps, the text embeddings of Stable Diffusion are optimized such that, each of them, learn about a single segmented region from the training image. These learned embeddings then highlight the segmented region in the attention maps, which in turn can then be used to derive the segmentation map. This enables SLiMe to segment any real-world image during inference with the granularity of the segmented region in the training image, using just one example. Moreover, leveraging additional training data when available, i.e. few-shot, improves the performance of SLiMe. We carried out a knowledge-rich set of experiments examining various design factors and showed that SLiMe outperforms other existing one-shot and few-shot segmentation methods.

Download publication

Associated Researchers

Aliasghar Khani

Machine Learning Research Scientist

Saeid Asgari

Former Autodesk

Aditya Sanghi

Principal AI Research Scientist

Ali Mahdavi-Amiri

School of Computing Science, Simon Fraser University

Ghassan Hamarneh

School of Computing Science, Simon Fraser University

View all researchers

Related Resources

Publication

2025

RECALL-MM: A Multimodal Dataset of Consumer Product Recalls for Risk Analysis using Computational Methods and Large Language Models

New multi-modal design dataset contains historical information about…

Publication

2022

Harnessing Game-Inspired Content Creation for Intuitive Generative Design and Optimization

A multi-scale generative design model that adapts the Wave Function…

Project

2019

Command Usage Arc Diagrams

Exploring and analyzing a database of over 60 million commands issued…

Article

2023

Recently Published by Autodesk Researchers

A selection of recently published papers by Autodesk Researchers…

Get in touch

Something pique your interest? Get in touch if you’d like to learn more about Autodesk Research, our projects, people, and potential collaboration opportunities.

Contact us