ELITE is out! We need this as an extension. #8592

bbecausereasonss · 2023-03-13T18:11:25Z

bbecausereasonss
Mar 13, 2023

Given an image indicates the target concept (usually an object), we propose a learning-based encoder ELITE to encode the visual concept into the textual embeddings, which can be further flexibly composed into new scenes. It consists of two modules: (a) a global mapping network is first trained to encode a concept image into multiple textual word embeddings, where one primary word (w0) for well-editable concept and other auxiliary words (w1···N) to exclude irrelevant disturbances. (b) A local mapping network is further trained, which projects the foreground object into textual feature space to provide local details.

https://github.com/csyxwei/ELITE/raw/main/assets/results.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ELITE is out! We need this as an extension. #8592

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

ELITE is out! We need this as an extension. #8592

Uh oh!

bbecausereasonss Mar 13, 2023

Replies: 0 comments

bbecausereasonss
Mar 13, 2023