Save and display per-token attention maps #1866

damian0815 · 2022-12-08T19:09:14Z

This pull request enables the display of per-token attention maps after generating an image.

Done:

Collect and return attention maps to generate.py
Pass attention maps and tokens to webUI - currently pushed as the following fields on the object emitted to the webUI socket with generationResult:
- attentionMaps (base64 image, size width/8 x 77*height/8), and
- tokens, see below.

Todo:

Display maps - @psychedelicious and/or @blessedcoolant will need your help on this part

Typical content of the tokens array is, eg for prompt a fluffy miyazaki dog, an array: ['a</w>', 'fluffy</w>', 'mi', 'yaz', 'aki</w>', 'dog']. The </w> strings represent "end-of-word". With this implementation, to match tokens to fragments of the input prompt text in the input box, the frontend code is going to have to crawl through the prompt and do a best-fit match of these tokens to the prompt string.

blessedcoolant · 2022-12-08T20:52:11Z

Generation error: AttributeError: 'CrossAttention' object has no attribute 'cached_mem_free_total'

…stness

…_maps_redo

damian0815 · 2022-12-10T10:29:34Z

this should be merged into main asap, even if that means frontend isn't using it.

attention map collection is always-on in this PR but the memory/performance impact is negligible.

backend/invoke_ai_web_server.py

psychedelicious

1 simple change requested - adding types to util function. I haven't been consistent with this at all but I want to make an effort to add types whenever possible going forward.

This is not related to this particular PR, but we will need to save the attention map data somewhere besides just the gallery. The gallery images arrays are not persisted across reloads, and even if they were, resetting localstorage will of course clear them.

So we need to think about how to store them as metadata.

Co-authored-by: psychedelicious <[email protected]>

* fix for crash using inpainting model * prevent crash due to invalid attention_maps_saver

damian0815 requested review from lstein, blessedcoolant and psychedelicious December 8, 2022 20:37

damian0815 and others added 5 commits December 9, 2022 14:02

attention maps saving to /tmp

f362268

tidy up diffusers branch backporting of cross attention refactoring

547c204

base64-encoding the attention maps image for generationResult

9b1e823

cleanup/refactor conditioning.py

33182af

attention maps and tokens being sent to web UI

a054fae

damian0815 force-pushed the feat_save_attention_maps_redo branch from 8b8a43e to a054fae Compare December 9, 2022 13:03

attention maps: restrict count to actual token count and improve robu…

89fbfc2

…stness

damian0815 removed request for lstein, psychedelicious and blessedcoolant December 9, 2022 14:22

damian0815 marked this pull request as ready for review December 9, 2022 14:46

damian0815 requested review from blessedcoolant and psychedelicious December 9, 2022 14:46

Merge remote-tracking branch 'upstream/main' into feat_save_attention…

0924669

…_maps_redo

keturn mentioned this pull request Dec 9, 2022

Refactor cross attention and allow mechanism to tweak cross attention function huggingface/diffusers#1639

Merged

psychedelicious reviewed Dec 10, 2022

View reviewed changes

backend/invoke_ai_web_server.py Outdated Show resolved Hide resolved

psychedelicious self-requested a review December 10, 2022 10:52

psychedelicious requested changes Dec 10, 2022

View reviewed changes

add argument type hint to image_to_dataURL function

887b3b1

Co-authored-by: psychedelicious <[email protected]>

psychedelicious approved these changes Dec 10, 2022

View reviewed changes

damian0815 merged commit 786b887 into invoke-ai:main Dec 10, 2022

lstein pushed a commit that referenced this pull request Dec 10, 2022

Fix crash introduced in #1866

12a8d7f

lstein pushed a commit that referenced this pull request Dec 11, 2022

fix for crash with inpainting model introduced by #1866 (#1922)

9f855a3

* fix for crash using inpainting model * prevent crash due to invalid attention_maps_saver

JamesDooley mentioned this pull request Dec 13, 2022

[bug]: Fails to generate an image - Windows 10 - RuntimeError: CUDA error: device-side assert triggered #1908

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Save and display per-token attention maps #1866

Save and display per-token attention maps #1866

Uh oh!

damian0815 commented Dec 8, 2022 •

edited

Loading

Uh oh!

blessedcoolant commented Dec 8, 2022

Uh oh!

damian0815 commented Dec 10, 2022

Uh oh!

Uh oh!

psychedelicious left a comment

Uh oh!

Uh oh!

Save and display per-token attention maps #1866

Save and display per-token attention maps #1866

Uh oh!

Conversation

damian0815 commented Dec 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Done:

Todo:

Uh oh!

blessedcoolant commented Dec 8, 2022

Uh oh!

damian0815 commented Dec 10, 2022

Uh oh!

Uh oh!

psychedelicious left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

damian0815 commented Dec 8, 2022 •

edited

Loading