Feature/save preview latent #672

ltdrdata · 2023-05-18T15:37:04Z

add SavePreviewLatent node.

use .png as container of latent.
exif=latent_tensor, pnginfo=same as saveimage -> you can load workflow on frontend
image_opt is optional: use logo.png if None -> logo.png is temporary image for testing. it must be changed to proper image.

LoadLatent

additional support for .latent.png

WASasquatch · 2023-05-18T16:08:29Z

Absolutely should not use PNG. It's a massive wasteful format, that even with optimization, will only save you anywhere form nothing to 10%.

One of the main points here was something that is tiny and can be shared, but not just a straight thumbnail because we're using a terrible format.

levaleureux · 2023-05-18T16:54:57Z

If you want to keep the data with not loss you can use .bmp format as a no compression row format.
https://en.wikipedia.org/wiki/BMP_file_format

ltdrdata · 2023-05-19T02:01:09Z

If you want to keep the data with not loss you can use .bmp format as a no compression row format. https://en.wikipedia.org/wiki/BMP_file_format

The reason I chose PNG is not because it is lossless. From a preview perspective, it is clearly a disadvantage because it sacrifices the advantages of file size. However, it is simply because the default image format in the current ComfyUI is PNG, which makes it easier to share the codebase. For example, there is no need to do any additional code work for tasks like workflow load using PNG.

Currently, it can be seen as an implementation that is close to a PoC. We can perceive the usability of both the safetensor format alone and its usage as an image container.

ltdrdata · 2023-05-19T02:05:48Z

I'm planning to improve it by applying a method demonstrated on how to decode without a VAE that was introduced last night. Instead of using a logo, I want to generate a basic thumbnail using this approach.

WASasquatch · 2023-05-19T03:34:09Z

I added PNG support to my class earlier, and also a form of PNG compression. Loss of colors heavily impacts PNG filesize. It could be applied to latent to RGB previews for further optimization. However the RGB previews of latents are pretty bad looking. Kinda worse then heavy jpeg compression. Probably why it hasn't been thougtht about for thumbnailing compression.

https://github.com/WASasquatch/ComfyLatentImage

ltdrdata · 2023-05-19T03:45:41Z

The consideration of using latent to RGB as a preview is solely intended as a convenient method for individuals who have no intention of connecting VAE for image visualization and storage. It would be much more useful than a meaningless logo, at the very least.

Furthermore, I am contemplating the idea of incorporating a marker indicating the presence of "latent" into the image, rather than simply providing it as a thumbnail.

I added PNG support to my class earlier, and also a form of PNG compression. Loss of colors heavily impacts PNG filesize. It could be applied to latent to RGB previews for further optimization. However the RGB previews of latents are pretty bad looking. Kinda worse then heavy jpeg compression. Probably why it hasn't been thougtht about for thumbnailing compression.

https://github.com/WASasquatch/ComfyLatentImage

The consideration of using latent to RGB as a preview is solely intended as a convenient method for individuals who have no intention of connecting VAE for image visualization and storage. It would be much more useful than a meaningless logo, at the very least.

Furthermore, I am contemplating the idea of incorporating a marker indicating the presence of "latent" into the image, rather than simply providing it as a thumbnail.

WASasquatch · 2023-05-19T05:15:36Z

You mean connecting a optional image to store as the preview? Shouldn't need VAE there. That is probably better than a placeholder image. My idea was an overlay of some basic information, as well as branding like link to repo for exposure since a1111 dominates all

Also I hope you know you are talking to WAS, who proposed this original idea in chat.

You should also consider all the uses. The latent to RGB image is tiny, which means a lot of viewers will be upscaling it to fit within their minimum width/height containers, leading to further degraded viewing. It should be a small image, and compressed, but also not just really bad. Civitai does this fors example to display images within their template correctly. Large image thumbnails in win11 which famously blur upscales too.

ltdrdata · 2023-05-19T06:16:51Z

You mean connecting a optional image to store as the preview? Shouldn't need VAE there. That is probably better than a placeholder image. My idea was an overlay of some basic information, as well as branding like link to repo for exposure since a1111 dominates all

Also I hope you know you are talking to WAS, who proposed this original idea in chat.

You should also consider all the uses. The latent to RGB image is tiny, which means a lot of viewers will be upscaling it to fit within their minimum width/height containers, leading to further degraded viewing. It should be a small image, and compressed, but also not just really bad. Civitai does this fors example to display images within their template correctly. Large image thumbnails in win11 which famously blur upscales too.

If the creator intentionally connects a decoded image and provides it, then it will be used as the preview. If it's not provided, then the intention is to generate a preview by simply pixelating it using latent to RGB.

The reason for making the provision of the image optional is twofold. Firstly, it allows avoiding VAE decoding unless a preview is genuinely needed for identification purposes during the intermediate process. Secondly, it enables the creator to provide high-quality previews if they desire to do so for the purpose of sharing with a large number of users.

And is the suggestion to enhance the pixelated image generated by latent to RGB for the preview by applying post-processing techniques such as blur and upscale?

WASasquatch · 2023-05-19T14:37:52Z

And is the suggestion to enhance the pixelated image generated by latent to RGB for the preview by applying post-processing techniques such as blur and upscale?

I think a small upscale (simple resize) is all that's needed. Just so other viewers don't apply their horrendous "optimized" upscalers that just make their upscale from thumbnails blurry and jpegy.

apply optimize for png attach format text for preview

ltdrdata · 2023-05-19T15:23:01Z

I applied the result of latent_to_rgb as the default preview and added a text at the bottom explaining the format called "ComfyUI LATENT." Additionally, I applied the optimize option for slight size optimization. For latent_to_rgb, I limited the size within the range of 128 to 512 for preventing meaningless high-resolution previews or excessively small previews. When intentionally providing image_opt, it is structured in a way that users are responsible for resizing to allow high-resolution output.

ltdrdata · 2023-05-19T15:56:55Z

Changed the upscale method to nearest-exact in order to achieve a more pronounced feeling of latent rawness.

morphles · 2023-05-19T20:41:38Z

I was not aware latent can be extracted like that without vae, frankly image looks quite amazing.

ltdrdata · 2023-05-20T03:01:14Z

I was not aware latent can be extracted like that without vae, frankly image looks quite amazing.

I was the same. It was possible with a very simple code provided by Comfy.

morphles · 2023-05-20T07:26:01Z

This somehow just convinces me even more that my hi res/multi-sampling idea is good :) I though latients would be something more abstract, and not directly convert-able to pixel values, thus making that multi scale combine some weird as thing.

WASasquatch · 2023-05-23T03:52:51Z

Mind just making this a plugin that hijacks latent saving in ComfyUI? I don't think Comfy is interested unfortunately.

dennwc

@ltdrdata Great idea! I'd really like this one merged eventually. Do you still plan to update the PR?

dennwc · 2024-06-02T12:47:57Z

nodes.py

+    @staticmethod
+    def save_to_file(tensor_bytes, prompt, extra_pnginfo, image, image_path):
+        compressed_data = BytesIO()
+        with zipfile.ZipFile(compressed_data, mode='w') as archive:


Maybe gzip or zstd would be better? It's unlikely that it will contain other files, right?

dennwc · 2024-06-02T13:05:49Z

nodes.py

+        with zipfile.ZipFile(compressed_data, mode='w') as archive:
+            archive.writestr("latent", tensor_bytes)
+        image = image.copy()
+        exif_data = {"Exif": {piexif.ExifIFD.UserComment: compressed_data.getvalue()}}


As an alternative to EXIF, it's also possible to write arbitrary data after the end of a PNG.

Can't find a good link at the moment, but TL;DR is: PNG decoder must stop after seeing IEND chunk, any data after that will not be read.

Thus, you could take a PNG encoded image and append a safetensors file to it. Decoder is also simple - PNG chunks encode the length, so it will just skip all of them until IEND and then will read the rest as a safetensors file. I could write a PoC encoder/decoder, if you want.

Is there any advantage over using EXIF?

It will likely has less encoding overhead.

Also, in theory, you'll still get the benefits of safetensors - the file can still be memory-mapped, since it's just added at the end. You just need to adjust the offset for tensor data. Although latents are pretty small, so I doubt it will be used that way.

ltdrdata added 3 commits May 18, 2023 23:49

support preview latent

3564ee8

clear parameter name

1ccedd5

add default image for test

499bc70

Merge branch 'Main' into feature/preview-latent

448208e

apply latent_to_rgb for default preview

1ff09e5

apply optimize for png attach format text for preview

Prevent becoming blurry for default preview image.

4489f45

Merge branch 'comfyanonymous:master' into feature/preview-latent

f7c1e8b

ltdrdata and others added 7 commits May 21, 2023 00:54

Merge branch 'comfyanonymous:master' into feature/preview-latent

b3a11be

Merge branch 'comfyanonymous:master' into feature/preview-latent

ea351c6

Merge branch 'comfyanonymous:master' into feature/preview-latent

cb7ab2f

Merge branch 'comfyanonymous:master' into feature/preview-latent

02bf3cb

Merge branch 'Main' into feature/preview-latent

261dc9c

Merge branch 'comfyanonymous:master' into feature/preview-latent

9cedbbb

Merge branch 'comfyanonymous:master' into feature/preview-latent

8a1fe96

Merge branch 'comfyanonymous:master' into feature/preview-latent

133e80f

ltdrdata added 17 commits July 25, 2023 15:12

Merge branch 'comfyanonymous:master' into feature/preview-latent

40c6832

Merge branch 'comfyanonymous:master' into feature/preview-latent

3c5286c

Merge branch 'comfyanonymous:master' into feature/preview-latent

6050c51

Merge branch 'comfyanonymous:master' into feature/preview-latent

6ba820a

Merge branch 'comfyanonymous:master' into feature/preview-latent

391eab6

Merge branch 'comfyanonymous:master' into feature/preview-latent

7c8b755

Merge branch 'comfyanonymous:master' into feature/preview-latent

48a1d6e

Merge branch 'comfyanonymous:master' into feature/preview-latent

9a7380e

Merge branch 'comfyanonymous:master' into feature/preview-latent

a12543b

Merge branch 'comfyanonymous:master' into feature/preview-latent

3700a4d

Merge branch 'comfyanonymous:master' into feature/preview-latent

d551c0e

Merge branch 'comfyanonymous:master' into feature/preview-latent

baa1f67

Merge branch 'comfyanonymous:master' into feature/preview-latent

f898003

Merge branch 'comfyanonymous:master' into feature/preview-latent

e48f0f4

Merge branch 'comfyanonymous:master' into feature/preview-latent

e8a9847

Merge branch 'comfyanonymous:master' into feature/preview-latent

479b3cd

Merge branch 'comfyanonymous:master' into feature/preview-latent

3354ad5

ltdrdata requested a review from comfyanonymous as a code owner August 15, 2023 00:24

ltdrdata added 10 commits August 15, 2023 15:06

Merge branch 'comfyanonymous:master' into feature/preview-latent

6d77cfe

Merge branch 'comfyanonymous:master' into feature/preview-latent

ff1ffc2

Merge branch 'comfyanonymous:master' into feature/preview-latent

289beec

Merge branch 'comfyanonymous:master' into feature/preview-latent

d078465

Merge branch 'comfyanonymous:master' into feature/preview-latent

f0fdc09

Merge branch 'comfyanonymous:master' into feature/preview-latent

4d7aabc

Merge branch 'comfyanonymous:master' into feature/preview-latent

1121061

Merge branch 'comfyanonymous:master' into feature/preview-latent

461d765

Merge branch 'comfyanonymous:master' into feature/preview-latent

72502aa

Merge branch 'comfyanonymous:master' into feature/preview-latent

d3e3c01

dennwc reviewed Jun 2, 2024

View reviewed changes

Merge branch 'main' into feature/preview-latent

6bf5e1e

Feature/save preview latent #672

Are you sure you want to change the base?

Feature/save preview latent #672

Conversation

ltdrdata commented May 18, 2023

Uh oh!

WASasquatch commented May 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

levaleureux commented May 18, 2023

Uh oh!

ltdrdata commented May 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ltdrdata commented May 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

WASasquatch commented May 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ltdrdata commented May 19, 2023

Uh oh!

WASasquatch commented May 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ltdrdata commented May 19, 2023

Uh oh!

WASasquatch commented May 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ltdrdata commented May 19, 2023

Uh oh!

ltdrdata commented May 19, 2023

Uh oh!

morphles commented May 19, 2023

Uh oh!

ltdrdata commented May 20, 2023

Uh oh!

morphles commented May 20, 2023

Uh oh!

WASasquatch commented May 23, 2023

Uh oh!

dennwc left a comment

Choose a reason for hiding this comment

Uh oh!

dennwc Jun 2, 2024

Choose a reason for hiding this comment

Uh oh!

dennwc Jun 2, 2024

Choose a reason for hiding this comment

Uh oh!

ltdrdata Jun 2, 2024

Choose a reason for hiding this comment

Uh oh!

dennwc Jun 2, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

WASasquatch commented May 18, 2023 •

edited

Loading

ltdrdata commented May 19, 2023 •

edited

Loading

ltdrdata commented May 19, 2023 •

edited

Loading

WASasquatch commented May 19, 2023 •

edited

Loading

WASasquatch commented May 19, 2023 •

edited

Loading

WASasquatch commented May 19, 2023 •

edited

Loading