SD_API_Pictures: Character json tags processing, suffix processing and translations triggers #1034

GuizzyQC · 2023-04-11T02:21:12Z

@Brawlence Moved the character json and NSFW tag processing to a separate suffix processing function. Created a function that, upon matching specific words in text model prompt or text model response will add specific tags to the SD prompt. With a populated translations.json file, this can make for a much more seamless experience than adding tags manually to the prefix when something more complex and precise is requested from SD.

The new character description request to the text model I have in there is giving good results with Alpaca in getting the character to first state what it's wearing, then describe its environment and finally itself doing an action.

Added suffixes to the SD side prompts, which can be a toggle for specific baked-in tags in the nsfw_prompts and anti_nsfw_prompts parameters. Added character self detection in prompt. On self detection, prompt change to ask character to describe its clothing and a marker is set to add positive_sd tag in character json to positive SD prompt and negative_sd tag in character json to negative SD prompt. If SD translations is toggled, extension will look into extensions/sd_api/pictures/translations.json and will add tags to SD side prompt if a string in the descriptive_word array matches a string in either the prompt sent to the text generation AI or the response from the text generation AI. This is useful, for instance, if you want the extension to recognize the word "tennis" to trigger a tennis focussed LORA in SD and add tags you would always want to be in a tennis-related image.

Added a file with sample words to SD tags translation

Fixed a bug that made the character focussed prompt be overwritten

ClayShoaf · 2023-04-11T05:30:20Z

Oh man, I went a completely different direction: #1038
I like the idea of adding Stable Diffusion tags to the character json, though. Gobbling up tokens by putting physical traits in the character description seems like a waste.

GuizzyQC · 2023-04-11T13:15:39Z

@ClayShoaf It goes a long way for weak/simple models where having it describe the character thoroughly, its clothing/accessories, its environment and then its action is too much. Offloading the character description helps a lot, and it makes sure that the character won't forget part of its self description. It allows people to carefully craft their character's appearance in Auto1111, and then copy the prompt to their character sheet and reliably get a similar looking character. And it also allows for embedding specific LORAs for the character in the character sheet.

ClayShoaf · 2023-04-11T15:30:46Z

LGTM

Tested it out and it seems to be working correctly. The only thing I would mention is that the examples in translations.json seem to rely on booru tags, but they won't really work as intended for things that haven't been tagged that way in boorus (i.e. 1girl/2boys doesn't really translate to 1ball/2rackets). Might be better to have something like this so people aren't confused:

{
"descriptive_word": ["tennis"],
"SD_positive_translation": "tennis ball, rackets, (net)",
"SD_negative_translation": ""
}

I know that's a little nitpicky and people who make translations files can edit them to work as intended.

Great work on this PR!

GuizzyQC · 2023-04-11T17:03:50Z

@ClayShoaf Noted for the translations.json file, I'm mostly copying examples from better prompters than me for the sample translations, I'm not sure what are the best examples. I do have my own tuned translations for personal usage at this point, including some that load SD LORAs (the main use for translations for me) that work very well, but I wouldn't want to put these in source code (and the LORAs wouldn't load anyway if the user doesn't have them). Unless someone has a better example, I'll use yours.

Your name detection logic in your PR just gave me the idea that it should also be possible to have the extension pull and add positive_sd and negative_sd from a second character sheet if that character is mentionned in the description or request. It would be a cool feature, but I'm not going to work on it soon because it's unlikely to give satisfactory results, as it would be subject to the standard pitfalls you often see with SD when two detailed characters are requested. What's particularly great with this field though is that we can expect that much of the work done now will get us better results later, as people hook it up with better and better models on the text gen side and on the Auto1111 side.

GuizzyQC · 2023-04-11T17:11:15Z

Another point to note: I've been testing the extension and tuning my prompts with Alpaca-7b. I would be surprised if the character self description prompt ("Describe what you are currently wearing, your environment and yourself performing the following action: ") would work significantly worse on a better model, but I haven't yet managed to get anything better to run on my paltry RTX3070 yet, so it would be great if people could test it on more recent models like Vicuna and GPT 4 x Alpaca, and with more parameters, to check that it behaves satisfactorily with them too.

@ClayShoaf

Added @ClayShoaf 's suggestions to make the sample tags more general.

@ClayShoaf

Adjusted the sample for the translations, to follow the suggestions from @ClayShoaf , and to add a LORA to the example. I prefer adding the LORA example to the readme than the translations.json file as it could cause issues if applied by users who don't have that LORA, or by users who don't edit the translations.json file.

ClayShoaf · 2023-04-11T18:16:59Z

I'm testing it a little more and I'm getting some little errors. I'll send a PR to your repo soon.

I had issues using yaml.safe_load with tabs in my JSON character so I had it switch to json.loads when it detects a json extension.

If using the sd_api_pictures extension, these tags will be forwarded directly to the prompt for Auto1111's API if the extension detects the character is sending a picture of itself.

This mostly fixes a few bugs, namely that yaml/yml files were not loading when I was testing with the Example character. There were also error messages being kicked if there was no `positive_sd` or `negative_sd` in the character's json file. I also put some boilerplate nsfw params, since I couldn't see anywhere that `params['nsfw_prompts']` was being updated.

ClayShoaf · 2023-04-11T19:30:44Z

Alright, I think I did that right. I'm still not completely familiar with how github works. Apparently forking oobabooga and your repo at the same time is not possible. I need to spend the time to sit down and do a formal educational session on how to push, pull, and merge all of this stuff correctly.

Anyways. Let me know what you think.

Removed NSFW prompts from negative suffix creation; I made this update earlier in my repo as I found this would give counterintuitive results if a user added a LORA in the NSFW parameter (the LORA would be loaded even if it is in the negative prompt).

Fixed some bugs and added nsfw_params

GuizzyQC · 2023-04-11T20:05:09Z

@ClayShoaf Excellent, I had some weird results testing that I couldn't identify and couldn't rule out as just being a quirky model or messy context/history, and I wasn't set up to inspect the final prompt sent to the language model. In hindsight, part of the action being stripped out makes a lot of sense.

I made a slight edit to the non-nsfw yes-characterfocus negative suffix to remove the nsfw tags; I had edited that out a bit earlier (maybe before you started to work on your changes) as 1) it's not in-line with the non-nsfw non-characterfocus result and 2) it can give counterintuitive results if a user puts a LORA in the nsfw_prompt (according to the Auto1111 documentation: "Lora cannot be added to the negative prompt.", so we should probably make sure we avoid users inadvertently doing it).

Moved string evaluation outside of input_modifier, changed input_modifier so that characterfocus would not interfere with picturebook / adventure mode.

GuizzyQC · 2023-04-11T21:21:52Z

Moved string evaluation outside of input_modifier, as it was interfering with picturebook / adventure mode. For now I'm leaving characterfocus outside of picturebook / adventure mode, so the entire weight of describing itself for that mode is on the language model. Maybe in the future a toggle for that mode or string evaluation for that mode could allow it to trigger characterfocus.

ClayShoaf · 2023-04-11T21:54:33Z

I'm mostly copying examples from better prompters than me for the sample translations, I'm not sure what are the best examples

A little unrelated, but I've probably done over 100K SD generations at this point. From what I've seen, a lot of the "better prompters" are very good at making one specific thing with one specific model and setup. A lot of the stuff that is included in generations is superfluous. I've done a lot of XYZ grids testing out the different popular tags, and while they have some effect it's not necessarily "better" in most cases, it works more like an extra bit of random entropy.

I am, by no means, the authoritative voice on the matter, but I have a much better understanding of how SD generation parameters/prompts work than I do, for example, the mechanics of git, haha.

I look forward to having even an XY grid for oobabooga. I would try to write it myself, but I'm worried that by the time I have something presentable, someone else will have made one that is better and all my time will have been wasted.

Moved string evaluation outside of input_modifier, as it was interfering with picturebook / adventure mode. For now I'm leaving characterfocus outside of picturebook / adventure mode, so the entire weight of describing itself for that mode is on the language model. Maybe in the future a toggle for that mode or string evaluation for that mode could allow it to trigger characterfocus.

I'm ashamed to say, I hadn't even tested picturebook mode. I don't have the code up right now, but it seems like something that could be handled with an if statement, maybe?

EDIT: I see it now. I won't have time to test it until sometime tomorrow

Brawlence

One small bug introduced with toggle_generation() (which renders force_pic and suppress_pic buttons non-operational), other comments are my personal preferences in naming and defaults.

Other than that, LGTM, great work!

Anyone else is willing to look at those changes?

characters/Example.yaml

extensions/sd_api_pictures/script.py

extensions/sd_api_pictures/README.MD

extensions/sd_api_pictures/script.py

Brawlence · 2023-04-12T08:43:19Z

Also cross-linking: #1038 (comment) .

In part I asked to factor out the logic for if_of_is_in to incorporate those future changes easier. Any thoughts on em?

Also, I really need to plan the separation of script.py into submodules as it's already too big to grasp on a glace.

Renamed is_of_is_in to string_evaluation, removed force toggle generation off in mode 2 and mode 0, changed default nsfw string to nsfw, changed check for existence of tags in character sheet to look for negative tags in negative suffix, renamed character sheet tags sd_tags_positive and sd_tags_negative

Changed sd tags positive/negative

Moved out request string generation from the string evaluation

GuizzyQC · 2023-04-12T15:25:17Z

Also cross-linking: #1038 (comment) .

In part I asked to factor out the logic for if_of_is_in to incorporate those future changes easier. Any thoughts on em?

Also, I really need to plan the separation of script.py into submodules as it's already too big to grasp on a glace.

@Brawlence I like the idea of those changes, it would help make the experience more immersive than a character being completely submissive to every request for pictures. I'm not sure of the flow of the inputs from the main text-generation-webui so I'd be out of my depth writing that for now. Of course there's also many improvements we could write for a more immersive experience around input evaluation. For now I just separated out the generation of the request to textgen from the evaluation of the input, that should make that new change you were talking around easier as you can add a case that won't toggle generation but will toggle a trigger for special inspection on the next message for terms of agreement or disagreement.

I've renamed "if_of_is_in" to a more helpful "string_evaluation". With regards to separation in submodules, it's getting to that point yes. I think input evaluation, UI, payload generation could all be separate.

GuizzyQC · 2023-04-14T14:39:06Z

I think I'm done with improvements on this PR for now, unless it's bug fixes. If it's merged the community will probably come up with other ideas for improvements. For the couple of weeks I've been playing with it, this extension has been such a game changer for local models; hopefully these improvements will take it a bit further

Would fail if character was set to None and the character was still asked to send a picture of itself.

If merged, character sheet is now best used to describe the look of the character.

Also fixed issues with picturebook mode and forcing generation not detecting translations

Hires options are made visible or invisible with toggle of HiRes

GuizzyQC · 2023-04-19T22:19:59Z

~~Fixes #1316 and obsoletes #1358~~

Fixed many bugs found by @altoiddealer and has latest improvement recommended by @Brawlence

This reverts commit e21db99.

Fixed README.md changes to reflect the change of the "nsfw" options to "secondary prompt"

GuizzyQC · 2023-04-20T16:14:52Z

Alright, so that this PR stops growing I'll hold off until merged for further adjustments.
@oobabooga Once you find this ready to merge, it should ideally be merged first, then #1358 and finally #1400 .

oobabooga · 2023-04-20T16:39:36Z

Thanks for the roadmap @GuizzyQC, I'll try to test everything today.

Will now only load translations file once per request instead of 3.

oobabooga · 2023-04-21T20:47:20Z

I think that this PR has valid changes that can improve the immersion of the SD extension, but it

Adds a lot of complexity to the extension that will make it harder for me to maintain it in the future.
Departs from the core functionality of the extension, which is to allow the LLM to generate stable diffusion prompts on its own.
Depends on modifications to the character yamls, when stable diffusion integration is not a core functionality of the web UI.

I encourage you to create your own customized fork of the extension and submit it here for others to download: https://github.com/oobabooga/text-generation-webui-extensions

GuizzyQC · 2023-04-21T22:30:16Z

Makes a lot of sense, I've moved it to its own repo and am submitting it

GuizzyQC added 3 commits April 10, 2023 21:37

Added translations.json

610352f

Added a file with sample words to SD tags translation

Update README.MD

1105794

GuizzyQC marked this pull request as ready for review April 11, 2023 02:21

Made some fixes to character focus prompt

ea4ce7d

Fixed a bug that made the character focussed prompt be overwritten

ClayShoaf mentioned this pull request Apr 11, 2023

update to make sd_api_pictures more dynamic #1038

Closed

GuizzyQC added 2 commits April 11, 2023 13:15

Update translations.json

e4a0611

Added @ClayShoaf 's suggestions to make the sample tags more general.

GuizzyQC and others added 3 commits April 11, 2023 14:29

Added support for YAML character sheets

105391e

I had issues using yaml.safe_load with tabs in my JSON character so I had it switch to json.loads when it detects a json extension.

Added fields for Stable Diffusion tags

622f017

If using the sd_api_pictures extension, these tags will be forwarded directly to the prompt for Auto1111's API if the extension detects the character is sending a picture of itself.

GuizzyQC added 2 commits April 11, 2023 15:47

Update script.py

d4175af

Removed NSFW prompts from negative suffix creation; I made this update earlier in my repo as I found this would give counterintuitive results if a user added a LORA in the NSFW parameter (the LORA would be loaded even if it is in the negative prompt).

Merge pull request #1 from ClayShoaf/patch-1

dfc14e5

Fixed some bugs and added nsfw_params

Changed string evaluation order

76cb932

Moved string evaluation outside of input_modifier, changed input_modifier so that characterfocus would not interfere with picturebook / adventure mode.

Restored default for mode

6b98c53

Brawlence reviewed Apr 12, 2023

View reviewed changes

GuizzyQC added 3 commits April 12, 2023 09:24

Changed sd tags positive/negative in README

9d0e4be

Changed sd tags positive/negative

Changed sd_tags_positive and negative in character

b54bf1c

Moved out request generation

820b31a

Moved out request string generation from the string evaluation

GuizzyQC added 4 commits April 14, 2023 12:16

Bugfix for character None

76553bb

Would fail if character was set to None and the character was still asked to send a picture of itself.

Changed Prompt Prefix label

1f736e9

If merged, character sheet is now best used to describe the look of the character.

Update script.py

ea694ab

Renamed NSFW to Secondary Prompt

467e252

Also fixed issues with picturebook mode and forcing generation not detecting translations

oobabooga added the extensions Pull requests concerning extensions and not the core functionality of the web UI. label Apr 19, 2023

GuizzyQC added 2 commits April 19, 2023 15:05

Avoid double triggering of translations

ffad133

Hires options change

e21db99

Hires options are made visible or invisible with toggle of HiRes

GuizzyQC mentioned this pull request Apr 19, 2023

SD_API_Pictures: Fix default mode setting, add upscaling, denoising and faces fixing options #1358

Closed

GuizzyQC added 3 commits April 20, 2023 00:20

Stylistic changes, fix secondary prompt trigger

ce36186

Revert "Hires options change"

749cd27

This reverts commit e21db99.

Reflect renaming of nsfw to secondary prompt

93c1613

Fixed README.md changes to reflect the change of the "nsfw" options to "secondary prompt"

Reduced file loading

cd60755

Will now only load translations file once per request instead of 3.

oobabooga closed this Apr 21, 2023

GuizzyQC mentioned this pull request Apr 21, 2023

Submitted sd_api_pictures_tag_injection oobabooga/text-generation-webui-extensions#1

Merged

GuizzyQC deleted the patch-2 branch April 22, 2023 02:58

SD_API_Pictures: Character json tags processing, suffix processing and translations triggers #1034

SD_API_Pictures: Character json tags processing, suffix processing and translations triggers #1034

Uh oh!

Conversation

GuizzyQC commented Apr 11, 2023

Uh oh!

ClayShoaf commented Apr 11, 2023

Uh oh!

GuizzyQC commented Apr 11, 2023

Uh oh!

ClayShoaf commented Apr 11, 2023

Uh oh!

GuizzyQC commented Apr 11, 2023

Uh oh!

GuizzyQC commented Apr 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ClayShoaf commented Apr 11, 2023

Uh oh!

ClayShoaf commented Apr 11, 2023

Uh oh!

GuizzyQC commented Apr 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GuizzyQC commented Apr 11, 2023

Uh oh!

ClayShoaf commented Apr 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Brawlence left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Brawlence commented Apr 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GuizzyQC commented Apr 12, 2023

Uh oh!

GuizzyQC commented Apr 14, 2023

Uh oh!

GuizzyQC commented Apr 19, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GuizzyQC commented Apr 20, 2023

Uh oh!

oobabooga commented Apr 20, 2023

Uh oh!

oobabooga commented Apr 21, 2023

Uh oh!

GuizzyQC commented Apr 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

GuizzyQC commented Apr 11, 2023 •

edited

Loading

GuizzyQC commented Apr 11, 2023 •

edited

Loading

ClayShoaf commented Apr 11, 2023 •

edited

Loading

Brawlence left a comment •

edited

Loading

Brawlence commented Apr 12, 2023 •

edited

Loading

GuizzyQC commented Apr 19, 2023 •

edited

Loading