modelli/EnvyBetterHands LoCon - beta2

EnvyBetterHands LoCon - beta2

|
7/1/2024
|
9:25:25 PM
| Discussion|
0
A dark fantasy portrait of an elven woman with red eyes and medium brown hair, adorned with ornate jewelry and glowing arcane third eye symbol, surrounded by floating planets and purple psychic energy.
Elven woman with auburn hair and freckles, dressed in intricate medieval armor, holding a candle at a wooden table surrounded by candles and small red flowers in a fantasy workshop setting.
A detailed illustration of an Asian woman dressed in dark scholar robes with intricate gold embroidery, sitting at a desk surrounded by magical texts, scrolls, and books, deeply focused on writing and studying ancient magic under filtered light.
Blonde woman with freckles smiling and sitting outdoors wearing a maroon v-neck t-shirt and floral plated microskirt
Close-up portrait of a female warrior with blue eyes and dark hair wearing intricately decorated heavy armor, set against a softly blurred forest background.
Smiling young woman wearing a red Santa hat and embroidered festive pajamas stands outdoors at night with a snowy park and aurora borealis in the background.
Close-up portrait of an adult Spanish woman with light brown quiff hair, dark brown eyes, dark goth makeup, shiny skin, and blue gemstone earrings against a bokeh background.
Portrait of a demon girl with black horns, short black hair, striking blue eyes, wearing a black lace outfit and a white cloak with floral designs in monochrome style.
A beautiful fairy with silver twintail hair and bright blue eyes, wearing a golden tiara and a tight green dress, standing in a moonlit forest surrounded by glowing yellow butterflies and luminous wings.
A platinum blonde man in a blue suit concentrating at a desk surrounded by glowing magical symbols and ancient books in a vast library with a magical aura.
A gothic vampire woman with platinum blonde wavy hair and glowing purple eyes stands amidst a misty graveyard under a full moon, wearing a dark leather corset and cape stained with blood.
A detailed portrait of a military commander blending Napoleonic era style with cyberpunk elements, dressed in luxurious black and gold uniform, in a smoky urban background.

Prompt Consigliati

nice hands,perfect hands,beautiful hands,fingernails

(masterpiece,best quality:1.3),nice hands

Prompt Negativi Consigliati

extra fingers,deformed hands,polydactyl:1.5,worst quality, low quality, poor quality, bad quality:1.35

deformed hands,polydactyl:1.3),(worst quality,low quality,poor quality,bad quality:1.35)

Parametri Consigliati

samplers

DPM++ 2M Karras

steps

32 - 60

cfg

9.5 - 15

resolution

512x768

vae

vae-ft-mse-840000-ema-pruned.vae

other models

7th_anime_v3_B (b000309cca), revAnimated_v12 (02aecf0c7d), EnvyCuteMix01, EnvyMix_V11 (c0c4ed6b84), EnvyMix_v1 (53c86ec36e), theovercomer8sContrastFix_sd15, applesugarJam_applesugarJamV10 (3c5afac440)

Parametri Consigliati per Alta Risoluzione

upscaler

4x-AnimeSharp

upscale

2

steps

13 - 18

denoising strength

0.44

Suggerimenti

Strength should be set to around 1.0.

Freely mix with other LoRAs for better results.

Avoid using negative embeddings like badhandv4 for improving hands.

Experiment with dynamic thresholding for better results.

Punti Salienti della Versione

Restarted training from scratch, because apparently training on vanilla 1.5 is actually better in terms of making models that don't overcook things or change the style very much. This new version is still in need of more training, so it's not quite as effective as the old one, but it does seem to, on average, improve things a bit, and it works across a lot more models and doesn't mess with style at all, so I think this is probably the right direction to go in. I'll play around with prompting a bit and update the main description with advice.

Sponsor del Creatore

I'm using Lora Block Weight. I believe you can also use Additional Networks and SD Webui Lycoris.

This model is a LoCon. You MUST install the Lycoris extension for it to load.

I'm using Lora Block Weight. I believe you can also use Additional Networks and SD Webui Lycoris.

UPDATE 4/27/2023: I've hit a training plateau so I'm in the process of adding a bunch more images to the dataset, including some more complicated stuff like intertwined fingers. I'm probably going to have to drop the learning rate some more, so things may be slower from here on. I'll keep everyone posted as things progress.

UPDATE: Prompting advice for beta 2:

  • This is a completely new train on top of vanilla Stable Diffusion 1.5. I did this based on the advice of a fellow enthusiast, and it's surprising how much more compatible it is with different model. It doesn't mess with the style of your model at all as far as I can tell, and it really only affects hands and occasionally arms, leaving everything else untouched.

  • It seems to work best at a strength of 1, although turning up higher than that (1.5, 2, etc) can help it on some images at the cost of making it worse on others. No need to mess with your CFG scale, as it doesn't cause things to overcook at these levels.

  • Freely mix it with other LoRAs.

  • I've had best results putting "nice hands, perfect hands" in the positive prompt (increasing the weight makes things worse), and "(extra fingers, deformed hands, polydactyl:1.5)" in the negative prompt. This is on EnvyMix v1 (and probably RevAnimated), but YMMV for other models.

  • "Bad hands" negative embeddings appear to make it worse, although I haven't tested this extensively.

  • As usual, this won't work miracles, but I do find that over a large number of images, it does make things generally better on average. Hopefully this will continue to improve with a few more nights of training.

Prompting advice for alpha 3 and beta 1:

  • Note that this advice is for RevAnimated 1.2. YMMV with other models.

  • It overcooks things a bit, but you need the strength set to 1.0 for it to really work well. You can work around this by reducing the CFG value to 5 or 6 or so. I've had good luck with enabling the dynamic thresholding extension and setting it to mimic CFG 5, and then I can set my CFG value to 9 or 10 and things come out fine.

  • I tried using it with another LoRA and got some pretty strange results, so YMMV there as well. Right now I'm just trying to get it to work consistently in a simple use case.

  • Oddly, I think it's regressed a bit on hands in neutral positions, but it's noticeably better at more complicated interactions, such as holding objects (which is why I have so many pictures of blacksmiths and librarians in the example images).

  • Keep your prompts simple and it tends to do better.

  • With RevAnimated, I tend to get 1 or 2 usable images out of every 8, with a bunch of other ones that are pretty close and can probably be fixed with inpainting.

Prompting advice for alpha 2:

  • It's getting stronger now, and it works best around strength 1. Setting it to 1.3 like the previous version will make things look bad.

  • My negative prompt is still "(extra fingers, deformed hands:1.15), (worst quality, low quality, poor quality, bad quality:1.35)"

  • I had good luck just putting "nice hands" in the main prompt.

Prompting advice For alpha 1:

  • Your prompt should contain these words: "beautiful hands, perfect hands, fingernails". I've had the best luck with them towards the middle, and at no emphasis.

  • The alpha1 LoCon seems to work best at a strength of around 1.3 (on RevAnimated 1.1, where I'm testing it right now -- YMMV for other models)

  • Don't use negative embeddings for improving hands. When I removed badhandv4 from my negative prompt, things improved noticeably. You may want to try without any negative embeddings at all. I haven't used them for a while now.

  • My negative prompt is: "(extra fingers, deformed hands:1.15), (worst quality, low quality, poor quality, bad quality:1.35)", which I arrived at through a lot of experimentation adjusting strengths and terms one at a time. It should work decently well.

  • This all gives me hope that there's a real shot at solving hands on SD 1.5. Even with good prompting, I'm generally not getting perfect results, but things are close. I'll consider this done when it creates well-formed hands without having to add anything to the positive or negative prompt.

Now back to your regularly scheduled readme...

I'm testing the theory that maybe the reason MidJourney's hands are so much better now is that they just took the time to specifically train a network on a high quality set of pictures of hands, and literally nobody else has actually tried. This LoRA definitely isn't at MidJourney levels yet, but I've been training it over night for several nights now and adding to the dataset where it appears deficient, and quality seems to be steadily improving. As such, I'm going to post this now so people can start using it. Consider this an early alpha -- I'll only stop updating once it stops getting better.

Example images are cherry-picked. Please don't expect this model to make all of your hand generations better. It may even make some of them worse, so you should evaluate its usefulness on a large number of images and not just one. If it works for you like it does for me, the a lot of your results should be the same or better quality (some will just be bad in different ways).

Precedente
Add More Details Detail Enhancer Tweaker LoRA - v10
Successivo
EasyNegative - EasyNegative

Dettagli del Modello

Tipo di modello

LoCon

Modello base

SD 1.5

Versione del modello

beta2

Hash del modello

ba43b0efee

Creatore

Discussione

Per favore log in per lasciare un commento.

Collezione di Modelli - EnvyBetterHands LoCon

Immagini di EnvyBetterHands LoCon - beta2

A dark fantasy portrait of an elven woman with red eyes and medium brown hair, adorned with ornate jewelry and glowing arcane third eye symbol, surrounded by floating planets and purple psychic energy.
Elven woman with auburn hair and freckles, dressed in intricate medieval armor, holding a candle at a wooden table surrounded by candles and small red flowers in a fantasy workshop setting.
A detailed illustration of an Asian woman dressed in dark scholar robes with intricate gold embroidery, sitting at a desk surrounded by magical texts, scrolls, and books, deeply focused on writing and studying ancient magic under filtered light.
Blonde woman with freckles smiling and sitting outdoors wearing a maroon v-neck t-shirt and floral plated microskirt
Close-up portrait of a female warrior with blue eyes and dark hair wearing intricately decorated heavy armor, set against a softly blurred forest background.
Smiling young woman wearing a red Santa hat and embroidered festive pajamas stands outdoors at night with a snowy park and aurora borealis in the background.
Close-up portrait of an adult Spanish woman with light brown quiff hair, dark brown eyes, dark goth makeup, shiny skin, and blue gemstone earrings against a bokeh background.
Portrait of a demon girl with black horns, short black hair, striking blue eyes, wearing a black lace outfit and a white cloak with floral designs in monochrome style.
A beautiful fairy with silver twintail hair and bright blue eyes, wearing a golden tiara and a tight green dress, standing in a moonlit forest surrounded by glowing yellow butterflies and luminous wings.
A platinum blonde man in a blue suit concentrating at a desk surrounded by glowing magical symbols and ancient books in a vast library with a magical aura.
A gothic vampire woman with platinum blonde wavy hair and glowing purple eyes stands amidst a misty graveyard under a full moon, wearing a dark leather corset and cape stained with blood.
A detailed portrait of a military commander blending Napoleonic era style with cyberpunk elements, dressed in luxurious black and gold uniform, in a smoky urban background.

Immagini con concept

Side profile of a girl with a hair bun wearing a transparent glass helmet against an orange background with delicate Japanese text above.
Portrait of a pink-skinned cyborg girl with yellow eyes and black sclera, featuring steampunk elements, mechanical limbs, an antique clock background, and a mechanical heart.
Ethereal woman with long flowing hair in a dark lace dress standing amid glowing fireflies at sunset above clouds with a glowing orb and streak of blue light in the background.

Immagini con hands

A surreal portrayal of a person's face held by two hands, with one side showing a cracked skull and the other a distressed expression, all in dark, muted tones.
A hyperrealistic detailed line-art portrait of a woman with closed eyes and hands gently touching her face, created using geometric triangular patterns in black and white.
A highly detailed blue watercolor and line-art portrait of a female face with intricate patterns and a serene expression, featuring meditative designs and realism.
A detailed, beautifully rendered elf girl with aqua eyes and colorful silver-grey twintails sitting on a wooden bench in an outdoor garden with flowers and stone pathways under soft light.
Futuristic android girl with chrome skin sitting inside a luxurious Rolls-Royce Phantom, wearing a floral designer sweater and blue jeans, with a cybernetic arm resting on a plush red leather seat and city lights visible through the window.
Close-up abstract portrait of a young woman with textured brushstrokes and fragmented geometric overlays dissipating into a muted, surreal background.
Close-up of slender hands gently touching a wine glass illuminated by warm lamplight, with a softly smiling face nearby in deep shadow.
Close up view of a stylized eye with blue shades, held delicately by two hands, reflecting a silhouette of a person inside the eye.
Close-up of a goth geisha adorned with neon glowing makeup and a holographic skull headdress, featuring intricate line art and a dark cyberpunk aesthetic in red and gold tones.
Highly detailed Mongolian warrior princess wearing white leather armour with fur collar kneeling in red cloth, set against vast Mongolian fields and mountains

Immagini con fotorealistico

Floating biomechanical harvester machine platforms with rotating rings, situated in wide corn fields under foggy atmospheric light.
A colossal mechanical tiger with armor-plated limbs rests on a rusty brutalist apartment building under a foggy, green-gray sky in a dystopian urban environment, with a lone human figure standing below.
Three women wearing nautical sweaters and slacks standing together on the deck of a luxury yacht at night with Caribbean ocean and palm trees in the background.
Photorealistic image of a young Japanese woman in a colorful sundress walking on a forest path surrounded by glowing jellyfish under natural lighting.
Portrait of a gothic Japanese woman with long white hair and blue eyes wearing a black kimono with intricate silver embroidery, standing in a traditional wooden setting.
An army of mice dressed in tiny coats holding pieces of cheese, gathered on a cracked post-apocalyptic street with ruined buildings and smoke in the background, captured with dynamic lighting and film grain effect.
Photorealistic image of a girl sitting in a comfy chair in a library, surrounded by candles with warm mood lighting, reading a sheet of paper.
Detailed vector illustration of a woman with black orchid flowers and intricate floral patterns on her clothing, featuring blue eyes and elegant hair accessories.
Elegant young woman in traditional Chinese Hanfu attire with flower blossom jewelry and long dark hair, posing surrounded by pink blossoms and soft white smoke.