A red-haired sorceress wearing a large black witch hat, black corset with white lacing, and a flowing purple skirt with a slit stands outdoors at night holding a twisted wooden staff.
A blonde fairy with wings, pointy ears, and a green dress is floating at night surrounded by glowing light particles and blue butterflies.
Portrait of Miyuki Inaba with short brown hair, brown eyes, and a grey tank top, hand resting on forehead against a white background.
Portrait of a girl with white long hair and brown eyes wearing a black corset and lace collar, styled in Vanillaware art with blue flowers and feather on black background.
Photorealistic portrait of a white-haired girl with blunt bangs and brown eyes wearing a black corset and detached sleeves, surrounded by floating blue feathers and light particles against a black background.

推薦提示詞

score_9, score_8_up, score_7_up, <lora:vanillawareStyle:1>, 1girl, solo, looking at viewer, full body, light particles

推薦反向提示詞

thumbnail,3d

3d, bad anatomy, watermark

推薦參數

samplers

Euler a

steps

20

cfg

7

clip skip

2

resolution

768x1344

vae

sdxl_vae.safetensors

other models

vanillawareStyle (272b477439ee), ponyDiffusionFor_v10 (7ad8ce957e)

推薦高解析度參數

upscaler

4xUltrasharp_4xUltrasharpV10

upscale

1.2 - 2

denoising strength

0.2

提示

Compose prompts as [character traits] + [style] + [expression] + [clothing] + [camera and action] + [background], modifying as needed.

If images appear blurry, add 'thumbnail' to negative prompts and increase its weight for clarity.

Adding '3d' to negative prompts can improve results.

Including 'realistic' or 'realism' in prompts can enhance figure features.

Adjust recommended weight between 0.6 and 1.0 to refine character appearance.

The training of this model and the images it generates are solely for learning purposes.

I did nothing, I'm just a porter.

This model is more like a character pack, and its side effect is the style it brings.

It took more than 30 hours of repeated attempts, during which I almost gave up, but in the end, I achieved a more balanced effect. Most importantly, my training hypothesis was verified. In the future, I might organize these experiences into an article.

But bad hands issues still exist.

Trigger word: vanillastyle

You could find the example prompts from the images above.

The prompt pf the previous version model mostly worked too.

My prompts are basically composed in the order of [character traits] + [style] + [expression] + [clothing] + [camera and action] + [background], and you can delete or modify them as needed.

If there is a particularly blurry situation, consider adding "thumbnail" to the negative prompt and increasing its weight until the image becomes clear.

Adding '3d' to negative prompt may get a better result, While adding such as 'realistic', 'realism' tag could enhance the feature of the figure.

Recommended weight: 1.0~0.6, adjust as needed until the character's appearance meets your requirements.

Upscale value recommendation is around 1.2~2.0 , denoising strength is 0.2

The dataset was mainly focus on the works of George Kamitani.

20240907v0.2

In this version, I tagged more images, and for the rest, I removed their tags, leaving only the trigger words to prevent conflicts with the carefully tags. (This method may be wrong.)

During the training process, there were too many instances where the images in the dataset were not accurately represented through the prompt. I tried changing various tags and retraining, with the same result. The repetitiveness of these images in the dataset is also not high, lacking continuity.

Finally, I read an article that mentioned increasing the number of training repetitions for certain characters to prevent the model from not learning these images sufficiently.

So, I placed all the single-existing images in the dataset into a subfolder, set the training repetitions to 2, and left the images that were already well-learned unchanged.

However, since there are quite a few quality issues with these discontinuous images, and I have not repaired them for the time being, increasing their training repetitions has had a certain impact on the overall style.

For the next version to improve, the most fundamental approach is to enhance the quality of the dataset, and also to make good use of captioning techniques, adding the same tag to those slightly lower-quality images, and then placing them all in the negative prompt when running the model.

20240715v0.1

This model can only be considered as v0.1, it is not very easy to use normally, and I think it is best to tagging more images in the dataset in detail for better results. In the future, I may slowly complete the training of this model.

The performance of this version is not very good, the images it generates may often appear chaotic.

I collected 100+ images as a dataset, but the number is still too much for manual tagging. I initially used wd1.4 to tagging all the images, but the quality of the tagging is still no good, (maybe my usage is not correct enough, and I welcome everyone to make suggestions).

Because I want to see the results quickly, for this dataset, I only manually tagging some images that meet my personal preferences, so the model's output effect will be better for these images.

貢獻者

上一個
Evangelion "Post-impact" Apocalyptic Style [Flux.1 D] by AutoPastel - V1
下一個
Chroma - Random Illustration/Anime Mashup - v2.0

模型詳情

模型類型

LORA

基礎模型

Pony

模型版本

v0.1

模型雜湊值

33ee0b8061

創作者

討論

log in以發表評論。

Vanillaware Style PonyXL - v0.1 的圖片

style 圖片

vanillaware 圖片