Stable Diffusion: Difference between revisions

Revision as of 12:17, 23 August 2022

Stable Diffusion is an open-source diffusion model for generating images from textual descriptions. Note: as of writing there is rapid development both on the software and user side. Take everything you read here with a grain of salt.

How to Use

beta.dreamstudio.ai: official web service
Official Github page
K DIFFUSION RETARD GUIDE: step-by-step instructions for running Stable Diffusion on Windows with the newest features.
basujindal fork: fork that uses less VRAM at the cost of speed
waifu-diffusion fork: fork that ???

Example Prompts

Prompt Design

Guidelines for creating better prompts.

Prompt Length

Be descriptive. The model does better if you give it longer, more detailed descriptions of what you want. Use redundant descriptions for parts of the prompt that you care about.

Note however, that there is a hard limit regarding the length of prompts. Everything after a certain point - 75 or 76 CLIP tokens depending on how you count - is simply cut off. As a consequence it is preferable to use keywords that describe what you want concisely and to avoid keywords that are unrelated to the image you want. Words that use unicode characters (for example Japanese characters) require more tokens than words that use ASCII characters.

Punctuation

Use it. Separating keywords by commas, periods, or even null characters ("\0") improves image quality. It's not yet clear which type of punctuation or which combination works best - when in doubt just do it in a way that makes the prompt more readable to you.

Emphasis

Putting a keyword in square brackets or appending an exclamation mark increases its effect. Putting a keyword in round brackets decreases its effect. Using more brackets or exclamation marks results in a stronger change.

Image Content

If you want your image to contain specific things: the less abstract your wording is the better. If at all possible, avoid wording that leaves room for interpretation or that requires an "understanding" of something that is not part of the image. Even concepts like "big" or "small" are problematic because they are indistinguishable from objects being close or far from the camera. Ideally use wording that has a high likelihood to appear verbatim on a caption of the image you want.

Miscellaneous

Capitalization does not matter.

Keywords

The most reliable way to find good keywords is to look at the keywords that are used to generate images that are similar to what you want. Below are some (unconventional) known good keywords (as determined by using keywords as prompts without other keywords).

"anime": generic anime-style images, looks somewhat like the 2000s. For style variations try "アニメ" (Japanese way to write anime, looks more modern), "chibi", "Kyoto Animation", "light novel illustration", "shonen", "Studio Ghibli", or "visual novel CG". Avoid "manga" and "waifu". Order of keywords is simply alphabetical.
"bobs and vagene": do not redeem the prompt
"ikemen": handsome Japanese men. Avoid "イケ面" (Japanese spelling).
"Gothic Lolita": frilly black dresses.
"oneshota": cute anime boys.
"Sweet Lolita": frilly pink dresses.
"Touhou", "Touhou Project": characters from the franchise. Avoid "東方".
"waifu": modern Japanese women.
"Zettai Ryouiki": short skirt in combination with stockings or socks, visible thighs. Avoid "絶対領域" (kanji spelling).
"美女", "美人": Japanese women, classical beauty standard.
"巨乳", "爆乳", "おっぱい": Japanese women with large breasts, either topless or wearing a bra.

Useful Links

GFPGAN: Tool for fixing faces
krea.ai: Website that lets you explore keywords
promptoMANIA prompt builder
clip-retrieval: Project that lets you determine the relationship between images and keywords, works in either direction. Online version here

@@ Line 46: / Line 46: @@
 == Keywords ==
-Known good keywords (as determined by using keywords as prompts without other keywords).
+The most reliable way to find good keywords is to look at the keywords that are used to generate images that are similar to what you want.
+Below are some (unconventional) known good keywords (as determined by using keywords as prompts without other keywords).
 * "anime": generic anime-style images, looks somewhat like the 2000s. For style variations try "アニメ" (Japanese way to write anime, looks more modern), "chibi", "Kyoto Animation", "light novel illustration", "shonen", "Studio Ghibli", or "visual novel CG". Avoid "manga" and "waifu". Order of keywords is simply alphabetical.
-* "Ikemen": handsome Japanese men. Avoid "イケ面" (Japanese spelling).
+* "bobs and vagene": do not redeem the prompt
+* "ikemen": handsome Japanese men. Avoid "イケ面" (Japanese spelling).
 * "Gothic Lolita": frilly black dresses.
 * "oneshota": cute anime boys.

Stable Diffusion: Difference between revisions

Revision as of 12:17, 23 August 2022

Contents

How to Use

Example Prompts

Prompt Design

Prompt Length

Punctuation

Emphasis

Image Content

Miscellaneous

Keywords

Useful Links

Navigation menu

Stable Diffusion: Difference between revisions

Revision as of 12:17, 23 August 2022

How to Use

Example Prompts

Prompt Design

Prompt Length

Punctuation

Emphasis

Image Content

Miscellaneous

Keywords

Useful Links

Navigation menu

Search