[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>108076014

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: 00003-49464576.jpg (1.18 MB, 2560x2048)
1.18 MB
1.18 MB JPG
>>
>>108078691
>same gartbage collage
at least its a fagollage I guess
where the fuck are the 1 girls faggot
>>
>>108078782
>>108078788
It's "@pigeon666, masterpiece, realistic" and the description of everything you see in the picture there. Now what?
>noo I need the entire prompt
Well I'm not giving it to, that's my OC donut steal, I'm not donating my OC to your dirty grabby hands. This is the bit that's relevant to the style
>>
File: 0158331.png (1.15 MB, 928x1120)
1.15 MB
1.15 MB PNG
>>
File: 45246.png (1.71 MB, 928x1120)
1.71 MB
1.71 MB PNG
>>108078817
was meant to be like this
>>
>>108078817
good gen but
>no slight pantyshot
sad
>>
File: 1.jpg (394 KB, 1320x1568)
394 KB
394 KB JPG
>>108078817
See how soulless that style is compared to the awesome style of the old models I use
>>
>>
File: 10523217.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
>>
File: 00005-3411327429.jpg (2.32 MB, 2560x2048)
2.32 MB
2.32 MB JPG
>>
>>108078813
>no qwen boilerplate
>highly simplistic prompt
Duh it "looks good" on whatever slopmix. With raw finetunes such as Anima you need to be autistic with your prompt. I'm assuming you weren't one who used regular Noob and instead preferred some downstream mix.

It's fine to prefer slopmixes because they are easier to handle, just don't whine about not getting good outputs from raw finetunes because you only use very simply prompts.

You will call this cope but the real cope is needing to wait for someone to mix a bunch of models together so that "1girl, standing" doesn't look like ass.
>>
>>
>>108078863
without metadata its pretty pointless to continue honestly
>>
>forced tags
>@
>"you are a helpful..."
tripleslop
>>
>>108078856
yo who is this?
>>
>>108078863 (cont)
The entire idea of mixes is a tradeoff between ease of use and elasticity/expression, this is not a controversial concept. It is simply a fact.
>>108078882
Combined with the fact that they aren't even the same resolution. But I'm guessing it's moreso stubbornness than purposeful trolling. I saw the same thing when anon would refuse to acknowledge that Illust was better than pony.
>>
File: 9357.png (3.16 MB, 1088x1904)
3.16 MB
3.16 MB PNG
>>
>>108078845
how can u like this illuslop
u have shit taste
kill urself
>>
>>108078897
jay effkay
>>
File: i4u-kero.jpg (1.86 MB, 1800x1260)
1.86 MB
1.86 MB JPG
>>
>>108078863
>no qwen boilerplate
NTA, but do people really? It's making no noticeable difference in my testing so far, other than feeling stupid.
>>
File: 1759446052394249.png (3.65 MB, 1152x1888)
3.65 MB
3.65 MB PNG
okaeri!!!!!!!!
>>
File: 1764651025344789.png (3.35 MB, 2016x1120)
3.35 MB
3.35 MB PNG
yuribros!!
>>
>>108078913
>>>/g/dalle
>>
>>108078845
This is more of a GPU RNG vs CPU one, GPU RNG is superior at expression
>>
File: 1766896355957496.png (3.98 MB, 1728x1248)
3.98 MB
3.98 MB PNG
today its frieren friday
>>
File: 1753276062350623.png (3.86 MB, 1152x1888)
3.86 MB
3.86 MB PNG
>>
>>108078914
The more complicated the prompt the more the effect is noticed.
>>
File: 1769682403629668.jpg (655 KB, 1536x1536)
655 KB
655 KB JPG
>>
>>108078920
>>
File: 1754847444097549.png (3.69 MB, 1216x1824)
3.69 MB
3.69 MB PNG
ready for the stark date
>>
>>108078934
Is that still the case for tag-style prompts?
>>
>>108078955
Yeah. I don't use any NLP with Anima anyway. But my prompts are still paragraphs long.
>>
File: 1.jpg (296 KB, 1320x1568)
296 KB
296 KB JPG
>>108078863
>simple prompts
I want the style of the artist, dumbass. I don't want that
>digital painting, highly detailed, cinematic lighting, sharp focus, concept art, trending on artstation, award winning, unreal engine 5, deviantart, octorender, 8k, 4k, 16k, alphonse mucha, ilya kuvshinov, artgerm, greg rutkowski, magic the gathering art, d & d character
prompt that you think is the peak of style, I'm just not interested.
>>108078905
If this is slop lock me up in a pigsty
>>108078898
>he thinks style changes based on resolution
You have literally never used a booru model have you
>>
>>108079005
illuslop is 100% recognizable because all illu gens have the same style/shading
its uncanny
SLOP
>>
>>108079005
The idea of longer prompts extends to the overall ability and adherence of a model. Again, it is not new information that raw finetunes necessitate autistically tagged verbose prompts. If you do not wish to take advantage of the elasticity provided by non mixed models than that is your prerogative. The situation you find yourself in, preferring old mixes over newer better finetunes, is neither new or unique - given enough time, unless a better model drops before, I'm sure someone will release a mix you will be happy with. That's just how model timelines go.

>You have literally never used a booru model have you
It's just one more thing to knock you on, it's not a true 1:1 comparison.

I won't dog on you for preferring slopmix styles but you shouldn't be surprised when very simple prompts don't "look good" on non mixed models. Again slopmixes are unironically predicated on being easier to use at the expense of a stronger "default" (read: slop) style.
>>
>108079093
nice llm reply shill
>>
>>108079116
It is a little humorous considering this same kind of conversation took place during the early days of Illust, but unfortunately you are incorrect
>>
File: 1740104729141163.png (1.78 MB, 1312x1568)
1.78 MB
1.78 MB PNG
>>108079005
>>
>>108078944
>dongload
>>
>>108078928
Plastic ears
>>
https://github.com/sdbds/ACE-Step-1.5-for-windows/tree/qinglong?tab=readme-ov-file#-installation

acestep 1.5 with cover functionality, get the portable zip, like comfyui portable.
>>
File: o_00197_.png (1.55 MB, 1280x768)
1.55 MB
1.55 MB PNG
>>
I don't think cutting out artist styles is a good idea but I don't get why people care about a model having too much of a specific look. Every single model does that, even SaaS slop like NBP that probably trained on everything. I think the future is going to be a bunch of artists training their own individual finetunes to be nothing but their own style rather than there being a single "do everything" model. At least once the current generation of anti-AI luddites die off.
>>
>>108079264
>native Comfy nodes are still crap
>kana112233/ComfyUI-kaola-ace-step doesn't work
Might as well try this.
>>
>>108079276
>I don't get why people care about a model having too much of a specific look
it makes the model less interesting. predictable.
>>
>>108079300
But all AI models do this like I said. Complaining about it seems a little pointless and it's probably going to become the norm in the future because copying someone else's style is even less interesting.
>>
>>108079276
Okay, catjak.
>>
uh oh meltie incoming
>>
>>108079295
try it, there is a shitload more functionality than the comfy workflow. also not that large (zip)
>>
>>108079351
It isn't but it's downloading all safetensors again. Fucking hell.
>>
>>108079361
it's worth it for the cover/repaint options, pretty funny even though im still figuring it out.
>>
File: KekstoneDoesItAgain.jpg (3.75 MB, 2496x1664)
3.75 MB
3.75 MB JPG
>>
>>108079320
some have less a default style than others, i dont get why youd want more of a specific default look even if you assume its inevitable, which i disagree with
>>
>antichroma schizo again
>>
>>108079005
how do these models handle the (actually a lot of) older Booru artists who never published any work at even 1 megapixel? I.e. the native resolution of their entire body of work is below that
>>
>>108079320
Emulating and combining multiple artists styles will ALWAYS be more interesting than relying on a models default look kek
>>
File: 1709947290459.png (12 KB, 715x174)
12 KB
12 KB PNG
>>108079371
More interested in picrel there but I hate Gradio and don't have patience to prepare an actual audio dataset.
>>
File: 1755387787780410.jpg (274 KB, 1179x1627)
274 KB
274 KB JPG
>pulled
>ModuleNotFoundError: No module named 'comfy_aimdo'
>>
>>108079399
update the python deps retard
>>
File: laughing oiran.jpg (57 KB, 852x480)
57 KB
57 KB JPG
>>108079399
>he didnt update requirements.txt
>>
>>108079379
It's less about wanting it to happen and more about it being inevitable. If even Google can't stop their model from being easily detectable after its gens get spammed billions of times I don't get why people expect some random finetuner will figure it out.
>>108079390
That's what model merging is for. Combining artist styles from just prompts is just a band-aid for our current era where it's still really difficult to train full models without being rich and hoarding terabytes of data.
>>
>>108079374
I would fund this
>>
>>108079393
>Setting constrained decoding max_duration to 240s based on GPU config (tier: tier5)
Well shit. In Comfy I tried to do 480s on 16GB and oom'd, if my limit is this low it isn't happy news.
>>
>>108079414
Google actually has an incentive to make their outputs homogenized so if they are brought to court they can easily say "the average person would know that's a gen because all our gens look the same".
>>108079414
>That's what model merging is for.
No, models like NoobAI or Anima are adept at combining artists styles BECAUSE they do not suffer from the same default look as something like WAI. I'm having trouble understanding what you're trying to say...
>>
>>108079371
also might try this:

https://www.reddit.com/r/StableDiffusion/comments/1qxs5qv/acestep_15_full_feature_support_for_comfyui_edit/



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.