动态网络自由门 Edition Discussion of Free and Open Source Diffusion ModelsPrev: >>107889954https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>Flux Kleinhttps://huggingface.co/collections/black-forest-labs/flux2>WanXhttps://github.com/Wan-Video/Wan2.2>LTX-2https://huggingface.co/Lightricks/LTX-2>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>NetaYumehttps://huggingface.co/duongve/NetaYume-Lumina-Image-2.0https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
this is the real and correct continuation of the /ldg/ general
based maintainer of thread quality
>>107892557Need to fix collage script but in the meantime this was supposed to be included
Blessed thread of frenship
alt collage
>>107892589And this >>107891027>>107891808
flex klein 4b is so good. i hope we get at least one noob tier finetune out of it.
tehe
https://www.youtube.com/watch?v=uHQZNNajxOk
In Wan2.2, is there a way to fix wan mouth flapping on anime characters? NAG isn't consistent.
>>107892614would rather a bigger model get trained since theres a bunch of copes to run them even without a lot of vram now
>>107892614>flex klein 4b is so good. i hope we get at least one noob tier finetune out of it.sd3.5m tier
thanks mods
>>107892667Is this supposed to mean something?
>>107892667Are you retarded or something?
What's the deal with manchild japanese cartoon addicts who have 0 creativity and absolutely need to use their beloved copyrighted characters and styles?
>>107892702benchod
>>107892708kurwa
>>107892613but big black is a punk band anon https://www.youtube.com/watch?v=jtPFzBLDSPk
>>107892680They will continue finetune lumina 2.0 instead.
>>107892641>>107892667you need to compare it with XL, once it's finetuned it will be great we had no good quality models in this range before 4b ~ 6b. the reason bigger models won't work is because of the cost to train them. it always end up having a bunch of bs settings to cope with the cost of hardware to create a free model. i would rather press F to dance on XL's grave than spend another year coping about that supposed bigger model finetune.
>>107892738what's experimental about that?
>>107892754it's just how i organize my gens
>>107892557Whoever posted this, thank you. Gorgeous.
>>107892831my wife
Even the klein base has terrible variance in seeds. That's why it looks so good I guess?
Yeah I suggest we all just move to /bant/ for a week or two until they get bored.
>>107892757
should i throw 100 grand at finetuning flux2 4b or z-image-turbo or wait for future z-image-copium releases etc
>>107892850
>>107892831
>>107892850Neat thanks
>>107892880The 9b model is great for removing watermarks, logos etc. I put it trough batch of 275 anime-illustrations, only had to manually edit 3 (!!!) images afterwards
>>107892980the flux vae is pretty good, but your images are still getting fucked by it, i would be cautious if you want to use them for training
How do I prompt for Z? I want new camera angles and shit but majority of the time it's just the default one or like if i use "off angle" it shifts it slightly but using prompts that are obvious like birds eye view or other stuff does nothing. I rarely truly get the actions I want through my prompts even when explaining them pretty good.
>>107892995ai slop
If I merge WAI 16 with a Chris Chan lora at low weight, will i have the same quality as NoobAI?
>>107893039maybe
>>107893010A trick is to have whichever LLM you prefer to caption an image with the angle you want and use that in your prompt.
>>107892998I didn't see any degradation, some got slight zoom effect. But it's good to keep in mind, you are right.
Just a bit of banter :]
>this other guy is so obsessed with me>proceed to post 80 posts about himyep another day another melty
If I download ggufs, do I need to grab clip/text encoder files from the main versions for best results anyway?It's my first day
>>107893126Ye
what no z image base does to a general
>>107893160there's still a lot to explore with turbo
>>107893183i mostly care about the fine tunes/loras, after the disappointment of v7 there has been nothing exciting in that segment
has anyone tried training flux2 klein loras so far? for example with this PR https://github.com/Nerogar/OneTrainer/pull/1261
>>107893055What is an easy to use free LLM? How do I even get/use them I am a brainlet at this stuff clearly.
>>107893183box?
>>107893252i use this https://github.com/comfyanon/llm-utilities
>>107893259https://files.catbox.moe/winvi8.png
>>107893272thanks
>>107892869
>>107893272preciate it anon
yes posting such images is totally normal and not schizo behavior
I propose this solution to the schizo problem: All schizos fight to the death, and the survivor then gets tortured to death
>>107893272doing the bamboozling
>>107893341proof?
its ok guys its safe to post
>>107893272>>107893305lmao kekd u got me
>>107893288You are welcome.
holy schizo nuke, based mods
what causes a person to do this for mroe than a year
>>107893393reaL?
finally, my paradise
>thread blesser anon is not one of the schizosbased and frenship pilled
>>107893411their faces were swapped with flux klein
>klein demolished in a single comparison imagejust as reminder the current year is now 2026
qwen 8b Q8 working good with klein. Any sense in looking for a fp16/bf16/whatever? How does it work with these LLM's?
>>107893412switch, top, bottom
if god were real he'd make mondo girls real
>>107893436what are you comapring here?
>>107893128Thanks
>>107893461Left: z-imageRight: Klein
>>107893469oh okay. can you post an edit comparison between the two?
>>107893393
>if god were real he'd make mondo girls real
>>107892557thx 4 bake
>>107892602ty 4 2nd fagollage
What do I have to prompt to get sharp backgrounds in ZIT? I tried describing the backgroudn as sharp, crisp, shot with high depth of field etc but get blurry backgrounds most of the time.
>>107893519it's better than setting a bush on fire, any jackass with a bic can do that
>>107893519
>>107893539use nag and put 'blurry background'
>>107893547nag?
try 8 steps with 9b klein distilled, can fix any issue with text/give more detail. it's already super fast anyways (8 steps was 15s)
>>107892602based alert
>>107893543
>>107893545NIGGA THAT'S CUTE!
>>107893562change the text at the bottom to "I'm in a heckin reddit game, rick!". the man is holding a bag of "onions chips"
>>107893562
>>107893581i dont see any onion chips, FAIL
>>107893590
>>107893459
is it just me or are 90% of the Z loras on civitai total garbage
Colorize the black and white photo
uh oh
>>107893627duhno base
>>107893633is that a kid on the right?
>>107893440
>>107893635yeah but I've tried a few test loras and the results looked at least halfway decent, the example images on most Z loras look like they were done with early XL shitmixes, you have to wrangle Z to do shit this bad
>>10789362798% of loras on civitai are shit which means z is actually doing well comparatively
>>107893412
the anime girl with white hair is wearing a white hoodie with a picture of hatsune miku on it, blue jeans, and white sneakers. keep her head the same and blindfold the same. she is holding a bottle of water instead of a tea cup. change the background to a sunny beach.
>>107893638idk much about history, i think that photo is world war 2 and not sure what the conscription age was
>>107893667kek
>>107893667klein is racist
one more
>>107893687Who is this old geezer?
>>107893668make a low polygon 3d render of the anime girl in the style of a playstation 1 game.2B version 0.1:
>>107893379
>>107893689isaac newton
>>107893689i think he invented physics or some shit
>>107893705before that people would just float?
>>107893689thats me
>>107893689it's me
>>107893668
make a low polygon 3d render of the anime girl in the style of a playstation game. keep her black blindfold the same.low poly thighs: also I think 8 steps is the way to go, generally better results and it's already fast as a model. 15s vs 10s or so.
>idk much about history, i think that photo is world war 2 and not sure what the conscription age was
>>107893689it's him>>107893715
>>107893353
>>107893725make the anime girl in a pixel art style like a nintendo game. keep her black blindfold, white hair, and dress color the same.
no Z-BaseBFL wonchinese century cancelled
>>107893760now make it an anime figure with a bikini outfit, same pose and sword and pod droid but black lace sling bikini
>>107893687>"Hmm, my senses are tingling"
>>107893725
>>107893739I liked stuff like "Remove sepia filter. Improve image with flat anime colors. Improve shading with chromatic aberration."
Why is it almost impossible to make a petite girl in z-image without going full degen? Like petite girl small breasts is fucking impossible.
>>107893804just use the lora
>>107893786
>>107893590This is the gayest shit I’ve seen here, im fucking puking rainbows.
So as someone new with edit models, whats the best practice for prompting.Say I'm just doing faceswaps. Do I reference the original picture? Is there a preference for picture order? Does it prioritize things like resolution or anything? Having success with some prompts, but issues with other, and I feel like it boils down to how some things are easier to prompt in language than the others. Thoughts?
>>107893827prompt?
>update diffusers>cant import Flux2KleinPipelinehuh?
>make her into my wife pretty please
>>107893834>>107893783Just time stuff and pull gacha
>>107893782make a plastic anime figure of the white hair anime girl, remove the black dress and change her clothing to a black bikini. keep her black blindfold, and white hair the same.
>>107893827
>>107893847very cool
>>107893837oh wait its not officially in there yet>pip install git+https://github.com/huggingface/diffusers.gitwerks
>>107893840
the man is holding a black and white polaroid picture of hatsune miku with one hand, and is pointing to it with the other.check em
>>107893847cover it in egg white
>>107893791
>>107893791>improve>anime colorsyou can only choose one, anon
>>107893876this is a job for the 2 image gen though:the man is holding a black and white polaroid picture of the girl in image 2 with one hand, and is pointing to it with the other.
do weez got 9b q8 yet fp8 is booty
>>107893909fp8 is still good, but yeshttps://huggingface.co/unsloth/FLUX.2-klein-9B-GGUF
>>107893938prompt: make it into slop
>>107893875
>>107893729oh hey what's up dude. i remember getting vacations with you like two years ago here.
>>107893996why she crang?
Want to install reforgeShould I install the latest Python or should I install the 3.7 like the github says?
>>107893964yes
replace the head of the man in image 1 with the man in image 2 in the same proportions. make the image black and white.so BFL is fine with face swaps now I guess (even though you can do it with reactor anyway)
>>107894036they're probably fine with it cause it looks like a bad photoshop
>>107894036todd:
>>107893938
>>107894048replace the head of the man wearing a blue shirt in image 1 with the man in image 2. replace the black man on the floor with a cardboard box that says "SKYRIM".
Using the not-yet-merged klein branch of stable-diffusion.cpp with Flux2 klein 4BWhen image editing, it OOMs rather quickly at Q8_0 on 12GB VRAM, so I'm limited to very low resolutions. The editing capabilities are great but the output quality tends to be poor. I assume that's down to the low resolution and being the 4B model?
>>107894062yeah the other anon wasn't kidding about bad photoshop lmao
>>107893996
>>107894070the fent man image is potato quality, I swear these fucking news orgs cant put up a single good image of it, despite how much they cried about it.regardless this is for memes not high art
>>107894087who
>>107894062
replace the head of the man in image 1 with the man in image 2.kek, drive but blade runner:
>>107894105You were not born back then when Bateman was a thing.
>>107893930
>>107894122see, high quality source yields a better output:
>>107894128i was watched dark knight in the cinemas zoomer
>>107893999good times. I started genning again like a month ago.
>>107893883
>>107893814Chadolf Hitler.
>flux-2-klein-9b-fp8>t2i>CFG 1>5s/it >single input imgedit >1.5s/itwat
>>107894067Which is the base resolution?
>>1078941544 and 6 are the best alt Lauras
>>107894160retard
klein training when?
>>107893885model / lora?
>>107894172zit no lora (loras are for faggots)
literally Miku
>>107894144Ok.
>>107894169when khoya is done adding support for qwen layered for some goddamn reasonwhat does that model even need loras or tuning for?
>>107894067I couldn't find the recommended resolution for 4B on the huggingface or their blogpost.If I don't specify a resolution when editing, it default to 512x512.
>>107894179thanks, i should have guessed a chink model would know xianxia
>>107894167please help me not be a retard
>>107894206ok
>>107894196was for >>107894162also lol Barbie anatomy (it could have just put panties there, I didn't specify otherwise.)
>>107894192>what does that model even need loras or tuning for?penus and vagooper
>>107894189
whoever you are, you told me that flux was always fine with boob, but I'm not gettin flux2dev to boob the same pictures I boobed with kleinhang your head in shame
>>107892557ACEStep 1.5 got me thinking about a possible way one could leverage a dataset of music without copying the artist's voice, and that's where music cover solutions come in handy. Does anyone know if RVC is still SOTA? Perhaps using https://github.com/Mangio621/Mangio-RVC-Forkwouldn't be too bad. I also know ACEStep can do music in a reference style, but I assume tunes or LoRAs are more flexible and lead to better quality.
>>107894256damn that's crazy
>>107893883catbox?
almost but still so far
>>107894267here you go https://files.catbox.moe/8jqkb7.png
>>107894215
I it just my shitty loras or does ZIT still have some difficultires with maintaining likeness when the character is further away? I've tried some celeb loras and the likeness is spot on portraits up to cowboy shots, but the further the character is away, the more generic the facial features seem to become. There's still some resemblance but not nearly as close as in closer views.
>>107894280cheeky
>>107894226Slopped as it may be, Qwen is most accurate to the model's face. That's why a 2 pass, one with Qwen then next with Flux is better.
>>107894308tldr retard
>>107894330nigger
>>107894336useless
>>107894345faggot
>>107894267here is real versionhttps://files.catbox.moe/uny7m8.pnghttps://files.catbox.moe/89wfqk.pnghttps://files.catbox.moe/y7hx60.jpghttps://files.catbox.moe/mglace.pnghttps://files.catbox.moe/yi8fvb.png
>>107893412dev got that... flux look
me too haha...
>>107893725>>107893760>>1078938479B or 4B?
>>107893436>a single datapoint is all you need
>>107893436damn everyone start training Z-Base NOW
>>107894368thanks, appreciate it
>>107894036>>107894048this looks so bad
>>107894424its the ad schizo making them
should I run distilled or non distilled klein for best quality?
>>107894445retard
>>107894445Distilled retard kun
>>107893814These faces are all the same but none look like the original lol
The Z Base model they have is so bad they're embarrased to share it. Otherwise they would've done so a long time ago. It's simple. That's the part about "Chinese culture" you must understand. Wonder where's DeepSeek R2? Chinese simply are not good as the West.
hm...
Is there a node that pauses the current cycle until user input? eg., I want to pause+notify after 1 video is created before it works on the 2nd, and so forth.
>>107894464>Chinese simply are not good as the West.Yet z-image turbo is still the curent goat
>>107894463hmm i wonder why
>>107894477yeap
>>107893412make the characters wear identical black and yellow colored cheerleader outfits
>>107894445>>107894393
>>107894477just generate one video at a time? then it stops till you hit generate again
>>107894532how do i do that?
>>107894532fool of atuk
>>107894506>fluxhands
>>107894551ai slop
>>107894459>>107894506thank you anons
>>107894551is this a gen or some program stuff?
>>107894563loooooool
>>107894464big finetunes are unlikely to use it either way cause it uses a far worse vae (flux 1) instead of klein's flux 2 vae which is 2x as accurate and should converge much faster
>>107894563klein with a pic from the /hr/ playboy thread and this prompt https://pastebin.com/drZ0yjyb
>>107894579I'm pretty normie>>107894592that's cool, looks like that old vector tank game or the intro to the 90s Jonny Quest
>>107894605isnt that comfy?
>>107894617no I'm still using Illustrous
>>107894624
>>107894605nice
its a little too clean but still neat