Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>107683139https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/ostris/ai-toolkithttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/musubi-tunerhttps://github.com/kohya-ss/sd-scriptshttps://github.com/tdrussell/diffusion-pipe>Z Image Turbohttps://huggingface.co/Tongyi-MAI/Z-Image-Turbo>WanXhttps://github.com/Wan-Video/Wan2.2https://comfyanonymous.github.io/ComfyUI_examples/wan22/>NetaYumehttps://civitai.com/models/1790792?modelVersionId=2485296https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd>Chromahttps://huggingface.co/lodestones/Chroma1-Basehttps://rentry.org/mvu52t46>Illustrioushttps://rentry.org/comfyui_guide_1girlhttps://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe|https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/r/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debohttps://rentry.org/animanon
Blessed thread of frenship
>>107687569thanks for the bread anon
>>107687569Thank you for baking this thread>>107687582Thank you for blessing this thread
, anon
comfy should be dragged out on the street and shot
>>107687600i love that interior, stylish af
>>107687609yeah, I love the fruitiger aero style :Dhttps://www.youtube.com/watch?v=Cz2YCRmDOFk
Looks like Qwen v2 will be a reasoning model (so an autoregressive model?)https://xcancel.com/cherry_cc12/status/2004741644810383684#mhttps://xcancel.com/cherry_cc12/status/2004162177083846982#m
mirrors can be tricky
What happens to Z image base and LTX2? They were supposed to release by Christmas. Do we still think Z is king? How has Lora training come along. Very well, shit, or so so?
>>107687595Extremely nice 2D to 3D conversation.
>>107687646No one knows, yes still king, and very well
>>107687663Can it train on anime well or is SDXL still best for that.
>>107687646>What happens to Z image baseit's inference code PR got merged 4 days ago, for the new version of Qwen Image Edit, the PR got merged 2 weeks before they released the model, make that what you willhttps://github.com/huggingface/diffusers/pull/12857
>>107687595she's cute but the smile is a bit creepy
>>107687600>>107687609Reminds me of that game where you're stranded in another water planet and you have to build a base.
>>107687707cool gen
>>107687646Z-base is the next Pony v7. People hype it up as only 'two more weeks' away and then when it finally releases in July 2026 it will look outdated.It has been over a month since Turbo released, they stated that base was to release 'that weekend'.
>>107687691>>107687653trying to get side body view. don't remember it being a annoying get a proper side view from sdxl.
>>107687738>they stated that base was to release 'that weekend'.you forgot that they said "I guess", so it wasn't 100% sure... yeah I know I'm coping a bit but still
>>107687646more like next Christmas
>Z image base was supposed to drop on christmas>they didnt release itgoddamnit
add the character on the right in image2 over the water, in the sky.got some ART here, anons.
>>107687669It's superior with grabbing onto styles but you give up general booru tags. Best to recaption with NLP. But IMO still better despite that drawback.
>>107687764kek
>>107687764okay, now gigachad is intact.
>>107687646>They were supposed to release by Christmas.>>107687763>supposed to drop on christmasThis was a rumor and never actually stated by the devs. I know they keep saying "soon" and because of this I will kill the next chinaman I see but they did not actually say the birth of christ.
>>107687569>>Maintain Thread Quality>https://rentry.org/debo>https://rentry.org/animanonis there a reason these are in the OP? they are off topic and encourage flame wars
déja vu
>>107687669fwiw ive left XL behind entirely
>>107687783kek, and a solo act:I will make new gens, just testing 2511 edit with 2509 8 step lora, seems good.
>>107687669nobody serious is bothering until a noob fine-tune is ready
https://www.youtube.com/watch?v=8mNwEqTlRoM
>>107687827Didn't they say they'd make an anime finetune themselves? Would be good to make an nsfw finetune from that.
>>107687835no nsfw. still have to wait
the old man is holding a sign saying "LDG". Hatsune Miku is standing to his left.
>>107687801if you actually cared, you would go back to sdg
>>107687847why don't you since all you do is shit up the general with your drama?
>>107687847don't interact with the schizo, no one loves him and he feeds on (You)s
>>107687835>didn't they saythey said a lot of things since z-turbo, and delivered on none of them. what will happen is radio silence regarding this 'anime finetune' and months will pass with people assuming that they're still working on it only for it to never release. "why bother finetuning it if they're doing it for us first??"expect nothing to happen as everyone sits around waiting for a mystical finetune that isn't even being developed.
>>107687827for serious finetunes sure but loras are fantastic in the meantime
>>107687867sorry, meant for >>107687855
>>107687904loras have always been a cope and they are over it and don't have much variation on zit
Charlie Brown with hair is so uncanny lol
>>107687919mmmmokay anonie
>>107687919>loras have always been a copetruth nuke
>>107687891Yeah I can see the issue with that.
the old man is wearing a white t-shirt with Hatsune Miku on it, and is holding two green glowsticks.this one turned out good, I think the 2509 8 step lora is more consistent than the 4 step 2511 one.
remember nunchaku? me neither
bombardino crocodilo hehe
Everything less than drawing by hand is a cope btw
Everything is a cope btw
>>107687948https://github.com/nunchaku-tech/nunchaku/releases/tag/v1.1.0it has implemented Z-image turbo recently but the quality is so ass, I guess small models don't like quantizations, like LLMs>>107687956if AI was cope and irrelevant the artists wouldn't piss and shit themselves over it
>loras have always been a copesaid by someone whos never trained
>>107687968is nunchaku for vramlets mainly? never used it myself
>activate 2 character loras>they bleed into a mess of blended features>prompt 2 characters the model knows>they work perfectly fineloras are outdated copium, technology that hasn't improved since 2019.
>>107687976it's a 4bit quant, but better than Q4 so it's definitely for vramlets who want a bit of quality
Issue of the skill desu
>>107687999another truth nuke, nothing will replace having the style/character directly trained in the model
>>107688005this 100%
>>1076880183dpd brownoid boomersloppers don't understand this because they settle for plastic crap. realism is literally a single style and they STILL can't get that right
Has anyone made a Danbooru tag LoRa for Z Image yet?
>>107688005anon is often apt to blame the free models as if theyd pay for it at all. very sad
>loras are just as good as finetunes!>loras work on distilled models (flux dev, flux 2)EPIC LIFEHACK DISCOVERED! why doesn't lodestone and noob just train loras instead? they're just as good!
Who is anon quoting? No one said that ITT
>ZiT loras are better than XL >WHAAAAATT??? LORAS ARE NOT AS GOOD AS FINETUNES HURRDURRholy retardation
>>107687595>>107688029remind me of the good old days of 2022 when we could only post close up of 1girl because that's the only thing SD1.5 could do correctly kek
where base
>>107688087>where baseStill in China obviously :(
>>107688158pretty cool
>>107665458>>107679533>>107679033I decided to use this to test some different settings. I used a deepthroat LoRA with the wan 2.2 FLF template.son = sage attention onsoff = sage attention offlon = lightning LoRA onloff = lightning LoRA offEverything else stayed the same. I changed the steps/cfg to the comfy recommended settings when changing the lightning LoRA.The gen time is at the end of the filename. You can see the difference in quality. The lightning LoRA looks like it gives a smoother zoom in effect on the background, but turning it off makes the foreground action a lot more intense.There's no telling if these differences will stay consistent across various seeds. I used to do thousands of gens of different SD settings back in the day and when you think you see a pattern, it could be totally a byproduct of that particular seed. More testing is needed.I will say that my anecdotal experience is that characters rotating/manipulating objects is much worse with the lightning LoRA on. It is much more likely to just morph the object around until it gets to a stable position.
>>107688088this but unironically
>>107688165>I will say that my anecdotal experience is that characters rotating/manipulating objects is much worse with the lightning LoRA on. It is much more likely to just morph the object around until it gets to a stable position.obviously, distillation hurts the model's quality and those lora apply distillation to make it faster, they improved a lot though since the SDXL turbo days, there's even better techniques than lora that are begging to be tried, we'll see how they fare in the future
>30 days since z-turbo was releasednot looking good, chinakeks
>>107688165KEK. Thank you for the breakdown, I personally like the one with sage on and the lightning lora on to be honest. Appreciate you!
>>107688194I think they know the eventual goontunes that will be made will collapse society as we know it. That's a heavy burden.
>>107688088
they started training the hentai finetune and realized how much money they could make with saas
>>107688233I won't mind if they keep the finetune API, we just need the base model, we can figure out the rest by ourselves
>>107688198That's the worst one, imho. He doesn't even turn the gun around, it just morphs to a different position. From what I can tell, everything off yields the best results, as expected.
>>107688194Only 30? Feels like it's been twelve months
>>107688278ai makes lesbians look hot again
the anime girl in image2 is outside the door in image1. keep the man's expression and face the same.
they're training more cunny which is the reason for the delay
>>107688394Have you had any luck replacing race of someone? e.g. westoid to jap and vice versa
>>107688403Sorry only Bane, Miku, Drive, and Floyd
>>107688394the anime girl in image2 has her arm around the man in image1. keep the man's expression and face the same.aww.
>>107688452>gooseling as emotionless as everqwen is so fucking accurate desu
>>107688403oh, absolutely.netflix rush hour. "make the asian man black."
>>107688467it just changed the skin color, he still has the face of an asian man
>>107687595model?
>>107688480ok here is an example with gosling:
>>107688491holy shit it would never change race and face for me. guess i downloaded the cucked qwen. 2509 or 2511?
>>107688491chinese rice farmer:>>1076885022511, but with the 8 step 2509 lightning lora, works well imo
>>107688165kek
>>107686065Download NunifRun install.batRun update.bathttps://github.com/Westlake-AGI-Lab/Distill-Any-Depthscroll to pretrained models, download the 97mb one and the largest one and shove them into the nunif-windows\nunif\iw3\pretrained_models\hub\checkpointsLoad up iw3-gui.bat (takes ages to load btw, so just wait like 5 minutes for the window to appear after the cmd box opens and closes)3D strength 1.5 to 4.0 (higher is better 3D depth but causes more artifacts around edges of foreground to background objects which looks like shit, I usually keep on 2)Convergence 0.5Depth Model (Distill any L (slower better quality) or Distill any S (worse quality but way fucking faster))Edge Fix 2Full TB = More horizontal resolutionF/Full SBS = More vertical resolution (though a 4k file becomes 8k, so if you want to lose half resolution and keep 4K (so 2k 3D) do Half SBS or Half TB.If you have a Nvidia Graphics Card newer than a 1080ti series, then do fp16.Depth Batch size = Depends on graphics card (chatgpt it)Worker threads = Depends on graphic card (chagpt it)Those 2 settings basically will be how fast your graphics card processes the image/video.There, you can convert movies/videos and images to 3D for VR or a 3D TV.
>>107688572not even close, QiE is so bad at keeping the original style
>>107688579I didn't prompt to keep the style the same I just said miku. you can keep pixel art style if you prompt for it for example.
>>107688565Oh for images as well, you can definitely do higher 3D strength and play around with foreground scale. Read the readme to see what they dohttps://github.com/nagadomi/nunif/blob/master/iw3/README.md
>>107688607do it for this >>107688572
Wish I could get face swap to work (some python shit is fucking up nodes) so I don't have to deal with every anime-real model having full asian faces.
>>107688572Shit why did it go for XII, it was so close lol
bf16 vs fp32 lora, is there a major difference or not so much?
https://www.youtube.com/watch?v=s4FnAOg6N5c
did the mods kill tranfag got yet or do I have to wait?*edit: schizo rentries are still there so I take that as a no. cya*
I dont watch this show but apparently some guy said he was gay.
>>107688761also bf16 qwen edit 2511 lora seems good, they say you can do 8 steps instead of 4, outputs are good:
can wan 2.2 do cartoon inbetweening well?
Should've put more trust into Newbie but nooooooo we HAVE to wait in this magical Z Image Base that doesn't exist, wait for this censored Anime finetune of the non-existent Z Image Base, and wait for someone to (actually) uncensor it without adding in their own schizo rules like ... artist clustering. Remember that shit?
>>107688825>artist clusteringthat was some insane idea indeed
>>107688746he's going to kill the threads if this keeps up
>>107689053the bowl...
Seems like multiple people in this thread are having mother issues.
the anime girl is wearing a white crop top and denim shorts, and white adidas sneakers.
>>107689086Hair and hands are fucked.Some more steps would be nice.
change the text from "IT'S TIME" to "OH SHIT". The man is pointing a gun at his head.actually more funny given it's not him holding the gun desu
give the girl wearing green on the left a green bikini.not really necessary for stellar blade but still a valid test case.
>>107689195remove all clothesctrl enter
>>107689240hey that's good and it fixed the awful baked shadows on her tummy
>>107689251using 2511 qwen edit, but 2511 lightning lora at 8 steps, the huggingface discussion said it works well at 8 too (devs were replying)
the girls are wearing a red bikini and red santa hat. there are christmas presents on the floor in front of them.pretty good, 8 steps is the way I think, 4 is fine but 8 is better detail/results
>the same meme on repeat for an entire yeari hate this place so much
>>107689274yea 8 steps is good. with 4 its too easy to notice background and depth of field looking undersampled
>>107689274business suits with a short skirt and black heels:
>>107688002thank you for making a comparison using qwen image between bf16, Q8_0 GGUF, Q4_K_M GGUF, and Nunchaku FP4 quantsif it's not too much effort i'd be interested in knowing more stats like cosine similarity and how similar certain tensors are between quantsnunchaku looks better than i expected for a 4 bit quant. this is also a great reminder that Q8_0 is good enough
I keep seeing people using the full qwen edit model, which is like 40gb.Isn't that overkill? Can't I just use a Q6 quant instead?Also I've been generating vids of girls twerking and getting facial blasted for 4 days.
>>107689292I have 16GB and I use Q8 which is like 20GB, even if you cant load the main model fully into memory it still works, it will just load some into RAM.
replace the text "Cyberpunk" with "Cyber Miku". Replace the man with the pistol with Hatsune Miku holding a green leek vegetable instead of a pistol.not bad
>>107689292>I keep seeing people using the full qwen edit model, which is like 40gb.>Isn't that overkill? Can't I just use a Q6 quant instead?Q8_0 is 99.97% similar to fp16/bf16 so yeah it's good enough, but technically not perfect. Q6_0 hurts a lot more and if you can run the Q6 you can probably run the Q8