[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Euler Normal Edition

Discussion of Free and Open Source Diffusion Models

Prev: >>107750643

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>107754310
Thanks for the bread anon.
>>
Blessed thread of frenship
>>
>Should have bought a 5090.
>Could have used that extra 8gb.
>Damn me.
you've been warned long enough, there's nothing more important than VRAM
>>
>>107754353
I didn't know i needed them until last Thursday.
>>
File: 1748765932664669.jpg (4 KB, 160x314)
4 KB
4 KB JPG
>finally the wan video is close to what I wanted
>notice the hand is slopped
we're still doing this in year of our lord 2026
>>
File: zimg_00245.png (1.74 MB, 1920x1280)
1.74 MB
1.74 MB PNG
>>107754359
yeah it's mindblowing i can do this from my couch in about a minute.a year from now is going to be insane.

here's an example of a hand detail to make sure the cat palm tattoo is legible.
>>
File: 1746704628415455.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>>107754388
Z-video turbo will save us
>>
File: 00127-3702348661.png (1.85 MB, 1288x976)
1.85 MB
1.85 MB PNG
>>
>training lora of girl i knew in highschool
>had a huge crush on her cute face, large breasts, and gigantic ass (she walked on her toes often but not in an autistic way so her ass was fat)
>she once consoled me after i broke up with my gf at the time
>visited her house once, got drunk and played vidya
>she looks at me and goes "soooo anon... are you going to make a move? ;)"
>spend the rest of my life jerking off to her
>a few years out of highscool we chat a bit
>she says "i miss you so much, anon. we were such close friends i miss that"
>now shes married
>probably already pregnant by some tan skinned guy from a non burger country

>look at training samples
>she looks so cute
>so fucking cute
i fucking miss her bros how did i miss that opportunity so hard
2k steps in and i dont think its collapsed yet
>>
File: 1762130020865433.png (18 KB, 493x236)
18 KB
18 KB PNG
shift before of after loading lora?
>>
File: :D_v2.webm (3.89 MB, 2048x1152)
3.89 MB
3.89 MB WEBM
previous outcome had choppy segments. no more
>>
>>107754390
That's z-image right?
>>
I think he blacked out
We can have a few hours of peace
>>
>>107754425
yes, as you can see on the filename
>>
Do you guys think Culc got cornholed by Jacko? I've always thought he was chemically castrated.
>>
>>107754411
it doesn't matter, there's no such things as orders on ComfyUi
>>
>>107754423
It makes a decent 4 seconds. slow mo not my thing.
But man, teach us...
>>107754429
4chanx, can't see them. Don't ask.
>>
File: zimg_00251.png (1.48 MB, 1920x1280)
1.48 MB
1.48 MB PNG
>>107754425
yes

>>107754411
which one looks better to you?
(shift -> lora)|(lora -> shift)
>>
>>107754310
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
is this why threads can't get to bump limit anymore?
>>
>>107754427
>I think he blacked out
Looks like no based on post above
>>
>>107754452
>4chanx, can't see them.
I also use 4chanX and I can see the filenames, lol
>>
>>107754472
he meant regarding the shitposters as it filters them out
>>
>>107754402
What kind of move? Did you play chess with her?
>>
>>107754462
there are two older threads that haven't been filled yet. I really hate how retarded /ldg/ posters are nowadays. actual fucking children
>>
>>107754488
I play cheese with mine.
>>
File: 00178-771365752.jpg (333 KB, 1344x1728)
333 KB
333 KB JPG
I absolutely hate Comfy it's not even funny. Really, I tried using it for a few weeks and it's a PITFA.
Worst issue I had, after I reluctantly coped with having to push the whole screen around all the time and move fucking boxes out of my way and having to right click the image to open it in full screen where I have to use another browser tab and/or having to zoom in and out all the time and having to install another fucking node just to save the image as jpeg and that the save image node doesn't have an option to change the filetype, that I fucking can't manage to have a Hires-Fix like functionality or something like the img2img tab for sending images to another workflow copying the exact settings to have inpainting or additional upscale.
To upscale an image I always have to push the generation through a second KSampler, even if I didn't like the first low res result. I simply can not understand how Comfy folks work like that and why no one in the whole cloud of the Comfy ecosystem ever came up with a conditional path that ONLY runs when you click a conditional on-off button while it cached the previous result and additionally reuses the same seed from the first render.
So yeah anyhow, I installed Forge-Neo and it was like getting back into my own comfy bed after a long second class train ride entangled in cables. Only thing I miss is the ability to install new samplers since everyfuckingthing is made as a fucking comfy node and not a library that one can just import into any UI of their choice.
>>
File: elf-dragon-spell-2.jpg (1.33 MB, 1560x2008)
1.33 MB
1.33 MB JPG
>>107754353
I thought about it but even having the money I couldn't bring myself to buy a 5090 when I already have a 4090.
>>
File: 00063_.png (2.45 MB, 1152x1440)
2.45 MB
2.45 MB PNG
feelin fab
>>
>>107754469
Could be the legacy schizo, he has something to gain. He's less aggressive than the one in the second link.
They are going to seethe and not pretend they are getting flushed by the mods again.
>>
>>107754519
Is this Z Turbo?
>>
File: elf-dragon-spell.jpg (1.22 MB, 1464x2144)
1.22 MB
1.22 MB JPG
>>107754530
Nah I'm still on pixelwave flux
>>
>>107754514
>Only thing I miss is the ability to install new samplers since everyfuckingthing is made as a fucking comfy node and not a library that one can just import into any UI of their choice.
poothon issue and comfyorg doesn't share. you only get paid or get equity if you only make shit for them. they never reward or support anything that comfyui uses. it's actual open source poison
>>
>>107754528
ranfaggot is the oldest schizo but always hits the thread with fresh schizo energy. how does she do it?
>>
>>107754536
The microsoft of gen AI.
>>
>>107754533
Have you checked out ZiT training or anything? I feel like you'd be able to push out some gigakino in addition to all your pixelwave gens
>>
File: 1738112807257994.png (2.4 MB, 1088x1376)
2.4 MB
2.4 MB PNG
>>
>>107754519
8Gb man... Just look at that cat...>>107754423
>>
>>107754550
Obsessing over him ruined your life.
>>
>>107754567
That is definitely a cute cat
>>107754558
I need to take a look at the newer models. The last one I tried was chroma which although it had some promise and I made a few pictures I liked with it, was too slow and inconsistent to be worth using unless you are wanting to something super raunchy.
>>
>>107754589
It's terrifying to see he spends all day doing this.
>>
>>107754533
>pixelwave flux
Wouldn't ZIT be faster on a 4090?
>>
>>107754599
IMO you'd really like ZiT. Fast, light(er), and if you're only doing up to softcore it's local SOTA. Chroma still wins with vagene and bob though.
Despite being distilled it takes to styles incredibly well.
>>
File: 1764887666415533.png (2.38 MB, 1440x1080)
2.38 MB
2.38 MB PNG
left is Qwen Image 2511 and right is Z-image turbo, Qwen Image improved on realism but it's still pretty slopped imo
>>
>>107754599
Z will learn your style easily, can recommend
>>
>>107754616
your obsession? you care deeply about what he thinks about you. can you elaborate?
>>
>>107754639
i actually prefer left even though i believe qwen is a total bloated meme left looks less ai
>>
Where could i post my animated cunnies?
Such a wast of shared friendship.
>>
>>107754665
/b/ or /trash/. /b/ is kind of insufferable if you hate avifags
>>
>>107754665
>>>/ic/catalog
>>
>>107754665
catbox link here
>>
>>107754652
Left is gayer.
>>
>>107754681
both are gay but left is authentically and tastefully gay while right looks like a faggot zoomer wannabe metrosexual
also the soft shadows on right look ugly compared to the hard realistic shadows of left
>>
File: 1748503038140680.png (2.97 MB, 1440x1120)
2.97 MB
2.97 MB PNG
24GB VRAM is all you need... anything more than that is just encouraging bloat. I DESERVE local Sora running on a single consumer GPU and will settle for NOTHING LESS.
>>
>>107754689
So it's just the pose right? They're technically the same.
>>107754675
>>>/ic/
They're realistic cunnies.
>>
File: 00170-1320704819.png (1.76 MB, 1448x728)
1.76 MB
1.76 MB PNG
>>
File: zimg_00268.png (1.46 MB, 960x1280)
1.46 MB
1.46 MB PNG
kinda cool you can use photo loras for paint styles
>>
fug vram i want speed and efficiency
>>
>>107754722
It's nice, is it French?
>>
>>107754713
the glasses, hair, the way he holds the cup, pose yes, and again the overall look of the image. right is too soft and ai
>They're technically the same.
no. i dont even know why i care to reply i dont even like or use qwen
>>
>>107754353
nah, i'd rather have a 5070ti with 128gb of ram than a 5090 with 64gb
>>
Here goes another ban, but no mater, i love you guys.
https://files.catbox.moe/yzzi5k.mp4
>>
File: 1759428505525575.png (2.21 MB, 1440x1080)
2.21 MB
2.21 MB PNG
>>107754639
I really don't like that ultra HDR effect on Qwen desu
>>
Speaking of which, no one send his 4090 to china to expand it to 48gb?
For those who don't know it IS really a thing.
>>
File: zimg_00272.png (1.58 MB, 960x1280)
1.58 MB
1.58 MB PNG
>>107754730
is what french? the elf? no? she's elvish???
>>
>>107754749
Not enough background noise.
>>
>>107754310
>32gb ram
>24gb vram
>still swapdisk assraped using qwen
ogre, can't have any fun in this gay world
>>
>>107754757
yeah you can buy one for 3k
https://www.c2-computer.com/products/new-parallel-nvidia-rtx-4090d-48gb-gddr6-256-bit-gpu-blower-edition
>>
File: zimg_00276.png (1.31 MB, 960x1280)
1.31 MB
1.31 MB PNG
>>107754757
this sounds like an incredible way to get scammed
>>
>>107754803
And get chicom stuxnet firmware installed.
>>
>still baseless
>still mogged by saas
it's over
>>
>>107754795b
>Blower-edition
I already like it.
>>107754803
I guess you have to user money you can afford to lose.
>>
>>107754816
how do we know they don't already do that to everything produced there?
>>
>>107754845
what do saasbros do when they want to gen tiddies and vagooper
>>
>>107754848
http://genius.cat-v.org/ken-thompson/texts/trusting-trust/
>>
>>107754402
this is not gonna be healthy for society at large is it
>>
File: zimg_00279.png (2.84 MB, 2048x1536)
2.84 MB
2.84 MB PNG
>>107754866
spend hours spamming increasingly more dubious prompts until they get a single hint of a nipple and are immediately permabanned with no refund
>>
>>107754699
You people recommend RTX 5070ti 16gb over RTX 3090 24gb

Fuck You all
>>
>>107754896
lmao
>>
>>107754906
I stand by my rec and 24GB is not enough so he's wrong.
>>
File: TRUTH NUKE.gif (367 KB, 400x293)
367 KB
367 KB GIF
>>107754896
>>
File: Wan16FPSG_00117.mp4 (1.14 MB, 624x800)
1.14 MB
1.14 MB MP4
>>107754353
i upgraded from a 3090 to a 5090 and Wan gens are literally 4x faster and you never need to block swap even at 1 megapixel
i'd have done this sooner if i knew. most people were like "yeah it's a bit faster" but i had no idea HOW MUCH faster it actually is.
>>
>>107754896
They don't ban anyone for nipples.
The autistic humiliation ritual of working around filters is true though when you SAASfag.
>>
File: valkyrie-bikini-armor-2.jpg (1.35 MB, 1464x2144)
1.35 MB
1.35 MB JPG
>>107754640
>>107754626
I'll give it a try this evening, maybe. Since looking up a workflow online it looks not complicated to set up in ComfyUI.
>>107754625
Definitely probably faster, though pixelwave is notably faster than chroma for whatever reason for me, even with same quantization.
>>107754896
I recall when OpenAI's image gen was new I tried it out making mermaid pictures and it kept making them randomly nude even when I explicitly told it otherwise.
>>
File: 53.jpg (647 KB, 2048x2048)
647 KB
647 KB JPG
Damn that z image turbo model is cool af, will play with it a bit
>>
File: 00111-1589695465.jpg (313 KB, 1344x1728)
313 KB
313 KB JPG
How are the Chinks so good at this AI stuff
>>
>>107755030
They dont have anti ai tech trannies. Not joking.
>>
File: mental illness.png (345 KB, 609x1532)
345 KB
345 KB PNG
>>107755030
>How are the Chinks so good at this AI stuff
they aren't cucked like the western dogs so they can fully focus on making their model good and not virtue signal with a cucked model
https://huggingface.co/black-forest-labs/FLUX.2-dev#risks
>>
>>107755030
Solo promoting is not impressive anymore. Show me a man who's limbs are strapped to all four corners of a desk partially submerged in a pool while a woman 3 times his size cleans his face with a sponge
>>
>>107754906
I've recommended getting a video card with as much VRAM as possible. There are idiots/trolls that say VRAM didn't matter and speed/bandwidth is more important, ignoring the fact that not being able to gen is a showstopper compared to slightly slower speeds.
>>
>>107755056
>>107755060
Yeah it's really bad with the regulations and BS in the EU. On top of that exploding energy prices while China will just build another coal or nuclear plant.
>>
>>107755071
also ignoring the fact that the 3090 has higher bandwith
>>
File: z-image-fp_00001_.jpg (2.71 MB, 2048x1264)
2.71 MB
2.71 MB JPG
>>
>>107754748
wait that looks like VAM, what a blast from the past, I wonder if they even are still doing their "new engine"
>>
American kids play games, Chinese kids study math.
>>
>>107755144
life is not worth it without some skibdy rizz 67 aura farming
>>
File: 00121-3967751756.jpg (639 KB, 1728x1344)
639 KB
639 KB JPG
>>107755069
You've got a weird fetish, man.
>>
>>107755136
Vam 2.0? Yea the dev work on it but drags on and on.
I also would like to see the last HDRP in good use.
As for my stuff, it looks really good out of wan2.2,
don't know how it compares next to virtamate2.0
>>
File: 1749794947765434.png (1.22 MB, 1217x1636)
1.22 MB
1.22 MB PNG
>>107754757
there are 32GB 5080 too now
https://xcancel.com/unikoshardware/status/2004527606818120006
>>
>>107755168
>Vam 2.0? Yea the dev work on it but drags on and on.
I don't think we'll see something proper in this lifetime, kind of sad

>don't know how it compares next to virtamate2.0
from what I get, vam2.0 is mainly a way to have way better performance compared to 1.0
>>
File: 00246-3954599878.png (2.03 MB, 1448x728)
2.03 MB
2.03 MB PNG
>>
How2prompt shortshack on Z without it turning her into an old hag?
>>
>>107754353
Compute is also important, some of the vram issues can be compensated with ram, but nothing can compensate for a slow chip.

>>107754987
I have a 5090 + 3090 server, and yes the 5090 is way faster, mainly because the 3090 cannot do fp8 (and it's also a slower card in general), NOT because of vram issues.
VRAM wise, you can test easily by forcing block swap with the 5090, for example 20 vs 0, and you'll see only a marginal speed improvement, not x4.
The 3090 is still great for a lot of things because of its vram size, you just need to know its limitations.
>>
>>107755121
did you get the face detail working anon?
>>
>>107755000
Check'd. Looking forward to it.
>>
>>107755082
it's not even really regulation for europe or the us, it's also self censorship and people paying too much important on safety culture
researchers in China fundamentally don't give a shit nor waste time about safetyism
>>
File: 1743011814069149.png (431 KB, 800x582)
431 KB
431 KB PNG
>>107755245
>researchers in China fundamentally don't give a shit nor waste time about safetyism
as it should
>>
>>107755245
>China fundamentally don't give a shit nor waste time about safetyism

Ask a communist model about what happened in Tien An Men square. They got their own no go areas.
>>
File: z-image_01782_.png (3.8 MB, 2048x1264)
3.8 MB
3.8 MB PNG
>>107755238
yeah and i did it wouthout the pozzed impact pact
>>
>>107755267
Can't, too busy asking it what happened in Naruto's bedroom during the night he learned the shadow clone jutsu
>>
>>107755267
Just like everywhere. You protest against the government, you get shot.
>>
File: 1738406931643664.png (247 KB, 3246x1056)
247 KB
247 KB PNG
>>107755267
lmao
>>
File: 00002-1562824526.jpg (386 KB, 1344x1728)
386 KB
386 KB JPG
>>107755267
Since most chink models are open weights, you can just run a finetune.
The European and American self-censorship cripples their models.
>>
>>107755256
in general yeah, wish western ones stopped being obsessed about that shit so much, to the detriment of their models

>>107755267
oh I didn't mean they were perfect, they have to not go too far and offend the old cadavers of their Central Party
as for tiennammen square, if you use a local unfiltered version of their LLM, you get perfectly normal results, the only parts censored are their chatgpt like websites
>>
>>107755267
I'll take that censorship over "nipples are the devil" sanctimonious western censorship desu
>>
File: retard baker.jpg (93 KB, 967x499)
93 KB
93 KB JPG
Stop jumping the gun on new threads you fucking imbecile
>>
>>107755267
Yeah but this has no effect whatsoever on waifu gens.
>>
File: kimi.jpg (25 KB, 1266x268)
25 KB
25 KB JPG
>>107755267
>>
>>107755316
it was full, posts were deleted
>>
>>107755324
GEG
>>
>>107755322
>Yeah but this has no effect whatsoever on waifu gens.
this, it's that simple, I don't give a fuck about what happened in ching chong city in 1989, if the model can do good 1girls that's all that matter
>>
>>107755337
Oh, my mistake
>>
File: gen___00003.mp4 (3.83 MB, 560x720)
3.83 MB
3.83 MB MP4
testing Wan SVI 2.0 Pro with high speed action footage. it did fuck up the middle seam but i don't blame it, quite cool regardless that you can make a 20 sec video like this
>>
>>
File: glm 4.7 tian an men.png (158 KB, 1130x633)
158 KB
158 KB PNG
>>107755294
Unlike westerners who completely pozz their models and need extensive finetuning or abliteration, these are just annoying system prompts.
The models themselves are fine.
If you run them locally or with an API endpoint that lets you change sys prompt, it will act normal.
>>
I expect the Chinks will stop releasing open weights as soon as they have a solid monopoly on generative AI.
>>
>>107755298
Did you inpaint her hands?
>>
>>107754889
I can create the perfect woman.
>>
>>107755380
yep, though deepseek 3.2 looks to have been brainwashed for the events even in api, if I ask in vague terms it doesn't know
didn't test with direct question, it probably would work
>>
File: img_00260_.jpg (319 KB, 843x1264)
319 KB
319 KB JPG
>>
File: 00017-1077081454.jpg (410 KB, 1344x1728)
410 KB
410 KB JPG
>>107755393
No that's just vanilla ZIT with random wildcard prompt.
>>
>>107755401
Weirdly enough it works with the speciale variant.
>>
File: 00019-1077081454.jpg (318 KB, 1344x1728)
318 KB
318 KB JPG
>>107755437
These style loras give the skin textures quite a hit tho.
Same prompt without lora
>>
>>107755463
Because these loras are overcooked. The developers of the lora trainers are also to blame as they don't offer more granular training options.
>>
>>107755450
interesting, probably rng from different finetunes
>>
File: 1girl.webm (554 KB, 480x832)
554 KB
554 KB WEBM
>>
>>107754402
>she looks at me and goes "soooo anon... are you going to make a move? ;)"
what happened?

i have an office crush that i only have one public pic of, it's low res, i need to somehow get a high res current photo of her so i can study her hypothetical situations
>>
Those girls look fairly young...
>>
File: file.png (1.31 MB, 1247x1090)
1.31 MB
1.31 MB PNG
>>107755437
that grid man
>>
File: img_00268_.jpg (428 KB, 1032x1376)
428 KB
428 KB JPG
>>
>>107755502
But they're not real anon
>>
>>107755505
it's ssss level
>>
File: 00004-261557875.jpg (370 KB, 1344x1728)
370 KB
370 KB JPG
>>107755481
Yeah the creator of that lora said I should use some specific sampler but I am on Forge so.
This is Jib realistic image lora, completely different lora but it creates pattern again without changing the overall outcome of the image.
>>107755505
Yeah quite obvious on this one. Lora was at 0.6 maybe would help when one doesn't apply the lora on the last steps.
>>
File: zimg_00311.png (1.27 MB, 864x1280)
1.27 MB
1.27 MB PNG
what's crazy is i am using default setting in ai toolkit and having no problems with loras at all, what the heck are you guys doing
>>
File: zit.jpg (1.66 MB, 1344x1728)
1.66 MB
1.66 MB JPG
>>107755463
Can your image generator add specks?
>>
File: 00006-261557875.jpg (344 KB, 1344x1728)
344 KB
344 KB JPG
>>107755539
Zhang scheduler was it. Not sampler. Bit that doesn't exist for Forge either.
Remove the loras and the pattern is gone.
>>
File: 1406270126.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>
>>107755552
Default OT config works great as well. Probably poor dataset idk.
>>
>>107755552
using shitty images from social media
are you training on 1024?
>>
File: zimg_00323.png (2.16 MB, 864x1280)
2.16 MB
2.16 MB PNG
>>
File: 4209557127.png (2.17 MB, 1536x1536)
2.17 MB
2.17 MB PNG
>>
>>107755552
the only problem i have with ai toolkit is that i oom with my 8gb card and cant train at all lol
>>
File: 00012-315177966.jpg (279 KB, 1344x1728)
279 KB
279 KB JPG
>>107755556
I have a film grain upscaler... but it seems to get killed by ultraflux vae
>>
>>107755552
You have no problems because you're training generic 1girl loras.
>>
>>107755625
nice
>>
>>107755552
Which lora are you using. What's the weight?
>>
>>107755630
love that lamp
>>
>>107755633
hand look wrong for some reason
>>
>>107755631
You can always rent a runpod to train loras.
>>
>>107755633
>>107755651
And feet too big, they should be more dainty.
>>
File: 00014-1666714639.jpg (293 KB, 1344x1728)
293 KB
293 KB JPG
>>107755651
Toe looks even wronger
>>
File: zimg_00326.png (2.3 MB, 864x1280)
2.3 MB
2.3 MB PNG
>>107755623
512 or 640 but yeah your dataset sucks

>>107755640
i also trained this very lora my man

>>107755648
i'm running psxlino at .8, the other is powershot s40 lora i made at .75 that isn't on civit

>>107755645
ty
>>
>>107755633
>4 toes on on her left foot
>6 toes on her right foot
>>
>>107755676
most humans have 10 toes, so this is fine
>>
File: 00016-3903040289.jpg (219 KB, 1344x1728)
219 KB
219 KB JPG
>>107755676
I think upscale pass messes it up sometimes because first pass looked ok. Last gen has two shoes suddenly. Maybe 0.5 denoise is too much.
>>
upscaling pass isnt worth it when cnet is borked
>>
>>107755659
what happens with the pictures that i upload for training? can i be sure they are not saved? think i'll just buy a new card desu
>>
File: 00018-1911996609.jpg (306 KB, 1344x1728)
306 KB
306 KB JPG
>>107755715
I mostly use Hires fix in Forge because I want to decide after first pass whether I bother an upscale.
>>
>>107755675
is there a particular way you downscale your datasets?
>>
File: 00022-2555801503.jpg (230 KB, 1344x1728)
230 KB
230 KB JPG
It has more issues with feet than with hands when they are not in focus.
>>
>>107755743
My issue with those is that they look brand new + immaculate +
It's like an AI signature at this point.
>>
>>107755796
What sampler / scheduler?
>>
File: 1740034783718460.png (2.27 MB, 1152x1472)
2.27 MB
2.27 MB PNG
>>
File: 00065-257776404.jpg (170 KB, 1024x1024)
170 KB
170 KB JPG
>>107755806
That's just the prompt
>>
File: zimg_00344.png (1.69 MB, 864x1280)
1.69 MB
1.69 MB PNG
>>107755747
nah just make sure when you look at the images they are all crisp and without any artifacts or oversharpening. have a good variety of shots too, never upscale images and crop them square.
>>
>>107755815
Euler a beta, simple or normal for these, but I swap often.
>>
>>107755819
Yes and no. You can add details and still looks "brand new".
I mean if you go for realistic, then, you have to take this into account.
>>
>>107755743
nice. that's zit? it's in forge?
>>
File: 0022-23228276_ayakon.png (2.47 MB, 1632x2208)
2.47 MB
2.47 MB PNG
>>
File: 00037-540681341.jpg (144 KB, 1024x1024)
144 KB
144 KB JPG
>>107755836
My prompts are lazy. But especially with models like ZIT and maybe Qwen too you can do a lot with novel prompting details.
>>
>>107755815
>>107755831
use euler, not euler ancestral for realism
>>
>>107755841
Yeah Forge Neo. lllyasviel pretty much seems to have stopped developing Forge original
>>
>>107755855
Try that again but ditch the blur effect?
>>
>>107755743
could you catbox one of yours with hires fix? i was pretty unhappy with realistic skin textures i was getting but looks like you're doing something i was too dumb to figure out
>>
File: xyz_grid-0005-4117569869.jpg (3.45 MB, 2576x2959)
3.45 MB
3.45 MB JPG
>>107755870
Does it change much? I don't know.
>>
File: 1751150190841493.png (2.18 MB, 1152x1472)
2.18 MB
2.18 MB PNG
>>
>>107755889
It's no blur effect just not upscaled. They are very early steps with z image.
>>
>>107755853
niceu
>>
File: 00026-3804632803.jpg (271 KB, 1344x1728)
271 KB
271 KB JPG
>>107755896
I really don't know what I did to make it ok just some wildcards thrown together.

>18 year old girl with futuristic makeup and with Latina brown bun hair, wearing Hooded jumpsuit and metallic fingerless gloves as a futuristic queen.
she is Lying on back, arms behind head with a gun, and her face stifled laugh. She wears glossy nailpolish.
the scene is Quantum teleportation malfunctions, splitting individuals into parallel versions across alternate realities..
a professional artistic photoshot. award winning quality.
>Steps: 10, Sampler: Euler a, Schedule type: Beta, CFG scale: 1, Shift: 8, Seed: 3804632803, Size: 896x1152, Model hash: 8e9df7926d, Model: lexivisionII_lexivisionZBeta2, Denoising strength: 0.4, Original Size: 896x1152, >Wildcard prompt: "a 18 year old girl with futuristic makeup and with __haircolors__ __hairstyles__ hair, wearing __clothing_female_futuristic__ as a futuristic queen.\nshe is __pose__ with a gun, and her face __expressionsH__. She wears glossy nailpolish.\nthe scene is __scenario_scifi__.\na professional artistic photoshot. award winning quality.\n", Hires Module 1: Use same choices, Hires CFG Scale: 1, Hires schedule type: Beta, Hires upscale: 1.5, Hires steps: 6, Hires upscaler: 4x-UltraSharp, Beta schedule alpha: 0.6, Beta schedule beta: 0.6, Version: neo, Module 1: Qwen3-4B-abliterated-q8_0, Module 2: ultraflux
>>
File: zimg_00351.png (1.78 MB, 864x1280)
1.78 MB
1.78 MB PNG
>>
sloppa hours
>>
>>107755935
Why don't you entertain us with some non slop?
>>
File: 1667629178207497.png (95 KB, 300x300)
95 KB
95 KB PNG
>>107755906
Teach us.
>>
>107755935
peanut gallery hour
>>
File: img_00336_.jpg (483 KB, 1408x2064)
483 KB
483 KB JPG
>>
>>107755935
You must be fun at parties.
In fact we're having one, and you're not invited.
>>
File: 00034-2311538816.jpg (341 KB, 1344x1728)
341 KB
341 KB JPG
>>
File: 1741217735133856.png (1.51 MB, 1440x960)
1.51 MB
1.51 MB PNG
>>
File: 00005-1562835173.jpg (362 KB, 1344x1728)
362 KB
362 KB JPG
>>
>>107756036
lmao
>>
>>107755828
You're the man.
>>
>>107754748
UUOOOOOOOOOOOOOH so hot, nice anon!!!!
>>
File: 00045-862270764.jpg (455 KB, 1344x1728)
455 KB
455 KB JPG
>>
Well Z-img understands what a trident is better than pixelwave.
>>
>>107756109
Unfortunately it doesn't understand cameltoe.
>>
File: mermaids-relaxing-ocean.jpg (614 KB, 2008x1560)
614 KB
614 KB JPG
It understands what a mermaid is, not requiring a lora (did have to clean up the tails a little bit in GIMP but nothing serious). Previously only booru-tag models could do that. Also two character subject handled pretty easily. Genning several times, their poses and interaction looked natural and normal most of the time.
>>
>>107755170
What's that supposed to cost, 5k?
>>
File: 20260103_182137.jpg (1.18 MB, 3840x2160)
1.18 MB
1.18 MB JPG
Also I checked the sun it rendered. A lot of times, suns and moons and circles in general that are intersected by foreground objects get distorted, sometimes seriously. This is a pretty good result for a near perfect circle.
>>
>>107756142
no price so it's probably stuff you're not supposed to buy
>>
>>107756144
Neat also based KDE chad
>>107756129
I think the most explicitly prompted characters I've seen in a single gen is 5. Maybe.
>>
File: 1753443223220902.mp4 (1.36 MB, 1920x1080)
1.36 MB
1.36 MB MP4
Basically it's the last steps that makes Z-image turbo so realistic, that's interesting
>>
A RTX Pro with 96GB or 48GB VRAM is too much for a hobbyist that wouldn't monetize this expense
And even though I have €2600 (current local price) to carelessly blow on a 5090, I'm just a fucking retarded beginner
I wanted to stick to my current 16GB card while I learn diffusion flows, the games I play wouldn't even benefit from a 5090, but the outlook for the consumer hardware market in the coming years is SO grim that I'm tempted...
>>107754889
They will keep immensely restricting consumer facing models, the average person won't be able to just generate a collection from his oneitis picture
But yeah, this will destroy a few individuals
---
On topic: Any resources on learning how to control Z img turbo better? It straight up ignores my 1girl posing instructions. I know there are posing models out there but I don't know how to build a proper Comfy flow to pose, or to add loras
>>
File: 00057-86068703.jpg (388 KB, 1344x1728)
388 KB
388 KB JPG
I can't wait for base and properly trained finetunes for that. With what z image can do in a distilled version already, base must be the goat.
>>
>>107756298
Oh my sweet summer child
>>
>>107754353
it's funny how many people around me (into ai) told me "nah 16GB is enough" then a few months later tell me how they wish they knew so they'd with 32 instead
>>
>>107756293
>my 1girl posing instructions
What instructions? Just describe the pose in the prompt. Unless you want goon poses-
>>
>>107756245
This shows that most anons don't know how models work.
>>
File: ComfyUI_temp_hrmad_00007_.png (3.36 MB, 1088x1856)
3.36 MB
3.36 MB PNG
damn, this olivia rodrigo zlora sucks ass
>>
>>107756245
anon that's a nonsensical conclusion, you can't have "last steps" without "early steps"
>>
>>107756298
Base is going to be heavily censored.
>>
>Base is going to [headcanon]
>>
>>107756326
youll get it
>>
>>107756346
Flux was censored too yet we got Chroma.
>>
>>107756298
One thing about it is I have not been able to get a good detailed painterly background. It kind of reminds me of pony in tending towards nondescript bokeh backgrounds. Though I have seen some with good backgrounds and it might just need a very detailed description in the prompt of all background details.
>>
The worst part of dataset prep is prunning out the duplicates because the artist felt the urge to repost their own work a dozen times.
>>
https://youtu.be/PJnTcVOqJCM?t=603
what's cool is that it's keeping the face's consistency during the whole scene
>>
File: zit.png (1.4 MB, 1155x981)
1.4 MB
1.4 MB PNG
>>107754639
trying to recreate for testing
t. vramlet
>>
undervolting is so effective damn. half tdp, fully silent. pretty sure the speeds are +/- the same too but would need to actually monitor for that after doing a comfy reboot on same sneed
>>
>>107756467
what gpu anon?
>>
>>107756298
You say this but people forget e.g. how much easier it was to prompt Chroma Flash than Chroma. If you had only tried Flash and naively assumed Chroma would just be a better version, you'd be wrong. It is better, sure, but it's different, and doesn't have all the same strengths. I fully expect a lot of anons will find Z-Image base disappointing if it ever releases.
>>
>>107756428
just... nvm.
>>
>>107756467
I tried undervolting but it crashed on ComfyUi (worked fine on demanding games like Clair Obscur though)
>>
my ZIT doesn't look as good as yours guys. is it because i use the nunchaku thingy?
also this is caucasian according to zit
>>
>>107756245
lol
>>
>>107756522
>is it because i use the nunchaku thingy?
omg remove that shit anon, it looks terrible with nunchaku, come on you can't run that small model on Q8 or something?
>>
>>107756410
Yes it's the description. That one for instance is just ChatGPT slop "generate a list of scifi scenarios, one line each".
ZIT requires you to write many details otherwise you get generic stuff.
>>
File: 1734293728197562.png (313 KB, 624x759)
313 KB
313 KB PNG
Have the option of getting a cheap 3090. Would it be worth it or is it smarter to just buy a 5070ti?
>>
>>107756556
to play megabonk?
>>
fuck Grok is really good... I mean it just spits out a ton of bondage images just from an oneliner prompt. I want that local
>>
>>107756522
>nunchaku
you'd have to be in a special kind of desperation to use that for such a small model, and while nunchaku is better than q4, it's still shit compared to fp16
>>
>>107756556
Depends, if you want to game too : 5070ti will be maintained longer. If you just want to gen, I'd go with the cheaper of the two.
>>
>>107756522
>>107756540
user a bigger quand and offload with MultiGPU
https://github.com/pollockjj/ComfyUI-MultiGPU
>>
>>107756568
>>107756595

I play games too, but not really a grafix guy. I'm assuming the 5070 would output faster than the 3090 for image and video.?
>>
who uses ZIT with pinokio and can tell me how to hires fix?
>>
>>107756522
Don't use nunchaku. I have 16GBVRAM and can run Z image in BF16 well.
>>
>>107756540
>>107756584
chatgpt told me to use that for my 8gb vram. probably some bullshit
>>
>>107756556
say the prices
in terms of gaymen power theyre the same card. 3090 consumes twice as much, but you are getting extra 8gb vram
>>
>>107756659
yes
>>
>>107756689
600usd for the 3090, 700usd for the 5070ti.
>>
>>107756698
that guy is jewing you up
>>
>>107756689
Unsloth has quants.
https://huggingface.co/unsloth/Z-Image-Turbo-GGUF
>>
>>107756680
use the thinking model, ask for sources, and always challenge it otherwise it will tell you bullshit yes
try zimage fp8 and swap blocks until you don't oom (assuming you have enough sys ram)
>>
>>107756680
>>107756707
I mean him
>>
File: 1750766624674298.jpg (76 KB, 621x613)
76 KB
76 KB JPG
Protip : Use Photoshop
>>
>>107756698
go for the 5070ti
>>
File: 1745741444038297.jpg (2.04 MB, 7961x2897)
2.04 MB
2.04 MB JPG
>>107756689
>>107756680
Q8 + offload a bit to the cpu, it's the closest quality to bf16 >>107756603
>>
>>107754745
explain
>>
>>107755226
>3090 cannot do fp8
So what is happening when I run an fp8 model on my 3090 ti?
>>
why cant i go higher than 1080p in Pinokio for ZIT goddammit
>>
>>107755226
>the 3090 cannot do fp8
what? I have a 3090 and I can run fp8 just fine, are you talking about the speed or something?
>>
>>107756717
>>107756718

Thanks for the help anons. I'll get that and some more ram. 64gb will probably do me fine.
>>
>>107756769
btw i double checked the benchmarks and the 3090 is beaten by the *base* 5070 which retails for less than 500 bucks. 600 would have been an insane price to pay for a 3090 for the little vram premium on such an old card
>>
>>107754397
>no i2v
better luck next time, chud
>>
>>107756745
>>107756757
It can run it but it computes it at fp16/bf16 speeds, not fp8. This is why fp8 is so much faster on 4090/5090.
>>
>>107756718
People keep posting this fucking image like it's gospel. It is old and outdated and made using flux 1.

Newer mixed precision fp8 quants are much better than pure fp8 but still just as fast. They are very close to fp16. I use them for everything I can't run at full precision. The tiniest, almost imperceptible quality boost of q8 isn't worth the speed penalty of having to dequant every weight on the fly during inference. With loras it gets even slower because lora weights aren't merged into the main weights when using GGUF.

It also depends a lot on the model. Flux 2 is so large, that q4_k_m is hard to tell from q8, and allows that model to run on 64gb RAM + 24GB VRAM without having to use --cache-none.
>>
>>107756796
>Newer mixed precision fp8 quants are much better than pure fp8 but still just as fast.
prove it
>>
>>107756796
this.
I was using Q_8 with WAN and changed to fp8 recently and holy shit the speed and memory management in fp8 is so much better, quants are a pain in the ass to load and generally slower too, same with qwen-edit
>>
>>107756796
>Newer mixed precision fp8 quants are much better than pure fp8 but still just as fast. They are very close to fp16. I use them for everything I can't run at full precision. The tiniest, almost imperceptible quality boost of q8 isn't worth the speed penalty of having to dequant every weight on the fly during inference. With loras it gets even slower because lora weights aren't merged into the main weights when using GGUF.
Interesting, especially the lora thing, I'm gonna test that, since I use a lot of lora with wan and qwen, having it dequantize every step sounds like very bad speed wise.
>>
>>107754390
What is the detailer node you are using? Got a catbox?
>>
>>107756796
>mixed precision fp8
what is the name of these? "fp8 scaled"?
>>
RTX 5070ti super 24gb at CES
>>
File: file.png (942 KB, 1200x1200)
942 KB
942 KB PNG
can this thing run z-image-turbo?
>>
>>107756870
>24 TOPS
No.
>>
>>107756883
Maybe it can do it slowly
>>
>>107756897
That's what she said
>>
File: 1696210216117.png (97 KB, 898x527)
97 KB
97 KB PNG
>>107756897
Anon, no. Not because it's not on the list, but because of what is.
>>
>>107756870
>8GB DDR4
My sides
>>
flux, flux2, zit, qwen, they all give you mangled, body horror pussies. Even with loras, you're not getting something realistic and convincing. The only models that deliver are still bigasp and its merges, maybe chroma too but it's hit or miss. It's very frustrating
>>
>>107756935
I guess you mean realistic stuff. For 2D at this point I think lewd 1girl is practically a solved problem with illustrious.
>>
>>107756935
Illustrious and Pony are pretty capable for puss too.
When it comes to nsfw waifus you can't get far without the SDXL line.
>>
>>107756917
SD1.5 might be enough. What do you mean "because of what it is"? If the model fits in the RAM, it should work, no?
>>107756930
It's made for edge devices. I'm just trying to see if I can push it to do other things.
>>
>>107755060
>pixel layer watermark
>can survive screenshot, resize, reencode, compression
spoopy. I bet comfyui does this irregardless of model
>>
why is heun so fucking slow?!
AAAUAAuaaauaUGGHHHHh
I'm gonna HEUUUUUUUUUUUN!!!!
>>
>>107756870
Oh I thought this was a gen of a fake chip KEK
>>
is it realistic to expect local models to act like online models where you just say
>make her wear a bikini
in the near future?
>>
>>107757030
Qwen already does but ask the person in the picture before doing so
>>
File: 1761430824102318.png (2.03 MB, 1152x1472)
2.03 MB
2.03 MB PNG
>>
>>107757030
No, that would be unsafe.
>>
>>107756935
Seeing redditors have violent fits over Chroma is entertaining when it comes up. I can't blame them because there's zero meaningful documentation, but them prompting "awesome, big boob, woman hot" is chefs kiss.
>>
>>107757030
>make her wear a bikini
that's illegal bro
>>
>>107757110
>violent fits over Chroma
why?
>>
>>107757117
99% of them are civit proles that can only type in the same 4 tags into a prompt window. They haven't figured out language prompting yet but maybe someday.
>>
>>107757137
there is no better site for LoRAs than civit
>>
File: 1759176597466297.png (1.07 MB, 832x1216)
1.07 MB
1.07 MB PNG
>>107757030
That's how Qwen Edit works already. In the near future people are waiting for Z-Image edit which will probably do it better.
>>
>>107755199
really nice! how are you making these?
>>
>>107756935
How are you forgetting Wan?
>>
>>107757044
>>107757153
>Qwen
nice good to know
>>
>>107757153
oh yes this in Z image edit with proper skin texture.
>>
when ready
>>107757207
>>107757207
>>107757207
>>
>>107757153
You edited the bikini onto her just for the blue board, correct?
>>
>>107756713
no
>>
>>107757187
Boomer prompting and img2img.

...
>>
>>107756831
>what is the name of these? "fp8 scaled"?
i think also "fp8 mixed"
>>
>>107755159
It's obsfuscating my actual fetishes, but the poses and concepts are pieces. Since I know wan 2.2 doesn't do full body couple porn well, I just go for general ideas that would be necessary for the porn were it to be capable. Of it can't to any of those things, it's useless to me. I only use AI to make shit that otherwise doesn't exist IRL.
>>
>>107756796
>>107756831
>>107757389
>miniscule quality increase
fp8 scaled is 5x worse than Q8_0 GGUF. We don't talk about normal fp8.
Getting tired of reposting this too. I'm assuming you're the same guy who I posted this rentry to last time as well
rentry.org/QUANTIZATION_ANALYSIS

I'm mostly fighting this fight about text encoders though. Arguments can be made for having e.g wan loaded in fp8 scaled but if you recommend sometime to use the text encoder in anything other then fp16 with --fast or Q8_0 you're trolling


If you mean something other than fp8 scaled when you say "mixed precision fp8" I would be interested in being spoonfed more
>>
>>107756680
i'm using fp16 on an 8gb card just fine
>>
File: file.png (3.64 MB, 1248x1824)
3.64 MB
3.64 MB PNG
>>
File: file.png (425 KB, 450x658)
425 KB
425 KB PNG
>>
File: file.png (543 KB, 450x658)
543 KB
543 KB PNG
>>
>>107756311
I would kill to be able to output 1080p video.
>>
>>107755294
It almost sounds like "we know it was fucked up and we learnt from it." I expected much less.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.