[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107481484

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
no u
>>
I'll just keep having fun with ZIT until base releases :)
>>
>best way I can describe using comfyui moving forward is like sitting on a 12" dildo leaving it in and saying "fuck it, I'm gay now" instead of pulling it out with some dignity and saying "what's the next steps"
>>
Wretched thread of faggotry
>>
File: Z-image turbo.png (3.25 MB, 1536x1536)
3.25 MB
3.25 MB PNG
>>
Z-Image Base
>>
Anons, ultimately I'm not feeling comfortable in this thread anymore
>>
>>107484430
never EVER
>>
The real reason why these threads are so quiet is this: they are insufferable and this base model release begging spam etc
If you want to affect something go spam the devs twitter or something for fuck sake
I don't want to post a single image here
"discussion" is also impossible
>>
>>107484435
just pull the dildo out of your ass with some dignity and ask what's the next steps?
>>
Thoughts on Chroma?
>>
>>107484435
And Forgetable?
>>
>>107484450
I don't know, ultimately ldg transformed into some weird energy. I don't feel comfortable anymore
>>
>>107484443
>If you want to affect something go spam the devs twitter or something for fuck sake
they spam them on discord but they stopped responding, they're ignoring everyone lol
>>
>>107484442
gonna give you up
>>
File: Miku.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>
Hey guys, Mr. Baba from Alibaba here. Z-Image Base was actually finished, and we put it on a flash drive to have it shipped over to Huggingface so that it could be published. Unfortunately, that flash drive was lost in transit over the Pacific and we don't have any backups of the model. We're going to have to retrain it from scratch. Sorry!
>>
>>107484478
Nice style
>>
>>107484492
no problem, thanks for trying!
>>
>>107484456
the 50k it cost to train would've been better spent on coke and hookers
>>
>>107484478
omg it migu!
>>
>>107484456
it's great
>>
>>107484478
Missed you miku tester anon
>>
File: ComfyUI_00243_.mp4 (3.72 MB, 640x832)
3.72 MB
3.72 MB MP4
>>
I accidentally WTC
>>
>>107484537
jet beams and steel fuel
>>
>>107484456
it's meh
>>
>>107484443
The real reason is that I personally got bored and relapsed on multiple addictions thereby causing a psychic cascade. My bad.
>>
>>107484456
If porn is your fuel then it's god since you can gen megadegen without a single lora.
>>
File: 1754923310635148.png (1.48 MB, 1280x720)
1.48 MB
1.48 MB PNG
>>107484416
>>
>>107484568
Nice style
>>
>>107484456
Blurry crap
>>
File: file.png (279 KB, 947x594)
279 KB
279 KB PNG
progress from pixel art VAE finetune
>>
File: FUCK ME.png (1.57 MB, 1280x720)
1.57 MB
1.57 MB PNG
Come on Alibaba, end our suffering, if you want to make it API only, have some balls and say it now.
>>
>>107484416
Classic symptom of not understanding Chinese culture.

The westoid cannot comprehend the idea of someone strongly implying they will do something then walking back on it.
>>
api is so heavily censored now. sora consistently rejects all my prompts, whereas i only got a few warnings weeks ago. why do these api fags persist with this shit? local is the boss
>>
File: 1765233710.png (870 KB, 1024x1024)
870 KB
870 KB PNG
>>
i updated comfyui and now my gens are 2x slower
>>
>>107484728
price of progress
>>
File: ZImage_01202_.png (287 KB, 512x768)
287 KB
287 KB PNG
>>
>>107484456
Best for NSFW hands down, and you can easily improve it with NSFW loras.

For everything else, Z-Image Turbo is the only thing worth using unless you are a weeb, then it's SDXL finetunes.

Most likely we will see a Z-Image Base (or even Turbo) finetune that adds uncensored NSFW, which will then be the SOTA
>>
>>107484783
kek
>>
>>107484399
As a Flux loyalist it pains me to say this, but Z Image is the new standard. It's so fast compared to Flux, or as I like to call it, Flush it in the toilet.
>>
File: ZIT_i2i_00011_.jpg (926 KB, 2048x2048)
926 KB
926 KB JPG
Tried some i2i upscaling old gens with ZIT. Had to stick to low denoising because it wanted to overwrite the face entirely and no amount of description did much to bring her back.
>>
File: whut.png (169 KB, 231x312)
169 KB
169 KB PNG
This is the freakiest random unprompted person I've seen so far.
>>
File: Z-image turbo.png (772 KB, 1280x720)
772 KB
772 KB PNG
>>
>>107484456
>>107484786
Which Chroma version/tune is best for NSFW?
>>
YOU CANT HANDLE THE TOOTH
>>
Chinese ________.
>>
File: ComfyUI_00255_.mp4 (459 KB, 640x832)
459 KB
459 KB MP4
>>107484908
oops
>>
File: ZImage_01276_.png (297 KB, 768x512)
297 KB
297 KB PNG
comfyui ui laggy after latest update or just me?
>>
File: 00011-3524689323.png (1.14 MB, 896x1152)
1.14 MB
1.14 MB PNG
what's the difference between base and Turbo anyway? Is it really gonna make our lives better?
>>
>>107484952
yep it's laggy as fuck, the frontend team is a bunch of incompetent hacks I swear
>>
>>107484955
one is based and the other is fast
>>
>>107484952
why do you think people are so pissed off at it more recently?
>>
File: ComfyUI_00256_.mp4 (693 KB, 640x640)
693 KB
693 KB MP4
>>
>>107484928
Can you animate this for me.
>>
>>
Hello friend, I'm getting started with image generation and I'm kind of aimlessly floundering around with comfy and forge and and have the basics figured out but I have no idea how to do things like upscaling, controlnet, faceswaps and other plugins that I've heard people talk about. Can anyone point me towards resources for what to do beyond downloading checkpoints and loras?
>>
File: ComfyUI_00257_.mp4 (466 KB, 640x640)
466 KB
466 KB MP4
>>107484983
>>
>>107484999
Impressive. Do more.
>>
File: ComfyUI_00258_.mp4 (299 KB, 480x640)
299 KB
299 KB MP4
>>
File: Z-image turbo.png (3.4 MB, 1920x1080)
3.4 MB
3.4 MB PNG
>>
>>107485033
kek.
>>
File: 1516376234133.jpg (82 KB, 500x371)
82 KB
82 KB JPG
When you look at the sampling preview and see an interesting result mid-sample is there a way to fish it out and continue from that stage? There's sometimes random gold on step 3 of 8 and it's getting lost.
>>
>>107484998
>upscaling
In forge after you have genned an image there's the row of buttons under it. One of those is hires fix which is upscaling with the hires fix settings. You can also use the i2i tab to upscale.
In comfy you can use upscale with model or download custom nodes.

>controlnet
I haven't really gotten into this myself.

>faceswaps
In comfy you can get some custom nodes to do it. There's an uncensored version of reActor around.
>>
>>107485104
intruiging
>>
Fuck off Kris
>>
>>107484998
Sameish

I got neoforge and some XL models installed, as well as zit, but I'm looking for a guide on how to set the various settings and just what to do, how to prompt
>>
File: 1757514067862207.png (166 KB, 1049x798)
166 KB
166 KB PNG
>>107485091
chain samplers
>>
>>107485104
I think you should kill yourself.
>>
>>107485119
lore?
>>
>>107485117
luiging
>>
File: 1755022794942247.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>
>>107485131
that doesn't look comfy
>>
>>107485133
nta but cris is a spammer of many boards, mainly known as a /vg/agdg/ schizo and for ruining /3/

last few years he's been spamming AI shit a lot but I don't know if that furfag is cris
>>
File: ComfyUI_00913_.png (1.33 MB, 1664x712)
1.33 MB
1.33 MB PNG
>>
>>107484899
v50 (final release) or v49 IMO, it already knows all the NSFW concepts so you are just adding that extra quality to a known concept that the base model can't reach since it is a jack-of-all-trades master of none
>>
File: ram2.png (37 KB, 803x464)
37 KB
37 KB PNG
I love being able to use Flux 2 as a vramlet. Linux is a miracle
>>
File: 1734654517890628.png (1.9 MB, 1024x1024)
1.9 MB
1.9 MB PNG
>>
>>107485104
>thoughts
it's bad. I'm sure only the most hardcore or furries could stomach such an abomination.
I urge you to cease and desist.
>>
>>107485198
post results. lets see what flux 2 can do.
>>
File: ZIT_i2i_00028_.png (2.89 MB, 1536x1536)
2.89 MB
2.89 MB PNG
>>
File: ComfyUI_00264_.mp4 (798 KB, 640x640)
798 KB
798 KB MP4
sometimes wan is just retarded
>>
File: 1745898479357629.png (942 KB, 1024x1024)
942 KB
942 KB PNG
>>
wan 2.2 sure loves bodyhorror, like turning a dick into a foot or having a girl vomit up cum
>>
File: Flux2_00414_.png (3.11 MB, 1824x1248)
3.11 MB
3.11 MB PNG
>>107485224
>>
>>107485241
kekkkkkkkk
>>
>>107485230
Hahaha.
>>
File: ComfyUI_00270_.mp4 (1.16 MB, 832x640)
1.16 MB
1.16 MB MP4
>>
is python itself retarded or just the people using it
>>
>>107485312
both
>>
>>107485311
kek they chucked a stick at him
>>
File: ZImage_01391_.png (238 KB, 896x640)
238 KB
238 KB PNG
>>
>>107485312
hi ani
>>
>>107485311
>tfw you have a 32b armor but a 6b stick kills you
>>
>>107485330
apropos
>>
File: 1743678304550526.png (1.9 MB, 1536x864)
1.9 MB
1.9 MB PNG
>>
>>107485312
Rite of passage
>>
>>107485323
B
>>
File: ComfyUI_00923_.png (1.27 MB, 1664x712)
1.27 MB
1.27 MB PNG
>>
Onetrainer should have the ability to train ZiT loras soon, nice
>>
File: ComfyUI_00273_.mp4 (1.18 MB, 832x640)
1.18 MB
1.18 MB MP4
>>
>>107485311
can it be a crossbow arrow?
>>
>>107485363
i hope it makes me download the entire hf repo instead of using my local diffusers and encoder :)
>>
File: 00029-305586167.png (1.15 MB, 1152x896)
1.15 MB
1.15 MB PNG
>>107485311
not bad. >>107485363
how long does it take to gen?
>>
File: ComfyUI_00271_.mp4 (1.19 MB, 832x640)
1.19 MB
1.19 MB MP4
>>107485374
close but not quite
>>
>>107485365
another plane has hit the knight
>>
>>107485363
didn't one anon already train something in zit? what did they use?
Couldn't one theoretically use a flux trainer since they use some of the same encoders?
>>
File: Wanimate_00137.mp4 (673 KB, 720x720)
673 KB
673 KB MP4
>>107484999
Thoughts?
>>
File: ComfyUI_00930_.png (1.33 MB, 1664x712)
1.33 MB
1.33 MB PNG
>>
File: 1747969781264149.png (1.99 MB, 1536x864)
1.99 MB
1.99 MB PNG
>>
>>107485381
if it's anything like with the other models you just need to drop the checkpoint and encoder into the same folder without any additional buggy humiliation rituals
>>
File: ZImage_01411_.png (227 KB, 896x640)
227 KB
227 KB PNG
>>107485352
>>
>>107485404
lold
>>
File: ComfyUI_00931_.png (1.28 MB, 1664x712)
1.28 MB
1.28 MB PNG
>>
>>107485413
Kill Jester
>>
File: 00030-3881377132.png (1.65 MB, 1152x1152)
1.65 MB
1.65 MB PNG
>>107485387
holy shit it went right through him.
>>
>>107485404
What have you done.
>>
File: ComfyUI_00935_.png (1.29 MB, 1664x712)
1.29 MB
1.29 MB PNG
>>
>>107485401
>what did they use?
It was probably this
https://github.com/ostris/ai-toolkit
>>
>>107485401
>what did they use?
AI toolkit, which sucks
>>
File: ZImage_01433_.png (345 KB, 896x640)
345 KB
345 KB PNG
>>107485442
>>
File: Wanimate_00138.mp4 (1.09 MB, 832x528)
1.09 MB
1.09 MB MP4
>>107485387
Thoughts?
>>
aitoolkit looks so much better than onetrainer but it actually sucks ass. onetrainer looks like ass but at least you dont have to do a humiliation ritual to train.
>>
>>107485506
damn, right in the tit
>>
>>107485401
>what did they use?
Ai-Toolkit and Diffusion-Pipe have support

OneTrainer has a branch for testing, should be merged any day now
>>
>>107484999
This is an old avert for a Barbarian game, right ?
>>
>>107485485
>>107485490
>>107485518
Thanks. I usually use Invoke-training but I doubt they'll have support anytime soon. git looks abandoned
>>
File: file.png (1.02 MB, 1924x1190)
1.02 MB
1.02 MB PNG
Fucking hell
>>
>>107485584
what are we looking at here?
>>
>base
never
>but...
no
>>
>>107485584
NICE NOODLES FAGGOT
>>
File: ComfyUI_00950_.png (1.19 MB, 1664x712)
1.19 MB
1.19 MB PNG
>>
>>107485600
is that chinese culture?
>>
There is NO fucking WAY I just got a fucking fat ugly bastard memeguy out of nowhere as a hallucinated person.
>>
>>107485599
painful uncomfy 1girl installation
>>
>>107485600
>mid model
>released

>grounbreaking model
>never
>>
File: 1563400429238.jpg (43 KB, 446x456)
43 KB
43 KB JPG
>>107485584
Niggas should've just packed The Sims character editor as a prompter.
>>
>>107485506
noooo not the boob
>>
>>107485609
looks like my apartment in nyc
>>
>>107484728
time to upgrade gpu
>>
File: ComfyUI_00958_.png (1.12 MB, 1664x712)
1.12 MB
1.12 MB PNG
>>
>update ComfyUI for first time in a month
>Nodes 2.0 disabled by default, can opt-in if I want
>cancel workflow button is gone from the bottom, but it's very obviously visible in the new progress bar in the top right, not a big deal
>performance is fine, same as it's ever been
>everything works the same, no issues
What the fuck is everyone complaining about.
>>
File: ComfyUI_00960_.png (1.22 MB, 1664x712)
1.22 MB
1.22 MB PNG
>>
>>107485708
shut the fuck up comfy and bring back the stop button
>>
File: ComfyUI_00962_.png (1.38 MB, 1664x712)
1.38 MB
1.38 MB PNG
>>
File: ComfyUI_00964_.png (1.29 MB, 1664x712)
1.29 MB
1.29 MB PNG
>>
>>107485712
The stop button is right fucking there in the top right next to the progress bar.
I swear I'm the only person who just adapts to minor inconsequential changes like this without any problems.
Are you guys all boomers or what.
>>
>>107485584
Is this really what comfers have been doing all this time just to gen some generic chibis?
>>
>>107485678
fapping
>>
>>107485764
>The stop button is right fucking there in the top right next to the progress bar.
it's not retarded monkey
>>
>>107485401
I used ai-toolkit
>>
File: 18.jpg (180 KB, 750x519)
180 KB
180 KB JPG
WHY DO ONLY SUBGRAPHS HAVE CUSTOM COLORS
DRAGGED AND SHOT
TOTAL COMFY ERADICATION
BY ALLAH THE FRONTEND TEAM WILL BE STONED
>>
>>107485802
because you touch yourself all day
>>
File: ComfyUI_00968_.png (1.23 MB, 1664x712)
1.23 MB
1.23 MB PNG
>>
Can I run zit's text encoder on a different machine? I have an ollama box
>>
>>107485599
bunch of chinamen thought it would be a good idea to spend $70000 to train a mediocre deep fried anime model that requires XML prompting. they ran out of money and are now hoping someone sponsors them even more compute to keep training it because just a little more cooking will surely fix the problems
>>
>>107485834
no it will produce mustard gas
>>
File: ComfyUI_00972_.png (1.24 MB, 1664x712)
1.24 MB
1.24 MB PNG
>>
>>107485709
>>107485725
>>107485730
Is this your character lora or just base Z-model?
>>
>>107485839
is this ZiT? With lora?
>>
>>107484624
neat
>>
DRAGGED
AND
SHOT
>>
>>107485834
You mean as an llm? Why? It's just base qwen.
>>
>>107485866
No, for zit. I'm low on vram here
>>
>>107485876
Just offload. Or get Q8 which is ~4,5G
>>
File: ComfyUI_00977_.png (1.29 MB, 1664x712)
1.29 MB
1.29 MB PNG
>>107485839
>>107485847
just ZiT, no loras
>>
File: z-turbo_00085_.png (3.72 MB, 1280x1920)
3.72 MB
3.72 MB PNG
>>
File: ComfyUI_00982_.png (1.31 MB, 1664x712)
1.31 MB
1.31 MB PNG
>>
>>107485791
The 'X' button, are you blind ?
>>
>>107485955
don't try to lie, this problem hasn't be resolved, get back to work subhuman
https://github.com/Comfy-Org/ComfyUI_frontend/issues/7108
>>
File: 00000-2272334071.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>
>>107485881
I think it's already offloading everything and having to swap the models mid-run. How much vram does zit need to run comfortably?
>>
>>107485836
For what it's worth, it's not bad as a beta. As we're never getting ZIB, i see the vision.
That installation is aids though
>>
>>107485839
i want a roastie like this so bad
she won't ever judge me
>>
>>107486021
she's a midget
>>
>>107485972
This must be some Nodes 2.0 shit, because I'm running Comfy right now and there's an 'X' to stop the current generation
>>
File: ComfyUI_00989_.png (1.25 MB, 1664x712)
1.25 MB
1.25 MB PNG
>>
File: 1752106270978553.png (26 KB, 712x270)
26 KB
26 KB PNG
>>107485764
you mean it's on this pointless progress bar that I have to hover over to even see because someone at comfyorg thought this was a good idea
>>
>>107486032
I would go to AA meetings with her and then cuddle and maybe have her peg me too
>>
File: ComfyUI_00991_.png (1.24 MB, 1664x712)
1.24 MB
1.24 MB PNG
>>
>>107486034
>good idea
sir! it is a great idea!
>>
>>107486065
saar
>>
File: ComfyUI_00295_.mp4 (1.52 MB, 720x1280)
1.52 MB
1.52 MB MP4
>>
>>
File: ComfyUI_00993_.png (1.43 MB, 1664x712)
1.43 MB
1.43 MB PNG
>>
>>107481706
honestly I don't see why not

some additional concerns from what I've gathered:
>lpddr5x slow
for imagen it's compute bound and 5070-like speed is fast enough

>but llm and mem bandwidth
alternative costs much more and muh power efficiency and muh room heater

>people report thermal throttling
just buy the asus one with better ventilation at the bottom and put a fan under it

>it idles at 30 to 40w
I guess it can be fixed with software but it's really not that high

>you can't plug another dgpu on it
maybe it's not needed, or just use another PC

it would be a great multi purpose lil box for experimenting stuff
>>
>>107485987
What's your system anyway? Full ZiT is 13G with 8G TE. Halve everything for Q8. Use multigpu node to dump the encoder before sampling.
>>
File: z-image-leak.png (1.87 MB, 1920x1080)
1.87 MB
1.87 MB PNG
GUYS I FOUND THE Z-IMAGE BASE
>>
>>107486164
>or just use another PC
Or I can just use another PC instead of this shite. Size doesn't matter unless you live in a singaporean apartment.
>>
>>107486182
lul
>>
>>107486164
or just pay for credits on openrouter and save your money and get good results with whatever the hell model you want
>>
File: ComfyUI_01000_.png (1.39 MB, 1664x712)
1.39 MB
1.39 MB PNG
>>
>>107486164
If you need anonymous users to approve your financial decisions, this computer won't make you happy
>>
>>
>>107486181
For now 16gb ram and 8gb vram (3060ti). I bought 16 gigs of extra ram and a 5070ti with 16gb vram but they're not here yet.
>>
>>107486202
>can do nsfw because muh cloud
>>
File: me coding.gif (2.47 MB, 320x240)
2.47 MB
2.47 MB GIF
>Kohya_SS Musubi tuner supports Z-image Turbo now
>mfw when try to run the code

It's cooking now. I'll see if it's better than Ostris in several hours.
>>
>>107486164
>arm
>>
>>107485584
>this cancer prompting format
Made by literal bugmen
>>
>>107486220
At least musubi-tuner works for me. I couldn't get aitoolkit to train at all.
>>
>>107484456
Love it and had to force myself to stop since it was eating up all my free time.
>>
>>107486225
how's that an issue for AI only purposes?
>>
>>107486240
Shouldn't all trainers train good enough providing the same data set?
>>
>>107486287
nah, they own add their own codes and stuff, from example training a chroma lora with ai-toolkit will give you different results than training with diffuse-pipe or OT, depending how they interpret the layers issue
>>
>>107486287
I was getting a bug where it just hangs at the start of training and doesn't do anything. There's no error logs so I don't know what it's doing
>>
How do you guys prompt when you're not on your computer?
I vibe coded a little web server so I can prompt for my 1girls at the toilet, at work, whenever and wherever!
>>
>>107486298
Witch kit is best then?
I've had some issues getting any to work but after a while they just do.
>>107486300
Did you check the terminal or terminal emulator?
>>
>>107486164
yeah
buy one before ram price fuck it up
>>
>>107486332
Didn't see anything but I kind of gave up already. Will just try musubi-tuner or onetrainer when it supports it
>>
Well then /ldg/ What lora trainer to rule them all?
>>
>>107486374
Doing it by hand manually
>>
whats the best img to vid for porn gening? I just got a 5090 and i want to put it to work.
>>
>>107486435
wan 2.x
>>
>>107486446
any specific models? recommendations?
>>
File: 00001-339569152.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
>>107486461
I recommend you return the 5090 and get a 5070 ti
>>
>>107486374
OneTrainer often lags by a couple of weeks when it comes to supporting new models, but I'd say it's the overall best, it's the fastest and least vram hungry, supports training practically every model even at as low as 8gb vram with its extremely optimized block swapping, also has a new quantization update in final stages of testing which cuts training time by half with practically zero quality loss.
>>
Sucks that you cna't use more than 1 lora at a time with ZIT, it fucking blows everything else out of the water in every other aspect.
>>
>>107486478
>OneTrainer
Never tried it as it doesn't have a webui
>>
>>107486476
why
>>
File: 00065-2356388029.png (1.65 MB, 1152x1152)
1.65 MB
1.65 MB PNG
>>
>>107486231
Being able to prompt multiple characters without crosstalk is gamechanging, I'll take it.
>>
>>107486510
I just think buying a 5090 to generate 5 second slop videos is a poor choice, but if you have the money then whatever, download wan 2.2 and look on civitai for loras
>>
File: ComfyUI_temp_onkru_00005_.png (2.27 MB, 1400x1800)
2.27 MB
2.27 MB PNG
>>107486332
mm I can only talk for Chroma, ai-toolkit works but it will give you image artifacts (those damn lines), diffuse-pipe works good but everyone on their discord is using OT, even one of the devs is a regular there so thats what I've been using lately, is also the fastest for training,
>>
>the base is coming in a few we...
no it isnt
>but this anime poster sa...
i dont care
>>
File: ComfyUI_00383_.png (2.57 MB, 1248x1920)
2.57 MB
2.57 MB PNG
>>107486542
Pretty sure that was always possible. You can just name your characters and then keep referring to them to describe them.
>Two girls named Ashley and Kathy sitting on a bench in a snowy field. Ashley is on the left and wearing a white beanie and brown jacket with a red scarf and looking at Kathy. Kathy is on the right and holding a cup of coffee, wearing pink ear muffs, a large blue hoodie, and black gloves. Ashey has long blonde hair and long bangs. Kathy has short black hair tied in a ponytail.
>>
Bro just takes the whole door!
>>
>>107486508
I'm just reading the git and if by 'webui' you mean a graphical interface, it does have one.
it's called start-ui
>>
>>107486620
>>anime poster sa...
that guy is clearly being paid to fluff and anon falls for it
>>
>>107486622
damn that's sharp af
>>
>>107486633
I think anon means a UI you can run remotely, probably because he's training on runpod or something
>>
>>107486616
I never noticed those lines.
>>107486633
That's a python based ui. I run my server in the basement so I don't have to get the heat nor sound in my office.
>>
>>107486682
>That's a python based ui
so what do you even use for inference? all of this shit is python launched on a server
>>
>>107486708
I think most ones default to gradio or npm. They made a post on how a browser is wasting VRAM when training and I agree on that *if* the browser is on the same machine as the training happens.
>>
File: 1735069124702983.png (796 KB, 1201x1482)
796 KB
796 KB PNG
wait what?
https://xcancel.com/LodestoneRock/status/1998215045118112029
>>
Is it just me or quality of Wan 2.2 FLF2V with lightning LoRA is really bad especially the last keyframes? Is VACE just a better way to go for long videos?
>>
just tried z-image. Why is every one of my images so noisy/grainy. I tried the official and some custom workflows and both have this issue.
>>
>>107486749
looks like lodestone implemented a new full pixel method or something
https://github.com/LTH14/JiT
>>
>>107486749
No matter how hard she begs the Chinese will never accept her
>>
>>107486784
kek
>>
>>107486779
jeet method?
>>
>>107486778
the "official" workflow probably still has flow bypassed. enable it and set it to anywhere between 3 and 7, helps somewhat. depends on the prompt as well.
>>
>>107486749
this is the vae-less stuff he's doing with racdiance right?
>>
>>107486793
yes, but with something new now >>107486779
>>
File: ComfyUI_00551_.png (2.08 MB, 1280x1536)
2.08 MB
2.08 MB PNG
>>
File: file.png (307 KB, 2328x1009)
307 KB
307 KB PNG
>>107486749
>>107486779
https://arxiv.org/abs/2511.13720
hmm...
>>
File: Capture.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>107486749
>https://xcancel.com/LodestoneRock/status/1998215045118112029
that image looks fine and I'm not seeing the squares patterns like on previous radiance models
>>
File: poohicide.png (117 KB, 304x304)
117 KB
117 KB PNG
>>107484624
You are training this? Is there some place you are planning on posting it? I love pixel art.
>>
>>107486865
Details still look fucked up kek but I wish him well on his endeavor
>>
>>107486843
why are researchers so uncreative with names. jit is already "just in time" compilation
>>
File: ComfyUI_00566_.png (2.82 MB, 1280x1536)
2.82 MB
2.82 MB PNG
>>
File: 1355139830646.png (178 KB, 500x500)
178 KB
178 KB PNG
>check HF to try that T2I wan shitmix that is out
>404
The fuck? Shit came out few days ago.
>>
>>107486843
im retarded and interested, explain pls
>>
File: 1742502700969310.png (156 KB, 2282x748)
156 KB
156 KB PNG
>>107486996
they simplified the architecture and don't use VAE's anymore, and the model is happy since it has to work with simplier shit
>>
>>107486843
this won't be usable on regular peoples' hardware probably tho
>>
>>107487010
how is that different from the radiance he already had? even simpler?
>>
Even at low res/settings my videos look deep fried. What am I doing wrong?
>>
>>107487031
Anything load of shit stones makes is basically bunk. Wasted money on a bad idea. Disregard anything his does.
>>
>>107487039
using lightx2v? if so, are you using anything but 1.0 for cfg?
>>
>>107487051
Wan 2.1 or 2.2
>>
what's your ratio of shit to good gens?
>>
>>107487010
The downside is that it takes significantly longer to generate images. Back to square one.
>>
>>107487104
If I posted all my shit gens I could turn this thread in to /sdg/ in the space of 20 minutes.
>>
>>107487147
is the butterfly guy still spamming /sdg/? amazing how one tweaking schizo can permanently ruin a general
>>
>>107487157
>butterfly guy
Ima need a qrd. I'm still seething over purple witch and the quokka.
>>
Oh wow I just checked in on /sdg/. What a fucking wasteland.
>>
File: z-image_00636_.png (3.19 MB, 2048x1152)
3.19 MB
3.19 MB PNG
>>
File: z-image_00637_.png (3.19 MB, 2048x1152)
3.19 MB
3.19 MB PNG
>>
>>107486478
OneTrainer is great. I've been training SDXL loras on a 4GB GTX1650S. I train at 768 res and below and I plug my monitor into my integrated graphics while I'm training and I manage to avoid OOM. Probably takes like 10 times longer to train anything than everybody else, but whatever, it works and getting a new GPU isn't an option.
>>
>>107487191
yeah it just seems to be one jeet spamming indian pictures
>>
File: Wanimate_00139.mp4 (782 KB, 832x528)
782 KB
782 KB MP4
Thoughts?
>>
>>107487236
best timeline
>>
Woah
>https://thaoshibe.github.io/relsim/retrieve/index.html
>>
File: z-image_00639_.png (2.76 MB, 2048x1152)
2.76 MB
2.76 MB PNG
>>
>>107487236
fucked up hand
>>
>>107486164
yeah but which oem is least shit?
>>
File: Wanimate_00140.mp4 (600 KB, 832x528)
600 KB
600 KB MP4
>>
File: file.png (69 KB, 788x496)
69 KB
69 KB PNG
>primitive node connected to scheduler on ksampler
>primitive node connected to scheduler on facedetailer
whyyyyyyyyyyyyyyyyyy
>>
File: z-image_00642_.png (3.56 MB, 2048x1152)
3.56 MB
3.56 MB PNG
>>
File: z-image_00643_.png (577 KB, 2048x1152)
577 KB
577 KB PNG
>>
>>107487215
When I had gtx 1650 I never even thought about sdxl lora training because I thought it would have been a disaster. Good to know that's not the case...
>>
File: z-image_00645_.png (987 KB, 2048x1152)
987 KB
987 KB PNG
>>
>>107486478
>it's the fastest and least vram hungry

It's the opposite. OneTrainer has the slowest and most bloated UI of them all.
>>
>>107487356
I thought the same thing for a long time because the one time I downloaded something to try training an SD1.5 lora even THAT turned out to be impossible. I finally decided to try again a few months ago and see if things had improved optimization-wise and it seems they have.
>>
>>107486865
the arm
>>
>>107487316
>if I sit still no one will notice I'm a chicken wing
>>
File: 1739395320472403.png (2.65 MB, 1216x1728)
2.65 MB
2.65 MB PNG
>check /ldg/
>it's still not out
yeah it's over
>>
>>107487446
You're all so deeply immersed in Chinese culture right now.
>>
The base model will be released once Tongyi finally finishes updating README.md
>>
>>107487446
it's been 4 days they've been ignoring everyone and pretending there shouldn't be any update about the date release of base, so yeah fair to say they're trying to pull it under the rug, it's over
>>
>>107487493
God I want a prize for seeing it and calling it. I also want the people who said I was a schizo or posted screenshots of non-commital discord responses to make video recordings of themselves apologizing to me.
>>
>>107487518
>God I want a prize for seeing it and calling it.
it's not officially over, it'll be once they'll announce it as an API only model
>>
I fell for it again...
>>
SDXL Eternal
>>
>>107487528
Baiting people with promises of open sourcing then close sourcing is Chinese culture.

>Hunyan 2.5
>Hitem 3D
>Plethora of video models
>Wan 2.5 (yes I count this one too, fuck you)
and now: Zit Base
>>
All that excitement over finally getting a good model that wasn't bloatmaxxed, all for nothing...
>>
>>107487545
>yes I count this one too
so you're a disingenuous retard? got it
>>
File: 1759006243758442.png (3.37 MB, 1216x1728)
3.37 MB
3.37 MB PNG
>b-b-but the tranime poster on reddit!!
>>
don't worry, lodestones will save us
>>
>>107487567
ramtorch node for christmas?
>>
>>107487576
no, z-chroma radiance flash 2kdc
>>
z image base when?
z image 30B when?
>>
new radiance just dropped and was already merged. And apparently he is going to test making it a 1 step model as well then wants to do z image if base drops
https://x.com/LodestoneRock/status/1998215045118112029
https://huggingface.co/lodestones/Chroma1-Radiance/blob/main/latest_x0.pth
https://github.com/comfyanonymous/ComfyUI/pull/11197
>>
>>107487662
cool. i'm gonna try it out. last time i tried radiance it sucked ass and took 5 hours to gen with
>>
>>107487662
already talked here >>107486749
I think I need to see more than 2 images to see the improvement though lol
>>
>>107487662
who's going to bite and post some test gens?
>>
>>107487700
It's almost like vae compression was being done for a reason hmm....
>>
i hope she keeps begging in their server its funny to witness
>>
>>107487740
that place is a wasteland since the tongui devs decided to hide and not talk to anyone anymore lmao
>>
File: 1742170382078921.png (30 KB, 785x349)
30 KB
30 KB PNG
>>107487700
rip
>>
>>107487725
no, several papers have come out stating vaeless training was possible, that is just the first model that has tried it that wasn't some small test for a paper
>>
File: 251208235927_00001_.png (1.19 MB, 896x1152)
1.19 MB
1.19 MB PNG
Trying ZiT to SDXL. I like it so far.
>>
>>107486332
diffusion pipe, ai tool kit is only good for short / small dataset loras, musibi trainer seemed good the one time I tried it long ago
>>
>>107487662
>latest_x0.pth
a fucking pickle? in the year 2025 of our lord?
>>
>>107487662
>if base drops
if hes actually still waiting for base then he is far more idiotic than even his most envious shitposters have led me to believe
>>
>>107487813
it seems like he's able to talk to the tongyi devs directly so he knows better than most if the mood is towards the release or not
>>
>>107487813
are you stupid? the undistill ostris made falls apart like a few thousand steps in. You can not train anything serious off of that
>>
>>107487662
>https://github.com/comfyanonymous/ComfyUI/pull/11197
it's been merged, is there a safetensors version of that shit?
>>
>>107487818
>talk to the tongyi devs directly
if that was true he wouldnt be @ing them in the main channel everyone has access to
>>107487819
the first dedistilled flux didnt take off either but my reply wasnt about that dummy
>>
File: 1758986549282692.png (21 KB, 220x154)
21 KB
21 KB PNG
>>107487662
>Z-Image Edit Chroma Radiance, a perfect edit model that only works on pixels so you don't compress the image for each edit iteration
just imagine
>>
>muh safetensors
>worried 1girl gens getting sent to fbi
kek
>>
>>107487848
It looks like he plans to finish radiance first, though he had paused training to get ramtorch going so he can do it with a 8x 4090 build instead of needing to throw money at h100 rentals
>>
Is it possible to inpaint in wan 2.2, to fix hands etc?
>>
File: 1755473120320248.jpg (1.1 MB, 3628x1514)
1.1 MB
1.1 MB JPG
https://yejy53.github.io/RealGen/
poor them, they thought they solved realism but then Z-image turbo existed lol
>>
>>107487908
>bro please this new set of benchmark images is totally real and unbiased this time for real I swear
>>
>>107487662
>plz bro download my pickle, don't you want to test that out? my close up of 1girl and a fucking dog photo are enough to convince you
lol, lmao even
>>
>>107487908
just get us out of this lora spiral of death
>>
>>107487961
its sad how easily some are impressed
>>
>>107487961
I mean I doubt he suddenly decided to throw everything away for a chance to maybe get a virus on some gooner's pc
>>
>>107485323
>>107485413
>>107485492
You could make a CYOA game with this sort of stuff.

>>107486034
Click view all jobs and it doesn't hide away. The one thing they need is an option to have it stay open by default.

>>107485708
I dislike the assets panel. It's weird and buggy and deleting something through it doesn't delete the file so like what's the point. It's just a list of shit you've genned this session. The queue is basically the same thing but with tiny thumbnails. And like I can drag images from assets into the workflow but I can't drag videos. What the fuck is up with that?
>>
File: 1763348740377845.png (831 KB, 2248x609)
831 KB
831 KB PNG
>>107486843
>https://arxiv.org/abs/2511.13720
that part is interesting, since the hidden size is constant, you can go for high images and the speed stays the same
>>
>>107487961
just wait for the goofs nigga. The furfag has been dumping pickles for the entire duration of the training so idk what's the issue now.
>>
>>107487662
we love pickles around here don't we folks? love pickles and cucubers and such. great vegetables i always say.
>>
File: ComfyUI_temp_xdoia_00003_.jpg (694 KB, 1280x1920)
694 KB
694 KB JPG
>>
File: 1745711315356115.png (1.94 MB, 2016x1290)
1.94 MB
1.94 MB PNG
here's what we're training tonight
>>
>>107488079
getty images lora?
>>
>>107488086
yeah i need to refine the watermark for my gens
>>
>>107488079
What trainer is that?
>>
>>107488105
https://www.presize.io/
its for preparing datasets. you wont find anything better
>>
>>107488086
>average Civitai trainer
>>
>>107488160
>>107488160
>>107488160
>>107488160
>>
Do anon batch gen with zit?
>>
>>107488118
i was honestly surprised how hard it was to find high quality photographs of him.
>>
>>107487662
>pth
I know that since 2.6 pytorch only load tensors but seriously, why?

>>107488079
Run fluxfill on those watermarks
>>
>>107488198
>Run fluxfill on those watermarks
i'm captioning them and z-image knows what a getty watermark is. it's fine
>>
>>107487557
Not nearly as disingenuous as Alibaba themselves when releasing it. If you can't see what they were doing I'd get checked for retardation.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.