[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


My Works BTW Edition

Discussion and development of local image and video models

Prev: >>106571086

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: 1750734059266854.mp4 (1.51 MB, 832x480)
1.51 MB
1.51 MB MP4
>>
File: AnimateDiff_00286.mp4 (2.42 MB, 1280x720)
2.42 MB
2.42 MB MP4
>>
Blessed thread of frenship
>>
File: ComfyUI_00225_.png (2.74 MB, 1080x1328)
2.74 MB
2.74 MB PNG
>>
What is the neta now for making realistic loras and using that lora with a mix for anime images?

I always get uncanny valley images or generic anime people that don't resemble the character
>>
>>106575478
model!?!?
>>
File: ComfyUI_00229_.png (2.28 MB, 1080x1328)
2.28 MB
2.28 MB PNG
>>106575564
I mentioned before. Its Qwen
>>
File: Chroma2k-test_00033_.jpg (571 KB, 1464x1608)
571 KB
571 KB JPG
>>
File: ComfyUI_00234_.png (2.18 MB, 1080x1328)
2.18 MB
2.18 MB PNG
>>106575587
thers always nothing to eat
>>
>>106575533
Pretty please
>>
File: ComfyUI_00314_.png (2.15 MB, 1024x1536)
2.15 MB
2.15 MB PNG
>>
File: file.jpg (2.04 MB, 1360x2048)
2.04 MB
2.04 MB JPG
>>
File: 00034-2834829260.jpg (192 KB, 1824x1248)
192 KB
192 KB JPG
>>
>>106575656
what model? for flux I recall training a few anime character specific loras and then using a kodak film stock lora to bring out realism.
>>
File: comfy69.jpg (674 KB, 1280x1280)
674 KB
674 KB JPG
>>
>>106575809
That's what I'm asking, what model do people use now, what model is good. I've heard that flux is very censored
>>
>>106575444
trips of truth, but asses are not thicc enough and women are not black and jiggly enough

too bad all the lora creators only care about PAWGs instead of just making good big asses for all races
>>
https://huggingface.co/bytedance-research/HuMo
Why /ldg/ is sleeping about this
>>
>>106575755
clean
>>
>>106575961
because it's from bytedance, they suck, they always sucked, and they will always suck
>>
File: 250913-235149.mp4 (1.64 MB, 1024x1376)
1.64 MB
1.64 MB MP4
I spent all day adapting to Wan2.2 3 steps workflow only to find out it doesn't work well with most nsfw loras.
>>
File: 00059-1218566832.jpg (216 KB, 1824x1248)
216 KB
216 KB JPG
>>
Out of memory is OOM? The famous 'ooming?
>>
>>106576047
yes
>>
>>
>>106575882
What model
>>
>>106575998
>they suck
how can they suck? that's the company that made Seedance 4.0
>>
do i need to crop the images into 1024x1024 for training or can i just throw them in as is?
>>
File: 00069-3089369207.jpg (145 KB, 1824x1248)
145 KB
145 KB JPG
>>
>>106576010
i'm assuming you mean 3 samplers, and yeah that's because 3 sampler workflows are a retarded meme
>>
File: 1746210843876573.png (568 KB, 1018x1099)
568 KB
568 KB PNG
>HunyuanImage and SPRO on the trending page
man... people have shit taste I swear to god
>>
>>106576142
all that's left is hobbyist redditors that much download new thing. stagnant models, shitty uis, grift culture and a circle jerk community made this shit so fucking lame it's actually unbelievable considering you can just command a model to show you what you want.
>>
>>106576142
I don’t know if you’ve noticed anon but outside of 4chud it seems like the main consoomers of all this AI stuff are jeets and chinks. It’s not surprising they constantly prop slop up, they have no souls. Just look at the trending page on sora kek
>>
>>106576142
i bet the trending page is sponsored or gets kickbacks in some other way, just like all trending pages on all websites
>>
File: 1745681947403856.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
>>106576142
Hunyuanimage is pure slop
SRPO is actually good for realism
I mean just look at this cute cato
>>
>>106576122
Crop
>>
>>106576185
it's fine on close up, it's when you want to go for full body shot that you can see it's worse on details than the original Flux dev, it's not as horrible as chroma when it comes to details, but it's bad
>>
>>106576122
No, basically every trainer will automatically crop the image to one of the supported aspect ratios

Things that are important though:

If you train on 1024 resolution, make sure the images are at least 1024x1024 in total pixel space, otherwise the trainer will likely upscale the images before training on them which will look ugly and introduce artifacts

The aspect ratio cropping might not crop the way you prefer, as in cutting out parts of the images you wanted to keep, manually cropping the images according to supported aspect ratios is ideal, but often just not practical when you are training lots of images
>>
File: 00079-63304773.jpg (342 KB, 1824x1248)
342 KB
342 KB JPG
>>
>>106575755
v
>>
File: 1741312392543212.png (165 KB, 1184x619)
165 KB
165 KB PNG
https://github.com/comfyanonymous/ComfyUI/pull/9682#issuecomment-3287541987
>Instead of adding a new node it might be best to just add the chroma radiance "vae" in the list of the "Load VAE" node
https://github.com/comfyanonymous/ComfyUI/pull/9682#issuecomment-3287560251
>Might be unintuitive for users though.
is Comfy fucking retarded?
>>
>>106576185
SRPO is only good for the new training method. It's stil flux.
>>
>>106576199
>>106576216
I see, that information helps. Also, is there a way to crop images in a batch (locally) like this site can?

https://www.birme.net/
>>
>>106576142
>lets not try cutting edge models guys!
>SDXL sucks, lets keep using SD 1.5!
>>
>>106576372
>nooo, I don't believe those hundreds of images on the internet showing plastic humans, I have to download that 20gb model and see it by myself!!
what mental illness is this?
>>
>>106576238
he isn't a front end developer at all
>>
>>106576349
reforge is the simplest, its in the Extras tab. It even has face recognition and you can get a plug-in to use an LLM for auto captioning.
>>
>>106576404
I see the option, thanks again. Very helpful.
>>
Cozy Saturday
>>
>>106575437
Anyone got a catbox version of this? >>106574719
>>
File: ComfyUI_temp_zqapt_00002_.jpg (1.77 MB, 1600x2096)
1.77 MB
1.77 MB JPG
For chroma training what to pick?
adam/prodigy?
batch 1/2?
150/100 epochs?
Also can you pick aresolution that isn't just either 512 or 1024 but something in between?
>>
What models are you using for realistic characters? Is flux still censored?
>>
File: 65672577.mp4 (3.65 MB, 1216x960)
3.65 MB
3.65 MB MP4
>>106576010
>>106576131
I did that yesterday and also gave up on it.
>>
>>106576529
Chroma Base
>>
if you fell for the 3 sampler meme you should honestly give up.
>>
File: bruh.png (602 KB, 1280x720)
602 KB
602 KB PNG
https://flux-reason-6m.github.io/
>We introduce FLUX-Reason-6M and PRISM-Bench. FLUX-Reason-6M is a 6-million-scale synthesized dataset
>>
>>106576391
Are you fucking with me? Go try SDXL 1.0 base with refiner. It is one of the most sterile plastic models ever. Hunyuan 2.1 may have issues but I've noticed it adheres to prompts better and is less censored than qwen image. I think once the refiner issues are sorted out and a a couple lora's come out it could be great in a month or so. It sucks at text, but so does qwen which seems to just paste it over everything in a sterile way.
>>
this field is being run by complete amateurs. I'm fucking speechless how fucking awful the production pipeline is for all these open source companies. fucking hell WHY!!!!!??!!?! WHY DID YOU SUPPORT THESE FUCKING NIGGERS!!!!!
>>
>>
>>106576517
All those can work
>>
>>106576529
Chroma is the best
>>
Is sage attention plug and play? I tried adding it, the gen time got even longer
>>
>>106576529
chroma, not because I love it, but it's the only one local that accepts natural language and is uncensored
there is nothing else
>>
File: AnimateDiff_00290.mp4 (2.15 MB, 1280x720)
2.15 MB
2.15 MB MP4
>>
>>106576588
I don't know for windows, but in linux it is plug and play kinda, and sage attention 2 is a big boost in speed on my workflows for free so it's worth the effort.
>>
>>
File: Seedream My Beloved.jpg (2.78 MB, 2868x3824)
2.78 MB
2.78 MB JPG
>>106576529
>What models are you using for realistic characters?
nothing, chroma can have good skin texture but its details and anatomy are fucked, my standards are now uncensored seedream 4.0, and I hope I won't die of old age before getting something like that
>>
>>106576216
>If you train on 1024 resolution, make sure the images are at least 1024x1024 in total pixel space
nta, what if the images are 2048x2048, do they get accepted as 1:1 or do they still get a 1024x1024 chunk cropped out of it by the trainer?
>>
>>106576615
Seedream 4 is insane. It's crazy to think we'll have local models this good in 10 years
>>
>>106576608
which model are we talking? big boys model or SDXL model?
>>
File: 2406631319.jpg (3.41 MB, 2048x2048)
3.41 MB
3.41 MB JPG
>>106576529
Flux-krea, and yes it is still censored.
>>
>>106576625
just enough time to ensure they're extra safe
>>
>>106576627
wan2.2/720p
>>
I'm done with video gen for the moment, 3-5 seconds video doesn't do it for me anymore.
Gonna wait some years till they figure out actual ways to have a big context window and unlimited length gens.
>>
>>106576615
lil jeet didnt gen a single actual image with the model he is shilling for days on end, brutal
>>
File: the goat.jpg (525 KB, 1536x2048)
525 KB
525 KB JPG
>>106576665
I did though
>>
>>106576654
not sure why you needed to announce this
>>
>>106576665
He's not a jeet he's a white Californian NEET, stop taking the bait
>>
>>106576677
Not sure what you proved other than seedream sucks at text lol
>>
>>106576686
>"sucks at text"
>can make perfectly fine kanjis at very small resolution
lmao, there's no other model that can do this, nice bait though, I give it a 7/10
>>
>>106576692
1.5 could give random kanji, you should know ffaze
>>
>>106576677
That's impressive, what the fuck?
>>
SwarmUI anons
Anyone know how to denoise/"creativity/ values in the refining process?
>>
>>106576692
Lol my friend learn japanese first
>>
File: AnimateDiff_00292.mp4 (1.06 MB, 1280x720)
1.06 MB
1.06 MB MP4
>>
Glad to see fellow API node users. Seedream absolutely BTFO BFL. China won the API war
>>
File: 1727669568347518.png (367 KB, 479x720)
367 KB
367 KB PNG
>>106576715
>m'lady
>>
>>106576709
Please stop arguing with the low IQ neet, he will always find some inane thing to bitch about because he dedicates his life to being a human gnat
>>
>>106576709
>>106576734
>samefag
>>
>>106576717
I keep seeing seedream gens pop up here, and over /sdg/ they’ve got UIs and tutorials for running local. Is it just one schizo pumping all this out? Pretty hilarious.
>>
>>106576734
>human gnat
surprisingly accurate, and now I can't help but think he's maybe the american cousin of that cockroach furk
>>
>>106576717
Yeah, Sneedream is supreme.
>>
>>106576745
>and over /sdg/ they’ve got UIs and tutorials for running local.
ok debo, time to go back in there
>>
>>106576750
We have what we have in the OP for a reason, him and his crew of retards have been defeated and rejected so while their thread languishes into nothing outside of two dedicated schizos posting for hours saying GM they come here to start shit.
The less attention you give the more it hurts them too.
>>
>>106576745
/sdg/ has local news, local tutorials, chroma posters and some are genning anime
>>
>>106576717
>China won the API war
it won the local war aswell, Wan 2.2 and Qwen Image are the best local models so far
>>
>>106576745
>>106576782
>/sdg/ shilling on my /ldg/
YOU NEED TO GO BACK.
>>
>>106576783
they didn't win the safety war though, you can generate titties on qwen
>>
>106576782
See what I mean
Every time like clockwork he does this and his other few friends take turns. Just ignore him and watch him shit his pants and go nuclear like he tried a few weeks ago with ani.
>>
China please, defeat OpenAI and Claude
>>
>>106576795
/ldg/?
And that api gens, anon?
>>106576798
Kek

You are not true local and Im not that schizo that you think
>>
>>106576799
they can't because they keep training off their outputs
>>
File: god bless the chinks.png (431 KB, 800x582)
431 KB
431 KB PNG
>>106576796
>they didn't win the safety war though
based Chinks
>>
Stop replying to him he's a waste of space
>>
>>106576819
>they keep training off their outputs
how do you explain Seedream then? they managed to beat everyone else on t2i
>>
Comfy status?
Did someone drag him into the street and shoot him?
>>
t...titties!?? Guys I feel unsafe
>>
>>106576839
Busting autistic loads in Chinese pussy while Ani beats off in the corner crying
>>
>>106576838
seedream is not an llm retard

>>106576839
he shot himself
>>
>>106576833
Meds
You will never be local, the api gens here endorse my words
>>
File: 00073-161251504.png (246 KB, 400x400)
246 KB
246 KB PNG
>>
>>106576869
comfy's girlfriend looks cute
>>
>>106576857
>seedream is not an llm retard
what does it has to do with everything? if they managed to be on top with image models, they can do the same with llms, are you this dumb or something?
>>
>>106576878
ESL retard
>>
>>106576883
Concession Accepted.
>>
>>106576878
Claude and gpt are llms, if you want to beat them you have to make a better llm. seedream is still behind whatever Google cooks up anyways because they are the only company that understands garbage in, garbage out
>>
>>106576630
https://files.catbox.moe/uvxo7s.mp4
>>
>>106576892
>seedream is still behind whatever Google cooks up anyways
fair, I also believe Google has monster models on their lab, they just don't release it because they can still destroy the competition with their B-tier model
>>
>anon spamming sneed in the non sneed thread
>"well this must be the sneed thread"
>>
>>106576464
It was until the post below >>106576612
>>
i want to train a chroma lora.
i'm not sure what to train, as i can gen most of what i want already.

taking requests, will train if it's not retarded or hard to caption.
>>
File: 1731714488286566.png (127 KB, 380x332)
127 KB
127 KB PNG
anon please use Seedream version 4.0 anon, it's the most realistic model anon, candid "actual" realism doesn't matter, contrast boosted incremental improvement of Flux Krea is all you need anon you have to trust me, please use the model Seedream 4.0 at replicate dot com slash bytedance or my family will be homeless anon... you generating images on your own local hardware is economic terrorism anon... please... look at this brown arab promotional image generated by the image model Seedream 4.0 again, just one more time anon... i dont want to have to post it again the next entire week again anon, i need this... please.... please dont talk about censorship i cant reply to that comment just go to the website and use it anon... use Seedream 4.0 generative ai image model.
>>
File: AnimateDiff_00296.mp4 (1.22 MB, 1280x720)
1.22 MB
1.22 MB MP4
>>
>>106576981
>pircel
So unrealistic, Seedream could make a more realistic man than this!
>>
>>106577000
that's quite impressive, Wan is such a great model
>>
>>106576981
Kek
>>
>>106576981
okay fine i will try krea
>>
>>106576010
try using default resolution - it's the only way I get good results with Lora

>>106576185
It's a real cat!
>>
I'm so lost on the best sampler for chroma
>>
File: Captcha.png (5 KB, 298x80)
5 KB
5 KB PNG
This threw me for a loop.
>>
File: WanVid_00012.webm (781 KB, 560x800)
781 KB
781 KB WEBM
never fear
>>
File: no more lies.jpg (53 KB, 600x600)
53 KB
53 KB JPG
I've heard GGUF files are not safe to use unlike safetensors.
Is that true?
>>
File: 12613271272.jpg (3.67 MB, 4096x4096)
3.67 MB
3.67 MB JPG
>>
File: 1730078242119399.png (2.58 MB, 1328x1328)
2.58 MB
2.58 MB PNG
>>
>>106577155
how do they manage to render such high resolution though? I know they have monster GPUs but still, 4096x4096 is tough to do, attention is quadratic
>>
>>106577183
secret chinese magic
>>
*yawn*
>>
>>106577140
Chinese snake oil paper
>>
I kinda wanna try out video generation but I only got 8 gigs of VRAM
Should I even bother or is it gonna look like total ass?
>>
>>106577186
>secret chinese magic
I think they released the paper though, they didn't give us the model but they told us how they achieved this
https://arxiv.org/abs/2509.08826
>>
>>106577206
>Crucially, we resolve the persistent challenge of "reward hacking": Our large-scale RMs exhibit and maintain high reward variance during RL fine-tuning
that's funny, they also talked about defeating the reward hacking on SRPO
>>
>>106577183
That's not really 4K. Just a cheaply upscaled 1k image. Case in point
>>106568389
The API shill made it quite obvious. The model isn't even capable of proper 2k.

Chroma is more groundbreaking than what they have.
>>
>>106577219
>The model isn't even capable of proper 2k.
>6144x2634
that's 2k no?
>>
holy fucking localcope
>>
>>
File: 1742341438885419.png (2.5 MB, 1328x1328)
2.5 MB
2.5 MB PNG
>>
>c-chroma can do it too!
embarrassing
>>
>>106577219
Also their model is not really realistic. Flux base is still stronger, which is why Chroma/Krea run laps around it. They are training strictly on SDXL/3.5 based images.
>>
>>106577091
Heun has the best quality from my experience but I haven't tried all the samplers.
>>
File: 1728148710009182.png (2.57 MB, 1328x1328)
2.57 MB
2.57 MB PNG
>>106577234
>>
>>106577251
>Flux base is still stronger, which is why Chroma/Krea run laps around it
Krea was finetuned on the undistilled Flux (which we don't have)
>>
I wish I could train Karomah in a reasonable amount of time :-(
>>
>sub 2k res
poorfags gtfo
>>
>>106577246
>shits on Chroma
>>106577258
>shits on Seedream
the enlightened centrist
>>
Childish tantrums are sad to see.
>>
>>106577242
We need you in /adt/ please 1 gen, only one will be sufficient
>>
>>
>>106577219
>Chroma is more groundbreaking than what they have.
This

The /saasdg/ shills are working overtime in the /ldg/ threads of late, pure desperation

Nobody wants the padded cell SAAS AI, it's the ultimate cucking made for limp dicks and trannies
>>
This is how cloudfags cope?
>>
>>106577273
>>106577281
I hate that these faggots are advertising in a LOCAL general, but the chroma cope is kinda cap
>>
chromacope is the exact same thing sdtards did with SDXL vs Dall-E 3. it's freetardation at its finest
>>
>>106577286
>the chroma cope is kinda cap
they've been coping since March, they're fucking insufferable, can't wait to see an actual good local model released so that it makes this Chroma shit completly irrelevant
>>
>>106577135
sloppa hooker is here?
>>
>>106577255
I'm really warming up on DPM++ 2M for the speed and quality balance, I do agree with you Heun is great but is slow and requires higher steps 60+ for good quality with chroma imo.
>>106577278
You can deal with whatever without me
>>106577281
You're dealing with a schizo, he has no plans to stop doing this if not this it will be another thing.
>>
>>106577292
qwen is decent for local, it has great prompt following that is better than seedream 3.0 from my testing. it would be a way better base than the schnellshit chroma is using now
>>
please stop posting seedream gens. im tired of witnessing local’s defeat. let us just gen in 1k ignorance
>>
>>106577301
for one nanosecond I considered making her a theme tune on ace, but fuck that
>>
>>106577259
>Krea was finetuned on the undistilled Flux (which we don't have)

Krea itself is still a distillation of its actual model.
>>
>>106577291
someone was coping yesterday claiming SDXL beats qwen image for shitposting purposes. People are this delusional
>>
>>106577303
the issue with Qwen is that it's a 20b model, it took lodestone 6 months to finetune a 9b model so...
>>106577309
yeah, we got Krea dev, BFL is not dumb, we'll only get distilled shit from them
>>
another day without loras for qwen nunchaku
without wan nunchaku
without loras for wan nunchaku
>>
Local celeb NSFW is so good bros
>>>/b/939756129
>>
File: 00096-2399848765.png (2.05 MB, 896x1152)
2.05 MB
2.05 MB PNG
>>
You guys! Yes you guys!
Go fuck yourselves for wanting and desiring a local general that covers everything.
Don't you want that to be the case?
'Yes here we love local api nodes'
'Yes here we love local anime'
'Yes here we love local videos'
'Yes here we love local audio'
'Yes here we love local text to speech'

You greedy cunts deserve the mess you got. Stop being such resource whores and pick a lane. Fuck you, you deserve it for being gluttons.
>>
>>106577321
what the fuck are the chinks doing right now. my dopamine is burning with the speed im genning at now with both the normal and edit models, where the fuck is the lora support?
I want to do the same with WAN, FUCKING CHINKS WHAT THE FUCK ARTE YOU DOIGNGG
>>
>>106577195
Which paper, and how do I know?
>>
>>106577302
>You're dealing with a schizo, he has no plans to stop doing this if not this it will be another thing.
always talking about yourself kek
>>
>>106577328
thanks for reminding me that current celebs look like shit nowadays
>>
>>106577338
>what the fuck are the chinks doing right now.
they're busy working on their new API models
>>
>>106577328
>Genning with cloudshit and inpainting
Like actual jeets.
>>
proprietary cuckies having an absolute melti
>>
>>106577292
You are never getting a better model unless another rich schizo funds it.
>>
>>106577321
>>106577338
Remember they wanted to make svdquant of wan 2.1, so not only they didn't do it, but they have twice the work with 2.2.
It's been months, and nothing, outside of releasing qwen but with no lora support which makes it kind of useless for nsfw.
Huge disappointment for me.
>>
>>106577357
No that OP is entirely local
>>
>>106577286
Chroma is easily the overall best local model at the moment, unless you are satisfied with censored slop, at which point you might as well use SAAS

Great composition variation, great detail in skin, hair, fabrics, artstyles

Easy and fast to train, much faster than Flux and MUCH faster than Qwen / Wan, the training results are also of great quality

Loras combine extremely well, particularly NSFW which makes sense since Chroma is uncensored and understands all NSFW concepts meaning you don't have to fight the model
>>
>>106577328
I'll never understand the fascination with celebs, they're not even looking that good compared to a random instagram pretty girl. Especially since they decided that being pretty wasn't required anymore to be in the movies.
>>
File: 1736292795245621.png (2.89 MB, 1328x1328)
2.89 MB
2.89 MB PNG
>>106577373
>
>>
File: SHUT THE FUCK UP.png (151 KB, 500x268)
151 KB
151 KB PNG
>>106577292
>they're fucking insufferable
case in point, they always need to write bibles to justify that their turd smell good, nigga, we have eyes, we know what looks good or not >>106577373
>>
>>106577291
Dalle 3 was actually groundbreaking. Chroma was the only model that managed to catch up to it (because everything that was thrown at Dalle realism wise, and yes, I have confirmed, there's an archive, Chroma could do, but better).
>>
>even the fursuit is plastic
lol
>>
Alright, after actually trying out Chroma, I'm concluding that the hype only exists if you're able to train on it since it knows more uncensored things. Otherwise, it's kinda meh.
>>
File: 1750557875863399.png (2.55 MB, 1328x1328)
2.55 MB
2.55 MB PNG
>>106577391
>
>>
>>106576612
is that Luz?
>>
File: arrow.png (10 KB, 299x228)
10 KB
10 KB PNG
Are GGUF files safe to use or not?
how can I know?
>>
>>106577397
everyone with eyes confirmed this 4 months ago. the only people using it are blurryanalog boomersloppers
>>
>>106577405
safe? what the fuck do you mean retard? go jeet in the cloud threads
>>
these are the cloud threads, it's in the OP
>>
File: the sovl is gone.png (686 KB, 1080x524)
686 KB
686 KB PNG
>>106577409
>the only people using it are blurryanalog boomersloppers
kek, I was one of those guys, I left the ship after v30, he killed that model when he decided to go for that low step method trash
>>
I was able to get my time down to 5 minutes
>>106577405
I have no idea I try to keep to one thing at a time and chroma is my focus
>>
at least we can all agree base models are superior to slopmixes
>>
>>106577410
I've heard they're like ckpt files. that they can execute malicious stuff like ckpt files.
>>
File: Chroma-out-0000.jpg (1.56 MB, 3072x2048)
1.56 MB
1.56 MB JPG
>>106577383
It's not Chroma's fault that you can't prompt for shit

I'm thinking AI image generation just isn't for you, promptlet
>>
File: 1757730954977680.jpg (3.47 MB, 2325x4096)
3.47 MB
3.47 MB JPG
Why this anime website is getting spammed with 3DPD garbage? Take your realistic gens back to /aco/ or wherever you came from and stop shitting up the thread. This is supposed to be an anime website, not your photorealistic slop.
>>
>>106577427
>5 minutes
any quality loss tho?
>>
>>106577438
>look at my 90s looking 1girls!!
dude literally go rope
>>
>>106577438
Omg it can do close up of women? THATS INSANE, THE REVOLUTION IS HERE
>>
>>106577451
Not really, it actually behaves pretty good at like 30-50 and slops itself at higher unlike the other samplers
>>
>>106577439
there's a thread for tranime fags like you, go back >>106575014
>>
File: rb.png (2.21 MB, 995x847)
2.21 MB
2.21 MB PNG
>>106577427
Bro does your dynamic tresholding have any advanced settings? I keep getting rainbow artifacts even even on 60 step second pass.
>>
>>106577438
It is bait. Notice how he and other "cloudshills" are only posting nonstop art and not "realism". They are still dealing with plastic skin. They know Chroma mogs.
>>
>>106577467
Try banding, color banding, in negatives, I threw that in last night and I stopped seeing that issue so much.
>>
>>106577473
>It is bait.
says the baiter, have a (You)
>>
>>106577409
Yeah but I had to confirm, and I don't want to train all the shit I want on it just because its some special fork of Flux.
I'll probably do one passthrough on some obscure shit and if it's better than maybe ... MAYBE ... otherwise nah
>>
Flux is unsalvageable. If Flux was actually usable for finetuning, Krea wouldn't have had to ask BFL for the raw model, they would've just used Dev.
>>
>>106577409
and let's not forget the fact he promised Chroma would have artist tags, it didn't happen, he lied to us (or he got brainwashed by the pony fag to not include them)
>>
>>106577467
Surely it can't be that easy? Because that looks like the sampler shitting itself from the massive CFG
>>
>>106576964
i wish i had a more interesting suggestion for you other than like artists or celebs kek
>>
>>106577480
>Mogged at anime (thanks to Noob, Neta, etc...)
>Mogged at realism (thanks to Chroma, Krea)

>Somewhat ahead in art, not by much (thanks to Qwen)
That's the state of cloudshit.
>>
>>106576964
Literally anything you want lol. I'm doing a diaper package lora.
>>
Are GGUF files safe or not?
>>
>>106577503
>Mogged at realism (thanks to Chroma, Krea)
oh he's funny
>Qwen does good art
LMAO WHAT???
>>
>>106577507
Why do you keep saying that every day?
>>
>>106577510
Don't download them from pakistani websites, retard.
>>
File: lol.png (388 KB, 800x551)
388 KB
388 KB PNG
>>106577519
>pakistani websites
wait they have internet?
>>
>>106577452
This level of desperation, meanwhile everything posted by SAAS and Qwen in these threads are nothing but slop, (((safe))) boring slop

No wonder you have to go to /ldg/ and shill, who the hell would use this slopped shit for entertainment values ? You can't do anything remotely edgy, and certainly nothing even bordering on NSFW, want to do style X, 'sorry, we didn't train on style X, and even if we did we didn't caption it in any way so it's impossible to specify it'

SAAS is a dead end for image / video generation since it will always be cucked and censored
>>
>>106577400
no anon, it would be weird for a 25yo man to watch 2020 cartoon show.
>>
>>106577510
it's a grift to get retards to download the official repo models so they look better. it also forces the vramlet to have poor quality precision so they are more inclined to pay for a new card. it's all marketing bullshit
>>
>>
>>106577511
The Qwen Chinks trained their model on all those aesthetic Seedream 3 images. That's why it does good art.
>>
File: 1617773156729.png (160 KB, 500x374)
160 KB
160 KB PNG
>>106577531
>>
>>106577473
Yeah I know, they complain of 1girl, yet only post 1girl, but slopped censored and plastic

But they're desperate, nobody who is interested in AI image / video generation gives a shit about SAAS
>>
>>106577527
why do you always have to write all those wall of text? are you getting paid by lodestone or something? you seem way too motivated to protect this turd ngl
>>
>>106577531
Is it more difficult to make a quantized safetensors model file than a GGUF?
>>
File: 1750087946623521.png (1.71 MB, 1328x1328)
1.71 MB
1.71 MB PNG
>>106577527
can chroma do this in 8 seconds?
>>
>>106575437
I wish you guys would stop arguing with him, we know he's just trying to piss you off with off topic shit
>>
>>106577541
sounds like a gooner deluded into thinking the industry revolves around nsfw
weird little bubble he lives in
>>
>>106577463
That place is full to the brim with pedos.
>>
Also whats the go-to chroma trainer for someone with 32GB of VRAM?
>>
>>106577542
>quantized safetensors
they don't do this. they just want you to get the base model so they don't have to do shit because they are lazy python chinks
>>
>>106577540
>nobody who is interested in AI image / video generation gives a shit about SAAS
https://getlatka.com/companies/midjourney#:~:text=Midjourney%20Revenue,2023%2C%20%2450M%20in%202022.
>>
>>106577550
One trainer worked with my 5090 out the box, wasn't possible a month ago they fixed it
>>
>>106577463
NTA but this is an anime board that belongs to an anine website and you realistic slop poster are just visitors here. Don't get bratty about it since you know where you actually belong.
You have to go back.
>>
>>106577545
It gets fuzzy with samefagging. Also lots of newfriends fall prey.
>>
>>106577550
>32GB of VRAM?
Onetrainer has presets for multiple vram sizes.
>>
>>106577558
Don't have a 5090, but thanks
>>
>>106577541
You don't have to read them

Since you have problems with words, when you see many of them, just scroll past them, there, a life hack for retards like you
>>
>>106577553
shouldn't chroma have quantized safetensors?
>>
>>106577547
Says the SAAS shill desperately posting in /ldg/

...
>>
File: retard.jpg (3.67 MB, 6144x2634)
3.67 MB
3.67 MB JPG
>>106577580
you're not gonna convince anyone with your word salad, what's your motive? this is about images, if you want to show how "superior" chroma is, make comparisons you fucking moron
>>
>>106577591
>singular shill theory
>>
>>106577594
Yes cloud uses synthetic datasets. We know.
>>
>>106577591
>noooo, how dare they make comparisons between local and API, NIGERMANN SAVE ME!!
that's what they do all the time on /ldg/, I don't see what's the big deal, you want to see how close you are to the best models, not putting your head inside the sand
>>
>>106577378
>I'll never understand the fascination with celebs
celeb worship is associated with lower intelligence. you don't have the brainwaves required to understand it

>they're not even looking that good compared to a random instagram pretty girl.
fwiw, "celeb" threads usually included internet famous girls too

>Especially since they decided that being pretty wasn't required anymore to be in the movies.
it's not about that. its about the fact that they're in the movies. its about the fact that they have status and you're degrading them etc.

for me, celebs are exactly the same as anime characters since they're just as "real". if you understand anime waifus you can understand celebrity worship
>>
>>106577613
>>106577591
*/lmg/
>>
>putting your head inside the sand
not beating the jeet allegations
>>
>>106577438
can you share the rest of the celebs? on gofile or whatever
>>
>>106577581
nobody fucking cared before about .pt format able to be compromised, nobody cares now about gguf exploits. if anything I wouldn't trust anything with nom in it since it's just getting fucked by exploits. python is also shit at being secure
>>
File: sad.png (45 KB, 302x167)
45 KB
45 KB PNG
>>106577438
it looks like plastic dolls of celebs, not actual humans, Chroma lost its edge on the skin texture, now it looks like any other random slop model
>>
>>106577594
Was this comparison made to 'prove' that all models tested here can produce shit ? Because these images all look like shit.
>>
>all it took was a single seedream gen to send chromakeks into a meltdown
i guess that's the difference between 512 and 4096
>>
>>106577594
Damn seedream is sloped to hell, midjourney boomer tier opinionated gen lmao
>>
>>106577613
If you want to see SAAS models, you go to /sdg/

Nobody goes to /sdg/ so the SAAS shills come to /ldg/ which is for local models only

Go back to /saasdg/ you pathethic loser
>>
>single seedream gen i mean the multiday shill campaign
>>
>>106577663
>>106577670
>Damn seedream is sloped to hell
nope >>106576615
>>
>>
>>106577634
All these celebs were shared in the previous thread, with prompts etc

I'm uploading some more later today, need to check which epochs are the best first, as in the sweet spot right before the lora becomes overtrained, so that it has the best likeness while it also remains flexible
>>
>>106577640
I'm more interested in why people are converting to compromizable GGUFs if they can convert it to safetensors instead.
>>
>>106577691
>3587x3587
is this an upscale or Seedream can really make 3k images?
>>
>>106577680
>>106576615

Those look even worse, seedream is so slopped and censored it literally cannot generate anything other than corpo product photoshoot overprocessed images like Krea does and can't make an actual candid realistic image to save its life lol
>>
Not even pixshartfags were this annoying
>>
>>106577650
These were combined with a Miles Aldridge lora, his style IS making people look like barbie dolls

Do a web search if you know how, which I somehow doubt
>>
>>106577699
it's cropped from 4k
>>
>>106577691
The implication being it can't actually do amateur photography only professional styled shots that are literally gaussian blurred
>>
>>106577726
>only professional styled shots that are literally gaussian blurred
>blurred
that one has no blur though >>106577594
>>
>>106577700
Well, they're trained initially on stock imagery, and then they generate generalised images from the model trained on stock imagery, and train on those

More generalised and more product placement with each iteration

At least it's not as bad as Flux dev, which was ONLY trained on synthetic data, that's the most plastic model ever
>>
sounds like headcanon copium
>>
>>106577736
And it's more professional than amateur lol like a pic from a magazine or company instagram post
>>
>>
File: 1734799334750104.png (658 KB, 1080x412)
658 KB
658 KB PNG
>>106577746
I also love candid shot, but modern chroma has lost that edge on that one... I much prefer its earlier versions for candid shots
>>
>>106577754
>I much prefer its earlier versions for candid shots
Regardless, sneed can't into it
>>
How am I supposed to use chroma 2k? Can I gen natively at 2k by 2k as the name implies? Seems awful when I do. Should I just use the official chroma workflow?
>>
>>106577759
that's fair
>>
>>106577765
It's 2Mpx pixel count, not 2k
>>
I like chroma, but it's worse at sex positions and anatomy than fucking Pony, and Noob makes it look like a joke in that regard. Even Neta Lumina, half baked, doesn't have this problem.

Want missionary sex?
>Noob: missionary position, sex, vaginal, penis, pussy. Bam.
>Chroma: use a VLLM to generate a prompt from an existing picture because the model STILL doesn't comprehend basic fucking tags, you might get missionary sex if you're lucky
It's almost as if the furry removed all tags from the training data and trained solely on LLM-generated slop captions.
>>
how bad is the quality loss with fp16_accumulation enabled?
>>
>basic fucking tags
retard
>>
>>106577794
This option does nothing on 5000 cards.
It seems like it's slightly worse like looking in earlier gens.
>>
>>106577804
"slightly"? have a comparison?
>>
File: chromacopee.png (81 KB, 1061x826)
81 KB
81 KB PNG
>>106577787
we pointed this out as well, but chromaschizo pretended not to notice. chroma is a mess of a model, it's unusable for anything more than blurry analog 1girls (which is why it's all chromaschizo generates)
>>
>>106577780
What the hell. Terrible name
>>
>>106577787
Pony and Noob are large finetunes with TONS of sexual positions.

Chroma is a base model, like SDXL, Flux, QWen, as such it's not specifically focused on anything but knows some of practically everything, which means a finetune or even a lora trained with a focus will improve it.

Doing an apples to oranges comparison is pointless.
>>
File: G0wipyRWQAA81bq.jpg (327 KB, 2048x2048)
327 KB
327 KB JPG
>>
>>106577787
>It's almost as if the furry removed all tags from the training data and trained solely on LLM-generated slop captions.
he did, he used Gemini to caption his images
>>
>>106577809
>we pointed this out as well
lel, the voices in your head are still only you
>>
>>106577809
>going by epochs instead of steps
Retard alert
>>
>>106577820
>pony and noob are large finetunes with TONS of sexual positions
Wasn't the point of Chroma to make it uncensored?
Jeez, what are the odds. BOTH boomer-prompt furry models failed.
>>
>>106577831
the dataset size of chroma and every sdxl finetune (noob, illust) is the same (5m images). do your research first, tourist.
>>
File: file.jpg (3.89 MB, 4096x4096)
3.89 MB
3.89 MB JPG
>>106577754
lol what was your prompt for that, kinda wanna see what seedream puts out
>>
>>106577809
schizos were right about chroma. its a failure
>>
>>106577844
>SDXL learns styles in under 10 epochs
Is what I was referencing faggot doomer
>>
File: G0weeXRa0AAuzQM.jpg (587 KB, 2048x1536)
587 KB
587 KB JPG
>>
>>106577852
>schizos were right about chroma. its a failure
this, our only cope now is Qwen image with SPRO to remove the slop and maybe we'll get something decent that'll motivate a rich coomer to make it NSFW
>>
>>106577859
Sexy.
Where's the anon training a model on his two consumer gpus? I need a status update.
>>
>>106577845
A candid image taken using a disposable camera. The image has a vintage 90s aesthetic, grainy with minor blurring. Colors appear slightly muted or overexposed in some areas. It is depicting:

A woman dressed as Megumin is looking at the viewer and holds a real-life version of Pikachu with her hands, Megumin is standing on a floating levitating Hoverboard, sunset, lots of neons, Tokyo. On the background there's multiple marble statues of Hatsune Miku, and people are gathering to pray towards those statues
>>
File: G0uNkN7XAAAX0xH.jpg (3.91 MB, 3072x4096)
3.91 MB
3.91 MB JPG
>>
File: 1757065053963247.jpg (702 KB, 1920x1080)
702 KB
702 KB JPG
Are there any good realistic models that don't try to force every chick to have her boobs out?
>>
>>106577843
>Wasn't the point of Chroma to make it uncensored?
De-distilled and uncensored

It is de-distilled and uncensored

The point was not to make a model with a focus on porn, but a base model that didn't censor porn

You don't seem to understand what a base model is, it's a model made to have as little bias as possible and know of as many concepts as possible so it can be used as a BASE for further finetuning
>>
>>106577845
it looks good but the motion blur seems a bit exagerated
>>
NEW
>>106577883
NEW
>>106577883
NEW
>>106577883
>>
File: 1734384852716937.gif (1.86 MB, 228x170)
1.86 MB
1.86 MB GIF
>day 2 of trying to train flux model
>try onetrainer
>errors everywhere
>try googling and ai search for solutions
>follow everything slowly step by step
>still doesnt work
>try flux trainer comfy
>comfy bitching about flux dev fp8 (the one it recommends to use)
>doesnt work and try flux krea
>works
>freezes and locks computer
>try fluxgym
>error about metadata

dont have time for this nonsense, tempted to just use a runpod at this point

>urrr skill issu

i dont care
>>
>>106577885
>The point was not to make a model with a focus on porn, but a base model that didn't censor porn
>You don't seem to understand what a base model is
If you have to finetune a very common and simple concept that should exist with no issues, then it fails as a base model.
>>
>>106577804
>This option does nothing on 5000 cards.
Incorrect. I get noticeable increased generation speed with it enabled on my 5090 using fp16 models. Perhaps it's just vramlets that use lower precision models that don't see any improvement.
>>
wait i might know why my gens are slower now
if this is the reason why I might be the biggest retard

so i gen on Q8 but use the fp16 t5xxl since its better, but i got psyopped into turning off --fast because niggers said it would affect the quality of the gen, but it obviously was just speeding up my text encoder procesisng and not affecting my GGUFs at all right?
>>
File: w.mp4 (2.1 MB, 640x1120)
2.1 MB
2.1 MB MP4
>>106574064
yes

>>106574201
art
>>
>>106576185
cattus* Learn how to spell in latin anglo scum
>>
>>106576349
I made a tool for it but don't wanna dox myself lol.
>>
>>106578470
>I made a tool for it but don't wanna dox myself lol.
this is the worst part of being in the hobby. i'd probably contribute a lot of stuff code-wise and make PRs (there's one for Kijai's nodes I want to fix regarding the preview vae_approx) but I'm not interested in even slightly associating myself with this stuff due to the nature of what I am interested in creating and sharing with the world
>>
>>106577885
and who the fuck is going to finetune it?
>>
>>106577918
no not really, have you seen images from the early days of sd 1.4? i'm talking about pre novealai leak(fine tune of sd 1.5O, and then after novel ai leak then sd 1.5 release.
this is base sd 1.5 gen btw, and i got this image after 50-ish gen with random seed, most stuff is just gobbledygook like twisted limb/torso/hips, and it doesn't even follow my promp about her pose "lying on her back in a bed" something like that i can't remember.
>>
File: comfy332.jpg (887 KB, 1280x1280)
887 KB
887 KB JPG
>>
>>106579042
Post in new >>106577883
>>
>>106579051
ups...
>>
>>106579025
*(finetune of sd 1.4) not sd 1.5O
>>
>>106576964
do a fatalpulse lora
>>
>>106576964
whipping with optional sfx



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.