[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


As a Cockroach Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106429545

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
AniStudio: https://github.com/FizzleDorf/AniStudio/tree/dev

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://github.com/Wan-Video
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>You're a "Scholar" or a "Connoisseur" of Technology
>The Joy is in the Learning, Not the Output
>Analysis Paralysis & The Curse of Knowledge
how many anons ITT suffer from this too?
if i was a kid i'd be able to make countless projects with the current diffusion models
yet i cant be bothered to do anything besides make a few gens when a new model releases
>>
comfy kind of stole the branding of invoke, no? or was at least heavily influenced by it.
>>
>>106435697
I don't suffer from that.
That's what I enjoy doing. Tinker with workflows, plot stuff, then abandon the model since something else came along.
>>
>>106435682
>AniStudio
Troll op, sad!
>>
>>106435702
yes. the comfyorg CEO has no creative vision whatsoever and just wants to sell off the company as soon as possible with stolen ideas. he's Chinese btw
>>
>>106431486
>>106435516
>>106435702
>no support for vPred
Dropped.
>>
File: image.jpg (766 KB, 1920x2160)
766 KB
766 KB JPG
Thanks anon for let me wake up from this hell!
>>106431486
>>106435516

Please, with all the love of the world, don't sleep in invokeai if you are a txt2img or img2img genner. ITs litteraly the best tool now!
>>
Blessed thread of frenship
>>
>>106435735
>stole the comfy logo from a bike company
>stole invoke UI
>steals your data
lol it really is adding up
>>
>>106435737
For that I use Forge or Reforge, I wont touch that wanna be monopolizingUI ever again
>>
File: 1746797287453367.png (57 KB, 976x556)
57 KB
57 KB PNG
this will surely fix all the ram leaks now, right?
>>
>>106435737
probably no cfg++ either
>>
>>106435752
I don't know, try it?
>>
>>106435752
Nope not updating, you can't make me
>>
>>106435767
Come on, update +0.1 seconds of speed
>>
File: 00036-2356067035.jpg (264 KB, 1536x1536)
264 KB
264 KB JPG
>>
>don't sleep in invokeai
ESL shills
>>
I think I have spaghetti dependency indigestion.
>>
invokeai is so much better than comfy, comfy doesn't even run on lower configurations
get invokeai, it's the right choice
>>
File: image15.png (2.82 MB, 1443x974)
2.82 MB
2.82 MB PNG
Is correcting 200+ Joycaption captions manually worth the effort?
raw joycaption example:
>A comic-style illustration shows a young woman with dark hair, wearing a sleeveless black top and light gray shorts, sitting in shallow water. She holds a broken wooden staff in her right hand and has dark, tangled vines wrapped around her legs. Her expression is tense. In the background, two large, serpentine creatures with gray scales and yellow eyes emerge from the water. The creatures have multiple heads and are partially obscured by green foliage and a rocky cliff on the right. The water is greenish with reflections of the surrounding greenery. The scene is set in a dense, jungle-like environment.
corrected version
>A man with long dark hair wearing a sleeveless tunic floating on a loose thicket of mangrove roots. He is paddling away from a giant tentacled monster, a kraken with a yellow egg sac. In the background are two enormous and thick mangrove trees.

I've masked away the speech bubbles (though not all of them in the dataset).
>>
>>106435810
>Is correcting 200+ Joycaption captions manually worth the effort?
I've been thinking about popping a few addy and chugging away at my dataset for 12 hours straight.
That's the only way I could do it.
>>
>>106435802
based, also supports nodes in json format, and their node system is more like to the unreal engine one, I obviously dont use nodetrash
>>
>>106435810
For that many images, better use gemini
>>
>>106435810
>manually worth the effort
Yes, but it needs serious autism and it's dry as fuck, based thorgal connoisseur.
>>
>>106435810
can't you give it some context?
>>
>>106435802
I agree, you can get it here :
https://github.com/invoke-ai/InvokeAI
>>
File: ComfyUI_00032_.mp4 (153 KB, 512x320)
153 KB
153 KB MP4
>>
>single man responsible for both joycaption and bigasp
basado
>>
File: 3.jpg (181 KB, 1920x1080)
181 KB
181 KB JPG
Imported Comfy node workflow imported for .json, much more organized and visual appealing than Comfy
>>
File: ComfyUI_06366_.jpg (1.74 MB, 2000x1148)
1.74 MB
1.74 MB JPG
>>
File: ComfyUI_07582_.jpg (2.1 MB, 1920x1102)
2.1 MB
2.1 MB JPG
>>
File: ComfyUI_07576_.jpg (2.27 MB, 2000x1148)
2.27 MB
2.27 MB JPG
>>
>>106435885
>>106435878
>>106435870
Neta?
>>
Running a lightweight model and running into an issue where if I try to gen an image with two characters, I can't really get it to properly assign their features. Like if I want to gen an image with two characters, one tall with red hair and one short with black hair, it's pretty much a crapshoot whether I'll get two short characters, two black-haired characters, one short character with red hair and one tall character with black hair, etc. Is this just an issue of an inadequate model or can I mitigate this with different prompting?
>>
>>106435895
Nah it has that weird Flux artifacting
>>
>>106435905
use invokeai
>>
Would anyone of you happen to have this?
https://civitai.com/models/137781/new-era-new-esthetic-retro-anime
https://boosty.to/girlsai/posts/29c18d14-bd36-4a15-a943-3bfb02ef371e
Or at least do you know when he intends to upload it to civit?
>>
this is now a /invokeai/ general
>>
Can I use this invoke whatever remotely? Invoke on main PC + Comfy on local server? If not, it's trash.
>>
wake me up when the schizo leave
>>
>>106436040 you schizo
>>106436030 you sane person
>>
>>106435839
It's possible to just vibe code an UI on top of Comfy or whatever backend you like.

I don't get the UI autism.
>>
File: itwasmeallalongbigbro.mp4 (2.08 MB, 832x640)
2.08 MB
2.08 MB MP4
You couldn't just quit while you were ahead, could you... big bro?
>>
>>106435827
I did the most effort I could muster: extracted panels with imagemagick, went through them and deleted the worst ones, masked away speech bubbles and borders, and let joycaption caption them. But after training it, I realized all the captions are wrong and that you have to give some negative examples (of speech bubbles, etc.) because suddenly it no longer knows the concept of a speech bubble.
But now it requires some real effort. I'm going to do it, just wanted to whine a bit.
Also, I think these comics might be naturally too "low resolution" for the purpose of loras. Like there's not going to be any detailed background characters and anything other than a portrait is going to be pic related. But learning French by reading Thorgal is my current project, so anyhow..
>>
>>106436068
I always knew she was the real culprit!
>>
>>106435895
>>106435908


SDXL noobai vpred finetune

https://files.catbox.moe/kcbsoi.png

Here is a catbox with a raw gen
>>
>>106436065
just do it over diffusers so it's easy to plug in new models. no reason to make it complicated and wasteful with confy
>>
File: crawlingchaos.mp4 (2.68 MB, 832x640)
2.68 MB
2.68 MB MP4
>>106436068
>>
now that the dust has settled and invokeAI is the new king, why would anyone ever use comfy again
>>
>>106436091
Not really related, but what do you guys use for making weapons to not look like regurgitated trash? I keep getting half-broadsword, half-katana abominations with melty hilts.
>>
>>106436068
Which model do you use for anime?
>>
File: 1751373554823206.jpg (1.36 MB, 3148x4143)
1.36 MB
1.36 MB JPG
Is this usable in ComfyUI yet?
>>
>>106435682
Any AI and workflow i can use to generate perfect sprite sheets?
>>
i gen'd like 2000 tortas today
i dont know why i did this
like they aren't hot-thick they're obese fat
>create anything machine
>folder full of thousands of obese latinas
is this what jensen had in mind????
>>
all SOTA VLMs suck
>>
Are all of the default workflows in comfyUI stuck in the past?

I'm trying them and I'm getting pure garbage.
>>
best interpolation node for wan in terms of quality?
>>
>>106436091
Kino.
>>106436153
>what do you guys use
A good base model.
>>
>>106436317
FILM VFI
>>
>>106436133
Will share it in /adt/ thanks, the thread is good and fun and there is interesting technical discussion but we need more people like you to fight the lolis.
>>
File: 1751329968236201.png (892 KB, 1360x768)
892 KB
892 KB PNG
Change the text "Zelda" to "Miku". Replace the round egg with Miku Hatsune.
>>
>>106436215
type: "GOOD MORNIN! ;3" into pos field
>>
File: 1747901853496792.png (898 KB, 1360x768)
898 KB
898 KB PNG
>>106436400
closer font:
>>
huh
>>
>>106435739
About invoke two questions.
What is missing in the communitie edition?
Is it true that there is hardly any choice of models and that it takes an absurdly long time for new releases to be integrated?
>>
File: 1755137549401016.png (892 KB, 1360x768)
892 KB
892 KB PNG
>>
>>106436153
This is with the "moor" replaced with the "streets of medieval Kyoto". A foot shorter, clasp morphing into a chrysanthemum, katana? (I don't know Japanese swords).
So.. I don't know. Don't mention any Eastern stuff? I don't know if anime and manga are inherently Eastern. They might not be, I mean Miku bends to my Western lora pretty easily.
>>106436324
Thank you.
>>
>>106436402
Ok... i guess tech is not there yet.
>>
Has anyone else tried the new InvokeAI Hires Fix yet? It uses a tiled control net, so there are no more artifacts, and you can actually control creativity now.

It has 2 parameters:
1- A structure parameter for ControlNet tile weight
2- A creative parameter for denoise,
Extra: the upscaler model and the same options that a K sampler has!

Inpainting is exelent too. No more downsizing to 1024x por consistency inpaiting!
Just select the BBox, set it to 1024x, and paint the mask and region prompt inside!.

With these editing tools, SDXL is back on top. It's better than Adetailer.

Everything is on one screen. You never have to navigate through 8k pixels of nodes or touch a node again. Its roght there stable and simple, and above all not buggy
>>
lol
>>
File: ComfyUI_00034_.mp4 (353 KB, 320x512)
353 KB
353 KB MP4
>>
File: ComfyUI_39226_.png (1.45 MB, 1024x1536)
1.45 MB
1.45 MB PNG
>>106436435
Invoke is dead lol their only MLE dev left the company months ago so they have only been doing paid api models since.
>>
Daily reminder that Invoke is a failed AI startup that recently tried (and failed) to secure additional funding, and will soon go bankrupt. They are now relentlessly shilling here, Reddit, Discord channels, anywhere they can in a hopeless attempt to turn things around. It's so fucking obvious.
>>106436463
your company is crashing and burning and your shares will be worth nothing lmao
>>
File: 1728188342872853.png (839 KB, 1360x768)
839 KB
839 KB PNG
change the location to an alien planet with a futuristic spaceship in the distance. a large moon is visible in the background. keep the character in green, in the same style.

neat, input was pixel art (kings quest), output is pixel art
>>
>Exception: INSTALL NEW VERSION OF PYAV TO USE API NODES.
can comfy just fuck off with his paid api slop shilling already
>>
>>106436507
>your company is crashing and burning and your shares will be worth nothing lmao
>>106436497 this was for you btw
>>
>>106436455
But I find it more interesting that when the gens don't conform to Japanese culture (though note the golden seal and the binding on the hilt) it reminds me of Junji Ito (the large head I mean).
I hope I don't get banned for posting cartoon gore.
>>
>>106436507
i still remember when it was a one man project by "lstein", much like auto1111, and was initially focused on being "the mac UI". weird how it grew and turned into a buyout/investor bait project
>stein
oh.
>>
>>106436507
>They are now relentlessly shilling here, Reddit, Discord channels, anywhere they can in a hopeless attempt to turn things around. It's so fucking obvious.
Nah, it's just the same anon who is salty about reForge development being canned. They tried shilling Wan2GP a few days ago. Guess they're trying Invoke now. Tomorrow it'll probably be SwarmUI or something.
>>
File: 1744709840611102.png (930 KB, 1360x768)
930 KB
930 KB PNG
>>106436521
change the location to a medieval era in a grass field with a castle nearby, and a red dragon in the sky shooting fire. in the distance is an army with plate armor and swords. keep the character in green, in the same style.
>>
I love anime fennec girls, they're good with technology
>>
File: 1738357514050521.png (675 KB, 1360x768)
675 KB
675 KB PNG
change the location to the surface of the moon, with a space shuttle on the surface. in the distance are several moons of varying sizes in the sky, and stars. keep the character in green, in the same style.

qwen edit is a good pixel art generator, just take an old game screenshot and poof, similar style.
>>
>Last thread devolves into 'muh professional workflow' and 'muh clients' with perfect reddit shill vernacular
>Invoke AI gets brought up and as THE TOOL
>Opportunistic troll Anons latch on to it
>[(You) are here]
This is actually really funny to me.
>>
>>106436435
Community edition has everything except API nodes and services. Only that. Its for gpulets tgats want to run SD 3.5 for example. But the samplers, utilities ans extencions are the same. I did tgat research myself some hours ago before downloading
>>
>>106436595
>>106436619
pretty cool
>>
>>106436435
Invoke runs any SD variant and Flux
Not qwen yet
Not any video model yet
>>
>>106436604
>python/jeetscript abomination shitheap
>good at technology
>>
>>106436507
Meds
>>
>>106436639
you forgot comfy crawling out of the griftspace to fud invoke because he's jealous
>>
>no bounce lora for 2.2
>no triple sampler no lora to light loras workflow for 2.2
never began.
>>
>>106436639
im not here im there
>>
>>106436525
CREATE AN ACCOUNT TO CORRECT BUGS
ALL ERRORS AND COMPATIBULITY IS BECAUSE YOU DO NOT HAVE A GMAIL ACCOUNT WITH COMFY YOU FUCKING BIN LADEN! ATTACH YOUR GMAIL
>>
>>106436153
Wait, do you mean like vanilla Chroma? My advice is to just train any small (non-realistic) style lora. I've found that suddenly concepts that it cannot express in photorealistic images just suddenly bubble up from its ancestral memory. V48.
>>
can someone port forward their comfy instance so I can make groid porn please?
>>
>>106436660
i thought the narrative was that comfy paid Panchovix to shut reforge now. now he's funding invoke because he's jealous..? jesus christ these shitposters lmao
>>
>>106436650
>cannot enjoy cute anime fennec girls without seething about a dev
:(
>>
>>106435739
Simple question for Comfy txt2img or img2img users.
Can comfy do pic related in two clicks?
Haters
>>
>>106432611
I find traditional media difficult to illicit from Neta. At least, more so than Noob.
>>
File: 1735247078061997.png (877 KB, 1360x768)
877 KB
877 KB PNG
change the location to a Japanese temple in Tokyo, with several lit lanterns. the sky is blue and it is cloudy. keep the character in green, in the same style.

in theory you could make an old school rpg with qwen edit backgrounds.
>>
>>106436710
no fennec girl that is good at technology plays in a city sewer getting covered in fecal matter
>>
>>106436507
Ok.
First: show url or some proof of that? Because the Invoke community its growing day by day.

If you dont like the Ui because has a paid service also, 'daily reminder' that Comfy has one too.
>>
File: 1753461754039276.png (925 KB, 1360x768)
925 KB
925 KB PNG
>>106436725
>>
>>106436497
Shut up you, go and fix you own bugs before speaking about another UI!
>>
>>106436497
>Invoke death
Url?
Proof?
Stock market graph?
Some randon twiter comment?
Anything?
>>
File: screenshot.1756591394.jpg (925 KB, 1645x787)
925 KB
925 KB JPG
lora manager for comfyUI was updated to include much better folder navigation. If you use tons of loras, this is simply a must-have. honestly it should be included by default
>>
>>106436714
https://github.com/Acly/krita-ai-diffusion
Anywany, not the point of /ldg/. Go to /ic/ and be the change you want to see in the world.
>>
>>106436760
Fix your bugs first
>>
File: 1728114092161742.jpg (1.61 MB, 2016x1152)
1.61 MB
1.61 MB JPG
>>
>>106436760
>If you use tons of loras
promptlets you mean
>>
File: ComfyUI_00035_.mp4 (205 KB, 320x512)
205 KB
205 KB MP4
>>
>>106436766
actually you have to fumble around with the stupid ass krita config for little things like changing the negative prompt which is really annoying. nice try though
>>
>>106436766
There is a whole post in the last thread why Krita sucks and is not reliable for actual proffesional work an. Team client compatibility.
You AI hobbyist.
>>
>>106436766
>diffusing locally is not the point of the local diffusion thread
this is the level of retardation in these generals.
>>
File: 1739572712912423.png (1 MB, 1360x768)
1 MB
1 MB PNG
>>106436725
change the location to a sunny beach, with women in bikinis walking on the sand. The ocean is nearby and a large vacation resort is in the background. the sky is blue and it is cloudy. keep the character in green, in the same style.
>>
>>106436766
>peopke who uses AI on weekends to have fun
Fuck you neet
>>
File: 1.jpg (335 KB, 992x1456)
335 KB
335 KB JPG
>>106436781
No amount of prompting is going to get you niche/western/unpopular characters.
>>
>>106436760
can you disconnect it from civitai entirely?
>>
>>106436760
Does the loras come attached to a node or I have to click the lora and then click a node and then make the connection?
You silly shiller, think before you post something this stupid.
FIX YOUR BUGS!
>>
What happened to the Chroma schizo? Evolved to InvokeSchizo?
>>
>>106436811
>prompting existing characters instead of unique creations.
it's beyond promptlet at this point, its brainlet.
>>
File: screenshot.1756591846.jpg (82 KB, 494x364)
82 KB
82 KB JPG
>>106436814
It's not really "connected" to civitai? It only uses civitai to fetch metadata.

>>106436816
You click a button and it sends the lora to the lora manager node.
>>
File: 1741832994229967.png (993 KB, 1360x768)
993 KB
993 KB PNG
>>106436792
one more, just to test something silly

change the location to a sunny beach near a volcano. The ocean is nearby and a large vacation resort is in the background. The sky is cloudy. The volcano is erupting lava and is filling the ocean with lava. keep the character in green, in the same style.

neat
>>
>>106436833
Yes Anon some people want to create art of existing characters not OC's. That has nothing to do with prompting. You can fuck off with your retarded logic now.
>>
>>106436785
For actual professional work you are told what tools to use. Did people in the 90s go on tantrums about layers or whatever photoshop introduced?
>>
>>106436854
and that's why ai chuddies will never be artists, all you can do is consume someone else's creations.
>>
File: screenshot.1756592044.jpg (695 KB, 1754x762)
695 KB
695 KB JPG
It also has a Firefox/Chrome extension that lets you easily download loras and have them automatically updated in Lora Manager.
>>
comfy should be dragged out on the street and shot
>>
>>106436880
Artists draw other's creations all the time. Again, you are retarded.
>>
>>106436834
>You click a button and it sends the lora to the lora manager node
>>
File: Qwen_WhiteLady_Output.png (2.38 MB, 1536x1536)
2.38 MB
2.38 MB PNG
Flux had Flux Girl
Qwen has Qwen Girl
despite not being distilled
I dunno know what this derives from
it's almost like it REALLY just wants to make everyone Asian even when prompted otherwise but is toeing the line or something lol
>>
>>106436834
>It's not really "connected" to civitai? It only uses civitai to fetch metadata.
Yeah ... I don't want that.
>>
File: ComfyUI_00247_.png (2.96 MB, 1264x1856)
2.96 MB
2.96 MB PNG
>>
>>106436882
>unnecessary extensions while your UI shits itself
It reminds me of the eve of the French revolution when France was starving and the kings were doing unnecessary things that did not dialogue with the reality of the population.
>>
File: screenshot.1756592276.jpg (183 KB, 497x576)
183 KB
183 KB JPG
>>106436900
It doesn't get any easier than that. I don't know what more you want.

>>106436906
I'm confused. Perhaps you are implying you are making illegal LoRA's and are afraid it is sending metadata to Civitai or something? That's not how it works.
>>
>>106436913
Vpred sloooop
>>
>>106436882
Do people actually run Comfy like pip install -e everything.txt? I read about that malware in a custom node just before getting into local and decided to run all AI related stuff in rootless podman.
>>
>>106436927
How many screens must I have to use this extension nd node with a simple workflow?
No, I wont scrooool
No, I wont zoooooming out
>>
anyone like to share their workflow for qwen image edit for styletransfer with 2 input images?
I can't figure it out right now. I am attaching two latents together, but it should only output one image instead of two next to each other.
Looks like I'm too retarded.
Ty anon
>>
>>106436955
https://www.reddit.com/r/StableDiffusion/comments/1myr9al/use_a_multiple_of_112_to_get_rid_of_the_zoom/
>>
>>106436955
Anon listen to me.
Qwen works PERFECTLY in Comfy
Qwen works PEEFECTLY in a third world GPU.
But they will never share a Qwen workflow.
They will throw you rentrys and github tutorials.
>>
>>106436927
No anon I literally just want to edit my own lora metadata because civit likes to purge models out of the blue and relying on them is a recipe for disaster.
You know what, I'll just steal this code and do it myself.
>>
i wish neta was good already
>>
>>106436903
Do normalfags really find this attractive?
>>
File: screenshot.1756593121.jpg (155 KB, 743x742)
155 KB
155 KB JPG
>>106437022
>I literally just want to edit my own lora metadata
you can do that.
>>
>>106437046
no
>>
whats the goto shit for controlnet setups these days bros?
ControlNetPlus looks the most complete, anything newer?
>>
>>106436934
pip installs are perfectly safe. It's the unreviewed extension code that is the problem.
>run all AI related stuff in rootless podman.
Good call.
I am doing the same with docker.
>>
>>106437053
the fuck is that trigger word
>>
>>106437104
forge
>>
>>106437124
I guess they wanted it to be really unique. Either way, you can edit any and all metadata if you don't want the metadata from Civitai.

>>106437022
>civit likes to purge models out of the blue and relying on them is a recipe for disaster.
Also forgot to mention, the metadata.json is of course saved locally so even if civitai purges something, you'd still have it. You can also optionally download all examples the creator used

Everything is saved locally.
>>
>>106436934
docker is a pretty popular method too. shit is too amateur to trust the software to detect threats. continue what you are doing. there was malicious npm repos lurking about recently too
>>
>>106436231
The only thing in his mind is money and real bitches
>>
I really want to dump $2.5k for a 5090, but honestly I should just wait, save my money for a 6090 in 2027, which will probably cost $4k. The 6090 will probably be way better than the 5090 since NVIDIA is 100% focusing on AI now. I'm expecting 48gb vram+ at minimum.

Buying a 5090 now, especially with the connector heating issues it has, would be something only a retard would do.
>>
>>106436827
Chroma is abandoned, unfortunately. There was a big meeting and everyone decided that Chroma, Qwen, Flux, Lumina, SDXL, Pony v7, Wan, and HiDream all suck and that we're moving back to SD1.5.
>>
>civitai still doesn't have a category for chroma
>qwen got it basically from day 1
>wan got it after 2 days
what the fuck is taking so long? are they simply afraid of chroma because it's uncensored?
>>
File: 1730274559533347.png (994 KB, 1360x768)
994 KB
994 KB PNG
change the location to a pond in a forest. a tent is nearby with a campfire outside it. keep the character in green and in the same style.

all based off a kings quest game screenshot, and in quen edit. pretty cool imo
>>
>>106436176
Link to paper?
>>
>>106437182
>The 6090 will probably be way better than the 5090 since NVIDIA is 100% focusing on AI now. I'm expecting 48gb vram+ at minimum.
Lol no. They deliberately VRAM gimp consumer cards to segment them out of enterprise/server cards that cost a lot more. It was surprising when 5090 got revealed to be 32gb. I wouldn't expect any bump till 7090 or whatever.
>Buying a 5090 now, especially with the connector heating issues it has, would be something only a retard would do.
Well you can decrease power limit to lower the odds of that happening. But yes Nvidia fucked up big time with 12vhpwr design.
>>
>>106437217
No one is making LoRA for it (al least not releasing it there), I just saw a furry LoRA and they trained it on flux lol
>>
>>106437252
>Well you can decrease power limit to lower the odds of that happening.
Forgot to add that up to a point it shouldn't affect slopping performance too much.
>>
>>106437252
>They deliberately VRAM gimp consumer cards to segment them out of enterprise/server cards that cost a lot more
Yes, but what's stopping them from upgrading enterprise/server card VRAM and making 48gb vram the new baseline for consumer cards? That's what I'm saying.

Or they can just increase the cost of the 6090 to be I dunno, 30~50% the price of enterprise/server cards.

>Well you can decrease power limit to lower the odds of that happening
You shouldn't have to worry about decreasing the power to prevent your house from burning down though. It's just a flawed design.
>>
>>106437217
Qwen releasing sorta overshadowed Chroma. Don't get me wrong. It's worse than Chroma for NSFW realism no question, but Qwen Edit makes it appealing to normies.

The ones making the real sketchy NSFW realism loras are not sharing them. No one is using Chroma for anime of course.
>>
>>106437217
>chroma is uncensored and they don't want to draw the ire of payment processors again by officially supporting it in any way
>virtually nobody outside of this thread has switched over to chroma, for various reasons
>>
>>106437291
Nah, they are afraid of what it can do out of the box. It needs no tune.
>>
>>106437276
>Yes, but what's stopping them from upgrading enterprise/server card VRAM and making 48gb vram the new baseline for consumer cards? That's what I'm saying.
I am interested in armchair GPU design discussion. Long story short, they most likely won't.
>You shouldn't have to worry about decreasing the power to prevent your house from burning down though. It's just a flawed design.
I am NOT excusing anything. But realistically you don't have too many options if you want >24GB Vram for slopping right now.
Alternatively you can get those Chinese 4090s with 48gb or 96gb VRAM.
No warranty and you gotta kick rocks if you are scammed but it's an option.
>>
>>106437300
"switch over"? what does that even mean. each model has their weaknesses and strengths. you dont need to only use 1. sdxl for anime, wan for video, qwen for image edits, chroma for nsfw realism.

these are the golden ratios
>>
>>106437291
I don't think that the "it's not popular enough" and "it was overshadowed" lines of thinking explain the entire reason. All they would have to do is add a tag for it in the filters. People have explicitly asked them to do that and they've ignored it. CivitAI seem like they have an actual motive for not wanting to support it.
>>
>>106437217
have you checked if there's even a feature request open for it on the CivitAI bugtracker thing? It could be that nobody ever officially asked for it
>>
>>106437313
Chroma is like SD 1.5 but on steroids. Every model that is "controversial", "bstaber" etc... combined. That scares (them)
>>
>>106437316
nooooooooo you can ONLY use one model, there HAS to be a unform /ldg/ meta at all times! Conform conform conform, things can NEVER co-exist
>>
>>106437325
>CivitAI seem like they have an actual motive for not wanting to support it.
you can generate a realistic naked child sucking dick without doing any additional bullshit. that's why.

>>106437332
this
>>
>>106437332
it has precisely zero raw knowledge that numerous previous models didn't also have. The only thing that sets it apart is having been captioned with high-quality natural language descriptions.
>>
>>106437350
That's probably the reason, yeah. Doubt CivitAI wants to deal with everything that would involve.
>>
>>106437316
>chroma for nsfw realism
Barring a handful of exceptions, people outside of this thread that are interested in this are still using SDXL or Flux. Chroma hasn't caught on much.
>you dont need to only use 1
You don't, and it's retarded to do so, but a lot of people do.
>>
We need a manhattan project for AI.
And no, not for AGI.
We need the new architecture, we need market-ready neural chips.
Right now they're trying to rebuild a steam-powered locomotive to fly to the moon with enough steam instead of finishing the rocket engine that's already in the drawer.
I am the only one in my circle of friends who studies the state of development and the new development paradigms. Most of them have heard about it, but don't understand the revolutionary nature of this development.
You?
>>
>>106437350
"realistic" is a stretch anon, people gotta stop pretending that Chroma just shits out Flux Krea type gens at the drop of a hat as opposed to being more in line with Pony Realism unless you put in a decent amount of prompting effort.

It's a furry model that he just happened to train on other stuff too, it was never intended to be some kind of Ebin Real model, people just can't help but assign their personal desires to things they aren't involved with developing.

And like it would have been understandable if Chroma WAS literally just FluffyRock Flux Edition with no other kinds of data, given what Lodestones was previously known for. The guy gave us more than he "had" to by any metric.
>>
>>106437399
>as opposed to being more in line with Pony Realism unless you put in a decent amount of prompting effort
pony doesn't come close to a chroma lora trained on realism.
>>
>>106437361
The parameter count alone makes it much better at depicting what it knows than anything came before it. The model is as good as Dalle 3 in coherence (which was already pretty "dangerous" at prompt following capability) and also photorealistic on top of that. While other models, even if they saw same data, wouldn't get close to this prompt following ability, and it would look like a render if we're talking basic prompts where it did.
>>
Something I've been meaning to ask: is there a need for models to be getting larger and larger? We're at the point where newer models are so big that the people known for doing large scale finetunes are saying "fuck that." If someone made a model that was basically just an SDXL-sized Qwen, would it actually be that much worse than the current 20b model?
>>
File: ComfyUI_00035_.mp4 (411 KB, 640x640)
411 KB
411 KB MP4
ty to anon who recommended the quantized wan2.2 14B
>>
File: succ-treasure.jpg (978 KB, 2688x1536)
978 KB
978 KB JPG
civitai won't even open for me. I wonder if my state is blocked like with porn sites. At any rate after using chroma for a while I find it useful but accept a lot of the complaints about it. It's inconsistency reminds me of an undertrained lora. But I can get the same effect with a lora (or rather embedding is where I saw it in the past) that has a dataset tagged in a contradictory manner so that it can't properly converge. It's generation is for some reason noticeably slower on my machine than pixelwave flux. A pull might help with that (I am quite out of date).
>>
>>106437414
how is this a sensible response to what I actually said lmao
>>
>>106437437
noice
>>
>>106437439
you're comparing it to pony realism, which annoys me. not remotely close in quality
>>
>>106437438
Chroma is just slow, I don't think a pull would help all that much.
>>
>>106437394
Only people building this would be glows and only use it for oppression.
>>
>>106437438
A 4k step Chroma lora at rank 68 at 1028 res takes me like 8 hours to train. Worth the wait though. Just run it overnight, problem solved.
>>
>>106437217
Yeah it's pretty strange. If I had to guess there's probably some petty disagreement with the usual suspects which would also explain the hysterical hatred of Chroma.
>>
>>106437380
>Barring a handful of exceptions, people outside of this thread that are interested in this are still using SDXL or Flux. Chroma hasn't caught on much.

Most of them are also blissfully unaware of Chroma's existence, what it can do, or how to prompt it properly.
>>
>>106437462
I'm uninformed. Who are the usual suspects?
>>
>>106437217
That still doesn't really add up, because it's not like you can post sketchy loras on Civitai even if the model allows you to make that type of content.
>>
>>106437182
This kind of thinking never EVER works out. I've been seeing this kind of magical thinking that the next generation of GPUs will somehow be worth it over the current one. The fact is you're just scared to take the financial plunge now and think you will be in a better position in a few years.
>>
>>106437475
Tech trannies
>>
>>106437426
> DALLE 3 revisionism
it was never that good anon, it has SDXL base tier aesthetic at BEST in even the most unrestrained scenario, and the spatial awareness really isn't that good compared even to Flux Dev in many cases

Next you'll be asking "when will local catch up with Midjourney" despite Midjourney V7 being fairly behind numerous local and non-local models that already exist on basically any benchmark chart or leaderboard you could view
>>
>>106437467
>Most of them are also blissfully unaware
What if I'm blissfully aware of how shit Chroma is and I become actively annoyed when people act like it isn't a deeply flawed model that trades literally everything that makes an image good for the ability to generate blurry vaginas and penises?
>>
>>106437399
Literally just asking for a photograph of anything gives you good results 99% of the time. Not even using additional style keywords. Just don't suck at prompting.
>>
>106437492
promptlet
>>
>>106437481
5090 is 40% faster than the 4090 so there's some truth to it. Of course, that's mostly because it has higher vram capabilities, but still.

>The fact is you're just scared to take the financial plunge now and think you will be in a better position in a few years.
IMO, if you buy a gpu, you should plan on using it for 3-4 years before upgrading. If I buy a 5090 now, I'd basically have to skip the 6000 series in order for it to be worth it.
>>
File: 20250817_153418.jpg (28 KB, 640x415)
28 KB
28 KB JPG
How long do you guys think that something like Nano Banana will be done on a local level? Image editing with AI that keeps the image consistent is what I've been waiting for and it's almost here. The biggest issue is how pozzed everything is with "muh ethics"
>>
File: 1741736327502777.png (1.02 MB, 1360x768)
1.02 MB
1.02 MB PNG
>>106437235
>>
>>106437451
"realistic" gens with Chroma LITERALLY resemble Pony Realism quite strongly in many cases unless you really prompt hard and use exactly the right sampler, and even then it has primarily a super raw "muh amateur iphone selfies" type aesthetic, which isn't the be-all-end-all of realistic gens
>>
>>106437500
>Still no chroma category on civit
>No more epochs to burn cash on
>Nobody discussing Chroma outside of a few sunk cost fallacy retards here.

I bet even the discord has forgotten chroma and is hyping up the qwen tune by this point.
>>
>>106437235
Qwen edit is really good. Too bad it's slow as shit.
>>
>>106437520
it's not purely about looks but prompt adherence and how well it can do real content. chroma has much more flexibility in what it can do. any model can make a realistic 1girl just standing doing nothing.
>>
File: ComfyUI_00036_.mp4 (217 KB, 512x320)
217 KB
217 KB MP4
>>
Chinese Communists launch offensive against Nvidia: https://e.huawei.com/cn/products/computing/ascend/atlas-300i-duo
>>
>>106437544
Yeah but those added prompt adherence comes with the cost of genning mutants half the time on chroma
>>
>>106437399
You could just say "I couldn't get the model to work how I wanted" instead of projecting.

>>106437438
What's your prompt? Chroma is slow as hell but it's giving me the best results for anything even vaguely creative these days.
>>
>Chroma is slow as hell
This is the reason it hasn't caught on it. It really is that simple. The #1 complaint about Chroma outside of this thread is that it's slower than shit.
>>
>>106437559
already posted on reddit. those huawei gpus are shit by comparison.
>>
>pc keeps running out of hard drive space every time i generate
what the fuck? is it normal for usage to fluctuate and swing this wildly? 300+mb????
>>
>>106437559
>LPDDR4X 96GB
the fuck is this
>>
>>106437580
nunchaku support for chroma soon
>>
>>106437583
>those huawei gpus are shit by comparison
For now.
>>
>those amd gpus are shit by comparison
For now.
>>
>>106437587
2 more weeks?
>>
The Chroma cope is off the chart today.
It's done. Everyone is moving on. I'm glad you enjoyed genning on the epochs while they were being trained but it's obvious to everyone now that Qwen will be base to train on going forward.
>>
>>106437580
Being slow is one thing.
Waiting through all the genning to get greeted with deformed anatomy is whole other.
It's the only NLP and porn (out of the box) capable model out there.
People would wait if it wasn't broken.
>>
>>106437604
This is bad bait, you should have said Wan instead of Qwen.
>>
>>106437604
Qwen is censored.
Chroma is not.
Why can't you get that through your thick skull? No shitmix/lora is going to make Qwen comparable.

Go on Reddit and look up every post asking for the best model for realism. Chroma is always at the top. Now Fuck off.
>>
>>106437580
>Chroma is slow as hell
3 minutes of gening is really boring. and all the fast chromas give me monsters.
>>
File: alpha.png (1.24 MB, 896x1152)
1.24 MB
1.24 MB PNG
Have you ever tried to write a prompt that includes every letter of the alphabet?
>absurdres, bare shoulders, closed mouth, dress, earrings, full body, green eyes, hand on own hip, interior, jewelry, kneehighs, lips, miniskirt, narrow waist, ojou-sama pose, pointy breasts, qingxin flower, ribbon, smirk, tight clothes, underwear, very long hair, wet, x hair ornament, yuri, zettai ryouiki
>>
>>106437614
>you should have said Wan instead of Qwen.
Read the whole post. I said base to TRAIN on.
>>106437616
>Qwen is censored.
As I said, read the post.

What a bunch of retards.
>>
>>106437624
I was able to get decent results pre-v50 with some of silveroxide's loras, but they don't work well with HD and they're all located in his obscure huggingface repo without much further information.
>>
File: 1735551008230237.png (1.67 MB, 1328x1328)
1.67 MB
1.67 MB PNG
it miku bday :)

An anime style Miku Hatsune sitting at a table in a kitchen. A vanilla birthday cake with strawberries is on the table, with lit candles and "8/31/25" in red icing on the cake. Miku is smiling. On her arm is the text "01" in red text.

with qwen q8: usually I use waiv14 or hassaku for anime but I wanted to see what qwen can do. text is good. but you have to add the 01 prompt otherwise you get a barcode on her arm.
>>
>>106437569
way to miss my point which was that people are dead set on insisting Chroma is what they personally want it to be and not what it actually is (and this applies both to realismfags and animefags)
>>
>>106437616
I only ever see people give the qwen to wan combo tip.
>>
This troll will say any model is better than Chroma. Even SDXL. Or SD 1.5 for that matter. All his arguments are nonsensical.
>>
File: probe-alien-lake.jpg (1.09 MB, 2688x1536)
1.09 MB
1.09 MB JPG
>>106437569
> What's your prompt?
I didn't save it but it is using the Royo lora I posted a while back.
>>106437627
I have not but I noticed that with pixelwave flux when I give prompts in latin it seems to default to old European-style woodcut paintings for some reason.
>>
If we just get a millionaire to bankroll people's finetune attempts and then wait a year for them to finish then surely the Qwen ecosystem will truly thrive. It's the perfect base model.
>>
File: muh dik.png (1.18 MB, 896x1152)
1.18 MB
1.18 MB PNG
>>
File: d.webm (2.51 MB, 720x1248)
2.51 MB
2.51 MB WEBM
>>
>>106437559
this is old news, these cards have existed for a while. the fact that people are only mentioning them now just shows how useless they are
>>
File: QwenBJ.png (14 KB, 1831x137)
14 KB
14 KB PNG
>>106437616
this has to be bait
Chroma is an architecture mod + heavy finetune of Flux Schnell, a VERY censored model. Which is fine, I like Chroma a lot for some use cases, any sort of weird hardcore NSFW you'd previously have maybe used Pony for it can do way easier, for example.

I'm not overly "sold" on Qwen personally for several reasons but I'm the sort of guy who always tries to give every model a chance, hence why I'm currently baking this blowjob lora for example
>>
>>106437663
>This troll
I'm not the one making the arguments that 1.5 is better. I'm just saying that Chroma is over and there will be no mass adoption because better models to train on exist now and it is would be pointless to spend any more computer on a model with busted anatomy no matter how long you train on it.

Flux schnell being a distill and actively resisting training from the get go was a huge indicator that the whole venture was ill conceived and it was a miracle that Chroma got as far as it did.
I know that hurts your ego as you've spent a long time molding your personality around Chroma, but it's the objective truth.
>>
very poor attempt
>>
>>106437704
I'm not attempting anything I'm just laying out the facts.
>>
>>106437694
You like genning slop, we get it. You don't have to derail every discussion on Chroma just because you like slop.
>>
>>106437683
I don't care at all about the Chroma argument, but could you share your training settings?
>>
>Chroma was made by putting Schnell in a blender and then trying to piece it back together, leading to obvious issues in the final model
>Qwen is aesthetic-tuned to such a retarded extent that overcoming its base slop tendencies is almost impossible, and it's so large that nobody will ever properly finetune it
SD1.5 is literally better
>>
File: rina.webm (1.9 MB, 832x640)
1.9 MB
1.9 MB WEBM
>>
>>106437715
I don't get why you're getting so mad at me?

Are they still training Chroma? No
Will they move to a Qwen based model? Most likely.
Has any space outside of a few people here adopted Chroma in any capacity? Not really, no.
Is there a Chroma category on Civit? Not yet.
Are there serious and systemic issues with anatomy, background coherence, noise and nonsensical objects in Chroma gens? Yes.

I don't get why I'm a troll for pointing this all out to you when you ask why it seems Chroma is failing.
>>
https://arxiv.org/abs/2502.12154
So was this model guidance just Chinese bullshit?
>>
But why do you need to gen anything but 1girl?
>>
>>106437627
Kneehighs and ZR don't really go together as does solo yuri.
>>
>>106437674
fucking horrible
>>
>>106437741
No one outside of any small space has significantly adopted local models anon. Don't make me remind you of MJ numbers compared to local.
>>
>>106437757
I feel like 1girl, especially anime 1girls, is basically a solved problem since sdxl-based fine tunes. And realistic 1girl is mostly solved. The video models are I guess useful too if you masturbate to five second gifs but they have no other value.
>>
File: 1756464658555363.webm (2.06 MB, 640x640)
2.06 MB
2.06 MB WEBM
>>106437650
and of course, you can use wan 2.2 to make it move.
>>
>>106437741
>>
>>106437781
I know that it's a solved problem. It's also the only thing worth genning, so what's the point of all of these new models?
>>
File: 00234-1476748619.png (3.83 MB, 1432x2144)
3.83 MB
3.83 MB PNG
>>106437757
anon, have you thought about ... 2girl?
>>
File: ComfyUI_00037_.mp4 (265 KB, 512x320)
265 KB
265 KB MP4
>>
>>106437801
Retarded. Stupid. If I wanted 2girls I would just gen 1girl twice. Are you even thinking?
>>
>>106437800
It seems like the second biggest use case based on the thread are dumb political memes. I suppose these need newer models.
>>
>>106437243
https://github.com/bytedance/USO
>>
File: 1756319723537883.png (377 KB, 2729x763)
377 KB
377 KB PNG
>>106437716
I think I have a couple of times, here they are again as far as what's worked well for me on TensorArt so far.

Only things to note beyond what's in the pic are:
- that their default "Qwen" trainer option there is the FP8 ComfyOrg safetensors, but you can choose the BF16 safetensors upload through the "Custom" menu for no extra credits cost, so I typically do that just cause why not.
- the number of scheduler restarts is 3, that's the one thing they don't list for some reason in the external settings breakdown.
- your number of epochs might vary, of course, 54 there was arbitrary (I personally never use any repeats though for any model as I find the impact is objectively worse always than just doing more epochs)
- their trainer backend is custom AFAIK but it seem to strictly use "Kohya scaling" for Dim, and for Qwen what that means is a Dim 16 lora will come out around 280 MB, a Dim 32 one will be like 590 MB, and a Dim 64 one (which would probably cause the run to fail if you even tried it lol) would presumably come out at like more than 1 GB theoretically
>>
File: 1755183505753226.mp4 (1.08 MB, 640x640)
1.08 MB
1.08 MB MP4
>>106437786
cutting a cake slice:
>>
File: 789.jpg (184 KB, 614x637)
184 KB
184 KB JPG
>>106437580
>>106437624
Use chroma cache. Big savings on 30 steps.
>>
>>106437741
These generals have a very very strong contingent of people who can’t just be happy with the method they’ve found comfortable, they also NEED to police others into doing it their way as well. (Chroma, ani, comfy, anti forge, and on and on and on). This space has enough going on that we can all just come together to goon in harmony, but no, you CANT USE anything besides comfy, kys if you do, seems to be the idea instead. It’s “my dad works at Nintendo and he could beat up your dad” for the zoomer generation I guess
>>
>>106437846
I'd just use noob instead of going through that shitty wait time
>>
My uncle works for Nvidia and he said he'll double GPU prices if you don't start using Invoke
>>
>>106437827
nice, very crisp, hard to tell it's only 640x640 at a glance
>>
>>
>>106437868
tell him namaste ringwald, he'll know what it means
>>
>>106437741
Why would Qwen succeed if the masses refuse to run Flux? There's not going to be any mass post-SDXL model adoption until Nvidia makes 24GB the gaming standard.
>>
>512x512
>no artist tags
>no characters
>mangled limbs
>melted noisy nonsensical details
chroma isn't slow for me, but it's simply not worth using.
>>
>>106437580
Unfortunately nunchaku failed us. But Chroma 1 HD Flash is fast and good enough for VRAMlets.
>>
If someone would make a T5 natural language SDXL, or an equivalently sized model, then that would be the new standard.
>>
>>106437752
it's just snake oil
>>
>>106437896
That's due to your fundamental misunderstanding that Chroma resisted adoption due to its size.
I assure you, most people downloaded it, genned a few nonsense pieces and deleted it. Take a look at any workflow to make it even generate anything passable. BONG TANGENTS? what the fuck is that.
>>
>>106437904
>Flash
Stop shilling this nonsense. It dumbs down the results too much. Not as bad as the earlier versions, but still
>>
pros/cons of running comfy in docker instead of natively on the host? security benefits? performance impact?
>>
>>106437904
>Chroma 1 HD Flash
>good enough
lol
>>
Did nunchaku ever say they weren't going to do chroma or is it still on their roadmap?
>>
>>106437917
u look like a bitch if u use docker
>>
>>106437900
As opposed to what?
>>
>>106437886
>>106437788
if this lil nigga left 4chan forever this general and board would be greatly improved overnight
>>
>>106437896
It wouldn't succeed. It's 20B so training that instead of Chroma would be impractical. The reason Chroma doesn't know artist tags is because Flux was so massive and it's hard to make a dent on these models. Qwen would be way worse. It'a so hard baked that every seed looks the same.
>>
>>106437683
Semantics I know, but calling it a finetune of Schnell really isn't accurate either.

Also for anybody dumb enough to fall for the bait, there is no qwen-based Chroma. Lode is working on switching from the t5 encoder to a Qwen2.5 based encoder.
>>
>>106437930
It's still there. 90% of this thread is bait.
>>
>>106437942
>Qwen2.5 based encoder.
Wait really? I was trying to figure out how to do this shit myself. Good shit.
>>
>>106437825
sample output for epoch 10 looks relatively promising already I think. I'd hope it would be though as I was schizo tier autistic about making sure the captions were perfect down to like, ensuring they didn't mix up hand side or say that someone was licking when they were really sucking and so on and so forth. Every single image is of a completely different chick (or pair of chicks for a few) with a pretty big race / age range so it should be about as appearance-agnostic as it possibly could be, also.
https://files.catbox.moe/20fonu.png
>>
>Talking with Qwen users
>"Qwen is pretty slopped"
>"Haha yeah, but it's got its upsides too"

>Talking with Chroma users
>"Chroma has a few issues"
>"WHAT? How fucking DARE you insult my model? I'll have you know it's the ONLY uncensored model in the world. It's literally perfect in every single way. Workflow? Post gens?"
>>
>>106437908
>SDXL
>4b params (extra 1b)
>16ch vae
>1.2k base res
>fixed vpred/color implementation
it's that simple, yet furries decided to throw $150k at flux schnell only to encounter the same anatomical issues everyone who even attempted to train a flux lora warned them about.
>>
File: WanVideo2_2_I2V_00243.webm (225 KB, 1248x720)
225 KB
225 KB WEBM
>>
>>106437912
Not an argument. Qwen has even less local adoption.
>>
Why does every Chroma argument sound like a cornered dog viciously lashing out at anything approaching it?
>>
>>106437825
How many images do you have for this one? And I see batch size is undefined, do you know what's being used?
>>
>>106437990
This should be easy to verify. I'll just check the Qwen usage against the Chroma usage on Civi- oh wait.
>>
You should use positivity about your favored model and post good gens people want to imitate to get other to use it rather than attacking the models you don't like via negativity. If you do this, everyone will just leave the model you don't like behind without you even having to discuss it if the other one is actually better. But I think what is going on is that outside of a few use cases all the models are so similar in capabilities that you can't really do this so we get console war crap.
>>
>>106438011
I'll just check qwen downloads vs chroma on hf. oh my.
>>
>>106437967
terrible. throw it in the collage!
>>
>>106437908
no it wouldn't lol

Kolors is and was effectively that (a ground up model on a different much more aesthetically pleasing dataset than Base SDXL using the SDXL arch, with ChatGLM 3 6B as the encoder) and nobody fucking gave a shit (despite the fact that it trained great, just like SDXL basically but with proper NLP understanding, really easy to teach it NSFW in the lora experiments I did).

Do not even start trying to tell me the license was "bad", either, it wasn't to anyone who actually understands fucking English, you can go to their HuggingFace and see that it basically just has a clause saying SaaS operations who have in excess of 100 MILLION monthly users need to request a special commercial use license. Which is whatever, I don't understand why randos who literally only care about porn loras and shit pretend like these commercial use clauses are even relevant to them for any model anyways.

TLDR There is no way you could modify actual base SDXL to be better than what base Kolors already is, I promise you.

Also why are you pretending like Lumina 2 doesn't exist? It does, it's got a 16 channel VAE, it's only a 2.6B param model, Gemma has a context of 8192 vs T5's 512, and we've even seen at least one full scale proper anime finetune of it already with relatively promising third party improvements on top of that finetune so far.
>>
File: Untitled.png (79 KB, 1271x224)
79 KB
79 KB PNG
>>106438020
huh?
>>
>>106438029
I didn't read this but you're wrong, someone needs to make SD1.5 use T5.
>>
>>106438029
>much more aesthetically pleasing dataset than Base SDXL
KEK I still remember the hypersloppa
>>
File: ComfyUIPIL_00528_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
Would something like this work, and if so, has anyone already done it?
>embed RAG database with all booru tags and their semantic meanings
>natural language prompt uses the database to transform it into a list of relevant tags
>no more needing to learn and look up magic booru words to gen on booru trained models
>>
>>106438011
I was confused once when bunch of people were sperging out in one lora's comments, "WE WANT VERSION X BACK", etc., yet it was there, free to download. It took like two minutes for me to realize 99% of civitai users are not local.
So what's your point?
>>
File: 1748066909717402.webm (2.39 MB, 640x640)
2.39 MB
2.39 MB WEBM
using the kijai 2.2 wan loras with 1.5 strength high, 1.0 low, now I get actual candle flicker:
>>
>>106438036
God all those numbers are so small really in the grand scheme of things. Local gen is basically irrelevant.
>>
>>106438058
Understanding local gen is basically a superpower.
>>
tfw i am a super human
>>
>>106438005
it's 150 autistically curated images captioned with jailbroken Gemini 2.5 pro lol.

Batch size is undefined because they use Gradient Accumulation Steps instead of Batch Size in their trainer for big chungus models like Qwen in order to save memory / avoid crashing on big runs. It's technically set to 1, though, behind the scenes. So if you were training somewhere else or locally and actually using normal batch size the closest equivalent to my settings would be batch size 4.
>>
>>106435985
>https://civitai.com/models/137781/new-era-new-esthetic-retro-anime

Login and you can download it 6.6GB
>>
>>106438071
>>106438071
>>106438071
>>106438071
>>
I make glorious titties, kneel before me saas-lets
>>
File: 1726586191678608.webm (2.26 MB, 640x640)
2.26 MB
2.26 MB WEBM
>>106438053
2 strength high, 1 low

seems a good starting point.
>>
>>
>>106438043
it was pretty objectively a more capable base model both due to the dataset and due to it just having proper NLP support. You sound like the sort of person who will just complain until the heat death of the sun about nothing that comes out ever being good enough while simultanously crying about why nothing new is coming out though lol
>>
>>106438082
SLOP>>106438021
per usual
>>
>>106437683
how does it compare to pony graphically? that's what I'm using now but I'll check out chroma



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.