As a Cockroach EditionDiscussion of Free and Open Source Text-to-Image/Video Models and UIPrev: >>106429545https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GPAniStudio: https://github.com/FizzleDorf/AniStudio/tree/dev>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://tensor.arthttps://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://rentry.org/wan22ldgguidehttps://github.com/Wan-Videohttps://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbourshttps://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
>You're a "Scholar" or a "Connoisseur" of Technology>The Joy is in the Learning, Not the Output>Analysis Paralysis & The Curse of Knowledgehow many anons ITT suffer from this too?if i was a kid i'd be able to make countless projects with the current diffusion modelsyet i cant be bothered to do anything besides make a few gens when a new model releases
comfy kind of stole the branding of invoke, no? or was at least heavily influenced by it.
>>106435697I don't suffer from that.That's what I enjoy doing. Tinker with workflows, plot stuff, then abandon the model since something else came along.
>>106435682>AniStudioTroll op, sad!
>>106435702yes. the comfyorg CEO has no creative vision whatsoever and just wants to sell off the company as soon as possible with stolen ideas. he's Chinese btw
>>106431486>>106435516>>106435702>no support for vPred Dropped.
Thanks anon for let me wake up from this hell!>>106431486>>106435516Please, with all the love of the world, don't sleep in invokeai if you are a txt2img or img2img genner. ITs litteraly the best tool now!
Blessed thread of frenship
>>106435735>stole the comfy logo from a bike company>stole invoke UI>steals your datalol it really is adding up
>>106435737For that I use Forge or Reforge, I wont touch that wanna be monopolizingUI ever again
this will surely fix all the ram leaks now, right?
>>106435737probably no cfg++ either
>>106435752I don't know, try it?
>>106435752Nope not updating, you can't make me
>>106435767Come on, update +0.1 seconds of speed
>don't sleep in invokeaiESL shills
I think I have spaghetti dependency indigestion.
invokeai is so much better than comfy, comfy doesn't even run on lower configurationsget invokeai, it's the right choice
Is correcting 200+ Joycaption captions manually worth the effort?raw joycaption example:>A comic-style illustration shows a young woman with dark hair, wearing a sleeveless black top and light gray shorts, sitting in shallow water. She holds a broken wooden staff in her right hand and has dark, tangled vines wrapped around her legs. Her expression is tense. In the background, two large, serpentine creatures with gray scales and yellow eyes emerge from the water. The creatures have multiple heads and are partially obscured by green foliage and a rocky cliff on the right. The water is greenish with reflections of the surrounding greenery. The scene is set in a dense, jungle-like environment.corrected version>A man with long dark hair wearing a sleeveless tunic floating on a loose thicket of mangrove roots. He is paddling away from a giant tentacled monster, a kraken with a yellow egg sac. In the background are two enormous and thick mangrove trees.I've masked away the speech bubbles (though not all of them in the dataset).
>>106435810>Is correcting 200+ Joycaption captions manually worth the effort?I've been thinking about popping a few addy and chugging away at my dataset for 12 hours straight. That's the only way I could do it.
>>106435802based, also supports nodes in json format, and their node system is more like to the unreal engine one, I obviously dont use nodetrash
>>106435810For that many images, better use gemini
>>106435810>manually worth the effortYes, but it needs serious autism and it's dry as fuck, based thorgal connoisseur.
>>106435810can't you give it some context?
>>106435802I agree, you can get it here : https://github.com/invoke-ai/InvokeAI
>single man responsible for both joycaption and bigasp basado
Imported Comfy node workflow imported for .json, much more organized and visual appealing than Comfy
>>106435885>>106435878>>106435870Neta?
Running a lightweight model and running into an issue where if I try to gen an image with two characters, I can't really get it to properly assign their features. Like if I want to gen an image with two characters, one tall with red hair and one short with black hair, it's pretty much a crapshoot whether I'll get two short characters, two black-haired characters, one short character with red hair and one tall character with black hair, etc. Is this just an issue of an inadequate model or can I mitigate this with different prompting?
>>106435895Nah it has that weird Flux artifacting
>>106435905use invokeai
Would anyone of you happen to have this?https://civitai.com/models/137781/new-era-new-esthetic-retro-animehttps://boosty.to/girlsai/posts/29c18d14-bd36-4a15-a943-3bfb02ef371eOr at least do you know when he intends to upload it to civit?
this is now a /invokeai/ general
Can I use this invoke whatever remotely? Invoke on main PC + Comfy on local server? If not, it's trash.
wake me up when the schizo leave
>>106436040 you schizo>>106436030 you sane person
>>106435839It's possible to just vibe code an UI on top of Comfy or whatever backend you like.I don't get the UI autism.
You couldn't just quit while you were ahead, could you... big bro?
>>106435827I did the most effort I could muster: extracted panels with imagemagick, went through them and deleted the worst ones, masked away speech bubbles and borders, and let joycaption caption them. But after training it, I realized all the captions are wrong and that you have to give some negative examples (of speech bubbles, etc.) because suddenly it no longer knows the concept of a speech bubble.But now it requires some real effort. I'm going to do it, just wanted to whine a bit.Also, I think these comics might be naturally too "low resolution" for the purpose of loras. Like there's not going to be any detailed background characters and anything other than a portrait is going to be pic related. But learning French by reading Thorgal is my current project, so anyhow..
>>106436068I always knew she was the real culprit!
>>106435895>>106435908SDXL noobai vpred finetunehttps://files.catbox.moe/kcbsoi.pngHere is a catbox with a raw gen
>>106436065just do it over diffusers so it's easy to plug in new models. no reason to make it complicated and wasteful with confy
>>106436068
now that the dust has settled and invokeAI is the new king, why would anyone ever use comfy again
>>106436091Not really related, but what do you guys use for making weapons to not look like regurgitated trash? I keep getting half-broadsword, half-katana abominations with melty hilts.
>>106436068Which model do you use for anime?
Is this usable in ComfyUI yet?
>>106435682Any AI and workflow i can use to generate perfect sprite sheets?
i gen'd like 2000 tortas today i dont know why i did thislike they aren't hot-thick they're obese fat>create anything machine>folder full of thousands of obese latinasis this what jensen had in mind????
all SOTA VLMs suck
Are all of the default workflows in comfyUI stuck in the past?I'm trying them and I'm getting pure garbage.
best interpolation node for wan in terms of quality?
>>106436091Kino. >>106436153>what do you guys useA good base model.
>>106436317FILM VFI
>>106436133Will share it in /adt/ thanks, the thread is good and fun and there is interesting technical discussion but we need more people like you to fight the lolis.
Change the text "Zelda" to "Miku". Replace the round egg with Miku Hatsune.
>>106436215type: "GOOD MORNIN! ;3" into pos field
>>106436400closer font:
huh
>>106435739About invoke two questions.What is missing in the communitie edition?Is it true that there is hardly any choice of models and that it takes an absurdly long time for new releases to be integrated?
>>106436153This is with the "moor" replaced with the "streets of medieval Kyoto". A foot shorter, clasp morphing into a chrysanthemum, katana? (I don't know Japanese swords).So.. I don't know. Don't mention any Eastern stuff? I don't know if anime and manga are inherently Eastern. They might not be, I mean Miku bends to my Western lora pretty easily.>>106436324Thank you.
>>106436402Ok... i guess tech is not there yet.
Has anyone else tried the new InvokeAI Hires Fix yet? It uses a tiled control net, so there are no more artifacts, and you can actually control creativity now.It has 2 parameters:1- A structure parameter for ControlNet tile weight 2- A creative parameter for denoise, Extra: the upscaler model and the same options that a K sampler has!Inpainting is exelent too. No more downsizing to 1024x por consistency inpaiting! Just select the BBox, set it to 1024x, and paint the mask and region prompt inside!.With these editing tools, SDXL is back on top. It's better than Adetailer.Everything is on one screen. You never have to navigate through 8k pixels of nodes or touch a node again. Its roght there stable and simple, and above all not buggy
lol
>>106436435Invoke is dead lol their only MLE dev left the company months ago so they have only been doing paid api models since.
Daily reminder that Invoke is a failed AI startup that recently tried (and failed) to secure additional funding, and will soon go bankrupt. They are now relentlessly shilling here, Reddit, Discord channels, anywhere they can in a hopeless attempt to turn things around. It's so fucking obvious.>>106436463your company is crashing and burning and your shares will be worth nothing lmao
change the location to an alien planet with a futuristic spaceship in the distance. a large moon is visible in the background. keep the character in green, in the same style.neat, input was pixel art (kings quest), output is pixel art
>Exception: INSTALL NEW VERSION OF PYAV TO USE API NODES.can comfy just fuck off with his paid api slop shilling already
>>106436507>your company is crashing and burning and your shares will be worth nothing lmao>>106436497 this was for you btw
>>106436455But I find it more interesting that when the gens don't conform to Japanese culture (though note the golden seal and the binding on the hilt) it reminds me of Junji Ito (the large head I mean).I hope I don't get banned for posting cartoon gore.
>>106436507i still remember when it was a one man project by "lstein", much like auto1111, and was initially focused on being "the mac UI". weird how it grew and turned into a buyout/investor bait project>steinoh.
>>106436507>They are now relentlessly shilling here, Reddit, Discord channels, anywhere they can in a hopeless attempt to turn things around. It's so fucking obvious.Nah, it's just the same anon who is salty about reForge development being canned. They tried shilling Wan2GP a few days ago. Guess they're trying Invoke now. Tomorrow it'll probably be SwarmUI or something.
>>106436521change the location to a medieval era in a grass field with a castle nearby, and a red dragon in the sky shooting fire. in the distance is an army with plate armor and swords. keep the character in green, in the same style.
I love anime fennec girls, they're good with technology
change the location to the surface of the moon, with a space shuttle on the surface. in the distance are several moons of varying sizes in the sky, and stars. keep the character in green, in the same style.qwen edit is a good pixel art generator, just take an old game screenshot and poof, similar style.
>Last thread devolves into 'muh professional workflow' and 'muh clients' with perfect reddit shill vernacular>Invoke AI gets brought up and as THE TOOL>Opportunistic troll Anons latch on to it>[(You) are here]This is actually really funny to me.
>>106436435Community edition has everything except API nodes and services. Only that. Its for gpulets tgats want to run SD 3.5 for example. But the samplers, utilities ans extencions are the same. I did tgat research myself some hours ago before downloading
>>106436595>>106436619pretty cool
>>106436435Invoke runs any SD variant and FluxNot qwen yetNot any video model yet
>>106436604>python/jeetscript abomination shitheap>good at technology
>>106436507Meds
>>106436639you forgot comfy crawling out of the griftspace to fud invoke because he's jealous
>no bounce lora for 2.2>no triple sampler no lora to light loras workflow for 2.2never began.
>>106436639im not here im there
>>106436525CREATE AN ACCOUNT TO CORRECT BUGSALL ERRORS AND COMPATIBULITY IS BECAUSE YOU DO NOT HAVE A GMAIL ACCOUNT WITH COMFY YOU FUCKING BIN LADEN! ATTACH YOUR GMAIL
>>106436153Wait, do you mean like vanilla Chroma? My advice is to just train any small (non-realistic) style lora. I've found that suddenly concepts that it cannot express in photorealistic images just suddenly bubble up from its ancestral memory. V48.
can someone port forward their comfy instance so I can make groid porn please?
>>106436660i thought the narrative was that comfy paid Panchovix to shut reforge now. now he's funding invoke because he's jealous..? jesus christ these shitposters lmao
>>106436650>cannot enjoy cute anime fennec girls without seething about a dev:(
>>106435739Simple question for Comfy txt2img or img2img users.Can comfy do pic related in two clicks?Haters
>>106432611I find traditional media difficult to illicit from Neta. At least, more so than Noob.
change the location to a Japanese temple in Tokyo, with several lit lanterns. the sky is blue and it is cloudy. keep the character in green, in the same style.in theory you could make an old school rpg with qwen edit backgrounds.
>>106436710no fennec girl that is good at technology plays in a city sewer getting covered in fecal matter
>>106436507Ok.First: show url or some proof of that? Because the Invoke community its growing day by day.If you dont like the Ui because has a paid service also, 'daily reminder' that Comfy has one too.
>>106436725
>>106436497Shut up you, go and fix you own bugs before speaking about another UI!
>>106436497>Invoke deathUrl?Proof?Stock market graph?Some randon twiter comment?Anything?
lora manager for comfyUI was updated to include much better folder navigation. If you use tons of loras, this is simply a must-have. honestly it should be included by default
>>106436714https://github.com/Acly/krita-ai-diffusionAnywany, not the point of /ldg/. Go to /ic/ and be the change you want to see in the world.
>>106436760Fix your bugs first
>>106436760>If you use tons of loraspromptlets you mean
>>106436766actually you have to fumble around with the stupid ass krita config for little things like changing the negative prompt which is really annoying. nice try though
>>106436766There is a whole post in the last thread why Krita sucks and is not reliable for actual proffesional work an. Team client compatibility.You AI hobbyist.
>>106436766>diffusing locally is not the point of the local diffusion threadthis is the level of retardation in these generals.
>>106436725change the location to a sunny beach, with women in bikinis walking on the sand. The ocean is nearby and a large vacation resort is in the background. the sky is blue and it is cloudy. keep the character in green, in the same style.
>>106436766>peopke who uses AI on weekends to have funFuck you neet
>>106436781No amount of prompting is going to get you niche/western/unpopular characters.
>>106436760can you disconnect it from civitai entirely?
>>106436760Does the loras come attached to a node or I have to click the lora and then click a node and then make the connection?You silly shiller, think before you post something this stupid.FIX YOUR BUGS!
What happened to the Chroma schizo? Evolved to InvokeSchizo?
>>106436811>prompting existing characters instead of unique creations.it's beyond promptlet at this point, its brainlet.
>>106436814It's not really "connected" to civitai? It only uses civitai to fetch metadata.>>106436816You click a button and it sends the lora to the lora manager node.
>>106436792one more, just to test something sillychange the location to a sunny beach near a volcano. The ocean is nearby and a large vacation resort is in the background. The sky is cloudy. The volcano is erupting lava and is filling the ocean with lava. keep the character in green, in the same style.neat
>>106436833Yes Anon some people want to create art of existing characters not OC's. That has nothing to do with prompting. You can fuck off with your retarded logic now.
>>106436785For actual professional work you are told what tools to use. Did people in the 90s go on tantrums about layers or whatever photoshop introduced?
>>106436854and that's why ai chuddies will never be artists, all you can do is consume someone else's creations.
It also has a Firefox/Chrome extension that lets you easily download loras and have them automatically updated in Lora Manager.
comfy should be dragged out on the street and shot
>>106436880Artists draw other's creations all the time. Again, you are retarded.
>>106436834>You click a button and it sends the lora to the lora manager node
Flux had Flux GirlQwen has Qwen Girldespite not being distilledI dunno know what this derives fromit's almost like it REALLY just wants to make everyone Asian even when prompted otherwise but is toeing the line or something lol
>>106436834>It's not really "connected" to civitai? It only uses civitai to fetch metadata.Yeah ... I don't want that.
>>106436882>unnecessary extensions while your UI shits itselfIt reminds me of the eve of the French revolution when France was starving and the kings were doing unnecessary things that did not dialogue with the reality of the population.
>>106436900It doesn't get any easier than that. I don't know what more you want.>>106436906I'm confused. Perhaps you are implying you are making illegal LoRA's and are afraid it is sending metadata to Civitai or something? That's not how it works.
>>106436913Vpred sloooop
>>106436882Do people actually run Comfy like pip install -e everything.txt? I read about that malware in a custom node just before getting into local and decided to run all AI related stuff in rootless podman.
>>106436927How many screens must I have to use this extension nd node with a simple workflow?No, I wont scrooool No, I wont zoooooming out
anyone like to share their workflow for qwen image edit for styletransfer with 2 input images?I can't figure it out right now. I am attaching two latents together, but it should only output one image instead of two next to each other.Looks like I'm too retarded.Ty anon
>>106436955https://www.reddit.com/r/StableDiffusion/comments/1myr9al/use_a_multiple_of_112_to_get_rid_of_the_zoom/
>>106436955Anon listen to me.Qwen works PERFECTLY in ComfyQwen works PEEFECTLY in a third world GPU.But they will never share a Qwen workflow.They will throw you rentrys and github tutorials.
>>106436927No anon I literally just want to edit my own lora metadata because civit likes to purge models out of the blue and relying on them is a recipe for disaster.You know what, I'll just steal this code and do it myself.
i wish neta was good already
>>106436903Do normalfags really find this attractive?
>>106437022>I literally just want to edit my own lora metadatayou can do that.
>>106437046no
whats the goto shit for controlnet setups these days bros?ControlNetPlus looks the most complete, anything newer?
>>106436934pip installs are perfectly safe. It's the unreviewed extension code that is the problem.>run all AI related stuff in rootless podman.Good call.I am doing the same with docker.
>>106437053the fuck is that trigger word
>>106437104forge
>>106437124I guess they wanted it to be really unique. Either way, you can edit any and all metadata if you don't want the metadata from Civitai. >>106437022>civit likes to purge models out of the blue and relying on them is a recipe for disaster.Also forgot to mention, the metadata.json is of course saved locally so even if civitai purges something, you'd still have it. You can also optionally download all examples the creator used Everything is saved locally.
>>106436934docker is a pretty popular method too. shit is too amateur to trust the software to detect threats. continue what you are doing. there was malicious npm repos lurking about recently too
>>106436231The only thing in his mind is money and real bitches
I really want to dump $2.5k for a 5090, but honestly I should just wait, save my money for a 6090 in 2027, which will probably cost $4k. The 6090 will probably be way better than the 5090 since NVIDIA is 100% focusing on AI now. I'm expecting 48gb vram+ at minimum.Buying a 5090 now, especially with the connector heating issues it has, would be something only a retard would do.
>>106436827Chroma is abandoned, unfortunately. There was a big meeting and everyone decided that Chroma, Qwen, Flux, Lumina, SDXL, Pony v7, Wan, and HiDream all suck and that we're moving back to SD1.5.
>civitai still doesn't have a category for chroma>qwen got it basically from day 1>wan got it after 2 dayswhat the fuck is taking so long? are they simply afraid of chroma because it's uncensored?
change the location to a pond in a forest. a tent is nearby with a campfire outside it. keep the character in green and in the same style.all based off a kings quest game screenshot, and in quen edit. pretty cool imo
>>106436176Link to paper?
>>106437182>The 6090 will probably be way better than the 5090 since NVIDIA is 100% focusing on AI now. I'm expecting 48gb vram+ at minimum.Lol no. They deliberately VRAM gimp consumer cards to segment them out of enterprise/server cards that cost a lot more. It was surprising when 5090 got revealed to be 32gb. I wouldn't expect any bump till 7090 or whatever.>Buying a 5090 now, especially with the connector heating issues it has, would be something only a retard would do.Well you can decrease power limit to lower the odds of that happening. But yes Nvidia fucked up big time with 12vhpwr design.
>>106437217No one is making LoRA for it (al least not releasing it there), I just saw a furry LoRA and they trained it on flux lol
>>106437252>Well you can decrease power limit to lower the odds of that happening.Forgot to add that up to a point it shouldn't affect slopping performance too much.
>>106437252>They deliberately VRAM gimp consumer cards to segment them out of enterprise/server cards that cost a lot moreYes, but what's stopping them from upgrading enterprise/server card VRAM and making 48gb vram the new baseline for consumer cards? That's what I'm saying. Or they can just increase the cost of the 6090 to be I dunno, 30~50% the price of enterprise/server cards. >Well you can decrease power limit to lower the odds of that happeningYou shouldn't have to worry about decreasing the power to prevent your house from burning down though. It's just a flawed design.
>>106437217Qwen releasing sorta overshadowed Chroma. Don't get me wrong. It's worse than Chroma for NSFW realism no question, but Qwen Edit makes it appealing to normies.The ones making the real sketchy NSFW realism loras are not sharing them. No one is using Chroma for anime of course.
>>106437217>chroma is uncensored and they don't want to draw the ire of payment processors again by officially supporting it in any way>virtually nobody outside of this thread has switched over to chroma, for various reasons
>>106437291Nah, they are afraid of what it can do out of the box. It needs no tune.
>>106437276>Yes, but what's stopping them from upgrading enterprise/server card VRAM and making 48gb vram the new baseline for consumer cards? That's what I'm saying.I am interested in armchair GPU design discussion. Long story short, they most likely won't.>You shouldn't have to worry about decreasing the power to prevent your house from burning down though. It's just a flawed design.I am NOT excusing anything. But realistically you don't have too many options if you want >24GB Vram for slopping right now.Alternatively you can get those Chinese 4090s with 48gb or 96gb VRAM.No warranty and you gotta kick rocks if you are scammed but it's an option.
>>106437300"switch over"? what does that even mean. each model has their weaknesses and strengths. you dont need to only use 1. sdxl for anime, wan for video, qwen for image edits, chroma for nsfw realism.these are the golden ratios
>>106437291I don't think that the "it's not popular enough" and "it was overshadowed" lines of thinking explain the entire reason. All they would have to do is add a tag for it in the filters. People have explicitly asked them to do that and they've ignored it. CivitAI seem like they have an actual motive for not wanting to support it.
>>106437217have you checked if there's even a feature request open for it on the CivitAI bugtracker thing? It could be that nobody ever officially asked for it
>>106437313Chroma is like SD 1.5 but on steroids. Every model that is "controversial", "bstaber" etc... combined. That scares (them)
>>106437316nooooooooo you can ONLY use one model, there HAS to be a unform /ldg/ meta at all times! Conform conform conform, things can NEVER co-exist
>>106437325>CivitAI seem like they have an actual motive for not wanting to support it.you can generate a realistic naked child sucking dick without doing any additional bullshit. that's why. >>106437332this
>>106437332it has precisely zero raw knowledge that numerous previous models didn't also have. The only thing that sets it apart is having been captioned with high-quality natural language descriptions.
>>106437350That's probably the reason, yeah. Doubt CivitAI wants to deal with everything that would involve.
>>106437316>chroma for nsfw realismBarring a handful of exceptions, people outside of this thread that are interested in this are still using SDXL or Flux. Chroma hasn't caught on much.>you dont need to only use 1You don't, and it's retarded to do so, but a lot of people do.
We need a manhattan project for AI.And no, not for AGI.We need the new architecture, we need market-ready neural chips.Right now they're trying to rebuild a steam-powered locomotive to fly to the moon with enough steam instead of finishing the rocket engine that's already in the drawer.I am the only one in my circle of friends who studies the state of development and the new development paradigms. Most of them have heard about it, but don't understand the revolutionary nature of this development.You?
>>106437350"realistic" is a stretch anon, people gotta stop pretending that Chroma just shits out Flux Krea type gens at the drop of a hat as opposed to being more in line with Pony Realism unless you put in a decent amount of prompting effort. It's a furry model that he just happened to train on other stuff too, it was never intended to be some kind of Ebin Real model, people just can't help but assign their personal desires to things they aren't involved with developing.And like it would have been understandable if Chroma WAS literally just FluffyRock Flux Edition with no other kinds of data, given what Lodestones was previously known for. The guy gave us more than he "had" to by any metric.
>>106437399>as opposed to being more in line with Pony Realism unless you put in a decent amount of prompting effortpony doesn't come close to a chroma lora trained on realism.
>>106437361The parameter count alone makes it much better at depicting what it knows than anything came before it. The model is as good as Dalle 3 in coherence (which was already pretty "dangerous" at prompt following capability) and also photorealistic on top of that. While other models, even if they saw same data, wouldn't get close to this prompt following ability, and it would look like a render if we're talking basic prompts where it did.
Something I've been meaning to ask: is there a need for models to be getting larger and larger? We're at the point where newer models are so big that the people known for doing large scale finetunes are saying "fuck that." If someone made a model that was basically just an SDXL-sized Qwen, would it actually be that much worse than the current 20b model?
ty to anon who recommended the quantized wan2.2 14B
civitai won't even open for me. I wonder if my state is blocked like with porn sites. At any rate after using chroma for a while I find it useful but accept a lot of the complaints about it. It's inconsistency reminds me of an undertrained lora. But I can get the same effect with a lora (or rather embedding is where I saw it in the past) that has a dataset tagged in a contradictory manner so that it can't properly converge. It's generation is for some reason noticeably slower on my machine than pixelwave flux. A pull might help with that (I am quite out of date).
>>106437414how is this a sensible response to what I actually said lmao
>>106437437noice
>>106437439you're comparing it to pony realism, which annoys me. not remotely close in quality
>>106437438Chroma is just slow, I don't think a pull would help all that much.
>>106437394Only people building this would be glows and only use it for oppression.
>>106437438A 4k step Chroma lora at rank 68 at 1028 res takes me like 8 hours to train. Worth the wait though. Just run it overnight, problem solved.
>>106437217Yeah it's pretty strange. If I had to guess there's probably some petty disagreement with the usual suspects which would also explain the hysterical hatred of Chroma.
>>106437380>Barring a handful of exceptions, people outside of this thread that are interested in this are still using SDXL or Flux. Chroma hasn't caught on much.Most of them are also blissfully unaware of Chroma's existence, what it can do, or how to prompt it properly.
>>106437462I'm uninformed. Who are the usual suspects?
>>106437217That still doesn't really add up, because it's not like you can post sketchy loras on Civitai even if the model allows you to make that type of content.
>>106437182This kind of thinking never EVER works out. I've been seeing this kind of magical thinking that the next generation of GPUs will somehow be worth it over the current one. The fact is you're just scared to take the financial plunge now and think you will be in a better position in a few years.
>>106437475Tech trannies
>>106437426> DALLE 3 revisionismit was never that good anon, it has SDXL base tier aesthetic at BEST in even the most unrestrained scenario, and the spatial awareness really isn't that good compared even to Flux Dev in many casesNext you'll be asking "when will local catch up with Midjourney" despite Midjourney V7 being fairly behind numerous local and non-local models that already exist on basically any benchmark chart or leaderboard you could view
>>106437467>Most of them are also blissfully unawareWhat if I'm blissfully aware of how shit Chroma is and I become actively annoyed when people act like it isn't a deeply flawed model that trades literally everything that makes an image good for the ability to generate blurry vaginas and penises?
>>106437399Literally just asking for a photograph of anything gives you good results 99% of the time. Not even using additional style keywords. Just don't suck at prompting.
>106437492promptlet
>>1064374815090 is 40% faster than the 4090 so there's some truth to it. Of course, that's mostly because it has higher vram capabilities, but still.>The fact is you're just scared to take the financial plunge now and think you will be in a better position in a few years.IMO, if you buy a gpu, you should plan on using it for 3-4 years before upgrading. If I buy a 5090 now, I'd basically have to skip the 6000 series in order for it to be worth it.
How long do you guys think that something like Nano Banana will be done on a local level? Image editing with AI that keeps the image consistent is what I've been waiting for and it's almost here. The biggest issue is how pozzed everything is with "muh ethics"
>>106437235
>>106437451"realistic" gens with Chroma LITERALLY resemble Pony Realism quite strongly in many cases unless you really prompt hard and use exactly the right sampler, and even then it has primarily a super raw "muh amateur iphone selfies" type aesthetic, which isn't the be-all-end-all of realistic gens
>>106437500>Still no chroma category on civit>No more epochs to burn cash on>Nobody discussing Chroma outside of a few sunk cost fallacy retards here.I bet even the discord has forgotten chroma and is hyping up the qwen tune by this point.
>>106437235Qwen edit is really good. Too bad it's slow as shit.
>>106437520it's not purely about looks but prompt adherence and how well it can do real content. chroma has much more flexibility in what it can do. any model can make a realistic 1girl just standing doing nothing.
Chinese Communists launch offensive against Nvidia: https://e.huawei.com/cn/products/computing/ascend/atlas-300i-duo
>>106437544Yeah but those added prompt adherence comes with the cost of genning mutants half the time on chroma
>>106437399You could just say "I couldn't get the model to work how I wanted" instead of projecting.>>106437438What's your prompt? Chroma is slow as hell but it's giving me the best results for anything even vaguely creative these days.
>Chroma is slow as hellThis is the reason it hasn't caught on it. It really is that simple. The #1 complaint about Chroma outside of this thread is that it's slower than shit.
>>106437559already posted on reddit. those huawei gpus are shit by comparison.
>pc keeps running out of hard drive space every time i generatewhat the fuck? is it normal for usage to fluctuate and swing this wildly? 300+mb????
>>106437559>LPDDR4X 96GBthe fuck is this
>>106437580nunchaku support for chroma soon
>>106437583>those huawei gpus are shit by comparisonFor now.
>those amd gpus are shit by comparisonFor now.
>>1064375872 more weeks?
The Chroma cope is off the chart today.It's done. Everyone is moving on. I'm glad you enjoyed genning on the epochs while they were being trained but it's obvious to everyone now that Qwen will be base to train on going forward.
>>106437580Being slow is one thing.Waiting through all the genning to get greeted with deformed anatomy is whole other.It's the only NLP and porn (out of the box) capable model out there.People would wait if it wasn't broken.
>>106437604This is bad bait, you should have said Wan instead of Qwen.
>>106437604Qwen is censored. Chroma is not.Why can't you get that through your thick skull? No shitmix/lora is going to make Qwen comparable.Go on Reddit and look up every post asking for the best model for realism. Chroma is always at the top. Now Fuck off.
>>106437580>Chroma is slow as hell3 minutes of gening is really boring. and all the fast chromas give me monsters.
Have you ever tried to write a prompt that includes every letter of the alphabet?>absurdres, bare shoulders, closed mouth, dress, earrings, full body, green eyes, hand on own hip, interior, jewelry, kneehighs, lips, miniskirt, narrow waist, ojou-sama pose, pointy breasts, qingxin flower, ribbon, smirk, tight clothes, underwear, very long hair, wet, x hair ornament, yuri, zettai ryouiki
>>106437614>you should have said Wan instead of Qwen.Read the whole post. I said base to TRAIN on.>>106437616>Qwen is censored.As I said, read the post.What a bunch of retards.
>>106437624I was able to get decent results pre-v50 with some of silveroxide's loras, but they don't work well with HD and they're all located in his obscure huggingface repo without much further information.
it miku bday :)An anime style Miku Hatsune sitting at a table in a kitchen. A vanilla birthday cake with strawberries is on the table, with lit candles and "8/31/25" in red icing on the cake. Miku is smiling. On her arm is the text "01" in red text.with qwen q8: usually I use waiv14 or hassaku for anime but I wanted to see what qwen can do. text is good. but you have to add the 01 prompt otherwise you get a barcode on her arm.
>>106437569way to miss my point which was that people are dead set on insisting Chroma is what they personally want it to be and not what it actually is (and this applies both to realismfags and animefags)
>>106437616I only ever see people give the qwen to wan combo tip.
This troll will say any model is better than Chroma. Even SDXL. Or SD 1.5 for that matter. All his arguments are nonsensical.
>>106437569> What's your prompt?I didn't save it but it is using the Royo lora I posted a while back.>>106437627I have not but I noticed that with pixelwave flux when I give prompts in latin it seems to default to old European-style woodcut paintings for some reason.
If we just get a millionaire to bankroll people's finetune attempts and then wait a year for them to finish then surely the Qwen ecosystem will truly thrive. It's the perfect base model.
>>106437559this is old news, these cards have existed for a while. the fact that people are only mentioning them now just shows how useless they are
>>106437616this has to be baitChroma is an architecture mod + heavy finetune of Flux Schnell, a VERY censored model. Which is fine, I like Chroma a lot for some use cases, any sort of weird hardcore NSFW you'd previously have maybe used Pony for it can do way easier, for example.I'm not overly "sold" on Qwen personally for several reasons but I'm the sort of guy who always tries to give every model a chance, hence why I'm currently baking this blowjob lora for example
>>106437663>This trollI'm not the one making the arguments that 1.5 is better. I'm just saying that Chroma is over and there will be no mass adoption because better models to train on exist now and it is would be pointless to spend any more computer on a model with busted anatomy no matter how long you train on it.Flux schnell being a distill and actively resisting training from the get go was a huge indicator that the whole venture was ill conceived and it was a miracle that Chroma got as far as it did.I know that hurts your ego as you've spent a long time molding your personality around Chroma, but it's the objective truth.
very poor attempt
>>106437704I'm not attempting anything I'm just laying out the facts.
>>106437694You like genning slop, we get it. You don't have to derail every discussion on Chroma just because you like slop.
>>106437683I don't care at all about the Chroma argument, but could you share your training settings?
>Chroma was made by putting Schnell in a blender and then trying to piece it back together, leading to obvious issues in the final model>Qwen is aesthetic-tuned to such a retarded extent that overcoming its base slop tendencies is almost impossible, and it's so large that nobody will ever properly finetune itSD1.5 is literally better
>>106437715I don't get why you're getting so mad at me?Are they still training Chroma? NoWill they move to a Qwen based model? Most likely.Has any space outside of a few people here adopted Chroma in any capacity? Not really, no.Is there a Chroma category on Civit? Not yet.Are there serious and systemic issues with anatomy, background coherence, noise and nonsensical objects in Chroma gens? Yes.I don't get why I'm a troll for pointing this all out to you when you ask why it seems Chroma is failing.
https://arxiv.org/abs/2502.12154So was this model guidance just Chinese bullshit?
But why do you need to gen anything but 1girl?
>>106437627Kneehighs and ZR don't really go together as does solo yuri.
>>106437674fucking horrible
>>106437741No one outside of any small space has significantly adopted local models anon. Don't make me remind you of MJ numbers compared to local.
>>106437757I feel like 1girl, especially anime 1girls, is basically a solved problem since sdxl-based fine tunes. And realistic 1girl is mostly solved. The video models are I guess useful too if you masturbate to five second gifs but they have no other value.
>>106437650and of course, you can use wan 2.2 to make it move.
>>106437741
>>106437781I know that it's a solved problem. It's also the only thing worth genning, so what's the point of all of these new models?
>>106437757anon, have you thought about ... 2girl?
>>106437801Retarded. Stupid. If I wanted 2girls I would just gen 1girl twice. Are you even thinking?
>>106437800It seems like the second biggest use case based on the thread are dumb political memes. I suppose these need newer models.
>>106437243https://github.com/bytedance/USO
>>106437716I think I have a couple of times, here they are again as far as what's worked well for me on TensorArt so far. Only things to note beyond what's in the pic are:- that their default "Qwen" trainer option there is the FP8 ComfyOrg safetensors, but you can choose the BF16 safetensors upload through the "Custom" menu for no extra credits cost, so I typically do that just cause why not. - the number of scheduler restarts is 3, that's the one thing they don't list for some reason in the external settings breakdown. - your number of epochs might vary, of course, 54 there was arbitrary (I personally never use any repeats though for any model as I find the impact is objectively worse always than just doing more epochs)- their trainer backend is custom AFAIK but it seem to strictly use "Kohya scaling" for Dim, and for Qwen what that means is a Dim 16 lora will come out around 280 MB, a Dim 32 one will be like 590 MB, and a Dim 64 one (which would probably cause the run to fail if you even tried it lol) would presumably come out at like more than 1 GB theoretically
>>106437786cutting a cake slice:
>>106437580>>106437624Use chroma cache. Big savings on 30 steps.
>>106437741These generals have a very very strong contingent of people who can’t just be happy with the method they’ve found comfortable, they also NEED to police others into doing it their way as well. (Chroma, ani, comfy, anti forge, and on and on and on). This space has enough going on that we can all just come together to goon in harmony, but no, you CANT USE anything besides comfy, kys if you do, seems to be the idea instead. It’s “my dad works at Nintendo and he could beat up your dad” for the zoomer generation I guess
>>106437846I'd just use noob instead of going through that shitty wait time
My uncle works for Nvidia and he said he'll double GPU prices if you don't start using Invoke
>>106437827nice, very crisp, hard to tell it's only 640x640 at a glance
>>106437868tell him namaste ringwald, he'll know what it means
>>106437741Why would Qwen succeed if the masses refuse to run Flux? There's not going to be any mass post-SDXL model adoption until Nvidia makes 24GB the gaming standard.
>512x512>no artist tags>no characters>mangled limbs>melted noisy nonsensical detailschroma isn't slow for me, but it's simply not worth using.
>>106437580Unfortunately nunchaku failed us. But Chroma 1 HD Flash is fast and good enough for VRAMlets.
If someone would make a T5 natural language SDXL, or an equivalently sized model, then that would be the new standard.
>>106437752it's just snake oil
>>106437896That's due to your fundamental misunderstanding that Chroma resisted adoption due to its size.I assure you, most people downloaded it, genned a few nonsense pieces and deleted it. Take a look at any workflow to make it even generate anything passable. BONG TANGENTS? what the fuck is that.
>>106437904>FlashStop shilling this nonsense. It dumbs down the results too much. Not as bad as the earlier versions, but still
pros/cons of running comfy in docker instead of natively on the host? security benefits? performance impact?
>>106437904>Chroma 1 HD Flash>good enoughlol
Did nunchaku ever say they weren't going to do chroma or is it still on their roadmap?
>>106437917u look like a bitch if u use docker
>>106437900As opposed to what?
>>106437886>>106437788if this lil nigga left 4chan forever this general and board would be greatly improved overnight
>>106437896It wouldn't succeed. It's 20B so training that instead of Chroma would be impractical. The reason Chroma doesn't know artist tags is because Flux was so massive and it's hard to make a dent on these models. Qwen would be way worse. It'a so hard baked that every seed looks the same.
>>106437683Semantics I know, but calling it a finetune of Schnell really isn't accurate either.Also for anybody dumb enough to fall for the bait, there is no qwen-based Chroma. Lode is working on switching from the t5 encoder to a Qwen2.5 based encoder.
>>106437930It's still there. 90% of this thread is bait.
>>106437942>Qwen2.5 based encoder.Wait really? I was trying to figure out how to do this shit myself. Good shit.
>>106437825sample output for epoch 10 looks relatively promising already I think. I'd hope it would be though as I was schizo tier autistic about making sure the captions were perfect down to like, ensuring they didn't mix up hand side or say that someone was licking when they were really sucking and so on and so forth. Every single image is of a completely different chick (or pair of chicks for a few) with a pretty big race / age range so it should be about as appearance-agnostic as it possibly could be, also.https://files.catbox.moe/20fonu.png
>Talking with Qwen users>"Qwen is pretty slopped">"Haha yeah, but it's got its upsides too">Talking with Chroma users>"Chroma has a few issues">"WHAT? How fucking DARE you insult my model? I'll have you know it's the ONLY uncensored model in the world. It's literally perfect in every single way. Workflow? Post gens?"
>>106437908>SDXL>4b params (extra 1b)>16ch vae>1.2k base res>fixed vpred/color implementationit's that simple, yet furries decided to throw $150k at flux schnell only to encounter the same anatomical issues everyone who even attempted to train a flux lora warned them about.
>>106437912Not an argument. Qwen has even less local adoption.
Why does every Chroma argument sound like a cornered dog viciously lashing out at anything approaching it?
>>106437825How many images do you have for this one? And I see batch size is undefined, do you know what's being used?
>>106437990This should be easy to verify. I'll just check the Qwen usage against the Chroma usage on Civi- oh wait.
You should use positivity about your favored model and post good gens people want to imitate to get other to use it rather than attacking the models you don't like via negativity. If you do this, everyone will just leave the model you don't like behind without you even having to discuss it if the other one is actually better. But I think what is going on is that outside of a few use cases all the models are so similar in capabilities that you can't really do this so we get console war crap.
>>106438011I'll just check qwen downloads vs chroma on hf. oh my.
>>106437967terrible. throw it in the collage!
>>106437908no it wouldn't lolKolors is and was effectively that (a ground up model on a different much more aesthetically pleasing dataset than Base SDXL using the SDXL arch, with ChatGLM 3 6B as the encoder) and nobody fucking gave a shit (despite the fact that it trained great, just like SDXL basically but with proper NLP understanding, really easy to teach it NSFW in the lora experiments I did).Do not even start trying to tell me the license was "bad", either, it wasn't to anyone who actually understands fucking English, you can go to their HuggingFace and see that it basically just has a clause saying SaaS operations who have in excess of 100 MILLION monthly users need to request a special commercial use license. Which is whatever, I don't understand why randos who literally only care about porn loras and shit pretend like these commercial use clauses are even relevant to them for any model anyways.TLDR There is no way you could modify actual base SDXL to be better than what base Kolors already is, I promise you.Also why are you pretending like Lumina 2 doesn't exist? It does, it's got a 16 channel VAE, it's only a 2.6B param model, Gemma has a context of 8192 vs T5's 512, and we've even seen at least one full scale proper anime finetune of it already with relatively promising third party improvements on top of that finetune so far.
>>106438020huh?
>>106438029I didn't read this but you're wrong, someone needs to make SD1.5 use T5.
>>106438029>much more aesthetically pleasing dataset than Base SDXLKEK I still remember the hypersloppa
Would something like this work, and if so, has anyone already done it?>embed RAG database with all booru tags and their semantic meanings>natural language prompt uses the database to transform it into a list of relevant tags>no more needing to learn and look up magic booru words to gen on booru trained models
>>106438011I was confused once when bunch of people were sperging out in one lora's comments, "WE WANT VERSION X BACK", etc., yet it was there, free to download. It took like two minutes for me to realize 99% of civitai users are not local.So what's your point?
using the kijai 2.2 wan loras with 1.5 strength high, 1.0 low, now I get actual candle flicker:
>>106438036God all those numbers are so small really in the grand scheme of things. Local gen is basically irrelevant.
>>106438058Understanding local gen is basically a superpower.
tfw i am a super human
>>106438005it's 150 autistically curated images captioned with jailbroken Gemini 2.5 pro lol. Batch size is undefined because they use Gradient Accumulation Steps instead of Batch Size in their trainer for big chungus models like Qwen in order to save memory / avoid crashing on big runs. It's technically set to 1, though, behind the scenes. So if you were training somewhere else or locally and actually using normal batch size the closest equivalent to my settings would be batch size 4.
>>106435985>https://civitai.com/models/137781/new-era-new-esthetic-retro-animeLogin and you can download it 6.6GB
>>106438071>>106438071>>106438071>>106438071
I make glorious titties, kneel before me saas-lets
>>1064380532 strength high, 1 lowseems a good starting point.
>>106438043it was pretty objectively a more capable base model both due to the dataset and due to it just having proper NLP support. You sound like the sort of person who will just complain until the heat death of the sun about nothing that comes out ever being good enough while simultanously crying about why nothing new is coming out though lol
>>106438082SLOP>>106438021per usual
>>106437683how does it compare to pony graphically? that's what I'm using now but I'll check out chroma