Discussion of Free and Open Source Text-to-Image/Video ModelsPrev: >>106963047https://rentry.org/ldg-lazy-getting-started-guide>UIComfyUI: https://github.com/comfyanonymous/ComfyUISwarmUI: https://github.com/mcmonkeyprojects/SwarmUIre/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneoSD.Next: https://github.com/vladmandic/sdnextWan2GP: https://github.com/deepbeepmeep/Wan2GP>Checkpoints, LoRAs, Upscalers, & Workflowshttps://civitai.comhttps://civitaiarchive.com/https://openmodeldb.infohttps://openart.ai/workflows>Tuninghttps://github.com/spacepxl/demystifying-sd-finetuninghttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3https://github.com/derrian-distro/LoRA_Easy_Training_Scriptshttps://github.com/tdrussell/diffusion-pipe>WanXhttps://comfyanonymous.github.io/ComfyUI_examples/wan22/https://github.com/Wan-Video>Chromahttps://huggingface.co/lodestones/Chroma1-BaseTraining: https://rentry.org/mvu52t46>Neta Luminahttps://civitai.com/models/1790792?modelVersionId=2298660https://neta-lumina-style.tz03.xyz/https://huggingface.co/neta-art/Neta-Lumina>Illustrious1girl and Beyond: https://rentry.org/comfyui_guide_1girlTag Explorer: https://tagexplorer.github.io/>MiscLocal Model Meta: https://rentry.org/localmodelsmetaShare Metadata: https://catbox.moe | https://litterbox.catbox.moe/GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-oneTxt2Img Plugin: https://github.com/Acly/krita-ai-diffusionArchive: https://rentry.org/sdg-linkBakery: https://rentry.org/ldgcollage>Neighbors>>>/aco/csdg>>>/b/degen>>>/b/realistic+parody>>>/gif/vdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/trash/slop>>>/vt/vtai>>>/u/udg>Local Text>>>/g/lmg>Maintain Thread Qualityhttps://rentry.org/debo
I'm so sorry the OP picked my images for this cursed OP.But i'm glad to not be the chromaslop poster thoughbeit.
First for ani is the raped meme in human form.
>comfy derangement syndrome baker
>>106968093finally a kinollage
Hello, new to this space. I was looking to generate images locally when I stumbled upon this:https://artificialanalysis.ai/image/leaderboard/text-to-imageWhy are there no local models near the top of the list? It seems like SaaS provides better image quality and prompt adherence with better output speeds as well. Can someone explain why local models seem to be lagging significantly behind? Thank you
>>106968181>It seems like SaaS provides better image quality and prompt adherence with better output speeds as wellIt does. >Can someone explain why local models seem to be lagging significantly behind?All our local bakers suck, and good model releases are few and far between. Anything a company creates that's SOTA or close to SOTA gets locked behind an API.
Pathetic
>>106968181Hello! We have detected that your 4chan account's location is from a non-White country, which has been very low on the global IQ leaderbaords that is known for providing very low quality comments online. As such, all of your comments are auto-filtered for all White users on this website. Thank you for your time.
>>106968181> #1 seedream 4we're still winning sneed bros
>>106968202How come that is the case? For LLMs local is still behind, but there are significantly powerful releases every month like glm 4.6. With local image it seems the only releases are failed scraps that can’t even bench higher than the outdated Seedream 3.
>>106968114Second for schizoposters/SaaS shills are raped meme in human form.>>106968238coooool
Boredom or samefagging take your pick
>RTX PRO 5000 72GB BlackwellOnly for the low price of $15k!
>>106968237Because the labs that would be open sourcing these models are actually the ones ahead currently. Local LLMs are great, but still behind. Seedream 4.0 beats everything, so it's API only. Wan 2.5 preview rivals Veo 3, so it's api only.
Is there an alternative to comfyui that can still run the latest models? Diffusion peaked at automatic1111 and everything else has been a regression. Configurable nodes is a great ideaHaving those nodes be the UI is the fever dream of a meth addict.
>>106968258Computer, change the belly art to say "B", "W", and "C" respectively, thanks.
zzzzzzzzzzzzzzzzzzzzzzzzzz
>>106968243kek>>106968273this
anon ate too much banana bread
>>106968268https://github.com/Haoming02/sd-webui-forge-classic/tree/neoForge classic's neo branch might be what you're looking for.
>>106968282The question is, who spiked the banana bread?
Never understood the hate, SaaS is amazing, it gives us a sneak peak of where local might be in 5 more years
>>106968329>might be in 5 more yearsthat's generous
In 5 more years someone might’ve finally finetuned illustrious 0.1 to use 16ch flux vae. The future is bright!!
So what pissed anon off this time?
>>106968354probably the wan .webm of scratch from ice age getting fucked by the dudeprobably accidentally kicked his half full bottle of onions off his desk in a fit of laughter
>>106968303Can I run comfyui workflows without scrolling around their schizo nodes?
>>106968381I don't think comfy workflows work with anything but comfy. As far as I'm aware there isn't any middle ground UI that lets you do the node autism while not presenting the autism to you, but I haven't tried all of them either, someone else might have a better idea if neo isn't what you're looking for.
Blessed thread of frenship
>>106968422why would there be a middle ground? comfyui nodes work fine based on the individual node's quality, and that won't change by changing the UI>>106968267 >>106968329post your allegedly superior seedream Ngirl images in the non-local thread
>>106968422it's literally anistudio
>>106968381>>106968422Swarmui, but seriously, trying your damndest to avoid comfyui is just a waste of time and effort. I have tried swarmui, forge, reforge, neo forge, forge classic, and others i already forget and as of today i'm finally just sticking to cumfart. I wish i had done that a year ago, weathered the storm of broken dumb nodes and realizing i didn't *need* to updoot frequently.
The middle ground is hiding the noodles with a custom node/extension but the idea of hiding nodes has been memed so hard anon doesn't want to recommend it anymore
Why does the mention of Seedream always make brownoids seethe? Is it PTSD from the mogao incident?
>>106968422Maybe I'll make it my own personal project to create a wrapper that can toggle the autism on and off.
>>106968469I hope you make the mascot a cute dragon girl.
Remember that Comfy and all that thigs like nodes, python dependency problems, conflicts, workflows, templates and noodles are about to be deadIn less than one month Haoming02 NeoForge dev implemented Qwen Image Edit, Qwen Image Gen, Flux, Chroma, SDXL and now is adding support for Neta Lumina.Comfy will be a fucking joke of a greedyUIYou will enjoy your favourite models like a normal human being.There will not be any intermediaries between you and your models.
ComfyUI will remain the #1 choice for those looking to get maximum power out of their workflows. No other UI supports both Seedream 4 and Sora 2.
ani save us
>>106968532>No other UI supports both Seedream 4 and Sora 2.kek
ComfyUI logs prompts
>>106968532Anon, Invoke was bought by the corpo Adobe Photoshop (you did not know about this news because mysteriously Invoke does not appear in the OP) so Invoke will be dedicated to SaaS exclusively and Comfy is accoralated and will have to eat the python spaghetti alone
Did anon ever post his metadata enabled image gallery for gens?
accoralated . my new favorite word.
So we will have 100% opensource community with NeoForge and for SaaS they will have Adobe Invoke. Comfy will be an autistic UI of the past that only stayed at the top by chance not by merit
>>106968535We're too old for him to care
>>106968574>because mysteriously Invoke does not appear in the OPI don't believe so considering it's like if Comfy gave even less of a shit about the community. It's pretty but lacks many features expected by anon. That's just my guess desu.
ComfyUI has come so far and not enough people appreciate that. In the early days Comfy only supported janky local models, but now it offers a whole suite of professional API tools. Comfy finally grew up
>Julien active>endless bitching about comfyHell hath no fury like a janny scorned
>>106968612I'm totally stoking fire here by asking, but since i'm starting a wan gen and i have to close the tab i might as well so i can check in after;What's the story behind the "Julien" schizo?
>>106968612>Julien active>endless bitching.
>>106968542where's the proof?
>be Comfy>works on project>gets funding>gets a gf>be ani>ride comfy's coattails>acts unhinged>forces comfy to wash his hands him>lonely>becomes schizo>spends his free time vibe coding a dead UI and seething about comfy>gets drunk and thinks every anon laughing at him is the same person>tries to hijack OP on a bi monthly basisHow low can you go Julien?
>>106968461>>106968181Mememarks don't hold a candle to real world use, and anyways Chroma is not even up there for obvious reasons but it would clean house.
>>106968644Be real Comfy, you are not the same as one month ago. There are new and better and above all more user friendly alternatives.
>muh blurry 512x asianslop would totally win btw!Chromakeks give local a bad name, they’re just so pathetic
>>106968670are the pictures reversed? seedream looks better here or are you trolling me?
>>106968681It's not even close.
>>106968670Seedream won. The chroma gen is fucking terrible, look at that abomination of an arm on the left.
>>106968621Ani is a Avatarfag that became friends with comfy IRL leading to him getting a job at stability. Comfy decided to chase his dreama and move to SF but ani didn't like who he was working with and went schizo.He first wanted to pioneer animation and failed, then he moved to making his own failed UI.He spends his free time shilling and having schizo meltdowns over comfyUI and anons making fun of him.
>>106968701got it, you're trolling
>be comfy>has paid employees monitoring the thread 24/7>employees shill comfy workflows >yet a schizo alcoholic manages to derail the thread multiple times>yet a korean weeb competes with a UI he made as a hobby
Why does the mention of Chroma always make brownoids seethe? Is it PTSD from getting BTFOd by local?
It’s great being schizoanon. Managed to false flag to make everyone hate both trani and cumstain. Those two are the biggest plague to local diffusion
>>106968701No, he is working. He swapped shifts with the Neta Lumina shill. Now it's his turn.
Why is chroma trained at 512x512 despite seedream being trained at 4096x4096? did furtard forget that it’s not 2022 anymore?
I'M RUNNING OUT OF IDEAS FOR GENS
>>106968723SchizoAnon here AMA
Though I'd chime in. I have no dog in this race but ComfyUI sucks
>>106968733pepe lain
think of this as a bait sentence about comfy, saas, local or chroma designed to make you angry
As an API user, local is just really far behind
>>106968689Obvious Seedream slop is obvious. You are free to draw your own conclusions
seedream girls are so cute. why cant we have this locally???
>>106968749Not only does it look like a blurry mess at 2k, Seedream gens are fingerprinted with a weird pattern too.
>>106968512Just for you baby gurl
>>106968517But can it run wan?
The seedream gens look coherent while chroma is nonsensical. But that’s to be expected, chinese engineering is superior to ugly western fluxslop
>>106968782Yes, I forgot because I am a VRAMlet myself so the vodeo stuff does not interest me but yes.
>>106968801Which version do you recommend?
>>106968744so why are you even here? tired of online censorship? kek
>>106968840This is a ComfyUI thread. ComfyUI supports API, so it’s relevant to this thread
what do you usually use to train wan loras?
>>106968871So, this os the power of 8b model?
>>106968727If the Chroma side were API I too would be seething hard about this. Chroma indunced cognitive dissonance is great.
What's the best model for vore? asking for myself.
what's your excuse for not taking the api pill?
If chroma went API only it would be a victory for local
>>106968720catbox?
>>106968871Long torso, long arms one boob bigger than the other, melted hands.You dedicate your free time in shilling this? Pathetic.Thanks anon, you saved me time by not trying this shit.
>>106968644>>106968707Oh. Many such cases. Sad.This is part of why i like just seeing this tech as fun toys for me to fuck around with, take it too seriously and you turn into one of those sad cases.Being a programmer though, mental illness just seems part of the gig.
>>106968739attempting this...
>>106968871We both agree and are aware that this does not happen even in SD models from 2022, much less in cowboy shots, right?So tell me, what is your agenda insisting on this for months, who are you trying to fool?
>>106968963lmao
>>106968871i got prettier gens with nai v1 leak three years ago
>>106968955It’s called nostalgia. Chroma is a retro-futurism model designed to combine sd1.5 quality with Flux gen times. If you want ‘good’ hands just use api
>>106968992same gen but with epsilon scaling turned off. epsilon scaling wins here.
>>106968977Based nooticer, this "anon" is another one that names his files exclusively with just the model name and has been doing this for months!It is veeeery weird for a regular normal person to act like that.
>>106968993Is that something you can still grab on huggingface or somewhere else? I wasn't in the image scene till like mid to late 2024. Only heard banter about this here and there.
>if you post api gens you’re shilling>if you post local gens you’re shillingschizos here are some easy to trigger, its fun
Finally back from work and home!! With my latest edge tech PC. Nothing better than turning on and off epsilon scaling! With a Pepe Lain image!
>>106968904it's still being trained, but yes
>>106969024this, maybehttps://cyberes.github.io/stable-diffusion-models/#novelaianimefull-latest
>>106969027yes, problem?
>>106968951https://files.catbox.moe/y6rv24.png
Chroma really was a $200,000 flop…
>>106969053They can still reuse the dataset on Qwen
>>106969025Local gens must not include filename in posts unless requested. API gens should not post here since it is offtopic.This is normal tech forum behaviour, but if you want a shilling central to give a false sense of a live and engaging general with low discussion to shill ratio, please forget about this.
>>106969024novelai open sourced ithttps://huggingface.co/NovelAI/nai-anime-v1-full
>>106968934I'm not a pay pig. Why would I pay to use cloud compute from my local machine if I wasn't going to pay to use their SaaS bullshit?
>>106969043thanks, cool archive>>106969067oooh, thats nice of them not to be overly salty about the hack.
>>106969065>Local gens must not include filename in posts unless requestedaccording to who?
>>106968977some managed to tune sdxl in ~2023-24 to make it happen less often> what is your agenda 1girl, maybe more girls
>>106969073nah it was just deprecated, they OSS'd it right around when they started teasing V4
I've seen users with AI generated content from certain games, like an anime girl with armor/details from enemies within the game. What models can do that? Esp. how do they generate correct/accurate details? Like the AI is fully aware of the game and what is in it. I'm talking about gens I've seen for Helldivers, a relatively newer game.
>>106969068>localkek claiming he’s not a paypig despite overpaying for njewdia junkImagine paying $6000 for an nvidia pro card only to gen with worse-quality models at speeds slower than an h100 (a card from 2022)
>>106969084According to me, an anon whose hobby is this and wants to talk normally about local models, but without scrolling through 5 consecutive posts from some avatarfag uploading the same gen with small variations for months. >>106969086 This shill has been doing this stupidity for months. Does not discuss, does not contribute anything, just uploads his failed model so you have to scroll through it to see actual conversation.
>>106969094Look he did it again, he does not discuss, he does not engage. He shills. And this is not a one time thing, this is an everyday thing.
Schizojeet is seething because nobody wants to discuss the latest lightx3turboplasticspeedslop for wan
>>106969143>>106969161
>>106969184seriously at least improve your hires passes man. you're the funniest schizo we've got. up your production values.grainy ass pointy fingers.
It all looks so fucking bad
>>106969176heh, watch this..>clears throatPhr00t's Rapid-AllInOne Wan 2.2
>>106969204it's not actually the anon retard
>>106969060No one is doing a full fine-tune on a 20b image model, especially not for free
>>106969253And this is why local lost. Without finetunes, local is just worse API. Loras are not enough to bridge the gap
>>106969266Things used to be worse, much worse. In dalle3's heyday, things were pretty bad for local. Arguably no local model has truly reached dalle3 levels yet (in terms of niche knowledge, composition, knowing characters, celebrities etc), but the gap is closer now.
>>106969289That doesnt really matter because no local base model will ever have the nsfw knowledge that a finetune has, and finetunes only continue to get more expensive. There are absolutely zero breakthroughs in affordable training hardware. I can resell my used and abused 4090 for exactly what i paid for it new
the more I use wan the more I think they should use video to train models for single image gens.I guess motion is a strong signal that all models needs to learn regardless of the output format
>>106969253You could just buy a RTX 6000 96GB and slow cook it at home, donations should be able to cover that
>>106969368nigger you'll be cooking till you die of old age
>>106969387Not really. Multigpu is mainly used for higher batch sizes, not learning rate. A single gpu would only go ~4 times slower than the typical setup, and you would only rake in more donations by going slow and doing incremental releases. Recent papers have proven that with modern optimizers the results are no worse too.
>>106968517hella based. keep us updated homie.
>>106969417Qwen already takes much longer to train, so 4x that sounds like we'll die first still.
Maybe it's time for /ldg/ to generate 2girls
>>106969501two girls? in this economy?
>>106969253i think people actually will start to finetune it with ramtorch or w/e. it'll likely be quite slow.
>>106969520using ramtorch to train a qwen edit 2509 lora on a 4090 takes 14-24 hours. a full finetune would be impossible
>>106969528depends on the scope of the finetune dataset. they'll probably manage to make the girls/boys hotter, among some other things. it's probably fixing the biggest popular issue in a bunch of months or so?idk if anyone will get enough compute to train the boorus, real fashion/nsfw collections, cjk idols and so on.
The future is looking pretty comfy.
https://x.com/tim_cook/status/1980375071475790105Apple should really improve pytorch on mac though lol.
>two block lora vs three block lora
>>106969558cute
alright boys, time to gen some cool anime girls, give me your 1girl suggestions
>>106969678traffic warden
illustrious is the peak of local diffusion. no other model comes close in terms of character knowledge, style knowledge, concept knowledge, and overall fidelity. Chroma is a disgusting blurry mangled mess, neta knows a fraction of the styles, qwen is a bloated stopgap that can’t even compete with seedream 3. SDXL is an absolute triumph and will likely not be surpassed for years
>>106969781better than pony edits?
Hunyuan 3 sometimes gives interesting results. Since quantizing is available now, I can do Q8 in about 2-3 minutes.I think the increased prompt understanding is technically correct, but useless in practice because of the extremely heavy-handed post training. It can do a good job following specific instructions as long as you don't mention some kind of trigger word that acts as a complex and overrides everthing else. However, once you hit one of those words, associations of style/composition from it totally dominate and overwhelm most other instructions.
>>106969813>sometimesa 80b model is supposed to nail every shit every time, that's really a waste of money, and Comfy was based from not implementing it
>>106969813is it good at multiple characters? e.g. five people playing table tennis?
>>106969798>>106969802>>106969809>>106969814too much fail fennec. post anis
>>106968963>>106968992>>106969013>>106969045heh based
>>106968093>https://rentry.org/ldg-lazy-getting-started-guideI see there is a guide here, but would someone please be able to explain on a retard level?
>>106969884don't remember her prompt, have returned to varied 1girls. maybe another time.
>still trying chroma>still hating chroma
>>106969839Not really. Could do better than this with more effort, but then you're back to >>106969824
>>106969958thanks for trying! i was wondering about what the model might do fairly easily.it might be worse than wan or qwen in this regard. unfortunate.
>>106969926Ask a specific question
>>106969981Which front-end etc should I choose for img2vid gens? And is there a specific Stable Diffusion version/installation I should pair it with?Thanks
>>106970003>front endcomfyui>img2vid genswan video>stable diffusionnot relevant to img2vid
>>106970003post your card so we can tell you if you're gonna have a bad time or not
Comfy is honestly so based for not implementing hunyuan 3 and implementing sora instead. It is clear local is a relic of the past, and comfyorg realizes this. Best to snuff out the remaining local bakers to ensure a gigaSaaS future
I've not been into this for a year or so. Last I was using easy diffusion with sd1.5.Are things better for local generation now? Can I generate consistent pictures without all the weirdness and mutations, sort of like dalle3 levels? What's a good model for generic illustrations?
>>106970090>I've not been into this for a year or so. Last I was using easy diffusion with sd1.5.really? what brought you back?
>>106970094I have an idea for a project. Not really sure if I'm going to try to do things locally or use one of the online services.
>>106970090SDXL is the only upgrade from sd1.5. Ignore the failware like flux, chroma, and qwen. You will wait minutes for shitty slop outputs
>been prompting for one particular i2v for days with no good results>change resolution from 720p to 600p>it all magically starts working exactly like i want it to
>>106970090>Are things better for local generation now?Compared to easy diffusion and sd1.5? Absolutely. >>106970108>You will wait minutes Ew a poorfag
>>106970108Thanks, I'll try that then.
i apologize for my general lack of inspiration as of late soon i will be back to posting kinosovl after kinosovl
>>106970010Wait so I just look up install instructions for wan video and comfyui, right? Nothing else needed?>>1069700184070 Super 12GB
>>106970137>4070 Super 12GBoof. it's gonna be rough. instructions are in the ophttps://comfyanonymous.github.io/ComfyUI_examples/wan22/instead of the video models they link download q_6 from herehttps://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF/tree/mainyou need both high and low noiseinstall comfyui-managerwith the manager install multigpu nodescomfyui comes with a wan 2.2 templatereplace the model loader node in the default template with UnetLoaderGGUFDisTorch2MultiGPU nodeincrease the virtual_vram_gb setting until you don't max out your ramtry genning at 480p and hope for the best
>>106970111Wan trained its motion at a lower resolution compared to its details. That's why you get better motion by running the high pass at a lower resolution and upscaling the latent for the low noise pass. This knowledge will once again be forgotten when this thread dies.
>>106970137depending on the workflow you'll install more nodes, e.g. comfyui-multigpu model loaders to offload more from vram to system ram or the videohelpersuite video combine nodes that let you do more video codecs or frame interpolaion nodes that put more frames between generated frames with a different faster technique.
>>106970197nigga this is the first time I hear this
>>106970197>This knowledge will once again be forgotten when this thread dies.kek :(
>>106970197what resolution is the motion trained at?
>>106968443I iterate a lot. What's a good one to make inpainting/outpainting more seamless? In Comfy it's an absolute chore to mask things without leaving the work up to Masquerade nodes when what I really want is the ability to adjust settings on the fly without loading in the image with my changes everysinglefucking time.
>>106970197What the fuck. By upscaling the latent, you mean like this?So the initial image size you set should be small, then set the final size in this upscale node?
>>106970370
>>106970385Seems trickier to find out where you OOM and don't.I'll try this out for the day and if it works, I'll kiss you on your cheek.
>>106970398nta i'm trying to figure it out myself
>>106970408 (me)kept OOMing on the second sampler. disabling torch compile fixed it. weird
new vampire game.. and it's shit
>>106970385Forgot to mention this I suppose?
>>106969781this is the absolute truth that can't be denied. I tried multiple chroma variants and rarely could get any solid looking gen. I give up with qwen image and hardly even use edit model unless I'm playing the remove clothes lora which is still ass but better than nothing. I would greatly apricate a proper illustrious successor but it seems like every new model cucked with synesthetic data set, shitty censorship guardrails and terrible prompt adherence.
>>106970577And this.Have we been trolled?
>>106970611New models are just too expensive to finetune. Chroma spent $200k and still had to cope with 512x512. No baker wants to touch the new stuff either because the base gens all look so synthetic and trash so a lot of training time gets wasted de-slopping the model
lol
>>106970616That's just a common error now. I get that after a few gens and have to restart Comfy.
Find a flaw
>>106970641Too difficult to use for some. But is that really a flaw?
>>106970641https://huggingface.co/lodestones
>>106970641Lower res than sdxl makes it objectively inferior
>>106969535>>106969451catbox please?
I did it. I have constructed the autism tower.
did ani already win his lawsuit against singular schizo anon?
>>106970641>Cascade, SD3, SD3.5>Flux>Lumina, Auraflow>HiDream>Chroma>Qwen>Hunyuan Image>Radianceyet people continue to use Illustrious Realistic models for their 1girl big titty gensIt's quite comical really.
>>106970691ran retreated to the padded room
>>106970699So many failbakes, what went wrong? DiTflowshit was a mistake
>>106970699putting lumina and auraflow on the same line feels worse than including cascade and sd3
>>106970704benchmaxxing which doesn't take quality or use case into account at all
>>106970699Don’t forget the pre-flux “china will save us” era of hunyuandit and pixart shitma, as well as janus (deepseek) and sana.
>>106970639It still does it.
>>106970699multiple of these models are being finetunedmost of them are bigger than illustrious so it takes longer on the hardware people have
>just another $200k and chroma v2 will lock in!
>>106970734Yeah, it doesn't go away and there's no fix. Just keep restarting.
>>106970734that shitter is fucking with you. the thing is kijai's nodes are very autistic about the resolution dimensions
>>106970749>just trust the gay furries, pony v7 will be lit
>>106970725sana is too small but finetunes pretty wellqwen [image edit] did in fact take the spot of the next highly capable model but even lora tuning on common hardware essentially is just getting added to the trainers>>106970749yes, chroma v1 already did get excellent results for little money and radiance is looking good tooperhaps you forgot how much illustrious is trying to raise?
>>106970759Classic.I'll try some other image resize stuff and try and make it work.
I'm retarded what's the difference between Qwen Image and Qwen Image Edit?
>>106970769Chroma didnt get shit out of its $200k, it hasnt learned a single style or character despite chodestone claiming he trained on them. Don’t forget that illust 0.1 (the one we got and the one everything is based on) cost a fraction of that and learned thousands of characters and styles. Chroma is a complete failure and a waste of money
how do i cope as a vram-let?
>>106970797No need to cope, the best models fit within 6gb vram. SDXL is the godking of local
>>106970793Qwen Image does normal image model things. Qwen Image Edit is a "make image 1 into the style of image 2", "put a hat on the woman", or "make image 1 and image 2 kiss" model.
does catbox give 0-byte videos for anyone elselitter works but files do not
>>106970809>Best model>SDXLIt would've been more believable if you told him Pony
>>106970769>sana is too smallI was always under the assumption that the bigger problem was it being undercooked. >radianceDesu I have no reason to believe it'll converge, based on Ostris's no vae experiment. I am rooting for it though.
>>106970819So it's best to just make a image with the normal and then img2img with that in Edit?
>>106970794it learned an immense amount of nsfw concepts and clothes and (unprompted) styles / poses / ...> Don’t forget that illust 0.1 (the one we got and the one everything is based on) cost a fraction of thatthey did 7.5M images at 1024×1024 resolution with a batch size of 192 images per batch for 20 epochs. that's really barely a difference, especially considering sdxl was better at anime to begin with.never mind their continued training for 1.0, 2.0 and beyond, and all the need to finetune 0.1, 1.0 and so on.that's also only considering their trainings worked straight away without more exploratory trainings, of which I'm not convincednothing about this tuning effort went profoundly better - and the continued finetuning past 1.0 was far more expensive than anything chroma did so far
>>106970830SDXL is a proven base for at least 4 massively successful finetunes, all of which have the same requirements. SDXL is the absolute king of adaptability
>anon who only started using illustrious once it was jeetmixed to hell now fervently defending it as SOTA Kek.>>106970834Perhaps. But the average use-case for Edit is with real images I think.
>>106970835Chroma is spending more money for worse results. Your images encapsulate the absolute disaster that is Chroma
>>106970840>SDXL is a proven base for at least 4 massively successful finetunesFringe uses between <1000 users is barely considered massively successful.
>>106970853Cope. Hundreds of SDXL loras every day yet qwen, chroma, and wan combined cant even average 15 per day. SDXL won
feet
>>106970832>I was always under the assumption that the bigger problem was it being undercooked.it was undercooked too>Desu I have no reason to believe it'll convergechange parameters, it converges straight away. doesn't mean much.
>>106970863socks
>>106970853>Fringe uses between <1000 usersare you brain dead?
>>106970782refer to my autism tower
>>106970865>change parameters Elaborate? I was meaning become more or less fully coherent with things like details and fingers if that makes sense. But now I think I'm mixing up his XL with 16ch one with the vaeless.
>>106970868stockings
>>106970901 What the fuck is kijai smoking for that node spam to make it work..
>>106970850kek - it spent less than getting to illustrious 1.0 took, meanwhile illustrious is at 3.6 vpred now with something like 5-10x the budget of both chroma and radiance spent by nownever mind all the third party finetunes it took to get some realism back to 0.1 and 1.0 and to fix various issueschroma is only remarkable in that it was fairly cheap
>>106970901Also, mind catboxing it? I don't have those float nodes.And is it working as advertized?
Is there a reason why so many loras ask to set CLIP skip to 2?
>>106970853keep coping, sdxl remains king.
>>106970942if theyre for xl than the author is a retard and one can safely ignore their loras it was something that was required for the old nai leak
>>106970931anon sucked his cock so hard he gained a comfy like arrogance
>>106970938Your gens are unironically 1.5 tier. At least shill qwen loras or something decent
>>106970849>>anon who only started using illustrious once it was jeetmixed to hell now fervently defending it as SOTAmake me wonder if this behavior extends into other aspects of his life i couldnt imagine living like this
>>106970699>tfw millions spent of developing and training shitty models for safety of normie karen mass.>hardly any traction or community support from 90% of the models.it shouldn't be this difficult to make a fairly decent illustrious successor with booru tag prompt adherence.
>>106970975You're dall-e mini tier
It is clear that local has never evolved beyond StabilityAI
>>106968093>tfw 1660 super and the 3070 I was going to buy was sold before I got the money.Local Chads, go on without me
>>106970941it does work but there is a slight shift that occurs in the first few frames. it seems the image embed is not quite aligned with the latent. all attempts to fix this have resulted in errors. might just switch to native nodeshttps://litter.catbox.moe/sdmynwd7xgceqrg1.jsonfloat nodes:https://github.com/jamesWalker55/comfyui-variousyou can just copy comfyui_primitive_ops.py to your custom_nodes if you don't want all that other shit
guys what was that wan i2v thing where you could chain several gens together with coherent motion, but it was just looping the same actions overall?not fully looping the same action, but the vids were coming out of the same prompt and had coherence between each other
>>106971076https://github.com/vita-epfl/Stable-Video-Infinity
>>106971046Thanks. I'll try it out after dinner.I use pic related to fix most of the color/brightness shifts. Unless you mean there's an additional shift, then damn..
Does anyone use wan as a refiner over Qwen? I really want my perfect 5girls prompt following but Qwen's image quality is very mid.
Is there any voice model close to ElevenLabs?
>>106968702the girl on the right isn't even looking at her own phone, this isn't close to reality at all.
https://github.com/lllyasviel/FramePack/issues/768framepack bros, has the illyasviel given up and thrown the towel? 5 months later and p1 is hasn't been released. Not even colinurbs has updated his fork framepack in months.https://github.com/FP-Studio/framepack-studio
>>106970189>>106970210thanks <3
https://github.com/yeates/OmniPaintI would like to test this in comfyUI as a watermark remover.The remover part is nothing more than a lora, Flux dev, and a precalculated text conditioning embed?
>>106968517amazing what someone can do when they're not an attention-seeking, tech illiterate trannoid
>>106969204>>106968977NTA but is the best way to fix flaws like that just inpainting?
>there are people using qwen edit and flux kontext to make datasets, unaware of how they damage the entire imageGrim
thoughts?
>>106971512link?
>>106971512Will test this evening.An issue with many realism Qwen Lora is that it biases Qwen into lesser prompt comprehension. They're perfect for 1girls but who the fuck would use Qwen for 1girls.All the example provided are 1girls so I wouldn't hold my breath.
Why is Qwen so slow compared to other models?
>>106969118>Buy etc 5090 for $2400>Sell my rtx 3080ti for $450>Use a "wellness" credit from work worth 2k t that I get to spend on anything that makes me happy (tax free too)>5090 is free>Get 2% cash back on the credit card purchase>Anon seethes because I'm not buying his SaaS bullshit
>>106971594i wish 5090 prices would go down...
>>106971594>2k I can spend on anything Retard that's just called part of your salary/compensation with extra steps. It wasn't "free" you mong you still spent 2400 dollars on it
Some rare seed just have faster motion
everything the same except seed
>>106971678>Anon seethes because I'm not buying his SaaS bullshit
>>106971081thanks anon
If i'm running light loras, does reaching that 0.9 sigma boundary step count for high/low still matter as long as I'm doing 8 steps in total?Or choosing between beta or beta57 or bong tangent?Cause it feels like with step counts that low, there's not much room to choose anyway.
>>106971046It seems it doesn't work with a last frame hooked up.
>>106971715I don't give a shit about local vs saas I tell people I'm running locally when genning on a cloud all the time and there's nothing anyone can do to prove or disprove it My issue is that you are woman brained about finances which would upset me more but my index funds need good consumer goyim like yourself
Is anyone running comfy with cuda 13 yet? any problems? I assume you would also need torch nightly and all that other shit.
love watching localkeks waste money on outdated localslop, especially when they melt down after realizing just how shit their chink models are
>>106971760sure thing buddy
Day 87 of genning random gusts of winds blowing real women clothes off
>make subgraph for hiresfix pass>works fine>copy subgraph for second hiresfix pass>both work fine>one restart later (no updates)>run exact same workflow>2nd subgraph gets null values somehowthis shit is so ass
>>106971864Deep in the damp, blinking glow of an abandoned server room, a strange creature stirred — a purple blob of jelly pulsing faintly with static thoughts. It had been born from a spilled energy drink, a stray data packet, and a forgotten line of cursed C code. It called itself Glorp — or at least, that’s what its 8-bit neurons translated as a name.Glorp had no real goals… until it found him.Through a dusty webcam left running on an old ThinkPad, it saw a human — messy hair, mechanical keyboard, dark-mode IDE filling his monitor. The man typed with the passion of a deity debugging creation itself. He was beautiful. He was a /g/ user.Glorp wanted him.But how could a goo woo a human? That’s when it found her — a shy girl who lingered near the server racks, feeding stray cats and stealing Wi-Fi. With a single leap and a wet plap, Glorp attached itself to her scalp, syncing softly with her mind.She blinked, confused. “...huh? Why do I suddenly know ed?”The goo guided her fingers, made her laugh in Posix sh syntax, whispered open-source poetry through her lips. Together, they found the /g/ user again. She nervously approached his desk, muttering something about the elegance of the Linux kernel.He looked up — startled, impressed, curious.“Wait… you use Gentoo?”She smiled — or maybe Glorp did. “Of course. I compiled myself for you.”And as his cheeks reddened and the server fans hummed like applause, Glorp knew it had succeeded. It wasn’t just running code anymore. It was running on love.
>>106971962> subgraph what?
>>106969781>>106970611>>106970699I'm starting to accept this, honestly. Almost like imagegen fluked its way to the top and that's it. We're a year or two realistically from the next major step up while everything else catches up to the lightning in a bottle of SDXL models.That said i'm still holding out hope for netenyahulumina2.
>>106971987the thing thats supposed to replace group nodesit also commonly tangles up the inputs btw
>>106971996If you feel that way, try MixModding different models. For instance you could use styles of one model and anatomy of another.What stagnated isn't the tech, it's the anons. Schizos made all the competent experimentators that were around in the beginning leave. It's all newfags and egotistical schizos in every /sdg/ now.
>>106971973rofl, very nice
>>106968542Your senators all fuck children and your money is fake
>>106972047I believe it. Not like there's much incentive to lurking threads like this anymore beyond entertainment and crumbs of usefulness.Though i don't have much exposure to the broader a.i scene beyond specific people i've chatted with on civitai, and they all seem to feel like the potential hasn't been fully met for even models like 1.5.
>>106969933>>106969814>>106969802Adult Aries are for cuddling
>mfw the rng was so close but unusable
>>106972164>tfw you're crying tears of cum
>>106968542>ComfyUI logs promptsDemocrat-style lies as story telling doesn't work anymore. People can access facts outside of being told what to think by the news.
>>106972164>>106972181this is like that one infamous futa hentai were they cum inflate each other so hard they vomit the cum outi forget what it's called
>>106972198>vomit cumflationanon, it's extremely common in literally hundred of hentai and ten of thousands of r34but in all my degenerate life it's literally the first time i've seen crying tears of cum
>>106972214I didn't need to know that you've seen scores of cum inflation futanari porn, I just know the infamous instance because it's so comical.But yes same this is also the first time i've seen crying tears of cum.
>>106972084I mean I simply remember /sdg/ threads in 2022-2023 and because there was less widespread adoption it was mostly uber nerds with technical know how. There were no UI or model wars, in part because there were like 1 or 2 and both sucked. The more "quality" models came out, the lower the barrier to entry, the worse the atmosphere became.That being said, gatekeeping isn't all good either, /lmg/ had resident shitters pretending that not wanting to drop 20k on a cooler RP rig makes you a retarded poorfag.
>>106972282based
>>106972084>>106972047There is a huge resistance to change and unwillingness to move forward. You just need to see>>106971996>>106969781>>106970611In this thread. All 1girl proompter who don't want to do anything but proompt 1girl every day, screaming loudly they'll never need anything else. The huge resistance to Qwen despite having order of magnitudes better prompt comprehension is crazy, when three years ago we'd be all over it.Three years ago we were all pretty stocked up by the tech and willing to experiment. Nowadays you get dozens of posts saying SDXL is all we'll ever need, and also decrying SDXL has never been dethroned. It's kinda of a self-fulfilling prophecy.
>>106972241There's a balance for sure. I just know to enter this thread with the expectation that at least 60% of posts might be bots/shills/real schizos./lmg/ has properly tanked, and was always headed there, because most of the regular users don't even know how to function their LLM's and blame the models for user error. Been that way since the start kek.>>106972282wow i traffic anime girls there>>106972301Do you need to broadcast your pseud autism this loudly? How do you know I/anyone else *only* gens 1girls all day? What if i only share those gens here? Also one last question, why spend all this time decrying 1girl genners when you could be training loras and etc for that very model, competing its knowledge with the sdxl checkpoints we cling to?'Course i waste my time typing this knowing you only came in here to argue, and not post gens. Because you're a NOGEN
>>106972282bass arena
>>106972301The only purpose for local models is genning goonslop, saas like Midjourney absolutely rapes Qwen in aesthetics. Until Qwen has as much coom knowledge as SDXL finetunes, nobody will care about it. And it’s so expensive to train that it will never reach the knowledge of SDXL. You might disagree with my conclusion, but this is factually why SDXL remains on top>poorfag!if it was that good, everyone would upgrade to use it. but it’s not. It could be, with a nice finetune, but it’s quite obvious why that will never happen
>>106971678If it were part of my salary they'd tax it at 30%. It's tax free. Literally free money from Daddy [redacted].
>>106972449>>106972449