[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Move the Latents Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106678738

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: combined_image.jpg (2.78 MB, 4096x2435)
2.78 MB
2.78 MB JPG
Here's the swordfighting chroma comp.
>>
>>106681299
2k-2 looks the best although its all trash
>>
i installed comfyui and all of the workflows are empty, i can't add nodes, it keeps flashing the message "YOU'RE GAY" on the screen
>>
>>106681318
It means you have a radeon gpu
>>
>>106681318
It means it detected you patched out the API nodes and fixed the memory leaks, breaking your install with its hidden DRM
>>
jewlibaba api announcement, in a few hours
>>
>>106681318
I got news for you, that means you're gay
>>
>>106681318
Did you open an issue in the repo?
>>
Who will replace alibaba as the open source king now that they've gone full closed source?
>>
>>106681383
not a burger thats for sure
maybe the french?
>>
File: jessiebig.webm (468 KB, 720x480)
468 KB
468 KB WEBM
Anyone got super old gens? Here's a really early Cogvideo gen I made like 2 years ago. Sure has been a lot of progress.
>>
What would I need if I wanna gen cool videos and animations like you guys?
>>
>>106681357
>In a few hours
Isn't it in 20 minutes?
>>
>>106681401
learn to read
>>
so is neta lumina dead or
>>
Blessed thread of frenship
>>
More like thread of yellow fever losers
>>
File: CogVideoX-I2V_00006.webm (634 KB, 720x480)
634 KB
634 KB WEBM
>>106681398
Yes, I always thought the progress would be slower, its amazing
>>
File: ComfyUI_00003_.webm (94 KB, 512x512)
94 KB
94 KB WEBM
>>106681398
i would show my first wan tries when i had my 8gig card but they're all extremely nsfw
well besides this one, its hilarious because i think it took an hour or two and this is all i got because i stupidly set the seconds too low.
>>
File: output.webm (3.87 MB, 800x1168)
3.87 MB
3.87 MB WEBM
>>106681401
You'd need to read the guide in the OP. If you wanna make anything of decent quality you need a 3000+ nvidia gpu with at least 12gb VRAM hardware wise.

>>106681357
please be opensource please be opensource please be opensource
>>
File: AI-Payprocessors.png (261 KB, 1011x1037)
261 KB
261 KB PNG
>>
>>106681398
this is from 2023
>>
>>106681477
I think I'm good on the gpu front. From what I'm reading, the one I want is WAN2GP, is that the one you guys are using?
>>
>>
>>106681437
blessed thread of friendzoned ;3
>>
File: 1757934031486111.gif (26 KB, 220x143)
26 KB
26 KB GIF
>the month of our lord september 2025
>still no good spanking lora
>>
>>106681398
Just over 3 years since I posted this.
>>
File: API NODES WON.png (258 KB, 967x608)
258 KB
258 KB PNG
TOTAL API NODE VICTORY!!!!!!!!!!!!!!!!!!!!!
>>
>>106681477
make her cast a fireball at an off-screen target
>>
>want to finally try Wan 2.2
>need to download more big files
>comfy ui is already over 200gb
How do I know what I can safely delete? Do I just take the plunge and backup a couple of models/ loras, delete it and start over?
>>
It's time we have a serious discussion about removing ComfyUI from the OP. It is clear that Comfyfaggot's push for API nodes is a major factor in convincing model providers to switch to API-only. A project whose primary focus is in raising money for SaaS while botching the past 2 local implementations (chroma and qwen) has no place in a thread about local diffusion. Anyone who insists on keeping ComfyUI in the OP is unironically a SaaS shill attempting to sabotage local development.
>>
File: 6-2023.png (1.6 MB, 800x1080)
1.6 MB
1.6 MB PNG
>>106681398
>>
FurkUI will save local
>>
File: dancedance4 - Copy.webm (3.45 MB, 1080x1920)
3.45 MB
3.45 MB WEBM
fixed the phasing issue and the face inconsistency, the only problem is the stupid stitching. a bit more and I can create a cucktok
>>
File: G0edv6OWoAAtm4C.jpg (642 KB, 1664x2496)
642 KB
642 KB JPG
Posting some Seedream nostalgia to celebrate another model joining the API nodes family!
>>
>>106681585
This.
REMOVE COMFYUI FROM THE OP IF YOU STILL CARE ABOUT LOCAL YOU FAGGOTS
>>
File: 1436576952547.jpg (89 KB, 1280x720)
89 KB
89 KB JPG
>>106681585
But does comfyanon intend to eventually abandon local gen in favor of API-only? Or are the API nodes simply just another set of tools added to ComfyUI's arsenal?
>>
File: 1745830155620851.png (986 KB, 808x1072)
986 KB
986 KB PNG
>>106681398
6-2023
>>
File: G1iKqscWsAAkMC_.jpg (1.25 MB, 4096x2304)
1.25 MB
1.25 MB JPG
>>
>>106681585
>implying replying to bait
You can just fork it and run it forever local. Kys.
>>
>>106681585
You literally haven't mentioned the replacement, anon
>>
can wan generate george floyd getting kneeled on by George Washington?
>>
File: 1741406015975277.mp4 (1.58 MB, 1024x576)
1.58 MB
1.58 MB MP4
WAN is best when merging concepts into novel outputs to get stuff you wouldn't see in the real world.

like Star of David sunglasses kek, even though they're heptagrams in this gen

interesting that most attempts had CONDEMN written as CONDENN so there's obviously some similar "vector brain space" being shared between the visual concept of an M glyph and an N glyph.

>>106681554
>still no good spanking lora
spanking is a tough concept to teach since its fully related to the speed of arm movements compared to just caressing the butt

>>106681584
dude a 1tb nvme or even sata SSD costs like 50 bucks stop being poor if you want to be in this hobby
>t. too lazy to dish out $120 for 64gb of ram
>>
>>106681588
yeah only the last second or two was jarring, you're almost there. Then just mask the wan graininess of the hair with some blocky bitrate lossy compression and it'll look even more legit
>>
>>106681604
ComfyUI choosing to offer API nodes is what killed local in the first place because companies see no need to release weights when the tools are all API compatible anyway. This is precisely why Wan is SaaS now
>>
>API nodes killed local
But (you) said local was dead way before that...
>>
>>106681629
Forking does nothing to change the fact that companies are switching to API-only thanks to ComfyUI's push for it. Keeping ComfyUI in the OP caused this. We would've had way more open-weight models if you retards noticed this 5 months ago and dropped it from the OP
>>
>>106681637
Don't glorify. Make uglier.
>>
Total Seedream LocalSaaS API Node Comfy Diffusion Victory
>>
>>
>>106681566
i'm going to generate humiliated chinese people, until xmas!
>>
File: dancedance6.webm (3.35 MB, 1080x1920)
3.35 MB
3.35 MB WEBM
>>106681640
actually I posted the wrong one, this is the latest version
>>
>aniseethe
>>
I wonder how well Noob can into Total Drama Island
>>
>>106681666
Yeah I see what you mean, the motions in this one are better but the stitching issue is more apparent, like its missing a few frames in the dance
>>
>>106681585
lel

Comfy has the BEST support for local models, practically all local models supported days after release, often within hours

Even supports local models still heavily in training
>>
>>106681683
broken garbage implementation is not support, it's sabotage
>>
>>106681636
yeah this can be done in a single pass with Wanimate using a headshot of GW and the floyd video

>>106681652
>Make uglier
You will never have freckles and green eyes abdul, kill all the non-ashkenazis you want but if you start touching European genetics even slightly it's nuke time
>>
>>106681690
truthnuke that comfykeks dont want to hear
>>
>106681690
get more vram
>>
>>106681669
it can into Total Drama Island pretty well from what I saw on /b/degen. a whole generation of zillenials grew up with Gwen being the prototypical goth egirl jailbait
>>
ETA until local dies on live stream?
>>
File: 0.png (144 KB, 540x630)
144 KB
144 KB PNG
What kind of artist name / lora / prompt should I use if I want to generate images like this?
>>
>>106681704
>ETA until local dies on live stream?
whenever Nvidia puts backdoors and always-on requirements for their 7000 series for """""national security purposes""""" and we have to start living in a Gibson novel where we risk 10 years in jail to smuggle in GPUs from China
>>
>>106681704
already happened
https://www.youtube.com/watch?v=NEXn1RgCf3Y
no more wan for local. local has literally ZERO support anymore, it's actually dead in the water now.
>>
>>106681703
Probably via LoRA, there's less than 200 images on both e621 and Danbooru... actually that's not a terrible amount but many anon are promplets.
>>
wow this seething is so organic
>>
File: 1745914415129175.webm (2.98 MB, 512x512)
2.98 MB
2.98 MB WEBM
>>106681398
September 2022.
>>
Shill thread. Remove comfyui
>>
>>106681725
initially i hated these types of animations but they grew on me especially modern highres versions
>>
>>106681714
I knew it would happen at some point, but damn, I thought it would've happened on wan 3.0, I'm so scared it means I'll have to spend the rest of my life with a MoE meme model, fuck :(
>>
This wan 2.5 presentation is some awful slop goodness.
>>
It's unironically over for local development. Holy shit what a fucking awful year
>>
>>106681720
just wait for them to fuck off somewhere else this always happens anytime a cloud model releases
>>
>>106681762
Qwen Image Edit is literally the best porn model.
>>
File: df.png (967 KB, 832x1216)
967 KB
967 KB PNG
>>106681705
This is the best I can do
There's got to be a good artist name or something for this
>>
>>106681767
this shit is so fucking slopped, fuck that, I want nano banana at home
>>
>>106681772
Oh you're poor, sorry. Maybe save up money so you can learn how to properly use a model.
>>
>JUST PRETEND NOTHING IS HAPPENING!! IT WILL ALL BE FINE
I shall remind them: >>106481726
>SAI died and switched to saas
>Kolors switches to saas after making a good model
>Hidream switches to saas after making a good model
>Hunyuan switches to saas with Hunyuan Image 2
>BFL intentionally sabotages local releases to promote their api
>Pony is now trying to shill his dead saas app
>Illustrious is now purely saas
>Deepseek's attempt at an image model was a complete disaster
>Pixart is dead, sana was complete shit
>Chroma spent over $100k training at 512x512 because finetuning even 8b flux is too expensive
>ComfyUI implements API nodes after realizing local is dead
>CivitAI follows suit, offering GPT and Gemini now
Now just today Wan went SaaS as well. Sticking your head in the sand doesn't change the fact that local is on a non-stop losing spree and all previous local model providers have switched to SaaS
>>
>>106681566
Local model when?
>>
>>106681778
>big model = perfect model
no, far from that, you need to read this
https://en.wikipedia.org/wiki/Garbage_in,_garbage_out
>>
holy cloudcuck cope
>>
localpajeets lost, no surprise there
>>
File: 1737242466769155.png (146 KB, 640x354)
146 KB
146 KB PNG
>>106681784
>Local model when?
>>
File: 00020-4078008795.png (721 KB, 832x1216)
721 KB
721 KB PNG
>>106681782
I'm out of the loop. What the hell is "SaaS"?
>>
>>106681785
If you weren't poor you'd make a LoRA and it does literally anything you can imagine at the quality of your training data. That's all you need to know.

Yes, Qwen Image Edit with a LoRA is better at porn than Chroma.
>>
>>106681795
short for Sucks Ass And Socks
>>
>SaaS is literally cuckolding local, stealing their models and locking them behind API while local can only watch
KEK
>>
>>106681796
workflow for new qie?
>>
>>106681799
API can't gen futas pegging shotas
>>
GET IN HERE FAGGOTS, WAN 2.5 STREAM STARTING SOON. PREPARE YOUR API NODES
https://www.youtube.com/watch?v=hyRFWDEX_EA
>>
>>106681800
I haven't played with the new model, just the old one which I've made some LoRAs for and it's quite good, and as I said, will easily train to output without the Chinese slop aesthetic. But I imagine the upgraded model will be even better for this.
>>
>>106681804
NovelAI does it way better than any local model
>>
how many steps does it take for there to be diminishing returns for realistic gens using euler a?
>>
>>106681810
Yeah I'm sure having everything you ever typed along with your credit card number will never backfire.
>>
File: sad.png (234 KB, 888x499)
234 KB
234 KB PNG
>>106681799
being a localkek is a humiliation ritual ngl
>>
>>106681813
What model? Also doing a second pass is more beenfical in general.
>>
>>106681799
Good, we already have a good set of tools, aijeets are always seeking the ultimate model that makes everything for them with fast speeds that also needs to be low on vram and needs to have an open license that grands me the rights of everything if I try to monetize it, ok? In reality all they want is an easy to use tool so they can vomit some grift content with no minimal effort.

If you can't use what you already have then I'm afraid you're just some poor vramlet with no skills
>>
so you gunna post the chinese commercial level gens or what
>>
At least 15 of the last 25 posts were made by the same 1 person samefagging who is almost certainly a vramlet and wouldn't have been able to run the 1080p model even if it was released.

Probably a shitskin who sees a sentient piece of poop when the look at themselves in the mirror too

>>106681813
Usually by the time you're doing twice the recommended you're doing enough or too much. So 40 for sdxl, 100 for wan etc
>>
>i-i love sd 1.4! it's all we ever needed!
holy localcope
>>
>>106681718
>there's less than 200 images on both e621 and Danbooru.
Total Drama is a western series. You'd probably find better stuff on reddit and unironically Tumblr and shit like that than boorus
>>
>>106681856
Poorfaggot nomodel reslet cope. you lost to api
>>
File: 1663788336970283.gif (2.17 MB, 360x238)
2.17 MB
2.17 MB GIF
vramlets won. it was useless to buy high high high computa, kek
>>
>>106681856
cringe
>>106681871
this
>>
File: 1737561734271477.png (1.51 MB, 1416x2120)
1.51 MB
1.51 MB PNG
>>106681703
base model at least knows the character and style to an extent but obviously >>106681718 could use more data
>>106681869
yeah ill probably scrape r34 or the ones you mentioned. its the style i want more than the characters desu
>>
>>106681872
Based. All local gens from Wan 2.2 look like complete garbage, and every single thread is poorfags (5090 is poor btw) coping with quants and light/speed loras just to produce ugly plastic contrast-blown garbage. Either use the models the way they were meant to (on cloud compute with 200GB VRAM) or go back to SDXL. Nobody wants to see your ugly Qwen/Wan abominations.
>>
>>106681864
I mean it's kinda true that saas will never ever be able to do the stuff we can do on local. It's like having the strongest man alive but restricted to only lift stuff in a gym and no where else
>>
File: 1745686360238230.png (1.2 MB, 1416x2120)
1.2 MB
1.2 MB PNG
>>
wan 2.2 finally works without ooming and its using very little vram FFFFFIIIINAAALLLLYYY

thank you big booty latina enjoyer for blessing my cumfartui
>>
>>106681885
he's rude but he has a point, some faggots here have spent thousands of dollars to get wobly plastic images as their best result lol
>>
>>106681875
I dont need anything else, I can generate Image, Video, Audio, Music, Lipsync,TTS locally with no censorship, API faggots think they won when in reality they will be monitored and censored with no freedom lol

>imagine celebrating using an API tool
lmao
>>
File: 1745580682056626.mp4 (1.63 MB, 976x560)
1.63 MB
1.63 MB MP4
hatsune miku rides nyan cat through space

neat
>>
>>106681904
migu save us
>>
>schizoboomers fall back to 'THEYRE WATCHING ME' conspiritard theories because even they know their models are objectively just worst so it's not even worth attempting the argument
grim
>>
>>106681910
Show me SAAS tits and pussy or gtfo
>>
>>106681888
>>106681902
Just ignore the trolls
>>
>>106681876
what are you even doing just search on civitai, it has the TDI characters and style
>>
>>106681902
>I can generate Image, Video, Audio, Music, Lipsync,TTS locally with no censorship
it looks like ass though, API models have won the aesthetic battle, we only got slopped shit, but it can do plastic penis and boobs though 1!1!!!1!1!
>>
File: VirginApi.png (469 KB, 1892x1038)
469 KB
469 KB PNG
>>
>>106681923
kekk
>>
>>106681917
>he thinks blurrydream looks good
man thank god I have eyes
>>
File: 1737512774180109.png (1.22 MB, 1416x2120)
1.22 MB
1.22 MB PNG
>>106681916
id rather bake my own / just seeing how well base can handle it
>>
>chromakeks spend 6 months making blurry analog asian slop
>seedream aces it far better than chroma ever could
>NOOOO I N-NEVER WANTED THE BLUR IN THE FIRST PLACE!!!
textbook example of localcope
>>
File: 00073-3113839384.png (929 KB, 1344x768)
929 KB
929 KB PNG
>>
>>106681856
>ranfaggot insulting other users
Why are you such an insufferable faggot?
>>
>>106681923
>no chad
>>
>>106681923
lmao
>>
>>106681952
who do you think's holding the chain? api providers are the gigachads, making models 10x better than anyone else, having full uncensored access to the weights (you just know sam GODman gooned to uncensored dall-e 3 with loras), and enslaving poorjeets and localkeks alike through API nodes.
>>
Is this a bot or are cloudkeks this delusional
>>
If it doesn't live inside my VRAM it doesn't exist to me
>>
>>106681971
this
>>
>>106681963
imagine using your gigasaas 256x b100 cluster to generate 1000 HD goonslop videos in less than a minute. api providers are the kings of the ai age
>>
>>106681971
what if I put all the model on the RAM only?
>>
>>106681796
proofs?
>>
File: 00021-3940128768.png (1.13 MB, 1192x736)
1.13 MB
1.13 MB PNG
>>106681622
>>
File: Capture.png (957 KB, 893x909)
957 KB
957 KB PNG
Interesting choice of preview videos
>>
File: ss.png (72 KB, 1252x186)
72 KB
72 KB PNG
>>106681816
wtf
>>
>>106682009
what's wrong? it's a known meme that is remixed a lot
>>
>>106682009
wtf are mclaren doing
>>
>>106681588
Workflow?
>>
File: 1728610647680549.mp4 (1.71 MB, 560x736)
1.71 MB
1.71 MB MP4
>>106681612
>>
>>106681971
so true king
princess peach's bouncy quadruple d's big butt and perfect pussy live on my 16gb of vram (and offloaded ram)
>>
4k Seedream gens living comfortably inside my machine locally diffused with Comfy API Nodes
>>
Is local really that much of a threat that they need to send shills to shit up the thread?
>>
>>106682072
Apparently
>>
>sends shills to the thread to distract while also kidnapping the best local models and mindbreaking them into API
how is saas so powerful? we lost so many of our top models to them...
>>
What happened that made these threads speed up so much in the last 24 hours?
>>
>>106682072
>taking shitposting seriously
autism is a hell of a drug, glad that Trump found the cure top kek
>>
>>106681971
ramtorchbros..........
>>
>>106682101
Wan 2.5 announced to be API only
>>
>I was only pretending
>>
Before anybody starts to have copium about Wan 2.5 getting an open weights release, I shall remind you that recently released behind API (no open weights) their 1T param LLM (Qwen-Max), Qwen ASR, Qwen TTS, and a better API-only version of their Coding model. Some were hopeful Qwen-Max would be released since it was a Preview, but they released the final version today and it's API only. The same will likely happen to Wan 2.5.o

It's ogre.
>>
>>106682122
Let's not forget what Hunyuan did with 3D models. The very second they got good enough it was API only.
>>
>>106682104
i wish ramtorch was real...
>>
>>106682122
I'm sure wan 2.5 is a giant model no one could run anyway so...
>>
>ramtorch
What's the catch?
>>
>>106682046
Kewl
>>
>>106682122
>i-i never wanted it anyway!! stop shilling!!! wan 2.2 is all we need, 2.5 is likely slop!!!
>>
>>106682107
Nuclear take: this community is the biggest bunch of entitled leaches I've ever seen. People expect all work to be done for them, all models to be released for free. I've seen redditors highly upvoted for kvetching that Flux-Dev isn't "open source" because of the non-commercial restriction. The entirety of inference / training code and support is done by like 5 people total, most of them not paid. There have also been a grand total of about 6 large-scale finetunes of open models, fucking 4 of them SDXL. There is a tiny group of people actually doing anything of note, and 99.999% of people just constantly complaining for free handouts. You guys deserve companies going API-only to milk you.
>>
>>106682101
alibaba team bought rabbi clothes
>>
>>106682133
brain-havers itt called this out months ago, but retards here insisted china would make everything local to pwn the west or some bullshit.
>>
stale pasta
>>
>>106682168
>NOOO YOU'RE NOT ALLOWED TO FEEL DISAPPOINTED THAT SOMETHING YOU ENJOYED AND ANTICIPATED IS THE COMPLETE OPPOSITE THING OF WHAT YOU WERE EXPECTING BECAUSE THEY WERE NICE TO YOU IN THE PAST.
>>
File: 1736388993490372.png (784 KB, 1257x1456)
784 KB
784 KB PNG
>>106682168
-> >>106675735
>>
>>106682177
>ENJOYED
*Enjoy
2.2 is enough for me
>>
>>106682171
I was one of them. I'm pretty annoyed it happened with this timing though.
>>
>>106682177
bruh, we warned you, it was obvious that they were going to go for the API route if they happened to make a SOTA model
>>
File: 1745293914905587.jpg (48 KB, 720x720)
48 KB
48 KB JPG
Being at the top of FOSS is very important as you get insane amount of eyes on your company and free work done on your products,
and since this is an industry where winner takes all, if you aren't #1, publishing your already done work as FOSS is essentially free.

The more the time passes, the more research is published and thus can be implemented, lowering the bar for effort required to publish a new good model for all people and companies that want a slice of the pie but aren't at the top, creating high pressure in the industry for someone to gain huge popularity by publishing FOSS.
Even just quickly mass training on the output of the top new proprietary model will allow you to create a model of 70-90+% of its quality for very low cost, meaning all competition will never be too far behind the top no matter what.

There must always be a FOSS king.
>>
>>106682184
>I'm pretty annoyed it happened with this timing though.
pretty much this, I think it was too soon, it's not like wan 2.5 has destroyed veo 3 or something
>>
>>
ComfyUI needs to be removed from the OP. why do you retards insist on shilling a platform that continues to push for apis, logins, and cloud workflows (monitored). you cant cry about api models while actively shilling them in the op
>>
>>106682168
not reading all that
>>
Reminder that we are fucked on the music gen front too. AceStep 1.5 will be the final open weights model they will release, at least the dev said so on their Discord server. He implied that 2.0 and beyond will not be open.
>>
>add one (1) extra lora to my wan 2.2 workflow making it 2 including lightning
>OOM's every time now
oh.
>>
>>106682197
Saved. Always an inspiration. Thanks.
>>
>>106682223
just offload a bit more to your ram
>>
How do you guys upscale your videos?
>>
>>106682006
They deleted the lora kek
>>
>>106682245
topaz, esrgan
>>
>>106682188
>88
trvth statvs:?
>>
>>106682245

i dont but ffmpeg -s
>>
>>106682202
Cum UI has overstayed its welcome, dev is a giant faggot.
>>
File: 1749060083177816.jpg (1.04 MB, 1080x1696)
1.04 MB
1.04 MB JPG
I was messing with my controlnet a while ago and messed up my install. I think I moved the tile model over to an Invoke install and I just noticed that the "tile_resample" preprocessor vanished back in Forge. How do I get that back? What do I rename my tile.safetensors to and where should I put it?

I'm a very stupid idiot so hopefully one of you guys can help me.
>>
>>106682234
oh yeah lol clip loader has an offload thing thanks if this doesnt work im pulverizing my frontal lobe then downloading a lower quant
>>
>>106682035
https://files.catbox.moe/6hoe9a.json
>>
How do I go on living knowing that the direct upgrade to the model I enjoy the most just got locked up behind and API and likely will never leave their servers again?
>>
>>106682294
You don't, there should be a rope in the shed anon, good luck
>>
>>106682294
https://docs.comfy.org/tutorials/api-nodes/overview
transition now while you still can.
>>
>>106682290
NTA but much appreciated man
>>
File: 1757310971354749.jpg (548 KB, 832x1216)
548 KB
548 KB JPG
>>106682282
and I'm going to post elves every now and then until I get help.
>>
>>106682188
I am actually surprised and grateful we got as many great open weights models as we did. The fact that some of these companies did a 180 and became API only is only surprising for the fact that the actually took as long as they did to pull that move.

But I believe we are only in this situation because Nvidia has virtually no competition in the GPU/accelerator space, and also the fact that distributed training still isn't a thing (and I don't think it ever will). So we will forever only get good large AI models out of the goodwill of these companies, or the occasional furfag with lots of money to spare like Lodestone/Chroma (and even there, it was just a fine-tune).
>>
>Qwen Image Edit -> Goes from shit to still shit but better
>Youtubers: "OMG, THIS IS IT, NANO BANANA AT HOME"
https://www.youtube.com/watch?v=YDJ9TEgcWPU
bruh...
>>
>>106682317
based
>>
File: 1750548043234657.mp4 (1.13 MB, 912x720)
1.13 MB
1.13 MB MP4
>>106682197
>>
File: 1750660121258949.jpg (529 KB, 768x1280)
529 KB
529 KB JPG
>>106682335
No you don't understand. It's a bad thing. I'm going to post low resolution elves because I can't upscale them without tile_resample.
>>
>>106682332
In its defense it is crazy good at posing people and keeping their likeness provided you have only the reference and the controlnet. Like absurdly good.
That point got overlooked because they also introduced a half assed muti subject feature that doesn't work at all.
The single reference stuff is amazing though.
>>
>>106682348
your lowres gens are better than some anons highres gens
sorry i wish i could help i dont use forge
>>
>>106682348
Wouldn't this be faster if you just uninstall controlnet extension and re-install?
>>
>>106682354
What is lowres?
>>
>>106682224
Thanks
>>106682346
Neat
>>
>>106682197
Damn! That shit looks sick!
>>
>chro-magnons seeing sdxl waislop for the first time
>>
If the next Qwen-Image turns out to be good and unslopped but API-only, it would further rub salt in the wound. I would be so upset I'd avoid local for a long time
>>
File: 1758234680989051.mp4 (3.32 MB, 912x720)
3.32 MB
3.32 MB MP4
>>106682376
Have a 5 second one as well. The left hand broke unfortunately.
>>
File: 1730679843027070.jpg (1.35 MB, 1352x1440)
1.35 MB
1.35 MB JPG
>>106682354
thanks

>>106682366
I tried that but it says it's built into forge so it wasn't in the extension list. I might have to do a clean install.
>>
File: combine_00003.mp4 (2.52 MB, 1920x640)
2.52 MB
2.52 MB MP4
Wan2.2 loras are completely broken on native comfy nodes, run the default template lightx2v example and you'll see. Meanwhile on KJwrapper nodes things are working as expected.
Picrel Example template - Comfy nodes - KJ nodes
Can someone please run the template workflow and prove I'm not just being retarded? I've been driving myself insane with this for the last couple days
>>
File: combine_00005.mp4 (3.63 MB, 1440x1280)
3.63 MB
3.63 MB MP4
>>106682430
Also the problem gets MASSIVELY exacerbated at 720p
Comfy - KJ
>>
>>106681286
Is there an actual quantization of qwen-edit for 12gb vramlets out there?
I tried whatever could fit from huggingface or civitai and it spat out errors that model type wasn't recognized.
Both gguf or safetensors. I made sure to link the text encoder and VAE too.
>>
>>106682440
>that model type wasn't recognized.
How old is your comfy because this was fixed when the first qwen came out
>>
File: 1742214474849353.jpg (1.07 MB, 1080x1568)
1.07 MB
1.07 MB JPG
>>
File: 1756463757347617.mp4 (1.91 MB, 720x1040)
1.91 MB
1.91 MB MP4
>>106682317
>>
We doing FF Tactics now?
>>
>>106682317
Who do I think she's cuter than she is sexy?
>>
>>
File: x9.png (2.21 MB, 1316x1367)
2.21 MB
2.21 MB PNG
>>106681424
They ran out of money or whatnot. It's quite telling looking at the ancestry chart at https://www.neta.art/blog/neta_lumina/ and their FAQ response.
>Why did we choose to continue training from AES-1-e100 instead of RAW’-e7?
>We believe that adopting a training curriculum that alternates between large and small datasets (a “large-small-large-small” strategy) yields better overall model performance.
>Switching to AES-1 let us complete the final “large-small” stages and ship the best trade-off we could (stability, looks, short-prompt performance).
>If you are interested in training with more abundant GPU resources, RAW’-e7 checkpoint is a strong raw starting point.
I really also take issue with the fact that 1.) The Lumina folks didn't open source their captioning software and 2.) Neta Lumina didn't even try and replicate that captioning structure and instead decided to implement their own without the structure that was used in the base model.
{
"gemini_caption_v10": {
"master_player_detailed_caption_en": "",
"compress_nl_en": "",
"Tag_mix_sentence_en": "",
"Medium_caption_en": "",
"short_summary": "",
"designer_caption_en": "",
"structured_summary_en": "",
"midjourney_style_summary_en": "",
"chinese_translation": "",
"midjourney_style_summary_zh": "",
"designer_caption_ja": ""
}
},
"wd_tagger": "",
"wd_tagger_metadata": {
"character": [""],
"series": [""],
"artist": [""],
"rating_tag": [""],
"quality_tag": [""]
}
}

Like yeah, no shit, half of the training time was used on the model trying to reconcile your new structured captioning with the base model's trained captions. Minimal changes would've been just adding a ja version of the long, medium, short and tag and then put in booru tagging in all languages inside those 4 prompts. Instead, you throw another 4 captions at the EN side and make the Chinese and Japanese sides inadequate and add in booru tags unnaturally.
>>
File: 1743982183164001.jpg (511 KB, 1184x1592)
511 KB
511 KB JPG
>>106682474
Man, video is getting really good. I wish I didn't cuck myself with a 4070ti.

>>106682476
The FFT style is like a black hole for me, it always pulls me back. I've probably unironically generated 100k images using loras from it and related styles.
>>
>Qwen 3 max: API only
>Wan 2.5: API only
just another sad day for local models
>>
File: 86677671.mp4 (3.68 MB, 960x1216)
3.68 MB
3.68 MB MP4
>>
File: 1729027892854092.jpg (578 KB, 832x1216)
578 KB
578 KB JPG
>>106682483
it's the large breasts
>>
>>106682529
I don't think so, I call them "hug breasts" because I wanna hug them.
>>
File: 1744197414002039.png (623 KB, 832x1248)
623 KB
623 KB PNG
>>
File: 1733675960643985.jpg (924 KB, 1024x1504)
924 KB
924 KB JPG
>>106682539
"hug breasts" is a working prompt that I've never tried before, thanks anon
>>
>>106682439
soul to the left, fried trash to the right. Which way, brown man?
>>
>>106682569
You're welcome. It's a joke I created with a friend and it somehow works depending on the model.
>>
>>106681424
Yume version is still being worked but more for stabilizing the results not really to finish up the incomplete training it has. being said, V3 is pretty usable now
>>
>>106682577
>my cum-smeared finger slipped on the keyboard, misspelling the prompt

Happy little accident, coomer.
>>
>>106682596
Nah, it's a joke with a friend making fun of another friend.. I was drunk and accidentally typed "huge breats", and we kept saying it.
>>
File: 1742208840218791.jpg (586 KB, 832x1216)
586 KB
586 KB JPG
>>106682596
I've typed "huge breasts" thousands of times and never made the typo myself but I don't use semen as a prompting conduit. I have a lot to learn it seems.
>>
>>
>>106682626
how do you get it to be a scenic view? every time i try, it's zoomed the fuck in to the character.

ive tried, scenic, scenery, wide shot, distant shot but it no dice
>>
>>106682596
lol spotted the non-local tranny
>>
File: 1750280277193710.png (703 KB, 832x1248)
703 KB
703 KB PNG
>>
>>106682635
I used 'wide shot'. You sure there's nothing else in the prompt that's causing it? If you prompt for a term that only makes sense in a close up shot, it's going to force it in that direction.
You could try 'landscape' or 'scenery' to force a wider shot maybe, but it obviously may not be what you want.
>>
what would y'all consider to be the best local model for texture generation for use in 3D/games? i've been goofing around with SDXL and it works nicely enough, but things move quickly and i have no idea how up-to-date a lot of information online is. i have a 3060 TI.
>>
ramtorch doko?
>>
File: 1743127669521192.jpg (1.37 MB, 1184x1728)
1.37 MB
1.37 MB JPG
>>106682626
That looks nice, catbox?

>>106682635
What I do is "scenery, outdoors/indoors" to get a background and then the rest of the prompt is about the character. Having "upper/full body" or "cowboy shot" or whatever helps with it zooming out too much.
>>
>>106682666
it's always sdxl or sd1.5. none of the models since have ever been usable practically.
>>
>>106682657
What strength? I have used "zooming in" and it's still too far away. "Zooming out" without "wide shot" is too close and using "Wide shot" has a large chance of being too far away. Your's is perfect.
>>
>>106682666
flux
>>
Does qwen edit work for neoforge?
>>
>>106682666
I'm a drunk retard and fucked up many prompts, you can get some fun failures with SDXL.
>>
>>106682676
word, okay - thank you

>>106682687
i'll give it a look, thanks

>>106682696
before i had gotten the hang of the settings and learned there were specific resolutions the model worked best at, i got lots of insane and fucked up images
>>
File: 1739934323244792.jpg (714 KB, 832x1216)
714 KB
714 KB JPG
>>
>>
>>106682679
I usually avoid strength > 1 most of the time, only as a last resort.
Use 'wide shot', but then add some terms that describes the character's expression or finer details - the model should be forced to zoom in a bit in order to render those details. Failing that, maybe try 'very wide shot' with a strength of like 0.2 to 0.4 in the negative prompt.
It can also be model or prompted artist dependent, like the image here - same prompt mostly, but different model and artist.
>>
>>106682731
Greta-anon...
>>
This place feels like purgatory now that API won.
>>
File: 1755140067960394.png (569 KB, 832x1248)
569 KB
569 KB PNG
>>
File: 00029-1815616887.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>106682706
I accidentally made wall textures trying to gen images of some characters.
>>
amd is never going to fix 6950xt performance with diffusion, so I have to wait to buy something else to bother.
>>
>>106682754
It was very wise of you to type "amd" as the first word in your sentence posted in an AI thread so people know it's safe to skip the rest of it
>>
File: beach.jpg (2.56 MB, 2462x1382)
2.56 MB
2.56 MB JPG
>>106682731
Is this good enough? I did this one a long time ago.
>>
File: 1735086161544492.png (583 KB, 832x1248)
583 KB
583 KB PNG
>>
Is Qwen Edit the best local model we have so fare?
>>
>>106681286
So I haven't done local genning in like 2 or 3 ish years. I want to get back into it a little bit to make some wallpapers for my tablet and TV and phone.
Last I was big into it, SDXL was just coming around. I used A1111 for forever and started winding down around control net being a thing.

I briefly took a look and saw Sdxl got a pony model or something and that reforged is sort of the A1111 successor.

I tried comfyui briefly. Got some short video gens. Cool for that but still an A1111 guy.


Whats the big new things I should know or update?
On a 3080 TI if it matters.
>>
>>106682734
its basically just a cool retro hobby hangout zone where us boomers can talk about outdated vintage tech.
>>
File: 1738133978830396.jpg (638 KB, 832x1216)
638 KB
638 KB JPG
>>106682796
Noob or Illustrious is like a better base model than Pony was.
>>
File: 1756797630204381.webm (1.85 MB, 768x1024)
1.85 MB
1.85 MB WEBM
from the river to the beverly hills pool

>>106682006
lol was this just an abby shapiro lora
sucks that there's not really any true bimbo loras for wan. the bimbo lips lora is okay but tends to go too crazy even at low weights

>>106682171
>retards here insisted china would make everything local to pwn the west or some bullshit
wan 2.5 is only SaaS because it pwns china, if veo 3 was an option in china it would have been open sourced

what if i told you that they DID pwn the west by making everything local? think about it, midjourney et al are not making their own base models but just finetuning wan to compete. the demoralization of the superiority of wan, and the opportunity cost of making a new base model versus just finetuning an apache 2 model, combined with the fundamentals of short-sighted venture capitalism has destroyed open model development in the West that isn't absolute meme companies like Mistral (who arent even that open) or Meta (who can't make a good model to save their lives even when offering researchers literally a billion dollars to work for them)
>>
>>106682832
>midjourney et al are not making their own base models but just finetuning wan to compete
absolute schizojeet headcanon
>>
You are supposed to tag accordingly at training. But what happens if you tag one thing with a non related tag, will that tag morf into the thing on the dataset, or will it get ignored?
>>
File: 1746109585265728.png (489 KB, 1248x832)
489 KB
489 KB PNG
>>
https://huggingface.co/nunchaku-tech/nunchaku-qwen-image-edit-2509/tree/main

NUNCHAKU BROS, WE WON!!!
>>
>>106682783
It's good enough as long as you're happy with it.
>>
>>106682852
So how do these versions differ from each other?
>>
>>106682839
basic economics isn't a headcanon you communist ethopian jew, the ashkenazi princesses i generate are whiter than you

hey that's a pretty good rap lyric can someone AceStep that for me thanks
>>
schizo moment
>>
>>106682857
I am, but I think it could look better for people who want a background.
>>
>>106682783
i wonder how many [white, Western] people have actually seen the milky way like this. i have never seen more than maybe 50 stars in the sky at the absolute most in my life, and that was at a cottage 3 hours away from the major city i live in
>>
>>106682847
as long as your captions are literally random or not related to the image at all then itll be fine
obviously the better the caption is the better the output will be but foundational models themselves are kinda shittily captioned anyway
>>
>>106682876
If you live in the city, it's hard. Wait until fireworks are shot. A day after you can see many stars somehow.
>>
>>106682879
*as long as your captions aren't literally random
>>
>>106682876
Thank you for contributing to the Local Diffusion discussion.
>>
File: 1750848651245376.webm (1.5 MB, 768x1024)
1.5 MB
1.5 MB WEBM
>threadshitter immediately backs down from slinging (You)s
damn what a compliment

>>106682884
I'll make it out to somewhere remote in my lifetime. I want to see the northern lights, and fjords

>>106682891
Yes i was commenting on a locally diffused gen. Just because you only got electricity in your village last year and you could always see the Milky Way doesn't make my comments off topic niggerfaggot
>>
File: ComfyUI_03952_.png (3.35 MB, 2560x2560)
3.35 MB
3.35 MB PNG
>>106682891
lol I say "Kill'em with kindness". They'll get so angry and call us "trannies" or "eurocucks". I laugh my ass off at how butthurt they get.
>>
real schizobrown hours
>>
>>106682914
Good for you anon! That shit looks nice. I've only seen something like it once in rural Pennsylvania on vacation with my folks.
>>
>>106682914
I'd venture to say you're the brownest of them all considering how often you hurl that insult at others. Like a "whoever smelt it, dealt it" kinda thing. But that's neither here nor there.
>>
File: 1758077029263568.mp4 (825 KB, 480x832)
825 KB
825 KB MP4
t2v fun times
>>
>>106682925
>I'd venture to say you're the brownest of them all
toddlercon is a white man's fetish. of course if you doubt my passion for the beauty of adorable little sweethearts i can queue up some stuff right now
>>
nobody asked lil brownysaar
>>
>>
i dont understand how sometimes an anon gets a few compliments, a few catbox requests, and suddenly they are compelled to treat this thread like a blog of sorts
>>
>>106682932
Mr President, another /ss/ raceplay enthusiast has hit the general

i'm gonna need a whole series of this, and i'm gonna need you to replace the boards 4chan org with awoo dot cf (thats awoo(.)cf) so you can keep sharing those thick black butts

>>106682946
the actual problem is that 4chan should have never had a Name field.
>>
File: 1739868417330504.jpg (1.21 MB, 1728x1024)
1.21 MB
1.21 MB JPG
>>
>>106682914
It's obvious you know you're posting off topic with that reply kek



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.