[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


🎉 Happy Birthday 4chan! 🎉


[Advertise on 4chan]


I'm Sore Too Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106751247

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
!:FOREWARNING:!

The past three threads have been full of trolls. Tread lightly, anon.
>>
>Wan2.5 API only
>Hunyuan 80b too big to run + no comfyui support
>Sora 2 drops
heh.... local victory any day now.........
>>
>>106753029
:(
>>
File: realmiku.png (1.18 MB, 1192x736)
1.18 MB
1.18 MB PNG
>>
>>106753029
>too big to run
even if it was the right size, no one would want to run it, it's slopped, and no one ran that HunyuanImage 2.1 model because of that, those chinks need to understand that they need a more serious dataset if they want to reach the next level
>>
>>106753010
I claim ldg for nofap.

>>106753045
wipe your face off girl
>>
Blessed thread of frenship
>>
File: WanVid_00002.webm (495 KB, 560x800)
495 KB
495 KB WEBM
finishes my work, but now I need sleep
>>
>>106753029
I'm having a lot of fun with Sora 2 right now, and the feed of other people's gens are hilarious. I'll be stuck on this until they censor it, then its back to Wan.
>>
>>106753029
>>Wan2.5 API only
I really don't get that move desu, they will be destroyed by Veo 3 and Sora 2
>>
>>106753048
Slop is beautiful.
>>
>>106753084
It's a ComfyUI powerplay, just look at all the ads and dicksucking he did after it went API only. This was part of the $17m investment agreement. The same investors blew 100x that investing in API-only models that failed to make profit. Now they're banking on Comfy to make API profitable.
>>
>>106753073
I DO NOT FEEL SAFE ANON I DO NOT FEEL SAFE
>>
>>
Serious question. Does Invoke suck? I'm on Linux.
>>
>>106753220
Here’s the black pill about all this stuff that some of the turbospergs in these threads hate to hear, all of these interfaces suck. They’re all shit to varying degrees and in different ways. So if it does what you want it to and you can get on with it well, use it.
>>
File: n30moanon.gif (3.43 MB, 540x469)
3.43 MB
3.43 MB GIF
>>106753022
the duality of man! ;3
>>106753061
OF friendship thread, blessed
>>
>>106753094
>>106752393
>>
>>106753235
I was using comfyui previously, not sure if I want to go that route again.
>>
>>106753276
Put on leggings, jogging pants, or a longer skirt. 4 inches below the knee.

This is [NOFAP COUNTRY]
>>
>>106753358
it just turned into an API wrapper anyways
>>
File: 1751550110596879.mp4 (777 KB, 640x640)
777 KB
777 KB MP4
will sam altman let you do this? didnt think so
>>
File: 1746761650496566.png (1.96 MB, 1536x1536)
1.96 MB
1.96 MB PNG
>>
File: 1728948717111829.png (2.14 MB, 1536x1536)
2.14 MB
2.14 MB PNG
>>
>>106753481
apparently he's allowing celebrities again, since he literally added his face too
>>
File: 1749841912069919.png (2.59 MB, 1536x1536)
2.59 MB
2.59 MB PNG
>>106753525
>>
File: 231878417988136.jpg (465 KB, 1496x2024)
465 KB
465 KB JPG
>>
This is probably cope but Sora 2 being really good may convince Alibaba to release the weights of WAN 2.5.

I'm not 100% convinced they're API only yet anyway.
>>
>>106753600
nah, if they decided to put this shit on API while knowing Veo 3 exists and is way better, it means that they still want to be destroyed or whatever
>>
>>106753530
Sora for some reason has always been a lot more lax than 4o through ChatGPT. You see celebrities and copyrighted shit on there all the time
>>
>>106753575
localkek sissies...
>>
uh oh... blurrychromasfeetschizo won't like that...
>>
>>106753634
Sora has a different team behind it and an 18+ service level agreement to use it, that's why.
>>
>>106753600
they can't go back to local. they've already ruined their reputation. besides, people have already spent money on their api. let them just get humiliated by american studios
>>
>>106753640
local needs generated ytp damn
>>
File: 1733104881169168.mp4 (1.54 MB, 640x352)
1.54 MB
1.54 MB MP4
>>106753640
https://files.catbox.moe/1seqwp.mp4
>MLP x FF7
so much kino...
>>
>>106753640
Anime ones are legit bad but the gameplay ones are pretty kino. Not gonna get me again though made the mistake with 4o 1st time around before they cucked the shit out of it.
>>
File: Sora is the GOAT.png (72 KB, 220x214)
72 KB
72 KB PNG
>>106753640
IT CAN DO YOUTUBE POOP? LMAOO LOCAL IS FUCKING DEAD
>>
lmg doesnt get the same trolling desu
>>
>>106753703
>Not gonna get me again though made the mistake with 4o 1st time around before they cucked the shit out of it.
yeah that's the problem with API models, since it's not on your computer they can cuck the shit out of it and you don't recognize it anymore
>>
the difference is not the trolling but the trolled.
>>
>>106753220
If Invoke had the same level of community support Comfy does it would be the best.
>>
the crossover between the two is very high
>>
>>106753640
https://files.catbox.moe/wiqjfo.mp4
So I guess that settles it, One Punch Man is the best fighter in the universe
>>
File: RA_NBCM_00029.jpg (1.04 MB, 1872x2736)
1.04 MB
1.04 MB JPG
>>
>>106753640
Judging from the open weight models coming out of China, these videos aren't aesthetic enough, therefore they remove it from the dataset. But hey, at least the bench scores are through the roof!
>>
>>106753762
which model for gundams?
>>
>>106753772
>Judging from the open weight models coming out of China, these videos aren't aesthetic enough, therefore they remove it from the dataset.
https://youtu.be/37XeFwHi3mU?t=10
>>
>>106753772
> these videos aren't aesthetic enough, therefore they remove it from the dataset
I would say good because they should stop using synthslop but alas I know Chang's mind is set on that.
>>
>>106753640
>>106753775
omg it migu and pepe!
https://files.catbox.moe/zynsee.mp4
>>
>>106753743
this gen looks fucking better than all of season 2, of one punch man anime
>>
i sure hope it's safe though
>>
So what happens now?
>>
>>106753820
we cope with chroma
>>
>>106751840
>be scam altman
>no violence
ACK >>106753743
>>
>>106753773
It's just Noob with some mecha artist tags thrown in the prompt
>>
File: 539977816130.jpg (542 KB, 1496x2024)
542 KB
542 KB JPG
>>
It's refreshing to see non-slowmo videos
>>
>>106753640
Why does Sam Altman always have to be the guy to innovate? Is local even trying?
https://files.catbox.moe/isse3d.mp4
>>
>>106753831
prompt "sam altman sucks MS dick for a shekel"
>>
>>106753726
uh huh. sure.
>>
>>106753844
>all the compute in the world
>can barely do more than wan 2.2 + talk
wow...

and it's censored...and limits your prompts...whoa...
>>
>>106753801
Unrelated to the thread but you just reminded me that the guy in charge of season 3 already came out and announced that fans should lower their expectations because he’s publicly telling them season 3 will not live up to season 1. No “it might not” or “we hope”, straight up it won’t live up to it lol. and may be worse than season 2. What kind of cucked backwards Japanese cultural bs is that lol.
>>
File: 1742054549373949.png (905 KB, 798x1327)
905 KB
905 KB PNG
>>106753844
kek
>>
File: 1744720542029542.png (2.08 MB, 1536x1536)
2.08 MB
2.08 MB PNG
>>
>>106753839
>it’s just noob
Bullshit, what’s your post processing? You’re telling me it spit out such a clean result just from your prompt?
>>
>>106753855
>can barely do more than wan 2.2+ talk
bait used to be believable
>>
>>106753866
kek noob is the best anime model anon come on now
>>
>>106753869
openAI has a literal shitload of compute and i'm not blown away, this isn't even better than wan 2.2 i2v in many cases

it even fucks up text
>>
>>106753869
it's not bait, it's localcope
>>
>>106753871
Must be some serious skill issue cause I can’t get it to produce nice clean sharp results like that. You didn’t img2img upscale or anything?
>>
File: oh no no no noooo.png (208 KB, 686x386)
208 KB
208 KB PNG
>>106753844
>Is local even trying?
SILENCE WESTERN DOG, EAT MY SYNTHETIC SLOP AND BE HAPPY!
>>
File: 1752656112820440.png (2.36 MB, 1536x1536)
2.36 MB
2.36 MB PNG
>>106753863
>>
>>106753869
Bros mad nobody wants to send him an invite code.
>>
>>106753881
Train the models with outputs from worse models, what could possibly go wrong?!?
>>
>>106753876
>this isn't even better than wan 2.2 i2v in many cases
it's true that I haven't seen any i2v gens from sora 2 so far, how do they look like?
>>
>>106753880
look at his res of course he upscaled and did a second pass
the only time you dont need that is if its simple and or bold lineart stuff
>>
>>106753877
>over 100000 high end AI gpus
>not exponentially better than wan 2.2
that's bad anon.
>>
>>106753894
As a certified phoneposting shitter all I see is file size and name
>>
I member when it was controversial to say training on synthetic outputs is bad.
>>
>>106753888
>oh man I can't wait to make censored slop
>porn
no, go prompt sam altman sucking off israel for shekels, see what it says.
>PAID censorship
lmao, the state of GPUlets
>>
>>106753911
also, despite Elon only giving a shit about stuff like grok recently, it's beating GPT5 in code and other benchmarks

Scam Altman is a RETARD.
>>
>>106753901
>>106753911
>couldn't wait a bit more than the 1 minute mark to seethe again
come on anon, you'll get that invite code soon, don't be jealous like that
>>
File: video arena 2.png (106 KB, 1422x1116)
106 KB
106 KB PNG
it's exponentially better than wan2.2. in fact, almost everything is. this reminds me of when keks coped over de3/4o by saying sdxl could do all that with controlnet. you just have to accept that, like all things, local models eventually become outdated. sora 1 is outdated and shit compared to wan but now wan is outdated and shit compared to sora. no need to cope and pretend your melty light2x quanted outputs are anywhere near as good
>>
its over, sam won
https://files.catbox.moe/6uyqxf.mp4
https://files.catbox.moe/g1nw2g.mp4
https://files.catbox.moe/3jmasp.mp4
https://files.catbox.moe/5bvruz.mp4
https://files.catbox.moe/1seqwp.mp4
https://files.catbox.moe/wa72uc.mp4
https://files.catbox.moe/2uoi3q.mp4
https://files.catbox.moe/os8t5k.mp4
>>
>>106753923
so far I'm unsure if it's better than veo 3 or not, but I think they're really close, I'll give an edge for sora 2 because it can do cameo shit and it looks less slopped
>>
>>106753922
imagine actually paying for censorship or limited prompts
>insert $5.99 for your Miku singing video, sir
>>
>>106753029
>heh.... local victory any day now.........
Before this local was king though. That's just how it works, they pass the torch back and forth.
>>
https://files.catbox.moe/odbake.mp4
https://files.catbox.moe/wiqjfo.mp4
https://files.catbox.moe/syu0xw.mp4
https://files.catbox.moe/wgeck8.mp4
https://files.catbox.moe/ede7y0.mp4
https://files.catbox.moe/lrh3yl.mp4
https://files.catbox.moe/4m6wn4.mp4
>>
>>106753934
except now ComfyUI steals the torch before it can be passed back to local. see: wan2.5
thank you api nodes!
>>
>>106753928
yeah we already talked about it -> >>106753640
>>
>>106753923
>it's better
it's SAAS shit and paid. And censored. And you are at the mercy of that fag altman.
>>
https://files.catbox.moe/zynsee.mp4
https://files.catbox.moe/7lmv0x.mp4
https://files.catbox.moe/8xgejs.mp4
https://files.catbox.moe/isse3d.mp4

>>106753942
eh, reposting anyways
>>
>>106753928
>horsefucker
opinion ignored.
>>
LMG does not get this same type of posting lel
>>
>>106753940
I completely lost interest in cumfart. it's a whore for telemetry and memory fuck-ups
>>
>>106753951
>not a horsefucker
opinion ignored
>>
Is sora 2 the new king? Is local now occupied gaza?
>>
>>106753932
>limited prompts
there's nothing more limited by local models, with sora you can literally make an avatar scene
https://files.catbox.moe/lrh3yl.mp4
what can local do? low poly 3d miku? that's all it can do
>but muhh coomm
shut the fuck up and try to understand not everyone want to use local models just for this shit
>>
what's the point?
>>
>>106753934
openAI have literal football fields filled with H100s and other shit, and the best that compute could do is 10 second videos that wan 2.5 can also do, locally.

if openAI is so good why are they losing to everyone in AI benchmarks? GPT5 sucks more dick than sora.
>>
>>106753960
for the (you)s
>>
>>106753923
It's better of course but come one we done this song and dance with all the saas model. They initially released free of all chains to generate hype than lock it up harder day by day till it's unusable slop machine. It's dumb to fall for the same tactic multiple times.
>>
>>106753938
it's impressive how Sam doesn't give a fuck about copyright shit, they have way more balls than the localkeks
>>
>invoke
lmao

>bitsandbytes library load error: Configured ROCm binary not found at /media/[path]/Invoke/.venv/lib/python3.12/site-packages/bitsandbytes/libbitsandbytes_rocm63.so

heheheheheh this is great lol hehehehe wow such combuddder
>>
>>106753964
> that wan 2.5 can also do, locally.
Based API nodes enjoyer. But don't worry, Sora 2 will be local soon once it's out of beta and added to ComfyUI API collection
>>
File: 1732935096848654.mp4 (711 KB, 672x480)
711 KB
711 KB MP4
wan is better, cause I dont have to pay fag altman a dime.

FUCK saas.
>>
>>106753964
>and the best that compute could do is 10 second videos
millions of people are using their services, of course they can't go for 1 mn videos, I thought it was quite obvious no?
>>
https://files.catbox.moe/xoq6md.mp4
>>
>>106753974
Curious can any sorafags do this?
>>
>>106753911
Even local models will prevent you from generating this degeneracy.
>>
>>106753971
found it. And so apparently all is fucked as per the usual with amd

https://github.com/bitsandbytes-foundation/bitsandbytes/issues/1608
>>
>>106753938
make clopper smut. go on... DO IT
>>
>>106753974
This. All my money goes to timmy tencent and Comfyorg instead. Wan 2.5 is crazy good on the API right now
>>
>>106753982
>Curious can any sorafags do this?
it's already done, with One Punch Man and San Goku
https://files.catbox.moe/wiqjfo.mp4
>>
>>106753982
theres like a dozen of those on the sora page
>>
File: 1752012712737938.mp4 (1.14 MB, 672x480)
1.14 MB
1.14 MB MP4
>pay for our prom-
>ACK!
>>
>>106753010
Is this ai slop?

https://www.instagram.com/thegraciehiggins/
>>
>>106753992
I mean punching altman
>>
>>106753995
yes anon kek
>>
>>106753992
Oh what the fuck. I just woke up and found out that SaaS can do THIS?
Fuck local. How the fuck haven't you all just roped yet?
>>
(invoke)

gonna install stable.

Also, I tried installing on another drive. I guess it has to be your primary drive, or maybe so.
>>
File: 1735712310819105.jpg (650 KB, 2000x2000)
650 KB
650 KB JPG
>>106753974
>wan is better, cause I dont have to pay fag altman a dime.
that's not a fucking argument, you have to pay thousands to daddy Jensen to run a subpar model (relative to Sora 2)
>>
This content was flagged as inappropriate and dangerous.
>>
File: 1750099465310681.png (3.56 MB, 1416x2120)
3.56 MB
3.56 MB PNG
>>
>>106754000
counterpoint, maybe she has a dslr with high dynamic range, and a bio-mod that locks her spine to the camera.
>>
>>106754001
>I just woke up and found out that SaaS can do THIS?
I want to root for local but at this point we're getting humiliated, and we're not reaching this level pretty much soon >>106753928
>>
>>106754005
I dont have to spent a penny to a company. I pay for my hardware and electricity. I win, people putting money into a website for prompts are retards.
>>
>>106754006
>This content was flagged as inappropriate and dangerous.
Welcome.
>>
>>106754005
>bam
>gamers wanna suck me for my paper
>>
https://files.catbox.moe/f49yc8.mp4
>>
>>106754016
its the most ai face ai face. also, if you want nodes use comfy invoke sux
>>
>>106754020
>I dont have to spent a penny to a company.
you spent a lot of pennies to Nvdia lol
>>
>>106754020
*send a penny, even

why would anyone ever spend $100 on prompts when you could buy a literal NVME or something that lasts years? That's a worse investment than gacha even. At least you get a JPG girl out of it.
>>
>>106754005
delete this
>>
>>106754002
nope. invoke is kill

>bitsandbytes library load error: Configured ROCm binary not found at

hahahah I guess it's ComfyUI or bust.
>>
kijai, please implement the long video loras so i can be free from these shit generals
>>
File: 1737119629773647.mp4 (725 KB, 672x480)
725 KB
725 KB MP4
>you did business with CHAYNA? now die.
>>
[EMERGENCY MEETING]
[Trigger warning - SaaS, Ope*AI]
Alright team, it seems the new S**a 2 model is making the rounds. Lots of shills are invading our thread tonight, but let me make it clear to them that we do NOT support any API that isn't ComfyUI. So take your shilling elsewhere.
For those of us still interested in advancing this space, let this be a room for open discussion about how we can improve. I already thought of a brilliant plan to minimize costs while maximizing quality. We can just download millions of outputs from that new model and train on those. That way we already have video-text pairs and audio too! I have already sent this idea to Alibaba's Wan team, and we should be seeing massive improvements in Wan 2.6 when it launches on ComfyUI API next summer.
>>
>>106754029
I have such a love/hate relationship with OpenAI, everytime I think they're done they release an insane model, and I can only shut my mouth and clap, this is impressive
>>
>>106754031
we need government to make free GPUs for the people.
>>
>>106754030
I guess so, their retarded installer can't even find the rocm binary lmao
>>
>>106754005
It's almost like gpus have more than one use. While you pay a sub to have altman grab you by the balls. Without actually being able to gen that of course, way too unsafe, only one way ball grabbing allowed.
>>
FUCK OFF SAAS SHILLS THIS IS AN API NODES THREAD
>>
File: 1743586487616360.png (3.18 MB, 2312x1304)
3.18 MB
3.18 MB PNG
i need to find a new artstyle to train on
>>
>>106753982
https://sora.chatgpt.com/d/gen_01k6expe32e0fvzcfx55hg8c7d
>>
Holy fuck I'm so blackpilled right now I don't even want to be seen with you people.
>>
>>106754049
how much are you being paid to shill here, jeet
>>
>>106754053
>censorship is only for API
oh boy, that's why our base local models have so much concept in there isn't it? we definitely don't live in an universe where our local models can only render Trump and Miku, right?
>>
https://files.catbox.moe/1c3h2s.mp4
>>
You can't handle the truth.
>>
>>106754046
As a bonus, invoke screwed up my system. lol lmao or whatever.
>>
>>106754068
if only there was a framework that allowed literally infinite content to work with existing models

we could even call it a "lora"
>>
>>106754068
Have you tried chroma? It's insane, it even knows megumin from konosuba and a bit of shrek as well.
>>
>>106754066
desu, you have to be paid to shill such a shitty model like Chroma if I have to be perfectly honest
>>
>>106754066
NTA, but all of the outputs I have seen have been genuinely impressive and were I not aware they were AI when being shown them I would have just assumed they were edit clips. Acting like what you see here isn't impressive does nothing but prove you ideologically poisoned to the point of being unable to to recognize something impressive for what it is.

Your next argument will be.
>But it's censored.

True but it's still impressive.
>>
File: 1753704814445845.png (1.92 MB, 2312x1304)
1.92 MB
1.92 MB PNG
>>
so all this good for is lolkeks?
>>
>>106754078
>muhh lora cope
great, now go for a FF7 render style + 5 MLP characters in one video with loras, that's 6 loras that have to work together in harmony, good luck with that lmaooo
https://files.catbox.moe/1seqwp.mp4
>>
>>106754083
ive never cared for chroma, illustrious for anime (realism models are ok) and qwen and maybe flux for realism are fine.

qwen edit to me is far, far more impressive than this sora2 is.
>>
Actually, probably the stupid retards at ubuntu fucked up amd again.

Any idea if arch works better with amd?
>>
>>106754100
>qwen edit to me is far, far more impressive than this sora2 is.
I don't get the appeal of this model, it slops shit too much, unable to keep the original style
>>
>>106754098
all the compute in the world and they can't even get basic fucking text right. even people with 8gb vram can get that to work on wan q2 or q4. sad
>>
>>106754102
what about debian? ubuntu's been effing up for years.
>>
>>106753048
>those chinks need to understand that they need a more serious dataset if they want to reach the next level

Let's be frank now. The Chinks are not good at training models. Like, they lack the innovation required to train a decent model from a proper dataset. Which is why they need to cheat with synthetic datasets.

Local's only hope to catch up to Sora 2 is now another company similar to BFL (if BFL themselves don't want to give us anything).
>>
File: 1736324859253270.png (62 KB, 640x360)
62 KB
62 KB PNG
>>106754089
>NTA, but all of the outputs I have seen have been genuinely impressive
we don't do that here, in the localkek cult you're supposed to say API models are bad, and HunyuanImage slop is good!
>>
>>106754061
Tried on multiple browsers, just gives me an error..
>>
>>106754123
>he doesnt know
ermmm, we don't discuss hunyuan image here. comfyanonymous said it's not coming to comfyui nor api nodes, so that makes it a saas model aka off-topic.
>>
>>106754117
>BFL
they suck at training their models too, Flux is slopped as fuck :(
>>
>>106754117
>Sora 2
If someone leaked it, how big would it be? Could I run it on my 5090?
>>
has anyone made anything funny yet with it, all the videos linked are cringe so far.
>>
>>106754073
sounds like a MLP song lol
>>106754129
I said "HunyuanImage", it could be HunyuanImage 2.1 which is supported by ComfyUi kek
>>
>>106754126
>https://sora.chatgpt.com/d/gen_01k6expe32e0fvzcfx55hg8c7d
worked on edge, motion but silent on ff.
>>
>>106754117
You don't actually think 4o piss filter is good do you? This model seem ok at a glance but I ain't gonna judge this until a month or two from now when non-shills get their hand on it and to see if cuckman gimps the model to hell and back.
>>
>>106754131
>If someone leaked it, how big would it be?
I doubt it's a giant model like 500b, they have to deploy this shit for millions of API users, but I guess it's pretty big, like 80b or some shit
>>
File: 1730950124402030.png (3.51 MB, 1416x2120)
3.51 MB
3.51 MB PNG
>>
>>106754145
>You don't actually think 4o piss filter is good do you?
we're talking about sora 2 here, it has no piss filter
>>
>>106754073
kek, I love the lyrics, I can tell how much funnier memes would be if we were able to have a local model that produces sound
>>
This is pretty fucked desu. The anime outputs genuinely looked like anime. OAI probably going around right now signing hundreds of contracts with studios for this.
>>
>>106753968
>we done this song and dance with all the saas model.
And it will continue forever every time.
>>
File: kill me please.png (608 KB, 821x665)
608 KB
608 KB PNG
>>106754073
>>
>>106754152
That's what I said, this model looks impressive at first glance so it may be well trained but we won't know until the layman tries it. nobody apparently saw the piss in 4o until much later.
>>
File: ComfyUI_01481_.png (987 KB, 1248x832)
987 KB
987 KB PNG
>Local is dea-
Oh yeah. I just turned Trump into a labubu with Qwen image edit. LOCALLY.
Your move, shitman.
>>
Ubuntu broke my environment variable, because that's of course what an update should do.
>>
>>106754180
>we won't know until the layman tries it
dude, there are a lot of examples made by regular people in there, even the anons are doing the migu memes, it still looks good (but you have a point, it won't stay that way, OpenAI loves to destroy their API models after some time)
>>
https://files.catbox.moe/ab5mww.mp4
>>
File: 1752508169817839.jpg (940 KB, 2120x1416)
940 KB
940 KB JPG
>>
>>106754109
unbelievable.
>>
>>106754144
Ahh well tried it on edge still nothing, maybe region blocked (I am in maple land). I am gonna take your word on it that the fist made contact with face.
>>
File: 1753717799807569.mp4 (971 KB, 672x480)
971 KB
971 KB MP4
hm, anon's wan 2.2 setup seems decent

high: 2.2 high lora 1 str into 2.1 lora 3 str

low: 2.2 low lora 1 str into 2.1 lora 0.25 str
>>
File: 1753405985125478.png (100 KB, 225x225)
100 KB
100 KB PNG
>>106754193
ngl this looks better than some low budget anime, some artists should really be worried about their jobs right now lol
>>
>>106754178
nice demon hands, miku
>openAI failed the hand test
is this SD 1.5?
>>
>>106754209
holy cope
https://files.catbox.moe/syu0xw.mp4
>>
File: 1730897198939998.png (1.79 MB, 1248x832)
1.79 MB
1.79 MB PNG
>>
bro posted that same shitty miku video like 5 times now
>>
File: 1740788959408446.mp4 (987 KB, 672x480)
987 KB
987 KB MP4
>>106754201
get fucked you gay pay per prompt asshole.
>>
https://files.catbox.moe/5og55f.mp4
>>
>>106753970
>they have way more balls
A monopoly*
Sam's lawyers, partners, shills, plants are ready to bully anyone who does not comply, while he himself can do whatever he wants.
>>
>>106754225
see, you can only add Trump, that's all this local model knows
>>
>>106754223
It was the same with sneeddrum. Reposting the same gen that someone else did over and over again.
>>
File: ComfyUI_01482_.png (875 KB, 1248x832)
875 KB
875 KB PNG
Local wins again. SAAS BTFO.
>>
>>106754226
also the broke sora logo thing is due to upscaling with wan btw, showing how shit wan is at lext
>>
>>106754226
>HOMORra with a guy voice
kek that's cursed
>>
File: 1756599680550278.png (146 KB, 273x303)
146 KB
146 KB PNG
>>106754215
>>
File: Altman_001.webm (3.91 MB, 1500x1000)
3.91 MB
3.91 MB WEBM
>>
https://files.catbox.moe/btaq5y.mp4
>>
>>106754230
I can add any character by plugging a lora in. infinite possibilities while you are limited by sam and gayAI. and your wallet. enjoy paying for your vids!
>>
>>106754241
>look at that pixel, it's wrong!
where was that energy when Chroma was rendering a third arm behind the woman's back?? you think local can do better than this?
>>
>>106754207
the timing is all off but low budget anime is low budget anime ai or not. shit tier
>>
Bets on which team will top this? Can't be China, no way.
>>
File: 1746750080266171.mp4 (943 KB, 672x480)
943 KB
943 KB MP4
does it have the big guy? didn't think so
>>
File: 1751685973027207.jpg (962 KB, 2120x1416)
962 KB
962 KB JPG
>>
>>106754248
you should not be satisfied of base models that has zero knowledge, loras are fucking cope, it'll always learn the styles/characters better if it's finetuned onto the model
>>
>>106754251
only google and openai I think, hopefully in 5-10 years china will have the compute
>>
>>106754249
Chroma isn't trained by the world's largest GPU farm and billions in compute.
>>
Anyway, got my gpu working with Blender, so my actual install of rocm is fine.

https://github.com/invoke-ai/InvokeAI/issues/8590

Looks like invoke is moar trash where none of the devs have any amd gpu at all, nor have they ever owned one, or even seen one in their lives.
>>
File: 1755049558495777.png (1.97 MB, 1440x1120)
1.97 MB
1.97 MB PNG
>>106753946
>>106753938
seethe vramlet, seethe :)
>>
>>106754268
why should I give a fuck? I just want to run good models, and so far, the only good models are the API ones
>>
File: 1745498729280896.jpg (1.09 MB, 2120x1416)
1.09 MB
1.09 MB JPG
>>
>>106754263
i'm not impressed when openAI has billions in compute and aren't making something far better than wan, who are probably using smuggled 4090s on an IKEA desk.
>>
>>106754263
If you wait on a base model to know everything when new shit gets released everyday. You would wait years before you could gen a character you like. LoRAs are pretty much necessity for customization cause no way a base model can know it all.
>>
>>106754234
Why can't ai do a likeness?
>>
>>106754251
Team Portugal.
>>
File: 1748319224730209.png (121 KB, 201x251)
121 KB
121 KB PNG
>>106754272
imagine giving Jensen an insane amount of money to get enough VRAM to run HunyuanSlop 3.0 lmao
>>
>>106754281
>If you wait on a base model to know everything when new shit gets released everyday.
that's an excuse you know it, there's something between "it only knows miku and trump" and "it should know every single danbooru characters"
>>
>>106754293
Not in this general there isn't.
>>
https://files.catbox.moe/l4zm03.mp4
https://files.catbox.moe/chltah.mov
https://files.catbox.moe/nc0odu.mp4
>>
>>106754273
>why should I give a fuck? I just want to run good models, and so far, the only good models are the API ones

Chroma is still the only image model that can do proper photorealism by far. Do you have eyes? Have you seen what real pictures look like? Even GPT 4o does not compare.
>>
https://files.catbox.moe/opj4fg.mp4
https://files.catbox.moe/q80gb6.mov
>>
the question is why would you take out your wallet, to make a few videos of goku? is it worth actual money? I can gen 10000 mikus and all I pay is the normal electric bill monthly.
>>
i have to pee dont do anything while im gone
>>
>>106754312
I have not paid a penny and ive been using it for like 6 hours
>>
File: 1755086956274883.png (1.22 MB, 1080x906)
1.22 MB
1.22 MB PNG
>>106754302
>Chroma is still the only image model that can do proper photorealism by far. Do you have eyes?
do you? because even Chroma got slopped after v30
>>
Serious question, why can't diffusers just werk like llm do?
>>
chroma looks like blurry melted shit which is why you cope with 'analog' gens. it cannot do convincing photography
>>
>>106754312
>I can gen 10000 mikus
that's the problem imo, I love Miku but I have the Miku fatigue, I want to gen something else...
>>
>>106754316
you think it will be free forever? why does sam charge people for limited prompts a day? it's a business.
>>
File: ComfyUI_01483_.png (1.14 MB, 1248x832)
1.14 MB
1.14 MB PNG
>Saas cucks sweating right now after seeing this.
>>
https://files.catbox.moe/kwtso1.mp4
https://files.catbox.moe/etvzmx.mp4
>>
>>106754309
Twas foolish of me to hope for anything close to rekt vids
>>
>>106754330
Omg a local model can render Trump? NO WAY
>>
>>106754193
looks like shit
for anime these models aren't really there yet
>>
>>106754329
id pay for wan if it was worth using, why would I use a worse model, and im hoping the $20 sub is enough for some gens a month since I already do that anyways
>>
>>106754317
>>106754321
Proper photographs:
https://www.dpreview.com/sample-galleries

Let me know if you still need your eyes checked.
>>
File: 1755405652191528.png (144 KB, 498x281)
144 KB
144 KB PNG
>>106754332
>https://files.catbox.moe/kwtso1.mp4
Ok now we're talking
>>
https://files.catbox.moe/h091el.mp4
>>
>>106754351
LMAOOOOOOOO, localkeks BTFO
>>
>>106754348
>sora 2 has the old wan slowdown bug
and people said china was behind.
>>
I have not clicked on a single .webm or .mp4 file in this thread. I'm waiting for those sweet Sora catboxes. Frankly posting any Wan 2.2 right now is embarrassing.
>>
>>106754321
>chroma looks like blurry melted shit
this, I don't get why people are still defending this piece of shit
>>
File: 1734820784519506.png (109 KB, 1035x72)
109 KB
109 KB PNG
>>106754351
>trillions in compute and vram/gpus
>no money for a text encoder
>>
>>106754089
>genuinely impressive
it's alright, barely better than wan but censorship is a valid argument. telemetry and no other options than prompting is something to be concerned with as well. lastly, you will own nothing and be happy, this model will be phased out and I'd hate to be in a production using it then it evaporates before it's finished. saas is a cancer for good reasons, not shitting on the model
>>
>>106754355
why do you say that? she moves pretty fast to me
>>
>>106754361
>a text encoder
the text quality you see on the video has nothing to do with the text encoder, is this a bait or something?
>>
https://files.catbox.moe/rfy20s.mp4
>>
>>106754362
>barely better than wan
it's better than veo 3, you're coping hard son, it's one thing to hate the SaaS process (I hate it too), it's another to lie about the true capabilities of an API model
>>
back when everyone creamed over sora, wan and other models released to piss on it. will history repeat itself?
>>
Why is python used so much still?
>>
>>106754367
so why did they fail to depict a basic string of text then?
>>
File: sora text.png (562 KB, 1173x688)
562 KB
562 KB PNG
what's with the excessive amount of wanjeet cope? wan can't handle anything more than text on a sign or a static banner at the bottom. meanwhile sora is generating text that sometimes only lasts 0.3 seconds then generating more unique text for the next shot. wan doesn't come close to this
>>
File: ComfyUI_01484_.png (1.05 MB, 1248x832)
1.05 MB
1.05 MB PNG
>Local only knows trump.
Saas shills on suicide wathc.
>>
>>106754368
kek, that one was funni, gone are the days of silent movies
>>
>>106754373
duh
>>
>>106754362
>barely better than wan
Shut the fuck up until you get a written declaration from an optometrist that your eyes work.
>>
File: 1728130267638889.png (799 KB, 1717x838)
799 KB
799 KB PNG
>>106754375
>a basic string of text
good luck finding a video model that is able to be so close to real text, I mean it, you won't find it
>>
>>106754281
You don't get it at all. Yeah LoRAs will be required for new concepts AFTER the cutoff date, but the model should know most things before it. Like the other anon said, the base model should know all danbooru tags by default, there shouldn't need to be an anime model at all.

It's almost 2026 and local is still struggling with this. Wtf is happening.
>>
>>106754376
>dream-sdxl
please stop inducing my PTSD
>>
openAI models are literally behind grok for code and other tests and elon hasn't given a shit about LLMs (he cares about AI, with tesla, but that's different) for many years. yet grok is winning on code tests.
>scam altman
>>
>>106754373
>back when everyone creamed over sora, wan and other models released to piss on it. will history repeat itself?
Oh god I hope so, Wan is better than Sora 1, if they can do it again my life will be fully fulfilled
>>
Python is such obvious absolute trash, I can't believe people still use this fucking garbage.
>>
>>106754388
This model knows all danbooru characters as of now? If so I will be impressed. Seems to be only the big guys, most model knows the big ones.
>>
>>106754388
>It's almost 2026 and local is still struggling with this. Wtf is happening.
localkek cope is happening, some people just can't accept the fact they are lagging behind hard, so they find excuses on why we're only getting slop models that doesn't know any characters at all
>>
>>106754388
The fact that chroma, a model 4x the size of illustrious with a more advanced architecture, still doesn't understand a single booru artist tag is baffling. What the fuck was the point of spending $150k just to produce a 512x512 'base model' that nobody will further finetune because it's not even properly coherent?
>>
File: 1758366804107653.mp4 (939 KB, 672x480)
939 KB
939 KB MP4
>wan 2.5 to embarrass altman soon
another deepseek moment inc to embarrass the Americans.
>>
>>106754370
>it's better than veo 3,
Veo doesn't have the crappy artifacting so no
>>
https://files.catbox.moe/6jh6nr.mp4
>>
>>106754402
Sora/4o knows artist tags?
>>
>>106754399
>most model knows the big ones.
lol, local base models only know Migu and Trump, I would love to render HOMOra as well >>106754299
>>
>>106754393
welcome to the anistudio waiting room
>>
>>106754412
>Sora knows artist tags?
how can it know artist tags? that's just for images, not videos
>>
IT EXISTS

https://github.com/leejet/stable-diffusion.cpp
>>
>>106754411
this is movie quality with light editing
>>
>>106754402
>What the fuck was the point of spending $150k just to produce a 512x512 'base model' that nobody will further finetune because it's not even properly coherent?
I have no idea, this guy is absolutely retarded
>>
>>106754420
Oh no he was complaining about artist tags cause I thought he meant sora/4o knew them. Damn got excited for a bit
>>
>>106754388
uh. and what API meme model will do all the danbooru tags?

most of them do or will censor the shit out of everything.
>>
>>106754412
yes, sora shows it can clearly understand the style of different animation studios, like madhouse (one punch man), shinkai, ghibli, and even western studios like mlp's.
>>
>>106754424
we know, welcome to the anistudio waiting room
>>
>>106754430
>what API meme model will do all the danbooru tags?
it can do much more than localkeks, and that shouldn't be happening at all, we don't have big enough models to compete on quality videos, but we should beat them on the number of concepts, and even for that we failed hard, we really suck
>>
>>106754430
NovelAI, they even have a custom architecture to properly segment characters to avoid prompt bleeding. Meanwhile regional prompter is 3 years old and barely functions
>>
>>106754406
It's a watermarking safety mechanic. They said it in the livestream. Thing is, Chinese researchers will assume its "bad quality", and say that their model has better quality in benchmarks.

>>106754420
It does. When the guardrails were looser at launch, the reasoning llm didn't rewrite the tags if you asked it not to. Can't do it anymore cause muh safety.
>>
https://files.catbox.moe/xipsho.mp4
>>
>>106754435
then why does the style change so much between cuts?
>>
>we
>we
>we
>we
>we
>we
>we
>>
>>106754435
yeah, it even knows Hibike Euphonium style (which is my favorite kyoani style)
https://www.youtube.com/watch?t=1181&v=gzneGhpXwjU&feature=youtu.be
>>
File: 1757033794940612.mp4 (1 MB, 480x672)
1 MB
1 MB MP4
can sora do this?
>>
>>106754441
We have models that can do all anime concepts on danbooru? What are you even arguing anymore lol
>>
https://files.catbox.moe/qmzo5d.mp4
>>
>>106754446
>we made it look shit on purpose
sorry but the shills have gone too far with this cope
>>
>>106754455
>talks about image models when we talk about competing against a video model (Sora 2)
I'm tired of arguing with retards, take your (You) or whatever
>>
>>106754454
also after some testing the anon recommendation/fix seems to work pretty decent

high: 2.2 high lora 1 str into 2.1 lora 3 str

low: 2.2 low lora 1 str into 2.1 lora 0.25 str

fixes the 2.2 speed issue a bit, better motion than before.
>>
>>106754462
That model can't do danbooru shit either? Like dude wtf?
>>
>>106754446
thing is it would be easy to fix with upscale models, so more likely its just to save compute
>>
>>106754441
>it can do much more than localkeks
where is the openpose control? what about control loras? scheduling? do you even know what local is capable of if you aren't a retarded neet?
>>
>>106754424
Holy shit, why is it impossible to just supply a binary? Every linux dev has turbo autism
>>
https://files.catbox.moe/4o5nm9.mp4
she talks to herself but damn
>>
>>106754436
:( It doesn't exist, and you know this.
>>
>>106754471
how can a video model do danbooru artist styles retard? it's only for images, are you braindead or something? I was talking about concepts only related to video like characters, or rendering style, this shit can do fucking FF7 styles, MK64 style, gta san andread style...
https://files.catbox.moe/1seqwp.mp4
https://files.catbox.moe/8xgejs.mp4
https://files.catbox.moe/1seqwp.mp4
>>
>>106754454
sora anon, prompt "anime style miku hatsune kneeling on a photorealistic george floyd"

I wanna see the model's brilliance.
>>
>>106754491
>>106754491
>>106754491
>>106754491
>>106754491
>>
>>106754478
who cares about your cope shit, I just want to prompt something and it does it, that's what AI is supposed to do, make your prompts reality, and that API model does that, if you want to spend hours working on your render because your local model is too shit to do it in one try, that's your fucking problem, but don't project your masochism to others
>>
>>106754478
nobody has used that cope since sd1.5. the "just inpaint and controlnet to get masterpiece scenes" cope existed since SDXL vs Dall-E 3. meanwhile nobody is making any complex masterpieces with it.
>>
>>106754441
danbooru, one of the many nsfw boorus? definitely not for most api models, not sora either.

>>106754445
it does not really seem to do all that well with 3+ characters and often even 2 with danbooru nsfw interactions either as far as I could tell



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.