[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106770350

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
first for anistudio
>>
>be transexual
>name your own self trAni
is lil bro fr?
>>
>>106772679
No it's just a common tactic used by people who are often in the same situation as him. He's a dime of dozen in other generals, there was a guy like him on /vg/ that after getting rejected created a spam bot that would post disgusting porn in every Guild wars 2 thread to the point everyone gave up.
Guild Wars 2 is absolute dogshit but they didn't deserve that and he did it for years.
>>
>>106772695
no lil bro anistudio is kino af ong it has svol bro fr fr the vibes bro bro
>>
Blessed thread of frenship
>>
File: ChromaDC-2K_00027_.jpg (634 KB, 1080x1576)
634 KB
634 KB JPG
>>
File: AniStudio-0003.jpg (483 KB, 768x1280)
483 KB
483 KB JPG
I don't browse /ldg/ but this thread it's biased
>>
>>106772731
fyck yoy bitchg mf
>>
once ani pushes the next build itll be in OP
>>
>>106772738
>gilled w/ fags
HOaW cud dis happems????
>>
File: AniStudio-00009.jpg (270 KB, 1024x1024)
270 KB
270 KB JPG
>>
comfy should be dragged out on the street and shot
>>
File: 00023-1276433757.png (1.87 MB, 1024x1280)
1.87 MB
1.87 MB PNG
>>
File: ChromaDC-2K_00030_.jpg (558 KB, 1080x1576)
558 KB
558 KB JPG
>>
File: 00073-534935028.png (2.9 MB, 1240x1240)
2.9 MB
2.9 MB PNG
Based on his behavior he most likely got banned playing on his main IP. He's not even typing properly anymore but using the same tactics, the last time he got like this it was because of the rentry being put in the old /sdg/ OP.
I'm guessing not enough anons are posting in his thread now so he's in full rage mode.
>>
>>106772773
lol
>>
File: IMG_2924.jpg (992 KB, 1106x2242)
992 KB
992 KB JPG
>>106772671
>puttinng spiteful shit in the collage again
you will PAY 4 what u done
>>
>>106772786
you're flaccid af
>>
File: IMG_2925.jpg (837 KB, 1125x1408)
837 KB
837 KB JPG
>>106772778
u WILL suffer
>>
>>106772778
go bump /sdg/ again real quick for me pls, at least try to be useful while lolcowing
>>
>>106772671
Thank you for baking this thread, anon
>>106772731
Thank you for blessing this thread, anon
>>
>>106772773
interesting style
>>
i wish i could make up imaginary persons in my head so i could have fun with them
>>
>>106772767
pretty gen. model?
>>
File: 00008-3184870179.png (1.25 MB, 1024x1240)
1.25 MB
1.25 MB PNG
This is a educational lesson for the new posters though, the only problem is he might do this more often because his traffic is slowing to a crawl.
>>
File: AniStudio-00010.jpg (312 KB, 896x1024)
312 KB
312 KB JPG
>>106772738
It's an injustice that SDNext is in the OP and not Ani, considering that most people here don't know how to code
>>
File: IMG_2918.jpg (798 KB, 1125x1348)
798 KB
798 KB JPG
what do you yhink anom???
>>
File: IMG_2930.jpg (757 KB, 1125x1430)
757 KB
757 KB JPG
>>106772804
bitck azs moderfyucker
>>
>>106772825
can you point out the meltdown posts so I know what to look for next time
>>
File: IMG_2923.jpg (818 KB, 1125x1073)
818 KB
818 KB JPG
>i will never stop
>i will be here forever
>wayching you ruining what you love
I will not stop until this general goes away forever
>>
>>106772860
Ok but at least use a SaaS model, what model are you using?
>>
*yawn*
>>
File: AniStudio-00011.jpg (253 KB, 896x768)
253 KB
253 KB JPG
>>
File: 00025-3484567926.png (1.92 MB, 1024x1280)
1.92 MB
1.92 MB PNG
>>
> chroma v1 q4 at 5s/it on laptop 5070 ti 12G w/ sageattention
I should be expecting more than this right?
>>
File: 1737987888621677.png (1.23 MB, 1152x896)
1.23 MB
1.23 MB PNG
WANCHADS

have you ever tested the 5b model? is it really that much worse than the big version?
>>
>>106772896
no it's slow as fuck no matter what you do
>>
File: AniStudio-00018.jpg (377 KB, 1280x1280)
377 KB
377 KB JPG
>>
File: IMG_2917.jpg (626 KB, 1125x1338)
626 KB
626 KB JPG
>>106772876
I WILL FUVK YPU UPP
>>
>106772910
>106772884
>106772827
stop the spam faggot
>>
File: ComfyUI_temp_snhsx_00052_.png (2.49 MB, 1568x1024)
2.49 MB
2.49 MB PNG
>>
>>106772897
yeah, it's dogshit
>>
>>106772897
it may work for certain subjects but generally YES, it is much worse

>>106772896
idk your hardware or your quant but it is going to be slower than SDXL
>>
>>106772910
can it do vidgens?
>>
>>106772606
AAAAHHHH DUDE I DIDNT REALIZE THEY HAD A NEW SPECIFIC 2509 EDIT TEMPLATE
IT WORKS SO FUCKING WELL
I LOVE YOU SO MUCH AAAAAAAAAA
THANK YOU ANON THANK YOU THANK YOU THANK YOU YOUR SUGGESTION MAY HAVE SEEMED OBVIOUS BUT IT UNBLOCKED MY SHIT
YAAAA QWEN EDIT YAAAAAA AMD YAAAAAAAA FAST GENS YAAAAAAA
Love you /ldg/
>>
https://xcancel.com/javilopen/status/1973658327533101221
it was bound to happen, at some point the models will be so good it'll be able to recreate 1:1 the dataset
>>
>>106772896
what is the gpu power usage? I know some of those laptop gpus get super power limited.
>>
>>106772942
>>106772854
>>
File: AniStudio-00019.jpg (411 KB, 768x1280)
411 KB
411 KB JPG
>>106772933
pretty cute, how sharing gens from my local model is it spam, biased anon?
>>
>>106772954
none of those were me I have not commented on the ani autism I'm not even sure what it is
I am just very happy because anon has really helped me out with my ComfyUI issue, which is now completely resolved after I thought it was OVER.
>>
File: IMG_2936.jpg (884 KB, 1125x1433)
884 KB
884 KB JPG
>>106772933
oh ehats dat u domt liek it? getting pestered and fukced with 24/7?
too bad so sad little faggot it's never gonna stop
>>
>>106772968
stop spamming lil bro
>>
>>106772964
I think the gens are very aesthetic
>>
>>106772942
sounds good? what did you achieve with that?
>>
>>106772860
>I will not stop until this general goes away forever
why do you hate this general?
>>
File: 00027-423969464.png (1.6 MB, 1024x1280)
1.6 MB
1.6 MB PNG
>>
>>106772979
>>106772981
Why are there always 2 brown demons for every aryan angel in these generals
>>
File: ComfyUI_temp_snhsx_00053_.png (2.58 MB, 1568x1024)
2.58 MB
2.58 MB PNG
>>
File: 1728308129246790.png (1.23 MB, 896x1152)
1.23 MB
1.23 MB PNG
>>106772896
that's shockingly bad, my 7900 XTX gets 2.76s/it in chroma, completely uncompressed too. either you're doing something wrong, or the 5070 is a terrible AI card even getting mogged by AMD or a used 3090
>>
>>106772991
pls we don't need to do another christ cuck hour
>>
File: 00049-2569226510.png (3.05 MB, 1432x2144)
3.05 MB
3.05 MB PNG
>>106772997
repent!
>>
File: IMG_2933.jpg (600 KB, 942x1138)
600 KB
600 KB JPG
you will atone for what you have done you will pay the price for the things you have done this general will be poo pood
>>
>>106773018
>you will atone for what you have done
what have I done? ;-;
>>
>>106773018
if you want to beat the singular schizo anon you need to do this now for 3+ years and hope he stops meanwhile
>>
>>106772827
is SDNext any good? been mostly using invokeai ever since I went from 3090 -> 7900xtx and now I'm back on Nvidia
gave up on Wan2GP when I tried it because it wasn't really set up well for bleeding edge blackwell cards or some shit
>>
>>106772945
Sora especially seems overfitted to shit, a lot of the dialogue, musci and animations cuts for the anime gens are pretty much 1:1 from the actual anime. It will be a miracle if this model survives the year lol
>>
File: IMG_4672.jpg (1.43 MB, 1179x1203)
1.43 MB
1.43 MB JPG
>it
is
>never
gonna
>fuckim
STOP
>>
>>106773018
>>106772969
>>106772917
Honestly I'm more worried that you're not using local models, because your gens aren't disgusting
>>
File: 1744446769075101.png (704 KB, 695x1114)
704 KB
704 KB PNG
>>106773041
>It will be a miracle if this model survives the year lol
it will, Trump will protect him (unironically), and the Anthropic lawsuit showed that using copyrighted dataset to train their model is "fair use" (as long as they buy the data instead of pirating it)
>>
>>106773051
kek, this, at least this schizo has some creative images
>>
>>106772945
Crazy how only SaaS models use copyrighted shit. You'd think by now that at least one smaller open source company that wants to disrupt and compete would do the same.

>so good it'll be able to recreate 1:1 the dataset

I disagree with that though. Unless I prompt for "Pirate of the Caribbean" it should not in any way mimic the sounds. That just means their model is overcooked or prompted wrong for those movies.
>>
>>106773060
I mean sure it will be fine here but other countries will definitely try ban the model. I can't see most animation studios in japan or nintendo doing nothing about it.
>>
File: AniStudio-00020.jpg (363 KB, 768x1280)
363 KB
363 KB JPG
>>106773051
Oh, so he wanted love after all. Did you see how he stopped spamming?
>>
File: ComfyUI_temp_snhsx_00066_.png (1.98 MB, 1024x1280)
1.98 MB
1.98 MB PNG
>>
>>106773086
>other countries will definitely try ban the model.
good luck with that, Japan is gonna be nuked again I guess lol
>>
File: 00097-2680183150.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>
>>106773091
>>106772940
status?
>>
>>106773086
Studios can seethe all they want ZOG is so entrenched they got them to import jeets by the shipload after resisting for so long. Owari da, Godzilla has fallen.
>>
>>106773074
>Crazy how only SaaS models use copyrighted shit.
that infiruate me, the local models we have have no concepts and the SAASkeks can prompt fucking "spongebob fighting against Homura, WWE style", that's so unfair man :(
https://files.catbox.moe/55wi6g.mp4
>>
File: 1737546303855304.png (1.27 MB, 1152x896)
1.27 MB
1.27 MB PNG
>>106772935
>>106772938
thanks for the advice. yeah, the 5B is pointlessly bad.

WAN T2I QUESTION:
Have wanchads noticed any differences between 2.1 and 2.2 for t2i? I really don't care about video
>>
>>106773111
based, fuck copyright, that's an outdated concept, I swear to god if the "progressive" want to ban copyright I'll vote democrat and deal with trannies twerking in the white house for 4 years like on the Biden era, if that's the price to pay I'll all for it
>>
File: AniStudio-00021.jpg (494 KB, 1152x1920)
494 KB
494 KB JPG
>>106773108
6gb Vram here!
>>
>>106773116
closedAI can do it because they're protected by the tribe.
>>
But what they can't do is train new concepts into the model which is where local excels.
>but they dont need to
New concepts and images are produced everyday. Only a local model that can be trained can keep up.
>>
File: 00030-334737389.png (1.98 MB, 1024x1280)
1.98 MB
1.98 MB PNG
>>
>>106773144
it's funny because Midjourney does the same and the CEO is also part of the tribe, I think I'm noticing something
>>
>>106773136
oh nice gens tho
>>
>>106773060
Naa this time they are also egging the music industry like that tweet said. Those guys are notoriously litigious. I doubt they can prove they bought those music
>>
File: 1750881189479804.mp4 (1.66 MB, 796x480)
1.66 MB
1.66 MB MP4
now that qwen edit and wan 2.2 are working smoothly, time to mess with wan animate.
>>
>>106773156
Idk, udio and suno seem to be fine even though it's obvious they used copyrighted music to get this quality
>>
>>106773157
oh god that's so terrible lol
>>
>>106773165
Yeah but I think they are in the gray area where it doesn't create something 1:1 like Sora does.
>>
File: 1744711292017406.mp4 (1.82 MB, 796x480)
1.82 MB
1.82 MB MP4
>>106773157
worked much better with a cropped face.
>>
>>106773173
desu if there's a lawsuit, I want OpenAI to win, so that the companies making local models will stop being terrified of copyright
>>
What's the best model for generating femboys that aren't too cartoony
>>
File: ComfyUI_temp_snhsx_00080_.png (1.95 MB, 1024x1280)
1.95 MB
1.95 MB PNG
>>
>>106773157
you try the newer version kijai put out? i haven't used it since he first put out the wananimateprocess repo, looks like he's done a little bit of work since then.
>>
>>106773193
Chroma is best for all photoreal stuff.
>>
>>106773128
I don't have much of a comparative T2I perspective. I'm guessing it's somewhat different since it is somewhat different for video.
>>
>>106773210
yeah i've been mostly doing qwen edit and wan 2.2 stuff, need to see what's new for animate.
>>
>>106772896
use fast fp16. sage does fuckall on chroma
>>
File: QwenEdit_00053_.png (942 KB, 1336x776)
942 KB
942 KB PNG
Bonjour, Monsieur Kong.
>>
>>106772940
>>106773108
https://github.com/FizzleDorf/AniStudio?tab=readme-ov-file#features
>>
>>106773220
honestly not sure if it made much of a difference since i was mainly messing around with other settings.
looks like he added some new sampler or some shit too, haven't messed with that one.
>>
>>106773136
is this actually anistudio????
>>
File: tent_coil_1.webm (3.85 MB, 896x1160)
3.85 MB
3.85 MB WEBM
Made my first wan 2.2 lora and since I'm a degenerate it's a lora trained on tentacles coiling around women's breasts. Took forever to bake, having to bake on two models is hell.
>>
File: 00034-2761764272.png (1.92 MB, 1024x1280)
1.92 MB
1.92 MB PNG
>>
>>106773257
theres a new workflow in the comfy template section, gonna try it out.
>>
>>106773048
>"I won't stop!"
>it stopped
oh man, those schizo hours ain't what they used to be :(
>>
>>106773220
>>106773257
https://github.com/kijai/ComfyUI-WanVideoWrapper/issues/1336
i don't use the low steps so i kinda just skimmed over it
>>
>>106773261
impressive, technology is amazing.
>>
File: AniStudio-00016.jpg (125 KB, 512x768)
125 KB
125 KB JPG
>>106773171
We need to talk about this person /ldg/, why do you have a schizo trying to recruit people from other threads to your threads like jehovah's witnesses?
>>
>>106772326
Regardless of how I feel about oai/sora, I'll never understand cheering for the musical industry IP grift.
>>
>>106773281
just how new are you?
>>
>>106773285
>I'll never understand cheering for the musical industry IP grift.
cattle behavior, they hate having power somehow, or they hate OpenAI and will go for any excuse to hurt them maybe?
>>
>>106773261
yea the training time is ass, but it at least seems to have worked - congrats
>>
>>106773261
what was your setup? been building a dataset but clueless how to actually train
>>
>>106773281
This has been a /sdg/ thing for the past few years
>>
>>106773296
They have no idea what they're cheering for, at least the zoomies don't, since they weren't there in the pre-spotify days.
It's suicidal behavior.
>>
Is the implication that the entire thread is a collective hive mind that operates in lockstep
>>
File: 00036-1497493606.png (2.12 MB, 1024x1280)
2.12 MB
2.12 MB PNG
>>
File: 1355139830646.png (178 KB, 500x500)
178 KB
178 KB PNG
Does QIE have a way to to "tile" edit anomalously big pics with noodle aspect ratio like >>>/d/11388389
>>
>>106773285
it's been a bizarre thing in social media, people cheering for companies with the most abusive copyright practices just because they have a hysteric hate of anything AI

>>106773344
>suicidal behavior
they would happily give up any right to fair use without understanding the actual consequence on their "fan arts"
>>
File: _you_.jpg (157 KB, 1024x768)
157 KB
157 KB JPG
>>
>>106773285
>I'll never understand cheering for the musical industry IP grift
Such a "scene" will be nonexistent once models like Udio/Suno become the standard. It's already the future.
>>
File: 00107-2109742012.png (580 KB, 1024x1024)
580 KB
580 KB PNG
>>
File: 00150-2868255923.jpg (885 KB, 1536x1536)
885 KB
885 KB JPG
blong & sance
>>
File: 00038-3222694720.jpg (949 KB, 1920x1536)
949 KB
949 KB JPG
>>
File: 1752048464353973.mp4 (865 KB, 640x640)
865 KB
865 KB MP4
okay the new comfy wananimate template works, it also seemingly extends the clip 5s?

char aznable, but miku:
>>
File: ComfyUI_00014_.webm (314 KB, 720x720)
314 KB
314 KB WEBM
>>106773451
I'm not having much luck with it
>>
File: he looks like him though.png (1.36 MB, 1600x1200)
1.36 MB
1.36 MB PNG
https://files.catbox.moe/wa06zi.mp4
the comedic timing is on point
>>
>>106773462
start the gen and stop, updates the points editor

add green points to the figure (head, body, limbs)

red dot outside their figure, then it should detect ok.
>>
>>106773392
Link to Hassan Piker Lora?
>>
>>106773472
>https://files.catbox.moe/wa06zi.mp4
lol
>>
>>106772897
it's good for text-to-image
not as much for video
>>
>>106773472
you can run sora local?
>>
>>106772897
>is it really that much worse than the big version?
it's terrible, 5b isn't enough to get quality videos, it is what it is
>>
>>106773498
yes
>>
the VRAM optimizer goat is back
https://github.com/comfyanonymous/ComfyUI/pull/10139
https://github.com/comfyanonymous/ComfyUI/pull/10141
>>
>>106773507
link? :)
>>
>>106773516
you want to be spoonfeed on everything don't you?
>>
>>106773510
Test conditions:

768x768x13f WAN 2.1 VAE Encode using regular VAE encode (latent saved to file to terminate the flow)
NVIDIA GeForce GTX 1660 SUPER (6GB)

wow, wan on 6gb is quite a feat.
>>
>>106773516
https://huggingface.co/openai/sora-v2-oss-pruned
>>
>>106773520
yes please
>>
>>106773526
>https://huggingface.co/openai/sora-v2-oss-pruned
thanks anon, but I'll wait for the GGUF, 400b is a bit too big for my gpu
>>
File: 1736494892627610.mp4 (727 KB, 640x640)
727 KB
727 KB MP4
hell yeah, the new template after updating comfy for animate does in fact work.

the video was too tall and I didnt change the 640x640 but..it does work, set the green points on the first frame and it's good.
>>
>>106773526
>distilled
>safetymaxxed
Im good
>>
>>106773281
This only happens when the /sdg/ schizo loses control. He did this during the thread anniversary too because he was ass hurt that everyone migrated here. It's obvious he flew too close to the sun which is why he's been crashing out for hours now.
He will calm down once he is able to post good morning to his 3 friends but until then it's war in his strange little mind.
We've been dealing with this same pattern for years and there are zero signs of him stopping.
>>
File: 1758498127748600.mp4 (1.46 MB, 640x640)
1.46 MB
1.46 MB MP4
>>106773537
and, by default it also extends it 5s (can just do 5s, but you can do as much as you like)
>>
File: ComfyUI_01611_.png (1.22 MB, 1280x704)
1.22 MB
1.22 MB PNG
>>106773488
got a WF for that? I tried it for t2i and got garbage..
https://files.catbox.moe/1cty8j.png
>>
>>106773526
>404
:(
>>
>>106773526
and of course we got the distilled version... and look at that licence it's shit, good luck finding a suspiciously rich furry to finetune that
>>
>>106773541
it's not a real link anyways lmao
>>
File: NO WAY.png (248 KB, 437x330)
248 KB
248 KB PNG
>>106773558
>>
>>106773147
>But what they can't do is train new concepts into the model which is where local excels.
they can, I don't know why you believe API companies don't use Loras, they do
>>
File: 1743827090315432.mp4 (620 KB, 544x640)
620 KB
620 KB MP4
>>
https://files.catbox.moe/acnrhf.mp4
>>
https://files.catbox.moe/7aowv3.mp4
bruh, the YTP community will thrive again with this shit
>>
>>106773631
https://files.catbox.moe/3jmasp.mp4
>>
>>106772786
>>106772799
>>106772828
>>106772844
Alibaba, stop harassing this thread because no one is buying your api slop
>>
He spent a full day doing this spam and hurt himself in the process.
>>
>>106773645
>He spent a full day doing this spam
based
>>
File: RA_NBCM_00039.jpg (1.35 MB, 1872x2736)
1.35 MB
1.35 MB JPG
>>
>it even knows watamote
that is insane
https://files.catbox.moe/8ol31c.mp4
>>
>>106773526
just tried this one out
>1374s/it
i read that wrong and got hyped for a moment...
anyone know when the api node is coming?
>>
>Nier Automata X Bobobo-bo bo-bobo crossover
lmaooo
https://files.catbox.moe/cq0sme.mp4
>>
>>106773537
good body, very bad face
>>
File: 00041-4213940353.png (1.75 MB, 1024x1280)
1.75 MB
1.75 MB PNG
>>
>>106773698
if it started with the face in view it'd work well, like the char one worked very good.
>>
>>106773661
Thank you for maintaining the purification ritual, I'm worried EU won't be able to maintain it this go around.
>>
https://files.catbox.moe/d444c0.mp4
if he's not the father, then who is?
>>
>>106773667
They obviously uploaded entire anime seasons.
They probably censor post hoc any fan service or generally lewd things.
>>
>>106773510
impressive work by that magician!
>>
>>106773594
but YOU the user cannot which is the point i was trying to make
>>
footfag anon, if you're here, look at this lmao
https://files.catbox.moe/hsr66y.mp4
>>
>>106773747
>but YOU the user cannot
I remember some API services letting you create a lora if you give it some image training and then you can use that lora on the API model
>>
File: 00147-732327419.png (2.22 MB, 1240x1240)
2.22 MB
2.22 MB PNG
Watching loneliness like this in real time is kind of sad not gonna lie.
>>
File: 00043-3399876799.png (3.83 MB, 1536x1920)
3.83 MB
3.83 MB PNG
>>
>>106773688
>OpenAI: fully understands that high-quality data with accurate captions is essential for success
>Tencent: "jUsT StAcK oNe mOrE lAyEr BrO!!"
I think I overestimated the chinks, they aren't that smart
>>
>>106773812
Who cares they are going to steal from them again and they won't do shit because they stole first.
>>
>>106773549
i mean if it also generates videos that look exactly like this throughout the whole thing then something's wrong in general on your end
>>
File: 1732180572163556.jpg (799 KB, 2016x1152)
799 KB
799 KB JPG
>>
>>106773812
ClosedAI has access to all the data. Chinks are forced to scrape. They aren't dumb, but they are lazy.
>>
I was called a faggot for saying synth data is bad to train on last year. Fuck you.
>>
File: 1746934865434844.mp4 (495 KB, 640x640)
495 KB
495 KB MP4
need a diff miku source image but a good start:
>>
>>106773847
>ClosedAI has access to all the data.
how? it's google that owns youtube, not OpenAI
>>
>>106773846
holy based
>>
>>106773849
>I was called a faggot for saying synth data is bad to train on last year.
you were right, and you are still right, retards are gonna retard, it is what it is lol, just be happy you got the last laugh
>>
>>106773812
There are good Chinese AI researchers. See for example Deepseek. I think everywhere the demand for quality AI researchers way exceeds supply so a lot of companies are wasting huge amounts of time, money, and compute creating worthless crap. Even OpenAI isn't immune to it, see for example GPT 4.5.
>>
File: 00046-3814303884.png (3.54 MB, 1536x1920)
3.54 MB
3.54 MB PNG
>>
>>106773849
I hope the faggot who instructed the Sana team to use synthetic data instead of danbooru rots in hell.
>>
>>106773852
Google has been selling data that to third party brokers for ages.
>>
File: 1739392794226828.mp4 (994 KB, 640x640)
994 KB
994 KB MP4
>>106773851
diff image, gonna try a less toonish one next
>>
>>106773883
OpenAI admitted they used "publicly available data" for their training, they probably scrapped that shit, I doubt Google is gonna help OpenAI, Google is their rival after all
https://www.youtube.com/shorts/M0QyOp7zqcY
>>
>>106773876
deepseek is kind of slop desu. it would tell you it was chatgpt trained by openai when asked originally. they clearly just did the same thing of scraping chatgpt outputs. just like how the next chinese video model will inevitably be trained on scraped sora 2 outputs.
>>
>>106773881
same, those faggots are setting the local AI space years back with their retarded lazy ideas
>>
>>106773876
Deepseek is only decent with LLMs, any multi modal attempts are garbage
>>
fascinating how all the spam did stop at once
>>
i would have more hope for local projects if chroma wasn't such a fucking monumental disaster. if it was a $10k project it would be understandable, but holy shit how can you spend $150k on this? and that was just on the smallest flux model. a finetune for something like qwen would likely cost millions.
>>
>>106773902
training on the outputs of another video model doesn't sound like it makes sense at all DESU
>>
File: 1745334324639230.mp4 (1.02 MB, 640x640)
1.02 MB
1.02 MB MP4
fanart miku, works nice, I think animate is best with anime -> anime or realism -> realism though.
>>
>>106773927
>a finetune for something like qwen would likely cost millions.
yep, let's not forget that furry fag had to train on 512x512 and it took him 6 months, we'll never get that one guy savior that'll train a giant model by himself, it's just too expensive to do
>>
>>106773927
I mean Chroma is WAY more immediately usable by itself than either Pony or Illustrious 0.1 ever were, for example.
>>
File: tent_coil_3.webm (3.85 MB, 896x1160)
3.85 MB
3.85 MB WEBM
>>106773303
Not sure what specifics you need. I used diffusion pipe with the prodigy optimizer. It was 25 clips from 3d and anime videos. I had chatgpt write a script that would take a timestamp and extract the clip using ffmpeg and then convert it to 16 fps so they were exactly 81 frames long. I just hand wrote the descriptions instead of using any sort of VLM.

I'd recommend seeing if your trigger collides with anything already in the model. I originally used the word coil/coils/coiled but that already really strongly associates with the chracter's whole body getting wrapped in a snake or tentacle so I had to retrain the whole damn thing. I switched to using "spiral/spirals/spiraled" instead for much better effect. The actual config that's relevant for wan is in the docs/supported_models.md file in the diffusion-pipe repo.
>>
>>106773902
My opinion is that even if they deliberately tuned it to match o1, a lot of companies would like to have done the same thing but mostly didn't pull it off.
But I do think that if the end of year comes without v4 or if it come and majorly disappoints it will be fair to say their hype was overblown.
>>
>>106773940
If some day it can be Folding@Home style computed, maybe.
>>
>>106773902
they got away with LLMs because synthetic text isn't as bad as synthetic images/videos, dunno how to explain it proprely but let's say that synthetic text is less likely to give you nonsensical shit, and it's easy to notice since they're only training it on objective question/responses like 2+2, for images/videos there's always nonsensical shit on AI renders so all this does is making the model learn mistakes and make it believe it's how the world should be rendered
>>
>>106773924
He's going to change IPs and start again, he wants us all to suffer from his own retardation.
>>
>>106773940
noobai was the closest thing to a supercluster-level finetune. but they have decided to not continue because they don't feel any models right now are promising enough. and they're right. the base models we have right now are simply not well-rounded enough to finetune on.
>>
>>106773947
i see, mostly was asking in terms of setting up the repo, since i'm probably going to use runpod
>>
>>106773957
The uninformed way I describe it is that there are a million ways to render 1girl but only one way to answer 2+2
>>
>>106773960
Neta Lumina could be a monumental jump over illustrious
>>
>>106773960
nah the guy who made the model, LAX, got scooped up by a corpo. that's why its been basically radio silence from him
>>
>>106773940
it isn't the money, plenty of retards have money, it is intelligence. You have to understand how to actually acquire/prune/tag your dataset, setup the scripts, monitor and refine, etc. Retards pretend you can just "hire someone lol" but actually training a model is really really really hard.
>>
File: 1753625078955355.mp4 (770 KB, 640x640)
770 KB
770 KB MP4
>>106773936
yeah. swapped the girl with an anri photo and as you can see, it's working better.
>>
>>106773940
I don't think the hardest part is the compute, the hardest thing to do in AI is having good quality data with the right annotations, this shit is a monstruous task, that's why I can't help but be impressed by what OpenAI is doing, they decided to tackle that shit instead of going for the easy path which is synthetic shit
>>
>>106773981
which one?
>>
>>106773960
>because they don't feel any models right now are promising enough
Was this explicitly stated in their discord or somewhere else? Can you post a screenshot?
>>
File: 1747148485513138.webm (2.79 MB, 850x850)
2.79 MB
2.79 MB WEBM
>>106773984
original:
>>
>>106773927
>Chroma is among the only local models that can stand tall among the influx of SaaSshit
>In fact it singlehandedly destroys every APIshit model due to their censored nature in just about any kind of NSFW prompt
>>
>>106773982
>>106773987
compute is easily the biggest limiting factor in local finetuning. if compute was unlimited we would have new experimental full-scale qwen finetunes every week.
>>
>>106773960
NoobAI ran out of money, what are you talking about kek. It's also bullshit that they couldn't do their own finetune of something like Lumina 2.0 or Cosmos-2 2B or even SD 3.5 Medium if they didn't want to go with a larger model, and actually had the money to train for as long as they needed.
>>
>>106773978
even though the newest version of yume is really good i think the fact that the base model sucks ass put many off. maybe they dont even know about yume.
>>
>>106774006
yes, but even if the compute wasn't an issue it would be slopped, the only way to get sovl is to annotate your billions of data pairs manually (or by using an almost perfect model captioner, which does not exist, or if it exists, only OpenAI has it lol)
>>
>>106774009
that doesn't really make sense, Neta Lumina 1.0 before the Yume guy picked it up was still better in every possible way than base Illustrious 0.1 without loras was
>>
OpenAI won.
(YOU) lost.
>>
File: 1728268470114032.jpg (752 KB, 2016x1152)
752 KB
752 KB JPG
>>106774009
I hope yume won't become abandoned, shit's nuts. I think in certain aspects its prompt comprehension is better than flux.
>>
>>106774020
the only people to ever show care about a local dataset were local hobbyists. so again, compute is the limiting factor because if hobbyists had access to compute the dataset issue could easily be solved. nobody wants to make a gigantic community dataset project because the only people who would have the resources to train on it would be SaaS. it would amount to months of work just to get cucked in the end.
>>
>>106774006
lol no. Think about all the fucked up models that get released with major funding and massive dedicated compute. Retards will mouth off about "censoring" but that has very little to do with it. It's why I have insane respect for the noob guy and the animagine guy. Training a model that actually works is insanely hard.
>>
>>106774003
The BigASP guy said he might do his next training on top of Chroma, I bet it'd turn out pretty good given how much more of the conceptual knowledge is already there versus in Base SDXL
>>
>>106774047
just do what they do and release it under a hyperspecific license
>>
>>106774044
>no, I can't do that
censorship = big loss.
>>
>>106774045
Gemma 3's max context length is 16 times longer than T5's so that kind of makes sense
>>
>>106774047
>if hobbyists had access to compute the dataset issue could easily be solved.
how? you still need to hire thousands of slaves to annotate your shit proprely and manually (no I don't want your gemini VLM saying that will smith is just a "black man"), I don't think you realize how much billions of data pairs actually is
https://time.com/6247678/openai-chatgpt-kenya-workers/
>>
>>106774061
You're not a gooner, are you?
>>
>>106774029
>was still better in every possible way than base Illustrious 0.1 without loras was
i recall hands and fine details being pointed at but i do agree with its initial (and current) potential
>>106774045
>I hope yume won't become abandoned
i swear i saw a recent post from the author talking about how he uses noob and doesnt really wish to continue with yume but maybe im bullshitting idr
>>
>>106774007
noob was training on illustrious, the way he trained wouldn't work on a base non anime model. We know he could (likely) make a new finetune on ill 2.0, but doing a full anime finetune on a base model is much much much harder.
>>
>>106774061
>censorship
local models are censored anon, (I will quickly brush it off the fact most of the base models we have don't know nudity at all), and the fact that Flux or Qwen can only render Miku meant that they didn't allow other characters to be in the dataset, or if they did they removed the names in the caption, that's censorship, a more subtle censorship but a censorship nonenthless
>>
>>106774069
you think censorship is only about tits? no. ask sora to have sam altman praising his jewish masters for a shekel. or sam altman grabbing a baby out of a pod cause he used a surrogate.
>>
>>106774051
name them. the only shit we get is disposable slop shat out by citationmaxxing research labs. not a single base model we received was actually made by a team that spent more than 1 hour prompting it. what i am saying is compute is the limiting factor when it comes to fixing up these shitty base models. SDXL is a terrible model but finetunes made it decently usable. chroma too might've been half-decent if he trained it above 512x but he didn't because again, even his $150k wasn't enough for compute. compute is THE limiting factor, there is simply no arguing around this because it's objectively the only thing preventing people from just schizobaking shit in their basement
>>
but pony v7 thobeit
>>
>>106774083
we get it, API models won't allow us to do edgy jokes (some are allowed though, like blackface)
https://files.catbox.moe/71d8no.mp4
but local models won't allow us to stack a lot of concepts (like Madoka x Watamote fighting each other on a WWE match, Jojo drawing style) since it doesn't know those concepts (and no, lora cope ain't it, you can't stack a lot of loras at the same time it quickly starts to shit itself, I want the concepts to be on the model)
https://files.catbox.moe/55wi6g.mp4
>>
Labs should unironically hire the best of the best prompters from the various AI threads (myself included obviously) so that we can tell them if their models suck or not since they cannot prompt a decent 1girl to save their life.
>>
File: 1753852811311076.mp4 (998 KB, 672x480)
998 KB
998 KB MP4
>sora is the future of video gener-
>ACK!
>>
>localcope begins
dont worry bros we'll surely catch up in 6 months
>>
>>106774095
you are using words you do not understand and pretending technology works in a way it does not. Go back to arguing with people in your head knowhere man
>>
File: 1732713925783583.mp4 (970 KB, 672x480)
970 KB
970 KB MP4
take it, sam!
>>
>>106774110
I'm not sure if Sora is the future, but it definitely is the present, it has beaten Veo 3 to me
>>
>>106774116
>no argument
typical. compute remains the limiting factor, you can ask any finetuner this and they will tell you the same thing. you clearly have not worked on anything beyond a lora and it's painfully apparent
>>
>>106774137
>compute remains the limiting factor
it is one factor, but having compute doesn't mean you win, look at Tencent, they trained a 80b model with 5 billion images, yet no one want to touch that shit because their dataset is so slopped , synthetic and lazy, that their model is just fucking ass
>>
>>106774129
>soundless
>slow motion
wan 2.2 slop
>>
>>106774148
I'm fine coping with no sound but the millions of slow mo videos makes me want to rip my eyeballs out.
>>
File: ComfyUI_00026_.webm (833 KB, 704x544)
833 KB
833 KB WEBM
anyone hazard a guess what I'm doing wrong
>>
>>106773982
ultimately, the only way for local to be save is to get a dataset "leak", just imagine if Sora 2's dataset was leaked, it would be a revolution, I'm not kidding, creating a high quality dataset is always the hardest part when training a model
>>
>>106774162
in the point editor for animate, put several green dots on the character you want swapped (head, body, arms)
>>
>>106774162
the maskings goosed, check your points
any chunks of color, can't have too many
>>
>>106774148
>slow motion
to be fair, a slow motion punch is always funny to look at
https://www.youtube.com/shorts/bCuB7vskNBI
>>
>>106774147
i agree, but maybe what i am trying to say is that the people with compute are clueless, and the people with a clue lack compute/expertise. hiring a couple kenyans to caption images like oai did for dall-e 3 costs a fraction of the price of a supercluster required to train something like hunyuan 80b. but it's not like any of these shitty chinese labs will bother, and they barely speak english to begin with.
the only people who care about the actual quality of local models is the community of prompters who discuss them daily. and said community is practically penniless which is why finetunes take forever (novelai dropped one month after SD1.5, pony dropped 6 months after SDXL, chroma dropped 11 months after Flux). training takes longer and longer, experimentation becomes increasingly costly, and more and more corners need to be cut because it's just too damn expensive.
>>
>>106774179
imagine trying to leak a multi hundred TB dataset
>>
File: 1735734747844437.mp4 (3 MB, 672x480)
3 MB
3 MB MP4
>>106774148
so just set the speed to 200%?
>>
File: 1737833531039417.png (153 KB, 1604x1242)
153 KB
153 KB PNG
>>106774205
holy shit you're right, goddam!
>>
>>106774205
This. Data's not even stored on their servers. It's on several layers of encrypted servers at discrete locations worldwide.
>>
>>106773927
why not hope for anistudio?
>>
File: RA_NBCM_00042.jpg (797 KB, 1872x2736)
797 KB
797 KB JPG
>>
>>106774209
From one local bro to another, you should bump it up another hundred percent or so.
>>
File: 1735855507884881.mp4 (999 KB, 672x480)
999 KB
999 KB MP4
I can abuse scam altman with wan

I can't do that with sora

wan wins.
>>
>>106774205
>imagine trying to leak a multi hundred TB dataset
yeah, that sounds unrealistic as fuck, so far only anything v3, llama1 and Miqu 70b got leaked, those are farily small size relative to a dataset
>>
Why arent there more leaks of lesser-tier models? I can’t think of a notable leak since novelai, yet there are hundreds of api providers. they cant all have good security
>>
>>106774251
>hundreds of api providers
A vast majority of them run open models and simply charge for ease of use and compute.
>>
>>106774243
if you think Sam doesn't get humiliated on Sora 2 you are not ready at all lol (that's what he wants actually, he has enough self depreciation to let people pay to throw tomatos at him, genius marketing)
https://files.catbox.moe/6p1rbn.mp4
https://files.catbox.moe/cuoc8e.mp4
>>
File: 1746531375235879.mp4 (768 KB, 672x480)
768 KB
768 KB MP4
24fps from 16, interpolated to 48 (from 32) is decent:
>>
>>106774271
>just a punch
boooring, Sora 2 lets you shoot people!
https://files.catbox.moe/nr3fk0.mp4
>>
>>106774251
>they cant all have good security
turns out that they all have, they're not dumb they know their company is dead if their golden goose is leaked
>>
File: 1749217675895491.mp4 (660 KB, 672x480)
660 KB
660 KB MP4
score, home run!
>>
>>106774201
Based on your post you think training a model is just compute, tagging, and pressing 'run', which is a huge fucking joke. Training a model, or even just doing a finetune, is much more complex than training a lora. The idea that devs at major AI labs aren't talking about AI all day everyday and trying to make the best model they can is comical.
>>
File: Tencent employee.png (59 KB, 275x183)
59 KB
59 KB PNG
>>106774312
>b-but I thought training a giant model on only synthetic data was da waeeeee!!!
>>
>>106774312
> The idea that devs at major AI labs aren't talking about AI all day everyday and trying to make the best model they can is comical.
yet you keep repeating that slopped datasets are the problem. if they're trying to make the best model they can, why do they continuously fuck up the dataset again and again?
>>
>>106774312
yes by filling it with synthetic data that looks like complete garbage
>>
>>106774271
>>106774292
can you add one punch man punching sam though? of course you can't, Wan doesn't know anime characters lol
https://files.catbox.moe/n37cop.mp4
>>
>>106774312
theres only a single anon who posts here thats trained a model from scratch
>>
>>106774371
Must be rough not being able to post in your thread lmao
>>
>>106774388
>*local cope noises*
that's what I thought
>>
>>106774392
You spent 12 hours so far seething at this thread to the point you have to ban evade by changing your IP on multiple occasions
Who hurt you?
>>
SaaS fags are uppity
>>
>>106774292
>no blood
>no face deformation
damn this model is so fucking boring
>>
cloudkeks need not apply
>>
>>106774399
>if you make posts I don't like you're the legendary single schizo poster debo
that again?
>>
I think one great effect Sora 2 can have in the industry (and by extension, local) is by forcing labs to stop doing the slop "cinematic" aesthetic in every fucking model. Normies are loving Sora 2 for how organic ("real") and unslopped it is. I hope this sends a strong message to competitors who fine-tune on slop, Sora 2 essentially looks like a pretrained model where the footage resembles raw youtube videos (as it should be).
>>
>>106774399
>seething at this thread
What a strange way to say “acknowledge the weaknesses of local models.” Sorry if not everyone subscribes to this cult that pretends we are close to the API fags.
>>
they get one good model and pretend like theyve been kings the whole time lel
>>
>>106774432
Don't worry, they'll start training on Sora 2 outputs. Only the slop meme ones though, to look good in promotional materials .
>>
>>106774432
Shame it (hypothetically) takes a wave of normies to get them to pay attention to something anons been shouting since forever.
>>
>>106774432
yeah, that's the most impressive part for me, the video look real, it doesn't have that classic film lightning slop you see on marvel movies, they just look like random ass videos some random filed with a random camera, you can see how real it is when you compare a real twitcher and its Sora 2 equivalent side to side
https://files.catbox.moe/o77vah.mp4
>>
File: debogged.png (941 KB, 1344x776)
941 KB
941 KB PNG
Activate George Droid. Push fent reactors to 105%. Remove all safeties from ze Narcan exchangeur.
>>
>>106774355
>>106774356
>>106774338
you are just betraying the fact you think training a lora is == to training a model lol. Several papers have come out showing how removing shit data from training actually makes it worse. The reason models look slopped is because of the training method, but because your only reference is slop merges and loras that is your only way to understand this shit.
>>
>>106774442
OpenAI has always been the precursor of things
>they got ahead of the curve with chatgpt and gpt 4
>they showed how powerful edit model can be with 4o
>now Sora 2 (desu that one is less impressive per say since Veo 3 exists and was the first one having sound but it's still the best model so far)
>>
>>106774442
this. localkeks only ever had one good model, and that was SDXL. it's been years and they still think someone is coming to save them
>>
>>106774479
retard
>>
>>106774476
>The reason models look slopped is because of the training method
nah, it's because chinks are lazy as fuck and can't help themselves and put a shit ton of synthetic data in there, they do that because it has been proven that using synthetic data increases the mememarks scores, it's the case for LLMs, it's also the case for image/video models
>>
peak localcopey hours. i think you all need a timeout, why not try seedream with comfyui API?
>>
>>106774476
kill yourself
>>
>>106774476
this is bullshit and you know it, are you a tencent employee per chance?
>>
>>106774476
weird how all the shitty models are trained with generated data then
>>
>>106774483
>put a shit ton of synthetic data in there
>citation needed
not even saying you are wrong, but there is a lot of real world runs showing having gold mixed with shit works better than just having gold.
>>
>>106774499
>but there is a lot of real world runs showing having gold mixed with shit works better than just having gold.
>
>
>citation needed
>>
>>106774494
weird how no richfag has trained a good model aside from noob guy. No dis to lodestone.
>>
>>106774432
>Normies are loving Sora 2 for how organic ("real") and unslopped it is.
you don't have to be a normie to appreciate how real it is, only weird fucks would prefer slopped renders over something that genuinely looks real, Chroma got that hype because it's one of the few models that could render humans with skin that doesn't look plastic
>>
>>106774476
Retards train LoRAs on synth data all the time and they suck ass what are you on about
>>
I wonder how many rupees Sam pays the indians to shill on 4chan.
>>
>>106774442
This autist has been doing this since the anniversary he started with the new wan model than this. It's kind of bizarre how he's unable to grasp other that people can't single him out after doing the same exact thing with zero divination. He's ban evading to do this too which makes it even more pathetic.
>>
>>106774503
>https://arxiv.org/html/2505.04741v1
you won't understand a word but I am sure chatgpt can summarize it for you
>>
>>106774519
>>106774519
>>106774519
>>106774519
>>106774519
>>
>>106774521
>In large language model (LLM)
You are retarded beyond comprehension
>>
File: 1755594254243302.png (62 KB, 842x270)
62 KB
62 KB PNG
>>106774521
this is about """""toxic"""" unfiltered data, not about low quality synthetic slop in image gen you absolute monkey dunning kruger
>>
File: 1739265933455434.mp4 (828 KB, 480x672)
828 KB
828 KB MP4
>>
>>106773275
even debo has to sleep sometime
>>
>>106773261
>>106773947
Neat. Nice twirling shish kebab. How many epochs did you end up training?
>>
>https://aaxwaz.github.io/Ovi/
>https://github.com/character-ai/Ovi

Breadcrumbs for local that want a Sora 2 at home.
>>
>>106772670
why flux krea has that plastic look?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.