[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106793003

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: 1755319364434425.png (984 KB, 1248x832)
984 KB
984 KB PNG
replace the text "HOLY FREAKING CRAP!!!" with "Oh...okay, I guess...". Replace the text "POSTING IN A STICKY!!!" with "something happened...". The anime girl has her arms at her side and is looking at the camera and looks bored.
>>
File: 1737142077876736.png (808 KB, 1080x1080)
808 KB
808 KB PNG
>>106795304
OH MAIH GAHHH
>>
Blessed thread of frenship
>>
File: ComfyUI_0005.png (1.84 MB, 1296x1728)
1.84 MB
1.84 MB PNG
>>
File: 1732899637282067.png (892 KB, 960x1080)
892 KB
892 KB PNG
the anime girl is pointing silver pistols to the left and right, in opposite directions.
>>
>>106795267
Is there some way to check if the model you’re using is an eps-pred model?
>>
are there any vibevoice implementations with ram offloading?
>>
>>106795414
Not that I'm aware desu. Usually just quanted.
>>
>>106795321
back in the day I got banned for the suggestion of cameltoe.
>>
File: WanVideo2_2_I2V_00468.webm (961 KB, 768x1312)
961 KB
961 KB WEBM
>>
File: image 11 Large.jpg (202 KB, 1280x768)
202 KB
202 KB JPG
I love genning sloppa and I’m tired of pretending otherwise.
>>
>>106795805
don't try to pretend that loving slop is a rare occurance, it's not, it's the norm, that's why HunyuanSlop 3.0 is 1st on the leaderboard >>106795208
>>
>>106795675
shame, existing wrappers have terrible ram management
>>
>>106795825
Yeah I'm not sure how easy it is to implement either considering it's not a traditional image model.

>>106795820
Fucking hell. Bets on them having bots voting their images when they pop up in the a/b test?
>>
File: image 13.jpg (880 KB, 2560x1536)
880 KB
880 KB JPG
>>106795805
For me, it’s the forbidden robot love
>>
File: WanVideo2_2_I2V_00469.webm (1.28 MB, 768x1312)
1.28 MB
1.28 MB WEBM
>>
File: 1738214509491745.png (2.49 MB, 1882x1246)
2.49 MB
2.49 MB PNG
https://arxiv.org/pdf/2510.02315
wen comfyui?
>>
>gens loops suddenly smooth as fuck with minimal color shift

?????
>>
>>106795996
>ctrl+f flux
>35 hits
Didn't even read.
>>
>>106796015
Nobody's going to pick qwen as the training model nigga
>>
>>106796015
qwen is a flow matching model so it'll work for it as well
>>
>>106796022
I never mentioned qwen. Stop fighting with phantoms.
>>
>>106796030
then what are you complaining about?
>>
>>106796036
Flux
>>
what other model would you pick as a testbed retard? everything else is too big
>>
>>106796053
Flux flux flux flux It's always flux and always will be fuck flux. There I said it. Boycott flux.
>>
This was included in a wan 2.2 i2v workflow, but I get this error. What gives? It downloads the model but I don't know where.
>>
File: 1754373129466223.png (31 KB, 282x310)
31 KB
31 KB PNG
>>106796062
based
>>
>>106796073
idk. Just disable those nodes. idk why those would even be in the i2v workflow in the first place. Someone was clearly doing acid when they added them.
>>
>>106796086
What are some alternatives? What are they even called, reading an image and adding prompts?
>>
>>106796146
>What are some alternatives
Just type the prompt yourself. Why the fuck is the Florence node even there? It's bizarre.
>>
File: 1754292948732458.jpg (1.78 MB, 1248x1824)
1.78 MB
1.78 MB JPG
>>
nsfw: https://litter.catbox.moe/gdnw8reaounyg5zn.mp4

Comparison between regular wan 2.2 q8 with nsfw loras on the left and on the right is the smoothmix 2.2 without loras.

Has anyone done sfw comparisons?
>>
File: ComfyUI_19292.png (3.5 MB, 1200x1800)
3.5 MB
3.5 MB PNG
>>106796062
I like Flux, it's very... malleable.

>>106796214
The speed there reminds me of the old dial-up days. Very nostalgic!
>>
File: cropping.jpg (748 KB, 3247x1632)
748 KB
748 KB JPG
Does this benefit lora training for NL models or I shouldn't bother? I only see tags there.
>>
Where is AniStudio general? I don't see it in the catalog
>>
>>106796437
Ani studio general is pending Softbank funding for management.
>>
>SD1.5 cyber realistic
>excellent photorealism out of the box
>celebrities, styles prompted for easily
>actually decent prompt adherence, in that it would gen what you asked for, even if imperfectly
>gens in seconds, hires fix included
>gacha for non-deformed anatomy

>chroma
>"next gen" prompt adherence
>can do sex
>gens take well over a minute with hiresfix
>gacha for actual, non-slop photorealism
>still might get bad anatomy

they call this progress?
>>
>>106796594
>they call this progress?
vu will own nothing vu will be happy
>>
>SD1.5 cyber realistic
>excellent photorealism out of the box
>celebrities, styles prompted for easily
>actually decent prompt adherence, in that it would gen what you asked for, even if imperfectly
>gens in seconds, hires fix included
>gacha for non-deformed anatomy

>chroma
>"next gen" prompt adherence
>can do sex
>gens take well over a minute with hiresfix
>gacha for actual, non-slop photorealism
>still might get bad anatomy

they call this progress?
>>
The Sora videos floating around have demoralized me.
That is all.
>>
>>106796214
>smoothmix 2.2
What's this? Just a Wan 2.2 with nsfw lora baked in? Did you try with light2x?
>>
>>106796649
Yeah, I think so. It's overall quality is worse, but the motion is just unbeatable for nsfw.
I am using the speed lora on the high noise. It does not like the latest t2v 2.2 lora which works just fine on the q8 model.
>>
The poster of this post is asking for model recommendations that can create better backgrounds/landscapes than SDXL, while working within their 12GB VRAM limitation. They're considering whether Stable Diffusion 3.5 Large would be a good option.
>>
time to return to /sdg/ boys
>>
Why won't this accept the fp32 clip? fp16 is fine.
>>
yeah this thread it's trash, nothing new
>>
Have there been any Wan 2.2 speedups in the past month?
>>
when will they invent models that can moan
sora 2 doesnt count
>>
>>106796826
Because KJ is a retard who forces HIS way of scaling down everyone's throats.
>>
I have been out for a while.

Did the Mayli anon ever deliver? 6 onthes passed.
Where is the Mayli dump?

What is the current best to make celeb AI porn?
Did the CHroma ever get finetunes?
>>
Stop caring about celebs
>>
>>106796859
Fuck open source, jfc.
>>
File: 1752685757896998.mp4 (916 KB, 928x640)
916 KB
916 KB MP4
>>
So I'm taking a look at smooth mix. What is it exactly? Just LoRAs baked into the model? Like it does look smooth and contrasty. But what's making that happen?
>>
>/ldg/
>/sdg/
>/adt/
all identical please get your shit together
>>
>>106797142
That's is like saying India, Pakistan and Nepal are the same. But they are not.
>>
>>106797156
>india and pakistan not the same

But they are the same shithole. Two different hemorrhoids.
>>
>>106796594
> >actually decent prompt adherence, in that it would gen what you asked for, even if imperfectly
> 1girl, black hair, standing
> 1girl, blonde hair, standing
>>
>>106797156
ok Sukdeep
>>
After giving smoothmix a go I can definitely recommend it. Extremely idiotproof. Even for stuff that isn't strictly sex related. It's just animated for interestingly.
>>
>>106797338
it's fucking garbage, it has the nsfw lora baked in which makes every girl fucking HUMP everything
>but im a coomer and I only gen coom shit
good for you, but people might also want to do funny/softcore stuff and even work stuff (I do animated avatars for some of our ai agents with this crap) and it's fucking garbage
>>
>>106797352
>I do animated avatars for some of our ai agents with this crap
I don't give a shit about your fake scam job.
>>
>>106797364
and I dont give a shit about having a model with prebaked nsfw loras when I can just turn them on/off on demand.
it's fucking retarded, it's the same as using the AIO model that one guy made on HF phroot or something, WHY would you even do that? are you literally incapable of chaining loras? kys
>>
>>106797381
>Hey guys I like this model, you should try it too
>FUCK YOU KYS IT'S USELESS FOR MY SPECIFIC THING FUCK YOU KYS!

I found the thread schizo everyone.
>>
>>106797118
If you mean the WAN based model, yeah it looks like they basically added loras to the base model.
What i can't find is which lora, it's a bit annoying.
At least write what it can do now.
>>
I finished my first wan 2.2 14b i2v lora. 101 15-second 640x360 clips, 5.5 days on a 4090d 48GB. I used Misumi-tuner. I'd expected it to output both the high and low files based on my options, but only got 1 file. It does seem to work as a high-noise lora though.
https://huggingface.co/quarterturn/wan2.2-14b-i2v-city-the-animation
TL;DR i2v not recommended unless you're on an H100 DGX cluster.
>>
>>106797393
but you claimed you can even do non sex related stuff, it's just not true. you're probably the author of this shitmix anyway kys
>>
>>106797352
>girl fucking HUMP everything
Did he add everything at weight 1?
>>
>>106797404
>it's just not true
But it can?
You're insanely defensive about this. I'm blocking you.
>>
>>106797393
>Hey guys, waste your time with my shitty slopmerge
>>
>>106797118
loras are more flexible and results retain more characteristics of the base model they're applied to. finetunes permanently add, remove, or narrow traits of the base model, like dog breeds.
smooth mix is so any smooth brain can horny gen without needing to go find and test nsfw loras. it's probably great for firsttimers, but it's pointless if you already know what you're doing.
>>
>You're insanely defensive about this. I'm blocking you.
>>
>>106797423
I bet any lora mixing gen you make will be shit on all over by smoothmix in an a/b test.
>>
>>106797414
I can confirm, this is what it produced with the prompt 'the girl is laughing at the viewer' when I tried it yesterday when you started shilling your merge.
>>
>>106796214
left is better
>>
>>106797433
Your gens suck. Skill issue on your part.
>>
>>106797410
OK I checked on the civitai, and unless the videos are fake, it seems able to do perfectly sfw things.
Only things with women is that it makes their breasts bounce more, which is fine by me.
I'd keep both the mix and the base model anyway.

The issue I'm seeing is that a lot of videos are slow motion, which from the look of it is due to inserting the lightning loras to the model.
Why though, it would have been way better to give the choice on that.
>>
>>106797452
yes! now put that all together. it's a reduction of the base model for gpulets/vramlets.
base model - flexibility + low step lora + coom loras = "finetune"
>>
>>106796707
I tried it but got OOM on a 480p res with 16gb vram, so I guess I ain't running it unless someone do a quant of this
>>
>>106797433
use case for NetaLumina? Do you have to inpaint your gens? I can visualize this image in tags and give it to SDXL I and get identical output in 30 seconds.
>>
>>106796707
>I am using the speed lora on the high noise.
I'm pretty sure the speed LoRA is baked in to both models. There's no point using it.
>>
>>106797480
good for you. I just like the addition of boom prompting. I don't use detailers/inpainting
>>
>>106797478
It's a perfectly competent mix. I don't know why you're choosing this hill to die on.
>>
>>106797499
nice legs nigga
>>
>>106797437
It's a matter of taste, I prefer the lesser high quality too, but being able to go full goonrot is nice.

>>106797479
Well the model is almost 20gb.

>>106797498
Ah, that makes sense as to why it's much faster.
>>
>>106797509
>he doesnt like his 'loids 1 legged
skill issue
>>
>>106797478
There is a workflow shared in the page, he can just add the lightning loras as requirements so at least they can be tuned to minimize the forced slow motion effect.
Also the model is clearly labelled as mix and not finetune.
A finetuned WAN would be awesome, but it looks like no one has the money and curated dataset for that.
>>
File: 00000-514158711.png (2.42 MB, 1248x1824)
2.42 MB
2.42 MB PNG
>>
Is there any tool that lets you read an image and it tells you with how much accuracy does it have a booru tag?
>>
File: 00074-2704194836.png (2.36 MB, 1824x1248)
2.36 MB
2.36 MB PNG
>>106796594
i tried chroma on reforge2 with a 5090 and it takes 3.5 minutes to generate an image. I found the model overall pretty shitty and got too many gens with deformity issues and bad prompt adherence despite plug in loras.
>>
>>106797499
Ok so NetaLumina is good for +1 character gens Boomer prompting No inpaint after you generate
What base resolution? Did you try inpaint as an experiment? What about outpainting? Is Neta technically superior in architecture to SDXL? Which components do you know about? UNET? Does it not use CLIP?

If you have to choose between Neta and Chroma as the "next local anime model" could you compare them?
>>
>>106797567
I think the Chroma hardline believers have quietly sunk back to the festering rotten logs they crawled out of. You don't have to humor them anymore.
>>
>>106797533
Yes, your brain
>>
>>106797601
So AntiChromaSchizo(me) was right?
>>
>>106797608
You can't be him because I am (You)
>>
File: file.png (27 KB, 529x436)
27 KB
27 KB PNG
i don't even want to know what these retards are up to again.
honestly filtering out all common troll trigger words and all namefags has increased this shithole's appeal.
>>
>>106797567
How'd you get it working, what model should I use, I've got NeoForge open right now what do I need to download?
>>
>>106797644
first, you take a spoon and then shove it right up your ass.
>>
>>106797533
Your eyes, your brain, a sheet of paper, and a pencil. Lazy fuck.
>>
>>106797585
>what base resolution
I'm specifically using neta yume, a neta lumina finetune. At least he calls it that, but sometimes he says it's a different pretrain and for the upcoming v4 he's saying that it's basically a different pretrain again but I dont have the means to verify his claims.
as for resolution, you can check his official WF in HF, which is the resolution im using (1024x1536). From my tests it can also do landscape and square at the same 'megapixellage' without any major problems.
Never tested inpainting (I just reroll), never tested outpainting.
Neta uses a 16ch VAE, I think this alone should make it worthy of consideration. It also doesnt use clip, it uses a full fledged LLM (gemma by google), which is the reason why it's actually able to understand both tags and natural language at the same time. No dual clip crap too.
I think comparing neta with chroma is not right. Chroma has better realism, and seems to be tuned towards that type of content. I don't care about realism at all. I think Neta yume is better suited for anime for the reason that you can still booru prompt with it, which is something you cant do with chroma. It's also EASIER to run compared to chroma.
>>
File: WanVideo2_2_I2V_00472.webm (1.12 MB, 1248x704)
1.12 MB
1.12 MB WEBM
God smoothmix is so shit. Look at the way the woman on the left's as desperately craves cock, any cock, before she snaps to attention and starts walking. Shit model.
>>
>>106797638
what the hell
>>
>>106797673
but this image just doesn't look good. no neta image posted here has looked good.
>>
>>106797686
thats what happens when you shitmix 40 loras overfit on pornography
>>
>>106797686
I don't see the issue.
>>
>>106797686
here is a little helper :
- when sfw use base wan
- when softcore or hardcore use loras OR use the mix
>>
>>106797673
Thanks for your answers
>>
fyi, this general is gettin astroturfed by that shitty wan slopmix

no idea what their endgoal is but apparently using loras is too hard for their widdle little brains so they have to use a wan """finetune""" with all loras built in which rapes your output
>>
>>106797724
i found it funny someone was in here the other day desperately claiming it was a finetune and NOT a shitmix
>>
people use models I don't like AND software I don't like! what's next, they gen characters I don"t like??? fuck this!
>>
smoothmix haters giving out autistic manual cars are actually better than automatic car vibes. When literally everyone but creeps drive automatics.
>>
>>106797758
cool simile bro
too bad your automatic car is constantly in reverse
>>
File: WanVideo2_2_I2V_00473.png (1.28 MB, 1248x704)
1.28 MB
1.28 MB PNG
Look at this absolute cock crazed harpy walking through the village. She'd take anyone. Even a fucking owl. I prompted her to be a good girl. Not this tramp.

This mix has been nothing but a disaster.
>>
>>106797773
Oops, posted the .png teehee.
>>
>>106797773
do you really think anyone is actually retarded enough to download it based on your "oh nooo it gens porn haha but i totally didn't mean to haha. please clap"

die.
>>106797783
double die to death right now.
unexist.
>>
smoothmix author seething REAL bad rn
>>
This place really is just a hive of schizos these days. Ani general soon?
>>
>>106797798
where do you think you are
>>
>>106797783
her breasts bounce way too much, original wan should have made them bricks so as to not incite incredible lust from anons only wanting sfw
>>
>>106797783
why does the owl lean in for a kiss
she's seducing the birds
>>
>>106797758
actual schizo nonsense
>>
>>106797783
Sexo but the bg is too static, shifting trees don't help.
>>
>>106797825
Shifting trees are probably because I upscale the latent in-between desu. I usually cheat a minute or two or gen time that way.
>>
>>106797853
>I upscale the latent in-between
??
>>
i finally understand. this is the actual containment thread and not adg
>>
It's been a week since t2v lightning v2 got released, no news about i2v yet...
>>
>>106797853
share the secret sauce workflow
>>
>>106797866
I gen at 832x480 then between samplers I upscale the latent by 1.5x.
Just a hacky little thing I do. I don't recommend desu because it can create artifacts.
>>
Wan 2.2 vace when.
>>
>>106797880
Are you using native nodes?
>>
This one honestly did exactly what I asked it so no complaints here.
>>
>>106797880
ok so high noise : 832x480 then low noise x1.5?
never thought it would be viable
>>
>>106797903
Hold on let me clean up the workflow and I'll box it.
>>106797887
KJ. I hope that's not a dealbreaker.
>>
>>106797878
i mean, is it really needed if the t2v lora works fine with i2v?
>>
File: WanVideo2_2_I2V_00475.webm (1.49 MB, 1248x704)
1.49 MB
1.49 MB WEBM
https://files.catbox.moe/llkbt5.png

There's the workflow.
But yeah. Smoothmix is a bit horny. Like >>106797783
I told it to pet the owl, not kiss it. But it also looks really good? So, idk. I worth a try in my opinion. I don't know why the schizo hates it so much.
>>
Why did no one mention theres a v2 of wan 2.2 animate?
>>
>>106798004
I guess we all got tired of making Donald Trump dance. Is it good?
>>
>>106797986
It didn't for me, it looked really bad.
>>
>>106797398
5.5 days damn
>>
>>106798000
thanks anon, will check once home
>>
>>106798008
oh wait I was using i2v. it already exists?
https://huggingface.co/Kijai/WanVideo_comfy/tree/main/LoRAs/Wan22-Lightning/old
>>
is there a way, for i2v, to let a node maintain the aspect ratio of the image and just let me change the one height/width so I can skip figuring out the needed value when swapping between 480/720p?
>>
>>106797915
>KJ
How do you deal with kijai's loader's autistic resolution requirements?
>>
>>106798046
By making sure images are resized to the multiples. Idk why he can't just fix that without the autistic nodes.
>>
>>106798029
those are the original ones, hence being in the "old" directory.
>>
File: xa.mp4 (1.97 MB, 1056x1856)
1.97 MB
1.97 MB MP4
>>106798000
smoothmix is fine. yea of course it's horny.
>>
>>106798029
Yes, but the team made new t2v last week and promised i2v too.
It fixed a few things, especially that shitty forced slow motion.
>>
>>106798043
KJ's resize image node (there's others too). Set it to resize image (not crop). Then you can for example put 0 in the height and 720 in the width and it will scale the height accordingly. You can also set both to 0 then use an integer node to quickly set the value.
>>
>>106798057
Ghost maid
>>
File: 1746004095499467.png (29 KB, 452x467)
29 KB
29 KB PNG
>>106798043
Personally I use the advanced node picrel.
It basically locks in the same aspect ratio as the image, and resizes to the desired megapixel.
For example :
720p = 0.93Mpixels
480p = 0.41Mpixels
>>
File: xb.mp4 (1.64 MB, 1056x1856)
1.64 MB
1.64 MB MP4
>>106798076
i'm guessing it's video game type clipping because not enough porn was trained
>>
>>106798000
HunyuanVideoWrapper this node refuses to work for me. I'd love to try out videoenhancers. Are there any other ones?
>>
>>106798147
What's the deep lore behind this character. What does she does that warrants her living in a castle and having a touchy feely maid while also dressing like a tramp?
>>
>>106798158
Not that I'm aware of. I'm not even sure it's doing anything desu. You can probably leave it off.
>>
>>106797686
image made with ai?
>>
File: WanVideo2_2_I2V_00581.webm (3.61 MB, 704x1280)
3.61 MB
3.61 MB WEBM
saars i cant stop
>>
>>106798071
That worked nicely, thanks.

>>106798091
Working in megapixels seems like an extra step and restrictive, doesn't it?

>>106798171
I'm really missing proper upscaling from working solely with images before.
I'm getting blurry results, I'll have to trial and error some settings.

The latent upscale goes very fast, tried it on my own workflow and it gets stuck at double decode, so I guess it's not compatible.
>>
>>106798315
>Working in megapixels seems like an extra step and restrictive, doesn't it?
Not really, it automatically picks the resolution and aspect ratio I want, it's very "fire and forget".
>>
why was sora getting shilled and spammed here but not wan2.5? really makes the ol' noggin rattle
>>
File: radiance.png (2.86 MB, 848x1488)
2.86 MB
2.86 MB PNG
>>106798159
perhaps a mansion of a family or an individual powerful femdom domme with a lot of staff that is largely hired for entertainment including the sexual kind

or a vampire with a maid sidekick that keeps up a strict and somewhat restrained appearance but really is an extremely horny and needy gf

better video models yet and it'll get very horny
>>
File: radiance.png (2.35 MB, 848x1488)
2.35 MB
2.35 MB PNG
>>106798306
no need to stop
>>
>>106798364
lines AND blocks? bro is radiance just never going to work?
>>
File: file.png (18 KB, 1018x248)
18 KB
18 KB PNG
its over for comfyui
>>
>>106798380
it's not going to work but it's not pixel space's fault
>>
>>106798380
What does lines and blocks mean?
>>
>>106798429
are you like super blind?
like, do you genuinely not see it?

>>106798404
a surprise to literally no one.
that 17m was just the start of the datamining.
>>
>>106798404
the jew needs to know what you're genning
>>
>>106798444
I’m not the guy who posted it, but it looks totally fine to me. I’m not trolling like what are you seeing? I wanna know.
>>
>>106798404
ohhhh onnononononoo
>>
>>106798404
And you guys are fucking idiots, all day licking this idiot's and Ani's balls.
>>
File: radiance.png (2.79 MB, 848x1488)
2.79 MB
2.79 MB PNG
>>106798380
i think it's largely working tho
>>
>>106798467
if you zoom in like crazy you will see lines
>>
>>106798555
Oh damn, OK yeah I see it, like right on the titties. Looks like Banding or something
>>
File: ComfyUI_0070-leiji.png (2.07 MB, 1296x1728)
2.07 MB
2.07 MB PNG
>>
>>106798404
what alternatives are there? anything just using code/command line tools?
I'm gonna be honest I look at those diagrams people sometimes post with five hundred arrows between different bubbles, it just looks like something a marketing or sales roastie would have come up with. just screams loq IQ.
>>
>>106798612
Very nice style, what model is it?
>>
File: AniStudio.png (17 KB, 277x362)
17 KB
17 KB PNG
>>106798404
Message to AniCHADS!
NEW UPDATE!
HE IS COOKING!
>>
>>106798623
every model can be used with python / cli directly. look at their hf pages.
>>
>>106798623
>what alternatives are there?
SDNext
Swarm
NeoForge
InvokeAI
>>
>>106798404
Each gen has a unique ID, has had them for a while. Did all of you not notice?
lol.
>>
>>106798755
Forge chads wya?
>>
surely you have proofs
>>
File: dmmg_0118.png (1.32 MB, 832x1216)
1.32 MB
1.32 MB PNG
are controlnets the only way to avoid bodyhorror via prompting? a woman lying down with her legs up should not be a complicated concept
>>
File: ComfyUI_0078.png (2.42 MB, 1296x1728)
2.42 MB
2.42 MB PNG
>>106798640
https://civitai.com/models/1916505/waijfu
Plus a few loras at different stages of base gen vs hiresfix
>>
ani will fuck up comfy so bad lmao
>>
>literalwho schizos are on
see you tomorrow bros!
>>
File: 1730328686174870.mp4 (1.92 MB, 720x1248)
1.92 MB
1.92 MB MP4
>>106798526
>>
>>106798817
LMAO is this chroma?
>>
>debo trying to throw ani under the bus after the warning
Great friend you are
>>
File: AnimateDiff_00001.mp4 (3.21 MB, 480x832)
3.21 MB
3.21 MB MP4
tested out wan 2.2 animate
>>
>>106798817
not impossible for a gymnast
>>
>>106798881
dude that's not possible, look where the feet and head are facing
>>
Is he unable to realize how easy he is to point out after years of the same pattern?
We already pointed out a copycat a few days ago you can't even give us that you retarded piece of shit.
>>
>>106798888
yeah looking a second time, I can see the issue
>>
File: dmmg_0076.png (1.42 MB, 832x1216)
1.42 MB
1.42 MB PNG
>>106798871
flux, which is generally fine as long as no one is upside down or sideways
>>
so I was looking into ai-toolkit and how to prepare a dataset and shit... is there like a newbie guide I can look at?
>>
Let me guess, can't latent upscale with this sampler? I get stuck on 'double encode' and it just chugs forever.
>>
>>106798965
you must have this much ram to generate ->
>>
>>106798973
I have another 64gb arriving wednesday, guess I'll wait till then. But it doesn't even work for 480p.
>>
>>106798940
What model?
>>
>>106798992
I'd be interested in flux/chroma and qwen
>>
>>106798306
base image made with res_2s bong chroma?
>>
File: Chroma_00005_.jpg (545 KB, 1296x1656)
545 KB
545 KB JPG
>>
>degenerate fetish furfag makes a new account to upload the same shit on civitai

Reported acc for hatespeech.
>>
>>106799041
can you drop this prompt i would like to compare it through flux
>>
File: ChromaAmateurphoto_00008_.jpg (640 KB, 1296x1656)
640 KB
640 KB JPG
>>
>>106798879
Girl name if real?
>>
>>106799048
If you hate the guy give us a reason not comment into the void like some pussy. We don't want to hear some random void speak when it's clear you have no one else to talk to about this.
>>
File: 1759432067749514.jpg (95 KB, 900x1200)
95 KB
95 KB JPG
>>106799122
>>106798879
should be matsumoto nanami
>>
>>106799188
Ah the real tits aren’t as huge and squishy looking as the video. Still nice tho. That’s the magic of ai, “make the tits bigger!’
>>
>>106799206
Get her pregnant and you'll have the same thing
>>
File: ChromaAmateurphoto_00009_.jpg (621 KB, 1296x1656)
621 KB
621 KB JPG
>>106799114
>a photo of large breasted hottie, drunk 22yo heavy metal chick who's having good time. North European, Nordic, possibly Swedish or Finnish. She was photographed outdoors, outside music festival during summer. Standing pose, upper body visible. Wearing small tight fit tubetop that barely covers exaggerating her busty body. Her wavy platinum blonde hair rests on her shoulders. She is looking at the viewer in skeptical manner her left eyebrow raised and grinning while holding a red beer can that says: "Karjala". Lighting is natural, and the focus is on her face and cleavage. She wears light makeup with some moody eyeshadow which highlights her blue eyes.
>>
>>106799229
What I like most is how she doesn’t have a perfect face, like real people
>>
>>106799256
tits are kinda comical tho
>>
File: 1742661132358378.jpg (660 KB, 1248x1824)
660 KB
660 KB JPG
>>
>>106799263
Lmao what sad place do you live at where that is comical to you?
>>
File: 1746494487768845w.jpg (203 KB, 1536x2048)
203 KB
203 KB JPG
>>106799188
>>106799206
Don't know who it is, just picked a random image in my downloads. What that guy posted looks about right. Picrel.
>>
File: dmmg_0024.png (1.44 MB, 832x1216)
1.44 MB
1.44 MB PNG
>>106799229
ty anon

i think yours looks way more "amateur" and has a better understanding of what regular person looks like
>>
File: image 16.jpg (793 KB, 2560x1536)
793 KB
793 KB JPG
>>106795920
Huh, thread is slow today, robot girls before bed, robot girls for lunch lel.
>>
File: 1747386223985292.png (1.44 MB, 1360x768)
1.44 MB
1.44 MB PNG
https://gofile.io/d/sfoxub

remove_clothes_qwen_edit_2509_v2_000005250.safetensors

if you still need it, cause civitai/github can't host it cause "durr it can potentially make lewds"
>>
>>106799341
Is it a lora?
can't qwen edit already do that by default?
>>
>>106799358
it can, but this makes more detailed images when you do lewds.
>>
File: 645.jpg (324 KB, 768x768)
324 KB
324 KB JPG
>>
File: 3623.jpg (232 KB, 768x768)
232 KB
232 KB JPG
>>
File: ChromaAmateurphoto_00011_.jpg (641 KB, 1296x1656)
641 KB
641 KB JPG
>>106799320
np np
>>
File: 7754674.jpg (262 KB, 768x768)
262 KB
262 KB JPG
>>
File: 76575675.jpg (260 KB, 768x768)
260 KB
260 KB JPG
>>
File: ChromaAmateurphoto_00020_.jpg (648 KB, 1296x1656)
648 KB
648 KB JPG
>>
File: WanVideo2_2_I2V_00580.webm (3.63 MB, 704x1280)
3.63 MB
3.63 MB WEBM
>>106799034
base image is an sdxl checkpoint
>>
>>106799387
nice scifi book cover vibes
>>
>>106799436
>>106799508
>>106799562
WTF is this? why noboady told me that there were a squad of rotating anons giving atention to the thread schizo so he doesnt go out of his thread?
>>
>>106799570
thanks! the model it's https://perchance.org/ai-text-to-image-generator , it's free!
>>
>>106799581
that's right anon, it's all a conspiracy!
>>
>>106799589
oh a random prompt thing
>>
File: dmmg_0042.png (1.5 MB, 832x1216)
1.5 MB
1.5 MB PNG
>>106799411
is this using any chroma loras? i might have to actually give it a try
>>
File: Untitled.jpg (281 KB, 768x768)
281 KB
281 KB JPG
>>106799611
yeah but for the consistency at low resolution I think that is a SD3.5 model
>>
>>106799581
>only one guy loved sora 2 and it's the legendary singular schizo
at least debo has some taste lol, those were funni ones
>>
>ran is seething again
>>
ran is another schizo?
>>
>>106799589
>>>/g/de3
>>
The pastebin existed for 2 years now and he has yet to be able to find a lie and will actively avoid talking about it despite begging us to make one after years of being a pest.
That story is done do not bring that up here please.
>>
>>106799624
Yeah I'm using lora. There should be similar stuff for Flux, amateur photo stuff
>>
>nobody
>ran
He really broke you
>>
File: dmmg_0045.png (1.55 MB, 832x1216)
1.55 MB
1.55 MB PNG
>>106799662
that makes a lot of sense. will investigate.
>>
I know the story of debo, now please tell me de lore of Ran
>>
>>106799736
*MrCatJak
>>
>>106798665
>2000 commits behind comfyui
>>
File: radiance.png (2.64 MB, 848x1488)
2.64 MB
2.64 MB PNG
>>106798870
cute
>>
>search /adg/
>no results
who bakes the next ani studio thread?
>>
>>106799830
/adg/ used to be "API diffusion general", but after the 4chan hack it died lol
>>
*yawn*
>>
>>106798347
>why was sora getting shilled and spammed here but not wan2.5? really makes the ol' noggin rattle
Sora 2 is actually a SOTA model way ahead of the rest and can make a shit ton of characters to play with, wan 2.5 is none of those, are you retarded or something?
https://files.catbox.moe/98xufa.mp4
>>
>>106797381
>>106797393
Phr00ts all in slop is great (depending on which version, 1, 4 and 9 are best)

>provides wild unhinged movement and wobble from 2.1
>provides some of the consistency from 2.2
>doesnt have to stop generating to switch between high and low samplers like 2.2 (2.2 adds like an additional 2 minutes between high then low)

Best of both worlds. You may proceed to seethe.
>>
>>106799946
>https://files.catbox.moe/98xufa.mp4
How am I supposed to live my life now, knowing that I will probably never have this marvel locally?
>>
>rolling forcing waiting room
https://github.com/TencentARC/RollingForcing
>>
>>106800011
By using local models for what the proprietary ones are afraid of - making porn and depictions of real people. It's also educational and fun, gets you involved in the space.
>>
>>106800069
>depictions of real people
sora 2 can do that
https://files.catbox.moe/xln1xs.mp4
>>
>>106800033
>he still believes in Tencent
dude, the only good thing they made was HunyuanVideo, and it was 1 year ago
>>
File: 1742289512799147.png (146 KB, 640x640)
146 KB
146 KB PNG
>>106800011
>Schopenhauer: "We seldom think of what we have, but always of what we lack. Therefore, rather than grateful, we are bitter."
Translation: you're fucked lol

>Epictetus (Stoic): "He is a wise man who does not grieve for the things which he has not, but rejoices for those which he has. Freedom is the only worthy goal in life. It is won by disregarding things that lie beyond our control."
Translation: since you can't control shit, don't worry too much about it

pick your poison
>>
>>106800011
local will catch up to this in a year or two
>>
He's upset again I guess community service anons wounded his pride
>>
>>
File: SAY THE LINE BART.png (39 KB, 1066x259)
39 KB
39 KB PNG
>>106800132
>He's upset
This, do not reply to his schizo meltdown!
>>
>update rocm
>first gen takes forever
what the fuck is rocm doing to take so long on the first gen in sdxl? It does this whenever you gen at a resolution it hasn't done before. after that it's fine. It's like it's compiling something, but what??
>>
>>106800168
comfyui bug
>>
>zero deviation from behavior
>everything burning around him
>still continues
How many years until he hits bedrock?
>>
>>106800184
>stop pretending, we all know who poopdickschizo is
>>
>>106800184
>How many years until he hits bedrock?
i think ran hit that a long time ago lil bro
>>
File: 1738180174141239.png (872 KB, 874x1035)
872 KB
872 KB PNG
https://huggingface.co/FabioSarracino/VibeVoice-Large-Q8

pretty good with the comfy template for vibevoice single speaker

https://voca.ro/15Lsaf2P7jE3
>>
>>106799117
>>106799229
>>106799320
>>106799411
>>106799624
Perkele. You lost the Winter war.
>>
whats the best checkpoint for non anime images that can do various races well?
>>
>>106800033
> nvidia exclusive
>>
>FETCH ComfyRegistry Data:
any way to disable this on start, I know how to check for new nodes
>>
>>106800080
>https://files.catbox.moe/xln1xs.mp4
can it do i2v with people i know irl?
if it can't then i don't care
>>
>>106800360
Surprisignly, there's not much i2v examples on the internet, maybe the t2v one is so good people don't care about i2v on sora
https://xcancel.com/tamami_smile_vt/status/1974805775613849971#m
>>
>>106800360
>>106800391
I think it's really mid on i2v, that's not what they focused on I guess
https://xcancel.com/vu0tran/status/1973959852386136084#m
>>
>>106800238
lmao, with the narrator from who killed captain alex:

https://voca.ro/1e6B09tEhfGl
>>
>>106800450
even better:

https://voca.ro/1nvCbPXeIi9c

I love how it can be emotive and actually copy an accent.
>>
>>106795268
>>
holy shit
if you actually know what you're doing inpainting is sooo good.
I was genning hundreds like a fucking monkey hoping for proper hands but with inpainting this shit was done in a few seconds.
I wasted so much money and time for basically the same genn...
>>
>>106800503
now try qwen image edit 2509 which is inpainting on steroids.
>>
before I go all in can anons who upgraded from a rtx 20 or worse to a 5090 tell me by how much image gening improved?
like how long did it took you to genn a high res image before and after?
>>
>>106800509
I tried regular Qwen Edit once with the instruction "remove extra fingers" and it responded by replacing both hands with empty plastic sockets on the wrists
>>
>>106800557
the new one is really good, it does stuff inpainting can't.
>>
>>106800539
that's like 8x faster vs a good 2080ti desktop or something on many models but of course not all models are the same

nvidia gpu basically have the AI pricing
>>
>>106800539
its worth it
>>
>>106800539
The 5090 is so much faster especially for inference.
>>
>>106800539
comfy is full of spyware, ani makes it easy for Rocm retards, neoforge is the latest dead horse on life support. as for models qwen image/edit are the current top tier but lack variation and takes forever. people just cope with either illustrious, chroma or lumina and those don't need as much VRAM but having a better card to make them go faster is usually a good idea
>>
>>106800577 (cont'd)
also don't forget that many nationalities will care that the power cost per gen is like 2-4x lower
>>
>>106800033
https://www.youtube.com/watch?list=TLGG6Lg6eujbUsEwNTEwMjAyNQ&time_continue=89&v=Qa6nhbiEvME&embeds_referring_euri=https%3A%2F%2Fself-forcing-plus-plus.github.io%2F&source_ve_path=MTM5MTE3LDEzOTExNywyMzg1MQ
this is ass wtf, and it's supposed to be a cherry picked video
>>
File: triton.jpg (77 KB, 1664x885)
77 KB
77 KB JPG
trying to get started with video gen using a workflow an anon shared. sorry for the tourist questions. how can I solve this? I'm on a virtual environment install atm. google is telling me triton is a linux thing?
>>
too many 1girls ITT
>>
>>106798347
because sora 2 is the new king. there's no way china is going, to beat the us this year. they can only release something for local, if they want some glory. kek
>>
>>106800577
>8x faster
holy shit
in that case it's really worth it.
>>
File: I like this quote tbh.png (303 KB, 700x441)
303 KB
303 KB PNG
>>106800674
China is just too lazy to make a quality dataset, it is what it is, you want glory? then caption your data proprely or go home
>>
File: lol.png (152 KB, 708x800)
152 KB
152 KB PNG
>>106798347
>Guys, why is the good model so highly praised and the mediocre model less so? It's strange, isn't it?
>>
>>106800620
You need it, I don't use windows to gen but check here : https://github.com/woct0rdho/triton-windows
>>
>>106799830
discussion about it here was never not allowed
>>
File: ComfyUI_02072_.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
>>
>>106800011
Just train a lora bro. /s
>>
File: 1733239277958140.jpg (666 KB, 2160x1177)
666 KB
666 KB JPG
https://www.reddit.com/r/StableDiffusion/comments/1nyz76z/prompt_adherence_comparison_between_two_hunyuan/
I still can't believe this slopped shit is 1# on the leaderboard, I lost faith in humanity
>>
For those wondering, yes it can do will smith eating shaghetti
https://files.catbox.moe/3e2gda.mp4
>>
>>106801007
That just looks like too few steps vs too many...
>>
>>106800238
worked for me, thanks

How come no "Load Audio" (upload) nodes can load .mp3? All I see is wav files listed. On Linux
>>
>>106801044
somebody call the smith estate and then it won't
>>
>>106801132
Will actually likes AI, he literally made a real video of him making fun of that meme (bottom)
https://www.tiktok.com/@crozzbonez0/video/7337606950290591022
>>
>>106799754
>implying a commits changing the version number is worth anything
>>
>>106801074
try this + workflow in workflow dir

https://github.com/wildminder/ComfyUI-VibeVoice

seems to work well even with 1.5b, just connect audio node to the middle node
>>
File: s.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
>>106801007
not too surprising tho

most corporations and nations can't really into art anyhow and it's not a matter of competence
>>
>>106801156
>>106801156
>>106801156
>>106801156
>>
>>106801074
>How come no "Load Audio" (upload) nodes can load .mp3?
>>106801157
It's me being retarded. There were NO .mp3 in that folder in the first place! LOL

I'll stick to https://github.com/Enemyx-net/VibeVoice-ComfyUI for the time being, bc it loads Q8 from this example >>106800238

Thank you regardless
>>
>>106797567
>>106799117
seems it gets non-generic faces really well
no idea why other models cant do it at all or without bunch of time-wasting/super good lora training



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.