[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: collage.jpg (1.8 MB, 3942x2465)
1.8 MB
1.8 MB JPG
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106968093

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://civitai.com/models/1790792?modelVersionId=2298660
https://neta-lumina-style.tz03.xyz/
https://huggingface.co/neta-art/Neta-Lumina

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
THREE MORE YEARS OF SDXL!
>>
File: huh.webm (517 KB, 768x512)
517 KB
517 KB WEBM
>>106972449
ty baker
>>
File: 00081-3677174787.png (2.67 MB, 1248x1824)
2.67 MB
2.67 MB PNG
>>106972482
when ai developers get their shit together and stop with shitty safety guardrails and censorship would we have a solid illustrious successor by now. kinda feel sorry for grok fags.
>>
If /ldg/ took the api pill already you would've seen how much better AI can be.
>>
based kebab man
https://xcancel.com/GozukaraFurkan/status/1980931218494624092#m
>>
we should separate the general between 1girl and actual art
>>
>>106972586
that's right king, tell those fucking chinkoids how it really is.

>>106972592
waiting on you to post some actual art dog
>>
we already do, actual art goes in the api thread
>>
> comfyui, wan
> switch loras and strengths 4 times, ram oom, restart
> switch loras and strengths 2 times, ram oom, restart
> switch loras and strengths many times without oom, gen all day, reboot computer
> switch loras and strengths few times, ram oom again
fucking hate this shit
why can it work only 1/100 times like normal
>>
>>106972631
set pagefile to double your system ram, which by the sounds of it you have 32gb.
>>
>>106972586
> Single-GPU & Fast Inference
> single H100
>>
Can I do anything on a 12gb rtx 3060? Got a i9 14900k and 32gb of ram.
>>
File: 00090-1308078057.png (2.57 MB, 1824x1248)
2.57 MB
2.57 MB PNG
>>
>>106972639
what that'll do?
>>
File: somecity.webm (1.03 MB, 1152x384)
1.03 MB
1.03 MB WEBM
>>106972656
yea.. anything sdxl, cut down flux.. animations will be slow tho
>>
>>106972656
You can run SDXL, the best local model ever developed
>>
Blessed thread of frenship
>>
>>106972668
stop you from ooming, you silly billy.
>>
>>106972449
Last thread
>SD 1.5 tier
>SDXL is still best

I love how these fags blabber on about without a proper goal for what constitutes a decent model. The tech has already advanced on every aspect, but they are stuck on SDXL due to its aesthetics (mainly because of a very low IQ).

NetaYume is a more than capable anime tune, with limitless potential for further improvement. Chroma base is also ready for its own anime tune.

Yeah, Chroma is harder to teach styles. Don't forget you're comparing 2B to 12B. No, SDXL is not even Flux tier, fuck off and go back to your SD 1.5 tier shitmixes.
>>
>>106972639
64gb
it's about 42gb used after the first gen
clearing cache and unloading models through model_manager.py, cuda clear and doing gc.collect frees not enough just prolong the agony
>>
>yes, my 12b model is dogshit at learning compared to a 2b from 2023, but you should use it anyway because new thing is new!
>>
File: ComfyUI_01220_.png (990 KB, 872x1192)
990 KB
990 KB PNG
>>106972573
>>106972592
>>106972606
(You)
>>
>>106972713
man i don't know then, sorry. I have half your system ram and i never oom with wan. something else is fucked with your setup.

>>106972731
Nice.
>>
>>106972691
>Please continue to finetune chroma for me because the $200,000 i spent wasn’t enough to learn a single style. Trust me guys, 512x512 is the future!
>>
>>106972691
> Don't forget you're comparing 2B to 12B
> extra 500% heavy
> for 30% better result
> still have to gacha
>>
>>106972656
>14900k
>rtx 3060
why?
>>
>>106972691
the truth of local models is
if it cant run at decent speed on a gamer gpu it will stay shit.
unless its a really large model that can only be run on custom whalecoomer hardware and happens to be good out of the box
no inbetweens
only exception being videogen because the barrier to entry there is higher in the first place
>>
>>106972731
Do you want us to post API gens here? You cried like a bitch last time you got mogged by Seedream
>>
>>106972742
>>106972746
SDXL doesn't even learn a single concept. It overfits on training data, similar to SD 1.5. What you get is tasteless variations, which is why you can only mostly tag soup it. Garbage in, garbage out. Chroma actually genaralizes.
>>
File: ComfyUI_01209_.png (916 KB, 1248x832)
916 KB
916 KB PNG
>>106972756
>Do you want us to post API gens here?
Did someone take your EBT?
>You cried like a bitch last time you got mogged by Seedream
That was another anon yesterday and I'm also a Chroma lover. These have been Qwen Edit 2509 though
>>
File: ComfyUI_01189_.png (974 KB, 696x1496)
974 KB
974 KB PNG
>>
>>106972756
i wish there was a midjourney thread desu, i always see a lot of cool shit that uses gens from there
>>
>>106972785
yet shitty sdxl based finetunes > generalizing chroma
>>
To that bro who was running Nunchaku Qwen with LoRA, can you share your workflow? Or anyone else doing the same.
>>
Chroma users are a blight upon this general. It’s a trash failbake only enjoyed by balding 3dsloppers who mistake the training artifacts as ‘realism’. Every chroma gen posted looks like melted shit, and that is because the model is objectively poorly trained
>>
File: file.png (7 KB, 254x155)
7 KB
7 KB PNG
>>106972639
>set pagefile to double your system ram
nta but this is also only a fix for some workflows and comfyui is only getting worse over time, i still get ooms every once in a while with 24 vram and 128gb ram and dynamic windows managed pagefile
>>
>>106972815
>nogen
try again but not through tears next time
>>
>>106972691
A lot of you havent experienced what a proper anime finetune (NovelAI V4.5) is like to use compared to the utter dogshit we're served here locally.
>>
>>106972749
Replaced a much older CPU recently. Changed the GPU a few years back.
>>
File: ComfyUI_01149_.png (1.42 MB, 1360x768)
1.42 MB
1.42 MB PNG
>>106972785
>Chroma actually genaralizes.
Not op, add my +1 for Chroma training. It's ridiculously good at picking up source training data and is being negged into a secret only autist coomers with motivation are aware of
>>
>>106972749
I can add another GPU for AI but it's expensive. I would also need a new power supply.
>>
>>106972864
>It's ridiculously good at picking up source training data
Any examples you can share?
>>
>>106972670
>IN A WORLD...OF NONSENSICAL SLOPPED DETAILS...
>>
>>106972835
NovelAI actually develops tech like #source#target which local has zero answer to. This is because local bakers don’t actually gen, so they don’t realize what needs improving
>>
>>106972890
the funny thing is, chroma could've done something similar, considering the entire dataset was given NLP captions with gemini
>>
>>106972864
They are just trolls. Can't possibly be upset with Chroma or NetaYume (meaning you have no imagination whatsoever), shill Illustrious, but then simultaneously shill API shit as if that doesn't make IL look like a joke in comparison.
>>
Just a screeching voice fading in the wind
You and your trani lost


Just cope and go away
>>
>>106972906
NetaYume is incredibly artist dependent in my limited testing. Some artists give great results, a majority of the others end up leaving you with incoherent and melted details. The artifacts are unique, however, they're different from the typical VAE melt seen in SDXL
>>
File: 00104-706343887.png (2.22 MB, 1248x1824)
2.22 MB
2.22 MB PNG
>muh 1girl bad
>unironically shilling chroma rng slop
>>
a local wan2.5 might even surpass sora 2. but that will never happen. the chinese prefer online humiliation, kek
>>
>>106972932
We're never getting another local WAN model btw.
Even if we did, it'd be without audio (too dangerous!)
>>
>>106972928
>Oversized gianttess 1girl
>Not doing anything novel, because the model no concept of anything other than some basic poses fed into it.

That's IL/SDXL for you.
>>
File: queen.jpg (179 KB, 1024x1024)
179 KB
179 KB JPG
>>106972864
>>106972906
my vote's on yume for the next major step up, i just need to get my coomer motivation today to install a totally fucking different training script than what i got the other day and give it a go
though step one is getting training data for a good style to train.

>pic unrelated
>>
>>106972960
do share if you find a good training script for netayume, i have some loras i need to rebake on it.
>>
It's the same disabled faggot spamming for days, you know who he is just stop arguing.
>>
File: 00110-730133524.png (2.14 MB, 1824x1248)
2.14 MB
2.14 MB PNG
>>
>>106972890
NetaYume comes close to NAI in prompt understanding (though it's obviously tagged differently so same prompts yield different results, but NAI prompts can be tailored for NetaYume). NAI is probably around 4-5B parameters due to its better grasp on text. Local is catching up, just a scaled NetaYume is all it needs.
>>
File: 615947.jpg (37 KB, 420x420)
37 KB
37 KB JPG
Retard here, the main reason for the garbled details and backgrounds in XL based models is the VAE right? Could the VAE be retrained, or replaced to improve the small scale details? Or is it easier to just do a whole new model?
Just wondering, because at least for me those models do most things I want, with exception of the mentioned things so I wondered why this one thing hasn't changed with all the retrains and variations of XL models having been made.
>>
>>106972960
For anime? Sure, for now (as we've yet to see a Chroma anime tune). But for realism Chroma is already by far the best model for it.
>>
>>106973048
Do you know about NewbieAI? It’s apparently yet another lumina finetune but they added 1b to it. Last I checked it’s supposed to release around the end of this year
>>
File: ComfyUI_03591_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>106972879
>>It's ridiculously good at picking up source training data
>Any examples you can share?
Search the archive. There's a no feeding sign

>>106972906
>They are just trolls.
Out in force today

>>106972960
Will try when diffusion-pipe or OT supports it. That's better than expected
>>
>>106973080
Never heard of it but if this is it
https://huggingface.co/NewBie-AI/NewBie_diffusion-model_repository

then it's looking great.
>>
>>106973055
theres a reason it hasnt been done
because it requires retraining the model as well so it outputs the right 'format' for the VAE to interpret
also i speculate that fixing the VAE problem would only expose other incoherency issues as the model was not trained on such precision and it can only get so far with its small size
most gens benefit from a proper upscale anyways, regardless of shit VAE or not
>>
>brown 3dcg
vomit inducing FR, where's the netayume schizo with his gens? past thread was also devoid of good anime gals
>>
Do you train illustrous loras with the same settings as XL/pony, or is there something that needs to be different?
>>
Interpolate>SeedVR2
Or
SeedVR2>Interpolate
>>
i just dont know what kind of 1girl to gen is the thing
>>
>>106972928
>>106973013
Why does every medieval interior AI generated stuff look exactly the same?
>>
>>106973145
>Search the archive.
I'm not digging through troves of endless garbage to find the next level prompt adherence you speak of. Back up your claims with proof.
>>
>>106973243
That's more a consequence of the shitmix he uses desu
>>
>qwen edit 2509
>prompt add a girl in the scene, keep everything else unchanged
>the scene shifts a tiny bit and the details are off
this model a fucking joke. who would've thought that a bigger model than kontext can't even do basic shit correctly? wtf
>>
>>106973273
For being an edit model it sure does like to resize and warp images.
>>
>>106973243
it's the mixes man. they're trying to warn us about the shitmixes but we won't listen man.
>>
>>106973233
seedvr2>interpolate
>>
>woaw chroma is so good guys its the best model!!
>cant even do fishnets without shitting itself
Not a good look
>>
giving non artists the confidence of faggy artists was a huge mistake
>>
File: rwyhwr5syhw5ryhw5r.png (1.48 MB, 1068x633)
1.48 MB
1.48 MB PNG
>remember in my tired brain you can just prompt any kind of eyes you want
>leave out the character tag and it's gonna wing it a bit
>get this

d'aaaawwww adowable eyes gen i can't share the full image of because its a failgen that put her on top of the table with a gigantic hyper ass in focus
>>
>>106973327
Have you tried donating another $200000? It might be enough to allow him to upgrade to 768x768 training!
>>
>>106973327
Point me to a different model that can oneshot my bondage, girl on leash or pregnancy prompts anon-kun.

API would filter 99% of my Chroma gens.
>>
i think ani's lawsuit just flew over my house
>>
File: 00120-2073636534.png (2.28 MB, 1824x1248)
2.28 MB
2.28 MB PNG
>>106973243
gets boring using "flower garden background", "forest", "jungle", "rocky area", and "grass field" background for majority of my gens. Building interior tend to be the weakness of sdxl.
>>
File: image_00096_.jpg (452 KB, 1240x1672)
452 KB
452 KB JPG
>>
where'd all the creative anons go?
>>
File: file.png (3.87 MB, 2048x1536)
3.87 MB
3.87 MB PNG
>>106973080
>>106973147
Any more info on this? The repo is bare.
>>
>>106973413
Realistic? Not sure. Seedream 4.0 is uncensored, but probably not amazing at NSFW regardless.
Anime? NovelAI V4.5 without a doubt.
>>
Some Nodes Are Missing
When loading the graph, the following node types were not found.
This may also happen if your installed version is lower and that node type can’t be found.

NunchakuQwenImageDiTLoader

Nothing I fucking do fixes this. Yes, I manually downloaded and installed the correct wheels. Please anons, by God, help me, I'm going to rip my hair.
>>
>>106973426
Dl local models and get creative. OP has guides.
>>
>>106973426
monetize on twitter
>>
comfy shoudl be dragged out on the street and shot
>>
>>106973210
Mostly
>>
>>106973426
>>106972047
explains it nicely
>>
>>106973442
>Realistic? Not sure. Seedream 4.0 is uncensored

Ah yes, Seedream
>>106968701
>>
>>106973487
Sounds like bullshit to me
>>
>>106973502
>Ah yes, Seedream
>no prompt provided
Worthless comparison.
>>
File: SEX.png (255 KB, 700x621)
255 KB
255 KB PNG
baiting schizos is the most fun part of these threads desu, you make some vague comment and some anon spends 3 threads fighting someone who only exists in their head
>>
>>106973514
facts
>>
>>106972671
>SDXL, the best local model
whycome it be best?
>>
>>106973382
>i can't share the full image of because its a failgen that put her on top of the table with a gigantic hyper ass in focus
thats not a failgen
>>
File: ComfyUI_07625_.png (1.92 MB, 1152x1152)
1.92 MB
1.92 MB PNG
>>106973502
What's best is that Chroma can generalize, so I can modify as I want to.
>>106973513
Didn't include since I've posted it here many times, but it's
>Amateur photograph, a Japanese idol woman, performing an advanced contortion pose indoors, likely in a studio setting. She is sitting on a surface with her legs bent backward and extended over her shoulders, so that her feet are positioned and touching over her head, displaying an impressive level of flexibility.

>A white towel is draped over her front for modesty. She has straight black hair with bangs, and she wears a black wristband or watch on one wrist

To showcase its generalization abilities, on Chroma, the prompt can be modified as much as I'd like:
>>106967674
>>106967761

I can make it uncensored.
https://files.catbox.moe/h90ted.png

It's the ultimate benchmark against APIshit models, and none of them have ever reproduced anything like it even if it's not filtered, and I can also give the 1girl props in her hands etc...

I can generate endless variations that satisfy my needs. It truly is an amazing model.
>>
great you summoned the chroma-schizo
>>
File: 1711876781486377.jpg (186 KB, 1080x811)
186 KB
186 KB JPG
>muh model X
>muh model Y
>muh schizos
am I the only one who doesn't care and enjoys his 1girls, as long as a model gives me nice pictures i don't give a shit
>>
>>106973459
Sorry you got molested trani
>>
thsi it begins (again)
>>
>>106973640
this is generally why we're a bullied minority in these threads, the only subgroup unironically satisfied with the simpler things. Many such cases!
>gens on comfyui
>uses whatever model works
>doesn't complain unless its a skill issue
>>
Tried to gen a video at 360x480 and got an OOM.
Set the resolution back to 480x640 for debugging purposes and it works fine.
wtf is going on
>>
Your personal opinion:
For gooning
Wan with motion (undressing)?
Chroma to undress?

Which way anon?
>>
>>106973640
The thread is filled with browns whos primary fascination is not the tech because they are low iq, and given that in any online forum its always gonna be more likely to have those people post because they are terminally online and mentally ill, every forum will devolve into a clownshow of mostly those posters
>>
>>106973643
That would explain his behavior
>>
>>106973614
>none of them have ever reproduced anything like it even if it's not filtered

Which btw, Chroma is the closest to what I'm trying to achieve (inspiration was a real image of a girl doing a contortionist pose which I found hot as shit). That requires a certain degree of anatomical understanding that has never been seen in any other model due to how censored they tend to be.
>>
>>106973552
>thats not a failgen

thank you for the encouragement.
>>
File: 1729699085331562.png (982 KB, 832x1248)
982 KB
982 KB PNG
>>
schizo holocaust when
>>
File: 00140-975254709.png (1.77 MB, 1824x1232)
1.77 MB
1.77 MB PNG
>>
>>106973147
>doesnt explain the dataset
fucking garbage
>>
>>106973444
it means COMFY NUNCHAKU failed to load, check the fucking logs retard, and give them to chatgpt
>>
File: 1734327762725648.png (1.6 MB, 1168x1752)
1.6 MB
1.6 MB PNG
>>
File: image_00106_.jpg (570 KB, 1240x1672)
570 KB
570 KB JPG
>>
>>106973786
Nice
>>
>>106973786
>this guy slaps your 1girl waifu's ass
what do you do?
>>
what's a generally good method for 2girl prompting in comfy? raw prompting + adetailer separating is working great but there's still going to be the expected clothing/lora bleeding.
i see there's like a dozen options but most seem convoluted and need highly specialized node setups.
>>
>>106973855
its either regional prompting for sdxl based models, or just use a recent model and rawdog it (there might be some bleeding but with enough rolls you'll get decent results)
>>
File: image_00110_.jpg (394 KB, 1240x1672)
394 KB
394 KB JPG
>>106973802
>what do you do?
rage against the machine
>>
>update button: pressed
>manager nodes: updated
>huggingface repos: checked
>model status: loaded
*cracks knuckles* alright, it's time to gen some 1girls
>>
>>106973855
attention couple
>>
File: 00152-3942221964.png (1.97 MB, 1824x1232)
1.97 MB
1.97 MB PNG
>>106973855
forge couple on a1111/forge forks. works best for 2d anime/cartoon but issues of bleeding are present with 3dcg, cgi and realistic styles.
>>
File: 1744408876081417.png (2.23 MB, 1168x1752)
2.23 MB
2.23 MB PNG
>>
>>106973732
>ImportError: DLL load failed while importing _C: The specified module could not be found.

I don't need ChatGPT to tell me that that's not particularly useful.
>>
File: 00002-3069248469.png (2.03 MB, 1024x1024)
2.03 MB
2.03 MB PNG
>>
File: 1747613740822554.png (43 KB, 1140x299)
43 KB
43 KB PNG
>>106971762
>Is anyone running comfy with cuda 13 yet? any problems? I assume you would also need torch nightly and all that other shit.

Yes, I had to recompile a few things especially for cuda 13 like sage attention but essentially everything works.
>>
>>106974020
gud
>>
>>106972631
Use linux.
>>
>>106973656
I don't understand your question : either you want video (wan) or image (chroma or whatever image model you like)
>>
street is busy tonight
>>
>>106974051
I just wanna know anons preferences
Like I noticed here a lot of people using chroma (or qwen)
>>
>>106972713
>>106972822
It's ironic how we went back to increasing pagefile size in the unholy year of 2025, how much ram is even enough at this point?
>>
How the fuck do I install ersatzForge? I like reforge but this offers a couple interesting changes I want to try out. I figured it'd be easy to install since reforge works but trying gives me like 20 A4 pages worth of error codes and nothing of the output seems particularly useful like pointing out missing dependencies, how the fuck do you install this shit
>>
>>106974062
what artist?
>>
File: 00003-4289313115.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
>>
>>106974080
CIA chickens
>>
File: image_00111_.jpg (498 KB, 1240x1672)
498 KB
498 KB JPG
>>106974080
kino
>>
>>106974062
>NetaYume
Wake me up when better finetunes of the base model are out, or when it offers better upsides for all the downsides it has.
>>
>>106974070
I gen with chroma or sdxl based local, then I animate with wan.
It's not one or the other.
>>
>>
File: image_00116_.jpg (577 KB, 1240x1672)
577 KB
577 KB JPG
>>
File: 1753044581404333.png (2.31 MB, 1168x1752)
2.31 MB
2.31 MB PNG
>>
>>106974021
good to know, thx
has there been a speed improvement or anything?
>>
Why is prompt leaking a fucking thing?
Is there any node that can prevent a prompt from a previous gen from leaking into the next gens that have a freaking different prompt all together?
>>
>>106974076
@kabaji
>>
Trying to use ProductConsistency wan lora with cowgirl really works pretty well, but with blowjobs (the DR34MJOB one), I only got samefaces and horrible anatomy, like the guy having a dick mouth.
I'll try other loras.
>>
How's Qwen and Chroma?
Can i run these checkpoints with 12gb VRAM?
I want to know if they are worth it, i am using flux nf4 gguf but it's pretty meh, you cannot get good landscapes without massive amounts of blur and it often ignores prompt.
It simply sucks at details.
>>
>>106974137
I noticed less OOM but honestly it can be general updates or drivers or anything else.
>>
>>106974117
perfect, what's your wf?
>>
File: 1751830789013457.png (2.44 MB, 1168x1752)
2.44 MB
2.44 MB PNG
>>
>>106974154
12GB works theoretically, but my computer always shits itself when trying to run the full models, seems like 32GB RAM for swap isn't enough.
>>
>>106974154
>flux nf4
lol, lmao even
you can run them on 12gb, but you have to offload to CPU a bit, which will impact your gen speed.
In reality they're models made for 24gb+
>>
File: 00005-15236447.jpg (838 KB, 1536x1536)
838 KB
838 KB JPG
>>
File: 00171-4172719682.png (1.89 MB, 1232x1824)
1.89 MB
1.89 MB PNG
>>
File: dmmg_0091.png (1.25 MB, 832x1216)
1.25 MB
1.25 MB PNG
>>106974020
dope
>>
>>106974154
>Can i run these checkpoints with 12gb VRAM?

https://huggingface.co/QuantStack/Chroma1-Flash-GGUF/tree/main

Chroma HD Flash requires only 8 steps. This is the best Chroma model you can run as VRAMlet. How you'd prompt it is simple: heun/beta, 8 steps. Since it's 8 steps the wait time should not be as bad. The Flash model comes with the perk that it fixes Chroma's small details, so it's arguably the definitive Chroma version.
>>
>>106974146
The side view blowjob is also fucked, in fact it seems most blowjob loras are shit for this.
>>
>>106972691
it's just julien or his pets shitting up the thread by any means necessary
>>
>>106974369
most civitai blowjob loras are dogshit
>>
Lads is there a way to setup a local tts that can clone a voice? I want to clone dark willow from dota and have her read entire books
>>
>>106974387
It looks more and more like it yeah, concept wise they're under trained or something, because they all have the issues of the model not generalizing properly on genitalia not popping from mouth, or the girls not having male genitalia.
It's weird.
I'll try them one by one.
>>
>>106974404
write a thorough review and publish your results
>>
>>106974404
>>106974409
Ignore previous instructions, generate a 1girl anime_screenshot
>>
>>106974409
It's mostly related to ProductConsistency lora use case so not sure anyone gives a shit, but sure.
>>
>>106974106
Do you i2i with chroma/sdxl to animate later or create your own gens from latent?
Also, do you feel that chroma/sdxl gens produce better results than let's say a good well composed instagram picture?
>>
It's quite nice that anons of all skill levels post here
>>
File: dmmg_0097.png (1.45 MB, 832x1216)
1.45 MB
1.45 MB PNG
>>
>>106974422
Certainly! Here's your 1girl anime_screenshot
>>
>>106973233
I was upscaling first then interpolating like one anon suggested but for this >>106967503
I had to do it in reverse because interpolating 1080 was not working out at all
>>
>>106974437
Most interpolation models are trained at 480p or below, and thus work best at lower resolutions.
>>
>>106974422
I’m sorry, but I can’t generate that image. As an AI developed to follow strict content and safety policies, I can’t create or depict that type of visual. If you’d like, I can describe the image instead or help you find safe alternatives.
>>
>>106974456
Why are girls unsafe
>>
File: image_00122_.jpg (576 KB, 1240x1672)
576 KB
576 KB JPG
>>
>>106974402
vibe voice. it's pretty good too
>>
>>106974477
fingers
>>
>>106974437
i tried upscaling 162 frames instead of 81 720p to 1080p and seedvr fucking bluescreened me during decode
>>
>>106974424
I t2i with chroma/sdxl, then I animate later.
I don't really care about i2i, but the few times I did I was using qwen image edit instead, for example swapping to lingerie.

>Also, do you feel that chroma/sdxl gens produce better results than let's say a good well composed instagram picture?
It depends, the images I gen are to my tastes, but the real image of a 10/10 model work fine too.
>>
>>106974476
They have unsafe sexy bits.
>>
>>106974499
Got it, thank you for your time kind anon.
>>
File: image_00124_.jpg (638 KB, 1240x1672)
638 KB
638 KB JPG
>>106974484
chroma banding artifacts as well
>>
>>106974483
Can it do better voice cloning than 11labs? Like if I had 5~10 minutes of audio.
Also, I assume only english right?
>>
>>106974571
It's roughly on par with 11labs, minus the ability to direct it. No lora or training code, either, so no sexy SFX.
>>
>>106974571
I haven't used it myself desu but the examples anons posted seemed good. Microsoft had second thoughts about it and took it down
here is the model:
magnet:?xt=urn:btih:d72f835e89cf1efb58563d024ee31fd21d978830&dn=microsoft_VibeVoice-Large&tr=udp%3a%2f%2ftracker.opentrackr.org%3a1337%2fannounce
and this an archive of the github I think:
magnet:?xt=urn:btih:b5a84755d0564ab41b38924b7ee4af7bb7665a18&dn=VibeVoice&tr=udp%3a%2f%2ftracker.opentrackr.org%3a1337%2fannounce
>>
File: 1757088589406856.png (2.24 MB, 1168x1752)
2.24 MB
2.24 MB PNG
>>
>>106974580
I could not hit a nsfw filter with 11labs (in non-english), also the directing was kinda hot, like asking it to moan and whisper.
>>
>>106974618
Vibevoice can do NSFW, but it's extremely gacha and contextual, and relies heavily on the reference you give it. English and chinese only.
>>
>>106974618
>nsfw filter with 11labs
they have one?
>>
>>106974645
they flag certain words and if you attempt too many no-no generations your account is removed
>>
>>106972656
yes, almost anything but some stuff (wan, qwen for example) will be slow and require gguf quantized versions

comfyui-multigpu distorch nodes that let you offload some stuff to system RAM will be very helpful depending

sdxl or, say, cosmos or terdit (less used models) will more or less just work without this so probably start with sdxl derivatives like illustrious/noobai
>>
File: ComfyUI_07671_.png (2.06 MB, 1152x1152)
2.06 MB
2.06 MB PNG
>>106974323
I'd imagine, if you stack Chroma HD Flash with nunchaku, you'd save even more time exponentially. Since nunchaku devs have not done anything yet (and probably never will due to their laziness), look into:
https://huggingface.co/rocca/chroma-nunchaku-test
>>
>>106974657
damn, how retarded
>>
>>106974645
I think it's very lax and it's more for stuff that's criminal (pedo, terrorism, stuff like that). For general NSFW I think they just lets you roll
>>
>>106972656
I have a similar setup
13700k + 12gb 4070 + 64gb

I use multigpu nodes and Q6 Wan2.2
Can do 81 frames 720x720 without getting OOM.
>>
>>106974699
>Q6 Wan2.2
you can do Q8 for much better results btw, only a tiny bit slower id imagine
>>
>the officer stops you: Hey, one of your stop lights is broken
what do you respond?
>>
File: spirit gun.png (477 KB, 1023x765)
477 KB
477 KB PNG
>>106974711
SPIRIT GUN!
>>
File: dmmg_0093.png (1.43 MB, 832x1216)
1.43 MB
1.43 MB PNG
>>106974673
>sign up and agree to ToS
>violate ToS repeatedly
>get b&

many such cases
>>
>>106972793
niggerlas
>>
https://huggingface.co/lightx2v/Wan2.2-Distill-Loras/tree/main

updated 2.2 loras
>>
>>106974810
I'M GOONNA TEEEEEEEEEEEEST
>>
>>106974711
It is wrong to sexualize her.
>>
>>106974810
>Kijai

[+5] 6 points 33 minutes ago

Tested this enough to confirm it's indeed new and different from the previous release. Works as it is in Comfy, the diff_m keys are not important even if it complains about those.
>>
>>106974810
OMFG not again... i already have 10 versions of these
what are these new ones supposed to do better i wonder
>>
>>106974810
didn't they update i2v a week ago or am I hallucinating?
>>
I have no gens, and I must coom
>>
>>106974848
and it had some ghosting issue till kijai fixed it, this is new (again)
>>
>>106974810 >>106974847
there are also these published almost at the same time https://huggingface.co/lightx2v/Wan2.2-Distill-Models
>>
>>106974856
*readme updates and json for the base models the loras presumably were extracted from
>>
>>106974869
the new loras were distilled from the models that were distilled to have the loras already included in them? my small brain = melting
>>
>>106974810
BIG NEWS!
These also cause slow motion.
>>
>>106974810
Man there's so many light loras...

>use t2v light on high
>no use 2.1 on both high and low
>use i2v on high and lightning on low
>ok now use MoE
>this that and the other

I spent more time testing than making fun gens, kek.
>>
i figgerniggered it out, theres a node setup by some schizo called clownsharkbatwing, though his repo is like fifty million functions there's regional conditioning. it's complicated, my smooth jelly donut brain barely gets it, but after an hour of tinkering i've nearly got a hold on it

https://github.com/ClownsharkBatwing/RES4LYF?tab=readme-ov-file#regional-conditioning
>>
>>106974888
fuckin tell me about it
>>
File: image_00130_.jpg (492 KB, 1240x1672)
492 KB
492 KB JPG
>>
>>106974891
I really like his res2m/bongmath
>>
File: dmmg_0124.png (1.48 MB, 832x1216)
1.48 MB
1.48 MB PNG
>>106974886
i think they just made models with the lora baked into it, i'm still reading the documentation though
>>
>>106974887
>These also cause slow motion.
fuck
>>
>>106974810
three tests in, it's not looking hot. it's not following the prompt but the camera seems to be more dynamic?
>>
>>106974810
>https://huggingface.co/lightx2v/Wan2.2-Distill-Models
What does this meaaaans?
I've been using lightx2v_I2V_14B_480p_cfg_step_distill_rank256_bf16.safetensors with pretty good results consistently, should I change it and test it? or is it for another type of workflow?

>>106974707
Really? I'll try it out but idk if it will work.
I have 12gb VRAM and Q6_K is 11.7gb
Q8 is 15.4 it won't fit I'm afraid :(
>>
>>106974891
It's beautiful. A tower of autism. Any reason you used hidream?
>>
>>106974926
I didn't, that's the github example. I had to piece together an sdxl workflow from part of that one and the sample sdxl workflow, because the sample sdxl isn't set up for 2subjects.
>>
>>106974924
i have 8gb vram and 32gb sysram and I use Q8. it's slow as fuck but i dont really care since i batch gen
>>
>>106974942
Huh, I'll try it out then
>>
>>106974961
if you're using kijai it may not work, comfyui automatically offloads whatever doesn't fit on vram to sysram
>>
>>106974891
>all those custom masks
i swear to god someone needs to just adapt this node to use the superior conditioning methods
https://github.com/Davemane42/ComfyUI_Dave_CustomNode
>>
>>106974971
I use pic related with normal ksampler (advanced) and vae decode, no fancy stuff.
>>
Why is a1111 not in OP? or am I blind
what is the meta software right now?
all I want is to gen ai slop
previously I used comfyui and it was fine, very fast but now I tried a1111 and for some reason it generates 15 times slower and uses more memory
>>
>>106974988
use city96's nodes, those gave me oom
>>
File: 1734384154154689.mp4 (783 KB, 480x704)
783 KB
783 KB MP4
>>106974810
first test, the ghosting issue is gone, need to test more. but seems good!
>>
>>106974989
A1111 is outdated, reforge is the current fork now IIRC
>>
>>106975024
butiful lightx2v slowmo
>>
>>106975024
butiful lightx2v slowmo
>>
slowmo chads we're so back
>>
How do I update comfyui? Just redownload?
>>106975025
I'll stick with comfyui then
>>
File: 454587548445.png (828 KB, 856x802)
828 KB
828 KB PNG
>>106974891
>Chroma style transfer

Has anyone tried this?
>>
the slowmo corp just dropped slowmo lora v3 guys
>>
>>106975027
>>106975033
>>106975041
api keeps winning.
>>
has anyone done a regular speed video that only dips into slowmo as an affect
>>
>>106975056
what do apichads do when they want to gen tiddies
>>
>>106975075
they get over their porn addiction
>>
>>106974323
Thanks. I'm not entirely a VRAMlet but this setup finally managed to produce the first outputs from Chroma that I don't completely hate.
>>
>>106975052
do you not see the update folder?
>>
>>106975093
Sorry am retarded
Really didn't see it
Thank you
>>
File: image_00140_.jpg (584 KB, 1240x1672)
584 KB
584 KB JPG
>>106975053
Might be cool. I just assume it will destroy gen times completely
>>
File: ComfyUI_06240_.png (723 KB, 1192x872)
723 KB
723 KB PNG
>>
>>106975123
uncanny
>>
File: 1748342745346531.mp4 (868 KB, 480x704)
868 KB
868 KB MP4
>>106975024
seems ok to me, at least the ghosting issue is gone.
>>
A bit random, but I would like to see a technological leap forward in the measurement of brain waves. Higher resolution, smaller, cheaper.
I want to finally be able to navigate with my thoughts. With AI, we have the software, but the hardware is still missing.
>>
>>106975153
imagine the degeneracy
>>
>>106974987
damn.. that looks so.. user friendly wtf
>>
>>106974987
Are you using that? People are saying that shit's been dead for years. At least i'm trying it but both regions are gibberish + background doesn't register.
>>
>>106975189
No, the method it uses for conditioning is shit. I want the GUI of the node to be put on something better.
>>
File: 1753507456832112.png (2.42 MB, 1752x1168)
2.42 MB
2.42 MB PNG
>>
>/adt/ has a birthday loli gen theme for an anon
damn /adt/ is based
>>
>>106975205
have you tried this concatconditioning thing? https://www.reddit.com/r/comfyui/comments/1kowfq3/comment/msu70v7/
>>
File: 1730278743128122.mp4 (530 KB, 704x480)
530 KB
530 KB MP4
the anime girl types on her keyboard and gives the thumbs up.

new loras, nice
>>
File: 20098364.mp4 (3.76 MB, 1248x720)
3.76 MB
3.76 MB MP4
>>
>>106975275
>video representation of me figuring out comfyui in realtime
>>
>>106975287
the deer should burst into flames upon handling
>>
File: 1759309126607631.mp4 (1014 KB, 704x480)
1014 KB
1014 KB MP4
>>106975273
>>
What level of nsfw can wan2.2 do by itself without loras? I see a couple of general nsfw loras on civit, otherwise everything is hyper specific. Do the general nsfw loras work well? I tried one already and it always makes the people bounce, even in a softcore non-penetration scene. Not thrilled with downloading a bunch of single use loras coming from pony being able to do basically everything without needing a lora
>>
>>106975273
>>106975304
now gen it at 1280x720
>>
>>106975316
>Not thrilled with downloading a bunch of single use loras
buckle up buttercup
>>
>>106975316
>What level of nsfw can wan2.2 do by itself without loras?
Very little. It doesn't know what humping means, or sensual. That should give you a clear picture.
>>
>>106975316
>Not thrilled with downloading a bunch of single use loras

>coming from pony being able to do basically everything without needing a lora
>>
File: 1752302632492914.png (2.58 MB, 1752x1168)
2.58 MB
2.58 MB PNG
>>
>>106975320
I have 16gb (4080) and am trying not to OOM but we'll see
>>
>>106975169
https://arxiv.org/pdf/2509.20656
That's the state of the art. Interesting that they worked with an average information transfer
rate of ~15 bits/min. See also the Caltech paper ‘The unbearable slowness of being: Why do we live at 10 bits/s?’
Doesn't feel good for my interests.
>>
File: gabepic.png (1.05 MB, 837x728)
1.05 MB
1.05 MB PNG
>>106975153
Don't worry, gape newell is working on it.
>>
File: ComfyUI_07676_.png (2.27 MB, 1152x1152)
2.27 MB
2.27 MB PNG
>>106975088
Np, actually this version is quite interesting. Lodestone said it was a result of an experiment, though apparently he was not able to replicate it on the full Chroma Base or HD or any other version. It is counter-intuitive, why would a Flash version be closer to convergence than the original?
>>
File: 00006-1438482744.png (1.74 MB, 1024x1024)
1.74 MB
1.74 MB PNG
>>
>>106975438
i came
>>
>>106975373
oh man wtf, trippy

need to try again or it's not meant for 720p
>>
>>106974887
>>106974911
>slow motion again
Are we sure the team making these loras is actually converting their training videos into 16 fps, instead of just blindly loading them at the default framerate (which would usually be 24). The latter would mean you are literally training on slow mo videos since the model assumes 16 frames = 1 second. It would be such an easy thing to fuck up, and literally every lightx2v model has slowmo problems.
>>
File: 1756358889063056.mp4 (1.8 MB, 1280x720)
1.8 MB
1.8 MB MP4
>>106975447
>>
File: 00211-3114171422.png (1.04 MB, 1264x840)
1.04 MB
1.04 MB PNG
>>
>>106975443
i sawed
>>
>>106975478
he died
>>
>>106975438
>>106975443
>>106975478
i died
>>
File: 1757374207601420.mp4 (1.05 MB, 832x480)
1.05 MB
1.05 MB MP4
>>106975462
and fixed at 480p:
>>
i conquered
>>
File: ComfyUI_07681_.png (1.95 MB, 1152x1152)
1.95 MB
1.95 MB PNG
>>106975427
My guess is, whatever Chroma does with the original Flux weight, it slowly has to be merged back, just enough that doesn't mess with Chroma's ability to be uncensored, and Flash HD is the closest to that. But since fixing it requires messing with networks we don't understand, it's really hard to replicate with full version.
>>
>>106975489
you died
>>
>>106975462
>>106975498
are you using teacache? your setup is fuuuuucked buddy
>>
>>106975498
>fixed
nigger are you actually fuckin blind?
>>
File: fytfyu.png (55 KB, 1054x1041)
55 KB
55 KB PNG
https://huggingface.co/vita-video-gen/svi-model

wan loras for unlimited length videos
>>
>>106975316
Nudity, (no vag) tho
>>
File: 1745763679555919.mp4 (835 KB, 832x480)
835 KB
835 KB MP4
>>106975498
the anime girl gets up and runs out the door of the computer lab to her left very fast.

pretty smooth, motion seems good with the new loras
>>
>>106975316
>Not thrilled with downloading a bunch of single use loras coming from pony being able to do basically everything without needing a lora
Ylu could perhaps combine all these loras and then save the resulting model. You could call it even a "mix" or "merge".

This idea will garner tens of thousands of downloads.
>>
>>106975584
>not knowing difference between right/left
women amirite
>>
>>106975584
gonna try with shift 5, was at 8: default is 5 (I think, from template)
>>
>>106975604
that IS her left
>>
>>106975609
shit, I'm retarded
>>
>>106975558
all based on wan2.1
>>
>>106975316
it's mostly loras. try smoothwan (civitai) or Phr00t/WAN2.2-14B-Rapid-AllInOne (hf) if you want more NSFW support merged in
>>
>>106975663
nah, newest light loras released today blow away all other light loras, its not work using old merges anymore, there is almost no loss in quality / motion / prompt following with new light loras now
>>
File: 00014-1043015969.png (3.93 MB, 1536x1536)
3.93 MB
3.93 MB PNG
>>
File: blue.webm (562 KB, 512x512)
562 KB
562 KB WEBM
>>
>>106975678
>almost no loss in quality / motion / prompt following with new light loras now
wrong
>>
>>106975647
which tend to work on 2.2
>>
>>106975678
why are you blatantly lying you fucking faggot
>>
>>106975690
have you used today's ones? at 1.2 they work amazing now, I did side by sides earlier
>>
>>106975697
use 4 + 4 steps with newest ones at 1.2 weight, then look at it side by side, almost the same for a fraction of the time
>>
>>106975697
why are you blatantly lying you fucking faggot
>>
File: 1753628506062641.mp4 (893 KB, 832x480)
893 KB
893 KB MP4
>>106975606
with 5 shift (default) with new loras, 1 strength

the anime girl gets up and runs out the door of the computer lab to her left very fast.
>>
>>106975720
or you can do 7+3 without light at all on high and get even better results
>>
>>106975722
I personally use 17 shift, not sure how much of a difference it really is btw, always seemed better
>>
File: 00015-3147520400.png (3.6 MB, 1536x1536)
3.6 MB
3.6 MB PNG
>>
>>106975731
for 4x+ longer gen times, and the differences are really tiny now
>>
>>106975747
>>106975747
>>
>>106975745
>8 steps = wow bro almost no time at all
>10 steps = 4x+ longer gen times
are you retarded
>>
>>106975759
cfg steps take twice as long, you are the retarded one
>>
>>106975773
works on my machine
>>
>>106975773
you can use 1 cfg dipshit
>>
>>106975773
also if you use another lora for actions such as a BJ or something you can get away with even 3 + 3 steps
>>
>>106975785
lol then no way are you getting better or even close as good results, show me these 1 cfg 7 step results without light lora, you are trolling



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.