[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Eat Your Spinach Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107154826

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Neta Yume (Lumina 2)
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd
https://gumgum10.github.io/gumgum.github.io/
https://neta-lumina-style.tz03.xyz/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>nigbo
>>
>>107162296
>that middle top image
>same height
>same breast sizes
>same hair style
>same hair color
>same face
Are people really this tone deaf and narrow minded?
>>
Blessed thread of frenship
>>
>>107162315
Anon has a clone harem
>>
>generated the perfect fucking cute girl poop video
>it’s entirely subtle. She lays there eying the camera curled up pajama bottoms down with a small poop log sliding down her butt from her perfect chocolate starfish
If this wasn’t a blue board I’d make the college easily.
>>
>>107162472
can wan actually do scat without a lora?
>>
>tfw no more girl pointing and laughing at user gens
what happened???
>>
>>107162490
I did it quite easily without one. As long as you have a starter image of a girl with a nice exposed butthole. It’s so good I’m impressed.
>>
>>107162068
>do you get better prompt adherence with fp16 clip compared to fp8 scaled?
fp16 fast is better than Q8, which is strictly better than fp8, so yes

>>107162472
you need to prompt or catbox the gen right now because I have tried to do poop coming out and it always just mostly turns into a brown bubble right around her asshole and never plops down fully
>>107162490
>can wan actually do scat without a lora?
its better than you think but still not good enough imo but then again i didnt try the angle anon is referring to (only squatting from behind)
>>
>>107162315
I thought they were supposed to be his sisters and he was simply living out his incest dream.
>>
>>107162530
>>107162546
I see. Not into scat, but always curious what it is capable of when it comes to NSFW/fetishes.
>>
>>107162580
>always curious what it is capable of when it comes to NSFW/fetishes
pissfags are eating well because the concept of a stream of water dripping from a vagina is a ridiculously simple concept that probably doesn't need any piss content in the training data at all

and you don't need to be into scat to appreciate a voluptuous muslim woman squatting and pooping on a flag of israel

>>107162530
oh it's I2V. never thought about i2v for scat but it's probably godlike making anyone you want be able to go poopy
>>
Anyone have link to the neta lumina style example site?
>>
>>107162602
>and you don't need to be into scat to appreciate a voluptuous muslim woman squatting and pooping on a flag of israel
you absolutely need to be into scat to find that hot, you're just aroused by political bullshit or something so it counters that I guess
>>
>>107162613
it's in the op
>>
>>107162664
people look at the op?
>>
>https://github.com/wallen0322/ComfyUI-Wan22FMLF
Anyone tried this for continuing videos and reusing not just the last frame but the last x ones?
>>
>>107162315
>gen image with a detailer
>show my friends on ldg
>vramlet gets mad
>>
>>107162530
Post it in /s*g/ they need the traffic
>>
>>107162750
>vramlet
>needs a detailer
>he can't gen at a high base res
lol lmao. all trolling aside, lrn2detailer dingus, you can use [SEP] and left-to-right detailing priority to make different faces.
>>
>>107162772
>spending 2 hours to improve shitposting gens
hmmm nyo~!
>>
>despite 5 links of shilling, still nobody wants to use neta
shit model lol
>>
>>107162546
>fp16 fast is better than Q8, which is strictly better than fp8, so yes
NTA but is there an nsfw fp16 clip? I only found a bf16 that didn't work with my checkpoint
>>
>>107162780
wow, here on display the average intelligence of someone that gens slop like yours. many such cases!
>>
Threadly reminder that Qwen-Image is shit and can't make fucking centaurs, as seen in OP's collage. Chroma (which is a hated model here) can do it with flying colors
>>
>>107162872
Sounds like a skill issue
>>
>>107162872
Why are you trying to make centaurs? Are you an animalfucker?
>>
>>107162772
i dont think its worth it to use any of those region prompting tools or whatever, you need to perfectly engineer your prompt then roll a seed your are happy with, its better to manually plan out and photoshop images if you are trying to make a good one. you wouldnt even be able to adetail a brunette to a blonde without ruining the consistency of the perspective an lighting
>>
>>107162872
>>
>tfw vramlet
when will the based chinks drop a vram monster
fuck ngreedia
>>
Threadly reminder that you should be angry 24/7
>>
>>107162879
>>107162880
It boggles my mind that a fucking 20b model doesn't know such a common concept
wtf were those guys doing
>>
We all know qwen is shit, just check the leaderboards. Local cant even manage to get a top-10 model anymore
>>
File: 1742349858142729.jpg (1020 KB, 1248x1824)
1020 KB
1020 KB JPG
>>
>>107162917
What were they THINKING?!
>>
>>107162933
Sanakeks be like
>>
>each subsequent run of flux/chroma in comfyui is getting slower and slower, up to 11s/it
>vram isn't even maxed
>cpu barely being touched
>no custom nodes
nani the fuck?
>>
>>107162872
just train a lora that will work better than chroma out of the box
>>
>>107162967
You are supposed to use ComfyUI with API nodes
>>
>>107162977
and i should be using a condom with your mom, but neither are happening in our lifetimes.
>>
>>107162977
Go back
>>
What's the point of running a local model if you're gonna use API nodes?
>>
>>107163062
hes just shitposting, ignore
>>
>>107162933
That's a lot of claws for one bear.
>>
>>107162930
Did the last hunyuan fall off?
>>
Any changes to the wan workflow meta since August?

What about these lightx2v models? Are they worth using over what kijai supplies?

https://huggingface.co/lightx2v/Wan2.2-Distill-Models/tree/main
>>
>>107163135
>workflow meta
there's a workflow meta? thought it was every man for himself out here. different strokes for different fellas.
>>
File: ComfyUI_00196.png (1.2 MB, 896x1152)
1.2 MB
1.2 MB PNG
Nothing better than genning MILFs I know on Tropical vacations while I use my PC as a space heater on the cold Winter days, peak cozy.
>>
File: 071033_00001.mp4 (2.26 MB, 656x960)
2.26 MB
2.26 MB MP4
>cumfart suddenly atrociously slow
>wan gen times nearly doubled
fuck this retarded noodle soup niggerlicious slop shit. not even reinstalling ((dependencies)) fixed this sudden heat death.
well. at least i can still gen at all.
>>
>>107163271
is this real?
>>
best local model to bulk caption images for lora datasets? need quality, vram/ram not an issue
>>
>>107163271
AniStudio.
>>
Anyone had luck getting a wan 2.2 aio to work with high noise loras
>>
>>107162802
Nah it's good
>>
>>107163406
fuck off
>>
>>107163271
I’m just replying to say I kekked to tears.
>>
File: 00048-2852057480.png (2.19 MB, 1824x1248)
2.19 MB
2.19 MB PNG
>>
File: 00062-1567798864.png (2.04 MB, 1824x1248)
2.04 MB
2.04 MB PNG
>>
>>107162971
If you need to train a lora, then it isn't "out of the box"
>>
>>107163368
For dataset captioning, if you have less than, say, 300 images, you won't want to use local models, just use the Gemini API. It's free anyway and can caption NSFW too as long as it isn't too spicy
>>
>>107163502
same. i needed some catharsis after the bullshit. crazy what wan can do at low resolutions.
>>
File: psxpsf_00001_.png (2.71 MB, 1792x1152)
2.71 MB
2.71 MB PNG
another anon recently posted a 2000s photography lora for Chroma with an over 600 image dataset, which made me want to experiment with flux. here's a lora based on the Canon PowerShot s40, one of the most popular cameras around that time.
>>
File: 1749748633173921.jpg (78 KB, 832x1216)
78 KB
78 KB JPG
>>107162296
Anons, what about stable diffusion do you love so much?
>>
>>107163603
i meant the lora will work better vs default chroma
>>
>>107163645
For Flux you might consider doing a rank 32 lora or higher, otherwise you won't get rid of the buttchins and plastic skin slop
>>
File: lora_00029_.jpg (694 KB, 1264x1656)
694 KB
694 KB JPG
>>
File: 1739888771350868.png (39 KB, 667x304)
39 KB
39 KB PNG
what language is this
>>
>>107163701
king
>>
>>107163701
would marry and creampie 20 times while tending to her bird nest
>>
>>107163802
damn bro
>>
should i use booru dataset manager to tag my images? i used an auto tagger but i dont think its good, it tags literally everything, i think i need to do it manually
>>
File: lora_00035_.jpg (843 KB, 1264x1656)
843 KB
843 KB JPG
>>
File: dmmg_0025.png (1.3 MB, 1216x832)
1.3 MB
1.3 MB PNG
>>107163681
this is rank 128. i have other loras i apply for faces and skin if i need to.
>>
>>107163888
Then the dataset is pretty bad. I've done 2000s photography loras before and they actually looked like images you'd see in Myspace and blogs back in the day
>>
Wondering if anyone has gotten manga / comics to work in any model

The loop is pretty simple so thinking of building and doing some end-to-end learning to optimize.
>>
File: local.png (1.49 MB, 1328x1024)
1.49 MB
1.49 MB PNG
Reminder that local will not be getting Nano Banana 2 tier models in a long time. If the final release is unslopped and high quality like that, then it will bfto all chink models, local or otherwise

The good news at least is that they will finally notice people hate plastic skin garbage and they will scrape from quality synthetic data (since all they do is scrape anyway)
>>
File: 1739062167922515.jpg (652 KB, 1572x773)
652 KB
652 KB JPG
>>107163938
>>107156804
The better proprietarycuck edit models are, the better outputs the new qwen image edit model can be easily trained on, thanks for spending millions for local to snatch it all up for free before training a clothes remover lora within a couple hours lol
>>
File: chroma___0001.png (3.99 MB, 1264x1656)
3.99 MB
3.99 MB PNG
>>107163912
to reiterate this is not a 2000s photography lora this is a canon powershot s40 lora,
>>
was wonderin' when them there anon would start poastin tiny fruit 2 gens
>>
>>107163956
idk they look like regular smartphone photos to me
the chroma sony cybershot lora I trained looked like photos straight from 2004~2005 down to red eyes and strong flash on the faces, squashed colors and some minor blurriness / grain
>>
>>107163956
Fingers technically make sense but damn what a weird position kek
>>
File: de-destilled flux.jpg (1.5 MB, 960x1440)
1.5 MB
1.5 MB JPG
Do qwen image loras reach the same level of quality as DE-DESTILLED flux when making celeb loras? So I know it's worth training
>>
File: 7558492.webm (875 KB, 960x720)
875 KB
875 KB WEBM
never considered 'young hot women bullying older women for not being as young and hot' as an avenue for gooning, but here we are
>>
>>107164037
did you make the lawnmover gen? it was epic
>>
File: 1740913023859141.jpg (292 KB, 467x568)
292 KB
292 KB JPG
>>107164072
reverse is more kino if you're not low test
>>
>>107164084
Nah
>>
>>107164088
harder to get across imo. You don't get formerly hot young women, unless they're acid attack victims or some shit, and that's just depressing
>>
>>107164072
cringe
>>107164088
based
>>
well gen it then, god
>>
File: dmmg_0065.png (1.41 MB, 832x1216)
1.41 MB
1.41 MB PNG
>>107163991
fair enough, this is still flux and it's a 9mb minimal lora. might need some more steps

>>107164036
chroma gonna chroma
>>
>Computer, take the thousands of generated bimbos in my folders and create a tiktok clone social media site filled with those women making videos for me to scroll through.
Soon.
>>
>>107164233
I want a life sized version of my 1girl to cook, clean, and take care of me when I'm sad.
>>
Yooooo! PonyV6 just dropped!!
>>
>>107163645
I don't remember that thing having anywhere near that shallow depth of field.
>>
>>107164314
pony v7 bros... our response!??!!?
>>
File: chud.png (1.31 MB, 1328x1024)
1.31 MB
1.31 MB PNG
>>
>>107164314
Personally, I prefer Pony 9999999 in 1
>>
>>107164314
abstract bullshit bros we eatin good
>>
>>107164479
CFG 0.7 bros...
>>
File: nano banana 2.png (1023 KB, 1704x953)
1023 KB
1023 KB PNG
local is dead, the technological gap that saas manage to gain this year is insane
>>
File: chud.mp4 (834 KB, 736x568)
834 KB
834 KB MP4
>>107164459
>>
File: ggufed.png (37 KB, 376x513)
37 KB
37 KB PNG
fp8 e4m3n is garbage doodoo caca and should never be used ever under any circumstance. q8_0 all the way

the fact that OP suggests it by default by linking to https://comfyanonymous.github.io/ComfyUI_examples/wan22/ is stinky

the fact that we use fp8 scaled for the WAN model itself at all instead of Q8_0 is stinky. i will make an opinionated guide to t2v shortly

bonus slop: https://rentry.org/QUANTIZATION_ANALYSIS


oh and Q8_0 and the fp8 quants still use the full vram of fp16 since they "unpack" the weights into fp16, read the rentry to understand why
>>
>>107164801
is either one actually right? i dont speak nerd
>>
how do i make a good self insert that isnt an ugly man?
>>
>>107164828
>should never be used ever under any circumstance
this is a lie because q8 actually uses slightly more vram than fp8 because its not true 8 bit its like 8.15 bit quantization
>still use the full vram
to clarify, they still use the full vram in this testing. read the #why-this-benchmark-doesnt-use-memory-efficient-inference section
>>
>>107162677
I've tried and the video has flashes.
>>
>>107164828
I care not for these faggy "numbers" and "data". Show me pretty images to convince me.
>>
>>107164801
saar...
>>
>>107164901
What is this homo "text". Show me a tiktok dance
>>
I asked Gemini and it said I am right and you're a retard. What do you have to say now?
>>
>>107164828
>fp8 quants still use the full vram of fp16
fp8 quants dont do that if you have fp8 support gpu

Although yes, people suggest and use fp8 because they have to cope somehow that they wasted money buying 2k+$ gpu instead of a 3090
>>
>>107164941
now we tokkin
>>
I asked deepseek_q_0.1 and it spat out some gibberish, but it's local so it must be right
>>
hi i'm a dumbass with a question
i've been using the comfy ui wan workflow to gen porn vids from pics. when i do short vids the quality is great. when i do long vids, the quality gets blurry af.
why? is it a memory issue or something? i have 64gb ram 16gb vram. i figured it's just making a new frame every time, why would the length of the vid be a factor?
>>
>>107164995
>i figured it's just making a new frame every time
it's not, it generates all frames at once
>>
>>107164995
did you try asking grok
>>
>>107164995
because you touch yourself at night
>>
>>107165006
well that is amazing, shows what i know. guess i gotta learn to stitch vids together
>>107165007
it told me to come here
>>
>>107165017
that's the general idea, yes
>>
>>107165023
if it generated a frame at a time we wouldn't be having such a hard time making continuations seamless
>>
>>107165037
that makes sense. i'm a newfag with vids but it's fun as fuck
>>
i can do the matrix multiplication faster in my head than these stupid gpus.. i just can't write the answers as quickly
>>
i pulled down that localsong project and started messing around with it. gonna take 36 hours to train the lora from the 10 mp3s I provided on my old intel macbook. was a bitch to get working too, doesn't work at all on windows. this shit better be worth it
>>
>>107164995
how long are you making them?
>>
File: 1748119024227701.png (2.34 MB, 1536x1536)
2.34 MB
2.34 MB PNG
pi-Flow qwen, 20s gen time

there's definitely a quality tradeoff
>>
>>107164995
when you image to video, wan uses details in the base image so you can get away with lightning loras, but if you prompt for a girl to walk around and wan has to generate new details, it will look like crap and you'll need to increased the steps and bump up the cfg. its really delicate process to get something that looks good that doesnt take 40 minutes to gen
>>
>>107165329
think length was 200-300? took about an hour to gen each one, let it run overnight. was very disappointed lol
>>107165359
yeah it's been a trial with me messing with random params and hoping for the best, right now i'm using 20 steps and 3.5 cfg. haven't even tried lightning loras yet. is there a good one to start with if you don't mind?
>>
File: kkk.mp4 (793 KB, 864x480)
793 KB
793 KB MP4
>>
>>107165400
lol the poof on left's hair
>>
>>107165329
i checked, blurry outputs are 18 seconds. can't figure out how to read the comfy metadata on the vids yet to get the exact length.
>>
Does wan context window actually extend T2V gens or is it snake oil?
>>
>>107165502
wan is trained for five second videos. you can get away with 10 seconds without visual degradation but the motion will suffer.
>>
>>107165546
man good to know, i gotta rtfm
>>
>>107165396
i use kijai's work flow from this guide https://rentry.org/wan22ldgguide. people have made other lightning loras but i have not tried them. for no lightning at all you should start with 20 steps high 20 steps low and 5 cfg
>>
>>107165400
hotties, they should have guns like they infiltrated the terrorist org and came to clean house
>>
File: Astaroth & Talim.jpg (264 KB, 1900x2103)
264 KB
264 KB JPG
Any of you up for doing some funny animations with this image.
>>
File: ComfyUI__00049_.mp4 (655 KB, 480x832)
655 KB
655 KB MP4
my sides
>>
>>107165743
Noice. Keep em coming. I want outrageous ones to wind up someone with them.
>>
File: file.mp4 (2.29 MB, 720x1280)
2.29 MB
2.29 MB MP4
Is this AI?
>>
>>107165783
can't be.. its too realistic
>>
>>107165810
>THAT jiggle
>realistic
>meanwhile the melting dog in the background
this shit is DOA tier. which is why i love A.I.
>>
>>107165783
It is but it's getting very hard to tell. I only figure it is because the person's instagram page who posted it has way too many girls doing a similar walk with no description or anything. Someone real would have a way more active profile.

https://www.instagram.com/kanakrajput.54/
>>
>>107165849
>Kanak Rajput
come on now
>>
>>107165858
there's a bunch of indian women with very fat asses which is laughable. 100% ai
>>
>>107165627
thank you so much anon, going through this now
>>
>>107165783
it is real but perhaps enhanced
>>
>>107165872
pajeetas go to KFC now
>>
>>107165872
>indian women with very fat asses which is laughable
indian women have no ass? (no actual idea)
>>
>>107165872
the hindu GODS using A.I to enhance their 5/10s to 6.5/10s, gotta respect the 6gb gpu hustle.
https://civitai.com/images/25399462
>>
>>107165922
hell no they're like asians. these girls asses are like they all got Brazilian butt lifts.
>>
>>107164826
kek
>>
>>107165821
There was a dog in the video?
>>
File: 1761688352986768.jpg (258 KB, 1167x364)
258 KB
258 KB JPG
saars this is very real
>>
tfw i have too much shame to start an ai baddie w huge ass account and gain one gorillion followers
>>
>>107165965
second and fourth look good, fifth can exist, rest is body horror material
>>
>>107165980
the only way you'd make money is pay walling it on onlyfans or something, and even that is very oversaturated now. unless you personally get a kick out of baiting horny men, i wouldnt do it.
>>
File: 1742790208785985.jpg (136 KB, 979x1211)
136 KB
136 KB JPG
>>107165938
damn, though asians can compensate with smaller proportions and nice legs
or win the genetic lottery like picrel
>>
>>107165965
typical chudai/wataa account
>>
>>107165965
I'll never understand the appeal of hyper proportions.
>>
File: ComfyUI_00006_.mp4 (498 KB, 640x640)
498 KB
498 KB MP4
>>107165762
>>
>>107166050
Hehe.
>>
>>107164801
Looks like they finally found a way to somewhat catch up to Chroma's realism. Grats to them, but it's still censored.
>>
XL until the heat death of the universe
>>
>>107165783
Looks like Sora 2 gen if it's AI.
>>
>>107164801
local devs made the mistake of chasing paramemeters instead of going for small easily trainable models that can be readily modified.
>>
>>107154883
The sound is too low quality. Seems to be model in the midst of pretraining, still needs a lot more pretraining to sound good.
>>
>>107164801
The whiteboard math crap is pure benchmaxxing in response to one of OpenAI's examples when GPT-4's image generator was released. They're obviously training it specifically for this task and it's practically useless.
>>
>>
>>107166242
looks pretty bad desu, very artificial and the face is pretty weird and a bit horrifying
>>
>>107166201
the local coper
>>
>>107164801
>still uses vae
kek, trash
>>
>>107164801
every CS grad should know this. it's just basic linear algebra
>>
>>107166301
does it?
>>
>>107166242
Also no fat chicks.
>>
File: 55545114545.png (1.14 MB, 1406x766)
1.14 MB
1.14 MB PNG
>>107166266
I checked Twitter and can tell the model will probably be insane, but htey're neutering the crap out of it. There's a reason we can't try it out ourselves yet.

https://x.com/synthwavedd/status/1987259262322749784
https://x.com/marmaduke091/status/1987474311691768059
https://x.com/synthwavedd/status/1987507356834427355/photo/1
https://x.com/rpnickson/status/1987353691167264792

Whiteboard crap aside this is pretty impressive. Giving me early Flux vibes, except the model is 200x smarter and more knowledgeable.
>>
File: 454554645214.png (1.08 MB, 1399x760)
1.08 MB
1.08 MB PNG
>>107166351
>>
>>107166363
is this real?
>>
What's the meta for local vid gen now? I haven't tried it in like a year but I want to try again
>>
>>107166380
wan 2.2
>>
>>107166351
>but htey're neutering the crap out of it
of course, but it's always good to see it's possible so it's kind of a goal on local at some point
>>
File: 4551125145454.png (845 KB, 661x936)
845 KB
845 KB PNG
>>107166369
Raw txt2img from Nano banana 2, same as pic rel
>>
>>107166266
I wouldn't call it horrifying. But the clothes look super artificial yes.
>>
>>107166380
Wan 2.2. 14B
>>
>>107166380
cope with wan2.2 because wan2.5 is actually good so they made it closed-source.
>>
Glad the cloudkeks got a little something, they really need it. Was worried about them for a second there.
>>
>>107166351
>>107166363
very excited to see these same images posted a couple dozen times in the future over and over again
>>
>>107166412
>kind of a goal on local at some point
Google probably trained this newer model on the entirety of their indexed images. It knows every celebrity on the planet. It even knows your face if you've ever posted a picture out there. There's no way local will ever catch up to a dataset of this scale.
>>
>>107166434
>>107166436
>>107166386
Is it even worth trying if i only have 32 gbs of RAM?
>>
File: 1762704086968138.jpg (1.14 MB, 3094x2168)
1.14 MB
1.14 MB JPG
>>107166483
Though, deep down we all know that the reason local won't ever catch up is because we rely on China. China loves synthetic slop. They're blind to it, as in they can't tell the difference between an AI image and realism. They will claim another Seedream clone (which itself is also trained on synthetic slop) is on par with Nano Banana 2.
>>
>>107166507
what gpu do you have
>>
File: minus social credit.png (59 KB, 512x174)
59 KB
59 KB PNG
>>107166538
arr rook da same
>>
>>107166507
It will be moderately slowed down but can still be worth it if you have a good enough GPU (3090 and above)
If you have the patience to wait I can run it in 10 minutes with 3060 + 32.
>>
File: roman farm plebs.png (1.77 MB, 1344x768)
1.77 MB
1.77 MB PNG
i unironically have doms from these 12 hour goon sessions. my biceps, should and upper back are getting ripped
>>
What is the difference between /sdg/ and /ldg/?
>>
>>107166597
the first letter is different
>>
>>107166597
/ldg/ actual local diffusion model discussion in between coom slop and drama.
/sdg/ dead discord server of the schizos exiled from here.
>>
>>107166592
>he finally figured out what peasant women are supposed to look like
proud of you son.
>>
>>107166615
>exiled
no. /ldg/ was made to get away from them
>>
>>107166597
sdg is the original diffusion thread.it was overrun by cancerous namefags that loved to circlejerk each other, so ldg was created for actual local gen discussion and sdg remains as a containment thread.
>>
File: 78784545.jpg (2.83 MB, 4032x2304)
2.83 MB
2.83 MB JPG
It's interesting that Flux dev SRPO solved plastic skin, but only at FP32, or at least it's very evident that BF16 really downgrades the skin. Unless something is wrong with this random comparison I found online.
>>
>>107166483
Sure but even something 80% as good would be nice, I really think we'll have something equivalent but local so nsfw friendly and non censored.
Flux/Qwen aren't the end goal of local after all.
>>
>>107166592
Couldn't you just find this normie shit on PornHub or something.
>>
>>107166710
>PornHub
wat
>>
File: SRPO Plot 1.jpg (220 KB, 820x469)
220 KB
220 KB JPG
>>107166699
Another one. I don't have the full pic, but even at low quality you can see it really messes with the detail.
>>
>>107166615
Ranfaggot wasn't happy with his discord server insisted on creating a new thread. He's permanently obsessed with this **** guy (see the op link, that's been made by ranfaggot). Ironically he's even bigger loser and avatarfag than the people he's so obsessed about.
>>
>>107166710
find me futa peasant girl porn on pornhub
>>
>>107166682
I wish LDG had a news subject like /SDG/ had and like /LMG/
>>
>>107166840
there is no meaningful news surrounding local. reminder that the retard who posted them was so desperate for any scrap of progress that he included literal malware just so things didn't look dry. the news posts are full of nothing but shitty snakeoil
>>
>>107166840
just lurk and youll see news as it happens anon
>>
>>107166840
any news actually worth something that is usable in the local space will be naturally brought up and discussed as anons test and experiment with it.
>>
>>
>there is no meaningful news surrounding local
Demonstrably false btw
>>
is anytest controlnet the best way to copy an existing image but change the character to the prompt, or is there a better way
>>
File: ComfyUI_07069_.png (1.21 MB, 1008x1032)
1.21 MB
1.21 MB PNG
>>
> Download Wan2.2 github
> Download Wan2.2-I2V-A14B from huggingface
> Download lightx2v/Wan2.2-Lightning
> Try to run the example command from the Wan2.2-Lightning readme.md
> "--lora_dir is not a recognized arg"

what the fuck is this code rotten shit just sitting there wasting my time with this horseshit

Wan2.2's generate.py just doesn't support loras or something? Why the fuck is it in the readme? Is the entire readme an LLM hallucination or some shit?

> Can only find people using it via ComfyUI

how is it possible that ONLY a visual programming interface is functional? All visual programming corresponds to a fucking library on the back end, how the fuck is there no python example of wan2.2-lightning working?

seriously, I thought web developers had a retarded community, but ML academics are a unique brand of code-illiterate
>>
>>107167010
i can't even find a python example of how to run Wan2.2 in two stages, so I don't need both 32gb models (low/high) loaded into memory at the same time.
>>
>>107166351
The google AI pipeline:
>lab creates it
>legal neuters it
>PR kills it
>>
>Qwen falls down the leaderboard to rank 26, local models knocked out of the top-20
>WAN announces WAN 2.5 will not be local
>ComfyUI stated that recent local models are shit and not worth implementing
>Rumored top-tier local model 'mogao' turns out to be API-only
There is a lot of local news actually.
>>
File: ComfyUI_00038_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>107167064
Then
>I rape it with a jailbreak/system instruct that tells it to ERP
>>
>>107166146
nope
>>
>>107167010
>>107167025
I used the stock repo for a while. You can definitely modify it to unload/load whenever you want. No idea about loras, but the lightning lora will make the results garbage so don't bother.
>>
File: qwen_00134_.png (1.82 MB, 1024x1536)
1.82 MB
1.82 MB PNG
>>
File: ComfyUI_21317.png (2.69 MB, 1200x1800)
2.69 MB
2.69 MB PNG
>>107166699
SRPO at FP16/Q8 need a ton of steps (I use 65 steps w/ res_2m currently) and high resolutions to overcome the naturally grubby output. Considering how little low frequency noise is in the FP32 images, that's probably the real difference at full-precision (less errors there), but none of those images are the same either, so there might be other factors at play.
>>
>>107167010
if you're too cool to use a UI then you should be smart enough to figure it out yourself.
>>
>>107167281
mating press jenny
>>
>>107167080
>>WAN announces WAN 2.5 will not be local
My gut feeling is that they will eventually release the weights for the 480p and 720p models but will keep 1080p API-only
>>
>>107167551
even that's wishful thinking.
>>
>>107167560
The dev himself said on a livestream they intend to release the weights after the preview ends, but said the model's requirements are much larger this time and implied most people who could run on gaming gpus so far would not be able to run 2.5 (it now has an audio component which is another model itself), they said they will at very least release the code and paper (it would be weird to release code without any weights)
>>
>jeeterboard
>rumor
>who cares
>ancient rumor
What did he mean by this
>>
https://github.com/kijai/ComfyUI-WanVideoWrapper/commit/d0ef3b5601ae72c49e735a324daf300567fac68d

After any update that modifies the model code and when using torch.compile it's common to run into issues with VRAM, this can be caused by using older pytorch/triton version without latest compile fixes, and/or from old triton caches, mostly in Windows. This manifests in the issue that first run of new input size may have drastically increased memory use, which can clear from simply running it again, and once cached, not manifest again. Again I've only seen this happen in Windows.

To clear your Triton cache you can delete the contents of following (default) folders:

`C:\Users\<username>\.triton`
`C:\Users\<username>\AppData\Local\Temp\torchinductor_<username>`
>>
Alibaba really hasn't been nice lately, though. Qwen-Max (the 1T param LLM) is API-only, their ASR model is API-only, Wan 2.5 is API-only, and probably more stuff I don't remember right now.
I hope in Wan's case they reconsider and at very least go the BFL route and release the weights as non-commercial is competition is the issue. They won't really lose money just because some gooners run their models in their basement
They should def reconsider the Apache license though because Midjourney and some other labs I forgot the name are using their own Wan fine-tunes, I don't care if some grifters get fucked as long as I can run local
>>
File: 1737235828052870.mp4 (2.26 MB, 720x1280)
2.26 MB
2.26 MB MP4
>>107165783
found this online
>>
>>107167732
where's the dog?
>>
Thinking of buying a 5080 for local image gen and fine tuning. Also probably need a ram upgrade since I only have 16Gb ram. Worth it or is there a better meta? For example I though about buying gpu time from some cloud service and training on their super machines. But I'm not sure if there could be legal issues there since I don't really own the images. Also what is the best model to train right now? From the look of it it seems like qwen is popular here right now.
>>
>>107167762
5k series is kind of a waste of money if you can find a 3090 or 4090 used
>>
>>107167757
there seem to be many similar videos from this girl
>>
>>107167762
if you're gonna go 5080 you might as well go 5090, skip going used for older architectures, the prices aren't worth it.
still extremely glad i went for the 16gb 5060 ti, but i am probably skipping until like 2030 because nvidia is going to go nuclear holocaust (hehe) jew with 6000 and beyond for sure. anyone saying otherwise is nuclear holocaust coping that it "Couldnt get worse"
>>
File: media_1762814400.png (1.32 MB, 768x1280)
1.32 MB
1.32 MB PNG
>>
>>107162750
no but its low effort slop that shouldnt have made the collage
but then again the collages are totally random so its hit n miss
>>
>>107167769
The 4090 seems way more expensive even used we're talking 2x the price. The 3090 is cheaper and more vram, but the performance at least according to the benchmarks linked here is lower for the 3090 than the 5080, but not sure if more vram is just better for this use case. like >>107167787 says may as well go 5090 at this point but that is even worse in terms for price closer to 3x. Not sure what the nuclear holocaust comment means. You mean they just burn power right? Yea unfortunately gpus these days just get sloppier with power use burn more and more wats for 20% performance
>>
>>107167912
If you have a high hz 4K monitor the 5090 is totally worth it for vidya alone
>>
>>107167912
by nuclear holocaust i meant a metaphor for "going all the way". if you think these prices are bad, just wait for next year.
5090 pretty sure you can get for MSRP, which is a better value than paying 2x+ msrp for the 4090 and the gouges of the 3090.
i got the 5060 because i wanted sage attention 2 + 1280x720 genning without OOM'ing. and given i upgraded from pre-AI architecture, one hell of a jump.
the 6000 series will be way way worse and for identical or even lower vram.
>>
>>107167762
>training
>on 16gb vram
if you're just genning it's fine. 100% need a ram upgrade
>>
>>107167932
Not too interested in gaming right now, only really play hearthstone and that runs on everything. My only reason to upgrade right now is for AI image gen.
>>107167939
Yea these prices are fucking retarded basically the top cards are super inflated probably because of AI and NVIDIA knows this and plays into it because of how they play these vram games. Looking at the 50 series lineup everything from the 5060, 5070, 5080 are gatekept at 16 GB vram because they want to sell more 90 cards. I'm thinking this is just to sell to people who want to load AI models and the only difference is getting the images faster.
>>
>>107168015
what's the minimum for training the best local Image only models right now? Is 32gb ram and 16 vram not enough for lets say sdxl training or sd3.5 or whatever the best is? Have no idea about any other models like qwen since I haven't followed the scene for a few months.
>>
>>107168092
NTA, but that's fine for SDXL. Qwen is a fucking monster and you will probably need at least 24gb to train it. Train it meaningfully I mean. And by train I mean LoRA
>>
>asus shop 5090 msrp restocks getting rarer and rarer
is it actually over?
>>
why is there no universal wan genitals lora that just werks?
>>
>>107168102
So I wonder if I should just get the 5080 + ram upgrade for generating images. Then train on runpod or some other service. And just use their H200 or some other bullshit level card for training really heavy image models. I mean consumer cards seem like a joke for training anyways.
>>
>>107168167
because it would be complicated. you'd have to train it on a bunch of different angles and sex positions and most people making loras would rather just do the sex position since that's easier.

your best bet is a general NSFW lora
>>
>>107167939
3090 around $600-$700 and it seems stable for months, honestly it's the best card to start with on a budget
5090 msrp for now (less volume soon so prices will go up for sure)
what about the 4090 though, is it finally somewhat around used at ok prices?
>>
>>107168167
What anon, you don't enjoy girls having testicles (just that, no dick), and men having dick coming out of their mouth?

>>107168227
It's the most annoying thing with face change in i2v because of a lora, especially with anything blowjob.
>>
>>107167281
got dam thats hot
>>
>>107168273
Is there really no new features in the newer gens that make them better for AI? I vaguely remember something about FP4 advertised by NVIDIA. Just marketing or what? All I'm paying for is more/faster vram?
>>
>>107168289
yeah some loras are basically unusable because they change the appearance of the person. whoever made them doesnt know how to make concept loras. probably no reg datasets.
>>
>>107168333
of course there is, just that it's not something worth getting at any price
just in raw compute, a 5090 will be way faster than a 3090, at 3.5 time the cost...
>>
>>107166597
/sdg/ is the original thread. /ldg/ was the split created by a troll. You're being held hostage here.
>>
man it's so good just telling qwen image edit to get rid of the text on an image, and it working out of the box
>>
>>107168413
What do you mean?
>>
>>107168421
just what I wrote anon, I had text over images from very old stuff, usually either speech bubbles, titles and book covers with images I never found the origin of, and it's really nice just asking it to make a cleaned image out of them
>>
>>107168413
>qwen image edit
>working out of the box
I assume you're using comfy
is there a way to make it do that with swarmUI?
>>
>>107168525
no idea sorry, I use comfy and everything works well, I just modified the example wf to use res3m/bong tangent and automatic input resolution and that's it
>>
I make so much cool AI art and none of it gets any attention. It hurts deeply. I'm ecstatic to get a single upvote on civitai. I don't know how people get so much clout online, while my stuff is totally invisible. But I think it's healthy. I don't want to grow dependent on upvotes for self-vaidation. I'll just continue making art that no one will ever see
>>
>>107168608
No one is impressed by what you generate, not even "ai artists". They're only interested in your prompt/workflow.
>>
>>107168608
>I'm ecstatic to get a single upvote on civitai
>posting art on civitai which is heavily botted
anyway post your civitai profile
>>
>>107168630
I think the real issue is no one even sees it, its like I'm being censored by default
>>
File: file.png (1.56 MB, 698x1329)
1.56 MB
1.56 MB PNG
>>107168608
is it that important to get nonsensical laughing emojis over every gen
(seriously, wtf is with the laughing emojis on civit, it's insane)
>>
>>107168643
civitai gets flooded with so much slope that unless you advertise it on another platform you wont be seen.
>>
File: pipeorgan1.png (2.51 MB, 1824x1248)
2.51 MB
2.51 MB PNG
>>107168638
I'd rather remain anonymous but here's one work in progress
On here I will only post things once because if I do a repost the schizo who lives here 24/7 will immediately throw a fit
>>
File: dmmg_0299.png (1.21 MB, 832x1216)
1.21 MB
1.21 MB PNG
>>
>>107168644
Obviously a hilarious gen.
>>
>>107168662
>1girl
>"art"
this better be a joke
>>
>>107168644
the proportions of the boat are all fucked up, how does this get thousands of votes
>>
File: _you_.jpg (157 KB, 1024x768)
157 KB
157 KB JPG
>>107168676
ok here's one for you
>>
>>107168608
Much like the Faggollage, one should not concern themselves with the opinions of others
>>107168676
Nah that one's pretty cool
>>
>>107168608
stop caring and post what you like
>>
>>107168676
of course 1girl is art, I bet the first art made by humanity were hunted animals and 1girls
>>
>>107168690
nice Furk
>>
File: dmmg_0263.png (1.42 MB, 832x1216)
1.42 MB
1.42 MB PNG
>>107168608
just keep posting through it, i barely get likes but i post anyway
>>
>>107168690
pretty good. i like it. cool concept. its refreshing to see stuff that isnt anime 1girl
>>
File: _mood_.jpg (462 KB, 1600x1104)
462 KB
462 KB JPG
>>
1furk
>>
>>107168690
Wait a fucking minute
>Grok
Are you fucking serious? You posted API shit in a local thread? Fucking asshole
>>
>>107168800
Retard detected.
>>
File: 1731980932764328.jpg (128 KB, 1024x1536)
128 KB
128 KB JPG
>>
1turk, massive breasts, wide hips
>>
>>107168662
>he actually believes the schizo "lives here" and doesnt post to multiple diffusion threads across 4chinz not caring about which thread is which
holy
>>
File: FluxKrea_Output_251512.png (2.33 MB, 1176x1752)
2.33 MB
2.33 MB PNG
>>
>>107168690
>1roach, massive table, masterpiece
>>
File: 990941155.png (17 KB, 1344x768)
17 KB
17 KB PNG
>>
>>107168824
change it to "the world is a joke" pls
>>
>>107162933
chroma?
>>
>>
>Neta Lumina Lightning LoRA
https://civitai.com/models/2115586
impatientbros we eat
>>
>>107169186
I just noticed the CatTower guy made a version https://civitai.com/models/2072920/neta-cat-tower I'm surprised anon hasn't talked about it since it seems many like his Illustrious model.
>>
>>107164826
kek, API model cuck BTFO
>>
What does Neta Lumina have over Noob/Illust?

>>107169186
>>107169211
>>
>>107169211
interesting, wonder how it compares to v3.5
>>
File: QuestionableHoneyAd.mp4 (1.86 MB, 464x688)
1.86 MB
1.86 MB MP4
I bet you could convince boomers this was a real ad kek
>>
File: ComfyUI_00009_.mp4 (595 KB, 640x832)
595 KB
595 KB MP4
>>
>>107169259
noob is still SOTA for soul but yume clears ilu by virtue of its 16ch vae and spacial understanding among other things
>>
>>107168663
hot
>>
>>107167091
>giving FAGMAN access to all of your weird ass fetishes
I dont care how good they make it, i will ONLY sext my PC.
>>
Impatientlets complaining about generation times while I need half a day to go through multiple thousand qwen image images that generated in the other half of the day every day baka
>>
>>107169348
i kneel, patiencechad
>>
File: file.png (65 KB, 623x731)
65 KB
65 KB PNG
>>107169211
I wonder how long it took him
https://huggingface.co/nukomasshigura/Neta-Cat-Tower
>>
>>107167939
>if you think these prices are bad, just wait for next year.
I need to upgrade my RAM from 32gb of ddr5 but prices are so insanely retarded the 64gb sticks I was flirting with a week ago before i learned about RAM shortages are now 2x in price. It's only going to get worse so I feel like I should just spread my ass now and pray for a good Christmas bonus.

>4090 and the gouges of the 3090
3090s have come down - 4090s are still retarded.
5060ti 16gb are coming in at sub MSRP if you find the right sale. probably the best NEW entry point.
>>
>>107169375
2.1k images at 1280x1280 with that high learning rate (compared to qwen image anyway) on a 5090, probably not too crazy, should be less than two weeks 24/7 unless im missing something, but im not an expert nor do i know that architecture
>>
>>107164852
>1girl + 1handsome retard
I'm sorry anon but it's just not possible with the current technology
>>
>>107169404
nevermind the learning rate is actually low, e-5, qwen image trains well for 3k steps at 2e-4
>>
>>107169433
>>107169433
>>107169433
>>107169433
>>
>>107169348
>Thousand gens of Qwen
How many actually different outputs is that, thirty?
>>
>>107169647
Using qwen image edit 4 step lightning lora on the qwen image model fixes the low seed difference problem, i think it was posted about first here https://civitai.com/models/2093591
>>
>>107169186
Finally!



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.