[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106866715

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>made it to the collage twice
collage bros, we're eating good
>>
>>106873109
Good gen style mix, me likey.
>>
>>106873109
Trannybake
>>
Blessed thread of frenship
>>
>>106873136
>1girl
>1girl
>1girl
>1animal
>1girl
Right.
>>
>their shitty tryhard "art" gens didn't get picked
aww, better luck next time diddums!
>>
>>106873158
Artfaggotry genners are the absolute worst and I have zero respect for them.
Go post them on reddit with some pretentious twat name like "Obsidian Reverie Over a Vermilion Horizon"
>>
That cancel prompt early node for comfyui is great. Doesn't matter so much for image gen, but for video gen, its a game changer given each epoch can take upwards of a minute
>>
>>106873193
>epoch
step, I mean
>>
>>106873046
>In this gen, the prompt is leaking into the speech for some reason.
welcome back SDXL
>>
>>106873193
Wish there was one which added a button to fully purge models from VRAM. I know manager has those two buttons, but they don't work properly, there's always shit leftover, ie click it, dumps, still have a few GB being used. Restart comfy, you see it immediately free it up. Fucks you up sometimes when switching workflows/large models, ie from Chroma 2K to SeedVR2
>>
>>106873175
You are jealous because some anons are more sensitive than you.
>>
Blessed thread of Radiance shilling.

Though if I am being honest it does not surprise me that Lodestones having so much money to throw around (as already shown with Chroma) would pay people to promote their finetune to different web communities.

It is the only way I can make sense of seeing the same person shilling "1girl, cowboy shot" for days straight without stopping, showing quality that is barely Pony v7 tier or worse.
>>
>>106873243
The worst of both worlds, hated by 1girlers and hated even more by actual artists
>>
>>106873193
Link.
>>
>>106873234
Nu-comfy has memory issues. How do I know? I've used it for so long and had pretty long pauses between updates. New versions have clearly declined in quality, I'd say since 6 months or so.
>>
>>106873263
To add: of course this could partly be because of new pytorch/cuda versions too. It's risky to point fingers I guess.
>>
>>106873109
Why do landscapes get discriminated against and never get picked for the collage?
>>
>>106873268
It's not pytorch, I've tried various nightlies and older versions. Its core memory management is just ass.
>>
File: ComfyUI_07280.png (2.96 MB, 1280x1600)
2.96 MB
2.96 MB PNG
>>106873270
Have you tried adding big boobs to them?
>>
>>106873270
When I pick collages, I just go with what catches my eye. If a landscape does that, I'd pick it. If you want more of the things you like, start baking.
>>
>>106873270
Protip; don't ever gen just to get included in the collage. That's gay as fuck. Gen what you like, then if your stuff does get picked, cool. If not, who cares, you're still genning what you like and having fun doing it, and not everyone is gonna have your taste
>>
use case for subgraphs in comfyui?
>>
>>106873262
https://gist.github.com/blepping/99aeb38d7b26a4dbbbbd5034dca8aca8

Put that in custom_nodes, restart, add ModelPatchFastTerminate and connect it between your model loader and your lora loader, or whatever you have after your model loader.
>>
>>106873307
Shit. Nobody cares. What users want is the ability to create our own frontends in comfy. Noodles and nodes should be backend stuff the user can make, then you tie it into a custom UI. You could recreate the forge UI and enhance it if you wanted to.
>>
File: ComfyUI_temp_urpoq_00001_.png (3.42 MB, 1920x1280)
3.42 MB
3.42 MB PNG
>>106873175
>Obsidian Reverie Over a Vermilion Horizon
>>
File: 65462.gif (1.67 MB, 260x200)
1.67 MB
1.67 MB GIF
>>106873319
>>
>>106873316
Having workflow dependent UI's would be great and that's how you know Comfy will never do it
>>
>>106873316
By combining a bunch of switches and conditionals you can make your own pseudo-frontend already anyway. It is nice to have the option to just prompt and forget or to noodlewrangle depending on the needs of a given gen.
>>
>>106873309
>it works perfectly
should be a core feature instead of a node
>>
What is the best local anime to realism transformation model? Can they come close to 4o and nano banana
>>
>>106873466
Qwen Image/edit for example.
>>
File: WanVid_00019.webm (1.27 MB, 720x960)
1.27 MB
1.27 MB WEBM
did offices ever have formal dress or was this always a movie/porn trope

yes I'm a neet, what makes you ask
>>
A cute catgirl lies on her tummy on a bed in a brightly lit bedroom.She says <S> Look at my butt! <E> She humps the bed while making a lewd face. While her tail wags and her butt shakes, she says <S> It's so fat!<E>. <AUDCAP>clear female teen speech, load thumping bass techno music plays<ENDAUDCAP>

It seems you have to be pretty specific with Ovi or it will speak your prompt.
>>
>>106873537
Many offices have dress code for both men and women
>>
>>106873109
What is the best setup for on-style cartoon characters? I know of pony diffusion, but I feel like it didn't quite cut it on style accuracy.
>>
File: 00052-2338309234.png (2.64 MB, 1536x1536)
2.64 MB
2.64 MB PNG
>>
OH MY GOD

You can COLLAPSE nodes..
>>
>>106873599
Noobai with loras
>>
>>106873606
are you the same retard from yesterday that didnt know theres a button dedicated for masking?
>>
>>106873609
No, I'm a different retard.
>>
>>106873109
smoothpilled slopmaxxers represent
>>
Bros, I'm so happy. I finally solved the ugly artifacting for faster movement with my workflow. Go below cfg 1, I'm currently at .5 and it's looking so much better, and the motion still remains.
>>
>>106873636
Proof?
>>
File: 00000-692690760.png (1.83 MB, 1152x896)
1.83 MB
1.83 MB PNG
>>
>>106873636
It's the cfg in the low noise btw.
I didn't even think it would produce results below 1, that's why I never tried it.

>>106873682
You've been warned, nsfw.
https://files.catbox.moe/f1eopd.mp4
>>
>>106873786
Puke
>>
>>106873821
Haha, made you look!
>>
File: 00075-1833731849.png (2.65 MB, 1248x1824)
2.65 MB
2.65 MB PNG
>>
File: 00001-41554332.png (1.35 MB, 768x960)
1.35 MB
1.35 MB PNG
>>
qwen image edit would be so good if it didnt arbitrarily squish or stretch the image
>>
>>106873842
Add that as Last Frame and First Frame is just a pov over the field.
>>
>>106873786
kys
>>
what's the most potato setup i can use for wan2.2?
>>
File: 00084-4225980036.png (2.47 MB, 1248x1824)
2.47 MB
2.47 MB PNG
>>
File: 00002-708461488.png (1.44 MB, 768x960)
1.44 MB
1.44 MB PNG
>>
>>106873277
prompt ?
>>
>>106873924
bro hit me up with anime style, these 3dcg sluts are GROSS
>>
File: 00264-3327435714.png (1.18 MB, 936x1240)
1.18 MB
1.18 MB PNG
>>106874063
2d is boring and doesn't turn me on anymore. Not really big in doing stylish stuff anymore.
>>
>>106874081
i guess cartoon style is also fine
>>
>>106873924
catbox?
>>
File: 00097-4219372414.png (2.48 MB, 1248x1824)
2.48 MB
2.48 MB PNG
3d just has more sense of depth that bring me better imaginative immersion to a gen despite the annoying ai artifacts and defects.
>>
File: rec (1).jpg (75 KB, 1200x675)
75 KB
75 KB JPG
>>106873842
It'd be a real shame if someone used this lora on /gif/ https://huggingface.co/Remade-AI/Jumpscare/tree/main
>>
File: 00105-2967528178.png (2.38 MB, 1824x1248)
2.38 MB
2.38 MB PNG
>>106874093
https://files.catbox.moe/4urogs.png
https://civitai.com/models/1243990/edith-up-rayman-origins-il
https://civitai.com/models/784543/nova-animal-xl
https://civitai.com/models/715287/nova-3dcg-xl
https://civitai.com/models/1045588?modelVersionId=1767015
>>
File: dmmg_0020.png (1.46 MB, 832x1216)
1.46 MB
1.46 MB PNG
attempt to make a character lora based on two celebs, but i think it would still get b& if i upped it because the likeness is too strong. going to try a third face in the soup.
>>
>>106874162
Ahh, Miss Konoru-chan desu!
>>
>>106874116
ijk
>>
>>106874116
Pony also did 3d pretty well despite it being a broken model.
>>
>>106874162
what the fuck is that resolution
>>
File: dmmg_0017.png (1.3 MB, 832x1216)
1.3 MB
1.3 MB PNG
>>106874177
so desu ne~
>>
are there any nodes for comfyui that ensure an image is divisible by a certain number? looking to pad images before sending them to qwen
>>
alexandra daddario?
>>
File: photogen_00020_.jpg (667 KB, 1152x1720)
667 KB
667 KB JPG
>>106874206
daddario courteney cox hybrid
>>
File: Screenshot_1.png (41 KB, 507x689)
41 KB
41 KB PNG
>>106874227

>captcha: R0RGY
>>
File: dmmg_0005.png (1.3 MB, 832x1216)
1.3 MB
1.3 MB PNG
>>106874201
idk it works well for flux

>>106874227
kjnodes resize node has a divisible by property on it

>>106874229
>>106874249
daddario/jennifer connolly, so not far off, wanted the dark hair/light eyes combo
>>
>>106874299
This is flux?! such quality! how??
>>
>>106873537
When i worked in offices in the early 90's women would generally only dress in that fashion if there were new men who had started that week or who were going out after work or were a week away from having their period start.
I had a couple of women ask me to rate their recently shaved legs, Note: I have an obviously larger penis and ballz than average, not cut of trousers hides it, I've asked tailors and they all suggested wider thigh cuts that are like shorts built into suit trousers.
Anyway, I digress, yes regular women did used to come in looking like that. It was great, they moved with unconcious grace as well. Women these days rarely know how to walk in a style which highlights their underlying femine bone structure.
>>
How to retain the style of the image in wan i2v? It get's plastic the longer the video goes on.
>>
File: ComfyUI_00014_.png (1.45 MB, 896x1152)
1.45 MB
1.45 MB PNG
>>106874383
this is literally from the default comfyui workflow, just upped the steps to 32 and changed the resolution. usually i throw face/hand detailer/upscaler on but it's not that hard
>>
>>106874507
but this gen is not as good as the previous one..
>>
>>106874162
daddario or whatever her name is and um some bint i cant recall ummm a younger courtney cox?
>>
>>106874552
ok i should have finished reading the thread. I'm leaving now.
>>
>STILL no lactation lora
Is there a guide for training a Wan2.2 lora? I think I've waited long enough now
>>
what can't comfyui do!
>>
>>106874589
There's several of them dumbass
>>
File: file.png (162 KB, 1851x1483)
162 KB
162 KB PNG
>>106874600
please do tell where I can find them anon
>>
File: ComfyUI_temp_hbola_00001_.jpg (653 KB, 3584x1152)
653 KB
653 KB JPG
>>106874552
>>106874249
gonna use courtney as the third face and try again

>>106874513
face, hand detailer and upscale are not present on this, as was said in the post. here is the gen -> face detail -> hand detail -> upscale
>>
File: file.png (1.73 MB, 1328x1328)
1.73 MB
1.73 MB PNG
>>
>>106874623
the flux buttchin strikes again
>>
What are some alternatives to seedvr2? I am OOMing hard on a 5090 trying to maintain the entire batch of frames in one. There's so much colorshifts in the batches, making them useless.
>>
File: screenshot.1760366746.jpg (198 KB, 593x727)
198 KB
198 KB JPG
>all of a sudden everyone is posting about the cancel comfy node despite it being around for months purely because anon posted it a few days ago
really makes you think the type of people lurking here
>>
File: 00003-423793190.png (1.18 MB, 768x960)
1.18 MB
1.18 MB PNG
>>
File: 111.jpg (118 KB, 837x392)
118 KB
118 KB JPG
>>106874649
>What are some alternatives to seedvr2?
None. SeedVR2 is currently the best, even better than Topaz upscaling models.

>I am OOMing hard on a 5090 trying to maintain the entire batch of frames in one.
I can run in on my 3090 by using a batch_size of 161/257 frames(7 second video). You don't need to put the full amount of frames for decent temporal consistency.

>There's so much colorshifts in the batches, making them useless.
color match node helps with that
>>
>>106874687
LDGs influence cannot be overstated.
>>
>>106874588
wtf there's not even a nipple
>>
>>106874687
the crazy thing about this is that it actually just works. its not snake oil.
>>
>>106874716
Huh, I'm trying a 256 tile right now, only using up half the vram. Added the color match stuff, thanks.
>>
File: IMG_20251013_151541.png (2.12 MB, 768x1254)
2.12 MB
2.12 MB PNG
>>
>>106874787
i have my tiled vae disabled though. i found it doesnt help much. in any case, you're shit out of luck for trying to upscale videos longer than 10 seconds.

its basically only suitable for wan upscales.
>>
>>106874830
Damn, the color match actually worked, thanks a lot. I'm saving as proress raw 4444 also, the filesize is exponential, lol, from 68mb to 680mb, 720p to 1080p. I should stick to mp4.
>>
File: 00005-2001155487.png (1.16 MB, 896x1152)
1.16 MB
1.16 MB PNG
>>
>>106874927
cringed irl
>>
>>106874600
please let me know >>106874608!
>>
>>106873624
theres an alarming number of them desu
>>
>>106874927
i'm chicken
>>
File: ComfyUI_temp_vtoag_00005_.jpg (666 KB, 3328x1216)
666 KB
666 KB JPG
>>106874927
hell yeah

>>106874639
chill
>>
>want to do generic cyberpunky ladies with the cyberpunk edgerunners lines all over the bodies
>the second I put cyberpunk(series) in the tags, it gives me a lucy
SAD, I just wanted robo looking ladies
>>
File: 00006-3218638317.png (1.49 MB, 896x1152)
1.49 MB
1.49 MB PNG
>>
>>106874993
Maybe try combining it with tattoo and putting Lucy in the negative prompt? My guess is how those lines would be tagged in most boorus.
>>
nigga learn a new concept already
>>
File: 00007-3722643462.png (1.96 MB, 1152x896)
1.96 MB
1.96 MB PNG
>>
>>106875120
nyo~
>>
just got started this week with AnythingV4 using AUTOMATIC1111 web-ui, is there any telemetry or anything? I'm on Windows 10, is this truly private or do I need to switch to a Gentoo box with network stack deleted in a cabin in the woods?
>>
>>106875229
>AnythingV4
holy throwback
>>
>>106875229
just kys
>>
>>106875268
Also have XL downloaded but it's slower and I need more thicc anime bitches per second

>>106875272
There will be time for this later
>>
>>106875281
go to >>/sdg/
this is the big VRAM general
>>
>>106875284
kk thx
>>
>>106875229
Everything is telemetry unless you use a firewall to block it
>>
>>106875229
look into an application on your machine called "Windows Firewall" if you want to prevent certain applications from accessing the network

if you haven't done it already, and you feel the need to ask this question, it's probably too late. do you have a method to quickly render your hard drives unrecoverable? how about fully encrypted at rest?
>>
>>106875229
nobody cares about your thicc anime bitches. you can probably generate that normie shit right on google's servers with no auth or encryption.
>>
>>106875305
so far everything i've done is technically legal so i should be alright, thank you
>>
File: 1749784980345202.png (423 KB, 750x1000)
423 KB
423 KB PNG
>>106873122
>Holy fuck hahaha. We really were blessed with Wan
aktually, Ovi is Wan 2.2 5b
>>
When are we getting local models than can generate animated VR gaussian splats in real time?
>>
File: dmmg_0045.png (1.55 MB, 832x1216)
1.55 MB
1.55 MB PNG
>>106875157
>>
File: dmmg_0048.png (1.55 MB, 896x1152)
1.55 MB
1.55 MB PNG
>>
>>106873309
why not github it?
>>
>>106872969
O LAWDY, DAT SWEET BOOTY
>>
>>106875655
gotta keep it secret. if comfy finds out he's gonna FLIP
>>
>>106874780
kek, true dat
>>
What's next? Let me guess...
>1girl
>1girl
>1girl
>1animal
>>
>>106875699
User end improvement, in MY UI? I DON'T THINK SO.
>next update breaks the script
You MUST wait for the graceful termination.
>>
>>106873537
>did offices ever have formal dress or was this always a movie/porn trope
Yes but obviously not in latex skirts, just the same as you would see any OL in Japan or China or SK nowadays, knee length skirt and blouse.
The only time I see proper "office" dresses like that is when I work with luxury brands, it's kind of out of fashion everywhere else, too bad, it was hot.
Though people were thinner too back then, so it helped.

>>106874458
>Women these days rarely know how to walk in a style which highlights their underlying femine bone structure.
It's also the heels, without them the sway is not as pronounced.
>>
>>106875712
in a slopmix style of course sir
>>
>grok animation
Ok AIfags, you win. I'm a believer now. I hope this shit becomes easy to do locally soon because I'm seriously considering giving that fucking muskrat money.
>>
File: dmmg_0059.png (2 MB, 896x1152)
2 MB
2 MB PNG
>>106875712
>>
>>106875771
Isn't it censored?
>>
>>106875771
>I'm a believer now.
sora 2 wasn't enough for you to realize that?? lol
>>
File: 1744258777319782.mp4 (1.94 MB, 416x752)
1.94 MB
1.94 MB MP4
>>106875803
I haven't been able to get porn to work but others have. It will do lewd stuff though.
>>
>>106875832
Never tried it. Besides some porn galleries I've been mostly ignoring AI.
>>
>>106875866
based coomer
>>
>>106875840
I've seen porn, from what I understand it's censored but with the right words and luck it goes through, so way less anal than oai stuff.
It's good but not amazing compared to wan with loras, but boy is it so much better at anime/2d.
No yapping, good animation.
>>
>ram filled 100%
>vram 58%

Bitch what are you doing
>>
>>106875840
That is a child.
>>
>>106875783
>1flower
>>
>>106875930
>the app
They made an app?
>>
>>106875937
Apparently
https://apps.apple.com/us/app/grok/id6670324846
https://play.google.com/store/apps/details?id=ai.x.grok&hl=en_US
>>
>>106875840
>>106875930
Is that Roll? Delete that smut!
>>
File: DiscoElysium_00001_.jpg (1.37 MB, 2360x1848)
1.37 MB
1.37 MB JPG
>>
File: 1759702607679898.mp4 (1.29 MB, 560x560)
1.29 MB
1.29 MB MP4
>>106875952
>>
>>106875872
>but boy is it so much better at anime/2d.
https://litter.catbox.moe/np5mq5szkbi5cljq.webm
I've been amazed at what is has been able to do to my gens. I cannot wait for local to have something like this, with more control over the voices.
>>
>>106875930
>Robot*
Nevermind. Carry on my good anon.
>>
>>106875969
super cute!
>>
File: 1247223689563.gif (1014 KB, 640x526)
1014 KB
1014 KB GIF
>>106875973
Jesus the audio.
>>
>>106875973
it's worse than sora 2 but makes up for it by being less censored (which is not hard, sora 2)
which means it's an amazing advertisement for people to get the premium x thing, quite clever

I'll keep using wan but this is something to strive for
>>
File: IMG_20251013_201550.png (1.78 MB, 1248x741)
1.78 MB
1.78 MB PNG
My sense of humor maybe fucked but this is the funniest image I generated
>>
File: 00014-1950319551.png (1.05 MB, 768x960)
1.05 MB
1.05 MB PNG
>>
>>106875991
Yeah I got that one on the first try. The daily limit's annoying but maybe it's a good thing, otherwise I'd waste the whole day animating shit.
>>
>>106875995
I wrote "genki babbling" and boy was the prompt adhered to.

>>106876014
Yeah, I caught myself on multiple days debating whether or not to shell out for it. Just running random old gens with random prompts and seeing what came out. Whatever the next generation of local is, if it can even do half of this, it'll be revolutionary.
>>
>>106876038
>waste the whole day animating shit.
Your not doing any work, a program is doing it.
>>
>>106876075
No shit, Sherlock. Is English not your first language?
>>
>>106875917
OUT OF 10
>>
>>106876075
I don't think he meant he was animating the stuff himself, bait anon.
>>
>>106876025
is bro gonna b ok??
>>
File: 00015-1515998327.png (661 KB, 768x960)
661 KB
661 KB PNG
>>
>>106876025
Turn man around so he's facing woman for maximum air flow
>>
>>106876075
It's prooompting, and it's very hard work.

I will tell my grandchildren of my hours of toil in the prompting mines.
>>
File: 1752595993005876.png (1.63 MB, 832x1248)
1.63 MB
1.63 MB PNG
>>106875695
"change the image into realistic photo"
qwen image edit 2509
>>
>>106876143
Can you force it to make the face realistic? It's doing the photoshop thing that asian cosplayers do.
>>
File: DiscoElysium_00004_.jpg (1.32 MB, 2360x1848)
1.32 MB
1.32 MB JPG
>>
File: 547547474.png (1.43 MB, 832x1248)
1.43 MB
1.43 MB PNG
>>
>I want to into qwen edit
>but don't want to rape my pc with docker or raw comfy
>>
>>106876143
>muh realism
SLOP
>>
>>106876179
inpaint
>>
File: 768526245788567537.png (1.43 MB, 832x1248)
1.43 MB
1.43 MB PNG
>>
1flower my beloved
>>
>>106876199
kontext q8 does work alright, but qwen image edit seems next level.
>>
>>106876143
face doesn't correspond to body
>>
>>106876204
can you make the background less blurry ie in focus, in qwen image edit?

Also related, can you fix a flux chin?

sincerely asking, since as I said, the security risks of docker & raw dog comfyui are too great for me.
>>
>>106876199
>responding to the obnoxious namefag
kys
>>
File: 00016-3810499685.png (434 KB, 960x768)
434 KB
434 KB PNG
>>
File: 1745190758917799.png (1.82 MB, 1160x896)
1.82 MB
1.82 MB PNG
>>106873783
"show the back of the flower."
qwen image edit 2509
>>
>>106876243
Yeah, but consider that anime heads are really huge an unnatural. idk what ai is supposed to do about it.
>>
File: 6457546756758.png (1.04 MB, 832x1248)
1.04 MB
1.04 MB PNG
>>
>>
File: DiscoElysium_00007_.jpg (945 KB, 1728x1352)
945 KB
945 KB JPG
>>
File: apple not ai.png (93 KB, 761x960)
93 KB
93 KB PNG
>>106876250
Again, the features are perfectly placed. This is a problem for ai, it can't drawl wrong. or drawl "pretty good".>>106876250
>>
>>106876277
lmao ahhhh keep them both in focus lol
>>
File: kpop.jpg (647 KB, 1795x1152)
647 KB
647 KB JPG
body type loras are the way
>>
>>106876347
model? lora?
>>
so with 24gb vram you can pretty much only do Q6 qwen right? Q8 is 22gb and the text encoder is like 15gb
>>
>>106876379
qwen edit 2509**
>>
>>106876303
Very cool
>>
>>106876359
wan 2.2 t2i, custom lora (tommy king)
>>
>>106876306
the unhinged gemini 2 flash can sometimes get half way there
>>
>still no sph pinching gesture wan lora
im going to have to make one myself arent i
>>
>>106873316
ani already doing it kek
>>
File: 1740556194399734.png (198 KB, 1356x689)
198 KB
198 KB PNG
>>106876379
no, you can go for Q8 if you offload some of the model to the ram
https://github.com/pollockjj/ComfyUI-MultiGPU
>>
File: 1756078544460758.png (1.27 MB, 832x1248)
1.27 MB
1.27 MB PNG
>>106876296
"show the character standing in a T-pose"
qwen image edit 2509

>>106876379
I'm using a Q8 gguf on a 16GB 4080, gens in ~60s
>>
File: 00000-400149594.png (986 KB, 624x936)
986 KB
986 KB PNG
>>
>>106876482
>>106876385
>>
>>106875712
1girl is fine as long as it's not one dude spamming a dozen minor variations of the same gen day in day out
>>
File: S L O P.png (1.29 MB, 640x1221)
1.29 MB
1.29 MB PNG
https://www.reddit.com/r/StableDiffusion/comments/1o5o3ka/hunyuan_30_second_atempt_6_minutes_render_on_rtx/
>6 minutes render on rtx 6000 pro
imagine paying a 10000 dollars gpu, and waiting 6 minutes for this shit lmao
>>
> So, what are good settings for wan i2v? I've noticed that the same settings give different quality on i2v and t2v. And taking the same time, higher res with less steps is worse than lower res and higher number of steps.
Wansisters? Thoughts on 3 ksamplers workflows?
>>
>>106875973
>I cannot wait for local to have something like this
We're going to need to discover something better than just pushing brute force methods to their absolute limits if you want something that can run on a consumer GPU. Who knows when that breakthrough is going to happen.
>>
Nigger he POSTED ABOUT THAT HERE ALREADY
>>
>still no long video news

*sigh
>>
>>106876562
these visions from the eternal kingdom of heaven are directly transmitted to his graphics card via the power of chinese engineering and youre laughing?
>>
>>106876255
Eversion sequel looking lit
>>
>>106876598
you're wrong you western dog, don't waste your time on optimization, just stack more layers!
>>
>>106876460
damn the vae loss is so fucking bad, we need edit models in pixel space
>>
>>106876617
suspicious lack of 1girls in glorious chinese heaven
>>
>>106876490
Larping as the thread moderator again? Drink bleach, faggot.
>>
File: 1712316727798443.jpg (45 KB, 620x413)
45 KB
45 KB JPG
>>106873319
Beautiful
>>
>>106876643
>I want this general to end up like /sdg/
kys
>>
File: 00017-2007643454.png (961 KB, 768x960)
961 KB
961 KB PNG
>>
File: holo 6_.mp4 (1.53 MB, 512x960)
1.53 MB
1.53 MB MP4
I just started with wan last night on my local machine, never done video before. Using start and end image I often just get a pseudo fade out for the transition, but it is trying to animate it.

I am doing 81 frames at 16fps. boosting FPS just makes it faster but with the same fadeout at the midpoint. is this a prompting issue or do I need a more advanced character animation workflow?
>>
I'm almost more interested in audio generation than video generation at this point, and I'm surprised that it seems to be so much harder than video generation.
The day a good nsfw audio model becomes available locally will be a very dangerous day for me indeed.
>>
File: jpop.jpg (1.06 MB, 2688x1152)
1.06 MB
1.06 MB JPG
>>106876595
I use basically the same settings for i2v/t2v with different model sampling values. take this all with a grain of salt though

I run 12 steps.
3 steps: high with character/style loras only
3 steps: high with previous loras plus 4step lightx2v lora
6 steps: low with previous loras plus 4step lightx2v lora

this give pretty good outputs and fixes the slow-mo problem for me
>>
>>106876720
its not harder there's not the same level of interest to do it.
>>
>>106876720
fuck porn audio.
I want to train a SNES audio lora and make some memorable tunes.
>>
>>106876490
So NetaSpammer, RadianceSchizo and Qwen Miku tester are spammers? Two days ago there was an anon flooding with PixAI images, KEK.
>>
>>106876720
this minus the nsfw part
i genuinely want a model that doesnt focus on producing shitty lyrics and can make weird (in a good way) sounding instrumentals instead so i can sample it
>>
>>106876385
great!

The essence of bad drawling is like this, you drawl a nose, and then drawl an eye (from the nose, getting about right) and so on, all across the face. errors accumulate, scale changes accumulate, and you wind up with the "dead reckoning" drawling. It's the basic naturally instinctive (wrong) way to drawl. ai does it correctly, it starts with the blurry wide awareness of the frame, and sets everything up.
>>
>>106876767
Train a transformer on MIDI event streams then
>>
>>106876767
It's not really the right way to do it. ie sonos, udio, they do it the diffusion way.

I mean, it's fun or whatever, but the right way to do it is, since it's game audio, you need
>isolated sounds (these are ok to use diffusion on - but probably better to teach the ai to change settings)
>mixingrules (if you aren't just running the ai settings on real metal, you'll need to have processing, because if you just mix individual sounds on modern software, it sounds wrong)
>the midi files - this is the real thing, and I think most people are using llms for this. I can search for it. midi, because of how universal it is, but idk maybe tracker files are a better choice for this.

and probably a mixed mode could
>>
>>106876682

Try using Wan2.2 Kijai. Native comfyui seems to produce weird ghosting for me.
>>
>>106876856
nice thanks.
ghosting, I wasnt sure of the proper term.
>>
>>106876720
mmaudio although not trained for nsfw, can mimic most nsfw sounds just fine. you just need to be creative with how you prompt it to get something that sounds similar.

it also works better when you layer it. for example, if you want the bed creaking, only prompt that, nothing else. then run it again with ambient only prompt, or other sex noises. then you merge all the audio into a single file.

https://vocaroo.com/1eRET1zSTeHL

i cant post the video it goes to for uhh, reasons.
>>
>>106876925
Can you make longer sound gens?
>>
>>106876941
I believe so, but I've only ever used it for wan gens, so 7 seconds. I need to get around to trying it.
>>
seedvr2 is so fucking good. i wish i had an rtx 6000. the vram requirements are insane for this
>>
>>106876925
There is a nsfw finetune, could you please test it?
https://huggingface.co/phazei/NSFW_MMaudio
>>
File: NeoLumina.png (58 KB, 809x723)
58 KB
58 KB PNG
Exclusive /ldg/ news!!!
Haoming02, the developer of ForgeClassic and NeoForge, is going to implement Neta/YumeLumina into NeoForge after adding support for Qwen Image Edit!

WE ARE BACK!
>>
>>106877023
I tried it but kept getting too many voices in the audio, despite having talking/etc in the negatives. Maybe I need to experiment more with it.
>>
>>106877054
*yawn*
>>
>>106876925
the bed creaking sounds are always so erotic by themselves
>>
I'm howling.
>>
File: 24524.png (446 KB, 604x451)
446 KB
446 KB PNG
Anyone have any benchmarks for laptop GPU?
I will be buying new one and I would like to get one that could handle gen of some images or even videos. I just don't have space for tower and well I might need to move quite often so only such machine is possible for me.
>>
>>106877099
>I might need to move quite
Unironically use web services of various kinds, and get an ipad.
>>
File: chroma hands.png (3.31 MB, 1600x896)
3.31 MB
3.31 MB PNG
Chroma is pretty good at bad hands.
>>
>>106877122
I would prefer way more to have my own laptop to do that work. If there is no benchmarks I will just try one with best GPU possible and pray to not melt it.
>>
>>106877184
If you absolutely insist, then do the external gpu thingy. That way you can change it out, and it'll be possible to actually use it for anything real ai wise.
>>
File: Video_00027.mp4 (1.26 MB, 592x720)
1.26 MB
1.26 MB MP4
>>106876595
>workflow in image
disable the sharpening, as you can see, it causes problems.
https://files.catbox.moe/bwi19c.png
>>
>>106877068
hello julien!
>>
File: 00058-3502328901.png (1.37 MB, 768x960)
1.37 MB
1.37 MB PNG
>>
>>106873046
>>106873110
i can use ovi with comfy on a 3090 using kijai's models and implementation. it's not great though
https://github.com/kijai/ComfyUI-WanVideoWrapper/issues/1343#issuecomment-3382969479
>>
File: Chroma 27 chads.png (2.89 MB, 1600x896)
2.89 MB
2.89 MB PNG
>>106877150
Chroma 27 is better :^)
>>
File: 00059-572243652.png (920 KB, 960x768)
920 KB
920 KB PNG
>>
>>106873109
so what can I do with a 9060xt?
>>
File: 00010-2315374717.png (1.02 MB, 1224x768)
1.02 MB
1.02 MB PNG
>>
chat is it true? >>106877709
>>
>>106878073
You'll have to pay China for your tastes in porn.
>>
>>106878073
who cares. this is the local general. why would you care about cloud gen shit.
>>
File: 1748351684074420.png (217 KB, 716x659)
217 KB
217 KB PNG
>>106878088
but china refuses nsfw. I have no problem, spending money on nsfw on paid video models. but they are all censored. you spend money on flower gens...
>>
are there local video models with sound?
>>
>>106878073
Local general faggot
>>
Are there any loras or nodes that makes movement cartoonishly wobbly? Closest I found is schizo prompting, resmultistep, cfg between 1.5 and 1.8 and (by accident) stacking lightx2v loras. It kinda almost does what I want but completely cooks the gen. Also adjusting shift does literally nothing.
>>
>>106878114
>>106878359
yet that post is on /lmg/ which is a local general, and no one pissed and shitted themselves over it lol
>>
File: 1728913563232816.png (52 KB, 1884x366)
52 KB
52 KB PNG
>>106878073
Yes.
Probably a move to get them some cash back since the disaster of losing their payment processors.
Don't really care as long as the website stays as is to get models and loras.
>>
speaking of Civitai, how come there's so little qwen edit loras? there's only like 3 useful ones. the rest are meme/garbage.
>>
>>106878506
civitai is 95% people with 3060
>>
>>106878506
I guess people don't want to make lora on a model that'll have a new version each month
>>
>>106878506
good training and dataset preparation procedures are still unknown.
if you think you have a good dataset/configuration, post it and maybe I'll try processing it for you.
>>
>>106878506
Qwen Image Edit doesn't have its own category yet, AND the useful ideas tend to be banned (like the nudify loras for example).
>>
File: 00047-931072244.jpg (1.08 MB, 2560x2048)
1.08 MB
1.08 MB JPG
>>
>>106878506
>there's only like 3 useful ones. the rest are meme/garbage.
that's because the good ones are getting removed from civitai
>>
File: what.png (37 KB, 1188x141)
37 KB
37 KB PNG
>>
>>106878610
>discord
why would you decide to go to gay hell anon? :(
>>
>>106878610
what's even the context there
>>
>>106878669
I decided to go defend the civitai change to the discord that is fuming because it's funny and they decided I hated minorities
>>
File: ms.png (1.89 MB, 1696x1296)
1.89 MB
1.89 MB PNG
hope I'm able to gen on Linux. this is gonna suck
>>
File: 00049-3232986652.jpg (1.1 MB, 2048x2560)
1.1 MB
1.1 MB JPG
>>
everything that has to do with local is being enshittified and it's all free and/or open spurce. what the fuck happened?
>>
>>106878673
I don't see the link between pony, civitai, the change of free tier, and minorities
are they all high there or something
>>
File: mad.png (37 KB, 1197x155)
37 KB
37 KB PNG
>>106878702
it's the people mad the free stuff is no longer free they pretty easy to get (you)s from
>>
>>106878715
Wan can do that (probably even 2.1 with the right loras). Depends on your settings, the loras, the prompt, what optimizations you have installed, etc.
>>
civitai is crap. my animation of a large breasted peasant woman has been in "pending for review" for a week
>>
>>106878796
they simply never review anything said as "waiting for review"
>>
>>106878715
call me when it's able to do a realistic undressing and not just clothes magically gone
I wonder if it's even possible to make an "undress lora" on wan
>>
>>106878815
I dont want it to be that realistic and involved, I just want it for quick shit posting/rage baiting
>>
>>106878449
lmg anons are so smart and cool and rich and awesome and have huge dicks and
>>
File: 1760275711389765.png (22 KB, 1008x689)
22 KB
22 KB PNG
I havent do any local generation since 2 years ago. But after CIVITAI Demise, anything that i should know of ? What generator should i use, etc etc ???

Image for easy (You)
>>
>>106875771
i also bit the bait. im convinced they got porn in the dataset because holy shit
>>
>>106878610
>leaf
>fag
>retarded
Not uncommon. Dare I say a majority of them are that way.
>>
>>106878973
shit included fluids unprompted for a mating press image to video pic
>>
File: 00051-2930974564.jpg (986 KB, 2048x2560)
986 KB
986 KB JPG
>>
>>106878815
If you think about the geometry actually involved in a realistic undressing, I am not surprised models are currently not equipped for it. Timelines in AI are always funky, but I suspect this is one of those things like "character A hands object to character B" that is so much harder than it looks it will not he quite "right" for a long time.
>>
>>106878995
SD1.5? I see a hint of bruising on the top right
>>
>>106875803
it is but with a bit of work you can keep re-rolling a prompt or change it a bit and one will go through.

ive had a new fully nude images rendered and a few vids with partial nipple exposed.

biggest issue is the limits. there seems to be 2 limits, first limit is 50 clips per day with some kind of rolling limit.

I suspect theres a way to grab the render before you get the "content moderated" error but I have not figured it out yet. like there has to be a way to capture all the frames as they happen
>>
File: 1737664629115408.png (1.57 MB, 1272x816)
1.57 MB
1.57 MB PNG
is flux still the best local for prompt comprehension?
>>
>>106878946
It's very erotic if done well, it mainly depends if it's done in a feminine way or simply "get rid of clothes asap"

>>106878997
honestly I think it's just lack of training for that, just like models are shit at understanding what underwear are and sometime make them a second skin because not enough material was fed to the model
>>
File: 1743103781181674.jpg (932 KB, 1248x1824)
932 KB
932 KB JPG
>>106879100
Qwen is way better in terms of comprehensio now, but chroma is more diverse in what it can achieve.
>>
>>106879120
>honestly I think it's just lack of training for that
Oh sure, it ultimately boils down to that, but the next time you take your shirt off, imagine the movements of your skeleton/rig in coordination with each other (multiple limbs moving at once) and how the fabric needs to move and deform. That's when physics enforces the "no tunneling" rule and there is no computational cost. Of course a model that is only x gigabytes big with all these other concepts sometimes gives up and makes fabric warp and disappear.
>>
File: 00101-3171878302.png (328 KB, 768x960)
328 KB
328 KB PNG
>>
File: chr27 tries.png (2.74 MB, 1600x896)
2.74 MB
2.74 MB PNG
>>106879132
Qwen is way better on hands. Here's 27...
>>
Bro i coping so hard right now Im gonna ragebuy RTX 5080. Anyone stop me....
>>
>>106879146
makes sense
>>
>>106879178
just use a credit card and make small monthly payments goyim, its ez. you'll barely incur much interest. you could die in a car acciddent tomorrow and never experience all those glorious gens. you only live once, do it.
>>
>>106879178
STOP
YOU VIOLATED THE LAW
>>
>>106879178
>16GB
LMAO
>>
>>106879203
yea its pretty stupid to buy anything below 24gb now. even 24gb is barely cutting it with newer models. in 2-3 years you will definitely need 48gb.
>>
When ready

>>106879215
>>106879215
>>106879215
>>106879215
>>
File: 1743866589854065.png (1.67 MB, 1272x816)
1.67 MB
1.67 MB PNG
>>106879132
downloading now. thanks
>>
File: neat.png (3.09 MB, 1600x896)
3.09 MB
3.09 MB PNG
>>106879172



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.