[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>106910887

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
long live illustrious
>>
File: dmmg_0002.png (1.52 MB, 896x1152)
1.52 MB
1.52 MB PNG
>>106915050
these loras are for standard flux, and annie lora is fighting for its life with krea
>>
Blessed thread of frenship
>>
>>106915115
is that the community woman? she barely resembles her. i wouldnt have noticed if you didnt say annie
>>
>>106915118
do you bless your shota collection too?
>>
File: ComfyUI_06133_.png (1.29 MB, 920x1136)
1.29 MB
1.29 MB PNG
>>
File: 1752545549912599.webm (3.89 MB, 2048x867)
3.89 MB
3.89 MB WEBM
>>
Reminder that Chroma has significant structural errors with most content types. It only works reliably for furry content and NSFW realism.
Chroma it's a broken model.
>>
have you tried being better at prompting?
>>
File: dmmg_0008.png (1.42 MB, 896x1152)
1.42 MB
1.42 MB PNG
>>106915120
yeah i'm still experimenting, this is one had to go to 1.4 to get a result at all without warping the image entirely
>>
File: ComfyUI_06088_.png (867 KB, 880x1184)
867 KB
867 KB PNG
>>
>It only works reliably for furry content and NSFW realism
so, it's good? the only bad thing about chroma is the genning times outside of flash
>>
what happens if i train a chroma lora with booru tags?
>>
File: ComfyUI_06126_.png (1.11 MB, 960x1088)
1.11 MB
1.11 MB PNG
>>
File: ComfyUI_06137_.png (1.06 MB, 944x1096)
1.06 MB
1.06 MB PNG
>>
File: ComfyUI_06134_.png (1.37 MB, 976x1072)
1.37 MB
1.37 MB PNG
>>
File: ComfyUI_06139_.png (1.24 MB, 1176x888)
1.24 MB
1.24 MB PNG
>>
File: dmmg_0021.png (1.37 MB, 896x1152)
1.37 MB
1.37 MB PNG
>>
File: ComfyUI_06123_.png (2.06 MB, 1328x1328)
2.06 MB
2.06 MB PNG
>>
File: zstyle.jpg (473 KB, 1792x1216)
473 KB
473 KB JPG
zstyle lora (zishy) tests are not bad
>>
Experiments with using SD1.5 junk outputs as latent images with low initial denoises for more interesting compositions in IL. Would need a lot of inpainting and corrections to actually work, since at low denoises it struggles to know what to do with what you give it, but the 1girls sure look neat.
>>
File: ComfyUI_06159_.png (1.25 MB, 1048x992)
1.25 MB
1.25 MB PNG
>>
>>106915407
What model does text so clear?
>>
File: ComfyUI_06160_.png (1016 KB, 952x1096)
1016 KB
1016 KB PNG
>>106915415
qwen with the lora that turns drawings into realistic pictures.
>>
File: ComfyUI_06045_.png (1.04 MB, 1016x1024)
1.04 MB
1.04 MB PNG
>>
Speaking of text, why can't qwen translate the text? Translating and coloring doujins would be so easy.
>>
>>106915421
kek

>>106915430
this kinda goes hard
>>
File: ComfyUI_06154_.png (1.04 MB, 1176x880)
1.04 MB
1.04 MB PNG
>>
File: dmmg_0015.png (1.25 MB, 896x1152)
1.25 MB
1.25 MB PNG
>>
File: ComfyUI_06144_.png (1.2 MB, 856x1216)
1.2 MB
1.2 MB PNG
>>
File: dmmg_0039.png (1.44 MB, 896x1152)
1.44 MB
1.44 MB PNG
>>
File: ComfyUI_06164_.png (1011 KB, 1120x928)
1011 KB
1011 KB PNG
>>
how do prompt good
>>
File: ComfyUI_06147_.png (1.21 MB, 1168x896)
1.21 MB
1.21 MB PNG
>>
>>
File: ComfyUI_06153_.png (1.3 MB, 1080x968)
1.3 MB
1.3 MB PNG
>>
File: ComfyUI_06176_.png (1.15 MB, 760x1368)
1.15 MB
1.15 MB PNG
kek
>>
>>106915556
need smart
>>
>>106915421
lora name?
>>
best frame interpolation method now?
>>
File: 1729824932508543.mp4 (2.39 MB, 720x912)
2.39 MB
2.39 MB MP4
>>106915455
>>
>>106915702
Still film. All the others have issues with high motion.
>>
File: WanVid_00004.webm (1.23 MB, 720x960)
1.23 MB
1.23 MB WEBM
halloween is a non holiday, buuuut...
>>
How has everyone been coping during the current local [video but also in general] diffusion models drought/slump we are in

I got like 3 cooms off of sora lewds (I love thick women with thicker accents so the sound goon was a unique and enjoyable new goon experience) but then I gave up due to too much moderation. I think I gave myself a fetish for that type of honk-laugh ghetto Brazilian women make in the process so that's nice

I literally read an entire chapter of Livy today because I was that bored. Wan 2.5 demoralized me so hard that even though it's getting colder and the heating hasn't turned on I'm still not able to get back into genning

>>106911896
>if anybody is interested in playing with animate more these look interesting, even has a cunny showcase
I've seen clips of that girl on ads in my gym so I'm assuming it's stock footage. Neat, there's more shots of her doing the splits and stretching and stuff
If platinum hair wasn't such a turn off to me I'd track down the source of the video it shouldn't be too hard since it seems popular enough, just "platinum hair girl bodysuit acrobatics stock footage" is probably enough

>>106915617
>qrd?
jews
but like unironically, in the sense that they think they know better than you about how to serve you and give you "safe and aligned" AI. Anthropic will never, ever release an open model fundamentally because of their perspective on safety.

>>106915736
Wow that water looks like absolute dogshit even for wan wtf?
>>
>>106915643
nta but I think it's this
https://civitai.com/models/1906441/qwen-edit-reality-transform-by-aldniki
>>
File: AnimateDiff_00003-1.mp4 (3.76 MB, 498x720)
3.76 MB
3.76 MB MP4
Haha holy shit, tested that autistic light lora mix. 127s for 121frames. Not a single slow motion.
>>
>>106915828
>127s for 121frames
Wew
hardware?
>>
File: realisticlora.png (2.4 MB, 1680x1240)
2.4 MB
2.4 MB PNG
>>106915779
wow, not bad actually
and also adding that lora for some reason gave me my best ever qwen edit time, 55s/it with the 4 step lora
>>
File: ComfyUI_temp_ucbis_00001_.jpg (522 KB, 1727x1320)
522 KB
522 KB JPG
>>
File: me.png (1.13 MB, 1072x976)
1.13 MB
1.13 MB PNG
>>106915779
Kek I fucking love this stupid thing.
This lora and it's consequences have been a major league swagout for the frog edits.
>>
What will be the next big model?
>>
File: QwenEdit_00099_.png (958 KB, 1368x760)
958 KB
958 KB PNG
I will be shitting up the thread for a little bit sorry. Join me.
>>
>>106915865
>55s/it holy VRAMLET
>>
>>106915929
>What will be the next big model?
HunyuanImage 4.0, 400b model, very big saar!
>>
>>106915949
dude, on the last one it got down to 39
my SDXL is like 1.5 but qwen edit + amd = its ripping and tearing time
but you have no idea how excited I am by these numbers, I've seen upwards of 150 running complicated ones
It's suffering but it will help me appreciate the power when I upgrade
>>
>>106915929
money's drying up, even altman is pivoting towards trying to earn a legit profit now, the beginning of the end
>>
File: 1747807675213734.png (90 KB, 277x182)
90 KB
90 KB PNG
>>106915969
>the beginning of the end
right at the moment of the 2 MoE models meme on Wan 2.2, FUCK
>>
>>106915929
maybe we'll be saved with bitnet, we could technically run giant models if they worked on 1.58bit precision >>106915856
>>
>>106915969
>legit profit now
money ouroboros is not legit profit
>>
File: AnimateDiff_00013.mp4 (1.79 MB, 848x480)
1.79 MB
1.79 MB MP4
Cars are impossible, also hitting people with the vehicle. But retaining the original face is kinda ok.

>>106915837
5090.
>>
File: QwenEdit_00102_.png (1.44 MB, 968x1072)
1.44 MB
1.44 MB PNG
>>106916018
So is it true JLo's dead?
>>
KEK, tried the 2 light lora setup for nsfw: https://files.catbox.moe/x6d2pv.mp4

>>106916058
Bullshit.
>>
File: QwenEdit_00103_.png (1.03 MB, 1176x880)
1.03 MB
1.03 MB PNG
>>106916076
manmade horrors
>>
>He still uses upscalers
>>
File: QwenEdit_00106_.png (1.7 MB, 1120x928)
1.7 MB
1.7 MB PNG
>>106915779
this lora is fucking incredible man
>>
>>106915929
>What will be the next big model?
for local? Wan 2.2 without lightning lobotomy because there is a legitimate chance we go into a 1 year+ freeze for SOTA local videogen and hardware upgrades will unlock wan's full potential. All the small video labs ran out of money and the big Chinese companies are SaaSing

Maybe some 2.2 tunes, maybe some more garbage video+audio releases

We need better hardware almost more than we need better local models at this point unless you want to enter the rentmaxxing and borrowing-from-work era. Just look at the resident AMDlet and all the tourists that show up with 6-8gb of vram on their laptop GPUs. Kids these days don't even have computers and just play Roblox on their phones
>>
File: QwenEdit_00107_.png (881 KB, 1376x752)
881 KB
881 KB PNG
is it safe ?
>>
>>106916183
I will stick with my 12GB and you will accommodate for me
>>
8GB needs to be the standard for models
>>
File: This Is Why We Draw.jpg (153 KB, 1080x1350)
153 KB
153 KB JPG
This is what AI "artists" will never understand.
>>
>>106916240
I can just tell from that style it's that guy with the OnlyFans girlfriend isn't it?
>>
How do you / Can you prompt qwen edit to leave the background transparent or remove a background to render something?
>>
>>106916240
>I don't draw for likes and fame chud!
>OMG AI IS TAKING MY JOB, WHY ARE YOU PEOPLE LEAVING!! ;-;
>>
>>106916205
I'm sticking with my 5070ti, I'm really happy I didn't buy a 5090 just for wan. Honeymoon phases always end even if it's the technology of your dreams

Since during my peak addiction phase I stated that I would pay $500 for wan 2.2, Iwould probably pay $2000 for the weights of a locally runnable sora 2 at this moment in time. Unfortunately sora2 is nipple and genital censored but it would be amazing for all audio goons and anything non nude

>>106916252
Oatmeal is a cuck like idubbz? I love learning about Internet people I don't care about. I literally had no idea who charlie kirk was until he got necced and I'm on 4chan like all the time
>>
>>106916263
>Oatmeal is a cuck like idubbz?
dunno who this fucker is, but if he's a leftist, I'd say likely
>>
>>106916261
There's plenty of other AI's that can remove backgrounds though, and far better than qwen can
>>
>>106916275
Locally?
>>
>>106915828
I've tried the new kj high lora, tried the lora combo, i still get slow motion.
The only way to not get it for me is to add a 1 step high pass without any loras right at the start. Works every time.
>>
>>106916281
>https://github.com/john-mnz/ComfyUI-Inspyrenet-Rembg
>https://github.com/PramaLLC/BEN2_ComfyUI
There might be newer ones, I haven't had to use one for a bit.
>>
>>106916281
https://github.com/GeekyGhost/ComfyUI-GeekyRemB
>>
>>106916284
what cfg do you use?
>>
File: G3VfuYGXQAAQH_0.jpg (57 KB, 917x499)
57 KB
57 KB JPG
>>106916240
>>
>>106916297
1 cfg for the entire process
>>
File: 1759194856156686.png (131 KB, 832x891)
131 KB
131 KB PNG
>>106916240
this guy is crashing out lool, only yoko taro can make coomer art while being seeing as a serious philosophical pundit
>>
>>106916290
>>106916296
You guys are posting slop.
This is what you should get.
https://github.com/1038lab/ComfyUI-RMBG
>>
>>106916323
GeekyRemB uses RMBG, dumb-dumb.
>>
>>106916290
>>106916296
>>106916323
Thanks fellas
>>
>>106916320
He should take up AI
>>
>>106916336
If what you are saying is that it uses BiRefNet, then that is true. It doesn't use the nodes in the repo I linked. The repo I linked supports more models, including inspyrenet. It has over 1k stars. You posted AI slop repos.
>>
Is there a better color matching extension for Comfy than this? It's kind of shitty, especially for video. Premiere's Lumetri is a lot better, but having to load it up and set the clips/images up manually is a pain in the ass.
>>
File: QwenEdit_00112_.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>
>>106916365
I use it but reinhard. And then I slap on that rgb matching node as well. It's rgb or something color, lets you set the color profile.
>>
>>106916365
https://github.com/regiellis/ComfyUI-EasyColorCorrector
>>
any local video upscaler model that gives better results than 4xultrasharp for realistic gens?
>>
>>106916395
SeedVR2 is the best video upscaler currently available imo. Better than Topaz, which used to be the goat. Gonna need a beefy GPU though.
>>
File: file.png (2 KB, 274x55)
2 KB
2 KB PNG
>>106915956
bro, I'm really fucking sorry for you, like no cap hope it'll get better for you
pic rel is the s/it I get with my 4080 with qwenedit 2509 + 4 step lora
>>
>>106916336
Sorry for being harsh.

They might not be slop, but early implementations (aside from Geeky). But yeah, I've tested a lot of different models for background removal and BEN2 and Inspyrenet did the best, but it probably depends on the image. However BiRefNet, also called rmbg at times, is also supported, among other models. Where the confusion comes in at is the rmbg can also just generically stand for remove background.
>>
>>106916419
jealous
hopefully amd chinks can figure out rocm
probably not
>>
File: SS051022.png (185 KB, 1632x979)
185 KB
185 KB PNG
>>106916395
If you do end up trying it, these are my settings. Make sure you get the nightly of the videoupscaler, otherwise you won't have the extra args node. Uses about 20GB VRAM peak. You can reduce usage by lowering batch size (50 or so) and vae tile size (512), but the lower you go, the more artifacts you'll get.
Don't use the FP8 model, it's broken and causes seams. And don't feed it interpolated video, gen at 16 raw, then interpolate after you run it through the upscaler, that way it has half as many frames to upscale.
>>
File: file.png (1.18 MB, 928x1152)
1.18 MB
1.18 MB PNG
>>106916240
what a coper
>>
>>106915929
>What will be the next big model?
>>106915969
>money's drying up,
the next big model is unironcally going to be the next small model that is just as smart as the big models and can by hyperfocused on a single task and/or knowledge area.
The future of LLMs is lots of smaller, smarter hyper-focused models that communicate with each other, not some god model that knows all.
>>
>>106916459
based
>>
>>106916459
soulless
>>
File: tls.gif (1.86 MB, 300x164)
1.86 MB
1.86 MB GIF
>>106916459
>the model obscured the pantyshot
>>
>>106916380
>>106916390
Nice, I'll try both. Thanks bros.
>>
>>106916459
should've left the text
>>
>>106916459
She ain't blueskinned.
>>
>>106916447
are those the best settings for 24gb vram?
>>
File: file.png (1.25 MB, 928x1152)
1.25 MB
1.25 MB PNG
>>106916494
>>106916504
>>
How many steps is generally good for Chroma?
>>
>>106916511
Works for me, but it'll depend on your base video res and what you want to upscale it to, so you might have to adjust it as you go to keep it under 23-23.5.
>>
>>106916540
26-35
>>
>>106916540
res_2s / bong_tangent = 20 steps OR euler / sigmoid_offset = 35 steps
>>
>>106916540
about tree fiddy
>>
>>106916541
thanks anon
>>
>>106915556
>load brian.json in venv
>link to t5 clip in workflow
>eggsecute
It not hrad
>>
File: QwenEdit_00121_.png (1.01 MB, 912x1144)
1.01 MB
1.01 MB PNG
>>106916240
>>106916459
>>
>>106916594
try to change the text into something more funni too
>>
>proving his point with every soul extracted gen
lawl
>>
>>106916613
cope
>>
>>106916613
>soul extracted
it just looked like rough shitty unfinished work
and for people who bitch about ai not knowing how to draw fingers...
>>
File: QwenEdit_00122_.png (1.29 MB, 1048x992)
1.29 MB
1.29 MB PNG
>lawl
>>
>>106916613
didn't know that cuck with the OF girlfriend was lurking on ldg lawl
>>
File: file.png (1.37 MB, 1056x992)
1.37 MB
1.37 MB PNG
>>106916613
is this u perchance?
>>
File: oh he mad.png (102 KB, 472x468)
102 KB
102 KB PNG
>>106916613
>lawl
>>
Quality bait
>>
imagine if bitnet wasn't a meme...
>>
File: ComfyUI_temp_rfbnz_00010_.png (2.09 MB, 1152x1536)
2.09 MB
2.09 MB PNG
This was a mistake caused by getting my noodles crossed, but I like the effect.
>>
File: file.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>106916688
>>
>>106916741
kek
>>
File: QwenEdit_00126_.png (1.2 MB, 1152x904)
1.2 MB
1.2 MB PNG
has science actually gone too far
>>
>>106916787
lmaooo, that's a good one
>>
>>106916461
Ever read a paper on deep neural quantum networks?
I would rather go with the god-like model.
>>
Weird, wan t2i, same prompt, just different seeds. Gens are different too, but have small areas with very similar features. Noticeable when you scroll gens fast enough, like some parts of images don't changed.
Maybe because of these meme res2 and beta ksampler options.
>>
>>106916636
Pretty funny
>>
File: ComfyUI_temp_tqmxg_00002_.png (2.19 MB, 1296x1728)
2.19 MB
2.19 MB PNG
>>106916728
Depending on how exactly you cross the wires, the effect is increased or decreased. Seems obvious that a majority black input image produces a dark output image, but I'm still surprised by the effect.
>>
File: QwenEdit_00130_.png (1.1 MB, 1192x872)
1.1 MB
1.1 MB PNG
>/ldg/
>>
>>106916545
res multistep + beta works too
>>
File: WanVideo2_1_T2V_00012.mp4 (1.88 MB, 960x528)
1.88 MB
1.88 MB MP4
any idea why the masking stops working completely when the scene changes? I'm using sam2 segmentation
>>
>>106915334
> 2026
> 22b model
it's funny how 3b sdxl finetunes can do the same or better while taking 10x less resources
>>
>>106916240
How tf is a drawing gonna ‘help someone in need’?
Come down from your high horse faggot ‘artist’, you draw somewhat stimulating lines but that’s it
>>
>>106916320
What a wanker
>>
>>106916390
This ended up being the best option for img2img upscaling and inpainting. Leagues better than KJ's Color Match.
>>
>>106917321
>How tf is a drawing gonna ‘help someone in need’?
this nigger thinks his coomer drawings helps someone philosophically or something, he's so up his ass it's comical
>>
>>106917321
>>106917331
It's just engagement rage baiting. X encourages that kind of behavior.
>>
File: QwenEdit_00144_.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>
>>106917345
He just wanted some water
>>
>>106917335
that's what I hate about twitter, Elon said he wanted to bring back "free speech", but is it really free speech when your speech is supposed to be as rage baiting as possible to get money?
>>
>>106917329
>histogram natching
nta, but that's genius
t. initiated in digital imaging
it'd work even better if you input a very specific histogram as refference, for example right arm reference for right leg
>>
how can I save the intermediate state with the workflow json?
>>
>>106917369
That’s the formula with the 200x character limit
Only quick shouting
I agree with your sentiment
>>
All I could think of when doing my sets to failure, was how much I wanted to gen..
Maybe I need a break.
>>
>>106917369
Use it like you'd use reddit and even 4chan, only for niche shit you're into. Avoid places infected with or highly susceptible to politics/culture war trash/rage baiting.
ie, my X feed is AI tech, space stuff and dogs being dogs. Much better than half a billion pajeets larping as British patriots, or the latest Netflix adaption filled with brown people.
>>
hw requirements for wan 2.2 i2v lora training?
>>
>>106917449
>All I could think of when doing my sets to failure, was how much I wanted to gen..
I have squatting rack and bench press setup next to my pc so I have something to do while training/tuning. Train while you train
>>
>>106917460
if you have to ask
>>
>>106915366
uh link to lora?
>>
>>106917485
ldg is so useless
>>
>>106917481
I do that on arm days.
>>
File: ComfyUI_temp_ajxyi_00001_.png (1.52 MB, 1344x1024)
1.52 MB
1.52 MB PNG
>>
>>106917460
Basically anything, thanks to block swapping, but it will be slow for vramlet
>>
File: media_1760144848.png (1.37 MB, 768x1280)
1.37 MB
1.37 MB PNG
>>
File: 00000-3496102137.png (371 KB, 512x640)
371 KB
371 KB PNG
>>
>>106917731
Yoda plays not with me anymore. Yoda thinks me not worth it.
>>
>>106917329
My output video is just 1 single image across all frames when I hook the vae color corrector up. It goes in between the vae decode and the video save node, right?
>>
>>106917769
Nah, that's just for img2img and inpainting, like I said. There's a similar node in the pack for video, though I haven't tested it yet.
>Batch Color Corrector
>>
>>106917796
Oh I thought it was all for video. I see now, I'll try the batch one.
>>
>>106917837
Meh, it doesn't fix the brightness shift at all.
>>
>>106917870
Set it to manual mode, play with brightness/contrast/gamma.
>>
File: 3516619052.png (1.79 MB, 896x1152)
1.79 MB
1.79 MB PNG
>>
File: ComfyUI_temp_ifkup_00004_.png (3.79 MB, 1600x1200)
3.79 MB
3.79 MB PNG
Cool hose.
>>
I have been trying qwen-edit and I think my setup is somehow wrong. The resulting image is quite wrong/weird. It looks like a blurred merge of the base image and the modified one, for example I will write a prompt about removing an object from an image while preserving the rest of it. The resulting image will have that object transparent and the image will mostly be blurred/smoothed compared to the original. I'm using the Qwen Image Edit template that come with ComfyUI, only replaced the clip and model loader with GGUF one.
>>
>>106916390
Thanks for the recommendation, I will try it.
But man, what's with so many project descriptions having a deluge of emojis, it's really annoying.
>>
Best vagina/pussy loras for w2.1 or w2.2? What do you guys use? I2V they always look kinda weird.
>>
>>106918150
Autism
>>
>>106918151
Try "MysticXXX", I haven't done much testing, but I think it might work pretty well.
>>
File: ol.png (386 KB, 1000x572)
386 KB
386 KB PNG
>>106918151
I trained my own for 2.1, then 2.2, since the ones available on civit were and are ass. And no, I won't share it, BEGONE
>>
>>106918151
I'd rather have someone make an i2v "tip of penis" one, right now every blowjob changes the penis to some kind of infinite flesh tube whenever the girl gets it out of her mouth
>>
>>106918198
True, and when they can make a good penis they change the woman's face.

>>106918186
Aw man :(
Can you at least share the training method?

>>106918178
It looks nice, will test this one.
>>
>>106916914
you can add noise to the diffusion process in patterns or waves of different colors/brightness to modify your images

>>106917301
high strength loras degrade the model anon

>>106917460
i've done it with the 5B model on a 3090, the 14B is just too damn slow
>>
>>106917976
Much like llm smiles, the brushstrokes don't quite reach the eyes.
>>
File: file.png (2.51 MB, 1328x1328)
2.51 MB
2.51 MB PNG
>>
File: dmmg_0050.png (1.13 MB, 896x1152)
1.13 MB
1.13 MB PNG
>>106917536
gonna upload to civit over the weekend maybe, i still need to gen all the samples and give it a write up
>>
>>106918227
>Can you at least share the training method?

>1998 images
>150 videos
>focus on diversity of body types and camera angles
>1/3 of images are extreme closeups of genitalia, 1/3 of videos are spreading labia to teach it the physics, otherwise pussy flaps can randomly flutter like wings
>joycaption images, hand caption videos
>spend two weeks cleaning up captions
>spend a couple hundred bucks to train it, fail a few times, then succeed
>horde it because fuck you I did all the work and its mine
>refuse to elaborate ever again
>>
>>106917382
> intermediate state
wdym
>>
>>106918343
Missouri
>>
>>106917460
I've heard 12gb for pics, 16gb for short clips.
>>
>>106917629
ramtorch promised batch size to ignore swap slowdown
>>
File: 1750593553513977.png (197 KB, 1280x1278)
197 KB
197 KB PNG
>>106918186
>join a community with a selfish attitude
>>
File: media_1760144877.png (1.34 MB, 768x1280)
1.34 MB
1.34 MB PNG
Make her tits bounce. It is very important mission foe the betterment of humanity.
>>
what does it mean when a WAN prompt comes out blurry
>>
>>106918343
it means you are a retard
>>
>>106918426
res too low, not enough steps, retarded settings, etc
>>
>>106918434
figure out by yourself then, nigger
>>
>>106918426
What's your ksampler config?
I usually do 9 steps
6 switch from high to low
cfg 1.5~3.5 on high depending on the level of movement you need or the original image or if it's flashing too much or whatever
1.1 on low to get the negatives
I'm not saying this is the best its just what I use and it works enough
Euler / Simple
>>
>>106918464
Ksampler 1:
noice seed: random numbers
control after gen: random
steps: 2
cfg: 1
euler
sched: simp
start at step : 0
end at step : 2
return with leftover noice enable

Ksampler 2:
noice seed: random numbers
control after gen: random
steps: 2
cfg: 2
euler
sched: simp
start at step : 2
end at step : 4
return with leftover noice disable
>>
>>106917295
https://www.reddit.com/r/StableDiffusion/comments/1o2sves/
https://www.reddit.com/r/StableDiffusion/comments/1o7r7sb/
try this, looks p good. haven't tried it myself tho as i deleted all my shit currently.
>>
>>106918488
>noice
hmm thirdie learn to write english maybe before trying to do stuff thats too hard for your little brainy? :)
>>
>>106918488
>>
>>106918488
the problem is steps: 2.
you need to put steps: 4 in both, fucking sopa de macaco retard UMA DELICIA kys
>>
>>106918504
why end at 10,000 steps ? does the sampler just ignore after step 9 ?
>>
File: wan.png (76 KB, 314x500)
76 KB
76 KB PNG
>>106918504
Just use this. It works out optimal steps for high/low depending on the total.
>>
>>106918524
yeah so I don't need to change it if I do 4-6-9 steps total
>>
>>106918450
you don't have the answer anyway because you are a retard
>>
>>106918526
doesnt this try to load both models at once? isnt it better to just use the custom scheduler this nodepack provides?
>>
>>106918426
>>106918488
you need more steps
also it depends on the gen, dynamic will be blurry

>>106918501
please consider doing a kys to yourself
>>
>>106918537
>doesnt this try to load both models at once
Do you even have to ask?
No. That would be retarded as fuck.
>>
im trying to get a picture of a woman in a bikini in WAN 2.1 I2V to have her breasts expand and her bikini rip and fling off.

it kind of works in grok imagine but I guess I cant use the same key words in WAN ?
>>
>>106918544
ur gay
>>
>>106918534
> assumptions of a nigger
>>
>>106918555
You'll need a LoRA obviously, base Wan can't do that shit
>>
>>106918526
Never seen this, will test it
>>
File: 1725111900484011.png (108 KB, 362x295)
108 KB
108 KB PNG
how do I get FLUX schnell to start with a picture ?
>>
HOLY FUCK CODE RED THEY'RE COMIN OUTTA THE GOD DAMN WALLS NOW REEEEEEE
>>
File: 81649384.mp4 (3.18 MB, 848x480)
3.18 MB
3.18 MB MP4
We still doing these?
>>
>>106918526
people still recommending this when you could already do that with kijais nodes really shows how retarded some of you niggers are
>>
>2025
>no good general audio models with training code
beyond owari
>>
>just use shitty closed ecosystem nodes bro
>>
>>106918597
shitters dont know better. sad!
>>
>>
>>106918585
>no good general audio models
always wondered why thats the case, like you'd think making AI music would be even above making pics and videos and yet basically no open source stuff exists for it.
>>
chroma emma lora is fucking great. do you guys have more celebs?
>>
Bros, stomach bulges are impossible to prompt for. It just doesn't understand it at all.
>>
>>106918611
because music is a copyright nightmare worse than image/video, and the holders are VERY aggressive usually.
For now we gotta cope with mmaudio and songbloom, both of which are NOT very good.
>>
>>106918611
it's a lot quicker and easier to ascertain the quality of a video/image model. anything beyond robotic TTS is deemed "too dangerous" to release. i just want a model that can moan
>>
>>106918623
>because music is a copyright nightmare
and yet udio and suno faggots are still in business doing better than ever.
>>
>>106918642
>udio and suno
They both have pending lawsuits
>>
>>106918642
but they didnt release weights, training methods or papers related to it. You cannot train open models on copyrighted music, this is the issue, they're instead probably doing it behind closed doors.
>>
>>106918567
img2img
>>
How the FUCK do I stop Wan 2.2 with lightx2v from over brightening/exposing the source image?
>>
>>106918654
which apparently go nowhere and have zero impact on them, they just keep launching more advanced shit all the time lol.
>>106918661
>doing it behind closed doors.
how is that making this any better?
I wish one of the niggers working there would just leak the model.
>>
>>106918687
You don't. Some starting images are worse than others. It's just how it is
>>
>>106918693
>leak the model
this isn't happening ever again
>>
>>106918693
The only closed source model leak that I can think of was NAI. The really big companies haven't had a single leak and likely won't, data security for these guys is tighter than Satan's asshole
>>
>>106918698
>>106918709
cmon niggers let a man dream ok?
>>
>>106918697
That's fucked. Wan 2.1 had the same issue, but it was easy to post process afterwards, 2.2 over exposes it to fuck and I can't really fix it.
>>
File: ComfyUI_06175_.png (1.24 MB, 1104x944)
1.24 MB
1.24 MB PNG
>>
any speed up lora is cope
>>
>>106918526
This is fucking shit, by the way. I just tested it and it blew the exposure out on every gen compared to the regular 2 sampler setup
>>106918721 if you're using that node, go back to dual ksamplers
>>
>>106918731
You can just say you're retarded.
>>
>>106918731
this
lightx2v is dogshit
>>
You wont be getting Sora 2 at home.
You wont be getting UDIO at home.
(You) lost.
>>
>yeah bro just take 30 minutes per gen for barely any difference in quality trust me bro its worth it
>>
fuck
I can't remember the name of the best local vidgen model we had before wan
>>
>>106918770
hunyuan
mochi-1
>>
>>106918774
>hunyuan
alien doing pushups
>>
>>106918526
>>106918736

Had the same experience. Generation took a little bit less (with cfg 1/1) time but quality drastically decreased. With my previous cfg 2/1.1 was the same time and quality decreased too.
>>
>>106918736
>if you're using that node, go back to dual ksamplers
Yeah, I *was* using it. It was recommended here a couple threads back. I tried two k sampler nodes just now though and the difference is huge. That's what I get for trusting you homos.
>>
>>106918774
was mochi-1 a thing
>>
>>106918504
k I used these settings and the static blur is gone but now the character is not following the prompt.

do I have to crank the LORA ? its at 1
>>
File: 1695679128574695.jpg (66 KB, 720x480)
66 KB
66 KB JPG
>mfw after a day of trolling grok imagine I got it to finally do a video tits and vagina
>>
>>106918822
that's not legal saar please read grok terms and conditions, no bob and vagene
>>
>>106918810
it beat hunyuan to the punch but was worse and also required datacenter gpus to even run. was one of the more genuinely cinematic vidgen models though
>>
>>106918813
Maybe increase the CFG of the high?
More noise at high may improve prompt following afaik. Or maybe prompt better, idk.
>>
File: FOUND THE RETARD.jpg (21 KB, 828x409)
21 KB
21 KB JPG
>using higher than 1 cfg with a lightning lora
>>
how do I save uncompressed video as avi in comfyui
>>
>>106918849
>avi
GO BACK TO THE 90'S, FAGGOOOOOOT
>>
>>106918847
>using higher than 0 cfg for anything
>>
>>106918860
W-what happens when you use cfg 0? I've never done it
>>
>>106918865
https://www.youtube.com/watch?v=rGmiPTmj8eM this in AI form
>>
>>106918833
I once managed to inflate the tits to like balloon size. then I tried to get milk to come out but it would never come out of the nipple.

grok imagine is fucking weird. some times it will not allow a prompt because the word "breast" other times it wont.
>>
>>106918327
Cant blame you. You'd barely get a like for your trouble on civitai
>>
File: file.png (18 KB, 310x278)
18 KB
18 KB PNG
>>106916447
Why don't i have this node?
>>
>>106918555
>grok imagine
I remember when that faggot Elon said xAI would be all about open source, he specifically called out OpenAI for acting like dragons hoarding gold piles when it came to their models. Still waiting for his image and video model to be open sourced, or the latest versions of Grok. Any day now, right?
>>
>>106918941
Go to custom_nodes, delete old SeedVR2 directory, cmd.exe
>git clone -b nightly https://github.com/numz/ComfyUI-SeedVR2_VideoUpscaler.git
>>
>>106918946
they release models 2 generations older than the flagship, and their first image model was just flux.
>>
>>106918941
you're using the old implementation, here open wide:
https://github.com/AInVFX/ComfyUI-SeedVR2_VideoUpscaler/tree/nightly
>>
>>106918946
grok imagine based on some version of flux.
>>
is there like an structured guide that teaches you what most of the settings in comfyUI nodes even mean? Or did you guys just learn by fucking around ?? this stuff is kind of overwhelming
>>
>>106918969
Learn by fucking around. Never use other people's workflows except to work out some feature they have that you'd like to integrate into your own. Learn how to make your own workflows, even if it feels like pulling teeth.
I started with this guy's tutorials, they're pretty good :

https://www.youtube.com/watch?v=Yk8aS233HP0&list=PLn4FL274ScykR8q0C4UD6mm0K74BAd-pO
>>
>>106918956
'clone' is not recognized as an internal or external command,
operable program or batch file.
>>
>>106918983
Install git, you git.
>>
>>106918956
>>106918961
thanks
>>
>>106918941
>>106918961
>here open wide
kek what is a noob going to do with that?

cd custom_nodes/ComfyUI-SeedVR2_VideoUpscaler
git checkout nightly
cd ../..
>>
File: image_00040_.jpg (591 KB, 1264x1712)
591 KB
591 KB JPG
>>
File: WanVideo2_2_I2V_00814.webm (2.48 MB, 704x1280)
2.48 MB
2.48 MB WEBM
her feet are way too small but im glad it didnt shrink her
>>
File: 1652032026221.jpg (187 KB, 728x727)
187 KB
187 KB JPG
>>106918755
it's not just sora. we don't have anything at home. not even the old hailuo 1. nvidia and china have ripped off ai users. i'm glad I only bought 12gb settings. imagine being poor for WAN 2.2 kek
>>
>>106918993
I mean, people should put in a little effort instead of being completely spoonfed, no? Or do you want complete retards coming here asking for tutorials for everything?
>>
is there a place to anonymously upload low effort NSFW AI slop ?
>>
>>106919042
and generate a bit of money, like enough for grok imagine heavy user to pay for its self.
>>
>>106919042
Twatter for realistic, any booru which accepts AI slop if you do anime instead
>>
>>106919042
>>>/gif/vdg/
>>
>>106919050
>generate a bit of money
post lewds to twitter. build a following.
create a patreon. post nsfw there.
>muh effort
no effort no money
>>
>>106919050
Patreon after getting word of mouth on X or somewhere else.
>>
>>106919075
wait you cant get banned on Twitter for straight up posting porn ?
>>
File: image_00043_.jpg (543 KB, 1264x1712)
543 KB
543 KB JPG
>>
>>106919086
post lewds and not porn so you dont get shadowbanned. if you post porn you will simply never get into the algorithm and will not grow
>>
>>106919086
this ain't 2009, dumbo
>>
>>106918969
dont listen to this retard >>106918978

always just use the most popular workflow or the one provided by the model creator and change that if you need to
>>
>just pick up other people's used smokes from the ground, don't learn to roll your own
>>
>>106919114
>always just use the most popular workflow
malicious advice
>>
>>106919095
what exactly is the cutoff point of a lewd? the smallest but of areola
>>
>>106919114
dont listen to this retard >>106918969

always just use your own workflow never the most popular or one provided by the model creator
>>
>>106919095
>shadowban
That explains my very low view count. I should delete my acc and re-use the name, but it's a whole month for it all to get deleted.
>>
>>106919120
there are too many parameters that are arbitrary beyond the fact that the specific model was trained with it in mind, so randomly trying things from literally nothing is low iq nigger advice
>>
>>106919139
be aware that Grok is visually inspecting everything you post to determine if it should be boosted or not.
>>
>>106919138
>>106919121

>>106919149
>>
>>106919114
>most popular workflow
Enjoy anything everywhere aids and other assorted cancer
>one provided by the model creator and change that if you need to
Generally okay for smaller models or niche models. For stuff like SDXL, Flux/Chroma, always make your own workflows rather than using some civit bloatware. So the advice to follow tutorials on how to actually make an SDXL/Flux workflow from scratch, including txt2img, img2img, inpainting, etc, is solid.
>>
>>106919150
So basically treat it like only simple ecchi is allowed, areolas, camel toes, asscracks? Anything that suggests nudity.
>>
>>106919172
If it'd get your gen removed from this thread, Grok will probably derank you
>>
>>106918978
>Never use other people's workflows
Reminder these are the browns that give their opinions here after genning with random resolutions, samplers, cfg, padding, steps and then saying the model sucks
>>
File: 00004-1463701549.jpg (842 KB, 1728x1344)
842 KB
842 KB JPG
>>
>use my hyperslop megaworkflow with 80 custom nodes xir
im good thanks
>>
>>106919194
dude have you seen the MONSTRUOS workflows posted in civitai with a bajillion custom nodes? are you fucking retarded?
>>
>>106919194
Not learning how to do something yourself and being fully reliant on other people is literally brownoid behavior.
>>
>>106919180
>>106919172
You can still post porn but you rely on other users to RT your content, but yeah its true, your account will never grow/grow really slow if you get shadowbanned from search/feed/etc.

Mainstream social media sucks for posting lewd content, its better to post your content on niche sites that with the same degenerate userbase, aka pixiv, deviantart, reddit, etc.

Also you have to deal with anti-ai retards that will mass report your content just because of spite and that also will get you shadowbanned
>>
does patreon have an algo? do people actively look for stuff or does it have to be linked and shilled ?
>>
>>106919215
This. I wouldn't tell people to never use workflows made by other people, that's retarded, but you should know the basics of how nodes and connections work, and what most of the settings on the workhorse nodes do. Otherwise, when the time comes that you need a custom function, or to do anything beyond what someone else's premade workflow does already, you're kind of fucked.
>>
>>106919249
agree.

start with a basic T2I and learn prompting first.


diving right in and playing with numbers sucks
>>
>>106919215
>Not learning how to do something yourself and being fully reliant on other people is literally brownoid behavior.
sounds more like amerimutt behavior
>>
>>106919271
There's a difference?
>>
>>106919012
>her feet are way too small
disagree
>>
>>106919225
I've accumulated over 200k views on pixiv, but on twitter it's like a total of 1500, lol. Posted porn since day 1.
Guess I'll re-think my strategy, I'd love to get a patreon going.
>>
>>106919323
>I've accumulated over 200k views on pixiv
I got banned the moment I got popular
>>
>>106919334
I got a copyright strike for doing umamusume stuff. Japs are fucking weird with copyright, probably because yakuza runs it all.
>>
my strategy:

troll grok imagine for nude:
take last frame of nude
use as starting frame next nude
>>
>>106919402
local general. fuck off
>>
few minutes
>>
>>106919520
>>106919520
>>
>>106915482
i like this
>>
>>106916320
i am words
words big
good big words
not really big
but words anyway
small words mostly
i am
thats all
>>
>>106916741
ay lmao



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.