[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106891121

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: 00101-3171878302.png (287 KB, 768x960)
287 KB
287 KB PNG
>>106868523
> So is SDXL still the best model to do this?
> I am trying to make Wan 2.2 porn videos based on photorealistic NSFW images of fictional characters in different sex positions. There are loras for different positions and acts for Wan 2.2, but as I understand they need the reference image to be in that position already for good results.
Have you've figured it out?
Some wan 2.2 loras work well with 1character reference image and "switches to the next scene:" prompt. I wish there were specific concept lora for such transitions: 1-4 frames of selfie/portrait/1girl standing and the next frames are the same character in different scenarios. Something like next shot lora for qwen edit.
Speaking of qwen edit: there is nsfw lora for regular qwen on civitai, claimed by someone in comments to work with qwen edit - you could try it.
>>
>a vast, sun scorched desert at golden hour, a lone tumbleweed rolling gracefully across cracked, arid ground, casting long, dramatic shadows. Dust swirls in its wake, catching the warm, amber light of the setting sun. Cinematic composition, wide-angle shot, moody atmosphere, soft lens flares, rich textures of dry earth and brittle tumbleweed, vibrant orange and purple sky, evoking solitude and timelessness, hyper-realistic, 4K resolution, inspired by Sergio Leone’s Westerns.
>>
If you think about it, the Chroma downers are keeping normies from ruining things. Booba-capable means booba-only to them
>>
File: ComfyUI_03357_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>
File: ComfyUI_03370_.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>
File: ComfyUI_03381_.png (1.75 MB, 1024x1024)
1.75 MB
1.75 MB PNG
>>
>increase cfg
>oom

???
>>
>>106896214
Don't forget flatties-capable
>>
I need compute... I can't gen or train...
>>
File: 1759822449004081.jpg (952 KB, 1248x1824)
952 KB
952 KB JPG
>>
File: sdgsdgsfsfw.png (218 KB, 486x713)
218 KB
218 KB PNG
I will report all of you poojeets doing this, to the authorities.
>>
>>106896432
whats wrong with it?
>>
>>106896474
Clearly some mentally ill fuck making loras of girls he's stalking.
>>
>>106896502
saar please do the needful and link lora next time so we can report faster, thank you kind sar
>>
>>106893590
any help
>>
>>106896995
I've been trying to fix it out for like 2 months. It's just inherent to wan. It's a delicate mix of settings for each individual image.
The people who tend to not get it are the ones using horrible 3d/realistic slop because it's so contrasty and highly visible details etc.
>>
>https://github.com/Nerogar/OneTrainer/pull/1056
Cool. I wanna see how Chroma2k takes loras
>>
>>106897014
FUCK
>>
>>106897027
The chroma loras are interchangeable. It depends more on resolution you train the dataset with and not the model.
>>
>>106897043
A workaround that is sometimes successful is to do the loop in two videos.
Video 1 is just First frame.
Video 2 uses the last frame of video 1 as first frame and then the original first frame as last frame.
>>
File: 00091-2502442138.png (2.32 MB, 1248x1824)
2.32 MB
2.32 MB PNG
>>
File: ComfyUI_00102_.webm (319 KB, 960x720)
319 KB
319 KB WEBM
you gotta get on that plane
>>
File: ComfyUI_temp_lsjjl_00002_.png (3.86 MB, 1440x1440)
3.86 MB
3.86 MB PNG
>>106897071
>>106897027
Actually does it mean I can load q8 and keep the spare vram for 1024 training instead of 640? Hmm hmm
>>
>>106896995
There's no easy fix with ComfyUI. You need to use Adobe Premiere and adjust the brightness and contrast for those specific frames in order to match them to the other frames. Fool proof, but irritating.
>>
>>106888911
> Please can anyone help, I'm trying to generate batches of images with the same seed for the wildcard generator and also the ksampler.

> This so far seems impossible. (generate every n images change the seed) Even with a counter node I can't do it because you can't reset the counter.

> You also cant generate a seed with a node then use that seed for the ksampler or the wildcard processor as when you change it it doesn't register until you start and stop generating.

> what are seed node with increment on run and math node with `seed // step`
>>
>>106897134
Just ask Gemini Pro to code what you need.
>>
how come ComfyUI doesn't output any warning when I use a Flux lora with a SDXL model? It used to.
>>
File: 00103-1260433749.png (2.48 MB, 1248x1824)
2.48 MB
2.48 MB PNG
>>
>>106897151
this, I've got like a dozen custom nodes for small little things specific to my workflows or needs
>>
File: RaMu ScArEd.webm (3.92 MB, 960x1280)
3.92 MB
3.92 MB WEBM
Why does Wan only give me French skeletons, is that what they look like over there?

>>106897027
Cool. With Flux it was either 512@fp16 or 1024@fp8 if you wanted everything to fit in 24GB of VRAM. 1024@Q8 seems like it could be a nice compromise (assuming it fits). I don't know if anyone's ever tested/compared the various precisions in training though, could be just another bottle of snake oil.
>>
bros gimme inspiration for my 1girls
>>
>>106897316
Black, obese, disabled, lesbian, wheelchair, single mother, communist, queen, girl boss.
>>
File: 00118-2677389798.png (2.22 MB, 1824x1248)
2.22 MB
2.22 MB PNG
>>
>>106897403
give her socks
>>
>crank up enhance-a-video to 400% to see what happens

Lol, did not expect that. What the fuck?

https://files.catbox.moe/pxfnni.mp4 nsfw
>>
File: 00125-30977266.png (2.19 MB, 1824x1248)
2.19 MB
2.19 MB PNG
>>106897414
defeats the purpose of why she's appealing to begin with.
>>
File: image_00006_.jpg (582 KB, 1264x1680)
582 KB
582 KB JPG
>>
>>106897498
chroma?
>>
File: image (3).jpg (273 KB, 1024x768)
273 KB
273 KB JPG
>>106891747
> What is the proper way of doing video continuation without being an independently glued i2v mess with no proper continuity?
> My mind is ripe with degenerate goon ideas for genning multi-staged stuff where each section of the video has its own prompt and lora. It has potential to make me abandon real porn forever but its sucking ass the way I'm doing on wan2gp

Load a video -> take last 4-30 frames, fill the rest, mask (important) -> input to vace node as control images -> denoise with new prompt, decode -> repeat with new video.
Automatize with asking LLM "please write python program to run comfyui api workflow with each promt, modify nodes XYZ, copy the result video to input folder"

The problem with video continuation is degrading quality after each next cut. You could try to avoid decoding/encoding by working with latents, but it will require you to modify vace node code (and read the paper first to find out is it possible at all).
>>
>>106897529
Stable Video Infinity's solved this, just waiting for a ComfyUI implementation. Only 2.1 for now, but 2.2 is in progress
>>
File: ComfyUI_06010_.png (1.13 MB, 1048x992)
1.13 MB
1.13 MB PNG
>>
>>106897545
> racial attention solved this, bro
> two more weeks, bro
>>
File: chroma___0008.png (1.58 MB, 832x1216)
1.58 MB
1.58 MB PNG
>>
File: 00127-1275124544.png (2.38 MB, 1824x1248)
2.38 MB
2.38 MB PNG
>>
>>106897561
>racial attention
kek
>>
File: ComfyUI_06003_.png (1.29 MB, 1024x1016)
1.29 MB
1.29 MB PNG
top kek
>>
>>106897475
give her socks anyways
>>
>>106897403
>>106897475
cute
>>
File: 275246866963124_00001_F.jpg (1.08 MB, 2000x3000)
1.08 MB
1.08 MB JPG
>>
>>106897571
>[Deleted]
Why?
>>
>>106897673
"BOSS, NO, THAT WAS NOT A PENIS, IT WAS A TINY GIRL INSIDE A FORESKIN, IT'S NOT THE SAME, DON'T FIRE ME AAAA"
>>
File: 00142-2540135358.png (2.83 MB, 1824x1248)
2.83 MB
2.83 MB PNG
https://files.catbox.moe/kgq333.png
https://files.catbox.moe/glqvbi.png
>>
>>106897673
It's one of life's mysteries.
>>
>>106897647
I've got something she can suck if you catch my drift
>>
File: image_00018_.jpg (525 KB, 1264x1680)
525 KB
525 KB JPG
>>106897524
yeah
>>
File: 1622413540833.png (1.77 MB, 1920x1080)
1.77 MB
1.77 MB PNG
>>106897027
Does it work OOB or is it gonna explode my entire install? Anyone tested?
>>
File: wan22___0091.png (1.22 MB, 832x1216)
1.22 MB
1.22 MB PNG
>>106897316
>go to any image board (hint: you're on one)
>find a picture of a woman
>skip generic hole pics, find something with a setting
>try to prompt the full scene from scratch
>change it up a little bit

ain't nothing new under the sun
>>
>>106897838
holy sameface
>>
>>106897673
>>106897725
top kek
>>
File: image_00020_.jpg (534 KB, 1264x1680)
534 KB
534 KB JPG
>>
File: 1738543166269111.png (929 KB, 1920x1080)
929 KB
929 KB PNG
Looks like I asked in the wrong thread (?) so I'll post here too
>>106897810
>>106897810
>>106897810
Can somebody gibe de pusi pls?
>>
>>106897923
Not your personal army
>>
>>106897923
just use ChatGPT
>>
>>106897938
I was directed specifically to this thread to ask for help. Nigger.

>>106897952
That shit is poisoned to hell with ghibli style. Dumbfuck.
Also
>/ldg/ - Local Diffusion General
>"Just use the corpo model bro"
Nigger.
>>
>>106897970
Gotta learn to suck a little dick to get the help you need here, bro.
>>
>>106897970
too retarded to coom 2025
>>
>>106897970
>>106897923
>>106897907
>>106897810
>>106897897
I replied to you newfag
>>
>>106897983
I'm downloading it. I just don't know which thread is the good one. Why does /g/ have like 4 AIgen generals??
>>
>>106897987
Try /sdg/, seems about right for you
>>
>>106898001
>Have to embarrass myself by asking a fourth time
If I get a vacation for spamming it'll be your fault.
>>
im completely new here, just got a new gpu. i just want to faceswap a video, preferably with two faces. can someone point me in the right direction. slowly going through the links, but im not sure which ones are relevant.
>>
If I would like to rent a GPU from a cloud for video generation, what would be a good provider? Something that I could use for porn generation? Legal stuff, of course.
>>
>>106898046
LOCAL DIFFUSION MOTHERFUCKER
>>
>>106898010
wan 2.2
>>
>>106898010
Sorry sarr, that's illegal.
>>
>>106898010
>just got a new gpu.
why cant dumb newniggers just say what GPU they got? am I supposed to read your mind?
>>106898046
wrong thread saas nigger, this thread is for LOCAL chads only.
>>
>>106898081
I don't have the hardware at the moment. It takes 30 minutes to generate a shitty video.
>>
File: WanVid_00013.webm (732 KB, 720x960)
732 KB
732 KB WEBM
>>
>>106898107
>wrong thread saas nigger, this thread is for LOCAL chads only.

I thought the point of this thread is
>Discussion of Free and Open Source Text-to-Image/Video Models and UI
I am not trying to use someone else's video generator service.
>>
>>106898141
nobody here knows about server renting because we don't use it, you won't find an answer
>>
File: ComfyUI_06022_.png (1.24 MB, 1032x1008)
1.24 MB
1.24 MB PNG
>>106897853
lol
>>
File: 00013-66846285.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>
>>106897072
the problem is i'm not trying for a loop, i'm trying to use WAN to tween a series of keyframes
>>
>>106898093
ty
>>106898107
hehe my bad 5070ti
>>
>>106898204
and fix the wrong lane
>>
File: ComfyUI_06025_.png (1.27 MB, 1024x1016)
1.27 MB
1.27 MB PNG
>>106898204
>>
>>106898124
nice
>>
>>106897574
Just use HD flash bro. It for the most part fixes background details, especially in crowded gens.
>>
File: ComfyUI_06026_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>106898204
>>
I will not plap.
>>
File: ComfyUI_05996_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>
>>106897923
If you use merges then there is no helping you, really.
>>
File: ComfyUI_06005_.png (1.38 MB, 968x1080)
1.38 MB
1.38 MB PNG
>>
File: ComfyUI_06020_.png (1.16 MB, 1240x840)
1.16 MB
1.16 MB PNG
>>
File: ComfyUI_06028_.png (1.25 MB, 944x1104)
1.25 MB
1.25 MB PNG
>>
File: ComfyUI_06027_.png (1.21 MB, 824x1264)
1.21 MB
1.21 MB PNG
>>
>>106898372
busa rape ojisan... I kneel
>>
im pulling bros
>>
I'm caching the gradients
>>
>>106898651
Be bold, pull nightly for all nodes, don't be a pussy.
>>
reticulating splines
>>
File: ComfyUI_06021_.png (1.04 MB, 1176x880)
1.04 MB
1.04 MB PNG
>>
File: radiance.png (3.05 MB, 864x1488)
3.05 MB
3.05 MB PNG
>>
fuckin shame that bg characters are fucking GARBAGE with neta, using qwen fucking spoiled me
>>
Idūs Octobres
>>
File: radiance.png (2.77 MB, 864x1488)
2.77 MB
2.77 MB PNG
>>106898115
get the hardware. until then go see the other threads that do SaaS or IaaS? we're intentionally not doing that here, it is too different.
>>
File: radiance.png (2.78 MB, 864x1488)
2.78 MB
2.78 MB PNG
>>106898951
qwen or better will eventually be trained too
>>
>>106898983
I'll finish my batch and unironically pass it through qwen to try and make the background not fucking garbage and a sharpener pass to make up for the vae loss.
>>
>>106898372
now do the full picture
>>
File: radiance.png (3.28 MB, 864x1488)
3.28 MB
3.28 MB PNG
BTW chroma radiance 0.4 was pushed some hours ago:
https://huggingface.co/lodestones/Chroma1-Radiance/tree/main

>>106899006
that is probably the best option for now.

i like radiance and the backgrounds are pleasant + still getting nicer, but I doubt you'll be able to prompt multiple characters as separately with as relatively little feature bleed as with qwen even when it's trained.
>>
waiting for based kijai to implement lora for long vid https://github.com/vita-epfl/Stable-Video-Infinity
>>
>>106899261
He's wasting time with some dogshit useless upscaler, I don't get it, the infinite gen time is like orders of magnitude more important, specially when they have 2.2 in the roadmap.
>>
File: radiance.png (3.06 MB, 864x1488)
3.06 MB
3.06 MB PNG
>>
File: radiance.png (2.63 MB, 864x1488)
2.63 MB
2.63 MB PNG
>>106899261
>see the 10‑minute “Tom and Jerry” demo
is that published as output rather than a demo script somewhere?

if you can actually generate a only half insane 10 minute tom and jerry demo, that's damn impressive
>>
>>106899371
Why even post these when the model is half baked if even that? Complete eyesore
>>
File: radiance.png (2.38 MB, 864x1488)
2.38 MB
2.38 MB PNG
>>106899435
because they look nice, nogen anon
>>
>>106899452
These look like shit. Safe to assume that you are trolling.
>>
>>106899435
Don't blame the tools when there's the artistic talent is not there.
>>
>>106899491
>These look like shit
this
>>
>>106899452
nta

You would have become a celebrated indie artist on internet 10 years ago by posting crap like this
>>
>>106899491
>Safe to assume that you are trolling.
don't underestimate the chroma cult anon, those guys are really mentally ill
>>
>>106899561
>celebrated indie artist
ldg is a celebrated indie art collective.
>>
What do you recommend for 3D models?
>>
>>106899561
You can be one again as 99% current AI gens are sdxl slop and future models will be trained on said sdxl slop making anyone with a brain stand out even more.
>>
File: prostrate princess.mp4 (2.64 MB, 928x640)
2.64 MB
2.64 MB MP4
>>
>>106899749
>fake pixelation
>framerate not fitting old style
2/10
>>
File: 00037-2466542931.png (534 KB, 1024x1024)
534 KB
534 KB PNG
>>
>>106897594
very nice
>>
How do I remove the watermark from Sora vids?
I don't want that blurry removal shit...
>>
>>106899364
kek, in all fairness, it seems the models he does are from enough people talking about it and i cant find any threads on leddit mentioning it, so can only assume he doesn't know it exists but, just a matter of time

>>106899434
i'm not 100% sure but yeah, its decent for what it is. i've been collecting long vid/wan optimizers that are in the works

https://github.com/TencentARC/RollingForcing
https://github.com/NVlabs/LongLive
https://github.com/vita-epfl/Stable-Video-Infinity
https://github.com/dc-ai-projects/DC-VideoGen
https://github.com/mit-han-lab/radial-attention
https://github.com/dvlab-research/Jenga
https://github.com/DualParal-Project/DualParal
https://github.com/philipy1219/ComfyUI-TaylorSeer
https://github.com/justincui03/Self-Forcing-Plus-Plus
>>
File: rocking.mp4 (3.1 MB, 928x640)
3.1 MB
3.1 MB MP4
>>
>>106899951
>the car turns on
audible kek
>>
File: image (24) (1).jpg (2.03 MB, 3072x1536)
2.03 MB
2.03 MB JPG
I dunno how I feel about Pony V7 quite yet. Neither it or Chroma are ever going to be worth using for anime over NetaYume so I don't really care about that aspect, and in the sense of just general boomerprompting or whatever it's typically at least somewhat similar to Chroma assuming you use the same schizo negatives for both of them. Threw in SD 3.5 Medium here just for fun also.
>>
>>106899943
you have to ask your question here anon >>>/wsg/5999080
>>
File: ComfyUI_00017_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>106899189
>chroma radiance 0.4 was pushed some hours ago
woo!
>>
File: ComfyUI_00340_.png (1.76 MB, 1024x1472)
1.76 MB
1.76 MB PNG
>>106899006
if you have to pass a chroma image through another model to get usable images, why are you using chroma? honest question
>>
File: 00043-3213665854.png (439 KB, 1024x1024)
439 KB
439 KB PNG
>>
>>106899951
lel
>>
File: ComfyUI_00023_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>
Nanobana
https://files.catbox.moe/lcwk43.png
https://files.catbox.moe/4hpj6y.png
https://files.catbox.moe/ar0q22.png
https://files.catbox.moe/sbwb27.png

Vs 4o.image with same prompt and IMG to img reference
https://files.catbox.moe/csvbkq.webp
https://files.catbox.moe/qk0rq4.webp
https://files.catbox.moe/qzx9aa.webp
https://files.catbox.moe/9jkwxv.webp
Reference
https://files.catbox.moe/ojbdmu.png
>>
File: 1748369203695295.png (95 KB, 508x500)
95 KB
95 KB PNG
>>106900267
>API vs API
on my local thread??
>>
>>106898919
as always freckles do weird things to me
>>
File: 1743199070027191.png (176 KB, 500x281)
176 KB
176 KB PNG
>>106900287
>as always freckles do weird things to me
same, and I'm still pissed fhey removed the freckles on Max on the Life is strange remaster
>>
>>106898951
Still a pretty nice gen IMO. Hi-res-fix tends to clean that kind of stuff up on NetaYume too also, same as any other model.
>>
>>106900267
>titty monsters with api
how the fuck
>>
So what's going on with CivitAI? Are they on their way to banning NSFW?
>>
File: image_00029_.jpg (563 KB, 1344x1728)
563 KB
563 KB JPG
>>
>>106900267
4o.image bad fingernails in the first gen, but a less generic face. colors are too red. Overrall winner though, just because it's a believably plain woman.
>>
>>106900502
Yep. Every single NSFW model and lora is getting deleted and that accounts banned. Same with every single person posting NSFW images
>>
>>106900279
lol

It's interesting, because now we need to see what qwen does.

but I'm totally not doing that for my first qwen.
>>
>>106900502
This time no, they are trying to monetize their cloud users somehow by paywalling nsfw gens.
I don't really care, as long as I can download loras and models.
>>
File: ComfyUI_00025_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
I believe you should fly!
>>
average finger count: nominal
>>
File: ComfyUI_00001_.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>106900557
>>
File: rwan.mp4 (994 KB, 640x1096)
994 KB
994 KB MP4
>>106899561
yea, that might have been enough. even better gens soon-ish I presume.

>>106899671
heh, why not
>>
Is there a way I can buy anti-offsets?
>>
File: radiance.png (2.65 MB, 864x1488)
2.65 MB
2.65 MB PNG
>>106899944
>i'm not 100% sure
i'm not sure either, but it looks pretty good. thanks for the linkage. that's a lot of stuff.
>>
no previews for me. HOTROD MODE
>>
File: radiance.png (3.22 MB, 864x1488)
3.22 MB
3.22 MB PNG
>>106900076
increasingly woo, yes
>>
>>106900639
is that a 1girl or a 1transgril?
>>
File: radiance.png (2.62 MB, 864x1488)
2.62 MB
2.62 MB PNG
>>
File: ComfyUI_00003_.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
go play basket fall

>>106900646
surprising how much a wig helps sell it
>>
>>106900437
https://files.catbox.moe/9cplcp.txt
>>
>>106900541
>>106900502
I believe that, Wan 2.2 has barely any new NSFW recently, it doesn't capture the energy Wan 2.1 had, and that's because of civitai's current rules, not the model itself
>>
>>106900681
Why is radiance always oversharpened?

>>106900680
for genning just images of mid women which wan?
>>
>>106900537
In my tests nanobana cannot generate cellulite even when specifically asked to do so, while 4o image model do it unprompted.
>>
File: image_00034_.jpg (1.53 MB, 1680x2160)
1.53 MB
1.53 MB JPG
>>
>>106900703
I mainly use nanobanana because I have a basic bich sub (mostly to voice chat with the flash llm model). and debug random things, from win11 to car stuff.
>>
>>106900719
irl there are often fuzzies and stuff you have to shoop out. I wonder if any models will get good at adding those in if you want. adding faults in.
>>
>>106900703
Nanobanana doesn't seem better than Imagen 4 Ultra in any way for straight text-to-image, especially because it doesn't have the same 2K output support.
>>
File: ComfyUI_00119_.jpg (320 KB, 1280x1920)
320 KB
320 KB JPG
>>
File: radiance.png (2.94 MB, 864x1488)
2.94 MB
2.94 MB PNG
>>106900287
they usually work on this model

>>106900416
i can imagine there will be some realtime qwen to fix this, augmented reality style
>>
I am retarded and have only used the online civitai UI. What is the easiest one for a beginner to get started? I only have a 2070 super 8 gig card...
>>
File: image_00038_.jpg (1.43 MB, 1680x2160)
1.43 MB
1.43 MB JPG
CAME seems like a decent sidegrade for Adamw, just need to dial learning rate way down.
>>
File: 1754476565247994.mp4 (2.82 MB, 720x1024)
2.82 MB
2.82 MB MP4
>>106900087
>>
>>106900801
how much system ram?
>>
>>106900838
screenshot of it in use?
>>
>>106900855
16gb.
>>
>>106900874
I'm leaning towards no... check with others, I found I needed 32gb. Maybe comfyui needs less now?
>>
64gb is more pleasant than 32 was, so I think maybe 48 is advised, not 100% sure.
>>
>>106900892
>>106900900
So no generation possible?
>>
>>106900900
ram is not expensive. just get 128gb and be done with it.
>>
>>106900910
I should.
>>
File: radiance.png (3.27 MB, 864x1488)
3.27 MB
3.27 MB PNG
>>106900645
https://www.youtube.com/watch?v=ooOELrGMn14

>>106900660
not uncommon anime 1girl hair, it works
>>
>>106900905
You can certainly try, the minimum spec says 16gb. I was not successful at getting it to work for me, at 16gb. Some tweaks may help your chances. If you are on Windows, supposedly increasing the page file works, idk.
>>
>>106900931
nice
>>
File: image_00040_.jpg (697 KB, 1344x1728)
697 KB
697 KB JPG
An Yujin
>>106900801
You could use some 1.5 checkpoints, but it will be rough. 12gb vram is probably minimum for smooth sdxl use if that's what you are after. ReForge & comfy both have learning curves and there's no way around it.
>>
File: radiance.png (2.97 MB, 864x1488)
2.97 MB
2.97 MB PNG
>>106900801
might want to try a bunch of the UIs with https://github.com/LykosAI/StabilityMatrix

probably just start with some common Illustrious checkpoint
>>
File: ComfyUI_00007_.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
lol
>>
I want to buy a basketball team so bad
>>
>>106900838
Adamw just works so why even bother
>>
>>106900842
spoopy
>>
File: radiance.png (2.97 MB, 864x1488)
2.97 MB
2.97 MB PNG
>>106901157
don't ask me why that one was removed, wasn't nude or anything

>>106901182
i myself think it'll work ok for sdxl and direct derivatives, and even quite a few other models with the right settings

hires fix and other stuff isn't very required with modern noob/illustrious/sdxl tunes anyhow. even that could probably be done with tilled methods
>>
File: ComfyUI_00009_.png (1.74 MB, 1024x1024)
1.74 MB
1.74 MB PNG
>>106901195
>>
>>106901247
CAME is for some reason the only optimizer that ever gave me good results on SD 3.5 Medium
>>
half price day at ball-mart
>>
>>106901264
the camel thing is banned, could be that.
>>
>>106901286
Did you use restarts?
>>
File: 1737721749211047.mp4 (3.1 MB, 720x1072)
3.1 MB
3.1 MB MP4
>>106900267
>>
That's it boys, Wan 2.2 has finally reached Veo's level... veo 3.1 that is lmao
>>>/wsg/6000191
>>>/wsg/6000194
>>
File: image_00049_.jpg (511 KB, 1264x1680)
511 KB
511 KB JPG
>>106901286
I got the best results with Illustrious & NoobAI with CAME. Just pain in the ass to find correct settings because it fries so easily
>>
Is there a local model to do grok image/sora 2 image to video stuff?
>>
>>106901264
I have a 4060 8GB and I mainly use illustrious checkpoints. It works in ~80 seconds.
>>
>>106901398
Wan
>>
File: radiance.png (3.25 MB, 864x1488)
3.25 MB
3.25 MB PNG
>>106901286
IMO it is a good optimizer. i also like prodigy schedulefree. nothing wrong with adamw if /when it works

>>106901297
don't see such a thing even if i zoom in. maybe just a technology issue. good thing we can do quick freckled 1girls.

>>106901195 >>106901278
quite artistic. i'm guessing we're not seeing this more on the internet because <filtered>?
>>
File: 1751031081700901.mp4 (3.82 MB, 640x640)
3.82 MB
3.82 MB MP4
>>
File: 1745920196749770.png (31 KB, 516x298)
31 KB
31 KB PNG
Best look vs time to gen I have when genning on wan i2v :
- using picrel for shift node
- a total of 10 steps

- 1 step high without lightx2v + high cfg (18+using skimming at 3) + res3s/bong_tangent
- 3 steps high using the latest lightx2v i2v lora + res2m/bong_tangent cfg1

- 1 step low without lightx2v + res3s/bong_tangent cfg1
- 5 steps low using i2v rcm nvidia lora + res2m/bong_tangent cfg1

It works well for me, I gen 129 frames/720p in 7 minutes on a 5090 and it avoids some detail blurriness and slow motion remaining even with the new lora.
>>
>>106901444
Thank.s I'll look into it.
>>
>>106901458
yap yap yap yap yap
>>
File: radiance.png (2.72 MB, 864x1488)
2.72 MB
2.72 MB PNG
>>106900801
yea, so I'm thinking this >>106901414 is just fine for sdxl and even other models with more patience or somewhat reduced quality/resolution.

if you constantly end up genning, maybe get a faster gpu and/or more system RAM then. just try it for now.
>>
>>106901477
kijai 2.2 new lora + rCM low lora (also on his huggingface) at 1 str seems decent
>>
File: ComfyUI_06042_.png (1.03 MB, 1216x856)
1.03 MB
1.03 MB PNG
>>
>>106901509
Yes that's what I use here. I still go relatively higher steps because it's not that longer on blackwell, and it gets rid of a lot of annoyances like weird blurry parts (from low noise) and some remaining slow motion (from high noise).
The crisp results are worth the trade off.
>>
>>106901507
I do 1280x1280 (and sometimes 1440x1080) with 48GB RAM and it works really well.
>>
File: image_00053_.jpg (1.39 MB, 2100x1580)
1.39 MB
1.39 MB JPG
that chair

>>106901517
damn that's old image
>>
File: 1743845279711446.mp4 (2.07 MB, 640x640)
2.07 MB
2.07 MB MP4
>>106901536
seems to work well
>>
File: 1760251463690977.png (1.7 MB, 1536x1024)
1.7 MB
1.7 MB PNG
qwenedited. sadly it tried to retain the original BG characters. still better 'scene' than whatever neta lumina came up with
>>
>>106901621
I enjoy this technology
>>
I need to get on the VIAGRA mailing list.
>>
File: 1745107096342797.png (316 KB, 600x453)
316 KB
316 KB PNG
>>106901666
>Not having comfyui support for image models is equivalent of not having llama.cpp support for text models. If you don't have it, your model will not get popular.
kek, he's not wrong, ComfyUi has the power to kill a model's momentum if he wants, so far I agree with him, when he decided to not implement HunyuanImage 3.0, that sent a strong message that the local ecosystem don't want giant slopped models that can't be run, and I hope he won't use that power to kill a model that deserved to be loved
>>
>>106899763
yet they pay $15 for this bro
>>
File: 1740484371050915.mp4 (450 KB, 704x480)
450 KB
450 KB MP4
>>106901639
the anime girl turns the paper in her hand to reveal a picture of hatsune miku.
>>
>>106901712
>sono no cringe doll
exit life
>>
>>106901712
blue haired Mike. I heard he got fat and started drinking again.
>>
File: 1743305507991310.mp4 (400 KB, 480x704)
400 KB
400 KB MP4
>>106901736
>>
>>106901766
relax what
>>
File: 1754232552464792.mp4 (680 KB, 704x480)
680 KB
680 KB MP4
>>106901771
relax bro, is just 1girl gen
>>
>>106901777
Why is she dressed like a hooker?
>>
*looks at the screen*
I don't think she should look like a hooker.
*returns to default pose*
>>
File: 1746309329168205.mp4 (551 KB, 480x704)
551 KB
551 KB MP4
>>
>>106901704
I actually like that he stopped that bullshit on its tracks
>>
File: ComfyUI_00019_.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>106901799
>>
Did lodestones ever give any updates on Chroma using qwen 2.5 vl as a text encoder? I remember an anon said he was thinking about doing that.
>>
>>106901907
he should train on gwen 3
>>
>>106901780
How's life in Afghanistan?
>>
>>106901918
I wish he would do that but unfortunately he's going to work with the ponyfag to butcher Qwen instead of doing his own thing.
>>
>>106901920
They are growing demographically, which is what life is. How's your country's white tfr?
>>
>>106901934
What is tfr?
>>
>>106901929
I want to believe. I'd rather he teamed up with the noobai guys tho, v7 was such a fucking failure idk how anyone can trust the ponyfag again
>>
>>106901704
>>106901833
is that why he never implements the optimizations? he wants to kill them off?
>>
deja vu...
>>
>>106901946
total fertility rate. It's births minus deaths.
>>
>>106901666
YES. this is why SD 3.0 is the most important model of this generation and tencent stinks
>>
File: ComfyUI_00021_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>106901855
>>
ive been genning on linux, the only issue is that is that comfyUI crashes OOM unless I disable hardware acceleration in the browser but then I get browser latency. Sigh
>>
>>106901947
A Qwen-based NoobAI that could do realism and anime would be cool.
>v7 was such a fucking failure idk how anyone can trust the ponyfag again
I'm still waiting for the v7 weights that were supposed to be released a few days ago
>>
>>106901976
offload bruh
>>
File: file.png (26 KB, 1296x450)
26 KB
26 KB PNG
I CANT HOARD AAAAAAAAAAAH
>>
>>106901994
offload cumfartui off your computer you mean
>>
File: file.png (16 KB, 787x152)
16 KB
16 KB PNG
>>106901995
oh so that's why
>>
>>106901999
yeah bro go back to neoforge, it's suited for the mentally handicapped
>>
>https://www.reddit.com/r/StableDiffusion/comments/1o7bk44/i_made_nunchaku_svdquant_for_my_current_favorite/
>https://huggingface.co/Tiwaz/CenKreChro
Anyone tried this?
>>
>>106901995
Civit staff finally had enough of slopgooners and decided to shut it down effective immediately
>>
>>106902005
I though any web crap is for the mentally checked out? then again your head seems to be vacant
>>
Is it better to just use the loras or the fp8 of the MoE 2.2? https://huggingface.co/silveroxides/Wan2.2-I2V-A14B-Moe-Distill-Lightx2v-fp8_scaled_hybrid/tree/main/distill_models
>>
>>106902013
>Krea+Chroma merge
what the hell
>>
File: ComfyUI_00023_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>106901969
>>
>>106902013
>krea+chroma
for what fucking purpose?
>>
>>106901948
>is that why he never implements the optimizations?
like what?
>>
>>106902013
I couldnt get it to work using either regular chroma or krea workflows. As for the svdquant, never tried it.

>>106902030
Yes, both are fun models
>>
>>106902013
Is there a reason you wouldn't just extract Krea as a lora and use that with Chroma? Why merge them?
>>
>>106902013
do you have examples of that franken merge?
>>
File: radiance.png (3.5 MB, 864x1488)
3.5 MB
3.5 MB PNG
>>106901676
>sadly it tried to retain the original BG characters.
maybe if you indicate their position in the image, you could replace or remove some more of them

but a qwen anime training would help a lot

>>106901712
cute
>>
File: ComfyUI_00024_.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>106902040
>>
>>106902051
infinite video? nunchaku? keeping tensorrt alive?
>>
>>106902082
I played around a bit more and got slightly better results, but im not gonna flood here with my tests, I only do that with 1 girl, pointing at viewer, laughing.
>>
>>106902060
>Is there a reason you wouldn't just extract Krea as a lora and use that with Chroma? Why merge them?
Merge is better if it works.

>>106902066
>do you have examples of that franken merge?
Nope, nothing. I wonder if it's possible to turn it into gguf
>>
File: 1751117347698166.mp4 (745 KB, 640x640)
745 KB
745 KB MP4
the anime girl turns to face the camera and gives a thumbs up.
>>
COMING IN HOT

100 ITERATIONS OF SLOP
>>
>>106902189
it sucked. moar its <> better
>>
File: 1751179010905621.mp4 (548 KB, 640x640)
548 KB
548 KB MP4
the man drinks a can of beer.

qwen edit for original char image, wan 2.2 to animate, with new 2.2 kijai lora + rCM low lora
>>
>>106897923
Comfyui + comfyui-multigpu custom node for RAM cache
>>
>>106902221
Pretty good. Try Wan ti2v. It's blazing fast.
>>
>>106902232
>Comfyui
That shit is everything except "comfy".
>>
File: image_00072_.jpg (685 KB, 1344x1728)
685 KB
685 KB JPG
>>
>>106902060
people have done Krea "extract" loras already, the results are absolutely nowhere even remotely close to the same as actually using Flux Krea itself (which was a huge finetune of a rawer version of Flux Dev than the actual released Flux Dev weights)
>>
>>106902239
Yeah but it's the most powerful text to image, or image to video tool around. The other tools can't compete at all.
>>
>>106902268
What does it do that reForge can't? (Other than video)
>>
>>106902273
neoforge does video
>>
>>106902283
That's not what I asked...
>>
>>106902273
If you need to ask you don't need to know. It's way above your pay grade. Need to know basis.
>>
File: ComfyUI_06036_.png (1.18 MB, 1320x792)
1.18 MB
1.18 MB PNG
>>
>>106902295
What will Anon do?
>Pokémon
>Bag
>Deflect <----
>Run
>>
File: ComfyUI_00028_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
VIDEO IS GAY
>>
>>106902273
Custom nodes. Can reforge use RAM as VRAM cache? Run ONYX models? Video 2 video for character pose extraction then putting another character on top?
>>
>>106900842
waow
>>
File: 1736917094152219.png (810 KB, 896x1168)
810 KB
810 KB PNG
qwen edit is neat cause you can use any image like a lora that you can manipulate, essentially.
>>
File: ComfyUI_0275.jpg (2.68 MB, 1664x2432)
2.68 MB
2.68 MB JPG
>>
>>106895979
honestly the nsfw stuff with qwen is pretty mediocre, i am still getting better results with flux kontext.
maybe i'm doing something wrong though?
>>
>>106902365
Yeah it's very powerful. It's the kinda model that is the reason AI gets banned or censored.
>>
>>106902365
Wow! Amazing! Tell me more about this "qwen".
>>
File: output.webm (3.88 MB, 720x1280)
3.88 MB
3.88 MB WEBM
>>106902318
I will take your bait.
>>
>>106902508
use the qwen clothes remover lora if you want nsfw. makes it nicer.
>>
>>106902578
wtf is wrong with your settings
>>
>>106901639
Anyone's reaction to "walking garbage" would be shock or confusion. Or I take it this was some "Engrish" mistranslation of some funky Japanese colloquialism?
>>
File: f.mp4 (1012 KB, 1088x1856)
1012 KB
1012 KB MP4
>>106902145
>>
File: i2v_all-in-slop2.X.mp4 (2.56 MB, 512x512)
2.56 MB
2.56 MB MP4
>load 240 frames
>ooms
>months and a few updates later
>load 321 frames
>works

huh? anyway, wan context + rcm is interesting
>>
File: 1748786482742439.mp4 (2.29 MB, 720x912)
2.29 MB
2.29 MB MP4
>>106900838
>>
>>106900838

CAME, Prodigy, AdamW... Where can I learn how these actually works?
>>
>>
File: 1759186067212718.png (1.25 MB, 1360x768)
1.25 MB
1.25 MB PNG
>>
>>106902578
Kek
>>
>>106902675
MY EYES. If you're on comfyui use the multiGPU plug in for ram cache. Also try the wan ti2v model which is super low on vram usage.
>>
File: ComfyUI_00328_.mp4 (1.81 MB, 1024x1024)
1.81 MB
1.81 MB MP4
>>106902621
There's several issues, I used a bad lora and bad interpolation and bad sampler. The interpolation really destroyed it.
>>
>>106902869
Which multigpu nodes? theres a shit ton to pick from
>>
File: ComfyUI_0317.jpg (3.18 MB, 1664x2432)
3.18 MB
3.18 MB JPG
>>
>>106902732
the funny thing about this kind of aznslop is that it has totally taken over the internet while having 0 appeal for white men. A little reminder that we don't run this place anymore and are a minority on "the global web"
>>
>>106903023
>while having 0 appeal for white men
yet people here post shit like "white man's kryptonite" in reply to a pic of an asian girl, which I find bizarre because I don't think most white guys like asian girls
>>
File: ComfyUI_0330.jpg (3.36 MB, 1664x2432)
3.36 MB
3.36 MB JPG
>>
File: 1730511270752845.png (1.06 MB, 1360x768)
1.06 MB
1.06 MB PNG
change the text saying "to be in" to "to generate". change the text "romantic" to "1girls".
>>
File: jubilthree.png (2.68 MB, 1296x1728)
2.68 MB
2.68 MB PNG
>>
>>106903126
I'm a white guy and I gen basically every ethnicity, and also go out of my way to make my loras be able to do properly do different ages / ethnicities
>>
>>106903126
>because I don't think most white guys like asian girls
No you have to understand, we have our own particular taste in Asian girls. It has some overlap but it's really not the same as the current pop culture beauty ideas in east asia... I think e.g. Zhang Ziyi in 2046 (2004) is very beautiful but I don't care for kpop idols etc
>>
File: 1744250334311447.png (1.92 MB, 856x1208)
1.92 MB
1.92 MB PNG
change the style of the purple hair anime girl from anime to a black and white japanese manga style, with halftone shading.

neat, it kept some in color.
>>
File: 1733492411484778.png (1.06 MB, 1360x768)
1.06 MB
1.06 MB PNG
>>
>>106902956
https://github.com/pollockjj/ComfyUI-MultiGPU
Then use .gguf Quant models with the node that lets you add RAM cache
>>
>>106902675
>wan context
does it work with i2v and wan 2.2 now?
>>
>>106903248
"""accidental"""" panty peek/shots are always nice
>>
File: 564654.png (389 KB, 512x512)
389 KB
389 KB PNG
>>
File: Harpysing.png (989 KB, 816x816)
989 KB
989 KB PNG
Hey, before I dive too deep into the rabbit hole myself, I just wanna ask: Can I use my 1 year old Mid-grade laptop (not gaming, no dedicated graphics card but it has that NPU thing Microsoft tells me is good for AI) for local image generation or am I stuck using web stuff? I poked my head into AI image generation a while back but things have gotten SO much better since last time I checked in.
>>
>>106903194
excellent

>>106903842
you can https://github.com/rupeshs/fastsdcpu

but it has limitations and of course almost everyone here isn't doing this but nvidia cuda or such
>>
>>106903877
Good to know, thanks!
>>
File: x.png (3.03 MB, 864x1488)
3.03 MB
3.03 MB PNG
>>106903888
np. hope you can do some nice stuff until you either get the faster hardware or go with IaaS/SaaS (not this thread)
>>
>>106903842
all the good stuff requires a big GPU like a 4090 or a 5090.
you also need all the ram you can get.
if you dont have this you will never produce anything in a reasonable amount of time and it will be boring.
>>
>>106902578
bit concerned about her bone health.
>>
>>106902675
a man of concentration
>>
>>106904041
a 3090 works well enough. with ram swapping I don't see why a 5080 or 5070 ti wouldn't work either
>>
>>106903529
anime is DEAD
(mcdonald's usa has anime figurines)
>>
File: ComfyUI_temp_xnbet_00010_.png (2.86 MB, 1296x1728)
2.86 MB
2.86 MB PNG
>>106903690
With so many flying heroines wearing so many skirts and so many fans looking up to them, """accidents""" are bound to happen.
>>
>>106904070
3090, what it/sec or sec/it do you get with Chroma HD bf16, 1024x1024, euler, cfg=1?
>>
>>106902578
>>./pol/518991752
>>
File: ComfyUI_00033_.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>>106904192
uh. trying again
>>>/pol/518991752
>>
new
>>106904218
>>106904218
>>106904218
>>106904218
>>
NAGCFGguider uses cfg 4?

very confusing, I thought the point of nag was to use cfg 1.

using on Chroma HD
>>
>>106902732
i came
>>
>>106902942

she looks like gwengwiz and mexican



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.