[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Not Safe For Wife Edition

Discussion and Development of Local Image and Video Models

Previous: >>108855256

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: ComfyUI_00015_.jpg (2.6 MB, 3128x4096)
2.6 MB JPG
blessed thread of animaship
>>
>>108864310
absolutely disgusting
>>
File: ANIMA_bface_bad_00007_.png (803 KB, 896x1152)
803 KB PNG
>>108864310
Heard you like 1girls.
>>
File: 1777613504636595.jpg (1.58 MB, 3584x4608)
1.58 MB JPG
>>
File: 1760583584546182.jpg (1.28 MB, 3584x4608)
1.28 MB JPG
which anime girl should I gen next?
>>
File: 1771434715320203.jpg (1.48 MB, 3584x4608)
1.48 MB JPG
>>
>>108864310
vageeeeeeeeeeeeeeen
>>
>>
File: 00039-1392129503.jpg (514 KB, 1536x2496)
514 KB JPG
>>
File: 1761953393225871.jpg (1.14 MB, 3584x4608)
1.14 MB JPG
>>
>>108864437
aren't we all?
>>
File: 00063-2490104966.jpg (473 KB, 1536x2496)
473 KB JPG
>>
File: categories.png (587 KB, 1240x620)
587 KB PNG
>>108863648
>>108863781

I was getting closer with just a NL prompt for the features, and a "3D CGI" style with base anima. I mostly meant the actual art style (cel shaded with comic book style outlines).
>>
1girl, sitting
>>
>>108864535
oops wrong text box
>>
>>108864513
*vomits*
>>
>new custom node that apparently fixes a thing I have issues with
>look at its own workflow to start with
>copy the node over to my own
>error
>error
>15min later I realize the values reset themselves in the node upon pasting it in

Thanks, open source.
>>
new lora for enhancing ltx gens is out to mess around with:

"LTX2.3 OmniNFT RL-LoRA generates high-quality video/audio + visuals and sound are perfectly synchronized, no laggy or mismatched audio.

- realistic Lip-Sync

- action-matched sound

- reduces synchronization errors by 52%

huggingface.co/Kijai/LTX2.3_comfy/tree/main

seems decent, my first test:

https://files.catbox.moe/d6u41h.mp4
>>
File: 1029981966473579.png (1.91 MB, 1024x1536)
1.91 MB PNG
>>
File: Qwen-Image-2512_00047_.png (1.18 MB, 832x1216)
1.18 MB PNG
>>
>>108864765
https://files.catbox.moe/lpat8v.mp4

seems to be more dynamic than no lora, pretty neat
>>
>Haven't touched local in a year
>Research for a while
>It's still shit

a year and still almost nothing new
>>
File: klein1.jpg (749 KB, 1400x1569)
749 KB JPG
trained brutalism slider for klein 9b
>>
File: coomer.png (17 KB, 250x208)
17 KB PNG
>>108864796
>>
>>108864841
looks cool
>>
https://github.com/TenStrip/10S-Comfy-nodes

Has anyone tried the tiled sampler out of this one for ltx 2.3?
I'm getting MORE distorted colors and brightness with it.
>>
File: Flux2-Klein_00087_.jpg (475 KB, 896x1456)
475 KB JPG
>>108864844
one of the worst things mankind has created
>>
>>108864850
haven't noticed anything wrong with it.
>>
>>108864796
what is this sitting pose called?
>>
>>108864867
Built like a brick house.
>>
https://huggingface.co/RuneXX/LTX-2.3-Workflows/blob/main/Video-2-Video/Extend-Any-Video/LTX-2.3_-_V2V_Extend_Any_Video_Multi-Extend_long_video.json

lot of interesting workflows by this guy, this one is a video extend one.
>>
File: Flux2-Klein_00098_.jpg (374 KB, 1136x1136)
374 KB JPG
>>108864878
>Built like a brick house.
>>
File: 1759288608121560.jpg (1022 KB, 3584x4608)
1022 KB JPG
>>
>>108864893
been using those for a while. testing ltx director wf's now.
>>
File: 1761303043881275.jpg (1.28 MB, 1344x2240)
1.28 MB JPG
>>
File: Flux2-Klein_00110_.jpg (509 KB, 1008x1296)
509 KB JPG
>>
>>108864893
working out the kinks for the extend prompts but lol, it does work: using the 2.3 q8 distilled model.

https://files.catbox.moe/h4hvru.mp4
>>
File: Flux2-Klein-9bfp8_00208_.png (2.5 MB, 1616x1280)
2.5 MB PNG
>>
>>108864994
i2i upscale?
>>
File: 1778841876664347.png (86 KB, 1000x796)
86 KB PNG
>>108865001
it just this "make it 3d"
>>
File: 1747776368598536.jpg (299 KB, 1651x1227)
299 KB JPG
What's the best local tts now? Last thing I checked out was Qwen3-TTS.

are
Fishaudio S2P
Kokoro
Chatterbox
scenema-audio
VibeVoice
DramaBox

any better?
>>
>>108865012
>>108864994
Is… is this… SFW? I’m so confused, my pant so confused (and hard)
>>
File: hbgu.png (303 KB, 682x606)
303 KB PNG
>>108865073
did u find out how to give emotions to a cloned voice in Qwen3?
apparently there's a workaround but it seems complicated
>>
>>108865073
IndexTTS2 and dramabox.
>>
File: drb.png (40 KB, 941x128)
40 KB PNG
>>108865106
oh apparently there's this new thing based on LTX 2.3 audio which is great, and it has a low vram mode. i'll try
>>
Hey this thing worked wonderfully. No more node chaining for long videos.
>>
>>108865147
IndexTTS2 is good for emotions but requires a lot more work. DramaBox is good for fast voice cloning.
>>
>>108864992
video extend can clone audio too apparently.

https://files.catbox.moe/qhmut1.mp4
>>
File: 3.jpg (1.52 MB, 1568x2091)
1.52 MB JPG
>>
>>108865288
brutal.
>>
>>108865382
miles_teller_turning_head.gif
>>
File: ComfyUI_00184_.png (1.61 MB, 1504x1000)
1.61 MB PNG
>>108864098
thank you so much! 2girl has been achieved!
>>
https://huggingface.co/modernjack3/Dramabox_DiT_Sulfur/tree/main
>>
File: 1758791956188602.png (1.2 MB, 1024x1024)
1.2 MB PNG
yo yo my niggas
how do I train an artist's art style for illustrious?
give me some tips
like what tags should be used
Thanks
>>
>>108865505
post a non garbage gen first
>>
File: ComfyUI_00044_.png (1.36 MB, 1184x880)
1.36 MB PNG
>>108865537
here's Germansuka
>>
there we go, extend workflow success. fun workflow (you can extend multiple times, just did the basic one)

https://files.catbox.moe/qgwors.mp4
>>
File: ComfyUI_00185_.png (1.72 MB, 1000x1504)
1.72 MB PNG
>>108865617
what's you rig looking like for video slopping?
>>
>>108865505
>100+ images
>As much as content diversity as possible in the same style
>Use some tagger like wd14 and remove style related tags
>TW first, don't shuffle the tw
>1024p
>rank 32
>Probably want to train for Anima instead in 2026 but whatever
>>
One of the worst OP collages of all time
>>
>>108865419
>>108865635
Very clean and looks like the game, it's just straight up anima base?
>>
>>108865636
>>TW first, don't shuffle the tw
trigger warning?
>>Probably want to train for Anima instead in 2026 but whatever
me too but civitai doesn't support it yet
Thanks btw
>>
>>108865459
Interesting. How's Sulfur? I've heard people praising it
>>
File: ComfyUI_00186_.png (1.79 MB, 1000x1504)
1.79 MB PNG
>>108865659
I stuck in a greg lora with anima preview 3
>>
>>108865660
trigger word, and I said that for if you enable tag shuffle
>>
>>108865662
Haven't tested Sulfur much I've been using Eros. You can do good stuff with it but the prompt adherence is awful.
>>
anything similar to njudea canvas? I liked painting and seeing changes immediately...
>>
>>108865635
regular comfyui, it's just a 4080 (16gb), it's nice that video gen works and is quick (ltx2.3) even without needing 24 or 32gb vram.

I think ltx can work fine on 12+ in general?
>>
>>108865617
kek, enabled the first extend tab, added this:

"the masked man runs to the left and sits at a computer and starts typing on the keyboard."

https://files.catbox.moe/7zpug0.mp4
>>
>>108865714
Seems like you can recreate something similar enough with: decent GPU paired with small enough model + maybe some step distill lora + some controlnet + low denoising + some vibecoding.
>>
>>108865659
no it doesn't
nova gens did
>>
File: ComfyUI_00144_.png (1.86 MB, 1504x1000)
1.86 MB PNG
>>108865727
I have a 9060 xtx 16 gig.. and 16 gig regular ram, do I need to upgrade to 64 gigs for video slopping?
>>
>>108865754
ok vibecode it and then link me the GH proj
>>
>>108865778
Nah
>>
>>108865777
16 gigs of regular ram is fucking brutal.
Technically not necessary but using abysmal amount of swap will not be a pleasant time.
>>
File: ComfyUI_00194_.png (1.77 MB, 1000x1504)
1.77 MB PNG
>>108865807
yeah I got nothing but errors, guess I'll have to save up lol
>>
>>108865777
I can do wan 2.2 on similar specs without a swapfile, low res and low fps though. Hopefully the ram prices go down in the next year or so.
>>
>if I pull that off would you die?

https://files.catbox.moe/hm2891.mp4
>>
File: ComfyUI_00195_.png (1.64 MB, 1000x1504)
1.64 MB PNG
>>108865846
how low are we talking? but yeah we have to wait, cuz I wanna do nsfw video gens
>>
File: mugen.png (756 KB, 1024x1024)
756 KB PNG
>>
>>108865777
i videoslop with 8gb vram, 16gb ram, 12 minutes for a 20sec 720p video
>>
fact or fiction?

these workflows are great: https://huggingface.co/RuneXX/LTX-2.3-Workflows/blob/main/Video-2-Video/Extend-Any-Video/LTX-2.3_-_V2V_Extend_Any_Video_Multi-Extend_long_video.json

https://files.catbox.moe/hsl2qb.mp4
>>
>>108865971
make one of Goku standing on top of Bejita and Bejita is saying "3-0"
>>
>>108865971
based
>>
File: ComfyUI_00197_.png (1.62 MB, 1000x1504)
1.62 MB PNG
>>108866001
tell me your secrets, cuz I want to do image to video
>>
File: 00098-2870274387.png (3.78 MB, 1536x2304)
3.78 MB PNG
>>
File: anima__0002_.png (865 KB, 1024x1024)
865 KB PNG
>>108866006
>>
I'm a (sfw) vageeeeeeeeen appreciator, bls gib, also believe niples proturding is very respectful
>>
>>108866065
:(
I wish I could make collages like this
care to share a catbox?
>>
>>108866004
even better, whats also nice is ltx can clone voices too regardless what you edit. the extend/remix workflows are pretty fun to mess with.

https://files.catbox.moe/pkmfj6.mp4
>>
File: 00106-346842418.png (2.57 MB, 2304x1536)
2.57 MB PNG
>>
File: anima__1_.png (1.42 MB, 1024x1024)
1.42 MB PNG
>>108866091
It's dead simple.

Prompt: 2koma, Two panels, first panel shows Bulma's back turned, looking at a computer. Second panel shows Bulma in same position but looking at the viewer and she says "No"

Euler a, 12 steps, turbo lora.
>>
File: 287954196609631.png (2.61 MB, 1024x1536)
2.61 MB PNG
>>
>>108865971
The troon should have cheap gray plastic cat ears
>>
>>108865971
Local IS dead
>>
local ist ToT
>>
File: miku kills gooners.png (1.33 MB, 1216x832)
1.33 MB PNG
I should follow my own advice more.
>>
>>
File: thanks.jpg (29 KB, 746x512)
29 KB JPG
>>108866196
>>
File: 430198398808667.png (1.87 MB, 1024x1536)
1.87 MB PNG
>>108866275
smol hands
>>
>>108866317
She's small
>>
>>108866024
pinokio app with wan2gp and LTX 2.3 Q4 K M. it's plug and play
>>
I still like qwen2512, has there been any word of something like 2606
>>
File: 85703445626177.png (1.85 MB, 1536x1024)
1.85 MB PNG
>>108866321
And cute.
>>
>muh akira slide
>>
>>108866360
Sorry, Alibaba's tight open-weight pussy is now CLOSED. They have stopped releasing image models and now only drip-feed their LLM releases based on xitter polls.
>>
>>108866360
They made Qwen 2 as a smaller model but they also converted API Judaism before releasing it.
So, nothing /ldg/.
>>
>>108866360
I recently got into Qwen and it's a very capable model.
The plastic look is rough though.
Any good loras you use to get less plastic shit?
>>
>>
File: ComfyUI_0612.png (2.36 MB, 1024x1536)
2.36 MB PNG
>>108866401
i've been training my own, only got 24gb vram though so I have to use a quant. It also takes a pretty long time to train a lora, like 5k steps on ai-toolkit to start to get it to behave. I do like it for how well it eventually lets you make pretty detailed things in the trained style though
>>
>>108866427
my wife almost saw this
>>
>>108866458
Ask her about her opinion on thighs
>>
>>108866431
Well good luck and feel free to share if it finishes.
>>
I am a goy who is noticing things.

https://files.catbox.moe/g4cyy5.mp4
>>
The thing ComfyUI does where once in a while my gens slow to 60s/it and my whole computer gets sluggish is almost worse than when it used to just OOM error
>>
>>108866586
>60s/it
which checkpoint?
your specs?
>>
ltx extend workflow but with the first extra extend enabled (can do another prompt)

https://files.catbox.moe/e7d8lo.mp4
>>
I've seen a lot of stuff from poe2 in d4, and now I'm seeing stuff from d4 in poe2.
>>
File: ld_00056_.png (1.94 MB, 1024x1536)
1.94 MB PNG
>>108866046
nice
>>
>>108864346
Nobara
>>
>>108866004

If you like to goongen then the absolute best local method is to make an initial 5 second video with wan2.2 then extend that video with LTX2.3 eros to make a 20-30 sec clip. WAN is much better at physics than LTX but using a WAN video as a base helps LTX to maintain that quality across a long format. And the whole thing gets sound obviously. Using the Runexx extend V2V workflow (with an added RTX super res node just before VideoCombine) I can gen a 25 second 1080p clip in around 120-150s with a 5080, 12700k, 96g DDR4.
>>
>>108866606
Chroma fp8 on a 12gb card doing ~0.5mp gens. I'm not surprised it's exceeding the available memory sometimes, I just don't like the particular solution they use.

If my gens take a minute or so on average, and opening a twitter video for five second causes it to go OOM and switch over to the 60s/it mode, and now my computer runs like shit for ten minutes while it finishes a single gen, it feels like it could have achieved a much better result by just unloading the model and waiting until the vram situation looks better...
>>
Anima rules!
Hater drools!
>>
double extend worked. we can fix shows with AI now.

https://files.catbox.moe/q78eh7.mp4
>>
File: 1752726232554809.jpg (1.24 MB, 3584x4608)
1.24 MB JPG
>>108866641
>>
>>108866702
niceu
>>
>>108866702
nice nobara, are you using anima or illustrious?
>>
>>108866702
Oof, sfw vageen, nice nice
>>
>>108866722
anima
>>
What's the best lora for NSFW for flux 2 klein 9b EDIT?
Don't really care much about 3d/realism and care more about 2d/anime.
>>
>>108866586
le dynamic vram meme
>>
Should I use a fp32 model over the same fp16 model if I have the vram?
>>
yo yo my niggas
what's the best auto tagger for lora training right now?
I've got 160 images
anime/comic book style
>>
>>108866586
It means it's raping your ssd for genning purposes and is killing it faster (no I am not joking). You need to disable that shit immediately
>>
>>108866722
>illustrious
People still use this ?
>>
>>108866734
Gets boring quick but the undress her lora is well made imo (have fun finding non-jannied link)
Most of the NSFW loras that are still around are t2i loras. Some still work fine for i2i without extreme amount of seed lottery but the facial likeness is often the real seed lottery using them.
>>
>>108866810
Not saying you're wrong because what the hell do I know, but it looks like Comfy explicitly said the system doesn't do that anymore
https://github.com/Comfy-Org/ComfyUI/discussions/12699

Is he wrong?
>>
File: 1633248578447.png (128 KB, 400x349)
128 KB PNG
Finally got face detailer to work on Comfy. It's been a journey learning how to use this thing. Now all that is left is learning how to set it to save images in different folders organized by date. Also, the AMD GPU I got to replace my GTX 1080 works like a charm. I thought it would be completely broken. Today was a good day.
>>
>>108866734
>>108866829
Welp just read the 2d/anime part now.
Yeah I have no clue about that.
>>
>>108866782
qwen or gemma
>>
>>108866834
Good now compare s/it of a model that takes less than 8 gigs of memory with 1080 and see how well it scales with TFLOPS difference.
>>
Anima is such a blessing
Crazy we got this
>>
File: 1767222938221507.gif (302 KB, 632x451)
302 KB GIF
>>108866857
Hell no, I rather keep the illusion. I will stick to AMD regardless since I'm on linux, so there's no point.
>>
File: 1749131732188017.jpg (1.42 MB, 3584x4608)
1.42 MB JPG
is there a way to use wd-eva02-large-tagger-v3 without using ComfyUI-WD14-Tagger? Any other GUI?
Thanks.
>>
>>108866700
How do I make this not look like absolute dogshit for stuff with fast movements like goon stuff?
It just creates a weird blur, I can't find where the fuck can I change the steps amount for better quality, can I even do that with this workflow?
>>
>>108866841
so stick to Qwen edit for nsfw edits for now?
>>
>>108866919
klein edit 9b Q8 is best so far imo (one or two images)

both are fine though
>>
>>108866900
taggui, Jelosus2/DatasetEditor, https://github.com/kohya-ss/sd-scripts/blob/main/finetune/tag_images_by_wd14_tagger.py
>>
File: IMG_3381.png (920 KB, 1024x1536)
920 KB PNG
>>
>>108864266
>>108864266
>>108864266
holy fuck can we chill with the porn OP images everywhere all the fucking time i am at fucking work
>>108864266 >>108864266
>>108864266
>>108864266 >>108864266 >>108864266
>>108864266
>>
>>108866993
anime website
>>
>>108866993
We should do whatever makes less posters like you come here
>>
Tbh I love anima but I do miss it when this general was more realism focused.
Hopefully it will return back to its roots in a few weeks when the honeymoon is over
>>
>>108866993
which one's porn?
>>
he's just baiting
>>
>>108867090
Tbh you missed your chance to post realism focused gen
>>
>>108867018
why are you watching that garbage in the first place?
>>
File: 468674.png (33 KB, 168x130)
33 KB PNG
Thoughts, anime bros? >>108866932
>>
>>108867179
im not. I unfortunately saw posts about it on social media. idk why TV is so bad now when there was good stuff in the past. glad I only care about videogames or anime.
>>
File: 1777560574033607.png (1.33 MB, 1024x1024)
1.33 MB PNG
klein edit 9b is very good. just say "give them a bikini" and voila, bodysuit to swimwear. (test image was a random emilia lora test image)
>>
>>108866900
Use a modern vision model like qwen or gemma
>>108867210
Anon discussed this ITT weeks and weeks ago
>>
>>108867210
inpainting is very good, rest needs time in the oven.
>>
>>108867254
>Use a modern vision model like qwen or gemma
but I'm training anima, wouldn't borutags work better?
>>
>>108866932
>>108867210
does it allow LoRA training?
>>
File: Qwen-Image-2512_00055_.png (1.44 MB, 832x1216)
1.44 MB PNG
>>108867143
>>
>>108867290
Anima was also trained with NLP but even qwen/gemmas faux booru tags are better than three year old WD models. It gets tags mostly right anyway.
>>
>>108865971
catbox? Please!
>>
File: hmm.jpg (48 KB, 741x568)
48 KB JPG
>>108867327
what if I use both booru tags and NLP to tag my pics?
>>
>>108867296
https://github.com/kohya-ss/sd-scripts/blob/main/docs/anima_train_control_net_lllite.md
>>
File: Juggernaut_Z_V1_00197_.jpg (542 KB, 1344x1728)
542 KB JPG
>>
>>108867365
Yeah the meta for lora training is to have qwen or gemma spit out both tags and NLP for every caption
Good luck have fun anon
>>
>>108867227
hot, hot as fuck
>>
>>108867365
been taken care of.
>>
>>108867458
is this Kohya? (never used something like it before)
>>
>>108865147
ive integrated that into my local ai gf and it's not too bad, vram hungery tho
>>
File: file.png (37 KB, 1228x207)
37 KB PNG
>civjeet has been extra shit lately
>investigate why
oh...
>>
>>108867476
No
>>
>>108867493
what is it then friendo? OneTrainer?
>>
desu it is trivial to vibecode your own caption software
>>
what sampler/cfg/steps for sulphur2?
>>
>>108867476
It's a fun little time wasting vibe coding project. Training data preparation app with captioning and editing for videos and images.
>>
>>108867480
civ became worthless once then introduced on-site generation and faggot ass "buzz"
>>
did already start coding your own again? :)
>>
File: Juggernaut_Z_V1_00244_.jpg (392 KB, 1344x1728)
392 KB JPG
>>
>>108867876
Is Juggernaut Z less melty than regular Z?
>>
File: 1768037621332574.jpg (1.4 MB, 3584x4608)
1.4 MB JPG
>>
File: z-image-bf16_00002_.jpg (476 KB, 1344x1728)
476 KB JPG
>>108867887
I don't think there's any difference. RescaleCFG helps
>>
how do you delete your beautiful porn? it feels like deleting the mona lisa sometimes
>>
>>108867876
share catbox?
>>
>>108867968
1girl, asian
you really need a prompt for this?
>>
>>108867975
yes :)
>>
File: Animaistoocreative.png (37 KB, 849x247)
37 KB PNG
Anima is TOO creative, that is bad
>>
You will not make me reply to plebbit content.
>>
>>108868010
why not just increase the cfg
>>
File: 1765664074370176.png (166 KB, 606x655)
166 KB PNG
>>108868010
>Top 1% Commenter
>>
>>108868026
its probably someone who prompts three tags only like "1girl, [flavor of the month gacha], standing" and is upset that it wont follow a single style cross seeds
>>
>>108868010
this is your brain on vpred
>>
File: ComfyUI_25490.png (2.98 MB, 1920x1080)
2.98 MB PNG
>>108867090
The other 75 anime threads are just too fast, they won't get noticed in the flurry of posts!
>>
File: Anima_00799_.png (544 KB, 896x1152)
544 KB PNG
>>
File: 991519409873443.png (1 MB, 1128x880)
1 MB PNG
>>
File: Anima_00811_.png (469 KB, 1216x896)
469 KB PNG
>>
https://github.com/Stability-AI/stable-audio-3 Anyone trying this?
>>
File: Juggernaut_Z_V1_00239_.jpg (510 KB, 1344x1728)
510 KB JPG
>>108868162
Was pretty damn rough.
>>
File: 0002-3792105265.png (1.53 MB, 1152x768)
1.53 MB PNG
>>
Does Anima know Mastema or is it lora time? The only lora for this guy is for SD1.5..
>>
>>
>>108868177
How is Juggernaut comparable to Z Image? The last NSFW checkpoint I downloaded was not that creative or provocative and was more NSFW overfitted.
>>
>>
Anons who are playing around with Anima, is it worth using the turbo lora? I feel like the sloppification effect is a bit too strong for the speed increase at strength 1. I've been playing around with using it with a lower weight and still getting 30% faster gens compared to base only, but not sure if that is the best approach.
>>
>>108868241
z juggernaut doesn't do nsfw any better than z base. https://huggingface.co/RunDiffusion/Juggernaut-Z-Image
>>
File: Anima_00836_.png (2.55 MB, 1136x928)
2.55 MB PNG
>>
>>108868216
searching danbooru for the proper tag gives mastema_(megami_tensei), which should be written as mastema \(megami tensei\)
>>
>>108868295
Is this i2i using anima? If yes can you share the workflow pretty please?
>>
>>108868285
>Anons who are playing around with Anima, is it worth using the turbo lora?
I use both the turbo lora and the cosmos dmd2 lora at 1.0 strength 4 steps and it looks fine to me.
>>
>>108868310
i2i with anima is no different than normal i2i
>>
File: 1006354682274307.png (1.97 MB, 1088x1472)
1.97 MB PNG
>>
>>108868314
>and it looks fine to me
no way man
>>
>>108868285
>is it worth using the turbo lora
Not for me, no. I'm fine with waiting a minute or two for 2+ megapixel gens
>>
File: Anima_00124_.png (404 KB, 896x1152)
404 KB PNG
>>
File: ComfyUI_35060_.png (1.43 MB, 2368x1760)
1.43 MB PNG
>>
/r/ing' (once again) sfw vageeeeeeeeen
>>
File: ComfyUI_35017_.png (1.13 MB, 832x1248)
1.13 MB PNG
>>
File: Juggernaut_Z_V1_00258_.jpg (435 KB, 1248x1824)
435 KB JPG
>>
>>108868233
>>108868263
HOLY KINO
>>
File: Anima_00027_.png (672 KB, 768x1024)
672 KB PNG
>>Anima saved local.

Anon, how to chose the right Klein9B?
https://huggingface.co/silveroxides/FLUX.2-dev-fp8_scaled/tree/main
>>
Captain Snudd stands proudly in a dim cheese vault beside his prized Emmentaler collection, surrounded by tall wheels, cracked wedges, labeled shelves, and golden crumbs scattered across an old stone floor. He wears a weathered nautical coat with brass buttons, a lopsided captain's hat, and a stern expression, holding one pale yellow wheel of Emmentaler like a sacred artifact.
Behind the shelves, Gogurtus the spaghetti demon lurks in partial shadow, a tangled creature of glossy noodles, red sauce, meatballs, and melting cheese, peering through gaps in the cheese racks. The foreground is crowded with cheese knives, wax paper, crumbs, and a small brass lantern. The background fades into a cool, cellar-like darkness with arched stone walls, hanging hooks, and dusty shelves, creating a surreal humorous horror scene with rich texture and dramatic warm lighting.

>>108868428
ty
>>
File: 690396558370360.png (971 KB, 832x1216)
971 KB PNG
>>
>>108868459
>>
>>108868453
q8 > fp8
>>
>>108868472
gguf gets slower with every lora added tho
>>
>>108868307
>mastema \(megami tensei\)
doesn't seem to know him, lora time then
>>
>>108868285
how long does an anima gen take without turbo?
>>
>>108868116
moar sexo jebby
>>
>>108868042
i still dont know what the fuck vpred and eps is
>>
>>108868599
I never liked vpred precisely because it was making look every seed the same and the proposed solution for wanting more variety was "add garbage tokens to the end"
And I don't know if it was all placebo or not either, but that "feature" was already a deal breaker
>>
>>108868516
use INT8, 2X speed even on 20xx GPU
>>
>>108868516
Without turbo I have to gen at least 42 ish steps to avoid detail artifacts (eyes, etc). Takes me 70 seconds at 2MP.
With full turbo (strength 1) you only need about 12 steps so ends up being around 20 seconds.
The downside is that the turbo lora is very biased towards a certain "high score on danbooru" look, is worse at certain concepts, less creative, etc.
>>
>>108868630
damn, 70 seconds is rough. it takes me 12 seconds on noob with 1.5 upscaling.
>>
File: hfdstey.png (475 KB, 3808x3384)
475 KB PNG
i told you guys for close to 2 months that ltx2.3 was kino, but only now you finally agree?
>>
which models generate the best and fattest asses?
zit looks better, but klein 9b looks fatter.
>>
File: Anima_00035_.png (2.75 MB, 1536x2048)
2.75 MB PNG
>>108868472
>>q8 > fp8
Ok, but mixed? dare? learn? wtf all thoses?
>>
>>108868679
entirely subjective, retard. now post an example
>>
File: debo_cs-f_anima1_00088_.png (2.48 MB, 1792x1140)
2.48 MB PNG
>>108868162
there's a new audio model out from facebook research
>>
>>108868650
tell your GPU and output resolution anon
>>
oh sweet they chose to release anima base while I'm away for two weeks
qrd? i hope it's easily trainable
>>
>>108868704
mixed usually means weights have a mixture of different quantizations. don't know about the rest.
>>
>>108868722
896x1152
25 steps
Upscale 1.5
5090 (teehee)
>>
Do you guys have any pixel landscape gens?
>>
1girl, standing,
>>
>>108868727
>qrd? i hope it's easily trainable
the release model seems ok

trainability seems neither especially easy nor especially hard with anima-standalone-trainer, but most training UI didn't add support yet so no prodigy schedulefree optimizer or CAME or other good stuff IIRC
>>
>>108868704
Mixed has some weights preserved in bf16 everything else is a meme
>>
>>108868751
and a flower lora and give each 1 girl a different flower
>>
>>
>>108868704
Schizo bruh repo
>>
>>108868472
same for 16gb vram?
>>
File: my cpu-berg.jpg (53 KB, 500x500)
53 KB JPG
>>
>>108868814
Modern cards should opt for fp8
>>
>>108868780
you need a lora for flowers? kek
>>
>>108868939
yeah, for consistency
>>
>>108868978
>>108868780
>>108868742
butt cheeks.
>>
is anima worth a try for someone who doesn't care about anime (except very specific artist loras)
>>
>>108868857
Fp8 for speed. There's a noticeable divergence vs BF16
>>
File: 451.png (73 KB, 2119x221)
73 KB PNG
lets see if it works
>>
>>108869005
If you don't care about anime and have limited time, chroma is lot more funny for experiment
>>
>>108869005
your question is beyond fucking retarded but sure why not, its like 3gbs, retard.
>>
>>108868453
Convrot int8 > mxfp8 > q8 > rest
some mixed fp8 variant but I don't recall precisely which > bunch of other fp8 variants and rowwise int8 > tensorwise int8
But I don't know different lesser fp8 and int8 variants rank among each other.
>>108868472
q8 is slow as shit due to need to dequant and plays horribly with loras.
Old unc advice in the age convrot and mxfp8
>>
>>108869001
>>
>>108869048
>>108868978
>>108868791
>>108868780
what do I prompt for similar underwear?
>>
>>108869063
color lace, color lace trim, lace stockings, lace thong, highleg, lace bra, bridal gauntlets, covered nipples, floral print
>>
>>108869097
>>
why are you spamming your 1girl, standing, flower, pinups?
>>
File: Herr Coomer.png (259 KB, 860x945)
259 KB PNG
>>108869097
>>108869110
thanks
>>
>>108869115
for this guy
>>108869116
>>
>>108869121
nice, make sure he finishes.
>>
>>108869042
Isn't Int8 have issue with most CIVITAI LoRA in comfyui?
>>
>>108869142
is this how ppl get you to pay for patreon?
>>
>>108869159
No? It's "messy" to deal with admittedly, but you have multiple options:
You can prebake lora into bf16 and convert on the fly > you wait through conversion when model loads and it get can very memory hungry for larger models, but then it works, with very low delta from the bf16 baseline
Dynamic lora loading > lora is applied at the run time with very low delta from the baseline and no need to requant (so memory friendly if you already have prebaked checkpoints lying around) but you take some performance hit >10% compared to usual int8 speeds
Stochastic and None (old default) > Larger delta from the baseline but most of the time one or both will perfectly fine with only minor changes compared to baseline. Faster than dynamic at inference. BUT will require dequanting (if checkpoint is already int8) and then requanting with lora applied so memory hungry like the first option. And like it's very unpredictably fucking random which model/lora will like which one of the two options (or both) to apply the lora.
>>
>>108869115
why not? nice 1girls
>>
Do periods work same way as commas in comfy? I been pulling some pngs and some use a mix
Is it just model dependant?
>>
>>108869276
Depends on the model, usually they're fairly comparable.
>>
>>108869276
no if clip encoder. yes if llm encoder
>>
>>108869160
patreon is finished for commissions
>>
Is it possible to use comfyui locally but use a cloud platform for the GPU only? I dont want a cloud platform seeing my nsfw content.
>>
>>108869269
Anon, you've fried my last braincell..
>>
>>108869367
they see your data
>>
>>108869367
lmao
>>
>>108869367
>I dont want a cloud platform seeing my nsfw content.
that's not how any of this works
not until cryptographic computing is a thing
>>
>>108869367
what are you afraid of weirdo
>>
>>108869367
No. This is a rich man's hobby now. You will need to spend thousands of dollars if you want enjoy non-normie AI porn.
>>
Anyone got good workflows that do backgrounds and forgrounds in separate passes then you can i2i together?
>>
i'm disappointed in dramabox. i wanted to use it for generating background audio for videos, but it only wants to generate people talking
>>
>>108869367
just boughted it bro
I looked into it too but just gaved in and buyinged a 5080 2 months ago
>>
File: mahi smart.png (299 KB, 1024x1024)
299 KB PNG
>>
Help me understand the difference between addetailer, hiresfix pre detailed, hiresfix post detailer then all these seperate adetailer nodes (hands/face/nsfw)

Im assuming its reduntant to do all and the hiresfix ones only worth if im actually running an upscale node (like hiresfix pre detailer same thing as addetailer if im not upscaling) ?
>>
File: ComfyUI_67111_.jpg (330 KB, 1664x2432)
330 KB JPG
>>
>>
>>108869560
Detailers locate an area of your image (hands, body, pussy, cock, etc), enlarge it, and run an i2i pass over that area. Highres pre and post simply tell the program to run an upscale / second pass over the entire image either before or after the detailer pass.

Some prefer to run all the detailers they desire before the highres fix pass, others prefer it happen after, and some like to do it both before AND after. It's really user preference.
>>
File: ComfyUI_Upscaled_00022_.jpg (2.68 MB, 4096x6144)
2.68 MB JPG
>>108869659
Thanks thats very helpful. I ran this with sdupscale and hiresfix pre detailer, i guess i didn't realize the hiresfix pre detailer was an actual upscaler like it is in the webui?
I expected just two outputs, first image at 1024x1536 and second at 2048x3072 but I got a third at 6k height also.
>>
File: 869890829.jpg (80 KB, 1000x1024)
80 KB JPG
i just got the same random seed twice in a row. what does this mean?
>>
>>
File: ComfyUI_00293_.jpg (2.84 MB, 2048x3072)
2.84 MB JPG
>>108869688
>>
>>108869736
The problem with lingerieprompting is you end up very very frustrated with the current state of lingerie. It becomes obvious fast that what is holding back your gens is the utter poverty of our art of ornamenting the nude body, and you discover the need for new forms of clothing that you cannot communicate to the prompt box. They are innovating such things in hentai, maybe; and for certain the art was known in some ancient societies; but you are chained to The Dataset, it is your mill-stone, you will never advance one step forward in this medium. The uselessness of AI.
>>
>>108869523
benchooood
>>
(Go ahead, reply "skill issue", you insect.)
>>
>>108869725
a djinn will visit you at 3am and install anistudio
>>
>>108869836
skill issue, whole advantage of ai is you can put whatever medium into whatever you want
if you see great hentai lingerie, prompt for that
detail it better
>>
>>108869859
Don't worry though, it won't compile.
>>
File: thousand-yard-stare80985.jpg (1.93 MB, 3008x3861)
1.93 MB JPG
how do i be as stoic as him?
https://files.catbox.moe/5fu0h2.mp4
>>
File: ComfyUI_temp_nrkgs_00001_.png (1.89 MB, 1248x1824)
1.89 MB PNG
>>
>>108869936
Long thumb that goes through the dress
>>
Fresh

>>108869976
>>108869976
>>108869976
>>108869976

Fresh



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.