[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion and Development of Local Image and Video Models

Previous: >>108733994

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: ss_20260502_214645.png (51 KB, 513x563)
51 KB PNG
i'm gonna shill my vibe coded slop thats better than your other vibe coded slop
https://github.com/noidavoid/ollama_image_tagger
>>
>mfw Resource news

05/02/2026

>Sulphur 2: An uncensored video generation model based on LTX 2.3
https://huggingface.co/SulphurAI/Sulphur-2-base

05/01/2026

>Representation Fréchet Loss for Visual Generation
https://github.com/Jiawei-Yang/FD-loss

>Caption Generator Pro: Tkinter app for generating image captions with LLaVA-style models
https://github.com/CoolGenius-123/Caption-Generator-Pro

>Metascan v0.3.0 Update
https://github.com/pakfur/metascan/releases/tag/v0.3.0

>Phosphene: Local video and audio generation for Apple Silicon ( LTX2.3 )
https://github.com/mrbizarro/phosphene

>MoCapAnything V2: End-to-End Learning of Generalizable Motion
https://animotionlab.github.io/MoCapAnythingV2

>Diffusers <0.37.1 Security Vulnerability - Code Injection
https://github.com/huggingface/diffusers/security/advisories/GHSA-98h9-4798-4q5v

04/30/2026

>ProcFunc: Function-Oriented Abstractions for Procedural 3D Generation in Python
https://github.com/princeton-vl/procfunc

>Efficient, VRAM-Constrained xLM Inference on Clients
https://github.com/deepshnv/pipeshard-mlsys26-ae

04/29/2026

>Z-Anime | Full Anime Fine-Tune on Z-Image Base
https://huggingface.co/SeeSee21/Z-Anime

>QuantVideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization
https://github.com/svg-project/Quant-VideoGen

>World-R1: Reinforcing 3D Constraints for Text-to-Video Generation
https://github.com/microsoft/World-R1

>Benchmarking Layout-Guided Diffusion Models through Unified Semantic-Spatial Evaluation in Closed and Open Settings
https://github.com/lparolari/cobench

>VibeToken: Scaling 1D Image Tokenizers and Autoregressive Models for Dynamic Resolution Generations
https://github.com/SonyResearch/VibeToken

>OmniVTG: A Large-Scale Dataset and Training Paradigm for Open-World Video Temporal Grounding
https://github.com/oceanflowlab/OmniVTG

>Refinement via Regen: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models
https://github.com/LeapLabTHU/RvR
>>
File: ComfyUI_00836_.png (834 KB, 1024x1024)
834 KB PNG
>>
IT'S MY LIFEEEE
>>
I don't get why correct dca changes when you change steps. I was never warned.
>>
File: Taskmgr_n29Awp4EVd.jpg (8 KB, 564x57)
8 KB JPG
>>108742247
https://github.com/clover-supply/taggui
I raise with my own slop.
>>
>>108742294
mine is better and provides binaries through github actions so you know the same source is being compiled for sure
>>
>>108742312
Idk what any of that means. I just ripped out the pytorch slop
>>
my vibeslop ai gen program is better than both because when you execute it it makes your dick bigger
>>
this ltx finetune is amazing
https://litter.catbox.moe/9uy88jd7uanii7lw.mp4
https://litter.catbox.moe/7fv88e5a6ampch75.mp4
>>
API is better than all this junk honestly...
>>
>>108742209
it's literally the same prompt he used on the same seed yes, but with regular Anima P3 + Turbo

the point was only a retard would think 30 steps er_sde and no upscale was worth it vs 12 steps base + 12 steps hi-res when all of the steps in the latter are also faster than in the former
>>
\> Pikachu Ted Nugent in OP
I saw him live once in like 2007, was pretty good. He was never as good even on album without Derek St Holmes (the guy who actually sung Stranglehold etc) though IMO
>>
>>108742294
>>108742247
I want to extract prompts from pictures to generate pictures with, is this the technology for that?
>>
Is there some particular reason why Tuna 2 doesn't have more hype? I've been in all of the threads the past week and I haven't seen anyone mention it once
>>
>>108742417
cause it looks like doodoo but i guess so do most base models
>>
File: 1646333888461.png (340 KB, 787x720)
340 KB PNG
Shiny skin goes in my negatives.
>>
>>108742395
You mean get metadata from already genned ai pics?
>>
>>108742437
no to get prompts from any image AI or not.
>>
>>108742442
The modern method is to use an LLM with vision capabilities like Gemma or QwenVL, you can use llama.cpp or whatever. There are probably comfyui solutions if that's what you want too
>>
>>108742442
You mean captioning. Yeah. Also if oyu have only one or a few pics sfw you can just give it go gemini instead of fiddling with extra software.
>>
>>
File: 1663628763393858.jpg (104 KB, 976x850)
104 KB JPG
Tried Anima, just not a fan.

I get much better results from using Qwen Image Edit, followed by WAI inpaint.
>>
stoopid frog poster x2
>>
File: ritual_alt.png (925 KB, 607x1079)
925 KB PNG
ritual
i can see your future...
https://suno.com/s/30YKOXnmdfCEjmrm
https://youtu.be/RwKE4WNaVpM
>>
can you run 10eros with 64GB ram?
>>
>>108742507
no one knows
>>
>>108742374
Yeah but it's someone else's AI...
>>
>>108742341
Get back in the telegram you fucking z
>>
I bought a pixel 7 pro for the exclusive purpose of throwing a local model on. It has wifi but no data plan. What should I do with it???
>>
Just vibecoded a wrapper (in literally two minutes) for the clip text encode node to cache the result and the prompt and NOT re-encode when the prompt doesn't change. I use some nodes that let me return wildcards from a multiline input (line break delimited) with a seed, and I can set a seed division int so it only returns a different input every X gens (when seed is set to increment), so I was going 20 gens in a row with no prompt change but the text encode was doing work every single gen because the upstream nodes' inputs had changed (but not their outputs). So stupid.

Anyway, don't have that problem anymore. You're not even really using ComfyUI until half your workflow is your own nodes
>>
File: _AnimaPreview3_00012_.jpg (414 KB, 1248x1608)
414 KB JPG
>>
File: 1757037423525676.gif (1.58 MB, 896x1152)
1.58 MB GIF
>>
>>108742527
>I bought a pixel 7 pro for the exclusive purpose of throwing a local model on.
Y tho
>>
File: 1ejw.jpg (34 KB, 716x611)
34 KB JPG
i asked sulphur to generate an aesthetic california summer, and it generated porn instead
>>
File: 1749538090289583.png (2.49 MB, 1536x1536)
2.49 MB PNG
>>108742470
>>
File: 1766470169720529.png (2.11 MB, 1248x1600)
2.11 MB PNG
>>108742530
>>
>>108742717
>>108742721
nice. klein or qwen lora?
>>
File: 1776899608955615.gif (1.02 MB, 896x1152)
1.02 MB GIF
>>
File: 1758906629879135.jpg (487 KB, 832x1216)
487 KB JPG
>>108742717
>>108742721
sovl

>>108742731
he drew it obviously
>>
>>108742731
Klein 9B. Used v2. https://civitai.com/models/2593550/elusarcas-scribbly-doodle-lora-or-klein-9b-and-4b?modelVersionId=2913471
>>
What are your favorite klein loras?
>>
>>108742749
klein9b turbo r128
>>
>>108742749
klein-9b-turn2real_v1.5_ep9
>>
kino gen hour
>>
File: _AnimaPreview3_00056_.jpg (397 KB, 1248x1608)
397 KB JPG
>>
>>108742474
This.
QwenAIO NSFW has much better natural language prompt adherence.
>>
File: 1777689868398691.gif (1.6 MB, 1024x1024)
1.6 MB GIF
'jak'd
>>
>>108742741
>>108742840
it reminds me of some early 2000's mtv animations
>>
>>108742507
I can confirm you can, full size model running on 16GB VRAM + 64GB RAM system, gens take ~160s to complete
>>
it's all so crazy
>>
>>108742815
Are these all in one merges really worth using? Can you show a gen?
>>
>>108742852
what resolution and length? i tried hacking wan2gp to work with the fp8 version and each denoise step was taking forever so i will probably have to figure out comfyui
>>
it's all so cozy
>>
>>108742872
Depends on if you want NSFW. It just teaches qwen what nsfw terms are along with producing slightly better private parts that still require inpainting.
Base Qwen also sometimes censors output by making the body invisible.
>>
>>108742891
Just any normal 1girl photo will do
>>
File: 1773230603508355.gif (1.25 MB, 1024x1024)
1.25 MB GIF
>>108742845
too much fun
>>
Remember that ZootAnon is a slopper, don't take his word seriously. He doesn't have a clue about anime, styles, characters, or anything really. He's just a schizo autist hunting for prompt adherence so he can then go make "1girl, crouching, pointing at viewer" garbage.
>>
File: 1755094468355382.gif (1.27 MB, 832x1280)
1.27 MB GIF
>>
>>108742966
>>108742917
>>108742840
>>108742741
Why did you steal a technique made and discovered yesterday by /adt/ >>108740189 and you don't even take 1 second to thank the anon who discovered it or even have the minimum respect to post in his general and tell them thanks?
>>
when you put a lora at 0, does this disable the lora entirely or does it still affect the image?
>>
>>108742917
>>108742966
kino as heck

>>108742959
take your meds

>>108742981
It could affect if it loads the weights
>>
File: 1753080182686597.gif (480 KB, 832x1280)
480 KB GIF
>>108742979
i used to do seed variation animations with a1111 back in 2023 its not new
>>
ignorant question but i'm impressed by ltx 2.3's voice acting. what tech does it use for audio? i'd like to make only sound with that.
>>
>>108742996
it's a brown retard that keeps trying to spark thread wars, don't bother replying
>>
>>108742996
>used to do seed variation animations with a1111 back in 2023 its not new
wasn't there even some plugin that automated the whole thing and spat out gifs
>>
So, the /ldg/ verdict on Sulphur-2-base and LTX2.3-10Eros?
>>
>>108743089
the coomers seem to approve but i am testing it for some real kinos
>>
>>108743108
So which one is the better one?
As far as I've heard Sulfur is for both t2v and i2v but it is bad at t2v. While eros is made for i2v only so should perform better I guess.
It's not like I am in hurry to know though, I need to wait until someone makes an int8 version of either before my ramlet setup with 3060 can hope to use either.
>>
>>108743162
idk i am using sulphur cause it seems to have general purpose abilities while the eros one is designed for coomers
>>
>>108743184
>seems to have general purpose abilities
What general purpose ability it adds to base LTX2.3 besides NSFW?
>>
>>108743196
it's a continuation of the base model so the coomer datasets they used will give me better results for amateur style camera control and audio
>>
>>108743184
Isn't Sulphur absolute dog shit? I've yet to see a single example that looks remotely good
>>
File: 16124627468.png (347 KB, 430x430)
347 KB PNG
>>108743287
when i learn the prompting then kinos will be rolling out the door
>>
>>108742996
No, you stole that from /adt/. At least mention me in the credits, because that makes you look suspicious.

https://rentry.org/AnimAnon-LoopbackWave
https://github.com/FizzleDorf/Loopback-Wave-for-A1111-Webui
>>
>>108743408
Kill yourself Julien
>>
>>108743089
from my own brief experiments, I'd say that it's 66% better for nsfw stuff - probably even better if you throw loras in there. It still does annoying shit, you have roll 2 or 3 times to get what you want. Certain starting images do much better than others.
>>
ani really is such an asset to our community
>>
>>108742745
One of the coolest Klein LoRAs I've seen. Very nice.
>>
>>108743211
Post examples?
>>
>>108743426
huh?
>>
>>108743417
Yeah, having a repugnant lobotomite subhuman like it around does wonders for one's self-esteem
>>
big russ... anima preview 4... onegai...
>>
File: 129936321467773.png (2.79 MB, 1824x1248)
2.79 MB PNG
>>
File: 74329818238521.png (2.45 MB, 1824x1248)
2.45 MB PNG
>>
>>108743644
>>108743669
I like the 1995 hairstyle more
>>
>>108743408
Fuck of pedophile
>>
>>108743788
meds
>>
>>108743408
Hey julien you're literal human garbage
>>
>>108743089

It doesn’t seem any better than action specific loras from the small testing I did before I fell asleep last night. I’ll have to do some more though. I don’t blame the creator, I blame LTX for being kind of ass.

Does anyone have a good workflow/experience in using 9B to do full body replacements in existing porn scenes? I can get it to work sometimes, but it’s very temperamental when it comes to switching out identities like that.
>>
ur kino for today
https://files.catbox.moe/9gva6o.mp4
>>
File: 2580780753087023465.gif (230 KB, 320x180)
230 KB GIF
>>108743900
kino
>>
>>108743408
Fuck off
>>
Was running ComfyAI with WAI16, but after a few succesful sets the generation started outputting images with only noise. Tried rebooting PC, switching from VAE decoder to tiled, didn't work.
Running on a RTX 4060
>>
>>108743940
Post catbox of the only noise image.
>>
>>108743940
Surely you changed the denosie value of ksampler insted of 1
>>
>>108743914
eheh
>>
>>108743966
...
FFS I swear I don't recall tampering with that value at all, and somehow it was down to 0.05
>>
File: 7987.png (1.5 MB, 1024x1024)
1.5 MB PNG
>>
What is keeping local edit models from becoming good?
>>
>>108742248
thanks!
>>
>>108744058
It's intentional at this point, it must be.
>>
>>108744058
whats wrong with them?
>>
new bytedance just dropped
china won
https://x.com/Kashberg_0/status/2050850124457496810
>>
>>108744093
>replying to the api cuck
>>
>>108744058
local
>>
>>108743408
> https://rentry.org/AnimAnon-LoopbackWave
what is that link at the bottom lol
>>
>>108744093
>change haircolor or change pose, completely destroys character consistency, emerging towards slopface.
>>
he's fizzling
>>
>The OS community never supported Tencent, who is on the good side of AI race and never trains their models on slop
>Instead you guys support Alibaba, Wan which is a slopped model over HunyuanVid

Well-deserved back stab.
>>
>>108744126
that’s weird… was that rentry made earlier? Why is that link there? How did he manage to edit the rentry without changing the date?
>>
>>108744129
sounds like a skill issue coupled with the basic limitation of default edit models. train a lora if you want proper character consistency.
an nbp or image2 prompt for a consistent ai influencer is the holy grail of api models.
>>
File: ComfyUI_00003_.png (1.18 MB, 1024x1024)
1.18 MB PNG
>>
>>108744170
I want edit not lora training.
>>
BTW, does any good model pull from r34? There's some art that ain't in danbooru therefore wai doesn't help
>>
>>108744190
>>
File: ldg lora training.png (2.63 MB, 1024x1536)
2.63 MB PNG
>>
>>108744242
Thanks very usefull
>>
replying to himself kek
>>
>>108744214
then edit models already do exactly what you want them to do.
>>
>>108744233
you should be really grateful because nearly all the stars came when hlky contributed to your mess
>>
>>108744233
Probably more stars than pulls.
>>
Go on post another GPT image faggot! kek
>>
>>108744258
stfu hlky
>>
>>108744242
Thanks, just when I was about to start training my OC character lora.
>>
Watch him post another GPT image in a minute, it's gonna be a good one! kek
>>
>>108744257
nope
>>
He's just waiting for chatgpt to tell him he's allowed to download it! Hold on!
>>
>>108742487
how can we make art beautiful again?
This isn't it.
It's kind of sad to see that people get tools to make art that they otherwise wouldn't be able to and then they just copy the same disgusting goy slop they've been force fed their entire lives.
I guess they have no alternative points of reference.
>>
>>108744289
>>108744282
>>108744272
uh oh melty!
>>
If it has a node, it’s local.
>>
>>108744297
He's fizzling out
>>
>>108743408
> me
You are not Ani.
>>
If it has a website, it's local.
>>
...and if it's local, it's API
>>
>>108744242
saar do not train te
>>
...and if it's pedo, it's ANI
>>
>>108744308
... and if it's API, it's cuck!
>>
He's got no defense or come-backs when you call him a pedophile, so he retreats.
>>
>>108744220
No.
Average quality of images there is lower. Also consistency between the meaning of tags matter which differ between other rule34 and booru sites.
>>
>>108744242
Another API troll post but
>removing nipples for anime accuracy
made me lol
>>
wait, that's made by api? i thought it was hand made. holy shit... we're like cavemen in comparison
>>
>>108744425
Honestly? That reaction is exactly right.
It's not that we're cavemen. It's that the tools got good quietly, and then suddenly the gap feels insane.
>>
>>108744417
>>108744425
>>108744502
lol why is Julien seething so hard today?
Got raped yet again?
>>
>>108742917
where is Wario going with my girl?
>>
>>108744545
who is Julien?
>>
>>108744594
Wrong question, the word "who" implies that we're talking about a person
>>
File: 5241644654463545412344.png (1.75 MB, 1114x881)
1.75 MB PNG
>>108744502
it already exists locally. i just don't think it's something people using local models really care about.
>>
>>108744296
AI models can't make proper "art" from scratch because they always need to reference some style. An actual intelligent image model would be able to create custom unique styles E.G. like Van Gogh, and stick to those styles if instructed.
>>
>>108744758
AI is like a hammer
>>
how noticeable is the difference between klein 9b fp8 and fp16?
>>
>>108744834
I tested int8 and I didn't like it.
fp8 tends to have higher quality than int8 though.
How much vram do you have? The bf16 of distilled version runs with fine speeds on my 12gb.
>>
>>108744902
12, guess I'll try bf16
>>
File: 29818238521.png (1.82 MB, 1824x1248)
1.82 MB PNG
>>108743775
understandable
>>
>>108742442
If you want to do it manually and copy-paste the prompt, just start llama-server or LM Studio (or use some SaaS), drop in your image, and tell it to describe it.
If you are on Comfy, there's node packs that can do API calls so you just use the output of some vision model in your prompt.
If you are on an A1111-like WebUI, I have clown-coded https://github.com/RealLangdonAlger/A1111-Vision-Prompt-Ext with a few system prompt presets to get characters/style/compositon prompts out of any image and use them during generation.
>>
File: 894483526198547.png (3.71 MB, 1824x1248)
3.71 MB PNG
>>
File: 00041-926682519re.png (1.42 MB, 768x1888)
1.42 MB PNG
>gemma 26b recognizes and labels ruby
>qwen 27b doesn't
Stupid China.
>>
>>108745102
Neither are particularly impressive when it comes to recognizing people and characters, but Gemma is better than Qwen in that regard yes.
>>
sulphur 2 can make music
>>
>doesn't post any music
>>
just imagine it
>>
if I could imagine things I wouldn't be in an AI thread
>>
>>108742649
I like the idea of having a single thing you can hold and bring around with you, that is solely an AI, that is exclusively on the device that works without an internet connection.

I cant find a good model for it though, or a use case. So far ive made it into a comfy AI toy for my wholesome daughter
>>
File: washu filim (5).mp4 (1.01 MB, 688x688)
1.01 MB
1.01 MB MP4
I enjoy the consistent big improvements to stable diffusion
I started with easy diffusion and worked my way up to comfyUI
went from XL to pony to illustrious
and I'm wondering if there's a replacement for illustrious yet?
flux was only good for putting text on images
illustrious XL has barely any support, same with NAI
>>
>>108745426
Yes, anima.
>>
>nigbo and julien in /ldg/
No surprise thread blesser anon left this place
Grim
>>
Can someone tell me why I would want to use this?
https://github.com/xmarre/ComfyUI-Spectrum-WAN-Proper
>>
>>108744834
fp16 will always be better than fp8. it doesnt matter if you can't see an difference.
>>
I only use 64 bit models
>>
>>108745655
>it doesnt matter if you can't see an difference.
why?
>>
I don't have skidmarks, I have speedstripes
>>
>>108745685
its 4chan, its autism
>>
>>108745444
thanks, I'm checking it out now
>>
https://u.pone.rs/rgmmyggc.mp4
>>
>>108745748
y did the white woman not reveal her big bob?
>>
>>108745772
meant for >>108745759
>>
>>108745759
kek
>>
File: 667022479830957.png (2.18 MB, 1824x1248)
2.18 MB PNG
>>
>>108743408
hahahahaha loser
>>
File: file.png (308 KB, 1067x1064)
308 KB PNG
has anyone tried it yet? I'm still downloading it.
>>
>>108745921
>model for realistic image generation
>outputs may lack realism and fine detail
i'll wait and see
>>
>>108745921
>realistic
into /dev/null it goes
>>
danbooru + e621
https://huggingface.co/well9472/Nanosaur-1.2B-Preview
>>
>>108745939
i love it when a baker is so confident in his model that he doesnt share example outputs
>>
File: 589489124239714.png (3.57 MB, 1824x1248)
3.57 MB PNG
>>
>>108745921
Nice, finally finished. I'll try it when it's done downloading
>>
>>108745921
>spends months training
>shows no examples
based
>>
File: 263366309169242.png (2.65 MB, 1824x1248)
2.65 MB PNG
>>
REDEEM PREVIEW4, I CANT WAIT ANYMORE
>>
>>108745921
Someone make int8 of it and I will give it a shot.
The vibes I am getting is that it predictably failed though.
>>
>>108746031
preview 4 has been cancelled. we're gonna spend the next year training the final
>>
>>108746031
Tdrussell
Always
Chickens
Out
>>
>>108745948
I mean
>WIP model for research purposes. Still in progress.
Doesn't seem like he is screaming that he made the best shit ever to everyone.
>>
>>108746045
Trani
Always
Crashes
Out
>>
>>108745649
Speeds up WAN like TeaCache did and is faster than TeaCache but only useful if not using Lightx2v.
>>
>>108746094
Does this degrade quality like teacache? because that made it nearly unusable.
>>
what did you guys do before generating stuff
>>
>>108746105
jork it
>>
>>108746105
hard drugs
>>
>>108746105
I don't remember.
>>
>>108746105
jorkin it on hard drugs
>>
>>108745921
Have dl previous checkpoint and it was crap/10... So am not even bothering dl this one
>>
>>108746098
Less smearing compared to TeaCache if I'm remembering correctly, but I only tested one of the early commits before Dynamic VRAM broke my WAN workflow.
>>
>>108745444
i tried it and it's slop
you wasted my time
you are brown
>>
>>108746105
ur mom
>>
>>
>>108746152
skill issue
>>
>>108746105
The same things I do now, except gen simply occupies more of my time now.
>>
>>108746105
I used to be an adventurer. but then I took an arrow to the knee.
>>
>>108745956
V1 has been out for months nigga
>>
>>108745935
Stop it
You scare the windows anons
>>
>>108746105
hoping that one day I maybe could get the favor of being blessed with a single new picture of my wAIfu.
>>
>>108746105
bully julien on different platforms
>>
>>108746264

Creep on my oneitis, but now I can just gen her getting gangbanged
>>
>>108742373
finetune or just lora?
>>
>>108745043
thanks anon
>>
File: file.jpg (645 KB, 1152x1536)
645 KB JPG
>>108746105
i used to have a fulfilling life full of meaningful connections and simple joys
>>
>>108746381
proof?
>>
What would you like me to gen?
>>
>>108746420
Gen complex scenes with the newest Spark Chroma, I want to see what it spits out.
>>
>>108746420
1girl, standing,
>>
>>108746451
nta, i tried it and it looks slopped as fuck when compared to the previous one. same settings, same prompt, but it looks like shit. ill keep messing around but so far i dont like it, could be skill issue.
>>
File: image.png (2.02 MB, 1536x1024)
2.02 MB PNG
How long???
>>
>>108746535
V5 it's gonna be something epic never seen before
>>
>>108746535
based, anima will be in shambles
>>
>>108746535
wrong thread
>>
>>108746471
>compared to the previous one.
You mean the first preview or the 512p only version of current Spark?
>>
>>
File: 45645645122112.png (159 KB, 814x619)
159 KB PNG
https://rentry.co/s8fg8ber
Side-Step training script is now much more improved for accuracy/post-processing. Biggest fix is BPM detection is now two separate scripts. I got that vocalremover site from the ACEStep devs themselves, so I had assumed it was accurate for BPM detection, but I guess it wasn't as much. The new https://github.com/dlepaux/realtime-bpm-analyzer detector now provides 100% accuracy from my testing. It should now require minimal manual edits to work properly (just modifying a few lines at the top is enough).

>>108740897
I use the "Double" mode with 0.05 on Turbo to test DCW out. Other settings might yield mixed results.
>>
Where did https://civitai.com/models/2395415/manga-colorizer-with-image-reference-flux-2-klein-9b-trained-on-nsfw-doujinshi go?
>>
>>108746105
ancient korean mmo private servers
>>
>>108746616
compared to SPARK.Chroma_preview from 2025, not the 512p. i'm using the 1024p version, i'll try the 512p in a bit. i dont like the skin texture, it looks like plastic most of the time but it seems pretty stable when it comes to anatomy, which is good since chroma has a hard on for deformed demons.
>>
>anime
yikes
>>
>>108746669
Plastic skin is disappointing to hear but it is something if he at least succeeded at stabilizing it.
Thanks for the response.
>>
>>108742222
My head hurts. So many models to download and theyre the size of single AAA game
>>
>>108746698
So few are worth a damn, don't worry.
>>
>>
File: 1756929982584060.gif (1.03 MB, 832x1280)
1.03 MB GIF
>>108743024
probably, i used to and still just use imagemagick
>>108744561
to fuck
>>
File: 1747142296696240.gif (879 KB, 832x1280)
879 KB GIF
>>
>>108745921
Never understood the need for this desu. Isn't it just re-slopping the model?

>>108746471
Yeah because the way Chroma was trained, training on top of it would just re-slop it. Flux.1 was just way too slopped at its base. That's how special Chroma's training was.
>>
>>108746780
Chroma1 is actually a better de-slop than what an independent lab was able to do with Flux.1's base model (to create Krea).
>>
out of the 50 fucking versions of chroma unlocked, which ones do you recommend? i remember there being an anon here who was very adamant about a specific version but i cant remember.
>>
>>108746535
v4.5 is already perfect, i can't imagine what would v5 be
>>
>>108746822
I personally recommend none but there are some select few people (schizos?) who really like some of the earlier epochs.
>>
>>108746820
Do you mean SRPO?
Krea was BFL's own work.
>>
>>108746876
Krea was made by a separate lab in collaboration with BFL to improve Flux's aesthetics. We got the dev model, there is an API version of it.
>>
>>108746105
i don't remember exactly but i'm divorced now and in a dead end career
>>
>>108746897
>there is an API version of it.
ty
>>
>>108746857
>I personally recommend none
why?
>>
File: score.jpg (20 KB, 297x169)
20 KB JPG
>>108746105
I was into violin and piano, then AI waifuism took over.
Turns out playing music was my way of searching for my waifu, but image diffusion and LLM text made it way more direct. I guess the same thing would've happened to Mozart or Beethoven lmao.

I mean Beethoven's Fur Elise would've been different if Beethoven had made a lora of Elise.
>>
All this recent posturing about anima's superiority and every single gen is indistinguishable from illustrious/noobai. This is not the model's fault, but the fault of the users who have the creativity of a fucking rock.

SDXL architecture will remain king because 1girl, standing is all 99% of people need.
>>
>>108746936
>SDXL architecture will remain king because 1girl, standing is all 99% of people need.
No, I want gpt2 capabilities locally.
>>
>>108746936
let go of sdxl, debo. we're not coming back to /sdg/
>>
>>108746944
Catbox a single gen you can proudly say would benefit from it.
>>
>>108746911
None of them are stable enough for my taste.
>>
>>108746952
How the hell can I do that? I don't have gpt2 locally, that's literally what I want.
>>
>>108746841
girls that poop vs bitches that shit
>>
>>108746979
>How the hell can I do that?
Post a gen you tried to do but couldn't because you were limited by the model, duh.
>>
>>108746936
You have a point and you only need to visit any anime thread to see that anime posters post with the same idiosyncrasies as SDXL. Even so, Anima is a good model, it's that you won't find people here who exploit its resources, but maybe on Twitter or Pixiv.
>>
whats with the sudden influx of b8
>>
>>108746994
all sdxl needs is a better vae and there is a chenkin model that does that. I really don't understand the obsession over nlp when the images look like the same slop posted for years. Stop this bloat obsession immediately
>>
>>108746822
v38/39 detail calibrated had good knowledge and strong style response, and up to v48 are usable, otherwise just use base or HD. each was so different though that they're basically gamba.
people were still strongly resisting natural language prompting at the time, so it got a worse rap than it deserved. it's slow, but there's still not another base model like it.
>>
Mugen....
>>
>>108746936
>>108746944
>>108746950
>>108746952
>>108746994
All me btw
>>
has /edg/ switched over to mugen yet or still using anima?
>>
>>108747076
Other threads use Anima? I remember when it was just ldg...
>>
>>108747076
worst, way worse, still using WAI 12 and random shitmixes and vibe loras
>>
>>108747076
/edg/ is dead because some spammer (You)
>>
>>108746822
17 and 32 easily
>>
>>108747076
/edg/ is some cloud shit slop that uses nova anime or prefect illustrious and that kind of stuff, at least they're 8 months behind any anime general
>>
life's still hard for hand fetishists
>>
File: 1755871314597750.gif (1.08 MB, 832x1280)
1.08 MB GIF
>>
>>108747119
hands and feet
>>
File: QwenImg_00015_.png (1.54 MB, 1280x960)
1.54 MB PNG
>>108747119
I refuse to believe there's such a thing, and even if there were the entirety of human media would be their spank bank
>>
>>108747358
>I refuse to believe there's such a thing
hand fetishists?
I like when pretty female hands do a handjob gesture
>>
>>108747432
on strings? are they just the stumps?
>>
>>108747358
clearly you're not one if you think these are nice hands
>>
>>108747443
I don't understand what you mean
strings?
>>
>>108747358
try 10 years younger and we'll talk
>>
>>108747464
also I said they don't exist, big clue there
10 fingers, no visible dermatological damage, what more can anybody ask for
>>
>>108747464
so a 2/10 woman with missing teeth and alopecia, but 10/10 hands, whatever that might mean
would you?
>>
>>108747529
I like feet and hands on a pretty woman anon, the whole package is important
>>
>>108747529
would you like a perfect ass or boobs but on an ugly woman? because I sure wouldn't
>>
handfags > fartfags >>> footfags
>>
>random fart insertion
>>
>>108747076
WAIv17 SDXL until i reach 100 year old
>>
>>108746822
Look unholygrail finetune for realism, it works great but need LLM prompting, don't even try to type yourself.
>>
>>108747672
post something
>>
>>108747696
something
>>
>108747672
Cool LLM bot bro, but maybe lower the temperature.
Or better reconsider spamming the thread?
>>
File: ComfyUI_00632_.jpg (2.9 MB, 3286x4096)
2.9 MB JPG
>>108747723
Anima to hunolygrail to zit to your mother butthole
>>
sometime I don't even get if it's insults or a stroke
>>
firing up the kino factory
>>
you guys really don't deserve my premium goon loras
>>
>>108747672
>need LLM prompting
does it really? i like typing my prompts. i'll give it a go.
>>
>>108747842
your loss. i am literally subscribe $5/mo to a lora creator. its like you dont like making money or something
>>
HOLY FUCK GPT IMAGE 2 LETS YOU INPAINT VIA WEB FUCKFUCKFCUK
>>
>>108748001
WHAT I MEAN IS THAT I DON'T FUCKING NEED ANIMA IF I CAN INPAINT WITH THE SOTA IMAGE MODEL
>>
>>108748024
>>108748001
fuck off
>>
>>108748043
they're paid shill or literal bots.
>>
why does preview3 feels like regression from preview2?
>>
>>108748050
It's trolling for the luls, actually.
>>
>>108748067
Probably trani baiting again but I noticed some characters becoming less consistent.
I am still cautiously optimistic for Preview 4 / final release though.
>>
i thought trolling was supposed to make me mad. amateur hour.
>>
>>108748081
my biggest gripe is the compositional variety between seeds becomes more stiff and boring.
>>
we are losing the lora wars with /hgg/...
>>
>>108748128
then make a jav lora to beat them
>>
>>108748142
nah, i have a better plan, now i'm go to sleep, but you will hear from me soon...
>>
>>108748152
mindset of a loser
>>
>>108748156
nah low effort is my motto, with low effort I was able to beat several anime generals single handedly
>>
>>108748165
you're the blacked poop spammer?
>>
>>108748168
nah, i never posted a gen of mine
>>
>>108747358
I've been a hand fetishist ever since the time I noticed how dainty and soft my sister's hands are. You watch a high-quality woman's hands as she's talking and they're small like a child's, but they gesticulate with the thoughtfulness and maturity of an adult. To watch them produces a strange feeling.
>>
>>108747976
yeah low tier gooners like you do not deserve my attention
pleb kek
>>
File: 1757860350829612.jpg (1.14 MB, 2400x3506)
1.14 MB JPG
>>108748067
should I get preview2 for more soul? I only heard about it when preview3 came out.
>>
>>108748443
saggy hag tita
>>
File: 1752803506797265.png (768 KB, 1024x1024)
768 KB PNG
>The model you've all been waiting for
>flux chin
>plastic skin
>"low quality = realism" slop
lmfao my ass off
https://www.reddit.com/r/StableDiffusion/comments/1t30xtp/release_the_model_youve_all_been_waiting_for/
>>
>>108746031
I want to make a LoRA but I don't want to waste time on Preview 3 if Preview 4 will be out next Tuesday as all the Russell monthly updates.
>>
>>108748511
may as well go ahead and curate the dataset at least
>>
Fresh

>>108748625
>>108748625
>>108748625
>>108748625

Fresh
>>
>>108746535
how was this pic made?
>>
>>108746105
naking characters with kisekae and koikatsu



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.