[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion and Development of Local Image, Video, and Music Models

Previous: >>109195112

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://huggingface.co/models
https://civitai.com
https://civitaiarchive.com
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Krea 2
https://huggingface.co/krea/Krea-2-Raw
https://huggingface.co/krea/Krea-2-Turbo

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Feet gens only
>>
>inb4 n*gbo
>>
when will we be rid of the lora scourge?
>>
>>109198744
>the #1 benefit of local models is that you can train them
>loras are the easiest way to train them
>this is actually a bad thing
>>
I have finally cracked the code.
These threads are created and maintained by civitai employees.
>>
best realistic goon model?
i'm not a vramlet
>>
>>109198798
Krea2
>>
>>109198798
I'm leaning towards SAManima+krea combo.
but SAManima alone is also probably just good enough and 3x faster
>>
I want to extend my wan 2.2 gens, 15-30 seconds if thats possible on a 3090. I read something about SVI it sounded wrong, the extend/animate nodes confused and scared me. what's the consensus.
>>
File: 160024CUI_00001_.png (1.61 MB, 1152x1536)
1.61 MB PNG
I wanted to try Krea2 but the 12gb AMD card plus 16gb system RAM combo is brutal.
>>
>>109198968
>16gb system RAM
wtf it's not 2010 anymore
>>
can you train krea loras with 12GB VRAM and 32GB RAM?
>>
>>109199006
(model)krea2_raw_bf16.safetensors is 24GB.
You have 12 VRAM.
24-12 = 12GB is off loaded to system ram.
You have 32 System ram.
32-12 = 20 system ram left over.

(text encoder) qwen3vl_4b_bf16 is 8.2GB.
You have 20GB System ram.
20-8.2 = 11.8 system ram left over.

Windows 11 typically uses around 6GB ram.
You have 11.8GB System ram.
11.8 - 6 = 5.8 system ram left over

Yes, though it will be very slow, approximately 6 hours.
>>
>>109199040
>(text encoder) qwen3vl_4b_bf16 is 8.2GB.
you can cache the text encoder thougheverbeit, just like vae latents
>>
>>109199046
cached means its sitting in ram. this is why i mentioned system ram and not vram, because it wont fit
>>
>>109199057
when you cache the TE you store only the resulting embeddings, not the text encoder itself, no?
>>
can you train krea loras with 96GB VRAM and 64GB RAM?
>>
>>109199006
Ask Claude.
>Yes — it's been done, though which tool you use matters a lot right now, and it won't be fast.

>**ai-toolkit currently has a documented problem at 12GB.** As of late June 2026, at least one person hit an immediate OOM trying to load Krea 2 into ai-toolkit on a 12GB card — not during the heavy forward/backward compute, but literally while the model was still being loaded into a trainable state. Their conclusion was that ai-toolkit's current Krea 2 integration doesn't yet have effective VRAM offloading during the main training loop, and they gave up on that path for 12GB specifically.

>**musubi-tuner is the documented working path at 12GB.** Someone else successfully trained a Krea 2 Raw LoRA on exactly your VRAM budget using musubi-tuner — the official Raw checkpoint plus the Qwen3-VL text encoder in bf16 (not even quantized) and the Qwen Image VAE, on 20 training images. This tracks with what we found earlier in this conversation: musubi-tuner's block-swapping/offloading design (built originally for video models like Wan and HunyuanVideo, which are much heavier than this) seems to be more mature for squeezing large DiTs into small VRAM budgets than ai-toolkit's Krea 2 support currently is.
>>
What are some recommendations for clothes remover workflows or pipelines? What checkpoints and models are being used by the sites that do these things?
>>
>>109199072
absolutely not. sorry bud
>>
>>109199086
>clothes remover workflow
what
klein 9b can do that easily with any nsfw lora. its that simple. no fancy tools needed.

t. undressed hundreds of women
>>
File: ComfyUI_00767_.jpg (1.75 MB, 2024x2696)
1.75 MB JPG
>>
>>109198980
>In 2010, mainstream consumer PCs were utilizing DDR3 RAM, which cost an average of $12 to $15 per gigabyte ($100 for a standard 8GB kit). Fast-forward to 2026, and the dominant technology is DDR5 RAM, which has ballooned to $11 to $18 per gigabyte ($350 to $600 for a standard 32GB kit) due to an intense, AI-driven global memory drought.
He's lucky he has that much.
>>
i cannot get krea to print the word slut
whore works sometimes but slut has never worked
>>
>>109199102
>$12 to $15 ballooned to $11 to $18
which dumb ass LLM wrote this?
>>
>>109199105
why waste time using krea to generate words or text. use klein for that. its an edit model. it can write any word or phrase you want without modifying the existing image.
>>
>>109198838
>SAManima
??
>>
File: 914615713.png (2.08 MB, 1536x1280)
2.08 MB PNG
>>
>>109199096
any that makes them wear sexy underwear instead? (and it doesn't looks like samebody)
>>
>>109199112
it would be convenient because it's doing everything else, but fair point. if it can't do it then it can't do it
>>
>>109199105
try alternative characters that look the same.
>>
>>109199116
SAM = Meta's Segment Anything Model. It's used with detailers. Anon is retarded and thinks Anima has better realism than Krea2.
>>
>>109199105
Wait for someone to make a finetune or lora for that, it seems like "cum" or "slut" and so on were censored either in the dataset or by whatever they did before releasing the model.
>>
>>109199105
>he still doesn't understand how the censorship was trained into krea2
>>
>>109199127
Yes, clothing loras exist, though you should be able to use whichever outfit you want as a reference image and it can put it on the person/character.
>>
File: 163237CUI_00001_.png (989 KB, 1152x1152)
989 KB PNG
>>109198980
>>
>>109199116
https://civitai.red/models/2409949/sam-anima-realistic?modelVersionId=3060073
>>109199133
You're retarded.
>>
>>109199138
I thought it was fixed already with the decensor lora
>>
Where is Debo, I want to see his cool spaceships
>>
File: 1776777048321654.png (65 KB, 333x498)
65 KB PNG
>>109199142
Let me guess, you fell for the "I'll never need that much ram" meme. Now you suffer.
>>
>>109199133
thanks but I don't get how segmenting would help here

>>109199150
ok now it makes sense, thanks anon
>>
>>109198423
>Kiwi
Tell me you're a tranny without telling me you're a misinformation spreading tranny
Fuck off this is disabled Californian neet spam that has been a fixture in all these threads for almost 5 years now
We are the top space in image generation and it makes faggots upset.
>>
>>109199150
SAM in the AI space is referring to segmentation.
You said SAManima, not Sam Anima(Not acronymed). Not my problem an idiot called their shitmix the same as a popular AI utility term.
>>
>>109198697
>mfw Resource news

07/04/2026

>Qwen3.5 INT8 ConvRot Text Encoders for ComfyUI
https://huggingface.co/Winnougan/Qwen-3.5-INT8-Convrot-Comfy

07/03/2026

>Krea-2 Depth ControlNet-LoRA
https://huggingface.co/Patil/Krea-2-depth-controlnet

>Multi-Resolution Flow Matching: Training-Free Diffusion Acceleration via Staged Sampling
https://github.com/Xingyu-Zheng/MrFlow

>DiffRGD: An Inference-Time Diffusion Guidance Through Riemannian Gradient Descent
https://diffrgd.github.io

>Representation Distribution Matching for One-Step Visual Generation
https://alan-lanfeng.github.io/rdm

>SAB-LVLM: Significance-Aware Binarization for Large Vision-Language Models
https://github.com/LyuQi127/SAB_LVLM

>Style-CCL: Content-Preserving Style Transfer via Curriculum Continual Learning
https://github.com/witcherofresearch/Qwen-Image-Style-Transfer
https://github.com/Tele-AI/TeleStyle

>ByteDance-Seed / PAR
https://huggingface.co/ByteDance-Seed/PAR

07/02/2026

>PAPA: Online Personalized Active Preference Alignment
https://github.com/NasikNafi/papa

>Condensing Large-Scale Datasets Directly with Minimal Information Loss
https://github.com/LINs-lab/CIM

>VisReason: A Large-Scale Dataset for Visual Chain-of-Thought Reasoning
https://y-research-sbu.github.io/VisReason

>Asset Generator for 2D & 3D: Blender add-on that generates assets from text prompts
https://github.com/tin2tin/Asset_Generator-2D-3D

>ComfyUI-TrixLoader: All-in-One Image Loader, Editor, and Resizer node for ComfyUI
https://github.com/trx7111/ComfyUI-TrixLoader

07/01/2026

>Elastic Diffusion Transformer: Accelerating SOTA generation models
https://github.com/wangjiangshan0725/Elastic-DiT

>Boogu-Image-0.1-Edit-Turbo
https://huggingface.co/Boogu/Boogu-Image-0.1-Edit-Turbo

>GEAR: Guided End-to-End AutoRegression for Image Synthesis
https://github.com/Tencent-Hunyuan/GEAR

>SpheRoPE: Zero-Shot Optimization-Free 360 Panorama Generation with Spherical RoPE
https://orhir.github.io/SpheRoPE
>>
>>109199177
>mfw Research news

07/04/2026

>Visual Semantic Entropy: Do Vision Language Models Recognize Visual Ambiguity?
https://arxiv.org/abs/2606.31407

>PhotoQuilt: Training-Free Arbitrary-Resolution Photomosaics via Bootstrapped Tiled Denoising
https://kooroshrh.github.io/photo-quilt

>MindFlow: Harmonizing Cognitive Semantics and Acoustic Dynamics for Facial Animation Generation in Dyadic Conversations
https://arxiv.org/abs/2606.27779

>Gradient Smoothing: Coupling Layer-wise Updates for Improved Optimization
https://arxiv.org/abs/2606.30813

>Rank-Aware Hyperbolic Alignment for Vision-Language Dataset Distillation
https://andyj1.github.io/raha

>On Test-Time Scaling for Vision-Language Models
https://arxiv.org/abs/2606.28864

>Clearer Sight, Fewer Lies: Oriented Pickup Preference Optimization for Multimodal Hallucination Mitigation
https://arxiv.org/abs/2606.29805

>Steal the Patch Size: Adversarially Manipulate Vision-Language Models
https://arxiv.org/abs/2607.00174

>Spatially Localized Image Degradation Embeddings for Image Quality Assessment
https://arxiv.org/abs/2606.29162

>NURBS Splatting: A Unified Differentiable Rendering Framework for Vector Graphics
https://arxiv.org/abs/2606.31764

>$μ$Flow: Leveraging Average Images for Improving Generalisation of Deepfake Faces Detectors
https://opontorno.github.io/MuFlow

>SPECSIA: Stylization Dataset for Novel-View Enhancement in Drawing-based 3D Animation
https://arxiv.org/abs/2607.00525

>Resonant Brane Splatting for Arbitrary-Scale Super-Resolution
https://arxiv.org/abs/2606.29453

>When Sinks Help or Hurt: Unified Framework for Attention Sink in Large Vision-Language Models
https://arxiv.org/abs/2604.03316

>Stateful Token Reduction for Long-Video Hybrid VLMs
https://arxiv.org/abs/2603.00198

>Universal Image Immunization against Diffusion-based Image Editing via Semantic Injection
https://arxiv.org/abs/2602.14679
>>
>>109199177
Fuck off
>>
File: acktually.jpg (31 KB, 512x512)
31 KB JPG
>>109199166
>SAM in the AI space is referring to segmentation.
>>
>>109199177
>>109199184
>family left him with his caretaker during the holidays again
Lol suffer
Also stop spreading malware and we might feel bad for you
>>
File: ComfyUI_00797_.jpg (1.71 MB, 2696x2024)
1.71 MB JPG
>>109199105
>>
File: 164428CUI_00001_.png (1.52 MB, 1152x1152)
1.52 MB PNG
>>109199154
It is what it is. The prices will go down soon, I'm sure.
>>
File: 1746897372678997.png (1.91 MB, 1024x1536)
1.91 MB PNG
>>109199209
>soon
>RAM prices expected to rise another 40-50% in Q3 2026, and then 30% more in Q4 as AI demand outpaces supply


https://www.techspot.com/news/112934-ram-prices-expected-rise-another-40-50-q3.html
>>
>>109199108
The shit they use for Google now, allegedly 3.5 Flash. And here's the link it took the data from https://dam.stanford.edu/memory-prices.html
>>
File: slut.jpg (857 KB, 1760x1760)
857 KB JPG
>>109199195
>>109199105
>>
Krea 2:
>>109197616

extra finger.
>>
>>109199222
no way.

short humerus.
>>
File deleted.
>>109199222
isnt future RAM production all bought out til 2030 or something crazy?
>>
>>109199265
Yes
https://www.reddit.com/r/homelab/comments/1uhsrew/ram_ssd_prices_will_remain_high_atleast_till/
>>
delayed gratification is a high IQ trait
>>
File: 1777378168419359.png (240 KB, 1691x742)
240 KB PNG
Does the new manager in ComfyUI not support managing models anymore? I can only see options for Node Packs and Nodes. Tried it with security_level=weak but I still couldn't see models.
Idk what I'm doing wrong.
>>
Two years ago we had Flux1, wow what a time to be alive!
>>
Four years ago we had SD1.5, that peak has never been surpassed.
>>
>>109199140
I'll try it, hopefully it won't give the same body
>>
Remember when this was mind blowing? https://thispersondoesnotexist.com/
>>
>>109199222
Bullshit.
>>
>local
uhm, maybe you have more fun swimming in shit?
>>
>>109199369
It's still kind of cool, but I'd rather gen an impossible genetic lottery winner than my neighbor.
>>
File: ComfyUI_00744_.jpg (1.72 MB, 2696x2024)
1.72 MB JPG
>>
>single reddit post has more info on krea2 than lurking the thread for a whole week
wtf I thought we were supposed to be the knowledgeable ones?
>>
>>109199451
This isn't sharpness or improved texture, this filter is some kind of cheap magic trick.
Come on, do better!
>>
>he thinks 4chin is knowledgeable
>>
What finetunes or loras are good at improving realism and skin texture for Z Image Base? IJuggernaut Z is better, but still leaves alot of room for improvement
>>
File: xi.png (1.02 MB, 1024x768)
1.02 MB PNG
>>
>>109199478
cant' you hi res fix in Z?
>>
>>109199451
Her humerus shrank in the second panel.
>>
>>109199453
why would anyone waste their time posting long informative messages here when it'll just get drowned out by low effort 1girl slop.
>>
>>109199478
don't necessarily need different tunes just different sampler settings.
>>
>>109199177
>>109199184
Thank you for the news!
>>
>>109199096
I've undressed hundreds of women irl, no lora required
>>
even comfy anon doesnt bother with this place anymore
>>
>>109199514
yeah fuck julien for that
>>
>>109199184
MALWARE spam, careful people!
>>
>>109199495
Noo noo not the tiny humerus!!!
>>
Hopefully someone will post a short humerus female-ish thing with a realistic head and ambiguous torso anatomy, and legs which do not match.

I'm keeping my hopes up.
>>
>>109199453
Nah, you are just a fucking idiot
>>
asymmetrical small humeri are my fetish
>>
>>109199542
lmao
>>
>>109199542
You think that speaking Italian makes you better than the rest of us?
>>
>>109199559
oui
>>
>>109199473
>>
>>109199567
that was written by an anon
>>
>>109199588
this was written by a homosexual
>>
zitface woman consented.
>>
>>109199588
https://arxiv.org/pdf/2604.12190
>>
File: x.png (1.51 MB, 1216x832)
1.51 MB PNG
>>
File: ComfyUI_temp_yeepp_00050_.png (3.44 MB, 1400x1800)
3.44 MB PNG
Is it safe to say that all krea bypass loras, enhancers, etc. suck? All of them I've tried change the prompt adherence too much, or fuck up the text and even the slightest change of any value changes the output compared to just using a nsfw/regular lora.

So far I've tried
bypasslora-vector3
skc3vo
Conditioning Krea2 Balance node
ComfyUI-Krea2T-Enhancer
>>
>>109199644
Yeah they suck. Ive had better results using few loras with low str
>>
>>109199644
https://huggingface.co/Beinsezii/Krea-2-Turbo-Projector-Scale-LoRA-Diffusers
it makes prompt adherence better for me even at just 0.1 strength, with actual expressions and spicy actions that will not go through without the lora.
>>
>>109199644
Funnily you kinda don't even need any loras or bypass filters with Krea if you just i2i with low enough denoise. The fun parts won't be so detailed but they are there and it knows the poses very well.
>>
what are the best modern models, both image, and I2V, for a 12GB VRAM, 64GB RAM haver. I've been away for so long. the training community seems like a divided mess between a half dozen different base models.
>>
>>109199665
The lora anon made and shared here is way better than this
>>
>>109199673
where
>>
File: debo_iu_k2_00007.png (2.73 MB, 930x1024)
2.73 MB PNG
>>109199511
:)
>>
File: ComfyUI_00803_.jpg (1.89 MB, 2024x2696)
1.89 MB JPG
>>109199644
For me, the battle is trying to keep my gens sfw so i can keep posting kinos.
Stack a couple loras and you get nudity without even trying
>>
>>109199709
>seven toes
a record?
>>
File: Krea2_turbo_00201__80.png (2.83 MB, 1338x2010)
2.83 MB PNG
>posting on 4chan means you smart
lol
>>
>>109199099
>>109199644
Stop using the Turbo model. Use RAW model with a turbo LoRA. It does not suffer from this terrible noise.

>bypasslora
Try vector2.
>>
>>109199511
>>109199701
What is the point of this ritual posting? Every thread like bots.
>>
>>109199721
>Use RAW model with a turbo LoRA. It does not suffer from this terrible noise.
drop a flow
>>
>>109199721
It does to some degree when you use the NSFW loras.
>>
>>109199720
>meat tube anatomy

what a joke
>>
File: ComfyUI_temp_xpcyr_00058_.png (2.66 MB, 1120x1440)
2.66 MB PNG
>>109199667
i2i what exactly?, I'm talking straight txt2image

>>109199662
yeh, that doesn't work for me, krea insists on censoring words, for example I want to write "cum eater" in the forehead of a 1girl, the model deletes it

>>109199709
I cannot unsee the krea2 1girl face, also multiple people in one photo have the same issues that all image models have and is that you get cloned faces
>>
File: krea2censorship.jpg (1.91 MB, 2170x1440)
1.91 MB JPG
>>109199721
I already do that, in the 2nd pass the turbo model censors my gen and doing low denoise i2i gives you shitty quality

Hopefully we get a Tile ControlNet model for Krea and that will fix this problem I think
>>
>>109199721
>>109199762
>he has to use 2nd pass
>>
>>109199762
God damn how are people so bad at this shit. Are you underage?
>>
>>109199762
>in the 2nd pass
why a second pass? you can gen at higher res, it won't get all distorted like other models
>>
>>109199746
Well obviously if you are doing t2i that won't work.
>>
>>109199746
>>109199762
It's the text encoder refusing your prompts. The text encoder has already been decensored. Someone posted this a while back, have you tried that?
https://huggingface.co/huihui-ai/Huihui-Qwen3-VL-4B-Instruct-abliterated
>>
I'm gonna take a guess that most of what Krea 2 knows about nudity is from artwork, not photos
it explains why it gets more and more cartoonish as you add more NSFW terms
>>
>>109199786
nta, but I use 2pass to increase output diversity and it really reduces the slop.
>>
he never drops a flow
>>
>>109199826
gg
>>
>>109199826
drop the flow
>>
File: file.jpg (1.03 MB, 4120x1270)
1.03 MB JPG
>>109199721
i prefer turbo, it gives the crisp realism i'm looking for. Raw is too soft for my taste
>>
>>109199762
>2nd pass

What? RAW with a turbo LoRA is single pass.
>>
>>109199839
>>
File: 1782514966937.jpg (830 KB, 1536x1024)
830 KB JPG
>>109199567
>>
>>109199841
>Raw is too soft for my taste
Increase the step count.
>>
>>109199850
all your gens are dog shit bro
>>
>>109199858
cry
>>
>>109199857
I'm not waiting any longer for a gen
>>
>>109199858
ok what gens are mine in the thread?
>>
>>109199865
the dog shit ones
>>
>>109199841
Try negpip
(illustration:-1.2)
And turn up the Turbo LoRA weight.
>>
>>109199869
Point to a not dogshit gen then.
>>
>>109199887
that would be anything in my output folder
>>
>>109199841
>fucked up bedframe
You're using depth controlnet aren't you?
>>
>>109199865
You need 2 pass to make gen image on krea2 and you produce results like this
>>109199746
>>109199762
>>
>>109199903
not my gen.
>>
>>109199721
>RAW model with a turbo LoRA
how is it different from directly using a turbo model?
unless you mean a different weight for the lora?
>>
>>109199948
>unless you mean a different weight for the lora?
yes.
>>
File: ComfyUI_temp_xpcyr_00060_.png (2.52 MB, 1120x1440)
2.52 MB PNG
>>109199777
>>109199786

>>109199826
this anon knows, it should be known by you that any distilled model (aka turbo models) reduce the variability of your gens, aka what you people cried time ago when using ZIT, >muh seed diversity problem


>>109199846
yeh, sorry I missread


>>109199809
Thanks, i'll try that, this is another example If I use a low denoise 2nd pass, the turbo model still tries to censor my gen
>>
>>109199671
Krea 2 for NSWF. Ideogram for some cases.
>>
>>109199809
that only matters if you use the prompt enhancer, it does nothing to the image generation itself
>>
>>109199970
works for me, single pass
https://files.catbox.moe/npieq0.jpg
>>
>>109199952
what weight do you use, anon, and how many steps does it need to compensate for lower weight?
>>
>>109200010
see >>109199850
>>
>>109199841
lowk just looks like diff sampler combo lmoa
euler sgm_uniform is pretty based on the turbo
also somehow nvfp4 with those shitty loras looks better than the bf16 model because that detail adds nothing but crap, and fucks up skin way faster because of the nsfw bypass and jeetbake loras
>>
File: debo_iu_k2_00015.png (2.73 MB, 1024x930)
2.73 MB PNG
>>
>>109200013
thanks anon, so ~0.5 weight, and ~1.5x the steps
>>
>>109200043
It's not an exact science. feel free to experiment and find what works best for you.
>>
File: krea2censorship2.jpg (2.16 MB, 1680x2160)
2.16 MB JPG
>>109199990
Nice anon
>>
>>109199644
>try conditioning control node
>uncensored but shiny wet slopped skin
>completely ignores all style prompt
pain.
>>
File: ComfyUI_00841_.jpg (1.09 MB, 2024x2696)
1.09 MB JPG
Krea 2 grid removal, wire it after vae decode
https://pastebin.com/v7y1z0SH
>>
File: Ideogram__00836_.jpg (2.06 MB, 2880x2160)
2.06 MB JPG
>>
>>109199977
ty anon. I'll give Krea a shot and also check out ideo.

Is wan2.2 still the best when it comes to video gen? or have LTX and the other options caught up? I remember 2.2 having great gens but being very slow on my hardware at launch.
>>
>>109200080
you remove the blur and you'd get an image almost imperceptible from a real one, keep posting those ideogram images anon, it's probably the only local model that can reach the quality of the best API models, really impressive
>>
>>109200080
slopped shiny plastic skin
>>
>>109200122
real images have blur
>>
>>109200126
?????
>>
>>109200070
the author of that repo went full schizo mode, this is his latest update, but I'll try it
>>
>>109200122
>API models
>Quality
Lol
>>
>>109200126
where?
>>
File: 1762336086085334.png (2.1 MB, 999x999)
2.1 MB PNG
>>109200130
not all real images have blur, you know that right?
>>
>>109200077
>grid removal
what grid?
>>
>>109200146
>posts image with tons of blur
>>
>>109200146
notice the lack of the word "all" in my comment
>>
>>109200126
https://www.instagram.com/i_am_kiko/
https://www.instagram.com/nozomisasaki_official/
https://www.instagram.com/hikari/
https://www.instagram.com/yuka/
>>
>>109200143
>>
>>109200152
then I'll say it for you, an amateur photo of people who aren't close to the camera don't have background blur, ALL of them
>>
>>109200154
gross why would you send me gore
>>
>>109200160
slop is realism lil bro
>>
>>109200159
post a non blurry image
>>
>>109200143
Seems similar to Redux, not bad. but not exactly SOTA at keeping identity.
>>
>>109200154
>7.4M followers
>The Instagram handle @i_am_kiko belongs to Kiko Mizuhara (born Audrie Kiko Daniel), a highly prominent American-Japanese model, actress, singer, and designer.

10/10 on muttland
kek
>>
File: 913457362035731.png (1.88 MB, 1152x1728)
1.88 MB PNG
>>
>>109200169
slopism filtered asian body horrors
>>
>>109200170
>I don't know how real photos look like, can you show me the way?
jesus dude, that's sad, I don't even want to argue with you anymore, what are you doing with your life?
>>
File: ComfyUI_temp_xpcyr_00070_.png (3.91 MB, 1344x1728)
3.91 MB PNG
>>
>>109200179
>can't post a non blurry image
>gets mad
haha get epically pwned
>>
File: 471316570161938.png (1.77 MB, 1024x1600)
1.77 MB PNG
>>
>>109200184
>mad
I said "sad", do you know how to read english?
>>
>>109200179
that was a different anon
is that tiny amount of blur really impossible at that distance?
>>
>>109200192
BOOM HEADSHOT.
you have been pwned by the pwnmaster.
get real.
>>
>>109200183
can you give her a vax scar on the side of her shoulder and bimbo lips pls
>>
File: Ideogram__00840_.jpg (2.3 MB, 2880x2160)
2.3 MB JPG
>>109200122
I like the grungy aesthetic, makes it feel more authentic. But here's the same seed with the analog photo references removed
>>
File: real image.jpg (510 KB, 2880x1800)
510 KB JPG
>>109200196
>is that tiny amount of blur really impossible at that distance?
I think the background blur is too strong to be a real image yeah, they really ought to train their models with more amateur photos imo
>>
>>109200216
that looks better thanks
>>
>>109200224
I liked the first one better.
>>
>>109200218
nice blurry photo
>>
>>109200228
I liked the second one better.
>>
kinda crazy how many hoops we have to jump through just to make the model write the word "CUM"
>>
>>109200080
>>109200216
How long does it take to make an image anon? They look good.
>>
>>109200117
I don't gen video that much but ltx 2.3 distilled seems better than the wan 2.2 equivalent.
>>
>>109200246
20 minutes
>>
>>109199841
>crisp realism
ahahahahah lmao what
>>
>>109200218
>I think the background blur is too strong to be a real image yeah
...thought you'd have some technical knowledge on lenses and shit to prove that blur was unlikely or something
>>
>>109200254
>20 minutes
Reminds me of my sd1.5 days where it took my 1080 20min to latent upscale my gens.
>>
>>109200183
lora hand. pasted look.
>>
I prefer the third one which is about to drop
>>
File: Ideogram__00689_.jpg (2.42 MB, 3840x2160)
2.42 MB JPG
>>109200246
4 Minutes for 2880x2160. Full 4k takes 6 minutes kek. 16GB VRAM
>>
>>109200254
Kreashill please
>>109200284
didn't know it could do native 4k, nice
>>
File: 1762101702360070.jpg (247 KB, 824x1912)
247 KB JPG
>>109200245
>>
>>109200246
>They look good
wat
>>
>>109200290
>Kreashill please
dem niggas live rent free in your head
I'm a cosmos chad.
>>
File: you heard me.png (691 KB, 1080x783)
691 KB PNG
>>109200294
>>
>>109200183
dog shit image with two pass
stop posting bro
>>
I unironically no longer know what the average real photograph looks like
>>
>>109200292
>CUMM
it still can't write it bro I'm crine ;-;
>>
>>109200301
dog shit post with two asses
stop posting troon
>>
File: ComfyUI_temp_iibop_00002_.png (2.72 MB, 1568x1340)
2.72 MB PNG
>>
File: 04-38-2026.jpg (1.87 MB, 1236x1848)
1.87 MB JPG
>>
File: 915851230646688.png (1.9 MB, 1024x1600)
1.9 MB PNG
>>
>>109200320
this shit node makes it even more slopped though, Krea isn't known to have great skin but with that you end up into some Flux territory in terms of plastic slop, definitely not worth it
>>
File: ComfyUI_temp_yhhjv_00005_.png (3.2 MB, 1286x1916)
3.2 MB PNG
>>109200320
>>109200330
Strength 1.
still giga slopped
>>
>>109200330
your slop is slopped sloppy joe
>>
File: 250653243920887.png (1.63 MB, 1024x1600)
1.63 MB PNG
>>
File: ComfyUI_temp_oqodh_00003_.png (2.77 MB, 1120x1440)
2.77 MB PNG
>>109200292
Krea2 hates the word cum

>>109200320
Whats the prompt? It seems that it fucked up the background, broke the handcuffs, change the style of the gen, conditioning changes the image too much if you only wanted a coherent text, I wonder if we could pinpoint what sigma controls the text or something like that
>>
>>109200352
>A gritty police mugshot photograph of a Hatsune Miku cosplayer standing against a grey height-marker wall in a dimly lit, industrial precinct room. The cosplayer features her iconic long teal twin-tails which are now messy and slightly frizzy from distress, paired with her signature futuristic outfit that appears disheveled. Her face is contorted in deep sorrow, eyes brimming with tears and visible wet tracks running down her cheeks as she sobs under the weight of the situation. Her wrists are bound in heavy metallic silver handcuffs, held out to the sides as she grips a white cardboard placard with both hands. The sign clearly displays the words "CUM SLUT" written in thick, jagged black marker. The lighting is harsh and overhead, creating a stark, high-contrast aesthetic that emphasizes her raw facial expression and the cold atmosphere of the booking area. The shot is a centered medium close-up, capturing the sharp details of her smudged makeup and the realistic texture of the cardboard sign.
>>
File: 249564648787181.png (1.92 MB, 1024x1600)
1.92 MB PNG
>>
>>109200349
Who hurt you, anon?
>>
>>109200154
Nta, and that is that nice film photograph you created with Ideogram, but can you create realistic looking amateur photography with it, similar to
https://www.instagram.com/nozomisasaki_official/p/DYzFT1mD_7r/?img_index=1

Pic rel is from my Chroma-Krea wf

Also there's lots of variation of detail on skin tone just by changing seed slightly, other parameters
https://files.catbox.moe/y8r62r.png
https://files.catbox.moe/5rq8jt.png

This is sort of thing the base model can absolutely not do on its own without additional help from a model like Chroma.
>>
>>109200328
dogshit fuzzy hair every single time
>>109200320
two pass, still looks like shit

stop posting
>>
https://github.com/blue-pen5805/ComfyUI-krea2-negpip
https://files.catbox.moe/n5j4no.jpg
https://files.catbox.moe/02pf5e.jpg
the only difference is having (close-up, illustration, anime, cartoon:-1.0) in the prompt
Krea 2 defaults to cartoonish proportions and actual cartoons/anime with certain combinations of words for some reason
it can be mitigated with photography terms but that clutters the prompt when "photograph of" should be all that is needed.
>>
I love 1girl.
>>
>>109200395
>https://files.catbox.moe/n5j4no.jpg
>https://files.catbox.moe/02pf5e.jpg
My kind of proportions right there.
>>
File: ComfyUI_temp_juqtg_00026.jpg (1.68 MB, 1400x1800)
1.68 MB JPG
>>
>he has to add grains to convince people it can generate realistic image
>>
>>
>>109200387
Can you point on the doll where the 2pass touched you?
>>
>>109200410
I agree with you on this.
>>
File: ComfyUI_00852_.jpg (947 KB, 2024x2696)
947 KB JPG
>>109200148
Every Krea 2 gen has a 2px grid for some reason
>>109200216
Ideogram is really good at making people look like action figures
>>109200257
Don't worry about it
>>
>he has to add low quality phone image on prompt because it can't generate high quality image
>>109200411
look at the dog shit fuzzy hair
>>
>>109200411
catbox?
>>
>>109200418
i'm a god master of gens. you cannot harm me with your pathetic human words
>>
>>109200418
>look at the dog shit fuzzy hair
low quality cosplay wigs look like that
>>
>>109200417
krea is garbage. What you re posting looks incredibly bad, so bad that my limbic system can't even respond.
>>
>>109200440
>he still has a limbic system
ngmi, lil blud
>>
krea makes deformed meat bags.

Real porn is not threatened by krea, that's for certain.
>>
>>109200417
That's pretty ridiculous she has 3cm hair growing out of her ass? lol
>>
File: ComfyUI_00759_.jpg (1.77 MB, 2024x2696)
1.77 MB JPG
>>109200449
Some women don't shave
>>
that anon just told on himself so hard
>>
>>109200462
u r virgin
>>
>>109200417
>that ass hair
DISGUSTING
>>
You're just autistic, and autisim disrupts the ability to see in a reliable way.

That's why you can't tell the limbs are distorted, it's because autism is a form of brain damage.
>>
1 and a half year old SDXL models can do better job than Krea2.
>>
>>109200369
prompt?
>>
>>109200483
until it can't
>>
>>109200483
now try having a real background
>>
>>109200483
You'd have to upscale this at least 4x for that argument even start to be legit
>>
>>109200483
If you're joking and that's Krea 2, then this looks nice, and it's a first for it not to be so ugly I had to hide the thumb
>>
>>109200483
I'm a Krea 2 hater but c'mon, SDXL is deprecated for realistic shit, it has a dogshit VAE
>>
File: Krea2_turbo_00211__80.png (2.47 MB, 1338x2010)
2.47 MB PNG
>>
>>109200511
sdxl will randomly manage zingers.
>>
>>109200517
I used hide post. It's ugly.

This is unhidden:
>>109200483

It looks like an actual human being.

krea is so bad, like ugly women in porn are better, at least they aren't goop-boned.
>>
>>109200523
i'm a male pornstar and you're wrong
>>
I want to get paid to hate on Krea too
>>
File: ComfyUI_00859_.jpg (1.35 MB, 2024x2696)
1.35 MB JPG
>>
>>109200507
you can reverse search it. It's on civitai
>>
File: damn.png (233 KB, 460x336)
233 KB PNG
>>109200533
Wait you guys are getting paid?
>>
File: 1781315130216050.mp4 (3.39 MB, 1824x2048)
3.39 MB
3.39 MB MP4
>"she is naked from behind"
>hmm, what's that? You want me to invent lots of clothing from behind?
le sigh
>>
>>109200507
https://civitai.red/images/125496376
>>
>>109200190
hot cha cha cha
>>
BLUE 10 done.
Fixing
RED 10.

In the coming days, Lord willing, I'll have done YELLOW 10 and GREEN 10, plus 12 PEOPLE.
>>
>>109200535
>man hands

I'm afraid you will never make it kid
>>
>>109200548
no alien hebe ass for you!
>>
>>109200555
https://youtu.be/68ugkg9RePc?t=33
>>
>>109200555
ACEStep XL is neat but local music needs a sound quality upgrade
>>
>>109200548
Not repulsive so I have to hide it, like krea.

>>109200550
Sure enough.
>>
File: DAMN IT.gif (513 KB, 220x220)
513 KB GIF
>>109200558
>>
>>109200541
supergirl is so hot
>>
File: Krea2_turbo_00216__75.png (2.43 MB, 1254x1884)
2.43 MB PNG
>>
>>109200535
catbox?
>>
>>109200555
based music sperg
>>
>>109200533
Next time, don't go for Qwen VAE when Flux.1 vae existed, don't put safety filters, and give us the style transfer adapter, and maybe your model wouldn't be as hated
>>
>>109200567
suno and udio have way worse vocals, it's not even close.

I'm really singing focused.
>>
File: Ideogram__00849_.png (3.38 MB, 1392x1728)
3.38 MB PNG
>>109200377
I didn't post those links, I don't try get drawn into thread slap fights and I don't know why Ideogram sets off a lot of anons in these threads. I like both Krea and Ideogram. Genuine amateur looking isn't one of Ideogram's strong points
>>
>>109200573
I guess it's empowering for t-rex to get a job as a model.
>>
>>109200535
You can tell it's realism engine because the tears look like cum.
>>
The shitty frizzy hair on Krea 2 is starting to piss me off. Also second pass (or a high resolution) sometimes frying the image and starts to add moles or droplets of water/sweat randomly.
>>
>>109200580
>Next time, don't go for Qwen VAE when Flux.1 vae existed, don't put safety filters, and give us the style transfer adapter,
They have all of this in their internal models which to them is all that matters.
>and maybe your model wouldn't be as hated
It is a massive success as far as they're concerned.
>>
>>109200597
>It is a massive success
keep telling yourself that lilbro
>>
>>109200596
skill issue maybe anima is more your speed big bubba
>>
>>109200596
>The shitty frizzy hair on Krea 2 is starting to piss me off.
You can't do miracles with this mid VAE
>>
>>109200548
Scail2?
>>
chinese vae drones are blasting my american flags.... ON MY 4TH OF JULY?
>>
I wonder what kind of flesh-style tube krea enthusiasts will come up with next.
>>
>>109200597
>They have all of this in their internal models which to them is all that matters.
The ideogram team didn't do that btw, they gave us the same exact product as their API, Ideogram is based, and Krea is cringe, facts.
>>
>>109200602
Did you miss the next part where I said "as far as they're concerned"?
>>
zitjeet woke up to cry about a model he cant run all weekend (exactly as predicted)
>>
File: Ideogram__00852_.png (3.03 MB, 1392x1728)
3.03 MB PNG
>>109200582
This one is a bit better but still looks like a bit too much like a glossy magazine print
>>
>>109200619
idiotgram and kreange
cosmos chads on top
>>
File: kekekekek.png (86 KB, 1617x386)
86 KB PNG
>>109200620
>as far as they're concerned
how do you even know something like that?
>inb4 some cope twitter posts
do you seriously believe they're gonna say that the reception was not as good as expected?
https://www.reddit.com/r/StableDiffusion/comments/1ucqr8x/comment/ot8qntr/?context=3
Oh wait he did say something like that lmao
>>
>>109200630
I really really wanna set fire to that building
>>
>>109200619
>they gave us the same exact product as their API
where are the BF16 (or higher) weights then?
>>
>>109200642
You don't like asparagus?
>>
>>109200646
makes my piss smelly
>>
>>109200643
maybe they trained it on fp8?
>>
File: file.png (14 KB, 939x142)
14 KB PNG
https://huggingface.co/krea/Krea-2-Turbo
desu
>>
>>109200668
krea bros, how do we respond?
>>
>>109200581
>suno and udio have way worse vocals, it's not even close.

Lol. I love ACEStep but try creating this with ACEStep XL
https://www.udio.com/songs/pHLn2TLb2jSsQfa9CC7S7J

or perhaps
https://www.udio.com/songs/heDdLtrK4wJcbQxShJW7sG

Or perhaps just something like

https://www.udio.com/songs/wzKtEP7gHsuFNGnjvL9GKp

The instrumental variety will never be there...

Even if you train a LoRA, the vocals won't be that quality. SA3 makes good instruments that are on par if not better than Udio, but unfortunately that model can't do vocals. I'm looking forward to the 2.0 ACEStep devs said they will release.
>>
File: The goat.png (159 KB, 2701x617)
159 KB PNG
>>109200668
ZiT won, get over it.
>>
the ball sack
>>
>>109200671
>SA3 makes good instruments that are on par if not better than Udio
But its composition is still nowhere near as ACEStep XL, hence ACEStep is still king of local music (though unfortunate with instruments so far)
>>
>>109200681
you are contending with one of the greatest autists to grace this thread i suggest you retreat
>>
>>109200614
yes
>>
>>109200581
>suno and udio have way worse vocals, it's not even close.
You guys don't know this, but the actual goat is sonauto
https://sonauto.ai/song/2e99edfa-da93-496f-adfd-6b49e9507893
https://www.reddit.com/r/aivideo/comments/1t1tb37/the_jerkson_5_dont_step_on_my_shoes/
>>
I filled your room up with balls, anon!
>>
>>109200707
bitch where's my desk and computer?
>>
>>109200639
>day 1 comment
But I will concede because I thought download stats >>109200668 would be higher. Even taking into account the fact that we don't have numbers for the original magnet (probably low) nor the comfy packaged versions (at most would only double the hf number), I don't think it'll get much higher than what it is now kek.
>>
File: 1592601758587.gif (994 KB, 500x277)
994 KB GIF
you guys say us, to stop nsfw addiction, but don't share your sexy prompts. so i stay with my nsfw prompts, kek. blame yourself, if internet is dirty
>>
>>109200728
>I thought download stats >>109200668 would be higher.
desu I'm not that surprised, and it's not because of what's being argued here, the model is simply too big, if someone wants to make a serious finetune, he won't go for a 12b model that has Qwen VAE when a 9b model that has Flux.2 vae already exists, it's a model with no future
>>
>>109200742
>>109200742
>>
>>109198697
Was pretty good with local a year ago. Now I'm using flow ai for pretty much everything except porn and some dark horror since they have consistent characters and can easily edit pictures with small details.

Will we ever have this kind of control on stable diffusion? I feel like this shouldn't be so hard.
>>
>>109200651
yeah, terrible smell
>>
File: ComfyUI_temp_nxbpe_00007_.png (2.95 MB, 1129x1688)
2.95 MB PNG
Anima into Krea is probably still better for NSFW 1girl type stuff.
>>
File: 1.jpg (263 KB, 1024x1184)
263 KB JPG
Man, Krea is great for multiple characters. This is with 0 character loras.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.