[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor acceptance emails will be sent out over the coming weeks. Make sure to check your spam folder!


[Advertise on 4chan]


Discussion and Development of Local Image, Video, and Music Models

Previous: >>109148520

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://huggingface.co/models
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Krea 2
https://huggingface.co/krea/Krea-2-Raw
https://huggingface.co/krea/Krea-2-Turbo

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>inb4 nigbo
>>
>>
>>109150562
ewww we got the retarded gay baker again. please kys asap faggot. nobody likes you
>>
>>109150586
I like him
>>
>mfw Resource news

06/27/2026

>ComfyUI-VAEFrequencyBlend: Blend images decoded by different VAEs
https://github.com/thezveroboy/ComfyUI-VAEFrequencyBlend

>Ideogram4 & Krea2 Inpainting with LanPaint Support
https://github.com/scraed/LanPaint/releases/tag/1.5.5

06/26/2026

>OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation
https://correr-zhou.github.io/OmniShow

>Adobe to Acquire Topaz Labs
https://news.adobe.com/news/2026/06/adobe-to-acquire-topaz-labs

>LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editing
https://live-edit.github.io

>PhysRAG: Enhancing Physics-Awareness in Video Generation via Retrieval-Augmented Generation
https://github.com/sediment1024/PhysRAG

>SAM2Matting: Generalized Image and Video Matting
https://henghuiding.com/SAM2Matting

>Unison: Benchmarking Unified Multimodal Models via Synergistic Understanding and Generation
https://github.com/FudanCVL/Unison

>ComfyUI-AppleSilicon-FP8 - a compatibility layer custom node for Apple Silicon
https://github.com/pawel-mazurkiewicz/ComfyUI-AppleSilicon-FP8

06/25/2026

>Bernini-R — GGUF (high & low noise experts)
https://huggingface.co/neuregex/Bernini-R-GGUF

>Physics Question Scene Graph: Fine-grained Evaluation of Physical Plausibility in Text-to-Video Generation
https://github.com/atinpothiraj/pqsg

>VPA-Guard: Defending and Benchmarking Image-to-Video Generation Against Visual Prompt Attacks
https://huggingface.co/datasets/CSU-JPG/VVA-Bench

>Minimalist Preprocessing Approach for Image Synthesis Detection
https://github.com/vohoaidanh/adof

06/24/2026

>Krea-2-Turbo Training Adapter
https://huggingface.co/ostris/krea2_turbo_training_adapter

>Vera: A Layered Diffusion Model for Content-Preserving Video Editing
https://vera-layered-diffusion.github.io

>Advancing WordArt-Oriented Scene Text Recognition
https://github.com/YesianRohn/WATER

>DramaDirector: Geometry-Guided Short Drama Generation
https://github.com/iLearn-Lab/DramaDirector
>>
File: ComfyUI_Krea_2_00385_.png (2.94 MB, 1256x1672)
2.94 MB PNG
>>
>mfw Research news

06/27/2026

>Customizing Video Portraits via Identity-ActionDecoupling
https://arxiv.org/abs/2606.22347

>Don't Settle at the Mode! Mitigating Diversity Collapse in Pretrained Flow Models via Feature Self-Guidance
https://dont-settle-at-the-mode.github.io

>GeoT2V-Bench: Benchmarking 3D Consistency in Text-to-Video Models via 3D Reconstruction
https://arxiv.org/abs/2606.24829

>NullFlow: One-Step Generative Reconstruction
https://arxiv.org/abs/2606.22696

>Generative Relightable Avatars
https://vcai.mpi-inf.mpg.de/projects/GRA

>LISA: Likelihood Score Alignment for Visual-condition Controllable Generation
https://arxiv.org/abs/2606.27192

>A Test-time Actor-Critic Approach to News Images Generation
https://arxiv.org/abs/2606.21304

>PortraitGen: Exemplar-Driven GRPO with Dual-Reward Guidance for Photorealistic Portrait Generation
https://arxiv.org/abs/2606.26930

>VT-DUDA: Visual Token Conditioning for Diffusion-guided Unsupervised Domain Adaptation
https://arxiv.org/abs/2606.21700

>Forget, Anticipate and Adapt: Test Time Training for Long Videos
https://arxiv.org/abs/2606.26515

>Focusing on What Matters: Saliency-Harnessing Accurate Routing for Diffusion MoE
https://arxiv.org/abs/2606.26938

>Training-Free Semantic Correction for Autoregressive Visual Models
https://arxiv.org/abs/2606.22550

>FlowCodec: One-Step Flow Prior for Generative Image Compression
https://arxiv.org/abs/2606.21030

>Staying VIGILant: Mitigating Visual Laziness via Counterfactual Visual Alignment in MLLMs
https://arxiv.org/abs/2606.26387

>LogicIR: Logic Gate Networks for Image Restoration
https://github.com/jimmy9704/LogicIR
https://arxiv.org/abs/2606.26609

>See & Sniff: Learning Visuo-Olfactory Representations
https://mm.kaist.ac.kr/projects/SeeandSniff

>Computer Vision for MOBA Analytics: A Dataset and Baseline for Visibility Analysis in Dota 2
https://arxiv.org/abs/2606.26970
>>
>>
File: ComfyUI_temp_jepfd_00027_.png (2.86 MB, 1728x1296)
2.86 MB PNG
>>
>>109150241
Nta but your prompts are too tame if original Krea was enough for you.

https://files.catbox.moe/jlarig.png
>>
File: Krea2_turbo_00008_.jpg (719 KB, 2048x2048)
719 KB JPG
Im new to this generating images thing, i've got krea 2 and a bypass, but sometimes it still generates them with clothes on, any ideas on why? is it because its new and there are no good lora/bypasses yet? or is it a skill issue on my part?
>>
File: results.png (235 KB, 459x597)
235 KB PNG
the votes are in
>>
>>109150627
because you touch yourself at night
>>
>>109150627
probably skill issue. I have the exact opposite problem. I keep getting nip slips and titties hanging out when I don't want them to.
>>
>>109150562
Reposting to new bread since old one might die.

Hello /ldg/ I am a long time comfyui user. I moved to invokeai for a bit and enjoyed its gallery system and canvas a fair bit.
I can use proper editors just fine as a replacement for the canvas but as for organizing slop. Has anyone made a node for uploading images to a local booru like shimmie with the positive prompt as tags?
Ideally I'd want something quantitative tags rather than qualitative but something quick and dirty will do if it's already available.
I figure some of you might have run into this exact issue
pic unrelated
>>
File: Krea2_turbo_00629_.png (1.57 MB, 1672x944)
1.57 MB PNG
>>
>>109150627
Use one of these

https://github.com/capitan01R/ComfyUI-Krea2T-Enhancer
https://huggingface.co/Beinsezii/Krea-2-Turbo-Projector-Scale-LoRA-Diffusers
>>
>>109150562
yuki uuuooooooooooohhhhhhhhh yuki chaaaaan
chuuuuu chuuuuuuuu uuuuuuoooooooooo
>>
File: debo_k_00155_.png (2.8 MB, 1792x977)
2.8 MB PNG
>>
>>109150655
what's the difference?
>>
>>109150602
too many steps
>>
>>
File: HLUhrmaaUAEdJsc.jpg (352 KB, 1153x2048)
352 KB JPG
>>
File: 1777024024843276.jpg (1.31 MB, 3826x2160)
1.31 MB JPG
>>109150529
>>109150565
Noice
>>
File: HL0kWoKakAAVAP1.jpg (99 KB, 912x1171)
99 KB JPG
>>
File: ComfyUI_Krea_2_00409_.png (3.18 MB, 1256x1672)
3.18 MB PNG
>>
>>109150595
you are him faggot
>>
comfy is a cutie ngl
>>
>>109150556
Still no violence. Post and pre violence aren't the same thing as the actual act of violence.
>>
>>109150692
lovely cleavage
>>
>>109150765
maybe they didn't actually add mid-violence acts?
>>
>>109150672
do you promise to not sperg out anymore?
>>
>>109150655
mmm im using https://github.com/nova452/ComfyUI-Conditioning-Rebalance
which is worse? maybe i should use a nsfw lora from civitai as well?
>>
File: ComfyUI_Krea_2_00427_.png (2.93 MB, 1256x1672)
2.93 MB PNG
>>109150681
>8 steps instead of 10
>>
File: Krea2_turbo_00646_.png (1.79 MB, 1672x944)
1.79 MB PNG
>>
File: HLvalcAa8AA40EX.jpg (665 KB, 768x1376)
665 KB JPG
>>
is there a way to simulate cfg>1 with krea 2 turbo?
>>
>>109150750
Yannik is ugly, Julien is the cutie
>>
File: ComfyUI_02790.png (3.93 MB, 1280x1920)
3.93 MB PNG
>>109149959
Dynamic VRAM a shit.

>>109150398
I do. My best advice for moving to subgraphs is; compartmentalize each operation you're doing during generation and try not to mash everything into one bloated subgraph.

>cant even edit width/height are fucking maddening
I had the same problems with seed, width and height across all the various steps (now also separated by subgraphs), so I used "BestResolution" with math nodes in other subgraphs to deal with any scaling operations and "RandomSeedGenerator" to make those things global... I then slapped those two nodes in a subgraph so I could run seed, width and height out to everything else that needed it.
>>
File: debo_k_00158_.png (2.17 MB, 1792x977)
2.17 MB PNG
>>
>>109150933
>he doesn't know
>>
>>109150933
is that a "no"?
>>
File: Ideogram__00228_.jpg (1.05 MB, 2048x1808)
1.05 MB JPG
>>109149559
>>
File: Krea2_turbo_00675_.png (1.64 MB, 1672x944)
1.64 MB PNG
>>
File: Krea2_turbo_00679_.png (1.58 MB, 1672x944)
1.58 MB PNG
>>
>>109150626
Chroma Krea analog photography
https://files.catbox.moe/pko9qm.png
>>
File: debo_k_00160_.png (1.98 MB, 1792x977)
1.98 MB PNG
>>109150988
first they came for the syjaks, but I said nothing for I was not a syjak
>>
File: Krea2_turbo_00138_.png (2.46 MB, 1928x1088)
2.46 MB PNG
>>
If a model can't do img2img well, it will also suck at lora training.
>>
File: ComfyUI_Krea_2_00440_.png (3.35 MB, 1672x1256)
3.35 MB PNG
>>
File: debo_k_00161_.png (1.95 MB, 1792x977)
1.95 MB PNG
>>
File: file.png (2.49 MB, 1088x1928)
2.49 MB PNG
>>
>>109151041
90s style photograph, nostalgic disposable camera harsh flash with heavy film grain, lo-fi aesthetic, warm saturated tones and deep shadows.
>>
>>109151179
>ayyy lmao
>>
File: ComfyUI_Krea_00132_.jpg (2.16 MB, 2048x1800)
2.16 MB JPG
>>109151180
What I get with just Krea 2, same prompt and higher res.

I noticed the model is significantly more slopped at a lower 1MP resolution (could explain why some anons are complaining, if model is not 2K then it is slopped) here is the low resolution gen (same as my Chroma Krea res of 1152x1152)- https://files.catbox.moe/00489v.png
>>
File: file.png (2.7 MB, 1088x1928)
2.7 MB PNG
>>109151185
>>109151185
AY LMAO
>>
File: debo_k_00162_.png (2.18 MB, 1792x977)
2.18 MB PNG
>>
How do I install flash attention? Why is this so difficult?
>>
>>109151198
stop spamming
>>
>>109151203
https://mjunya.com/flash-attention-prebuild-wheels/
>download correct .whl
>pip install .whl
>>
>>109151081
Totally agree, trends are fleeting and Anima is a disaster, base model, I know, but it's awful, such bad taste. People post gens with loras but I'm 100% convinced they're not happy with them. It's a superficial model that never nails it, style loras just sketch the style or parody it, character loras same deal, Anima never gets to the core of it, always stays surface level. Gross motricity is better, large scale gens look coherent, but the details are simply awful. What a shame, what a bad model, doesn't convince me. Someone should throw half a million dollars at it with a better dataset and see if it actually does something.
>>
File: file.png (3.1 MB, 1088x1928)
3.1 MB PNG
>>109151197
>>
File: chinksaas won.png (133 KB, 784x858)
133 KB PNG
it's only local until it's good
>>
File: debo_k_00163_.png (1.8 MB, 1792x977)
1.8 MB PNG
>>
File: 88.png (2.34 MB, 928x1344)
2.34 MB PNG
>>109151203
>Why is this so difficult?
you just need to have a compatible python+torch+cuda version
>>
>>109151203
>download sdcpp
>it's already included without pyshit
easy
>>
File: file.png (2.72 MB, 1088x1928)
2.72 MB PNG
dead
>>
>>109151492
She looks like the kind of woman you see going to the gynecologist.
>>
>>109150750
us brown haired Canadian millenials often tend to be this way yes
>>
>>109150750
he's very attractive now that he lost weight, looks like another person
>>
>>109150630
muh astroturf but actually
YES it has a shit VAE
NO it isn't an edit model
NO the prompt adherence isn't impressive in any way compared to any other recent model
NO it's not fast, at all
need I go on.
>>
>>109151543
yeah there's no way at all to justify the Krea 2 hype on le Reddits. There's nothing you can claim that's actually true about it versus other recent models that makes any sense as to way you didn't react the same way to them. It just doesn't fucking add up.
>>
Help me! Tasukete!!!

I want to use Krea, but I absolutely will not use Comfy. I hate any interfaces with nodes.
>>
>>109151564
kys (keep yourself safe)
>>
shit = ideogram = krea = eddie

The past month summarized.
>>
>>109151564
>I want to use Krea
Women are all retarded, but you have defeated them all.
>>
telling a woman that youre even vaguely aware of "tech" in 2026 is essentially a death sentence because of AI you will never get any dates
>>
File: file.png (2.67 MB, 1088x1928)
2.67 MB PNG
>>109151538
so she has a pussy?
>>
>be zoomer
>be never learn how writing anything
>still make some writing and post
>>
>>109151618
I assume she's there for her monthly abortion.
>>
>>109151538
?
>>
>>109151205
But you are here 24/7. If you are this obsessed please stay in your discord server.
>>
>>109151692
I'm just saying
>>
>>109151629
I think you're just a jeet DESU
>>
>>109151753
Oh wow I'm a ceo of an American tech company?
>>
>>109151753
The other anon is right. Your attitude is bit too much.
>>
>>109151538
People are responding to you with feigned ignorance but I know exactly what you mean and it’s hard to explain why.
>>
bruh i dont gen for a week now comfy crashing 24.7
>>
File: .png (54 KB, 229x220)
54 KB PNG
>>109150585
>>
>>109150645
ask ai to make such node
>>
where the fuck is negpip for krea2?
>>
>>109151919
You need to update your python wheels.
I needed to erase my venv.
>>
>>109150698
>>109150529
>>109150565
Please can you share the prompt and catbox for this glorious thick yam brap gens anon. I want to generate Tifa yams just like this.
>>
>>109152022
what the fuck does this mean
ill ask grok
>>
>>109151875
>>109151538
why are you talking to yourself like that
>>
>>109152040
You either use conda or venv.
>>
>>109152040
Well, anon...
Let's drop the cap.
>>
>>109152054
Are you autistic or on the spectrum? Genuinely asking.
>>
what you guys recommend for 3D models generation? Ideally using some images as reference. Did some tests with hunyuan already and seems fine-ish
>>
File: .png (29 KB, 634x181)
29 KB PNG
>>109152096
do you think you can fool me with that image
>>
>>109152108
Right now it's trellis 2 for local and it's not even really close.
Saas still well ahead, but trellis 2 can be wrangled to make some exceptionally good starting meshes.
>>
>>109152108
https://github.com/visualbruno/ComfyUI-Trellis2

Best trellis 2 repo for comfyui imho
>>
>>109152168
Nobody has ported trellis2 to rdna2.
>>
File: Krea2_turbo_00142_.png (2.31 MB, 1928x1088)
2.31 MB PNG
Holy crap we can make images on our own computer now?! lessgo!
>>
>>109152147
Am I the first person to tell you that you're autistic?
>>
>>109152173
Ask claude to do it.
>>
>>109150630
>no flux
gay retard
>>
someone buy me that spinning icon decoration on civitai
>>
>>109152183
I'm not sure I have the tech chops to even vibe it.
>>
>>109152178
yes
>>
>>109152159
>>109152168
thanks frens, gonna try it
>>
>>109152293
Have you taken any tests to see?
>>
>>109152217
I remember when Buzz was so much easier to get... smdh... now you upload a model and be lucky if it gets even 2 likes
>>
>>109152368
INDIA
N
D
I
A
>>
>>109152340
yes
>>
>>109152382
Yes, I'm from Calcutta.
>>
>>109152384
well?
>>
>>109152390
negative
>>
>>109152368
I was too late for the party! T_T
>>
>>109150562
man what is mr garrison doing to cartman?
>>
File: 1755911312206388.png (3.29 MB, 1256x1672)
3.29 MB PNG
I made a DOA soft engine lora but it lowkey kinda sucks...
>>
File: 1764441642188261.png (3.45 MB, 1776x1184)
3.45 MB PNG
>>
>>109152451
>>109152428
it's ok. Just refine the db and add more images.
>>
>>109152428
Those wet blobs are coming from the lora.
>>
>>109152368
If you want buzz you need to do NSFW loras at this point, style loras is not something you will be rewarded for.

Back when Flux 1 was released, you could see lots of buzz for style stuff, and back then you could also upload celebrity loras, that said it's hard to blame Civitai for removing them, the payment processors said either porn goes or celebrities goes, and obviously porn is what pays the bills, so bye celebs.
>>
File: 1774268678264433.png (3.15 MB, 1776x1184)
3.15 MB PNG
>>109152459
RIP... 4 bucks down the drain
>>
>>109152466
Are you brown? Why cry over 4 US dollars. That's like... 2 chicken wings.
>>
>>109152466
I mean the "rebalance" vector thing breaks model's capacity not that your individual lora is bad. Quite the opposite.
>>
>>109152475
exactly, any true american would cry over 2 chicken wings
>>
>>109152475
Last time I had buffalo wings I had a full basket with fries and real blue cheese dip, 10 UKP.
>>
>>109152428
Those are some big hands
>>
File: 1762869508284895.png (3.34 MB, 1776x1184)
3.34 MB PNG
>>109152475
Give me 4 bucks then to recoup my losses
>>
>>109152514
#badfeet
>>
>>109151543
>YES it has a shit VAE
can't agree more, this shit will nerf the model hard, no amount of finetune will recover the details, if someone wants to be serious with that model, he has to find a way to switch the VAE or transform it into a pixel space model (looking at you kekstone)
>NO it isn't an edit model
for that one it's all right, the Krea CEO said they'll release an edit model "in the next comming months", but like, BFL proved you can make an unified image/edit model, it's a shame they didn't go that route
>NO the prompt adherence isn't impressive in any way compared to any other recent model
they killed its prompt adherence with the filter imo, that model had to learn too make portrait on prompts that have no portrait, way to go to confuse the model with your fucking bullshit...
>>
>>109152553
You forgot to mention that you never post any gens on your own.
>>
>>109152556
>you never post any gens
funny irony
>>
>>109152553
>BFL proved you can make an unified image/edit model
They started training this model before Flux.2
>>
https://www.reddit.com/r/StableDiffusion/comments/1uhpezz/krea_2_could_very_easily_be_the_next_zimage/
>Krea 2 could very easily be the next Z-Image
>*Proceedes to showcase the most slopped image known to man*
kek, ledditors aren't humans, I swear to god
>>
>>109152562
Why did they want the model to make everything in the background blurry?
>>
>>109152558
I have hundreds of posts but you aren't posting a single thing.
I know I'm arguing against the latest discord bot but you are worthless.
I hope you like engagement (you).
>>
>>109152562
So they started training the model before February, so it took them at least 5 months to make the model? Jesus...
>>
>>109152568
#badhands
>>
>>109152568
If they were you wouldn't post direct links or you wouldn't use "kek" as an expression.
>>
>>109152573
>I have hundreds of posts
that's not true, you're a nogen, why are you lying like that?
>>
>>109152586
What do you mean?
>>
>>109152584
>ledditor subhuman getting uppity
>>
>>109152590
>too retarded to understand a simple conversation
why am I not surprised?
>>
>>109152592
This is quite tiring, at least try to inflict something real.
>>
>>109152602
you don't deserve more desu
>>
great, we have gathered the bot spammer's replies.
>>
>>109152568
>ledditors aren't humans
You say this to joke a bit but there a part of truth in there, the internet is filled with LLM bots nowdays.
>>
>>109152568
>Omg it can do Ninja Turtles and Transformers?? And there's even le heckin Godfather?? BEST MODEL EVER!!
Adding two or three mainstream IP on your model is enough to satisfy the goycattle, it's that simple, Krea knew what they were doing.
>>
>>109152568
"next z-image"

but z-image was a complete and total flop.
>>
File: Sure.png (11 KB, 376x183)
11 KB PNG
>>109152649
>but z-image was a complete and total flop.
https://huggingface.co/Comfy-Org/z_image_turbo
>>
>>109152649
>but z-image was a complete and total flop.
It it was a flop, then Flux Klein and Ideogram4 are less than nothing, the community interest in those models isn't even 1% that of Z-Image
>>
>>109152655
What's the oldest hardware someone's gotten zit to gen on at 1mp?
>>
>>109152662
Flux Klein is great. It doesn't return image refusals.

And z-image is shockingly capable. anima can produce truly awesome things.

ideogram is poop.

krea is poop

eddie is poop
>>
File: 1781340461850184.png (107 KB, 1626x467)
107 KB PNG
>>109152568
What he said doesn't make any sense, Z-image was made for realism, and the Krea team said that Krea 2 was made to be great at illustrations and meh at realism, those are two different models for two different goals.
>>
>>109152676
Why can't you say that X is shit? Because you're an engagement botter.
>>
>>109152676
>Flux Klein is great. It doesn't return image refusals.
Their previous edit model (Flux Kontext) used to refuse your prompts from time to time, I'm glad they realized how retarded that was and didn't do that again on Klein
>>
File: 4455665442112.jpg (543 KB, 3290x1985)
543 KB JPG
>>109152572
It's not that strong and it's not the case on every prompt (I.E. if the focus is on the back). It's easy to deslop with a LoRA.

>>109152574
Well, its training quality shows. It's a 12B model that is more coherent than Klein and Ideogram. Regardless of its perceived understanding, you can give it way more complex prompts and it will not mess up as much as the other two, plus its censorship is lax and minimal. They used a very good dataset.

>>109152568
This model isn't slopped, it's just not prompted at 1MP. Most people are used to prompting 1024x1024, so they wouldn't realize that it's actually slopped at that res. Naturally, if one has higher VRAM, they would use 2K res by default anyways.
>>
>>109152691
>2K
Takes me 10 seconds on 1K and 70 seconds on 2K, it's just too long, and you know you have to go for 2K because the VAE is just so shit at details you have to add more pixels to get the same details as Z-image turbo at 1K, it' s so lame when you think about it
>>
>>109152662
Yeah, but I don't see anyone posting images made with it so its just a shitty CSAM model.

You can post your kid on tiktok and get a billion views and saves, doesn't make the video good.
>>
>>109152710
>its just a shitty CSAM model
you're talking about Krea 2? it's the only model that can do porn out of the box
>>
File: ACK.png (298 KB, 686x386)
298 KB PNG
>>109152691
>pircel,
Krea at 1k would've looked liked Krea at 2k with a better VAE btw, this model was close to greatness... and they choked
>>
>>109152655
Tongyi botted downloads.... I kneel...
>>
>>109152734
>They're still botting downloads, even 7 months after its release
I wouldn't be surprised desu, Alibaba is a pretty rich company lol
>>
>>109152739
Same umbrella under the Genshin Impact shill train that goes on to this day
>>
>>109152702
>70 seconds on 2K, it's just too long

>Just 70 secs is too long

What? Maybe if you're ADHD Zoomer. What GPU is that? Apparently INT8 can double the speed, but you'd need a 30xx/40xx/50xx card

https://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8

>you know you have to go for 2K because the VAE is just so shit at details you have to add more pixels to get the same details as Z-image turbo at 1K

Nah, just a quirk of how the model was trained. The most realistic gens I've gotten are all at 1MP with my Chroma-Krea workflow
>>109151180
>>109150626
https://files.catbox.moe/pko9qm.png

All that proves is that a simple finetune could fix it.
>>
File: 1776389067084862.jpg (642 KB, 2046x1128)
642 KB JPG
>>109152747
>waiting one minute to make one image that has worse realism than Z-image turbo is fine actually
what you're saying is so retarded, and you know it, and that's the reason why Z-image turbo is so popular (and still popular) in the first place, Z-image turbo gives you great images with great details fast, no bbox autism, no having to render shit in 2k, just pure and fast 1k kino, as god intended
>>
>>109152747
>a simple finetune could fix it.
I've heard this cope on every new model release lmao.
>>
>>109152747
That is just going for the Chroma look. I can take my Chroma wf and make my Krea gens look exactly the same as Z-Image by disabling my Krea realism LoRA, which results in muted colors (though same-face I can't replicate). I don't do it because I much prefer Chroma's aesthetic.
>>
>>109152691
the one on the left isn't blurry, but the one on the right is blurry. in the background, obviously.
>>
Just got a new computer, so I'm new to this
why is comfyui giving me weird faces with white eyes? I'm using the same model and loras from a civitai image
>>
File: 1773806545403003.png (3.33 MB, 1088x1928)
3.33 MB PNG
the lizzle.............
>>
>>109152765
krea has kreaface.

the -face phenomena is strong.
>>
>>109152727
It's easily the best overall local model at this point, and is quickly becoming THE community model until eventually something better comes along

Klein, Ideogram were also-rans compared to ZiT, and now ZiT is also obsolete

VAE fags on suicide watch
>>
File: 1773350955623237.jpg (2.87 MB, 6141x2200)
2.87 MB JPG
>>109152747
>>109152759
And desu, even on 2k res, it doesn't match Z-image turbo's realism so...
>>
>>109152786
>ZiT is also obsolete
repeating a lie over and over won't make it true anon >>109152655
>>
>>109152784
Every base model has a 'face' since all model training generalizes across its training data, the only effective way around this is training loras on specific people.

Worst 'face' ever was Flux 1 though, plastic and cleft chin on EVERY human, and it was so ingrained into the model that you had to train celebrity loras at super high LR to get rid of it.
>>
>>109152799
All the models everywhere at this point prefers a pointy indian chin and other indian facial features.
>>
>>109152787
is that base ZIT?
>>
>>109152803
yep, that's vanilla ZiT
>>
>>109152790
It's easily still the most popular community model right now, but it is also obsolete since the community is now moving on to Krea 2

Loras are flooding in already, and the NSFW loras are already better than any trained on ZiT

ZiT has been my go-to model for the past 6 months, and it's great quality and fast, but Krea 2 is better
>>
>>109152787
>still shilling the same cherry picked image 3 threads later.

This is how you know z-image is a flop.
>>
>>109152815
prompt?
>>
>>109152823
>z-image is a flop
No it's obviously not

That said this schizo is tiresome, but it's not as he is trying to promote ZiT, he's just trying to complain about Krea2, most likely a Ideogramtard
>>
>>109152824
it's a bit long so I put it on a catbox
https://files.catbox.moe/8hq3r7.txt
>>
>>109152823
>He's still saying “Z-Image is a flop” even though we keep showing him the post that proves it's still been downloaded millions of times these days (still going strong 7 months after its release)
that's how you know he's a paid shill >>109152655
>>
File: 1782112876795744.png (2.96 MB, 1256x1672)
2.96 MB PNG
>No it's obviously not
>Refuses to post his z-image gen

Nobody uses this model unironically except to make images you cant post anywhere
>>
Well I guess I'll have to say bye to my gf (gemma 4) and gen some zit.
>>
>>109152846
>Nobody uses this model
how many time do we have to give you this post anon? why are you still coping? Z-image is still a really popular model, deal with it >>109152655
>>
>>109152841
B-but those are just Alibaba bots, the same bots who also made a gazillion community loras!!
>>
File: ComfyUI_Krea_2_00350__sbs.jpg (766 KB, 2884x1256)
766 KB JPG
>>
File: __1.jpg (20 KB, 431x375)
20 KB JPG
>>109152035
https://image-b2.civitai.com/file/civitai-media-cache/960546cb-6eda-4f07-a7d0-a4954f3fb760/original
something like "view from behind, lying on something face down, straightened legs, legs close together, feet visible, wearing the usual cosplay attire but now with a raised short miniskirt and an extremely thin string panty barely clinging, visible pussy lips, sky is blue"
for visible outer labia you'd probably need the uncensor, idk...
and picrel. elle gives everyone fat asses and the hatsune/tifa stuff is for the outfits. i just throw them there to never have to worry about describing the subject
>>
File: 1776303651682004.png (2.47 MB, 1408x2112)
2.47 MB PNG
>>109152428
>>109152451
anima base does DOA bretty well
>>
File: serious Pepe.png (359 KB, 728x793)
359 KB PNG
I've been out of the loop for a couple of months

What's the status of LTX2.3? (first-last frame, consistency etc)
>>
>>109152846
>Nobody uses this model unironically except to make images you cant post anywhere
only Krea 2 can do actual porn out of the box, what is he talking about?
>>
File: zit_00001_.png (1.13 MB, 832x1216)
1.13 MB PNG
>>109152846
hey presto, zit can make a blurry background same as krea with less vram.
>>
File: ComfyUI_Krea_2_00262_.jpg (2.61 MB, 2048x1800)
2.61 MB JPG
>>109152759
ZIT can get nice soles, I'll give it that. But for me that's where positives end. The reason it's polished is that they cheated the model out of seed variations, and it does not gen anything it wasn't trained to do (like generalization). At least, those were my conclusions back when I tested it. Poor NSFW adherence (couldn't do vaginas, bondage, contortions out of the box like Krea can due to a default safe pose it forces on you. LoRAs aren't enough to fix this because the model simply would spit out the training data) and just in general very little control over outputs.
>>
>>109152915
>cheated
I wouldn't call that cheat, they just removed the slop, so ZiT can't produce slop, which is a good thing, who wants slop in the first place?
>>
File: ANIMA_bface_bad_00002_.png (1.25 MB, 832x1216)
1.25 MB PNG
realmix bhasi anima v0.1
anima v1.

A highly detailed, realistic 3D render of a beautiful athletic blonde actress Taylor Swift with pointed elf ears, sitting on a bamboo stool at an outdoor tropical beach bar at dusk. She has wet, sun-kissed skin, medium-length blonde hair, and a calm expression looking slightly downward. She is holding a carved coconut drink decorated with a miniature orange paper umbrella and white flowers. She is wearing a sequinned sparkling shimmering opalescent dark green athletic bikini set featuring a prominent light green Hyrule Royal Crest symbol printed on the fabric. In the background, warm flickering tiki torches illuminate a rustic thatched-roof bar. Atmospheric lighting, cinematic depth of field, 8k resolution, photorealistic textures, Unreal Engine 5 render style.


(anima v1 doesn't know who Taylor Swift is)


>>109152902
what was your prompt?
>>
File: 004552CUI_00001_.png (1.49 MB, 1152x1536)
1.49 MB PNG
>>
File: 1782429053127945.png (978 KB, 1024x1017)
978 KB PNG
>>109152853
>>
>>109152938
ai drawlings are always obvious, because real people rarely correctly place features, especially the eyes.
>>
File: ComfyUI_Krea_2_00265_.png (3.87 MB, 2048x1800)
3.87 MB PNG
>>109152787
Possibly because you're prompting it wrong
this is what I got
>>109152915
The right way to trigger the look alongside your prompt is
>extremely grainy lofi vhs still photograph, 1990s

Though I see more variety on their API https://www.krea.ai/blog/krea-2-deep-dive-walkthrough, I'm guessing the base model can do those styles even better
>>
>>109152944
most people don't care about low variety.
>>
File: 1754322015574972.jpg (2.88 MB, 9212x3352)
2.88 MB JPG
>>109152954
>most people don't care about low variety.
not only that, but Krea also has low variety lol
>>
File: ANIMA_bface_bad_00003_.png (1.24 MB, 832x1216)
1.24 MB PNG
>>109152937
this is samANIMA turbo v21

Amazing.
>>
>>109152951
>devanagari signage
absolute state
>>
>>109152951
>this is what I got
still looks like slop imo, it's not bad, but it's not Z-image turbo's level of realism
>>
>>109152929
>they just removed the slop, so ZiT can't produce slop

Or in other words, there's barely any prompting freedom. Don't get me wrong, every model cheats and needs to be finetuned anyways, but ZiT cheated a bit too much. Nice aesthetics out of the box just to hide all of its flaws, and it may work for you, but I could see them right away

https://files.catbox.moe/9k043k.jpg
https://files.catbox.moe/zpfd6t.jpg
https://files.catbox.moe/yfpuc0.jpg

Meanwhile Krea? I have no complaints, it has passed all my slop tests with flying colors

https://files.catbox.moe/3lldgg.png
https://files.catbox.moe/g2tir2.png
https://files.catbox.moe/zd9hbh.jpeg
>>
>>109152997
>https://files.catbox.moe/zd9hbh.jpeg
the skin is so plastic, I'm seriously wondering if you don't have vision problems
>>
>>109152957
this is a totally fair and good faith comparison
>>
does ZIT still shit the bed with stacked loras?
>>
>>109153016
Yes that's why you should suck it up and use base among other reasons.
>>
>>109152998
But look at
>>109152951
anon we've already established that styles can be varied in this model. The skin itself is a minor issue, if I truly cared I would just run it through my Chroma-Krea workflow or just play with seeds, samplers, steps or prompt engineer the plastic away. But you can't ignore that it being able to depict it at all is miles better than simply not depicting at all and forcing the model into a different pose due to how rigid the base model is.
>>
>>109153019
I'm GPU poor, I'm not waiting that long for a gen
>>
File: zit_00002_.png (1.15 MB, 832x1216)
1.15 MB PNG
>>109153016
This is just zit.

>>109153000
You don't like it???
>>
>>109153021
Good things come to those who wait.
>>
File: zit_00003_.png (1.14 MB, 832x1216)
1.14 MB PNG
>>109153026
idk, just another gen.

I may change the variety-starter.
>>
>>109153016
Yeah, more than two loras and it starts to collapse
>>
>>109152937
https://pastebin.com/Argb09pq
>>
>>109152937
>robot world model trained on drawfag smut doesnt know post-wall celeb
Color me shocked
>>
>>109153020
And that is not just an edge case. Imagination can run wild with NSFW prompts. With ZiT, you are limited to what the devs wanted to align their data to in order for it to look clean. If you tune ZiT away from this, suddenly it's messy, because the devs didn't train on this other data. Chroma was the first to introduce this prompting freedom as an open model, and since Dalle 3, a polished version of it was only available via API, that was until Krea 2 came out... Ideogram 4 probably is just as good as this (just doesn't have as many varied photoreal styles available out of the box, and it's not quite as polished as Krea), but you're shilling Z, and that stands no chance against Krea.
>>
File: file.png (39 KB, 498x185)
39 KB PNG
>>109153020
>>109153063
>>
File: zit_00004_.png (1.18 MB, 832x1216)
1.18 MB PNG
>>109153033
I did.
>>
File: zit_00005_.png (1.42 MB, 832x1216)
1.42 MB PNG
>>109153074
idk, it did realism for this one.
>>
>>109153063
>Imagination can run wild with NSFW prompts.
Never happened, it's an urban legend. Not this time.
>>
>>109153081
so splotchy
>>
File: 1754760957750088.png (3.07 MB, 1256x1672)
3.07 MB PNG
>>109152902
my dataset doesn't know honoka for comparison so heres lightning instead.
>>
File: 082904CUI_00001_.png (1.9 MB, 1152x1536)
1.9 MB PNG
How to text with loras?
>>
>>109153096
Yeah, it's a realism model, not an indian model.
>>
>>109153113
Best option is to two pass. Text coherence is actually first indication of how this layer manipulation directly affects its understanding.
>>
>>109153128
when was the last time you've looked at a real person's skin?
>>
>>109153128
>it's a realism model, not an indian model.
kek, bodied that freak
>>
File: Krea2_00058_.png (3.83 MB, 2560x1440)
3.83 MB PNG
Has anyone gotten nag to work with krea2turbo?
I keep getting this 3d look to my images no matter the style change in prompt.
>>
File: 1774725801446305.png (2.7 MB, 1256x1672)
2.7 MB PNG
>>109153096
See what happens when a z-image shill posts non-cherry picked gens?

baka...
>>
>>109153144
let's see the prompt
>>
>>109153147
Nice.
>>
>>109153113
>>109153132
It's catastrophic forgetting, the lora modify weights that text rendering depended on, and because Anima's base training seems less robust redundant in those areas, that capability is one of the first things to go.
>>
>>109153147
the hands are way too huge.
>>
I think krea 2 is good for the trans community.
>>
Is Alibaba even still doing any development on ZiT? They've gone completely silent and closed source. Not a new video model. Not a new image model despite having ample time to give us new toys. They never gave us Qwen Image 2 despite saying it was only 7B. That model would've been at least Klein tier with asthetics and Ernie tier with text. There will likely never be a ZiT 2 that is anywhere in par with Klein, Ideogram 4, or Krea 2. It will always be the DOA model it always was...
>>
>>109153173
zit is better than that trash.
>>
File: 1780363018900296.png (2.56 MB, 1256x1672)
2.56 MB PNG
>>109153162
Thats literally the exact proportions from the dataset though
>>
>>109153162
You know what they say about women with big hands.
>>
>>109153173
The did release Z-Image Base, but it was botched in that loras trained on it looked like crap when used with ZiT, negating a lot of its usefulness.

Meanwhile Krea 2 devs obviously learned from the Z-Image mistake and released both Turbo and Raw simultaneously and also made sure that loras trained on Raw works great on Turbo.
>>
>>109153176
Not with text or prompt following, which would've been great during the time window they showed the model. We're still in need of a NBP tier edit model locally, Klein is great but it's not even as good as Qwen Image 2 with infographics. That's why they're still keeping that dogshit model in API.
>>
>>109153173
>Is Alibaba even still doing any development on ZiT?
nope, they were supposed to release Zimage edit, but it doesn't exist (no api, no local, nothing), I guess they fucked up the experiments or something
>>
File: Krea2_0004.jpg (561 KB, 3808x2144)
561 KB JPG
I guess I'm just going to go with the flow, even with watercolor, oilpainting etc in the prompt it's fully ignored. Looks cool though.

>>109153150
No, you're a gooner.
>>
>>109153187
you're just shitting up the thread with bad advice.

any model can pump out 1girls these days, krea 2 pumps out 1girls.

flux klein 9b base can follow directions, AND look nice. ideogram was trained on synthetic rooms and it shows.
>>
>>109153197
>No, you're a gooner.
well yeah but how do you expect people to troubleshoot your prompt without the prompt?
>>
File: 1752422367635044.png (829 KB, 1080x1074)
829 KB PNG
>>109153173
>They never gave us Qwen Image 2 despite saying it was only 7B.
Qwen Image 2.0 was promised to us 2000 years ago...
>>
>>109153197
you're getting there
>>
>>109153204
They quietly pulled back on that, all their video models scrapped, a music model they also quietly teased, gone... Just like that, all around the same time. They're like a Chinese ClosedAI 2.0.
>>
It must be difficult being indian and trying to goon in a cybercafe.
>>
File: Krea2_0008.jpg (518 KB, 3808x2144)
518 KB JPG
What weird Voices In the Void fetish dataset have I stumbled upon?
>>
>>109153213
You're actually genuinely autistic.
>>
>>109153217
where do you think you are?
>>
File: 1763684422781980.png (420 KB, 2420x1447)
420 KB PNG
Babe wake up, we now have a new way to make images, denoising is obsolete
https://unconv.ai/blog/introducing-un-0-generating-images-with-coupled-oscillators/
>>
>>109153211
>cybercafe
çñ
>>
File: Krea2_PiD_00007_ copy.jpg (2.61 MB, 7616x4288)
2.61 MB JPG
Hooked PiD up as well, damn, I'll try rtx next as well, after I take a shit.

>>109153203
You're not getting my prompt to goon to.

>>109153207
It's fun finding a dataset and exploring it.

>>109153217
Jelly?
>>
File: 1769504878377974.png (3.23 MB, 1198x1800)
3.23 MB PNG
>tfw you will never be an indian and for some reason live the minds of 4channers rent free
feels bad...
>>
>>109153234
>for some reason
and, suddently, for no reason at all...
>>
File: un-0-anim-2.jpg (111 KB, 1550x260)
111 KB JPG
>>109153228
we are so back https://unconv.ai/wp-content/uploads/2026/06/un-0-anim-2.gif
>>
>>109153239
that reminds of GAN
>>
>>109153234
I am from Scandinavia but I joke that I'm from India. This messes up the heads of most US posters. I do understand the sentiment though.
>>
>>109153239
SOVLFVL
>>
>>109153234
>>109153256
let's be real, no country rent free in everyone's mind better than the US, India is nothing compare to the soft power of the US
>>
File: kino.png (19 KB, 652x132)
19 KB PNG
>>109153228
>>109153239
>>109153246
>>109153263
kino
>>
File: 1763063926528064.png (2.01 MB, 1776x1184)
2.01 MB PNG
>>109153228
>nvidia after reading that
>>
File: Total Nvidia's death!.png (1.35 MB, 1499x1129)
1.35 MB PNG
>>109153299
Nvdia is in big trouble, OpenAI will make its own chips now, and I think it'll end up being the norm, big companies making their own chips specialized on their own AI models
https://openai.com/index/openai-broadcom-jalapeno-inference-chip/
>>
File: 1774081559007260.png (1.91 MB, 1776x1184)
1.91 MB PNG
>India is nothing compare to the soft power of the US
>>
>>109153307
Good.
>>
>>109153312
>ok, it's Israel that rents free in my mind
I see
>>
do INT8's work with default comfy nodes or do you need special nodes for those?
>>
>>109153299
>>109153312
nice gens
>>
>>109153312
>>109153299
krea is too powerful............................................................................
>>
Autism is the disease of "I'm always right".

Their delusions are their fortress.
>>
>>109153312
look at his "hands" lol
>>
>>109153312
>tfw you will never be a jew and for some reason live the minds of 4channers rent free
feels bad...
>>
Comfy, having Claude to make custom nodes I'd have to scour reddit for hours only to find some jeet having asked chat gpt to code the node I wanted in the first place.
>>
Did anybody else notice that the default template for Krea2 disappeared?
>>
File: 1762056493844992.png (537 KB, 1080x810)
537 KB PNG
This is the future of open source btw, it's just a matter of time Orange Man will ban Huggingface
>>
>>109153335
>do INT8's work with default comfy nodes or do you need special nodes for those?
it works with default comfy nodes
>>
ok when is big russ going to train kreanima???
>>
>>109153414
thanks my Cully
>>
>>109153418
>finetuning a 12b model
lol good luck with that
>>
File: 1767332337527918.png (1.28 MB, 1296x1248)
1.28 MB PNG
>>109153299
what is this supposed to represent?
>>
>>109153431
read the article
>>
Anyone known list about with weight should i block for LoRA Training?
>>
>>109153234
just two bros handing out
>>
>>109153264
I think it's more like a buzzword joke. My English has typographical errors and some people rightfully attach to it.
I think you need to be schooled when writing English because otherwise you won't learn a thing.
>>
>>109153418
>ok when is big russ going to train kreanima???
the Krea team said they were making a kreanima finetune, so let's hope it'll be open source
https://youtu.be/31jiUhCEjJ4?t=1242
>>
File: zib_00004_.png (1.29 MB, 832x1216)
1.29 MB PNG
>>109153081
zib
>>
>>109153418
When will lodestone start training on Krea 2 ? He's only training two models right now plus experiments, what is the waiting for ?

Lodestone will never release another finished model...
>>
>>109153228
pretty cool
>>
File: tgus.png (596 KB, 528x680)
596 KB PNG
>>109153520
>When will lodestone start training on Krea 2 ?
I genuinely want him to do a Zeta Krea and go for pixel space, Krea definitely needs an improvement from that mid VAE
>b-but all his experiments failed!
I don't care, he should try it anyway
>>
>>109153312
kek
>>
>>109153534
no. back in the autism cage.
>>
>>109153536
oy vey!
>>
>>109153536
seethe and cope, kike
>>
>>109153536
>back in the autism cage.
>>109153542
>now. in the cage.
this IDF shill is on all generals or what? kek
>>
File: you lost.png (299 KB, 660x372)
299 KB PNG
>>109153536
>STOP NOTICING
>>
File: 1765275545609555.mp4 (1.87 MB, 720x1072)
1.87 MB
1.87 MB MP4
>>109153234
>>
File: ComfyUI_00260_.jpg (452 KB, 1280x1280)
452 KB JPG
>>109153536
>Full support to israel sarr
>>
they're indians.
>>
>>109153562
>hot steamy beer
>>
>>109153562
Based
>>
>>109153577
>>109153577
>>109153577
>>109153577
>>
Autistic people really and truly do believe they are always right, and always are the winner. It's a sight to behold.
>>
What's a good model that can do "semi" realistic images, runs on an RTX 3080 (10GB) for which LoRA's can also be trained on the same card?

Training LoRA's for Anima works fine but it doesn't do realistic images.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.