[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor acceptance emails will be sent out over the coming weeks. Make sure to check your spam folder!


[Advertise on 4chan]


Discussion and Development of Local Image, Video, and Music Models

Previous: >>109188075

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://huggingface.co/models
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Krea 2
https://huggingface.co/krea/Krea-2-Raw
https://huggingface.co/krea/Krea-2-Turbo

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>inb4 n*gbo ban evading yet again
>>
Blessed thread of frenship
>>
>mfw Resource news

07/03/2026

>Multi-Resolution Flow Matching: Training-Free Diffusion Acceleration via Staged Sampling
https://github.com/Xingyu-Zheng/MrFlow

>DiffRGD: An Inference-Time Diffusion Guidance Through Riemannian Gradient Descent
https://diffrgd.github.io

>Representation Distribution Matching for One-Step Visual Generation
https://alan-lanfeng.github.io/rdm

>SAB-LVLM: Significance-Aware Binarization for Large Vision-Language Models
https://github.com/LyuQi127/SAB_LVLM

>Style-CCL: Content-Preserving Style Transfer via Curriculum Continual Learning
https://github.com/witcherofresearch/Qwen-Image-Style-Transfer
https://github.com/Tele-AI/TeleStyle

>ByteDance-Seed / PAR
https://huggingface.co/ByteDance-Seed/PAR

07/02/2026

>PAPA: Online Personalized Active Preference Alignment
https://github.com/NasikNafi/papa

>Condensing Large-Scale Datasets Directly with Minimal Information Loss
https://github.com/LINs-lab/CIM

>VisReason: A Large-Scale Dataset for Visual Chain-of-Thought Reasoning
https://y-research-sbu.github.io/VisReason

>Asset Generator for 2D & 3D: Blender add-on that generates assets from text prompts
https://github.com/tin2tin/Asset_Generator-2D-3D

>ComfyUI-TrixLoader: All-in-One Image Loader, Editor, and Resizer node for ComfyUI
https://github.com/trx7111/ComfyUI-TrixLoader

07/01/2026

>Elastic Diffusion Transformer: Accelerating SOTA generation models
https://github.com/wangjiangshan0725/Elastic-DiT

>Boogu-Image-0.1-Edit-Turbo
https://huggingface.co/Boogu/Boogu-Image-0.1-Edit-Turbo

>GEAR: Guided End-to-End AutoRegression for Image Synthesis
https://github.com/Tencent-Hunyuan/GEAR

>SpheRoPE: Zero-Shot Optimization-Free 360 Panorama Generation with Spherical RoPE
https://orhir.github.io/SpheRoPE

>ADAPT: Attention Dynamics Alignment with Preference Tuning for Faithful MLLMs
https://github.com/yao-ustc/ADAPT

>Phase-Aligned RoPE for Mixed-Resolution Diffusion Transformer
https://hao-yu-wu.github.io/mixed_res
>>
KREA 2 > Z IMAGE TURBO
>>
>mfw Research news

07/03/2026

>OrbitQuant: Data-Agnostic Quantization for Image and Video Diffusion Transformers
https://arxiv.org/abs/2607.02461

>Anti-Prompt: Image Protection against Text-Guided Image-to-Video Generation
https://arxiv.org/abs/2607.01499

>NeoMap: Training-free Novel-View Synthesis from Single Images and Videos
https://arxiv.org/abs/2607.01962

>PixGS: Pixel-Space Diffusion for Direct 3D Gaussian Splat Generation
https://arxiv.org/abs/2607.01803

>DetailAnywhere: Fashion Detail Generation via Cross-Modal Feature Alignment Distillation
https://arxiv.org/abs/2607.02220

>ICDepth: Taming Video Diffusion Models for Video Depth Estimation via In-Context Conditioning
https://xuanhuahe.github.io/ICDepth

>Alignment Is All You Need For X-to-4D Generation
https://arxiv.org/abs/2607.02516

>QWERTY: Training-Free Motion Control via Query-Warped Video Diffusion Transformers
https://arxiv.org/abs/2607.01869

>DisciplineGen-1M: A Large-Scale Dataset for Multidisciplinary Visual Generation and Editing
https://arxiv.org/abs/2607.02290

>MIBE: Multi-subject Interaction Benchmark and Evaluator for Personalized Image Generation
https://arxiv.org/abs/2607.01383

>Optimizing Visual Generative Models via Distribution-wise Rewards
https://arxiv.org/abs/2607.02291

>Wavelet-Guided Semantic Signal Compensation for Inversion-Free Image Editing
https://arxiv.org/abs/2607.02421

>Efficient PEFT Methods with Adaptive Checkpointing for Vision Models and VLMs on Resource Constrained Consumer-GPUs
https://arxiv.org/abs/2607.02158

>From SRA to Self-Flow: Data Augmentation or Self-Supervision?
https://arxiv.org/abs/2607.02508

>KathaTrace: Diagnosing Semantic Trajectory Collapse in Generated Visual Narratives
https://arxiv.org/abs/2607.01312

>LASER: A Corrective Lens for LVLMs via Visual Attention Preservation and Sink Suppression
https://arxiv.org/abs/2607.01707
>>
>>109192143
>>109192148
Fuck off off topic spammer
>>
>>109192148
>>109192143
thanks!
>>
>>109191553
>The datasets are being made public
BWAHAHAHAHAHAHAHAH we all fucking wish that would happen but there's no way.
>>
is 50-70 images a overkill for character likeness training for krea2 and z image base?
>>
File: ComfyUI_00470_.jpg (1.46 MB, 2024x2696)
1.46 MB JPG
>>
File: debo_sf_k2_uv_00139.jpg (825 KB, 3168x1312)
825 KB JPG
>>109192156
my pleasure :)
>>
>>109192160
no
>>
>>109192168
Crazy how proper upscaling is way too hard for you after 4 years lol
>>
>>109191443
funny init
>>
File: ComfyUI_00358_.jpg (1.88 MB, 2024x2696)
1.88 MB JPG
>>
>>109192178
mama
>>
File: ComfyUI_temp_lrgjv_00013_.jpg (583 KB, 1568x2344)
583 KB JPG
Whoever said use anima as first pass for krea2 you might be on to something.
>>
File: comfyui_00027_.png (384 KB, 640x512)
384 KB PNG
>>
>>109192178
do you have a prompt for this beautiful jungle worrior? also are you using the regular krea2 turbo model or a mixed tuned model?
>>
is debo ban evading again
>>
I just saw this a krea 2 controlnet for depth
https://huggingface.co/Patil/Krea-2-depth-controlnet

requires depthanything2
https://github.com/kijai/ComfyUI-DepthAnythingV2

for the retarded it needs to be placed in.
ComfyUI/models/depthanything/

we are eating so good
>>
>>109192160
No, but they should all be different. You should remove those redundant images in your dataset (same outfit, similar pose/composition).
>>
>>109192160
no such thing. the more the better, assuming they are all varied.
>>
>>109192192
can you share the workflow for it?
>>
>>109192178
what sampler/scheduler are you using?
>>
you cant make me use two different models for a single output i wont do it
>>
krea users in with their astonishing deformities.

It makes sense that anti-racism is so popular. If you can't even comprehend anatomical mass, how can you comprehend political mass?
>>
>>109192221
Euler/Simple.
>>
File: debo_sf_k2_uv_00142.jpg (2.75 MB, 6192x2580)
2.75 MB JPG
>>109192210
nice find
>>
>>109192210
nice thanks
>>
>>109192210
ah wait a second i don't know how the fuck this works, i don't think the node can see it hmm.
>>
>>109192238
ty
>>109192216
https://files.catbox.moe/vkr3yc.png
sorry it's messy.
>>
>>109192192
You could've made this without using Anima? I don't get it.
>>
>>109192210
>>109192244
ah i should have read the setup, it seem its need some node to be installed.
>>
cozy breas
>>
>>109192212
>>109192215
>>109192171
thanks anons for the advice.
>>
File: 1779370030213746.jpg (1.46 MB, 3840x1640)
1.46 MB JPG
>>
>>109192247
thanks, hope comfy will unload anima before loading krea
>>
>>109192254
Show us, then.
>>
>>109192258
i dont think this works with comfyui
>>
>>109192192
Anima knows anatomy.
>>
>>109192269
Show you what, a basic goth thot taking a selfie with saggy tits? Seriously?
>>
>>109192240
>>109192239
https://github.com/Tanmaypatil123/Krea-2-controlnet

i don't think its for comfyui at this stage what a shame...
>>
Why does Krea start to fry really bad anytime the prompt includes a person and is for a photograph or real image
>>
File: Krea2_turbo_00113_.png (3.89 MB, 2512x1672)
3.89 MB PNG
>>
>>109192272
yeah, someone needs to make it work with depthanything, i wonder if simply adding it to the directory and changing the code to include it would work some how? They say you need depthanything2
>>
>>109192279
Sounds like you can't. :(
>>
>>109192302
maybe ask claude to built a custom node to do it. im busy at the moment
>>
>>109192308
You're right. I can't be assed to waste my time on trivial 1girl gens.
>>
>>109192266
I can fit both at the same time. if it doesn't there's a custom node that exists to force unloading. idk if it's worth doing if you have to unload tho.
>>109192254
Anima is uncensored by default, has a lot more seed variation and has booru knowledge. Basically I find krea2 to be quite boring in its output compared to Anima so this way you get the best of both worlds. Animas NSFW knowledge and creativity + Krea2 rendering.
>>
I assume 12GB VRAM and 32GB RAM is too weak to train a Krea2 lora right?
>>
>>109192335
i've seen a few anons claim you can
>>
>>109192312
I'm being retarded anon, it would need to be loaded with the actual controlnet nodes, the depthmap can be generated from anything but this file is what controls the conditioning from what i gather.
>>
is it possible to do image editing like face or background swapping with local models like grok does or is that still needing manual inpainting
>>
>>109192291
>

cringe inducing. idk. visceral. I could inspect to find out why.
>>
>>109192320
I'll find you a good example.
>>
>>109192335
it'll be slow due to ram offloading
>>
File: Krea2_turbo_00116__90.png (3.33 MB, 1505x2261)
3.33 MB PNG
>>
>>109192353
The right leg is too long.
>>
>>109192198
Can you do her but in the same post and outfit and body as >>109192178
>>
>>109192160
No. it can work ok with half the images but this is fine. Just try it.

Don't hesitate to drop photos that seem to cause issues tho, you have plenty.
>>
File: Krea2_turbo_00119__90.png (3.28 MB, 1505x2261)
3.28 MB PNG
>>109192363
You are right, it is
>>
>>109192160
i suddenly want this lora
>>
>>109192288
Pls anon halp
>>
>>109192210
the guy who made it left a post on reddit
https://www.reddit.com/r/StableDiffusion/comments/1uia5v4/krea2_control_net_pose_or_depth/
>>
File: 6.png (2.35 MB, 1664x1280)
2.35 MB PNG
txt2img slop is boring, I want a Flux Klein level DranGAN
>>
File: debo_sf_k2_uv_00144.jpg (3.57 MB, 6192x2580)
3.57 MB JPG
>>109192348
there are a bunch of different ways to do it but you'll need to set up a workflow that uses reference images and stuff
>>
>>109192387
Is that sydney sweeney lora with strength turned down?
>>
>>109192165
catbox?
>>
>>109192401
he's a faggot but he's right, it's been a while we haven't gotten a new decent edit model
>DranGAN
what's that?
>>
Working on Chroma-Krea wf optimization. The RAW model is really good at rendering humans with the distill lora and filter bypass. It's also very diverse like OG Chroma was. The only issue is that depending on the output, the limbs/fingers may be slightly inaccurate and in wrong location, I guess that's the cost of diversity, or I could test lowering or increasing weights to see where it goes. I'm very close. Perhaps a third pass through regular Turbo would fix that.
>>
File: 783.gif (2.53 MB, 780x526)
2.53 MB GIF
>>109192414
https://dragganai.io/
>>
File: lmao.png (996 KB, 1109x938)
996 KB PNG
>>109192415
jesus wtf is that
>>
File: Krea2_turbo_00121__90.png (3.36 MB, 1505x2261)
3.36 MB PNG
>>109192407
I did put Sydney Sweeney in the prompt but Krea doesn't seem to know her very well
>>
>>109192404
so i need a shitload of reference pictures for the person i want to faceswap in? i wonder how the big cloud models make do without it
>>
>>109192415
There also is one extra knob tweaked to improve realism, Krea2NegPip with (illustration:-1.2) to avoid anime/drawings since the RAW model doesn't naturally do it, works to some extent, but needs improvement
>>
>>109192387
If she tried to touch me I would break out my tailor's tape measure from my murse and give her a proper dressing down.
>>
>>109192424
In that case the Turbo LoRA was turned down, so output was a little messy, but increasing its weight to 1 it becomes much more coherent. The expressiveness on her face is from the filter bypass.
>>
>>109192414
>>109192423
neat
>>
>>109192462
it's just a bundle of bad loras.
>>
>>109192429
Tifa's fine too.
>>
File: ComfyUI_02220_.png (1.81 MB, 1296x976)
1.81 MB PNG
So I decided to try the int8 convrot for krea 2 and it is actually much slower (30 seconds on a gen that will take 70) 16 GB 4060 Ti + 64 GB Ram, I don't understand it was supposed to be faster for any modern architecture not only Ampere. Any clue? I am not using custom nodes, just the official convrot model. Is comfy's implementation wrong again?
>>
>>109192462
nice
>>
>>109192488
desu i thought convrot was poorfag only
im still using fp8 scaled
>>
man, those are shit feet.
>>
>>109192491
no. absolutely not. horrid
>>
>>109192478Setup generally works, just needs a bit more tweaking to improve fine details. Maybe the filter bypass isn't flawless and it's generally better without, will test that next
>>
>>109192497
mxfp8 here. pretty sure they mention what quant you should use in the repo.
>>
>>109192478
>>109192511
>>109192521
post something better (you wont)
>>
>>109192488
I wanted to use NVFP4 but I think only turbo has that available right now
>>
>>109192523
This one, right?
https://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8
It looks like FP8 is the best for me
>>
>>109192571
Yeah that's the one.
>>
File: debo_sf_k2_uv_00145.jpg (788 KB, 3168x1312)
788 KB JPG
>>109192439
no no, you just need one reference image. getting the workflow you want set up is the heavy lift but the input images are straight forward
>>
>>109192530
:^) I'm genning music with Ace Step 1.5 XL SFT.
>>
File: 1779646699541740.png (1.15 MB, 2240x1264)
1.15 MB PNG
>>
File: Krea2_turbo_00133__75.png (2.76 MB, 1254x1884)
2.76 MB PNG
>>
>>109192522
8 or 9 steps seem optimal, past it and it will hallucinate details. Filter bypass I don't think was doing much, skin stays realistic without it (though it clearly helps in the case it's a single generation through Krea as opposed to a 2 pass workflow).
>>
>>109192488
I have the 4060 Ti 16GB too and int8-convrot is faster than fp8 for me
is your ComfyUI up to date?
>>
>>109192694
people are afraid of updating comfyui because they might add a feature they don't use
>>
:^) guess who actually owns 4chan
>>
>>109192719
Ah yes it's definitely not because you were derailing the thread with an off-topic discussion...
>>
File: Krea2_turbo_00136__75.png (3.23 MB, 1254x1884)
3.23 MB PNG
>>109192719
4chan is a honeypot
>>
>>109192373
i going to try and trim it down to 60, captioning is going to be repetitive but I'm getting good at it.
>>109192389
i will share the gofile of it when its finished, she's a DNP so posting her huggingface is a high risk. going to take awhile.
>>
>>109192744
shit caption desu
>>
>>109192738
krea is shit not honey
>>
>>109192768
It's not shit, I like it
>>
>>109192744
why the fuck are you using booru style captioning for newer models? dude, use joy caption or some LLM. those tags are useless
>>
>>109192768
I like shit, the steamier the better
>>
>>109192634
Unlike OG version of the workflow, not a lot of issues left after 2nd pass. Initial Chroma gen still plays a bit of role in final output though, E.G. it should be close enough but not entirely inaccurate with things such as feet size and positioning etc... Though higher denoise values help, I tend to stick to around 0.5-0.65 for best results. At 0.65 it generalizes and doesn't break the realism style. It cleans up background really well. No stack of LoRAs will ever compare to doing this a 2 pass with Chroma. Similarly with anime, no stack of LoRAs compare to a 2 pass with Anima. The base models being passed are too strong at what they do, and only full finetunes could ever hope to compare. Krea 2 is very good at approximating styles with img2img, and its RAW model is even better at it than Turbo, which is why all this works. Hence, the wf is very useful for realism NSFW.
>>
>>109192744
why the fuck are you using ugly women? dude, use an attractive girl or something. those bitches are useless
>>
>>109192694
Yes. I updated to test convrot. I don't know what can it be. I am using Krea 2 turbo + 4-5 loras.
ComfyUI 0.27.0
ComfyUI_frontend v1.45.20
Templates v0.11.1
Discord
ComfyOrg
rgthree-comfy v1.0.2606200020
EasyUse v1.3.7
ComfyUI-Manager V3.41
>>
File: 1760132832195392.png (237 KB, 604x810)
237 KB PNG
>>109192719
>makes an off topic post
>gets warned because of the off topic post
"huhhh??? I dun get ittt, I didun du nuthin"
>>
>>109192786
post wf?
>>
>>109192792
>ugly women?
nta but I thought she was pretty hot.
>>
>>109192786
I too get that stretched artifact on the bottom some times, like the resolution is wrong or something
>>
File: Krea2_turbo_00139__75.png (2.94 MB, 1254x1884)
2.94 MB PNG
>>109192799
Get your eyes checked then
>>
>>109192497
int8 convrot is higher quality than fp8 scaled. comfy said the default comfy repo will only include convrot in the future
>>
>>109192793
>I am using Krea 2 turbo + 4-5 loras.
okay, what about the speed with no loras?
>>
>>109192794
He objectively didn't say "kys kys". Run md5 on the string.
>>
base krea doesn't know what the inside of a pussy looks like.
>>
>>109192822
What's ideo do if you dig a hole there?
>>
>>109192818
kek, good point
>>
>>109192822
neither do you
>>
>>109192799
she looks like mrs potato head that got hit with down syndrome
>>
File: 1764643857333743.png (387 KB, 547x497)
387 KB PNG
I can tell this retard used Qwen Image Edit kek
>>
>>109192851
I'm eternally grateful for the fact I don't need to see this sort of excrement because I don't use linkedin.
>>
>>109192860
Don't need to use it if you're unemployed
>>
>>109192851
Don't be silly anon, it's clear that he had a great co-working day!
>>
>>109192868
Your post hit me like a physical blow.
>>
File: 1774879184758443.png (1.21 MB, 1376x2064)
1.21 MB PNG
>>
>>109192841
>>109192807
Ok? more fun for me?
>>
File: ComfyUI_02259_.png (1.51 MB, 976x1296)
1.51 MB PNG
>>109192816
Slow, the loras are not related.
>>
>>109192851
Do wagecucks enjoy humiliating themselves?
>>
>>109192896
no... anon we're making FUN of (You)
>>
File: ChromaVsAnima.jpg (1.7 MB, 2688x1728)
1.7 MB JPG
Training a real person character lora on Anima Base and then running it at like 0.8 strength on top of the official Turbo lora at around 0.7 strength gives similarly good results to training one on Chroma HD and then running it on the Chroma Flash checkpoint at 0.8 strength
>>
>>109192177
Pedo board
>>
>>109192902
Spooky how she looks like the ex I was dating in college. I wish she stayed this cute though, she became an alcoholic junkie and aged like shit.
>>
>>109192922
Did the blog factory explode
>>
>>109192917
>>109192177
normies need to be gate kept. stay on reddit. you never belonged here
>>
>>109192922
You could have saved her
>>
>>109192830
kek
>>109192808
In your dream?
>>
File: samus.jpg (384 KB, 841x1264)
384 KB JPG
>>109192632
>>
>>109192935
>You could have saved her
She got empowered by feminism, 3 months of dating her was depressing enough.
>>
>>109192910
idk man, it's kinda your loss if you don't find someone attractive.
>>
>>109192941
https://www.reddit.com/r/comfyui/comments/1uk6q5m/comfyui_v0270_now_officially_supports_convrot/

>It has better quality than fp8 while generally being faster.
>It has better quality than fp8 while generally being faster.

>We will likely be updating our default templates to use the convrot int8 models by default because they give better quality and performance for most people.
>We will likely be updating our default templates to use the convrot int8 models by default because they give better quality and performance for most people.
>>
File: 00014-1575866754.png (2.07 MB, 1344x1728)
2.07 MB PNG
>>
https://github.com/Comfy-Org/ComfyUI/issues/14735
>Finally, the Comfy community currently only integrates ConvRot W8A8. I want to emphasize that the real advantage of ConvRot lies in W4A4. I look forward to the Comfy community using ConvRot W4A4 to provide users with an even more outstanding performance experience.
It seems we have to look forward to INT4, which will mean even greater speeds
>>
File: 00041-1645631207.png (3.07 MB, 2560x1344)
3.07 MB PNG
>>109192781
>>109192759
what? the booru style captioning worked well with my previous loras for z image base and krea 2. i find it better and more precise than a long ass continuous run off sentence structure and paragraph structure.
>>
>>109192955
All forms of fp8 are dogshit
How does convrot int8 compare to Q8_0 GGUF?
>>
>>109193014
>than a long ass continuous run off sentence
did you mean run-on?
just don't write run-on sentences you fucking retard
>>
>>109193018
>How does convrot int8 compare to Q8_0 GGUF?
https://github.com/BobJohnson24/ComfyUI-INT8-Fast/blob/main/Metrics.md
>>
>>109193018
>How does convrot int8 compare to Q8_0 GGUF?
They're supposed to be basically the same.
>>
>>109192178
Unreal
>>
>>109193018
q8 gguf has slightly better quality in some circumstances but int8 is faster, but take my word with a grain of salt and compare/contrast yourself if you are interested. I personally didnt find much of any difference between q8 and int8
>>
>>109193030
>>109193029
>>109193038
Ok cool q8_0 is still better and remains the greatest quantisation format of all time. I can't imagine how you would fundamentally do better so it makes sense thanks anons
>>
>>109193051
did you skip the part about convrot being faster and providing nearly identical quality. that makes it better
>>
>>109192955
Didn’t know comfyfags were plebbitors
>>
when int4-convrot rolls out, gguf will be obsolete.
>>
>>109193064
>not keeping up with ai tech on various platforms
imagine /g/ being your only source of news
>>
>>109192797
WF
https://files.catbox.moe/u8t01a.png

You need to grab models from here

INT8 Convrot Chroma 1 HD Flash
https://gofile.io/d/QlXI2i

T5 Text encoder
https://gofile.io/d/l93BuN

Downside so far is it doesn't work well at 2k. Chroma 1 Flash also suffers from a strange issue with counting females (which the HD version doesn't suffer from) It's one area that Lodestone could've improved for the Flash version, but I don't see any newer Flash models for Radiance, 2K, etc...

The benefits this wf are immense, same as last time, all the bad details from the Chroma gens are gone, nonsensical backgrounds, extra fingers, etc...

Before
https://files.catbox.moe/y7p7mk.png
After
>>109192786
Before
https://files.catbox.moe/yc8wf6.png
After (Pic rel)
Before
https://files.catbox.moe/6s8m05.png
After
https://files.catbox.moe/ewble1.png

And of course the model can do much more, and NSFW is here at full capacity.
>>
ok we pull
>>
>>109193051
it's a bit worse than Q8, but it's 2x faster, there's no way I'm gonna ignore that
>>
File: 00325-445185681.png (1.39 MB, 904x680)
1.39 MB PNG
>>
>>109193051
Q quants lose speed with loras
>>
mxfp8 > convrot
Cuda is especially tuned for it.
>muh speed
only poorfags with old hardware want dogshit lower quality models
>>
>>109192210
Anima bross? Why they aren't doing controlnet for us!?!?!
>>
>>
>>109193075
>2x faster
*On 30xx cards. Why do people keep leaving this out? iIt's objectively worse on 40 and 50 cards
>>
>>109193060
I'm gonna need to do my own tests, the cosine similarity of only 0.98xxx for Q8_0 GGUF sounds low

>>109193075
>2x faster than Q8_0
Oh I see it's GPU optimized, I guess this is the q8 killer then yeah
Maybe this will make me get into LTX finally
>>
>>109193128
>*On 30xx cards. Why do people keep leaving this out?
not only on 30xx cards, it also works on 20xx cards, where did you get the info it's only for the 30xx cards?
>>
>>109193128
I have yet to see the speed benefits being claimed on my 3060.
Either VRAM is not enough and voids any speed gains or my comfy install is shit.
>>
>>109193113
https://huggingface.co/TaihoC/Anima-ControlNet-VACE-Depth
Umm retard?
>>
>>109193158
40 and 50 cards natively support FP8
>>
File: 00352-2271551540.png (1.3 MB, 800x856)
1.3 MB PNG
>>
>>109193190
int8 is faster than fp8 on my 4060 thobeit
>>
>>109193190
you still get a speed increase on the 40 and 50 cards, 10 to 20% speed increase, and since int8 is a better quality quant than fp8, fp8 is basically deprecated
>>
>>109193128
>Which Format Should I Use?
>RTX 30xx INT8 ConvRot for best quality, >INT8 for fastest
>RTX 40xx FP8
>RTX 50xx Blackwell NVFP4, MXFP8, or FP8, your choice

https://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8

I guess it makes a difference (likely speed or accuracy, who knows) but I use INT8 since I'm a 3090let.
>>
what local models do i need for video to video
>>
>>109193222
scail2
wan vace
ltx2.3
>>
>>109193128
It's still 2x over all the other variants on my 5090. My ComfyUI is on cu130 with matching pytorch so I don't know what's going on there.
>>
File: 1776995139921870.png (591 KB, 614x552)
591 KB PNG
>>109193231
gracias
>>
>>109193126
chroma krea? did lodes train a krea finetune?
>>
>>109193282
Nope, I wish. Rn he's focusing on finishing radiance for Chroma.1 and other models. Maybe he will train Krea, but its license is not the best. That is just a very fancy wf where I'm doing a 2 pass, first genning with Chroma, then passing it to Krea as img2img to fix mistakes which works well and is very fast on my 3090 >>109193069
>>
made a custom node that sends a toast notification if a fixed seed is being used. i think its a pretty neat idea dont you think? sometimes i save a workflow with a fixed seed, open it up the next day and forget it's not randomized then wonder why my gens all are similar.
>>
>>109192210
Retard here, how do I use this for gens? Please post workflow.
>>
>>109193354
It's not made for Comfy yet.
>>
>>109193397
Then why was he yapping about ComfyUI/models/depthanything/?
Fucking LLM bot spam.
>>
>>109193006
The original Convrot paper was actually mostly about int4, it specially advocated for keeping weights primarily in 4 bit while keeping a handful of important layers in 8 bit. (Both convrot variants) It even envisioned that this could be combined with a nunchaku style outlier decompose lora for much more efficient results.
>>
>>109193460
Cornrot is another germick, psycho hump.
>>
>>109193460
Oh he actually said these in his post.
Well maybe I should have read his post before posting, lol.
>>
>>109193302
Is he actually starting radiance training again? I thought he was still trudging through zeta.
>>
>>109193505
He restarted Radiance "training" weeks ago.
He is "training" Zeta, Radiance and some useless 2.5B pixel space model from scratch simultaneously (Thanks to Comfy being a retard gifting him 2 6000 pros).
>>
>>109193415
>filtered
ngmi
>>
>>109193069
FUCK OFF FAGGOT STICK CHROMA UP YOUR ASS. YOUR CHROMA EXAMPLES ARE SHIT. WHY DO YOU KEEP INSISTING ON USING THAT FAILED, YEAR OLD MODEL WITH A DEV WHO IS MORE OF A CLOWN THAN AN ACTUAL DEV? CHROMA IT'S OUTDATED, BUILT ON THE FLAWED SCHNELL ARCHITECTURE, UNSTABLE AS HELL, AND ONLY SEEMS TO UNDERSTAND SLOPPY PROMPTS. BY INSISTING ON GOING BACK TO THAT MODEL, ALL YOU'RE DOING IS PROLONGING THE DAMAGE CHROMA HAS ALREADY DONE TO THE LOCAL COMMUNITY.
>>
Krea 2 is a realistic /ss/ machine, it wonned
>>
File: Ideogram__00741_.jpg (2.99 MB, 3840x2160)
2.99 MB JPG
>>
>>109193588
proofs?
>>
>>109193639
Ideogram is really impressive ngl
>>
>>109193648
nice try fedboy
>>
Help me understand Krea 2 better, anons.
>Prompt structure
So what's up with it? To my understanding, it's supposed to be
>Subject, background/environment and mood, camera angle/framing, style and lighting
In that order, right? And I should be verbose and descriptive instead of throwing a tag soup.
However, at times I have trouble forcing the model to adhere to my prompts. For context, I'm using both a the filter bypass and the "instruct abliterated" clip for prompt adherence, as well as nsfw loras. When I prompted for "youthful woman", "thin wasp-waist" and "massive tits" it always gave me very petite results with small breasts, it wasn't until I removed "youthful" that it started generating something closer to my prompt.
There's also face: How well does it understand describing facial features, and what's the best way to do so? Would ethnicity work? Should I get a lora for this too?
>Posing
This is another wall I'm regularly hitting. I've tried my best to describe a pose, even get gemini and claude to describe it in order to prompt Krea 2 and still don't get anything remotely similar.
From what my uneducated ass gathers, I can't use controlnet with this model. So what's the best way to get it to adhere to a reference pose?
>>
>>109193666
but it is too big and slow for my hardware so actually it is pretty bad and no one should want it
>>
>>109193666
nah
>>
>>109193676
>There's also face: How well does it understand describing facial features, and what's the best way to do so? Would ethnicity work? Should I get a lora for this too?
use a celeb, it knows many
>>
File: ComfyUI_temp_bzkba_00002_.png (3.4 MB, 1129x1688)
3.4 MB PNG
>>
>>109193678
it's pretty small as far as the newer models are concerned and most people can run it
>>
>>109193694
18.6B+8B is not small
>>
>>109193685
What if I don't necessarily want my result to look like a celeb?
[spoiler]What if I want the face of someone it doesn't know?[/spoiler]
>>
>>109192160
You get better likeness with fewer images.
>>
>>109193708
then you have to train a lora for it know
>>
>>109193703
it's a MoE model, no one says Wan 2.2 is a 28b model
>>
>>109193720
it's not a MoE model just because it is split into two files
>no one says Wan 2.2 is a 28b model
high IQ people do
>>
File: ComfyUI_temp_bzkba_00006_.png (3.15 MB, 1129x1688)
3.15 MB PNG
>>
>>109193676
Krea2's prompt interpreter works almost like SDXL. The model prioritizes the words in the front of the prompt. The order of the words in the prompt matter more than the content of your prompt. There's often concept bleeding.
>>
>>109193666
we can't use other loras with the turbo lora. and and the standard version is fucking slow. also, complex prompts...
>>
>>109193505
Seems to be doing 1024p training Radiance alongside Zeta.

>>109193575
Why does the SOTA at NSFW realism trigger some anons so much? Lol, also Chroma 1 Flash is special and fixes many problems that plagued its OG version. Chroma.1 HD unfortunately suffers too many issues for me to go back to it, but a mix with HD Flash does fix them (at the cost of being huge to run, I'll figure out if that can be fixed).

>UNSTABLE AS HELL, AND ONLY SEEMS TO UNDERSTAND SLOPPY PROMPTS

Skill issue. Its NSFW understanding is far beyond Krea, good luck prompting any of these safetymaxxed models for "panties slightly visible", and unlike Krea it understands every single expression that girls can do from nuanced prompts without needing to resort to bypass tricks.

>PROLONGING THE DAMAGE CHROMA HAS ALREADY DONE TO THE LOCAL COMMUNITY.

Your screeching is akin to
>why do you keep using what's available to you
There's no reason why someone who's seeking to decensor Krea 2 shouldn't use just use Chroma in 2-pass setup, otherwise they'd be limited to the sameface, poses, lack of coherence, lack of prompting freedom, and a bunch of other issues that are limitations of LoRAs.

Also, it goes without saying that a Chroma-Krea wf doesn't suffer from most of the limitations of Chroma except for the limited areas where it falls short. Even then, just increase denoise, adjust prompt slightly on Krea side and you're good again.
>>
>>109193772
I thought they all worked like that.
>>
Prompt understanding/control of Krea, but with Chroma's uncensored and unblurred backgrounds and subjects, both fixed. It's a win/win for realism.
>>
Controlnet works with this actually.
https://github.com/facok/comfyui-krea2-controlnet
Damn I hoped it would overpower censorship when it comes to naughty stuff but alas.
Besides that it's meh. It's too loose for my taste when it comes to following the controlnet image, results are a bit weird, it also adds stuff to image that wasn't in the control image.
Thanks to the person who made it for the effort nonetheless.
>>
>>109193871
I'm not running two large models
>>
>>109193879
You don't need to. Just generate the Chroma image separately and then do I2I in Krea2 with low denoise.
>>
>>109193825
I've yet to see any evidence that Krea 2 without loras is in any way "good" at NSFW. No one has ever posted a pic made with vanilla Krea 2 that went beyond basic solo booby, anywhere that I've ever seen.
>>
File: 5ohetc.png (890 KB, 1216x832)
890 KB PNG
>>
>>109193887
yeah, that's running two large models
>>
File: file.png (819 KB, 1426x983)
819 KB PNG
>>109193397
>It's not made for Comfy yet.
>>
File: Fuck niggers.png (97 KB, 1863x632)
97 KB PNG
Based Klein
>>
>>109193895
I assumed you already had chroma gens you wanted to improve. Either way, the NSFW Krea loras don't match what all chroma can do. For example none of them can do anal good, so it just made my results worse
>>
>>109193906
Even for ZiT like, the Mystic NSFW ones were pretty good, if Krea 2 is just more of that IDK why people are hyping it up so much
>>
>>109193900
he literally just made this an hour ago
https://github.com/facok/comfyui-krea2-controlnet
>>
Comfy really needs to work on making the UI just werk if this is going to become a meaningful industry standard.
>>
>>109193913
Mystic NSFW changes the appearance too much for me.
>>
>>109193925
SNOFS is better
>>
>>109193924
actual enterprise companies use cloud models so i dont think it matters
>>
File: WetMuscleTest.jpg (2.54 MB, 3072x4096)
2.54 MB JPG
>A muscular beautiful young Japanese woman with long dark hair, wearing a wet black high-cut one-piece swimsuit, clear platform high heels, and gold earrings. Her skin is shiny with oil or water, highlighting her muscular physique. She poses with one hand on her hip and the other extended flexing, looking directly at the camera.
>A small golden triangular logo with an eye symbol and the word 'Milly' in cursive script, located in the top-left corner.
>gray painted brick wall with visible mortar lines, light-colored tiled floor.
>studio lighting, even and bright, highlights muscle definition and sheen
>>
>>109193879
Not that large. It's just 8 steps an INT8 version for Chroma 1 HD Flash (a flash version quantized at INT8 and running 2x the speed, only 30 secs on 3090), plus the usual Turbo speeds with Krea for 8 steps (also 30 secs). Think of it this way, the cost of the workflow assuming you have at least 64GB of RAM (the more, the better) to fit both models is just 2x the speed of your current Krea workflow (during inference), and the benefit is deslopping the model for almost all NSFW tasks you'd need to.
>>
>>109193925
Yeah it's not perfect, but it was still quite good as far as ZiT is concerned IMO. Anyways BigAsp V3 on Klein 9B is looking good IMO, hope it gets either a Turbo version or at least Turbo lora of some kind eventually when it comes out.
>>
>>109193928
eh, i'm really not a big fan of any AIO NSFW lora. I prefer targeted concept loras. Those tend to work the best.
>>
>>109193932
ideogram looks like it has some shitty grainy filter applied, no thanks.

krea2 is better.
>>
>>109193934
Starting Chroma image.
https://files.catbox.moe/4898v0.png

Same seed as >>109193934, slightly higher denoise from 0.50 to 0.55.
https://files.catbox.moe/exphbd.png

My usual denoise level of 0.60 (most slopped in this case)
https://files.catbox.moe/9eddx8.png

The model also doesn't understand as many hand signs as Chroma it seems and it needs decensor.
>>
>>109193968
better if you also thought pre-2512 Qwen Image was somehow good at producing realistic images maybe lol
>>
File: ComfyUI_temp_bzkba_00026_.png (2.97 MB, 1129x1688)
2.97 MB PNG
>>
>>109193986
it doesn't matter either way. ideogram is abandoned as far as the community goes. everyone is making loras for krea 2. you can keep using a dead model all you like though. good luck
>>
Why are there still people praising Ideogram when it can't even do SFW gens? It censors bread for christ sakes.
>>
>>109193998
Ideogram is abandonned because it decided to kill itself by not allowing coom loras, not because Krea 2 is better, that's a huge difference
>>
>>109193996
troon hands
>>
>>109194017
https://civitai.red/models/2688234/realism-engine-ideogram-4-krea-2?modelVersionId=3047310
>>
>>109194017
ok anon you win. now can you stop posting useless ideogram comparisons?
>>
when will we finally get a model that's small enough to be widely used and receive community development but still enough of an improvement to replace XL instead of these bloatmodel abortions that are forgotten as soon as the novelty wears off
>>
>>109194022
why? you're afraid to measure Krea 2 to other models? that doesn't scream "this is the best model ever" to me
>>
>>109194027
comparing things to dead models is a waste of time.
>>
>>109194021
You still need to do BBOX gymnastics to get that lora to work.
>>
>>109194023
Anima is kinda that DESU
>>
The comparisons will continue. I like both models btw
>>
>>109193998
My point is moreso I do not understand how Krea 2 is supposed to be better than really any vaguely recent model, not just Ideogram
>>
>>109194056
Uhh it knows celebrities or something
>>
>>109194021
>That json prompting

Into the trash it goes. I'm a human being, not a REST API. No, LLM prompt enhancing are not as good as just raw prompting (the latter is much more creative than the former as LLMs use an autoregressive architecture)
>>
File: Ideogram__00010.jpg (120 KB, 1024x1536)
120 KB JPG
literally me
>>
File: a-piece-of-bread.png (889 KB, 1726x848)
889 KB PNG
>>
File: WetMuscle-zit.jpg (289 KB, 1536x2048)
289 KB JPG
>>109193932
>>
Where are the america gens for America's birthday
>>
>>109193354
>>109193415
cool it fucking retards i just found it when it was just released and no support for comfyui and jumped the gun. But here you miserable fucks.

https://github.com/facok/comfyui-krea2-controlnet

Things are moving fast it seems.
>>
>>109194114
nice toes
>>
>>109194162
Image blocked by safety filter.
>>
File: ComfyUI_00235_.png (733 KB, 1024x1024)
733 KB PNG
no. i dont think i will
>>
>>109193874
>Damn I hoped it would overpower censorship when it comes to naughty stuff but alas.
I was already 70% there just denoising an image from anima, disabling safety filter and using some nsfw lora's for krea. with the controlnet I think I'll be able to 100% denoise the image or at least a little higher without losing too much composition. There was not many images it couldn't do at 80% denoise, so maybe its skill issue on your part? I'll maybe share a workflow if I get it working but do not pester me.
>>
File: fgcer3.png (926 KB, 1024x768)
926 KB PNG
>>109194162
>>
File: new.jpg (709 KB, 1520x1952)
709 KB JPG
>>
File: file.png (2.88 MB, 2910x840)
2.88 MB PNG
>>109193919
>>109194180
This node seems superior to the brute force workaround i made. I'm about to have a lot of fun with this
>>
>>109194361
I get RuntimeError: No compatible Krea2 control LoRA block weights were found in the selected file. when running the node =/ don't know why because I checked the hash of the file and it is the correct file. any hints?
>>
File: debo_sf_k2_uv_00156.jpg (217 KB, 1574x656)
217 KB JPG
>>
File: Krea2_00223_.png (3.32 MB, 1536x2048)
3.32 MB PNG
mondo girl clocking off
>>
>>109194361
>>109194381
well they changed something 16 minutes ago, they might have broken it because its not loading that lora at all. Unless I'm being retarded which I don't think I am because I followed everything exactly.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.