[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage.jpg (3.43 MB, 5000x4410)
3.43 MB JPG
Discussion and Development of Local Image, Video, and Music Models

Previous: >>109146216

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://huggingface.co/models
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Krea 2
https://huggingface.co/krea/Krea-2-Raw
https://huggingface.co/krea/Krea-2-Turbo

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
ideogram WARRIORS run this thread
>>
https://github.com/capitan01R/ComfyUI-Krea2T-Enhancer
this is better than Krea Rebalance imo
>>
File: ComfyUI_Krea2__00142_.png (1.68 MB, 1296x1296)
1.68 MB PNG
>>
indiagram users are here and they're upset
>>
File: 00026-1666684830.jpg (515 KB, 2304x2304)
515 KB JPG
>>
>>109148550
i shit on you timmy
>>
Let's settle this once and for all!

Poll:
https://poal.me/kl1qfy
https://poal.me/kl1qfy
https://poal.me/kl1qfy
https://poal.me/kl1qfy
https://poal.me/kl1qfy
>>
>>109148538
How so?
>>
>>109148550
I hate both idiotgram 4 and Kreatarded 2
>>
File: 1761024767689697.png (22 KB, 786x241)
22 KB PNG
?

Is there really no regional prompting, sketch input or anything like that for ZIT?
>>
>>109148577
No anima, no vote.
>>
>>109148592
you get less slopped images and less artifacts, it doesn't feel like you're losing something else while removing the filter
>>
I'm gonna try something with the collage and bernini, had to resize though due to oom 5000+ resolution for the reference images is too much it seems on my hardware. It will be interesting if it can do a 5 second montage or even have all the characters from that image in one scene. If it can do that then i will be amazed but I have doubts on scene complexity from such a shitty image res.
>>
File: 1773876293059924.png (567 KB, 1566x903)
567 KB PNG
>>109148577
STOP THE COUNT
>>
>inb4 nigbo
>>
File: Krea2_turbo_00480_.png (1.86 MB, 1672x944)
1.86 MB PNG
>>109148577
I don't think any one of them is the best. It's good to have all of them. You can test them, learn their strengths and weaknesses and use them to achieve whatever it is that you want to create. They can exist peacefully together augmenting each other. They are tools in your arsenal.
>>
>>109148601
I intentionally left Anima out because it's an Anime-specific fine-tune and the other models are all-arounders foundational models
>>
>>109148601
50k status?
>>
>>109148619
rent free.
>>
>>109148619
35 stars status?
>>
flux2klein9b vs krea2
>>
>>109148618
Is indiagram an all rounder? i only ever see one style coming from it and that's cinematic film stills. can indiagram even generate anime or cartoons?
>>
>>109148614
wow this place is buzzing.
>>
>>109148614
ZITgods wonned as expected
>>
File: ZiT still da goat.png (1.38 MB, 1647x1363)
1.38 MB PNG
>>109148614
As god intended.
>>
>>109148638
the thread losted. nobody is here
>>
>>109148644
Cope
>>
>>109148641
Vramlets dominate the hobby
>>
>>109148625
flux2klein9b is retarded if its the one on the left, its not even the correct mediation pose ffs, the one on the right is the correct way, palms not touching. I hate klein9b so much i might just delete it from my machine.
>>
>>109148646
copeding is what you is
>>
File: 679353003264701.png (1.84 MB, 1984x832)
1.84 MB PNG
>>
>>109148641
So in /ldg/ are there only 12 GPUtards? Lmao
>>
I voted 6 times for ZIT.
>>
File: AIEEEEEEE.png (818 KB, 1522x1092)
818 KB PNG
Kreasissies..
>>
>>109148650
>the one on the right
my right, i'm autisic so putting myself in their position is an after thought.
>>
bros i just took a bunch of photos of someone i know, make a lora for krea2, and with a simple nsfw lora i have perfect nudes, like i wouldn't even tell if they are real or not.
With ltx2.3 you can probably animate them to do something simple. This can be addictive and I'm not sure I like it...
>>
>>109148650
kek you right.
Flux is getting old. it needs an upgrade.
>>
>>109148673
now get to blackmailing
>>
File: 2026-06-27_krea2_29.jpg (1.86 MB, 2160x3840)
1.86 MB JPG
>>109148656
I didn't vote because I have no strong preference yet.
>>
>>109148054
>>109146932
>You turn up the realism
>The model becomes sloppy and too realistic
>Can't do amateur photography or even vary its styles as much as other models can
>>
>mfw Resource news

06/27/2026

>ComfyUI-VAEFrequencyBlend: Blend images decoded by different VAEs
https://github.com/thezveroboy/ComfyUI-VAEFrequencyBlend

>Ideogram4 & Krea2 Inpainting with LanPaint Support
https://github.com/scraed/LanPaint/releases/tag/1.5.5

06/26/2026

>OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation
https://correr-zhou.github.io/OmniShow

>Adobe to Acquire Topaz Labs
https://news.adobe.com/news/2026/06/adobe-to-acquire-topaz-labs

>LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editing
https://live-edit.github.io

>PhysRAG: Enhancing Physics-Awareness in Video Generation via Retrieval-Augmented Generation
https://github.com/sediment1024/PhysRAG

>SAM2Matting: Generalized Image and Video Matting
https://henghuiding.com/SAM2Matting

>Unison: Benchmarking Unified Multimodal Models via Synergistic Understanding and Generation
https://github.com/FudanCVL/Unison

>ComfyUI-AppleSilicon-FP8 - a compatibility layer custom node for Apple Silicon
https://github.com/pawel-mazurkiewicz/ComfyUI-AppleSilicon-FP8

06/25/2026

>Bernini-R — GGUF (high & low noise experts)
https://huggingface.co/neuregex/Bernini-R-GGUF

>Physics Question Scene Graph: Fine-grained Evaluation of Physical Plausibility in Text-to-Video Generation
https://github.com/atinpothiraj/pqsg

>VPA-Guard: Defending and Benchmarking Image-to-Video Generation Against Visual Prompt Attacks
https://huggingface.co/datasets/CSU-JPG/VVA-Bench

>Minimalist Preprocessing Approach for Image Synthesis Detection
https://github.com/vohoaidanh/adof

06/24/2026

>Krea-2-Turbo Training Adapter
https://huggingface.co/ostris/krea2_turbo_training_adapter

>Vera: A Layered Diffusion Model for Content-Preserving Video Editing
https://vera-layered-diffusion.github.io

>Advancing WordArt-Oriented Scene Text Recognition
https://github.com/YesianRohn/WATER

>DramaDirector: Geometry-Guided Short Drama Generation
https://github.com/iLearn-Lab/DramaDirector
>>
File: 180429CUI_00001_.png (1.97 MB, 1536x1152)
1.97 MB PNG
>>
>>109148676
>Flux is getting old.
Z-image turbo is older than Klein though lol
>>
can ideogram do this?
>>
>mfw Research news

06/27/2026

>Customizing Video Portraits via Identity-ActionDecoupling
https://arxiv.org/abs/2606.22347

>Don't Settle at the Mode! Mitigating Diversity Collapse in Pretrained Flow Models via Feature Self-Guidance
https://dont-settle-at-the-mode.github.io

>GeoT2V-Bench: Benchmarking 3D Consistency in Text-to-Video Models via 3D Reconstruction
https://arxiv.org/abs/2606.24829

>NullFlow: One-Step Generative Reconstruction
https://arxiv.org/abs/2606.22696

>Generative Relightable Avatars
https://vcai.mpi-inf.mpg.de/projects/GRA

>LISA: Likelihood Score Alignment for Visual-condition Controllable Generation
https://arxiv.org/abs/2606.27192

>A Test-time Actor-Critic Approach to News Images Generation
https://arxiv.org/abs/2606.21304

>PortraitGen: Exemplar-Driven GRPO with Dual-Reward Guidance for Photorealistic Portrait Generation
https://arxiv.org/abs/2606.26930

>VT-DUDA: Visual Token Conditioning for Diffusion-guided Unsupervised Domain Adaptation
https://arxiv.org/abs/2606.21700

>Forget, Anticipate and Adapt: Test Time Training for Long Videos
https://arxiv.org/abs/2606.26515

>Focusing on What Matters: Saliency-Harnessing Accurate Routing for Diffusion MoE
https://arxiv.org/abs/2606.26938

>Training-Free Semantic Correction for Autoregressive Visual Models
https://arxiv.org/abs/2606.22550

>FlowCodec: One-Step Flow Prior for Generative Image Compression
https://arxiv.org/abs/2606.21030

>Staying VIGILant: Mitigating Visual Laziness via Counterfactual Visual Alignment in MLLMs
https://arxiv.org/abs/2606.26387

>LogicIR: Logic Gate Networks for Image Restoration
https://github.com/jimmy9704/LogicIR
https://arxiv.org/abs/2606.26609

>See & Sniff: Learning Visuo-Olfactory Representations
https://mm.kaist.ac.kr/projects/SeeandSniff

>Computer Vision for MOBA Analytics: A Dataset and Baseline for Visibility Analysis in Dota 2
https://arxiv.org/abs/2606.26970
>>
>>109148692
Who would want to render this though?
>>
>>109148702
So... it can't?
>>
>>109148673
escapist chads are going to inherit AI
>>
File: this.png (263 KB, 620x337)
263 KB PNG
>>109148703
>>
File: bernini testing_00004.mp4 (1.54 MB, 656x656)
1.54 MB
1.54 MB MP4
>>109148608
that didn't work so well.

You are a helpful assistant specialized in image-to-video generation. Using the reference image. create a 5 second montage from all the characters in the image provide.
>>
>>109148673
>I'm not sure I like it...
Sure Jan...
>>
>>109148711
>posts non-generated jewish man from a reddit movie to make a point
Interesting.
>>
>>109148679
A spineless anime fag, how unusual!
>>
>>109148719
So... I'm right?
>>
>>109148673
as long as you aren't bothering anyone with it what's the harm? not that I condone such a thing.
>>
>>109148727
No you're an idiot and ideogram is not an all-rounder it is highly specialized towards one thing and that's cinematic still frames.
>>
>>109148688
nice style
>>
File: Ideogram__00144_.jpg (1.58 MB, 2048x2048)
1.58 MB JPG
>>109148734
That's not true
>>
File: kek.png (61 KB, 1116x336)
61 KB PNG
>>109148734
>Ideogram is highly specalized on things that matter while Krea 2 can do Lois from Family Guy
that's why you lost btw
>>
File: Krea2_turbo_00484_.png (1.65 MB, 1672x944)
1.65 MB PNG
>>
>>109148742
>look here's ONE extremely mediocre example of it doing something else
Uh-huh
>>109148744
Losted within 16 votes! Nooooooo! Such large!
>>
File: debo_k_00137_.png (2.68 MB, 1792x977)
2.68 MB PNG
>>
>>109148770
I mean, were you retarded enough to expect this place to be filled with thousands of people? it's mostly 10 schizos who are samefagging lol
>>
>>109148770
>It's not an all-rounder
>NOOOO NOT LIKE THAT
>>
>>109148767
kek, i have to try this krea2 turbo later, i have it downloaded just haven't used it yet. I only downloaded because people said it is as fast as anima is when anima is running at 30 steps. And its 12b with better prompt adherence
>>
>>109148780
Why is every comparison between the two not trying to test it's capabilities but only testing realism?
>>
>>109148692
how lewd can it get thou?
>>
File: 2026-06-27_krea2_32.jpg (1.44 MB, 2160x3840)
1.44 MB JPG
>>109148721
I've started genning Anime literally yesterday for the first time. Don't even know what I'm doing, to be quite honest with you.
Still holding out my judgement on Krea since I don't know if my workflow isn't quite there yet or if it's the model.
>>
>>109148799
because only realism hasn't been solved yet, anime shit has been solved since 2023
>>
>>109148820
>anime shit has been solved since 2023
kek. yeah there only exists anime and realistic photographs. smartest indiagram user.
>>
>>109148821
>there only exists anime and realistic photographs
duh
>>
>>109148820
>>109148825
Ideogram can't even do anime so it's missing 50% of all art.
>>
File: Untitled design.mp4 (3.58 MB, 1440x1440)
3.58 MB
3.58 MB MP4
>>109148520
>>109147187
>>
File: Ideogram 4.png (820 KB, 700x1024)
820 KB PNG
>>109148837
>Ideogram can't even do anime
It can though?
https://www.reddit.com/r/StableDiffusion/comments/1u0lc5g/ideogram_4_80s_anime_lora/
>>
>>109148673
I dont see why anyone would do this unless you were underageb&
>>
File: 00030-3098469184.jpg (502 KB, 1728x2880)
502 KB JPG
>>
>>109148847
pfft. lol that looks like straight nigga jizz
>>
>>109148854
>immediatly thinks of BBC
straightest Kreakek user
>>
>>109148863
I'm black and gay. Highest percentile IQ too, ytboy.
We made AI models and you stole them.
>>
File: Ideogram__00212_.png (2.09 MB, 1264x1680)
2.09 MB PNG
>>109148692
Yea
>>
Terrible gen but I wasn't expecting this out of krea without a NSFW LoRA.

https://files.catbox.moe/wni9y2.jpeg
>>
>>109148883
Uh oh, KreaBlackGaySissies, what's our next cope?
>>
>>109148883
>he actually responded instead of coping and seething
Nice. Thank you although I have my doubts it's ideogram due to the nature of the shills for it but I'll take it.

>>109148887
Rape.
>>
>>109148884
Looks pretty nice actually. Tits and nipples look real which is very surprising
>>
>>109148884
Yeah, it has trained on a lot of NSFW, so it knows the concepts, as long as you bypass the censorship with a bypass-lora or node setup.

Best results are still with NSFW loras, but it's so much easier training them when the base model already knows NSFW concepts.
>>
some anon have extremely poor or average taste and it upsets me
>>
>>109148904
now THAT's a proper melty
>>
>>109148896
>coping and seething
Only the Kreakeks lose their shit, sign of low IQ btw >>109148904
>>
>>109148932
now THAT's a proper melty
>>
File: HunyuanVideo_00043.mp4 (497 KB, 640x480)
497 KB
497 KB MP4
seethe
>>
>>109148932
>>109148917
>>109148938
its same fagging its a robot llm for sure.
>>
>>109148938
What do you mean?
>>
File: pixel-0000-745445542.png (538 KB, 2560x2048)
538 KB PNG
>>
File: 1781789106412427.jpg (549 KB, 1536x1024)
549 KB JPG
gpt-image-2 lora for anima is really good. It feels like turbo lora (better consistency, more details) which doesn't break artist styles. You don't need to add @gpt-image tag, as it does its job even without the tag.
>>
File: 1760442889243966.jpg (227 KB, 1651x1052)
227 KB JPG
Also, I finally made a detailer that works with anything that SAM3 can detect. No more dozens of yolo models. No more impact pack.
>>
File: lmao.png (205 KB, 480x360)
205 KB PNG
>>109148870
>I'm black and gay.
>>109148904
>I appeal to the kys theory with people like you
are you actually LowTherGod?
>>
File: peach.mp4 (827 KB, 1024x1024)
827 KB
827 KB MP4
>>
File: Ideogram__00216_.png (1.91 MB, 1264x1680)
1.91 MB PNG
>>109148692
>>109148883
Better one after bounding box autism
>>
>>109148896
>I have my doubts it's ideogram
>>
File: 185739CUI_00001_.png (1.69 MB, 1152x1536)
1.69 MB PNG
>>109148740
@diathorn
>>
File: 1770389099602110.png (373 KB, 500x559)
373 KB PNG
>>109148998
nahh, this shit's way too autistic, come on lol
>>
>>109148965
It doesn't look like anything
>>
why does lilbro keep posting stale memes
>>
>>109149015
Akasaka Aka is just underrepresented in training data. The crux is that if you pick an artist whose style is well known by Anima, it'll be preserved.
>>
File: Krea2_turbo_00496_.png (1.71 MB, 1672x944)
1.71 MB PNG
>>109148793
idk for me it's always been faster than anima. These images complete in 10 seconds.
It's a pretty fun model, it's true strength being control. Not perfect but the best I've experienced. Haven't tried ideogram yet.
>>
File: 551353183134252.png (2.18 MB, 1728x1024)
2.18 MB PNG
>>
>>109149114
nice until you see the out of place adetailed face
>>
>>109148975
post catbox workflow pls
>>
File: 1123762693168305.png (2.36 MB, 1728x1024)
2.36 MB PNG
>>109149126
It's not actually. WLOP art style just kinda looks like that.
>>
File: Krea2_turbo_00500_.png (1.96 MB, 1672x944)
1.96 MB PNG
>>
can you use krea 2 for face details?
>>
File: Ideogram__00205_.jpg (827 KB, 2048x1536)
827 KB JPG
>>109149009
It's the thinking man's model
>>
>>109149158
Nice, which style prompt did you use for this?
>>
File: 352316074213457.png (1.91 MB, 1152x1600)
1.91 MB PNG
>>
File: Ideogram__00201_.jpg (910 KB, 2048x1536)
910 KB JPG
>>109149163
Err... It's a long story
>art_style: An authentic 1980s amateur analog photograph
>aesthetics: visible film grain, muted yet warm colors, subtle color shifts, gentle fading from age, imperfect framing. Natural candid composition. Printed photo aesthetic with tiny dust specks, light scratches, rounded print corners. The image has the unmistakable texture and imperfections of a real developed film print from the 1980s,
>medium: Analog photography
>>
File: 1775495953563408.png (40 KB, 185x127)
40 KB PNG
>>109148965
>more details
>>
File: 405253322595526.png (2.13 MB, 1152x1600)
2.13 MB PNG
>>
>>109149158
>>109149180
it looks good, probably even better than ZiT for realism
>>
File: animatunetest1_00400_.jpg (389 KB, 1600x1200)
389 KB JPG
>>
>>109149168
looks like chroma
>>
File: 1764869177658710.jpg (554 KB, 1536x1024)
554 KB JPG
>>109149131
https://litter.catbox.moe/m47pse.png

>>109149188
It's not perfect, but without the lora, this prompt rturns into pic related.
>>
File: 871537876642043.png (2 MB, 1024x1728)
2 MB PNG
>>109149204
That's a hot obaasan.
>>
File: ComfyUI_Krea2__00154_.png (2.42 MB, 1688x1688)
2.42 MB PNG
>>109149152
>>
File: 552612053701329.png (2.22 MB, 1600x1152)
2.22 MB PNG
>>109149226
It's Krea 2.
>>
File: animatunetest1_00405_.jpg (388 KB, 1600x1200)
388 KB JPG
>>109149234
>obaasan
>>
File: ComfyUI_temp_gtvtv_00002_.jpg (341 KB, 1104x1920)
341 KB JPG
>>
>>109149234
catoboxo?
>>
>>109148815
About this lewd

https://files.catbox.moe/d9kqhr.png
>>
File: 00034-231479589.png (1.84 MB, 1344x1728)
1.84 MB PNG
>>
>>109149283
wow
>>
File: 614433788868478.png (1.97 MB, 1728x1024)
1.97 MB PNG
>>109149250
Damn, she was hotter in the first one, looked a bit like Hikaru Utada.
>>109149257
https://files.catbox.moe/eniy0x.png
>>
>>109149283
That's lewd
>>
CAN THE WLOP SHIZO FUCK OFF, THANK YOU!
>>
>>109149311
now that's what I call a MELTY!
>>
>>109149311
qui?
>>
File: ComfyUI_Krea_2_00259_.jpg (2.49 MB, 2048x1800)
2.49 MB JPG
>>109148847
Needing a LoRA for what Krea 2 excels at (just to get a mediocre version, because it's just a LoRA) is pretty bad. It can do almost every anime style with its eyes closed.
>>
File: animatunetest1_00417_.jpg (369 KB, 1600x1200)
369 KB JPG
>>109149296
I'm trying to get ridd of the babyface problem I had with this dataset. Any woman under 40yo, non mature, would be pure jailbait. Also the base asian faces are terrible
>>
File: output.jpg (695 KB, 2752x912)
695 KB JPG
Krea 2 can learn styles, after abject failures to train artist loras for Flux 2 Klein and Z-Image, with 1250 steps, you can see the effect of the Lora (John William Waterhouse style).
>>
>>109149316
damn, she's horny
>>
>>109147317
>actual ass when lying prone
finally we have a model that understands anatomy
>>
File: New.jpg (938 KB, 1696x1696)
938 KB JPG
>>109149180
converted to jpg because size
>>
>>109149314
>It can do almost every anime style with its eyes closed.
nothing from danbooru tho
>>
>>109149316
>John William Waterhouse style
Krea 2 can do him out of the box though
>>
btw, is the south park style available on these new models?
>>
>>109149324
>>109149180
i wanted to say have you tried Flux?
>>109149322
prompt was detailed. But Kera2 did produce a good result compared to others.
>>
>>109149316
Based pre raphaelite enjoyer but it should look more like a painting than an illustration if you catch my drift
>>
>>109149314
the thing is that Ideogram can do anything it wants, can't say the same with Krea, it'll always be nerfed by its subpar VAE
>>
Is Krea 2 the closest "Dalle 3 at home" experience we got to date? Considering what it can do out of the box (knows lots of characters well, artstyles etc)
>>
File: 416848433870771.png (2.43 MB, 1728x1024)
2.43 MB PNG
>>109149315
>this dataset
>Any woman under 40yo, non mature, would be pure jailbait

Hmm, that doesn't speak very well for your dataset lol.
>>
>>109149328
>>
>>109149346
wtf, based
>>
File: 438338797562472.png (2.06 MB, 1728x1024)
2.06 MB PNG
>>109149316
I also had good results with style training. Training on RAW, inference on Turbo and it reproduces the style pretty well.
>>
File: ComfyUI_Krea_2_00260_.jpg (2.72 MB, 2048x1800)
2.72 MB JPG
>>109149325
Perhaps. Btw has anyone tried an Anima to Krea 2-step refinement workflow? For Chroma, it already feels almost as if we got an Edit model, it can borrow styles pretty well with just img2img.
>>
File: ComfyUI_Krea2__00160_.png (2.86 MB, 1920x1080)
2.86 MB PNG
>>
>>109149391
>>
File: output.jpg (1.02 MB, 1824x1376)
1.02 MB JPG
>>109149327
Yes, it clearly recognises the name, but the effect is not very strong or accurate enough. With other artists that I have tested it the style comes in even less than with Waterhouse, with some it comes in stronger. It is in any case a good foundation for the lora to work so well when the concept is not so alien apparently.
This is more subtle trigger word vs. no trigger word at medium strength.
>>
File: 201135CUI_00002_.png (1.88 MB, 1152x1536)
1.88 MB PNG
>>
I think Flux2K9 can still pass. It has power on realism department. But it needs more data on base model.
>>
>>109149415
its too aesthetically tunned
>>
>>109149391
>>109149398
it is. Also it can do a lot better if it stops adding extra hands and legs to the character.
>>109149442
>>
>>109149415
I have good news for you anon
https://huggingface.co/fancyfeast/bigasp-3
>>
>>109149459
we iz eetin guud
>>
>>109149459
I saw it in previous thread. Someone said it's not properly cooked yet.
>>
>>109149473
>In-development models for bigASP 3
>bigASP 3 is currently undergoing training.
I wonder how they got that idea
>>
>>109149459
i'll try it because why not
>>
>>109149345
>Hmm, that doesn't speak very well for your dataset lol.
I used only pretty much the perfect looking young women with symmetrical faces. Adding milfs solved the situation. Thank you milfs
>>
>>109149315
You are a helpful assistant specialized in image-to-video generation. Using the reference image. create a video with the woman wearing the short black skirt doing a gun fight on a busy street with police using an AR15 assault style rifle. The woman stands crouching down behind the door of a roofless car which is open on the passenger side of the car, she is firing her rifle at police officers out in front of her at distance over the top of the car door. The woman hold the rifle like a professional, the rifle butt is pressed firmed into her shoulder as she looks down its sight, she firing controlled 3 round bursts, the vibration of the rifle firing causes her upper body to vibrate with natural movement and physics and the rifles muzzle flashes realistically with empty brass bullet casing ejecting out the side of the rifle. The woman's face is of serious focus and determined aggression. The police officers are rapidly running for cover behind police cars, the police car lights are flashing intensely blue illuminating the surrounding area. realistic style.

Lets see how it goes, i got some GTA lora that might work good for the high noise at least.
>>
boys... I deleted my anima folder. waste of space.
>>
File: Ideogram__00221_.jpg (993 KB, 2048x1808)
993 KB JPG
>>109149314
I say this without a hint of irony. SKILL ISSUE.
No lora required
>>
ideojeets on suicide watch
>>
All the things that bernini claims to be able to do and their system prompts:

You are a helpful assistant.
You are a helpful assistant specialized in text-to-image generation.
You are a helpful assistant specialized in text-to-video generation.
You are a helpful assistant specialized in image editing.
You are a helpful assistant specialized in subject-to-image generation.
You are a helpful assistant specialized in image-to-video generation.
You are a helpful assistant specialized in video editing.
You are a helpful assistant specialized in video editing on content propagation.
You are a helpful assistant specialized in video editing with reference.
You are a helpful assistant specialized in ads insertion.
You are a helpful assistant for editing. You may need to adjust the subject's action or position.
You are a helpful assistant for editing. You might need to adjust the video's style, lighting, colors, textures, and the subject's pose or action.

consider that it can edit images it could then be used to make last frame best quality in a wan workflow all-in-one using only for all the actions. I was a bit shocked not many spoke of it, well i'm having some fun with it.
>>
File: debo_k_00140_.png (1.94 MB, 1792x977)
1.94 MB PNG
>>109149152
was this the end of the reddit dystopiaverse story?
>>
>>109149519
still keeping mine until there's a danbooru krea2 finetune. dont have vram to make loras
>>
File: animatunetest1_00253_.jpg (695 KB, 1152x1568)
695 KB JPG
>>109149559
That's pretty impressive gen, especially if it's without lora
>>
>>109149574
>danbooru tags

I literally just prompt:
>fat black nigger woman getting pounded in her vagina by a white man with a large penis
and it works.
>>
>>109149459
sampler and scheduler?
>>
test
>>
File: 203452CUI_00001_.png (1.45 MB, 1152x1536)
1.45 MB PNG
>>
>>109149584
doesn't know more niche characters or artists though. like gbf ones
>>
>>109149599
sounds like you're an NTR cuck and shouldn't be genning anyways.
>>
File: 453665373646657.png (2.41 MB, 1152x1472)
2.41 MB PNG
>>109149501
I see, I see.
>>
File: Ideogram__00184_.png (3.79 MB, 2048x1536)
3.79 MB PNG
Best realism, competitive anime/illustrations, unmatched control. The Krea distraction was fun but it's time to come home, genning man
>>109149583
No lora. https://files.catbox.moe/d0nmdr.png
>>
>>109148695
>>See & Sniff: Learning Visuo-Olfactory Representations
>https://mm.kaist.ac.kr/projects/SeeandSniff
Sniff Diffusion when
>>
>>109149589
>>109149412
Hello catjack how are you catjack!
>>
been a while.. since when can AI do hands?
>>
i wish i made a mediocre idiot proof trainer so when my GPU dies comfyorg would buy me a new one
>>
>>109149636
the only people who actually want this are stinky fat people and jeets
>>
>>109148988
oh momma
>>
>>109149473
It's great.

But it's a base model, so 30-40 steps, CFG 4-6, decent negative prompts.
>>
File: Krea2_turbo_00535_.png (914 KB, 1672x944)
914 KB PNG
>>109149157
Not sure. I'm a Forge-Neo native and mainly use Comfy for video stuff. I can't inpaint in comfy so I hope we get Krea-2 support in Forge-Neo soon
>>109149236
nice. quality is pretty good. did you do a second pass or did you render at that resolution?
>>109149572
no, it might take several days to execute. really struggled with that last gen. seems like it doesn't want to do nudity even though I tell it that the nudity is censored by floating bubbles.
>>
File: may I see it?.png (262 KB, 640x480)
262 KB PNG
>>109149677
>It's great.
you know the deal
>>
>>109149625
That's very impressive. Looks like they actually gathered dataset from some blu-ray remasters

>>109149663
That software is like virtual cock and ball torture
>>
the most important lora just dropped, boys
https://civitai.red/models/1899877/trans-femboy-or-klein-9b-zit-qwen?modelVersionId=3076122
>>
>>109149677
got it
>>
>>109149625
If ideogram didn't decide to go for the bbox autism, it would have been the current king and Krea 2's release would have been a flop like the Boogus and the Ernies, I'm not joking
>>
>>109149374
I just used the defaults in ai toolkit, 3000 steps (it's almost finished). What did you use for parameters?
>>
>>109149236
one pass on default. it added glow to the text by itself tho.
>>
File: 00037-3127376569.png (3.39 MB, 1344x1728)
3.39 MB PNG
>>
File: debo_k_00142_.png (1.28 MB, 1792x977)
1.28 MB PNG
>>109149684
>it might take several days to execute
keep it pumpin, anon. I'm diggin the project
>>
File: 154618058667166.png (2.28 MB, 1344x1344)
2.28 MB PNG
>>109149752
Same.
>>
Since there are so many kreatards here, did any one of you train on krea 2 turbo with ostris's adapter instead of on raw?
>>
File: ComfyUI_Anima_Krea_00002_.png (2.79 MB, 1152x1152)
2.79 MB PNG
>>109149389
Anima to Krea style transfer workflow test. Lower denoise values may preserve more of the style. Only issue I've noticed so far with this is that it mutes the colors a bit, but adding a punch filter on Windows photo settings help mitigate that

Original
https://files.catbox.moe/dxcdtg.png

Output
https://files.catbox.moe/3tni2d.png

After punch filter
https://files.catbox.moe/g0mbis.png

Original
https://files.catbox.moe/ygb6p4.png

Output (wan 2.1 vae, better lines)
https://files.catbox.moe/vpnq87.png

After punch filter
https://files.catbox.moe/tyniui.png
>>
>>109149830
make a grid we ain't clicking all these links
>>
>>109149846
>t. phone user
>>
How many people want to see the investors for comfyui and the comfyui org get shit on for making commercialization the standard over open source?
>>
>>109149830
Note it should be possible to replicate that filter with any post processing node.
>>
File: 1760362750972543.jpg (462 KB, 1313x1157)
462 KB JPG
>>109149830
Bros, I'm so tired of Krea noise (best visible in uniform areas).
>>
>>109149861
If someone forks it or makes a better version of it I'll use that but otherwise I simply don't care. We can vibecode our own interface/backend if we really needed to.
>>
>>109149880
>forking cumfart
nobody wants to do that and it will never happen. cumfart is destined to die because the devs have no idea how to do ui/ux
>>
its up
https://www.youtube.com/watch?v=v6KRngGo10U
>>
>>109149896
Retard
>>
>>109149905
nobody cares shill faggot
>>
>>109149906
Care to explain? Python is a literal cancer on tech stacks. Python apps don't stick around for a reason
>>
>>109149685
I can't post why exactly it is great on a blue board. It's the first giant NSFW finetune for a modern model.
>>
>>109149908
melty
>>
File: 00038-2083173392.png (1.7 MB, 1344x1728)
1.7 MB PNG
>>
>>109149929
Oh your the baker troll. Go suck a cock faggot
>>
>>109149918
you can put a catbox, it's allowed
>>
https://www.reddit.com/r/StableDiffusion/comments/1uh0yk1/wan22_oom_after_comfy_update_on_workflow_that_ran/

comfysisters... even predditors are laffin at our dynamic vram...
>>
>>109149914
>Python apps don't stick around for a reason
Yeah it's the language that's the problem. Maybe if everything were written in Rust everything would be better!
>>
>>109149959
>comfysisters
Does this actually exist? I haven't met anybody that didn't have a problem with comfy
>>
>>109149625
Ideogram is so big and slow tho, I'm GPU poor
>>
>>109149962
Sorry, did you name any long lasting python "application" in your post? Something that lasted over a decade at least?
>>
>>109149978
Netflix. Spotify. Reddit. Calibre.
>>
File: bigasp_0.jpg (853 KB, 1248x1824)
853 KB JPG
>>109149951
I'll post some sfw stuff, just need a bit to come up with prompts
>>
so whats the point of people converting these models to gguf?
>>
>ai toolkit implemented the partial offloading optimization long ago, ramtorch, from the creator of chroma
>it works for most models, but not chroma
lol
>>
>>109149861
>>109149896
desu if i didnt have to reimplement a bunch of stuff i wouldve already switched to sd.cpp
>>
File: ComfyUI_Anima_Krea_00003_.png (2.07 MB, 1152x1152)
2.07 MB PNG
>>109149846
https://files.catbox.moe/uzwqtm.png
Filtering is a bit of cheating and overcompesating for the img2img not having as dynamic range for some reason, the strategy isn't perfect but it somewhat works kek
https://files.catbox.moe/xyz8y9.png

I guess if you really want to maximize likeness, you could go right back to Anima for 1 or 2 steps at low denoise, or perhaps optimizing the Krea prompt to be as close as possible to the intended output.
>>
File: debo_k_00144_.png (2.64 MB, 1792x977)
2.64 MB PNG
>>109149932
I watched a youtube video yesterday about how bald eagles will sometimes kidnap chicks from other birds, forget it was supposed to be a snack, then raise it as its own.
>>
>>109149874
That is a qwen 1 vae image, Wan 2.1 fixes those lines, though subtle.
>>
>>109150046
who said you would be the one doing it? Sending devs that direction seems to be the way forward because cumfart is just stagnant corpo garbage
>>
>ai toolkit has a bug that forces text encoder to run on cpu if training chroma
looooooool
>>
>>109150097
Is there even any point in running the text encoder? I always just unload it.
>>
>>109149804
the bat can't flap it's wings if he is hold them. he should be holding the fur on it's back. gen it again
>>
>>109150111
No, he's gliding.
>>
>>109150109
making and caching the initial text embeddings
>>
Impressive how much stuff this thing almost knows. It makes huge variety of things with default ai-slop aesthetic.
>>
>>109150120
Well yeah, but that's fast even on CPU isn't it?
>>
File: chu.jpg (325 KB, 1184x888)
325 KB JPG
>>
>>109150133
im emailing this to nintendo
>>
File: debo_k_00145_.png (2.68 MB, 1792x977)
2.68 MB PNG
>>
>>109150130
not if you have hundreds of images
>>
File: chu2.jpg (352 KB, 1184x888)
352 KB JPG
>>109150138
Tell them to send their cutest lawyer.
>>
>>
actual quote from a Redditor on Krea 2:
>I already created dozens of 8k pictures (3840x2160) but ok… it can‘t do it. Your argument sounds valid.

why does bro think that's "8K" lmao
>>
>>109150170
eeewwwwwww
>>
File: hmm.png (2.48 MB, 1838x917)
2.48 MB PNG
>>109150129
sus
>>
>>109150186
you found out that the ai generated image was in fact
ai generated?
>>
>>109150195
whys there's so much Flux.2 in there is the question moreso. And WAN
>>
i think krea just fulfils everything i want from a local model and i'd be happy with nothing new releasing ever again
>>
>>109149677
yeah AspGuy doesn't train at retardedly low resolutions like Kekstone so I feel like it's almost impossible for him not to come out with something fairly useable if Klein 9B is the underlying model
>>
>>109150211
>whys there's so much Flux.2 in there is the question moreso. And WAN
Using similar ""realistic"" pony generated dataset
>>
>>109149398
this is a good example of how hard the Flux.2 VAE mogs
>>
>>109150221
I got better results from the original Flux Krea for every prompt I tried
>>
>>109150221
I want a simple finetune to get rid of the baked refusals
>>
can you technically use another vae with krea 2 or are you stuck with whatever they used?
>>
File: 1772865016693297.jpg (834 KB, 1944x1456)
834 KB JPG
7 minutes blah
>>
>>
>>109150233
why the fuck do all you people need to gen abnormal proportioned women? You gens are fucking ugly and trashy.
>>
>>109150263
Someone posted this earlier:
https://x.com/PhotogenicWeekE/status/2070641554784768187
>>
>>109150277
You're right, alas some people really do have terrible taste
>>
>>109150272
not worth it, like the style looks authentic but no human would ever draw the dude's bare foot like that or draw the sword-holding hand like that
>>
>>109150286
yeah this thread is like always either the same generic pale skinny Asian Waifu or some Voluptuous Giant Booba Lady of various ethnicities lol
>>
>>109149559
Much better than the SDXL tier gens that other anon was posting. I already see a few NSFW LoRAs for Krea 2 on civitai that make it really good, I see the equivalent of Ideogram 4. How come? Whether a model trains well should also factor into the decision of which model to choose.
>>
File: debo_k_00146_.png (2.32 MB, 1792x977)
2.32 MB PNG
>>
>>109148538
>>109148592
You can just use the ~300 byte lora.
>>
>>
Joschek if you post here. Fuck you, your loras are dog shit; Hang up the towel retard.
>>
>>109150278
black magic
>>
>>109149559
tried a straight text-to-image remake on Klein 9B with a Gemini caption of this
>>
>>109150278
open the comparison at full size lol, the right-side one might be more "detailed" but it also has crazy edge aliasing that isn't present in the original left-side output
>>
>>109150290
i actually dont like the style
>>
File: file.png (185 KB, 797x676)
185 KB PNG
>>109150278
my main issue is the grid pattern on the left, if that can even be solved
>>
somebody fix the grid of pixels at 100x zoom level!
>>
File: ComfyUI_temp_jepfd_00005_.png (2.87 MB, 1728x1296)
2.87 MB PNG
>>
do you guys make your own workflows? subgraphs where i cant even edit width/height are fucking maddening.
who are the fucking normies who upload enshittified comfyui workflows with subgraphs
>>
>>109150393
>cumpster
>>
>>109150398
skill issue
>>
File: ComfyUI_Krea_2_00362_.png (2.88 MB, 1672x1256)
2.88 MB PNG
>>109150381
>>109150368
I don't see it, maybe I'm blind
>>
>>109150393
\> cum dumpter
\> cumpster
>>
>>
File: aliasing.png (1.03 MB, 927x649)
1.03 MB PNG
>>109150404
this is full size (not bigger, exactly 100%) on the right side. Look at her glasses. If you scroll back and forth you'll see the original isn't like that
>>
File: debo_k_00148_.png (2.31 MB, 1792x977)
2.31 MB PNG
>>
File: ComfyUI_temp_jepfd_00007_.png (2.75 MB, 1728x1296)
2.75 MB PNG
>>
>>
>>109150415
you're complaining about this but it butchered her earring
>>
File: zitsloppa.png (772 KB, 865x451)
772 KB PNG
>>109150422
I beg you to get even one output that actually says "CUM DUMPSTER" spelled correctly. Also lol @ ZitSloppa background
>>
>>109150355
Not Klein's strong suite
>>
localplastic thread
>>
>>109150439
I mean yeah that looks wack too. Either way if you're gonna use a different VAE on Krea 2 I'd use just the regular Wan 2.1 VAE if anything, not this shitty upscaler VAE
>>
>>109150422
can you do one where her opposite life is her getting her law degree
>>
>>109150445
I mean I'd try the original prompt too if I knew what it was DESU.
>>
>>109150393
>>109150422
prompto pls?
>>
>>109150305
Hmm?
>>
>>109150305
if this person is actually THE Debo they don't seem to meet the hype at all frankly, they seem to just post images without saying anything
>>
>>109150452
Catbox is here. >>109149625 It's ID4 bounding box autism though
>>
File: Krea2_00181_.png (2.64 MB, 1400x1872)
2.64 MB PNG
i love plastic
>>
>>109150302
>I see the equivalent of Ideogram 4
I don't see*, or at least nearly as many models.
>>
File: 2026-06-28_krea2.jpg (1.87 MB, 2376x4504)
1.87 MB JPG
>>109150471
I love plastic and square pupils.
>>
File: debo_k_00150_.png (2.35 MB, 1792x977)
2.35 MB PNG
>>109150457
don't believe the rumors. I'm pretty well behaved
>>
>>109150482
Civitai delayed adding an ideogram 4 category to their site for several weeks, killed it's momentum I guess. A lot of people had moved onto Krea by the time they added it
>>
>>
>>109150393
I want a way to get in taboo words on the model, this is ridiculous
>>
File: ComfyUI_temp_hgcje_00015_.png (2.45 MB, 1920x1200)
2.45 MB PNG
>>109150441
this anon likes to overanalyze images from a /g/ thread, holy autistic behavior
>>
File: ComfyUI_temp_hgcje_00016_.png (2.3 MB, 1920x1200)
2.3 MB PNG
>>
Krea2 censored violence
>>
>>109150539
I got it to show a bunch of dead niggers bleeding out on the floor behind asuka holding a glock
what exactly is censored? I just think it's not trained on a lot of violence. big difference from censorship.
>>
>>109150539
use this to remove the filter
https://github.com/capitan01R/ComfyUI-Krea2T-Enhancer
>>
File: ComfyUI_temp_hgcje_00017_.png (2.29 MB, 1920x1200)
2.29 MB PNG
>>
>>109150562
>>109150562
>>109150562
>>109150562
>>
>>109150457
he got domesticated
>>
File: kissed.jpg (61 KB, 929x1342)
61 KB JPG
>>109148520
Hello /ldg/ I am a long time comfyui user. I moved to invokeai for a bit and enjoyed its gallery system and canvas a fair bit.
I can use proper editors just fine as a replacement for the canvas but as for organizing slop. Has anyone made a node for uploading images to a local booru like shimmie with the positive prompt as tags?
Ideally I'd want something quantitative tags rather than qualitative but something quick and dirty will do if it's already available.
I figure some of you might have run into this exact issue
pic unrelated
>>
>>109150133
>>109150168
>AGPeach



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.