[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor acceptance emails will be sent out over the coming weeks. Make sure to check your spam folder!


[Advertise on 4chan]


Discussion and Development of Local Image, Video, and Music Models

Previous: >>109133256

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Krea 2
https://huggingface.co/krea/Krea-2-Raw
https://huggingface.co/krea/Krea-2-Turbo

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
first for anima coughed and there was a death rattle
>>
krea 2 danbooru status?
>>
>>109136560
A rather small collage, no?
>>
>mfw Resource news

06/25/2026

>Bernini-R — GGUF (high & low noise experts)
https://huggingface.co/neuregex/Bernini-R-GGUF

>Physics Question Scene Graph: Fine-grained Evaluation of Physical Plausibility in Text-to-Video Generation
https://github.com/atinpothiraj/pqsg

>VPA-Guard: Defending and Benchmarking Image-to-Video Generation Against Visual Prompt Attacks
https://huggingface.co/datasets/CSU-JPG/VVA-Bench

>Minimalist Preprocessing Approach for Image Synthesis Detection
https://github.com/vohoaidanh/adof

06/24/2026

>Krea-2-Turbo Training Adapter
https://huggingface.co/ostris/krea2_turbo_training_adapter

>Vera: A Layered Diffusion Model for Content-Preserving Video Editing
https://vera-layered-diffusion.github.io

>Advancing WordArt-Oriented Scene Text Recognition: Datasets and Methods
https://github.com/YesianRohn/WATER

>DramaDirector: Geometry-Guided Short Drama Generation
https://github.com/iLearn-Lab/DramaDirector

>PG-MAP: Joint MAP Optimization for Inference-Time Alignment of Diffusion and Flow-Matching Models
https://github.com/sophialanlan/PG-MAP

>Safe Few-Step Generation via Velocity Editing
https://uzn36.github.io/VESFlow

>Co-occurring associated retained concepts in Diffusion Unlearning
https://github.com/damilab/CARE

>MeshFlow: Mesh Generation with Equivariant Flow Matching
https://qiisun.github.io/MeshFlow

>Arbor: Explicit Geometric Conditioning for Controllable 3D Asset Generation
https://arbor.jdihlmann.com

>VideoAgent: All-in-One Framework for Video Understanding and Editing
https://github.com/HKUDS/VideoAgent

>Krea 2 GGUF
https://huggingface.co/molbal/krea2-gguf

>Anima updates license to version 1.2. Clarifies commercial use restrictions
https://huggingface.co/circlestone-labs/Anima/commit/8cca6bb7b35f7f6abb2e21616ae44de083dbb8fa

>ComfyUI-Krea2T-Enhancer
https://github.com/capitan01R/ComfyUI-Krea2T-Enhancer

>PiD: Plug-and-play diffusion decoder that replaces VAE/RAE decoders
https://github.com/nv-tlabs/PiD
>>
>>109136575
The price for collage real estate has gone through the roof. Collage maker can charge what they want and still proft.
>>
>no krea2 style transfer
DOA
>>
>mfw Research news

06/25/2026

>DomainShuttle: Freeform Open Domain Subject-driven Text-to-video Generation
https://arxiv.org/abs/2606.26058

>Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models
https://arxiv.org/abs/2606.25473

>Chorus II: Cross-Request Sparsity Reuse for Efficient Image-to-Video Generation
https://arxiv.org/abs/2606.25040

>EchoStyle: Unlocking High-Fidelity Video Stylization with Reverse Data Synthesis
https://arxiv.org/abs/2606.25465

>Concept Removal for Frontier Image Generative Models
https://arxiv.org/abs/2606.25548

>Structuring Sparsity: Block-Sparse Featurizers Capture Visual Concept Manifolds
https://arxiv.org/abs/2606.25234

>MIMFlow: Integrating Masked Image Modeling with Normalizing Flows for End-to-End Image Generation
https://arxiv.org/abs/2606.26016

>FreeStory: Training-Free Character Consistency for Free-Form Visual Storytelling
https://arxiv.org/abs/2606.25079

>TryOnCrafter: Unleashing Camera Trajectories for Realistic Video Virtual Try-on via a Renderable 4D Try-on Proxy
https://sunhao242.github.io/TryOnCrafter_web.github.io

>In-context Region-based Drag: Drag Any Region to Any Shape
https://arxiv.org/abs/2606.25907

>Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models
https://wan-streamer.com

>Steering Vision-Language Models with Joint Sparse Autoencoders
https://arxiv.org/abs/2606.25657

>Brevity is the Soul of Inference Efficiency: Inducing Concision in VLMs via Data Curation
https://arxiv.org/abs/2606.25432

>Do vision-language models search like humans? Reasoning tokens as a reaction-time analog in classic visual-search paradigms
https://arxiv.org/abs/2606.25066
>>
File deleted.
>>109136560
turbo
>>
uhhhhh.....
>>
>>109136584
Censor bars missed the mark brutha, janman won't like this one
>>
>>109136584
holy kreap, lois
>>
File: ComfyUI_00012_.png (1.92 MB, 1088x1448)
1.92 MB PNG
my bad didnt realize it was nsfw
>turbo
>>
The best uncensor is the weight shifting thing and not the lora right?
>>
>>109136576
>>109136581
Fuck off debo
>>
File: 1752746842941025.png (1.77 MB, 1024x1024)
1.77 MB PNG
an anime figure of hatsune miku standing on a pedestal, wearing a two piece swimsuit and summer hat on a wooden table.

krea 2 is very good desu
>>
>>109136572
Sorry I'm gunna need a couple mil to do that
>>
Can someone explain the psyche of a person who donates to lodestones for a finetune when it's statistically guaranteed in a week that they will drop said fine tune within a few days and autistically jump to another finetune?
>>
>>109136614
>someone explain the psyche of a person who donates
No, I don't get it.
>>
File: 1772036428765376.png (2.05 MB, 1088x1632)
2.05 MB PNG
>>
>>109136632
art
>>
>>109136632
this is incredible I don't care what model made it that has a special weirdness to it
>>
>>109136632
is he pissing in the orange juice
>>
Could someone share a workflow for Krea2 please? Don't like the comfy built in one the slightest and just want to play with prompts again
>>
>>109136614
except it's really more like 6-10 months and he doggedly tries quite a lot of stuff. obviously not all of which worked out, like it is for anyone who trains never mind tries new model architectures.

and the license is good and the expense so far was extremely low (although you seem like it's stuck in your head somehow and i can't tell why the fuck that is)
>>
>>109136657
>Don't like the comfy built in one the slightest
Explaining why you don't like it is the first step to fixing it.
>>
>>109136672
Sub graphs are gay and not in the fun way
>>
>>109136632
This belongs in a museum.
>>
>>109136657
My biggest gripe is how much it could be paired down. It should just be normal, regular t2i, nothing else.
>>
>>109136632
thanks, its krea raw https://files.catbox.moe/w3u1ii.png
>>
>>109136675
This is a low IQ take and actually outs you as a person incapable of abstract thinking. Being unable to understand a graph within a graph. Embarrassing.
>>
>>109136698
I'm prompting from my phone. Entering and leaving those is an adventure in it self. Prompt enhancing is also the most cringe thing ever
>>
>>109136711
>phone
die die die
>>
>>109136657
>>109136682
https://files.catbox.moe/ih4a3b.png
Minaj Asuka from the Collage. Very simple T2I workflow. Lora is this
https://huggingface.co/Beinsezii/Krea-2-Turbo-Projector-Scale-LoRA-Diffusers
Or just remove the node if you don't want to use it
>>
>>109136735
Thank you that one looks much cleaner.
>>109136717
Excuse me princess for not wanting to sit next to my computer when prompting.
>>
new kernel on arch, did it fix the comfyui regression?
>>
>>109136770
You're the princess. You can't sit at your computer like a man? You need to find a soft couch or bed to prompt in? Fuck outta here, princess.
>>
>>109136560
shit collage desu
>>
>>109136777
>Getting excuse me princessed in the lords year of 2k26 and malding over it
>>
>>109136785
Get on your computer like a man, bitch tits
>>
forgive ani
>>
I sit on my couch in the living room prompting on my computer which is connected to my large television because I have taste and am not poor
>>
File: Krea2_turbo_00020_.png (1.1 MB, 1368x768)
1.1 MB PNG
Hey guys why am I getting a weird pattern at the bottom of my images?
>>
>>109136795
You're fat
>>
File: 1782423321738651.jpg (2 MB, 3515x4096)
2 MB JPG
>>109136560
I need his veredict on Krea... his tests on Z Image and on Anima helped me so much, where is he when he is needed most?
>>
>>109136799
cutie
>>
>>109136799
because you touch yourself at night
>>
>>109136799
on chroma that was because of weird resolutions
havent seen that happen in krea tho
>>
>>109136802
Can i request some muscle mommies?
>>
>>109136802
I'm too stupid to even notice the details
>>
>>109136802
Kek, I remember that day, it was funny when he later realized he had the adetailer checkbox unchecked and it had been off the whole time.
>>
>>109136832
learn to samefag unironically
>>
File: 166825378985560.png (3.38 MB, 1088x1379)
3.38 MB PNG
Okay, Krea2 turbo seems pretty good, and the prompt enhancer seems to work pretty well too.
>>
File: 1729191436559.jpg (1.22 MB, 3440x3440)
1.22 MB JPG
>>109136575
>>109136778
I miss the anon who'd did these https://desuarchive.org/g/thread/102862167/#q102862524
>>
>>109136872
pov me throwing away all my anima loras
>>
File: 1765599924458865.png (1.03 MB, 1336x1336)
1.03 MB PNG
>>
File: 505406872123577.png (1.9 MB, 1088x1600)
1.9 MB PNG
>>
File: 915714227080436.png (2.16 MB, 1152x1472)
2.16 MB PNG
>>
>>109136799
catbox it so we can try to replicate/fix?
>>
>dozens of amazing gens in last thread
>abomination op collage
I think I know who is behind this...
>>
File: ComfyUI_Krea2__00003_.png (2.07 MB, 944x1672)
2.07 MB PNG
>>
File: 114672861419445.png (1.75 MB, 1152x1472)
1.75 MB PNG
>>
>>109136909
>>109136917
Neat.
>>
>>109136955
cool stuff, that's Polanski's Lost in Mars film
might try to do some film gens too
>>
>>109136955
>>109136956
Also neat.
>>
File: ComfyUI_U_00019__cw.png (2.4 MB, 1920x1272)
2.4 MB PNG
When's the full version of Krea 2 coming?
>>
File: 871.png (2.71 MB, 1344x1280)
2.71 MB PNG
>>
>>109136985
It's already available through ComfyCloud API
>>
when pony v8
>>
File: 745024401773931.png (1.63 MB, 1728x1024)
1.63 MB PNG
>>
>>109136991
the subtle non-square ar of this really pulls it all together
>>
File: Widow_Test.jpg (2.77 MB, 3843x4688)
2.77 MB JPG
>>109135571
>>109135599
Wow it actually has some insane quality at 4k. I wonder why they lied
>>
File: 1761680203147973.png (1.72 MB, 1784x1000)
1.72 MB PNG
>>
File: animatunetest1_00010_.jpg (382 KB, 1512x1152)
382 KB JPG
>>
File: Krea2-_00065_.png (2.36 MB, 1824x1088)
2.36 MB PNG
>>
>>109137055
>>109137061
Nice
>>
>>109136560
Civitai has been worthless and HuggingFace is comfy as fuck to browse. Replace civ with hf naow.
>>
>>109136775
i confirm that it does but now the VAE decode step takes forever, for fuck's sake man
>>
>>109137143
i'm on artix, what regression was there? I'm on Linux 7.0.12
>>
>>109137152
models just didn't load. i don't know if it happened on other UIs or just comfy. i think if i ever build a server for this i'm gonna use nix and fucking pin every package.
>>
>>109137159
weird, I have not experienced this issue using anima or krea2 in comfy at all
>>
File: debo_k_00003_.png (3.01 MB, 1792x977)
3.01 MB PNG
>>
File: Widow_Test2.jpg (1.32 MB, 2560x2880)
1.32 MB JPG
>>
>>109137180
I'm assooming the aspect ratio of your gens is the same as your monitor.
>>
File: debo_k_00004_.png (2.89 MB, 1792x977)
2.89 MB PNG
>>109137206
no, its just the happenstance of some math
>>
File: 1020326851606064.png (1.25 MB, 832x1216)
1.25 MB PNG
>>
>>109137198
ideogram is great but everything it makes feels like it came out of a gritty christopher nolan movie
krea is such a good all rounder



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.