[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor applications are now closed. Thanks to all who applied!


[Advertise on 4chan]


Catbox Host Seething Edition

Discussion and Development of Local Image, Video, and Music Models

Previous: >>109015348

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
gm saars
>>
>inb4 n*gbo
>>
>>109020001
https://rentry.org/LDG_vital_info
>>
File: Ideogram_00210_.png (3.57 MB, 1680x944)
3.57 MB PNG
I made a fun challenge for you /ldg/ idiogram is amazing
>>
where is ideogram anima???
>>
i would be using ideogram right now if it didnt have a piss filter...
>>
>>109020027
ideogram looks awful to me.
>>
maintain thread quality

https://rentry.org/LDG_vital_info
https://rentry.org/LDG_vital_info
https://rentry.org/LDG_vital_info
>>
>>109020033
based openai poisoning their own models with a piss filter in order to sabotage all of local
>>
File: ComfyUI_00037_.png (1.54 MB, 1024x1024)
1.54 MB PNG
>>
>>109020027
But you say this exact same thing about Ernie, Kontext, Klein, Z Image, Qwen, WAN, and every single one of the 40 Chroma versions and Anima this last 6 months. Hard to take you seriously at this point anon :(
>>
File: .png (646 KB, 2449x1978)
646 KB PNG
>>109020033
I would try if I had enough vram
>>
File: 47145346543076.png (2.23 MB, 1088x1600)
2.23 MB PNG
Can you recognize which character this is supposed to be?
>>
>>109020033
No model at all does what I want. I want such an extensive amount of face descriptors that you can create at least a very near likeness, and with enough spamming of the gen button, a likeness.

That's not the case right now. No model has such a capacity.

The funniest thing right now is how models can't even produce a complete range of chins. Noses are also just impossible, you can't get an appropriately large nose, not too large, but not model small either. You can't control these things.
>>
>9.3B
yeah *burp* she could lose some weight mhm
>>
found him right in the middle
>>
I met my wife on ldg
>>
>>109020049
you do have enough ram anon showed us in previous >>109019809
>>
Almost did my first Ideogram gen until I realize I'd have to update Comfy.
>>
>>109020071
Idiogram on the first couple of gens is the most disappointing thing I can imagine until you learn how it works
>>
>>109020063
He either re-named his safetensors, changed them after he started or actually magically found a way to fit 17GB in 16GB.
It's funny considering I can easily run WAN2.2 and LTX here but not ideogram4
>>
File: 5647587.png (2.66 MB, 1920x1088)
2.66 MB PNG
how do (You) solve the severe kino drought problem?
>>
Didn't know my wife was inside my GPU
>>
>>109020001
Other than the Touhou one every single one of these images is an indefensible inclusion. Yes I was snubbed of course. None of these had any right to take a place that could have belonged to my gens (very high quality).
>>
>>109020077
>the first couple of gens is the most disappointing thing I can imagine until you learn how it works
this describes literally every single model
>>
>>109020077
Where's the good gen? I'm waiting.
>>
>mfw Resource news

06/09/2026

>SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning
https://teal024.github.io/SCAIL-2

>BLM-SGAN: Bidirectional Language Modeling for Semantic-Spatial Text-to-Image Generation
https://github.com/haidy-maher/BLM-SGAN-Text-to-Image-Generation

>SwiftVR: Real-Time One-Step Generative Video Restoration
https://h-oliday.github.io/SwiftVR

>Property-Informed Diffusion-Based Text-to-Microstructure Generation
https://github.com/hongsong-wang/PropDiff-TMG

>OmniTryOn: Video Try-On Anything at Once!
https://github.com/xcltql666/OminTryOn

>IEA: Amateur-Friendly Conversational Image Editing Agent via Three Stages of Multitask Alignment
https://github.com/OpenDFM/Image_Edit_Agent

>CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning
https://github.com/InternLM/CapRL

>CHROMA: Detecting AI-Generated Images through Inter-Channel Color-Space Correlations
https://github.com/JPSoteloSilva/CHROMA

>VideoWeaver: Evaluating and Evolving Skills for Agentic Long Video Generation
https://github.com/JianhuiWei7/VideoWeaver

>Built to benefit everyone: our plan
https://openai.com/index/built-to-benefit-everyone-our-plan

>China Preps $295 Billion Plan to Fund Nationwide AI Buildout
https://www.bloomberg.com/news/articles/2026-06-09/china-prepares-295-billion-plan-to-fund-nationwide-ai-buildout

>Z-Image-Engineer V6 (4B)
https://huggingface.co/BennyDaBall/Z-Image-Engineer-V6

06/08/2026

>Unified Safe In-context Image Generation in Multimodal Diffusion Transformers via Restricting Unsafe Information Flows
https://github.com/deng12yx/UVR

>GuideCAD: A Lightweight Multimodal Framework for 3D CAD Model Generation via Prefix Embedding
https://github.com/mskimS2/GuideCAD

>Consistency-Preserving Diverse Video Generation
https://github.com/XinshuangL/Diverse-Video

>Ideogrammar — Ideogram 4 Prompt Editor
https://github.com/rlemson7/ideogrammar
>>
>>109020090
the OP is a mentally ill tranny dude
>>
>>109020062
No shes my chinese researcher wife no yours
>>
>mfw Research news

06/09/2026

>Ultra Flash: Scaling Real-Time Streaming Video Generation to High Resolutions
https://arxiv.org/abs/2606.09150

>Seeing is Believing: Aligning Prompt Rewriting with Visual Anchors for Text-to-Image Generation
https://arxiv.org/abs/2606.08492

>MilliVid: Hierarchical Latents for Long-Range Consistency in Video Generation
https://davidcharatan.com/millivid

>OmniGen-AR: AutoRegressive Any-to-Image Generation
https://arxiv.org/abs/2606.09156

>TIDE: Task-Isolated Diffusion for Unified Video Editing and Generation
https://LittleWork123.github.io/tide

>LiteVSR: Lightweight Adaptation of Frozen Diffusion Transformers for Video Super-Resolution
https://arxiv.org/abs/2606.09250

>CineDance: Towards Next-Generation Multi-Shot Long-Form Cinematic Audio-Video Generation
https://aliothchen.github.io/projects/CineDance

>CoVEBench: Can Video Editing Models Handle Complex Instructions?
https://arxiv.org/abs/2606.08415

>ZIPP:Zero-shot Image Personalization from Personas
https://arxiv.org/abs/2606.08841

>TUDSR: Twice Upsampling-Diffusion for Higher Super-Resolution
https://arxiv.org/abs/2606.09608

>Beyond Raw Signals: Undecoded Generative Latents as Privileged Synthetic Data
https://arxiv.org/abs/2606.08336

>Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions
https://arxiv.org/abs/2606.09076

>HACK++: Towards More Effective Head-Aware Key-Value Compression for Efficient Visual Autoregressive Modeling
https://arxiv.org/abs/2606.08302

>Understanding Quantization-Aware Training: Gradients at Quantized Weights Bias to the Low-Loss Basin
https://arxiv.org/abs/2606.09012

>Diffusion Image Generation with Explicit Modeling of Data Manifold Geometry
https://arxiv.org/abs/2606.00094

>Real-Time AttentionBender: Granular Interactive Network Bending of Video Diffusion Transformers
https://arxiv.org/abs/2606.06497

>Optimizing Few-Step Generation with Adaptive Matching Distillation
https://arxiv.org/abs/2602.07345
>>
>>109020092
Ideogram felt way worse to me what with its retarded promoting schema and the filter that actually isn’t really a filter if you prompt it right.
>>
>>109020092
wrong FLUX 9B Distill is perfect right away
>>
>>109020104
fact
>>
STOPPPPP trying to convince me to use ideogram i dont wanna right now
>>
Is nucomfy compromised? Do I do the needful and upgrade to try the new FOTM?
>>
(repost)
>>109020005
>>109020000
How do you know if a 1girl is a 1girl tho?
>>
>>109020128
my model would never lie to me
>>
>>109020140
As long as she identifies as a 1girl it's definitely not gay.
>>
>>109020080
Does this look like a qwen workflow? You could just ask for help instead of accusing everyone of lying.
Workflow here, obviously remove the lora loaders: https://files.catbox.moe/fvmcf3.png (embed)
If that doesn't work try changing sys memory fall back settings in nvida control panel and add --reserve-vram 2 to your .bat
>>
File: Ideogram_00211_.png (2.88 MB, 1840x1040)
2.88 MB PNG
>>
no lie, I've never really had any women flirt with me, so one day a tranny did, and honestly he was passing, but not, because like what are the odds lmao. A "tomgirl" style tranny.
>>
>>109020144
I quite literally dropped my entire graphical interface and opened the comfyui web interface on another machine just to be sure it had as much ram as possible and it still doesn't work
I'm not going to waste hours of my life just because you want to troll people into running a model it's impossible for them to run
>>
File: Ideogram_00212_.png (1.59 MB, 1840x1040)
1.59 MB PNG
>>
>>109020168
I have yet to see anything nice come out of ideogram.
>>
>>109020168
holy shit i was going to call this bullshit but then i noticed that >>109020144 isn't using the Ideogram 4 Scheduler
>>
File: 32be5e94e2f69177.gif (939 KB, 320x327)
939 KB GIF
>>109020190
>>
>>109020190
Pretty sure that scheduler is impotant.
>>
File: neko ui 2.png (130 KB, 1304x711)
130 KB PNG
>>
>>109020113
>Is nucomfy compromised
for like a year now yeah
>>
>>109020215
Where in the code?
>>
>>109020144
Is flash attention worth the hassle?
>>
File: ComfyUI_00041_.png (2.24 MB, 1536x1536)
2.24 MB PNG
>>
>>109020220
check it out ya'll. c:/users/desktop/comfyui/network.py
>>
>>109020223
>Is flash attention worth the hassle
I don't know how you can have comfy on your PC for more than a week and not have inadvertently installed flash attention like 20 times.
>>
File: idogram_00003_.png (1.78 MB, 848x1264)
1.78 MB PNG
>>
>>109020238
>ido

idk man. trained on plasters.
>>
>>109020238
Flash atention *whore*
>>
>>109020233
Im sorry I'm not a vramlet
>>
no lie, I've never really had any interest in API models, but one day I tried grok, and honestly it was uncensored, because like what are the odds lmao. A "uncensored" style API.
>>
do you guys accept refugees?
>>
not your workflow not your waifu
>>
>>109020256
only if you are the kino kind of refugee
>>
>>109020249
>I'm not a vramlet
>having flash attention 2 means you're a vramlet

That's a new one
>>
File: Ideogram__00196_.png (1.88 MB, 992x1504)
1.88 MB PNG
>>109020190
>>109020201
I know I'm getting baited now but here's the same workflow with ideogram4 scheduler. https://files.catbox.moe/w0s0cg.png
>>109020223
5.43s/it with
6.09s/it without
Not massive gains but it takes 30 seconds to install so might as well
>>
>>109020256
Depends where from.
>>
File: Ideogram_00217_.png (2.48 MB, 1840x1040)
2.48 MB PNG
>>
AI is for chumps.
>>
>>109020258
>>109020275
/adt/ :'(
>>
>>109020250
I have Grok and Nano Banana, through their like um real basic plans. Like I think the lowest is a decoy product, idk.

Anyway, they're both pretty decent 1girl generators overall.

I do something I think is amusing, instead of photographing assorted hos, I attempt to memorize their appearance and then type it in and gen it. It's harder than it sounds, very amusing imo. So far, I go through basic colors like as layers top down, then styles, then key extras. So like
straw - hair
white - shirt
white - shorts
black - shoes

with extra layers if they are there, like a belt etc.

then a types layer
double tied pony tail
hourglass-ish 80's style shorts pantsuit *I'll look this up*
black roller skates (neon green accent)

sometimes there is extra info that's like maybe add, maybe don't, because it depends on the perspective, so if a freckled face, then you can't really have the pony tail.

And, if we consider the face, there are shapes to the face that might be observed.

I will also observe if something is dirty, and if she's sweaty, but these really are just basically optional according to what you think.

If you save it to the gallery in your phone, you may do a double take, because it may look like you took their photo, sort of. It's highly funny.
>>
>>109020285
Yes, this is an anime general, you know tdrussell? Well, he posts here.
>>
I'm sitting naked after I mowed the lawn, so my balls smell like a dead rat.
>>
File: ComfyUI_00042_.png (2.93 MB, 1536x1536)
2.93 MB PNG
>>
in case any ldg hot babes are into that.
>>
>>109020285
I was worried you were gonna say /sdg/ or something. That's like hearing about someone who survived living next to the elephants foot and decided to move in after all this time.
>>
File: Ideogram_00218_.png (2.57 MB, 1840x1040)
2.57 MB PNG
>>
>>109020050
IT'S PIKACHU
>>
>>109020285
We like anime too, what's your favorite anime?
>>
File: Neko UI 3.png (283 KB, 1920x1080)
283 KB PNG
>>
I'll make the perfect ux for myself!
>>
>>109020319
>>109020279
I like these, catbox?
>>
>>109020325
NTA but i like Evangelion and Frieren.
>>
>>109020337
cringe and cringe
>>
File: oldfuk.png (81 KB, 198x182)
81 KB PNG
Drag and drop is not feasible on comfy anymore, what happened?
>>
>>109020285
Welcome ^^
>>
File: 01311-3546849431.png (2.28 MB, 1024x1536)
2.28 MB PNG
>>109020285
Are you a gacha fag? here is a Furina, enjoy. Welcome to the thread anon!
>>
>>109020325
That's kind of a loaded question, but to name a few:

Katanagatari, shoujo shuumatsu ryokou, 3-gatsu no lion, made in abyss, kaguya-sama wa kokurasetai

>>109020380
I played a lot of genshin but got too bored in natlan and stopped.
>>
>>109020300
Wait, tdrusell? The actual dev of Anima lurks this general? Based. Good info anon.
>>
File: ComfyUI_00044_.png (1.77 MB, 1024x1024)
1.77 MB PNG
>>
>>109020331
https://files.catbox.moe/eo3s33.png
>>
>>
>>109020425
damn, ideogram is pretty good
>>
>>109020459
>damn, ideogram is pretty good
samefag

this shit is easy, bring it.
>>
why causes schizsaar to seethe so hard about ideogram?
>>
>>109020504
iffy license and rocky launch created a polarizing opinion on the model. People who never moved on from the initial impression are baffled people enjoy it now.
>>
>>109020517
we just need tdrussel to finetune it to make everyone happy
>>
>>109020476
well these;

>>109020319
>>109020279
>>109020172
(prompt plz...)
look cool for wallpapers but it could be a fluke since all ihear it is tarded model trained on closed source outputs is it
>>
oh anonie...
>>
File: debo_vn_fia_00045_.png (2.15 MB, 1792x977)
2.15 MB PNG
>>
>>109020560
>it is tarded model trained on closed source outputs
it is. but 60% of nano banana pro is still 200% better than shit like z image or hidream. training on sloppa api outputs won't get you nearly as good as the models you're copying, but it's still better than everything else. local is just that pozzed right now.
>>
[[[[[[[[[[[[cleft chin]]]]]]]]]]]]]]
>>
>>109020053
Why didn't they just copy the Chinese? Bloated mess
>>
>>109019569
Is it anima Samv2
>>
6b is ideal. more than sdxl and can fit on most cards with decent quants. anima doesn't have enough parameters.
>>
>>109020614
Easier to fix than moot chin tho
>>
>>109020617
>Why didn't they just copy the Chinese?
desu because they are retarded and lazy
>>
>Why didn't they just copy the Chinese?
you mean why didn't they sell out to API? thank god they didn't.
>>
calm down anon no need to start getting upset
>>
>>109020635
Yes.
>>
>>109020560
>>109020441
>>
>negative prompt: ugly
do you really need more?
>>
>he doesn't even notice mootchin
>>
The very best 1girl ever though of is coming right up.
>>
>>109020782
stop spying on my generations
>>
File: Ideogram_00224_.png (2.6 MB, 1376x1376)
2.6 MB PNG
>>109020782
Sorry to keep you waiting
>>
File: 973.jpg (32 KB, 680x686)
32 KB JPG
>>109020800
>>
File: output_1781058796.png (2.22 MB, 832x2048)
2.22 MB PNG
>>109020800
>no feet

uh. that can't be it.

HERE IT IS!!!

*square toenails are normal.
>>
boobs too small now wtf anon >>109020807
>>
File: FK9B__00001_.png (1.78 MB, 832x2048)
1.78 MB PNG
>>109020810
oops, here it is. lmao
>>
>>109020812
I got scared that the giant cone nipples would be too much for a blueboard so I made the bboxes for the boobs smaller.
>>
>>109020822
>want to make nipples smaller
>instruct model to make entire booba smaller
??????????
>>
>>109020829
The bigger the boxes got the more ridiculous and bovine the nipples got.
>>
>>109020822
herro im hiroshima nagasaki, i give you gaijin permission to postu
>>
File: Ideogram_00227_.png (3.28 MB, 1376x1376)
3.28 MB PNG
>>109020837
>>
File: ComfyUI_00047_.png (1.67 MB, 1024x1024)
1.67 MB PNG
>>
so what happens now?
>>
>>109020941
show me something
>>
File: Ideogram_00233_.png (3.22 MB, 1376x1376)
3.22 MB PNG
>>
ideogram sux for nsfw
>>
>>109019368
Catbox?
>>
>>109021051
Depends what your poison is.
>>
>>109021053
monster-girls, futas, massive nips, normal shit
>>
>>109021051
it's more uncensored than zit
>>
File: tmpveu9wjoc.png (744 KB, 1376x1944)
744 KB PNG
How do I make my gens better? They're not up to par, imo.
>>
>>109021203
put satan into the negative prompt



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.