[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor acceptance emails will be sent out over the coming weeks. Make sure to check your spam folder!


[Advertise on 4chan]


File: collage_1782869265_1.jpg (1.93 MB, 2995x2757)
1.93 MB JPG
Data Hunters, Scrapers, And Hoarding Fags Edition

Discussion and Development of Local Image, Video, and Music Models

Previous: >>109169231

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://huggingface.co/models
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Krea 2
https://huggingface.co/krea/Krea-2-Raw
https://huggingface.co/krea/Krea-2-Turbo

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: 1756160266543630.png (288 KB, 1360x752)
288 KB PNG
that last thread was horrible.
do better
>>
>>109172561
This collage features images from several threads ago
>>
>>109172581
Yes because the last thread only had pornslop and useless images.
>>
>mfw API news

>Google’s new Nano Banana 2 Lite image model is its fastest and cheapest yet
https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-omni-flash-nano-banana-2-lite/

>Seedance 2.0 Mini and 4K is now available in ComfyUI
https://blog.comfy.org/p/seedance-20-mini-and-4k-is-now-available

>ByteDance launches Seed Audio 1.0 Unified AI Audio Generation for Speech, Music and Ambient Sound Creation
https://fal.ai/models/bytedance/seed-audio-1.0

>Midjourney goes from generating cat images to full-body ultrasound scans
https://www.theverge.com/ai-artificial-intelligence/952011/midjourney-medical-ai-ultrasound-scan

>Alibaba releases HappyHorse 1.1 Available on Alibaba Cloud
https://www.alibabacloud.com/blog/happyhorse-gets-stronger-motion-expressiveness-higher-generation-consistency-and-enhanced-visual-quality_603293

>ByteDance's New AI Video Model Can Make 30-Second Clips From a Single Prompt
https://www.cnet.com/tech/services-and-software/bytedance-introduces-new-seedance-2-5-video-model/

>Luma Introduces Ray3.2 Model & API: Complete Creative Control for Video Generation
https://lumalabs.ai/news

>The Layout Bet — Reve 2.0
https://blog.reve.com/posts/the-layout-bet

>Introducing Gemini Omni — Google’s multimodal video creation/editing model
https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-omni/

>Nano Banana 2 and Nano Banana Pro are generally available via Gemini Enterprise Agent Platform
https://cloud.google.com/blog/products/ai-machine-learning/nano-banana-2-and-nano-banana-pro-are-generally-available

>Grok Imagine 1.5 Preview
https://x.ai/news/grok-imagine-1-5

>Seedance 2.0 in Runway API
https://docs.dev.runwayml.com/api-details/api_changelog/
>>
>>109172583
Fair enough
>>
>>109172581
>>109172583
That thread didn't get a collage as well.
>>
>>109172583
based collage baker making sloppers seethe.
>>
schizo link baker has shit taste
>>
File: 1764186554524154.png (2.72 MB, 1275x1650)
2.72 MB PNG
cant wait for a krea 2 tier edit model so I can finally make more comic stuff.

right now training LoRAs for dozens of characters is stupid
>>
>edit model
shouldn't even be a thing anymore. hopefully by that they mean an upgraded klein-like model with edit capabilities. zero reason to have separate models for edit when klein proved it can remain smart and efficient combining them in one.
>>
File: 1717445385642.jpg (3.86 MB, 3264x1740)
3.86 MB JPG
>03 Jun 2024
The first /ldg/ collage, have we regressed
>>
File: 2112671.png (2.1 MB, 1216x1600)
2.1 MB PNG
>>
>>109172694
damn, gens used to have soul.
>>
>>109172587
thanks!
>>
>>109172694
it's all abstract art to hide how poor the gens were
>>
>>109172694
when the barrier for entry lowers, so does the standards.
>>
>>109172672
There's very clearly been two different bakers for a while now with the random anon doing it here and there.
>>
>>109172694
I've made some gens that are so good I'm honestly hesitant to share them. The fear that someone could steal them and monetize it is real. I bet a lot of you feel the same
>>
^nobody is gonna steal your ai slop lil guy.
>>
>>109172715
That's fake and gay. Imagine genning "art" trained on stolen "art" then crying about the possibility of someone stealing your "art". lmao some real ouroboros shit
>>
>>109172679
do they end up having sexo
>>
>>109172715
Post em bro, come on
>>
>mfw Resource news

06/30/2026

>OmniDance: Multimodal Driven Dance Video Generation with Large-scale Internet Data
https://github.com/AMAP-ML/OmniDance

>SAFE-DiT: Semantics-Aware Fast-path Execution for High-Resolution Diffusion Transformers
https://github.com/xuanhuayin/SAFE-DiT

>EcoVideo: Entropy-Orchestrated Video Generation Paradigm in Cloud-Edge Dynamics
https://github.com/IF-LAB-PKU/EcoVideo

>See Only When Needed: Context-Aware Attention Intervention for Mitigating Hallucinations in LVLMs
https://github.com/Iris1946/CAI

>Spanning the Visual Analogy Space with a Weight Basis of LoRAs
https://research.nvidia.com/labs/par/lorweb

>Krea 2 LoRA Trainer
https://github.com/CaptainGrock/Krea2Trainer

>Ideogram JSON Captioner Kit - making ID4 datasets slightly less painful
https://github.com/Adudeguyman/Ideogram-fantastic-upgraded-captioning-kit

06/29/2026

>Krea 2 Base & Turbo — NVFP4 / FP8 / MXFP8 / INT8 / ConvRot INT8
https://huggingface.co/Winnougan/Krea-2-Base-Turbo-NVFP4-FP8-INT8

>Local Dream 2.8.0 with Anima support
https://github.com/xororz/local-dream/releases/tag/v2.8.0

>OSOR: One-Step Diffusion Inpainting for Effect-Aware Object Removal
https://github.com/Zhouqm-Git/osor

>Diffusion Model Attribution via Spectral Coupling of Denoiser Responses
https://github.com/Pragati-Meshram/SGS

>OrthoTryOn: Geometric Orthogonalization for Conflict-Free Unified Fashion Generation
https://github.com/NJU-PCALab/OrthoTryOn

>CSD: Content-aware Speculative Decoding for Efficient Image Generation
https://github.com/aderfebr/CSD

>Dismantling Pathological Shortcuts: A Causal Framework for Faithful LVLM Decoding
https://github.com/Cc2021start/Fox

>Extra CFG++ Samplers
https://github.com/xxiiyu/extra_cfgpp

>VNCCS 3.0 release
https://github.com/AHEKOT/ComfyUI_VNCCS/releases/tag/3.0.0

>forgeModelPatch: Add ZImage and Anima to Forge
https://github.com/croquelois/forgeModelPatch

>Flux2-Klein-9B-True-V3
https://huggingface.co/wikeeyang/Flux2-Klein-9B-True-V3
>>
>mfw Research news

06/30/2026

>Nemotron-Labs-Diffusion-Image: Advancing Masked Discrete Diffusion for High-Resolution Image Synthesis
https://arxiv.org/abs/2606.29814

>Intermediate Text Representation Guided Text-to-Image Generation for Enhancing One-and-Only Alignment
https://basedoun-won.github.io/one-and-only-ir-guidance

>Your Data Manifold is Secretly a Reward Model: Shell-LCC for Text-to-Video Generation
https://arxiv.org/abs/2606.30248

>Mural: Transferring LLM knowledge to image generation via Mixture-of-Transformers
https://arxiv.org/abs/2606.29013

>Concept Removal Guidance: Evidence-Calibrated Negative Guidance for Safe Diffusion Sampling
https://arxiv.org/abs/2606.29801

>Goku: A Million-Scale Universal Dataset and Benchmark for Instruction-Based Video Editing
https://arxiv.org/abs/2606.30599

>Illuminating Unified Multimodal Model for Free-form Interleaved Text-Image Generation
https://arxiv.org/abs/2606.30054

>MuseBench: Benchmarking Intent-Level Audiovisual Arts Understanding in MLLMs
https://musebench.github.io

>Rigel: Self-Distilled Score Adaptation for Image and Video Captioning Evaluation
https://arxiv.org/abs/2606.29997

>MAVIN: Multi-Shot Audio-Visual Generation with Narrative Control
https://arxiv.org/abs/2606.29473

>DreamForge-World 0.1 Preview: A Low-Compute Real-Time Controllable World Model
https://trydreamforge.com

>ScaleErasure: Inference-Time Minimal Intervention for Precise Concept Erasure in Next-Scale Autoregressive Image Generation
https://arxiv.org/abs/2606.29282

>The Human Creativity Benchmark
https://arxiv.org/abs/2606.30561

>What Color is the Sky (for a non-human)?
https://arxiv.org/abs/2606.28912

>W4A4 Quantization for Inference on Wan2.2-I2V-A14B
https://arxiv.org/abs/2606.29337

>Self-Evolving Agentic Image Restoration via Deliberate Planning and Intuitive Execution
https://arxiv.org/abs/2606.28971

>StackingNet: Collective Inference Across Independent AI Foundation Models
https://arxiv.org/abs/2602.13792
>>
>>109172715
pussy.
>>
>>109172720
Let's be honest, everyone here is just posting their throwaway gens. It’s been obvious for a while now.
The ones who are actually talented post their best work on twitter/instagram
>>
>>109172749
and you're not allowed to post those on 4chan because twitter and instagram won't allow it? makes no sense. fartists going to fart
>>
File: 1773144064837849.png (3.6 MB, 1280x1928)
3.6 MB PNG
>>109172749
are you kidding? some of the best stuff Ive seen posted here. its twitter and insta holding all the slop

they literally spam these sites through an API, thousands of accounts less than a year old with literally thousands of images gtfo here
>>
>>109172707
wow, two faggots that screech about drama. so much better than one
>>
>Generate a detailed drawing reference showing hands grabbing and kneading soft smooth balls of fat from various angles
yeah, five year gap in knowledge and quality, at minimum
>>
>>109172776
Wow where do I download that local model?
>>
>>109172776
now get it to add a nipple
>>
>>109172587
>>109172729
>>109172733
Fuck off debo
>>
>>109172769
That makes three with (you).
>>
>>109172822
Proof you aren't Debo?
>>
>>109172825
which one are you? One of the two drama rentry faggots or the faggot that wants them to fuck off and stop making shitty bakes?
>>
File: debo_sf_k2_uv_00036.jpg (2.74 MB, 6192x2580)
2.74 MB JPG
>>109172822
yea prove you're not debo
>>
>>109172478
prompt and lora please
>>
> >109172842
melted space slop
>>
How does Nvidia's PID upscaling compare to using SeedVR?

https://huggingface.co/Comfy-Org/PixelDiT
>>
>>109172838
It sounds like (you) are a fourth.
>>
File: 1667632287024443.png (377 KB, 512x512)
377 KB PNG
These were peak.
>>
>krea2 release
>ideogram2 release
>no gens
>tardbo generating sdxl-tier sloppa
>localkeks coping and seething
did everyone give up?
>>
>>109172939
fuck off.
>>
>>109172939
Hey man I'm cheesin' here don't ya see? Take a break and be the change you want by uploading your gennies.
>>
File: 0938sp.png (1.38 MB, 1024x1024)
1.38 MB PNG
>>
Only 97% you say?
>>
apiGODS accept there is always room to improve
>>
Krea is too limited.
>>
Last chance for Pride Month gens.
>>
>>109173075
we need per-pixel reasoning. i'd pay for comfycloud if we had nano banana uncensored. comfy, it's your move
>>
>>109173094
it's july 1 in india already
>>
File: 1756523081658475.png (3.59 MB, 1920x1088)
3.59 MB PNG
>>109172939
I have gen fatigue because the only stuff that gets praise is pornslop but ironically that content is the most forgettable because once you coom it just sits there in shame.

Plus I'm not getting the same kind of dopamine from people interacting with my stuff on other websites because they might as well just be bots.
>>
>>109173106
I don't think I have ever coomed to an AI image or video and I gen those things sometimes lol.
>>
File: debo_sf_k2_uv_00037.jpg (3.8 MB, 6192x2580)
3.8 MB JPG
>>
>>109173116
you dont have to coom *to it*
cooming washes all that stuff away - real, ai, hentai, etc. and pushes your mind back to a normal state where you can identify your behavior as abhorrent.
>>
File: ComfyUI_temp_nbacg_00095_.png (2.7 MB, 1152x1920)
2.7 MB PNG
>>
File: ComfyUI_00928_.png (1.34 MB, 832x1216)
1.34 MB PNG
>>109172561
What's /lmg/'s consensus on Anima? How does a compare to illustrious?
>>
>>109173139
Nah. Nothing would ever make me think looking at naked women as abhorrent behavior. Sitting down and only genning porn or actively gooning to hardcore shit all day would be or any measure of long lengths of time (wasted) doing that.
>>
>>109172939
Everyone left to let the handful of schizos have their little containment general to shout in.
>>
>>109173183
you are not allowed to mention an*ma here
>>
>>109173183
It generates the same poses over and over.
>>
File: ComfyUI_temp_nbacg_00097_.png (3.01 MB, 1152x1920)
3.01 MB PNG
>>
File: ComfyUI_temp_nbacg_00098_.png (3.45 MB, 1152x1920)
3.45 MB PNG
>>
>>109173208
Box?
>>
>>109172706
Not true at all. You're just trying to justify your classism/racism or whatever other mental defect you have.
>>
>>109173237
What do you mean?
>>
File: ComfyUI_temp_nbacg_00099_.png (3.29 MB, 1728x1296)
3.29 MB PNG
>>109172694
>no edit model
>no text
>shit composition
>random gatcha seed gens
>boring le random abstract no detail style



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.