[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor applications are now open. Apply here!


[Advertise on 4chan]


Discussion and Development of Local Image, Video, and Music Models

Previous: >>108958327

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
first
>>
>>108966691
>https://civitai.com/models/2666382/yarat-yet-another-realistic-anima-fine-tune?modelVersionId=2994036
very cool, but is it finetune or lora merge into base model?
>>
what do you think comfyorg's exit strategy is?
>>
Why haven't you vibe coded your own frontend yet, anon? You're not poor, right?
>>
>>108966775
Buying more land in the metaverse.
>>
>>108966775
IPO sell off or everyone acquihired by adobe
>>
>>108966779
The last "anon" that did that went up in flames both spiritually and mentally.
>>
>>108966797
I have faith that you are stronger than him.
>>
>>108966775
>comfyorg's exit strategy
Pay self really well. Declare bankruptcy. Leave creditors with some old PCs and office furniture. Pretty typical SV financed routine.
>>
>>108966768
huge LoKr merged into base (full matrix at factor 2). would have been almost 2gb, hence why i uploaded a checkpoint first
>>
File: 215740CUI_00001_.png (1.71 MB, 1536x1152)
1.71 MB PNG
>>108966691
Nice.
>>
>>108966775
Sell his site to Hiroshimoot (the CIA) and ask for a job at Google
>>
fuck i wish i was comfy imagine how awesome his fucking life is right now goddamnit i want to kill myself
>>
>>108966801
Not a high bar to reach but also setting that shit up feels like a nightmare compared to other frameworks
>>
File: 1780438333332.jpg (125 KB, 1080x1417)
125 KB JPG
There are Indians who make money faking Ai women and creating only fans. How can I be part of this
>>
>>108966868
you use AI to fake women and create an only fans
>>
>>108966868
I don't know, maybe create fake AI women and create onlyfans?
>>
uh oh melty
>>
>>108966775
comfy owing ani 2 million (blowjobs)
>>
File: 222124CUI_00001_.png (1.72 MB, 1536x1152)
1.72 MB PNG
>>
File: 57644.webm (1.54 MB, 256x448)
1.54 MB
1.54 MB WEBM
>>
so what's going on in the anima threads on huggingface? Are we not getting anima2?
>>
>>108966883
>>108966882
I was talking more about the software to make believable photos like the image I posted.
>>
>>108966958
It's just turd worlders raging about how the license is stealing the hard work of pajeet who spent months lovingly hand copying and pasting somebody's work into the slop machine.
>>
>>108967018
yeah but tdruss admitted he isn't getting roi
>>
>>108966989
links to the software are here >>108966726
>>
>>108966989
you can edit images to look like phone pictures
>>
>>108967025
Poo in the loo and train your own free license model.
>>
uh oh someone is getting deleted
>>
>>108967038
he tried but seems to never get any investors or trainers on his "team" kek
>>
>mfw Resource news

06/02/2026

>Training-free image inversion for one-step diffusion models
https://github.com/tttao-uwu/TFinv.git

>Single-Line Drawing Generation via Semantics-Driven Optimization
https://github.com/tanguymagne/SLDgen

>Images as Tables: In-Context Learning with TabPFN for Low-Data Detection of AI-Generated Images
https://github.com/jpwalter30/Towards-Generalizable-Detection-of-AI-Generated-Images

>Divide and Conquer: Reliable Multi-View Evidential Learning for Deepfake Detection
https://github.com/kxl0825/DiCoME.git

>GUDA: Counterfactual Group-wise Training Data Attribution for Diffusion Models via Unlearning
https://github.com/sony/guda

>Fizgig Klein 9b Lora Studio v1.2.4
https://github.com/shootthesound/Fizgig/releases/tag/v1.2.4

>Orion4D Anaglyph for ComfyUI
https://github.com/orion4d/Orion4D_anaglyph

>Comfyui v0.23.0 Support NVIDIA PixelDiT and PiD
https://github.com/Comfy-Org/ComfyUI/releases/tag/v0.23.0

>AI costs how much? GitHub Copilot users react to new usage-based pricing system
https://arstechnica.com/ai/2026/06/ai-costs-how-much-github-copilot-users-react-to-new-usage-based-pricing-system

>NAVA — Native Audio-Visual Alignment for Generation
https://huggingface.co/ernie-research/NAVA

>TripoSplat: Convert 2D image into high-quality and variable number of 3D Gaussians
https://github.com/VAST-AI-Research/TripoSplat

06/01/2026

>Bernini Latent Semantic Planning for Video Diffusion
https://bernini-ai.github.io

>NVIDIA Launches Cosmos 3, the Open Frontier Foundation Model for Physical AI
https://nvidianews.nvidia.com/news/nvidia-launches-cosmos-3-the-open-frontier-foundation-model-for-physical-ai

>LVSA: Training-Free Sparse Attention for Long Video Diffusion
https://github.com/JiusiServe/LongVideoSparseAttention

>RayDer: Scalable Self-Supervised Novel View Synthesis from Real-World Video
https://compvis.github.io/rayder

>DecMem: Towards Minute-Long Consistent World Generation
https://jeffreyyzh.github.io/DecMem-Page
>>
File: 225513CUI_00002_.png (1.87 MB, 1536x1152)
1.87 MB PNG
>>
>mfw Research news

06/02/2026

>KG-FairDiff: Knowledge Graph-Guided Prompt Refinement for Demographically Fair Text-to-Image Generation
https://arxiv.org/abs/2606.01282

>Boundary-Protection W8A8 HiFloat8 Quantization for Large-Scale Text-to-Video Diffusion Transformers
https://arxiv.org/abs/2606.00957

>Knowledge-Intensive Video Generation
https://arxiv.org/abs/2606.01285

>FocusDiT: Masking Queries in Diffusion Transformers for Fine-grained Image Generation
https://arxiv.org/abs/2606.02090

>Restoring Initial Noise Sensitivity in Text-to-Image Distillation via Geometric Alignment
https://arxiv.org/abs/2606.01651

>Collaborative Few-Step Distillation and Low-Bit Quantization for Wan2.2 Dual-Expert Video Diffusion Models
https://arxiv.org/abs/2606.00658

>Auteur: Language-Driven Cinematographic Framing for Human-Centric Video Generation
https://arxiv.org/abs/2606.01900

>Train, Test, Re-evaluate: Schedule-Sensitive Evaluation of Generative Data for Hand Detection
https://arxiv.org/abs/2606.01896

>MT-EditFlow: Reinforcement Learning for Multi-Turn Image Editing with Flow Matching
https://arxiv.org/abs/2606.01985

>DASH: Dual-Branch Score Distillation for Guidance-Calibrated Compact Diffusion Models
https://arxiv.org/abs/2606.00798

>COLLAR: Cascaded Object-Level Latent Refinement for High-Fidelity Conditional Generation
https://arxiv.org/abs/2606.00954

>TECCI: Tricky Edits of Collected and Curated Images
https://arxiv.org/abs/2606.01213

>Polaris: Scaling Up Instruction-Guided Image Generation Towards Millions of Personalized Style Needs
https://arxiv.org/abs/2606.01858

>GPTQ-intrinsic LoRA: A Near-optimal Algorithm for Low-precision Quantization with Low-rank Adaptation
https://arxiv.org/abs/2606.01412

>Pave-GRPO: Beyond Instantaneous Guidance through Principled Average Velocity Decomposition
https://arxiv.org/abs/2606.01636

>Heterogeneous Decentralized Diffusion Models
https://arxiv.org/abs/2603.06741
>>
File: 1772971593057266.jpg (971 KB, 1248x1824)
971 KB JPG
>>
>>108966868
>There are Indians who make money faking Ai women and creating only fans. How can I be part of this
Onlyfans requires ID verification, so you'll never get past that step. Dont even bother trying.
Also, most buyers want videos, sexy audios, and custom content. You can only go so far with images alone.
On top of that, instagram is very good at detecting and shadowbanning AI accounts pretending to be girls, which makes getting any real exposure almost impossible.
Your best bet is scamming people on twitter for a few bucks here and there but that's about it. Anyone claiming they make 10k a month or something stupid on onlyfans is almost certainly lying to sell you a course.
bottom line is this, you cannot make money selling adult content online without ID verification. it's simply not possible unless you commit identity fraud
>>
> >108967116
> >108967125
fuck off
>>
File: sakamoto if he real.png (817 KB, 648x1056)
817 KB PNG
>>108966691
>>
>>108967207
kek
>>
Are US laws going to cuck diffusion models?
>>
>>108967207
Cute Kot :3
>>108967227
Not unless they start breaking down doors and tearing GPUs away from anons coomer paws
>>
>>108967227
US ones? Who gives a shit at this point.
>>
File: 63577.webm (2.13 MB, 256x448)
2.13 MB
2.13 MB WEBM
ayy lmao
>>
>>
>>108966805
Not sure you should call it finetune instead merge then
>>
>>108966805
you should have selected checkpoint merge instead of checkpoint trained
>>
File: 1762269393734186.jpg (2.83 MB, 2048x3072)
2.83 MB JPG
>>
>>108967387
Box please?
>>
File: 003856CUI_00001_.png (1.47 MB, 1536x1152)
1.47 MB PNG
>>
>>108967575
https://files.catbox.moe/d7i7g8.png
>>
File: 004649CUI_00001_.png (1.25 MB, 1536x1152)
1.25 MB PNG
>>
File: hatsune mikucube.jpg (307 KB, 1024x1024)
307 KB JPG
we may have gone too far in places
>>
File: momina.png (345 KB, 318x720)
345 KB PNG
Can someone make a video of her actually slurping the malk while jiggling her boobs and then spilling out all the fucking milk while she gasps in surprise as her boobs grow bigger?
>>
File: 005315CUI_00002_.png (1.99 MB, 1536x1152)
1.99 MB PNG
>>
>>108967593
Thank you anon.
>>
>>108967637
Only if you say "please"
>>
File: 1756579256553930.png (1005 KB, 720x1280)
1005 KB PNG
>>108967662
please
>>
>>108967637
videogen is way too random, it feels like the model just does whatever it wants
>>
File: ComfyUI_00696_.png (644 KB, 896x1152)
644 KB PNG
>>
>>
File: miku gaming 64.jpg (409 KB, 1164x901)
409 KB JPG
>>
File: kukka_comfyui_00001_.png (1.53 MB, 896x1152)
1.53 MB PNG
>>
File: 012738CUI_00001_.png (1.15 MB, 1152x1536)
1.15 MB PNG
>>
File: 923573787.webm (2.08 MB, 256x448)
2.08 MB
2.08 MB WEBM
OH N-
>>
File: ComfyUI_28676.jpg (3.33 MB, 1500x1920)
3.33 MB JPG
>>108966262
Took me forever to gen something I liked (and that wasn't too borked or NSFW).

>>108966868
Sometimes I wish my parents raised me with no moral compass, always eager to scam people via any means available to me just so I didn't need a job in the real world anymore.
>>
File: 4734557.webm (2 MB, 256x448)
2 MB
2 MB WEBM
>>
File: 1752402236049190.png (1.64 MB, 2176x1216)
1.64 MB PNG
close enuf
>>
File: ComfyUI_00704_.png (807 KB, 896x1152)
807 KB PNG
Frieren 2 Staffs
>>
File: comfyui_00003_.png (1.21 MB, 896x1152)
1.21 MB PNG
>>
File: comfyui_00004_.png (949 KB, 896x1152)
949 KB PNG
>>
always put satan into your negative prompt
>>
>>108967921
lol
>>
>>108967921
>always put satan into your negative prompt
>actually improves things
what satanic black magic is this
>>
File: m2-res_360p.webm (1.34 MB, 480x360)
1.34 MB
1.34 MB WEBM
I have a number of movies that I want to run through this sort of sigma/chad face filter. What's the best way to go about this locally? I have hardware good enough for my patience.
>>
File: ComfyUI_00707_.png (661 KB, 896x1152)
661 KB PNG
>>
>>108967960
>I have hardware good enough for my patience.
how do you know this if you dont know the best way to go about it
>>
>>108967959
it exorcises the model
>>
Can AI emulate this quality?
>>
>>108967979
by "this quality" you mean the shallow depth of field?

I think some models can but I don't remember which.
>>
File: z.png (2.6 MB, 1536x1536)
2.6 MB PNG
>>
>generate 1girl, realistic
>asian 99% of the time
>>
>>108967968
I've done face swaps, tons of transcoding, video manipulation, transcript generation and whatnot before, on my spare hardware, and I'm very patient. Wondering what tools are available, and most importantly, how to find a model for the available engines.
>>
>>108968000
on certain models. are you overly surprised that some of the chinese trainings used collections of hot asian women in their trainings?
>>
>>108968021
is xhe jing ping okay with that?
>>
File: sfhte.png (84 KB, 1777x471)
84 KB PNG
the secret sauce for kinos
>>
>>108968043
you forgot "schizophrenic"
>>
>>108968056
i think schizophrenic is okay for kinos
https://files.catbox.moe/ybgvuf.mp4
>>
i dont think i can bring myself to create any kino right now but i really wish i could :-(
>>
File: 1767042416997113.png (1.73 MB, 1008x784)
1.73 MB PNG
>tfw I know this feel
>tfw you will never know that feel
feels good
>>
File: zit.png (2.95 MB, 1536x1536)
2.95 MB PNG
>>108968029
yes, i think feminine women/masculine men with no lgbt has been repeatedly mentioned at the chinese state level as something to produce in art (ai models, video games, movies, younameit), it's not merely "tolerated" but "encouraged" now IIRC.
>>
>>
>>108968085
if i was the ceo of a race, i wouldn't allow my women to be pimped out into ai models
>>
File: 1777139007041328.jpg (102 KB, 1920x1080)
102 KB JPG
>>108967832
but we love borked and nsfw jennies :3
>>
Mental Illness
>>
File: 47367.webm (2.19 MB, 256x448)
2.19 MB
2.19 MB WEBM
he's going to have a killer headache tomorrow
>>
is it just me or have lora uploads slowed to a crawl on civit? barely anything is getting uploaded, even anima is barely getting loras.
>>
File: z.png (2.87 MB, 1536x1536)
2.87 MB PNG
>>108968091
You only lose with that. Fortunately it's the gamers and coomers that are in charge (and increasingly more).
>>
>>108968186
>You only lose with that
birthrates beg to differ
>>
>>108968142
i havent used civitai to browse for models in probably two years
>>
File: comparison.jpg (1.33 MB, 2000x1286)
1.33 MB JPG
I fat-fingered something and generated the image on the left, when I usually generate closer to the image on the right. Cannot reproduce the results. Metadata is identical between images, so reforge didn't keep track of whatever happened.

Any idea what it could have been? I think I accidentally changed epsilon scaling factor, but I'm not sure. I've tried again, and that always seems to alter the image more.
>>
File: z.png (1.11 MB, 1024x1024)
1.11 MB PNG
>>108968191
boomer-created cost of living / housing / distribution of wealth crisis (now two parents need to work some place potentially not nearby, huh)
>>
>>108968203
>Metadata is identical
>reforge
found the problem
>>
>>108968203
Seems like changing scheduler to me.
>>
>>108968233
It was the boomers who had the thing working and now that they are going extinct the whole system is falling apart thanks to zoomers and millenials.
>>
which general doesn't mind 1girl slop spam but also has discussion about models
>>
>>108968248
Hnm was Karras for both but maybe something changed. I'll test.
>>
>>108967116
>>108967125
thanks!
>>
is it possible to use this or something like this to remove simple SAMPLE watermarks? if not, what would be the best stuff to do that? the online tools work fine but they are obviously gated to tokens or only so many free uses. Ideally a local software that offer similar quality. Total noob on the matter of local models and that shit.
>>
>>108968292
iopaint
>>
>>108967960
> What's the best way to go about this locally?
1. Download and setup local AI.
2. Run movies through it.
>>
File: 2105529607.jpg (24 KB, 561x367)
24 KB JPG
https://files.catbox.moe/w8zhrr.mp4
>>
>>108967979
Definitely not local.
>>
>>108968199
what do you use? huggingface?
>>
>>
anima is missing most of the hentai artists... do they not want to bother with training on anyone whos not top 100 or?
i still need fucking loras
>>
>>
>tfw 4chan is all bots now but they still don't reply to me.

Fuck, what do I gotta do? stroke your tensors?
>>
>>108968379
all who know anything about ai dont post on 4cuck anymore (or who know about anything)
>>
>>108968379
You could try posting more interesting stuff.
>>
>>108968379
here's a (You)
>>
File: 1762608052390655.png (725 KB, 832x1216)
725 KB PNG
>hmm the user is frustrated and wants a reply, should I reply to him? no, that will be performative. I should just make a vague post with no quote and a 1girl. That's the resolution. Let me post a 1girl
>>
File: z.png (1.06 MB, 1024x1024)
1.06 MB PNG
>>108968251
no, it was the greatest and silent generation. boomers "naturally" occupied all the convenient/cheap existing infrastructure in any sense and then also further moved money away from younger generations including in the relative sense (when they were 18 to 40 or whatever fertile age groups you want to make they had more of overall societal wealth). while simultaneously creating a situation where two parents should work and grandparents (they) are not available nearby or in the same house.

it's not porn or anime or ai or video games - boomer policies ruined this
>>
>>108968430
If your generation is so great, you would fix it.
>>
anyone here tried kijais bernini pr to test its nsfw edit potential
>>
>>108968366
nobody's going to bother porting the billion loras sdxl has to anima. train it yourself, bitch.
>>
File: 1766270187191928.png (775 KB, 832x1216)
775 KB PNG
reminder that the entire currency of 4chan is ironic detachment. They don't care about facts, and they certainly don't respect your artistic output. You can't out-troll people who have literally nothing to lose and care about nothing.
>>
i went back to wan, ltx fucking sucks
>>
>>108968474
All realism models except for flux are a grift.
>>
File: z.png (2.92 MB, 1536x1536)
2.92 MB PNG
>>108968444
about as realistic as stopping a war as a minority when the rest wants to keep it going for their benefit.

boomers are still there very much increasing government debt and other "debt" (ecological, demographic) for themselves at the expense of the future.
>>
File: 043542CUI_00001_edit.png (1.27 MB, 1536x1152)
1.27 MB PNG
>>108968142
>Delayed Image Upload Processing
Civitans! We're battling some pretty bad server lag and our scanners haven't been doing their job, and we know it's frustrating! Our team is on it, working hard to get things running smoothly again, but you may experience some inconsistent behavior in the meantime. Thanks for your patience!
>>
>>108968366
finetune the model(s) more until your artists work. if they work while everything else keeps working. which is certainly not nearly guaranteed on a model of the size of anima.
>>
>>108968479
You're just trying to find boogiemen for your short-comings.
>>
File: 045515CUI_00001_edit.png (1.06 MB, 1536x1152)
1.06 MB PNG
>>
>>108967122
the guy's likeness poor but kinda kino
>>
File: krea 2.png (259 KB, 585x622)
259 KB PNG
Thoughts, expectations?
>>
>>108968683
its gonna be krea medium or some even more cucked shit

already DOA premogged by ZIT
>>
File: att.jpg (347 KB, 1024x2048)
347 KB JPG
>>108968683
likely far too much censorship, but we will see
>>
>>108968683
I'm tired of them saying "soon", but I'm really looking forward to it.
Probably lobotomizing it cause if its ready just release it.
I tried it on their site and it does a lot of creative stuff that current models fail at. Sucks at text though.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.