[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage_1774843063_1.jpg (1.89 MB, 4072x2412)
1.89 MB
1.89 MB JPG
Recently Pregnant Edition

Discussion and Development of Local Image and Video Models

Previous: >>108478554

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>108482974
I trained one for Qwen that I never even released where it was like, ~250 images captioned with Gemini 2.5 Pro with no woman ever appearing in aaaaaaaaaaaaaaaaaaaaaaaaaamore than one image. Mabes I redo for Klein
>>
>>108483450
woops held down the a key by accident, fucking kek
>>
>>108483401
Why bake 20 posts early? Need to insert your tranny drama links faggot?
>>
>>108482706
wtf are you talking about, 9B distilled can literally remaster old porn pics without fucking up the dicks. It can't make new dicks sans lora but if you have this problem your prompt is likely retarded.
>>
Blessed thread of frenship
>>
>mfw Resource news

03/29/2026

>HybridScorer: CUDA-powered image triage tool
https://github.com/vangel76/HybridScorer

>Calgary artists debate AI's role in creativity as library launches new residency
https://calgaryjournal.ca/2026/03/12/calgary-artists-debate-ais-role-in-creativity-as-library-launches-new-residency/

03/28/2026

>Seedance 2.0 ComfyUI Nodes
https://github.com/Anil-matcha/seedance2-comfyui

>ComfyUI-DreamScene360
https://github.com/jfirma1/ComfyUI-DreamScene360

>ComfyUI-Foundation-1: Structured Text-to-Sample Diffusion for Music Production
https://github.com/Saganaki22/ComfyUI-Foundation-1

03/27/2026

>ComfyUI Enhancement Utils
https://github.com/phazei/ComfyUI-Enhancement-Utils

>SDXS - A 1B model that punches high
https://huggingface.co/AiArtLab/sdxs-1b

>ComfyUI-DaVinci-MagiHuman
https://github.com/mjansrud/ComfyUI-DaVinci-MagiHuman

>ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
https://luo0207.github.io/ShotStream

>Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration
https://v-gen-ai.github.io/Calibri-page

>Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction
https://github.com/Westlake-AGI-Lab/FreeLOC

>MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data
https://macro400k.github.io

>EagleNet: Energy-Aware Fine-Grained Relationship Learning Network for Text-Video Retrieval
https://github.com/draym28/EagleNet

>PMT: Plain Mask Transformer for Image and Video Segmentation with Frozen Vision Encoders
https://github.com/tue-mps/pmt

>PixelSmile: Toward Fine-Grained Facial Expression Editing
https://ammmob.github.io/PixelSmile

>RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models
https://yfyang007.github.io/RealRestorer

>Google AI breakthrough is pressuring memory chip stocks from Samsung to Micron
https://www.cnbc.com/2026/03/26/google-ai-turboquant-memory-chip-stocks-samsung-micron.html
>>
>mfw Research news

03/29/2026

>FontCrafter: High-Fidelity Element-Driven Artistic Font Creation with Visual In-Context Generation
https://arxiv.org/abs/2603.22054

>From Part to Whole: 3D Generative World Model with an Adaptive Structural Hierarchy
https://arxiv.org/abs/2603.21557

>GenMask: Adapting DiT for Segmentation via Direct Mask
https://arxiv.org/abs/2603.23906

>Dress-ED: Instruction-Guided Editing for Virtual Try-On and Try-Off
https://arxiv.org/abs/2603.22607

>Efficient Coarse-to-Fine Diffusion Models with Time Step Sequence Redistribution
https://arxiv.org/abs/2603.21348

>Efficient Zero-Shot AI-Generated Image Detection
https://arxiv.org/abs/2603.21619

>MultiBind: A Benchmark for Attribute Misbinding in Multi-Subject Generation
https://arxiv.org/abs/2603.21937

>WorldCache: Content-Aware Caching for Accelerated Video World Models
https://umair1221.github.io/World-Cache

>MSRL: Scaling Generative Multimodal Reward Modeling via Multi-Stage Reinforcement Learning
https://arxiv.org/abs/2603.25108

>PosterIQ: A Design Perspective Benchmark for Poster Understanding and Generation
https://arxiv.org/abs/2603.24078

>Getting to the Point: Why Pointing Improves LVLMs
https://arxiv.org/abs/2603.21746

>TRINE: A Token-Aware, Runtime-Adaptive FPGA Inference Engine for Multimodal AI
https://arxiv.org/abs/2603.22867

>ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models
https://arxiv.org/abs/2603.21105

>Visual Attention Drifts,but Anchors Hold:Mitigating Hallucination in Multimodal Large Language Models via Cross-Layer Visual Anchors
https://arxiv.org/abs/2603.25088

>AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection
https://arxiv.org/abs/2603.23115

>When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning
https://dingwu1021.github.io/SelfJudge
>>
File: 1757222339774685.png (2.34 MB, 1256x1616)
2.34 MB
2.34 MB PNG
>>
File: 1751120624166566.png (786 KB, 578x767)
786 KB
786 KB PNG
>>
>>108483526
>https://github.com/vangel76/HybridScorer
seems broken, uses some (IDK which) old version of transformers and so on
>>
File: 1750091082545580.png (2.1 MB, 1664x2560)
2.1 MB
2.1 MB PNG
>>
File: deJZ_zi_00047_.jpg (1.04 MB, 1745x1920)
1.04 MB
1.04 MB JPG
>>108483669
they've been pushing a lot of fix commits
>>
File: kJXrg20B298.jpg (172 KB, 742x777)
172 KB
172 KB JPG
>started waking up in the middle of the night because my training run oomed and I could resume it
cyberpunk 6th sense
>>
>>108483829
Had similar experience few times. Good thing loras train so fast nowdays there's no need to leave pc running over night.
>>
File: 1767602104834922.png (2.38 MB, 1664x2560)
2.38 MB
2.38 MB PNG
damn words
>>
File: 1765702170766508.png (1.36 MB, 1168x1792)
1.36 MB
1.36 MB PNG
>>
>>108483882
*cringes*
>>
>>108483401
why did you bake 20 posts before the bump limit and insert off topic links into the op?
>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
????
>>
>>108483526
>>108483531
why is this schizo still here? is he admiting that /ldg/ is the superior thread?
https://rentry.org/debo
>>
>>108483460
>>108483465
>>108483919
You have no idea how fun it is to see anifart having melties like that, he's such an entertaining schizo
>>
>>108483950
are you ok?
>>
>>108483942
god forbid we talk about diffusion. nah, let's talk about ani's farts
>>
>>108483919
are you ok?
>>
>>108483957
>let's talk about ani's farts
what? you're responding to a post talking to debo, you aren't the center of the world, there's other schizos we also make fun of
>>
>>108483968
>we
you and your headmates?
>>
>>108483972
why are you talking about ani's farts though? you wanna smell them?
>>
35 stars status??
>>
File: 1755851760859784.png (236 KB, 595x619)
236 KB
236 KB PNG
wtf I love orange man now!
>>
>>108483977
that wasn't me, dummy. but hey since we're on the topic what's your opinion on it? what do you think they smell like?
>>
File: 1760894255929414.mp4 (603 KB, 640x832)
603 KB
603 KB MP4
>>108483977
>>
>>108483998
Based
>>
>>108483988
Problem is, Trump is building a fascist ethnostate where creating adult content could result in death. We're not there yet, but we're not far off.
>>
>>108483988
That's great
>>
>>108484006
You have to be over 18 to post here
>>
>>108484006
>Trump is building a fascist ethnostate where creating adult content could result in death.
By preventing banks from refusing to provide services to companies that want to offer adult content? How does that work?
>>
>>108484014
banks can still refuse
>>
>>108483988
>wtf I love orange man now!
If he really had any balls, he should have proposed a new amendment prohibiting banks from refusing to cooperate with businesses for reasons unrelated to legality.
>>
>>108484026
Cant every business choose their customers in usa of merica
>>
>>108484022
but Trump's action definitely indicates that he's supporting businesses that want to provide adult content, that's the exact opposite of "a fascist ethnostate where creating adult content could result in death"
>>
>>108484030
Banks should be the exception, you can't do business online without going through them, they have way too much power.
>>
>>108483988
Huh, positive if true. I'll take it.
>>
>>108483998
>Ani supposedly hates trannies
>But he definitely loves roleplaying as an anime girl
you will never be a woman anitroon
>>
Reminder Suno 5.5 was leaked
>>
File: lul.png (392 KB, 1200x516)
392 KB
392 KB PNG
>>108484071
>Suno 5.5 was leaked
as true as the "leak" of Seedance 2.0
>>
>>108484071
you leaked into your pants
>>
File: 00016-3117488846.png (2.13 MB, 1920x1080)
2.13 MB
2.13 MB PNG
>>108484071
>>108484080
>>108484085
i leaked your mum's pants m8
>>
>>108484080
this one was real though?
>>
>>108484006
>a fascist ethnostate where creating adult content could result in death.
why are you so islamophobic anon? :(
>>
>>108484094
>this one was real though?
yes, I'm actually running Seedance 2.0 9b locally right now
https://huggingface.co/ByteDance/Seedance2.0-9b
>>
Local AI wouldn't be so cucked if they had age verification built into the models.
>>
>>108484113
it's cucked because they're preventing people to make illegal images, it doesn't matter if the guy making them is an adult or not
>>
>illegal
>>
>>108484137
>>108484120
>>
>>108484006
reddit is that way anon ->>
>>
File: Wanimate_00002-audio-2.mp4 (3.94 MB, 590x800)
3.94 MB
3.94 MB MP4
Has there been an update to wan animate? This ain't good.
>>
>>108484103
>9b
imagine
>>
File: 1774605157996884.png (1.55 MB, 1072x1880)
1.55 MB
1.55 MB PNG
>>
FUCKING LIAR
>>
>>
File: 1773620513449207.png (1.51 MB, 1072x1968)
1.51 MB
1.51 MB PNG
>>
>>108484408
chinese culture strikes again
>>
>>
>>108484408
can't believe it's been 4 months we've been waiting for Z-image edit, lmao
>>
>>
>>
Can we generate bathtubs filled with sausages yet? Pls respond, this is the most important test.
>>
File: etc-nbp-2026-03-30_00002_.jpg (2.83 MB, 1696x2528)
2.83 MB
2.83 MB JPG
>>108484473
that technology is beyond us
>>
>>108484445
We have klein.
>>
File: 1768921710293762.png (238 KB, 600x600)
238 KB
238 KB PNG
Please tell me that a newer version of ZIT or a better model for us VRAMlets has been released since last month. I havent been lurking since February, just cooming non-stop.
>>
>>108484481
Damn. We finally did it.
>>
File: etc-nbp-2026-03-30_00011_.jpg (2.66 MB, 1696x2528)
2.66 MB
2.66 MB JPG
>>108484533
it is done
>>
File: etc-nbp-2026-03-30_00019_.jpg (2.91 MB, 2400x1792)
2.91 MB
2.91 MB JPG
>>
>>108484169
skill issue
>>
>>108484560
Can other models do it?
>>
>>108484595
got a catbox?
>>
>>108484618
you won't like it
https://files.catbox.moe/q6kxb3.png
>>
Is ai-toolkit still garbage?
>>
>>
>>108483460
>>108483465
>>108483919
kek what a worthless raped retard
>>
>>108484680
Thanks, I've seen that exact type of workflow before
>>
>>108483741
that's the current state after fix commits for me

requirements says current transformers, code contains references to methods and imports that have been removed from transformers quite a few versions ago

haven't been able to figure out which version
>>
>>108484696
it's still a more simplified tool... not actually that bad

but also yes there are far more of the zoo of optimizers in onetrainer and such and that is an advantage if you think you need to try one of the other optimizers. and such.
>>
File: 1772624366099987.png (178 KB, 640x392)
178 KB
178 KB PNG
>>108484511
>we have plastic generator
>>
>>108483829
> 2026
> can't ask a llm to write a shell script resuming ooming training
>>
How can I do tiled vae encode when I'm using nodes like painter etc?
>>
>>108483953
ani are you ok
are you ok
are you ok ani
>>
>>108484977
https://github.com/BigStationW/ComfyUI-PainterI2Vadvanced
>This custom fork has an additional PainterI2VAdvanced (Tiled) node
>>
>>108484026
>>108484039
> banks
they are payment processors
>>
>>108485006
Same thing, payment providers rely on banks like Visa or Mastercard, so the real driving force behind all of this is still the (((banks))).
>>
>>108484969
>believing any vibecoded garbage
>>
Why doesn't any Ai trainer support creating loras for MMAudio?
>>
>>108484999
Hmm, it says I already have it, but I don't see a tiled version. Is it possible to clone it into a separate folder and have both?
>>
>>108485027
>Is it possible to clone it into a separate folder and have both?
I don't think so
>>
>>108485028
Oh it didn't have first last frame options..
>>
I'm trying to make latent upscale work again and can't figure out why the quality is much worse compared to native.

Latent, video>latent, different upscale settings, more steps. Nothing brings it close.
>>
>>108485062
I guess it's time for you to learn how to vibecode that, use Claude
>>
File: 1764775989442707.png (431 KB, 800x582)
431 KB
431 KB PNG
>>108484511
>We have klein.
Z-image edit will blow this shit out of the water, which is why they're taking their time, they want the MOG to be as impactful as possible
>>
>>108485079
>inb4 turbo and base loras won't be compatible with edit again
>>
>>108485086
I find base loras to work better on turbo than turbo loras on turbo desu
>>
>>108485023
you've just proved again it's skill issue
>>
>>108485067
Upscale doesn't work with my wife Julia.
>>
File: API cucks please.png (115 KB, 1988x652)
115 KB
115 KB PNG
kek
>>
>>108485470
just monkeys pushing buttons to make the shiny appear
>>
>>108485470
>openai being frugal
lol. laugh out loud.
I dunno whey they axed it but blowing cash in record numbers has never been an issue before
>>
>>108485067
i'd love to get your workflow where latent upscale actually works, seriously
>>
>>108485486
>but blowing cash in record numbers has never been an issue before
I guess the AI bubble is popping and they can't afford burning money anymore
>>
>>108485511
that's wishful thinking
openai are fine financially, they have a government contract now
>>
>>108485515
>openai are fine financially
then why are they shutting sora 2 off?
>>
>>108485523
I said I don't know, nobody else is cuting back, and the only thing to happen to OAI recently is the gov involvement
maybe the compute is being redirected to palantir or something. Compute is in short supply but vc isn't
>>
Saar, do the needful.
>>
>>108485562
biuteful desi girl open bobs
>>
>>108485562
how the fuck did he manage to make it so plastic with fucking ZiT?? jeets are on another level I swear
>>
is gguf loader a forbidden custom node that will completely break everything, or has it been sanctioned by our lord and master as worthy to use
>>
>>108485562
what does the comments say please, sar
>>
File: 1747318799937985.png (451 KB, 2902x1270)
451 KB
451 KB PNG
No Z-image edit for you gweilo, how about some gift cards instead saar?
>>
is flux 2 klein supposed to be (ai-generated:1.9)? even 9b one?
>>
>>108485645
>thinking in tag weights
cope
>>
>>108485645
I get good results when using it for edits. It sucks for T2I and most klein loras suck.
>>
>>108485655
so it's only good as a photoshop for lazy fucks? that sucks
>>
File: AnimateDiff_00002.mp4 (1.52 MB, 720x912)
1.52 MB
1.52 MB MP4
>>108485227
She's my wife, fuck off.

>>108485505
I'm trying an old kijai workflow, but I forgot how it works, webm related, broken.

>>108485600
No idea, I block these fags.
>>
>>108484169
there is wan scail
>>
>>108485633
>shit
>shit
>actual usable item
How do I aim for 3rd?
>>
File: Untitled1231231fqwf-2.mp4 (3.66 MB, 1200x514)
3.66 MB
3.66 MB MP4
Meh, the kijai latent upscale workflow I had suffers greatly from the discoloration and is slowmo, can't make use of all the new tech like painter etc.

Left my shitty 480>720 "latent upscale"
Mid default 720p
Right kijai.
>>
>>108485787
>gift card
>"actual usable item"
SAAR DO NOT REDEEM THE CARD
>>
File: AnimaPreview2_00013_.png (1.55 MB, 1264x976)
1.55 MB
1.55 MB PNG
>>
Guys, I can't stop thinking about Ani. How come he gets to go to Japan while I sit in a basement...
>>
>>108485808
How is iGoy and ps5 better than straight cash?
>>
>>108485812
kek
>>
>>108485812
rent free
>>
>>108485816
Trip to Japan for 3 days or so is less than $5k.
>>
>>108485835
yeah, Comfy and ran definitely rent free inside of your schizo head
>>
File: 1_00023_.jpg (3.05 MB, 2616x3556)
3.05 MB
3.05 MB JPG
>>
You're all wasting your lives on digital heroin. I know because I've been there.
>>
>>108485841
did you check under your bed for ani? he could be anywhere you know
>>
I wish Comfy would just become closed source already.
>>
>>108485850
that's not how legs work
>>
File: Schroeder.png (189 KB, 628x420)
189 KB
189 KB PNG
>>108485863
>that's not how legs work
tell that to peanuts characters
>>
>>108485812
absolute kino
>>
>>108485856
pretty sure gen'ing doesn't lock me in a trance like coma for the entire day. i queue up a few hundred gens then go fuck off to work on my actual hobbies.
>>
>>108485918
Cope. It's the same as gambling.
>>
>>108485925
What is the wager I am gambling? What is the risk? I'm not losing anything.

>Your time
I'm not losing additional time either because I can do other things while the gens are processing.
>>
>>108485925
>gambling
I'm not losing money if I get a bad image though?
>>
>>108485941
Why do you queue up "hundreds of gens" then every day? Explain me why can't you just quit, what do you do with the gens? Do they benefit you? What is your long-term end goal with the gens?
>>
>>108485638
>>108485939
>Bottom Line: Qwen maintains a dual-track strategy—open weights for research/community (Apache 2.0) and closed API for enterprise/flagship capabilities. Qwen3.5-Omni follows the latter pattern as of March 2026.
Sad, let's not expect anything good from Alibaba anymore, it's over
>>
>>108485841
seems ani and debo live rent free inside the baker's head since the cretin is hellbent on having these links in the op
>https://rentry.org/debo
>https://rentry.org/animanon
>>
>>108485951
>why can't you just quit
why are you still here if you decided to quit?
>>
>>108485951
>Why do you queue up "hundreds of gens" then every day?
So I can see a multitude of different girls being fucked and sucking dick.

>Explain me why can't you just quit
Why quit something that is fun and has no impact on my physical or mental health?

>what do you do with the gens?
I masturbate to them. Sometimes I post the art I make in threads.

>Do they benefit you?
Yes. Masturbation feels good. Getting (You)'s from my art also feels good.

>What is your long-term end goal with the gens?
To make even better goon material.
>>
>>108485953
Chroma still undefeated
>>
>>108485961
I see, good luck with your long-term future.
>>
File: beachplease.png (3.33 MB, 1080x1920)
3.33 MB
3.33 MB PNG
>>
>>108485967
Thank you. I have progressed significantly since my journey began and am excited with each passing day as I trial and error my ideas.
>>
File: 1764341321211584.png (3.13 MB, 1536x864)
3.13 MB
3.13 MB PNG
>>108485962
Z-image turbo destroys Chroma though
>>
>>108485973
retard.
>>
File: Video_00001.mp4 (2.2 MB, 720x912)
2.2 MB
2.2 MB MP4
I'm making progress on latent upscaling, but now I'm getting this fucked up first few frames. Never seen it before.
Anyone?
>>
>>108485953
>open weights for research/community (Apache 2.0) and closed API for enterprise/flagship capabilities
its only open source while it sucks
>>
File: 1355139830646.png (178 KB, 500x500)
178 KB
178 KB PNG
Do you always want to train on as high batch size as you can manage? Even on smaller datasets ~50 pics?
>>
when are we gonna get a better video model than WAN for porn?
>>
>>108486039
Grok
>>
>>108486051
thanks, I'll check it out
>>
File: 1758177622139665.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>>108485997
It's open source until it's good.
>>
>>108485562
>ZIT
>weird over-defined plasticy butt-chin early SDXL face

bengali excellence at work
>>
>>108486006
most generally would more or less do that (whatever is fastest)

unless the training configuration is doing a stupid thing (most aren't) when batched and you just essentially need batch 1 or 2. that was a thing for some configurations, IDK if it still is.
>>
File: _AnimaPreview2_00006_.jpg (339 KB, 992x1456)
339 KB
339 KB JPG
>>
>>108485939
I'm guessing that still doesn't mean they can now more freely do more NSFW/questionable on the now non-enterprise "research" models?
>>
File: _AnimaPreview2_00016_.jpg (242 KB, 992x1456)
242 KB
242 KB JPG
>>
File: 402564739816582.png (1.34 MB, 1216x832)
1.34 MB
1.34 MB PNG
>>
>>108486287
me on the left
>>
>>108486326
how do you type
>>
File: 1_00025_.jpg (2.25 MB, 2616x3556)
2.25 MB
2.25 MB JPG
>>
File: lmao.png (252 KB, 2040x1151)
252 KB
252 KB PNG
That's right you ungreatful fucks, you should've sucked Sam's big cock and said thank you!
>>
where horse face
>>
>>108486502
I did! I did and he STILL took Sora away from me!!
>>
File: sowwy.gif (1.33 MB, 498x312)
1.33 MB
1.33 MB GIF
>>108486558
Sam: "I remember you, you didn't swallow, so no Sora for you!"
>>
Remember how
 simple background 
was fucked on Noob
>>
File: Untitled.png (178 KB, 474x678)
178 KB
178 KB PNG
>>108485816
Poor Ani tried to team up with Laxhar and Iodestone, but realized Laxhar works at Comfy and was involved in Anima the whole time and Iodestone got funded by Comfy and Yoland, lol.
>>
>>108486982
/lgg/ - Local Gossip General.
>>
>>108486993
You said this, but /ldg/ it stopped being a diffusion thread a while ago.
>>
:(
>>
>>108486982
The local scene kinda sucks lately it's lonly catfights over slop and money. At least NovelAI doesn’t have as much drama.
>>
i hate discordtroons
>>
>>108487023
Oh, you...
>>
>>108487023
Take a look at /de3/ and it’s all gens talk and gens posts. That local catfight only happens when local troons are indirectly chasing money, Ani as a primary example.
>>
>>108487002
But anon diffuses here all the time
>>
>>108487032
You might hate them, but they’re the ones pulling the strings that make things happen.
>>
>>108487032
hey this is ranjak's and his discordtroon friends' place so you can FUCK OFF and post your gens elsewhere
>>
>>108487043
rent gratis
>>
>>108487050
>pulling the strings
>ERP with a mentally ill diaper fag so he can groom you into his secret channel
grim
>>
>>108487067
This is how you get the Comfy fund.
>>
we need /ic/ to raid us.
>>
>>108487055
>>108487063
>>
>>108487080
>>108483401
>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
>>
File: 1771478095929915.png (140 KB, 1150x312)
140 KB
140 KB PNG
>>108487055
>discordtroon
but Ani (you) are a discordtroon though
>>
>>108487090
rent free
>>
>>108487055
rent free
>>
>>108486982
>laxhar is on Anima team
I don't know about anything else in that screenshot, but the Anima team is me and me alone.
>>
>>108487128
we can tell. your dataset is lacking and you jumped the gun thinking it was hot shit. you are literally the new pony dev
>>
>>108487128
preview3 when sar
>>
>trani still assblasted about comfy working with actual talented people
based
and don't forget the flip
>>
>>108487165
about a week
>>
>>108487147
>you are literally the new pony dev
weird thing to say since pony dev managed to make one good model (v6), you're a bit retarded anifart don't you?
>>
>>108487128
can me and my chinese researcher gf join you? plox?
>>
>>108486982
WTF? L-A-X works at Comfy? And Noob 2 is built on GLM Image which is a 9B model with a GLM LLM as text encoder, 40GB+ VRAM minimum... the only way anyone's running this is through Comfy Clo... oh OH It all makes sense now.
:^)
>>
>>108487168
what do you mean?
>>
>>108486982
Thanks, I love WAI and ReForge now
>>
https://www.reddit.com/r/accelerate/comments/1s7twd8/byteplus_is_selling_exclusive_seedance_20_access/
>BytePlus is selling exclusive Seedance 2.0 access to studios at a $2 million commitment. For that price, buyers get what nobody else can: zero queue times, real-face uploads with no content restrictions, and priority compute allocation. Approximately 400 US companies have signed up already.
wtf??? they literally won 8 billion just like that, HOLY SHIT
>>
>>108487209
calm down sperg
>>
Yep, API models are an infinite money printer. We have to face the fact that there will never be another good open source model released in the future.
>>
>>108486982
LMAO what a fucking seething retard failed dev holy fucking shit, the melty is DELICIOUS too
>>
I'm very disappointed in julien.
That's why i take my github star back.
>>
File: 1761638120183731.jpg (707 KB, 1536x1536)
707 KB
707 KB JPG
>>
>>108487337
nice gen. what do you think about the last two lines in the OP? you think they should be there?
>>
>julien
>>
File: 1745135001935744.jpg (710 KB, 1840x1328)
710 KB
710 KB JPG
>>108487352
here's a (you) ;^)
>>
>https://civitai.com/models/2495369/kirazuri-anima
>Known Limitations & Issues: Catastrophic Forgetting Any of the base model knowledge outside of that dataset will have significant forgetting, and LoRA trained on the base model are not expected to function very well with this finetune.
Ah yes, finetuning Anima, it destroys the model knoweldge and breaks everything, and Loras works with nothing but the base model it was trained on, we are so back!
>>
>just noticed consistent anatomy error in my dataset
uggh was so distracted by the big penis didnt even see it
>>
>>108487420
>distracted by the big penis didnt even see it
>>>/h/hgg
>>
Let's take some guesses on what his next FUD will be when it turns out loras and finetunes work just fine on the final version of Anima. Here's my guess: "creativity". Anima isn't "creative" but SDXL tunes are. Why would anyone use an uncreative model for art?
>>
>>108487451
>and finetunes work just fine on the final version of Anima.
Sorry, but that’s an architectural limitation of the Cosmos model. More training won’t fix it, just like SDXL still struggles with multiple characters and hands.
>>
>>108487391
Why would I trust someone who clearly trains on synthetic data?
>>108487451
LoRAs do already work an anon in a previous thread shared his and said there wasn't any "catastrophic forgetting"
>>
>>108487475
>>108487467
More people are noticing Anima's forgetting and lora incompatibility across checkpoints. It's been known for a while. You can cope like you did with Z Base, but the memory issues and compatibility problems aren't going away. It's an architectural issue, it's baked in and won't get fixed.
>>
>>108487501
>More people are noticing Anima's forgetting and lora incompatibility across checkpoints.
any examples of this forgetting? some specific character prompts? I could do some testing
>>
>>108487510
That would ruin the thing he has going on
>>
>>108487391
>based on preview version 1
Found your problem.
>>
>>108487510
No I was making the whole thing up.
>>
>>108487501
who would have thought a model made to be the eyes of some warehouse roomba isn't a good base model for imagen
>>
I think at a certain point you need to realize you're the only person posting in this thread
>>
>>108487501
Preview 2 made Anima stable but dilluted a bunch of artist tags. Only fix is releasingera specific checkpoints every year with new characters and artists. Pick the checkpoint that matches your gen. Anima 1980s, Anima 1990s, Anima 2000s, etc simple as.
>>
What jealousy does to a man.
>>
File: _AnimaPreview2_00043_.jpg (264 KB, 1072x1376)
264 KB
264 KB JPG
>>
>>108486993
Still doesn’t compare to what you’d read inside rs2 clan wars white portal.
>>
File: _AnimaPreview2_00066_.jpg (320 KB, 1456x992)
320 KB
320 KB JPG
>>
File: 1746580829260970.jpg (575 KB, 1536x1536)
575 KB
575 KB JPG
>muh anime
ill take my zitslop wife
>>
>>108487451
who are you talking about? your bull?
>>
>>108487741
bro change your comebacks please, they're pretty fucking stale
>>
>>108487768
it's crazy that every general has at least one of these people.
they post the same shit almost verbatim everyday and then act completely oblivious when they get made fun of.
>lol looks like -insert name- is having a meltie
0.0000001 seconds later
>who? what does this even mean? no one even knows who he is? why are you so obsessed with me? i mean him! who ever he even is??!???!?
>>
>>108487814
yeah it's insane what ran has down to this general
>>
File: ComfyUI_temp_pdtgy_00323_.png (2.96 MB, 1280x1600)
2.96 MB
2.96 MB PNG
>>
>>108487814
fist point, yeah. second point, lol no. I think people pick on him to drive him further into his own madness
>>
>>108487266
SaaS won. All those studios will be inferencing through ComfyUI API too, for maximum workflow flexibility
>>
>>108487846
seems like you're just projecting your own suicidal depression as you sit tormented with the knowledge of how to use AI tools but having no creative spark because you're a sad sack of shit
>>
>>108487822
35 stars status?
>>
>>108487723
This prompt looks very familiar.
>>
>>108487858
kind of based honestly.
>get cucked by hollywood
>in a last ditch effort to recoup your losses you sell "access" for 2m to smaller studios who are willing to risk the legal repercussions
>all those small studios get cease and desist letters the second they generate anything
>you get to scrooge mcduck your new found fortune while those smaller studios go bankrupt
>>
long dick general
>>
>>108487960
based. currently working on bulges in leggings and jeans
>>
>>108486361
ty
>>108486384
https://files.catbox.moe/campnf.png
>>
What hardware are you guys on? Is a 5070 enough?
>>
>>108488424
if you have an nvidia gpu that was made in the last 5 years you can gen locally.
less vram = more gguf + older models
more vram = less gguf + newer models
>>
>>108488474
Yeah but is it good?
>>
>>108488485
better than grok slop
>>
>>108485020
>banks like Visa or Mastercard
>>
File: Z-image-00739_.png (885 KB, 800x1280)
885 KB
885 KB PNG
>>108487732
then i'll take zitslop anime wife
>>
>>108488424
I'm running a 4070 and have 32GB of ram,
I can run Z-Image Turbo and gen a 1440*1440 pic in 60 seconds, Wan 2.1 ranges from 2 min to 4 min depending on reso and length, but i usually stay between 480 - 640 pixels @ 81 - 192 frames
>>
File: 1744853888184329.png (719 KB, 1152x896)
719 KB
719 KB PNG
>>
File: Z-image-00689_.png (874 KB, 800x1280)
874 KB
874 KB PNG
>>
File: 1769694049716015.png (762 KB, 1152x896)
762 KB
762 KB PNG
>>108488529
>>
>>108488424
It's enough. You'll need to use quants for video models.
>>
>>108488541
>>108488529
If /g/ released it's own operating system, this would be the logo.
>>
anon what is the model you used for this gen? very love the skin texture here. did you use seedvr2?
>>
>>108488629
anon what is the model you used for this gen? very love the skin texture here. did you use seedvr2?
>>108484408
i just want that god damn qwen image 2.0 to go open source already.
>>
File: 1765783057668538.png (1.35 MB, 1168x1704)
1.35 MB
1.35 MB PNG
>>
>>108488424
5090/64gb of ddr5. Would love to upgrade 128gb of ram but my pc doesn't function well with 4x ram slots being used and two sticks of ddr5 ram is $2.2k. A 5070 is good entry point for local but it wont be enough in the long run especially as models and text encoders get bigger in size. You will struggle with generating videos at 720p with just 16gb of vram.
>>
File: _AnimaPreview2_00074_.jpg (254 KB, 1072x1376)
254 KB
254 KB JPG
>>
File: _AnimaPreview2_00077_.jpg (195 KB, 1376x1072)
195 KB
195 KB JPG
>>
File: _AnimaPreview2_00086_.jpg (472 KB, 1072x1376)
472 KB
472 KB JPG
>>
File: 00195-2874974734.png (2.71 MB, 1824x1248)
2.71 MB
2.71 MB PNG
for basic talking head content, ltx 2.3 with the ic loras is good at that. Notice a lot more improvements compared to my previous gens made with the previous model. I still wish there was more diverse variation to the voice audio but it's a improvement of the previous 2.0 model.
https://files.catbox.moe/qu1i8q.mp4
https://files.catbox.moe/c89wqs.mp4
>>
has anyone tried davinci?
>>
Is wan still the king for local video porn?
>>
File: 1754184720077841.png (1.19 MB, 1520x1344)
1.19 MB
1.19 MB PNG
>>
>>108482706
>>108483466
I think it isn't censorship but I am having a bug with Reference Conditioning node that causes all white images. Anyone know any alternatives I can plug into my workflow?
Using current comfy is genuinely painful.
>>
File: Flux2-Klein_01079_.png (299 KB, 592x720)
299 KB
299 KB PNG
>>108488960
never experienced that, workflow?
>>
>>108488927
What's this, MS Paint style lora?
>>
>>108488967
Sure.
https://litter.catbox.moe/uujxqrx4264cjfod.png
I git pull'd yesterday.
>>
File: 1765640592487378.png (1.01 MB, 1520x1344)
1.01 MB
1.01 MB PNG
>>108488969
nope just @minuspal, oekaki, jaggy lines
>>
>>108488977
Eh, workflow looks normal to me, probably just bumfart. My install is months old and staying that way.
>>
File: 1754008835212199.png (979 KB, 1520x1344)
979 KB
979 KB PNG
>>108488997
same prompt and seed but with microsoft paint \(medium\),
>>
File: Z-image_00727_.png (1.08 MB, 800x1280)
1.08 MB
1.08 MB PNG
>>
>>108489013
Very wise. I ran into a minor issue with something else and updated, hoping for a fix. Not only that thing wasn't fixed, now my workflow is getting randomly crippled. I am certain the Reference Conditioning node is the clue somehow, it keeps randomly disappearing (says node not installed despite being part of the core) and reappearing when I load old jsons. I opened the exact workflow I uploaded to catbox in a different browser and it worked. It's a lottery. This shit is driving me crazy.
And no my main browser isn't the cause, I tried clearing cache before.
>>
>>108489055
might be related to nodes 2.0. try disabling.
>>
>>108489244
Already disabled but thanks for the suggestion nonetheless.
>>
Will LTX ever be able to deliver "undressing"?
>>
>>108489361
it sucks with cutscenes too
>>
>>108488887
yeah and i dont think it'll be topped unless they release wan 2.5+
>>
noobanima when
>>
>>108489417
the funds are being spent finetuning GLM-Image so that we can autoregressively generate our images in 4 minutes

please understand
>>
>>108489495
can someone go to chinaman and hit him on the head
>>
>>108488774
Just use a good voice cloner model and use that on top of the video. It's a lot better at lipsyncing with custom audio now.
>>
heyyy i have a question! i have a ryzen 7 AI 350 AMD laptop running linux, what's the easiest way to run diffusion models on my npu? and if anyone doesnt mind, chat models as well. arigatou gozaimasu!
>>
thunder_pony
>>
File: 1725545289666750.jpg (4 KB, 233x216)
4 KB
4 KB JPG
i updated comfyui and ltx while waiting for davinci. and now, ltx is much faster. thx i guess kek
>>
>>108489567
sell it and buy one with at least RTX A5000 16 gb
>>
> >108489611
shill
>>
>108487266
>Source: Just trust me bro
>>
File: ComfyUI_18243.jpg (3.48 MB, 1500x2000)
3.48 MB
3.48 MB JPG
>>108488774
Use VibeVoice to clone a voice. Like the other guy said, it's pretty good at lip syncing now... although, that does just add another reel to the slot machine when genning (voice, image and combining them into a video).
>>
Fresh when ready

>>108489653
>>108489653
>>108489653
>>
>>108489573
You are limited to SDXL sized models probably. For even them I am skeptical of you getting any decent performance. I don't know any backend that uses the NPU, so can't help you about it.
For LLMs, you are bandwidth bound so just run them on the CPU. You can only run very small models or MoE models with decent performance. GPT-OSS 20B (I don't know what the current SOTA small MoE model is) or similar might work.
>>
comfy breas
>>
>>108489611
>updooted comfy
it's a trap do not listen to this xir
>>
>>108490248
works on my machine



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.