[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion and Development of Local Image and Video Models

Previous: >>108538676

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: o_00923_.png (595 KB, 1280x768)
595 KB
595 KB PNG
>>
File: Gemma 4 31b it.jpg (1.06 MB, 3105x1600)
1.06 MB
1.06 MB JPG
Google saved local!
>>
>local gets saas scraps
>OMG SO BASED
it's just a smaller version of the API model. just use the API
>>
>mfw Resource news

04/06/2026

>UNICA: A Unified Neural Framework for Controllable 3D Avatars
https://github.com/zjh21/UNICA

>WSVD: Weighted Low-Rank Approximation for Fast and Efficient Execution of Low-Precision Vision-Language Models
https://github.com/SAI-Lab-NYU/WSVD

>When Negation Is a Geometry Problem in Vision-Language Models
https://github.com/fawazsammani/negation-steering

>Take-Two laid off the head its AI division and an undisclosed number of staff
https://www.engadget.com/gaming/take-two-laid-off-the-head-its-ai-division-and-an-undisclosed-number-of-staff-182824338.html

>SilkStack Image Browser: Browser with ComfyUI metadata support
https://github.com/skkut/SilkStack-Image-Browser

04/05/2026

>ComfyUI-ZImage-Triton: Triton-accelerated W8A8 quantization
https://github.com/newgrit1004/ComfyUI-ZImage-Triton

>ComfyUI Assets Manager v2.4.4 update
https://github.com/MajoorWaldi/ComfyUI-Majoor-AssetsManager/releases/tag/v2.4.4

>From RTX to Spark: NVIDIA Accelerates Gemma 4 for Local Agentic AI
https://blogs.nvidia.com/blog/rtx-ai-garage-open-models-google-gemma-4

>FLUX.2-klein-9B — PolarQuant Q5: 9B rectified flow transformer
https://huggingface.co/caiovicentino1/FLUX.2-klein-9B-PolarQuant-Q5

>Qwen3.5-9B-Neo-PolarQuant-Q5: 9B on any GPU with PolarQuant
https://huggingface.co/caiovicentino1/Qwen3.5-9B-Neo-PolarQuant-Q5

04/04/2026

>STAGE: Storyboard-Anchored Generation for Cinematic Multi-shot Narrative
https://github.com/escapistmost/Storyboard-Anchored-Generation

>Regularizing Attention with Bootstrapping
https://github.com/ncchung/AttentionRegularization

>LTX2.3-Multifunctional: Functionality optimization based on LTX desktop version
https://github.com/hero8152/LTX2.3-Multifunctional

>Gemma 4 31B IT NVFP4 model is quantized with NVIDIA Model Optimizer
https://huggingface.co/nvidia/Gemma-4-31B-IT-NVFP4

>AP Netflix VOID – ComfyUI Custom Nodes
https://github.com/adampolczynski/AP_Netflix_VOID
>>
>>108543580
You dropped >>108543595 btw
>>
>>108543601
Someone used Gemma 4 26B A4B to do real time translations of a VN, fucking impressive
https://streamable.com/ug9ddy
>>
>mfw Research news

04/06/2026

>Not All Frames Deserve Full Computation: Accelerating Autoregressive Video Generation via Selective Computation and Predictive Extrapolation
https://arxiv.org/abs/2604.02979

>VOSR: A Vision-Only Generative Model for Image Super-Resolution
https://arxiv.org/abs/2604.03225

>LumaFlux: Lifting 8-Bit Worlds to HDR Reality with Physically-Guided Diffusion Transformers
https://arxiv.org/abs/2604.02787

>Evaluating AI-Generated Images of Cultural Artifacts with Community-Informed Rubrics
https://arxiv.org/abs/2604.02406

>MMPhysVideo: Scaling Physical Plausibility in Video Generation via Joint Multimodal Modeling
https://shubolin028.github.io/MMPhysVideo-Page

>Salt: Self-Consistent Distribution Matching with Cache-Aware Training for Fast Video Generation
https://arxiv.org/abs/2604.03118

>Learning from Synthetic Data via Provenance-Based Input Gradient Guidance
https://arxiv.org/abs/2604.02946

>Gram-MMD: Texture-Aware Metric for Image Realism Assessment
https://arxiv.org/abs/2604.03064

>Can Nano Banana 2 Replace Traditional Image Restoration Models? An Evaluation of Its Performance on Image Restoration Tasks
https://arxiv.org/abs/2604.03061

>VERTIGO: Visual Preference Optimization for Cinematic Camera Trajectory Generation
https://arxiv.org/abs/2604.02467

>CAMEO: A Conditional and Quality-Aware Multi-Agent Image Editing Orchestrator
https://arxiv.org/abs/2604.03156

>Can VLMs Truly Forget? Benchmarking Training-Free Visual Concept Unlearning
https://arxiv.org/abs/2604.03114

>QAPruner: Quantization-Aware Vision Token Pruning for Multimodal Large Language Models
https://arxiv.org/abs/2604.02816

>LumiVideo: An Intelligent Agentic System for Video Color Grading
https://arxiv.org/abs/2604.02409

>VLMs Need Words: Vision Language Models Ignore Visual Detail In Favor of Semantic Anchors
https://arxiv.org/abs/2604.02486

>Forget Many, Forget Right: Scalable and Precise Concept Unlearning in Diffusion Models
https://arxiv.org/abs/2601.06162
>>
>>108543607
the api is cucked slop
>just train a Gemini Gem so it isn't complete trash
why do that when you can train it properly with open weights?
>REEEE LORAS ARE COPE!!!!
>>
would using an uncensored text encoder help with prompting nsfw wan gens?
>>
File: GEMINI BTFO.png (294 KB, 2611x1570)
294 KB
294 KB PNG
>>108543607
>it's just a smaller version of the API model.
it beat gemini on one benchmark kek
https://foodtruckbench.com/leaderboard
>>
File: 1773073149303325.png (35 KB, 324x78)
35 KB
35 KB PNG
>>108543580
>>108543601
I don't come here often, is this an inside joke I'm missing?
>>
>>108543675
nobody here knows japanese, that's all
>>
File: 051137.png (19 KB, 115x115)
19 KB
19 KB PNG
>sir how do you track your model's training progress?
<I test how well its generating 1girls standing from gacha games
I wish this was a bad joke
>>
>>108543666
satan bringing the receipts
>>
File: _AnimaPreview2_00334_.jpg (403 KB, 1160x1696)
403 KB
403 KB JPG
>>
>>108543666
>5k waste
>>
can i be runnings the gemma 31b on my 3050ti laptop?
>>
File: o_00930_.png (1.29 MB, 1280x768)
1.29 MB
1.29 MB PNG
>>
File: _AnimaPreview2_00341_.jpg (317 KB, 1160x1696)
317 KB
317 KB JPG
preview 3 when
>>
>>108543917
for a model specialized on anime it's already more realistic than Qwen Image keek
>>
>>108543917
My penis is ready
>>
Anon share jenny LoRA. I need to coom
>>
>>108543917
let him cook!
>>
File: _AnimaPreview2_00349_.jpg (392 KB, 1160x1696)
392 KB
392 KB JPG
>>
it was fun genning with you all but I won't entertain the thread schizo further by posting in his demented drama thread. bye
>>
>>108543592
Model?
>>
File: _AnimaPreview2_00356_.jpg (413 KB, 1160x1696)
413 KB
413 KB JPG
>>
File: _AnimaPreview2_00361_.jpg (509 KB, 1248x1608)
509 KB
509 KB JPG
>>
Blessed thread of frenship
>>
File: 00006-1681208019.png (1.95 MB, 1024x1280)
1.95 MB
1.95 MB PNG
>>
File: ComfyUI_00383_.png (3.95 MB, 1280x1920)
3.95 MB
3.95 MB PNG
>>
finally more content and less talk
>>
File: HDh-fmta4AEMq84.jfif.jpg (1.17 MB, 3058x4096)
1.17 MB
1.17 MB JPG
The asian community are big fans of Grok and Nano Banana
>>
>>108544509
>local lost
we know, maybe we could get a miracle and get google to release a local image model, they saved the llm fags with gemma 4...
>>
>>108544492
>>108544525
Continue crying
>>
>>108544525
what happened to based china? i heard all last year how they would save local and surpass sora 2 while releasing everything locally as part of a 4d chess plan to destabilize the west????
>>
>>108544509
yeah indians fucking love grok and banana.
>>
>>108544547
>sora 2
bruh
>>
Test
>>
>>108544509
imagine her braps
>>
>>108543574
> I can't wait for the day when we'll have VNs that will be automatically translated by LLMs, at this point they are good enough to replace those fucking translatorTroons
Why wait, you vibecode shit, capturing your screen, sending to a LLM and overlaying with translation.
>>
what's an id (ic?) lora
>>
File: ComfyUI_00116_.png (1.22 MB, 1280x960)
1.22 MB
1.22 MB PNG
Always remember to crank it before bedtime anon.
>>
File: file.png (1.62 MB, 2048x1792)
1.62 MB
1.62 MB PNG
TURD RUSSELL POST THE REALISM LORA!
>>
>>108545055
Godspeed anon and have fun on your break
>>
>>108544888
https://ali-vilab.github.io/In-Context-LoRA-Page/
>>
i await preview3
>>
>>108545150
I hope the FUD gets changed up a bit when it drops
>>
It's been hyped too much. Expectations are too high. (not mine, yours)
>>
its not even releasing any time soon
>>
>>108545175
but enough about openanima
>>
>>108545186
you can't rush perfection. i'm sure OpenAnima is still in the planning stage and they're working hard on establishing proper dataset collection tools so they don't run into the problem that tdrussel did with an outdated dataset
>>
File: ComfyUI_04649_.png (3.39 MB, 2048x1792)
3.39 MB
3.39 MB PNG
Test.
>>
>>108545197
but enough about openanima
>>
File: asuka.png (1.2 MB, 832x1216)
1.2 MB
1.2 MB PNG
I haven't tried to generate a video in a year.
Has video generation gotta way, way better over the past year?
Can we gen videos faster, in higher quality, with less VRAM, than we could last year?
>>
Damn gemma is pretty useful, it's nice being able to upload a gen and ask it for suggestions on how to get the model to behave how you want.
>>
>>108545255
Faster with more VRAM and lower quality.
>>
>>108543580
Whats the best model for decensoring/reducing moasic on j
JAVs?
>>
>>108544817
You need to be a bit smarter than that. Have it create character profiles, keep track of writing styles, etc.
>>
File: ComfyUI_02999_.mp4 (1.2 MB, 840x1232)
1.2 MB
1.2 MB MP4
>>108545255
About this time last year we were getting wan 2.2 which was a bit better quality but slower due to model swapping if you're a VRAMlet. Then we got LTX 2.0 which can do lip sync, has frame rate control, and can be quite fast but the quality of almost all movement is dogshit. Then we got a new version of LTX (2.3) which has somewhat better quality but it's still really bad overall. It's been pretty stagnant. Especially compared to what we've seen with API models in the last few months. Alibaba has said they're going to do more open source shit but no one is optimistic that they're gonna open source any more of their video models.
>>
>>108545407
Well, I love the way my image looks when animated, so I think it's time to check out img2vid again.
What model did you use for that great animation?
>>
>>108545407
> Alibaba has said they're going to do more open source shit but no one is optimistic that they're gonna open source any more of their video models.
compute is the limit anyway
>>
File: ComfyUI_02998_.mp4 (1.18 MB, 840x1232)
1.18 MB
1.18 MB MP4
>>108545432
It's wan 2.2 with the testbounce lora.
>>
>>108545407
Alibaba will only open source their LLMs. They recently asked what they should open source next, and the list was only LLMs. Nobody releases image/video models anymore, instead they just release textslop. Local diffusion is quite literally dead and stagnant
>>
>>108545504
there have been like 10 different models released in the last 2 weeks.
all the big companies just watched bytedance get bent over a barrel by hollywood, they are all going to move local. just look at the fucking state of wan 2.7, they didn't even train it. those weights aren't staying closed.
>>
File: 00126-95173719.png (1.08 MB, 832x1216)
1.08 MB
1.08 MB PNG
>>108545503
I might be a retard here, but
What site do you use to find loras? I can't find anything named testbounce in my usual spots
>>
>>108545583
It's this one https://civitai.com/models/1944129/slop-bounce-wan-22-i2v

The filename is bounce_test for whatever reason.
>>
can you guys tone it down
>>
>>
File: Whywhy.jpg (46 KB, 597x131)
46 KB
46 KB JPG
Fuck, this message is from December 2024. Why didn't he update it at least once? It wouldn't have cost him anything.
>>
>>108545640
I stopped caring about Noob a long time ago.
>>
>>108545564
>there have been 10 civitai pony shitmixes released in the last week!
>wan will release 2.5... i mean 2.6... well they will release 2.7 if we say it's bad!!
holy fucking localcope
>>
>>108545640
NoobAI is the Claude Opus 3 of anime models.
>>
>>108545652
Good for you, Noob ruined me for everyone else, I don't know how LAX did it but in aesthetics he won.
>>
>>108545668
idk why you have to come here to have melties over local models, just enjoy whats left of saas while you still can. kling is kind of fun. a bit mid if we are being honest, but it's better than nu-grok.
>>
>>108545712
I was a day0 user.
>but in aesthetics he won.
I agree but it's time to move on.
>>
File: 1769356921345905.png (3.09 MB, 1664x1216)
3.09 MB
3.09 MB PNG
eh
>>
>>108545640
Use Anima as the edit model for Noob it meets all requirements.
Can't wait for preview 3 tomorrow.
>>
File: 1749916924309847.jpg (131 KB, 483x448)
131 KB
131 KB JPG
>>108545583
dude, you were literally living in a cave to not know about civitai
>>
>have a list of frame indexes
>select frames from that list given an image batch
do nodes like this exist?
>>
>>108546050
likely not, make by yourself it's very easy
>>
File: 1770236195521730.png (3.69 MB, 1152x1728)
3.69 MB
3.69 MB PNG
>>
File: 1758565878388185.png (2.69 MB, 1920x1080)
2.69 MB
2.69 MB PNG
hello retards
>>
File: 1753068430768409.png (2.96 MB, 1920x1080)
2.96 MB
2.96 MB PNG
>>
File: 1753110701412960.jpg (364 KB, 1920x1080)
364 KB
364 KB JPG
Work Harder-1
https://youtu.be/Zgm9gJPAebk
https://suno.com/s/2rKeV3YWwy7rRHgN
>>
File: 1763506626809360.png (2.85 MB, 1920x1080)
2.85 MB
2.85 MB PNG
>>
>>108546333
>>108546338
https://github.com/ggml-org/llama.cpp/pull/21543
>AUTOMATIC1111
takes me back...
>>
File: 1744788154618674.png (657 KB, 1878x1214)
657 KB
657 KB PNG
https://xcancel.com/Designarena/status/2041275779204735047#m
>Sora 2 is 11th
this is probably the most worthless benchmark I have ever seen
>>
>>108546779
https://youtu.be/RERsGjQrQ6E?t=612
Wan 2.2 didn't have that plastic skin, I guess the Alibaba niggers couldn't help themselves but to use synthetic data shit to train their subsequent versions
>>
File: 1745796527771312.png (2.64 MB, 1920x1080)
2.64 MB
2.64 MB PNG
>>
>>108541775
>>108542153
>probably another nothingburger like "Klein KF cache" or some shit
And... it was a nothigburger, now they distill the fucking VAE lmao
https://github.com/Comfy-Org/ComfyUI/pull/13314
>>
File: 1767606645880108.png (2.55 MB, 1920x1080)
2.55 MB
2.55 MB PNG
>>
File: 1769595107989982.png (919 KB, 782x1032)
919 KB
919 KB PNG
Ace step XL still sounds like ass desu
https://github.com/Comfy-Org/ComfyUI/pull/13317
https://huggingface.co/ACE-Step/acestep-v15-xl-sft
https://ace-step.github.io/ace-step-v1.5.github.io/

https://files.catbox.moe/mgvdgt.mp3
>>
>https://huggingface.co/collections/ACE-Step/ace-step-15-xl
It's up.
>>
File: 1747859329496416.png (2.7 MB, 1920x1080)
2.7 MB
2.7 MB PNG
everyone lies!!!
>>
File: 1745437635859282.png (2.47 MB, 1920x1080)
2.47 MB
2.47 MB PNG
>>
File: 1757577071451003.png (18 KB, 680x129)
18 KB
18 KB PNG
>>108546865
:skull:
>>
>>108546876
>>108546880
if you want to spam your slop there's /sdg/ for that, go back
>>
File: 1747971551177570.png (2.14 MB, 1636x1507)
2.14 MB
2.14 MB PNG
>>108546890
Look at them playing it safe. They’re losing the long game by cucking their own models. The second one company decides to ignore the copyright shit and go all-in, every other model will become a footnote in history.

API models have no issue training their models with copyrighted IP, why the fuck is local more cucked than them??
>>
File: Gemma 4 31b.png (760 KB, 1869x1392)
760 KB
760 KB PNG
>the LLM fags got gemma 4, a model small enough to be run locally while being good enough to be competitive with the best API models
>While we're still in the Stone Age, still finetuning SDXL in the year of our lord 2026
What went so wrong...
>>
>still finetuning SDXL
jeet pandering
>>
>>108546907
>The second one company decides to ignore the copyright shit and go all-in
Mongoloid take. Imagine thinking that a company would chose this as a business strategy.
>>
>>108546954
>Imagine thinking that a company would chose this as a business strategy.
retarded take, look at OpenAI when they release a new image/video model, at the begining it's always allowing copyright shit, because they know it's the biggest thing to do to attract people, no one want to play with a boring model that doesn't know shit about the pop culture surrounding us
>>
>>108546927
local LLMs are like, 75% of the way towards API-level and they surpass the previous SOTA within 6 months
local image models are maybe 30% of the way towards GPT-2/Nano Banana level, and at least 2 years behind. dall-e 3 still has more character/style knowledge than any local base model.
but we can't even complain about shit datasets anymore because local doesn't get any new image models anyway
>>
File: 1749073638231186.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>108546927
qwen3vl 8b is better for captioning (from my tests, even better than qwen3.5 9b and of course gemma4 e4b)
Didn't test the 31b
>>
Any of you actually make money genning sloppa?
>>
>>108547183
do you get paid for asking stupid questions?
>>
>>108547185
I wish
>>
>>108545407
>>108545503
How much VRAM do you need to gen these?
Can a vramlet like me with only 12gb do it?
I see a bunch of different Wan 2.2 options - which one did you use? Image2video 14B? Or TextImage2video 5B? FastWan?
>>
Man, /g/ really is fucking dying.
>>
>>108547243
The whole site is
>>
>>108547243
>/g/ really is fucking dying
/lmg/ is having the most activity since a long time ago, turns out that releasing good local models helps the ecosystem thrive or something, waiting for you Alibaba, time for your next move
>>
>>108547286
zit release was the best time in recent memory. threads were actually hitting the image limit. too bad 'base' turned out to be lame shit and turbo is incredibly boring
>>
Anima preview 3 when
>>
NovelAI v5 when?
>>
>>108547387
What do you expect if not the same flop?
Today I requested the day off from work so when Anima 3 comes out I can flood the thread with Mugen gens ;3
>>
>>108547415
Uhh based department?
>>
>>108547415
Ultra based, and me wih NovelAI gens
>>
>>108547415
I will colaborate with some Nano Bananana ones!
>>
File: 1658707268383823.webm (2.9 MB, 636x636)
2.9 MB
2.9 MB WEBM
>haven't updated comfy in a couple months
>pull
>it's slower than ever
>ui somehow looks worse
>"Some nodes are missing required inputs" spam on large workflows with multiple image inputs (ie combined i2i, upscale, inpaint workflows with bypass switches for each
>some gay dynamic vram bullshit which is slowing down first gens on models that don't fucking need "dynamic" vram
>uses a version of transformers which breaks a bunch of nodes
The enshitification will continue until there's nothing left to shitify
>>
>>108547441
fuck off ani
>>
Anina will be the greatest flop ever made.
It will be the last anime local model.
It will break the moral of anime genner.
It will be the model that makes people realize that every time local models give you 2 new things, they take away 3 other things in exchange.
>>
>>108547183
yes
>>
>>108547446
I was just thinking to myself, "I sure am glad that all those namefagging schizos are gone, and now we can have threads in peace"
But there are some schizos that will never be gone: the schizos who think every anon they don't like is one of the namefagging schizos, so they're constantly jumping at shadows and yelling at the ghost of people who don't even post here anymore
>>
>>108547471
And?
>>
>>108547455
wait, the last anime local model?? just a month ago you said it was going to be a saas model
>>
>>108547484
Anima will break the morale and hearts of local posters. Tdrusell will learn the hard way the harsh reality of open weight models, becoming the dead apostle of the local scene.
>>
told you long ago turdrussell is a disgrace
>>
Comfy somehow had to convince his childhood friend tdrusell that local models are doomed.
What better way than to gift him a model so he could finetune it and experience the disillusionment firsthand?
>>
>not even released yet
>anon already seething
>>
>>108546850
Klein KV cache sped up editing by a lot though.

>>108547286
Most of it is Ani cycling through his troll repertoire though, which adds negative value.

>>108547471
Just how long have you been gone?
>>
>>108547183
i easily could but i don't want to turn my personal hobby into a side hustle because then it would no longer be enjoyable.
>>
>>108547441
>uses a version of transformers which breaks a bunch of nodes
this one was super annoying but I figured out how to band aid it.

python_embeded\python.exe -m pip install transformers==4.57.6

this version of transformers doesn't break any nodes
>>
>>108547680
It's mostly for the neets and third-worlders.
>>
Haha, look at all these lmg sloppers. They are happy because a SaaS company gave them a bunch of 2022 GPT 2 tier quality model. It was not Drummer, DrShogun, Noromaid, Euryale, or any local LLM fine tuner.
It was Google who made the most important change in their hobby.
>>
>>108547106
Did you test 8b vs 35b moe?
>>
>>108547721
35b moe is indeed better than the 9b(which is basically unusable since it doesnt even follow the prompt), but I want my captioning to be blazing fast. I would say it's more or less on par with the 8b.
>>
>>108547729
I am guessing 35b should have better world knowledge when it comes to recognizing lesser known people/characters and less hallucinations trying to do that, even if their overall visual reasoning abilities are similar.
Again didn't test it myself though.
>>
>>108547741
I never tested for knowledge really, just captioning and the ability to describe characters, clothing, composition, lighting, etc...
>>
>>108547746
Alright thanks for the input nonetheless.
>>
>>108547286
Everyone moved to >>>/vg/aicg/
>>
>>108547793
I identify as a male and I'm not an attention whore, so no thanks
>>
>>108546850
Who tf actually cares about VAE speed. Even tiled on 128 is like 5 seconds max on big upscales.
>>
Insider news: Anima preview 3 will be released in /adt/. Tdrusell is tired of being treated badly here and I partly understand it desu.
>>
>>108547888
>5 seconds
gotta go faster
>>
>>108547898
makes sense
>>
fud status?
>>
Even in /ldg/ its F.U.D!
>>
>>108547856
>I identify as a male
you can't identify as a male, you are biologically observed as a male
>>
>>108547959
meant to say as my birth assigned gender, you're right sorry.
>>
>>108547964
you weren't assigned a gender, it was ascertained from your anatomy
and it wasn't even at birth, they can tell the sex by the third trimester
>>
>>108547998
You're absolutely right! My embryo-developed sex is what I meant!
>>
>>108548004
a troon and reddit tier sarcasm, find a better duo
>>
>>108548004
"you can't have a sex as an embryo, you aren't human you are a clump of cell", that's the argument feminists use to kill babies when they're still as the form of an embryo, can't be a murder if you're not human after all right? kek
>>
yep ... it's about that time of the day
>>
Julien Lubimiv the broke and raped transexual retard is crying again
>>
>>108548124
>Biologically, even a month old fetus is a human.
so it's a murder, that's what I said, desu the reason why conservatives don't push on that subject too much is because they know the people who do abortions the most are black people, so to them it's a good way to prevent the increase of % of black people in murica lol
>>
File: friendly reminder.png (3.02 MB, 3554x2000)
3.02 MB
3.02 MB PNG
>>108548242
>we don't base our laws on pure biology alone
we should, objective reality always supersedes feelings
>>
bot raid
>>
How can I outpaint with qwen image edit and forge? I was just making the image bigger and then masking and inpainting but I can't manage to generate something that's doesn't look badly stitched.
>Your ip range has been banned due to abuse...
This fucking page is so fucking retarded
>>
sir please let redeem preview3 to save thread thank u gladly
>>
>>108548284
sirs before redeemage of anima3 you must be of fudding while promotions of your friends model mugal
>>
>>108548299
sorry sir no firend with dalit mugal
>>
>>108548253
You're terrified of objective reality, which is why you think it's murder and that murder means anything inherently.
>>
File: 00000-3558044508.jpg (1.43 MB, 2048x2048)
1.43 MB
1.43 MB JPG
>>
File: 1760028137035273.png (640 KB, 1000x1000)
640 KB
640 KB PNG
>>108548321
murder is killing humans, and a fetus is a biological human, I don't need to do mental gymnastics to justify murder anon
>>
>>108548402
No no, humans are thinking things. You can't murder a rock. It's not murder until the age of, say 2?
>>
local diffusion?
>>
>>108548440
>It's not murder until the age of, say 2?
try killing a 1 year old baby and see if you won't go to jail lol
>>
>>108548457
Lmao, so know society suddenly determines "biological" "objective" reality? Enjoy your trannies i guess
>>
File: 00001-2515562191.jpg (1.06 MB, 2048x2048)
1.06 MB
1.06 MB JPG
>>
>>108548491
Your stance? Doesn't have the right? Bro, what the fuck are you talking about? We're talking about reality, what's intrinsically, objectively true. Laws of the universe. Why the fuck would your "stance" matter lmao
>>
>>108548508
>Your stance?
yes, my stance, my stance which is that we should follow objective reality, it's a stance because not everyone agrees with it, you disagree with it for example
>>
>>108548520
You just wrote a diatribe about how much you didn't want to follow objective reality, garbage about rights and positions and ideas lmao
Are you really that delusional my man? Do you believe that you're feelings decide "objective reality"?
>>
>>108548536
>You just wrote a diatribe about how much you didn't want to follow objective reality
?
>>108548253
>objective reality always supersedes feelings
??
>>
>>108548545
all of this
>>108548491

Why are you pretending the things you want believe and think matter?
>>
File: 00032-1517354133.jpg (392 KB, 1536x2592)
392 KB
392 KB JPG
>>
>>108548574
disgusting slop (and shitty upscale to boot), learn to prompt and use your tools, faggot
>>
File: 00024-944316091.jpg (469 KB, 1536x2592)
469 KB
469 KB JPG
>>
File: 8C4Whddg.jpg (80 KB, 728x504)
80 KB
80 KB JPG
what are y'all baking, anima finetunes?
>>
>>108543580
How come character loras for anima are basically non-existent (at least on Civitai)? I've seen people claim that training for it is a pain in the ass or harder or something but it..... isn't. Pic rel is a style Lora I trained easily in like 3 hours. One of the biggest complaints I see with anima Is its perceived lack of stylistic versatility and knowledge. (I'm pretty sure I recall someone posting screenshots from a Discord showing trainers complaining about having difficulties with it). Why not just train a Lora of whatever specific style you want to generate with it?
>>
File: o_00938_.png (329 KB, 968x1080)
329 KB
329 KB PNG
>>
>>108548635
probably waiting on the full release
>>
File: 00038-1417801373.jpg (384 KB, 1536x2592)
384 KB
384 KB JPG
>>
>>108548635
because every anima lora nukes the model's knowledge
what do you think your generic 1girl standing even proves?
>>
>>108548635
Do any GUIs support it yet? I tried this one but I couldn't even start the training: https://github.com/gazingstars123/Anima-Standalone-Trainer I'm not python literate enough to troubleshoot this shit, I need something that "just works." OneTrainer support WHEN?
>>
>>108548609
Klein and Chroma loras
>>
>>108548681
>because every anima lora nukes the model's knowledge
Such as?
>>
>>108548722
ignore him, tdrussy has debunked this.
>>
>>108548697
I got it working the other night and trained two anima loras back to back with it. What errors were you getting? I might be able to help you troubleshoot it. The culprit when mine failed to start was torchvision not being installed (that was missing from the requirements.txt so I guess the crater assumed whoever would be using it what have all dependencies pre-installed)
>>
>>108548732
he'll be posting that realism lora any time soon, right?
>>
>>108548165
Based
>>
>>108548739
it'll be released around the same time as openanima, keep an eye out!
>>
>>108548739
I can attempt one of those myself if you want. I'm assuming you would just need a bunch of real life photos of people and landscapes that are high quality, right?
>>
>>108548786
i don't need loras from a rando
i just want to know what parameters tdrusell used to avoid the problems every anima lora trainer has reported
>>
>>108548732
sure we should all deny our experiences with this model because the pedo baker turdy posted one comparison image with a lora that doesn't exist
get better shills
>>
File: AniPedo.jpg (1.73 MB, 979x2558)
1.73 MB
1.73 MB JPG
>>108548805
ani is the pedo though
>>
File: 9911.jpg (1.89 MB, 1256x1672)
1.89 MB
1.89 MB JPG
I kill the pedos
>>
>>108548813
obsessed retard
>>
>>108548826
why are you a pedo ani though?
>>
>>108548826
>Signature use
>>
>>108548823
zased, generate more uoh ToT killers
>>
>everybody who disagrees with me is poopdickschizo
>>
>>108548838
not ani, there are people who can think for themselves and call out this thread's idols on their bullshit
>>
>WanVideoWrapper
>33 conflicts
>>
File: bigfan.jpg (88 KB, 480x624)
88 KB
88 KB JPG
>>
File: Really makes you think.jpg (69 KB, 2763x399)
69 KB
69 KB JPG
>>108548868
>not ani
>>
>>108548801
What specific issues do they report?
>>
>>108548881
>>108548893
>every post I disagree with is this ugly boogeyman I made up
convenient is it
>>
If I were to generate say a 100 images by using prompts for each different image, what's the cleanest way of doing it?

I'm currently using CR Prompt List node but differentiating prompt is messy as fuck seeing how I'd need to add like 50-60 tags on each line.
>>
>>108548902
yeah dude keep promoting your shitty mugen abortion BRUH lmao.
whats next in today's FUD campaign btw?
>>
>>108548902
tell that to ani who believes everyone who disagrees with him is ran kek
>>
It's not going to release today.
>>
>>108548894
the model's character, style and concept knowledge gets degraded when you train loras on it, especially style loras
>>
>>108548927
bro try ANY illushit lora right now and it'll be similar/worse
stop spreading this retarded FUD
>>
>>108548904
so you have 100 lines of prompt?
>>
>>108548881
lmao
>>
samefagging accusations in 3...2...
>>
>>108548935
that was never a problem with any of the illustrious loras i trained
i don't give a shit if you think i'm poopdickschizo or whatever, i'm just talking about my own experiences
>>
File: 1766222866032715.png (253 KB, 301x371)
253 KB
253 KB PNG
>>108548927
That's....how adapters work anon. You gain one thing but something else suffers. This applies to LLM adapters merges too (And usually to a much worse degree). Are you using the STYLE loras at full strength and then acting surprised when the Lora meant to generate a specific STYLE. Cannot generate other STYLES well?
>>
>>108548881
>>108548941
i'm another anon and think this is hilarious!
nobody even mentioned ani before the faggot but yeah there totally is some "ani fan" here
>>
>>108548962
ughhh bro muh catastrophic loss
>>
>>108548964
>im another anon
LMAO!
>>
>>108548918
the very comprehensive statistics of 2 whole past data points say its a few hours away
>>
>>108548980
very excited for new regressions
>>
>>108548940
I'll build up on it yes. I want it to then be easily tweakable also.
>>
File: 00045-1392570032.jpg (463 KB, 2592x1536)
463 KB
463 KB JPG
>>
>>108548993
wdym?
what's your input texts?
>>
>>108548980
Big Russ would've corrected it if it wasn't true... He was here posting ITT!
>>
>>108549022
wowww my idol posting here with us merely mortals? my panties are so wet!!!
>>
File: 00043-1482177456.jpg (424 KB, 2592x1536)
424 KB
424 KB JPG
>>
>5e-4
>"It forgets everything!"
kek
>>
>>108549022
>>108549040
-> >>108548826
>>
>>108549062
Real, Anima has exposed how many retards there are in this community. KEEP USING MUGEN SAAAARS
>>
File: 00047-1993847902.jpg (480 KB, 1536x2688)
480 KB
480 KB JPG
>>
>>108549101
bro whats the purpose of these random insta thots? get better material, boooooo
>>
>>108549105
shut up ani
>>
>>108549084
are those mugen users in the room with us right now?
>>
>>108549118
users? unlikely, faggots part of the so called 'cabal' 100% without doubt.
faggot.
>>
>>108549125
is the cabal in the room with us right now?
>>
File: 1752932617526531.png (3 KB, 134x50)
3 KB
3 KB PNG
>>108549133
yes
>>
>>108549139
scary
>>
No unironically it's not dropping today you can leave
>>
>>108549022
Maybe. Or he just let it slide like most shitposts.
>>
>>108549010
I'm going individual prompts, just a ton of them in one go.

Right now I load it from a .txt, but the line separation doesn't seem to work as it does with the nodes on the left into the text list node.
>>
>>108549152
Big Russ, is we be getting v3 today
>>
File: 1769547901145119.jpg (29 KB, 400x400)
29 KB
29 KB JPG
>ltx2.3 22B t2v
>literally bad or missing medieval content, even with pro version
>can't even make elves and shit
>22B...
>>
>>108549157
try the string list to string from the impact pack
>>
>>108549206
Trained on bollywood. Try hindu mythos.
>>
Not muh heckin medieval!!!
>>
>>108549236
Oh come on.. The issue was that comfyui needs to be rebooted in order for the changes in the .txt file to be updated.. The R refresh doesn't work for it..
>>
File: 00053-1196419650.jpg (375 KB, 1536x2688)
375 KB
375 KB JPG
>>
>>108546769
BASED
>>
>>108549287
you can't even articulate. how do you expect for help?
>>
Is there another way to make goofs that isn't the lcpp memefork? I want to try the new spark but there's no goofs yet.
https://huggingface.co/SG161222/SPARK.Chroma_v1/tree/main
>>
File: o_00943_.png (1.26 MB, 1536x512)
1.26 MB
1.26 MB PNG
>>
>>10854633
Holy shit AUTO111 returned!!
Haoming02 and Panchovix KILL YOUR FUCKING SELF THE KING HAS COME TO RECLAIM HIS THRONE! YOU ARE NOTHING BUT TRASH!
>>
File: deSA_zi_00044_.png (2.15 MB, 1792x977)
2.15 MB
2.15 MB PNG
>>
>>108549459
Fuck off and go back to your containment general
>>
File: deBU_zi_00025_.png (2.04 MB, 1536x922)
2.04 MB
2.04 MB PNG
>>108549470
>>
File: o_00944_.png (1.22 MB, 1536x512)
1.22 MB
1.22 MB PNG
>>
File: ComfyUI_00634_.png (863 KB, 832x1216)
863 KB
863 KB PNG
>>108548927
>>108548935
>>108548958
>>108548962
>>108548970
Looks like Anima works with two loras (Style + Character)

https://files.catbox.moe/2w5jfb.png
https://civitai.com/posts/27803776
>>
Anima can’t be released today. Tdrusell still needs to fix bugs, create an official Anima lora, and share the settings and tutorials. Many experienced fine tuners and lora makers are hesitant to use it again, even if versions 3, 4, 5, or a final version are released, due to ongoing bugs.
>>
>>108549555
>diffusion model
>"bugs"
define the bugs
>>
>>108549542
omg that goes against the narrative please remove your post immediately!!!!!
>>
>>108549569
he meant fuds
>>
>>108549542
Your examples are incorrect and misleading.
1)Does your character actually exist, or is it an OC?
2)If it does exist:
1-it has a very basic design;
2- it’s a portrait photo, which is one of the easiest types to create.
3)if you had been paying attention and genuinely cared about Anima, you would have read the forum most of the examples there are full body shots.
4) You are sharing your account information as a form of backup, but given my previous points, your account, your help, and your Catbox links all seem completely misinformative and misleading.
>>
>>108549542
>AI_Art_Factory
oh, so that's who it was. lol.
>>
This meltie is getting stale. We should pivot back to UIs. Oh, make sure to randomly mention sd.cpp
>>
>>108544509
Turn her into a loli
>>
File: ComfyUI_00636_.png (736 KB, 832x1216)
736 KB
736 KB PNG
>>108549589
>1)Does your character actually exist, or is it an OC?
If you would have checked the link, I posted you would've known that was a Zelda character.

What do you mean misinformation? What do you think I'm trying to disprove? All I'm showing is that training style and character loras that work is indeed possible.

>3)if you had been paying attention and genuinely cared about Anima, you would have read the forum most of the examples there are full body shots.

Like this?

https://files.catbox.moe/18xjn4.png
>>
File: o_00946_.png (1.82 MB, 1920x1080)
1.82 MB
1.82 MB PNG
>>
Hey guys, tdrussell here. When I said preview 3 was releasing tuesday, I meant the tuesday 2 weeks from now. Sorry for the confusion.
>>
>>108549708
>>108549542
Your posts aren’t helpful. Again, check the Hugging Face discussion, characters discussed there have many visual details.
Your characters are conveniently simple, presumably to be used as examples.
Once again, you’re not helping and are only harming your reputation further.
>>
>>108549708
He's trolling you
>>
>>108549763
As stated earlier if the model by itself sucks at doing a particular complex, niche character, just use a well trained Lora. Standalone models, the murder how well trained they are, How well tagged the data set is, etc, is going to accurately generate each and every single niche character from some obscure anime No one even bothers to seed on torrents anymore. This applies to styles too. I at least kind of understand people wanting the thing to know characters but I don't get people acting surprised it does not know niche artists either. I'll take a look at the discussion page, but I recommend you just do what I did and be the change you want to see
>>
File: img_00003_.jpg (749 KB, 1520x1824)
749 KB
749 KB JPG
>>
>>108549555
If they are so experienced why are they using high SDXL-era LRs with it?
>>
>>108549708
>>108549542
People there https://huggingface.co/circlestone-labs/Anima/discussions/112 are discussing this maturely, explaining their training process and showing before and after results.
You come here saying, “lol it works for me” which adds nothing and only fuels fanboyism.
Your input is biased and useless.
You’re not contributing to the discussion, you’re not posting in the official diffusion thread, and if you actually know how to make a lora , you’re still not explaining your method or process.
You’re not helping and you’re poisoning the discussion and damaging your own credibility.
If you want to help, go to the official page and post your method.
>>
>>108549708
>>108549797
hes trolling you anon
>>
File: o_00949_.png (1.32 MB, 968x1080)
1.32 MB
1.32 MB PNG
>>
>>108549625
he seems really passionate right now probably due to the rumors of v3 he feels he needs to up the fuding
>>
>>108549843

>Point is, I'm not in anyway knowledgable or experienced about training and finetuning

>The training program is https://github.com/67372a/LoRA_Easy_Training_Scripts , which adopted Anima training code from duongve implementation on sd script, which was in turn originated from Bluvoll's code.

kek
>>
all this FUD and yet anima is still on my pc lol
>>
Mugen status?
>>
>>108549863
I earnestly don't understand why they don't use the trainer created by the one who authored the model...
>>
>>108549797
>>108549763
>>108549708
>>108549589
>>108549581
>>108549571
>>108549542
Mour comparisons

https://files.catbox.moe/wb26mw.png
https://civitai.com/images/126722206


>You’re not helping
I'm not trying to "help" what you THINK I'm trying to help. Anima was brought up in discussions. So I'm contributing.

>You come here saying, “lol it works for me”

yes, people tend to say that when things work well for them. I already told you I would read the discussion. Quit having mental breakdowns when people do not think or respond the exact ways you want or expect them to. You're making a niche hobby way more serious than it needs to be
>>
>methani is still shitting herself unironically
>>
>>108549863
>https://github.com/67372a/LoRA_Easy_Training_Script
Can this be used fresh out of the box to train Anima loras or will it require modifications like this one

>>108548697
>>
>>108549842
are the people training anima with >1e-4 LR in the room with us right now?
>>
File: img_00013_.jpg (627 KB, 1520x1824)
627 KB
627 KB JPG
>>108549863
>https://github.com/67372a/LoRA_Easy_Training_Scripts
works great with Anima, dont see whats the problem
>>
>>108549930
why would you use something that very clearly doesn't work instead of what the model creator himself is using to train the model?
>>
File: ComfyUI_00637_.png (684 KB, 832x1216)
684 KB
684 KB PNG
>>108549926
>methani
>>108549950
Can you link posts to them stating what they used so i can take a look at it? I'd prefer to use whatever they used if it's easier
>>
>>108549950
nobody uses diffusion pipe
>>
>>108549937
Just look here >>108549843
>>
File: deMS_zi_00019_.png (3.44 MB, 1663x1164)
3.44 MB
3.44 MB PNG
>>108549853
>>
>>108549892
Anyone diffusing anime still uses SDXL. Don’t believe me? Check any anime thread on 4chan. Anima is a dead model anons here didn’t adopt it in time and it has various problems. But what would you know? You’re a /ldg/ slopper.
>>
File: animalora.png (70 KB, 873x385)
70 KB
70 KB PNG
>>108549961
>>
File: img_00019_.jpg (635 KB, 1520x1728)
635 KB
635 KB JPG
>>
Fresh when ready

>>108550008
>>108550008
>>108550008
>>108550008
>>
>>108549867
Yeah, alongside Chroma, Qwen Image, Edit and Flux Kontext, you don't gen, you’re an /ldg/ slopper who collects models on your hard drive and never uses them again, we know the type of person you are.
>>
>>108549995
>Anyone diffusing anime still uses SDXL. Don’t believe me? Check any anime thread on 4chan.
the only one that still uses SDXL is >>>/g/adt kek
>>
Why aren't loras loaded when I use the <> prompt for it in the text or a .txt file?
>>
Help a brother out
where to get all the uncensored text encoders, gguf preferably
>>
fudder in complete tilt lmao
>>
File: ComfyUI_00640_.png (1.05 MB, 832x1216)
1.05 MB
1.05 MB PNG
>>108549998
Clarification. I was asking what trainer they used, not what data he used to train THEIR lora or what settings. I thought you were saying I should've used whatever trainer they used instead of what I used;

https://github.com/AiArtFactory/Anima-Standalone-Trainer

(speaking of which I should probably update the requirements file to have torchvision as a dep so nothing breaks the next time I need to use it.
>>
>>108550042
what frontend are you using?
>>
>>108550085
its mentioned in the last part, diffusion-pipe which the model creator created
>>
File: microbikinisupremacy.jpg (305 KB, 1160x1608)
305 KB
305 KB JPG
I believe in microbikini supremacy.
>>
>>108549998
>>108550096

is that from this page?

https://huggingface.co/circlestone-labs/Anima/discussions

Which discussion?
>>
>>108550147
https://huggingface.co/circlestone-labs/Anima/discussions/112#69d337b5bb1ba652fb6522e6
>>
File: owl.png (1.24 MB, 968x1080)
1.24 MB
1.24 MB PNG
>>108540110

Beautiful specimen.
>>
>>108550094
Comfy.
I just found out it can't load loras through prompts like forge can. Trying out some custom nodes.
>>
>>108550141
So you say, but you still had to make the 1girl interesting otherwise.
>>
Reminder fresh

>>108550008
>>108550008
>>108550008
>>108550008



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.