[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


AIDS and Vaseline Edition

Discussion and Development of Local Image and Video Models

Previous: >>108575392

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>china sold out and local is now left with zero developers
is it over? we'll never get a seedance model locally
>>
File: _AnimaPreview3_00044_.jpg (350 KB, 1608x1160)
350 KB
350 KB JPG
>>
no one remembers stabilityai
>>
File: Flux2-Klein_00669_.png (3.18 MB, 1920x1072)
3.18 MB
3.18 MB PNG
can we tone the seethe down a bit
>>
>>
>>108585044
that was diferent
>>
>>108585044
>no one remembers stabilityai
and Mochi (still waiting for MochiHD kek)
>>
>mfw Resource news

04/11/2026

>ComfyUI-RookieUI: The ultimate A1111-style sidebar
https://github.com/rookiestar28/ComfyUI-RookieUI

>Qwen3.5-4B-Base-ZitGen-V1: Image captioning fine-tune of Qwen 3.5 4B optimized for Z-Image Turbo
https://huggingface.co/lolzinventor/Qwen3.5-4B-Base-ZitGen-V1

>ComfyUI Memory Visualization
https://github.com/kijai/ComfyUI-MemoryVisualization

04/10/2026

>JoyAI-Image-Edit now supports ComfyUI
https://github.com/jd-opensource/JoyAI-Image#-news

>Two Front Doors: Civitai.com, Civitai.red, and What's Next
https://civitai.com/articles/28369/two-front-doors-civitaicom-civitaired-and-whats-next

>Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator
https://fr0zencrane.github.io/uni-vigu-page

>PrivFedTalk: Privacy-Aware Federated Diffusion with Identity-Stable Adapters for Personalized Talking-Head Generation
https://github.com/mazumdarsoumya/PrivFedTalk

>AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation
http://aka.ms/avgenbench

>Cross-Modal Emotion Transfer for Emotion Editing in Talking Face Video
https://chanhyeok-choi.github.io/C-MET

>ChenkinNoob-XL-V0.5
https://modelscope.ai/models/ChenkinNoob/ChenkinNoob-XL-V0.5

>Control Order & Free Memory: Controls the order of node execution with device-agnostic memory management
https://github.com/mkim87404/ComfyUI-ControlOrder-FreeMemory

>DMax: Aggressive Parallel Decoding for dLLMs
https://github.com/czg1225/DMax

04/09/2026

>MAR-GRPO: Stabilized GRPO for AR-diffusion Hybrid Image Generation
https://github.com/AMAP-ML/mar-grpo

>HybridScorer: Score, sort, and cut large sets down fast with GPU-accelerated AI review
https://github.com/vangel76/HybridScorer

04/08/2026

>OrthoFuse: Training-free Riemannian Fusion of Orthogonal Style-Concept Adapters
https://github.com/ControlGenAI/OrthoFuse

>MIRAGE: Benchmarking and Aligning Multi-Instance Image Editing
https://github.com/ZiqianLiu666/MIRAGE
>>
>>108585044
well yeah, because all their employees left to form BFL and continued releasing the same censored API-first goyslop. all they did was change names
>>
>mfw Research news

04/11/2026

>M2StyleGS: Multi-Modality 3D Style Transfer with Gaussian Splatting
https://arxiv.org/abs/2604.03773

>SafeCtrl: Region-Aware Safety Control for Text-to-Image Diffusion via Detect-Then-Suppress
https://arxiv.org/abs/2604.03941

>SymphoMotion: Joint Control of Camera Motion and Object Dynamics for Coherent Video Generation
https://grenoble-zhang.github.io/SymphoMotion

>NavCrafter: Exploring 3D Scenes from a Single Image
https://arxiv.org/abs/2604.02828

>Reinforcement-Guided Synthetic Data Generation for Privacy-Sensitive Identity Recognition
https://arxiv.org/abs/2604.07884

>Collaborative Multi-Mode Pruning for Vision-Language Models
https://arxiv.org/abs/2604.02956

>GENFIG1: Visual Summaries of Scholarly Work as a Challenge for Vision-Language Models
https://arxiv.org/abs/2604.04172

>Stochastic Generative Plug-and-Play Priors
https://arxiv.org/abs/2604.03603

>Symbiotic-MoE: Unlocking the Synergy between Generation and Understanding
https://arxiv.org/abs/2604.07753

>Differentiable Stroke Planning with Dual Parameterization for Efficient and High-Fidelity Painting Creation
https://arxiv.org/abs/2604.02752

>Graphic-Design-Bench: A Comprehensive Benchmark for Evaluating AI on Graphic Design Tasks
https://arxiv.org/abs/2604.04192

>Token-Efficient Multimodal Reasoning via Image Prompt Packaging
https://arxiv.org/abs/2604.02492

>DINO-QPM: Adapting Visual Foundation Models for Globally Interpretable Image Classification
https://arxiv.org/abs/2604.07166

>Do Audio-Visual Large Language Models Really See and Hear?
https://arxiv.org/abs/2604.02605

>AutoSOTA: An End-to-End Automated Research System for State-of-the-Art AI Model Discovery
https://arxiv.org/abs/2604.05550

>Neural Network Pruning via QUBO Optimization
https://arxiv.org/abs/2604.05856

>Beyond Fixed Inference: Quantitative Flow Matching for Adaptive Image Denoising
https://arxiv.org/abs/2604.02392
>>
File: pixel-0008-1554111285.png (1.17 MB, 1344x1728)
1.17 MB
1.17 MB PNG
who remembers disco diffusion
>>
>>108585098
nice try kek
>>
>>108585112
are people unhappy with BFL?
>>
File: _AnimaPreview3_00052_.jpg (318 KB, 1608x1160)
318 KB
318 KB JPG
>>
yucky saas swamp
>>
File: _AnimaPreview3_00075_.jpg (313 KB, 1608x1160)
313 KB
313 KB JPG
>>
>>
>>
File: _AnimaPreview3_00082_.jpg (307 KB, 1248x1608)
307 KB
307 KB JPG
>>
>>
File: _AnimaPreview3_00086_.jpg (244 KB, 1248x1608)
244 KB
244 KB JPG
>>
File: _AnimaPreview3_00091_.jpg (419 KB, 1248x1608)
419 KB
419 KB JPG
>>
>>
>>108585079
>>108585221
>>108585236
>>108585258
chroma sucks
>>
>>108585296
bruh
>>
kino hour
>>
File: 555.jpg (1.77 MB, 1152x2048)
1.77 MB
1.77 MB JPG
>>
>>
File: _AnimaPreview3_00104_.jpg (276 KB, 1248x1608)
276 KB
276 KB JPG
>>108585317
bruhs @ u

>>108585378
the bold and the beautiful
>>
>>108585305
How can you tell it's Chroma? Maybe it's ZiT
>>
File: ComfyUI_temp_pivpu_00033_.png (2.85 MB, 1200x2032)
2.85 MB
2.85 MB PNG
>>
File: _AnimaPreview3_00119_.jpg (364 KB, 1696x1160)
364 KB
364 KB JPG
>>
File: images.jpg (11 KB, 198x255)
11 KB
11 KB JPG
Which anime model is most used on 4chan threads right now Anima, SDXL or NAI?
Vote!
https://strawpoll.com/B2ZB9rDajgJ
>>
File: ComfyUI_temp_pivpu_00034_.png (2.98 MB, 1200x2032)
2.98 MB
2.98 MB PNG
>>
File: ComfyUI_temp_iuvrh_00001_.png (2.14 MB, 1248x1728)
2.14 MB
2.14 MB PNG
>>
face is fixed on anima. fucking finally. that's the boring part of illus-pony
>>
good job face
>>
>>108585500
>>108585512
>/^ComfyUI_temp_/i;type:filename;
you'll thank me later
>>
>strawpoll
kek
>>
>>108585114
Gib us pixel merge
>>
File: 666.jpg (1.91 MB, 1152x2048)
1.91 MB
1.91 MB JPG
z-image making big feet again
>>
File: _AnimaPreview3_00121_.png (1.88 MB, 1696x1160)
1.88 MB
1.88 MB PNG
>>
File: pixel-0009-2084854573.png (1.12 MB, 1344x1728)
1.12 MB
1.12 MB PNG
>>
cozy breas
>>
File: 00013-3076487574.jpg (1.7 MB, 2016x2592)
1.7 MB
1.7 MB JPG
>>
File: ComfyUI_temp_lheqp_00001_.png (3.62 MB, 1552x2160)
3.62 MB
3.62 MB PNG
>>
Lodestone ZIT projects and their status?
>>
>>108585744
Diaper: filled
>>
File: ComfyUI_temp_uoeie_00002_.png (2.03 MB, 1920x1152)
2.03 MB
2.03 MB PNG
>>
File: ComfyUI_temp_lheqp_00006_.png (2.37 MB, 1920x1152)
2.37 MB
2.37 MB PNG
>>
>>108585744
Anteater cocks: gleaming
>>
>>108585485
>illust retards not knowing it's based on SDXL
Still?
>>
File: ComfyUI_temp_lheqp_00008_.png (1.55 MB, 1920x1152)
1.55 MB
1.55 MB PNG
>>
File: 00018-1158535645.jpg (1.52 MB, 2016x2592)
1.52 MB
1.52 MB JPG
>>
File: 1762160217564214.png (3.12 MB, 1168x1704)
3.12 MB
3.12 MB PNG
>>
>>108585815
I look like this
>>
File: 00020-4042355821.jpg (2.22 MB, 2016x2592)
2.22 MB
2.22 MB JPG
>>
>>108585891
Come on, ugly male feet, ugly male toes, ugly male ankles, ugly male thighs, ugly male buttocks.
>but it looks like it was drawn by hand!
Fuck off. Artists are learning too, and many of the artists on Danbooru are amateurs and have slop eyes.
>>
File: 1744668127760498.jpg (978 KB, 3024x2268)
978 KB
978 KB JPG
can someone fix this
>>
File: 00022-2281910583.jpg (2.22 MB, 2016x2592)
2.22 MB
2.22 MB JPG
>>
>>108585891
If I covered her from the waist up, would I still be able to tell it was Frieren just from the waist down? What would her legs and feet look like, considering she is a small elf, and what about her toes and ankles?

None of this would happen if you used NoobAI based models, but it is Saturday, it is your free day, it is your "casual animu genning day" you turned on your PC and chose the lowest effort model of all and posted this troon crossdressing as Frieren in the least anime general of all:
ANIMA and /LDG/
>>
>>108585891
Nice feetos
>>
File: deJA_zi_00039_.png (2.57 MB, 1792x977)
2.57 MB
2.57 MB PNG
>>108586032
thanks i puked
>>
File: 1771032806653727.webm (2.89 MB, 1920x1080)
2.89 MB
2.89 MB WEBM
what upscaler do you guys use?
seedvr2 changes the face too much and looks like slop.
using z image as a 2 pass completely messes up the skin and adds a weird white haze like someone smeared cum all over the pic.
im really at a loss for good upscalers
>>
File: 00024-2218325491.jpg (1.46 MB, 2016x2592)
1.46 MB
1.46 MB JPG
>>
File: deJA_zi_00046_.png (2.56 MB, 1792x977)
2.56 MB
2.56 MB PNG
>>108586182
this upscaler is basically instant
https://github.com/Comfy-Org/Nvidia_RTX_Nodes_ComfyUI
>>
>
>>
>>108586234
You can’t post in two threads at the same time. You have to choose, /ldg/ or /sdg/. We demand exclusivity to our schizos.
>>
File: Z-image_00132.png (1.15 MB, 827x1131)
1.15 MB
1.15 MB PNG
what is there left to look forward to now that china has officially abandoned us?
>>
>>108585415
i can
i'm a resident schizo and so he is
>>
>>108586348
the inevitable concentration camps
>>
File: 1749595308754266.png (2.99 MB, 1248x1920)
2.99 MB
2.99 MB PNG
>>
https://civitai.com/models/2536147?modelVersionId=2850290

Style lora example for Anima, full captioned dataset and all config files are shared. The model trains extremely well I don't know why some people say otherwise.
>>
is there any LTX 2.3 workflow that doesn't have a hundred random custom nodes? Why do these faggots feel the need to install every random piece of shit node set rather than making things work with the most popular nodes?
>>
>>108586449
>Rutkowski
based bigruss
>>
File: 00026-1292305904.jpg (1.74 MB, 2016x2592)
1.74 MB
1.74 MB JPG
>>
>>108586348
wait for the next company to do the same thing.
>here are a bunch of great open source models
they build up a userbase and then try to monetize a new model and another company comes in and fills the void.
or some rich neckbeard like notch or kim dotcom throw a bunch of money into a new model just because they can.
>>
>>108586449
>I don't know why some people say otherwise.
They were using sub-optimal configs and blaming it on the model.
>>
>>108586348
we just accept that we're no longer a part of the cutting edge of tech, we're retro tinkertroons now who enjoy fiddling with outdated hardware. like the people who try to push the limits of the nintendo 64. we will be seeing if we can push out models to get 1/10 as good as seedance 2.0, or if loras can get local models to properly fill a wine glass to the brim.
>>
https://github.com/ClownsharkBatwing/RES4LYF

snake oil?
>>
https://huggingface.co/Lightricks/LTX-2.3/blob/main/ltx-2.3-spatial-upscaler-x2-1.1.safetensors

use the updated upscaler with 2.3, helps a lot it seems.

https://files.catbox.moe/gpmk06.mp4
>>
>>108586485
https://huggingface.co/RuneXX/LTX-2-Workflows/tree/main

I use these, they work well with ltx 2.3 distilled
>>
>>108586601
All the good stuff from that schizo paradise was brought to mainline Comfy DESU
>>
>>108586573
I just want a good API that isn't completely gimped after the first week.
Seedance2 looked good when it was first showcased, what we have now is a joke.
Bad physics, plastic skin, inconsistent generations, stiff animations.
I guess we wait for Happyhorse, but it will probably get hit with a cease and desist on day one.
Fucking bleak.
>>
>>108586449
Fuck you!
Why don't you post in anime generals!?!?!?!
POST IN ANIME GENERALS YOU FUCKING FAGGOT !
WHY DO YOU IGNORE US!?!?
I HATE YOU!
>>
rocketbrown is melting down
>>
>>108586032
This reminded me of when I went to the ENT doctor and he had a colossal scar on his throat, it shocked me because I thought the scar was from him slicing his throat but it was from a thyroid surgery lmao
>>
Anything except actually building skills.
>>
>>108586112
the star wars we need. This is beautiful.
>>
>>108586741
how do you know that?
>>
>>108586621
thanks, amigo
>>
>>108586731
kek
>>
>>108586449
Based, retards on suicide watch
>>
>>108586449
I HAVE TO POST MY ANIME NEWS TO THIS 3DPG SLOP GENERAL!!!
MUH CATJACK MUST READ MY ANIME NEWS OR I WILL LOSE MY MIND!!!
MUH CATJACK!!! MUH MEAT!!!
THEY ARE VERY VERY IMPORTANT!!! NOT THE 200 ANIME POSTERS OF ALL 4CHAN, NO NO NO, THEY ARE TRASH, WORTHLESS, BENEATH CONTEMPT!!! ONLY MUH CATJACK AND THE ZIT AND THE CHROMA SLOPPERS MATTER TO ME!!!
/LDG/ MUST STOP EVERYTHING AND READ MY ANIME NEWS RIGHT NOW THIS INSTANT!!
>>
>>108586449
Onegai, realism lora kudasai!
>>
>>108586449
blessed thread of frenship
>>
I hope the chroma and zit sloppers here enjoy the great anime news of this faggot
>>
File: image-4.jpg (240 KB, 688x1504)
240 KB
240 KB JPG
What's the easiest local hardware I can use to make slop like this where I'm just going to take pictures and say "Give her a silver dress" or "Give her blue eye shadow" like you can do with cloud tools like Gemini and Grok
>>
>>108586841
>>108585019
>>Klein
>https://huggingface.co/collections/black-forest-labs/flux2
>>
>>108586841
16GB VRAM gpu
>>
>>108586851
I've got a 5080 and this is the most exciting use case of it now that I've beaten RE9 on Nightmare
>>108586844
thank you will give it a shot
>>
>>108586807
>pls saar give csam lora i must be generating the cunny, kindly do the needful
>>
>>108586860
>5080
Qwen Edit is larger than Klein but would still fit on your card. Try that out instead desu.
>>
>>108586869
Goodness, my cup runneth over
Alright I'll try it
>>
>>108586880
>>108586869
Is there a way to get Qwen edit running locally on Linux instead of the hugging face version?
>>
>>108585744
is he still claiming to have done the first vaeless model
that doesnt seem accurate if hes never got any to converge
>>
>>108586449
>Posting on civitai.

For shame.
>>
>>108586902
nvm figured it out, didn't realize it worked in comfy
>>
>>108586902
Three versions of Qwen are listed here https://comfyanonymous.github.io/ComfyUI_examples/qwen_image/ godspeed my celeb gooner
>>
>>108586449
I spent 8 hours today of my saturday using your model and sharing artist tag and comparisons on /h/, /e/ and /adt/ with other anons who use Anima. Watching you ignore us makes me want to never use your model again.
>>
>>108586921
praying to elohim that it isn't censored crap
>>
>>108586449
with these settings how long would it take on a regular card
>>
File: 1758948263662488.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>
>>108586449
>I don't know why some people say otherwise.
If you spend 5 minutes on any place where people discuss lora training you discover why, most people train to overfit because they caption poorly and prompt poorly, so for the loras to work for them they have to imprint the DNA of the image in the model
>>
>>108586449
>Caption with Gemma 4 31b. If you have less VRAM, JoyCaption or one of the medium sized Qwens would work almost as well.
joycap and medium sized qwens suck ass tho bruh fr
>>
>>108586783
Those "retards" will never read his message, because those retards are the ones using his model and posting in dedicated anime generals, not here.
>>
Complete meltdown
>>
>>108586943
a lot of people use shit tier datasets too, but they don't know they are shit so they just keep on doing the same thing over and over.
garbage in, garbage out.
>>
Can the LARPers stop samefagging and replying to Tdrssell's post to simulate interest? Here we all use Zimage and Chroma. We never cared about anime models, stop pretending engagement.
>>
File: projecting.png (471 KB, 1056x456)
471 KB
471 KB PNG
>>108586862
>>
>>108585744
Most of the discord seems to have moved on from this thread. I had to stop chroma gooning but if I remember correctly all three are still in training epochs. Kaleidoscope is fairing better than Zeta, to the surprise of nobody.
>>
King Russ
>>
>>108587004
Cut the act. An anime model dev shows up and suddenly you care. You never did before, I know exactly what you're doing.
>>
>>108586998
Other way around, my bad.
>>
Russ God
>>
how do I use z image turbo/base with tensorrt?
>>
>>
>>108587056
You never cared about anime models, LARPER!
>>
what's the best method of training LTX-2.3? ai-toolkit is useless
>>
https://higgsfield.ai/original-series/zephyr/episode-1
>Traditional directors flimmaxxxing using Seedance 2.0 on Higgsfield. Watch “Zephyr” FULL Ep.1 – this is what happens when filmmakers face ZERO gatekeeping. With Unlimited Seedance 2.0 now LIVE everywhere for anyone with up to 70% OFF* - YOU can build your next viral AI movie. 2 minute intro got MILLIONS in a day. Now see how full Zephyr takes over your feed.
>Dir. by ILYA KARCHIN & the team.
>Zephyr (2026)
>>
>>108586609
see, quality is better vs the 1.0 one (both use the new one):

https://files.catbox.moe/5pid7f.mp4
>>
File: big eyed freak boy.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>108586931
>>
File: ComfyUI_temp_jutls_00002_.png (2.6 MB, 1152x2016)
2.6 MB
2.6 MB PNG
>>
>>108587167
kind of underwhelming
>>
File: ComfyUI_09207_.png (1.86 MB, 1024x1024)
1.86 MB
1.86 MB PNG
guess ill try out the greg lora...
>>
File: ComfyUI_temp_jutls_00003_.png (2.39 MB, 1152x2016)
2.39 MB
2.39 MB PNG
>>
so are we stuck at wan 2.2 forever?
>>
File: ComfyUI_temp_jutls_00005_.png (2.5 MB, 1152x2016)
2.5 MB
2.5 MB PNG
>>
>>108587286
welcome to local
>>
>>108587237
>>108587278
>>108587318
Too old
>>
>>108587286
people are going to be using wan for the next 2-3 years regardless of what comes along.
>>
File: promptmaxing.png (118 KB, 393x1235)
118 KB
118 KB PNG
who else is promptmaxxxing?
>>
>wan 2.2 i2v -> lose detail over frames
>flux klein -> cannot move camera too much
ways to combine best of both worlds?
>>
>>108587318
>>108587278
Very, very impressively realistic
>>
>>
File: ComfyUI_temp_jutls_00015_.png (2.87 MB, 1248x1824)
2.87 MB
2.87 MB PNG
>>
>>108585019
need download for the knight bitch on the horse
>>
File: ComfyUI_temp_jutls_00022_.png (2.25 MB, 1248x1728)
2.25 MB
2.25 MB PNG
>>
>>108585160
>>108585250
>>108585250
>>108585268
>>108585394
>>108585480
I like the 90s anime filter, but you gotta train your model to stop making Frieren look so unhinged/retarded
>>
>>108586348
Is that Nautilus?
>>
>>108586348
pic would be cool without FNAFfag
>>
>>108587140
Be quiet Bluvoll.
>>
Can anyone make some recommendations for why I'm not getting the results I'm expecting to get? I know that if I were doing this with something that had natural language processing it wouldn't change anything about the image other than her outfit
Is Qwen 2512 not a good fir for what I'm trying to do?
>>
File: ComfyUI_temp_jutls_00024_.png (3.23 MB, 1248x1728)
3.23 MB
3.23 MB PNG
>>108587646
its better to inpaint if you wanna add specific stuff
>>
>>108587661
That makes sense, I'm just spoiled by Gemini. I'll poke around and see if I can figure out how to add an inpainting mask with Qwen
>>
>>108587646
Could also use qwen image edit instead of the regular qwen image.
>>
>>108587646
>>108587728 this kek
>>
>>108587728
Sorry I'm super new to this- Where do I find that under templates? Is it just called Qwen Image Edit or is it this one?
>>
If you're doing SFW stuff just stick to API. It's not worth using Qwen Edit for basic shit that API models can do 100x faster and better. You can get Qwen Edit here, but the model itself is outdated https://huggingface.co/Qwen/Qwen-Image-Edit-2511
Flux Klein 9b is the best edit model available locally, and it can gen/edit without needing separate models. Qwen was working on a model that could gen/edit in one, but they decided to abandon local for API like everyone else.
>>
Haven't tried upscaling since the pre-AI days (waifu2x back in 2017 I think). Is it still a meme or is it good now?
>>
>>108587773
Sorry, API?
I was recommended Klen earlier, Qwen doesn't seem nearly as easy as the online stuff so maybe I'll try that
>>
>>108587450
Let's see Paul Allen's kissy face girl.
>>
>>108587775
I've never upscaled anything that didn't feel like a sidegrade. You always lose something you liked about the original gen, unless you keep the denoise so low that you wonder "what am I even wasting my GPU cycles on? It looks the same."
>>
>>108587784
API = 'the online stuff'. If your task is just putting a fur scarf on a girl, then there's no reason not to just use google AI or whatever. The benefit of local is nsfw stuff and niche use-cases that API cannot achieve (loras which are trained on specific concepts or styles).
>>
lol
>>
>>108587801
The main thing is I just don't want to get constrained by stuff when I am doing something NSFW or if it's just some random guideline it doesn't agree with. I also despise paying for cloud software in general
I feel like what I'm asking for isn't necessarily outlandish, I'm sure it would be slower on local hardware even with a 5080, basically just something that can interpret prompts and then apply them to images.
I mean look at this, why would this be moderated? It's something I'm sure local hardware is capable of, I'm trying to find the best tool for the job
>>
>>108587834
Another example, this stuff seems like it's what people have been working on for years, I'm surprised there isn't a consensus best tool for something like this. Granted there are a million different directions that people are working on
>>
>>108587834
Use Flux Klein 9b, it should be more than capable of doing this
>>
>>108587749
>>108586921
>>
>>108587856
Okay will be trying klein in the morning, thanks Anon
>>108587860
Sadly after playing for a couple hours I think Qwen might not be what I'm looking for, what I'm looking for is a really simple tool that's kinda hard to fuck up
>>
>>108587855
There isn't a single best tool because all companies are competing. Here is a list of pretty much all the relevant edit models:
https://artificialanalysis.ai/image/leaderboard/editing
>>
>>108587873
This is also very handy, thank you
>>
>sneederboard
>>
File: 1765336873951395.png (3.88 MB, 1328x1640)
3.88 MB
3.88 MB PNG
my wife seira
>>
File: 1772319093603289.jpg (832 KB, 2048x1328)
832 KB
832 KB JPG
>>
File: Why.png (8 KB, 390x223)
8 KB
8 KB PNG
Sorry if this is a dumb question, I'm new to this. Why does this keep popping up? I already downloaded and selected a VAE and put it in the VAE folder. (vaelsem). What else do I need to do?
>>
>>108586449
but these are only using natural language, how should it look if you want natural language and booru tags?
>>
>>108588022
What UI/Model?
>>
File: What.png (32 KB, 1674x205)
32 KB
32 KB PNG
>>108588068
I downloaded Stability Matrix and just downloaded the first package available there (WebUI Forge NEO). And these are the other stuff
>>
we wuz snape (ltx 2.3)

https://litter.catbox.moe/l359g17k1ba76zhz.mp4
>>
>>108588117
more action:

https://litter.catbox.moe/n1p1j6e4ns8bxn83.mp4
>>
File: comfy__520.jpg (990 KB, 1098x1605)
990 KB
990 KB JPG
>>108588117
>>108588132
instant classics
>>
File: 1761981410156904.png (131 KB, 1851x749)
131 KB
131 KB PNG
https://github.com/Comfy-Org/ComfyUI/pull/13369
What is this model? Are we saved?
>>
File: 1770027281971472.png (350 KB, 1121x879)
350 KB
350 KB PNG
>>108588222
looks like it's a 8b model and uses ministral 3d as a text encoder, but so far it doesn't seem there's anything else about that model on the internet
>>
my body is ready for nu chink slop
>>
>>108587130
catbox? looks crisp
>>
File: 1750705985290236.png (3.18 MB, 1880x1072)
3.18 MB
3.18 MB PNG
>>
File: 85673838227.jpg (2.32 MB, 1664x2432)
2.32 MB
2.32 MB JPG
>>
https://youtu.be/1_5sSJK2rU0?t=1761
>A 665k subscribers channel is talking about Anima
lmaoo wtf??
>>
>>108588320
Stop wasting your time watching "news youtubers"
>>
>>108588320
he talks about even the smallest most random niche ai that only get github announcement pages too
>>
>>108588320
"saar, what i scam with this week, saar? thanks saar, have my bell ring thank you saar"
>>
>>108588222
it's from baidu right? it might be good
>>
>>108588320
omg some scam artist trash tuber talks about tranima?!?!? turd russy bros we won so fucking hard!!!!
>>
>>108588341
>>108588331
the seethe is delicious
>>
>>108588339
>baidu
what should i know them from other than china
>>
File: 1774752637044401.png (348 KB, 1473x1688)
348 KB
348 KB PNG
>>108588349
baidu's site is one of the most visited in the world
>>
>>108587400
nevermind issue solved
>>
>>108588358
but what should anon know them from
>>
>>108588330
that's why I'm subscribed to that channel, he always finds something cool and niche at some point, he's not just like those sloptubers who only present things everyone knows about
>>
File: 1756511195721900.jpg (822 KB, 2048x1128)
822 KB
822 KB JPG
so many new kinos to watch, so little time
>>
>>108586449
maybe bluvoll and tranifag will shut the fuck up now with their fudding
>>
File: 1753423010278987.jpg (653 KB, 1328x1640)
653 KB
653 KB JPG
>>
File: 1751022907777741.jpg (558 KB, 1328x1744)
558 KB
558 KB JPG
>>
>>108588473
the faggots new hobby is trying to start a flame war between ldg and hgg
gotta do something to satisfy the mental illness i guess
>>
>>108588473
You don't train loras for Anima. Bluvoll already made 2 finetunes this week. Stop pretending to be interested in anime.

>>108588320
It's the only anime model successor of Illustrious, and you don't even post in anime generals. You're trash. I know it's you because you're the only one who cares about your model in this 3DPG general.
>>
File: 56275426272472.jpg (2.05 MB, 1664x2432)
2.05 MB
2.05 MB JPG
>>
>>108588320
our boy made it.
fucking BASED to see a homegrown hero landing such a massive W
>>
>>108588581
Yes, remember tdrusell's first posts? He didn't even know how to make a lora and asked us for help. How he has grown!
>>
>>108588624
you always start knowing nothing yes, and the sky is blue
>>
>>108588551
boss please!
>>
anons, im using sd.next. which model/checkpoint comes closest to commercial cloud imagegens. i have 128GB of vram. also are there models which can run at float16 instead of bf16 ? v100 is slow on bf16
>>
>>108588733
ask ai how to convert locally bf16 to fp16
>closest to commercial cloud imagegens
for what? ZIT for realism, noobai/illustrious for tranime, klein for editing
>>
>>108586449
>Greg Rutkowski
I remember.
>>
>>108588761
thanks - will check it out
>>
>>108588076
I never heard of that vae, but you are using anima so check the anima huggingface page. in that VAE/ Text Encoder you will need to select qwen_image_vae and qwen_3_06b_base
>>
>>108588076
>cattower
>>
>>108588733
Change sd.next settings to FP16, read the wiki (Compute setting) https://github.com/vladmandic/sdnext/wiki/Performance-Tuning
>>
>>108588843
basado
>>
File: 8653786373.jpg (1.94 MB, 2432x1664)
1.94 MB
1.94 MB JPG
>>
File: 554557516881795.png (460 KB, 832x1216)
460 KB
460 KB PNG
>>108586449
It do work.
>>
What LM to use for enhancing nsfw image prompt? Some says Qwen3 is better than Qwen3.5?
>>
>>108588922
take an english 101 class
>>
>>108588922
go for gemma 4, it's the new hot thing in town, way better than qwen imo
>>
>>108588922
Gemma 4, yeah. Mistral models are one of the least censored out of the box. Qwen has higher chances of refusal.
>>
>>108588922
gemma 4 + that system prompt >>108588368
>>
>>108588922
Test Heretic models. Gemma base won't spit any nsfw.
>>
>>108588974
it can with that jailbreak prompt >>108588960
>>
File: images.jpg (28 KB, 492x406)
28 KB
28 KB JPG
>>108588985
>>
Never did vidgen, is there a tutorial for vid for not poorfags like in the op?
>>
>>108588222
>embedding_key='mistral3_24b'
jesus fucking christ? a fucking 24b text encoder??
>>
has anon used gemma 4 in your workflow? how?
>>
File: 777.jpg (2.27 MB, 1328x1944)
2.27 MB
2.27 MB JPG
>>108589090
sir pls understand
>>
File: r7qbi2yz8qqg1.png (1.8 MB, 1152x1536)
1.8 MB
1.8 MB PNG
saw this guy on reddit using generated pics for a fake onlyfans. any idea what model he could be using?
>>
>>108589159
Why are you generating literal children?
>>
>>108587141
how's ai toolkit useless?
>>
>>108589179
its teebs
>>
>>108589184
nta, but its retarded hidden console is awful. What the fuck were they thinking?
>>
File: 1758174372714758.png (69 KB, 1991x348)
69 KB
69 KB PNG
>>108588222
https://github.com/huggingface/diffusers/pull/13432
based, there will be a base model and its turbo variant
>>
File: 1685272533435.png (27 KB, 487x104)
27 KB
27 KB PNG
>>108589177
Lustify maybe.
>picrel
HATE
>>
>>108589268
https://github.com/HsiaWinter/diffusers/blob/3aec976fc30347e4ea70e5f97c1bb4123cc218fd/docs/source/en/api/pipelines/ernie_image.md
>ERNIE-Image is designed with a relatively compact architecture and solid instruction-following capability, emphasizing parameter efficiency. Based on an 8B DiT backbone, it provides performance that is comparable in some scenarios to larger (20B+) models, while maintaining reasonable parameter efficiency.
big if true
>>
Where BERT image though?
>>
Big Bird Image or bust
>>
>>108589184
it doesn't have any effect if you train it with a starting frame
>>
>>108589284
>not an edit model
come on dude, unified edit/image models are the future
>>
>>108589307
>Big Bird Image
When bird game 3 image??
https://www.tiktok.com/@ancient_meme_archive/video/7557971057102114079
>>
>>108589318
I trained ltx23 lora in AITmostly with videos. It worked decently but I think the images were not close enough in style and slowed the learning and made the model stiffer. I think I'll try videos entirely next time.
>>
>>108588222
So Comfy implemented that baidu model one but not this one?
https://huggingface.co/jdopensource/JoyAI-Image-Edit
why?
>>
>>108589385
What do you mean by "close enough in style"? You just take frames from the video and use that as your input images
>>
File: 1751917912441562.mp4 (1.94 MB, 1280x720)
1.94 MB
1.94 MB MP4
how did wan 2.7 fuck up this much?

https://xcancel.com/ChrisGwinnLA/status/2039960196458680366
https://www.youtube.com/watch?v=RERsGjQrQ6E

wan 2.5/6 was marginal improvement if even that, and now this is just trash.
>>
>>108589402
I'm not talking about the starting frame. It was a character lora. I had two datasets, one video dataset and one image dataset. The image dataset had images from photoshoots etc. which didn't match the style of the videos.
>>
>>108589423
anon, Alibaba actually has a good video model, it's called HappyHorse
https://xcancel.com/AlibabaGroup/status/2042530517799887326#m
https://xcancel.com/lovart_ai/status/2043282414605332813#m
>>
>>108587834
nigga look for site that has flux klein, qwen image 2.0 or wan2.7. Budget pixel is my favorite it because of it variety of models and mentions the various levels on strictness a of a model. I find local image generation at the moment to very stale and boring at the moment.
https://budgetpixel.com/
>>
>>108589423
I've used wan2.7 image and video generation. its absolute censored dogshit that even makes wan2.5 look a lot better. The shit model has a filter that re-writes your prompt to be sfw pg13. Basically making de-clothing and nudity prompts difficult to get right. Many people are disappointed with wan2.7 and it's basically DOA saas model.
>>
>Image
Lightyears behind SAAS
>Video
Saas is literally in another universe
>3D
Local has completely given up.

Grim
>>
>>108589506
>Image
>Lightyears behind SAAS
maybe baidu will save us >>108588222
>>
>>108588222
>Flux2 VAE
now we're talking
>>
>>108589523
let's hope the licence is good too, but the most important part is that the images are good and not slopped
>>
>>108589523
Like Mugen
>>
File: local definitely lost.gif (15 KB, 250x200)
15 KB
15 KB GIF
https://xcancel.com/obscaries/status/2043304041053397437
>>
File: woof.png (223 KB, 400x400)
223 KB
223 KB PNG
>>108589523
>now we're talking
*yawn*, pixel space or gtfo
>>
>>108589561
can seedance 2 do anything other than capeshit fighting slop?
>>
>>108589585
obviously
https://xcancel.com/AzeAlter/status/2043027227374436827
>>
>>108589592
holy shit. wish local was able to produce something interesting like this
>>
File: Untitled.png (356 KB, 728x703)
356 KB
356 KB PNG
Trellis 2
>>
>>108589618
It's out of scope for local hardware. Currently local is only good for low-quality goonbait but it's fine, it'll get better along with hardware.
>>
>>108589179
He is literally generating not literal children.

>108589265
I make videos of sexy kids, not images, because while still images work for big tits hags since you can appreciate them as meat, little girls are more of a vibe so video works better for that. It's why I'm waiting for video+audio so eagerly to add more dimensionality to that vibe.

Unfortunately it's looking like the odds of the evasi@n website being shut down due to costs before a local video+audio model comes out go up every month
>>
>>108589592
it's really impressive, but it won't be true democratization as long as this level is not local, you're still dependant on API censorship shit, it kills creativity
>>
>>108589159
ZIT Laura B???
>>
>>108589629
did they fix the workflow somewhere? seemed much worse when i tried it on release
>>
File: Gemma 4 31b caption.png (1.4 MB, 1878x1337)
1.4 MB
1.4 MB PNG
>>108589637
>it'll get better along with hardware.
I used to believe that, but Z-image turbo and Gemma 4 proved to me that you can get insane quality with a relatively small model, the future is bright
>>
File: Untitled.png (376 KB, 801x776)
376 KB
376 KB PNG
>>108589670
If you look up the latest repos, they use something called DINO lock that helps a lot. But it's still meh.
>>
>>108589689
>you can pay $10
That's the whole concern, his stripe got banned and most of his potential market are retarded sooner nocoders who don't understand crypto so I'm worried about the financial health. I've gotten many times the worth of my gold donation and have shared hundreds of videos of youthful beauty with the world and have done my small part in displacing and substituting demand for the real thing.

No seriously, a couple of times on some pedo-adjacent forum I see some guy post an old Gen and it makes me happy knowing that it is inarguable that this person has consumed something AI generated instead of the real thing

The monthly begging prompt is back though and unlike previous years there's less momentum and the project is much more private, and 4chan is getting less and less popular.

Video+audio doesn't share well on /g/ anyways but I'm just excited for the extra world knowledge the audio dimension brings. I will FINALLY be able to actually prompt for something like a home family vlog where the dad is holding the camera because 1000% that training data is in current models but there's no way to express that knowledge given the relationships you make when captioning videos with just text and not learning the audio information
>>
File: 00001-1260334831.jpg (2.24 MB, 2016x2592)
2.24 MB
2.24 MB JPG
>>
File: PLEASE.png (34 KB, 246x205)
34 KB
34 KB PNG
>>108588222
please be good, we haven't gotten anything decent this year so far (except klein I guess)
>>
File: 00004-1077369241.jpg (2.13 MB, 2016x2592)
2.13 MB
2.13 MB JPG
>>
File: 1738142509041313.jpg (80 KB, 604x604)
80 KB
80 KB JPG
>>108589423
>>108589439
i'm really fed up with chinks. i wouldn't have said anything if they'd released something, but now, fuck them. i don't even touch wan anymore for sfw content. ltx is much better
>>
>>108589453
If it's the same model though, why is doing it on the website better? Does it just understand prompts better?
>>
File: image.jpg (172 KB, 2000x2000)
172 KB
172 KB JPG
>only enough vram to train 1024 with batch 2
It's over. I should've gotten a 4070TiS back then and not just the 12gb abortion
>>
>>108589985
Depending on how many steps you need and which model (I am presuming something like SDXL with that res + batch size combo) it can cost less than a dollar to train a lora with online compute. (vast, runpod, etc.)
>>
File: 1773673613343768.jpg (686 KB, 1536x1536)
686 KB
686 KB JPG
OWO
>>
>>108590008
uwu, what's this?
https://www.youtube.com/watch?v=7mBqm8uO4Cg
>>
>>108590008
The bubbles aren't doing a good enough job hiding those hands:(



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.