[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion and Development of Local Image, Video, and Music Models

Previous: >>109001708

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>mfw Resource news

06/07/2026

>Ideogram4 GGUF quantized files
https://huggingface.co/leejet/ideogram-4-GGUF

>‘A driver of political violence’: how the breakneck AI boom is fueling anti-tech extremism
https://www.theguardian.com/technology/2026/jun/07/anti-ai-tech-extremism-violence

>Ideogram 4 NF4 integration for Forge Neo with a visual JSON layout builder
https://github.com/Whatwhatio/forge-neo-ideogram4

>Huihui-gemma-4-12B-it-abliterated
https://huggingface.co/huihui-ai/Huihui-gemma-4-12B-it-abliterated

06/06/2026

>HugginFace VFS Plugin: Native Total Commander file system for Hugging Face models
https://github.com/mikinko/HuggingFace_WFX

>ComfyUI Lance AIO: Custom nodes to run Lance-3B
https://github.com/SteveImmanuel/comfyui-lance-aio

>Cube: Generative AI System for 3D
https://github.com/Roblox/cube

>The token bill comes due: Inside the industry scramble to manage AI’s runaway costs
https://techcrunch.com/2026/06/05/the-token-bill-comes-due-inside-the-industry-scramble-to-manage-ais-runaway-costs

06/05/2026

>RhymeFlow: Training-Free Acceleration for Video Generation with Asynchronous Denoising Flow Scheduling
https://simon-dcs.github.io/Website-of-RhymeFlow

>Complexity-Balanced Diffusion Splitting
https://noamissachar.github.io/CBS

>Can We Predict The Human Preference For Text-to-Image Content Prior To Generation And Is It Even Useful To Do So?
https://github.com/LSU-ATHENA/HPM-Predict

>SAM-Flow: Source-Anchored Masked Flow for Training-Free Image Editing
https://github.com/chwbob/Sam-Flow

>Geometry-Aware Dataset Condensation for Diffusion Model Training
https://github.com/2018cx/GADC

>StoryVideoQA
https://github.com/nercms-mmap/StoryVideoQA

>Lightricks to split into two companies as it cuts 75 jobs
https://www.calcalistech.com/ctechnews/article/r1dgjt5gmg

>Akium Sampler: Custom k-diffusion sampler for Stable Diffusion Forge / A1111
https://github.com/AkiumAI/akium-sampler
>>
lowest effort collage of all time lmao
>>
File: 1760481521858266.png (1.99 MB, 941x1672)
1.99 MB PNG
Astronaut playing violin on the moon, by Greg Rutkowski.
>>
>by Greg Rutkowski
lost technology unless you're midjourney (it's probably lost there too)
>>
>>109003994
i was gonna make the top right image real fore fun but its a literal chibi...

fuck it lets just get it done anyway and she how they like realistic loli...
>>
>>109004104
Do you think he's still mad about AI even though his name is basically associated with "good art"?

He's one of the largest beneficiaries of the technology.
>>
>>109004110
standing. looking at pooer
>>
>>109004026
hmmm
>>
File: animaHighres_00038_.png (1.38 MB, 1024x1024)
1.38 MB PNG
>>
>>
File: animaHighres_00040_.png (1.37 MB, 1024x1024)
1.37 MB PNG
>>109003927
there u go faggot is that what you meant and want? its OK you don't have to fucking hide it...

what you gonna do fucking mald and seethe about it for the next 24 hours?
>>
localmeltie
>>
>>109004188
what ever man what ever man what ever
>>
the problem is mine is artist and legit the other shit is i'm gonna hide it and its fucking cringe as fuck. people see right through it.
>>
reprompted the whole collage itself as a new image on Klein 9B with Gemini caption lmao, came out better than expected honestly

the prompt is very long so I put it here:
https://pastes.io/aywh1DMS
>>
File: Anima-00018-387790566.jpg (750 KB, 2432x2944)
750 KB JPG
>>
File: animaHighres_00041_.png (1.32 MB, 1024x1024)
1.32 MB PNG
why not. its just a young girl with a bear suit in front of her computer.
>>
File: animaHighres_00042_.png (1.29 MB, 1024x1024)
1.29 MB PNG
>young girl wearing a cute bear suit in front of her computer in her messy bedroom, fast food, coke drink from typical fast food with clear plastic lid, disbelief and depression about what she is seeing on her computer monitor. Night time, dark, blue light from the screen, realistic, photo, high quality

dropped the reference and controlnet just to see how the prompt worked.

not bad.
>>
File: Ideogram_00082_.png (2.21 MB, 1680x944)
2.21 MB PNG
>>
Bro seems to believe that this image was an example of something only Nano Banana could do, impossible to reproduce without Le Ebin Json Ideogram (it is not and was not)

https:/reddit.com/r/StableDiffusion/comments/1tzr6ci/an_experiment_recreate_jsonprompted_closed_model/
>>
>>109004220
this is what Neon Genesis Evangelion will look like in 2013
>>
Are most using the workflow provided by the lakers of ltx 2.3 for training loras or is 3rd party stuff better now?
>>
>>109004220
>misato cosplaying as aska
cute hag
>>
>>109004351
Naomi Wu before boobjob
>>
>>109002285
Bot you don't talk about that stuff after sex? Do you never ask each other about random lore of things you don't know about the opposite sex?
>>
>>
>>109004405
I am fascinated by your world view
>>
>>109004424
AY BITCH WHERE DEM HORI NIP NIPS COME FROM HUH

- me, post coitus
>>
File: Ideogram_00087_.png (2.83 MB, 1680x944)
2.83 MB PNG
>>
Haven't touched SD since 2023
Is in painting still required for editing a face or are models today smart enough to handle requests like "give the woman on the right blone hair"?
>>
>>109004479
>are models today smart enough to handle requests like "give the woman on the right blone hair"?

Short answer yes.

Slightly longer answer, not all models.
>>
File: nbp.png (1.6 MB, 1024x1024)
1.6 MB PNG
>>109004351
>Three anime figures on or near the PC tower
there's four
>"stance": "standing in a slight contrapposto pose",
not really, she's just leaning
>negatives: Any appearance of pink/magenta anywhere
your troony keyboard and pride flag in the background??

not bad, a few more training runs on nano banana and gpt outputs and local should be there by 2027.
>>
>>109004502
??? wat? my picrel Klein one is closer overall to the original Gemini pic than his Ideogram one was. Had had rando curtains on the wall and four figures on top with none inside, instead of three plus one inside.
>>
File: Ideogram_00092_.png (2.7 MB, 1088x1456)
2.7 MB PNG
>>
>>109004533
why are Ideo gens so Ernied, like high contrasty grainy
>>
>Ideogram 4
fucking kill yourself... fucking brown skinned low iq cunt...
>>
buy a fucking ad for there is nothing that shit model can do that i can't with a simple controlnet faggot mouth breather
>>
>>109004532
the ideogram one is terrible. json prompting is a meme anyway, the only reason it works on nano banana is because their 3 trillion parameter LLM is re-writing the prompt in the background. jeets think it's some kind of 'computer language' that 'better represents how models think' but it's really just slop.
>>
File: animaHighres_00043_.png (1.52 MB, 1024x1024)
1.52 MB PNG
>>109004502
he really cares about this, its his whole life...
>>
wtf are horizontal nipples?
>>
>>109003927
Have an RTX 5080 with 16GB VRAM. Can I even run WAN?
>>
Why is this guy having a meltie?
>>
>>109004588
>json prompting
i told gemini to fuck off with that shit, it seems to leaking more and more into mainstream cloud models, its is fucking garbage. The more they move away from us the more they will self destruct, so its win win.
>>
>>109004614
you'd know if you asked a woman about it immediately after railing her
>>
>>109004619
because you're a fucking faggot and i'm tired of pretending to be nice...
>>
>>109004619
I mean I just posted this:
>>109004351
where the ATTACHED pic here on le 4Chinz (made with Klein) was just showing that the Gemini pic (left side of the Reddit thread) was not in any way an example of something that was difficult to gen to begin with. IDK about anything past that lol
>>
>>109004600
why does this have a gemini watermark
>>
>>109004636
show me
>>
>>109004636
oh so it does, i guess they trained this model on some images from gemini, so fucking sue me? its not like those other companies didn't steal everyone else's shit and then charge money to use it...

"its okkay when we do it... "
>>
>>
>>109004636
KEEEEEK remember to thank your api overlords, localkeks. without google and openai, you wouldn't have any synthetic outputs to train your slop on.
>>
File: 56754.webm (3.09 MB, 768x576)
3.09 MB
3.09 MB WEBM
would you look at the time
>>
>>109004657
sure its not like every real image wasn't already trained into the local models, its not like those datasets are magically gone either.
>>
>>109004671
pathetic man truly pathetic, you could even see the start point before the explosion.

Tip: start from empty white image, then make the prompt.
>>
>>109004619
Four hours ago someone posted a cute 1girl and he's been melting down ever since.
>>109003582

It's been a couple of months since this guy has had this particular brand of sperg out ITT. He used to do it like every other weekend.
>>
File: Ideogram_00099_.png (2.6 MB, 944x1680)
2.6 MB PNG
>>
>>109004689
i wonder why my glow nigger
>>
>>109004700
neat
>>
File: debo_s_fia_00016_.png (2.01 MB, 1792x977)
2.01 MB PNG
>>
we're going to hell and you're all coming with us.
>>
the darkness will consume you. our mission complete it was our purpose.
>>
>>109004700
Prompt? I'm close to wanting to use i4 even if just to say I have
>>
>>109004671
Nice ehh... vignette
>>
>>109004636
because he just i2i'd >>109004502
>>
https://www.youtube.com/watch?v=3wTl8DrC240
Amen
>>
>>109004728
https://pastebin.com/MMGXzBv0

It's llm slop and too big to post here
>>
>>109004743
the plot thickens
>>
>>109004760
Are you having it write that based on an existing image or are you just typing out ideas and having it expand upon them?
I'm really close to trying out i4.
>>
Japanese Folk Metal ACEStep XL LoRA. Trained on just 10 songs.
https://vocaroo.com/1hOnOf8ZWn71
https://vocaroo.com/18pRgXxfm3tj
https://vocaroo.com/15AMD9XrQ4Xl
>>
File: Ideogram_00096_.png (1.71 MB, 944x1680)
1.71 MB PNG
>>109004783

The latter. My pathetic human attempts ended up looking like shitty romance novel covers.
>>
in the end who really cared this much, it was all just a dream.

peace
>>
nightmarishly terrible thread, a new low for local. good job lads.
>>
File: Ideogram_00106_.png (2.61 MB, 1936x1088)
2.61 MB PNG
>>
File: Ideogram_00107_.png (3.32 MB, 1936x1088)
3.32 MB PNG
>>
>>109004838
>that photo on the wall behind migu and homer
kekk
>>
>>109004797
Hows the speed and on what card are you running it on?
>>
>>109004871
3090. Slow as fuck desu. Like a minute 30 seconds per image.
I think there are ways to speed it up, but I haven't checked yet.
>>
>>109004789
easiest way to train acestep loras?
>>
>>
>>109004890
NTA, but side step.
>>
>>109004816
Anon can make it up in the back half dw
>>
https://files.catbox.moe/ek5jwc.mp4
facebook ai content is so unhinged lmao
>>
>>109004890
Wrote a detailed guide here
https://rentry.co/s8fg8ber

But it uses custom scripts alongside Side-Step. By far easiest way is to just use Side-Step's options to caption the dataset, but my script is what I use since I can work around bugs once everything is fetched. The most tedious part is just the data curation, with doubledouble top and the web to curate lyrics, and the help of Gemini to structure them, it should not be too bad (done manually), though I still use a script to reformat my lyrics. For training, I use a cloud GPU (Modal) since it's free $30 in monthly credits, good for a few runs (this one was just $5). I will update the guide later to include a Modal training script. It can be done locally, but since I have a 3090 it would have to be left running overnight to get the 800 epochs. I don't like to use 60 sec chunks on Side-Step because I think it converges better without, so I train with the full songs which takes exponentially more time.
>>
>>109004946
Thanks anon!
>>
File: Ideogram_00116_.png (3.44 MB, 1264x1680)
3.44 MB PNG
>>
>>109004990
how nsfw can ideogram get? Can it do boobs? What about underwear, lingerie?
>>
>>109004946
nice ty anon
>>
>>109004996
It can do most things. It's that fucking filter that's the problem.
>>
>>109004996 >>109005000
try https://pastebin.com/xpYezwZp as workflow
>>
>>109004990
>>109005000
it has sd1 face

so it seems all these three news one ernie,animu and ideogram are fails then

cant beat zimage and klein and even qwen aint that bad
>>
there's one thing that isn't a failure
>>
>>109005019
Interestingly I just tried my regular workflow but wrote naked lady with no clothes in Japanese and it did it no issues.
>>
File: 1764654624217817.png (2.82 MB, 1728x1728)
2.82 MB PNG
>>
>>109005000
ok, so it can do breasts
https://civitaiarchive.com/models/2679521?modelVersionId=3008701

I'm just not impressed with the weird grainy quality. I'm trying to find a reason to pull the trigger and start downloading the models but I just don't see how it's better than zit or k9 with some realism loras thrown in. Skin and faces look very, very, sloppy
>>
File: ig.jpg (268 KB, 896x1120)
268 KB JPG
>>109005043
good too.

anyhow if the censorship gives you trouble that's the least affected sensible workflow I've seen so far. makes the model usable.

i forgot to add "masterpiece" but it almost is
>>
>>109004671
I used to work with a girl who looked and acted like this. We both worked in a Japanese company so she was a rice hunter and a bit of a cunt. But your of reminds me of her.
>>
>>109005072
the 1girls are less pretty than in other models overall but not terrible

> I just don't see how it's better
you can prompt far more characters/objects in defined regions, that's the main thing IMO
>>
>>109005080
what like she craved yellow d lmao
>>
>>109005089
Yeah lol. I assume all white women I meet here are into Asian dick. It’s hell on earth for them here otherwise
>>
>>109005080
you should have fixed her
>>
File: ig.jpg (348 KB, 896x1120)
348 KB JPG
>>109005072
also the best model for text. maybe if you want to do some visual storytelling.
>>
>>109005078
have you tried it with an abliterated clip?
>>
File: ig.jpg (306 KB, 896x1120)
306 KB JPG
>>109005152
no. i just recommend trying >>109005019, it works quite well
>>
why does the comfyui desktop app run at 5 FPS? the gen speeds are fine, but the interface is so slow. i'm not even using one of those jeeted 500 node workflows either
>>
>>109005072
this is jeetslop

>This workflow uses an uncensored text encoder: Qwen3VL-8B-Uncensored-HauhauCS-Aggressive, plus a latent upscaler before the image sampler. Right now, it works successfully around 30% of the time

bro thinks the text encoder has jack diddly fucking squat to do with anything here (it does not, "uncensored" text encoders are NOT a thing that serves any purpose in the context of image models)
>>
anyone got ideogram gguf to work in comfy?
>>
File: ComfyUI_00746_.png (503 KB, 896x1152)
503 KB PNG
How can local diffusion be used to create video game? How do I ensure my OC has a consistent face?
>>
File: file.png (1.88 MB, 768x1376)
1.88 MB PNG
>>
>>109005240
Why would you use gguf?
It's quantized from 8bit, it will have shit quality.
Just run the fp8 if you can, nf4 if you are a hyper vramlet with I dunno like 6 gigs of vram.
>>
>>109005198
i'm guessing by far most here (still) use the webui, it's probably also the most obvious workaround tion if that desktop ui has some bug
>>
>>109005263
this shit so ass
>>
>>109005265
nvfp4 is blackwell only
>>
>>109005273
no saar, very good model saar. please watch the /r/stablediffusion postings
>>
>>109005259
many options. world models
https://github.com/Tencent-Hunyuan/HY-World-2.0
https://github.com/robbyant/lingbot-world
https://over.world/

might be the most direct way sooner or later

or try to use 3d object generation splat models, idk which is currently best maybe try https://github.com/VAST-AI-Research/TripoSplat https://github.com/IgorAherne/TRELLIS.2-stableprojectorz etc, obviously this is for 3d engines

or use blender/krita/whatever with plugins to work with 3d or 2d textures

it's not what most people here usually do tho.
>>
File: idiotgram.jpg (257 KB, 1055x788)
257 KB JPG
>>109005273
>>109005263
>>109005301
saar you only need to draw more bounding boxes
>>
>>109005263
anyone have a workflow/node to get around the salocy filter?
>>
File: ComfyUI_00747_.png (442 KB, 896x1152)
442 KB PNG
>>109005303
>>
File: ga.jpg (228 KB, 896x1120)
228 KB JPG
>>109005322
see >>109005019
it seems to mostly work for me. it may be that you do have to define some extra boxes. idk, add a /ldg/ logo
>>
>>109005291
nf4 and nvfp4 aren't related at all.
>>
Also nvpf4 will "work" on 4000 and 3000 series, the speed will be ass due to lack of acceleration, similar to how fp8 works on 3000.
But that is also true, and possibly worse for Q quants.
Regardless it's not nvfp4 anyway.
>>
File: 1778593395456.webm (1.65 MB, 464x688)
1.65 MB
1.65 MB WEBM
>>
>>109005019
Error running sage attention: Unsupported head_dim: 256, using pytorch attention instead.

am I supposed to switch off sage on start-up?
>>
File: ComfyUI_00750_.png (538 KB, 896x1152)
538 KB PNG
>>
>>109005380
idk if anything is better than the fallback it already chose
>>
>>109005335
\>slop
apt
>>
>>109005366
OUUUGGHH
>>
>>109005198
>desktop
stop using desktop
>>109005272
>i'm guessing by far most here (still) use the webui,
i hope one day this is not true. its trivial to create your own native frontend
>>
File: xy.jpg (299 KB, 1120x896)
299 KB JPG
>>109005366
smooth dancing

>>109005399
imagine what we can do with horrible taste in everything now!
>>
File: bimbows cyber.png (2.39 MB, 1672x941)
2.39 MB PNG
>>109003927
>>
>>
I'm pretty sure there's not ever gonna be an Ideogram 4 image I can't remake with Klein 9B natural language prompting
>>
File: 95..jpg (326 KB, 1120x896)
326 KB JPG
>>109005496
at least it won't be that easy
>>
File: igram.jpg (225 KB, 1120x896)
225 KB JPG
>>109005496
regional prompting is obviously powerful
>>
File: igram.jpg (303 KB, 1120x896)
303 KB JPG
not saying it's an extremely aesthetic tune but it's probably the best control over composition yet
>>
>>109005541
did you get reference images working yet?
>>
File: igram.jpg (255 KB, 1120x896)
255 KB JPG
>>109005585
just playing with regional prompting on the prompt builder node that kj boss made. didn't try anything else yet (and I don't know what is possible overall)
>>
File: igram.jpg (336 KB, 1120x896)
336 KB JPG
>>109005599
variant. it's really not that bad.
>>
>>109005677
>>109005599

I’m still not sure if this model can do real alpha or not or it’s just a limitation of comfyui
>>
>>109005599
box? I get nothing but the filter, even with the workflow that was posted
>>
use case for renting a GPU to use Ideogram 4 instead of paying for better quality API services?
>>
NF4 quantized model :24 GB VRAM
FP8 model :48 GB VRAM
CPU/RAM offloading RAM (64–128 GB+)

This is not local, renting a GPU isn't local.
>>
File: 1759304904691262.png (46 KB, 581x372)
46 KB PNG
i'm trying something new for this lora i'm training. i had a pretty big dataset of varying resolutions. so instead of removing all the low res ones, i sorted them up using my trainers bucket logic into their own 512,768 and 1024 datasets, and then started training using all three resolutions at the same time.

is this a good idea or is it gonna fuck it up completely? to me it seems like a good idea, but this shit is black magic so it probably isnt.
>>
>>109005496
Kek...I've seen the same thread. Was that suppose to be a challenge?
>>
File: 1769429388170587.png (282 KB, 505x870)
282 KB PNG
>>109005782
1400 steps in. doesnt seem to have shat itself yet
>>
>>109005723
sure
https://litter.catbox.moe/erenjfkjsdr8zzzx.png
>>
>>109005914
thanks
>>
File: file.png (318 KB, 463x933)
318 KB PNG
>>
has anyone tried using ideogram as a low denoise upscaler?
im hoping it can remove the cloudy cum filter from zit images but i dont want to download a huge model just for a quick experiment
>>
File: fact.jpg (57 KB, 716x687)
57 KB JPG
reminder to put satan into your negative prompt
>>
Is tdrusell here?
If Anima doesn't forget concepts, why don't realistic Anima finetunes retain Anima character knowledge?
>>
File: u_00024_.png (1.09 MB, 1024x1024)
1.09 MB PNG
>>109006018
I would like to know that too... im currently playing with anima.
>>
File: 3itches.png (938 KB, 1024x768)
938 KB PNG
I want Anima with input so I can gen random cute girl and then put her in many cute poses
>>
>>109006018
Because at the end of the day, Anima always forgets concepts, except in some particular cases where it does not, and the entire pro Anima crowd uses those cherry picked wins as their primary excuse.
>>
File: file.png (455 KB, 706x648)
455 KB PNG
>>109006077
Yup but at least it can do racist shit which i like
>>
https://civitai.com/models/2645333/photanima?modelVersionId=3013998
what do yall folx think?
>>
>>109005782
>>109005892
Yeah the idea seems "ok". Though some moderate amount of lanczos doesn't really hurt the training too much, or warrant this kind of esoteric bucketing.
I think your lora for the certain butterface actress will turn out ok.
>>
File: 1528756500577.png (110 KB, 429x410)
110 KB PNG
What if I train anima lora only on NL captions?
>>
>>109006018
>yet another Catastrophic Faggotmeltie from Julien
Lmao
>>
>>109006100
It looks less plastic than most other realism attempts (and without using grainy analog/90s/2000s/candid/iphone slop to cover up for plasticity) but the details are rough. Maybe the base is better but haven't seen enough images to judge that.
>>
>>109006110
It works fine in my experience.
The official lora was trained NL only.
>>
File: Ideogram_00139_.png (1.4 MB, 1024x1024)
1.4 MB PNG
ARRRGGGHHHGGHH
>>
>>
>>109006136
>lifting armpit and not doing anything with it
Lame!
>>
File: genw.png (930 KB, 1021x968)
930 KB PNG
>>109006110
dew it.
>>
i gotta fix my sleep schedule so i'm awake during the spam hours
>>
File: Ideogram_00143_.png (2.81 MB, 2224x960)
2.81 MB PNG
>>
File: 1759591335135061.jpg (2.41 MB, 1344x2240)
2.41 MB JPG
>>
>>109006273
not the Toyota Prius on wow again
>>
File: Ideogram_00145_.png (3.91 MB, 1936x1088)
3.91 MB PNG
>>
File: Ideogram_00146_.png (2.98 MB, 1936x1088)
2.98 MB PNG
>>
can it do shrek
>>
now do one where she's bald
>>
>>109006326
>>109006349
migu :(
>>
Is anyone else able to use svi2pro? It seems to have been bricked by some update the past few months.

Once it goes into low noise pass, it just turns to shit, like the light lora is 10x as strong.
>>
>>109006326
>>109006349
vax status?
>>
>>109006387
Still getting weekly boosters up till that image was taken.
>>
File: Ideogram_00148_.png (3.66 MB, 1936x1088)
3.66 MB PNG
>>
but can it do shrek with only 8GB of vram?
>>
do you guys put "6 fingers" into the negative prompt? or does it do nothing
>>
>>109006131
klein9b, remove all text
>>
>>109006458
It's not the text. it was supposed to be a topless Japanese woman in a fundoshi.
>>
File: Ideogram_00149_.png (2.95 MB, 1936x1088)
2.95 MB PNG
>>109006440
>>
File: Ideogram_00150_.png (3.71 MB, 1936x1088)
3.71 MB PNG
>>
>>109006478
Fingers are all wrong (the gpu fingers).
>>
File: test.png (2.85 MB, 1856x1536)
2.85 MB PNG
for some reason my prompt scheduling + negpip workflow broke yesterday (negpip stopped working) and i kept trying to figure out what was happening instead of just reopening the workflow (couldn't understand why it broke in the end)

halfway through i decided to make my own "prompt scheduler" where i just chain ksamplers with the steps i want for each artist since the asagi4 one only supports floats and it doesn't give me proper control with the turbo lora since that's only 12 steps

i ended up learning a bit about comfy and how the steps affect anima and i think i can mix artists properly now

also the linear quadratic scheduler seems pretty cool. didn't see anyone ever bring it up with anima, i don't know if it's only useful with the turbo lora

>>109006444
if you're using a turbo lora for anima for example, the negative prompt is ignored. it depends on your setup. you can try negpip (https://github.com/pamparamm/ComfyUI-ppm)
>>
File: Ideogram_00153_.png (3.53 MB, 1936x1088)
3.53 MB PNG
>>
the second method actually works. It seems to almost kill the grey filter completely (at least for me)
https://old.reddit.com/r/StableDiffusion/comments/1tz4fnf/ideogram_4_a_solution_for_removing_the_annoying/
>>
File: Ideogram_00156_.png (2.92 MB, 1184x1776)
2.92 MB PNG
>>109006559
>>
>>109006649
weird aspect ratio
>>
File: Ideogram_00155_.png (2.83 MB, 1184x1776)
2.83 MB PNG
>>109006654
Yeah it was being weird.

I have a good one but the areolas are visible so I gotta box it.

https://files.catbox.moe/bd4i10.png
>>
File: Ideogram_00159_.png (3.37 MB, 2224x960)
3.37 MB PNG
>>
>ideogram
>24gvram + 64gb ram
yeah, renting GPUs isn't local, you can fuck yourself ideogram spammer
>>
File: 1777377802210324.png (230 KB, 509x755)
230 KB PNG
>>109005782
>>109005892
>>109006101
yeah klein had zero problems with this. pretty funky seeing the step time all over the place
>>
>>109006711
>t. poorfag vramlet
>>
File: Ideogram_4.0_00094_.png (1.6 MB, 1376x768)
1.6 MB PNG
>>
File: Ideogram_00161_.png (3.89 MB, 1088x1936)
3.89 MB PNG
>>
make some ideogram nuclear kinos
>>
>>109006770
This isn't even ideogram. This is just a regular modern WoW screenshot.
>>
>>109006827
The intention is what counts, chud.
>>
>>109003927
civitai is kill
>>
>>109004552
probably stole flux tech
>>
>>109006942
We use tensor tho
>>
>>109006942
.red works for me
>>
>>109006478
>8gb version
Grim.
>>
>>109006018
Because you're using the mixes and merges. You're not using the bare tunes.
>>
File: 1775441606477677.png (3.97 MB, 1408x1760)
3.97 MB PNG
I think I'm getting the hang of id4. Hopefully, people will start releasing loras for it
>>
With ltx 2.3, how do I fix the smearing in between frames? More steps doesn't seem to fix it..
>>
File: Ideogram4_00036_.jpg (160 KB, 896x1120)
160 KB JPG
>>
File: 8418.png (2.37 MB, 912x1376)
2.37 MB PNG
kek
>>
File: 151612CUI_00001_.png (1.76 MB, 1152x1536)
1.76 MB PNG
>>
>>109007256
Only gen that wasnt blocked by safety filter. Photo of a castle was blocked, but not this.
>>
>>109007300
the gothic asshole? i thought they closed that place down
>>
>>
Fuck waiting for forge and other ui to keep up I'll do it myself
>>
File: 1771699728246418.png (3.77 MB, 1200x1808)
3.77 MB PNG
>>
They're trying to kill /adt/ now, you can't have shit on this bitch of an imageboard.
>>
cozy bread
>>
>>109007440
>0 stars
cumfart won't give you millions
>>
>>109007440
Already 200% better than Ani or Comfy.
Keep going champ.
>>
>>109007459
This is for me, this is a comfy wrapper why the fuck would I waste my time shilling when the pain point is something anyone who's not a vramlet couldn't do themselves?
I don't have schizo illusions of grandeur anon. Entire project is what I want
>>
>>109007453
Natural /g/ selection. Pretentious narcissists disguising themselves as AI artists won't make it on 4chan.
>>
File: u_00040_.png (653 KB, 1024x1024)
653 KB PNG
>>
>>109007469
what astounds me is how many of these wrappers keep popping up. Confyorg just cannot into ui/ux. Comfy is a fucking liar
>>
>>109007520
sittin on the toilet
>>
>>109007527
My problem is with the forgeUI branch failing at basic shit. I can now just add things as I need it and I'm not going to beg for help or acceptance unlike some other loons that are trying to push vibecoded shit as something that needs extra hands for ego reasons. With comfy 90% of the work is done the rest is translating it to your workflow
>>
File: 541848.png (2.37 MB, 976x1296)
2.37 MB PNG
>>
>>109007654
But muh ComfyCloud. If you're using Ideogram and ComfyCloud, you're supporting the local ecosystem
>>
File: 4145401356.png (24 KB, 200x150)
24 KB PNG
AAAAAAAAARRRRGGHH I LOST
>>
>>109007442
>guy on left
Vagina from Chroma dataset?
>>
>>109007677
comfy really fucked local by choosing cloud as the main revenue. fuck him
>>
Completely stock Klein 9B Distilled
https://files.catbox.moe/ldd75p.png
>>
File: z-image-turbo_00002_.png (1.06 MB, 1024x1024)
1.06 MB PNG
>>109007535
>>
>>
File: debo_vn_fia_00004_.png (2.27 MB, 1792x977)
2.27 MB PNG
>>
>>109007713
>BLM fist
>>
File: Ideogram__00009_.png (2.77 MB, 1088x1440)
2.77 MB PNG
>>
Even GPT Image can't avoid the finger problem.
>>
File: Ideogram__00014_.png (2.33 MB, 1088x1440)
2.33 MB PNG
>>
>>109009098
why does the image look like it's literally melting?
>>
File: 08-19-2026_01.jpg (1.26 MB, 1248x1824)
1.26 MB JPG
>>
File: 296639373572133.png (2.51 MB, 1152x1728)
2.51 MB PNG
>>
File: debo_vn_fia_00006_.png (2.25 MB, 1792x977)
2.25 MB PNG
>>
File: 1007272930385581.png (1.51 MB, 1088x1600)
1.51 MB PNG
>>
>>109009098
Do mamako oosuki
>>
Bro you could not do comic before Ideogram 4 bro, plz bro believe it was impossible bro
>>
File: 254464231643963.png (1.69 MB, 1600x1024)
1.69 MB PNG
>>
File: 337728118956942.png (1.68 MB, 1024x1600)
1.68 MB PNG
>>
File: Ideogram__00029_.png (3.18 MB, 1248x1664)
3.18 MB PNG
>>109009610
not sure who the character is so who knows how good it ended up
>>
i like ideogram, i trained a lora on 100 nsfw images and the lora immediately overrode the nsfw filter and worked perfectly, idk what people are complaining about
and json bbox prompting means i don't have to do Shakespearean prompts to get the model to position things the way i want
>>
File: anima1_00009_.jpg (387 KB, 1272x1760)
387 KB JPG
>>109009858
dl link for lora?
>>
>>
File: 465748966342467.png (2.26 MB, 1152x1472)
2.26 MB PNG
>>
>>109009858
I hate to say it but reddit was right. It is a giant leap forward for local. Getting around the filter is trivial.
>>
ideogram mogs bigly but the insistence on json + bbox prompting is mega cringe
>>
>>109009968
bbox would be a problem if it didn't work or stitch things together properly but it gives you what you exactly want. It's the best way to control what ends up in the image.
>>
File: 854906296377090.png (1.65 MB, 1152x1600)
1.65 MB PNG
I'm ootl, what does "bbox" prompting mean?
>>
File: 1099069460619817.png (1.17 MB, 1152x1600)
1.17 MB PNG
>>
>>109009971
It looks so ass though, it's like Ernie, way too many grainy NBP and Gippity 1.5 images in the dataset
>>
File: 820688013760192.png (1.66 MB, 1600x1152)
1.66 MB PNG
>>
File: debo_vn_fia_00011_.png (2.8 MB, 1792x977)
2.8 MB PNG
>>109010032
>>109010152
nice style
>>
>>
File: gooseyou.jpg (294 KB, 896x1120)
294 KB JPG
>>109010032
you draw a bounding box on a canvas, assigning a description and/or color to it to direct the gen, for instance placing a box where you want the subjects head to be and describing their expression, or placing items into the scene, basically full creative control
>>
>>109010187
So basically just regional prompting.
>>
File: 346569720558861.png (1.42 MB, 1152x1472)
1.42 MB PNG
>>109010187
I see, that's cool.
>>109010170
Thanks.
>>
>>109008626
>Did you call "moi" a dipshit ?
>>
fuck
the ideogram shilling is working, should i take the plunge with only 16gb of vram?
>>
File: z_00498_.jpg (517 KB, 1264x1800)
517 KB JPG
>>
>>109010268
mootchin
>>
its up
https://www.youtube.com/watch?v=21t316WkUl4
>>
>>109009852
>ideogram has buttchin

I'm an unc, I can gimp any text.
>>
>>109009654
ideogram is trash because of the baked in refusals. poop.
>>
>>109007440
Share the repo when you're done.
>>
>>109010238
Yeah, I'm using it smoothly in ComfyCloud, very recommended.
>>
>>109010229
>>109010170
>>109010152
>>109010032
Oh, cartoons, very neat
>>
>update comfy
>it's broken again
>try to fix it for an hour
>give up, go back to the old version
>update nvidia driver
>old version stops working too
I hecking love ComfyUI and its developers
>>
>>109010411
>update nvidia
>update comfy
>update all custom nodes
>everything works perfectly fine
you suck.
>>
>>109010409
Fuck off
>>
>>109010422
where?
>>
>>109010420
>update nvidia
>update comfy
>update all custom nodes
>nothing works
Thanks
>>
>>109010436
(you) problem. works on my machine
>>
>>109010437
(you) solution, doesn't work on my machine
>>
File: z_00506_.jpg (482 KB, 1264x1800)
482 KB JPG
>>
>>109010440
>doesn't mention what version they're upgrading from
>doesn't mention if it was the backend or front end that broke
>doesn't mention post any logs
can't provide a solution if you don't mention anything
>>
Half of you roaches are technically retarded, why update for the fuck of it when nothing worth updating for is out
With that said the fucking torchaudio is a serious breaking issue and comfy needs to fucking figure it out.
>>
File: output_1780952392.png (1.97 MB, 832x1216)
1.97 MB PNG
19 day Nofap report, no woman talked to me.
>>
>update cumfart
>run some gens, not really paying attention
>gens start to take forever
>suddenly my nvme dies
very comfortable
>>
anyone knows where is catjack?
>>
>>109010463
torch audio is a Python problem. It has longstanding issues.

Usually, you have a version conflict that you aren't realizing.
>>
>>109010474
He went to church, clearly.
>>
File: z_00508_.jpg (556 KB, 1264x1800)
556 KB JPG
banana for scale
>>
>>109010481
mootchin
>>
>>109010463
Some of us are developers. I keep a stable version, bleeding edge version and daily backups. Anything that I can confirm is broken on the latest update, I submit bug reports so they can be fixed.
>>
how come there isn't a catch-all workflow for everything? something that has control net, adetailers, inpainting and whatever you can think of into a single workflow, where you can easily bypass the things you don't use. surely some autist has put something like this together.
I've been browsing some anima workflows on civit and I wanna shoot myself. I have very basic knowledge of things, and some flows are either unnecessarily convoluted or too basic. I've tried frankensteining something myself but I can't do it
>>
File: 0.jpg (3.1 MB, 3758x2963)
3.1 MB JPG
>>109010436
I have 140 custom nodes. Currently the only broken node is comfyui-ppm, which will be fixed soon by the dev.

GeForce Game Ready Driver: 610.47 (May 26th, 2026)
Latest ComfyUI version
>>
>>109010498
when have you done this for any other "professional" software?
>>
>>109010498
have a cookie anon, you are a good boy^^
>>
>>109010545
This man ain't got no legs
>>
File: z_00512_.jpg (704 KB, 1264x1800)
704 KB JPG
>>
>>109010545
slap them out of her hand. Look carefully at the reflection in the glass of the photo on the wall on the right. You will see the lighting is consistent with a movie set. For this reason, you have become aware that you are being set up, run!
>>
>>109010498
Honest question, as a developer, do you find it acceptable to release as many buggy version in a row as they do? I'm a developer too and I've been thinking about contributing to Comfy but I just cant make myself work for people who are as unproffesional as they are.
>>
>>109010554
better chin. What's up with those feet tho?
>>
>>109010553
The legs come with the fp8 version :(
>>
File: output_1780953673.png (1.95 MB, 832x1216)
1.95 MB PNG
>>109010470
her contact lens is falling out. Help her find it, and she'll reward you with a kiss.
>>
>>109010545
>only the best cookies for the best comfycloud users
>>
test
>>
>>109010436
want to know a tip? the main thing that utterly borks your comfy is updating python dependencies, typically torch, torchvision & torchaudio. you almost never need to update them unless comfy needs it.
>>
File: z_00515_.jpg (565 KB, 1264x1800)
565 KB JPG
>>109010560
she's a real battleaxe
>>
Fresh when ready

>>109010630
>>109010630
>>109010630
>>109010630

Fresh when ready
>>
File: output_1780954261.png (1.84 MB, 832x1216)
1.84 MB PNG
>>109010575
gonna try "chin" in the negative.
>>
>>109010508
https://github.com/Haoming02/sd-webui-forge-classic



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.