/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

[Post a Reply]

Name
Options
Comment
Verification	4chan Pass users can bypass this verification. [Learn More] [Login]
File
Please read the Rules and FAQ before posting. You may highlight syntax and preserve whitespace by using [code] tags.


08/21/20	New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17	New trial board added: /bant/ - International/Random
10/04/16	New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]

Janitor applications are now open. Apply here!

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous
/ldg/ - Local Diffusion Genera(...) 06/03/26(Wed)13:56:58 No.108972752

File: highlights_g_108966726_17(...).jpg (3.1 MB, 5355x5203)

3.1 MB JPG

/ldg/ - Local Diffusion General Anonymous 06/03/26(Wed)13:56:58 No.108972752

The Secret Sauce For Kinos Edition

Discussion and Development of Local Image, Video, and Music Models

Previous: >>108966726

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon

Anonymous
06/03/26(Wed)13:58:28 No.108972764

Anonymous 06/03/26(Wed)13:58:28 No.108972764

Can Krea2 do N64 kino OOTB?

Anonymous
06/03/26(Wed)13:59:04 No.108972768

Anonymous 06/03/26(Wed)13:59:04 No.108972768

File: 00104-8016_ayakon_2026060(...).png (3.43 MB, 1418x2123)

3.43 MB PNG

oops fox posted after thread over

Anonymous
06/03/26(Wed)13:59:16 No.108972772

Anonymous 06/03/26(Wed)13:59:16 No.108972772

how long before we have dream diffusion?
Turn my nightmares into a reality

Anonymous
06/03/26(Wed)13:59:37 No.108972775

Anonymous 06/03/26(Wed)13:59:37 No.108972775

Blessed thread of frenship

Anonymous
06/03/26(Wed)14:00:17 No.108972781

Anonymous 06/03/26(Wed)14:00:17 No.108972781

File: 1760367433244970.png (138 KB, 984x408)

138 KB PNG

ai image detectors are quick with updating their models
holy smokes

Anonymous
06/03/26(Wed)14:01:41 No.108972791

Anonymous 06/03/26(Wed)14:01:41 No.108972791

File: 1779034870481081.png (701 KB, 930x465)

701 KB PNG

klein edit 9b is pretty neat, can do all kinds of stuff "use colored pencils like a sketch", change color, etc.

qwen edit is good but I seem to get better results with klein edit.

Anonymous
06/03/26(Wed)14:02:50 No.108972801

Anonymous 06/03/26(Wed)14:02:50 No.108972801

>>108972791
>klein edit 9b
what's the difference between klein 9b and klein edit 9b? Also which one can be used as kontext?

Anonymous
06/03/26(Wed)14:04:47 No.108972816

Anonymous 06/03/26(Wed)14:04:47 No.108972816

>>108972801
edit workflow for edits, im using klein edit 9b (distilled), very fast even at 8 steps (4 default)

Anonymous
06/03/26(Wed)14:08:12 No.108972829

Anonymous 06/03/26(Wed)14:08:12 No.108972829

>>108972305
Are you thinking about releasing your kinoapp? I would love to try it out.

Anonymous
06/03/26(Wed)14:09:16 No.108972840

Anonymous 06/03/26(Wed)14:09:16 No.108972840

nb4 niggas who dont remember the safeycucking of ideograms last model complaining about the safetycucking of their new model

Anonymous
06/03/26(Wed)14:09:24 No.108972841

Anonymous 06/03/26(Wed)14:09:24 No.108972841

File: 1780151980273053.png (1.02 MB, 1056x976)

1.02 MB PNG

>>108972816
also, klein edit seems to be pretty good at copying font styles:

Anonymous
06/03/26(Wed)14:09:24 No.108972842

Anonymous 06/03/26(Wed)14:09:24 No.108972842

>>108972763
honestly the best nipples i've seen in a base model

Anonymous
06/03/26(Wed)14:12:16 No.108972859

Anonymous 06/03/26(Wed)14:12:16 No.108972859

File: ComfyUI_00720_.png (1.23 MB, 896x1152)

1.23 MB PNG

Anonymous
06/03/26(Wed)14:12:26 No.108972861

Anonymous 06/03/26(Wed)14:12:26 No.108972861

File: 1766037634923393.png (950 KB, 1080x1080)

950 KB PNG

SD3 chads eating good tonight

Anonymous
06/03/26(Wed)14:14:08 No.108972870

Anonymous 06/03/26(Wed)14:14:08 No.108972870

File: 1758705451492465.png (1.36 MB, 1024x1024)

1.36 MB PNG

>>108972841
better quality image as source.

Anonymous
06/03/26(Wed)14:19:03 No.108972901

Anonymous 06/03/26(Wed)14:19:03 No.108972901

how have they not fired the greasy snake already?
https://www.reddit.com/r/comfyui/comments/1tvttzv/ideogram_40_just_open_sourced/

Anonymous
06/03/26(Wed)14:20:35 No.108972914

Anonymous 06/03/26(Wed)14:20:35 No.108972914

Comrades! It's Pride Month. Show us your pride !

Anonymous
06/03/26(Wed)14:22:53 No.108972925

Anonymous 06/03/26(Wed)14:22:53 No.108972925

File: 1769030986209693.png (1.21 MB, 1440x960)

1.21 MB PNG

>>108972914

Anonymous
06/03/26(Wed)14:25:23 No.108972940

Anonymous 06/03/26(Wed)14:25:23 No.108972940

File: spaghetti pixelization.png (225 KB, 1635x767)

225 KB PNG

>>108972829
Maybe when I get the spaghetti under control. This could all be done by a single wrapper.

Also there are features not yet implemented, like figuring out a design for a better manual palette node with a color picker. I really don't like how unwieldy it all is right now.

You can do all of what I'm doing with some basic math right now. The only 'hard' part is picking a gen resolution close to your target aspect ratio and target image size (in total pixels) which divides cleanly by 8 and the target pixel size. You could figure out how to do that yourself and probably come up with something good enough, or ChatGPT could come up with an algorithm in five seconds that does it. For quantizing to black and white an easy trick with the default nodes is to just composite the image onto a large black and white image, quantize to 2 colors, then crop it back to the original dimensions. Downscaling and upscaling can both be done through the default nodes trivially.

Also remember it's very easy to have any LLM make you a comfyUI node from scratch to do whatever you want if you give it this link: https://docs.comfy.org/llms.txt

All my time presently is absorbed by trying to make this app for tag correction on images for LoRA training, I'm just working on some UI elements for that. Since someone a few threads ago said the reason I don't like LoRA training is because of sour grapes or whatever. I'm getting a bit sidetracked on the way to making my LoRA...

Anonymous
06/03/26(Wed)14:26:23 No.108972946

Anonymous 06/03/26(Wed)14:26:23 No.108972946

File: 1778590750619831.png (1.47 MB, 1024x1024)

1.47 MB PNG

>>108972870
shalom (the people who decided to change the protagonist)

Anonymous
06/03/26(Wed)14:26:55 No.108972951

Anonymous 06/03/26(Wed)14:26:55 No.108972951

File: Klein-9b-197522610437817_(...).png (505 B, 84x85)

505 B PNG

Anonymous
06/03/26(Wed)14:29:02 No.108972966

Anonymous 06/03/26(Wed)14:29:02 No.108972966

File: 1766974471594411.png (254 KB, 448x336)

254 KB PNG

>84x85

Anonymous
06/03/26(Wed)14:29:27 No.108972972

Anonymous 06/03/26(Wed)14:29:27 No.108972972

>>108972940
>>108972951
Based and pixel pilled

Anonymous
06/03/26(Wed)14:29:32 No.108972973

Anonymous 06/03/26(Wed)14:29:32 No.108972973

Wonder how well it does as a second pass? Use other model like ZIT to get a rough composition, decode, re-encode latent, continue the steps from around 20% to completion so that the safety text doesn't get a chance to spawn in.
Because from the few booba gens I've tried if the text goes away within the first 10-20% steps it doesn't come back.

Anonymous
06/03/26(Wed)14:34:04 No.108973009

Anonymous 06/03/26(Wed)14:34:04 No.108973009

File: 5875.png (1.66 MB, 1504x848)

1.66 MB PNG

>>108972781
It probably just has the same features that most generative models have, no need to update, specially since its diffusion which is really easy to spot (as opposed to GANs or anything specifically trying to avoid detection), its also using flux VAE, which isn't new.
>>108972914

Anonymous
06/03/26(Wed)14:40:47 No.108973049

Anonymous 06/03/26(Wed)14:40:47 No.108973049

File: 1776615456186897.png (1.25 MB, 1024x1024)

1.25 MB PNG

>>108972946
okay, here is the most accurate cover.

Anonymous
06/03/26(Wed)14:42:55 No.108973063

Anonymous 06/03/26(Wed)14:42:55 No.108973063

File: 1girl.png (549 B, 76x76)

549 B PNG

Maybe this will be the one. Combined a few gens to make it.

Anonymous
06/03/26(Wed)14:45:29 No.108973081

Anonymous 06/03/26(Wed)14:45:29 No.108973081

File: 1girl_fix.png (547 B, 76x76)

547 B PNG

>>108973063
removed a 'detail' on the face which felt like meaningless noise. Better I think.

(the text extending out of the page is a stylistic choice I made, not a mistake. Maybe an artistic mistake.)

Anonymous
06/03/26(Wed)14:55:15 No.108973142

Anonymous 06/03/26(Wed)14:55:15 No.108973142

File: ComfyUI 2026-05-16-18_00074_.jpg (130 KB, 1398x484)

130 KB JPG

>>108972973
Tried it at 0.9 denoise and it unsurprisingly distorts the vagene into a tumor but could be worthwhile for partially clothed gens.
https://litter.catbox.moe/x7u5vprbixpzljfo.png

Anonymous
06/03/26(Wed)15:00:05 No.108973179

Anonymous 06/03/26(Wed)15:00:05 No.108973179

File: 1771110777435.jpg (18 KB, 398x376)

18 KB JPG

Reply blocked by safety filter

Anonymous
06/03/26(Wed)15:00:14 No.108973182

Anonymous 06/03/26(Wed)15:00:14 No.108973182

File: 1760794671931890.png (1.38 MB, 1024x1024)

1.38 MB PNG

>>108973049

Anonymous
06/03/26(Wed)15:04:09 No.108973206

Anonymous 06/03/26(Wed)15:04:09 No.108973206

Ouch 2 minutes for Turbo on my 3060, could have been worse but not a great start.
Anyway does anyone know why there are two checkpoints? What is "unconditional"?
Are different parts of the cfg equation calculated by different models here? Is it what it is referring, why?
I would also ask for what mu and std (standard deviation?) stand for but I doubt anyone can make sense of that comfy spaghetti.

Anonymous
06/03/26(Wed)15:08:26 No.108973232

Anonymous 06/03/26(Wed)15:08:26 No.108973232

File: 1766074143812898.jpg (40 KB, 500x500)

40 KB JPG

>>108972325
>>108972707
>>108972726
>mfw reading this
filtered again...

Anonymous
06/03/26(Wed)15:10:12 No.108973246

Anonymous 06/03/26(Wed)15:10:12 No.108973246

so what is this new model people are having censorship issues with?

Anonymous
06/03/26(Wed)15:18:21 No.108973285

Anonymous 06/03/26(Wed)15:18:21 No.108973285

>>108973246
https://huggingface.co/Comfy-Org/Ideogram-4

Anonymous
06/03/26(Wed)15:20:04 No.108973299

Anonymous 06/03/26(Wed)15:20:04 No.108973299

i'm 1girling
and i'm happy

Anonymous
06/03/26(Wed)15:22:55 No.108973311

Anonymous 06/03/26(Wed)15:22:55 No.108973311

>>108973285
why the fuck would they train censorship into it
>it GENERATES A CENSORED IMAGE instead of censoring it

Anonymous
06/03/26(Wed)15:25:00 No.108973324

Anonymous 06/03/26(Wed)15:25:00 No.108973324

>>108972768
catbox?

Anonymous
06/03/26(Wed)15:26:31 No.108973339

Anonymous 06/03/26(Wed)15:26:31 No.108973339

>>108973311
>>108972840

Anonymous
06/03/26(Wed)15:26:39 No.108973341

Anonymous 06/03/26(Wed)15:26:39 No.108973341

>>108973311
So that you can feel S̵̨̧̛̼̫͖͇̳̝͕̣̜̖̱̻̯̤̰̝̭͕͖̗͕̮̟̰͙̤̟͙̪͉̻̯͕̘̬͖̪̰͙͚̈́̾̈͗͗͐͛͗̈̋̐̀̂̍̈́͋̽́̄͂͗̋̔́̎̀͐͒̒̉̋͗̊̆͘̚͜͠͝͠ͅÀ̵̢̢̢̡̧̡̛̛̪̹͇̬͙͈̻̻͎̗̠̱̰̬̜̝̙͈̟̪̰͕̤͕̦͇̖͈̫̞͈̻͙̣̳̻̥͓̰̠͍͚͕͖̦͍̄͗͑̑̌̑̿́̉͆͒̍̿̾̀́̀̿̎́͑̀͂̽͗̂͗̓̓̃͑̋̌͗̎͂̇̑͌̽͆̿́͛́̃̐̽̓̋̈́́͊̈̐̾̏̍̌͗͋̆̒̿͊̍͐̉̊̉̈̀͋̄̓͜͜͜͝͝͝͝͠͠͝ͅͅF̵̢̨̧̧̡̡̧̢̢̧̛̛̤̝̺̻͔̝̣̼͈̣̪̭̜͚͕͙̟̫̝̮̹̥̫̙͙͉̺͚̦͍͍͍̰͕̪͕͎̩̝͙̘̠͚̙̞̠̻̬͖̱̯͖̟̙͇̪̦̬͍͍͙̣͕͑̽̈́̂̂͗͑̋͆̈̊́̿̂̐̏͛͒̌̐̽͗͊͛̏͊͋͐̽̑͛̂͌͐̓͐̾́̽̋̐͑̎͛̈̽̓̔̌̿͛̀̃́̀̿͌̋͌̆̄̽̂͌̇͂͂̓̾̄͆͛́̒̓́͊͘̚̕̕͘͜͠͝͠͝͠͝͠͝͝ͅȨ̸̡̙̫̝̘͔̟̝̳̙͇͚̭̪̦͚̬̤̼̫̖̗͇̈́̀̃̆̐̈́͑̇̃̓̒̈́̿̄̈́́̌̓̒̐̈́̇̚͠ chud, be grateful for once.

Anonymous
06/03/26(Wed)15:29:03 No.108973358

Anonymous 06/03/26(Wed)15:29:03 No.108973358

This might sound unbelievably retarded but how do you make people visually breath hard or take deep breaths then exhaling repeatedly with Wan or LTX?

I've tried describing the act of repeated inhaling and exhaling but it doesn't seem to work. Genning hyperventilating seems hard, or i'm retarded.

Anonymous
06/03/26(Wed)15:30:32 No.108973372

Anonymous 06/03/26(Wed)15:30:32 No.108973372

Remember when localroaches wouldn't shut up about their local uncensored models?

Anonymous
06/03/26(Wed)15:37:11 No.108973409

Anonymous 06/03/26(Wed)15:37:11 No.108973409

Cry more SaaSfag

Anonymous
06/03/26(Wed)15:41:29 No.108973434

Anonymous 06/03/26(Wed)15:41:29 No.108973434

File: 1749256176615482.png (1.37 MB, 1024x1024)

1.37 MB PNG

censorship is retarded.

we have klein edit + undress loras, even ltx 2.3 lewd finetunes if you want lewds.

Anonymous
06/03/26(Wed)15:43:10 No.108973442

Anonymous 06/03/26(Wed)15:43:10 No.108973442

>>108973372
we can just use another model anon
and you are still out of credits after 2 gens kek

Anonymous
06/03/26(Wed)15:56:43 No.108973521

Anonymous 06/03/26(Wed)15:56:43 No.108973521

Judging by the facts that its sensitivity is prompt dependent (probably one style of prompting was more over-represented than the other during the finetuning) and neither comfy nor diffusers code contain anything funny about censorship, we can conclude that this censorship was probably just post-training finetuning.
They thought the model to draw the grey censorship image when asked for no-no prompts. Clearly not enough regularization so it's fried as fuck when it comes to generating that even for most benign prompts. I am not sure it being less sensitive for json is a case of it being trained on json so that it is able to distinguish between "good" and "bad" prompts better when given json prompts or if it is a case of being trained on NL prompts so that it can pick up thought crimes easier with NL and json slips past. (I am inclined believe the latter as some anon was able to gen shitty nipples with json last thread.)
So anyway this might be salvageable with finetuning and I expect NSFW loras to work for the specific type of shit they are trained for, although they might be less versatile and less reliable to use than non-safety cucked models. It has a great vae and text encoder, so if it responds well to training it might be still worthwhile to thinker with it.

Anonymous
06/03/26(Wed)15:57:31 No.108973527

Anonymous 06/03/26(Wed)15:57:31 No.108973527

File: 1755038994684352.png (1.38 MB, 1024x1024)

1.38 MB PNG

>>108973434

Anonymous
06/03/26(Wed)15:58:16 No.108973532

Anonymous 06/03/26(Wed)15:58:16 No.108973532

>>108973521
Sounds like the exact same situation as their last model. Check out ponyfags Auraflow model to see how that turned out kek. No harm no foul tho I don't really give a shit about Ideogram.

Anonymous
06/03/26(Wed)15:59:26 No.108973544

Anonymous 06/03/26(Wed)15:59:26 No.108973544

>>108973232
I'm going to make a single standalone node that does the basic pixel calculations which I can upload to a pastebin or something, then give you a workflow showing the idea. Going to assume you have the popular 'ComfyUI-custom-scripts' thing from pythongosssss, because you'll need a node that does custom math

Almost done

Anonymous
06/03/26(Wed)15:59:44 No.108973545

Anonymous 06/03/26(Wed)15:59:44 No.108973545

>mfw Resource news

06/03/2026

>Ideogram 4.0: Open model at the forefront of design
https://ideogram.ai/blog/ideogram-4.0

>JoyAI-Echo: Pushing the Frontier of Long Audio-Visual Generation
https://echo-team-joy-future-academy-jd.github.io/Echo-LongVideo-Page

>Follow-Your-Preference++: Rethinking Preference Alignment for Image Inpainting
https://github.com/shenytzzz/Follow-Your-Preference

>LongLive-RAG: A General Retrieval-Augmented Framework for Long Video Generation
https://github.com/qixinhu11/LongLive-RAG

>MAI-Image-2.5
https://microsoft.ai/models/mai-image-2-5

>AAD-1: Asymmetric Adversarial Distillation for One-Step Autoregressive Video Generation
https://aad-1.github.io

>Inference-Time Scaling for Joint Audio-Video Generation
https://jung-jaemin.github.io/ITS-AVGen-Proj

>Video-Mirai: Autoregressive Video Diffusion Models Need Foresight
https://y0uroy.github.io/Video-Mirai

>Order within Chaos: Capturing Intrinsic Energy Anomalies for AI-Manipulated Image Forgery Localization
https://github.com/phoenixnir/FLAME

>VISReg: Variance-Invariance-Sketching Regularization for JEPA training
https://haiyuwu.github.io/visreg

>HumanNOVA: Photorealistic, Universal and Rapid 3D Human Avatar Modeling from a Single Image
https://HumanNOVA.github.io

>Cosmos 3: Omnimodal World Models for Physical AI
https://research.nvidia.com/labs/cosmos-lab/cosmos3

>TGV-KV: Text-Grounded KV Eviction for Vision-Language Models
https://github.com/Danielement321/TGV-KV

>JAVEDIT: Joint Audio-Visual Instruction-Guided Video Editing with Agentic Data Curation
https://ryanchenyn.github.io/projects/JAVEdit

>Any2Poster: Any-Source Poster Generation Across Modalities and Domains
https://github.com/Any2Poster/Any2Poster

>Martin Scorsese faces industry backlash over AI company partnership
https://www.independent.co.uk/bulletin/culture/martin-scorsese-ai-black-forest-labs-b2988639.html

Anonymous
06/03/26(Wed)16:00:45 No.108973550

Anonymous 06/03/26(Wed)16:00:45 No.108973550

>mfw Research news

06/03/2026

>Training-Free Multi-Concept LoRA Composition with Prompt-Aware Weighting
https://arxiv.org/abs/2606.03792

>Text-to-Image Models Need Less from Text Encoders Than You Think
https://nsping13.github.io/contextless-TTI

>Qwen-Image-Flash: Beyond Objective Design
https://arxiv.org/abs/2606.03746

>Bootstrap Your Generator: Unpaired Visual Editing with Flow Matching
https://research.nvidia.com/labs/par/byg

>Initialization is Half the Battle: Generating Diverse Images from a Guidance Potential Posterior
https://arxiv.org/abs/2606.02453

>Inverting the Generation Process of Denoising Diffusion Implicit Models: Empirical Evaluation and a Novel Method
https://arxiv.org/abs/2606.03111

>Retrieve What's Missing: Coverage-Maximizing Retrieval for Consistent Long Video Generation
https://arxiv.org/abs/2606.02479

>Drifting Preference Optimization for One-Step Generative Models
https://arxiv.org/abs/2606.02521

>Equilibrated Diffusion: Frequency-aware Textual Embedding for Equilibrated Image Customization
https://arxiv.org/abs/2606.02129

>Geometry-Aware Implicit Memory for Video World Models
https://gim-world.github.io

>GuidedBridge: Training-freely Improving Bridge Models with Prior Guidance
https://arxiv.org/abs/2606.03119

>MemoGen: Can Past Experience Improve Future Text-to-Image Generation?
https://arxiv.org/abs/2606.03243

>UniVerse: A Unified Modulation Framework for Segmentation-Free,Disentangled Multi-Concept Personalization
https://universe-personalization.github.io

>Diffusing in the Right Space: A Systematic Study of Latent Diffusability
https://arxiv.org/abs/2606.03578

>$A^2$: Smaller Self-Supervised ViTs Localize Better than Larger Ones
https://arxiv.org/abs/2606.03148

>Attention, May I Have Your Decision? Localizing Generative Choices in Diffusion Models
https://arxiv.org/abs/2604.06052

>You Don't Need All That Attention: Surgical Memorization Mitigation in Text-to-Image Diffusion Models
https://arxiv.org/abs/2603.00133

Anonymous
06/03/26(Wed)16:05:13 No.108973581

Anonymous 06/03/26(Wed)16:05:13 No.108973581

File: 23f.png (6 KB, 335x261)

6 KB PNG

Can I tell an LLM to use booru tags and it will do it properly or hallucinate her own?

Anonymous
06/03/26(Wed)16:06:59 No.108973590

Anonymous 06/03/26(Wed)16:06:59 No.108973590

>>108973532
He didn't train on the last Ideogram though.
He trained on an extremely underbaked model someone else trained on Ideogram outputs, without pruning any of the censored images.
Anyway I am just saying it's probably worth experimenting with. You are probably not going to get rid of almost 100% of the "image blocked by safety filter"s without finetuning on millions of images but 90-95% might be doable on a very small scale finetune, or hell even lora, without spending a fortune.

Anonymous
06/03/26(Wed)16:08:15 No.108973600

Anonymous 06/03/26(Wed)16:08:15 No.108973600

File: anima1_00004_.jpg (462 KB, 1344x1728)

462 KB JPG

Anonymous
06/03/26(Wed)16:08:59 No.108973602

Anonymous 06/03/26(Wed)16:08:59 No.108973602

>>108973581
Depends on LLM but aside from booru tags that have special meanings that aren't trivially obvious, most llms will cope fine with tag style instructions. They are smart enough to just guess what you might be meaning even though they have no proper booru training.

Anonymous
06/03/26(Wed)16:11:45 No.108973622

Anonymous 06/03/26(Wed)16:11:45 No.108973622

>>108973581
I've done this. Try preprocessing a list of Booru tags with their wiki entries and build a prompt like "below is a list of tags with descriptions, only output the tags, follow their descriptions". You can't do ALL tags out there, that will blow up the context window even of the larger open source models out there, but you could always just do more than one pass and split them up.
This worked halfway decent with Qwen 3.5, I haven't tried Gemma 4 but assume it would work even better as it's ridiculously easy to get it to uncensored with a system prompt.
Also look up Xgrammar where you can literally force the model to only output certain formats. Not sure whether it's compatible yet with Gemma4 though

Anonymous
06/03/26(Wed)16:12:00 No.108973625

Anonymous 06/03/26(Wed)16:12:00 No.108973625

> >108973545
> >108973550
fuck off

Anonymous
06/03/26(Wed)16:14:24 No.108973645

Anonymous 06/03/26(Wed)16:14:24 No.108973645

File: aa.png (1.9 MB, 1400x704)

1.9 MB PNG

Anonymous
06/03/26(Wed)16:17:50 No.108973667

Anonymous 06/03/26(Wed)16:17:50 No.108973667

File: anima1_00010_.jpg (483 KB, 1344x1728)

483 KB JPG

Anonymous
06/03/26(Wed)16:19:44 No.108973679

Anonymous 06/03/26(Wed)16:19:44 No.108973679

>>108973600
>>108973667
those are really good. base anima or lora?

Anonymous
06/03/26(Wed)16:24:10 No.108973703

Anonymous 06/03/26(Wed)16:24:10 No.108973703

File: ComfyUI_temp_dzunn_00002_.png (1.17 MB, 1024x1472)

1.17 MB PNG

Thank you euler cfg pp, very cool

Anonymous
06/03/26(Wed)16:24:58 No.108973705

Anonymous 06/03/26(Wed)16:24:58 No.108973705

File: anima1_00018_.jpg (466 KB, 1344x1728)

466 KB JPG

>>108973679
photo lora test, took fraction of dataset and tried how it learns. pretty good so far

Anonymous
06/03/26(Wed)16:26:03 No.108973714

Anonymous 06/03/26(Wed)16:26:03 No.108973714

>>108973705
How many images for test and in total? How many steps have you trained for?

Anonymous
06/03/26(Wed)16:27:27 No.108973727

Anonymous 06/03/26(Wed)16:27:27 No.108973727

>newest localkek slopware has built-in censorship
>saas remains decades ahead due to their uncensored all-knowing base models
local is an absolute embarrassment

Anonymous
06/03/26(Wed)16:28:29 No.108973733

Anonymous 06/03/26(Wed)16:28:29 No.108973733

>>108973705
really nice so far, keep it up

Anonymous
06/03/26(Wed)16:33:10 No.108973762

Anonymous 06/03/26(Wed)16:33:10 No.108973762

>>108973703
It's a based sampler desu. It's certainly more difficult to use than others but once you get the hang of it the results are incredible.

Anonymous
06/03/26(Wed)16:33:57 No.108973767

Anonymous 06/03/26(Wed)16:33:57 No.108973767

>>108973714
54 epochs with batch 8, around 600 images something like that

>>108973733
TY only gonna get better

Anonymous
06/03/26(Wed)16:34:29 No.108973772

Anonymous 06/03/26(Wed)16:34:29 No.108973772

>>108973703
Just reconfigure your pp, Anon!

Anonymous
06/03/26(Wed)16:36:01 No.108973787

Anonymous 06/03/26(Wed)16:36:01 No.108973787

Is there a good target for number of images in a lora dataset for anima? I've been making some style loras with 100-200 images and usually get good results around 1500 steps. Is it best to use the largest dataset possible with good image diversity and style consistency?

Anonymous
06/03/26(Wed)16:38:29 No.108973803

Anonymous 06/03/26(Wed)16:38:29 No.108973803

>>108973727
I can tell you're not putting your heart into it this time. Sad desu.

Anonymous
06/03/26(Wed)16:42:04 No.108973824

Anonymous 06/03/26(Wed)16:42:04 No.108973824

>>108973787
quality>quantity always, been training for a long time and the two things that consistently improve the quality the most were purging the dataset of garbage and increasing the dim/rank

Anonymous
06/03/26(Wed)16:43:30 No.108973833

Anonymous 06/03/26(Wed)16:43:30 No.108973833

File: anima1_00033_.jpg (833 KB, 1344x1728)

833 KB JPG

Anonymous
06/03/26(Wed)16:44:04 No.108973836

Anonymous 06/03/26(Wed)16:44:04 No.108973836

File: b0dcq5.png (2.04 MB, 1216x832)

2.04 MB PNG

Anonymous
06/03/26(Wed)16:49:23 No.108973870

Anonymous 06/03/26(Wed)16:49:23 No.108973870

File: anima1_00038_.jpg (521 KB, 1344x1728)

521 KB JPG

one for battlestation thread

Anonymous
06/03/26(Wed)16:50:26 No.108973877

Anonymous 06/03/26(Wed)16:50:26 No.108973877

I thought api was supposed to be censored? How come local models are spitting out safety images? That never happened with Grok

Anonymous
06/03/26(Wed)16:50:54 No.108973880

Anonymous 06/03/26(Wed)16:50:54 No.108973880

35?

Anonymous
06/03/26(Wed)16:51:17 No.108973882

Anonymous 06/03/26(Wed)16:51:17 No.108973882

>>108973870
where i post my kinosovl from

Anonymous
06/03/26(Wed)16:51:44 No.108973892

Anonymous 06/03/26(Wed)16:51:44 No.108973892

>>108973787
Above 100 is a good target for style loras, pretty much any model. I expect the difference between 100 and 1000 images to be not worth it for most loras.

Anonymous
06/03/26(Wed)16:59:04 No.108973934

Anonymous 06/03/26(Wed)16:59:04 No.108973934

File: ComfyUI_temp_qkigd_00003_.png (1.97 MB, 768x1408)

1.97 MB PNG

Anonymous
06/03/26(Wed)17:02:14 No.108973961

Anonymous 06/03/26(Wed)17:02:14 No.108973961

File: pixel_00015_.png (1 KB, 64x96)

1 KB PNG

>>108973232
Done, here you go
https://files.catbox.moe/rksrik.zip

Put that node in your custom nodes folder. You should also have this installed:
https://github.com/pythongosssss/ComfyUI-Custom-Scripts
Just for the math node. I have my own math node that I like better but most people have this one installed already, I tried to make something you could use right away

Anonymous
06/03/26(Wed)17:02:48 No.108973965

Anonymous 06/03/26(Wed)17:02:48 No.108973965

Is anon trying to pretend like it's all local models and not just Ideogram?

Anonymous
06/03/26(Wed)17:03:56 No.108973976

Anonymous 06/03/26(Wed)17:03:56 No.108973976

>>108973824
>>108973892
ty fellas I appreciate the help

Anonymous
06/03/26(Wed)17:05:48 No.108973991

Anonymous 06/03/26(Wed)17:05:48 No.108973991

>>108973877
imagine spending 10k on a rig just to get cockblocked by a safety filter when you try to gen "1girl, standing"
LOOOOOOOOOOOOOL

Anonymous
06/03/26(Wed)17:07:07 No.108974000

Anonymous 06/03/26(Wed)17:07:07 No.108974000

y did anon reply to himself ?

Anonymous
06/03/26(Wed)17:08:25 No.108974014

Anonymous 06/03/26(Wed)17:08:25 No.108974014

File: anima1_00044_.jpg (650 KB, 1344x1728)

650 KB JPG

>>108973882

Anonymous
06/03/26(Wed)17:09:38 No.108974023

Anonymous 06/03/26(Wed)17:09:38 No.108974023

>>108974000
uh oh melty

Anonymous
06/03/26(Wed)17:11:52 No.108974039

Anonymous 06/03/26(Wed)17:11:52 No.108974039

>moving forward all local models will be censored
How does that make you feel, anon?

Anonymous
06/03/26(Wed)17:13:03 No.108974043

Anonymous 06/03/26(Wed)17:13:03 No.108974043

>his seething grows quieter and quieter

Anonymous
06/03/26(Wed)17:18:09 No.108974068

Anonymous 06/03/26(Wed)17:18:09 No.108974068

my Anima upscales all like shit compared to the base image for some fucking reason
when i used illustrious my upscales were always objectively better than the base image

Anonymous
06/03/26(Wed)17:18:49 No.108974072

Anonymous 06/03/26(Wed)17:18:49 No.108974072

>>108972752
HUGE NEWS EVERYONE I COMPILED SDCPP AND IT DOESN"T CRASH

(but gguf don't work, comes out all white, preview is all black, apparently it's a bug)

Anonymous
06/03/26(Wed)17:19:50 No.108974078

Anonymous 06/03/26(Wed)17:19:50 No.108974078

>>108974039
doesn't matter. we have the weights for flux dev 2.

Anonymous
06/03/26(Wed)17:21:15 No.108974085

Anonymous 06/03/26(Wed)17:21:15 No.108974085

>>108974068
latent upscale?

Anonymous
06/03/26(Wed)17:21:25 No.108974088

Anonymous 06/03/26(Wed)17:21:25 No.108974088

>>108974068
User error also anima can do larger resolutions out of the box anyway so highresfixing is just a cope

Anonymous
06/03/26(Wed)17:24:17 No.108974104

Anonymous 06/03/26(Wed)17:24:17 No.108974104

File: anima1_00052_.jpg (687 KB, 1344x1728)

687 KB JPG

>>108974068
I can't get good upscales with it either

Anonymous
06/03/26(Wed)17:26:18 No.108974119

Anonymous 06/03/26(Wed)17:26:18 No.108974119

are people making any celeb/streamer lora's for illustrious anymore? or do i bite the bullet and use sd 1.5 and pony

Anonymous
06/03/26(Wed)17:29:58 No.108974141

Anonymous 06/03/26(Wed)17:29:58 No.108974141

>>108974119
you have to be the change you want to see in the world. you do know how to train loras, right?

Anonymous
06/03/26(Wed)17:30:37 No.108974143

Anonymous 06/03/26(Wed)17:30:37 No.108974143

File: anima1_00055_.jpg (867 KB, 1344x1728)

867 KB JPG

Anonymous
06/03/26(Wed)17:31:47 No.108974150

Anonymous 06/03/26(Wed)17:31:47 No.108974150

>>108974039
KEKSTONE WILL UNCENSOR IT!!! JUST DONATE $500000 SO HE CAN TRAIN AT 256x256 ON FURRY DIAPERSCAT SLOPPA

Anonymous
06/03/26(Wed)17:31:50 No.108974151

Anonymous 06/03/26(Wed)17:31:50 No.108974151

>>108974068
Try genning with a natively higher resolution first. I stopped using upscalers with Anima. I like the native look better.

Anonymous
06/03/26(Wed)17:32:17 No.108974155

Anonymous 06/03/26(Wed)17:32:17 No.108974155

File: ComfyUI_Anima_03198_.png (1.33 MB, 1344x960)

1.33 MB PNG

Anonymous
06/03/26(Wed)17:34:10 No.108974174

Anonymous 06/03/26(Wed)17:34:10 No.108974174

File: 153246486.png (328 KB, 800x778)

328 KB PNG

im going back to dalle mini. thats where the sovl is at

Anonymous
06/03/26(Wed)17:34:49 No.108974175

Anonymous 06/03/26(Wed)17:34:49 No.108974175

>>108974104
>>108974068
well you might be in luck, nvidia dropped the pid checkpoints for qwen today, this could
https://huggingface.co/Comfy-Org/PixelDiT
the comfyui master doesnt have it yet but should drop soon, support is already in the nightly

Anonymous
06/03/26(Wed)17:35:09 No.108974180

Anonymous 06/03/26(Wed)17:35:09 No.108974180

File: anima1_00059_.jpg (1.06 MB, 1344x1728)

1.06 MB JPG

Anonymous
06/03/26(Wed)17:35:14 No.108974182

Anonymous 06/03/26(Wed)17:35:14 No.108974182

I really thought anon would get more trolling out of the Ideogram release. I'm pretty disappointed in him he's barely trying.

Anonymous
06/03/26(Wed)17:35:42 No.108974184

Anonymous 06/03/26(Wed)17:35:42 No.108974184

File: smdyyn.png (1.94 MB, 1216x832)

1.94 MB PNG

Anonymous
06/03/26(Wed)17:35:49 No.108974186

Anonymous 06/03/26(Wed)17:35:49 No.108974186

>>108974175
>this could
im retarded
this could enable upscaling for anima *

Anonymous
06/03/26(Wed)17:37:57 No.108974204

Anonymous 06/03/26(Wed)17:37:57 No.108974204

File: ComfyUI_00001_.png (607 KB, 1024x1024)

607 KB PNG

Anonymous
06/03/26(Wed)17:42:46 No.108974243

Anonymous 06/03/26(Wed)17:42:46 No.108974243

File: 213751CUI_00001_.png (1.24 MB, 1536x1152)

1.24 MB PNG

Anonymous
06/03/26(Wed)17:44:27 No.108974257

Anonymous 06/03/26(Wed)17:44:27 No.108974257

>>108974175
there was a z-image section in https://huggingface.co/nvidia/PiD , is that not released for comfyui yet or is it pixeldit_1300m_1024px_bf16.safetensors ?

Anonymous
06/03/26(Wed)17:50:34 No.108974301

Anonymous 06/03/26(Wed)17:50:34 No.108974301

>>108974257
zit was released initially already, what they dropped yesterday are checkpoints for SDXL and qwen vae (so what anima uses) as well, as well as a fixed flux2 one
https://huggingface.co/nvidia/PiD/tree/main/checkpoints/PiD_res2kto4k_sr4x_official_qwenimage_distill_4step
https://huggingface.co/nvidia/PiD/tree/main/checkpoints/PiD_res2kto4k_sr4x_official_sdxl_distill_4step
support is only in comfy nightly so far though

Anonymous
06/03/26(Wed)17:52:42 No.108974318

Anonymous 06/03/26(Wed)17:52:42 No.108974318

File: ComfyUI_00004_.png (1.14 MB, 1024x1024)

1.14 MB PNG

Anonymous
06/03/26(Wed)17:53:28 No.108974325

Anonymous 06/03/26(Wed)17:53:28 No.108974325

>>108974141
I saw an old git repository for local gen. the LoRA_Easy_Training_Scripts linked in some posts. but setting anything up on nvidia 50 series is a bitch, but it's been a minute since i've tried again.

Anonymous
06/03/26(Wed)17:56:24 No.108974346

Anonymous 06/03/26(Wed)17:56:24 No.108974346

File: ctma9p.png (1.31 MB, 1024x1024)

1.31 MB PNG

Anonymous
06/03/26(Wed)17:58:22 No.108974365

Anonymous 06/03/26(Wed)17:58:22 No.108974365

>>108974182
>Ideogram
what release?

Anonymous
06/03/26(Wed)17:59:23 No.108974374

Anonymous 06/03/26(Wed)17:59:23 No.108974374

>>108974301
>support is only in comfy nightly so far though
There is support in sdcpp :^) not sure which models it works with yet...

https://github.com/leejet/stable-diffusion.cpp/pull/1585

Anonymous
06/03/26(Wed)18:00:02 No.108974376

Anonymous 06/03/26(Wed)18:00:02 No.108974376

>>108974301
>support is only in comfy nightly so far though
ah. i guess I'll wait for a bit then, but it seems promising

Anonymous
06/03/26(Wed)18:00:08 No.108974378

Anonymous 06/03/26(Wed)18:00:08 No.108974378

>localkeks have more safety filters than SaaS
LOOOOOOOOL

Anonymous
06/03/26(Wed)18:00:24 No.108974381

Anonymous 06/03/26(Wed)18:00:24 No.108974381

>>108974365
>>108974182
https://huggingface.co/Comfy-Org/Ideogram-4
this???

I don't understand, not joking lol

Anonymous
06/03/26(Wed)18:01:13 No.108974391

Anonymous 06/03/26(Wed)18:01:13 No.108974391

File: 215430CUI_00001_.png (1.13 MB, 1536x1152)

1.13 MB PNG

What sampler do you guys use? I keep switching between er_sde and dpmpp 2m sde.

Anonymous
06/03/26(Wed)18:02:14 No.108974397

Anonymous 06/03/26(Wed)18:02:14 No.108974397

>>108974374
also, is this an upscale model? idk

>pid_flux1_512_to_2048_4step_bf16.safetensors

There's always so much new stuff.

>>108974376
>ah. i guess I'll wait for a bit then, but it seems promising
It looks like it can be tested out using sd cpp.

I'll have to try it out.

Anonymous
06/03/26(Wed)18:03:55 No.108974411

Anonymous 06/03/26(Wed)18:03:55 No.108974411

>>108974085
I think that helped a lot actually, thanks.
I was doing VAE Decode --> Encode inbetween each sampler. Apparently Anima doesn't like that and it makes faces slightly uglier. Illustrious was fine with it.

Anonymous
06/03/26(Wed)18:04:32 No.108974414

Anonymous 06/03/26(Wed)18:04:32 No.108974414

>>108974391
euler_cfg_pp

Anonymous
06/03/26(Wed)18:04:58 No.108974416

Anonymous 06/03/26(Wed)18:04:58 No.108974416

>>108974397
>also, is this an upscale model?
it can be used for ZIT upscaling at least
https://github.com/Comfy-Org/ComfyUI/pull/14103

Anonymous
06/03/26(Wed)18:06:44 No.108974427

Anonymous 06/03/26(Wed)18:06:44 No.108974427

>>108974416
It's extremely confusing.

found the doc:
https://github.com/leejet/stable-diffusion.cpp/blob/master/docs/pid.md

Anonymous
06/03/26(Wed)18:07:16 No.108974431

Anonymous 06/03/26(Wed)18:07:16 No.108974431

why is vae decode such a resource hog?

Anonymous
06/03/26(Wed)18:07:18 No.108974432

Anonymous 06/03/26(Wed)18:07:18 No.108974432

>>108974391
res_2m with sgm_uniform and er_sde with bong_tangent

Anonymous
06/03/26(Wed)18:07:21 No.108974433

Anonymous 06/03/26(Wed)18:07:21 No.108974433

>>108974378
More? Anon and math are enemy

Anonymous
06/03/26(Wed)18:08:35 No.108974440

Anonymous 06/03/26(Wed)18:08:35 No.108974440

>>108974431
goes from tiny little ultra compressed latent into full pixel image that needs to fit inside your GPU's VRAM

Anonymous
06/03/26(Wed)18:09:39 No.108974446

Anonymous 06/03/26(Wed)18:09:39 No.108974446

>>108974427
>>108974416
ok.

>In stable-diffusion.cpp, PiD currently runs as an image edit pipeline

so sdcpp doesn't have proper pid support yet. if you want that, nightly.

But, this sounds like a cool use of pid.

Anonymous
06/03/26(Wed)18:10:41 No.108974449

Anonymous 06/03/26(Wed)18:10:41 No.108974449

>>108974440
Just say "because math"

Anonymous
06/03/26(Wed)18:15:16 No.108974470

Anonymous 06/03/26(Wed)18:15:16 No.108974470

>>108974346
Cool. What model?

Anonymous
06/03/26(Wed)18:17:40 No.108974484

Anonymous 06/03/26(Wed)18:17:40 No.108974484

It's really weird to hear about ideogram. I did a lot of gens with that back then, but I ditched it when Flux dev 1 came out.

Anonymous
06/03/26(Wed)18:18:25 No.108974488

Anonymous 06/03/26(Wed)18:18:25 No.108974488

File: 221138CUI_00001_.png (939 KB, 1152x1536)

939 KB PNG

>>108974432
>bong_tangent
lol
that's new

Anonymous
06/03/26(Wed)18:19:21 No.108974498

Anonymous 06/03/26(Wed)18:19:21 No.108974498

File: ComfyUI_00012_.png (258 KB, 1024x1024)

258 KB PNG

Anonymous
06/03/26(Wed)18:19:43 No.108974499

Anonymous 06/03/26(Wed)18:19:43 No.108974499

>>108974431
Related to this but I don't get how tiled decode works compared to regular decode. Why does my RAM usage shoot up with normal vae decode but tiled decode doesn't when they're both still storing everything in RAM while decoding?

Anonymous
06/03/26(Wed)18:19:56 No.108974502

Anonymous 06/03/26(Wed)18:19:56 No.108974502

>>108974470
It's img2img with flux1-dev, but t2i does pretty much the same thing (see strength param).

./sd-cli.exe \
--diffusion-model ../models/flux1-dev-q8_0.gguf \
--t5xxl ../models/t5xxl_fp16.safetensors \
--clip_l ../models/clip_l.safetensors \
--vae ../models/ae.sft \
-H 1024 -W 1024 \
-i ../Documents/1.png \
--strength 0.78 \
-o $ofile \
-p "$P" \
-s -1 \
--sampling-method euler \
--steps 20 \
--guidance 3.5 \
--cfg-scale 1.0 \
--clip-on-cpu \
-t 8

Anonymous
06/03/26(Wed)18:25:10 No.108974532

Anonymous 06/03/26(Wed)18:25:10 No.108974532

Has anybody switched from using comfyui manager (maybe still using it just for reference but not to start the download) in order to manage comfyui using pixi instead of pip?

Anonymous
06/03/26(Wed)18:26:32 No.108974540

Anonymous 06/03/26(Wed)18:26:32 No.108974540

Has nvidia pid been adapted for wan 2.2 instead of sdxl?

Anonymous
06/03/26(Wed)18:28:44 No.108974551

Anonymous 06/03/26(Wed)18:28:44 No.108974551

>https://civitai.red/models/2668799/cyberrealistic-anima
slop time

Anonymous
06/03/26(Wed)18:30:36 No.108974563

Anonymous 06/03/26(Wed)18:30:36 No.108974563

>figured out how to use the extra model paths after 3 hours before chatgpt immediately corrected me
>finally fire up comfyui
>have to tinker around just to save images on another drive as something other than png
ah, so it begins

Anonymous
06/03/26(Wed)18:35:47 No.108974589

Anonymous 06/03/26(Wed)18:35:47 No.108974589

File: 175568.jpg (1.1 MB, 1532x3165)

1.1 MB JPG

Anonymous
06/03/26(Wed)18:36:46 No.108974591

Anonymous 06/03/26(Wed)18:36:46 No.108974591

>>108974499
pretty sure economizing vram is the purpose of decoding tiles instead of the whole target image in one go.

Anonymous
06/03/26(Wed)18:37:30 No.108974595

Anonymous 06/03/26(Wed)18:37:30 No.108974595

File: ComfyUI_00016_.png (1.27 MB, 1024x1024)

1.27 MB PNG

Anonymous
06/03/26(Wed)18:41:09 No.108974615

Anonymous 06/03/26(Wed)18:41:09 No.108974615

File: anima1_00078_.jpg (540 KB, 1344x1728)

540 KB JPG

nsfw https://files.catbox.moe/xwik7e.jpg

Anonymous
06/03/26(Wed)18:43:49 No.108974630

Anonymous 06/03/26(Wed)18:43:49 No.108974630

File: 1758341443714999.png (3.33 MB, 2048x1024)

3.33 MB PNG

>>108974301
from a few tests with a slop custom node, it can be pretty neat. seems like the more your image looks like an illustration the worse it gets though.
also slightly changes colors

Anonymous
06/03/26(Wed)18:44:07 No.108974635

Anonymous 06/03/26(Wed)18:44:07 No.108974635

>>108974551
>early access 7k buzz
>"semi realistic"
holy kek

Anonymous
06/03/26(Wed)18:47:31 No.108974654

Anonymous 06/03/26(Wed)18:47:31 No.108974654

File: 7de92e2e-a6ae-4484-a923-e(...).png (3.78 MB, 1536x2304)

3.78 MB PNG

>>108974551
the ai stare

Anonymous
06/03/26(Wed)18:47:47 No.108974656

Anonymous 06/03/26(Wed)18:47:47 No.108974656

Instead of 30 FPS videos, is there a way to setup a workflow so you instead can generate frames in continuity with each other one at a time? What if 4 FPS is enough for me?

Anonymous
06/03/26(Wed)18:50:07 No.108974673

Anonymous 06/03/26(Wed)18:50:07 No.108974673

>>108974589
Nice

Anonymous
06/03/26(Wed)18:51:15 No.108974680

Anonymous 06/03/26(Wed)18:51:15 No.108974680

>>108974635
Yeah. Looks like he tuned the worst ai slop in

Anonymous
06/03/26(Wed)18:52:19 No.108974691

Anonymous 06/03/26(Wed)18:52:19 No.108974691

>>108974551
>"i cant get proper realism into the model so ill call it "semi-realistic"
>unironically charge people money for it
who is even the audience for this

Anonymous
06/03/26(Wed)18:52:20 No.108974692

Anonymous 06/03/26(Wed)18:52:20 No.108974692

https://youtu.be/XogoQnkQUO8?si=Ah7Nb_pE49-CLGG2
why has noone talked about this?

Anonymous
06/03/26(Wed)18:53:20 No.108974698

Anonymous 06/03/26(Wed)18:53:20 No.108974698

>>108974589
i respect the idea but its so blurry and burnt

Anonymous
06/03/26(Wed)18:54:19 No.108974707

Anonymous 06/03/26(Wed)18:54:19 No.108974707

>>108974698
filmed on nokia

Anonymous
06/03/26(Wed)18:59:40 No.108974737

Anonymous 06/03/26(Wed)18:59:40 No.108974737

File: ComfyUI_Anima_03226_.png (1.39 MB, 1344x960)

1.39 MB PNG

Anonymous
06/03/26(Wed)19:05:26 No.108974759

Anonymous 06/03/26(Wed)19:05:26 No.108974759

>>108974630
how do you make stereo images like this?

Anonymous
06/03/26(Wed)19:08:55 No.108974772

Anonymous 06/03/26(Wed)19:08:55 No.108974772

File: 1751148517247041.png (98 KB, 526x954)

98 KB PNG

>>108974759
should be a default node

Anonymous
06/03/26(Wed)19:10:21 No.108974785

Anonymous 06/03/26(Wed)19:10:21 No.108974785

File: ComfyUI_Ideogram_0001.png (73 KB, 703x1251)

73 KB PNG

Anonymous
06/03/26(Wed)19:15:16 No.108974807

Anonymous 06/03/26(Wed)19:15:16 No.108974807

>>108974785
And a new record on you in their Redis database. A.I watch you!

Anonymous
06/03/26(Wed)19:16:43 No.108974817

Anonymous 06/03/26(Wed)19:16:43 No.108974817

>>108974785
Do these niggas really think people should only use models for generating pictures of dogs?

Anonymous
06/03/26(Wed)19:16:50 No.108974819

Anonymous 06/03/26(Wed)19:16:50 No.108974819

File: 1769651642728811.png (3.03 MB, 2048x1024)

3.03 MB PNG

>>108974630
photograph example with lora

Anonymous
06/03/26(Wed)19:16:54 No.108974821

Anonymous 06/03/26(Wed)19:16:54 No.108974821

>>108974772
cool, thanks
Just had an idea that maybe one could use qwen image edit to maybe change the angle on one of these images with like 1 degree and then stitch them together. Could maybe make the 3d effect a little stronger.
Something to experiment on some other day

Anonymous
06/03/26(Wed)19:21:51 No.108974848

Anonymous 06/03/26(Wed)19:21:51 No.108974848

File: z052dt.png (1.74 MB, 1024x1024)

1.74 MB PNG

Anonymous
06/03/26(Wed)19:23:22 No.108974858

Anonymous 06/03/26(Wed)19:23:22 No.108974858

Im out of the loop. But just looking at what I'm seeing here this looks like what happened when Aura flow go to released (anyone remember that) the service the model used to get its training data from had a cat that would appear the model blocked the prompt. It looks like they did the same here but instead of a cat it just generates text.

Anonymous
06/03/26(Wed)19:25:12 No.108974868

Anonymous 06/03/26(Wed)19:25:12 No.108974868

>>108974785
do apikeks really? This would never happen with a local model, local is free and uncensored!

Anonymous
06/03/26(Wed)19:25:35 No.108974869

Anonymous 06/03/26(Wed)19:25:35 No.108974869

>>108974817
it's not that unreasonable actually.
Cats are also great source material for fun images.

Anonymous
06/03/26(Wed)19:26:47 No.108974879

Anonymous 06/03/26(Wed)19:26:47 No.108974879

File: 232133CUI_00001_.png (1.08 MB, 1536x1152)

1.08 MB PNG

>>108974785
catbox?

Anonymous
06/03/26(Wed)19:30:44 No.108974904

Anonymous 06/03/26(Wed)19:30:44 No.108974904

https://research.nvidia.com/labs/par/byg/
bygots, rise up

Anonymous
06/03/26(Wed)19:40:26 No.108974957

Anonymous 06/03/26(Wed)19:40:26 No.108974957

File: 1758277066069077.png (16 KB, 613x207)

16 KB PNG

>>108972914
gayest thing i could find

Anonymous
06/03/26(Wed)19:42:17 No.108974968

Anonymous 06/03/26(Wed)19:42:17 No.108974968

Censorship aside, how are the output from Ideogram that do get through? Are they good?

Anonymous
06/03/26(Wed)19:43:32 No.108974975

Anonymous 06/03/26(Wed)19:43:32 No.108974975

File: ComfyUI_Anima_03259_.png (1.28 MB, 1344x960)

1.28 MB PNG

Anonymous
06/03/26(Wed)19:44:23 No.108974981

Anonymous 06/03/26(Wed)19:44:23 No.108974981

>>108974957
this looks like the brightness adjust for a horror game

Anonymous
06/03/26(Wed)19:56:00 No.108975041

Anonymous 06/03/26(Wed)19:56:00 No.108975041

https://www.reddit.com/r/StableDiffusion/comments/1tw6c4y/sorry_not_sorry_ideogram_jailbroken_in_1_easy_step/

seems people have figured out the censor slop?

Anonymous
06/03/26(Wed)20:07:27 No.108975105

Anonymous 06/03/26(Wed)20:07:27 No.108975105

>>108974968
I am getting shit. Some anatomy errors too.
I only tested a few images in Turbo and Default though.
It seems you need to do some really fucking tedious json autism if you want good results.

Anonymous
06/03/26(Wed)20:10:17 No.108975119

Anonymous 06/03/26(Wed)20:10:17 No.108975119

File: 4689876.webm (2.5 MB, 256x448)

2.5 MB WEBM

Anonymous
06/03/26(Wed)20:10:56 No.108975122

Anonymous 06/03/26(Wed)20:10:56 No.108975122

File: Ideogram_00012_.png (1.51 MB, 1024x1024)

1.51 MB PNG

Such a shame because it's textual capabilities are impressive.
Mogs anything else local and most API models too.
This is default. (20 steps)

Anonymous
06/03/26(Wed)20:12:50 No.108975129

Anonymous 06/03/26(Wed)20:12:50 No.108975129

File: Screenshot from 2026-06-0(...).png (163 KB, 1249x898)

163 KB PNG

>>108974488
it's totally not lol. Here's what it looks like.

always visualize your sigmas. then you watch in the preview, whatever step, you know what level of detail is being worked on. All public models act like this, big to small.

Anonymous
06/03/26(Wed)20:13:51 No.108975134

Anonymous 06/03/26(Wed)20:13:51 No.108975134

>>108975122
unc here. Learn gimp, really, this is dumb lol

but ideogram brings back memories. I did the whole 360 degree Janduz (sp) cycle.

Anonymous
06/03/26(Wed)20:15:14 No.108975143

Anonymous 06/03/26(Wed)20:15:14 No.108975143

File: Screenshot from 2026-06-0(...).png (163 KB, 1249x898)

163 KB PNG

>>108975129
Here's beta57. It's flatter.

Anonymous
06/03/26(Wed)20:17:01 No.108975156

Anonymous 06/03/26(Wed)20:17:01 No.108975156

File: Ideogram_00013_.png (1.57 MB, 1024x1024)

1.57 MB PNG

>>108975122
Even works at turbo.
>>108975134
I know gimp unc, I am just testing shit, practical value be damned.
Usually when they say "our model has great text" they mean some stupid benchmeme but this model actually has great text.

Anonymous
06/03/26(Wed)20:17:55 No.108975164

Anonymous 06/03/26(Wed)20:17:55 No.108975164

>>108975122
>>108975134
here
https://archive.org/details/new-360-symbolic-degrees

These are a good source of test material. It's very interesting to see that since that time attitudes and laws relating to nudity are both more strict, and at the same time homosexuality is very legal and common. I prefer the time when they were in the closet and nudity wasn't a high crime punishable by systemic rape.

Anonymous
06/03/26(Wed)20:18:56 No.108975170

Anonymous 06/03/26(Wed)20:18:56 No.108975170

>>108975156
I'll respect it for its speed, at least.

Anonymous
06/03/26(Wed)20:19:35 No.108975172

Anonymous 06/03/26(Wed)20:19:35 No.108975172

i'm out of the loop. why is this ideogram release noteworthy? is it supposed to be the best local image model? or are people just interested because it's new

Anonymous
06/03/26(Wed)20:22:26 No.108975184

Anonymous 06/03/26(Wed)20:22:26 No.108975184

File: Klein-9b-734066213960284_(...).png (635 B, 84x85)

635 B PNG

Been trying to get a decent icon of a hand pulling a photo out of a librarian's drawer... this is the best so far and it's still shit

Skill issue, I know.

Close to switching back to black-on-white just because the icons are so much easier to make

Anonymous
06/03/26(Wed)20:23:32 No.108975190

Anonymous 06/03/26(Wed)20:23:32 No.108975190

>>108975122
there is something wrong with the inference, the images are all so bad, is shouldn't be because fp8 right?

Anonymous
06/03/26(Wed)20:28:08 No.108975210

Anonymous 06/03/26(Wed)20:28:08 No.108975210

>>108975190
I am suspecting something might be off with the schizo workflow Comfy ships as well, but I am not sure.
No, I don't think because it's FP8. FP8 only release sucks for training and making more quants, but it shouldn't tank the inference this much.

Anonymous
06/03/26(Wed)20:28:09 No.108975211

Anonymous 06/03/26(Wed)20:28:09 No.108975211

>Discussion and Development of Local Image, Video, and Music Models
>Music models
Was that always there?

Anonymous
06/03/26(Wed)20:28:39 No.108975213

Anonymous 06/03/26(Wed)20:28:39 No.108975213

File: Tired-Meme41-1248604943.jpg (63 KB, 720x523)

63 KB JPG

>>108974785
So, on top of wrestling with samples, complex multi pass workflows, plastic skin, melted hands and feet, catastrophic forgetting and failed LoRAs just to pull off what should be a stupidly simple concept on local models ,now we also have to fight this brand new flavor of censorship? Fantastic.

Anonymous
06/03/26(Wed)20:31:35 No.108975228

Anonymous 06/03/26(Wed)20:31:35 No.108975228

>>108975211
We're dying out here, we need to bring in some fresh anons. For some reason Catjack has been spamming his gens all week long.

Anonymous
06/03/26(Wed)20:33:22 No.108975232

Anonymous 06/03/26(Wed)20:33:22 No.108975232

>>108975211
:) good to see.

Anonymous
06/03/26(Wed)20:33:45 No.108975235

Anonymous 06/03/26(Wed)20:33:45 No.108975235

>>108975228
Catjak just constantly spergs at random anons. Actual thread lolcow

Anonymous
06/03/26(Wed)20:37:16 No.108975252

Anonymous 06/03/26(Wed)20:37:16 No.108975252

>>108975041
>Arbitrarily ablating so many layers at random weights
Enjoy the body horror and incomprehensible AI nightmares.
Maybe if you could figure out a way to disable censorship by only slightly changing (0.8-0.9) small amount of layers it could be useful.
Like it seems to work but results aren't good.
This sigma crap another redditor linked in the thread:
https://www.reddit.com/r/StableDiffusion/comments/1tw6gmq/ideogram_safety_filter_is_removed_by_using/
didn't work for me but other timestep shenanigans seem worth experimenting.

Anonymous
06/03/26(Wed)20:37:35 No.108975254

Anonymous 06/03/26(Wed)20:37:35 No.108975254

>>108975235
I love Jack the Cat, I wish we had more anons dedicated to this general. This general has so much potential but it needs more love desu

Anonymous
06/03/26(Wed)20:39:00 No.108975264

Anonymous 06/03/26(Wed)20:39:00 No.108975264

>>108975232
It was added only 2 threads ago....did any new models come out?

Anonymous
06/03/26(Wed)20:41:38 No.108975283

Anonymous 06/03/26(Wed)20:41:38 No.108975283

>>108975264
i like my ltx music

Anonymous
06/03/26(Wed)20:42:13 No.108975288

Anonymous 06/03/26(Wed)20:42:13 No.108975288

>>108975041
>>108975252
just wait for the sarah peterson patch, she'll fix it

Anonymous
06/03/26(Wed)20:43:05 No.108975295

Anonymous 06/03/26(Wed)20:43:05 No.108975295

>>108975283
LTX can do music?
At what length?

Anonymous
06/03/26(Wed)20:45:03 No.108975300

Anonymous 06/03/26(Wed)20:45:03 No.108975300

>>108975264
We have ace step xl sft, with dcw.

I'll go ahead and start work on my next song, I guess lmao.

The key thing is to realize that prompting is very important. If your prompt is bad, you can do a2a with cover strength at 0.3. this can mitigate some issues. you need good audio equipment to hear ace step 1.5 xl gens in their full glory, though they only are essentially at idk 48kz mp3 maybe in total quality, ok? a paradox! like one of those women that has a wang.

Anonymous
06/03/26(Wed)20:45:20 No.108975302

Anonymous 06/03/26(Wed)20:45:20 No.108975302

>>108975295
it can go on forever, but i haven't tried making a full song with it since you can only extend it in short increments in order to give enough memory for the context window to fit enough of the song to remain consistent

Anonymous
06/03/26(Wed)20:47:45 No.108975315

Anonymous 06/03/26(Wed)20:47:45 No.108975315

>>108975300
I'm not fully familiar with that functionality, any guide guides or recommended baby's first settings?

Anonymous
06/03/26(Wed)20:48:46 No.108975320

Anonymous 06/03/26(Wed)20:48:46 No.108975320

>>108975302
That sounds tedious

Anonymous
06/03/26(Wed)20:52:05 No.108975336

Anonymous 06/03/26(Wed)20:52:05 No.108975336

>>108975320
yes, it takes forever when you want actual lyrics since it takes a long time to generate the extension, and then you can find that the singer doesn't say the correct thing. but i think it sounds okay considering it's a video model
https://files.catbox.moe/fuxnsb.mp4

Anonymous
06/03/26(Wed)20:53:22 No.108975337

Anonymous 06/03/26(Wed)20:53:22 No.108975337

>>108975252
>>108975041
>>108974785

I personally hate all you and your "I have money and a GPU" vibe and I fantasize daily about watching you suffer BUT EVEN SO I think you should not be using Ideogram or messing around trying to jailbreak it because:

->If Ideogram sees people deliberately avoiding their model because of this new censorship, it will pressure the people who came up with this nonsense to rethink their approach. Hit them where it hurts and that is the usage stats.

2: If you keep trying to jailbreak it, you are making things worse for everyone. Every new model they drop will be harder to crack, more labs will start copying this censorship playbook, and the whole local ecosystem becomes a giant headache for all of us.

The best move here is to go on strike and stop supporting anti gonner models.

Anonymous
06/03/26(Wed)20:54:10 No.108975344

Anonymous 06/03/26(Wed)20:54:10 No.108975344

>>108975336
why does local attract so many blind and deaf people?

Anonymous
06/03/26(Wed)20:55:37 No.108975349

Anonymous 06/03/26(Wed)20:55:37 No.108975349

>>108975315
steps 100, cfg scale between 6 and 13, shift as high as 11, dcw mode double, dcw scaler 0.0008, dcw high scaler 0.0005, ode euler

that's where I've settled. There's another person who is ahead of me on this.

other models need rhyming, ace step doesn't need it, idk if it even helps - much.

but it still works better with quatrains of about the same number of syllables.

the biggest tip is don't use "compose" and keep the audio codes thing blank. well, imo it's better. that keeps it squared up, but I don't like it. But when you do this, realize your prompt is kind of sequential. It relates to how clip works. I have not fully figured out how prompting works, because it's weird how it works. It knows descriptions of sounds.

Anonymous
06/03/26(Wed)20:56:15 No.108975353

Anonymous 06/03/26(Wed)20:56:15 No.108975353

you will never date your ai generated 1girl

Anonymous
06/03/26(Wed)20:56:44 No.108975356

Anonymous 06/03/26(Wed)20:56:44 No.108975356

>>108975337
this is whats going to happen
redditors will find a trivial jailbreak then everyone will use it for a while, realize its shit then forget about it and go back to klein/zit

Anonymous
06/03/26(Wed)20:57:24 No.108975359

Anonymous 06/03/26(Wed)20:57:24 No.108975359

>>108975353
Meanie

Anonymous
06/03/26(Wed)20:58:57 No.108975368

Anonymous 06/03/26(Wed)20:58:57 No.108975368

>>108975337
>I personally hate all you and your "I have money and a GPU" vibe and I fantasize daily about watching you suffer
Why???????

Anonymous
06/03/26(Wed)20:58:58 No.108975369

Anonymous 06/03/26(Wed)20:58:58 No.108975369

>>108975353
the trick is to become the 1girl

Anonymous
06/03/26(Wed)21:04:11 No.108975393

Anonymous 06/03/26(Wed)21:04:11 No.108975393

Has anyone gotten the comfy workflow to work for ideogram4 using the fp8 models? I have a 4090, I get "mul_cuda" not implemented for 'Float8_e4m3fn'. I'm pretty sure Ada supports that so I'm confused. I'm on nightly. I guess I'll have to wait more. Very strange of them to not release the fp16 model. If I had that, it would work, since I have the VRAM.

Anonymous
06/03/26(Wed)21:06:01 No.108975401

Anonymous 06/03/26(Wed)21:06:01 No.108975401

whats so cool about ideogram4?

Anonymous
06/03/26(Wed)21:07:13 No.108975409

Anonymous 06/03/26(Wed)21:07:13 No.108975409

>>108975401
its new and powerful

Anonymous
06/03/26(Wed)21:07:21 No.108975410

Anonymous 06/03/26(Wed)21:07:21 No.108975410

>>108975401
its a cuckold simulator

Anonymous
06/03/26(Wed)21:07:58 No.108975414

Anonymous 06/03/26(Wed)21:07:58 No.108975414

>>108975369
whats the best prompt for this

Anonymous
06/03/26(Wed)21:08:03 No.108975415

Anonymous 06/03/26(Wed)21:08:03 No.108975415

>>108975393
Why are you supporting a model that doesn't want (You) using it? Use Anima cuckie.

Anonymous
06/03/26(Wed)21:08:32 No.108975416

Anonymous 06/03/26(Wed)21:08:32 No.108975416

File: 1752555687172871.png (1.5 MB, 1024x1024)

1.5 MB PNG

Anonymous
06/03/26(Wed)21:09:29 No.108975424

Anonymous 06/03/26(Wed)21:09:29 No.108975424

>>108975349
I see I have been using the gradio UI and just settled at 200 steps at heun. For me it's been best for whatever they call the guidance in that interface to sit a 2.5-3.
>>108975336
What made you decide to try this?
I guess there isn't much discussion on music gen, I didn't see much of it in the threads

Anonymous
06/03/26(Wed)21:10:17 No.108975427

Anonymous 06/03/26(Wed)21:10:17 No.108975427

>>108975295
https://files.catbox.moe/8s5ca3.mp4 (warning - nudity)
Yes, and sometimes it's pretty good, but it doesn't follow the prompt all that well when using 8-step distilled, and I don't have the patience to wait longer. I take the music as a nice freebie when it comes out OK.

Anonymous
06/03/26(Wed)21:11:09 No.108975432

Anonymous 06/03/26(Wed)21:11:09 No.108975432

>>108975356
Well, yes.

There's nothing here to be excited about, unless you are REALLY interested in generating text, as in less than 1% of local users.

Eventually we will have a ZiT/Klein killer model, but this sure ain't it.

Looks like anima is finally dethroning the SDXL finetunes for anime stuff though.

Anonymous
06/03/26(Wed)21:13:33 No.108975447

Anonymous 06/03/26(Wed)21:13:33 No.108975447

>>108975432
Aside from that Anima is overtaking Zeta Image and Klein, I keep noticing more and more realistic loras and finetunes for Anima.

Anonymous
06/03/26(Wed)21:13:40 No.108975448

Anonymous 06/03/26(Wed)21:13:40 No.108975448

>>108975432
>Looks like anima is finally dethroning the SDXL
all I've seen is half sticking with IL and half going with anima. mostly speed complaints

Anonymous
06/03/26(Wed)21:14:10 No.108975450

Anonymous 06/03/26(Wed)21:14:10 No.108975450

>>108975427
Interesting
I'm going to hone my skills with Ace step 1.5 I think it has a lot of potential the only problem I have is that there is a noticeable quality jump with the stl or whatever top end model between 80 where there's not much gain then it just jumps up around 180- 200 step I guess it's 400 when you use heun.

Anonymous
06/03/26(Wed)21:14:17 No.108975452

Anonymous 06/03/26(Wed)21:14:17 No.108975452

File: image.png (1.04 MB, 1024x1024)

1.04 MB PNG

>>108975415
Anima is undertrained garbage which gens hands like 2023 dall-e. Also, I tell me not to gen porn with your model, I'm going to gen porn with it, you know?

Anonymous
06/03/26(Wed)21:16:01 No.108975457

Anonymous 06/03/26(Wed)21:16:01 No.108975457

>>108975424
>What made you decide to try this?
sometimes i get random music in my videos, so i thought it had lots of music in the data

Anonymous
06/03/26(Wed)21:16:40 No.108975458

Anonymous 06/03/26(Wed)21:16:40 No.108975458

>>108975129
>>108975143
Very interesting, thanks.
I'll take a look.

Anonymous
06/03/26(Wed)21:17:35 No.108975464

Anonymous 06/03/26(Wed)21:17:35 No.108975464

>>108975432
>Looks like anima is finally dethroning the SDXL finetunes
All the actual skilled prompters and artist moved to it after preview 1 released desu

Anonymous
06/03/26(Wed)21:18:37 No.108975467

Anonymous 06/03/26(Wed)21:18:37 No.108975467

>>108975447
how easy is it to finetune anima? i have over 100k real photos

Anonymous
06/03/26(Wed)21:19:46 No.108975471

Anonymous 06/03/26(Wed)21:19:46 No.108975471

I want to use Ideogram alongside windows 12 and nodes 2.0

Anonymous
06/03/26(Wed)21:21:43 No.108975480

Anonymous 06/03/26(Wed)21:21:43 No.108975480

>>108975467
if you can afford to fine-tune just do the base cosmos model and get rid of the grifter licence

Anonymous
06/03/26(Wed)21:23:07 No.108975484

Anonymous 06/03/26(Wed)21:23:07 No.108975484

>>108975480
Anime dataset is actually good for NSFW realism, it gives more hot and creative compositions

Anonymous
06/03/26(Wed)21:23:21 No.108975486

Anonymous 06/03/26(Wed)21:23:21 No.108975486

>>108975467
Default training params work fine
>>108975480
Since he can tune he's probably not a jeet so he doesn't have to worry about the licence

Anonymous
06/03/26(Wed)21:24:41 No.108975494

Anonymous 06/03/26(Wed)21:24:41 No.108975494

>>108975452
id be more than willing to help you get better outputs with anima but you strike me as the kind of anon who doesnt want help and would rather complain

Anonymous
06/03/26(Wed)21:24:41 No.108975495

Anonymous 06/03/26(Wed)21:24:41 No.108975495

>>108975467
I don't know

Anonymous
06/03/26(Wed)21:25:42 No.108975500

Anonymous 06/03/26(Wed)21:25:42 No.108975500

>>108975452
Why you follow me everywhere i go?

Anonymous
06/03/26(Wed)21:26:43 No.108975504

Anonymous 06/03/26(Wed)21:26:43 No.108975504

>grifter licence
Has anon found any proof of this yet or is he still just trolling

Anonymous
06/03/26(Wed)21:27:21 No.108975509

Anonymous 06/03/26(Wed)21:27:21 No.108975509

>>108975467
easy

Anonymous
06/03/26(Wed)21:27:38 No.108975511

Anonymous 06/03/26(Wed)21:27:38 No.108975511

>>108975494
why did you even bother replying to him lol

Anonymous
06/03/26(Wed)21:37:45 No.108975558

Anonymous 06/03/26(Wed)21:37:45 No.108975558

>>108975424
>settled at 200 steps at heun
ace step cpp doesn't have heun, and is capped at 100 steps. idk why.

dcw needs to change depending on how many steps you use.

If I ever go back to comfyui for ace step, I would use exp_heun_2_x0_sde for the sampler, and I always use tan2 for my scheduler now. It's like a double Z shape. basically, bong tangent "rushes" through the mid sigmas, but tan2 has an adjustable plateau in the middle, or wherever you want it. anyway, these seem to be the best sampler + scheduler, so I think. Shame the sampler isn't on sdcpp, and shame I have to use comfyui to collect my sigmas, not that it takes that long.

Anonymous
06/03/26(Wed)21:38:42 No.108975562

Anonymous 06/03/26(Wed)21:38:42 No.108975562

>>108975504
Just so you know, whoever you are, it looks really sus from the outside when someone jumps to defend one specific model this fast always. We were talking shit about Ideogram earlier and nobody said a word, but the second Anima comes up, suddenly there's a white knight in the thread. That's a little too convenient bro.

Anonymous
06/03/26(Wed)21:38:42 No.108975563

Anonymous 06/03/26(Wed)21:38:42 No.108975563

>>108975504
tdruss was bitching about not making enough money abloobloo

Anonymous
06/03/26(Wed)21:39:35 No.108975568

Anonymous 06/03/26(Wed)21:39:35 No.108975568

>>108975558
Wait there's a c++ version well fuck!
What do you gain from that version?

Anonymous
06/03/26(Wed)21:40:48 No.108975571

Anonymous 06/03/26(Wed)21:40:48 No.108975571

File: output_1780533942.png (1.56 MB, 832x1216)

1.56 MB PNG

>>108975134
>>108975164
Here's zit doing the first degree, I didn't prompt or prompt enhance, just pasted it in.

Anonymous
06/03/26(Wed)21:41:48 No.108975577

Anonymous 06/03/26(Wed)21:41:48 No.108975577

>>108975568
It runs on my rdna2 card, and has dcw, and a2a.

rdna2 is kind of the runt of the cards, amd has partially dropped support.

Anonymous
06/03/26(Wed)21:44:19 No.108975585

Anonymous 06/03/26(Wed)21:44:19 No.108975585

>>108975577
>AMD shitting the bed
It's all so tiresome it's like they throw the match on purpose

Anonymous
06/03/26(Wed)21:45:54 No.108975590

Anonymous 06/03/26(Wed)21:45:54 No.108975590

guys guys
remember
anima is le bad >:^(

Anonymous
06/03/26(Wed)21:48:01 No.108975594

Anonymous 06/03/26(Wed)21:48:01 No.108975594

>>108975585
Yep, I'm through with all the companies. It's clear they have conspired to limit ram, and to limit "cuda" matrix math.

Anonymous
06/03/26(Wed)21:51:08 No.108975604

Anonymous 06/03/26(Wed)21:51:08 No.108975604

>>108975041
>>108975252
Thinking again, how much does the theory even make sense here?
They modified the relevant weights during post-training process and taught the model to draw the grey image when presented with forbidden conditioning.
So we want to disrupt the layer weights in such a way that:
1) It doesn't completely cripple the model or make it too weird, so essentially a small enough delta on as few layers as possible
2) No longer draws the safety filter image
3) Instead draws whatever it knew about the naughty conditioning before post-training?
The latter doesn't seem very possible through ablation. I guess the realistic goal here is to make it less prone to ludicrous false positives. If we assume that they fucked up the training and unintentionally fried the model and that's why it is so trigger happy, it might be possible to moderately clamp select few amount of probably the composition related middle layers and no longer get so many safety filter images, without also raping the model.
But I dunno finetuning seems like a much better way out of this mess.
I was thinking about putting some combinations into comfy oven before going to bed, and see if anything interesting comes up when I wake up, but I am now reconsidering if this is worth it.

Anonymous
06/03/26(Wed)21:54:00 No.108975609

Anonymous 06/03/26(Wed)21:54:00 No.108975609

The last remaining hope is Celestial, it's obvious that amd intentionally nerfed the matrix math on rdna4.

Anonymous
06/03/26(Wed)21:54:26 No.108975610

Anonymous 06/03/26(Wed)21:54:26 No.108975610

>>108975562
>>108975563
So no proof yet? Shame.

Anonymous
06/03/26(Wed)21:55:00 No.108975611

Anonymous 06/03/26(Wed)21:55:00 No.108975611

what do we want?
matrix math
when do we want it?
NOW!

Anonymous
06/03/26(Wed)21:56:35 No.108975622

Anonymous 06/03/26(Wed)21:56:35 No.108975622

>>108975471
theyre calling him the most pozzed genner known to anon

Anonymous
06/03/26(Wed)21:57:06 No.108975625

Anonymous 06/03/26(Wed)21:57:06 No.108975625

>>108975622
I'm disappointed he's going to use bare metal instead of cloud.

Anonymous
06/03/26(Wed)22:03:55 No.108975654

Anonymous 06/03/26(Wed)22:03:55 No.108975654

>>108975563
dont worry bro im sure soon someone will join your team to make apache2 anima

Anonymous
06/03/26(Wed)22:05:20 No.108975663

Anonymous 06/03/26(Wed)22:05:20 No.108975663

>>108975654
Funny how that became a nothing burger.

Anonymous
06/03/26(Wed)22:08:51 No.108975673

Anonymous 06/03/26(Wed)22:08:51 No.108975673

File: output_1780537152.png (1.63 MB, 832x1216)

1.63 MB PNG

>>108975571
aries 2 of Janduz, not prompted, maybe I should come back and prompt.

Anonymous
06/03/26(Wed)22:10:04 No.108975677

Anonymous 06/03/26(Wed)22:10:04 No.108975677

>>108975663
Somtimes, a model can do one thing well, and feels like it should be a lora. Like ovis image whatever, it can do cartoon text really well. but I don't think it's good enough to do i2i, so pointless, I guess.

Anonymous
06/03/26(Wed)22:10:39 No.108975678

Anonymous 06/03/26(Wed)22:10:39 No.108975678

File: ideo.jpg (444 KB, 1088x1936)

444 KB JPG

Anonymous
06/03/26(Wed)22:12:42 No.108975683

Anonymous 06/03/26(Wed)22:12:42 No.108975683

>https://echo-team-joy-future-academy-jd.github.io/Echo-Infinity/
New model released that lets you generate 24 hour long videos based on Wan.

Anonymous
06/03/26(Wed)22:17:30 No.108975696

Anonymous 06/03/26(Wed)22:17:30 No.108975696

Everyone's favorite model is released.
https://civitai.red/models/2544636?modelVersionId=2983680

Anonymous
06/03/26(Wed)22:18:20 No.108975703

Anonymous 06/03/26(Wed)22:18:20 No.108975703

>>108975683
>24 hour long videos
They're trying to kill the coomers, aren't they ?

Anonymous
06/03/26(Wed)22:24:32 No.108975720

Anonymous 06/03/26(Wed)22:24:32 No.108975720

>>108975696
At least this time he's not lying and has labeled it as a merge

Anonymous
06/03/26(Wed)22:25:24 No.108975723

Anonymous 06/03/26(Wed)22:25:24 No.108975723

>>108975696
Why is everyone so hyped about this? Legit question.

Anonymous
06/03/26(Wed)22:27:24 No.108975727

Anonymous 06/03/26(Wed)22:27:24 No.108975727

>>108975723
>Why is everyone so hyped about this?
It's Pride Month. Let 'em get loud.

Anonymous
06/03/26(Wed)22:27:27 No.108975728

Anonymous 06/03/26(Wed)22:27:27 No.108975728

>>108975723
>everyone
Sure Jan...

Anonymous
06/03/26(Wed)22:27:36 No.108975729

Anonymous 06/03/26(Wed)22:27:36 No.108975729

>>108975723
Same reason people get hyped about dogshit popular music. Probably a mixture of shilling and some kind of viral snowball effect past a certain point.

Anonymous
06/03/26(Wed)22:28:51 No.108975736

Anonymous 06/03/26(Wed)22:28:51 No.108975736

>>108975723
>Why is everyone so hyped about this?
some struggle to "stabilize" outputs from raw finetunes so they need something like WAI with a rigidity that compensates for their lack of prompt-fu

Anonymous
06/03/26(Wed)22:33:08 No.108975754

Anonymous 06/03/26(Wed)22:33:08 No.108975754

music enthusiasts here? what's the meta for local musicgen now? i'm interested in melodic instrumentals. was having fun with audiocraft a year ago.

Anonymous
06/03/26(Wed)22:37:31 No.108975780

Anonymous 06/03/26(Wed)22:37:31 No.108975780

>>108975754
I am but AI will never come anywhere close to generating anything I could remotely enjoy (Classical) so I don't even bother

Anonymous
06/03/26(Wed)22:37:31 No.108975781

Anonymous 06/03/26(Wed)22:37:31 No.108975781

File: 023341CUI_00001_.png (1.31 MB, 1536x1152)

1.31 MB PNG

>>108975736
Hm, the output is a indeed a lot cleaner: >>108974243

Anonymous
06/03/26(Wed)22:39:26 No.108975787

Anonymous 06/03/26(Wed)22:39:26 No.108975787

>>108975781
sovl vs slop, as always with these shitmixes

Anonymous
06/03/26(Wed)22:39:49 No.108975788

Anonymous 06/03/26(Wed)22:39:49 No.108975788

>>108975754
>what's the meta for local musicgen now?
FL Studio

Anonymous
06/03/26(Wed)22:41:59 No.108975797

Anonymous 06/03/26(Wed)22:41:59 No.108975797

>>108975787
wai pretty much always was a nice finetune tho

Anonymous
06/03/26(Wed)22:42:40 No.108975802

Anonymous 06/03/26(Wed)22:42:40 No.108975802

>>108975494
I don't do a whole lot of imagegen anymore except to feed into ltx-2.3 i2v, and nsfw sdxl models are good enough for that. Maybe anime will improve with more training. I dunno, I just don't have the free time I used to.

Anonymous
06/03/26(Wed)22:43:07 No.108975805

Anonymous 06/03/26(Wed)22:43:07 No.108975805

>>108975797
it was never a finetune tho. he just shitmixes random loras into slopshit

Anonymous
06/03/26(Wed)22:43:56 No.108975812

Anonymous 06/03/26(Wed)22:43:56 No.108975812

>>108975797
>finetune

Anonymous
06/03/26(Wed)22:44:48 No.108975815

Anonymous 06/03/26(Wed)22:44:48 No.108975815

>>108975781
i wont convince you to NOT use it i dont care about that. my only point was that mixes and merges are designed to give a default style which some prefer and others do not. if you like the style it brings than by all means use it, but i dont really care for models that have their own built in style. it usually leads to them looking less like real images and more like generated outputs.
>>108975797
>finetune
lel

Anonymous
06/03/26(Wed)22:46:28 No.108975823

Anonymous 06/03/26(Wed)22:46:28 No.108975823

What if I train a lora with the "image blocked by safety filter" images it outputs, tagged with a diverse set of the prompts it rejects and apply it with -1 strength at runtime? What would happen?

Anonymous
06/03/26(Wed)22:48:43 No.108975833

Anonymous 06/03/26(Wed)22:48:43 No.108975833

>>108975802
>I just don't have the free time I used to.
the future belongs to the zoomers and gen alpha, old man

Anonymous
06/03/26(Wed)22:53:09 No.108975855

Anonymous 06/03/26(Wed)22:53:09 No.108975855

>>108975823
negative loras dont work, you may prevent that from being generate but it will generate garbage

Anonymous
06/03/26(Wed)22:57:39 No.108975871

Anonymous 06/03/26(Wed)22:57:39 No.108975871

File: 024743CUI_00001_.png (1.39 MB, 1536x1152)

1.39 MB PNG

>>108975815
Ok, so this is the same input from the collage in the OP. I used a sketch lora for that look and WAI seems to completely ignore that. I assume it's because it's a merged model and the lora doesn't work very well and not because it is overriding the effect it is supposed to have on the image? I don't know much about how this works, sorry.

Anonymous
06/03/26(Wed)22:58:28 No.108975880

Anonymous 06/03/26(Wed)22:58:28 No.108975880

File: ComfyUI_Anima_03269_.png (1.28 MB, 1344x960)

1.28 MB PNG

Anonymous
06/03/26(Wed)23:01:35 No.108975894

Anonymous 06/03/26(Wed)23:01:35 No.108975894

json prompting is the gayest thing on the planet

Anonymous
06/03/26(Wed)23:01:50 No.108975895

Anonymous 06/03/26(Wed)23:01:50 No.108975895

>>108975855
What if I apply it at +1 strength at the unconditional model then?
Could this model's unique structure make it work?

Anonymous
06/03/26(Wed)23:02:08 No.108975897

Anonymous 06/03/26(Wed)23:02:08 No.108975897

>>108975871
does your lora have trigger word? have you tried increasing the lora strength?

Anonymous
06/03/26(Wed)23:03:12 No.108975902

Anonymous 06/03/26(Wed)23:03:12 No.108975902

File: 025827CUI_00001_.png (1.55 MB, 1536x1152)

1.55 MB PNG

The N64 lora seems to work really well.

>>108975897
It does. I can also try increasing the lora strength. Hold on.

Anonymous
06/03/26(Wed)23:07:17 No.108975930

Anonymous 06/03/26(Wed)23:07:17 No.108975930

File: ComfyUI_Anima_03277_.png (1.29 MB, 1344x960)

1.29 MB PNG

Anonymous
06/03/26(Wed)23:09:33 No.108975941

Anonymous 06/03/26(Wed)23:09:33 No.108975941

>>108975894
It's a humiliation ritual for sure. Great prompt adherence has questionable value when no will sit through the tedium.
I think it's meant for some agentic loop with LLM in the middle, I don't think they expect us to type that garbage by hand.
Still sucks though.

Anonymous
06/03/26(Wed)23:10:37 No.108975947

Anonymous 06/03/26(Wed)23:10:37 No.108975947

File: 030506CUI_00001_.png (1.44 MB, 1536x1152)

1.44 MB PNG

from 1.00 strength to 1.30. Doesn't look much different. I'll crank it up to 2

>>108975930
Sick

Anonymous
06/03/26(Wed)23:11:28 No.108975951

Anonymous 06/03/26(Wed)23:11:28 No.108975951

>>108975754
I'm still new to it but Ace Step seems to do well if you can prompt things out. There's a fuck ton of settings I don't understand but so far so good
https://vocaroo.com/1gFu5B3LIcBC

Anonymous
06/03/26(Wed)23:12:53 No.108975953

Anonymous 06/03/26(Wed)23:12:53 No.108975953

File: 030905CUI_00001_.png (1.13 MB, 1536x1152)

1.13 MB PNG

lol it added femkuna to the image
I guess it does look lil bit sketchier though

Anonymous
06/03/26(Wed)23:14:54 No.108975965

Anonymous 06/03/26(Wed)23:14:54 No.108975965

>>108975941
I just use an llm to generate the prompt. They have the system prompt they use in their github.

https://github.com/ideogram-oss/ideogram4/blob/main/src/ideogram4/magic_prompt_system_prompts/v1.txt

I have been able to generate nsfw images. I don't think I have triggered the safety image once.

I don't have much opinion so far on quality, but it is decent enough.

Anonymous
06/03/26(Wed)23:16:49 No.108975975

Anonymous 06/03/26(Wed)23:16:49 No.108975975

File: 031344CUI_00001_.png (1.32 MB, 1536x1152)

1.32 MB PNG

2.50

Anonymous
06/03/26(Wed)23:18:35 No.108975983

Anonymous 06/03/26(Wed)23:18:35 No.108975983

File: ComfyUI_Anima_03292_.png (1.07 MB, 1344x960)

1.07 MB PNG

>>108975947
Thanks.

Anima is actually pretty fun once you get the hang of it.

Anonymous
06/03/26(Wed)23:22:06 No.108975993

Anonymous 06/03/26(Wed)23:22:06 No.108975993

>>108975965
it doesn't look better than zit though. I don't give a fuck about text

Anonymous
06/03/26(Wed)23:27:37 No.108976002

Anonymous 06/03/26(Wed)23:27:37 No.108976002

Are the luddites dead yet

Anonymous
06/03/26(Wed)23:30:00 No.108976013

Anonymous 06/03/26(Wed)23:30:00 No.108976013

File: 032512CUI_00002_.png (1.4 MB, 1536x1152)

1.4 MB PNG

>>108975983
Check the wlop lora on civitai. It has a similar artstyle.

Anonymous
06/03/26(Wed)23:32:31 No.108976018

Anonymous 06/03/26(Wed)23:32:31 No.108976018

>>108974431
because you didn't
--disable-dynamic-vram

Anonymous
06/03/26(Wed)23:34:59 No.108976025

Anonymous 06/03/26(Wed)23:34:59 No.108976025

>>108975369
ywn
baw

Anonymous
06/03/26(Wed)23:36:43 No.108976032

Anonymous 06/03/26(Wed)23:36:43 No.108976032

File: ComfyUI_Anima_03320_.png (1.44 MB, 1344x960)

1.44 MB PNG

>>108976013
funny enough, I tried "@wlop", but didn't like the result, so I removed him from the mix.

I'll try the lora though.

Anonymous
06/03/26(Wed)23:46:47 No.108976075

Anonymous 06/03/26(Wed)23:46:47 No.108976075

File: 1778558574961652.png (1.57 MB, 1344x960)

1.57 MB PNG

>>108976032
Man, the plastic slop people tolerate...

Anonymous
06/03/26(Wed)23:48:24 No.108976080

Anonymous 06/03/26(Wed)23:48:24 No.108976080

>>108976075
desu still very much plastic

Anonymous
06/03/26(Wed)23:49:24 No.108976083

Anonymous 06/03/26(Wed)23:49:24 No.108976083

>>108976080
Get your eyes checked, sis

Anonymous
06/03/26(Wed)23:50:22 No.108976089

Anonymous 06/03/26(Wed)23:50:22 No.108976089

>>108976018
Comfy keeps threatening that they will remove this option, but I'm starting to doubt it ever will since they suck so bad at keeping a decent vram threshold with dynamic-vram, meaning you will OOM

People will rather use more system ram and take the ~5% perfomance hit

Anonymous
06/03/26(Wed)23:51:04 No.108976093

Anonymous 06/03/26(Wed)23:51:04 No.108976093

File: ComfyUI_Anima_03341_.png (1.47 MB, 1344x960)

1.47 MB PNG

>>108976075
stop ruining my slop

Anonymous
06/03/26(Wed)23:58:49 No.108976113

Anonymous 06/03/26(Wed)23:58:49 No.108976113

>>108976002
Nope but they are well on their way out. They're in the violently attacking high profile figures phase. People don't suffer that kind of behavior long.

Anonymous
06/04/26(Thu)00:06:08 No.108976136

Anonymous 06/04/26(Thu)00:06:08 No.108976136

File: BasevsWAI.jpg (521 KB, 3075x1146)

521 KB JPG

Yeah, I think I get what Anonymous meant when he said 'mixes and merges are designed to give a default style'. The checkpoint seems to steamroll most of the loras I've tried,

Anonymous
06/04/26(Thu)00:07:29 No.108976140

Anonymous 06/04/26(Thu)00:07:29 No.108976140

>>108976136
holy shit wai is such a shitty slop, how can anyone like this

Anonymous
06/04/26(Thu)00:13:52 No.108976149

Anonymous 06/04/26(Thu)00:13:52 No.108976149

>>108975349
>dcw mode double, dcw scaler 0.0008, dcw high scaler 0.0005,
This does make a difference way better separation

Anonymous
06/04/26(Thu)00:16:11 No.108976158

Anonymous 06/04/26(Thu)00:16:11 No.108976158

>>108976149
Yeah, I remember the anon posting about dcw. dcw is supposedly originally for images, but you don't see anyone doing it.

Anonymous
06/04/26(Thu)00:19:21 No.108976163

Anonymous 06/04/26(Thu)00:19:21 No.108976163

>>108976083
its okay you can still try again

Anonymous
06/04/26(Thu)00:26:01 No.108976185

Anonymous 06/04/26(Thu)00:26:01 No.108976185

whats wai?

Anonymous
06/04/26(Thu)00:31:51 No.108976203

Anonymous 06/04/26(Thu)00:31:51 No.108976203

File: output_1780546927.png (1.89 MB, 832x1216)

1.89 MB PNG

best 1girl, you can't even compete.

Anonymous
06/04/26(Thu)00:37:28 No.108976218

Anonymous 06/04/26(Thu)00:37:28 No.108976218

someone make Bitcoin-chan hanging from a noose
thanks

Anonymous
06/04/26(Thu)00:39:42 No.108976226

Anonymous 06/04/26(Thu)00:39:42 No.108976226

>>108976218
you do it

Anonymous
06/04/26(Thu)00:43:58 No.108976237

Anonymous 06/04/26(Thu)00:43:58 No.108976237

File: 1777152399008990.png (559 KB, 896x1152)

559 KB PNG

>>108976226
I can't get the ₿ logo on the shell
someone edit it

Anonymous
06/04/26(Thu)00:44:02 No.108976238

Anonymous 06/04/26(Thu)00:44:02 No.108976238

>>108973324
it's a grifter

Anonymous
06/04/26(Thu)00:44:04 No.108976240

Anonymous 06/04/26(Thu)00:44:04 No.108976240

>>108976218
>still bag-holding in 2026

Anonymous
06/04/26(Thu)00:45:10 No.108976246

Anonymous 06/04/26(Thu)00:45:10 No.108976246

>>108976240
no, I tethered up when it was above 100k
but I plan to start DCA once it goes below 60k

Anonymous
06/04/26(Thu)00:45:17 No.108976247

Anonymous 06/04/26(Thu)00:45:17 No.108976247

>>108976240
>being chinese
never be chinese

Remember the adage "a chinaman's chance"

Anonymous
06/04/26(Thu)00:46:18 No.108976251

Anonymous 06/04/26(Thu)00:46:18 No.108976251

>>108976246
>>>/biz/
fuck off

Anonymous
06/04/26(Thu)00:47:21 No.108976255

Anonymous 06/04/26(Thu)00:47:21 No.108976255

File: output_1780547872.png (1.9 MB, 832x1216)

1.9 MB PNG

>>108976203

Anonymous
06/04/26(Thu)00:47:32 No.108976257

Anonymous 06/04/26(Thu)00:47:32 No.108976257

File: ComfyUI_00129_.png (1.04 MB, 1024x1024)

1.04 MB PNG

>>108976251

Anonymous
06/04/26(Thu)00:49:23 No.108976266

Anonymous 06/04/26(Thu)00:49:23 No.108976266

>>108976257
Fine, retail stonks will get smacked in the summer. You want your biz, there it is.

Anonymous
06/04/26(Thu)01:07:19 No.108976313

Anonymous 06/04/26(Thu)01:07:19 No.108976313

What should I generate?

Anonymous
06/04/26(Thu)01:08:31 No.108976319

Anonymous 06/04/26(Thu)01:08:31 No.108976319

>>108976313
>What should I generate?

>>108975164

also, you can throw it into an llm and add a modifier, like steampunk, videogame scene, renaissance painting, dark anime.

Anonymous
06/04/26(Thu)01:10:48 No.108976331

Anonymous 06/04/26(Thu)01:10:48 No.108976331

>>108973545
>>108973550
thanks!

Anonymous
06/04/26(Thu)01:12:33 No.108976338

Anonymous 06/04/26(Thu)01:12:33 No.108976338

>China, Germany, USA, Singapore, Israel, China, China, China...
When will see see a French image model? Or one from South America? Australia? Sweden?

Anonymous
06/04/26(Thu)01:15:41 No.108976351

Anonymous 06/04/26(Thu)01:15:41 No.108976351

>>108976338
right after they import more muslims and jeets

Anonymous
06/04/26(Thu)01:19:54 No.108976371

Anonymous 06/04/26(Thu)01:19:54 No.108976371

>>108976338
>French
Can anyone besides maybe Mistral train anything worthwhile there?
>South America
Lol
>Australia
Interesting how no major global tech corporation ever came out of there, AI is no exception. In theory has the ingredients. Too much regressive tax/regulation like Europe?
>Sweden
If you thought BFL's German safetrooning was bad Swedes will probably invent a whole new level of cuckoldry.

Anonymous
06/04/26(Thu)01:20:55 No.108976373

Anonymous 06/04/26(Thu)01:20:55 No.108976373

https://xcancel.com/thepatch_kev/status/2062140772942774681?s=20
musicgen chads...

Anonymous
06/04/26(Thu)01:22:48 No.108976385

Anonymous 06/04/26(Thu)01:22:48 No.108976385

>>108976373
https://github.com/betweentwomidnights/sa3-ableton-extension
this looks really cool

Anonymous
06/04/26(Thu)01:34:40 No.108976436

Anonymous 06/04/26(Thu)01:34:40 No.108976436

File: file.png (1.31 MB, 1320x776)

1.31 MB PNG

>>108972764
idk

Anonymous
06/04/26(Thu)01:50:20 No.108976501

Anonymous 06/04/26(Thu)01:50:20 No.108976501

>>108976436
krea 2 medium is so cucked lmao

Anonymous
06/04/26(Thu)02:00:31 No.108976532

Anonymous 06/04/26(Thu)02:00:31 No.108976532

>>108976373
waiting for the lmms version

Anonymous
06/04/26(Thu)02:11:52 No.108976568

Anonymous 06/04/26(Thu)02:11:52 No.108976568

File: ComfyUI_temp_blyix_00009_.png (1.74 MB, 896x1184)

1.74 MB PNG

I wish /k/ wasn't full of retarded boomers so we could have good gun loras for every model.

Anonymous
06/04/26(Thu)02:14:54 No.108976574

Anonymous 06/04/26(Thu)02:14:54 No.108976574

>>108976501
Almost got baited by this.
Let's just hope the model release doesn't get safetymaxxed.
It would be a shame cause the model is really creative.

Anonymous
06/04/26(Thu)02:18:32 No.108976589

Anonymous 06/04/26(Thu)02:18:32 No.108976589

>>108976574
>Let's just hope the model release doesn't get safetymaxxed.
its over

Anonymous
06/04/26(Thu)02:18:40 No.108976590

Anonymous 06/04/26(Thu)02:18:40 No.108976590

>>108976574
the only thing that would be good is an uncucked krea 2 large, maybe a slightly older checkpoint of it compared to the one they can sell on the api, otherwise its DOA like krea 1, which is what it will be, since they won't ever release krea 2 large

Anonymous
06/04/26(Thu)02:21:53 No.108976601

Anonymous 06/04/26(Thu)02:21:53 No.108976601

File: 1772515159983707.jpg (314 KB, 2048x2048)

314 KB JPG

Anonymous
06/04/26(Thu)02:25:59 No.108976609

Anonymous 06/04/26(Thu)02:25:59 No.108976609

File: ComfyUI_temp_jbcjs_00003_.png (3.29 MB, 1664x2496)

3.29 MB PNG

Also anyone has Nihei lora? According to booru, he has only <150 pics and this sure ain't it after prompting for him.

Anonymous
06/04/26(Thu)02:27:21 No.108976615

Anonymous 06/04/26(Thu)02:27:21 No.108976615

File: feels good man.png (432 KB, 1024x1024)

432 KB PNG

>local is in such a good spot that no anon bats an eye when the new SOTA textgen model is ruined by safetycucking

Anonymous
06/04/26(Thu)02:28:14 No.108976619

Anonymous 06/04/26(Thu)02:28:14 No.108976619

What if train image models with not just tokens output from LLMs encoding text, but also tokens output from LLMs encoding images? Or both? Or audio?

Anonymous
06/04/26(Thu)02:28:38 No.108976622

Anonymous 06/04/26(Thu)02:28:38 No.108976622

>>108976615
nobody cares about textgen, we use text to gen 1girl instead here. you'll have your abliterated fix soon anyway

Anonymous
06/04/26(Thu)02:29:49 No.108976625

Anonymous 06/04/26(Thu)02:29:49 No.108976625

>>108976619
that's called ltx

Anonymous
06/04/26(Thu)02:30:28 No.108976629

Anonymous 06/04/26(Thu)02:30:28 No.108976629

>>108976622
no ideogram4

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.