[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Have a Cookie Anon Edition

Discussion and Development of Local Image, Video, and Music Models

Previous: >>109003927

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>109010629
THE LIGHTING IS INACCURATE

checkmate.
>>
cumfart devs are grifter faggots
>>
reposting my kino
>>
>>109010664
slap the cookies away. note the reflection in the photo on the right. Clearly the lighting is the result of a greenscreen. You are wearing surgically implanted augmented reality (AR) contact lenses. She is an imposter.
>>
>>109010675
> She is an imposter.
but pytorch is real
>>
>mfw Resource news

06/08/2026

>Unified Safe In-context Image Generation in Multimodal Diffusion Transformers via Restricting Unsafe Information Flows
https://github.com/deng12yx/UVR

>GuideCAD: A Lightweight Multimodal Framework for 3D CAD Model Generation via Prefix Embedding
https://github.com/mskimS2/GuideCAD

>SS-TPT: Stability and Suitability-Guided Test-Time Prompt Tuning for Adversarially Robust Vision-Language Models
https://github.com/sunoh-kim/SS-TPT

>Consistency-Preserving Diverse Video Generation
https://github.com/XinshuangL/Diverse-Video

>Ideogrammar — Ideogram 4 Prompt Editor
https://github.com/rlemson7/ideogrammar

06/07/2026

>Ideogram4 GGUF quantized files
https://huggingface.co/leejet/ideogram-4-GGUF

>‘A driver of political violence’: how the breakneck AI boom is fueling anti-tech extremism
https://www.theguardian.com/technology/2026/jun/07/anti-ai-tech-extremism-violence

>Ideogram 4 NF4 integration for Forge Neo with a visual JSON layout builder
https://github.com/Whatwhatio/forge-neo-ideogram4

>Huihui-gemma-4-12B-it-abliterated
https://huggingface.co/huihui-ai/Huihui-gemma-4-12B-it-abliterated

06/06/2026

>HugginFace VFS Plugin: Native Total Commander file system for Hugging Face models
https://github.com/mikinko/HuggingFace_WFX

>ComfyUI Lance AIO: Custom nodes to run Lance-3B
https://github.com/SteveImmanuel/comfyui-lance-aio

>Cube: Generative AI System for 3D
https://github.com/Roblox/cube

>The token bill comes due: Inside the industry scramble to manage AI’s runaway costs
https://techcrunch.com/2026/06/05/the-token-bill-comes-due-inside-the-industry-scramble-to-manage-ais-runaway-costs

06/05/2026

>RhymeFlow: Training-Free Acceleration for Video Generation with Asynchronous Denoising Flow Scheduling
https://simon-dcs.github.io/Website-of-RhymeFlow

>Complexity-Balanced Diffusion Splitting
https://noamissachar.github.io/CBS
>>
>mfw Research news

06/08/2026

>TrioPose: Native Triple-Stream Diffusion Transformers for Pose-Guided Text-to-Image Generation
https://arxiv.org/abs/2606.07053

>AsyncPatch Diffusion: spatially-flexible image generation
https://arxiv.org/abs/2606.07079

>Streaming Video Generation with Streaming Force Control
https://neu-vi.github.io/StreamForce

>CULTURESCORE: Evaluating Cultural Faithfulness in Video Generation Models
https://arxiv.org/abs/2606.07311

>3DMorph: Single-Image-Guided Local 3D Shape Editing and Morphing
https://arxiv.org/abs/2606.07115

>Breaking the Lock-in: Diversifying Text-to-Image Generation via Representation Modulation
https://arxiv.org/abs/2606.06813

>FreeAnimate: Training-Free Human Image Animation with Preview-Guided Denoising
https://freeani.github.io

>DRIFT: From Robustness Gaps to Invariance Manifolds for AI-Generated Image Detection
https://arxiv.org/abs/2606.06918

>AdaTok: Self-Budgeting Image Tokenization with Quality-Preserving Dynamic Tokens
https://arxiv.org/abs/2606.07185

>An Adaptive Data cleaning Framework for Noisy Label Detection
https://arxiv.org/abs/2606.07086

>Diagnosing Visual Ignorance in Vision-Language Models
https://arxiv.org/abs/2606.06890

>AdaGRPO: A Capability-Aware Adaptive Enhancement for Flow-based GRPO
https://bujiazi.github.io/adagrpo.github.io

>TEVI: Text-Conditioned Editing of Visual Representations via Sparse Autoencoders for Improved Vision-Language Alignment
https://arxiv.org/abs/2606.07451

>MidSteer: Optimal Affine Framework for Steering Generative Models
https://arxiv.org/abs/2605.05220
>>
>>109010686
Do you use some of this shit?
>Unified Safe In-context Image Generation in Multimodal Diffusion Transformers via Restricting Unsafe Information Flows
https://github.com/deng12yx/UVR
>GuideCAD: A Lightweight Multimodal Framework for 3D CAD Model Generation via Prefix Embedding
https://github.com/mskimS2/GuideCAD
>SS-TPT: Stability and Suitability-Guided Test-Time Prompt Tuning for Adversarially Robust Vision-Language Models
https://github.com/sunoh-kim/SS-TPT
>Consistency-Preserving Diverse Video Generation
https://github.com/XinshuangL/Diverse-Video
>Ideogrammar — Ideogram 4 Prompt Editor
https://github.com/rlemson7/ideogrammar
06/07/2026
>Ideogram4 GGUF quantized files
https://huggingface.co/leejet/ideogram-4-GGUF
>‘A driver of political violence’: how the breakneck AI boom is fueling anti-tech extremism
https://www.theguardian.com/technology/2026/jun/07/anti-ai-tech-extremism-violence
>Ideogram 4 NF4 integration for Forge Neo with a visual JSON layout builder
https://github.com/Whatwhatio/forge-neo-ideogram4
>Huihui-gemma-4-12B-it-abliterated
https://huggingface.co/huihui-ai/Huihui-gemma-4-12B-it-abliterated
06/06/2026
>HugginFace VFS Plugin: Native Total Commander file system for Hugging Face models
https://github.com/mikinko/HuggingFace_WFX
>ComfyUI Lance AIO: Custom nodes to run Lance-3B
https://github.com/SteveImmanuel/comfyui-lance-aio
>Cube: Generative AI System for 3D
https://github.com/Roblox/cube
>The token bill comes due: Inside the industry scramble to manage AI’s runaway costs
https://techcrunch.com/2026/06/05/the-token-bill-comes-due-inside-the-industry-scramble-to-manage-ais-runaway-costs
06/05/2026
>RhymeFlow: Training-Free Acceleration for Video Generation with Asynchronous Denoising Flow Scheduling
https://simon-dcs.github.io/Website-of-RhymeFlow
>Complexity-Balanced Diffusion Splitting
https://noamissachar.github.io/CBS

Or you are just larping?
>>
>>109010664
Thanks for baking, grandma.
>>
File: DEBO_00067_.png (1.99 MB, 1728x1344)
1.99 MB PNG
>>109010703
both
>>
>>109010630
>bottom left
I don't usually follow these threads, but is the tech already able to take some collage of photos like photobashing you make badly and then the AI fixes it for you?
>>
Stop the spam already
>>
how come there isn't a catch-all workflow for everything? something that has control net, adetailers, inpainting and whatever you can think of into a single workflow, where you can easily bypass the things you don't use. surely some autist has put something like this together.
I've been browsing some anima workflows on civit and I wanna shoot myself. I have very basic knowledge of things, and some flows are either unnecessarily convoluted or too basic. I've tried frankensteining something myself but I can't do it
>>
File: z_00519_.jpg (531 KB, 1264x1800)
531 KB JPG
>>109010657
no!
>>
>>109010664
reptilian skin
>>
File: output_1780955642.png (1.99 MB, 832x1216)
1.99 MB PNG
>>109010657
>>
>>109010787
I totally called it. RUN!
>>
GUYS STOP ENJOYING IDEOGRAM 4.

The license is bad
>>
>>109010880
You forgot to post the fun image you generated
>>
>>109010880
I mean I dislike it because it's slow, cannot edit at all, doesn't look that great, has an annoying mandated way of prompting that I've yet to see be proven well as actually a significant advantage over other models, and is ultra censored.
>>
> >109010686
> >109010696
Fuck off
>>
I will never even download ideogram.

Ever.
>>
File: output_1780957470.png (2 MB, 832x1216)
2 MB PNG
>>109010838
>>
File: ComfyUI_00377.jpg (1.94 MB, 1500x1920)
1.94 MB JPG
>>109010766
Recently, I migrated my workflow over to subgraphs and even knowing exactly how my workflow functioned, it took me a while to fix the problems that the new subgraph-flow introduced because with those, Comfy doesn't tell you where an error is occurring, just that it is. A catch-all workflow (even avoiding subgraphs) would be a nightmare to build and maintain and you, the new/learning user, would have no hope of fixing things when they break. Convenience and complexity rarely play nice together.

Also, you're not doing everything at once so there's no need to have everything in one place.

>error: file too large
>3.7MB
>error: file too large
>2.4MB
WAT!? Is /g/ just fucking with me now?
>>
>>
It's basically pointless asking this here but has anyone run into the issue with Forge where Adetailer doesn't seem to trigger during inpainting or img2img?

I tried putting detecting at 0.1 and it still does nothing.

How 2 fix?
>>
File: z_00537_.jpg (281 KB, 984x1264)
281 KB JPG
>>109011007
try other adetailer model
>>
>>109010703
Read the OP and just ignore him.
>>109010557
I don't think comfy is something that should be looked at as proper user interface, the backend is great the rest is scared by opinionated bullshit head by a dev that needs a tighter leash. I only pull as needed because until they practice proper software development it feels like a crapshoot
>>
File: z_00541_.jpg (284 KB, 984x1264)
284 KB JPG
>>
>>109011040
Okay somehow toggling the on/off button a bunch of times seems to have fixed it.
Thanks anyway Anon.
>>
Maybe it's just me, but prompting age ranges seems to not work in Ideogram, atleast not like it does with Klein/ZIT. It seems to completely ignore it and either gives me a fucking toddler or adult. No in-between.
>>
https://civitai.com/models/2570290/sonic-the-hedgehog-anima-il-nathalie-fourdraine-style
>>
>>109011046
>the backend is great
it's beleaguered by python itself which should not be used to code an app at this scale.
>>
>>
>>109011169
Seems like nobody follows this with AI and every C++ fanatic is a broken schizo that can't accomplish shit.
So unless someone is willing to make the change we just have to cope.
>>
File: Ideogram_4.0_00008_.png (1.75 MB, 768x1376)
1.75 MB PNG
>>
>>109011191
Comfy doesn't even implement common graph layout features.
>>
>>109011236
don't care.

text isn't why we gen.
>>
anyone know if there is a way to generate prompts based on text? for example if I have a character generator app that can output text like:
>Gender: Female
>Race: Elf
>Class: Mage
>Eye Color: Blue
etc is there any way to automatically turn that into a prompt like
>1girl, elf, elf ears, mage, blue eyes,
etc? I know some non-local website apps turn user prompts into prompts optimized for their AI so i'm sure it exists just not sure what it's called or if there's a local version of it
>>
ideogram is a snake that must be opposed as fully as possible.

No model, no matter its features, is acceptable that includes outright censorship. Instead, the acceptable "censorship" of a model is by exclusion of the material from the training.

we do not accept whatsever any model which fails to adhere to this rule.
>>
To be clear, we treat ideogram like Theodosian I treated the Arians.

ideogram is a false local model - it smuggles corporate control into its design, it's not acceptable, and we do not allow it.

In fact, we recommend that it be deleted from hugginface. It's an anti-open model.
>>
>>109011259
this but also anima does this
>>
How do we get the NSA and CIA to delete all existing copies of ideogram?
>>
>>109011267
What prompts do it, and what happens? Can it be mitigated?
>>
>>109011255
>Puts lightweight LoRA on model
>No longer censored

Oh no. I now how sota level image generation and no censorship and the company the made the model has plausible deniability for any damage caused by users.
>>
>>109011279
it steals the models you train so he can sell them and you cannot
>>
>>109011279
I tried farting_on_prey and nothing happened
>>
>comfy fuding
>anima fuding
*sniffs* ew... is that... literal human garbage i smell?
>>
>>109011191
>So unless someone is willing to make the change
No he just seethes about it every thread. No plan to help fix it or anything baka.
>>
File: output_1780961051.png (2.1 MB, 832x1216)
2.1 MB PNG
I'm crazy about this sort of bluescreen aesthetic.

>>109010965


>>109011285
We have to take a hard line. Absolutely ruthless. No snakes. No poison.

Once you start allowing it, it gets greater and greater until you wind up with the beast like Apple with hardware that literally doesn't work with Linux.
>>
I mean, I literally have no idea why id made her dress a cutout. No clue...

maybe "(((cropped)))" in the negative???
>>
>>109011305
... or maybe "contouring" in the negative??? idk.
>>
>>109011305
People are going to censor their model no matter what and you're forced to use it.
>>
>everyone moved on to ideogram
looks like anima was catastrophically forgotten
>>
>>109011317
They need us, we don't need them. Think about it, do you *need* ai? but they need popularity.

Not recommended: ideogram is a model that had potential, but it was ruined by certain corporate decisions. essentially, their censorship makes it unreliable, and maintains undue influence of corporate culture which we can't tolerate. As such, we recommend you have nothing to do with the company. Avoid. 0/10 cudastars.
>>
>>109011332
Only anima can do this:
>>109011305

We have all received the papAI bull: no ideogram.
>>
>>109011332
do you make these blatantly false posts just to try and piss off "anima fanboys"? keke
>>
>>109011334
I refuse to use inferior models to satisfy your reddit outrage.
>>
>>109011339
Go lick a windows 11.
>>
btw, my laptop still has Windows 11, and I can report that it's not cool.

One of the worst things about Windows 11 is that it doesn't show a clock on the lock screen.
>>
>>109011339
why are you here if you don't use local models?
>>
>>109011373
The biggest black pill I ever found was that shills are real, common, and everywhere (like aids)
>>
>>109011379
damn thats crazy
>>
ideogram is legit better than closed source models, wtf
>>
>>109011362
>it doesn't show a clock on the lock screen.
Yes it does? what.
>>
I wish the drunken sperg had a different hobby so his insane ramblings would be posted in another thread instead of here
>>
>>109011402
It's good but I kinda hate using bboxes to setup images because sometimes I don't know how I want the composition of the image to be.
>>
>>109011300
Seems like everyone in that camp are mentally ill, I don't really care about python vram is the resource that matters in this space anyways
>>
>>109011388
Wait until you find out about the thing that never happened.
>>
>>109011423
well actually it's just ram with offloading and cumfart takes too much of it
>>
just give up jules
>>
why is catjack still schizoing out? his Boogeyman left months ago
>>
>gets called out
>proceeds to screech into the void
uber kek
>>
>>109011412
if you jiggle the mouse, but if you hit a key you go straight to typing the password. No clock. And escape doesn't take you to the clock. And there is no back arrow (maybe there's supposed to be?)
>>
>>109011463
>offloading ram
At that point use a different model
>>
>>109011496
bruh if I could buy hardware I would but I still want to try newer models in the meantime
>>
https://civitai.red/models/2681698/ideogram-missionary-sex-mi55ionary
>>
>>109011334
its the least censored model in forever, you just have to follow the prompting format.
>>
>>109011504
You're upset about something when it's a hardware issue, I wouldn't even bother if I had the ram to spare because of the sheer slowdown you get by doing that. You either have to cope or make more money or look for older cards with more vram
>>
>>109011513
if this dynamic vram tumor feature actually worked I would be fine
>>
>update comfy
>speed halved
why
>>
>>109011496
Retard. Ram offloading has become insanely efficient. There's a reason all ai tech companies are buying all the ram they can find. Buying GPUs with lots of ram is REALLY expensive, not only that but they ship with comparatively small amounts of ram, so offloading is where it's at.
>>
I have 300 gigs of ram
>>
File: 1758153242270000.mp4 (3.41 MB, 920x2000)
3.41 MB
3.41 MB MP4
how the fuck can a phone do this?
this is alien tech
>>
>>109011642
Those companies run hardware with five times your memory bandwidth (and ten times the CPU speed), which is the only way to make offloading to system RAM not run at 7 t/s.
>>
>>109011706
>anon discovers gaussian splatting
>>
File: 4ecd61e8d4636bbd.jpg (11 KB, 169x270)
11 KB JPG
>>109011706
Api noses
>>
>>109011706
>error reframing photo
But anon... it didn't do it
>>
File: 1767747800403656.mp4 (3.52 MB, 920x2000)
3.52 MB
3.52 MB MP4
>>109011716
theres nothing that comes close to this on computer
>>109011720
>>109011726
this is all done locally on the phone, you can toggle off cloud mode for simple tasks
>>
>>109011706
Unironically easily vibecoded in a single evening
>>
>>109011642
Then why is it so shit and why are you on your knees begging comfy to save you?
>>
File: ComfyUI_11033_.png (2.19 MB, 1280x2048)
2.19 MB PNG
>Ideogram 4 is trained exclusively on structured JSON captions
wtf is this shit?
>>
>>
>>109011762
>theres nothing that comes close to this on computer
The first part is quite literally gaussian splatting. The second part is just regular image generation. Just because it's magic to you does not mean it is actually magic.
>>
>>109011787
why are people literally too retarded to just put a prompt refiner node or a fucking deterministic json formatter into their workflow
>>
>>
>>109011806
it feels intimiate to type the prompt yourself as if you are handcrafting your very own personal 1girl.
doing it any other way is cuckoldry
>>
>>109011806
They shouldn't if you peg yourself as the top solution for image generation. User friendliness is lost on comfy and the more educated and advanced I become with AI in general the more hate I have for the direction this project has taken.
I already defended comfyUI today but the state of things is mind numbing and the easiest solution is to use a llm to help navigate all the poorly documented changes made to this fucking thing which means you need deep pockets.
>>
File: ComfyUI_00037_.png (1.47 MB, 1088x960)
1.47 MB PNG
>>
File: ComfyUI_00034_.png (1.04 MB, 1008x1032)
1.04 MB PNG
>>
>>109010985
>>109010985
my jenny you've enhanced yourself
>>
>>109011806
what I meant is, why have they done it this way?
are they trying to reinvent the wheel or what?
>>
>>109011831
its literally one more node in theory anon
>>109011833
>User friendliness is lost on comfy
its almost as if youre using a bleeding edge image model released literally 2 days ago or something
its not comfyuis tasks to dictate the standard people should use to prompt this model, my point is that it takes literally 5 minutes to vibecode a node that formats whatever text you input as a caption to { "high_level_description": "insert your prompt here" }, and probably less than that to set up one of the existing llm nodes to create that format given a free text
the only reason people complain is because they got used to writing every single detail of their style manually when this format essentially is a superset of it
>>109011869
yes (and heres why thats a good thing)
>>
so now that anime was an absolute SUCCESS, what's next for local?
>>
>>109011799
Anon will say shit like this but never reproduce it.
>>
>>109011869
It's for controllable generation, product photos, etc.
It also forces it to not be creative which is why ledditors love it so much.
>>
>>
>>109011799
regular gaussian splats require many real photographed angles to determine the depth and appearance of things, this may use gaussian splats but turning a single photo into a gaussian splat would require a whole ton of AI guessing
>>
>>109011894
it would be a success when Russ announces the sequel. he currently bitches about how TA is getting a better deal than everyone else
>>
It's not that I don't think the license for idiogram isn't draconic and highly restrictive. It's the sweaty out of touch autistic reddit tier retards telling me that I'm not even allowed to enjoy or post outputs from the model.

They can't let people enjoy thing. Everyone knows ideogram will disappear the moment a new thing comes along.
>>
>>109011910
it would be so fucking funny if sdcpo was the reason why cumfartorg falls apart
>>
>>109011915
https://github.com/VAST-AI-Research/TripoSplat

Oh?
>>
So much models, so little finetunes desu
>>
If sd.cpp is so much better than pyshit, why isn't everyone using sd.cpp? People would use it if it's better, right?
>>
File: Ideogram_00170_.png (3.59 MB, 1936x1088)
3.59 MB PNG
>>
>>109011916
and to think, whatever money anima makes will just go to fund comfycloud anyway...
>>
>>109011901
>>109011915
It's okay anon don't be scared by the "magic". I know it's new and scary but trust me it won't hurt you.

>>109011918
Did the blog factory explode?
>>
>>109011916
you sound jealous that you dont have a deal with TA
>>
>>109011924
yea, that's exactly what i said
>>
>>109011930
I think people are switching. They are keeping up with model releases unlike neo forge
>>
>>109011934
These newer models excel at stylistic approximation but lack the structural consistency needed to reproduce a defined character accurately across different seeds.
>>
>>109011955
>I think people are switching.
lol
>>
imagine if voldy never gave up webui
>>
>>109011986
>even more retards who don't understand basics like "second pass" or what a VAE is
Hell on earth
>>
did anyone else's comfy just break for absolutely no reason. god damn what the fuck happened.
>>
>>109011986
what happened to him?
>>
>>109011952
you didnt even look at the repo
>TripoSplat converts a single 2D image into high-quality and variable number of 3D Gaussians, developed by TripoAI.
>single 2D image
>>
>>109011986
What's the real reason he stopped? Don't give me that "he wanted to play vidya instead" or whatever surely it was some dumbass drama
>>
>>109012005
oh wait false alarm. lm retarded and had a feedback loop going
>>
>>109011894
a currently unknown or little known tunerchad is going to do some tests with anima and e621, get some funding based on the tests and his connections, release a really cool model, and then 6 months later itll get shitmixed to the point where retards are able to use it. and then we start this cycle all over again
>>
>>109011986
I had to go out of my way and edit a file by hand with a fix I got in the github issues just to get it to work on my GPU (1650 super), I'm glad I don't have to do that anymore.
>>
>>109012008
last I saw him contributing to llama.cpp. even voldy took the c pill
>>
>>109011986
I don't want another webui thoughever
>>
>>109012013
and you didn't read my whole post
>>
>>109011910
>>
>>109012049
you could elaborate on it
>>
File: Ideogram_4.0_00003_.png (132 KB, 768x1376)
132 KB PNG
damn, what a good pic this model generated!
>>
>>
>>109012070
yeah it feels like I get the safety bullshit 30% of time despite not making any changes to my prompt or workflow. ideogram is definitely the best local model for realism but seed hunting is even worse than chrome
>>
File: 1780970448955000.jpg (1.95 MB, 2796x1821)
1.95 MB JPG
crayon, children's drawing, children's art, children's book illustration, forest, trees, hills, subdued, abstract
>>
if only it didnt have the piss filter :(
>>
>>109012079
so why the fuck they released this turd?
shit is useless.
>>
how can we help sdcpp?
>>
>>109012067
first guy says it's "quite literally gaussian splatting", which doesn't really explain anything the tricky part is making the splat, not displaying a splat.
i said that regular gaussian splats involve taking photos with many angles, basically regular splats are derived from photogrammetry information and doesn't involve AI.
i also added that to take a single picture and making a splat of it would require AI, as a simple explanation why that's because a single photo doesn't provide any means of providing depth or hidden area information. (though thinking about it, don't iphones now have some kind of lidar camera for depth? that would certainly help a lot, but that alone can't fill in the gaps created by shifting the perspective of a single photo)
>>
>>109012099
it's not meant to be used uncensored.
>>
>>109012103
anything more than simply spamming about it here it too much for anon
>>
File: why not.jpg (45 KB, 977x666)
45 KB JPG
>>109012110
>>
>>109012109
>which doesn't really explain anything the tricky part is making the splat
yeah this repo is where you see it creates the splat from a single image >>109011924
>i also added that to take a single picture and making a splat of it would require AI,
are you saying apple doesnt use ai?

i dont see how you are still confused about why anon wouldnt really think that much of apples thing since we already have the tools to get something like it
>>
>>109012099
It's marketing. Bots out in full force.
They also made sure to only release fp8 and nf4.
>>
>>109012117
hard to get investors/support with an uncensored model.
>>
>>109012130
to be fair, fp8 competes with fp16 ZIT/Klein easily. makes me wonder just how good the full model is.
>>
>>109012130
>It's marketing.
marketing for what? to announce to the world they are retards?
everyone involved in making this retardogram4 garbage should have their hands chopped off, should be gelded with a hot iron and and should be thrown into the ocean.

if you agree give me (You)s.
>>
>>109012145
why so angry? no one is forcing you to use it. its also free
>>
did they add input images to ideogram yet?
>>
>>109012079
you only get the "safety" image if you dont use enough bboxes. But also the slightest of loras gets rid of it. Looks like they did it as light as possible so people could remove it easily. And it knows boobs and even vulvas better than anything else base model wise ever did. Its the least censored model since sd1.5
>>
>>109012150
check under your foreskin
>>
>>109012149
>its also free
other free models are better and arent completely useless trash with gay safety filters.
nobody wants >muh safety.
>>
>>109012156
I hate using bboxes because it forces me to setup a composition. I'll try using loras, but there's only a few, like 3? the penis ones don't look too promising.
>>
>>109012099
it's fun for apiGODS to humiliate localkeks now and then, watching them wait 60+ seconds for a censored output and then start coping about how loras, finetunes, and workflows will fix it.
>>
>>109011955
Not a lot of people are aware and most of them are deep into their Comfy node soups, so I doubt there are a lot of people switching.

But it is powerful and makes automation of workflows / tests SO much easier than Comfy where you need custom nodes for anything remotely more complex than queueing generations.
>>
>>109011874
Why are you assuming I'm talking about the most recent model?
People like you are the main reason why issues like this never get resolved.
>>
>162 replies
>29 images
grim
>>
>>109012172
>it's fun for apiGODS
are you dumb? you get the exact same grey safety picture you fucking cloud cuck moron.
>>
>>109012182
>But it is powerful and makes automation of workflows / tests SO much easier than Comfy where you need custom nodes for anything remotely more complex than queueing generations.
sd.cpp has builtin tools for grids and stuff?
>>
>>109011955
Do they even have feature parity?
>>
>>109012186
doesnt happen on grok or nano banana, and our spidermans-on-a-unicycle still look better
>>
>>109012186
Even worse, they have less opportunity of circumventing it than local does.
>>
ZIT is fast and works well, 2 min for an image that may be censored is retarded
>>
File: z-image_00776.png (3.6 MB, 1160x1744)
3.6 MB PNG
>>109012185
much to discuss
>>
>>109012183
i think you replied to the wrong anon originally
>>
>>109012191
kys you fucking retard
>>
>>109012190
>feature parity?
Hah. We all wish.
>>
>>109011955
>I think people are switching
No. No they are not. sd.cpp will never be popular until someone builds a better frontend than Comfy for it. You're legit psycho if you think your average person who can barely use python is going to be typing raw commands in terminal to generate content.
>>
>>109012190
the average slopper writes some shitty prompt and hits generate. 90% of the community use nothing extra outside loras
>>
File: image (10).png (106 KB, 1059x386)
106 KB PNG
>>
>>109012212
nb4 anon mentions the sdcpp frontend that is literally worse than auto1111
>>
>>109012215
90% of the community barely tweak the default workflow and wants to be spoonfed for every single gen they see because they think it uses some magic workflow
>>
>>109012212
it has a webui...
>>
>>109012215
So nothing changed from day one....But lets not pretend comfy feeds the worst type of midwitts
>>
>>109012217
Yes based Russ informed anon of this the other day https://desuarchive.org/g/thread/109001708/#q109003014
>>
>>109012217
does it dissolve the licence? They will probably change it to something more draconian
>>
>>109012217
Cool. So can someone just make a dummy lora that has no impact on the model other than getting rid of the safety filter already?
>>
>>109012215
>>109012220
so just to be clear it does not have feature parity? why did you even reply then?
>>
File: shrek-smile.gif (1.67 MB, 498x281)
1.67 MB GIF
>>109012218
>>109012221
>>
File: 1777354250692264.png (1.27 MB, 1024x1024)
1.27 MB PNG
>>109012198
see, zit just works
>>
>>109012228
not everything but it doesn't have a bunch of snake oils. It has controlnet, masking, loras, slg, etc. no idea on nag
>>
File: z-image_00769.jpg (843 KB, 1432x1432)
843 KB JPG
>>
>>109012198
ZIT is still too 'clean' and produces model tier people. ideogram has far better variety
>>
>>109012238
>not everything...
you couldve just said "no it does not have feature parity" instead of what you posted. not sure why you are doing that
>>
>>109012242
He's mentally ill seeing how he calls those things snake oil
>>
>>109012218
there's this one too
https://github.com/fabricio3g/FlaxeoUI/
>>
>>109012256
no need to get upset anon unless you have some sort of stake in sdcpp?
>>
>>109012258
>built with Electron and Node.js.
I'd rather use AniStudio and get hacked by a homosexual Canadian man than use this. Holy LOL.
>>
>>109012256
Anon you should stop replying to him seeing at this point
>>
does ideogram pass the loli test
>>
just to be clear sd.cpp is lacking in many features present in comfy?
>>
>>109012278
I'm 99% sure that's the case, I don't know why the other anon is having a schizophrenic breakdown.
>>
>>109012278
no it won't crash when trying to open it. that was my favorite comfy feature
>>
File: z-image_00760.png (3.24 MB, 1160x1744)
3.24 MB PNG
>>
>>109012266
Comfy is also Jabbascript though
>>
>>109012292
If that electron app had the same features as Comfy frontend then your argument would be persuasive.
>>
Flux.2 Dev vs Flux.2 Klein - who was the jannetty?
>>
>>109012305
wat
>>
>>109012268
yes
>>
>>109012308
which one's worse?
https://www.urbandictionary.com/define.php?term=jannetty
>>
>bigasp guy is training klein 9b
>lodestone considering training ideogram
You stupid fucks, literally just train on Anima. What's easier, teaching a censored corpo-friendly base model all the nsfw concepts and tags, or teaching Anima to have slightly better textures and details when doing realism? Like I could understand it if they had a hard requirement to train only permissively licensed models, but if we're going for fucking flux and ideogram just use Anima as the base instead ffs.
>>
still waiting for ideogram nuclear kinos
>>
>>109012482
>2b vramlet garbage
lol
>>
>>109012482
none of what either of them or really any notable trainer does right now matters. the Next Big Thing always comes from a literal who
>>
>>109012538
still better than 9b bloated slow as shit garbage (Chroma)
>>
>>109012482
>realismsloppa datasets
dead on arrival. enjoy struggling to generate basic penetration because their shitty 2023 joycaption tags don't know what the fuck they're looking at.
3dkeks KNEEL to animeGODs. wait for your next anime booru finetune, then you can leech off the superior tagging knowledge and cope with your instagram loras.
>>
>>109011510
ideogram is trash and must be abolished, let the CIA and NSA wipe it from all the hard drives.
>>
File: output_1780963674.png (2.05 MB, 832x1216)
2.05 MB PNG
This is anima, it looks good, that's how you know it's not ideogram.
>>
File: debo_vn_fia_00026_.png (2.23 MB, 1792x977)
2.23 MB PNG
>>
why do we still have vram limits in 2026? why has no one simply made it so each layer gets loaded one by one instead of trying to push it all through large shards?
>>
feels like wan2.2 has been abandoned. almost no one makes loras for it anymore. im looking at LTX 2.3, and the examples look so fucking shit compared to wan. i refuse to use this.
>>
>>109012727
Wan2.2 is feature completes with LoRa already. There isn't much left to add on. If you want a special LoRa, you need to bake it yourself.
>>
File: 1767052596570758.png (110 KB, 1065x735)
110 KB PNG
>>109012217
don't worry, lodestone has got this covered
>>
>>109012743
"scribble hands" lodestone
>>
>>109012743
>jumping to ideogram before he even finishes his zeta-chroma model
this damn furry will constantly be playing catch up. by the time ideogram-chroma is done, something much better will already be out
>>
>>109012758
He never trains hands, because his models aren't intended for non-furry content.
>>
>>109012743
Can lodestones ever see one thing through to completion?
>>
File: 34757.gif (3.99 MB, 320x222)
3.99 MB GIF
>>109012727
>i refuse to use this
>>
File: Ideogram_00176_.png (3.71 MB, 1264x1680)
3.71 MB PNG
>>
>>109012826
garbage
>>
>>109012727
>feels like wan2.2 has been abandoned.
>im looking at LTX 2.3, and the examples look so fucking shit compared to wan. i refuse to use this.
Same desu. Next to nothing worthwhile released in a year.
Local video scene is so rough compared to images or /lmg/.
>almost no one makes loras for it anymore
Tbh that's because based i2v loras keep getting jannied from everywhere.
>>109012789
Technically he "finished" Chroma.
>>
>>109012873
>he "finished" Chroma.
he trotted across the finished line.
>>
WTF is Wan 2.5? Is it API shit? Why don't I see Wan 2.5 videos?
>>
>>109012887
all wan versions after 2.2 are api
>>
>>109012887
Wan 2.5 was a dark day for local video.
>>
>>109012727
Someone trained an image reference lora for ltx recently. Haven't tried it yet.
https://huggingface.co/LiconStudio/LTX-2.3-Multiple-Subject-Reference
There's still some advancements going on but I guess people here are only talking about more sex poses when they complain here.
>>
>Prompt executed in 315.66 seconds
>censored nipples
What is Klein 9b good for then?
>>
>>109012914
>people here are only talking about more sex poses
I never understood this. Sex poses are so low concept. I can't imagine having a fetish as simple as girl bent in unusual position. I need a whole story arc to enjoy it.
>>
>>109012921
>What is Klein 9b good for then?
making kinos
>>
>>109012921
>Klein 9b
>315.66 seconds

What?
>>
>>109012931
20 steps
vramlet
>>109012926
post examples
>>
File: Ideogram_00182_.png (3.78 MB, 2224x960)
3.78 MB PNG
>>
>needing to ask your groomcord before making any compute decisions
kek
>>
>>109012933
i deleted them
>>
>>109012914
>I guess people here are only talking about more sex poses when they complain here.
Correct. If I wanted to do SFW content I'd just use cloud shit.
>>
>>109012921
>Prompt executed in 315.66 seconds
I assume you are using base? Just use the distill if you don't have a beefy computer.
>censored nipples
There are lots of NSFW loras already.
>What is Klein 9b good for
Edits.
>>
>>109012803
He is right though, LTX has audio, but for everything else it is inferior to Wan 2.2
>>
File: Ideogram_00185_.png (3.62 MB, 1680x1264)
3.62 MB PNG
I was making this image as a joke but then I realized how real it felt and I got shivvers down my spone.
>>
3deep5me
>>
>>109012996
being able to generate past 5 seconds already btfos wan. i don't know what people are doing wrong. maybe they should just use wan2gp since it seems to be the only correct implementation
>>
who's the retard?
>>
>>109013020
ltx looks like vomit
>>
>>109013020
ltx is essentially a talking head model. It can do 1 person looking at the camera and talking. The moment you add in human movement, it starts to rapidly break down in prompt adherence. Even the eros/sulphur models with loras can't come close to what wan22 can do.

It's a pain in the ass but genning a 20 second clip in wan and then using the frames as guides for ltx seem to help, but even then, it's pretty retarded.
>>
>>109013040
it must be a prompting issue. i assume the coomers are trying to do something that involves the target interacting with the camera. you have to change your language and be very explicit with a bunch of things related to the scene to avoid ltx misinterpreting your script. i started using ltx after grok closed its doors to free users, and i remembered struggling with prompting ltx because grok is a lot better at following natural language as opposed to writing like a script writer. i guess that is what people are struggling with
>>
>>109013104
>it must be a prompting issue
You are so delusional I believe you have never used wan 2.2 at this point.
ltx is so fucking dogshit is hilarious, is the sd 1.5 of video models, it can't do shit, the anatomy is basically completely fucked and the proportions are completely wrong 99% of the time, you have to be a braindead retard as not to see how shit ltx is.
>>
>>109013020
5-7 seconds is all i need to goon buddy. just a nice loop. I have alread generated 3,400 wan videos.
>>
>>109013126
why not make a mega folder, fren
>>
>>109013129
its all cunny
>>
>>109013122
>sd1.5 if video models

Extremely diverse adaptable and trainable with long lasting applications that still pop up today simply due to how lightweight and powerful it was.

Comparing ltx 2.3 to sd1.5 is not the burn you think it is
>>
File: 1757271183423874.gif (511 KB, 840x488)
511 KB GIF
>>109013134
>>
File: 24577.gif (4 MB, 320x245)
4 MB GIF
>>109013122
>You are so delusional I believe you have never used wan 2.2 at this point.
you're right, i'm the crazy one here
>>
Lil queefy feelin niggly I see
>>
>>109010985
somebody make shartifAI/scatAI lora or whatever it is so we can make the butts shart and fart
>but why
4 just eez
>>
Is it realistic to fully fine-tune ideogram to make anime coom model that's comparable to Anima? I remember people hyping up ZImage Base saying someone could do that, then nothing really came of it execpt a few shitty LoRAs and useless fine-tunes that barley teach the model anything or are just LoRA merges. Instead, eventually we got Anima instead based off Cosmos 2B. Please nobody talk out of your ass like last time like with ZImage Base, I've already been burned
>>
>>109013177
I haven't seen enough examples to see how well it responds to training to judge that.
>>
I still have no idea how not to get shit images with Ideogram.
I am trying kj json prompt builder workflow and either get segmented weird stuff, or I get a single cohesive image but it adds bunch of BS I didn't ask for, or I get plain shit images. I can't figure out how this model is supposed to click.
>>
I wanna step away from cartoons and animes
Whats your best tip for realism prompting
>>
>>109012217
So there is hope.
But I will never forget the retard redditors defending a model with a safety filter.
>>
>>109013200
The thing with the bboxes is that any space that's not labeled for anything, the model will just make something up. Usually it tries to create something that makes sense but I tend to see it generating clones or nonsense in unlabeled areas. Also if you're using kj's prompt builder, I suggest using the the Grab BG button to insert your last gen in the background. Then adjust the boxes around if it's giving distorted outputs.
>>
What's the verdict on Anima base-v1 vs preview3-base?
>>
File: 36778.webm (3.66 MB, 420x321)
3.66 MB
3.66 MB WEBM
there can only be one
>>
>ltx fired their whole team
>local still coping with wan2.2
AHAHAHAHAHAHAHA
>>
>>109013237
I don't get it though. I am fine with some bbox autism, but it's fucking unusable if I have to specify some BS to everywhere. What if I just want some empty scenery in some part of the image?
Why is this the case, did they fuck up the training or something? You would expect the model to know no bbox = fluff, absolutely nothing not mentioned in the high_level_description or background parts.
>>
>>109013268
Not a massive difference but it's more stable, minor details are better, slightly better backgrounds, it copes better with 1536p.
Preview 3 was weird in some ways, base doesn't give me the same distasteful feeling. I like it.
>>
>>109013271
*whips out badge*
SFW vageen department, slowly place that sfw vageen real close to the camera where I can see it
>>
File: file.png (2.65 MB, 1184x1776)
2.65 MB PNG
>>
File: ComfyUI_00756_.png (321 KB, 896x1152)
321 KB PNG
>>
File: 17121982615612.jpg (740 KB, 1080x1621)
740 KB JPG
>>
>>109013321
>ELiS PURNITEL
>FALLOUT GAS STATION
Did you attempt to prompt Ella Purnell and this is what the model spit out? Lel.
>>
File: file.png (1.01 MB, 572x1024)
1.01 MB PNG
>>109013390
I just copied a prompt from gemini, and yes it doesn't know that IP, nor ella it seems. The feet still looks nice.
>>
>>109013422
Pretty much all local models are safety cucked against real people.
The odds of Fallout Lucy, Lucy MacLean, Lucy from Fallout TV show, etc. working are higher since this model knows some IP.
>>
File: 2134567.webm (3.76 MB, 420x321)
3.76 MB
3.76 MB WEBM
>>109013304
*sends you to the rape dungeon*
>>
Smart App Control is blocking my aimdo.dll in python_embedded so I can't launch comfy
It doesn't have an exceptions system which is retarded, so am I just better off turning it off?
>>
>>109013512
well I tried relaunching and now it got through
windows is so funky..
>>
File: Anima_03072_.png (883 KB, 896x1152)
883 KB PNG
>>
File: ComfyUI_00497.png (3.44 MB, 1500x1920)
3.44 MB PNG
>>109013150
Girls don't do... "that"!
>>
File: sssaasasasasasa.png (2.05 MB, 1024x1536)
2.05 MB PNG
>>109013150
gross
>>
Is ideogram already in comfy or do we have to wait?
>>
>>109013472
Oh no, donut rape me pleeees
>>
why isn't /ldg/ losing its shit over ID4? No vram? Too slow on shitty systems? Bbox concept too complicated to understand?
It's a higher res, more realistic zit that knows more concepts, more characters, is better at text, and allows finer control over objects that are generated in the image.
>>
>>109013810
Image per second rate too slow for gooning for me personally. Not denying it's good.
>>
File: ideogram(11).jpg (390 KB, 1600x1200)
390 KB JPG
>>
File: ideogram(12).png (3.25 MB, 960x1296)
3.25 MB PNG
>>109013825
its 11 seconds for 1024 x 1024 for 20 steps on a 5090. Use flash attention
>>
File: ideogram(6).jpg (1.53 MB, 1648x2544)
1.53 MB JPG
>>
>>109013872
Unfortunately I have a RTX4070. I'll try flash though, ty for the tip.
>>
File: 0_00374_.jpg (1.07 MB, 1360x2048)
1.07 MB JPG
>>109013891
use int8 convrot as well, even faster https://huggingface.co/bertbobson/Ideogram-4-INT8-ConvRot/tree/main
>>
>>109013810
getting there, I had to reinstall an os on the server, so its a looong process to even think about getting the new stuff in, edit: just started downloading now and will post proof soon enough, but I think im going straight for perceptual training of something soon after.

Anyone done it for ID4 yet? https://github.com/BuffaloBuffaloBuffaloBuffalo/ai-toolkit-perceptual/tree/main

God speed lmoa
>>
File: ideogram(17).png (3.21 MB, 1918x1277)
3.21 MB PNG
>>
File: ideogram(23).jpg (803 KB, 2522x1234)
803 KB JPG
12, 20, 48 steps
>>
File: ComfyUI_00657_.png (1.41 MB, 1024x1024)
1.41 MB PNG
>>109013941
whats with the buttchin?
>>
>>
File: migu.jpg (442 KB, 1024x1216)
442 KB JPG
>>
I've been out of the loop for a few weeks because I went into a Claude AI induced psychosis to make an app no one wanted on GitHub. What's the qrd on the sdcpp situation, is it actually good now? It was awful last time I tried it but that was a long time ago.
>>
>>109014089
it's still shit
>>
File: migu2.jpg (733 KB, 1536x1024)
733 KB JPG
>>
Is there a good flow to turn sketches into more complete drawings? I once did it with controlnet on forge UI among some other settings that I don't really remember but it needed a lot of manual tweaking and it was a bit of a pain.
It's been a couple years since I last tried this so I wonder if it has been streamlined better at all.
>>
File: 1777139007041328.jpg (102 KB, 1920x1080)
102 KB JPG
>>109013698
>>109010985
insane quality jennies
>>
Don't know how to install flash attention for comfy, dam
>>
>>109014060
>>109014137
plastic. is this flux?
>>
>>109014203
https://mjunya.com/flash-attention-prebuild-wheels/
and use kijai's flash attention node
>>
>>109013698
they do, I've seen vids of it
>>109013758
It depends, if the girl is hot then it is hot as well
>>
>>109014213
you reminded me of my childhood trauma when i saw that old "girls dont fart" video on youtube where she shoots diarrhea all over the shower walls
>>
what model are they using?
https://www.instagram.com/mikuu.cosplay/
hive doesnt detect her pics as ai for some reason
>>
https://github.com/zai-org/SCAIL-2
>>
>>109013810
Because the resident schizo convinced everyone it was shit and censored
>>
>>109013810
VRAMlets. This is FP8 ID4.
>>
>>109012727
use sulpur then train your own lora on top of it retard. LTX blows away wan, both quality wise and the range of what it can do. On top of audio, minute long gens that are coherent, speed, video editing, audio synching, ID lora...

https://files.catbox.moe/h2nqay.mp4
https://files.catbox.moe/pqh1n7.mp4
https://files.catbox.moe/mjb54h.mp4
https://files.catbox.moe/xt15uu.mp4

https://files.catbox.moe/mjb54h.mp4
>>
How does stable diffusion work on gaming laptop?s?

In the sense that if I have an insufficient PSU in PC land it causes the PC to power down.

Does the battery have sufficient power capabilities or is the laptop just more able to manage limited resources and extend gemeration times to prevent this?
>>
File: ComfyUI_00578.jpg (3.79 MB, 2560x1440)
3.79 MB JPG
>>109013810
I haven't seen anything that makes me want to wade through all their bullshit to start using yet. Making Mario fist-bump Sonic or whatever just doesn't peak my interest.

>>109014168
I've discovered latent preview and can speedrun changes now. Fast iteration like that helps when there's so many dials and knobs to adjust.
>>
>>109014365
what loras are you using. I notice you didn't post a workflow
>>
>>109014412
that is actually dev + the sulphur experimental lora https://huggingface.co/SulphurAI/Sulphur-2-base/blob/main/experimental/sulphur_experimental_lora_v1.safetensors

the pony animation is being trained on top
>>
what do i put into the ideogram positive conditioning?
chatgpt tells me this is where my character descriptions should go, but i always thought the positive box was just guidance constraints?
>>
>>109014395
It works like any other machine if you are asking these questions you should educate yourself on gaming laptops and watch videos from Jarrod's tech. Typically the most you can get will be a 60ti class card with more vram at the top end skew if we talk about performance.
>>
>>109014412
example WF:

Characters:
[PRIMARY — Photorealistic woman, long dark hair, tanned skin, athletic build, wearing a tight gray ribbed tank top and black shorts, expression is playful and heavy-lidded.]

Setting: [Brightly lit indoor room with soft natural light coming from a window behind her.]

[0s–7s]
The shot opens as a low-angle close-up on her face, her hand reaching toward the camera as if setting a phone down on a surface. As her hand pulls away, the camera settles into a steady, slightly shaky handheld view looking up at her. She smiles mischievously, her chest heaving slightly, and she reaches for the hem of her gray tank top. She pulls the fabric upward in one smooth motion, exposing her breasts to the lens.
Sounds: [rustle of fabric, soft thud of a phone being placed, heavy breathing]
Woman (playful, breathless) : "My tits are so fuckin' heavy..."

[7s–13s]
The camera, now a tight close-up on her torso, follows her hands as they reach down to grab her breasts. She squeezes them firmly with both hands, her fingers sinking into the skin, then she lifts them upward toward her chin before letting them drop heavily. The framing remains tight and shaky, capturing the weight and movement of her skin.
Sounds: [slapping skin, heavy rhythmic panting, soft fleshy thuds]
Woman (lust-drunk, teasing) : "Look at them."

[13–20s]
The camera pushes in closer, the frame becoming an extreme close-up as she leans her body forward, her breasts pressing against the lens and filling the entire view. She reaches up to the collar of her gray tank top, grabs the fabric, and pulls it over her head in one continuous motion, tossing the garment to her side. The shot remains a macro-style close-up on her skin as she settles closer to the lens.
Sounds: [fabric sliding over skin, heavy breathing, soft rustle of the shirt hitting the floor]
Woman (ecstatic, breathless) : "Ahhh~ Haaah~"
>>
>>109014430
>>109014420
thanks
your examples were i2v, right?
>>
>>109014395
laptops do not have sufficient cooling to be able to maximize their psu anyway. Usually they are designed at like 150W TDP but their psu is rated for 240W. running on battery would be retarded as you would have maybe an hour of genning, also to protect the battery they rarely allow you to even pull as much power as needed from it as it heats up too much. I gen on a 4060 gaming laptop, sometimes even with only 100W via USB-C and it works perfectly fine. I get a like 10% boost on the actual PSU.
>>
>>109014446
No, all T2V
>>
File: cover4-345074123.jpg (59 KB, 800x450)
59 KB JPG
>>109014399
one of these days I'll have a workflow deserving of clever iteration methods
>>
>>109014430
a lot of retard's issues is they try to prompt LTX with "Woman shows vageen and booba, man fucks her"
when it needs much more detailed prompts. Its like ideogram but for video
>>
File: ideogram-1.jpg (416 KB, 1024x1536)
416 KB JPG
>>109014426
https://civitai.com/models/579280/new-ideogram-4-in-workflows-collection

if you have 24gb, you might oom every second try like I have, but this gathers both the gemma4 formatting for you, and works to bypass 'thinking about json', pic related is my result

I have no advice on your question, I just know from the logs that this workflow just werks
>>
When training a LoRA for a concept (object or body part) should you have all your training dataset in the same style or is it fine to have various styles?
>>
>>109014462
have you tried eros? Is just using the lora better? I know eros is purely for i2v
>>
>>109014521
just use the new experimental lora, its night and day better. I merged it into dev so it dosent have to load it each time
>>
>>109014399
It doesn't just know Mario and Sonic. It knows a shit ton of characters. You can make those characters interact with each other by just smashing their bboxes together. Someone just needs to create good NSFW loras and you can essentially make lewd fancomics.
>>
>>109014517
do you want to train a concept lora or a style lora
>>
>>109014517
you for sure want separate styles or it will associate whatever with that style as well
>>
File: 00004-2708940517.png (2.03 MB, 1920x1472)
2.03 MB PNG
>>109011332
only reddit moved to ideogram.
>>
>>109014490
>retard's
>>
can someone explain ideogram, is it actually local, why are so many people saying they are getting blocked by safety filters on a local program
>>
>>109014365
>LTX blows away wan, both quality
>Post absolute dogshit daz3d looking examples that get mogged by even the shittiest wan 2.2 gens
Of course it has to be a ponygger
lmaooooooo
>>
>>109014545
yes and because people are stupid, it works fine as long as you follow its prompting format. Use kijai's nodes which does it for you.
>>
>>109014557
https://files.catbox.moe/h2nqay.mp4
show me these gens
>>
nasty ahh
>>
File: 00009-3457505663.png (2.18 MB, 1920x1088)
2.18 MB PNG
anima will never surpass sdxl shitmixes.
>>
>>109014558
is there a comfy workflow for that I can try, and is this another sidegrade to flux/qwen/zimg or should I actually care and try it
>>
>>109014568
Workflow?
>>
>>109014627
https://pastebin.com/VU0PcdtS
yes, its legit better than closed source models. I can hardly believe they actually released the weights
>>
>>109014627
ive just now been trying it out, its fucking insane. that being said i've been out of the loop since z-image base so maybe its not that much better compared to other recent models.
>>
>>109014656
nah, everything has been trash till this, amina is ok but only for anime
>>
Do controlnets exist for anima yet?
>>
can i be runnings the ideagram on 4 gb?
>>
>>109014679
yes saar
>>
so much ideogram shilling but ive yet to see anything that makes me want to try it
people have only posted stock images and posters so far, is that all the model can do?
>>
what's the vram requirement for ideogram?
>>
>>109014693
Not sure about absolute minimum but it runs on 12gb.
>>
File: 00013-2517934150.png (2.65 MB, 1920x1088)
2.65 MB PNG
>>
>>109014691
>shilling ideogram
for good reason. Its the first model actually on par or better than closed source now. And check reddit for tons of examples

https://www.reddit.com/r/StableDiffusion/comments/1u13gqu/some_cinematic_ideogram_4_tests/

https://www.reddit.com/r/StableDiffusion/comments/1u0e1g0/ideogram_40s_understanding_of_characters_and_ip/

https://www.reddit.com/r/StableDiffusion/comments/1u0n1fy/cant_stop_using_idg4/

https://www.reddit.com/r/StableDiffusion/comments/1u0m7hw/some_ideogram_4_results/

https://www.reddit.com/r/StableDiffusion/comments/1u037qa/crt_screen_game_on_ideogram4_on_civitai/

https://www.reddit.com/r/StableDiffusion/comments/1tzva1m/ideogram_4/
>>
>>109014693
>>109014711
I can confirm it runs on 8 too, 2.5-3 minutes for 1 megapixel
>>
>>109014724
>gpt-tier baked slop
>omagawd le mario and sanic **soifaces*
>>
>>109014724
theres just something very off putting about that model
i suspect that its entire training data was just stock photos and movie screenshots because nothing feels "real" about any of the images
>>
>>109014724
legit better prompt following than gpt image 2 while looking better than z image. The quality is insane
>>
File: 00021-3354509339.png (2.73 MB, 1856x1280)
2.73 MB PNG
>>109014724
that model is heavily being shilled hard by the bots and paid influencers. The complete 180 degree flip-flop is not genuine sentiment at all.
>>
File: 54875.png (154 KB, 236x723)
154 KB PNG
>2026
>every single thread
>replying to bait still
oh hell naw
>>
I am a creative man, I can't be satisfied with 1girl gens forever
I need to make stories!
>>
so the full weights for ID4 are not out yet? guess ill wait before trying to train for it.
>>
>>109014772
have you been on discord? all the big model trainers are hyped about it, its literally all people everywhere are talking about
>>
>>109014783
what models have they trained?
>>
>>109014781
training the fp8 works fine

>>109014788
they are still exploring the best way to handle the cond and uncond but a quick tune was shown off on lonestone's server. It was learning penis / simple sex acts super quick
>>
cant wait for the aitoolkit update so i can train a lora for id4
>>
>>109014788
>>109014799
https://files.catbox.moe/iam5k7.png
this was not that many steps done incorrectly on it
>>
bake new NOW!!!!!!!!
>>
>>109014799
>they are still exploring the best way to handle the cond and uncond
Qrd on the current consensus?
Seems like it can a bit messy to train.
>>
>>109014799
you said all the big model trainers are hyped about it. what models have they trained previously? you did not answer my question
>>
>>109014799
>they are still exploring
Heh, local, not even once.
They will forget it like Z Image, like Qwen, like Klein,
>>
>>109014822
lonestone has like 4 models training, imo he should abandon all those and focus on ideogram. And stop trying to do fancy model modifications. Just train the damn model itself
>>
>>109014817
Damn. We might eat good with this model in the future.
>>
>>109014821
https://x.com/ostrisai/status/2063199111525052741
the ai-toolkit guy likes it. he has trained:
>Image
black-forest-labs/FLUX.1-dev (FLUX.1)
black-forest-labs/FLUX.2-dev (FLUX.2)
black-forest-labs/FLUX.2-klein-base-4B (FLUX.2-klein-base-4B)
black-forest-labs/FLUX.2-klein-base-9B (FLUX.2-klein-base-9B)
ostris/Flex.1-alpha (Flex.1)
ostris/Flex.2-preview (Flex.2)
lodestones/Chroma1-Base (Chroma)
Alpha-VLLM/Lumina-Image-2.0 (Lumina2)
Qwen/Qwen-Image (Qwen-Image)
Qwen/Qwen-Image-2512 (Qwen-Image-2512)
HiDream-ai/HiDream-I1-Full (HiDream I1)
OmniGen2/OmniGen2 (OmniGen2)
Tongyi-MAI/Z-Image-Turbo (Z-Image Turbo)
Tongyi-MAI/Z-Image (Z-Image)
ostris/Z-Image-De-Turbo (Z-Image De-Turbo)
stabilityai/stable-diffusion-xl-base-1.0 (SDXL)
stable-diffusion-v1-5/stable-diffusion-v1-5 (SD 1.5)
baidu/ERNIE-Image (ERNIE-Image)
NucleusAI/Nucleus-Image (Nucleus-Image)
HiDream-ai/HiDream-O1-Image (HiDream O1)

>Instruction / Edit
black-forest-labs/FLUX.1-Kontext-dev (FLUX.1-Kontext-dev)
Qwen/Qwen-Image-Edit (Qwen-Image-Edit)
Qwen/Qwen-Image-Edit-2509 (Qwen-Image-Edit-2509)
Qwen/Qwen-Image-Edit-2511 (Qwen-Image-Edit-2511)
HiDream-ai/HiDream-E1-1 (HiDream E1)

>Video
Wan-AI/Wan2.1-T2V-1.3B-Diffusers (Wan 2.1 1.3B)
Wan-AI/Wan2.1-I2V-14B-480P-Diffusers (Wan 2.1 I2V 14B-480P)
Wan-AI/Wan2.1-I2V-14B-720P-Diffusers (Wan 2.1 I2V 14B-720P)
Wan-AI/Wan2.1-T2V-14B-Diffusers (Wan 2.1 14B)
Wan-AI/Wan2.2-T2V-A14B-Diffusers (Wan 2.2 14B)
Wan-AI/Wan2.2-I2V-A14B-Diffusers (Wan 2.2 I2V 14B)
Wan-AI/Wan2.2-TI2V-5B-Diffusers (Wan 2.2 TI2V 5B)
Lightricks/LTX-2 (LTX-2)
Lightricks/LTX-2.3 (LTX-2.3)

>Audio
ACE-Step/Ace-Step1.5 (Ace Step 1.5)
ACE-Step/acestep-v15-xl-base (Ace Step 1.5 XL)

>Experimental
lodestones/Zeta-Chroma (Zeta Chroma)
ideogram-ai/ideogram-4-fp8 (Ideogram 4 FP8)
>>
>>109014845
>single lora
>zero bleed
>Regional captioning and prompting is amazing
Big if true.
>>
did they add reference images to ideogram yet?
>>
>>109014819
for loras same lora trained on both, the amount is still being tested
>>
does ideogram take reference images?
>>
File: 1759797895275459.png (3 KB, 500x500)
3 KB PNG
>109014845
holy shill melty
>>
>>109014845
so one guy is excited about it

>all the big model trainers are hyped about it

yeah ok. lol, lmao
>>
>>109014845
he meant tunes, so like lodestone's chroma, realistic wtf its called guy and others
>>
>>109014858
why does it matter?
the fact is that everyone is hyped about id4 and people are posting crazy good results
>>
>>109014865
he is trolling, that is why I did not respond
>>
>>109014865
the only local model that matters as far as i'm concerned is anima. if something is trained on this using the booru dataset and produces better results, i'll care

>>109014869
i hope you're getting paid
>>
File: 1759243031958404.png (1.42 MB, 1024x1344)
1.42 MB PNG
can i train loras for ID4 anywhere yet?
>>
>>109014874
so you're a vramlet then? That's the issue? And yes, a corporation is paying people to shill making porn off of their model breaking their TOS / license, of course.
>>
Shill or the roach himself?
>>
>>109014885
any model that hasn't been trained to generate specific artists is worthless
>>
File: ComfyUI_ayakon_.jpg (3.14 MB, 2120x3277)
3.14 MB JPG
>>
File: 1631394802022.png (1.74 MB, 701x843)
1.74 MB PNG
Reminder that ai toolkit is total garbage made by a patreon scammer.
>>
Can anima imitate Frank Cho?
>>
its looking like ideogram is far less censored than we thought. This is with the unmodified model:
https://files.catbox.moe/4mi0gw.png
>>
File: 7jtjqs.png (113 KB, 526x277)
113 KB PNG
>>
>>109014931
white women are supposed to have pink holes, not black holes
>>
>>109014961
the top one is hot
>>
>>109014931
there's no way it gapes out of the box
>>
File: 46785.gif (1.94 MB, 320x222)
1.94 MB GIF
>>109014974
>>
>>109014928
No but shouldn't be difficult to train a lora in his style.
I might train one myself one day.
>>
>>109014975
it can do giant boobs with veins, facesitting, cameltoe, it wouldn't surprise me. But all I've used is prompting. I'd like to know exactly how you get genitalia like that
>>
>>109014994
This proves that censoring a model makes it worse. The model is incredible and even knows porn, it was just buried in the model in a smart way under images with the naughty bits made featureless. That is likely why it trains it right back in so fast
>>
>>109014923
nooo not the heckin patreon attached to free software
fuck off moron, let devs make money in the least obstructive way possible if they are providing free stuff
>>
>>109015019
i don't know much about how it works, but i think when they censor it, it basically tells the future trainers exactly what weights need to be flipped around
>>
>>109014984
I wish girls like these were real.
>>
File: saria watches direct.jpg (342 KB, 1353x1047)
342 KB JPG
>>
>>109015067
who is this
>>
bruh I thought the thread 404'd because it shows 404 posts in red lmao
>>
File: 3556678.gif (3.88 MB, 320x222)
3.88 MB GIF
>>109015056
>>
File: ideo-2.jpg (350 KB, 1680x944)
350 KB JPG
some sour grapes ITT! Join the bandwagon, Zimage is abandoned.

I don't like the excessive veins all the time though, but it's a good indication of detail output
>>
>>109015085
I am not baking because I feel like but the time I am ready to post someone else has already baked the new one.
Happened more than once.
>>
bros what's the current model meta for anime diffusion?

I see in the OP post links to anima and an outdated meta link in which illustrious etc are recommended.

VRAM is not an issue, thank you!
>>
>>109014931
>uncond manipulation
He is doing what with the uncond model precisely?
>>
>>109015147
Anima.
Most of the rentries are indeed outdated.
>>
>>109014931
Bottom image is from bigASP 3 and neither gen was posted by silveroxide
>>
Fresh
>>109015348
>>109015348
>>109015348
>>
someone else bake so we can bully troonjak out of the thread



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.