[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Perfidious, Manipulative, Cock Sucking, Brain Damaged Midget With A 15 Inch Dick Edition

Discussion and Development of Local Image, Video, and Music Models

Previous: >>109020001

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>109024741
https://rentry.org/LDG_vital_info
>>
long dick general
>>
File: z_00089_.jpg (335 KB, 912x1336)
335 KB JPG
>>
Blessed thread of frenship
>>
>>109024687
Listen man, I know how to troubleshoot, i just don't know why or what the fix does, it just fixes it.fuck blackwell cards.
>>109024716
I think if i run some shitty sd1.5 model it will be fine on top of a llm, but i see your point.
i would probably only be able to run a 13b model
>>
Is there a finetune of Ideogram4 that gets rid of the piss filter yet?
>>
>>109024708

what is the point in lying about something that is demonstrably false by every conceivable metric?

other than misleading people that don't know anything, like some kind of perfidious, manipulative, cock sucking faggot?

current anima is closer to pony that it is to current illustrious
>>
>>109024819
anon you need to relax
>>
>>109024819
>what is the point in lying about something that is demonstrably false by every conceivable metric
indeed, what is the point anon?
>>
now that local is dead, what the fuck do we do now??
>>
File: igram.jpg (388 KB, 1584x1264)
388 KB JPG
>>109024808
i don't think so, we just use a workaround with json bboxes
>>
>fudnon is back
>>
>>109024827
i enjoy the part where he provided demonstrations of the thing that is demonstrably false
>>
>>109024831
get in the circle, jerk to the right
>>
Thinking about how long after their release that random anons would do seethe drive by posts about how illust sucked and pony was better or (even further back in history) how pony sucked and whatever 1.5 shitmix was better. I wonder how long that'll go on for with anima until they come to terms.
>>
>>109024853
i don't need to post literally fifty percent of civit in this thread so that you can deny it all the same, fag
>>
>>109024819
How the fuck are you prompting it if you actually think this
>>
File: z_00113_.jpg (443 KB, 912x1336)
443 KB JPG
>>
>>109024883
1girl, standing in the positives
lowres in the negatives
Euler A 20 steps
Nothing else
>>
>>109024891
>>
>>109024891
yeah but you're literally describing anima though
>>
>>109024870
>civitai jeets have terrible taste
STOP THE PRESSES!
>>
>>109024869
yeah this hypothetical thing that never happened because pony was total dogshit would be a great rebuke to what i'm saying if it were true and it would have ever happened
even the biggest pony users said it's shit
>>
>>109024927
You can just say you weren't around during that time it's okay anon we won't judge you
>>
>>109024833
Is that supposed to be an example? Not shitting on you but that still has the essence of the piss filter on it. Sorta reminds me of when NoobVpred dropped and anon kept coping by doing post processing on the earlier version saying it's just as good when clearly it was not.
>>
File: AnimaVsIllustrious.png (2.56 MB, 1664x1216)
2.56 MB PNG
>>109024891
I figured I'd actually try this kek
They both produce basically extremely different compositions / styles / environments / etc every time you run it on a new seed, which is better than them having low variety I guess
>>
File: 944175595580848.png (1.85 MB, 1152x1472)
1.85 MB PNG
>>
File: z_00118_.jpg (309 KB, 912x1336)
309 KB JPG
>>
>>109024967
Anima mogs here ngl
>>
File: 796325632740220.png (2.11 MB, 1152x1472)
2.11 MB PNG
>>
>mfw Resource news

06/10/2026

>EvoQuality: Self-Evolving VLM for Image Quality Assessment
https://huggingface.co/ByteDance/EvoQuality

>ComfyTV: Turn ComfyUI into a TapNow / LibTV-style canvas app
https://github.com/jtydhr88/ComfyTV

>PathRelax: Parallel-Path Relaxed Speculative Jacobi Decoding for Accelerating Auto-Regressive Text-to-Image Generation
https://github.com/Haodong-Lei-Ray/PathSpec

>SSR-Merge: Subspace Signal Routing for Training-Free LoRA Merging in Diffusion Models
https://github.com/nagara214/SSR-Merge

>SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning
https://teal024.github.io/SCAIL-2

>IDEAL: In-DEpth ALignment Makes A Discrete Representation AutoEncoder
https://github.com/Row11n/IDEAL

>Image to Prompt: Web app to turn an image into Ideogram 4 JSON prompt
https://github.com/cocktailpeanut/image-to-prompt

>Simple Diffusion XS (sdxs-2b alpha version)
https://huggingface.co/AiArtLab/sdxs-2b

>Bernini-R: Repackaged model files for ComfyUI
https://huggingface.co/Comfy-Org/Bernini-R

06/09/2026

>SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning
https://teal024.github.io/SCAIL-2

>BLM-SGAN: Bidirectional Language Modeling for Semantic-Spatial Text-to-Image Generation
https://github.com/haidy-maher/BLM-SGAN-Text-to-Image-Generation

>SwiftVR: Real-Time One-Step Generative Video Restoration
https://h-oliday.github.io/SwiftVR

>Property-Informed Diffusion-Based Text-to-Microstructure Generation
https://github.com/hongsong-wang/PropDiff-TMG

>OmniTryOn: Video Try-On Anything at Once!
https://github.com/xcltql666/OminTryOn

>IEA: Amateur-Friendly Conversational Image Editing Agent via Three Stages of Multitask Alignment
https://github.com/OpenDFM/Image_Edit_Agent

>CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning
https://github.com/InternLM/CapRL
>>
>mfw Research news

06/10/2026

>HarmoView: Harmonizing Multi-View Constraints for Identity-Consistent Video Generation
https://conallwang.github.io/HarmoView_Pages

>STEDiff: Strengthening Text Embedding for Text-to-Image Alignment in Diffusion Model
https://arxiv.org/abs/2606.10653

>FadeMem: Distance-Aware Memory Consolidation for Autoregressive Video Diffusion
https://arxiv.org/abs/2606.10671

>Making Time Editable in Video Diffusion Transformers
https://arxiv.org/abs/2606.10183

>BiWM: Advancing Open-Source Interactive Video World Models with Bidirectional Autoregression
https://arxiv.org/abs/2606.10135

>Mean Flow Distillation: Robust and Stable Distillation for Flow Matching Models
https://arxiv.org/abs/2606.11155

>Improving Text-Instance Alignment Of Foreground Conditioned Out-Painting Via Customized Concept Embedding
https://arxiv.org/abs/2606.10892

>FoA-SR: Faithful or Aesthetic? Profile-Aware Preference Optimization for Real-World Image Super-Resolution
https://arxiv.org/abs/2606.10275

>Dissect and Prune: Enhancing Robustness in AI-Generated Image Detection
https://arxiv.org/abs/2606.10309

>AnimaSpark: A Feed-Forward Method for Animating Arbitrary 3D Objects
https://arxiv.org/abs/2606.10988

>Lip Forcing: Few-Step Autoregressive Diffusion for Real-time Lip Synchronization
https://cvlab-kaist.github.io/LipForcing

>PTL-Diffusion: Manifold-Aware Diffusion with Periodic Terminal Laws
https://arxiv.org/abs/2606.09816

>Evaluating the Representation Space of Diffusion Models via Self-Supervised Principles
https://arxiv.org/abs/2606.09718

>Few-step Generative Models as Lossy Compression
https://arxiv.org/abs/2606.10450

>Pose-ICL: 3D-Aware In-Context Learning for Pose-Controllable Subject Customization
https://arxiv.org/abs/2606.10902

>Don't waste SAM
https://arxiv.org/abs/2606.10696
>>
>>109024967
>>109024967
cool you used the base that i already said anima was better than
now use literally any finetune/merge/checkpoint more recent than q2 2025, i'll let you pick one with under 500 downloads
>>
>anima only one finetune away from its base model (cosmos)
>illustrious was two finetunes away from its base model (SDXL)
I was going to finish this green text but got bored halfway through since this thread has already discussed it a million times
>>
>>109024741
can zimage generate porn images. I cannot make a woman gag on a cock. the dick does not show up.
>>
> >109025001
> >109025007
fuck off debo
>>
>>109025043
welcome to every non-anime model out there. animegods spent decades curating boorus just for this moment, while realismbrowns grovel for onlyfans leaks
>>
File: Ideogram_4.0_00007_.png (1.61 MB, 1024x1024)
1.61 MB PNG
Shouldn't this model be beyond body horror. Often get more people than what was prompted too. Sumtin Wong.
>>
>>109025035
i didn't say anima doesn't have the potential to become better than illustrious through community support, i'm saying it's been almost half a year and you have to be generous to say that we're close, and people here unironically believe anima is not close to becoming equal, but somehow categorically better, which is just wrong
i don't remember the last time an illust batch gave me gore, i have yet to see an anima batch that didn't
>>
>>109025073
I say skill, you say issue
skill
>>
>>109025058
is this the fucking state of ai on /g ?
>>
>>109025078
ISSUE!
>>
File: z_00133_.jpg (260 KB, 984x1264)
260 KB JPG
>>109025043
needs lora, very easy to make for base z
>>
>>109025060
Did you use a separate bbox for face?
>>
>>109025130
where do i get porn lora. It was not in civitai. Or should I switch to flux. I am sure you are not using lora for zimage are you?
>>
Anybody here who's trained Ace-step Loras? How hard is it?
>>
>>109025151
Use Chroma instead. If you wanna use Zbase for porn etc you better be prepared to train stuff yourself
>>
>>109025131
Nope.
>>
>>109025168
A better method is to use ZImage to generate the image, then use Chroma with bbox penis detector to fix the genitalia. it works very well. Penetration might be more tricky since ZImage without NSFW loras will make it so shit the detector might not recognize it
>>
File: 678777414732766.png (1.78 MB, 1152x1600)
1.78 MB PNG
>>
File: z_00137_.jpg (400 KB, 984x1264)
400 KB JPG
>>
>>109024967
why do people use either of this?
>>
File: z_00139_.jpg (300 KB, 984x1264)
300 KB JPG
>>109025194
Too bad there are so few finetunes
>>
>>109025073
i'm saying that anima can't do much more than boorish pixiv waifu slop or degenerate "anime-inspirted" instagram (((art))) (100% of everything ever posted in this thread) and then having the nonspecific sex that breaks hard if it's made to follow any kind of directorial vision
prompt adherence for anything other than multiple characters is worse
illustrious with multiple subjects(not characters) is better than anima with multiple subjects, and that's going to limit how useful anima's identity retention is in a zero sum way
it has serious problems with framing in general as well, it barely responds to view angle/composition tags that fall out of 5pp EPIC/vanilla pinup pixiv dreck, everything is too zoomed out or too zoomed in and seems to care very little for medium focal length, you know, the thing you need for 90% of shit
it is better at TWO things, spoken text and correctly collecting tokens for multiple defined characters without loras; it is as good at some things, and worse at more than some things, and many of those won't necessarily improve with finetunes
by all means, if this sounds like something caused by me not using it correctly, i hope i've given you enough of a pattern for you to tell me how to get better results
>>
File: ComfyUI_temp_hbzkm_00001_.png (3.26 MB, 1152x1920)
3.26 MB PNG
>>
yeah man sorry but literal skill issue
>>
>>109025242
>illustrious with multiple subjects(not characters) is better than anima with multiple subjects
what
lol
lmao
>>
Is this >>109024737 true?
All my workflows work better with sage anyway so I never bothered with flash but I wanted to try idiotgram and sage doesn't work with it
>>
File: z_00140_.jpg (432 KB, 984x1264)
432 KB JPG
>>
>>109025280
you know what they say about girls with big wingspans
>>
>>109025264
anima trades its compositional capabilites for character retention
anima is very good at putting characters in the scene, worse at making them do things
illustrious is very bad at putting characters in the scene, better at making them do things
>>
>>109025001
>>109025007
Why do you post this to an audience of none?
The anons that you hate don't even post here anymore
>>
>>109025315
long sleeves?
>>
>>109025017
>finetune/merge/checkpoint
I know you really just mean shitmixes and to that I say lol
>>
>>109025242
>>109025317
i like how you still have not posted any comparison images to support your argument
>>
>>109025327
i don't care if you call them my wife's newborn nigger child, they're all better than anima right now
>>
>"finetunes will fix it!"
>finetunes never arrive
>sell out to api
brilliant psyop by big china, localkeks fell right for it
>>
no surprise
>>
>>109025343
proof?
>>
File: 4Way.jpg (1.74 MB, 3328x1216)
1.74 MB JPG
Did the comparison again with a slightly more favorable negative on all models. Used the same seed for all of them too. TLDR if you want "nice" images to be output with totally arbitrary elements that the prompt doesn't mention then just use the Anima Turbo Lora lol
>>
z image edit when?
>>
>>109025378
never ever
>>
>>109025372
you have to prompt for hair, eye color etc to make a comparison that makes sense. also the turbo 2 is better than the first one imo
>>
>>109025378
I have it
>>
>>109025372
>turbo looks like sd1.5 sloppa
>just use that!
the absolute state
>>
>>109025343
yeah nah im never going back to a shitty 4ch VAE
>>
>>109025383
Well yeah that's the point of a turbo lol. You can run it lower, like 0.7ish, to keep the low step / 1 cfg ability but bring back more variety though.
>>
torch for cu133 when?
>>
File: z_00142_.jpg (382 KB, 984x1264)
382 KB JPG
>>109025315
we enjoy large tracts of land
>>
>>109025406
no, the point is to make images faster. it still has prompt adherence
>>
Why would I want a model that clearly looks ai generated, uses an antiquated vae, and adds random shit to the output that I didn't prompt for though? Honestly asking
>>
>>109025396
You misunderstood me, a lot of people do not in fact want there to just randomly be e.g. a whole ass beach that the extremely short positive prompt never talked about
>>
>>109025419
Nothing I said implied it didn't have prompt adherence. But you're still wrong, distillation requires trading off variety and flexibility for quality to be able to do the cfg 1 / low step thing. This is why Turbo models often look better than their bases in the average case.
>>
File: z_00154_.jpg (391 KB, 1336x912)
391 KB JPG
>>
>>109025383
Where is your comparison? You've spent like over two hours bitching and moaning about how illust shitmixes are better without a single output from yourself. I have a hard time believing you're not trolling.
>>
>>109025494
not only is that person not me, but he's also trying to defend anima
>>
>>109025478
woah
>>
File: debo_vn_fia_00071_.png (2.37 MB, 1792x977)
2.37 MB PNG
>>109025319
everyone appreciates having the latest news available at a glance, especially in threads with a very high noise-to-signal ratio where its hard to find useful posts amongst all the
>I hate you and wish you would die
nonsense
>>
>>109025532
You spam links to jeet papers. Not seeing this spam would be better.
>>
>>109025319
i do read it, unironically kill yourself
>>
>>109025426
what you're describing is exactly how i would describe qwen
i find it takes control away and creates random bullshit far more frequently, SDXL just tends to lock up on denoising or print a completely different style of image when it fails, which i prefer because i at least know that there's a serious issue with the prompt; i feel like i'm left guessing with qwen much more
>>
>>109025319
i also read it, and I'm neither of the other guys quoting this post
>>
>>109025548
Ok fag put the phone away
>>
File: 213633CUI_00002_.png (1.65 MB, 1152x1536)
1.65 MB PNG
>>
>>109025246
Lynchian, holy hell
>>
>>109025532
You wanted to play this stupid game so enjoy the prize and now fuck off debo
>>
>>109025532
>I don't like how anon acts in this thread so I post two to three full length posts where only one or two links in them may be relevant at the start of every thread
lmao
>>
File: 00015-1026850138.png (2.22 MB, 1824x1248)
2.22 MB PNG
>>109024819
sdxl community finetunes are way better than anima in its current state.
>>
>>109025646
kek he mentions noise-to-signal when his "news" posts are unironically 90% nothing burgers
>>
please stop referring to shitmixes as finetunes it is disingenuous and equates the skill of tuning to the brainless slop of mixing
>>
File: anima in 2028.jpg (3.35 MB, 4096x1024)
3.35 MB JPG
>>109024967
results when you don't deliberately go out of your way to sabotage illust, 4 random checkpoints
>>
>>109025716
slop times four
>>
My computer is hot. My gens are not.
>>
>>109025532
Why do you post here when you still have a thread that you wanted?
Why are you still doing this after all these years?
>>109025648
Which ones?
>>
>>109025666
>equates the skill of tuning to the brainless slop of mixing
They don't understand the difference is the thing
>>
>>109025772
there are brainless tunes and there are skillful mixes
>>
trellis2 is still pretty much the thing to use for 3d models right?
>>
File: 1766188361584200.jpg (2.78 MB, 2048x3072)
2.78 MB JPG
>>
>>109025739
oh no the retarded shit eating nigger called my one token prompt slop, i guess the blurry smear is better after all
>>109024870
>>109024870
>>
he mad
>>
dude just get a better gpu you don't even need a crazy vram for anima
>>
>>109025809
cutie
>>
**********
**********
**********
*****
>>
>>109025783
>brainless tunes
true
>skillful mixes
oxymoron
>>
>>109025716
Were you too scared to label which is which or were you too stupid
>>
>he doesn't merge scat loras to improve anatomy
>>
>>109025822
why would anybody with a better gpu use anima?
>>
>>109025866
Look at the low quality of those gens and his arguments. You know who this is.
>>
File: 1763610612906130.webm (880 KB, 480x832)
880 KB
880 KB WEBM
>>109025809
oh dang bernini actually pretty good
>>
so who is finetuning ideogram...?
please don't say kekstone
>>
Is whatdreamscost-comfyui safe? Why won't manager let me install it without security warnings?
>>
>>109025915
Woot. Isnt it lightweight? Looks decent
>>
>>109025917
he is
>>
>>109025917
Yes, he has 200gb dataset of canine penises training with 128×128 res
>>
>>109025922
nothing is safe
>>
>>109025917
some dude nobody has heard of with an actual job using his six month savings
as it always is
>>
>>109025716
It was the literal same prompt with the other anon's joke verbatim for positive / negative / steps / sampler, on both. I already did a second comparison where I used a more forgiving negative on all models, also.
>>
>>109025934
Indeed:
>>109024539
>>
File: ComfyUI_temp_kgupm_00038_.png (2.45 MB, 1088x1792)
2.45 MB PNG
>>
File: 1774325419918352.jpg (2.99 MB, 2048x3072)
2.99 MB JPG
>>109025915
Ngl, that is pretty neat.
>>
>>109025917
AstraliteHeart
>>
File: ComfyUI_temp_lheqp_00007_.png (2.85 MB, 1088x1792)
2.85 MB PNG
>>
>>109025915
I've heard a lot about bernini except how big it is and I used the last remaining motivation to look it up to make this snarky post so tell me.
>>
>>109025319
>>109025548 (you)
>>109025546 (you)
I also read it, so fuck off
>>
>>
>>109025997
yes, so were mine, verbatim

euler a/simple, 20 steps, one sampler, no upscaling, no post, 10 second gen
1girl, standing
-
lowres
this is just what a normal quality current illust popular checkpoint does
>>
>>109025001
>>109025007
unemployed thread schizo
>>
File: ComfyUI_temp_xsnsm_00009_.png (2.65 MB, 1088x1792)
2.65 MB PNG
>>
>>109026061
i did forget to match the resolution but i assure you portrait aspect ratios would make even better images, certainly not worse
>>
Is this thread always this useless and unhelpful or did I show up at a bad time?
>>
File: cui gen_00009_.png (533 KB, 512x512)
533 KB PNG
I tried using ComfyUI like once and I genned some pretty bad but funny things. I miss the old DALLE1 looking shit, its an aesthetic I think that isn't tapped into well. What are your favorite or best protoAI/funny weird looking gens?
>>
>>109026010
>>109026022
>>109026059
>>109026089
Im literally leaking right now
>>
>>109026089
i wish I could figure out how tf to use comfyUi without it turning into a jumbled dog shit mess
>>
>>109026098
Why not both?
>>
>>109026098
We're at that weird point where something interesting happened recently (ideogram 4) so there's a burst of activity, but we've exhausted discussion until the tools, loras and finetunes start coming out, so all the spergs are busy having melties at each other (it's how they flirt)
>>
Do you know any cool models and loras, something more tool oritented, like pose models, upscalers, unblur, something that increases image quality or things that aren't obvious at first.
>>
File: ai6.png (1.03 MB, 769x834)
1.03 MB PNG
>>109026103
not local but still absolute kino (also I saved this file in June 2022, can't believe how far we've come)
>>
>>109026139
You're probably in more need of honing your prompting skills than anything else
>>
>>109026139
>negpip
>cfg normalization
>shift scheduling
>a good prompt
desu all u need
>>
>>109025931
>>109026025
same cost as wan22, so if you can run that you can run bernini
>>
>>109026142
they're all clearly fucked up, but it's amazing what coloring a human face weirdly can do to make it look scary or concerning.
>>
"Princess Leia is bent over a table wearing her iconic Slave Leia costume looking shocked while a monstrous terrifying fat sluglike grotesque green creature resembling Jabba The Hutt holds her waist as it has aggressive standing rough sex from behind with her. The feeling of the creature's penis entering her from behind appears to cause Princess Leia significant pain."

https://files.catbox.moe/d8wcjj.png

No Illustrious checkpoint will actually generate precisely this for the same prompt. But with Anima Turbo I can do 12 steps gen + 12 steps high res denoise and get it even faster than I would even if any Illustrious checkpoint just knew what Slave Leia looked like inherently. There's just tons of little reasons like this I'm never going to use Illu again when Anima (and particularly Anima Turbo) is a thing now.
>>
File: ComfyUI_temp_axxiq_00004_.png (3.14 MB, 1088x1792)
3.14 MB PNG
>>
>>109026142
if you tried this prompt on any modern 'intelligent' model, you'd a result 100x less accurate btw
technology has regressed for the sole purpose of allowing indians to generate infographics
>>
File: 234249CUI_00002_.png (1.97 MB, 1152x1536)
1.97 MB PNG
>>
>>109026200
>>109025588
Hello Catjack
>>
>hard drive anheroed and deleted all my workflows
>>
its pretty nuts how hard and how long anon has tried to kill this thread yet it remains the place on 4chinz for local image generation
>>
Has anybody attempted any nsfw loras on ideogram? I did, and it did not go well. I'm surprised considering how it took my style lora. I had the idea of bboxing specific body parts in the captions with the idea that it would help it learn but it really didn't.

The much-hyped output censoring turned out to be nonexistent in my experience, but I wonder if it has some brain damage that makes nsfw harder to train. I'm not a great lora trainer so maybe somebody else can pull it off
>>
File: 69673605.png (2.99 MB, 1232x1632)
2.99 MB PNG
lol
>>
>latex in the negs

lifesaver
>>
>>109026056
excuse me?
>>
>>109026261
there's this, which looks alright
https://huggingface.co/RazzzHF/Realism_Engine_Ideogram_4/tree/main
i think some furries also messed around with nsfw loras. what isn't going well? are you looking for just nudes or sex?
>>
am i retarded, i cant get anima models to work on comfy
what the fuck
>>
Anyone know what the best perceptual hashing algorithm is ?

Tried phash and pdq, they're decently fast but the results are kinda crap, where images taken in sequence with hardly any difference has a higher distance than totally compositionally unrelated images, like wtf.

I don't care if it is slow, as long as it does a good job.
>>
>>109026283
Pretty much just nudes. Gotta have pubes though so I practically have to train it myself because most people are philistines.
>>
>>109026288
What do you mean by work? It's generating garbage? Nothing? The workflow is failing?
>>
File: 4.png (2.3 MB, 1232x1632)
2.3 MB PNG
>>109026288
considering there's a working workflow in their sample image yes indeed you are
>>109026299
if you want to do what I think you want to do look into this https://github.com/verlab/accelerated_features
>>
>>109026310
Tried multiple workflows from civitai because i realised just plugging the model into my basic bitch one wouldnt work, they are all so convulted and all require 10+ diff things i dont have
so i get all those things, then i just get more errors and bs
>>
>>109026315
>workflows from civitai
there's your problem

>>109026312
>workflow in their sample image
Use this or the template included in comfy
>>
tried making an xl character lora for 3 days and still can't get it right, do i just nut up and take the easy way out and not do xl...
>>
>>109026333
If you can't even train xl I got bad news for you...
>>
>>109026333
Skill issues. There are like 2 dozens LoRa config floating on the web, what the fuck are you doing?
>>
>>109026341
huh?
Isn't Zit training like super easy?
>>
>>109026253
I'm confused to why he's posting here when he has a full general and all of his enemies stopped posting.
What's the point?
He has even moved to anon posting bait like this
>>109025716
When it's obvious it's his prompting style as shown in
>>109025532
>>
>>109026352
I'm using this one, https://www.youtube.com/watch?v=d4QJg4YPm1c
but its like two years old, maybe i should look harder.
>>
https://rentry.org/LDG_vital_info
>>
>>109026384

First of all, nobody use the official SDXL release around here. It's shit. Broken anatomy, censored data, etc. Use a tuned checkpoint instead.
>>
>>109026385
What does this prove when we have a documented malware spreader actively spamming this general after forcing us to migrate here?
>>
>>109026384
by "sdxl" i assume you mean illustrious/noob or something, right..?
if so, surely you've used a character lora before and though "damn this one is good, i want to train my own". well most loras contain a lot of the training metadata and you can just copy their params. i think forge tells you, but I also use this to inspect them https://lora-inspector.rocker.boo/
for tagging, you need to use booru tags if training illust, and you need a character activation tag. i usually pin mine to the front and make sure to either set it as a protect tag or set keep tags to 1 so it doesn't drop out.
>>
>>109026410
Yea I heard, I think i used pornmasterpro or Ultra_Realistic_BSY_Illus_V2_Patreon_Pro_DMD2_ALT_C44.
but honestly i forgot since i started my training like last night and it just finished lol
>>
>>109026322
>just use the shitty template from comfyUI
that's even worse advice than just using random civitai workflows
>>
>>109026174
first of all, my dude, that is a dogshit boilerplate slop image you could get without very much effort even on pony
second, not only will any popular illustrious model do a better job and offer FAR more flexibility while doing so, every, any, illustrious model knows what the slave leia outfit is, which tells me you haven't even tried and are literally just fucking lying
illust won't make a good image for this subject matter either, but illust has freely available loras that will make perfect images in perpetuity, and it sure as shit makes images better than that on its own
>>
>>109026431
Are you retarded? It's the same as the official workflow, they just put it in a subgraph.
>>
>>109026423
yea, it was eaither pornmasterpro or an ultra realistic illustrious one i grabbed from kemono before it went down. Ill try looking for one to copy, im trying to make a realistic character.
>>
>>109026127
>we
There is no 'we', retard.
>>
>>109026459
official workflows never include any form of optimization of quality or speed
I was using WAN official workflow like a fucking retard for weeks generating videos that looked like garbage at 480p that took 2 minutes per video, 6 minutes for 720p when it didn't OOM
I started using a proper workflow and now 720p takes 150 seconds and all the ghosting and all other crap is gone and fixed.

You have to be a literal retard to use ComfyUI default workflows
>>
>>109026480
sorry I have a boyfriend
>>
File: ComfyUI_15428_.png (1.23 MB, 1024x1024)
1.23 MB PNG
>>
>>109026481
I'm not recommending them in general, I'm recommending the anima one because it's the same as the official one with a subgraph and different prompt.

It's definitely worse to download a random jeet workflow off of civitai with a bunch of custom nodes with multi-pass upscaling and 10 detailers.
>>
>>109026333
>XL
anon... its time to upgrade to a modern anime model like anima
>>
>>109026498
but you will need all of those things once you start creating polished image sets whose quality you actually care about beyond the next nut
>>
>>109026498
So don't just pick ANY random jeet workflow, look for ones with lots of downloads, get them, check if it asks you to install some random custom node with 3 stars or not, hell just open them up to see what they are doing and adapt it to your own workflow.
>>
File: br7qat.png (2.58 MB, 1024x1024)
2.58 MB PNG
>>
>>109026520
You really don't for anima. You can already generate above 1MP natively and faces aren't fucked like they are on sdxl.

Also, anon was just trying to get a working workflow because his random jeet workflow wasn't working.
>>
I'm the guy who was bitching and inventing conspiracy theories about nsfw ideogram lora training and I just realized I selected the wrong fucking dataset and didn't actually train on a single nude image. This may explain my poor results...
>>
>>109026579
>anon fucks up
>blames the model
a tale as old as time. i wont hold it against you since youve repented with this post but be aware many anons do not go on to admit their wrongdoings so you should take pride in this.
>>
>>109026579
that's a relief, excited to see the actual results then
>>
What do we think about capybara ?
>>
>>109026515
I'll switch to anima when it has as many lora's as Illust and XL, and can do realism.
I'm going to get my racist asian and huge ass streamer illust lora if it's the last thing i do.
>>
>>109026646
You could build all of that now you do know that right?
>>
>>109026584
>>109026617
Yeah, honestly now I'm kind of impressed with how ideogram took the training. What happened was that I had accidentally trained it on a style and some things in my dataset vaguely applied to things in my test nsfw prompts. I could see there were some slight changes that I thought had come from my nsfw dataset, but they were very slight and it clearly wasn't learning like I expected so I doubled the learning rate and let it train for longer. It was probably ultimately four or five times more training than it should have got, but I figured it's an unfamiliar model so I'd just experiment and keep training until I saw progress or it completely fried.

By rights it should have been completely braindead but it really wasn't. The outputs were worse than base, but not to the degree you would expect with a massively overcooked style lora active, which to me indicates that the model does a pretty good job of compartmentalizing. I trained it on a style but wasn't prompting that style, so it didn't hurt the model that badly.
>>
>>109026646
>I'm going to get my racist asian and huge ass streamer illust lora if it's the last thing i do.
A life well spent
>>
>>109026656
I'm trying to do that now, I'm still at the sucking part of learning.
>>
does the number of loras really matter tho if 83% of them are unironic jeetslop
>>
>https://huggingface.co/sensenova/SenseNova-U1-A3B-MoT-SFT
>39b
ACK
>>
File: file.png (167 KB, 512x512)
167 KB PNG
https://xcancel.com/knshtyk/status/2064610520452440558
>One of the reasons why AI-generated images are "instantly recognizable" is this: when you extract the spatial frequencies in the image by filtering them into frequency bands using a tool, the amount of information is distributed too uniformly across the entire screen. It's a property derived from the latent space and the process, but it differs greatly from the state where, like in human creations, intentional craftsmanship resides in parts, resulting in variations in information density.

Are there any tools available to get around this?
>>
>>109026712
The answer is no. The barrier to training a LoRA is unironically so low and the LoRAs created so hyper specific to the needs to the creator that it's baffling that people even care beyond a few select general purpose ones with proper documentation.
>>
>>109026733
>Are there any tools available to get around this?
Pencil and paper.
>>
>>109026733
I think that's the problem.
>>
>>109026733
make an issue on github
>>
>>109026733
>prompt for blurry background
problem solved
>>
>>109026733
give us a way to run the same "FFT filters" on our own images or else bunk
>>
>>109026733
>filtering them into frequency bands using a tool,
How do you do this?
>>
>>109026733
https://i.imgur.com/zvZ3OHr.png
https://i.imgur.com/kjQPZeT.png
Well, its an AI image and seems to have pretty distinct levels of details to the main character, secondary character and background
This is a terrible method anyway
>>109026862
https://www.djmannion.net/img_freq_web/
>>
>>109026862
>>109026863
Just vibecode it
>>
File: Ideogram_0007.jpg (954 KB, 1936x1088)
954 KB JPG
>>
>>109026866
give us your CLAUDE.md .
>>
>>109026733
>One of the reasons why oil paintings are "instantly recognizable" is this
How do we get around this, oilbros? I don't want my paintings to look like pastels.
>>
File: milfeve.png (925 KB, 1024x1024)
925 KB PNG
1hag
>>
>>109026876
Moar spatial frequency!
>>
>>109026865
but is the style of that gen tradart? it looks like its possibly more realistic
>>
File: web_img_freq_export.png (189 KB, 512x512)
189 KB PNG
>>109026876
>>
File: web_img_freq_export.png (157 KB, 512x512)
157 KB PNG
>>109026867
>>
File: lol.jpg (231 KB, 1242x1003)
231 KB JPG
>>109026865
>>
>>109026894
>>109026889
You just gonna sit there pointing it out or are you gonna fix it?
>>
>>109026914
max out orientation width
>>
File: widthmax.jpg (231 KB, 1226x1000)
231 KB JPG
>>109026940
>>
>>109026142
friday night in an english pub
>>
File: booo.jpg (304 KB, 1220x1346)
304 KB JPG
i'm not convinced is fun to test tho
>>
>>109026975
what if i test my hedge clippers on ur balls
>>
>>109026733
>>109026865
I tested these on old images I made back in SD1.5 as well as recent ones in zit and none of this applies at all. I get pretty dynamic "waves" where it dips between black and light gray. Xitterfags being retarded like usual.
>>
>>109026865
Mine wasn't so flat, is this just for anime?
>>
File: Ideogram_0012.jpg (643 KB, 1936x1088)
643 KB JPG
>>
>>109026975
Real life photos were the least dynamic I noticed. I think it just has to do with the contrasts so a well-lit photo looks flat in these.
>>
This manual tagging program thing is sucking the life out of me. I don't even care about LoRA training... why am I so set on making this stupid app
>>
File: Ideogram_0015.jpg (490 KB, 1088x1936)
490 KB JPG
>>
>>109027075
catbox?
>>
Can we wobble or randomize or apply presets or fractalize or whatever operation on the frequency domain and transform back into RGB ?
>>
File: PhotoAnima21_Turbo_Output.png (2.82 MB, 1344x1728)
2.82 MB PNG
PhotoAnima 2.1 Turbo is pretty decent IMO, still needs a bit of fine detail tuning but it's like, much better for realism than something like Chroma Flash while being six gorillion times faster
>>
>>109027072
You're simply autistic
>>
>>109027082
https://files.catbox.moe/91qjg7.jpg
>>
>>109027102
armpit skin wrinkles/folds lookin pretty rough
>>
>>109027139
Oh shit I forgot it was a jpg. Hold on.

https://files.catbox.moe/jofgap.png
>>
>>109027102
Did this guy train on elderly women exclusively to get armpits like that?
>>
>>109027150
Thanks my man!
>>
>Git pull comfyui
>My workflow going backwards, creating infinite process loop.

What the fuck. Any nodes to prevent backflow?
>>
>>109026999
Living the dream.
>>
>>109027182
ask ai
>>
File: qant1c.png (2.04 MB, 1024x1024)
2.04 MB PNG
>>
>>109027160
>>109027144
Funny thing about diffusion models is details are basically synonymous with old people and ugly people. Details = skin imperfections, acne, wrinkles.
>>
>>109027144
yeah, I'm still fiddling with samplers and schedulers. Different ones give pretty different results even on the same seed and prompt, e.g. picrel only the sampler and scheduler changed, nothing else
>>
>>109027217
based, that means i am highly detailed
>>
How to get neo forge prompt from a png in comfy ui?
>>
>>109027182
what do you mean backwards?
>>
>>109027102
hmmm, getting somewhere now. This one is res_2s / bong_tangent with "bong math" turned off, whereas the one I'm replying to was with it turned on. I think off seems better.
>>
>>109027247
ok, found the sweet spot I think. res_2s / bong_tangent (and bong math turned off) but with 10 steps gen + 10 steps high-res denoise instead of 8 + 8.
>>
>>109027075
this sold me on ideogram, something like this is still impossible with anima (which is fine, its a 2b param model). if only big russ didn't blow his load, maybe he should increase the api taxes some more
>>
>>109027357
he gets money from use so I think it's safe to say he's pretty much fucked
>>
File: 1753695063909701.png (2.16 MB, 1448x1086)
2.16 MB PNG
>>
>>109027365
dr. he, it is time
>>
File: debo_s_fia_00022_.png (1.91 MB, 1792x977)
1.91 MB PNG
>>
>>109027240

Traced the issues back to custom node ComfyUI Layerstyle Purge Vram activating again in upstream process. Broken after git pull. Fuck.
>>
File: wtf.jpg (886 KB, 4556x1132)
886 KB JPG
Trying to train my own LoRA for generating ballbusting fetish videos
>>
>>109027357
what does this even mean
>>
>>109027459
big russ is probably tdrussle. The guy who trained anima and the guy who also owns diffusion pipe WHICH SHOULD HAVE HAD LTX 2.3 MONTHS AGO IF YOURE READING THIS.
>>
whatever happened to those "as a 80s Dark Fantasy Film" videos
>>
>>109027482
I think the new trend is "As a wigger"
>>
>>109027469
nobody uses diffusion pipe lol
>>
>>109027494
I-i do...
>>
File: Ideogram__00313_.png (3.19 MB, 1376x1824)
3.19 MB PNG
What's the license like on ideogram? Still no ideogram lora section on civitai
>>
File: he git pulled.jpg (45 KB, 1974x118)
45 KB JPG
>>109027240
>>109027182

Found why it goes "backwards". The flow cache reset when CPU run max out. Which means your flow upstream get lost and run again. Fucking comfy. May a thousand fleas infest his couch. Going how to disable this bullshit, CPU spikes all the time.
>>
>>109027502
Do the prompt again but have a bbox for each breast being massive resting on the table completely at the mercy of gravity.
>>
File: debo_s_fia_00025_.png (2.14 MB, 1792x977)
2.14 MB PNG
>>
>>109027507
bro the project is a bloated mess of tech debt. cumfart is dying
>>
File: ComfyUI_00016_.png (2.16 MB, 2000x2000)
2.16 MB PNG
>>
how do i make money from training loras (sdxl or anima)
>>
>>109027514
you ui do you use?
>>
>>109027514
weird I'm always on the latest git and never have any issues
maybe stop using shitty custom nodes
>>
>>109027518
by selling them
>>
>>109027102
>>109027247
>>109027345
Why not just use Klein?
>>
>>109027518
>Invest in Nvidia
>Enjoy aislop, share LoRas freely
>Make other become addicted to AI cocaine
>???
>Profit
>>
>>109027520
sdcpp is the future. fuck everything python

>>109027527
you always say this but never have an image to show it works. curious
>>
>>109027510
whats bbox for left breast
>>
>>109026733
>One of the reasons why AI-generated images are "instantly recognizable"
What are the other reasons?
>>
File: cute.png (826 KB, 2213x1260)
826 KB PNG
>>109027557
>>
File: debo_s_fia_00028_.png (1.64 MB, 1792x977)
1.64 MB PNG
>>
>>109027502
https://files.catbox.moe/c39f7d.jpg
Whoops
>>
File: Ideogram__00318_.png (3.18 MB, 1376x1824)
3.18 MB PNG
>>109027510
https://litter.catbox.moe/j3sd3tnlhg8pze2u.png
Came out nsfw despite not prompting for nipples. Funny how when you sneak past the safety filter it's a very horny model
>>109027582
Grotesque
>>
>>109027542
I do, but Anima has the inherent NSFW support already so it serves a purpose too
>>
File: Ideogram_0036.jpg (647 KB, 1616x1616)
647 KB JPG
>>109027585
I want them to ooze over the table...

But yeah the model is full of filth. I need to dedicate an afternoon and train a LoRA for it.
>>
which realism model is best if i wanna turn illustrious or anima gens into realistic with i2i? one that can handle nude/non nude well.
>>
>>109027591
>inherent NSFW support
what good is inherent nsfw support if you're interested in realism and the model is not trained for it?
>>
>>109027579
why do you use comfy when it's bloat for your use case?
>>
>>109027606
I actually do a lot more with it than simple t2i with flux. Nothing else comes close when it comes to WAN/LTX i2v. Or actual i2i editing.
Why do you lie about ComfyUI here just to shill some crap UI you probably don't even use?
>>
File: debo_s_fia_00030_.png (2.25 MB, 1792x977)
2.25 MB PNG
>>
>>109027585
the detail is so shit in this thing for using the Flux.2 VAE DESU
>>
>>109027602
what? the concepts still work in this realism finetune, that's the point
>>
What does ideogram default to when you just prompt for "woman"?
Chinese models are usually plastic asian bimbos, Flux is generic AI face buttchin white women and LTX 2.3 is Indians
>>
>>109027663
Sometimes I like to say inane and totally wrong things for a reaction too.
>>
>>109027672
>ideogram default to when you just prompt for "woman"?

Content blocked by safety filter.
>>
>>109027676
kek
>>
Does sdcpp have feature parity with Comfy?
>>
File: 123.png (2.31 MB, 1536x1024)
2.31 MB PNG
>>109027628
nice prompt
>>
>>109027585
I feel like for tits this size the nipples are kind of unrealistic.
>>
File: debo_s_fia_00035_.png (1.73 MB, 1792x977)
1.73 MB PNG
>>109027697
:)
>>
>>109027695
it has more features like not crashing or never having to download a pytorch wheel ever again
>>
>>109027625
you can do all that without gay retarded python shit yaknow. you get nothing out of a workflow and it shows
>>
>>109027625
the saddest part is he makes those posts for free
>>
>>109027625
>Why do you lie about ComfyUI
where was the lie?
>>
>>109027715
like how you self reply?
>>
>>109027709
>>109027713
>>109027720
>>109027723
Why did you make four seperate replies instead of combining them into one?
>>
there's jeets in here filtered by python software?
>>
File: ComfyUI_00056_.png (1.75 MB, 1024x1024)
1.75 MB PNG
>>
>>109027730
you mean jeets here supporting python right?
>>
>>109027763
this takes me back to the day when sd 1.4 was released as a backend package with a basic example script, and i was the only person in the thread who was posting generations since /g/ is full of jeets that didn't know how set up a venv
>>
File: debo_s_fia_00040_.png (2.08 MB, 1792x977)
2.08 MB PNG
>>109027751
>agi is just a human
deep
>>
Interesting
https://www.reddit.com/r/StableDiffusion/comments/1u2eb48/ideogram_4_character_reference_workflow/
>>
>>109027805
Tried it. Not so good. OP's image is cherrypicked because everything else looks like the temu version of the subject. We already have better models for this.
>>
>>109027814
That's a shame. Ideogram would be amazing if it had edit capabilities
>>
>>109027822
i thought it did. i checked the website and it seems to showcase that ability. i'm assuming all the UI scripts people are using are not fully implementing the model at the moment
>>
>>109027785
and yet none of you can into C. welcome to the codelet side
>>
>>109027834
NTA but that's how I read it. The method above was hacky anyway.
>>
File: ideogram4_00028_.png (2.63 MB, 1184x1776)
2.63 MB PNG
>>
Imagine generating in 2026 and not using sage attention 3.
>>
>redditors posting anima hate
lol
>>
>>109027848
absolutely demented woman in the back
>>
>>109027849
imagine still generating with poothon in 2026
>>
File: debo_s_fia_00045_.png (2.09 MB, 1792x977)
2.09 MB PNG
>>
>>109027849
give me the ampere version saar
>>
>>109027856
Your gens still look shit, after all this time
>>
>>109027864
i think its cute
>>
File: debo_s_fia_00048_.png (2.06 MB, 1792x977)
2.06 MB PNG
>>109027864
you can say my gens are bad
but you can never post a better gen
even after all these years
>>
>>109027876
:]
>>
File: referenceimage.jpg (170 KB, 1383x873)
170 KB JPG
>>109027834
Yeah it shows reference inputs right on their main page. Would be nice if the devs themselves explained something but they're probably keeping quiet because they hate coomers
>>
>>109027889
Also why has no one created transparent images yet?
>>
>>109027883
I just don't use avatar :)
>>
>>109027889
You think they panicked when they realized how easy their safety filter was got around?
>>
>>109027900
you're the troon that seethed when i said your laura lora was not good
keeeeeeeeeeeeeeeeeeeeeeeeek
>>
>>109027858
idk, my ram is being taken up running a single illust prompt, i can't load up my 5b parameter llm to ask it what that means
>>
>>109027902
It would be really dumb of them to not notice since it only takes a few bboxes to bypass it.
>>
>>109027849
I can use sage 3 and i still have no idea what it's good for.
>>
>>109027907
you need blackwell to use it
>>
>>109027902
No. I think they did the bare minimum to cover their asses and are legally now able to claim any trouble caused by their model is in violation of their license.
>>
File: Ideogram__00338_.jpg (910 KB, 1376x1824)
910 KB JPG
>come with us white boi, we're saving america
>>
is ass cleavage a real term
what the fuck is it called that nicki minaj has in the anaconda video
>>
>>109027894
iirc someone tried and it did fake transparency (generated the checkered background).
Looks like image edit and image segmentation are api only, no idea if they're coming later or simply too powerful for us mere mortals
>>
>>109027929
it will come. i have patriots on the inside. trust the plan
>>
File: Ideogram_0059.jpg (320 KB, 1088x1456)
320 KB JPG
Oops got the bboxes wrong
>>
File: Ideogram_0061.jpg (247 KB, 1088x1456)
247 KB JPG
>>
>>109027928
Haven't and won't watch that video but I assume you are referring to the ass crack?
>>
>>109028009
>>109028009
>>109028009
>>
>>109028004
yeh, its ok i know you have seen it and know what im talking about
>>
>>109027129
I'm not, as it happens



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.