[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage.jpg (3.02 MB, 3506x4400)
3.02 MB JPG
Discussion and Development of Local Image, Video, and Music Models

Previous: >>109063941

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
first :)
>>
Reminder it doesn't take a year and a half to make a working imagegen frontend and that you too can have more features than that cobbled together mess without much effort.
Don't ever flex your so called contributions again you ankle biting sperg and stop insulting devs that give us shit FOR FREE.
>>
>>109068617
oh no... I'll verify the version number when I get home, because I see that checkpoint name too. I cloned it off a repo I was directed to for manual download, I thought it was up to date considering its repo had things updated within the hour of cloning it. Surely I can just update it, right? Or do I even want to update?
>>
>>109069227
Meds dude
>>
>>109069282
Go back two threads if you want to see someone that needs meds. Sorry that I'm being honest about someone that uses a low skill dead project as a reason for why they should be listened to while they harass devs endlessly
>>
>>109069305
Meds dude, everyone, from the most capable to the most schizo can have whatever opinion they want in this anons, public, and free discussion thread
>>
File: singing_cat.jpg (464 KB, 1104x1472)
464 KB JPG
>>
can someone share an ideogram workflow with bboxes and shit?
>>
>>109069333
An opinion is fine but actively spreading misinformation and attacking people disagreeing with you is something entirely different is it not?
>>
File: shut up bitch.gif (2.28 MB, 404x378)
2.28 MB GIF
>>109069227
>>109069282
>>109069305
>>109069333
>>109069361
>>
>>109069231
updating is mandatory if you want to use the latest models, such as ideogram and scail2.
its generally good practice to stay up to date anyway.
and dont listen to people who say their comfy got bricked after pulling, they're either trolling or were using an outdated (inefficient) workflow
>>
File: anima1_00001_.jpg (416 KB, 1152x1576)
416 KB JPG
Shitmix marches on, eyes almost fixed
>>
>>109069231
>Surely I can just update it, right?
Yeah, every model uses it's own workflow, so you can just grab a basic one for whatever model you want from the Comfy menu and go with something else.
>>
>>109069227
Where's the repo?
>>
>>109069438
No need for one this isn't a hard thing to do. I'm just highlighting how that crab acted like a simple task was some great feat that was going to change the local landscape.
When you see how easy it is to make what he made you fully understand why his project is dead and forgotten.
>>
>>109069458
Okay but why post it here and gloat if you're not going to share? Where's the repo?
>>
>>109069465
To prove a point, it's just a wrapper that makes api calls this is a trivial task for anyone with hardware to run AI. Even more someone that was a so called professional in the space to begin with. Many anons on /lmg/ have already built their own frontends using local tools only this is basic shit.
>>
File: cat_hole.jpg (265 KB, 896x1600)
265 KB JPG
It sure is hard to prompt Rabbit Hole reference with text only.
>>
>>109069404
>>109069395
thanks. Biggest reason I procrastinated to learn Comfy last time was getting acclimated to these workflows and UI, but now I am ready.
>>
File: file.png (420 KB, 909x907)
420 KB PNG
Do /ldg/ save their own creations in a folder if they come up good? how do you organize it?
>>
So does Ideogram have edit capabilities or not?
>>
>>109069562
i'm keeping a bunch of gens, yes.

no organization in particular. one time I found something by mass tagging via AI model but this is still becoming better and it's ending up as a hydrus database (you can have multiple there) that almost certainly will get dropped for better tagging eventually.
>>
>>109069600
it does not but people started hacking up the capability, e.g.

https://huggingface.co/BitPoet/Ideogram4-Inpaint-LoRA/tree/main
>>
>>109069600
API version does, not the local model. I tested someone's workflow for character references and it barely works. Only gets the most superficial details about the character but doesn't look as accurate as just using klein
>>
>>109069562
I coded my own personal image viewer / collage generator with ranking system and whatnot for goon sessions
>>
File: file.mp4 (1.66 MB, 576x1056)
1.66 MB
1.66 MB MP4
>>109069490
we can only win 1girl step by step, I suppose
>>
>>109069803
Based /g/oonmaxxer
>>
>>109069803
what is the ranking like? https://huggingface.co/Qwen/Qwen-Image-Bench or something older?
>>
>>109069562
I save everything by default and then delete after the session. If I did the opposite and decided what to save after each generation then that takes too much time.
>>
>>109069624
That's trained on like 70 image pairs?
You need orders of magnitude more to properly hack in edit capability.
Not that I am trying to ankle bite people doing cool shit.
Specialized edit loras like "remove her clothes" are more doable on smaller datasets.
It would be cool if some autist somehow pulls off hacking klein tier edit capability into a model though.
>>
>>109069892
I mean manual ranking them via UI so i can filter out, group in views, random shuffle only the best ones in a collage on click etc
>>
>spent $20 to train ZIT LoRA of oneitis
>it looks like shit
>>
>>109069562
Comfy output folder.
Images are saved as model name followed by number.
I copy more kino gens among them to a separate AI folder in Pictures. Sometimes I rename them more descriptively, sometimes not.
>>
>>109069922
>Spending $20 to train a ZIT lora
Wtf, did you get scammed retard? Should be doable for a dollar or less, even if you need to hire a cloud GPU.
>>
>>109069395
as of the current version, every single comfy custom node works as well, so anyone complaining about borked updates doesnt know what theyre doing. they're probably updating their python dependencies, which absolutely will fuck your shit up.
>>
File: 1777217182966853.png (218 KB, 529x472)
218 KB PNG
>>109069937
I'm richie rich
>>
i think its time for me to abandon SDXL for Anima. What makes Anima better outside of using a natural language?
>>
File: 1761999132577051.gif (7 KB, 128x128)
7 KB GIF
give me a sexy prompt to work with klein 9b t2i
Just downloaded it
>>
>>109069982
better te, better vae, modern flow model resulting in:
>better details
>better text
>better backgrounds
>better style knowledge
>>
>>109069692
>API version does, not the local model.
What the fuck
>>
>>109070015
>>better backgrounds
>mfw
agree with the rest
>>
>>109070005
Get the snofs lora and just describe the action.
>>
>>109070029
The backgrounds are definitely one of the weakest parts of the model but they are still better than SDXL imo.
>>
>>109070031
https://huggingface.co/forum50622/SNOFS
this?
>>
>>109070059
That's Qwen. It says so on the filename.
https://civitai.red/models/1972981/sex-nudes-other-fun-stuff-snofs?modelVersionId=2818111
>>
File: anima1_00084_.jpg (493 KB, 1152x1576)
493 KB JPG
>>109069922
Should not be possible to fail zit lora training
>>
>>109069922
i'll do it better for $30
>>
so anima 1.0 still has artifacts and general issues on highres genning/upscaling particularly in portrait formats. the official highres lora does the best job mitigating these but is also a net negative for overall quality when loaded onto 1.0 imo
>>
>>109069982
unless you're making "an porn" low IQ prompts, natural language is mostly a gimmick

better character props, knows more characters, better character identity that applies better to multiple characters(still not exactly reliable though), and pretty much absolutely nothing else, replete with a total absence of loras and quality checkpoints
until the latter is addressed first and foremost sdxl will continue to outperform anima in 90% of cases
>>
>>109070224
and by high res i typically mean 3mpix and up. and generally i see best base gen results at 1.56 mpix (1.25x scale)
>>
File: anima1_00094_.jpg (448 KB, 1576x1152)
448 KB JPG
>>
File: 1061129676464399.png (1.77 MB, 1472x1152)
1.77 MB PNG
>>109069562
Yes, just sort by date. Has about ~1600 files after ~4 years.
>>
File: anima1_00115_.jpg (456 KB, 1576x1152)
456 KB JPG
>>
>>109070296
"flustered" and "angry" have significant, if not close to absolute, visual overlap in many cases, this is a very bad prompt on any model
a better example would have been something like telling it to put one character on the left and one character on the right, which anima can do more reliably, except using a regional prompting workflow would produce even better results and be more token efficient, hence gimmick
it's not actually a gimmick, it does improve these types of prompts, but it's not a massive leap forward
>>
>>109070248
Boomer prompts are the future, deal with it chud.
>>
>>109070462
Bounding boxes are, actually. And we'll go back to tag prompting because it's more digestible in JSON formatting when the location of each element is already determined.
>>
File: flux-2-klein-9b-kv-fp8.png (2.43 MB, 1184x864)
2.43 MB PNG
I'm trying flux-2-klein-9b-kv-fp8, gens look good in the preview but get completely fucked up in the end, pic related. Any ideas?
>>
>>109070507
>collage made of porn thumbnails
shiggydiggy
>>
I used to train with 1024p cosine but now I want to experiment with something like this to see if I can get a decent speed boost without hurting quality too much.
First I will train on 512p once, 768p thrice, 1024p once per epoch mixed res with constant lr. I will stop when the lora starts to get fried.
I will choose an epoch before the lora starts to get fried.
Then after loading its weights, I will do a shorter 1024p only cosine run to finish the lora off.
Does "--network_weights" options in sd-scripts work good enough for this? Anyone here has experience training loras in a similar way?
>>
File: preview.png (814 KB, 480x720)
814 KB PNG
>>
File: debo_ccg_fia_00022_.png (1.32 MB, 1792x977)
1.32 MB PNG
>>
>>109070507
Wrong vae probably
>>
>>109070488
>Anima assumes that flustered mean blushing.
it isn't: the overwhelming majority of asuka images will depict her as angry in the way you envision, and the overwhelming majority of shinji images will depict shinji as flustered in the way you envision, on top of the fact that "asuka pointing at shinji" is another concept that will be heavily reinforced by these characters' training data

if you make asuka point at shinji and have shinji be the one that's angry and asuka be the one that's flustered and scared, it will start failing and going in bizarre directions in an "sdxl-like" manner

this is what i meant by "low IQ an porn," anima does extremely common stuff well, better than base sdxl models(which are a waste of disk space at this point), but it's not really a significant improvement for prompts longer than one paragraph where you need to have control over backgrounds, character framing, positions and angles; it is still "better," but it's not better, and sdxl has stuff already made for it that makes it better
>>
Niggly wiggly!
>>
Why are they seething about anima?
>>
File: 104573489467735.png (1.67 MB, 1024x1728)
1.67 MB PNG
>>
>>109070529
Yeah this should work. I've trained several loras with hard restarts. Started with 512, as much batch size as possible, stop and start with 768, smaller batch, and finally with 1024
>>
>>109070602
i said asuka pointing at shinji, not shinji pointing at asuka
i didn't say it was impossible, i said that prompt retention would be much weaker and more in line with other models
>>
>>109070689
Faggot just shut the fuck up, you're reaching for straws at this point and promptlet retards like you need stop talking so much
>>
>>109070696
>does the opposite what i said
>faggot you're grasping at straws
kill yourself
>>
https://files.catbox.moe/djofxs.mp4
scail 2 nsfw

Idk, maybe there is potential.
>>
>>109070715
It's funny how people made a fuss about deepfakes years ago and now you can literally do this on your own computer
>>
>>109070534
Fuck off thread schizo
>>
>>109070711
FAGGOT STOP CRYING YOU'RE UNABLE TO POST ANYTHING AND JUST SIT HERE WHINING LIKE A FUCKING BITCH
I'm not even the anon you're arguing with I'm fucking tired of your autistic low IQ whining rituals over anima that you do daily.
>>
>>109070728
i'm similarly tired of your autistic low IQ worship rituals over anima because it's the only model forgiving enough to let you generate a basic bitch stock image that you've already seen, that already exists, because you're a braindead golem that can't see the apple
>>
necessary:
https://rentry.org/LDG_vital_info
>>
>>109070744
YOU ARE A FUCKING LOSER YOU WAKE UP HERE TO POST EVERYDAY BITCHING AND MOANING ONLY TO GET PROVEN WRONG JUST TO WAKE UP TO DO THE SAME FUCKING THING AGAIN JUST GO BACK TO YOUR CONTAINMENT BOARD YOU WORTHLESS SACK OF SHIT
>>109070748
Also you with the cope rentry faggots like you cry about gatekeeping but not gatekeeping allows faggots like the malware spreader and anima whiner to stay in this general
>>
>>109070737
He will never stop remember how he embarrassed himself yesterday he had every anon tell him to fuck off and he kept at it for hours, these faggots need a fucking job
>>
>>109070568
>Wrong vae probably
FluxKVCache seems broken, skipping makes it work
>>
>>109070715
saar workflow pls
>>
File: debo_ccg_fia_00021_.png (2.13 MB, 1792x977)
2.13 MB PNG
>>109070727
>>
How are the local models doing? Are we still stuck in pony hell?
>>
>>109070897
anon its 2026 not 2023
>>
>>109070900
Yeah, but last time I checked civitai, all the models where based on pony or illustrious (which is just as shit)
>>
>>109070875
Why are you here is your dead piece of shit thread only being kept alive by your for vanity?
You're so buck broken you'll post here after years of attacking and screeching over anons not wanting you here?
>>109070897
Models are much better now look how much lifting they are doing for OG schizo that can't even do basic prompting. Inpainting is also in a amazing spot thanks to klien and other models that can make complex task easier. Big things are happening and the next play in my opinion is working with llms in tandem
>>
>>109070912
anima/waiNSFW(illustrious) for 2d/anime
klein 9b for editing images
chroma/zit for nsfw realism
wan22/ltx2.3 for nsfw video

this is the current local meta
>>
File: tinker_chan_0.mp4 (3.47 MB, 512x896)
3.47 MB
3.47 MB MP4
>>109070897
Pony was like 3 models back. We're on Anima now.

New thread, new scailslop.
>>
>>109070912
>civitai is full of jeets stuck in 2024
yes this is not new information
>>109070936
you dont have to keep recommending WAI its okay anon you can just say anima
>>
>>109070932
Don't expect a reply from him anon, he's just a human gnat basically and will do the same thing again and again and again
>>
>>109070914
hooking your local text and image models together is a lazy afternoon of prompting away, it's already here.
>>
>>109070968
agreed
>>
>>109070744
>anima is forgiving
Huh? The whole rhetoric from XL sloppers up to this point has been that the illust shitmixes are actually easier to use kek. Why has this argument suddenly changed?
>>
>>109071016
it's pure spite, all this faggot has is spite and it's clear. There's a small clique of faggots that have been driven out from this general multiple times and they are reduced to bitter gnats that just lash out spreading misinformation non stop.
They have /sdg/ for a reason but realize nobody cares what goes on in that shit heap which why multiple members of that clique post here shitting things up.
>>
File: forgeneopostneo.jpg (258 KB, 2376x1782)
258 KB JPG
show me your vibevcoded AI slop for working with Ideogram
this is my hotkey based workflow, with comfyui/llama.cpp backend
>>
>>109071063
>black guy
>>
>>109071049
This is the most unemployed post I've ever seen in my life.
>>
>>109071063
Can you show the gen though? Looks super complicated but I haven't explored id4 yet
>>
>>109071107
>projecting
You know we can just go back 2 to 3 thread right?
You know this same exact method is why you and your camp has lost all reputation and power in this space right?
But go on keep failing anyway and blaming anons and devs that have accomplished more than any of you have despite desperately wanting recognition and attention.
The other anon is still waiting for your xl gen btw
>>
File: ComfyUI_00260_.jpg (452 KB, 1280x1280)
452 KB JPG
>>109071083
Sarr please we like the bbc
>>
>>109070669
Like when you load network weights and start training again with them, it's building on the information from the previous training run, right? I was worried this kind of "hard restart" might disrupt training too much, I hope it works as well as you say.
Also do you know if warmup is necessary after loading previous weights? I think I will have some warmup steps on the second run regardless.
>>
>>109070845
>>109070845
https://files.catbox.moe/d775nc.json
>>
>>109071191
thanks
>>
File: 564122705498760.png (1.53 MB, 1088x1600)
1.53 MB PNG
>>
>>109071191
>>109071212
be aware that the original >>109070715 gen was 480 × 832

used this video as reference
https://www.erome.com/a/LFpXkYM3
>>
>>109069227
>>109069305
>>109069361
Pretty transparent move. This general owes its existence here and everywhere else to Ani, yet you are taking shots at him, but in reality you are chasing yesterday's thread engagement hoping another anons joins in to keep this thread and Anima discussion breathing.
It won't work, happy slow thread, worthless bitch.
>>
trying way too hard with the bait, only a special dent could be up to this
>>
>>109071256
0/10
>>
>>109071256
>Ani, yet you are taking shots at him
I give him backshots
>>
>>109071256
we know
most of us here just filter the trolls who try to ruin the thread with their petty drama
>>
>>109071344
not to mention people have already been charged and convicted specifically for AI stuff they made
>>
Its pretty wild that they closed up the "no real victims" loophole.... How did they do that?
>>
>This general owes its existence here and everywhere else to Ani
Source?
>>
>>109071389
it wasn't just the AI stuff, there were real people involved
>>
>>109071418
It used to be this way but at least some states have changed the wording to allow them to prosecute just AI. Not sure where it stands federally thoughever.
>>
>>109071413
You're falling for wheelchair boy's bait. Don't listen to it and ignore the other discussion that's taking place that is only made to make this general look bad.
On a brighter note they are losing biggly and have been for years now
>>
>>109071418
no, there are many, many cases that involved no real people. if you're so confident, go ahead and post that stuff on Twitter and see just how quickly the cops will show up at your door.
>>
>>109071418
https://enoughabuse.org/get-vocal/laws-by-state/
https://law.washu.edu/ai-policy-and-regulation-resources/explicit-images/

These laws will be ubiquitous pretty much everywhere soon. Do not pretend you're safe because nobody was harmed.
>>
>>109071437
>no, there are many, many cases that involved no real people
link?
>>
Why are we having a bluesky discussion on 4chan take your nounce shit elsewhere.
>>
>>109071451
4chan is the one place you can freely talk about it you fucking normie. this isnt your reddit safebox.
>>
>>109071463
Either go to /b/ or go back to your home over a bluesky troon
>>
File: pricked.jpg (129 KB, 480x720)
129 KB JPG
>>109071089
atm mines tool calling for random words and making me artsy slop on repeat since i could not be bothered to think today.
but also, because they're vision and edit models, it can get a lot fancier and have them interact with the lm proofing the results and iterating on its own, and they'll use references for scenes and chars and accept crude ms paint scribbles and vague direction and make things work. shit's wild now.
>>
Just don't upload your shit anywhere. Read any of these cases and you'll see they were caught because they uploaded pics to their cloud storage account or drop box. Just don't be retarded.
>>
>no links
yeah, I thought so
>>
>>109071489
>expects me to google cp cases just to prove a point
im already probably on a list. i dont need google searches of a bunch of cases making me look even more sus
fuck off
>>
>>109071502
>nothing
yeah, I thought so
>>
This conversation reeks of a Canadian fail dev schizo
>>
For those who run dedicated LLMS along side image models, do you actually have enough computer to have both loaded at the same time without much slowdown or are you loading and unloading them every time you want to switch?
>>
>>109071489
https://www.nbcnews.com/tech/security/ai-child-exploitation-crisis-rcna259409

>Pfefferkorn said she has reviewed over 60 state and federal AI-related CSAM cases and found that obscenity charges have largely been used in cases that do not feature real children. However, she said that most people found to possess AI-generated CSAM possess real CSAM as well. “You can nail them to the wall for that.”
>>
>>109071521
more like glowies
>>
File: uniimom - Tifa Lockhart.jpg (79 KB, 832x1216)
79 KB JPG
>>109069190
>@UniiMom, uniimom \(Artist\), tifa lockhart, suspender skirt, low-tied long hair, white tank top, cleavage, shiny skin, wet skin, light smile

https://civitai.com/models/2704672/uniimom-illustrious-or-ponyxl-or-sd15
>>
>>109071556
buy an ad, loser
>>
NOBODY ASKED YOU FUCKING SCHIZO TAKE THIS SHIT TO /B/ OR BLUESKY
STOP GLOWING BRIGHTER THAN A FUCKING SUPERNOVA
>>
>>109071566
Kewl. I only have 12 though, maybe I can find a quant and have them both running at the same time. I think.
>>
>imaginary boogiemen and endless rambling
Seems like someone is having their monthly psychothic/manic episode. This happens every couple of weeks or so.
>>
>>109071153
I'm not sure how it actually works, but it did. I used warmup steps, I had it set up to 5% steps were warmup. I think I used CAME 8bit + rex for it
>>
>>109071522
gemma, yeah, it slacks off a lot and cuts corners by default. you just gotta throw more hand holding in the sysprompt, it's not a job they trained it for after all.
>>
File: 1508432108.png (2.27 MB, 1536x768)
2.27 MB PNG
schizo hours
>>
>ai-toolkit-perceptual
Does this not fully work on Windows? I can get to the UI without errors, but then nothing works. All the preflights flash a window and tell me they can't find shit and trying to train without that stuff (dunno why you'd want that) doesn't work either.

I could never install AI Toolkit without an installer either, but that would just flat-out refuse to run anything.
>>
https://files.catbox.moe/2cvlhb.mp4
396 secs to gen
added a cowgirl lora to help a little bit, but it seem to lose a little bit of the likeness too
>>
>>109071764
Pretty smooth. Love the headless horseman
>>
>>109071764
I don't really see the point of this type of cheating. If finding a real porn vid is necessary, it's better to just jerk off to the actual porn video. It's just scope for not having a local Seedance 2 tier model.
>>
>>109071784
>scope
cope*
>>
>>109071784
NTA but it does increase the quality and lets you control the girl to your specific tastes. This is effectively porn customization.
>>
>>109071784
It's not often I find myself feeling this disconnected to another human. I wish you could feel the surge of hormones I feel when face-swapping sometimes. It's a rush completely out of the ordinary.
>>
>>109071795
But it's lame. I much prefer giving it a picture and the AI figures out the rest, no cheating, but only Seedance 2 is good at that.
>>
>>109071764
the headless whoresman
>>
>>109071784
Why did they cheat us with animated dragon Smaug when we could have just watched Cumberbatch crawling on the floor?
>>
>>109071804
this is what I like too but I cant give a 12-way anal bukkake scene to Seedance
>>
>>109071784
Depends on how you can use the original footage tho, I changed a kissing video with two different characters and it worked, is not replacing really but what you can really do with the motion footage, I read somewhere that any video with movement works with SCAIL
>>
>>109071607
Alright thanks for the info anon.
>>
File: ill2.jpg (397 KB, 2048x2048)
397 KB JPG
>>109070932
first of all that style sucks, that's not megumin
it looks nothing like the actual character from the anime, and also looks nothing like the mishima kurone illustrations, it's just generic pixivcore in a megumin cosplay, this is important
aesthetically, it also sucks, people excuse it because it isn't tuned, that doesn't mean shit to me, i'm not in this because i like the stock, anime is the chainlink of t2i right now
yes, the prompt adherence is very impressive and i hope someone can improve the aesthetics so that it actually becomes usable, but this result sucks
>>109071016
why are you so absolutely hell-bent on gaslighting yourself into believing that only one person would prefer another model over anima? you sound deranged
the fact that third party checkpoints are required in the first place is already enough to assert that anima is the easiest to use
>>
>>109071732
I couldn't get it to work on Linux neither. I didn't use the UI but the CLI. Was getting OOMs regardless of what I do.
The concept looks like it has potential but I am waiting until something else besides ai-toolkit supports it.
>>
>>109071784
>train 3D animation model on porn animations
>generate 3D sex on basic placeholder models
>transfer to SCAIL2 to animate actual characters
There now you have infinite porn.
>>
>>109071807
>>109071777
he gained his head back
https://files.catbox.moe/s5xaxp.mp4
>>
File: SMMX_avatar_0002.png (41 KB, 200x250)
41 KB PNG
>>109071948
I actually liked the headless one better.
Or better yet, can the man be Max?
>>
File: debo_ccg_fia_00028_.png (1.73 MB, 1792x977)
1.73 MB PNG
>>
go back
>>
>>109071992
Him posting here is basically kissing the ring because he has nothing else. He's lonely
>>
I appreciate him posting the news to be desu
>>
>>109071935
I managed to LLM my way through it with a reinstall; had to force pip to use binaries (because it kept trying to compile wheels) and change scipy in requirements.txt to 1.17.1. Everything else but those two changes completely mangled the install.

God I hate Python...
>>
>>109072049
Eat shit
>>
Everybody is a critic.
>>
>>109071784
>I don't really see the point of this type of cheating. If finding a real porn vid is necessary, it's better to just jerk off to the actual porn video.
None of the cutest girls from my collection of elementary school yearbooks I have been collecting from thrift shops and eBay from different European countries over the last 5 years have done porn at the ages I care about.
>>
File: 1754545554578308.jpg (656 KB, 2880x1440)
656 KB JPG
>>109072078
No
>>
File: ill3.jpg (448 KB, 2048x2048)
448 KB JPG
>>109072047
>infinitely less resembling of Mishima Kurone's style?
i didn't go for mishima kurone style, i didn't use any loras, it used the anime style for the megumin character, which it nailed perfectly, much to your apparent chagrin
>Your gen has worse style
no it doesn't
>worse details
lol
>worse prompt adherence.
yes, but not pictured here
with the obvious exception that i didn't remove the panties to keep it board legal
>But the fact that you're admitting that prompt adherence of Anima is better is a big step forward.
i never denied it, retard
>>
>>109071784
its showcasing that they can replace the woman with a child (or record themselves to swap with the child, or swap it with themselves, who knows)
>>
>>109072047
I told you he was wasting your time, look how long it took for him to make that piece of shit.
>>109072101
Can your caretaker take away your computer access?
Seriously you're a burden on us all
>>
>>109072073
delulu
>>
Now show me three characters
>>
>>109072095
Faggot
>>
File: 993982.png (3.13 MB, 1216x1280)
3.13 MB PNG
>>
what happened to tdrussell? did he abandon local?
>>
>>109072148
let him rest he literally created local anime SOTA
>>
>>109072148
Seeing how mad he made a small circle of do nothing faggots I'm sure he's chilling laughing at them shit and piss their pants.
Mr 38 gave up on his project and is reduced to a bitter mess and an anon in the beginning of the thread basically made his project in days not years basically showing how much of a low IQ fraud he really is.
>>
The period from Qwen 2.1/2.2 to whichever will be the next actually good successor will be the longest time in history of the more important AI fields (text/images/video) where there were no better models for this long.
>>
File: 443.png (2.04 MB, 1216x960)
2.04 MB PNG
>>109072124
>>
>>
>>109072118
https://www.telegraph.co.uk/news/2017/12/06/new-yorks-met-museum-refuses-remove-balthus-painting-despite/
Seethe.
>>
>>109072210
>delulu thinks of himself on par with an artist in 1938 because he pressed a button on his computer
>>
Complete retard here.
If I want to make something that looks like Final Fantasy / Stellar Blade characters (realistic but heavily idealized, that still looks like a 3D model), what model should I be using?
>>
File: 2026-06-16_ideogram_30.jpg (2.65 MB, 1440x2560)
2.65 MB JPG
Finally a model that knows what a swastika is. Geez.
>>
>>109072235
find a lora for what you want
https://civitai.red/
>>
>>109072242
this needs to be reported and have the makers retrain this. unacceptable
>>
>>109072242
>tattoos
>>
>>109072229
>strawman
Concession accepted
>>
>Concession accepted
this is what retarded delulu's say when the lose
>>
File: ill4.jpg (544 KB, 2560x1792)
544 KB JPG
>>109072164
>Anima can replicate Mishima Kurone close enough,
that doesn't look even a little bit similar to any konosuba covers on google, not even remotely, not in colors, not in proportions, not in any details, nowhere, cool though
>Where are ornate windows? Where are bookshelves? Where is a mirror with a figure in it?
i didn't bother fucking prompting all this inane background garbage that apparently you and my boomer dad think is somehow not entirely trivial for any image model to churn out, gonna be honest, here you go
>>
both your guys fags gens look like god awful slop tbdesu
>>
>>109071255
>cowgirl lora

did you use wan21 lora? im trying with the workflow and it works..but the guy seems to generate poorly. he's either missing entirely or only his legs are visible. maybe my reference is bad?
>>
>>109072404
A vision of hell, two retards fighting about their ntrslop.
>>
File: 2026-06-16_ideogram_32r.jpg (2.09 MB, 2160x4096)
2.09 MB JPG
>>109072254
It's just a test, alright.
>>109072252
Agreed. I'm crying in the fetal position right now, rocking back and forth.
>>
>>109069562
i delete everything after session, 100k+ gens and not a single one worth saving
>>
>>109072056
>click a random menu option
>A fatal hardware error has occurred
>Error Type: Cache Hierarchy Error
Hmm, maybe I didn't get it working...
>>
>>109072495
saved by model type/date/seed. thats the most organization I have. eventually i'll put them in a proper program
>>
File: 1778679677125137.png (17 KB, 1386x193)
17 KB PNG
>why does my lora keep coming out completely torched
>oh
>>
>>109072410
https://civitai.red/models/1337157/wan-cowgirl?modelVersionId=1660695

this one
this is no way the intended use of scail lol, I'm just experimenting
>>
File: screenshot.1781648425.jpg (27 KB, 187x345)
27 KB JPG
>>109072571
hah, thats the exact lora im trying right now. the problem im having is the male generates poorly and i dont know why. i can already see in my current preview his head/torso generated, but now his bottom half is missing. the mask looks fine though

for your ref, are you using a full body image of the person, or just a close-up of their face?
>>
>>109072593
a nude full body
>>
File: 2026-06-17_ideogram_01r.jpg (1.91 MB, 4096x2160)
1.91 MB JPG
>>109072447
Bokeh. So hot right now.
>>
File: file.png (1.3 MB, 1024x1536)
1.3 MB PNG
hermes fag here back again for another day of rendering
i fucked up. ive been running grok in hermes and using my 4070 to render comfy ui images. welllll turns out grok isnt unlimited and i ran out of my monthly tokens in 1 day. lesson learned. trying different local llm models that will fit on my 4070. i had a old 1060 6gb laying around that i pointed comfyui to for rendering. obviously a downgrade. still fucking with things but here is the last thing hermes back grok created with anima in comfyui before tokens went kaput 
>>
File: debo_ccg_fia_00030_.png (2.1 MB, 1792x977)
2.1 MB PNG
>>
File: file.png (1.67 MB, 1024x1536)
1.67 MB PNG
>>109072751
one more. i really like this one 
>>
File: 1755508023052888.jpg (554 KB, 1024x1536)
554 KB JPG
>>109072751
Nice pose.
>>
>>109072782
So ugly so low effort
>>
>ai-toolkit OOM's my 16gb
>diffusion-pipe has 2gb to spare
epic for the win
>>
> >109072782
fuck off debo
>>
where did everybody go?
>>
Multi reference PR open for SCAIL-2.
This makes me wish they built it on Wan 2.2 rather than 2.1.
Then it would probably compete with Bernini.
>>
What is the best model for concept art? Like creating character refs and level concepts?

I know WAI is optimized for anime characters, but is there something more tailored for environments? Or something that can create original characters with consistent front, side and rear views?
>>
Guys, where did it all go wrong for Pony?
>>
>>109073252
scail on wan2.2 isnt realistic because it would require consistent training on both high and low models
also it wouldnt provide much benefit since the reference video already supplies motion and the main improvement from wan2.2 is motion quality
>>
>>109073287
probably during the training
>>
>>109073280
Anima and Klein
>>
>>109073280
>original characters with consistent front, side and rear views?
if you're expecting actual consistency(e.g. some object or detail is not exactly the same as it was in the previous image) then the answer is nothing, nothing can do what you want, these are diffusion models, diffusion is a destructive process

>I know WAI is optimized for anime characters
WAI *claims* to be specialized for anime characters in the sense that they built on existing, identifiable anime character sets for their checkpoint, guess what, so did every single other fucking illust/anima checkpoint that exists, when they mean "characters" they mean "our checkpoint maybe sort of has better retention for an asuka langley or a kirino kousaka(it doesn't, really)," not that it will be useful to you and your specific 1girl OC, nothing right now is going to be
the only way to get what i assume you want is to press forward with mind numbing trial and error and sift through literally ten thousand gens until you have enough usable images to create enough data to make your own lora for that character, which will somewhat but definitely not entirely make generating more of that character somewhat easier
as for backgrounds, if a color palette/general style/texture isn't enough for you (e.g. you want persistent landmarks, structures and geography), forget about it
>>
>>109073333
I know I know, but Wan 2.2 has better prompt adherence and motion.
Lowering the pose strength with that would make room for some cool gens.
>>
>>109073405
cont. things like dialed in detailers and/or detailer loras, controlnet shit, ipadapter style pseudoloras and regional prompting/mask inpainting can be useful but all of these come with their own enormous headaches that will at many points make you wonder if it's even worth fucking with any of that shit at all in the first place
tl;dr you aren't going to shit out a VN on a local model in a week, or a month, or probably even a year, unironically learning to draw well enough to guide diffusion models with your own work is probably your best bet
>>
>>109072242
>>109072447
Everything I try to gen is "blocked by safety filter" is there an uncensored model everyone's using?
>>
>>109073567
are you using the bounding box shit? this image has that in metadata, grabbed it a few threads ago
https://files.catbox.moe/w0s0cg.png
>>
>>109073610
raped asscroft
>>
Y'all better not be making pornog with these fancy AI models.
>>
In Gemini, I can feed it a shitty MS Paint drawing and turn it into a good pic.
Can I do that with a local model? Or is that a Gemini only thing?
>>
>>109073674
>Can I do that with a local model?
Yes
>>
File: new.jpg (409 KB, 992x1504)
409 KB JPG
>>109073610
>>
File: obsession.webm (3.87 MB, 512x896)
3.87 MB
3.87 MB WEBM
>>109073616
>not gaped asscroft
Ya blew it.
>>
>>109073768
what the shit
>>
>>109073674
That's jut basic img2img
>>
>>109073674
You've been able to do that locally for years, LONG before it was available on SAAS
>>
>>109073280
I've used Chroma for that, don't know if it's the best tho
>>
>>109073280
>What is the best model for concept art? Like creating character refs and level concepts?
Ideogram gives you by far the best control for things like character refs etc, not sure if it has the styles you need though.

That said Civitai recently added a Ideogram section so there will likely be lots of loras soon.
>>
my loras keep turning out like shit. does anyone wanna share their diffusion-pipe toml?
>>
>>109073280
Klein 9B lets you literally prompt character references with one image. Ignore the schizo rambling about WAI.
>>109073851
It doesn't accept image inputs. Maybe if you're creating a new character directly from the model then sure.
>>
>>109073287
base model choice, not wanting to wait for something new to come out
>>
my god do i want a fucking good new video model
>>
>>109073864
>It doesn't accept image inputs.
Ahh, see what you mean. Well, you could train a lora of your character, but image edit functionality would be easier.
>>
made a lora for this cat, computer scene next
>>
>>109073864
>Klein 9B lets you literally prompt character references with one image.
acceptable for your subnormal monkey brain, maybe
you are like the target audience for terrible 2000s sameface anime porn
>blue hair on a copy pasted samefaced balloon tittted freak
>oh my gosh you guys it's literally ayanami rei just like in the show
that's the limit of your nigger brain, AI is a threat to you and your standing in society and you're going low
>>
>>109073949
Take your meds. Are you Canadian by any chance?
>>
>>109073961
I'm Lithuanian
>>
>>109073927
>didn't edit out the patreon watermarks before training
>>
>>109073988
oops those normally appear up top didn't see them, imagine the image cropped with that out
>>
>>109073961
sorry anon i can't make you see the apple
>>
>>109073993
Too late, it's over.
>>
>>109073963
what's the difference?
>>
>>109074020
I'm white. That's the difference.
>>
>>109073252
Kijai god delivers yet again.
>>
>https://huggingface.co/circlestone-labs/Anima/discussions/194#6a316716058ae35977a43d43
cash money
>>
Can one of the scail anons tell me how to keep the character stylized in the output? It is making my human cartoon characters into realistic humans and it doesn't look good.
>>
>>109074052
Don't worry anon will still lose his mind and sperg out over it being a "broken model".
>>
>>109074052
>TLDR: if you are an individual (not a company), you may sell Derivatives (loras, finetunes, etc) that you create. The intention is to allow small-scale monetization from selling model weights, for example paid commissions to make a lora.

he had to do this since people were right that tranima is a model that would not achieve great heights without actually big tunes, the base model is not that great for end users in making good and detailed compositions OOTB, although it seems like the big tune trainers already complained that anima also catastrophically forgets? is that true?
>>
>>109074058
Replacement mode false. It will animate your image instead. If you want a new background, the new PR SCAIL-2 allows background reference as well.
>>
>>109073818
>>109073777
>>109073677
Damn...
Have I been paying for shit that's supposed to be free? Would my old 3060 be enough?
>>
>>109073927
Is it bad that, as much as I dislike this bitch, after seeing she looks like a fucking slampig IRL I want to fuck her?
Like, I really want to make that disgusting pinopig burn calories through the most depraved fuck I can imagine.
>>
>>109074052
Cool. If that one anon who is training a realism Lora for anima releases his checkpoint instead of tinkering with it forever I'll give him some spare ETH (but he won't)

>>109074089
They didn't have to do anything but this does help get people with delusions of grandeur making a living off of training Loras into the game which isn't something I hate

>>109074099
>Have I been paying for shit that's supposed to be free?
All software should be free, so yes
But Google's models are indeed better and faster, you can cope by imagining you were renting the server hardware from them (still cloud capitalism slavery though)
>Would my old 3060 be enough?
At this point almost certainly. If you're used to Gemini quality it might be useful as a prototype/test before you spend real money
>>
>>109074091
thanks king
>>
>>109074112
>>109074099
Gemini is also super censored and cannot use loras.
So if you wanna make porn or use very specific artstyles Gemini won't do.
>>
>>109074089
lole kys
>>
>>109074112
>All software should be free
>I'll give him some spare ETH
pick a lane bud
>>
>>109074152
>what are donations
>>
>>109074111
I want to fuck the avatar I don't care what they look like irl, it's like expecting voice actors for your anime waifu to also be hot, they mostly are not
>>
>>109074112
>If that one anon who is training a realism Lora for anima releases his checkpoint
It will be released, one merge left
>>
>>109074089
You now have nothing to seethe about and you still haven't made your Apache2 Anima
GGs
>>
>>109074175
Most of the time that's my exact mentality with vtubers.
But for some reason her IRL looks inspire a primal need to fuck. Normally I don't go for the fatties but this one does the trick.
>>
>>109074158
payment
>>
>>109074089
I didn't have to do anything. I added a carve-out for individuals selling loras because a bunch of people emailed me wanting a commercial license for lora commissions on their patreon or whatever. There's no feasible way for me to collect any kind of license fees from a bunch of randos, plus it would never amount to any significant revenue anyway. No reason not to allow it.
>>
>>109074158
Thanks for saving me time because I would have spent longer than 3 words explaining the nuance

>>109074187
>It will be released, one merge left
Can I pay you to do a chroma and dump all your checkpoints though or even the training materials / information if it's not a private/illegal dataset? Teach a man to fish and all that
>>
>>
Remember the faggot that started the seething crusade against anima for it's license is the owner of a dead project where he expects people who make profit using his software (lol) to pay him for his cobbled together vibe coded garbage that uses other peoples shit poorly.
What a fucking lolcow
>>
>>109074091
>new PR SCAIL-2 allows background reference as well
Link to that?
>>
>>109074228
What happened? I'm new here
>>
https://civitai.red/articles/31483/civitai-nodes-for-comfyui-the-whole-toolbox-in-your-graph
>>
>>109074195
the fact that it's a real woman is what makes it bad, simp, real women shouldn't act like anime children while pretending to be blissfully unaware of the unsurvivable existential hellworld that fifty percent of their audience goes through, as they take those people's money
vtubers are unusually charismatic and talented women, that are somehow more evil than regular high value women
>>
>>109074253
time for your dose of outside lil chuddie
>>
>>109074240
When Anima dropped one of our special personalities that earned his own rentry shit his pants in a jealous rage because comfy gave the creator funding to get the project off the ground. He began to seethe over the license of the project and posted on multiple discords including this thread bitching and complaining about it. Like many of his schizophrenic delusions he claimed that he was making a version of anima that had a better license but like everything he claims he would do nothing came of it. Now all we get are unhinged schizo rants about everything wrong with anima and how it's a failure next to XL. It's pathetic and this guy used to work for SAI and since comfy wouldn't give his low skilled ass welfare money by letting him join comfy org he has regressed to a bitter jobless schizo with no feats despite claiming all of his dead unused projects are the reason why he should be a figure of authority.
>>
>>109074234
https://github.com/Comfy-Org/ComfyUI/pull/14509
>>
How does Ideogram compare to ZiT and Klein?
>>
>>109074272
you write this in response to someone criticising the act of simping for and gooning to vtubers?
if you could write nothing in response to anyone for the foreseeable future, and then consider hitting your head against concrete really hard after jumping off of a bridge or building so that you die, that would be great
>>
>>109074293
gooning to vtubers is so weird, they should just goon to asuka like I have for the past 15 years instead. Asuka is completely not a real woman at all so actually it's LESS weird that I'm masturbating to that!

I think vtubers are japanese autism weeb cringe too, but c'mon man
>>
>>109074291
Thank you!
>>
File: debo_ams_fia_00016_.png (2.19 MB, 1792x977)
2.19 MB PNG
>>
>>109074339
nice 6 finger takes a special kind of stupid to make that in 2026
>>
File: debo_ams_fia_00017_.png (1.94 MB, 1792x977)
1.94 MB PNG
>>109074357
>>
>>109074357
it's intentional. she provides more grippy handjobs like that
>>
>>109074366
With his condition I doubt that
>>109074365
That's better I guess random seeds can save you. How does it feel to kiss the ring of /ldg/?
You're free to stay but we will always look down on you until you actually take accountability and put in effort.
>>
File: debo_ams_fia_00018_.png (2.5 MB, 1792x977)
2.5 MB PNG
>>109074377
>>
>>109074306
yes it's less weird when someone masturbates to an idealized version of a woman whose true personality and inner perception are defined from the ground up by the masturbator
how are we defining "weird" here? it's weird that anyone would suffer a real woman that has existed in the real world and has become extremely wealthy through puppeteering sexualized cartoons in the most bitter, emasculating flavor of current year hoeflation social media prostitution without even having to reveal her identity or show you her actual tits, whose entire career is to profit off of sexually frustrated children and men that are as intelligent as children, that have no ability to compute or comprehend their personal motivations and driving forces, let alone those of other people
couldn't be me, crack whores are pillars of society compared to this shit
>>
>>109074222
>the training materials
might be a bit too much work just to hand over, over 2 year project
>>
>>109074390
See you're now trying was it that hard?
Every day you post here fills me with joy btw, The best part is I don't have to do anything and you still lose in perpetuity. But seriously after all these years you should apologize for malware spreading and apologize to the anons you obsessed over, we all moved on but you haven't, which is sad.
Still proud that you finally figure out hands after all these years give your caretaker a pat on the back for never giving up on you.
>>
File: debo_csa_fia_00002_.png (1.96 MB, 1792x977)
1.96 MB PNG
>>109074412
>>
This doesn't work when the images look decent. Modern models have taken away your annoyance factor with gens not because they are good but they hit the bare minimum which makes this more sweet!
Cheers schizo you're now in the pure noise bucket....like a fading voice in the wind
>>
File: sturgeon.jpg (1.23 MB, 4096x4096)
1.23 MB JPG
>>
File: mafw.png (254 KB, 900x806)
254 KB PNG
>>109074279
idgi, why are you here if he made this thread and you dont like him?
>>
>>109074444
This doesn't work the community made the thread and the OP documents his mental decline.
Try harder with the bait will ya?
You guys are like that one Japanese guy that didn't know the war was over and you keep at it. He hasn't even updated his repo in months since the c++ boys made a proper functional frontend for their project.
>>
File: debo_csa_fia_00003_.png (2.12 MB, 1792x977)
2.12 MB PNG
>>109074444
checked
>>
File: Wan21_SCAIL2_00129.mp4 (1.1 MB, 1248x352)
1.1 MB
1.1 MB MP4
>>109074291
Starting video from the back view is now doable much easier. Before it was kind of a pain in the ass to use back reference first, then immediately switch to front ref.
>>
Finally got Comfy updated and working (we are still on ver .24, right?), and I made my first practiced local gen! Thanks for the help, /ldg/. Now the journey really begins.
>>
File: debo_csa_fia_00005_.png (2.22 MB, 1792x977)
2.22 MB PNG
>>109074486
congrats
>>
>>109074486
Good job anon
Just ignore this schizo
>>109074499
He spreads malware and it's documented other than that you'll do great
>>
>>109074467
workflow?
>>
>>109073851
it is a medium tier model with only strength that via json you can skip controlnet and regional prompt extensions to precisely direct the generation
properly prompting for that would take some time
training stuff for that which i will try will be tedious
ostris and kohya have released code for it that seems somewhat stable
>>109073333
how do you guys prompt on scail?
i never used wan before but got this scail to check if i can expand my image dataset for lora training.

running the image through the testing workflow produces meh results it does not preserve image quality.
or does one needs some of those wan2.1 loras?
>>
I tried using scail2 on blowjob scene, the dick disappeared
>>
Migrate
>>109074528
>>109074528
>>109074528
>>109074528
>>109074528
>>
>>109074291
>>109074509
>>
>>109074253
Never said I liked her or wanted to give her money.
I just want to fuck her. That's it. No simping here.
Unless you're monkmaxxing and think sex drive is simping, in which case you should get off the internet and recite some mantras.
>>
Is this GPU ranking still relevant?
https://chimolog.co/bto-gpu-stable-diffusion-specs/
>>
>>109074027
>lithuanians
>white
lol
>>
>>109074752
>12gb vram wins all price efficiency rankings

interesting

i'd say if given benchmarks (mostly sdxl and upscaling) are relevant to your usecase then yes, edit models and video are missing though



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.