[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Anime Segmentation Edition

Discussion and Development of Local Image and Video Models

Previous: >>108489653

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Blessed thread of frenship
>>
blessing, very nice.
now lets go for two news posts and then a meltie over early bake.
>>
feed me gens
>>
>>108494546
*hugs you*
>>
>>108494555
The schizo"news" are like nigbo: disabled
>>
File: anima_preview2_00263.png (1.62 MB, 1359x701)
1.62 MB
1.62 MB PNG
Interesting, Anima knows the concept o peeling potatoes
>>
File: anima_preview2_00264.png (1.7 MB, 1366x768)
1.7 MB
1.7 MB PNG
SEXOOOO
>>
>>108494605
>>108494599
Washed up colors, no true blacks, Anima epsilon slop
>>
File: _AnimaPreview2_00074_.jpg (379 KB, 896x1152)
379 KB
379 KB JPG
>>
File: _AnimaPreview2_00067_.jpg (383 KB, 832x1216)
383 KB
383 KB JPG
>>
>>108494637
nice
>>
Does anyone have a signed Lora Manager add-on for Firefox with a modified license-manager.js file?
>>
File: anima_preview2_00265.png (2.14 MB, 1366x768)
2.14 MB
2.14 MB PNG
>>108494614
What are you talking about? for anime we don’t need true blacks, they aren’t stylistically faithful to the medium. The v-pred and rectified flow approach was mainly implemented for realistic or semi realistic styles, not for anime.
>>
>>108494648
Why are you managing your LoRAs in your web browser
>>
LTX2.3 i2v

What are the limits for duration? I tried 20 sec, it kind of worked
>>
>>108494599
>>108494605
>>108494652
comfortable gens anon
>>
>>108494666
those are screenshots from the anime that are being passed off as gens
>>
File: 1758799380357376.png (760 KB, 1024x1024)
760 KB
760 KB PNG
>>108494634
And gave her off model garbage
>>
>>108494672
>off model
you have to actually prompt for the things you want to see i know thats hard but its required
>>
File: anima_preview2_00267.png (1.14 MB, 1366x768)
1.14 MB
1.14 MB PNG
Can SDXL cucks make anime feet that arent rotoscoped?
>>
File: retardo.jpg (18 KB, 250x203)
18 KB
18 KB JPG
>>108494659
the add-on is handy; if it werent, he wouldnt use it as a paywall
>>
>>108494680
I did. Even extreme prompt like "12 years old, flat chest, young, child"
>>
cooozy
>>
File: _AnimaPreview2_00057_.jpg (315 KB, 832x1216)
315 KB
315 KB JPG
>>
can Anima do img to img ?
>>
File: anima_preview2_00271.png (981 KB, 1366x768)
981 KB
981 KB PNG
Frieren feet >>108494688
Pic related Fern feet
pic your poison
>>
>>108494728
Yes
>>
File: MUGEN_example.png (554 KB, 414x617)
554 KB
554 KB PNG
so this is the power of rectumflow
>>
>>108494736
Can Anima edit fully clothed lolis to wearing microbikinis ?
>>
File: 1759937274328900.png (2.85 MB, 1344x1616)
2.85 MB
2.85 MB PNG
>>
>>108494599
Cute
>>
>>108494744
Yes, remember to change your clothes promptmake a good mask. But technically, yes, I use Anima to change my sloppy SDXL backgrounds or fix hands or feet, and it just works as intended.
>>
what's with the animetroon spam
didn't they get their own fucking containment
>>
File: 1756095137926968.png (3.82 MB, 1344x1520)
3.82 MB
3.82 MB PNG
>>
>>108494691
anon, none of those are real tags. do you have any idea what you're doing?
>>
>>108494764
ADG is dead thread
>>
>>108494765
>>108494750
>>108494740
>>108494732
Go back to /adt/, tranny.
>>
>>108494599
>an anime model that finally renders anime images that look like real anime and not some 2.5d slop
that's nice actually
>>
>>108494774
>>108494764
>>108494770
Stop shilling your shit thread retard
>>
>>108494764
I won’t post in /adt/, sorry. /ldg/ has always been anime friendly since the beginning, but you can segregate yourself if you want.
>>
>>108494783
>>108494784
This.
>>
Why does /adt/ get so insecure when anons post anime here? What are they afraid of?
>>
File: _AnimaPreview2_00094_.jpg (354 KB, 992x1456)
354 KB
354 KB JPG
>>
File: anima_preview2_00272.png (1.54 MB, 1366x768)
1.54 MB
1.54 MB PNG
>>
File: 1752066706449552.jpg (1.28 MB, 1641x1960)
1.28 MB
1.28 MB JPG
>>108494760
So where i can download Anima ?
but before i kill my 10gb of write on my SSD. Prove me Anima is good.

Turn her into microbikini, with accurate bodytype
>>
>>108494810
even the hands look good, looks like the anima devs know what they're doing
>>
>>108494795
The worst part is that /adt/ anons didn’t come from /ldg/, /adt/ was created by Julien to disrupt /ldg/ and put his malware there.
>>
File: 1746687325049682.png (2.91 MB, 1344x1616)
2.91 MB
2.91 MB PNG
>>
>>108494795
I don't care about anime when posted as a tech example but spam does get annoying, too much of this shit everywhere
>>
>>108494814
>before i kill my 10gb of write on my SSD
holy autism
>>
>>108494814
Do you think I’m replying from my PC with my GPU running? No, anon, you’re lucky I’m constipated in the bathroom and still replying to you. But if you’re the Grok one, API models are better than local ones if they’re uncensored, so keep using Grok.
>>
>>108494814
i will not spoonfeed a tard
>>
>>108494868
>>108494860
>>108494856
Nevermind then
>>
>>108494838
It's just Meat spamming.
>>
>>108494880
figured
>>
>>108494850
Yeah, and why did you decide to start here? Why not start with the other generals you visit ;)
>>
File: 1753046725796754.png (3.06 MB, 1168x1704)
3.06 MB
3.06 MB PNG
>>
>>108494985
Because they don’t want anime posted here. What would be the point of /adt/ if anime were posted here too?
>>
>>108494850
>noooooo stop posting gens and having fun reeeee
>>
File: anima_preview2_00270.png (2.15 MB, 1366x768)
2.15 MB
2.15 MB PNG
>>
isn't he tired of doing this shit
>>
>>108494810
>>108495017
What are doing, retard?
>>
>>108494860
Anon this is /ldg/
>>
>>108495055
He is contributing dataset, you ungratefull bastard
>>
>>108494814
we get it, you're retarded and you just want someone to give you your /r/equest on a silver platter
>>
>>108495105
>We
Dont be rude, anon.
>>
>>108495105
I could fire up Illustrious + Photoshop to turn her into microbikini with accurate bodytype and artstyle. But i want to find a easy edit, similar to what Grok Imagine does
>>
what's the point with these blurry shit gens?
>>
If I was janny I would ban every discord post, auto-ban any mention of any fucking namefags and the endless gossip posts 24/7 in this general. You've completely ruined this entire thread for months. We used to have gens, gen feedback, tips and tricks, GOOD FAITH model discussions, etc. Now it's all namefag this, discord screenshotterio this and endless off-topic garbage.
>>
>>108495140
And why don’t you use Grok? Did you make it a personal goal not to use APIs or something like that? Just use Grok, anon.
>>
>>108495188
All this because anime posting? how rude /adt/ raider
>>
File: 1749424784466207.png (3.1 MB, 1168x1704)
3.1 MB
3.1 MB PNG
>>
>>108495188
uh oh, meltie
>>
>>108495188
tbf not much happening in image gen recently. There's been no real hype and no excitement since Zimage base turned out to be a nothingburger and Klein being just Flux with everything bad that's bad with Flux.
So what would the thread be talking about? Everything already has been said.
>>
>>108495190
Havent you heard the news ? Grok is dead. The censorship is similar to Gemini now
>>
>>108495205
/adt/ is mad that they have zero community and personalities
>>
>>108495188
meat meltie
>>
This retard fucking attacking every single person he replies
>>
File: 1756763441766431.webm (2.15 MB, 1000x1250)
2.15 MB
2.15 MB WEBM
atack me :3
>>
>>108495319
disgusting creature
>>
File: 1773787292157462.png (312 KB, 2269x1432)
312 KB
312 KB PNG
https://prismml.com/news/bonsai-8b
>1bit LLMs are a thing
now imagine if we would make 1bit diffusion models, we would be able to run 30+b models and they would finally get competitive with API models
>>
>New model
>censored
>new model
>censored
>dead
>repeat

Its tiresome. All these years and Illustrious still the best. This is why AI bubble pops. Finally i can afford RAMs
>>
>>108495472
but the gap between local and SaaS models is due to the size of the dataset not the parameter count
>>
>>108494342
based jenner always posting right before the thread dies so it's hard to reply with words of thanks and encouragement
>>
Did you people unironically PAY for api ? isnt this /LDG/ ?
>>
File: anima_preview2_00273.png (1.18 MB, 1366x768)
1.18 MB
1.18 MB PNG
>>
File: 1744559495231005.mp4 (1.7 MB, 704x928)
1.7 MB
1.7 MB MP4
>>108495319
:3
>>
File: anima_preview2_00274.png (1.85 MB, 1366x768)
1.85 MB
1.85 MB PNG
>an abandoned wooden cart lodged high in the branches of a large tree, low-angle view, weathered wood, broken shafts and wheels, dense green leaves, pale gray overcast sky, distant forest and soft mountain silhouettes,
Neat
>>
Fuck off Animatranny
>>
File: Screenshot004-1.png (67 KB, 1572x512)
67 KB
67 KB PNG
Why GEMMA-3 for LTX2.3?

Can any other sizable LLM do the trick too?
>>
>>108495750
yeah just use whatever
>>
>>108495750
>Can any other sizable LLM do the trick too?
no, it's like giving you chinese text, you'll never make it since you've only been trained to read and understand english
>>
>>108495728
where?
>>
>>108495750
It's trained on precise embeddings Gemma-3-12b spits out. You can't use any other LLM and get coherent gens, even if you pick one where tensor shapes match.
>>
File: 00189-3401236421.jpg (512 KB, 1792x2304)
512 KB
512 KB JPG
why does ldg like such simple styles or anime screenshots with anima anyway

where is my masterpiece, best quality, 1girl, looking at viewer noobslop?
>>
>>108495988
i'm more the greg rutkowski kind of guy
>>
File: anima_preview2_00275.png (1.2 MB, 1366x768)
1.2 MB
1.2 MB PNG
>>108495988
I like anime tho
>>
>>108495750
>IQ4-XS
OHNONONONONONONO
>>
I found out where the "Anima architecture is dramatically flawed" meme is coming from. And now I never want to spend time dredging through noobcord ever again.

Cosmos uses T5XXL. Anima replaces this with Qwen3 0.6b + LLM adapter module, because it's smaller. Because the DiT expects T5 embeddings, the Qwen3+adapter combo mostly recreates those T5 embeddings, at least early on in training. But, the adapter is trainable, so over the course of a huge finetune it can shift the embeddings to more easily represent the anime data.

Weeks ago, Bluvoll took Anima, replaced Qwen3+adapter with the original T5XXL, and generated some images with it. This predictably caused a huge degradation in artist, character, and general anime knowledge. Bluvoll's conclusion was "the DiT barely learned anything, almost all the knowledge is in a 130M parameter adapter module". He spread this all over noobcord, everyone started believing it and calling tdrussel incompetent, and it continues to be repeated there to this day.

Fucking ridiculous. Of course running a model on text embeddings it wasn't trained against is going to fuck up the outputs. That doesn't mean all the knowledge is in the text encoder / adapter, it literally just means you're feeding wrong text embeddings to the model.
>>
>>108496025
Thanks tdrusell for explainging this, we keep beliving in ainima here
>>
>>108496025
>noobcord
This Anima is bad is Ani or a Noobcord raid?
>>
I keep wanting to like anima, but nothing has felt like the jump from SD1.5 to SDXL yet. It just feels like yet another sidegrade that is better in some ways but not enough to matter. When do we get a model with actual prompt adherence and tag understanding?
>>
i NEED my tags
>>
>>108496068
Anima can’t reach the same level of composition as Zit or Zib. I like generating epic battle scenes with lots of soldiers, dragons, and orcs, elves, arrows but Anima starts repeating elements and mangling characters once things get a bit too complex. Zit and Zib handle that much better. Both Anima and Zimage eventually break, but Zimage has a much higher tolerance for complex scenes of this kind.
>>
File: 04488-4129453782.png (2.5 MB, 1213x2159)
2.5 MB
2.5 MB PNG
It's over, isn't it?
>>
i NEED my hags
>>
Is there a way to extend a wan 2.1 video seamlessly? I can img2vid the last frame but lets be real that sucks ass and doesn't preserve trajectory.
>>
>>108496161
svi
>>
File: Discord.jpg (447 KB, 714x1047)
447 KB
447 KB JPG
>>108496121
Local image gen its doomed like local text gen, I’m thinking of going back to NAI since they upgraded their uncensored RP model
https://medium.com/@novelai/welcome-novelais-newest-writing-model-xialong-ecde7d21d111
>but this is SaaS!
Fuck off, /ldg/ cord has a cloud gen discussion thread and they talk about cloud gen
>>
>>108496161
Another option is ltx 2.3 to extend.the video, it's less terrible than straight up ltx 2.3 img 2 vid
>>
>>108496217
C u c k
Make your own thread
>>
>>108496217
Online models even more cucked anon ??? Sora ?? Grok ??? Flux ???
>>
>>108496253
He said NAI, which isnt cucked
>>
File: purged.png (2.42 MB, 1824x1248)
2.42 MB
2.42 MB PNG
>>
>>108496217
localkeks fear uncensored saas. novelai is both faster, cheaper, and better than hosting an equivalent LLM locally. soon local will be nothing but posturing, it's already so far behind models like nano banana 2 and uni-1
>>
>>108496267
youre a banana
>>
>>108496267
>better
I can run GLM4.6 locally at q4_k_m quant, and don't have to send my cunny prompts to someone else's server.
>faster, cheaper
True, and this will always be the case with giant MoE models since inference providers can do dynamically batched inference for efficiency. With local you always have a batch size of one.
>>
>>108496267
>localkeks fear uncensored saas
It's true. I wanted to make some sfw gens so I started booting comfyui but then I realized I would just be wasting my time and I used nano banana instead. Porn is the only thing localkeks have but it's only a matter of time before they are eclipsed by an uncensored API model.
>>
>>108496321
your API node???
>>
>>108496260
Yea but you have to pay $$$$$$ for it. Still cucked
>>108496267
How much you get paid for this post ?
>>
>>>/SaaSdg/
>>
>>108496346
>>108496245
Why? Also Comfy Cloud exists and API nodes exists
>>
>>108496504
Make your own thread. Anything that cost $$$ and Online is not allowed here
>>
>>108496521
how bout fuck your trooncord drama circlejerk and we'll just post what we want you stupid bitch
>>
>>108496521
Who the fuck are you? Fuck off newfag. LDG has always been open to cloud and anime. If you wanted to segregate yourselves that's your own business, but don't come here trying to inveting rules to LDG into something it never was.
>>
>>108496025
Doesnt Klein also switch t5 for qwen
>>
>Discussion and Development of Local Image and Video Models
>>
so much heckin "development of local image and video models" here between all the posts viciously attacking a software developer
>>
>>108496025
Thanks James Bond, you found 1 counter argument for why Anima is shit. Now you need to find the other three. The most important one is the memory loss issue.

Proof 1 >>108487391
Proof 2 >>108489823
Proof 3 >>108492625
Proof 4 >>108493262 (this is the one you're addressing)
>>
>>108496576
/ddlivm/ - Discussion and Development of Local Image and Video Models
>>
>>108496615
Proof 1 is easy to debunk. They trained on the wrong model. Proofs 2, 3 and 4 are handled by the post you are replying to.
>>
>>108496576
Ok, and what about UIs? GPUs? Python dependencies? Guess we need 3 more threads!
>>
>>108496615
>now you need to dance for me, monkey
no
>>
>>108496019
Can Anima actually do this? No joke, this is scene is complex.
>>
>>108496667
Pwease!
>>
So what models you guys developing? Me personally, I've been working on this thing. It's a video model that will put Seadance and Sora to shame and it will be fully uncensored. It's gonna be litty.
>>
>>108496686
Working on this insane uncensored image model called NovelAI. We even get paid to post about it here! Check it out sometime
>>
>>108496019
Hmm, okay. I just like nice pictures and maybe trying to sneak into a collage here and there.
>>
I can't stop thinking...
>>
bout dem beans
>>
>>108496650
>>108496552
Lmao seething. Your online models is next ;)
>>
>>108496025
Man, I'm a nontrainer, and even I know swapping out the text encoder a model was trained on is just asking for trouble.

>>108496677
They're actual anime screenshots you can find on Google. Is bait.
>>
good evening sirs
one of my favourite lewd AI "artist" quit recently, and decided to release his metadata in the form of a png, and his lora/safetensors file
i've never delved into imagegen before, what's the simplest way to use these to gen stuff similar to what he did?
i got as far as installing invoke AI and getting his lora added to it but i have no clue what im doing from here and everything i try to generate using invoke is just "basic" stereotypical ai type images even when i fiddle with the weight of the lora or whatever
>>
>>108496834
Use forge
Or
Use ComedyUI and import the png to the workflow
>>
>>108496811
>They're actual anime screenshots you can find on Google. Is bait
I know but can Anima do this?
>>
>>108496711
I'm looking at this and, hold on, you're telling me I get a SOTA lewd image generator AND story writer for one subscription? Wow, and it frees up my gpu so I don't have to sweat in a sauna, or I can still use local models to animate images as I'm genning new ones. This is great.
>>
>>108496878
You also get SOTA tooling like inpainting, character slots, vibe transfer, style and character references and much more.
>>
>>108496834
Don't use Invoke. Use Forge Neo or ComfyUI, links in OP. Copy your question to GPT for guidance and check YouTube tutorials, there are a lot, and the UI github readme,
>>
Which is better, NovelAI or Anima through ComfyCloud API?
>>
>>108496895
think theyre about the same at the current moment for anime images. novelAI is better if you cant train anima loras, want to use their LLM, or want to make furry images. Anima with a controlnet should be way better though for pure anime genning once it gets one
>>
>>108496895
NovelAI pros :^) better than Anima in every aspect
NovelAI cons :^( no API nodes

Anima pros :^) has API nodes, Comfy support, and stays up to date with most characters
Anima cons :^( lower quality than NovelAI
>>
>>108496861
It can get surprisingly close.
>>
File: Flux2-Klein_00385_.png (981 KB, 768x1344)
981 KB
981 KB PNG
>>
>>108496853
>>108496894
thanks, i've installed comfyUI and am fiddling around with it now
this shit looks way more complicated than text gen (text gen 3-4 years ago, anyway), i hope its as straightforward as GPT is saying and its just a case of plugging in the correct model and the lora then dropping the metadata into the thing lol
>>
>Grok makes local gens obsolete.....
>.....until they cucked themselves

Laws and regulations really killed AI.
Were lost
>>
>>108497145
maybe they shouldn't have let people make csam on their service, fucking retards
>>
>>108497174
Why should they care ? No real children is involved.
>>
File: 1770023263841792.png (2.92 MB, 1880x1072)
2.92 MB
2.92 MB PNG
>>
>>108497145
ltx2.3 is not bad at all. any big nsfw loras can easyly kill grok
>>
>>108497222
I mean for image gens and edit. Pre cucked Grok is absolutely good for it
>>
>>108497125
it's the constant bloating and breaking every update that will make you wish there was something else
>>
>>108497285
i've got the workflow and the lora plugged in, but the images im getting are all black, googling a bit it seems like im using the wrong checkpoint model or something
the png i have with metadata has a model name and hash inside it, i thought just grabbing the same model name from civitai would be good enough but do i need to use the hash to find the specific version the guy used?
>>
>>108497298
nevermind, i just looked up the hash and it's the same safetensors file i downloaded
not sure what i'm doing wrong, GPT said to try lowering the image resolution and disabling the lora but neither of those did anything
>>
>>108496834
>>108497298
You should really figure out the basics of t2i before you go around attempting to emulate other peoples gens / use their complicated workflows
>>
>>108497322
with his metadata and lora i assumed it would just be as simple as plugging the right things in and tweaking the proompts for my own purposes
>>
>>108497335
Just upload the workflow to catbox.moe and share that link here. I'll tell you what's wrong.
>>
>>108497341
https://files.catbox.moe/2djac5.json
>>
>>108497348
Huh. Must be something related to your install or hardware.
>>
>>108497385
Nvm, I stopped reading too soon.
>>
>>108497388
was messing around and tried just pasting in everything manually using the default workflow and it seems to be working, just need to tweak ksampler settings and shit i guess, huh
>>
>>108497417
actually, just removing "CLIP Set Last Layer" or setting it to -2 instead of -1 seems to fix the whole imported workflow
i have no idea what i'm really doing but this is progress i guess
>>
Multi-GPU bros I want to take the plunge on a second GPU, a 3090, to pair with my 4090 so I can have more breathing room for what I can load and run in parallel, and maybe also run models split between the two.

I'm planning to mount and plug outside my current case since its already claustrophobic in there with the 4090.

My shopping list as of now:
- the used 3090
- a pci riser
- a new, second PSU exclusive to the 3090 (I guess my current 1000W corsair wouldn't handle both at full load)
- an add2psu module

Is that it or am I missing something? Guys who run 2+ GPUs do you think it was worth it? My main usecase would be running shit is parallel like LLM + media gen, edit model + gen model, power batching stuff, etc.
>>
File: 2026-01-20-15-36-55-003.jpg (1.49 MB, 2082x1440)
1.49 MB
1.49 MB JPG
>>108497443
yeah I guess, but I already had my 3060, I got a 3090 because I could, and I knew I wouldn't get another chance
I was already set up, and I've tested the power draw and it's never been higher than 500w
it's not useful for image gen at all, at best you can still play games while training or inferencing
>>
>>108497502
>it's not useful for image gen at all, at best you can still play games while training or inferencing
Shouldn't SDXL run pretty ok? Also the same big LLM models that I run now, but slower, and fast for small/intermediate ones.

Also forgot to include the frame/case in the shopping list, I'm still not sure what to pick. What would be the best options just to mount a GPU + second PSU + maybe a fan? I hope it doesn't takes another big ATX case just for this.
>>
>>108497571
At this point make a small Linux arch headless only to run comfy
>>
>>108497502
looks hot in there. I thought about pairing my GPU with an RTX 4000 PRO but not sure it would work
>>
>>108497641
it's fine. Might be an issue in a month or two though
>>
how would you go about transforming a digital painting of an oc character into a character sheet? the character is at an angle in the painting and has a bunch of shit in the background.
>>
>>108497659
nano banana 2 can do it, just ask it to extract the character and turn it into a character sheet
>>
>>108497611
This PC already function like this pretty much, I only actually sit there for work.
I have it set up with many home server like services for me to connect and fuck around while laying around with my thinkpad or traveling.
>>
>>108497571
you can run two separate batches on each gpu but you can't have the compute from both cards work on the same batch. Even in training all the compute happens on the main card but you can split the model across both cards.
>>
>>108497679
>but you can split the model across both cards.
Is this efficient? Never messed with multi GPU loading. I'm currently on 128 GB RAM, can I expect a bump in token speed running inference on glm46 and the likes? Or at least be able to run slightly bigger quants.
>>
File: ComfyUI_jpg_00413_.jpg (3.9 MB, 4288x7680)
3.9 MB
3.9 MB JPG
My attempt to multi gpu was a total failure.
>>
>>108497741
My condolences.
>>
>>108497700
This is more a question for /lmg/
>>
File: 1747612452452658.png (3.15 MB, 1168x1792)
3.15 MB
3.15 MB PNG
>>
File: Flux2-Klein_00171_.png (2.82 MB, 1072x1920)
2.82 MB
2.82 MB PNG
>>108497745
Thank you. I will buy a cheap second hand box and put the weaker gpu in it, use it as a thrasher to test agents without fear of them leaking my credentials, so it's not a total loss even though my pride took a hit.
>>
>>108497771
This 2000s aesthetic is neat, really brings be back. Checkpoint? Prompt?
>>
>>108497812
Anima. Prompt: https://pastebin.com/SB2dGvYN
>>
>>108497828
Thanks senpai!
>>
>>108497766
I thought I was there lmao
>>
File: 975225.jpg (58 KB, 900x960)
58 KB
58 KB JPG
If I'm training at 768px and a pic in the dataset has the shorter side below that, does it get resized or dropped?
>>
File: 1771299844276202.png (2.91 MB, 1168x1704)
2.91 MB
2.91 MB PNG
>>
>>108497794
Prompt for this?
>>
>>108497980
That's not how it works. 768px = 768x768 so it will target roughly 589824 pixels. Images will be bucketed based on aspect ratio and min bucket steps.
How this exactly works depends on the trainer. It might do something like scale down with even ratio to closest pixel target match and crop out excess pixels on the edges.
>>
File: Flux2-Klein_00172_.png (2.8 MB, 1072x1920)
2.8 MB
2.8 MB PNG
>>108497991
3D digital art rendering with a photorealistic aesthetic, depicting a bright, naturalistic outdoor scene. The setting is a panoramic vista from a high stone or concrete ledge, overlooking a vast, lush green forest landscape of rolling hills that extend to the distant horizon. In the foreground, to the left, vibrant pink and white roses bloom amidst rich green foliage. The sky above is a brilliant azure with soft, fluffy white cumulus clouds. Bright, slightly diffused natural daylight illuminates the scene, creating gentle shadows and pronounced, glossy reflections on metallic surfaces.The central subject is a **futuristic angelic android** with a slender frame and balanced, sculpted feminine proportions, suggesting a refined, advanced design rather than human. Its exterior is a smooth, matte white material, akin to a sleek exoskeleton or form-fitting suit. The head is partially visible in profile, featuring a delicate jawline. Short, immaculately styled white hair with a clean, sleek bob cut frames the head, which is adorned with a prominent white headset featuring large, reflective chrome earcups. The figure’s overall aesthetic is one of elegant technology and serene contemplation.The **android is standing with its body angled approximately 35-40 degrees to the right from the viewer's perspective, its head slightly lowered and turned an additional 25-30 degrees to the right**, gazing out towards the expansive landscape with a contemplative demeanor. Its form-fitting white base suit is augmented by intricately designed, highly reflective chrome-plated armor components. These include large, rounded shoulder pads, segmented armbands on the upper arms, full forearm guards, and a segmented belt-like structure around the waist, which features a discreet black module on the right hip.

(cntd)
>>
>>108498022
The legs are protected by robust, segmented shin and calf guards, leading down to sleek white boots detailed with reflective chrome toe caps and distinctive red accents on the back of the heels. Large, exquisite angelic wings, crafted from a transparent, multi-faceted crystalline or holographic material, are mounted on its back. These wings exhibit extreme iridescence, reflecting and scattering light into a dazzling spectrum of greens, blues, pinks, yellows, and oranges, while appearing to emit countless sparkling, multi-colored bokeh-like lights. The camera captures a medium full shot, positioned at a slightly low angle, emphasizing the subject's stature against the horizon. The aspect ratio is square (1:1), with a shallow depth of field that keeps the android and immediate foreground in sharp focus, while the distant forested hills are softly blurred.


Flux2Klein9b btw
>>
davinci whennnnnnnnnnnnnnnn
>>
File: file.png (310 KB, 1891x1109)
310 KB
310 KB PNG
Currently vibecoding a video generation tool because managing clips and shit from ComfyUI is a huge pain in the ass for a large project. You import API workflows from comfy and select which inputs you want to be able to change between generations. It uses a comfyUI backend to actually do the inference. Generated assets are downloaded and managed in a graph so you can see origin images/clips and join them together. This is mainly focused for gamedev where you might have lots of looping animations or transitions between keyframes and shit and screwing up a video early in a pipeline can lead to tons of wasted time if not caught.
>>
>>108498045
Based bespoke front end dev
>>
>>108494992 >>108495233 >>108497197
now This is what I call technology and related topics
>>
>>108498045
Nice. I did something similar setting up some keybindings to send stuff from mpv to comfy.
>>
File: ComfyUI_18577.png (3.2 MB, 1500x2000)
3.2 MB
3.2 MB PNG
>>108495188
You can't "out loser" these guys, Anon. They post hundreds of times per day and are gonna easily steamroll any actual discussion going on. You just have to sorta have a thread around them (ignoring them completely) if that's what you want. They're not going away.

>>108495550
Well, I'm really only here for news/info.
>>
File: 1746672044298602.png (2.42 MB, 1432x1432)
2.42 MB
2.42 MB PNG
>>
>>
>>108496217
unironically end your life, marketer
>>
>>108498022
the wings are wrong shes missing a heel which tanks the image
>>
>>108498106
the quality of these jen gens is really good
how hard was it to get it trained and what model did your train for?
>>
https://huggingface.co/Qwen/Qwen-Image-2
new model
>>
>>108498176
Funny
>>
>>108498318
fuk u
>>
File: 1764791606297232.jpg (817 KB, 2048x1128)
817 KB
817 KB JPG
>>
comfy does not feels comfy at all
>>
>>108498106
what model/workflow is that?
>>
File: 1769287940051024.jpg (801 KB, 1840x1328)
801 KB
801 KB JPG
i like cake
>>
File: 1773252494310492.jpg (632 KB, 2000x848)
632 KB
632 KB JPG
Babe wake up, bytedance released another useless shit
https://github.com/ByteVisionLab/DreamLite
>>
>>108498511
can't they just release Seedance 2.0 as an april's fools? would be funnier :(
>>
File: lol.png (162 KB, 1631x1142)
162 KB
162 KB PNG
>>108498511
>A 0.39b model is supposedly better than Kontext (12b)
who believes this? I mean I get it it's bytedance they made fucking seedance 2.0 but come on man
>>
>>108498511
>https://github.com/ByteVisionLab/DreamLite
>DreamLite is built on a pruned mobile U-Net backbone
Why are they still using U-Net in the year of our lord 2026?
>>
File: 1761566745818639.jpg (668 KB, 2048x1128)
668 KB
668 KB JPG
>>108498511
bro I love the plastic look!
>>
>>108498567
I wonder how fast it is. anyway for 0.39b looks like edge stuff (stuff to run on shitty devices) which actually makes it interdasting
>>
File: 43958734865893769.png (418 KB, 749x323)
418 KB
418 KB PNG
just transitioned from forge to comfy and im having trouble using detailer for eyes, which has been leaving them mismatched. i dont have a set up for manual inpainting yet, so im wondering if there is something else i can do to get better results?
>>
File: python_tImf5UagkR.jpg (26 KB, 700x300)
26 KB
26 KB JPG
Do I just rotate the 3:2 to make them 2:3?
>>
>>108498720
what is that?
>>
>>108498720
No
>>
>>108498741
onetrainer buckets
>>
>>108493092
The segmentation looks really clean, can the segmenter output just background removal?
>>
>>108498657
post workflow retardokun
>>
File: 1771595371229932.jpg (529 KB, 1536x1536)
529 KB
529 KB JPG
>>
File: Video_00001.mp4 (2.95 MB, 1056x1600)
2.95 MB
2.95 MB MP4
>tfw I solved latent upscaling issues
>>
File: 1756369696410569.png (148 KB, 640x562)
148 KB
148 KB PNG
https://xcancel.com/thejobchick/status/2039032800452723034
>Oracle will fire 30000 employees
now that AI is replacing jobs, when will we get Universal Basic Income?
>>
>>108498991
I gen in 480p
>>
>>108499009
UBI is one bowl of rice per day
>>
>>108499009
western countries already have UBI essentially. you won't starve while unemployed.
>>
File: Video_00001.mp4 (2.6 MB, 1056x1600)
2.6 MB
2.6 MB MP4
>>108499029
Yeah, so do I.
>>
>>108499133
>western countries already have UBI essentially.
it's not real UBI, you have to proof you're looking for a job or else they'll take it for you, real UBI is admitting there's not enough jobs for everyone so they leave the unemployed alone
>>
>>108499133
haha in my western country you have to pay around 100$ for mandatory """private""" health care no matter what or you are going into debt
if you are looking for a job they pay it for you, but look for too long and they kick you out that system
nothing better than to get the worst of both worlds
>>
>>108499139
but upscaling kinda gay?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.