[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: autocollage 2.jpg (486 KB, 1554x1964)
486 KB JPG
Discussion and Development of Local Image and Video Models

Previous: >>108891692

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
My art was not included..:(
>>
>inb4 n*gbo
>>
File: goth.jpg (534 KB, 1216x1536)
534 KB JPG
>>
mine was included
>>
>>108897509
Sorry I was in a hurry, possible I skipped over them in haste

If it was the sci fi landscapes I thought they had too many elements, an overcrowding that was in my opinion in bad taste.
>>
File: elves7.png (852 KB, 768x768)
852 KB PNG
>>
File: ComfyUI_00447_.png (684 KB, 896x1152)
684 KB PNG
>>108897541
what is that dress called?
>>
File: anima4.png (877 KB, 768x1024)
877 KB PNG
>>
troll collage
>>
File: Juggernaut_Z_V1_00289_.jpg (603 KB, 1344x1728)
603 KB JPG
>>
File: ComfyUI_00449_.png (909 KB, 896x1152)
909 KB PNG
>>
>>108897550
elf, revealing_clothes, pelvic_curtain, armored_dress, cleavage
>>
File: 1765439668826780.png (2.83 MB, 1536x1536)
2.83 MB PNG
>>
File: ComfyUI_00450_.png (1.55 MB, 896x1152)
1.55 MB PNG
>>108897608
>>
File: Juggernaut_Z_V1_00325_.jpg (485 KB, 1248x1824)
485 KB JPG
>>
>>108897541
>>108897550
elf, revealing_clothes, pelvic_curtain, armored_bra, cleavage_cutout, breast_cutout, gold_trim, gold_chain
>>
>>108897624
now gen Griffith in that pose
>>
>>108897550
+ diamond-shaped_brooch
>>
File: Video_00004 (9)-2.mp4 (3.83 MB, 696x1000)
3.83 MB
3.83 MB MP4
Trying prompt relay but wan will still go full schizo after 81 frames.

Is there any tech against that?
>>
File: 1771335945942336.png (1.22 MB, 966x621)
1.22 MB PNG
>>108897664
>>108897684
thanks
>>108897672
tried
>>
>mfw Resource news

05/24/2026

>L2P: Unlocking Latent Potential for Pixel Generation
https://huggingface.co/tsolful/Z-Image-L2P-INT8

>MooshieUI: Beginner-friendly interface for ComfyUI
https://github.com/Mooshieblob1/MooshieUI

05/23/2026

>Klein Tiled Upscaler for ComfyUI
https://github.com/Gavr728/ComfyUI_KleinTiledUpscaler

>Anima AI Character & Artist search engine with 49,000 sample images
https://animadex.net

>ComfyUi-Untwisting-RoPE (Training-Free Style Transfer)
https://github.com/BigStationW/ComfyUi-Untwisting-RoPE

>LongCat-Video-Avatar-1.5
https://huggingface.co/meituan-longcat/LongCat-Video-Avatar-1.5

>IMG Dataset Refiner v4.3
https://github.com/NyxAwroo/IMG-Dataset-Refiner/releases/tag/v4.3

>Sulphur-2-base
https://huggingface.co/SulphurAI/Sulphur-2-base

05/22/2026

>[real] Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models
https://github.com/microsoft/Lens

>L2P: Unlocking Latent Potential for Pixel Generation
https://nju-pcalab.github.io/projects/L2P

>GenEvolve: Self-Evolving Image Generation Agents via Tool-Orchestrated Visual Experience Distillation
https://ephemeral182.github.io/GenEvolve

>EasyVFX: Frequency-Driven Decoupling for Resource-Efficient VFX Generation
https://easy-vfx.github.io

>FashionLens: Toward Versatile Fashion Image Retrieval via Task-Adaptive Learning
https://github.com/haokunwen/FashionLens

>PALLAIDIUM Refactored
https://github.com/tin2tin/pallaidium_refactor

05/21/2026

>Follow the Mean: controlling flow-matching generative models by shifting endpoint means with reference examples
https://github.com/pedrocurvo/follow-the-mean

>LiTo: Surface Light Field Tokenization
https://github.com/apple/ml-lito

>Q-ARVD: Quantizing Autoregressive Video Diffusion Models
https://github.com/tsa18/Q-ARVD

>StreamGVE: Training-Free Video Editing
https://dsl-lab.github.io/StreamGVE

>PGC: Peak-Guided Calibration for Generalizable AI-Generated Image Detection
https://github.com/xiaoyu6868/PGC
>>
>mfw Research news

05/24/2026

>Structural Energy Guidance for View-Consistent Text-to-3D Generation
https://arxiv.org/abs/2605.19876

>Simple Approximation and Derivative Free Inference-Time Scaling for Diffusion Models via Sequential Monte Carlo on Path Measures
https://arxiv.org/abs/2605.17850

>ROAR-3D: Routing Arbitrary Views for High-Fidelity 3D Generation
https://arxiv.org/abs/2605.21121

>MARQUIS: A Three-Stage Pipeline for Video Retrieval-Augmented Generation
https://arxiv.org/abs/2605.17640

>RiT: Vanilla Diffusion Transformers Suffice in Representation Space
https://arxiv.org/abs/2605.21981

>LiteFrame: Efficient Vision Encoders Unlock Frame Scaling in Video LLMs
https://jjihwan.github.io/projects/LiteFrame

>Landscape-Awareness for Geometric View Diffusion Model
https://arxiv.org/abs/2605.19865

>CineMatte: Background Matting for Virtual Production and Beyond
https://arxiv.org/abs/2605.18328

>DEVIS-GRPO: Unleashing GRPO on Dynamic Extreme View Synthesis
https://arxiv.org/abs/2605.16937

>Dropout Universality: Scaling Laws and Optimal Scheduling at the Edge-of-Chaos
https://arxiv.org/abs/2605.21648

>Degradation Frequency Curve: An Explicit Frequency-Quantified Representation for All-in-One Image Restoration
https://arxiv.org/abs/2605.17506

>Reducing Object Hallucination in LVLMs via Emphasizing Image-negative Tokens
https://arxiv.org/abs/2605.21300

>Attention Hijacking: Response Manipulation Across Queries in Vision-Language Models
https://arxiv.org/abs/2605.17310

>Comparative Evaluation of Deep Learning Models for Fake Image Detection
https://arxiv.org/abs/2605.20971

>HEED: Density-Weighted Residual Alignment for Hybrid Vision-Language Model Distillation
https://arxiv.org/abs/2605.17093

>Artificial Intelligence can Recognize Whether a Job Applicant is Selling and/or Lying According to Facial Expressions and Head Movements Much More Correctly Than Human Interviewers
https://arxiv.org/abs/2605.17461
>>
>>108897699
Thank you Debo for being here.
>>
>>108897699
>>108897703
go back
>>
>>108897601
plap plap plap
>>
File: captured5.png (1012 KB, 768x1024)
1012 KB PNG
>>
>>108897502
oh hello where are the age 40+ 1girls at?
>>
File: captured6u.png (3.21 MB, 1536x2048)
3.21 MB PNG
>>108897868
>>
>>108897868
defo sfw
>>
>>108897901
anime website
>>
>>108897910
ye?
>>
File: salad girls.png (849 KB, 1024x768)
849 KB PNG
>>
File: salad girls2.png (817 KB, 1024x768)
817 KB PNG
>>
File: 202224CUI_00001_.png (2.4 MB, 1248x1824)
2.4 MB PNG
>>
File: zimage base.png (1.35 MB, 768x1344)
1.35 MB PNG
whats wrong with my zimage base? im using 30 steps and 5 cfg. my workflow works fine with turbo
>>
File: salad girls4.png (924 KB, 1024x768)
924 KB PNG
>>
>>108898098
cfg normalization, negpip, shift scheduling, and 50 steps
>>
>>108898096
Amazing, prompt?
>>
>>108898134
@seapall, @qunqing123, blue theme, painterly, 1boy, short hair, sitting, knee up, floating hair, high grass, lake, close-up, dim lighting, film grain, white birds
>>
File: debo_tm-m_anima1_00015_.png (1.78 MB, 1792x977)
1.78 MB PNG
>>108897868
>>108897904
I feel like leaving a sword in the cell with her is a potential security oversight
>>
>>108898149
thank you for your valuable opinion.
>>
File: 203130CUI_00001_.png (2.22 MB, 1248x1824)
2.22 MB PNG
>>108898134
>>108898146
btw I accidentally used some cartoon lora on that one, this is the prompt without it
>>
File: saw_mHkZWca.png (221 KB, 800x450)
221 KB PNG
>>108898149
Only if that's not part of your evil plan
>>
>>108898131
can you recommend a good simple work flow for text to image? i am used to a1111 and comfy is a nightmare
>>
>>108898149
How do you structure your prompt? Care to share examples?
>>
>>108898189
>Cut off your leg, faggot.
>>
>>108897933
Not that into uh. pol word music.
>>
>>108898215
Fuck off idiot
>>
>>108898131
>>108898098
>>108898192
He's unusual. Most people don't use negpip.

Your broken gens aren't explained by a lack of advanced techniques.

You didn't give a full set of info.

Anyway, you should use the EXAMPLE wf first. Is it busted? If you don't get bit identical gens with the wf (it should be literally identical to the .png image part, it will be obvious, you don't need to do an xor in gimp to check, with diffusion divergence is obvious).
>>
>>108898192
https://comfyanonymous.github.io/ComfyUI_examples/z_image/
replace the model with base, add the other things, change the appropriate params and there you go
>>108898245
desu more people use negpip than shift scheduling
>>
>>108898162
>cartoon lora
Didn't really change the style.
>>
File: debo_tm-m_anima1_00018_.png (2.18 MB, 1792x977)
2.18 MB PNG
>>108898215
this prompt isn't particularly well structured. its just a collection of wildcards
>>
>>108898225
which word?
faggot?
tranny?
nigger?
kike?
dyke?
roach?
sandnigger?
mudslime?
boong?
cracker?
niggerfaggottrannykike?

There should be a new rule on this site:
>you must say kike once in your post to make sure you're not an agent or a bot
It also makes your posts more important for people that are stupid enough to profile people and list them according to the type of hate speech they find niggerlicious.
>>
>>108898263
Why isn't there any AI users that actually can make a scene and use their glorified AI brush (because that's what these are) to make actual fucking art?
Why do they instead spam their garbage without checking and then force the entire art culture to change to the point of crappy art becoming the culture?

Honestly, I thought DeviantArt and furryfaggot artists were bad enough.
>then they proceed to force paywalls and authentication for fucking everything because it's how they control the markets
>>
lmao
>>
File: Juggernaut_Z_V1_00341_.jpg (754 KB, 1248x1824)
754 KB JPG
>>
>>108898308
>then force the entire art culture to change to the point of crappy art becoming the culture?
what do you mean by this

how is zimage forcing crappy art culture
>>
Anima.
>>
>>108898332
It was more general local AI use.
>>
File: ComfyUI_00048_.png (1.27 MB, 832x1216)
1.27 MB PNG
>>108898263
>desu more people use negpip than shift scheduling

I dont know what either of these mean
>>
any tips for not getting blotchy skin with z image turbo? it seems great besides this feature
>>
>>108898347
Vague with no examples. Got it.
>>
File: Juggernaut_Z_V1_00366_.jpg (724 KB, 1824x1248)
724 KB JPG
>>108898355
using lora is the only thing that helped me
>>
File: dedede.png (898 KB, 1672x1522)
898 KB PNG
Is something going wrong with my detailers if this is all that shows up in the preview box?
>>
>>108898357
It's so obvious I shouldn't need any, faggot.
>>
>>108898286
Please share it
>>
File: 2geZR.png (1.21 MB, 1024x768)
1.21 MB PNG
>>
File: ComfyUI_00023_.jpg (1.78 MB, 1536x2688)
1.78 MB JPG
>>108897664
nice prompt
>>
>>
>>108898454
jumbled legs
>>
do you think they will fuck up krea 2 by releasing another cucked version of their model that will be DOA again? i think they've learned.
>>
File: 213912CUI_00001_.png (785 KB, 1024x1536)
785 KB PNG
>>
>>108898500
If it's based on Flux2 then it's cucked to start with and it's intentional.
>>
File: hghf.png (1.24 MB, 1024x768)
1.24 MB PNG
>>108898499
>>
File: debo_tm-m_anima1_00023_.png (2.3 MB, 1792x977)
2.3 MB PNG
>>108898430
https://files.catbox.moe/7oyp2e.png
>>
File: lkhjgf.png (1.17 MB, 1024x768)
1.17 MB PNG
>>
File: Juggernaut_Z_V1_00379_.jpg (816 KB, 1824x1248)
816 KB JPG
>>
>>108898517
cool
>>
File: 214729CUI_00001_.png (814 KB, 1024x1536)
814 KB PNG
absurdres and highres tags are pretty good for cleaning up artifacts
>>
>>108897621
Model anon?
>>
>>108898580
zit
>>
>>
>>108898586
A shitmix? That's very crisp.
>>
File: custom sigma test.jpg (1.78 MB, 3584x3104)
1.78 MB JPG
I have been experimenting with micromanaging sigmas today, and in my opinion there is untapped potential.
I am using a modified version of kijai's custom sigmas node, I removed annoying stuff about interpolating from it. I also added ModelSamplingAuraFlow 1.0 after model load to prevent default 3.0 shift from interfering with the sigmas.
One possible idea that comes to my mind is that by focusing on high sigmas like >0.85 you can set most of the structure of the image in less than a dozen steps. It will be noisy (much less than you think if you micromanage it right) and minor details will differ but this can cut a lot of time for upscale or other i2i workflows where the base image needs to be structurally set but not needed to be "fully cooked".
Regardless I am not a i2i guy so I focused on something else. I was trying to figure out how sigmas click so I tried "how far I can push until it breaks" stuff with sigmas, removing values or changing them to see the effect on the image. This gave me an idea to better approximate higher step count output on lower step counts.
It's still a huge WIP, I've been primarily focused on sampling with euler 1024p (larger sigmas become even more valuable with higher res). It has also been primarily tested on 40 step simple shift 3 cfg 5. I will see how well it generalizes to other distributions. And on Anima because it's fast to test:
> 0.9 keep all (important for structure.)
< 0.9 and > 0.75 drop every other
Between 0.75 and 0.45 keep only ~step count/10 (3-5)
Between 0.45 and > 0.25 drop every other.
Last 3 sigmas are 0.25, 0.13, 0.0
I am doing this because in my opinion the way we approach sigmas is extremely unscientific. Just vibe with different sampling parameters until some scheduler and shift value seems to work good enough with the model/sampler/step combo. Someone smart should be able prove an optimal distribution for given step/resolution/cfg/etc. target but alas I am not that guy so I am doing good old trial and error.
>>
File: megafloods.jpg (50 KB, 499x471)
50 KB JPG
i just want to make cute dickgirls, i dont want to trial and error all these fucking settings in all these custom nodes and have to update comfy every day
>>
>>108898617
1536 1536 res2s beta57
https://civitai.com/models/2093591/dall-e-3-like-girls
>>
File: images.jpg (28 KB, 492x406)
28 KB JPG
>>108898637
>>
>>108898619
Here are values if anyone wants to copy them:
1.0, 0.9915254712104797, 0.9827585816383362, 0.9736842513084412, 0.964285671710968, 0.9545454382896423, 0.944444477558136, 0.9339621663093567, 0.9230769872665405, 0.9117646217346191, 0.8999999761581421, 0.8749999403953552, 0.8478260636329651, 0.8181818127632141, 0.7857143878936768, 0.75, 0.6891892552375793, 0.6176469922065735, 0.5322580933570862, 0.4285714626312256, 0.3461538851261139, 0.25, 0.13636364042758942, 0.0
And before you ask, why not just use same 23 steps with same euler simple shift 3, pic related.
It generates closer results. (More efficient distribution?)
Perhaps I should have added this to initial comparision, oh well.
>>
is using multiple sampler passes and playing with sigmas just autism?
>>
File: asuka.jpg (489 KB, 1536x1024)
489 KB JPG
>>
File: RND_A_002.png (1.12 MB, 832x1216)
1.12 MB PNG
>>
>>108898653
dual sampler with a zit lora is very good, first stage use the lora, and last few steps use base zit to get more details
>>
>>108898653
yes, although there is a benefit to it its not something you can find out by doing it by hand and looking at single examples like that: https://arxiv.org/abs/2311.06845
>>
>>108898653
It might help to think about what the beta distribution is doing mathematically and why it works good

Your first few steps determine image composition the most and are for coarse denoising, while steps toward the end are for fine denoising. You actually don't want to be denoising the same uniform amount all throughout the diffusion process, you should be putting the finishing touches on a already defined image toward the end.

https://arxiv.org/html/2407.12173v1
>>
>>108898675
Did you experiment on sigma thresholds for SDE > ODE switch? The paper doesn't seem to get into details of that.
>>
what % of gens do you keep?
>>
File: 00006-328313067.png (1.33 MB, 1536x1024)
1.33 MB PNG
>>
File: 1778968382318151.png (43 KB, 1120x797)
43 KB PNG
challenge: turn this meme into something cool using your model of choice
>>
>>108897557
I looked at every image in the thread, copied the ones I enjoyed into a folder, chose appropriate relative display sizes, and automatically generated a collage.

It happened to be arguably all 1girl. So what? It's hard to make good 0girl, and there are fewer gens to choose from. Sometimes I don't like any.

I don't lie in a collages or include people just so they feel included. 1girl has been the language of genning since the earliest days, all of those images are participating in a shared discourse. I stand by my choices.
>>
I got an RTX 4060 with 8GB vram, can that handle any of the newer video gen models?
>>
File: Flux2-Klein_00130_.png (109 KB, 336x336)
109 KB PNG
>>108898877
>>
>>108898877
https://github.com/deepbeepmeep/Wan2GP
>>
File: 00009-462630324.png (2.19 MB, 1536x1024)
2.19 MB PNG
>>
File: kay-figure.jpg (1.36 MB, 1477x3059)
1.36 MB JPG
What is the best way to create figurine gens? Model? Lora? Tag?
>>
>>108898497
>>108898587
Is this anima or what?
Can you share the workflow?
>>
>>108898894
search for lora
>>
>>108898877
yea. i have a 3050 and i use wan 2.2 and the q4 k m version of ltx 2.3.
>>
File: Juggernaut_Z_V1_00412_.jpg (758 KB, 1344x1728)
758 KB JPG
>>
>>108898877
>video gen
video is the ultimate midwit fly trap
>>
>>108898913
what the hell?
>>
>>108898913
can you share the prompt so i can see what it does on klein 9b? curious if that thing can do art
>>
>>108898930
use google's ai. It's pretty good.

>Zdzisław Beksiński.
idk, maybe that's what's in the prompt?
>>
>>108898930
It can do art. It knows some styles. It doesn't know artists.
>>
>>108898877
yeah that should be more than enough to load the seedance2 web page
>>
>>108898930
Used Beksinski lora which does the lifting, prompt is just random from scraped list
>by paul klee and hieronim bosch, a long and dreary night of eerie confusion, a sentinel keeps watch, but the devil is approaching, precise,fancy, magnificent, high detail, highres, masterpiece, pastel whimsical excellence
>>
File: Juggernaut_Z_V1_00422_.jpg (661 KB, 1344x1728)
661 KB JPG
It's over for artists
>>
File: a1111.jpg (50 KB, 469x579)
50 KB JPG
>>108898855
this is after deleting the grids folder
>>
>>108899015
oh ok thx
>>
>>108899032
based gachamaxxer
>>
We need to solve hands because the longer I can't gen them the more they become a fixation

Hands touching hands...
>>
>>108899097
"hands touching hands" isn't a normal description.
>>
>>108899107
Baffling reply
>>
File: 1771528508913736.jpg (1.52 MB, 1248x1824)
1.52 MB JPG
>>
>>108899116
I'll try it.
>>
>>108899097
I don't have a solution but it depends on positions and tags, for some the model struggles a lot.
hands on own hips > most likely going to be okay
holding hands, pov > good luck
It is a modern model with decent vae, te and supports high res
At least some should be possible to fix with lora in theory, unlike say SDXL where most of such shit was snakeoil
>>
>>108899097
hands are mostly solved in newer models and will be better and better in the future, its mostly shit loras trained on datasets with fucked up synthetic hands that are the problem

only low iqs care about fingers and other details that dont add anything to the point of the image since good pixelspace edit models will fix all these issues vey easily soon, and even now you can easily inpaint it if you really care
>>
>>108899127
Very nice
>>
>>108898896
https://files.catbox.moe/fnn352.png
>>
>>108899149
Thanks.
>>
can model authors find out if you used their models in ur paypig comms and shit
>>
>>108899141
>>108899142
No no you don't understand, my consective-numbered-friends, I am a veteran prompter, I know the limitations. I'm not a hand-nitpicker, and I know how some models "solve" this problem—they do not solve it to my satisfaction.

I am frustrated because we are at the point where you can *almost* see the thing, just out of reach. Disregard the many excess fingers, something is almost happening here.. An image like this, if it could have succeeded, would have been sensuous in a way we haven't been able to do yet
>>
>>108899206
No model is trained on a hand dataset that's satisfying. It likely only needs a lora, actually, as usual.

The same is true of human faces. They generat *a* face. Or, with a lora or the tags, *the* faces.

But they can't generate faces based on description, not really. You don't have enough word types. A face typology isn't used (but this is an available dataset type)
>>
>>108899195
Possibly. Very unlikely to happen though. (About as unlikely as someone getting decent money doing slop commissions.)
>>
where do you upload your lewdgens? i have a twitter but i dont like it, it rapes the quality sometimes, and now i am locked out of dms because i did not connect it to my cell phone
>>
>>108899195
>can they
yes
>will they
no
>can they prove it in court
no
>>108899229
>where do you upload your lewdgens?
i hoard them to use them in a game one day otherwise i would catbox and post them in ldg
>>
>>108899245
>>can they
>yes
>>will they
>no
>>can they prove it in court
>no
ai is still a ? in the courts imo.
>>
>>108899229
Overwhelming majority I keep to myself. What's the point? Once I ejaculate it fulfills its purpose. I sometimes post on /aco/. Dead, slow and little engagement but eh, you are technically sharing it.
/b/ is just rapid porn spam and too degen. I expect /d/ and /trash/ to be too degenerate too. /h/ has a reasonably fast thread but I don't vibe with the "thread culture" there.
Well you could also izzat farm on civit, but yeah I don't do that neither.
>>
>>108899315
i gave civitai a shot but they locked everything to "pending review" and never made my gens public
>>
>>108897664
Is there an equivalent to pelvic curtain to the breasts?
>>
>>108899225
bruh i see people with patreons making 8k a month
>>
>>108899373
string with curtain covering breasts
>>
what resolutions does anima work with, does it survive x2 high-res passes at like 0.5 deionise as well?
>>
>>108899431
>what resolutions does anima work with,
512-1536p
>does it survive x2 high-res passes at like 0.5 deionise as well?
Not an upscaling guy but most people don't seem to like it for that.
>>
i love ldg
>>
I'm a complete brainlet that's new to this Stable Diffusion stuff and I couldn't find anything about this in the Github or Rentry, but I'm trying to run Forge UI and I found a model to use and after updating the program I get this error:
>


RuntimeError: CUDA error: no kernel image is available for execution on the device CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.
What can I do to fix this?
>>
>>108899478
Fuck it didn't copy right
>RuntimeError: CUDA error: no kernel image is available for execution on the device CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions
>>
>>108899478
let me guess, you're on amd
>>
>>108899485
an AMD CPU, yeah. I have a Nvidia graphics card.
>>
>>108899478
>What can I do to fix this?
unironically install linux
>>
>>108899478
Are your drivers up to date?
>>
>>108899027
Fucking kek
>>
>>108899509
Yeah they are
>>108899500
Dangit. In that case I'll just wait to get a Steam Machine
>>
>>
>>108899478
Last forge commit was a year ago. Maybe something finally broke down.
But this sounds like a driver issue.
Also this: >>108899500
>>
>>108899556
Steam Machine will perform absolute ass for AI.
>>108899557
Kino gen.
>>
how do i add a grid output to an alr complex workflow
>>
tried out anima for a bit, going back to wai, thanks for listening.
>>
okay???
>>
>>108899586
Bye. Enjoy your 2.5d plastic jeetmix slop.
>>
I don't threaten violence. However, is it really violence when someone insults our lady Anima?
>>
Telling a chick that the reason my cock looks messed up is because it's not pixel space
>>
>>108899647
did you try upscaling it by 4x?
>>
>>108899653
No cnet so it just looks even more fucked up but she understands
>>
>>108899661
he will never be...
>>
>>108899586
i know you're shitposting for (you)s but i also went back to noob, but like, unironically.
>>
File: ComfyUI_06094e_.png (2.88 MB, 1362x1751)
2.88 MB PNG
>>
whats the best anima model for realism?
>>
>>108899844
https://civitai.red/models/2585622/ultrareal-fine-tune-anima
However the human anatomy in this model is quite shit
>>
>>108898096
first image of a male i have ever downloaded from here. No homo but that is aesthetically stunning
>>
>>108898096
I think this would be a better image without the birds.
>>
>>>/pw/20532294
slop
>>
File: TP_Zelda_Figure_004.png (1.01 MB, 768x1280)
1.01 MB PNG
>>
chances that LTX wont fuck up their seedance 2.0 mass destillation for LTX 3.0 by the end of the year?
>>
File: x.png (158 KB, 896x1152)
158 KB PNG
>>
>>
Why is LTX still worse than Wan2.2?
>>
>>108899158
Danke
>>
>>108900032
I don't know why did you think it will ever be better in any way shape or form.
ltx is the biggest pile of shit out there, the sd1.5 of video models.
>>
>>108900032
When the fuck will we get a new version of wan released? I'm sick of waiting
>>
>>108900092
wan 2.5 2.6 2.7 are all tiny incremental improvements if that, we need something big.
>>
>>108900102
doesn't wan have sound now?
>>
>>108900032
Trash video dataset and captions.
Seriously nobody knows how to caption or prompt it and their docs don't make sense.
>>
>>108900107
ltx has sound too and more things on paper but what matters is end result quality
>>
is it impossible to train loras for zimage turbo? i trained this thing for 3 hours and it cant make a penis. the base z image is kinda working
>>
File: WW_00011_.png (1.22 MB, 1152x896)
1.22 MB PNG
>>
File: 00071-3746505865.png (1.04 MB, 1536x1024)
1.04 MB PNG
>>
>>108899880
>However the 3d trash doesn't conform to 3d garbage opinions
shit take, 3d gooner opnions discarded. do you think the swastika is 3D? It's 2D, like God intended.

>platonic solid

get out of my face you pedo
>>
File: debo_tm-m_anima1_00037_.png (1.99 MB, 1792x977)
1.99 MB PNG
>>
do all of the flux 2 models have the same image editing capabilities? or only klein 9b?
>>
>>108900119
Yes, ZiT is both step and aesthetically distilled meaning it is impossible to train to such a degree that it would start producing passable peanus.
ZiB is neither of those things.
>>
has anyone been able to compile cublas_ops for windows11?
>>
>>108900021
CERAMIC PUSSY
>>
What should I generate?
>>
>>108900209
THANK
YOU

>not 3d
>not perverse
>>
>>108900309
see if you can get a picture of a skinny white woman with blonde hair, who is pretty.
>>
>microsoft/Lens-Base
uh.
>>
>>
>>108898855
Models will improve, and I'll get better at prompting too. Everything I can do now, I will be able to do with much less effort next year
>>
File: debo_tm-m_anima1_00038_.png (2.19 MB, 1792x977)
2.19 MB PNG
>>108900325
glad you like it
>>
So, 0
>>
are there any good z-image tunes that can do convincing nsfw?
>>
>>108899880
For some reason this model is hidden from me in searches but it shows up if I go to the direct link. Real weird.
>>
>>108900392
Big Love Zuna but it gens pretty high contrast images. They have almost SDXL+ feel to them.
>>
is it worth to try and tweak Shift in Anima?

Does it visually do anything?
>>
>>108900412
I hate his shit. It's like he uses the exact same shitty data set on everything he trains and it always creates weird, anorexic, elongated torsos.
>>
>>108900347
also
>Contributors (Alphabetical Order):
Baining Guo, Chong Luo, Dong Chen†, Dongdong Chen, Fangyun Wei†, Ji Li, Jianmin Bao, Jiawei Zhang*, Jinjing Zhao*, Lei Shi, Qinhong Yang, Sirui Zhang*, Xiuyu Wu, Xuelu Feng, Yan Lu, Yanchen Dong, Yang Yue*, Yitong Wang, Yunuo Chen, Zhiyang Liang*, Ziyu Wan†

How is that even legal. I want a country, idgaf about which one, just a country.
>>
Jesus Christ
>>
>>108900426
yea
>>
>>108900431
what's his race
>>
>>108900431
He almost certainly uses the same dataset. It's still the best zimage tune for actual nsfw I know in terms of anatomical knowledge.
>>
>>108900328
Come on
>>
>>108900470
ZiT needs an oriental female fine-tune.
>>
File: ComfyUI_00457_.png (661 KB, 896x1152)
661 KB PNG
>>
>>108900477
Ok, maybe say "kind of pretty"
>>
>>108900572
this is my new favorite outfit
>>
>>108900032
ltx shits on wan
>>
has flux2 klein character lora training been fixed by now?
>>
>>108900750
I've trained only two and they both turned okay so I don't know what the problem was to begin with. I have to use high strength though.
>>
Is there a way to make zit (or any image gen algorithm for that matter) use 16-bit HDR scrgb instead of plain 8-bit sdr srgb?
>>
>>108900869
If you have ever tried color correcting AI images you would know that the color space is baked in.
>>
>>108900748
Elaborate.
Explain why literally everyone I see besides you prefers wan.
>>
>>108892550
post extension .json? looks neat
>>
>>108899097
>>108899206
Just inpaint, anon.
>>
>>108898855
I've updated pytorch, cuda, whatever enough times to know that previous generations are no longer possible
>>
>>108898855
I keep the keepers, 1/10 of gens. Problem with AI stuff is that it's just a joke and offers no real value outside of making your little shitty cum ui workflow. People who spam this shit every day are mentally deranged and have no real life skills.
Oops.
>>
>>108900936
if you don't enjoy it, you should quit. if you enjoy it, it has value.
>>
>>108900947
You sound like a chronic masturbator. Perhaps get a job first and then you'll understand the perspective involved.
>>
>>108900955
what's the perspective?
>>
how to prompt for detailed cat ears on top of a girls head without getting a cat on her head @_@
>>
>>108901016
What model?
>>
>>108900960
What do you mean?
>>
>>108900890
When I used wan it was so much work and the pinkio client thing was so clunky for no reason, tiny menus in different tabs.
This was back when I used A1111, so, that could be why my opinion of it soured. LTX inside comfy is a much better experience.
>>
>>108901018
illustrious
>>
>>108901023
If you enjoy stuff you enjoy it. Not everything needs to bring in money.
>>
>>108901031
You need to use Japanese or Chinese. Kemonomimi works, for example. Or (Nekomimi:1.5).
>>
>>
adding score_x tags and quality tags makes your images more plastic and AI generated looking
>>
https://music.youtube.com/watch?v=t6zhuFAoC6Y&si=9UuNhtOa37Zoamfl
this is a piece of a human soul
>>
>>108901076
no shit
>>
how many pieces can you chip off before you're someone else?
>>
>>108901041
>Nekomimi
thanks this worked fantastic i was getting tired of those little shits
>>
>>108901076
i just wanan coom, i dont care if it looks ai because im not autistic, i care that it looks visually arousing
if i wanted lowq hentai id go browse nhentai
>>
>>108901041
>>108901041
What are some other common silly problems that can be solved easily by using another language? I didn't think of that before.
>>
>>108901085
sounds like jeffrey cantu ledesma in a depressed day
>>
>>108901168
Problem is that "mimi" means ear in Jap but if you google it, you'll get anime degeneration and also fake translation.
>>
>>108901168
kodomo doushi
>>
>>108901219
feds got me

on another note, where can i learn how this works
>>
>>108901235
First step is to stop using workflows from reddit/civitai.
>>
>>108900890
vramlets
>>
File: nvidiaa.mp4 (2.9 MB, 720x720)
2.9 MB
2.9 MB MP4
>nvidia solved vae
>>
>>108901352
https://research.nvidia.com/labs/sil/projects/pid/
>>
>>108901245
this is from here
also, so learners should just throw nodes from the manager into default workflow to see what sticks?
>>
>>108901360
No, you should do the opposite: do not default to 3rd party nodes until you understand what you are doing.
Cum ui is problematic but use the software as they intended, and only then add some useless shit nodes if you have to.
>>
File: 1755723277288254.jpg (240 KB, 1200x900)
240 KB JPG
>>108901359
Fugg this is actually biggg, I kneel
>>
>>108901352
>>108901359
Huge
>>
>>108901352
>>108901359
>no anime girls
>no anime girls, with dicks, naked
Meh. Remind me when there is Anima PiD
>>
>>108901370
so is this the incorrect way to do regional prompting?
>>
>>108901452
You don't need regional prompting and if you do, you would be so advanced that you wouldn't care.
>>
>>108901359
>>108901352
this is decoding and upsampling btw
>>
>>108901457
but i want to generate very 3 different monstergirls in the same image
>>
>>108901452
If you still are concerned about this, I would look at latent mask node...
>>
>>108901462
Just prompt it you stupid motherfucker.
>>
Holy fuck how to solve man-hands with klein?
>>
If face detailed is only showing this as the preview thing, is it working incorrectly?
>>
>>108900371
I love how there's supposed to be a role for humans on a space ship :)
>>
>>108901498
looks fairly normal woman's hand to me.
>>
>Performance: It can decode 512×512 latents into 2048×2048 high-resolution images in under 1 second using consumer GPUs, like the RTX 5090.

heh
>>
>>108901521
ikr. He probably thinks Bridget Macron has a wang, which is legally untrue in many countries.
>>
>>108901521
Look at a larger pool and you'll see that the hands are manly 99% of the time.
>>
>>108901528
who knows who cares.
>>
>>108901546
ikr, but also how brave and we're proud
>>
>>108901545
Are you insecure about something?
>>
Didn't expect klein to be so good at bronze statue.

>>108901556
jUSt BeCAUsE i DiSLikE gAY sTUff IT mAKeS mE GAy
>>
>>108901556
I'm not insecure, I'm fat.
>>
>>108901586
Nothing gay about normal women's hands my friend.
>>
>>108900572
Hawt bidel
>>
I'm so glad I've saved a ton of memes.

>>108901622
Women's hands, no, male hands on women, yes.
>>
>>108901656
lol get a hold of yourself, you're pretending to be straight too hard.
>>
GOOD NEWS EVERYONE
>>
IT'S MONDAY
>>
>>108901677
Yay!!

>>108901682
Fuck off!
>>
>>108901696
That's the good news.
>>
>>108901656
Aside from realistic jak looking extremely uncanny without human proportions, it ruined xir hair.
Poorly dyed sparse balding troon hair is where its soul lies.
Also the discolored post-mortem tongue becoming normal.
Not that it matters too much but also its strange how it hallucinated the tattoos.
>>
repost. genning another one, so hold on, ok.
https://files.catbox.moe/ycyclz.mp3
>>
>>108901500
probably not, is it detailing what you expect?
>>
>>108901736
I remember genning stuff like that year ago. Thank god I've grown up a little.
>>
>>108900032
It's not worse, they both have own strengths and weaknesses.
>>
>>108901799
X

NOT GOD

(demons)
>>
>>108901838
lmao I'm pantheist I just enjoy the cringe stuff aesthetically and musically.
>>
>>108901846
Really? Right there in front of my Septuagint pdf?
>>
>>108901876
sorry don't know what you are talking about.
>>
I was literally dragging it with my mouse. Unbelievable.
>>
>>108901888
checked.

It's the pre-Christian Greek version of the Old Testament.

https://www.logos.com/product/188040/the-lexham-english-septuagint-2nd-ed?ssi=0
>>
>>108901907
interesting stuff as long as you know you're reading old philosophies and fairy tales and don't take it too seriously.
>>
>>108898098
disable sageattention if you're using it
>>
File: failsuba.png (998 KB, 1024x768)
998 KB PNG
>>108901467
NTA, but how exactly do you do this? How do you specify tags for the girl on the left? Anima doesn't do it well with natural prompts:
>Two girls from Konosuba, Aqua wearing Megumin's clothes on the left, Megumin wearing Aqua's clothes on the right
inb4 learn2prompt, that's exactly what I'm trying to do, dammit
>>
cooking up some kinos
>>
>>108901917
You're really a fucking idiot.
>>
>>108901947
how come?
>>
aids, probably.
>>
>>108901661
>>108901720
Just kys, ywnbaw.
>>
File: failsuba2.png (1 MB, 1024x768)
1 MB PNG
>>108901937 (me)
And you can't do it with just tags:
>aqua (konosuba), megumin (cosplay), megumin, aqua (konosuba) (cosplay)
>>
>>108901960
That's funny
>>
like it just knows it's steppemen
>>
has anyone been able to compile --fast cublas_ops for pytorch 2.10+ on cuda13.1?
>>
Impressive.
I should go through all klein loras to see if I can experiment with styles of other mediums with finetuning.
>>
>>108901990
hammer's shadow. hand.
>>
>>108901937
>>108901970
You just can't do this kind of stuff with anima (or sdxl for that matter)
The model just isn't smart enough to separate clothing from character properly.
Your best bet is describing their appearances and clothing details individually instead of "draw character x but like character y".
>>
>>108901990
Where is the original image from?
>>
>>108902021
Range Murata is the artist, been a huge fan of him for a long time.
>>
>>108902017
it was very easy with regional prompting:
>aqua (konosuba), megumin (cosplay)
>BREAK
>megumin, aqua (konosuba) (cosplay)
but I can't use it anymore because I'm on sd.cpp
so >>108901467 is a lying faggot? You can't just prompt?
>>
>>108902021
That's Ranxerox or something like that, been ages since I've seen any comic books.
>>
>>108901952
just minding my own business. stop being such a weirdo.
>>
Does git pull and comfy manager update do the same thing?
>>
File: 1779710282391173.jpg (220 KB, 1472x1104)
220 KB JPG
>>108902100
You can just stitch 2 images together at this point
>>
File: 8092382305802349-5.png (1.73 MB, 1280x720)
1.73 MB PNG
>>
shots fired
>>
File: hugsuba.png (862 KB, 1024x768)
862 KB PNG
This works too, apparently:
An anime artwork of two characters hugging each other.
The character on the left is Aqua with blue hair, blue eyes, large breasts, wearing red robe, witch hat, black panties.
The character on the right is Megumin with brown hair, red eyes, flat chest, wearing blue and white jacket, blue and gold skirt, green ribbon, white sleeves, no panties., aqua (konosuba), megumin (cosplay), megumin, aqua (konosuba) (cosplay), white background, @ebi 193
>>
File: ComfyUI_00465_.png (481 KB, 896x1152)
481 KB PNG
>>108901641
>>
>>108902017
>You just can't do this kind of stuff with anima
why?
rei from neon genesis evangelion, she is wearing a white plugsuit and has messy blue hair. she is hugging goku from dragon ball z.
>>
>>108902278
oops, don't report, I'll delete
>>
File: ComfyUI_26414.jpg (3.54 MB, 1500x1920)
3.54 MB JPG
>>108901352
>>108901359
Neat!
>>
>>108902282
but that's goten
>>
>>108900891
Haven't made it into a browser extension yet, still requires dropping images into a folder and opening a tkinter GUI. That's probably the next step if I want it to be properly usable, but I have a few other ideas I might work on first
>>
File: file.png (2.95 MB, 1248x1824)
2.95 MB PNG
>>108902228
Sometimes it hallucinates the hair, lol.
>>
>>108902305
For me, all the time
>>
>>108899559
forge neo is actively updated
https://github.com/Haoming02/sd-webui-forge-classic
>>
File: baki8.png (943 KB, 768x1024)
943 KB PNG
>>
>>108902324
>worst quality, low quality, score_1, score_2, score_3, blurry, jpeg artifacts, sepia, text, watermark, bad hands, painting \(object\), signature, patreon username, artist name, watermark, text, username, logo, brand name, creator name, handwritten text, web address, copyright name, artist signature, pixiv id, twitter username, tumblr username, deviantart username, text bubble, caption, subtitle, (signature:2), (watermark:2), (text:2), cropped signature, visible text, corner watermark
>>
File: Anima be like.jpg (10 KB, 276x182)
10 KB JPG
>>108902326
cfg: 1
>>
File: file.png (3.07 MB, 1248x1824)
3.07 MB PNG
>>108902310
Are you using the turbo lora?
>>
>>108902340
there's some negpig extension for that
>>
>>108902305
It hallucinated the panties too. This raises a philosophical question: should Aqua, who doesn't wear them, keep doing so in Megumin cosplay, or should Megumin be faithful to Aqua's cosplay and not wear them?
>>
>>108901038
I'm jewish.
>>
>>108902388
Me too.
>>
File: baki11.png (1.02 MB, 1024x768)
1.02 MB PNG
>>108902375
>two characters from different series wear clothes of third characters not present in the image
>>108902324
>>
>>108902388
I don't talk to jews, homosexuals, atheists, uh... you call em whata.... idk. what do you call them fellers what are stupid. and I don't talk to mexicans, or indians.

Pretty much just white straight Christians.
>>
>>108902401
I talk to everyone, even if they are bad people.
>>
>>108902375
ditch natural prompt and get back to tags: char1, char2, char3 (cosplay)
>>
Are the local models still all pony tier shit?
It's been over 2 years now, surely there are better things now, right?
>>
File: 73915331899617.png (2.06 MB, 1152x1600)
2.06 MB PNG
>>
>>108902434
There is Anima but it forgets everything and Cumfart is going to make it a SaaS model very soon.
>>
>>108902420
2 characters wearing same clothes, souryuu asuka langley, megumin, albedo (overlord) (cosplay), white dress
>>
>>108902407
Yeah so that's a really bad idea, unless you're a bad person, then fuck right off.
>>
File: 78690885576671.png (2.22 MB, 1152x1600)
2.22 MB PNG
>>
>>
>>108902445
tard fangers
>>
>>108902456
Tis a free world and nothing bad happens from civil conversation. Besides I did fuck off.
>>
>>108902447
that sounds catastrophic...
>>
>>108902420
>>108902451
>>
THIS IS IT BOYS. THIS IS THE BIG GEN.

THIS IS THE ONE.

I FEEL IT COMING. IT'S COMING!!!!

"For the length of days"
>>
File: askuta and asuna cosplay.png (1.11 MB, 768x1024)
1.11 MB PNG
>Two characters from different series cosplay as characters from two other different series
Anima handles some hard concepts effortlessly, and completely flops things that seem easy
>>
>>108902351
https://github.com/pamparamm/ComfyUI-ppm

pamparamm updated his repo just yesterday
>>
File: api.jpg (596 KB, 2000x1552)
596 KB JPG
>>
>>108902574
>>108902574
>>108902574



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.