[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106812600

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: 1747748540838193.mp4 (302 KB, 1280x720)
302 KB
302 KB MP4
Reposting this here, this looks like a very interesting paper >>106815714
https://xcancel.com/du_yilun/status/1975178086325883100#m
https://arxiv.org/pdf/2510.02300
>>
>>106815707
not really
>>
>>106815738
Snake oil newfag
>>
>>106815759
>everything is a snakeoil
I hope not, life would be so boring if you were true :[
>>
/schizohours/
>>
>>106815767
Anon, I...

You might have to sit down for this.
>>
gonna try and glitch kontext
>>
>>106815790
Specifically, the mesoamerican nine lords of the night.
>>
Why does everybody seem to get such great results with qwen? When I use it, it can't recreate the likeness. Sure, it gets the outfit right, the hair color even the expression, the overall shape.The likeness just isn't there.
Do I just need to keep genning for that lucky seed?
What am I doing wrong?
>>
>>106815866
>What am I doing wrong?
Not loading the full model
>>
File: 1754596240302851.jpg (814 KB, 2584x1119)
814 KB
814 KB JPG
>>106815866
>What am I doing wrong?
nothing, QIE isn't that good
>>
>>106815882
I've got the Q8. Shouldn't it be almost as good as the original?
>>
>>106815904
Retard
>>
>>106815866
Probably the LoRA or not enough steps
>>
>>106815899
Yeah, this is exactly what I'm talking about.
I guess it's only good for cartoons.
>>
new snakeoil just dropped: https://www.reddit.com/r/StableDiffusion/comments/1o09lwm/fsampler_speed_up_your_diffusion_models_by_2060/

welcome back teacache. again.
>>
For wan 2.2, when the motion that moves a lot across the image is not necessarily blurry, but distorted, and the motion that is less is fine, what is the main indicator of what needs to be adjusted?
>>
I wanted it to be good so bad
>>
>>106815899
what prooompt?
>>
File: 1758847003425542.jpg (67 KB, 1024x916)
67 KB
67 KB JPG
I've been using automatic1111 for like three years, is comfy really that much better? I tried it out and it doesn't treat my prompts quite the same. It does seem to be more efficient with resources though.
>>
>>106815967
expectations
>>
>>106815982
I kind of hate comfy. It supports a lot of stuff though.

>can you?
comfy generally can. more than anything else ever so far.
>>
>>106815982
No it's not that much better. Everything requires more effort.
It has more support though and a lot of fancy gimmicks.
>>
File: goy_detected.png (872 KB, 856x1216)
872 KB
872 KB PNG
>>106815920
all I can suggest is to maybe break up the edit into steps and see if it helps, but I'm not sure
pic related is how it did Bibi. It's kinda slopped but not unrecognizably so.
Maybe it also struggles with small faces in the image but I truly don't understand how this actually works.
>>
>>106815960
Is it really snake oil when it works?
>>
>>106816060
All these snakeoils look good until you figure out what the horrible catch is.
>>
>>106815982
>it doesn't treat my prompts quite the same
AFAIK it doesn't support token weighting (aka (()) and (breast:1.5)). Despite this people are still using them, by pure cargo cult thinking. This is the only difference with automatic1111, if you take the same seed, model, sampler, and prompt, you should get the same image.

I used automatic and forge before, and I hated the Comfyui node based workflows at first, but the fact is that there are now thousand of custom nodes to handle literally everything under the sun. Which makes Comfyui always the first platform to get XXX and more or less necessary for me.

It also allowed me to create pure abominations of workflows were the guidance was on a loop with itself and a controlnet and other monstruosities, which wouldn't have been possible with more closed type software. I hate the fact that it's in python/web, in dependency hell, takes 10 mins to start up, sometimes simply refuse to start up, breaks randomly on update, and is a resource hog, but the nodes stuff really grew on me.

I heard SwarmUI is good too if you don't want to fiddle with workflows.
>>
>>106816031
You mean like a face inpainting step?
I guess that's worth a shot.
I mean your pic looks like it has the likeness.
>>
If only comfyui wasn't such a pain in the ass to set up.
>>
File: 00146-2586795257.png (2.24 MB, 1248x1824)
2.24 MB
2.24 MB PNG
>>106815982
my nigga just switch to using reforge2 and wan2gp.
>>
>>106816219
and having a keylogger for """telemetry purposes"""
>>
>>106815738
In layman term?
>>
>elated to have a gpu with more vram
>even more elated to go back to my history of over a year of genning and remake everything with higher clarity and a proper desktop resolution

>mfw realizing i have to organize a broad spectrum of categories across literal tens of thousands of gens
fuck. there has to be a better alternative to Diffusion Toolkit this thing is niggerliciously buggy.
>>
>>106816241
Download my shit
>>
Any use of running a video enhancer node on the high noise model?
>>
File: 84^|+|.png (1.81 MB, 1024x1024)
1.81 MB
1.81 MB PNG
kontext has SOVL.
>>
tell me about the namefag, why does he pick his scabs?

>>106816316
if i picked that scab off would you die?
>>
>>106816244
it's apparently spamware
>>
>>106816322
I had to do it

bruh
>>
>>106816241
>In layman term?
the quality is better during training and it's faster to make the image (you need less steps)
>>
File: 00152-1577203312.png (2.41 MB, 1248x1824)
2.41 MB
2.41 MB PNG
>>
>>106816326
>>106816332
you're a pretty scabby guy
>>
>>106816234
You're kidding of course
>>
>>
>>106816363
does she win? I couldnt keep up with the mango
>>
>>106816322
It would be extremely delicious
>>
File: WanVideo2_2_I2V_00617.webm (1.5 MB, 704x1280)
1.5 MB
1.5 MB WEBM
>>
File: 00157-2694143227.jpg (161 KB, 1824x1248)
161 KB
161 KB JPG
>>106816394
i only watched the anime plus the ova. Heard the ending was shit so i didn't bother reading the manga.
>>
>>106816463
*sniff*
>>
>>106816168
> it doesn't support token weighting (aka (()) and (breast:1.5))
you could just test by yourself
>>
>>106816463
>>106816480
*SNIIIIIFF*
>>
> scatPICKLER
>>
>>106816316
Try changing bruh to bruv. I feel the image is right for it.
>>
>>
File: ComfyUI_02553_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>
File: ComfyUI_02557_.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>
File: baby_detected.jpg (329 KB, 856x1216)
329 KB
329 KB JPG
>>106815967
Do you use interpolation? Because that can cause distortion with movement.

>>106816031
Fyi, you can try adding film grain (node) to the image in an attempt to reduce the slopped skin look.
>picrel
>>
File: 00175-1352841745.png (2.15 MB, 1824x1248)
2.15 MB
2.15 MB PNG
>>106816495
can't believe its been 10 years since i watched this anime back in senior year of high school. time flies.
>>
>time is running out
it sure is.
>>
File: ComfyUI_02580_.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>
File: ComfyUI_02583_.png (1.8 MB, 1024x1024)
1.8 MB
1.8 MB PNG
>>
File: 00180-1954112427.png (2.43 MB, 1824x1248)
2.43 MB
2.43 MB PNG
>>
>>106812995
Creating the "posing sheet" from a real photo with Qwen Edit?

It's a deal I guess
>>
File: 00184-3497891073.png (2.4 MB, 1248x1824)
2.4 MB
2.4 MB PNG
>>
File: ComfyUI_02608_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>
File: QwenImg_00028_.png (2.28 MB, 1152x1440)
2.28 MB
2.28 MB PNG
suck fummer
>>
File: ComfyUI_02613_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>
File: ComfyUI_02615_.png (1.49 MB, 1024x1024)
1.49 MB
1.49 MB PNG
>>
>>
>>106816804
catbox pls?
>>
File: ComfyUI_02619_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>106816830
https://files.catbox.moe/pq35kk.png
>>
>>106816827
how many more same images are in the queue?
>>
>>106816841
queue is empty right now. wanna see any particulare anime gal pointing at you and laughing?
>>
>>106816844
nah, bro, i'm good, i think i've seen enough
>>
File: ComfyUI_02633_.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
>>
File: 00190-2145660996.png (2.4 MB, 1248x1824)
2.4 MB
2.4 MB PNG
>>
File: ComfyUI_02640_.png (1.5 MB, 1024x1024)
1.5 MB
1.5 MB PNG
BRAAAAAP
>>
>>106816852
>ive seen enough
impossible
>>
is there a illustrious/noob model that's capable of realistic amateur cosplay photos of anime characters with working illustrious loras (like real realistic, not this slopped 2.9d look)
>>
Is the new netta yume available and is there a good example workflow provided by the creator or do I just do it from scratch?
Going to take a crack at it
>>
File: input1.png (123 KB, 293x265)
123 KB
123 KB PNG
Playing with Kontext. the input, genned by SOVLful sd1.4
>>
File: output.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>106816907
>>
>>106816886
v3 is on civitai/HF
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0/tree/main
the HF repo has a basic example workflow

v4 TEST is closed access:
https://huggingface.co/duongve/NetaYume_test_version

I got access after 3~ hours
>>
File: 00198-4008304862.png (2.5 MB, 1248x1824)
2.5 MB
2.5 MB PNG
>>106816878
https://civitai.com/models/1045588/pornmaster-pro-illustrious-and-noob
>>
>>106816941
Thanks, I don't get closed access for a beta but I'll test 3 see what I can do, does it include a high res pass or do I need to do the noodle humiliation ritual and set it up myself?
>>
>>106816953
no hires, but im not using it, im natively genning at the resolution you see (res also used in the example workflow)
>>
>>106816959
Cool thanks
>>
is 4/8step lora on lumina models possible?
>>
File: 00203-1541715990.png (2.57 MB, 1248x1824)
2.57 MB
2.57 MB PNG
>>
What would be the best driver version for a RTX 3060?
>>
File: ComfyUI_02694_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
>>106816789
Very cool
>>
>>106816789
Not cool
>>
File: ComfyUI_02702_.png (1.64 MB, 1024x1024)
1.64 MB
1.64 MB PNG
>>106817050
>>106817054
And everything was in balance
>>
>>106817062
Did she jeeted?
>>
File: ComfyUI_02721_.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>>106817085
>Did she jeeted?
F
>>
It's almost 2026, how come there's still not a model to img2img turn anime into photoreal pictures that are actually consistent and faithful recreations of the reference? I see people posting Qwen Edit pics that supposedly do that but I never managed to recreate, even using the lora. My attempts at most make a slightly more semi-realistic version but barely. Send help
>>
File: downloadffd.png (2.06 MB, 1080x1920)
2.06 MB
2.06 MB PNG
>>106817159
>>
File: clean bruh.png (2.01 MB, 1024x1024)
2.01 MB
2.01 MB PNG
>>
File: 1639431584000.jpg (16 KB, 348x312)
16 KB
16 KB JPG
>>106817159
>It's almost 2026
thanks for the reminder, dick.
>>
>>106817204
Why does it keep doing the tictac speech bubbles lol? It even has a shadow.
>>
>>106817196
>that flux chin
>>
I don't see many people talking about Lumina compared to Flux and Qwen, what is Lumina's deal? It seems to really punch above its weight for having only 2B parameters, but then again maybe not seeing as the gens that people do post look really slopped.
>>
IM GONNA DO IT
>>
>>106816943
her three rows of teeth look so cute
>>
>>106817241
OK IT DIDNT WORK
>>
>>106817215
no idea, this is me trying to change the style. I think the hands are really creepy.
>>
>>106817265
512x512 is the minimum for flux iirc
>>
>>106817228
>but then again maybe not seeing as the gens that people do post look really slopped.
Heavily underbaked because the guys who trained it ran out of budget not even midway in
Might have replaced noob/illust if they had enough money, but we will never know, someone is doing a finetune of their finetune, but it ain't doing much
>>
File: WanVideo2_2_I2V_00631.webm (1.5 MB, 1280x704)
1.5 MB
1.5 MB WEBM
>>
>>106817223
>That flux chin
It really is a tragedy.
Just imagine what might have been.
Speaking of. I really should convert one of those Sam Altman "sneed" (very popular on /wsg/ right now) videos to a gif for use as a reaction image.
>>
>>106817265
Ofc. Anything multiplied by 0 is still 0
>>
File: 00223-3951280407.png (2.54 MB, 1824x1248)
2.54 MB
2.54 MB PNG
>>
File: kek3.mp4 (346 KB, 192x960)
346 KB
346 KB MP4
>>106817265
You can do noodle gens but it has limits.
>>
File: 00237-3493631236.png (2.48 MB, 1248x1824)
2.48 MB
2.48 MB PNG
>>
>>106817196
Why don't you tell the fucking method? If I get home and this png doesn't have the workflow I'll kick you in the nuts.
>>
>>106817336
desu funny
>>
>>106817336
also I like how everyone is static unless prompted
>>
File: TensorArt_00043_.png (1.57 MB, 1024x1344)
1.57 MB
1.57 MB PNG
My honest thoughts,
Text is way less accurate than flux/chroma and while I see the vision I won't bother deep testing this until this lands on some forge fork properly. I have better things to do with my time than play around with nodes to do a basic checkbox on other UI.
I do see the vision and I'm sure a second pass will improve it greatly. I did the 5 batch test and it failed with text still in a waaaaaaaaaaaaaaaaaay better spot than current chroma
I also had the same problems using the example prompt
>>
File: TensorArt_00005_.png (1.51 MB, 1024x1536)
1.51 MB
1.51 MB PNG
>>
has sage attention working with qwen yet?
>>
File: 1744710925282689.png (1.6 MB, 1152x896)
1.6 MB
1.6 MB PNG
>>
forge is just too good. i'd invite forge out to a seedy hotel and make wild fuck with her while comfyui watches from the cuck seat as i knock her up
>>
File: ComfyUI_02734_.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>
File: ComfyUI_02754_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
>>
is this sdg? why so many images?
>>
File: ComfyUI_02760_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>106817665
I'm testing Chroma1-HD-Flash. No avatars in sight. You feeling okay anon?
>>
>>106817503
here is your answer: >>106817665
only avatarfags are allowed to post here
yes: sad
>>
This is why we can't have good things.
>>
>>106817691
>sad
lay off the estradiol
>>
what an unironically dead shithole
>>
File: ComfyUI_02765_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
>>106817716
The retard woke up so he's going to do his projection bit again, we both should just ignore him. I noticed the drunk decided to bother EU and they ignore him probably in a similar fashion to the people in his actual life.
>>
>>106817740
all I can here is "im gay im gay im gay im gay"
>>
Just remember the rentry, he tried so hard to remove it from mass reporting to splitting threads and seething non stop for months. Ignoring him is a victory because he fought so hard to stop it.
>>
>he's trying to shit up the blessed thread again
>>
File: radiance.png (2.57 MB, 848x1488)
2.57 MB
2.57 MB PNG
>>
File: radiance.png (2.95 MB, 848x1488)
2.95 MB
2.95 MB PNG
>>
i miss ani
>>
File: radiance.png (2.51 MB, 848x1488)
2.51 MB
2.51 MB PNG
>>
File: screenshot.1759847944.jpg (201 KB, 1080x863)
201 KB
201 KB JPG
>>106817791
ldg will never be as shit as sdg
>>
File: radiance.png (3.01 MB, 848x1488)
3.01 MB
3.01 MB PNG
>>
>>106817830
Lets not trigger them please he's part of that small circle of loser schizos and we just had a meltdown yesterday. I appreciate you posting it but they are extra sensitive as of late.
>>
File: radiance.png (2.96 MB, 848x1488)
2.96 MB
2.96 MB PNG
>>
I'm a winner. I'm studying [ANCIENT GREEK].
>>
File: radiance.png (2.25 MB, 848x1488)
2.25 MB
2.25 MB PNG
>>
very stinky bread
>>
File: file.png (2.75 MB, 848x1488)
2.75 MB
2.75 MB PNG
>>106817813
check some threads back, i think he reported yesterday about work going into anistudio. some decisions about how to build package libs for linux and sdcpp and such
>>
File: radiance.png (3.05 MB, 848x1488)
3.05 MB
3.05 MB PNG
>>
File: radiance.png (2.96 MB, 848x1488)
2.96 MB
2.96 MB PNG
>>
File: radiance.png (3.31 MB, 848x1488)
3.31 MB
3.31 MB PNG
>>
File: radiance.png (2.54 MB, 848x1488)
2.54 MB
2.54 MB PNG
>>
File: radiance.png (2.77 MB, 848x1488)
2.77 MB
2.77 MB PNG
>>
>>106817893
Nobody cares about your vibe coding sperg that ERPs with other men on discord while posting bondage shota
>>
File: radiance.png (2.73 MB, 848x1488)
2.73 MB
2.73 MB PNG
>>
That's a lot of images anon
>>
i think we should invite debo for one thread for once
he's lonely :(
>>
>>106817969
judging by the terrible SD1 tier quality, im going to assume they're some cretin that crawled out of /sdg/.
>>
File: radiance.png (3.04 MB, 848x1488)
3.04 MB
3.04 MB PNG
>>106817964
some anon asked. i personally don't disapprove in the least of someone trying to make open sauce tools.

comfy and others were there when voldy ended making webui. it's good.
>>
>>106817969
he's trying to show you scrubs what an actual good gen looks like
>>
File: jules.png (939 KB, 1024x1024)
939 KB
939 KB PNG
everytime I check these threads I get hit with a wave of mental illness
>>
>>106817964
>Nobody cares
quite a few anons care
>>
>>106818007
>"anons"
>>
>>106817740
sounds like you are very concerned
you are thinking about this subject every single day
>>
>>106818010
yes anonies
>>
>>106817987
He's not welcome here after the shit he pulled he should keep making his own thread. He found a way to piss anons off on every timezone by being a absolute faggot and I'm not talking about his boy lust either.
>>
>>106815960
Can anyone confirm this gives 20~30% speed boost with minimal quality loss?
>>
>>106814841
wtf is in the back
>>
i hate julien for what he has done in the past
>>
File: crane.png (2.73 MB, 1024x1024)
2.73 MB
2.73 MB PNG
Sometimes failed gens are kind of nice.
>>
>im a ranphile
>>
ani is the sovl of /ldg/ really wish he'd post more updates here desu
>>
>>106818069
Tried it with wan2.2 at 20 steps (10 high + 10 low + 5 cfg) I had accidentally left the light loras in 1st try and results were pretty damn good. So I took out the loras and it just slopped out. I tried like 10 different tests, different steps, cfgs, it kept giving me very sloppy results. There must be some settings I'm missing. As for speed boost, yes it is like a couple minutes faster to regular 20 steps.
>>
>>106815960
>welcome back teacache. again.
It's always that way. There's no free lunch.

The only tolerable degrading is quants, and only because it's literally necessary to get the gb down.
>>
We have sdxl/flux/chroma/qwen for images, wan for videos, but the real tragedy is there is nothing worthy for audio - Sora 2 is just AUs ahead of anything available locally, they are sd0.5 level of quality or event worse. More than that there is no lora training for them.
Not Sora 2 ads btw.
>>
>>106818156
mmaudio works good enough for nsfw audio for me. just wish I knew how to make loras for it.
>>
File: crane2.png (2.07 MB, 1024x1024)
2.07 MB
2.07 MB PNG
>>106818079
>>
>>106818211
> good enough for nsfw audio for me
Any example? Because I tried and it's dogshit for me, both in sound and sync.
>>
>>106818002
we have a new place for sane people my friend, come join us >>>/wsg/5989007
>>
>>106818211
>mmaudio works good enough for nsfw audio for me
like pretty shit but good enough for your purposes? is my opinion of mmaudio

>>106818156
seeing a wan gen is like looking at an sdxl gen after seeing what saas models can do
>>
>>106818290
I'm sorry you only have 12gb of vram, I bet if you put your mind to it you could suck enough dicks to afford a 5080 super by the time it comes out.
>>
>>106818303
>ran is always thinking about male genitalia
>>
>>106818303
truth hurts i know but have some pride in being a localcuck
>>
>>106818290
> seeing a wan gen is like looking at an sdxl gen after seeing what saas models can do
Yes, and you can work with SDXL at least. But MMAudio is what was before SD compared to SaaS.
>>
>He's seething again
Time to leave the thread because that hurts him more
>>
youre here forever
>>
bros I need a feet detailer
>>
>Maintain Thread Quality https://rentry.org/debo
>>
>>106818341
you can always try the /ldg/ discord. we're having a shota erp party this friday, don't miss out!
>>
>>106818362
bruh
>>
>>
>>106818360
check your DMs
>>
so what happens now
>>
>>106818077
>>>/gif/29585416

I'm trying to unfuck smoothmix.
>>
wtf i slept on neta yume but it's pretty good, way better adherence and multicharacter pose, though my biggest complaint is that "official styles" that follow character's franchise artstyle are bitch to prompt often wrong compared to illustrious
>>
File: file.png (2.51 MB, 848x1488)
2.51 MB
2.51 MB PNG
>>106818156
for sound effects matched to video yes, but it really doesn't sound like one of the best TTS. multiple are smoother than that.
>>
>>106818591
Is it faster than chroma?
>>
>>106818615
NTA but I can't judge that until I see how it behaves with a high res pass, but so far it's faster and less step hungry compared to chroma
>>
>>106818615
dunno, on 4060 i get 55s per 832x1216 at 30 steps compared to ~15s on SDXL, there's nothing like DMD2 to make it low step sadly
>>
>>106818591
>"wow guyz, this [obscure model no one talks about] is soo guud
>*never provides any examples*
why??
>>
>>106818763
It's not really obscure for those in the know, which I take it you are not.
>>
File: ComfyUI_temp_txrit_00004_.png (3.59 MB, 1400x1800)
3.59 MB
3.59 MB PNG
>>
File: ComfyUI_temp_txrit_00006_.png (3.52 MB, 1400x1800)
3.52 MB
3.52 MB PNG
>>
File: ChromaPainterly_00019_.jpg (881 KB, 1176x1792)
881 KB
881 KB JPG
>>
>>106819273
now I understand why we call him agent 47, his forehead measures 47 centimeters kek
>>
File: radiance.png (2.59 MB, 848x1488)
2.59 MB
2.59 MB PNG
>>
File: ChromaPainterly_00025_.jpg (931 KB, 1176x1792)
931 KB
931 KB JPG
>>106819364
big brain
>>
File: radiance.png (2.63 MB, 848x1488)
2.63 MB
2.63 MB PNG
>>
>>106819553
this looks worse than SD1.4 lawl
>>
File: ChromaPainterly_00028_.jpg (857 KB, 1139x1709)
857 KB
857 KB JPG
https://www.youtube.com/watch?v=m9M9YeLDnfI
>>
stay mindfoo of the tier list
>>
File: radiance.png (3.06 MB, 848x1488)
3.06 MB
3.06 MB PNG
>>106819571
the detailed patterns on the clothes and other things are pretty obviously better than some general purpose sd1.4
>>
File: radiance.png (2.72 MB, 848x1488)
2.72 MB
2.72 MB PNG
>>
File: ComfyUI_02805_.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>A crystal-clear mountain lake reflects snowcapped peaks and a sky painted pink and orange at dusk. Wildflowers in vibrant colors bloom at the shoreline, creating a scene of serenity and untouched beauty.
>>
Do any models actually know pepe? I only get generic frogs when i try to prompt him.
>>
>>106819693
wow that's bad. it made me raff around
>>
>>106819693
>>106819709
I'll try Chroma HD Flash.

I have a slow card, so it won't happen like very quickly or anything.
>>
>>106819623
It's an ok image, but where is her dog?
>>
>>106819709
a couple anons were posting kino pepes made with noobai awhile ago
>>
>>106815982

comfy is too autistic for me, use forge or reforge
>>
File: qwenEditFail.png (851 KB, 2847x1618)
851 KB
851 KB PNG
Am I doing something wrong here?
>>
File: wow.png (2.26 MB, 1024x1024)
2.26 MB
2.26 MB PNG
>>
>>106819801
try to be more precise by saying "image 1" and "image 2"
>>
>>106819805
Yeah, that's where qwen is better than kontext.
>>
>>106819693
>creating a scene of serenity and untouched beauty
still can't believe we let LLMs write useless descriptions like this
>>
this namefag is really shitting the place up, huh
>>
>>106819853
There is no namefag here.

https://en.wiktionary.org/wiki/namefag
>>
>>106819853
hes also shitting up /lmg/ . i dont understand this mental illness
>>
>>106819882
He has been posting on multiple threads just pay him no mind or filter him.
>>
File: ComfyUI_02486_.png (987 KB, 768x1344)
987 KB
987 KB PNG
>>
>>106819888
Yeah, but he's
>>106819882
not using a name, so I can't block him - no ids either.
>>
>>106819968
don't mind them bro fuck the namefag haters
>>
>>106820031
IMPOSTER! POLICE!
>>
>>106819968
You implied I was fucking a tranny because I actually get pussy in another thread. I will not hold a grudge but I will turn a blind eye when the cleaners come for you.
>>
>>106820043
I've never had a woman flirt with me, but a tranny did once.
>>
>>106820043
>You implied I was fucking a tranny because I actually get pussy
sounds like that really bothered you anon so maybe there's some truth to it
>>
>>106816623
oh that does look much better
>>
>>106819755
>noobai
why would something trained on danbooru and e621 be good at pepes?
>>
File: 1738476653182456.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>106819953
show a 3d breakdown of the scene, screenshots of a 3d modeling program
>>
>>106820123
this is what ani should be aiming for. tired of flip flopping apps
>>
>>106820123
KRITA GOD I KNVVL
>>
>>106820123
Come back when it can do actual 3d
>>
File: 1730309474724105.jpg (358 KB, 625x619)
358 KB
358 KB JPG
I love seeing what these models generate in the parts of the image you dont specify
>>
File: 1729892730276243.jpg (593 KB, 878x807)
593 KB
593 KB JPG
So have people already written something to OCR output images with text and drop the gens that aren't accurate?
>>
>>106820123
which UI?
>>
File: 1742393665932850.png (3.59 MB, 1416x2120)
3.59 MB
3.59 MB PNG
>>106820097
you ever search for pepe in either of those websites? heres some from anon
https://desuarchive.org/g/thread/105888667/#q105894461
https://desuarchive.org/g/thread/105888667/#q105894802
https://desuarchive.org/g/thread/105895325/#q105895642
https://desuarchive.org/g/thread/105895325/#q105897432
desu i cant think of another """base""" model that knows him without a lora
>>
File: 1741079271354304.png (753 KB, 1186x968)
753 KB
753 KB PNG
uh oh... localsissies, how do we cope?
>>
>>106820246
it's so over not even china slop can save us
>>
*yawn*
>>
*brap*
>>
File: 1737384282856159.png (896 KB, 1088x1079)
896 KB
896 KB PNG
So how long till models with Sora 2 quality will be running local? 2 more years? Never ever?
>>
>>106820313
i'm hoping next year but that's probably wishful thinking
>>
A bit /lmg/ but does anyone here use something like a small LLM that can be run locally to enhance their prompts? I am asking for flux/chroma t5 prompting in particular. Grok 4 seems to do an okay job, but I don't think I want to sent all of my prompts to someone else's servers.
>>
So how long till large cocks will be in my ass? 2 more years? Never ever?
>>
File: bobina_2.webm (3.9 MB, 1024x1016)
3.9 MB
3.9 MB WEBM
I like sexualizing market forces.
>>
>>106820246
we don't have to cope with sora's censorship, that's one of the points of local
>>
File: 1738095692447971.mp4 (781 KB, 640x640)
781 KB
781 KB MP4
Can any local models do this level of animation or is it all SaaS?
>>
>random anon makes a joke about debo not being able to walk
>trolling begins
I don't think anyone believes he has a physical disability just a mental one.
>>
>>106820411
this much is covered no problem by wan/hyvid and maybe even framepack or such
>>
File: 1752451616662629.png (1.29 MB, 1360x760)
1.29 MB
1.29 MB PNG
the anime girl is holding a sign saying "waiting for WAN 2.5". keep her expression the same.

new qwen edit 8 step lora works pretty good, before the qwen image 2.0 lora was better than the v1 qwen edit lora.
>>
>>106820313
My guess is something like two gpu gens, so 6 years.
>>
File: ComfyUI_00124_.mp4 (1.11 MB, 1024x1024)
1.11 MB
1.11 MB MP4
>>106820411
That looks awful. Here's\ a wan version. I told it to have him bring the mug up to his mouth and take a sip but I think the existence of the straw is preventing that because it's nonsensical.
>>
>>106820516
neat, how long did the gen take? I'll have to setup wan today
>>
>>106820516
3 minutes on a rtx 6000 pro.
>>
File: 1755308807942261.png (1.29 MB, 1360x760)
1.29 MB
1.29 MB PNG
>>106820449
works good with Q8 qwen edit 2509 too

how much better is Q8 vs fp8 scaled, in general?
>>
File: 1737956002033711.png (951 KB, 1104x944)
951 KB
951 KB PNG
the girl in image1 is shaking hands with the man in image2. keep the man's expression the same.

Q8 seems pretty good, only slightly longer gen time and a few seconds for higher quality is a fair tradeoff
>>
File: 1739275542561425.png (788 KB, 1176x888)
788 KB
788 KB PNG
>>106820616
the girl in image1 is shaking hands with the man in image2. keep the man's expression the same. keep the girl's expression the same.
>>
>>106820411
the neat thing about local models is that you can turn that straw into a penis. YOUR penis if you like.
Well? what are you waiting for? These workflows aren't going to install themselves. You've got dick videos to generate.
>>
>>106820246
I guess this is why people become religious, when there's no hope of faggots like sama being punished by anything real you can only fantasize of a higher power striking them down.
>>
File: 1750200651510256.png (2.56 MB, 1716x1216)
2.56 MB
2.56 MB PNG
Change the text "We will" to "LDG General". Change the text "Kessoku band tour 2024" to "LDG general tour 2025". Replace the blonde anime girl with Hatsune Miku.

yeah, Q8 is more consistent and giving better results.
>>
File: 1751779863333434.png (674 KB, 1112x936)
674 KB
674 KB PNG
replace the black man on the left with Hatsune Miku wearing a black trenchcoat.

text is less blurry than fp8 test of this too, pretty good
>>
>schizo slop testing hours
see you in 4 hours!
>>
>>106820792
you dont even have a GPU to gen, nogen
>>
>>106820692
that's just a cope though, if he doesn't get punished now he won't be punished ever, there's no afterlife
>>
>>106820809
but thoughts and prayers...
>>
>>106820616
>>106820799
Real talk, what's your deal? Is this your hobby? Running unnecessary tests with Miku and the Deus Ex guy FOR WEEKS EVERY DAY shaking hands, eating burgers, changing hats?
Come on dude, I'd rather be a GPUlet and a genlet than have a 1k GPU just to throw my free time in the garbage
>>
File: ComfyUI_02809_.png (1.69 MB, 1024x1024)
1.69 MB
1.69 MB PNG
What's wrong with anon's tests? I learn a lot without having to do stuff
>>
>>106820637
(silently counting the fingers)
>>
>>106820809
Obviously. Now see what op asked.
This is literally a bubble, the speed they're pumping it up at. But with unseen before clarity, we know how OAI gets money from nothing for nothing, how they break all intellectual property laws that would normally pull fines in scale of billions and don't get even a slap on the wrist. When it's that clear it makes my butt hurt.
>>
Are there any workflows that can allow me to drop in a 3DPD video and convert it to something more tasteful?
>>
>>106820823
so if you dont gen why visit the thread about gens
>>
File: 1755457709549395.png (820 KB, 1248x832)
820 KB
820 KB PNG
>>
File: 1756952512367714.png (2.9 MB, 1827x1149)
2.9 MB
2.9 MB PNG
Prompt: "make the woman face the viewer"
Model: Qwen-Image-Edit-2509-Q8_0.gguf
LORA: Qwen-Image-Lightning-8steps-V1.0.safetensors
8 steps, 1.0 cfg, euler, beta
42s, RTX 4080 16GB
>>
>>106820889
It would be reasonable for the government to force them to stop calling it "open ai", because it's false advertising.
>>
>>106820841
>>106820873
Come on, let's drop the act. Don't you find it weird that a human being would do this every single day for weeks? Plus if you look at previous days the pattern is the same

[prompt]
[comment and sloppy stupid empty conclusion]

and no, I don't learn anything from this because I'm never going to use these situations in my workflow. If I actually care about my generations, I want maximum control, so I'll probably do inpainting manually where I can control the process and blending. Only a slopper with no aesthetic sense would rely on an automatic editing model.
>>
>>106820901
meds
>>
>>106820901
new models/loras come out. I test them and post results. new anons may try the new models.
>>
>>106820889
>censored slop
that's literally local, we can only make plastic humans and miku
>>
File: 1745652742043698.png (731 KB, 1248x832)
731 KB
731 KB PNG
>>106820998
>paying for censorship and prompts
>>
File: EIGHTY BILLION.png (742 KB, 640x822)
742 KB
742 KB PNG
>>106821004
>paying a 10000 dollars gpu for this
>>
>>106821029
no one with a brain thinks hunyuan 3.0 is worth it

wan 2.2 can run on like 8-12gb even, yet is infinitely better.
>>
File: 1746405627147787.png (3.21 MB, 1765x1185)
3.21 MB
3.21 MB PNG
Prompt: 'create an in-store advertisement display for the "robo gf" product in image1, on sale for $99.99'
Model: Qwen-Image-Edit-2509-Q8_0.gguf
LORA: Qwen-Image-Edit-2509-Lightning-8steps-V1.0-fp32.safetensors
8 steps, 1.0 cfg, euler, beta
47s, RTX 4080 16GB
>>
>>106821029
You pay $500-1000 so she'll spread her legs at cafe ooh la la
>>
File: 1755596055306689.png (829 KB, 1248x832)
829 KB
829 KB PNG
>>
>>106821066
>switch: censor sora 2
but he's bringing IP back I guess? >>106820246
>>
>>106821046
are you using the comfy 2509 workflow? I have a 4080 too and i'm getting 30-35s gen times, but im using simple scheduler so maybe that's the difference
>>
File: 1737516248847296.png (866 KB, 1248x832)
866 KB
866 KB PNG
>>106821095
but that's still pretty fast, given what the model is capable of it's worth it.
>>
>>106821066
That would have been good though. Sadly the reality is much much worse. He steals, he hypes, everyone pretends he didn't steal and agrees to make deals to let the original theft slide. Happened with text, then images, now video. It's a perfect grift since he convinced the government only this way will allow it to """win""" something. And the rich get richer.
>>
File: 1750830724151382.png (2.91 MB, 1538x1365)
2.91 MB
2.91 MB PNG
Prompt: "make the woman and the monster on the ceiling stand side-by-side with their arms around each other's shoulders, smiling happily"
euler simple
41s

>>106821095
yeah i'm using the comfy example, switching to simple shaved off a few seconds
>>
>>106821156
nice

yeah q8 edit is really nice, they say Q8 is comparable to fp16 so it's the most efficient option in general.
>>
File: 1756566619532533.png (824 KB, 1248x832)
824 KB
824 KB PNG
the man is standing in front of a large sign saying "ClosedAI". He is looking at a large TV screen with Hatsune Miku holding a sign "fuck you Altman".

you tell him!
>>
>>106821132
>>106821205
>made tens of images seething about Sam
damn, he's not renting free in your head, he's literally having a party in it, holy sheet
>>
>>106821230
it's just for fun, and because there is no censorship I can do it. that's why open source is better.
>charging money for prompts
>>
File: 1731506958109662.png (524 KB, 1136x912)
524 KB
524 KB PNG
the green cartoon frog is sitting at a computer with a CRT monitor and is typing. The text "LDG" is on the monitor.
>>
File: 1732604618795560.png (841 KB, 1136x912)
841 KB
841 KB PNG
>>106821251
the green cartoon frog is sitting on a chair at the beach holding a tropical drink, and a bag of chips with the text "SIPS" on the bag.
>>
>>
File: 1755330292737620.png (3.45 MB, 1672x1254)
3.45 MB
3.45 MB PNG
Prompt: "make the woman wear a white button-up shirt and a pencil skirt, make her sitting on an office desk, keep the pose the same"

It kept the fucked up chroma fingers too
>>
change the headline "police take adoptive son (15) into custody" to "Illegal immigrant kills someone again". Replace the man in the blue coat with a monkey.

lmao
>>
File: 1737662301733099.png (1 MB, 1056x984)
1 MB
1 MB PNG
>>106821366
helps if I add the image.
>>
>>106821366
/int/ subject matter expert here
monkey means brazilian, who are not a common crime committing refugee in europe
t. /int/
>>
> qwen_image_edit_2509_bf16.safetensors
> Huggingface, 40GB ETA 5 fucking days.

Wtf? Is there somewhere else to download? My internet isn't that slow. Something is fucked up.
>>
>>106821406
>>106821406
>>106821406
>>106821406
>>106821406
>>
>>106819622
these benchmarks were made by retards. 7900 XTX >>> 9070 XT
>>
>>106821407
i had claude code write a dl script for me yesterday and it only took a few hours
are you using huggingface-cli?
>>
>>106821454
Git bash curl.
>>
>>106821470
they might throttle downloads that aren't using their client or something shady like that.

huggingface-cli download Comfy-Org/Qwen-Image-Edit_ComfyUI split_files/diffusion_models/qwen_image_edit_2509_fp8_e4m3fn.safetensors

It's the fp8 but that runs at full speed for me even behind mullvad.
what are you running in curl? I want to see if its throttled here too
>>
>>106821342
did you gen the pudgy one on the left? if so, what model can do pudge?
>>
>>106820901
>If I actually care about my generations I just manually craft them
very cool.... any other tips? If I go in Photoshop you can get even finer control on every pixel with crazy tools and the shit is instant
>>
>>106821594
tips, I have some, but you better be fucking respectful, as I am a genius, and a person of considerable note.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.