[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1743201255976125.jpg (1.51 MB, 2366x2223)
1.51 MB
1.51 MB JPG
Discussion of Free and Open Source Diffusion Models

Prev: >>107802907

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg
>>
>>
>>107805478
gross catjak inclusion. glad we didn't go with it
>>
>>107801830
54
>>
>>107805478
>posting the slop version of the 2nd last instead of the actual good version
did you make that op? are you the slopper? you sloppy boy
>>
>>107805483
lmfao he never improved after 4 years
>>
File: ComfyUI_temp_penru_00002_.png (2.47 MB, 1120x1400)
2.47 MB
2.47 MB PNG
>>107805407
lets see how chroma does

>>107805465
Yeah its true, I guess, i'll post them, I just don't like giving out my metadata for free, don't you have a chroma version for the retro magazine scans?
>>
File: 1760110890438086.jpg (1.27 MB, 1248x1824)
1.27 MB
1.27 MB JPG
>>
https://old.reddit.com/r/mildlyinteresting/comments/1q70cjx/apparently_i_got_my_son_an_acab_puppy_for/
??
>>
>go to chroma discord to steal images like told to
>its all newbie
lol
>>
File: ComfyUI_temp_penru_00003_.png (2.28 MB, 1120x1400)
2.28 MB
2.28 MB PNG
>>
>>107805503
I was bamboozled
>>
File: ComfyUI_temp_penru_00004_.png (2.16 MB, 1120x1400)
2.16 MB
2.16 MB PNG
>>
>>107805470
thx for the bake anon
>>
File: img_00304_.jpg (919 KB, 1520x1728)
919 KB
919 KB JPG
>>107805465
>great eye anon, just the first chapter
Hard to miss, the style is so timeless
>>
>People who have plastic stuff might be using the base 512 res model like idiots
its posted everywhere that the main huggingface one is the base pretrain to train loras off of, are people actually using that one to gen with and are complaining how it looks at higher res than 512?
>>
>>107805483
bro i don't know who these nanocelebrities are
>>107805488
i knew it was going to trigger someone that's why i picked it
>>
Can you catbox more fucked up chroma shit so i can wan?
>>
>>107805493
i think she might have regressed. the slop this troon is posting is just repulsive
>>
>>107805527
here is ltxv 2 failgen instead
https://files.catbox.moe/k6caj3.mp4
>>
>>107805522
give lora then, in fact give full workflow, even the prompt
>>
>>107805566
last thread, it was posted like 3 times
>>
File: 1764784446485500.jpg (1.76 MB, 1248x1824)
1.76 MB
1.76 MB JPG
>>
File: flux1_0003.png (1.77 MB, 832x1216)
1.77 MB
1.77 MB PNG
>>107805498
just remove the metadata. i only keep in prompts etc because i want people to be able to use the loras correctly. i have a flux version but it's kinda crap. lemme see if it even works on chroma

>>107805520
i have to watch 2000 again soon
>>
File: radiance.jpg (112 KB, 768x1344)
112 KB
112 KB JPG
>>107805498
>lets see how chroma does
dunno, something like this if you don't include too much sexy in the prompt
>>
File: 1738651203473326.jpg (1.67 MB, 1248x1824)
1.67 MB
1.67 MB JPG
>>
>>107805582
>radiance
that is still pretraining and is also 512 res
>>
File: 1739220335994299.jpg (1.48 MB, 1248x1824)
1.48 MB
1.48 MB JPG
>>
qrd on the last 24h?
>>
File: 1737673024506618.jpg (1.38 MB, 1248x1824)
1.38 MB
1.38 MB JPG
>>
>>107805611
z image base maybe soon, ltxv 2 buggy as fuck T2V is great, I2V is fucked by might be fixable with 48 fps and 40 steps
>>
>>107805611
asking base when and finally not having garbage ltx2 spam. honorable mention to some lumina and chroma banter
>>
File: 3457345.png (1.81 MB, 1024x1536)
1.81 MB
1.81 MB PNG
>>
man we really need something to rescue us from comfy and pyshit
all those models are getting bigger, my ram can't handle all this
>>
base is going to be uncensored right
>>
>>107805620
>>107805627
appreciate it
>>
File: zimg_00055.png (1.65 MB, 960x1280)
1.65 MB
1.65 MB PNG
>>
>schizobake
>>
>>107805640
i dont like comfy but you converted me to a comfy believer with your nonstop seethe, congrats
>>
I'm so impressed by this lora. The 2509 version, even the 2511 baked in version couldn't do this much precision.
>>
>>107805688
what makes it a schizobake?
>>
>>107805574
catbox the image asshole
>>
>>
File: ComfyUI_temp_ktrgs_00001_.png (3.16 MB, 1152x1728)
3.16 MB
3.16 MB PNG
>>
>>107805470
Why is AniStudio not in OP?
>>
>>107805711
its great for padding out datasets for sure if you have a front and back image
>>
>>107805711
Does it work on stylized images?
>>
File: ComfyUI_temp_ktrgs_00002_.png (3.23 MB, 1152x1728)
3.23 MB
3.23 MB PNG
>>
>>107805739
He was too busy including random reddit posts in the image collage
>>
>>107805711
that's really good stuff, do you have a link to the lora and list of views?
>>
>>107805711
Thanks for the link anon? Anon?
>>
File: mlady.jpg (292 KB, 1408x840)
292 KB
292 KB JPG
>>
>>
>try chroma with workflow from last thread
>get body horrors half the time
eh...
>>
File: good news.png (44 KB, 1084x362)
44 KB
44 KB PNG
ltx team knows the issues are are planning later versions to fix them
THAT SAID THE 48 FPS FIX WORKS FOR I2V ITS NIGHT AND DAY BETTER!
>>
File: file.png (30 KB, 735x285)
30 KB
30 KB PNG
Where do I find this?
>>
>>107805745
Oh yeah, I didn't think of that, I don't train loras.

>>107805762
Qwen changes the image too much for my taste, I'd only use this for low res so you can fix it via upscaling.

>>107805776
>>107805780
https://huggingface.co/fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA
>>
>>107805828
Chroma is best with full boomer positive prompts and schizo SD 1.5 negatives, mostly
>>
File: file.png (1.02 MB, 1432x837)
1.02 MB
1.02 MB PNG
why is this picture from reddit in the OP collage?
>>
>>107805886
what i'd like to know is why so many of you reddit faggots come here
>>
>>107805891
i browse both reddit and 4chan and have been for years, deal with it. that's how i noticed we've been bamboozled by that fake gen
>>
cozy bread
>>
>>107805739
unironically ranfaggot does her best to not include it in her schizo bakes
we should add AniStudio because that would make her melt for weeks
>>
AGAIN, PUBLIC SERVICE ANNOUCEMENT.
LTXV TEAM CONFIRMED WHAT I SAID
TEMPORAL COMPRESSION IS FUCKED FOR I2V, USE 48 FPS TO OFFSET IT.

Temporal upscaler kind of helped but not as much and it causes other issues.
>>
>>107805835
you don't need it really. Last time I tried Flan with Chroma it produced slightly worse results than regular T5 XXL for every single gen.
>>
>>107805919
no
>>
File: file.png (742 KB, 998x429)
742 KB
742 KB PNG
only three ltx 2 loras on civit so far
>>
>>107805957
>slop x3
>>
>>107805919
Does the model actually take into account the fps setting or won't the video just be twice as fast and half as long?
>>
>>107805919
>use 48 FPS
but there's like three places to set FPS, do you mean only in the last node?
>>
>>107805891
i only check that place for comfyui and stablediffusion news. there's nothing more effective that can aggregate all the new stuffs the same way
>>
>>107805957
oh wow, the phone camera lora examples. This is gonna trick so many facebook moms / boomers
>>
and if 2.1 actually fixes the audio and I2V.. we are just getting started
>>
>>107805891
I don't like reddit either but it's the only place I know to get diffusion news besides discord, which I know nothing about, and this god-forsaken thread full of retards
>>
File: image (30).jpg (113 KB, 753x938)
113 KB
113 KB JPG
>>107805978
use LTX WF not comfys, and replace their gemma text encoder and disable the prompt enhancer
>>
>>107805995
what about the frying over time?
>>
>>107806070
wtf does that even mean. the max is 20 secs but it starts getting worse after 15 secs. Is that what you mean?
>>
>>107805957
What's the hardware requirement for training ltx2 loras? I think it's civitai's chance to earn a big buck if they allow uses to train on their servers
>>
so did anyone try image gen with ltx2?
>>
What happens if you go for 7-9 seconds in WAN 2.2? No merging etc
>>
Recommended non-lightning steps and cfg for qwen edit 2511?
>>
>>107806091
13GB vram with ramtorch, 32GB vram if int8. It trains VERY fast. Turn down learning rate
>>
>>107806091
very difficult / time consuming on consumer hardware, you need at least H100 or RTX Pro 6000 ideally
>>
>>107806106
stop talking out your ass. You can get decent speed on 13GB, ramtorch is only a small slow down, and you can do it at int8 on a 5090
>>
>>107805911
You really sound like a winner in life anon
>>
>>107806153
why are you false flag replying to yourself troonjak? is it because everyone thinks you are schizo on discord?
>>
>>
File: ComfyUI_00001_.jpg (1.6 MB, 2028x1183)
1.6 MB
1.6 MB JPG
trying out qwen2512
>>
God I hope the next booru model uses updated tags at the very least. Been some welcome changes and additions since Noob's dataset.
>>
>>107806214
>he fell for it
>>
>>107806172
They pity you, little lolcow
>>
>>107806213
>everything still oversaturated and plastic
sigh... they need to make a qwen 2 already using what they learned from z image instead of these finetunes
>>
>>107806219
You know what would be funny. If they used an updated booru dataset except only the new images get the new tags while they leave the old image with the old tags.
>>
I hope the next big booru train excluded all loli and toohus again just to make "that" group seethe
>>
Chroma hands and feet are fucked 9/10...
Ok gens are just cherry picking
>>
>>107806213
>correct reflections
>lots of details
>nothing obviously broken like 6 fingers
cute, especially as anime styles are often terrible in qwen/flux
>>
>>107806322
If it's trained on booru, then loli is likely excluded anyway
>>
>>107806390
Based
t. loli, toddler, child in negatives
>>
>>107806397
I usually put them in the positives
>>
>>107806098
If your prompt is complex then it will loop over seven seconds, meaning if you prompt for the camera to pan or for characters to enter from off screen then the camera will pan back to the starting position or the character will enter and then leave, etc. You can usually do 113 frames without looping, maybe a little more depending on the prompt, it depends, but 113 usually doesn't loop. But if you're doing something that involves a repetitive motion like a woman dancing or a man thrusting his dick into a woman then you can easily do ten seconds or a little more even. Past ten seconds the motion gets less and less dynamic and eventually the video quality starts to degrade.
>>
>>107806397
faggot
>>
File: file.png (1.24 MB, 768x1152)
1.24 MB
1.24 MB PNG
>>107806410
Same
>>
File: file.jpg (164 KB, 848x1488)
164 KB
164 KB JPG
>>107806213
it is very good in terms of prompt adherence
>>
>>107806428
Hey I recognize this
>>
File: 1738075912781350.jpg (523 KB, 2016x1152)
523 KB
523 KB JPG
>>
>>107806450
Post box please
>>
File: radiance.jpg (146 KB, 848x1488)
146 KB
146 KB JPG
>>
File: 1756344605863629.png (2.01 MB, 1152x1312)
2.01 MB
2.01 MB PNG
more 1girl pls
>>
File: 1744143518159731.jpg (1.29 MB, 2016x1152)
1.29 MB
1.29 MB JPG
>>107806466
It's an akihiko yoshida lora i'm tinkering with for zit, catbox won't do anything. I will probably post it once i feel it's good enough.
>>
>>107806489
Mind posting an example prompt then? Thank you.
>>
>107806322
Why does the existence of /adt/ make the schizo seethe so much?
>>
File: 1737974294937891.png (1.75 MB, 1024x1536)
1.75 MB
1.75 MB PNG
>Chroma hands and feet are fucked 9/10...

Thats why oldGODS win, my mind has been trained by months of absolute body horror of sd 1.4/5 that i dont even register the hands anymore even if they are not hidden out of frame or behind the 1girl's (huge breasts) head, or even if they are good... "we can use the adetailer / inpainting for hands later on one day" a tiny voice said in my mind around 2022... and now, its just "we can use nano banana pro edit model at home in a year or two later on anyway"...

The only thing that matters is how an image makes you feel about it, makes you appreciate that one hidden gem idea that you will always be able to notice within that image, even if there is face body horror, a year later you will be able to fix all of the thousands of gens in all of your folders in one pass and enjoy that one gem that is still hidden inside each.
>>
>>107805919
I haven't seen a single ltx2 gen that isn't instantly recognisable as slop.
>>
File: 1742876407520037.jpg (1.17 MB, 2016x1152)
1.17 MB
1.17 MB JPG
>>107806496
Subject1: describe the subject
Subject2: describe the subject
Describe what Subject1 and Subject2 are doing together.

Zit will remember the names so you don't need to copypaste the description of their appearance all over prompt for difficult scenes.
>>
>>107806025
dunno if i trust you
>>
>>107806516
see >>107805891
>>
>>107805771
maybe they got it from here?
>>
>>107806499
they removed some useless frontend from their OP or something
>>
File: 1754492724952813.png (2.39 MB, 1152x1280)
2.39 MB
2.39 MB PNG
>>
>>107806562
cumfart is still there
>>
File: file.png (168 KB, 700x493)
168 KB
168 KB PNG
project ava + comfy acess for nudes whenever she feels like sending it
>>
>>107806363
USE THE 2K RES MODEL NOT THE 512 RES ONE!
>>
>>107806625
which one
>>
Also be careful of the prompt enhancer and this gemma model.
It tends to completely NULL the prompt for anyone still using the node and give you static, strange or garbage gens because once it detects a NSFW word, it completely kills the whole prompt.
>>
>>107806629
go back a thread, the silveroxides one
>>
>>107806643
gemma ablit helps but hopefully someone can get that heretic one working with it
>>
>>107806643
Many times it would replace my prompt with a "/" sign whenever it detected nsfw. total cancellation. This is the source of the static ltxv gens
>>
>>107806643
>>107806661
they spent a month censoring you gooners out, you aren't winning this
>>
>>107806603
What is the benefit of the cumjar as opposed to say using a tablet/laptop/mini pc taped to a large monitor? There is a selection of VRM enabled llm frontends.
>>
Modern hatred for porn is so fucking weird. At least when it came from Christian fundamentalists it made sense. Nowadays, just let me fucking goon, why the fuck do payment processors and some blue-haired landwhales have a fucking say in this?
>>
>>107806672
You misunderstood, the model itself is not censored just the prompt enhancer, just use a different llm for prompt enhancement or dont do it at all, you don't need it
>>
>>107806709
because if the normies can easily goon 24/7 the cattle slaves will focus less on being good cogs at work, waiting instead to go home and goon and living minimally instead of tryharding at work and then consooooming the propagandaslop in their free time instead of being happy alone with an internet connection
>>
>>107806739
isn't the schizo theory the opposite? That they are trying to make everything degen to destroy family values and make men not have sex anymore to lower the population?

Its like you could twist anything at all into anything you wanted to fit your viewpoint
>>
>>107806734
no you minsunderstood, the mode is censord
>>
>>107806557
It's pretty clearly older
>>
>>107806749
Both are true at the same time because the point is the balance the two and the slow boiling of the frog. You want to degenerate family values because if society has strong connections you cant subvert and control it easily. But you also can't go too hard too fast or else the frog will jump out since it will feel the quick shift.

Low IQ's are unable to comperhend any nuance to anything and eternally live in the world of false dialectics. "It's EITHER this 100% OR it HAS to be the exact opposite 100%"
>>
>>107806786
benchod
>>
>>107806786
The other anon is right, you are a schizo
>>
so where base
>>
javascript:quote('107806835');
china
>>
>>107806603
Could be fun if it's engages when it sees you gooning.
>>
>>107806901
why do you think its jar shaped, she needs food
>>
File: 659979369.png (84 KB, 683x471)
84 KB
84 KB PNG
Why this doesnt work with GITS scheduler and ZIT/Qwen 2512? Other schedulers work fine
>>
soooo anyhow

I heard z-img base is in the baking?
>>
>>107806926
It's been scientifically proven that euler/euler ancestral + simple/normal/beta is all you ever need
>>107806938
Yes, in just two weeks we'll get our hands on it
>>
>>107806938
copium
>>
>>107806986
proof?
>>
File: image (31).jpg (120 KB, 1191x898)
120 KB
120 KB JPG
prompt enhancer censorship fixed:
https://files.catbox.moe/7ywnpm.py

Uses llama.cpp server (way faster, can use any model, tho I just use Gemma 3 for now) and support for prefill (which is basically a cheat code against refusals)
>>
>>107807014
no base
>>
My wan2.2 girls have penis...
Negative prompts or Loras vagen do nothing.
>>
>>107807056
based
>>
>>107806986
@grok is that true?
>>
Why did they use gemma-3-12b instead of the smaller models? What difference does it make?
>>
>>107807046
that's a baseless assumption
>>
File: thedick.jpg (73 KB, 642x642)
73 KB
73 KB JPG
>>107807056
>>
>>107807017
there are lots of prompt enchancers, which one is this for
>>
File: proooompt.jpg (344 KB, 807x1055)
344 KB
344 KB JPG
are y'all writing one prompt at a time or blasting them through a local model then refining them en masse?
>>
>>107807102
wildcards
>>
>>107807097
ltxv but you can use it for any model, it literally is just to enhance your prompts
>>
ltx might be ass in my opinion but atleast we aren't in a wan chokehold anymore and they have every intention of staying open source
>>
>>107807017
is there some way to use the qwen clip model to also rewrite the prompt, instead of having to load two different models into vram? they have the same name so i assume they're the same, but i'm probably wrong.
>>
>>107807111
Maybe I'm misunderstanding, I thought you modified a prompt enchancer node pack, is that py file a single node you made standalone or what?
>>
>>107806603
bad idea. comfy already destroys ssds and giving an llm agentic access just sounds like bad news. both softwares phoning home is the icing on the shit cake
>>
>>107807066
>>107807089
Halp.
>>
>>107807130
comfy phones home?
>>
>>107807122
run llama.cpp with whatever model you want, this just connects to it
>>
>>107807130
>comfy already destroys ssds
qrd
>>
>huggingface usually has super quick download speeds
>download a huge file
>speed is suddenly slow as shit
Their servers love fucking with you
>>
>>107807102
>>107807109
this + batch size 4 + 32 queued jobs
>>
File: 09843534.png (1.6 MB, 1024x1536)
1.6 MB
1.6 MB PNG
>>
>>107807109
so you're just making random images or do you refine the wildcard prompts?
>>
>>107807144
you fuck lots with them
>>
Any updates on the anon training on wikiart?
>>
>>107807140
>swap runs out of memory
>uses ssd for allocation automatically
>this rapes the ssd
>>
>>107807102
>>107807154
you guys gennin 1girl are under 25 aren't you
>>
>>107807150
This + comfy oom + queue lost
>>
>>107807159
Yeah
>>
>>107807114
now we just need that same energy for an uncensored image gen model that will stay open source
>>
>>107807168
thankfully doesnt happen with just 4 here
>>
>>107807171
illustrious
>>
File: ComfyUI_00042_.jpg (705 KB, 2816x1536)
705 KB
705 KB JPG
>>
>>107807150
wild, i refine a prompt down then gen maybe 4 images since i know exactly what i'm going for. maybe i should try this out
>>
>>107807193
how is it legal for amd and nvidia ceos to be related?
>>
>>107807204
good dna breeds success
>>
>>107807193
You can't convince me that the AMD CEO isn't just the Nvidia CEO in drag
>>
>>107807217
racist
>>
>>107807193
That is not accurate. No one in the audience cheered. In fact the keynotes were awkwardly silent. They hype no one with that corporate AI shit and their entire economy is circular.
>>
very early / low sample ltxv BJ lora. Can't wait for better audio versions.
https://files.catbox.moe/vkahfc.mp4
>>
>>107807292
I don't know what I expected...
>>
>>107807304
something lame probably
>>
>>107807292
Very nice
>>
>>107806587
how does that relate to my post at all?
the fact that you're a repulsive retard with a vendetta has nothing to do with /adt/
>>
base never ever
>>
>>107807292
come on you asshole we can't watch that, i even tried but i just cant, do a normal one
>>
>>107807292
i am vomit
>>
>>>/wsg/6068160
>>
>>107807292
Sexo.
>>
>>107807292
based
>>
>>107807154
>>>/wsg/6068169
>>
>>107807412
horse fucker
>>
>>107807114
Just remember they're Israelis so take it with a grain of salt
>>
>>107807292
gay
>>
>>107807490
They are our greatest ally, cope
>>
https://nitter.net/ltx_model/status/2009328901559050559
kikes are already dabbing on chinks on twatter
>>
>>107807511
>can se 60% of nipple
prepare your anus
>>
>>107807491
technically straight
>>
>>107807523
the horse is a futa
>>
>>107807529
lol what?
>>
>>107807547
watch zootopia 2
>>
>>107806213
Why is she eating a hamburger with an eyelash curler
>>
File: LTX_2.0_i2v_00068_.webm (1.05 MB, 1088x832)
1.05 MB
1.05 MB WEBM
>>
>>107807608
if sd 1.5 was video
>>
base not in a million years
>>
>>107807638
1.5 was kino, just like wan is. Ltx is like sd2
>>
File: file.png (1.19 MB, 992x1048)
1.19 MB
1.19 MB PNG
>>107807638
>>
File: 1748187062444806.png (156 KB, 414x340)
156 KB
156 KB PNG
>>107807685
>mfw
>>
File: ComfyUI_Image_00007_.jpg (620 KB, 2176x1072)
620 KB
620 KB JPG
https://files.catbox.moe/eeyxnw.png
>>
>>107807698
wtf is this real?
>>
>>107807608
increase fps to 48 or stop complaining about a fixable problem
>>
>>107807709
i din complain, im having fun
>>
File: wan (3).png (3.22 MB, 2304x960)
3.22 MB
3.22 MB PNG
I'll be doing a collage with head-to-head competition between Spark Chroma, WAN, and Z-Image; just need to get that burst of Motivation(tm) to create it

https://files.catbox.moe/6dtk4j.png
>>
>>107807721
Do it benchod
>>
>>107807707
No. We live in a simulation.
>>
>>107807166
Is your penis still functional?
>>
File: spark chroma.png (2.32 MB, 1920x800)
2.32 MB
2.32 MB PNG
>>107807707
yes, this is my waifu
https://files.catbox.moe/xh36zg.png
>>
File: wan.png (2.26 MB, 1824x1248)
2.26 MB
2.26 MB PNG
>>107807728
yes saar i'll do the needful soon(tm)
https://files.catbox.moe/yvl0qa.png
>>
Thoughts on Qwen Image 2512? Is it a significant improvement?
>>
File: gsdgsddsd.png (65 KB, 1090x453)
65 KB
65 KB PNG
Its refreshing to see a company actually try to make architectural improvements instead of just "make it bigger". Also them recognizing the current issues and planning to be fully opensource with more releases coming is promising.
>>
File: zimage.png (2.87 MB, 1456x1072)
2.87 MB
2.87 MB PNG
https://files.catbox.moe/8euk1l.png
>>
>>107807721
Go anon go!
>>
File: zimage (2).png (3.93 MB, 2176x1072)
3.93 MB
3.93 MB PNG
>>107807791
The imgs I'm publishing this thread are selected ones from the montage; I should have the montage ready for next thread

https://files.catbox.moe/mv4b4o.png
>>
File: angry.png (93 KB, 1164x910)
93 KB
93 KB PNG
They angry, point at them and laugh!
>>
File: file.png (10 KB, 401x119)
10 KB
10 KB PNG
>>107807812
saar...
>>
>>107807812
one is based not like the others
>>
>>107807812
>production-ready
>>
>>107807812
>production ready
bruh
>>
>>107807850
you ask that to the CEO you get stuff possibly more looked into / censored
>>
>>107807841
literally the only sane person in that screenshot
>>
>>107807812
>AI automates menial tasks
>WTF AI is taking our joobs!!
>AI automates porn
>WTF why not make it do something useful???
They really do remind me of political goalpost movers
>>
File: 1741943998045432.png (86 KB, 1425x640)
86 KB
86 KB PNG
zzzzzz
>>
>>107805631

https://files.catbox.moe/ag7dwf.mp4
>>
>>107807966
I'm sure that's against terms and conditions
>>
>>107808007
>catbox
You must be new here

https://files.catbox.moe/jivtj0.mp4
>>
>>107808054
No no, he must mean model's ToS.
>>
>>107808054
give lora
>>
File: montage.jpg (780 KB, 3522x1500)
780 KB
780 KB JPG
Once all montages are completed, I'll glue them in together in one pic and share the catbox with all the gens in them
>>
>>107808071

https://civitai.com/models/1648982/wan-nsfw-posing-nude
>>
>>107808153
It's a joke bro are you actually autistic?
>>
>>107808145
>Turbo has a tasteful hint of butt
Noice
>>
>>107807056
if you're using the general NSFW lora, lower the strength until that doesn't happen.
>>
File: zimage.png (3.89 MB, 1072x1840)
3.89 MB
3.89 MB PNG
>>107808165
The chicoms are odd fellas, but I must say they can train a good model
>>
>>107808159
it's for him >>107808097
>>
File: montage.jpg (945 KB, 5293x1156)
945 KB
945 KB JPG
>>
how come sometimes you get audio and a still frame that slides? it works amazing in general but what causes that?
>>
>>107808216
all your results are wrong, your prompt is shit mate
>>
>>107808185
rather: it's fucking weird that various from the west or other places often refuse to use sadpanda/*booru and other nsfw collections

that fellow chinese coomers would use them unless stopped hard is quite obvious
>>
>>107808216
With all due respect Wan sux, possibly due to prompting. Turbo is pretty but RLHF'd to hell and Chroma is aesthetically pleasing as expected. Can't say this is a very sensible comparison when in essence Chroma was RLHF'd to normal human being tastes instead of the generic sterile stock image look. But still a great idea.
>>
File: desenhando.jpg (2.14 MB, 4005x4916)
2.14 MB
2.14 MB JPG
>>107808226
Img-in-img prompt is pretty hard to get right; half the time there wasn't even a painting, the guy was painting the sky of an actual airport

>>107808249
I used JoyCaption to generate the prompts; I selected the imgs and prompts both from the best ones I managed to gen. I'm trying to gen as close as possible as to how a "normal" user would, if that makes sense. It's a very informal thing I started doing, on a whim
This one failed even more spetacularly, not one of them got the helicopters right imo
>>
the man is singing with passion. he says "ooh I love fent, gotta get me that fent", to uptempo dance music.

lmao, using the latest ltx2 distil workflow. what a time to be alive, unlike floyd.

https://files.catbox.moe/iue2wl.mp4
>>
File: zimg_00248.png (1.63 MB, 960x1280)
1.63 MB
1.63 MB PNG
>>107808145
>>107808226
no kidding
>>
>>107808285
*template workflow within comfy, works well
>>
>>107808285
even better floyd song
>anon will complain about floyd and miku
I AM TESTING OKAY?

https://files.catbox.moe/0jehyq.mp4
>>
>>107808289
very cool gen
>>107808237
safetyism and its consequences
>>
>>107808285
>>107808326
>skin instantly turns to blurry slop
>shit audio quality
comfy and these workflows must be fucking up the model. If not why would they release a model like this?
>>
File: zimg_00252.png (1.53 MB, 1280x960)
1.53 MB
1.53 MB PNG
>>107808331
thx i stole the inspo myself
>>
File: montage.jpg (905 KB, 2799x4909)
905 KB
905 KB JPG
>>
File: 1686881806498851.jpg (4 KB, 218x231)
4 KB
4 KB JPG
>>107808285
>>107808326
how can you fags tolerate that ugly face, even for a few seconds. american taste, yikes
>>
>>107808366
Would you please share your workflow? For some reason the ones I'm using keep giving me a sepia tone to the image and/or not working at all, and the only one I'm using that neither freezes nor sepiamaxx uses a very bizarre method to gen the images
>>
adjusted resize images from longer edge node from 1500 to 1024 and now it runs smoother on 16gb vram/64 physical ram.

behold, the LORE of floyd:

https://files.catbox.moe/vao1ap.mp4
>>
Why anon keep testing chroma?
>>
>>107808372
ok ZIT wonned all previous but WAN really wonned this time
>>
File: montage.jpg (865 KB, 2058x2923)
865 KB
865 KB JPG
>>107808372
I agree, it mogged the others pretty hard
>>107808467
It's the only one that does porn out of the box and it can generate male genitalia correctly, without having them be eunuchs
>>
>>107808486
Z looks best overall but details on the gun and hand are wrong
>>
>>107808467
Now that lodestone gave up and jumped ship from his own model, someone is still hoping....bless his heart
>>
Lodestone is the same tier as Anistudio for failed shilled garbage, and I have zero issue nagging you fucks to make a Lodestone rentry and put it in the OP.
>>
File: z-image_00082_.png (1.3 MB, 864x1280)
1.3 MB
1.3 MB PNG
>>107808393
i use this with a good-ass prompt
files.catbox.moe/a4kkdn.png
>>
I think the LTX provided distill fp8 workflow on huggingface might work better than the comfy template one.

https://files.catbox.moe/0zjl8g.mp4
>>
>>107808505
>Now that lodestone gave up and jumped ship from his own model,
Did he stop training that Radiance meme?
>>
>>107808507
>being butt blasted about a model
corny
>>
File: montage.jpg (1018 KB, 3541x4909)
1018 KB
1018 KB JPG
>>107808498
My Z-Image flow upscales the img 4 times from a 68x40 initial image; fast and surprisingly decent, but you sacrifice anatomical details for the speed
>>107808520
Thanks a lot, fren
>>
>>107808543
Turbo is the most coherent one even if you think that the others have better artistic direction.
>>
>>107808505
>jumped ship from his own model
Yeah but before that he had to shill it here for the entire last year. That guy should seriously get an IP range ban at this point.
>>
File: montage.jpg (689 KB, 2080x2947)
689 KB
689 KB JPG
>>107808560
I agree with you
>>
I'm trying to run gay-assed Chroma-2K-QC-fp8mixed-blockwise. It works with silveroxide's node but at every step it stops momentarily and gives me

FP8 dynamic quant failed, falling back to dequant: at 48:16:
b_s_k_blocks = tl.cdiv(K, input_block_size)
b_s_base = b_s_ptr + pid_n * b_s_k_blocks

# Accumulator
accumulator = tl.zeros((BLOCK_SIZE_M, BLOCK_SIZE_N), dtype=tl.float32)

# Main loop
for k_idx in range(k_blocks):
k_start = k_idx * BLOCK_SIZE_K

mask_k = offs_k < K - k_start
a_fp8 = tl.load(a_ptrs, mask=mask_k[None, :], other=0)
^
cannot cast int32[constexpr[128], constexpr[128]] to <['128', '128'], fp8e4nv>
>>
File: montage.jpg (873 KB, 1710x2945)
873 KB
873 KB JPG
>>
File: montage.jpg (705 KB, 1529x2923)
705 KB
705 KB JPG
Generated WAN in horizontal because I'm a dumb fucking whore, sorry
>>
is it just me or is Qwen image edit 2511 awful when it comes to an anime style images? it worked fine on 2509.
>>
>>107808636
Does it load completely into the memory?
Does you run it on 4000 or newer GPU?
Also where did you get that?
>>
File: desenhando.jpg (599 KB, 1446x2947)
599 KB
599 KB JPG
Final one. Now to generate the montage of montages and prepare the catbox with all gens and shit
>>
>>107808719
*Do you run it
Sorry wrote a different sentence first and changed it halfway
>>
>>107808719
https://huggingface.co/silveroxides/Chroma-Misc-Models/tree/main/Chroma-2K-QC

I think it loads into memory. I'm using a 5090
>>
>>107808750
Yes you have the hardware for it. Try this maybe if you are fine with using HD:
https://huggingface.co/silveroxides/Chroma1-HD-fp8-scaled/blob/main/Chroma1-HD-fp8mixed-final.safetensors
Another anon claimed (I guess there is some chance that you are the same guy?) that it worked fine for it. I am yet to test.
>>
>>107808770
Fuck it's past midnight in Europe and I am speaking like a jeet.
Sorry anons, I should go to bed.
>>
>>107808781
Sleep well good sir
>>
File: 81.png (2.2 MB, 1120x1584)
2.2 MB
2.2 MB PNG
I'm so glad zit and qwen are a thing, good quality local images is finally real, maybe there's a future for us after all
>>
>try out kandinsky i2v on an A100 cloud gpu (80gb vram)
>takes nearly 2 hours for a 5 sec video
>output is in slo-mo and barely moving
Damn
>>
File: bitmap.jpg (1.16 MB, 1877x2526)
1.16 MB
1.16 MB JPG
All the gens I made are in this file, enjoy
https://files.catbox.moe/k59ajl.rar
>>
>>107808950
can't you just do a simple portrait of a girl? I'm more interested in realism, skin tone, the ability to generate a unique face, getting the age right
>>
the green character on the right says "hold on, that dumb karen bitch is trying to hit police officers!". then the green character fires their gun several times.

what a model.

https://files.catbox.moe/rk0nj7.mp4
>>
>>107809068
Wtf is this man?
How is anyone coping that this shit isn't broken?
Is it because it can do audio?
Is it because it runs fast?
Or is it simply getting shilled by (((certain people)))?
>>
>>107809068
holy sovl
>>
So onetrainer for training? Anyone got any tips for concept training? Can 24gb vram x2 do it?
>>
>>107809125
I love how everything is expressive, even without prompting specific details.

the green character on the right says "hold on, that dumb karen bitch is trying to hit police officers!". then the green character fires their gun several times, which sounds like a 50 caliber rifle.

https://files.catbox.moe/b2q9lh.mp4
>>
>>107809135
for z-image*
And if I should even bother without base
>>
File: 00015-4027596825.png (3.24 MB, 1536x1536)
3.24 MB
3.24 MB PNG
>>
File: 00017-3566244399.jpg (261 KB, 1536x1536)
261 KB
261 KB JPG
>>
why so dead
>>
>>107801257
Would you please share some more useful prompts for gathering a good data set?
>>
File: 00021-2482767602.png (2.95 MB, 1536x1536)
2.95 MB
2.95 MB PNG
>>
File: 00018-2892329109.png (3.26 MB, 1536x1536)
3.26 MB
3.26 MB PNG
>>
How do I move the queue button back to the left bar on comfy? Where's the stop button??
>>
>>107809151
I had a bad time trying to train a style lora.
Some can make it work but even the best ZIT loras fuck the anatomy and text a bit at least. I would honestly wait.
>>
Whoo hoo. Got sdcpp going.
>>
the green character on the right says "hold on, that dumb karen bitch is trying to hit police officers!" in a cartoonish voice. the pink character on the left says "get that bitch!". the green character on the right fires their gun several times, which sounds like an AK-47 rifle.

KINO. this model can do so much.

https://files.catbox.moe/1opb5i.mp4
>>
put me in the screencap
>>
New Flux2 vae Noob epochs
https://huggingface.co/CabalResearch/NoobAI-Flux2VAE-RectifiedFlow-0.3/tree/main
Not there yet TM but seems more hopeful that this will converge into something worthwhile before Z anime model.
>>
nu thread
>>107809385
>>107809385
>>107809385

>>107809364
should probably repost in the new thread



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.