[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106784371

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
Cursed thread of Schizophrenia
>>
File: 00002-3349445212.png (1.42 MB, 1024x1280)
1.42 MB
1.42 MB PNG
>>
Blessed thread of frenship
>>
>spend all this time trying to set up Ovi
>realize the model barely fits on my card
KIJAI!!!!!!
>>
File: 00005-3158338820.png (2.4 MB, 1024x1280)
2.4 MB
2.4 MB PNG
>>
>>106787696
it's dogshit genuinely do not bother
>>
how much does torch compile impact quality degradation
>>
That wan 2.2 smoothmix is excellent at nsfw.
>>
>>106787739
stop shilling your slopmix
>>
guys, what's the best txt2img model for realistic amateur 1girl gooning?
>>
File: 1758082816635826.mp4 (3.21 MB, 816x640)
3.21 MB
3.21 MB MP4
>>106787629
you can try kijai's workflow. it's worked better for me than natives nodes for flf2v
https://files.catbox.moe/zft83o.json
>>
>>106787754
chroma
>>
>>106787754
various realistic sdxl finetunes or chroma base.

depending on what you do also wan with lora as an image model
>>
File: hearts2.png (2.63 MB, 1296x1728)
2.63 MB
2.63 MB PNG
>>
how are people training loras with a 1 image dataset?
>>
If I want to edit an image to give a guy fat fucking honkers, what do I use?
>>
File: 1756220687394372.png (1.74 MB, 1440x1088)
1.74 MB
1.74 MB PNG
guess I should just sit around with my thumbs up my ass
>>
>>106787883
poorly
>>
>>106787925
enjoy wasting your time
>>
guys i think we need to support ani more
she is building a revolutionary ui from the ground up, just for us
and i think as a community she deserves our help
i think she is right here: >>106784731
we just need to bully comfy devs enough to make them switch to improving sd.cpp, that's the least we can do for her
>>
>>106787892
A pencil
>>
File: 00009-3763538821.png (1.89 MB, 1024x1280)
1.89 MB
1.89 MB PNG
>>
>>106787977
Sadly I am a retard that can't draw a straight line with a ruler and need to be tool-assisted to have that.
>>
File: ComfyUI_0070.png (2.47 MB, 1296x1728)
2.47 MB
2.47 MB PNG
>>
Whats the best way to save metadata to an image in comfyui so that I can extract the workflow later? Something similar to the PNG history in automatic1111
>>
>>106788053
metadata is already saved dumbo
>>
File: 00011-1751455610.png (1.68 MB, 1024x1280)
1.68 MB
1.68 MB PNG
>>
>>106787950
i saw some on civit and they come out fine. so i assume they mean 1 image but flipped and turned a bunch of times
>>
>Wan2.2-T2V-A14B-4steps-lora-250928 at higher res

I get blurry gens at 1280x720 resolution while 720x480 works perfectly fine. Admittedly, I have a couple of other loras as well.

Am I supposed to double the steps if I double the resolution?
>>
>>106788146
>Am I supposed to double the steps if I double the resolution?
no. are you using 2.1 480p loras or something?
>>
File: white_dress_1.webm (3.91 MB, 880x1176)
3.91 MB
3.91 MB WEBM
>>106788040
>>
>>106788203
The way she moves her shoulders looks like an exaggerated live2d animation.
>>
>>106788188
All parts are of wan2.2 (high/low noise etc)
Only VAE is 2.1
>>
why do people complain about local? We got every tool we need, shit is so cash.
>>
>>106788272
mental illness
>>
>>106788272
Avatarfags from /sdg/ troll this general because they became irrelevant.
>>
Ever since I started playing with wan 2.2, I've neglected my daily tests of new loras..
>>
>>106788203
what loras? aside from your custom one
>>
File: mickey_bath.webm (3.88 MB, 912x1144)
3.88 MB
3.88 MB WEBM
>>106788289
Just the tentacle one.

>>106787978
>>
>>106788287
>>106788272

BASED
>>
>>106787811
>>106787795
any idea which sdxl one is good? i'm not waiting 3 minutes for a gen on chroma
>>
>>106788300
poor miggey. I hear he couldn't afford that GPU that he really wanted.
>>
>>106788203
Excellent
>>
File: 00130-23228276_ayakon.jpg (329 KB, 1632x2208)
329 KB
329 KB JPG
>>
File: L.O.V.E.png (1.94 MB, 1024x1024)
1.94 MB
1.94 MB PNG
>>106787771
that is incredibly horrifying.
>>
File: 1579182181958.png (91 KB, 240x262)
91 KB
91 KB PNG
Had a thought.
The AI/diffusion trend has only been going on for a couple of years. Imagine what type of shit we can do in like 10
>>
>>106788300
It's really weird how we just accept that yeah totally you can just type to your computer and it will make an animated movie for you.
>>
File: chroma 50... is special.png (2.39 MB, 1024x1024)
2.39 MB
2.39 MB PNG
>>106788376
Chroma 50. ahhhhhhhh
>>
>>106788389
Me and your mom. Just imagine in a few years when ur obsolete.
>>
>these midwit namefags shitting up the thread with their stupid retarded opinions
>>
File: pek7.png (72 KB, 247x248)
72 KB
72 KB PNG
>>106788410
She's dead, so whatever
>>
>>106788389
>this isnt the first thing he thought about the moment he saw dall-e 3 mini demo online and every single time any new tech or optimization drops
ngmi
>>
File: 1753446986482358.png (208 KB, 1810x1160)
208 KB
208 KB PNG
>Looking forward to the Chinese open source ai that will give a middle finger to all these companies.
ledditors are so blissfully ignorant :(
>>
Do the new models do hook noses yet? I've been a bit afk for a few months.

Also, am I being really dumb planning to get an Arc B60 (24gb of vram).
>>
>>106788415
>he still hasn't filtered tripfags with 4chanX
ngmi
>>
>>106788389
we're only on year 3 lmao
>>
>>106788432
I prefer calling them out, gotta keep these autists in check
>>
>>106788419
Always found the AI gen video tacky..
..
until I started deepfaking my friends, now I am embracing it like an atheist embracing gods grace or some shit.

>>106788433
Exactly.
>>
>>106788415
I'm not a namefag, I just typed something in the name field. You can type the same thing.

btw I compiled stable-diffusion.cpp for rocm. It does alright. It's slower than Comfyui. But, somehow, it's more comfy (but definitely not ui).
>>
>>106788430
>Also, am I being really dumb planning to get an Arc B60 (24gb of vram).
yes
>>
>>106788441
>I want to feed the trolls
that's what they want, you are falling to their traps and you're happy about that
>>
>>106788432
I don't have a tripcode

.>>106788416
a dead is fine too
>>
>im not a namefag but I filled the name field
below room temp iq
>>
21 seconds per iteration is a bit slower than my comfyui wf was, with Chroma.

I remember it was supposed to get fixed so it doesn't need cfg > 1. Did that happen?
>>
>>106788430
>Also, am I being really dumb planning to get an Arc B60 (24gb of vram).
Arren't the intel gpus only good for LLMs since you can chain them?
>>
>>106788461
(he lives in hell)
>>
>>106788473
>21 seconds per iteration is a bit slower than my comfyui wf was, with Chroma.
What's your gpu and workflow? Do you have fast fp16 enabled?
>>
File: 00014-3541703162.png (1.45 MB, 1280x1024)
1.45 MB
1.45 MB PNG
>>
Nunchaku for chroma yet?
>>
>>106788499
>copechaku
>>
>GGUF is so good bro!!!
>nunchaku? heh.... c... c-copechaku
when will they figure out the narrative
>>
File: 1733664958022995.png (37 KB, 811x294)
37 KB
37 KB PNG
>>106788461
I was talking about this, if you only let "Anonymous" talk you filter out a lot of main character syndrome schizos
>>
>>106788512
I gen on a 4gb laptop and so I like GGUF
>>
Until you realize there is a small group of retards still bitter about the split the easier it is to ignore the daily bullshit they do. There will always be some monster of the week because these people are scorned that you don't want to post GM to them and call them by name.
The api trolling died so expect some other random time wasting faggotry
>>
>>106788512
nunchaku is good
gwen lora support soon fellow nunchakuers
>>
>>106788512
>nunchakek isn't Q4 quality I swear!!
prove it
>>
>>106788517
20 minutes per gen?
>>
>>106788524
there's a storied catalogue you can sift through if you'd like to check the archives
>>
>>106788524
I have qwen edit Q8, FP8 and nunchaku. no fp/bf16 cause im poor. Give a me a prompt and ill test
>>
>>106788537
>no fp/bf16 cause im poor.
so it's useless, the goal of Quant comparisons is to see which one is the closest to fp16, if you can't make that image you can't see anything
>>
uh oh i smell a goalpost mover
>>
>>106788534
nah, no one compared nunchaku with Q8 and fp16 and see how far it is relative to Q8, it didn't happen yet
>>
>>106788533
Depends on the model, but sometimes in that ballpark.
>>
>>106788547
you're right, but I would do the test under the assumption that Q8 is the highest quality, so we could at least see if nunchaku is better than fp8.
I've even read claims that nunchaku is better than Q8 but I think that's pure cope. I had no way to test this sadly.
>>
File: ComfyUI_01874_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>Prompt executed in 13.45 seconds
Chroma1-HD-Flash ddim/simple 12 steps
>>
>>106788562
XLOP
>>
>>106788562
how do people like this garbage again?
>>
File: ComfyUI_01876_.png (1.49 MB, 1024x1024)
1.49 MB
1.49 MB PNG
>>106788566
>>106788567
Nice gen. You beat me
>>
>>106788567
>how do people like this garbage again?
they don't, it's just lodestone shilling his shit because he still can't accept he spent 150k on a failed project lol (I won't be too disrespectul towards him though, he did his best to save local)
>>
>>106788562
I like the pink/purple reflections
>>
>>106788581
It's literally producing 480p quality images, do these retards think jpeg artifacts/jagged edges = WOW PURE SOVL QUALITY?
>>
File: 00016-107905163.png (1.22 MB, 1280x1024)
1.22 MB
1.22 MB PNG
>>
>>106788594
It's called "aesthetic".
>>
>>106788562
>Chroma1-HD-Flash
stop using this shit
>>
File: ComfyUI_01881_.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>106788581
>lodestone
Not him or debo

>>106788584
>I like the pink/purple reflections
Same

>>106788666
>Chroma1-HD-Flash
>stop using this shit
I do what I want
>>
>>106788476
>b60 vs b580
>24gb vs 12gb
>Both use the Battlemage BMG-G21 GPU die with 20 Xe-cores and 160 XMX AI engines.

"Procyon Stable Diffusion XL (FP16)" is 100 steps at 1024 x 1024.
>>
File: file.png (3 KB, 271x84)
3 KB
3 KB PNG
>>106788605
me when i'm fish
"heh"
and then eh swam
>>
comfy should be dragged out on the street and shot
>>
>>106788701
> arbitrary score
> no sampler settings or any settings

total benchmark death
>>
>>106788562
Images gen that way when denoise is reduced somewhat, at least in early steps.
>>
>>106788723
paying for sd is funny too.
>>
what's with all the runninghub spam on civitai recently? do they pay well to advertize?
i'll immediately sell out and add their retarded faggy website to my loras if the chink pay is good.
>>
please save me from 12s/it on chroma..
>>
>>106788825
buy a gpu made from this century
>>
>>106788825
I'm at 20s/it, stop complaining :^)

Seriously considering a b60.
>>
File: 00018-583636848.png (2.18 MB, 1024x1280)
2.18 MB
2.18 MB PNG
>>
>>106788713
fizzledorf should be dragged out on the street and shot
>>
https://civitai.com/models/2015171/
(download and then) report this for ez buzz
>>
unable to masturbate to any of these
>>
File: 00019-1940005087.png (1.57 MB, 1024x1280)
1.57 MB
1.57 MB PNG
>>
File: music.png (2.35 MB, 1024x1024)
2.35 MB
2.35 MB PNG
>>106788879
>>
File: ComfyUI_01892_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>>106788734
>>>106788562(You)
>Images gen that way when denoise is reduced somewhat, at least in early steps.
16 steps is definitely better. I have denoise 1.0 and no post-processing to see what the base models can do. 4090 definitely helps make it fast (cached text)
>Prompt executed in 10.05 seconds
>>
yeah just what qwen needed, even more slopped shit https://civitai.com/models/1706513/
>>
>>106788825
ask for the lightning fags to make a lightning chroma lora lol
>>
>>106788926
There's several already
>>
File: Cloudkeks are so funny.png (165 KB, 1733x884)
165 KB
165 KB PNG
STOP NOTICING GOYIM
>>
>>106788879

this one?
>>
how bad is it to use videos of the same girl for the wan lora? she does this specific dance i want, i have like 12 clips of only her
>>
Dis lil disabled nigga goin fishing after his last can of bait expired
>>
>>106788928
there's none for Chroma
https://huggingface.co/lightx2v
>>
File: fren.png (2.12 MB, 1024x1024)
2.12 MB
2.12 MB PNG
>>106788879
or this one?
>>
File: light.jpg (748 KB, 2304x1152)
748 KB
748 KB JPG
>>106788825
What gpu?
>>106788912
Stop using ddim, it skews your gens towards very bright
>>106788956
https://huggingface.co/silveroxides/Chroma-LoRAs/tree/main
>>
>>106788947
nice bulge
>>
File: file.png (6 KB, 642x86)
6 KB
6 KB PNG
>>106788860
I FEEL SO SAFE RGHT NOW LIKE HOLY SHIT
even though the world is falling apart
>>
File: moar steps.png (2.02 MB, 1024x1024)
2.02 MB
2.02 MB PNG
>>106788879
How about this one?
>>
File: Well job gentlemen.png (449 KB, 600x603)
449 KB
449 KB PNG
>>106788860
>>106789011
>working as a red team for civitai for free
lmao
>>
>>106789011
Did you whisper that to yourself after getting stretched out in summer camp?
>>
>>106788860
>>106789011
¿what was it?
>>
>>106787739
OK but where is the list of included loras?
>>
File: gamora.jpg (83 KB, 794x794)
83 KB
83 KB JPG
are there any photoreal models that can do orcs and goblins and fantasy creatures? im having a hard time find one that works, if it can make an orc its very unrealistic like clay skin. but photoreal models do a terrible holloween costume type of orc. i cant find one that can make a modern special fx make up type orc with skin like pic, or can even do the orcs from lord of the rings well
>>
File: dmmg_0045.png (1.62 MB, 832x1216)
1.62 MB
1.62 MB PNG
>>106788879
>fapping to slop
ishygddt
>>
>>106789125
you telling me people are producing these lewd images for purposes other than cooming?
>>
File: dmmg_0267.png (1.63 MB, 832x1216)
1.63 MB
1.63 MB PNG
>>106789141
i exclusively gen to scam boomers
>>
started underclocking my gpus to keep them surviving until 4090s are not 2k each...
>>
>>106789173
when my gpu dies i will go back to drawing futas
>>
>ctrl+f amd
>Phrase not found
hey guys, are W7900 48G or MI210 64G usable for video/image gen on linux?
can they match nvidia's similar cards?
>>
>>106789173
Just power limit to 70% and blast fans at 100%
>>
Any good comfyui worflow for nsfw image tagging?
>>
>>106789202
>match
no

I have amd. The answer is basically just get nvidia.
>>
>>106789221
https://github.com/jhc13/taggui
>>
File: plsliveMr3090.png (90 KB, 841x423)
90 KB
90 KB PNG
>>106789213
that is part of what i meant with underclocking,
>>
File: file.png (1.6 MB, 768x1280)
1.6 MB
1.6 MB PNG
>>106789089
Chroma probably does it best but you gotta wrangle the shitty hands and stuff. I haven't tried it on the photoreal sdxl models so maybe those do well?
>>
>>106789224
well how bad is it? I don't know if I can stomach nvidia
>>
File: rdr 001.png (620 KB, 512x512)
620 KB
620 KB PNG
>1.4 still mogging with SOVL
>>
>>106789226
Thank you anon, this seems like a better solution to be honest.
>>
File: 1735801808493843.png (931 KB, 992x1048)
931 KB
931 KB PNG
>>106788935
look at the SAAS and laugh.
>>
File: 1734529500817972.png (822 KB, 1064x984)
822 KB
822 KB PNG
>>106789141
for fun

also qwen image edit is also a free meme generator since it can do almost anything.
>the pink hair anime character is dressed as a spaceman
>>
>>106789173
power limit, less power draw and quieter fans and basically 95% of the performance

my 4080 is at 70%. FPS difference in games is like 5 or so, and at high fps you dont notice 5 fps.
>>
>>106789291
girl is so fuckable
>>
Did anyone save early examples from like 2020-2021? Really miss those shitty gens, something nostalgic about them now.
>>
What the fuck. Nodes can just disappear in a workflow? I had two Get nodes vanish and brick the genning.
>>
>>106788723
they also need to be tracking watts, because if a card is under-utilized, you need to ask why
>>
>>106789321
no anon, you fucked up.
>>
>>106789278
Nevermind, it doesn't work with the 5090.
>>
File: 00023-1596002716.png (2.99 MB, 1024x1536)
2.99 MB
2.99 MB PNG
>>
File: 1742610213922169.mp4 (1.52 MB, 528x784)
1.52 MB
1.52 MB MP4
>>106788947
>>
>>106789224
come on friend complain a little. I want to know EXACTLY why amd is such a bad decision
>>
>>106788512
> nunchaku
nvidia only
>>
>>106788430
> Also, am I being really dumb planning to get an Arc
just basic inference, sdxl training only (limited too, no fp8)
>>
File: 1739976329075812.jpg (204 KB, 1440x1517)
204 KB
204 KB JPG
>>106789309
indeed, which is why she's in the asian qt pile
>>
>>106789321
when you have a good workflow, export it to a folder so you can import it in case anything is up. it shouldnt just vanish though.
>>
So are we just not gonna get a good VACE for 2.2?
>>
>>106788430
get Nvidia. CUDA is king. 16gb is enough to do wan Q8 video even or any model (I use a 4080).
>>
>>106789439
Source on the girl?
>>
>>106789468
480p?
>>
>>106789470
no idea, just a random photo, google lens is your best bet
>>
>>106789476
I very rarely like asian women but damn, that one is beautiful
>>
>>106789474
i've made vids at 832x480 just fine, 24gb+ is good but not "necessary", it's just a bonus and you can generate larger stuff if you like.
>>
File: 1747148661794432.jpg (523 KB, 2880x960)
523 KB
523 KB JPG
remove the black guy from the picture.

I kneel qwen edit.
>>
File: 1742333471684538.png (1.26 MB, 1248x832)
1.26 MB
1.26 MB PNG
>>106789489
replace the black guy with a japanese man with light skin in a business suit.
>>
>>106789484
>I very rarely like asian women
/ldg/ is a White man's general, lower your tone while speaking here.
>>
File: Untitled.png (1.69 MB, 896x1056)
1.69 MB
1.69 MB PNG
>>
>>106789510
based
>>
File: 1749716577978022.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
replace the ocean in the background with lava, and add an erupting volcano on the hills in the distance.
>>
File: 1757258135222910.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>106789527
replace the ocean in the background with ice, and add snow all over the structures in the picture.
>>
File: 1733783388271618.png (1.21 MB, 896x1160)
1.21 MB
1.21 MB PNG
give the man a mexican sombrero, mexican poncho, and curly moustache.

el literally me
>>
How long until veo3/sora2 level videos run on consumer hardware?
>>
>>106789568
Two weeks.
>>
>>106789568
After the bubble burst when corpos will have to focus on efficiency
>>
>>106789568
download VLC and you're set
>>
File: 00030-3705629445.png (986 KB, 1344x768)
986 KB
986 KB PNG
>>
File: 1740197168740743.png (115 KB, 1876x581)
115 KB
115 KB PNG
>>106789568
>How long until veo3/sora2 level videos run on consumer hardware?
never, sora 2 is great because it knows so many things, the chinks are only interested with mememarks they don't care if humans are plastic or we can't render Homora or every characters from family guy, once OpenAI decided to close the gate and not allow people to render characters anymore, the hype died down instantly
>>
File: 1738096736595088.png (1.19 MB, 1008x1032)
1.19 MB
1.19 MB PNG
after qwen edit, memes will never be the same. they are evolving.
>>
>>106789606
>fun while it lasted
>less than a week
kek
>>
>>106788287
watch as half of them are now 404
>>
>>106789579
>After the bubble burst
Buy When There’s Blood in the Streets
>>
>>106789510
if you are american you are not white
>>
>>106789611
>after qwen edit, memes will never be the same.
this shit still zooms in the image, it stills compress the original pixels (since it has a vae) and once you want to use a character to do something completly different it slops it, it's not nano banana dude, wake up lol
>>
File: 1733802272678886.png (1.16 MB, 1008x1032)
1.16 MB
1.16 MB PNG
>>106789611
transform the character on the left into a sexy brunette girl with large breasts, with her hands at her side.

problem solved.
>>
>>106788552
it did happen a few days ago
>>
File: GPW3dB5bsAASKim.jpg (77 KB, 1080x1033)
77 KB
77 KB JPG
>>106789510
I dont even understand what you ment by this post, if im white im allowed to like asian women or not, according to you? But anyway, american shitskins dont get to call the shots about this topic
>>
File: 00026-874235526.png (2.23 MB, 1024x1280)
2.23 MB
2.23 MB PNG
>>
>>106789644
source?
>>
>>106789630
>if you are american you are not white
We are mutts.
>>
File: 1755058274275360.png (1.22 MB, 928x1120)
1.22 MB
1.22 MB PNG
replace the text "Starcraft" with "Mikucraft" in the same font style. Replace the woman in the picture with Hatsune Miku in the same style. she has a red "01" on her arm.

sometimes Qwen will give Miku a barcode instead of the number, the last part fixes that.
>>
>>106789661
based (f)re(d)tard
>>
>>106789669
look at the archive
>>
>>106789708
its not there
>>
File: 1758357919250084.png (501 KB, 936x1112)
501 KB
501 KB PNG
the cartoon character with white skin is sitting at a desk with a computer and CRT monitor, wearing a "did absolutely nothing" tshirt. the image is black and white. keep his expression and pose the same.

qwen edit = free chud generator. from a cropped headshot, still works.
>>
>>106789711
not with any proper complex prompt comparisons i mean
>>
File: 1735962267880583.png (488 KB, 936x1112)
488 KB
488 KB PNG
>>106789716
>>
>>106789716
Literally me except add a high temp pc in the back thats genning coom
>>
>>106789721
there was one, plenty chinese text
>>
File: 1740962271850766.png (520 KB, 936x1112)
520 KB
520 KB PNG
>>106789728
mouse chud
>>
File: 1747958481436990.png (636 KB, 1116x1432)
636 KB
636 KB PNG
>>106789716
>>106789728
keep doing nothing, fellow chuds!
>>
AAAAA
>>
File: 1751319952596946.png (564 KB, 936x1112)
564 KB
564 KB PNG
>>106789733
the cartoon character with white skin is sitting at a desk with a computer and CRT monitor, wearing a "did absolutely nothing" tshirt. the image is black and white. keep his expression and pose the same. A computer monitor behind him has a blonde anime girl in a bikini with large breasts.

works
>>
>>106789745
>wants to buys components to not use them fully
>>
>>106787650
lol the twerking star wars alien is so retarded. new fetishes that can't exist in real 3d space are going to get created or already are
>>
>>106789755
it's not fully there, it's more than fully, 100% means "my gpu is too small for that model" kek
>>
>>106789489
I guess it doesn't work on stable-diffusion.cpp
>>
>>106789764
offloading a part of the model is not a large speed penalty or one at all depending on what is being offloaded
>>
File: 1748414690012886.png (777 KB, 936x1112)
777 KB
777 KB PNG
>>106789749
>>
File: disagreement.png (2.33 MB, 1024x1024)
2.33 MB
2.33 MB PNG
>>106789755
Well, multitasking is impossible at that point.
>>
File: 1738844882937381.png (512 KB, 936x1112)
512 KB
512 KB PNG
>>106789733
there we go, now the temp is there.
>>
File: 1740179737856136.png (11 KB, 640x734)
11 KB
11 KB PNG
>>106789787
>>
>>106787771
That MP4 looks damn fantastic

>>106788203
Would look even better with the belly button removed.
>>
>>106789716
qwen edit sounds great.

I may have to break down and do comfyui...
>>
File: Rey.jpg (905 KB, 2160x3500)
905 KB
905 KB JPG
>>106787650
Can any one share those Darth Talon webms?
>>
>>106789809
it's a very fun ai model. you can manipulate any gen or image you have, and it works well with noob/illustrious, or qwen/flux/etc gens, and you can use these images for wan i2v too.
>>
File: 1729740013597379.png (346 KB, 936x1112)
346 KB
346 KB PNG
multi image test:

the cartoon character with white skin is shaking hands with the green cartoon frog wearing a blue shirt and red shorts and white sneakers in image2. the background is white.

just put a pepe in image2 and enabled the node (disable 2/3 if you arent doing multi image stuff)
>>
File: Rey2.jpg (965 KB, 2160x3840)
965 KB
965 KB JPG
>>106789813
Bump!!
>>
File: 00030-1980144515.png (2.12 MB, 1280x1024)
2.12 MB
2.12 MB PNG
>>
File: Rey3.jpg (1.06 MB, 2160x3840)
1.06 MB
1.06 MB JPG
>>106789813
>>106789833
Bumping!!
>>
>>106789843
All these vaguely similar gens are great but I'm sad because it's unlikely to be illustrious.
>>
File: 1743950580353434.png (410 KB, 952x1096)
410 KB
410 KB PNG
>>106789830
the man in image2 who is 6 feet tall is shaking hands with the green cartoon frog wearing a blue shirt and red shorts and white sneakers in image2. the background is white.

if it's a cropped image stating general size helps with combining gens.
>>
>>
File: 1759184564177366.png (663 KB, 952x1096)
663 KB
663 KB PNG
the green cartoon frog is wearing a blue shirt and red shorts and is sitting at a computer with a CRT monitor and typing. keep their expression the same. On the screen is the text "LDG".

mind you this is from a cropped pepe face with no body.
>>
>>106789395
Everything AI is built and optimized for nvidia cards, simple as
>>
File: 1734967529601804.png (630 KB, 952x1096)
630 KB
630 KB PNG
>>106789883
>>
File: 1752991757131826.png (693 KB, 1136x912)
693 KB
693 KB PNG
the green cartoon frog is wearing a blue shirt and red shorts and is sitting at a computer with a CRT monitor and typing. keep their expression and pose the same. On the screen is the text "LDG".

this time source is peepo
>>
File: 1752406327263662.png (828 KB, 1136x912)
828 KB
828 KB PNG
>>106789907
better colors
>>
File: 1759497087188587.png (968 KB, 1176x880)
968 KB
968 KB PNG
the cartoon character is holding up a newspaper saying "LDG NEWS: Sam Altman kneels to SAAS copyright!", with a picture of a man below it.

qwen edit is so good. can fix the typo with another edit but the functionality works.
>>
File: 1748039332340260.png (1004 KB, 1176x880)
1004 KB
1004 KB PNG
>>106789986
the cartoon character is holding up a newspaper saying "LDG NEWS: Sam Altman begs for more money!", with a picture of a cartoon man holding a bag of money below it.
>>
File: 00032-3975052202.png (2.17 MB, 1024x1536)
2.17 MB
2.17 MB PNG
>>106789854
i am using (a mix of) illustrious. mostly using oekaki tag.
>>
File: 1732772556267923.png (1.03 MB, 1376x760)
1.03 MB
1.03 MB PNG
>>
>gen image of girl with cum on her face
>animate it with wan22
>cum drips like water
>try various loras, none of them help
>>
BUUUULSHITTTTTTTTTT
https://www.youtube.com/watch?v=NRifKEf0xr8
>>
>>106790102
have you tried prompting thick fluid etc
>>
>>106790106
>votes 3608
>>
what is the best current way (besides using the OG code from wan github repo) to use Wan Animate?
There was a KJ workflow, and then a native workflow, but it was missing the "preprocessing" part that the original github code had, so the results were not very good.
Has there been any improvement?
>>
>>106790128
yes have been proompting. ill try some wan21 loras. there has to be somrthing that works
>>
>>106790142
>95% Cl +- 10
it means even with the worst case scenario (1167 -> 1157) it'll still stay number one
>>
>>106790155
So? It's a botted score for a model that everyone knows is shit.
>>
>QwenImageEditPipeline

Looks like stable-diffusion.cpp will have to port that.
>>
>>106790106
Wow who could've guessed that jeeterboards are worthless. This is a huge surprise no one saw it coming.
>>
>>106790106
yes brother xi, we numba wan
>>
>>106790160
>It's a botted score for a model that everyone knows is shit.
that I agree with
>>
>>106790106
kek, I'm glad HunyuanImage has made LMemeArena completly useless and destroyed its reputation, at least this giant slopped model was useful at something
>>
it really took hunny 3.0 for anon to realize its all bullshit KEK
>>
>>106789784
Get help!
>>
File: ComfyUI_temp_siahh_00005_.jpg (884 KB, 2016x1152)
884 KB
884 KB JPG
>>
>>106790106
>https://old.reddit.com/r/StableDiffusion/comments/1nxyk0j/for_the_first_time_ever_an_open_weights_model_has/

why are they shilling it so hard?
do they really not understand it's not actually good?
>>
File: 1741847546323706.png (102 KB, 1619x417)
102 KB
102 KB PNG
>>106790251
to be fair the comments aren't really agreeing with the post
>>
File: 1728300846523939.png (3.11 MB, 1744x1200)
3.11 MB
3.11 MB PNG
give the anime girl long teal twintails.

this is after "swap the outfit with the outfit of the girl in image2" which was haruhi.
>>
>>106790270
4 mn late anon >>106790251
>>
>>106790267
what model are you using for these?
>>
r/StableDiffusion has many incorrect information regarding Hunyuan Image 3.0. A simple comparison between Hunyuan Image 3.0 and Qwen Image was conducted, and it was concluded that it is not much better than Qwen Image, let alone SDXL. Such incorrect information has misled many people. Hunyuan Image 3.0 is an autoregressive LLM model that integrates a diffusion model. Therefore, it has "concepts" and can generate tasks like traditional diffusion models that cannot handle, such as creating similar comics, through simple prompts, or customize every part of the image (the number of understandable prompts far exceeds Qwen Image). Currently, there are only four image generation models with this capability: Google's nano banana, ByteDance's Seedream 4, OpenAI's image1, and finally Tencent's Hunyuan Image 3.0. Therefore, the open-source community has for the first time obtained a model that is extremely close to the closed-source SOTA model. Unfortunately, it has not sparked much hype.
>>
Holy go back
>>
File: 1735283187821221.png (939 KB, 872x1200)
939 KB
939 KB PNG
>>106790267
give the anime girl short black hair and a black goth style outfit.
>>
>>106790282
qwen edit v2 (2509), with the 8 step qwen image lightning lora v2.0. fp8 model. the comfy template has the model links on the side, im using the 8 step lora over the 4 step one, it's fast enough anyway.
>>
>>
>>106789606
copyright cant be gone soon enough, what absolute aids
>>
File: 1746732841224723.png (1.06 MB, 872x1200)
1.06 MB
1.06 MB PNG
>>106790304
the anime girl is dressed as a spaceman in a white spacesuit.

neat.
>>
>>106790289
>Unfortunately, it has not sparked much hype.
wait, you're telling me a giant 80b model that needs at least a 10000 dollars gpu and that can only produce slop and Miku hasn't sparked hype? NO WAY
>>
>>106787892
qwen edit 2509
>>
>>106790207
I'm not the woman, I'm the MAN.
>>
File: 1735143416385509.png (1.24 MB, 872x1200)
1.24 MB
1.24 MB PNG
the anime girl is dressed as a music conductor.
>>
>>106788420
Why are the chinks cucking so hard in this regard though
>>
>>106790317
we need to teach ai how to put eyes and mouths in the wrong places so it looks amateurish.
>>
File: 1735939345653413.png (1.18 MB, 896x1160)
1.18 MB
1.18 MB PNG
>>106787892
qwen edit.
>change the man into a blonde woman with very large breasts
literally booba.
>>
>>106790367
I don't think they're scared of copyright, they just don't give a fuck, all they want is to be numba wan in mememarks, so it doesn't matter to them if the humans are plastic or it can only make Trump, if it can put a blue square on top of a red circle they think they reached success
>>
>>106790246
skyscrapers like that would be odd. Without heavy transport traffic, it would indicate basically people spend at least the whole month in a single tower.
>>
File: ComfyUI_01018_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>80B model
>outputs look just like sloppier versions of qwen image and hunyuan image 2.1

When are these chinese companies going to get better datasets?

pic related is chroma and it looks 100x better.
>>
>>106790384
ya it's the dog girl
>>
>>106790002
Oh, cool. Hope up.
>>
>>106790384
But what about prompt adherence.
All is see is slop here slop there but does it actually understand more concepts?
>>
File: 1730840075612668.png (990 KB, 912x1144)
990 KB
990 KB PNG
show the anime character from behind. they have a large ass.

source is the fairy from metaphor, gallica. see, inpainting is neat but it can't do all the stuff edit can. you'd need to use openpose, make a pose, and then hope your output at high denoise matches the original character/outfit.

very cool model.
>>
>>106790394
>more concepts?
such as?
>>
>>106789813
in the previous thread retard
>>
File: 1734304251207082.jpg (95 KB, 850x1063)
95 KB
95 KB JPG
>>106790412
and the reference was this:
>>
>>106790369
holy plastic
>>
File: black.png (606 KB, 2276x847)
606 KB
606 KB PNG
Is it just me or Qwen Edit still produces black outputs because of sage attn?
>>
>>106790421
>>106790412
model?
>>
>>106790368
>namefag doesnt know
typical
>>
File: 00045-2339911991.png (1.88 MB, 1448x920)
1.88 MB
1.88 MB PNG
>>
File: 1736243665745770.png (1.01 MB, 912x1144)
1.01 MB
1.01 MB PNG
>>106790421
a view of the anime character from the front.

no other model can do all this manipulation, kontext did some but not as much as qwen edit.
>>
>>106790431
>qwen edit.
latest?
>>
>>106790427
qwen edit 2509, use the comfy template, swap 4 step lora for the 8 step one (better quality, still fast)
>>
File: hunyuan_image_3_fennec.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>106790394
It doesn't. It can't even do a proper fennec girl.
They look even worse than the already bad qwen image and hunyuan image 2.1 fennec girls.

80B for pic related, this model is just a large piece of shit.
>>
>>106790413
Anything that Qwen/Flux are too retarded to gen correctly
>>
File: 1755044909452726.png (1.52 MB, 1686x590)
1.52 MB
1.52 MB PNG
a view of the anime character from the side.

pretty good desu
>>
File: no_sageattn.png (1.79 MB, 2300x1156)
1.79 MB
1.79 MB PNG
>>106790425
>high level of retardation confirmed
>>
>>106790394
No, it's mostly worse. Sometimes it produces interesting output, but only through RNG. If you have a complex prompt you'd like to test, post it and I'll run it, but it'll be half an hour for the result.
>>
File: settings.png (217 KB, 1593x976)
217 KB
217 KB PNG
if anyone is fiddling with bigASP 2.5, I find the general setup in the pic really bring out the advantages of "flow matching SDXL" quite nicely.

The clip-l is this LongClip finetune from zer0int:
https://huggingface.co/zer0int/LongCLIP-KO-LITE-TypoAttack-Attn-ViT-L-14/tree/main
>>
File: 00025-3938720238.png (2.73 MB, 1248x1824)
2.73 MB
2.73 MB PNG
>>
File: 1730230046543935.png (81 KB, 1118x599)
81 KB
81 KB PNG
>>106790425
>>106790469
you can make it work with sageattention, you have to remove the sageattention command and put this node instead
>>
>>106790486
Convince me with a gen
>>
>>106790446
Oh nooooo it can't do a fennec girl!!! Throw it in the trash!@!!
>>
is wan 2.2 loightning good enough at 2 steps per pass or am i just fucked to need 4 steps a pass? i gen at 720p at a high quant so quality's pretty good but i hate that this is twice as slow as 2.1 because i need twice the steps from what i've tested before.
>>
>>106790452
Can it do a person? Try someone good looking and white.
>>
>>106790428
There is nothing.
>>
>>106790446
if you told me this was a Qwen Image render I would've believed you, it has exactly the same anime style wtf?
>>
>>106790429
Do any non-whites romanticize about lonely jobs in space?
>>
>>106790446
API node status?
>>
File: ComfyUI_19201.png (3.47 MB, 1200x1800)
3.47 MB
3.47 MB PNG
>>106788420
>>106788935
That's one way to save on GPU resources!
>>
>>106790446
Do you thin AniStudio will eat into your market share?
>>
>>106790502
it can do anything, use "keep their expression the same" to make the face unchanged more or less.
>>
>>106790544
>>106790544
>>106790544
>>106790544
>>106790544
>>
File: cumra.jpg (83 KB, 1012x624)
83 KB
83 KB JPG
>>106790542
doubt [x]
>>
>>106789805
>Would look even better with the belly button removed
covered_navel, impossible_clothes on semi-real outputs is the promise of AI.
>>
>>106790532
How did you prompt mid face, or is it a lora?

also props for seasonality
>>
>>106789395
>>106789255
it's alright for image gen, haven't tried video yet because I assume it'll be aids and the results for whats possible doesn't seem great anyway.
16gb vram. image gen is actually pretty decent on my hardware but the aids part is you have to deal with amd's rocm which is the definition of nigger rigged if you want to use it with comfy. It works but... there are less amd users so it will always be behind on performance and whats possible unless AMD really gets their shit in gear.
>>
File: ComfyUI_03745.png (2.43 MB, 1344x1344)
2.43 MB
2.43 MB PNG
>>106790560
MID!? Jenny's face is cute as heck!



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.