[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: collage_1766776530_1.jpg (951 KB, 2192x1472)
951 KB
951 KB JPG
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107671064

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>107675287
Thank you for baking this thread, anon.
>>
Blessed thread of frenship
>>
>ani still seething
>debo Gasing him up on discord
Pathetic
>>
>>107675287
Thank you for baking this thread, anon
>>107675320
Thank you for blessing this thread, anon
>>107675296
Thank you for thanking this baker, anon
>>
>>107675381
>Thank you for thanking this baker, anon
lol. Anyway, since Comfy said Tongyi is making an anime model that's not fully uncensored, any educated guess as to whether their anime finetune will be easy to finetune further to support NSFW? I imagine there will be at least a few guys willing to do train Tongyi's anime tune on NSFW, but I'm not sure if that will even work. Does the NoobAI team even have the compute to do a full anime tune on Z-Image-Base?
>>
>>107675381
Now thank me you little bitch.
>>
>56 images in the last thread
is this a new low?
>>
>>107675403
kek no not even close
>>
>>107675407
They only use SDXL merges thoever
>>107675394
Let's see about Bayse releasing first. One step at a time
>>
>>107675420
>They only use SDXL merges thoever
It's the spammer being an asshole, the IP nuked posts contain this kind of shit.
>Let's see about Bayse releasing first. One step at a time
I'm just going crazy with the waiting!
>>
>>107675420
Because there's still nothing better for anime???
>>
what does a 2ch VAE look like
>>
>>107675440
lol, im also curious to know
>>
ZiT with an animu LoRA is a million times better than any SDXL checkpoint. I will allow you to continue coping.
>>
^ bait
>>
>>107675446
Yeah desu you're not wrong, I agree that ZiT with style loras is superior in terms of precise composition adherence. But with IL models I can just put in some kino tags at high strength and immediately up the output quality, while with ZiT if parts of your prompt aren't very effective you don't have much recourse.
>>
>>107675446
i love using 5 loras for every gen too, who even needs finetunes when you can just bake a lora
>>
>>107675446
yeah i cant go back to 4 channel vaes they suck so much ass
>>
>>107675491
on the genning side you can cope with high res inpainting. ofc this doesn't help on the training side
>>
>>107675440
One channel for each booba
>>
You got me curious and I'm checking Civit rn for those Zit anime loras and every single gen with every anime lora I see looks as janky if not jankier than Noob - melted eyes, even more meltier fine detail, terrible hands. How is it any better than Noob? Or what specific loras are you guys talking about that make anime shit good on Zit?
>>
>>107675536
civitai sucks ass in general, train your own lora. you do have the compute right anon ??
>>
>>107675536
This guy's ZiT style loras are really good, he posts gens here.
https://civitai.com/user/xixxix/models
>>
>>107675536
>checking Civit rn for those Zit anime loras and every single gen with every anime lora I see looks as janky
That's not unique to Turbo
Civ is full of jeets what did you expect kek
>>
>i-it's just Civit!
>still no anime gens posted
How unexpected. Made me check though.

>>107675552
Nothing in there looks better than Noob. What exactly the advantage if the base gen quality is more or less the same, if not worse? Natural language prompt adherence?
>>
clearly you are new if you have yet to see good ZiT anime gens
>>
>>107675569
>Natural language prompt adherence?
This, and things like interior coherence (like the IL fucked up beds, room corners, etc), as well as small details.
I still prefer IL overall, to be sure.
>>
>>107675434
>It's the spammer being an asshole, the IP nuked posts contain this kind of shit.
Told ya. >>107675407
>>
>>107675569
>Nothing in there looks better than Noob.
Are you unable to see the difference between a 4 channel and 16 channel VAE? Besides the better prompt adherence. XL needs regional prompting cope while ZiT can simply be prompted with "the boy is to the left of the girl"
>>
I've been sleeping under a rock, what is z-image?
>>
>>107675616
no you havent
>>
>>107675605
>Are you unable to see the difference between a 4 channel and 16 channel VAE?
You can upscale 4ch vae gens.
>>
>>107675622
Kek not the same at all
>>
>i just checked the shithole and its full of shit what gives?
>>
>>107675626
4ch vae won't be able to capture some very high-frequency data, but other than that, upscales can give great results if you do them right
>>
>>107675642
Which one?
>>
I went home for christmas, expecting to phone lurk ldg to learn about the new base version. But here we are. Im traveling back tomorrow and we still dont even have a release date. Doom posters were right, as usual
>>
Who are worse? Schizos or vramlets?
>>
>>107675680
china traitor
>>
File: 1759656960630208.png (2.05 MB, 1088x1536)
2.05 MB
2.05 MB PNG
give the anime girl with a red christmas dress. she is wearing a santa hat. there is snow on the ground. keep her expression and hair the same. keep the location the same.

it turned the visor into a santa hat.
>>
File: ZMr69PCuSIKuCyeMJ0TS2.png (1.47 MB, 1692x1438)
1.47 MB
1.47 MB PNG
Qwen AnyPose lora
https://huggingface.co/lilylilith/AnyPose
>>
Can I make a request? I'm using SwarmUI and can't really do much more than use a model the way it's intended, without any special parameters and Loras.
Can someone create a picture of my WoW character? Female human paladin, tan skin, freckles around the nose, gold-blonde hair, tied into a neat bun, brown eyes. Would be cool to show her leaning on a large zweihander sword, maybe a cowboy shot. Bonus points for having it look similar to the WoW comic art style.
>>
>>107675864
>pussy lips
those are balls which are desirable desu
>>
>>107675864
also someone suggest this anon how to remove the tiling jpg artifacts most noticeable in the background
>>
>>107675680
>Who are worse?
ESLs
>>
>>107675842
Just gen it on Nano Banana Pro.
>>
>>107675864
grok it
>>
>>107675886
can qwen do nsfw?
>>
I think ani's pastebin should be posted in all threads. It isn't fair we only single out debo when ani is a smarter and more unstable schizophrenic than he could ever be. I don't want to post in the anime thread so I hope one of them sees this post
>>
>>107675918
I've seen this recommended before. It can do genitalia as well? Not just breasts?
>>
File: 1748899793792122.webm (685 KB, 384x288)
685 KB
685 KB WEBM
What is the best local model and workflow for face swapping videos on a RTX 5090?
>>
File: 1746336625492234.png (1.89 MB, 1088x1536)
1.89 MB
1.89 MB PNG
>>107675741
the anime girl is sitting at a table with a black CRT monitor with "LDG" on the screen in stylish text. behind her is Hatsune Miku wearing a black trenchcoat.

2511 is pretty good with the new workflow, use this one cause the 2509 template needs the slightly updated one.

https://docs.comfy.org/tutorials/image/qwen/qwen-image-edit-2511
>>
File: 1741795062331132.png (1.93 MB, 1088x1536)
1.93 MB
1.93 MB PNG
>>107675935
oops, meant to add this one, but both are fine I guess.
>>
Finally got around to trying PersonaLive and it's total shit
>>
>>107675942
you guess wrong
>>
File: 1765946119795942.png (212 KB, 966x765)
212 KB
212 KB PNG
>>107675980
def something off with workflow, plus there are the same artifacts in every image of yours
>>
File: file.png (335 KB, 594x654)
335 KB
335 KB PNG
>>
>>107676084
Why is it rounded?
>>
>>107676095
easier to put in your ass
>>
>>107676095
g spot
>>
>>107676084
Way more expensive for the same performance as the cheapest 5090.
>>
File: ComfyUI_00071_.jpg (898 KB, 2048x2624)
898 KB
898 KB JPG
I been checking all the ai thread on different boards, and I notice you guys hardly post shit or never max the thread, what gives?
>>
>>107676176
its about status, i don't drive a ferrari because its better, i drive it so people understand who i am
>>
>>107676216
A ferrari is visible outside for everyone to see, and it also legitimately feels different to drive.
A gpu would behave the exact same no matter what you get unless you want silence and get the watercooled model, or have enough money and get the 6000 pro which is actually more powerful.
>>
>>107676255
slut
>>
>>107676261
Thanks, you too.
>>
File: 287401044.jpg (2.64 MB, 2432x1664)
2.64 MB
2.64 MB JPG
>>
will a 4070 ti work better than using google colab?
>>
>>107676216
same reason hood niggers drive BMW and wear gold chains
>>
>>107676216
Lol fag. One of my colleagues got shot and died because some guys tried to rob his cars in his home. This isn't even America btw.
>>
>>107676220
incorrect. tran is a nocoder
>>
File: ComfyUI_00077_.jpg (1.56 MB, 2048x2624)
1.56 MB
1.56 MB JPG
>>
>>107675805
too much bleeding
>>
>>107676576
can you stop posting off topic?
>>
File: 1764950062091102.png (793 KB, 2057x1208)
793 KB
793 KB PNG
wtf i love qwen now
>>
>>107676675
but you made the image less desirable
>>
File: 1754385430199181.jpg (171 KB, 784x1168)
171 KB
171 KB JPG
>>107676685
:^)
>>
>>107675741
>>107675935
>>107675942
I'm so hype for when this finally drops desu
Here's a LoRa btw

https://civitai.com/models/1473885/motoko-kusanagi-or-gits-2026-or-il
>>
I've only just started using wan2.2. but is i2v generally lower quality than t2v?
>>
>>107676869
i2v is better cause you can pick the style from the source image, t2v is fine but is random.
>>
>>107676216
That you have money? That doesn't make people respect you.
>>
>>107676897
not about respect, its about knowing i am above you and have way more power than you, you will either stay out of my way or you will try to please me for favors
>>
File: 1751679529820156.png (1.56 MB, 1504x1112)
1.56 MB
1.56 MB PNG
side profile of the anime girl. she is smiling.

neat what edit can do desu
>>
>>107676469
can you stop posting off topic?
>>
File: 1747335535246475.png (1.73 MB, 1504x1112)
1.73 MB
1.73 MB PNG
>>107676910
the anime girl is using a military salute. change the background to a military base.
>>
>>107676911
it's a bot
>>
>>107676906
You still have to follow traffic laws. You have no more power over mean than a homeless person has power over me.
>>
>>107676923
stop the cope, if you were to cause me trouble i could afford lawyers just to waste your time, can the homeless guy do that?
>>
File: 1755281754631912.png (1.62 MB, 1504x1112)
1.62 MB
1.62 MB PNG
read the book!
>>
>>107676938
You're the one that's coping. You attach imaginary value to a consumer product. You won't get power just by buying a ferrari. Buying a ferrari doesn't imply anything about your financial situation except you spent money buying it. You could also be renting it.
>>
How to increase output resolution on QIE? I remember trying in the past but it fucks the picture
>>
>>107676984
truth nuke
>>
which Lightx2v loras am I meant to use for i2v?
>>
>>107676958
I wont read the book provided by someone with 3 hands!
>>
File: 1763288579265328.png (2.67 MB, 1264x1320)
2.67 MB
2.67 MB PNG
>>
>>107676938
>stop the cope, if you were to cause me trouble i could afford lawyers just to waste your time, can the homeless guy do that?
Fool.
The homeless guy is untouchable. What are you going to do, sue him? Waste his time? He has nothing but time and he has nothing for you to take.
He has no property.
He has no dignity.
Prison is just the promise of three square meals a day and a roof over his head.

Even if you were to kill him somehow and get away with it, it would not undo the damage that he is capable of doing to you permanently just by shitting in your lambo.
>>
>>107677047
high: kijai MoE distil lora 1 str

low: lightx2v 2.2 lightning low noise, 1 str

seems good for me
>>
>>107677089
>The homeless guy is untouchable. What are you going to do, sue him? Waste his time?
https://www.youtube.com/watch?v=0wBkN2mSGJg
kek
>>
>>107677089
gee anon what could one do to someone nobody cares about, would be a shame if they disappeared
>>
>>107677099
But he didn't kill anybody in that film.
>>
>>107676923
You should ask him how much it costs for a likable personality, since he clearly can't afford one
>>
File: 1739982896849666.png (1.8 MB, 1264x1320)
1.8 MB
1.8 MB PNG
the man is doing a karate pose, while the text "FENT MAN" is at the top.
>>
>>107677110
what? he killed a lot of people, his father was powerful enough to put all of that shit under the curtain
>>
>>107677124
You clearly didn't watch the film.
>>
>>107677124
but I thought money didn't give power according to anon?
>>
>>107677123
holy plastic, Qwen Image Edit fucking sucks at humans dude :(
>>
>>107677132
it's just a movie though? when you do bad shit in real life you can be in trouble, even if you're rich, ask Diddy and Epstein about it lol
>>
>>107677137
it's actually pretty good but with some transforms if you dont specify "keep their expression the same" it can vary
>>
>>107677147
lol
>>
>>107677137
welcome to local!
>>
File: 1738311156262017.png (2.2 MB, 1264x1320)
2.2 MB
2.2 MB PNG
gentlemen, affluent white male here, where could I find the best bottle of wine, perchance?
>>
File: 1765064493701459.jpg (20 KB, 400x400)
20 KB
20 KB JPG
>>107677123
is that only the dead guy you have on your pc? even if you have mental problems, there are so many other funny people out there
>>
>>107677243
fent man is just a test case. I have a very large folder/folders of various content.
>>
File: this.png (153 KB, 498x410)
153 KB
153 KB PNG
>>107677235
>welcome to local!
if Z-image edit is as unslopped as Z-image turbo we might be saved though
>>
>guys look at me im so racist haha do I fit in?
So cringe. Grow up, man
>>
Vote.
>Chinese are all liars
>Base will be released this year
>>
>>107677261
>t. brown
>>
>>107677254
a couple dozen images is not a large folder
>>
>>107677158
That's a troll. There are indians who are really mad we can locally gen, being rich white people who own several GTX 5090 gpus in sli.
>>
File: 1751664494228392.png (2.02 MB, 1720x968)
2.02 MB
2.02 MB PNG
replace the man in the center holding a sword with hatsune miku wearing plate armor and holding a sword. Change the text "kingdom come" to "Hatsune Miku".

neat how AI can figure out the missing letters and emulate the style.
>>
>>107677261
how is that racism though? he's making fun of floyd because he was a fentonyl addict who held a gun to the belly of a pregnant woman, not because he's black
>>
>>107677261
it's just not relevant at all. it was almost six years ago, like give it a fuckin rest
>>
File: 1736074824198664.png (1.95 MB, 1720x968)
1.95 MB
1.95 MB PNG
>>107677306
fixed the hats thing. prompted "plate helmet":
>>
File: 1763778963119629.mp4 (1.9 MB, 2048x730)
1.9 MB
1.9 MB MP4
>>107677306
>emulate the style.
you have more chance of getting that with that custom node
https://github.com/BigStationW/ComfyUi-TextEncodeQwenImageEditAdvanced
>>
>>107677324
>it was almost six years ago
like Epstein's ""suicide""
>like give it a fuckin rest
do you think we should stop talking about the Epstein's files because the man died 6 years ago then?
>>
>>107677348
k
>>
Got some crispy yummy SONGBLOOM in the oven cookin up real nice.
>>
File: 1744480582259680.png (1.77 MB, 1720x968)
1.77 MB
1.77 MB PNG
that text is CLEAN, qwen edit is amazing.
>>
the brotherman bill is a brother living at the top of the bill
the brother manbill is the brother with brotherman bill skills
>>
>>107677456
>can't keep the style of the original image
>it doesn't look like floyd anymore
>big giant head for some reason
ay yes, I can see how "amazing" it is
>>
File: 1761859303134721.png (1.94 MB, 1720x968)
1.94 MB
1.94 MB PNG
replace the man in the center with the anime girl in image2 who is holding a sword in the air. Change the text "kingdom come" to "Miku Miku". leave the text "deliverance" intact. keep his expression the same.
>>
>>107677502
can you try to edit your taste so it isn't shit or spammy?
>>
File: 1762920718932740.png (1.6 MB, 1288x1288)
1.6 MB
1.6 MB PNG
>>107677539
hey, calm down pal
>>
>>107677682
>calm down pal
Said Migu calmly, holding a pistol in her hands.
https://www.youtube.com/watch?v=IdoD2147Fik
>>
>>107677682
Hey look, it's the suicide troon mascot.
>>
>>107677456
even photoshop gives better results. edit poop ai is meh
>>
>>107677705
nah that's teto
>>
>>107677705
troons like bridges, not miku
>>
>>107677733
>>107677718
color scheme
>>
>I just can't stop loving you, I just can't stop loving you
>And if I stop, then tell me just what will I do

When will we have the ability to generate Michael Jackson songs at home? So far anything I tried is so far worse than udio...
>>
>107677764
retarded perchance?
>>
>>107677780
It's Eggnog Guzzling Fascist
>>
>>107677780
I don't know why 4chan has tripfag shit in the first place, we're supposed to be all anons
>>
Switching to AceStep, for Spanish 8^) May have some feliz navinazi songs.
>>
>>107677805
Based nonnie.
>>
>>107677805
It's not a tripcode. Don't you have to be 13 to be groomed on this site by chinese homosexuals?
>>
btw comfyui's audio decode is garbage.

comfyui is garbage and needs a replacement.
>>
btw I am trans, if that even matters.
>>
>>107677805
>we're all supposed to be anons
>just because, okay?
>>
>>107677860
if you want to be an avatarfag you have discord for that
>>
File: 1758970628607599.png (1.79 MB, 1288x1288)
1.79 MB
1.79 MB PNG
the anime girl in image1 and the anime girl in image3 are standing beside the man in image2.

interesting compilation
>>
File: 1749162671786405.png (1.76 MB, 1288x1288)
1.76 MB
1.76 MB PNG
>>
edit models are so fucking bad, the local ones at least
>>
File: 1741637295502946.png (1.64 MB, 1288x1288)
1.64 MB
1.64 MB PNG
the anime girl in image1 is sitting at a desk with the character in image2.

kek
>>
>>107677936
Is that what the talmud says?
>>
File: 1738894415929253.png (1.54 MB, 1552x1072)
1.54 MB
1.54 MB PNG
the old man with glasses is wearing a fedora and is holding a katana.

he studied the blade.
>>
>>107677905
kek these are all good. i love the fent shit. keep it up
>>
File: 1761710397168794.png (1.48 MB, 1288x1288)
1.48 MB
1.48 MB PNG
make a plastic anime figure of the image, on a round pedestal. keep her expression the same.
>>
>>107677998
this is so bad there's 0 shadow on the sword lol
>>107677936
this, let's hope Z-image edit will make QiE irrelevant
>>
How much do you guys charge per 1200x1200 prompt?
>>
File: 1742008043503711.png (2.41 MB, 1288x1288)
2.41 MB
2.41 MB PNG
make a sketch of the anime girl on a medieval papyrus, that is resting on a table.

pretty neat output
>>
>>107678058
about tree fiddy
>>
>>107678063
I mean, QiE 2509 could do that as well, at this point we're just consoming the same product again just because it's labeled "new" and not because it has improved and can do shit that the previous version couldn't
https://www.youtube.com/watch?v=-JmVjdYE7qY
>>
>>107678083
new one seems more responsive for certain prompts and transformation prompts, it's not a huge leap but it is better imo
>>
>>107678083
It's a bit better but not a revolution, which makes sense since it's an iteration of the same model, not "Qwen image edit 2.0".
>>
some guy fixed long vid jank!

https://www.reddit.com/r/StableDiffusion/comments/1pwh4gw/new_implementation_for_long_videos_on_wan_22/
>>
>>107678138
yeah I'm not gonna try his schizo workflow, if he does a custom node that simplifies this shit then maybe
>>
And sometimes broken is better than good, more gooder.

Behold, healing entrainment.
https://files.catbox.moe/viq09l.mp3

Vocaroo is mono, so no bueno. USE HEADPHONES, RELAX, GET HEALED. lyrics (Lope de Vega https://www.poeticous.com/lope-de-vega/versos-de-amor-conceptos-esparcidos?locale=es):
[verse]
[es]Versos de amor conceptos esparcidos.
[es]engendrados del alma en mis cuidados.
[es]partos de mis sentidos abrasados.
[es]con más dolor que libertad nacidos.

[verse]
[es]expósitos al mundo en que perdidos.
[es]tan rotos anduvistes y trocados.
[es]que sólo donde fuistes engendrados.
[es]fuérades por la sangre conocidos.

[chorus]
[es]pues que le hurtáis el laberinto a Creta.
[es]a Dédalo los altos pensamientos.
[es]la furia al mar las llamas al abismo.

[chorus]
[es]si aquel áspid hermoso nos aceta.
[es]dejad la tierra entretened los vientos.
[es]descansaréis en vuestro centro mismo.
>>
>>107676176
mine was $2k, which i sold my 4090 for, so was a pretty sweet deal overall
>>
>>107678147
you can probably simplify it yourself if you're not stupid. besides, been waiting for this since wan 2.1 released, those old daisy chained workflows will come in handy
>>
>>107678186
>been waiting for this since wan 2.1 released
you couldn't came up with this by yourself? what are you stupid?
>>
>>107678138
pretty cool but this doesn't address character consistency
>>
>>107678194
>bla bla bla
not music, tell me why I should care.
>>
>>107678170
3090, 4090, 5090, the only respectable nvidia cards. Sort of like trannies that pass.

If you haven't heard this, you're missing out:
>>107678157
>>
>>107678194
>guy makes breakthrough
>on christmas
>YEAH BUT IT DOESNT-

kek, never change /ldg/, never change
>>
File: 1759452659423556.jpg (416 KB, 1328x1944)
416 KB
416 KB JPG
>>
>>107678244
shut up, turn on grayscale, put on the hifi headphones, and listen:
>>107678157
>>
>>107678248
nice 2.5d
>>
File: T W O W E E K S.png (640 KB, 1280x720)
640 KB
640 KB PNG
>>107678244
>breakthrough
call me back when making a 40 seconds video doesn't take 2 weeks on inference though
>>
>>107678248
Forensic tip

You can tell it's not a real japoshit wacom painting in grayscale. Highly obvious somehow.

My screen is in grayscale. In this moment I am erudite.
>>
>>107678252
bla bla bla, not video, i dont care
>>
>>107675616
Z-Image-Turbo is an ultra-fast, distilled AI model for generating high-quality, photorealistic images, developed by Alibaba Tongyi Lab. It's known for its exceptional speed (sub-second inference), low resource requirements (runs on 16GB VRAM consumer GPUs), and strong performance in following detailed instructions, handling bilingual text (English & Chinese), and achieving realism, making it ideal for high-volume, cost-effective content creation.
>>
File: 1740750762394417.jpg (287 KB, 1328x1944)
287 KB
287 KB JPG
>>107678274
thanks

>>107678279
what is this schizo babble?
>>
>>107678286
absolute cringe.

>>107678316
Looks like ass, in other words.
>>
>>107678277
>he hasn't heard of svi and speed boosts
>>
>>107678347
they make quality worse though, and you add that to his cope technique it's a perfect recipe for disaster
>>
>>107678333
>absolute cringe.
yes, you are
>>
>>107678355
i wike chineeees if woo pweeeze
>>
>>107678354
its ok, just say you have a 6gb card, no judgement here
>>
>>107678372
>the guy telling you he's using "speed boosts" because he has a slow GPU is calling you a vramlet
the irony is off the charts
>>
File: file.png (1.58 MB, 1278x839)
1.58 MB
1.58 MB PNG
>>107675805
>>107677123
meh. It's better than the base model, but if you want something unique, it's probably not gonna do great. I had to add extra prompting to get the background to stay the same.
>>
>>107678310
for once what's written on the model card isn't some marketing exageration, the model is literally as good as they claimed lmao
>>
>>107677256
their repo already said the edit model is gonna be shit.
>>
>whole thread is a retard spamming his shitty qwen edit gens
lmao
>>
>>107678398
no, they said that for the base model, edit will be fine (even though I don't know why they don't want to use RLHF (aka the secret sauce) on that one)
>>
File: file.png (2.96 MB, 1248x1872)
2.96 MB
2.96 MB PNG
>>
>>107678277
hey, just calling you back to let you know that sageattention exists.
>>
>>107678381
HUH!? SORRY SWEATY, I CANT HEAR YOU OVER THE SOUND OF THESE FANS! HEH, YEAH THEYRE SOMETHING ELSE. THAT KIJAI FELLA, HIS NODES ARE NUTS, GOT A 30 SECOND VIDEO UNDER 4 MINUTES!
>>
>>107678417
Looks bad in grayscale. 2/10.
>>
File: file.png (48 KB, 775x408)
48 KB
48 KB PNG
>>107678404
>not putting it at the same level as ZIT
drop the scales from your eyes, anon. It's over
>>
File: nice one.png (249 KB, 512x512)
249 KB
249 KB PNG
>>107678440
keek
>>
>>107676198
Funny how that is true while this thread is still the defacto imggen tech thread on the chanz
>>107678403
Everyone's going home tomorrow so anon will have access to his puter then
>>
File: reeeee.gif (419 KB, 220x220)
419 KB
419 KB GIF
>>107678449
that's because those niggers refuse to apply RLHF to it whyyyyyyy
>>
File: file.png (1.64 MB, 896x1152)
1.64 MB
1.64 MB PNG
>>
>>107676198
>you guys hardly post shit or never max the thread, what gives?
pretty much only happens when theres a new and exciting model drop, the rest of the time is philosophical debate which is kino
>>
>>107678449
even if it won't look at good as turbo, they still trained their model with only real data so it'll always be less slopped than QiE, there's no way it's gonna be worse, it won't be nano banana pro of course, but it'll be a significant improvement
>>
>>107678483
>the rest of the time is philosophical debate which is kino
this
>>
>>107678467
In grayscale, it doesn't look like a photo.
>>
File: ahahah.png (606 KB, 3057x1480)
606 KB
606 KB PNG
>>107678449
remember when they wanted to be honest and said on github that base genuinely looked bad (as it should it's just a base model) but then they had to change it because the truth nuke pill was too big to swallow? keek
>>
>>107678461
don't you need a bunch of people giving feedback to do that?
>>
File: 00019-1756343922.png (1.96 MB, 1048x1432)
1.96 MB
1.96 MB PNG
Anyone else run into this issue? I'm using ForgeUI, it was working fine about a month ago. Today, the generated faces look terrible. Even re-running a gen that previously looked fine now produces artifact on the face
>>
File: 00049-1756343922.png (1.95 MB, 1048x1432)
1.95 MB
1.95 MB PNG
>>107678630
this is the gen from a month ago. It didn't need Adetailer on face to fix.
This random artifact does not just appear on face, it randomly affects some parts on the body in other gen
>>
>>107678580
fair enough, I assume they gave them just enough money to do RLHF on the image gen part (turbo) but didn't bother for the edit part, now that Z-image base is the most anticipated model ever maybe they'll take this project more seriously and pour some more money to it, but I suspect they'll only do that for API purposes only (not gonna blame them, when you have the secret sauce the best thing to do is to keep it for yourself first)
>>
>>107678630
Honestly I'd move to Cumfy and try there, Forge is abandonware. That's what I did and the transition wasn't as painful as I thought it would be (still hate the noodle interface though).
>>
File: file.png (1.54 MB, 1750x750)
1.54 MB
1.54 MB PNG
>>107675805
qwen absolutely refuses to have someone put a gun in their mouth
>>
>>107678708
use ablit
>>
File: -.mp4 (1.31 MB, 720x720)
1.31 MB
1.31 MB MP4
>>
What percentage of gen problems do you think are skill issues vs model issues
>>
>>107678736
every issue is a model issue
>>
>>107678705
I tried using Comfy but I can't get Inpainting to work there, every guides I followed didn't work, while Forge just have working inpainting without doing any extra steps
>>
>>107678736
every issue is a skill issue
>>
>>107678748
>every guides I followed didn't work
Did u try the 1girl guide in OP?
>>
>>107678736
every problem is a skill model
>>
>>107678716
You mean an abliterated model for the clip? I didn't even consider that.
>>
>>107678736
every skill is problem model
>>
ai should spoonfeed us, its literally its purpose
>>
>>107678716
got a recommended one that you use?
>>
anon is so goddamn retarded hed rather sperg out than learn how to prompt the model
>>
>>107678781
this. 'prompt engineers' are brown tourists who don't understand the purpose of ai
>>
>>107678748
Try looking again, there are some guides on YT, I followed one when I was setting shit up months ago and everything worked (still does). I would share the workflow but it has some custom nodes, so you might get confused by that and shit won't work out of the box. Actually working better cause there's no issue there that I've been having on Forge where no matter the settings my inpaints always left a seam-like outline. Or at least it's not as visible as on Forge.
>>
>>107678781
>ai should spoonfeed us, its literally its purpose
truth nuke
>>
every issue is the seed of a seminal vessel.
>>
>>107678805
show semen
>>
File: file.png (10 KB, 769x84)
10 KB
10 KB PNG
>>107678716
>>107678767
>>107678784
why do they break up the models like this? How the hell am I supposed to use this in comfy?
>>
the "purpose" of ai is to make my penis hard faggot
>>
>>107678824
downood then use the merger
>>
>>107678832
proof?
>>
>>107678646
>>107678630
Don't be a fucking pussy about it... holy ****, zoomers these days are so goddamn soft
>>
>>107678859
Cooked his zoomer ahh.
>>
>>107678630
>>107678646
>>>/e/edg/
>>
>>107678835
>merger
is that a node, or a separate piece of software?
>>
>>107678870
https://github.com/soursilver/safetensors-merger
>>
>>107678824
Hey what model is that? I was trying to find one too
>>
can someone just upload the abliterated model?
>>
>>107678887
https://huggingface.co/huihui-ai/Huihui-Qwen3-VL-4B-Instruct-abliterated-FP8

>>107678878
Thanks, anon. Guess it's time to
>python3 -m venv env
>source env/bin/activate
>pip install -r requirements.txt
yet again
>>
>>107678859
>they are so goddamn soft
>holy ****
say the word pussy
>>
>>107678933
fuck you n****r
>>
File: N I G G E R.png (26 KB, 220x182)
26 KB
26 KB PNG
>>107678940
NIGGER
>>
>>107678824
Just ask ai to write you a python script for safetensor merging. It's literally 4 or 5 lines.
>>
>>107678708
>>107678824
>>107678878
Well, now I'm getting tensor shape errors. I'm not going to spend all night working on this for something that probably won't work in the end anyways

>>107678962
I was not aware that it was this simple.
>>
>>107678984
hey bud, you can also use gguf, that's what I do
https://huggingface.co/prithivMLmods/Qwen3-VL-4B-Instruct-abliterated-v1-GGUF
>>
>>107678940
nagger? yeah I am gonna nag you nigger, clean you room you little bitch
>>
File: ComfyUI_00604_.jpg (2.42 MB, 3584x5120)
2.42 MB
2.42 MB JPG
>>107678708
>>107678716
Turns out I could just translate "the gun is pointing into his mouth" into chinese and it kind of worked. Not his gun thoughbeit.

>>107679006
Thanks, fren. I'll give it a shot. Does this load natively in the Load CLIP node or do I need to load it in a gguf loader and pass it through somehow?
>>
with qwen image edit how can you get actual nudity? it stops at breasts and he hasnt seen pussy once in his life
>>
File: 1766526019909745.png (185 KB, 640x446)
185 KB
185 KB PNG
>>107678940
>n****r
>>
>>107679062
>it stops at breasts and he hasnt seen pussy once in his life
qwen image edit and me have a lot in common.
>>
>>107679006
>>107679033
>Does this load natively in the Load CLIP node
nevermind, I found the gguf clip loader. I'm retarded. Still giving tensor shape errors, though
>>
>>107679079
nta but are you using it with scail or what?
>>
>>107679062
even if you get total nudity you'll get wax body nudity so I don't really see the point lol
>>
>>107679091
qwen 2511
Do you know if I need the qwen-image-edit model to be gguf as well?
>>
File: z-image_00053_.png (2.47 MB, 1696x1280)
2.47 MB
2.47 MB PNG
is clownshark snake oil?
>>
File: 1755078190787802.png (947 KB, 1296x808)
947 KB
947 KB PNG
>>107678384
edit is still lots of fun, it can copy fonts too.
>>
>>107679103
no you dont
>>
>>107679119
it has "clown" in its name, the writing was on the wall kek
>>
>>107678887
So, I ended up using this model.
https://huggingface.co/silveroxides/Qwen2.5-VL-7B-MixedPrecision-ComfyUI/tree/main
It was the only one that wasn't kicking back shape errors. It also did not work. I noticed that the latents in early iterations were trying to show what I wanted, though, but then they completely flipped the gun around at the end. I took a screenshot, before it flipped the gun around. pic related.
>>
>>107679208
try reducing steps?
>>
Drawn Together? More like genned together, HA HA HA
>>
File: z-image_01298_.png (3.14 MB, 1440x1440)
3.14 MB
3.14 MB PNG
>>
File: file.png (2.87 MB, 1536x1536)
2.87 MB
2.87 MB PNG
>>107679208
this is what it actually shit out at the end
>>107679214
I'm only doing 4 steps. I don't think going lower is going to turn out well.
>>
Which Diffusion and Lora models work with 16GB VRAM?

Im using ComfyUI Portable with Manager.

I want to generate Image to Video. 5 - 10 seconds. 720p ideally.

Does anyone have a workflow they can share or know of one on civitai that can do what Im asking?

Wouldnt hurt if you can do two images, of say two people that will then partake in whatever is generated.

Ideal if it doesnt take an hour to generate a 5 second clip.

The various tutorials and packages on youtube are strictly for 24GB GPUs.
>>
>>107678759
>>107678798
reinstalling Forge didn't do anything. I'll try ComfyUI again
>>
File: file.png (1.7 MB, 1844x862)
1.7 MB
1.7 MB PNG
>>107679208
once again, this is the latent on step 2/3 vs the end result. It knows what I want. It's capable of generating it, but it completely flips, right at the end.
>>
File: file.png (1.25 MB, 1016x1024)
1.25 MB
1.25 MB PNG
>>107679253
yeah i cant get it either
>>
>>107679253
>It's capable of generating it, but it completely flips, right at the end.
shieet, do you think it has some censorship layers or some shit?
>>
>>107679208
ok but do you know why it flipped? It's because you have a good sigma on the step where it's good, but the sigma's too high on the next step.
>>
File: file.png (1.17 MB, 1016x1024)
1.17 MB
1.17 MB PNG
its something
>>
File: file.png (2.98 MB, 1536x1536)
2.98 MB
2.98 MB PNG
>>107679253
>>107679273
>>107679274
I decided to turn off the lightning 4 step lora and change the settings back to 20steps/4.0cfg and this is what I got.
>>107679276
no idea what that means
>>
>>107679304
Isn't it interesting how homosexuality is so tightly coupled with self destruction generally?
>>
>>107678824
Get a goof or a merged file.
>>
File: 1751941308463057.png (486 KB, 1362x686)
486 KB
486 KB PNG
>>107679304
OH OH OH, PERSONA
>>
>>107679308
toilet
>>
File: zimg_0011.png (1.77 MB, 960x1280)
1.77 MB
1.77 MB PNG
>>107679308
sampler/scheduler?
>>
>>107679119
It really depends on what you want to do with it. There is no universally 'good' sampler for everything, because more often than not we want to jump off a few stops before we reach some arbitrary optimal state.
>>
>>107679320
is your pic real?
>>
>>107679320
>euler/simple
comfy template default
>>
>>107679235
Guess I expected too much from 4chan its like linux threads: ask for help or info get nothing.
>>
>>107679308
what extension did you use to extract the pose?
>>
>>107679334
Sorry bud, I ignore all reddit posts.
>>
>>107679119
If you have no use for the latent blending/mixing/whatev then yes. But it has some interesting nodes.
>>
File: zimg_0015.png (1.9 MB, 960x1280)
1.9 MB
1.9 MB PNG
>>107679324
no

>>107679325
i've only seen issues with SDE samplers swapping the shit out of my gens mid-diffuse. huh, weird
>>
File: 1754336060003134.mp4 (2.22 MB, 720x720)
2.22 MB
2.22 MB MP4
>>107679219
>>
>>107679339
I'm using the pose lora posted earlier in this thread
>>
>>107679346
Bro is that steak real I'm so hungry rn
>>
>>107679344
>Reddit posts
>Using caps
Self reported.
>>
>>107679363
I'm from ifunny, file that into your report.
>>
>>107679119
ksampler sucks, no matter the flavor, vibecoded or monkeycoded.
>>
>>107679308
>>107679308
Will you share?
>>
>>107679381
share what? It's the qwen 2511 template in comfy with the lora from this thread added in
>>
>>107679392
No it isn't liar benchod
>>
File: zimg_0129.png (1.5 MB, 960x1280)
1.5 MB
1.5 MB PNG
>>107679361
no

>>107679345
>>107679321
i think it gives more variation but that's about it for what i'm doing

>>107679369
about five words away from being a useful contribution
>>
File: file.png (1.18 MB, 1016x1024)
1.18 MB
1.18 MB PNG
>>
>>107679431
Annnnd now we know why.
>>
>>107679431
why is the hand on backwards
>>
File: file.png (1.19 MB, 1016x1024)
1.19 MB
1.19 MB PNG
>>107679443
>>107679435



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.