[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


🎉 Happy Birthday 4chan! 🎉


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106830604

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: cuda_throne.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
first time qwen failed text this bad for me altho I didn't prompt it
>>
File: 1733666995063777.png (1.14 MB, 1376x760)
1.14 MB
1.14 MB PNG
the dog on the left is wearing a chinese rice farming hat, and has glowing white eyes. the dog is shooting lightning from its paws at the man in the middle who looks surprised.

qwen image edit 2509 Q8 quants + new edit lora, 8 steps

kek
>>
>>106833555
Lyra wins! Cockroachlity!
>>
File: WanVideo2_2_I2V_00511.webm (742 KB, 704x1248)
742 KB
742 KB WEBM
>Ugughugh
>I'm a street fighter
>>
File: 1742981747010385.png (1.15 MB, 1376x760)
1.15 MB
1.15 MB PNG
>>106833555
the dog on the left is wearing a chinese rice farming hat, and the dog has glowing white eyes. keep the dog's appearance the same. a large lightning bolt is approaching the man in the middle from the right.

better
>>
>>106830631
strange biting pattern, probably needs to be re-genned?
>>
>>106833588
what's the news on that story? Hasan made a video "showing" that it wasn't a dog shock collar (that is was only a vibrating one), then some twitter posts showed that it was a dog shock collar
>>
>>106833627
he lied about a shock collar not being a shock collar and /pol/ already found the exact model he used.
>>
>>106833627
It was obviously a shock collar, but the story will be spinned as Hassan wants, his fans will follow his every word, and others will just have more disgust on how utterly horrible this human being is.
>>
>>106833633
>/pol/ already found the exact model he used.
/pol/ is always useful when dealing with shit like this, that's why I can't hate this board, they fight the right fight
>>
File: 1741544148475389.png (1.02 MB, 1376x760)
1.02 MB
1.02 MB PNG
remove the man with glasses. replace the blue "HASAN" text with "KAYA". Add a large dog bone and dog treats to the room.

saved.
>>
>>106833652
qwen edit even copied the neon sign style, such a neat tool.
>>
oh, oh no
https://civitai.com/models/1901521
>>
>>106833686
https://en.wikipedia.org/wiki/Shock_collar
>>
>>106833689
wake me when i can download and shit on it properly
>>
Anyone else wish we could place shock collars on some of the posters here?
>>
>>106833689
Everyone saw this coming.
>>
Local Diffusion?
>>
File: 1754463850207224.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
the anime girl is giving a thumbs up and wearing pixelized black sunglasses.

cleansing my system with a Miku cause fuck that dog abusing faggot.
>>
File: 1750570060379494.png (54 KB, 568x202)
54 KB
54 KB PNG
>>106833689
What a sad and pathetic ending.
>>
>>106833737
they said its for a week, im talking about the shitty gens
>>
File: file.png (2.79 MB, 1280x1536)
2.79 MB
2.79 MB PNG
>>106833689
>>106833743
SD1.5 tier
>>
>>106833689
https://civitai.com/images/104983866
kek'd
>>
>>106833582
If someone trained, for example Sniper elite type of x-ray bulletcams for WanVideo, where could they publish this type of lora? (I didn't trained just curious if someone did.)
>>
>>106833758
Still cute freckles.
>>
>>106833750
>subject expands to include one more example that mentions a rabbi
>a regular anonynous poster (tm) starts kvetching like his dad was accused of something
smartest brown lmao
>>
>>106833743
"Style clusters" clearly participate on the god awful results.
>>
File: WanVideo2_2_I2V_00512.webm (699 KB, 1248x704)
699 KB
699 KB WEBM
>When you left the ez bake oven on.
>>
>>106833737
surpassed by noobai vpred and illustrious. wai v15 does literally everything you can ask for and you rarely need a lora cause it knows so many characters natively.

https://github.com/DominikDoom/a1111-sd-webui-tagcomplete

with this extension I can get exact tags for characters/styles/concepts with ease, no need to look it up. add in adetailer and controlnets and you can do basically anything easily.
>>
>>106833821
*also note it works with reforge too, you dont just need auto1111 GUI.
>>
File: 00002-3815787921.png (2.23 MB, 1024x1280)
2.23 MB
2.23 MB PNG
>>
in the grand scheme of things you are just a faggot boot licker an anti jewish and christian person. Just following orders.
>>
File: 00003-323737886.png (2.89 MB, 1024x1280)
2.89 MB
2.89 MB PNG
>>
https://old.reddit.com/r/StableDiffusion/comments/1o0v232/pony_v7_release_imminent_on_civitai_weights/nih4khl/
https://old.reddit.com/r/StableDiffusion/comments/1o0v232/pony_v7_release_imminent_on_civitai_weights/nih4h1z/
>ponyfag working with lodestones to train v8 on qwen
fucking lol. i guess chroma came and went.
>>
>>106833928
and you complain?
>>
File: 104970654.jpg (230 KB, 1024x1536)
230 KB
230 KB JPG
https://civitai.com/images/104970654
>With big text "It's so real" on top and small text "You can almost taste it!" on bottom.
lel
>>
>>106833945
yes
>>
enough talking to the retarded hate everything crowd tonight.
>>
File: WanVideo2_2_I2V_00513.webm (658 KB, 704x1248)
658 KB
658 KB WEBM
>Oh noo. I can't stop worlding my warcraft!
>>
File: 1750262188069430.png (2.12 MB, 1040x1560)
2.12 MB
2.12 MB PNG
>>
how excruciatingly slow would doing something like this >>106833582 on rtx 20xx 8gb be ?
>>
>>106834030
At that res? You probably could have saved up for a new gpu in that time otherwise what like... ten minutes? idk.
>>
>>106834030
probably quicker with cpu
>>
>>106834033
>... ten minutes? idk.
lol no.
>>
File: qwen syuen bf16 test.png (1.33 MB, 1584x1312)
1.33 MB
1.33 MB PNG
I downloaded Qwen 2509 BF16 and tested it. It's capable of matching Nanobanana in terms of creating backside view for weird and complicated costumes. The Qwen 2509 FP8 e4m3fn quantize version did not have good understanding of avant garde character designs. Very happy that I can kick nanobana to the curb with its bullshit filters.

DisTorch2 Model Final Device/Layer Assignments
--------------------------------------------------
Device Layers Memory (MB) % Total
--------------------------------------------------
cuda:0 (<0.01%) 485 2.33 0.0%
cuda:0 447 20543.58 52.7%
cuda:1 396 18421.99 47.3%
--------------------------------------------------
[MultiGPU_DisTorch2] DisTorch loading completed. Total memory: 38967.90MB
100%|| 8/8 [00:20<00:00, 2.51s/it]
Requested to load WanVAE
loaded completely 4538.394530296326 242.02829551696777 True
Prompt executed in 48.43 seconds


https://github.com/pollockjj/ComfyUI-MultiGPU
>>
>>106834033
>>106834037
is wan iq3_km adequate?
>>
>>106834053
No idea, but in my experience anything below Q4 is basically unusable and even Q4 itself is a huge compromise.
>>
>>106834030
maybe 10-15 mins? you might have to find out.
>>
Anyone use Pinokio? I hear it is very messy to uninstall and akin to bloatware.
>>
>>106834049
Huh, did multigpu get better or something?
>>
>>106834078
What is pinokio doing that you can't?
>>
>>106834100
Makes installing much easier.
>>
>>106834078
Its bloatware. But so the others. The only thing thats not bloatware is if the program is written in C/C++ and then uses dll libraries for clean usage.

Like stable-diffusion.cpp. Clean/simple shit.
>>
>>106834111
I've only ever seen pinokio in reference to issues with it. I feel it's better to spend five minutes writing the basic command line arguments to install a repo rather than obfuscate it behind whatever pinokio is doing.
>>
>>106834049
I had no speed difference between using ram + gpu and second gpu vram instead.
So it was faster (twice as fast) for me to use 2 separate comfyui sessions, each one with one gpu and offloading to ram.
Now I had 2 x 3090, not sure what you use.
>>
I am genuinely confused as to wheter ramtorch is actually a real thing or just the furfag dipping his toes where they don't belong again and causing a fuss over nothing.
>>
>>106833928
More like Auraflow came and went.
>>
>>106834128

Interesting strategy, but I don't prompt fast enough for the need for batch size 2 gen.

5090+4090+ 124gb sysram.
>>
>>106834133
He's training Radiance with it and Ostris seems to mange to train Qwen with it.

Also when did he cause a fuss over nothing? Altering the Flux Schnell model and training Chroma on it also worked. Radiance works too after adopting the PixNerd scientific paper into Chroma.
>>
>>106834164
>Also when did he cause a fuss over nothing?
I don't like this tone. I'm going to assume you're one of his lackeys and hide you response. Can I get a response from someone who isn't a member of his patreon?
>>
>>106834133
>>106832028
>>
File: WanVideo2_2_I2V_00514.webm (965 KB, 1248x704)
965 KB
965 KB WEBM
>Braps then leaves.
>>
>>106834142
It's just a way to prompt more, I copy paste the workflow in both, it's essentially giving me twice the speed.
Of course, this is useful for slow stuff like wan.
>>
Can anyone post a funny?
>>
File: 1739774532460391.png (1.28 MB, 1040x1560)
1.28 MB
1.28 MB PNG
>>
File: AnimateDiff_00329.mp4 (1.91 MB, 1280x720)
1.91 MB
1.91 MB MP4
>>
>>106834049
Did you try the GGUF Q8? I got the impression it's one of the most used variants around here.

>>106834182
> this tone
don't make asspulls if you don't want to be challenged on them.
>>
>>106834269
>Challenged.
Don't take a accusatory tone with me. Blocked.
>>
>>106834256
Apparently wan doesn't understand how violence works. Battle Royale with Cheese?
>>
>>106834287
yes, and I'd guess most chinese video models will filter violence
>>
File: AnimateDiff_00331.mp4 (1.91 MB, 1280x720)
1.91 MB
1.91 MB MP4
>>106834287
yes, but lightx2v makes it worse
this one is without lora
>>
File: 1735082152733117.png (1.91 MB, 1000x1127)
1.91 MB
1.91 MB PNG
>>106833689
>in the past:"guys buy a high pc, let's goooo"
>now:"only api bros, no one care about your rtx 500gb vram"
>>
>>106834360
No, I think it'll work on rather normal hardware. It just probably isn't very good overall.
>>
>>106833928
now he's gonna finetune a 20b model? jesus, how much money does he have?
>>
>>106834414
He could slap his supporters in the face and fuck their sister and they would still donate to him.
>>
File: 1730278165465832.png (1.55 MB, 1728x1344)
1.55 MB
1.55 MB PNG
>>
File: WanVideo2_2_I2V_00515.webm (2.09 MB, 1488x848)
2.09 MB
2.09 MB WEBM
>>
>>106834424
he DID release pony and you can complain about AuraFlow - but it wasn't an impossible idea and it's not a rugpull or anything.

most commercial models devoured more money behind the scenes too, we just didn't see it.
>>
>>106834445
bruh, it took him more than a year to get this piece of shit lol, by the time he has finished finetuning Qwen, we'll probably have a stronger base model, he's taking way too long
>>
https://www.reddit.com/r/StableDiffusion/comments/1o1u2zm/text_encoders_in_noobai_are_dramatically_flawed_a/
Uh oh, noobai sissies, how do we respond?
>>
File: 1744868365595611.png (2.91 MB, 1728x1344)
2.91 MB
2.91 MB PNG
>>106833928
holy fuck v8 is going to be a disaster. the furry has been ok but flawed. the horsefucker is going to drag him down.

>>106834455
exactly, qwen is fun but too fucking bloated. we need a new model that has a local-friendly architecture that uses newer optimizations.
>>
>>106834466
Is this the same clip schizo as always?
>>
>>106834472
no, looks like someone else
>>
>>106834476
Is it possible to have a clip schizo get to their point in under three paragraphs?
>>
>>106833928
>ponyfag working with lodestones
I mean...
>>
>>106834486
kek you have a point, everytime a guy talks about clip he writes a fucking bible about it, I don't know why
>>
File: radiance.png (2.94 MB, 848x1488)
2.94 MB
2.94 MB PNG
>>106834455
so far it is not looking like anyone is publishing local and open nsfw (including furry and most of *booru) base models

these finetuning efforts that currently take months will likely usually be first for now
>>
>>106834443
How is it such high quality?
>>
>>106834506
BUNDA BUNDA BUNDA
>>
>>106834506
>such high quality
there's a lot of motion weirdness goin on though, and after looking at thousands of sora's videos, Wan looks like shit now :(
>>
>>106833928
fuck it we needed a qwen finetune anyway
>>
>>106833928
desu if I was lodestone I would take it personally, the ponyfag is literally making him understand that his chroma model is shit and is not deserving of a finetune lmao
>>
>The anti chroma schizo was right about the qwen finetune all along
>>
>>106834466
Stopped reading at "Noob 1.1" as the last and defacto release is vp 1.0
>>
File: radiance.png (2.65 MB, 848x1488)
2.65 MB
2.65 MB PNG
>>106834471
>we need a new model that has a local-friendly architecture that uses newer optimizations.
that was likely what he tried with auraflow but obviously it didn't go as well as hoped?

yes, neta yume lumina and chroma worked out better but that's also somewhat larger models... still it's fairly obvious to me why they'd want qwen also with the edit mode features.
>>
File: 1756338247316909.png (137 KB, 2092x714)
137 KB
137 KB PNG
>>106833928
I don't get it, he wants to transform Qwen Image into an edit model? then why not finetuning Qwen Image Edit instead?
>>
>>106834501
Neta Yume Lumina isn't even a month old and has great potential.
>>
this pony faggot is intentionally making his models worse by censoring the dataset and removing artists time and time again and yet you dumbasses still have some hope in him?
>>
>>106834589
I don't, I'm passing time desu kek
>>
>>106834589
>and yet you dumbasses still have some hope in him?
this thread is not a collective consciousness dont lump me in with the rewards
>>
>>106834599
you're such a reward anon!
>>
>>106834584
unless that notorious fraud actually makes a noobxl size finetune i don't see it actually being something besides a 1girl simulator
>>
>>106834589
the lode of stones will set him straight.
>>
File: WanVideo2_2_I2V_00516.webm (2.7 MB, 848x1488)
2.7 MB
2.7 MB WEBM
>>
File: radiance.png (2.63 MB, 848x1488)
2.63 MB
2.63 MB PNG
>>106834584
yes, certainly, that one still seems to be learning useful stuff!

the overall tuning of lumina, neta lumina to neta yume lumina took a while tho. it is simply good that it continued working until now and therefore likely more into the future.

for auraflow from what I gather it hit a wall and retries and trying to "go around the blockage" figuratively didn't seem to work, so unless you had ideas how to fix the architecture I guess it's better to abandon auraflow
>>
this level of ESL has to be SEA
i know its you, jungle asian
>>
>>106834638
You have to link to the posts you're accusing, schizo.
>>
File: radiance.png (2.85 MB, 848x1488)
2.85 MB
2.85 MB PNG
>>106834506
yea, wan is great. it could be even a little better if you burn more gpu time... most don't have the patience tho
>>
Does the different scaled models change animation dramatically or is it mainly the quality that differs?
When trying to make a prompt work, would it make make sense to spam them on the lower quality models to then get final results on tbe bigger models?
>>
File: WanVideo2_2_I2V_00517.webm (1.76 MB, 848x1488)
1.76 MB
1.76 MB WEBM
>When you forgot the alter the framerate
>>
>>106834615
>i don't see it actually being something besides a 1girl simulator
Some of the examples show multiple. In fact, one of them shows 5girls all individually prompted. https://civitai.com/images/99675719
>noobxl size finetune
Obviously desired but not crucial for adoption. Illustrious was great even before it was dicked down with e621.
>>
File: file.png (2.88 MB, 848x1488)
2.88 MB
2.88 MB PNG
>>106834680
the quants can affect what happens in the animation pretty strongly. it isn't just "the same with less visual fidelity".
>>
>>106834680

Video gen is a wild horse. 1 settings different and the whole motion flow changes. I doubt you can swap quantization and get the same motion. There is some semblance of control if you use start/end frames. But even that doesn't guarantee reproducibility.
>>
>>106834696
>>106834711
Damn, guess I'll stick to 3-4min gens.
>>
File: file.png (1.27 MB, 1280x720)
1.27 MB
1.27 MB PNG
Diella lora when.
>>
>>106834689
i'm afraid without e621 you can forget about complex porn positions. and i don't want another model that's only capable of standing and pov cowgirl.
>>
File: catbox_mzdun7.png (1.72 MB, 1040x1520)
1.72 MB
1.72 MB PNG
Beginner here, using SwarmUI. Trying to replicate civitai images and it always comes out slightly wrong.

https://civitai.com/images/105020580 as a random example.

It always has this weird yellow tint, it's blurrier even with me using remacri as an upscaler, and it's less detailed. The detail and blur I guess I can solve by upping the refiner upscale, but I can't get rid of that weird yellow tint.
>>
File: WanVideo2_2_I2V_00518.webm (677 KB, 1488x848)
677 KB
677 KB WEBM
>>
>>106834745
What's an example of a position Noob can do that Illustrious can't? But if we're talking about which model has the easiest path to noob-but-better while inference still being within the reach of the majority, I can't think of another besides Yume.
>>
>>106834443
oh shiiii
>>
File: WanVideo2_2_I2V_00519.webm (1.78 MB, 848x1488)
1.78 MB
1.78 MB WEBM
>>
cozy
>>
File: WanVideo2_2_I2V_00520.webm (1.25 MB, 1488x848)
1.25 MB
1.25 MB WEBM
>>
File: ComfyUI_0003-urd.png (2.4 MB, 1728x1296)
2.4 MB
2.4 MB PNG
>>106832615
Model isn't quite as good for loraless Urd as it is for Bell.
>>
How come China is so based

https://files.catbox.moe/ztpndq.mp4
>>
File: ComfyUI_temp_auxpg_00001_.png (2.83 MB, 1728x1296)
2.83 MB
2.83 MB PNG
>>106835058
>>
Is this node banned or something? It won't download and other people report it missing as well but the links to those pages are fucking deleted.
>>
File: ComfyUI_temp_irfic_00001_.png (3.17 MB, 1296x1728)
3.17 MB
3.17 MB PNG
>>106835116
>>
Is a gpu with 16gb of vram much faster than one with 12 when using wan?
>>
File: radiance.png (3.13 MB, 848x1488)
3.13 MB
3.13 MB PNG
>>106835115
getting wan 2.2 and animate was extremely nice, yea
>>
>>106835171
Extremely cool texture.
>>
File: radiance.png (2.66 MB, 848x1488)
2.66 MB
2.66 MB PNG
>>106835157
depends on the exact settings, but usually not a huge amount. it's more noticeable for the 4090 and 5090 because they have both more RAM and quite a lot more processing power.

same could again be said for H200 presumably but no one here seems to have that.
>>
>>106835132
There were and there always will be node which fail to be installed via Manager. Use git clone manually
>>
>>106835132
the official seedvr2 node is garbage, you gotta install from here:
https://github.com/AInVFX/ComfyUI-SeedVR2_VideoUpscaler/tree/nightly

youre welcome
>>
can I use qwen lora on qwen edit?
>>
>>106834016
>>106833582
lora for this feel?
>>
With large enough batch, you can even use nvme ssd on ramtorch.
>>
>>106834745
>i'm afraid without e621 you can forget about complex porn positions
Bbbut... Don't animals have different porn positions than humans, including those reverse joints/elongated feet, somesuch. Won't it mess up concepts?
>>
File: WanVideo2_2_I2V_00521.webm (2.35 MB, 848x1488)
2.35 MB
2.35 MB WEBM
omg migu?!
>>
>>106835115
Any thoughts on how to smooth oth the tremor in the DW "bones"?
>>
>>106834745
idk about yume but neta lumina did have e621 in it's dataset
>>
>>106835658
I can't believe she feels empowered
>>
Realistic models for i2i refinement after qwen edit?
>>
>>106835058
>>106835116
>>106835135
very nice
>>
>>106835796
maybe you can use a flux finetune but even then I'm not sure you won't lose more than you gain unless you limit it to masked/segmented areas.
>>
>>106832401
Sometimes colors work, others it doesn't get recognized and gives you different colors each time.

Something like coral gives you actual corals sometimes. Pantone could work but I'm not sure, has no one tested this?
>>
holy fuck v7 is worse than v6
how did he manage to fuck it up that badly
aside from removing artists, using auraflow



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.