[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor applications are now being accepted. Click here to apply.


[Advertise on 4chan]


File: highlights_g_106609272.webm (1.99 MB, 2048x1184)
1.99 MB
1.99 MB WEBM
Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106609272

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>>106613605
If it weren't for that crying girl the entire bottom row would be mine. I'd have a collage bingo.
>>
File: 1754245415326462.png (878 KB, 1024x1024)
878 KB
878 KB PNG
https://files.catbox.moe/qphnpf.jpg
Repeated reminder to not use Chroma HD/Flash HD. Base/2K + flash lora is a good speedy starting point. Base is also the most suited for second pass/upscale.
>>
File: radiance.png (1.48 MB, 1488x832)
1.48 MB
1.48 MB PNG
>>106613615
not bad, anon
>>
>>106613629
Kek that glitch skirt.
>>
nunchaku team, wtf are you doing, where is the promised wan support??
>>
File: radiance.png (2.44 MB, 1488x832)
2.44 MB
2.44 MB PNG
>>
whats better, scaled fp8 or q8??? BROS??
>>
>>106613641
Sorry some literal who just released a model nobody will use so we've diverted all our resources to making that work.
>>
>>106613648
same quality, scaled is pretty good
>>
>>106613647
nice SD1.4 image anon, I too love nostalgia
>>
>>106613641
Bro Wan3 is dropping soon. Give up.
>>106613648
Q8
>>106613655
No
>>
>>106613663
>Wan3 is dropping soon
source??
>>
File: radiance.png (3.27 MB, 1488x832)
3.27 MB
3.27 MB PNG
>>106613648
q8 might be better but it's not guaranteed
>>
>>106613668
The blue dragon probably. Some say he is wisest in all of China.
>>
>>106613648
I prefer scaled, but both are fine.
>>
>>106613663
>>106613668
let's hope they got rid of the dual model meme, with the lightvx lora, it's taking more time to unload/reload the second model than doing the inference part
>>
File: WanVideo2_2_I2V_00425.webm (677 KB, 768x1056)
677 KB
677 KB WEBM
>>
Has anyone experimented with the wanvideo context options node? Supposedly allowing you to gen a bit longer stuff?

>>106613648
Scaled fp8/16 seems to give me better results for more static videos for loops, while q8 can do a lot of motion. This is for a first frame-last frame loop workflow.
>>
>>106613702
>Has anyone experimented with the wanvideo context options node? Supposedly allowing you to gen a bit longer stuff?
that doesn't seem to be a usable thing with the way wan 2.2 works
>>
File: radiance.png (3.19 MB, 1488x832)
3.19 MB
3.19 MB PNG
>>106613661
perhaps so, it'd however be very inflexible

train 1.4 model to support more fetish boots with ballet outfit, probably only get these after that
>>
File: radiance.png (2.98 MB, 1488x832)
2.98 MB
2.98 MB PNG
>>
so is small/flat chests impossible on wan2.2? I want to gen some porn of fit track runners.
>>
>>106613729
do it with qwen
>>
>>106613709
The bane of open source I guess. New things come out and the previous thing doesn't work.
>>
File: radiance.png (3.22 MB, 1488x832)
3.22 MB
3.22 MB PNG
>>106613729
in i2v as far as I can tell it's almost only that huge breasts shrink, not that small ones grow (specific lora excluded)
>>
File: 1750591726684038.png (66 KB, 279x181)
66 KB
66 KB PNG
>>106613758
>It's a testament to the perils of the sunk cost falacy. He's burnt so much money and obviously hasn't released v7 just because the results were so shockingly bad that it it would instantly make ponysisters rope. This can't end well.
I want him to release v7 though, it would be so funny
>>
https://www.reddit.com/r/comfyui/comments/1niddkv/the_comfy_oath_carved_in_stone_free_forever/

holy cringe.
>>
>>106613808
10 years old me would have been very impressed.
>>
File: 1750277244298310.png (113 KB, 655x621)
113 KB
113 KB PNG
is this snakeoil?
>>
>>106613808
Its reddit so they need to pander to their brand of retardation a bit
>>
>>106613850
Nag isn't. But delete torch compile.
>>106613853
Not even reddit is buying it lol.
>>
>>106613850
NAG works, but radial attention is piss
>>
File: 1740528890859938.png (3.42 MB, 3828x1133)
3.42 MB
3.42 MB PNG
>>106613850
nag works really well on kontext, dunno for wan though
>>
>>106613850
No its WanVideo
>>
>>106613808
Cringe yes, but at least he kept his word
>>
>>106613872
>he kept his word
... yet
>>
File: 1735480823767862.mp4 (980 KB, 480x672)
980 KB
980 KB MP4
the man carrying boxes on his back runs to his left into an amazon warehouse, where a large amazon logo is above the door.

amazon stranding is real.

disabled high 2.2 lightx2v, low enabled, 6 steps. works like a charm, high enabled kills the motion.
>>
>>106613872
to appease the peasants while they laugh, sure.
>>
>>106613882
Ehh, ok
>>
File: the real GOTY.mp4 (3.63 MB, 864x608)
3.63 MB
3.63 MB MP4
>>106613883
top kek, if this game wins the GOTY it won't be funny at all though
>>
File: file.png (3.05 MB, 1488x832)
3.05 MB
3.05 MB PNG
>>106613808
the wording is definitely a bit... but the core of it is great

i suppose one day we can generate eminence in the shadows: comfy edition
>>
File: WanVideo2_2_I2V_00427.webm (752 KB, 768x1056)
752 KB
752 KB WEBM
>>
File: file.png (3.01 MB, 1488x832)
3.01 MB
3.01 MB PNG
>>106613883
that turned out great.
>>
File: 1755021396344206.jpg (2.05 MB, 2432x3984)
2.05 MB
2.05 MB JPG
>>106613648
Forget the 4/5xxx series copers, Q8 is basically fp16 while fp8_scaled is quite different every time.
>>
>>106613989
yep, nothing can beat Q8, I wished the nunchaku guys focused on making Q8 fast instead of coping with some fp4 shit
>>
File: 1735065982504345.jpg (444 KB, 3456x1221)
444 KB
444 KB JPG
>>106613989
>b-b-but fp8_scaled CAN look OK!!!
Yeah, you can RNG your way into something that looks OK since images have a high capacity of containing error but in places where it doesn't matter. But none of that is relevant when you going away from base fp16 model is objectively gonna be worse in general and especially for details.
>>
>>106614009
damn its basically a different model
>>
File: 1726700437058009.webm (1.65 MB, 480x672)
1.65 MB
1.65 MB WEBM
go amazon man go!
>>
>>106613688
you dont have enough ram, it should only take half a sec
>>
File: 1754092875565198.jpg (2.04 MB, 7961x2897)
2.04 MB
2.04 MB JPG
>>106614023
and it gets worse the further you go
>>
>>106614005
nunchaku is even better. The parts of it that would degrade are fp16
>>
File: 1742966409880417.webm (1.33 MB, 480x672)
1.33 MB
1.33 MB WEBM
before the issue was slow motion. now it can be sanic fast with the high 2.2 lora disabled.
>>
>>106614059
I think three steps is too low desu. 4 was around what the paper outlines.
>>
>>106614056
>nunchaku is even better.
it's not better than Q8 you're delusional
>>
>>106614080
it legit is closer to fp16 than Q8
>>
>>106614056
>nunchaku is even better.
You should be in an insane asylum. Nunchaku is good, but it's not better than q8
>>
>>106614094
Q8 has more detail degradation, nunchunu only looks different style wise per seed
>>
>>106614092
prove it, show a comparison image between bf16, Q8 and nunchaku
>>
>>106614005
They did in the paper, their 8 bit method is basically perfect and also supports SDXL
>>
>>106614112
>in the paper
nigga
>>
>>106614112
they compare that to INT8, this shit is worse than fp8 (and even worse than Q8), it's not a good comparison
>>
>>106614112
Sir, I'm from the asylum. Please come with us, you need help.
>>
>>106614112
our 0.7b LLM model beats the <latest top trillion param model> on this benchmark we specifically finetuned it for its basically better than that model now!!!!!! tier retardation
>>
All the AI papers are fucking useless. Only thing that holds any value is same seed comparison between models.
>>
>>106614112
don't cite papers here, they can tell the truth, only trust your gut and tell stupid shit with confidence
>>
>>106614159
>only trust your gut
*eyes >>106614050
>>
>>106614159
>qwen has high aesthetic quality! the paper said so!!
>>
>>106614140
are you retarded? it's their own int8 method not naive int8
their int4 and nvfp4 are better than q4
>>
File: qwen-image.jpg (2.81 MB, 5924x5708)
2.81 MB
2.81 MB JPG
>>106614110
youll have to wait till I get home but they have this
>>
>>106614165
there is neither scaled nor nunchaku stuff there, so you're right, be even more confident!
>>
>>106614159
>they can tell the truth
30% of the time yes
https://en.wikipedia.org/wiki/Replication_crisis
>A 2016 survey by Nature on 1,576 researchers who took a brief online questionnaire on reproducibility found that more than 70% of researchers have tried and failed to reproduce another scientist's experiment results
>>
>>106614175
>be even more confident!
>>106614056
>nunchaku is even better.
yep, that's confidence, always trust a random anon, if he says so, that's true
>>
I trust myself.
I made videos with fp8 scaled, and ones with q8, no difference in output, but the fp8 scaled was faster.
>>
File: 00008-1320046499.png (1.57 MB, 896x1152)
1.57 MB
1.57 MB PNG
>>
Assuming I've got a shitrig of an old server from the 2010s runnin nextcloud and lyrion, How viable would putting a modern gpu there for SD be?
Im wondering how much of a bottleneck old chipset/cpu/ram would be?
>>
Have you ever made a claim so retarded the entire general fell into chaos?
>>
>>106614173
the problem with this comparison was always that its too basic with a huge room for error in the image, you can fuck it up during inference a lot and as long as its vaguely a book shop of books with correct words on it, its good

gen a realistic crowd of different people of different clothes/races all holding different objects engaged in battle for example or other similar complex prompts, it will shit itself
>>
>>106614222
aesthetic af
>>
>>106614200
Surely you tested it on multiple seeds on complex motion and action prompts, right... right? Oh...
fp8 scasled blurrs the motion
>>
>>106614200
>no difference in output
if you only asked for "1girl, walking" then yeah you don't need a solid quant to do this, it depends on each case
>>
>>106614241
>aesthetic
Lucky gen. had to (badly) airbrush the little man out of it.
>>
>>106614228
not hard, when the entire general already has below average intelligence.
>>
>>106614225
2010s is a bit vague. Probably most important is that it's at least PCIE 4.0, and you want your models to be on a fast nvme ssd. If you offload to the CPU (you most likely will for video gen unless you get a 5090 at minimum) then the System RAM speed matters a lot and then the CPU speed.
>>
>>106614273
>when the entire general already has below average intelligence.
It's your fault, your score is so low that it brought the average down to a ridiculous level.
>>
File: screenshot_50210.jpg (69 KB, 600x764)
69 KB
69 KB JPG
just unfucked my lora thanks /g/
>>
Radiance is strange because it loves to slap super fine threads throughout the image.
>>
File: 1749568837981.jpg (171 KB, 1187x1944)
171 KB
171 KB JPG
So is it a better idea to train a character lora and a pose lora, or trian the character and the pose in one lora?
>>
My friend is an architect and he wants to use AI to enhance his images. I haven't image genned since the Dreambooth days (I primarily just video gen now), how should I go about this? SD with some realism loras + control net with depth map?
>>
There's not gonna be a real VACE 2.2 is there?
>>
>>106614364
What model? For poses you can use controlnet.
>>
not sure if this is the right place to ask this, but can image to video gens be profitable? or there’s a good chance that the original owner can sue your ass into oblivion?
>>
>>106614409
You'd be the first.
>>
>>106614335
>furshit
>>
>>106614409
Do you mean like having a porn patreon focused on i2v content? In that case I think you'd want to gen your own images.
>>
nunchaku wan WHEN WHEN WHEN WEHN



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.