[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor applications are now being accepted. Click here to apply.


[Advertise on 4chan]


File: highlights_g_106609272.webm (1.99 MB, 2048x1184)
1.99 MB
1.99 MB WEBM
Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106609272

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>>106613605
If it weren't for that crying girl the entire bottom row would be mine. I'd have a collage bingo.
>>
File: 1754245415326462.png (878 KB, 1024x1024)
878 KB
878 KB PNG
https://files.catbox.moe/qphnpf.jpg
Repeated reminder to not use Chroma HD/Flash HD. Base/2K + flash lora is a good speedy starting point. Base is also the most suited for second pass/upscale.
>>
File: radiance.png (1.48 MB, 1488x832)
1.48 MB
1.48 MB PNG
>>106613615
not bad, anon
>>
>>106613629
Kek that glitch skirt.
>>
nunchaku team, wtf are you doing, where is the promised wan support??
>>
File: radiance.png (2.44 MB, 1488x832)
2.44 MB
2.44 MB PNG
>>
whats better, scaled fp8 or q8??? BROS??
>>
>>106613641
Sorry some literal who just released a model nobody will use so we've diverted all our resources to making that work.
>>
>>106613648
same quality, scaled is pretty good
>>
>>106613647
nice SD1.4 image anon, I too love nostalgia
>>
>>106613641
Bro Wan3 is dropping soon. Give up.
>>106613648
Q8
>>106613655
No
>>
>>106613663
>Wan3 is dropping soon
source??
>>
File: radiance.png (3.27 MB, 1488x832)
3.27 MB
3.27 MB PNG
>>106613648
q8 might be better but it's not guaranteed
>>
>>106613668
The blue dragon probably. Some say he is wisest in all of China.
>>
>>106613648
I prefer scaled, but both are fine.
>>
>>106613663
>>106613668
let's hope they got rid of the dual model meme, with the lightvx lora, it's taking more time to unload/reload the second model than doing the inference part
>>
File: WanVideo2_2_I2V_00425.webm (677 KB, 768x1056)
677 KB
677 KB WEBM
>>
Has anyone experimented with the wanvideo context options node? Supposedly allowing you to gen a bit longer stuff?

>>106613648
Scaled fp8/16 seems to give me better results for more static videos for loops, while q8 can do a lot of motion. This is for a first frame-last frame loop workflow.
>>
>>106613702
>Has anyone experimented with the wanvideo context options node? Supposedly allowing you to gen a bit longer stuff?
that doesn't seem to be a usable thing with the way wan 2.2 works
>>
File: radiance.png (3.19 MB, 1488x832)
3.19 MB
3.19 MB PNG
>>106613661
perhaps so, it'd however be very inflexible

train 1.4 model to support more fetish boots with ballet outfit, probably only get these after that
>>
File: radiance.png (2.98 MB, 1488x832)
2.98 MB
2.98 MB PNG
>>
so is small/flat chests impossible on wan2.2? I want to gen some porn of fit track runners.
>>
>>106613729
do it with qwen
>>
>>106613709
The bane of open source I guess. New things come out and the previous thing doesn't work.
>>
File: radiance.png (3.22 MB, 1488x832)
3.22 MB
3.22 MB PNG
>>106613729
in i2v as far as I can tell it's almost only that huge breasts shrink, not that small ones grow (specific lora excluded)
>>
File: 1750591726684038.png (66 KB, 279x181)
66 KB
66 KB PNG
>>106613758
>It's a testament to the perils of the sunk cost falacy. He's burnt so much money and obviously hasn't released v7 just because the results were so shockingly bad that it it would instantly make ponysisters rope. This can't end well.
I want him to release v7 though, it would be so funny
>>
https://www.reddit.com/r/comfyui/comments/1niddkv/the_comfy_oath_carved_in_stone_free_forever/

holy cringe.
>>
>>106613808
10 years old me would have been very impressed.
>>
File: 1750277244298310.png (113 KB, 655x621)
113 KB
113 KB PNG
is this snakeoil?
>>
>>106613808
Its reddit so they need to pander to their brand of retardation a bit
>>
>>106613850
Nag isn't. But delete torch compile.
>>106613853
Not even reddit is buying it lol.
>>
>>106613850
NAG works, but radial attention is piss
>>
File: 1740528890859938.png (3.42 MB, 3828x1133)
3.42 MB
3.42 MB PNG
>>106613850
nag works really well on kontext, dunno for wan though
>>
>>106613850
No its WanVideo
>>
>>106613808
Cringe yes, but at least he kept his word
>>
>>106613872
>he kept his word
... yet
>>
File: 1735480823767862.mp4 (980 KB, 480x672)
980 KB
980 KB MP4
the man carrying boxes on his back runs to his left into an amazon warehouse, where a large amazon logo is above the door.

amazon stranding is real.

disabled high 2.2 lightx2v, low enabled, 6 steps. works like a charm, high enabled kills the motion.
>>
>>106613872
to appease the peasants while they laugh, sure.
>>
>>106613882
Ehh, ok
>>
File: the real GOTY.mp4 (3.63 MB, 864x608)
3.63 MB
3.63 MB MP4
>>106613883
top kek, if this game wins the GOTY it won't be funny at all though
>>
File: file.png (3.05 MB, 1488x832)
3.05 MB
3.05 MB PNG
>>106613808
the wording is definitely a bit... but the core of it is great

i suppose one day we can generate eminence in the shadows: comfy edition
>>
File: WanVideo2_2_I2V_00427.webm (752 KB, 768x1056)
752 KB
752 KB WEBM
>>
File: file.png (3.01 MB, 1488x832)
3.01 MB
3.01 MB PNG
>>106613883
that turned out great.
>>
File: 1755021396344206.jpg (2.05 MB, 2432x3984)
2.05 MB
2.05 MB JPG
>>106613648
Forget the 4/5xxx series copers, Q8 is basically fp16 while fp8_scaled is quite different every time.
>>
>>106613989
yep, nothing can beat Q8, I wished the nunchaku guys focused on making Q8 fast instead of coping with some fp4 shit
>>
File: 1735065982504345.jpg (444 KB, 3456x1221)
444 KB
444 KB JPG
>>106613989
>b-b-but fp8_scaled CAN look OK!!!
Yeah, you can RNG your way into something that looks OK since images have a high capacity of containing error but in places where it doesn't matter. But none of that is relevant when you going away from base fp16 model is objectively gonna be worse in general and especially for details.
>>
>>106614009
damn its basically a different model
>>
File: 1726700437058009.webm (1.65 MB, 480x672)
1.65 MB
1.65 MB WEBM
go amazon man go!
>>
>>106613688
you dont have enough ram, it should only take half a sec
>>
File: 1754092875565198.jpg (2.04 MB, 7961x2897)
2.04 MB
2.04 MB JPG
>>106614023
and it gets worse the further you go
>>
>>106614005
nunchaku is even better. The parts of it that would degrade are fp16
>>
File: 1742966409880417.webm (1.33 MB, 480x672)
1.33 MB
1.33 MB WEBM
before the issue was slow motion. now it can be sanic fast with the high 2.2 lora disabled.
>>
>>106614059
I think three steps is too low desu. 4 was around what the paper outlines.
>>
>>106614056
>nunchaku is even better.
it's not better than Q8 you're delusional
>>
>>106614080
it legit is closer to fp16 than Q8
>>
>>106614056
>nunchaku is even better.
You should be in an insane asylum. Nunchaku is good, but it's not better than q8
>>
>>106614094
Q8 has more detail degradation, nunchunu only looks different style wise per seed
>>
>>106614092
prove it, show a comparison image between bf16, Q8 and nunchaku
>>
>>106614005
They did in the paper, their 8 bit method is basically perfect and also supports SDXL
>>
>>106614112
>in the paper
nigga
>>
>>106614112
they compare that to INT8, this shit is worse than fp8 (and even worse than Q8), it's not a good comparison
>>
>>106614112
Sir, I'm from the asylum. Please come with us, you need help.
>>
>>106614112
our 0.7b LLM model beats the <latest top trillion param model> on this benchmark we specifically finetuned it for its basically better than that model now!!!!!! tier retardation
>>
All the AI papers are fucking useless. Only thing that holds any value is same seed comparison between models.
>>
>>106614112
don't cite papers here, they can tell the truth, only trust your gut and tell stupid shit with confidence
>>
>>106614159
>only trust your gut
*eyes >>106614050
>>
>>106614159
>qwen has high aesthetic quality! the paper said so!!
>>
>>106614140
are you retarded? it's their own int8 method not naive int8
their int4 and nvfp4 are better than q4
>>
File: qwen-image.jpg (2.81 MB, 5924x5708)
2.81 MB
2.81 MB JPG
>>106614110
youll have to wait till I get home but they have this
>>
>>106614165
there is neither scaled nor nunchaku stuff there, so you're right, be even more confident!
>>
>>106614159
>they can tell the truth
30% of the time yes
https://en.wikipedia.org/wiki/Replication_crisis
>A 2016 survey by Nature on 1,576 researchers who took a brief online questionnaire on reproducibility found that more than 70% of researchers have tried and failed to reproduce another scientist's experiment results
>>
>>106614175
>be even more confident!
>>106614056
>nunchaku is even better.
yep, that's confidence, always trust a random anon, if he says so, that's true
>>
I trust myself.
I made videos with fp8 scaled, and ones with q8, no difference in output, but the fp8 scaled was faster.
>>
File: 00008-1320046499.png (1.57 MB, 896x1152)
1.57 MB
1.57 MB PNG
>>
Assuming I've got a shitrig of an old server from the 2010s runnin nextcloud and lyrion, How viable would putting a modern gpu there for SD be?
Im wondering how much of a bottleneck old chipset/cpu/ram would be?
>>
Have you ever made a claim so retarded the entire general fell into chaos?
>>
>>106614173
the problem with this comparison was always that its too basic with a huge room for error in the image, you can fuck it up during inference a lot and as long as its vaguely a book shop of books with correct words on it, its good

gen a realistic crowd of different people of different clothes/races all holding different objects engaged in battle for example or other similar complex prompts, it will shit itself
>>
>>106614222
aesthetic af
>>
>>106614200
Surely you tested it on multiple seeds on complex motion and action prompts, right... right? Oh...
fp8 scasled blurrs the motion
>>
>>106614200
>no difference in output
if you only asked for "1girl, walking" then yeah you don't need a solid quant to do this, it depends on each case
>>
>>106614241
>aesthetic
Lucky gen. had to (badly) airbrush the little man out of it.
>>
>>106614228
not hard, when the entire general already has below average intelligence.
>>
>>106614225
2010s is a bit vague. Probably most important is that it's at least PCIE 4.0, and you want your models to be on a fast nvme ssd. If you offload to the CPU (you most likely will for video gen unless you get a 5090 at minimum) then the System RAM speed matters a lot and then the CPU speed.
>>
>>106614273
>when the entire general already has below average intelligence.
It's your fault, your score is so low that it brought the average down to a ridiculous level.
>>
File: screenshot_50210.jpg (69 KB, 600x764)
69 KB
69 KB JPG
just unfucked my lora thanks /g/
>>
Radiance is strange because it loves to slap super fine threads throughout the image.
>>
File: 1749568837981.jpg (171 KB, 1187x1944)
171 KB
171 KB JPG
So is it a better idea to train a character lora and a pose lora, or trian the character and the pose in one lora?
>>
My friend is an architect and he wants to use AI to enhance his images. I haven't image genned since the Dreambooth days (I primarily just video gen now), how should I go about this? SD with some realism loras + control net with depth map?
>>
There's not gonna be a real VACE 2.2 is there?
>>
>>106614364
What model? For poses you can use controlnet.
>>
not sure if this is the right place to ask this, but can image to video gens be profitable? or there’s a good chance that the original owner can sue your ass into oblivion?
>>
>>106614409
You'd be the first.
>>
>>106614335
>furshit
>>
>>106614409
Do you mean like having a porn patreon focused on i2v content? In that case I think you'd want to gen your own images.
>>
nunchaku wan WHEN WHEN WHEN WEHN
>>
File: ComfyUI_temp_fkvpb_00017_.png (2.77 MB, 1280x1024)
2.77 MB
2.77 MB PNG
>>
File: ComfyUI_00255_.mp4 (593 KB, 832x480)
593 KB
593 KB MP4
>>106614605
neat style
>>
>>106614542
when you buy a real gpu.

>>106614605
nice style. what was the prompt? it's chroma right? it has those lil noisy (in a good way) details that look like chroma
>>
>>>/h/8723568
I've come a long way with my i2v loops. I realize that keeping your image bright helps a ton with the quality for some reason.
Can finally start doing postprocess editing.

Really makes me want to start using flux etc to make some funky fantasy stuff.
>>
File: ComfyUI_temp_fkvpb_00027_.png (3.68 MB, 1664x1088)
3.68 MB
3.68 MB PNG
>>106614675
https://files.catbox.moe/d684h8.png
>>
>>106614408
Chroma. Pose is just an example really. It's a complex interaction that needs a lora because control net doesn't understand it.
>>
>>106614742
Prolly two loras since your character lora wants to be diverse in angles and shit. Unless you literally don't do anything but same character in one pose forever.
>>
>>106614228
>I was only pretending to be retarded
>>
File: 0_00120_.mp4 (744 KB, 832x480)
744 KB
744 KB MP4
>>
>>106614675
ur BLACK
>>
how to controlnet with gwen edit
>>
frankenstein monster
https://files.catbox.moe/fz74oz.mp4
>>
File: 00094-2300254381.png (2 MB, 1200x1520)
2 MB
2 MB PNG
>>106614029
dude better have my gpu in that delivery load,he's really goin for it!
>>
>>106614837
"the large mecha robot in the background lifts up its arm holding a futuristic gun and points it towards the viewer. a large purple laser beam shoots out of the futuristic gun filling the view with a purple explosion. the two women in the foreground starts burning with fire. the two burning women transforms into two skeletons standing in the foreground. the two skeletons fall apart and fall down."
>>
For a chroma lora can you reuse your datasets for XL loras as is with the same captions?
>>
You Will Never Be Ani
>>
>controlnet lora for qwen
>nunchaku qwen still doesnt support loras
SUFFERING BROS
>>
>>106614891
fucking disgusting my dude, kys
>>
File: 1757813590629606.mp4 (1.24 MB, 480x672)
1.24 MB
1.24 MB MP4
the white hair anime girl wearing a black blindfold, stands up and walks out the door to her right.

lightx2v 2.2 high lora off, low lora on, kijai workflow. works better than 2.1 with high/low (with wan 2.2)
>>
File: 0_00134_.mp4 (738 KB, 832x480)
738 KB
738 KB MP4
>>106614904
kek, I'll work on it.
>>
>>106614924
Right, this is more up your alley >>106614878
I understand.

>>106614932
still blows my mind you don't even have to prompt wan for nudity much of the time, it just fills in blanks.
>>
>>106614782
Was just wondering if overlapping 2 loras on each other would fuck both of them up since they weren't trained together, but I guess it works fine for Chroma then? Ty if true, will save me big on effort since all I have to do is the character which is way less images and time.
>>
We are diffusing seedream locally through comfyui’s powerful api. who could ask for more?
>>
File: 00025-2368599741.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
>>
>>
>>106614228
easy when you post it over and over again for multiple days
>>
>was playing around with wan 2.2 workflow I made
>I had the qwen edit workflow open
>save all workflows and close everything
>after a couple hours reopen comfy
>qwen edit WF was for some fucking reason overwritten by wan 2.2 wf
thanks cumfart, ur a fucking nigger
>>
>>106614991
small indie cumfarter dev pls understand
>>
>>106614991
>not exporting workflows
>trusting the built in "save" option
ISHYGDDT
>>
>>106615012
I think it happened when I had the wan wf open and decided to close the qie wf and clicked on save.
>>
>>106614976
me, I want an UI that isn't the digital equivalent of getting your teeth pulled without anaesthetics
>>
File: ComfyUI_00206_.png (561 KB, 1024x1024)
561 KB
561 KB PNG
>>
>>106614710
wonderful, thank you anon.
>>
File: ComfyUI_00211_.png (330 KB, 1024x1024)
330 KB
330 KB PNG
>>
File: ComfyUI_00215_.png (650 KB, 1024x1024)
650 KB
650 KB PNG
>>
File: WanVid_00007.webm (1011 KB, 544x704)
1011 KB
1011 KB WEBM
could be worse
>>
>>106614906
Yeah it can work
>>
>>106615176
gay
>>
making a lora of yourself so you can literally diffuse your cock in other peoples asses is peak local diffusion
>>
>>106615202
I'd not submit my likeness to the silicone spirit
>>
>>106615202
>make nsfw lora of yourself
>release it on civit
>>
File: ComfyUItest_00015_.png (1.78 MB, 1024x1024)
1.78 MB
1.78 MB PNG
>>
>neo"forge"
>neo"vagina"
>>
File: ComfyUI_00218_.png (705 KB, 1024x1024)
705 KB
705 KB PNG
>>
>neon knights
>>
Ronnie James Dio
>>
>>106614159
the comparisons posted here are far more academic than the grifting faggots making useless papers
>>
whers ranfag?
>>
>>106615246
deep and profound
>>
>make nsfw model of your mom
>release it on civit
>???????
>profit
>>
>>106615165
cute

>>106615202
i was tempted by the demon which suggests this same idea to you, but, it's probably not a good idea.
>>
>>106615269
rent free
>>
>>106615269
hopefully went back to /sdg/ with the other avatarfags
>>
File: 1741631419695817.mp4 (1.06 MB, 480x672)
1.06 MB
1.06 MB MP4
almost seamless.
>>
>>106615165
I hate how magic effects just kind of fade it in like a Photoshop layer having it's opacity turned down. With enough prompting you can get it to be more magic like with a particle effect obscuring it but it's still no sailor moon transformation sequence.
>>
>>106615306
and breathless!
>>
>>106615289
>but, it's probably not a good idea.
what could go wrong?
>>
>>106615286
But how do you get the training material for the nsfw part?
>>
>>106615330
Ngl I wouldn't like to look at a distorted figure of myself with three legs, melted face and head rotated 180°
>>
>>106615306
this is breathtaking!
>>
>>106615330
i don't know brotha that's why i won't risk it.
https://youtu.be/4Wulc0enY4M?si=vE5sa9EYoIJiexV3
>>
>>106615293
He really has nothing else to live for, I wonder how years will he can keep doing this, almost at the 5 year mark
>>
>>106613605
>https://comfyanonymous.github.io/ComfyUI_examples/wan22/
Retarded question: Are those videos in the OP done by Wan or is it even local txt/img-to-vid model?
>>
>local diffusion general
>>
>>106615397
Yes



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.