[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


🎉 Happy Birthday 4chan! 🎉


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106826868

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: 1753534225207814.jpg (721 KB, 1344x768)
721 KB
721 KB JPG
>>
>shill in collage
>>
File: WanVideo2_2_I2V_00690.webm (1.36 MB, 704x1280)
1.36 MB
1.36 MB WEBM
>>
File: WanVideo2_2_I2V_00691.webm (1.41 MB, 704x1280)
1.41 MB
1.41 MB WEBM
>>106830631
wan struggles with tongues
>>
Blessed thread of frenship
>>
>>106830607
anyone else notice when you use the comfyui workflow for qwen image editing that it gets progressively more retarded the more images you generate until you reload the workflow?
what
>>
>>106830631
ok now make her pull one boob out and show it to me
>>
>>106830687
If that's really the case you should be able to take the workflow from the "retarded" gens, rerun it after reloading and get different results
>>
File: 1744457502903438.jpg (1.03 MB, 1248x1824)
1.03 MB
1.03 MB JPG
Neta is fun
>>
>>106830102
Huh I hadn’t even thought about using these LLMs for custom tuning my interfaces. Maybe I’ll fire up my ChatGPT plus when I get home and make a script for forge finally, I’ve always wanted to be able to pipe from txt2img into img2img latent upscale with controlnet instead of the hiresfix but that’s not possible in default, but I can’t code for shit beyond “hello world”. Thanks for opening my eyes anon.
>>
What is the current face swap/head swap meta?
>>
>>106830687
Sounds like something to do with memory maybe? I’ve noticed sometimes in forge when it gets buggered up upscaling too big or too many Loras everything starts looking more and more fried even with the same cfg and such
>>
What's your WAN 2.2 workflow?
>>
>>106830753
no problem anon
i was skeptical on AI too but claude code is really good at doing this agentic shit with bash/python
I was thinking today of having it write up something to OCR my output images for quality control when doing text edits/generations
if you have a chatgpt subscription i think codex is the equivalent, i would look into that. it's 100x nicer than having to copy-paste code and error messages back and forth to a web interface (i did that for far too long)
>>
File: 00032-1399991597.png (3.42 MB, 1080x1920)
3.42 MB
3.42 MB PNG
>>
File: 1742865493728877.jpg (1.74 MB, 1920x1080)
1.74 MB
1.74 MB JPG
>>106830837
>deepdream dogs were 10 years ago
fuck
>>
>>106830850
I got it working on Docker. It was neato. I had no idea what docker was and just followed the guide.

I now hate docker and refuse to install it, and comfyui, because it's awful too.
>>
File: 00033-1644118011.png (3.27 MB, 1920x1080)
3.27 MB
3.27 MB PNG
>>
>>106830738
based
>>
>>106830913
it barely even looks like her. enough to get my penis hard though.
>>
>>106830924
did he run it through the "refiner"?
>>
File: ChromaGiger_00004_.jpg (949 KB, 1472x1896)
949 KB
949 KB JPG
>>
>>106830765
probably still reactor faceswap? not sure tho, almost no one here seems to care for swapping faces.
>>
>>106830924
looks like who? this is an ai generated woman
>>
File: 1756939138552239.png (401 KB, 1258x1587)
401 KB
401 KB PNG
how do you know the right number of batch size to get that equilibrium?
https://xcancel.com/LodestoneRock/status/1975921870412325016#m
>>
>>106831079
trial and error depending on what you train. Btw does batch size affect quality of the final lora or is it purely about speed?
>>
>>106831088
>Btw does batch size affect quality of the final lora or is it purely about speed?
it's all about speed
>>
File: 00034-1539108598.png (1.24 MB, 792x992)
1.24 MB
1.24 MB PNG
>>
File: watermark?.png (2.08 MB, 2042x1357)
2.08 MB
2.08 MB PNG
lmao, at least nano banana doesn't make your image piss yellow like 4o imagegen
>>
>>106831088
Yes it affects quality
>>
>>106831079
based Dr. Furk
>>
File: please bro....png (34 KB, 250x144)
34 KB
34 KB PNG
>>106831079
>it's 100% identical to speed bro you have to believe me, please bro it's no different to native non offlading bro just increase your batch size bro I got this, I killed Nvdia bro you have to trust me on that one bro
>>
File: cum.jpg (120 KB, 1098x816)
120 KB
120 KB JPG
its weird how wan seems to know about penises pretty well without a lora.
>>
File: 00035-121072309.png (1.62 MB, 1104x880)
1.62 MB
1.62 MB PNG
>>
>>106831118
does that encode user info or is it just a generic watermark on all images is the question.
>>
>>106831088
It definitely affects quality but you can approximate the effects with gradient accumulation
>>
>>106831150
it knows this shit way less than hunyuanvideo though
>>
Coming from using automatic1111... ComfyUI is filtering me hard. I'm too retarded to use this shit.
>>
>>106831199
you had to learn new stuff to use automatic.
you'll learn new stuff to use comfy.
it's just different. don't sweat it.
>>
What's the best AI to gen anime l*li image to video with?
>>
>>106831199
don't bother until all the webapps using pytorch are dead. don't except anything that isn't a exe. anyone still recommend comfy is either a paid shill or a baby duck fearing change
>>
ok so where is my payment?
>>
>>106831227
>exe
>paid shill or a baby duck fearing change
ironic
>>
why do people hate comfy?
>>
>>106831258
not everyone, I'm sure there are a lot of people happy his org sells their data. hell, they probably pay for api nodes as well!
>>
>>106831267
What data does comfy get?
>>
>>106831278
we would all like to know. has to be something juicy if people pay for it
>>
>>106831286
it's open source so surely you can point to the part of the code which is sending your data out?
>>
File: ComfyUI_00085_.png (1.01 MB, 1280x800)
1.01 MB
1.01 MB PNG
>>106831241
guess you'll have to bill them
>>
>>106831308
electron, login (not disabled if you add the no API flag), and the custom node manager. verified by wireshark. nobody forked it so I guess they don't care enough about the shitty software to keep a privacy respecting fork updated
>>
he still hasn't posted a single screenshot about any of his claims btw
>>
>>106831343
can you?
>>
burden of proof
>>
>>106831327
>no lines of source
i accept your defeat
>>
comfyui calls home, prove me wrong
>>
File: 1735486212494323.mp4 (1.28 MB, 704x480)
1.28 MB
1.28 MB MP4
so a faggot was tazing his dog on a stream, named hasan.
>dog casts: thunder 1
>>
>>106831375
the dog on the left grabs the man with glasses with his paws and a lightning bolt strikes the man wearing glasses, and he falls down on the ground.
>>
Dog taze man.
>>
File: 00045-3749320490.png (3 MB, 1920x1080)
3 MB
3 MB PNG
>>
File: Kontext_00002_.png (963 KB, 1344x848)
963 KB
963 KB PNG
>>106831308
>>
File: ChromaGiger_00022_.jpg (1.29 MB, 1472x1896)
1.29 MB
1.29 MB JPG
>>
File: 00034-2655164237.png (1.37 MB, 896x1152)
1.37 MB
1.37 MB PNG
>>
File: ChromaGiger_00023_.jpg (1.37 MB, 1472x1896)
1.37 MB
1.37 MB JPG
>>
>>106831199
Select the C icon in the top left corner and browse templates. Its very common to find workflows with examples on Youtube and Reddit too.

Nodes are simply Python programs and ComfyUI is the retard ranch where you wrangle them together. Some of them will be killed and eaten next year while new ones will be created. Don't get too hung up on needing to know what every single one does.
>>
File: 1730785197425755.mp4 (1.14 MB, 704x480)
1.14 MB
1.14 MB MP4
>>106831386
wan 2.2 was made for days like this.
>>
best models:
neta lumina for anime
chroma for realistic
qwen image for prompt adherence
Change my mind.
>>
File: 00047-623757031.png (1.43 MB, 768x960)
1.43 MB
1.43 MB PNG
>>
just run comfy in firejail if you're autistic?
>>
>>106831550
or just use opensnitch and block the network access requests
>>
>>106831550
>>106831557
then comfy won't open at all
>>
>>106831587
you don't block local network dummy
>>
>>106831589
But what if trani hacks xemxilf by doing so??
>>
>>106831621
no idea what you're saying but it sounds like you might have other issues
>>
File: ChromaGiger_00029_.jpg (1.11 MB, 1472x1896)
1.11 MB
1.11 MB JPG
>>
File: 1731212266727686.png (112 KB, 1650x608)
112 KB
112 KB PNG
>>106831118
chat is that true?
>>
File: QwenEdit_00024_.png (491 KB, 800x592)
491 KB
491 KB PNG
tried the 4step
carried the face over head and shoulders beyond the 8step with same ksampler (steps changed obviously)
>>
Finally, was about time!
>>
>>106831735
we're eating good now
>>
File: ChromaGiger_00032_.jpg (1.53 MB, 1472x1896)
1.53 MB
1.53 MB JPG
>>
File: 1746742817279617.mp4 (1.22 MB, 704x480)
1.22 MB
1.22 MB MP4
>>
>>106831529
>qwen image for prompt adherence
hunyuan 3 beats qwen image with the bonus that no one can use it.
>>
>>106831786
good doggy. fuck hassan
>>
File: ComfyUI_0039-re.png (2.75 MB, 1296x1728)
2.75 MB
2.75 MB PNG
>>
File: NOO IN REAL LIFE.png (227 KB, 480x360)
227 KB
227 KB PNG
>>106831786
Sam Hyde invoked this dog to do some damage
>>
File: cuteCUTiEEE.webm (640 KB, 704x884)
640 KB
640 KB WEBM
>>106830661
>>106830631
ok now make her pretend to actually like me </3
>>
>>106831836
how she doing that with her lips...
>>
>>106831764
Not usually a chroma guy but nice, catbox?
>>
>>106831787
is there some computer magic the chinese can summon which turns my graphics card into the extremely tight undersized bathing suit?
>>
File: ChromaGiger_00036_.jpg (818 KB, 1472x1896)
818 KB
818 KB JPG
>>106831847
lora, still needs cooking
>>
File: 1751366292633572.png (489 KB, 1241x761)
489 KB
489 KB PNG
LOOOOOOL
>>
>>106831859
pretty sure I saw posts like that here too
>>
>>106831854
Respect
>>
File: fun(1).mp4 (1.51 MB, 560x560)
1.51 MB
1.51 MB MP4
Turning my ancient proompts into vids is fun.
>>
>>106831846
unprompted, but the starting frame already had a lil tongue blep out ;3
"smirk, eyes closed, sultry" etc you know by now
>>
>>106831883
slopped but cute
>>
File: n30moanon.gif (3.43 MB, 540x469)
3.43 MB
3.43 MB GIF
>>106831883
has a dreamlike stop-motion quality to it
whats fucked up is i am a schizoid &
her being a lookalike to someone
i used to spend a lot of time with
is so fuggin weird\strange i uh
i gotta take some time off
from all of this worldly
stuff man, like truly
i think i do mean
my mind is uhh
getting um
pookied
;_;
>>
>>106831883
what model is that video model
>>
File: fun2.mp4 (1.88 MB, 640x480)
1.88 MB
1.88 MB MP4
>>106831883
Don't worry, it's not all 1girl!

>>106831906
slopped?
>>
>>106831909
what the FUCK is "pookied"?
>>
>>106831914
plastic
>>
File: ChromaGiger_00037_.jpg (786 KB, 1816x1208)
786 KB
786 KB JPG
>>106831872
it's a challenge
>>
File: 00051-322425743.png (1.74 MB, 960x960)
1.74 MB
1.74 MB PNG
>>
>>106831807
>NO, IN REAL LIFE
>>
>>106831863
remember when people said open source wouldnt get stuff like dall-e

now we have wan i2v, illustrious/noobai, qwen/flux, and qwen edit to make an image do basically anything.
>>
File: fun3(1).mp4 (2.16 MB, 640x480)
2.16 MB
2.16 MB MP4
>>106831912
A good one

>>106831923
I'm just getting into this video stuff. Give me some tiiime
>>
File: 1753049711325018.mp4 (881 KB, 704x480)
881 KB
881 KB MP4
the dog on the left punches the man with glasses in the face with his paws and a lightning bolts strike the man wearing glasses from above, and he falls down on the ground.

DORYA!
>>
>>106831925
Kino
>>
>>106831947
airbud yes!
>>
>>106831940
more grills

>>106831912
I doubt it's local
>>
File: fun4.mp4 (3.01 MB, 752x416)
3.01 MB
3.01 MB MP4
>>106831940
>>
>>106831940
>>106831973
local diffusion general
>>
>>106831973
>>106831940

These are pretty good, so I won't hate. But shouldn't you be posting SAAS shit over on /sdg/?
>>
File: ComfyUI_0045-re.png (2.78 MB, 1296x1728)
2.78 MB
2.78 MB PNG
I am just so entranced by how this model shades the fabric folds. It is impossible to resist just giving random characters gossamer shawls.
>>
is it possible at all to prompt lighting adjustments to qwen edits?
>>
>>106831980
Comfyui supports SaaS and this is basically the Comfyui general so...
>>
*yawn*
>>
>>106832002
strawman argument, and its false
this is local diffusion general
>>
File: 00054-1482257944.png (2.27 MB, 1024x1024)
2.27 MB
2.27 MB PNG
>>
>>106832002
Which is why /sdg/ exists.
>>
Why do you keep saying if you use a non danbooru tag with illustrious it doesn't recognize it?
That's a lie, you can change some tags a little, like changing colours or singular and plural elements. The model understands that, even if a tag doesn't exist in the booru.
>>
File: 1732007050890640.png (131 KB, 591x554)
131 KB
131 KB PNG
Lodestone antisissies...our response?
>>
>>106832028
Does ostris support chroma?
>>
>>106832012
There are plenty of posts about comfyui on /ldg/ that have nothing to do with local diffusion specifically though
>>
>>106832025
2.0 (base not whatever shitmix you use) is better at it desu
>>
when anistudio comes out, webslop isn't allowed because saas sissies keep getting the wrong idea
>>
fucking ani hoarding all the good shota gens
>>
wan is amazing.

https://files.catbox.moe/e14okh.mp4
>>
>>106832037
then they have no right to be here? i dont see a contradiction
>>
File: 1757869592341799.png (40 KB, 576x310)
40 KB
40 KB PNG
>>106832028
looks like it will get faster too
>>
>DAEY HAB NO RIGHT TO COME ON LDG AN 4CHINZ
truly unhinged
>>
I need to know anons,
If you are training and you use an unrecognized tag, does the lora learn the tag for a concept or does it get ignored completely?
>>
>>106831097
That's not true, higher batch sizes learn slower at the same learning rate. It also impacts output diversity, overly high batch size can cause the results to be too averaged out sometimes.
>>
Ovi is fucking great, why didn't they did it for wan 2.2? Are they dumb? It would be like x10 better.
>>
>>106831925
Incredibly badass.
>>
How to make Qwen actually change the image with img2img workflow? No matter what I prompt it outputs practically the same image barely more realistic.
>>
>>106832094
>don't gatekeep anons, keep it cool and let everyone in there
that worked very well on /sdg/ right??!
>>
>>106832028
I have no response, if it works it's cool, I welcome the local ecosystem being stronger
>>
>>106831836
collagebait
>>
>>106832163
theres something fucky in your wucky
>>
>>106831079
I refuse to believe Furk is only one person.
>>
>>106832202
He is, but because he's a turkroach it gives the impression of fast multiplication.
>>
>dejavue during captioning
AIEEE
>>
>>106832202
for a PhD guy he really acts like the most unemployed person ever lol
>>
What are colours stable diffusion can understand?
I want to see what colours does it know, I don't just wanna use "blue" or "red", I'm talking RGB values here and their respective names.
>>
File: 1745626541010751.mp4 (957 KB, 704x480)
957 KB
957 KB MP4
be free doggo!



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.