[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107693072

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg
>>
blessed thread of frenship
>>
ramtorch when
>>
is there like a beginner friendly guide to get started with your own model rig like the kind that Pewdie uses?
>>
File: 1758369591983134.png (1.3 MB, 1378x693)
1.3 MB
1.3 MB PNG
>>
I need to seamlessly blend someone into a group of people. Gemini Banana Pro can't do it apparently.. :(
>>
guys how do I upscale, please discuss local diffusion with me I thought you were my friends
>>
File: file.png (4 KB, 365x70)
4 KB
4 KB PNG
how do I get rid of this stupid piece of shit?
>>
it's over...
>>
>>107700541

Chinese culture’d
>>
>>107700522
What, you don't like clicking three times to cancel the queue and back out or have several things piled on top of each other? That's $17mil worth of design right there, Anon!
>>
>>107700576
I thought you had to go, fuck off already
>>
>>107700582
You must be mistaken, I'm not anybody in these threads.
>>
>>107700604
I would love to discuss the pros and cons of all software pertaining to local image diffusion, but exactly half of that discusion will cop me a ban, so I'd rather not discuss any of it, hence this empty thread
>>
>>107700568
What do you mean? Is it actually out?
>>
>>107700612
anon you are free to discuss anything on-topic and constructive if you keep personalities and schizo drama out of it
>>
>>107700629
you're not allowed to impersonate janitors or moderation staff
>>
>>107700522
>>107700576
>>107700582
>>107700604
>>107700612
>>107700629
...I just wanted to know if I could disable it
>>
>>107700637
that's not what i'm doing. just stating the obvious. nobody ever got banned for discussion local diffusion in a local diffusion general, you're spreading disinfo
>>
Where rentry
>>
>>107700643
You can actually roll the frontend back a bit, I'm not sure how far though.
>>
>>107700656
https://www.youtube.com/watch?v=X1osnpVqY_k
>>
File: 1611498912553.jpg (4 KB, 160x314)
4 KB
4 KB JPG
Finally got Latentsync 1.6 working, spent all fucking morning on it. It doesn't replace random shit like InfiniteTalk does. But infinitetalk is better at the lipsync.

I can't find jack shit on how to mask a face, or even mouth, with infinitetalk. Does anyone know?
>>
>>107700676
So, there's no way to turn it off with the current frontend?
>>
so what are the current top dog local models
when last I checked it was I think either flux or chroma for realistic stuff
and noobai/illustrious for weeb

I saw that pony 7 came out but I have no idea if that had any impact or not
>>
>>107700409
based
>>
>>107700763
No, unfortunately.
>>
>addicted to watching avr_loss go up and down for hours
>bouncing between its over and we are back
>>
Why was this thread created 2 hours before the previous thread hit bump limit?
>>
FUCKING WHY
>>
Why is Anistudio is such an useless piece of shit software?
>>
>>107700879
things made out of spite often are, it's not a good motivator for quality
>>
>>107700883
This
Most of his actions are out of spite plus he never delivers any of the software he promises. Still no high res fix and most UI had that in weeks not months.
>>
>>107700883
I feel genuinely sorry for whoever uses it in their workflow
>>
>>107700866
>2025-02-27
I can think of why perhaps
>>
>>107700866
>>107700927
AAAAAAAAAAA

WHY ISN'T REQUIREMENTS INSTALLED AUTOMATICALLY WITH COMFY ANY MORE REEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE

FUCK OPEN SOURCE
>>
nooooticing
>>
>>107700957
noticing? noticing what? the time? the weather? that's so ambiguous it could mean anything, most of which are on topic
>>
>>107700951
Yeah we really need an alternative UI that preferably isn't Python. Do any exist?
>>
>>107700966
There's always stable-diffusion.cpp :)
Not many other good alternatives, though
>>
>>107701015
are you using light2x? some wan shitmixes already have that embeded in them, so using light2x on top of it is like doubling the strength. only thing i can think of is try running it without it
>>
Any model to edit pixel art sprites to do different actions like, walking and so on?
Maybe qwen edit? Is there even a lora for it?
>>
Given the insane amount of hype around base its obviously going to flood the thread when it drops. I think we should preemptively come up with a temporary containment thread for it like how /wait/ was made to stop deepseek taking over /lmg/. Thoughts on /zog/ - z omni general?
>>
Ok legit what the fuck is the proper way to generate videos locally?
I just tried generating text to video 720p with 20 steps in comfyUI with hunyuanvideo1.5 model and this shit came out:
https://streamable.com/nag39n

I have no idea why those black squares are on the video.

This took like 1h30m to make. If i wanted to use like 50 steps it would probably take like 3-4h. Also there doesn't see to be any proper multi-gpu support either.

Is image-to-video any better in this regard?
>>
>>107701135
Comfy is still faster than the failure you're trying to shill
>>
>>107701135
never gonna use your trash ui
comfyui live forever, ani seethe forever
>>
holy mother of seethe anon. calm down
>>
>>107701135
yeah researchers are already discussing the pain of python instability and it's effect on performance. the only people that want to keep python are the vibe coders so they seem smart when they slop a custom node together. year of the snake is almost over
>>
>>107700879
skill issue
>>
>>107701141
i can do wan2.2 i2v with my shitty 8gb card in 1-5 mins depending on the settings, prompts etc and i dont see any artifacts
>>
>>107701322
I think that's a good idea.
>>
>>107701311
Agreed, whoever developed it isn't very good at coding
>>
>>107701320
i'll try that next then...
>>
File: 1743242427874675.jpg (498 KB, 1248x1872)
498 KB
498 KB JPG
>>
>>107701374
>reposting old gens again
kys
>>
cozy
>>
File: 1755281754631912.jpg (408 KB, 1504x1088)
408 KB
408 KB JPG
>>
>>107701471
please resolve your skill issues. she has three arms and the promo text is clipping through the third arm. you always post complete slop then a slightly improved one. there isn't any rush anon. take the time to make something good for once
>>
File: 00007-2344595850.png (2.65 MB, 1824x1248)
2.65 MB
2.65 MB PNG
>>
File: 1759466559310756.jpg (652 KB, 1302x1936)
652 KB
652 KB JPG
>>
>>107701496
>shemale enjoyer
knew it
>>
>>107701502
this show sucked.
Do a real girl like asuka.
>>
>>107701496
damn she hot!
>>
>>107701508
the ending sucked but first season was kino
>>
>>107701487
Anistudio is lacking, it still can't do high resolution fix and it has been out for half a year
>>
File: 1743822248734865.png (841 KB, 833x573)
841 KB
841 KB PNG
>>107701516
I was somewhat interested in it but it tried wayy too hard to be something like eva and deeper than it actually was. By the end they just gave up.
>>
>>107701526
basically why everything after first season sucked. lost it's own identity
>>
File: 1737989131631263.jpg (418 KB, 1525x854)
418 KB
418 KB JPG
>>
>>107701417
yeah it feels just like im still on reddit. this place has a bad rep over there but looks like its misplaced and the mods are taking care of the trolls.
>>
File: 00178-3380301330.png (2.72 MB, 1344x1728)
2.72 MB
2.72 MB PNG
>>107701505
>>107701510
i unironically use to wank to her character model of sfxt and old fanart before the whole mainstream "tranny and newhalf" retconning of her. perfect character design.
>>
>>107701559
Go back
>>
>>107701560
if you don't prefer the Dommy mommy cock, you can't enjoy poison to her fullest
t. poison enjoyer
>>
>>107701560
cool gen anon, love the pose
>>
>>107701559
i hate reddit because i can't anonymously praise myself on there :(( they have good advice for open source text-to-image/video models though
>>
>>107701559
>yeah it feels just like im still on reddit. this place has a bad rep over there but looks like its misplaced and the mods are taking care of the trolls.
This, the new /ldg/ is a redditfriend approved thread
>>
>>107701560
Don't pay attention to the schizo, your gens are really neat and always pretty high quality
>>
File: 1740887924436956.jpg (399 KB, 814x1208)
399 KB
399 KB JPG
>>
>>107701723
>another repost
>>
>>107701723
megumin a cute!
>>
>>107701129
Anyone?
>>
>nearly 2026
>no model can do convincing facesitting
>>
>>107701814
wan but it can't do small sprites and the motion is too fluid for most pixel art aesthetic.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.