[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1747145924870626.jpg (2.89 MB, 3942x2508)
2.89 MB
2.89 MB JPG
Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107594109

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
https://comfyanonymous.github.io/ComfyUI_examples/z_image/

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Template: https://rentry.org/ldg-template
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg
>>
>NBP in collage
i want the other baker back
>>
Alright, I'm noticing it now
>>
File: Z__00012_.png (2.58 MB, 1664x936)
2.58 MB
2.58 MB PNG
z-image does absolutely not understand goat pupils. even if i prompt "they have rectangular shaped eye pupils" it will make them round. don't listen to people who tell you otherwise
>>
>>107596979
that gen jesus christ how horrifying
>>
>>107596979
try slit-type pupil or horizontal slit pupil
>>
File: z-turbo_00034_.png (1.85 MB, 1024x1536)
1.85 MB
1.85 MB PNG
>>107596979
>>
File: 3627315182.jpg (634 KB, 1536x2688)
634 KB
634 KB JPG
>>
>>107596291
>I'm not sure if this is bait or not.

https://www.reddit.com/r/comfyui/comments/1poulr3/psa_the_save_image_as_type_chrome_extension/

If it's not this extension, then it's some other extension.
>>
File: Z__00029_.png (2.27 MB, 1664x936)
2.27 MB
2.27 MB PNG
>>107596998
thanks, that worked much better!
>>
>>107597089
lmfao. I guess this is the number of r's problem but for an encoder
>>
File: ComfyUI_00807_.png (2.09 MB, 1200x1600)
2.09 MB
2.09 MB PNG
Tard wrangled ZIT.
>>
fishy bread
>>
>julienstudio in the collage
grim
>>
did anon find the node?
>>
>>107597261
Which node?
>>
>>107597271
multigpu encoder loader for wan. he was trying to contact them from separate loaders and just kept getting tensor errors
>>
File: grid-0009.png (610 KB, 896x1152)
610 KB
610 KB PNG
>>
>>107597277
pretty lit desu
>>
File: 00114-2132615272.png (1.09 MB, 896x1152)
1.09 MB
1.09 MB PNG
>>
>>107596792
Where's the real thread?
>>
>>107597189
>>107597223
>>107597293
https://rentry.org/ranfaggot
>>
>>107597298
it's like a parody of the ones he wrote. I think I finally get it kek
>>
>julien
>>
File: z-image_nag_00214_.png (1.77 MB, 1024x1536)
1.77 MB
1.77 MB PNG
>>
>>107597422
lol still going at it?
>>
dont overlook Flux 2. it encourages a lot of prompt engineering and it knows more things than z
>>
>>107597481
>json
>>
>>107597481
it also encourages you to buy another gpu to gen with z while a single flux 2 gen finishes

whats the time with 4 step lightning or whatever is the meta for it anyway?
>>
>>107597496
>VRAM: 10GB
>RAM: 64GB
>fp8
>1824x1248
>300 seconds
>>
>>107597293

>>107597478
>>107597478
>>
>>107597514
finally thank you
>>
>11k forks of comfyui
>but none are worth switching to
We had Forge/reForge as a replacement for A1111, why isn't there a good alternative for comfy?
>>
multithreading
>>
>>107597514
you lost, nigger
>>
Opinions on https://github.com/zai-org/SCAIL ?
>>
>>107597532
you should ask in the thread not made for shilling TraniStudio
>>107597478
>>107597478
>>107597478
>>
>>107597536
based
>>
>>107597540
Why do you repost from the real thread anon? >>107597525
>>
>>107597557
julien is smooth brained that's why
>>
You know what, based. Last thread was fucking weird, I see why it's needed now
>>
>>107597514
>>107597544
Thanks
Imagine a collage with the vibecoded wrapper lmao
>>
>>107597583
shoot. what if the universe was vibecoded?
>>
>>107597540
looks neat but I am not able to use it unfortunately
>>
>>107597583
>>107597601
you too should discuss that right now here
in the thread that will again not hit bump limit again because no one cares about julien
enjoy and ran won
>>
>>107597557
I posted in both threads but I can see I'm in the real thread
>>
File: horselo.png (1.07 MB, 1500x1026)
1.07 MB
1.07 MB PNG
>>
why is every random promotion of wan 2.6 on twitter calling it open source lol
>>
>>107597617
Weird when you reposted my post
Are you a schizo by chance?
>>
>>107597636
a real tour-de-horse
>>
>>107597653
heh
>>
File: z-image_nag_00216_.png (2.16 MB, 1024x1536)
2.16 MB
2.16 MB PNG
>>
>>107597636
this is gud, i accept it
>>
File: taxi.png (1.84 MB, 1536x864)
1.84 MB
1.84 MB PNG
>>
File: 00114-2132615272.png (1022 KB, 896x1044)
1022 KB
1022 KB PNG
>>
File: htjud1.png (1.8 MB, 1024x1024)
1.8 MB
1.8 MB PNG
>>
>>107597672
and no centaurs either... remember what they took from you
>>
File: mists1.png (2.11 MB, 1824x1248)
2.11 MB
2.11 MB PNG
>>
>>107597697
These look like shit
>>
>>107597706
they aren't that bad.
>>
>>107597668
this is pretty kino. prompt?
>>
>>107597707
For 2022 maybe
Why the sudden drop in quality? Weird
>>
MrSchizo you should rethink where you're putting your target on. Don't you think that if your aggressor came from here, he would have already stopped doing this because he doesn't want to see his beloved general being constantly attacked?
>>
>>107597723
What do you mean?
>>
File: 1735775740419650.png (1.97 MB, 1472x1216)
1.97 MB
1.97 MB PNG
i thought we were supposed to get the base model yesterday?
>>
>>107597722
gguf? NAG? other optimizations? you have no idea what people are using but prompter game can recognize prompter game no matter the quality
>>
>>107597736
MrShizo I'm saying it's valid that you want to take revenge, sometimes the truth is in what people keep silence about and not what they yap the most about
>>
>>107597770
yesterday isn't christmas
>>
>>107597779
>revenge
I unironically don't get what you mean
Explain?
>>
>>107597722
I don't think base SD1.5 could generate swords like that
>>
>>107597797
this
>>
[Prompt Rewriter] Error: HTTP 500
Server response: {"error":{"code":500,"message":"failed to process image","type":"server_error"}}
[Prompt Rewriter] Note: This may be a VLM context issue. Current context: 4096
Prompt executed in 6.98 seconds

why this does keep happening with the fucking promp rewriter node
>>
>>107598048
Are you exceeding a character limit greater than 4096?
>>
>>107598091
Not character limit, token limit. 4096 context is more than enough for a prompt though, like the size of a large essay.
>>
You guys enjoying being strung along by Chinese culture?
>>
>>107598116
we are in the Christmas culture long game
>>
do anon hoard models? I'm out of disk space
>>
>>107598142
I hoard models and gens. still have 4 TB free
>>
>>107598048
that means your context size isn't big enough, increase that number
>>
>>107598091
no, I'm just trying to caption a small dataset, it goes well for 90% or the photos but then it gives out that error for some images, I have to rerun it several times until it works, too bad the dev has the issues section disabled on the repo, there is no way to report bugs
>>
>>107598048
>>107598107
images consume tokens anon, one 1MP image is already eating ~1000 tokens, increase your context size
>>
if we dont post images theyre going to start saying we're the troll thread
>>
>>107598181
>too bad the dev has the issues section disabled on the repo, there is no way to report bugs
I just opened the issue tab, but like I said, do that >>107598182
>>
>>107598188
it's just tRan replying to herself. who honestly uses chroma anymore now that zit is out?
>>
File: z-image_nag_00223_.png (2.36 MB, 1024x1536)
2.36 MB
2.36 MB PNG
>>
File: 1766116248819.png (1.1 MB, 1350x900)
1.1 MB
1.1 MB PNG
Is more than 32GB of VRAM necessary for image/video generation? I was thinking of getting a 5090 but now I'm considering a framework desktop or even a mac studio for the 128GB of shared memory. I also thought about an RTX 6000 but that card alone is just about triple the price of every other option (in my country, at least).
>>
>>107598206
I was going to say I don't believe you but it really is just him spamming the same prompt as usual
>>
>>107598228
necessary no, desirable yes
>>
>>107598228
no you can offload to memory and only need to process each layer with your GPU. it is still worth getting one with a lot of cores though
>>
>>107598234
So the 395 APU is worth it over the 5090?

>>107598238
Or is the 5090 worth it over the APU?

The main things I want to do are generate video and run an LLM to act as a foreign language tutor.
>>
>>107598228
shared mem isn't really the performance boost people memed it would be. anon was right, cuda cores are more important for speed now that vram capacity the most important (considering you have a healthy amount of ram)
>>
>>107598268
5090 and 128 gb RAM is a really good spot to be in.
>>
>>107598268
APU is useless for video gen.
>>
File: 1766116870930406.jpg (63 KB, 910x586)
63 KB
63 KB JPG
https://www.reddit.com/r/StableDiffusion/comments/1ppa8x9/zimageedit_news
>>
>>107598228

This dude is ready to drop like fucking....10-12k just to goon, I respect it honestly.
>>
>>107598336
I don't think he is going to get the rtx 6000 nonie
>>
File: file.png (127 KB, 693x173)
127 KB
127 KB PNG
>>107598336
>>107598369
man you guys are casuals compared to us /lmg/chads
>>
>>107598392
Duh. diffusion models have a lower bar of entry so of course most people here don't have multiple gpus or even 64 gb of RAM
>>
File: honest reaction cat.jpg (84 KB, 939x1065)
84 KB
84 KB JPG
>bought 5060ti for 450 bucks
>wan 2.2 image to video still takes 5 minutes for a 5 second clip 640x640

Guess you really gotta throw money at this shit if you want to do anything. I was really debating buying a 3090, but then I would need a new PSU as mine is only 700w and I don't even think a 3090 would fit in my matx case obviously so then I would need a case. That would be like easily 1500+ bucks I will settle for the 5060ti for now, my 3060 12gb was doing okay gaming even on my 4k monitor but it was pretty bad for AI. I know I should probably not just be using the default templates in comfyui as their are probably all kinds of stuff to speed things up. But I have no idea how to setup a workflow, and whenever I try to use these custom things from civitai I rarely get them to work. I try to download and clone the things you need to use them but they still don't work.
>>
>>107598413
>>107598238
if you can find a used 4090 that would probably be the best bang for your buck
>>
>>107598413
do multigpu with 3060
>>
File: 1763289253783748.gif (1.48 MB, 190x200)
1.48 MB
1.48 MB GIF
>>107598428

Hmm that's a big think, I think I would still need a new mobo as my matx board only has one full PCI slot but I could re-check that. New case and motherboard wouldn't be that bad like 250-300 bucks less if I cheap out on the case just for function and a full atx motherboard. 28gb of vram would be pretty sick probably, i'm assuming there are no like issues really with multigpu like in gaming as all the AI is doing is filling up the ram and using CUDA cores and shit you don't need to sync frames and all other kinds of nonsense.
>>
File: trellis2.png (437 KB, 865x771)
437 KB
437 KB PNG
Trellis 2
>>
>>107598228
You should get a 5090 right now. If not get a 5070ti right now. You should also be prepared to rent an H200 to generate video+audio when it comes out in 2026 because spending 500-1000 dollars renting for your audio-video goon will be cheaper than spending 10k on a graphics card that will gen 4x slower and lower res than you can get renting a H200 cluster
>>
>>107598445
you might not need an additional full pcie slot but a riser
>>
>>107598448
printable gakis for hot glue
>>
File: 1766118716439308.mp4 (3.86 MB, 2048x1152)
3.86 MB
3.86 MB MP4
https://xcancel.com/aisearchio/status/2001365588980175153#m
snake oiled again
>>
>>107598466

i just looked up my motherboard, it actually does have a second full size PCI slot but it's not PCI x16 it's x4 so I assume the GPU would be gimped as fuck? I would just need a new case or like you said get one of those risers or extensions and I could just like run the card all ghetto outside the case or on top of it or something.
>>
>>107598477
never thought it wouldn't be another snake oil
>>
>>107598477
>erm but achtually it's only a 2X speed up
>erm but actually that's only on [niche hardware]
>and uhm err actually it's only a 3 second speedup over normal times on regular hardware and hurts the output.

Every time and I have no reason to believe this is different.
>>
>>107598455
>rent
never
>>
Why are you reposting from the other thread?

>>107598325
>>107598284

>>107598456
>>107598477
>>
File: 1765814453579586.png (1.13 MB, 1216x832)
1.13 MB
1.13 MB PNG
>>107598534
begone nigger
https://rentry.org/ranfaggot
>>
>>107598534
its pretty damning that he can't answer your question and just calls you his scapegoat boogeyman
>>
how do you make money with this stuff?
>>
What is the current go to I2V model
>>
>>107598565
be good at networking, social media trends and keep a steady patreon with the audience you cultivate

>>107598569
still wan but runner up being the latest hyvid which is a bit less censored
>>
>>107598576
>still wan
wan2.2?
>>
>>107598597
yes
>>
>>107598565
I make few hundred every week. I sell my gens to Japanese sararymen.
>>
>>107598620
based. what are they into?
>>
>>107598543
prompt?
>>
>>107598627
They really love transsexuals.
>>
>>107598672
omg just like /ldg/
>>
>>107598709
Catpiss-anon works in Japan and look what sort of nasty stuff he spams.
>>
>>107598534
there are people who come here for new information and to help people occasionally. They don't care about the absolute, unmitigated, retarded faggotry that's going on with the this /ldg/ split. Sometimes, people will want to go back several /ldg/ threads, maybe they remember a workflow or link or a node recommendation, and they'll try to track it down. But now, everything is split into two threads. So you can't remember, did I see that workflow in this faggot's thread or that other faggots thread? And so it doubles your time searching.
>>
File: 1766121230236715.png (3.39 MB, 1336x2008)
3.39 MB
3.39 MB PNG
>>107598750
true
>>
>>107598750
desuarchive search is really bad too unless you know the exact text pattern to search with.
>>
>>107598750
>>107598774
it's all that stupid catjak retard's fault for starting this split bake trend over some stupid fucking off topic rentries in the OP
>>
File: 1736295707701347.png (2.76 MB, 1280x1408)
2.76 MB
2.76 MB PNG
>new model comes out
>realism like you've never seen before
>have to come up with smart ways to get around the censorship
>lewd fine tune of the model comes out
>can finally gen good looking nudes and properly train nsfw loras, but you've gotten used to the level of realism and gimmicks the model originally had
>new model comes out...
will we ever escape this cycle?
>>
>>107599235
It will end soon. Future new models won't be possible to run on consumer hardware. Computing will shift towards the cloud.
>>
>>107599854
yep
they're already trying to make personal computing unaffordable
getting an actual powerful pc will be like getting a car in ~10 years if they succeed



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.