[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107570316

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: 1743842532698128.jpg (3.67 MB, 5895x3140)
3.67 MB
3.67 MB JPG
For those who missed it, DFloat11 is making absolutely identical images as BF16, down to the pixel lmao
https://huggingface.co/mingyi456/Z-Image-Turbo-DF11-ComfyUI
https://github.com/BigStationW/ComfyUI-DFloat11-Extended
https://imgsli.com/NDM1MDE2
>>
Blessed thread of blessed thread of blessed thread of blessed thread of blessed thread of blessed thread of blessed thread of blessed thread of blessed thread of b
>>
>>107576396
why did you pick your own slop images with the shitty lora that fucks up hands?
>>
>>107576418
GPT please translate me in vramlet language
>>
im gunna traaaaaaaaaaiiiinnnnnnn
>>
>>107576428
you already know why https://rentry.org/ranfaggot
maintain the thread quality we have another tranny bake
>>
>>107576396
can we get a rebake? this has a bunch of off topic drama links in it
>>
>>107576396
>>107576437
are you still going on with this flamewar shit? both of you should kill yourselves already
>>
/ldg/- Limp Dick General
>>
okay now post again but this time try not to sound like youre seething and malding
>>
>>107576477
sure
>>
>>107576418
Shouldn't this be physically impossible? Did the Chinese use some black magic?
>>
File: ComfyUI_09480_.png (1.78 MB, 864x1280)
1.78 MB
1.78 MB PNG
>>
>>107576487
>Did the Chinese use some black magic?
I have no idea how they did that but they did lmao, my guess is that models don't care if the precision is 16 bits or 11
>t. didn't read the paper but should since it's really cool that they managed to pull this shit off
>>
I don't understand why OneTrainer won't develop a web UI. Some people rent GPUs for training and can't see the window. I guess I could use the CLI mode but at that point I may as well just switch to musubi tuner.
>>
>>107576437
>https://rentry.org/ranfaggot
lmfao ran is probably seething about this
>>
File: leaderboards.png (108 KB, 1597x1021)
108 KB
108 KB PNG
>local keeps falling behind
kek, saas wins again
>>
>>107576476
Oh no!
>>
File: 1753669176211519.png (168 KB, 498x498)
168 KB
168 KB PNG
>>107576513
>Flux 2 [max]
what?
>GPT Image 1.5 1st
WHAT??? what is this shit? I feel I'm missing some episodes there
>>
>>107576418
does it work with loras tho
>>
>>107576513
>>107576523
https://youtu.be/DPBtd57p5Mg?t=3
lmao it changes the image during edit, even Kontext dev doesn't do that, NOOB
>>
File: file.png (180 KB, 1139x547)
180 KB
180 KB PNG
>>107576430
Here's the chart.
>>107576487
Just read the paper, it's been out since April.
>>
File: ComfyUI_09478_.png (1.46 MB, 864x1280)
1.46 MB
1.46 MB PNG
>>
>>107576533
no ;-;, we need some coding wizard to improve that node
>>
>>107576497
>>107576546
nice
>>
File: ComfyUI_09448_.png (1.18 MB, 864x1280)
1.18 MB
1.18 MB PNG
>>
Oh no
>>
>>107576418
>>107576487
https://github.com/LeanModels/DFloat11
>How It Works
>DFloat11 compresses model weights using Huffman coding of BFloat16 exponent bits, combined with hardware-aware algorithmic designs that enable efficient on-the-fly decompression directly on the GPU. During inference, the weights remain compressed in GPU memory and are decompressed just before matrix multiplications, then immediately discarded after use to minimize memory footprint.
>Key benefits:
>No CPU decompression or host-device data transfer: all operations are handled entirely on the GPU.
>Decompression overhead is constant per forward pass and independent of batch size, making DFloat11 increasingly efficient at larger batch sizes.
>DFloat11 is much faster than CPU-offloading approaches, enabling practical deployment in memory-constrained environments.
>At batch size = 1, inference is approximately 2× slower than the original BF16 model, but the performance gap narrows significantly with larger batches.
>The compression is fully lossless, guaranteeing that the model’s outputs are bit-for-bit identical to those of the original model.

Note the second-last point. Might still be nifty for VRAM-constrained setups.
>>
Thoughts on Reddit for learning about open source models?
>>
File: 1729735186901635.png (453 KB, 828x765)
453 KB
453 KB PNG
>>107576513
In what world is the new GPTslop better than NBP? I swear every single time a new corpo model gets released it rises to the top regardless of the quality of its outputs. These benchmarks are nothing but a huge meme.
>>
>>107576570
His ass is surprisingly shapely
Also
>retarded captcha is broken and locks you out of posting if you don't need verification
What the fuck is this
>>
File: 1760704379781037.png (273 KB, 2287x1088)
273 KB
273 KB PNG
>>107576501
basically on bf16 you let the model chose between -123 and +123 (2^8), but in reality they never go for such high numbers, they actually never reach the max of 8 (2^3), so basically if you tell them to go for a max value of 8 nothing will change and you basically won 5 bits, 16 bits -> 11 bits
>>
>>107576640
>In what world is the new GPTslop better than NBP?
I have no idea dude, NBP has almost completly solved realism and this new OpenAI isn't even close to that
>>
>>107576640
They're just lying now because it's all a game to appease boomer shareholders
>>
>>107576641
>>retarded captcha is broken and locks you out of posting if you don't need verification
you have to update 4chanX, they just fixed that, and yeah it was annoying as fuck
>>
File: 1765941548.png (2.04 MB, 1024x1024)
2.04 MB
2.04 MB PNG
This is gold.
>>
>>107576640
Is this your first time realizing leaderboards are literal memes and should not be taken seriously
>>
>>107576696
looks like a baked potato to me anon...
>>
>>107576702
nah, that leaderboard used to be pretty accurate, like if I were to make my own rankings of image models it would be pretty close to that one... until today I guess
>>
File: ComfyUI_09390_.png (2.48 MB, 864x1280)
2.48 MB
2.48 MB PNG
>>
>>107576551
vibe code it
>>
File: 1741493991551134.png (2.01 MB, 1168x1752)
2.01 MB
2.01 MB PNG
>>
>>107576396
>>
File: Z-image turbo.png (3.74 MB, 1920x1080)
3.74 MB
3.74 MB PNG
>>
Where comfy node for treillis 2?
>>
File: 1749119132904285.jpg (657 KB, 1920x1088)
657 KB
657 KB JPG
holy fuck the new captchas are a nightmare.
>>
File: 1737838292361703.png (2.23 MB, 1168x1752)
2.23 MB
2.23 MB PNG
>>
>>107576868
I like them desu, they tickle my brain.
>>
>>107576868
if you cant solve them in under 5 seconds it means you are most likely nonwhite and low IQ
>>
>>
File: 1759369506007860.png (1.88 MB, 1752x1168)
1.88 MB
1.88 MB PNG
shitty fingers were a part of the artists dataset
>>
>>107576632
do not redeem
>>
File: 1744160159718230.png (2.15 MB, 1168x1752)
2.15 MB
2.15 MB PNG
>>
chinese culture
>>
WHERE IS TRELLIS 2 COMFYUI?
Can someone give me comfy's phone number so I can ask him directly?
>>
File: ComfyUI_09230_.png (1.39 MB, 864x1280)
1.39 MB
1.39 MB PNG
>>
File: 1746942160172203.png (1.58 MB, 1168x1752)
1.58 MB
1.58 MB PNG
>>
File: 1759971936983067.mp4 (1.31 MB, 720x1072)
1.31 MB
1.31 MB MP4
>>
File: ComfyUI_00766_.jpg (1.94 MB, 2147x3138)
1.94 MB
1.94 MB JPG
>>107577065
May I have a cat box good sir?
>>
Can I load two different wan models at the same time?
Using SCAIL alongside the wan 2.2 low for example.
>>
File: 1757163593213217.png (1.89 MB, 1168x1752)
1.89 MB
1.89 MB PNG
>>107577129
>>
File: ComfyUI_01387_.jpg (3.12 MB, 2458x2458)
3.12 MB
3.12 MB JPG
>>107577230
i have not, willing to try.
>Still no /ai/ board.
>>
File: 1740691470037648.png (2.81 MB, 1120x1440)
2.81 MB
2.81 MB PNG
>>
File: 1753193272636742.png (1.63 MB, 1168x1752)
1.63 MB
1.63 MB PNG
>>107577247
https://files.catbox.moe/pezxmg.safetensors
https://files.catbox.moe/hv28nx.json
>>
>>107577294
Based
>>
File: 1747742322052045.png (1.72 MB, 1168x1752)
1.72 MB
1.72 MB PNG
>>
>>
File: 1750553648075410.png (1.97 MB, 1168x1752)
1.97 MB
1.97 MB PNG
>>
Bruh, I'm browsing the thread about kuroba app being fucked over by the captcha.

Then I see a familiar pattern.

This fucking schizo might also be the dev for the chance app.
>>
>>107577665
It's definitely the same schizo.
Surely this can be triangulated to find some more info that results in a permanent removal.
>>
>>107577665
>>107577689
Did you mean to post this to another thread?
>>
File: ZIT_00008_.jpg (796 KB, 1392x2400)
796 KB
796 KB JPG
Neat, this breast slider lora works nicely.

>>107577752
No, it's about whatever schizo that's been plaguing us the past week.
>>
File: 1752018008035589.png (263 KB, 1200x630)
263 KB
263 KB PNG
>>107576513
when z-image base releases, everything will be alright
>>
tried out wan 2.6 and it can do a good blowjob. neat. praying this shit is announced open source as a surprise please.
>>
File: ZIT_00036_.png (3.33 MB, 1392x2400)
3.33 MB
3.33 MB PNG
Even at .75 weight, the v2 controlnet for zit obliterates the quality..

>>107578110
We're getting two chink models based on grok, this month? What a time to be alive.
>>
What's the best UI to get into if you just want to make meme/porn videos and images?

Hopefully easy to use. I have 16 vram if that's a factor.
>>
>>107578342
comfyui.

whether it's easy or medium hard is a matter of personal perspective. it's not a smartphone app for the lowest common denominator.
>>
I hate to install a whole node pack to use just one node and not have enough information of how to use many of them.
Comfy is so fucking awful to use, wish we had a better UI for generating images
>>
File: 1747613399816693.png (87 KB, 1165x620)
87 KB
87 KB PNG
>>107578453
https://github.com/comfyanonymous/ComfyUI?tab=readme-ov-file#comfyui-manager
>>
>>107578453
there's already good solutions for the cumfart poothon dependency hell problem
https://github.com/FizzleDorf/AniStudio
one of the anons is creating a c ui fully independent from poothon node hell, it's very solid. the only reason it's not in the op is because comfy schizo is pooping her pants because of its existence
>>
>>107578479
I wasn't asking you, schizo
>>
>>107578482
t. schizo
ldg will be free from poothon hell
>>
>>107578488
>t. schizo
t. guy who made 70 posts in one thread and got all of them nuked
>>
>>107578491
not your imaginary "schizo", schizo
everybody is fed up with comfy and tRan niggerbakes
>>
>>107578500
https://desuarchive.org/g/search/tnum/107570316/deleted/deleted/
yeah that's totally not you haha you got me there
>>
>>107578511
yeah its different anons that got spam reported by ran (so meaning you troonjak schizo)
>>
>>107578500
>everybody is fed up with comfy and tRan niggerbakes
Still don't get whats the purpose of those schizo rentries. should remove them already, they are not related to the thread topic
>>
I love blonde foxes
I'm going to use comfyui
it's that simple
>>
>>107578581
gen them on interfaces that are not complete piece of shit. comfy is trash
>>
>>107578587
Give me a reason to, and I will
So far, these other outdated/abandoned/schizo options haven't given me a reason to switch.
It's really that simple.
>>
>>107578601
>Give me a reason to, and I will
he's too lazy for that
he'd rather delusionally hope that by spamming threads enough times, he will eventually get free labor and sell commercial licenses
>>
>>107578612
>he'd rather delusionally hope that by spamming threads enough times, he will eventually get free labor and sell commercial licenses
i'm tranjak schizo and i make up shit by the way
>>
>>107578617
yeah anon, the 70-message spam that was nuked in that thread and 80 nuked messages in two other threads were totally not you, hahaha!
>>
Guys we should support the violent african gangs that probably roam Trani's home city, and pay them to beat him up
>>
>>107578652
aren't posts like that against 4chan rules? sounds like instigation
>>
>>107578659
Nah, for it to be instigation I'd need to be talking about a human being
>>
>>107578659
I thought you were okay with that kind of stuff though? >>107559634
>>
>>107578672
retard
>>107578674
not me
>>
>>107578680
yeah haha the 150 nuked posts were definitely not you haha that's right you tell em
>>
>>107578680
We don't sign our posts here
>>
>>107578110
>praying this shit is announced open source as a surprise please.
they won't even release the deprecated wan 2.5 lol
>>
Why are Anistudio """fans""" so mentally ill?
>>
it's sad that the schizo tranjak is so deranged that ani even stopped posting here. he used to be so happy sharing his progress but ldg is not what it used to be. he's on discord now
>>
>>107578708
Cool, you should fuck off and stuff your mouth with trani's cock as much as you wish
>>
I think the schizo actually cried when the first nuke hit the thread. It's heartwarming.
>>
>>107578717
i disagree, we should be more united against ran's schizophrenia. she's very damaging to ldg with her niggerbakes
>>
>>107578724
Why?
The links warning about some avatarfaggot reatards make your axewound ache THAT hard?
>>
So... local diffusion, huh?
>>
>>107578798
we should use lmg until things calm down and niggerjak takes her meds finally
>>
File: anons_lora_00009_.png (764 KB, 832x1248)
764 KB
764 KB PNG
>>
>>107578806
Please keep making retarded posts
Makes tge moment you get jannied even funnier
>>
>>107578825
based. ldg will overcome schizo tranjak meltdowns and will be better than ever
>>
>>107578260
v2 cnet is broken, use v1 while we wait for the new one
>>
keep seething tRan. ldg will get through this
>>
File: anons_lora_00032.png (721 KB, 832x1248)
721 KB
721 KB PNG
>>
STOP
REPLYING
TO
IT
>>
>>107578863
niggerjak is crying
>>
File: anons_lora_00040.png (853 KB, 832x1248)
853 KB
853 KB PNG
>>
File: 1759218643590802.png (2.13 MB, 1344x1344)
2.13 MB
2.13 MB PNG
Where the fuck is the base model unironically?
>>
>>107578875
ranfaggot will just samefag if you don't reply to his threads as usual. just ignore the drama spitebakes
>>
File: file.mp4 (576 KB, 576x416)
576 KB
576 KB MP4
>>
>>107578863
>>107578875
>>107578930
Why are you replying to your own posts?
>>
>>107578920
post lora
>>
>>107578941
only one of those posts is mine, don't know about others
>>
>>107578967
Sure, I'll remember to act surprised when they all get deleted at the exact same time
>>
>>107576396
>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
local diffusion?
>>
"a woman is looking at herself in a mirror"

Impressive.
>>
File: 1758544964955559.png (205 KB, 1005x777)
205 KB
205 KB PNG
>>107578981
yes, local diffusion
>>
>>107578986
second one is hot
>>
For zimage turbo is FlowMatchEulerDiscrete the best? FlowMatchEulerDiscrete and clownshark sampler some people say

In my experience euler simple just works, there was one more similar combo that seemed like 3% more clear on details that I found but didn't want to change to in case it makes some other seeds or concepts worse.
>>
get it before it's deleted
https://civitai.com/models/1696517/ig-hottie?modelVersionId=2511469
>>
>>107579031
>3 comically bad example gens
>trained on pony outputs
hmmm... nyo
>>
File: ZIT_00038_.png (3.25 MB, 1392x2400)
3.25 MB
3.25 MB PNG
>>
https://apple.github.io/ml-sharp/
>We present SHARP, an approach to photorealistic view synthesis from a single image. Given a single photograph, SHARP regresses the parameters of a 3D Gaussian representation of the depicted scene. This is done in less than a second on a standard GPU via a single feedforward pass through a neural network. The 3D Gaussian representation produced by SHARP can then be rendered in real time, yielding high-resolution photorealistic images for nearby views. The representation is metric, with absolute scale, supporting metric camera movements. Experimental results demonstrate that SHARP delivers robust zero-shot generalization across datasets. It sets a new state of the art on multiple datasets, reducing LPIPS by 25–34% and DISTS by 21–43% versus the best prior model, while lowering the synthesis time by three orders of magnitude.
>>
>>107576396
BOOBA
Also sick ass RoboCop art.
>>
>>107579036
Why do Indians refuse to understand the concept of GIGO?
If you train a better model with shit images, it won't "uplift" the shit data, you would only ruin the better model.
But I suppose their thoughts aren't more complex than bob and vagen.
I also think this shit got worse last few months. Civit always had a sea of garbage loras, but I feel like I am not imagining the uptick of 512p, slop dataset loras.
>>
File: 1764423350851144.jpg (379 KB, 1024x1536)
379 KB
379 KB JPG
>>107576513
nice to see openai topping the charts
>>
>>107579065
ENHANCE
>>
File: WanVideo2_2_I2V_00024.mp4 (710 KB, 832x480)
710 KB
710 KB MP4
>>
>>107579031
actual garbage
>>
Qwen with 8 step lora is about as fast as Z and is better with some prompts. But it also has sloppier visuals.

What's your favorite Qwen realism lora?
>>
>>107579113
qwen cant do realism, it looks like plastic no matter what
>>
>>107579031
this is the worst thing ive ever seen in my life
>>
wan 2.6 live

https://www.youtube.com/watch?v=Dp-CMOo-kOc
>>
>>107579135
>implying i have that much vram
>>
is 12gb vram enough for video generation or will I be severely limited?

16 is entry level right?
>>
File: qfqfqgg.jpg (1.87 MB, 2784x4800)
1.87 MB
1.87 MB JPG
Can't wait for base and exploring artists.
>>
Does ComfyUI work well on Cachy+AMD GPU?
>>
im training a lora of a building for the first time. wish me luck
>>
File: Untitled.png (3 KB, 419x59)
3 KB
3 KB PNG
wat do? 0 results on the internet.
>>
>>107579135
holyfuck! Does anyone speak Chinese? Will it be open-sourced? What are the requirements?
>>
>>107579220
For wan 2.2 24gb (as well as 64gb of sys mem) is needed for sane speeds.
16gb will be better but don't expect it to run fast.
12gb is doable I suppose if you can wait 10 minutes for a video but I find it not worth bothering.
>>107579236
>Cachy
That's just Arch under the hood, I run it fine on Arch so yes
>AMD GPU
Ayyymd sucks for AI
>>107579237
GL anon, unusual concept for a lora.
>>
>>107579270
what UI should you be using if you are running 16vram 32memram?

can this do 1080 or 720p?
>>
>>107579135
>api
>>
what if all these Chinese models are intended as a weapon to turn generations of Western men into useless gooners?
>>
>>107579292
it is what it is
>>
>>107579292
when we reach realtime 100% convincing generations of anything of any length civilisation will be over.
>>
how can i do this shit in image into video shit in comfyui without looking like ass
>>
>>107579340
someone will just make an AI that identifies AI gens

or someone will make a law that requires AI gens to have a watermark symbol or something
>>
>>107579340
>when we reach realtime 100% convincing generations
give it a year or two
>>107579374
>someone will just make an AI that identifies AI gens
then someone will just make an ai that can avoid being identified, the computational cost of this cat and mouse game will not be worth it
in the end, the internet will be viewed as a casino where everything is assumed to be fake and if you get tricked by an AI then thats just part of the experience
>>
>>107579270
>Ayyymd sucks for AI
Fuck, sucks that's still the case. Hoped they would have something going for it by now.
>>
File: 1748493105905617.png (2.01 MB, 1344x1344)
2.01 MB
2.01 MB PNG
>>107579292
i hope that's true because then they would have a final solution to the random access memory problem
>>
>>107579292
They've already achieved that with tiktok.
>>
File: Z-image turbo.png (1.6 MB, 1280x720)
1.6 MB
1.6 MB PNG
>>
File: 1741635413785080.png (395 KB, 668x390)
395 KB
395 KB PNG
has anyone trained a style lora for z-image? how many images do you generally need? i was thinking of making one of this swedish comic book illustrator i like
>>
>>107579574
Animate it and it's basically a mobile game ad



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.