[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: 1761253481986025.jpg (2.65 MB, 3835x3471)
2.65 MB
2.65 MB JPG
Cautious GenFlare Anticipation Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107548966

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
1~h before the meltie starts.
>>
File: 1750304252349694.png (1.79 MB, 1536x1024)
1.79 MB
1.79 MB PNG
>>
Asian women fuck NIGGERS they are RICE BUNNIES they want BIG BLACK COCK


This message has been sponsored by Chroma™
>>
>>107557293
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
baste
>>
File: 1749804876345632.png (209 KB, 2244x1009)
209 KB
209 KB PNG
https://github.com/huggingface/diffusers/pull/12839
Qwen Edit 2511 is comming
>t. don't care that much since I know Z-image edit will be better and unslopped
>>
>>107557330
what a faggy pr, I think ZIE is going to be after christmas, so this should tide us over for a bit
>>
File: 1752484342058259.png (2.14 MB, 3202x1422)
2.14 MB
2.14 MB PNG
>>107557330
>>107557344
Z-image base will be able to do edit and I bet even this unspecialized model will be better than QiE kek
>>
File: 00298-1393174673.png (1022 KB, 896x1152)
1022 KB
1022 KB PNG
>>
File: zimage dalle.png (2.29 MB, 1536x1024)
2.29 MB
2.29 MB PNG
>>
>>107557406
i actually vomited
fuck you
>>
>>107557360
>I bet even this unspecialized model will be better than QiE
Won't even be close. We're getting the pretrained Omni base before it goes through SFT and what makes it look photorealistic - the rlhf and grpo steps.
Refer to pages 21-24 of the paper for the full process
>>
>>107557413
>We're getting the pretrained Omni base before it goes through SFT
i surely hope so. it's been a while since we had a true non-distilled base model release that can be properly trained and used as a blank canvas. midwits that want something that produces photoreal 1girls out of the box don't understand
>>
>>107557413
it's a good news desu, a completly untamed Base model ready to be trained the way we want, now the question is, can we reach Turbo's level with community's finetuning?
>>
File: 1763277967267835.png (75 KB, 286x301)
75 KB
75 KB PNG
>>107557115
>>107557432
https://xcancel.com/bdsqlsz/status/1984268208594104544
holy
>>
>>107557413
wait, so they distilled z-image first and then applied RLHF? how did they do that? I thought finetuning a distilled model was almost impossible
>>
>>107557495
Agree. I do like a good polished model (was hoping they'd release something like turbo without distillation alongside the foundational omni) but on a larger scope to truly move away from SDXL for good, the foundation model is required.
>>
chroma z when?
>>
>>107557413
I wonder how great Z-image turbo would've been if it wasn't distilled before doing the RLHF stuff
>>
>>107557413
will whatever they're gonna be releasing be of any use to us normies or is it only for the vramchads who're gonna use to train their own checkpoitns?
>>
How do we fight the splitbaking menace?
>>
>>107557614
Normal seed variation, less sameface and pose, better uncommon concept understanding, though realism would suffer and it would take more steps. If they wanted to train it for higher quality output at fewer steps they could but:
>"It [turbo] achieves 8-step inference that is not only indistinguishable from the 100-step teacher but frequently surpasses it in perceived quality and aesthetic appeal."
>>
>>107557643
baking right now
>>
rebaking
>>
File: file.png (88 KB, 424x863)
88 KB
88 KB PNG
what's the deal with this?
i'm relatively new to comfyui, does it unload the model after every batch or something?
is there a way to keep it in memory if so?
>>
rabaking

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon

is off topic.
>>
>>107557302
you were right on the money
>>107557781
share your workflow
>>
>>107557730
>It [turbo] achieves 8-step inference that is not only indistinguishable from the 100-step teacher
I don't believe that, there was this new distillation paper showing that D-DMD is not the optimal distillation method, let alone something that would be equal to something not distilled
>>
>>107557807
>>107557807
>>107557807

move. also tell the ranfaggot he is a a fag for starting a flamewar
>>
https://rentry.org/ranfaggot
>>
>>107557815
Thanks for exposing yourself in the collage.
>>
>>107557820
see >>107557818
>>
>>107557820
ani is right. stop using troll bakes. just let ran shit his diapers i stead
>>
>>107557826
>>107557835
>the repost spam has started
another day of ani choosing to spend his valuable time on earth having a melty instead of turning off the computer. blessed
>>
>>107557781
you're probably caching your prompt.
>>
File: file.png (1.34 MB, 2241x1527)
1.34 MB
1.34 MB PNG
>>107557790
i made a customer DeepSeek API-connected conditioner node that fleshes out prompts and translates them to chinese automatically, but the issue was happening prior to that.
the Load LoRA (DistillPatch) is this https://modelscope.cn/models/DiffSynth-Studio/Z-Image-Turbo-DistillPatch -- just a normal Load LoRA node renamed.
i'm using an RTX 3060 12GB .
>>
>>107557850
>>107557824
>>107557831
>>107557820
>>107557815
>>107557790
>>107557788

go here this thread is radioactive
>>107557807
>>107557807
>>107557807
>>
>>107557850
plz read before posting in dramabakes
https://rentry.org/ranfaggot
>>
>>107557859
based
>>
>>107557850
My guess is that you don't have enough VRAM to hold both DeepSeek, Qwen and ZiT, so when it takes less time, it's because you're re-using the same prompt and don't need to load DeepSeek (which would dump ZiT from VRAM).
>>
Imagine working for a year to defeat Comfy with your own UI and you only get 33 stars
>>
File: file.png (142 KB, 1292x747)
142 KB
142 KB PNG
>>107557859
>>107557854
i'm gonna be honest, i don't give a fuck.
it's 4chan. there's no "official" threads.
the other thread is claiming this one is fake too.
who gives a fuck one way or another, really?
just post useful shit. what's the point in discouraging people to participate?
what are you trying to control?
are you upset that you're not getting the dopamine hit from your thread of choice or the one you started doing well?
people are so fucking weird.

you can't put "started many successful 4chan /ldg/ threads" on a job resume.
>>
>>107557891
that makes a lot of sense actually.
time to research a way to fix this shit without buying a new GPU.
>>
>>107557896
This is the local schizo. /ldg/ is, unfortunately, spammed to oblivion by him. Best to just ignore him. Use whichever thread you prefer.
https://rentry.org/animanon
>>
>>107557788
>>107557815
>>107557818
>>107557844
>>107557854
you are such a subhuman faggot
>>
>>107557293
>>107557293
>>107557293

see >>107557859 and >>107549797

stop using dramafaggot bakes or this will continue
>>
>>107557922
what will continue?
>>
>>107557902
>chroma was a bust but what was the best version before it got bad? Is it v48?
New versions are working just fine. Mostly sidegrades to each other.
>>
>>107557894
when Comfy first started, i hated it
the creator and his simps were fucking annoying
constantly berating the pioneers of webui that came before them
advertising relentlessly on every AI thread
just being annoying shitters

i still don't like it compared to A1111 because it's still shit to use, but i conceded that it's where all the new shit is
Comfy is like the Windows of user interfaces.
>>
>>107557896
>i'm gonna be honest, i don't give a fuck.
then you allow it to continue by supporting a welfare nigger

>>107557929
you feeding the troll
>>
>>107557922
We can do this shit forever, Julien
And soon, I will find you and make your life so miserable you will spend the rest of it wishing for death
>>
>>107557938
>Julien
>>
>>107557936
>then you allow it to continue by supporting a welfare nigger
i got my question answered already in this thread.
it's what i was here for.
so... yea, i don't give a fuck about your welfare nigger.
>>
>>107557413
are you sure it's pre SFT? it would be good for finetuning, but the community doesn't known how to finetune it correctly. overloading the model with booru 1girl, close up, esoteric background causes the model to be addicted to close ups and lose the ability of generating stable backgrounds
>>
>>107557933
>Comfy is like the Windows of user interfaces.
It really isn't. Comfy being the default UI for image gen is a disaster. It's as if Arch without install script somehow became the default OS for home use.
>>
>>107557946
>>107557955
you are such a subuman retard faggot
the rest of your life will be pain, and so much brain damage you won't be ablecomprehend it
>>
Ok how do we move forward? What are our options to salvage the general from tRANny?
Option 1: Placate him, add rentries to OP, don't splitbake. Could work for a shortwhile but unfortunately ran's dramafaggotry is well documented with his long, cursed history in this general so he will find something else to have a schizo meltdown about and the peace will be very short-lived.
Option 2: Status quo, keep splitbaking whenever ran shits up the OP. Self-explanatory, inefficient and annoying, not really a solution but at least we would be standing up to him.
Option 3: Some fucking how convince the jannies (who notoriously give zero fucks about AI threads) to range ban ran. I don't think he is tech savvy at all so it should shut him up. Not gonna happen but one can dream I suppose.
Do we just like keep calling him a faggot here or is there anything we can actually do?
>>
>>107557954
Nobody is sure of anything, it's speculation and assumptions based on their naming - "Omni", and the position of "Omni" in the graphs from the paper that place it before the finetuning steps, and the wording from the blog post.
https://tongyi-mai.github.io/Z-Image-blog/
>A foundation model designed for easy fine-tuning
What concerns me is that all that amazing autistic prompt following capability comes from both SFT and, according to the paper, the GRPO, DPO and RLHF. If the community cannot do a basic SFT, I doubt many know how to implement the laterl RL stages.
>>
>>107557972
4. you fucking kill yourself, Julien
>>
>>107557520
lmao chinks waste no time making videos of their women getting railed by blacks
>>
>>107557954
>>107557967
>>107557986

>>107557807
>>107557807
>>107557807
>>
>>107557997
Why should I go there? I'm a tourist from lmg. Last time I was here was over a year ago
>>
File: 1753781375892166.png (98 KB, 757x428)
98 KB
98 KB PNG
>>107557997
why you keep reposting shit from this thread or the archive, you subhuman retard? >>107558002
>>
>>107557360
Julien copypasted your post >>107558010 >>107558016
Don't forget to report his dogshit thread
>>
>>107558009
You're talking to the local spammer. You can get the essentials about him in the OP rentry https://rentry.org/animanon
>>
File: 1748591240190997.png (155 KB, 792x696)
155 KB
155 KB PNG
more shamelessly copied posts from Julien >>107558017



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.