[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107675287

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>Z Image Turbo
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>WanX
https://github.com/Wan-Video/Wan2.2
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2485296
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe|https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
File: Qwen Image v2.png (1.71 MB, 928x1664)
1.71 MB
1.71 MB PNG
>>107679926
Qwen Image v2 soon(TM)
https://xcancel.com/bdsqlsz/status/2004771274573381772#m
https://xcancel.com/cherry_cc12/status/2004036904417927376#m
>>
>>107679975
if they call it v2 it means it's a new model right? I wonder if they managed to make it less than 20b this time
>>
>>107679975
it still looks quite fake. why do they all get so glossy and wax-y around the chest?
>>
>>107679975
>another bloated piece of shit
i don't care. z-image when
>>
>>107679975
>>107680001
>it still looks quite fake.
yeah, that won't beat z-image turbo, especially if the model is bigger
>>
File: 1757148853163608.mp4 (501 KB, 832x480)
501 KB
501 KB MP4
man, wan 2.2 is so good.

8 steps, high lora: kijae MoE distilled, low lora: wan 2.2 lightning low noise

it just works.
>>
>>107679975
>bdsqlsz
posting things from this guy should be an instant ban.
>>
>>107680032
Complaining about news should be an instant ban.
>>
>>107680044
Not understanding Chinese culture should be an instant ban.
>>
>>107680026
what does low lore mean
>>
>>107680066
lora, wan 2.2 uses high and low noise models separately + individual loras
>>
File: Z-image turbo.png (2.18 MB, 1536x864)
2.18 MB
2.18 MB PNG
>>107679975
>>
Does nag actually work for wan?
>>
>>107680026
Workflow?
>>
>>107679957
Tanks 4 bake
>>
>>107680162
it's just the wan 2.2 template workflow in comfy, check templates
>>
>>107680026
wassup with kijae on high instead of using lighting for both
>>
>>107680162
load image -> wan 2.2 video -> save video
load prompt -> |
load high lora -> |
load low lora -> |
>>
>>107680171
No it isn't fuck you liar piece of shit
>>
>>107680172
there was a new 2.2 lightx2v lora and it had issues, kijai fixed it (the high one)

https://huggingface.co/Kijai/WanVideo_comfy/blob/main/LoRAs/Wan22_Lightx2v/Wan_2_2_I2V_A14B_HIGH_lightx2v_MoE_distill_lora_rank_64_bf16.safetensors
>>
File: zit_lora_00038_.png (1.15 MB, 1280x720)
1.15 MB
1.15 MB PNG
>>
>>107680179
post workflow benchod
>>
>>107680176
it literally is, just use the MoE lora for high and 2.2 lightning for low. 4 steps high (so 0 to 4) and 4 low (4 to 8)

models are wan 2.2 high and low Q8.
>>
>>107680186
Do you even understand how easy it would be to prove your point? You could post a paste or the image with the workflow, but you didn't because you are lying
>>
>>107680198
Are you the that jeet tranny spamming, but now changing tactics?
There's nothing remarkable about that video gen.
>>
File: 1764892011083487.png (720 KB, 2307x948)
720 KB
720 KB PNG
>>107680198
why do people refuse to listen, just use the fucking template. stupid adderall dependent generation.
>>
File: 1736233028505742.mp4 (870 KB, 832x480)
870 KB
870 KB MP4
anyways.
>when your friend says he wants to suck your dick:
>>
>>107680217
I'll assume its the same schizo that gets mass deleted, so nothing new
>>
>>107680217
he right though i can't find this template
>>
>>107680222
nah its just a lazy newfag
>>
Switchover between diffusion on gpu and cpu vae, thought it looked neat.
>>
>>107680181
:)
>>
>>107679975
qwen tongue my anus
>>
has anybody tried feeding danbooru tag .csv to an llm to compose noobxl prompts?
>>
>>107680495
>noobxl
>>
>>107680516
yeah? it's still the best local anime model
>>
>>107680495
I haven't tried, but since I do not trust LLM's complex information retrieval in such large data, I think I would do it a little differently.

I'd have a text file with one tag per line, and then I'd ask an agentic AI (e.g. claude-code, codex, gemini-cli) to come up with a prompt, and then check if each tag it used actually exists by grepping the text file, and if it doesn't exist it needs to try grepping different synonyms/alternate wordings.

If it can't find any synonym, you can just let them as is hoping it's in the training data anyway, or you can ask it to try something different instead.
>>
>>107680529
Actually, for the first verification, you could simply write a script that checks if every tag in a prompt actually exists. Then for whatever tag doesn't exist, the AI can try grepping various synonyms, I guess.
>>
File: 1751821676488552.mp4 (1.04 MB, 640x640)
1.04 MB
1.04 MB MP4
behold, fent science: (qwen edit to make it, wan to animate it)
>>
>>107680575
Wow! Hilarious!
>>
File: 1760483640933255.mp4 (834 KB, 640x640)
834 KB
834 KB MP4
>>107680579
do not question the science anon
>>
>>107680026
Can it run on 24+32 GB?
>>
>>107680583
Yes.
>>
>>107680583
well im using a 4080 which is 16gb so easily, using q8 even: you can use ram to load the rest of the model into memory and the speed is still fine. (q8 is like 20gb or something)
>>
>>107680582
>>107680575
Handsome astronaut *****/*****
>>
File: 1744992644848690.mp4 (687 KB, 640x640)
687 KB
687 KB MP4
>>
hilarious gens
>>
File: JLC Lewd.webm (3.91 MB, 1024x1024)
3.91 MB
3.91 MB WEBM
Hey, what ever happened to that guy that bought a $2,000 (or whatever) "RTX 6000 Pro" from eBay?
>>
>>107680666
He's dead.
>>
>>107680666
>666
>>
>>107680521
I mostly use Wai. I've never gotten decent-looking pics when I tried Noob, and Newbie looks similarly bad. Am I supposed to actually include dumb terms like masterpiece, 4k, best quality, etc?
>>
>>107680682
wai is based on noob, not illustrious. people made some tag checks like year tags and "very awa" tags and it's pretty much confirmed that wai model creator is just a liar
>>
>>107680666
He's body parts in someone's freezer
>>
>>107680682
you're supposed to use artists. fuck any genner who only use base style slop with wai or any other illustrious shitmix
>>
>>107680696
Nah thanks, I don't care about this shit that much. I'm creating slop, not art pieces, I don't give a shit about "artists" . Go to /ic/ or /a/ if you're into this
>>
>>107680575
>>107680582
That watermelonium is too brightly colored he should have let it congeal
>>
>>107680705
I'm doing God's work, perfecting acestep, since the chinks have abandoned it.
>>
>>107680705
your kind disgusts me
>>
File: 1740393412042229.webm (2.67 MB, 640x640)
2.67 MB
2.67 MB WEBM
okay enough of fent man

I think 8 steps (4/4) vs 4-6 is a good tradeoff. more fluid/less blurry.
>>
>>107680705
you've never shared a gen here, what makes you think you can tell others if they belong lmao
>>
>>107680705
trvke
failed wannabe artists on suicide watch
>>
I wonder if God wants me to make AceStep135 into Udio at home. I'm getting tired, and I've basically worked all day trying to improve gens.
>>
>>107680179
Why lora, can't it be merged into the model?
>>
>>107680696
I do use artists, but it seems to work better with Wai still, so I was wondering if I was just missing a trick somewhere for Noob usage. Maybe I'll play with it again tomorrow.
>>
>>107680734
[horrible noises]
ok that wasn't it.

The thing about audio is you have to listen, you can't just see an xy plot and know what's happening. And there's no preview.
>>
>>107680779
the REAL chad is tuning acestep. None of that faggy video noncense
>>
>>107680779
i'd use neutral style shitmix in your case then
https://civitai.com/models/1208658/manticore-v-pred-traditional-artstyle-merge-noobillustrious
https://civitai.com/models/1201815?modelVersionId=1491533
>>
>>107680790
huh. that sure isn't MUSIC
>>
this spammy retard is the scab guy right? why doesnt he drop the namefag? is this a sign common for all low iq retards like him?
>>
File: inpainted_00025_pp.png (3.21 MB, 1344x1728)
3.21 MB
3.21 MB PNG
>>107680779
I've tried NoobAI a bunch and I have a much better experience with stable slopmixes. Maybe it's because I don't seed hunt much and instead tweak my tags and tag weights a lot.
>>
File: 1730142790107818.png (705 KB, 832x1248)
705 KB
705 KB PNG
>>
>>107680815
>posts the absolutely sloppiest anime garbagèwith the worst chara of any anime ever
wow gj bro
>>
This is the base (or large finetune) model prompters general we established this a thousand threads ago
>>
File: file.png (1.41 MB, 1056x1344)
1.41 MB
1.41 MB PNG
>>107680819
#inspirational
Finally I see the light!!!!
>>
>>107680815
The image is comfy but shitmixes suck ass compared to Noob v-pred 1.0. The only thing they are better at is backgrounds, but SDXL sucks for backgrounds anyway.
You and >>107680682 should read this:
https://d0xb9r3fg5h.feishu.cn/docx/YpOQdtHTDoetcZxIO9fc33onnee
>>
>>107680815
ani himself said that noobai is outdated and too hard to use, he usually leans on illustrious mixes. we're all tired of sdxl though, newbie shows promise but the inference is slow
>>
>>107680852
Silence, noobschizo.
>>
>>107680857
LMAOOOO
>>
>>107680857
>implying anyone cares about your pathetic opinions
lolcow and i don't even use noob
>>
>SDXL
where did all these newfrens come from ?
>>
"Noobschizo" was right. You laughed at him yet now you cling to your downstream mixes. Next time heed his words.
>>
sdxl won. 10b+ params and 3dkeks are still generating plastic garbage. sdxl holds 1000+ styles yet flux/z hold 10 at best. chroma is melted garbage that isn't even relevant anymore. 3 more years of sdxl
>>
File: image.png (2.07 MB, 2304x1792)
2.07 MB
2.07 MB PNG
>>107680852
I think it's just something about the way I prompt.
>>
File: testing.png (3.12 MB, 2000x1300)
3.12 MB
3.12 MB PNG
I need some background character creation for something I'm working on. I don't have much experience using AI but I tried google geminai and I'm blown away at how good it does what I want with pic related as an example of "add more people". However, there's a huge issue of the image being downscaled. This example isn't that great to demonstrate that but if I render a 4k image and do the same thing, it is downscaled to 1k which is unusable. Would running a model locally fix that? Are local models good enough to replicate this? It's essentially perfect besides not remaining in the same resolution as my original image
>>
>>107680914
>his neck

lmao



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.