[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor application acceptance emails are being sent out. Please remember to check your spam box!


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107040459

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Neta Yume (Lumina 2)
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd
https://gumgum10.github.io/gumgum.github.io/
https://neta-lumina-style.tz03.xyz/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
Cookies
>>
FIBO (new image gen model, json-based) is out:

https://huggingface.co/spaces/briaai/FIBO
https://huggingface.co/briaai/FIBO

From my tests it's a bit less aesthetically slopped than Flux and Qwen, but can't do anything "copyrighted"
>>
>>107045555
>generate a beautiful representation of the local diffusion general thread from the 4chan image board
>>
>>107045562
Considering it is using an LLM under the hood, it looks incoherent garbage
>>
Cursed thread of anti-comfy samefag
>>
>>107045562
Suspicious lack of cunny in this representation desu
>>
>>107045599
maybe but you aren't human if you don't hate on comfy
>>
>>107045551
This isn't true just because you say it is lmao, there's tons of nodes that do all sorts of different shit that I can just run stuff through as a batch in a very specific order. The only programs that can actually do the same shit I'm talking about are like e.g. ChaiNNer which literally also uses a node based system.
>>
I HATE SPAGHETTI
>>
I noticed some models (variations of noobai) will do expressive/creative pics without overcooking the prompt, but look like shit aesthetics wise, or a lot of malformed shit, and on the other side of the spectrum (illustrious variations) they do a very clean and polished artstyle, but come out with dead expressions and basic angles and such.

Why is AI so dead bros?
>>
File: 1735622926127100.png (51 KB, 806x404)
51 KB
51 KB PNG
>>107045555
IT'S
SHIT
>>
stoopid frog poaster
>>
>>107045665
I love Apu posters so much bros
>>
>>107045555
the json thing seems like a good fit for danbooru style tags so you could hotswap parts of the composition etc, too bad the general tags are not labeled in any way aside from wiki groups
>>
>>107045628
so just ai stuff? Gimp can batch edit shit faster and easier than noodling some garbage together. this also sounds like custom nodes, comfy alone is barren when it comes to doing anything else
>>
>>107045672
run it locally, then?
>>
>>107045720
I have 16gb (4080) and am trying not to OOM but we'll see
>>
>>107045533
Imagine how much better the world would be today if the top left corner of pic related was accurate. Shit would get done.
>>
File: briaai, only for fagets.png (52 KB, 1078x834)
52 KB
52 KB PNG
>>107045555
Wow it's great
>>
>>107045732
but how would they suck dick
>>
>>107045746
the deer should burst into flames upon handling
>>
File: media_1761616262.png (1.07 MB, 768x1280)
1.07 MB
1.07 MB PNG
>>
>>107045555
Seems OK I guess, I don't think it's quite as detailed and crisp as Flux Krea though. The JSON thing seems a bit cumbersome also.
>>
>>107045555
Does it run locally in anything other then Diffusers? (Their Comfy node is an API node)
>>
>>107045562
>trained exclusively on long structred captions.
Cum on.
>>
>>107045672
>>107045746
Can you guys cut it out? Feeling pretty unsafe here...
>>
>>107045794
>thats not a failgen

thank you for the encouragement.
>>
Some rare seed just have faster motion
>>
>>107045533
Is it safe to use local diffusion on my local/only computer or could the power required for it fuck my computer up?
>>
>>107045665
Because it's a tool of interpolating between known images.
>>
>>107045819
>Their Comfy node is an API node
it's over
>>
>>107045750
there's other holes.
>>
File: chroma___0111.png (1.57 MB, 832x1216)
1.57 MB
1.57 MB PNG
>>107045200
i did this before and it came out crap. i want to give it another shot. did you use screencaps from a blu-ray rip or old vhs caps?
>>
A few threads back some anons were asking for AceStep 1.5 gens (from the current training run), there goes:

https://vocaroo.com/16OrwGV5Wqi5
https://vocaroo.com/1b2iSwhkZN3y

I strongly believe it will match or even surpass Suno once they finish the later training stages
>>
>>107045901
They did the same thing with Bria 3.2 when the released the weights, provided local Diffusers code but a Comfy node that just called the API on their commercial website, not sure why
>>
>>107045890
have a good quality psu and nothing will blow up
>>
>>107045899
Sometimes I do an img2img with one of the creative pics I like to try and improve the style, and the following model completely kills the soul in it.

For example illunoob 2.0 > diving illustrious
>>
>>107045941
>Buy etc 5090 for $2400
>Sell my rtx 3080ti for $450
>Use a "wellness" credit from work worth 2k t that I get to spend on anything that makes me happy (tax free too)
>5090 is free
>Get 2% cash back on the credit card purchase
>Anon seethes because I'm not buying his SaaS bullshit
>>
>>107045941
> one of the creative pics
Generated?
>>
>>107046046
amazing what someone can do when they're not an attention-seeking, tech illiterate trannoid
>>
>>107045914
I used blu-ray screencaps. The best possible quality for dataset, no matter what
>>
>>107045899
Source?
>>
>all the piss and golden shower loras on civitai are gone
it's over
>>
>>107046032
>>107046065
Wtf are these two niggas talking about.

>>107046046
Yeah, lets put as example, "asanagi bukkake", noobai ones will tend to do more expressive and dynamic pics than illustrious, yet are messy and not so good artstyle wise as diving illustrious
>>
>>107046085
> yet are messy and not so good artstyle wise
Soul.
>>
>>107046076
the girl on the right isn't even looking at her own phone, this isn't close to reality at all.
>>
what's with random comments from older threads being spammed here, is this one of them schizos I keep hearing about?
>>
>>107046095
Its souly but its chroma-esque in how theres always something entirely too fucked up, like hands, I'd wish to get "diving illustrious flat anime" levels of polish without sacrificing on creativity. I'd posts some examples here but blue board.
>>
>"clean and polished" style = good
I hate that I share this earth with you.
>>
>>107046123
Then kill yourself and stop sharing it, duh :P
>>
>>107046109
spambot that comes and goes
>>
File: chroma___0118.png (1.76 MB, 832x1216)
1.76 MB
1.76 MB PNG
>>107046068
thx, maybe that's where i messed up trying to do too much at once.
>>
File: FluxKrea_Output_26621.png (2.4 MB, 1248x1872)
2.4 MB
2.4 MB PNG
>>
>>107046168
Take your time with dataset. Gather around 50-60 good images first, crop them by hand to 1:1 and then caption whatever way you want. Start training and don't stop until it works. It's much better when you have success under your belt, it will demystify the whole thing
>>
File: chroma___0139.png (1.71 MB, 832x1216)
1.71 MB
1.71 MB PNG
>>
File: media_1761616219.png (1.38 MB, 768x1280)
1.38 MB
1.38 MB PNG
>>
ive been making videos exclusively of 1girl or 2girl but more and more i am liking the idea of ugly fat bastard with girl or pathetic nerd with girl for the self insert
>>
>>107046483
>1girl
>picture of ugly pathetic bastard nerd (You)
>stitch them side by side
>????
>profit
>>
>>107046314
nigger just give up with this shit
>>
It's not him most of it is his disabled attack dog. When he's actually active his bitching and moaning is more targeted and he seethes out another anon's name especially when he's drunk
>>
File: ComfyUI_20584.png (2.83 MB, 1200x1800)
2.83 MB
2.83 MB PNG
>>107045890
When I first got my particular 4090 I had no idea it could draw 700W or that Stable Diffusion (even back in the simpler 1.5 days) would suck as much power as it could out of the system during inference and it repeatedly tripped the breaker because my PSU wasn't up to snuff. Upgraded to a 1.3kW from an 850 and haven't had a problem since.

So like >>107045928 said, just make sure you have a quality PSU.
>>
File: me.png (1 MB, 768x1344)
1 MB
1 MB PNG
>>107046498
hmm
>>
Why does Stable Video Infinity exist? Examples I've seen of people using it still have the stutter between clips. Zero consistency with motion throughout the video. Its far worse than SkyReel.
>>
>>107046582
Wait for the proper comfyui integration. The loras we have now only kinda work with kijai's wrappers
>>
>>107046631
why isn't it in yet? why is it always API nodes first priority for these fucks
>>
>>107046653
why do you care, you don't gen?
>>
>>107046562
visible square grid
>>
>>107046709
>elbows too pointy
>>
>illustrious still king after more than a year
what happened to all the progress?
>>
>>107046298
Good tips
>>
>>107046841
all of the new models are huge and most people are still vramlets so the majority sticks to XL based models
>>
>>107046841
NetaYume is slower than SDXL but basically as close as you can get size-wise and speed wise while actually still having a worthwhile arch. A lot of it is due to the better text encoder and VAE too.
>>
>>107046884
No, dont act like theres an alternative, chroma IS GARBAGE FOR ANIME AND YOU KNOW IT, qwen I never tried but it cant be much better

t. 24gb VRAM still sticking to 6gb illustrious models
>>
>>107046946
This. There's little reason to use Illustrious anymore.
>>
File: 132.png (809 KB, 844x763)
809 KB
809 KB PNG
>>107047015
Can it do proper artist styles like asanagi? Im gonna download it and if its worse than illunoob 3.0/diving illustrious im gonna shit on it till the end of days.
>>
>>107046946
I hope there's a chance of Netayume realistic finetunes, realistic XL models all kinda suck
>>
>>107047046
With 1.6k images on Danbooru I assume yes. But now I have a feeling it'll be too difficult a model for you to handle so please ignore my post and do not use it.
>>
File: 1746266180296430.png (1.96 MB, 1728x1344)
1.96 MB
1.96 MB PNG
>>107046841
because it has sovl. technological progress alone can't create models with sovl and can cause regressions too

>>107045555
this actually looks really good IF you are using it for corpo or government work where you need to abide by copyright laws. The JSON captions sound awesome if you're building an API around this model.
>>
>>107047075
>its not bad you just cant use it
C'mon now. Anyway I'll give it a chance, its still downloading.
>>
>>107047075
>too difficult a model for you to handle
I think "shit" is the word you're looking for. The model is shit.
>>
>>107047046
it can do asanagi. could do better though, this model needs more training.
>>
Are there "long video" tweaks but for wan2.2?
Everything released seems to be for wan2.1 and it's a bit annoying.
>>
File: 1759233081363475.mp4 (1.07 MB, 832x480)
1.07 MB
1.07 MB MP4
the man stands up and runs down a sunny beach on the left.

2.2 MoE kijai lora high, 2.2 lightning low, 1 str each, new high lora seems to work well. shift is 8 (from 5)
>>
File: blk.jpg (209 KB, 832x1216)
209 KB
209 KB JPG
>>
I'm trying out an XL model(snakebite 2.1) and all outputs I get with the suggested settings look like flourescence microscopy, any idea what coudl cause that? Obviously al lother XL models and XL based ones I used so far worked.
>>
>>107047101
Many were/are filtered by Noob for this reason, especially in the beginning, even though now it's not controversial to say it's the local anime SOTA.
Anyway, good luck and make sure to check out the links in OP for the Yume prompt book and what not.
>>
File: media_1761776205.png (1.1 MB, 768x1280)
1.1 MB
1.1 MB PNG
>>
>>107047067
it would make more sense to just do a realistic tune of the Lumina 2.0 base model without the significant amount of anime-specific fine tune on top of it I think

>>107047121
some artists work better at different resolutions than others TBQH, it depends on how their work was originally uploaded. NetaYume works best at a bit higher than typical SDXL resolutions though overall, I like 968x1322, 1024x1536, and 1280x1536 for portrait at least. Generally it's coherent up to at least 1536x1536 for one-shot gens, sometimes higher.
>>
>>107047159
this image does not look better or worse than the last one you posted really. Also you can't just post coom here, you're gonna get three-dayed lol
>>
File: 1752484266541100.png (48 KB, 1746x202)
48 KB
48 KB PNG
New optimization seems broken.
Can someone try to gen anything using : --fast pinned_memory with latest comfy nightly?
>>
How the shit do I do video with neoforge?
>>
>>107047110
nah it's good, and not really hard to use, just leave the boilerplate prompts where they are in the pos and neg and don't forget to put `@` in front of your artist tags. Pretty simple.
>>
oh brother. not another one
>>
>>107047219
what if its its just glue?
>>
did I get banni
>>
>>107047227
Clearly you haven't read the documentation.
>>
>>107047227
how is she standing looking at viewer hands on own hips in one and lying on stomach in the other if it's the same prompt?
>>
File: ComfyUI_20596.png (3.15 MB, 1200x1800)
3.15 MB
3.15 MB PNG
>>107046709
Yeah, it's the VAE I use. While it enhances my LoRA that was trained with an EQ VAE, it "sharpens" the output just enough to pull that grid out of SRPO/other LoRAs. The guy suggests not using his EQ VAE for inference, but the color is much more vibrant and my LoRA holds up much better (even when the face is very small!) that I prefer it to the default even with the extra noise.
>>
>>107047261
hollero im gonna cook an actual nsfw comparison
>>
>>107047267
sfw*
>>
>>107047267
we all already know that both of these models can do NSFW properly when you prompt them in a not retarded way, we don't need a comparison lol
>>
File: illunoob vs netayume.png (2.17 MB, 2560x1280)
2.17 MB
2.17 MB PNG
>asanagi, blonde, milf, frontal shot, white tank top, jeans, upper body in frame
>>
>>107047289
He's gunna do it anyway and then whine because his horrible prompt and settings don't look good. Such is the way of imagen.
>>
>>107047315
>>107047289
please stop shilling this netayume garbage
>>
>>107047312
oof nice tits on the right
downloading it right now
>>
>>107047312
read https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd
>>
>>107047344
not even the pics they have in that site are good lmao
>>
>>107047312
these are both plausibly asanagi TBQH, I dunno what your point here is really.

if that was the actual prompt you're still not really prompting NetaYume quite right though. Just go look at the user-upload gallery on Civit for V3.5.

>>107047327
>won't say why or how exactly it's garbage
>>
>>107047312
damn yume obviously has far superior prompt comprehension in that comparison.
you should listen to this guy >>107047344 though.
>>
>>107047386
>Just go look at the user-upload gallery on Civit for V3.5.
It's all garbage, all of it. I think this model might just be useless.
>>
>>107047386
>>won't say why or how exactly it's garbage
because when he types "1girl big boobs" into his shitmix it looks sooooooo good but when he does it in yume it suckss!!!!! >:(
>>
>>107047344
i wouldn't say EVERYTHING in this guide totally applies to NetaYume's most recent versions honestly but it's a good starting point. The model is more flexible prompt-wise then they imply I think, overall most important things IMO are just don't delete the Gemma prompt boilerplates, and do use either Res Multistep Linear Quadratic or DPM++ 2S Ancestral Linear Quadratic at around CFG 4.5 - 5.5.
>>
https://huggingface.co/morphic/Wan2.2-frames-to-video
>>
>>107047399
the guy who actually posted the comparison literally posted images that could believably have been from the same model they were so aesthetically similar though, I don't really get it
>>
>>107047413
I agree though I use a different sampler and scheduler. Once you get the hang of it you can really do whatever you want promptwise just like other models.
>>
>>107046946
Still fucks up things such as hands though right? What's the point.
>>
>>107047474
Not as much with a good prompt. But to imply XL does good hands more often without detailers is funny.
>>
>>107047496
I mean, why should I stop using IL if the other one is just as bad? It's like picking up a fancy SDXL finetune.
>>
>>107047515
If you feel it's bad then you shouldn't use it, I don't care. But the only thing IL has on Yume is that it's been out for longer and thus more people have raped it. In all other respects, Yume is superior. Your same argument was made when IL first released and look at what everyone's using now.
>>
File: 00024-4174496661.png (3.54 MB, 1560x2328)
3.54 MB
3.54 MB PNG
>>
File: illunoob vs netayume 2.png (2.44 MB, 2560x1256)
2.44 MB
2.44 MB PNG
>>107047424
>the guy who actually posted the comparison literally posted images that could believably have been from the same model they were so aesthetically similar though, I don't really get it
Bro what? WHAT? That is absolutely, positively not what im seeing here
>>
>>107047312
>>107047560
This comparison means nothing unless you use regular Illustrious 0.1 or even 2.0. You're comparing a mix to a bare finetune. It's not apples to apples.
>>
>>107047590
hmm, fair enough
>>
>>107047560
Also I hope you're using 3.5 and not 2 like your filename implies. Regardless, all Lumina models not just Yume don't look as good unless you use the initial boilerplate prompt because it's using an LLM as the text encoder and not T5 like XL.
>>
File: 1750623449579380.mp4 (3.92 MB, 744x808)
3.92 MB
3.92 MB MP4
>>107046578
heheh
>>
if this is still asanagi have you actually looked through his 79 or so pages of shit on Danbooru? His style isn't THAT consistent, these images all fall within stuff he's uploaded at various times to my eye
>>
>>107047142
Post catbox.
>>
>>107047515
>why should I stop using IL if the other one is just as bad? It's like picking up a fancy SDXL finetune.
IL is just a fancy XL finetune fucking KEK
>>
>>107047142
based on the CivitAI page the person who made this model doesn't seem to understand that BigASP V2.5 did not train the text encoders AT ALL (whereas BigASP V2.0 did, a lot). Beyond that he's merged a flow-matching SDXL model with a not-flow-matching-one in a weird way so there could be lots of things wrong.
>>
File: cat bateman.jpg (56 KB, 1170x1132)
56 KB
56 KB JPG
Did you ever get "what the fuck am I doing with my life" feeling while slopping?
Happened to me today.
I fear wasting my life on garbage.
Like even when you can gen a genuinely good looking image no one gives too much shit since it is slop.
No amount of wf/prompt optimization changes the underlying fact that it is a slot machine.
And for coom gens, same principle why gooners are mocked applies.
I keep getting a strong urge to delete everything AI related on my computer and do NNN.
>>
>>107047678
The point is that unless I see a tangible difference, swapping over my workflow feels like too much work. I don't want to swap unless the images I get from swapping are pristine quality, but that's not what's gonna happen. They will be equal, and maybe they might be better, but there's a speed cost. A good model should need as little supervision as possible from me, I shouldn't have to fix its mistakes.
>>
File: emberassing.png (14 KB, 448x658)
14 KB
14 KB PNG
>>107047708
And yes sorry for the faggy blogpost but I needed that out.
>>
File: 00273-1925284612.jpg (261 KB, 1248x1848)
261 KB
261 KB JPG
>>
Been trying all day to get comfyui to work on my
>gaming laptop

lmao takes 30s to render a shitty 480x720 clip after using an all in one work flow with no adjustable loras.

Fml is there no way to generate good shit as fast as grok with my setup?
>>
>>107047726
>tangible difference
I mean, even the fact that its vae is VASTLY superior to XL's... how is that not a tangible difference? Not even mentioning Gemma kicks T5's ass or the fact that it can do higher res outright.
>swapping over my workflow
I use the exact same workflow as I did for Illust. The only difference is the prompt, it was painless.
>>
File: 1761774105935020.png (1.37 MB, 1024x1344)
1.37 MB
1.37 MB PNG
>>107047708
no, I gooned my way into 6 figure pay and I hope you get the same opportunity at some point.
>>
>>107047764
I meant 30 MINUTES lol
>>
>>107047764
genning videos on a laptop is like tryna fuck an obese bitch with your micropenis; pointless
>>
>>107047708
do it
>>
someone got tired of waiting for nunchuku devs to implement wan2.2
https://github.com/Disty0/sdnq?tab=readme-ov-file
https://huggingface.co/Disty0/Wan2.2-I2V-A14B-SDNQ-uint4-svd-r32
>>
>>107047708
its fun to mess around with high creativity settings and just spin the wheel for a fun goon, but if you're putting in excessive amounts of effort into it, you gotta rethink your life
>>
ani and comfy are basically examples of when you win the jackpot
>>
>>107047708
>I fear wasting my life on garbage.
no need to fear anon you are actually wasting your life on garbage.
>>
>>107047764
>as fast as grok with my setup
that's a hard no
>>
Oh shit this guy is based kek

https://huggingface.co/Disty0/Chroma1-HD-SDNQ-uint4-svd-r32

Chroma bros we are so back!
>>
I'll wait until FurkGOD weighs in on the issue.
>>
>>107047838
chroma cannot afford to be dumbed down any further
>>
File: hmmm.jpg (2.1 MB, 2560x1280)
2.1 MB
2.1 MB JPG
>>107047726

NetaYume positive:
`You are an assistant designed to generate anime images based on textual prompts. <Prompt Start> @asanagi, a cartoon of an overweight nerdy man wearing a fedora and sweating and shaking as he sits at a desk in his darkened bedroom and stares at his computer monitor with a deranged expression on his face. The computer monitor is facing away from the viewer and seen from behind. There is a pepe the frog poster on the wall behind the man. A speech bubble coming directly from the man's mouth reads "OH FUCK, I'M COOOMING!"`

NetaYume negative:
`You are an assistant designed to generate low-quality images based on textual prompts. <Prompt Start> worst quality, lowres, sketch, greyscale, monochrome`

IlluNoob positive / negative: same as above but with Gemma boilerplate removed and no @ sign for the artist tag

IlluNoob didn't even succeed at NOT making the image black and white here lol
>>
>>107047838
I don't get
>>
>>107047795
Maybe the gayming laptop guy has a chance after all
>>
>>107047881
near lossless 4bit quants for a bunch of models
https://huggingface.co/collections/Disty0/sdnq
>>
>>107047795
>>107047838
These are not SVD (A4W4) quants though.
It says SVD is implemented on the github but I am not seeing it among the two examples you posted.
>>
>>107047838
Someone now just needs to tell the guy about HD Flash, or maybe we could do it ourselves? https://github.com/vladmandic/sdnext/wiki/SDNQ-Quantization

>>107047855
Oh yes it can. HD Flash -> nunchaku for SDXL speeds.
>>
>>107047903
https://github.com/vladmandic/sdnext/wiki/SDNQ-Quantization
>>
Here are some benchmarks
https://github.com/vladmandic/sdnext/wiki/Quantization#benchmarks
not as fast as nunchuku but still a 2-3x speed up
>>
>>107046946
Last time I tried 3.5 it couldn't gen sub 500 pic dataset gachas characters
>>
>>107047943
like?
>>
>>107047943
>sub 500 pic
did you also describe the characteristics or did you only prompt the name
but i also wouldnt be surprised if some random gacha characters werent included in his dataset considering the way he describes it
>>
>>107047838
how is this any better than other chroma projects?
>>
>>107047997
read nigga read
>>107047914
>>107047930
>>
>>107047943
How reliably can you gen sub 500 characters on base il 0.1 tho
>>
>>107047890
>XL quants
holy poorfag i couldnt imagine needing that
>>
>>107046946
It needs a lightning lora asap.
>>
>>107047708
>Did you ever get "what the fuck am I doing with my life" feeling while slopping?
No, it's fun tinkering. Anon, there are people who enjoy making legos or solving random puzzles, I just consider what i'm doing as a hobby.

>Like even when you can gen a genuinely good looking image no one gives too much shit since it is slop.
Do you make images/videos for your hobby enjoyment or to get a dopamine hit of people congratulating you?

>And for coom gens, same principle why gooners are mocked applies.
Again, who gives a shit about "mocking" if you do something you enjoy.

>do NNN.
Ah yes, the latest protestant culture ramadan.
>>
>>107047890
>HunyuanImage3 still too big
reee!
>>
File: ComfyUI_06578_.png (1.48 MB, 1200x896)
1.48 MB
1.48 MB PNG
>>
>>107047874
NovelAI 4.5 didn't do as well as I expected on this one. More accurate Pepe (but in the wrong place), that's about it
>>
>>107048123
novelai still uses shitty T5. They will prob switch to gemma / another VLM for 5, I assume that is why they only went from 4 to 4.5
>>
File: images(1).jpg (19 KB, 457x437)
19 KB
19 KB JPG
>>107047621
Stop coping, illustrious just works, netayume doesnt look good even when you throw a whole bible of prompts onto it, how exactly is the text encoder in SDXL worse when it just works the way its intended?
>>
>>107047874
If you wanted to do this type of non nsfw stuff you'd use chroma, not netayume or illustrious
>>
>refresh civitai
>ball punching lora and ball kicking lora for wan i2v
can someone explain why someone would want their balls punched or endure any kind of physical abuse targeted towards their balls in any capacity?
>>
>>107047874
Soul vs souless
>>
File: ComfyUI_06590_.png (1.46 MB, 1200x896)
1.46 MB
1.46 MB PNG
>>
>>107046841
The two big models chroma and pony were trained with yolo retard methods based off of vibes instead of following the documentation. Neither model should have the issues that they are having but they both decided to obscure tokens and do random shit. Pony v7 is a fucking embarrassment and the horse fucker faggot scammed the retards that supported him
>>
>>107048227
The real cope is thinking T5 is as good as Gemma.
>how exactly is the text encoder in SDXL worse when it just works the way its intended?
So does Gemma. It seems as if you don't really understand the discussion.
>>
>>107048289
I agree with pony, you are wrong about chroma, chroma is great and we had no idea a better model that was not distilled that is qwen image was going to release after
>>
https://huggingface.co/valiantcat/Qwen-Image-Edit-MeiTu
>>
>>107048290
You've been hammering down your point of netayume being better for nsfw/anime but its just not better, like not even close to illustrious.
>>
File: ComfyUI_06596_.png (1.65 MB, 1200x896)
1.65 MB
1.65 MB PNG
>>
>>107048296
>can't stick to a single style because it's slopped after talking and following pony fag
>lied about artist tags being present when he obfuscated them
>fails at 2D due to consistency
Chroma failed at doing what it set out to do
>>107048306
You're comparing a base model to a finetuned model with way more iterations of training, the only real flaw is the text encoder which should be better
>>
newfag here. Where exactly do you start with turning images into videos with prompts like grok imagine? Is there like a popular model everyone likes to use now or something
>>
>>107048306
Cool comparison you posted to illustrate your point.
>>
>>107048324
cool furk
>>
File: ComfyUI_06598_.png (1.17 MB, 1200x896)
1.17 MB
1.17 MB PNG
>>107048340
kek'd
>>
>>107048241
does chroma now manage to pull off consistent-across-seeds anime styles?
>>
File: chroma.jpg (570 KB, 1376x2304)
570 KB
570 KB JPG
>>107048328
>can't stick to a single style
That is the point of a 'base' model, if it preferred a style that would be a huge issue
>lied about artist tags being present when he obfuscated them
you are thinking about pony, not chroma
>fails at 2D due to consistency
wha?
>>
>>107048331
what GPU do you have? if its not a 5090/4090/3090 you can forget about it.
>>
>>107048367
>this fucking retard again
We're not doing this, we already had this conversation. You already got told and you fucked off when asked to post 5 back to back images of the same seed and you shat your pants after another anon proved you wrong.
>>
>>107048227
no one is coping lmao, the only person other than me who actually did a comparison just posted several very simple side-by-side examples where both images looked like something the 1000+ upload artist in question could have actually done
>>
>>107048379
>You already got told
lol rewriting history did we? You could not show me a single better gen from any other model that is not a style finetune
>>
>that reddit gen
I hope he doesn't seriously believe that a model being able to generate a consistent artist style is the same as it being overtuned to whatever is the authors preference kek
>>
>>107048332
>post comparisons
>noo not like that thats just better cause its fine tuned you see that model had more iterations and fine tunes and this and that-
>>
>>107048241
now THIS is coping kek
>it's actually bad that the model has excellent prompt adherence, which could never be relevant for NSFW surely!
>>
>>107048328
Not this shit again
>You're comparing a base model to -
No one is going to finetune this shit, just like how no one finetuned chroma. Bonus failure points since this was supposed to be "great" at weebshit and it isn't.
>>
>>107048399
Dude theres not even a single pic in civitai where netayume looks better than illustrious, and when I did nsfw it was UGLY, do you not understand what that means? If I want to make meme shit then ill just get a 22gb chroma model and it will be infinitely better than netayume, but STILL INFERIOR FOR HENTAI TO ILLUSTRIOUS
>>
>>107048394
Which legit comparison proved that base illustrious is better?
The rest of your post further illustrates that you don't understand the discussion if you are the same anon kek
>>
>>107048394
the only actual comparisons in this thread were by me (who did "i'm cooming" guy, and isn't the same anon you just replied to) and the anon who did booba lady earlier.
>>
>>107048373
How bad is it on a 3070...
>>
>>107048289
If it's so easy to deslop Flux or train an 8B model why don't you do it anon?
>>
>>107048413
>No one is going to finetune this shit
So you don't realize that Yume is already a finetune of Neta which is a finetune of Lumina? There's no reason for anon to take your post seriously.
>>
>Faggot is recycling his old bit to derail the thread
You really need a new fucking hobby dude you recycle the same bait every fucking day
>>
its just one guy attacking literally every new model, they are just trying to discourage new models period, ignore the troll
>>
>>107048418
are you the same guy who actually seemed to believe that his asanagi 1girls showed any kind of noteworthy difference at all? If so there's your problem, if not I don't really understand what you're talking about. If you give a "known good results to you on XYZ Illu model" NSFW prompt I'll try it on Neta though.
>>
>>107048437
Fact of the matter is, last model we got worth a damn for artist styles was SD 3.5 Medium, but it was a broken weight. Still, if community figured out way to work around it we'd be eating good.
>>
>>107048435
>8gb
>slow
OH IT'S BAD. better get a job boy
>>
>>107048423
107048433
These two:
>>107047560
>>107047312
Stop coping, I dont need to "understand" anything, you just kept saying "nyooo use netayume its better than illustrious" but then I ran a bunch of pics and it sucks.
>>
>>107048435
its not going to be fun for you.
>>
File: ComfyUI_06519_.png (1.55 MB, 896x1200)
1.55 MB
1.55 MB PNG
>>
>>107047708
>I fear wasting my life on garbage.

same it needs to be at least twice as fast so I have life left over
>>
>>
>>107047289
>when you prompt them in a not retarded way
>>107047315
>He's gunna do it anyway and then whine because his horrible prompt and settings don't look good. Such is the way of imagen.
>>107048477
>Stop coping, I dont need to "understand" anything, you just kept saying "nyooo use netayume its better than illustrious" but then I ran a bunch of pics and it sucks.

jej
>>
>>107048477
(again I'm not the same person you just replied to) as I said before you don't know what you're talking about if you really believe that ANY of the four images you posted are actually particularly "bad" or "good" outputs for a very simplistic `asanagi` tag positive prompt, kek
>>
>>107048497
migu :(
>>
File: 105865337.jpeg.png (3.62 MB, 1536x2304)
3.62 MB
3.62 MB PNG
If you put in the work, you can make netayume look like a quantized version of low step illustrious pic, believe in your dreams and never give up!!
>>
>>107048514
>heh youre just not smart enough to use my model
Yeah, I think ill stick to chroma for the retarded shit you do, and illustrious for quality anime, those seem to work just fine on my tiny brain
>>
File: 1759468372916869.png (3.93 MB, 1416x1888)
3.93 MB
3.93 MB PNG
>>
>>107048597
STILL not the same guy as the guy you just replied to, you ARE dumb though if you literally refuse to pay any attention to the most basic recommendations on how to use a given model, even down to sampler choices.

>>107048581
I dare you to box this
>>
File: 1747996523859824.png (3.68 MB, 1416x1888)
3.68 MB
3.68 MB PNG
>>
Neta yume is not good
There, I said it
>>
File: FluxKrea_Output_121515.jpg (3.1 MB, 2048x2048)
3.1 MB
3.1 MB JPG
>>
I'm torn between linear quadratic and bong tangent desu
>>
I wish your caretaker would pull the plug on your internet
>>
>>107048649
1schizo, trolling at reader, vramlet psyop
>>
>>107048655
for Yume? I think res2s bong tangent was actually even better for fingers and text than DPM++ 2S Ancestral Linear Quadratic when I tried it once. Slower still though. And both of those are slower (but better) than Res Multistep Linear Quadratic.
>>
>>107048639
>I dare you to box this
Anon, ifs from civitai, they all look like low step quants of better models
>>
Who knew skill issues could manifest in such a way
>>
>>107048743
true, whoever made netayume has a severe skill issue
>>
>>107048743
Ran cannot post anything positive but he needs to snicker and criticize others.
>>
>>107048639
he scrolled down past all the good gens to the bottom of the civitai page to find it kek
>>
File: 1745645481851985.png (3.86 MB, 2640x1328)
3.86 MB
3.86 MB PNG
>>107048303
why are they comparing against fp8, that's not a fair comparison at all
>>
>>107048804
they're comparing prompt adherence not general quality
>>
File: 1740861920763018.jpg (2.04 MB, 7961x2897)
2.04 MB
2.04 MB JPG
>>107048815
the prompt adherence gets worse with worse quants though, that's the fucking problem
>>
>>
File: 107878412.jpg (954 KB, 1248x1824)
954 KB
954 KB JPG
>>107048782
>netayume
>good gens
Bro? Where is it? This is like 2gb illustrious tier
>>
>>107048671
What do you keep your sampling shift at or do you even use it at all?
>>
>>107048671
>>107048835
nta but normal sampling shift doesn't do anything to res/bongmath samplers, you have to use "ModelSamplingAdvancedResolution" node
>>
>>107048829
I dare you to box this one too
>>
>>107048871
I dare you to find one pic on civitai that doesnt suck
>>
>>107048671
>res2s
res3m should be faster for an equivalent result
>>
File: FIBO.jpg (3.08 MB, 1664x1216)
3.08 MB
3.08 MB JPG
>>107045794
tried a few more, it's coherent enough. I sincerely doubt it has any NSFW knowledge at all though.
>>
use case for flux krea?
>>
>>107048891
Just use it
>>
File: 1757269461063003.png (248 KB, 1956x1636)
248 KB
248 KB PNG
>>107045555
lol
>>
>>107048876
I mean like every other similar troll you're never going to actually show a specific example of what YOU supposedly think a "good gen" is
>>
>>107048828
Nice. It's refreshing to see a Radiance gen that doesn't look like it's only half converged.
>>
>>107048906
>HiDream superior to flux dev
that's how you know it's a mememark
>>
>>107048870
>"ModelSamplingAdvancedResolution" node
>
 cannot access local variable 'sampling_base' where it is not associated with a value 

Dang it
>>
>>107048906
Now compare them all to SDXL
>>
>>107048303
is it a finetune of the original QIE or the 2509 version?
>>
>>107048891
better prompt adherence than normal Flux, relatively better understanding of styles, MUCH better out of the box realism (in most cases it's kinda like what Asian Waifu Chroma Schizo claims Chroma is for gens that aren't NSFW enough to be disallowed on /ldg/, except actually and without negative prompts existing at all)
>>
>>107048946
can you say that again in English?
>>
>>107048931
the Full was better than Flux Dev but not in a way that really justified it since it was still extremely similar looking to Flux Dev
>>
>>107048941
what the hell, works out of the box for me
>>
>>107048945
Original.
>>
New, I have a general question so apologies if this was already answered fifteen-thousand times before.

I'm trying to generate Img2Vid stuff (furry slop lmao) and after days of fucking about with Comfy I came to this from a youtube video.

https://files.catbox.moe/x5uk9r.json

I think it might work but I keep getting torch out of VRAM errors trying to run it. I know the 4070ti and 64GB RAM I'm rocking isn't as powerful as a 4080 or 4090, but I figured 12gig of VRAM was enough if I did some scraping. Should I download a Q4 version of Wan, would that help? Anything else I could do to get this thing working?
>>
File: 1736538819902629.png (876 KB, 1750x1554)
876 KB
876 KB PNG
have you guys seen this?
https://huggingface.co/lightx2v/Autoencoders
>>
>>107048907
Right right, im the troll, its not you who is shilling netayume like you got paid for it, despite there being 0 instances where its good
>>
>mentions ran for no reason at all
Like I said he always reveals himself, he's upset that his general is dead much. He will always engage in circular time wasting arguments because his life has no meaning.
>>
>>107048983
Don't care
>>
File: 1742938262905896.png (263 KB, 2513x1332)
263 KB
263 KB PNG
>>107048983
you have to use this custom node to use it
https://github.com/ModelTC/ComfyUI-LightVAE
>>
>>107048983
you won't see anywhere near good performance unless you have a H100.
>>
>>107048995
if you weren't an extremely generic, mediocre troll you would lead in with an actual "positive" baseline for comparison
>>
>>107048974
even after updating it still doesnt work and theres no related github issues lole fuck me dang it
>>
>>107049027
Stop arguing with him, he does this every fucking day. He moved from the api spam to do this. Read the rentry in OP for Christ sake. He literally did this not even a week ago and tried his chroma troll with the same exact images only to gear shift after anon called him out. He's a low functioning autist
>>
>>107048983
>encode speed
>decode speed
the encode speed is when it's making the video and the decode is when it's converting the latent to pixels right?
>>
>>107048983
Wouldn't anything be fast with over 3TB/s of memory bandwidth?
>>
File: GL_t98JW8AACFiV.jpg (6 KB, 250x250)
6 KB
6 KB JPG
>>107049027
Diving illustrious flat paradigm shift > netayume

I have to go to sleep, tomorrow I'll shit on it a bit more, so you can cope about prompts and training some more, hope youre being paid well, shill.
>>
File: fibo.png (470 KB, 678x386)
470 KB
470 KB PNG
>>107045555
Introducing... the sloppifier!
>>
File: a_beautiful_quee_c5ea3.jpg (817 KB, 896x1152)
817 KB
817 KB JPG
>>107048471
>A beautiful queen wearing an ornate silver crown with diamonds and fleur d'lis. The queen is crying with her head in her hand, side view, tears running down her cheeks. She is lit by a candle with a castle wall in the background. A hint of Gothic in an Art Nouveau drawing.

SD3.5L.
Remember what they took from you.
>>
File: cute_skunk_in_a_0736f.jpg (585 KB, 896x1152)
585 KB
585 KB JPG
>>107049073
>cute skunk in a wildflower field, whimsical style with small details and organic patterns, stylized, hand-drawn, vibrant, warm colors, #handcrafted #handmade

Actual sovl not possible with any of these modern models. It's so rare/impossible to see.
>>
>>107049073
>artist styles
>a hint of Gothic in an Art Nouveau drawing
erm... anon thats not a specific artist
>>
File: 1758581291413242.png (21 KB, 544x305)
21 KB
21 KB PNG
>>107048982
this looks way too complicated, just use the default i2v model and add picrel node with offloading half the blocks to ram (blocks to swap value of 20), use two of them, one for high one for low
start with that and see if it works, then add whatever lora you want
>>
>>107049053
You never posted a gen because you know you would get exposed disabo
>>
>>107048982
yea if you correctly use block swap/vram management and unload models at the right time it should presumably work?

but i don't remember the details of a kj wrapper workflow, I use non-kj wan
>>
Is telling anon you'll be back tomorrow supposed to be a diss
>>
>>107049041
share your wf, I can take a look
>>
>>107049092
He's too disabled to know how to troll and gets mad when called out
Ignore him and read OP
>>
>>107049050
There is an infinite amount of newfags or just people who can't stop themselves from taking the bait.
>>
File: portrait_collag_8aaf5.jpg (706 KB, 896x1152)
706 KB
706 KB JPG
>>107049080
>Portrait, collage of black block letters on a white background, letters of different sizes draw a dream portrait, light and shadow, low angle, fog, monochrome, kinetic art, Victor Vasarely, hyper detailed

>>107049081
Not my gens, saw them in SD3 post that I searched for due to nostalgia

https://wiki.monai.art/en/models/SD3_5_Large_and_Medium

So sad how far local has fallen. There used to be a time when local devs seriously cared about catching up with Midjourney, but they stopped caring.
>>
>moved to the next troll
>still unable to post his gens
We see your gens in /sdg/ it's obvious you don't have a pot to piss in
>>
>>107049084
nta but is this a new node and assuming I have to update something? I dont see it in my nodes list. Hopefully I can add more thand 300 frames with this
>>
>>107049093
https://files.catbox.moe/bn2v7i.json
>>
>>107049008
>LightVAE nodes depend on WanVideoWrapper for main model support
pass
>>
>>107048983
yes ive seen many asian women specifically in this thread even
>>
>>107049127
it's a custom node : https://github.com/orssorbit/ComfyUI-wanBlockswap
>>
>>107047890
>>107048045
Hunyuan 3 at 4 bit is very bad, don't bother. It's not like it's easy to get good results even at 16, but at 4 it's a lot of low-detail mutants.
>>
>localpajeets coping with speedslop to make their already outdated models look even worse
Kek!
>>
>>107049008
lEmao
https://github.com/ModelTC/ComfyUI-LightVAE/issues/5
>>
>>107049186
Thanks, been looking for something like this. Not sure why comfy has implemented a native version by now, not a fan of kijai's wan.
>>
>>107048982
install multigpu nodes
https://files.catbox.moe/p1n5b2.json
>>
>>107049103
SD 3.5 Medium could have been a good base but it was tricky to train, very tricky, so people got fed up quickly I think.
>>
File: file.png (3.96 MB, 1920x1080)
3.96 MB
3.96 MB PNG
>>107045555
you neglected to post this sick pic from their huggingface
>>107048885
for 8B i dont think that looks too bad? idrk
>>
>>107045555
>json
why? what's wrong with plain text?
>>
File: 1745740118726756.png (1.35 MB, 1633x821)
1.35 MB
1.35 MB PNG
Stress test prompt on FIBO.

>Detailed photograph RAW of seven smiling friends of different races that are at a nightclub concert with dim lighting that is shining on their faces, behind them is a crowd of people dancing while fighting with large swords, everyone is holding a sword in their left hand and an intricate beer glass with differently colored beer in the right hand. Far behind them above the DJ there is a sign which has "Minimum drinKing age 021!" written on it in stylized cursive letters.

Not too bad.
>>
>>107049217
it works with native workflows
>>
File: 1753013917378892.png (47 KB, 160x160)
47 KB
47 KB PNG
>>107049239
damn this shit is slopped, and lmao at that
>>
File: 1730484764182722.png (1.67 MB, 1152x896)
1.67 MB
1.67 MB PNG
>>107049239
>>
File: 544244564.jpg (244 KB, 1024x1024)
244 KB
244 KB JPG
>>107049103
Krea output
>>
File: 1752989515459405.png (1.68 MB, 1152x896)
1.68 MB
1.68 MB PNG
>>107049252
>>
File: 1761180927719504.mp4 (3.83 MB, 584x880)
3.83 MB
3.83 MB MP4
>>107047265
jenny yum
>>
File: 121121151112.jpg (457 KB, 1024x1024)
457 KB
457 KB JPG
>>107049263
>>107049073
>>
>>107049251
it doesn't seem like it's trained on synthetic data at all from my tests DESU
>>
File: Qwen Image.png (1.22 MB, 1664x928)
1.22 MB
1.22 MB PNG
>>107049239
Qwen Image with the same prompt
>>
>>107049263
is this supposed to be bad? or are you a different anon. I think this one is cool TBQH
>>
File: qwen image 2.png (1.31 MB, 1664x928)
1.31 MB
1.31 MB PNG
>>107049275
>>
>>107048982
>>107049084
here a catbox of a very simple 2.2 i2v workflow based on the example one for you to start with, you can enhance it down the line with more stuff if you want :
https://files.catbox.moe/on2e3y.json
>>
>>107049284
>>107049284
>>107049284
>>107049284
>>
>>107049239
>Not too bad.
Compared to most other models, that is, but still not great
>>
>>107049252
>>107049273
>>107048885
Great, we have caught up to Dalle 3 with this model, except there's no nudes or copyrighted content in the dataset so it isn't Dalle 3 tier...
>>
>>107049214

based



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.