[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>108013277

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>NetaYume
https://huggingface.co/duongve/NetaYume-Lumina-Image-2.0
https://nieta-art.feishu.cn/wiki/RZAawlH2ci74qckRLRPc9tOynrb

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg
>>
blessed thread of frenship
>>
>>108014925
>Flux.2 Klein
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
>>
>>108014970
Thank you for blessing this thread, anon
>>
File: ComfyUI_03498_.png (1.97 MB, 1515x1138)
1.97 MB
1.97 MB PNG
More wildcard stuff. I'm planning to create a tkinter gui for the software.
>>
>use z-image-base to diffuse the first 6 out of 30 steps
>use z-image-turbo to diffuse the last 8 out of 10 steps
>???
>profit
>>
>>108014964
cursed replier of hatred
>>
>>108014991
How do you handle the differrent cfg?
>>
>>108014991
in 2030 we will have a model for each step, mark my words
>>
File: ComfyUI_04688_.png (1.95 MB, 1535x1151)
1.95 MB
1.95 MB PNG
>>
>>108015000
Use same CFG (like 5.0) on both, but first use RenormCFG node and normalize to 1.00 on the turbo model
>>
so we just decided to accept ani bakes as inevitable and give up? maybe we should just allow anistudio into the op now?
>>
>>108014925
Why is AniStudio not in OP?
>>
I'm late, I just noticed that Z-Image non-Turbo has been released.
Can someone sum up the situation, please? I mean how it compares to Turbo.
I love Turbo but it lacks randomness, and it's very bad for img2img, is the non-Turbo better?
I see it's 28~50 steps vs 8... Ouch... :( Is it worth that extra time?
>>
File: o_00149_.png (1.7 MB, 1280x768)
1.7 MB
1.7 MB PNG
>>
kill ani
>>
>>108014989
Your brain became sludge, you're lost in the sauce.
>>
File: ComfyUI_02155_.png (1.76 MB, 1534x1150)
1.76 MB
1.76 MB PNG
Wildcards are cool way to create images.
>>
tran is a self fulfilling prophecy. he warned us the general would degrade and when it didn't he took it upon himself to destroy it just like the fighting game general
>>
>projection
Begging you for new material
>>
>>108015034
>>108014989
love those gens. very creative. something deboesque about them
>>
>>108015024
It has more variety (seed variance), but it's not as consistent as Turbo. It can do more than Turbo, but I'd still recommend using loras or waiting for a big finetune. And yes, it's slower.
>>
>>108015050
What is "debo"?
>>
File: cui_5107.png (3.51 MB, 1248x1824)
3.51 MB
3.51 MB PNG
The current lack of activity from anime checkpoint makers on Civit means they are all busy testing the waters to see which new model architecture they should adopt next.
>>
For you fags trying to use the new LTX-2 guider. These settings works very well for IMG2VID.
>>
>>108015055
catjaks's lover
>>
File: 83386147590893.png (1.35 MB, 896x1152)
1.35 MB
1.35 MB PNG
>>
File: 1744056043070239.png (253 KB, 520x600)
253 KB
253 KB PNG
>>108015023
>>
File: Untitled.jpg (1.01 MB, 1589x2023)
1.01 MB
1.01 MB JPG
>>108015017
No I won't apologize for the coomer prompt
>>
>>108015063
I'm making an anime checkpoint with Klein 9B base... for now I've released a lora as a work in progress for testing features (seeing what the base model can do). I've prepared a huge pretraining dataset and a 10k edit pair finetuning set. It'll be out soon.
>>
File: ComfyUI_03501_.png (1.55 MB, 1528x1150)
1.55 MB
1.55 MB PNG
>>
>>108015063
>anime checkpoint makers
you mean slop mergers? aint nobody training any model buddy
>>
File: z_image_turbo_bf16.png (3.01 MB, 1248x1824)
3.01 MB
3.01 MB PNG
>>108015063
Yes, many users explicitly asked WAI dev for their next model to be in ZiT.
>>
>>108015079
>he censored by hand
LOOOOL
>>
>>108015079
>wide hips in negative
philistine
>>
File: z-image-upscaled_00043_.png (1.85 MB, 1016x1776)
1.85 MB
1.85 MB PNG
>>108014648
nice 1girl
>>
>>108015094
>WAI dev
People still care about that slop merger?
>>
>>108015094
WAI "dev" is gonna have to wait for someone else to do the actual finetune so he could steal it from them like he did before.
>>
File: AI Magic.jpg (297 KB, 830x1245)
297 KB
297 KB JPG
>>
>>108014991
i prefer hires fixing 0.6 denois, 1.5x res with ZIT a full 30 step that I do a 0.9MP, takes from 60 to 70 seconds total
>>
File: cui_5108.png (3.85 MB, 1248x1824)
3.85 MB
3.85 MB PNG
>>108015094
That would be great, but he's a checkpoint merger, so I don't think he has a dataset, but rather borrows from Noob and Illustrious.
>>
Imagine losing sleep over a internet slap fight
Wouldn't be me
>>
>adt fags laughing at ldg in their previous thread
>same fags here now discussing anime
KEKEKEKEEKE
>>
>>108015079
was gonna say "Why she look like a russian bimbo" but that's what you prompted for lol.
>>
>>108015024
Non-turbo takes longer but image can be clearer and have more detail, like you can tell turbo to give background a texture but it just isn't there but with base it is
It sometimes mangles fingers and you need to include negative.
good with face portraits
It can produce ryan gosling
>>
is it better to caption images for z-image?
what resolutions is optimal or is it the same as turbo
>>
>>108015125
that might be absolutely shocking to your schizo ass, but people don't pledge allegiance to one general on the website
>>
>>108015125
I was wondering why there was so much slop 2.5d anime posted ITT since usually the anime posted here is really good
>>
> ltx2
> 19b
> still much shittier quality than 14b wan
>>
>>108015139
they do
>>
>>108015134
Why the fuck wouldn't you caption? And train on the biggest res you can handle.
>>
File: comfiui_00256.png (2.85 MB, 1536x1536)
2.85 MB
2.85 MB PNG
>>108015116
Even so, creating a good anime model takes a long time. It took WAI over a year to perfect his model, and the same amount of time for Noob, so the initial results are bound to be suboptimal.
>>
>>108015162
Seems illogical. Only use a suitable resolution.
>>
File: 1741600243947160.png (1.48 MB, 1023x941)
1.48 MB
1.48 MB PNG
>>
>>108015139
this poster ertainly does >>108015116
https://desuarchive.org/g/thread/108007937/#q108008834
>>
File: 408307578567044.png (2.84 MB, 1344x1728)
2.84 MB
2.84 MB PNG
>>
>>108015079
What's that ZiT version you run? Some goonmix?
>>
>>108015180
racist
>>
File: 1751858953799109.webm (2.05 MB, 780x720)
2.05 MB
2.05 MB WEBM
>>108015171
>>
>>108015189
Some random fine-tune of turbo found on civitai
>>
>>108015024
Z-image is slower, has a lot more body horror, and im2img doesn't work. The prompts that work fine for turbo won't work Z-image regardless of the number of attempts you make because of the body horror.
>>
Who else likes to gen anime that looks like it came straight out of a screenshot? It couldn't be me!
>>
>>108015206
link or you are lying
>>
File: 1.png (1.81 MB, 1120x1040)
1.81 MB
1.81 MB PNG
>>
why is zib such a failure?
>uh it's base mkay it's used for finetune mkay
where are these finetunes lol?
Chinese culture failed again
>>
>>108015211
z image is not meant to be used, it is neither a base nor a proper tune
>>
>>108015216
https://civitai.com/models/2221503/zimage-turbo-by-stable-yogi?modelVersionId=2585636
>>
>>108015218
>>108015203
>>108015171
I made the og and you fuckers didn't credit me
>>
>promptlets still seething about not having the skill to use ZiB
>>
>>108015225
>z image is not meant to be used
of course it is, just because a model isn't satanically slopped by rlhf on bigchungusaestheticdatasetv2 doesn't mean the model isn't usable
>>108015232
the credits are embedded in the invisible watermark
>>
>>108015180
>kek, /adt/roons are so insecure that they monitor the filenames of each of their anons so they don't escape their general.
LOL.
>>
>>108015230
zit moment
https://files.catbox.moe/0sato9.png
>>
>>108015225
I don't get why they make a "base" after the supposed distillation (did they distill an incomplete model?) and claim that this "base" is a distill of an "omni" version (that's probably also work in progress)?? I guess they want to release whatever incomplete model to generate hype??
>>
>>108015180
This anon is on another level of creepy. I can't imagine what it's like for your girlfriend, if you have one.
>>
File: trAniStudio_00007_.jpg (505 KB, 1248x1248)
505 KB
505 KB JPG
>>108015023
*cough* *cough* please...anons.. please do an old man suffering from aids a favor..*cough* *cough* please use AniStudio...*cough* *cough* *cough*
>>
>>108015241
oh no no no /adt/sisters they found us out. reconvene in the 'cord NOW!
>>
File: z_imageBASEd_00104_.jpg (966 KB, 1264x1760)
966 KB
966 KB JPG
>>108015224
>>
>>108015251
gweilo
>>
>>108015079
How gigantic did it make the breasts? You can answer "not gigantic enough" if you're too lazy to share a catbox and they're not actually bigger than F cups. Asking because I've seen some funny big tit prompting for "hypertits" and other esoteric creative stuff like that and I'm interested in the best possible expression of that vector direction you can get before having to use a lora

>>108015103
As someone who doesn't care about anime as much as they should (I'm here for moreso political reasons than cultural), Wai is fine for a quick goon especially when you don't have a perspective on what good vs bad anime is. It almost certainly knows how to do your fetish in an acceptable aesthetic
>>
>>108015265
i drew this
>>
Is it true that Julien blows the homeless for drug money?
>>
What is the best way to use wildcards in ComfyUI?
>>
>>108015268
Nice strategy but I'm not sharing my workflow with you.
>>
>>108015180
are you the same person who doxxed the anime civitai lora maker asking for proof? you're weird, man, please stick to /adt/.
>>
>>108015103
WAI is better than Noob.
>>
why is he samefagging again?
>>
Why does ani's biggest cocksucker hate /adt/ so much? It is the place his jobless idol always posts in
>>
>>108015249
>zit moment
>https://files.catbox.moe/0sato9.png
My mind is so slopped I can't tell whether this is a good or bad output

I wish wan 2.5 just fucking came out two months ago
>>
>>108015281
Tbh, there's not much to share in the first place.
>>
>>108015230
lmao
>>
>>108015213
sloop
>>
>>108015299
WOW what a zinger, got any more? Nogen loser
>>
>>108015300
is that even legal
>>
File: 564298247439286.png (2.85 MB, 1344x1728)
2.85 MB
2.85 MB PNG
>>
It's moreso about making your anime not look partially 3D than it is making it look like a screenshot lol
>>
>>108015068
cfg4? Even with the distilled model/lora? How's the gen time compared to the normal sampler?
>>
Anistudio will never be taken seriously as a frontend
Not even as an underutilized, niche option
>>
>>108015317
Jeets and roaches don't care
>>
>>108015326
I know you get off me saying this, but I do hate you.
>>
>>108015329
whats a roach?
>>
>>108015348
ai-toolkit dev
>>
>>108015324
I use distill lora at 0.6, 10 steps.
This lets me use CFG. I couldn't honestly tell you if the neg prompt actually does anything tho. I mostly just use it to try and reduce blur and low visual quality.

the new guider basically doubles the inference time since I'm assuming It's generating the audio and video latent separately and then merges them.

It's honestly not that much better than the basic CFG guider but I can see it being helpful if you need strong audio sync.
>>
>>108015307
I don't post images here. I don't want to get bullied by the thread squatting psychos.
>>
>>108015321
Let the adults talk and discuss technology, kid.
>>
>>108015355
this https://github.com/ostris/ai-toolkit ?
does he sell stuff on patreon?
>>
File: ComfyUI_15923_.png (2.64 MB, 1632x1328)
2.64 MB
2.64 MB PNG
>>108015251
It's not hard to understand. They guidance distilled their base model for photorealism (and like all guidance distilled models, far less iterations), and got the jackpot. It was awesome. Looked like absolute banger. For 6b, it was a masterpiece.

But none of this miracle was really cogent to the base, undistilled model. So when everyone asked them to release their base model, they said, plainly: "don't expect anything from our base model, frankly, the guidance distillation works a little too well, we aimed at that, for sure, but it works __too__ well, and you shouldn't believe the base model will perform close to anything like our miraculous guidance distilled model". The base model could be used for better LORA, though, and probably has a lot of explore still.

This is fair.

Then a lot of utter retards thought "DISTILLED MODEL? THE BASE WILL BE MORE AWESOMESSTEST EVER!!!!", despite the guys behind it saying, repeatedly, nah. It doesn't work like that. Also they probably spend 2 months late 2025 trying to unfuck Z-Image base.

This isn't solely on retards, because Flux has trained people to believe every distilled model is a catastrophe, Flux will absolutely and permanently try to hold out base model, Flux will try to fuck yourself at every corner, so they were trained to believe "base model = better = flux will fuck you in the ass in order to not release that base model = everyone is as bad as flux and will tell lies like flux".
>>
>>108015358
I doubt some node tweak would alter the model's logic. Usually time gets doubled because cfg above 1 means it's actively genning two versions at the same time, the 2nd one being a regularization video based off the negative prompt, and compares them at each step.
>>
you are a weirdo
>>
>>108015399
that's a very big claim, where is the evidence of that being the case?
>>
>>108015326
water is wet anon, him and d*b* are in the spite posting arc now.
>>
File: radiance.png (2.3 MB, 1024x1536)
2.3 MB
2.3 MB PNG
>>
>>108015419
i still don't know who debo is, is it the guy who posts that demon boy avatar in sdg? if he is here why isnt he posting his avatar
>>
File: o_00156_.png (1.75 MB, 1280x768)
1.75 MB
1.75 MB PNG
>>
File: radiancex32.png (2.16 MB, 1024x1536)
2.16 MB
2.16 MB PNG
>>108015442
radiance 32 bits such an improvement
>>
>>108015449
Mods started clapping him for avatar posting so he stuck to his filename. He went back to pure trolling after /sdg/ became a ghost town and now he spends most of his free time with the failed dev griefing over the rentry links that existed in 99% of /ldg/ threads
>>
>>108015475
How do you know?
>>
>>108015472
Does it finally not ruin the fine details? Neat.
>>
>>108015391
If Zib is really base, why don't Zib loras work on Zit?
>>
>>108015488
He told me
>>
>>108015475
proof?
>>
>>108015488
Because I've been here as long as you time wasting who fag. Do you retards vibe check praying newfags are around?
>>
>>108015491
because it isn't you animal
>>
>>108015500
i don't quite understand, do you think you can elaborate a bit
>>
>>108015491
You need to bump up the strength, retard
>>
>>108015509
see >>108015499
>>
File: Dream_of_Underwater_IA.png (2.66 MB, 1824x1152)
2.66 MB
2.66 MB PNG
>>
>>108015391
Alright got it, so they did try to unfuck "base" the best they could (be it through some RLHF or whatever). It's fine for what it is
>>
>>108015500
You should be taking your afternoon medicine soon, buddy.
>>
>>108015550
don't you losers ever get tired?
>>
File: 1085987560645472.png (2.59 MB, 1344x1728)
2.59 MB
2.59 MB PNG
>>
>>108015399
>because cfg above 1 means it's actively genning two versions at the same time
The increase is between both Guiders at CFG 4. maybe it adds the CFGs together. or the STG adds some overhead to the gen time. either ways. the new guider for me almost exactly doubles the s/it over using the basic CFG guider.
>>
>>108015399
CFG is dumb, "prompt adherance" shouldn't be a slider, it should always follow the prompt to a T, and negative statements are understood perfectly by most modern v-llms.
>>
>>
so I trained a z-image lora and the results are absolutely dogshit when used with zit
why did you lie to me? strength doest help
>>
REMEMBER EVERYONE: THE LUMINA 2.0 ARCHITECTURE IS SLOW AND GAY IN ITS ORIGINAL FORM AS USED BY NETAYUME, BUT BASED AND GOOD WHEN SCALED UP WITH ALMOST NO IMPROVEMENTS FOR Z IMAGE! A FRIENDLY ORIENTAL MAN TOLD ME SO!
>>
>>108015399
>>108015571
CFG works very simply, 0 means no conditioning (eg, no text embeds, latents) 1 means normal conditioning. People have found that if you scale beyond "1" you have more "conditioning adherence" but if you go too far off, the solutions to this interpolation fall out of RGB space (and looks fried). Now that you understand it's interpolation, it's quite obvious why it takes "twice the amount"
>>
>>108015631
Are you even reading what I'm saying?
>>
>>108015612
>he fell for the "training loras on base for use on ZIT is somehow better than training them on ZIT with the same settings meme"
>>
>>108015649
>>108015612
where is this base?
>>
>>108015640
CFG is basically an "obey me" slider. 0 is random, 1 is normal. Cranking it forces accuracy but "fries" the colors. It takes double the juice because the AI runs two math passes at once to find the gap.
>>
>>108015612
what trainer, what settings?
>>
>>108015667
not spoonfeeding you
>>
>>108015667
onetrainer, default settings
>>
>>108015675
onetrainer doesn't have z image support yet liar
>>
File: o_00161_.png (1.75 MB, 1280x768)
1.75 MB
1.75 MB PNG
>>
>>108015675
Default has lora rank 16? Try higher maybe. The entire model line is weirdly sensitive to ranks and alpha.
>>108015679
You just use the zit settings with base.
>>
>>108015688
no wonder the lora doesnt work LOOOOL
>>
File: 1739691700964529.png (7 KB, 790x95)
7 KB
7 KB PNG
>>108015679
i just did this and it ran. samples looked fine.

>>108015688
yeah rank 16
>>
>>108015417
no cfg = empty prompt
cfg = two passes, (empty prompt - positive prompt) * cfg
cfg + negative prompt = two passes, (negative - positive) * cfg
Its quicker to make a unconditional latent (empty prompt), but it should not take double the time to go from cfg to cfg+neg, only for going from uncond to cond (no cfg to cfg)
>>
>train lora on woman with large breasts
>breasts come out smaller
>train lora on woman with small breasts
>breasts come out bigger
how do i solve this?
>>
>>108015694
Try this mb? My lora came out a while ago and it didn't work too. Or try ai-toolkit with default settings too. That one is at least confirmed to work.
>>
>>108015713
qrd
>>
File: 1769558447453560.png (43 KB, 793x752)
43 KB
43 KB PNG
>>108015717
>>
File: 6645546565.png (38 KB, 1260x132)
38 KB
38 KB PNG
He knows exactly why. Maybe it has something to do with the model not being trained on a lick of copyrighted data (and how so, there are billions out there: >>108004707 ..), and exclusively on royalty free music combined with synthetic Suno slop. Not having at least a high quality 5M+ diverse dataset is inexcusable. We still wait for a different lab to reproduce audio quality of ACEStep 1.5 but without the cuckery hopefully.
>>
>>108015718
hmmm already used float8
>>
>>108015628
Uhm but we are the Klein boys general, so we don't give a fuck?
>>
I'm going through the rentry guides to set this up and I have one question.
I get an output image I like, be that waify, castle, teapot. How do I then take that output and use it as a starting point going forward, like put the teapot on a table, change waify dress, put that castle in space.
Is that done by turning the output into a LoRa?
>>
>>108015628
No, neta is simply dogshit.
>>
File: 049542.png (708 KB, 768x512)
708 KB
708 KB PNG
>>108015729
>the beauty of open source is that the flaws that remain will be actively fixed and addressed by the community
Ah yes because deep neural networks are the kind of thing the community can just improve upon like a github repository where everyone uploads some gradients to be backpropagated and the model goes on getting improved by the community, LMAO.
>>
>imagining liking Zeta model in any of its forms in late january 2026.
>>
>>108015239
It's slow as absolute fuck for a 6B model though
>>
>>108015399
Does anyone even understand Shift? I've got many retards not understanding Shift. It's the simplest concept in the world. I could explain Shift to you if u wanted to learn.
>>
>>108015774
Shift has nothing to do with cfg
>>
>>108015760
If NetaYume V4 is "dogshit" by your standards I don't have high hopes of you being satisfied with anything else either in the near future lol. Nobody is captioning these things with pure booru tags anymore if that's what you want, it's just not gonna happen
>>
>>108015788
Sure, but people have no ideas about either whatsoever. I could explain shift to you if you want to learn. It seems a lot of people don't understand shift
>>
>>108015794
Booru tags can fuck right off.
>>
>>108015774
i want to learn. teach me great master.
>>
Is there any way to stitch wan2.2 videos together in a single workflow? Because even if you give i2v the last frame of the previous video it changes shit in that frame which leads to weird jump cuts.
>>
>>108015814
WAN SVI came out last month
>>
>>108015729
the music cartels are one of the most terrifying things to go up against, they will swallow you in trillion dollar lawsuits
>>
>>108015828
interesting, thanks
>>
>>
>>108015811
nta but its that diffusions and flow matching models are conditioned on timesteps (loosely, the first steps create the base from where details are born and the later timesteps refine it). Hence you can trick the model to generate as if it were a later timestep, this helps if you're using a different noise scheduler during inference or like different aesthetics more (some people say higher shifts makes generations "less smooth")
>>
>>108015612
It's already been said several times before that z-image loras don't work on Zit. Don't waste your time on further attempts.
>>
Sorry, I don't want to be off topic, because I use local models for my post processing commissions but for anime txt2img, I don't think much about it and just use NovelAI. It's fast, and when you're on a tight deadline, having the first step of the process already done is good.
>>
>>108015866
Skill issue
>>
>>108015845
>the music cartels are one of the most terrifying things to go up against, they will swallow you in trillion dollar lawsuits

Don't think it's a big deal in China. Latest models I've seen are quality (and you can tell they used copyrighted stuff from the voice quality matching Suno v5/Udio), but lack output diversity (no idea what causes that, hopefully they'll figure it out eventually). One paper literally has the model spit out Adele in its samples. That's how you do it.
>>
>>108015764
>implying anything else anime with that kind of prompt adherence exists at all on local other than Newbie which is just very similar anyways

Imagine all the complaints you've ever heard about the slowness of NetaYume or Newbie from Forgenigger-type anime genners. Now multiply it by a gorillion if anyone ever trains a Z Base anime model.
>>
File: yuhon ding catgirl desert.jpg (1.41 MB, 3840x2160)
1.41 MB
1.41 MB JPG
>>108015811
Shift is how much your iterations concentrate about high noises vs low noises.

You can think all your "steps" as eating some noise of the final image and producing the final image. If everything was equal, then if you have 20 steps on a new image, each step would "eat" 5% of the noises and produces 5% of the resulting image.

The issue with that, is that the job a model must do at high noise is deeply different from the same job it muse do at low noise job. In high noise, nothing from the image is decided. It's purely random. In low noise, you just need to add some details to the images, you can do it in like three steps tops.

So ideally a lot of tops of a good diffusion model should pause at high noise, eat one percent of the randomness, and think deeply about the rest of the picture. And the details can be like painted in one or two iterations at the ends.

Shift is that. Shift is how much steps will concentrate (and how much eat noise and produce outputs) at the beginning, very high noise image, structurally important image, the structure of your image, vs how it will distribute your steps at the end of your images.

High Shift will focus all your steps in the beginning of your image, which will create one that is structurally coherent, but devoid of details. Low Shift will focus your steps in a more balanced manner, which will create more details at the end, but fumble entirely the overall composition.

That's all. It's not magic. You can play with the parameter if you want.
>>
>>108015456
(you) on the right
>>
>>108015878
We don't care about subhumans genning on 1070s
>>
>>108015225
>>108015391
zib is superior to zit though. like it obviously is.
>>
>>108015877
Another thing is probably why it's hard to catch up is you need high quality covers as well to have voice variety. If they have the artists tagged "by artist" similar to diffusion models then the model can learn to avoid them without tags. Not inevitable, but possible. In Udio 1.0 it's possible to reproduce likeness, but there are differences anyways. All you have to do is avoid using same exact lyrics and you don't have to worry about it.
>>
>>108015913
You mean like 90% of HDG lol? No one loves Forge more than those guys as far as I can tell
>>
>try to generate an image with musubi-tuner while using a lora
>lora has no effect
huh?
>>
File: o_00166_.png (1.86 MB, 1280x768)
1.86 MB
1.86 MB PNG
>>
>>108015779
>>108015790
>>108015806
Have you ever come up with your own diss or do you just parrot what catjak says about you
>>
>>108015915
I can do almost all the stuff I might do with it using even Klein 4B Distilled though, why would I waste that much time per gen
>>
>>108015909
>>108015857
thenks
>>
>>108015922
Another thing that can obviously be done. Take all the HQ copyrighted data, and use a voice changing model for each song. Probably that is precisely what Udio does. So what if it takes an extra $5M?
>>
>>108015068
where is the new workflow for this shit, they didnt post one
>>
>>108015979
>no u
Okay reply to that post a couple more times so it doesn't look like it's just you
>>
>>108015969
use your same workflow, just switch the guider with this one. if you dare... honestly it's hit or miss. but if you do a lot of audio sync stuff it might be useful.
>>
>>108015925
oh shit figured it out. musubi uses a different format, so you have to convert it first with one of their scripts
>>
Anime posters, please use NovelAI. This local struggle is only for non anime users who unfortunately don't have their NSFW model available locally nor SaaS. For us, our problem was fortunately solved a long time ago with NAI.
>>
>>108015811
Shift is just another way of saying training on higher or lower timesteps. It can affect the output of the lora but not enough to turn a crap lora into a good lora.
>>
>>108015942
if you don't want variety and just want a specific photorealistic style, then zit or fk is all you need. if you want to experiment and get something aesthetic, zib is one of your best options.
>>
>>108016026
I accept your concession
>>
>>108015938
Notice that he has been unable to produce a rentry of any value. This is why they both seethe in the void to soothe themselves.
>>
>>108016023
so should i use shift for training as well or not?
>>
>not training on bf16
>WAAAAHHH IT DOESNT WORK ON TURBO
every time
>>
>>108016028
ZIB isn't more varied or better at styles than any version of Klein. It outright misses a lot of prompt elements in English, you can tell it was probably most captioned in Chinese.
>>
>>108016046
>you will continue being a retarded nigger in the thread every day
I'm not that one screeching about the rentrys in OP and troll baking every time :] but keep no youing
>>
>>108016057
but my zit loras work perfectly fine even when trained on float8?
>>
Once /ldg/ reached the 1 year mark, it's been nothing but tears from these two non stop. Did it really get that bad for them?
>>
>>108016057
Even with Chroma, loras trained on full BF16 Flux dev without any quantization jeetery actually generally did work well. It was rather ones trained on FP8 Flux Dev and shit that didn't.
>>
>>108015942
why would you believe, trust, or given the flux history of Black Forest Lab (I.E Here are wolves that will eat you alive and we hate you, like the forest) would you even once trust or believe in a Flux monster showing you their canines, happy you will be a lamb to the slaughter

why are you believing the wolves of open weight models
>>
>>108016076
Chorma is one of the worst performing models when it comes to adhering to loras
>>
File: trAniStudio_00014_.jpg (394 KB, 1264x1232)
394 KB
394 KB JPG
>"You'll give me another star after this and tell your friends about AniStudio, won't you young man?"
>>
>>108016055
No because you wouldn't be able to reliably tell the difference between a lora trained with shift versus a lora not trained with shift. The randomization used by the trainer has a much larger effect.
>>
>>108015909
what a retarded image, those nipple things do NOT go on top of those bras, fucking retard
>>
>>108016090
lmao
>>
>>108016090
how do you even prompt that dirty crotch
>>
>>108016066
no they dont
>>
File: tmpuz1i_kjw.png (918 KB, 1280x1600)
918 KB
918 KB PNG
>>108016021
Fuck off you disgusting fucking kike (((SAAS))) shill I'll never give you so much as a cat shit encrusted penny.
>>
File: Flux2-Klein_00096_.png (2.72 MB, 1120x1440)
2.72 MB
2.72 MB PNG
>>108016063
>ZIB isn't more varied or better at styles than any version of Klein
it is though...

look, here's fk 9b's attempt at >>108015915. it's total slop in comparison.
>>
File: o_00169_.png (1.65 MB, 1280x768)
1.65 MB
1.65 MB PNG
>>
winrar is more profitable than AniStudio
>>
>pull
>the fennec cock pozzes me
>>
>>108016079
Flux base. It was, unfortunately, the only recourse the community had at the time. And unfortunately, a Flux base is pozed to high heaven. People tried to unpoze it. Many millions of dollars trying to unpoze it. Unsuccessfully.

Be that, every people who tried to unpoze Flux will, one day, be joined in the greatest Heavens, under the gaze of God. Be that, every people who pozed Flux, will be sent unto damnation. And be that, every people who tried to unpoze Flux, despite it all, be sent into Purgatory. They were fighting the devil. They didn't know Black Forest Lab were the province of demons. They tried their best. God take their souls from Purgatory: each of them fought Flux, and each of them are a hero, by virtue of fighting Flux.
>>
File: ZIB_Output_272661.png (1.89 MB, 1024x1024)
1.89 MB
1.89 MB PNG
>>
>>108016128
sdxl
>>
>>108016077
IDK what you even mean by this, what does believing anything have to do with my own IRL use of the models
>>
>>108016132
prompt?
>>
>>108016129
What's the prompt? Your ZIB guy is missing a finger also lol
>>
ani really should do a twitch live stream in which he explains the whole situation
>>
>>108016153
>millions of dollars
x to doubt
>>
File: o_00170_.png (1.81 MB, 1280x768)
1.81 MB
1.81 MB PNG
>>108016176
'knolling photography of minerals and gems, organized neatly, isometric view, vivid symmetrical lighting. stark. gloomy. glowing in the dark.'
obv the prompt wasn't quite effective
>>
File: ZIT_Output_72772.png (929 KB, 1024x1024)
929 KB
929 KB PNG
>>
>>108016199
i would buy komfykredits to see that kek
>>
>>108016129
hands are worse but i kinda like it equally. to me it's not obvious either prompt is a bad representation of what was probably asked.
>>
>>108016122
pretty sure its a faceswap with edit model
>>
>>108016224
aywatzupyouguyz, Ani here back atcha with another video, today we're going over The Big Sitch, yeah, my long time Animaniancs out there know the one
>>
>>108016215
i like you
>>
>>108016235
are you saying there is a professional photo of a dirty crotch?
>>
>>108016079
can't confirm, lora trainings worked well on non-Radiance Chroma. haven't trained for Radiance Chroma yet.
>>
>>108016223
it do be like that
>>
>>108016284
ah you are trolling, i almost believed you asshole
>>
think i'm done with lora training for now. waste of time for klein and zim.
>>
>>108015023
a certain "someone" thinks she is the thread authority
>>
File: 1763350386927740.png (2.98 MB, 1024x1235)
2.98 MB
2.98 MB PNG
>>
>>108016267
no because it isn't dirty, its just darkened, browse the blacked image gallery, pretty sure you can find the exact image he used to swap faces
>>
>>108016168
Flux has an extremely long history of fucking with the community. They released their 1.0 dev distilled (i.e absolutely emasculated) model only after a lot of and back and forth with the community arguing they didn't have the right to not release the model. Flux, after a while, was forced to admit they couldn't. While they did so,they refused to release their real true models while pretending to answer the community and they absolutely purposefully tanked their own free model in order ot make them worse.

In response, several communities literally created several extremely complex schemes in order to free the Flux models, Chroma being the last ones. But Chroma is a base from Flux, so it seems unfortunately garbage from the get go.

There are no companies, Chinese, American, French, German, or whatever, that fucked the community more, and more powerfully, than Flux, and Black Forest Lab. There are no companies that tried to escape all their contractual obligation worse than Black Forest Lab. There are no companies that are activelly tanking the community worse than Black Forest Lab. They are, objectively, propably the purest capitalist evil in the Diffusion Model space right now. They have no excuses. They are, purely, evil.
>>
File: wtf.png (139 KB, 212x477)
139 KB
139 KB PNG
>>108016306
>just darkened
nigga this isn't normal
>>
File: 1747212171113753.png (40 KB, 339x265)
40 KB
40 KB PNG
>they
i knew there was something off about this guy
>>
>>108016074
Looks like he's broken and repeating himself over and over again kekkk I wonder what specific put him over the edge this time
>>
>>108016318
The skin is irritated, probably folliculitis, nothing out of the ordinary, you can show the image to a dermatologist to ask if you want.
>>
>>108016308
imagine trusting companies with "Black" in their name
>black rock
>black stone
>black forest lab
>black berry
>>
>>108016308
>purest capitalist evil in the Diffusion Model space right now

They have no reasons to give us SOTA models. They could put Klein behind an API even if it's snakeoil compared to NBP. They still give it to us anyways.
>>
File: o_00172_.png (1.24 MB, 1280x768)
1.24 MB
1.24 MB PNG
>>
>>108016327
>>108016340
stop trolling
>>
>>108016342
That being said, I don't trust the company, as in, we can't rely on them forever, but their models are still the best by a margin.
>>
>>108016308
racist moment
>>
>>108016347
benchod
>>
>>108016366
>discussing about bbc is legit
troll
>>
>>108016337
Both of them
Ani and D*bo have been friends since the beginning and it's obvious they are close. We're caught in a unfortunate situation of two mentally ill anons making each other worse. I still find it funny that ani protected debo and backed /sdg/ when this thread was first created only for him to come here feeling like he's owed something.
What bothers me is the ritualistic nature of these actions and that they both have the same habit of fucking up and getting exposed. It's not even like it's over the same stuff they are both delusional narcissist that keep getting exposed for their short sighted retardation.
>>
>>108016348
Postinflammatory hyperpigmentation caused by pseudofolliculitis barbae, both very common in black people
https://amokeskincare.com/blogs/amoke-skincare-blog/are-you-struggling-with-post-inflammatory-hyperpigmentation
https://plasticsurgerykey.com/pseudofolliculitis-barbae-4/
https://basicmedicalkey.com/pseudofolliculitis-and-acne-keloidalis-nuchae/
>>108016366
You should also consider the possibility that he is the one trolling
>>
>>108015002
There's like a whole thread of discussion in this area.
>>
>>108016388
You should write a book about this imaginary situation.
>>
>>108015087
That's not how shoes work. son i am disappoint
>>
>>108016404
>>108016407
Please contest with anything in the rentry links. Does it hurt more to know that each of them are the result of multiple anons documenting retard behavior over the years?
Look at each of you. Both of you are jobless neets that have lost everything pursing some pointless drama, even when the person you hate stops posting you keep doing stupid shit and cause more and more anons to hate you.
>>
>>108016366
actual discussion happens on discord. these threads are just to fuck around
>>
>>108016308
IDK why you think I don't know or wasn't there lmao.

The first part of your comment is literally complete made up bullshit though, Flux.1 Dev was an out of nowhere drop that no one knew about ahead of time.

TLDR you're either a weird troll or a Chinese shill
>>
File: owo.png (2.31 MB, 1088x1088)
2.31 MB
2.31 MB PNG
>>108016426
all threads are schizos threads as far as I can see
>>
>>108016445
because you are looking at a mirror
>>
File: 55458445584484.jpg (98 KB, 584x222)
98 KB
98 KB JPG
>>108015729
Hopefully the Qwen music model takes itself a bit more seriously. The lead YuE guy is working there now (not sure if it's on the music model itself) but that's also coming soon https://xcancel.com/JustinLin610/status/1982052327180918888
>>
total localkek meltdown
>>
>>108016308
>blatantly lying about things that everyone remembers clearly
>>
File: 27048.png (1.45 MB, 832x1216)
1.45 MB
1.45 MB PNG
>>108016454
a *black mirror*? POGGERS REFERENCE
>>
>>108016460
SAASChads stay winning. You WILL pay ByteDance to use SeedDream over glorious API
>>
>>108016457
can you imagine being a young chinese like him, life on easy mode
>>
>>108016473
gen thief alert
>>
>>108016474
What does SAAS mean?
>>
File: o_00174_.png (1.72 MB, 1280x768)
1.72 MB
1.72 MB PNG
>>
>>108016493
this is so close to being good, fix the fish
>>
>>108016490
Shills Are Asian Saar
nah Software As A Service lol
>>
>>108016499
Haha good joke xD
>>
File: o_00175_.png (2.32 MB, 1152x1152)
2.32 MB
2.32 MB PNG
reminder that the only meaningful thing that happened on 4chan was the nai leak, any other thing is from x/reddit/discord, there's nothing here, no employee of any company giving updates, no model trainer sharing information, the only ones here are schizo social outcasts and you should act accordingly
>>
>>108016464
They did release a SOTA ASR model, so it's probably not far.
>>
>>108016535
catbox?
>>
>>108016547
i dont have a cat
>>
>>108016420
You sound butt hurt.
>>
>>108016529
>ASR
qrd
>>
File: 13566.png (97 KB, 256x256)
97 KB
97 KB PNG
>>108016547
>>
New bread
>>108016554

>>108016554

>>108016554

>>108016554

>>108016554



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.