[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>108090101

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>>108094778
stop including your cuck porn and schizobabble
>>
File: Flux2-Klein_01250_.png (2.36 MB, 1616x896)
2.36 MB
2.36 MB PNG
>>
>>108094796
based overzealous thread quality maintainer
>>
Well, thanks to Comfy for releasing Anima, /ldg/ now seems a lot more artistic, much more fitting for a weeb website in general. When VRAMlets win gens quality and variety are better.
The WAN and Qwen Image eras, where it was the same two anons with powerful GPUs making Mikutest DeusExtest and celeb 5s videos, were awful and boring.
>>
>>108094821
top lels, lore behind it?
>>
Blessed thread of frenship
>>
>>108094858
nobody is using it in the anime thread and the shill keeps making shit and posting it here expecting us to worship him for destroying open source models
>>
Comfy dragged into my house and patted
>>
Is he still upset that Comfy didn't give him the 1mil to make an animu model?
>>
>>108094866
>destroying open source models
You don't have to use it. You can act like it doesn't exist and keep using SDXL models. Or you can spend 100 grand and train your own Anima and then give it away for free.
>>
>>108094872
isn't comfy taking revenue share from the model?
>>
>>108094872
well... hes also upset that comfy chose yoland as his business partner instead of him. like a scorn ex buttbuddy or something.
>>
>>108094890
then why are you so aggressive against an open anima model that is better?
>>
File: 37697740459341.png (2.41 MB, 1184x1744)
2.41 MB
2.41 MB PNG
>>108094821
I believe her.
>>
>>
>an open anima model that is better?
>still doesnt exist
Wake me when you actually publish the model, Julien.
>>
>>108094897
proof?
>>
>>108094899
I don't know what schizo you think I am or why you're assuming I'm against that. By all means make a completely open apache 2 licensed Anima. The community would be better off for it. I just don't think it's gonna happen.
>>
>>108094921
ani is a better candidate. he makes kino and smut so he knows what's best
>>
File: Makoto_Bianca_027.png (1004 KB, 896x1152)
1004 KB
1004 KB PNG
>>
File: 1743445454185273.jpg (674 KB, 2336x1876)
674 KB
674 KB JPG
If you made a character lora for klein where they likeness felt off, try combining the lora with one or more reference images of the same person.
>>
>>108094930
>I just don't think it's gonna happen.
No one does desu.
>>
>>108094866
>nobody is using it in the anime thread
kek you say that like it means anything
>>
File: 1748706631210183.png (1.59 MB, 1704x792)
1.59 MB
1.59 MB PNG
>>108094963
>>
File: 1744876968091142.jpg (516 KB, 1536x1536)
516 KB
516 KB JPG
slopped a bit
>>
>>108094963
>>108094991
What's the point of an edit model if you need a LoRA for likeness?
>>
File: 1747182085296217.png (3.58 MB, 1728x1248)
3.58 MB
3.58 MB PNG
mr sasuga himself
>>
>>108094941
Lucky boy.
>>
File: 1742862539967240.png (3.67 MB, 1664x1312)
3.67 MB
3.67 MB PNG
>>
File: 1769073749183083.png (3.69 MB, 1824x1216)
3.69 MB
3.69 MB PNG
>>
>>108094963
>>108094991
So now I need to double bloat? Kino nostalgic Pamela Anderson my first cums.
>>
File: 1744217800903535.jpg (453 KB, 1536x1536)
453 KB
453 KB JPG
>>
File: 1760795138969700.jpg (2.64 MB, 4608x1536)
2.64 MB
2.64 MB JPG
which way, 1girlstanding bros?
>>
File: 1761267814059075.jpg (685 KB, 2336x1876)
685 KB
685 KB JPG
>>108094991
>>
File: Flux2-Klein_01339_.png (2.33 MB, 1920x1072)
2.33 MB
2.33 MB PNG
>>
what is the best model to make hot cosplayer girls to masturbate. I want something that can make nice thighs and ass, perhaps tiddies.
>>
>>108095020
Number 2 looks sloped
>>
>>108095020
1, pure, 2 thot, 3 mix
>>
>>108095027
Is she alive? Remeber her blowjob videos from Ares
>>
>>108095041
she played a mummy in the new naked gun movie
>>
>>108094515
>An OpenAI employee made tagexplorer...
You can set your git email to a fake one
>>
>queue up an acestep batch this morning before running errands
>come back to see its only half done
>execution time for each prompt is varying wildly
>set duration to random after generate like a retard and it has made multiple 10 minute songs about something supposed to be 2min
>>
AceStep anons, please join:
>>108095075
>>108095075
>>108095075
>>
>>108095095
oh they made a new one? was using the old one but saw it died when i got home
>>
File: z_image_bf16_00213_.png (1.68 MB, 920x1608)
1.68 MB
1.68 MB PNG
>>
File: WORKFLOW__00056_.jpg (699 KB, 1456x2048)
699 KB
699 KB JPG
>>
>>108095102
Pretty good. Is this Klein9B?
>>
File: 742668097575703.png (2.43 MB, 1440x1440)
2.43 MB
2.43 MB PNG
>>
File: o_00325_.png (1.95 MB, 1152x896)
1.95 MB
1.95 MB PNG
>>
>>108095072
i can't tell you how many times i queued up a batch of files only to find they all came out shit because of a wrong setting.
>>
>>108095095
wasn't there a local voice general a long time ago? you should revive that
>>
File: o_00326_.png (1.85 MB, 1152x896)
1.85 MB
1.85 MB PNG
>>
File: 813566357730734.png (1.74 MB, 1024x812)
1.74 MB
1.74 MB PNG
>>
>>108095183
Kinematic foreground blur
>>
>>108095020
3, needs to just be slightly younger
>>
File: o_00327_.png (1.92 MB, 1152x896)
1.92 MB
1.92 MB PNG
>>
>>108095159
That general was usually baked by a single opinionated autist that was obsessed with finding out how ElevenLabs works, and he vanished once local got "okayish" TTS models. There isn't much interest about TTS around here since every thread on the subject dies quickly as there isn't much to show or discuss. AceStep/musicgen at least is a new thing and it would be useful to discuss how to get the most out of it
>>
File: o_00328_.png (1.9 MB, 1152x896)
1.9 MB
1.9 MB PNG
>>
>>108095102
well faggot? >>108095111
>>
>>108095020
1>3>2
>>
File: 1ZGD-h5IZIU.jpg (633 KB, 2258x2160)
633 KB
633 KB JPG
Is there a way to edit a source video and replace the character with the one in a reference image? Like the hitler walking on stage meme?
>>
alright. i decided to hop off z-image. the inference times... are mind numbing.
can i get a qrd on klein? distilled worth using at all? its not bad.
4b or 9b? how about lora training?

>>108095027
what is it that you're doing exactly?

i have many questions. there should be a rentry for this.
>>
File: 1563932145047.jpg (10 KB, 325x325)
10 KB
10 KB JPG
>>108095339
>the inference times... are mind numbing
>>
>>108095125
Can I get uhhh Ana de Armas on the right and uhh Samara Weaving on the left?
>>
>>108095339
>what is it that you're doing exactly?
just feed it a reference image like you normally do for i2i. describe how you want klein to use the reference image, and then just follow up with your normal 1girl prompt.
>>
>>108095339
9b is fine for basic editing and if your standards aren't the highest then regular t2i MIGHT be serviceable.
>>
what is the best model to make hot cosplayer girls to masturbate. I want something that can make nice thighs and ass, perhaps tiddies.
>>
File: 107304876388990.jpg (2.59 MB, 2304x1792)
2.59 MB
2.59 MB JPG
>>
>>108095356
are you autistic, check out /anime diffusion general/, if you're not then check out chroma
>>
im looking for inspiration
>>
>comfy talked in /ldg/cord
comfy talked in /ldg/cord
>comfy talked in /ldg/cord
IT'S HAPPENING!
>>
File: 56545.png (2.01 MB, 1728x960)
2.01 MB
2.01 MB PNG
>>108095339
klein is good, 9b, distilled, training is just as shit/hit or miss (at least from what I'v seen, haven't tried myself so don't know if its just a skill issue)
>>
>>108095365
i said cosplay dude. I want real looking women, not anime...
>>
>>108095375
>/ldg/cord
no such thing
you're simply a nigger
goodbye
>>
>>108095375
omg post the screenshot!!!
>>
File: 688308361196148.png (2.31 MB, 1440x1440)
2.31 MB
2.31 MB PNG
>>108095348
I guess?
>>
>>
can the anime fags move to their containment please?
>>
File: 279467.png (3.24 MB, 1248x1248)
3.24 MB
3.24 MB PNG
>>108095502
tranime is 4chan culture, chud
>>
>>108095480
Yeah, NetaYume v4 has a better aesthetic imo, and Newbie is heavily underbaked. What I like about Anima is how universal it is to prompt, since Newbie and NetaYume’s prompt syntax is awkward.
>>
File: IMG_1946.png (1.58 MB, 1216x832)
1.58 MB
1.58 MB PNG
>>
File: WORKFLOW__00058_.jpg (862 KB, 1456x2048)
862 KB
862 KB JPG
>>108095102
>>108095321
This is Z-base
>>
File: 621691549665930.jpg (2.78 MB, 1664x2432)
2.78 MB
2.78 MB JPG
>>108095339
It's pretty good.
>>
File: 00000-2074038299.jpg (1.73 MB, 1792x2304)
1.73 MB
1.73 MB JPG
>>
wtf... why are random posts getting deleted. is some fag reporting anime gens?
>>
File: zit_00007.png (1.71 MB, 960x1536)
1.71 MB
1.71 MB PNG
what's the current meta for making videos? we using wan 2.2 still?
>>
>>108095785
you deleted them yourself to claim someone reported it (and the mods went through with it)
>>
>>108095801
could you be more specific? there's several tools for different jobs
>>
Training the ZIM lora at 1e-4 and Timestep shift 1.33 resulted in much better likeness, close but still no cigar. I guess I'll retry with a higher timestep shift.
>>
>>108095339
Lora training for Klein is very bad.
>>
File: file.png (1.29 MB, 896x1152)
1.29 MB
1.29 MB PNG
>>108095480
nice gen desu
>>
Anima has a stronger grasp on western styles over Eastern, it's pretty clear when some artist automatically give way better hands ect, same issue with the other hybrid models so move accordingly.
Also token weighs go into full numbers like 1-5 not really 1.5-2. Artist more sensitive but the eastern artist seem really weak vs western stuff so far, you can work around it just be mindful. Also score tags in negs make things worse from what I can tell but I have a feeling you might need to fish score to get the artist to show better if not worse.
I'm new to swarmUI and the fucking adetailer is bugged so this is basic two pass gens.
>>
>>108095809
trying to get into it from images, i just want to do i2v, maybe up to 10s?
>>
>>108095813
>I guess I'll retry with a higher timestep shift.
isn't there a correct value? lmao even more hyperparameters to tune
>>
>>108095823
>isn't there a correct value
likely yes but no one knows it yet
>>
File: ZImageBase_Output_125515.jpg (2.31 MB, 1536x1536)
2.31 MB
2.31 MB JPG
Trained a Tim Jacobus style lora for ZIB
>>
>>108095813
>>108095823
There's no correct value. It just becomes more img2img when you shift. Unless you can control all of the RNGs, it's hard determine if changing your settings would make any difference.
>>
File: 247750634804028.png (648 KB, 896x1152)
648 KB
648 KB PNG
>>
>>108095815
not really, most of the Klein loras on Civit are pretty good
>>
>>108095818
what's wrong with the adetailer? you just do <segment:yolo-Anzhc Face seg 640 v3 y11n.pt,0.35,0.6> {prompt}
>>
what are those american jail things called, it's a placard with their name on it, or a serial number, or something, they hold it during mugshots
>>
>>108095842
oooo nice
>>
>>108095353
9B Distilled is better than Z Image in every way
>>
>>108095852
>>
>>108095842
Link?
>>
>>108095339
The distilled is mostly better and easier to use, the Klein Bases don't have any SFT or RL training
>>
>>108095827
>likely yes but no one knows it yet
It should've been in the model card
>>108095847
>There's no correct value. It just becomes more img2img when you shift.
huh? This may be a side effect but that is not what it was made for https://huggingface.co/blog/MonsterMMORPG/decoding-the-shift-and-diffusion-models-training
>>
>>108095842
another
>>
>>108095860
>in every way
kekd
>>
>>108095874
>It should've been in the model card
yeah it should've
>>
>>108095851
I'm getting a cuda dsa error
I'll try again it just keeps failing on me
>>
>>108095887
though now that I think about how it works the timestep shift for loras is likely different than for fine-tuning anyway so... yeah, another parameter to tune
>>
File: 478260455908418.png (1.82 MB, 1344x1728)
1.82 MB
1.82 MB PNG
>>
>>108095874
Nothing in the article contradicts with what I said. If you introduce a bias in the timestep, you'll reduce the ability of the lora to generate likeness. That would reduce overcooking but also means your lora will work better when doing img2img.
>>
>>108095902
As far as I can tell we basically know nothing about Z Image training whatsoever, the only thing that's fore sure is that what worked for Turbo doesn't work for Image. I'll try out a a few more things and suggestions I've seen in the onetrainer discussion then call it a day and then wait until someone else finds out more
>>
>>108094963
I tried that before. Just using the reference image produced better results than using the reference and lora.
>>
>>108095486
>>>/g/dalle
>>
i have a 12yo computer whats the minimum amount of money i need to spend to get a decent enough machine that can run this shit and make pics locally? is $600 enough?
>>
>>108095989
$3.50
>>
File: 926.png (2.38 MB, 960x1344)
2.38 MB
2.38 MB PNG
>>108095921
>we basically know nothing about Z Image training whatsoever
so much for "open" models, welp, better than nothing that's for sure
>>
>>108095989
2-6k because gpu and ram prices have skyrocketed and you'll need a new motherboard platform
>>
>anima hyped to hell
>author no longer feels the need to continue training it
>>
>>108095989
I'll give you $10 if you let me smash it with a bat
>>
>>108096003
>saying random retarded shit
>>
>>108095989
You could technically get away with a 12gb 3060 but some newer speedups and gimmicks are locked to 40XX or newer.
>>
>>108096001
>so much for "open" models, welp, better than nothing that's for sure
I guess someone will find out eventually but it sucks, with ZIT we know how to train it within a couple days of it coming out, but this, idk how this shit works at all
>>
>>108095818
Nice, you can use YOLO models <segment:yolo-face_yolov8m-seg_60.pt,0.3,0.25> {prompt}
This after <refiner>
And pic rel is similar to the Adetailer tab of Forge
>>
>>108096033
Got it, i'll try again lol
>>
>>108095183
>fag
nick fe moment
>>
>>108095921
I trained a Z-image lora using same settings and dataset I used for a Zit lora. The results were more diverse but mostly the same.
>>
File: o_00336_.png (1.54 MB, 896x1152)
1.54 MB
1.54 MB PNG
>>
>>108095981
Thanks, I know, but I mostly use local models primarly, so I’ll stay, just sharing a cool gen while Klein loads.
>>
>>108094963
i tried it on a somewhat good lora and it actually improved the results further.
i also tried it on a borked lora and it improved the results as well, but could not salvage it enough to be usable.
>>
>>108096096
yeah i think this is the 9b meta for sure
>>
>>108094993
It's not a very good edit model is the thing it's just the most lightweight one right now
>>
>>108096033
>>108095851
Ever encounter this on swarm?
ComfyUI execution error: CUDA error: no kernel image is available for execution on the device
Search for `cudaErrorNoKernelImageForDevice' in https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html for more information.
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.
Never had this issue on other UIs but this only happens when I use the adetailer segmentation here. Obviously I have cuda and whatnot installed
>>
>>108096063
I tried that too at first, same settings, dataset, but the results are abysmal.
>>
>>108095881
cool
>>
>>108096191
thanks
>>
damn klein is... dank...
>>
>>108096199
uh thank you for impersonating me to thank them i guess kek
>>
>>108096119
>not a very good edit model
yeah it is lmao
>>
File: Flux2-Klein_01345_.png (2.11 MB, 1616x896)
2.11 MB
2.11 MB PNG
>>
>>108096220
if you dont really care about style or likeness then sure
>>
>>108096229
an issue of skill
>>
>>108096233
im sure you meant to reply to the anon using a lora for likeness >>108094963



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.