[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


Discussion of Free and Open Source Diffusion Models

Prev: >>108104750

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl
https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
It's so over
>>
CHROMACORD RAUS
>CHROMACORD RAUS
FURTROONS RAUS
>FURTROONS RAUS
SHILLS RAUS
>SHILLS RAUS
>>
File: 105665937814290.jpg (3.66 MB, 1664x2432)
3.66 MB
3.66 MB JPG
>>
>>108110995
so true, only shills with permit like anima get the pass :P
>>
File: 1204001anima-preview.png (3.04 MB, 1280x1552)
3.04 MB
3.04 MB PNG
>>108111004
>anime model that can follow basic instruction and styles is a shill model.
Meds?
>>
>defence forfce worth 1M
hmmmm
>>
>>108110980
thx 4 fagollage and bake
>>
File: radiance_x32.jpg (105 KB, 736x1280)
105 KB
105 KB JPG
>>
>>108110980
Ty for baking anon
>>
how does tranjak keep making his already awful gens worse every time
>>
>>108111056
jealous?
>>
File: z_image_bf16_00310_.png (2.53 MB, 1088x1088)
2.53 MB
2.53 MB PNG
>>
Klein for editing
Z image for t2i non anime
Anima for anime

Simpleas
>>
okay just doing these identity swap tests and so far:
1. nanobanana pro - just werks. literally no mistakes
2. seedream - not bad, but loses likeness a little bit
3. flux klein - can't follow the prompt very inconsistent results - does strange perspective things
4. flux pro - does an indentity swap, but the new person is completely random
5. nanobanana - completely fails
>>
Did a few Qwen Image 2.0 vs. Flux Klein 9B Distilled comparisons here:
https://www.reddit.com/r/StableDiffusion/s/Zyqa2DUfuB
>>
File: 903937244686875.png (2.27 MB, 1024x1024)
2.27 MB
2.27 MB PNG
>>108111090
That makes sense to me.
>>
File: Flux2-Klein_01287_.png (585 KB, 704x768)
585 KB
585 KB PNG
>>108111111
>>
>>108111111
What's the prompt for your test? You can't use super vague Kontextesque prompts with Klein, it won't work
>>
>>108111037
I'm still experimenting with Z Image so far and I've finally gotten to the point where I can a lora is not total garbage. LR is 1e-4 but it feels like what makes or breaks it is the timestep shift, the attempts at 1.0 were a disaster, at 2.0 it's starting to look serviceable, I increased by 0.33 between attempts and it improved accordingly. Works decently on ZIT as well.
>>
>>108111131
>High-end fashion editorial. PERFORM A FULL IDENTITY SWAP. Take the EXACT FACE, facial features, eyes, nose, hair, skin tone, and head of the person in the FIRST IMAGE and place them onto the person in the SECOND IMAGE. The final result must be the person from the first image. Ensure the identity of the person from the first image is perfectly preserved and recognizable. The body type and physical proportions should also match the person in the first image. The clothing, pose, lighting, and background MUST stay exactly as they are in the second image. The final result must look like the person from the first image is wearing the outfit from the second image in the same environment. Do not change the dress or the background.

tried a couple of other ones as well, it was too all over the place
going to put seedream through more tests
>>
>>108111114
Where do I use Qwen 2? It's not on Fal.
>>
File: ComfyUI_02108_.jpg (294 KB, 993x1489)
294 KB
294 KB JPG
okay i guess it is time to stop posting for real
5+ times and i can't solve this captcha because it is broken or something
i don't have time for this bullshit
>>
File: anima__00900_.png (1.46 MB, 832x1216)
1.46 MB
1.46 MB PNG
>>108111171
damn bro, sounds rough, sorry to hear about that
>>
File: Malware.png (101 KB, 1248x693)
101 KB
101 KB PNG
A lesson for everyone
Some gullible idiot on the /ldg/cord installed AniStudio on their PC out of pity
>>
okay seedream tests:
2/5 very good
2/5 random person
1/5 mostly preserves the wrong identity
unfortunate. i guess i'll stick to nb pro for now.

more prompt tests on klien just results in random parts of both people and environment being mashed together. gg.
>>
>>108111171
what model is this?
>5+ times and i can't solve this captcha because it is broken or something
for me it was the new one where you are supposed to find the difference between outputs and not within a single output unlike the other ones so far
>>
File: 1958001anima-preview.png (2.1 MB, 1280x1552)
2.1 MB
2.1 MB PNG
>>108111305
Why the fuck are you posting things from the discord?
Trying to false flag?
>>
File: z_image_bf16_00314_.png (1.79 MB, 1088x1088)
1.79 MB
1.79 MB PNG
>>
File: Anima_00112_.png (1.12 MB, 832x1296)
1.12 MB
1.12 MB PNG
>>
https://files.catbox.moe/gwyysy.png
>>
File: Malware2.png (36 KB, 709x507)
36 KB
36 KB PNG
>>108111305
Nobody wants to try AniStudio
>>
File: ComfyUI_temp_vdoco_00074_.png (2.77 MB, 1824x1248)
2.77 MB
2.77 MB PNG
https://files.catbox.moe/9xaz22.png
>>
>>108111319
it's z base, still figuring out the samplers
basic euler plus normal with big schizo negative prompt seems to work at least
>>
File: ComfyUI_temp_kyrpm_00014_.png (3.57 MB, 1824x1248)
3.57 MB
3.57 MB PNG
https://files.catbox.moe/qt6fb5.png
>>
>>108111370
while hilarious I don't appreciate you dragging this out here
>>
>>108111149
Do you have your sample images? This prompt is like, way too far in the opposite direction of complexity lol
>>
File: ComfyUI_temp_kyrpm_00028_.png (3.26 MB, 1824x1248)
3.26 MB
3.26 MB PNG
https://files.catbox.moe/3i1qhw.png
>>
>>108111161
It's the default image generator in Qwen Chat now
>>
File: ComfyUI_temp_kyrpm_00037_.png (3.25 MB, 1824x1248)
3.25 MB
3.25 MB PNG
https://files.catbox.moe/yxgk42.png
>>
File: Flux2-Klein_00691_.png (445 KB, 704x768)
445 KB
445 KB PNG
gay boy discord drama hours
>>
>>108111383
i do desu
>>
>>108111305
>>108111370
How can someone larping as dev be that bad at coding kek
It's a rather trivial project and i'm really wondering how you can fuck up everything all the time with each "release" happens (if you call uploading mysterious binary blobs as release)
>>
>>108111370
I don't get it. it's two people that want to try in your image
>>
>>108111403
reminds me of Tree of Life film
>>
>>108111383
i do, because it's discord shitters that ruin every thread, a few weeks (months?) back when the trolling was the worst of the worst with the bots, d*bo and quokka posted images as if to say "i am here". these fuckers keep ruining threads while giggling in their discords and this applies to you too. 4chan is in this state because of this shit.
>>
File: ComfyUI_temp_kyrpm_00041_.png (3.6 MB, 1824x1248)
3.6 MB
3.6 MB PNG
https://files.catbox.moe/mfd4dl.png
>>
File: 1770730096107608.jpg (160 KB, 1079x1180)
160 KB
160 KB JPG
They will release Opus3?
>>
>>108111409
idk. maybe you should make a UI we can try. id rather that than whatever gaycord drama you want to stir up
>>
>>108111421
is it good? The no
>>
>>108111421
There's more chance of me shitting out an RTX 6000
>>
>>108111415
While I'm sure people like that have a discord that place has nothing to do with that
>>
>>108111385
lel, i tested simpler ones as well.
if you want to give it a go just pull some random images from zara and use a random ig girl image as user image.
the use case here is for ultra-normies so it needs to just work 9/10 times and i don't think any of the models except nb pro are going to be robust enough to do this for scenarios where the input images could be from anywhere.
i haven't tested gpt image, but i think that's more expensive than nb pro
t bh i think by june/july there'll be a few more models that'll just-werk.
>>
File: ComfyUI_temp_kyrpm_00129_.png (1.98 MB, 1248x1824)
1.98 MB
1.98 MB PNG
>>108111412
Terrence Malick is pure kino
https://files.catbox.moe/w55nph.png
>>
>>108111305
>>108111321
>>108111370
kill yourself niggerjak
>>
File: 00005-3080378337.jpg (1.8 MB, 2560x1536)
1.8 MB
1.8 MB JPG
>>
File: ComfyUI_temp_kyrpm_00021_.png (3.42 MB, 1248x1824)
3.42 MB
3.42 MB PNG
https://files.catbox.moe/vkn83f.png
>>
>>108111443
this
>>
>>108111421
Another Microsoft Clippy tier office assistant model I guess (gpt-oss et al)
>>
>>108111305
>>108111370
Hilarious to think that this is the guy bothering actual devs like illyasviel and crying at github support to get usable uis deleted
>>
File: ComfyUI_temp_kyrpm_00112_.png (3.38 MB, 1824x1248)
3.38 MB
3.38 MB PNG
>>108111445
Love this image a whole bunch, congrats
https://files.catbox.moe/7zaww7.png
>>
>>108111450
>>108111443
Crazy how the guy caught posting discord logs thinks another person would do what he does when the reality is a ton of people hate him
>>
>>108111448
>>108111403
love these, reminds me of my Hitler Jugend days
>>
>julien discord drama
>>
>>108111421
So I can run it Q1? lol. Into the dumpster
>>
>>108111421
The odds of them releasing a local model? (I would bet more /lmg/ than /ldg/ though)
>>
File: ComfyUI_temp_juqfx_00015_.png (3.45 MB, 1824x1248)
3.45 MB
3.45 MB PNG
>>108111477
Erm... thank you for your service? o7
https://files.catbox.moe/mwz0n3.png
>>
>>108111445
That's a really nice style
>>
File: 021273859174047744]-2.webm (3.91 MB, 576x1024)
3.91 MB
3.91 MB WEBM
>>
File: 1300002anima-preview-2.png (1.98 MB, 1280x1552)
1.98 MB
1.98 MB PNG
>>108111478
He got caught doing this a month or so ago
He really needs help
>>
>>108111480
>>108111421
If they release Opus 3 or Claude 2.1 (my beloved), I’d be willing to rent a GPU just to use it.
>>
>>108111501
this is great
>>
>>108111502
>He got caught doing this a month or so ago
where?
>>
>>108111501
Pure cinema
>>
>>108111514
ugly person, ugly attitude, ugly gens
>>
File: 1742536674285705.png (579 KB, 1000x778)
579 KB
579 KB PNG
>>108110980
Posting in an epic bread.
>>
File: 117689547.jpg (503 KB, 1248x1824)
503 KB
503 KB JPG
Glad I’m only into retro anime mecha aesthetics and this model solved all my problems https://civitai.com/models/860092?modelVersionId=2584964
>>
>>108111536
pdf
>>
File: fromY.png (1.08 MB, 800x1000)
1.08 MB
1.08 MB PNG
>>
File: toY.png (1.14 MB, 1000x800)
1.14 MB
1.14 MB PNG
>>
File: z_image_bf16_00315_.png (3.53 MB, 2016x1152)
3.53 MB
3.53 MB PNG
>>
>>108111514
Yes. It's an obedience test. You have to praise his gens or you out yourself as an enemy.
>>
>>108111466
wait he is actually trying to nuke the competition? kek
>>
>>108111445
I'm gonna make you look like a samefag, but I also really like it.
>>
>>108111517
read the OP
>>108111598
You act like I'm a god and ani is some good boi who dindu nuffin
fuck off
>>
>>108111502
I really like this gen.
>>
File: 75674634.png (5 KB, 90x258)
5 KB
5 KB PNG
nice, i’m testing anima on forge neo haoming delivered fast i’m gonna try hires.fix and img2img.
>>
>>108111370
>please try my binaries *sad cat*
Pathetic, why would anyone download and run his binary blobs when he declared dozens of times he wants to doxx his "schizo anon"?
>>
>>108111644
>forge
hahaha noob
>>
File: ComfyUI_temp_juqfx_00005_.png (3.04 MB, 1824x1248)
3.04 MB
3.04 MB PNG
https://files.catbox.moe/bmeal9.png
>>
File: LikenessSwap.jpg (1.97 MB, 1920x3216)
1.97 MB
1.97 MB JPG
>>108111436
IDK anon, seems like an issue of skill. Here's a random stock photo lady replaced with Margot Robbie, for example. Klein 9B Distilled, 4 steps.
```Completely replace the face of the woman in photographic image 1 with the exact same face of the woman from photographic image 2. Keep all other aspects of the composition and layout and lighting and color palette in photographic image 1 exactly the same as they are.```
>>
>>108111629
the op rentries are schizobabble so I stopped paying attention to those. show us where ani was "outed" or you are a schizophrenic loser
>>
>>108111667
where is the fent?
>>
>>108111672
Nta but what do you want proof of? Comfy making fun of him? Julien posting realistic cute and funny? His crashouts on github? His alcohol problems? The depression posting? All the lies he told over the years (and never delivered)? ...
>>
File: AniStudio-00010.jpg (1.32 MB, 979x2558)
1.32 MB
1.32 MB JPG
>>108111672
nta but let's see you try to get out of this one again ani
>>
>>108111696
proof?
>>
File: 1320001anima-preview-2.png (2.42 MB, 1280x1552)
2.42 MB
2.42 MB PNG
>>108111650
He won't be stupid enough to pull that on a anon, but for all the shit he talks you would think the unemployed fuck spin up a VM with gpu pass through or dual boot on linux to test out his broken slop.
>>108111656
Skill issue
>>108111672
>schizobabble
outed yourself
We're done here, your self soothing mechanism has no power here especially since you can't contest anything
>>
>>108111383
it's on topic so fuck off
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
>>
>>108111704
>you would think the unemployed fuck spin up a VM with gpu pass through or dual boot on linux to test out his broken slop.
it's a fucking embarrassment that after over a year of work he doesn't have a CD pipeline, as if he never heard of the concept. subjunior dev and he thought he deserves investor bucks lol
>>
>>108111656
Superior hi.resfix
Superior GPU RNG
Superior token weighting
Superior lora weighting
>>
File: 905444977929085.png (2.04 MB, 905x1248)
2.04 MB
2.04 MB PNG
Whatever happened to that lady that saw some motherfucker that wasn't real?
>>
>>108111717
maybe you should open a pr. not sure how he had Linux binaries before but you can ask him and help the project instead of being a jaded retard on 4chan. just some food for thought
>>
>>108111668
hmm i was hitting 4b via openrouter so maybe that's it?
can you try a full outfit + body preservation please?
>>
>>108111741
also, if you can do a random person instead of celeb would appreciate it - just on the off chance that celebs being in the training set changes anything
>>
>>108111717
And at the same time he shat on hlky for trying to help him setting up a basic github actions pipeline
What an insufferable asshole ngl
>>
>>108111738
ranfaggot is a nocoder
>>
File: 1329002anima-preview-2.png (2.54 MB, 1280x1552)
2.54 MB
2.54 MB PNG
>anima hit forge main
Goodbye swarm, time to go home
>>
>>108111738
why would I give you free labor when you've stolen my gens and insulted those I posted in threads you don't like?
>>
File: ComfyUI_temp_cnsni_00019_.png (2.58 MB, 1824x1248)
2.58 MB
2.58 MB PNG
>>108111667
>>108111695
me 0.001 seconds after I setup my GPU to calculate illegal math
>>108111737
Tiffany Gomas, the woman behind the viral "That motherfucker is not real" outburst on an American Airlines flight, has spoken publicly about the incident.

In a video posted in August 2023, Gomas apologized for her behavior, calling it her "very worst moment" and expressing regret for her emotional outburst and use of profanity. She acknowledged that her actions were unacceptable, even if distressing, and specifically apologized to passengers, especially those with children.
She clarified on the Pardon My Take podcast in November 2023 that she did not see anything unusual or non-human. The phrase was an expression of speech used during a heated argument with another passenger, which she described as having "bad energy" after switching seats.
The incident began after she accused a relative of stealing her AirPods, leading to an altercation that escalated. She refused to board the plane, claiming the flight wasn't safe, and was eventually removed by authorities.
Gomas was issued a criminal trespass notice but was not arrested. The flight was delayed by three hours due to deplaning and re-screening.
She has since launched a website, tiffanygomas.com, and expressed support for anti-cyberbullying efforts, though she described the online backlash as "invasive and unkind."
Despite the viral fame and memes, including conspiracy theories about shapeshifters and government threats, Gomas has maintained that her outburst was a personal moment of distress, not a sighting of anything supernatural.
https://files.catbox.moe/egdxp2.png
>>
>>108111748
Does that claim magically improve your predicament?
Imagine being so fragile, you project your frail ego on to others and wonder why you get caught samefagging every day
>>
>>108111741
4B Base or Distilled? Whether it's 9 or 4 the Distills are better for editing, the Bases are raw models with no SFT or RL training for aesthetics or coherency
>>
>>108111764
whatever they're serving here:
https://openrouter.ai/black-forest-labs/flux.2-klein-4b

i don't even see distills on the official bfl api
>>
File: 00006-3807541216.jpg (1.45 MB, 2560x1536)
1.45 MB
1.45 MB JPG
>>
File: 615947.jpg (37 KB, 420x420)
37 KB
37 KB JPG
Dramas and schizos aside, Z Image seems to require a timestep shift value of at least 2, more probably better, to halfway reproduce the likeness for celeb loras. There's that anon that did a lora with some youtube personality/dead wife who said he used stock settings except bumping up an "experimental value" which was probably timestep shift. Works very well with ZIT too.
>>
>>108111782
thanks for sharing your findings anon
>>
>>108111777
It's free?
>>
>>108111741
4b is quite a bit worse than 9b
>>
what caused the tran meltdown this time?
>>
>>108111832
nope.

>>108111845
hmm, for some reason they don't like 9b on openrouter - let me see if i can hit it anyway.
>>
>>108111832
>The first generated megapixel is charged $0.014. Each subsequent megapixel is charged $0.001.
>>
>>108111858
And how do I use it? Is there an API node in Comfy for Openrouter?
>>
>>108111875
i just get gemini cli to write the scripts and run the tests for me
>>
File: FaceSwaps2.jpg (2.52 MB, 3216x1920)
2.52 MB
2.52 MB JPG
>>108111741
Same prompt as before, Klein 9B Distilled, 4 steps. Both inputs are random stock photos from Pexels.
>>
File: 1711876781486377.jpg (186 KB, 1080x811)
186 KB
186 KB JPG
>>108111792
Most info on ZIM lora training I've seen online is complete garbage, only way to get anywhere is to try things yourself. LR I'm using on these experiments is 1e-4 btw, with AdamW. I'm training a bit further to see if things improve further, otherwise I'll try a new run with Timestep Shift value 2.33(I started at 1.0 and worked my way up in steps by 0.33).



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.