[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collage_1759780042_1.png (1.48 MB, 1083x582)
1.48 MB
1.48 MB PNG
Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106807358

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
Ignore off topic time wasting post that are only made to be inflammatory
>>
Blessed thread of frenship
>>
>>106810439
is that lora just for anime edits? I want to do edits on photos of real people
>>
Dunno why it made the multithread collage so tiny. Oh well.
>>
>>106810419
Thank you baker, I'm not cut out for baking
>>
>>106810419
>Ignore off topic time wasting post that are only made to be inflammatory
again? get rid of that shit already!
https://www.youtube.com/watch?v=ThWTOuiMV7M
>>
File: 1740920304657784.png (1.12 MB, 880x1184)
1.12 MB
1.12 MB PNG
>>106810452
no, it works on anything

original photo was just a bike.
>>
>>106810440
skill issue
>>
File: 1732194351678025.png (892 KB, 1360x768)
892 KB
892 KB PNG
the man with white hair on the right in image1 is sitting with the man in image2 at a mahjong table.
>my tsumo is augmented
>>
>haven't genned since SD 1.6 was the new hotness
>try out the new high vram, super slow model (HiDream)
>it doesn't look any better at all
gonna try the latest SD but this isn't very impressive honestly
>>
who the schizo here in /sdg/?
>>
>>106810471
>skill issue
the model has skill issue yes, now what? you'll have to prompt "don't forget that hands have 5 fingers" or "don't put a 3 leg please"? there's some shit that shouldn't be said, of course we want quality, that shouldn't be written on the prompt, quit sucking off Alibaba's cock it's embarassing
>>
Any decent guide on training wan loras? I'm trying to find some but info seems to be scattered as fuck.
I did some initial tests already and the thing seems to OOM very easily on a 4090, anything bigger than 2 seconds at 480p and dim 8 seems to go out of vram. Iteration time also seems shitty compared to training SDXL loras.
>>
>>106810484
cliff notes version
>get wan 2.2 and the lightx2v lora, 4 or 6 steps
>get noobai/illustrious models for anime like base noob, and wai v15 off civitai
>get qwen image and qwen image edit
there, you are now equipped to make images and video.
>>
>>106810493
>mad about based unpozzed chinks
hello kike
>>
/sdg/ is having a melty currently
>>
>>106810419
post second from top left image
>>
File: file.png (18 KB, 693x234)
18 KB
18 KB PNG
lmao
>>
>>106810484
>HiDream
wtf are you doing nigga
>>
>>106810516
it's a big vram model which came out recently
i presumed it would be good
what should i use instead?
>>
>>106810502
>unpozzed
Qwen image is slopped and can't do private parts, is that what you define unpozzed you cocksucking kike?
>>
File: ComfyUI_00017_.png (1.76 MB, 1328x1328)
1.76 MB
1.76 MB PNG
>>106810240
My first actual gen put into the collage. I'm honored.
>>
File: 1751648851507509.jpg (813 KB, 2584x1119)
813 KB
813 KB JPG
Qwen Image Edit sucks
>>
File: 1753113460899822.png (865 KB, 1248x832)
865 KB
865 KB PNG
the man in image2 standing beside the man in image1 and is pointing a black pistol at the head of the man in image1. keep the appearance of the man in image2 the same.
>I know you work for the illuminati, Altman
>>
>>106810511
>subjective bias
nothing subjective about that, he knows a tiny population can run a 80b model, so he's putting pressure on Tencent by forcing them to release normal sized model (or else no implementation), that's based actually
>>
File: 1VmvE6Gsjgk.jpg (77 KB, 608x698)
77 KB
77 KB JPG
>>106810524
lol
>>106810511
Hunyan is simply bloated shit. Good call to drop it. Even with quants it's useless.
>>
>>106810561
>keep the appearance of the man in image2 the same.
who would want that? it's so weird to have a low poly 3d render mixed with a badly photoshoped photo of a human loool
>>
>>106810593
but that was exactly what he ordered? comfy has been pushing for larger generalistic image models for years, gets exactly what he ordered and says he don't like it. fuck comfy
>>
>>106810511
you keep forcing this narrative that Comfy made a bad call here, and everytime everyone answer to your post and call you retarded, when will you give up Ani?
>>
comfy shills forcing python for another 5 years. how about cut it out? fuck comfy
>>
>>106810616
ComfyOS is going to be the BEST anon!
>>106810617
ComfyUI never 'phones home' you can't prove it!
>>
File: 1739375605604164.png (220 KB, 1353x859)
220 KB
220 KB PNG
>>106810616
>but that was exactly what he ordered?
we'll get a smaller model, the multibillion dollar company Tencent bent the knee to Comfy
https://xcancel.com/T8star_Aix/status/1972934185624215789#m
>>
File: 1754139852048736.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>106810551
works on my pc, use the comfy template workflow, use the new 8 step lora

the anime girl Hatsune Miku in image1 is pointing a black pistol at the man in image2. the background is a stage with a "ClosedAI" logo behind them.

and it was just a cropped pic of OG Miku. still works fine.
>>
>>106810593
>he's putting pressure on Tencent by forcing them to release normal sized model (or else no implementation)
>that's based actually
No. He's killing local by making sure no one will release an actually capable local model that can reach the capability of API models (all larger than 80b).
Local will be stuck forever with slop with this mindset. Fortunately there isn't no one like this sabotaging this in LLMs and we can enjoy 600B models locally.
>>
>>106810616
>comfy has been pushing for larger generalistic image models for years,
not at all, he didn't implement StepVideo (30b) because the model is too big
>>
>>106810622
if ani is actually successful and gets ggml into the limelight like llms, I'll jump ship. it's all we have right now so even though I hate it, I have to use it
>>
>>106810639
hunyuan is bloated shit. what a weird hill to die on
>>
>>106810605
i haven't done anything with imggen in 2 years, i just asked a chatbot to search the web and make a table comparing all of them and picked the most recent high vram chinese model (i assumed western ones would be gimped due to censorship)
it's damn slow, gonna try SD 3.5 next.
>>
>>106810639
>API models (all larger than 80b)
let me guess, you saw that in a dream?
>Local will be stuck forever with slop with this mindset.
dude, you think local will be saved if we get a giant model no one can run? how?
>>
>>106810631
>Tencent bent the knee to Comfy
they don't listen to comfy, they just noticed nobody cared. if anyone unironically listened to comfy things would be much worse
>>
File: ComfyUI_temp_magnf_00001_.png (1.65 MB, 1040x1280)
1.65 MB
1.65 MB PNG
>just a gorillion more parameters bro
>>
>>106810656
try qwen image
>>
File: 1741706060838100.png (1.73 MB, 1360x768)
1.73 MB
1.73 MB PNG
>>106810637
>works on my pc
no it doesn't, try to go for 2 humans and make them do something completly different to their original image, and at this moment it won't work anymore
>>
>>106810659
>if anyone unironically listened to comfy things would be much worse
oldfags know, newfags can't understand. a tale as old as time
>>
>>106810657
>you think local will be saved if we get a giant model no one can run?
They said the same when Deepseek models came out. And nowadays many have bought the hardware to run it (and similar size models) locally.
>>
>i'm a vramlet mad about highparams
kek
>>
>>106810659
>they don't listen to comfy, they just noticed nobody cared.
they care even less since they can't run it on ComfyUi, hello?
>>106810671
ah yes, the local model Deepseek, the exact same model that created a split between /lmg/ and a new thread based only on Deepseek because it wasn't deemed local due to its size?
>>
Tencent provides the least cucked datasets, one day they will shit a decent quality model
>>
>>106810676
post your hunyan gens
>>
now that the dust has settled are edit models good enough to enhance data sets for lora training?
>>
>>106810690
nano banana, always
>>
>>106810683
>they care even less since they can't run it on ComfyUi
technically kijai is the authority on getting shit into a wrapper. comfy just steals code nowadays. he's just phoning it in and isn't important anymore
>>
>>106810676
Debo only has 12GB of vram which explains his api trolling
>>
>>106810687
>Tencent provides the least cucked datasets
their dataset is the same as Alibaba's, that's why their drawing style is so similar, at some point I thought I was looking at a Qwen Image drawing
>>
>>106810690
no it still does off style shit, fucked up anatomy and sometimes (rare) has outright refusals. inpainting still required
>>
>>106810669
if you really want an exact face, just use the reactor node to do a 1:1 face swap on that result.

https://codeberg.org/Gourieff/comfyui-reactor-node

there is an open source tool for everything.
>>
Stop summoning nibgo by mentioning him he needs to stay in his containment thread
>>
>>106810676
>>106810713
seriously though, is there someone in there that has 96 gb of vram? you need a rtx 6000 (10000 dollars) to get that
>>
>>106810715
>ask for vagina in tencent model
>get vagina
>ask for vagina in Alibaba model
>pink blob
>>
File: 1731680149545414.png (939 KB, 1024x1024)
939 KB
939 KB PNG
>>
>>106810690
even the SOTA model (nano banana) isn't that impressive desu, that's expected, Image models is a pretty new field, you need time to perfect shit
>>
File: ChromaPainterly_00051_.jpg (1.52 MB, 2312x1616)
1.52 MB
1.52 MB JPG
>>
fizzledorf and debo should be dragged out on the street and shot
>>
>>106810734
we get it
it's shit
>>
>>106810736
>you need time to perfect shit
you mean "make safe" right?
>>
>>106810721
>just use the reactor node to do a 1:1 face swap on that result.
then what's the point of using that edit model? you could use a regular image model + use reactor
>>106810747
kek, this
>>
>>106810723
I've seen at least a dozen have showed photos/task manager/nvidia smi screenshots of rtx 6000 blackwell on /ldg/ and /lmg/ combined.
>>
>>106810747
go back to begging for shekels Sam
>>
>>106810742
nigbo, cumfart, trani and niggerjak should be dragged and shotted
>>
>>106810734
>draws the lower half of his body anime style
looooooool, this is so bad
>>
File: ComfyUI_temp_gonas_00002_.jpg (601 KB, 2703x2703)
601 KB
601 KB JPG
>>106810723
We had an anon running the full version of Hunyan and on it and it still looked like shit.
>>
comfy should get dragged down an alley by a kitsune and blown
>>
>>106810750
real
>>
>>106810769
trannie already did this several times though
>>
>>106810742
I get debo but ani isn't here man. he only comes by when he updated anistudio
>>
you are automatically culling your synthetic dataset by running cv and some multimodal LLM on them for QC, right?
>>
>>106810764
okay, then go pay money for your SAAS shit

enjoy spending $1000 on videos, even gambling has a chance to win money.
>>
>>106810761
>reputation so bad he drags others hoping it will work
You do this all day, don't you have anything else going on in life?
>>
File: ChromaPainterly_00053_.jpg (1.64 MB, 1584x2360)
1.64 MB
1.64 MB JPG
>>
>all these API shills defending cumfart
>>
Debo is only here because he finally turned /sdg/ into his paradise (a dead shithole)
>>
>>106810782
I don't want to shill SAAS, I just don't want to pretend that the local models we have so far are good, is that too hard to accept?
>>
Use case for image generation?
>>
>>106810761
Line 'em up
>>
>>106810804
unchaining yourself from jewish-owned pornsites
>>
>>106810808
that's what yt-dlp and stash are for
>>
>>106810815
I mean total stop to flesh hoes content
>>
>>106810803
stuff like noob/illustrious, wan, and qwen/qwen edit are amazing for being local, you can't expect local to compete directly with $1000000000 in H100s on AI farms.
>>
>>106810824
>you can't expect local to compete
you lack ambition then, at some point we'll improve the architecture + the training method to the point a smaller model will have the same quality of current SaaS, it's just a matter of time, Tencent is giving up by stacking layers, there's so much to improve but that requires to not be a lazy fuck, they could first of all put more effort on their dataset, adding synthetic shit is a poison that makes your model slopped, the chinese fucks don't want to deal with this, it's their loss
>>
>>106810846
still, wan 2.2 is far better than a lot of AI video sites and generate in less time (with lightx2v)

if anything SAAS is stagnating cause of censorship bullshit and "ethical AI" concerns.
>>
>>106810846
>their loss
image gen isn't worth even a fraction of 1% of what llms are worth
>>
>>106810766
what's gonas?
>>
>>106810853
the only thing new in comfy nowadays is saasshit. it's poisoning local too
>>
>>106810856
I don't get Tencent, they don't want to spend money on a more quality dataset, but they spent money on a 80b model, training such a big model is soo expensive
>>
File: ChromaPainterly_00055_.jpg (1.69 MB, 1584x2360)
1.69 MB
1.69 MB JPG
>>
>>106810853
nobody who isn't a poor neet pedophile cares about censorship
>>
>>106810862
they dont really care about image gen, they just want to say they have a model and it has some big number of parameters, the actual outputs from it are basically irrelevant to them
>>
>>106810869
then why did everyone drop sora2 after censorship was implemented? is everyone a poor neet pedo?
>>
>>106810879
no idea what you're talking about, you mean the openai one? everyone still uses their product, especially normies
>>
>>106810869
censorship isn't about tits. the reason you should care is OpenAI should not decide what you can prompt.
>no you can't do that!
imagine PAYING for censorship.
>>
>>106810874
this, it's just benchmaxxing behavior for money. if only investors knew all the benchmarks don't actually translate to use case
>>
>>106810853
>if anything SAAS is stagnating cause of censorship bullshit
our video model is censored too, censorship isn't just "we can't do coom" it also means we can't render celebrities, characters, styles, Sora 2 can do that without any issue (you want a 80's avertisment about ff7 characters mixed with Love Live characters and Will Smith? you got it!), for Wan we are lucky it can render Miku at all, pathetic really
>>
I just want to thank /sdg/ for being completely useless when I was new to local diffusion. All my questions were ignored in favor of some avatar retard spamming their garbage and getting complimented. This angered to the point of doing my own research and now, 6 months later, I can do anything I want.

thank you /sdg/
>but this is /ldg/
yes, but I know those faggots lurk here and I refuse to ever post in that general again.
>>
File: radiance.png (3.09 MB, 848x1488)
3.09 MB
3.09 MB PNG
>>106810862
maybe they want to get/test the technology rather than the actual model
>>
>>106810902
Lora or does it know such recent genshin characters?
>>
>>106810895
:^) proving the sanctions aren't working may be the whole point.

Consider that it might not actually be the advertised size.
>>
>>106810895
Are you the same guy who was disappointed in anime diffustion thread too?
>>
>>106810908
it knows some, not all
>>
File: RUN.jpg (91 KB, 566x806)
91 KB
91 KB JPG
any loras for skinless flesh/flayed skin
>>
>>106810913
You're a very annoying schizo and you should fuck off this board. It's bad enough you post here now but you spill your mental illness in other threads
>>
File: chroma dreams.png (2.29 MB, 1024x1024)
2.29 MB
2.29 MB PNG
>>
>>106810869
lol go try to gen anything on sora
>>
why does qwen insist on making women brown? it's genuinely fighting against making white people wtf is this
>>
File: 1748246620556869.png (330 KB, 3102x1544)
330 KB
330 KB PNG
>>106810892
>if only investors knew all the benchmarks don't actually translate to use case
it'll be hard to break that cycle, that bloated model is currently 1st on the Llarena ranking
>>
>>106810948
>3608
>>
>>106810869
censorship is a slippery slope. eventually you will not be able to generate anything remotely copyrighted. this means forget doing anything involving licensed anime.

people like you are so stupid.
>>
>>106810948
the day people realize these benchmarks are all gamed is the day we can have good models again.
>>
>sd3.5 wants you to dox yourself on HF to download it
lel
>>
>>106810953
>95% cl +-10
>>
>claims it has its knowledge update to september
>doesnt know best girl from last anime season
trash (or at least only partial)
SAD. Also tried some other gens with basier (way older) and got shitty results
>>
>>106810894
with i2v it doesn't matter what the character is, you have a source and it just works

for t2v it matters knowing all the people natively more.
>>
File: 1758525295574911.mp4 (3.88 MB, 800x656)
3.88 MB
3.88 MB MP4
>>106810976
>with i2v it doesn't matter what the character is
it matters even on t2v, if you want to make someone appear on the screen, the problem is the same, you only have Miku as a choice, it's not fun at all
>>
>>106810943
the model is thirsty for BWC
>>
China NEEDS to make Wan 2.5 available locally to undercut Sora 2.
>>
>>106810993
I think the biggest problem is the lack of IP characters, when OpenAI decided to remove this, people said that this model was dead and hype died off
>>
>why is a company currently being sued for IP infraction and under court order to preserve all user interactions for discovery being cagey about IP
gee i wonder
>>
>>106811000
yeah, that's all what people were doing with sora 2, crossover of characters and shit
>>
File: IMG_1642.gif (19 KB, 220x220)
19 KB
19 KB GIF
>anal sex image
>with lightx2
>very basic linear motion. she barely reacts. the guy barely reacts. both kinda just bored

>without lightx2
>the girl is fucking bouncing up and down his dick
>she's looking back at the guy and twirling her hair while smiling
>the guy is actually thrusting and getting up in there
>the whole fucking couch is moving with their rough sex
>>
File: AnimateDiff_00376_1.mp4 (3.75 MB, 720x720)
3.75 MB
3.75 MB MP4
trying 20 seconds with the 'vantage' mod for wan
>>
>>106811004
>I'll just pretend that they haven't used IP as an effective bait and then switched once the hype was at its max, and I'll just pretend they have done this before with 4o generation as well
oh hi Sam
>>
File: 1750339041684681.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
another neat thing of the edit model is you can copy a typeface without knowing the font. what do you do if you dont have the .ttf? you aren't going to try and make the letters, so AI can do it.
>>
File: raadiance.png (3.13 MB, 848x1488)
3.13 MB
3.13 MB PNG
>>106810908
just a current chroma radiance snapshot, no lora

but don't expect characters to be as reliable as on noob/illustrious so far
>>
>>106811024
>manlet Miku
:(
>>
>>106811025
horrid quality
if i wanted lines like that i'd ask michael j fox to draw it
>>
>>106811025
I thought the dataset is way older. I can do some degen shit with Furina now.
>>
File: 1750306230104214.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>106811043
well the source image was low quality, would be better with a larger image.
>>
>>106811064
it just looks like she has been copy pasted on the image without any care on making her mix with the original background, it's not good, maybe you're having fun with that model (and good for you, that's all that matters), but I find the work really amateur
>>
how come they are allowed to infringe on all the IPs all over again when they release another model? I don't get it
>>
File: radiance.png (3.32 MB, 848x1488)
3.32 MB
3.32 MB PNG
>>106811057
i *did* say and want to show that it's not as reliable/equally good as noob/illustrious, yes

>>106811058
haven't tried her. maybe it'll be possible, I wouldn't fully count on it yet tho
>>
>>106811080
the better question is: "how the fuck is Midjourney still surviving after letting people render copyrighted IP for 3 years straight at this point"
>>
>>106811080
>conveniently forget
>everyone gets hooked
>rugpull
it's the only way to get paid users, it's by tricking them
>>
File: 1733252401139087.png (1.35 MB, 1360x768)
1.35 MB
1.35 MB PNG
replace the man in the image with Hatsune Miku. The bridge is made out of sand. Change the color of the red dragon in the background to green.

neat
>>106811074
there are better examples desu
>>
>>106811090
>Midjourney still surviving
it's still alive? I don't know anyone that uses it anymore
>>
File: 1747896007780304.png (359 KB, 1886x1355)
359 KB
359 KB PNG
>>106811095
it's making more money than ever actually
https://electroiq.com/stats/midjourney-statistics/
>>
File: 1759064656779805.png (3.96 MB, 2555x720)
3.96 MB
3.96 MB PNG
>>106811093
the red dragon in the distance is perched on the side of the bridge near the camera.
>>
>>106811118
>the dragon had that subtle desaturated red
>now it's just dark red
slop
>>
File: My PC.png (47 KB, 1301x259)
47 KB
47 KB PNG
>>106808737
If you want actual storage space (and are someone who can't afford 30TB SSDs), mechanical is the way to go.
>>
>>106811131
the source image is potato quality and has almost zero details of the dragon, so how is it supposed to know specifics
>>
>>106811136
>67tb of space but poorfag tier 250$ ram
lmao
>>
File: 1748099602911699.png (2.14 MB, 1344x768)
2.14 MB
2.14 MB PNG
>>106811138
>iz impossibruu
stop saying that, nano banana had no issue with your image
>>
File: 1755901169670250.png (1.6 MB, 1360x768)
1.6 MB
1.6 MB PNG
>>106811118
change the location to a festival with balloons, and kiosks with food. tables of food are placed on the bridge. remove the red dragon. keep the user interface the same.

a silly example but still works.
>>
>>106811136
Yeah but plates are too slow for loading gigarillion GB models.
>>
File: 1742786985715037.png (823 KB, 1248x832)
823 KB
823 KB PNG
SAASfags will never learn
>>
>>106811166
Yeah, now... but it was built 5 years ago!

>>106811185
Just don't use them for work. Use them for storage.
>>
File: 1743004747703933.png (375 KB, 1341x1626)
375 KB
375 KB PNG
>>106811194
ahahah imagine being an APIkek, they really act battered wives desu
https://xcancel.com/AndrewCurran_/status/1974468940517753336#m
>>
>>106811170
this just demoralized me, local will never catch up...
>>
How many years after ASI will we need before every big workflow won't need Unload All Models from RAM node before the end? Is it even physically possible to achieve this level of technological advancement outside of emulating it within the Matrix we will one day create?
>>
File: 1757478375259991.png (943 KB, 1248x832)
943 KB
943 KB PNG
>we get them to pay for our "non profit" then rug pull them
>>
>>106811225
He's going to have another meltdown. The truth is the api schizo is just mad that he can't afford a new card even with his disability check.
>>
>>106811246
imagine paying a 10000 dollars card to render this slop lol
https://www.reddit.com/r/StableDiffusion/comments/1nt22sm/hunyuanimage_30_t2i_examples/
>>
what is a decent video upscaler (not frame interpolation, kill yourself) that isn't seedvr?
>>
>>106811006
>no links
>>
>>106811263
they are all a meme, get cracked topaz and play with the different models
>>
>>106811259
I wouldn't pay for this in any way shape or form. Looks like sloped trash.
>>
https://files.catbox.moe/z2z7ue.mp4
>>
>>106811259
These niggas keep hyperfixating on the text placement and ignore that the image overall looks like ass. Qwen must've mindbroken them.
>>
>>106811285
post this here anon >>>/wsg/5989007
>>
>>106810955
>anything involving licensed anime.
and nothing of value was lost
>>
File: ChromaPainterly_00065_.jpg (1.17 MB, 1608x2216)
1.17 MB
1.17 MB JPG
>>
File: 1739221185868457.png (803 KB, 1248x832)
803 KB
803 KB PNG
openAI board meeting:
>>
>civitai jannies wont review my posts
how the fuck am i suppose to grow and leave the 9 to 5 rat race?
>>
>>106811308
posts have to be reviewed? wtf
>>
>>
> funny how the haters always talk about me
>>
File: 1730238855203914.png (816 KB, 1248x832)
816 KB
816 KB PNG
>>106811300
the man is in front of a TV monitor behind him saying "non-profit = charge money to generate cats", with the image of a cartoon cat beside the text. He is pointing up at the monitor. keep his expression the same.
>>
File: 987032184001034.mp4 (871 KB, 264x480)
871 KB
871 KB MP4
>>106811000
I still use it. It's still the best creative tool in existence right now. I can prompt something like:
>"This character does the same actions from xyz movie, and the humor and pacing is similar to xyz TV show from the 90s. The character fights similar to blah blah..."
There's no local or other SaaS model that can do this, it's the only one. OpenAI probably spends a ton of R&D on creative output rather than benchmaxxing and slop.

What makes it even better is that if I want to fix things up or change things around, I can use VACE or Wan i2v locally. It's a win win for me, but man I still wish we had a local equivalent.
>>
>>106811316
are you trying to be the new debo?
>>
>>106811318
>xyz movie
>xyz TV show
those are also copyrighted IP stuff, if OpenAI removes them it's over
>>
File: 1754185309591777.png (980 KB, 1248x832)
980 KB
980 KB PNG
>>106811316
>>106811321
no
>>
File: ComfyUI_I2V_00257.mp4 (1.71 MB, 1176x784)
1.71 MB
1.71 MB MP4
>"Sir, this is our latest chip. It featu—"
>"Let's have a taste!"
>>
>>106811327
>he will post 150 slight variations of the same prompt
/sdg/ is here for that, go spam your shit here
>>
>>106811336
relax sam, some people will still pay
>>
>>106811330
top kek, that slow motion is so unfortunate though, waiting for the lightvx fags to finally make a functioning I2V lora (like they did with T2V)
>>
File: 1733460557336630.png (997 KB, 1041x600)
997 KB
997 KB PNG
keep the box of text on the left. replace the girl with a manga style Hatsune Miku who looks disgusted. the image is in a black and white manga style with halftone shading.

this one's pretty cool.
>>
File: escape.jpg (367 KB, 1733x1318)
367 KB
367 KB JPG
>>
File: 1729781264898490.png (1.46 MB, 952x1096)
1.46 MB
1.46 MB PNG
>>106811377
by itself:
>>
>>106811377
it did its own thing though, the pose isn't the same, the drawing style isn't the same (look at the eyes and nose)
>>
>>106811393
well, it understands controlnet data, just put in a canny/depth or openpose of the original image as image2 source, that works too.
>>
>>106811324
Lemme explain. If I were to use an analogy, it's like a style LoRA but for motion and storytelling. You can use stuff like "inspired by... dances similar to..." and it will do it. If the genned video contains the IP or copyright like the character or likeness, it won't make it past the filter.

If "rightsholders" were to copyright storytelling, motion, and pacing, we have a bigger issue that would essentially screw over everything outside of image/video gen.
>>
File: wan22___0153.png (1.54 MB, 832x1216)
1.54 MB
1.54 MB PNG
>>106811393
are you retarded? you read that prompt and expected what?
>>
>>106811409
>You can use stuff like "inspired by... dances similar to..." and it will do it.
no it won't, if you say "similar to studio ghibli" you used studio ghibli's name, and that's copyright infrigment
>>
>>106811418
to keep the pose and artist drawing style you fucking retarded NIGGER, you are a low IQ nigger, you have to understand that
>>
>>
>>106811418
not gonna lie, footfag chroma anon is probably the most annoying guy on /ldg/, everytime he opens his mouth, he says retarded shit
>>
>fascinating how someones have so much free time to document the posting habits of others
>>
File: wan22___0195.png (1.64 MB, 832x1216)
1.64 MB
1.64 MB PNG
>>106811430
imagine the model just made up what it thought you meant when you didn't prompt something. use your last two brain cells and attempt to consider why the model doesn't assume things that are not in the prompt.
>>
>>106811508
>oh please dear model, the character must keep her 2 arms, oh yeah she has 5 fingers, you must keep her 5 fingers, and she has a head, you should keep her head, and she has hair, you should keep her hair
if you want this, you can take a rope if you want, call me crazy but the model should know implicitly that it should keep some shit, retarded moron
>>
File: ChromaPainterly_00071_.jpg (1005 KB, 1608x2216)
1005 KB
1005 KB JPG
>>
>>106811448
hes one of the more based posters as he always posts his prompts when asked, posts comparisons, and discussed the actual tech instead of whining like the rest of you worthless niggers every fucking minute of the day
>>
File: 1736798404350189.png (745 KB, 1015x627)
745 KB
745 KB PNG
>>106811528
>he's defending himself
>>
>>106811508
>imagine the model just made up what it thought you meant when you didn't prompt something.
The anon you're replying to is probably used to shitmixes and can't into base model prompting, don't mind him.
>>
>>106811506
>I didn't get the pattern of the anon spamming female asians
congrats anon, it means your IQ is on the 2 digits scale
>>
File: 1750552235905374.png (39 KB, 1066x259)
39 KB
39 KB PNG
>>106811536
and the retard classic: everyone that disagrees with me is a single person, definitely the kind of behaviour of cognitive disonnance seethe that contributes greatly to the thread's quality and isnt what made this general into this shitshow that it is, low iq zoomer retard
>>
>>106811550
you're still defending yourself though
>>
>>106811552
>nuh huh
thanks for proving me right again with that high iq argument, now continue seething in the thread for 3 more years
>>
the funny thing about samefag accusations is one doesnt really feel the need to defend against them unless they are in fact samefagging desu
>>
>>106811556
>for 3 more years
there you go, you're assuming I'm poopdickschizo, you should read the image you sent 1 post ago and see the irony of your sentence >>106811550
>>
File: wan22___0180.png (1.23 MB, 832x1216)
1.23 MB
1.23 MB PNG
>>106811542
>i will pray for his chromosome
>>
File: 1759732560289494.png (282 KB, 535x402)
282 KB
282 KB PNG
>>106811572
and I will pray for better hands
>>
>>106811566
this
>>
I was using kling and other AI sites for a long time, but finally installed wan 2.2 locally. And I have to say I'm impressed. It's image to video is as good as any of those online generators. Plus, totally uncensored
>>
>>106811570
of course, you're definitely just a regular newfag that just joined the thread instead of one of the low iq browns shitting it up every day despite speaking out of your ass about the posters in the thread while shitting it up over and over in each reply

sorry you got exposed as having no arguments and needing to resort to accuse everyone disagreeing with you as being one person, better luck next thread brown
>>
holy meltie
>>
>>106811570
>>106811566
>>106811595
samefag
>>
>>106811556
>for 3 more years
Meaning: He thinks I'm a regular schizo seething in sdg/ldg for 3 years at this point
>>106811610
>you're definitely just a regular newfag
So... which one is it retard? Am I an old fag or a newfag? Am I a shrodinger-fag?
>>
>disabo continues his seething despite 80% of the thread being fully aware of his antics
>anons now make fun of his ritual post
What's next ffaze going to pretend to be a newfag to ask for a recap for the 1000th time?
Going to randomly seethe about ran like anyone cares?
Going to go on your phone go agree with yourself?
Going to deflect and try to push heat on ani?
Or are you going to call everyone making fun of you schizo?
Oh! I know you're going to spam reports again only for the mods to ignore you because they are tired of you doing that whenever you feel "unsafe"
You need new material... better yet you need a new hobby because you're not improving and you don't have the intelligence to hang with anyone in this thread.
The avenge anon myself included has put in a fraction of the time you put into this hobby but you still can't make anything good.
>>
I know it's Monday and this is the Employed Thread but fellas let's settle down
>>
>>106811625
>so low iq he cant even read
kek, cant make it up
now reply to this again confirming for the third time you are low iq
>>
>>106811641
Concession Rejected.
>>
>>106811644
there we go, thanks for dancing for me monkey, now do it again
>>
>>106811629
>Going to randomly seethe about ran like anyone cares?
You are literally randomly seething about debo like anyone cares?
>>
>>106811639
>the Employed Thread
could have fooled me. too many neets are still around
>>
>>106811629
lmao this is the best lolcow post yet
>>
>>106811656
>>106811629
>>106811625
>t. samefagging chud
>>
>>106810451
this
>>
File: 00291-4267933873.jpg (1.1 MB, 2480x2480)
1.1 MB
1.1 MB JPG
Endless cycle
>>
>>106811651
>he's still replying
nice
>>
>>106811679
>no u
weak, try again monkey
>>
>>106811688
>he keeps replying
noice
>>
>>106811698
>repeats the line
kek, npcs just can't help themselves, dance more monkey
>>
When will we reach this level of sovl? >>>/wsg/5989414
>>
File: file.png (2.88 MB, 848x1488)
2.88 MB
2.88 MB PNG
>>106811672
endless cycle of what?
>>
>>106811718
Endless cycle of garish semi-furry gens I suppose.
>>
>>106811711
>>repeats the line
>kek, npcs just can't help themselves

>>106811651
>thanks for dancing for me monkey
>>106811711
>dance more monkey
ahah I love irony
>>
>>106811720
>no u
you already failed with that one, i said try again monkey
>>
>>106811688
>try again monkey
>>106811723
>try again monkey

>>106811711
>>repeats the line
>kek, npcs just can't help themselves
>>
gay lovers quarrel on /g/
>>
Jannies. Clean it up.
>>
isn't china tired of being humiliated by the us? they betrayed us for nothing. and now, they have to defeat two kings
>>
is it necessary to do FETCH ComfyRegistry Data every time you start, I know how to check for new nodes/updates.
>>
>>106811724
did i finally buck break your npc mind?
dont give up dancing yet monkey, again
>>
>>106811728
>they have to defeat two kings
I thought Alibaba had a chance against veo 3, but against sora 2? 0% chance
>>
>>106811718
Anons taking the bait and still feeding.
>>
>>106811729
yes. how else does comfyorg sell your data other than the login?
>>
File: file.png (2.75 MB, 848x1488)
2.75 MB
2.75 MB PNG
>>106811712
that is pretty good. dunno, 0.3-3 years? it probably depends on both model and hardware releases

or we're on parity within 0.0something of a year by doing nothing as the SaaS censors the sovl, lewd and politics out of the model
>>
/adt/
4 hours 30 minutes
26 / 11 / 4
>>
File: radiance.png (3.23 MB, 848x1488)
3.23 MB
3.23 MB PNG
>>106811719
it will be essentially endless, yes

>>106811753
i guess we'll consider the ai that talks to the ai some anon too over here
>>
>>106811724
kek
>>
File: 1740043493347549.png (2.68 MB, 2352x888)
2.68 MB
2.68 MB PNG
keep the white textbox on the left and leave the text in the textbox unchanged. the anime girl in a school uniform is pointing to the right with one hand, and has the same expression. the image is in a black and white manga style.

from an osaka image.
>>
>>106811802
why is it zooming out though? you never asked the model to do that
>>
>>106811770
Same thing sdg or ldg would have if you took away the shilling, schizo wars and slop comparisons.
>>
>>106811809
>is pointing to the right with one hand
>>
File: 1752312189393619.png (608 KB, 1176x888)
608 KB
608 KB PNG
>>106811809
if you really want to you could prompt "keep the pose the same" or use an openpose img or canny/depth to keep it exactly the same

anyways, what did Osaka mean by this?
>>
>>106811817
>slop comparisons
those are important though
>>
File: 1743555085122838.png (820 KB, 1176x888)
820 KB
820 KB PNG
>>106811821
im not going to finish it but you get the idea.
>>
>>106811724
>monkey conceeded
there we go, thanks for the dance bro
>>
>>106811819
>your arm can only be extended, there's no way to flex it so that you can only see her hand pointing out
(You)
>>
>>106811839
Where the fuck would you put a pointing arm in the original pic?
>>
File: 1753927120963367.png (781 KB, 1176x888)
781 KB
781 KB PNG
you anons should relax. make a 1girl.
>>
>>106811845
she said she is pointing to the right with her hand, not that she's poiting her arm, do you know how to read or something?
>>
what's my best bet for a video model to put john cleese's face on sexy body and lipsync "and now for something completely different"
>>
>>106811775
>>106811768
Chroma? Could you please share to me your workflows I want to do experiments
>>
>>106811833
>im not going to finish it
tch
>>
>>106811821
>>106811833
>N I G ht
>>
>>106810824
doubt qwen is that far off in terms of params, the dataset is just total synthetic trash
>>
>>106811900
yeah, I think 20b is a good spot to get serious quality, the only thing lacking is a great dataset
>>
>>106811953
I'd say less than that. around 12B with an unslopped dataset
>>
File: file.png (4 KB, 378x96)
4 KB
4 KB PNG
yo i installed anistudio and started it but now my computer is glitching what is going on man? fuck this
>>
SD3.5 sucks absolute ass at making flags
>>
>>106811718
moar
>>
how censored is qwen-edit? Not for sexual stuff, I want it to simulate people being gored and things like that for shitposting
>>
File: file.png (11 KB, 194x247)
11 KB
11 KB PNG
>>106812089
im done with this piece of shit fuck you for recommending it
>>
File: ComfyUI_0034.png (2.82 MB, 1248x1872)
2.82 MB
2.82 MB PNG
>create elaborate spaghetti maze with multiple custom subgraphs
>use it to gen generic 1girl slop
Genuinely wouldn't have it any other way. The programming lego itself is fun.
>>
File: 00007-204781246.png (2.26 MB, 1248x1824)
2.26 MB
2.26 MB PNG
>>
File: 1731904610487977.png (1.26 MB, 1552x672)
1.26 MB
1.26 MB PNG
I will now buy your game.

replace the clothes of the girl in the blue and white dress with a black bra and panties. she has large breasts.
>>
File: 1748141016923583.png (109 KB, 258x544)
109 KB
109 KB PNG
>>106812089
>>106812109
>>
File: 00332-3767886975.jpg (1.04 MB, 2480x2480)
1.04 MB
1.04 MB JPG
I don't see anything furry about this but this guy is far gone anyways
>>
>>106812136
>not adding "and wide hips"
FAG
>>
>>106812135
she was always the worst of the warden trio
>>
File: 1728465450729756.png (1.28 MB, 1552x672)
1.28 MB
1.28 MB PNG
>>106812136
replace the clothes of the girl in the blue and white dress with a black bra and panties. she has light skin, and large breasts.

there, forgot to mention skin tone. now it matches the arms.
>>
>>106812136
>>106812156
What was the original?
>>
File: 1731969992717320.png (1.25 MB, 1552x672)
1.25 MB
1.25 MB PNG
>>106812143
there you go.
>>
File: brokeKEK.mp4 (511 KB, 608x400)
511 KB
511 KB MP4
>>>106807115
;3
>>106810451
blessed flanzone for FRENS!
>>
>>106812136
>>106812156
the skin tone didnt change...

>>106812165
I guess it's harder for it to do big hips instead of big tits.
thanks tho anon
>>
File: 1759433734221078.png (1.21 MB, 1393x606)
1.21 MB
1.21 MB PNG
>>106812163
original (which is good, pokemon take notes):
>>
>>106812156
>>106812165
>>106812172
just stitch the images together bro, you don't have to make 3 posts for something that could be summed in one
>>
File: 1751023136412078.png (960 KB, 816x1280)
960 KB
960 KB PNG
the anime girl is holding a sign saying "_ IGGERS". At the bottom of the image is white subtitle text with black outline saying "what things do you dislike, Pippa?"
>>
File: radiance.png (1.49 MB, 848x1488)
1.49 MB
1.49 MB PNG
>>106811873
that's the chroma radiance snapshot, i update my copy every few days so just pick the latest

> Could you please share to me your workflow
sure, here
https://litter.catbox.moe/swkx2wil9t8g71u6.json

arguably you might as well use the standard workflow with 30-60 steps euler normal/beta or such
>>
File: 1757694453355292.png (994 KB, 816x1280)
994 KB
994 KB PNG
>>106812201
better arms:

this model is pure gold.
>>
File: 1756410343146901.png (1.15 MB, 816x1280)
1.15 MB
1.15 MB PNG
>>106812219
one more pippa.

the anime girl is smoking Marlboro cigarettes and is holding a bottle of jack daniels. She looks tired.
>>
>>106812201
if you wrote "NIGGERS" would it not work?
>>
>>106812172
>>106812165
she already has big breasts in the original, did you need to prompt for big tits to get it to make them big in the edit, or was it already aware of her chest size?
>>
File: 1745369209095634.png (1.03 MB, 816x1280)
1.03 MB
1.03 MB PNG
>>106812236
kek if you type "turn this character into a nigger" this is what happens:

china doesn't care about political correctness.
>>
>>106812172
>no navel outline
Sad!
>>
>>106812249
LMAO
i gotta clone this repo asap
>>
>>106812249
Nigkin Nigga
>>
>>106812247
usually it will just use the proportions of the character. like if you use a japanese gravure model, it will keep their proportions, unless you specify otherwise.
>>
>>106812249
that's pipkin pipper with a hard R
>>
>>106812254
make sure it's the 2509 model (v2), and use the new lora:

https://huggingface.co/lightx2v/Qwen-Image-Lightning/tree/main/Qwen-Image-Edit-2509

comfy template for qwen edit is updated and works, just connect the lora and 8 steps or 4 is all you need (8 is fast so I use 8)
>>
>>106812249
>has "nigger" in the dataset
>but not "teto"
FUCKING CHINKS
>>
>>106812138
You are absolutely right: this image is 100% gay.
>>
>>106812089
>>106812109
the next level of compute requires sacrifice
>>
File: 1735413361862104.png (1011 KB, 816x1280)
1011 KB
1011 KB PNG
sheeeeeeeeeeit

actually hilarious it has the tag trained and associated with images.
>>
File: radiance.png (2.84 MB, 848x1488)
2.84 MB
2.84 MB PNG
>>106812096
>>
Been messing with Neta Lumina and it's really impressive. I have no clue why the Chroma dude didn't use Lumina as a base. The Schnell rankensteining was a bad move IMO.
>>
>>106812273
based chinks as usual
>>
File: 1750652513397650.png (719 KB, 640x1632)
719 KB
719 KB PNG
>>106812273
here anon

the anime girl is holding a sign saying "fucking chinks!" with one hand, and is pointing to the sign with the other hand.

no teto in dataset doesn't mean teto can't do stuff from an image.
>>
>>106812273
whats teto
>>
imagine if they censored things in chinese but not english lel
>>
>>106812310
the least generic and least random tranimegirl the most mentally sane weeb has an obsession with
>>
File: 1754080943903397.png (618 KB, 640x1632)
618 KB
618 KB PNG
>>106812309
>>
>>
>>106812317
oh so it's not "censored" it's just "not in the training data because nobody cares" kek
>>
>>106812292
>Been messing with Neta Lumina and it's really impressive.
can you show some examples?
>>
File: 1744237885456236.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
for science

blackune blacku

same prompt as >>106812249
>>
Hey, everyone. If I use the inpaint function, can I kind of force some things in video generation? For example, if I generate a paper airplane in a guy's hand with inpaint, is it easier to then make a video of that same guy throwing the paper airplane into the air?
Thanks.
>>
>>106812344
yeah, it'd be easier with wan i2v to have them throw an airplane if you had the starting image with a paper airplane in their hand.
>>
>>106812340
xhe... xhe's just like me... i... thank you.
>>
>>106812167
kekked
>>
>>106812340
OMG IT NEGRU
>>
>>106812216
Thanks!
>>
File: radiance.png (3 MB, 848x1488)
3 MB
3 MB PNG
>>106812292
why didn't the neta lumina people train flux schnell, qwen, hunyuanimage, hidream, cosmos or something else?

probably mostly because you can't know how well you'll comparatively succeed with a larger training
>>
File: 1735064391276724.png (1.04 MB, 824x1264)
1.04 MB
1.04 MB PNG
replace the man in blue with Hatsune Miku in the same pose.

neat
>>
File: radiance.png (3.23 MB, 848x1488)
3.23 MB
3.23 MB PNG
>>106812450
i even think i saw miku doing this before
>>
File: 1729264457513242.png (1.19 MB, 1280x816)
1.19 MB
1.19 MB PNG
replace the man in image1 with the man in image2. keep the lion on the right the same.

not quite but still neat
>>
File: 1351329179938.png (37 KB, 331x224)
37 KB
37 KB PNG
Do the image captions for lora training need to include the depth of field/rule of thirds/symmetry or can I cut it from the prompt? Seems like a waste of tokens.
>>
>non-locals need 51-50
>non-locals need checking up on this week
>check up on all your non-local frens ;3
"Content Restricted!
A resource you selected does not allow the generation of content rated above PG level. If you attempt to generate sexualized content with this resource the image will not be returned, but you will be charged."
>>
>>106812544
The image caption needs to be tailored to how you expect the end user to want to manipulate the lora. It depends on what the lora does, how you expect people to prompt for it, what model you're using, etc. There is no one answer, only what levers you think the audience should have access to and which they shouldn't.
>>
baking
>>
>>106812600
>>106812600
>>106812600
>>106812600
>>
>>106810930
You have nothing to offer me, for I have a lover and she is Truth.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.