[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor applications are now being accepted. Click here to apply.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106542135

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://github.com/Wan-Video
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>>106545184
How is someone who commissions porn off of random artists considered normal? That is highly abnormal behavior.
>>
sex with jenny
>>
>>106545210
this. normalfags dig for shit that already exists
>>
File: NeoForge.png (3 MB, 1328x1535)
3 MB
3 MB PNG
>>106545199
THIS IS NOT A DRILL!!
NeoForge SUPPORTS QWEN!
https://github.com/Haoming02/sd-webui-forge-classic/tree/qwen

>NeoForge compatible models:
SDXL
Flux
Chroma
Qwen
WAN

>But, Neo Forge stil doesn't support:
API services
Telemetry
Nvidia sponsorship
24/7 Chinese support
>>
>>106545210
most of the retards here are so completely disillusioned from reality it's absurd
>>
File: 1729408844510706.png (324 KB, 638x747)
324 KB
324 KB PNG
>>106545242
>Telemetry
>>
>>106545242
WHAT!? Are you telling me that a 100% local UI, not sponsored by Nvidia or any Chinese company, but solely by an anime fan, is creating a UI compatible with all SOTA models?
>>
>>106545094
Forge is so convenient I am baffled as to why it isn't maintained. So many QoLs by using a GUI.
Neoforge seems to be working at least. How is the extension capabilities, compatible with normal forge extensions?
>>
File: Jenny Seat Expansion.webm (3.92 MB, 1280x720)
3.92 MB
3.92 MB WEBM
I like how creative video generation can be sometimes. The strange reactions always make giggle.

>>106545229
Wouldn't that be great?
>>
30% of men each generation don't reproduce. That's normal. Or is it? 25% of Gen Z are virgins. That's normal. Or is it? Maybe it's normal just for Gen Z

The problem with normal vs abnormal is the same problem as the Ship of Theseus, just like how the argument hinges on the definition on what a "new ship" is and once you define it there's nothing to discuss, if we define what "normal" is then there's nothing to discuss


The actual question is whether paying for porn/sexual gratification or not is "normal", which can be argued endlessly. Normal people go on dates and pay for women's dinners. Normal people pay for sex. Normal people spend money to increase the power of their orgasms. Or do they?
>>
File: NeoForge2.png (1.68 MB, 1328x1328)
1.68 MB
1.68 MB PNG
>>106545254
>>106545242
>>106545269

Exactly, NeoForge can't support API services and big company sponsors.

NeoForge is a local UI supported by a single dev who lives off other work and doesn't make a living from his UI.

For now NeoForge only supports local models, unfortunately NeoForge can't connect to the internet from its UI nor attach email accounts or credit cards, please excuse this.

But for now we are supporting all %99 local models!
>>
>>106545298
buy an ad
>>
AUGH
>>
switched to resmultistep with wan on a whim and photoreal gens look so much better. cant believe the noise was just euler
>>
>most of the retards here are so completely disillusioned from reality it's absurd
I think another aspect of it is the fact that most of "reality" is not considered during discussions because the people discussing the "reality" are not interested in engaging with it

You see this all the time in stuff like incelism where they say "all women have too high standards" and they ignore all the women that are in their league because the incel doesn't even consider a woman below 5/10 a woman at all and therefore is not part of the "all women" he's talking about
>>
is qwen even good for anything?
>>
File: 1746285669109365.jpg (967 KB, 2839x1333)
967 KB
967 KB JPG
Babe wake up, Tencent has finetuned Flux dev and made it more realistic
https://xcancel.com/bdsqlsz/status/1965695387676946720#m
https://huggingface.co/tencent/SRPO
>>
>>106545303
Why don't you tell that to your Node Overlord? Dear Chinese employee #654253
>>
>>106545321
did... chroma just get BTFO?
>>
>>106545321
Is it nsfw?
>>
>>106545321
Too many models lately
Something is wrong
>>
File: Jenny Bath Time.webm (3.92 MB, 1280x720)
3.92 MB
3.92 MB WEBM
>>106545321
>47.6GB
*cries in 4090*
>>
>>106545321
Yeah! More realism baby!!!! I want to feel that im paying taxes!!! Tired of fantasy cope, give me that soul crushing bureaucracy immersion.

Feminist woman Lora when?
>>
>>106545351
it's just a finetune of flux dev anon
>>
File: file.png (140 KB, 1414x469)
140 KB
140 KB PNG
>>106545321
>realsitc
>>
>>106545351
its just flux dev, wait for a quant
>>
>>106545351
Will be perfectly usable with Q8
>>
File: 1750219490259135.png (1.52 MB, 1508x734)
1.52 MB
1.52 MB PNG
>>106545352
>Yeah! More realism baby!!!!
>Tired of fantasy cope
nuh uh
>>
>>106545355
>flux dev
Is that the one that can't draw women? Was the point of this tune to unfuck it?
>>
>>106545321
looking at the examples, i think this is how they got SeeDream 4.0 to not be slopped. very cool
>>
>>106545371
>i think this is how they got SeeDream 4.0 to not be slopped
can you elaborate on that? looks interesting
>>
>>106545365
>not anime
based
>>
>>106545375
im just skimming the paper currently, but you can see the progression as they do direct comparisons to flux and different reward methods
>>
>>106545344
The guys at the top must be baking some crazy shit that even the trash we get is at this level. We may be close to some major milestone
>>
File: Screenshot_216.png (3.46 MB, 2131x844)
3.46 MB
3.46 MB PNG
>>106545375
>>106545380
example
>>
File: 1727623905745384.png (486 KB, 1722x1340)
486 KB
486 KB PNG
>>106545321
please god, tell me tencent has also trained on boobs and vagene on that one
>>
>>106545385
oof, DanceGRPO seems like to be the best method to sloppify a model lol
>>
>>106545321
so is it undistilled now?
>>
File: 1741183002745550.png (170 KB, 1718x1307)
170 KB
170 KB PNG
>>106545406
>so is it undistilled now?
>guidance scale 3.5
maybe?
>>
>>106545417
>512
Are those max tokens? No way it's that little.
>>
File: 1735260536875377.png (15 KB, 651x112)
15 KB
15 KB PNG
>>106545321
https://huggingface.co/tencent/SRPO
what the fuck, can they really stick their own licence on top of the flux's licence?
>>
>>106545321
ok yeah judging by the examples this is pretty incredible. i need quants for this asap
>>
>>106545428
t5xxl was always 512 tokens max, desu that's plenty of words, I never reach that threshold
>>
>>106545439
Ok, but if you are already doing surgery on a model why not slap a more modern TE on it?
>>
>>106545321
why would tencent release something like that? it's way better than their HunyuanImage slop lmao
>>
>>106545321
im a newfagretardmoron, do i just drag this to the flux folder and use it like other flux checkpoints?
>>
>>106545451
I think so, it's just a finetune
>>
>>106545445
I guess they just wanted to show their new SPRO method, if they added more shit into it we wouldn't be sure that the improvement would be from SPRO or the new text encoder
>>
>>106545321
https://tencent.github.io/srpo-project-page/
>This approach enables online adjustment of rewards in response to positive and negative prompt augmentation
those mf really undistilled Flux dev, let's fucking gooooooo
>>
>>106545321
So this is gonna work with all existing flux loras day 1?
>>
File: file.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
cat
>>
>>106545321
>>106545390
KREA BTFO
>>
>>106545482
@furk
Can we get a test on this?
>>
Wow uhh ... so about that chroma base model ...
>>
>>106545495
Still better for porn unless the chinks uncensored it completely.
>>
>>106545473
>Faster Training. By rolling out only a single image and optimizing directly with analytical gradients—a key distinction from GRPO—our method achieves significant performance improvements for FLUX.1.dev in under 10 minutes of training. To further accelerate the process, our method supports replacing online rollouts entirely with a small dataset of real images; we find that fewer than 1500 images are sufficient to effectively train FLUX.1.dev.
imagine they managed to beat chroma with just 1500 images, if I were on lodestone's shoes I would kms lol
>>
>>106545504
I wonder if you could unfuck chroma with this method.
>>
>>106545504
isn't this big for finetuners? could it be that tencent saved local?
>>
>>106545510
Can we just... let chroma go already?
>>
>>106545510
probably, chroma had absolutely zero reward alignment
>>
anybody tried it yet?
>>
File: file.jpg (667 KB, 3180x1704)
667 KB
667 KB JPG
>>106545321
that looks great desu
>>
That's what they get for not including anime
>>
>>106545523
waiting on a gguf
>>
File: Tencent vs Lodestone.png (184 KB, 621x351)
184 KB
184 KB PNG
>>106545504
>train smarter not harder
thanks for reminding us this Tencent Sama!
>>
>>106545527
they just wanted to humiliate the silly efforts of the local community after we made fun of hunyuan
>>
File: 1730480356315799.png (1.73 MB, 1668x1736)
1.73 MB
1.73 MB PNG
>>106545321
this is an excellent example, this method shows that it won't overtrain if it sees the same image too often (like those known paintings)
>>
>>106545482
it sounds like a major finetune so the usual answer would be "no, only some"
>>
File: Sorry :(.gif (2.55 MB, 379x213)
2.55 MB
2.55 MB GIF
>>106545321
I apologize Tencent, I spent so much time making fun of you I actually wasn't familiar with your game
>>
File: ComfyUI_00644_.png (1.77 MB, 768x1536)
1.77 MB
1.77 MB PNG
>>106545527
nta, Chroma version of the man
>>
>>106545242
can qwen inpaint with this?

Or am I better off learning to inpaint with comfy.

Using edit fucking sucks and it fucks up all the time
>>
>>106545574
can you go for a 1:1 image instead
>>
>>106545574
lmfao chroma got mogged
>>
>Several large models get released today
>/ldg/ : \

>Another fine tune of flux gets released
>/ldg/ :O

You people are simple.
>>
File: Lodestone right now.png (168 KB, 680x328)
168 KB
168 KB PNG
>>106545527
>>106545574
>>106545582
>>
>>106545598
all he cares for is the furry shit so in his mind choma is superior
>>
With neoforge, should I use --cuda-malloc? Does it change anything with quality or just optimization?
>>
>>106545566
>our method achieves significant performance improvements for FLUX.1.dev in under 10 minutes of training.

I mean, how much training did they actually do?
>>
>>106545587
>garbage slop releases
>general reacts appropriately
>new research to de-slop models releases (with open source model as proof)
>general reacts appropriately
>>
>>106545587
more like
>slop model released
:[
>finetune that removes the slop of Flux dev
:D
>>
>>106545269
because ai has exhausted open source devs due to the pace features and models come out. Comfyui is not a good product, but until ai stops progressing (unlikely to ever happen) quick deployments drain away the userbase of enthusiasts killing all competition.
>>
File: ComfyUI_00646_.png (2.13 MB, 1152x1152)
2.13 MB
2.13 MB PNG
>>106545579
Sure

>>106545582
Your gen matches your personality.

Chroma isn't picking up the grain at all
>>
>>106545626
that looks so plastic, Tencent has won
>>
>>106545587
what if we used this SPRO thing to unslop all those several large models that got released today?
>>
>>106545587
>Several large models
several? there was just HunyuanImage that got released no?
>>
>>106545642
sir, your 8xH100 cluster?
>>
>>106545602
probably is for booru content too? surely tencent didn't manage to train major *booru in 1500 images?

if anything this finetune method -if it works- sounds like something that could polish booru finetunes or whatever else adds content..
>>
>>106545647
they unslopped flux with only 1500 images, and that took them 10 mn, if you rent 8xH100 for 10 mn it'll only cost like 2 dollars?
>>
>>106545626
It's crazy how chroma fucking lost all semblance of being good in the last few epochs
>>
prompt some nsfw to see how bad it is
>>
File: ComfyUI_00647_.png (1.99 MB, 1152x1152)
1.99 MB
1.99 MB PNG
>>106545626
Oops that was only 20 steps. Here's 26

>>106545671
>It's crazy how chroma fucking lost all semblance of being good in the last few epochs
For me, it got more and more sharp, but I do 1280p gens

In other news, radiance stopped getting updates 12hrs ago.. maybe a release of that soon?
>>
>>106545671
What's crazy is how long people kicked and screamed holding on to hope despite it being dead.
>>
>>106545676
>prompt some nsfw
i tried, its almost fucking impossible to do nsfw. this shit sucks
>>
File: 1753014146916908.png (1.66 MB, 2690x605)
1.66 MB
1.66 MB PNG
>>106545321
https://arxiv.org/pdf/2509.06942
really impressive, it's like we went from ps2 to ps5, it doesn't look AI anymore
>>
>>106545690
proofs?
>>
>>106545321
I never thought a chink company was interested on unslopping their models (since their models are so slopped in the first place), color me surprised, for real
>>
>>106545321
>wen GGUF
you can make your own with this lol
https://github.com/lum3on/ComfyUI-ModelQuantizer
>>
Is it possible it is a "reverse qwen" and it is so hardbaked into realism it can't do anything else?
>>
>>106545697
i tried a bunch of times, even saying the word penis like 5 times and they kept showing up in underwear. i deleted it already, so disappointed.
>>
>>106545690
yeah titties are alright but haven't had much luck with anything else
>>
>>106545742
all they did was do new RL to flux what did you expect. the model has no concept of genitals
>>
>>106545748
>what did you expect
futa lolis
>>
>>106545671 >>106545689
you see more realistic patterns of analog image defects from vintage cameras (film grain) on sfw portraits

you then declare that a nsfw/booru/furry booru focused model (almost the entire training data was that) is bad and dead. the same technique can probably even be applied to the nsfw/booru/furry booru focused model too, although that remains to be seen.

how is that making any sense?
>>
File: ComfyUI_00649_.png (2.1 MB, 1152x1152)
2.1 MB
2.1 MB PNG
>>106545689
>holding on to hope
This angered anon
>>
>>106545748
futa oneeshota
>>
>>106545748
>what did you expect
dunno, but not this dogshit.
>>
The only good outcome from this is lodestone applies the same methods to chroma to make it not shit
>>
If you merged it with chroma would it inherit the porn knowledge or are the architectures too far removed?
>>
>>106545786
cant wait for the first true flux shitmix
>>
>>106545786
It's not a flux schnell (chroma is using that, with one pretty silly part trimmed down to a more reasonable alternative) but a flux dev derivative, right? I really don't think you can just merge them but feel free to try.

I'm guessing if anything it makes sense to try to finetune Chroma with the published method instead.
>>
Good morning vramlets
>>
>>106545321
>remarkable training efficiency-converging in just 10 minutes using 32 NVIDIA H20 GPUs
does this mean that we can unslop any models ourselves?
this amount of compute should be doable locally, shouldn't take more than a month on consumer gpus
>>
>>106545827
Start finetuning chroma for us
>>
>>106545691
offline-HPSbros, we won.
Now... What is offline-HPS, again?
>>
>>106545842
online is the good one
read the paper
>>
>>106545830
>shouldn't take more than a month on consumer gpus
no reason to not rent unless you have free electricity
>>
>>106545832
From lurking in these threads i made an impression chroma is kinda not oke
>>
>>106545854
chroma is just a mediocre base model with zero RL
but thankfully a new technique was developed to avoid the slopped look that was derived from RL
>>106545321
>>
>>106545854
Think of it as a fancy and horrifically slow SD 1.5.
>>
>>106545854
It's not the best, but it's the most creative and most depraved. Most of the doomers are butthurt schizos.
>>
>>106545854
should be fine after an aesthetic finetune
>>
>beggars out in full force already
>>
>>106545887
we are all beggars in this hobby
>>
>>106545748
>what did you expect
that they used 1500 images of porn, DUH
>>
i didn't realize how impressive SRPO is until i looked at the full sized pictures
>>
>>106545898
Ikr, I showed this to my brother and he asked me why I showed him a real image kek
>>
>>106545898
>>
>>106545898
it's honestly a revolution, the shackles binding us to Slop have been cut
>>
>>106545827
>Good morning vramlets
take qwen image and apply the SPRO technique, save local anon
>>
>>106545912
>>
>>106545919
chroma has the nsfw knowledge doe...
>>
>>106545919
Can you even unfuck qwen with how baked in the slop data is?
>>
>>106545871
>but it's the most creative
out of the 50, which one?
>>
>>106545940
qwen image seems less slopped than flux dev though?
>>
>>106545955
can qwen image do 1girl 1boy SEX
>>
>>106545927
on some images it still has some graininess, washed out colors, and the usual problem of older, smaller, low iq image models of mangling some harder to do details up, but given this is just a quick flux finetune, its incredibly good, using this with whatever would be the next gen model would be prod ready for a lot of things
>>
>>106545947
If you aren't autistic enough to play with the autism experimental versions, then the Base or HD
>>106545955
The slop is just polished into technical precision. I never used old flux. Did it also have zero seed variety like qwen?
>>
File: 1752062313327643.png (320 KB, 2396x908)
320 KB
320 KB PNG
>>106545842
>Now... What is offline-HPS, again?
>>
>>106545691
seems like going from offline to online made the difference, they struck gold with that, I wonder why they didn't try "online" on PickScore and CLIP aswell?
>>
>>106545968
>autism experimental versions
talking about the detail calibrated versions?
>If you aren't autistic enough
i'm so goddamn bored that i might download all of them and test a single seed in each one.
>>
>>106545993
>i'm so goddamn bored that i might download all of them and test a single seed in each one.
Are you sure about that?
https://huggingface.co/silveroxides/Chroma-Misc-Models/tree/main
>>
>>106545365
Now say "Greg Rutkowski"
>>
>>106545968
old flux? a lot of samepose, sameface for a whole lot of prompts, yes
>>
>>106545487
It's a real cat!
>>
>>106545321
>Gigachad Tencent "I unslop flux with only 1500 images" vs Virgin Lodestone "I spent 150k dollars fucking up Flux with 5 millions of images"
>>
>>106546004
damn, my autism...waivers...
>>
File: Tencent saved local.png (998 KB, 1108x1151)
998 KB
998 KB PNG
>>106545321
sovl!
>>
What do you guys use to stitch videos together? Just started with Wan, I can merge it with an external program but I wanted to know if I can do it within Comfy.
>>
>>106545854
The shitposting it attracts is honestly kind of baffling. No other model is quite like it and it's truly uncensored. It's demanding and the prompting takes some getting used to but it's a lot of fun once you get the hang of it.
>>
>>106545321
now the question is, why didn't Tencent use this SPRO method on their own fucking HunyuanImage model???
>>
>>106546067
>muhh 3 arms and 7 fingers coom
you lost Lodestone
>>
>All those realistic models all coming out at once
>Not a single new good anime model with artstyle and characters baked in
Why must we suffer, anime bros...
>>
>>106546065
kdenlive is a video editing software 5hat will help you take your videos the rhe next level
>>
>>106546073
Maybe the dual model setup or the refiner vae caused issues?
>>
>>106546081
just $25
>>
>>106546056
>AI can't make kin-
>>
>>106546081
Isn't anime kind of a solved problem?
>>
>>106546084
>kdenlive
thanks, ill take a look. im currently using avidemux. maybe there's a workflow that can do them but i'm not a comfywizard.
>>
>>106546103
1girl is. Anime isn't.
>>
>>106546056
what if Midjourney knew about SPRO all that time, maybe that's the method that gave them that real feel and made them unique compared to their competitors?
>>
>>106545321
Has anyone managed to run it? I'd like to see more examples
>>
File: 1743067905086835.png (3.59 MB, 1248x1824)
3.59 MB
3.59 MB PNG
>>106545854
I've been using chroma for several months now, and i'm yet to find a better mix of medium knowledge and creativity. Qwen is too slopped in comparison, despite being too detailed and requiring minimal tard wrangling.
>>
File: chroma.png (2.33 MB, 832x1488)
2.33 MB
2.33 MB PNG
>>106546081
yea, most clearly don't get to tune everything in depth yet
>>
>>106546118
it looks like shit though, that's the fucking problem
>>
>>106546135
because it hasn't had... wait for it... reinforcement learning!
>>
We've got the whole world in our hands,
We've got the whole wide world in our hands!
We've got the whole world in our hands,
When we have Chinese AI deepfake spyware local models in our hands!
>>
File: file.png (2.41 MB, 832x1488)
2.41 MB
2.41 MB PNG
>>106546118
won't really change with hunyuanimage either.

the issue is that they weren't broadly nsfw/questionable *booru/furry booru/porn/[...] finetuned
>>
File: 1738484199767688.jpg (1.48 MB, 2272x1552)
1.48 MB
1.48 MB JPG
>>106546081
Anime models are in a good place as long as you don't stick to WAIshit.
>>
>>106546154
catbox?
>>
File: 1755186455968333.jpg (628 KB, 3177x911)
628 KB
628 KB JPG
>>106546081
this method improves anime too
>>
File: file.jpg (113 KB, 899x336)
113 KB
113 KB JPG
>>106545199
>https://rentry.org/wan22ldgguide
I followed this guide and used the Kijai fast workflow at the bottom, can anyone explain what this means by "connect this" on a note? Guide doesn't talk about it.
>>
File: chroma.png (1.77 MB, 832x1488)
1.77 MB
1.77 MB PNG
>>106546164
i really hope this is more broadly applicable.
>>
>>106546154
yeah if you do not have eyes
>>
>>106546065
you can do it in comfy, i like to use shotcut for quick editing or adding audio. davinci resolve is good too and easy to pirate if you want to really get into it
>>
File: file.png (2.44 MB, 832x1488)
2.44 MB
2.44 MB PNG
>>106546167
pretty sure it means torch_compile_args from the output to another node's input. it looks like it is connected tho.
>>
>>106546193
Ah alright thanks.
>>
>>106545199
Is anyone else having the issue with wan2x that the last frame in the video is darker than the end frame you set as conditioning image
>>
>>106546183
thanks for the recs, i mostly just want to stitch them for now while i mess with wan but ill keep those programs in mind if i want to go ham on it.

>you can do it in comfy
do you have a workflow for it that you can share?
>>
>>106545316
Like how woman don’t see me for not being top tier
>>
>>106546218
>>106546183
https://civitai.com/models/1818841/wan-22-workflow-t2v-i2v-t2i-kijai-wrapper
>>
>>106546162
He is the Neta Shiller, he gens and heavily inpaints his pics, claims he made them with Neta Lumina, but refuses to share workflows or installation instructions.
>>
>>106546164
Are these your prompts or theirs?
>>
>>106546264
theirs
https://tencent.github.io/srpo-project-page/
>>
>>106545321
Let me guess it still doesn't know more concepts and is just as limited as flux?
>>
>>106546065
i just use ffmpeg with batch scripts written by chat gpt
>>
>>106546118
It's the only model I use at this point for the same reasons. Wish it was a little faster, but nothing else really comes close in terms of prompt adherence/creativity. Looking forward to the qwen2.5 TE being integrated as well.
>>
>>106546272
its quite literally the same model except for much better aesthetics
>>
>>106546270
What's the name of the model? SPRO?
>>
>>106546288
yes >>106545321
>>
File: crache.png (12 KB, 349x217)
12 KB
12 KB PNG
>>106546279
>Wish it was a little faster
Use this when you are just spamming for good seed.
>>
>>106546167
it's already connected
>>
File: 🩺💉🎀.png (2.45 MB, 1344x1728)
2.45 MB
2.45 MB PNG
why didn't they use srpo on hunyuan image?
it would make it a kino nsfw model
>>
50GB model? I have to buy new SSD...
>>
>>106546303
maybe making HunyuanImage took some time, and they discovered SPRO later, but since you can get this kino result with only 1500 images, maybe we can do it by ourselves
>>
>>106546297
I keep forgetting to install that.
>>
>>106546297
what is this
>>
the SPRO is based on what? Flux? SD? Or it's entirely their own thing?
>>
>>106546318
Makes it go fast in exchange for more scuffed output. So turn it off if you want to rerun a good seed properly. You can also set the top value to 0 if you aren't using samplers that need time at the beginning.
>>
>>106546337
Flux
>>
>>106546337
read the paper retard
>>
>>106546337
it's a training method, and they used it to finetune flux and unslop it
>>
>>106546349
read?
>>
>>106546337
read the model page retard
>>
>>106546349
no thanks, I am not a homosexual
>>
File: hyimage.jpg (296 KB, 2048x2048)
296 KB
296 KB JPG
>>106546337
fluxdev finetune with new method that (mainly? exclusively? idk) seems to quickly train aesthetics
>>
>>106546318
it reads from other comfyui instances that have the node and checks to see if they have a similar prompt to deliver the similar latents to speed up generation. it's why it doesn't work without internet.
it's like p2p but images
>>
retard
>>
File: 1757511075042.jpg (97 KB, 550x535)
97 KB
97 KB JPG
>>106546349
>>
File: eTdhpfCBfxU.jpg (512 KB, 1179x1162)
512 KB
512 KB JPG
Is there an image loader node that automatically pulls the prompt from metadata into a text window?
>>
>>106546450
five seconds in google
https://github.com/receyuki/comfyui-prompt-reader-node
>>
>>106545445
You do realize each token has quadratic attention requirements right?
>>
>>106542948
Ahahaha that’s so fucking great, we truly are in a new age lads. Elastigirls voice is damn spot on.
>>
File: I'm waiting.png (727 KB, 1113x629)
727 KB
727 KB PNG
>>106545321
>wen GGUF?
>>
>>106546550
>>106545724
>>
>>106546554
I don't have enough RAM to make my own :(
>>
Hey /g/uise.
I'm having issues with upscaling images. I might be retarded but I can't get a good upscale no matter the workflow or models I use.
Any recs?
>>
Fresh install of neoforge and my noobai checkpoints come out like this. Huh?
>>
>>106545242
Finally! I can be free of nodes (assuming it all works right)
>>
>>106546564
how much vram/ram u got
>>
>>106546607
24gb vram + 64gb ram
>>
>>106546626
should be enough to just run the fp32 then. i've been running it fine on 24gb vram and rams only hitting 23gb-25gb.
>>
>>106546573
Do you have your own workflow or are you using some premade template?
>>
>>106546584
skill issue, as expected of someone who uses a name on a hungarian pipe market
>>
>>106545330
ani is unironically doing more for moving forward. this chink stagnates on deprecated software
>>
>>106546669
>hungarian pipe market
havent seen this one before is it an original
>>
File: ComfyUI_00657_.png (2.52 MB, 1024x1536)
2.52 MB
2.52 MB PNG
>>
>>106546647
oh nvm ram peaks up to 33gb
>>
>>106546659
I've been following tutorials and been using the workflows there, while trying with some models/controlnet/loras I like.
>>
>>106546691
> 1 more experiment
>>
>tfw antchromaschizo was right
>>
>>106546692
how are the results? can you show some of your outputs
>>
>>106546691
>no vae
>still looks like sd 1.5
woaw
>>
>>106546723
I think he just used Qwen Image to make fun of lodestone
>>
File: 1749916924309847.jpg (131 KB, 483x448)
131 KB
131 KB JPG
i think it's time for chang to sell the local models. the free locals are dogshit, and the chinks are really lazy. i don't mind buying something, like Kling or Hailuo. we'll be waiting years for a good local at this point...
>>
>>106546757
>the free locals are dogshit, and the chinks are really lazy.
not the good day to say something like that anon lol >>106545321
>>
>>106546757
Would you accept kling or hailuo if it meant the models came with irremovable censorship?
>>
>>106546720
>how are the results?
a slightly better flux i guess, still has the issues flux always had.

>can you show some of your outputs
nothing really great to share desu
>>
>>106546669
It's been like a year since I had to reinstall shit. I don't recall having to do anything extra for a particular checkpoint.
>>
>>106546799
oh so you were just trolling, got it
>>
if you dont do as i say, you are trolling.
>>
>Get scraper software
How can these artist realistically stop me when I have a cyber penis that can astroproject through barriers into the artssuy?
>muh water mark
I can fuck that unprotected too
>>
> Moving model(s) has taken 134.31 seconds

death to all forge derivatives
memory management is the DEVIL

haoming PLEASE
>>
>>106546842
haoming should have used reForge as a base for neo.
forge is a dead end. he will go insane before he fixes anything of value or makes it even halfway usable. neo forge is doomed.
>>
>>106546814
you could just do your own...
>>
>>106545242
>>106545298
Ummmmm
>>106545321
I think some people are trying to memory hole this important news by giving attention to a new mediocre model..Nobody is going to fall for this distraction, the timing is too convenient...
>>
>>106546757
It's gonna be long ride buddy, strap in because it's never gonna end until AGI happens or wars and other distractions suck the money away. It's also worth noting qwen image at q4, a 12gb model can do basically everything chatgpt image gen had at the start of this year. Stop being such a baby.

the price difference is too vast tho. It costs ~30 cents to generate 2 minutes of wan video on my 5090 on expensive east coast electricity. Generating that on the cloud would be 72 fucking dollars and it's censored to boot.

If they ever get the price and speed at reasonable level, sure, maybe it will be a cool alternative, but as it is fuck that.
>>
>>106546842
>>106546872
Fun Fact: ReForge 2 it's build upon NeoForge.
How can he be doomed if he today he implemented Qwen Image?
>>
How do I into video generation? Do I have to follow the Wan 2.2 guide in OP? Or can I do it any other way? I know ComfyUI docs had a tutorial on video gen but I don't know how reliable it is
>>
File: AnimateDiff_00316.mp4 (2.07 MB, 1104x624)
2.07 MB
2.07 MB MP4
>>106546550
>GGUF NOW
>>
>>106546893
there is no reforge 2 unless you mean the "new" reforge that was killed not even 3 days later and also came out before neo forge.
also you would still be wrong because it was just based on reforge.

it's doomed because it is completely unusable. it takes nearly two minutes inbetween gens for the memory management to do whatever the fuck it's doing.
and the larger the model the more busted it becomes.
>>
File: qwenedit_00075_.png (2.13 MB, 1712x1216)
2.13 MB
2.13 MB PNG
>>106546831
>watermark
qwen image edit
>adversarial noise
qwen image edit

problem, artists?
>>
>>106546916
>pircel
kek, too bad the table didn't broke though
>>
>>106546915
>do i have to read the guide that tells me how to do what i'm asking about? tell me right now for i cannot choose the obvious answer without someone holding my hand
>>
>>106546937
correct
>>
>>106546919
just wait 2 weeks retard, even with the delay its still faster than learning nodes
>>
>>106545321
Someone please check if flux is finally uncucked.
Technically it cost them like less than 20 bucks to do this? Maybe jailbraking their flux finetune would be just that easy? Like rewarding it with non mutant peepee and vagoo.
Also, couldn't this method be used to train loras to stop before overfitting? Doesn't seem like a complicated implementation.
>>
>>106546968
But chroma?
>>
>>106546919
Memory management is on the to do list as far as I can see on the github.
>>
>>106546926
Basado
>>
>>106546926
fuck them up the ass no lube
>>
>>106546926
>>adversarial noise
Am I wrong or does this only make the image look worse for the viewer while not affecting training
>>
File: 1726082020948997.png (3.74 MB, 2702x1197)
3.74 MB
3.74 MB PNG
>>106546968
>Also, couldn't this method be used to train loras to stop before overfitting?
it does, like it shows that it's not overfitting concepts that are too frequent in the dataset like some known paintings and shit
>>
>>106547045
I dunno, they're just completely deranged. Maybe they hide their un-noised art on their patreon or some shit lmao
>>
>>106547045
Yeah but artists are artists for a reason, critical thinking is not in their set of tools
>>
>>106546954
> learning nodes
my guy it's not that hard. you take a thingy and plug it into another thingy. i've made dozens of working wf's and i am a certified retard.

>>106546987
right, he added that recently. i really genuinely hope he doesn't go insane trying to fix whatever cancerous shit is going on with it.
i'm not one of the nutcases here that are all like "reee comfyui bad" but some things are just so much faster and cleaner in forge.
>>
>complaining about visual scripting
Holy lel
>>
>>106547045
I'm going to let you in on a secret:
all these claimed "adversarial noise" and "ai poison" tech are all snake oil. Everyone knows it's snake oil. They are designed to appease governments and artists who are completely clueless about how AI works.

It's a game of pretend where only the above mentioned are clueless.
Microsoft has a "ai posion" tech that literally does not work at all.
>>
File: Chroma_00002_.jpg (460 KB, 1248x1824)
460 KB
460 KB JPG
>>
File: file.png (1.03 MB, 653x847)
1.03 MB
1.03 MB PNG
let me use forge with chroma
you have to save us all haoming-chan
>>
>>106547117
>Microsoft has a "ai posion" tech that literally does not work at all.
it does something though, it destroys the drawing quality, so it makes the artist look like he doesn't know how to draw, and we're training on those shitty images too so...
>>
>>106547133
what were your training settings and tag approach to making that lora?
>>
>>106547136
https://github.com/maybleMyers/chromaforge
>>
>>106547144
just another ploy to destroy everything nice baka.

>>106547166
same issue, been using that since it released. the mem bugs are less pronounced there though.
>>
File: Chroma_00003_.jpg (435 KB, 1248x1824)
435 KB
435 KB JPG
>>106547151
tags: natural language + booru tags. This image is just with booru tags compared to previous. adamw8bit 2 batch 1 step. 0.0003 LR, 100 epoch / saved every 5 after 50th epoch and tested which works the best. Dataset cropped and cleaned manually
>>
>>106547091
Honestly at this point not like comfy is any better at memory management so if he can at least slightly more manageable will be more than enough for me to switch
>>
>>106545321
https://youtu.be/256_QUGhEj8?t=733
he explains why we got those slopped looks on recent models, before SRPO they were using GRPO and the reward system was flawed, like the model was hacking this shit to get more rewards by saturating the color more, and you had that overbright plastic look to it
>>
>>106547192
well comfyui doesn't need to needlessly un/reload the model between every single goddamn generation. it is absolutely shit at mem management too though.
sure would be cool if unloading the vram and sys ram actually worked.
>>
>>106547192
memory management is a lot better in the new comfy and the unload models and node cache buttons actually work now
>>
is the nu comfy node look thing in main yet i want the upgraded look already
>>
>>106547237
you just put in the checkout for the frontend but it's very shit. I just went back to legacy and I save like 300mb of RAM
>>
>>106547229
Which update was that, the newest one? Last update I did was a week ago and had my computer hard hanged at least 5 times since then lol.
>>
>>106547256
>I just went back to legacy and I save like 300mb of RAM
grim. tanks anon
>>
File: 00010-2720728034.png (2.43 MB, 1248x1758)
2.43 MB
2.43 MB PNG
>>
>>106547275
what was your prompt for that?
>>
why is everything that's added to comfy just horrid, unstable and wasteful jeeted slop?
>>
Wait, why is comfy using my RAM for the wan 2.2 f16 models when I have 32gb vram?
>>
>>106547320
convolutions are probably forced to cpu even though the model is loaded into gpu
>>
>>106547292
masterpiece, sexy fine ass night elf with huge titties goddamn, wearing red cloak with cleavage, black background
>>
>>106547275
>noobslop
>>
>>106547349
i just wanted it to see what this flux shit would spit out i'll try that prompt tho sounds good
>>
HDM has really inspired me to make a tiny model:
I wonder how fast you can train a 500m booru model with all the tricks but also with the Hunyuan 32x VAE
>>
>>106547332
Is there a way to get rid of that?
It froze my pc until render was done.
>>
>>106547320
it has to unload the frames or latents or whatever. also aren't the fp16 models like 30 gigs by themselves?
>>
File: 1526495802456.jpg (25 KB, 462x430)
25 KB
25 KB JPG
>>106547190
>saved every 5 after 50th epoch and tested which works the best
Are you telling me I am not suppossed to just let it run the entire cycle and then just use the result?
>>
File: Jenny Cat-Dance.webm (3.99 MB, 960x1280)
3.99 MB
3.99 MB WEBM
Damn, every native 720p gen was in slo-mo... anyone know where the resolution limit is for "normal speed" with 2.2? 960*xxx seemed to work well, but there's still trouble with eyes at that res.
>>
>>106547385
idk, comfy is a hot mess nowadays, I dunno what setting or flag would do anything but I'd check out the launch flags docs first
>>
>>106547292
its just a basic noobvpred prompt with a few style tags, its more or less this what the other guy said lol, except I use sagging breasts instead of cleavage
>>
>>106547396
>>106547402
27.9gb.

I have a fresh install of comfy, so there's probably stuff I can set to launch.
I've seen some workflows include unload nodes, but I figured that was done natively..
>>
>>106547400
>Are you telling me I am not suppossed to just let it run the entire cycle and then just use the result?
You can totally do it, I like to test and find the one that works for me.
>>
tagui isn't working on fedora what are some good alternatives
>>
>>106547438
it's normal for the ram to fill up especially when genning at 720p
>>
File: IMG_2966.jpg (132 KB, 1206x522)
132 KB
132 KB JPG
>>106546584
Haoming is updating like auto1111 did, stuff will break, don’t pull. Picrel
>>
>>106546919
>completely unusable
Works fine for 1girl gacha. If you need the newer models just use comfy already lol
>>
>>106547525
>you, the user, should-
Retard alert. Learn what a git branch is. But that's too much for a tranimefag
>>
File: ComfyUI_151671_.jpg (59 KB, 1024x1024)
59 KB
59 KB JPG
>>106547431
probably one of the better ones
>>
>>106547542
>Works fine for 1girl gacha
forge classic and neoforge have no speed difference for 1girl slop with sdxl/pony/illustrious/noob so there is not even a need to "upgrade" if you have forge.
>>
how can i coomgen more creatively
>>
>>106547498
Is there something specific with Fedora that makes it not work ?

I had some problem installing it on Arch at first, but it was solved by installing torch torchvision manually in the venv first and then do pip install -r requirements.txt
>>
>>106547582
free your mind
>>
>>106547629
>>106547629
>>106547629
>>106547629
>>106547629
>>
>>106547551
He’s based though he shits on complainers in his GitHub issues tracker, good for a laugh
>>
>>106546111
It's not just Midjourney, all the industry has secret sauce they never publish.
Kind of sad in a way.
>>
>>106546937
>>106546948
nta but I tried folllowing the guide and got errors left and right. also is the guide uptodate? everything i've tried still doesn't show "pytorch version reads either 2.7.1 or 2.8.0dev" I get pytorch version: 2.8.0+cu128 is that the same as dev?
>>
>>106546926
It removed the halo from this BA character which is not at all what you want and some better prompting would've preserved it. You do lose detail but yeah, it's pretty much all there.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.