[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Janitor applications are now being accepted. Click here to apply.


[Advertise on 4chan]


Ongoing FUD Campaign Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106593668

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2122326
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
>>106596594
>Ongoing FUD Campaign Edition
which fud? I feel like I'm going crazy for who's fussing who anymore
>>
Comfy hate anchor
>>
Qwen SRPO waiting room:

https://github.com/Tencent-Hunyuan/SRPO

Qwen (and local) will be saved once someone trains SRPO on it, trust the plan
>>
>>106596608
>>106594681
>>
I hate Comfy
>>
>>106596621
how did this not make the OP? it's like one of the trumpet sounds of the saaspocalypse
>>
>>106596648
Because all the OPs are comfy workers
>>
>>106595289

All will submit to localGOD'S BWC
https://files.catbox.moe/bqigvh.mp4
>>
Reminder that Comfy going SaaS is inevitable and they'll force it to us as the standard whether we like it or not.
Get ready
>>
>>106596666
giga satan out here with his dogshit fried workflow looking down at us all
>>
>>106596648
Comfy's gonna push SaaS whether we want it or not. It's happening. They're making it the default to us local users and there's nothing we can do about it escept keel using Comfy
>>
>>106596688
Only option is to keep using the old Comfy builds or fork it. Typical corpo bs trying to monetize everything.
>>
>>106596688
cumboi answered the call
>>
Seedream marches on
>>
Anyone else planning to jump ship or are we all just gonna bend over and take it? Heard some new UI called AniStudio or whatever is being making progress
>>
>>106596727
we have other options so it's a no brainer.
>>
>>106596727
Already jumped ship to a python script (local) to inference with Bytedance Seedream API
>>
>>106596716
very impressive, the anatomy is on point and the style is good
>>
>>106596716
gm local bro!
>>
>>106596716
can i get a surprise dick in the mouth on this one but don't prompt for who
>>
File: seedream v3-olb.jpg (295 KB, 1024x1024)
295 KB
295 KB JPG
>>106596732
seecum is pretty impressive for certain things I think
>>
>>106596731
The maximal safe option
>>
Remember that if you want real local, /sdg/ is the real local experience >>106595919 >>106596084 >>106595051 , they actually discuss diffusion instead of whatever this is
>>
nice try nigboid
>>
what a depressing day
>>
I dont know where to go, why comfy did this to us?
>>
File: saas marches on.png (117 KB, 1390x493)
117 KB
117 KB PNG
this was foretold btw
>>
File: ComfyUI_38232_.jpg (3.38 MB, 1872x2496)
3.38 MB
3.38 MB JPG
>build on the web
>revel in the web
>the spiders are your friends
good work comfy, stick it to those antisocial freaks! let's connect everyone and everything together!!! into one convenient package!
>>
>>106596774
might as well, fucking cancerous saas shills constantly shitting up a LOCAL general...
>>
Reminder: Forge and SDNext are still alive. Comfy isn't the only game in town unless you bought into the cult.
>>
calm down nigbo, your blood pressure
>>
Comfy's workflow autism was always a trap. Now they’ll monetize the "ease of use" they forced everyone into. Should have stuck to Python scripts.
>>
Comfy looks like a normal human but at Jupiter gravity (2.5G)
>>
>new models are plastic shit
>comfyui sells out to saas
>local hardware stagnant
>chroma flopped
>im still genning noobai on reforge
feels based not buying into the snakeoil garbage. i mean it unironically when i say 2 more years of SDXL. your fotmslop will NEVER be finetuned
>>
comfy going corpo is the same story repeating itself, once the hype and userbase is massive, monetization pressure kills the community vibe.
>>
remove comfyui from the OP or have a shill UI section with invoke and comfy
>>
>>106596879
That's why you need a "stable baseline" like Forge. People underestimate how valuable an abandoned but maintained local UI is.
>>
>>106596727
Hearing whispers AniStudio devs want to keep it MIT and never touch SaaS. might be worth keeping an eye on it before comfy kills our workflows.
>>
>>106596731
that or just go pure python workflow. less shiny, but nobody can SaaSify your scripts.
>>
File: juli.jpg (157 KB, 800x800)
157 KB
157 KB JPG
>>106596952
thanks Jewlien i'll stick with gradforgeudio
>>
>>106596669

If Comfy flips the switch to SaaS, 90% of this general is gonna fracture overnight. Some will cling to old Forge, some will fork, and some will migrate to /sdg/
>>
>>106596907
You can make a model from scratch for $1000. HDM 300m was $500, you could do HDM 600m for $1000 and it'll run laps around Illustrious because of the better architecture and text encoder.
>>
>>106596975
>migrate to /sdg/
loony bin no thanks nigbo
>>
>local tool
>starts nudging users toward cloud logins
we’ve seen this play before, saaspocalypse is real, trumpet sounding loud
>>
NeoForge gang checking in, simple UI, fast, no comfy shenanigans, the migration has already started, anons don’t realize it yet
>>
File: Chroma2k-test_00089_.jpg (655 KB, 1248x1824)
655 KB
655 KB JPG
>>
File: 1749403570667381.png (1.08 MB, 1080x1005)
1.08 MB
1.08 MB PNG
>>
The real general is still >>>/g/sdg/

/ldg/ is comfy damage control
/adt/ is 2dschizopilled but at least they are loyal
>>
this level of schizo, it's kind of impressive
>>
>>106597033
It doesn't matter how many times you parrot this, I'm not budging.
>>
comfyui is finished, face it, no one wants to load google account every time to generate a miku meme
>>
>>106597060
sorry but that would breach content guidlines
>>
>>106596907
The moment a project starts "partnering" with corpos it's already over.
>>
Seedream sails on
>>
>>106596952
NeoForge is already doing 90% of what people actually need without the bloat.
>>
>>106596981
And think, the 300m HDM model while not awesome was also *not* bad. So we're talking about degrees of scale from 300m, to 600m to 1.2B to 2B or even 4B. The fact is small models are completely underexplored despite likely being capable and achievable to train with a modest budget.
>>
File: akRgGXl9_700w_0.jpg (52 KB, 700x811)
52 KB
52 KB JPG
Is there any chance for A1111/Forge/Neoforge ever coming back to relevance? If so then yes please make Noodleshit a saas and fuck off. Lets return to simple local applications. Like 90% of the shit Comfy offers you don't need anyhow. The only reason for Comfy is new models not the billions of noodle nodes.
>>
>>106597088
but python is bloat?
>>
>>106597088
AniStudio might actually save local if it doesn’t get dev-nuked. But comfybros are coping hard, they think their flowchart legos are immune to monetization. Spoiler: they aren’t.
>>
>>106597090
>as a faggot collectivist zoomer I must use a UI that is "popular"
>>
>trani seething over comfy, attempting to get people to jump ship
>they go to forge/invoke/neoforge instead
AHAHAHAHAHA
>>
>>106597090
neoforge has the best chance of all three, anistudio is the only long term option when it's ready. who honestly wants to be on pyshit forever?
>>
> comfy will never betray me
next update: “please log in with your comfy cloud account to unlock nodes”
Lmao, enjoy your SaaS pipeline slaves.
>>
if a mod banned that anti-comfy poster, i'm willing to bet 90% of this thread would be deleted. the amount of same fagging is insane.
>>
>retards pretending you can't just run the old comfy versions
>>
anons:
>there is no way comfy could make his UI any worse, it has to get better right?
comfy:
>hold my dildo
>>
>>106597130
Anyone who's ever even just looked at github knows you can just fork shit, roll back to old commits and, here's the kicker, CHANGE THE CODE! I know, it's crazy, right? Anyone who fuds this shit (normal comfy, not the cloud version) is a confirmed retard and should not be interacted with
>>
>>106597130
Why? Comfy cloud has nothing to do with local Comfy.
>>
anons:
>there is no way china would betray us
china:
>introducing seedream 4
>>
>>106597104
>I am a retard and I don't get the point
With the current way of things, if the UI is not popular, new models won't be supported because the creators of models only support the most popular UIs.
>>
>>106597156
>someone who isn't me should fork the code and change it
>>
>>106597110
>when it's ready
And there is your problem kiddo
>who honestly wants to be on pyshit forever?
Python is the best option right now to have it multiplatform.
>>
>>106597157
Which makes it even bigger nothingburger
>>
>>106597165
what the fuck are you talking about? they always do diffusers first
>>
>>106597165
That's not how it works. They release the inference code which uses the same Diffusers, Transformers, libraries as every other model and you implement the inference within your UI. This isn't rocket science.
>>
>>106597167
1. I never said that
2. People can make local/private forks (shocker, I know!), how do you know I'm not running one?
>>
>>106597125
it's happened a few times. I think the record is 80 posts deleted.
>>
>>106597125
Samefagging is the only thing keeping half the site alive. Why do you think Hiro refuses to implement per-thread poster IDs? If he did you'd realize 90% of the banter and shilling and interracial porn pandering is like 4 jannies and another 2 aspie retards.
>>
>>106597116
then someone will fork it. i hate nocoders so goddamn much
>>
>>106597125
And the seedreem shills
>>
>>106597201
I won't, will you?
>>
>>106597188
from diffusers import DiffusionPipeline
import torch

model_name = "Qwen/Qwen-Image"

# Load the pipeline
if torch.cuda.is_available():
torch_dtype = torch.bfloat16
device = "cuda"
else:
torch_dtype = torch.float32
device = "cpu"

pipe = DiffusionPipeline.from_pretrained(model_name, torch_dtype=torch_dtype)
pipe = pipe.to(device)

positive_magic = {
"en": ", Ultra HD, 4K, cinematic composition.", # for english prompt
"zh": ", 超清,4K,电影级构图." # for chinese prompt
}

# Generate image
prompt = '''A coffee shop entrance features a chalkboard sign reading "Qwen Coffee $2 per cup," with a neon light beside it displaying "通义千问". Next to it hangs a poster showing a beautiful Chinese woman, and beneath the poster is written "π≈3.1415926-53589793-23846264-33832795-02384197".'''

negative_prompt = " " # Recommended if you don't use a negative prompt.


# Generate with different aspect ratios
aspect_ratios = {
"1:1": (1328, 1328),
"16:9": (1664, 928),
"9:16": (928, 1664),
"4:3": (1472, 1104),
"3:4": (1104, 1472),
"3:2": (1584, 1056),
"2:3": (1056, 1584),
}

width, height = aspect_ratios["16:9"]

image = pipe(
prompt=prompt + positive_magic["en"],
negative_prompt=negative_prompt,
width=width,
height=height,
num_inference_steps=50,
true_cfg_scale=4.0,
generator=torch.Generator(device="cuda").manual_seed(42)
).images[0]

image.save("example.png")


Guys how do you turn this into a UI?
>>
>>106597182
>>106597188
No one creates model support for unpopular UIs. And in img and vidgen it's not like in the LLM world where the UI (Sillytavern etc) is completely independent from the model backend (koboldcpp, llamacpp, LM-Studio etc).
>>
File: qwenimage_00029_.png (1.66 MB, 1328x1328)
1.66 MB
1.66 MB PNG
>>
>>106597211
subscribe to Claude and generate a UI
>>
>>106597211
import gradio as gr
>>
>>106597212
>>106597211
python generate.py  --task t2v-A14B --size 1280*720 --ckpt_dir ./Wan2.2-T2V-A14B --offload_model True --convert_model_dtype --prompt "Two anthropomorphic cats in comfy boxing gear and bright gloves fight intensely on a spotlighted stage."


It's impossible to wrap their inference code in a UI.
>>
>>106597215
>how come 4chan keeps losing traffic?
>>
>>106597229
never change /g/
>>
>>106597229
you must be new to this
>>
>>106597199
One thread I used to visit had a resident schizo (many such cases), but because it's on /vg/, there are no IDs. When 4chan died for a few days, the (4+4)chan thread was literally 30% that one guy who sometimes forgot that thread IDs were a thing on that board. It's fucking crazy how starved for attention some people are, shame the (4+4)chan thread died after 4chan came back online
Also fucking kek you can't even mention the other site without tripping a filter, what is this, reddit?
>>
>>106597236
Are you saying it's impossible to wrap their inference code in a UI? This is amateur shit to do.
>>
>>106597237
he can pretty much be a node developer at least
>>
>>106597237
New to developing? You do realize you can just take their generate.py code and wrap it into a basic internal API that keeps the model loaded and then wrap that with a UI. This isn't fucking rocket science. You don't need ComfyUI to do this.
>>
File: ComfyUI_00477_.jpg (329 KB, 1336x1952)
329 KB
329 KB JPG
>>
>>106597206
>I won't, will you?
If I need to, I will. I already had to dive into the code to fix an autism that Kijai overlooked. It's not actually that complicated of a codebase. You can understand the non-ML stuff in two hours digging through with Claude reading the files and explaining to you what they are
>inb4 hallucinations
seethe, nocoder
>>
>>106597258
ok
so
why is NO ONE DOING IT
>>
>>106597274
NTA but because we already have plenty of UIs that get the job done?
>>
File: Qwan_00038_.jpg (745 KB, 1984x2976)
745 KB
745 KB JPG
Mercyful Fate is so cool.
>>
>>106597274
Because you people are needy whiny assholes so the people who did this already did and didn't make it public so they don't have to support some nigger with 8 GB of RAM whining about GGUF support. Why don't you do it is the real question.
>>
>>106597269
>If I need to, I will
so your not. ok. thanks hackerman faggot
>>
ooooooOOOOOOOOOHHHHHH NYOOOOOOOOOOOOOO~
>>
>>106597211
>Guys how do you turn this into a UI?
go on duck.ai and ask claude haiku 3.5 to turn this code into a gradio UI that exposes all variables with reasonable sliders and steps
>>
>>106597283
which of these "plenty UIs" support as many models/branches as Comfy?
>>
>>106597299
cool kids wrap it with custom html/css/js without a bloated library with hidden telemetry
>>
>>106597299
>sliders
shit taste
>>
>>106597305
>support as many models/branches as Comfy?
you don't use 90% of them. shocking, I know
>>
>>106597305
If they were alive, they would.
>>106597307
meds
>>
>>106597090
I have no doubt that there is a large 'market opportunity' for a cohesive streamlined inference application to compete with Comfy, but they need to stop trying to recycle A1111 endlessly and just start making a new UI from scratch, one that makes sense for the differences of newer models, including video.
>>
>>106597289
See and that's why they only support Comfy and we are where we are now.
>>
>>106597315
huh
>>
>>106597305
I don't know, reforge can handle all my t2i/i2i needs and I recently switched to swarmui and now its comfy backend because it offers more control and flexibility. Some workflows can't really be dumbed down to the level of a1111, that's just how it is
>>
>>106597325
They don't support Comfy, they support Diffuses and Comfy steals their inference code just like everybody else. From there you have FOTM hacks people sometimes do but they're often complete shit and snakeoil. But that's not saying you can't just steal their code from their Comfy node too.
>>
>>106597289
Based.
All of the nunchakufags are on suicide watch
>>
>>106597342
yeah except thats bs
>>
>>106597307
>>106597308
>cool kids wrap it with custom html/css/js without a bloated library with hidden telemetry
yeah i know, i did that for self forcing when i briefly hosted an endpoint here
i was just giving the simplest solution for getting from A to B
>>
>>106597321
The fact is making a UI right now is more of a hobby and labor of love with almost no money making opportunity. There really is no value in supporting every possible model which is one of reasons why doing these UIs is a chore.
>>
>>106597367
>doing these UIs is a chore
i'd argue its a good learning experience, and also helps you learn how to implement what you read in papers, but if that doesn't matter to you then i agree
>>
>>106597379
Again, if you're doing it for "learning" then there's no value in making it public so that the niggers can harass you.
>>
>>106597387
>there's no value in making it public
sure there is, people can make issues/PRs with improvements that you can learn from, and you can put it on your resume. are you the thread troll? do you want to see a beautiful child's feet?
>>
>>106597387
that's just open source in general
>>
>>106597404
lmao okay buddy you're delusional
where's your github?
>>
>>106597411
>lmao okay buddy
Concession accepted. I guess that's a no on the beautiful child's feet, oh well.
>>
>>106597407
surely at this point you've noticed AI has attracted some of the most obnoxious retards in software, literal 90 IQ brownoids who barely know how to use a computer, let alone know what a graphics card is, needing you to support their favela dirt PC with handholding tech support
>>
>all this shitfighting
i give up on local, your models suck anyway. time to play with api
>>
>>106597414
wow no github, knew it
it's crazy, shaming me to do work for you doesn't actually work, ask yourself why
>>
>>106597428
You can't gen cunny on api
>>
>>106597418
I think the worst nigger stench comes from the anistudio fudder. ani is at least proving he can into C which is admirable. nobody should be hating on that
>>
>he clearly has never used the bytedance api
>>
>>106597430
stfu and do work like a good code monkey boy
>>
It's kinda poetic, actually.
You had years to get off of gradio and noodles, and now look at the present: abandonware and SaaSshit. You only have yourselves to blame
>>
complete localtard meltdown
>>
I am having an absolute brainfart help me out.
Which node is this? Or whichever 16ch vae compatible empty latent node you know of that also lets you set framerate?
>>
And that, my sweet anons, is how you win an argument with a troll. With the power of beauty :)

I'm serious about forking comfy though, none of this SaaS shit worries me in the slightest.
>>
>>106597489
>Comfy
Yeah fuck off
>>
>>106597440
you can, but only if you enjoy going to prison afterwards.
>>
>>106597489
hunyan latent of empty wan video or something like that
>>
i'm going to make a new ui i'll report back in a month
>>
why would you betray the snake
what has the snake done to you
>>
Nothing local does will change the fact that SaaS has achieved complete victory in the AI space
>>
I got bored with video gens, 3-5 seconds is just not interesting enough anymore.
What's the best current model for anime characters and creative stuff?
I'm using an llm to write a novel for fun and I want to be able to visualize the characters, preferable a model that can also do character turn around and i2i so I can change their outfits if needed.
>>
>>106597535
>child nudity is legal in america.
That doesn't mean shit. If the Cloud provider doesn't want that type of content on their platform they're going to ban you and possibly send a tip to LEA for investigation. You've got to be a moron to even think of doing that shit on cloud shit.
>>
>>106597367
>why doing these UIs is a chore
I'd argue that since you already spend lots of time trying to fix A1111 and its derivatives to work with current day models, that time is better spent making a new UI which is actually built from scratch for the needs of today.

Comfy just offers nodes, which offers huge flexibility but makes certain workflows anything but streamlined, like for example inpainting. InvokeAI is a decent solution but it's crippleware made to push you to their proprietary solution, also it is just for image generation.

We'll see if someone takes up the gauntlet, there is opportunity, if not then Comfy will remain the only real option.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.