[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Taste is Subjective Edition

Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107032422

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Neta Yume (Lumina 2)
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd
https://gumgum10.github.io/gumgum.github.io/
https://neta-lumina-style.tz03.xyz/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
first for fuck spam
>>
also no more 1girl
>>
we 2girls minimum now
>>
File: aMakoto.jpg (3.37 MB, 2128x1642)
3.37 MB
3.37 MB JPG
>>
>>107040573
why don't u reduce the resolution then slowly increase until u oom?
>>
Looks like the jeets sim card bandwidth got neutered, slowing down his bots.
So refreshing.

I've been testing the latent upscale for wan 2.2 for a few days now. It doesn't seem worth it, for me at least.
Having to run the entire steps over from 0 again takes so long to do.
>>
looks like the bot has arrived
>>
>>107040633
Depends, not sure how much good it will do with just say 4 steps, but if you use more steps it can be beneficial (the more the better). It can prevent "wasting" steps on the high noise model (overall composition, layout, and fundamental semantics) and leave more steps for the low noise model (details, textures, and video coherence) for the whole denoising process. Just gen videos and compare them side by side with something like GridPlayer and see the difference for yourself.
>>
>>107040636
ok this looks real.. aside from their angular faces i don't think i'd suspect this being generated
>>
>>107040636
Noice
>>
The bot needs posts to reply to. So if we don't post, it won't post.
Genius idea.
>>
>>107040648
to be fair, these gens were considered good back in 2023 when local was still competitive. but now saas is so far ahead that these gens look like shit in comparison
>>
when will they invent an ai that can moan
>>
>>107040653
Did not work, trying another gen without any loras to see if there's any weird interaction fucking up.
Or maybe I suck at prompting
>>
>>107040616
Because I wanted max scan lines after making the gen.
>>
>>107040653
It's already possible retard
>>
>>107040658
Don't know if you're trolling me or not but will try it out lol
>>
>>107040660
prompt in chinese. works 110%
>>
What's this max allocated memory?

>>107040658
Stop replying to the bot.
>>
>>107040665
Use NetaYume, not the original Neta Lumina, if you aren't already. They can do it decently enough, DPM++ 2S Ancestral Linear Quadratic seems to give the most consistently good results for it. Particularly long text support definitely isn't as strong as in e.g. Flux or Qwen though.
>>
>>107040660
nope
>>
What is his endgame?
>>
>>107040670
just check the default templates, retard.
do you know how to breathe?
>>
>>107040671
>Also its very good at everything NSFW with any NSFW lora
lol no. it's ok at best if you stack half a dozen loras and fuck around with strengths
>>
so this is the power of a bot...
>>
>>107040682
Chroma does the same if you look at it wrong. It literally was trained off of SD 1.5 hyperslop gens
>>
>>107040685
Hey I wrote this post a couple days ago
>>
Can i get a reply?
>>
>>107040697
whoops i meant unintuitive lol
>>
>>107040697
Sure heres a (you)
>>
make the bot moan
>>
>>107040706
Look at the details, everything comes out incredibly melted.
>>
>>107040708
I'm sorry bud, but it is a skill issue too. You can absolutely get by on 8gb for imagegen, especially if you can run sage attention.
Hey everyone starts somewhere.
>>
>>107040665
Copy that, I'll just say I made the gen and put it in GIMP to make the scan lines.
>>
test
>>
>>107040739
yeah it's slopped and there's minor flaws in places but it's generally coherent, like I said Qwen with no Lora will give you something fucking hilarious that's not even vaguely close to correct for the same prompt
>>
>>107040728
Nigga i'unno.
>>
>>107040747
Does xis rooting neofurgina smell that bad?
>>
>>107040748
What in the god damn hell is that?
>>
>>107040753
do you think only a single anon here likes realism?

...guess what that means when put together
>>
0459
Reminding to include four digits or an image when posting for easy filtering. Or continue eating shit.
>>
>>107040761
>Hatsune Miku appears on screen from the left and shakes hands with the blue-haired anime girl
you know what's sad, is that Wan is able to add a new character to the scene while keeping the same artistic style while Qwen Image Edit can't
>>
>>107040755
I have a realism SDXL checkpoint but I gotta do some stuff so it doesn't make fat chicks lol
>>
>>107040640
>ok this looks real
her eyes are shaking more than in an earthquake in japan lool
>>
>>107040785
no it doesn't
compare frames from the middle of both videos
you can't make long vids with such quality drop
>>
>>107040775
guess I am because I rarely use it lol
>>
>>107040799
have you ever heard of light lora? that is the entire point of it, you must be new as fuck
>>
Why would you guys post big booba bait snd then stop reeee
>>
>>107040804
the only negative is very slightly slower gens, it only increases quality when you have a negative prompt with stuff like blurry, low res and stuff in it, not using NAG if using CFG 1 is just retarded
>>
File: 1750128155095051.mp4 (790 KB, 640x640)
790 KB
790 KB MP4
I can't get Wan2.1 to do jack shit, only Wan2.2 just werks
>>
>>107040813
yea, always double up your negatives in chinese, I found that works way better, having it just in chinese or english only works half the time I found
>>
>>107040813
What's your prompt?
>>
>>107040816
...what?
>>
>>107040826
can't you read?
>>
>>107040836
if only Qwen Image wasn't so slopped it would be an incredible model
>>
File: 1761147482448868.gif (1.19 MB, 439x360)
1.19 MB
1.19 MB GIF
>>
File: ComfyUI_03834_.png (2.79 MB, 2560x2560)
2.79 MB
2.79 MB PNG
>>107040801
I am new but not at the same time. I hated A1111 and moved to ComfyUI and it's worked a billion times better. Sorry if you hate it friend, it just works for me.
>>
>>107040844
it is, crushes details and slops the output. if you don't think so you need to get your fucking eyes checked
>>
>>107040849
I wonder how long it took him and with what hardware.
>>
.
>>
>>107040860
it's beautiful, and some people have the nerve to say that we don't need artist tags, we definitely do
>>
>>107040855
13th Gen Intel(R) Core(TM) i5-13400F (2.50 GHz)
48GB DDR5 RAM
NVIDIA GTX 4060 8GB GDDR6
~80s/gen
>>
>>107040866
Potato tier
>>
>>107040873
with 5 shift (default) with new loras, 1 strength

the anime girl gets up and runs out the door of the computer lab to her left very fast.
>>
>>107040872
Oh yeah, forgot to mention: I have a 512GB M.2 SSD
>>
>>107040875
shit, I'm retarded
>>
what's with all the nonsense replies?
>>
>>107040878
I have 16gb (4080) and am trying not to OOM but we'll see
>>
>>107040873
Potato? I agree. Let's see your gens me boyo
>>
>>107040880
>Not thrilled with downloading a bunch of single use loras

>coming from pony being able to do basically everything without needing a lora
>>
Stop posting in this thread he wants that.
>>
>>107040887
No, the method it uses for conditioning is shit. I want the GUI of the node to be put on something better.
>>
>>107040882
wat I always use IL mah niggeh. I will not deny the single use LoRA criticism though.
>>
>>107040890
Might be cool. I just assume it will destroy gen times completely
>>
File: makotobench.jpg (3.25 MB, 2432x2432)
3.25 MB
3.25 MB JPG
>>107040891
Never. I get any gen ~80s
>>
File: 1755003560526095.mp4 (411 KB, 640x640)
411 KB
411 KB MP4
>>107040816
I never needed to use negatives in 2.2
>>107040821
fixed camera, maid leans forward emphasizing cleavage, stormy orange and pink clouds in the sky move, building lights in background flicker

By contrast this is what I could do effortlssly with 2.2
>>
>>107040895
you are talking to a bot
>>
>>107040900
if you're using kijai it may not work, comfyui automatically offloads whatever doesn't fit on vram to sysram
>>
>>107040902
I'm no fucking bot
>>
>>107040902
It's beautiful. A tower of autism. Any reason you used hidream?
>>
>>107040906
three tests in, it's not looking hot. it's not following the prompt but the camera seems to be more dynamic?
>>
>>107040908
>These also cause slow motion.
fuck
>>
File: 1743768083817579.mp4 (974 KB, 640x640)
974 KB
974 KB MP4
>>107040900
>>
>>107040896
It offloads the models to RAM so if you don't have a good VRAM card, it'll take care of you, but it might be slow.
>>
>>107040913
It's roughly on par with 11labs, minus the ability to direct it. No lora or training code, either, so no sexy SFX.
>>
>>107040914
I’m sorry, but I can’t generate that image. As an AI developed to follow strict content and safety policies, I can’t create or depict that type of visual. If you’d like, I can describe the image instead or help you find safe alternatives.
>>
can someone wish me a happy birthday
>>
>>107040924
Happy birthday
>>
>>107040935
I noticed less OOM but honestly it can be general updates or drivers or anything else.
>>
File: ComfyUI_07555_.png (3.49 MB, 2560x2560)
3.49 MB
3.49 MB PNG
Makoto and I are going to bed. Have a good night guys!
>>
>>107040946
good to know, thx
has there been a speed improvement or anything?
>>
File: ComfyUI_07455_.png (3.64 MB, 2560x2560)
3.64 MB
3.64 MB PNG
>>107040950
Nope.
>>
>>107040636
hot
>>
>>107040960
That's a bot
>>
File: ComfyUI_07363_.png (3.77 MB, 2560x2560)
3.77 MB
3.77 MB PNG
>>107040971
bot's a'int got my gud shit.lol
>>
i need a flux or similar model for creating backgrounds/buildings with signage, like a movie theater with a sign and marquees. is flux dev enough or does anyone have better recs?
>>
>>107041054
We only do 1girl here.
>>
>>107040971
>expecting a braindead avatarfag that replies to ANY post you give him to have any critical thinking skill
LMAO dude, just lmao
>>
File: 1742416902601900.mp4 (1.27 MB, 832x480)
1.27 MB
1.27 MB MP4
new kijai 2.2 high lora and shift 8 seems to work nice for wan
>>
>>107040459

Damn that image has a lot of symbology like "Roko's basilisk".. the horror of technology and madness of erotic.. Serial Experiments Lain on the background maybe represent that multiverse online.

"The game".
>>
File: 1738199869038040.mp4 (525 KB, 640x640)
525 KB
525 KB MP4
>>107041079
>>
can anyone link any ok wai illust comfyui workflow?
>>
File: 00025-3004574862.png (1.1 MB, 896x1152)
1.1 MB
1.1 MB PNG
>>107041068
shut up dumb faggot, here's a 1goodboy
anyway i grabbed this https://civitai.com/models/1931032?modelVersionId=2185585
>>
File: 1752536364057329.mp4 (1.56 MB, 480x832)
1.56 MB
1.56 MB MP4
>>107041153
the camera pans out and the man with the beard is standing beside two very fat women in a black dress.
>>
File: tmpjujh5i03 2.mp4 (546 KB, 560x832)
546 KB
546 KB MP4
>>
What if they made Chroma but good?
>>
File: chroma anatomy.png (159 KB, 340x213)
159 KB
159 KB PNG
>>107041219
Ohh brother this guy STINKS!
>>
Ran waited until euro hours and is now trying to bate people here.
>>
File: 1756067235279031.mp4 (890 KB, 832x480)
890 KB
890 KB MP4
>>
>>107041341
me when came up with a name for a variable or a class
>>
>>107040636
is that locally generated?
>>
File: thisisfine.jpg (589 KB, 972x509)
589 KB
589 KB JPG
not even chatgpt was able to save me from this
>>
>>107041585
yes with bouncy walk lora
>>
retard here, I'm mainly using forge and reforge right now for image gen, is there a way to do txt2vid or img2vid without having to touch comfyui?
Is it even worth it to try on a 3090? I saw wanGP in the rentry, maybe that's where I need to start?
>>
>>107041732
crazy tbqh
>>
>>107041742
>Is it even worth it to try on a 3090?
Yes

>without having to touch comfyui?
you will touch comfy and you will like it
>>
>>107041742
i think neoforge does wan videos as well? the other alternative is wan2gp. also, people with worse gpus gen wan videos so you don't have to worry about your 3090.
>>
nunchaku wan when bros? NUNCHAKU WAN WHEN!??
>>
>>107040946
>>107040960
>>107040982
don't forget to hang yourselff on the way out, worthess retard
>>
anyone tried HoloCine?
>>
>>107041881
thanks for calling out the mentally ill, but could you also post a gen? I want to see some juicy 1girls
>>
>>107041881
>he's the thread moderator
Time for your medicine.
>>
>>107041881
I'll do so when the opportunity presents itself! Sorry it's taking so long, I don't know how to tie knots...
>>
File: wan.jpg (39 KB, 658x657)
39 KB
39 KB JPG
>>107041872
Probably never at this point, they're too busy with image models. Radial attention gets 1 update every month so..We have context nodes and apparently, soon to be properly implemented in comfy longcat and svi loras. If they can get these new loras to work for comfy natively, I wouldn't care too much about nunchaku.
>>
Avatarfags like Ran ruined these threads.
>>
>>107041769
>you will touch comfy and you will like it
i really don't want to
I'm used to the a111 ui and I'm not going to spend another week trying to make sense of whatever the fuck is going on in comfyui with its lines and boxes
>>107041809
ah right, i only have forge and reforge now
i couldn't really find anything on the internet about using wan there so I'll look into neoforge, thanks
>>
>>107041742
>Is it even worth it to try on a 3090
the fuck you mean even worth? a 3090 is quite capable
>>
>>107041978
fags like you ruined the thread
>>
>>107040459
> Neta Yume
Still being trained? Because unusable.
>>
File: image_00003_.jpg (687 KB, 1184x1560)
687 KB
687 KB JPG
>>
THE JANNY DID IT
HOOOOLY
>>
>>107042742
>THE JANNY DID IT
doesnt really look like it...
>>
File: dmmg_0033.png (1.54 MB, 832x1216)
1.54 MB
1.54 MB PNG
>>107040459
i got my own edition edition

>>107041175
very good boy

>>107042684
my heebies are jeebied
>>
File: image_00012_.jpg (858 KB, 1170x1560)
858 KB
858 KB JPG
>>107042817
>my heebies are jeebied
Dario Argento made me do it
>>
death to spaghetti
>>
>>107040946
>>107040960
>>107040982
GN Makoto anon.
I actually don't hate you.
Why are you here though, did you get banned from /v/?
>>
>>107041668
How about installing it properly rather than using what's probably an outdated and buggy script?
>>
File: ComfyUI_00001.mp4 (1.29 MB, 480x640)
1.29 MB
1.29 MB MP4
>>
Its over
>>
>update comfy like you did it a hundred times before
>it shits itself
reinstall time again
>>
No such thing as good Local model. Its 100% over since this shit is regulated harder than Marijuana
>>
>>107043046
what are you on about?
>>
File: image_00018_.jpg (678 KB, 1184x1560)
678 KB
678 KB JPG
>chroma struggles with machete
5 billion dollars wasted on training
>>
File: file.png (2 KB, 296x36)
2 KB
2 KB PNG
>unfucks your fucked update
I don't get why people are afraid of pulling in 2025
>>
File: ComfyUI_00004.mp4 (1.36 MB, 480x640)
1.36 MB
1.36 MB MP4
>>
>>107043068
but then you still need to update? how do you get out of the cycle?
>>
File: prompting.jpg (22 KB, 459x414)
22 KB
22 KB JPG
Do we have "prompt travel" for native comfy yet? Apparently using | along with its context node works for kijai wrapper nodes https://github.com/kijai/ComfyUI-WanVideoWrapper/issues/686

While we have 2 native context nodes, doesn't seem to work unless there's something missing? Example prompt could be something like:

Man picks his nose |
He raises his head and notices camera |
He walks up and leaves out of shot
>>
Bro i just want to prompt. WTF with the nodes ?? I hate it
>>
>>107043109
doesn't work with t5. it's clip only
>>
>>107043132
I dont use wrapper, only native, wrapper for me is slow.
>>
Can I store both wan models in my ram? Or does it have to be in the vram for speed?
>>
>>107043163
yes
>>
>>107043163
they're used one at a time, why do you need both loaded?
>>
>>107043167
Great reply.

>>107043186
I was thinking if I can fit both of them in the ram, I'll save some time perhaps. But the ram vs vram speeds would probably be vastly different?
>>
File: image_00029_.jpg (655 KB, 1184x1560)
655 KB
655 KB JPG
>>
>>107040946
>>107040960
>>107040982
>>107042951
>crossboarding avatartroons
disgusting
>>
>>107043248
I don't get
>>
>>107043028
You can easily rollback commits and install previous requirements. You don't even need some snapshot manager bs. It takes two command lines in comfys venv/python environment. Even a total retard can work out how to do it using an llm.
>>
>>107043222
I don't really understand where you're going with this. Loading the model takes 12 seconds for me with an nvme so I get why you need both loaded unless you're loading off an HDD which is insane and you're not loading both on vram unless you have a 6000 pro or some shit. Having ONE model fully loaded on vram is faster for inferencing vs offloading parts of it but not by a huge margin.
>>
>seems like you are butthurt
>>
>>107043296
>so I get why you need both
*I don't get
>>
>>107041974
>they're too busy with image models.
Lol, lmao even. Chroma was supposed to be released months ago. They prioritized Qwen on day 1. Chroma is just a modification to Flux Schnell, that should've been no issue. Qwen is a novel architecture. There's also practically no advantage to using Qwen unless you're Chinese, given that it's nothing but plastic and Chroma shows equal amounts of improvements to prompt adherence to Qwen while still being more important since it's completely uncensored.
>>
>>107043296
Yeah it's fast for me too. But if the speed of genning is just as fast as if it would be loaded into vram, then I'd do that. But I don't know if the ram makes it slower to gen.
>>
'ate videofags simpleas
>>
>>107043327
Chroma is a blurry 512x512 failbake, so bad the creator can’t even decide on a proper final version. They were right not to touch it
>>
>>107043381
>20B
>Can't do variety for shit
>Can't do realism for shit
>Only good at anime
>Still can't do styles that are not hardbaked
>Not much better than Flux

Qwen an even bigger failbake. Chinks should be smarter than that.
>>
File: image_00038_.jpg (581 KB, 1184x1560)
581 KB
581 KB JPG
>>
gens are getting progressively worse each thread
have local models peaked?
>>
b8 is getting progressively worse each thread
have the trolls peaked?
>>
>>107043542
Yeah, in 2023 with SDXL.
>>
>>107043572
you gotta be blind not to notice
>>
File: ComfyUI_20557.png (3.55 MB, 1200x1800)
3.55 MB
3.55 MB PNG
>>107043109
I prompt like that using periods and it just werks (you know, like it's a series of short sentences), what are they trying to accomplish here that a period doesn't do?

>>107043542
Well, show everyone how it's done.
>>
I like how some innocent offhand comment causes some schizo to fight persons that only exist in their head for 3 threads
>>
>>107043751
What do you mean?
>>
Damn, the amount of frames really pay a big difference on how much vram is used. Obviously it's going to be less, but I'm using 70% less vram, crazy.
>>
>>107043806
I've had WAN bust VRAM because I turned the resolution down once, no idea what's going on there.
>>
>>107043788
most /ldg/ threads
>>
>>107043892
?
>>
File: 00077-760576019.jpg (358 KB, 1248x1848)
358 KB
358 KB JPG
>>
File: image_00052_.jpg (590 KB, 1170x1560)
590 KB
590 KB JPG
>>
reving up
>>
File: image_00056_.jpg (516 KB, 1170x1560)
516 KB
516 KB JPG
>>
>>107044114
where's all te blood and tears
>>
File: image_00059_.jpg (395 KB, 1170x1560)
395 KB
395 KB JPG
>>
Noticing a severe lack of local progress… did something go wrong?
>>
>>107044217
do you not see the update folder?
>>
Seems like local fell off… what happened?
>>
Who here knows about Sora 2? It’s an insanely powerful state-of-the-art video model developed by OpenAI that can generate practically anything.
>>
>>107044231
what do nonlocal fags do when they want to gen tiddies
>>
>model drought
>bot spam
>revolving door of schizos, comfy fudders and saas shills
what a shithole
>>
This thread is so weird
>>
*yawn*
>>
I didn't, that's the github example. I had to piece together an sdxl workflow from part of that one and the sample sdxl workflow, because the sample sdxl isn't set up for 2subjects.
>>
File: image_00060_.jpg (473 KB, 1184x1560)
473 KB
473 KB JPG
>>
Tired of slow gens and plastic outputs? Click here for free ComfyAPI credits. Interface with the most powerful models in the world using ComfyUI’s API nodes.
>>
>>107044054
What upset you this time?
>>
>>107044288
fuckin tell me about it
>>
>>107044248
still better than the alternatives
>>
>>107043691
sex with jenny
>>
As a new-generation image creation model, Seedream 4.0 integrates image generation and image editing capabilities into a single, unified architecture.
>>
OMFG not again... i already have 10 versions of these
what are these new ones supposed to do better i wonder
>>
Reve: Reimagine reality. Create, edit, and remix images. Combine natural-language edits with a drag-and-drop image editor.
>>
Ok so uh, 1080p at 200frames is not worth genning.
(5090)
>>
File: image_00062_.jpg (592 KB, 1184x1560)
592 KB
592 KB JPG
>>
Generate stunning images, explore creative ideas, and turn inspiration into reality with Ideogram.
>>
Midjourney is an independent research lab exploring new mediums of thought and expanding the imaginative powers of the human species.
>>
>>107044248
comfy fudding is fine because we need a better application than that grifty piece of shit
>>
>>107044362
wtf are you doing. wan doesn't support 1080p, you'll just get nonsense
>>
>>107044362
Only 46 minutes? Or 46 hours kek
>>
>>107044385
Not with i2v.
I go above 720p 99% of the time. The quality increase is huge.

>>107044391
46m yeah, I'll definitely do this for 81 frames, the context to parts of the image is too whacky.
>>
>>107044381
we don't even have a stable vibe voice implementation for comfy. they are all broken shit
>>
See it. Dream it. Seedream 4
>>
>>107044375
Why don't you contribute to the studio, then?
>>
thank you Julien for making us the KINGS of /g/
your raped bitch seethe will ensure /ldg/ lasts for eons
>>
>>107044436
I don't contribute to software like comfy because I think it's a waste of time
>>
>>107040813
What's the use case for wan 2.1? I jumped ship for 2.2 and never looked back. Loading two models back to back sucks but then I get to see if the video will turn dogshit earlier so I can restart faster.
>>
File: image_00067_.jpg (650 KB, 1170x1560)
650 KB
650 KB JPG
>>
>>107044460
using SVI for longer generations until (if) they make a 14b 2.2 implementation. other than that I dunno. I remember just trying to get a girl to dance was painful, with gen after gen of body horror and arms ghosting out of existence. 2.1 was pretty ass desu
>>
Watching SaaS lead the ai race
>>
I don't smell Julien
Don't let the false flag work. Julien is more primal, he lacks tact and inhibition. He's a broken childhood rape victim that feels rejected and his desperation leaks in every post.
>>
>>107044246
Visit the Dalle general. That general unironically peaked at early Dalle 3. Now all they post is SD 1.5 tier gens. The recent SaaS models suck so hard that the only thing left for them is to troll local threads, because there are many trannies from such threads. (Which to be fair, you have to be an insane, deranged tranny to even consider using a SaaS model in the first place).
>>
>>107044520
12GB works theoretically, but my computer always shits itself when trying to run the full models, seems like 32GB RAM for swap isn't enough.
>>
>>107044536
How's Qwen and Chroma?
Can i run these checkpoints with 12gb VRAM?
I want to know if they are worth it, i am using flux nf4 gguf but it's pretty meh, you cannot get good landscapes without massive amounts of blur and it often ignores prompt.
It simply sucks at details.
>>
which card is faster 3090 24gb vram or 5080 16gb vram?
>>
why the short break between spam?
>>
>>107044362
>46 min
eeewwwwwww. isn't genning at a low res then upscaling the way to go?

>>107044460
fluid movement. 2.1 can do some reeeeally sloppy movement at the cost of consistency, where 2.2 requires lora stacking
>>
>>107044569
I gen with chroma or sdxl based local, then I animate with wan.
It's not one or the other.
>>
Just generated a 4k Seedream image in 10 seconds. API models are insanely powerful
>>
>>107044569
I dunno if you're a bot or not, post with a picture or repeat the question
>>
>>107044573
It's ironic how we went back to increasing pagefile size in the unholy year of 2025, how much ram is even enough at this point?
>>
>>107044583
I just wanna know anons preferences
Like I noticed here a lot of people using chroma (or qwen)
>>
>>107044586
forge couple on a1111/forge forks. works best for 2d anime/cartoon but issues of bleeding are present with 3dcg, cgi and realistic styles.
>>
>>107044595
its either regional prompting for sdxl based models, or just use a recent model and rawdog it (there might be some bleeding but with enough rolls you'll get decent results)
>>
>>107044262
what lora is this my man
>>
>they still think this is a bot
You underestimate him
>>
>>107044604
>this guy slaps your 1girl waifu's ass
what do you do?
>>
>>107044572
ip block evading is my guess
>>
File: QwenImg_00028_.png (1.35 MB, 1152x1440)
1.35 MB
1.35 MB PNG
never subscribed to the nogen discrimination, but it serves an actual purpose here
>>
>>107044609
it means COMFY NUNCHAKU failed to load, check the fucking logs retard, and give them to chatgpt
>>
>>107044583
>ran being nasty as usual
i have never seen a single positive post from you
>>
>>107044610
>thats not a failgen

thank you for the encouragement.
>>
>>107044615
The thread is filled with browns whos primary fascination is not the tech because they are low iq, and given that in any online forum its always gonna be more likely to have those people post because they are terminally online and mentally ill, every forum will devolve into a clownshow of mostly those posters
>>
good job jannies
>>
>>107044619
you mean americans? you are not an exception to this, mutt
>>
>>107044624
this is generally why we're a bullied minority in these threads, the only subgroup unironically satisfied with the simpler things. Many such cases!
>gens on comfyui
>uses whatever model works
>doesn't complain unless its a skill issue
>>
>>107044569
5080 has slightly more cuda cores and is faster overall and even faster with fp4 optimizations, but 16gb means if you want to do video gen it might be slower because you'll have to offload more, not sure by how much.
>>
>>107044629
>i can't share the full image of because its a failgen that put her on top of the table with a gigantic hyper ass in focus
thats not a failgen
>>
>>107044634
Sounds like bullshit to me
>>
>>107044610
I don't think it's a big deal. More spam = more visible the lack of moderation. It will be clear for the advertisers if it's worth using this site or not.
>>
You are absolutely right —
>>
>>107044650
Some Nodes Are Missing
When loading the graph, the following node types were not found.
This may also happen if your installed version is lower and that node type can’t be found.

NunchakuQwenImageDiTLoader

Nothing I fucking do fixes this. Yes, I manually downloaded and installed the correct wheels. Please anons, by God, help me, I'm going to rip my hair.
>>
>>107044624
kek, yea /ldg/ seems to be the only infected board, welp time to try again tomorrow when the faggot spammer is asleep
>>
>>107044665
gets boring using "flower garden background", "forest", "jungle", "rocky area", and "grass field" background for majority of my gens. Building interior tend to be the weakness of sdxl.
>>
File: chroma___0085.png (1.49 MB, 832x1216)
1.49 MB
1.49 MB PNG
>>
>>107044556
You forgot this IP Janny
>>
File: image_00068_.jpg (507 KB, 1170x1560)
507 KB
507 KB JPG
>>107044600
Just some classic movies I compiled into dataset, works surprisingly well
>>
>>107044686
Have you tried donating another $200000? It might be enough to allow him to upgrade to 768x768 training!
>>
>>107044691
>remember in my tired brain you can just prompt any kind of eyes you want
>leave out the character tag and it's gonna wing it a bit
>get this

d'aaaawwww adowable eyes gen i can't share the full image of because its a failgen that put her on top of the table with a gigantic hyper ass in focus
>>
>debo reduced to this level of pissing himself
>>
>>107044650
Who advertises on 4chan though? I only remember some schizo woman posting ads here about getting watched.
>>
>>107044744
seedvr2>interpolate
>>
>>107044665
>yea /ldg/ seems to be the only infected
Very strange when you think about it. Who could despise this blessed thread?
>>
>>107044765
That's more a consequence of the shitmix he uses desu
>>
>>107044765
It's not like he's been attacking the general with his friend for over a year and didn't kick it into overdrive during the anniversary
We know who it is he always dog whistles to his nemesis so he can seethe at him.
>>
>>107044897
It's always about drama with you, thread schizo.
>>
>>107044410
i'd ask to see the gen but it's probably futa or something worse
>>
File: image_00078_.jpg (482 KB, 1368x1368)
482 KB
482 KB JPG
>>
File: image_00085_.jpg (562 KB, 1336x1768)
562 KB
562 KB JPG
>>
why does every general thread outside of this thread not like comfyui very much?
>>
File: ComfyUI_20564.png (3.01 MB, 1200x1800)
3.01 MB
3.01 MB PNG
>>107044329
Not 'till after dinner!
>>
is this general still infested with a bot
>>
>>107045299
why are you lying? oh right, you're just spreading FUD like you explicitly said you would
>>
>>107045299
it's unstable, full of telemetry, poothon, waste of time and the devs won't bother adding better tooling, custom nodes that should just come with the fucking thing and grifts relentlessly. also it's reddit focused, it's not a channer ui
>>
>>107045299
You don't need comfy if all you use are XL mixes
>>
File: ChromaVersusKrea.jpg (1.87 MB, 2496x1824)
1.87 MB
1.87 MB JPG
Tested Chroma DC 2K versus Flux Krea on the same prompt / seed for higher-res one shot gens. Both were 1248x1824 direct, just 25 steps. I think they both did pretty good coherence-wise overall, Chroma did come out with noticeable horizontal scan lines though.
>>
>>107045299
spaghetti
>>
>>107045323
all valid reasons as to why it's over evaluated at 17 mil. one gui that gets rid of python will make it useless
>>
>>107045343
might get rid of the banding artifacts with different negative prompt
>>
>>107045352
spaghetti apps are pretty cancerous for anyone not ultra autistic
>>
>>107045299
I think that's mostly /h/ really, but they hate everyone and everything. /sdg/ I've never seen care, and definitely no one on the /b/ thread really talks about that kind of thing, that thread is almost purely images nowadays to the extent it's sort of boring to participate in IMO.
>>
>>107045299
It's a broken buggy piece of shit lacking basic functionality. The only people that "like" it just download other people's workflows and circumvent having to actually interact with the software beyond tweaking a couple parameters and pressing run
>>
Please work on your wrapper
>>
>>107045382
what are you working on anon?
>>
spaghetti holocaust when
>>
Read up a the thread a bit and the spam is like, even sneakier today than yesterday, it's using comments from fairly old threads in a way that's even more confusing
>>
>>107045393
Is the spam in the same room with you right now, anon?
>>
if you really want comfyui to improve, remove GitHub stars
>>
>>107045393
if he was smart he'd train a LLM on the past threads for this general like the /vg/vn spammer did
>>
no one would care about spam or schizos if people posted more gens
>>
>implying other generals are worth lurking for anything other than 1girl huge bob in vagine slop
>>
File: image_00093_.jpg (474 KB, 1336x1768)
474 KB
474 KB JPG
>>
>>107045449
By all means anon ladies first
>>
FIBO (new image gen model, json-based) is out:

https://huggingface.co/spaces/briaai/FIBO
https://huggingface.co/briaai/FIBO

My from tests it's a bit less aesthetically slopped than Flux and Qwen, but can't do anything "copyrighted"
>>
>>107045323
I mean I also use Comfy for lots of image-related processing stuff that isn't always directly genning, always with my own simple workflows that are just a straight line of the order I want shit to happen. Much of it has absolutely no equivalent in other UIs. There's just a massive amount of stuff you can do quite easily in the exact order you want in Comfy that would require me to be running probably multiple other pieces of software too if it wasn't all doable in Comfy. So for me it's the opposite of inconvenient
>>
>>107045480
>>
>>107045502
Needs more sharpening.
>>
>>107045480
My guess is, whatever Chroma does with the original Flux weight, it slowly has to be merged back, just enough that doesn't mess with Chroma's ability to be uncensored, and Flash HD is the closest to that. But since fixing it requires messing with networks we don't understand, it's really hard to replicate with full version.
>>
>>107045490
>image-related processing stuff that isn't always directly genning,
why? there is many more applications that do it better
>>
When ready

>>107045533
>>107045533
>>107045533
>>
>>107045490
are you retarded? comfy has to be run alongside other apps to be actually useful
>>
>>107045526

literal three pixel halo at sweater



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.