[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Reverse Rape Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106426678

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassic
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
AniStudio: https://github.com/FizzleDorf/AniStudio/tree/dev

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://tensor.art
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://rentry.org/wan22ldgguide
https://github.com/Wan-Video
https://alidocs.dingtalk.com/i/nodes/EpGBa2Lm8aZxe5myC99MelA2WgN7R35y

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
https://rentry.org/ldg-lazy-getting-started-guide#rentry-from-other-boards
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
"Booru tags" give much better results with WAN than your natural language anal discharge.
>>
File: 1754968499258906.mp4 (693 KB, 640x480)
693 KB
693 KB MP4
neat, it worked with my quen edit gen

an anime style Miku Hatsune walks into the picture and points at the yellow sign on the building.
>>
File: 1726073602472557.mp4 (883 KB, 640x480)
883 KB
883 KB MP4
an anime style Miku Hatsune walks into the picture and points at the yellow sign on the building. the camera pans down to show the street.

camera commands work a lot better with 2.2.
>>
>smoking
based
>>
>>106429605
also I just noticed the glass even reflects her, wan is such a neat model.
>>
>>106429609
ciggers are gonna be the first to be dragged out and shot
>>
>>106429545
that petting her parrot is extremely high quality. Especially with the mixed styles.
>>
>>106429589
People would have complained less with this logo
>>
Why does smokeules upset people?
>>
>>106429561
Yeah I tested other prompts as well.

>Show me a girl with no hair. Annotate it so that it's obvious she doesn't have hair. But behind her a reflection of the hair.
https://files.catbox.moe/ni2qsw.png

>Show me a lake with no water in it. Annotate it so that it's obvious there's no water.
https://files.catbox.moe/94c9sb.png

>Split a lake with no water in half. Show me what's in the gap. Ensure there's no ice either. Maybe add a dot at the center of the abyss. But what is it made of? The exterior? Cheese. Of course.
https://files.catbox.moe/3js6e1.png

I think it's even better at prompt following than 4o.
>>
File: 1727381615121523.mp4 (1.11 MB, 640x480)
1.11 MB
1.11 MB MP4
wan is amazing.

Miku Hatsune wearing a cowboy hat jumps out of the yellow logo and lands on the street.

actually did exactly what I asked.
>>
>>
>>106429661
wan 2.2, 2.1 lightx2 lora at 3 strength for high, and 1 for low. (works better than 2.2 lora)
>>
>>106429670
>lora at 3 strength for high
I beg everyone to stop doing this. Please.
Just don't use anything for high. You only need 4 steps.
>>
wait, qwen can do undress right off the bat?
based. But it doesn't draw bussy, do i need lora for that?
>>
>>106429674
kijai was doing it and he's basically AI jesus at this point
>>
>>106429686
Bussy and nips are really bad for qwen, but Wan does an okay job at nipples.
>>
>>106429688
Just try it. I promise. Kij usually only tinkers superficially before moving on to implementing the next meme feature.
>>
>>106429669
checked & kino
>>
File: 1752395474029691.mp4 (1.22 MB, 640x480)
1.22 MB
1.22 MB MP4
Miku Hatsune wearing a cowboy hat jumps out of the yellow logo and lands on a brown horse, that rides away down the street.

almost, got some floor effect as well though
>>
>>106429690
i mean qwen edit. I try nudify and it suprisingly can do that...but bussy is no go
>>
File: 1741847401518950.webm (3.36 MB, 720x1280)
3.36 MB
3.36 MB WEBM
Mikutroonsisters... BEAGHAHAHAHbros are laughing at us...
>>
File: 1741082494842326.mp4 (1.02 MB, 640x480)
1.02 MB
1.02 MB MP4
there we go.
>>
File: ComfyUI_00031_.png (3.1 MB, 1280x1920)
3.1 MB
3.1 MB PNG
>>106429612
jeez anon, next you'll tell me you hate fat bitches too
>>
>>106429716
https://huggingface.co/starsfriday/Qwen-Image-Edit-Remove-Clothes

try this, it's new. just found it by googling "qwen edit clothes remover".
>>
>>106429734
Liking fat bitches is based, unlike liking thin-kutroon
>>
>>106429650
Yh, seems Imagen 4 is an autoregressive model. Gemini 2.5 Flash is just Imagen 4 modified to have the edit functionality.
>>
>>106429734
I love fat women you don't even know.
>>
File: ComfyUI_05437_.png (708 KB, 1280x720)
708 KB
708 KB PNG
>>106428899
very nice
>>
>>106429669
Excellent.
>>
>>106428899
double checked. It's crazy how it mixes the anime style and 3d bird seamlessly. Only low quality areas are the petting.
>>
File: 1744995482964333.png (1.51 MB, 1176x880)
1.51 MB
1.51 MB PNG
>>
File: 1740430942990367.mp4 (596 KB, 640x480)
596 KB
596 KB MP4
neat, said "change the location to an island with a bridge" on an old pc game screenshot (kings quest)

told wan to make him walk on the bridge: if I added to make it pixel art it would prob be more authentic.
>>
>>106429739
works but bussy still badly drawn
image is slightly cooked, but maybe this is just qwen edit's fault
>>
>>106429748
Though it's still not as good at text as 4o.
>>
Protip for training Qwen edit LoRAs. Start with the result you want to see then edit backwards.
Want to make a women naked? Find a naked woman and edit clothes on her. Then when training you put the before pictures in the after folder so the model knows you want naked women.

This works with almost anything. Want a specific anime style? Edit the style you want to look realistic or in a generic style and use the inputs as the outputs during training.
>>
File: Katie Price Jordan 07.jpg (251 KB, 1600x1241)
251 KB
251 KB JPG
Can I make a request for one of you to animate this.
>>
>>106429792
Troons are not a natural thing to train in your model. It would just get confused and swap out vaginas for dicks. Based Chinks.
>>
File: 1749180014633196.mp4 (812 KB, 640x480)
812 KB
812 KB MP4
lmao

the girl on an airplane turns and flies it into a tall skyscraper like the world trade center, and explodes into fire and flames.
>>
>>106429829
also...how was there already smoke before?

what did china know?
>>
any tips for achieving extreme angles with SDXL? like from directly below??
>>
>>106429748
right, this was well known already
>>
File: 1746756447262135.mp4 (975 KB, 640x480)
975 KB
975 KB MP4
>>106429829
im very curious why there is already smoke on these gens before the hit.
>>
>WAN
>spread legs
>weird elongated labia appears in the pussy
>>
File: 1733898213881145.mp4 (789 KB, 640x640)
789 KB
789 KB MP4
cool, it worked. wan is really good.

the miku hatsune statue changes from stone to color, and miku dives in the fountain behind her.
>>
>>106429910
I can't hold this in any longer. ITS HATSUNE MIKU.
not Miku Hatsune.
Nobody calls her but her first name first.

HATSUNE MIKU.
>>
>>106429845
Nah, model isn't even autoregressive apparently https://archive.is/kxb7V

Though this seems like an AI article. That would explain why the model still sucks at text.
>>
>>106429916
I know. but what matters is what the model understands. miku hatsune works fine, and probably vice versa I guess.
>>
so it's safe to say chroma v1 HD was a nothingburger? lodestone is already moving to a pixel diffusion model and the anatomy issues with v1 never improved.
>>
Give me your S2V gens. I want to see something creative and funny.
>>
>>106429779
jesus it was hard to get down that cliff, fuck king's quest. cool animation though
>>
File: 1749925928857258.png (62 KB, 2560x1440)
62 KB
62 KB PNG
>>106429995
the location was just a qwen edit swap. the source image was this:

but it's cool that it can make pixel art based on pixel art.
>>
qwen nunchaku lora yet?
>>
>>106429734
>>106429742 (me)
Here's some proof of literally me with that italian slut
https://files.catbox.moe/1e4d36.mp4

Auto-continue workflow from https://civitai.com/models/1818841/wan-22-workflow-t2v-i2v-t2i-kijai-wrapper
>>
File: 1737191538797465.mp4 (765 KB, 672x480)
765 KB
765 KB MP4
action!
>>
>>106430161
this was removed from the theatrical release, I seen it
>>
do some of you use wan 2.2 locally? what generation time you get for what gpu?
>>
>>106430222
chimolog.co/bto-gpu-wan22-specs/
>>
File: 1730540174051412.mp4 (901 KB, 672x480)
901 KB
901 KB MP4
the man wearing the blue shirt and beige pants has a fist fight with a muscular masked white man wearing a black tank top, who walks in from the right.
>>
>>106430222
it's very fast with the lightx2 i2v lora (2.1). my last gen was like 120 seconds, could be less if I turned off interpolation.
>>
File: wan22benchmark.png (2.34 MB, 1344x2118)
2.34 MB
2.34 MB PNG
>>106430225
>>
File: 1739522586872928.webm (1.28 MB, 672x480)
1.28 MB
1.28 MB WEBM
better big guy
>>
>>106430243
>>106430232
ty
>>
>>106430243
3060 12gb hanging there like a cockroach
>>
>>106429917
it is autoregressively modeled, that's what the director part is. It's just not doing everything in one part, basically.
>>
File: 1737808192315850.webm (1.17 MB, 672x480)
1.17 MB
1.17 MB WEBM
a muscular masked white man wearing a black tank top walks in from the right and throws the man wearing the blue shirt and beige pants on top of the plane.
>>
File: WAN_00024.mp4 (766 KB, 736x568)
766 KB
766 KB MP4
>>106429822
>>
File: 1739287677668232.mp4 (755 KB, 672x480)
755 KB
755 KB MP4
almost a proper big guy...
>>
>>106430281
We never die!!
>>
>>106430295
Noice. Cheers for that. Could I trouble you for some more?
>>
>>106430295
This >>106430357 is why you never feed beggars.
>>
>>106430368
Piss off anon. The guy who did that for me is a star, and if he doesn't want to do anymore, that's entirely up to him. The fact that he even did one has done me a great service.
>>
>>106430243
Wow, my card sucks for video: 4060 Ti 16GB.
I am going to need an upgrade.
What's with the 5090 having three variant models?
>>
>>106430400
5090
5090 with PCI-E limited to x8
5090 with a power limit of 450W
>>
>>106430368
i wanted to see if i can make her walk like a horror movie ghoul
>>106430357
no, sorry
>>
>>106430243
I'm surprised to see my 3080 10GB so high up on the list. it's dogshite VRAM is its real killer but
>>
File: 1735689414299896.mp4 (754 KB, 672x480)
754 KB
754 KB MP4
big guy, big toss

tried to go over the plane but...not that far
>>
File: 1733481989803285.mp4 (822 KB, 672x480)
822 KB
822 KB MP4
>>106430478
now he got over it.
>>
File: ComfyUI_00001_.mp4 (1.16 MB, 1280x704)
1.16 MB
1.16 MB MP4
impressed by wan2.2 5B, this might not be much by other people's standards but it's the best I've been able to produce so far, definitely a big improvement over my SVD results
>>
File: 1729555257294616.mp4 (714 KB, 672x480)
714 KB
714 KB MP4
last one. I like the flip.
>>
>>106430449
>no, sorry
No worries. Thanks anyway.
>>
>>106429669
Bravo
>>
>>106430521
The quants of the big model are better.

I am trying the S2V model with comfy workflow.
I am using a wan i2v video's last frame as the image input but it seems to jump from that image and it is not used as the first frame.
https://files.catbox.moe/ms3emk.mp4
>>
>>106430521
have you tried LTXV?
>>
So it seems like forgeui development is ded. Should I switch to comfyui if I want to use newer models like qwen or is there hope that better support for flux, qwen etc is coming to forge?
>>
>>106430443
>5090 with a power limit of 450W
Should be the safest way to run it.
>>
>>106430624
nah, overclock to pull 1000W
>>
>>106430637
and gain 5% performance yay
>>
whatever it takes
>>
File: 1740419452718146.png (1.07 MB, 1360x768)
1.07 MB
1.07 MB PNG
the man is wearing a black suit and brown dress shoes, and is inside a McDonalds restaurant during the day. The McDonalds logo is visible in the background and there are McDonalds cashiers taking orders. He is sitting at a table with dozens of Mcdonalds cheeseburgers, and is giving the thumbs up.

qwen edit test
>>
can someone try a sigma shift of 1 on wan 2.2?
I get completely unusable blurry mess and I don't get why
>>
>>106430656
he's sitting on his ventriloquist dummy
>>
File: 1737713049915628.png (1.13 MB, 1360x768)
1.13 MB
1.13 MB PNG
>>106430702
slightly better brah:
>>
>>106430597
Looks great! I'll try the quantized versions.

>>106430616
Yes, but not to any great success. That might be me doing things wrong though.
>>
best seg model for replacing backgrounds?
>>
>>106430243
>3060 Ti 8GB vs 4060 Ti 8GB
Yet they still had the audacity to release that card...
>>
File: 1745070571829135.png (1.32 MB, 1360x768)
1.32 MB
1.32 MB PNG
the man is in a Walmart produce section wearing a black suit and brown dress shoes. The Walmart logo is visible in the background. He is beside a shopping cart full of vegetables, and is holding a few tomatoes.
>>
>>106430597
Comfy needs to look at this issue.
Transition between 2 Wan2.2 videos by extracting the last frame is flawless (at 5s). Transition between Wan2.2 video by extracting the last frame to Wan S2V jumps (at 10s).
https://files.catbox.moe/j24o0f.mp4
>>
File: overbake.png (293 KB, 1824x1308)
293 KB
293 KB PNG
>>106430777
The problem has to be here.
>>
File: 1753775791802156.png (967 KB, 1360x768)
967 KB
967 KB PNG
>>
File: 1750824234941407.png (995 KB, 1360x768)
995 KB
995 KB PNG
>>106430903
>>
File: 1749669051019205.png (1.03 MB, 1168x888)
1.03 MB
1.03 MB PNG
>>
File: 1725212747219028.png (1.03 MB, 1168x888)
1.03 MB
1.03 MB PNG
>>106430956
>>
File: file.jpg (287 KB, 2016x1104)
287 KB
287 KB JPG
>>106430656
>Absolutely cooked his likeness and adds a bigass ahoge
Are you using a reference latent and an empty one?
Guess I'll run some plots tonight. Hell yeah.
>>
Are there any benchmarks on RAM timings and the effect on RAM-offloaded performance?
>>
File: AD_00007.mp4 (1.75 MB, 568x840)
1.75 MB
1.75 MB MP4
finally fixed my vram memory leak, feels good
>>
>>106430998
it's just default qwen edit with the 8 step lora. what are you using for that and how are you getting a higher res?
>>
think it, dream it, do it.
>>
So is nunchaku the fastest way to run qwen edit?
>>
TO PEOPLE FIGHTING COMFY PLEASE READ!

I will share a personal story of mine, hope it helps to anyone having struggle with Comfy(spoiler: it's not comfy)

Story:Was malding for weeks with ComfyUI trying to make book covers. Absolute nightmare, nodes made no sense, workflows wouldn't cooperate, was about to drop AI art as slop.

Tried InvokeAI, holy shit, night and day difference. It's Photoshop for AI. (layers, working controlnet/IP adapters, no node autism. Regional guidance is god tier!)

REVELATION: SDXL is actually good when you're not fighting ComfyUI's interface

Not shilling, personal exprience, just one anon to another who escaped ComfyUI hell, finally have actual control without the headache.
>>
I'm still constantly in dependency hell. Why the does the Comfy manager find the repo for a missing node but then just refuse to install it? I have to manually git clone and pip install everything.
Is this broken or am I missing something?
>>
>>106431512
Comfy is so potent when a workflow actually runs. But I feel like I spend 90% of my time being a node fixer and only 10% actually making images.
>>
Comfy it's great when someone makes a runpod template that works out of the box. The tool has huge potential, but the setup filters out anyone who just wants to make or edit images, im waiting for the day it becomes more plug and play.
>>
>>106429545
Anistudio is not good enough to be listed here, needs more dev time.
>>
Comfy just flew above my house.
>>
>>106431486
This, ComfyUI is not comfy for actual work, maybe for tech enthusiasts, but no for actual people who has to work and bring solutions and needs a stable UI.
>>
>>106431571
You can probably easily create a working workflow for any model then write a pretty and simply web ui only disclosing some of the stuff like steps, available loras, samplers and schedulers, all interrogating comfy in api.
>>
>>106431486
Invoke UI is actually comfy and the devs are alright. But they're lagging on new tech, still no svdquant/nunchaku, their upscaler is also slow and kinda jank.
>>
>>106431551
>but the setup filters out anyone who just wants to make or edit images
eh, I dunno about that
initial setup is pretty easy to just get going, yeah it'll be rough but you can ramp up really quickly with the help of llms
it's not like back when you had to track down every little thing to random forum posts anymore
>>
>>106431486
Just use AniStudio
>>
>>106431595
Thanks for answering, >106431486 this anon.

The thing that Comfy may be good for is generating a single, sporadic sunday saturday gen, but it's impossible and not reliable to use like a proper stable work tool, like Photoshop.
If you actually need to work alone, or you are part of a working team, Comfy is not good.
>>
did they say when would they release nunchaku for qwen image edit?
>>
>>106431627
i don't have dick girls fetish
>>
Quick question:
How much do you use AI diffusion for actual work?
What is your take on Comfy? I'm eager to listen to your opinions.
Not hobbyists, only people who work and need to bring solutions and need to share things with other people, who may or may not be a tech related person.
>>
>>106431629
wan first and wan never so QIE never
>>
>>106431598
>their upscaler is also slow and kinda jank.
People taking about speed and Comfy are missing the point, these aren't the same thing at all.

The canvas is actual visual creation, you're not prompting for "girl on a kitchen watching a plane over the window" and hoping the model gets it. You're placing things where you want it, building the image yourself.
You can't get that with prompts or even 50 node workflows.
>>
>>106431627
It just crashes for me
Doesn't work on windows nor ubuntu
At this point i think it got added to OP just to waste my time
>>
>>106431561
Dropped a turd no doubt
>>
>>106431677
I don't generate 20 images and pick the best, I make the best one myself.

This is what most of /ldg/ don't get about Forge and AI genning. They think it's "prompt + seed + nodes = gen" because that's all they see,
>>
>>106431692
The 'AI isn't art' and the 'xxxUI it is not a real UI' crowd would shut up quick if they understood you can use AI as an actual creative tool instead of a slot machine.
>>
why the fuck qwen image create blurry shit , especially the background
if you prompt "simple green background", it creates very noticable jpeg_artifact background. Is this the best t2i around?
>>
>>106431677
Exactly this.
Anons sperging about speed are retarded because it's not about pressing the generate for 2 seconds, it's about creating at the speed of your imagination.
ComfyUI can be "powerful" all it wants but it doesn't let you work this way. No proper layers, no regional control, no asset management, just nodes.
The real game changer its an UI that let you do what you want, that's the difference between creating and gambling.
>>
>>106431677
>The canvas is actual visual creation, you're not prompting for "girl on a kitchen watching a plane over the window" and hoping the model gets it. You're placing things where you want it, building the image yourself.
>You can't get that with prompts or even 50 node workflows.
>>106431692
>I don't generate 20 images and pick the best, I make the best one myself.
Explain without sounding mad how is this any different from krita plugin
>>
>>106429611
Wan has such great reflections. Glass that is cut or patterned will also reflect correctly, curved surfaces reflect correctly, it also does subtle reflections on surfaces that aren't glossy. Movement of water has a long way to go but the refraction works. It understands physical reactions of things like fabric and cushions. I think it would take a while for image models to catch up on stuff like this and by then video models will be even better. Images have a resolution advantage but video models work perfectly with image upscalers because they don't have compression artifacts or noise like real video. I loaded a 720p screenshot in topaz gigapixel demo and was amazed. Some anon posted a guide on how to use topaz models in workflows but I lost it.
>>
I don't give a flying fuck how you use your models, it's a tech board, it should favour tech solutions instead of artfag dumbed down niggercattle software.
>>
File: ComfyUI_00320_.png (1.78 MB, 1328x1328)
1.78 MB
1.78 MB PNG
>>106431733
>>
>>106431653
Have the three UIs Invoke, Forge and Comfy installed.
Cons: Comfy has no canvas, masking guidance is bad,
Pros: Touching Comfy for video stuff and models like Qwen that neither Forge or Invoke doen't have it yet.

Invoke and Forge:
Invoke supports drawing tablets with pressure sensitivity so I can draw directly on layers and process it, important for anyone who actually draws.
For actual image work Invoke or Forge if you don't draw destroys it.
>>
>>106431742
Krita plugin is duct taping AI onto a drawing program, you're dealing with two separate programs trying to talk to each other. Eats massive amounts of space and memory for all the backend you need running.
Invoke is purpose built for AI art, everything's integrated in one UI, model switching, controlnets, IP adapters, regional guidance. No forcing between programs, no broken connections, no compatibility problems.
>>
>>106431753
Sometimes the 'dumbed down niggercattle softwar ' is the better tech choice for productivity.

Question for you, tech religious anon: If the 'technical' solution prevents you from using the technology effectively, is it really the better tech choice?
>>
>>106431486
Will give it a try. I HATE comfyui.
>>
>>106431681
it got added by some faggot who wants to be julien's boyfriend or something. he was rushing page 1 bakes so he can insert his gay shit
>>
>>106431836
Yes, because I'm interested in the tech side. Which is why I'm posting on the tech board.
>>
>>106431653
People here never worker with an Art Director
>>
>>106431873
People here never worked in first place
>>
File: 00025-2482994666.jpg (130 KB, 1824x1248)
130 KB
130 KB JPG
>>106429844
"from below view" and low angle view" + mention the sky but nothing much of the ground surface. work best with illustrious/noobai models imo
>>
>>106431653
I use ComfyUI for the control and built workflows I was proud of but after starting a job where I use generative AI daily alongside meetings and other projects, I no longer have time to troubleshoot dependencies or obscure errors. That experience shifted my priorities, for my situation, speed and reliability matter more than maximum control.

I still reach for ComfyUI when I need custom logic or very specific control at scale, but for day to day tasks I prefer other UIs that are turnkey.
>>
>>106431486
On my setup, Invoke is noticeably slower, with SDXL and identical settings, ComfyUI generates in about 6 seconds while Invoke takes around 20. I also don’t have an art background, so the Photoshop style, layer based UI isn’t intuitive for me.
>>
File: AnimateDiff_00246.mp4 (2.43 MB, 944x720)
2.43 MB
2.43 MB MP4
>>106430956
>>
>>106431907
the only problems i've ever had was installing sage. everything else just werks. you are so full of shit, probably an actual paid shill for invoke
>>
>>106431816
>space and memory
Ah yes space and memory is famously limited among /ldg/ers
>No forcing between programs
What the fuck does that matter if it just werks
>no broken connections, no compatibility problems
Name top 5 compatibility problems. You can't, because it's just comfyui with a node that sends stuff to and from Krita. It's as simple as that.
>>
>>106431860
Fair point anon, I get you're here for the tech, respect that.
But tech discussion it's also about efficiency and tool optimization, or I am wrong?
>>
Rookie question here!


I'm training a chroma character lora on onetrainer, and after a few trainings it does not seem like it's properly grabbing the likeness.

I have 13 images, running it at 1e-4 adamw constant on a 1 batch run for a 32 rank lora so about 1200 steps. On the tensorboard logs, I keep seeing my lowest loss at around 1134 out of the training.


So my question is what do I do in this case? Do I keep continuing to train past the 1.2K steps and hope it converges, or am I running it too long?
>>
>>106431653
90% of diffusion use is for work, rest is testing and fucking around. I think comfy needs too many custom nodes and the memory management is terrible. It's great for creating some specific thing like x amount of graphic assets, but I wouldn't use it for iterative work or inpainting.
>>
>>106431956
This is my opinion as a proffesional worker that uses editing software as a daily basis.

When you're delivering to clients or working with teams, you can't depend on a third party plugin that might break between updates or have different behavior across setups.
The Krita plugin adds another layer of potential failure, you're troubleshooting Krita, ComfyUI, AND the bridge between them and the other persons, each update to any component can break the workflow.
Also sharing projects with a team when everyone needs the exact same plugin version, comfy setup, and pray nothing breaks between their configs it is really hard it thinks it is the difference between hobby tooling and professional tooling.

TLDR: For personal risk free projects Krita and Comfy it's fine, but for professional work where you need to share files with teams, maintain consistent workflows, or guarantee you can open a project months later, having everything integrated in one application matters and a lot, trust me.
>>
>>106428377
the qwen edit remove clothes nude lora doesn't work on wan2gp?
>>
I feel that comfy is a good tool for working with workflows, but a bad tool for working with images. For exactly same reasons I've settled with forge. Inpainting sketches ftw. I can just make whatever I want with minimal prompting with sdxl. Forge does not have layers, but I'm used to giving small edits in external sw (krita in my case).
>>
Morning /ldg/ lads I havnt made slop in a few weeks, if i update comfy is it going to break all of my wan 2.1 & 2.2 workflows?
Is it safe to update or should I just leave it alone?
>>
>>106432059
update anon, i gained +0.5 seconds on genning
>>
>>106429545
>AniStudio: https://github.com/FizzleDorf/AniStudio/tree/dev
Please Remove from OP

>Chroma: https://huggingface.co/lodestones/Chroma1-Base Training: https://rentry.org/mvu52t46
Please Remove from OP
>>
>>106432017
It's not reliable enough for one man project that spans over several months, I can only imagine the nightmare of having to share everything with a larger team
>>
>>106431261
I can never get them to blush like that it is always a more subtle blush.
>>
why am I not seeing more itty bitty titty lola bun?
>>
This >>106432074 and this >>106432059 I don't understand why here in all this places relies so much in Comfy, I understand all the edge tech babble, but as an UI is the least reliable of all.
>>
File: roller coaster.png (14 KB, 795x151)
14 KB
14 KB PNG
what is this thing called?
>>
>>106432076
i described it as a deep red blush
>>
File: and this.png (37 KB, 887x461)
37 KB
37 KB PNG
>>106432111
With this. Also, remember to show your daily affection to Panchovix, he need us.
>>
>>106432111
Honeydicking
>>
>>106432111
Mental illness. Don't touch that shit
>>
>>106432072
>tranistudio
remove

>chroma
no
>>
Comfyui is like having GIMP installed but only for GEGL and nothing else
>>
>>106432059
just backup your folder if you're worried
>>
>>106432131
>Do not touch the best REAL community based txt2img img2img UI ever made

People like you should either die or live cucked enough to pay for your API nodes.
>>
>>106432155
>just backup your 500GB folder bro
>>
File: AnimateDiff_00247.mp4 (3.15 MB, 944x720)
3.15 MB
3.15 MB MP4
>>106431931
the only two decent girls i could get out of 20 gens. wan seems to think gingers are supposed to be butt ugly
>>
>>106432164
Don't cry once your favorite troon gets his feelings hurt or 41% himself and your project stays in limbo forever
>>
>>106432167
take out the models folder and link it with a junction
>>
>>106432195
No, because I show support to my favorite troon dev :3
>>
>>106432167
you forgot to do weekly 500bs backup because Comfy updates every week
>>
>>106432118
Don't care about reForge. ComfyUI is better.
>>
>>106429545
what a fucking terrible collage
>>
>>106432283
>>
>>106432283
Leaving aside the video thing and latest year tech, how it is better?
>>
>>106432310
Modularity, aesthetics & automation.A111/forge/reforge/classic(same shit) is fine if all you care about is one off simplistic 1girl anime gens.
>>
What the fuck is Chroma v1.0-HD-rev-0.1?
>>
>>106432340
>1girl anime gens.
you forgot about regional prompting and forge couple
>>
so is qwen-edit supposed to change the face of the image you're editing or is something going wrong with settings/quants?
>>
>>106432310
>Leaving everything current aside how is it better

just lmao
>>
I really don't understand why comfyui triggers midwits so much
like why do you give a fuck? just use whatever you want for your 1girl slop
>>
>>106431900
thank you, I'll give it a shot. Might have been the ground surface thing doing me in.
>>
>>106432346
just more autism. I'm sticking to 48d and 49 until I see a new version that's a clear and unambiguous improvement with no stupid workarounds like changing resolution
>>
>>106432380
Because it's a bad ui. Going from comfy to ue5 blueprints is such a massive difference in quality of life and usability it's insane.
>>
>>106432380
its the same anon that shits on comfy every single day. reforge development being cancelled has made them shitpost harder now.
>>
>>106429674
>Just don't use anything for high. You only need 4 steps.
You are utterly retarded, you can't get away with just 4 steps without lora, it will look like fucking dogshit because the refiner only have a mush of blobs as reference, at least for 720p
>>
>>106432310
>Leaving aside the video thing and latest year tech, how it is better?


>Living in 2023
>>
File: WAN 2.2 I2V_00001(1).mp4 (571 KB, 480x832)
571 KB
571 KB MP4
anons, anyone share a faster workflow for wan? this shit took me 18 mins, on a 5080.. this is the wf Im using btw

civitai. com/ models/ 1818841/wan-22-workflow-t2v-i2v-t2i-kijai-wrapper?modelVersionId=2058285
>>
>>106432473
>5080
shouldn't have fallen for the muh features meme instead of getting vram
>>
File: 1730806132669984.png (1.36 MB, 1120x1440)
1.36 MB
1.36 MB PNG
>qwen lightning 8 step + lenovo ultrareal lora
I detect zero slopping in this gen. qwen can easily be deslopped.
>>
File: 1753642270007091.png (1.21 MB, 896x1152)
1.21 MB
1.21 MB PNG
>>106432522
>chroma 49
lol...
>>
>>106432397
>ue5 blueprints
it is a diffusion UI?
>>
>>106432549
It's a node ui like comfy.
>>
>>106432556
but works with all the diffusion models like comfy?
>>
>>106432576
better, you make AAA Vidya and Hollywood productions with it
>>
File: 1749297325109781.jpg (526 KB, 1024x1024)
526 KB
526 KB JPG
Lumina can handle two separate, distinct subjects without loras or regional prompting of any kind.
Quite honestly, i'm surprised by what this model can do. It seems to have better prompt adherence than flux based models if you want to create some completely outlandish shit too.
>>
>>106432611
I'm very hopeful for this model if they can finish training the artist styles into it and polish up pure tag prompts and anatomy
>>
File: 1750698494902643.jpg (986 KB, 1344x768)
986 KB
986 KB JPG
>>106432621
Yeah, if it gets Noob tier artist knowledge and better aesthetics, it's easily going to eclipse any existing anime model.
>>
>>106432533
Looks like you fucked something up
>>
>>106432611
>>106432621
>>106432654
Let me guess, censored?
>>
>>106432380
There is "someone" who wants users to use his UI over comfy and donate free developer time
Notice how the comfy shitposting always goes through the roof when that "someone" "stealth"-shills his UI again
>>
>>106432692
yet qwen handled the prompt just fine.

give it a try if you want to prove me wrong:
>Amateur digital photo with flash lighting shot in iphone 10 in the year 2010.
>A friendly hook-nosed Jew Orthodox Jewish Rabbi smiling a wide grin wearing a white Kippah hat, Payot sidelocks, and a small golden Star of David necklace. The Jew holds out his hand to shake hands with the viewer. In the background, cast by the flash lighting on the wall behind the Jew, the Jew's shadow is in the shape of the Jew threateningly holding holding a long sharp knife.
>The background is a dimly-lit and ornately decorated room.

funnily, both models wrongly interpret "flash lighting" as light from a flashlight
>>
>>106432522
i use same settings. better satisfaction than with chroma
>>
>>106432756
Try specifying that it is a camera flash
>>
>>106432736
no, it actually does have artists. they're just half baked
>>
What do you anons use for video upscaling?
I need preferably 1.5x or at most 2x upscaling to upscale 480p gens.
I don't need maximum quality, I need something that gives passable output, and fast, wan 2.2 already runs at snail pace on my vramlet GPU.
I am fine with two separate models if needed for realistic and cartoon/anime gens.
>>
>>106432787
cracked topaz chronos fast
>>
>>106432800

This, but honestly the upscaling doesn't do much besides make it look extremely fake IMO.
>>
>>106432783
This reminds me that I wanted to dabble in comics creation
Which of the current slop machines is best at being artistically pleasing while having some consistency or being good for lora training?
>>
>>106432736
Nta but it also isn't censored if you meant sex. And if op is talking about Neta-lumina.
Agreed, really wish someone would finish it, it's less than half-baked but already nice enough to still believe.
>>
>>106432800
How fast are we talking? Also isn't chronos an interpolator?
I am okay with what Film VFI node does, I just want some upscaling beforehand.
>>106432821
How so?
Is it bad for cartoon/anime gens as well?
>>
File: ComfyUI_temp_qtppj_00007_.png (1.91 MB, 1024x1024)
1.91 MB
1.91 MB PNG
The models for Kino creation locally are all available:
Base images: pick your poison.
Videos without dialog: Wan2.2
Dialogs: ChatterboxTTS
Videos with dialog: Wan S2V
Foley: Hunyuan Foley
Music: ACE, (Is there something better?)
>>
>>106432893
a UI to make kino without pulling your hair out doesn't exist yet
>>
>>106430972
is this gwen edit?
why does it blurry af
>>
>>106432974
? wan2gp is there for retards that can't use comfy
>>
>>106432974
You are going to have to use a regular movie editor to put all the things together, it goes without saying. I am using Shotcut and it's good enough.
>>
>>106432851
>How fast are we talking? Also isn't chronos an interpolator?
right, thats the second part of the workflow u will be using in topaz, the default upscaler is otherwise good, proteus i think
>>
>>106432893
what about image edit?
>>
>>106433049
Kontext of Qwen Edit should suffice. Let's say you need to change a character's clothes, or their expression. For each recurring character you are going to train a Lora, as you need to have the characters look consistent.
>>
Is the 480p wan2.2 looking good enough or should I go for 720p?
>>
>>106433200
wan 2.2 does both 480p and 720p in one model
>>
>>106433200
Good enough. Everybody will know that you are a vramlet, cardlet and poor; but it's really good enough.
>>
Alright do we really need to start doing this? Do we need to start showing PC specs in this place?
>>
>>106433288
Yes it's important, above all to filter AntiSomething schizos.
>>
>i heckin love telemetry!!!
>>
anons when loading loras into wan 2.2 workflow in addition to the light2xv distill, do I need to load 1 of the same lora for both high and low noise?
>>
>>106433523
maybe, depends
if it doesn't come with instructions then try it with both
>>
is wan first+last frame producing darkened results for anyone else?
using kijai if that makes a difference
>>
File: ComfyUI_00027_.mp4 (556 KB, 640x640)
556 KB
556 KB MP4
>>
File: WAN_FINAL_00002.mp4 (1.45 MB, 480x576)
1.45 MB
1.45 MB MP4
>>
>>106433820
could have just been an image
>>
I'm trying to migrate flux loras so they are compatible with chroma, and it's so fucking slow.
I guess the script is very non optimized because it uses no system ressources.
>>
>>106434006
Are you converting the flux loras somehow or just retraining from the same datasets?
>>
>>
>>106434197
converting whatever layer is compatible, I don't have the datasets to retrain anything
>>
>>106434310
What are you using and what settings?
>>
File: ComfyUI_00005_.mp4 (513 KB, 640x640)
513 KB
513 KB MP4
>>106434248
lol

What games do you play while waiting for gens to resolve? I mostly play Dungeon Crawl Stone Soup.
>>
>>106432522
The slop is subtle but still there.

>>106432533
>>106432756
You're not using Chroma Flash which would probably give a better result out of the box. The most correct looking, polished image is not necessarily an advantage, considering Qwen's limitations.
>>
File: WAN2.2_00257.mp4 (1.34 MB, 832x480)
1.34 MB
1.34 MB MP4
I can't get rid of the jump when joining i2v and s2v videos.
https://files.catbox.moe/enlhb8.mp4
>>
>>106434321
I use the script here :
https://github.com/EnragedAntelope/Flux-ChromaLoraConversion

I just have a script I run periodically that looks in my flux lora folder and converts anything >70 to chroma
>>
>>106434401
>The slop is subtle but still there.
Not that anon, but what exactly?
>>106434380
Mahjong.
>>
>>106434414
Face looks plastic. Flux LoRAs give a similar "realism" aesthetic. Real flash photos (which can only be reproduced by something like Chroma so far) have a certain texture/detail on the skin.
>>
>>106434006
Just finished.
It took me about 7h to migrate my 30 loras.
>>
>>106434380
I am on 3060. AI hogs a lot of resources with that one. Not sure if I can play much. I scroll the internet or watch shit. I watched Internet Historian's Ever Given video while that one was being baked.
Maybe I can play Solitaire or something else super light though.
>>
>>106434458
>Maybe I can play Solitaire or something else super light though.
my go to is snes emulation
>>
>>106434457
Can you clarify what you did to "migrate"?
>>
>>106434475
The script does that :
>The conversion is a four-step process designed to translate a LoRA's influence from one model architecture to another:

>Apply LoRA: The original Flux LoRA is merged into the Flux base model at full strength.
>Extract Difference: The script calculates the precise difference (the "delta") between the original Flux model and the LoRA-merged version.
>Apply Difference: This "delta" is then applied to the Chroma base model, effectively transferring the LoRA's changes.
>Extract LoRA: A brand new, Chroma-native LoRA is extracted from the modified Chroma model using SVD (Singular Value Decomposition).
>>
>>106434412
Does it work with both dev and schnell?
>>
>>106434380
>What games do you play while waiting for gens to resolve? I mostly play Dungeon Crawl Stone Soup.
Baldurs Gate 2 or any similar old game that takes virtually zero resources to run.
>>
File: ComfyUI_00028_.mp4 (306 KB, 640x640)
306 KB
306 KB MP4
>>
best model to generate 1guy of the architect generating plump 1girl prototypes?
>>
>>106434412
>>106434483
Interesting. Makes sense.
Can't we apply the underlying idea of transfer delta from model A to model B to pretty much any model combo though?
Shouldn't work well with CLIP trained loras but regardless might have some use.
>Compatibility Pre-Scanner: Analyze your LoRAs before conversion to get a detailed report and a UNet compatibility score, saving you time and effort.
Fascinating, I should actually read what this code does.
>>
>>106434512
I only migrated from dev, sorry, dunno about schnell.
>>
>>106434604
You can probably do that but my guess is that the more the models are different, the less layers would be converted.
>>
>>106434602
kek
>>
>>106434604
>Shouldn't work well with CLIP trained loras
Which sucks, since I'd love to transfer the massive bulk of SDXL loras into a modern environment.
>>
>>106430538
>>106430515
>>106430478
KEK/10
>>
File: 4chan.png (8 KB, 675x84)
8 KB
8 KB PNG
>>
what can I use to makes videos longer than 81 frames avoid repetition in wan2.2?
>>
File: furkd.jpg (45 KB, 647x364)
45 KB
45 KB JPG
well?
>>
>>106434923
why is comfy a habitual liar now?
>>
>>106434923
Furk will save local.
>>
>>106435015
I hope that grifting faggot will drop dead as soon as possible.
>>
>>106435020
Furk is kind of based desu.
>>
File: ComfyUI_00021_.mp4 (2.75 MB, 640x640)
2.75 MB
2.75 MB MP4
>>106434414
>>106434458
>>106434554
Ok nice, I should see if Diablo 2 still holds up.
>>
>>106435061
He's dumb as shit.
>>
>>106435015
holy shit he photobombed a screenshot lmao
>>
Do you guys edge to taesd previews?
>>
>>106435147
Smart enough to grift his way to at 5090. Can you say the same?
>>
>>106435167
And he got it in Turkey too, which is a secondary market compared to US/Europe, so even harder to get.
I'll give him that as a cockroach he is very resilient.
>>
>>106434999
because UI monopoly???
>>
>>106435188
clearly something is wrong since I still see auto filenames, wan2gp screenshots and invoke mentions instead of using this bloated shit
>>
>>106435167
That would require me to grift in the first place.
>>
File: 1.mp4 (1020 KB, 640x640)
1020 KB
1020 KB MP4
>>
File: 2.mp4 (901 KB, 640x640)
901 KB
901 KB MP4
>>
>>106434923
no problem in my ubuntu
>>
>>106435194
https://github.com/Haoming02/sd-webui-forge-classic/issues/121#issuecomment-3239339814
well there's this too now
>>
>>106435194
UI monopoly, cope with it, he will do whatever he wants, he dont have competition, he dont have to be the best or better than anyone. Perfect example of what any corpo or company or small company does when he dont have to compete
>>
>>106432654
>if
please god make this happen please dear lord please hear my prayer
>>
>>106435238
>anime girl pic
We are back
>>
>>106435238
The only thing with Forge Classic is that he is using an old version of Gradio and everything, some ControlNet preprocessors and Adetailer isnt fully fully compatible with it
>>
>>106434923
ITS A PYTHORCH PROBLEM YOU STUPID FUCKING CHUDS ITS LITERALLY UNFIXABLE
>>
>>106432473
lol i get a gen of that caliber in 4 minutes on my modded rtx 3060 24gb
>>
File: wintoddlersbtfo.gif (38 KB, 220x216)
38 KB
38 KB GIF
>>106434923
WINTODDLERS BTFO
>>
File: 4634563.png (1.77 MB, 1920x1080)
1.77 MB
1.77 MB PNG
>>106431486
Thanks anon, really helpfull, now im learning for the youtube tutorials from here https://www.youtube.com/@invokeai,
it is a very complete UIfor txt2img and img2img, has regional guidance and controlnet as if it were the most easy thing in the world and if you are a hobbyst it lets you import and work with ComfyUI nodes
Thanks for letting me scape the Comfy hell!
>>
>>106435516
Very organic post.
>>
ah fuck okay ill try out invoke again but it better be good this time!
>>
>>106429922
anon i can guarantee you the model understands hatsune miku above anything else. you are retarded.
>>
>>106435509
thanks for reminding me we're in /g/
>>
>>106435526
as organic as the posts that supports Comfy
>>
>>106435535
use it with this in mind "txt2img and img2img" and with a photoshop mindset, if you go to press the generate button use Forge
>>
>>106435571
i will keep that in mind, rakesh
>>
>>106435562
meds
>>
>>106435516
WTF, I can compare two images with a slider and I also have all these options in two sidebars???
Do you know that in Comfy you need an 8k megapixel workflow with infinite problems between nodes and bugs??
>>
File: 00026-717767929.jpg (196 KB, 1824x1248)
196 KB
196 KB JPG
>>
>>106435610
I like that you, entirely undeterred, keep genning in that style but are these actually known characters? Or is it just random stuff? Do you think of the poses yourself or use wildcards?
>>
>using invoke
>voluntarily giving up your privacy
https://www.invoke.com/privacy
''User-generated content data, such as prompts, conversation text, comments, questions, messages, images, works of authorship, and other content or information that you generate, transmit, or otherwise make available on the Service, as well as associated metadata.''
>>
>>106435207
Integration of the shadow requires a high IQ.
>>
>>106435650
the same as Comfy, also this is for the paid cloud API version, but they have a free local
>>
>>106435593
I agree, this is clearly better, you can download it here : https://github.com/invoke-ai/InvokeAI and it's free!
>>
>>106435659
nice try glowie
>>
>>106435659
>the same as Comfy
Are you alright? Mentally?
>>
>>106435682
>>106435682
>>106435682
>>
>>106429545
Any AI and workflow i can use to generate perfect sprite sheets?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.