[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Some Models Next Week Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106848716

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: IMG_2311.jpg (74 KB, 934x2000)
74 KB
74 KB JPG
>>106851472
thanks this is my life's work
>>
File: 00080-1245755773.png (795 KB, 1224x768)
795 KB
795 KB PNG
>>
debo world
>>
>>106851479
thanks for the Korn lady Nicholas
>>
>lightx2v/Wan2.1-T2V-14B-CausVid
wtf are they doing, they release everything except i2v
>>
Blessed thread of frenship
>>
Blessed thread of frenship
>>
File: 00000.jpg (850 KB, 1440x2000)
850 KB
850 KB JPG
>>
File: pic2.png (1.73 MB, 2872x1566)
1.73 MB
1.73 MB PNG
New to this. First time training a chroma Lora and it came out pretty good in the sampler (ai-toolkit), and I’ve been experimenting with a simple workflow based on the default Comfy chroma template, adding only my Lora to the graph (picrel). Results have been good but inconsistent. I’ve been using a fixed seed so that I have reproducibility between all experiments. The settings in picrel repeatably make a VERY good representation of the subject. Like so good I think I could use it to train with… HOWEVER:

>If I change only the seed, results are often inconsistent in both facial retention, body shape, and other details.

>Even with the “perfect seed” if I make what I think are minor changes to the prompt (eg to change her position etc) I get similarly inconsistent results.

>If I remove the inconsistent typo (“She's in a relaxed pose with her right arm on her hip. Her She”) I lose some facial retention and get weird arm artifacts.

I noticed during my training that by the final epoch some samples were spot on and some were meh. Does all this point to my LORA being shit? Should I go back and retrain until all 10 training samples are perfect? I’m using the default training prompts from ai-toolkit. Is my prompting shit? It’s just cobbled together crap I found.

Given the workflow I’ve made, if my LORA is actually good what consistency should I be expecting? What other variables are at play in terms of locking in a body shape? The body from the current settings is something I’d like to bake in somehow (re-training with the generated images?) I don’t know yet how to explain the wild inconsistency. Any advice appreciated, sorry I can’t post the actual pix or lora. Using a runpod L40S.
>>
>>106851533
maybe end you're life
>>
>>106851498
>our model excels at producing coherent long-form videos

Wait, so is this actually long video? Is it finally here? Is this their aim? I'm interested to know their choice of 2.1 and causvid
>>
>>106851503
>>106851505
blessed threads of frenly zones ;D
>>
>>106851549
Forgot link https://huggingface.co/lightx2v/Wan2.1-T2V-14B-CausVid
>>
>>106851552
go back
>>
>>106851533
from what I found chroma in general tends to be very inconsistent and you need to reroll many times to get the right result. doesnt matter whether you use a lora or not.
>>
did that sadamoto yoshiyuki lora ever get posted somewhere?
>>
what is the difference between DC-2k and chroma-HD?
>>
Is it me or does Img2Img just not work with Chroma? I always get really bad results. Is there a trick?
>>
File: 00085-2175000283.png (949 KB, 1224x768)
949 KB
949 KB PNG
>>
>>106851564
>>106851564
Interesting... have you tried qwen-image? I trained a qwen lora with the same dataset and steps, and pictures and consistency and detail is pretty good but I haven't figured out how to get it to do the nsfw stuff well at all (which I understand is absent from its training?)
>>
>>106851570
wasn't good enough, have to re-tag and re-train
>>
>>106851588
no, because qwen is censored therefore you will need many NSFW loras to make explicit content where as with chroma you only need a lora for your character/person.
>>
>>106851533
There is no way in hell that Chroma or any model for that matter understand hip and breast measurement sizes.
>>
>>106851571
Everything with HD in its name is ass.
>>
>>106851533
Why do you fucks keep genning basic fat bitches? Can you rotate an apple in your head?
>>
>>106851615
>Can you rotate an apple in your head?
use case?
>>
File: 00193-2148697109.png (1.31 MB, 768x960)
1.31 MB
1.31 MB PNG
>>
>>106851615
>Can you rotate an apple in your head
what does this mean
>>
File: IMG_1487.jpg (1.61 MB, 1808x3216)
1.61 MB
1.61 MB JPG
>>106851620
>>
I just imagined anons 1girl in my head, rotated her 180 degrees, took her clothes off, raped her, and cut off all her limbs.
>>
>>106851622
if you cant it means you're an npc with no imagination.
>>
>>106851593
i really liked the look of the example pics though.
if the new version ends up looking quite a bit different, would you consider uploading the older version as well?
>>
>>106851615
maybe that's anon fat wife and he wants to see her in sexual poses though i dont know why he doesnt just ask her i mean she's right there wtf are you doing
>>
>>106851605

lol yeah, if I use this

>breast size A, medium waist size, large hips.

I get similar proportions but a totally different stance and a decent face

If I actually remove that line above entirely, I get a literal alien body horror show. WTF?
>>
>>106851637
>and cut off all her limbs.
Speaking of it.
Are there any remarkable guro/gore Qwen Edit loras, or Wan Loras?
>>
>>106851656
how about you go back to /b/? hmmm?
>>
>>106851615
Post a SINGLE gen you've made for the class to see your superior taste.
>>
>>106851607
so DC-2k is good?
>>
>>106851682
yeah
>>
>>106851561
>"We couldn't detect valid metadata in this image.
>Outputs based on this image must be PG, PG-13, or they will be blocked and you will not be refunded...!"
>>
File: 00194-4256795667.png (994 KB, 768x960)
994 KB
994 KB PNG
>>
>>106851639
I can't visualize shit in my head
wish I had this superpower, but it has nothing to do with imagination
>>
>>106851479
>>106851627
who IS this man??? ;o
>>
>>106851641
it had like 70% fail rate, overcooked with too low resolution. next version will be same but better so no worries
>>
File: 00195-3829580780.png (957 KB, 768x960)
957 KB
957 KB PNG
>>
File: ComfyUI_00025_.png (3.86 MB, 1536x2688)
3.86 MB
3.86 MB PNG
>>
File: Inpainted_00026_.png (1.46 MB, 1144x912)
1.46 MB
1.46 MB PNG
I'm enjoying qwen image edit, but is it possible to use it with a mask ?
I had big trouble inpainting this picture, qwen kept messing up with the characters wings, i had to use flux with mask inpaint back again.
>>
File: 1752570636594461.mp4 (794 KB, 800x480)
794 KB
794 KB MP4
>>106851615
I have aphantasia, cool it with the anti p-zombie remarks
>>
>>106851734
its ani
>>
File: radiance.png (3.15 MB, 864x1488)
3.15 MB
3.15 MB PNG
>>106851827
https://github.com/scraed/LanPaint?tab=readme-ov-file#example-qwen-edit-2509-inpaint
>>
File: 00196-3606899199.png (545 KB, 960x768)
545 KB
545 KB PNG
>>
>>106851533
jesus christ, im gonna have to get into AI image gen now. There simply aren't enough smooth plapable bitches to crank my hog to, Ill have to generate them.
>>
>>106851803
cool style. catbox?
>>
Catpiss-anon is wreaking havoc again.
>>
>>106851852
aside from the water coming from the ceiling light, this is a really aesthetic gen
>>
File: NetaYume3vs35.jpg (3.36 MB, 2512x1712)
3.36 MB
3.36 MB JPG
The difference between NetaYume 3.0 and 3.5 is a bit harder to define than 2.0 Plus vs 3.0, but I do think 3.5 is another modest improvement overall. Main thing I've noticed is eye proportions for both male and female characters make a bit more sense in 3.5, and it adds some nice relevant details in appropriate contexts where 3.0 didn't, like the sword here. Prompt (sans boilerplate / neg) was just `masterpiece, best quality, very aesthetic, a 2d digital anime illustration of a samurai warrior in traditional armor, standing in a cherry blossom garden.`
>>
>>106851938
desu artists are better in 3.5 IMO as well
>>
File: radiance.png (3.12 MB, 864x1488)
3.12 MB
3.12 MB PNG
>>106851926
ceiling mounted "rain showers" with some or many integrated lights - it could literally be this way IRL

sure: it maybe just dreamt this up, hard to tell
>>
>>106851938
I don't think you understand how these models work. Your scientific comparison is useless.
>>
>>106851938
looks quite a bit better in my subjective opinion. not just the eyes but the whole face looks way less sloppy
sword also breaks some of the obnoxious symmetry, but still way too much of that in this pic imo
>>106851983
sure but shes still wasting water with the other shower head in that case, unless maybe theres a dwarf standing under it just out of sight
>>
>>106851984
>I don't think
Of course you don't.
>>
File: 00197-2296826522.png (1.21 MB, 768x960)
1.21 MB
1.21 MB PNG
>>
File: 00101-1579212411.png (2.6 MB, 1248x1848)
2.6 MB
2.6 MB PNG
>>
File: radiance.png (2.89 MB, 864x1488)
2.89 MB
2.89 MB PNG
>>106851996
>sure but shes still wasting water with the other shower head in that case
seen that IRL too. quite a few women even apply soap/shampoo without turning their shower off at all, yes.
>>
Gemini or JoyCaption for wan captions?
>>
>>106852027
janus
>>
>>106851938
>boilerplate
do you prepend your positive and negative prompts with "You are an assistant designed to generate anime images based on textual prompts. <Prompt Start>" like the examples?
>>
>>106852027
grok
>>
File: ComfyUI_03025_.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
>>
File: ComfyUI_03031_.png (1.68 MB, 1024x1024)
1.68 MB
1.68 MB PNG
>>
File: ComfyUI_03039_.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>
>>106851974
yeah I haven't tried a ton yet. I still don't really understand his explanation of what 3.5 actually is on the Civit page lol, but it doesn't seem to be that important, whatever it is its not worse than 3.0 so whatever
>>
>>106851938
Testing now, but your pic like you said is a small improvement. I think Yume being honest that it is 3.5 is fair
>>
>>106851984
what? it was the same seed and same prompt and same sampler / scheduler settings, how else would you compare two versions of the same model lol
>>
File: radiance.png (2.61 MB, 864x1488)
2.61 MB
2.61 MB PNG
>>106852051
aesthetically pleasing
>>
File: radiance.png (2.59 MB, 864x1488)
2.59 MB
2.59 MB PNG
>>
>>106852074
nice eyes
>>
>>106852069
I'd rather he be careful and make small improvements like this than just YOLO train like some people do and wind up with like enormous seed variance between versions, meaning there'd be more likely noticeable new deficiencies in some area
>>
File: ComfyUI_03049_.png (1.74 MB, 1024x1024)
1.74 MB
1.74 MB PNG
>>106852074
ty radiance chad. I'm not having a ton of luck with radiance atm

>Prompt executed in 14.46 seconds
Chroma1-HD-Flash again. I think aura flow shift 3 makes it slightly better
>>
>>106852063
i think hes saying 3.5 isnt DPO'd which IIRC is something anon really wants in a model. no dumbass "human preference".
>>
>>106852112
Not a dig, but man these images look cooked broski. As if the CFG is set high.
>>
File: ComfyUI_03051_.png (1003 KB, 1024x1024)
1003 KB
1003 KB PNG
>>106852129
MFW CFG 1
>>
File: 00198-1680127058.png (996 KB, 960x768)
996 KB
996 KB PNG
>>
>>106852098
Yeah I like it, at least he is active on discord and we are getting model updates pretty fast. Did you prompt the sword?
>>
>>106852112
>"Computer! Add a ring to the creature's middle finger. The ring is of a shiny silver material and should have a noticeably large blue engraving of the Star of Remphan"
>>
>>106851803
niceu
>>
File: 1742557929558802.jpg (1.14 MB, 1248x1824)
1.14 MB
1.14 MB JPG
>>106851938
Artist styles are more pronounced, anatomy is better (not perfect, but better), seems to retain the creativity and prompt adherence.
Also tried quite a few more erotic prompts and it seems to grasp NSFW better too.
>>
File: file.png (2.87 MB, 864x1488)
2.87 MB
2.87 MB PNG
>>106852091
ty

>>106852112
>ty radiance chad. I'm not having a ton of luck with radiance atm
2d/3dcg 1girls (often still with flawed hands) are clearly among the best trained concepts right now in case you're trying to do something else. it's not a direct continuation of chroma-1 base, it doesn't know some other stuff AS well as it worked on pre-radiance chroma

cool demon hand on a globe!
>>
File: radiance.png (2.92 MB, 864x1488)
2.92 MB
2.92 MB PNG
>>
File: radiance.png (2.47 MB, 864x1488)
2.47 MB
2.47 MB PNG
>>
File: ComfyUI_00064_.png (1.85 MB, 960x1328)
1.85 MB
1.85 MB PNG
>>106852159
sank yew
>>
Yume doesn't matter because NovelAI exists
>But saas
The animeland is Illustrious or novelAI. Everything else is a failed experiment.
>>
You can come up with something better than that, anon.
>>
File: ComfyUI_03057_.png (1.64 MB, 1024x1024)
1.64 MB
1.64 MB PNG
>>106852175
>2d/3dcg 1girls (often still with flawed hands) are clearly among the best trained concepts right now
It works with a LoRA trained on HD pretty well. Need to do a sampler/scheduler sweep on radiance. Wish it was native in Comfy to queue all the options in a menu
>>
Not worth the effort since only one person uses it
>>
File: ComfyUI_03069_.png (1.74 MB, 1024x1024)
1.74 MB
1.74 MB PNG
>>
File: ComfyUI_03074_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>
File: file.png (2.5 MB, 1024x1024)
2.5 MB
2.5 MB PNG
>>106852154
how does a star of remphan look? i guess this is a transformed israeli instead.
>>
File: file.png (2.51 MB, 1024x1024)
2.51 MB
2.51 MB PNG
>>
File: 1739400369003840.jpg (21 KB, 222x293)
21 KB
21 KB JPG
>>106852257
holy shit its perfect
>>
File: ComfyUI_03096_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
File: ComfyUI_03121_.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>
File: 00200-2254264571.jpg (431 KB, 2304x960)
431 KB
431 KB JPG
>>
File: ComfyUI_03130_.png (1.64 MB, 1024x1024)
1.64 MB
1.64 MB PNG
>>
is pony v7 as good as everyone expected it to be?
>>
File: radiance.png (2.96 MB, 864x1488)
2.96 MB
2.96 MB PNG
>>
File: radiance.png (3.07 MB, 864x1488)
3.07 MB
3.07 MB PNG
>>
>>106852279
a masterpiece.
>>
>>106852387
That fact that it got token discussion and wasn't mentioned until you brought it up should say everything about it.
>>
>>106851472
>>
File: ComfyUI_03153_.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
>>
badger badger badger badger
>>
>>106852282
good

>>106852418
yes, it used neural
>>
File: media_1760144786.png (1.3 MB, 768x1280)
1.3 MB
1.3 MB PNG
Here is your favourite elf!
Make sexy animations with her and you will be rewarded with more spicy pics!
>>
File: 1737750451330959.png (2.09 MB, 1080x1624)
2.09 MB
2.09 MB PNG
>>
>>106852446
I don't want to.
>>
File: media_1760144711.png (1.29 MB, 768x1280)
1.29 MB
1.29 MB PNG
>>
>>106851734
>who IS this man??? ;o
he is a namefag who goes by tenta who was doxxed on /b/ by his crazy ex gf for being too much of a pedo (serious)
he left /b/ and has hung out in /g/ ever since
he's probably one of the best schizogenners of all time if we're being honest. recognizable style over the years but always some weird liminal madness that felt like you had to be mentally ill to conjure it up

>>106851722
>I can't visualize shit in my head
>wish I had this superpower, but it has nothing to do with imagination
It's not a superpower, its what 95% of the world is able to do. I have aphantasia too. Is there 3 anons itt with aphantasia right now??
>>
File: 00203-3594939031.png (952 KB, 768x960)
952 KB
952 KB PNG
>>
File: 00101-1686565988.png (1.3 MB, 1224x768)
1.3 MB
1.3 MB PNG
>>
File: 1734077471905417.png (2.42 MB, 1080x1624)
2.42 MB
2.42 MB PNG
>>
File: wetelf.mp4 (1.43 MB, 640x1064)
1.43 MB
1.43 MB MP4
>>106852446
>>
cozy
>>
>took the time to learn comfyai
Ngl its not as bad as the people said it would be to learn. Is it one of those things where it gets annoying when you dig into it more
>>
>>106852669
It makes sense to programmers but for anyone who wants simple and easy to understand, it sucks because it's a regression from A1111 in that regard for their smooth brains which I am not insulting, it's just fact.
Most people haven't gotten click to install easy to work because the tools need to be build fundamentally so that a normal person on Windows can click an .exe, install, and then off to the races with buttons. Even A1111 was not like this but people got quite close, the main issue is the complexity doesn't scale well, which ComfyUI once you understand does well with workflows and how it does management of models and LoRAs.
>>
File: 00204-4291804499.png (622 KB, 768x960)
622 KB
622 KB PNG
>>
File: ComfyUI_00203_.mp4 (1.98 MB, 792x1320)
1.98 MB
1.98 MB MP4
>>106852446
>>
>>106852669
same feeling for me. hated it at first, but once I got the hang of it so many things were unlocked. Some custom node stuff is annoying but overall it is fun being able to experiment. Doesn't hurt that gpt5-thinking and grok4 are good at looking at workflows and giving feedback lol
>>
File: somevideo_noaudio.webm (823 KB, 640x640)
823 KB
823 KB WEBM
MMAudio is criminally underrated.
It's a nice copium while we don't get a true local Veo 3 or Sora 2.

https://files.catbox.moe/ls6f60.mp4
https://files.catbox.moe/742l9a.mp4


Reminder that Wan S2V is also a thing (but it is its own dedicated model unfortunately)
>>
File: 1754318122369966.jpg (309 KB, 1125x843)
309 KB
309 KB JPG
>>106852669
you learned for nothing, it's over. no more new local models. the api era is now
>>
File: 1753035525717844.jpg (894 KB, 1456x2128)
894 KB
894 KB JPG
stableslop > uncomfy noodle
>>
>>106852669
>Is it one of those things where it gets annoying when you dig into it more
Yes, slightly. Once you realize how powerful it is, the things that do not work suddenly become maddening. Not in the way the trolls portray it, but definitely in a
>oh my goodness this is so close to working, what is this node bug/UI quirk/python weirdness etc
I would much rather engage in spaghetti weaving than going back to A1111 style gens, but there is genuine frustration in the noodles.
>>
>>106852842
>pircel
I have the same expression when I look out my bedroom window and see those fucking NPCs crossing the street.
>>
File: 1731490117376888.jpg (1.17 MB, 1456x2128)
1.17 MB
1.17 MB JPG
I often wonder if NPCs are even capable of having dreams.
Or if maybe they're just living in ours.
>>
File: 1733695645956330.png (3.79 MB, 3064x1758)
3.79 MB
3.79 MB PNG
https://huggingface.co/spaces/wcy1122/DreamOmni2-Edit
>another snakeoil
it won't stop these days, can't stop taking those Ls
>>
File: media_1760144805.png (1.24 MB, 768x1280)
1.24 MB
1.24 MB PNG
>>106852539
Nice! Here is another pic.
>>
File: media_1760144759.png (1.22 MB, 768x1280)
1.22 MB
1.22 MB PNG
>>106852726
Sweet! Here is another one.
>>
>>106852446
>>106852467
>>106852539
>>106852726
>>106852997
>>106853004
>another avatarfag
jesus, it never stops in this place, fortunately that one is easy enough to filter out
>>
is there a solution for color correcting two clips so you can seamlessly edit them together? i have tried the comfy color match nodes and multiple video editor but none can do it. there is no way i can do it manually
>>
>>106853016
are you pretending to be retarded?
>>
>>106852207
Is it radiance or your prompting that always leaves a noticeable texture to everything?
>>
>>106853034
It's radiance.
>>
>>106853034
that's just the iconic Chroma noise
>>
File: 1751211935200659.jpg (1.25 MB, 1792x1792)
1.25 MB
1.25 MB JPG
>>106853022
He doesn't need to pretend.
>>
>>106853043
Damn, he always gens good 1girls but the texture makes it a bit off.
>>
>>106853022
>>106853058
>another avatarfag protecting his fellow avatarfag
color me shocked
>>
File: 1730823668536884.jpg (84 KB, 600x600)
84 KB
84 KB JPG
>>106853097
>another no-gen response
We're generating what we feel like generating.
What would you prefer to see, anon?
Maybe some clowns so you won't feel so out of place?
>>
>>106853118
>no-gen
that answer is /sdg/ coded, you need to go back
>>
>>106852771
https://litter.catbox.moe/ov1h3rgen4mhlysd.webm
>>
File: 1740377914283769.png (2.48 MB, 1800x1800)
2.48 MB
2.48 MB PNG
>>106853128
>implying
and you need to tongue my anus
>>
>>106853135
>you need to tongue my anus
I'm not a faggot like you so I'll kindly refuse
>>
>>106853133
true, MMAudio is criminally undorighted
>>
File: 1733959367374628.jpg (722 KB, 1024x1024)
722 KB
722 KB JPG
>>106853139
Cool, let us know when you want to contribute something besides salt.
In the meantime I will avatarfag as a green frog.
>>
File: and, sent.png (27 KB, 1001x214)
27 KB
27 KB PNG
>>106853144
what are you contributing? broken rules?
>>
>>106852669
ngl, anistudio is the best of both world when it's out. easy to install and components open up a lot of unexplored implementations. game engine design is the future because it can give you an abstract of something simple and give you a deeper autism than nodes can provide. you can even make nodes out of components. comfy can't beat that
>>
File: 1748040194013864.jpg (103 KB, 1024x1024)
103 KB
103 KB JPG
>>106853151
>literally threatening anon
Yes, please report me for avatarfagging as a green frog, go ahead.
Then scurry back to plebbit.
>>
What is it about this thread that attracts such buttmad troons?
>>
File: ComfyUI_00275_.mp4 (677 KB, 720x720)
677 KB
677 KB MP4
>>106853165
>he missed
pepe is just too pure for this world
>>
>>106853180
/sdg/ avatarfags (those "peopl" are the main reason /sdg/ died) want to kill this general too, they're like viruses, if you let them spread, it's over
>>
>>106853162
lol
>>
File: 1750515868514293.png (764 KB, 880x1184)
764 KB
764 KB PNG
>>106853206
How many similar gens am I allowed to post before it's 'breaking the rules'?
Pray tell.
>>
>>106853221
just stop avatarfagging nigger. you dont have to post the same image 100s of times.
stop shitting up the thread or are you too retarded to understand this?
just dont be a faggot.
>>
>>106853162
>>106852669
>ngl
>>
>>106853227
this
>>
>>106853021
>multiple video editors
This doesn't tell anything especially if they are freeware shit.
You need to learn the correct workflow. Match the blacks and whites first.
>>
File: ComfyUI_05784_.png (959 KB, 864x1208)
959 KB
959 KB PNG
>>
>>106853273
Now animate it.
>>
>>106853227
>post 3 gens of an anime character
>react to a retard with 5 amphibious gens
So a maximum of 4 similar gens before the avatar police are called, got it.
>>
>>106853162
a game engine with this stuff built in sounds infinitely better than whatever webslop we are using
>>
>>
File: 1757642791368247.png (2.85 MB, 1080x1624)
2.85 MB
2.85 MB PNG
>>
>>106853227
What's wrong with Avatar? You have inreresting psychosis xD
>>
https://litter.catbox.moe/zpak3566ec5dhkd6.webm
>>
File: 1751026715625983.jpg (209 KB, 1024x1024)
209 KB
209 KB JPG
>>106853381
>pixai
Your non-localfag gen has been reported to the authorities.
Enjoy your b&
>>
File: 1757611869636266.png (2.5 MB, 1080x1624)
2.5 MB
2.5 MB PNG
>>
>>106853457
VibeVoice + Wan S2V?
>>
>>106852771
No, it's trash even for just a copium.
>>
>>106853501
It does an okay job for stuff that doesn't have voice/vocals in my opinion
https://files.catbox.moe/uqc7o4.mp4
>>
>>106853144
https://litter.catbox.moe/0zkw5it7owl4h3f0.webm
>>
>>106853521
KEK
>>
someone wake me up the day local can do fucking MAD videos like this >>>/wsg/5995145
>inb4 what is "MAD"
newfag!
https://www.youtube.com/watch?v=fz_KNTsP0cQ
>>
File: 1757722963428666.png (2.79 MB, 1080x1624)
2.79 MB
2.79 MB PNG
>>
>>106853501
whoever said that it works for nsfw must've been smoking crack
>>
File: ComfyUI_00420_.mp4 (702 KB, 1280x720)
702 KB
702 KB MP4
>>
comfy being decidedly uncomfy again
>>
File: ComfyUI_00419_.mp4 (638 KB, 1280x720)
638 KB
638 KB MP4
>>
File: ComfyUI_temp_orpez_00001_.png (2.78 MB, 1280x1680)
2.78 MB
2.78 MB PNG
>>
File: 1755068728607896.jpg (1.14 MB, 1456x2128)
1.14 MB
1.14 MB JPG
>>106853652
moar cyborgs plox
>>
>>106853475
https://litter.catbox.moe/z4kcqhpchizxa77m.webm
>>
>>106853743
Can it do "normal" sounds, like a person starting a car and driving it in high speeds?
>>
File: 1736217818631355.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
What's the SOTA local for masked-region prompting?
>>
>>106853806
I've seen Qwen Edit inpaint loras (where it replaces what is masked in a particular color), look it up
>>
>>
>>106853206
Are those people in the same room with you right now?
>>
>>106853814
Neta anon, do you prompt for artists?
>>
>>106853825
yes
>>
>>106853817
you are in the room so yes
>>
File: 00017-1761883909.png (1.55 MB, 1536x864)
1.55 MB
1.55 MB PNG
>>106853206
boo!
also, i disavow all of this. we just want peaceful coexistence.
>>
>>106853837
What do you mean?
>>
>>106853817
what about me :3
>>
File: 1733535416639089.png (2.45 MB, 1536x1536)
2.45 MB
2.45 MB PNG
>>106853837
>still bickering about avatarfaggotry
Not everyone is a terminal /g/fag concurrent with the entire history of board drama...
Might I suggest putting a disclaimer in an easy to read place?
Something like a maximum of 4 similar images, as we discussed.
>pic unrelated
>>
>>106853872
>jeez, I wonder why that anon is so weary about avatarfaggotery, it's not like those guys killed a general (/sdg/) or someth... oh...
>>
File: 1742452037409966.png (2.42 MB, 1040x1560)
2.42 MB
2.42 MB PNG
>>
File: 1742750935636390.jpg (342 KB, 1024x1024)
342 KB
342 KB JPG
>>106853880
At least tell me how avatarfags 'killed it'
because last I checked it was just less quality than this thread
...and I just checked again, and it's still there?
>>
File: 1743094917439100.png (993 KB, 1280x720)
993 KB
993 KB PNG
>>106853812
Thanks, it looks neat but not quite what I'm looking for. I want text to image, but where the prompt changes slightly in different parts of the canvas
>>
>>106853893
try to make your best guess on why avatarfagging is against the rules
>>
>>106853872
>>106853893
cooked garbage gens
>>
File: 1731826787642774.png (2.05 MB, 1568x1568)
2.05 MB
2.05 MB PNG
>>106853898
Because we must remain trapped in a maze of confusion and never knowing who we're speaking to or else we might escape the matrix?
I seriously don't know, I'm also not avatarfagging either.

>>106853909
Why yes, I'm running SD1.5 on a smart refrigerator.
It's all I have.
Thanks for noticing.
>>
File: 1732148601284272.png (2.56 MB, 1040x1560)
2.56 MB
2.56 MB PNG
>>
File: ComfyUI_06886_.png (3.28 MB, 2560x2560)
3.28 MB
3.28 MB PNG
>>
glowie melty?
>>
>>106853004
https://litter.catbox.moe/yfs54isq57bbo9nz.mp4
>>
>>106853933
omg its mi-.. wait this isnt migu
>>
>>106853951
Sorry to disappoint lol
>>
File: 1742493014959867.gif (3.94 MB, 480x434)
3.94 MB
3.94 MB GIF
>click on /sdg/
>58 posts containing variations of approximately 5 different images.
Okay, I see what you mean now.
Newfag standing down...
Have a blessed evening!
>>
https://www.reddit.com/r/StableDiffusion/comments/1o3o1ax/rcm_sota_diffusion_distillation_fewstep_video/
>RCM : SOTA Diffusion Distillation & Few-Step Video Generation
is it better than the self forcing method (lightvx)?
>>
>>106854001
Nobody knows until we get an implementation because all bechmarks are lies.
>>
>>106854025
>implementation
it's just a lora you can just run it just like that?
>>
File: bro....png (34 KB, 250x144)
34 KB
34 KB PNG
https://rectified-cfgpp.github.io/
trust me bro this time we'll replace cfg bro, I know there's like hundreds of attempts that turned out to be snakeoils but this one is the good one bro
>>
>>106854060
>no comparison with original cfg++
>>
>>106854060
You logged my IP. What's the catch?
>>
File: 00120-2721347063.png (1.68 MB, 1024x1528)
1.68 MB
1.68 MB PNG
>>
>>106854077
>127.0.0.1
bro ... you are in MY HOUSE?!?!?
>>
File: ComfyUI_temp_urpdv_00003_.png (3.5 MB, 1680x1280)
3.5 MB
3.5 MB PNG
>>
>>106854112
WHAT?! MY IP IS 127.0.0.2 WHAT HAVE YOU DONE?!
>>
man, ive also read that this vietnamese fag decide to just train tags instead of NL because 'uwaaah, i didnt have anyone to review nl captions... so I just dropped them lol!'. I mean having gemini/gpt or even joycaption caption your shit would be better than to caption at all.
I fucking hate these subhumans
>>
File: IMG_20251011_100951.png (1.42 MB, 1248x744)
1.42 MB
1.42 MB PNG
>>
>>106854194
>I mean having gemini/gpt or even joycaption caption your shit would be better than to caption at all.
You think those don't need QC?
>>
>>106854206
you can have another LLM do the QC. Point is, even without QC, don't you think that adding non QC'd natural language captions would be better than no NL captions at all? Modern models have very good performance, man even fucking GEMMA 27b is good at it (except it doesnt do NSFW), so you're telling me that 95% good/passable NL captions are worse than 0%?
>>
I will never prompt as if I'm a VLM. You can't make me.
>>
>>106854194
>train tags instead of NL
good. neta works just fine with tags + minimal nl. anybody who thinks having to write a novel of gpt slop for a prompt is a good idea is subhuman
>>
>>106854194
Considering I have yet to get a decent answer for my question, I can see why.
>>
>>106854222
the point being you want the model to be able to generalize, if your dataset isn't evenly captioned like in this case, NL becomes less effective, while tags will be more effective.
retard
>>
>>106854222
trips of truth
>>
1boy, anonymous, fellatio
>>
>>106854232
>NL becomes less effective, while tags will be more effective.
now go ahead and explain why this is bad
>>
give me new prompt ideas
>>
>>106854248
get her on her knees and make her suck a dick
>>
I'm okay with 2 more years of SDXL until the saas replacement is created
>>
>lobotomizing a language to a short list of words is actually good
>>
>>106854248
Okay, hear me out...
2girls
>>
>>106854248
white skin, glowing eyes, halo
>>
>>106854256
thats easy with oral insertion lora, next
>>
>>106854243
>model losing the ability to actually follow NL is good!
kys, it's one of the selling features of neta lumine, I'll just go back to ill/noob
>>
>neta
Failed model, I accept your concession
>>
>noo sir you need to write a 10 page novel describing the texture girth, and smell of her penis instead of just writing large penis, veiny penis, futanari
>>
>her
>>
redpill me on netayume
>>
>>106854286
no
>>
>>106854286
Finetune of a model that wasn't finished baking
>>
>>106854286
It's alright.
>>
the fuck is peft and NaDiT and how do you update it?
Could not import 'NaDiT' from any of the paths: ['custom_nodes.ComfyUI-SeedVR2_VideoUpscaler.src.models.dit_v2.nadit', 'ComfyUI.custom_nodes.ComfyUI-SeedVR2_VideoUpscaler.src.models.dit_v2.nadit', 'src.models.dit_v2.nadit']. Last error: peft>=0.17.0 is required for a normal functioning of this module, but found peft==0.15.2.
>>
>>
I accidentally updated my comfy and now I see a really shitty looking new ui.
how fucked am I?
>>
>>106854334
make her do some sit ups and push ups.
>>
File: ComfyUI_06916_.png (3.01 MB, 2560x2560)
3.01 MB
3.01 MB PNG
>>
>>106854343
its over, uninstall, low level format your drives, sell the pc and just buy a comfy cloud subscription
>>
File: ComfyUI_temp_nraqd_00002_.png (3.03 MB, 1680x1280)
3.03 MB
3.03 MB PNG
Is SATA ssd good enough for models? I don't want to waste my last M2 slot until 4TB drives drop in price.
>>
>>106854385
unless you're swapping your ssd speed should only affect your model load times
>>
File: 2323.gif (666 KB, 320x320)
666 KB
666 KB GIF
I might be too fucking dumb for this, I cant get an image to generate
I install stable diffusion
dl wai and put it in my models/stable-diffusion folder
I run stable difusion in cmd prompt window, it opens in my web browser
I choose the model in the drop down menu and then type my prompts and click start
the bar moves but no progress is ever made
AMD gpu

what is wrong?
>>
>>106854311
Peft is a hugging face lora lib. The most foolproof way is probably to update the requirements.txt of the last node you've installed with the relevant version number and then do requirements reinstall.
>>
File: ChromaSadamoto_00023_.jpg (853 KB, 1128x1648)
853 KB
853 KB JPG
>>
>>106854419
>amd gpu
bro....
>>
>>106854661
asuka a shit.
>>
File: ChromaSadamoto_00024_.jpg (1014 KB, 1128x1648)
1014 KB
1014 KB JPG
>>106854669
no u
>>
>>106854419
>I install stable diffusion
wut? do you mean stable diffusion webui by automatic1111? because that's a outdated piece of shiet. switch to either comfyui or whatever fork of reforge people use now
>AMD gpu
uhh for amd you'll have to do some googling to figure out how to get your backend of choice working
>>
File: 1713319706206.jpg (88 KB, 750x1000)
88 KB
88 KB JPG
>>106854419
>he buy boughted the amd graphics
>>
File: 0000001.mp4 (3.22 MB, 832x832)
3.22 MB
3.22 MB MP4
>>
File: 0000002.mp4 (2.99 MB, 928x640)
2.99 MB
2.99 MB MP4
>>
>>106854803
>>106854788
embarassing
>>
>>106854812
nsfw lora fucks it up, turn it off
>>
File: 0000003.mp4 (3.26 MB, 928x640)
3.26 MB
3.26 MB MP4
>>106854813
They won't give me even an inch!
>>
>>106854818
does conan hit this bitch or not
>>
File: 0000004.mp4 (3.57 MB, 640x832)
3.57 MB
3.57 MB MP4
>>
>>106854818
>the pole

KEK
>>
File: 0000005.mp4 (2.66 MB, 832x640)
2.66 MB
2.66 MB MP4
>>
File: 0000006.mp4 (3.37 MB, 928x640)
3.37 MB
3.37 MB MP4
>>
>>106854419
Follow some guide if necessary and download comfyui-zluda:
https://github.com/patientx/ComfyUI-Zluda
>>
File: 0000007.mp4 (2.74 MB, 928x640)
2.74 MB
2.74 MB MP4
>>
File: 0000008.mp4 (2.18 MB, 832x640)
2.18 MB
2.18 MB MP4
>>
File: 1736681514115962.gif (382 KB, 698x500)
382 KB
382 KB GIF
>>106854827
>>106854835
>>106854839
>>106854844
>>106854849
literally better than all Sora 2 threads combined
bravo
>>
>>106854812
nice
>>
File: ChromaSadamoto_00036_.jpg (960 KB, 1128x1648)
960 KB
960 KB JPG
>>
>>106854849
wtf why did lodestone do this
>>
>>106854891
She'll be pulverized and then sprinkled across next radiance weights randomly. He found that 1girl sacrifice is the best way to steer his checkpoints.
>>
File: 251011-174550-wan5s_00001.mp4 (3.37 MB, 1200x1792)
3.37 MB
3.37 MB MP4
>>
>>106855016
nice finally some squats
>>
File: ComfyUI_00002_.jpg (364 KB, 1536x2112)
364 KB
364 KB JPG
>>
File: file.png (1.37 MB, 864x1216)
1.37 MB
1.37 MB PNG
>>106855126
>>
>>106855016
never skip boob day
>>
>>106855016
Finally, wan 5B
>>
File: 1754585646888688.mp4 (1.45 MB, 480x832)
1.45 MB
1.45 MB MP4
>>106855294
>>
File: 251011-191753-Wan5s 00001.mp4 (2.69 MB, 2000x1488)
2.69 MB
2.69 MB MP4
>>
>>106855476
>where do you think you're going geek boy?
>>
Pony status?
>>
Is there a chance of doing something with 8GB vram and 16gb ram? I've been using some pony model about a year ago, and it was somehow working. Did the technology advance?
>>
>>106855599
>can I do something with shit hardware
no, youre stack with XL derived models
>>
File: flat_chest_1.webm (3.87 MB, 832x1248)
3.87 MB
3.87 MB WEBM
>>106855016
FTFY
>>
>>106855628
can we have a middleground? i dont like tumors girls
>>
File: tiktok.mp4 (3.33 MB, 640x864)
3.33 MB
3.33 MB MP4
>>106855126
>>
What are you guys doing videos with? Still just WAN?
barely works on my 2070S...
>>
>>106855621
What are the baseline requirements for doing something like that?>>106855016
>>
>>106855639
damn I wonder when they will find this poor being's bones... if they will be able to identify her?
>>
>>106850096
> Those who tried using Qwen-Omni and uploaded real songs for it to describe know what I am talking about.
2.5 or 3?
>>
>>106855642
>2070
the vramlets thread is that way >>>/sdg/
>>
>>106855650
I mean it does work. It just takes half an hour for 5 seconds
>>
>>106855643
16gb vram and 64gb ram minimum if you dont want to kill yourself waiting for gens to finish. it's around 3 minutes for 5s video with this setup
>>
File: flat_chest_2.webm (3.87 MB, 832x1248)
3.87 MB
3.87 MB WEBM
>>106855633
>>
>>106855666
holy balloony
>>
>>106854194
Qwen/Wan do capturing with LLM anyway.
>>
>>106854343
pip install -U comfyui-frontend==1.23.4
>>
File: flat_chest_3.webm (3.87 MB, 832x1248)
3.87 MB
3.87 MB WEBM
>>106855633
>>
>>106854385
Good, but not enough. Try to move the hottest models/loras to nvme.
>>
Desperately need the GTA VI chad to post a catbox of one of his gens for that workflow!

Captcha: PAWGX
>>
>>106855708
O_o ( . Y . )
>>
>>106855657
Use light2x loras.
>>
File: 1.mp4 (3.37 MB, 928x640)
3.37 MB
3.37 MB MP4
>>
what is this /adt/ leak?
>>
File: 00078-1389261569.png (1.68 MB, 1224x768)
1.68 MB
1.68 MB PNG
>>
>>106855721
I'll clean the workflow first, it's filled with embarrassing custom nodes
>>
File: bad.mp4 (3.73 MB, 928x640)
3.73 MB
3.73 MB MP4
>>
>>106855916

Thank you, I appreciate you!
>>
>>106855916
please a lot of anons want to use your workflow
>>
>>106855963
i don't
>>
>>106855963
chroma is broken, gta6 anon cherrypicks his gens
>>
Taken me 4 days of genning to finally get the results I wanted, and even then it's not close to perfect. I can 3d animate the result I wanted in a day.
I give it another year before it can do what I want with just a few minutes spent on it.
>>
>>106855916
basterd bicth redeem the workflow
>>
Is there any benefit in changing these values? I'm noticing a lot of performance drop when this is working on higher res gens.
>>
>>106856130
tiling can help reduce peak VRAM usage
>>
>>106856140
During the entire process of the gen? I guess it will produce visible lines from the tiles?
>>
new
>>106856149
>>106856149
>>106856149
>>
>>106856147
no, vae encode/decode only happen when transforming pixels to/from latents, meaning only at the beginning and end
>>
>>106856155
Ah, alright, thanks.
>>
>>106855639
holy shit haha
good one
>>
>>106854129
waow
>>
>>106855937
i like it



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.