[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106851472

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
Do not use the GTA6 Chroma workflow
>>
File: file.png (38 KB, 666x466)
38 KB
38 KB PNG
What went wrong?
>>
Cursed thread of foeboat
>>
File: 00000-4049137034.png (981 KB, 1224x768)
981 KB
981 KB PNG
>>
>yet another collague of 1girls and favouritism
no one ever complains when ranfaggot posts 10 identical gens but when someone else does that it's a big no-no
>>
>>106856169
Pro Copyright
Closed source
If Pony team had been honest from the start and said "sorry, pony7 project failed because of x y z, we are uploading the model to huggingface so you can learn from our mistakes" people would have reacted differently
>>
>>106856181
I agree, collages should be varied. Thankfully different topics are discussed here. I do not understand wanting to homogenize the collage
>>
>>106856181
>his gens didnt get in the collage
>>
>>106856208
This is schizo posting, change your gen design, almost spam
>>
>mfw made it effortlessly into the collage
>>
>crying because your slop wasn't chosen for the completely arbitrary fagollage
>>
File: file.png (2.16 MB, 1328x1328)
2.16 MB
2.16 MB PNG
>>106856181
>>
pip install -U comfyui-frontend-package==1.24.4
Rgthree for the status bar.
>>
>>106856169
Meanwhile I'd like a proper historical excursion not a meme answer. Was Animagine that bad or was it a nsfw issue that made Pony v6 win? It was the best of the worst due to blind luck and he didn't have any real knowledge or wit to make another banger, is this correct?
>>
What are you going to do when your boss is asking did you get represented in the /ldg/ collague? Going to be pretty uncomfortable situation on Monday.
>>
>>106856181
> >yet another collague of 1girls and favouritism
/adg/ hijack
>>
My Comfy gallery view dissapeared how doI activate?
>>
File: 1754005447906002.mp4 (1004 KB, 480x720)
1004 KB
1004 KB MP4
>>
>gen all day at 860p just fine
>swap to just first frame then pic the last frame of that as first frame and original as last frame
>ooming
>>
>>106856265
let me guess, kijai's nodes?
>>
>>106856238
They should split CumUI into two versions: basic, and 'cutting edge'. That could be easily doable because it's node-based and thus modular anyway.
Those who want microtransactions and handholding can use Cutting Edge and enthusiasts can use the Basic version. Basic means it's like the original version, no bullshit.
>>
>>106856284
im sure there are options to disable ai nodes, im not sure about templates
>>
>>106856225
based i kneel
>>
>>106856283
Yes. I'm also getting an error of it being unable to find an image file, which I'm not using.
>>
>>106856289
Yes but I didn't mean this at all.
>>
>>106856284
you product-think like a pure consumer. the source is freely available.
>>
WHY ARE YOU OOMING, CUNT
YOU LITERALLY GENNED 30+ 860P VIDEOS ALL DAY WHILE I WAS EVEN GAMING
>>
>>106856363
time to upgrade to rtx 6000 pro goy
>>
>>106856240
There were also, remarkably, artists in v6. They were obfuscated, and yet, they worked natively and conflicted with the rest of the prompt much less than loras did. This allowed for styles, including nsfw styles, because frankly, nsfw was rampant in aom shitmixes, too, but styles weren't, so it was mindbogglingly samey.
>>
how is upscaling 480 compared to genning native 720? i really want longer videos like 10+ seconds but i cant do that at 720
>>
File: 00001-2710591822.png (1.2 MB, 768x960)
1.2 MB
1.2 MB PNG
>>
is there a guide for video generation?
Whats the best model and method for a 16GB GPU?
>>
Give me the latest WAN Animate workflow

Why do hands get red after 20seconds in to the generation?
>>
>>106856377
>rtx 6000 pro
looks affordable to me with every passing day
>>
>>106856392
Sovl, hope you see you in the next collague
>>
File: 00002-1601625212.png (714 KB, 768x960)
714 KB
714 KB PNG
>>
Any method to get rid of jerky moves?

https://files.catbox.moe/21prxk.mp4
>>
>>106856284
They don't want it. Even with 1.24 there is a big warning in the log, twice.
>>
they were shilling chroma before...
now they shill radiance and neta
>>
>>106856489
perhaps fuck off to /sdg/ if you want to see old, stale models
>>
>>106856461
Based, sincerely hope you get in the collague.

And I think that /ldg/ needs /sdg/ anons posting here.
Your thread is stuck spamming 1girl gens and benchmark posting. /sdg/ could fill the gap with more artistic stuff.

Look at your collages, just 1girl center frame everywhere.
I get it is a male hobby and website but it shows how limited the vision is past benchmarks and coomer slop. /sdg/ anons would bring the variety this thread is missing.

Fix your Debo Lumi relationship please.
>>
>>106856559
Kill yourself
>>
>>106856535
In /sdg/ we are using Chroma 2k and Krea, not Qwen yet
>>
>>106856559
kys
>>
File: ComfyUI_05867_.png (1.19 MB, 808x1288)
1.19 MB
1.19 MB PNG
>>
>>106856616
>woodnt
belly looks strange
>>
File: file.png (1.89 MB, 1328x1328)
1.89 MB
1.89 MB PNG
>>106856571
>>
I WAS EVEN GENNING AT 960P WITHOUT OOM

DOES THE COMPLEXITY OF THE IMAGE MATTER??
>>
>>106856616
hmmm why isnt her figure brown like the box? kuro illya bros???
>>
File: file.png (2.09 MB, 1328x1328)
2.09 MB
2.09 MB PNG
>>106856633
>>
File: ComfyUI_05868_.png (1.23 MB, 864x1208)
1.23 MB
1.23 MB PNG
>>106856622
>>106856636
>>
>>106856633
>>
>>106856647
>>106856731
Thank you for the support, it is no longer OOMing.
>>
>>106856746
did you buy the rtx 6000 pro?
>>
>>106856746
for now
>>
>>106856707
better
>>
>>106856753
I have to.

>>106856758
It worked for one gen, then it stopped again.
>>
>>106856181
Go back
>>
Is there a lora for putting characters inside a car (viewed from the outside)?
>>
>>106855721
https://civitai.com/models/2035131/chroma-workflow-with-simple-upscale-sharpening
+ gta6 lora
https://civitai.com/models/2035113/chroma-lora-gta6-art-style
>>
File: Jalter sampler testing.jpg (3 MB, 6390x5983)
3 MB
3 MB JPG
Hello, I tested some samplers and schedulers for Neta Lumina v35, hope it helps! Thanks @/ldg/ neta lumina anon for sharing workflow with prompt structure.
>Prompt:
You are an assistant designed to generate anime images based on textual prompts. <Prompt Start>

1girl, solo, jeanne d'arc alter \(fate\), white hair, yellow eyes, short hair, small breasts, hair between eyes, armor, cape, black cape, fur trim, fur-trimmed cape, fur collar, gauntlets, disgust, frown, looking down, looking down at flower, holding flower, white flower, upper body, from side, profile, standing, fire, fire on hands, smoke, burning flower, outdoors, field, white flower field, sky, day, cloud, blue sky, cloudy sky, house, building, town, grass, mountain, castle, masterpiece, best quality, highly detailed
A side profile view of Jeanne d'Arc Alter standing in a vast white flower field on a clear day. She wears imposing black armor with a fur-trimmed cape flowing behind her. Her short white hair frames her yellow eyes as she gazes downward with disgust at a white flower in her gauntleted hand. Flames erupt from her fingers, engulfing the delicate bloom as smoke curls upward. The peaceful countryside stretches behind her with a distant town and castle nestled against mountains under a cloudy blue sky.

>Negative Prompt:
You are an assistant designed to generate low-quality images based on textual prompts <Prompt Start>
ai generated image.blurry, worst quality, low quality, watermark, twitter username,

Model:
netayumeLuminaNetaLumina_v35Pretrained, Seed: 755103215, Steps: 25, CFG Scale: 5 Res: 1024*1536

Conclusion: Karras Scheduler does not work, and DPM++ 3M SDE samplers are unstable.
Res Multistep and Euler sampler (not ancestral) gave the best quality across all schedulers
PNG: https://files.catbox.moe/3j2n5c.png
>>
File: 00005-940736407.png (1.12 MB, 1152x896)
1.12 MB
1.12 MB PNG
>>
File: ComfyUI_05869_.png (1.14 MB, 896x1160)
1.14 MB
1.14 MB PNG
>>
>>106856995
Basado!
>>
File: IMG_1487.jpg (1.61 MB, 1808x3216)
1.61 MB
1.61 MB JPG
>>106856181
>>106856454
>>106856392
>>106856461
week long spergout about nigbo getting repurcussions nick? same time next week??
>>
File: ComfyUI_05871_.png (1.04 MB, 1040x1000)
1.04 MB
1.04 MB PNG
this qwen shit is highly addicting I must say
>>
File: 00006-1257878491.png (1.57 MB, 896x1152)
1.57 MB
1.57 MB PNG
>>
File: ComfyUI_05873_.png (1.07 MB, 1232x848)
1.07 MB
1.07 MB PNG
>>
>>106857130
based
>>
File: 1722679514338.png (56 KB, 658x266)
56 KB
56 KB PNG
>>106856997
Nice, I was planning to test v3.5 today. Usually gen with these.
>>
>>106856257
damn that's pretty good, how'd you get it do be so coherent? controlnet?
>>
File: 00007-1211459248.png (2.02 MB, 1152x896)
2.02 MB
2.02 MB PNG
>>
>>106856860
?
>>
>ran is committed to moderating this thread
>he thinks he can order people on anonymous imageboard
>ironically he is the biggest namefaggot ever graced this website whatsoever
>>
>>106857237
This is really nice anon you've inspired me to try branch out from 1girl coombait, unironically
>>
File: ComfyUI_05876_.png (1015 KB, 792x1312)
1015 KB
1015 KB PNG
>>106857155
check this one out kek
>>
>>106857272
but 1 girl coombait is all I live for
>>
File: 00008-749540045.png (1.97 MB, 1152x896)
1.97 MB
1.97 MB PNG
>>
>720p, 121frames, 8steps, smoothmix
>50min to gen 4x videos
>5090

Doesn't seem right.
>>
>>106857351
>smoothbrainmix
lol
>>
I'm not liking v35, will gen some other in v4 test (which was comparable to v3) to see if maybe my expectation were too high or v35 is truly garbo as I suspect
>>
File: 00009-1143345746.png (1.65 MB, 896x1152)
1.65 MB
1.65 MB PNG
>>
>>106856477
how did you make this?
>>
>>106857322
I mean, same, it's been my only real hobby for the past couple years, kind a pathetic D E S U
>>
wan loras for t2v are actually kinda sick
>hailey rose body lora, 0%, 85%, 50%
>>
>>106857563
>me in the back
>>
File: 1167752492635496281-NEO.png (2.64 MB, 1344x1728)
2.64 MB
2.64 MB PNG
https://files.catbox.moe/a6e7a6.png
>>
>if the penis stroke ends at the bottom of the shaft and you start with the last frame the stroking is now going to move in reverse

REEE
>>
>>106857563
lora link?
>>
>>106857594
i have no idea how to share it, i just baked it
>>
>>106857592
>penis stroke
gay af
>>
>>106857612
bruh.. mediafire/catbox something man.. what else is in your bakery?
>>
>>106857580
needs to be hairier
>>
File: 1167752492635496287-NEO.png (3.07 MB, 1344x1728)
3.07 MB
3.07 MB PNG
>>106857580
>>106857639
My bush preference is light
>>
I'm not even using this shit but I have it in the folder, why is it pestering me about it?

>>106857617
>>>/gif/29608974
Say that to her face.
>>
>>106857622
mostly flux stuff, a qwen lora. i'm gonna see if i can get this uploaded somewhere, it's like 500mb
>>
File: 00010-846569074.png (1.95 MB, 1280x768)
1.95 MB
1.95 MB PNG
>>
>>106857656
nice! do try
>>
>>106857622
https://www.mediafire.com/file/jh4m8oeog4k8obk/WAN.zip/file

post some gens, keyword is psxhr/ohwx_woman
>>
File: 251012-005534-wan5s_00001.mp4 (2.56 MB, 1200x1792)
2.56 MB
2.56 MB MP4
>>
The people uploading stuff like this, I imagine them as a down syndrome poojeet gooner. Two braincells that has inbred themselves to a massive 25 braincells.
>>
File: 1754770157287493.png (653 KB, 1080x1080)
653 KB
653 KB PNG
>>106857754
I find it funny personally
>>
File: Video_00008.mp4 (759 KB, 544x704)
759 KB
759 KB MP4
>brought to you by underarmour
>>
hola buenos días tengo un problema se como descargar juegos rpg en mi computador Mac me cuesta bastante lograr que funcionen o nisiquiera puedo eh intentado meter software como linux y Microsoft pero no lo eh logrado aun cuando intento meter códigos en terminal y la verdad mi experiencia es nula en todo lo que se trate de programación quisiera saber que puedo hacer para lograr jugar juegos rpj como misao o yume Nikki espero que alguien vea esto y me ayude gracias por leer
>>
File: WAN2.2_00232.mp4 (1.89 MB, 544x960)
1.89 MB
1.89 MB MP4
>>106857781
whatcha think?
>>
File: 1663130674349.png (375 KB, 512x512)
375 KB
375 KB PNG
>>106856149
what site can i go to ai filter images where the shapes are still recognizable but the context is something else?
like how pic related has fruits but we all know it's daily dose
>>
File: WAN2.2_00233.mp4 (3.12 MB, 544x960)
3.12 MB
3.12 MB MP4
>>106857666
>>106857781
>>106857870
I like it!
>>
File: Jalterizing Chroma.jpg (1.42 MB, 5064x1120)
1.42 MB
1.42 MB JPG
>>106856997
Testing Chroma 2k + GTA6 lora with similar prompt! I attach img with workflow, seems good for 2d ilustration
Left: verbose tags with ClaudeAI
prompt
https://files.catbox.moe/zq6r53.txt
workflow
https://files.catbox.moe/gyx02x.png

Middle: Only Dan Booru Tags as used with Neta Lumina testing
prompt
https://files.catbox.moe/eo7n0o.txt
workflow
https://files.catbox.moe/feyip1.png

Right:Only Prose caption as used with Neta Lumina testing
prompt
https://files.catbox.moe/4on479.txt
workflow
https://files.catbox.moe/w7samx.png

Original Workflow and Lora >>106856995
Changed the resolution to 1536*1024
>>
File: chroma_flux__0074.png (1.48 MB, 832x1216)
1.48 MB
1.48 MB PNG
>>106857897
>>106857870
really nice, i should figure out how to post this shit to civit i guess
>>
File: Video_00010.mp4 (1.48 MB, 544x704)
1.48 MB
1.48 MB MP4
>>
File: WAN2.2_00235.mp4 (3.8 MB, 544x960)
3.8 MB
3.8 MB MP4
>>106857897
>>106857913

its really no effort desu.. really curious to know what else youve cooked up now! This lora works really well!
>>
>>106857913
Gross, Michael.
>>
https://github.com/leejet/stable-diffusion.cpp/pull/877
ani will be eating good soon. lumina when?
>>
>>106857910
Interesting
>>
>>106857945
theres always money there!
>>
>>106857948
Thanks! I think Chroma2k has better composition and image quality overall. Flowers, city background, and shaders are all cleaner. NetaYume knows anime characters better but for me Chroma2k wins on quality at least in my test.
>>
All advances in non-1girl slop will eventually employed to make even better 1girl slop. This is the fundamental nature of AI.
>>
>>106857666
>>106857897
What does the lora do? Is it only for t2v?
>>
Anyone tried this wan lora yet?


>Scales up continuous-time consistency distillation (e.g., sCM/MeanFlow) to 10B+ parameter video diffusion models.
>Provides open-sourced FlashAttention-2 Jacobian-vector product (JVP) kernel with support for parallelisms like FSDP/CP.
>Identifies the quality bottleneck of sCM and overcomes it via a forward-reverse divergence joint distillation framework.
>Delivers models that generate videos with both high quality and strong diversity in only 2~4 steps.


https://huggingface.co/Kijai/WanVideo_comfy/tree/main/LoRAs/rCM
>>
>>106857941
i made some flux loras for body types (hailey rose, angel youngs), a film style lora, a "Real Face" lora (still get chinmogged), a nipple lora which is iffy. then this week i made qwen and wan versions of some of those to experiment with.

https://files.catbox.moe/l2xzmb.png
>>
File: WAN2.2_00238.mp4 (2.61 MB, 544x960)
2.61 MB
2.61 MB MP4
>>106858032
it's a body type lora. Works on all I believe, I've only tested T2V & T2I
>>
>>106858032
it's a body type lora, so it should apply her body type to your images, her face was removed from the training set. at higher strengths it will apply her hairstyle as well. trained for t2v, but you're free to try it with i2v and see what happens. examples are here:

>>106857563
>>
File: 00080-3859159613.jpg (1.66 MB, 1728x2160)
1.66 MB
1.66 MB JPG
>>
File: WAN2.2_00240.mp4 (2.58 MB, 544x960)
2.58 MB
2.58 MB MP4
>>106858086
>>106858089

The catbox is damn good! That's Qwen I'm guessing?
>>
>>106858100
curious how you remove her face, crop it? erase it and leave a blank space?
>>
>>106856559
commit un-alive
>>
>>106857947
I am starting to see ani has his opening to catch up. there hasn't been anything close to dethroning alibaba and I doubt something will be by eoy. really hope people start bringing stuff over/optimizing ggml since contributing to comfy just feels like shitting on a pile of shit
>>
File: WAN2.2_00242.mp4 (2.52 MB, 514x946)
2.52 MB
2.52 MB MP4
>>106858133
>>
https://vocaroo.com/1niDlo3PQdpg
>>
File: qwen___0002.png (1.23 MB, 832x1216)
1.23 MB
1.23 MB PNG
>>106858133
catbox is wan, posted image is qwen (AY-lora). are you using any loras for your vids? they are really sharp

>>106858137
draw a black square over the face, then in the caption add "ohwx_woman with a black square over her face"
>>
File: WAN2.2_00244.mp4 (3.37 MB, 544x958)
3.37 MB
3.37 MB MP4
>>106858200
I use the lightx2v lora and have a sharpen node

this however is using a vhs lora
>>
File: x.png (1.46 MB, 1224x768)
1.46 MB
1.46 MB PNG
>>
You have made my day, sir. :)
>>
>>
>>106858234
amazing that you got them to stay still and not talk
>>
File: Video_00014.mp4 (2.86 MB, 544x960)
2.86 MB
2.86 MB MP4
>>106858234
i think i got the sharpener working. thx m8
>>
File: WAN2.2_00252.mp4 (3.46 MB, 544x958)
3.46 MB
3.46 MB MP4
>>106858400
hehe, this was the 2nd attempt, they did babble in the first one

>>106858416
>>106858234
Yeah, looks much better!
>>
File: Video_00015.mp4 (2.42 MB, 544x960)
2.42 MB
2.42 MB MP4
>>106858436
slow motion fixed too, gonna train some more loras
>>
Is there some massive image dataset I can download to train my model from scratch?
>>
>>106858538
yes, scrape the entire internet, godspeed
>>
File: WAN2.2_00260.mp4 (3.55 MB, 960x536)
3.55 MB
3.55 MB MP4
>>106858436
>>106858530
Looking forward to whatever you bake next! Only way to follow you would be on Civit, my suggestion would be you try and upload there!
>>
>>106858530
what a cutie
>>
File: ComfyUI_00427_.mp4 (1.7 MB, 832x832)
1.7 MB
1.7 MB MP4
can't seem to get the pumpkins to light up individually at random, it's either all or nothing
>>
File: chroma_flux__0034.png (1.07 MB, 832x1216)
1.07 MB
1.07 MB PNG
>>106858571
yeah i'll make some more and figure out how to get 'em on civit

>DirtMcGirt
>>
>>106858571
this is pretty cool
>>
File: Video_00018.mp4 (1.63 MB, 544x960)
1.63 MB
1.63 MB MP4
>>
>>106858584
cool. creepy but cool.
>>
File: WAN2.2_00263.mp4 (2.35 MB, 608x608)
2.35 MB
2.35 MB MP4
>>106858571
>>106858623
thanks, try out some creepy stuff

>>106858617
godspeed!

>>106858631
this is super sharp and clear!
>>
radiance bro(s), it's t5 like chroma, but I don't get how to prompt it. and why does it make my 4090 chirp like a time bomb countdown?
>>
ram isn't that important when you have 32gb vram right?
I should be good with 64gb ram?
>>
>>106858416
nice please more of her
>>
File: 1739778686575858.png (110 KB, 432x369)
110 KB
110 KB PNG
>>106858665
yes, as long as you allocate enough time in your day to sunlight and fresh air.
maybe a shower
>>
File: file.png (2.33 MB, 1328x1328)
2.33 MB
2.33 MB PNG
>>
File: WAN2.2_00268.mp4 (2.14 MB, 608x604)
2.14 MB
2.14 MB MP4
>>106858652
>>
File: 1735372157459180.jpg (771 KB, 1456x2128)
771 KB
771 KB JPG
>be me
>using swarmUI like a pleb
>'refine image'
>makes it worse every time
sigh
>>
>>106858748
i like it
>>
What is the most suitable custom attention setup for RTX 3060? Looking at Flash Attention, not sure if I want to bother with Sage/Triton.
>>
>>106858700
good.
I'm so excited finally upgrading from my shitty 1070 to a 5090.
I don't even know where to start.
>>
>>106858829
i can't imagine that leap.. i went from a 4090 to a 5090 and im pretty happy with it
>>
File: IMG_2311.jpg (74 KB, 934x2000)
74 KB
74 KB JPG
>>106858819
nick

>>106856149
>>
File: 00043-4160751386.png (2.02 MB, 896x1152)
2.02 MB
2.02 MB PNG
>>
File: ComfyUI_00432_.mp4 (345 KB, 832x832)
345 KB
345 KB MP4
>>
>>106857642
basedo ;3
>>
>>106858631
What sampler are you using?
>>
File: WAN2.2_00275.mp4 (3.59 MB, 544x960)
3.59 MB
3.59 MB MP4
>>106858748
back to 1girl posting
>>
File: 00044-1908066525.png (1.23 MB, 1152x896)
1.23 MB
1.23 MB PNG
>>
File: ComfyUI_00438_.mp4 (547 KB, 1280x720)
547 KB
547 KB MP4
>>
File: 00045-3516086549.png (1.15 MB, 1152x896)
1.15 MB
1.15 MB PNG
>>
File: 595019203544512_00001_F.jpg (2.9 MB, 2000x3000)
2.9 MB
2.9 MB JPG
>>
File: 1746590349231864.gif (613 KB, 498x394)
613 KB
613 KB GIF
>>106859255
dat filesize
>>
>>106859255
MILK
>>
how does SeedVR2 upscaling compare to Topaz?
>>
n
>>
File: ComfyUI_00440_.mp4 (471 KB, 1280x720)
471 KB
471 KB MP4
>>
>>106859314
Better for images. You need to use the fp16 model though, the fp8 is broken and creates seamlines.
Dunno for video, the VRAM requirements are too high.
>>
File: 00046-1949190322.png (1.32 MB, 1152x896)
1.32 MB
1.32 MB PNG
>>
>>106859339
You can run fp16 on a 3090 with block swapping and a suitable batch_size. It only takes maybe 5 minutes. There are also GGUF's available. So far I'm liking the results, but I need to compare it with Topaz video models.
>>
File: ComfyUI_00441_.mp4 (1.14 MB, 720x1280)
1.14 MB
1.14 MB MP4
>>
>>106857237
Underrated
>>
>>106859366
Not bouncy enough.
>>
>>106859350
d*bo's mom?
>>
>>106859274
Hiro can afford it
>>
>>106859339
>the fp8 is broken and creates seamlines
lolwut I gave up on seedvr2 because of those fucking lines
>>
>>106859388
Hiro still living in 2003 boards when all we could post is 1 image
>>
>>106859399
there's a giant note in the github that literally states fp8 is broken. not sure how you missed it
>>
>>106859409
>github
ain't got time for that nerd shit
>>
File: SadamotoYoshiyuki_00002_.jpg (1.06 MB, 1152x1712)
1.06 MB
1.06 MB JPG
>>
>>106859415
that nerd shit could've saved you time
>>
File: k3k.jpg (56 KB, 897x882)
56 KB
56 KB JPG
>>106857317
Convenience display platter, lul
>>
>>106857701
Yummmmn!
>>
File: WAN2.2_00291.mp4 (3.74 MB, 960x544)
3.74 MB
3.74 MB MP4
>>106859068
>>
File: Michelle T 4576.mp4 (3.46 MB, 1056x768)
3.46 MB
3.46 MB MP4
>>
File: 00047-587783274.png (1.98 MB, 1152x896)
1.98 MB
1.98 MB PNG
>>
>load old workflow
>both output and post processing operate differently
fffffffffuuuuuuuuuucccccccccckkkkkkkkk
>>
>>106859509
this obsession with sticking to destructive scaling development is fucking awful. comfyui breaks so many rules of media editing applications it's fucking retarded anyone claiming it to be good design
>>
>>106859509
So comfy, right?
>>
File: SadamotoYoshiyuki_00013_.jpg (1002 KB, 1152x1712)
1002 KB
1002 KB JPG
>>
>>106859541
If anyone else gives me noodles to connect in a more consistent way, I'm willing to listen.
>>
>>106859557
>"hey anon heres my workflow"
>posts forge catbox
>>
File: 1759709173938602.png (41 KB, 747x686)
41 KB
41 KB PNG
Guys, absolute idiot here.
With the new rocm I got A1111 to run locally on my AMD 9070xt. But I can't get forge to run. Is there/will there be some kind of fork for AMD GPUs, now that ROCM 6.4 is out?
>Use comfyui
I hate comfyui
>>
File: 1750461451033253.png (221 KB, 1633x948)
221 KB
221 KB PNG
https://xcancel.com/Ali_TongyiLab/status/1976864306353652217#m
ahahahah
>>
>>106859634
Unironically learn comfy. I used to hate it too, and refused to switch from forge/reforge for the longest time. I could never go back now.
>>
>>106859509
i dont think you know what you're doing
>>
does chroma dc 2k work with hd loras?
>>
>>106859652
>I used to hate it too
don't kid yourself, you still hate it
>>
File: SURFER-R0SA.gif (3.98 MB, 512x470)
3.98 MB
3.98 MB GIF
>>106859619
>"hey anon heres my workflow"
>posts salvaged frames\edits from ezgif
;3
>>
>>106859654
youre right i shouldve never pulled
>>
>>106859671
Why'd you want to taint it with the HD shit?
>>
comfy was the best a year ago but one griftchink is all it takes to destroy everything
>>
>>106859673
You don't know me.
>>
>>106859692
Comfy is the only UI still securing millions in funding and adopting all the latest models/tech while also partnering with every major AI provider. you can't cope any harder bro
>>
>>106859684
because there are loras that exist on civitai that i want to use for 2k that are made for hd, but i am unsure if they work
>>
>>106859634
>absolute idiot here
>AMD
>I hate comfyui
kek
>>
File: 00048-1962744031.png (1.22 MB, 1280x768)
1.22 MB
1.22 MB PNG
>>
>>106859704
well just try it yourself? seriously you could've gotten your answer by now
>>
>>106859671
>does chroma dc 2k work with hd loras?
yeah works no problem
>>
>>106859704
All chroma loras are interchangeable
>>
>>106859698
>redditor
it's pretty obvious

>>106859702
what does that have to do with all the enshitification other than they got the money by making the app much worse?
>>
>>106859720
Stop being a fucking retard
>>
>>106859478
How did he do it? And why isn't anon doing it as well?
>>
I don't mind comfy. The only thing I fucking despise is how you can't cancel mid step/during model loading/etc. If I hit Cancel, it should immediately stop
>>
>>106859650
So right now, the space is
1) Sora, censored as all hell but technically the most controllable and highest quality
2) Grok Imagine, much less censored (though prompts do get rejected inconsistently, did a few breast expansion gens earlier and now they all get blocked) and with less control but generally much more freedom
3) Wan, least control (requires loras etc) but least censorship due to opensource...except they want to make it subscription anyway

Not looking good. The town is not big enough for everyone, I don't think.
>>
>>106859702
>Comfy makes millions therefore his product is getting better
who. the. fuck. is. this. R E T A R D?
>>
>>106859743
https://gist.github.com/blepping/99aeb38d7b26a4dbbbbd5034dca8aca8
>Custom node for ComfyUI to make interrupting sampling more responsive
>>
>>106859754
It is getting better though, you're just seething and shitting your pants over optional api nodes
>>
>>106859769
Why does this need a custom node?
>>
>>106859769
so you're telling me that it's possible to make it stop right away but comfy doesn't do it? what the fuck is his problem? sometimes one single step takes a minute on Wan, I don't want to wait a mn if I want to interrupt the process, fuck this retard, jesus
>>
File: ComfyUI_00452_.mp4 (544 KB, 640x832)
544 KB
544 KB MP4
>>
Is there a way to make NAG work with the FaceDetailer from Impact Pack or do I have to manually hack something together?
>>
>>106859650
none of these videos are interesting to me
>>
>>106859650
didnt some anon post a screenshot a few threads back about "new models next week" or something from one of the devs? i forgot to save it
>>
>>106859796
comfy isn't some god that can implement everything right away. if it gracefully cancels gens without any other issues down the road then it will eventually get implemented into comfy.
>>
>>106859828
i regret uploading that image
>>
>>106859847
>comfy isn't some god that can implement everything right away
are you retarded or something? this is a really important thing to implement, he's more focused on adding API nodes than fucking this, lmao, stop sucking up his dick he won't date you fucker
>>
>>
File: 00049-2362339818.png (1.56 MB, 1280x768)
1.56 MB
1.56 MB PNG
>>
>>106859769
I just tested it, holy shit it stopped it right away during a Wan gen, this is actually good, thanks for the code anon
>>
>>106859872
this is the official repo that will be updated
https://github.com/blepping/ComfyUI-bleh
>>
>>106859769
I just tested it, holy shit it deleted my comfui directory, I'm finally free
>>
I just t
>>
>>106859883
It sends my dick pics straight to comfyorg wtf
>>
>>106859883
Just press Control+C.
>>
>>106859907
this, just remove the power cable of your computer and you'll see the process will be terminated right away
>>
>>106859907
or you can just click a button. this is useful also for mobile users.
>>
>>106859678
BRAPP!!!
>>
File: 1740109505120899.png (898 KB, 768x768)
898 KB
898 KB PNG
What's the SOTA local for region prompting?
>>
>>106859769
>>106859883
can't believe this has to be a custom node, why can't Comfy make this basic shit official?
>>
File: dmmg_0126.png (1.26 MB, 832x1216)
1.26 MB
1.26 MB PNG
>>106859034
euler/simple
>>
>>106859929
using a model with spacial understanding i.e. qwen flux chroma or lumina
>>
File: wtf.png (2.62 MB, 2048x1024)
2.62 MB
2.62 MB PNG
Did anyone ever investigate what the fuck actually happened with the SD 3.0 / 3.5 arch? I don't believe it has jack shit to do with "censorship", there's definitely some weirder underlying technical issue that resulted for example in 3.5 Large Turbo actually being MUCH more coherent than regular 3.5 Large for the majority of outputs, which is obviously the opposite of what you'd normally expect distillation to result in. Picrel is same seed / same prompt between 3.5 Large and 3.5 Large Turbo.
>>
>>106859956
>why can't Comfy make this basic shit official?
Saar you have to understand, Comfy needs several million more for its implementation Saar!
>>
>>106859973
who cares? it's a model that can't be saved, we moved past that
>>
File: SadamotoYoshiyuki_00030_.jpg (1.09 MB, 1152x1712)
1.09 MB
1.09 MB JPG
>>
>>106859966
damn that's good.. what model?
>>
>>106859929
I can think of at least three ways to achieve this and none are necessarily better than the others
>>
>>106860031
samefag
>>
>>106859772
>It is getting better though
was not the right time to spew that lie lol
>>106859743
>>106859769
>>
>>106859984
I mean I've had some degree of success training doras on 3.5 Medium. I think the fact that it had a wider resolution range kind of helped it. I think the whole situation is interesting, anyways.
>>
File: dmmg_0195.png (1.25 MB, 832x1216)
1.25 MB
1.25 MB PNG
>>106860031
flux
>>
>>106860069
her skin is more plastic than the inflatable buoy
>flux
oh...
>>
>>106860069
where's the buttchin? last i used flux it was total buttchin
>>
File: 00050-4287958449.png (1.16 MB, 1280x768)
1.16 MB
1.16 MB PNG
>>
File: dmmg_0089.png (1.41 MB, 832x1216)
1.41 MB
1.41 MB PNG
>>106860081
it's still there, just try to run facedetailers high with a second model, i think this is krea for facedetail
>>
File: ComfyUI_00464_.mp4 (549 KB, 640x832)
549 KB
549 KB MP4
>>
>>106860108
oh damn.. ok good to know, thanks
>>
File: 1741324195081560.png (622 KB, 1080x1285)
622 KB
622 KB PNG
upscalers are getting better and better, good to know
>>
>>106860152
gee upscaling 5 pixels into someone whose pictures are all over the entire internet? HOW DO?

retards
>>
File: 1754198200504354.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
>>106859971
I want slightly different text prompts in different regions of the gen. simply describing in only a text prompt how the image changes limits me to text-describable regions. and even with good models there's a risk of prompt cross-contamination as there is only one prompt for the whole image.
>>106860038
If you could kindly provide me with a searchable term then I could decide that for myself.
>>
>>106860152
>>106858189
>>
>>106860152
the skin is really plastic though, did they finetune Flux or something? lool
>>
>>106860152
Now do it of someone that isn't in the model's dataset (aka a normal person not a celebrity) and see how accurate it makes them.
>>
>>106860152
>skin color changes
same old problems
>>
File: 1743011008551093.png (215 KB, 672x450)
215 KB
215 KB PNG
>>106860217
it's a big improvement over this I guess
>>
File: ComfyUI_00101_.png (1.05 MB, 1280x800)
1.05 MB
1.05 MB PNG
>>106860152
wowww incredible how'd they do it
>>
I like plastic skin on 1girls because it makes them look more like impossibly perfect dolls than people. They are anime abstractions made into matter, not a depiction of flesh.
>>
>>106859929
>>106860179
i dont think its changed much from latent couple, regional prompting , etc. but i dont use them
there hasnt been any "big" breakthroughs unless you mean specific nodes / custom extensions which i can only assume still receive the occasional update
qwen edit can do "add [thing] inside [the area i marked with a circle]" but that doesnt seem much different than inpainting functionally
>>
File: dmmg_0043.png (1.24 MB, 832x1216)
1.24 MB
1.24 MB PNG
>>106860252
>>
>>106860223
this is the behavior I expect from a straightforward just-an-upscaler.
If I compress the resolution of the result image, then it would match the original much better than >>106860152
I want an upscaler that treats the task as a signal processing task, with any ai just being a vice captain providing context-dependent guidance
>>
>upscaling
even wan 2.1 does this? (& a mostly decent job)
im not sure what im missing here???
>>
>>106859650
>new element notification sound
lol
>>
>>106860263
I am not sure what this face is intending to convey but extremely cool gen, love the gold trim and the iridescence. Nice work anon.
>>
I often dream about being a millionaire and having a dedicated room for ai gen'ing. a bunch of h100's in a gpu cluster with dedicated A/C, backup power and 100tb of raid ssd storage over a 40gb fiber line. man the things i'd do.
>>
>>106860308

>bunch of h100's in a gpu cluster
>with dedicated A/C
>backup power
>100tb of raid ssd storage

>millionaire

Are you retarded?
>>
>>106860335
the fuck is the problem?
>>
File: 1747472316939041.jpg (339 KB, 1024x1024)
339 KB
339 KB JPG
>>106860308
Hey how do we know we're not living in a simulation crafted by some fat neckbeard who managed to achieve that already and went on to design the perfect augmented reality marketed as a videogame to trap our souls forever?
Exactly, we don't!
>>
>>106860308
that's like $330k from Dell, new
>>
>>106859650
>>106859744
they can't go back, it's too late. if they go back, it means they flopped, and their model can't be pro or something like that
>>
>>106860360
That's just upfront cost. The operation cost(electricity) of having multiple h100's + A/C running 24/7 would be enormous.

>>106860356
Even if it were true, we'd still be left with the chicken/egg problem. What is the 'real' reality?
>>
>>106860368
they can make a wan 3.0 that'll be actually good on the SaaS world and release wan 2.5
>>
File: 1740297803558215.jpg (37 KB, 640x396)
37 KB
37 KB JPG
>>106860386
>What is the 'real' reality?
If I told you... I'd have to unplug you!
>>
>>106860399
Likely won't happen, companies (almost) never release legacy closed source when they upgrade models.
>>
>>106860386
depends where you are, i guess. colo near me is like $100/kWh flat rate, which is going to take a while to rack up to $600k.
we have a few 4xH100 and 8xH100s, and they're not really any more power hungry than any other future ewaste. it doesn't really save any money to run them power limited, either.
>>
python was a mistake
>>
>>106860386
you could get a rack at QTS for 2k a month to run that lol
>>
>>106856149
>>
File: E85ZGg0JnSrrPjjw.mp4 (1.86 MB, 720x1280)
1.86 MB
1.86 MB MP4
>>
where are the good gens doe
>>
>>106860467
i like this
>>
File: ComfyUI_00023_.png (1.67 MB, 800x1280)
1.67 MB
1.67 MB PNG
>>106860475
on that guy's million dollar dream server i guess
>>
>>106860275
You missed the sign on the door that reads "no gay retards"
>>
>>106860467
This gave me diabetes, and Parkinson's
>>
File: 00000-2022559353.png (2.64 MB, 1152x2016)
2.64 MB
2.64 MB PNG
>>
new needed
>>
File: 00256-3247359317.png (2.43 MB, 1248x1848)
2.43 MB
2.43 MB PNG
>a toast, to /ldg/
>>
>>106860668
>>106860668
>>
>>106860419
they prefer to kill their old models, rather than share them. the chinese don't have a sharing culture. free models are only for quick popularity.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.