[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106736034

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
gibs mor Jebby
>>
Blessed thread of frenship
>>
is nunchaku wan 2.2's only hope to escape extremely destructive light loras while also keeping gens a sensible 3 minutes?
>>
>>106739455
let me refine that guy's question because I'm also interested
where can we find LORAs of people who do NOT look like a particular celebrity nowadays?
>>
>>106739695
yep
we'll get nunchaku wan but then wait another 6 months for lora support
>>
>>106739695
>while also keeping gens a sensible 3 minutes?
What make you think nunchaku will magically cut down gen times by 50%+?
>>
>>106739850
he's delusional. best scenario is 20 steps at q8 quality with q4 gen speeds
>>
File: bannersideways.gif (2.24 MB, 243x720)
2.24 MB
2.24 MB GIF
>>106739587
BLESSED SHIP OF THEADLY FRIENDS
>>
>>106739695
nunchaku is a dead end, no loras work with it
> b-but flux has lora supprt
lmao nobody cares about that model anymore.
>>
File: GwDmgLeWMAAKC03.jpg (234 KB, 1824x1248)
234 KB
234 KB JPG
while most of you are assholes and schizos, i would like to thank /ldg/ for being the sole source with information on how to use ComfyUI. been doing this since 2023 and I've learned so many little tricks. i havent seen this information anywhere else on the internet.
>>
>>106739695
>please my Q4_S quality meme
no thank you
>>
>anything sensual with wan requires a lora
>all loras are just HYPER SEX MEGA BLOWJOB
>>
File: 1734004473993745.png (1.31 MB, 1360x768)
1.31 MB
1.31 MB PNG
can use reactor in reforge/etc to fix any faces if you do a two character prompt and one face is off:

all these tools work together. that's why you have multiple models/etc.
>>
>>106739991
it would be a good model if it didn't make the humans plastic, that's the biggest difference between that one and nano banana, I hope they'll be able to fix that for the next versions
>>
>>106740008
it works perfectly fine if you say "keep their expression the same", the only time it can be off is if the pose changes and you use multiple characters. but that can be fixed with prompting or a face swap.
>>
>>106739985
>want slow romantic cowgirl lora
>ULTRA-PISTON FUCKMACHINE9000

RRrrrEEEEeee
even promptwrangling it into slow movements doesn't work with the overtrained and badly captioned loras
>>
>>106740035
just rent a runpod and do it yourself, chuddie
>>
>>106740049
> spending money
lmao, kek even
>>
File: dawnita42069gif.gif (2.4 MB, 330x480)
2.4 MB
2.4 MB GIF
>>106739985
ew ;c
2.1 had fun dancing lora atleast i was hoping things would be different
>>
File: 1731167256184121.png (2.24 MB, 1536x1536)
2.24 MB
2.24 MB PNG
>>
>>106740060
back to copeville then
>>
>>106739934
why is hip swaying so damn mesmerizing
>>
File: ComfyUI_18838.png (2.31 MB, 1920x800)
2.31 MB
2.31 MB PNG
>>106739631
>>
File: 1751107141961441.png (2.3 MB, 1536x1536)
2.3 MB
2.3 MB PNG
>>106740071
we've come a long way since pony desu

model is wai v15 (any noob/illustrious model will do, consistently good anatomy etc.)
>>
File: J0YFUL.gif (3.9 MB, 320x466)
3.9 MB
3.9 MB GIF
>>106740084
for dare i say it, FUN! ;D
>>
File: 1744460256212357.png (2.31 MB, 1536x1536)
2.31 MB
2.31 MB PNG
>>106740094
also, high kick is a neat tag.

for your anime gens I highly recommend using this extension with reforge:

https://github.com/DominikDoom/a1111-sd-webui-tagcomplete

infinitely easier to figure out a tag, just type and you get a list of stuff to select with the proper booru tag (that models are trained on).
>>
>>106740096
very good wan expressions wtf
>>
>>106740076
or i could just train it for free:
https://huggingface.co/lora-training-frenzi
https://huggingface.co/lora-training-frenzi
https://huggingface.co/lora-training-frenzi

let's you train a lora on whatever model for an entire week, so like 4 models a day
>>
>>106739985
>prompt "licking"
>subject just holds their tongue on something a couple of times
>write tongue kiss
>they just press their tongues together
wtf where's tongue swipes, flicking and swirls in the piece of shit. Wan 3.0 better be trained one explicit pornographic materials.
>>
File: 1738453577896667.png (2.38 MB, 1536x1536)
2.38 MB
2.38 MB PNG
>>106740112
>>
>>106740114
sure let me give some random guy my dataset
>>
>>106739985
lower the weight to 0.1 then
>>
File: ChromaRedline_00004_.jpg (706 KB, 1408x1952)
706 KB
706 KB JPG
>>
File: 1752219104413305.png (2.79 MB, 1536x1536)
2.79 MB
2.79 MB PNG
>>106740126
>>
>>106740134
counterargument: free
>>
>chineseium sloppa models and WANk
gee thanks
>>
File: 1747091681849575.png (13 KB, 655x115)
13 KB
13 KB PNG
>>106740203
>free
doesn't look free to me
>>
>>106740117
>prompt licking with 2.1 I2V lora
>they stick out their tongue like a snapchat filter
>use 2.2 i2v lora
>barely moves
no winning happening here
>>
>>106740215
i'm going to train porn. what are they going to do, ban me?
>>
>>106740114
>limited time
>shared resources
>can't do nsfw
>surveilled by huggingface as the icing on the cake
>>
>>106740113
its highly edited anon, with wan you have to salvage\cut frames hehe
>>106740126
>>106740112
i used to get banned for images like this ;3
good times
>>106740117
>her tongue is already out in the photo, now it looks like a b-movie prosthetic\strange as fuck heh...
>>
>>106740299
there are non-nsfw things you can train silly.
5000 steps should be more than enough for a concept or even a char.
>>
>>106739695
That's what I am waiting for at least.
>>
>>106740313
>concept or character
Why do you need the cloud for that, you can just do that right now lol.
You only need the cloud if you're going to do videos.
>>
>>106739956
What kind of tricks?
>>
File: 1751710348121999.png (774 KB, 928x1120)
774 KB
774 KB PNG
>old 2.1 lightx2v works better for high pass of wan 2.2
what a shame.
>>
https://files.catbox.moe/pq8nyg.webm
>>
File: 1732259415927978.png (779 KB, 928x1120)
779 KB
779 KB PNG
>>106740423
>>
>Your 5 credits will expire in 1 days.
>The content you uploaded appears to violate the community guidelines
>in order to promote a healthy community, we will impose certain restrictions on content that may appear in text, images, and videos; such as depictions of implications of sexual acts, exposed body parts, seductive content, detailed sexual situations, pornographic intentions, adult products, and the like.
>the scope and degree of content review may vary depending on the time\region.
>since the above judgement is based on Ai content\enforcement review, there MAY be a possibility of misjudgement...
if you need a human review, please contact our customer service Ai bots so they can redirect you to our homepage and ignore your request after stealing your real-world money ;D


accurate yet? hjahaha
>>
how much would it cost to do a small fine-tune of chroma
>>
File: 1748281748827951.mp4 (597 KB, 672x480)
597 KB
597 KB MP4
>>106740557
the man opens the box of pizza and eats a slice.

not bad. 2.1 lora for high pass at 3 str, 2.2 lora for low pass at 1 str.
>>
>>106740423
have you seen the deus ex remastered announcement? god it looks so fucking bad, please bring back dev studios consisting of only old white men
>>
>>106740607
yes. at least this will revive the original modding community and we might get some actual good stuff like gmdx/revision/etc.
>>
>>106740592
kek
>>
File: 1755446633902721.mp4 (1.12 MB, 832x480)
1.12 MB
1.12 MB MP4
the man on the left gets on his hands and knees, bowing to the man on the throne who is laughing.

from the qwen edit image. Billy won SO hard.
>>
>>106740607
>consisting of only old white men
glad you specified old white men because young white men are all trannies now. they don't usually grow old kek
>>
>>106740654
bow, peasant!
>>
File: 1758661798176733.mp4 (993 KB, 832x480)
993 KB
993 KB MP4
>>106740692
>>
>>106740592
literally the same situation for wan2.5 online. even some sfw prompts are denied kek
>>
File: 1733617852166006.mp4 (2 MB, 832x480)
2 MB
2 MB MP4
kek, from another qwen edit.

the man on the left is drinking his champagne and the yellow character on the right throws his 100 dollar bills into the air.
>>
File: 1758794893121184.mp4 (3.88 MB, 832x832)
3.88 MB
3.88 MB MP4
>>
anyone generate sound effects for their videos? i am looking into AudioX AI which apparently can make effect based on text and an input video
>>
>>106740771
all sound generation that's local sucks donkey ass. not worth it
>>
>>106740146
great movie taste
is that base chroma or with a lora?
>>
File: 1731211404210743.mp4 (1.52 MB, 480x832)
1.52 MB
1.52 MB MP4
the man in the white Saudi clothes flies an airplane into a tall skyscraper in new york, which causes a large explosion.

what did EA mean by this?
>>
>>106740750
I wonder if you can have that pantyshot without starting with it
>>
File: ChromaRedline_00018_.jpg (508 KB, 1760x1328)
508 KB
508 KB JPG
>>106740779
Lora. Testing to see if I can reduce concept bleed so chraracters dont get mixed. Chroma stripes pretty much ruin it anyways. Meh
>>
File: abyss.jpg (554 KB, 2014x1540)
554 KB
554 KB JPG
>>
File: p0tat0sadly.gif (3.9 MB, 386x580)
3.9 MB
3.9 MB GIF
>>106740898
i wonder if its the 360 lora for 2.1 or something newer\better\etc
:o
>>
I'm a promptboomer that's still using SDXL what should I be using instead?
>>
>>106740947
HD doesn't have the banding problem but I observed heavy detail loss during the second pass especially in the background. One of the many reasons why I'm pissed with how this model turned out
>>
>>106741015
>360 lora
It's probably that.
>>
>>106740750
prompt?
>>
File: 00184-2539883880.jpg (459 KB, 1440x1920)
459 KB
459 KB JPG
>>
File: 00187-2219899248.png (3.6 MB, 1920x1440)
3.6 MB
3.6 MB PNG
>>
File: 1733549345586802.mp4 (1.44 MB, 480x832)
1.44 MB
1.44 MB MP4
welcome to the saudi sims.
>>
>>106741160
which dude do all those wives belong to
>>
>>106741064
>HD doesn't have the banding problem
I get stripes with every model, there's no escape
>>
File: 00188-2751198210.png (2.82 MB, 1440x1920)
2.82 MB
2.82 MB PNG
>>
File: 1744595860260119.mp4 (1.5 MB, 640x640)
1.5 MB
1.5 MB MP4
the anime girl is doing a high kick and brings her leg to the floor, to stand in a neutral pose.
>>
File: 00189-2398877833.png (3.01 MB, 1440x1920)
3.01 MB
3.01 MB PNG
>>
File: 1746609173477018.mp4 (2.07 MB, 640x640)
2.07 MB
2.07 MB MP4
the anime girl is doing a high kick and spins 360 degrees.

it interpreted the speed lines as water or something...I guess.
>>
File: 1741088461580489.mp4 (1.89 MB, 640x640)
1.89 MB
1.89 MB MP4
the anime girl doing a high kick puts her feet back on the ground and waves hello.

better!

3 str high pass with 2.1 lora, 1 str low pass with 2.2 (low) lora.
>>
the anime girl bends over to touch her toes with both hands.

BFL wont let you gen a girl on grass btw, but China lets you do what you want.
>>
>>106741376
>BFL wont let you gen a girl on grass btw
that's not safe
>>
File: 1746821837484162.mp4 (1.73 MB, 640x640)
1.73 MB
1.73 MB MP4
>>106741376
helps if I add the wan clip.
>>
>>106741215
>>106741190
butiful 2.2 lightning super duper slow-mo
>>
File: 1738147097936768.mp4 (1.57 MB, 640x640)
1.57 MB
1.57 MB MP4
even the shadows change, wan is a neat model.
>>
File: 1747687204573237.mp4 (3.79 MB, 832x832)
3.79 MB
3.79 MB MP4
>>106741110
the image i used was from some anon from /ldg/

I used this workflow https://civitai.com/models/1911655/wan22-360-degree-orbitnsfw?modelVersionId=2163681
I switched the wan2.1 lora's with wan2.2 lightning
and added a interpolator at the end
>>
File: 1737581584763488.mp4 (1.17 MB, 640x640)
1.17 MB
1.17 MB MP4
was there ever a video of the 4koma guys? this is pretty close to what it'd probably look like.
>>
>>106741423
as the for the prompt I followed the instructions in the workflow but I got better results by adding the word "degrees" after "orbit 180"
>>
i like how i can tell if an image was genned with chroma based on whether it has flux lines or not. why does chroma do that?
>>
File: 1754280330845565.mp4 (776 KB, 640x640)
776 KB
776 KB MP4
warning: this video is ILLEGAL in China.
>>
File: dmmg_0286.png (1.7 MB, 832x1216)
1.7 MB
1.7 MB PNG
another day, another attempt at avoiding flux face
>>
>>106741423
prompt for the image. i will try later wan slop
>>
File: ComfyUI_temp_burgz_00008_.png (3.02 MB, 1192x1648)
3.02 MB
3.02 MB PNG
>>
File: RA_NBCM_00028.jpg (705 KB, 1872x2736)
705 KB
705 KB JPG
>>
>>106741492
the image was sourced from another anon from /ldg
>>
>>
File: 1746785452008716.mp4 (793 KB, 640x640)
793 KB
793 KB MP4
>>106741480
>>
>>
>>
File: ComfyUI_temp_pfsdp_00002_.png (2.32 MB, 1192x1648)
2.32 MB
2.32 MB PNG
>>
>>
File: 1739478761415742.mp4 (428 KB, 640x480)
428 KB
428 KB MP4
the blonde girl in the dress stands up and walks to the other side of the room.

no interpolation cause it makes more sense with the retro game edit.
>>
File: ComfyUI_temp_pfsdp_00004_.png (2.47 MB, 1192x1648)
2.47 MB
2.47 MB PNG
>>
File: ComfyUI_temp_pfsdp_00005_.png (2.42 MB, 1192x1648)
2.42 MB
2.42 MB PNG
>>
File: 00006-1576299817.png (1.27 MB, 1192x736)
1.27 MB
1.27 MB PNG
>>
>>106741164
Damn... This model is fucked anon. I had to hang in the towel.
>>
File: 1755610117386992.mp4 (553 KB, 640x480)
553 KB
553 KB MP4
>>
File: ComfyUI_temp_burgz_00015_.png (2.97 MB, 1192x1648)
2.97 MB
2.97 MB PNG
>>106741064
those fucking flash loras ruin the background, I finally got a decent Chroma wf after testing like 10 models, mixing up WF for two days, is such a mess, its like the model is incomplete and you gotta patch it out with several unet modifications to make it work
>>
>>106741776
top kek
>>
>>106741799
The creator decided to freestyle it, he made some really good models in the past and decided to just say fuck it and destroy the thing. I don't know who this model is for, people want basic shit like consistency and no banding during high res passes, something I have never seen before with a model and I'm sure it has to do with him fucking with the resolutions during training.
>>
File: 1739830601905417.mp4 (2.27 MB, 640x480)
2.27 MB
2.27 MB MP4
the man on the left pours his champagne over his head, as the yellow character throws 100 dollar bills in the air.
>>
File: ComfyUI_temp_pfsdp_00008_.png (2.55 MB, 1192x1648)
2.55 MB
2.55 MB PNG
>>106741824
One thing I noticed after testing several of the latest models, is that the noise convergence is fucked up real bad, thats why you get shitty noisy backgrounds cluttered with unwanted stuff with the most basic wf settings. also thats why the flash loras work well but they ruin the background, they go to far into convergence deleting small details, the author should focus on that instead of releasing those schizo models that no one will use
>>
File: 1743743946278379.png (2.66 MB, 1728x1344)
2.66 MB
2.66 MB PNG
>>
>>106741872
He could have had a great model but decided to be a fucking retard, the more people that try to use the model the more people realize how bad it is. Me and other anons with the hardware to actually run this shit decently have done extensive testing on this model and most of have already come to the conclusion this model is shit on a fundamental level and the creator is basically covering his ears screaming lalala.
We already learned that obfuscating tokens destroys model and he lied about it not being censored. I should have known something was up once he started talking to the retarded pony faggot.
>>
File: 1749447792471568.mp4 (2.63 MB, 640x480)
2.63 MB
2.63 MB MP4
>>106741855
>>
>>106741889
this anime was so funni
https://www.youtube.com/watch?v=mixzbaXx208&list=PLdE7sv4frbx-5WOBCGxgmkev_YTf-5gZ4&index=3
>>
>>106741914
sign in?

no thanks.
>>
File: 1747669800263910.mp4 (2.02 MB, 640x480)
2.02 MB
2.02 MB MP4
the yellow character takes off his costume to reveal Miku Hatsune under the yellow suit.
>>
>>106741933
wait you need to sign in to see this? that's bullshit...
>>
File: 1742938940142117.mp4 (1.94 MB, 640x480)
1.94 MB
1.94 MB MP4
the yellow character takes off his costume to reveal an anime style Miku Hatsune under the yellow suit.

that refined it a bit.
>>
>decide to finally upgrade from Forge to reForge2
>go for an entirely clean install and give a spin
>outputs don't match Forge outputs despite identical metadata
>decide it's not worth troubleshooting right then and close out for the day

>run webui-user.bat the next day
>hangs at loading models while randomly kicking up my CPU fans
>hangs every time I try to launch it

Is this thing just shit? Are any of the Forge forks not shit?
>>
File: output.webm (3.85 MB, 792x1320)
3.85 MB
3.85 MB WEBM
Messing around with a hades 2 lora I made for noobai back when the early access version first came out.
>>
File: dsaffsda.png (19 KB, 807x635)
19 KB
19 KB PNG
>>106741902
I mean look at the crap they share on their discord, how are you supposedly to know whats the best model to use
>>
>>106741902
>We already learned that obfuscating tokens destroys model and he lied about it not being censored.
he also lied about the model having artist tags, he's suck a sneaky bitch
>>
File: ComfyUI_temp_pfsdp_00010_.png (2.41 MB, 1192x1648)
2.41 MB
2.41 MB PNG
>>
>>106742024
fucking kekd
>>
>>106742024
It's all cope for them making a foundation of shit SAI did the same thing with it's multiple miscarriages
>>106742055
Yes and I wish more people would be on his ass about that. I still don't know what the fuck he was or is currently doing with these models, he's throwing shit at a wall praying that it sticks what he needs to do is go back to a older epoch and start fucking over instead of wasting time and money that people gave him.
>>
>>106742024
schizo scribbling lmao
>>
File: 1750713018445125.png (1.72 MB, 2336x880)
1.72 MB
1.72 MB PNG
Edit models are so great because you can put any obscure ass anime chararacters as an input and it'll just do it, no need to make loras anymore
>>
>>106742123
Why are you reposting other anons images? It's bad enough you were doing this pathetic impersonation shit earlier in the other thread trying to derail it. You can't even go a day without exposing yourself dude. You should be happy with this thread being the way it is, it's everything you asked for.
>>
>>106742024
lol wut, if they're going to make schizo diagrams at least use draw io or something.
>>
>>106742128
>Why are you reposting other anons images?
what?
>>
File: ComfyUI_temp_pfsdp_00012_.png (2.95 MB, 1192x1648)
2.95 MB
2.95 MB PNG
>>
File: RA_NBCM_00017.jpg (918 KB, 1872x2736)
918 KB
918 KB JPG
>>
File: 1749551026637143.png (788 KB, 928x1120)
788 KB
788 KB PNG
>>
>>106742138
Debo seething pay it no mind
>>
>>106741980
regular reforge is fine with all the extensions, havent had a need to update it. controlnet union, adetailer, reactor, any anime model, all work fine.
>>
chroma feels like lodestones read a bunch of arxiv papers and decided that he was a genius who could get all these experimental snake oil training techniques to work, and then he actually trained it and now we're finding out why nobody actually uses those papers
>>
>>106742158
Ran...
>>
File: chromaschizowf.png (192 KB, 2077x1029)
192 KB
192 KB PNG
>>106742024
what makes me laugh is that one of the the main guys from their discord got all uppity and mad when phr00t made an AIO model for Chroma, and this is the wf from the guy who gatekeeps Chroma lmao
>>
>>106742175
It does feel that way, I haven't felt this disappointed since SD3 desu.
>>
>>106742175
kek, the vae less paper is promising though
>>
File: ComfyUI_temp_pfsdp_00013_.png (2.88 MB, 1192x1648)
2.88 MB
2.88 MB PNG
>>106742070
>>106742142
I changed one small value and got 3 different types of images

Chroma is indeed weird but good nonetheless, good it can filter vramlets and aijeets
>>
>>106742189
Can you post some of his gens?
I look in the discord and all I see is typically dog shit compared to what anons have posted in this thread. They just post the slop and don't care about how fucking bad it looks or how inconsistent it is.
>>106742203
Perhaps he should make functioning model first before freestyling no?
>>
doesn't radiance take multiple times longer to gen with than normal chroma?
>>
>>106742209
Sure it's fine to 3D it seems to have a heavy bias to it. But for art styles it goes straight to hell because it can't stay on target. I can't think of a finished model with this issue outside of SD3 and it killed the company.
>>
>>106741528
i didn't find anything. people here talk too much anyway. i refuse to read everything
>>
File: ComfyUI_00019__6.png (1.4 MB, 896x1152)
1.4 MB
1.4 MB PNG
>>106742211
his gens are pretty fucking good tbqh but I wont ever use that schizo workflow, his civitai models are pretty good too but he has like 30 different wf, his HF is cluttered with lora experiments and models, I mean c'mon, commit with something at least lol
>>
>>106742158
larpy is up to good again? :D
>>
File: 1728299133394262.png (775 KB, 928x1120)
775 KB
775 KB PNG
the man is wearing a tinfoil hat in the shape of a triangle. At the bottom of the screen is a black bar with blue text saying "JC Denton: they say the frogs are being turned gay.". Keep his expression the same.

kek, actually got it, qwen edit v2 is a gold mine for editing/manipulation.
>>
>>106742230
>his gens are pretty fucking good tbqh
meh
>>
i still kek every time i remember that silveroxides puts his hardware info in his discord username
>>
>>106742230
So only 3D I guess, Many anons have already said the model does good at irl/3D but why would a furry want realistic porn when it's obvious his target was 2D?
Hell he shilled that it could do it well but instead lobotomized it's ability to do it by being a retard. So many tokens have a strong IRL 3D bias and it hurts.
>>
File: 1753238116452742.png (801 KB, 928x1120)
801 KB
801 KB PNG
>>106742247
>>
>>106742164
Just switched branches over to regular reForge, same exact shit. It refuses to even make it past the command window.
>>
>>106742230
Am I supposed to be impressed with the detail? Cause that's pretty much Flux, yeah.
>>
>>106742270
do a fresh install of reforge then move your models and loras over.
>>
>>106742271
Thank you for saying that, that is a basic bitch flux gen. I wouldn't be surprised if some of them were in the thread trying to do damage control. A few anons already posted images showing how the model can't stay on point in regards to style with the same exact prompt earlier today.
>>
File: 1757487303716526.jpg (55 KB, 424x512)
55 KB
55 KB JPG
the man is sitting at a desk in an office on a leather office chair. he is holding a yellow file folder. At the bottom of the screen is a black bar with blue text saying "JC Denton: why don't you get a job?". Keep his expression and pose the same.

looks better than the shitty "remaster" JC
>>
File: ComfyUI_temp_pfsdp_00015_.png (2.69 MB, 1192x1648)
2.69 MB
2.69 MB PNG
When it works it works
>>
File: 1739427080720498.png (901 KB, 928x1120)
901 KB
901 KB PNG
>>106742297
...if it would save the full image that is.
>>
File: 1751909979958487.png (1.84 MB, 1683x880)
1.84 MB
1.84 MB PNG
>>106742247
>qwen edit v2 is a gold mine for editing/manipulation.
for non realistic shit yeah it's really good
>>
File: 1735420488479582.png (622 KB, 1112x936)
622 KB
622 KB PNG
>>106742311
it's even better with the new one, all you have to say is "the character in image1 is holding a picture of image2."

no more concatenating/latent stitching stuff, just reference the node.
>>
File: 1753974946584838.png (968 KB, 1168x888)
968 KB
968 KB PNG
>>106742324
and for realistic stuff it works good too, just use "keep their expression the same" and it should retain the face. or else reactor can fix it if not.
>>
>>106742334
>and for realistic stuff it works good too
it works fine if the edit is minimal, but it shits the bed if you go for 2 humans (2 inputs) and ask for something really different to the original image, if they fix that on their next version it'll start to be a really great model
>>
File: ComfyUI_temp_pfsdp_00016_.png (2.71 MB, 1192x1648)
2.71 MB
2.71 MB PNG
>>
File: 1740089152305628.png (844 KB, 816x1264)
844 KB
844 KB PNG
>>
File: 1729890054137646.png (884 KB, 928x1120)
884 KB
884 KB PNG
the man in image1 wearing a black coat is holding a large picture of the man on the right in image2 with his right hand. At the bottom of the screen is a black bar with blue text saying "JC Denton: what a shame.". keep his expression the same.
>>
File: 1753198807961085.png (855 KB, 928x1120)
855 KB
855 KB PNG
>>106742432
>>
>>106742432
nice
>>
>>106742432
>>106742437
yeah I've heard about that remaster, never played Deus Ex but I feel ya, must be painful to see this incompetent shit kek
>>
>>106742445
the good news is it will get people who made good original mods to make new stuff or salvage the shit they make.
>>
File: ComfyUI_temp_pfsdp_00017_.png (2.5 MB, 1192x1648)
2.5 MB
2.5 MB PNG
>>
File: 1745508095843532.png (760 KB, 928x1120)
760 KB
760 KB PNG
even better:

the man in image1 wearing a black coat is holding a TV with the screen showing image2. At the bottom of the screen is a black bar with blue text saying "JC Denton: what a shame.". keep his expression the same.

I love how you can reference nodes now, before it was a pain in the ass cause if 2 characters look alike, how do you reference them properly? now it's simple.
>>
Any other local music Chads?

Audio to audio:
>https://xcancel.com/cocktailpeanut/status/1886456240156348674

Raw text2music:
>https://map-yue.github.io/music/this_is_my_life_rap.mp3

Revisiting Yue, this is insane... Like a little upgrade to sound quality and it's even better than Udio! There's nothing out there better than YuE right now.
>>
>>106740592
This is like a giant advertisment for why you always want local

Good work HF!
>>
File: 1737714616116117.png (802 KB, 928x1120)
802 KB
802 KB PNG
okay, NOW it's good cause I was more detailed and mentioned arms, so now he's actually grabbing it.

the man in image1 wearing a black coat is holding a TV with his arms, with the screen showing image2. At the bottom of the screen is a black bar with blue text saying "JC Denton: what a shame.". keep his expression the same.
>>
>>106742334
>and for realistic stuff it works good too
That plastic mannequin hand...
>>
File: 1746653583397721.png (868 KB, 928x1120)
868 KB
868 KB PNG
>>106742484
and with a simple image swap and text revision...

the original image had her arms cropped and it even fixed that. AI is so cool.
>>
>>106742468
It's utterly insane, and for that audio to audio clip, meant to link second gen (scroll down)
https://xcancel.com/cocktailpeanut/status/1886456240156348674#m

It may not be API tier yet, but it's damn close. I wonder what those devs are doing regarding a v2 for this model! I can safely say YuE is our only chance to be completely free from API overlords.
>>
>>106742468
>There's nothing out there better than YuE right now.
is this is a joke or something? Udio destroys everything that exist so far
https://www.youtube.com/watch?list=OLAK5uy_lxkPK9uaHgimCi2QbliQuMFQu-9E9jrMo&t=76&v=AJmuSruNun8&feature=youtu.be
>>
>>106742468
>>106742522
>Feb 3.
>>
File: 1758755498833895.jpg (1.19 MB, 2048x2048)
1.19 MB
1.19 MB JPG
Took me a bit too long to realize that this is Nurse Joy.
So, moar?
How about Whitney, Gardenia, Kamitsure, Koruni, or Viola?

>>106717426
Aren't these guys supposed to miss their targets constantly?
Too unrealistic!
>>
>>106742557
>Nurse Joy
wrong damn pic, whatever, the neat Nurse Joy gif not long ago
>>
>>106742524
Are you listening to the composition/instruments/lyrical arrangement anon? I know the audio quality isn't the best, that can be improved.
https://map-yue.github.io/music/step_back.metal.mp3

https://map-yue.github.io/music/%E5%AE%8C%E7%92%A7%E3%81%AA%E9%96%A2%E4%BF%82.mp3

(text2music, OG song: https://www.youtube.com/watch?v=v1NMaIQ58N0)

Yeah, this model is insane. It has potential. Picking and choosing bad models to scale is how local stagnates (E.G. HunyuanImage 3), but the Chinks should scale this instead.
>>
yep, it's a joke
>>
>>
File: 8654512158444.png (96 KB, 997x598)
96 KB
96 KB PNG
>>106742586
Though the general audio quality is higher, you're overrating Udio based on a few nitpicked songs anon-kun. You also have to reroll with Udio otherwise you get shit on par with Suno. Depending on the song this model solidly trades blows with both.

Anyways, does anyone here actually read the papers?
https://arxiv.org/abs/2503.08638
They've done blind tests. Again, not claiming it's better, but it's not that much worse.

https://www.udio.com/tags/music
Listen to the quality of songs that show up here, not some curated playlist...
Plus, Udio is getting sued anyways if I'm not mistaken. That would force them to make their model worse, and I think it's already showing.
>>
>>106742577
>>106742622
I know some anons love to throw "shill" for everything, but this one is an actual shill, and I don't get why he's shilling a product that is 7 months old lol
>>
File: 1732188758234307.png (894 KB, 1176x880)
894 KB
894 KB PNG
the man on the right pumping his fist in the black shirt in image1 is wearing a glass monocle and black top hat, and has a curly black moustache. He is wearing a black tuxedo.

classy 4koma
>>
>>106742630
>shilling a product
>the model is free to download
>it's 7 months old, why would they care?
>>
File: 1737093434115957.png (931 KB, 1176x880)
931 KB
931 KB PNG
>>106742635
>>
>>106742652
>noo you don't get it, no one cared about this model for 7 months because it's a hidden gem that needed to be brought up again
Sure.
>>
File: 00067-60425751.jpg (316 KB, 1344x1728)
316 KB
316 KB JPG
>>106742164
there several issues with multiple adetailer models, samplers and schedule types not working at all on reforge and reforge 2. Panchovix needs to serious patch this shit up.
>>
>>106742670
The only reason it did not catch much traction back then was because of long gen times. Everyone is a VRAMlet and this model is practically 24GB only if you want to generate at semi comfortable speeds. Now we have an exllamav2 implementation of it that helps, 10 mins down to 2 mins.
>>
File: ComfyUI_temp_pfsdp_00020_.png (2.49 MB, 1192x1648)
2.49 MB
2.49 MB PNG
>>
File: 1732750140072772.png (679 KB, 1112x936)
679 KB
679 KB PNG
the white cartoon character in image1 is sitting at a computer on a large desk, on the second monitor beside him on the desk is showing image2. the background is white.
>>
File: 1743146144983122.png (761 KB, 1112x936)
761 KB
761 KB PNG
>>106742743
>>
>>106742731
Now, don't get me wrong. ACE Step is good, and I can't wait for v1.5. But I want to see those guys iterate on their model. I'm not sure if ACE Step cucked its dataset or what, but for some reason their model doesn't generate music as high in quality.
>>
File: ComfyUI_temp_burgz_00030_.png (2.96 MB, 1192x1648)
2.96 MB
2.96 MB PNG
>>
>>106742763
>Road rash on her face.
>>
the absolute state of natural language models
>>
File: ComfyUI_temp_burgz_00031_.png (3.23 MB, 1192x1648)
3.23 MB
3.23 MB PNG
>>
File: 1754305537783476.png (958 KB, 1112x936)
958 KB
958 KB PNG
>>106742749
>tfw meeting miku
>>
>>
File: 1748306798474293.png (939 KB, 1112x936)
939 KB
939 KB PNG
>>106742787
keep the expression of the character in image1 the same

that makes qwen edit keep the same look (roughly)
>>
holy shit why is chroma so fucking nasty looking???
>>
>>
File: 00194-4129041132.png (2.35 MB, 1152x2016)
2.35 MB
2.35 MB PNG
>>106742781
despite sdxl limitations, it still gets the job gone with decent speed and quality results. illustrious still remains king for me.
>>
File: 1736578484249430.png (981 KB, 1112x936)
981 KB
981 KB PNG
>>
File: ComfyUI_temp_pfsdp_00024_.png (3.24 MB, 1192x1648)
3.24 MB
3.24 MB PNG
>>
File: ComfyUI_07225_.png (1.93 MB, 1152x1152)
1.93 MB
1.93 MB PNG
>>106740750
Nice

>>106741799
Based and creepshot pilled

>>106741423
It was sourced from me, here you go anon-kun
https://files.catbox.moe/g6iq7z.png
https://files.catbox.moe/wx1qx9.png
>>
>>106742866
>>106741492
>>
File: ComfyUI_temp_pfsdp_00025_.png (2.39 MB, 1192x1648)
2.39 MB
2.39 MB PNG
>>
File: 1730094636032409.png (829 KB, 928x1120)
829 KB
829 KB PNG
>>
>>106742775
that's the Chroma Blush
>>
>>106739956
that's cool and all but anistudio is going to be my daily driver whenever ani gets off his ass
>>
File: 1754798249081945.png (2.48 MB, 1728x1344)
2.48 MB
2.48 MB PNG
>>
>>106742866
thx!
>>
File: 1753128199663048.png (2.38 MB, 1536x1536)
2.38 MB
2.38 MB PNG
>>
>>106742966
damn this is good
>>
File: ComfyUI_05600_.png (984 KB, 720x1280)
984 KB
984 KB PNG
>>
>>106742971
it's just wai v15, pretty good all purpose anime model (like most anime models, noobai or illustrious)

what's neat is the model knows so many characters you dont even need a lora except for very specific stuff.

https://github.com/DominikDoom/a1111-sd-webui-tagcomplete

that extension is amazing for prompting cause it knows all the booru tags and you can fill them easily (no need to memorize exact tags).
>>
>>106742939
Based
>>
>>106742564
>>106742557
>>>/vp/napt
>>
>>
is there a ksampler that can alternate between two models? you know, kinda like refiner workflow, but each step it alternates between two models. i don't want to place 20 ksamplers for each individual step
>>
File: 1728766326057782.png (2.58 MB, 1536x1536)
2.58 MB
2.58 MB PNG
>>106742988
like so, defaults for the model plus some characters + location + outfits.

masterpiece,best quality,amazing quality,blue_archive, asuna \(bunny\) \(blue archive\), blue archive,blue sky, beach,karin \(bunny\) \(blue archive\), 2girls, waving, smile, fishnet pantyhose,
>>
>>106742947
i would point out how fucked up her pants look, but that blur... the blur makes it feel so real...
>>
>>106742939
same. sick of maintaining python for every update
>>
>>106741824
It's an experimental model. Base/HD are good as is for what they are. I think the creator should have made a bigger deal about Flash HD. I would say Flash HD weights are the best for the general public that doesn't like to tinker much with settings or with bad gens as it's closest to convergence. Downside is no negs with it, but most normies wouldn't care. Plebbit would be all over this model.
>>
File: 235654435.jpg (170 KB, 768x768)
170 KB
170 KB JPG
>>
File: 00205-2573023262.png (2.72 MB, 1824x1248)
2.72 MB
2.72 MB PNG
>>
File: ComfyUI_temp_maqfd_00040_.png (3.67 MB, 1192x1648)
3.67 MB
3.67 MB PNG
>>
File: ComfyUI_temp_pfsdp_00032_.png (3.66 MB, 1192x1648)
3.66 MB
3.66 MB PNG
>>
File: ComfyUI_01473_.png (3.85 MB, 1728x1248)
3.85 MB
3.85 MB PNG
>>
>>106743063
fellow chrome chad, thank god it filters vramlets and aijeets
>>
>>106743055
>Downside is no negs with it
iirc NAG fixes that.
is there any link to flash HD (ggufs?)
>>
>>106743114
there's the full model and an fp8 but i don't think there's any ggufs of flash. you'd have to quant it yourself
>>
File: ComfyUI_01474_.png (3.57 MB, 1728x1248)
3.57 MB
3.57 MB PNG
>>
File: ComfyUI_temp_maqfd_00044_.png (2.25 MB, 1192x1648)
2.25 MB
2.25 MB PNG
>>
File: 1748545268145220.png (2.75 MB, 1536x1536)
2.75 MB
2.75 MB PNG
>>
ffs every other post in this thread is miku you guy are suffocating me
>>
>>106743110
>>106743159
neat
>>
File: RA_NBCM_00020.jpg (792 KB, 1872x2736)
792 KB
792 KB JPG
>>
hm... what to gen what to gen...
>>
>>106743220
How about some Miku Hatsune?
>>
>>106742939
>whenever ani gets off his ass
so, never?
>>
>>106743230
why the fuck would i do that
>>
File: 1740274622203894.png (2.42 MB, 1536x1536)
2.42 MB
2.42 MB PNG
>>106743230
not him but sure, why not I had reforge open:
>>
>>106743234
he is completely capable of doing that otherwise anistudio would never had existed in the first place
>>
File: ComfyUI_temp_pfsdp_00035_.png (2.32 MB, 1192x1648)
2.32 MB
2.32 MB PNG
>>
>>106739587
I'll have you know I haven't fapped since september 20th.
>>
>>106740185
nice tits, all foids should look this good
>>
>>
>>
File: 1744380564923411.png (2.4 MB, 1536x1536)
2.4 MB
2.4 MB PNG
>>
>>106743212
nice, what model
>>
File: still not enough vram.png (98 KB, 1137x483)
98 KB
98 KB PNG
>>106743110
*ahem* FUCK VRAMLETS
>>
File: 1754580929930664.png (3.08 MB, 1728x1344)
3.08 MB
3.08 MB PNG
>>106743071
>>106743098
that is fucken sick lmao.

my attempt at this.
>>
File: 1740750652591428.png (2.36 MB, 1536x1536)
2.36 MB
2.36 MB PNG
>>106743338
>>
>>106741481
Avoid flux face by hagmaxxing, interesting technique
>>
>>106743356
do you gen or just buy?
>>
File: 1729208292942338.png (1.33 MB, 1079x1373)
1.33 MB
1.33 MB PNG
Is there a way, in anno domini 2026, for me to just hum into a microphone and have AI convert that into an instrument?
>>
File: 1751021063128293.png (2.52 MB, 1536x1536)
2.52 MB
2.52 MB PNG
>>106743376
>>
>>106743356
Still 5 years before this is useful for imagegen (outside training), and by then better GPUs will be out.
>>
>>106741980
>>106742164
Forge classic (the old classic not the Neo branch with the comfy tier memory problems) is also still great if you’re doing 1girl gacha with sdxl based models. Churning through some wildcard prompts as we speak lel.
>>
>>
>>
>another million auto forks
>>
>>106743384
I used to get ads for this very thing all the time but not recently. I wish I recalled the name so I could tell you and you could hum into a microphone and have AI convert that into an instrument in anno domini 2026
>>
>>106743380
yeah all the time anon. I've got the RTX 6000 and 5090 both going at once. Specing out a C210 now
>>
File: 1747088878888910.mp4 (3.42 MB, 560x768)
3.42 MB
3.42 MB MP4
>>106743071
>>
>>106743356
we get it anon
>>
>>106743449
anon please, prompt it a little more, not just "miku running thru war"
>>
>>106743356
I want to see what you can make with all that VRAM. Show me some brilliance anon
>>
i hate nag why does this shit ruin anatomy
>>
>>106743477
it would be cool if vram directly correlated with kinosovl but that is not the case. often its the opposite.
>>
File: Jentsune Nichu.webm (3.93 MB, 1280x720)
3.93 MB
3.93 MB WEBM
>>106743220
Animate some screenshots.

>>106743356
I'm planning a 6000 Pro build right now and was wondering if you've done any testing with RAID 0 Gen 5 NVMEs on that consumer mobo? I foresee loading to be a bitch on a 96GB/192GB system (it's so much to fill), but I don't want to add a PCIE card unless I absolutely have to.
>>
>>106743477
Uh, there isn't anything the 6000 can't do that the 5090 can't, just faster as I don't have to move around with normal ram.

I'm trying to find something that isn't people I know or NFSW...
>>
>>106743489
>I'm planning a 6000 Pro build right now and was wondering if you've done any testing with RAID 0 Gen 5 NVMEs on that consumer mobo? I foresee loading to be a bitch on a 96GB/192GB system (it's so much to fill), but I don't want to add a PCIE card unless I absolutely have to.

Are you actually loading and unloading models that much? Striping on an NVME would be insane. I guess I haven't seen the need or I don't mine the extra load. You might have some more serious workflows. I just setup a RAID 10 on another board, but it was 2.5 SSDs and not NVMEs.

Also on most boards share bandwidth with the PCIEx16 card slot so just putting an SSD in that slot will drop you down to 8x. I would think that memory bandwidth is actually useful and you wouldn't want to lose that.
>>
I'm new to this, how can I do this with my CPU and how can I gen more efficiently without frying my PC? I did try easydiffusion but it ate all my ram and took 10 minutes to gen. What about something that can be connected to sillytavern too?
>>
>>106743537
>CPU only
forget about it
>>
File: 1755768269082662.mp4 (3.62 MB, 560x560)
3.62 MB
3.62 MB MP4
>>106743169
>>
File: 00061-1909410372.png (2.32 MB, 1248x1824)
2.32 MB
2.32 MB PNG
>>
>>106743540
What about openvino?
>>
Speaking of nag, what does nag_sigma_end do? It's a float that goes from 0 to 20, but what exactly does that correspond to?
>>
>>106743560
snake oil value
>>
Wan has a built-in defense against genning porn of yourself with other real people—this is why I have never tried and will never try this—which is that it has a 20% chance at any given time of springing a trap on you, and making you watch a video of yourself popping YOUR tits out and sucking THEIR womanpenis (horrifying)
>>
cozy
>>
Diffusing Seedream locally with ComfyUI API Nodes
>>
>>106743537
anistudio unironically. if you are using igpu vulkan will probably help
>>
File: Jennyland_01.webm (3.92 MB, 1280x720)
3.92 MB
3.92 MB WEBM
>>106743519
>Are you actually loading and unloading models that much?
Kinda, sorta (I use LLMs for prompt work). Loading a 18-20GB LLM takes about 15s right now and I don't want to wait minutes once I can load a quant that's over 200GB.
>>
>>106743636
anistudio unironically cuts down the model loading by more than half
>>
>>106743489
>>106743636
I finally got around to watching parts of some of her videos for the first time. I'm going to make a supercut of every time she starts a video with "so...".
I want to be her husband.
>>
anistudio is actually good?
>>
>>106743537
Sorry man, but while CPU can work ok for LLMs, for image/video generation it is just too slow to be remotely practical at this point
>>
>>106743666
not really, it has it's advantages but still lacks qwen support, lumina support, conditioning modifiers and has to load the model every gen. it's the highest potential by being faster than everything else without snake oils
>>
>>106743666
If it worked it would be. Memory offloading does not seem to work either.
>>
File: 1732954314500999.png (1.04 MB, 768x1280)
1.04 MB
1.04 MB PNG
>>106743537
>with my CPU
>>
>genning new images for i2v
>forgot how fast the image gen is

The day we get videos genned at these speeds, bros..
>>
File: 00111-416324273.png (1.88 MB, 1344x1728)
1.88 MB
1.88 MB PNG
>>
are their any smut finetune gguf's of qwen_2.5_vl_7b that might be better for nsfw image gen? Im always worried the text encoder is spitting garbage out. Or is there a way to put in a system prompt in comfyui? I have some abliterated model but those are just retarded
>>
>tfw I scrapped some content curator social media grifter account I've hated for months, all the photos and videos of his account and trained several loras with it

thank you for the free dataset retard, only good thing that has come out of those grifters, now you can use their own media
>>
File: 1757732330436896.mp4 (1.76 MB, 640x640)
1.76 MB
1.76 MB MP4
>>106743476
hatsune miku runs through a warzone
>>
>>106743839

>>106743839

>>106743839
>>
>>106743830
sounds like you like their content so much you made a lora just to make more of it. but okay bro. You 'hate' him.
>>
>>106743853
I've never followed him, just know his account because he is a smug fuck, he just reposts boutinela photos and photos from other accounts, basically a content stealer douche, all he does is recycle his own posts, he got a pretty good following I gotta say, but those content curators accounts are just grifters, all of them
>>
>>106743356
Did u get the 6000 pro on msrp price? US$10000 ?
>>
>>106743930
I had to suck Jenson off. He's small so it's workable but.... smegma...
>>
>>106740750
which 360 lora was this?
>>
>>106739934
>>106740096
>>106740312
>>106741015
why does this tranny still soil our threads with its presence?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.