[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107294974

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/kohya-ss/sd-scripts
https://github.com/tdrussell/diffusion-pipe
https://github.com/ostris/ai-toolkit

>WanX
https://rentry.org/wan22ldgguide
https://comfyanonymous.github.io/ComfyUI_examples/wan22/

>NetaYume
https://civitai.com/models/1790792?modelVersionId=2298660
https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd
https://gumgum10.github.io/gumgum.github.io/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: 00001-3342832187.png (2.7 MB, 1080x1920)
2.7 MB
2.7 MB PNG
>>107300315
>Would I be able to make hot blondes?
nah, if you tried genning hot blondes it'll give them little peckers. i'm one of the only people with the power to generate hot blondes.

>>107300319
understandable.
>>
>>107300360
What do you mean?
>>
Can someone redpill me on what the best checkpoints are for anime gen? I already have WAI_NSFW, Noob and novaAnimeXL. What else should I add to the collection?
>>
File: 00154-3302656621.png (1.85 MB, 1040x1520)
1.85 MB
1.85 MB PNG
>>107300371
i was gonna keep trolling you lightly but im genuinely too exhausted
if you really wanna get started i recommend downloading either wainsfw, or another suggested illustrious checkpoint non-anime related, follow a really simple prompting guide of subject, subject details (hair/eyes/clothes), subject action, then model recommended quality tags at the END.
euler a karras, 5/7cfg, 1024/1024. gen that in either forge or comfyui (forge for beginners) and you're off to the coomer races.
>>
>>107300380
Most anime checkpoints are regurgitated merges of existing models and as such, fried. Test your models and try using obvious booru tags but different mediums too.
Most models like Wain are lobotomized. Of course gens look good but not very flexible.
>>
>>107300380
https://civitai.com/models/1790792?modelVersionId=2298660
>>
>>107300389
shut up ranfaggot
is your 5090 holding you back?
>>
File: 00074-688102625.png (2.05 MB, 1080x1920)
2.05 MB
2.05 MB PNG
>>107300380
>>107300393
essentially this, but it's also all up to personal preference, some models (and merges) will just do certain concepts you like better, at which point that model is objectively best from your perspective.
example being i only autistically stuck to nova animal all this time because it has a really solid semi-real style i can flexibly push and pull in my own ways vs every other model and especially ones that are MEANT to be generating realistic humans to begin with.
>>
>>107300396
Is this what most people here use? How is it for contextual prompts? I'm guessing it doesn't support Illustrious loras?

>>107300380
Extending from this, does anyone have suggestions for a checkpoint/workflow specifically for anime that adheres to it's official style? /adt/ recommended this one to me but I'm yet to try it:

https://civitai.com/models/2120585/trueanime

It also strongly recommends the Megami Magazine Lora.
>>
>>107300449
this one i got recommended a week back, its now my main for anime styles. surprisingly good lora adherence.
fair warning, its vpred, so make sure your shit's prepared for it
https://civitai.com/models/1280186?modelVersionId=1444264
>>
>>107300407
Yeah das rite. It's a tool. Creating simple 1girls with bursting breasts doesn't take that much of an effort than some other subjects or even purposeful illustrations.
>>
>>107300449
>How is it for contextual prompts?
Elaborate?
>>
>>107300360
awooga
>>
File: ChromaLora_00208_.png (1.61 MB, 1264x1024)
1.61 MB
1.61 MB PNG
>>
I will reveal my latest art piece in 15 minutes.
>>
>>107300505
i trust this plan
>>
thanks for keeping us updated
>>
>>107300532
I just read your update.
>>
File: +1 don tuber.jpg (58 KB, 316x325)
58 KB
58 KB JPG
>5 different UI options in the OP
Alright /g/entlemen, tell me which one is the objectively correct choice so I don't need to try all 5
Also, are there any plugins/toots for giving you a built in list of tags used with the training data so you can autofill tags or do you just need to use dan/gelbooru and maybe e621 tags and pray that's where they trained from
Using
>Illustrious XL personal merge
at the moment from an anons suggestion
attention grabbing image
>>
File: 00067-372797941.png (2.25 MB, 1080x1920)
2.25 MB
2.25 MB PNG
>>107300549
>objectively correct choice so I don't need to try all 5
it comes down to if you have the autism to handle spaghetti nodes and tinkering (and are smart enough to just grab someone else's workflow) you use cumfartui, if you're a normal person you grab whichever version of SD forge fits what you want to do.
since you're just starting, i think most would suggest forge classic.
>>
>>107300460
>vpred
zoomer
>>
>mixes and merges
>gradio
Nostalgic for 2023
>>
File: 1_00003_.mp4 (1.56 MB, 832x1440)
1.56 MB
1.56 MB MP4
>>107300407
>>
>>107300549
>Also, are there any plugins/toots for giving you a built in list of tags used with the training data so you can autofill tags or do you just need to use dan/gelbooru and maybe e621 tags and pray that's where they trained from
There are various such plugins for both ComfyUI and Forge. Just look up danbooru tag autocomplete plugin. They don't use the checkpoint's training data though, just Danbooru and e621 tags. Anime model captions are usually closely gatekept. Except RouWei, whose creator recently shared a huge dataset.
I use a Comfy plugin that can do what you're describing for loras, though. rgthree's Power Lora Loader.
>>
>>107300575
cute. incredibly cute. i spiritually cummed to this.
>>
File: ChromaLora_00216_.png (2.23 MB, 1264x1024)
2.23 MB
2.23 MB PNG
>>
>>107300600
vs Nano Banana Pro
>>
>>107300559
Thanks ran
>>
File: 2_00002_.mp4 (1.27 MB, 992x1440)
1.27 MB
1.27 MB MP4
>>
>>107300576
>Anime model captions are usually closely gatekept.
Why the fuck would anyone do that
Are people really hoarding their special snowflake model sauce for ego reasons?
>I use a Comfy plugin that can do what you're describing for loras, though. rgthree's Power Lora Loader.
Sounds interesting, I assume you mean pulling tags from loras and not loading loras from some external source (having to go manually hunt and download loras compatible with whatever the new meme model is always annoyed me)
>>
File: 1732904312683871.png (556 KB, 945x942)
556 KB
556 KB PNG
>>107300647
>hoarding their special snowflake model sauce for ego reasons
It seems that way.
>loras
It can show you a lora's trained words
>>
File: sugi0087.webm (1.01 MB, 704x704)
1.01 MB
1.01 MB WEBM
>>
>>107300647
I want to know how my Science Computer can Quantize existing SDXL models.
>>
File: shit dick.png (214 KB, 1491x512)
214 KB
214 KB PNG
seems like my gens are all over the place. even with the color match node, sometimes gens do the exact opposite and actually shift halfway with blown out colors.
someone pls point me in the right direction here, some pics gen really well and some are fucked.

>>107300671
daaaaaaaamn
>>
File: 1754934318755370.png (1.68 MB, 900x1157)
1.68 MB
1.68 MB PNG
>>
Do any of you have experience with image to 3D model? How is it in it's current state? How much work is needed for retopology? Is it slow?
>>
File: Nano Banana Pro.png (1.62 MB, 1152x928)
1.62 MB
1.62 MB PNG
>>
>>107300724
If you use Houdini, you can convert the model to volume and then resurface it but if you want to any meaningful results you still need lots of manual work.
Image to 3d is still best used like photogrammetry or lidar scans, eg. for backgrounds and static stuff.
>>
>>107300676
*so far i lowered motion amplitude to 1.05 and that helped the schizoness..
>>
>>107300742
I mean, even if the topology is bad, if it can capture the shape accurately (which it seems capable of), then I would think that potentially renders the sculpting workflow surplus outside of maybe some auxiliary touchups.
>>
>>107299730
this is pretty cool
>>
>>107300761
It's never that simple. You can use it as a base in zbrush or modo but if you are already good in modeling you could get same result by starting from scratch.
But of course, you should experiment that's where the fun is!
>>
File: 3_00004_.mp4 (2.43 MB, 960x1440)
2.43 MB
2.43 MB MP4
>>107300610
Nano Banana Pro won!
Oh, wait...
>>
>>107300812
>>107300761
To add: I'm quite cynical but you could make a pipeline and then manually fix the result. It's probably a fun project for a weekend or two but you need proper tools for this. Forget about blender or freetard hobbyist tools right off the bat.
>>
File: Wan_00008-1.mp4 (3.88 MB, 864x720)
3.88 MB
3.88 MB MP4
Lol, tried to get elon musk to come in out of frame and chug a beer with hitler.

The new painteri2v longvideo is amazing.
runninghub.ai/post/1991738572403470338/?inviteCode=rh-v1152 (download on the right)
>>
>>107300816
this made my dih leak..
>>
>>107300449
>Megami Magazine Lora.
I thought I was over 2d but the previews on that do look nice. I might give that a shot
>>
>>107300855
zamn look at that spaghetti. Its like six twin towers. so much of this workflow could be easily consolidated
maybe i'll try later in the morning and see what i get out of it. does look pretty seamless.
>>
>>107300855
I still have lots of respect for the old man.
>>
>>107300855
Yeah the painter nodes are great. Only small nit pick is the mild jump between each stitch. I'm sure it'll improve but it is pretty impressive. https://github.com/princepainter/ComfyUI-PainterLongVideo
>>
>>107300936
Funny enough, they have their own ksampler where you can plug in both high and low noises, should clean things up abit. https://github.com/princepainter/Comfyui-PainterSampler
>>
>>107300966
catbox the uncensored pls boss
>>
>>107300972
I couldn't get that one to install.
>>
File: 3_00008_.mp4 (2.05 MB, 960x1440)
2.05 MB
2.05 MB MP4
>>107300816
>>107300914
Nothing to feel ashamed
>>
File: 3194792021.png (449 KB, 832x1216)
449 KB
449 KB PNG
>>
what happened to landscape diffusion thread?
>>
>>107301016
absolute cinema

by the way, how do some of you handle stripping prompts with wan? certain clothes are impossible for me to get right. full body dresses with cleavage out for example, cant get this fucker to do more than just have the girl rub her tits.
>>
>>107301061
I use percentages. Clothes: 75% and so on.
>>
File: 3_00010_.mp4 (2.9 MB, 960x1440)
2.9 MB
2.9 MB MP4
>>107301061
Lora used (deleted on civitai for understandable reasons):

https://civitaiarchive.com/models/1918035?modelVersionId=2170846

And I use smoothmix model which appear to be quite creative
>>
File: 4_00001_.mp4 (1.27 MB, 1440x1184)
1.27 MB
1.27 MB MP4
>>107300731
>>
File: 1741639992070403.jpg (564 KB, 1346x686)
564 KB
564 KB JPG
Any ideas how to fix color shift when doing vae decode -> upscale -> vae encode with WAI and many other anime checkpoints? All colors become much warmer. Sometimes, like on the attached pic, the difference is huge.
>>
File: nodes.jpg (99 KB, 1118x332)
99 KB
99 KB JPG
just got started with this tonight, its quite fun. I was curious about pic related. putting together the simple text2img workflow, I used what I assume were all core/included/base nodes of comfyui. I would have to go out of my way to download and nude custom nodes correct? Is there any danger in downloading models, loras, etc?
>>
>>107301171
Lowering cfg helps. Or you could do a separate color grading pass if you have the software.
>>
File: Wan_00011-1.mp4 (3.84 MB, 720x720)
3.84 MB
3.84 MB MP4
This is fun. Just using 4 steps too.
>>
>>107301186
lmaooooooooo
>>
>>107301172
Yeah there is but as long as you don't download them from shady links it's 99% fine.
Like if I google up my nordic bank customer service, second highest link is a scam website. Always read what you're doing.
>>
>>107301186
Teach me, Master!

Palingenesis Edition?
>>
>>107301186

workflow or else I will tell your mom!
>>
>>107300855
>>107301201
It's this one.
>>
>>107301186
workflow or I coll the poh lis
https://youtu.be/a2ZkQTecBwA?t=14
>>
>>107301201
>Palingenesis
Apparently that guy is pretty shifty https://www.reddit.com/r/comfyui/comments/1o1skhn/a_word_of_caution_against/
>>
>>107301229
Cool! I follow him too
>>
>>107301256

this >>107300855
>>
File: 1749023099339509.jpg (591 KB, 1147x842)
591 KB
591 KB JPG
>>107301184
Dropped cfg to 1 for hi-res pass. It didn't help.
>>
>>107301272
thanks anon
>>
>you actually really AREN'T supposed to tag character loras
>>
>>107301278
High denoise also changes the image and sort of accumulates colors too.
It's a balancing thing. Try 0.35 for example.
Or, do you have useless or additional vae encode/decode nodes somewhere?
>>
>>107301172
>I would have to go out of my way to download and nude custom nodes correct?
Correct.
>Is there any danger in downloading models, loras, etc?
.safetensors are as safe as anything, they only contain data and not code.
.pt (pickletensors) contain code so I wouldn't just run any random .pt file.
>>
>>107301278
I don't know why you're experiencing this, but if you share a workflow that has the issue you're describing I'll investigate. As an aside nowadays I run all my gens through manual color corrections, usually levels and color balance or color temperature.
>>
>>107300816
Is there a place in 4chan dedicated to posting videos like that? Or red boards like that?
>>
>>107301399
AI images are pretty baked in when it comes to black levels. Too much so in most cases.
>>
>>107300322
top right image looks like it could be in gagetown or wainright

I have spent so much time there

good times. I miss it
>>
imagine post processing colors instead of fixing your gen settings kek
>>
File: 1752390221292707.png (1.9 MB, 1248x848)
1.9 MB
1.9 MB PNG
>>107300600
>>107300610
now try to make that with chroma
>>
>>107301451
Try /asdg/ on /aco/
>>
File: 1745146669144660.jpg (708 KB, 1657x948)
708 KB
708 KB JPG
>>107301338
0.35 helps a little. but to get rid of color shift completely, I'd need to set denoising to 0.

>>107301399
Here is a minimal workflow. https://files.catbox.moe/9ejgru.png
>>
>>107301503
btw instead of using nearest exact which basically pixelates the image use lanczos instead. Never understood why cumfy used nearest in his examples.
>>
>>107301451
One can post videos "like this" and spicier as catbox on this board

Or are you just looking for more "like this"?
>>
>>107301503
The issue is your AI upscale model. It's the one that's changing the colors. Use another one or none.
You didn't crop your attachment enough on the right, FYI.
>>
>>107301479
haha. I never imagined I'd see anything like this
>>
>>107301484
Thanks! I'm unironically glad they allow these in a cartoon board. None are as good as these upskirt we have here, though.
>>
>>107301503
Just use a color match node ffs anon, how can you be so narrow minded
>>
>>107301515
I meant a thread for those without the static pictures. I got bored and quit these back in August 2024 because I was dead bored of static pictures, my interest resurfaced with AI video, but it was just slop, now the spice was solved (flawless videos of my favorite fetishes??) but most people still post the still pics anyway (I get it, since without the static girl the video wouldn't have been made, but the pics on the thread seem like old tech in comparison.)
We now have the technology to make any girl ever show her panties, and this girl is the only one I've seen doing it.
>>
>>107301586
It's not real and automatic grade is often bad
>>
File: 1761895244002688.jpg (359 KB, 1312x455)
359 KB
359 KB JPG
>>107301531
Nope. Previewing the image after the upscale pass shows that the colors are the same.

>>107301513
I thought for downscaling it shouldn't matter that much. Hi-res pass adds noise. But I compared two options side by side and it turns out nearest exact causes much higher color shift. The color shift is non-existent right after upscale. It appears only after sampling. Perhaps aliasing introduced by nearest exact is interpreted as additional noise for diffusion... Weird.

>>107301586
Of course I can use it. But it's a hack. It's more useful to find why exactly it's happening.
>>
>>107301620
>Nope. Previewing the image after the upscale pass shows that the colors are the same
I tried, and same. However, for some reason, the sampler stops changing the colors if I use another model or none. So you might still want to do that. I'll play around with it further though.
>>
>>107301620
Yeah, it should not matter that much, but the upscale model likes lanczos more because it is smoother and sharper. This is my bro-science.
>>
>>107301607
>We now have the technology to make any girl ever show her panties

What a time to be alive and kicking!

>and this girl is the only one I've seen doing it.

It's me being bored and pushing the limits a bit while testing stuff
>>
>>107301656
I prefer violence and no ai model is good at it.
>>
>>107301172
>would have to go out of my way to download
Use comfyui manager
> Is there any danger in downloading models, loras, etc?
.safetensors are as the name implies
I wouldn't really worry about models.
Extensions/custom nodes are the primary security concern here.
Run comfy under docker/podman if you care about security.
>>
>>107301694
There's also sandboxes, those can withstand anything without a sweatdrop.
>>
>>107301694
Why not cutetensor?
>>
So, Hunyuan 1.5 was a nothingburger?
Damn still no good video diffusion model for us VRAMlets.
>>
File: 1676772204312160.jpg (60 KB, 750x750)
60 KB
60 KB JPG
does anyone have a good qwen edit workflow? If possible, a 2D to reality workflow
>>
For anyone wanting more, /r/ has you covered (except people are making AI videos starting from a real picture instead of a generated one.)
>>
File: Wan_00021.mp4 (3.92 MB, 720x720)
3.92 MB
3.92 MB MP4
".. Did I just shoot myself..? Where am I? Surely heaven.. no.. wait..."
>>
>>107301451
/gif/vdg or the grokslop thread
>>107301722
from the examples anon posted it seems to have a tendency to break down very badly. not good. I wonder how well it does nsfw because let's be real, wan can't do benis or bagina acceptably
>>107301784
here's a standard workflow: https://files.catbox.moe/erp3i2.png
there's multiple loras for that around:
https://www.reddit.com/r/StableDiffusion/comments/1p4c1jd/test_images_of_the_new_version_of_alltoreal02/
https://civitai.com/models/1906441/qwen-edit-reality-transform-by-aldniki
>>
>>107294189
please boss, gibbe the prompt
>>
>>107301620

y u mad, bro?

https://files.catbox.moe/0ryv90.mp4
>>
>>107301535
with nano banana pro I can finally do the "fine I'll do the unlikely crossover myself" since no one is doing this kind of shit and it has a lot of potential lol
>>
>>107301874
>censoring books on a catbox
why? that's the point of a catbox, to show the goods
>>
File: Nano Banana Pro.png (2.03 MB, 1408x768)
2.03 MB
2.03 MB PNG
Come on Elon, bring this kino back
>>
>>107301885
I asked wan to generate it censored like this, and it did. And I do not want to risk a ban only because the censorship was not perfect))

https://files.catbox.moe/y54fzc.mp4
https://files.catbox.moe/x74goh.mp4
https://files.catbox.moe/xrn6a1.mp4
https://files.catbox.moe/8j49yb.mp4
>>
File: Wan_00036-1.mp4 (3.54 MB, 720x720)
3.54 MB
3.54 MB MP4
Highly recommend doing these multiple shots, I'm leveling up my prompting so much.
>>
>>107301813
thx bro
>>
>>107300407
You should try pornmaster-pro-illustrious-and-noob models v3-v5. The models very flexible in the 3dcg, semi realism and hyper realism.
>>
>>107302143
You are one of the most gifted posters in these threads. Even better than the other guy who always posts furry images.
>>
>>107300630
uooooh my dick go up!!
>>
File: 6_00002_.mp4 (1.46 MB, 992x1440)
1.46 MB
1.46 MB MP4
>>107302143
>>
File: sarcasm.png (202 KB, 403x402)
202 KB
202 KB PNG
>>107302189

???
>>
>>107302333
I'm not breaking your balls here
Sonny boy it was a genuine comment
>>
File: 00065-1987604997.png (2.72 MB, 1824x1248)
2.72 MB
2.72 MB PNG
>>107302189
there's other talented anons here besides me. I wish more people would post proper gens and not memes.
>>
File: flux_0001.jpg (937 KB, 2496x2432)
937 KB
937 KB JPG
anon in another thread was discussing split cfg for image gen and since i don't trust any of you, i set up the experiment which was pretty easy in comfy. in this context i'm using flux so instead of cfg i'm splitting the fluxguidance, i can repeat this with a different model. i did add a comparison with face detailers since anon said low cfg gen can mess up faces. high 5, low 2.5

high cfg | high->low | low cfg (no fd)
high cfg | high->low | low cfg (w/fd)
>>
File: flux_0002.jpg (950 KB, 2496x2432)
950 KB
950 KB JPG
>prompt: a professional photograph taken with a 50mm prime lens, a female wrestler, latina, black hair with red highlights in the front, center part, heavy eyeliner, a neon yellow and orange wrestling outfit with a cutout, flexing in the middle of a wrestling ring

high 6, low 1.9
>>
>>107302511
Oh hey, I'm the one who posted about this originally. The reason I tried doing this originally was because I wanted a low CFG for the last few steps of drawing-style gens to get smooth shading and fine linework, but it does seem to have other benefits.
Well, actually the real reason why I started doing this was because I was watching previews of dpmpp_2s_ancestral / linear_quadratic gens and I saw so many promising looking gens go completely to hell exactly on step 22/40 plus or minus one step, so I was looking for ways to prevent that from happening.
I haven't tried this approach with photorealistic gens under Flux or Chroma yet but I'll be interested to see what results you get. I'm still fairly new to all this, for what that's worth.
>>
File: AnimateDiff_00125-1.mp4 (3.77 MB, 486x330)
3.77 MB
3.77 MB MP4
Returning to old stuff I tried to i2v when I first started. Man what a leap both me and the tech has taken.
>>
>>107302629
toss me the model and high and low cfg
>>
>>107302775

I posted a catbox in the previous thread if that is of any use, and if cloudflare will ever let me post

https://files.catbox.moe/uoydz0.png
>>
File: chroma.jpg (856 KB, 2496x2432)
856 KB
856 KB JPG
chroma
>>
File: wai_illustrious.jpg (1.1 MB, 2496x2432)
1.1 MB
1.1 MB JPG
>wai

>random civit prompt
>>
File: netayume_0002.jpg (909 KB, 2304x2048)
909 KB
909 KB JPG
>>107302875
here's an example of 40 steps, 30 high cfg, 10 low. kind of a neat experiment.

https://files.catbox.moe/hqvsyz.png
>>
lightning bros have we found a final solution to the slow motion problem?
>>
File: netayume_0004.jpg (971 KB, 2304x2048)
971 KB
971 KB JPG
last post, 40 steps, 50% split, and a heavy face detailer. i don't watch anime (pig disgusting), i don't really know what's best but a low cfg + face detail seems easiest.
>>
>>107303151
>>107303237
Oh, nice. By the way, on the off chance that anybody is interested in the gen itself, that particular gen just happened to be a good seed in the midst of some crappy ones, but I tweaked the prompts a bit (particularly the neg, which was left over from something else I was doing) and got a much better and more consistent run of gens starting here.

https://files.catbox.moe/pk89fe.png
>>
Finally figured out how to publish an update to your own custom node. I have no idea why this information is so damn hidden, basically in pyproject.toml if you update the version field (and commit) then this is all it takes. Fucking magic.
>>
File: comfyuidevs.png (70 KB, 977x821)
70 KB
70 KB PNG
Why is comfy like this? another shit update that changes everything
>>
>>107303237
Garbage
>>
>>107303402
He's a cunt. Could no longer tolerate this shit and decided to finally switch. AniStudio might be a fresh UI but every update is an improvement, and the dev is as friendly and helpful as possible. I just like the vibe
>>
>reinstall Onetrainer
>redownload the ChromaHD repo
>use the Chroma 16gb LoRa preset
>load the lora in comfy
>still getting this error "Error while deserializing header: invalid JSON in header: EOF while parsing a value at line 1 column 0"

I don't know what i'm doing wrong here. Has anyone encountered this issue when making their Chroma loras?
>>
File: comfyuidevs2.png (116 KB, 1065x919)
116 KB
116 KB PNG
>>107303402
why are comfy devs like this?
>>
>>107303434
I've never used Onetrainer but from the error message it sounds like you're trying to load an empty file. Safetensors starts with a JSON-formatted table of contents doesn't it?
>>
File: 1746539675091799.png (2.85 MB, 1120x1440)
2.85 MB
2.85 MB PNG
>>
>>107303413
same. anistudio has most essential stuff anyway
>>
>>107300855
Just turns to grey after the first frame. The demo YT videeo is a completely different workflow from the one offered for dl in the video :/
>>
>>107303447
Huh, need to remember to not update.
>>
>>107303465
Copy the entire entire comfy install folder to make frequent backups. Use links to the models and outputs to external folders.
>>
File: comfyuidevs3.png (118 KB, 1073x1211)
118 KB
118 KB PNG
holy fuck the comfy-frontend devs are fucking dumb
>>
>>107303413
With comfy I had to be anxious about updating every time. Even worse with a1111
anistudio just works
>>
>>107303492
I have forgotten to use links. Need to change this asap.
>>
>>107303449
right, but im not sure why its being read as an empty file, ive made sure that the dataset is on the correct path so i'm not sure what's going on. im going to do some more testing as i'm sure this is user error but i cannot pinpoint where im messing up.

i do not have this issue with the sdxl presets so im confused.
>>
What's a prompt to keep 2d images 2d without turning into that plastic 3d look?
>>
>>107303584
Try putting "The image is in the style of a 3D render" in the negative prompt
>>
>>107303584
Add art medium tag.
>>
Let's say, hypothetically, someone wanted to photoshop pictures of their penis onto photos of their friends then have ai animate it so that the friends are sucking the penis. How would one go about this? Hypothetically.
>>
>>107303646
you need pictures of their penises first. if you can do that, then its all easy from there.
>>
>>107303646
pretty straight forward but this is illegal, fyi
>>
>>107303646
>>107303661
Yep, I'm calling the cops.
>>
File: 1736781652781956.jpg (1.01 MB, 1248x1824)
1.01 MB
1.01 MB JPG
>>
>>107303661
it's illegal to distribute the results
>>
>>107303681
I look like this
>>
I love this kind of comparison where the esl poster didn't even bother to fix their prompt's grammar, invalidating the comparison
https://redlib.catsarch.com/r/StableDiffusion/comments/1p4paqd/my_testing_of_hunyuanvideo_15_and_wan_22_on_i2v/
>>
>>107303737
wish reddit added the location feature like X did, anytime I see some indian posting stuff it makes me barf
>>
>>107303809
we need flags across the chan too.
>>
>>107303829
its true, I gotta admit Musk was pretty smart about revealing the location of X accounts, hopefully other social media platforms will follow, thats a pretty smart way to mitigate (AI) grifters accounts, it would be sooo funny to see if all those "ai influencers" accounts revealed where they are from, especially those indians pretending to be european ai girls lol
>>
>>107303809
lmao so scared of the brownies
>>
>>107303829
would be amazingly helpful considering how much shitposting comes from american residential proxies.
>>
File: 1752003725163078.mp4 (2.74 MB, 640x640)
2.74 MB
2.74 MB MP4
>>107303464
Got it working (lightx lora was off). I'm a moron.
>>
>>107303861
indian detected
>hallo saar this is my low quality post trying to get upboats in reddit sar, with my shitty workflow, bad accent, and I need to plaster my ugly indian face in the results

Why are they like that?
>>
>>107303900
Sure, sweetie, whatever helps you sleep at night.
>>
File: 1737516482929349.png (1.08 MB, 2861x4547)
1.08 MB
1.08 MB PNG
>>107303861
settle down dalit while brahmin is talking
>>
>>107303967
mustve taken some time and effort. A for effort buddy!
>>
>>107304001
thankfully learning what per capita means and that shitting on the streets is bad took no effort
>>
>>107301804
Holy fucking keeek
>>
>>107301804
kekw
>>
>>107303879
Good job!
It is not seamless though.

I can see the cut at 5 seconds
>>
is there a guide outside of the rentry in the OP for training with chroma that you'd recommend? i cant get this fucking lora to train correctly in onetrainer.
>>
>>107304211
Chroma is a bitch to deal with, but there aer some ppl here who have some great results. look for emma poster or some youtbe girl. ask them
>>
>>107304229
very well, i SUMMON THEE, EMMA POSTER. Provide me with the means to train a lora with chroma and my (YOU) is YOURS!!!
>>
File: chroma_00057_.png (2.9 MB, 1152x1536)
2.9 MB
2.9 MB PNG
>>
i like to gen women with huge dicks
>>
>>107304386
proof?
>>
>>107302510
ah yes, super proper 1girl 3dp gens. so exciting. wow.
>>
>>107303860
>its true, I gotta admit Musk was pretty smart about revealing the location of X accounts
but he rolled it back and I think it's not gonna be back again :(
>>
>>107304391
you post one nogen. enough talk.
>>
what best?
HOON YEN?
WANX?
KADINK?
LTT?
>>
>>107303646
>step 1: photoshop picture of your penis onto picture of your friend
>step 2: animate it
>>
Looks like normalfags finally see the rising prices of RAM. TY based Lodestone for Ramtorch.
>>
https://civitai.com/models/2150906?modelVersionId=2432765
grab it before it inevitably gets deleted
>>
>>107304683
I made a similar lora that was for changing outfits but people just use it to make people nude
that's still up, go figure
>>
File: 1755425132711703.png (2.21 MB, 1120x1440)
2.21 MB
2.21 MB PNG
SPARK chroma has a weird freckle/piercing artifact it keeps putting on belly buttons
>>
>>107304771
on civitai if the uploader gets banned for whatever reason, their loras get deleted as well.
>>
>>107304683
Great Lord!
>>
File: 8_00005_.mp4 (1.59 MB, 1088x1440)
1.59 MB
1.59 MB MP4
>>107304311
>>
>>107304683
what is it? cant see it
>>
>>107301016

can ai do vipstyle fails with same character
>>
>>107304869
figured it out. just say belly button and DON'T say navel, it associates that with navel piercings.
>>
>>107304683
overfit shitty lora
>>
>>107304894
video transition to nudity
>>
File: flux_69420.png (1.26 MB, 832x1216)
1.26 MB
1.26 MB PNG
>>
>>107304683
>https://civitai.com/models/2150906?modelVersionId=2432765

>Nude Scanner Wan 2.2
>https://civitai.com/user/Cyberfolk
prompt:video of all women get scanned,the clothes are removed and become nude

Mirror https://gofile.io/d/PbSiNV
>>
>>107302143
might give em a look then. still stuck in my habits thoughbeithowever.
>>
>>107304387
>>>/gif/29847978
>>
>>107305256
I tried this with smoothmix.

The cloths were removed, however, not by some "scanner magic", but mostly manually

Maybe, smoothmix overthinks too much

>>107305129
>without this lora though
https://files.catbox.moe/bmcgl6.mp4
>>
File: 1752366112134005.png (1.62 MB, 1408x768)
1.62 MB
1.62 MB PNG
Is Nano Banana Pro now the most realistic model of them all?
>>
>>107305601
>he posted it again
>still with no uncensored catbox
:(
>>
File: 81200035_.png (2.28 MB, 1152x1440)
2.28 MB
2.28 MB PNG
vera wang call me, I got ideas
>>
>>
>>107305691
is nsfw content possible? also, the bodyguards all have the same face.
>>
>check twitter
>viral post about women being irresponsible whores (same as any other day on twitter)
>ok lets check my overnight gens
>disgusted by all these stupid whores I generated
>into the trash they go
You need to be in the right headspace when you check the gens
>>
File: 1755310363224929.png (2.01 MB, 1408x768)
2.01 MB
2.01 MB PNG
>>107305870
>is nsfw content possible?
obviously not, but you can get something like this
>>
>>107305879
>he goes on twitter
>he lets others influence his emotions
>he lets his emotions influence his creative actions
brown feminine npc
>>
>>107305879
you sound deranged and low iq, it makes sense that all you gen are the whores you hate, retard.
>>
>>107305879
>>check twitter
that's your fucking problem, why are you wasting your time in this shit hole?
>>
>>107305879
it's axiomatic, why would you feelings ever change on it, that's like getting pissed off at gravity
>>
>>107305901
>>107305915
>>107305917
>>107305936
I am a white man and I'm proud of my sensitive feminine nature. My Sun is in Pisces.
>>
File: flux_0079.png (1.42 MB, 832x1216)
1.42 MB
1.42 MB PNG
what is going on with civit? it seems like there are a lot of people making terrible loras and then spamming them with 2-3 alt accounts all day. does this make money? are they just trying to get free gens?

>>107305620
i guess she painted all the way down
>>
>>107305945
Keep telling yourself that, it makes it easier. Trust me, I know.
>>
>use sdxl preset in onetrainer with booru tags
>lora works
>use chroma preset with same dataset but natural language txt files
>lora doesn't work
i'm about to admit defeat and stick to illustrious forevermore. this is like the 8th time i train with lora and keep getting the same fucking error.
>>
>>107305945
lol you're a fag
>>
>>107305620
>I tried this with smoothmix.
>The cloths were removed, however, not by some "scanner magic", but mostly manually
>Maybe, smoothmix overthinks too much

Indeed, after switching to a pair of "normal" Wan2.2_I2V_Q8 + lightx loras, the effect of the scan lora became visible

>>107305998
https://files.catbox.moe/dyyktu.mp4
>>
>>107306434
so how does the scan lora work to improve anatomy in normal nude gens? certain loras i've tried do that, but they're trained with really grainy videos that make nips/the whole gen look really shitty in turn.
>>
>>107306504
It does not improve the anatomy, it re-invents it and force upon the character

All complain that it give bigger boobs which I can confirm
>>
it wouldnt be smart to upgrade from a 3080 to a 5070 ti, for $800, now would it. its the only thing that would fit in my mid tower kek
>>
>>107306642
get a 5090 or shut the fuck up
>>
>>107306642
>PNY RTX PRO 4000 Blackwell SFF Edition 24 GB

on Santa's secret list
>>
>>107306642
It's not too late to get one of those chink modded 4090Ds
>>
>>107306642
You can upgrade to a ComfyCloud subscription and get better gens faster instead of wasting money on underpowered localslop
>>
>>107306639
fuck.
well besides that, if you or anyone else wants to take a look at my setup here, and suggest how to improve motion, i'd appreciate it. dildo riding lora never seems to work right for any images, always either goes halfway down the dildo or only the head, for every image. and every setup i've ever had.
https://files.catbox.moe/0u8ghu.mp4

>>107306748
i fucked your mum without a comfycloud subscription, what do you think of that donny?
>>
>>107306798
>https://files.catbox.moe/0u8ghu.mp4

very much appreciated indeed!

I came across PainterI2V a couple of time. A good opportunity to see it in action.

Thank you, kind anon!
>>
bruhhh >>107306252
>>
You guys think nano banana pro actually generates at 4k by 4k resolution, or is it a hidden upscaler?
>>
>>107306960
it's not that difficult to gen at 4k
>>
i know it's ldg but what happened to vibe voice? didn't they have a 7~B model?
>>
>>107307126
I THINK indextts2 is what replaced it. at least that's what i've been using.
also these things never get updated after v0.1 so. expect that.
>>
>>107306977
it's difficult to do it without creating repetitive artifacts on a model trained at half the resolution or lower.
>>
>>107307175
stop using models from 2023. oh wait, it's local!
>>
>>107307187
your mom's from 1923 but that doesn't stop me from hittin' it!
>>
>>107307126
There hasn't been a single open source TTS model that has come even close to ElevenLabs
>>
File: 1753250941456143.png (1.87 MB, 1298x1342)
1.87 MB
1.87 MB PNG
>>107306960
It's 4K native anon, generating at higher resolutions is only a problem for vramlets
>>
long live local diffusion
>>
long live illustrious
>>
>>107303450
Kek
>>
File: ComfyUI_07312_.png (1.68 MB, 912x1200)
1.68 MB
1.68 MB PNG
>>
File: ComfyUI_07280_.png (957 KB, 832x1248)
957 KB
957 KB PNG
>>
File: ComfyUI_07306_.png (1.83 MB, 912x1200)
1.83 MB
1.83 MB PNG
>>
File: ComfyUI_07158_.png (1.14 MB, 768x1360)
1.14 MB
1.14 MB PNG
>>
>>107307302
There is no model you can run that will generate at 4K properly regardless of how much vram you have.
>>
>>107300575
whoa unexpectedly cute
>>
Do most of you use Neta Yume for anime? How well does it perform for your purposes in your opinion?

How well does it work for generating specific anime characters? Particularly in their original style.
>>
File: ComfyUI_06761_.png (1.06 MB, 1200x896)
1.06 MB
1.06 MB PNG
>>107309239
>>
>>107309239
>Do most of you use Neta Yume for anime? How well does it perform for your purposes in your opinion?
no, the styles and anatomy suck, and probably has an ok knowledge of characters
>>
File: 1761236982749893.png (1.28 MB, 1440x1120)
1.28 MB
1.28 MB PNG
>>
>>107309271
Impressive. Very nice.
Let's see some cum splattered on her face.
>>
>>107309239
No, I still use Illustrious-based models. I haven't tried Neta Yume, but I was not impressed with Neta Lumina. And I hate natural language prompting when it barely supports concepts that tag-based prompting doesn't. It just makes genning less predictable/more blackboxy.
>>
File: 1745172496542911.png (1.63 MB, 1440x1120)
1.63 MB
1.63 MB PNG
>>107309271
>>
>>107309276
>And I hate natural language prompting when it barely supports concepts that tag-based prompting doesn't. It just makes genning less predictable/more blackboxy.
Interesting. I think I'm the opposite, I actually want to leverage randomness a bit, specifically for multi-character compositions.

I don't really like how Illustrious requires either region mapping or controlnet for any images with two or more characters, and controlnet is basically a requirement for anything involving the characters overlapping. I have to either find a reference image that depicts exactly what I want (which might not exist) or I have to create an image myself by posing characters in blender and configuring the camera.
Using natural language to articulate a series of multi-character compositions sounds like it would be fun. I used to do this with OpenAI's Sora, but the major problem there was censorship and the generic anime style that can't be changed.
>>
File: 1734567434840888.png (1.67 MB, 1440x1120)
1.67 MB
1.67 MB PNG
>>
File: 00095-880233606.png (2.06 MB, 1080x1920)
2.06 MB
2.06 MB PNG
>>107309301
how updated is neta yumine now? last i tried it, it had zero artist knowledge and it was a pain in the ass to use. but it has potential, i want it to succeed noobai.
>>
>>107309301
>I actually want to leverage randomness a bit
these local dit models barely have variation for the same prompt so I don't see this as a point supporting it

>I don't really like how Illustrious requires either region mapping or controlnet for any images with two or more characters
this is just a case of bad tooling. I wouldn't see this as a pain point if we just had built in tools for making it easy instead of a noodle nightmare or using krita
>>
Nano Banana Pro is actually fucking shit at editing photos of people. Original Nano Banana was lightyears ahead. It's DOA.
>>
>>107309340
Somehow, it's as bad Kontext Dev at maintaining ID. No idea how Jewgle could fuck something up this bad.
>>
The Yume author has a new version in the works fwiw
>>
>>107309340
>>107309344
>outrageous claim
>zero proof
wew
>>
>>107309339
>these local dit models barely have variation for the same prompt so I don't see this as a point supporting it
I mean, I'd be happy to vary the prompt to get different results each time. Could even delegate that duty to an LM.

>this is just a case of bad tooling. I wouldn't see this as a pain point if we just had built in tools for making it easy instead of a noodle nightmare or using krita
You mean there is actually some kind of noodle-infested implementation for posing rigged characters and positioning the camera in a 3D scene for a controlnet?
>krita
my drawing is beg level so even drawing a controlnet reference is out of the question for me lol.
drawfags who hate AI should realize that their skillset can actually be leveraged effectively in the diffusion pipeline.
>>
>>107309346
Can't post example, but I'm not the only one with the problem
https://www.reddit.com/r/Bard/comments/1mvhhh1/nanobanana_is_amazing_but_it_does_not_produce/

https://www.reddit.com/r/Bard/comments/1p2pdyd/faces_look_worse_in_editing_on_nano_banana_pro/
>>
>>107309301
>>107309301
>I don't really like how Illustrious requires either region mapping or controlnet for any images with two or more characters
My biggest problem with a lot of those local models that use natural language prompting is that they actually ignore a lot of complex instructions you wouldn't be able to prompt with tags. So they add blackboxiness without adding composition abilities.
>I have to create an image myself by posing characters in blender and configuring the camera.
I often create the ControlNet reference images using a model with good natural language prompting abilities, and then continue on Illustrious which is really fast (good for iterating), and has the best styles support. I do a ton of prompt tweaking, rerolling and inpainting, so staying on a next-gen model for the entire process would take me a very long time.
Maybe worth looking into, but the creator of RouWei has an early stage T5Gemma->CLIP adapter which enables natural language prompting for SDXL models that already seems to work pretty well. He also has a 16-channel VAE adapter. His stuff is optimized for RouWei but I've seen people have some success with other checkpoints. I haven't tried either yet though.
>>
>>107309421
>I often create the ControlNet reference images using a model with good natural language prompting abilities
Are you talking about cloud models like Sora? Do you not generate anything lewd or violent for your controlnets?
>>
File: 00007-3365253514.png (2.63 MB, 1080x1920)
2.63 MB
2.63 MB PNG
shut the fuck up about cloud models on /ldg/ you niggers
>>
>>107309488
I've used Chroma, Kontext, and Sora. I haven't genned custom NSFW ControlNet source images though.
>>
Nano Banana can't generate Messi playing Blitzball btw. Only Sam GODman's GPT models have such worldly knowledge. The new qwen will fail at it too because it's chinkshit.
>>
One of the best...

https://civitaiarchive.com/users/playtime_ai
>>
File: 888686.jpg (1.93 MB, 2348x3500)
1.93 MB
1.93 MB JPG
>>107305870
I think so? I saw some nsfw examples, but only with paid API. It definitely has some knowledge in regards to softcore stuff like this.
>>
Can anyone do a better job of explaining how prompts work in NetaYume than the official template in ComfyUI? Because the information here is different to the NetaLumina prompt book on their website.

What is the @ for? The comment implies artist, but it uses "comfyanonymous" as an example. There is no comfyanonymous tag on danbooru.
What is the "Prompt Start Tag"? Ctrl-F reveals no matches for "prompt start" in the prompt. Is it the first quotation mark? What about when I want to start off with Danbooru tags to establish a structure first? That's what the Neta Lumina prompt book recommends. Do I prefix the list of tags with quotations? Or is that only for the natural language?
>>
File: ComfyUI_08750_.jpg (993 KB, 2048x2048)
993 KB
993 KB JPG
>>107300600
>>107300610
Do you seriously think this garbage slopped to hell and back model is better than Chroma? Kek, don't be so retarded anon. Pic rel is HD Flash mixed with v50, and I don't even know what your prompt is, but don't delude yourself on what Chroma can or cannot do based on a LoRA. Now let's see how "good" this model you claim is good truly is.
>>
has anyone found out how to do nerds or NEETs in chroma yet?

it doesn't do bad teeth, for example
>>
File: ComfyUI_07632_.png (1.74 MB, 1152x1152)
1.74 MB
1.74 MB PNG
>>107309685
>Something went wrong with this response, please try again.

Of course there can't be a proper comparison. Chroma wins first pic

>Amateur photograph, a Japanese idol woman, performing an advanced contortion pose at a bench in a barn. She is sitting on a surface with her legs bent backward and extended over her shoulders, so that her feet are positioned and touching over her head, displaying an impressive level of flexibility.

>There's a rooftop rope attached to both of her ankles and duck tape on her mouth.

>A long white towel is draped over her entire front for modesty. She has straight black hair with bangs.
>>
>>107309642
#1, never ever use a comfy default prompt
#2, prompt like an llm or ask an llm to write a prompt for you by describing what you want
#3, nobody here knows how to prompt something that looks good because the model kind of sucks

hope this helps
>>
File: AnimateDiff_00168.mp4 (2.25 MB, 720x840)
2.25 MB
2.25 MB MP4
Trying out this lora combo some autist made. It pushes the lora weight to 2 which with a pusa lora it seems to turn out fine. It's as if you're doing t2v but with a starting image.
>>
File: 44541254454.jpg (540 KB, 2650x1712)
540 KB
540 KB JPG
>>107309685
Now, this is not bad, better than the Seedream result, but Chroma has better details. Though the fucked up teeth on the right can be seen as an artifact of a normal smartphone image, it's still a somewhat slopped result. I never asked for the phone in the selfie to be visible.
>>
>>107302761
i see joe rogan got a new hrt prescription.
>>
File: 551215114541.jpg (351 KB, 2522x1432)
351 KB
351 KB JPG
>>107309685
>>107309805
Aaaand the sloppiness reveals itself. It's interesting that the safety has been so hardly baked that it's not even properly following the prompt "from a low angle".
>>
File: AnimateDiff_00174.mp4 (2.15 MB, 720x856)
2.15 MB
2.15 MB MP4
>>107309792
I meant 5.
It's dogshit.
>>
File: 451544415441.jpg (1.05 MB, 3546x2541)
1.05 MB
1.05 MB JPG
>>107309685
Onto the next, the Pro result is about on par with Seedream, but that is a professional photo, still doesn't care that I'm asking for an amateur photograph. I will now show another Chroma seed for the same prompt.
>>
File: chroma_00073_.png (1.34 MB, 1152x1152)
1.34 MB
1.34 MB PNG
>>107309849
>>
File: G6Uki94WMAAMMIj.jpg (219 KB, 1024x1024)
219 KB
219 KB JPG
>Amateur photograph from 1998 of a middle-aged artist copying an image by hand from a computer screen to an oil painting on stretched canvas, but the image is itself the photo of the artist painting the recursive image.
google won
>>
File: 545412122154.jpg (1.21 MB, 3546x2541)
1.21 MB
1.21 MB JPG
>>107309900
Second seed, similar pose and angle hints to me a lack of variety on Nano Banana Pro side. I can continue, but you get the point. Don't think I can test anything with panties, or gore, so Chroma wins by a margin.
>>
File: ComfyUI_07795_.png (2.42 MB, 1152x1152)
2.42 MB
2.42 MB PNG
>>107309958
Though, if I were an APIcuck, instead of embarrassing myself I would just pour resources into getting Chroma to run from the cloud on stuff like Runpod.

Here's a challenge for you APIcucks. Not even showing graphic gore, just a normal Halloween themed picture.
>>
>Not even showing graphic gore
those hands would disagree, chromakek
>>
File: ComfyUI_08147_.png (1.83 MB, 1152x1152)
1.83 MB
1.83 MB PNG
>>107309998
Her hands are slightly bruised and cut. Chroma is a perceptive model you see.
>>
File: 0121.jpg (1.02 MB, 1536x2752)
1.02 MB
1.02 MB JPG
>>107309990
what made you so mad anon. I don't know if you know, but you can use both, api and local models at the same time. Both have their use cases. You can even mix and match!
>>
>>107310018
Because I expected the model to actually be good, and it sucks. Sad because it's quite evident that the ungated model is good, but it's too censored to consistently give good outputs, similar to Sora 2.
>>
>>107310037
I agree with >>107310018 but thank you for running comparisons anyway, always educational
>>
File: 61613541.jpg (1.86 MB, 2281x3400)
1.86 MB
1.86 MB JPG
>>107310037
>but it's too censored to consistently give good outputs
it is way less censored than the first one. NB1 would kill me for trying to generate stuff like this.
>>
local is simply outdated. WAN selling out was the final blow. i've comfortably migrated my entire workflow to comfy API and i'm happy to report i get results that are both faster and better
>>
File: 1746447487584380.png (304 KB, 2793x1398)
304 KB
304 KB PNG
>>107309420
on the gemini site nano banana pro is absolutely terrible, on llmarena it's pretty good though, and people seem to like it here
>>
>>107309685
>>107309958
Side note, mixing Chroma HD Flash delta with HD seems to correct the Chromatic aberration issues that normal HD Flash has, and also fixes consistency with text and other aspects like defaulting to overexposure in pics. Seems to be best of both worlds, since the fixes in small details from original HD Flash carries over, in addition to improved promot following from original Chroma HD.

Looking at some old kino images from prior early versions, it used to do really nice low quality type pics, but those earlier versions would destroy the small details and even HD would struggle a lot.

Would be nice to merge the models properly so I don't have to load them seperately, but I never figured out how.
>>
>>107310137
Also I wonder if LoRAs work well with this kind of setup, since HD is just so much better at raw aesthetic prowess.
>>
Is CFGZeroStar relevant these days?
>>
File: file.png (2.03 MB, 1408x768)
2.03 MB
2.03 MB PNG
>>107309420
I don't know for edit, but as its own image model, ans especially as a manga/comic creator this shit is fucking amazing
>>
I want to start learning to do t2i with upscaling and inpainting with comfy, coming from forge. Multidiffusion tiled upscaling using denoise, not some stock anime4x crap.
Can't seem to find a workflow that contains it all.
Does someone have one to share?
>>
File: 1759592555676602.png (891 KB, 1793x936)
891 KB
891 KB PNG
Subgraphs seem mature now, they have almost everything I'd want and no weird jank gotchas that I can detect yet.

This is the Wan 2.2 rentry's workflow, with the settings I don't use tucked away inside subgraphs + a tiny bit of cg-use-everywhere (controversial, but very much optional). If wants it: https://files.catbox.moe/0g9sm3.json
>>
>Comfy broke NAG yet again
>git asking for credentials for a fucking pull request
I hate open source so goddamn much...
>>
Probably not the right thread, but I'll ask anyway.
I've got some issues with DeepFaceLab.

Basically, it's not using my GPU at all when training.

I start training, my GPU stays at 0-1%, but my CPU jumps to 100% usage, so much that if I let it run for a couple of minutes, the temp gets high enough that my PC just shuts down.

Which is weird because I remember using DeepFaceLab a while ago on that same PC and it worked.
>>
>>107310503
How are you supposed to use loras on this? Specifically for the low-noise model where they make a bigger difference.
>>
File: 1762875025335199.png (69 KB, 701x484)
69 KB
69 KB PNG
>>107310820
See this screenshot.
>>
>>107310057
catbox?
>>
>>107310057
>>107310918 (me)
or prompt?
>>
>>107310722
You need to reinstall pytorch and ensure it's the one with gpu support.
>>
>>107310965
Haven't tried this yet, I'll check it out.
>>
checked
>>
>>107310077
enjoy your paid censorship, lmaooooooo
>>
>>107311297
>>107311297
>>107311297
samefag
>>
>>107300855
shape-shifting face



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.