[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Let it snow Edition

Previously on /sdg/: >>103489402

Beginner UI local install
EasyDiffusion: easydiffusion.github.io
Metastable: metastable.studio
SwarmUI: github.com/mcmonkeyprojects/SwarmUI

>Local install
Forge: github.com/lllyasviel/stable-diffusion-webui-forge
ComfyUI: github.com/comfyanonymous/ComfyUI
SD.Next: github.com/vladmandic/automatic
InvokeAI: github.com/invoke-ai/InvokeAI

>Use a VAE if your images look washed out
rentry.org/sdvae

>SD 3.5 info & download
rentry.org/sdg-link#sd35
civitai.com/models/896953/stable-diffusion-35-medium
huggingface.co/city96/stable-diffusion-3.5-medium-gguf
---
civitai.com/models/878387/stable-diffusion-35-large
huggingface.co/city96/stable-diffusion-3.5-large-gguf

>Try online without registration
sd3.5-medium: replicate.com/stability-ai/stable-diffusion-3.5-medium
sd3.5-large: replicate.com/stability-ai/stable-diffusion-3.5-large
sd3.5-turbo: replicate.com/stability-ai/stable-diffusion-3.5-large-turbo
flux-dev: huggingface.co/spaces/black-forest-labs/FLUX.1-dev
txt2img: www.mage.space

>Models, LoRAs & upscaling
civitai.com
huggingface.co
aitracker.art
openmodeldb.info

>Index of guides and other tools
rentry.org/sdg-link
rentry.org/rentrysd

>View and submit GPU performance data
vladmandic.github.io/sd-extension-system-info/pages/benchmark.html

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
rentry.org/hdgcb
catbox.moe

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>
File: fSDG_News_000133_.jpg (606 KB, 1024x1152)
606 KB
606 KB JPG
>mfw Resource news

12/12/2024

>Fast Prompt Alignment for Text-to-Image Generation
https://github.com/tiktok/fast_prompt_alignment

>FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models
https://matankleiner.github.io/flowedit

>TryOffAnyone: Tiled Cloth Generation from a Dressed Person
https://github.com/ixarchakos/try-off-anyone

>Leffa: Learning Flow Fields in Attention for Controllable Person Image Generation
https://github.com/franciszzj/Leffa

>InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models
https://github.com/Hundredl/InvDiff

>cc12m-4mp: 25k image 4mp dataset
https://www.reddit.com/r/StableDiffusion/comments/1hctvnz/25k_image_4mp_dataset

>3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
https://github.com/KwaiVGI/3DTrajMaster

12/11/2024

>One Diffusion to Generate Them All [40GB lul]
https://lehduong.github.io/OneDiffusion-homepage

>ComfyUI-IF_MemoAvatar: Memory-Guided Diffusion for Expressive Talking Video Generation
https://github.com/if-ai/ComfyUI-IF_MemoAvatar

>DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
https://jianzongwu.github.io/projects/diffsensei

>StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization
https://github.com/Aria-Zhangjl/StoryWeaver

>Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
https://jianhongbai.github.io/SynCamMaster

>ObjCtrl-2.5D: Training-free Object Control with Camera Poses
https://wzhouxiff.github.io/projects/ObjCtrl-2.5D

>FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
https://fiva-dataset.github.io

>FireFlow: Fast Inversion of Rectified Flow for Image Semantic Editing
https://github.com/HolmesShuan/FireFlow-Fast-Inversion-of-Rectified-Flow-for-Image-Semantic-Editing

12/10/2024

>Sana-ComfyUI
https://github.com/NVlabs/Sana/blob/main/asset/docs/ComfyUI/comfyui.md
>>
File: fSDG_News_000132_.jpg (589 KB, 1024x1152)
589 KB
589 KB JPG
>mfw Research news

12/12/2024

>ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation
https://arxiv.org/abs/2412.08645

>BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation
https://research.nvidia.com/labs/amri/projects/blade

>DMin: Scalable Training Data Influence Estimation for Diffusion Models
https://arxiv.org/abs/2412.08637

>Multimodal Latent Language Modeling with Next-Token Diffusion
https://arxiv.org/abs/2412.08635

>Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning
https://arxiv.org/abs/2412.08614

>LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations
https://arxiv.org/abs/2412.08580

>StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements
https://arxiv.org/abs/2412.08503

>Video Summarization using Denoising Diffusion Probabilistic Model
https://arxiv.org/abs/2412.08357

>ALoRE: Efficient Visual Adaptation via Aggregating Low Rank Experts
https://arxiv.org/abs/2412.08341

>Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming
https://arxiv.org/abs/2412.08221

>TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning
https://arxiv.org/abs/2412.08176

>Analyzing and Improving Model Collapse in Rectified Flow Models
https://arxiv.org/abs/2412.08175

>Antelope: Potent and Concealed Jailbreak Attack Strategy
https://arxiv.org/abs/2412.08156

>AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting
https://arxiv.org/abs/2412.08149

>Seeing Syntax: Uncovering Syntactic Learning Limitations in Vision-Language Models
https://arxiv.org/abs/2412.08111

>Generative Zoo
https://genzoo.is.tue.mpg.de

>Doubly-Universal Adversarial Perturbations: Deceiving Vision-Language Models Across Both Images and Text with a Single Perturbation
https://arxiv.org/abs/2412.08108
>>
>>
>>103499685
that looks like shit
>>
File: 1.jpg (129 KB, 1152x896)
129 KB
129 KB JPG
>>
>>
>>
>>103499697
>https://www.reddit.com/r/StableDiffusion/comments/1hctvnz/25k_image_4mp_dataset

HF link is below, kinda looks sloppy linking a reddit stub like that
https://huggingface.co/datasets/opendiffusionai/cc12m-4mp
>>
File: denp_00076_.png (3.28 MB, 1824x1248)
3.28 MB
3.28 MB PNG
>>103500016
whoops. good call
>>
>>103499775
wtf is this real
>>
File: 07873-1295920401.png (3.18 MB, 1432x1840)
3.18 MB
3.18 MB PNG
>>
>>103499775
the smallest NY rat
>>
>>
File: 07874-61669404.png (2.61 MB, 1432x1840)
2.61 MB
2.61 MB PNG
>>
this general absolutely fucking sucks
>>
File: file.png (162 KB, 475x839)
162 KB
162 KB PNG
>>103499697
Haven't seen poses like this before
>>
File: denp_00073_.png (3.72 MB, 1824x1248)
3.72 MB
3.72 MB PNG
>>103500556
those are the only poses possible. technology hasn't advanced enough yet
>>
File: 1711601316285862.png (1.62 MB, 946x946)
1.62 MB
1.62 MB PNG
>>103499685
What's the use case for parquet datasets?
>>
File: denp_00072_.png (3.41 MB, 1824x1248)
3.41 MB
3.41 MB PNG
>>103500788
in general or specific to AIML?
>>
File: 1714851359536056.png (1.56 MB, 946x946)
1.56 MB
1.56 MB PNG
>>103500824
Specific to Lora training
>>
File: dens_00014_.png (363 KB, 1824x1248)
363 KB
363 KB PNG
>>
>>103501054
When was the last time you had a job?
>>
File: dens_00016_.png (388 KB, 1824x1248)
388 KB
388 KB PNG
>>103501071
things were good for a while there before the board forced me to resign
>>
File: 323511587587411970.webm (896 KB, 1072x720)
896 KB
896 KB WEBM
>>103490295
>>
File: delux_sr_00024_.png (1.39 MB, 1536x1024)
1.39 MB
1.39 MB PNG
>>103501116
LMAO, thats awesome
>>
File: dens_00018_.png (336 KB, 1824x1248)
336 KB
336 KB PNG
>>
File: dens_00023_.png (344 KB, 1824x1248)
344 KB
344 KB PNG
>>
Animationfag here. I enjoy drawing and making those drawings do the squash and stretch, but shit like having to color and shade every frame is a colossal pain in the ass. That's the sort of thing I'd like to either hand off to a bunch of south korean slave laborers or an AI. And I can't afford korean slave laborers. So what AI would you recommend I look into for doing the tedious grunt work part of animation?
>>
File: dens_00027_.png (381 KB, 1824x1248)
381 KB
381 KB PNG
>>103501428
coloring animations isn't a common thing around here so idk if they're any sort of gold standard or commonly used tool people might be using. I've got this thing that might work
https://github.com/luckyhzt/LVCD

there was another tool I vaguely remember but I can't find it rn
>>
File: 1712372462648860.png (969 KB, 896x1152)
969 KB
969 KB PNG
>>
File: dens_00028_.png (368 KB, 1824x1248)
368 KB
368 KB PNG
>>103501428
>>103501499
OH, this was the thing I was thinking of

https://ykdai.github.io/projects/InclusionMatching
>>
File: 42262634.png (957 KB, 613x1024)
957 KB
957 KB PNG
>>
File: dens_00030_.png (391 KB, 1824x1248)
391 KB
391 KB PNG
>>103501623
is that a giant mutant pedobear attempting to sexually assault laura kinney in a school uniform? quite the gen
>>
>>103501683
yes yes
>>
File: dens_00040_.png (451 KB, 1824x1248)
451 KB
451 KB PNG
>>
File: 20_ppl.jpg (191 KB, 1850x1850)
191 KB
191 KB JPG
Its fast food friday
>>
What's the best GPU to get for AI? Just anything with high Vram? I'm eyeing either 4070 or 4080
>>
File: 15_ppl.jpg (248 KB, 1850x1850)
248 KB
248 KB JPG
>>
File: a.jpg (2.47 MB, 2048x2048)
2.47 MB
2.47 MB JPG
>>103502029
I use a 4080 Super, I would say a 4070 TI Super would be good too, really anything Nvidia with 16+ gigs of RAM.
>>
File: b.jpg (2.4 MB, 2048x2048)
2.4 MB
2.4 MB JPG
>>
>>103501700
Newfags are completely unaware of pedobear branding.
>>
I miss Schizoanon
>>
>>103501071
I have my own start-up. Been working on my own killer app for a while. Think about iPhone, social connectivity and high quality gens.
>>
>>103502389
Perhaps you should consult a medical professional. This obsession has been going on for over two years by now.
>>
>>103500788
>>103500885
None because community training scripts/UIs don't support them (they should)
>>
>>103502420
what do you mean?
>>
>>103502386
I havent seen pedobear in so long I had to think for a second if it was actually him. he's fallen off radar and zoomqueers dont know who he is
>>
>>103502438
If you can't read...
I'm going to be commercially successful but you are still here next year, repeating those same lines again and again. Enjoy, little buddy!
>>
>>103502386
Imagine gatekeeping a website
>>
>>103502456
Ok.
>>
>>103502530
Ok.
>>
>>103502456
Are you beans?
>>
>>103502641
I'm not. I keep things under wraps.
>>
>>103502753
You sound like a nigbophile
>>
>>103502961
What is that? I don't understand.
>>
File: 00000-1670799449.jpg (1.66 MB, 1536x2064)
1.66 MB
1.66 MB JPG
hoo morning
>>
Suggest zero-terminal XL models that actually work, I want to check out this technology.
I've been suggested "cosxl" but it only produces noise for me. Terminus can't be found in a single safetensors file.
>>
Finally got the name of unet to show-up in the filename. Confyui is a pain but it's definitively flexible.
>>
>>
File: 00002-2162758047.jpg (1.24 MB, 1536x2064)
1.24 MB
1.24 MB JPG
>>
Remember keep your pedophile in his containment
>>
>>103503603
Are you the said pedophile?
>>
>>103503603
Which one?
>>
File: 00004-4157839041.jpg (1.86 MB, 1536x2064)
1.86 MB
1.86 MB JPG
>>
>>103503347
You would know that for some models you'll need to set clip skip to -2 or otherwise it's just noise.
>>
gm
>>
>>103503851
Morning~!
>>
>>
>>103503717
I've never seen clipskip doing anything for XL even with the enabled setting.
>>
>gm
>>
File: 00005-3243897079.jpg (1.82 MB, 1536x2064)
1.82 MB
1.82 MB JPG
>>103503991
good morning
>>
>>103503989
Then you are wrong and clueless.
>>
>>
>>
File: forest river winter.webm (3.61 MB, 1920x960)
3.61 MB
3.61 MB WEBM
>>103503329
Good morning
>>
>>
>>
File: 00008-3135606506.jpg (1.76 MB, 1536x2064)
1.76 MB
1.76 MB JPG
>>103504413
always makes me want to try animatedif again, but last time it seemed not to work with forge
>>
>>
File: 00010-1433911647.jpg (1.8 MB, 2048x2752)
1.8 MB
1.8 MB JPG
>>
Flux your way to Fucking Somewhere else.

Take your artsy bullshit and Fuck Off.
>>
>>103504498
This shit
>>
>>
>>103504766
Is that a haiku?
>>
>>103504766
post a gen if you want anyone to take you seriously. this isn't ldg
>>
this general fucking sucks
>>
>>103505028
stop trolling
>>
>>103505028
why are you here every day?
>>
>>
>>103505056
I am not here everyday

>>103505049
Not trolling, this general really does fucking suck
>>
>>103505056
why dont you ask that to >>103501818 ?
>>
File: 00051-2418293898.jpg (808 KB, 1536x1968)
808 KB
808 KB JPG
>>103505139
>>
File: 1.jpg (106 KB, 1152x896)
106 KB
106 KB JPG
>>103500234
It's the Big Lust model which excels at realism and not just NSFW.
>>
>>
>>
getting colder
>>
File: dens_00041_.png (384 KB, 1824x1248)
384 KB
384 KB PNG
>>103504169
dragons: the musical. lol

>>103504413
I like the flowers vs withered trees contrast

>>103505139
where else would I be?

>>103505275
you've been really cooking lately. very artistic gens all week

>>103505276
big cute

>>103505290
me and my gf
>>
File: 00053-3648539969.jpg (1.54 MB, 1536x2064)
1.54 MB
1.54 MB JPG
>>103505407
cheated on that one, used an mspaint
https://files.catbox.moe/luvaju.png
burnt out a bit, had some nightmarish dreams, will have to move on to a new prompt type, or more likely revisit an old prompt
these have been with sd 1.5, i feel the most artistic of model types
>>
>>103505275
oh youre buttbuddies my apologies
>>
File: 00054-1479359960.jpg (1.25 MB, 1536x2064)
1.25 MB
1.25 MB JPG
>>103505516
you should try being friendly to other people, and not nasty, hateful, if whatever your mental illness/es are allows positive relations with others
>>
>>103505583
which part was not friendly?
>>
>>103505604
no need to play dumb/er than you already are
>>
>>103505621
that wasnt very nice
>>
>>103505637
well deserved
>>
>>103505650
why?
>>
File: 00056-4250338403.jpg (1.13 MB, 1536x2064)
1.13 MB
1.13 MB JPG
>>103505681
>>
>>103505583
Aren't you the one who's fresh out of a mental ward?
>>
>>103505696
and what point are you trying, and failing, to make?
>>
>>103505710
Your projection is pretty apparent.
>>
File: dens_00043_.png (267 KB, 1824x1248)
267 KB
267 KB PNG
>>103505725
you portray yourself as deeply mentally ill
>>
File: 00057-2865880023.jpg (1.6 MB, 1536x2064)
1.6 MB
1.6 MB JPG
>>103505725
you are very confused. i have made no secret of being mentally ill, the point, that you are apparently too dim to understand, is that that other anon/you spend your time being nasty to others on here, and that whatever your illnesses are might be why you are internet trolls, rather than posting art or whatever. but i've engaged too much with you today, so i'll try not to talk to you further, at least for today.
>>
>>103505741
What do you base that on?
>>103505781
I don't really spend time here anymore, actually.
>>
>>
File: dens_00045_.png (360 KB, 1824x1248)
360 KB
360 KB PNG
>>103505804
>I don't really spend time here anymore,
if only

>>103505853
is this nai4?
>>
>>103505865
illustrious :P
>>
File: dens_00046_.png (401 KB, 1824x1248)
401 KB
401 KB PNG
>>103505877
fantastic
>>
>>103505903
ty pixel-san *_*
>>
>>103505865
I'm not sure why you presume I do.
>>
>>
File: 00058-3538257084.jpg (1.41 MB, 1536x2064)
1.41 MB
1.41 MB JPG
>>
File: dens_00047_.png (279 KB, 1824x1248)
279 KB
279 KB PNG
>>
controlnet-depth ideas (nsfw) if people are curious *_*
https://files.catbox.moe/1xxh53.png


+ a funny bad video
>>
>>103506051
what video model?
>>
>>103505903
How goes the job search?
>>
>>103506062
kling on first try, but i dont have credits to spam attempts
>>
File: dens_00049_.png (384 KB, 1824x1248)
384 KB
384 KB PNG
>>103506064
what do you mean?
>>
>>103506084
I thought you were looking for a job in the ai space
>>
File: dens_00055_.png (361 KB, 1824x1248)
361 KB
361 KB PNG
>>103506093
being open to new opportunities isn't synonymous with a "job search"
>>
>>103506127
I'm sorry I misunderstood
What are you working on now?
>>
Morning anons
Happy Friday 13th
>>
File: dens_00058_.png (352 KB, 1824x1248)
352 KB
352 KB PNG
>>103506166
why do you ask?

>>103506186
lol, nice gen. I didn't even realize it was friday the 13th.
>>
>>103506186
spooky
>>
>>103506215
I remember you talking about building things for the general. Could I be mixing you up with another regular?
>>
>>103506186
gm
>>
File: 1726101563533112.png (11 KB, 660x411)
11 KB
11 KB PNG
holy fuck comfyui is the complete opposite of comfy. literally every thing i try and do it i run into some stupid fucking error. i was trying to get animatediff to work last night and it was a different problem but always a fucking problem.

what does this even mean?

i've updated fucking everything including the python shit, uninstalled fucking everything except this node, i cannot even begin to conceptualise what the problem is and nobody online seems to have had it so there's no fixes.

if someone knows the solution then great, but really i'm just posting because i fucking hate comfyui so much. who makes this shit?
>>
>>103506367
A very autistic man that used to fuck trani until his optics went to shit and now he has distanced himself from him publicly.
>>
File: 00061-694935780.jpg (1.4 MB, 1536x2064)
1.4 MB
1.4 MB JPG
>>
>>103505912
Domo... Kurisumasu ni wa Kentakkii~!
>>
File: 00783-3265116481.png (419 KB, 512x688)
419 KB
419 KB PNG
>>
File: dens_00064_.png (357 KB, 1824x1248)
357 KB
357 KB PNG
>>103506328
>building things for the general
you mean the news thing I was working on?
>>
File: 00069-3850224395.jpg (915 KB, 1536x2064)
915 KB
915 KB JPG
rediscovered this old weird prompt that gens 2d subjects in 3d settings
>>
>>103506620
Oh yeah that!
Can we expect a Christmas gift?
>>
File: 00074-2089109648.jpg (1.28 MB, 1536x2064)
1.28 MB
1.28 MB JPG
wandered into the weird part of the outback
https://youtu.be/7nc13m4xTYA?si=FvIccE33O_xfPww4
>>
>>103506367
i hate comfyui
i did a git pull yesterday on the comfyui folder and the ui is totally different and even more of a pain in the ass than it was. i dont even want to bother reverting back.
never again
i dont even care about what i might miss out on anymore
>>
File: dens_00065_.png (310 KB, 1824x1248)
310 KB
310 KB PNG
>>103506993
its pretty defunct at this point. I hope to find some time to work on it some through the holidays but I'm not gonna have too much time. I'm also severely backlogged on posting actual news content to it that I have no idea how I'd ever get caught up. I have ideas on tooling to help lower the barrier to getting content publishes, but that needs time too

I really needed to focus up and get something published before the summer cuz thats really where my available time dropped off a cliff
>>
>>
schizophrenic situation
>>
File: 120755-tmp.png (3.02 MB, 1536x1920)
3.02 MB
3.02 MB PNG
>>
File: dens_00067_.png (307 KB, 1824x1248)
307 KB
307 KB PNG
>>103507124
amending this: I'm generally using my free time to make sure I'm getting news posted here over getting work done on the news site. sifting through news represents a fair bit of time each day and I'd rather put that time towards posting up news here first, but that mostly doesn't leave me with dev time
>>
File: 1727124565904729.jpg (772 KB, 1248x1824)
772 KB
772 KB JPG
>>
>>103507124
What happens in the summer and how can I help?
>>
>>
File: dens_00070_.png (399 KB, 1824x1248)
399 KB
399 KB PNG
>>103507250
>how can I help?
give me 100 btc
>>
File: 120762-tmp.png (2.98 MB, 1536x1920)
2.98 MB
2.98 MB PNG
>>
Yo
>>
Is Swarm the best ui right now? Sounds like it has virtually everything you'd want to prompt, and even a built-in comfy for workflows.
>>
File: 1727258685919881.jpg (975 KB, 1248x1824)
975 KB
975 KB JPG
>>
>>103507442
Buy an ad
>>
File: dens_00072_.png (382 KB, 1824x1248)
382 KB
382 KB PNG
>>103507398
yo

>>103507442
I don't think a single person here uses swarm

>>103507453
love it. very RA-anon coded
>>
File: 120770-tmp.png (3.07 MB, 2208x1248)
3.07 MB
3.07 MB PNG
>>
>>103507281
I don't have that....
>>
File: 1728422177845481.jpg (1.8 MB, 2688x1536)
1.8 MB
1.8 MB JPG
>>
>>
Is flux still the only good model?
>>
>>103507296
>>103507596
Fran sexo
>>
>>103507932
it was never the only good model
>>
>>103507281
>nigbo begging for crypto
>>
>debo getting defensive
>>
>>103507148
do you use like masking to get these layouts or is this entirely prompts? What model and loras is this?
>>
>>
>>103508043
can sd3.5 do text yet?
>>
>>103506186
like others I had no idea it was friday the 13th. years ago I would have gone and spent the day watching the movies to celebrate.
>>
File: dens_00077_.png (352 KB, 1824x1248)
352 KB
352 KB PNG
>>103507647
me neither :(

>>103507932
models are just tools. whether they're good or bad depends on whether you can wield them well or not

>>103508109
it was a joke
....unless?

>>103508219
it can do a decent job with text. I think its much more about the t5 encoder than the model itself


>>103508358
>years ago I would have gone and spent the day watching the movies
now you're too old and jaded to enjoy the whimsy of youth?
>>
>>
File: dens_00079_.png (319 KB, 1824x1248)
319 KB
319 KB PNG
>>103508384
are the spiders friends or foes? sword fighting a legion of giant spiders would be pretty bad ass
>>
Debo is on disability newfags
>>
okay, I'm the anon that posted an orihime pic the other day, >>103442196
I slowly figured out that my VAE was causing the glitchy artifacts
what's a good VAE for anime images? I was using kl-f8-anime2.safetensors (i'm on SD 1.5)
>>
File: dens_00080_.png (411 KB, 1824x1248)
411 KB
411 KB PNG
>>103508480
I'd ask "what's my disability?" but that's too much of a layup
>>
>****
>>
>>103508503
For 2 years you have dedicated 15+ hours to this general
>>
File: dens_00082_.png (367 KB, 1824x1248)
367 KB
367 KB PNG
>>103508550
thanks. I am very dedicated
>>
>mfw Resource news

12/12/2024

>Nig Prompt Alignment for Text-to-Image Generation
https://github.com/tiktok/fast_prompt_alignment

>FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models
https://matankleiner.github.io/flowedit

>TryOffAnyone: Tiled Cloth Generation from a Dressed Person
https://github.com/ixarchakos/try-off-anyone

>Leffa: Learning Flow Fields in Attention for Controllable Person Image Generation
https://github.com/franciszzj/Leffa

>InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models
https://github.com/Hundredl/InvDiff

>cc12m-4mp: 25k image 4mp dataset
https://www.reddit.com/r/StableDiffusion/comments/1hctvnz/25k_image_4mp_dataset

>3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
https://github.com/KwaiVGI/3DTrajMaster

12/11/2024

>One Diffusion to Generate Them All [40GB lul]
https://lehduong.github.io/OneDiffusion-homepage

>ComfyUI-IF_MemoAvatar: Memory-Guided Diffusion for Expressive Talking Video Generation
https://github.com/if-ai/ComfyUI-IF_MemoAvatar

>DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
https://jianzongwu.github.io/projects/diffsensei

>StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization
https://github.com/Aria-Zhangjl/StoryWeaver

>Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
https://jianhongbai.github.io/SynCamMaster

>ObjCtrl-2.5D: Training-free Object Control with Camera Poses
https://wzhouxiff.github.io/projects/ObjCtrl-2.5D

>FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
https://fiva-dataset.github.io

>FireFlow: Fast Inversion of Rectified Flow for Image Semantic Editing
https://github.com/HolmesShuan/FireFlow-Fast-Inversion-of-Rectified-Flow-for-Image-Semantic-Editing

12/10/2024

>Sana-ComfyUI
https://github.com/NVlabs/Sana/blob/main/asset/docs/ComfyUI/comfyui.md
>>
>mfw Research news

12/12/2024

>BallMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation
https://arxiv.org/abs/2412.08645

>BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation
https://research.nvidia.com/labs/amri/projects/blade

>DMin: Scalable Training Data Influence Estimation for Diffusion Models
https://arxiv.org/abs/2412.08637

>Multimodal Latent Language Modeling with Next-Token Diffusion
https://arxiv.org/abs/2412.08635

>Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning
https://arxiv.org/abs/2412.08614

>LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations
https://arxiv.org/abs/2412.08580

>StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements
https://arxiv.org/abs/2412.08503

>Video Summarization using Denoising Diffusion Probabilistic Model
https://arxiv.org/abs/2412.08357

>ALoRE: Efficient Visual Adaptation via Aggregating Low Rank Experts
https://arxiv.org/abs/2412.08341

>Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming
https://arxiv.org/abs/2412.08221

>TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning
https://arxiv.org/abs/2412.08176

>Analyzing and Improving Model Collapse in Rectified Flow Models
https://arxiv.org/abs/2412.08175

>Antelope: Potent and Concealed Jailbreak Attack Strategy
https://arxiv.org/abs/2412.08156

>AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting
https://arxiv.org/abs/2412.08149

>Seeing Syntax: Uncovering Syntactic Learning Limitations in Vision-Language Models
https://arxiv.org/abs/2412.08111

>Generative Zoo
https://genzoo.is.tue.mpg.de

>Doubly-Universal Adversarial Perturbations: Deceiving Vision-Language Models Across Both Images and Text with a Single Perturbation
https://arxiv.org/abs/2412.08108
>>
>mfw Resource news

12/12/2024

>Nig Prompt Alignment for Text-to-Image Generation
https://github.com/tiktok/fast_prompt_alignment

>FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models
https://matankleiner.github.io/flowedit

>TryOffAnyone: Tiled Cloth Generation from a Dressed Person
https://github.com/ixarchakos/try-off-anyone

>Leffa: Learning Flow Fields in Attention for Controllable Person Image Generation
https://github.com/franciszzj/Leffa

>InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models
https://github.com/Hundredl/InvDiff

>cc12m-4mp: 25k image 4mp dataset
https://www.reddit.com/r/StableDiffusion/comments/1hctvnz/25k_image_4mp_dataset

>3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
https://github.com/KwaiVGI/3DTrajMaster

12/11/2024

>One Diffusion to Generate Them All [40GB lul]
https://lehduong.github.io/OneDiffusion-homepage

>ComfyUI-IF_MemoAvatar: Memory-Guided Diffusion for Expressive Talking Video Generation
https://github.com/if-ai/ComfyUI-IF_MemoAvatar

>DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
https://jianzongwu.github.io/projects/diffsensei

>StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization
https://github.com/Aria-Zhangjl/StoryWeaver

>Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
https://jianhongbai.github.io/SynCamMaster

>ObjCtrl-2.5D: Training-free Object Control with Camera Poses
https://wzhouxiff.github.io/projects/ObjCtrl-2.5D

>FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
https://fiva-dataset.github.io

>FireFlow: Fast Inversion of Rectified Flow for Image Semantic Editing
https://github.com/HolmesShuan/FireFlow-Fast-Inversion-of-Rectified-Flow-for-Image-Semantic-Editing

12/10/2024

>Sna-ComfyUI
https://github.com/NVlabs/Sana/blob/main/asset/docs/ComfyUI/comfyui.md
>>
>mfw Research news

12/12/2024

>BallMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation
https://arxiv.org/abs/2412.08645

>BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation
https://research.nvidia.com/labs/amri/projects/blade

>DMin: Scalable Training Data Influence Estimation for Diffusion Models
https://arxiv.org/abs/2412.08637

>Multimodal Latent Language Modeling with Next-Token Diffusion
https://arxiv.org/abs/2412.08635

>Bitshchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning
https://arxiv.org/abs/2412.08614

>GAIION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations
https://arxiv.org/abs/2413.08580

>StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements
https://arxiv.org/abs/2412.08503

>Video Summarization using Denoising Diffusion Probabilistic Model
https://arxiv.org/abs/2412.08357

>ALoRE: Efficient Visual Adaptation via Aggregating Low Rank Experts
https://arxiv.org/abs/2412.08341

>Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming
https://arxiv.org/abs/2412.08221

>TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning
https://arxiv.org/abs/2412.08176

>Analyzing and Improving Model Collapse in Rectified Flow Models
https://arxiv.org/abs/2412.08175

>Antelope: Potent and Concealed Jailbreak Attack Strategy
https://arxiv.org/abs/2412.08156

>AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting
https://arxiv.org/abs/2412.08149

>Seeing Syntax: Uncovering Syntactic Learning Limitations in Vision-Language Models
https://arxiv.org/abs/2412.08111

>Generative Zoo
https://genzoo.is.tue.mpg.de

>Doubly-Universal Adversarial Perturbations: Deceiving Vision-Language Models Across Both Images and Text with a Single Perturbation
https://arxiv.org/abs/2412.08108
>>
>>103508550
What about me? I'm here pretty often too.
>>
File: 000000_21997_.png (2.23 MB, 950x1690)
2.23 MB
2.23 MB PNG
>>103504766
>Take your artsy bullshit and FLUX Off
Ironic.
>>
>>103508429
i dont think they're friendly
>>
File: 00083-3127048840.jpg (1.19 MB, 1536x2064)
1.19 MB
1.19 MB JPG
https://youtu.be/dcDB9SHYW0w?si=GViF084PqUDmdHVk
>>
>>103508381
>now you're too old and jaded to enjoy the whimsy of youth?
I think its more ive seen the movies so many times now the though of watching them makes me roll my eyes with a smile. I love the movies but I dont really enjoy movies anymore and rather just sit and do other things with my time. And yeah im probably getting old and jaded about life also.
>>
File: 120804-tmp.jpg (574 KB, 1536x2016)
574 KB
574 KB JPG
>>103508175
>do you use like masking to get these layouts or is this entirely prompts?
It's just prompts. I se Multiple views, panels, and speech bubbles.
>>103508175
>What model and loras is this?
Right now I'm using waiNSFWIllustrious and the only lora I'm using is an add detailer.

Here's a catbox. https://files.catbox.moe/vi7b3i.png
>>
>>
File: dens_00093_.png (326 KB, 1824x1248)
326 KB
326 KB PNG
>>103508792
I wish I could figure out how to get suno to do anything even close to this sort of sound

>>103508819
too bad, there's supposedly a new wave of horror movie making happening that has all the film-heads gushing. I get your sentiment though cuz contemporary media is almost all product and zero passion. but 2025 is the year of AI video and the passionate visionaries will be unchained

it is a bit of a shame that all the vidgen moved into ldg. I guess the dream of sdg network is dead
>>
Nobody likes you or your pedo tranny friends
>>
all the imaggen did, too
>>
>>103508664
imposter
>>
>>103508985
What does this mean?
>>
File: 120810-tmp.jpg (496 KB, 1536x2016)
496 KB
496 KB JPG
>>
>>
Luigi was with me.
>>
File: 120813-tmp.png (3.78 MB, 1536x2016)
3.78 MB
3.78 MB PNG
>>
File: deme_000151_.jpg (845 KB, 1344x1024)
845 KB
845 KB JPG
>>103509086
my queen
>>
>deme
faglord more like
>>
File: 120816-tmp.jpg (482 KB, 2304x1536)
482 KB
482 KB JPG
>>
gottem'
>>
>>103508939
yeah all this ai art and video and chatbots is bringing in a whole era of being able to unleash the stories in you. I joke im going to make a comic book universe over the next 10 years and even been putting powers and lore to some of the art and charactors I make. And with the direction things are going who knows I just might put out a universe in ai that has thousands of charactors in it.

5 years ago if someone said it'd be the start of the era of being your own video maker and art maker in 5 years most would go sure whatever but now im able to make horrible goku vs tien videos and other stuff and characters in ai art all day. so its possible now but im also hopeful.
>>
>>103509182
>only two nipples
2/10 would not bang
>>
File: 120821-tmp.png (3.63 MB, 1536x1920)
3.63 MB
3.63 MB PNG
>>
>>103509132
kek
>>
>>103509315
Please do a Purple Witch page, that would annoy some anons and that's funny.
>>
@103509331
nig*bo
>>
File: 120825-tmp.png (3.61 MB, 1536x1920)
3.61 MB
3.61 MB PNG
>>103509381
>>
Are julien and cumfart not friends anymore?
>>
File: PW.jpg (169 KB, 1024x1024)
169 KB
169 KB JPG
>>
File: 120827-tmp.png (3.42 MB, 1536x1920)
3.42 MB
3.42 MB PNG
>>
>>103509458
>>103509519
Wow amazing! Going to whip out my iwn version later on..
>>
>>103509533
ok dweebs
>>
File: 1707034933108457.jpg (1.03 MB, 1248x1824)
1.03 MB
1.03 MB JPG
>>
>>103509467
It's just your typical gay drama. One day they are literally married, next day things can never even be reconciled again.
>>
File: 120829-tmp.png (3.39 MB, 1536x1920)
3.39 MB
3.39 MB PNG
>>
File: 00088-3127048845.png (3.9 MB, 1536x2064)
3.9 MB
3.9 MB PNG
>>
File: 120832-tmp.png (3.93 MB, 1536x1920)
3.93 MB
3.93 MB PNG
Enough smug witch
>>
>>103504020
In Comfy there's no reason whatsoever to even use the "Clip Set Last Layer" node at all for any XL model
>>
File: 00089-2589956620.jpg (1.39 MB, 1536x2064)
1.39 MB
1.39 MB JPG
really bored/boring
if there was a gen you liked from the past, i might try to make more, or a general idea, just really bored
>>
>>103509830
Gen artsy fish with that 2d/3d prompt.
>>
File: IMG_1487.jpg (1.61 MB, 1808x3216)
1.61 MB
1.61 MB JPG
big fan of whatever this is
>>
File: file.png (520 KB, 512x512)
520 KB
520 KB PNG
check out my cute 1boy
>>
@103509840
d*bo
>>
File: file.png (2.15 MB, 1024x1024)
2.15 MB
2.15 MB PNG
>>103509850
hang on I fixed it.
>>
i'm too high for this
>>
>>103509856
Thank you for the compliment. Love your dedication.
>>
>>103509877
Drunken genning is not advisable either at some point it becomes almost impossible to think about prompts and getting confused about different nodes and outputs.
>>
>>103509882
>nogen
>>
>>103509877
or you're not high enough
>>
>>103509893
oh i dont use [non]comfyui
also not drunk
>>103509903
this
>>
File: 120842-tmp.png (3.01 MB, 1920x1536)
3.01 MB
3.01 MB PNG
>>
>>103509905
I did never say you're drunk, retard.
>>
File: 000000_22000_.png (2.22 MB, 936x1664)
2.22 MB
2.22 MB PNG
>Alcohol is bad, bad I tell you! HUGE improvements in health after abstinence
>>
>>103509921
your comment is rude
>>
>it's literally the same four avatarfags spamming their garbage on here daily
>>
File: 1715702664607721.jpg (486 KB, 1248x590)
486 KB
486 KB JPG
>>
>>103509964
and me
singular schizo anon
>>
>fran actually thinks people look at his spam
little did he know nobody does
>>
>>
>>103509964
Found the upset 'Artist'.
>>
>>103509987
Found the avatarfag slopper
>>
>>103509910
Fran is one of the best genners here.
>>
>>103509979
I do
>>
i miss schizo anon
>>
File: download (1) (5).jpg (285 KB, 1024x1024)
285 KB
285 KB JPG
>>
File: 00096-301119683.jpg (1.02 MB, 1536x2304)
1.02 MB
1.02 MB JPG
fish thing didn't work out, resorted to maid women again
>>
Is anybody interested in posting anything besides cringe?
>>
File: download (1) (15).jpg (317 KB, 1024x1024)
317 KB
317 KB JPG
>>103510196
mmm?
>>
File: 1729901484983592.png (2.03 MB, 1536x1536)
2.03 MB
2.03 MB PNG
>>103510204
Niga you r cringe.
Yor post make as much sense at this gun.
>>
File: 00097-450738756.png (3.31 MB, 1280x1920)
3.31 MB
3.31 MB PNG
>>103510196
>>
>>103509478
>purple witch was a cadbury witch all along
I should have known!
>>
Friday night, time to gen
>>
File: 1725984518664963.jpg (190 KB, 1024x1024)
190 KB
190 KB JPG
Hey if anyone here makes political memes,
consider sharing!

>>>/pol/491347687
>>
>>
File: 120851-tmp.png (3.01 MB, 1536x1728)
3.01 MB
3.01 MB PNG
>>
>>103509964
I post on occasion when im not doing something
>>
File: chibi robo suit.jpg (56 KB, 512x768)
56 KB
56 KB JPG
>>103510196
you are one of those types arent you?
>>
File: download (3).jpg (241 KB, 1024x1024)
241 KB
241 KB JPG
>>103510213
no
>>
File: 1731369583877920.webm (1.2 MB, 720x720)
1.2 MB
1.2 MB WEBM
>>103510347
>>103510354
Thank you for the timely response.
I can post cringe too, most of mine was generated online though.
>>
File: file.jpg (97 KB, 1024x1024)
97 KB
97 KB JPG
>>103510335
same desu
>>
>>103510378
Hope the recent HF storage changes didn't impact you too much
>>
File: 00100-982645448.jpg (898 KB, 1280x1920)
898 KB
898 KB JPG
>>
File: file.png (31 KB, 889x214)
31 KB
31 KB PNG
>>103510409
looks like it changed again, there's no real limit on free accounts, just don't abuse it, uploads should be useful and it's unlimited for pro/enterprise
https://huggingface.co/docs/hub/storage-limits
>>
File: img (3) (1).jpg (476 KB, 1024x1024)
476 KB
476 KB JPG
>>103510372
I used krea :/
>>
File: 1733276982846611.png (1.53 MB, 1536x2048)
1.53 MB
1.53 MB PNG
>>103510422
Are these supposed to be anatomically real anime grills or something?
>>
>>103510476
just whatever the generator gives, i use lots of wildcards
>>
Next Thread

>>103510485
>>103510485
>>103510485
>>
>>103510378
Ahh it's pixel-san himself *_*
>>
File: download (1) (2).jpg (526 KB, 1024x1024)
526 KB
526 KB JPG
>>103510500
I'll fill it up
>>
File: download (1) (6).jpg (367 KB, 1024x1024)
367 KB
367 KB JPG
>>
File: 1734138635476_image.jpg (131 KB, 984x984)
131 KB
131 KB JPG
Filling
>>
File: poloraoidz.jpg (170 KB, 1308x682)
170 KB
170 KB JPG
>>103505963
im gonna marry that flux girl
>>103506127
love this


anyways, you guys are way too fast on \g\ here
i take a nap, come back, old thread is gone
byeeeee

>consider adding \NAPT\ @ \VP\ to your relevant boards+threads list
>>
>>103510608
>image limit reached
MY FUCKING POINT EXACTLY



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.