[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1758088747542296.png (1.27 MB, 832x1216)
1.27 MB
1.27 MB PNG
Previous /sdg/ thread : >>107295961

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Early Preview UI
AniStudio: https://github.com/FizzleDorf/AniStudio

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
https://huggingface.co/QuantStack/Qwen-Image-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Distill-GGUF
https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF

>Flux.1 Krea
https://docs.comfy.org/tutorials/flux/flux1-krea-dev
https://huggingface.co/black-forest-labs/FLUX.1-Krea-dev
https://huggingface.co/QuantStack/FLUX.1-Krea-dev-GGUF

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
https://huggingface.co/QuantStack/Wan2.2-TI2V-5B-GGUF
https://huggingface.co/QuantStack/Wan2.2-T2V-A14B-GGUF
https://huggingface.co/QuantStack/Wan2.2-I2V-A14B-GGUF

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://github.com/maybleMyers/chromaforge
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Models, LoRAs & upscaling
https://civitai.com
https://tensor.art
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>
>mfw Resource news

11/24/2025

>cc12m-1mp_plus-realistic: Filtered CC12M dataset for 1mp+ realism
https://huggingface.co/datasets/opendiffusionai/cc12m-1mp_plus-realistic

>simpletuner v3.1.3 with Kandinsky5, ACE-Step music training, and a webUI
https://github.com/bghira/SimpleTuner/releases/tag/v3.1.3

>Hunyuan 1.5 step distilled loras
https://huggingface.co/Comfy-Org/HunyuanVideo_1.5_repackaged/tree/main

>MMT-ARD: Multimodal Multi-Teacher Adversarial Distillation for Robust Vision-Language Models
https://github.com/itsnotacie/MMT-ARD

>Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats
https://github.com/SooLab/AllPath

>PairHuman: A High-Fidelity Photographic Dataset for Customized Dual-Person Generation
https://github.com/annaoooo/PairHuman

11/22/2025

>FaceFusion ComfyUI
https://github.com/huygiatrng/Facefusion_comfyui

11/21/2025

>HunyuanVideo-1.5: A leading lightweight video generation model
https://huggingface.co/tencent/HunyuanVideo-1.5

>SAM 3D: 3Dfy Anything in Images
https://ai.meta.com/sam3d

>ComfyUI-SAM3DBody
https://github.com/PozzettiAndrea/ComfyUI-SAM3DBody

>NoPo-Avatar: Generalizable and Animatable Avatars from Sparse Inputs without Human Poses
https://wenj.github.io/NoPo-Avatar

>Dataset Distillation for Pre-Trained Self-Supervised Vision Models
https://linear-gradient-matching.github.io

>SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation
https://henghuiding.com/SceneDesigner

>ComfyUI Replace First & Last Frames
https://github.com/lovisdotio/ComfyUI-Replace-First-Frame-Last-Frame

>PartUV: Part-Based UV Unwrapping of 3D Meshes
https://www.zhaoningwang.com/PartUV

>Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers
https://github.com/OPPO-Mente-Lab/Qwen-Image-Pruning

>UniFit: Towards Universal Virtual Try-on with MLLM-Guided Semantic Alignment
https://github.com/zwplus/UniFit
>>
>mfw Research news

11/24/2025

>One-Step Diffusion Transformer for Controllable Real-World Image Super-Resolution
https://arxiv.org/abs/2511.17138

>Spanning Tree Autoregressive Visual Generation
https://arxiv.org/abs/2511.17089

>Diversity Has Always Been There in Your Visual Autoregressive Models
https://arxiv.org/abs/2511.17074

>Energy Scaling Laws for Diffusion Models: Quantifying Compute and Carbon Emissions in Image Generation
https://arxiv.org/abs/2511.17031

>Vision Language Models are Confused Tourists
https://arxiv.org/abs/2511.17004

>Neighbor GRPO: Contrastive ODE Policy Optimization Aligns Flow Models
https://arxiv.org/abs/2511.16955

>Loomis Painter: Reconstructing the Painting Process
https://arxiv.org/abs/2511.17344

>ATAC: Augmentation-Based Test-Time Adversarial Correction for CLIP
https://arxiv.org/abs/2511.17362

>DSeq-JEPA: Discriminative Sequential Joint-Embedding Predictive Architecture
https://arxiv.org/abs/2511.17354

>Where Culture Fades: Revealing the Cultural Gap in T2I Generation
https://arxiv.org/abs/2511.17282

>A Little More Like This: T2I Retrieval with Vision-Language Models Using Relevance Feedback
https://arxiv.org/abs/2511.17255

>PostCam: Camera-Controllable Novel-View Video Generation with Query-Shared Cross-Attention
https://cccqaq.github.io/PostCam.github.io

>SafeR-CLIP: Mitigating NSFW Content in Vision-Language Models While Preserving Pre-Trained Knowledge
https://arxiv.org/abs/2511.16743

>SAM 3: Segment Anything with Concepts
https://arxiv.org/abs/2511.16719

>Rethinking Diffusion Model-Based Video Super-Resolution: Leveraging Dense Guidance from Aligned Features
https://arxiv.org/abs/2511.16928

>UniModel: Visual-Only Framework for Unified Multimodal Understanding and Generation
https://arxiv.org/abs/2511.16917

>Q-REAL: Towards Realism and Plausibility Evaluation for AI-Generated Content
https://arxiv.org/abs/2511.16908

>Warm Diffusion: Recipe for Blur-Noise Mixture Diffusion Models
https://arxiv.org/abs/2511.16904
>>
File: deAR_cHD_00072_.png (2.04 MB, 1459x998)
2.04 MB
2.04 MB PNG
>>
>>
>>
>>
File: oldnew_00399_.png (2.59 MB, 1128x1432)
2.59 MB
2.59 MB PNG
>>107314383
What exactly a dieference between Krea and normal Flux?
>>
>>107313834
Not really but they tend to create so tunnels, Quokkas mainly live in bushy places
>>
>>107315015
less buttchin
>>
>>
>>107315015
>What exactly a dieference between Krea and normal Flux?

You can read about it here

https://bfl.ai/blog/flux-1-krea-dev
https://www.krea.ai/blog/flux-krea-open-source-release

Try both for yourself if you can and see if their claims match what you see in the images you generate. You might not see much difference depending on the subject matter.
>>
>>
>>107315015
Flux no bueno señor
https://suno.com/song/bc1afd4f-4cf8-4689-aa1a-49019b3fcb8c
>>
>>
>>107314872
Can't see prompt but damm man looks like the retrofuturism stuff I genned a while back I used keywords like 50s movie poster, pinup and epcot
>>
File: doge.jpg (359 KB, 640x640)
359 KB
359 KB JPG
>>107315125
>Big field. Many green.
>>
File: oldnew_00684_.png (1.65 MB, 800x1200)
1.65 MB
1.65 MB PNG
>>107315072
>Try both for yourself if you can and see if their claims match what you see in the images you generate. You might not see much difference depending on the subject matter.
It seems to be better. At the very least it's not worse. Thank you, anon.
>>
>>107315165
far out
>>
>>107315356
Kek it was a chatgpt experiment to make lyrics smarter then dumb them down.

"In this endless field of clover
How do we ascend
When the holders of the luck already own the four-leaves?
An endless war — and it starts again"

It was originally supposed to be about a field of clover in which all the 4 leaves have already been long since plundered, or those borne with a silver spoon in their mouths already hold all the cards and we can't do anything about it now.
>>
I have thousands of images in my output folder. Does anybody have sorting/viewing tips?
>>
>>
>>
>>107315615
Very nive
>>
>>107315500
Can you screenshot your folder so we know what we're dealing with
>>
>>107315500
Use a clip sorter
>>
>>
File: file.png (25 KB, 360x477)
25 KB
25 KB PNG
>>107315647
>>
PW cameo
>>
>>107315738
I meant the contents
>>
File: file.png (1.58 MB, 852x1971)
1.58 MB
1.58 MB PNG
>>107315859
mostly pictures itterations
>>
>>
File: deAR_cHD_00075_.png (1.77 MB, 1459x998)
1.77 MB
1.77 MB PNG
>>107315500
best strategy is to just organize from the start, using folders to group prompts and possibly attaching metadata into the file name, folder name, or into an adjacent txt file

if you're looking for an application, here's what I have in my archive. I can't speak to how well any of these would compliment your workflow, nor is this anywhere close to an exhaustive list

>Diffusion Toolkit
https://github.com/RupertAvery/DiffusionToolkit

>LeftRight - Fast Image Sorter
https://github.com/bewinxed/leftright

>SD-Categorizer-2000: Python script to organize all your images
https://github.com/LesPles/SD-Categorizer-2000/tree/main
>>
>>107315915
Wildcard trash. Please consider ending yourself?
>>
>>107315985
no u
>>
File: 00013-1406015025.jpg (1.66 MB, 2304x1792)
1.66 MB
1.66 MB JPG
>>
>>
File: deAR_cHD_00076_.png (1.68 MB, 1459x998)
1.68 MB
1.68 MB PNG
>>
File: 00014-1578091936.jpg (1.21 MB, 1792x2304)
1.21 MB
1.21 MB JPG
>>
>>107315500
large thumbnails, methamphetamine and self-loathing. lubricant helps as well.
>>
File: autumn river.webm (3.7 MB, 1920x960)
3.7 MB
3.7 MB WEBM
>>
i am not well
do not trust ((( doctors )))
>>
File: deAR_cHD_00081_.png (1.75 MB, 1459x998)
1.75 MB
1.75 MB PNG
>>107316896
hope things get better, anon
or at least you void the misery with some thanksgiving comfort foods
>>
File: 1764027726344.jpg (129 KB, 1152x768)
129 KB
129 KB JPG
>>
File: deAR_cHD_00082_.png (2.27 MB, 1459x998)
2.27 MB
2.27 MB PNG
>>
File: PW_146872_.png (2.26 MB, 1800x1136)
2.26 MB
2.26 MB PNG
Good evening, anons! I hope everyone is doing well :]
>>
File: deAR_cHD_00084_.png (1.82 MB, 1459x998)
1.82 MB
1.82 MB PNG
>>107317781
heyo, hows it going?
>>
File: PW_146900_.png (2.24 MB, 1280x1600)
2.24 MB
2.24 MB PNG
>>107317933
Pretty good! Just got home a little bit ago :]
How are you?
>>
File: oldnew_01012_.png (2.02 MB, 800x1200)
2.02 MB
2.02 MB PNG
>>107315975
>best strategy is to just organize from the start, using folders to group prompts and possibly attaching metadata into the file name, folder name, or into an adjacent txt file
I'm way too adhd for that
>>Diffusion Toolkit
>https://github.com/RupertAvery/DiffusionToolkit
Look perfect for what I want. Thank you, anon.
>>
File: PW_146929_.png (2.02 MB, 1024x1280)
2.02 MB
2.02 MB PNG
>>
File: deAR_cHD_00085_.png (1.52 MB, 1459x998)
1.52 MB
1.52 MB PNG
>>107317960
just enjoying turkey day eve eve eve
have to get some important stuff done tomorrow but then im max chill the rest of the week
do you have a favorite thanksgiving food (assuming you dont have to make it)?

>>107318011
happy to help
nice gens too
>>
File: PW_146915_.png (2.19 MB, 1280x1600)
2.19 MB
2.19 MB PNG
>>107318220
Hahaha!
Ohhhh nice well I hope that goes well!
Hmmmm it might sound dumb but I LOVE baked mac n cheese haha! It's the best!
I never have to cook for thanksgiving luckily
Though my mom wants me to show her how to make this pesto thing I make so that'll be about it probably
I might bake some treats to take over there too
>>
File: deAR_cHD_00087_.png (1.34 MB, 1459x998)
1.34 MB
1.34 MB PNG
>>107318437
>baked mac n cheese
fuck yeah. my family never did mac n cheese but I'd fuck with baked mnc any day
>I never have to cook for thanksgiving luckily
the one day of the year, lol. what treats are you thinking of making?
>>
File: PW_146972_.png (2.26 MB, 1280x1600)
2.26 MB
2.26 MB PNG
>>107318564
Right? It's SO good hahaha!! I always take some home
LOL ikr! Hmmm maybe brownies or something simple like that haha I haven't really given it much thought desu
They really like german chocolate cake, but I hate coconut haha!
What's your favorite?
>>
File: deAR_cHD_00093_.png (1.62 MB, 1459x998)
1.62 MB
1.62 MB PNG
>>107318671
>I hate coconut
coconut ruins everything it touches, except pina coladas and samoas
>What's your favorite?
mashed potatoes is king. oven baked turkey is pretty up there too. cant really be turkey day without turkey
for treats, my family used to do a variety of pies every year. some would say too much pie. I wouldn't, but some would
>>
File: PW_146984_.png (2.39 MB, 1280x1600)
2.39 MB
2.39 MB PNG
>>107318727
I agree with that first part, but it ruins literally everything LOL
Ohhhhhhhhhhh Mashed potatoes are a very close second for me!!
I'm not a huge turkey fan, but I like to mix it up in my mac and tatos! I really like honey glazed ham!
I'm really particular about pie haha I like apple and chocolate
Oh peach cobbler is good too!
>>
File: PW_147035_.png (2.23 MB, 1280x1600)
2.23 MB
2.23 MB PNG
Remix of Mouseanon's song Soul Star!
https://suno.com/s/SIAo07wIJwImB3CH
>>
i miss schizo anon
>>
File: autumn river 2.webm (3.75 MB, 1920x960)
3.75 MB
3.75 MB WEBM
>>107319230
The remix sounds great! How's your kitty doing?
>>
>>107316272
looks like Ai Shinozaki
>>
File: OJ_SEK_V_CLAM_4.jpg (973 KB, 3584x3956)
973 KB
973 KB JPG
>>
File: hifi.png (2.34 MB, 1536x1024)
2.34 MB
2.34 MB PNG
>>
>>107319949
The souuuuuund
Of silenceeee
>>
File: hifi.png (2.01 MB, 1536x1024)
2.01 MB
2.01 MB PNG
>>
File: PW_147031_.png (2.14 MB, 1280x1600)
2.14 MB
2.14 MB PNG
>>107319856
Heyy! Great to see you again! :D
She's doing great!! She slept with me last night haha
I was surprised at how fast she went from hiding to being super friendly :]
How are you doing?
Great anim!!
>>107319936
Heyy!
>>
File: OF_SEK_PROTEIN_04.jpg (1.01 MB, 3584x4608)
1.01 MB
1.01 MB JPG
>>107320133
Hey how are you
>>
File: PW_147051_.png (2.56 MB, 1280x1600)
2.56 MB
2.56 MB PNG
>>107320151
I'm doing great! Just relaxing haha
How are you?
>>
File: OF_SEK_PROTEIN_12.jpg (1011 KB, 3584x4608)
1011 KB
1011 KB JPG
>>107320167
I'm good, enjoying the cool weather. How's work?
>>
File: autumn river 3.webm (3.74 MB, 1920x960)
3.74 MB
3.74 MB WEBM
>>107320133
I'm glad to hear that she warmed up to you so quickly. It's really kind of you to have rescued her.
It's great to see you again too! I'm just fine, thanks, glad you like the animation.
>>
File: PW_147059_.png (2.44 MB, 1280x1600)
2.44 MB
2.44 MB PNG
>>107320172
I'm glad to hear it! :D
Yeah it's been nice and gloomy out hahaha! I love it!
Work has been good! Staying busy haha
Looking forward to thanksgiving?
>>107320173
Me too! The first night she was hiding under my dishwasher and meowing LOL
I was afraid to start it up haha
Aw thanks! She's super tiny so there's no way I could leave her outside
Especially since it's been really cold and rainy lately
It's always great to see you! I'm so glad you're doing well :]
Yeah! Your style is always so nice!
>>
>>107320217
Nice and gloomy, are you being sarcastic or are you one of those people who enjoys those overcast slightly down days? I personally am one of those people, but I'm just wondering. I am indeed looking forward to Thanksgiving. I love fresh cooked turkey. I've been asked to make my famous whipped sweet potatoes, so I'll be bringing that. Do you have any fancy or special dishes that you make for Thanksgiving?
>>
File: 0v_3.jpg (312 KB, 1808x2320)
312 KB
312 KB JPG
A man with two gold coins will want three gold coins, and a man with three gold coins will want four gold coins. But a man who lacks gold will find satisfaction in the world around him.
>>
File: PW_147087_.png (2.54 MB, 1280x1600)
2.54 MB
2.54 MB PNG
>>107320233
LOL I really do love it! Overcast weather is the best!
I'm not a huge fan of summer, too bright and hot hahaha
I love thanksgiving food! Ohhh that sounds good! Honestly I don't do much cooking if any on thanksgiving
My parents always make hella food! I might bring some sweets tho :]
>>107320246
Interesting gen and quote!
>>
File: PW_147155_.png (1.97 MB, 1280x1600)
1.97 MB
1.97 MB PNG
Good night, anons! I hope to see you all tomorrow! :]
>>
File: autumn river 4.webm (3.64 MB, 1920x960)
3.64 MB
3.64 MB WEBM
>>107320217
Good spotting! underneath a car or a running dishwasher is no place for a small kitten.
I suppose that's what I like to gen, usually. I like your style, too, though I wouldn't want you to feel constrained by me, I like your others as well.
>>107320511
Goodnight
>>
is local feeling a little mogged atm? short of hardcore stuff
>>
File: hifi.png (2.04 MB, 1536x1024)
2.04 MB
2.04 MB PNG
>>
>gm
>>
File: autumn river 5.webm (3.59 MB, 1920x960)
3.59 MB
3.59 MB WEBM
>>
>>
>>
>>
>>
>>
File: 00000-341334437.jpg (806 KB, 2048x2560)
806 KB
806 KB JPG
>>
>>
>>
>>107314383
>anistudio
>claim its c/cpp
>requires the uses of python
fake and gay
>>
>>107322708
who the fuck added it to OP
>>
>>107322708
Whats worse is its utilizing stable diffusion.cpp, whose main goal was to get the fuck away from python.
>>
>>
>>107322719
Why ask questions you know the answer to?
>>
>>107322775
but I dont know, I didnt even notice it
>>
>>107322792
If you didn't even notice it, how are you mentioning it? Stop being disingenuous.
>>
>influx of tourists
Have american schools already ended for the thanksgiving celebration? Asking as a non burger.
>>
>>107322795
>everyone I reply to is one person
who's being disingenuous now?
>>
>>107322811
>Things nobody said
>who's being disingenuous
Oof
>>
>>107322833
I accept your concession
>>
>>107322838
Whatever ends this cringe conversation, bro.
>>
File: deCA_cHD_00003_.png (1.33 MB, 1613x922)
1.33 MB
1.33 MB PNG
>>107322719
>who the fuck added it to OP
you don't even post here, so why do you care?

>>107322808
its not zoomies out of school, its 40 year olds who don't get invited to their family thanksgiving anymore
>>
>>107322719
Ani herself was exaczly once here in the last two years
She made debo add it to OP and since then moved to adt/ldg
Since debo is a good dog he of course did as she wished
>>
>>107314383
FLUX.2 [dev], 32B https://huggingface.co/black-forest-labs/FLUX.2-dev
>>
>>107323052
64GB at full FP16, ~32GB at Q8, ~16GB at Q4... damn, that's pushing a lot of gpu poor people out of the picture.
>>
File: deCA_cHD_00004_.png (1.02 MB, 1613x922)
1.02 MB
1.02 MB PNG
>>107323052
itshappening.gif

>>107323065
kinda surprised they released it instead of leaving it as an API model. I guess the 24gb havers get to have some fun at least
>>
>>107323129
>60gb safetensors
>50gb text encoder
ooof
yikes even
>>
Morning anons
>>
File: deCA_cHD_00005_.png (1.02 MB, 1613x922)
1.02 MB
1.02 MB PNG
>>107323178
gm albino quokka
>>
File: file.png (13 KB, 449x176)
13 KB
13 KB PNG
Comfy team already working on getting Flux.2 support going...
>>
File: deCA_cHD_00006_.png (972 KB, 1613x922)
972 KB
972 KB PNG
>>107323308
>doesn't have day0 support like they did with flux1
interesting. I guess BFL isn't working directly with the comfy team anymore like they had in the past
>>
File: file.png (65 KB, 767x350)
65 KB
65 KB PNG
Wtf it's on OpenRouter
>>
>>107323330

You can bet that the api support for flux2 will be done first before local. Either way I'll be waiting for the https://docs.comfy.org/ update before I add it to the op text.

>>107323129
>kinda surprised they released it instead of leaving it as an API model.

Was there rumours that this might happen leading up to this release? Or just speculation based on how large this model is?
>>
Only month left for Christmas!
>>
File: file.png (23 KB, 739x156)
23 KB
23 KB PNG
>>107323424
even more expensive than Banana Pro, kek
>>
>>107323424
>Wtf it's on OpenRouter

I don't use OR for image generation so that's the first time I've seen per TOKEN charging for images rather than per image cost. Looks like prompting a basic "1girl" is going to be a lot cheaper than "1girl, blonde hair, blue eyes, cleavage, from behind, looking back"
>>
File: deCA_cHD_00009_.png (570 KB, 1613x922)
570 KB
570 KB PNG
>>107323429
>Was there rumours that this might happen leading up to this release?
not rumors, but people were speculating that BFL was gonna give local the shaft
>Or just speculation based on how large this model is?
wasn't even based on model size cuz BFL said literally nothing about flux2. it was just the assumption that all companies trend towards fucking over local over time

>>107323465
holy fugg
>>
>>107323468
Cost is per million tokens. Difference between 1 word and 1 paragraph of text input is negligible.
Often images *are* counted by image but OR translates it to a number to represent the cost.
>>
Is XL worth finetuning for anime artstyles? Better than 1.5 or no?
I tried using "IllustriousXL" as a base, but when I run the model (without loras) the output is just pure noise.
>>
File: IMG_0455.png (1.01 MB, 1152x768)
1.01 MB
1.01 MB PNG
>>107323574
XL is a bit better for realism actually, it's weird that Illustrious isn't working for you, are you using a different vae? Maybe you have clip skip set to something else.
>>
File: file.png (138 KB, 898x572)
138 KB
138 KB PNG
>>107323465
Ah, $0.06 per megapixel both in and out. Normally input is counted less.
That's why it looked so expensive (I mean it is).
>>
downloading flux2 now
>>
>>107323493
>it was just the assumption that all companies trend towards fucking over local over time

I guess they're making enough money via api services that they don't mind the mild dent in revenue from other companies hosting the model for themselves and cutting BFL out of the picture.

>>107323688

Banana Pro is 6, 9 and 12 cents for 1024, 2048 and 4096 respectively so that means Flux2 is still more expensive.
>>
File: IMG_0459.png (662 KB, 1152x768)
662 KB
662 KB PNG
>>107323723
Careful anon, that thing is heavier than Elden Ring
>>
File: ComfyUI_00003_.png (2.35 MB, 1152x1536)
2.35 MB
2.35 MB PNG
>>107323816
it's constantly loading and offloading lol
not too slow tho
>>
File: ComfyUI_00004_.png (2.31 MB, 1152x1536)
2.31 MB
2.31 MB PNG
image style transfer
>>
File: ComfyUI_00005_.png (2.24 MB, 1152x1536)
2.24 MB
2.24 MB PNG
>>
yep
>>
Comfy is back? Last time I was here they said it was dead. I've been using something ridiculous ever since.

I have been having issues with inpaint though, maybe I'm out of practice and forgetting how this is done. marking the areas for change no matter the creativity strength is resulting in the whole image changing instead of just the hairclips like I wanted. I'm not at home atm or I'd show what I mean. any suggestions in the mean time?
>>
File: deCA_cHD_00010_.png (1.06 MB, 1613x922)
1.06 MB
1.06 MB PNG
>>107323940
lmao

>>107323973
comfy has never been dead or close to dead
>>
Comfy workflow and FP8 safetensor at 35.5GB!! up, no quants from QuantStack or City (rip) yet...

https://comfyanonymous.github.io/ComfyUI_examples/flux2
>>
File: deCA_cHD_00012_.png (1.12 MB, 1613x922)
1.12 MB
1.12 MB PNG
>>107323984
>or City (rip)
city is rip? qrd?
>>
File: ComfyUI_00008_.png (2.79 MB, 1152x1536)
2.79 MB
2.79 MB PNG
i guess it knows some celebs
>>
File: HLKY LIVES.jpg (92 KB, 1718x432)
92 KB
92 KB JPG
GUYS HLKY ISNT DEAD
>>
File: ComfyUI_00009_.png (2.63 MB, 1152x1536)
2.63 MB
2.63 MB PNG
>>
File: city-hf.png (14 KB, 408x120)
14 KB
14 KB PNG
>>107323990
>city is rip? qrd?

I was just going from his HF activity (picrel) but looking at his github https://github.com/city96 he did something a few weeks back.
>>
File: IMG_0460.png (716 KB, 1024x1024)
716 KB
716 KB PNG
>>107323940
LoL wut?
>>
flux2 censored btw
>>>/r/20102658
>>
>>107324110
>has tay but not her nips
so it's half censored, cos otherwise tay wouldn't be in there AT ALL.
>>
File: ComfyUI_00010_.png (2.26 MB, 1152x1536)
2.26 MB
2.26 MB PNG
>>107324120
yep, they fucked with it after training is my guess
>>
File: deCA_cHD_00013_.png (1 MB, 1613x922)
1 MB
1 MB PNG
>>107323995
>>107324011
>>107324146
weird how it makes close-but-not-really taylor swift
>>
oops meant for the other thread lol
>>
>>107324172
saved
>>
>>107324173
yeah, it knows celebs but changes them a bit
>>
File: IMG_0461.png (800 KB, 768x1152)
800 KB
800 KB PNG
>>107323995
>>107324011
Cool, maybe most celebs who are "Globally famous" try someone else
>>107324110
>>107324120
>>107324146
I guess there's a version of FLUX.2 that is the second coming of BigLust hidden on a hard drive at BFL.
>>107324172
Kek
>>
File: ComfyUI_00019_.png (2.6 MB, 1152x1536)
2.6 MB
2.6 MB PNG
flux2 is fun but eh
too much resources needed to run and it's not "there"
>>107324217
>I guess there's a version of FLUX.2 that is the second coming of BigLust hidden on a hard drive at BFL.
yeah they keep the good stuff for themselves lol
>try someone else
give me names, i dont keep up with celebs
>>107324205
lol
>>
chroma2 when
>>
>>107324236
>too much resources needed to run
what gpu are you on and how long does it take? I know it's going to be in the minutes range but still curious.
>>
>>107324264
not mine but a 5090 is this
>>
File: ComfyUI_00020_.png (2.77 MB, 1152x1536)
2.77 MB
2.77 MB PNG
>>
>>107324289
>seconds per iteration on a 5090

yeah he needs to wait for a lower quant like a Q6 and it will finish in around 10 seconds instead.
>>
File: IMG_0462.png (1.94 MB, 1096x1488)
1.94 MB
1.94 MB PNG
>>107324236
>names
>Female celebs
Sabrina Carpenter
Jenna Ortega
Dafne Keen
Emma Watson (?)
>Male Celebs
Timothee Chalamet
Tom Holland
Tom Cruise
Cristiano Ronaldo (?)
>>
does comfy ui still use cuda 12 like a caveman?
>>
File: ComfyUI_00026_.png (1.93 MB, 1152x1536)
1.93 MB
1.93 MB PNG
>>107324349
>Emma Watson
>>
File: ComfyUI_00027_.png (2.17 MB, 1152x1536)
2.17 MB
2.17 MB PNG
>>107324349
>Sabrina Carpenter
>>
File: ComfyUI_00032_.png (1.46 MB, 960x1280)
1.46 MB
1.46 MB PNG
>>107324349
>Dafne Keen
didnt know that one
>>
File: ComfyUI_00033_.png (1.44 MB, 960x1280)
1.44 MB
1.44 MB PNG
>>107324922
that is supposed to be "jenna ortega" btw
here's tom cruise
>>
File: ComfyUI_00034_.png (1.42 MB, 960x1280)
1.42 MB
1.42 MB PNG
>>107324349
>Timothee Chalamet
>>
File: ComfyUI_00035_.png (1.37 MB, 960x1280)
1.37 MB
1.37 MB PNG
>>107324349
so it probably knows really well known celebs like taylor swift/tom cruise (and ronaldo), or more current ones like tim
>Cristiano Ronaldo
>>
File: deCA_cHD_00014_.png (1.07 MB, 1613x922)
1.07 MB
1.07 MB PNG
>>
File: ComfyUI_00041_.png (2.48 MB, 1536x1152)
2.48 MB
2.48 MB PNG
>>
File: 55920681938466894.jpg (411 KB, 1772x2274)
411 KB
411 KB JPG
>>
File: deCA_cHD_00015_.png (895 KB, 1613x922)
895 KB
895 KB PNG
>>107325660
swap reyleia for sexy chewbacca
>>
File: ComfyUI_00050_.png (1.56 MB, 832x1216)
1.56 MB
1.56 MB PNG
>>107325755
it refused to at first (i think because the prompt said "slave leia outfit")
>>
>>107325755
Swap your model for one that doesn't create an abortion out of cartoon hands. It's 2026 almost. We've had this handled for ages.
>>
File: deCA_cHD_00017_.png (958 KB, 1613x922)
958 KB
958 KB PNG
>>107325890
lol, not as hairy as I would have expected
>>
File: ComfyUI_00052_.jpg (322 KB, 832x1216)
322 KB
322 KB JPG
>>107325937
>>
File: deCA_cHD_00018_.png (851 KB, 1613x922)
851 KB
851 KB PNG
hmm doesn't appear to be any flux2 hf spaces

>>107325947
now THAT is a sexy slave chewbacca
>>
File: 55920683938466894.jpg (422 KB, 1772x2274)
422 KB
422 KB JPG
>>
File: 55920684938466894.jpg (442 KB, 1756x2258)
442 KB
442 KB JPG
>>
File: 3treasure.jpg (310 KB, 1152x896)
310 KB
310 KB JPG
>>107325937
What do you call this style? Some sort of draft/cel sheet/character diagram?
>>
File: deCA_cHD_00020_.png (958 KB, 1613x922)
958 KB
958 KB PNG
>>107326389
its mostly (concept art:1.3) doing a lot of the work but I also have "rough draft character sheet, hand drawn" to roughen it up a bit. "hand-written notes and annotations" emphasizes the style, along with a slew of negatives that may or may not do anything
>>
File: 00061-564686606.jpg (1.61 MB, 2048x2560)
1.61 MB
1.61 MB JPG
>>
File: deCA_cHD_00021_.png (1.06 MB, 1613x922)
1.06 MB
1.06 MB PNG
>>107326741
he looks like he plays jazz flute
>>
>>107324928
>>107324938
>>107324946
Actually surprised how well the men turned out compared to the girls.
Maybe the censorship is more centered in women than men
>>
File: deCA_cHD_00024_.png (1.08 MB, 1613x922)
1.08 MB
1.08 MB PNG
>>107326741
https://suno.com/s/1WB2m2b0Vs3UTLxc
>>
File: 00062-3242746558.jpg (1.29 MB, 2048x2560)
1.29 MB
1.29 MB JPG
>>
File: Flux2_00029_.png (2.11 MB, 864x1536)
2.11 MB
2.11 MB PNG
behold
>>
Duplouokka
>>107327360
That looks really cool, thanks :)
>>
File: Flux2_00030_.png (2.05 MB, 864x1536)
2.05 MB
2.05 MB PNG
now with bun-chan
>>
File: Flux2_00032_.png (2.24 MB, 864x1536)
2.24 MB
2.24 MB PNG
>>
File: Flux2_00033_.png (2.32 MB, 864x1536)
2.32 MB
2.32 MB PNG
>>
File: deCA_cHD_00025_.png (876 KB, 1613x922)
876 KB
876 KB PNG
>>107327360
lol, really captures the essence of expression monke

>>107327393
bunchan included in the group photo! she turned out very asian for some reason, haha
how long do these gens take you?
>>
File: Flux2_00036_.png (2.53 MB, 864x1536)
2.53 MB
2.53 MB PNG
>>107327460
on a remote gpu (5090)
>100%|| 20/20 [00:27<00:00, 1.36s/it]

a 3090 runs it in like 3mins
>>
slopfest
even the linked threads from 3 years ago or something are better by far
>>
>>107327691
your mom is a slopest lol
>>
Is nVidia better than AMD for genning with Stable Diffusion? Is there a good GPU in the $600-750 range that will work well? Does CPU matter much?
>>
File: deCA_cHD_00026_.png (1.3 MB, 1613x922)
1.3 MB
1.3 MB PNG
>>107327796
>Is nVidia better than AMD for genning with Stable Diffusion?
yes, but the margin is shrinking
> Is there a good GPU in the $600-750 range that will work well?
5070ti is the best mid-budget option. if you're a gambler you can wait to see if the SUPER mid-cycles come out next year but rumors are they're not happening due to supply issues
>>
>>107327796
nvidia is better than anything for ai stuff lol
get a 5070 or whatever higher
or a used 3090/4090 if you find one
cpu+ram do help, especially if you get into llm
comfy also does memory offloading so more ram is best too
good luck with today's prices tho
>>
>>107327824
>>107327821
thanks for the info, anons
i've already resigned myself to the fact that i'll have to pay dearly; that seems to have been the name of the game ever since people started to want to mine crypto
i thought when a lot of the crypto mining moved off GPUs to ASICs, prices would come down, but along comes AI
such is life
>>
File: deCA_cHD_00027_.png (1.5 MB, 1613x922)
1.5 MB
1.5 MB PNG
>>107327886
>but along comes AI
google recently made a deal with meta on TPU supply, which has seemingly got nvidia nervous. the reason TPUs make sense for meta is because all they do is tensor processing, and they do it efficiently. this might be the pressure than nvidia needs to flesh out more specialized products, instead of thinking they can just sell blackwells to everyone and everything
none of this helps anyone in the market now, looking for an AI-capable card. though I hope 2-3 years down the line, it'll cause a dramatic shift in options and pricing for the home consumer market
>>
File: deCA_cHD_00028_.png (1.03 MB, 1613x922)
1.03 MB
1.03 MB PNG
>>
back to my beloved chroma
>>
File: deCA_cHD_00030_.png (1020 KB, 1613x922)
1020 KB
1020 KB PNG
>>107328254
any final thoughts?
>>
File: ComfyUI_temp_nkdmp_00007_.png (1.65 MB, 1008x1344)
1.65 MB
1.65 MB PNG
>>107328289
it's fun (although 50+GB space to play with it is excessive considering how much room other shit takes (llms, loras, venvs, etc)
it follows prompts really nicely
the image edit is hit or miss - if you say "do this style on that image" it mostly works, but "place this person from image1 in the place of that person in image2" is random, since there's no real way to tell it which image is "first" or "second" or however you want to order them
as you saw, it knows celebs/characters about 80-90% for the most part (except trump, every single model knows trump lol)
since it's brand new not a lot of custom nodes know what to do with it which kinda sucks
training loras will likely be super VRAM consooming so no go there, and "take chromagirl's face and place it in this prompt" didnt work for me, although i dont really know what would work
so yeah, fun to play with, and do interesting things like the manga/video game covers stuff since it follows prompt really well, but i leave that to others, since my only love is chromagirl (formerly known as fluxgirl) and chroma handles 90% of what i throw at it well enough.
if flux2 gets more reasonable/easier to run/etc i may add it to a workflow, but for now i'll just keep it like it keep the biglust/pony models, for very specific things lol
best edit model so far, although it's still censored and doesnt even bothr with the "remove clothes" prompt for those who want to know
>>
File: deCA_cHD_00031_.png (1.3 MB, 1613x922)
1.3 MB
1.3 MB PNG
>>107328348
I appreciate the write up and insights
>my only love is chromagirl
based
>>
File: ComfyUI_temp_jplcq_00004_.jpg (1.26 MB, 2160x2880)
1.26 MB
1.26 MB JPG
>>107328348
also it does >1024 resolutions fairly well and with not a lot of extra cost in time, without losing its mind or making doubles
also flux2 (and bfl for that matter) have too many evangelists out there that refuse to see any issues so the noise will get worse for a while
https://x.com/IanSharar/status/1993469586407129182
>>
File: ComfyUI_temp_jplcq_00005_.jpg (1.26 MB, 2160x2880)
1.26 MB
1.26 MB JPG
>>
>>107328369
in middle right if the shirt were to be tucked in it would stick out of the skirt
>>
File: deCA_cHD_00033_.png (1.49 MB, 1613x922)
1.49 MB
1.49 MB PNG
>>107328497
she tucks it into her underpants, ofc
>>
>>
File: panty tucked shirt.png (1.17 MB, 704x1488)
1.17 MB
1.17 MB PNG
>>107328528
yeah sure
>>
File: deCA_cHD_00034_.png (915 KB, 1613x922)
915 KB
915 KB PNG
>>107328621
lol
busted..
but she could easily roll up the shirt into the undies if she wanted
>>
>>
Next Thread

>>107328651
>>107328651
>>107328651

Flux.2 spaces are available...

https://huggingface.co/spaces?search=Flux%202

City has beaten Quantstack...

https://huggingface.co/city96/FLUX.2-dev-gguf
>>
File: deCA_cHD_00035_.png (1.02 MB, 1613x922)
1.02 MB
1.02 MB PNG
>>107328657
im not gonna get used to this 140 pop. I was thinking i had +5 to go
>>
File: to tuck or not to tuck.png (1.14 MB, 704x1488)
1.14 MB
1.14 MB PNG
>>107328621
>>
fillin
>>
>>
File: deCA_cHD_00037_.png (656 KB, 1613x922)
656 KB
656 KB PNG
>>
File: desd35_00017_.png (1.62 MB, 1152x896)
1.62 MB
1.62 MB PNG
>>
File: dedk_00093_.png (3.14 MB, 1920x1280)
3.14 MB
3.14 MB PNG
>>
>>
File: fSDG_News_000154_.jpg (302 KB, 896x512)
302 KB
302 KB JPG
>>
File: 1754468592276808.png (36 KB, 128x128)
36 KB
36 KB PNG
>>107328698

This was earlier again than the recent 142 to 143ish to get the Flux.2 change in. Getting closer to 140 will make those times when it's nearing midnight in your timezone less of a sweat if others further east have already clocked out.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.