[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1722213214673195.jpg (262 KB, 1536x1536)
262 KB
262 KB JPG
Previous /sdg/ thread : >>101664139

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SD.Next: https://github.com/vladmandic/automatic
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>SD3 info & download
https://rentry.org/sdg-link#sd3

>Try online without registration
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Animation
https://rentry.org/AnimAnon
https://rentry.org/AnimAnon-AnimDiff
https://rentry.org/AnimAnon-Deforum

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Discord
6wUwtcJsr2

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: FDG_News_00002_.jpg (116 KB, 1216x832)
116 KB
116 KB JPG
>mfw Resource news

08/01/2024

>Stable Fast 3D: Rapid 3D Asset Generation From Single Images
https://stability.ai/news/introducing-stable-fast-3d

>Announcing Black Forest Labs
https://blackforestlabs.ai/announcing-black-forest-labs

>Flux: The Next Leap in Text-to-Image Models
https://blog.fal.ai/flux-the-largest-open-sourced-text2img-model-now-available-on-fal


>Kolors ipadapter FaceID Plus
https://github.com/Kwai-Kolors/Kolors/tree/master/ipadapter_FaceID

>The EU’s AI Act is now in force
https://techcrunch.com/2024/08/01/the-eus-ai-act-is-now-in-force

>Video game performers picket over AI protections
https://apnews.com/article/sagaftra-strike-video-games-ai

>Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs
https://lalbj.github.io/projects/PAI

>Detecting, Explaining, and Mitigating Memorization in Diffusion Models
https://github.com/YuxinWenRick/diffusion_memorization

>Forgedit: Text Guided Image Editing via Learning and Forgetting
https://github.com/witcherofresearch/Forgedit/

>ControlMLLM: Training-Free Visual Prompt Learning for Multimodal LLMs
https://github.com/mrwu-mac/ControlMLLM

>Accelerating Image Super-Resolution Networks with Pixel-Level Classification
https://github.com/3587jjh/PCSR

>ComfyStereo: port of the stereoscopic script used in stable-diffusion-webui-depthmap-script
https://github.com/Dobidop/ComfyStereo

>MeGA: Hybrid Mesh-Gaussian Head Avatar for High-Fidelity Rendering and Head Editing
https://github.com/conallwang/MeGA

07/31/2024

>Bubble Prompter for Stable Diffusion WebUI
https://github.com/captainzero93/sd-webui-bubble-prompter

>Waifu Diffusion V public tests
https://huggingface.co/waifu-diffusion/wdv-tests

>ComfyUI_frontend v1.2.7
https://github.com/Comfy-Org/ComfyUI_frontend/releases/tag/v1.2.7
>>
>mfw Research news

08/01/2024

>Tora: Trajectory-oriented Diffusion Transformer for Video Generation
https://ali-videoai.github.io/tora_video/

>Dynamic Object Queries for Transformer-based Incremental Object Detection
https://arxiv.org/abs/2407.21687

>Expressive Whole-Body 3D Gaussian Avatar
https://mks0601.github.io/ExAvatar/

>MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
https://arxiv.org/abs/2407.21654

>Evaluating SAM2's Role in Camouflaged Object Detection: From SAM to SAM2
https://arxiv.org/abs/2407.21596

>A Simple Low-bit Quantization Framework for Video Snapshot Compressive Imaging
https://arxiv.org/abs/2407.21517

>Fine-gained Zero-shot Video Sampling
https://densechen.github.io/zss

>Generalized Tampered Scene Text Detection in the era of Generative AI
https://arxiv.org/abs/2407.21422

>Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model
https://arxiv.org/abs/2407.21408

>SmileyNet -- Towards the Prediction of the Lottery by Reading Tea Leaves with AI
https://arxiv.org/abs/2407.21385

>AI Safety in Practice: Enhancing Adversarial Robustness in Multimodal Image Captioning
https://arxiv.org/abs/2407.21174

>Optical Computing for Deep Neural Network Acceleration: Foundations, Recent Developments, and Emerging Directions
https://arxiv.org/abs/2407.21184

>Embedding Space Selection for Detecting Memorization and Fingerprinting in Generative Models
https://arxiv.org/abs/2407.21159

>Adding Multi-modal Controls to Whole-body Human Motion Generation
https://yxbian23.github.io/ControlMM/

>Direct Unlearning Optimization for Robust and Safe Text-to-Image Models
https://arxiv.org/abs/2407.21035

>Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion
https://arxiv.org/abs/2407.21032

>SuperVINS: A visual-inertial SLAM framework integrated deep learning features
https://arxiv.org/abs/2407.21348
>>
File: 00017-1799380269.jpg (317 KB, 1552x1200)
317 KB
317 KB JPG
First in thread. What's up chaps.
>>
File: file.jpg (377 KB, 768x1280)
377 KB
377 KB JPG
>>101670503
>Would berry my dick so far in her ass
>>
>>101670535
>Error: Our system thinks your post is spam. Please reformat and try again.
for some reason, I can't post this in the news because 4chan is flagging it as spam. lets see if it can be a standalone post

>ComfyUI: Basic Flux Schnell and Dev model implementation
github.com/comfyanonymous/ComfyUI/ [COMMIT ID GOES HERE BUT IS SPAM APPARENTLY]
>>
anyone tried flux yet? Should I care?

I have 24gb vram
>>
File: IMG_3364.jpg (75 KB, 984x984)
75 KB
75 KB JPG
>>
File: file.jpg (231 KB, 768x1280)
231 KB
231 KB JPG
>screenshot from a brazzers.com production
>>
>>101670570
if you have a 40GByte VRAM gpu you can go download a new 12b parameter model
>https://huggingface.co/black-forest-labs/FLUX.1-dev
>https://blackforestlabs.ai/announcing-black-forest-labs/
and run it on comfy

if not.. same old sam old.. SAI still has their head in their ass
>>
FLUX-dev
https://replicate.com/black-forest-labs/flux-dev

FLUX-schnell (distilled)
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
>>
>>101670606
looks like the examples are from saas and local is resource heavy and not the same >>101670081 >>101670180
>>
>>101670606
no.. the model alone is 24GB
>>
File: 4.png (129 KB, 625x640)
129 KB
129 KB PNG
>>
File: 00107-3409093694.jpg (290 KB, 1552x1200)
290 KB
290 KB JPG
What's the deal with SD3? Last I checked in here, people were saying
>Garbage output
>Trash liscence
Still the case? Anyone using it?
>>
File: IMG_3365.png (1.18 MB, 1344x768)
1.18 MB
1.18 MB PNG
>>
>>101670582
Links which have too many slashes elicit automatic spam protection.
>>
>>101670669
2 more weeks for the fixed version
>>
>>101670668
What even is this
>>
>>101670669
the beta model is out but the 3.1 is still slated for release
>>
>>101670606
it's quite good >>101670319
>>
>>101670582
You can use a 6 characters commit id in the link instead.
https://github.com/comfyanonymous/ComfyUI/commit/1589b5
>>
>>101670683
Why even bother posting
>>
File: fgdfgdfdgfgd.jpg (419 KB, 1280x768)
419 KB
419 KB JPG
>>
>>101670616
Can it do celebrities?
>>
I miss PW, he is a tarento.
>>
File: de_fl_00005_..jpg (82 KB, 1216x832)
82 KB
82 KB JPG
>>101670701
great, thanks! I'll use that for the next news post
>>
>samurai bun
kek
>>
>>101670775
In Japan, this would mean the Samurai was in debt for someone. Bartender would cut off his bun and sell it back to him.
>>
File: ComfyUI_Flux_0001.jpg (53 KB, 1280x720)
53 KB
53 KB JPG
8 steps flux dev
206 seconds
>>
>>101670807
ouch.. how much vram and ram did it gobble?
>>
Can anyone please put the flux model in a torrent?
>>
File: 1722533531885_image.jpg (91 KB, 964x1446)
91 KB
91 KB JPG
>>101670723
Audrey Hepburn
8 steps
>>
This model is pretty good

https://files.catbox.moe/79tj08.png
>>
>>101670830
just make a huggingface account, the dev version is free
>>
>>101670807
stop the shilling, so far no "flux" gen has been better than any sd 1.5 or xl gen
>>
https://files.catbox.moe/gej765.jpg
>>
>>101670941
best ass ever. more

I want a gf who looks like that so bad.
>>
File: de_fl_00008_..jpg (89 KB, 1216x832)
89 KB
89 KB JPG
>>101670918
if you're not a sai employee, idk why you'd care what models people use
>>
File: 1722534562658_image.jpg (67 KB, 984x984)
67 KB
67 KB JPG
>>101670864
wtf those nipples are true mosquito bites lmao
>>
>>101670736
>he
>>
>>101670918
weirdo
>>
File: sample.jpg (322 KB, 1440x1440)
322 KB
322 KB JPG
https://replicate.com/black-forest-labs/flux-dev
>Great image quality
>Ok prompt understanding
>Can do NFSW
>>101670941
>>101670424
>>101670391
>Nice anatomy
>Apache 2.0 Licence
This is a great day, fuck you SAI
>>
>>101670642
>no.. the model alone is 24GB
looks like it can be run on a 24gb vram card >>101670797
>>
File: de_fl_00011_.jpg (95 KB, 1216x832)
95 KB
95 KB JPG
>>101671037
>too big to run locally unless you're GPU rich
forgot that bit
>>
>>101671114
you can't make a great model if it's not big enough, you also forgot that bit
>>
>>101671143
now local can make meme images and clipart like dalle!!111!!
>>
>>101671037
>Can't generated a guy doing a handstand
There is still room for improvement. It also can't generate broken objects, like a broken sink or window. I also tried the "woman stepping on a cake" test; the cake looks as hard as rock. When are we going to have a physics-aware diffusion model?
>>
File: 1722535171870.jpg (163 KB, 512x512)
163 KB
163 KB JPG
The Schnell demo has Fallen!
>>
File: de_fl_00012_.jpg (98 KB, 1216x832)
98 KB
98 KB JPG
>>101671143
I consider "can I run it" a component of whether a model is 'great' or not
>>
>>101671180
retarded faggot
>>
File: insane.jpg (389 KB, 3840x1751)
389 KB
389 KB JPG
>>101671158
it has great anatomy, great image quality, insane prompt understanding, the fuck do you want more?
>>
>>101671180
but it's possible, if you quantize to 8bit, and DiT models are known to be really resiliant to quantizations because it's a transformers model
>>
>>101671205
goes without saying
>>
>>101671205
you new?
>>
>>101671214
for it not to take forever? For it not to use the wasteful t5xxl instead of t5xl? it can only make images and there is no room for vid tech that isn't txt2vid boring garbage?
>>
The Schnell hugginface demo Is back!
>>
>>101671258
>it can only make images and there is no room for vid tech that isn't txt2vid boring garbage?
Moving the goalpost I see.
>Yeah messi is great at football... BUT NOT SO GREAT AT BASKETBALL
we never asked this model to do that faggot
>>
File: de_fl_00014_.jpg (222 KB, 1216x832)
222 KB
222 KB JPG
>>101671231
sure, I'll look forward to that but for now its too big for me
>>
>>101671293
use that instead anon
https://replicate.com/black-forest-labs/flux-dev
>>
>>101671301
your uncle was too big for you too
>>
>>101671300
>Moving the goalpost I see.
these are problems with the model and you don't have a comeback for it
>clueless comparison
go sit in the corner retard
>>
>>101671329
>these are problems with the model and you don't have a comeback for it
I never said this model is perfect, why are you lying like that? I said that it's by far the best base local model we ever had, it's not even close
>>
>>101671312
Kek, Lego Quokka turned into duplo fox
>>
i miss schizo anon
>>
File: 1722535875355738.jpg (749 KB, 2048x2048)
749 KB
749 KB JPG
Flux is incredibly good at text, and real text, not the photoshopped comic--sans text like SD3 kek
>>
>>101671415
I'm right here.
>>
>>101671341
>lying
point out the lie
>>
>>101671416
please do a proper upscale of the bottom left one, very nice, or hand it over so I can do the dirty deed
>>
>>101671499
the image there is at its right resolution, you just have to crop it and upscle
>>
>>101671499
Is it even possible to upscale in comfy if the normal gen eats exactly 24gb of vram?
>>
>>101671558
it won't be 24gb of vram if you load everything on 8bit anon
>--fp8_e5m2-text-enc --fp8_e5m2-unet
>>
>>101670535
Listen carefully, I'm not going to repeat:
DOWNLOAD FLUX SCHNELL FAST!!!!!!!!!! It's probably going to get taken down or some shit.

The niggas didn't filter the dataset properly, it can generate realistic looking ch*ld nudity. Right now Replicate/Fal don't filter the model outputs. They'll notice sooner or later.

IDK how they did such a bad job - the anime seems to be filtered from NSFW, but not real world fucking photos.
>>
>>101671114
>hardware for a 12B model
>gpu rich
/lmg/ with their 4x4090 rigs would laugh at you
>>
>>101671579
>DOWNLOAD FLUX SCHNELL FAST!!!!!!!!!!
more like flux dev, that's the better one
https://huggingface.co/camenduru/FLUX.1-dev/tree/main
>>
>>101671579
>The niggas didn't filter the dataset properly, it can generate realistic looking ch*ld nudity
retard fearmongering for no absolutely no reason
>>
>>101671626
it's d*bo
>>
>>101671616
yeah,, I checked, dev can generate it too, but schenll does it in literally 1 second on replicate
>>
>>101671579
Since when it's a problem, retard? All SD models could generate that. And SD 1.5 is especially great at it.
>>
download dev + schnell fast!!!!!!
>>
>>101671645
You haven't seen schnell outputs, they're complete level above SD 1.5, idk what dev could do given the right prompts.
>>
File: poolsclosed.png (352 KB, 1024x1024)
352 KB
352 KB PNG
>>
>>101671656
Niggy, that was the very first thing I genned and it wasn't as good and hot as what SD1.5 or SDXL produce.
>>
File: de_fl_00017_.jpg (320 KB, 896x1088)
320 KB
320 KB JPG
>>101671579
omg, its unsafe? I'm writing my congress persons as we speak

>>101671597
all that GPU just to make a miku chatbot

>>101671645
>Since when it's a problem
cuz they'll get sued and have to spend all their money on lawyers instead of training models
>>
is animatediff dead? i see the motion model for xl is 9 months old and is a beta. how are people making animations with pony or xl?
>>
>>101671656
>>101671647
>>101671579
that's cool, the fearmongering level is a good metric to see if a model is usually good or not, unironically
>>
File: pirate.jpg (117 KB, 1216x832)
117 KB
117 KB JPG
>>101671037
not bad
>>
File: 1699427414904338.png (2.28 MB, 832x1216)
2.28 MB
2.28 MB PNG
It passes the grass test alright
>>
So what's the soft ceiling for good quality with SDXL? It seems that when I start genning 1440pixel images, the quality starts dropping compared to like 1000pixel. Is there a sweet spot?
>>
File: HOLY-SHIT.jpg (597 KB, 3560x1740)
597 KB
597 KB JPG
>>101671726
It perfectly passes the grass test, insane quality
https://replicate.com/black-forest-labs/flux-dev
>>
File: file.png (19 KB, 637x233)
19 KB
19 KB PNG
fp8 loading just dropped in comfy
>>
>>101671774
>unet_name
so flux is a unet architecture?
>>
File: welcome.jpg (548 KB, 2048x2048)
548 KB
548 KB JPG
>>101671416
>>
>>101671788
it's insane how those humans look at far away distance, more parameters is really all you need after all
>>
File: out-0 (8).jpg (174 KB, 1024x1024)
174 KB
174 KB JPG
hmm... it's a good effort I guess
>>
>>101671037
>Apache 2.0 Licence
proprietary license. only the vae is apache 2.0
>>
File: out-0.png (845 KB, 1024x1024)
845 KB
845 KB PNG
the anime is pretty good. pro tip: object placement is always from the viewer's POV. I tried doing it from the character's but it always favors the viewer
>>
File: ComfyUI_Flux_0031.jpg (168 KB, 1024x1024)
168 KB
168 KB JPG
>>
For those who have at least 16gb of vram, there's a script that can run flux at 8bit
https://gist.github.com/AmericanPresidentJimmyCarter/873985638e1f3541ba8b00137e7dacd9
>>
>>101671823

Damn, even an American President is using Flux lol
>>
File: 1701270872717228.png (1.02 MB, 2192x547)
1.02 MB
1.02 MB PNG
>>101671799
ehm?
>>
>julien
>>
>>101671815
>proprietary license. only the vae is apache 2.0
is it worse than SDXL licence? I hope not, pony needs to work on this shit!!
>>
>>101671037
I'm sorry but in >>101670941 the girl's left hand is hairy.
>>
>>101671856
>implying a hobbyist can afford to train it
>>
>>101671853
it's way better than any other model at this far distance anon, not even close
>>
>lubimiv
>>
I showed flux to my gf and she said it's deeply concerning because people would use it to make nudes without consent
>>
>>101671876
hobbyists train 70B local LLMs, surely they can train this
>>
>>101671823
i thought you built that 4x3090 rig
>>
What is the point of SDG and LDG separation if everyone uses flux?
Asking for future.
>>
>>101671882
so she's a glass half empty type of person
>>
>>101671897
>everyone
>>
>>101671882

It really is disturbing that people make nudes of real people when you can make nudes of far hotter fictional people
>>
>>101671887
they are using 130b param models and have room to spare yet we are maxxing out at 12b in imagen... they shouldn't be treated the same
>>
>>101671906

Like the celeb nude creators that just ruin everything for everyone.
>>
>>101671897
I'm not getting a 3/4090 until they're £250~
maybe the bubble with have burst by then and this is the peak
>>
File: ComfyUI_Flux_0035.jpg (178 KB, 1344x768)
178 KB
178 KB JPG
>>
>>101671897
/ldg/ is full of losers who bet on lamer models than what sai shit out. most of the people from BF are ex-sai shitters too
>>
>>101671906
this, it's fucking retarded
>>
>>101671906
it's also a crime to make nudes of people without their explicit consent and downright repulsive
>>
>>101671799
dude thats an upscale with an nmkd model, an sdxl model resample and a facedetailer pass. adjusted the detection threshold so it picked them all up, even the 2 in the middle. not too much denoise so they dont stick out too much.
>>
>>101671906
>>101671882
nice, the fearmongering level is quite high, that means it's a great model >>101671626
>>
>>101671579
It's time someone simply stated: "we don't ban photoshop or cameras because they can be used to make illegal pictures, AI models can generate illegal pictures especially by a determined user, they are accepting that if they're caught with those illegal pictures they will go to prison"
>>
>>101671978
say that to the jews
>>
>>101671980
People only censor models because of the perceived social backlash.
>>
>>101671897
/ldg/ split off because they wanted a thread that was safe for trannies, not because they had any better models to use. /sdg/ primarily uses the best model, which until today was SD versions and their finetunes, but other models have always been discussed and occasionally posted with alongside them without issue.
>>
>>101671996
The split is because SDG attracts autistic weirdos that treat 4chan like a Discord server.
>>
File: ComfyUI_15069_.png (1.31 MB, 1920x800)
1.31 MB
1.31 MB PNG
cant quite do catgirls of my preference
>>
dall-e 4 when
>>
>>101671995
and abetting illegal counterfeiting.
>>
>>101672033
flux v2 will be dalle4
>>
File: redeem.jpg (842 KB, 2500x3333)
842 KB
842 KB JPG
>>
>>101672014
/sdg/ is where the cool kids hang out 8)
>>
File: de_fl_00027_.jpg (248 KB, 1344x768)
248 KB
248 KB JPG
>>101672019
I can't get anthropomorphic dog people competing at the olympics either

>>101672074
I wonder what hes thinking rn
>>
>>101672084
yeah /sdg/ loves kids
>>
>>101672019
>ComfyUI
are you using it on fp8 anon?
>>
>Debo still posting blurry gens even with a new model
Pottery
>>
>>101672019
10/10 would pat
>>
>>101672102
there's nothing wrong with liking kids, having a sexual attraction for them on the other hand....
>>
File: ComfyUI_15070_.png (1.43 MB, 1920x800)
1.43 MB
1.43 MB PNG
>>101672115
i just have the "weight dtype" set to default, didnt know what it did so didnt touch it
>>
File: sample.jpg (359 KB, 768x1440)
359 KB
359 KB JPG
howifeel
>>
File: kek.jpg (298 KB, 2976x1545)
298 KB
298 KB JPG
>>101672019
>cant quite do catgirls of my preference
skill issue, unironically
https://replicate.com/black-forest-labs/flux-dev
>>
>>101672188
How much VRAM do you have?
How much VRAM does it ask for the image model?
How much VRAM does it ask for the text encoder model?
Do you offload?
>>
File: ComfyUI_15079_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>101672217
i have a 4090, i dont do anything that i know of
im running on wsl if that matters

loading in lowvram mode 21557.27999973297
i get this log when running which i've not seen before
>>
File: 1698923926452472.png (2.61 MB, 1728x1344)
2.61 MB
2.61 MB PNG
>>101671726
FP8 might be degrading quality in the fine details?
dunno
>>
File: FLUX_00001_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
Flux is great
You need a 3090/4090 for it though
With the basic workflow from comfy it fills up to 23.5G in lowvram mode
30 seconds/20 steps on my 4090

but quality is top
we gonna eat good
this feels like back when 1.4 was released
>>
File: ComfyUI_15085_.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>101672094
>>
File: FLUX_00002_.png (1 MB, 1024x1024)
1 MB
1 MB PNG
>>
>>101672303
>With the basic workflow from comfy it fills up to 23.5G in lowvram mode
what does lowvram do? it offload the text encoder in the cpu?
>>
File: de_fl_00029_.jpg (265 KB, 1344x768)
265 KB
265 KB JPG
>>101672323
might be easier with cartoon styles. I'm trying to make it a real tv broadcast (but failing on the aesthetic too)
>>
>>101672303
>You need a 3090/4090 for it
it's over.
>>
File: FLX_00001_.png (1.91 MB, 1072x1072)
1.91 MB
1.91 MB PNG
>>101672303
yea just got it working to .. but it lets test its limits, btw 1072x1072 just barely works on 24GB to
>>
File: out-0.jpg (229 KB, 1024x1024)
229 KB
229 KB JPG
>>101672367
it's still better than what the /lmg/ fags have to deal with, they need multiple 3090 to be able to run decent models kek
>>
File: FLUX_00005_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>
>>101672375
>>101672303
>>101672279
>>101672263
can someone do a comparaison between fp8 and fp16 to see if there's a lot of difference? if not you can save a lot of vram
>>
>>101672388
aren't you an /lmg/ fag?
>>
>>101672388
now you need it too, got back in the closet.
>>
File: FLUX_00007_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
>>101672397
Looks pretty nice actually
>>
File: de_fl_00032_.jpg (322 KB, 1344x768)
322 KB
322 KB JPG
>>101672367
hang in there, quantized model in 2 weeks
>>
>>101672409
he is, /lmg/fags are known for their shit tastes & miku avatarfagging.
>>
File: out-0.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>101672409
>>101672411
I am and I have multiple gpus yeah, unfortunately having multiple gpus don't mean shit in image gen because you can only use one
>>
File: FLUX_00009_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
File: FLUX_00010_.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
oh yeah
it's good
>>
https://huggingface.co/camenduru/FLUX.1-dev/tree/main
Do I have to rename .sft to .safetensor to make it work on comfy?
>>
File: FLX_00002_.png (1.8 MB, 1072x1072)
1.8 MB
1.8 MB PNG
>>101672404
can't for fp16 you need something with more than 24GB of VRAM
>>
>>101672463
surprised what characters it actually knows. Seems like several were aligned out (asuka for one).
>>
File: FLUX_00014_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>101672475
no need to rename it
just put it in models/unet
and put the ae.stf in models/vae
>>
>>101672479
>can't for fp16 you need something with more than 24GB of VRAM
some anon managed to do it on fp16 though? >>101672263
>>
File: FLX_00003_.png (1.43 MB, 1072x1072)
1.43 MB
1.43 MB PNG
>>
>>101672509
what about the text encoder?
>>
File: FLX_00004_.png (1.79 MB, 1072x1072)
1.79 MB
1.79 MB PNG
>>101672540
goes in clip
>>
>>101672560
I put the three of them?
clip_l.safetensors
t5xxl_fp16.safetensors
t5xxl_fp8_e4m3fn.safetensors
>>
>>101672019
adorable regardless
>>
Aw hell yeah!
>>
File: FLUX_00018_.png (750 KB, 640x896)
750 KB
750 KB PNG
wow
works well on resolutions lower than 1024x1024
I think they are using Unet for this type of versatility
DiT would usually shit the bed when trying to generate on resolutions different than what it was trained on
>>
File: aaaaa.jpg (257 KB, 1405x1737)
257 KB
257 KB JPG
>>101672581
>works well on resolutions lower than 1024x1024
indeed, the blog says that it should work fine on low resolution, they are so based they thought of everything
https://blackforestlabs.ai/announcing-black-forest-labs/
>>
File: FLX_00006_.png (1.25 MB, 1072x1072)
1.25 MB
1.25 MB PNG
>>101672574
they all go intol
>/models/clip
>>
File: FLUX_00022_.png (773 KB, 640x896)
773 KB
773 KB PNG
>>
File: 1710988900859851.png (2.61 MB, 1728x1344)
2.61 MB
2.61 MB PNG
>>101672279
>>101672404
okay FP8 is a tradeoff
the degradation is subtle though

also, someone needs to figure out how to hi rez with this model because the standard Img2Img is causing the colours to go whack

>>101672579
nice

>>101672581
Flux is a Mixed DiT model
>>
>>101672610
thanks a lot anon!
>>
File: FLX_00009_.png (1.04 MB, 1072x1072)
1.04 MB
1.04 MB PNG
can do goo, cant feet
>>101672610
>>
File: FLUX_00025_.png (759 KB, 640x896)
759 KB
759 KB PNG
>>
File: all.png (3.22 MB, 1024x3072)
3.22 MB
3.22 MB PNG
>>101672479
>>101672522
I have now swapped to fp8 as it was occasionally getting this, and it hasn't happend since

Loading 1 new model
Prompt executed in 35.02 seconds
got prompt
model_type FLUX
Killed

>>101672404
"a robot chicken"
>>
>>101672428
python is really shit trying to make that work while all the llm repos are in cpp
>>
File: FLUX_00027_.png (809 KB, 640x896)
809 KB
809 KB PNG
>>
File: 1701934184274075.png (2.6 MB, 1728x1344)
2.6 MB
2.6 MB PNG
>>101672623
okay fp8 TE was the culprit
fp16 TE with fp8 DiT is close enough
>>
File: ComfyUI_02085_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
fp8 on a 4080, 1.21s/it
>>
File: goo_00001_.png (1.02 MB, 1072x1072)
1.02 MB
1.02 MB PNG
>>101672623
for the time being you can hires with sdxl if needed.. first I have to find benefits this models offers other than extreme text accuracy and way more knowledge since 12b ofc .. gotta poke it with the go'ey stick

>>101672660
I tried fp16 and got a kernel panic and blue screen, glorious
>>
>>101672690
how do you set fp8 on comfy?
>>
File: FLUX_00032_.png (690 KB, 640x896)
690 KB
690 KB PNG
>>
make a huge titty bimbo in flux, right now.
>>
>>101672692
that's not bad
>>
File: FLUX_00039_.png (800 KB, 640x896)
800 KB
800 KB PNG
>>
File: FLUX_00043_.png (771 KB, 640x896)
771 KB
771 KB PNG
>>
>>101672743
woa have your read the license? violence against children in gens is forbidden
>>
>>101672708
git pull and get the latest Comfy
use these example workflows
https://comfyanonymous.github.io/ComfyUI_examples/flux/
set weight_dtype to one of the fp8 in Load Diffusion Model node

>>101672702
cute slimes
>>
File: Comparaison-FP8-16.gif (2.14 MB, 1728x1344)
2.14 MB
2.14 MB GIF
>>101672279
>>101672690
Thanks for the feedback anon
>>
File: FLUX_00044_.png (807 KB, 640x896)
807 KB
807 KB PNG
>>101672751
she's not a real person
>>
File: goo_00006_.png (1.09 MB, 1344x800)
1.09 MB
1.09 MB PNG
>>101672752
>cute slimes
thank you.. I prefer my sd slimes sofar.. also it doesnt understand "transparent slime"
>>
File: FLUX_00048_.png (496 KB, 640x896)
496 KB
496 KB PNG
the quality is extremely impressive
if you told me this was generated I would not believe you
>>
>>101672752
>set weight_dtype to one of the fp8 in Load Diffusion Model node
which one? fp8_e4m3fn or fp8_e5m2
>>
File: FLUX_00047_.png (600 KB, 640x896)
600 KB
600 KB PNG
>>
File: ComfyUI_02087_.png (3.41 MB, 2048x2048)
3.41 MB
3.41 MB PNG
upscales well
>>
File: FLUX_00051_.png (866 KB, 640x896)
866 KB
866 KB PNG
I just realized that it has no negative prompt
>>
>>101672755
uhh you got them wrong anon
please delete your image because it is misleading, I'll post a comparison pic later

>>101672802
either should be fine, I used fp8_e4m3fn
>>
File: 1722528357365492.png (594 KB, 1602x900)
594 KB
594 KB PNG
>>101672794
>the quality is extremely impressive
>if you told me this was generated I would not believe you
the very first flux gen I saw was this one and I thought it was just one of those random memes you found on the internet, that was a great first impression yeah
>>
File: FLUX_00057_.png (756 KB, 640x896)
756 KB
756 KB PNG
>>
>>101672829
what do you mean? I put his fp8 as a comparaison? it's not the good one? >>101672690
>>
File: FLUX_00058_.png (713 KB, 640x896)
713 KB
713 KB PNG
>high quality movie poster of Elsa as John Wick
fuuuuck
it's so goood
>>
File: 1722456906567631.jpg (1 MB, 804x1430)
1 MB
1 MB JPG
Can any one replicate this style
>>
File: FLUX_00065_.png (769 KB, 640x896)
769 KB
769 KB PNG
>>
File: ComfyUI_Flux_0097.jpg (210 KB, 1344x768)
210 KB
210 KB JPG
>>
>>101672853
all those pics are mine
they are not a proper comparison because it is a mix of fp16 and fp8 TE as well as DiT

I'll create a proper comparison of bf16 vs fp8 DiT with TE fixed to fp16
>>
File: FLUX_00068_.png (803 KB, 640x896)
803 KB
803 KB PNG
>>
>>101672855
>>101672883
which model version? can you pls share, since flux has 3 different ones
>>
File: FLUX_00070_.png (700 KB, 640x896)
700 KB
700 KB PNG
>>101672870
gave it a try
>high quality sketch illustration of a badland female warrior with tattoos. her hair is a mess and she is eager to fight
not quite there, but with a few tweaks it can get there
>>
>>101672894
>it is a mix of fp16 and fp8 TE as well as DiT
flux is a DiT model?
>>
>come out of fucking nowhere and btfo everyone in an instant
how did they do it
>>
File: FLUX_00072_.png (773 KB, 640x896)
773 KB
773 KB PNG
>>101672904
I'm just on flux-dev for now
haven't tried the other one yet
>>
File: goo_00016_.png (1.65 MB, 1376x800)
1.65 MB
1.65 MB PNG
ya 12b is indeed capable of nice stuff
>a complex science fiction town drawn by Akira Toriyama, sunset .. now I wish you could get an anime nipple, wasnt able to yet
>>
File: styles nanachi_00002_.png (3.67 MB, 1344x1728)
3.67 MB
3.67 MB PNG
>>
uh, lewd huge tits?
>>
File: FLUX_00078_.png (781 KB, 640x896)
781 KB
781 KB PNG
>>
>>101672941
is that flux?
>>
File: rose_fire.jpg (143 KB, 1024x1024)
143 KB
143 KB JPG
sdxl, 1min47sex, gtx1060 6GB, 16GB ram
>>
>>101672953
paying her respects for his parents :(
>>
File: styles nanachi_00005_.png (3.69 MB, 1728x1344)
3.69 MB
3.69 MB PNG
>>101672941
ah, wrong one
>>101672956
sdxl
>>
File: 1693578401318859.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>101672918
>>
File: 1699412529931030.png (407 KB, 1080x1279)
407 KB
407 KB PNG
FUCKING KEK, STABILITY IS
D E A D
E
A
D
>>
File: FLUX_00090_.png (526 KB, 640x896)
526 KB
526 KB PNG
>D'Oh!
>>
File: hmm.jpg (296 KB, 3202x1573)
296 KB
296 KB JPG
how do I put the text encoder in fp8?
>>
File: FLUX_00095_.png (584 KB, 640x896)
584 KB
584 KB PNG
>>
File: FLUX_00100_.png (606 KB, 640x896)
606 KB
606 KB PNG
>>
File: rose_girl.jpg (196 KB, 1024x1024)
196 KB
196 KB JPG
>>101672961
same min same sex
>>
File: goo_00020_.png (1.4 MB, 1376x800)
1.4 MB
1.4 MB PNG
guess we will have to wait a few months till a capable anime fine tune of flux appears .. its okaish in anime, nothing more .. or I havent found the right way to prompt it into good anime yet
>>
>trani
>>
File: ComfyUI_02089_.png (3.33 MB, 2048x2048)
3.33 MB
3.33 MB PNG
>>
File: wtf.jpg (16 KB, 799x211)
16 KB
16 KB JPG
wtf, flux asks for a shit ton of ram during loading
>>
File: goo_00021_.png (1.25 MB, 1376x800)
1.25 MB
1.25 MB PNG
>>101673001
top right corner, change "weight type" from default to fp8_e4... or fp8_e5...
>>
File: 1716396560581528.png (1.23 MB, 832x1216)
1.23 MB
1.23 MB PNG
>>101673001
set weight_dtype to one of the fp8 in Load Diffusion Model node

>>101673024
what are you looking for anon
SFW anime should be okay in Flux
>>
File: FLUX_00106_.png (1.03 MB, 640x896)
1.03 MB
1.03 MB PNG
>>
>>101673044
>>101673045
but that will put fp8 on the image model and not the text encoder? or it does both? how do you get to choose only one of them?
>>
File: FLUX_00119_.png (512 KB, 640x896)
512 KB
512 KB PNG
>>
File: file.jpg (366 KB, 1792x1024)
366 KB
366 KB JPG
working on flux ait desu
i really need to fix up more of ait so its easier to write the IR. the rope stuff is just awkward atm
>>
File: goo_00022_.png (1.27 MB, 1376x800)
1.27 MB
1.27 MB PNG
>>101673045
>SFW anime should be okay in Flux
hhhmmm thats not what I mean .. it doesnt understand some basic stuff I am used to .. like transparency on 2D .. or arists .. maybe I just need to learn to use txxl5 better

pic related for example is _not_ an
>anime girl by Studio Ghibli
even NAI got Studio Ghibli right .. probably we just need a finetune
>>
>>101671926
NINTENDO
HIRE THIS COMPUTER
>>
>>101672982
So Flux is actually the real SD3. We're so fucking back
>>
>>101672428
>multiple gpus don't mean shit in image gen because you can only use one
What, really?
There goes my 2x3090 idea on this.
>>
Cool
>>
File: FLUX_00123_.png (712 KB, 640x896)
712 KB
712 KB PNG
>I'm sorry SAI. Nothing personal. It's just ... business.
>>
how much VRAM do I need for flux?
>>
File: ComfyUI_02092_.png (3.43 MB, 2048x2048)
3.43 MB
3.43 MB PNG
>>101673070
you just download the fp8 t5 model and use that: https://huggingface.co/camenduru/FLUX.1-dev/tree/main
>>
>>101673122
32Gb
>>
>>101673129
and I sleep again
>>
File: FLUX_00130_.png (701 KB, 640x896)
701 KB
701 KB PNG
>>
>>101673135
works with 24GB in float8
>>
>>101673070
that should change only the diffusion model
for TE fp8, just load the fp8 t5xxl file or add the corresponding cmd line option when running comfy

>>101673096
artists are probably nuked, and 2D data seems to be cleaned, so transparency might be an issue
let me see what I can gen
>>
>>101673122
you'll never be able to run that model, anon. start working some overtime shifts and you might get a chance next year
>>
>>101673117
>>
File: FLUX_00135_.png (787 KB, 640x896)
787 KB
787 KB PNG
>>
>>101673117
>>101673162
That's fucking crazy how accurate it gets the default XP wallpaper
>>
File: FLUX_00141_.png (739 KB, 640x896)
739 KB
739 KB PNG
>>
File: asas.jpg (54 KB, 1102x564)
54 KB
54 KB JPG
>>101673147
So basically I put fp8 for the clip 1 and I let clip_l for the clip 2? Is there a big difference between a fp16 text encoder and a fp8 text encoder?
>>
File: 1691821579792536.png (1.03 MB, 832x1216)
1.03 MB
1.03 MB PNG
>>101673122
24 GB

>>101672774
try "transparent see-through"
picrel is "a transparent see-through cute slime in a foggy moody forest"
>>
File: de_fl_00034_.jpg (321 KB, 1344x768)
321 KB
321 KB JPG
>>101673093
>ait
the more things change, the more they stay the same
>>
File: FLUX_00149_.png (564 KB, 640x896)
564 KB
564 KB PNG
>>
File: goo_00024_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>101673070
you are right just look at the left most nodes. .the top is the model, you can choose to go fp8 on that. the lower one is the text enconder, you can go fp8 there .. either will do, or both (maybe you can get higher res then)

also pic related.. somewhat studio ghiblish.. hmm .. I just dont like boomerprompting
>>
>>101673176
it's really good at making desktops, give it a try
>>101671850
>>101671903
>>101671959
>>
>>101672909
Its also the lighting and softness
>>
File: 00164-2761608105.jpg (1.33 MB, 1944x2576)
1.33 MB
1.33 MB JPG
lots of dreaming last night. only bit i remember is at the end a sort of music video for 'school's out' starts playing but it devolves into a highschool of zombie people, and a young boy gets led around the corner and torn up. then i was awake.
>>
File: dda.jpg (303 KB, 3187x1593)
303 KB
303 KB JPG
How you you go on preview mode on Comfy?
>>
File: 1705825019457705.png (991 KB, 832x1216)
991 KB
991 KB PNG
>>101673180
>>101673178
yes
fp8 TE is slightly jank, better to use fp8 for the DiT since it degrades much less compared to the TE
>>
File: goo_00025_.png (926 KB, 1024x1024)
926 KB
926 KB PNG
>>101673180
ya thats great, but thats not anime style, its more like a 3D render, you can achieve such things in realistic SDXL models to, but I am looking for something thats transparent in a 2D environment
>>
>>101673210
I meant within the context of it being in what is essentially the background.
It's an unnecessary but impressive detail.
>>
File: ComfyUI_02094_.png (3.75 MB, 2048x2048)
3.75 MB
3.75 MB PNG
closer to Dall-e quality than anything we've had yet, crazy this dropped outta the blue
>>
File: goo_00026_.png (895 KB, 1024x1024)
895 KB
895 KB PNG
I am probably just gobly that I have to learn new prompting rules .. itll work out, 12b definitly is superior in many many ways
>>
File: de_fl_00035_.jpg (312 KB, 1344x768)
312 KB
312 KB JPG
>>101673218
hop on the flux train and you'll forget all about it
>>
>>101673219
i assume you mean like a "simple ui" view?

drag the clip text encode above the picture and zoom in so it fills your screen.
boom ui engineer
>>
File: fp8-DiT.jpg (264 KB, 3219x1583)
264 KB
264 KB JPG
>>101673221
>fp8 TE is slightly jank, better to use fp8 for the DiT since it degrades much less compared to the TE
and the text encoder is on the VRAM right? is there a way to put it only on RAM so I can save memory for the DiT?

Btw fp8 DiT managed to do the "halo" meme fine
>>
>>101673196
at this point there's so much more in my fork of ait and meta's i might as well defork it
>>
>>101673258
no, like you can see the steps as it starts to generate, like on A1111
>>
File: 1701895244290691.png (977 KB, 832x1216)
977 KB
977 KB PNG
>>101673219
double click an empty area
type preview and click the Preview Image node
connect the output of the VAE Decode node to that
toggle the save image node with Ctrl + M

>>101673228
I get transparency in 2D just fine though
picrel is "anime illustration of a woman wearing a transparent see-through dress"

>>101673265
Comfy will automatically shuffle things around between RAM and VRAM, don't worry about it
>>
>>101673279
open comfyUI manager menu and select a preview type in the dropdown box
>>
File: file.jpg (487 KB, 1792x1024)
487 KB
487 KB JPG
>>101673270
than meta's*
>>
>>101671258
They stated the next model is a T2V based on Flux.
>>
>>101673239
holy i am coom, give prompt pls pls
>>
File: FLUX_00190_.png (942 KB, 640x896)
942 KB
942 KB PNG
>>
File: file.png (2 KB, 213x62)
2 KB
2 KB PNG
ram usage converting dev to diffusers lol
>>
File: FLUX_00193_.png (978 KB, 640x896)
978 KB
978 KB PNG
>>
File: goo_00032_.png (959 KB, 1024x1024)
959 KB
959 KB PNG
>>101673286
>I get transparency in 2D just fine though
>picrel is "anime illustration of a woman wearing a transparent see-through dress"
ya I guess transparent anime dresses are abounded in the data set, but it seems neither are oppai girls nor transparent slimes
>an oppai girl is hugging a transparent see-through slime in a foggy moody forest at sunset, kawaii, anime style, behind them are sticks and stones
nothing to worry about, finetunes will happen if the license allowes it I guess
>>
>>101673279
https://github.com/comfyanonymous/ComfyUI?tab=readme-ov-file#how-to-show-high-quality-previews

but I think someone will need to write a latent2rgb/train a taesd equivalent for this model
>>
File: FLUX_00203_.png (675 KB, 640x896)
675 KB
675 KB PNG
>>
File: 1695908622540457.png (1.31 MB, 832x1216)
1.31 MB
1.31 MB PNG
>>101673286
it understands 2D transparency just fine

>>101673340
It can do isometric GBA style, nice

>>101673343
lemme see if I can get something similar
>>
File: FLUX_00207_.png (712 KB, 640x896)
712 KB
712 KB PNG
>>
File: kek.jpg (350 KB, 3840x1635)
350 KB
350 KB JPG
>>101673296
>open comfyUI manager menu
where is it?
>>
AAAAAAAAAAAAA FUCK ME replicate flux pro is broken
>>
stability wasnt even able to release the updated SD3 weights lmao
>>
File: FLUX_00212_.png (846 KB, 640x896)
846 KB
846 KB PNG
>>
>>101673384
Whatever they release is gonna look like a pebble compared to the mountain Flux is
>>
File: (41).png (1.12 MB, 1024x576)
1.12 MB
1.12 MB PNG
>>
File: mmm.jpg (58 KB, 1479x602)
58 KB
58 KB JPG
For each gen, it loads and unload on my ram, that makes shit slow, how to prevent that on comfyUI?
>>
>>101673375
oh, you need to download ComfyUI Manager. You can enable previews with startup commands too, but idk what they are, you'd have to look it up.
ComfyUI Manager is incredibly useful tho, so if you want to actually start using comfy I'd recommend getting that: https://github.com/ltdrdata/ComfyUI-Manager
>>
>>101673384
instead they left stability and then released flux :^)
>>
File: goo_00033_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>101673367
>lemme see if I can get something similar
thank you .. ill try to learn .. txxl5 gives me headaches "retro anime style" kinda works tho
>>
File: ComfyUI_02105_.jpg (846 KB, 2048x2048)
846 KB
846 KB JPG
>>
File: ComfyUI_15125_.png (204 KB, 512x512)
204 KB
204 KB PNG
does that count as transparent
>>
>>101673384
It's because the people who were originally making SD3 left to make this kek
>>
>>101673399
>>101673399
>>101673399
>>
>>101673435
oh ok, thanks anon
>>
>>101673445
yea kinda.. but I guess I am just perfectionist with transparent slime.. thanks for trying tho
>>
>>101673445
If the cactus is behind her, yes.
>>
>>101672982
Yeah so the issue really was Emad. They are doing perfectly fine now that this grifter is out of the place.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.