[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1697802239013434.jpg (307 KB, 1536x1536)
307 KB
307 KB JPG
Previous /sdg/ thread : >>101785853

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SD.Next: https://github.com/vladmandic/automatic
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Black Forest Labs: Flux
https://huggingface.co/black-forest-labs/FLUX.1-schnell
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Discord
6wUwtcJsr2

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: da nooz 25.jpg (531 KB, 1344x960)
531 KB
531 KB JPG
>mfw Resource news

08/08/2024

>ml_mdm - Matryoshka Diffusion Models
https://github.com/apple/ml-mdm

>TinyVAE Training Repository
https://github.com/XmYx/tinyvae-flux

>InstantX / FLUX.1-dev-Controlnet-Canny-alpha
https://huggingface.co/InstantX/FLUX.1-dev-Controlnet-Canny-alpha

>Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO and SAM 2
https://github.com/IDEA-Research/Grounded-SAM-2

>ComfyUI xlabs Flux ControlNet implementation
https://github.com/comfyanonymous/ComfyUI/pull/4260

>"Forge is under a week of major revision. Backend Rewrite is 90% finished"
https://github.com/lllyasviel/stable-diffusion-webui-forge#under-construction

>Fast Sprite Decomposition from Animated Graphics
https://cyberagentailab.github.io/sprite-decompose

>Concept Conductor: Orchestrating Multiple Personalized Concepts in T2I Synthesis
https://github.com/Nihukat/Concept-Conductor

>D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods
https://github.com/Onkarsus13/D2Styler

>ComfyUI-EbSynth: Fast Example-based Image Synthesis and Style Transfer
https://github.com/FuouM/ComfyUI-EbSynth

08/07/2024

>ControlNet Canny FLUX.1-dev
https://huggingface.co/XLabs-AI/flux-controlnet-canny

>Diffusion1B: Dataset of 1.2 billion images generated by diffusion models
https://huggingface.co/datasets/bigdata-pw/Diffusion1B

>Diffusers v0.30.0: New Pipelines, New Methods, and New Refactors
https://github.com/huggingface/diffusers/releases/tag/v0.30.0

>Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
https://github.com/OpenGVLab/Diffree

>LLaVA-OneVision Easy Visual Task Transfer
https://llava-vl.github.io/blog/2024-08-05-llava-onevision

>IP Adapter Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts
https://unity-research.github.io/IP-Adapter-Instruct.github.io

>SiCo: A Size-Controllable Virtual Try-On Approach for Informed Decision-Making
https://github.com/SherryXTChen/SiCo
>>
File: scscsc.jpg (183 KB, 2492x746)
183 KB
183 KB JPG
/g/ bros... I need help
>>
>mfw Research news

08/08/2024

>How Well Can Vision Language Models See Image Details?
https://arxiv.org/abs/2408.03940

>Lightweight Video Denoising Using a Classic Bayesian Backbone
https://arxiv.org/abs/2408.03904

>Target Prompting for Information Extraction with Vision Language Model
https://arxiv.org/abs/2408.03834

>Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation
https://arxiv.org/abs/2408.03735

>Soft-Hard Attention U-Net Model and Benchmark Dataset for Multiscale Image Shadow Removal
https://arxiv.org/abs/2408.03734

>Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling
https://openstorypp.github.io/

>TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization
https://arxiv.org/abs/2408.03637

>A comparative study of generative adversarial networks for image recognition algorithms based on deep learning and traditional methods
https://arxiv.org/abs/2408.03568

>Hybrid diffusion models: combining supervised and generative pretraining for label-efficient fine-tuning of segmentation models
https://arxiv.org/abs/2408.03433

>Set2Seq Transformer: Learning Permutation Aware Set Representations of Artistic Sequences
https://arxiv.org/abs/2408.03404

>A Non-negative VAE:the Generalized Gamma Belief Network
https://arxiv.org/abs/2408.03388

>FastEdit: Fast Text-Guided Single-Image Editing via Semantic-Aware Diffusion Fine-Tuning
https://arxiv.org/abs/2408.03355

>An Empirical Comparison of Video Frame Sampling Methods for Multi-Modal RAG Retrieval
https://arxiv.org/abs/2408.03340

08/07/2024

>MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation
https://arxiv.org/abs/2408.03312

>DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers
https://arxiv.org/abs/2408.03291
>>
>Hello Anon! I hope you are doing well :]
>>
File: BMP_FLUX_04139_.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
>5'11"
>>101791958
Goes in the unet folder in models not checkpoints
>>
File: 0.jpg (239 KB, 1024x1024)
239 KB
239 KB JPG
>>
File: 0.jpg (117 KB, 1024x1024)
117 KB
117 KB JPG
>>
File: 363.jpg (24 KB, 1126x293)
24 KB
24 KB JPG
>>101791976
Now I get this... at least the message is smaller. Already tried updating the ComfyUI and pytorch.... nothing
>>
File: 00063-3224901288.jpg (639 KB, 1368x1776)
639 KB
639 KB JPG
get the flux out
>>
File: mc_xl_00275_.png (2.06 MB, 1328x1944)
2.06 MB
2.06 MB PNG
>>
>>101792098
desu anon it just looks like you haven't got the flux model downloaded in the right folder or aren't using the correct node to load it
>>
>>101792098
refresh the ui and reselect it
>>
is civit messed up right now? I can't login
>>
>>101792098
Is your model in unet?
Follow a simple guide anon, the basics have been explained in a lot of places.
>>
Does anybody know how I can get ControlNet to work with SDXL on Forge? I have the models but the results are fucked up. It works fine on SD 1.5 models (with the appropriate ControlNet model. I'm not using the same for SDXL)
>>
>>101792156
Have you tried reading the docs retard kun?
>>
>>101792113
>>101792128
I tried it in checkpoints folder and afterwards in unet folder, neither worked. I followed an youtube guide https://youtu.be/KTPLOqAMR0s?si=PGhl3AZJnbeOXHRT&t=178 he leaves it at the checkpoints folder

>>101792114
Same error message
>>
>>101792231
flux is not the same as a traditional stable diffusion checkpoint

https://comfyanonymous.github.io/ComfyUI_examples/flux/
>>
>>101792231
Follow another guide.
I literally put it in /models/unet/Flux/, clicked on the uner loader workflow node and selected it, it's super simple.
>>
>>101792231
just follow comfys simple guide and make sure your comfy is up to date, and go from there.
https://comfyanonymous.github.io/ComfyUI_examples/flux/
>>101792196
I still see ppl using sd15 embeddings in their sdxl gens, so there is that.
>>
>>101792018
Had to check long and hard to make sure this wasn't a merchant
>>
>>101792196
Have some self respect anon, don't do that.
It's mostly people bad habits.
Flux doesn't understand the weight stuff, and masterpiece is useless since at least SDXL.
>>
We need a guide for the setup for low-tier PCs
>>
File: file.jpg (222 KB, 1024x1792)
222 KB
222 KB JPG
>>101791896
occam's razor desu, the data simply isn't there in general and you're getting results sometimes with descriptions because of an odd few images with laion trash captions that are descriptions
they're going to have used what they already had from sai so laion and midjourney
afaik they haven't said "we recaptioned everything with vllm" that's just conjecture and more likely down to T5
>>
File: file.jpg (464 KB, 1024x1792)
464 KB
464 KB JPG
>>101792350
>more likely down to T5
to clarify, that's responding to long prompts because T5 is a llm
>>
>>101792350
Occam's razor for me would be for them to have money to get (more) complete datasets.
If they could pay for the compute, they could pay for a good dataset.
>>
>>101792267
>>101792278
I am going to follow the guide, thanks m8
>>
https://github.com/apple/ml-mdm
Wtf is this for? I'm a brainlet
>>
File: file.jpg (315 KB, 1024x1792)
315 KB
315 KB JPG
>>101792402
they know where i am :^)
>>
File: FLX_00042_.png (1.26 MB, 968x1088)
1.26 MB
1.26 MB PNG
>>101792457
That was the wrong one
>>
>>101792477
I bet she hates He-Man
>>
File: best_new_dataset.png (8 KB, 290x290)
8 KB
8 KB PNG
>>
>>101792453
yeah and they want nothing to do with you for a reason, faggot
>>
>>101792514
stay mad, datalet
>>
>>101792477

poor monk seems lost his meat
>>
>all that data
>no likes
>no job
>>
>>101792533
if your data was quality it'd be in use, sweetheart
>>
What's the future of local? Is SD3 a deadend?
>>
>>101792618
moar models. sd3 is dead.
>>
File: file.jpg (264 KB, 1536x1536)
264 KB
264 KB JPG
>>101792618
Looks like it. Stable diffusion wants to earn money now, so they'll only release gimped shit to the public and try to make money from their big models that are API only. But those models can't hold a candle to their proprietary competitors, so why would anyone use them?

Flux, which was recently released and came out of nowhere seems to be the next big thing. Takes a metric pisston of vram to run properly and people are still figuring out the best way to train loras for it and finetune it so it becomes more useful, but it blows all of stable diffusion's base models out of the water.
>>
>>101792618
The future of local are releases by companies/startups with local models understanding natural language but who never saw a penis/vagege/tit nor ever heard of any known character ever made.
Then some autist anon tries to add this stuff back with more or less success.
>>
File: 000000_16091_.png (2.99 MB, 1434x1075)
2.99 MB
2.99 MB PNG
Flux does a Beaver Tail, she passes, a bit hairy but nice.
>>
File: a.jpg (1.22 MB, 3223x3051)
1.22 MB
1.22 MB JPG
Wtf I think I discovered something interesting by accident, even if you put nothing in the negative prompt, if you increase the Guidance Negative, the model will respect the style more and won't overcook Miku style to the model!
>>
>>101792661
How much vram does Flux need?
>>
>>101792901
if you go for fp8 flux (~12gb + 2gb for the resolution) + fp16 clip (~9.6gb) + VAE (2gb spike at the very end)
>>
>>101792930
I "only" have 12...
>>
>>101792618
short term: better flux
long term: generative 3D model + simulator + LLM + 2D background, it's essentially the same as the structure of video games.
>>
>>101792292
I'm not the one who do it
https://civitai.com/images/23243276
>>
>>101792820
No seriously this is working, look!
https://imgsli.com/Mjg1Nzc0
>>
File: file.jpg (254 KB, 1536x1536)
254 KB
254 KB JPG
>>
>>101792901
>>101792941
12 is good enough if you optimize your pc vram / ram usage. but it's in the "barely" range. keep cfg at 1
>>
>>101792974
People on civitai always use retarded shizoprompts that do nothing. Nothing new.
>>
File: ComfyUI_00158_.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>>101792618
SD shot themselves on the foot with SD3, and with a cool new kid on the block it's all downhill from here for them
>>
>>101792292
>>101793049
Can you prove it actually never helps? AI shit is perfect for cargo culting bullshit.
>>
>>101793100
Yes, push the number to 10-20 and you won't see any difference.
And make the same image seed without "masterpiece".
>>
>>101793100
Do you think Pony score tags would be anything but noise to Flux?
>>
why don't models have prompt feedback anyways? why can't it tell me it hasn't been trained on whatever gibberish I just typed in
>>
>>
>>101791935
4090 is en route, can't wait to train some cunny loras
>>
File: ComfyUI_00159_.png (2.01 MB, 1024x1024)
2.01 MB
2.01 MB PNG
>>101793270
almost looks like bonbi....
>>
File: mc_xl_00375_.png (1.91 MB, 1248x1824)
1.91 MB
1.91 MB PNG
>>101792986
ugh, i guess i'll unload the waifu workflow
>>
>>101793375
i dont know who that is
>>
File: ComfyUI_00160_.png (958 KB, 1024x1024)
958 KB
958 KB PNG
>>101793409
it's probably for the best you don't
/ttg/ was a wild place
>>
in comfyui, what fucking key press keeps making all my seeds fixed, and how the FUCK do i disable this?
>>
File: ComfyUI_01768_.png (2.06 MB, 1024x1024)
2.06 MB
2.06 MB PNG
>>101792820
Got another example
>Hatsune Miku, 80's anime drawing style
https://imgsli.com/Mjg1Nzg2
>>
>>101793523
how do i download the workflow for comfy?
>>
>>101793569
Take this anon: https://files.catbox.moe/m9qr6g.png
>>
>>101792820
>>101793523
show the left side of the spaghetti pls
>>101793583
is that the same workflow you posted earlier, with teh dynamic thrshholding and the disabled xyz?
>>
>>101793610
>is that the same workflow you posted earlier, with teh dynamic thrshholding and the disabled xyz?
yes, DynamicThresholding is mendatory if you want to use high CFG + Negative prompt/Guidance
>>
>>101793615
tips for realism?
the settings as they are on teh workflow tend to overburn even with no neg and high neg guidance
>>
File: file.jpg (306 KB, 1792x1024)
306 KB
306 KB JPG
flickr in progress. going through past years of 'explore' first which are curated best photos. then ill grab all the photos from every user of those. after that ill go through the groups
also getty pretrain set downloading images, and more scraping in progress, can probably get that dropped tomorrow
gn
>>
>>101793695
can you post a screen of comfy's workflow with the burnt image and the settings so I can see what's wrong with it?
>>
>>
File: Capture.jpg (52 KB, 597x600)
52 KB
52 KB JPG
>>101793695
Do you use those new settings on DynamicThresholdingFull? The old ones weren't optimal
>>
>>101793710
>>101793769
it's literaly the exact workflow posted here >>101793583
only difference is prompt, and i'm not using anything weird (or quality tags)
on this image i changed cfg on ksampler to 2, the previous image was 4, and the one before was 6
so threshholding seems to be making it worse vs just using cfg 1 on the ksampler
>>
File: ComfyUI_10510_.png (2.86 MB, 1600x1200)
2.86 MB
2.86 MB PNG
>>
File: BMP_FLUX_03135_.png (757 KB, 1024x1024)
757 KB
757 KB PNG
>>
>>101793775
>threshholding seems to be making it worse vs just using cfg 1 on the ksampler
higher CFG are meant to make the model understand prompts better, but if you don't use DynamicThresholding the image will collapse, it's something made to prevent the insane burning, but doesn't work at 100% yeah, seems to be working better on anime rather than realism
>>
>>101793775
>>101793803
oh and different w x h of course
maybe that's why
here's at cfg 1.5, i'm gnona play with the other settings then
>>
>>101793821
>6 fingers
flux is trash
>>
File: IMG_2117.png (2.23 MB, 1224x1224)
2.23 MB
2.23 MB PNG
Killing N’wahs and outlanders in the name of CHIM
>>
File: BMP_FLUX_03675_.png (1.59 MB, 1024x935)
1.59 MB
1.59 MB PNG
>>101793844
Works on my machine
>>101793878
Based
>>
File: ComfyUI_02700_.png (1023 KB, 1152x896)
1023 KB
1023 KB PNG
>>
File: Capture.jpg (797 KB, 3840x1360)
797 KB
797 KB JPG
>>101792820
Ok I did a XY plot (X = sampler I just took euler AND Y = GuidanceNeg scale) and... what the fuck, it looks like the sweet spot is at 10... how lucky was I? I was just puting this number randomly, if I decided to go for 7 for example I would've gotten something glitched and I would never consider this viable or look at it with more details... this is INSANE
https://files.catbox.moe/rpp8u2.jpg
>>
File: BMP_FLUX_03822_.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
I thought it would be a cool concept to have a heart with extending tree branches sucking the life out of stuff around it. But then I got bored/forgot and started making goblin girls roasting hearts over a campfire like marshmallows instead
>>
File: BMP_FLUX_03899_.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
Pic related
>>
>>101793997
you're in the news >>101791952
>>
File: ComfyUI_02714_.png (1015 KB, 1152x896)
1015 KB
1015 KB PNG
>>
File: BMP_FLUX_03943_.png (1007 KB, 1024x1024)
1007 KB
1007 KB PNG
>>101794037
Man, I wish. Weed and marathoning anime was the best, haven't done it for years. I miss it a lot.
>>
indeed
>>
>>101794200
So what do these mean. Is Comfy hard to use? I haven't found a reason to try Comfy yet, just looks more complicated.
>>
>>101794221
Yes absolutely nothing is explained for you in the UI no tooltips no nothing, and there are hundreds of nodes with sometimes 10+ different settings for each and you're expected to go out on your own and figure out what they do.
>>
File: ComfyUI_01801_.png (2.22 MB, 1024x1024)
2.22 MB
2.22 MB PNG
>>101792820
Watercolor works so well with that method
https://imgsli.com/Mjg1ODI5
>>
File: delux_tm_00001_.png (1.13 MB, 1152x896)
1.13 MB
1.13 MB PNG
>>101794200
what do you hate most about comfy rn?

>>101794221
some stuff is harder, some stuff is easier. its a very different UI which some people jive with and some dont
>>
>>101794221
>>101794282
for me the problem is just navigating the spaghetti when you just want to switch out one node
even in cleanish workflows it's a pain to switch things out and then changing settings and making sure your new node is not afecting other stuff negatively
it's not a fire and forget kind of thing with comfyui

i like genning to relax, and today i've spend like 8 hrs fucking around with nodes instead of just genning cute realistic waifu with animals ;_;

i'm about to call it a day on this pain in the ass lel
>>
File: delux_tm_00010_.png (1.05 MB, 896x1152)
1.05 MB
1.05 MB PNG
>>101794318
I feel ya. hopefully you get a workflow tuned into what you're aiming for soon and don't get caught in perpetual spaghetti hell.
>>
File: BMP_FLUX_05153_.png (930 KB, 648x1200)
930 KB
930 KB PNG
I was trying to get her to take a sippy out the Kool-aid man's head
>>
File: 1177801.jpg (11 KB, 681x408)
11 KB
11 KB JPG
What are your thoughts about AMD's "Amuse" tool?
>>
File: delux_tm_00018_.png (744 KB, 896x1152)
744 KB
744 KB PNG
>>101794423
I think I saw someone once comment about it saying "AMD won" but then never saw anyone else ever say a single thing about it
>>
File: flux_comfyui_mt_00015_.png (1.09 MB, 1280x1024)
1.09 MB
1.09 MB PNG
>>
File: BMP_FLUX_04350_.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>>
>>101794423
What is it even used for? "Amuse" kinda sounds like a comedian chatbot that helps you while telling you a joke or two.
>>
>>101793797
What model is this? It looks so good!
>>
File: 33.png (63 KB, 831x705)
63 KB
63 KB PNG
>>101794423
>>101794613
It has these additions, and it's somewhat good for users with AMD hardware.
>>
>>101794613
>AMD issues a C&D for ZLUDA and launch Amuse
They're laughing in your face AMD bros
>>
>>101794718
wh-what..?
>>
horrors beyond your comprehension
>>
File: delux_tm_00020_.png (919 KB, 1216x832)
919 KB
919 KB PNG
>>101794733
now this is art
>>
>>101794493
So cute is this also flux?
>>
File: delux_tm_00021_.png (863 KB, 1216x832)
863 KB
863 KB PNG
>>101794781
yes but the style is pretty unreliable. it pops into hd half the time
>>
>>
File: ComfyUI_10504_.png (2.72 MB, 1600x1200)
2.72 MB
2.72 MB PNG
>>101794661
flux
>>
File: ComfyUI_00161_.png (1.65 MB, 1024x1024)
1.65 MB
1.65 MB PNG
Is there no easy way to prompt edit in ComfyUI like we used to do in Auto1111?
I mean the [start:end:step] syntax
>>
>>101794580
have you tried forge? it seems like it already supports flux
https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/964
>>
File: delux_tm_00023_.png (739 KB, 1216x832)
739 KB
739 KB PNG
>>101794887
https://github.com/ltdrdata/ComfyUI-extension-tutorials/blob/Main/ComfyUI-Impact-Pack/tutorial/ImpactWildcard.md
>>
File: delux_tm_00025_.png (761 KB, 896x1152)
761 KB
761 KB PNG
>>101794887
>>101794932
oh wait you said prompt editing
https://github.com/asagi4/comfyui-prompt-control/
>>
>>101793992
Damn that's cool
>>
>>101793878
Catbox?
>>
File: lAdtPj8zwTq62e_IRlBmC.png (772 KB, 1024x768)
772 KB
772 KB PNG
Why is AMD so slow?
>>
>>101794927
i've been wary of updating forge since last time (last week) when i ended up having to reset the venv and download everything again
>>101794945
is it finally working? i gave up on it months ago when i was giving comfyui it's quarterly "let's try this spaghetti shit again" shot
>>
File: 00079-3959637511.jpg (579 KB, 1368x1640)
579 KB
579 KB JPG
nite
>>
File: delux_tm_00031_.png (844 KB, 896x1152)
844 KB
844 KB PNG
>>101795184
gn
>>
>>101795163
just clone another forge a new folder and create a new venv, test shit with that, I used to run both webui and forge
>>
alright g'nite all
>poor kitty
>>
File: delux_tm_00029_.png (883 KB, 896x1152)
883 KB
883 KB PNG
>>101795225
gn
>>
>>101795224
space on the drive is the issue, not the risk of the update itself
already flux swallowed up another 30 or so gb (model, vae, updating comfy, updating deps, etc)
i'll check out the forge update one of these days
>>
File: delux_mp_00008_.png (1.17 MB, 896x1152)
1.17 MB
1.17 MB PNG
flux can't seem to do silly ms paint drawings
>>
>>101795328
>flux can't seem to do silly ms paint drawings
use this method anon, it'll help
https://reddit.com/r/StableDiffusion/comments/1enm9og/discovered_by_accident_a_trick_to_make_flux/
>>
File: delux_mp_00002_.png (1.43 MB, 896x1152)
1.43 MB
1.43 MB PNG
>>101795396
I'm waiting until it doesn't double gen times. shits already too slow :(
>>
File: FLUX_00056_.png (819 KB, 1024x1024)
819 KB
819 KB PNG
>>101795328
>>
Debo the retard
>>
>>101795409
>I'm waiting until it doesn't double gen times. shits already too slow :(
maybe our savior will be AD (Adaptive Guidance) >>101794987
>>
>>101795420
he's already seen it in the other thread he just sucks at imggen
>>
File: delux_mp_00013_.png (490 KB, 1216x832)
490 KB
490 KB PNG
>>101795414
not ms paint but closer than what I've been getting

>>101795420
reminds me of 1.5 days when people were trying to figure out solutions to all kinds of stuff. hope that strategy (or any other) works out

>>101795427
>he's everywhere and everyone
schizo
>>
coping retard kek
>>
Good to see the containment thread is keeping him here.
>>
File: delux_mp_00018_.png (954 KB, 1216x832)
954 KB
954 KB PNG
>>
File: ComfyUI_10513_.png (2.91 MB, 1600x1200)
2.91 MB
2.91 MB PNG
>>
File: delux_mp_00019_.png (1.02 MB, 1216x832)
1.02 MB
1.02 MB PNG
I'm in the totally random style dimension
>>
How do I do xy prompt substitution etc in Comfy?
>>
File: delux_mp_00023_.png (770 KB, 1216x832)
770 KB
770 KB PNG
vibe
>>
File: ComfyUI_10522_.png (2.19 MB, 1440x1120)
2.19 MB
2.19 MB PNG
>>
File: ComfyUI_10526_.png (2.31 MB, 1440x1120)
2.31 MB
2.31 MB PNG
>>
https://github.com/lllyasviel/stable-diffusion-webui-forge/blob/main/backend/nn/flux.py#L2

Lmao, illya the pedo is seething so much at A1111 he put this so they can't use the flux code from forge. What a huge faggot.
>>
File: delux_mp_00022_.png (1.29 MB, 1216x832)
1.29 MB
1.29 MB PNG
>>101795930
I love code-based drama
>>
>>101795930
good for him, A1111 is trash
>>
You can't get enough of your drama, can you
>>
>>101795930
A1111 is free so they can use it.
Stupid drama craving sperg
>>
>>
>>101795930
isn't this the comfy / forge wars arc? comfy is doing something similar in his new commits
>>
File: yuki1.png (2.37 MB, 2048x2048)
2.37 MB
2.37 MB PNG
>>
>>101795930
Is it even safe to run FLUX on any other UI that isn't comfy?
>>
File: yuki2.png (3.4 MB, 2048x2048)
3.4 MB
3.4 MB PNG
>>
File: 037.png (999 KB, 968x1088)
999 KB
999 KB PNG
>>
>>101795970
A1111 is far better for experimentation and throughput, Comfy is when you've only optimized conditions and want to refine
>>
File: yuki3.png (2.95 MB, 1792x2304)
2.95 MB
2.95 MB PNG
>>
>>101796050
>Forge* is far better for experimentation and throughput,
fixed 4 u :)
>>
>>101796074
What even is Forge? First time hearing of it
>>
>>101795930
So what does this mean exactly? Will it take longer for Forge to be compatible with FLUX? Am i understanding this right?
>>
>>101795930
>restricting use of GPL code
retard
>>
File: ComfyUI_10556_.png (2.27 MB, 1440x1120)
2.27 MB
2.27 MB PNG
>>101795930
>https://github.com/lllyasviel/stable-diffusion-webui-forge/blob/main/backend/nn/flux.py#L2

thats actually aimed at comfy, it will be the same when SD3.1 arrives
>>
I'm still using Forge version f0.0.17v1.8.0rc-latest-276-g29be1da7
It just werks really fast and doesn't have an autismal UI.
Uninterested in Flux until anime girl LORAs are posted for it regularly on Civitai.
>>
>>101796128
This but I'm more interested on celebs, Dafne Keen LoRA is a priority!
>>
>>
File: nagatorovomit.jpg (1.16 MB, 1859x2048)
1.16 MB
1.16 MB JPG
>>101796194
>3DPD
You do you, anon.
>>
>>101792986
Do you get the same results if you put your neg guidance back, and instead turn up positive guidance? I'm guessing that negative is a hack. I don't think it was conditioned on negative prompts?
>>
>>101796226
No, turning up the positive guidance will never have that effect, we've tried it before. And yeah, negative guidance has this weird effect of removing the model's bias, I found it completely by accident and I'm glad I did lol
>>
>>101796103
you can already test flux on forge, anon
>>
>>101796226
>I'm guessing that negative is a hack. I don't think it was conditioned on negative prompts?
it's basically an empty negative prompt + Negative Guidance at 10, you can see what I mean by looking at this workflow for example:
https://files.catbox.moe/xlxd00.png
>>
>>
>>101796248
Thx, I'll look into it.
>>
File: holo1.png (2.25 MB, 896x1152)
2.25 MB
2.25 MB PNG
>>101796288
I'll check it out when I gen next for sure
>>
File: FLX_00057_.png (1.38 MB, 968x1088)
1.38 MB
1.38 MB PNG
>>
>>101796353
Challenge: get it to show all six strings and not fuck them up.
>>
File: FLX_00060_.png (1.29 MB, 968x1088)
1.29 MB
1.29 MB PNG
>>101796371
I tried
>>
>>101796461
I guess not showing the thinner strings is better than previous models have done, which is completely fuck them up.
>>
File: FLX_00061_.png (1.51 MB, 968x1088)
1.51 MB
1.51 MB PNG
>human body
>>
>Debo cried himself to sleep
>pw dilated himself before crying
>>
File: Capture.jpg (955 KB, 3473x1309)
955 KB
955 KB JPG
>>101792820
>>101793992
https://files.catbox.moe/3bdsif.jpg
Ok I did a XY plot between GuidanceNegative (X) and CFG (Y) and... you're not gonna believe this, but the default parameter I've chosen before (CFG 6 + GuidanceNegative 10) seems to be the sweet spot... I think I never got this lucky in my life what the hell?
>>
>>
File: ComfyUI_temp_cfayp_00006_.png (1.76 MB, 1024x1024)
1.76 MB
1.76 MB PNG
>>
File: mc_xl_00535_.png (2.92 MB, 1296x2268)
2.92 MB
2.92 MB PNG
>>
>>101796774
Does this work for anything other than Miku?
>>
File: ComfyUI_00162_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>
>>101796920
I have these
>>101795978
>>101794191
>>101795045
>>
File: file.jpg (203 KB, 1024x1792)
203 KB
203 KB JPG
>>101795930
based desu
>>
Can you keep your thread schizo debo in your containment thread? thanks.
>>
File: 00056-848456308.png (3.76 MB, 1536x1080)
3.76 MB
3.76 MB PNG
>>
File: Capture.jpg (396 KB, 2911x1442)
396 KB
396 KB JPG
https://civitai.com/models/625042?modelVersionId=706397
Ok I finally get what Adaptive Efficiency means, it's basically the same thing as CFG, but when we're reaching the end of the inference, when the pictures doesn't change much anymore, it reverts back to CFG = 1 -> 2x speed increase at that moment. It's cool but... DynamicThresholding doesn't work well with cfg = 1 and the result is this whiteish picture, I wish I could find a way to deactivate DynamicThresholding when the cfg = 1, is there a node that can do conditions and shit?
>>
File: Capture.jpg (108 KB, 3196x608)
108 KB
108 KB JPG
>>101797348
I put the workflow for those interested, if we can fix this we'll get a nice speed improvement
https://files.catbox.moe/w5jc8e.png
You need to download the adaptive node to make it work
>>
https://civitai.com/models/633553?modelVersionId=708301

the user said it took $1 total to train on an a100 (you can rent these online), so LORA training seems feasible, you only need one character lora for example and you can gen it forever.

their post:

Sure I used

https://github.com/XLabs-AI/x-flux

for training on a RunPod A100 SXM instance (80GB VRAM, but only 42 utilised with default settings).

If you don’t take into account that I wasted time setting up and used way too many steps (10,000 but 2,500 was enough) for the small number (700) of images I had it cost less than $1USD to train.

Note I had to convert the output to safetensors using huggingface https://huggingface.co/spaces/safetensors/convert then used this script https://huggingface.co/comfyanonymous/flux_RealismLora_converted_comfyui/blob/main/convert.py to make it compatible with comfy (and everything else).

https://github.com/jhc13/taggui for both natural language and tag style captions.

https://www.birme.net/ For cropping and resizing.

Going to try this with larger datasets since this LoRA wasn’t expensive. Takes about 2.2 hours to do 10k steps (can be improved) if you don’t save checkpoints too often (which adds like an hour with the default settings).
>>
File: 1698848611536325.png (1.01 MB, 832x1216)
1.01 MB
1.01 MB PNG
>>101797639
test output with "miku hatsune on a tv screen" and the lora in the provided workflow:
>>
File: 1699301760478201.png (1.37 MB, 832x1216)
1.37 MB
1.37 MB PNG
they said the instance prompt was anime art of a girl/woman, last image didnt have it: >>101797674
>>
File: file.jpg (530 KB, 1024x1792)
530 KB
530 KB JPG
new data who dis
https://huggingface.co/datasets/bigdata-pw/BIGstockimage2M
>>
I made a tutorial to improve the inference speed by 25% if you are using CFG > 1 + DynamicThresholding
https://reddit.com/r/StableDiffusion/comments/1enxcek/improve_the_inference_speed_by_25_at_cfg_1_for/
>>
>pedo and schizo den
>>
ldg is laughing at us again
>>
File: file.jpg (153 KB, 1024x1024)
153 KB
153 KB JPG
pitchfork reviews dropping later today
>>
File: ComfyUI_00985_.png (1.73 MB, 1280x960)
1.73 MB
1.73 MB PNG
>>
>>101798785
I liked your data.
>>
>>101798910
thanks! much appreciated
>>
File: end_gen.jpg (291 KB, 1448x1280)
291 KB
291 KB JPG
>>
>>101798676
this kinda works with dynthresh but it's slowwwwww
but at least it's not making a mess anymore. still looks burned for realism tho
>>
File: file.jpg (286 KB, 1024x1792)
286 KB
286 KB JPG
CLITFlickr dropping soon:tm:
Contrastive Language Image Training (data from) Flickr
unknown size atm because scrape still in progress, target is 50M+ though
open to suggestions for the vllm to use for captioning, preferably something fast so it won't take more than a few days on 4xA100 80GB, testing florence-2 atm
>>
>>101799643
Who are your favorite posters?
>>
>>101799801
Me, myself and I.
>>
>>101799801
every poster is my favourite desu
>>
>>
quick retard question
If I want to render something in 1920x1920 only shit comes out and it takes forever.
How do I fix this?
Go for a smaller size and use the Hires.fix?

Or is there another way to get the AI to render it properly
>>
File: genshins.jpg (3.12 MB, 5789x1913)
3.12 MB
3.12 MB JPG
multi character grid
>>
>>101799902
what model and what interface?

1024x1024 is guaranteed to always be good, higher res require you do other things yes, like hires fix.
>>
>>101799956
webui
Im a retard so I dont know that there are other interfaces
different models
I tried people with absolutereality and some wallpapers, but the wallpapers (I love forests) dont work. Always the same trees.
>>
>>101799801
The one that posts the blonde fox. Don't care for the poster, I like the fox, she is cute, and I want to fuck her
>>
>>101799999
checked, quints don't lie
>>
>>101799994
absolutereality says it's a 1.5 model on civitai. I don't even bother with 1.5 models, they're too finicky.
Make things easy on yourself and be sure the "Base Model" section says it's SDXL or Pony.
>>
>>101800063
>>101799956
Woah, what an utter nonsense advice.
>>
>>101800063
thanks man, Im an absolute novice with these things.

Open to any feedback, thx anons
>>
Ded general
>>
File: ComfyUI_01009_.png (1.27 MB, 1280x960)
1.27 MB
1.27 MB PNG
>>
>>101800364
she isn't here
>>
>>101800383
Hes always here
>>
>>101800445
She isn't here tho
>>
>>101800541
Ok debo
>>
>>101800364
It's actually working, he stays only here except for his mfw bs on /ldg/
>>
File: lynx by flux.png (426 KB, 512x1024)
426 KB
426 KB PNG
After 1.6 hours in FLUX, I present to you.... my masterpiece.

Seriously though WTF am I doing wrong? All the other models gave me much better results...

prompt: high quality digital painting portrait of a female rogue, she has a ponytail, brown hair, a rudimentary muffler over her shoulders, subtle gold ornaments on her hair and sharp beautiful yellow eyes
>>
File: ComfyUI_02763_.png (1.29 MB, 1152x896)
1.29 MB
1.29 MB PNG
it's not the prompt, some of your other settings must be fucked up. Either post a catbox or screenshot of your workflow
>>
>>101800716
Who are you talking to
>>
>>101800696
flux will randomly blur for no apparent reason. just hit gen again
>>
>>101800696
>/sdg/ - Stable Diffusion General
?
>>
>>101800790
here have a sd image
>>
It's dead, Jim.
>>
File: 1716303048065343.png (193 KB, 455x400)
193 KB
193 KB PNG
>install forge
>computer performance actively affected by just downloading a fucking model with the civit extension
>has yet to start downloading as i'm typing this
>never had this problem with regular sdui
>>
File: file.jpg (216 KB, 1024x1792)
216 KB
216 KB JPG
pitchfork dropped
https://huggingface.co/datasets/bigdata-pw/Pitchfork
>>
>>101801152
use comfyui
>>
Will you divulge your other pseudonym if you ever get an interview or will you lie so they don't find out? I doubt you'll ever get to that point anyway but I'm curious.
>>
>>101801369
huh?
>>
File: lynx by rpg_v5.png (1.11 MB, 656x1152)
1.11 MB
1.11 MB PNG
>>101800834
Good stuff
>>
>>101801415
I'm asking hlky who now goes by big data presumably because no one wants to hire him.
>>
>>101801369
gonna assume that's aimed at me, it's on all the datasets anyway
>>101801430
bigdata is just the name of a group that i'm the only member of. i haven't actually applied to that many positions and i have been hired before, i don't think anyone really cares, i'm probably just being filtered by ATS because i don't have a degree and i'm not in the US
>>
>>101801468
Ah. Decent cope desu.
>>
>>101801468
Can I join your group? You can call me the Formatter.
>>
I swear to god captioning is literally the worst part of this hobby
>>
>>101801488
a job related to data or funding would be nice, but i'll keep doing it anyway, data has always been a big interest of mine
>>101801513
maybe, do you have experience in beverage preparation, specifically coffee
>>
File: hot(23).jpg (89 KB, 984x984)
89 KB
89 KB JPG
Is FLUX on A1111 or forge easier to use? I'm having a hard time with those "split sigma" things that i still don't understand completely.
>>
File: delux_mp_00026_.png (746 KB, 1216x832)
746 KB
746 KB PNG
>>101801369
I could never bridge that gap because of how thoroughly brain-broken my haters are
>>
>>101801343
does it generate faster than base sdui? my specs are fine for generating but that shit takes forever
>>
File: ComfyUI_00165_.png (1.49 MB, 1024x1024)
1.49 MB
1.49 MB PNG
>>101798676
thanks anon, I thought you were a schizo but still decided to give the workflow a try
I've seen a non-zero improvement on the speeds with pretty satisfying results
I kneel
>>101801622
are you autistic
>>
>>101801662
Distracted.
>>
File: delux_mp_00005_.png (1.64 MB, 896x1152)
1.64 MB
1.64 MB PNG
>>101801622
its not empty. __init__.py has the node code. and then it links out to the project page for the paper. as far as specific usage w/ flux, you'll have to experiment or wait for anons to figure out what does or doesn't work. I think rn its just a theory that AG will work
>>
>>101801622
also disregard anyone with filename delux_
obvious plant
>>
File: hot(12).jpg (75 KB, 976x1158)
75 KB
75 KB JPG
>>
>>101801343
what the fuck is this ui, this is way more confusing
>>
>>101801101
>Jim
He's too prideful to admit he lost and let it go. Plus, he has nothing else going on IRL besides getting yelled at by mommy.
>>
File: adaptive_guidance_test.jpg (265 KB, 1535x665)
265 KB
265 KB JPG
>>101801714
Got it working, I tend to spazz out when distracted. It's an obvious brain damage issue.
The node seems to work at least. It's clearly faster than normal SDXL for me.
I don't have the luxury of testing out Flux because I'd need to update my machine.
>>
File: ComfyUI_00167_.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>101798676
on a second analysis, I've seen a 40% gen time improvement on flux dev
fantastic stuff
>>
>d*b* containment general
>>
>this thread is still up
>>
File: delux_mp_00006_.png (1.29 MB, 896x1152)
1.29 MB
1.29 MB PNG
>>101801837
ikr? jim is almost as bad as dave

>>101801859
>I don't have the luxury of testing out Flux because I'd need to update my machine.
I think that's what's created a fresh interest in AG. there's been a push to find ways to improve guidance and control for flux because it doesn't work with cfg. AG was recently raised as a potential route though I dunno if there's been anything conclusive
>>
>>101801751
He uses other filenames too because so many anons filter him. Luckily you can always spot his gens because they are trash.
>>
Is everything alright in here?
>>
File: hot(1).jpg (114 KB, 976x1158)
114 KB
114 KB JPG
>>
File: ComfyUI_01018_.png (1.4 MB, 1280x920)
1.4 MB
1.4 MB PNG
>>
File: trumpvance2025_~2.jpg (84 KB, 1000x1000)
84 KB
84 KB JPG
>>
>>101802147
who is this handsome beast?
>>
>>101801548
Didn't ask don't care
>>
Yo
>>
>>101802255
Hey yo! How have you been? Haven't seen you in a while.
>>
>>101802147
>this is whos replying to you
>>
>>101802271
Kek
>>
>>101802147
looks like a schizo
>>
>>
Why is this subreddit so schizophrenic?
>>
>Good morning/afternoon Anon! I hope you are all doing well :]
>>
>>101802355
>subreddit
>>
anon I'm confused.
what's the difference between/sdg/ and /ldg/?
which is more professional and can give me the most technical help?
>>
>>101802385
/ldg/
>>
>>101802385
>which is more professional
neither, /ldg/ is full of complete amateurs that just discovered Dynamic Thresholding and /sdg/ is a cesspit of avatarfags
>>
>>101802355
>nogen
Why are you such a nobody? LOL! :]
>>
>>101802385
/lmg/
>>
>>101802400
who are the good guys?
>>
File: delux_me_00029_.jpg (1.27 MB, 960x1344)
1.27 MB
1.27 MB JPG
>>101802355
picrel

>>101802385
there's no difference beyond arbitrary lines drawn between who's willing to post in which general. you can have the same conversations in either general, talk about the same tech, get the same help
>>
>>101802417
none, they exist as two sides of the same faggy coin
>>
cope
>>
>>101802417
ldg posters come here every day to shit the place up but I guess it is subjective whether you considering that bad behavior
>>
File: 00016-2905859876.jpg (520 KB, 1248x1864)
520 KB
520 KB JPG
mornin, 2 minutes before noon
>>
Niggerlas and debo like two peas in a pod
>>
>>101802417
the good guys don't post anymore, both generals are worthless dogshit.
>>
>>101802478
Good morning, i hope you're doing well friend
>>
>>101799481
>still looks burned for realism tho
I think for realism you don't need to put GuidanceNeg at 10, you can also decrease the AdaptiveGuidance threshold to get something less burned

>>101801662
>thanks anon, I thought you were a schizo but still decided to give the workflow a try
kek

>>101801945
>on a second analysis, I've seen a 40% gen time improvement on flux dev
that much?? what threshold value did you use anon?
>>
>>101802489
is your brain/mind the pod
>>
>>101802440
>get the same help
When's the last time you gave helpful advice that's deeper than surface level
>>
Is it possible that no one here knows what "schizophrenia" means?
>>
>>101802503
where can I post pics anonymously with other chill anons?
>>
>>101802440
False, anons wanted to get away from blatant rulebreaking, avatarfags, schizos, drama and especially you thread schizo
>>
File: 1723222939311_image.jpg (149 KB, 1216x974)
149 KB
149 KB JPG
Morning anons
Ears still too big.
>>
>>101802529
upto you really, just lurk on both generals for a while and pick your poison.
>>
File: delux_mp_00010_.png (975 KB, 896x1152)
975 KB
975 KB PNG
>>101802478
gm. happy friday. whatcha working on today?

>>101802518
post a gen

>>101802529
sdg is usually pretty chill when ldg shitposters aren't leaking in

>>101802542
>anons wanted to get away
then why do you come here every single day?

>>101802553
gm. looks kind of like a primitive ancestral quokka. give it some sabertooth fangs, lol
>>
File: ComfyUI_01022_.png (1.43 MB, 1280x960)
1.43 MB
1.43 MB PNG
>>
File: delux_mp_00011_.png (1.17 MB, 1216x832)
1.17 MB
1.17 MB PNG
>>101802595
requesting this lady with a sidecut/undercut
>>
>>101802542
false, pixart fags got salty over being made fun of and wanted a safe space.
>>
File: ComfyUI_00009_.png (516 KB, 656x1152)
516 KB
516 KB PNG
You lied to me /sdg/, flux results are always shit (at least to me)
>>
So the new cope is "they are all from /ldg/ shitting up the thread for no reason!" instead of "its a singular schizo anon who shits up the thread since 1.5 years everday!" ?
>>
File: ComfyUI_02644_.jpg (1.14 MB, 2048x2048)
1.14 MB
1.14 MB JPG
>>101802641
unironic skill issue, you're just retarded.
>>
>>101802644
simple answer is it is a group of trolls
>>
>>101802664
Nta but yeah they coordinate in the discord
>>
>>101802691
The one in this threads OP?
>>
>101802583
>post a gen
So the answer is "never", okay.
>>
>101802583
/sdg/ is chill because most people left for /ldg/ to get away from you
>>
>>101801714
>you'll have to experiment or wait for anons to figure out what does or doesn't work. I think rn its just a theory that AG will work
I think it's already working with this method -> >>101798676
>>
>>101802615
I tried both sidecut and undercut, neither of them is working
>>
File: delux_mp_00016_.png (1 MB, 1216x832)
1 MB
1 MB PNG
>>101802924
bummer. a persistent achilles heel of AI
>>
File: 1__~4.jpg (85 KB, 1000x1000)
85 KB
85 KB JPG
>>
>>101802996
do you still own the glasses in >>101803013 ?
>>
>>101803056
His glasses are weird pixel ones not the ones in that pic. That's clearly not even him
>>
File: 1723225366380_image.jpg (206 KB, 1188x974)
206 KB
206 KB JPG
>>101802641
Kek
>>
>>101803056
nah he wears minecraft glasses
>>
File: 1723225456359_image.jpg (217 KB, 1216x974)
217 KB
217 KB JPG
Quokka wild!
>>
>>101803178
>>101803178
>>101803178
>>
>>101803172
Damn, this nigga wild!
>>
can flux do proper nail polish?
>>
File: 1723225703297_image.jpg (196 KB, 1216x974)
196 KB
196 KB JPG
Filling time!
>>
File: delux_me_00031_.jpg (658 KB, 960x1344)
658 KB
658 KB JPG
>>101803214
idk whart counts as 'proper'
>>
File: ComfyUI_00974_.png (1.68 MB, 1280x896)
1.68 MB
1.68 MB PNG
>>
File: 1723225996259_image.jpg (138 KB, 984x984)
138 KB
138 KB JPG
>>
File: delux_mp_00017_.png (779 KB, 1216x832)
779 KB
779 KB PNG
>>
File: 1723226241270_image.jpg (131 KB, 1216x974)
131 KB
131 KB JPG
Filled!
>>
>>101802654
I get great results with all other models so flux is the retarded one
>>
>>101803389
>I can't use this thing, so ITS retarded, not me!
cope
>>
>>101803389
Ok vramlet
>>
>even comfy left /sdg/ for /ldg/
the absolute state of affairs



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.