[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


No Place like Home Edition

Previously on /sdg/: >>102224091

>SD3 info & download
https://rentry.org/sdg-link#sd3
https://education.civitai.com/quickstart-guide-to-stable-diffusion-3
https://aitracker.art/viewtopic.php?t=57

>Beginner UI local install
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Auto1111 forks
SD.Next: https://github.com/vladmandic/automatic
Anapnoe UX: https://github.com/anapnoe/stable-diffusion-webui-ux

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://openmodeldb.info

>Black Forest Labs: Flux
huggingface.co/black-forest-labs/FLUX.1-schnell
comfyanonymous.github.io/ComfyUI_examples/flux

>Animation
https://rentry.org/AnimAnon
https://rentry.org/AnimAnon-AnimDiff
https://rentry.org/AnimAnon-Deforum

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: fSDG_News_000071_.jpg (395 KB, 896x512)
395 KB
395 KB JPG
>mfw Resource news

09/04/2024

>Stability AI’s Top 3 T2I Models Now Available in Amazon Bedrock
https://stability.ai/news/stability-ais-top-3-text-to-image-models-now-available-in-amazon-bedrock

>LinFusion: 1 GPU, 1 Minute, 16K Image
https://github.com/Huage001/LinFusion

V>iewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
https://drexubery.github.io/ViewCrafter

>FluxMusic: Text-to-Music Generation with Rectified Flow Transformer
https://github.com/feizc/FluxMusic

>OpenAI co-founder's safety-focused AI startup raises $1 billion
https://www.reuters.com/technology/artificial-intelligence/openai-co-founder-sutskevers-new-safety-focused-ai-startup-ssi-raises-1-billion-2024-09-04

>OmniMotionGPT: Animal Motion Generation with Limited Data
https://zshyang.github.io/amg-website

>Study claims 57% of content on the internet is now AI-generated
https://www.windowscentral.com/software-apps/sam-altman-indicated-its-impossible-to-create-chatgpt-without-copyrighted-material

>DOJ subpoenas Nvidia in deepening AI antitrust probe
https://arstechnica.com/tech-policy/2024/09/doj-subpoenas-nvidia-in-deepening-ai-antitrust-probe-report-says/

>Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing
https://github.com/FusionBrainLab/Guide-and-Rescale

>SOOD-ImageNet: Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation
https://github.com/bach05/SOODImageNet

>Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
https://follow-your-canvas.github.io

>How Senator Weiner wants to regulate AI
https://www.youtube.com/watch?v=tZCAvME-a98

09/03/2024

>ViT-L/14 / CLIP-L Text Encoder finetune for Flux.1
https://huggingface.co/zer0int/CLIP-GmP-ViT-L-14/tree/main

>ComfyUI v0.2.0 Release
https://blog.comfy.org/comfyui-v0-2-0-release/

>How much is AI hurting the planet? Big tech won't tell us
https://mashable.com/article/ai-environment-energy
>>
>mfw Research news

09/04/2024

>Generating Consistent Long Depth Sequences for Open-world Videos
https://depthcrafter.github.io

>Towards Generative Class Prompt Learning for Few-shot Visual Recognition
https://arxiv.org/abs/2409.01835

>Dreaming is All You Need
https://arxiv.org/abs/2409.01633

>Decompose the model: Mechanistic interpretability in image models with Generalized Integrated Gradients (GIG)
https://arxiv.org/abs/2409.01610

>DiVE: DiT-based Video Generation with Enhanced Control
https://arxiv.org/abs/2409.01595

>Dynamic Motion Synthesis: Masked Audio-Text Conditioned Spatio-Temporal Transformers
https://arxiv.org/abs/2409.01591

>Purification-Agnostic Proxy Learning for Agentic Copyright Watermarking against Adversarial Evidence Forgery
https://arxiv.org/abs/2409.01541

>Lagrangian Motion Fields for Long-term Motion Generation
https://plyfager.github.io/LaMoG

>DiffCSG: Differentiable CSG via Rasterization
https://yyyyyhc.github.io/DiffCSG

>PatternPaint: Generating Layout Patterns Using Generative AI and Inpainting Techniques
https://arxiv.org/abs/2409.01348

>Target-Driven Distillation: Consistency Distillation with Target Timestep Selection and Decoupled Guidance
https://arxiv.org/abs/2409.01347

>SPDiffusion: Semantic Protection Diffusion for Multi-concept Text-to-image Generation
https://arxiv.org/abs/2409.01327

>Disentangling Mean Embeddings for Better Diagnostics of Image Generators
https://arxiv.org/abs/2409.01314

>OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model
https://arxiv.org/abs/2409.01199

>TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval
https://arxiv.org/abs/2409.01156

>Balancing Performance and Efficiency: A Multimodal Large Language Model Pruning Method based Image Text Interaction
https://arxiv.org/abs/2409.01162

>DPDEdit: Detail-Preserved Diffusion Models for Multimodal Fashion Image Editing
https://arxiv.org/abs/2409.01086
>>
File: 00002-3178103715.png (995 KB, 896x1088)
995 KB
995 KB PNG
It kind of gave her a unibrow
>>
File: 00014-1717353855.png (1.61 MB, 1160x992)
1.61 MB
1.61 MB PNG
>>
>>102239170
nice skirt
>>
File: 00000-86507756.jpg (418 KB, 996x1209)
418 KB
418 KB JPG
>>
test
>>
File: 00003-3297536947.png (507 KB, 744x888)
507 KB
507 KB PNG
>>
File: 00004-1425661398.png (932 KB, 896x1088)
932 KB
932 KB PNG
>>
File: delux_dc_00022_.png (2.36 MB, 1152x1344)
2.36 MB
2.36 MB PNG
ty for helping me get the news up btw
>>
>>102239276
What do you call this style
>>
>>102239773
Pseudo-surrealism
>>
>>102240018
>Study claims 57% of content on the internet is now AI-generated
How is this clickbait or doomer? This is the future and you haven't seen anything yet.
>>
>>102239826
You dont know what either of those words mean do you
>>
>https://github.com/huggingface/diffusers/blob/main/examples/dreambooth/train_dreambooth_sd3.py
Finally, a real shame SDXL is stuck with the cuck version
>>
File: small pond.webm (2.18 MB, 1920x960)
2.18 MB
2.18 MB WEBM
>>102239460
Cool pixelart skeleton
>>
>>102240677
A buddy of mine is having issues where his gens are greyed out or bricked if he uses anything other than Euler A. I'm not too familiar with comfy. Can you guys spot anything fucked up with his workflow?
>>
>>102240703
For t2i, you want denoise at 1.0 like the other poster said. Try that and see. How good do the Euler A images look with 0.4 denoise?
>>
>>102240703
Why are you spamming the same post, Chang?
>>
>>102240760
I don't have any image on hand and he's asleep right now. As I remember them, they weren't artifacted or odd looking. I'll check back in tomorrow if the denoise doesn't fix it.
>>
with nooz you looz
>>
Hey /g/uise, I was looking for a good guide on making multiple views of characters.
I want to have multiple views of a character I drew.
>>
>>102238589
hf also allow direct image uploads, however there are limitations on the number of files and files per folder that make this impractical for large datasets, see
https://huggingface.co/docs/hub/repositories-recommendations
you can ignore the apparent 300gb limit, it's not enforced. zip files are also supported. tar files in this case are webdataset, this is a common format for ai training and supported directly by hf's datasets libraries
note that odc-by is not a non-commercial license, i state that the activities are conducted for non-commercial research purposes because the copyright law in my country specifically allows exceptions for non-commercial research with regards to data mining etc, similar laws probably apply in other countries, and i don't need to use a non-commercial only license for that to apply, it is simply not my problem if others decide to use the datasets for commercial purposes. the license does technically require attribution for reusage of the dataset and produced works however i do not actually care and have no way of enforcing it and/or proving anyone used the dataset
it is also important to note there is a distinction between the license of a dataset and the licenses of its content, as another example see laion, the datasets are licensed cc-by-4.0
>>102238623
the format doesn't matter, in your case the cc0 license applies to the dataset as a unique arrangement and not the contents, it's fine, don't worry
>>
>>102241233
You could try IPadapter to transfer the style from your original character, and use a controlnet to get different poses or views. I don't know of a specific guide but there are many out there for those tools.
>>
>>102241309
Will try, thanks!
>>
File: file.png (103 KB, 1960x786)
103 KB
103 KB PNG
Have you seen that? Seems like SAI made a 16b model they intend of keeping for themselves, again
https://xcancel.com/StabilityAI/status/1831381755057861056#m
https://aws.amazon.com/blogs/aws/stability-ais-best-image-generating-models-now-in-amazon-bedrock/
>>
File: jungle.webm (2.46 MB, 1920x960)
2.46 MB
2.46 MB WEBM
>>102241254
I tried animating one of the background of your matrix gens. I'll post it if you want to see it.
>>102241414
Not surprising they aren't releasing anything good for us, but 16 billion parameters probably means the full version couldn't be run on any consumer cards.
>>
>>102241602
>16 billion parameters probably means the full version couldn't be run on any consumer cards.
it can if we go for Q8_0 instead of fp16, and the quality is really close so it would be worth it
>>
File: jungle river.webm (2.28 MB, 1920x960)
2.28 MB
2.28 MB WEBM
>>102241606
That's true. It would be fun to see what such a large model can do. I wonder how safe and boring they made it.
>>
>>102241602
sure
>>
File: matrix.webm (3.56 MB, 1024x1024)
3.56 MB
3.56 MB WEBM
>>102242238
It kind of got the idea. Hope I didn't dissapoint.
Your gens were fun.
>>
>>102242332
nta but cool
>>
Fluxpro.art brought in a credit system which makes it unusable. Is huggingface the only other in-browser alternative for flux?
>>
File: tree house.webm (2.08 MB, 1920x960)
2.08 MB
2.08 MB WEBM
>>102242552
Thanks anon
>>
>>102242332
nice
>>102242543
>data is running out
data isn't running out, problem is that there's relatively few organizations actively collecting data, the rest are processing the same sources like common crawl, laion is just processed common crawl, HuggingFaceFW/fineweb is just processed common crawl, the largest component of The Pile is just processed common crawl. not saying common crawl is bad but they only cover a relatively small portion of all available websites and it's not target specific so much of the data on each individual site will be missed, for example, they're not crawling every product from a shop or every forum post
>>
File: Fox.jpg (207 KB, 1536x1536)
207 KB
207 KB JPG
>>
File: 00013-3311483528.png (591 KB, 616x808)
591 KB
591 KB PNG
>>102239773
idk, cute semi-realism. didnt prompt a style, it is from loras.
>>
>n*gbo and koff are the only avatars
>everyone else left
the absolute state of affairs
>>
>>102243207
Can you fuck off for a single day schizoanon?
>>
>melting
>>
>>102239069
When do you think we'll finally start seeing some decent optimizations? Instead of hurrrr just buy bigger card. Bro just download another 20gb file then make sure you have this $2000 card, minimum. Would like to see:

>much smaller model sizes
>faster render times
>>
File: Kagerou.jpg (231 KB, 1536x1536)
231 KB
231 KB JPG
>>102243440
I think everyone would like to see that.
But most optimizations that are found are tiny and usually require re-training from the ground up.
Quantization is the obvious one that at least brings down model sizes and thus VRAM use, but it also reduces quality.

I wonder if anything will ever come from those ternary models. They made this huge announcement about how efficient they are while being almost as accurate as non-quantized models and then... silence?
>>
Is it me or is the /r/stablediffusion subreddit down atm?
>>
File: 00147-2751409884.png (2.76 MB, 1344x1728)
2.76 MB
2.76 MB PNG
>>
File: 00154-3355822059.png (3.23 MB, 1344x1728)
3.23 MB
3.23 MB PNG
>>
>>102243650
Yes all r*dditors are here now in the schizo thread
>>
File: 00239-2740145625.png (1.1 MB, 1344x1728)
1.1 MB
1.1 MB PNG
>>
File: 00248-2435650797.png (3.5 MB, 1344x1728)
3.5 MB
3.5 MB PNG
>>
>>102243650
disregard that, must have been a cdn issue
>>
File: 00000-4140823725.png (977 KB, 920x1208)
977 KB
977 KB PNG
>>
File: 00386-1466562172.png (2.8 MB, 1728x1344)
2.8 MB
2.8 MB PNG
>>
File: 00259-1659284891.png (3.31 MB, 1344x1728)
3.31 MB
3.31 MB PNG
>>
File: 00395-2558152076.png (2.32 MB, 1344x1728)
2.32 MB
2.32 MB PNG
>>
>>102243719
Literally me since 519 days
>>
I only have 8GB VRAM.
Anyone tried the quantized Flux models? Are they worth using over SDXL, both in terms of image quality and more importantly, composition and prompt adherence?
Anyone made comparisons?
>>
>schizo containment thread
>>
File: 00001-2931440690.jpg (441 KB, 1400x1112)
441 KB
441 KB JPG
>>102243987
Use the nf4 model. Works great up to around 1500*1000 on an 8GB card.
>>
File: 00019-3491179388.png (757 KB, 616x888)
757 KB
757 KB PNG
>>102243987
i am using q4_0.gguf. i have an 8gb vram gpu. i was using the nf4 but q4_0 works better, not crashing my pc.
no idea how it compares to sdxl, skipped from 1.5 to flux.
>>
>>102244081
Is there any story behind your avatar? Seems oddly specific, genuinly asking
>>
File: 00014-1324824488.png (649 KB, 616x808)
649 KB
649 KB PNG
>>102244095
nope! thanks for genuinly asking.
>>
i am now 98% happy with my lora
it only took 5 versions this time
>>
File: Triple trouble.jpg (357 KB, 1536x1536)
357 KB
357 KB JPG
>>
>>102244081
Can you link the files needed for the gguf stuff? I kept trying to get it working but it needs more than the base model to work
>>
>>102244138
Are you saying this because you dont want schizo anon to know or is there really no story?
>>
File: Scarlet sisters.jpg (347 KB, 1536x1536)
347 KB
347 KB JPG
>>
>>102244081
what quantization do you use for the t5 encoder?
>>
File: 1702468865525454.png (2.71 MB, 946x1306)
2.71 MB
2.71 MB PNG
>>102239069
Why are artfags so over dramatic JESUS CHRIST?


x.com/adamtotscomix/status/1831391968649474214?t=2qC-HWH8nUw6sCFA0sWrqw&s=19

Someone explain this mental illness to me.
>>
>>102244293
Why are you reposting an old image?
>>
>>102244293
it's par for the course. Imagine if a tech came out that let almost anyone do your job better than you and you might be a bit upset too. Artists are highly strung and it has actually happened to them.
>>
File: 1694182292310385.png (153 KB, 294x220)
153 KB
153 KB PNG
>>102244326
>2 days old
>Old image
>>
File: hfghfh46t456546546.png (36 KB, 1697x169)
36 KB
36 KB PNG
>>102244236
https://huggingface.co/comfyanonymous/flux_text_encoders/tree/main
the clip_l and t5xxl_fp8_e4m3fn files go into the,
(i am using forge), text_encoder folder in the models folder.
https://huggingface.co/black-forest-labs/FLUX.1-dev/tree/main
the ae.safetensors goes into the vae folder.
pic relatable is my setup, it works for me.
>>
>>102244293
>Someone explain this mental illness to me.
First explain why you are on twitter
>>
>>
File: 1689680291536253.jpg (575 KB, 1078x1082)
575 KB
575 KB JPG
I havent touched anything AI in like 8 months
I still have the same shitty GPU as two years ago, so I used Auto1111 Forge since it ran slightly better
Any updates since then? Any new projects focused on optimization?
>read the op
no, I don't care about forks that make auto1111 look like modern web slop, only performance, i have shitposts to make
>>
>>102244372
Very grateful, can you link the quanted model you are using
>>
File: delux_ra_00057_.png (2.04 MB, 1536x1152)
2.04 MB
2.04 MB PNG
>>102244293
>too many people clicked a twitter button, my life is over

>>102244346
I once tried to prompt for this ultra closeup style but failed. flux couldn't help but draw the whole head
>>
File: 00022-3929885972.png (959 KB, 744x888)
959 KB
959 KB PNG
>>102244447
https://huggingface.co/lllyasviel/FLUX.1-dev-gguf/tree/main
as you see, there are others, no idea if others are better or not, for 8gb.
>>
>>
>>102244525
Thanks!
>>
>>102244493
I think you replied to the wrong person
>>
>>102240691
are there any local video models yet?
>>
>>102244525
>8gb flux model

How does the performance compare to the full version? I currently have the giant 20+ GB versions on my machine but if the performance is comparable or basically the same I'll switch to using this one instead.
>>
File: delux_ra_00059_.png (2.31 MB, 1536x1152)
2.31 MB
2.31 MB PNG
>>102244565
I think you replied to the wrong person
>>
File: file.png (57 KB, 1371x464)
57 KB
57 KB PNG
Did something update? I did the usual git fetch + pull from the Forge repo, and shit's broken now
>>
>>102244661
i run forget with
--disable-xformers
>>
>>102244717
*forge

i always type "forget" lel
>>
File: file.png (233 KB, 1840x1408)
233 KB
233 KB PNG
>>102244717
>>102244730
But it used to work just fine before
this was with --xformers --reinstall-xformers btw which I run just in case to be up to date

Do xformers just not work anymore?
...Also I should've posted a longer stack trace, I get this along with the xformers error (even with --disable-xformers)
>>
>>102244791
--cuda-malloc
and don't disable xformers.
>>
>>102244791
>>102244914
i dont know if xformers is working or not since i disabled it and everything runs for me well enough, but maybe try updating xformers on your venv if youre feeling brave (also update torch to 2.5 which is supposed to be slightly faster)

pip install --upgrade --pre xformers --prefer-binary --extra-index-url https://download.pytorch.org/whl/cu124 -f https://download.pytorch.org/whl/torch_nightly.html 


change "cu124" to whatever cuda version your venv is using, then to same command with torch torchaudio torchvision instead of xforerms with same command, taht worked for me
>>
>>
File: dumb_fairy.png (55 KB, 200x200)
55 KB
55 KB PNG
>>102244927
I ended up reinstalling (recloning the forge git) the whole thing since it was 2 years old and it seemed to be megafucked as im unable to fix the environment
I have cuda 121, should I look up to reinstall it (?) to upgrade its version since I'm setting it up all over again? I actually have no idea how it works but surely being on latest is ideal?

I'm also been reading a couple of posts itt, from the gist of it, flux seems way better in both generation and performance, so I guess I'm using that next. I assume it's able to do anime? all images seem to be realistic shit, i need my moeshit
>>
>>102245299
since you're reinstalling, might as well. be sure to remove your venv, then check the different requirements.txt files, set to cu124 any cuda packages, and torch to 2.5
ignore xformers needing 2.4 and see how it goes
>>
Morning anons
>>
>>102245378
gm
>>
Other version of the last image, bit more cursed
>>
File: 00031-2360199258.png (1.06 MB, 888x744)
1.06 MB
1.06 MB PNG
>>102245378
good, a few seconds before noon, morning
>>
me on the right
>>
File: file.png (35 KB, 690x711)
35 KB
35 KB PNG
>>102245340
I only see requirements_versions.txt which doesnt specify torch version and doesn't have any cuda stuff
Am I missing something?
>>
>>102245526
eh i was thinking of kohya_ss dozen requirements files lel
forge just has the one, so you're good
run the pip install manually first (after activating the new venv) for xformers command i posted before,
then rerun it for the latest torch torchvision torch audio with the same command just changing the package names
(xformers may show a message about wrong torch version, just ignore)
then just run webui and it should work (in theory lel)
>>
File: ReiTei.jpg (271 KB, 1536x1536)
271 KB
271 KB JPG
>>
>>
File: delux_ra_00061_.png (2.77 MB, 1536x1152)
2.77 MB
2.77 MB PNG
>>102245378
gm
>>
File: flux0310.jpg (3.03 MB, 2528x2000)
3.03 MB
3.03 MB JPG
>>
>>102245671
nice fingers
>>
>>102245681
are you using a lora? i've been seeing a bit of color "burn-in" in some flux1-dev gens (without loras) even when not specifying style
>>102245694
she's one of them nordic aliens
>>
File: 00003-2782281059.png (1.25 MB, 768x1104)
1.25 MB
1.25 MB PNG
with flux, have you found a better sampler/sched than euler simple
>>
File: delux_ra_00062_.png (1.53 MB, 1536x1152)
1.53 MB
1.53 MB PNG
>>102245716
nah, these are from a while ago where I was trying a wide range of guidances to basically limit test. I wanted these gens to be more of a 'candid iphone' aesthetic but it didn't really matter how much I cranked guidance cuz it would sooner start burning than get closer to what I wanted
>>
>>102245721
euler beta (tho i dont notice much of a diff)
>>102245766
i've kinda given up messing with cfg threshhold/guidance and all and just gone back to just using prompt experiments to try and get specific looks. flux seems to handle that best rather than keep dicking around with fractional changes in extensions/nodes
>>
>>
File: delux_ra_00073_.png (1.57 MB, 1536x1152)
1.57 MB
1.57 MB PNG
>>102245792
>i've kinda given up messing with cfg threshhold/guidance
same. I tried so many of the adaptive cfg workflows but none of them seem to give any better control. hopefully they'll give us a version of flux that fully supports cfg and negatives in the future
>>
File: file.png (33 KB, 1783x163)
33 KB
33 KB PNG
>>102245573
uhhuh
>>
>>102245934
well i told you the order lel
xformers
torch torchvision torchaudio
those are two separate ones
also it seems you didnt delete your venv and recreate it first
>>
>>102245979
gomen.... i'm retarded.....
>>
In one of the resources it still has google collab stuff
hasn't google banned using collab for stable diffusion?
>>
>>
File: 00038-956080621.png (729 KB, 616x888)
729 KB
729 KB PNG
>>
>>102246125
i like that one
>>
File: file.png (65 KB, 960x861)
65 KB
65 KB PNG
>>102245979
Okay seems to work, I do not have any model setup yet (I still have old SD SD1.5 and SDXL models iin my drive) so I guess I'll figure out flux now
>>
>>102244372
>need to login to download vae
>>
Is Lora training on pony with 8GB VRAM possible these days? That was the only thing holding me back from switching over when it came out, but maybe the technology has improved since then?
>>
>>102246414
not that anon but here is the same file under their schnell branch instead, no login required

https://huggingface.co/black-forest-labs/FLUX.1-schnell/blob/main/ae.safetensors
>>
>>102246313
good luck!
search for the flux1-dev gguf models and the t5-v1_1-xxl-encoder-Q8_0.gguf which does work on forge (i've been u sing it all day)
>>
>>
>>102239208
nice
>>
>>
experiments gone wrong
>>
>>
>>102244372
I'm using your settings there and it took me like 15 minutes to generate a single image, is that normal?
I also have 8GB VRAM, actually I believe that only 6GB dedicated, rest is shared...
>>
>>
>>
File: output.webm (218 KB, 380x380)
218 KB
218 KB WEBM
>>
>>
File: delux_ra_00074_.png (2.03 MB, 1536x1152)
2.03 MB
2.03 MB PNG
>>102247533
rare to see you on a weekday, I feel like
>>
>>
>>
File: output.webm (179 KB, 380x380)
179 KB
179 KB WEBM
>>102247595
i wish i had more time for playing around with things
stupid real life haha
>>
nooo waifu dont leave me
>i'm sorry i cheated on you with that slutty LLM the other day
>>
>>102247765
dont we all
>>
>>
File: output.webm (191 KB, 380x380)
191 KB
191 KB WEBM
>>
We should merge /ldg/ and /sdg/ together
>>
File: delux_ra_00075_.png (2.41 MB, 1536x1152)
2.41 MB
2.41 MB PNG
>>102248071
how do you propose achieving this?
>>
File: output.webm (188 KB, 380x380)
188 KB
188 KB WEBM
>>102248021
dont want to spam but that frame annoys me haha
>>
shit's about to break lel
>>
File: 00040-1259050629.png (713 KB, 616x888)
713 KB
713 KB PNG
mfw generate forever doesnt work? left and returned to just 1 pic
>>
>>102248181
you had a seed set didnt you
>>
>>102248195
nope
>>
>moved on to posting real kids
>mouth still stuffed with cottonballs
>>
>>102248207
check your logs then
i did a grid 5x4 and half the gens went OOM while switching models so i got a checkboard of results
>>
File: output.webm (199 KB, 380x380)
199 KB
199 KB WEBM
gn frens
>>
>gn
>>
File: 00047-731033013.png (856 KB, 616x1008)
856 KB
856 KB PNG
cant sneak one past the a.i. hall monitor.
>>
u gunna cri like last time or
>>
>>102248390
trippy
>>
>>102248439
what's the prompt on that lel
>>
File: _bing_.png (1.99 MB, 1011x1011)
1.99 MB
1.99 MB PNG
>>
>>102245766
Overbaked
>>
File: delux_ra_00078_.png (2.08 MB, 1536x1152)
2.08 MB
2.08 MB PNG
>>102248390
gn

>>102248396
gn

>>102248439
she looks practiced in the way of the sword

>>102248612
illiterate
>>
File: 00037-3668462268.jpg (107 KB, 1488x1008)
107 KB
107 KB JPG
>>
>>102248621
Sorry to have upset you it was just an observation
>>
>>102248510
https://files.catbox.moe/8rhh5n.png
the loras are to blame, or credit, for the weird look
>>
>>102248621
>buttchins
>>
File: delux_ra_00079_.png (1.94 MB, 1536x1152)
1.94 MB
1.94 MB PNG
>>102248633
>be retarded
>"you're retarded"
>OH I GUESS YOU'RE UPSET
retard
>>
>>102248691
I dont know why are are so Angry it was just an observation
>>
File: delux_ra_00080_.png (1.67 MB, 1536x1152)
1.67 MB
1.67 MB PNG
>>102248696
since you're too dumb to even understand why you're being called a retard: you replied to a post without reading any of the words. in the post, I describe why the image is burned, as apart of a conversation about cfg/guidance. your response was "this is burned". yes retard, that was the point.

me calling you a retard isn't me being 'upset' or 'angry', its me calling you a retard because you're a retard
>>
>>102248702
>>102248691
Added to pastebin
>>
used 3090 (24GB)
or
new 4070 Super (16GB)
>>
>hes crying again
>>
>>102248702
holy melty
go on!
>>
File: schizo-thread.jpg (234 KB, 1024x768)
234 KB
234 KB JPG
>>102248702
>>
>>102248717
With anything related to local AI generation, images, sound or text you want as much VRAM as you can afford.
>>
>why did anon migrate to /ldg/
>>
File: delux_ra_00081_.png (1.93 MB, 1536x1152)
1.93 MB
1.93 MB PNG
>>102248717
wait a month and get a 5090
>>
>>102248854
U ok now
>>
File: 00053-288269055.png (778 KB, 744x888)
778 KB
778 KB PNG
>>
>>102248702
The mask falls off (again)
>>
>>102248854
will it be $800?
>>
Guess he isn't as helpful as he claims to be???
>>
File: Cats.jpg (358 KB, 1536x1536)
358 KB
358 KB JPG
>>
>>
File: delux_ra_00082_.png (1.78 MB, 1536x1152)
1.78 MB
1.78 MB PNG
>>102249197
missed a zero, maybe
>>
Neeeeeeeerrrrrrrrrrddddddddsssssssssssssssss
>>
>>
File: delux_ra_00083_.png (2.42 MB, 1536x1152)
2.42 MB
2.42 MB PNG
>>102249396
no u
>>
File: 00056-1675913331.png (594 KB, 616x888)
594 KB
594 KB PNG
trouble prompting the 'puffing up cheeks' funny face thing.
>>
>>102248716
based basedbin manager anon
>>
>>102249411
shit's deep fried af
>>
File: nigbo.png (1.34 MB, 1192x792)
1.34 MB
1.34 MB PNG
>>
>>102249411
Do you realize you could end all of this at any point "notable poster"?
>>
File: 00049-2539875271.png (782 KB, 616x1008)
782 KB
782 KB PNG
>>102249600
he cant do anything for your mental problems
>>
File: nigbo2.png (1.14 MB, 1192x792)
1.14 MB
1.14 MB PNG
>>
>melty
>>
>>102249426
obese in negative if you're using any cfg modifier
otherwise, pointy face
>>
>>102249426
>>102249715
sorry, i'm high and completely misread what you wrote

>>
File: 000000_17333_.png (2.23 MB, 1032x1508)
2.23 MB
2.23 MB PNG
>>102249396
>>
File: Mike.jpg (218 KB, 1536x1536)
218 KB
218 KB JPG
>>
File: 00057-3246485037.png (1.12 MB, 888x616)
1.12 MB
1.12 MB PNG
>>
Anywhere where I can read up on prompting and whatnot for better results with Flux?
>>
>>
>>102249600
Notables were the first who called him out with a gen attached. He is just a lowly namefag and retard.
>>
>>102250280
Ran took everything from him...
>>
>>102249628
>psychward anon speaks up
>>
>>102249682
**** when he sees an ungroomed newfag
>>
>>102250600
its called sklunching
>>
>>102250600
Unironically >>>/trash/, they know pony the best and no one has ever had sex here before.
>>
>>102250600
It's called "getting some help"
>>
>>102250600
its called "doin' the stanky leg"
>>
Is there a specific term or tag for when a woman is nude but covered in jewelry to simulate clothing? Or alternatively metal clothes?
>>
>>102250710
"Nude woman adorned in jewelry"
>>
did someone got triggered by the no no word?
>>
>>102250783
I think making requests is against the site rules if I'm not mistaken
>>
File: 00008-504744535.jpg (836 KB, 1232x1776)
836 KB
836 KB JPG
>>
>it's cottonball time
>>
>>102250829
Someone needs a trip to the alps
>>
File: 00064-2813097906.png (833 KB, 887x520)
833 KB
833 KB PNG
>>102251077
mayhaps. ive never seen a mountain, irl, and never will, most likely.
>>
>>102251100
>ive never seen a mountain, irl
go out more then
>>
>>102251100
maybe because you're obviously obsessed with hills
>>
File: 00065-1725507783.png (951 KB, 888x616)
951 KB
951 KB PNG
>>102251119
amazing advice, thanks
>>102251139
uh, yeah, whatever that means.
>>
>>
>>102250807
What are you a rule following nerd?
>>
>>
>>102251100
you can't leave home?
>>
File: 00008-4067986864-g.jpg (2.05 MB, 2624x3520)
2.05 MB
2.05 MB JPG
will a 20gb vram be enough to run flux? been a while since i been here. i'll post the last image I made back in June.
>>
>>102251177
>pedo
>retard
checks out
>>
CAW CAW
>>
File: crowmerge.png (2.51 MB, 1024x1536)
2.51 MB
2.51 MB PNG
I forgot how much I love crowmerge. Shame the controlnet messes with faces enough to cause adetailer to fail.
>>
>>
File: delux_dc_00011_.png (2.39 MB, 1152x1344)
2.39 MB
2.39 MB PNG
>>102251215
we have a resident hall monitor schizo who's obsessed with site rules as a mechanism to call the teacher on posters he doesn't like.

>>102251517
I thought you said crowmage at first and now I want to make crow mages
>>
I HATE QUOKKA I HATE PURPLE WITCH.
FAKE ASS BITCHES
>>
File: crowmerge2.png (2.66 MB, 1024x1536)
2.66 MB
2.66 MB PNG
Using the muted parchment of the initial crowmerge gen combined with an ultra-saturated feminine IP-Adapter on the hiresfix on the armor makes for a fun look. Very important for creating and controlling contrast.

>>102251641
I keep making knights, but a crow mage would be fun too. Haven't gone back to make a horrory gen in a while.
>>
File: nigbo3.png (1.36 MB, 1192x792)
1.36 MB
1.36 MB PNG
>>
>>
>>102251641
Don't you mass report posts you don't like tho?
>>
File: delux_cm_00002_.png (2.46 MB, 1344x1152)
2.46 MB
2.46 MB PNG
>>102251913
you get mass reported because no one likes you, not because I'm all powerful. I mean, I AM all powerful, but you're also deeply unlikable
also, get new material. this shit is so fucking dull at this point. even joe rogan tries out a new joke or two each year
>>
File: 00009-1429186276.jpg (838 KB, 1776x1488)
838 KB
838 KB JPG
prob my fav gen ive ever made.
>>
>>102251995
what are you on about?
>>
File: delux_cm_00003_.png (2.64 MB, 1344x1152)
2.64 MB
2.64 MB PNG
>>102252000
checked and great gen. bird maid is in the same genre as crow mage
>>
>>102252000
it is nice but i'll never understand the preference for grainy images
i'm always trying to clean up noise (not details) from images and failing miserably
>>
>no one likes you
>you do the same thing over and over again
Oh le irony
But you wouldn't know anything about that
>>
>>102252026
the mention of crow, is why i added the finch, earlier mountains mentioned, same deal.
>>
>>102252042
ill blame the retroanimefluxv1 lora. it is at .5 strength. at .75 or higher... very noisy.
>>
File: delux_cm_00008_.png (2.59 MB, 1344x1152)
2.59 MB
2.59 MB PNG
erm... it got confused on whether it was drawing a crow or a hat

>>102252046
I'm jealous of how rich the scenery is. I'm trying to get some scenery in these gens but its mostly being overwritten by the painterly tokens
>>
>>102252087
describe the textures of the background before your character(s)
that's teh flux way, background, char, event, styles (even tho styles can go anywhere)
you must unlearn what you have learned
>>
File: delux_cm_00014_.png (2.67 MB, 1344x1152)
2.67 MB
2.67 MB PNG
>>102252106
I've tried shifting order around before with prompts but didn't notice much of a difference in attention tbdesu. maybe its just the difference between comfy and forge encoding; comfy has always used absolute weighting. I've never known definitely how weighting works with flux honestly
>>
>>102252087
the scene is described as "behind her, the majestic peaks of the Swiss Alps rise dramatically, their snow-capped summits cutting into the clear blue sky, adding a sense of grandeur and serenity to the scene." credit to chatgpt for giving free gpt-4o time.
>>
>>102252166
oh comfy is a pain in the ass to do weights when you come from forge. i still hate it. but forge is let's you pick in the settings what method to use for weights i believe
but the main point i've seen everywhere is to describe not just "mountains" but "peaks covered in ___" and other textures to bring out backgrounds (and as a bonus avoid bokeh)
unless your prompt is super short like "girl siting in moutains with bird" or something
like this
>>102252168
an llm to make a prompt from your original is a godsend for that
>>
File: crowmage.png (2.21 MB, 1024x1536)
2.21 MB
2.21 MB PNG
>>102252087
Happened on Crowmerge too. I guess the shapes are either really similar or something is weird in the dataset, but I can't think of any examples of this off the top of my head.
>>
he mad
>>
File: Giga-Copium.png (964 KB, 1022x500)
964 KB
964 KB PNG
>>102239069
>Disinfo-Copium-Machine go BRRRRRRRR

What a silly thing to lie about lol

https://x.com/halphelt/status/1831316915551137918?t=Q7enYETzZ5jvJzeufY6TIA&s=19
>>
File: delux_cm_00015_.png (2.72 MB, 1344x1152)
2.72 MB
2.72 MB PNG
>>102252232
>isn't a crow
>doesn't have a crow familiar
I give this a 0/10 on the crow mage scale. love it otherwise though

>>102252259
he's just flexin'

>>102252267
model collapse is real. there's studies showing 1) training on synthetic data is harmful to outputs and 2) the extreme proliferation of synthetic data is causing it to seep into training sets. thats part of why meta and openai are trying to find ways to tag/watermark ai content, so its easier to filter out of data
>>
This thread is now 21 hours old
>>
>>102252295
i am sure they can avoid sloppageddon. amusing to me because i am using flux loras trained partially from my 1.5 gens that mostly used loras trained from other older 1.5 gens, and leading back to my o.g. koff lora trained from my horrible sketch work.
>>102252327
too old
>>
File: ai pepe laughing.jpg (177 KB, 1024x1024)
177 KB
177 KB JPG
>>102252327
that's called "comfy" unlike a certain ui
>>
>>
>>
File: delux_cm_00023_.png (2.68 MB, 1344x1152)
2.68 MB
2.68 MB PNG
>>102252333
>i am sure they can avoid sloppageddon
yeah, its a problem but a problem with solutions. I've seen studies on the problem but I've seen studies exploring solutions too, either from better pruning synthetic data or increasing the value it can provide to training. and, of course, there's the holy grail of AGI that doesn't train from data but learns, decoupling it from raw data injestion
>>
>>102252455
>>102252455
>>102252455
>>
File: 00071-41877417.png (1.2 MB, 744x888)
1.2 MB
1.2 MB PNG
extra filler..
>>
>>
File: 00073-3418285013.png (1.12 MB, 744x888)
1.12 MB
1.12 MB PNG
>>
File: 00000-3234698016.png (826 KB, 744x888)
826 KB
826 KB PNG
from 5 past midnight last night, or morning or whatever
>>
File: r9k_1725173043331293.jpg (145 KB, 1378x746)
145 KB
145 KB JPG
>>102252295
>training on synthetic data is harmful to outputs and
No shit of course if you train on images that have fucked up fingers, weird looking eyeballs, extra toes, etc etc, the ai is going to learn those mistakes as being normal. That's why using REAL and high quality images along with high quality tagging is so important. "model collapse" it's just another buzzword they use because they don't even know what it means.

>the extreme proliferation of synthetic data is causing it to seep into training sets.
Not really. You don't even have proof of this happening because most of the people that train these models do not open source their data sets. Even the lazy companies and trainers that do these on a regular basis not realize the importance of the quality of images and tagging, so QC has been beefed up a lot compared to what they were doing back in 2022. Models like Flux would not be possible if they were lazy with QC. Same with SDXL and especially pony.
>thats part of why meta and openai are trying to find ways to tag/watermark ai content,
Trying to make sure they don't accidentally download and I image could be a reason why. But if they are fooled by the image into thinking it's real, that also makes a good enough to include in the data set because it looks real enough for a person,which means it won't negatively affect the data sets outputs at all because the picture looks real. Perceptual quality is what matters You will only get "model collapse" if you have very shitty or non-existent QC (just mass scraping without checking the images or tags and just throwing it into the blender. That's basically what they did with earlier versions of SD). Having good QC is also more cost-effective because it means you don't have to train your models as long.

https://www.researchgate.net/publication/350015537_AI_Efficiency_Index_Identifying_Regulatory_and_Policy_Constraints_for_Resilient_National_AI_Ecosystems

https://www.reddit.com/r/dataisbeautiful/s/6t76G1HJdT
>>
ded schizo general
>>
>>102252542
Is Synthetic Data all We Need? Benchmarking the Robustness of Models Trained with Synthetic Images
https://synbenchmark.github.io/SynCloneBenchmark/

The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better
https://arxiv.org/abs/2406.05184

SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training?
https://arxiv.org/abs/2402.01832

Scaling Laws of Synthetic Images for Model Training ... for Now
https://arxiv.org/abs/2312.04567

Training on Synthetic Data Beats Real Data in Multimodal Relation Extraction
https://arxiv.org/abs/2312.03025

On the Limitation of Diffusion Models for Synthesizing Training Datasets
https://arxiv.org/abs/2311.13090
>>
>>102252561
>Is Synthetic Data all We Need?
Literally no one said that's all we need. Real data is much preferable like I said in my giant blog post. However if synthetic images are necessary, they need to make sure they at least look passable. Normies it even some of us were fooled by that Pope-in-a-white-coat gen that got posted to reddit a while back

https://www.cbsnews.com/news/pope-francis-puffer-jacket-fake-photos-deepfake-power-peril-of-ai/

This isn't even "haha the normies are so clueless!" Type shit. People that would typically know better or fooled too until they actually pixel peeped the image (something no one does on a regular basis when they are doom scrolling).
>>
>>102244927
do i put that in command line args or?
>>
>>102239069
how do i make money off this? start a pixiv account and spam anime armpits?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.