[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: PW_80247_.png (64 KB, 256x256)
64 KB
64 KB PNG
Previous /sdg/ thread : >>101891401

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
SD.Next: https://github.com/vladmandic/automatic
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Black Forest Labs: Flux
https://huggingface.co/black-forest-labs/FLUX.1-schnell
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Discord
6wUwtcJsr2

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: FDG_News_000017_.jpg (766 KB, 1344x960)
766 KB
766 KB JPG
>mfw Resource news

08/14/2024

>Flux.1-Dev NF4 Quant v2: flux1-dev-bnb-nf4-v2.safetensors
https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/1079

>ComfyUI-Lumina-mGPT-Wrapper
https://github.com/Excidos/ComfyUI-Lumina-mGPT-Wrapper

>bigdata-pw-Dataception: Dataset of datasets
https://huggingface.co/datasets/bigdata-pw/Dataception

>InstantX / FLUX.1-dev-Controlnet-Union-alpha
https://huggingface.co/InstantX/FLUX.1-dev-Controlnet-Union-alpha

>Judge Advances Copyright Lawsuit by Artists Against AI Art Generators
https://www.hollywoodreporter.com/business/business-news/artists-score-major-win-copyright-case-against-ai-art-generators-1235973601/

>Integrating Saliency Ranking and Reinforcement Learning for Enhanced Object Detection
https://github.com/mbar0075/SaRLVision

>ComfyUI nodes for ControlNext-SVD v2
https://github.com/kijai/ComfyUI-ControlNeXt-SVD

>ComfyUi UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
https://github.com/Isi-dev/ComfyUI-UniAnimate-W

>Stable Audio ControlNet
https://github.com/EmilianPostolache/stable-audio-controlnet

>ComfyUI_NAIDGenerator
https://github.com/bedovyy/ComfyUI_NAIDGenerator

08/13/2024

>Kohya training to enable Flux training with 12GB VRAM GPUs
https://github.com/kohya-ss/sd-scripts/pull/1374/files

>InstantX / FLUX.1-dev-Controlnet-Canny Updated
https://huggingface.co/InstantX/FLUX.1-dev-Controlnet-Canny/tree/main

>ClickAttention: Click Region Similarity Guided Interactive Segmentation
https://github.com/hahamyt/ClickAttention

>A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models
https://github.com/taehong-moon/ee-diffusion

>ComfyUI Dwpose TensorRT
https://github.com/yuvraj108c/ComfyUI-Dwpose-Tensorrt

>SSL: A Self-similarity Loss for Improving Generative Image Super-resolution
https://github.com/ChrisDud0257/SSL

>CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
https://github.com/THUDM/CogVideo
>>
>mfw Research news

08/14/2024

>Imagen 3
https://arxiv.org/abs/2408.07009

>Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models
https://arxiv.org/abs/2408.06995

>Prompt-Based Segmentation at Multiple Resolutions and Lighting Conditions using Segment Anything Model 2
https://arxiv.org/abs/2408.06970

>SceneGPT: A Language Model for 3D Scene Understanding
https://arxiv.org/abs/2408.06926

>Dynamic and Compressive Adaptation of Transformers From Images to Videos
https://arxiv.org/abs/2408.06840

>Token Compensator: Altering Inference Cost of Vision Transformer without Re-Tuning
https://arxiv.org/abs/2408.06798

>Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective
https://arxiv.org/abs/2408.06741

>DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
https://arxiv.org/abs/2408.06740

>DC3DO: Diffusion Classifier for 3D Objects
https://arxiv.org/abs/2408.06693

>Masked Image Modeling: A Survey
https://arxiv.org/abs/2408.06687

>Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models
https://arxiv.org/abs/2408.06646

>EditScribe: Non-Visual Image Editing with Natural Language Verification Loops
https://arxiv.org/abs/2408.06632

>Prompt Recovery for Image Generation Models: A Comparative Study of Discrete Optimizers
https://arxiv.org/abs/2408.06502

>Synthetic Photography Detection: A Visual Guidance for Identifying Synthetic Images Created by AI
https://arxiv.org/abs/2408.06398

>Response Wide Shut: Surprising Observations in Basic Vision Language Model Capabilities
https://arxiv.org/abs/2408.06721

>Breaking Class Barriers: Efficient Dataset Distillation via Inter-Class Feature Compensator
https://arxiv.org/abs/2408.06927

>Do Vision-Language Foundational models show Robust Visual Perception?
https://arxiv.org/abs/2408.06781

>ViMo: Generating Motions from Casual Videos
https://arxiv.org/abs/2408.06614
>>
>we finally get a local model capable of complex scenes, fine details, etc..
>majority of images being spammed are shitty 1girls that are worse than 1.5
kek
>>
File: 2024-08-15_00035_.png (2.04 MB, 1024x1024)
2.04 MB
2.04 MB PNG
>>
>>101898130
welcome to /sdg/
simping
damn
(1)girls
>>
File: 1511238943617.jpg (14 KB, 320x320)
14 KB
14 KB JPG
>>101897949
>>101898050
Posting my question here, gonna sleep now, night night anons love you all
>>
File: PW_82483_.png (857 KB, 1024x768)
857 KB
857 KB PNG
>>
File: 2024-08-15_00036_.png (1.98 MB, 1024x1024)
1.98 MB
1.98 MB PNG
>>101898130
nogen

>>101898160
its what we do
>>
File: PW_82481_.png (845 KB, 1024x768)
845 KB
845 KB PNG
>>101898167
Good night, anon! Sleep well :]
I wish I could answer your question, but I have no clue unfortunately haha
>>
>>101898130
If you read through the discussions it seems like imagegen is progressing and big things are always right around the corner, but when you look at images from 18 months ago they look pretty much the same as the ones people are genning today. We've already hit diminishing returns lmfao.
>>
>>101898062
>>Flux.1-Dev NF4 Quant v2: flux1-dev-bnb-nf4-v2.safetensors
how does this compare to the fp8 checkpoint?
>>
Making low poly stuff on flux is pretty hard.
>>
>>101898181
>its what we do
indeed
>>
>>101898277
yeah, it's not really a do it all machine. so far.
>>
>>101898313
i am 5 inches from buying a retired mining card for flux dev.
>>
File: PW_82476_.png (911 KB, 1024x768)
911 KB
911 KB PNG
>>101898277
I think schnell might work pretty well for something like that! I haven't tried, but it does pixel stuff better than dev does
>>
>>101898329
it's fun stuff
>>
>>101898354
can flux alter a starting image?
>>
>>101898240
performance or quality?
>>
File: ComfyUI_00493_.png (2.64 MB, 1408x1408)
2.64 MB
2.64 MB PNG
>>
>>101898379
the 17.2 gb one:
https://huggingface.co/Comfy-Org/flux1-dev/blob/main/flux1-dev-fp8.safetensors
>>
>baby yoda in prompt
of course it knows baby yoda, but not a million celebrities lel

>>101898374
you mean img2img? i'm sure it can be used for that in some way
>>
>>101898408
I was asking what you wanted to compare. the quality is pretty much the same. nf4 is a lot faster if fp8 doesn't fit in your vram.
>>
File: deliggyliggy.webm (380 KB, 854x480)
380 KB
380 KB WEBM
can someone explain what supposed to be better about flux compared to SDXL or pony?
>>
>>
File: 1f2af95a.webm (524 KB, 864x1168)
524 KB
524 KB WEBM
oh luma..
>>
>>101898476
prompt comprehension
>>
File: 2024-08-15_00058_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>
>>101898479
lol. scammed
>>
>>101898461
>nf4 is a lot faster if fp8 doesn't fit in your vram.
my card is just 16gb (6950xt), so it sounds smart.
>>
>>101898195
Delusional jeets are blind to aesthetics and cant tell that models have actually gone BACKWARDS style wise.
>>
>>101898479
Luma is useless and they pretend people pay for that shit
>>
>>
>>101898476
flux does two things.

One is the color range is more photographic. the other is it blurs backgrounds, often, giving a depth of field or bokeh effect. we don't know how to turn it off, it doesn't always show up, but often it is evident in early steps.

thr language model part is very good, it is pro grade, whereas sd is a real struggle, to get prompts to give you what you want.

However, we are still exporing both sd and flux. sd 1.5, even.

as an example, small image sizes in sd and flux generate very different results, for some prompts.

Also, certain seeds can seem to have a certain character across prompts.
>>
>>101898592
it's funny
>>
>>
>>
>>101898591
It's because people's neural networks are being trained by AI pictures(mere exposure effect), we are reaching a middle point between what is good and what is bad.
>>
File: ComfyUI_00494_.png (3.06 MB, 1408x1408)
3.06 MB
3.06 MB PNG
>>
>>101898591
I think mostly we just have people still setting their guidance too high.

But there have been some real setbacks. The dataset culling for 'aesthetic quality' has biased the models towards flickr-style garbage, excessive golden light, DoF, etc.
>>
>>
>>
File: delux_ci_00003_.png (1.32 MB, 1536x968)
1.32 MB
1.32 MB PNG
>laser gun
I guess I can't even be mad
>>
npc outta nowhere
>>
>>
>>101897973
is flux still broken on mac?
>>
>>
File: file.png (2.05 MB, 3840x2088)
2.05 MB
2.05 MB PNG
Is video upscaling there yet?
I am doubtful.
>>
File: tmpjcrrrdbw.png (999 KB, 768x1024)
999 KB
999 KB PNG
>>
gn all
>>
flux is the cummer's choice
>>
Sisters...nobody is posting
>>
File: 103026-tmp.png (3.11 MB, 1536x1728)
3.11 MB
3.11 MB PNG
>>
File: file.png (151 KB, 1884x995)
151 KB
151 KB PNG
i don't think my system can run flux dev on forge, most of the videos say to just grab the fp8 and drop it on the checkpoints folder, so i did that and whenever i try, i always get this error on the console, any tips?
>>
File: delux_ci_00006_.png (1.54 MB, 1536x968)
1.54 MB
1.54 MB PNG
>>101899234
gn

>>101899256
I'm trying to make a cool gun fight but not getting anywhere
>>
>>101899301
Catjak won
>>
>>101899337
won what?
>>
File: tmpuun60k5n.png (1.12 MB, 768x1024)
1.12 MB
1.12 MB PNG
>>
File: 103032-tmp.png (3.04 MB, 1536x1728)
3.04 MB
3.04 MB PNG
>>
File: tmpqgdhjsrb.png (1.1 MB, 768x1024)
1.1 MB
1.1 MB PNG
>>
>>101899420
Why are you posting in the containment thread?
Go to the new one
>>
File: delux_ci_00015_.png (1.73 MB, 1536x968)
1.73 MB
1.73 MB PNG
um, laser... pokers? idk what happened here
>>
File: 000000_16316_.png (2.82 MB, 1434x1075)
2.82 MB
2.82 MB PNG
>>
File: 103037-tmp.png (2.95 MB, 1536x1728)
2.95 MB
2.95 MB PNG
>>101899460
>Go to the new one
What other one? I only saw one thread.
>>
I think he has the word "debo" filtered
>>
>>101899498
/ldg/
It is 10x more active but it has a pastebin telling newfriends about the thread schizo in the OP so you might have it filtered, he's been attacking the general over the last few weeks.
>>
>>101899524
Also 10x more CP
>>
>>101899533
i see none
>>
>>101899538
Just fuck off then
>>
File: delux_ci_00016_.png (1.78 MB, 1536x968)
1.78 MB
1.78 MB PNG
>pew
>>
>>101899498
Debo was spamming CP on there earlier today if you want to believe it or not, he really has it out for /ldg/
>>
>>101899570
Don't bother replying, if he wants to pretend he just discovered both threads and hour ago let him rot in his own shit posting.
>>
File: tmpn4oq7dgu.png (1.13 MB, 768x1024)
1.13 MB
1.13 MB PNG
>>
File: delux_ci_00019_.png (1.63 MB, 1536x968)
1.63 MB
1.63 MB PNG
>>101899570
you will blame me for literally anything. you really do think every poster is me, lmao
>>
you better not get him angry >:(
>>
File: 000000_16320_.png (3.62 MB, 1434x1434)
3.62 MB
3.62 MB PNG
G'night Anons,
>>
>>101899681
you will never be japanese
>>
File: delux_ci_00020_.png (1.76 MB, 1536x968)
1.76 MB
1.76 MB PNG
>>101899704
gn
>>
>>101899723
He will never be accepted in his own thread he squatted in for 2 years straight either
>>
File: tmp0flp5ehc.png (1.09 MB, 768x1024)
1.09 MB
1.09 MB PNG
>>
>>101899739
i know. im sonic schizo anon from the old days. gotta check in on my personal lolcow from time to time.

oh I almost forgot... Hi :) Good morning :) hows the humidity today?
>>
File: 00797-1568426366.jpg (977 KB, 1560x1944)
977 KB
977 KB JPG
goo night
exit light...
dream of soft, cute, pleasant things.
>>
File: tmp5bgz_dow.png (1005 KB, 768x1024)
1005 KB
1005 KB PNG
>>
>fran moved to /ldg/
>>
File: delux_ci_00026_.png (1.79 MB, 1536x968)
1.79 MB
1.79 MB PNG
>>101899852
gn
>>
The dream has been realized
All I had to do was wait and do nothing
>>
File: 1723697011248_1.jpg (65 KB, 984x984)
65 KB
65 KB JPG
>>101899852
GN anon
>>
File: 1723697606191_3.jpg (62 KB, 984x984)
62 KB
62 KB JPG
>>
File: PW_82441_.png (857 KB, 1024x768)
857 KB
857 KB PNG
>>101899234
>>101899704
Good night, anons! Sleep well :]
>>101899286
Heya, Fran!! Good to see you again :]
>>
>>101900282
Fran is posting on /ldg/ and we encourage you to come there as well
>>
>>101900318
No the faggot can stay here
>>
>>101900323
Quiet debo
>>
File: PW_82446_.png (866 KB, 1024x768)
866 KB
866 KB PNG
>>101900318
No thanks, but thanks for the offer, anon! :]
>>
Thank god
>>
Dodged a fucking missile there, whew
>>
File: 00073-2024-08-12-cJak.png (2.05 MB, 1024x1344)
2.05 MB
2.05 MB PNG
Crazy how there's multiple anons keeping watch of the schizos
Good job guys I salute all of you
>>
File: 57683.png (3.26 MB, 1440x3120)
3.26 MB
3.26 MB PNG
/ldg/ never ever
>>
>>101900426
anon can easily tell when you nogen kek
>>
File: 57684.jpg (723 KB, 1440x3120)
723 KB
723 KB JPG
>>101900426
but i didn't nogen so what the fuck are you talking about?
>>
File: 57685.png (2.42 MB, 1440x3120)
2.42 MB
2.42 MB PNG
can't wait for these retards to call me debo fucking LOL. are you perchance a groyper, as well? spiritually brown
>>
File: 00009-2841367328.png (1.69 MB, 1152x896)
1.69 MB
1.69 MB PNG
>>101900470
>are you perchance a groyper
No doubt. They've tied their identity to an AI image generation thread and are under the delusion they hold some kind of power. I am going to restate my belief that these posters are like 12 years old.
>>
File: 57686.png (3.13 MB, 1440x3120)
3.13 MB
3.13 MB PNG
the slop must flow
>>
>>101900514
The sad reality is it's middle aged men with full blown autism.
>>
Bros malding
>>
>>101900381
Best image ITT
>>
File: 57678.png (2.61 MB, 1440x1440)
2.61 MB
2.61 MB PNG
>>101900572
you need to go back
>>
File: 00009-1466438136.png (863 KB, 896x1152)
863 KB
863 KB PNG
>>
File: file.png (2.15 MB, 1200x1200)
2.15 MB
2.15 MB PNG
#illuminatisowhite
>>
>>101901255
hes so cool
>>
>>101901270
yeah but no
>>
i really dont care
>>
>>101898479
Anyone tried posting an image with convenient censorship and tell luma to rotate the pov? I just tried it, but sometimes it takes days until my video is done.
>>
File: 00012-2897654696.png (938 KB, 896x1152)
938 KB
938 KB PNG
>>
File: 00014-2095082350.png (1.15 MB, 896x1152)
1.15 MB
1.15 MB PNG
>>
>>101898776
is this unedited? this is actually kino can i get a catbox please? really nailing the shitty chinese cellphone camera look of 2010 or so
>>
File: delux_ci_000007.jpg (107 KB, 578x578)
107 KB
107 KB JPG
>>
File: gen_tmp_08.jpg (122 KB, 1400x992)
122 KB
122 KB JPG
What is the thread theme
>>
>>101901860
1girls, quokkas, purple witches, soijaks, centaurs
>>
File: gadget0003.jpg (133 KB, 1304x1304)
133 KB
133 KB JPG
>>101902021
Hasn't that been the theme for like ever
>>
kinda staring to get the appeal of scraping, don't wanna do it properly and set up a db but it's kinda fun nonetheless
>>
>>101901760
Extremely powerful gen
>>
>>101900381
Doing my part
Never forget: trani is literal human garbage
>>
File: 00021-476882849.png (1.17 MB, 896x1152)
1.17 MB
1.17 MB PNG
>>101902037
Usually. Nice gadget.
>>101902072
Thank
>>
Is this Luma or something else?
How do I achieve this?

https://x.com/VApollyonAI/status/1823616380442194314
>>
finally got Flux working. are 30-ish second generations normal for a single image? I'm on a 3090 Ti
the generations are quite great, I'm getting the hype now
also, what's the status on people being able to finetune this model? last i heard it was impossible, but now I'm seeing people make loras on civitAI, so now I have no idea
>>
File: hq720 (1).jpg (37 KB, 686x386)
37 KB
37 KB JPG
What's your estimation date for Flux compatibility and Automatic1111?
>>
>>101902322
A lot faster than my 3080. There are people saying they are working on finetunes and it is possible. There's this, but I don't think it's a real finetune
https://civitai.com/models/645943/fluxunchained-artful-nsfw-capable-fluxd-tuned-model-by-socalguitarist
>>
>>101902340
I'm running flux on Forge right now, does that count?
>>
>>101902340
is SD3 even implemented yet?
>>
I've been trying to achieve that soft retro 80's painted look.
>>
File: 00022-1378769234.png (929 KB, 896x1152)
929 KB
929 KB PNG
>>101902340
It works with Forge, you can just use that
>>
>>101902358
Nobody uses sd3
>>
>>101902398
because it's not implemented
and other reasons, but that's a big one
>>
>>101902411
sd3 is implemented on A1111 2 months ago, no one care about that piece of gagbage
https://reddit.com/r/StableDiffusion/comments/1dh0z5v/sd3_support_has_been_added_to_automatic1111/
>>
File: 1girl_1_103.jpg (663 KB, 1376x1840)
663 KB
663 KB JPG
anyone try the gguf models yet?
>>
>>101902411
Its in A1111 for awhile. Nobody uses it because its trash
>>
File: gadget0009.jpg (136 KB, 1304x1304)
136 KB
136 KB JPG
Adios muchachos.
>>
File: Comparison_all_quants.jpg (3.84 MB, 7961x2897)
3.84 MB
3.84 MB JPG
>>101902434
yeah, I left fp8 for Q8_0, that one is way closer to fp16
>>
File: file.jpg (239 KB, 1024x1024)
239 KB
239 KB JPG
>>101902050
you don't have to use a database, here are some of the advantages for big scraping though
>you can easily avoid duplicates by adding a unique index, otherwise your script needs to keep track of the ids in a set or use a dict with the id as the key, that adds to memory requirements which is a problem if you're scraping on cheap low end nodes
>you avoid potential data loss if a script crashes before a save point, and you can just stop it then resume whenever
>you can keep track of additional stages like adding a 'downloaded' field, and distribute to different nodes with a 'worker_id' field
>you don't need extra processing to combine multiple files when distributing
i use mongodb so i don't need to define a schema, and i like the way it integrates in code compared to something like postgres x psycopg and writing all those queries, and you need to define the schema again with sqlalchemy. sometimes you can't know the schema in advance from one request because the service is actually using mongodb or similar so the schema can/will be different
i'll also say that 99.9% of the time you can do everything using requests, as in get/post not the python library, you should probably use curl-cffi, its faster and there are cases where python requests library is blocked due to ja3 fingerprinting. there are very few circumstances where i've had to use browser automation, recently the only one was automating civitai's registration because my captcha service kept failing on recaptcha v3 enterprise. same thing for parsing html, the majority of sites are CSR and thanks to nextjs even the SSR ones have a script tag with the json data in
>>
File: file.jpg (266 KB, 1024x1024)
266 KB
266 KB JPG
more advice
i dont recommend proxies for a few reasons
>your scripts will be more complex with threading/process pools and rotating logic
>if you're getting ip bans you're doing it wrong, be respectful in your work
>residential proxies are ridiculously expensive because the botnet owners realized they could appear more legitimate by selling bandwidth instead of lists, there are very few cases where you actually need residential ips. datacenter proxies are pointless when you can get higher reputation ips and unmetered/high limit bandwidth from the usual server providers
>proxy providers should not be trusted, like vpn providers there are cases where they are operated by bad actors or even foreign state actors
>you're not gaining any bandwidth capacity compared to using multiple nodes
>>
File: 1girl_1_111.jpg (821 KB, 1376x1840)
821 KB
821 KB JPG
>>101902462
nice good to know, i'll try Q8_0, been using nf4, seems like that's the best low bit quant rn
>>
>>101903067
>i'll try Q8_0, been using nf4
if you can't for a big boy like Q8_0 and want a quant at the same size as nf4, go for Q4_0, it's way better than nf4
>>
File: 1569332951.png (1.47 MB, 896x1152)
1.47 MB
1.47 MB PNG
>>
>>101903067
What's the difference between these models and nf4 etc.?
>>
>nigbo
>>
File: 1girl_1_116.jpg (757 KB, 1376x1840)
757 KB
757 KB JPG
>>101903076
i can run the full fat, but i need my gpu ram for LLMs too

>>101903220
see this post
>>101902462
>>
>>101902462
how do quants work with loras? do they need to be retrained?
>>
>>101903253
I saw the comparison, was just wondering what the advantage of a quantized model is?
>>101903320
Comment on civitai said they don't work with loras
>>
>>101903357
ok so you didn't see the comparison then because it details all the advantages/disadvantages, your comprehension is worse than claude opus baka
>>
>>101902651
>>101902863
Thanks for the info, good to know.
Not doing anything particularly intensive / being mindful not to potentially ruin it for everyone, can see how that could happen if you just yolo it however
>>
>>101903397
I see that nf4-v2 was 8.1/24.0GB vs Q4_0 was 8.3/24GB and gen time was a little longer. I do think the Q4_0 image is better, but am not sure what other information I'm not comprehending.
>>
File: 1639264423.png (1.52 MB, 896x1152)
1.52 MB
1.52 MB PNG
>>
>>101903710
very cool
>>
File: grid-0002.jpg (1.42 MB, 3200x3200)
1.42 MB
1.42 MB JPG
>>
Any version of Flux that works with LORAS?
>>
Does anyone use Stability Matrix? Does it hinder performance for the image generation?
>>
File: grid-0003.jpg (1.15 MB, 3200x3200)
1.15 MB
1.15 MB JPG
>>
https://github.com/intel/AI-Playground

intel arc bros...
>>
File: file.jpg (213 KB, 1024x1024)
213 KB
213 KB JPG
>>101903519
see anna's archive vs worldcat, they allegedly caused millions in damages by being reckless
what are you working on atm?
i've just put together a processing script for flickr to prepare the initial release
i use pyspark for this with mongodb connector, it can work with other formats like json
dataframes are nice to work with and pyspark is faster than something like pandas. i save to parquets, its best practice because the format is more efficient than others due to the way it stores data, built-in compression support and partitioning
btw jsonlines is a good alternative to plain json for scraping scripts, one advantage is that you can write data constantly rather than json.dump at the end or at intervals
>>
>>101904088
all of em if you have the vram
>>
>>101904366
what do you mean? sd 1.5 pony and sdxl loras work with flux?
>>
File: 2024-08-15_00218_.png (2.26 MB, 1280x1024)
2.26 MB
2.26 MB PNG
>>101904623
no they dont, but loras will add even more VRAM requirements on flux

pic related
>>
Why did you split the thread? Is it because of the influx of flux users?
>>
>>101904991
The hetero/nonhetero divide.
>>
File: 3784308148.png (1.33 MB, 896x1152)
1.33 MB
1.33 MB PNG
>>101903982
thanks
>>
>>101905049
Damn. They're essentially the same thread. Now I have to keep checking both threads constantly (no homo).
>>
>>101905058
You can safely ignore this thread for the most part, it is an unofficial containment thread for certain individuals
>>
Just FYI "file.jpg" is debo
>>
>>101905135
Good to know, because I have been posting with that filename in the other thread (copying and pasting from comfy).
>>
>>101905166
Happy to help
>>
File: 00076-431962997.jpg (106 KB, 832x1216)
106 KB
106 KB JPG
>>
File: file.jpg (397 KB, 1792x1024)
397 KB
397 KB JPG
preparing flickr release now. 269m rows for this first version
>>101905135
no that me
>>101905166
he's obsessed desu and thinks everyone is debo
>>
ComfyUI does some magic and allows me to work on higher resolutions that SDUI doesn't (tiled vae decoding for example). But I like the simpler UI in SDUI. Are there other ones that will let a vramlet (8 GB) like me upscale to 1024x1024 without running out of RAM?
I'm forced to use 32fp because of fucking AMD (might get resolved in ROCM 6.2, hopefully).
>>
>>101905277
Ok debo
>>
File: 1723729568422_image.jpg (159 KB, 970x1304)
159 KB
159 KB JPG
>>
is flux the real deal? night and day?
>>
>>101905483
はい
>>
>>
File: 1710011659565247.png (5 KB, 500x600)
5 KB
5 KB PNG
Why do I need to login to AI generator sites if I want to create something? I don't want to login.
>>
File: 00008-4042586485.jpg (954 KB, 1560x1944)
954 KB
954 KB JPG
goo morning. going to do laundry for the first time in over 2 years
>>
>>101905948
good luck
>>
now on flux gguf on forge
>>101899234
>>
>>101901812
Can't do catbox (please understand) but the relevant bits of the prompt are:

>grainy indistinct myspace pic, weird hair.
>an awkward normal young woman's low quality image on social media
>so random XD jess omgg ur so EMO!!

Then relevant gen settings are the size and the guidance which I can't exactly pinpoint because I generated it randomly within a range but it's somewhere between 1.1 and 1.5

Even with all that, FLUX still has a strong bias toward 'high quality' photos. Haven't found a perfect trick for getting around that yet.
>>
>>101906023
This looks incredible.
>>
File: ifx35.png (955 KB, 1024x1024)
955 KB
955 KB PNG
>>101906023
here's a fun one, trying ImageFX
>>
>>101905995
>gguf
whats that
>>
>>101906053
The sum of all my knowledge of FLUX basically amounts to: "set your guidance somewhere between 1.1 and 1.5"
>>
>>101906160
see /ldg/ (past 2-3 threads) or
https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/1050

i guess it's what LLM people use to fit their large models into commercial GPUs, so the flux dev model was 23+GB and is now around 11.8 on gguf q8 (whatever that means).

it seems to work well enough
>>
File: file.jpg (534 KB, 1792x1024)
534 KB
534 KB JPG
went to the shop and forgot the coffee >:(
flickr 270m dropping soon:tm: just waiting on repartition, after that i'm adding license name to accompany flickr id for the license, then i'll start the upload
>>
File: BMP_FLUX_00108_.png (1.49 MB, 904x994)
1.49 MB
1.49 MB PNG
>>101906648
I don't understand any of this but congrats
>Gimp crashes again for the 200th time after I'm done with it
Looks like I win again this time, universe
>>
>>
>>101906786
where have you been?
>>
>>101906826
wouldnt you like to know
>>
File: hot(06).jpg (93 KB, 984x984)
93 KB
93 KB JPG
So I learned that the Elon chatbot uses FLUX for it's image generator and on one hand I'm glad more people are getting exposed to FLUX but on other hand I hate that most people will think you need to pay Elon to run FLUX and now FLUX Will be forever be attached to those cryptobros Elon musk obsessed faggots.
>>
i miss schizo anon
>>
>>101906826
Playing old vidya with mods. Also demotivated by not being very good with Flux. Although I liked all the anons who used my We-have-Miku-at-home-robot-antenna-hugs-toaster whatever prompt thingy though.
https://files.catbox.moe/sdoxeu.jpg
>>
File: delux_ci_00022_.png (1.6 MB, 1536x968)
1.6 MB
1.6 MB PNG
>>101906901
the more people that pay twitter to gen pictures, the more funding BFL will have access to (hopefully). regardless of how the specifics of the funding pipeline shake out, it positions BFL into a healthy position for future development. we just have to hope that they remain interested in publishing open models going forward
>>
thank you, thread schizo
>>
>>101906901
I pay premium for my business so I can use unlimited. Unfortunately I can't try to do titty pics because I wouldn't be surprised if Elon somehow leaked everybody's prompts
>>
File: FLUX_00003_.png (1.22 MB, 896x1152)
1.22 MB
1.22 MB PNG
rebuild went great, thanks for asking
>>
>>101907216
whats your business?
>>
>>101907275
I won't go too specific but it's an app for couples I made which I promoted by posting lots of memes and growing on Twitter. We're a pretty silly account and make edgy jokes and all, but I still don't want logs out there of my desperately trying to generate lewds associated with the brand lel
>>
File: delux_ra_00030_.png (1.59 MB, 1536x1152)
1.59 MB
1.59 MB PNG
>>101907249
what parts did you replace? hopefully you snuck in some nice upgrades
also what aesthetic tokens/phrases are oyu using for this gen? I wanted to get something closer to what you're getting than what I got
>>
File: 1723151458944260.png (3.62 MB, 1344x1728)
3.62 MB
3.62 MB PNG
>>101906826
Also I wanted to make a thread with this image for the OP because I really liked it a lot but I kept getting beat to it so I'll just post it here.
>>101906901
I sold 5K worth for my AC bill at $250, should have dumped it all but I got greedy, should have known it would tank considering how volatile it's been for the past year, rip.
>>
>>101907389
prompt please?
>>
>>101907399
Not my image senpai
>>
File: file.png (1.53 MB, 1024x768)
1.53 MB
1.53 MB PNG
Oh yeah I wanted to get one of Trump mugging somebody and ending up getting this which made me giggle
>>
File: file.jpg (472 KB, 1792x1024)
472 KB
472 KB JPG
>>101906648
coffee is a drink. repartitioning is a detail of parquet file format, it splits the records between multiple files. flickr uses an id 0 to 10 for the license info, they have it as a string for some reason, i've cast it to an int so you can filter like license > 0, and i'm also adding the license name, but renamed to the common format e.g. `CC BY-NC-SA 2.0` instead of `Attribution-NonCommercial-ShareAlike License`
{0: 'All Rights Reserved',
1: 'CC BY-NC-SA 2.0',
2: 'CC BY-NC 2.0',
3: 'CC BY-NC-ND 2.0',
4: 'CC BY 2.0',
5: 'CC BY-SA 2.0',
6: 'CC BY-ND 2.0',
7: 'No known copyright restrictions',
8: 'United States Government Work',
9: 'CC0 1.0',
10: 'Public Domain Mark 1.0'}
>>
>>101906786
oops this >>101907479 was for you
>>
File: BMP_FLUX_05512_.png (762 KB, 1024x1024)
762 KB
762 KB PNG
>>101907497
I was already cool with you, no need to cater to my airhead fetish (no homo). Also the coffee explanation was funny
>>
can twitter flux do kamala?
>>
File: 2024-08-15_00215_.png (1.85 MB, 1280x1024)
1.85 MB
1.85 MB PNG
>>101907585
can twitter.. are you joking?
>>
File: 103065-tmp.png (2.85 MB, 1536x1728)
2.85 MB
2.85 MB PNG
>>
File: file.jpg (438 KB, 1792x1024)
438 KB
438 KB JPG
>>101907581
i don't mind explaining stuff. hopefully some people will take it on board and learn something. it would be cool to see more people doing data stuff. i'd love to have more members in the big data hf team desu
>>
eh, comfyui keeps going OOM but forge just werks
i'm done trying with comfyui
>>
File: 103072-tmp.png (2.57 MB, 1536x1728)
2.57 MB
2.57 MB PNG
>>
File: 103073-tmp.png (3.26 MB, 1536x1728)
3.26 MB
3.26 MB PNG
>>
File: 01087-1923652683.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>101907621
Whew lad. Also what's up with the halo?
>>101907698
My programming knowledge is minimal. I made an rpg once in Python where I had like 15 rooms you could navigate, and you could select which of the 3 goblins you wanted to target when an encounter started, that was enough math brain drain for me.
>>
>>
File: 103058-tmp.png (3.04 MB, 1536x1728)
3.04 MB
3.04 MB PNG
>>101907896
>what's up with the halo?
I think it looks cool
>>
>>101906901
you are sexually aligned, therefore submid
>>
File: gen_tmp_15.jpg (135 KB, 1400x992)
135 KB
135 KB JPG
The best way to predict the future is to create it
>>
what is the best automated captioning software for datasets?

I tried civitai one and one in a Collab, but their results are so bad that I'll have to recaption all by hand....
>>
Do you guys find generally that you get better results using fewer proompt terms? Like obviously you don;t have as much control but it's my experience that the shorter the proompt the better quality the result, kind of a Jesus Take The Wheel situation. But I am also running in lowvram mode so maybe that has something to do with it.
>>
File: delux_ci_00023_.png (1.87 MB, 1536x968)
1.87 MB
1.87 MB PNG
>>101908079
anons were discussing that a bit yesterday. maybe this conversation chain will be interesting for you: >>101896299

>>101908095
depends heavily on the model and prompt and goal
>But I am also running in lowvram mode
this doesn't affect prompt understanding. the only case where vram footprint would affect prompt understanding is if you were using different encoders, like t5-fp16 vs t5-f8. but in that case, you have to manually change it, its not automatic based on vram
>>
File: 00014-1959905581.jpg (521 KB, 1248x1560)
521 KB
521 KB JPG
>>101908095
yeah, or you can use some 1000 token prompt and get surprised
>>
File: florence-large.png (3.21 MB, 1718x3130)
3.21 MB
3.21 MB PNG
>>101908079
https://huggingface.co/microsoft/Florence-2-large is the best model atm imo
idk about software supporting it, i'll be putting something together for myself desu so i could release that when its done. what format is your dataset/how are you expecting it to work? just so i can make sure other's use cases are supported
>>
I'm getting some paintings I really like from Flux
>>
>>101908218
nice
>>
File: 103087-tmp.png (2.64 MB, 1536x1728)
2.64 MB
2.64 MB PNG
>>
File: delux_ci_00024_.png (1.71 MB, 1536x968)
1.71 MB
1.71 MB PNG
>>101908218
this is really good. I was trying for something similar to this but missed by a wide margin. willing to share the prompt? I'm curious how you got it
>>
>>101908218
>101908272
do not engage him
>>
>>101908293
I was literally just about to press enter with the prompt. Is there a reason I shouldn't engage him?
>>
>nigbo baiting newfags again
>>
>>101908313
A rentry exists for new posters to warn about him. He is not someone you want to engage with.

https://rentry.org/debo
>>
>hes literally too new
>>
>>101908214
I'm training a style based on Demons souls remake and Dark souls 3, starting to get some good results already, bit the captioning is holding progress
>>
File: delux_ci_00025_.png (1.84 MB, 1536x968)
1.84 MB
1.84 MB PNG
>>101908313
this schizo stalks and harasses me literally every single day. I'm just trying to talk about imggen
>>
>>101908165
thanks for the help, I'll look at it
>>
>>101908337
was literally just about to post his rentry
gg anon
>>
>singular schizo cope
>still
>>
>>101908337
It's a shame that pretty much every general these days has at least one dedicated schizo lol, I appreciate the warning!
>>
>harrassed
>on an anonymous mongolian basket weaving forum
>>
File: BMP_FLUX_05132__cleanup.png (860 KB, 648x1200)
860 KB
860 KB PNG
>>101908357
Cool always love gens from this game
>>
>>101908313
Oh, and he's also likely the CP spammer over on /ldg/, another general most of the posters here migrated to to get away from him. All the interesting tech discussions are there as well.

>>101908407
You are welcome.
>>
File: file.jpg (281 KB, 1792x1024)
281 KB
281 KB JPG
flickr finally uploading
>>101908407
you've been tricked. the dedicated schizo is the poster you're replying to
>>
File: gdfgfgd.jpg (1.4 MB, 4200x1253)
1.4 MB
1.4 MB JPG
debo did nothing wrong
zap and heaven seem to be the best ones, i agree
>>
File: 102749-tmp.png (2.96 MB, 1536x1728)
2.96 MB
2.96 MB PNG
>>
>>101908445
I'll be real with you, I don't really care
>>
>only friend is a pedo from /degen/
>>
>>101908375
I told you like a hundred times already: just be honest, be a man, stand behind what you did and be actually sorry about it. But no, "everyone else is the problem" and you never did anything wrong. Do you realize just how many anons are annoyed by your games? That you now (again) try to play the victim card (which doesnt work because of the rentry and pastebin kek)? /ldg/ was created just to avoid you, you specifically, thread schizo
>>
>>101908498
Iirc /ldg/ was created by the Pixart/Sigma gang, but it also works to get away from debo because he considers this general his home
>>
>>101908445
You haven't been in this thread in ages and you had to change your name because no one in the industry wants to work with you kek
>>
File: file.jpg (347 KB, 1792x1024)
347 KB
347 KB JPG
>>101908518
my name is still the same. i made a group so others can join in if they want to
get a grip desu
>>
File: delux_nf_00052_.png (2.36 MB, 1344x1152)
2.36 MB
2.36 MB PNG
>nogen hyper-obsessive schizo posting and drama dumping
day 500 of you not beating the allegations
>>
>>101908595
Who are you?
>>
Too bad no one (who knows) will want to stay once they realize KEK
>>
Morning anons
>>
>101908608
Cry more
>>
File: delux_ra_00029_.png (2.57 MB, 1536x1152)
2.57 MB
2.57 MB PNG
>>101908633
gm
>>
>>101908375
>I'm just trying to talk about imggen
What does >>101908670 have to do with image generation?
>>
>>101908608
No one mentioned you but thanks for inserting yourself into the convo as usual
>>
>>101908608
"Singular schizo anon" is not the one who tried to distribute malware for weeks to catch "singular schizo anon", thread schizo
>>
Did Fizzledorf get fried again or something?
>>
>>101908688
I remember the interior anon days lmfao
>>
>>101908734
God i hope so
>>
>>101908238
Angel MILKERS?
>>
File: 103074-tmp.png (2.73 MB, 1536x1728)
2.73 MB
2.73 MB PNG
>>101908783
Amazing. Here are some real angel milkers.
>>
>>101908793
those are some sinful boobs
>>
File: BMP_FLUX_05538_.png (980 KB, 1024x1024)
980 KB
980 KB PNG
Oh no
>>
nice blue archive halo
>>
File: delux_ra_00032_.png (1.87 MB, 1536x1152)
1.87 MB
1.87 MB PNG
>>101908921
RIP mouse
>>
>he still responds to anons who have him filtered
I didn't think it was possible for IQ to be that low
>>
>>101909021
>>101909021
>>101909021
>>
File: file.jpg (262 KB, 1024x1024)
262 KB
262 KB JPG
>>101908613
hlky
i used to develop a library for stable diffusion and other models, it powered stable horde. in collaboration with laion we were gathering aesthetic ratings using a system i developed. as part of that i was reviewing images generated via stable horde. i discovered the service had a huge pedo problem. i didn't take it well. i ended up deleting everything i developed and demanded stable horde stop using the library i develop. they wouldn't, rather couldn't because there wasn't really any other option. they decided to keep the library online and removed license notices, essentially denying me credit for my work, again i didn't take this well and used the license to force the repo to be taken offline for a couple weeks
in hindsight it was pretty stupid and i could have done more to fight the problem had i stayed
in my defense i had little professional experience at the time, i used to work in warehouses, i was also working at SAI at the time which was generally not a good experience for various reasons that don't even need explaining because it's SAI, and i think the csam problem understandably negatively affected my mental health.
there have been other disagreements with other projects i've worked on, for example, AIT node for comfyui. AITemplate as a whole was still largely experimental and had already been pretty much abandoned by Meta, so there were issues with the node, and i'd get annoyed with issue reports. that time instead of deleting the repo i transferred it first to comfy, who immediately made it private for some reason, then after i complained he transferred it to fizzledorf aka ani. they could not keep up with development of the node and abandoned it
all of this was like over a year to 18 months ago
since then i've got better, now i do data stuff and have continued development of an AITemplate fork with new supported kernels, modeling support etc, it's mainly just for fun though, i haven't got any interest in developing a node again
>>
>>101909066
ok
>>
holy tldr
>>
File: BMP_10019_.png (2.28 MB, 1328x1328)
2.28 MB
2.28 MB PNG
>>101909066
I've been at 10+ different jobs before, always remember you are a number and your opinion doesn't matter. That being said good job on staying sane and sticking to your guns. We'd probably both be millionaires at this point if we were smart and just followed what the corrupt uppers told us to do, it's always a tossup.
>>
>>101909066
I am also "first for (tr)ani is literal human garbage" poster.
AMA
>>
File: delux_il_00036_.png (2.19 MB, 1536x1152)
2.19 MB
2.19 MB PNG
>>
File: delux_tru_00043_.png (1.03 MB, 1344x768)
1.03 MB
1.03 MB PNG
>>
File: 1693418227088777.jpg (7 KB, 128x112)
7 KB
7 KB JPG
>>
>>101908214
moondream mogs florence

>>101909066
nice i think i used some of your stuff in the very early days when i was doing all my AI gens through a colab
>>
>>101908410
it's coming along well, I may post it here when it gets refined
>>
>>101908992
looks sweet, catbox?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.