[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: long dick general.jpg (2.33 MB, 3264x2448)
2.33 MB
2.33 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101879426

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: delux_flebo_00033_.png (1.53 MB, 1216x832)
1.53 MB
1.53 MB PNG
>mfw
>>
I bet you kiss girls faggots
>>
>>101882089

new breadmaker?
>>
>>101882175
>>
File: 00075-1599418401.png (1.59 MB, 1152x896)
1.59 MB
1.59 MB PNG
>>
File: ComfyUI_01363_.png (665 KB, 768x768)
665 KB
665 KB PNG
Testing that owl prompt from the previous thread
>>
File: 00079-179768410.png (1.58 MB, 1152x896)
1.58 MB
1.58 MB PNG
>>
N
>>
>>101882231
>N
endroid owner above ^
>>
File: 00080-2253546269.png (1.18 MB, 896x1152)
1.18 MB
1.18 MB PNG
>>101882208
Nice, here's mine.
>>
File: Capture.png (1.07 MB, 1421x1690)
1.07 MB
1.07 MB PNG
https://huggingface.co/spaces/fancyfeast/joy-caption-pre-alpha
It's pretty good indeed, still not at the level of GPT4V though
>>
>>101882247
gpt4v cant do porn and is not free, joycaption is based on llama 3.1 8B
>>
>>101882261
for SFW you could use GPT4 and for NFSW you could use joy caption, that way the model will see multiple prosing styles instead of seeing only one kind of slop
>>
>>101882273
if booru then feed the captioner the tags and it performs at or better than gpt4 most of the time.
>>
I
>>
G
>>
G
>>
>>101882283
>if booru then feed the captioner the tags and it performs at or better than gpt4 most of the time.
gpt4 also can eat the booru tags though, so it's probably gpt4 + booru tags > joy captions + booru tags for the SFW department imo
>>
File: 00381-2823384128.png (1.54 MB, 1152x864)
1.54 MB
1.54 MB PNG
This man blocks your path what do you do?
>>
E
>>
>>101882307
N
>>
>>101882320
I
>>
A
>>
>>101882329
C
>>
File: file.png (1.74 MB, 896x1152)
1.74 MB
1.74 MB PNG
>damn, flux can do that?!
>>
File: 1723607657878269.jpg (354 KB, 2748x1286)
354 KB
354 KB JPG
https://github.com/kohya-ss/sd-scripts/pull/1374#issuecomment-2287134623
24gb fp16 finetune? Sounds too good to be true, what's the catch?
>>
>>101882305
suck him off of course
>>
>>101882348
add a catbox anon, so that we can see the boobies in full and beautiful display
>>
>>101882261
Not bad.
>>
File: ComfyUI_01767_.png (1.55 MB, 1344x768)
1.55 MB
1.55 MB PNG
>>101882348
>>101882369
https://files.catbox.moe/obzx64.png
(boobs, obviously)

ignore the rest of the autism in the workflow
>>
>>101882247
>>101882374
From the looks of it, it's better than florence and it can do NFSW, maybe the new local SOTA, where can I download it?
>>
>>101882395
nice anon, nice looking boobs
>>
All I want is a caption model that stops calling assess bottoms and buttocks.
>>
>>101882351
just change them later
>>
>>101882374
>headband
>its a blindfold
>>
>>101882416
give in and start prompting for buttocks too
>>
File: ComfyUI_00432_.png (2.9 MB, 1536x1376)
2.9 MB
2.9 MB PNG
>>101882305
>>
>>101882428
It's talking about her actual headband, seems like it didn't notice her blindfold.
>>
>>101882416
Those terms are far more common as labels than "asses". I know it sounds cheesy though
>>
>>101882416
>All I want is a caption model that stops calling assess bottoms and buttocks.
make a python script that changes the "bottoms" and "buttocks" words to "ass", you could even include a % of replacement so that you get diversity, post processing is a thing and I love that shit
>>
File: ComfyUI_00433_.png (3.54 MB, 1536x1376)
3.54 MB
3.54 MB PNG
>>
File: tmp9jrwelh5.png (2.31 MB, 1440x1440)
2.31 MB
2.31 MB PNG
>>
>>101882374
I get A LOT of art style consistency (the thing people say Flux doesn't have) when using a prompt from this (or chatgpt)
>>
>>101882455
kek
whats the artist tag?
>>
File: 00407-1150490355.png (1.38 MB, 1152x864)
1.38 MB
1.38 MB PNG
>>101882455
YUGE MISTAKE PAL
>>
>>101882455
is this flux?
>>
>>101882443
not really, no one in a relaxed casual conversation would call it a buttocks or a bottom
>>
File: 00095-1421731025.png (959 KB, 896x1152)
959 KB
959 KB PNG
>>
>>101882488
>american detected

this is a euro board
>>
>>101882465
I know people criticize language models for sounding "generic" but they can all understand each other really well.
>>
>>101882351
>Sounds too good to be true, what's the catch?
Nobody but the person claiming to have written the code has seen the code.
>>
>>101882488
>no one in a relaxed casual conversation
but classifiers that label the images do not
>>
>>101882488
"Ass" can mean "donkey," so language models prefer to use the less ambiguous terms.
>>
File: ComfyUI_00435_.png (2.67 MB, 1536x1376)
2.67 MB
2.67 MB PNG
>>101882466
just 'a comic drawing'
>>101882483
yeh
>>101882475
GIT
>>
File: Capture.jpg (332 KB, 1635x1694)
332 KB
332 KB JPG
>>101882500
I think he added more details on that reddit comment, I think he's too much knoledgable to be full of shit, time will tell
https://www.reddit.com/r/StableDiffusion/comments/1erj8a1/comment/li0hwmt/?utm_source=share&utm_medium=web2x&context=3
>>
File: ComfyUI_00921_.png (993 KB, 1288x848)
993 KB
993 KB PNG
>>
>>101882522
Samefagging to add an illustrative example.
>>
LLMs will conform to MY WAY of prompting
I WILL NOT conform to THEIR way
>>
>>101882397
https://huggingface.co/spaces/fancyfeast/joy-caption-pre-alpha/tree/main

git clone, create a new venv for it, activate venv, pip install the requirements then run the file and it will launch the gradio. For batching you will need to edit the file.
>>
>>101882560
Did you download it? How big is it? How much VRAM does it ask compared to Florence?
>>
>>101882536
This would be awesome because I hate fucking around with LoRAs.
>>
>>101882374
Feed the tags with it like the character name and it will do even better. You can change the prompt in the .py file
>>
>>101882580
Don't you need way more images for a full finetune. If you just want a character doing a thing I think a LoRA is a much more time and effort saving task.
>>
>>101882576
its llama 3.1 8B + adapter so whatever that takes. For sure less than 24GB
>>
>>101882583
>Feed the tags with it
you can't on this demo unfortunately
>>
File: 00419-1150490356.png (1.31 MB, 1152x864)
1.31 MB
1.31 MB PNG
>>
>>101882595

>local diffusion general

>>101882560
>>
>>101882595
just run it local >>101882560
>>
>>101882592
I'm surprised they went for L3.1, why not Gemma2-9b, that one has better benchmarks
>>
>>101882605
prob the license or the architecture
>>
Does anyone have a script for training a LoRA on 16gb VRAM?
>>
File: ComfyUI_01956_.png (1.5 MB, 1536x640)
1.5 MB
1.5 MB PNG
goodnight /ldg/

flux development is moving too fast it consumes my entire day
>>
>>101882603
Using a demo of a local model is on the scope of "local diffusion general" bucko
>>
>>101882615
sounds like cloud diffusion to me, host it locally and do what this anon said >>101882560
>>
>>101882604
where do you download it for local?
>>
>>101882634
Has no one used huggingface before? Google it.
>>101882560
>>
>>101882615
you are correct
>>
I'm gonna go ahead and say it. I don't think we can train flux on 24gb of vram and this is all a hoax.
>>
>>101882591
>Don't you need way more images for a full finetune. If you just want a character doing a thing I think a LoRA is a much more time and effort saving task.
I think a finetune of flux (I'm talking about a real finetune, with a shit ton of pictures in it) is a must need for 3 reasons:

- NFSW, duh
- Flux is severely undertrained, it doesn't know much concept and for a 12b model it can probably eats several more billions of pictures before saturating (we'll never reach to that limit though kek)
- Flux-dev is a distilled model, finetuning it will transform it into a more natural model, dunno if that will improve anything but heh, let's see
>>
File: ComfyUI_00088_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
question, does LoRa training for flux work now?
is there a guide or something for it on how to do it?
>>
>>101882654
https://github.com/ostris/ai-toolkit

This is the most straight forward
>>
File: ComfyUI_00437_.png (3.29 MB, 1536x1376)
3.29 MB
3.29 MB PNG
>>
>>101882650
>finetune
can someone explain to me what's the difference between a finetune and a pretraining? a finetune will use less layers on the model right?
>>
File: 00024-317819023.jpg (419 KB, 1120x1440)
419 KB
419 KB JPG
>>
File: ComfyUI_00928_.png (1.07 MB, 1288x848)
1.07 MB
1.07 MB PNG
>>
>>101882654
>question, does LoRa training for flux work now?
>is there a guide or something for it on how to do it?
https://github.com/bghira/SimpleTuner/blob/main/documentation/quickstart/FLUX.md
>>
>>101882650
No I totally get why a finetune is necessary, but for personal use to get something very specific to your needs a LoRA seems way more appropriate than a full finetune.
>>
>>101882712
the both work, and a finetune can also add more concepts into it, so you'll need to download less loras for our use case, which is always good. I like when my model can output everything I have in mind without having to worry on looking at a lora on civitai... if that one ever exists that is
>>
>>101882686
Pre training = initial training run to get the thing working
Fine tune = training for a specific task or set of tasks for a model that has already been trained
Continued training = fully updating the model across the board

I think. Someone correct if wrong lol.
>>
File: 00441-1150490357.png (1.53 MB, 864x1152)
1.53 MB
1.53 MB PNG
>>101882670
cute
>>
File: 00026-2490054030.jpg (245 KB, 1120x1440)
245 KB
245 KB JPG
>>
>>
>>101882779
Will this create a 3D image if I cross my eyes?
>>
>>101882764
Pretty close Marlboro logo plus Shell, based
>>
>>101882785
Try it sir
>>
>>101882764
impressive, it has the feel of the 90's anime, prompt?
>>
>>101882797
Mmm, kinda?
>>
>>101882645
It works, just wait :)
>>
>>101882785
i was just trying that myself.. didn't work for me.. dunno if its because it's too big or if its just not shifted
>>
>>101882824
Looks like you're the legit guy who claimed that shit, your ":)" was a dead giveway >>101882536
So... when will you upload the code?
>>
File: 00033-3702294881.jpg (244 KB, 1120x1440)
244 KB
244 KB JPG
>>101882800
I used, "a retro style, anime OVA, VHS cover art from 1988" then the rest
>>
>>101882853
can you give me the full prompt, I wanna see if that can be improved with CFG 6 + GuidanceNeg 10
>>
File: Capture.jpg (449 KB, 3243x1381)
449 KB
449 KB JPG
Does Flux knows what an adult look like?? It always reverts back to chibi style when a lot of complex prompt happens
>>
>>101882842
When it's ready. I promise it's real though. :)
>>
>larping
>>
>>101882817
That's stereo 3D friend
>>
File: ComfyUI_00898_.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>
File: 4x.jpg (10 KB, 380x128)
10 KB
10 KB JPG
>>101882872
We'll see anon, we'll see, will you end up as a hero or a clown is the question
>>
File: ComfyUI_00926_.png (1 MB, 1288x848)
1 MB
1 MB PNG
>>
Test
>>
File: ComfyUI_00945_.png (1.14 MB, 1288x848)
1.14 MB
1.14 MB PNG
>>
File: 00037-1858885819.jpg (255 KB, 1120x1440)
255 KB
255 KB JPG
>>101882856
are you the guy from reddit?
>>
>>101882897
Congratulations on your ban expiring.
>>
>>101882901
yeah lol
>>
Guys I just tossed out my second 3090. Looks like vram means nothing anymore.
>>
File: 00041-879363358.jpg (235 KB, 1120x1440)
235 KB
235 KB JPG
>>101882906
I fucking hate redditors
>>
File: temp_mzgpt.png (2.43 MB, 1120x1440)
2.43 MB
2.43 MB PNG
>>
File: ComfyUI_31670_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>
File: file.png (1.04 MB, 1302x1200)
1.04 MB
1.04 MB PNG
new grok model just dropped on x. apparently its better than midjourney
>>
File: ComfyUI_03882_.png (1.84 MB, 1184x1592)
1.84 MB
1.84 MB PNG
>>
File: 00046-227087702.jpg (184 KB, 1120x1440)
184 KB
184 KB JPG
>le women on the grass test
>>
>>101882982
it literally didn't gen what he asked for
>>
>>101882982
>not local
>>
>>101882996
I see cum on her belly
>>
>>101883010
shh... let them think that's SOTA
>>
>>101882982
can you provide link or something?
>>
>>101882982
It's literally using Flux nigger.
Also
>giving Flux access to every tech illiterate normie
I hate Elon.
>>
File: 00052-3654891057.jpg (168 KB, 1120x1440)
168 KB
168 KB JPG
>>
>>101883037
>>giving Flux access to every tech illiterate normie
>I hate Elon.
what do you mean? Elon made a flux API or something? Can you give me a source or something?
>>
File: 00053-974218020.jpg (109 KB, 1120x1440)
109 KB
109 KB JPG
>>
>>101883059
>>101883076
neat
>>101883074
>it uses flux
the voices told him this
>>
>>101883080
>the voices told him this
>>
>>101883074
It's a new subscription based service on X, shows up when you open up the app.
>>
>>101882779
Took me a bit, but this actually makes a pretty good 3D effect when you get your eyes crossed right. How'd you make it?
>>
File: 00055-3643169300.jpg (158 KB, 1120x1440)
158 KB
158 KB JPG
bruh
>>
>>101883089
>>101883080
https://xcancel.com/nima_owji/status/1823388838279922166#m
>>
>>101883116
thanks anon
>It'll also generate images using the FLUX.1 model!
It's impressive how much FLUX has changed the landscape forever, and it's only been 10 days, people are focusing on quants (nf4), on more training optmisations, fucking Musk decided to use it as an API, that's how good you know a model really is, when SD3M got released, the only things it managed to add more on the community is "le funny women lying on grass meme" kek
>>
File: 00060-542790689.jpg (244 KB, 1120x1440)
244 KB
244 KB JPG
New Quant: flux1-dev-bnb-nf4-v2.safetensors
https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/1079
>>
File: ComfyUI_00135_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>101882663
>>101882701
thanks bros, gonna read up on it a bit.
>>
Bigma next month
>>
>>101883150
I'm not sure if them working directly with X is good news for open source. I guess time will tell, but those guys want a competitor to MJ and Elon replied to the CEO all the time, so it could be bad.
>>
>>101883165
>I promise this time it's better than fp8!1!1!1!
Fool me once, shame on you. Fool me twice...
>>
How can we get rid of **** shitting up /sdg/
>>
>>101882884
kek :(
>>
>>101883201
not my problem, this is /ldg/ go back fixing your shit in your home
>>
>>101883101
more like this or like Simon Stalenhag art
>>
>>101883165
will this still work on comfy's nf4 node? I wanna try it out
>>
>>101883199
NF4 is pretty fast, i'm just waiting for lora support
>>
>>101883257
>NF4 is pretty fast
it's almost the same speed as fp8 if you have enough vram to run it, nf4 isn't fast because of optimized math calculation, but simply because nf4 will be less likely to end up into your ram
>>
File: FLUX_00065_.png (1.2 MB, 896x1152)
1.2 MB
1.2 MB PNG
pc rebuild today
probably, if I can be bothered
>>
File: temp_mzgpt.png (1.48 MB, 1120x1440)
1.48 MB
1.48 MB PNG
>>
File: ComfyUI_00037_.png (805 KB, 832x1216)
805 KB
805 KB PNG
>>
>>101883268
well it just werks
>>
>>101883116
Honestly this perfectly explains why Flux can't do artists. There would be an outrage.
>>
>>101883378
>Honestly this perfectly explains why Flux can't do artists. There would be an outrage.
what do you mean? SD1.5 and SDXL survived and they could do artists and celebrities well
>>
>>101882613
Does anyone have any kohya training script for flux at all? I can adapt it but I am very stupid and keep getting errors
>>
Does ComfyUI just let everything sit in RAM until it's like 90% full before freeing up unnecessary stuff (like the unet fully loaded in VRAM)?
>>
File: Capture.jpg (269 KB, 3088x1395)
269 KB
269 KB JPG
>>101883165
Good news, nf4-v2 still works on ComfyUi nf4 node
https://github.com/comfyanonymous/ComfyUI_bitsandbytes_NF4
>>
File: 00088-4228999352.jpg (114 KB, 1120x1440)
114 KB
114 KB JPG
>Anon, you're so funny...can't believe you're my coworker
>>
File: 1701741685098447.jpg (77 KB, 896x1152)
77 KB
77 KB JPG
>>
File: ComfyUI_00040_.png (1.45 MB, 1344x768)
1.45 MB
1.45 MB PNG
>>
>>101883523
When did it stop?
>>
>>101883565
Flux does frutiger aero???

Could you please share your prompt?
Thank you!!!!!
>>
>>101883677
he changed the architecture a bit on v2, so I supposed there would be an error or something
>>
File: ComfyUI_00153_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>
Looks like the ays/ays+ schedulers manages to remove the white effect on Dynamic Thresholding
https://imgsli.com/Mjg3NDM0/0/1
>>
>>101883689
sure fren, I just put a frutiger aero image into Joy Caption and fixed its mistakes for the prompt, it's long:
>This is a digitally created artwork depicting a cityscape on an island floating in the ocean. The scene is vibrant and detailed, featuring a blend of natural and man-made elements. The background is a clear, bright blue sky dotted with fluffy white clouds. In the foreground, the water is crystal clear, allowing for a view of the ocean floor. The water surface is rippled, and there are several small fish, including a yellow one with black and white markings, swimming near the bottom.

>To the left of the image, a large, blue, spherical globe is partially submerged, with water flowing out of it. The globe appears to be made of glass or a transparent material, and it is partially cracked, with water cascading out in a dynamic, realistic manner.

>On the right side, a futuristic cityscape rises from the water. The buildings are modern skyscrapers with sleek, glass exteriors in various shades of blue and white, reflecting the sky and water. The cityscape is detailed, with intricate architectural designs, suggesting a high-tech metropolis.

>The overall style is a mix of realism and fantasy. The artwork combines vibrant colors and detailed textures to create a visually striking and imaginative scene. The image evokes a feeling of hope for a better future like that depicted in the image, along with a refreshing feeling thanks to the bright colors and vibrant, healthy coexistence of nature and man.
>>
>>101883715
Thank you again! This is great news. Have not played with Flux yet so can't wait to use it to create inspiration gens to create in Photoshop. (Or shop gens)
>>
File: ComfyUI_00154_.png (2.28 MB, 1920x1080)
2.28 MB
2.28 MB PNG
>>
File: ComfyUI_00023_.png (977 KB, 1216x832)
977 KB
977 KB PNG
>>
>>101882351
>https://github.com/kohya-ss/sd-scripts/pull/1374#issuecomment-2287134623
Will this mean anything for potentially smaller vram requirements for loras too if it works for finetunes?
>>
File: FluxBeLike.png (240 KB, 640x476)
240 KB
240 KB PNG
>>101883715
>>101883689
>>
>>101883755
kek
>>
File: ComfyUI_00155_.png (2.22 MB, 1920x1088)
2.22 MB
2.22 MB PNG
>>
File: ComfyUI_00044_.png (1.3 MB, 1344x768)
1.3 MB
1.3 MB PNG
I should make the prompt longer
>>
>big anime titty LoRA has more downloads than the base model
>>
>>101882536
Wait is quant lora training out now? What's vram reqs?
>>
>>101883793
>almost 60k downloads
Flux Won.
>>
how do you upscale with flux? Is it just like upscaling with other models?
>>
File: 00450-2616587780.png (829 KB, 896x1152)
829 KB
829 KB PNG
>>
>nf4 on comfy breaks LoRAs

What's the fucking point then?
>>
File: 00453-985245280.png (830 KB, 896x1152)
830 KB
830 KB PNG
>>
I wish there was a slightly smaller param flux that was more vramlet friendly. It doesn't really feel like flux truly needed as many params as it has. And I know vramlets can run it still with quantz but loras being limited to a 3090 minimum sucks ass
>>
>>101883880
How about instead of cucking to vramlets, we all hit Nvidia with an anti trust lawsuit and force them to make better cards and a reasonable price?
>>
>>101883880
Nvm just read this
https://github.com/kohya-ss/sd-scripts/pull/1374/files
Fuck you guys for making fun of me when I asked if 12gb lora training had any chance of being possible in the future
My salty poor fag ass is back baby
>>
>>101883902
Meds. NOW.
>>
Gm Eurobros
>>
File: ComparisonFp8_nf4-v2.jpg (1.36 MB, 3070x2372)
1.36 MB
1.36 MB JPG
>>101883165
>>
>>101883845
the weights are too different, it's like being mad Pony loras don't work on XL
you're throwing a tard tantrum
>>101883964
GOOD MORNING SAR
>>
File: 00008-3658315987.png (1.07 MB, 896x1152)
1.07 MB
1.07 MB PNG
>>
>>101883960
The fuck are you on about fag
>>
>>101883973
>You're being unreasonable for seeing an issue with the fact that half the community wont be able to use LoRAs with the only viable way to use Flux for them.

I'm not even a vramlet and can see this is an issue.
>>
>>101884003
Nta but isn't this fixed by just training the Loras on the nf4 version...?
>>
>>101883996
>Fuck you guys for making fun of me when I asked if 12gb lora training had any chance of being possible in the future
You're literally making up fake scenarios in your head to be upset with us. That's what crazy people do, anon.
>>
>>101883393
SD and SD 1.5 were never at the hands of normies.
>>
>>101884014
And now they don't work on the fp16 and fp8 version.
>>
>>101884039
This literally happened a few threads ago you retard, rope
>>
>>101884058
Well, yeah? You either train for both or the community ends up favoring majority support for one (most likely nf4 because the vast majority of users will be vramlets)
>>
>>101884055
>SD and SD 1.5 were never at the hands of normies.
you're joking? SD1.5 was a 0.75 model, everyone could run it on their potato PC, you can't make a more normie and accessible model than this one
>>
>>101884076
I know and that's a problem. I don't want to lower my standards just to please some poo in the loo who wants to make celebrity porn.
>>
>>101884092
Well don't train for nf4 or use nf4 Loras then, I don't see how this is an actual issue for you
>>
>>101883972
So... nf4-v2 is actually slower than fp8? damn...
>>
>>101884087
Right, they were at the hands of anyone who did their due diligence and knew how to download a file on their computer and run some commands. Not literal retards who are dumb enough to browse X.
>>
>>101884103
It's an issue for me that even vramlets make good LoRAs.
>>
>>101884111
maybe in some cases with a 3090 that doesn't need it anyways, for me it's way faster
>>
>>101884114
you think that flux will be uncensored on X's api? LOOOOOOL
>>
>>101884120
>makes good Loras
>poo in the loo who wants celeb porn
Pick one
Or buy me a 3090 and I'll make you every lora you've ever desired
- t. vramlet in question

Wouldn't be a problem if we could actually pool multi GPU memory for lora training though reee
>>
>>101884141
Not fully, but take a look at https://xcancel.com/SpaceX69_420/status/1823389265427939753#m

Of course they will censor it more after all this, but if Elon gets a stake in Flux then say goodbye to 2.0 being able to reproduce images like this once enough normies produce "harm".
>>
>>101884175
>Of course they will censor it more after all this, but if Elon gets a stake in Flux then say goodbye to 2.0 being able to reproduce images like this once enough normies produce "harm".
fuck... you got a point there...
>>
>>101877628
Thanks. You linked wrong but "posts" was the correct one.
>>
>>101883972
https://imgsli.com/Mjg3NDUy
>>
File: 25513123.png (587 KB, 776x607)
587 KB
587 KB PNG
>>101884175
Also don't forget that SD 1.5/XL had seething Twitter trannies.

The entire fiasco is what caused the CEO to cave in (because he clearly wasn't acting in good faith) and pic rel.
>>
sooo what are we using to caption datasets for flux loras? running them through chatgpt?
>>
>>101884209
If you give them an inch they'll take the arm, if you kneel for pardon, they'll remove their pants for a succ. SAI just don't understand that the cucking they are doing to their models will never be enough to those twitter crazies, all they want is to kill AI, not just have a cucked AI.
>>
>>101884175
But that was always going to be the case, no? They get acquired by someone, and if they try to push the lid on a genie bottle like stability, the devs are just going to leave and do it all over.
>>
>>101884242
>sooo what are we using to caption datasets for flux loras? >>101882247 >>101882261
>>
>>101884257
It still isn't clear they "defected", for all we know they just formed a rival company and are feeding off the open hype before going fully closed like Mistral and "Open"AI. For all we know Flux while brilliantly trained didn't include that many different artists in its training data after all.
>>
Flux paper when?
>>
>>101884312
>before going fully closed like Mistral
Isn't Mistral's best model right now free to download?
>>
>>101884261
thanks anon, wasn't sure if that was the general go to or just NSFW
>>
File: ComfyUI_00052_.png (3.01 MB, 1824x1248)
3.01 MB
3.01 MB PNG
Heun makes these even better, too bad it's so slow
>>
>>101884387
well, you could probably get even better captions for SFW using a cloud model but then it's not local and you have to pay
>>
File: ComfyUI_03706_.png (1.7 MB, 1024x1024)
1.7 MB
1.7 MB PNG
>>101884400
Go for deis, this shit is good and as fast as euler
>>
would clip_l + t5 not be capable of understanding booru tags if a lora was trained on them instead of natural language?
I'm under the impression it would but if no one knows I'll try a test run tomorrow
>>
>>101882247
Tried it with a recent gen and this about one of the subjects
>The man on the right is facing away from the camera, but his blond hair and attire suggest he is likely the same individual as the one in the previous photograph.
It is obviously Donald Trump to any human that has seen him even just once but the mention of a "previous photograph" really lowers my trust in this caption model.
>>
>>
>>101884445
can you instruct it to caption each image individually without reference to any other image etc?
>>
Is there anything in local which can mimic NAI's style? I haven't done any image gen since sd 1.5 tunes so i'm going to have to relearn everything.
>>
>>101884483
probably, I'd have to clone the space and run it locally to try
>>
>>101884485
animagine sdxl is probably your best bet unless you want NSFW, then you'll want to try some pony mix like autism confetti and abuse loras
still not quite on the same level as NAI but depending on what styles you're after it may be satisfactory. you're unironically better off looking at the /hdg/ or /e/ most likely, as everyone here is currently balls deep into flux which is kind of shit with styles (finetunes pending and loras in progress)
>>
>>101884485
T-ponynai3 is probably the closest, it's a pony mix designed specifically to look like NAI
>>
File: ComfyUI_00055_.png (2.78 MB, 1824x1248)
2.78 MB
2.78 MB PNG
>>101884433
damn you were right
>>
>>101884385
I still remember when Medium wasn't released by them and large was API only, but it seems competition has made them release one of their best models.
>>
File: FluxDev_00959_.jpg (226 KB, 832x1248)
226 KB
226 KB JPG
>>101884569
>>
>>101884185
np, just assumed the file named 'tags' was the right one for the tags you wanted
>>
>>
>>101884385
>Isn't Mistral's best model right now free to download?
they were forced to open source it due to competition, mistral's new 123b model was their answer to llama 3.1 405b. flux doesn't really have much competition right now so it's kind of like the sd era where only one company dominates.
>>
File: ComfyUI_00058_.png (3.11 MB, 2016x1152)
3.11 MB
3.11 MB PNG
>>
>>101884647
Doesn't SAI have an 8B model that is supposedly pretty good?
>>
>>101884672
Oh, I like this. Might make it my background.
>>
>>101884647
Hopefully Hunyuan, Bigma and Lumina change that.
>>
>>101884678
>Doesn't SAI have an 8B model that is supposedly pretty good?
I tried SD3-8B on their api and this shit isn't even close to Flux, what an embarassement
>>
>>101884678
SAI is in such a bad shape i doubt they could afford to release it without collapsing
>>101884694
yeah, image gen really really needs more players competing in the field
>>
https://github.com/comfyanonymous/ComfyUI/issues/4343#issuecomment-2287947547
>Can you try running it with: --disable-cuda-malloc to see if it improves things?
Comfy, it worked fine before with malloc, you shouldn't ask us to remove that feature, but to make it work again with malloc instead
>>
man trying to use the adaptive cfg workflow whatever with a lora and this shit is so fucking slow on a 3060
>>
File: ComfyUI_00061_.png (3.52 MB, 2016x1152)
3.52 MB
3.52 MB PNG
>>101884690
Glad you like it anon, I think I finally found some settings to really get the aesthetic right.
>>
>>101884771
do you also have adaptive guidance? it makes shit faster
>>
>>101884699
Their 8B is 100% not as good as Flux. That's like saying theirs is like Dalle. It was never that good at prompt following, it was more on the level of Sigma at prompt following (a literal 600m model), that's how big their embarassment was...
>>
>>101884784
>I think I finally found some settings to really get the aesthetic right.
Can you share it with us? I might learn something or two
>>
File: FLUX_00037_.png (1.26 MB, 896x1152)
1.26 MB
1.26 MB PNG
it's really annoying that it's adequate with vapes but terrible with cigarettes
I get it, people don't usually hold things between their index and middle finger so it gets confused, but fuck
>>
>>101884785
Yea, it actually seemed like it was A LOT faster without the adaptive guidance when i was using a different workflow
>>
>>101884806
there's no way it's faster without adaptive guidance, maybe you should update ComfyUi and try again
>>
did the guy with the 4090 that took 3 minutes to gen one image ever figure out the issue?
>>
>>101884710
SAI just says whatever to keep the scam going. I bet the idiots who reinvested into SAI right before BFL released their model feel like idiots right now.
>>
>>101884799
Of course! Most of the prompting is just from putting frutiger aero images into Joy Caption. https://files.catbox.moe/pncyo6.png
>>
>>101884847
>Of course! Most of the prompting is just from putting frutiger aero images into Joy Caption.
I think you'll have more success if you put it on gpt4v, you can do it it's SFW kek
>>
>>101884821
im getting like 145s/it. I've updated multiple times...it was way faster on an old version but i was getting the white grid.
>>
>>101884766
Comfy is a talentless hack who seethes and rages constantly while denying any fault, what more can you really expect? Mentally ill homo who has had multiple public melties and actively points people towards them because he lacks the depth of mind to recognize it's shameful and makes him look like more of a retard
>>
>>101884766
Even without malloc I still have OOM on a lora load, the fuck did he do?
>>
>>101884678
SAI is unironically going to go bankrupt. They don't have anything to offer. The most baffling part is that, while for users Flux came from nowhere, SAI knew was well aware of it, and they genuinely thought it was going to flop, so they didn't feel pressured to release the 8B version as SD3. Even after Flux came out and people were asking SAI to reconsider, they still were convinced Flux was going to flop. Now Flux runs on 8GB of vram, can be trained on 12gb, and it's overall much better even than SD 8B, so even if SAI released it to the public no one will bother with it, not to mention the horrible licenses they pushed for.
>>
>>101884766
I used to think Comfy was so cool, bro, but he's kind of a hack, bro. What if that Forge guy made his own node based UI, bros? Wouldn't that be great?
>>
File: ComfyUI_03725_.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>101884923
they got what they fucking deserved, it's been more than a year that we asked us to stop acting like a fucking cuck, I'm not gonna cry on their grave, fuck them, it's flux era now
>>
>>101884923
Even in the official Stable Diffusion discord people are demanding to turn it into a Flux server because SD is DOA. KEK
>>
File: FLUX_00040_.png (1.27 MB, 896x1152)
1.27 MB
1.27 MB PNG
I hate mondays
>>
>>101884923
You don't have to convince all the users that SAI is garbage. The problem is they have a pretty deep set set of roots that let's the farm capital to keep the afloat whenever they need it based on their previous so-so track record. It's going to take a few more rounds before people realize it's just an empty shell company full of resource sucking safety trannies and HR with no actual talent on board.
>>
>>101884935
>What if that Forge guy made his own node based UI, bros? Wouldn't that be great?
Holy kek the amount of seething and dilating from comfy if that happened. Please illya, for the hilarity alone
>>
File: ComfyUI_00067_.png (2.94 MB, 2016x1152)
2.94 MB
2.94 MB PNG
>>101884864
Lol you're right, I usually just default to whatever is free and open. ChatGPT gives better captions tho, helped me with this one
>>
Are LoRAs in ComfyUI used in parallel at the same time or in sequence? Using a node like Power Loras that allow to load multiple loras.

The use case would be for example to first load a Lora or a person, and then the realistism lora. So it "fixes" the lower quality of the person's lora?
>>
>>101885095
that's not how loras work in anything really, you can't really fix bad quality on one lora with another, the bad quality lora will just add bad quality any time it's used
>>
>>101884923
SAI is going to triumph in the end, because they are the only company that takes AI safety seriously. Have a good time prompting porn with your toy KEK. Meanwhile SD will be used by adults as a tool for our jobs.
>>
>>101885095
lora load order doesn't change the resulting weights
>>
The latest updates to forge fucked something up. I'm suddenly getting out of memory errors and crashes when using schnell with the exact same settings I was using yesterday. Dev still works fine, strangely.
>>
File: ComfyUI_00071_.png (1.88 MB, 1152x2016)
1.88 MB
1.88 MB PNG
>>
Can i run flux on a 3060 12gb?
>>
>>101885195
you can even now train Loras on it, yes
>>
>>101885195
mostly, hope you have a good cpu
>>
>>101885195
Nigga, you can fine tune it with 12gb now.
>>
>>101885213
Which model should i get for that gpu?
>>
File: ComfyUI_00073_.png (1.88 MB, 1152x2016)
1.88 MB
1.88 MB PNG
God I wish I could live here.
>>
>>101884400
heun 10 steps = euler 20 steps
>>
>>101885298
Any recommendation on what to try? For realistic generations
>>
>>101885259
>the future we were promised
>>
>>101885077
>helped me with this one
you're welcome anon
>>
>>101885224
Iunno, I'm still using the day1 setup and workflows, the 16 dev model in 8 precision
>>
>>101885195
I am but it takes like 20 minutes to get 1 pic on cfg 6 with a lora. cfg 1 aint bad tho
>>
File: ComfyUI_Flux_8469.jpg (259 KB, 768x1344)
259 KB
259 KB JPG
>>
File: ComfyUI_00077_.png (2.02 MB, 1152x2016)
2.02 MB
2.02 MB PNG
>>101885403
thamks
>>
thread ded?
>>
site ded
what are the FBI investigating now
>>
>>101885591
posting was fubar'd from around 6am eastern until a few mins ago. check a few other generals and there will be big gaps there too
>>
>>101885591
no hiroshimoot and the feds feeding off our data can't handle keeping the site up for long periods of time throughout the years anymore

>just by sheer coincidence blacked porn was left up on /v/ this whole time
>>
File: asa005.jpg (136 KB, 635x1538)
136 KB
136 KB JPG
>>101885591
4chan's captcha or cloudflare was dead
>>
File: FD_00045_.png (1.7 MB, 1024x1024)
1.7 MB
1.7 MB PNG
I blame the pedo.

In other news, can we train LoRAs on 16gb vram yet?
Also nf4 v1 vs nf4 v2 vs fp8 vs fp16
https://imgsli.com/Mjg3NDg5/1/3



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.