[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


[Advertise on 4chan]


File: ComfyUI_00046_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101933401

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: ComfyUI_02896_.png (2.38 MB, 1224x1224)
2.38 MB
2.38 MB PNG
>>
File: image.jpg (91 KB, 1536x1024)
91 KB
91 KB JPG
>>
File: file.png (640 KB, 1834x783)
640 KB
640 KB PNG
>>101935320
anyone know any good post processing nodes for comfy? this was my bootleg method
>>
File: delux_flebo_00046_.png (1.98 MB, 1152x1536)
1.98 MB
1.98 MB PNG
>mfw
you guys never fill your threads
>>
>>101935327
>Welcome to the buttchin bus, stud.
>>
What models do you anons use to caption dataset for Flux? I normally used WDtagger Swinv2 on A1111 for dataset tagging. Is it really necessary to boomer prompt your captions for flux?
>>
File: delux_ci_00029_.png (1.62 MB, 1536x968)
1.62 MB
1.62 MB PNG
asking again

what’s ldg’s current meta for guidance/cfg with gguf?
>>
>>101935334
Is Debo the first trans avatar fag that made the leap from /sdg/ to /ldg/?
>>
>>101935346
Seems to eat all my vram. I tend to avoid cfg all together unless I need something very specific.
>>
are flux loras for 24GB only, cause it's slow as shit on a 4080.
>>
>>101935344
>I normally used WDtagger Swinv2 on A1111 for dataset tagging. Is it really necessary to boomer prompt your captions for flux?
natural language will always be more accurate than just tags, what if you have "woman, table, chair, sitting" , does that mean the woman is sitting on the table or the chair? only a natural english sentense has the answer
>>
>>101935359
*generating, not training.
>>
File: 00101-2747064153.jpg (625 KB, 1344x1728)
625 KB
625 KB JPG
>>
File: delux_ci_00030_.png (1.85 MB, 1536x968)
1.85 MB
1.85 MB PNG
>>101935357
thats what my default is, just nothing fancy. its faster but I wish I had more levers of control
>>
>>101935344
Some anons are locally running this and have edited the python script to batch. Might have to ask when the thread is more active or take a look at it yourself. https://huggingface.co/spaces/fancyfeast/joy-caption-pre-alpha
>>
File: image.jpg (93 KB, 1536x1024)
93 KB
93 KB JPG
>>101935338
kek
im throwing old juggernaut gens into joycaption and throwing that into flux with minor adjustments to see the differences in style. really impressed with the prompt adherence so far

>>101935344
>Is it really necessary to boomer prompt your captions for flux?
if you want nuance then yes, i'd say. more information is always good. i also have no idea what im talking about
>>
File: ComfyUI_00038_.png (1.62 MB, 1248x1824)
1.62 MB
1.62 MB PNG
>>
File: image22.png (1.01 MB, 1344x768)
1.01 MB
1.01 MB PNG
picrel is the juggernautXL v8 reference image. we're 100% gonna go back to old models for the early AI aesthetic and "sovl" one day. maybe flux will be considered aesthetic and sovl too in 5 years
>>
So can we make epic porn or what. Someone post a pic of trump bumming Kamala please
>>
File: flux split.png (138 KB, 1511x515)
138 KB
138 KB PNG
>>101935359

I clocked 16.3 gb + 9.7 = 26 gb total vram on 2x4090 on 1 LoRa. Generation only. Maybe you can manage 24GB or lower with nf4 or something.
>>
>>101935452
we're not that kind of general! chud!!
>>
File: images.jpg (14 KB, 290x174)
14 KB
14 KB JPG
>>101935452
>Someone post a pic of trump bumming Kamala please
on a blue board?
>>
>>101935465
ok, now it makes more sense. I think the play is to use sdxl/pony for loras/characters then do inpainting in flux for stuff.
>>
>>101935480
*for characters that flux cant do natively, that is
>>
File: ComfyUI_00538_.png (2.53 MB, 1824x1248)
2.53 MB
2.53 MB PNG
>>101935480
Mentioned this in an earlier thread but you can also do the opposite. Use Flux to create an advanced composition and then resample it in a Pony-based model for aesthetics. You can also use a mask to keep specific items such as text from being destroyed, and resample those items at a lower denoise later.
>>
>>101935465
is that normal for flux loras to take that much vram?
>>
File: ComfyUI_02903_.png (1.87 MB, 1840x768)
1.87 MB
1.87 MB PNG
>plastic Flux tits

fuck this was with the realism LoRA as well
>>
>>101935519
I cannot find an official word on this anywhere, but I assume it's bugged. There's no reason I can think of why a LoRA that is only a hundred or so megabytes is eating up all the vram.
>>
File: image.jpg (81 KB, 1536x1024)
81 KB
81 KB JPG
girl in the back is built like a nutcracker
>>
>>101935532
this shits cooked
>>
File: ComfyUI_00066_.png (1.91 MB, 1248x1824)
1.91 MB
1.91 MB PNG
>jeets already spamming civit with dogshit flux loras
Incredible
>>
File: image23.png (1.14 MB, 1344x768)
1.14 MB
1.14 MB PNG
>>101935532
the realism lora is snake oil
i also havent seen any good tunes that keep the realism at base flux levels. everything becomes plastic slop
>>
>>101935542
>a round chin in flux

you cracked the code
>>
>>101935542
very nice
>>
>>101935537
>I cannot find an official word on this anywhere, but I assume it's bugged.
there's an issue about it, and yes it's a bug, before I had no problem loading a lora now it's overflowing my 3090 even though I had 10gb of vram to spare
https://github.com/comfyanonymous/ComfyUI/issues/4343
>>
File: delux_ci_00031_.png (1.95 MB, 1536x968)
1.95 MB
1.95 MB PNG
>>101935553
its cuz civit lets them train loras with their jizz credits. theres no quality control, its all to drive mindless consumption
>>
>>101935563
>the realism lora is snake oil
most tend to be
>>
File: image.jpg (89 KB, 1536x1024)
89 KB
89 KB JPG
>>101935565
>you cracked the code
the secret is unironically youthful adjectives

>This is a high-resolution photograph featuring two young Russian girls seated in the back of a modern, luxury car. The image is bathed in a vivid, neon pink and purple light, creating a surreal and futuristic atmosphere. The young girls, both of whom have fair skin and straight, long auburn hair, are dressed in matching, high-gloss white latex outfits that resemble futuristic or sci-fi attire. The outfits have high collars and exaggerated, angular shoulders, adding to the futuristic aesthetic. The teen girl on the left has a neutral expression, while the teen girl on the right has a more intense, contemplative look. The car's interior is sleek and modern, with black leather seats and a minimalist design. The ceiling is lined with neon lights that emit the vibrant pink and purple hues, contributing to the overall otherworldly ambiance. The windows are tinted, and outside, the night sky is barely visible through the tint, adding to the mysterious, isolated feel of the scene. The photograph captures a moment of introspection and modernity, blending fashion and technology in a visually striking manner.
>>
File: ComfyUI_00006_ (4).png (1.88 MB, 1248x1824)
1.88 MB
1.88 MB PNG
>>101935586
Civit adding on site lora training was a mistake, 90%+ of all loras being posted now across all models are objectively dogshit. Zero quality control on that site, just an overwhelming flood of slop.
>>
>>101935545
yeah it's overcooked a little bit. still honing in the numbers on my cfg/adaptivethreshold
>>
>>101935615
do you use tonemap? if yes you could decrease the multiplier value to uncook some of it aswell
>>
>>101935563
this one feels very tron
>>
>>101935509
yeah, ponyXL does characters really well but sometimes the backgrounds are not detailed, so these models can work in tandem to create good results because there are TONS of 1.5/xl/pony loras.
>>
File: delux_ci_00033_.png (1.66 MB, 1536x968)
1.66 MB
1.66 MB PNG
>>101935614
they're trying real hard to figure out how to drive revenue. they never cared about quality of content but they especially don't care now.
>>
>>101935614
Is there any filter or tag that indicates it was trained on site? Really want to filter all that crap.
>>
File: ComfyUI_00003_ (2).png (955 KB, 832x1216)
955 KB
955 KB PNG
>>101935644
Just speedrunning enshittification

>>101935648
Not that I'm aware of
>>
File: 1706122415727926.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>101935614
sort by downloads, the best stuff generally gets lots of downloads. and you only need one good character lora to make a billion gens.
>>
File: 1703879726656326.png (1 MB, 1024x1024)
1 MB
1 MB PNG
>>101935667
flux makes a good inpainting tool too, even without controlnets it manages things well: i'd usually use openpose or canny/depth when making clothes edits.
>>
is 20 generally the sweet spot for steps in flux? (dev model)
>>
>>101935553
>need my celebrity slop! saaar!! redeem the emma watson lora!!
Hopefully when Kamala makes deepfakes illegal they scrub all celeb slop off of civitai
>>
>>101935709
It seems to be 30 actually for the most consistent results.
>>
>>101935709
20-25

Anyone used this?

--bf16-vae --fp16-text-enc --fp8_e4m3fn-unet
>>
>>101935709
I use 30+ these days personally. The lower, the less things the model will be able to get right from your prompt. In other words it will follow your prompt less. 20 feels a bit too low for me sometimes so I bumped my default.
>>
File: 00058-949262416.png (2.24 MB, 1120x1440)
2.24 MB
2.24 MB PNG
>>
File: 1723362544684045.jpg (114 KB, 772x1128)
114 KB
114 KB JPG
>>
>>101935732
"realism" in image models was always a mistake. I'm also on my knees praying mommy Kamala and the EU fuck all of the jeets in the ass.
>>
File: Pegasus1.jpg (792 KB, 1920x1200)
792 KB
792 KB JPG
>>101935309
Which one is the best to run locally? I was recommended flux but I don't see it in the list.. is it comfyUI?
>>
File: 1700207971848600.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
pixel art is a fun prompt to start things with:
>>
File: image_merged.jpg (142 KB, 1536x1901)
142 KB
142 KB JPG
JoyCaption is really good at describing the angle of a shot, not so much esoteric outfits or cyborg stuff

>>101935631
thanks, im going for a gritty neon cyberpunk aesthetic. ive noticed flux tends to add too much light to images
>>
File: 1713674690763676.png (904 KB, 1024x1024)
904 KB
904 KB PNG
>>101935732
you can just use something like reactor, in flux, to face swap a txt2img gen into her if you wanted to. no need for a lora.

ie: john wick on a beach, but ryan gosling (swap)

im having way more fun making mikus though
>>
File: ComfyUI_01675_.png (851 KB, 1344x832)
851 KB
851 KB PNG
>>
>>101935806
*in forge, flux is a model. or with comfy. loras are good for characters but flux generates perfectly fine human females, all you need to do is a faceswap.
>>
>>101935741
that looks like command line for setting the checkpoint weight and text encoder weights, you can set that in comfy on the nodes
>>
thread quality always takes a dive during the weekends
>>
Would you guys rather:

>load CLIP to CPU, wait 20-25 secs for every prompt change

or

>use a quant to reduce the model size on your VRAM, but potentially get worse gens
>>
>>101935850
>wait 20-25 secs for every prompt change
hell no
>>
>>101935334
is this what japanese looks like to normies?
Is that why you keep posting NTR as perfect examples of cute and innocent girls?
why cant normies and AI just learn jap?
>>
File: 1713466368095218.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
this one turned out cute
>part of prompt is holding a mic
>ends up being one of the old school radio mics
>>
File: Capture.png (330 KB, 333x425)
330 KB
330 KB PNG
>>101935553
Here's your Flux lora bro
>>
>>101935850
i would not go below q4
>>
>>101935860
Why did he use Fifa '18 for the dataset, is he stupid?
>>
File: image.jpg (90 KB, 1536x1024)
90 KB
90 KB JPG
>>101935751
its nice to see the aesthetic is nice enough to be worth replicating

>>101935850
quants only matter if you care about reproducibility or if you're doing text stuff imo
>>
>>101935852
This happens regardless right now, because of how LoRAs are busted.
It's actually quicker to just load the clip to cpu than to wait for it to load up again on vram, oom out then load again.
>>
File: ComfyUI_02925_.png (1.34 MB, 1536x640)
1.34 MB
1.34 MB PNG
>>101935863
Don't worry I'm sitting on 24GB of VRAM I just start hitting the limit when I upscale and I'm sick of my machine freezing up.

Guess I could ditch windows and plug my monitors into on-board graphics
>>
>>101935842
It was in the comments here to speed up gen time

https://www.youtube.com/watch?v=chfUGCE0AVY
>>
>>101935452
There is an anal LoRA
https://civitai.com/models/649472/anal-art-for-flux?modelVersionId=726623
>>
>>101935309
>Beginner UI
i stopped visiting this place for a few months and now A1111 is not for beginners what happened?
>>
>>101935885
Interesting, I haven't tried using loras yet - Just exploring flux after taking a break since SD3 dropped. I remember having this being a problem when I was playing with sigma (before comfy was updated which stopped having to reload the checkpoint after anything in the workflow changed).
>>
>>101935896
well for me what made things way faster was fp8 weight for the main model, left the text encoder at fp16. but if you have 24GB vram or 64gb ram then fp16 is fine. it will slow down if you max either.
>>
>>101935854
are you japanese?
>>
>>101935334
>>101935854
Manga translator here. some of the phrases almost have meaning. it's like how SD used to be with english
>うし麦し半<>
>ちわいってーんい!
>土@んてっん上こまそい
ushimugihan<>
>cow wheat/master half<>
chiwaitteeeeeeni!
>CHWAI NEH I SAY!
tsuchi @ nteeeenjoukomasoi
>earth @ net roaf thing n deet (TL, could be a muffled 'roof thing thin & detailed')
>乃のーも毛のゐってに
>てース班いえ4っ母ら
nonooo moke noyi tte ni
>attempt to say no but spelling it two ways then mo also spelt 2 ways followed by sounds
teeesu sake ie 4 hahara
>>
Goodnight
>>
File: ComfyUI_Flux_02117_.png (1.71 MB, 768x1152)
1.71 MB
1.71 MB PNG
So I swapped out .sft to .safetensors on ComfyUI and that seems to have helped performance after I rebooted my PC, not sure if placebo or just raw benefits of a new insane Adaptive Guider workflow.
>>
>>101935869
I think it's literally a lora of his custom fifa character judging by the description. Why he thinks anyone else would want this is anyone's guess
>>
>>101935920
.sft is just .safetensors, the file hash is the same
>>
>24gb vram
>Training at 512 resolution doesn't even really impact outputs
>can train at rank 32 LoRAs now

24gb chads, we're eating good.
>>
wew, there's a few controlnet models out already
what's the feedback?
>>
>>101935904
>i stopped visiting this place for a few months
they were added just after christmas so you were gone longer anon
>>
>>101935969
Meh
>>
>>101935904

Technical debt and bloat. The gens are waayy slower on A1111 for some reason and people eventually move on to other front ends.
>>
>>101935981
I should add to this. It's not that they don't work. It's more that the controlnets we're getting feel more like poorly trained publicity stunts than anything practical.
Sure, they confirmed that they worked, but the dataset and time it took to train them reflects the poor quality.

See xinxir for an example of how a good controlnet should operate.
>>
>>101935987
and A1111 doesn't support Flux right? Damn that guy is slow, I mean, I have a lot of respect for him when I learned SD in 2022, but the reality is here, forge and Comfy are better devs than him
>>
>>101935994
can they be used with 12Gb?
>>
>>101936011
I don't know, sorry.
I know a good way to find out!
>>
>>101936014
yeah I'm on it
>>
File: image.jpg (89 KB, 1536x1024)
89 KB
89 KB JPG
>>101935940
>Training at 512 resolution doesn't even really impact outputs
i have only heard people say this is a bad idea to do
>>
File: ComfyUI_02930_.png (1.68 MB, 1536x640)
1.68 MB
1.68 MB PNG
>>
forge defaults to 896x1152 even when ui-config.json has height/width set to 1024x1024?

why
>>
>>101935320
>photo of an Indian being forced to teach his replacement as a condition for receiving unemployment
>>
File: GGUF_Safetensors_LoRA3.png (1012 KB, 2277x929)
1012 KB
1012 KB PNG
GGUF in ComfyUI now works with the native LoRA loader. No extra fuckery required.
https://github.com/city96/ComfyUI-GGUF/pull/29
>>
>>101936029
nm other people are having a similar issue, it's a bug
>>
>>101936045
>it's a bug
it's a pain in the ass is what it is, but you can easily load a png into the png info tab and send the parameters to txt2img
>>
File: 1723038923358582.jpg (85 KB, 1142x1153)
85 KB
85 KB JPG
>>101936044
thank you for your service fren
>>
>>101936023
I've unironically heard the exact opposite.
>>
File: image.jpg (81 KB, 1536x1024)
81 KB
81 KB JPG
>>
>>101936044
Based City.
>>
>>101936044
KING
How did you do it?
>>
>>101936044
this may be the way to use loras without 24gb, neat
>>
File: file.png (7 KB, 434x121)
7 KB
7 KB PNG
>>101936044
wait what is this, you can load conditioning?
>>
>>101936044
dumb question, but how do you get the little denoising preview on the k-sampler?
>>
File: image.jpg (80 KB, 1536x1024)
80 KB
80 KB JPG
>>101936061
the place i heard what i heard is here, so you probably shouldn't listen to me at all
>>
>>101936044
loadcond replaces the clip text encode nodes? how are you prompting? im a comfy noob
>>
>>101936120
Ill be back in a few hours and see how my megumin LoRA looks. (It's being trained at 512).

I've done 1024, 768 etc and they all turned out great. I wanted to see how those LoRAs looked at a higher rank, so I turn down the resolution.
>>
>>101936044
Speed is a bit shit since the calculations are done in FP32. Comfy added an option to do them in any dtype but that just got merged and I'm not gonna deal with 20 more reports of "it don't work :(((" by instantly using it kek
>>101936072
Instead of applying the LoRA weights to the quantized tensors I keep them on the CPU, start loading them asynchronously, do the dequant (by which time they're already on the GPU), do the math on the actual weights, then delete the copy on the GPU
>>101936095
>>101936121
Ignore that, that's just for testing so I don't have to load T5 each time lol. I just saved the outputs of the text encoder node to a .cond file.
>>
>>101936098
Launch with "--preview-method auto" or "--preview-method taesd" (if you have downloaded taesd)
>>
File: error.png (93 KB, 719x401)
93 KB
93 KB PNG
Good morning sirs. Does anyone else get this error trying to apply the lora to gguf?
>>
>>101936044
THATS MY GOAT
>>
>>101936166
It is telling you that your failed the math exam, unfortunately you will have to repeat the grade.
>>
File: image.jpg (75 KB, 1536x1024)
75 KB
75 KB JPG
>>101936130
>I wanted to see how those LoRAs looked at a higher rank
ah yes, the HoRA
>>
>>101936166
Which LoRA? I only tested on the ones I trained on ostris's trainer.
>>
File: ComfyUI_00032_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>
>>101936166
did you update?
>>
>>101936183
SDXL LoRAs were often 64 to 128.
32 isn't THAT high rank.
>>
>>101936166
connect it at the start of the model loader rather than after anything else than modifies it, that's how i fixed that error
>>
>2 weeks
Look at where we are now. This shit is moving faster than any other local model since 1.5.
I want the niggers who spent 2 days arguing that it was impossible to train to speak up and accept defeat.
>>
>why is my speed so dogshit
>cfg 1.1
>>
>>101936157
This is also just an option in comfy manager.
>>
>>101936188
It's the official flux realism lora converted for comfyui
>>101936236
it don't work :(((
>>
File: ComfyUI_00121_.png (589 KB, 512x768)
589 KB
589 KB PNG
>>101936189
Nice. How did you get the swastika?

For now the best gens I got with schnell were with the [*background] [(*face)style] prompt style
>>
>>101936257
>loras not possible
>finetunes not possible
apparently a lot can change in a week
>>
>>101936202
Yeah I updated the gguf custom node, didn't update comfyui, kinda afraid to after seeing the UI change
>>
File: 1703948945602284.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>ingame screenshot of world of warcraft starring miku hatsune as a wizard
>>
>>101936323
>kinda afraid to after seeing the UI change
There is an easy fix for this. Get your autism medicated
>>
File: ComfyUI_00123_.png (588 KB, 512x768)
588 KB
588 KB PNG
>>101936294
From here?

https://huggingface.co/XLabs-AI/flux-lora-collection/tree/main
>>
File: 1717759457873719.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>101936325
>>
>>101935542
>>101935563
>>101935595
These are sick

Everything else in this thread was gay and a waste of time
>>
>1000 steps into my 512x512 megumin test

It's starting to look like her.
>>
File: ComfyUI_00124_.png (569 KB, 512x768)
569 KB
569 KB PNG
>>101936329
>>101936308
Why is it generating the same leather jacket for every gen? I noticed it does the same for some background, like it knows only one type of leather jacket or city
>>
>>101935470
Daaaw is the ameriturd scared of a banny wanny?
Pathetic
>>
File: ComfyUI_00035_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>101936308
Just ask for it desu, flux knows swastikas unlike past SD slop that would always just give me messed up checker patterns instead
>>101936329
Yeah, the one that was converted for comfyui
>>
>>101936344

First sample for reference
>>
File: LoRA_Test.png (38 KB, 1282x253)
38 KB
38 KB PNG
>>101936294
Odd, I have the anime one and it seems to load.
>>101936323
I also hate it. Just make a copy of the "web" folder and update it, delete the new one and copy the old one back.
Then before each update you'll delete the web one again, do a "git reset --hard" then a git pull before copying the old one back kek.
>>
>>101936354
Thanks for the hack to keep the UI, I might resort to that as well o7
>>
>>101936354
Comfy really needs to add an autism option in the settings that changes it back to old. Does not sure what motivated him to do a UI change when the primary people comfy appeals to are actual autists.
>>
File: image.jpg (77 KB, 1536x1024)
77 KB
77 KB JPG
>>101936189
i wonder if flux devs being german will get any controversy for being able to generate swastikas so well

>>101936338
thanks anon ill definitely be using the first image in my future cool music video where i rap about fucking robots

>>101936257
>This shit is moving faster than any other local model since 1.5.
this is also because people have more compute on average, and techniques like lora/controlnet were already discovered/invented
>>
File: ComfyUI_00126_.png (557 KB, 512x768)
557 KB
557 KB PNG
>>101936351
This works for me

https://huggingface.co/XLabs-AI/flux-lora-collection/blob/main/realism_lora.safetensors
>>
>>101936380
We had all those tools with XL too, but it took ages for it to be worthwhile. Then we've had pixart and hunyuan and nobody gave a shit about those either.
the primary motivator is how good the base model is.
>>
https://github.com/bmaltais/kohya_ss/issues/2701

Lurked this thread for flux kohya ss training configs. Running one from there right now with some old danbooru style dataset I have, at least it worked and started training.

!Watch your temp/ power limit!
>>
>>101936386
>https://huggingface.co/XLabs-AI/flux-lora-collection/blob/main/realism_lora.safetensors
In comfyui? The one that wasn't converted? That one gives me different errors and ends up not being applied at all
>>
>>101936397
Use this one
>https://github.com/bmaltais/kohya_ss/issues/2701#issuecomment-2294611159
It's what I'm training on right now, it's the only one I've had success with samples are promising at 3600 steps
>>
>>101936402
>That one gives me different errors and ends up not being applied at all
That was fixed last week I think https://github.com/comfyanonymous/ComfyUI/pull/4302
>>
File: 1695054741688234.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
world of miku:
>>
File: ComfyUI_00129_.png (596 KB, 512x768)
596 KB
596 KB PNG
>>101936402
Yes
>>
>>101936428
Thinking about making WoW classic LoRA to really force that aesthetic. Interested?
>>
>>101936420

I got invalid Additional parameters with that one.
>>
File: ComfyUI_00336_.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
>>101936351
>>
File: ComfyUI_00130_.png (582 KB, 512x768)
582 KB
582 KB PNG
>>101936444
Can you try it with leather jacket?
>>
File: Capture.png (10 KB, 1024x102)
10 KB
10 KB PNG
>>101936440
Could be --highvram? Did you git pull? Every other config I tried from that thread trained to completion but the finished loras had little to no effect at all
>>
File: image30.png (933 KB, 1344x768)
933 KB
933 KB PNG
>>101936393
>the primary motivator is how good the base model is.
you're right. SDXL would be half of what it was today without pony
>>
File: ComfyUI_172140_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>101936379
The default UI is still the old style one. Right now you have to go in the settings and manually set an option to get the new one.
People have been complaining about the UI since the beginning, the new UI is the beginning of one that's designed by actual UX people.
>>
>>101936439
im just testing a range of prompts, save your training time for a character/aesthetic you like. but ty
>>
>>101936490
I-its not like I'd be making it for YOU. I just like the aesthetic is all, BAKA.
>>
so if Q4 works with loras that means people with under 24GB can generate loras, right? Even Q8 would work. I tried the regular fp8 model on a 4080 and it took 3 minutes (16GB vram).

so this is very promising stuff
>>
File: ComfyUI_00341_.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
>>101936460
>>
>>101936498
it'd be a neat style in general, would be popular on civitai
>>
>>101936474
That's weird because I didn't change anything, I just did a pull and it changed. I definitly didn't pick an option to get it. I assume you made a change between then and now.
I personally like the new UI but please understand that your audience is mostly people with varying levels of actual autism and change makes them angry.
>>
>>101936474
the specific design isn't what people have been complaining about and your userbase is predominantly autistics who hate change
>>
>>101936499
*generate images with loras

training is another thing entirely
>>
When will the Q2 quants drop?
>>
File: ComfyUI_172142_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>101936505
Did you already have the beta UI enabled? (the bar instead of the floating side menu).
>>
File: 1704116347050872.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
now we're getting closer, now that I specified long twintails.
>>
File: image.jpg (90 KB, 1536x1024)
90 KB
90 KB JPG
>>101936511
dont do that to yourself anon. buy a used 3090 or even a 3060
>>
>>101936509
>>101936509
?? I've been using loras on q8 just fine on a 4070ti Super 16gb. I even trained one locally
>>
>>101936518
>the bar instead of the floating side menu
Yeah I did. I guess that explains it.
>>
>>101936534
probably cause I used the default fp8, I was running out of vram, ill try the Q8/Q4
>>
>>101936523
Looks pretty cool Anon. I wonder if specifying her username, the map name in the minimap and the chat contents would replace the gibberish texts.
>>
>>101936534
how do you connect the gguf or lora loader nodes to a clip/prompt node? it just has model out, any screenshot of your workflow?
>>
>>101936380
Can we get nudes of the Russian girls? How/where?
>>
File: ComfyUI_00928_.png (1.5 MB, 1280x720)
1.5 MB
1.5 MB PNG
Flux sure likes necklaces
>>
File: ComfyUI_00040_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>101936501
based
>>
>>101936534
>I've been using loras on q8 just fine
So we can just use any Flux lora?
I wonder if controlnets will be the same.
>>
So is nf4 old and busted with the GGUF being the new hotness?
>>
>>101936590
Is that what you think Russian women look like?
>>
File: ComfyUI_00927_.png (1.54 MB, 1280x720)
1.54 MB
1.54 MB PNG
>>101936625
yeah
>>
>>101936590
definitely not on a blue board, also base flux sucks with nudity and nipples
>>
>>101936590
russian-nude-girls.com.exe
>>
>>101936474
is the old one going to get removed
>>
>>101935309
I like how even if the base model can do it, a LoRA can help keep the prompts shorter. Not fully accustomed to boomer prompts so that's another great use case for LoRAs besides adding style.
>>
>>101935614
>just an overwhelming flood of slop
Just like real life.
>>
File: image.jpg (82 KB, 1536x1024)
82 KB
82 KB JPG
>>101936633
he really choose the most East Asian looking gen in the whole thread to ask that question too kek

>>101936695
>Just like real life.
"90% of everything is shit"
>>
>>101935850
>wait 20-25 secs for every prompt change
Every gen is several minutes, so whatever.
I do both and I'm still in pain.
>>
>>101935850
I have a 7700 and even a prompt with 3k characters is processed in under 10 seconds
>>
File: ComfyUI_172151_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>101936674
Maybe in a long time idk.
>>
>>101936723
what prompt is that anon
>>
File: FD_00380_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>101936702
To be fair, women from Vladivostok tend to have that look, sort of, due to the proximity to Mongolia, China, and North Korea. Russia is more ethnically diverse than people realise.
Flux seems to agree.
>Prompt: A woman from Vladivostok Russia wearing traditional clothes.
>>
>>101936729
a made up prompt to test the processing speed
>>
>>101936726
I think you should leave it in forever as an easter egg if someone uses the arg --autism
>>
>>101936741
I doubt the model has any specific information on what part of Russia they models come from.
>>
File: FD_00383_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>101936741
Wait this is the wrong pic, it was this one
>>
>>101936702
I was referring to the ones in white, not that stinking gook looking girl
>>
https://civitai.com/models/644109/pepe-flux?modelVersionId=720509

a new age of pepe
>>
>>101936756
Yeah the vlm wouldn't know, but I am curious now, gonna run some tests.
>>
>>101936741
>that watermarked chin
>>
>Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.
what it be
>>
File: 1704160011984822.png (967 KB, 896x1152)
967 KB
967 KB PNG
loras do in fact work, neat
>>
>>101936804
it checks if okay on GPU so don't have to go to CPU
>>
>>101936808
aight, what do
>>
File: rus grills.jpg (3.69 MB, 1999x1999)
3.69 MB
3.69 MB JPG
>>101936791
It's all just vaguely Russian looking regardless of the region. None of them are correct.
>>
>>101936817
it be faster
>>
>>101936821
They are all sisters from the same chin.
>>
>>101936829
They are all gens from the same seed.
>>
>>101936826
What I'm 'posed to do?
>>
>>101936839
don't compile torch yourself
>>
>>101936726
yeah i can understand if it starts to cause trouble
>>
File: 1702808362592953.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>
File: ComfyUI_temp_eusqs_00002_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>101936858
>>
>>101936858
Training LoRAs, cant gen right now.
>>
Anyone get joycaption running locally yet?
>>
>>101936878
yes
>>
>>101936889
Cool, how?
>>
>>101936804
that's usually proceeded by indexing errors or something like that. it's an issue with the code, nothing you can do yourself, report the issue with the full traceback
>>
File: 1718968464074337.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
has science gone too far? this is what happened with "elf" by itself.
>>
>>101936891
I cloned and ran it.
>>
>>101936806
>loras do in fact work, neat
it's real
>>
>>101936846
>>101936900
I just updated my nvidia driver and now I get all kinds of shit, I don't what these mean
>>
File: file.png (7 KB, 811x52)
7 KB
7 KB PNG
>>101936858
I found a well priced Titan. Is it a good choice for those 24GB?
>>
>>101936937
I did that with 16gb, you dont need 24 but it's nice for training purposes.

try >>101936912 or try fp8/q8 in forge with the update
>>
>Goofgufs now working with LoRAs
This is so good.
>>
File: ComfyUI_00046_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
>>101936474
>UX "people"
It's fucking owari
>>
>>101936973
That's almost a cameltoe. Is it learning?
>>
>>101936991
>Is it learning
No, unless he has a camel toe lora that is purely coincidental. It can't learn, it's a pre-trained model. We have to feed it additional training in the form of loras and fine tunes.
>>
>>101936998
B-but AI!
>>
File: FD_00397_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>
File: 1694943747312868.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
<lora:FLUX-Pepe-1:1> Pepe riding a skateboard in summer

new age of memes begins now
>>
>>101936991
pure luck just by prompting for tight high-wasit shorts
>>
so how is gguf lora formed?
>>
>gguf+lora works but not with dynamic thresholding
owari da
>>
>>101936912
What all this stupid celeb loras show, that is posible delete the chin butt with a a proper data finetune
>>
>>101937023
how is goof form?
>>
>>101936044
any plans for Q8 T5?
>>
>>101936998
You don't say? I thought it could communicate through the internet by interfacing with anon's GPU, altering its weights and syncing them via bittorrent protocol in real time.

Thanks for clearing that up. It's nice to have intelligent people like yourself here.
>>
File: FD_00398_.png (1013 KB, 1024x1024)
1013 KB
1013 KB PNG
>>101937029
It also shows that Trump and Kamala had an affair
>>
>>101937023
looks like no conversion required
>>
File: 1704495039809965.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>
>>101937040
>he doesn't know
>>
Flux werks but i dunno what to gen
>>
>>101937040
You joke, but it wouldn't be the most retarded question asked seriously in here, so I just assume everyone is an idiot.
>>
>>101937026
I moved away from dynamic thresh because it was tanking my performance and the visual gains were subjective at best. Unless you very specifically want a certain thing I'm usually fine without it.
>>
>>101936878
>git pull it from hugging face
>copy+paste venv from comfyui or whatever into the folder
>edit app.py and change the model path to a publicly accessible model, like the quantized llama one (check archives for link)
>If on windows, create batch file to run app.py (ask chatgpt if you don't know how)
>open UI in browser
>done

if you encounter any issues or errors ask chatgpt how to fix them
>>
I'm gonna post it
>>
>>101937051
sex gifs
>>
File: 1699238850424597.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
prompt cartoon to get cartoonish pepe instead of RL pepe:
>>
>>101937055
dont
>>
>>101937040
you'll find anons on the DALL-E 3 threads firmly believing that the more they prompt a certain celebrity the better the model gets at reproducing them
stupidity is real
>>
>>101937055
DO IT FAGGOT
>>
>>101937066
DALL-E jeets aren't human
>>
>>101937060
I don't think the flexibility and successfulness of flux LoRAs is being discussed enough. Even at low rank they nail aesthetics. Go ahead and find an artist you like, take 50-100 of their images and just shove them into a trainer, don't even bother captioning it and it will ape their style almost perfectly.
>>
>>101937055
you won't
>>
>>101937055
Don't do it anon...
>>
File: FD_00396_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>101937066
Ask them if they can gen this.
>>
>>101936786
(((The creator of this asset requires you to be logged in to download it)))

How does no one seemingly have a problem with this?
>>
>>101937082
yeah, even for testing a SDXL lora I used 10 images from one piece and was able to make a nami that looked just like it. and that's with BASIC settings, not even tons of training.
>>
>>101937066
just remember the average IQ in India is literally the IQ standard for qualifying as mentally retarded in the first world, and everything makes horrifying sense
>>
File: file.png (456 KB, 512x512)
456 KB
456 KB PNG
So, I found a way to avoid the chins might be to prompt ethnicity and avoid writing "woman" or "girl". You can already see that with Asian. It's not 100% foolproof, but it might be worth looking into.

This is a "Jewish beauty".
>>
File: Capture.png (6 KB, 402x150)
6 KB
6 KB PNG
>>101937088
There's literally paid loras on civitai now
>>
File: 1720205090285615.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>if only you knew how GOOD things are
>>
>>101937086
only the censorship around DALL-E would stop this from generating
if you played with it in the first week there should be no doubt it can gen that plus tongues and saliva (Flux can't into tongues and saliva)
>>
>>101937096
Mental retardation determination by IQ only makes sense after normalized to 100 in the test population.
>>
>>101937101
You could be exploiting people's need to cum and make your own LoRAs.

>>101937104
I like how trump's eyes are also vaguely pepeish.
>>
>>101937116
You know what DALL-E can't do that FLUX can? Run on my GPU.
>>
>Been like 4 days since the 24gb fine tuning guy said he could get it working on a single 3090.
>Even Kohya was impressed
>Nothing has come of it yet.

Where the fuck is he?
>>
>>101937101
Signs the site has hit rock bottom, we are in dire need of a new website



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.