[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.24 MB, 3264x3264)
1.24 MB
1.24 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102046042

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
File: ComfyUI_02741_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
File: ComfyUI_00918_.png (3.15 MB, 1376x1536)
3.15 MB
3.15 MB PNG
>>
File: ComfyUI_10315_.png (2.3 MB, 1400x800)
2.3 MB
2.3 MB PNG
>>
File: ifx173.jpg (217 KB, 1024x1024)
217 KB
217 KB JPG
>>
File: frst.jpg (198 KB, 896x1152)
198 KB
198 KB JPG
>>
File: ComfyUI_00758_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
local lora training guide for flux? 16GB of VRAM
>>
File: ComfyUI_32775_.png (2.61 MB, 1536x1536)
2.61 MB
2.61 MB PNG
>>
>>102048342
watch this video and join my patreon
>>
File: ComfyUI_00996_.png (1.63 MB, 768x1280)
1.63 MB
1.63 MB PNG
is it true that nf4 "slaps"?
>>
>>102048342
>>
File: file.png (945 KB, 1024x1024)
945 KB
945 KB PNG
Training a little bit more
>>
File: 00037-2415769798.png (892 KB, 1344x720)
892 KB
892 KB PNG
>>102048342
Get 4090
>>
>>102048366
what the hell is going on in that image
>>
File: ComfyUI_01001_.png (1.75 MB, 768x1280)
1.75 MB
1.75 MB PNG
how could nf4 do this to my biracial daughter?
>>
File: ComfyUI_01002_.png (1.61 MB, 768x1280)
1.61 MB
1.61 MB PNG
>>102048392
she has been raped by compression format her father is very angry about it. it was in the local news yesterday
>>
File: ComfyUI_00922_.png (2.42 MB, 1536x1376)
2.42 MB
2.42 MB PNG
>>
File: 1715441946920481.png (782 KB, 824x824)
782 KB
782 KB PNG
Does anyone have a good prompt/negative prompt to get natural looking photos? Flux makes everything look so hollywood/instagram style.
>>
fligugigu
>>
>>102048437
use lora
>>
File: ComfyUI_00923_.png (1.74 MB, 960x1280)
1.74 MB
1.74 MB PNG
>>
>>102048482
good shit
>>
>>102048437
Put the prompt in a LLM, it needs lots of details

Use this lora

https://civitai.com/models/652699
>>
File: ComfyUI_04109_.png (1.43 MB, 832x1216)
1.43 MB
1.43 MB PNG
>>
Training for 1250 steps. I should re-caption the dataset with wd-tagger, though. So this might end up being a beta lora or something.
>>
>Using this option, you can even try SDXL in nf4 and see what will happen - in my case SDXL now really works like SD1.5 fast and images are spilling out!
So this is just complete bullshit?
>>
File: ComfyUI_32776_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
File: 00025-3308483902.png (1.13 MB, 896x1152)
1.13 MB
1.13 MB PNG
>>
File: ComfyUI_Flux_10314.jpg (473 KB, 1024x1024)
473 KB
473 KB JPG
flux fp16 got extremely slow for some reason in my comfy even with --lowvram (24GB GPU)

I feel dirty using fp8
>>
File: 00028-1561240628.png (1.18 MB, 896x1152)
1.18 MB
1.18 MB PNG
>>
File: ComfyUI_32777_.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>
File: 1724447494689.png (95 KB, 436x536)
95 KB
95 KB PNG
>update forge
>everything is noticeable slower
>>
>>102048715
kek
>>
>>102048677
I don't even have the option to use fp8.
>>
>>102048234
>flux
but where are the Pony-level of furry finetunes? How long has it been, 2, 3 weeks?
>>
>>102048677
Something took up some vram and it overflowed? Have you checked?
--lowvram just makes sure to unload models after use, it might not be particularly helpful in certain situations
>>
>>102048715
Thanks for playing the forge lottery, better luck next time.
>>
File: ComfyUI_212326_.png (2.55 MB, 1920x1080)
2.55 MB
2.55 MB PNG
this is out of 1300/10000 steps so far, gonna post on civitai once complete. Even took pictures of my hot toys MMS466 matrix figure. How many steps is too much steps? letting it go for 10k and then save all outputs incrementally 100,200,300,etc
>>
>>102048743
Just 2 more weeks™
>>
File: _00006-4142853513.jpg (100 KB, 832x1216)
100 KB
100 KB JPG
>>102048437
amateur, noob, newbie, not so pro, no professional, low budget
>>
File: ComfyUI_00924_.png (1.66 MB, 960x1280)
1.66 MB
1.66 MB PNG
>>
>>102048342
Plug in values and pray to God it works
>>
Official prompt:
>This is a digital cartoon drawing of apustaja the antropomorphic frog. He is wearing a blue t-shirt and looks sad.

https://mega.nz/file/sEADSCgI#2q0RUTZPGxotB5sIEb8LwDJ-LGgno1kAR4X966eMu2I

Please experiment, distribute, and post results.
>>
File: ComfyUI_01023_.png (1.55 MB, 768x1280)
1.55 MB
1.55 MB PNG
>>102048482
>>
>>102048779
What are your settings for output resolution/network rank/network alpha/convolution rank/convolution alpha?
>>
>>102048815
sorry i only download from safety sites like civitai i dont want to be infected with a dangerous virus. also mega isn't this the site that was run by a anti-semite nazi criminal on new zealand that has been on the news recently. not ok
>>
File: 00032-1338252880.png (1.08 MB, 896x1152)
1.08 MB
1.08 MB PNG
>>
>>102048565
I'm a bit confused about people combining boomer captions and WDtags. WDtags gets separated by commas but normal sentences don't have the same effect
>>
File: ComfyUI_32779_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
File: file.png (897 KB, 1024x1024)
897 KB
897 KB PNG
>>102044360
>>102048815
>>
File: 1723024654108.png (957 KB, 1280x720)
957 KB
957 KB PNG
>>102048677
>>102048715
>>
>>102048677
I can use fp16 just fine when I start my PC fresh .. then if I use it a while and resume using comfy and fp16 its slow and sometimes even gives me bluescreen, even on a 4090 fp16 hits the limits of usage for comfy somehow.
>>
>>102048234
Imggen
>>
File: file.png (711 KB, 1024x1024)
711 KB
711 KB PNG
>>
>>102048921
me
>>
File: sailing.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>
File: file.png (894 KB, 1024x1024)
894 KB
894 KB PNG
>>
File: 1696860059228658.png (529 KB, 512x512)
529 KB
529 KB PNG
how do I create people with trunk noses?
>>
>>102048829

[
{
"id": "n3o",
"type": "local",
"crop": "true",
"crop_aspect": "square",
"crop_style": "center",
"resolution": 1.0,
"minimum_image_size": 0.25,
"maximum_image_size": 1.0,
"target_downsample_size": 1.0,
"resolution_type": "area",
"cache_dir_vae": "cache/vae/n3o",
"instance_data_dir": "datasets/n3o",
"disabled": false,
"skip_file_discovery": "",
"caption_strategy": "textfile",
"metadata_backend": "json"
},
{
"id": "text-embeds",
"type": "local",
"dataset_type": "text_embeds",
"default": true,
"cache_dir": "cache/text/n3o",
"disabled": false,
"write_batch_size": 128
}
]




any suggestions please let me know what to update
>>
>>102048838
https://civitai.com/models/679189
>>
>>
>>102048829

and here is the config file

https://pastebin.com/JyjiQVwr

any suggestions would be greatly appreciated.
>>
>>102048342
https://www.reddit.com/r/StableDiffusion/comments/1eyr9yx/flux_local_lora_training_in_16gb_vram_quick_guide/
this one worked for me. have to train 512x512 though
>>
File: 2024-08-23_00507_.jpg (1.09 MB, 2688x1728)
1.09 MB
1.09 MB JPG
damnit... I went for a bath and set some hires upscales of my scifi scenes on a q so I can have some after ..


and I set SD UltimateUpscale iteration mistekenly to 73 ... now I have have 1 .. here have it .. and Ill prompt something else now.

Also GGUF is absolutely unusable if you use 3 loras at the same time as in pic related. Damnit.
>>
>>102049096
Wait, he only used 10 images for the dataset? I thought people were using autocaptions because their datasets were large.
>>
File: file.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>
>>102049120
you barely need anything to train characters with lux. just throw 20 pics in and you get extremely good likeness.
>>
everything costs money unless you own a 4000 series and still have to pay for open ai prompting, otherwise its extremely tedious work. So pay to rent a fast gpu and then pay open ai for prompting
>>
>>102049133
then damn, i'll caption these better than the autocaption
>>
>>102049120
you use autocaptions because writing captions is extremely tedious
>>
File: ComfyUI_32780__cleanup.png (1022 KB, 1024x1024)
1022 KB
1022 KB PNG
>>
>>102049140
another reason is that Flux was trained on autocaptions, we don't know what model they used but having similar "LLM sounding" captions can't hurt
>>
File: fs_0350.jpg (74 KB, 1024x1280)
74 KB
74 KB JPG
>>
Flux takes more work but outcome is better where sdxl took less effort and got better results, am i wrong?
>>
>>102049160
This is correct. Nearly everything JoyCaption spits out is an instantly useable prompt for FLUX, so using JoyCaptions autocaption for learning will be inherent for FLUX
>>
>>102049212
work to set up?
I just downloaded it with the workflow. I'd prefer a1111 but eh.
>>
>>102049211
i see the slag lora is coming along well anon
>>
>>102049212
if you have a monster prosa prompt you can outdo anything that SDXL can do (bot coom cause finetunes) .. but yea in SDXL finetunes you can get exactly what you want with a few tokens if you know how that finetune wants em

>masterpiece by Greg Rutkowski trending on ......
>>
>>102049230

no creating loras
>>
>>102049211
10/10 slampigs, imagine the jiggle
>>
>>
File: 2024-08-23_00443_.png (1.46 MB, 1280x768)
1.46 MB
1.46 MB PNG
>>
File: file.png (1014 KB, 1024x1024)
1014 KB
1014 KB PNG
>>
File: 2024-08-23_00511_.jpg (701 KB, 2688x1728)
701 KB
701 KB JPG
>>
File: file.png (790 KB, 1024x1024)
790 KB
790 KB PNG
>>
File: IMG_9636.jpg (714 KB, 1125x1123)
714 KB
714 KB JPG
>>102048234
>omg bro fluxxxx bro it’s so good bro sd is doomed bro
>physically cannot make porn
>>
>>102048838
lol, and you are still browsing 4chan? xD
>>
>>
>>102049405
This poster is an underage transexual-loving indian.
>>
File: zonkey_test.jpg (65 KB, 512x640)
65 KB
65 KB JPG
does flux support BREAK? Is there a list of supported syntax that I am unaware of. Specifically interested in () and [X:Y:0.4]

Pic not related
>>
>>102049409
haha
imagine if she got stuck halfway though a wall
wouldn't that be funny
hahaha
>>
>>102049381
>the eternal coomer
>>
>>102049454
no, well maybe, could be, but if only in the CLIP part of the prompt, not in the T5 part.. that will see BREAK as a bold word .. maybe something to write on a sign or such
>>
Okay I need to uncuck flux.
Is there an existing porn image dataset or do I have to make one?
>>
>>102049490
You are in for a world of pain and defeat. It’s ununcuckable.
>>
>>102049381
The problem is trying to gen homosexual porn
Try normal porn and it'll work
>>
File: 2024-08-23_00515_.jpg (555 KB, 2688x1728)
555 KB
555 KB JPG
>>
File: ComfyUI_04121_.png (1.38 MB, 704x1408)
1.38 MB
1.38 MB PNG
>>
>LR 1.0
call the cops.
>>
>>102049381
now let's see what base SD1.5 or SDXL does with that prompt
>>
File deleted.
>>102049555
False. Half the time the women have no nipples in a fucked up way. There is no keyword or phrase to make it consistently have naked people; it gives them modesty briefs and bras constantly, or if it does have them naked it does a SFW side view like pic related.
99.9% of the time women don’t have labia or clitorises and the men don’t have penises or assholes.
There is clearly next to zero actual erotic imagery in the dataset, or it is unlabeled.
>>
time to generate 150 1girls while i go to the bathroom
>>
>>102049615
this is still a blue board
>>
File: 2024-08-24_00004_.png (1.28 MB, 1344x864)
1.28 MB
1.28 MB PNG
>>
>>102049381
>>102049615
search flux lora on civitai there are some to do porno
>>
>>102049212
It depends on what you're training. The techniques and settings I used for training SD/SDXL didn't work the way I thought they would for flux, and now I have to experiment again. The ease of training, creativity, and flexibility of the base models seem to become worse with each new release even though their prompt adherence became better.
>>
>>102049615
How close to that could you get with base XL or 1.5 I wonder
>>
>not enough vram to train loras
>can't even offload or anything, just fucked
IT'S OVER...
>>
>>102049627
The image is SAFE FOR WORK you can’t see SHIT
>>
https://civitai.com/models/656083/copycat-flux-testfp8fp16?modelVersionId=758738
is this sovl or sovlless?
>>
>>102049685
I said base models, anon
>>
>>102049664
>The ease of training, creativity, and flexibility of the base models seem to become worse with each new release even though their prompt adherence became better.
I noticed this too. I make up for it with wildcards. There's also a technique of shifting prompts halfway through. You begin with an unrelated prompt, and shift it to your real prompt, usually at 10% steps. I haven't tested it on flux yet.
>>
>>102049701
nsfw does not equal porn, it includes erotica damnit .. ah whatever
>>
File: 0.jpg (275 KB, 1024x1024)
275 KB
275 KB JPG
>>
>>102049686
You can use a cloud service.
>>
File: ComfyUI_Flux_10377.jpg (270 KB, 768x1344)
270 KB
270 KB JPG
>>
File: ComfyUI_32786_.png (977 KB, 1024x1024)
977 KB
977 KB PNG
>>
File: ComfyUI_32789_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
File: fs_0370.jpg (284 KB, 1024x1280)
284 KB
284 KB JPG
>>
File: IMG_9641.jpg (888 KB, 1125x1121)
888 KB
888 KB JPG
>>102049615
Why would you want to be making porn when you could be making this
>>
File: 0.jpg (170 KB, 1024x1024)
170 KB
170 KB JPG
>>
What happens if you provide the same tag multiple times?
>>
>>102049886
Dali?
>>
File: 00141-4105126478.jpg (382 KB, 1000x1496)
382 KB
382 KB JPG
hej
>>
>>102049898
Base XL img2img on some random image.
Just effing around, throwing junk at SD to see what it does.

prompt:
by Homer Winslow
[ style of Ernesto Neto| style of Roberto Matta ] , (minimalism:1.9)
BREAK
[ style by Rob Gonsalves | style by Dan McPharlin | style by Igor Morski | style by Andrzej Grenda style by Agostino Arrivabene]
by Rothko
BREAK
minimalist , blank, few, sparse, honeycomb [ style of Joan Miro | style of Paul Cezanne | style of Jesus Raphael Soto | style of Georges Vantongerloo ] by Odilon Redon, chiaroscuro , impasto,Craquelure, hatching, Sfumato
BREAK
an infinite flatness stretching to a flat horizon, (minimalism:1.7) surreal,
style of Igor Morski, [ style by Rob Gonsalves | style by Dan McPharlin | style by Igor Morski | style by Andrzej Grenda | style by Agostino Arrivabene ], style by Fischinger, style by Hoyland]
abstract expressionism ,
graphic art, minimalism,

(Bauhaus:0.4) ,
BREAK
surreal, abstract, geometric, (Russian avant-garde) by Alberto Morrocco ,, Romantic, Rococo, intricate, amazing, minimalism, vector art,
BREAK
by Marijah Bac Cam , by Renee Johannes , by Gabriele Maurus, by Sumit Mehndiratta, by Ralph Paqui , by Kandinsky, by Miro, (minimalism:1.4)
(minimalism:1.6)
[by Simon Stalenhag

style by Franz Kline |
style by Jackson Pollock |
style by Agnes Martin |
style by Helen Frankenthaler]
>>
>>102049897
This apparently
>>
>>102049956
A SNAAAAAAAAAKE
>>
File: 0.jpg (112 KB, 1024x1024)
112 KB
112 KB JPG
>>
File: IMG_9649.jpg (904 KB, 1125x1722)
904 KB
904 KB JPG
>>102049975
Wow!
>>
File: ComfyUI_00930_.png (1.59 MB, 960x1280)
1.59 MB
1.59 MB PNG
>>
>new model/finetune/lora/etc
>absolutely compelled to prompt “trypophobia” ten times in a row
Why do I do this to myself
>>
File: tmpc4jbgath.png (3.27 MB, 1536x1536)
3.27 MB
3.27 MB PNG
badger:2
>>
>>102050063
add a bulge
>>
File: ComfyUI_04115_.png (1.28 MB, 768x1280)
1.28 MB
1.28 MB PNG
>>
>>102049615
Flux hates NSFW so much that it made both of them crying while they hug.
>>
File: ComfyUI_04135_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>
File: ComfyUI_32791_.png (2.37 MB, 1920x1080)
2.37 MB
2.37 MB PNG
Which way, prompt man?
>>
File: ComfyUI_04140_.png (1.5 MB, 960x1088)
1.5 MB
1.5 MB PNG
>>102050105
continue prooompting
>>
File: ComfyUI_04146_.png (1.45 MB, 704x1408)
1.45 MB
1.45 MB PNG
>>
File: tmp5epvapm0.png (3.72 MB, 1536x1536)
3.72 MB
3.72 MB PNG
>>102050052
>>102050072
naw
>>102050099
whats wrong with you?
>>
File: ComfyUI_Flux_10411.jpg (369 KB, 768x1344)
369 KB
369 KB JPG
>>
File: ComfyUI_04148_.png (1.47 MB, 768x1344)
1.47 MB
1.47 MB PNG
>>
>>102049956
Honey badger
>>
File: ComfyUI_00933_.png (1.82 MB, 960x1280)
1.82 MB
1.82 MB PNG
>>
File: ComfyUI_04157_.png (1.37 MB, 896x1088)
1.37 MB
1.37 MB PNG
last one from the batch nightclub wildcard
>>
File: FLUX__00020_.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>102049234
base flux is fine for slags
>>
File: tmpryz2t5hb.png (3.72 MB, 1536x1536)
3.72 MB
3.72 MB PNG
scopophobia:2
>>
File: IMG_9653.jpg (577 KB, 1125x1692)
577 KB
577 KB JPG
>>102050136
Can’t believe these assholes couldn’t train on a single crumb of porn, but could on every decades-old boomer meme
Shameful
>>
>>102050006
damn nice
>>
File: flux_00633_.png (1.08 MB, 1280x1024)
1.08 MB
1.08 MB PNG
>>
File: ComfyUI_00935_.png (1.4 MB, 1280x960)
1.4 MB
1.4 MB PNG
>>
File: ComfyUI_Flux_0151.jpg (926 KB, 1536x2688)
926 KB
926 KB JPG
>>
>Been using a1111 with some hacked up pile of kludge for torch because I'm on a 5700XT
>Oh hey some torch whls finally got pushed out for torch+rocm6.2 and my package manager is finally shipping 6.2 lets give it a spin
>Kernel panic on start
Christ its 2010 era catalyst all over again what was OpenCL really so bad?
>>
>>102050178
too thin, they are well fed now
>>
File: IMG_9654.jpg (921 KB, 1125x1737)
921 KB
921 KB JPG
>>102050178
Wtf
>>
>>102050237
If you’re going for 1960s sci fi pulp cover you’ve nailed it
>>
File: ComfyUI_Flux_0155.jpg (1.03 MB, 1536x2688)
1.03 MB
1.03 MB JPG
>>102050130
upscaled
>>
File: flux_00639_.png (1.05 MB, 1280x1024)
1.05 MB
1.05 MB PNG
>>
File: ComfyUI_Flux_0157.jpg (992 KB, 1536x2688)
992 KB
992 KB JPG
>>
File: ComfyUI_32797_.png (1.14 MB, 720x1280)
1.14 MB
1.14 MB PNG
>>
File: ComfyUI_Flux_0153.jpg (935 KB, 1536x2688)
935 KB
935 KB JPG
>>
File: 00233-1281223147.png (1.62 MB, 896x1152)
1.62 MB
1.62 MB PNG
>>102050391
>>102050346
>>102050340
>>102050178
Nice
Nice
Nice
>>
>>102050391
>>102050346
People that post images taking actual effort without posting workflow and shit should be lined up and shot
>>
File: 00235-1281223149.png (2.03 MB, 896x1152)
2.03 MB
2.03 MB PNG
BEHOLD!
>>
Does anybody know of a simple (not gradio) clip-interrogate api I can use to power the semantic search for my file picker? Tired of relying on bloated and unstable apis on stable diffusion webui implementations
>>
File: 00241-1281223155.png (2.01 MB, 896x1152)
2.01 MB
2.01 MB PNG
>>
File: 00244-1281223158.png (1.99 MB, 896x1152)
1.99 MB
1.99 MB PNG
>>
File: 00002-1281223162.png (1.7 MB, 896x1152)
1.7 MB
1.7 MB PNG
>>
>>102049701
idk about you, but my boss wouldn't be happy if he saw that on my monitor
>>
File: ComfyUI_Flux_10405.jpg (340 KB, 768x1344)
340 KB
340 KB JPG
>>102050425
weird catbox request
>>
>>102050215
thanks
>>
File: ComfyUI_32799_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>
>>102050417
this is what a dog looks like when you're on acid
>>
>>102050237
great
>>
>>102050473
Having a boss that can see your monitor in 2024 is on you matey
>>
File: bComfyUI_108644_.jpg (809 KB, 2048x1024)
809 KB
809 KB JPG
>>
>>102050435
Yeah well some of us are (a) fucking retarded and (b) want to make a tiny tweak of exactly what was posted, not play guess and check to get something that sort of looks like it
>>
>>102050473
Let's be honest browsing 4chan at work is risky no matter what lol
>>
>>102050444
This is fucking sick
>>
File: ComfyUI_32803_.png (1011 KB, 1024x1024)
1011 KB
1011 KB PNG
>>
Okay, is it perhaps too easy to train flux LoRAs? Civit is full of garbage right now.
>>
File: flux_00650_.png (1.1 MB, 1280x1024)
1.1 MB
1.1 MB PNG
MIDRIFF!

Will hollywood PLEASE do their damned job so that I don't have to do this myself???
>>
>>102050684
>>
File: 00078-4003759954.png (2.22 MB, 1024x1440)
2.22 MB
2.22 MB PNG
>>
>>102050639
Prompt was Soul, and it had a movie portrait lora

But Soul without a lora also does some really random images. Sometimes it's a dog, sometimes it's a person and then sometimes it's images like that
>>
>>102050694
Love me some midriff anon, midriff has be a staple prompt from the early days.
>>
>>102050684
Basically Civit let's your train a lora on their site using Buzz, so yeah all kinds of people have easy access to make a random lora
>>
So basically for 99% of people flux is good enough for everything, and its two blind spots are:
-porn
-hentai
-intellectual property
So someone just needs to make a dataset of a few thousand images to cover those, and then it can be applied to any open sores models going forward to unpoz them.
>>
>>102050750
that's three things not two
>>
File: 00013-AYAKON_1248186.png (3.7 MB, 1536x2560)
3.7 MB
3.7 MB PNG
>>
>>102050767
Are you accusing me of not being able to count? I know my numbers!
>>
Dumb question but does the order of your words in prompts matter?
>>
>>102050787
Closer to the beginning of the prompt = more attention
>>
>>102050792
I see. I'll keep that in mind thanks.
>>
File: 00086-1840624089.png (2.2 MB, 1024x1440)
2.2 MB
2.2 MB PNG
>>
File: ComfyUI_32807_.png (2.59 MB, 1536x1536)
2.59 MB
2.59 MB PNG
>>
why do my loras only work at 1.5+ strength
every one I've downloaded just consumes every face in the gen, but mine has to be teased out, whether it's been baking for 2 or 20 hours
tell me why goddamit
>>
File: 00162-1299597972.jpg (736 KB, 1248x1864)
736 KB
736 KB JPG
my gf
>>
>>102050767
I added hentai as an afterthought
>>
File: bComfyUI_108688_.jpg (869 KB, 1920x1088)
869 KB
869 KB JPG
>>
>>
File: ComfyUI_32808_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
>>
>>102050883
Okay now we need an ai to set everything up to let me buy this cup.
>>
File: 1717920938121114.png (845 KB, 768x1024)
845 KB
845 KB PNG
Anyone know what I'm doing wrong. I put "1 girl, single picture" in my prompt. I also put "multiple girls, 2 girls, 3 girls, background characters" in the negative prompt and I'm often getting stuff like this? Or two side by side pictures in one.
I just started today and I'm using Easy Iffusion with the perfect world model. I'm going to try new models in a sec.
>>
File: ComfyUI_32809_.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>
>>102050914
tank you
>>
>>102050907
Your prompt sucks. Copy some prompts from people that know what they are doing. Check the model page on civit.
>>
>>102050907
This looks like SD1.5. It frequently does things like that if you prompt larger than 512x512. You need to add more context like "sitting behind a desk" to reduce the likelihood of this appearing.
>>
>>102050899
>>
>>102050966
I see. I'll try something like that.
>>
>>102050828
The more familiar the model is with the lora's concept or the more generalized the lora is, the less strength is required. Too low of a learning rate can also cause this.
>>
>>102050997
the recommended was 0.0001, so I bumped it up to 0.0005, and same deal. There's hints that it's in there, but I need to dial the strength up to get it to show up
I'm just gonna try 0.001 and see what happens. I don't care if I overshoot, at least I'll know it's doing something
>>
>>102050985
Omg.
It’s so cute.
I need a gf so I can buy things like this “for her” instead of either having plain cups or seeming gay.
>>
File: 00017-4132157855.png (1.15 MB, 1216x832)
1.15 MB
1.15 MB PNG
so romantic
>>
>>102051020
You can try using Prodigy optimizer and enable the Tensorboard to see the learning rate used towards the end of training. The optimal learning rate would be around that value.
>>
>>102051064
idk why people don't just use the prodigy optimizer more often than not. Sure, it make take a few extra steps to find it's groove, but it's not gonna deep fry your shit.
>>
File: ComfyUI_32814_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>102051034
dude I have several cupboards filled with vintage ceramics & all sorts of things ceramic. (ex gf was ceramic artist). its very manly. no catgirl cup tho.
>>
File: 00312-AYAKON_12481768765.jpg (458 KB, 4000x1500)
458 KB
458 KB JPG
Dehya love, made this mixing pony for the character and flux for the background
>>
File: ComfyUI_02177_.png (1.66 MB, 1312x1024)
1.66 MB
1.66 MB PNG
>>
anyone got a nice local model recommendation that will LLM-ify my basic Flux prompts?
>>
File: ComfyUI_04165_.png (1.52 MB, 896x1088)
1.52 MB
1.52 MB PNG
>>
File: ComfyUI_32817_.png (999 KB, 1024x1024)
999 KB
999 KB PNG
>>
>>102051221
just run it through grok, its really good about cleaning up shit prompts
>>
>>102051205
STARTIN TO LIKE IT BRO. also, I keep thinking about burning my balls with a lighter. might be really nice.
>>102051221
I use ollama&Llama3.1, seems to work well.
>>
>>102051221
gemma-2-2b-it-abliterated-IQ4_XS works pretty well, I don't even have to unload it while using Q4KS flux and t5 models
>>
>>102051205
He is thinking about one of uncle Ben's teachings:
With a lot of subscribes comes a lot of money.
>>
File: ComfyUI_32819_.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>
>>102051151
if you told me this isnt ai i would have believed you
absolutely high quality stuff
>>
>>102051335
thanks, it's flux now letting me do nice backgrounds, SDXL based models including pony were terrible at them
>>
File: ComfyUI_32818_.png (1006 KB, 1024x1024)
1006 KB
1006 KB PNG
>>
File: ComfyUI_32821_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>102051379
the cat lol
>>
File: ComfyUI_32822_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
Trippin balls
>>
Why does every vlm out there insist on starting each caption with "This image depicts"?
>>
>>102051424
you'er asking it to describe/prompt an image, so it starts with "the image depicts/describes/shows/etc"
like "explain teh basis of a to me" will answer "the basis of a is..."
you have to tell it not to do that, or tell it to do something in different words
>>
>>102051424
turd-wrangle it into not doing it.
>>
grrrr.. I am fucking done with just 32GB system ram and stutters on loading models .. just ordered 64GB
>>
File: 00014-AYAKON_1248182.png (1.47 MB, 1280x1280)
1.47 MB
1.47 MB PNG
>>102051424
are you specifically asking it to do that lol? Many scripts such as joycaption ask it to start with that phrase to align the model instead of it saying something stupid like "that is a nice picture!"
>>
>>102051460
i'm on 128gb and flux+a lora has me at 90gb sometimes (via forge)
might be the dozens of tabs i have open too, tho
>>
File: Untitled.png (470 KB, 1201x554)
470 KB
470 KB PNG
>>102051469
>>102051456
>>102051453

Since the model is literally just completing the text, I found the best way to deal with it is to be a retard for the model and write out "This image depicts" for it so it continues from that point
>>
>>102051485
that works too lel
>>
File: 00112-1778854220.png (2.26 MB, 1024x1440)
2.26 MB
2.26 MB PNG
party rockers in the house
>>
>>102051335
>4 fingers on left hand
>believable
>>
File: ComfyUI_00940_.png (1.54 MB, 1152x1536)
1.54 MB
1.54 MB PNG
>>
>>102051460
bro did the same on thurs. my main image gen system is 3090 with dd4, saw 64gb ddr4 3600 was $110 and ordered that shit
>>
>>102051485
Those are pretty good results for local, what model did you use?
>>
>>102051485
nice lol.
>>102051460
>>102051485
wth. i forcethe fp16 clip into the ram, the model obviously occupies my vram and its all fine with 32gb ram
>>
>>102051472
64gb is max for me on this system, my next system ill go memory maxing regardless of any current "you only need" meme, but building a system now is out of the question with all the failure on release happening again and again, not early adopting 9000 series nor Ultra 2xx series .. I am not a beta tester for AMD or intel
>>
>>102051533
lol check task manager. Your pagefile is getting gangbanged when you switch loras.
>>
>>102051534
yeh last i built was on a 570 chipset and i've maxed it out on processor and ram, gonna be a while for me before doing a new build
>>
>>102051205
This guy is absolutely based.
>>
>>102051533
>all fine with 32gb ram
ya this >>102051544
>lol check task manager. Your pagefile is getting gangbanged when you switch loras.
my NVME is fast enough to not make my break down in tears, but if loading 2 loras running fp16 I hit max vram and system ram on loading, once it runs its fun it goes back to "just" 27 of 32GB .. but the loading and sorting process is insane
>>
>>102051205
when the world needed him most...
>>
>>102051547
got a b550 AMD platform .. that maxes out at 64GB .. and wrose its an mITX board .. so I have to get a single new kit of two .. well it will be here on monday .. and goting from 2800MT/s to 4400MT/s also will help alot I hope
>>
>>102051572
i went all out with a big case (6 ssd +2m2) and 3090ti, 128gb and 5950x, i'm set for the next 6 months at least the way things are going nowadays
>>
>>102051524
It's just joy caption. I've tried COGvlm, internvl and even gpt4 but unless I suddenly grew a second 3090, I think joy caption is best local option out there right now. Intern VL is a very close second, even at 8B, it would be my go to if it did a better job reading a complex system prompt and being consistent with it, but at 8B it's a bit too retarded. I suspect the 70B version is excellent.

My current method is I have a trap that takes batch processes the images and second tab where I can find and replace words like "A man", with the character's name.

automatic captions with cleanup seems to be most surefire way right now.
>>
>>102051601
isn't joycaption just a llama3 wrapper?
>>
>>102051601
I can't get 70b working on local with it because the joycaption author seems to have only trained the model for 4096 dimensions
>>
>>102051605
It is. But it works quite well.
>>
>>102051605
check the folder ... its a small 190mb finetune if ya wanna say a lora for llama3. So kinda you are right.
>>
>>102051101
What's optimum for prodigy? (Steps, epochs, etc)
>>
File: 00120-2461395209.png (2.2 MB, 1024x1440)
2.2 MB
2.2 MB PNG
i fucking love flux bros, this shit feels like 2 years ago when nai was leaked
>>
>>102051614
ya need 70b to describe pussy? cmon.. the mini llama is just fine for joycaption
>>
>>102051630
yaa.. that, got the same elusive addictiveness, I wonder when we see flux.berrymix
>>
>>102051631
if you have ever tried out 8b models vs 70b+ you would know the small models are fucking retarded and are probably making the captions much worse
>>
File: flux_00679_.png (1.17 MB, 1280x1024)
1.17 MB
1.17 MB PNG
>>
File: 1716295189785289.png (674 KB, 1792x1024)
674 KB
674 KB PNG
>>102051631
>>102051646
>>
>>102051631
that'd be some juicy pussy.
>>
File: FLUX__00100_.png (932 KB, 1024x1024)
932 KB
932 KB PNG
>>
>>102051623

1 epoch = one loop around all the images in your dataset
1 step = 1 image processed in your dataset being processed (I forget how this worked with batching but the math is the same)
You might want to to about 3 or 4 repeats of your images in a dataset.

But the general copium is 2000 - 4000 steps with 2-4 repeats per image.
The chances that your final product is the best iteration of your LoRA usually isn't likely, so you just pick the best performing one from all the steps saved along the way.
>>
>>102051620
I'm having trouble wrapping my head around how it works. So It's a LoRA that reads outputs from a clip model and has Llama 3 interpret them?

Can we use other clip models aside from the google one?
>>
>>102051663
Oops, I told a lie here, one epoch = the number of images in your dataset * the number of repeats.
So if you had 10 images with 2 repeats, 1 epoch would be 20 steps.
>>
File: ComfyUI_32823_.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>
>>102051674
heck if I knew .. better ask in /lmg/ JoyCaption was the first time ever I felt compelled to run an LLM locally, but I guess so? but I think what counts is which llama you use to make it "better" at describing. I catched it at describing some pixel art yesterday instantly as a screenshot from Zelda for the NES and than blabbered about what Link does instead of thinking of it as pixel art .. so I guess the 8b can be quite dumb as >>102051646 anon say but ... no idea if you can just throw in a bigger llama and have it be smart enought to discern pixel art from an 8bit video game screenshot, I don't think the google clip is the problem
>>
File: ComfyUI_32826_.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>
File: flux_00682_.png (1.21 MB, 1280x1024)
1.21 MB
1.21 MB PNG
>>
File: ComfyUI_32816_.png (937 KB, 1024x1024)
937 KB
937 KB PNG
>>
File: ComfyUI_32831__cleanup.png (809 KB, 1024x1024)
809 KB
809 KB PNG
>>
I am stuck genning girls with police hats, someone help
>>102051740
coming down is the hardest thing. (lol)
>>
File: FLUX__00010_.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
>>102051763
post em
>>
File: 00008-3177437783.png (661 KB, 1024x1024)
661 KB
661 KB PNG
>>
>>102051623
It largely depends on your dataset. In my experience, for a single concept or character, 1000 total steps is enough excluding regularization images. It's best not to pack too many things into a single lora because of potential bleeding issues, though I've see a lora that has hundreds of separate concepts. 100 max steps per image. You can go lower if other images have details that you want to keep for the character or concept.
>>
File: Untitled.png (29 KB, 422x643)
29 KB
29 KB PNG
>>102051736
Well, its training config is right there in the files, it specifically states it was trained on Llama 3.1 8B so I don't know if it's as simple as plugging a bigger Llama model would work, more layers and all that.
It looks like there's a dataset he's using to train it we also don't have eyes on, but I bet you could train your own captioning LoRA if you wanted to using this.
>>
>>102051797
>>
File: ComfyUI_32834_.png (3.04 MB, 1536x1536)
3.04 MB
3.04 MB PNG
>>
File: aseet.jpg (20 KB, 542x375)
20 KB
20 KB JPG
>>102051763
>>
>>102051811
ya sounds logical, guess I have to learn all that LLM stuff to for the sake of doing good prompts and captions.

I yesterday just let JoyCaption look at some of my pictures in my old wallpaper folder and put that into FLUX, and damn some of em it replicated very well. The whole epic boomer prompt writing art is something that is abit alien to me after writin long comma seperated tag lists fod Stable Diffusion models. But I can see why its very powerful cause t5 understands things like an llm
>>
File: ComfyUI_32835_.png (945 KB, 1024x1024)
945 KB
945 KB PNG
>>
>>102051850
love those!
>>102051655
>>102051863
>>
File: file.png (1.08 MB, 2560x1297)
1.08 MB
1.08 MB PNG
>fully automated luxury 1girls
>>
>>102051872
but you ease into it, no? we all learned to write stories. I feel weird letting an LLM write a prompt for me. its cool to get inspiration tho. I also did the same thing with joycaption on some older gens and its funny how close the result sometimes was after throwing the prompt into flux.
>>102051921
scary lol
>>
Mr. Baker...
>>
>>102051961
yea ofc, writing what you actually want to see in natural language is the far superior approach the whole tag system was always just cause it technological wasnt possible locally .. we all gotta move on .. well I guess the ponies will be still stuck with score_123 for a while
>>
woa.. img limit hit before msg limit.. thats rare, who will bake? collage baker is euro, probably deep asleep now
>>
>>102052036
https://www.befunky.com/create/collage/
>>
so, is flux with a GGUF flux model and a lora as fast as with the regular fp8 quant on comfy now? I 1.5s/it vs 1.05s/it for me and I can't pull. just wanted to know
>>
>>102052065
not in my experience, FP8 with --fast on a 4000 series GPU is league's ahead
>>
>>102052087
I wish I had an ada gpu. oh man. how fast can you go? 0.5s/it? npk0
>>
i'm training a lora right now so i cant check. can flux do gigachad?
>>
>>102052110
>>102052110
>>102052110
>>
>>102052121
It cannot
>>
been out of the game for a while and just saw that there's a new model called Flux. What's the quick rundown? Is it better than SD?
>>
File: 1695895945890016.jpg (77 KB, 896x512)
77 KB
77 KB JPG
>>
File: 1724092979930154.jpg (61 KB, 896x512)
61 KB
61 KB JPG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.