[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (757 KB, 3264x2176)
757 KB
757 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102480758

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/kohya-ss/sd-scripts/tree/sd3

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
File: 1707951945531960.png (1.85 MB, 1024x1024)
1.85 MB
1.85 MB PNG
>>
I'm trying to create a LoRA for Flux using ComfyUI and the custom node "ComfyUI Flux Trainer". I was able to get a dataset and start training but at 750 steps in (25%) I got an error saying:
RuntimeError: torch.cat(): expected a non-empty list of Tensors

It looks like it was trying to generate a sample image and was unable to do so. I don't really know why this happened and was wondering if anyone could give me some guidance to either (A) fix this problem or (B) suggest another way to locally create and train LoRA for Flux.
>>
File: 1723683402283590.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>
File: 1695619181873168.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>it's over
>>
two qs
1)how can i make their boobs to have a natural sag?
2) is there a model out there that specialized in nature houses and shit?
>>
File: 1701129504012043.png (1.37 MB, 1280x832)
1.37 MB
1.37 MB PNG
>>
File: bComfyUI_118501_.jpg (298 KB, 1024x1024)
298 KB
298 KB JPG
>>
File: 0.jpg (272 KB, 1024x1024)
272 KB
272 KB JPG
>>
File: XL252024.jpg (207 KB, 1200x1520)
207 KB
207 KB JPG
Morning all
>>
File: 1719240086429912.png (1.11 MB, 896x1152)
1.11 MB
1.11 MB PNG
>>
>>102504336
the words natural and sagging both work as adjectives for breasts, as well as "breasts apart" and shit, just read this https://danbooru.donmai.us/wiki_pages/tag_groups
>>
>>102504449
good stuff
>>
File: 1167752492635494314-F1.png (1.9 MB, 896x1152)
1.9 MB
1.9 MB PNG
>>
File: 1167752492635494374-SD.png (1.96 MB, 896x1152)
1.96 MB
1.96 MB PNG
>>102504753
>>
File: 1167752492635494332-SD.png (1.72 MB, 896x1152)
1.72 MB
1.72 MB PNG
>>102504761
>>
File: 1167752492635494355-SD.png (2.5 MB, 1128x1200)
2.5 MB
2.5 MB PNG
>>102504778
>>
File: 1167752492635494377-SD.png (1.64 MB, 1152x896)
1.64 MB
1.64 MB PNG
>>102504787
>>
>>102504858
>>102504787
>>102504778
>>102504761
>>102504753
too noisy
>>
>>102504858
pony? pretty good
>>
File: 1167752492635494388-SD.png (1.29 MB, 1152x896)
1.29 MB
1.29 MB PNG
>>102504858
>>102505046
guess that lora is too strong
>>102505070
nope, flux
>>
>>102504858
either too bad or too good trigger disc
>>
>>102504495
The chud tranny
>>
Are Forge or reForge compatible with AMD GPUs on Linux? I don't see that they have install instructions for that, but I don't know what I'm doing.
>>
>>102505409
yea
>>
File: bComfyUI_118093_.jpg (291 KB, 768x1024)
291 KB
291 KB JPG
>>
>>102498629 #
I have a theory that you could jailbreak it with a few hundred or a few thousand if you do certain things, but it would require you to write sexual words in l337sp34k for every prompt because bfs has essentially poisoned their dataset for all sexual words. It’s like how if you use “Emma Watson” for a character lora instead of “emmwat” it will be fucked up, because “Emma Watson” already means something.
“Penis” already means “shirtless headless male” in Flux. Training against that is a waste of time. Needs to be TRUP3N1S or something.
>>
>>102504296
the west has fallen :(
>>
>>102506085
>it would require you to write sexual words in l337sp34k for every prompt because bfs has essentially poisoned their dataset for all sexual words. It’s like how if you use “Emma Watson” for a character lora instead of “emmwat” it will be fucked up, because “Emma Watson” already means something.
>“Penis” already means “shirtless headless male” in Flux. Training against that is a waste of time. Needs to be TRUP3N1S or something.
the catastrophic forgetting of neural networks during finetuning could be used at its avantage, let's say Flux thinks "Penis" means "shirtless headless male", if we add more pictures of penis with the right "Penis" prompt, maybe it'll help Flux forget that Penis means "shirtless headles male"?
>>
>>102506132
Yeah and the loras that use the normal words do work, but you need less images/get a better result if the word is new. At least in theory. I asked the other person that made a penis lora how many images he used. If it’s a lot more than one using l33t (which took 40) then I consider my POC a success and will keep going. I already consider it a slight success despite being burnt since mine makes zero instances of that “tarantula mouth balls” that every other penis Lora has that’s clearly from the poisoned original dataset.
I just want to make one perfect tiny lora that changes the base model the smallest amount possible, such that you can just put a regex between the prompt box and the model and it acts for all purposes like a “jailbroken” version of the base model.
>>
>only three imgs in the collage
It's over
>>
>>102506440
>I just want to make one perfect tiny lora that changes the base model the smallest amount possible, such that you can just put a regex between the prompt box and the model and it acts for all purposes like a “jailbroken” version of the base model.

It would help usage if there were a standard. but having a standard could make future poisonings easier.

is there a metadata slot that we could put a machine-readable replacement map into?

cravf intvan avccyr nahf pyvgbevf
>>
>>102504229
Use Kohya GUI.
>>
>>102507011
I’m sure there’s a comfy node for word replacement, or it could be made in 20 minutes. From playing with dev I think it would only need a few dozen words tops redefined.
>having a standard could make future poisonings easier.
That’s actually the best part. If they poison a new version of the model against the new word, you just change the word and retrain. Oh they poisoned TRUC0CK? A few hours on a 4090 and now it’s TRXP3NIS instead. It could be a randomly generated string. It doesn’t matter. It’s un-poisonable.
>>
>>102505103
You try adjusting lora block weights.
>>
>>102507011
>>102507373
Oh nvm I see you mean a standard for all loras. Yeah that would get instantly poisoned for flux 2 and not work. I’m just talking about one lora to rule them all since if it weren’t for the censorship I wouldn’t use loras at all.
>>
>>102507420
you don't use style loras?
>>
>>102507466
No, it seems like a stopgap thing to me. Like if the model is good enough you don’t need a lora. Midjourney, dalle etc don’t have them. You just describe it.
>>
>>102507821
lol
>>
Would anyone please share an example for a GGUF Q4 flux comfy workflow? I cannot figure out what I am doing wrong.
>>
>>102507821
uhhhh
>>
can you guys come back to sdg already? this it silly
>>
>>102507842
>>102508011
I mean, in the long term am I wrong? Endgame scenario is maybe adding a collage of images in the style as input for image2image. As time approaches infinity the number of loras will go to zero.
>>
I'm gonna ask again: do we have a local option for 3D model generation?
>>
>>102508089
Okay but that's not where we are now, dog.
>>
Does anyone know why I need to install rocm to make pytorch (thus comfyui) work with my amd card?

ollama doesn't need installation of rocm, but it uses rocm.
>>
>>102508089
That's false. It would imply the hypothetical future model would be able to fulfill all requests. It's not possible to include infinitely many possibilities into a single model.
>>
>>102508077
Go back
>>
Nevermind the prompt, what would you guys use to improve the quality of the image? I'm literally just bumbling my way through at the moment.

https://files.catbox.moe/yxv37w.png
>>
>>102508323
Yeah, tell flux dev to make a grid of different angles and plug the result into any photogrammetry or Gaussian splatting software. a separate direct to mesh model is unnecessary.
>>
File: IMG_9908.jpg (245 KB, 1258x1125)
245 KB
245 KB JPG
>>102508920
Flux really likes that color for night forests huh
>>
Any new (cope) cfg techniques lately? Last time I tried autocfg and skimmed cfg but they both result in a somewhat noisy, messy image. Also some smaller details look like shit.
>>
>>102509003
the only real cfg technique is to turn it up to 4 and disable guidance and suffer
>>
>>102508323
SAI has a text to 3D model but that's it
>>
>>102509003
nope, it's a field that surly deserves to be explored more, have you tried Tonemap and DynamicThresholding aswell?
>>
File: mermaid_0007f.jpg (1.39 MB, 2048x2048)
1.39 MB
1.39 MB JPG
> NotImplementedError: Cannot copy out of meta tensor; no data! Please use torch.nn.Module.to_empty() instead of torch.nn.Module.to() when moving module from meta to a different device
Flux lora training is such agony
>>
File: file.png (1.82 MB, 1152x896)
1.82 MB
1.82 MB PNG
https://civitai.com/models/787392/emad-mostaque?modelVersionId=880549
lawl
>>
File: 00012-2320925639.png (1.43 MB, 1280x1024)
1.43 MB
1.43 MB PNG
>>102504155
>>102504233
>>102504296
>>102504449
nice chudjak gens, here's one of mine
>>
>gpu busy
>eh well, I can use huggingface and try out the new joycaption to do this dataset by hand in the mean time
>going smoothly
>joy caption 504 gateway timeouts
g-guess I'll wait to use my PC
>>
File: flux_piano_1.jpg (613 KB, 1920x1080)
613 KB
613 KB JPG
>102498629
Basic nudity can be restored easily with lora. I've tried it for anime and it works just fine. You need a decently captioned dataset with close-ups and it would just work.
>>
>>102509694
Fix, mistyped >>102498629
>>
>>102509694
Could you please share catbox?
>>
>>102509645
Sooner or later all the joycaption spaces will dry up for good :(
>>
does anyone know what model was used to train all the FLUX training data?
>>
It's bullshit that we "need" style loras for a model as large as Flux
>>
>>102509798
it is, a giant model like that could've eaten all the concept that exist on the history of humanity, but the BFL team is too cucked to do it
>>
>>102509798
i have a feeling that we can coax more out of it than what we've been able to thus far removing the need for many existing loras
>>
>>102509756
I made it with heavily modified krita-ai-diffusion plugin and I'm too lazy to make it save metadata properly, so I can't share the catbox, but I can describe the workflow.

Prompt: "High-quality anime screencap depicting a theatrical stage inside of a mansion. The seats are dark and empty, while the stage is well lit.

A grand piano is on stage. A young woman sits at the piano. She has black hair in twin braids and is dressed in long and intricate classic maid dress."

Workflow: my own style lora, BasicGuider at 3.5. Generated at 50% of the final resolution, after that highres fix, after that a targeted highres fix of the girl and the piano.
>>
>>102507877
Have you looked at this?
https://github.com/city96/ComfyUI-GGUF
>>
>>102509050
>quadruples your inference times
Heh, nothing personal
>>
>>102509908
CFG "only" doubles your inference time lol
>>
>>102509775
caption all the training data*
>>
>>102509775
>>102509997
we don't know anything about the details of the training, they just gave us the model and that's it
>>
File: file.png (3.87 MB, 1200x1799)
3.87 MB
3.87 MB PNG
looks like more finetunes are comming into flux, for the moment it's not anything serious but it's always a good news to see people doing it
https://civitai.com/models/785421/sdvn11-ghibli-flux?modelVersionId=878315
>>
>>102510087
are they actually finetunes or are people just merging loras to jeetmix like usual
>>
>>102510132
for that one it's not specified it was a lora merge so I guess he made a real finetune
>>
>>102510087
>entire "finetune" for a single style
>ghibli even
>>
>>102510166
yeah, I don't see the point either, if you want to make a single style, do a lora
>>
>>102509316
i hope you figure it out anon <3
>>
Is there a better version of this:
https://huggingface.co/Aitrepreneur/FLUX-Prompt-Generator somewhere?

My flux prompts suck and I think it is because I keep sticking to bourru tags and stupid XL tricks.

I am going to use it as a base and convert it so you can do reasonable things with it (set host, port, not ollama). I want something standalone so I am not interested in the comfy node.
>>
>>102509876
nta but ty
>>
>WAS node suite giving me some gayass quote everytime I launch Comfy
>>
>>102510397
i haven't had much trouble prompting similar to before, it's just like, before the model would mostly ignore your prompts for the position of things and the details on them, it was pure rng, so you could omit those from the prompt. now you need to include them.
>>
File: elf_0089.jpg (1.29 MB, 1792x2304)
1.29 MB
1.29 MB JPG
>>102510318
Thanks, I think I've got it working. Just run the kohya scripts from the command line, don't try to use any GUIs. It is training now, VRAM looks good. We'll see in a few hours.
>>
>>102510561
please post script when you do. are you on the flux branch i presume?
>>
>>102510087
His website links to this Flux collab. Never heard of it before. https://colab.research.google.com/github/StableDiffusionVN/SDVN-training-colab-flux/blob/main/SDVN_Flux_Training.ipynb

>>102510561
Did you try that ai-toolkit?
>>
File: file.png (36 KB, 975x606)
36 KB
36 KB PNG
>>102510664
So I guess that was a Lora all along?
>>
>>102509829
The only Lora that needs to exist is a single one with accurate anatomy, major celebrities/IP, and like twenty modern artists.
>>
>>102510561
i wish elf girls were real
>>
File: elf_0086.jpg (1.28 MB, 1792x2304)
1.28 MB
1.28 MB JPG
>>102510635
The command line argument I'm using (on a 4090) is:

accelerate launch  --mixed_precision bf16 --num_cpu_threads_per_process 1 flux_train_network.py --pretrained_model_name_or_path C:/ai/ComfyUI/models/unet/flux1-dev.safetensors --clip_l C:/ai/ComfyUI/models/clip/clip_l.safetensors --t5xxl C:/ai/ComfyUI/models/clip/t5xxl_fp16.safetensors --ae C:/ai/ComfyUI/models/vae/ae.safetensors --cache_latents_to_disk --save_model_as safetensors --sdpa --persistent_data_loader_workers --max_data_loader_n_workers 2 --seed 666 --gradient_checkpointing --mixed_precision bf16 --save_precision bf16 --network_module networks.lora_flux --network_dim 32 --optimizer_type adamw8bit --learning_rate 1e-4 --cache_text_encoder_outputs --cache_text_encoder_outputs_to_disk --fp8_base --highvram --max_train_epochs 10 --save_every_n_epochs 1 --dataset_config d:/ai/lora/data/dataset.toml --output_dir d:/ai/lora/output --output_name luisroyo_flux --timestep_sampling shift --discrete_flow_shift 3.1582 --model_prediction_type raw --guidance_scale 1.0 


Basically kohya's recommended way to run it with a few little tweaks to the paths, the number of epochs, and the network_dim. As long as it runs and produces a lora, I can usually figure out how to wrangle the other stuff to get something usable. It uses about 18GB VRAM. I *think* you could probably update the command to use the fp8 versions without too much agony.
>>102510664
> Did you try that ai-toolkit?
I didn't try that one. I used the kohya-ss-gui I got good use out of for SDXL loras. Supposedly it supports flux but I just could not make it work.
>>102510725
Me too, friend.
>>
>>102510827
>>102510635
And the commit hash of sd-scripts is
1286e00bb0fc34c296f24b7057777f1c37cf8e11

Which I dunno what branch that is on. the kohya gui checks it out as a submodule and is that revision it is using.
>>
>>102509332
It's been a number of months since I've thought about Emad.
>>
File: 00295.png (1.89 MB, 1216x832)
1.89 MB
1.89 MB PNG
>>
Are there any tricks to training small loras?

I've noticed that disabling layers with the lora block loader isn't the same as not training those layers. Merging loras trained on different layers also doesn't replicate the lora block loader's effect. It seems that all layers are interdependent.
>>
File: 00300.png (2.02 MB, 1408x704)
2.02 MB
2.02 MB PNG
>>
>>102510968
they are connected yes
personally i always train then resize until it breaks
>>
>>102509645
>>102509772
i cant find any joycaption spaces that aren't 504ing
>>
>>102510827
That command is going to end up paywalled behind the Turk's pateron in 2 hours.
>>
File: catgirl_0217.jpg (854 KB, 2432x1664)
854 KB
854 KB JPG
>>102511184
Well if he can goad people into paying him for publicly available info (from Kohya himself at that) then kudos to him for the hustle.
>>
What name for this style would flux recognize? I know in human language it's 'AI slop' or 'stable diffusion anime', but idk what the actual word is for this 'anime but with weird shiny pseudo-3D shading and giant eyes that is literally only done by AI' style. I can get everything right from base flux dev except that weird hair/face.
>>
>>102511298
Let's make an ai persona that teaches people how to make an ai educational virtual persona that cyber-begs. This has to be the #1 business idea of the present century.
>>
>>102511298
I love how Flux combines model looks with garbage porn clothes.
>>
>>102511570
Well that is not flux, that is pony realism. And the porn clothes is probably because I used a porn image and used it with wd14 tagging and controlnet to make a catgirl pic.
>>
>>102511570
You can't tell the difference between Flux and XL?
>>
After being stuck on a potato for months, I just got back into local via Forge since it looks like A1111, but I'm kinda wondering if I should try to use Comfy, as it seems like it has more tools available? Haven't had a chance to try either much, last time I tried Comfy I kept getting an error and couldn't get it to do anything.
>>
>>102511723
If you're used to A1111, then give forge a try, or I think reforge which is the current best version (not sure). If it gives you problems or you want to give comfyui a try you can do that too. Comfy does seem to be pulling ahead in terms of tooling but for how most people do image gen forge is still plenty useful and has a familiar interface.
>>
>>102511639
Would Flux look better?
>>
okay am I retarded, is it possible to use AMD's pytorch rocm docker with comfyui?
>>
>>102511723
unless you have really specific shit you want to automate forge is a lot comfier. comfy is very uncomfy
>>
>>102511779
The VAE itself gives Flux images higher detail but "looks better" is up to your own feelings. I appreciate both for different reasons.
>>
>>102511800
Does Forge have rocm built-in?

I want to avoid installing rocm proper if possible, because it likes to wreck Linux installs.
>>
>linux server running ComfyUI
>open webpage though Macbook
unironically the most comfy
>>
>>102511830
>have PC with dual GPUs
>never "use" it as anything but a --listen slave for my laptop from the lazyboy chair
comfy
>>
cozy thread
>>
>>102511455
If the default flux anime style doesn't do it for you, there are probably plenty of Loras trained on XL/1.5 slop on Civitai.
>>
File: 1705680686193564.jpg (528 KB, 1792x2304)
528 KB
528 KB JPG
>>102511770
>>102511800
Appreciate the input, guess I'll stick with Forge for now while I give Flux a whirl.
>>
File: elf_0807.jpg (732 KB, 1792x2304)
732 KB
732 KB JPG
>>102511862
Lol that's me except I only have one 4090 GPU
>>
is it just me or have a bunch of servers been getting slammed by DDoS in the past week or so? huggingface now too. who pissed off the botfarm
>>
File: big boss smoke.gif (1.94 MB, 160x200)
1.94 MB
1.94 MB GIF
>just found out i didn't have cuda enabled after 2 months of genning
i am still new local stuff, for both text and images
and every day i am finding small ways to speed things up

fuck
>>
>>102512046
If she didn't have the knife ears this would be pretty good
>>
>>102512091
shut you mouth
>>
File: 1699753559928987.png (638 KB, 893x711)
638 KB
638 KB PNG
i can run the normal flux-dev-fp8.safetensor using a checkpoint loader, however when i use the dev-fp8 model in pic related i cant without getting dtype_5 errors. apparently i have to use a diffusion loader and add my own clip and vae, but then i run out of vram. am i doing something wrong, or are merges just much more vram intensive?
>>
>>102512281
He'd be better off going to dedicated coomer threads on /aco/ and other places than here. We have our 1girls too but it's not an ideal venue.
>>
>>>/bant/21272174
>There was a request for AI abs and feet, but this was the best I can find on /g/ after going through 3 long threads edition
Holy kek
>>
>>102512290
>>102512294
there're some good 1girls posted here at least
>>
>>102512281
crosslinking is three meme arrows, silly

after examining this thread, their skill is even worse than /b/, and while their taste is normie it's not bad at all. also the thread is like 85% images which is pretty based
>>
File: hm.jpg (2.12 MB, 2112x1344)
2.12 MB
2.12 MB JPG
>>
>>102512395
The one on the left is great.
>>
>day 22
>>
>>102512018
prompt?
>>
>>102512492
no
>>
>>102512492
With metadata:
https://litter.catbox.moe/4au49l.png
>>
>>102512546
Thanks!
>>
>>102511172
Is it over?
>>
>>102512471
 at night, the wind, digital photoshop collage acid trip, opalescent, long exposure shutter drag macro 
>>
>>102504144
>FLUX.1 "schnell"
>1 image takes over a minute on my 40xx
>>
File: bird.png (1.28 MB, 704x1344)
1.28 MB
1.28 MB PNG
>>
>>102512546
nice anon, i thought it was a real picture
>>
>>102512736
kek is this because of my RAM?
>>
>>102512878
gens take me 7 minutes with negative prompting enabled, on my AMD card. You're doing fine.
>>
File: 2024-09-22_00002_.png (674 KB, 1280x720)
674 KB
674 KB PNG
contribution.
>>
>>102512736
Are you doing 4 steps?
>>
File: 1725919076251292.jpg (346 KB, 1792x2304)
346 KB
346 KB JPG
>>102512749
Thanks anon. Oddly enough, subsequent gens of the same prompt have tended to look a bit more plasticky for some weird reason.
>>
>>102512281
Creepy
>>
I'm trying to improve this personal LORA of mine by giving it more data of different background scenes/locations, anyone have tips on how i can force this little fidgety nigger to gen something its training probably won't allow? Like let's say a scene of a simple modern bedroom that doesn't have too much stuff in it, my LORA will force a TON of weird random psychedelic shit, is there a way around this? Any scene loras?

>i'm probably overthinking it again
>>
File: bComfyUI_120489_.jpg (686 KB, 1024x1024)
686 KB
686 KB JPG
>>
>>102512996
Your lora might be undertrained or your dataset is too diverse.
>>
>>102513054
the dataset was intentionally a no fucks given slam dunk of over 200 gens from 1.5 done out of sheer curiosity when i had over 2k buzz on civitai
decided i should at least gen some shit in xl and make it consistently good this time
it overtrained on a TON of victorian/rustic looking furniture scenes so most indoor scenes end up having a lot of gold picture frames and wooden furniture in them kek
this time it'll be an actual attempt at a good dataset, about as much training on characters again but more diverse.
>>
>>102513042
Prompt?
>>
File: 1726992390.png (844 KB, 1024x1024)
844 KB
844 KB PNG
>>
>>102511639
you can tell from the pixels.

t. the guy who knows what photoshop looks like.

>>102512061
I am getting 504 from HF. Weird if that is a DDoS

>>102512736
I get slightly faster times on my 3090. Might want to check some configuration settings.
>>
>>102513131
are you undervolting your 3090? How is it at genning multiple images in XL at once?
might hold out to buy one until they're like $500, which should be around the 5000 launch.
>>
>>102513145
I bought mine used. The price isn't dropping. When everyone doesn't move to the 2x price for 4GB VRAM it is going to be messy.

I don't track my XL batch times, I set it to 4 and then run multiple runs. I just works. It is worth the upgrade not to OOM when you want to run IOPaint/lama at the same time.
>>
um there are pornstar loras now for Flux.
>>
>>102513042
>>102513093
nice
>>
>>102513190
> It is worth the upgrade not to OOM when you want to run IOPaint/lama at the same time.

so fucking based god damn, guess that settles my decision for me.
>>
>>102513210
depending on your cache flow a [3|4]080 is dirt cheap if you want to kick the can down the road and run 2 gpus.
>>
>>102513232
How is it working out with AMD apus and lots of ram?
>>
>>102513236
no fucking clue. I got tired of troubleshooting AMD last year. The arch maintainer guy got overloaded and I was stuck on some old ass CUDA version.
>>
How do I use a Lora, with a negative prompt workflow?
>>
File: bird.png (788 KB, 704x1344)
788 KB
788 KB PNG
>>
>>102512736
You’re doing something wrong. It’s slow as FUCK but it’s like 20s.
>>
Okay so for LLMs, there’s stuff like VLLM and exllama and tensor-rrt that are basically designed to inference as quickly as possible, for like businesses.
Does that exist for flux? Or is however fast comfy is as good as it gets?
>>
>>102513405
>however fast comfy is as good as it gets
>fast comfy
>>>/sdg/
>>
>>102510166
I find it hilarious that they would use Ghibli for it, considering how that old coot feels about AI, or technology outside of planes and warmachina.
>>
File: clueless.gif (1 KB, 128x128)
1 KB
1 KB GIF
>Hmm if a 3080 can run SDXL while genning batches i wonder if my 1080 can run it while playing God Hand on pcsx2..
>>
>>102513507
>*while running batches + an llm
why is my brain like this at 8pm every night lately
>>
>>102513503
that's the point, we can respect the artist but not the man, I genuinely think Miyazaki is a retarded man, but as an artist he's a completely different person
>>
Is there a Precious Moments lora?
>>
>>102513669
>Precious Moments
Like this?
https://civitai.com/models/288609/awkward-family-photo-meme-sd15xl-or-everyone-knows-that-guy?modelVersionId=328968
>>
>>102513933
No, it's figurines and cartoons where the heads are all almost exactly the same, with the same eyes on all of them. They're amazingly ugly.
>>
File: precious bowel movements.png (1.79 MB, 1835x907)
1.79 MB
1.79 MB PNG
>>102513933
he means this
>>
File: 1697426208770.jpg (211 KB, 1024x1024)
211 KB
211 KB JPG
>>102514041
weird. I was thinking bobble head after that description.

An old thread you can just say style of precious moments.
https://desuarchive.org/g/thread/96704377/#96705075

pic related and it isn't mine.
>>
File: 2024-09-22_00015_.png (1.22 MB, 1280x720)
1.22 MB
1.22 MB PNG
>>
fwiw I have no idea why it came out like that.
>>
you guys seen what the chinamen AI is producing , dayumn

>>>/aco/8523781
>>
>>102514213
>it finally got to the point of making actual porn now
holy fucking shit we are so fucked if we get this open source by the end of the year

kek that fucking WWE self suplex one
>>
>>102504144
Cotmake Ail Time!
>>
>>102514213
>>102514225
>look what a cloud resource can do
nobody here cares.
>>
>>102514432
can you READ you RETARDED ass NIGGER?
i hope you kill yourself via cum coma when cog 10b comes out
>>
flux was a monkey's paw model, chinaman our only hope now
>>
chinaman abandoned us
>>
no..
>>
>>102513093
*DING*!
>>
>>102514225
>if we get this open source by the end of the year
we wont but if we did no one would be able to run it
>>
hugging face is back up if anyone else was waiting for it
>>
>>102514659
I was. Thanks.
>>
File: sendhelp.png (794 KB, 1262x919)
794 KB
794 KB PNG
my joycaption is possessed wtf
>>
File: mip cross.png (152 KB, 350x273)
152 KB
152 KB PNG
>>102514833
run to your nearest orthodox or catholic church ,acquire holy water, mist it gently on your PC (UNPLUGGED/TURNED OFF)
repeat if necessary.
>>
>>102514845
no lips, no ears, no neck, no shoulders, no arms, no legs, no body, no eyes, no eyebrows, no mouth, no teeth, no nose, no eyes, no eyebrows, no lips, no teeth, no nose, no mouth, no eyes, no eyebrows, no lips, no teeth, no mouth, no eyes, no
>>
>>102514833
Nah it just recognized the image as fujo art and accidentally associated her adjacent emo poetry with it
>>
>>102514935
I'm not sure which possibility is scarier
>>
File: joycaption aids.png (1.48 MB, 1836x891)
1.48 MB
1.48 MB PNG
>>102514833
possession aside it looks to be broken in general
>>
>>102514833
descriptive works fine though
>This is a vibrant CGI image featuring Princess Peach from the Mario video game series. Princess Peach is depicted in a stylized, cartoonish manner, with exaggerated features such as large, expressive eyes and a wide, open mouth. She is wearing a red baseball cap with the word "PEACHES" in white letters, and a blue and white striped crop top with matching blue denim overalls. Her long, flowing blonde hair cascades down her back, and she has large, round blue earrings. Princess Peach is holding a fishing rod in her right hand, with a small fish in her left hand, which is raised up towards the camera. The background shows a serene, sunlit lake with gentle ripples, and lush greenery on the distant shore. The sun is bright and high in the sky, casting a warm glow over the scene. The image captures a playful, outdoor moment, with the character's expression suggesting excitement and joy. The CGI style is highly detailed, with smooth textures and vibrant colors, creating a dynamic and lively atmosphere.
>>
>>102514972
werking for me
>The image is a high-resolution photograph of a sleek, modern water dispenser. The dispenser is primarily white with a glossy finish, featuring a rectangular shape with rounded edges. The top of the dispenser houses a large, clear, blue plastic water cooler with multiple stacked layers. The cooler has a translucent blue tint that allows light to pass through, giving it a slightly cloudy appearance.
>Below the cooler, the dispenser has two water outlets with blue and white handles, indicating hot and cold water options. To the right of the outlets is a small red button, likely a dispensing control. The front of the dispenser has a small compartment door, which is closed in the image, possibly used for storing additional accessories or supplies.
>The dispenser stands on a flat, white base, providing stability. The overall design is clean and minimalist, with a focus on functionality and modern aesthetics. The background of the image is plain white, ensuring the focus remains on the dispenser itself, making it easy to discern its details and features. The photograph is well-lit, highlighting the smooth surfaces and glossy finish of the dispenser, giving it a polished and professional look.
>>
>>102514147
issokay
>>
>>102514972
the weird part is I was doing a 'training_prompt' which shouldn't have shit out booru tag shit to begin with. seems like it works now that I retry the same image, but it definitely gets confused
>>
>>102513072
>using ai images for training
DOA
>>
>>102516057
>yet it worked amazingly well anyway
get fuuuuuuuuuuuuukt
>>
>>102516255
you were the one who was having problems
>>
>>102516290
>he's trying desperately to argue with someone on the basket weaving forum of all places
>and he chose someone who loves arguing for the fun of it
retard
>>
New flux lore just dropped?
>>
>>102516449
what?
>>
>>102516449
what?
>>
>>102516589
>>102516505
what?
>>
>>102516723
hwat?
>>
is someone shilling that """""""""uncensored""""""""" lora again lol
>>
File: tmpq74tmqmx.png (1.03 MB, 896x1152)
1.03 MB
1.03 MB PNG
>>
File: ComfyUI_01699_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>
>>102516449
>>102516782
i was genuinely asking i don't know what the fuck he meant by lore
>>
>>102514547
>we wont but if we did no one would be able to run it
really depends, we could use GGUFs to get something close
>>
File: file.png (59 KB, 1000x1000)
59 KB
59 KB PNG
that's interesting, Flux is really slow so it could help
https://github.com/discus0434/comfyui-flux-accelerator
>>
File: tmpkh7gsacv.png (1.01 MB, 768x1024)
1.01 MB
1.01 MB PNG
>>
Why did Ran leave this general to rot?
>>
>>102517184
because they're more occupied with shitposting around drama, OR they do post here without avatarfagging
>>
>>102517195
Great answer, appreciate this.
>>
>>102517062
thanks for sharing this globuloid, 4chan's gookfed servers are busted and it took 18 minutes for this to load.
>>
>>102517208
thought it's just my shitty internet connection, glad to know we're all suffering
>>
>>102511830
Literally me but Windows and an AMD card
I really should just dualboot at this stage
>>
>>102511830
or you could just run forge
>>
File: tmp30yvoj8e.png (872 KB, 768x1024)
872 KB
872 KB PNG
>>
File: 1712321577210518.png (1.92 MB, 1024x1024)
1.92 MB
1.92 MB PNG
what's the recommended node for stacking loras
>>
>>102517613
>mm, yes, that's a fine piece of wood
>>
>>102517613
>>102517625
Yep, that's wood.
>>
>>102504144
>24 hours old thread
>>
File: tmpcdc1fl6k.png (999 KB, 1024x768)
999 KB
999 KB PNG
I'm not using flux much due to technical limitations, but I'm noticing that it lacks in variety when generating with the same prompt. Seed by seed the images seem to differ less than what I'm used to from previous finetunes. Feels more stiff, even if more reliant prompt-wise.
>>
File: tmp3eiz_gz8.png (1.37 MB, 896x1152)
1.37 MB
1.37 MB PNG
>>
>>102517613
https://github.com/Suzie1/ComfyUI_Comfyroll_CustomNodes
someone recommended the one from this to me but i ended up not using it and just chaining my loras together.
>>
What does chaining loras do/mean? Or at least how does it differ from what I guess is the default method to apply them.
>>
>>102517613
rng
>>
>>102517908
https://github.com/rgthree/rgthree-comfy

Configure it with right click after you add it in comfyui, it can retrieve trigger words too and previews
>>
>I accidentally set cfg to 300 somehow
>only 5 of my queue finished last night
ngmi
>>
>>102517208
Globuloids are the distant relatives 9f garloids. Some less initiated can still easily get confused.
>>
File: bComfyUI_120722_.jpg (604 KB, 1024x1024)
604 KB
604 KB JPG
>>
>>102517908
what's rng?
>>
>>102517830
>>102517908
>>102517927
alright thanks
>>
File: tmpm1v0skan.png (1.18 MB, 1152x896)
1.18 MB
1.18 MB PNG
>>
>>102517036
Big if true
>>
File: bComfyUI_120920_.jpg (534 KB, 1024x1024)
534 KB
534 KB JPG
>>
How do I view filenames in civitai?
>>
>>102517952
what do the gens look like tho
>>
>>102518306
> no prognathism
>>
>>102516928
Terrifying
>>
A good vagina dataset is 100x harder to get than a good penis dataset.
Partly because women aren’t pathologically hooked on uploading 12MP zoom lens portraits of their vaginas with a CC license, and partly because like 90% of vaginas are fucked looking and ugly with a flaccid/hidden clit
>>
>>102518446
>>102517952
lol I meant I set steps. cfg would be crazy
>>
>>102518551
ive maybe seen a small handful of perfect pussies in my days, and both were owned by big booty pale skinned latinas.
>>
>>102518627
Lucillexs and ceoofgothicc
just for context.
>>
Flux LORAs needed:
1. More famous men
2. Some ugly women
3. Some disfigured ones (Halloween!!!!)

imo hair color doesn't matter, doesn't Flux easily change hair color?
>>
>>102518691
You will use the gigantic model with no loras and you will be happy.
>>
>>102518430
i think the only way is to download the image
>>
File: 2024-09-23_00004_.png (862 KB, 720x1280)
862 KB
862 KB PNG
>>102518446
>>102518578
>>102517952
cfg is capped at 100.

Here's before. After arrives in 3 minutes (I have a slow card).
>>
>>102518742
aha, makes sense, thanks!
>>
File: 2024-09-23_00005_.png (681 KB, 720x1280)
681 KB
681 KB PNG
>>102518750
cfg 100
>>
>>102518742
I fucking hate this so much. It really sucks that it is a preferred unique id for the site. The modelid and modelversionid falls apart quickly. If a user re-uploads something that isn't exactly same except for the contents everything in the API gets fucked. If there is a gap in my scrapper between getting the metadata and getting the file half the time everything falls apart for me.
>>
>>102517770
>>102518070
very cool
>>
What kinds of upscalers are used for anime images nowadays?
>>
>>102517184
>>102517195
>>102517201
Why do you care?
>>
>>102518884
*Grabs your nose and steals it*
Now what?
>>
One thing I don't like about Flux is how for anime style illustrations it still looks mostly lackluster. Hopefully over time it gets better
>>
>>102518863
what kind? Gotta know if you want those lines crisp.

>>102518887
it's 4chan. I suspect it will end up someplace horrible or sexual for a small group of people. I haven't seen nose/knee porn yet, but it is just a matter of time.
>>
>>102518828
For what purpose are you scraping shitting Civitai gens?
>>
File: 1167752492635494398-SD.png (1.66 MB, 896x1152)
1.66 MB
1.66 MB PNG
>>
>>102518900
>what kind? Gotta know if you want those lines crisp.
I'm not sure yet, I just don't like the Lanczos-blur.
>>
flux really doesnt know body types I think..
what are some prompt tags that work?
>>
>>102518936
pudgy.

belly.
>>
>>102518891
That’s what I was trying to explain earlier. SD has this “pseudo-3D vibrancy” that is somehow both the core signature of AI slop and looks objectively better than real anime.
>>
>>102518939
anything for thic woman? muscualr woman?
>>
>>102518918
Making the worst lora of all time
>>
File: flux_4_1.jpg (336 KB, 832x1216)
336 KB
336 KB JPG
>>102518891
It learns styles well, so it's not a huge problem in the end.
>>
>>102518918
archiving. Loras are useless without documentation. Grabbing the sample images and other details is nice after something gets scrubbed.

>>102518932
try anime-6B (I am not looking up the exact name) then and come back with your results.

>>102518944
usually better to describe the person, amazon woman rather than thick. It is flux so you will mostly get instragram selfie sized women no matter what you do.
>>
>>102517036
How much does it change the output?
>>
>>102519026
dunno I haven't tried it, I don't think it's working for GGUF quants
>>
Omg they stole anon's nose, how will anon recover
>>
>>102519000
>archiving
Oh you’re a hero. It’s only a matter of time before half of the site is purged. civitai.green will be what the entire site looks like in a year. I’ve talked to VCs that use the phrase “boiling the frog” without the faintest hint of hesitation or shame. This has been the plan from the beginning.
>>
File: upscale.png (2.66 MB, 2048x2048)
2.66 MB
2.66 MB PNG
>>102519000
I guess that works. Thanks anon.
>>
File: ComfyUI_00056_.png (1.21 MB, 720x1280)
1.21 MB
1.21 MB PNG
>>102518944
idk yet
>>
>>102519000
Thanks chad, any chance you'll have it as a torrent?
>>
File: ComfyUI_00058_.png (1.14 MB, 720x1280)
1.14 MB
1.14 MB PNG
>>102518944
morbidly obese
>>
File: IMG_0122.jpg (477 KB, 1125x1115)
477 KB
477 KB JPG
>>102518944
“Female bodybuilder”
>>
>>102519111
mama
>>
File: IMG_0125.jpg (846 KB, 981x996)
846 KB
846 KB JPG
>>102519204
Wholesomely obese
>>
>>102519253
nice
>>
>>102519111
That torso looks dead-on like my mom when she had ascites requiring a permanent drain and was weeks from death
>>
File: IMG_0126.jpg (837 KB, 1010x1167)
837 KB
837 KB JPG
What did Black Forest labs mean by this
>>
>>102519319
It says right there - Run!
>>
File: 00039-132621961.png (1.84 MB, 1024x1536)
1.84 MB
1.84 MB PNG
>>102518943
eh, the signature look of "ai slop" is more appliable to 3d and disney-mimicking pictures in my opinion.
For anime, SD based models are good for anime, they look like the style of many good illustrators. You can make stuff that look like something drawn by Saitom, Mataro, Tony Taka, Kurehito Misaki... Not thanks to SD of course, but because of the Novel AI leak.
Unfortunately it's limited because it's old stuff. This picture is with a SD 1.5 model from early 2023. It's pretty much ancient in terms of AI.
I see all the new exciting stuff coming only for comfy and Flux, but I still can't easily make pictures that look good as I could with these old models... and that sucks, because I want to play with the new toys.
>>
>>102519272
you should make a lora of her (to keep her memory alive)
>>
>>102519358
>SD 1.5 model from early 2023. It's pretty much ancient in terms of AI.
I have kept sd 1.5, but I deleted sdxl. flux is better.
>>
File: ComfyUI_00059_.png (1.03 MB, 720x1280)
1.03 MB
1.03 MB PNG
>>
File: bComfyUI_120817_.jpg (524 KB, 1024x1024)
524 KB
524 KB JPG
>>102519253
how about obese granny bodybuilder?
>>
File: ComfyUI_00060_.png (991 KB, 720x1280)
991 KB
991 KB PNG
>>102519424
Once this thing is realtime, and talking to me, I'll never leave the house.
>>
File: bComfyUI_120988_.jpg (149 KB, 1024x1024)
149 KB
149 KB JPG
>>102519449
same bro
>>
>>102519128
possibly. It depends on how bad it gets. Half the loras have had their time and should disappear into the history of the internet. Most of the stuff that has been wiped has been styles and celeb stuff. I haven't seen much in the way of three way scat session involving beastiality so I am not too fussed about the censorship of civit. As far as the celeb stuff, I am fine with the bar being higher. If I see another low quality t. swift I might have to yell at the next starbucks guy I see.

>>102519449
start your scraping for you TTS. If anyone has a AI wav merge method I would love to hear it.
>>
File: IMG_0127.jpg (775 KB, 1001x1012)
775 KB
775 KB JPG
>>102519435
>wholesomely obese elderly female bodybuilder
Me irl
>>
>>102519449
Looks underage and way too fat
>>
>>102519516
wasn't as horrid as i thought it was going to be
>>
>>102519546
I first put it as a joke, but “wholesomely” is weirdly good at making a fat person with dignity instead of the most disgusting swamp orb you’ve ever seen
>>
>>102519584
morbidly obese had some interesting results too i see lol
>>
Disgustingly Obese Women Collage
>>
>>102519319
shitty background reminds me of pony models kek
>>
>>102519424
>>102519449
holy fucking KINO what are you prompting to get this ((build))??
>>
mistral nemo is vramlet SOTA in the llm space and it was made in collaboration with nvidia. does that mean bigma will be our nemo equivalent since nvidia is involved with it?
>>
File: IMG_0128.jpg (976 KB, 1027x1130)
976 KB
976 KB JPG
>>102519652
>>
>>102520026
i like the 1 random midget
>>
>>102520053
It has absorbed the “one random child to absolutely ruin the harem hentai” trope
>>
>>102519901
i sure hope thats the case
>>
seems like one of nai's models got leaked or something
>>>/h/8218392
>>
>>102520302
again? lmaooooo
>>
>>102520302
>>>>/h/8218422
seems like the torrent isn't working, it probably was some troll or something
>>
>>102520342
try again, it woiks for me. Illustrious-XL-v0.1
>>
>>102520302
I’m not risking glowie shit to find out WTF that is
>>
>>102520363
but that model is already on huggingface?
https://huggingface.co/OnomaAIResearch/Illustrious-xl-early-release-v0/tree/main
>>
The next loaf of bread is here...
>>102520372
>>102520372
>>102520372
>>
>>102520363
>XL
yawn... I'm sure NAI finetuned Flux at this point, we need that model instead kek
>>
>>102519855
the word "obese", but with the Hayden Panettiere lora.
>>
>>102520469
>that's the gospel of obesity according to whatever model you're using
I see..



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.