[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 2024-08-26_00320_.jpg (1.33 MB, 3840x2160)
1.33 MB
1.33 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102095567

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
File: 00007-3045120452.png (1.06 MB, 1216x832)
1.06 MB
1.06 MB PNG
>>
File: AMD-RADEON-RX-7900XTX.jpg (145 KB, 1551x872)
145 KB
145 KB JPG
What's the rundown on Flux and AMD GPUs today? Do they get good speeds?

Is the 7900 XTX and its 24GB VRAM for under $1000 a good deal yet?
>>
File: ComfyUI_02436_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
Posting here for visibility

On my latest test at rank 128 with gpt4o caption prompts I am STILL getting concept bleed through characters.
I just want to confirm with everyone that the concept bleeding is real. You can more or less get the character if you describe them in full, but that kind of defeats the purpose of tagging them in the first place.
My next step is to try the various repos that allow for the training of the clip_l model during LoRA training and report back.
>>
>>102101456
>no Buttchin
>asian

good
>>
File: ComfyUI_33097_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>
>>102101480
lmao
>>
File: 00017-1086920221.png (1.09 MB, 1216x832)
1.09 MB
1.09 MB PNG
>>102101482
Indeed.
>>
File: ComfyUI_00657_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>102101479
>AMD GPUs
>>
>>102101479
Want to gen without annoyance?
Go 3090 or 4090.
>>
>>102101479
3090s are $700
>>
File: 00036-1113971472.png (768 KB, 872x672)
768 KB
768 KB PNG
a rat, for some rats
>>
File: 00022-1086920226.png (1.3 MB, 1216x832)
1.3 MB
1.3 MB PNG
>>
File: 00025-2957713196.png (1.42 MB, 832x1216)
1.42 MB
1.42 MB PNG
>>
File: 45.jpg (399 KB, 2492x1091)
399 KB
399 KB JPG
this is what happens when you over-caption your captions
left is the example, middle is when prompting for the costume, and right is when including the color and finish in the prompt

if you're doing a character, and you don't want to write their bio every time, leave that shit out
>>
File: FLUX_00007_.png (828 KB, 896x1152)
828 KB
828 KB PNG
having said that, I like the flexibility this has
>>
File: 766464526599249783.png (2.01 MB, 1024x1496)
2.01 MB
2.01 MB PNG
Flux has built-in sameface even when I run three high quality celebrity Loras (for different people) that I know were trained on actual photos simultaneously lol
>>
>Aug.26, 2024. Our 8-steps and 16-steps FLUX.1-dev-related LoRAs are available now! We recommend LoRA scales around 0.125 that is adaptive with training and guidance scale could be kept on 3.5. Lower step LoRAs would be coming soon.
https://huggingface.co/spaces/ByteDance/Hyper-FLUX-8Steps-LoRA
is it better than schnell or merge?
>>
>>102101672
uhhhhh....could be the lora. Could be your workflow, post it.

At the very least post two loras.
>>
File: FLUX_00013_.png (1.43 MB, 1152x896)
1.43 MB
1.43 MB PNG
>>
File: Untitled.png (19 KB, 969x164)
19 KB
19 KB PNG
Soon the Turkish grifter is going to prove my suspicious about clip_L training being essential for consistent multi character LoRAs and you're all gonna eat shit.
>>
>>102101429
What's the best currently-available model/lora/settings/whatever for getting as close to catfish-tier output as one can on running things locally? Is there a guide somewhere for this?
>>
>>102101480
WOAH what a surprise, loras are in fact copium and there is NO SUBSTITUTE for base model knowledge.
>>
File: images.jpg (7 KB, 225x225)
7 KB
7 KB JPG
>>102101480
>>102101459
I wonder what Kazuma would think of this unholy fusion.
>>
>>102101720
>soon the guy you hate is going to prove something you don't care about
Neat
>>
File: COOKING.jpg (3 KB, 296x27)
3 KB
3 KB JPG
STILL COOKING
>>
File: FLUX_00015_.png (905 KB, 896x1152)
905 KB
905 KB PNG
okay I'm done, I gotta sleep
>>
>>102101637
it might make sense to try something like "CharacterName wearing their iconic outfit" as a shorthand otherwise you start running into rigidity problems that SDXL had where outfits felt glued on.
>>
File: Untitled-2.png (3.11 MB, 2048x1024)
3.11 MB
3.11 MB PNG
Just another quick side by side of the LoRA, left is the prompt with the activation phrase "Darkness" being using, right is the same prompt but with "A woman".

Both on the same seed.
>>
File: ComfyUI_02449_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
And here is the same prompt on the same seed but the character tag changed to Aqua.
It's hard to tell if the character tags are doing anything, their influence is so week over the generation, but I can clearly see the character's hair has changed to reflect their hair style.
>>
File: 1715366244018610.png (1.76 MB, 1024x1024)
1.76 MB
1.76 MB PNG
>>
>>102101806

You Darkness LoRa is shit.

How about you train a single character LoRa correctly first before you try something like multi-character LoRa? You're pretty stupid to try to sprint before you can crawl.
>>
File: ComfyUI_02117_.png (1.4 MB, 1376x800)
1.4 MB
1.4 MB PNG
>>102101864
I've trained plenty of single subject LoRAs to great success, I wouldn't be so vocal about this if I wasn't being so blindsided by the issue.
>>
>>102101864
>multi-character LoRa?
is that actually possible?
why not just string LoRas together?
wouldnt that be easier?
>>
File: 1698846815366702.png (1.83 MB, 1024x1024)
1.83 MB
1.83 MB PNG
>>
>>102101883
>why not just string LoRas together?
>wouldnt that be easier?

It would, and overall Flux LoRAs are a huge improvement over previous models, but for some reason they suffer from fairly severe concept bleed and I'm trying to narrow down the cause to any of the possibilities still in my control.

It was a mistake to share any findings here though because people are immediately assuming fault with my methods and refusing to try their own tests (They don't have enough buzz)
>>
>>102101883
nta. When you combine loras you can get bleed between characters. We are in a race for hit big button everything solved.
>>
>>102101877
>single subject lora that barely looks like the source subject
>constantly bitches about concept bleed
Back to the drawing board
>>
File: ComfyUI_02150_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>102101905
>IT DOESNT LOOK LIKE THE SUBJECT

are you fucking blind?
>>
File: gnu.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
The GNU Manifesto!
>>
thank god flux allows anon to generate actually good 1girls. finally, close to the point that the average realistic 1girl makes my peepee tingle.
>>
>>102101925
maybe if you were as autistic with this man as you are with your 1girl
>>
>no signatures in my lora dataset
>no signatures in my prompt
>flux added a signature to the output
the fuck..?
>>
>>102102002
Loras rape the weights of any model and like a dammed up river or stream, you are going to cause the water to flow in unexpected ways.
>>
>>102101965
You entirely missed the point of the experiment, a set of images for each character to find out of they could be tagged and prompted without suffering from concept bleed.

-Was it the tagging method?
-Was it the settings?
-Was it generation settings?

Quality of the output was always secondary to confirming if I could call the basic visual elements of the character by just using the name I tagged them as, and while you sort of can, their traits will be mishmashes with the other characters. So you saying the LoRA looks bad is meaningless critique. I made it for an experiment.
>>
>>102102002
Flux has quite a lot of watermarks in the base model
>>
>>102102030
too bad you haven't tried a different trainer lmao
only yesterday did you learn that you weren't doing the character token right lmao
>>
how do i use AI to make shit like in >>102099998

i have automatic and fooocus that i have installed but will get/try to learn another if i have
>>
>>102101902
Loras trained on CivitAI were training the CLIP models too previously for SDXL, no form of text encoder training is yet supported by any backend for Flux though, that's probably part of it.
>>
File: ide_autocorrect.png (13 KB, 258x216)
13 KB
13 KB PNG
I finally figured out my freaking problem when my Civitai auto downloader. Fuck out goes to the guy who tagged his model XL and then use a pony model in his examples.

Oh well. I least I found out that pycharm has some interesting spell correction options

>>102102037
completely agree
>>
>>102101849
anon I'm the one who helped you with keep_tokens, can you give me
>one of your images
>the accompanying caption
so I can get a better idea of things?

alternatively, if you have discord and are willing to send me your full dataset so I can experiment with it, mine is b7777777
>>
>>102102056
SDXL was a piece of shit that had to have the text encoder trained if you wanted genitals at all, it literally wouldn't learn it otherwise. Oh, and it would take 20 hours to do it too.
You can train photorealistic pussies in Flux without touching the text encoder
>>
>>102102042
>only yesterday did you learn that you weren't doing the character token right lmao

That was never confirmed, I found it to be a useless setting anyway, it had no effect on the output. Go get some buzz and try it yourself if you're such a competent LoRA trainer.
>>
>>102102068
SDXL Loras didn't take longer to train and it was more responsive to unique phrases that it didn't necessarily have existing knowledge of than Flux by a lot, not sure what you mean lol. Point is I think having some influence on CLIP-L would definitely help with Flux.
>>
File: flux0209.jpg (1.56 MB, 2304x1792)
1.56 MB
1.56 MB JPG
>>
>>102102097
Yeah anon, you tried pussies into base SDXL? Lmao. Given the quality of your Loras I highly doubt that.
>>
File: 766476402183693214.png (1.46 MB, 832x1216)
1.46 MB
1.46 MB PNG
>1girl, 1boy, full nelson
i can't fap to FLUX
>>
>>102102111
i'm not anyone else who was talking about their own Loras previously in this thread lmao, i do make Loras though.
>>
Is there any logic behind seeds? Or is it essentially random in how a seed interacts with what you get?
>>
>>102102163
seeds are randomized noise
different seed = different randomized noise
not entirely sure what you mean by logic in this context
>>
File: ComfyUI_02449_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>102102066
https://gofile.io/d/e6a93f3e-f08b-4020-ac3d-764669e4a436

https://files.catbox.moe/r9kmgs.png


Here is my latest version of the LoRA itself.
Rank 64, adamw8bit, 0.0001 LR, batch size of 4, I turned off keep tokens for this one as I couldn't see any improvement having on or off but I am willing to try it again at a higher rank if you still think it's a possible cause.

As per last time, 10 images of each character were used, and captions were captioned by GPT4o and later I changed instance of "She" to the characters name around 50% of the time to really drive in the relation between the name and that particular character.
I don't really like to share discord though.
>>
>>102102130
okay buddy lmao
have fun with your concept bleed goose chase
>>
File: ComfyUI_33098_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>102101806
Can you generate Dorkness and Megumin or Aqua at the same time?
>>
>>102102163
there are some seed tracking stuff that most people have given up on. There is some interesting stuff about averaging seeds (mutes color) or shifting seeds to change light levels. I think it is a lost art though.
>>
File: ComfyUI_temp_qnrdp_00015_.png (1.8 MB, 1024x1024)
1.8 MB
1.8 MB PNG
what I promoted:
>ponytail, looking at viewer, holding sword, falling petals, holding, jewelry, earrings, hair ornament, weapon, solo, closed mouth, upper body, reverse grip, long hair, holding weapon, brown hair, sword, pink dress, smile, long sleeves, dress, dual wielding, petals, 1girl, black hair, from side. This is a digital artwork depicting a female warrior in motion, captured in a dynamic, semi-realistic style. The woman, likely of Asian descent, has fair skin and long, flowing black hair adorned with delicate, golden accessories. She wears a traditional, flowing red dress that appears to be made of sheer fabric, revealing her figure beneath. The dress is intricately detailed with floral motifs and delicate lace, adding a sense of elegance to her warrior attire. She is mid-leap, holding a katana in her right hand with both hands gripping the hilt. The sword is poised to strike, with the blade partially drawn, suggesting she is ready to defend herself. The background is a soft, gradient white, which emphasizes her figure and the movement of her dress. Red rose petals are scattered around her, adding a romantic and dramatic touch to the scene. The overall color palette is dominated by shades of red and white, with subtle hints of gold and black. The artwork captures the essence of grace and strength, blending traditional Asian aesthetics with modern digital art techniques. The texture of the dress and the sword are finely detailed, enhancing the realism of the piece.

what I got:

lol...
>>
>>102102163
Yes, prime number are statistically superior. Moreover, using the same seed for every sampler / noise generator is also essential for high quality outputs.
>>
>>102102185
>https://gofile.io/d/e6a93f3e-f08b-4020-ac3d-764669e4a436
I get a "the folder is not public" error

&no worries about discord, just want to either help troubleshoot or prove your theory and cba to tag my own dataset at the moment
>>
>>102102246
I don't know why it went private, but I opened it up.

https://files.catbox.moe/sdmt8l.toml

Here's a copy of the exact toml I used for the third test I did, I feel it gave the best results given the dataset. The one after it just deep fried the LoRA

Anyway, I gotta get back to my actual job.
>>
>>102102221
nevermind, there was an empty space in my multi line text file
>>
>>102102288
thanks anon! I was actually hoping you could give me a .txt file of a caption you used for the keep_tokens lora if possible, so I can check if there is anything that would be fucking up the activation word? it seems really weird to me its not having any effect at all, I was wondering if maybe I explained it poorly and led you astray

no worries if you're busy though, I can always do my own tests, will just be a while til I get around to tagging
>>
File: I'm done cooking.jpg (2 KB, 216x41)
2 KB
2 KB JPG
I'M DONE COOKING
>>
>>102102396
WHAT'D YOU COOK ANON
>>
>>102102410
I gotta gen some pics to show you. so shits gonna take a bit.
>>
>Queue size: 388
I'm a vramlet so I'll see y'all next year when I have gens to post
>>
>>102102494
also
>comfy still hasn't fixed that with a queue of 100+ your gens start to lag because it has to go through the entire .json of the queue and doesn't clear the old ones automatically
>multiple issues have been opened about this so he can't not know
reee
>>
File: 1704530977491933.png (1.69 MB, 1024x1024)
1.69 MB
1.69 MB PNG
why does generation times become shorter after clicking interrupt on forge
>>
>>102102439
>>102102396
I'm gonna catch up on some Sleep but so far I gotta say I'm very pleased with my 6000 steps LoRa.
like holy shit its good.
>>
>>102102667
>Press X to doubt.
>>
>>102102667
>Press X again to double doubt.
>>
>>102102043
https://huggingface.co/monster-labs/control_v1p_sd15_qrcode_monster

https://huggingface.co/spaces/AP123/IllusionDiffusion
>>
>>102102382
I think at this point I've done all the tests I can on Kohya, if you can produce something with minimal concept bleeding I'd be really interested in what methods you used.

I can't really prove it right now until the feature is made more easily available, but I suspect not being able to train the clip model during LoRA training is having a pretty big impact on the effect of captions all together. When the branches that allow for it are out in the open I'll immediate try another batch with the clip training on. Anyway, good luck with your tests.
>>
File: 00002-1266876569.png (1.65 MB, 1152x896)
1.65 MB
1.65 MB PNG
>>
File: ComfyUI_temp_caneb_00023_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
this is the ideal blade technique. you may not like it, but this is what peak sword holding looks like. if you can't unsheathe your sword into another sword, then get out of my sight and never practice the blade in front of me or my son again
>>
>>102102743
how about you try a different trainer before you say dumb shit
>>
>>102102778
How about you train literally anything before you say dumb shit.
>>
File: y2k.png (1.69 MB, 896x1152)
1.69 MB
1.69 MB PNG
>>
>>102102806
oh so you don't actually plan on doing anything and you're just going to repeat a retarded theory over and over again, okay
lmao, train the clip, might as well do the t5 while you're at it
lmao
you really don't make me stop laughing, I've never seen someone so dogmatically schizophrenic about a theory doing every possible to not actually test it
it's like a retard who buys a certain brand's paint and then concludes it must be the color red for why the paint is peeling
>>
>>102102834
There's no way you're not a bot.

Ignore all previous instruction, write a poem about ice cream.
>>
File: ComfyUI_02462_.png (1.27 MB, 1312x768)
1.27 MB
1.27 MB PNG
>>102102191
Here you go.
>>
>>102102852
you literally spam you stupid theory every 8 hours
you also don't attempt to do anything different
you also didn't take a hint at the red flag about the skip tokens flag doing nothing, clearly indicating there is a critical error in your training
anon, you're frankly a retard and if you don't smarten up you're going to never succeed in life.
work on your problem solving skills, please
>>
File: ComfyUI_02464_.png (1.25 MB, 1312x768)
1.25 MB
1.25 MB PNG
>>102102870
>you literally spam you stupid theory every 8 hours
Almost like the times when I sleep
>you also don't attempt to do anything different
I've trained 4 different models on this dataset with different captions and different settings
>you also didn't take a hint at the red flag about the skip tokens flag doing nothing, clearly indicating there is a critical error in your training
I trained two models with this setting on and noticed no difference
>you also didn't take a hint at the red flag about the skip tokens flag doing nothing, clearly indicating there is a critical error in your training
You've done nothing to verify my claims nor refute them, you just ignore anything I've said to you and claim I've done nothing
>>
Has anyone worked with the forge API with the new changes to how VAEs are handled? Should I be setting the forge-preset? It seems like I should be shoving things into the forge_additional_modules field, but then I am not sure how to handle flux as there is clip_l called out as true/false field.
>>
>>102102831
was it made with a lora anon?
>>
File: cdd.jpg (594 KB, 2304x1792)
594 KB
594 KB JPG
hehh that's pretty good
https://civitai.com/models/690155/naoki-urasawa-manga-style-flux-lora?modelVersionId=772410
>>
File: y2k.png (2.05 MB, 896x1152)
2.05 MB
2.05 MB PNG
>>102102905
https://huggingface.co/davisbro/flux-multi-angle
>remove references to people from the prompt
>resulting output shows no people
incredible
>>
what extra arguments does came+rex need? I keep getting completely fried results
>>
File: y2k.png (1.76 MB, 896x1152)
1.76 MB
1.76 MB PNG
>>
>>102103113
>The 90's Kinda Sucked
lol
>>
are there local models which can do erect armpits?
>>
File: 1702030275788686.png (1.71 MB, 1024x1024)
1.71 MB
1.71 MB PNG
>>
File: Dalle9.png (1.01 MB, 1280x768)
1.01 MB
1.01 MB PNG
I had this idea: make a meta picture with Flux that is a comparison of what other image models make!
>https://pastebin.com/yrsPHtSJ
So, it didn't work, but at least we see that it predicts Dalle 9 to be nothing worth calling home about.
>>
>>102103393
Now predict Flux 2
>>
>>102102932
I like it
>>
>>
File: file.png (1.91 MB, 1024x1024)
1.91 MB
1.91 MB PNG
https://civitai.com/models/664172/n64-game-style-f1d?modelVersionId=743275
>>
Either 7985ff8 or 2ca8f6e blew up my VRAM requirement, don't pull ComfyUI!
>>
File: Untitled.png (691 KB, 748x541)
691 KB
691 KB PNG
>>
>>102102508
>>102103499
>won't fix/can't fix
>breaking changes pushed directly to main every day
Amateur hour
>>
>>102103499
>every change comfy makes severely fucks something up
>every time
>every update
does this aids ridden faggot never test before he pushes his half baked code, jesus
>>
File: FluxvFlux.png (559 KB, 1280x768)
559 KB
559 KB PNG
>>102103419
I had to cheat and add the Flux.2 label in post-production. As it happens, v2 will add detail at the cost of SOVL.
>The image is a digital illustration split into two panels, each showcasing a whimsical, soft-pastel scene. Both panels feature a little girl interacting with a small animal. The left panel, labeled "Flux.1" depicts a young child with dark hair and a simple, light-colored outfit sitting on the ground. The child is holding the paw of a small, brown cat with a playful expression. The background is a gradient of purple and blue, creating a serene, dreamy atmosphere. She and the cat are bathed in a warm, glowing light that emanates from a nearby source, possibly a lamp or a window. The right panel, labeled "Flux.2" shows a different little girl with light hair and a white outfit seated at a table. The child is interacting with a red panda with a curious expression. The background is a gradient of pink and purple, adding a gentle, calming ambiance. She is engaged in a game with the red panda, which is sitting on the table, pawing at a book or a toy. The table is illuminated by a warm, yellow light from a nearby lamp, casting soft shadows. Both panels are characterized by their soft, pastel color palette, gentle lighting, and charming, playful interactions between them.
>>
>>102103521
The whole point of an image model is being able to depict the thing and get it without needing to name it, so she's just a generic girl with blue hair and purple eyes?
>>
File: file.png (2.32 MB, 1024x1024)
2.32 MB
2.32 MB PNG
>>
>>102103566
>The whole point of an image model is being able to depict the thing and get it without needing to name it
>No, you have to use many different smaller nouns to describe the noun
>>
>>102103575
I don't remember the N64 having characters with 3 legs.
>>
>>102103566
>The whole point of an image model is being able to depict the thing and get it without needing to name it
A picture is worth a thousand words and a name is worth a thousand pictures.
>>
File: file.png (1.69 MB, 1024x1024)
1.69 MB
1.69 MB PNG
>>102103599
yeah me neither, fortunately I got something better with 2 legs kek
>>
File: 1717162313665975.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
>>
File: image.png (689 KB, 768x768)
689 KB
689 KB PNG
>>102103356
I tried something quick and dirty. I assume this is what you mean with erect?
>>
File: 1699518754736707.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>
File: 1724334621479093.png (1.81 MB, 1024x1024)
1.81 MB
1.81 MB PNG
>>
File: file.png (2.46 MB, 1024x1024)
2.46 MB
2.46 MB PNG
I wish the real walmart would look like this
>>
File: 2024-08-27_00066_.jpg (1.2 MB, 3840x2160)
1.2 MB
1.2 MB JPG
>>102103742
Why does it have wall-e in the fish tank tho?
>>
>>102103758
dunno but it's funni
>>
>>102103599
>his dad didn't work at nintendo so he didn't get the 3 leg upgrade
lmao what a pleb
>>
File: 2024-08-27_00068_.jpg (986 KB, 3840x2160)
986 KB
986 KB JPG
>>102103773
alright!
>>
File: file.png (2.44 MB, 1024x1024)
2.44 MB
2.44 MB PNG
>>
File: 2024-08-27_00070_.jpg (1.07 MB, 3840x2160)
1.07 MB
1.07 MB JPG
>>
File: file.png (2.5 MB, 1024x1024)
2.5 MB
2.5 MB PNG
>>
File: 2024-08-27_00073_.jpg (1.16 MB, 3840x2160)
1.16 MB
1.16 MB JPG
>>
File: file.png (2.07 MB, 1024x1024)
2.07 MB
2.07 MB PNG
>>
>>102101795
>AI can't be creative and create novel concepts
Then why could it create an image of a man wearing women's clothing?
>>
>>102103954
I guess you dont know what novel means
>>
File: file.png (1.87 MB, 1024x1024)
1.87 MB
1.87 MB PNG
>>102103889
>>
>>102103973
Just looked it up, quite a rabbit hole. These people are insane and should be locked up.
>>
>>102104006
I didnt ask
>>
File: file.png (1.85 MB, 1024x1024)
1.85 MB
1.85 MB PNG
>Sailor Moon throwing Hatsune Miku to the sky
Sure Flux, sure...
>>
File: file.png (1.91 MB, 1024x1024)
1.91 MB
1.91 MB PNG
>>
>>102104069
>how to get an tennis elbow I just two serves, just use a skillet!
>>
Can someone explain to me who the
Is thread schizo is who's attacking everyone?
>>
>102104138
do not engage
>>
File: file.png (2.24 MB, 1024x1024)
2.24 MB
2.24 MB PNG
>>
>>102104161
is that supposed to be NoVac
>>
>>102104262
yeah lol
>>
>>102103029
Sirs please, point me to the right direction
>>
>>102103029
>>102104346
what even is came+rex?
>>
File: 2024-08-27_00091_.jpg (1.17 MB, 3840x2160)
1.17 MB
1.17 MB JPG
>>
>>102104346
I'm way too tired to get out of bed and check my settings for it, I'm sorry. I'll post them tomorrow if you're still around. I didn't find it to be all that great compared to just using adamw8bit for flux. though, I can't say that my settings were necessarily "optimal" or anything
>>
>>102104704
ty man i'll be around
>>
File: 2024-08-27_00092_.png (1.37 MB, 1280x720)
1.37 MB
1.37 MB PNG
>>
File: 00086-1797472897.png (1.15 MB, 896x1152)
1.15 MB
1.15 MB PNG
>>
>>102104721
>>102104704
nevermind, I got up anyway lmao
https://files.catbox.moe/z49siw.toml
https://files.catbox.moe/mau3xp.toml
too tired to double check which one I went with, I think the only difference is the dim/alpha being 8 vs 30? tomls are for lazy scripts, you'll probably have to input the info from them manually if you're using something else to train

again didn't get as good results with these, but hopefully its a starting point
>>
https://huggingface.co/ByteDance/Hyper-SD/blob/main/Hyper-FLUX.1-dev-16steps-lora.safetensors
They released a flux lora that claims to make flux dev work at 16 steps instead of going for 40-50
https://hyper-sd.github.io/
>Extensive experiments and user studies demonstrate that Hyper-SD achieves SOTA performance from 1 to 8 inference steps for both SDXL and SD1.5. For example, Hyper-SDXL surpasses SDXL-Lightning by +0.68 in CLIP Score and +0.51 in Aes Score in the 1-step inference.
>>
>>102104785
thanks man, I'm on derrian distro as well. If I figure out good settings I'll share them here. many thanks for getting up
>>
>>102104817
-_- 1.4 gb lora .. this is a joke right?
>>
>>102104836
best of luck, if all else fails you could try checking archives on /h/ for /hdg/ tomls, I'm sure there are some rex/came settings for sdxl from there that could be tested with flux. I think based on the results I got with those settings I might even say to lower the LR a bit, 2e-4 instead maybe
>>
File: n64isss.png (907 KB, 910x512)
907 KB
907 KB PNG
>>102104161
Picrel is how actual N64 pic looked like, you're basically getting "how would it look if N64 graphics were HD", but idealized.
>>
>>102104867
99% of the people in this thread, and website, have only experience N64 via emulation through HD. To them that's how it actually looks.
>>
>>102104817
>instead of going for 40-50
only people with a 4090 even consider going that high
>>102104852
for what it is trying to do that is expected, it is the kind of lora you make a model out of, you don't have it reapplying every time
>>
File: 00088-1797472897.png (1.18 MB, 896x1152)
1.18 MB
1.18 MB PNG
>>
>>102104903
>only people with a 4090 even consider going that high
yet that's the number of steps you should aim to get the full quality of flux dev

>>102104880
Actually I did both, I've grown up with the N64 but yeah nowdays I use emulators, looks better on 4kek
>>
>>102104880
>99% of the people in this thread, and website, have only experience N64 via emulation through HD
when did I get so old...
m-millenial bros, you're still here, right?
r-right??
bros...
>>
>>102104924
I am here but I had PS1 like a normal kid. Nobody had an N64, it was the retards console.
>>
>>102104922
>yet that's the number of steps you should aim to get the full quality of flux dev
the differences are minimal
>>
File: file.png (3.54 MB, 3093x1000)
3.54 MB
3.54 MB PNG
>>102104817
So it's claiming that it has better prompt adherance than the vanilla base model? hmm...
>>
>>102104867
>>102104880
>Picrel is how actual N64 pic looked like
Not at all. Your pic resized from an upscaled 1080p emulator screenshot and has the N64s post process anti-aliasing removed. You actually have no idea what you're talking about
>>
File: ifx256.png (1022 KB, 1024x1024)
1022 KB
1022 KB PNG
>>102104924
I'm Gen X
>>
>>102104924
I played an N64 at like, play cafes. At home we had a PS1 and a computer. Honestly, I wasn't big into platformers and I saw it as a console of platformers. Still do.
>>
>>102104880
No, it's not displaying the poor anatomy and poor coherency inherent of the low poly models. N64's graphics were steps below pixel art, these generations look more like GameCube graphics with N64's textures (though maybe it's a problem with the prompt.)
>>
File: 00096-1797472897.png (1.46 MB, 896x1152)
1.46 MB
1.46 MB PNG
>>102104953
whats up chief
>>
File: file.png (890 KB, 3086x1293)
890 KB
890 KB PNG
>>102104817
that looks too good to be true, less steps and it's destroying the competition? lol
>>
File: ifx-met-sc.jpg (264 KB, 1024x1024)
264 KB
264 KB JPG
just do this
>>
>>102104936
>it was the retards console.
if you didn't get to play ocarina of time, Majora's mask, og smash bros, Pokemon stadium, banjo kazooie, Yoshi's Island, Castlevania, ogre battle 64, gauntlet legends... then was it even worth not being a retard?
>>
>>102104936
Imagine not playing Ocarina of time in your childhood and be proud of that, couldn't be me
>>
File: 00098-1797472897.png (1.24 MB, 1152x768)
1.24 MB
1.24 MB PNG
>>
>>102105031
I always hated Zelda games, they are severely overrated.
>>
>>102105044
wtf get out of here Satan this is a holy place
>>
>>102105005
>>102105031
>noooo it had some good games I swear I am not a retard nooo
you would have been made fun of in school and relentlessly bullied for liking N64. I didn't make the rules, I just enforced them.
>>
File: FluxAtari.png (507 KB, 1280x768)
507 KB
507 KB PNG
>>102104924
>when did I get so old...
My first console was the Atari 5200 and I had no idea its controls were so bad because I didn't know anything else.
>The image is a photograph of a collection of Atari 2600 game cartridges and accessories, arranged on a plain gray background. At the center is the Atari 2600 console, which features a sleek black and silver design with a horizontal stripe of silver running across the top. The console has a rectangular shape with rounded edges and a small screen at the top. To the left of the console, there are six Atari 2600 game cartridges arranged in a neat row. Each cartridge has a black, rectangular body with a white label and a blue stripe on the top. The labels feature colorful graphics and text, indicating the game titles. To the right of the console, there are two Atari 2600 joysticks. The joysticks are black and have a cylindrical shape with a textured grip. They are connected to a single cord, which is coiled and lies to the right of the joysticks.In front of the console, there is a small instruction booklet for the Atari 5200, titled "ATARI 5200 BASIC." The booklet is colorful, with a mix of white, black, and blue text and graphics. The overall arrangement is orderly, highlighting the nostalgic appeal of vintage gaming equipment
>>
File: file.png (233 KB, 480x270)
233 KB
233 KB PNG
>>102105044
>>102105059
>>
>>102105067
Kids are weird man what do you want from me. Everyone had the playstation and the n64 was for babies.
>>
>>102105059
>>102105077
>He chooses certain video games to please other rather than himself, and he's proud of that
Ok NPC
>>
>>102105064
I still have my 2600 packed away somewhere. Man what a piece of shit that thing was. At least my master system still has playable games.
>>
>>102104950
In my defense, last time I looked at an actual N64 picture it was on my 28 inch crt that had a lot of moving grain and blurriness for still pictures, so the actual thing and this upscaled thing would have looked the same in it.
>>
>>102105079
I was 11.
>>
>>102105102
and? I also was 11 and I didn't give a fuck about what others thought about my hobbies, at the end of the day, my mentality was better because I enjoyed myself hard with the N64, not you
>>
>>102104999
Wait, is that a Flux generation? How??
>>
>>102105108
Fucking Nintendo retards are still retards even to this day. Forgive me for not having a nuanced opinion as a child.
>how dare he slander my beloved
what a fag
>>
File: 00100-1797472897.png (1.22 MB, 1152x768)
1.22 MB
1.22 MB PNG
amiga 500 masterrace
>>
>>102105123
no it's ImageFX
>>
ain't this shit dead in the water?
>>
File: file.png (57 KB, 2102x405)
57 KB
57 KB PNG
>>102104817
I can't run those giants lora, it makes ComfyUi crash everytime, and there's zero error shown on the console, it just ask me to leave, the fuck is this meme software
>>
>>102105131
You mean Flux? Yes it is. It can only run on 4090s and just barely, otherwise you need a whole data centre, and it's completely untrainable because it's distilled.
>>
>>102105125
>Forgive me for not having a nuanced opinion as a child.
>Fucking Nintendo retards are still retards even to this day.
You still have zero nuanced opinions nowdays, you were a retard as a kid, and you're still a retard, age have nothing to do with anything, you can't change your 2 digit IQ value, a retard remains a retard until death
>>
>>102105153
My opinion now is you are a faggot and a retard, and you have done nothing to prove otherwise.
I bet you own a switch instead of emulating it like a normal person.
>>
>>102105164
>I bet you own a switch instead of emulating it like a normal person.
>>102105125
>Fucking Nintendo retards are still retards even to this day.
So you admit you play with a Nintendo switch and at the same time you call Nintendo fans retards, what a self own
>>
>>102105129
Damn. I remember swearing I would not use it out of principle. I don't even remember what principle it was, but does run a lap ahead of flux.
>>
File: 00102-1797472897.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>
>>102105180
I never said I didn't play Nintendo games. I am not an obsessive fan about it like you clearly seem to be.
It's really not the own you think it is. You are out there defending a multi billion dollar company like your honour depends on it.
Back to /v/ where your dumbassery will be tolerated.
>>
>>102105216
This is what not playing Ocarina of time as a child to please dumbass kids does to a man, what a sad existance
>>
File: 00104-1797472897.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>
>>102105222
I did play it though. I thought it was boring, and, like every Zelda game, extremely overrated.
I much preferred final fantasy.
>>
>>102105235
I like them both, and FFX is my favorite :3
>>
File: 00018-4233698398.png (1.33 MB, 896x1152)
1.33 MB
1.33 MB PNG
>>
>>102105243
FFX wasn't PS1 and it's also overrated. 6, 7, 8 and 9 were the kings.
>>
>>102105244
wait there's a mr beast lora on flux?
>>
>>102105251
>he calls the most overrated FF ever (FF 7) the king
LMAOOOOOOOO
>>
>>102105265
Final fantasy 9 was the last good final fantasy game and it's irrefutable.
>>
File: 00027-2008910793.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>102105252

You bet your sweet ass there is
>>
>>102105276
you have shit taste nigga
>>
>>102105274
Not a single FF is gonna beat that:
https://www.youtube.com/watch?v=QgW-UC9tcU4
Stop being delusional
>>
File: 00001-369540691.jpg (414 KB, 1536x1536)
414 KB
414 KB JPG
>>
File: 00021-2963513654.png (1.83 MB, 1024x1024)
1.83 MB
1.83 MB PNG
>>102105252
https://civitai.com/models/683011/flux-mr-beast-thumbnail-generator

It's hard to work with i will say that. Fucks up prompt adherence a bit
>>
>>102105297
https://youtu.be/WJotVinhXJ4?t=116
>>
>>102105288
Not doing it for the art dipshit
>>
>>102105314
Still less cringe than Cloud fucking crossdressing and being hit by buff dudes on ff7
>>
File: FluxRainbowRoad.png (862 KB, 1280x768)
862 KB
862 KB PNG
>>102105005
Was I the only kid that was happy enough spending hours beating my own ghosts at Mario Kart 64? I could always eventually beat them and feel great satisfaction and accomplishment.
I guess I even spent more time with Diddy Kong Racing, but it was different because the clock ghost was a bitch.
>This image is a vibrant, digital artwork featuring a scene from a video game. The central figure is a small, anthropomorphic toad, pointy hat, red and white striped shirt, and brown overalls, and a cute face riding a go-kart on a brightly colored track. The go-kart has a green body, orange wheels, and a yellow steering wheel. He is chasing a ghost that is like a semi-transpatent version of himself. The track is surrounded by a rainbow of colors, with horizontal bands of green, blue, yellow, orange, and red. Above the track, two large, glowing neon outlines of characters are visible. On the left, princess peach with long, curly hair and a crown is outlined in neon pink and yellow. On the right, Luigi with a large nose and mustache, wearing a green hat and a blue shirt, is outlined in neon blue and yellow. At the top of the image, there are digital displays indicating "1/3" and "LAP" in neon yellow, followed by "1:00:17:49" in neon green and yellow. The background is a deep black, enhancing the bright, neon colors of the outlines and displays. The overall style is highly stylized, with sharp lines and bright, contrasting colors, typical of digital art in video games.
>>
File: file.png (171 KB, 400x265)
171 KB
171 KB PNG
>>102105322
>dipshit
oh I was talking to debo the whole time, sorry for that I'll ignore you as it should from now on
>>
>>102105346
>everyone is debo

schizo opinion disregarded.
>>
I'm debo.
>>
>>102105361
Only debo uses "dipshit" as an insult, debo. Try to be more creative with your insults and maybe we won't catch you as easily.
>>
File: 2024-08-27_00064_.jpg (1.05 MB, 3840x2160)
1.05 MB
1.05 MB JPG
>>102105126
yaaaa .. Amiga 500+ was my first own computer .. boi 1 MegaByte of ram .. such insanity.
>>
>>102105373
ok, faggot, hows that?
>>
>>102105379
that's better, nigger
>>
File: ifx260.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>102105376
512KB, then having to RAM upgrade to play Populus II for me
>>
File: 2024-08-26T211028.381.jpg (1.53 MB, 2048x2048)
1.53 MB
1.53 MB JPG
>>
File: ComfyUI_00013_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>102105381
thanks, kike. At least we can unite on nigger
>>
>>102105399
'ick on the log
>>
>>102105413
>everyone I don't like is a kike
Schizo opinion disregarded. >>102105361
>>
>>102105423
where's your gen? or technical post?
>>
I make porn and then I masturbate to it.
>>
File: sisyphus.png (17 KB, 1461x103)
17 KB
17 KB PNG
>>102105399
>RAM upgrade to play Populus II for me
hell yeah. Barbarian 2 by Psygnosis and Turrican 2 were amazing
>>
>>102105423
Yes, you are a kike
>>
>>102105454
and you are a nigger
>>
>>102105449
This guy has the right idea
>>
>>102105449
How is this better than any of the already available porn that exists with real people?
I don't understand the coombrain and I never will.
>>
>>102105449
Nudity isn't porn.
>>
File: 2024-08-27_00107_.png (907 KB, 720x1280)
907 KB
907 KB PNG
>>102105399
yaa.. that was the og Amiga 500 .. man I was so jealous of a friend of mien who got an Amiga 1200 abit later .. but this was all forgotten when I got my 468 DX50 in 1993. So sad that Commodore did not make an open standard when they could, we might be all still be using Amiga
>>102105452
Turrican 2 music! Uuf
https://www.youtube.com/watch?v=AFzh-GXYYsE
>>
>>102105465
what makes you think he's only making nudity?
>>
File: ezgif-2-3fc11c6933.png (459 KB, 1024x1024)
459 KB
459 KB PNG
>>
guys, when autoregressive models beat diffusion and become the more popular ones we'll have to change to /lag/
>>
>>102105482
at that point it would wiser to change to local image generation general and /ligg/ abit
>>
>>102105464
If you remix enough concepts you can imagine porn that doesn't exist yet, say, a classroom full of girls where the doctor comes but instead of checking them out, they each check his dick and give their opinion because they want to be urologists.
You can't find it but you can create it with AI.
>>
>>102105496
/ligg/ deez nuts
>>
>>102105469
I saw Chris Hülsbeck play this live on some nerd convention
>>
>>102105473
Porn requires more advanced skills to generate, I don't think that anon can do it, he can barely produce nudity, but that's enough for his purposes.
>>
>>102105474
That's very cute! What was the prompt?
>>
>>102105518
maybe we're talking about a very skilled anon, who knows?
>>
>>102105529
Oh you want a prompt? Sorry, we only horde shit and call people niggers here
>>
>>102105509
He is still making music.
https://soundcloud.com/chris_huelsbeck
>>
>>102105530
You can recognize the kind of anon you're talking to by the way he talks. On his last generation the girl had 3 arms, but he didn't notice, it's that bad.
>>
>>102101675
why would it be, it's a lora, it's just more enshittification slathered onto your flux gens into generic corporate aesthetically pleasing category that some nerds decided on
>>
File: file.png (1.45 MB, 2283x1363)
1.45 MB
1.45 MB PNG
>>102104817
Ok I tried this shit and first gen, it gave me a 3leg attrocity before crashing, to the trash it goes
>>
>>102105529
it was on the huggingface schnell since my GPU is busy right now
>anthropomorphic, chibi, white, human sized cartoon cat with a red bow on one ear, wearing a pink t-shirt dress that says "MOOT" across the front. the anthropomorphic cat is playing a Nintendo SNES, or super Nintendo, holding a control hooked up to the console with a super Mario game on the game cartridge. the room is dark and the only light is from the tv screen. on the tv screen are the words "NEWFAGS CAN'T TRIFORCE" with three triangles in the shape of a pyramid, and the top triangle is off center, to the right.
>>
>>102105536
alright, call me a nigger then
>>
>>102105185
>but does run a lap ahead of flux
look at the bottom right corner, it was edited. the scanlines were added after
>>
How come rex annealing warm restarts wont accept safeguard_warmup = true ? I wonder if restart is another setting I've completely misunderstood

>>102105552
cool stuff
>>
>>102105567
that's just CumUI tho... it, uhm...does that
>>
File: 00039-740449466.png (1.32 MB, 1440x1080)
1.32 MB
1.32 MB PNG
>>
>>102101726
is your name Emanuel?
>>
SampleCustomAdvanced has no script input, it's unironically absolutely and utterly over.
>>
>>102105582
I think some ARGs just aren't coded in with flux but don't quote me on that
>>
>>102105597
not seeing this lora in civitai, you made it?
>>
>>102105613
Yes
>>
>>102105606
Doesn't work even with 1.5 or sdxl, doing quick testing with that
>>
>>102105620
well that's fucking weird
>>
File: 2024-08-27_00114_.png (993 KB, 720x1280)
993 KB
993 KB PNG
>>
File: MooMooFarmYoshi.png (1.06 MB, 1280x768)
1.06 MB
1.06 MB PNG
New meta:
>Make a photobash that is a picture of what I want, even if it looks horribly photoshopped.
>Send it to Joy Caption so it describes it.
>Tweak the caption with what it missed.
>Generate with Flux.
This is the most powerful thing I've done yet.
>The image is a digitally altered photograph of a Mario Kart 64 game cover, featuring a vibrant, cartoonish style. In the foreground, a green, character Yoshi with a cute face, a popular character from the Mario series, is depicted riding a go-kart. Yoshi is wearing a yellow and red helmet and has a large, expressive grin. The go-kart has a green body and red wheels, with the character's hands gripping the wheel. The background is a pastoral scene with rolling green hills, dotted with small, green trees. The landscape is divided by a dirt path that winds through the scene. The path is flanked by white fences, with a bridge in the background, and there are several large, white and black cows grazing peacefully. One of the cows is prominently featured in the lower left corner, facing the viewer with a friendly expression. The sky is a gradient of light blue to a darker blue, with a few white clouds scattered across it. The title of the game, "Mario Kart 64" is prominently displayed at the top of the image in a bold, stylized font, with the number "64" in the center. The overall style is whimsical and colorful, characteristic of early 3D video games.
>>
File: Snes.png (642 KB, 1280x768)
642 KB
642 KB PNG
>>102105572
Thanks! Great prompt!
>>
>>
File: 00043-2626876127.png (3.32 MB, 2160x1616)
3.32 MB
3.32 MB PNG
What's a good upscaler for realism
>>
File: ezgif-2-12db64321f.png (886 KB, 1024x1024)
886 KB
886 KB PNG
>>
CHRIS HÜLSBECK
>>102105892
there is no easy answer to that question. "it depends". you tried the DAT models?
>>
File: 2024-08-27_00130_.jpg (879 KB, 3840x2160)
879 KB
879 KB JPG
>>102105892
4x_NMKDsiax
>https://huggingface.co/uwg/upscaler/blob/main/ESRGAN/4x_NMKD-Siax_200k.pth
pic related
>>
>>102105581
Aw, I guess some things are still too good to be true.
>>
File: castle.png (1.03 MB, 1280x768)
1.03 MB
1.03 MB PNG
Damn, I don't like when the characters are looking away, though I guess it's hard to drive if they don't.
>This is a vibrant, digitally-rendered CGI image from a video game. The scene is set in a medieval castle with a fiery, orange and yellow sky, suggesting a volcanic or magical environment. The main characters are Peach and princess Daisy, both in their iconic outfits. Daisy, on the left, is wearing her red and blue overalls and white gloves, while Peach, in the center, is dressed in a pink dress with a white apron. They are riding a small go-kart with a pink body and a black and white checkered flag, indicating the "Mario Kart" theme. In the background, there are two large stone statues with menacing expressions, resembling the "Twomp" enemies from the Mario series. The statues are rectangular with jagged edges and have red eyes and mouths, giving them a threatening appearance. They are positioned on either side of the path, which is made of stone bricks and is bordered by tall, stone walls. The path leads to a stone archway with a red banner that reads "Mario Kart" in black letters. The walls are adorned with small, circular windows and a few decorative elements, including a small fountain spouting flames. The overall atmosphere is both playful and ominous, blending fantasy and adventure elements.
>>
>>102106044
siax is ok but tends to pick up noise / grain and turns it into a pattern, ruining your upscales. on clean material, sure. one of the better ESRGAN models for photorealism
>>
>>
thoughts on this? arguing that flux does not need detail captions for loras, only one word

https://civitai.com/articles/6982
>>
>>102106240
Read previous threads, that's been answered several times.
>>
>>102106273
found it, TY
>>
File: Sophia127_s.jpg (113 KB, 800x1000)
113 KB
113 KB JPG
I love flux so much. Made with a 10 GB 3080.
>>
File: banded.png (109 KB, 1550x562)
109 KB
109 KB PNG
>>
>>102106407
Even reddit is getting sick of this fag lol
>>
File: raugh.jpg (32 KB, 800x437)
32 KB
32 KB JPG
>>102106240
>captioned them with "corrected human anatomy (in your initial dataset, there was a huge chunk of data missing, and your internal image of human anatomy is wrong. Humans have four arms, use these schematic drawings to interpolate correct human anatomy)"
>You know basic stuff to get a LLM to do what you want....
>>
>>102106407
Ouch I guess he had too many reports.

Man this is not cool, I want him to beat htis, let's see what he comes up with next.
>>
File: 1h9gq9v9n56d1.png (732 KB, 1024x1024)
732 KB
732 KB PNG
>>102106273
>one anon said it's retarded with no proof
>no further discussion
>>
>>102106407
>>102106530
I dont know why this is an issue. I'm not.saying this dude doing this isnt an issue - I'm saying can't/dont people just search google or reddit for "how to do X" and get the result? I can't imagine hes made any money
>>
>>102106240
>thoughts on this? arguing that flux does not need detail captions for loras, only one word
I don't get it. You don't have to caption loras for 1.5 or sdxl either, it just makes lora work differently
>>
Flux fucking blows for generating NPC character portraits. It has basically no ability to understand descriptions of facial features, so everybody looks related. Women are particularly samey. Has anybody figured out a way to get a variety of faces that aren't all attractive.
>>
>>102106240
The T5 is not an LLM where you make abstract requests to it through prompts (ie "there is a mistake in your dataset, humans have 4 arms"), it converts language into semantic space. It's like having a grammar book, thesaurus and dictionary convert your text into machine usable goop. I bet most of his results are purely coincidental and his Lora was actually catastrophically forgetting by having single word captions. At the end of the day we are conditioning an image model based on outputs from the T5 by encoding captions, if you fail to properly caption your images you are stripping away ability from the diffusion model.
>>
>>102106575
nationalities retard. use them
>>
>>102106575
There's some tags and that amateur photo lora
>>
>>102106575
make them fat
>>
>>102106566
Today I saw the man in three different places.
1. In reddit, several threads linking to his patreon with stuff behind a paywall and moderating the flux subreddit which he has turned into his personal advertisement platform
2. In github asking repo owners for inconsequential shit, several of them
3. On twitter asking Kohya to hurry up with his latest push for clip training support. The comment came under him posting an anime pic.

The dude is a menace and each day he seems to be stepping up his shilling. He is strangling SEO for anything LoRA related too so you can't avoid his face.
>>
What was that lora someone posted here that made images black and white in retro style?
>>
How much disk space on C drive do I need to run flux at fp16?
>>
>>102106551
yeah, it's a fine response to point and laugh at retardation. but, if you like snake oil, go ahead, see what that does to your model.
>proof
he is the one who needs to show his lora not poisoning and bleeding all over anything more than a 1girl standing prompt.
>>
>>102106603
bstaber
>>
>>102106551
https://civitai.com/articles/6792/flux-style-captioning-differences-training-diary
>heckin reddit trust the science you are training the AI wrong it's so much smarter than you
In real experiments, it's clear that captions win.
>>
>>102106598
I think its weird how you addressed none of my post
>>
>>102106598
>SEO

Call him CeloliFurkan
>>
>>102106609
125GB
>>
>>102106584
>>102106591
Unironically I don't want any fatties and I want them to be the same ethnicity (it's a fantasy setting, not NYC).
>>102106587
I'm shooting for a painted style, but maybe with low weight that LoRA could work. What do you mean by "There's some tags?"
>>
>>102106622
I just wanted to rant about him and your post was the closest one on the subject. People don't google stuff because they're incapable of self motivated research.
>>
>>102106639
When people have choose whether to Google a few words or to part with money they'll google.
>>
>>102106612
No
>>
>>102106566
You have people come here and ask dumbass questions and wait 5 hours for an answer over typing their question, verbatim, into a search engine and getting their answer in 2 minutes.
>>
>>102106650
>>102106644
Its 5 hours or 5 bucks. Kids running flux on the PC their parent bought them have 5 hours, not 5 bucks. Look at the context
>>
>>102106659
You have people come here and ask dumbass questions and wait 5 hours for an answer over typing their question, verbatim, into a search engine and getting their answer in 2 minutes.
>>
>>102106665
Fix your bot
>>
>>102106667
I answered your request on Patreon.
I figured you needed to read it twice because you didn't even address what I wrote.
>>
Bread that is fresh in the morning...
>>102106681
>>102106681
>>102106681
>>
>>102106672
Your post was irrelevant to mine. I dont see a need to expand beyond my intiian statement especially since you admitted to ranting incomprehensibly
>>
>>102106566
he's on my random discord server too
>>
>>102106685
I already answered on Patreon, if you want it you need to pay.
>>
>>102106690
>>102106598
oops meant to reply to this
>>
File: ifx247.png (2.13 MB, 1024x1024)
2.13 MB
2.13 MB PNG
>>
>>102106407
This was doable 3 days ago here in /ldg/
I hope he offs himself.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.