[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (853 KB, 3264x3264)
853 KB
853 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102021045

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
I want train a Lora on images of downies. How do I train with flux on 24gb vram?
>>
>>
File: ifx137.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
>>102024495
ok
>>
>>102024498
>we have tauren at home
>>
File: 2024-08-22_00298_.jpg (741 KB, 1536x2688)
741 KB
741 KB JPG
>>102024460
thank you baker
>>
>>102024520
the actual model shape and texturing is impressive though
>>
So, who am I going to start a retarded argument with about something stupid today?
>>
AutomaticCFG is a weird animal, why the images are so different for CFG 2 and 3, there's theorically a better shit than that with "Skimmed CFG" but it's not working on Flux unfortunately
https://github.com/Extraltodeus/Skimmed_CFG
>>
File: bComfyUI_107336_.jpg (234 KB, 768x1024)
234 KB
234 KB JPG
>>
>>102024538
gen couple of these, take a blurry photo of your monitor with them on it and post it on /v/ claiming blizz is working on new classic+ expansion kek
>>
>>102024549
flux is failure. all it does it create sd 1.5 tier dogshit slop, only this time you don't need to upscale and inpaint 10 thousand times to fix the fingers. lykon was right, we moved backwards.
>>
File: image.jpg (83 KB, 1536x1024)
83 KB
83 KB JPG
>>102024549
share a retarded opinion to get the ball rolling
>>
>>102024595
Someone already beat me to it >>102024586
>>
>>102024586
>lykon was right, we moved backwards.
we indeed moved backwards, we could lie women on grass on SDXL, but with its next iteration, SD3, it's not possible anymore kek
>>
>>102024549
Can you link me to a one-click GUI that downloads, captions and trains a lora for me on my 8GB AMD card in under an hour? I've searched but I can't find it.
>>
>>102024576
I plan to.
I don't have hopes it will ever get Tauren right.
>>
File: file.png (183 KB, 1082x504)
183 KB
183 KB PNG
World of horror anon here. I have unpacked the entire game for the sprites to make a lora for it

not sure if I should train this on Flux or SDXL

and if I train, what is the best approach to tag all these images and vet them (there are over 4000 sprites)? Do I just use the 1.4 tagger? Thanks in advance
>>
>>102024608
and with flux aesthetically pleasing images are no more. bravo. lykon tried to warn you all.
>>
File: Capture.jpg (1.69 MB, 3593x3993)
1.69 MB
1.69 MB JPG
>>102024549
Now that MJ allows us to use their site API (with 25 free images per week), I realize even more how Flux loves to gives generic slopped images instead of going for the kino
>>
>>102024617
it only learns quickly concepts it already knows, especially the ones that are not properly tagged by the T5. for new concepts, you need more samples for it to work.
>>
>>102024617
based.
>I don't have hopes it will ever get Tauren right.
does flux understand what a minotaur is?
>>
>>102024630
bait
>>
id slam my nose down her asshole like a drinking bird if ya know what i mean
>>
File: file.png (17 KB, 140x105)
17 KB
17 KB PNG
>>102024626
I have no idea, and I can't test stuff because I just started a 4 hour training sesh.
But for Flux I would try joycaption or florence2.
Joy caption described pic related like this:
>This image is a black-and-white digital drawing rendered in a pixelated, manga style. It features a close-up of a young woman with a striking, androgynous appearance. She has a pale complexion, short, straight black hair with blunt bangs that frame her face, and a small, triangular birthmark on her left cheek. Her eyes are heavily lined with dark eyeliner, giving them a dramatic, sultry look. Her lips are full and slightly parted, with a subtle, almost indifferent expression on her face.
>The background is a minimalist, abstract representation of a room, with a grid pattern on the left side and a dark, shadowy area on the right. The overall texture of the image is rough, with visible pixels and a high contrast between light and dark areas, typical of the manga style. The use of black and white, combined with the pixelated texture, adds to the gritty, edgy aesthetic of the artwork. The woman's attire is not visible, but the image focuses entirely on her face and the surrounding minimalist environment.
>>
>>102024460
what model do I need to make anime girls like in the upper left of that pic?

can I do it local?
>>
>>102024648
It know something >>102024498
>>102024647
Only on 3rd epoch so we will see. Maybe it will come together, maybe I need more Tauren in the dataset for V2.
>>
>>102024668
>The woman's attire is not visible
not sure if this works with flux
>>
>>102024644
>Now that MJ allows us to use their site API (with 25 free images per week)
It doesn't say it refreshes every week. It says you have one week to use those 25 gens.
>>
>>102024668
Florence2 said:
>The image is a black and white illustration of a woman's face. The woman appears to be in her late twenties or early thirties, with short, dark hair that is styled in a bob with bangs. She has a serious expression on her face, with her eyes looking off to the side and her lips slightly parted. Her hair is parted in the middle and falls over her shoulders. She is wearing a collared shirt with a collar and a necklace. The background is blurred, but it seems to be a room with a window and a door. The overall mood of the image is somber and contemplative.
>>
>>102024674
Pony and yes.
>>
>>102024691
oh yeah you're right, my b
>>
>>102024694
What should work well is: "World of Horror, horror game art," + <LONG DETAILED CAPTION > + <WDV3 TAGS>
>>
File: Capture.jpg (310 KB, 2405x1400)
310 KB
310 KB JPG
>>102024644
There's more kino on the dpmpp_2s_ancestral sampeler but this shit is SLOOOOOW, maybe it's the ancestral thing that can make it less slopped, damn I wish I could try euler A right now...
>>
>>102024668
I think I will upscale all of the images with nearest neighbor first so that tagging is easier first. Perhaps I'll just dump the image set here after I have cleaned it up so people can also try

Somehow I think SDXL would do a better job at this. But I can be wrong. Gonna try out both anyways
>>
>>102024728
this guy has absolutely no shame or whatsoever, it's insane
>>
>>102024754
The people who make money tend to be low inhibition shameless retards, you can tell he's a narcissist too because all he does is use his own pictures. Gremlin man who stares at walls guy doing videos is worse but I fucking hate both of them.
>>
>>102024734
>Somehow I think SDXL would do a better job at this. But I can be wrong. Gonna try out both anyways
Based. Post results.
>>
>>102024728
https://github.com/ImageOptim/gifski i beg you
>>
>>102024770
Gremlin man?
>>
File: 0.jpg (319 KB, 2048x1024)
319 KB
319 KB JPG
>>
File: FD_00007_.png (1.76 MB, 1024x1536)
1.76 MB
1.76 MB PNG
>>102024770
I mean, I trained a LoRA of myself too. I just don't go showing it to the world.
>>
>>102024788
It was a high quality gif from sharex, but I had to reduce it to 2mb to upload. I'll check that out though
>>
>>102024626
>what is the best approach to tag all these images and vet them (there are over 4000 sprites)
Manually. It's unlikely any of the caption models have game specific information. What's the point in captioning them with generic slop?
>>
>>102024796
He hasn't posted in a while but he likes to do rotoscope videos of him as a goblin slowly turning his head and he's like 50 years old hipster/poser. He looks like he smells drenched with $200 cologne.
>>
hmm
>>
>>102024818
I don't really need any game information tagged though
But I will do that.
>>
>>102024818
The only thing you need to capture from the game is "World of Horror style" in your caption. The goal isn't to recreate game states, the goal should be to make new, novel images in that style.
>>
>>102024644
bro what MJ produced there is absolute slop, it doesnt look like ghibli at all, its a weird invention of genres that dont exist and are completly out of focus
>>
>>102024858
Why not? Surely you want characters and locations tagged.
>>102024859
>style
Then just don't bother captioning them at all.
>>
File: Capture.jpg (320 KB, 3191x689)
320 KB
320 KB JPG
>>102024868
yeah, I never said it followed the prompt well (desu you have to go for MJ niji for that), but at least it's giving you varied styles of outputs, wheras flux not only doesn't listen to the style you asked for, but always wants to go for the most generic slop ever
>>
>>102024888
I know this is difficult for you to understand, but diffusion models don't have meta information.
>>
File: image.jpg (87 KB, 1536x1024)
87 KB
87 KB JPG
here in my garage
>>
>>102024770
>people who make money tend to be low inhibition shameless retards
Great mindset for success, anon. Passionate, autistic introverts are also types who make good money.
>>
>>102024893
>most generic slop ever
with that picture in your reply.. total brainlet detected
>>
File: file.png (1.8 MB, 1440x832)
1.8 MB
1.8 MB PNG
>Miku holding a crystal in the style of Ghibli
Then ask an LLM to turn it into a schizo prompt
>>
>>102024912
I found you anon >>102024549
>>
>>102024905
If you want more control from your lora, wouldn't you want to capture things such as unique expressions, unique clothing etc so that you can bring it up when you want.

Otherwise if the training has someone that often wears glasses or something then it might keep popping up in your gens when you didn't ask for it.
>>
>>102024911
I make money but I'm not shameless. Shameless eventually causes you to crash and burn, it only works if you get lucky and you grab as much money as possible before it comes crashing down. At the end of the day it requires constant grifting and finding new markets to grift in. Unironically this guy will wear out his welcome.
>>
>>102024859
The other anon has a point. If you don't caption, you can "activate the style" simply by attaching the lora. But that would hinder the ability of the model to understand what's going on in the image, and you would not be able to prompt for specific characteristics well. Generic (even automatic) captioning will do. Add "WoH" as a first tag if you don't want to pollute the rest of the model as much.
>>
let's make prompts using comments here and post the results
>>
>>102024911
dunno anon, I want to make money out of something I'm proud of, and out of something that will really help people, he's just producing slop and is just taking the works of thers and pretend he got those results by himselfs, he's the complete version of a grifter
>>
>>102024911
>Passionate, autistic introverts are also types who make good money.
they make good money for others: they usually get taken advantage of
for true success you have to be a mix of both, like zucc
>>
>>102024949
he just posted a new reddit thread with helpful information 5 mins ago!

What did YOU do in this time!?
>>
>>102024958
To not get taken advantage of you simply say no. You only get fucked if you refuse to move on and instead get complacent.
>>
File: 00025-3595585678.png (474 KB, 512x512)
474 KB
474 KB PNG
>>102024933
>I make money but I'm not shameless. Shameless eventually causes you to crash and burn, it only works if you get lucky and you grab as much money as possible before it comes crashing down. At the end of the day it requires constant grifting and finding new markets to grift in. Unironically this guy will wear out his welcome.

used this as a prompt
>>
>>102024644
If I had to choose which one is more closer to "retro aesthetic ghibli" I'd pick the bottom one.
>>
>>102024983
>you simply say no.
easier said than done
most people would rather be a small part of a large thing than a large part of a small thing. zuck turned down 1 billion for facebook. would you have?
>>
kek, I love JoyCaption, this is what it says MJ-loveAnons picture
>This is a vibrant, digital anime-style illustration depicting a young woman with long, flowing teal hair styled into twin ponytails, adorned with delicate white flowers. She has large, expressive blue eyes and a fair complexion. She is dressed in a white, sleeveless dress with intricate lace details, which gives her a delicate and ethereal appearance.
>The woman is holding a large, sparkling crystal in her hands, which reflects light in a mesmerizing way, adding to the magical atmosphere of the scene. She gazes at the crystal with a mix of awe and curiosity, her expression filled with wonder.
>The background is a lush, colorful garden filled with various flowers in shades of pink, purple, and yellow. The petals and leaves are vividly detailed, with soft, almost glowing hues. The garden is bathed in a soft, ethereal light that creates a dreamlike ambiance.
>The overall color palette is dominated by pastel shades of blue, pink, and white, with a hint of green and yellow from the foliage. The image has a soft, almost dreamy texture, with a focus on delicate details and a sense of tranquility and magic.
>>
>>102024958
That sounds like a copout from someone who would like to make money, but doesn't. So you turn your poverty into a virtue by saying that only narcissists make money.
I know two examples (and I won't tell you their trades because you will strawman them), of dedicated hard workers (always self-employed) who absolutely make bank and screw nobody over.
>>
>>102025006
whats joy caption?
>>
Junji Ito style. I used Junji Ito as a promot so many times in 1.5 days.

https://civitai.com/models/672535/junji-ito-style-f1d?modelVersionId=752866
>>
>>102024992
soul
>>
>>102025000
My controversial opinion is Zuck should have and now he's a prisoner to his job. He's completely locked in now, it'd be unbearable weight and constant scrutiny and fighting others at the top.
>>
>>102025000
>most people would rather be a small part of a large thing than a large part of a small thing
nta but your brain is full of useless maxims somebody else put in there.
I get the impression life is so much larger than you'll ever know
>>102025022
https://huggingface.co/spaces/fancyfeast/joy-caption-pre-alpha
>>
>>102025022
it's an auto tagger which has NSFW built in
>>
>>102024682
Where did you get your dataset?
>>
>>102025023
>The creator of this LoRA has set this version to Early Access and as such it is only available for people who purchase it. This LoRA will be available for free in 2 days, 22 hours, and 30 minutes or once the donation goal is met. If you want to know more, check out our article
Nah go fuck yourself
>>
>>102025054
That's rude, that's not even me! I'm just into Junji Ito, I'll wait until it unlocks.
>>
Why shouldn't we use higher dims and alpha when llms use stuff like 256 dim that's backed up by actual papers and research and flux is basically like a llm. Sure, for a simple face you don't need it, but styles would absolutely benefit from it
>>
>>102025065
You think I'm fucking made of vram
>>
What loader node is used to load the joycaption model?
I tried the joycaption loader from cxh_joy but it wont accept a path and has two options to download either meta lama 3.1 or the bnb version?
>>
File: 00029-4077346702.png (412 KB, 512x512)
412 KB
412 KB PNG
>>102024921
>I found you anon >>102024549

Used this as a prompt
>>
File: image.jpg (93 KB, 1536x1024)
93 KB
93 KB JPG
>>102025032
>fighting others at the top
what's there to fight? he owns over 50% of his company. no one else in big tech has that (the only comparable example is gaben but he's an order of magnitude poorer than zuck)

>>102025013
>dedicated hard workers (always self-employed) who absolutely make bank and screw nobody over
we get it anon, you fuck hookers

>>102025033
>nta but your brain is full of useless maxims somebody else put in there.
>I get the impression life is so much larger than you'll ever know
you could've just said "i disagree with that statement" and given a reason as to why instead of projecting while trying to insult me kek
>>
does Flux still suck at generating rain and making things look wet?
>>
File: 2024-08-22_00320_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>102025006
makes good prompts
>>
>>102025113
thanks Teebs
>>
File: aa.png (2.65 MB, 2069x1933)
2.65 MB
2.65 MB PNG
>>102024644
Wtf MJ that's fucking dissapointing, now I don't feel so bad about Flux kek
>>
>>102025113
Yeah you're naive. It's always you debo with shit takes.
>>
>>102025042
Google.
>>
File: image.jpg (96 KB, 1536x1024)
96 KB
96 KB JPG
>>102025094
>So, who am I going to start a retarded argument with about something stupid today?
>generates a short haired girl
what did flux mean by this?
>>
>>102025113
oh no... can you please go back to your usual delux name file, it was better when I was able to filter your ass out
>>
File: file.jpg (28 KB, 768x768)
28 KB
28 KB JPG
>>
>>102025094
>>102025147
b-based
>>
File: 00030-3626875406.png (1.18 MB, 768x768)
1.18 MB
1.18 MB PNG
>>102025147
It was: I found you anon 102024549

With the >> too but as you can see it added confusion for you.

But still "I found you anon", just made girls, not sure why lol

>>102025026
>soul

Used this as a prompt
>>
Total comfyui retard here. I just set it up for florence2 captions. But how can I get it to display the output text in the workflow instead of having to read it from the terminal??
>>
File: 1721696301607022.jpg (64 KB, 984x984)
64 KB
64 KB JPG
>>
File: grid-0007.jpg (821 KB, 1536x1536)
821 KB
821 KB JPG
>>102025026
>>102025184

this soul prompt is pretty good
>>
File: ComfyUI_05266_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>102025184
>Used this as a prompt
Used this a prompt
>>
Unintentional horror body anon from yesterday here. I got x3 more varied images and captioned them all with joycaption, and the results are already much better at 500 steps than they were with my previous attempt at 2000:
https://litter.catbox.moe/i5w41s.jpg
>>
File: 2024-08-22_00329_.png (1.36 MB, 768x1344)
1.36 MB
1.36 MB PNG
>>102025184
>>102025202
good stuff
>>
>>102025213
futa bros we back
>>
>>102025209
>So you sice mash?

Yes, yes I do.
>>
File: grid-0008.jpg (952 KB, 1536x1536)
952 KB
952 KB JPG
>>
>>102025280
lmao it's not supposed to be a futa model
Today I'm going up to 3500 steps with 122 images (1 batch), and I hope I will see it converge before the 4 hours are up.
>>
can i run flux dev on 1080ti? dont care if its slow, just dont want it to crash my computer and actually do some nice gens
>>
>>102025195
I figured it out I downloaded a show text custom node
>>
>>102025306
you used a lora right? that looks good
>>
>>102025343
you can even run it on just cpu if you have enough RAM .. it just will take ages
>>
File: 00039-1153933758.png (916 KB, 768x768)
916 KB
916 KB PNG
>>102025228
also good stuff

>>102025280
>futa bros we back

Used this as a prompt
>>
>>102025349
Nope, the prompt was simply: Junji Ito

Now of course it doesn't really look like Juni Ito, need a lora for that, but it's fun to see what happens with random prompts.
>>
https://civitai.com/images/24997852
Impressive
>>
File: 2024-08-22_00332_.png (1.76 MB, 768x1344)
1.76 MB
1.76 MB PNG
>>102025384
it knows more than ppl think .. I thought id need a Stalenhag lora, but it knows it .. (someone actually made one tho on civitai)

I wonder how many loras are pointless and how many are actually needed if you just prompt it correctly (like the Picasso whining the first few days)
>>
>>102025454
>if you just prompt it correctly
you can't make it more correct than saying "Picasso style"
>>
>https://github.com/kohya-ss/sd-scripts/blob/sd3/README.md#flux1-fine-tuning
>Please update PyTorch to 2.4.0. We have tested with torch==2.4.0 and torchvision==0.19.0 with CUDA 12.4.
Ok.
>https://github.com/bmaltais/kohya_ss/tree/sd3-flux.1
>Launching the GUI on Linux:
>gui.sh --listen 127.0.0.1 --server_port 7860 --inbrowser --share
10:39:42-557617 WARNING  Package wrong version: torch 2.4.0+cu124 required 2.1.2+cu118                                                                                                                              
10:39:42-558913 INFO Installing package: torch==2.1.2+cu118 torchvision==0.16.2+cu118 xformers==0.0.23.post1+cu118 --extra-index-url https://download.pytorch.org/whl/cu118

I see.
>>
File: 2024-08-22_00331_.png (1.85 MB, 768x1344)
1.85 MB
1.85 MB PNG
>>102025476
I had this discussion to often the last days .. this time Ill just skipt it.
>>
File: ComfyUI_05272_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>
No we went too far!
>>
File: grid-0010.jpg (367 KB, 1536x1536)
367 KB
367 KB JPG
>>102025213
>Unintentional horror body anon from yesterday here. I got x3 more varied images and captioned them all with joycaption, and the results are already much better at 500 steps than they were with my previous attempt at 2000:
>https://litter.catbox.moe/i5w41s.jpg


Used this as a prompt, I have no idea why there is a white cat
>>
File: 2024-08-22_00338_.png (1.1 MB, 768x1344)
1.1 MB
1.1 MB PNG
>>102025505
overbaked you are not furry anymore, you went back to cow
>>
>>102025505
kek I need a mount like that
>>
>>102025454
yeah we should experiment more before getting too serious with loras, for a better idea what gaps to fill in
>>
File: file.png (652 KB, 1724x865)
652 KB
652 KB PNG
750/3500 girls already looking more normal and interesting
chin disappearing
>>102025519
That's interesting. I got white cats out of nowhere yesterday while testing the lora too.
>>
>>102025502
https://www.youtube.com/watch?v=3RMAPFH75AU
>Picasso style
>...
>A painting with geometric shapes and fragmented forms, characteristic of the Cubism era. Bold, vivid colors and abstract elements to break down the features into flat, two-dimensional planes. Multiple perspectives within the same image, emphasizing the flat surface and rejecting traditional techniques of perspective and modeling. The overall composition should reflect the innovative approach to form and structure, blending realism and abstraction.
>>
File: 00007-2237182431_cleanup.png (3.15 MB, 1280x1920)
3.15 MB
3.15 MB PNG
>>
File: fluxtest.jpg (2.91 MB, 5170x1581)
2.91 MB
2.91 MB JPG
some first attempts at making a flux lora for a character.
First 2 images are with the booru tags included, 2nd isn't, all captioned with joycaption.
The prompt this time around was just the list of tags, so it took the anime drawing style from the images without specific prompting for it
>>
Joy Caption will even recognise individual pieces of art and tell you the artist, but style descriptions are so vague that it is not enough to give you a somewhat consistent style (forget about a specific style). Style Loras hardly seem to work, the output is hopefully in the vicinity of the intended style at best and the deleterious effects of Loras on Flux are with a lot of Loras evident too.
For IPAdapter to become helpful with style, advanced control is required, basic control isn't enough.
>>
>>102025544
I didn't use a lora, but that's interest what lora was it? lol
>>
File: 2024-08-22_00341_.png (1.37 MB, 832x1216)
1.37 MB
1.37 MB PNG
>>102025535
I think its fine, even if pointless later, the more loras the better.. also there are always promptlets, for them loras are a great way the easy MJ like experience
>>102025559
qed
>>
>>102025559
kek
>>
>>102025574
Nice boobs and hands
>>
File: money.jpg (1.92 MB, 2048x3072)
1.92 MB
1.92 MB JPG
>That sounds like a copout from someone who would like to make money, but doesn't. So you turn your poverty into a virtue by saying that only narcissists make money. I know two examples (and I won't tell you their trades because you will strawman them), of dedicated hard workers (always self-employed) who absolutely make bank and screw nobody over.
used this as a prompt for all the images, didn't change it in the slightest
>>
>>102025591
I mean, the image you saw is from a lora I'm baking. I got white cats while testing it. You used my post talking about that lora, and got white cats. Coincidence? Yes.
>>
>>102025621
Oh I see...now I'm scared. I always had a feeling these diffusion models were gateways to the superntaural
>>
File: fluxmotorcycle.jpg (2.03 MB, 5170x1581)
2.03 MB
2.03 MB JPG
some obvious cool potential here, will need a lot of work just on workflows, prompts and lora training settings I guess
>>
File: image.jpg (91 KB, 1536x1024)
91 KB
91 KB JPG
>102025127
>102025134
>102025158
>no argument so they pull a name out of the thread schizo hat
icky

>>102025184
>It was: I found you anon 102024549
yeah i thought so but imagining it was the other reply was funnier
>>
File: 00046-3170250176.png (2.78 MB, 1536x1280)
2.78 MB
2.78 MB PNG
>>102025615
Good stuff
>>
File: file.png (479 KB, 652x643)
479 KB
479 KB PNG
>>102025636
>always had a feeling these diffusion models were gateways to the supernatural
Oh man you have no idea.
>>
>>102025610
Thanks
>>
File: -apzb5vnS0KUWuu4sSSxOQ.jpg (467 KB, 1024x1024)
467 KB
467 KB JPG
>>102025559
ideogram 2 output, not bad
>>
File: ComfyUI_05275_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>
File: 2024-08-22_00352_.png (989 KB, 1024x1024)
989 KB
989 KB PNG
>>102025636
so this >>102025659
>>
>>102025718
>>102025559
This is because they probably avoided including "Picasso" or any other artist names for styles.
Just "cubism" will probably do it already.
>>
>>102025750
also, I say it again, Picasso did not just paint cubism, and FLUX isn't dumb and knows that
>>
>>102025744
The more I stare at this the more worrying it feels.
>>
>>102025760
>>102025750
where's your amazing cubism flux output?
>>
File: Bruh.jpg (156 KB, 1344x896)
156 KB
156 KB JPG
>>102025760
>Picasso did not just paint cubism
say what nigga?
>>
File: file.png (406 KB, 652x659)
406 KB
406 KB PNG
>>102025771
My lora is training. I'm trying to work on my laptop, but getting distracted thinking about how Thoth reaches out to us through language in so many ways, even in this godless age.
>>
File: 00048-3045364282.png (768 KB, 768x768)
768 KB
768 KB PNG
>>102025643
OK let's try it

>>102024549
>So, who am I going to start a retarded argument with about something stupid today?

Used this as a prompt
>>
What loss value are you looking for when training for Flux? Or is it arbitrary. Mine is currently around 0.4 and still decreasing.
>>
>The Boondocks Style

https://civitai.com/models/672741/the-boondocks-style?modelVersionId=753089

Finally.
>>
>>102025851
>a few seconds ago
buy an ad
>>
>>102025851
when the 3 cherry picked pictures look bad you know you won't have a good time ;'(
>>
File: file.png (399 KB, 1080x1208)
399 KB
399 KB PNG
I have cleaned up and cut the sprite count down to 1300. Time to train...

Don't think WD1.4 tagger is gonna cut it this time. I don't think these models are good at looking at 1-bit images at all bros
>>
File: image.jpg (90 KB, 1536x1024)
90 KB
90 KB JPG
>>102025793
>generates a wife
relatable
>>
>>102025884
what if you used img2img or controlnet to turn them into coloured images then use a tagger on them for more accurate results
>>
>>102025894
I think the horror is just beyond the AI's comprehension
>>
File: cola.jpg (2.77 MB, 1536x4096)
2.77 MB
2.77 MB JPG
>>102025652
I like this, thanks for the idea
>>
>>102025854
Every heard of sort by new?

>>102025878
Not always the case, sometimes lora makers aren't goof at prompting
>>
>>102025884
>don't think these models are good at looking at 1-bit images at all bros
Nonsense! Looking forward to seeing your results, though. Godspeed!
(please share your dataset if you give up :p)
>>
>>102025775
educate yourself
>>
>>102025927
I stole it from discord, but you are welcome

And lol nice images, what's up with the guys in the background
>>
>>102025775
He was a kid prodigy and was active for a very long time. He had a so called blue period before. Cubism, he was already famous then. "A painting by Picasso" could potentially give you very different outputs and you couldn't really say the model doesn't know what its doing.
>>
>>102025775
what that anon said >>102025939 or go and read the archives this was discussed over and over the past weeks but new brainlets stumble in and think they are art experts cause they saw two picasso pictures, this isnt fucking /art history general/
>>
>>102025973
Even if you write "Picasso style, cubism" to specify you want the cubism part of Picasso it won't work on flux
>>
>>102025944
it's the coke mafia
>>
File: ComfyUI_05281_.png (1.37 MB, 1024x1024)
1.37 MB
1.37 MB PNG
>>
Sooooo close
>>
>>102026045
this reminds me of alpha screenshots
>>
>>102026000
Yeah, Flux and >>102025775 could use some Art History lessons.
>>
>>102026045
is your training data all separate subjects? will it learn the size differences at all?
>>
File: 00245-1989532304.png (1.95 MB, 1024x1440)
1.95 MB
1.95 MB PNG
>>
File: ComfyUI_00648_.png (855 KB, 1024x1024)
855 KB
855 KB PNG
>>
>>102026039
>Y2K style cover art with a low poly 3D render of: Hatsune Miku as a sleek, robotic samurai in chrome armor is slicing through waves of pixelated sushi rolls flying through the air. Each slice sends colorful sparks flying. Behind her, a giant koi fish swims through the sky as if it were water, creating ripples of light. Y2K style text at the bottom: "Sushi Master."
I'm a bit frustrated because I got the flying sushi only at CFG 7, that's too high for my taste it fries shit, I hope we'll get SkimmedCFG working on Flux soon, AutomaticCFG just doesn't cut it
https://imgsli.com/MjkwMTE2
>>
>>102026084
I hate this.
>>
>>102026062
It's a style LoRA so yes, but I think the issue is bad tagging on my part. I will run V2 with better tags.
>>
>>102026000
>>102026060
are you braindead? if you literally just write "cubism" in the prompt it spits out cubism .. if you add picasso it wont cause less than a few percent of its picasso info is cubism
>>
>>102026039
O0O
>>
>>102025884
I made a script that will crawl a directory and add joycaption captions for every PNG or JPG it finds:
https://pastebin.com/P0hHYBcZ

This and training with ai-toolkit has given me good results even without reviewing the captions for errors (which do happen).
>>
File: ComfyUI_00651_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>
>>102026068
asuka lora is finally a thing?
>>
>>102026110
I was generalising: Flux sucks at art styles even if it can recognise some. I don't care about cubism, and I haven't really tested it, nor do I care to test it now. Give me a Pre-Raphaelite style painting and I'll eat my hat though.
>>
I heard SD3 is a disaster, but have been hearing positive things about flux. Is it tangibly better than SDXL? I'm assuming it's of course a censored model but is it at least good at adhering to the prompt?
>>
why didn't flux include artists names? are they like pussies?
>>
>>102026196
You pompous faggots sure do like to argue over idiotic shit. Fuck off of /g/ with your useless antagonism. Fucking boomer.
>>
>>102026219
>why didn't flux include artists names? are they like pussies?
yes
>>
>>102026172
there is a really shitty one on civitai that works ok with the retro anime lora. going to try and cook one myself tonight
>>
>>102025936
https://files.catbox.moe/gm7lut.zip
I have enlarged them with nearest neighbor and (mostly) deleted the useless sprites and cropped the UI. Have fun!

>>102026134
Based. Trying it now
>>
>>102026238
lol what fags if i did flux i would do artist names and styles and also i would do pussy. what are they going to do sue me? lmao i dont even pay taxes because i have no income. neets should take over ai and deliver the real shit
>>
>>102026256
>lol what fags if i did flux i would do artist names and styles
yeah, seems like only Midjourney had the balls to do that
>>
>>102026256
hilarious, anon, you should become a comedian but keep not paying taxes of course
>>
>>102026270
lmao'ng at americans having to pay taxes for telling jokes

>>102026268
midjourney even did avengers. they went after the big guys in hollywood and they don't care. real alphas
>>
>>102026298
>midjourney even did avengers. they went after the big guys in hollywood and they don't care. real alphas
so did DALL-E 3
>>
>>102026196
yea since Pre-raphaelits were basically romanticist imitating Renaissance it probably will mix .. as many of their works just look plain out like Renaissance paintings.. not their later famous ones like that drowning lady in the lake .. damn that brings back feels, my late mother was a big Pre-Raphaelit fan
>>
File: ComfyUI_00814_.png (1.55 MB, 1024x1024)
1.55 MB
1.55 MB PNG
does this looks good?
>>
>>102026313
but dalle-3 doesn't have Greg Rutkowski because he was crying too loud on twitter
>>
File: FD_00013_.png (391 KB, 512x512)
391 KB
391 KB PNG
>>
>>102026256
anon you can't afford the 8xH100s to train a model, and as always an appeal for "somebody (that isn't me) should do something"
maybe when you put your money on the line you'll understand some of the decisions make to avoid losing it all
>>
>>102026337
looks fried out with the vertical striping. You going for some kind of film grain effect?
>>
>>102026337
you need to clean your toner and print it again
>>
>>102026196
just spout bullshit and then claim that you wish to remain ignorant when proven wrong

kek
>>
>>102026347
>the 8xH100s to train a model
is that enough for a batch size of 4000?
>>
>>102026347
>maybe when you put your money on the line you'll understand some of the decisions make to avoid losing it all
Midjourney is making a shit ton of money out of the artist's tears though
>>
File: ComfyUI_00811_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>102026356
i think the trick is to lower cfg so it looks all warped and blurry but good. now if you want to do sharp and not slop then you need real talent
>>
File: FD_00015_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
What do we think? Good enough or should I go for V2?
>>
>>102026367
You should be able to train a 2B model from scratch in like 2-3 months.
>>
started a multi-res lora training last night before I went to sleep....windows update restarted my pc 2 hours later
fml my life
>>
File: ComfyUI_05285_.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>
>>102026369
Oh is there a local model of Midjourney that lets you generated naked versions of those artists? There's a lot of control when everything is hidden behind a paywall and API.
>>
>>102026347
there's literally no reason at all why flux can't do artist names. it's like people have to reinvent the wheel all over again because they're pussies. stop defending this copyright gay shit
>>
File: 2024-08-22_00370_.png (1.31 MB, 768x1024)
1.31 MB
1.31 MB PNG
>>102026331
nta but its called "Ophelia"

>>102026196
not quite their style, but ateast got the long wavy hair right
>>
File: ComfyUI_00655_.png (2.53 MB, 1920x1080)
2.53 MB
2.53 MB PNG
>>
File: 00063-3461634790.png (1.24 MB, 768x1280)
1.24 MB
1.24 MB PNG
>>
>>102026393
Actually there is a reason, being named a defendant in a copyright lawsuit. But I'm eagerly waiting for your model.
>>
>>102025587
config please
>>
>>102026391
Moving the goalpost I see, we were talking about including the artists on a model, and you seemed to imply that doing that would lead to a bankrupt, guess who's bleeding money, SAI (the ones removing the artists on the dataset) and guess who's making a lot of money, that's right, Midjourney
>>
>>102026399
sick
>>
File: ifx158.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>
File: 00062-3834384990.png (2.26 MB, 1024x1440)
2.26 MB
2.26 MB PNG
>>
>>102026399
cool
>>
>>102026373
kek she has the human female face
it's definitely more accurate than just prompting it
>>
>>102026405
>Actually there is a reason, being named a defendant in a copyright lawsuit.
then why Midjourney isn't dead then?
>>
>>102026409
SAI is a failure of a company. Flux is proof you can release a base model that doesn't have artist names. Midjourney can bankroll against lawsuit and they have absolute control over the images they generate.
>>
>>102026405
good luck suing some guy all the way over in germany. sorry but american courts don't have jurisdiction over other countries
>>
File: ComfyUI_00654_.png (1.78 MB, 1024x1024)
1.78 MB
1.78 MB PNG
>>102026399
>>
File: FD_00014_.png (532 KB, 512x768)
532 KB
532 KB PNG
>>
File: ComfyUI_05286_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>
File: 00065-3461634792.png (1.28 MB, 768x1280)
1.28 MB
1.28 MB PNG
>>
File: creep_0_result_resultj.jpg (284 KB, 1365x1008)
284 KB
284 KB JPG
>The image is a digital drawing rendered in a pixelated, halftone dot style, reminiscent of early digital art or halftone printing techniques. The subject is a young man with light skin and short, dark hair, styled neatly. His face displays sharp, angular features and a wide, unsettling grin revealing a row of jagged, uneven teeth. His eyes are narrow and slightly sunken, giving a menacing appearance.

>He is wearing a light-colored, long-sleeved shirt under a V-neck sweater that is primarily gray with subtle white stripes near the collar. The sweater appears worn, with noticeable dark stains or smudges scattered across its surface, particularly around the chest area, adding to the overall grim aesthetic of the illustration.

>The background is entirely black, providing stark contrast and making the subject stand out sharply against the void. The black background and the halftone dot style create a high-contrast, somewhat eerie visual effect.

>There are no additional objects, people, or textures in the background, focusing all attention on the man's unsettling expression and the state of his clothing. The overall mood is dark and unsettling, likely aiming to evoke a sense of unease or horror.


This might turn out to be good, lads
>>
>>102026407
just the ai-toolkit default on 4k steps, desu I was not impressed yet
>>
>>102026424
>>102026399
what the fuck. how?

could you do one like that where they spotted skibidi in space?
>>
>>102026420
Midjourney is combating a lawsuit as we speak at considerable expense.

>>102026423
Okay, they can't do business in America now, let me know how that goes.
>>
File: ComfyUI_00653_.png (1.8 MB, 1024x1024)
1.8 MB
1.8 MB PNG
>>102026424
>>
>>102026421
>Flux is proof you can release a base model that doesn't have artist names.
no it's not, not having artists or celebrities is a huge downfall
>Midjourney can bankroll against lawsuit
Try to guess why? because they made their model interesting by actually having the balls of having the artists in there
>they have absolute control over the images they generate.
they could've choosen the cucked dalle API route, going as safe as possible and not letting the users have the artists, yet they decided to go the based route, you just need some balls, and I think not a lot of people have those, that's why I respect MJ so much
>>
File: GUIDANCE40_IPADAPTER_0.7.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>102026202
SD3 Medium was a big flop. The only redeeming quality was prompt understanding and you had to deal with an intentionally damaged model because of hard censorship.
Flux is better every aspect, and the advances to make every tool available to consume hardware have been enormous.
There is a single aspect where Flux doesn't surpass SDXL: Style due to artist name recognition.
>>102026235
Style is important, SD1.5 and SDXL had this ability and now it's gone, I'm working to see if there is any way to restore this ability. Maybe a checkpoint can, but for what I've seen Loras aren't good enough. IPAdapter just came out and it also works in that direction, but it isn't good enough
>>
File: ComfyUI_00657_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
>>102026456
I don't care anon, you're not making a model and your arguments are obtuse at best. If you really gave a shit you'd rent a Runpod and train the artists in (you won't).
>>
>>102026441
>Midjourney is combating a lawsuit as we speak at considerable expense.
but they're still there is it? and they haven't removed any artists because they got a lawsuit in their ass, they aren't kneeling to the retarded artists scream, they are fighting for freedom and fair use
>>
you basically need a 24gb card to use flux1 at a reasonable speed right? im guessing offloading layers means it will take minutes instead of seconds per image?
>>
>>102026474
it runs in under 8GB of VRAM now
>>
File: 1167752492635494112-SD.png (1.69 MB, 896x1152)
1.69 MB
1.69 MB PNG
>>
>>102026466
>I don't care anon
>>
>>102026478
wait what? how?
>>
>>102026473
Midjourney is an established company that can afford paying a million to fight a lawsuit. Where's your model again? Are you training a base model?
>>
File: FD_00019_.png (957 KB, 1024x1024)
957 KB
957 KB PNG
Trump is too powerful for the LoRA
>>
>>102026483
good
>>
>>102026484
If you care so much and you care so little for civil suits feel free to full finetune dev and make a new base model with all the things you think the model is missing. Go wild.
>>
File: ComfyUI_00658_.png (1.67 MB, 1024x1024)
1.67 MB
1.67 MB PNG
>>
>>102026488
>Midjourney is an established company
try to guess how they managed to be an established company? that's right, by making their model fun, thanks to the artists and celebrities
>>
>>102026460
>cuck thinks his opinion matters
kys cuck
>>
>>102026486
https://huggingface.co/city96/FLUX.1-dev-gguf/tree/main
and if you're a CPUlet/RAMlet that can't run T5 on the CPU
https://huggingface.co/city96/t5-v1_1-xxl-encoder-gguf/tree/main
>>
does comfyui support nesting dynamic prompts by now such as in `{a cute {girl|woman|loli}|an ugly {boy|man|shota}}` yet? If not, why?
>>
>This is a digital artwork featuring a pixelated style reminiscent of early video games or ASCII art. The image depicts a nighttime scene with a stark contrast between light and dark pixels. The upper part of the image showcases a sky filled with a pattern of white and black dots, creating a sense of a starry or cloudy night.

I am actually quite impressed with joy caption
>>
File: 133_81095647_p0.jpg (897 KB, 1600x916)
897 KB
897 KB JPG
what is the best prompt to get flux to do highly detailed, anime inspired art, for example something like this.
Putting "anime" in at all makes it all flat color shit looking
>>
>>102026505
No it's because they started at the same time as SD got popular so they flew under the radar for quite some time. But nice try.
>>
>>102026514
thanks
>>
File: ComfyUI_00659_.png (1.81 MB, 1024x1024)
1.81 MB
1.81 MB PNG
>>
File: ifx186.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>102026424
best one
>>
before I sink another 8 hours into this, what should the guidance scale for a character lora be
>>
File: 00068-2757218954.png (1.97 MB, 832x1216)
1.97 MB
1.97 MB PNG
>>
>>102026523
use a vlm to get what it thinks
that's not "anime inspired"
that's hyper realistic expressive digital art
>>
>>102026493
fucking kek
>>
>>102026243
Thank you for sharing it. My GPU will be occupied for a few more hours, but I think I will give it a try. However, I'm going to trim the transparent pixels before training. Flux should be able to handle it, and I think the empty space would cause unintended trouble. In reality I have no idea, but that's my conjecture right now.

Do report back if you use the script. It worked surprisingly well for porn images, but I don't know if it will work well for everything. I changed the text model from plain llama to hermes, btw. Simply because I didn't want to deal with asking Meta for download permission on Huggingface.
>>
File: NiceTry.png (274 KB, 1676x649)
274 KB
274 KB PNG
>>102026526
>they started at the same time as SD got popular so they flew under the radar for quite some time.
>flew under the radar
>>
File: 00069-2757218955.png (1.66 MB, 832x1216)
1.66 MB
1.66 MB PNG
>>
>>102026557
whos the jew shilling MIDjourney here?
>>
File: ComfyUI_temp_elmvr_00195_.png (3.43 MB, 1776x1296)
3.43 MB
3.43 MB PNG
>>102026488
Every tech company is interested in these 'artist rights lawsuits': Google, X, MS it isn't going to be just Midjourney or SAI or BFL. Big tech has to worry about reputation, but I doubt they will allow the courts to destroy the little guys without putting a word.
>>102026395
I have a long way ahead.
>>
>>102026434
ULTRABASED
>>
>>102026554
The script is good but it can only do jpegs (errors when using png.) no big deal though, I'll just tag with the jpeg and use the pngs for actual training

To showcase how good joy is:

>To the left side, there are various posters and notices affixed to the wall, some partially obscured. One poster features a circular emblem, possibly a logo or symbol. A vending machine is visible on the right side, with buttons and a display panel, labeled "COFFEE" in English.

I can barely see the word and joy can pick it up. Kinda shocked actually. Not entirely sure how it interprets horror elements, though.
>>
>>102026557
Yeah you bad faith faggot, just because something is popular within a niche doesn't mean it's reached public consciousness especially in regards to being a target.
>>
>>102026580
You're so fucking retard, Google, X, and MS want Midjourney to lose, they would love to have massive regulations on licensing AI datasets.
>>
File: ComfyUI_00099_.png (824 KB, 768x768)
824 KB
824 KB PNG
Teach me how to prompt flux properly.
I want to generate images that show body completely, ie shoes must be visible.

When I try something like (from flux example)

>cute anime girl with massive fluffy fennec ears and a big fluffy tail blonde messy long hair blue eyes wearing a maid outfit with a long black gold leaf pattern dress and a white apron feet have black shoes on them mouth open holding a fancy black forest cake with candles on top in the kitchen of an old dark Victorian mansion lit by candlelight with a bright window to the foggy forest and very expensive stuff everywhere


During first steps it's okayish(picrelated 4 steps) but at 20 steps model tends to leave the shoes out after 10 or steps
>>
>>102026441
>Okay, they can't do business in America now, let me know how that goes.

>tfw usa can't gen images of girls on grass
>tfw us gov desperately lobbies for eu and russia to hate each other
>tfw jewish tricks don't work anymore and all the girls on grass are for eu, russia, and china
Heh. Notting personnel.
>>
>>102026602
Sure thing retard, keep pretending that Midjourney is some niche company and that the anti-AI artists are somehow unaware about that software using their artists prompts
>>
>>102026622
increase image height
>>
>>102026622
wide shot
>>
File: 00071-282244568.png (1.92 MB, 896x1152)
1.92 MB
1.92 MB PNG
>>
>>102026531
Keke
>>
>kohya spitting errors
>One trainer spitting errors
Fuck me
>>
>>102026631
now you're moving the goal posts, MJ made a lot of money before they got noticed by people who would want AI regulated, in the process they have quite a lot of money to defend against law suits
last reply, go use Midjourney and go hang out on their Discord
>>
>>102026655
Flux just can't into kissing, it's always an awkward touching of the lips
>>
>>102026661
have you tried reading the errors, anon
>>
File: ComfyUI_00854_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
ok so there maybe a valid reason not to do artists. they can get sued. but why they didn't do pornos? there's no reason for that. it's like they hate money

>>102026625
america is going to be left behind
>>
>>102026594
You're gonna cook a good lora.
>errors when using png
I was able to use PNG. Maybe there is some particularity in these images it can't handle. Since you uploaded them, I will test it later and see if I can improve it.
>>
>>102026670
Kissing lora please
>>
File: Capture.jpg (218 KB, 2758x1487)
218 KB
218 KB JPG
>>102026668
Uh oh...
https://trends.google.com/trends/explore?date=today%205-y&q=Midjourney,Stable%20Diffusion
>>
>>102026677
>america is going to be left behind
Unironically, but in many regards.
The empire moves to and fro throughout history.
>>
>>102026543
joycaption says
This digital artwork depicts a fantasy character in a dramatic, ethereal setting. The central figure is a young woman with pale skin and striking red hair, adorned with intricate, glowing red horns atop her head. She stands in a confident yet slightly defensive posture, with her right hand raised, fingers slightly spread, and her left hand resting by her side. Her attire is an elaborate, fantasy-inspired ensemble featuring a red, sleeveless bodysuit with a high slit revealing a thigh, and a black cape flowing behind her. The bodysuit is adorned with intricate, glowing designs that resemble a mix of feathers and flames, adding to the fantastical aura.

The background is a dark, mystical forest with tall, red tree trunks and a canopy of dark leaves. The scene is illuminated by a red light, creating a surreal, almost otherworldly atmosphere. Snow-like particles float around the character, enhancing the magical feel. The overall color palette is dominated by deep reds, blacks, and blues, with subtle hints of purple and green. The artwork is highly detailed, with a focus on texture and depth, showcasing the artist's mastery in digital medium. The image is signed by the artist "Nixel" and features a watermark linking to their Patreon account.
>>
File: 2024-08-22_00391_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>
>>102026617
Big tech has created their models by ingesting every piece of media on the Internet. They haven't suppressed the use of protected IPs, meanwhile MJ just aped modern digital styles (styles cannot be copyrighted). A court decission about artist rights is more dangerous to Big Tech because they have money. Regulation will come when they don't need to scrape the Internet for their models if ever, as ChatGPT needs to be aware of the news.
>>
>>102026622
>comfy reaching out to /ldg/ for prompting advice
We're proud of you comfy
>>
>>102026720
king of the vramlets
>>
At 2000 steps, already generating the kind of image I wanted, but still have monstrosities in other cases. Going up to 3500. I'm seeing loss of 0.2, but still occasionally swinging all the way up to 0.6 (I don't really know what I'm doing).
https://litter.catbox.moe/fvk4js.png
>>
>>102026731
What's going to happen is there will be regulation requiring vetting of the images of a dataset for licensing which will necessitate licensing your dataset from someone like LAION for $100k.
>>
>>102026760
When loss swings up that means the model encountered an image/caption it has no idea how to do.
>>
>>102026760
handbra lora?
>>
File: ComfyUI_00112_.png (518 KB, 640x640)
518 KB
518 KB PNG
>>102026641
Thank you, that actually worked! Out of 5 images all 5 have shoes
>>
>>102026762
Yeah that's peanuts for Bigtech, but they don't need it. They can censor very effectively after gen. Why do you think they care? It isn't even an important source of income just an expense looking for a business model that might or might not exist.
Regulation of datasets for moral outrage reasons doesn't interest anybody.
>>
>>102026782
Mmh... Thank you for the information.
>>102026783
I just took three thots I really like and made a softcore dataset.
>>
File: ComfyUI_40120_.png (1.46 MB, 832x1216)
1.46 MB
1.46 MB PNG
>>102025023
>>102025054
Looks good. First time a Junji Ito lora looks convincing imo. First gen btw, a self-portrait.
>https://litter.catbox.moe/tgjx8r.safetensors
>>
>>102026803
It interests Big Tech you retard because it stops pesky small people from making competing models.
>>
File: FD_00031_.png (1.05 MB, 768x1344)
1.05 MB
1.05 MB PNG
>>
>>102026823
Loss is basically the accuracy of a prediction vs ground truth (closer to 0 is more accurate).
>>
>>102026824
This looks actually good.
>>102026782
Does this hurt the quality of the LoRA in general?
>>
>>102026824
Nice lol.It can also do colour like I did earlier, but maybe not as accurate, but still interesting to see what happens.

>>102026404
>>
>>102026846
Loras rape the weights so unlike a full fine tune where everything balances out the Lora will eventually turn the model into a corpse.
>>
>>102026824
>in monochrome jitostyle
I like that trigger phrase. Does it output normal flux-looking images if you don't use it?
>>
File: FD_00032_.png (896 KB, 768x1344)
896 KB
896 KB PNG
>>
>>102026861
this, 100% this, that's why a large finetune that will add more concepts to flux is the priority
>>
File: ComfyUI_00861_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>102026710
france is going to take over ai. they don't care about copyright because they don't have hollywood and they are all pedophiles so they don't care about "safety". they already took over llms with claude, it's only a matter of time until they release their image model. their only problem is that they're smug assholes and they're going to make it proprietary to dunk on other nations.
>>
>>102026861
Speaking of finetunes. Is it feasible to fine tune the base model with a 3090?
>>
File: 3312587235.png (1.39 MB, 1152x896)
1.39 MB
1.39 MB PNG
>>
>>102026888
Wait french people are all pedos? I had no idea
>>
>>102026888
checked
2 more weeks
>>
File: FD_00033_.png (918 KB, 768x1344)
918 KB
918 KB PNG
>>
>>102026888
>they are all pedophiles so they don't care about "safety"
Anyone who has trained a lora on (regular) porn can tell you how eager Flux is to make them too young. I can't imagine the kind of lobotomy DALL-E has had done for example, to avoid that.
>>
Mon ami.....
>>
>>102026891
Kohya allegedly can full fine tune but it's fucking slow and they have zero sampling so it's all trust as bro. Realistically any real full fine tune still requires a massive dataset to prevent catastrophic forgetting. For example, if you wanted to a NSFW finetune you still need to balance it with SFW.
>>
>>102026891
yeah it is
>>
Bread is waiting for your collection right here...
>>102026895
>>102026895
>>102026895
>>
>>102026891
looks possible yeah
https://reddit.com/r/StableDiffusion/comments/1exgkqy/surprised_i_havent_seen_it_mentioned_that_you_can/
>>
File: 2024-08-22_00403_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>
>>102026830
Big Tech isn't afraid of Flux or Midjourney. Imagen is a loss leader at best.
>>
>>102026839
>>102026874
>>102026908
love these lmao
>>
>>102024917
The overall mood of the image is somber, solemn and surreal
>>
File: Screenshot_24.png (717 KB, 1499x541)
717 KB
717 KB PNG
Using non-English language is fun. It works for common stuff, but tweak it a little(I added 4ะบ) and suddenly a kitten transforms into whatever the thing is on the left
>>
>>102025065
I use dim 4 for character Loras and 8 for styles.

Reason: I tried bigger and saw no advantage in direct comparisons
>>
>>102026720
the 1% of vramlets acording to steam
>>
File: 0eqbkouyn8kd1.jpg (71 KB, 1054x493)
71 KB
71 KB JPG
It's the 2 year anniversary of SD1.4, happy birthday to yuuuu...
>>
>>102027434
hopefully with flux they'll start releasing better models, competence is really good for the technology
>>
>>102027434
happy birthday SD1.4!
>>
im trying to use flux gguf, for some reason i'm getting only blurry images - i set up VAE and t5xxl, etc. i'm using Swarm UI. has anyone else run into this issue?
>>
>>102028052
new bread, anon >>102026925



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.