[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.16 MB, 3264x3264)
1.16 MB
1.16 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102149564

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
File: file.png (2.63 MB, 1024x1776)
2.63 MB
2.63 MB PNG
>>
File: 2024-08-30_00250_.jpg (1.23 MB, 3840x2160)
1.23 MB
1.23 MB JPG
>>102155129
thank you baker
>>
File: FD_00057_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>
File: 1700395331554176.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>
What should I use to create logos? I need a logo to use as a YouTube avatar, with the letters T W
>>
Anyone got to use the new version of Forge on Windows? Whatever I try I always end up with an error about missing dependencies in c10_cuda.dll or some other torch dll... And it sucks because the new version of Layer Diffusion, which img2img mode works at last, only works with the new version of Forge, so I can't use it. :(
>>
Did I winner
>>
>>102155077
>can't use vit text encode thing, stuck with clip.i? get
clip l is CLIP ViT-L-14, you're saying you're stuck with CLIP ViT-L-14 and that can't use CLIP ViT-L-14.
fucking techlets
>>
Blessed thread of frenship
>>
>>102155176
Where did the marshmellows come from? They aren't on the box.
>>102155191
Use fluxart.pro for free without needing to set anything up.
>>
File: file.png (2.27 MB, 1024x1776)
2.27 MB
2.27 MB PNG
>>
>>102155216
link doesn't work
>>
>>102155216
the letters I guess
>>
>>102155223
oh it's backwards, fluxart.pro
>>
>>102155223
>>102155216
>>102155239
ffs fluxpro.art
>>
File: FD_00062_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>102155199
Because I like cats. I don't care if they are mind controlling me.
>>
File: 1694337626963768.png (2.34 MB, 1280x1840)
2.34 MB
2.34 MB PNG
>>
>>102155301
They are mind controlling you to make you like cats.
>>
File: 00031-4148520278.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
OMG it's smol Mio!!!!
>>
>>102155206
that is not true, thought. someone said the other day that vit-l-14 was much better than clip.l. and if that was true it would work, being the same thing, wouldn't it, silly?
>>
File: FD_00065_.png (972 KB, 1024x1024)
972 KB
972 KB PNG
>>102155318
That's fine.
Besides you only get it from faeces and my cats shit outside.
Just stop eating cat shit and you'll be fine.
>>
So changing the value of distilled cfg (3.5 by default) is what people talk about when they say lower/increase cfg scale?
>>
File: 2024-08-30_00254_.jpg (2.06 MB, 3840x2160)
2.06 MB
2.06 MB JPG
>>
>>102155199
btw did you know toxiplasmosis makes your face more attractive and you more brave. that's why chads love cats.
>>
>>102155360
no, someone said ViT-L-14-BEST-smooth-GmP was much better (which is wrong, no one can tell them apart consistently in a side by side comparison) than clip.l (clip l IS CLIP ViT-L-14) so they're not "the same thing", retardie techlet anonie.
>>
File: 00046-1542787544.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>
>>102155448
>ViT-L-14-BEST-smooth-GmP
stop being pedantic. thats the one i meant, dummy
>>
File: 2024-08-30_00258_.jpg (2.3 MB, 3840x2160)
2.3 MB
2.3 MB JPG
>>
File: ComfyUI_01414_.jpg (937 KB, 1728x2304)
937 KB
937 KB JPG
>>
>>102155498
>stop being pedantic
stop being retarded
>>
File: 00013-2876974957.png (297 KB, 471x417)
297 KB
297 KB PNG
I'm training a flux lora and even after 2000 steps it still doesn't really look like the person, and all the images look like a mess with some grids and messed up hands etc

Anyone have any idea wtf is going on?
>>
>>102155570
>Anyone have any idea wtf is going on?
you're doing something wrong
>>
File: file.png (3.03 MB, 1024x1776)
3.03 MB
3.03 MB PNG
>>
>>102155593
I know..... :(
>>
File: 1702892045280380.png (2.57 MB, 1280x1840)
2.57 MB
2.57 MB PNG
>>
>>102155570
This looks like her, but you shouldn't really be doing this anyway.
>>
File: 1707671756654048.png (1.15 MB, 896x1088)
1.15 MB
1.15 MB PNG
getting 1.39s/it on a 3090. to the anon that was getting 1.50s/it on a 4090 you should switch to comfy.
>>
>>102155570
>I'm cooking something and even after 20 minutes it doesn't look like the recipe.
You left out basic information making it impossible to help you, why do you make people drag important details out of you?
>>
File: 2024-08-30_00244_.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>102155698
anon you read my post wrong >>102154797.. I am getting 1.5it/s .. not s/it
>>
>>102155698
>to the anon that was getting 1.50s/it on a 4090 you should switch to comfy.
NTA but you misread, the 4090 1.5 iterations per second
>>
File: 00041-3130506956.png (2.29 MB, 2352x1344)
2.29 MB
2.29 MB PNG
>>102155698
it is 1.5 it/s not the other way around anon...
>>
>>102155698
>getting 1.39s/it on a 3090
without CFG?
>>
File: 1708274096166494.png (1.32 MB, 896x1088)
1.32 MB
1.32 MB PNG
>>102155537
would you ask a dog to stop barking or a fish to stop swimming?

>>102155728
>>102155729
>>102155773
ok ok guise sorry. it was a indecent mistake
>>
>>102155773
Also Kita the SD1.5!
>>
>>102155709
I'm just wondering if anyone has had the same kind of result, they might know what it means.

>>102155665
What does this even mean?
>>
File: 2024-08-30_00264_.jpg (1.41 MB, 3840x2160)
1.41 MB
1.41 MB JPG
>>102155788
thanks for trying to be helpful, all is fine
>>
File: 1720902289588762.png (1.46 MB, 896x1088)
1.46 MB
1.46 MB PNG
>>102155787
with skimmed cfg
>>
>>102155811
1.39s/it with cfg > 1? that's pretty good
>>
>>
>>102155800
I can't tell you why the thing you're cooking doesn't look like the recipe because you have left out important details. Anyways, last reply, you clearly just want to be coy and obtuse. Your settings are wrong, that's why it looks bad.
>>
File: fs_0044.jpg (160 KB, 1280x752)
160 KB
160 KB JPG
>>
>>102155836
I can't post the link to the settings, but it's exactly the same as here. I doubt that info will help, I'm thinking someone might have had a similar result and the solution has nothing to do with settings but something like a torch version or something.

https://www.reddit.com/r/StableDiffusion/comments/1ezd23b/i_will_train_a_flux_lora_for_you_for_free_3/?sort=new
>>
>>102155874
you sure you didn't fuck up setting the learning rate?
>>
File: 2024-08-30_00266_.jpg (2.52 MB, 2160x3840)
2.52 MB
2.52 MB JPG
>>
File: 1710579298653989.png (958 KB, 1024x1024)
958 KB
958 KB PNG
n64 lora is kinda neat

<lora:n64graphics:1> Miku Hatsune in a dungeon with treasure chests from The Legend of Zelda, in the style of a nintendo64 game.
>>
>>102155881
Yeah I tried the one there and another one that people recommend 1e-4, tried three times now. The OP has a whole bunch of loras made with these settings.

Anyway I'll figure it out.
>>
>>102155934
>Miku Hatsune
>>
Interesting...

It seems that PONY has much more "Character related Loras" rather than "art style related Loras".

Is there a reason for that? I'm still waiting for Loras of fairly popular shows to be created. So odd.
>>
>>102155874
Okay, that wasn't very helpful. Here's how I've done it:
- use https://github.com/ostris/ai-toolkit
- caption 20-40 images doing this format: "<character name>, <style>, <long form caption>, <wdv3 tags>"
- use learning rate between 1e4 and 2e4
- sample every 250 steps with one of your training captions to verify it's learning, get a good result in 2500-5000 steps
>>
>>102155950
>Is there a reason for that?
People want to fuck characters more than they want to fuck styles.
>>
File: ComfyUI_00227_.png (1.38 MB, 896x1088)
1.38 MB
1.38 MB PNG
>>102155821
>Like comfy loading/unloading does when you OOM. Forge does have a memory slider to cut down on this.
it was almost at the max. comfy doesn't unload whenever you generate. right now not even genning, gpu mem is at 18.6 and system ram at 18.6, only when you switch your lora weights and shit...
>>
File: 1717911183707668.png (1023 KB, 1024x1024)
1023 KB
1023 KB PNG
>>102155934
>>
>>102155967
You mean when you don't gen then you don't OOM. Quick dude, get to reddit and post this shit. Everyone must know.
>>
File: kingsucre2.jpg (3.27 MB, 1792x2304)
3.27 MB
3.27 MB JPG
Afternoon
>>
File: 1719219804205609.png (1.22 MB, 896x1088)
1.22 MB
1.22 MB PNG
changed the lora and now it is generating gay boys instead of women, and the gay lora is not even loaded, i checked. clownworld
>>
>>102155952
Thanks, but yeah I can't use that, the one in that thread is for 12gb Vram

The issue is this werid grid noise no matter how many steps

Anyway I'm gonna go figure it out.
>>
>>102156042
yeah 12GB of VRAM is massive cope lmao
>>
File: 1703475436019759.png (1.21 MB, 896x1088)
1.21 MB
1.21 MB PNG
>>102156005
I don't OOM, I never OOM (at least not on comfy)
>>
File: 00325-1952569695.jpg (583 KB, 1536x1536)
583 KB
583 KB JPG
>>102155950
Character loras are super easy to train. Style loras are more complicated
>>
File: ComfyUI_01110_.png (882 KB, 1024x1024)
882 KB
882 KB PNG
>>102155698
>>102155788
>Buttchin
>>
>>102156051
No it's not, plenty of people have done it and I've gone over 2000 steps. So the issue isn't the VRAM.

If you think VRAM is casuing this then you don't understand anything lol
>>
File: Flux_01248_.jpg (366 KB, 1792x1024)
366 KB
366 KB JPG
i posted my Urushihara Satoshi flux loora to civit today, turned out okay but will try to make it better

https://civitai.com/models/7227?modelVersionId=782696
>>
>>102156146
you clearly don't understand how bit precision works
>>
>>102156127
why is there always a schizo who wants to put porn on this blue board? can't go a week without one doing that
>>
>>102156149
oh nice one dude, 90's anime aesthetic is the best!
>>
>>102155952
>caption 20-40 images doing this format: "<character name>, <style>, <long form caption>, <wdv3 tags>"
you dont have enough images and your captions are all wrong and in the wrong format.
>>
>>102156157
You clearly don't understand how hundreds of people have made working loras with 12GB vram.

Anyway this is a waste of time, you can't help at all. See ya
>>
File: ComfyUI_00234_.png (1.24 MB, 896x1088)
1.24 MB
1.24 MB PNG
>>102156121
I knew you would come. they are beautiful which ever little imperfections they may have. it's even better that way.

get educated fool
https://www.youtube.com/watch?v=7xYO-VMZUGo
>>
File: ComfyUI_01111_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>102156196
>>
>>102156187
yeah sure buddy, keep training :%)
>>
>>102156149
Damn dude this is great. What kind of dataset are you using?
>>
>>102156218
I will, and I will figure this issue out like I always do.

The reddit thread has a person that has done pleny of 12GB vram loras, and the 12gb lora training is a known thing that everyone is using, I'm sorry that you are slow.

By the way my used Vram when training doesn't even go past 9gb
>>
File: 2024-08-30_00276_.jpg (640 KB, 2160x3840)
640 KB
640 KB JPG
>>
>>102156196
she's cute
>>
>>102156273
I don't have high hopes for you
>>
>>102156160
Angry sdg tranny trying to get this deleted
>>
>>102156260
there's plenty of other nfsw spaces than /hdg/ you could go, why going here, you know what? I'd much prefer you to go on /sdg/, this place is dead anyway, so... :^)
>>
>>102156260
>I got laughed out of /hdg/ for being a promptlet slop poster
>>
>>102156290
All he's doing is forcing the mods come up with a permanent solution to ban evasion. He could just as easily be spamming CP right now.
>>
File: 1709818810619373.png (1.37 MB, 896x1088)
1.37 MB
1.37 MB PNG
>>102156149
please upload png so i can STEAL your prompt

the great thing about AI anime is that it looks better than real anime ever could. its a the product is greater than the sum of it's parts type deal. you should make that pic the lora profile because it's like the best picture i've ever seen
>>
>>102156289
I especially don't have high hopes for you, small brain that can't handle things outside his own bubble. continue being stuck in the past.
>>
File: ComfyUI_01417_.jpg (894 KB, 1728x2304)
894 KB
894 KB JPG
>>
File: fs_0085.jpg (55 KB, 968x728)
55 KB
55 KB JPG
waiting for the morning calls to end so I can get back to my games
>>
File: file.png (21 KB, 100x103)
21 KB
21 KB PNG
>>102156319
>>
>>102156311
>forcing the mods come up with a permanent solution to ban evasion
at some point they're gonna run out of proxies right?
>>
File: 1713393638731528.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
An ingame screenshot of World of Warcraft, the main character is Miku Hatsune. She is wearing tier 2 paladin armor from world of warcraft. HUD and UI visible. The setting is Icecrown from World of Warcraft Wrath of the Lich King.
>>
>>102156341
Depends on how he's doing it and he's circumventing some other filters because his clearly nsfw images are staying up for hours so the mods are clearly asleep and relying on something to detect these images.
>>
>>102156341
he can just use VPNs .. nearly infinite IP addresses and ranges
>>
>>102156339
>what is the theory of evolution
https://www.youtube.com/watch?v=bx4TlfRtrjo
>>
>>102156231
about half is the usual color artbook scans and half screencaps from OVA's he worked on, 1286 images total

its the same set ive been using on SD 1.5 & XL but might try updating it with more recent material
>>
>>102156149
what's the trigger word? there's zero prompt examples on the images aswell, how am I supposed to prompt that?
>>
>>102156361
Most VPN IPS are banned
>>
>>102156431
because if I wanted to look at some hentai I wouldn't be there, what are you trying to achieve, fed? nuke this thread or something?
>>
>>102156431
You are a nuisance and an undesirable. I can't wait until people like you are permanently unable to post here, 4chan will increase in quality 10x.
>>
>>102156431
What interest do you have in derailing these threads?
>>
>>102156431
FUCK YOU
*punches your dick*
FUCK YOU
*punches your dick*
FUCK YOU
*punches your dick*
>>
>>102156409
>1286 images total
Very nice! I'm slowly completing Masamune Shirow dataset among others. Couldn't train his style on 1.5 without looking complete garbage.
>>
>>102156149
>585 mb
I appreciate the sentiment but that's too big it's making ComfyUi crash on GGUF's, and I don't want to go back to fp8 to make it work, I won't leave my Q8_0 waifu :(
https://github.com/city96/ComfyUI-GGUF/issues/84
>>
File: 1705333009922679.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>102156349
>>
>>102156469
Are you really that retarded kek
>>
>>102156469
probably just an edgy teenager.. should not even be on 4chan
>>
>>102156556
answer my question then
>>
>>102156506
Use Q6_K dingus
>>
>>102156149
YOU made that? I saw it on top downloads, high quality
>>
File: 2024-08-30_00285_.jpg (888 KB, 2160x3840)
888 KB
888 KB JPG
>>
>>102156570
that won't change anything bucko, I already have enough memory on Q8_0, it's only using 17gb out of my 24gb card when loading loras
>>
File: fs_0098.jpg (77 KB, 968x408)
77 KB
77 KB JPG
>>
File: 1722090993322300.webm (239 KB, 1208x1024)
239 KB
239 KB WEBM
>>102156469
I have no desire to derail these threads,
just ignore/hide.
Here have a bernie
>>
>>102156619
that's a high quality bernie
>>
anyone have an inpainting workflow that is as comfy as inpainting in A1111?
>>
File: 00359-1952569696.jpg (452 KB, 2048x1152)
452 KB
452 KB JPG
>>
>>102156713
Nightmare fuel
>>
File: Flux_01238_.jpg (329 KB, 1792x1024)
329 KB
329 KB JPG
>>102156319
>>102156339
kek yea the hands keep fucking up, or they go through the glass

its a boomer prompt:

soft cult anime illustration from japan; from a low exaggerated asymmetric perspective looking upward at a smiling Japanese woman, tilting her head to the side waving while removing her bulky technical flight helmet while she is nestled in the large cockpit of a futuristic fighter jet, prominently at bottom of image. the cockpit is full of retro-futuristic panels illuminating her face and body. Against a backdrop of darkness dotted with lights of runway and city. image uses creamy complimentary color scheme for graphic effect. Large text on aircraft wing reads "URUSHISATO-3" the aircraft is beautifully weathered like a scale model
>>
I don't even get way a Janny doesn't just babysit this thread so he can get some brownie points for all his successful bans. They'll double his pay for being a good worker.
>>
How much porn have you made this week?
>>
Just a reminder that Theres a contact button on the 4chan homepage where you can tell the mods that the Jannies aren't doing their job. Just reported the hentai spammer isnt being death with and what do you know his posts were deleted.
>>
File: 1720425788472227.png (696 KB, 672x672)
696 KB
696 KB PNG
>>102156727
>pay
anon... I..
>>
File: 1722522562591538.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
closer to the game artstyle:
>>
>>102156727
>>102156752
Being a janny gets you a free 4chan pass
>>
File: Flux_00008_.png (1.16 MB, 1024x768)
1.16 MB
1.16 MB PNG
>>102156722
wewlads
>>
>>102156727
It's a thankless job, but thank god we have them, or else this thread would turn into absolute shit, we can't let the schizos win
>>
File: 00366-1952569695.jpg (789 KB, 1210x2150)
789 KB
789 KB JPG
>>
File: Flux_01105_.png (1.68 MB, 1024x1024)
1.68 MB
1.68 MB PNG
>>102156426
it has long captions for each image, no specific trigger words

referencing animation, illustration, anime will help. OVA pushes it to look more like screencaps
>>
>>102156945
>it has long captions for each image
are the captions consistent in how they reference recurring details?
>>
>>102156925
more
>>
File: 00369-1952569696.jpg (871 KB, 1232x1859)
871 KB
871 KB JPG
>>102156983
>>
File: 2024-08-30_00291_.jpg (1.61 MB, 2160x3840)
1.61 MB
1.61 MB JPG
>>
>>102156149
dumb question you can't use that lora on pony/autismXL right?
>>
>>102156501
oh hell yea, thatd be a good one for flux, i think it could handle how he mashes all the panels together

the one shirow lora i saw was for XL as a lycoris format,i think whoever did it painted out all the backgrounds by hand to make it train better
>>
Anyone have a decent 12GB vram lora training setup?

I'd rather not pay the turkish guy
>>
>>102157083
nta, but no flux and SDXL are not compatible, but check that lora.. it has a SDXL version
>>
>>102156506
would ~256mb be good?
>>
>>102157112
nah, that bug must be fixed, it's not the fault of your lora, it should handle 500mb loras
>>
File: 106237-tmp.png (3.02 MB, 1536x1824)
3.02 MB
3.02 MB PNG
>>
File: grid-0137.jpg (225 KB, 1744x1312)
225 KB
225 KB JPG
>>102157094
>painted out all the backgrounds by hand
God damn that's lots of work. So far I've just skipped really messy images and cleaned clear ones if there's scanning artifacts etc.
>>
>>102157130
>it's not the fault of your lora
his lora is guilty of being obese regardless of Comfy bugs
>>
>>102157167
Succubus
>>
>>102157170
I prefer his lora to be big and good rather than being forced to be tinier and worse because there's some ComfyUi bugs preventing its loading
>>
>>102157197
big doesn't equal good
>>
>>102156969
i used joycaption & it shits out boomer prompts like this one for every image:

The image is a vibrant, colorful anime-style illustration. The scene depicts two characters, likely from a racing or competition setting, on a high-speed go-kart track. The background is a dark, gradient-filled, and textured representation of a racing track, with subtle shading and dynamic lines suggesting speed and movement.

The characters are the main focus. The first character, a young woman with short, spiky orange hair, is lying on the ground, seemingly injured or exhausted. She's dressed in a blue and white racing suit, with a white glove and a determined expression. Her eyes are wide open, conveying a sense of urgency or distress.

The second character, a young woman with long, purple hair tied in a ponytail, is seated on the go-kart, wearing a red and white racing suit, complete with a white glove and a confident, energetic expression. Her eyes are focused, and her posture suggests a sense of readiness or excitement.

The go-kart, with its sleek, high-tech design, is the central element, with the characters' dynamic poses and expressions capturing the intense, high-stakes atmosphere of a competitive racing scene. The illustration is rendered in a style reminiscent of classic anime and manga, with bold lines, vibrant colors, and dramatic lighting. The overall atmosphere is tense and fast-paced, emphasizing the thrill and urgency of the racing environment. The image is highly detailed, with intricate textures and shading, creating an immersive experience.
>>
>>102157209
that's fair, but there's a reason it's that big, he trained his shit on 1200 pictures
>>
File: Flux_00017_.png (1.11 MB, 1024x768)
1.11 MB
1.11 MB PNG
>DANGER DANGER DANGER WOO WOO WOO COLLISION WOO WOO WOO
women pilots amirite fellas?
>>
Got 8vram (rtx3070)
Want to try out flux, is my card good enough or should I just rent a cloud gpu for a couple hours until I get tired of it?
>>
File: 2024-08-30_00294_.jpg (1.55 MB, 2160x3840)
1.55 MB
1.55 MB JPG
>>
File: 00374-1952569695.jpg (708 KB, 1360x2050)
708 KB
708 KB JPG
>>102157186
yes!
>>
>>102157241
you can probably run a quantized model Q6 or NF4 .. but it won't be much fun
>>
File: ComfyUI_01419_.jpg (907 KB, 1728x2304)
907 KB
907 KB JPG
>>
>>102157247
rad
>>
>>102157263
how many proxies does this motherfucker have?
>>
>>102157217
>a high-speed go-kart track
pffft
>style reminiscent of classic anime and manga
this stuff annoys me, it's not reminiscent, it IS classic anime
>>
>>102157232
>RETARD RETARD RETARD
>PULL UP PULL UP
>>
File: 2024-08-30_00295_.png (1.44 MB, 720x1280)
1.44 MB
1.44 MB PNG
>>102157271
ty, have a free fish
>>
File: 106319-tmp.png (2.52 MB, 1536x1728)
2.52 MB
2.52 MB PNG
>>
File: Flux_00268_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>
>>102157430
>nvm I am retarded
self realization is the first step of healing
>>
File: 2024-08-30_00300_.jpg (1.5 MB, 3840x2160)
1.5 MB
1.5 MB JPG
>>
File: aigrifter5.png (8 KB, 771x97)
8 KB
8 KB PNG
turkish ai grifter just got btfo on reddit
>>
File: Flux_00026_.png (1.06 MB, 1024x768)
1.06 MB
1.06 MB PNG
uchigatana style
>>
File: fs_0160.jpg (63 KB, 1280x640)
63 KB
63 KB JPG
>>
File: 00394-897245490.jpg (585 KB, 1440x960)
585 KB
585 KB JPG
>>
File: 2024-08-30_00301_.png (1.75 MB, 1280x720)
1.75 MB
1.75 MB PNG
>>
>>102155129
how about you locally diffuse some pussy on yo dick
>>
File: 106140-tmp.png (2.8 MB, 1536x1632)
2.8 MB
2.8 MB PNG
>>
>>102155934
>n64graphics
I can't find one with that specific name
>>
File: 1708383842454267.png (666 KB, 1024x1024)
666 KB
666 KB PNG
lineart, monochrome, Miku Hatsune, white background
>>102157630
search n64 then filter by the 2 flux types

https://civitai.com/models/660136
>>
File: 00410-897245491.jpg (430 KB, 1200x1800)
430 KB
430 KB JPG
>>
File: 00085-3542029600.png (1.21 MB, 1352x1080)
1.21 MB
1.21 MB PNG
>>
File: 106329-tmp.png (2.67 MB, 1536x1728)
2.67 MB
2.67 MB PNG
>>
File: 2024-08-30_00304_.jpg (1.02 MB, 3840x2160)
1.02 MB
1.02 MB JPG
>>
using the bnb nf4 v2 model, with 8 gb vram, with forge, i seem unable to use loras, idk why. switching the 'diffusion in low bits' setting, i either get black images, or my pc crashes. ultimate sadness. anyone know of other options for running flux with loras?
>>
>>102157861
>using the bnb nf4 v2 model, with 8 gb vram, with forge
go for Q4_0 anon, it's the same size and better quality
>>
File: 00424-897245489.jpg (366 KB, 1152x1728)
366 KB
366 KB JPG
>>
anyone have a good llm prompt for writing prompts for flux?
>>
>>102157987
I go for bing, chatgpt, claude... they're all good at writing verbose slop, that's what t5 likes
>>
File: 2024-08-30_00312_.jpg (1.53 MB, 6144x1536)
1.53 MB
1.53 MB JPG
>>
>>102158016
damn that's great.. war of the worlds?
>>
File: 2024-08-30_00314_.jpg (1.71 MB, 6144x1536)
1.71 MB
1.71 MB JPG
>>102158108
ya, but in the style of Qi Baishi
>>
File: file.png (1.98 MB, 1024x1024)
1.98 MB
1.98 MB PNG
>>102156149
Really impressive lora dude, the 90's had the best aesthetic that I can agree on
>>
File: ComfyUI_01427_.jpg (734 KB, 1728x2304)
734 KB
734 KB JPG
>>
File: 1701408280528367.png (2.37 MB, 1024x1536)
2.37 MB
2.37 MB PNG
>>
>>102158169
Nvidia Blackwell Troonix III! In the flesh!!!! Wowowowowowow
>>
>>102158228
die in a fire pedo
>>
File: 00453-897245487.jpg (453 KB, 1512x1008)
453 KB
453 KB JPG
>>
are photos of sketches suitable training material?
relative of mine is an artist and i wanna have a go at training it on some of the sketches for fun, only problem is they're on paper and not digital
not sure how lighting, angles and such would affect the training result
>>
File: aseet.jpg (20 KB, 542x375)
20 KB
20 KB JPG
>>102158335
>>
>>102158335
Use scanner if possible, but hq photos can work if you get correct lighting
>>
File: 1695507050618807.png (1.25 MB, 1024x768)
1.25 MB
1.25 MB PNG
>>102158335
dummy. how is the computer going to see the drawings on the paper? that would only work if they were on the computer.
>>
>>102157506
inb4 they still allow his "it's 80% free 10% paywalled!" crap
I wish GitHub repo owners would do something about him. I don't lurk Reddit, but I still end up seeing his retarded ass all over GitHub issues
>>
>>102152844
https://civitai.com/models/262924/june-shoe0nhead-lapine
>>
File: file.png (2.57 MB, 1024x1024)
2.57 MB
2.57 MB PNG
>>102156149
George Costanza's lora is too stronk for your style kek
>>
File: 00286-1275504474.png (1.79 MB, 1024x1440)
1.79 MB
1.79 MB PNG
>severed arm? in my boat? surely you must be mistaken officer
>>
>>102158423
Has anyone tuned an amputee lora?
>>
File: ComfyUI_01430_.jpg (725 KB, 1728x2304)
725 KB
725 KB JPG
>>
>>102158409
prompt?
>>
File: ComfyUI_00986_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
Some guy asked me about my Aika LoRa:
>"What is the activation word for Forge? I load the Lora in but nothing changes with/without."

I only use ComfyUI so no idea what "activation word" he is talking about.
in comfy you just mention "Aika" and it will output Aika.
do LoRas work differently on forge?
or is the dude retarded?
>>
q8 model is comparable to fp16, and is better than fp8? or is that not quite true
>>
Extremely retarded brainlet question: is it possible to get 1.5 to use t5? like, could I train a massive 1.5 model on relatively low compute requirements and it uses t5 instead of clip?
>>
>>102158529
what you said is correct, Q8 is the closest quant to fp16 in terms of quality
>>
File: hg65546.png (1.36 MB, 1536x1024)
1.36 MB
1.36 MB PNG
preview pics from the civitai flux lora trainer. suppose you should hope i continue to be unable to get loras to work on my pc without errors or crashing
>>
File: ComfyUI_03886_.png (3.36 MB, 1440x1280)
3.36 MB
3.36 MB PNG
>>
>>102158558
I'm interested
>>
>>102158542
it has already been done in a way with ELLA
https://github.com/TencentQQGYLab/ELLA
>>
>>102158461
https://files.catbox.moe/1f3c9y.png

"A photo-based tarot card featuring Shoe0nHead: The Fool, number 0, is depicted as a Shoe0nHead standing at the edge of a cliff, with one foot extended in front of the other, as if she is about to take a step forward. She is dressed in a simple transparent white robe, with red panties and red bra showing, with a white rose in her left hand and a small bag slung over her shoulder. Her right hand is raised in a gesture of innocence and trust.\n\nThe Fool's face is serene and calm, with a gentle smile. She is surrounded by a halo of light, symbolizing her connection to the divine. Her eyes are cast upward, as if she is looking towards the heavens.\n\nAt her feet, a small dog is shown, looking up at her with a curious expression.\n\nIn the background, the sky is a bright blue, with a few white clouds scattered about. The cliff's edge is precipitous, with a great void below, symbolizing the unknown and the infinite possibilities that lie ahead.",

I used llama 70b and asked it to describe in detail The Fool in Golden Dawn. The description is probably not that accurate, but ai descriptions are often pretty nice.
>>
>>102158461
This is the link to the ai
https://huggingface.co/chat/

groq has the same one available in the menu
>>
>>102158572
Pretty cool
>>
>>102158504
really, you never heard of activation words aka trigger words?
in your case "Aika" is the word associated with the subject
you might have seen loras on civitai with stupid trigger words like "sc4rl3tj0h4ns0n" that they used instead of the name
>>
File: file.png (1.93 MB, 1024x1279)
1.93 MB
1.93 MB PNG
>>102158335
When you train it's like telling the model "I want more images like this". That means if you take pictures of a sketchbook and those pictures look like pictures from a sketchbook, that is going to be the output of the model after training. If you want them to look like cleaned up sketches you will have to do edits.
>>
>>102158589
Blessed chinese, thanks
>>
File: 00038-3137794238.png (510 KB, 512x688)
510 KB
510 KB PNG
>>102158670
stop that!
>>
>>102158670
Where do you work, sounds comfy.
>>
>>102158695
Cute maid
>>
>>102158730
not him, but I work from home so technically he's right, it's safe for me
>>
File: ComfyUI_00267_.png (1.32 MB, 1728x576)
1.32 MB
1.32 MB PNG
>>102158597
>>102158626
thanks, man. do you use a llm to do prompts for you? that is very long and I imagine it would be exhausting to prompt that out every time. like you have an image in your head and you describe it briefly to an llm giving some headers here and there and tell it to turn it into a detailed prompt?
>>
File: 000006.jpg (828 KB, 1408x2000)
828 KB
828 KB JPG
gm or guuj goornaj as we say in Columbus
almoost caturday teehee
>>
>>102158777
ok Julien
>>
File: 2024-08-30_00335_.jpg (782 KB, 2496x3648)
782 KB
782 KB JPG
>>
File: 1696737580354570.webm (132 KB, 768x768)
132 KB
132 KB WEBM
ok for real this time, this is a SFW image.
>>
>>102158795
Yeah, I am experimenting with llm descriptions.

long prompts aren't needed to just get tarot cards though.
>>
File: ComfyUI_00904_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>102158642
Thats what I thought, so why can the guy not figure out how to use it?
is he tarded?
even when you click on the LoRa on civitai it says under Trigger Words "Aika".
>>
File: 00435-897245489.jpg (315 KB, 960x1440)
315 KB
315 KB JPG
>>102158910
is this the final fantasy art style? It looks great
>>
File: 2024-08-30_00337_.jpg (944 KB, 2496x3648)
944 KB
944 KB JPG
>>102158988
yea, Final Fantasy Tactics lora
>https://civitai.com/models/649575/final-fantasy-tactics-for-flux
>>
File: 00431-897245492.jpg (324 KB, 1152x1728)
324 KB
324 KB JPG
>>
File: 106317-tmp.png (2.93 MB, 1536x1824)
2.93 MB
2.93 MB PNG
>>
File: 00039-746139218.png (535 KB, 512x688)
535 KB
535 KB PNG
afternoon, not morning
>>
>>102158978
>Trigger Words
Important keyphrase.
>>
File: ComfyUI_00997_.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>102158879
>Buttchin
>>
File: 00433-897245487.jpg (338 KB, 960x1440)
338 KB
338 KB JPG
>>
>>102159071
She got a butt pussy too
>>
File: 1714535625147472.png (2.17 MB, 1024x1024)
2.17 MB
2.17 MB PNG
>>
File: mg3.jpg (559 KB, 1024x1024)
559 KB
559 KB JPG
>>102159030
>https://civitai.com/models/649575/final-fantasy-tactics-for-flux
cool I remember the SD1 one
>>
>>102159071
If you keep that up I'm gonna start generating birds with buttchins
>>
File: c55f.jpg (222 KB, 1024x1024)
222 KB
222 KB JPG
>>
What's this thing I keep reading about prodigy and how it's not good for large datasets? Is it part of some official documentation or just people using terrible settings?
>>
>>102159175
justasplanned.exe
>>
>>102159175
a bird with any type of chin would be interesting
>>
File: xyz_grid-0005-3350072032.png (2.81 MB, 2048x1184)
2.81 MB
2.81 MB PNG
>>102158978
I just tested it, the lora works but I don't think your trigger word does anything.
"Aika" can sometimes spit out random shit. Just using "a woman" gets better results. How did you caption your dataset?
>>
>>102159204
prodigy is shit compared to the goat, AdamW
>>
File: 2024-08-30_00342_.jpg (667 KB, 2496x3648)
667 KB
667 KB JPG
>>
File: fixed.png (506 KB, 664x581)
506 KB
506 KB PNG
>>102159071
Ironically, it's very easy to photoshop out.
>>
>get featured on civit home page
>I sleep
>get pic featured in the /ldg/ collage
>REAL SHIT

>>102159274
Okami style LoRA? I love it.
>>
File: ComfyUI_01112_.png (944 KB, 1024x1024)
944 KB
944 KB PNG
>>102159175
>>
File: FD_00074_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>102159175
Do it you coward you won't
>>
File: ComfyUI_00940_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>102159234
>How did you caption your dataset?
Put an image of Aika into joy caption, then just replaced words like "woman" with "Aika" and when it says "she wears...." I changed it to "Aike wears...." and then added it to the .txt
>>
File: 1721540485510057.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>102159325
>real lilith fans are all fat
>>
File: ComfyUI_00950_.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>102159345
*Aika wears
>>
File: 2024-08-30_00347_.png (1.43 MB, 1280x720)
1.43 MB
1.43 MB PNG
>>102159292
yaa.. so many good style loras came out the past days
>https://civitai.com/models/701052/okami-style-f1d?modelVersionId=784410
>>
>>102159349
Are you able to get taking off denim_shorts, with one piece underneath?

idk, I get oddities so far.
>>
File: FD_00076_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>102159366
Correct, but I have lost 12kg in the last 2 months. I am on a strict diet of eating nothing but your mums pussy.
>>
>>102159259
Both work great
>>
>>102159382
What's okami?
>>
File: 1704012033900153.jpg (23 KB, 655x468)
23 KB
23 KB JPG
>train model on jpegs
>gens are always full of DCT-like artefacts when zoomed in
why do they do this
>>
did Scarlet Witch anon ever share the lora?
>>
File: 2024-08-30_00348_.png (1.44 MB, 1280x720)
1.44 MB
1.44 MB PNG
>>102159404
2006 video game.. kinda a Zelda-like, but with a very distinctive art style and set in ancient Japan
>>
>>102159404
Game from Wii and PS2 where you play as Amaterasu. a wolf goddess, and need to use a magical paint brush to navigate the world and restore life, and it's in a heavily stylized Ukiyo-E art style. It's a really cool game that people would never make today.
>>
>>102159435
Does it have voice, or is it reading heavy?
>>
>>102159382
I need a Far Side lora
>>
File: wut_00027_.png (1.06 MB, 832x1216)
1.06 MB
1.06 MB PNG
>>102159307
>>102159325
I tried. No double chin, but he looks cool.
>>
File: ComfyUI_01114_.png (935 KB, 1024x1024)
935 KB
935 KB PNG
>>102159467
>>
File: 2024-08-30_00350_.png (1.47 MB, 1280x720)
1.47 MB
1.47 MB PNG
>>102159446
no voice, but not that text intensive, there was a HD remaster recently but I haven't played that version.. I played it back on the Wii
>>102159458
I want a Harada/Disgea lora
>>
File: Capture.png (4 KB, 197x84)
4 KB
4 KB PNG
>>102159345
Well, that seems fine. Your lora does work in forge just the trigger work is a little hit or miss. Picrel is a pretty important setting for forge loras I find, maybe that's the tards problem
>>
File: FD_00081_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>102159467
He does look cool but I don't think you can get a bird with a butt chin.
>>
>>102159488
I have the HD remaster, it's identical and awesome because it's prettier.
>>
>>102159488
Small screens / canvasses are underappreciated as art targets. They encompass about a foveal area.
>>
>>102155950
>fairly popular shows
Style LoRAs are usually artist styles, because porn.
>>
File: sc.png (94 KB, 1258x497)
94 KB
94 KB PNG
>https://huggingface.co/TheLastBen/The_Hound
holy shiet
>>
File: 1710685968285046.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>
trying the q4_0, with the automatic fp16 lora low bits setting. at 164.14s/it.. not good. will try smaller image size.
>>
File: xyz_grid-0007-2652050878.png (3.18 MB, 2048x1184)
3.18 MB
3.18 MB PNG
>>102159491
Another example of it clearly not understanding what "Aika" is
>>
>>102159615
I don't understand what Aika is
>>
sugested topic: "anon, we came from the future to make u coom"
>>
>>102159509
Trying a new strat.
>>
>>102158978
holy fuck anon, she's pure coom fuel
>>
>>102159707
>she
>>
File: wut_00028_.png (1.4 MB, 832x1216)
1.4 MB
1.4 MB PNG
>>102159707
More birds in the sky, and fish in the sea lad.
>>
>>102159572
interesting
>>
File: ComfyUI_Flux_0302.jpg (469 KB, 1152x2048)
469 KB
469 KB JPG
>>
>>102159717
definitely a bee.
>>
File: FD_00094_.png (894 KB, 768x1344)
894 KB
894 KB PNG
>>102159654
That's been every day though
>>
>>102159728
Some things ai can do are extremely hard to paint, alternately very hard to photoshop. fwiw a background isn't, and this is an example where I would prefer the ai to be able to deliver layers. I know they aren't made like this yet, but I would like it. This is a similar example, this ideally would be layers:
>>102159723
>>
>>102159572
if you only have to train one layer it should mean lower hardware requirements for training right?
>>
>>102159762
she needs to beg for anal
>>
>>102159572
>>102159776
Also, how does this work in terms of the layers? To what extent can you get crazy with the layers? Can you reorder them? Skip some? Replace some outright with say sdxl? I really have no idea.
>>
>>102159615
how different is "Aika wearing a black dress" from "a woman wearing a black dress"? "wearing a dress" should move the "Aika" vector closer to "a woman" therefore making it trigger but still it clearly messed up
>>
File: ComfyUI_Flux_11640.jpg (229 KB, 672x1504)
229 KB
229 KB JPG
>>
File: 00040-3090140485.png (941 KB, 688x888)
941 KB
941 KB PNG
randomly thought 'why does anything exist' and spooked myself a bit
>>
Is there a way to train a lora for image-text-models? Let's say you trained a lora to generate a new concept, but then you also want the clip interrogator to recognize your new concept when the image is fed to it. Using the existing models, it would only give you generic descriptions but not recognize your specific concept.
>>
>>102159886
Considering we have different custom clip models, I would say yes. But I have never done this so I don't know how.
>>
File: 2024-08-30_00321_.jpg (1.03 MB, 2496x3648)
1.03 MB
1.03 MB JPG
>>102159871
>'why does anything exist'
to evolve the Universe
>>
>>102159846
>not wanting the purest most virgin girl to shily but hornily beg u for anal
weak 2bh pham
>>
>>102159931
What does a clip model do anyway?
>>
>>102159938
please don't engage the thread-hopping schizo
>>
File: FD_00103_.png (1.89 MB, 1024x1024)
1.89 MB
1.89 MB PNG
>>102159952
removes your foreskin
>>
File: wut_00029_.png (793 KB, 832x1216)
793 KB
793 KB PNG
>>102159483
>>102159509
It's hard, it definitely is not a simple matter. It's not consistent, it doesn't want to do it.
>>
File: _0169.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>102159938
this is pretty. is it with flux? lora?
>>
>>102159846
how is this guy able to keep posting? It cannot just be vpn/proxies?
>>
File: FD_00106_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>102159981
Jesus christ anon that's terrifying.
>>102159952
Actual answer because I feel bad. It's basically a translation layer between the text input and the image generation output. It turns natural language into visual understanding the image model can use.
>>
File: 2024-08-30_00325_.jpg (860 KB, 2496x3648)
860 KB
860 KB JPG
>>102159952
it reads your prompt and tells the inference of the model what to make.. its a text encoder, FLUX used both CLIP and T5xxl .. both are text encoder. CLIP is very simple, t5 is as powerful as an LLM like chatGPT

>>102159994
flux, Final Fantasy Tactics lora.. see >>102159030
>>
>>102159955
why get butthurt over someone posting in different threads? how sad of a person are you
>>
>>102159769
There's this, it generates an umage with a transparent background:
https://github.com/lllyasviel/LayerDiffuse
It tends to have a bias towards certain poses though.
>>
File: file.jpg (394 KB, 1792x1024)
394 KB
394 KB JPG
fed up of the drama schizo so i'm gonna just post here instead
civit loras are uploading to hf now, ~190k of them
new art set dropping later maybe, waiting for the scrape to finish
>>
>>102160032
it's about sending a message, this nigga spams his corpse bride shit everywhere and goes off topic 99 times out of 100 then posts cp and gets banned xd, although you may know this already nicholas
>>
File: 105442-tmp.png (2.77 MB, 1536x1728)
2.77 MB
2.77 MB PNG
>>
>avatarposts
go away hlky you off topic fag kid
>>
>>102160065
you sound schizo
>>
Piping hot bread straight from the oven...
>>102160057
>>102160057
>>102160057
>>
File: _0233.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>102160024
oh wow, it looks great. thanks anon
>>
File: xyz_grid-0008-2059044321.png (1.84 MB, 2048x1284)
1.84 MB
1.84 MB PNG
>>102159802
Imagine the handjobs
>>
>>102160111
she could jerk it and blast your prostate with just one hand
>>
>>102160087
okay nicholas doesn't mean I'm not right
>>
>>102160022
Oh I am sorry, did I mention it's a real photo, why do you hate nature? (fingers crossed)

>clip
So basically we had to have Black Forest Labs give us the clip, otherwise Flux would have been only in daydream mode?
>>
>>102160168
even if you are right, i am better than you, in all ways.
>>
>>102156016
would
>>
>>102160024
oooo sheesh, I didn't know t5 is way better, this is extremely important info lmao
>>
>>102160203
homo
>>
>>102160198
better at what?
>>
>>102160071
I'll bite, why does it have vae errors?

>>102160081
Can you get anime to remove jean shorts to reveal a one piece bathing suit? I can't into prompt
t promptlet
>>
File: ifx278.jpg (263 KB, 1024x1024)
263 KB
263 KB JPG
>>
File: ifx279.jpg (226 KB, 1024x1024)
226 KB
226 KB JPG
>>
File: ifx285.jpg (305 KB, 1024x1024)
305 KB
305 KB JPG
>>
File: ix282.jpg (398 KB, 1024x1024)
398 KB
398 KB JPG
>>
Can I commission a person to make a lora?
How much does that usually run? I will provide several thousand photos.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.