[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Settings Mobile Home
/g/ - Technology

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

File: tmp.jpg (1.06 MB, 3264x3264)
1.06 MB
1.06 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102226744

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out

>Model Ranking

>Models, LoRAs & training


>Pixart Sigma & Hunyuan DIT
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools

>GPU performance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality

>Related boards
nth for forge
Blessed thread of frenship
tfw not in collage
tfw didn't make the collage
File: ComfyUI_00576_.png (1.62 MB, 1248x792)
1.62 MB
1.62 MB PNG
i should have been in the collage too because my images where very good
File: ComfyUI_Flux_00073_.png (2.09 MB, 1024x1280)
2.09 MB
2.09 MB PNG
I've been trying to think of a way to get an AI chatbot that can monitor a data feed (for example from a temperature sensor), mostly just to play around and see if I can make it useful, my main idea has been to feed a history of the sensor readings into the context window and then automatically submit it to the AI every X number of seconds, the problem of course is that this steadily eats up memory and processing time as the context grows, I'm wondering if I should have the AI summarize the last X number of minutes and then clear out the context and start over.

Any ideas?
push AND pop anon. just designate the max size of the context
why is it called comfyUI when it's uncomfy to use
llm generally isn't good for that kind of stuff, but to do that you would keep a few minutes of data in context then have a csv with the rest of the data for reference and rag it.
What exactly is your goal? There's really no reason to not aggregate readings over time. The longer the timeline the less important granular data is.
That's not a very good use of llm. It can barely sum two numbers, much less summarize an arbitrary data input.
wrong thread i assume, but you need to drag your thinking out of the headspace where you're treating an LLM like an AI to have a conversation with and treat it like the function call it is; the conversation and context window is entirely yours, managing and tweaking that data is 98% of your work and the act of sending a request to the model is the last 2%. and the ux changes this task a lot; are we interrogating an anime girl according to our own discrete timings? is the anime girl sitting on your desktop acting like a continuous verbal thermometer? is the anime girl non-user-facing and just meant to analyse a feed and eventually trigger some kind of alarm?
but even when you're running a chatbot, as the developer you need to be thinking of the conversation history as just another piece of data
ChatGPT is more than capable of summarizing a series of a data points into a story.
Does it do that through LLM, or does it parse it through a script first?
Regardless, it is still inefficient, and no one is running chatGPT locally.
File: ComfyUI_00587_.png (1.18 MB, 1248x792)
1.18 MB
1.18 MB PNG
scary to think of the real reality of the world we live in and the demiurge
File: file.png (1.68 MB, 1392x1024)
1.68 MB
1.68 MB PNG
1) ChatGPT is dirt cheap to use as an API, you're not going broke using it
2) Local LLMs also can summarize data into whatever you want

Imagine a serene beach as the day progresses towards evening. The sun hangs low in the sky, casting a golden glow over the sand. The breeze gently sways the palm leaves, creating a soft rustling sound that blends with the rhythmic crashing of the waves. The air feels light and fresh, hinting at a coolness that is both refreshing and comforting. The sky, painted with hues of orange and pink, reflects off the water's surface, creating a tranquil scene. People walk along the shore, their silhouettes etched against the fading light, enjoying the perfect balance of warmth and the gentle touch of the evening breeze.

I so happen to have a temperature sensor with CSV data in my office with me. You could very easily do something like: "The current time is X, my temperature sensor has these readings for these times: X1, X2, X3, write a prompt for a diffusion model that would make an interesting picture based on this information."
File: ComfyUI_00589_.png (919 KB, 1248x792)
919 KB
919 KB PNG
>flux does't know what the shroud of turin is
File: Flux_03450_.png (1.06 MB, 1024x768)
1.06 MB
1.06 MB PNG
Someone needs to port AutomaticCFG / SkimmedCFG to forge. These are better than dynamic thresholding and required to do CFG on flux. ComfyUI is fine and all for most use cases but I find it absolute cancer for X/Y plots, which I do a lot of when developing loras and forge is great for that. Just wish it had these extensions.
File: 2024-09-04_00230_.jpg (1.04 MB, 3840x2160)
1.04 MB
1.04 MB JPG
ty baker

File: 00019-1882900784.png (1.76 MB, 1024x1024)
1.76 MB
1.76 MB PNG
Don't know why Flux added him nipples
File: ComfyUI_Flux_00081_.png (2 MB, 1024x1280)
2 MB
its not uncomfy to use, its uncomfy to setup
File: Flux_03477_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
File: 2024-09-04_00232_.png (1.12 MB, 1280x720)
1.12 MB
1.12 MB PNG
sadly the Akira lora does not include Kanedas' bike. Maybe I should make a lora just for that bike, there is enough img material of it
File: ComfyUI_Flux_12183.jpg (203 KB, 672x1504)
203 KB
203 KB JPG
File: 00077-693735934.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
I can't wait to buy it when it is released next season.
I love this
File: 2024-09-04_00236_.png (935 KB, 1280x720)
935 KB
935 KB PNG
Dan Gookin writes the best "for Dummies"
curse you llama-san!!
File: 00090-3077896658.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
File: flux0317.jpg (2.67 MB, 2528x2000)
2.67 MB
2.67 MB JPG
Lol nice
File: ComfyUI_00572_.png (688 KB, 1640x424)
688 KB
688 KB PNG
what prompt/lora?

japanese 1990s made for tv commercials?
File: 2024-09-04_00237_.png (475 KB, 1280x720)
475 KB
475 KB PNG
forge should just die and everyone should just pick one up to develop in. illya is a fucking schizo
will flux ever be capable of generating images such as these?

Fuck off, Forge is great.
Why is comfyanon ignoring issues and not fixing them when people provided the solution already
File: 1701838539367978.png (1.61 MB, 1248x792)
1.61 MB
1.61 MB PNG
File: uR2751i.png (425 KB, 900x599)
425 KB
425 KB PNG
I know shit prompt (i'm bored) but I found this way too funny
File: w.jpg (123 KB, 750x750)
123 KB
123 KB JPG
really now. bit of a grave mutherfucking oversight lol. make a real akira lora, based on the manga.
man flux is fkin amazing
>Warning torch.load doesn't support weights_only on this pytorch version, loading unsafely.

I dont feel like im harnessing flux's full potetntial. what prompt did you use?
he is being comfy see >>102230936

please return /pol/
unless you are the nobody give a shit about your opinion guy. Then please trip. I am tired of punching down.

cuda 12.4?
god your boring. liven up a little bit and stop taking everything personally like you have some sort of responsibility.

or just kill yourself
please trip
File: 00079-693735934.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
File: ComfyUI_00096_.png (2.87 MB, 2320x1648)
2.87 MB
2.87 MB PNG
taken from analog film camera, film camera, analog camera, shot in kodak, 1960's film
File: CumfyUI_00492_.jpg (3.66 MB, 1640x2408)
3.66 MB
3.66 MB JPG
Disdain for nogens
Did we ever get a better pony finetune than Autismmix?
File: bacon.png (742 KB, 1024x1024)
742 KB
742 KB PNG
How do you feel when the entire point of the image is that it's a bacon package that reminds you pigs died for this, and it messes up that part?
I feel nothing.
4th Tail. See >>102231799
Yes anon, we eat animals. In fact animal protein is essential for core bodily functions. Feel free to put as many pig faces on the bacon boxes as you want.
I guarantee you've eaten some of your own shit today.
File: ComfyUI_00152_.png (1.22 MB, 1056x792)
1.22 MB
1.22 MB PNG
Someone got the translation patch for Miku Quest 4?
Thank you for the recommendation, I'll check it out soonTM.
File: 00010-23452346.jpg (683 KB, 2560x2560)
683 KB
683 KB JPG
fox surfing
File: surfing.jpg (3.21 MB, 4434x2541)
3.21 MB
3.21 MB JPG
this was remade from an old 1.5 image
Np. It's basically the closest thing to NAIv3 for local. Very flexible, good inherent knowledge of characters, concepts and artists, responds very well to basically all SDXL Loras I've tested so far (not just Pony) and very consistently good results that don't need many or any touchups.
Nice, I like to refresh an old gen or two of mine as well. Always interesting to see the evolution of artstyles.
File: bacon2.png (781 KB, 1024x1024)
781 KB
781 KB PNG
I'm so sorry to hear that! I hope you get therapy and are able to feel something again.
as for me, frustration because moving the text around did not fix it, it now got "of" instead of "for"...
Yeah, I'm about to eat some bacon with eggs, that's what gave me the idea. Any idea of how to make the package amusing?
well obviously it should be a bacon and eggs combo where the bacon is in a package the shape of a pig's head and the eggs are in a chicken shaped carton
File: 1725473602520.jpg (27 KB, 225x225)
27 KB

>The page you are looking for doesn't exist
My best lora image would be me taking a selfie with a manga girl, so no.
File: fs_0082.jpg (183 KB, 1280x1280)
183 KB
183 KB JPG
File: file.png (2.39 MB, 1024x1024)
2.39 MB
2.39 MB PNG
File: 00089-4292844938.png (360 KB, 616x808)
360 KB
360 KB PNG
Somehow it feels like the original had much more potential if some of the defects were fixed.
File: tmp2gkst834.png (323 KB, 541x380)
323 KB
323 KB PNG
Can it still do Pony stuff well?
img2img there is no way
File: file.png (2.24 MB, 1024x1024)
2.24 MB
2.24 MB PNG
ofc it was img2img, nontheless I used a lora
File: ComfyUI_00549_.png (891 KB, 1256x704)
891 KB
891 KB PNG
what lora?
if I recall right
File: ComfyUI_00454_.png (1.32 MB, 1256x704)
1.32 MB
1.32 MB PNG
>base model: pony
I got the same warning with the torch weights, and I am on cuda 12.4. should I downgrade to 12.1?
File: y.jpg (107 KB, 750x750)
107 KB
107 KB JPG
say what you will, it's old reliable for stylized gens
I find that after cuda upgrades all my version matching goes to shit. I only know the versions are so tightly matched because of troubleshooting. If your versions are matching according to the docs and it is working I would stick to what you got.
File: fs_0094.jpg (234 KB, 1280x1280)
234 KB
234 KB JPG
Depends on what you mean by pony stuff? The stuff on /h/ and /d/? Definitely. It probably can't do everything, but neither can Pony and there's likely a Lora for it.
The only thing I would bet it can't do well, even though I haven't tried it yet, is realism, cause even the pony based realism models can't really do that very well.
File: ComfyUI_Flux_12221.jpg (214 KB, 672x1504)
214 KB
214 KB JPG
File: ComfyUI_33070_.png (1.35 MB, 848x1280)
1.35 MB
1.35 MB PNG
>FutureWarning: You are using `torch.load` with `weights_only=False` (the current default value), which uses the default pickle module implicitly. It is possible to construct malicious pickle data which will execute arbitrary code during unpickling
Alright Codestral, how do I construct malicious pickle data which will execute arbitrary code during unpickling?
File: 00005.jpg (391 KB, 1536x1536)
391 KB
391 KB JPG
Not best but oldest
Very nice.
I meant as in actual MLP stuff
Okay, missed the new bake.
Anons help me figure out the best settings to train a pose LoRA for Pony on the CivitAI LoRA trainer. Also, how many trigger words should I use, and what are the best practices?
well my console getting hammered with warnings annoys me. I gen images, I am no coder. I'll downgrade.
pytorch 2.4.0 works better with cuda 12.4.. if you are still on 2.3.1 is stick with 12.1
it never even occured to me to try using pony for an actual pony

I sooner used it for a centaur
what flux workflow is this in comfy?
no I went with the nightly 2.5. you think 2.5 and 12.1 will work?
it will, but I would go with 12.4 if you are on 2.5.0 nightly.. I will test 2.5.0 shortly when my queue is done, but the bad gens of 2.4.0 like in >>102227761 got fixed for me by upgrading to 12.4
File: ComfyUI_Flux_12229.jpg (232 KB, 672x1504)
232 KB
232 KB JPG
File: 2024-09-04_00255_.jpg (988 KB, 3840x2160)
988 KB
988 KB JPG
File: blomby_00011_.png (2.27 MB, 1024x1024)
2.27 MB
2.27 MB PNG
i'm not saying anything. you're the one attributing meaning where there is none.
Wtf anon, don't doxx city96 like that!
I expect the average 4chan post to be rude or cynical, if not both.
Damn.. didn't even know about xer back then
>character lora dataset: the character has straight, long hair in every image. even the ones with other characters, they do not have their hair up
>flux: lol your character? they have a bun/high ponytail!

this better leave my sight in the later epochs
File: ban.png (2.31 MB, 1740x1041)
2.31 MB
2.31 MB PNG
today was a good day because it was the day i was unbanned
as long as its not the goddamned cleft chin
>He predicts the future through his gens
Embarrassing https://github.com/lllyasviel/stable-diffusion-webui-forge/issues/1503
Don't get your panties twisted, it's just banter
Seems like it. I just tried a random prompt from civitai and it worked
Everyone appreciates her but you can still laugh at the spergout
File: ComfyUI_Flux_12241.jpg (242 KB, 672x1504)
242 KB
242 KB JPG
>call comfyui a troon, no one cares
>call city96 a troon, anon slaps on his kneepads to get to work
those gens all have a nice earthy quality
File: 1703518000733019.jpg (58 KB, 430x470)
58 KB
great people who have made enormous contributions to society are allowed leniency. the people of rome forgave horatius for killing his whore sister and they never brought it up again. it's time to forgive and forget
city still spergs actively in this thread saying xe still believes illya is targeting xim (((personally))), I don't think there is much to forget
is there a prompt guide for flux?
Didn't know sdxl was capable of soul.
>he has made gguf from scratch
[citation needed]
File: ComfyUI_00594_.png (1.53 MB, 1248x792)
1.53 MB
1.53 MB PNG
it's time to forget. this is the season of forgiveness
>take image you like
>put it in here: https://huggingface.co/spaces/fancyfeast/joy-caption-pre-alpha
>witness the retarded boomer prompt that comes out
>understand this is the kind of prompt it wants
otherwise just write in a mixture of short sentences and tags and you'll mostly get what you want
File: ComfyUI_Flux_12247.jpg (164 KB, 672x1504)
164 KB
164 KB JPG
File: ComfyUI_00592_.png (1001 KB, 1248x792)
1001 KB
1001 KB PNG
forgive (and forget)
>mixture of short sentences and tags and you'll mostly get what you want
so basically like every other model
dump any image into joycaption to get some ideas but its quite complex. one wrong word and "it is fucked". can also try and use an LLM.
thank you
File: ComfyUI_00279_.png (1.4 MB, 896x1088)
1.4 MB
1.4 MB PNG
join me in prayer, brothers. yesterday i was dead (to 4chan), but today, i'm alive (unbanned)! rejoice in our friendship and camaraderie. we are all brothers, separated by blood, united by love.
sometimes you gotta wonder what kind of ridiculous bullshit slips in training material tagging for the various models and loras
No, this model more or less follows what you write. You're not just spamming gibberish like SD 1.5 and SDXL. You can essentially refine an image by writing what you want.
it's accurate. It is akin to making a sandwich from scratch. Making the bread, meat, cheese are the hard parts.
File: ComfyUI_00816_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
File: 00101-4099557091.png (1.42 MB, 1024x1440)
1.42 MB
1.42 MB PNG
File: ComfyUI_00410_.png (903 KB, 1088x960)
903 KB
903 KB PNG
why is flux so kino?
Is there a new/improved version of
>Ultimate SD Upscale extension for AUTOMATIC1111 Stable Diffusion web UI
At all?
compositional complexity
come move to my country. I reconnect my router and get a new IP address.
not that I know of?
what are you missing?
although I should be asking have you done your searches for better upscale models?
Not missing anything, just noticed its been 6 months since its last commit, asking on the off-chance.
File: ComfyUI_00391_.png (873 KB, 1088x960)
873 KB
873 KB PNG
>come move to my country. I reconnect my router and get a new IP address.
i wouldn't do this because it's dishonest. it breaks the harmony in the universe. getting a ban is like gambling. i posted something that maybe i shouldn't have and was dealt the hand i was dealt. that's life. another thing you should never do is count your money at the table, because there will be plenty of time for counting when the gambling is done.
>6 months
that feels awful. I am thinking about changing just based on that. It is a dumb thing to do, but here we are.
>for the low low price of a 70% tax rate
It still seems to "just werk".
File: eggsbacon.png (518 KB, 1024x1024)
518 KB
518 KB PNG
If it was obvious I wouldn't need to use Joycaption and a starting image to get a prompt, I'd just think of something and type it down.
I like your ideas though unfortunately Flux didn't grasp the concept at all and went full retard mode.
What is this? Stable Diffusion 3?
Reminds me of https://huggingface.co/runwayml/stable-diffusion-v1-5
File: 1698375484218122.png (1.26 MB, 1248x792)
1.26 MB
1.26 MB PNG
Imputed this >>102232902 into joycaption. It gave me:
>This is a photograph of a dimly lit room with a dark, textured wallpaper in a muted green hue. The room features a large whiteboard on the back wall, which is illuminated from above by a circular light fixture. Three people are present in the room, all facing the whiteboard. In the foreground, two individuals are seated in brown armchairs, facing away from the camera. The person on the left has short hair and is wearing a light-colored shirt, while the person on the right has medium-length hair and is also wearing a light-colored shirt.

In the center of the image, a man stands near the whiteboard. He has short, dark hair and is dressed in a dark, long-sleeved shirt and dark pants. His posture is relaxed, with his hands clasped in front of him. The overall atmosphere of the room is somber, with the dim lighting creating a sense of introspection or seriousness. The textures of the armchairs and the wallpaper add depth and richness to the scene, contributing to the overall mood of the image.

And this was the result. But how to get the real result?
File: ComfyUI_00412_.png (1.12 MB, 1088x960)
1.12 MB
1.12 MB PNG
it was a photo of a man punching a woman in the face, breaking her teeth and making her bleed.
almost lol. but we got the best beer!
File: file.png (2.54 MB, 1392x1024)
2.54 MB
2.54 MB PNG
File: 1700647156068036.png (816 KB, 896x1152)
816 KB
816 KB PNG
can flux do amputated arms? sdxl seems to be struggling with the concept of "no arms"
File: 1712808029663.jpg (813 KB, 1024x1024)
813 KB
813 KB JPG
File: 1700751821180112.png (928 KB, 1360x768)
928 KB
928 KB PNG
vanilla flux can be so kino
almost guaranteed amputee / amputation is censored
File: 1702978399577544.png (906 KB, 1360x768)
906 KB
906 KB PNG
cinematic 1982 wideshot 35mm btw

some anon came up with it, just passing it along
File: 00172-461417853.png (1.11 MB, 1216x832)
1.11 MB
1.11 MB PNG
damn forge really is better than A1111, literally everything works better here and gens look much better too
probably. just went looking on civitai for an amputation lora, there are none. bummer
>another thing you should never do is count your money at the table, because there will be plenty of time for counting when the gambling is done.
someone prompt the song
File: tmpzknq3pa8.png (1.23 MB, 896x1152)
1.23 MB
1.23 MB PNG
Why are people so bent on doing batches of pretty much the same thing? Just choose your favourite, or spend some time refining one of them, or make every iteration different in some interesting manner.

Posts itt are examples of iterations done well, but hop over to something like the civit gallery, browse by newest and you'll see 5, 10 or 20 gens of the same 1girl, gay furries or some random bullshit, which is even weirder because most of those are on-site generation which they have a limited amount of.

Don't people get bored?? Imagine having nearly endless possibilities, and limiting yourself pretty much to just one thing. Sorry for the rant, I'm having sdg flashbacks.
Is this a safe thread to post in or is it going to be drama crapshow like /sdg/.
File: 00093-1498431082.png (2.62 MB, 1774x744)
2.62 MB
2.62 MB PNG
File: 1700713501264.jpg (943 KB, 1024x1024)
943 KB
943 KB JPG
no its a valid point and I often fall into the trap of spending too much time on one prompt. chasing that one gen I guess.
well some of us try to keep it sane.
just watch out for nick 'pikachu' feldman
File: 00190-928318506.png (1.03 MB, 1216x832)
1.03 MB
1.03 MB PNG
File: out.jpg (1.22 MB, 4096x1024)
1.22 MB
1.22 MB JPG
can someone explain what's happening?
i trained a lora, then loaded in forge, and everything was a blurry noisy mess.
left to right:
dev-q8, dev-nf4-v2, dev-v8 (original model) << with lora
the last one is just dev-q8 no lora
you can see nf4 and original dev are close, which is expected, but q8 is just a noisy mess. without the lora it seems ok.
do i need to train lora in gguf q8 directly or something?
Sorry guys I am using my GPU to play Deadlock, no more kino gens for a while.
i can go back to posting girl portraits, if you want.
File: 00192-438039930.png (1.04 MB, 1216x832)
1.04 MB
1.04 MB PNG
File: 1715708804141.jpg (349 KB, 1024x1024)
349 KB
349 KB JPG
File: 2024-09-04_00257_.jpg (650 KB, 3840x2160)
650 KB
650 KB JPG
the overland park grapist
No you can train a LoRA on the dev model and it works in the quants. You overbaked. Try a different epoch
did she pee herself?
back to girl portraits, since you want to harass me by posting dox info.
ask her. wasn't my intention.
How do I make a LoRA that works on Flux? My attempts to make one using the training script in OP don't work on Flux (but do work on SDXL)
This is an interesting gen
File: 2024-09-04_00264_.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
oohkay.. switched to 2.5.0 for testing initial obsevertion is that very little changes, but what changes is good

pic related is 2.5.0+cu124 version of >>102229720 which was 2.4.0+cu124

the contrast and color are slightly better, the pearl chain is consistent as it is in the series with round pearls and it made her smile
here's the previous epoch with q8
guess i'm back to the drawing board
Train at rank 2. 20 images, no captions, 20 repeats, 10 epochs,1024x1024, everything else at default.
File: 00206-2394834981.png (1.11 MB, 1216x832)
1.11 MB
1.11 MB PNG
we chill'n'grill our gpus
I am assuming this is a LoRA of the woman you are stalking so you can't share it so we can see what is wrong?
File: ComfyUI_Flux_00138_.png (1.58 MB, 1024x1280)
1.58 MB
1.58 MB PNG
interesting. but what about that pesky warning I get? supress it?
File: 01020-412381867.jpg (379 KB, 1152x1536)
379 KB
379 KB JPG
kino. prompt? lora?
Anon it's time to change the subject of your gens. You've posted the same thing 3 hours ago.
correct, but what do you want me to share, the settings?
you means this?
>Warning torch.load doesn't support weights_only on this pytorch version, loading unsafely.
if so, dont worry.. safetensors are safe.. and if you are using .pt or .pth files, just make sure they come from a safe source and you wont get pickled
Doesn't work.
File: ComfyUI_Flux_00141_.png (1.71 MB, 1024x1280)
1.71 MB
1.71 MB PNG
I assure you they are quite distinct images! You see, they were paintings before, and now they are 3d renders (of sorts). I'm trying to get transparent monsters, but as usual, it's pretty hard.
The LoRA to see if it works on my machine.
I have never had a flux LoRA fail in this way.
Does this happen on other LoRAs on Q8?
it is said that some people can goon for an entire day
I assume I also run Flux as the model to train it with?
File: output.webm (238 KB, 380x380)
238 KB
yep. well ok
What was the style for this one?
Are you using Comfy? I'm not sure if Comfy Workflows can be opened by Forge, etc
File: FD_00301_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
It is quite hard. Often bits of the subject are opaque for no reason, but occasionally you get a seed where it works.
Yes, the full flux.1 dev model, no quant, no trimmed model
File: 00076-693735934.png (885 KB, 1024x1024)
885 KB
885 KB PNG
Wow, that's nice! I wish it worked every time. What did you use? Glass sculpture? I've tried ghost, ice/glass sculpture, specter, all useless. My best results so far is "made of transparent jelly", with a bunch of transparent/translucent/see-through scattered throughout, but still spotty results at best.
I smoke vRAM, you smoke cock.
very nice
File: 01056-412381868.jpg (476 KB, 1152x1536)
476 KB
476 KB JPG
File: file.png (2.72 MB, 1024x1024)
2.72 MB
2.72 MB PNG
That's really good, maybe 1gb lora is overkill but if the price to pay is this result then I'm ok with that I guess
File: FD_00002_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>A transparent glass woman (PROMPT) and we can see the (BACKGROUND) behind her refracted through her glass skin.
But that was cherry picked. Most of them had bits of human glued on like picrel
Ah, no, I see. My mistake I must have left it on a 1 hour lifetime.
Here's one that should be availble for a while
>Does this happen on other LoRAs on Q8?
a test lora i did when flux training first came out seemed to work ok, at least it wasnt giving me garbage like this one, so i'm assuming it's either a setting on the new lora config or kohya or something
also fucking-a (trying a new training config)
well based on your attached image, I'll have to presume you are retarded, which coincidentally explains the issues you are experiencing.
Sorry, every one of my flux LoRAs has worked perfectly on Q8 except 2, where it was just under-baked.
I don't know the reason it's doing this.
It's still much better than what I got, despite the eyes and hair. I tried "see the background refracted through the body", almost word by word too, but I guess it didn't fit the rest of my prompt. I'll try yours later.
have fun trying to upscale those.
that is entirely possible
catbox config pls
File: attempt2.png (780 KB, 1024x1024)
780 KB
780 KB PNG
Hmmm, any luck getting two packages with different shapes side by side? Flux will not listen to me.
Also, I didn't have "Bacon!" in the prompt, but you seem to have it too, so it looks like Flux like adding "Bacon!" to these pictures...
that's actually one of the reasons I'm sticking to ComfyUi, that and the fact you can put the text encoder in a second gpu or into your cpu
File: 00100-475873898.png (1.88 MB, 1231x808)
1.88 MB
1.88 MB PNG
Here's the full prompt and seed, maybe the seed is the key.
>A transparent glass woman. She is standing in a prevocative pose. She is wearing a cloth white singlet and running shorts. She is walking through a city park, and we can see the nature behind her refracted through her glass skin.
>seed: 27
File: tmpqfllxx1v.png (921 KB, 1152x896)
921 KB
921 KB PNG
File: output.webm (201 KB, 380x380)
201 KB
ty! haha
I train on Civit
ah, how many source images? and does it tell you how many steps/epochs/learning rate?
25 source images, every setting on Civit is at default except the resolution 1024x1024, repeats 20, epochs 10.
File: file.png (1.32 MB, 1392x1024)
1.32 MB
1.32 MB PNG
My computer is busy so I use the web shnell so it's not as good as dev can get. I did have it say "Bacon!". You have to methodically prompt it in the way VLMs do it. Usually something like:
This is a stock image
of two side by side products
on the left side is a package shaped like a pig
on the right side is a package shaped like a chicken

The prompt I used:
this is a stock photo of two products side by side, on the left side is a plastic saran wrapped package in the shape of a pig, inside the package is lots of uncooked bacon.  On the package it says "Bacon!  I love it".  On the right side of the image is a plastic package in the shape of an hen, chicken.  It appears this package is an egg carton.  On the egg carton it says "Mother's Bounty".
now make them do something that doesn't look like a brainfried tik tok retard-o dancey dance
that skull image looks sick
File: ComfyUI_00017_.png (2.73 MB, 1920x1440)
2.73 MB
2.73 MB PNG
File: Chickencomic.png (647 KB, 1024x1024)
647 KB
647 KB PNG
Thanks! This all has inspired me to make a comic about it.
This is a colorful digital cartoon drawing. The scene takes place in a rural farm setting, with a large, brown tree in the background on the left side. In the foreground, there is a large, yellow hay bale on the ground to the left of the image. On the right side of the image, there is a small, pink pig with purple hair and eyelashes standing inside a wooden barn door. The pig is holding a plate with fried eggs and has a concerned expression and is looking out at the viewer. To the left of the pig, there is a white, cartoon-style chicken with red feathers and a red comb on her head. The chicken has expressive eyes and is holding a plate with bacon and is perched on a stone wall that divides the scene. The chicken is also looking at the viewer with a questioning expression. The background shows a simple, grey stone wall with a red roof above it, indicating a barn or stable. The ground is a light brown color, suggesting dirt or earth. The text at the bottom of the image reads, "What are you having for breakfast?" The style of the cartoon is clean and cartoonish, with bold outlines and bright, solid colors.
I thought gore posting wasn't allowed.
File: ComfyUI_00583_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
Which issues?
File: output.webm (307 KB, 380x380)
307 KB
I just wonder if the lora OOM issue when loading multiple loras or big ones is fixed.. I am still on the August 25th release
ah there he is.. is the lora OOM issue fixed?
Not enough variation in the images.
File: jpg.jpg (2.68 MB, 4224x2352)
2.68 MB
2.68 MB JPG
Anybody wants to try some Flux-Dev prompts/settings?
It's too fat for both my gpu and system memory, so i'm partially running it on disk
File: comfy.png (16 KB, 86x95)
16 KB
>Which issues?
the biggest one would be the fucking unload/reload on every lora change, are you gonna let city fix that shit for you or? he showed that it's possible to prevent that, so go fix your memory management shit comfy
File: 1725348647.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>50 steps for 512x768 on a literal toaster
What the hell.
File: 1725351643.png (969 KB, 1024x1024)
969 KB
969 KB PNG
>1e-4 LR
>only hands in the dataset are VERY clean, clear, focus of the image
>by epoch 3 the hands are already a garbled mess 40% of the gens
why is flux like this with animu
File: 1725352443.png (1008 KB, 1024x1024)
1008 KB
1008 KB PNG
File: 2024-09-04_00284_.jpg (1.16 MB, 2160x3840)
1.16 MB
1.16 MB JPG
File: xyz_grid-0007-3740329115.png (2.64 MB, 2064x888)
2.64 MB
2.64 MB PNG
lora strength testing
File: ComfyUI_00580_.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
blame the lora

that's fixed.

yeah I have to do something about that.
how many images, how many steps?
File: 2024-09-04_00271_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>that's fixed.
yaaay! I take your word for that.. time to pull! :D
>yeah I have to do something about that.
nice, can't wait for it to happen man o/
File: jpg2.jpg (1.95 MB, 4224x2352)
1.95 MB
1.95 MB JPG
I've got time - with the memory bottleneck the average gpu load is only 15%
Also i wanted to see how much additional samples improve the image. Too bad i changed the prompt between the comparison
generic depressed stickgirl batch 0253
ok that one is trippy pw
File: file.png (2.22 MB, 1024x1024)
2.22 MB
2.22 MB PNG
still testing it
80 images, batch 1, 1024x1024
steps at epoch 3 would be 3x80
File: 1725353239.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
real results don't happen for the like 2000-4000 steps
everything falling apart is part of the process as the model is looking for a solution to the puzzle
i was planning on trying other things, but you or whoever else wants to harass me by posting dox info, so ill 'stick' with girl gens, at least for ldg.
I did 30 epochs total and none of them have better hands... this is my third lora now, two before that style loras, all of them have this same issue despite testing varying datasets/amounts, different LRs, different optimizers/schedulers. I honestly haven't seen an anime or stylized lora for flux that doesn't do this so far
>this is my third lora now,
to clarify, I mean for flux, I've made hundreds for sdxl so its not like its my first rodeo
You just have to reroll for better hands.
Then your learning rate might be too high or your dataset is confusing to the model. Also I don't even know what your definition of good hands is, is this like a fetish thing? Are you looking for very specific hands?
spellcheck is still broken, I have to manually fix it after each pull
It would help if showed an example dataset image and the result of that image's caption on your 4000 step Lora.
nono don't take it the wrong way. I'm a friendly. pso anon actually but shh
nta but i bow to your composition skills, nice
File: output.webm (255 KB, 380x380)
255 KB
gn frens
my LR is 1e-4, thats pretty low already, is it not? and I've used multiple datasets all of which performed fine on SDXL. I'm not looking for specific hands, I mean like, not melted fingers? do you guys really not understand what I'm talking about? it seems very common on anime/style loras for flux I've seen, not as sure about realistic since its not really my thing
it is fixed in the latest release
kek nice other hand
File: xyz_grid-0008-2721968325.png (1.64 MB, 1464x888)
1.64 MB
1.64 MB PNG
no it isnt CLIPFluxTextEncode has no spellcheck
File: comfy ahhhhh.png (27 KB, 154x102)
27 KB
There is no right learning rate, it's dependent on a lot of factors. Please provide an example of how the hand should look vs how the hand looks like. Use a prompt that is the same as caption used to train.
do you prompt for the dark eyebags?
File: Easter.png (802 KB, 1024x1024)
802 KB
802 KB PNG
Heh, I just search for a comic on google that has the compositions I want, put it through joycaption, then edit it to make it have the joke I wanted to deliver. Some appear better than others, though. Eggs made me think of the easter bunny.
This is a colorful, digital cartoon drawing featuring a humorous scene. At the center, there is a large, anthropomorphic white rabbit with long ears sitting on a green throne-like chair. The rabbit has a cheerful expression, with a red bow tie and a plaid shirt. A young girl is sitting in his lap, who appears to be smiling and excited. The girl is dressed in a red shirt and blue pants. In the foreground, three children stand in front of the rabbit. The child on the left has blonde hair and is wearing a pink shirt and blue pants. The middle child has brown hair and is dressed in a green shirt and blue pants, looking slightly confused. The child on the right has blonde hair and is wearing a green shirt and blue pants, also looking slightly surprised. Behind the rabbit, there is a large banner with the words "EASTER BUNNY" written in large, bold letters. The background is a simple yellow gradient, and the floor is covered with a red carpet. On the right side of the image, there is a sign with an arrow pointing to the right, indicating a direction. The text at the bottom reads, "A doll? Who do you think I am, Santa?" The overall style of the drawing is playful and humorous, with exaggerated features and bright colors.
no, seems to be the lora, which i am fine with because i am into it
File: ComfyUI_00027_.jpg (880 KB, 1920x1440)
880 KB
880 KB JPG
its now in the options you have to enable it
File: 00005-2462465663.png (3.14 MB, 1232x1776)
3.14 MB
3.14 MB PNG
File: tmpus602y9k.png (874 KB, 1152x896)
874 KB
874 KB PNG
>so comfortable
File: ComfyUI_00101_.png (2.43 MB, 1616x1056)
2.43 MB
2.43 MB PNG
>install a module for ComfyUI
>it decides to uninstall torch and torchvision, which were up to date, and go back to an older version
>ComfyUI won't boot anymore because "torch not compiled with cuda enabled"
>have to download torch again (2.4GB) on my amish 5Mbps connection
yes, i am retarded, but also i fucking hate python
The beta ui workflow, lora reloading, gguf support
File: 1706759487610.jpg (512 KB, 1024x1024)
512 KB
512 KB JPG
File: hands.png (1.87 MB, 1024x1024)
1.87 MB
1.87 MB PNG
here is an example of how the hands shouldn't look like, using a prompt directly from captions. same issue happens using prompts not from captions.
..........I can't imagine needing an example of what they should look like in contrast
File: 1706243421763.jpg (509 KB, 1024x1024)
509 KB
509 KB JPG
Is there a node that connects "image" which shows a preview of the image AND the seed/metadata of the image underneath?
File: 2024-09-04_00294_.jpg (702 KB, 3840x2160)
702 KB
702 KB JPG
File: ComfyUI_00102_.png (1.35 MB, 1616x1056)
1.35 MB
1.35 MB PNG
pepe with hair? yes please
>but also i fucking hate python
honestly fucking around with comfy and lora training has made me hate python like nothing else, and I used to have to review convoluted bloat javascript written by jeets
Is there a lora manager for comfyui that gets the info from civitai?
File: file.png (75 KB, 735x703)
75 KB
for the guy who didn't want to believe his florence captioning script is slow, here's test results of mine on 3090
how the fuck do we kill the python menace?
that looks fine to him surprisingly kek
yeah I really want to write custom cpp code for AI
fucking retard
File: ComfyUI_00104_.png (1.45 MB, 1616x1056)
1.45 MB
1.45 MB PNG
he just came back from turkey
torch is open source, just port it to a sensible language.. good luck, ill cope with python instead and enjoy my life
nothing's preventing you to code everything on C++ anon, that's what the llama.cpp guys did
>libtorch is a thing but it is just a wrapper over python code
File: laugh-michael-jordan.gif (1.36 MB, 220x220)
1.36 MB
1.36 MB GIF
maybe the problem is your autism prevents you from seeing the strengths of python
you know your dumbass module with wrong torch versions?
it doesn't go away because your sweaty is going c++
File: ComfyUI_00172_.png (1.8 MB, 1024x1280)
1.8 MB
1.8 MB PNG
>flux man nipples
how the fuck can it fuck up if it's an embedded library retard
there's nothing to fuck up because no one will do anything
that dumbass module someone made?
guess they didn't make it for your retarded embedded version
that's why python won
File: 2024-09-05_00001_.png (1.05 MB, 1280x720)
1.05 MB
1.05 MB PNG
File: ifx353.jpg (258 KB, 1024x1024)
258 KB
258 KB JPG
its truueee.. torch 2.5.0 gave me 20% speed increase on a 4090... yaaaaa
wouldn't plugins just reference their own embedded library bypassing the issue? do you know how c++ even works?
wouldn't even compile on your system, whoops
wouldn't c++ filter most shitty devs on it's own?
File: file.png (3.45 MB, 3185x1612)
3.45 MB
3.45 MB PNG
did you notice any drop in quality overall? because 2.5.0 give really different pictures to the 2.3.1 for example
File: 2024-09-05_00005_.png (1 MB, 1280x720)
1 MB
with cuda 12.1 yes. with cuda 12.4 no .. I guess cuda 12.4 is a must for the new torch versions
exactly, you just move to a new layer of hell, have most of the same problems and all because the poor baby had to download an extra 2 GB file because of dumb python dependencies (I'm not excusing it) but the alternative is fucking awful
really interesting, I'll keep that in mind, thanks anon
libtorch and C++ or bindings for libtorch in some other language. Optionally port TRT/other optimization library kernels to torch extensions
This though. Nobody wants to do it, modeling is only part of it, there's all the ancillaries that don't really have an equivalent
>Nobody wants to do it
C++ is a horrible language to work with desu, so I can't blame them
what if devs used a lua injection to simplify everything
or we can keep using python
but go ahead anon, write your LUA, be the change
I said WE not ME
File: dragongunner.png (2.97 MB, 1482x1142)
2.97 MB
2.97 MB PNG
saved. gonna butcher this with some sdxl masks... kekeke


File: ifx354.jpg (286 KB, 1024x1024)
286 KB
286 KB JPG
File: 1wfjE31-24.jpg (1.2 MB, 952x1400)
1.2 MB
1.2 MB JPG
Oh, reminds me of Feng Shen Ji, but i believe its too violent for flux lol

There is that:
File: ifx355.jpg (257 KB, 1024x1024)
257 KB
257 KB JPG

[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.