[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collagen (1).jpg (3.03 MB, 3264x3264)
3.03 MB
3.03 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102230647

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>
Blessed thread of frenship
>>
File: tmpgi72sd_i.png (1.06 MB, 1152x896)
1.06 MB
1.06 MB PNG
>>
>adding more words to the prompt messes up the LORA on next gens

Anyone has this issue?
>>
File: 2024-09-05_00014_.png (747 KB, 1280x720)
747 KB
747 KB PNG
>>102235413
thanks for bread
>>
>>102235477
lmao dude's prompt is leaking
>>
>>102235477
what lora? maybe it was trained with wonky captions?
>>
>>102235440
SOVL
>>
File: 1.png (805 KB, 1024x1024)
805 KB
805 KB PNG
>>102235523
All loras

Example image, 1st gen with original prompt and lora loaded
>>
File: 2.png (744 KB, 1024x1024)
744 KB
744 KB PNG
>>102235578
And this is 4th gen with same seed and prompt after changing the prompt on 3rd gen
>>
File: Cooking.png (757 KB, 1280x768)
757 KB
757 KB PNG
This is a highly detailed digital illustration depicting a vibrant and colorful kitchen scene. The central focus is a blue-tiled kitchen counter with various cooking utensils and food items. In the foreground, there are two black frying pans on a red electric stove. The left pan contains several sausages and a small bowl of mashed potatoes, while the right pan has a pile of golden-brown French fries. To the left of the stove, there is a large bowl filled with instant noodles, garnished with green onions and a boiled egg with bakon.  On the counter to the right, there is a plate with a grilled steak, a glass of orange juice with a slice of lemon, and a small bowl of salad. In the background, a pink kettle steams with a cup of tea beside it. The counter also features a red toaster with two slices of bread inside. The background wall is a soft, pastel blue, and the floor tiles are a light blue and white pattern. The overall style of the image is highly detailed and realistic, with a focus on textures and vibrant colors, giving it a modern, inviting feel.
>>
>>102235477
What UI?
>>
>>102235615
comfy, updated it to latest version
>>
>>102235578
>>102235594
spooky.. how much did you change the prompt?

>>102235621
you not using --fast? cause that is non deterministic and causes big trouble
>>
>>102235413
Why is the news dump never included anymore?
>>
File: file.png (940 KB, 854x480)
940 KB
940 KB PNG
>>102235621
>comfy, updated it to latest version
he pulled?
>>
train for flux with 512x512, or 1024x1024? or use buckets and fuck it?
>>
File: 1.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>102235632
Added one word at the end of the prompt

Here is this on 1st gen

>This is a highly detailed digital illustration depicting a vibrant and colorful kitchen scene. The central focus is a blue-tiled kitchen counter with various cooking utensils and food items. In the foreground, there are two black frying pans on a red electric stove. The left pan contains several sausages and a small bowl of mashed potatoes, while the right pan has a pile of golden-brown French fries. To the left of the stove, there is a large bowl filled with instant noodles, garnished with green onions and a boiled egg with bakon. On the counter to the right, there is a plate with a grilled steak, a glass of orange juice with a slice of lemon, and a small bowl of salad. In the background, a pink kettle steams with a cup of tea beside it. The counter also features a red toaster with two slices of bread inside. The background wall is a soft, pastel blue, and the floor tiles are a light blue and white pattern. The overall style of the image is highly detailed and realistic, with a focus on textures and vibrant colors, giving it a modern, inviting feel.
>>
>>102235681
512x512 with buckets, should take 1/4 of the time.
>>
>>102235702
Does it return to normal output when you remove that word? If so.. I guess its just extremely bad luck. Or T5 just threw a fit ..
>>
File: 2024-09-05_00021_.jpg (743 KB, 3840x2160)
743 KB
743 KB JPG
>>
File: 2.png (1.49 MB, 1024x1024)
1.49 MB
1.49 MB PNG
>>102235702
Here is same prompt and seed but 3rd gen after changing one word in the prompt on 2nd gen

>>102235726
No, the gen gets worse and worse until I load another lora

Looks like it does not unload the previous gen and combines them both if the prompt changes but lora stays the same

Using the GGUF unet too
>>
>>102235681
512x512 is a lot faster but 1024x1024 is worth it in my opinion. I notice the quality/detail difference immediately in my own loras and everyone who trains at 512 and says they see no difference, I can see it when they post pictures. you bucket either way, though. if you have images that are smaller, just use multi resolution (some 512x512, some 1024x1024, etc)
>>
>>102235413
I saw some sort of "FLux for retards" guide a couple of weeks, but lost it.
Does anyone have a link?
I remember seeing info about low VRAM solutions, which is relevant to me and my ancient dinosaur 1080.
In SD1.5 I have to run several scripts to even get it to do anything, and several workarounds for upscaling.
>>
File: Newyear.png (842 KB, 1280x768)
842 KB
842 KB PNG
>>102235702
Oh no, I misspelled bacon as bakon and now people copying the prompt will misspell it for eternity.
This image is a digitally created illustration in a cartoon style, depicting a festive New Year's Eve celebration in a cozy, warmly lit restaurant. The background features a blue wall with a small, ornate fireplace and a window with a yellow curtain. Above the window, a string of colorful fairy lights is draped, adding to the festive atmosphere.  In the foreground, there are five persons, all with simplified, cartoonish features, celebrating. The central figure, a child with a red hat and skirt, is holding a small gift box and waving enthusiastically. To the left, someone in a red uniform is standing with arms raised, while a girl in a black suit and top hat is seated at a nearby table. Another young woman in a blue shirt and red tie is standing next to the seated person, and a fourth figure in a red shirt and white pants is standing near the central child. The fifth figure is seated at a table, wearing a white shirt and black pants. The tables are round with teal-colored stools, and each table has a small, decorated cake with a lit candle. Confetti and streamers are scattered throughout the scene, enhancing the celebratory mood. The floor is wooden, and a red railing runs along the top edge of the image. 
>>
>>102235754
>No, the gen gets worse and worse until I load another lora
okay thats a bug, not a feature.. if you can reproduce it, the best is probably to report it.
>>
Me on the left.
Fuck flux by the way.
>>
>>102235440
I'm intrigued
>>
>>102235760
>https://comfyanonymous.github.io/ComfyUI_examples/flux
but use
>https://huggingface.co/city96/FLUX.1-dev-gguf
>https://huggingface.co/city96/t5-v1_1-xxl-encoder-gguf
use the largest versions your gpu can handle, it won't be a speed increase but quality changes
>>
>>102235819
Not quite what I was looking for, but thanks
>>
>>102235807
>masterpiece, best quality, absurdres
>>
>guy who owns a social media claims to own a cluster of 100,000 H100s
Are we prepared for the SaaS trolls?
>>
File: 2024-09-05_00027_.jpg (733 KB, 3840x2160)
733 KB
733 KB JPG
>>
>>102235855
>looks like kino
yeah
>>
>>102235855
what ever happened to
 incredibly_absurdres 
?
>>
File: hotelf.png (1.36 MB, 1280x768)
1.36 MB
1.36 MB PNG
Wow, I sent an image to joycaption and got a prompt, then I hit the spacebar by accident and it disappeared, losing it forever, but I got a new one that was completely different, so just like with image generations and seeds, you may have bad luck and get a bad prompt.
Also, it doesn't detect isometric perspective, I've been having to add it manually because otherwise the compositions aren't this cool.
This image is a colorful, highly detailed digital rendering of an isometric, indoor, luxurious hotel lobby, likely in a tropical or subtropical setting. The lobby features a grand, spiral staircase with ornate railings leading to a second floor balcony. The floor is a light beige with various patterns and textures. In the center of the lobby, there is a circular, stone-lined fountain with a small waterfall, surrounded by lush green potted plants and a small seating area with a couch and armchair. On the right side of the image, there is a large, circular reception desk with a green and white counter, flanked by more potted plants and a seating area with a couple of chairs and a table. Above the reception desk, a large, round screen displays the hotel logo and a counter with a green background showing the current time and a total of 2,704 guests. The top right corner features a small, circular pool with a lounge chair and an umbrella, and a large, green, tropical plant. The left side of the image includes a small shop with a red and white striped awning and a variety of merchandise displayed on shelves. The background is filled with decorative elements like ornate chandeliers, paintings, and plush seating areas. The overall ambiance is vibrant.
>>
>>102235799
I think the new comfy update broke the unet loader (gguf).

The lora prompt changing has no issue with the "load diffusion model" node.
>>
File: 2024-09-05_00029_.jpg (759 KB, 3840x2160)
759 KB
759 KB JPG
>>102235861
~100MegaWatt energy draw .. 3.7 ExaFlops (faster than the faster on Top500 Supercomputer list)

he mad.. and what does Elon do with it? Train Grok.. absolut madman
>>
File: 00117-1847719189.png (622 KB, 808x616)
622 KB
622 KB PNG
>>
File: ComfyUI_Flux_12379.jpg (156 KB, 672x1504)
156 KB
156 KB JPG
>>
>>102235942
nice
>>
File: 2024-09-05_00031_.jpg (858 KB, 3840x2160)
858 KB
858 KB JPG
>>
File: Fluxgirls.png (908 KB, 768x1280)
908 KB
908 KB PNG
>>102235807
This is Flux's version.
This is a digital illustration in an anime style, depicting two voluptuous girls dressed in provocative bunny costumes. Both have fair skin and are posed confidently with one hand on their hips. The woman on the left has long, blonde hair tied into twin tails, and she is wearing a black bunny costume with a high-cut bodysuit that accentuates her breasts and curvaceous hips. She has a playful expression with a slight smirk and a cigarette dangling from her lips. The woman on the right has short, silver hair and is also dressed in a black bunny costume with a similar high-cut bodysuit. She has a more serious expression, looking directly at the viewer. Both costumes feature black bow ties, white cuffs, and black bunny ears. The background is a dimly lit room with a blurred, urban cityscape visible through the windows, suggesting an indoor setting. The overall atmosphere is sensual and slightly suggestive, with a focus on their curves and the allure of their costumes. The image is highly detailed with a glossy texture to the costumes, enhancing the sense of realism and depth.
>>
>>102235886
joy caption is just run with whatever LLM you feed it, so what it does is up to the LLM you use and the base caption/instruction you give that LLM (and yes, since its a LLM, it having varying outputs is a thing)
>>
>>102236002
they look fat
>>
>>102235895
more like city96's bad coding broke it when he tried to fix the unloading lora thing, not considering why it was there to begin with. I wish illya really had made fun of him, he deserved it
>>
File: tmpijs456ji.png (937 KB, 1152x896)
937 KB
937 KB PNG
>>102235574
>>102235808
Appreciated.
>>
>>102236065
>bad coding
Where is your pull request genius?
>>
>>102236077
why would I fix something I don't use/can be resolved by reverting to the previous commit? sorry you're in the double digits for IQ and think what city is doing is some miracle, I swear none of this thread is actually into tech
>>
File: ignored.png (995 KB, 1280x768)
995 KB
995 KB PNG
>>102236056
I'm just using this one:
https://huggingface.co/spaces/fancyfeast/joy-caption-pre-alpha
And have no idea what it's using under the hood.
Also, I asked Flux for a chicken and a pig and it just ignored them.
This is a high-resolution photograph of a neatly arranged breakfast plate. The plate is round, white, and placed on a light gray wooden surface with subtle grain texture. On the plate, there are two sunny-side-up eggs with bright yellow yolks and slightly crispy edges, sprinkled with a light dusting of black pepper. To the left of the eggs there's a chicken. There are three slices of crispy, golden-brown bacon, adding a rich, savory contrast to the dish. To the right of the bacon, there's a pig. There are several small, round cherry tomatoes, charred and slightly blistered, adding a vibrant red color and a hint of sweetness. Fresh, green parsley leaves are scattered around the plate, adding a pop of color and freshness. To the bottom right of the plate, there is a fork with a sleek, modern design, its handle made of metal with a wooden grip. In the background, there is a clear glass mug filled with a dark brown liquid, likely coffee, and a wooden pepper grinder, adding a rustic touch to the scene. The overall composition is well-balanced, with a focus on the vibrant colors and textures of the food, making the plate look appetizing and inviting.
>>
File: ComfyUI_Flux_12405.jpg (236 KB, 672x1504)
236 KB
236 KB JPG
>>
does flux have good controlnet yet?
>>
File: 1716006235363.jpg (374 KB, 1024x1024)
374 KB
374 KB JPG
>>
File: _bing_.png (1.99 MB, 1011x1011)
1.99 MB
1.99 MB PNG
>>
>>102236097
I'm a contributor too and just think that you are disrespectful and toxic. Your initial comment brings nothing.
>>
>>102236164
yes, have not tried it yet tho
>https://huggingface.co/XLabs-AI/flux-controlnet-collections
>>
>>102236182
really, my initial comment pointing out what the issue is and how to resolve it brings nothing? you must be a jeet
>>
File: 1701143195864.jpg (279 KB, 1024x1024)
279 KB
279 KB JPG
>>
File: ezgif-7-5ada7ae038.webm (2.48 MB, 490x718)
2.48 MB
2.48 MB WEBM
>>
>>102236182
>Status: Dilating.
>>
my improvement for forge got ignored and I am never contributing to open source again because of it
>>
Ever notice how whenever someone brings up City96 doing something wrong there are replies as if the "anon" is taking it really personally?

Maybe stop lurking here if you don't want to hear people speaking honestly about you, City. Your fragile ego apparently can't even handle other people implementing your code in their repos, yet alone 4chan.
>>
File: 1724052069201.jpg (1.73 MB, 1024x1024)
1.73 MB
1.73 MB JPG
>>
File: 1695406601221.jpg (372 KB, 1024x1024)
372 KB
372 KB JPG
>>
File: 1699296991200.jpg (305 KB, 1024x1024)
305 KB
305 KB JPG
>>
Pill me on finish sketches with img2img
Any tips for the sketches themselves?
What software do you use?
What model/settings? Do you describe the input image or just let the model guess based on the sketch?
>>
>>102236243
kek I've noticed it too. if it weren't for xirs melty I'd think it was a troll, but its not like he hasn't posted here before. kinda sad if xe's just seething all the time desu
>>
File: ComfyUI_Flux_12423.jpg (116 KB, 672x1504)
116 KB
116 KB JPG
>>
File: 1724367280294.jpg (1.47 MB, 1024x1024)
1.47 MB
1.47 MB JPG
>>
>>102235885
and my favorite:
>1woman
>>
>>102236243
>>102236182
>>102236065
I think it's comfy lack of support for gguf, I reverted back to the old city gguf commit and the issue is still there
>>
>>102235895
Not just the gguf, it also broke the impactpack components. The detailer node completely fucks up the face after the first generation.
>>
File: 00047-720600542.png (328 KB, 512x512)
328 KB
328 KB PNG
>>102236243
I'm not city96 but I think you should stop being mean, what have you contributed? exactly, I think we should lift people like city up instead of bringing them down, idk what code in repos you're on about but if we keep bullying people then you'll have no code in your repo or whatever
>>
File: abnormal.png (627 KB, 1280x768)
627 KB
627 KB PNG
I ran out of joycaptions, so I had to use https://aichatonline.org/gpts-2OToA97Vhr-Describe-Image , which gives prompts so long they have to be pastebinned:
https://pastebin.com/HMkC64vg
The whole point of the original picture is that she was morbidly obese, which was completely missed. AI has still a long way to go.
>>
File: 2024-09-05_00045_.jpg (1.21 MB, 3840x2160)
1.21 MB
1.21 MB JPG
>>
File: tmp1j255ivn.png (1.12 MB, 1152x896)
1.12 MB
1.12 MB PNG
>>
>>102236274
>Any tips for the sketches themselves?
Yeah, you have to pre-color them before sending them or you're going to get different versions of the same sketches.
>>
File: 2024-09-05_00050_.jpg (1.07 MB, 3840x2160)
1.07 MB
1.07 MB JPG
>>
File: nowwithapig.png (1 MB, 1280x768)
1 MB
1 MB PNG
>>102236109
Getting closer...
This is a high-resolution photograph of tow animals, a chicken and a pig, and a neatly arranged breakfast plate. The plate is round, white, and placed on a light gray wooden surface with subtle grain texture. On the plate, there are two sunny-side-up eggs with bright yellow yolks and slightly crispy edges, sprinkled with a light dusting of black pepper. To the left of the eggs there's a chicken. There are three slices of crispy, golden-brown bacon, adding a rich, savory contrast to the dish. To the right of the bacon, there's a pig. There are several small, round cherry tomatoes, charred and slightly blistered, adding a vibrant red color and a hint of sweetness. Fresh, green parsley leaves are scattered around the plate, adding a pop of color and freshness. To the bottom right of the plate, there is a fork with a sleek, modern design, its handle made of metal with a wooden grip. In the background, there is a clear glass mug filled with a dark brown liquid, likely coffee, and a wooden pepper grinder, adding a rustic touch to the scene. The overall composition is well-balanced, with a focus on the vibrant colors and textures of the food, making the plate look appetizing and inviting.
>>
>>102236337
city is doing good work but he's too sensible to criticism, having that meltdown made people question his actions, he's kind of right tho, must be frustrating to add support for gguf then next update comfy or forge breaks it
>>
>>102236243
I'm not City96. You're just obviously obnoxious. You're also an obvious retard. You're why open source contributors may not want to make the effort to share anything. You're a pile of shit and your conspiracy theory to explain why people may not agree with your tardiness is enough to show how much of a waste of cum you were.
>>
>>102236418
>tow
>>
File: 2024-09-05_00054_.png (1.05 MB, 1280x720)
1.05 MB
1.05 MB PNG
owow .. "tow" animals .. this is a problem for T5 .. fix that, maybe it works then

also I would start with a rough description first like
>This is a high-resolution photograph of two animals, a chicken and a pig having breakfast.
then go into details .. but it did not work for me either, probably because 90% of the prompt is describing the plate and no mention of the animals again but the very front
>>
>>102236337
He's the one throwing melties and attacking other repo owners for imagined slights. Then coming here and getting upset that people are laughing at him for it. Illya even implemented his code into forge and in exchange city threw an autistic spazz over nothing:
https://github.com/lllyasviel/stable-diffusion-webui-forge/issues/1503
>>
File: fs_0158.jpg (83 KB, 728x728)
83 KB
83 KB JPG
>>
File: 00119-318994461.png (1.03 MB, 744x888)
1.03 MB
1.03 MB PNG
>>
>>102236439
>post is not quoting anyone
>I'M NOT CITY96!! STOP ACCUSING ME OF BEING CITY96!!!
uh, ok... sus
>>
>>102236439
>You can't criticize anyone or they will never contribute anything to open source ever again!
I recognize this must be difficult in your estrogen heightened emotional state to comprehend, but normal, functional people don't completely shut down over comments on 4chan. Yikes.
>>
File: fs_0162.jpg (198 KB, 1280x1280)
198 KB
198 KB JPG
>>
File: 2024-09-05_00057_.png (1.08 MB, 1280x720)
1.08 MB
1.08 MB PNG
>>102236418
This is a high-resolution photograph of two animals, a chicken and a pig having breakfast. The chicken is a hen sitting next to he plate on the table looking shocked. The pig is large and pink and seems to be hungry. The pig leans over the plate. They sit at a table and between them is a plate a neatly arranged breakfast plate. The plate is round, white, and placed on a light gray wooden surface with subtle grain texture. On the plate, there are two sunny-side-up eggs with bright yellow yolks and slightly crispy edges, sprinkled with a light dusting of black pepper. To the left of the eggs there's a chicken. There are three slices of crispy, golden-brown bacon, adding a rich, savory contrast to the dish. To the right of the bacon, there's a pig. There are several small, round cherry tomatoes, charred and slightly blistered, adding a vibrant red color and a hint of sweetness. Fresh, green parsley leaves are scattered around the plate, adding a pop of color and freshness. To the bottom right of the plate, there is a fork with a sleek, modern design, its handle made of metal with a wooden grip. 

In the background, there is a clear glass mug filled with a dark brown liquid, likely coffee, and a wooden pepper grinder, adding a rustic touch to the scene. The overall composition is well-balanced, with a focus on the vibrant colors and textures of the food, making the plate look appetizing and inviting.

is that what you wanted?
>>
>>102236243
>Insult Illyasviel
>No one cares
>Insult Comfy
>No one cares
>Insult City96
>NOOO YOU ARE PREVENTING OPEN SOURCE ANON YOU ARE A BIG MEANIE !
Yup, there is no way it's not a troll baiting if he's not samefagging, kek
>>
File: fs_0168.jpg (244 KB, 1280x1280)
244 KB
244 KB JPG
>>
>>102236530
I just saw this, not the rest your mentioning.
>>
>>102236478
oh wow I didn't know about this. I appreciate gguf but that is pretty cringe behavior, especially how he tries to get an autistic 'last word' in after Illyasviel responds so calmly.
>>
>>102236522
Yes! Nailed it, thanks.
>>
>>102236560
Also, see why I don't write prompts myself and rely on joycaption? When I try to edit a prompt I can't even spell 2!
>>
>>102236555
Yeah, after reading that I don't really have any respect left for City anymore. I can still respect his contribution, but him as a person is clowntier.
>>
look in the last thread, all the posts defending city were deleted. it's definitely a troll. I'm pretty sure we can all agree that xe's melty was hilarious no matter if we use gguf or not
>>
File: 2024-09-05_00061_.jpg (645 KB, 3840x2160)
645 KB
645 KB JPG
>>102236560
you are welcome
>>102236573
kek, happens.. also tow is a legit English word, so spellcheck won't spot it
>>
>>102235660
I like this
>>
File: 00121-2635594846.png (812 KB, 744x888)
812 KB
812 KB PNG
>>
>>102236478
tl;dr version?
>>
File: 00218-862505742.png (1.92 MB, 1024x1440)
1.92 MB
1.92 MB PNG
>a toast ... to /ldg/
>>
>>102236538
stone age KINGS....
>>
File: ComfyUI_Flux_12455.jpg (244 KB, 672x1504)
244 KB
244 KB JPG
>>
>>102236478
damn.. now I am double glad I have 4090 and don't have to use that trouble makers code .. especially after it also seems to be broken >>102235895
>>
File: reallife.png (970 KB, 1280x768)
970 KB
970 KB PNG
And with this magic I can finally have a real live version of my joke!
>>102234281
Well, sorta.
Just ran out of Flux Dev quotas, moving to schnell... I should remember to use schnell unless I'm drawing text as it gives 7 times as many quotas...
>>
File: ComfyUI_00260_.png (1.52 MB, 1024x1280)
1.52 MB
1.52 MB PNG
>>
File: 00319-412381868.jpg (1.09 MB, 1620x2160)
1.09 MB
1.09 MB JPG
>>
>>102236692
typical /g/ poster
>>
>>102236703
thanks it's a self portrait
>>
>>102236478
kek illyasviel replies so perfectly that it almost makes me want to switch back to forge. maybe one day..
>>
File: 2024-09-05_00072_.png (1.22 MB, 1280x720)
1.22 MB
1.22 MB PNG
>>102236674
>Well, sorta.
the prompt is very long.. probably to long for T5 to follow still.. it I added
>Below the image is a large caption text in bold capital white letters: "What are you having for breakfast?".
and it forgets grammar half the way.. still a very funny result
>>
I don't even care that it's NSFW, his images are just super boring and ugly. It'd be an insult to my eyes even if this was a red board
>>
>>102236778
I've noticed a lot of the text output is very jeety when it fucks up. whatever decides the text in flux was absolutely reviewed by third party jeet labour, once you start noticing it, it's impossible to ignore
>>
>>102236753
>>102236768
sir, /h/ is in another castle.
>>
>>102236809
text works nearly perfect when the prompt isnt to long.. I would remove alot of the flavor text in your prompt to slim it down
>>
>>102236821
he's a regular spammer here, report and ignore. he literally thinks the letterboxing and artifacts on his images protect him from some nefarious autojanny out to hunt him down, lol
>>
File: 00338-2000385030.jpg (865 KB, 1620x2160)
865 KB
865 KB JPG
>what is it about fat ugly guys fucking cute girls?
>>
File: ComfyUI_00263_.png (1.34 MB, 1024x1280)
1.34 MB
1.34 MB PNG
>>
>>102236832
I deserve a hot woman. She does not deserve a hot man. Simple
>>
File: fs_0174.jpg (150 KB, 1280x1280)
150 KB
150 KB JPG
>>
File: full.png (36 KB, 908x902)
36 KB
36 KB PNG
Why do all the stylized image hands in flux look like they were trained on the finger crystal meme art?
>>
File: 2024-09-05_00078_.png (1.2 MB, 1280x720)
1.2 MB
1.2 MB PNG
>>102236674
>>102236809
I changed your prompt to
This is a photograph of two animals, a chicken and a pig having breakfast.  The chicken is a hen sitting next to he plate on the table. The pig is large and pink. The pig leans over the plate. They sit on a table and between them is a plate a neatly arranged breakfast plate.

Below the image is a large caption text in bold capital white letters: "WHAT ARE YOU HAVING FOR BREAKFAST?".

The plate is round, white, and placed on a light gray wooden surface with subtle grain texture. On the plate, there are two sunny-side-up eggs with slightly crispy edges, sprinkled with black pepper. To the left of the eggs there's a chicken. There are three slices of crispy, golden-brown bacon. To the right of the bacon, there's a pig. There are several small, round cherry tomatoes. Fresh, green parsley leaves are scattered around the plate. To the bottom right of the plate, there is a fork with a sleek, modern design, its handle made of metal with a wooden grip.

In the background, there is a clear glass mug filled with a dark brown liquid and a wooden pepper grinder. The overall composition is well-balanced, with a focus on the vibrant colors and textures of the food, making the plate look appetizing and inviting.

removing alot of the joycaption flavor text .. you don't need to describe to FLUX for every food item how savory and delicious it is.. its a neat breakfast with bacon, egg, tomato, pepper and parsley.. then it has enough braincells left to care for the caption
>>
File: ComfyUI_00265_.png (1.33 MB, 1024x1280)
1.33 MB
1.33 MB PNG
>>
>>102236875
I'm the second poster and I have nothing to do with the other guy you're quoting. I was mentioning what I've noticed over time using flux and it turning into jeet text
>>102236869
>>102236849
fuck I can't unsee it
>>
File: 2024-09-05_00080_.png (1.25 MB, 1280x720)
1.25 MB
1.25 MB PNG
>>102236894
>I'm the second poster
ow sorry.. and ya.. you should prompt concise and logical in flux, otherwise its gets lost in flavor text
>>
>>102236875
the mini chicken and pig are really cute in this one desu
>>
>>102236894
>The pic
Kek'd, it really does look like that..
>>
>>102236674
>quotas refilled
>submits to schnell
>No available GPU for you after 60 seconds
>Tries again
>YOU RAN OUT OF QUOTAS, WAIT 15 MINUTES
What did they mean when they called it the democratization of AI?
>>
>>102236936
Nta but I'm still assmad paying for a hf sub doesn't change the quota problem. You can just clear cookies or use a different browser and it resets if you aren't logged in though, I think. at least I'm pretty sure it's not based on ip, weirdly
>>
>>102236936
they are paying replicate for your gens.. we can be happy they even offer the service
>>
File: 2024-09-05_00083_.jpg (738 KB, 3840x2160)
738 KB
738 KB JPG
>>
File: 00012-3659008742.jpg (752 KB, 1344x1728)
752 KB
752 KB JPG
>>
>>102236956
Technically the repo owner pays HF for the service via a monthly sub. Having said sub doesn't stop you from getting hit with the same limits everyone else gets, though. It's literally better to not login and just proxy chain. It's kind of shitty.
>>
File: 00364-1540613866.jpg (951 KB, 2160x1620)
951 KB
951 KB JPG
>>102236969
amazing gen, flux?
>>
File: ComfyUI_00267_.png (1.58 MB, 1024x1280)
1.58 MB
1.58 MB PNG
>>
>>102236955
>Nta but I'm still assmad paying for a hf sub doesn't change the quota problem
I think you're supposed to duplicate the space, which is free, in your new space the quotas will have reset, that's why there's many cloned versions of the main spaces, because, once they are created you can set them to public and allow other people to use them.
>>
>>102237000
I still get slapped with the limits even on my cloned repos.. unless you mean you hit limit > clone > hit limit > clone again infinitely ?
>>
>>102236990
>flux1_d_Q8_0
ty
>>
>>102236887
baaabhabhiat pls gurl open bob i am very lover com com i take u home
>>
>>102236973
>It's literally better to not login and just proxy chain
At least you can do that, neural.love has the best online outpaint/uncrop on the web, but they only allow 5 per account, and no matter what I've tried, VPN, different browser, different everything, they know it's me and new accounts start at 0 credits.
It's impressive they found a way to finger print users, I just hope they don't tell anybody how they do it.
I think the secret is to create many accounts and get 5 credits on each of them before running you start using them, though, no matter how many, eventually one will run out of accounts and credits.
>>
File: 00014-3659008744.jpg (694 KB, 1344x1728)
694 KB
694 KB JPG
>>
>>102237010
>unless you mean you hit limit > clone > hit limit > clone again infinitely ?
In theory, I still have to heard from someone trying that.
But if it works the first time, then yes, each new space resets the quotas for it.
>>
>>102237037
so great, gj dude
>>
>>102237031
HW-based fingerprints like the ones from webgl exist
>>
File: ComfyUI_13638_.jpg (610 KB, 1024x1024)
610 KB
610 KB JPG
>>102236243
>imagined slights
I'll add the chatgpt commit to the issue just for you kek. He was being petty as shit.
>coming here and getting upset
I'll always attach my name and waifu gens to my shitty opinions.
Anyone defending my autistic spergout should find something better to do.
>Illya even implemented his code into forge
And replaced the license bit by bit with AGPL to make sure any improvements he makes can't be used with comfy, hence why we're stuck with my shitty cobbled together code.
>>
File: 000000_17306_.png (2 MB, 1032x1508)
2 MB
2 MB PNG
>>
>>102236478
>>102237077
Wrong post, was meant for the one that links to the issue.
>>
>>102237077
I talked to the very same guy who brought it up here a few days ago. used "meltie" that time as well, kek. and he referred to me as "chitty" because I disagreed with him, which was entertaining
>>
File: schnell.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
And I'm back! Um... yeah, schnell is kind of not the same thing... or maybe it's CLIP interrogator's prompts. In any case, with a quantity over quality thing, I guess I'll just wait.
Also, quotas seem to work in a weird way, if you generate slowly, you seem to have lots of quotas, but if you generate fast and go over what they allow, they make you wait for a refill, so there's a rate at which you can generate images and never run out of quotas, which is produces more than running out of them and having to wait. They're weird.
a screenshot of a house with lots of furniture, fashion gameplay screenshot, in game screenshot, interior gameplay screenshot, in-game screenshot, in - game screenshot, screenshot from the game, photograph of 3d ios room, lots of decoration and furniture, screenshot, gameplay screenshot with ui, a screenshot, home page screenshot, official screenshot, treasure room, game screenshot
>>
File: 0.jpg (299 KB, 1024x1024)
299 KB
299 KB JPG
>>
File: 00122-574775698.png (701 KB, 616x808)
701 KB
701 KB PNG
>>
File: 00381-1540613867.jpg (990 KB, 1620x2160)
990 KB
990 KB JPG
>>
File: 2024-09-05_00089_.jpg (667 KB, 3840x2160)
667 KB
667 KB JPG
>>
>>102237076
It's weird they're the only ones I've seen using them, though, it's as if all the other sites didn't mind people creating multiple accounts and actually hope users getting fed up on it and just buying an account.
Also, did we lose the war or something? How come there's not a way to fake a different HW-based signature? It was the first time I was defeated, and I even considered buying an account, though, I would have bought an account for Jasper Outcrop out of spite, anyway.
But then I found out Adobe Firefly's outpainting and it was decent at 25 outpaints per month for account. I still can't believe how close I was to spending money on a hobby I'm supposedly doing for fun, though.
>>
File: 2024-09-05_00095_.jpg (610 KB, 3840x2160)
610 KB
610 KB JPG
>>
File: 1725499564180.jpg (105 KB, 1080x630)
105 KB
105 KB JPG
>>102237077
>yfw city actually is samefag defending himself on here
awkward when the schizos are right
>>
File: 1725499564320.jpg (120 KB, 1024x1024)
120 KB
120 KB JPG
>>
>>102237077
Why changing or adding words to the prompt with a lora loaded makes the image generated worse each time? To fix it you need to turn off the lora, generate one image then turn it back on
>>
File: 00004-1540613865.jpg (541 KB, 1150x1510)
541 KB
541 KB JPG
>>
>>102237204
Actually kind of hilarious he ignored the posts from his users experiencing this issue and focused on avatarfagging and stroking his ego. Kind of proved them right.
>>
File: 1721116111545.jpg (214 KB, 1024x1024)
214 KB
214 KB JPG
>tfw noticed my pip cache is 48GB
>>
File: ComfyUI_13644_.png (268 KB, 768x768)
268 KB
268 KB PNG
>>102237189
I was gonna sperg at him about the >implementing your code part but deleted it kek. I'm starting to think people don't give a shit about software licenses.
>>102237204
I was trying to reproduce that hence the random shit I kept adding to the image but it works for me with both the unet and T5 being the gguf loader, probably because it reloads the T5 each time since I don't have enough vram for both. Can you post your workflow?
>>
File: 00102-1826007368.png (998 KB, 744x888)
998 KB
998 KB PNG
>>102237205
amusing
>>
File: 2024-09-05_00097_.jpg (553 KB, 3840x2160)
553 KB
553 KB JPG
>>102237243
yaaa.. python has no sanity when it comes to cache and multiple version install.. reminds me I need delete python 3.10 .. probably hogging ~30GB on my ssd
>>
>>102237243
73.35GB on my install.. fuck it
>>
>>102237243
>>102237286
its time anons:
>pip cache purge
>>
>>102237244
>I'm starting to think people don't give a shit about software licenses.
Is this as close as you get to a moment of self awareness that everyone else is cringing at your hyperfixation against illya's license used
>>
File: 1696510496382182.gif (786 KB, 153x165)
786 KB
786 KB GIF
>>102235413
Newfag here. Do you guys mind if I ask a question regarding ai art training. I want to know more I go regarding its legality.
>>
>>102237305
obviously.. if there is no lawyer involved even big companies give a shit about licenses, and the common man is a pirate anyway
>>
>>102237243
>>102237286
wtf I'm scared to check mine now... I might be better off purging and not looking for my sanity.
>>
>>102237321
do it
>pip cache info
>>
>>102237313
No legal issues unless you're making cp/based on real children/using it for blackmail
>>
>>102237161
>>102237031
can't they just use your MAC address?
try VMs
>>
>>102237330
The thing is according to my quick research, apparently anything uploaded online is automatically the IP of whoever uploaded it. For example of an artist uploads their shit onto Instagram, that IP is automatically theirs unless otherwise stated, if my sources are correct. Does this mean someone using that art to train a base model is technically copyright infringement? Or am I misunderstanding something?
>>
File: 00080-2737269228.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>102237243
>10.5 gbs
what do??
>>
>>102237343
>Does this mean someone using that art to train a base model is technically copyright infringement?
No
>Or am I misunderstanding something?
Yes, that nobody cares
>>
>>102237343
unless it's 1:1 duplication output it's not copyright infringement and sits under some fan art editing grey area. though realistically if someone was getting sued for this first it's a big corpo like oai, not some random nobody making a lora. the literal millions of models online are not all trained with "ethically sourced consensual images"
>>
>>102237356
just purge.. its just for reinstalls
>>
>>102237343
1. train locally
2. dont post it online
3. ???
4. profit!
>>
>>102237343
scraping isn't copyright infringement, that is already case law
AI law is going to settle on already established copyright interpretation, copyright is based on the final output. An artist being inspired by Disney artwork is not committing copyright infringement making an "inspired" painting in the Disney style.
>>
>>102235807
AnimagineXL3.1 is still SOTA
>>
>>102237244
What unet model are you using?
>>
File: ComfyUI_00273_.png (1.88 MB, 1024x1280)
1.88 MB
1.88 MB PNG
>>
File: Final.png (528 KB, 854x593)
528 KB
528 KB PNG
>>102236875
Thanks, when I tried that, it didn't give me any text :( - so I just cheated, mirrored the image, and added the caption in post-production. This is the final version, at last!
This is a photograph of two animals, a chicken and a pig having breakfast each with their own plate. The chicken is a hen sitting next to the plate with bacon on the table. The pig is large and pink. The pig leans over the plate with fried eggs They sit on a table and the plates have neatly arranged breakfast. Below the image is a large movie yellow subtitle caption text in bold capital letters: "- WHAT ARE YOU HAVING FOR BREAKFAST?" and below it says "- YOUR MOM." The plate are round, white, and placed on a light gray wooden surface with subtle grain texture. On the plate plate on the right, there are two sunny-side-up eggs with slightly crispy edges, sprinkled with black pepper. On the plate of the right there are three slices of crispy, golden-brown bacon. There are several small, round cherry tomatoes. Fresh, green parsley leaves are scattered around the plates. To the bottom right, there is a fork with a sleek, modern design, its handle made of metal with a wooden grip. In the background, there is a clear glass mug filled with a dark brown liquid and a wooden pepper grinder. The overall composition is well-balanced, with a focus on the vibrant colors and textures of the food, making the plate look appetizing and inviting.
>>
>>102237334
Do they get to see that through a VPN?
>>
>>102237329
is this just base flux or also a lora?
>>
File: 2024-09-05_00099_.jpg (394 KB, 3840x2160)
394 KB
394 KB JPG
>>102237465
awesome! I also love the peppermill tea cup
>>
>>102237343
1) What you do locally will never be known unless you share it.
2) Simply "feeding" the model a picture isn't an act of copyright infringement
but most importantly
3) If you think you're doing copyright infringement and posting generated work under the guise of someone else's art, what are they going to do about it.. (if they even know)? Most artists are too poor to sue.
>>
>>102237434
Q2_K_S with Q3_K_S T5 to fit it on 10GBs lol. One thing I noticed is that it completely dies and leaves the T5 model on the CPU if I force cuda-devices=1 and disable cuda malloc, though that sounds like a different issue.
>>
>>102237370
Midjourney just bypassed all that, they trained a model with copyrighted material, then they used that model to produce new pictures that, according to law, hold no copyright protection because they were generated by an AI, and then they trained the AI people get to use with those.
So much talk about model collapse and how training with AI pics is bad, and yet before flux Midjourney was the most beloved model outranking Dalle 3 in blind tests.
>>
File: 000000_17312_.png (2.13 MB, 1032x1508)
2.13 MB
2.13 MB PNG
>>
File: Final1280.png (1010 KB, 1280x890)
1010 KB
1010 KB PNG
>>102237504
Thanks! So much for the final version as I uploaded a downscaled one for some reason! XDD Here the correct one!
>>
File: pegasus.png (2.87 MB, 1568x1568)
2.87 MB
2.87 MB PNG
>>
>>102237571
So are you color blind and don't notice this is a blue board?
>>
File: 1.jpg (168 KB, 1500x927)
168 KB
168 KB JPG
>>102237526
This is the workflow, if I replace "unet loader gguf" with "load diffusion model" it works fine. Does not matter what model I put in the "unet loader gguf", if the prompt is modified with the same lora loaded the image becomes more blurry each gen
>>
>>102237612
Sdg tranny trying to nuke this thread
>>
File: fs_0180.jpg (122 KB, 1280x944)
122 KB
122 KB JPG
>>
>>102237621
>the image becomes more blurry each gen
i get taht in forge when using t5-v1_1-xxl-encoder-f16.gguf, but that was a couple of weeks ago, so i dont know if it's been fixed in forge
>dont ask
>>
File: 00010-1184624313.png (1.07 MB, 712x936)
1.07 MB
1.07 MB PNG
made, mostly, right again. img2img is fun.
>>
>>102237657
what's your workflow?
>>
File: fs_0182.jpg (209 KB, 1920x1280)
209 KB
209 KB JPG
>>
>>102237666
i dont use comfyui, i use forge, so no workflow, unless workflow means something more than whatever
>>
is this the avatar discord?
>>
>>102237670
I like this
>>
>>102237686
what if i told you it always was
>>
>>102237621
The rgthree loader calls the regular LoRA loader internally so it should be fine.
Are you using the Q8 and fp16 T5 with lowvram/swapping them between gens or do they both fit into VRAM? I assume the former.
>>
>>102237683
ah i use forget too, so just img2img with the same prompt or what?
>>
>>102237079
how does one drink from a cup with no lips
>>
>>102237716
They fit (I think)

Requested to load FluxClipModel_
Loading 1 new model
loaded completely 0.0 9319.23095703125 True
Requested to load Flux
Loading 1 new model
loaded partially 4801.2591796875 4800.8555908203125 0
Attempting to release mmap (233)

...


got prompt
Unloading models for lowram load.
0 models unloaded.

Is this normal? There should be a message to confirm the release of mmap?
>>
File: miss.png (458 KB, 1280x256)
458 KB
458 KB PNG
Genning at 1280x256 by mistake...

This image is a detailed digital illustration of a three-story apartment with a whimsical, cartoonish style. The apartment is situated in a large, multi-colored building with green and blue vertical stripes. The top floor features a spacious bedroom with a large bed, a wooden nightstand, and a small table with a lamp. The middle floor has a kitchen with a green and white tiled floor, a small dining table, and a stove. The bottom floor includes a living room with a couch, a coffee table, and a large potted plant. The basement level contains a laundry room with a washing machine and a dryer, a small desk, and a bookshelf. The apartment is cluttered with various items, including a magnifying glass, a globe, a pair of sunglasses, and a broom. The background features a lush green lawn with a small pond and a few trees. The bottom right corner displays a cartoon character in a red and white outfit holding a sign reading "SUNGLASSES." The top of the image shows a menu bar with options such as "CLOSET," "CHECK," and "CHROME." The overall style is vibrant and playful, with a focus on detail and a sense of depth.
>>
File: fs_0189.jpg (558 KB, 2560x1696)
558 KB
558 KB JPG
>>
>>102237761
>Is this normal? There should be a message to confirm the release of mmap?
That part is normal when lowvram is triggered, yeah. Without that it was using 2x system RAM.
Looks like T5 fits but Q8_0 loads in lowvram mode ("loaded partially", probably due to being larger + needing headroom for inference). Will test mixing T5 + quantized unet like that.
>>
File: full.png (1.23 MB, 1280x768)
1.23 MB
1.23 MB PNG
>>102237769
And at 768 height.
Somehow I feel that genning at 256 height and then outpainting the bottom and top would have given better results...
>>
I used q5 and q6 and same problem with the lora

>>102237810
What about this when starting it?

custom_nodes\ComfyUI-GGUF\nodes.py:79: UserWarning: The given NumPy array is not writable, and PyTorch does not support non-writable tensors. This means writing to this tensor will result in undefined behavior. You may want to copy the array to protect its data or make it writable before converting it to a tensor. This type of warning will be suppressed for the rest of this program. (Triggered internally at ..\torch\csrc\utils\tensor_numpy.cpp:212.)
torch_tensor = torch.from_numpy(tensor.data) # mmap

ggml_sd_loader:
GGMLQuantizationType.F16 476
GGMLQuantizationType.Q8_0 304
model weight dtype torch.float16, manual cast: None
model_type FLUX
>>
>>102237396
>>102237505
>>102237358
>>102237370
>>102237330
>>102237343
Newfag still here. Would there be an issue of I trained a Lora but decided to post the dataset used on a site like Civitai or huggingface?
>>
File: 1717429281384.jpg (463 KB, 1024x1024)
463 KB
463 KB JPG
>>
>>102237833
Why post the dataset?
>>
>>102237830
>UserWarning
That one's fine, we copy it later on (that's what the releasing mmap message is about lol)
>model weight dtype torch.float16, manual cast: None
On RTX 30XX and up it's bfloat16 by default and fp16 has a workaround, but it runs on pascal so probably not it either.
>>
>>102237799
Pretty
>>
File: 1711200078718.jpg (1.43 MB, 1024x1024)
1.43 MB
1.43 MB JPG
>>
File: 1725504299233.jpg (157 KB, 1024x1024)
157 KB
157 KB JPG
>This image is a close-up photograph of a collection of seashells and starfish arranged in a random, overlapping pattern. The shells and starfish are predominantly in shades of pastel pink, cream, and white, creating a soft, serene, and delicate aesthetic. The seashells are a mix of scallop shells, conch shells, and spiral shells, each with a glossy, smooth texture that reflects light subtly. The starfish are of various sizes and shapes, some with intricate patterns and others with simpler, rounded features. The background is a soft, white fabric, likely a satin or silk material, which provides a subtle contrast to the pale colors of the seashells and starfish. The overall composition is rich in texture, with the glossy, smooth surfaces of the seashells and starfish contrasting with the soft, silky texture of the fabric. The image evokes a sense of tranquility and marine beauty, ideal for a beach-themed decor or a coastal-inspired setting.
>>
File: fs_0193.jpg (570 KB, 2560x1696)
570 KB
570 KB JPG
added the word 'swarovski', guess you mean the moon is a giant diamond now, k
>>
>>102237878
Looking at output folder, it started messing up the gen with prompt change on September 2, so maybe an update from then
>>
File: maphouse.png (971 KB, 1280x768)
971 KB
971 KB PNG
This is a high-resolution digital screenshot of a computer game, likely a simulation or management game, showcasing a detailed, 3D-rendered scene of a luxurious mansion. The mansion, situated in a lush green lawn, is a large, two-story building with a red tiled roof, white walls, and ornate architectural features, including a balcony and columns. The mansion is surrounded by a neatly manicured garden with vibrant flowers, and a pathway leads up to its front entrance. The image is presented from an overhead perspective, allowing for a clear view of the mansion's surroundings. To the left, there is a partially constructed wooden tower, suggesting ongoing construction or maintenance work. To the right, there is a rocky cliff with a small waterfall, adding a natural element to the scene. The game interface is visible at the top of the image, displaying various statistics and controls. The interface includes a large number counter, "2863," indicating the game's progress, and a gold coin symbol, suggesting a currency system. There are also two green buttons labeled "+150" and "+225," possibly for adding resources or upgrading the mansion. The game controls are also visible, including a play button and a pause button. The overall style is vibrant and detailed, indicative of modern gaming aesthetics
>>
File: 1702364411702.jpg (382 KB, 1024x1024)
382 KB
382 KB JPG
>>
File: 1725504736214.jpg (135 KB, 1024x1024)
135 KB
135 KB JPG
>This is a high-resolution photograph of an ice formation, likely taken outdoors during winter. The image showcases a dense cluster of icicles hanging vertically from a dark, rocky surface. The icicles vary in color, ranging from a clear, translucent blue to a pastel pink and a soft orange. The colors are vibrant but subtle, creating a mesmerizing visual effect. The icicles are thick and appear to be formed from layers of frozen water, with some showing a slightly rough texture where the ice has been formed around twigs or small rocks. Snow covers the ground and the base of the icicles, adding a layer of white to the scene. The snow is fluffy and pristine, contrasting with the dark background. The overall atmosphere of the image is cold and serene, capturing the beauty of natural ice formations. The photograph emphasizes the intricate details of the ice, with light reflecting off the surfaces, creating a glistening effect. The colors and textures of the ice and snow are vividly captured, making the image both visually striking and scientifically informative.
>>
>>102237871
Why not?
>>
>>102237933
The commit before that for the gguf repo would be 69f0daf but that has the issues where lowvram LoRAs are only partially loaded, so those wouldn't be reproducible 1:1 on current version.
>>
>>102238006
commit 3bad1b4 seems to be working fine, I'll check 929d154 too
>>
File: 1711747022625.jpg (955 KB, 1024x1024)
955 KB
955 KB JPG
>>
File: KOLORS.png (1.72 MB, 1024x1024)
1.72 MB
1.72 MB PNG
>>102237908
Kolors with just this part:
>This image is a close-up photograph of a collection of seashells and starfish arranged in a random, overlapping pattern. The shells and starfish are predominantly in shades of pastel pink, cream, and white, creating a soft, serene, and delicate aesthetic.
>>
File: 1696263183823.jpg (816 KB, 1024x1024)
816 KB
816 KB JPG
>>102237937
>>
File: 1721373930001.jpg (484 KB, 1024x1024)
484 KB
484 KB JPG
>>
>>102238028
I like the colors (no pun intended) of kolors more but 4 step schnell had less deformations I think
>>
File: 0.jpg (38 KB, 1024x1024)
38 KB
38 KB JPG
>>
>>102238022
commit 929d154 has the bug

commit df3378d has the bug

testing commit fa1671d

commit e19fe8e works fine
>>
>>102238157
Ah, managed to reproduce it. Yea it's fucked. It's like the same patches (LoRA weights) get applied over and over again for the lowvram weights.
>>
File: miku_shocked.jpg (183 KB, 640x931)
183 KB
183 KB JPG
>have to keep hardware acceleration off to make sure I don't oom
>new comfy ui lags like hell without acceleration
>queue menu literally freezes for 20+ seconds if there's more than 5 images generated
Sething so hard I accidentally posted in the wrong thread
>>
You know how for sdxl and sd1.5 people made those Loras that's be shit like "ruby style" then it'd randomly add rubies to parts of the output image, like turning the clothes to rubies, or hair or eyes made of ruby, etc?

How the fuck did they make those? Just a dataset with tons of randomly inpainted images where they inpainted rubies over random sections or?
>>
>>102238189
whats with the red frame?
>>
File: 0.jpg (92 KB, 1024x1024)
92 KB
92 KB JPG
>>
File: 0.jpg (106 KB, 1024x1024)
106 KB
106 KB JPG
>>
File: lol.png (867 KB, 1024x1024)
867 KB
867 KB PNG
>>102238157
commit fa1671d has the bug

commit ba8ce44 has the bug

So whatever was added after e19fe8e caused it
>>
>>102238260
Thanks, pushed a possible fix if you want to give it a try.
>>
File: 1719574131440.jpg (487 KB, 1024x1024)
487 KB
487 KB JPG
>>
File: 1695099369754.jpg (485 KB, 1024x1024)
485 KB
485 KB JPG
>>
File: 00132-1571426639.png (1.99 MB, 1230x808)
1.99 MB
1.99 MB PNG
https://civitai.com/models/721039/retro-anime-flux-style
on left
https://civitai.com/models/7227?modelVersionId=782696
on right
>>
>>102238245
asshat is evading bans. Report and ignore.
>>
File: ComfyUI_temp_zgmjl_00005_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>
File: 1717676794928.jpg (507 KB, 1024x1024)
507 KB
507 KB JPG
>>
File: nice.png (983 KB, 1024x1024)
983 KB
983 KB PNG
>>102238315

lllyasviel who? city96 best dev

Thanks, works good now
>>
>>102238426
See, if you're not being a pedo you can share good things.
>>
What are the signs that you've gone too many steps when training a lora? is it that things like backgrounds are hard backed in and difficult to prompt out? or is it that anatomy becomes completely fucked? or does the whole thing just not even function?
>>
>>102238479
things start to fall apart, colors start to burn
>>
File: 00135-4011525463.png (601 KB, 616x808)
601 KB
601 KB PNG
>>102238474
i prefer without the animu filters. thanks.
>>
>>102238470
If I was I would've read the patch logic right the first time around instead of assuming it only runs once kek. Pretty sure the forge codebase isn't this jank.
>>
>>102238514
it is still jank. Just not in a way it looks like 12 year olds barfing up snippets from chatGPT.

It is just really sad how insecure everyone is about the "competition".
>>
>>102238479
all of the above
>stiff, can't prompt things out, starts hallucinating things like accessories, hands and fingers extra fucked, eyes get weird
then eventually it turns into a literal melted mess and blend of colors
then the final stage it is just a block of color vomit
>>
File: donkey.png (1.88 MB, 1024x1024)
1.88 MB
1.88 MB PNG
same epoch, different prompt
>1st prompt, get this:
>>102238441
>next prompt, get this
>pic rel
oh.. oh no...
>>
File: 1713012114430.jpg (198 KB, 1024x1024)
198 KB
198 KB JPG
>>
>>102238541
It's all experimental, but city96 took the time to fix it directly, other githubs have tons of unsolved issues
>>
>>102238673
see
>>102238541
>It is just really sad how insecure everyone is about the "competition".

Why the fuck would it matter at all if other githubs have other issues. I am super happy for city96. He has a successful repository. It doesn't mean his shit is pure as winter snow. I called out some just stupid lack of optimization a few days ago from an accepted commit. He should have caught it. He also works hard and deserves credit for that. If he feels sad he can look at his hundreds of github stars. Fuck off with this white knight shit.
>>
File: r.png (349 KB, 279x504)
349 KB
349 KB PNG
>>102238596
>>
>>102238673
I agree with >>102238701
Dickriding on 4chan just looks like you're trying to instigate shit 99% of the time, even if it's meant well.
>>
>>102238701
>Fuck off with this white knight shit.
legitimately, that anon is out here sucking city's cock like he's getting paid in GBs of vram on his next gpu card
>>
File: 1705070621492640.png (1.45 MB, 852x1199)
1.45 MB
1.45 MB PNG
>Flagged for review
>This image won't be visible to other users until it's reviewed by our moderators.
civitai...
>>
File: 1696424852887.jpg (219 KB, 1024x1024)
219 KB
219 KB JPG
>>
>>102238849
filthy, scandalous, indecent belly button
>>
File: 1725435456.png (807 KB, 1024x1024)
807 KB
807 KB PNG
>>
File: 00140-726041125.png (686 KB, 688x888)
686 KB
686 KB PNG
lot of progress made today,
>>
File: 1719957620682.jpg (195 KB, 1024x1024)
195 KB
195 KB JPG
>>
File: 1725435846.png (773 KB, 1024x1024)
773 KB
773 KB PNG
>>
File: fs_0211.jpg (855 KB, 3072x2048)
855 KB
855 KB JPG
>>
File: 00050-1478667309 copy.png (300 KB, 380x538)
300 KB
300 KB PNG
>>
File: 1724048593610.jpg (465 KB, 1024x1024)
465 KB
465 KB JPG
fuck yeah finally
>>
File: 00004-630435315.png (779 KB, 744x888)
779 KB
779 KB PNG
>>
File: 1695206756026.jpg (479 KB, 1024x1024)
479 KB
479 KB JPG
>>
File: 1717583455650.jpg (499 KB, 1024x1024)
499 KB
499 KB JPG
>>
File: 1705716119794.jpg (429 KB, 1024x1024)
429 KB
429 KB JPG
>>
File: 1709487127584.jpg (423 KB, 1024x1024)
423 KB
423 KB JPG
>>
File: 1713175169373.jpg (326 KB, 1024x1024)
326 KB
326 KB JPG
>>
File: 00030-1996159423 copy.png (185 KB, 482x340)
185 KB
185 KB PNG
lowres https://files.catbox.moe/bcc12t.png
upscale https://files.catbox.moe/m8j8hf.png
lora
https://www.mediafire.com/file/e5oednckssbf7sg/mysterysep04-step00000435.safetensors/file
upscale for increased detail>take to photoshop or something else and then downscale while using nearest neighbor to retain jaggies for best effect
>>
File: 1725516967339960.webm (2.52 MB, 1280x720)
2.52 MB
2.52 MB WEBM
>>102235413
>>>/pol/thread/480659309
/pol/ is having a blast with hailuoai
Let's add it to the breed.
>>
File: 1725515838414826.webm (1.79 MB, 1280x720)
1.79 MB
1.79 MB WEBM
>>102239453
>>>>/pol/480659309
>>
File: 1712569108159.jpg (487 KB, 1024x1024)
487 KB
487 KB JPG
nostalgia for 90's dance music videos
>>
>>
File: FLUX_00013_.png (2.21 MB, 1440x1120)
2.21 MB
2.21 MB PNG
hammer time
>>
>>
>>102238474
>nick posts 12 year old girl in you're path
>>
File: ComfyUI_01520_.png (1.13 MB, 768x1360)
1.13 MB
1.13 MB PNG
New fresh bread from the oven
>>102239549
>>102239549
>>102239549
>>102239549
Come grab some bread!
>>
>>102239466
chill with the antisemitism
>>
Made my first lora yesterday, character lora for flux. rank/alpha 32/32, learning rate 0.0004, adafactor, split_mode, 1600 steps and spitting out wips at 400/800/1200 too. trained at 512x512, ~25 fairly high quality high res varied aspect ratio images but no preprocessing beyond whatever bucketing/cleaning the comfyui flux training node does for you automatically.

I assume my dim 32/32 was huge overkill should go to 8/8 as a rule of thumb default, do I need to toggle any dependent variables alongside that? past that I assume going to 768 or 1024 training is worth it if my 10GB is enough, especially for e.g. characters with detail, and probably lowering the learning rate or using prodigy if I wanna be lazy (even the 300 steps lora was basically complete, using 0.0004). but I'm guessing the next lowest hanging fruit is in a higher quality dataset and preprocessing it more.
>>
File: 000000_17325_.png (1.87 MB, 1032x1508)
1.87 MB
1.87 MB PNG
>>102237745
>Chuds it back,
>>
>>102237243
>ERROR: pip cache commands can not function since cache is disabled.
like a boss



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.