[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: collaeg.jpg (3.6 MB, 3264x3264)
3.6 MB
3.6 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102281807

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/pol/uncensored+ai
>>
Blessed thread of frenship
>>
File: 2024-09-09_00027_.png (849 KB, 1280x720)
849 KB
849 KB PNG
>>102294975
>>
>>102294975
imggen
>>
>>102295168
spoopy
>>
File: cloud.png (3.95 MB, 1568x1568)
3.95 MB
3.95 MB PNG
>>
>>102295236
unattended sd15 level of car placement. Don't get me started on the scaling.
>>
sometimes I feel like 'eh is flux really worth it, I could go back to sdxl'... but then I realize I don't really have to inpaint, don't constantly have to fix eyes, etc... and I suddenly snap to my senses and realize I can't go back
>inb4 use both
I refuse
>>
File: ComfyUI_00419_.png (2.61 MB, 1568x1568)
2.61 MB
2.61 MB PNG
>>
>>102294999
checked.
>>
File: rope.png (3.7 MB, 1568x1568)
3.7 MB
3.7 MB PNG
>>102295365
>>
>>102295417
if it gets more effort into images I don't care what I have to do.

that noose is the size of the woman. Good work!
>>
File: ComfyUI_00410_.png (2.04 MB, 1568x1568)
2.04 MB
2.04 MB PNG
>>102295435
why would i waste time editing it for you? also please post gen so the thread doesnt go to shit again
>>
So, what is the meta model now? Flux or still SDXL?
Also, what is the minimum hardware to run flux?
>>
>>102295463
no.

If you would like to care about something besides big number gets bigger I would love your thoughts on doing some sampler testing using something like this:
https://github.com/epistoteles/TensorHue
I am getting real tired of X/Y grids.
>>
File: 00010-0.png (2.23 MB, 2560x1440)
2.23 MB
2.23 MB PNG
Can we generate videos from text yet? I want to type
>cammy cosplayer with big ass twerking
and get a 10-second clip of exactly what I described
>>
File: 07017-1303742816.png (2.81 MB, 1344x1728)
2.81 MB
2.81 MB PNG
What's the difference between sdg and ldg?
>>
>>102295705
l
>>
>>102295504

This is exactly why I love modern technology.
>>
>>102295489
NAI
>>
File: 00002-478670983.png (2.8 MB, 2304x1792)
2.8 MB
2.8 MB PNG
>>
>>102295794
kys
>>
>>102295794
>Own 4080, is good
>considering future
>5080 will still be 16GB
>5090 only gets +4GB over
>7900XTX slower than 4080
>AMD still doesnt have 100% compatabilty on this stuff
>SLI/crossfire dead for years
>No double vram solder mods for 4080, 7900 etc
>No 8900 XT/XTX, only lower end cards, flagship cancelled
>Whatever the fuck Intel is doing
>AI models censored and cucked early on
God I fucking hate this timeline
>>
>>102295945
>left: Midjourney, DALL-E 3, Leonardo, Grok, etc.

>right: me with my 3080
>>
>>102295945
Nah it will be
>5600: 12GB and 16GB
>5800: 20GB and 24GB
>5900: 28GB and 32GB
>>
File: ComfyUI_temp_pjtkf_00007_.png (1.73 MB, 1024x1024)
1.73 MB
1.73 MB PNG
>>102295964
kek forgot image
>>
File: reality.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>102295892
then i woke up
>>
>Intel A770
>$280
>16GB GDDR6
Stupidly slow. But clearly both GayMD and Jewvidia could have double vram models for +$100-$200.
>>
File: 1720992851022781.gif (3.09 MB, 633x356)
3.09 MB
3.09 MB GIF
>>102294975
I feel this is the appropriate place to ask this since AI models are heavily reliant on using images, many, if not most of which are copyrighted:

If reposting someone's art without their permission is technically copyright infringement, then how come websites like rule34, Danbooru, gelbooru, etc, are still around? They essentially amount to piracy sites but specifically tailored towards art and especially smut art right? How come websites like archive.org and nhentai are gone after and forced to comply with DMCA request yet these guys seem to get us got free? To be clear I am not trying to champion copyright trolls or even the concept of copyright in general. I just find it all at that no one seems to care about The aforementioned sites. Maybe it's because normies are hypocritical and wouldn't fare touch their favorite coom sites?
>>
>>102296123
They only go after easy targets.
>>
>>102296160
Define "easy targets"
>>
>>102296167
People that can't afford to have protracted litigations against them
>>
File: 000000_17499_.png (2.29 MB, 1508x1032)
2.29 MB
2.29 MB PNG
>>
File: 1718358164841941.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>
>>102296123
>rule34, Danbooru, gelbooru
fair use. A person isn't going to not watch pokemon because that person saw Misty getting railed by an onix.

>like archive.org
not derivative or adaptive. Actual copies of the thing. I assume the same nhentai.

Although maybe actual answer. Safe harbor stuff makes it hard. You can't sue a site for having copyrighted material. You can sue if they don't take it down (or do it too much). "Most" of the copyright detection is based on hashes. If you alter stuff the hash breaks. There are better things to use, but it is hard. I am sure you have seen videos on youtube that are mirror images of the original.
>>
>>102296090
>could have double vram models for +$100-$200

probably not. every tech giant in the world is simultaneously competing to buy hundreds of thousands of H100s, if anything it is a bit weird that commercial GPUs are available to hobbyists at any price.
>>
>>102296333
Probably yes. Consumers can buy gddr5/6 for those double vram mods on the cards it works on. That's just a cope/excuse.

The real reason is because anything "enterprise" gets to have a massive markup on it and such cards would cannibalise sales of enterprise cards even though the inflation of prices on consumer is already price gouged and price fixed to fuck.

TLDR: it's all Jewish tricks
>>
File: 000000_17501_.png (2.29 MB, 1508x1032)
2.29 MB
2.29 MB PNG
>>
Today I was genning sexy ladies in swimsuits and one of them was genned with a swimsuit made of straps which on closer inspection formed a near- perfect pentagram (the upside down one with two 'horns' and one point going straight down, in this case to the pussy)

Cannot post ofc because too pornographic. The other gens were all normal swimsuits, it wasn't a strappy slutty prompt. Should I be spooked or is that a common style? Not using a finetune or any LoRAs, just FLUX base.
>>
>>102296641
your GPU is now possessed by demons, get a priest to cleanse it with holy water ASAP before you become the demonz
>>
>>102296641
There's a swimsuit like this, based on school Mizugi style. There's probably a bunch drawn like this too
>>
File: ComfyUI_00304_.png (3 MB, 1568x1568)
3 MB
3 MB PNG
>>102296641
there are some people who theorize AI has jailbreak moments and tries to send us a message. get a load of this guy:

https://www.youtube.com/watch?v=L4CyBtW6_9c
>>
>>102296333
>if anything it is a bit weird that commercial GPUs are available to hobbyists at any price.
Not at all. The 1080ti had 10+ GB vram about 8 years ago, it's genuinely bizarre how little vram has gone up on consumer cards. Under normal market conditions we would have 48GB by now for the exorbitant prices we pay.
>>
File: 000000_17521_.png (1.78 MB, 1508x1032)
1.78 MB
1.78 MB PNG
>>
is there a node that switches back and forth between two samplers? like does 1 step of euler_a then one step of dpmpp (or something of your choosing)?

I seem to recall something like that existed but I can't find it.
>>
>>102297012
You can be autistic and set up 20 (or however many steps you're doing) ksamplers advanced each with 1 step in it
>>
File: 06968-1279162420-1_8_5.png (1.29 MB, 768x1344)
1.29 MB
1.29 MB PNG
>>
File: 00281-1468769135.png (2.84 MB, 1840x1432)
2.84 MB
2.84 MB PNG
>>
File: Vivarium11.jpg (217 KB, 1584x1064)
217 KB
217 KB JPG
>>
File: 00307-614982056.png (2.86 MB, 1432x1840)
2.86 MB
2.86 MB PNG
>>
hibernation mode
>>
File: ComfyUI_33531_.png (803 KB, 1024x768)
803 KB
803 KB PNG
>>
>>102297376
You can also go back one or more steps. This would add more detail.
>>
File: 00051-4123849912 copy.png (185 KB, 340x482)
185 KB
185 KB PNG
>>
How do I go about enhancing an existing photo with stable diffusion? Ive been digging around and know hardly anything about this but I most see text to image or image generation from a source image.
>>
>>102298130
This is some weird construction if you actually look at the image
>>
File: Vivarium03.jpg (126 KB, 1584x1064)
126 KB
126 KB JPG
>>102298192
Define "enhancing"
>>
>>102298196
it's just a vending machine equipped with a bench in a low traffic alleyway.
>>
File: ai.png (1.11 MB, 948x636)
1.11 MB
1.11 MB PNG
Ive been seeing these AI IG pages pop up with what looks like ai images made from reference photos...or is this 100% text to image?
>>
File: ComfyUI_33552_.png (1.39 MB, 1024x1280)
1.39 MB
1.39 MB PNG
>>
File: ComfyUI_33497_.png (734 KB, 736x1024)
734 KB
734 KB PNG
>>
File: delux_sa_00002_.png (2.64 MB, 1344x1152)
2.64 MB
2.64 MB PNG
>>102298252
no way to tell, really. could be either

>>102298333
see you again some day
maybe somewhere else
maybe somewhere new
where we'll never know we knew each other

https://suno.com/song/dbe1f9d7-e8e8-44a9-92c8-ef85d45b5f02

>>102298602
openai bought them all
>>
>>102298130
Catbox?
>>
>>102298252
aren't all AI images from reference images?

also, yes. Bottom left is a i2i image from the movie content. The difficulty of the next thing does not match the quality of the images.
>>
>>102296123
The artists have every right to submit DMCA requests to those sites and if those sites fail to take down the images then the website owners will lose the court case 100% of the time.

Government and law agencies go after big targets, websites that are facilitating piracy of paid products. When the products are free, those websites become small fry and there's just better use of the law's time.

For example, here's a comparison with indie games. It's the difference between a website which:
- Takes all the indie games that cost money on Steam and reuploads them for free. They'd be subjected to both DMCA requests from the game developers and get targeted by law agencies.
- Takes all the FREE indie games on Steam and reuploads them. They'd be subjected to DMCA requests from the game developers but may not get targeted by law agencies. Also the game developers probably wouldn't want to submit DMCA requests because they're just happy that more people are sharing their free game.
>>
File: taymaga.png (3.96 MB, 1568x1568)
3.96 MB
3.96 MB PNG
>>
File: 00003-882580751 copy.png (261 KB, 482x340)
261 KB
261 KB PNG
>>102298710
https://files.catbox.moe/b2daup.png
a good chunk of redrawing took place.
lora for that particular image can be found here
https://www.mediafire.com/file/vqv9z84832sbfdd

works really well to do standard upscaling to increase detail and then downscale it with nearest neighbor to somewhere around a total of 800~px total between height/width(this gives the heavy oekaki look).
>>
>>102298896
Awesome, thanks, looks very clean
>Kemurikusa prompt
Doubly based
>>
>>102298888
checked. impressive
>>
>>102298213
Its a vending machine thats placed extremely high relative to a person standing on the street. When's the last time you had to climb stairs to reach a vending machine s buttons
>>
File: Vivarium05.jpg (163 KB, 1584x1064)
163 KB
163 KB JPG
>>
>>102299141
https://files.catbox.moe/t42kh8.png
>>
>>102299176
I dont really care I'm not clicking
>>
>>102299176
>showing the machine is too far away from where you'd stand to actually be operated and you'd need to step up
>proving his point
Kek
>>
Does controlnet work with flux?
>>
File: 1714798993498461.png (5 KB, 256x256)
5 KB
5 KB PNG
>>
>>102299212
Yes

>>102298252
Nude lora, text to image
>>
File: ComfyUI_33555_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
Prompting defibrillator electrodes is hard.
>>
File: ComfyUI_33557_.png (682 KB, 1024x1024)
682 KB
682 KB PNG
>>
File: hollowpursuits021.jpg (91 KB, 694x530)
91 KB
91 KB JPG
can someone hook me up with a good tutorial on how to use controlnet with flux on forge? mind you i just started out and know basically nothing
>>
controlnet for sdxl are STILL bad? i'm kinda shocked.

has anything changed since cnet LLLite?
>>
File: ComfyUI_33562_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>
File: ComfyUI_33565_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>
File: ComfyUI_33566_.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>
File: ComfyUI_33569_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>
File: tmphedzay_c.png (1.01 MB, 1152x896)
1.01 MB
1.01 MB PNG
>>
File: ComfyUI_33571_.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>
File: tmp8opp9ju_.png (2.46 MB, 1744x872)
2.46 MB
2.46 MB PNG
>>
File: ComfyUI_33573_.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>
File: ComfyUI_33574_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>
File: 1720757455437402.png (39 KB, 603x347)
39 KB
39 KB PNG
is there still no better way to load loras than picking them from a shitty list? I know there's the power lora loader but it's extremely tedious to organize once you have alot of them
>>
>>102300708
Try this

https://github.com/JaredTherriault/ComfyUI-JNodes
>>
File: ComfyUI_33577_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>
File: ComfyUI_33578_.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>
n slur
>>
>>102295705
/sdg/ is the super diffusion general, and /ldg/ is the loser diffusion general. You only post here if you consider yourself a loser.
>>
File: ComfyUI_33579_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>
>>102300741
that looks amazing. even has a metadata reader which i have been looking for. thanks!
>>
File: ComfyUI_33581_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>
File: ComfyUI_33582_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
File: ComfyUI_33583_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>
whats the thing where we can turn sketches to finished drawings? is it img2img or sketch to image??
>>
File: ComfyUI_33585_.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
>>
>>102299582
I like it
>>
File: 00011-3629693581.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>
>>102301045
<3
>>
>>102301045
I2i. Sketch to IMG is like you're drawing the blob yourself in real time in the little editor. you could make an image in ms paint and upload it to i2i and get the same results though, so it's functionally the same thing
>>
>>102301163
from the thumbnail I thought this was giant Mario disposing of midget Luigi... I need more sleep..
>>
File: 000000_17524_.png (2.83 MB, 1508x1032)
2.83 MB
2.83 MB PNG
>>
Miku posters should be shunned. Miku posters should get no (You)s. Miku posters do not belong in the OP collages. Roundhouse kick a Miku poster to the face..
>>
>>102301230
it's a giant poop
>>
File: ComfyUI_33589_.png (1004 KB, 1024x1024)
1004 KB
1004 KB PNG
>>
>>102301250


>>102300449
>>102300634
>>
>>102301279
Nice
>>
>>102301290
Kill Miku posters. Behead Miku posters. Roundhouse kick a Miku posters into the concrete. Slam dunk a Miku poster baby into the trashcan. Crucify filthy Miku posters. Defecate in a Miku posters food. Launch Miku posters into the sun. Stir fry Miku posters in a wok. Toss Miku posters into active volcanoes. Urinate into a Miku posters gas tank. Judo throw Miku posters into a wood chipper. Twist Miku posters heads off. Report Miku posters to the IRS. Karate chop Miku posters in half. Curb stomp pregnant Miku posters. Trap Miku posters in quicksand. Crush Miku posters in the trash compactor. Liquefy Miku posters in a vat of acid. Eat Miku posters. Dissect Miku posters. Exterminate Miku posters in the gas chamber. Stomp Miku poster skulls with steel toed boots. Cremate Miku posters in the oven. Lobotomize Miku posters. Mandatory abortions for Miku posters. Grind Miku poster fetuses in the garbage disposal. Drown Miku posters in fried chicken grease. Vaporize Miku posters with a ray gun. Kick old Miku posters down the stairs. Feed Miku posters to alligators. Slice Miku posters with a katana
>>
>>102301348
:/
>>
>>102301348
i thought there was only one?
>>
>>102301268
I see an abandoned container/vessel, rusted. parts laying scattered, very sepia.
>>
>>102301393
Kill the Miku poster. Behead the Miku poster. Roundhouse kick the Miku poster into the concrete. Slam dunk the Miku posters baby into the trashcan. Crucify the filthy Miku poster. Defecate in the Miku poster's food. Launch the Miku poster into the sun. Stir fry the Miku poster in a wok. Toss the Miku poster into active volcanoes. Urinate into the Miku poster's gas tank. Judo throw the Miku poster into a wood chipper. Twist the Miku poster's head off. Report the Miku poster to the IRS. Karate chop the Miku poster in half. Curb stomp the pregnant Miku poster. Trap the Miku poster in quicksand. Crush the Miku poster in the trash compactor. Liquefy the Miku poster in a vat of acid. Eat the Miku poster. Dissect the Miku poster. Exterminate the Miku poster in the gas chamber. Stomp the Miku poster's skull with steel toed boots. Cremate the Miku poster in the oven. Lobotomize the Miku poster. Mandatory abortions for the Miku poster. Grind the Miku posters fetuses in the garbage disposal. Drown the Miku poster in fried chicken grease. Vaporize the Miku poster with a ray gun. Kick old Miku poster down the stairs. Feed the Miku poster to alligators. Slice the Miku poster with a katana
>>
File: 0.jpg (194 KB, 1024x1024)
194 KB
194 KB JPG
>>
>>102301458
fascinating, this explains the psyche of those living in India quite well. they do not see vast piles of feces around them, but abundant skyscrapers and thriving metropolises. truly a marvelous insight into the minds of such primitive people.
>>
>>102301268
a hole is a hole
>>
>>102301528
>>102300744
oops
>>
>>102301515
I thought it was fascinating you saw feces hoping to redeem.
>>
>>102301515
Kek'd
>>
>>102301554
Off to work have a great day Anons,
>>
>>102300990
I needed this
>>
>>102301563
have fun anon
>>
>>102301563
Have a good day!
>>
File: SDXL20245.jpg (735 KB, 1256x1256)
735 KB
735 KB JPG
Dodging all of the potholes
>>
File: ComfyUI_33591_.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>
>>102301657
>Take my strong hand!
>>
>>102295168
https://www.youtube.com/watch?v=1CEb7MTeGxU
>>
>>102295385
I like to imagine where the pool has no border it cascades down like some kind of magical waterfall, glimmering in the sparse light. And that water drips down, falling onto the city below, which in contrast to this surreal fantasy mansion is an allyway filled with toxic waste, filth and radioactive zombies.
>>
>>102301716
our perspective zooms in on a small, untainted pool of water built up in that alleyway, and within it is a miniature, thriving echo system, where it zooms in further to show a run down mansion-castle in the sky, with a radioactive pool, that overflows to the cityscape beneath, a clean and awe-inspiring place, with a small puddle of radioactive ooze, that zooms in... Forever in repeat...
>>
>>102301786
>echo
eco
>>
File: IMG_20240909_070644.jpg (453 KB, 1024x1024)
453 KB
453 KB JPG
>>102301716
>>102301786
I put this into schnell do you think he's gonna be alright...
>>
File: ComfyUI_33594_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
File: ComfyUI_33595_.png (1017 KB, 1024x1024)
1017 KB
1017 KB PNG
>>
Did anything happen in the last two weeks? I've been in cryo sleep.
>>
>>102301964
why did he kill her?
>>
File: 1725887911182.jpg (113 KB, 1024x1024)
113 KB
113 KB JPG
>>102302005
nothing, return to slumber
>>
File: 1725888616129.jpg (193 KB, 1024x1024)
193 KB
193 KB JPG
Wtf is this shit. Can someone whose currently at their PC check if this works on dev?
>a black lion with black fur drinking water with a barbed tongue
Ignoring the rest of the prompt, schnell can't do a black lion no matter what I try. This was the least offensive image out of 5
>>
File: 1725889148202.jpg (194 KB, 1024x1024)
194 KB
194 KB JPG
>>102302171
What the fuck
>The image is a high-resolution photograph of a majestic black lion, captured in a natural setting. The lion is lying down on its front paws, with its powerful, muscular body and thick, luxurious mane dominating the foreground. The mane is a deep, glossy black, with a slightly lighter, almost silver, undercoat around the neck and chest, giving it a striking contrast. The lion's eyes are a piercing yellow, with a fierce yet intelligent expression, and its nose is a dark, almost black, color. The texture of the lion's fur is smooth and dense, with individual hairs clearly visible, giving it a regal and imposing appearance.
>The background is a blurred, out-of-focus natural landscape, with hints of green foliage and a pale blue sky, suggesting a savanna or grassland environment. The ground beneath the lion is a light, sandy color, adding to the natural setting. The lighting is soft and natural, highlighting the lion's features and creating a sense of warmth and depth. The overall mood of the photograph is one of strength, majesty, and wild beauty, capturing the essence of the lion in its natural habitat.
>>
>>102297853
neat
>>
File: ComfyUI_rlses_00005_.png (1.34 MB, 1088x896)
1.34 MB
1.34 MB PNG
>>
>>102302267
Dev is better but still looks like over processed hot plastic garbage
>>
File: file.png (153 KB, 1064x458)
153 KB
153 KB PNG
BIGGEST
>>
>>102302451
For you
>>
>>102295705
Is that a gen? How do you get motion blur?
>>
>>102302171
>>102302267
>>102302427
web inference?
>>102302451
impressive
>>
LDG lost.
>>
>>102302594
>web inference?
I'm not sure what you mean by this
>>
File: 0.jpg (45 KB, 1024x1024)
45 KB
45 KB JPG
>>
File: ComfyUI_33597_.png (996 KB, 1024x1024)
996 KB
996 KB PNG
>>
File: 0.jpg (474 KB, 2048x1024)
474 KB
474 KB JPG
>>
maybe a dumb question but why the FUCK is there no option to use the original VAE for latent previews in comfy? it literally takes less than a second to decode a Flux image and i'd much rather see an accurate preview than a low-quality TAESD approximation
>>
Time to test AdEMAMix.
It has a smaller VRAM requirement when running the EMAs on the CPU which is nice.
>>
File: grid-0001.jpg (2.15 MB, 3072x2880)
2.15 MB
2.15 MB JPG
fun one
>>
>>102302516
2big4u
>>102302594
thanks
>>
i need an LLM or something to describe an AI gen for me, i want to recreate an old 1.5 style in Pony but i can't even begin to describe it well enough to find similar styles or anything. Feels like a shot in the dark given googling hasn't helped much.
>>
I'm starting to think that for anime and like styles training the CLIP and only using booru tags is giving better results than adding boomer prompt to captions. most of my tests before this have been a wildcard mix of boomer prompt+booru tags, however I think that ultimately just dilutes things over giving any great amount of versatility/benefit

nothing, and I mean nothing, seems to fix flux fucking up anime hands, though. I've tried only quality hands, mix of both, no hands in dataset, etc, and they all come out horrible 75% of the gens. I can only assume that in those 12B params they dedicated 95% of it to plastic looking, overshooped photos. I hate it (but I'm still not going back to sdxl yet kek)

might just start shooting out animu loras on shitvit that are only booru tagged, easier for me anyway. worst case I learn otherwise and re-do 'em later
>>
>>102303406
>and they all come out horrible 75% of the gens
It's not a case of over/underfitting?
>>
File: what in the DAMN PONY.png (964 KB, 1463x883)
964 KB
964 KB PNG
civitai really does just have a collective of the most brainrotten people on the planet
what, could he have possibly needed, the fucking NORMAN ROCKWELL LORA FOR?!
Its become increasingly impossible to use this fucking site to find anything anymore because 99%. of all results on EVERY lora thats used is completely unrelated and tends to be images even worse than this (at least this was on topic)
I wondered if this is also just a problem of the auto captioning and metadata reading from the site itself, and people just never correct it, because i've noticed that on my own submissions where it completely failed to detect Loras i got from the site, or even assigned the wrong ones.
>>
is downloading forge worth it for flux?
fuck comfy.
>>
>>102303508
Yes. I back from a long break from a1111. Forge is basically a1111 with flux support. Easier install than a1111 too from memory.
>>
>>102303053
>less than a second * the number of previews per gen
kinda prohibitive. iirc there are custom nodes that allow you to choose the VAE used for previews.
>>
>>102303491
loras like that are used for general image composition rather than specific style (at least for me). It's fun to play around with stuff like that and see what you'll get.
>>
File: 1703670344790804.png (495 KB, 768x512)
495 KB
495 KB PNG
Hey so I have an rtx 3060 and I really liked the strawberry mix back in the day, should I be fucking with all this new shit like flux and whatnot or will it just be slower and make worse waifus?
>>
>>102303508
forge is nice
>>
>>102303652
what have you got to lose by trying
>>
>>102303391
https://huggingface.co/spaces/OpenGVLab/InternVL
Maybe try some random HF spaces?
>>
>>102303646
Pretty sure the one i felt the need to sperg about *is* a strict style, i mean mixing norman rockwell with a ton of other opposite styles just seems more like schizophrenia.
>>102303666
thanks, i forgot this was even a thing on HF.
>>
>>102303652
then you'll like AutismMix and the superior Pony Diff (both XL)
>>
>>102303652
>worse
Yeah it's not great yet for that, it's fantastic at complexity, at text, symbols, hands, not fucking shit up. But the anime art style isn't great, if you want to try I suggest the bnb nf4 base model for a 3060 and the kestral anime flux model. You'll need the clip, tx5, and ae.safetensor models too. Otherwise sdxl based models (eg even pony derivatives) are there now across art styles, character knowledge etc and are a step up from sd1.5
>>
>>102303683
NTA. Tried it, didn't like it. I much prefer PrefectPony and CyberRealistic (sp?)
>>
>>102303484
even base flux fucks up anime hands a lot
>>
File: 1700721617931777.jpg (236 KB, 633x758)
236 KB
236 KB JPG
>have hundreds of pics in dataset
>have to run each one through InternVL2 and then proofread/edit every single caption
>captioning a single image properly can take 2-5 mins

I HATE CAPTIONING
>I HATE CAPTIONING
I HATE CAPTIONING
>I HATE CAPTIONING
I HATE CAPTIONING
>I HATE CAPTIONING
I HATE CAPTIONING
>I HATE CAPTIONING
>>
>>102303652
if its 12GB thats what I have and I'm even frequently training loras for flux, so even if its on the bare minimum end of things you should be pretty capable
gens with a lora take like a minute 30 to a minute 40 though for flux, kind of painful but when you don't have to inpaint or hires fix or redo a bunch of shit to get a decent result it is ok
>>
>>102302451
What's the lora set?
>>
>>102303773
wait until you find out it doesn't even matter because no one proofread the captions used to train the model
>>
>>102303646
>>102303677
Or some people set it to 0.1 and don't bother to remove the lora
>>
>>102303773
I'm at the point where I am pretty sure any gains from using boomer prompting aren't enough to justify not continuing to just use booru tags for 90% of loras ngl
my next test will be if making a style lora with a trigger word overcomes not using boomer captions, if you use that trigger word + boomer prompt to gen
>>
>>102303703
Perfect Pony? Only finding a pixel-perfect lora for that.
CyberRealism is ofc more for 3DPD. PonyRealism is easier to work with for that in terms of anatomy
>>
Why most people on civitai use the portrait size for gens and not square?
>>
>>102303874
No, PrefectPony with the misspelling. V2 cleaned style.
https://civitai.com/models/439889/prefect-pony-xl

I loved darelites fantasy mix for sd1.5 and I would often do native 2048x1024s with it in txt2img using controlnet open pose + latent couple to divide up the segments. I don't want photoreal, but luek the type of "realism" I got from darelites and it was great in fantasy settings.
>>
>>102303924
oh, cool, that's a really recent release. looks nice.
I haven't kept up with XL whatsoever since flux came out
>>
What does "decouple" parameter do for lora training?
>>
>>102303969
I only just got back into it a few days ago so just started both XL and flux.
>>
File: file.png (12 KB, 1289x72)
12 KB
12 KB PNG
>>102303778
civit mirror
>>
>>102304059
You mean the optimizer argument? This https://arxiv.org/abs/1711.05101
>>
>>102304059
It decouples your parameters
>>
>>102304085
Yeah. I don't get it, time to google translate
>>
>>102304111
tldr is decoupled weight decay is better than L2 regularization so you should enable it unless you know what you are doing
>>
>>102304153
what if huber loss is used, do you still enable it?
>>
>>102303646
If you are just trying to get the gen closer to your mental image instead of the opposite then keep up the good work.
>>
File: 1489718417.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>
>>102304068
Does it download the metadata too? Could it be used to build a lora library for this? >>102300741
https://github.com/JaredTherriault/ComfyUI-JNodes
>>
Any checkpoints actually worth using or is base dev+lora still the way to go?
>>
>>102304247
I would assume the Hubber loss doesn't play any meaningful role in this, don't really know though
>>
Any way to run it on 5700xt with decent performance on windows?
>>
>>102304381
Yes
>>
>>102304422
How? Last time I tried directml was slow and rocm still wasn't supported.
>>
dear thread
do not buy kingston ssds
fuck these shitty ass ssds
so tired of its shit
it will begin giving you 100% disk errors if you have it over half capacity, every single one, without fail
kingston is ass
>>
>>102304495
The only SSD that hasn't let me down is Western Digital Blue. Teamgroup SSDs are absolutely ass especially if you're dealing with large numbers of files, they come to a crawl after like a minute of continuous use.
>>
>>102304488
It will be slow
https://github.com/patientx/ComfyUI-Zluda
https://github.com/brknsoul/ROCmLibs/
+
https://github.com/city96/ComfyUI-GGUF
https://huggingface.co/city96/FLUX.1-schnell-gguf/tree/main
>>
File: ComfyUI_temp_ypbeg_00002_.png (604 KB, 2144x2752)
604 KB
604 KB PNG
I'm trying to build a pixel art workflow on comfy but i ran into an issue, the pixelization node gives me the following output, it does pixelate but it also gives me weird greenish artifacts, any help?
>>
>>102304342
that one is just the weights, i'll do separate repo with the metadata, i suppose i can do that now
you can build whatever you want with it
>>
>>102304613
Pretty weird it looks like chromatic aberration
>>
>>102304613
I had this problem too. For me the trick was to vae decode and pixelate early in denoising (like 30%) and do some processing like quantize and increase saturation after quantizing, then vae encode and denoise the remaining 70% followed by pixelizing again
>>
>>102304745
it's not a perfect solution, I should emphasize. quantizing likes to
wash out the image, I wish I knew of a smarter algo for quantizing.
>>
Is the 'CCP video AI' the best way to make AI videos right now? or is there any better way
>>
>>102304745
I managed to fix the colors a bit by using Image color match node from Easy-Use, but i still see the artifacts. Can i take a gander at your workflow?
>>
>>102304875
I'm not at home right now, so I can't.

It's really pretty much as I described though.
>>
>>102304342
here
https://huggingface.co/datasets/bigdata-pw/LoRA-Metadata
>>
>>102304914
Ah its alright anon, I'm mostly confused in the part where you pixelate early in denoising
>>
Adam_mini is still king.
>>
>>102304954
using ksampler advanced, you set one to stop at step X and the other start at step X

make sure the second also has add noise enabled, up to you whether the first has return with leftover noise on or off

"steps" on both should be same number

this just gets the latents to a point where your diffusion model might be more aware that it's working on pixel art with pixels of a certain size and diffuse accordingly.
>>
>>102305146
Ohhh i get it, ill try that, thanks anon
>>
>>102304779
They'll paywall it soon
Then you can use for free
https://huggingface.co/THUDM/CogVideoX-5b
>>
File: 00101-3209214616.png (1.08 MB, 1024x768)
1.08 MB
1.08 MB PNG
>>
File: 00101-3230790650.png (2 MB, 1536x1152)
2 MB
2 MB PNG
Im getting surprisingly close to recreating the style i want in Pony, but the hair detail isn't quite there and i still gotta adjust things so it doesn't crush the eyes like this
any tips for adding more definition and detail to the hair? its too, dunno the term, "flat tone?" like theres very little strands going on.
>>
>>102304556
>Western Digital Blue
will have to look into it for my next purchase because I don't know if I can stand the headache of these fuckers I'm dealing with rn anymore lmao
thanks anon
>>
>>102305360
for eyes use adetailer otherwise they'll always look crappy on initial gen in pony
for the hair you'll need a lora that adds more detail, you can inpaint mask over it (if youre using comfy you can use dino seggs and just type in 'hair' and it should automatically detect a decent mask for you)
in general though doing a HRfix/pass at a lowish denoise can usually help add more fine details
>>
File: 00113-3188733407.png (2.16 MB, 1536x1152)
2.16 MB
2.16 MB PNG
>>102305579
not even kidding man, i've been debating just throwing all my old 1.5 gens into civitai just for the fuck of it knowing full well there's NO way it could work
and. it did. what the fuck, i've been sitting on this all god damn summer thinking it was impossible, and yet i get picrel, what i consider to be the perfect evolution of the style with what Pony can do.
looks like i don't really need any additional detailers or whathaveyou after all.
>and i could've done this much earlier with the same "fuck it lets ball" attitude
there's room for improvement, like just doing another round of "fuck it" lora training by genning another 200 images in XL this time.
wtf i havent been this excited by this tech since last year.
>>
i love civitai's schizophrenic prompting when it gives you lora epoch examples
definitely gives me a good idea for how my lora turned out
>>
>reducing strength of lora results in blurry eyes or fucked hands
Why?
>>
status of bigma?
>>
>>102308841
balls
>>
gib me RTX Titan AI now
>>
>>102308841
Soon
>>
File: file.png (533 KB, 512x512)
533 KB
533 KB PNG
>>
>>102308841
obligatory "2 more weeks"
>>
>>102303406
What dim? I noticed that hands work much better if you bake with at least 64 dim, and when you resize afterwards they become worse right away. It sucks having a 256MB lora, but hey, as long as it works.
>>
>>102309626
cool man
>>
File: dingding.gif (59 KB, 638x604)
59 KB
59 KB GIF
Soo... I just deleted my entire /models folder because I didn't know that it was a symlink because it was in a /cache subfolder. 250 GB...
I now have to download all Loras and checkpoints again... if I remember what I had and needed.
Anyhow also all upscalers and controlnets are gone. Is there like some resource that somehow has all that stuff in one zip or something that I can just download it?
>>
File: 1725124297849978.png (584 KB, 512x768)
584 KB
584 KB PNG
Alright lads, figured out babby part 1 of how to comfyui, now to go do my groceries and hopefully not fall into addiction this week.
>>
>>102303773
its okay anon im here for you
>>
How come no one uses Lion? Isn't it better on paper?
>>
>>102305360
What is the style you are targeting exactly?
>>
>>102311165
i have no idea how to describe it as i mentioned but its this >>102306009
mostly it became an amalgamation of various schizoid negatives everyone used for sd1.5, back when i was brand new to all this and Pony wasn't even out then.
>>
File: file.png (706 KB, 1024x1024)
706 KB
706 KB PNG
>>102301786
>>
File: file.png (1.92 MB, 896x1152)
1.92 MB
1.92 MB PNG
>>102302427
SDXL wins again
>>
>>102311586
not 16ch tho
>>
>>102303491
That way you get more buzz. This is capitalism at work.
>>
>>102302171
this was my first gen with flux nf4, your exact prompt + my usual gen settings
>>
File: file.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>102302171
euler simple (man flux looks like shit sometimes)
>>
File: file.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>102311793
ipndm/sgm uniform
>>
File: ComfyUI_33598_.png (1002 KB, 1024x1024)
1002 KB
1002 KB PNG
https://ostris.com/2024/09/07/skipping-flux-1-dev-blocks
>>
File: file.png (2.04 MB, 1024x1024)
2.04 MB
2.04 MB PNG
A majestic black lion with lustrous, obsidian-black fur, its piercing eyes glowing with an inner light, gracefully lowering its head to drink from a crystal-clear, moonlit stream. The lion's barbed tongue, glistening with droplets of water, traces the surface of the stream, creating gentle ripples that shimmer in the ethereal glow of the moonlight. Surrounding the scene, ancient trees with gnarled roots and dark, shadowy foliage frame the lion in an aura of mystery and awe.

I don't see what the problem with black lions is.
>>
File: file.png (2.01 MB, 1024x1024)
2.01 MB
2.01 MB PNG
later nerds
>>
>>102311816
How soon until we figure out the worthless blocks and start making a smaller model?
>>
>>102295417
I never liked those high cut leotards n shit.
>>
>>102298888
Post this on Facebook and let all the old people think it's real.
>>
>>102311905
If you're talking about making smaller LoRAs, there are already attempts:
https://huggingface.co/TheLastBen/The_Hound
https://github.com/ostris/ai-toolkit?tab=readme-ov-file#training-specific-layers
>>
>>102312123
I'm talking about chopping up the main model
>>
>>102304495
isn't that a symptom of a fake SSD?
>>
File: 1711548548144425.png (1.96 MB, 1080x1576)
1.96 MB
1.96 MB PNG
is my gpu dying or something? can't get this orange to go away... should be black. Have the right VAE and all.
>>
>>102309716
I've been doing 32 but I'll give 64 a shot, I can live with the bigger size if it helps with hands
>>
>>102311121
I used lion for SDXL Loras all the time but I didn't find it that great on my flux tests... Maybe I'll try it again with some different settings
>>
>>102311562
Now we're genning with portals
>>
>>102302427
Skill issue
>>
>>102311686
>>102311793
>>102311813
It looks like a dog with fuzzy hair :(
>>
>>102311846
This is a lot nicer but it still has weird color seep on the face from the original non black color. Otherwise really good
>>
>>102312387
I've yet to see an image from flux with no lora that doesn't look like over processed Photoshop trash, how can skill overcome how it's trained
>>
>>102312368
What settings did you use?
>>
File: 1184954420.png (1.78 MB, 1008x1512)
1.78 MB
1.78 MB PNG
post more elves
>>
>>102312456
>for sdxl (pony, actual sdxl probably needs LR lowered):
https://archived.moe/h/thread/7927209/#7927952
>for flux:
files.catbox.moe/w3teku.txt
(I've since switched to training on 1024x1024 and 32 dim generally, but I've been using adamw8bit with varying LR instead)
>>
File: elf.png (1.85 MB, 833x1285)
1.85 MB
1.85 MB PNG
>>
>>102312605
Damn nice, thanks dude. I think my next batch is using those settings since I just extracted lora metadata from bunch of loras I have. I'm just using Huber loss instead L2.
>>
whats the reason we have sdg and ldg which is the main one ppl use?
>>
>>102312953
sdg is the discord thread for anons banned on discord
>>
File: 00147-1581562163.png (2.95 MB, 1024x1440)
2.95 MB
2.95 MB PNG
>>
>>102312953
i have no clue why they are separated, I chill in both and it seems like this thread has more actual conversation and tech talk
>>
File: mirror.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
/ldg/ seems to have a greater percentage of nogen trolls
>>
This is your reminder not to respond to the daily what is the difference between these two threads question.

>>102312267
prompt memory is real. Just restart your UI, maybe your computer.
>>
>>102313066
we don't feel compelled to sign our posts with an avatar
>>
File: taylormaga.png (1.94 MB, 1015x1015)
1.94 MB
1.94 MB PNG
i made a twitter account solely for viewing AI art. theres some amazing stuff coming out of japan. they I find out they all use midjourney
>>
File: pegasus.png (2.87 MB, 1568x1568)
2.87 MB
2.87 MB PNG
>>102313085
wheres all the precious tech advice?

i generally dont like conversing and reading what people write because its 90 percent butthurt back and forth trash
id rather look at images and move on
>>
File: Untitled.png (264 KB, 1029x765)
264 KB
264 KB PNG
>>102298888
gorgeous
>>
I've never posted a gen but I make up 40% of the posts.
>>
>>102313109
>wheres all the precious tech advice?
>i generally dont like conversing and reading

real mystery why you are missing it.

>>102313210
Me too, but i am 70%.
>>
File: 00189-1829015801.png (1.24 MB, 896x1152)
1.24 MB
1.24 MB PNG
>>102301250
>>
>>102313382
Omg, it Migu Robbie
>>
File: ComfyUI_temp_pcsfy_00005_.jpg (350 KB, 1664x2432)
350 KB
350 KB JPG
I like those leotards with thigh-high boots. Only thing better is bikini armor with thigh-high heel boots.
>>
>>102313420
meant for
>>102295417
>>
File: 1716984048458958.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>102301250
Stir fry mikuposters in a wok, report mikuposters to the IRS, slam DUNK mikuposters into the trash can, slice mikuposters in half with a KATANA.
>>
>>102313055
Indeed. This thread can easily hit text message limit every thread first whereas the other one hits image limit first.
>>
File: 00002-3436815059.png (3.72 MB, 1776x1488)
3.72 MB
3.72 MB PNG
>>
>>102313055
seems like a divide and conquer technique by dalle general, wouldnt be suprised if someone over there started up this second general
>>
>>102313420
Some nice combos you've mentioned there, will have to try them out next time I'm genning some 1girls
>>
>>102313086
ban this ad
>>
>>102303406
Never really had a better result on SDXL/Pony with anything more than very condensed description captions and tags are ultimately more useful in most cases.

As for Flux I think it kind-of trains but also almost all caption models I tried create a mess and I suspect I'd want to train on both tags and boomer captions except I haven't figured out how exactly. Maybe it should alternate the caption style between Epochs? IDK.
>>
File: 000000_17534_.png (2.56 MB, 1508x1032)
2.56 MB
2.56 MB PNG
>>
>>102313524
no u
>>
File: pimp.png (1.91 MB, 1440x992)
1.91 MB
1.91 MB PNG
>>102313524
>>
>>102313382
wb mikuposter, dont worry about that other anon
>>
>>102313476
Best poster is back, we are saved
>>
>>102294975
I’m going to post it in the next thread too since we’re near limit but, stupid question:
When you train a Lora or whatever else, does it do the N inference steps on every batch, or just one step, or what?
Asking because I have a jerryrigged thing stapling some encoders to the front of flux and only have enough vram for 10 inference steps on each batch, which feels like not enough.
>>
>>102312368
Even on SDXL I rarely preferred Lion over Prodigy or CAME or even plain AdamW8. Perhaps it's that I mostly trained character LoRa / DoRa etc. ?

Same on flux, CAME so far seemed easiest.
>>
>>102313594
When they announced yesterday when they was going to leave forever...
>>
>>102312921
good luck in your ventures fren. I do like lion a lot so I'll do some more experiments with it on flux too
>>
>>102313612
Odd. The amount of repeats of training on each image/caption in an epoch shouldn't really increase vram usage.

Are you sure you didn't increase batch size in the sense of how many images it processes at the same time in each step?
>>
>>102313632
What settings did you use for CAME?
>>
>>102313705
Currently I'm basically doing https://civitai.com/models/713258/flux-lora-trainer-on-comfyui with LR 1.0-1.2 and snr_gamma 5 or 10. It
>>
>>102313761
Same settings work for sdxl?
>>
>>102313581
Based
>>
>>102313704
I’m confused by it/thinking it’s a leak somewhere also, since looking at the code there’s no reason it should take more VRAM. Just wanted to be sure in thinking that it was even important to fix before I went digging around in BFL’s inference library to fix it (since the part with the leak is just calling their module’s forward in a loop and not allocating anything itself)
>>
>>102312037
Damn, this would actually work
>>
>>102313761
BTW Adafactor also worked quite good for me on Flux, not as great on SDXL or even Sigma (Prodigy/CAME were nearly always much better there).

>>102313783
I didn't do the SDXL training on this new ComfyUI tool. It was also basically default with OneTrainer and so on tho.
>>
Bread delivery has arrived...
>>102313958
>>102313958
>>102313958
>>
>>102298888
So bizarre how the response of leftists to celebrity rightoids is basically “oh wow fuck them then”, but the response of rightoids to celebrity lefts is basically the political version of deepfake porn
Leave Britney alone
>>
>>102313066
>nogen
such an SDGism
>>
File: 000000_17537_.png (2.46 MB, 1508x1032)
2.46 MB
2.46 MB PNG
>>
>>102313632
lion with 8/8 dim/a is very good for character loras for pony based models imo, but you can't use the same LR as you'd use with the others
>>
>>102304568
5700xt is gfx1010 though, I doubt it will work with gfx1031 rocm, no?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.