[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.25 MB, 3264x3264)
1.25 MB
1.25 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102178647

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
File: FD_00273_.png (1.87 MB, 832x1216)
1.87 MB
1.87 MB PNG
>>
File: 1722903277074969.png (609 KB, 712x480)
609 KB
609 KB PNG
>>
File: 1706387634983995.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
Anon working at the coroner's office:
>>
File: ComfyUI_01119_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>did the joycaption shit locally in comfyUI for my next LoRa dataset
>it created 180 text files so far so good
>edited all descriptions in a long and tedious process
>time to finally run ai-toolkit feelsgoodman
>always results in failure after it runs like 20 - 50 steps
>"error UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe9 in position 846: invalid continuation byte"
>means that when Python tried to read and decode the contents of a (text) file as UTF-8 text, it found a byte (0xe9) that is not valid in UTF-8 encoding.
>apparently there are some invisible "problematic characters" in the text files that nobody can see but this shitty program
>created a powershell script to remove them all (thanks chatgpt you fuck)
>now it works

I feel like I'm the hackerman, but also my question is what went wrong with joycaption?
>>
>>102181733
Shinji looking fuckable
>>
File: 00009-2428755409.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
Can love bloom between space marines and eldar?
>>
>>102181766
Goddam that's one tall bitch, the dude should be >2 meters tall
>>
File: 1700809173459325.png (574 KB, 712x480)
574 KB
574 KB PNG
>>
File: FLUX00005.png (1.5 MB, 1536x1248)
1.5 MB
1.5 MB PNG
>>102181766
Personally I wouldn't marry someone if my last name were to be come Flinker
>>
File: 00023-1804073867.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>102181781
Better?
>>
>>102181837
No, take it back, I like tall women. Besides, the helmet looks nothing like what spacemarines use
>>
>>102181733
I am significantly fatter than that.
>>
>>102181845
Default Flux with no LoRA struggles with space marine helmets.
>>
File: Untitled-2.jpg (1.82 MB, 4096x4096)
1.82 MB
1.82 MB JPG
https://gofile.io/d/tXflpP

Okay, I completely my initial experiment with block 7 and 20 training only. I included a link to the file, it's only 10mb in size despite being at 128 dim. Total training time was a little over 2 hours, but it could be done much faster if I reduced the batch size to 1 (I used 4). Not sure how 'good' the output is. The subject is senjogahara hitagi. Feel free to download and play with it and give your thoughts on the results. I might try a different character next and see what comes out.
>>
File: 1699748149526404.gif (910 KB, 112x112)
910 KB
910 KB GIF
Are there any good online generators / generation services that are good and don't have limitations? Ideogram fulfilled this purpose before and now it's became nogger, unsure if Flux is a meme (I've been using Flux Pro too); any recommendations to use?
>>
>>102181739
é
>>
>>102181906
>but it could be done much faster if I reduced the batch size to 1 (I used 4)
How does this even work? Is training a specific block slower with a high batch size?
>>
Can someone explain how civitai is able to commercialise inference on flux dev? Like I can pay buzz to generate flux dev loras. How is that possible with the non commercial licencing everyone is complaining about?
>>
>>102181938
Well if you set it to train for 2000 steps and set the batch size to 4, each step takes longer, which makes me thing I probably "overtrained" this model. I'll take another stab shortly.
>>
>>102181941
Probably some bullshit about you're paying for the GPU time rather than paying for Flux
>>
>>102181739
sanitise your outputs, it will save you a lot of headaches in the future
I just write them all to a huge json file so I can edit them all at once, then export them as named txt captions when it's all good.
>>
is there any reason to train a lora for flux more than 512x512? it will just downscale all my big images right? or will the outputs be shit?
>>
>>102181958
interesting...so why does the pony guy keep bitching about flux? Can't he just do the same thing as civitai
>>
>>102181941
>>102181958
They probably have a license. You're allowed to commercialise Flux with permission from BFL.
>>
Is there a list of characters that Flux can recognize without a Lora? If not there should be
>>
File: FD_00061_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>102181983
I don't know, but I always train at 1024x1024 and I have had almost every one turn out good. No idea what will happen at smaller res.
>>102182003
Sounds like a project. I saw someone on Reddit was doing a test with 4000 artists. Could do the same thing with characters.
>>
File: ComfyUI_05887_.png (981 KB, 1024x1024)
981 KB
981 KB PNG
>>102182003
>Is there a list of characters that Flux can recognize without a Lora? If not there should be
Here's the list anon:
- Hatsune Miku
- Donald Trump
>>
File: 2024-09-01_00053_.jpg (1.06 MB, 2496x3648)
1.06 MB
1.06 MB JPG
>>102181739
Do a comparison of both sets of text files, then you will find the changed characters and see which one JoyCaption created that is not UTF-8 .. tho its very strange most be some weird shit, since even Emoticons and Chinese is UTF-8
>>
File: FD_00105_.png (1.41 MB, 1536x1024)
1.41 MB
1.41 MB PNG
>>102182014
It also knows Queen Elizabeth and Meghan Markle. Also Sonic and Goku.
>>
File: ComfyUI_02147_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>*rapid knocking*
>Hello? Kohya? HELLO?
>Why haven't you put specific block training on Kohya? Hello?
>I messaged you on twitter, github, discord and the other github but you didn't pickup.
>My patreons and I are very worried.
>KOHYA?
>HELLO?
>>
File: ComfyUI_00870_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>102181936
I didnt see this shit in any of the files, I edited all of them manually and read through them and there was nothing wrong with them.
>>102181978
I have a script now that can sanitize them. its like one extra step and it doesnt take all that long, for 180 text files it was like not even 10 seconds.
>>
>>102182053
I have a hard time to believe that you could put all the characters/celebrities that exist on a single layer
>>
>>102182014
Does Mickey Mouse just fine. Also most iconic Star Wars bros and not to forget Pikachu and Sailor Moon
>>
>>102182075
I doubt anyone ever seriously entertains that idea, but it does make for very lightweight and targeted LoRA training.
>>
>>102182038
knows Vegeta to.. I should try Bulma.
>>
>>102182013
>Sounds like a project. I saw someone on Reddit was doing a test with 4000 artists. Could do the same thing with characters.
I wouldn't mind doing this.
Can I do it with this in Comfy if I set it to increment or does that only change the seed?
>>
File: 1713803267697469.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>
File: ComfyUI_02814_.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
So after doing some more tests on the layer 7 training I did, I feel it raped the weights fairly substantially despite the limited training. This is even at 1500 steps.

I will try again at a smaller batch size then one more time at a smaller dimension (currently at 128 dim), but it sure creates a lightweight model that trains fast!
>>
File: 2024-09-01_00054_.png (1.02 MB, 832x1216)
1.02 MB
1.02 MB PNG
>>102182122
ow your Monogatari lora is making progress! definitly Senjogahara looks now.. but I think 128 is not needed.. I ran a Harada/Disgea style lora at Dim 32/Alpha32 over night and it turned out perfect (but it only has 88 pictures, not your 1000+)
>>
>>102181678
Nice pose but man, the feet are bad. And the person overall looks more like sd1.5 early days.
>>
File: 1712881690593852.png (561 KB, 712x480)
561 KB
561 KB PNG
>>
>>102182111
seed only.
>>
>>102182142
You're a little bit behind on the update. I'm just testing the new target block training. I'm only using 60-ish images per character.
The idea being you can train on just two blocks and avoid ruining the weights in the process. I'm not so sure . It could just be my bad training data though.
>>
>>102182159
Dang. How can I do it then? Trying WAS Text Load Line From File but keep getting permission denied.
>>
>>102182199
Never mind I figured it out.
>>
File: 2024-09-01_00070_.jpg (1.13 MB, 3840x2160)
1.13 MB
1.13 MB JPG
>>102182168
I see.. three questions then:

Does your training data include alot of text? Cause if I remember correctly Monogatari has insane amounts of Japanese characters in some scenes. This might confuse the training.

Did you caption the dats? If so in what way. I used JoyCaption and then edited each file.. sounds like alot of work for such a huge dataset.

How big is a dim 128 lora? My 4000 step dim32 lora is already 335mb
>>
>>102182261
I cut a good deal of the text with that in mind. It might be picking up on the new newspaper wallpaper in hitagi's house though.

Caption is joycaption with the AI toolkit trigger word edited in.

Lora is 10mb
>>
>>102182168
is target block training faster? how long does 1000 steps take?
>>
File: 2024-09-01_00071_.png (567 KB, 1280x720)
567 KB
567 KB PNG
>>102182269
>Lora is 10mb
woa thats insanely small.. my harada lora is the first flux lora I made so I don't know the details on all of that but how did you get it so small? also won't it loose much data if you make such a small lora out of such a big data set?
>>
File: Untitled.png (4 KB, 598x33)
4 KB
4 KB PNG
>>102182291
This is at a batch size of 1 with 512 and 1024
batch size of 4 at 512 was about 4.5~ish it/s
>>
File: 2024-09-01_00065_.jpg (1.26 MB, 3840x2160)
1.26 MB
1.26 MB JPG
>>
>>102182305
Well, it's only training 2 blocks in the entire LoRA, that being the character output and... I'm not sure what the other one is doing.
>>
OK got this shit working. Who do we want to know about?
>>
>>102182356
Bulma! Also try iconic Disney characters
>>
>>102182321
>that being the character output
there is no "character output" layer or "face" layer
these few layer loras will have issues we can't predict
>>
File: Bulma-1.png (383 KB, 512x512)
383 KB
383 KB PNG
>>102182407
I mean more like a list of characters I can put into a text file. For example Naruto, One Piece, DBZ etc
Also it doesn't know Bulma
>>
File: 2024-09-01_00077_.jpg (1.23 MB, 3840x2160)
1.23 MB
1.23 MB JPG
>>
>>102182421
Therefore I should not test them? What are you trying to say here?
>>
>>102182426
no bulma ;_;
>>
File: 2024-09-01_00079_.png (1.05 MB, 1280x720)
1.05 MB
1.05 MB PNG
>>
>>102182431
I'm saying there is no "character output" layer or "face" layer
>>
>>102182431
they're being autistic, ignore them as god intended
>>
>>102182480
>they
>>
>>102182484
multiple layers.. therefore plural they .. not everything is about US gender wars
>>
>>102182484
you're being autistic, I will ignore you now as god intended
>>
>>102182490
the...layers...are being autistic?
>>
>>102182447
At some point during inference there is are layers that change the output. Call them what you want, but training only those layers seems to be something. No need to be a dismissive autist about it.
>>
>>102182509
>Layers
I mean blocks

Damn it, I've been getting that mixed up all day.
>>
>>102182501
they are digital neurons.. they sure could be
>>
>>102182509
all layers affect the output
>>
OK I don't know how long this will take or if I will get bored but here's where I'll upload the character tests for anyone who is interested
https://mega.nz/folder/a2Ri0b4Z#SNKrUAChFFeovXJZT3V5SA
>>
>102182523
PSA: This is Debo, ignore and do not be baited into engaging.
>>
>>102182539
Impossible for I am debo.
>>
>>102182426
Try Bleach
>>
>>102182536
Lovely! Ill follow your research ..

also damnit Trunks really got it bad eh?
>>
>>102182523
Not all blocks affect the output in the same way.
>>
>>102182585
No block affects just faces.
>>
File: file.png (602 KB, 634x651)
602 KB
602 KB PNG
>>102182536
I knew this would be bad but I didn't expect this bad
>>
>>102182595
*or just characters, or just whatever you're trying to train
>>
>>102182600
Lmao stupid AI, that's piccolo
>>
File: 2024-09-01_00087_.jpg (729 KB, 2160x3840)
729 KB
729 KB JPG
>>
>>102182611
this bothers me on a subconscious level and I can't explain why
and no it's not the signs
>>
>>102182600
at this point to save flux you have to give it to it the whole gelbooru dataset...
>>
>>102182621
good finetunes never ever
>>
File: file.png (628 KB, 625x618)
628 KB
628 KB PNG
>>102182536
lmaoooooo
>>
File: ComfyUI_00128_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>
>>102182595
>>102182605
If training two blocks successfully trains the character I was trying to train then what does that imply?
>>
>>102182679
that you're not stress testing the lora enough
>>
>>102182611
it's that girl from Disgaea right?
>>
>>102182689
I'm done replying to debo.
>>
File: 2024-09-01_00088_.png (1.13 MB, 1280x720)
1.13 MB
1.13 MB PNG
>>102182704
ya, but I didn't train it on specific characters .. just a general Harada style lora
>>
>>102182673
Hi Comfy can you give city a handjob so he can fix the prompt switching issue with gguf?
>>
>>102182679
It means it is enough for your needs, it does not mean the blocks are "character" blocks.
Can it generate the character in a recognizable way at a distance, does it keep smaller details when you prompt for changes like a different outfit, does it artifact when stacked with other loras?
>>102182710
rent free
>>
File: ComfyUI_00161_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>102182726
what's the prompt fixing issue?
>>
>>102182710
that's the good move, there's no point in educating that retard, you'll just waste your time
>>
>>102182563
>>102182631
Trunks is a pretty unfortunate name.
>>
>>102182741
sorry you can't keep up intellectually
>>
>>102182742
it should work though if you say "Trunks from Dragon Ball Z"
>>
>>102182738
When using GGUF if I change the prompt without manually unloading the model it OOMs and I have to kill ComfyUI and restart.
>>
>>102182738
>>102182757
Doesn't happen with fp
>>
Some random Disney characters.
Is there any specific IPs you want to know about?
>>
>>102182777
>Dalmatian Cruella
>>
>>102182777
>Kala
kek
>>
File: 2024-09-01_00095_.png (699 KB, 720x1280)
699 KB
699 KB PNG
>>102182620
I wonder why. The Disgea/Harada style is very dividing, you love it or you hate it. Never heard anyone get uneasy by it tho.
>>
File: 1702415926705078.png (1.94 MB, 1024x1024)
1.94 MB
1.94 MB PNG
>>
deus ex style LoRA going poorly
>>
File: 2024-09-01_00098_.png (672 KB, 1280x1280)
672 KB
672 KB PNG
>>102182820
what did you use as data? artwork or game screenshots?
>>
>>102182834
Screenshots. This is epoch 5 of 15 so will see what happens.
>>
>>
>>102182777
I see that you're working on One Piece now, that's a good idea to add the series names after the character name, it will help the model a little, especially after the Trunk fiasco kek
>>
Everyone in One Piece is Luffy
>>
>>102182892
Yeah after seeing Trunks done so dirty I thought it would be a good idea. I might re-run DBZ with this change.
>>
File: 2024-09-01_00101_.png (2.32 MB, 1280x1216)
2.32 MB
2.32 MB PNG
>>102182900
tony tony chopper and Nami seem to be alright tho
>>
File: 000000_17162_.png (2.31 MB, 1508x1032)
2.31 MB
2.31 MB PNG
>>102182673
Mornin' Comfy, cute,,
>>
>>102182673
Are you actually comfy? Why is your gen number so low? Shouldn't it be like 50k by now?
>>
>>102182053
lol
>>
Thread dead
>>
File: ComfyUI_hgdf_00051_.png (1.14 MB, 1280x720)
1.14 MB
1.14 MB PNG
>>102183076
lies
>>
File: 1.jpg (112 KB, 1280x1280)
112 KB
112 KB JPG
>>
>>102183095
no debo, we really don't need you
>>
>>102182842
First Deus Ex?
>>
>>102182900
does it know star trek characters? true blood? dexter?
>>
>>102182757
werks for me
t. 8gb vramlet, using Q4_K_S flux and Q8 t5xxl, both loading into VRAM one after another then unloading upon completion.
>>
File: 2024-09-01_00111_.png (1.24 MB, 1280x720)
1.24 MB
1.24 MB PNG
>>
>>102183162
For tv show and movies it's better just to prompt the actors name, no?
>>102183156
Yeah. I think it's starting to shittify on epoch 8. I don't have hopes on it though.
>>
File: 1694267362271250.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>yakuza in prompt
>dude missing his pinky finger
there's something about flux that surprises me
>>
>>102183170
Works if I use the Q6 quants but I don't want to do that when I can use FP8 for better quality just fine. It was working on Q8 last week, then something changed and now it's fucky.
>>
>>102183247
Better to prompt character, actor and tvshow all
>>
>>102183274
Why? If it knows the actor it knows the show character
>>
File: 2024-09-01_00106_.jpg (1.23 MB, 2496x3648)
1.23 MB
1.23 MB JPG
>>
>>102181739
and now my LoRa finished backing.
>>
>>102183162
It mixes Leonard Nimoy and Zachary Quinto when you prompt for Spock
>>
File: 2024-09-01_00122_.jpg (1.4 MB, 2496x3648)
1.4 MB
1.4 MB JPG
>>
Flux knows a lot of super heroes
>>
>>102183347
cool.. Aunt May tho lol
>>
>>102183347
it only knows what the llm knew how to tag, so mostly western trivia, and not a lot more.
>>
>>102183306
that's pretty neat

>>102183347
Judge Dredd? Lobo?
>>
File: file2.png (1.11 MB, 1368x1024)
1.11 MB
1.11 MB PNG
>>
>>102183369
You have the power to test specific characters I missed yourself. But yes it knows Dredd and Lobo.
>>
File: bComfyUI_114629_.jpg (1.48 MB, 3072x1536)
1.48 MB
1.48 MB JPG
>>
https://drive.google.com/drive/folders/1eGrbstWLGOlinNL_d7WzaYcOzpSp5a9x
So basically flux dev knows 4 styles max kek
>>
>>102183632
You are visually retarded
>>
>>102183649
oh hi debo
>>
File: 2024-09-01_00130_.png (957 KB, 1280x720)
957 KB
957 KB PNG
>>
>>102183657
>everyone who disagrees with me is debo
get help
>>
>>102183657
>>102183689
guys you have your own thread, no one is fooled by this
>>
this whole debo thing is poisoning the thread in a very unfortunate manner. was gonna post my gens but why?
>>102183347
very much appreciated.
>>
>>102183730
You're welcome. Doing random cartoon characters next then going to bed. Might do more tomorrow
>>
Trying to get chat gpt to generate a list of characters to test was a mistake
>>
>>102183825
the spider-ham unlimited. this is art
>>
File: file.png (1.34 MB, 1696x857)
1.34 MB
1.34 MB PNG
>1.8m
nice
>>
Johnny Bravo looking slick.
>>
flux winning on all levels. feet > dashboard, 100% awesome. how much win can a man take
>>102183898
slick glasses
>>
File: 2024-09-01_00140_.png (865 KB, 1280x720)
865 KB
865 KB PNG
>>
File: FD_00001_.png (959 KB, 1024x1024)
959 KB
959 KB PNG
I think my sample prompt was just shit. Deus Ex success.
>>
>>102184029
>I think my sample prompt was just shit.
Never trust samples, they lie. They always lie.
>>
File: 1708942273823.jpg (242 KB, 1024x1024)
242 KB
242 KB JPG
>>
File: 2024-09-01_00124_.png (1.38 MB, 832x1216)
1.38 MB
1.38 MB PNG
>>102184029
niice
>>
File: 1697806189755.jpg (496 KB, 1024x1024)
496 KB
496 KB JPG
>>
>>102184040
>>102184064
Not finished yet but I will upload it tomorrow and post link.
>>
>>102182013
>>102182111
>list of characters
>test with 4000 artists
I never understand why people do this. *Every* character/artist a model knows? Why not test every noun in the dictionary while you're at it? The output is generally not a helpful resource (maybe styles/artists, but certainly not "characters it knows"), or at least not in the web form they're usually presented.

What I can potentially see as being a helpful resource is a ui extension similar to booru tag autocomplete where you'd pick the model and load in that database, and when you type "squ" will show you an autocomplete dropdown with "squirtle (pokemon)" in yellow to indicate he's unreliably understood, "squidward (spongebob squarepants)" in green to indicate the model knows him well, "squirt (finding nemo)" in red to indicate it has no idea. Making the database would be a pain, though, although you could definitely fully automate the process by feeding in a list to generate, joycaptioning the output, and searching the caption for the input character name. do X images per char to test for success, also do Y joycaptions per image if doing so is cheap, overall score out of X*Y is the model's knowledge of that character. Really only saves you 1 gen though, by letting you just abort trying to gen unknown characters and look
>>
>>102183849
to think i could make $140k by generating sd text
>>
>>102184118
Because it's currently unknown information. Where are you going to get your extension from?
>>
>>102184118
>joycaptioning the output, and searching the caption for the input character name
That's great except joycaption doesn't know most characters lol
>>
>>102184189
it also says paintings and sketches are digital artwork
>>
>>102184169
i didn't mean creating the database to input into the extension, i mean presenting that information in the form of some unprocessable website or rentry guide which is invariably how it ends up
>>102184189
ah, damn, i tested on a couple of gen 1 pokemon to check that it at least wasn't averse to naming IPs and it did fine, but I guess if that's the case it's not automatable yet. and yeah i just tested a couple more and it can't do anything remotely obscure, failed at gill from finding nemo and tokuchi from one outs. bit of a bottleneck. we'll probably have something within a year, but actually I suppose reverse google image search might do a better job with its advantages of checking similarity against real tagged images, don't know what's usable in terms of api for that though
>>
>>102184291
OK but how do we know what characters and art styles it knows? How will this extension you suggest going to know this information?
>>
It's pretty funny everyone in the FSM has decided that it's just a little too free, with ai art.
>>
File: 1697073191300754.png (1.75 MB, 1152x896)
1.75 MB
1.75 MB PNG
>>
File: 000000_17167_.png (2.64 MB, 1508x1032)
2.64 MB
2.64 MB PNG
>>102184075
Nice.
>>
File: 1699045104493216.png (1.34 MB, 1152x896)
1.34 MB
1.34 MB PNG
>>
>>102184410
add fantasy wildlife
>>
Casual user here. I feel like I'm having way more trouble getting Flux to generate a specific art style or architecture style than Stable Diffusion. I wanted to generate ancient Greek architecture and cityscapes for inspiration, but all my prompting just spits out more recent generic Mediterranean or Italian architecture.
>>
>>102184419
ets?
>>
>>102183670
Overcooked
>>
File: FLUX~5.jpg (2.67 MB, 2992x2336)
2.67 MB
2.67 MB JPG
>>102184439
Add a year and something like
archaic, vintage, retro, historical
>>
>>102184210
This is a digital artwork, likely created in a computer-generated imagery (CGI) style.
This is a digital artwork, likely created in a digital painting style.
The image is a digital artwork, likely created using computer software.
>>
>>102184440
https://civitai.com/models/659794?modelVersionId=738279
>>
>>102184425
>Great idea.
>>
File: 1696543950236.jpg (509 KB, 1024x1024)
509 KB
509 KB JPG
>>
File: Untitled-1.jpg (730 KB, 1920x2160)
730 KB
730 KB JPG
So I tried training again with only targeting 2 blocks but this time at a batch size of one, much less aggressive training and I got what I think a better, if slightly weaker results

The activation keyword is arararagi karen. Once again, the total size of the LoRA file was only 10MB and did a pretty good job a reproducing the character.

https://gofile.io/d/wpVjPJ
>>
>>102184485
how well does it generalize
>>
>>102184491
>generalize

as in?
>>
File: FD_00018_.png (713 KB, 1024x1024)
713 KB
713 KB PNG
https://civitai.com/models/709157?modelVersionId=793219
>>
>>102184510
How about Father Time scolds a koala.
>>
File: mirror.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>
>>102184526
I literally do not understand. It's a LoRA of a specific character. You type in the character's name and what you want to see them doing.
>>
File: 1719712500775788.png (1.84 MB, 1152x896)
1.84 MB
1.84 MB PNG
>>
>finetunes and loras for XL and pony are coming out daily by the dozens
>only one or two slop loras for flux that barely manage to summon recognizable characters from the depths of the uncanny valley
What went wrong?
>>
>>102184522
Only 18mb nice
>>
If someone posts images using your model on civitai you get buzz for each image or only once for each user?
>>
>>102184583
this is bait, probably debo trying a new angle
>>
File: 1724744278715791.png (1.85 MB, 1152x896)
1.85 MB
1.85 MB PNG
>>
>>102184583
>softcore loras for flux STILL mostly a hit and miss struggle
>porn nowhere even near what 1.5 or xl can do
>1.5 still king when it comes to porn inpaint
>>
>>102184363
FSM?
>>
File: 1698710622655.jpg (817 KB, 1024x1024)
817 KB
817 KB JPG
>>
File: 1709513629088898.png (1.58 MB, 1152x896)
1.58 MB
1.58 MB PNG
>>
File: 1703094294700.jpg (498 KB, 1024x1024)
498 KB
498 KB JPG
>>
File: 1717426188002385.png (1.69 MB, 1152x896)
1.69 MB
1.69 MB PNG
>>
Are there some good pony based models yet other than the months old original pony base and autism mixes?
>>
how prevent boob armour
>>
File: 00_14.jpg (202 KB, 1552x1200)
202 KB
202 KB JPG
Any anons have plans for the fall?
>>
>>102184708
What is this, Discord?
>>
File: FD_00040_.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>
>>102184589
>If someone posts images using your model on civitai you get buzz for each image or only once for each user?
>>
File: 00046-3174201157.png (1.12 MB, 808x1008)
1.12 MB
1.12 MB PNG
>>
File: file.png (633 KB, 512x512)
633 KB
633 KB PNG
>>
>>102184797
How many epochs left?
>>
>>102184817
We're on a journey. I still haven't added all the pop culture and now I'm adding screengrabs from various movies.
>>
>>102184522
What do you know about Operation Warp Speed?
>>
>>102184583
The hype has always been a lie.
>>
File: 00050-3672676105.png (821 KB, 688x1008)
821 KB
821 KB PNG
>>
What exciting discoveries in LoRA training did you make today?
>>
File: 1721356322023948.png (1.97 MB, 1152x896)
1.97 MB
1.97 MB PNG
>>
More Deus Ex please
>>
flickr set reached 1B. processing to parquet now then uploading
>>
File: ComfyUI_01538_.png (1.72 MB, 1440x1920)
1.72 MB
1.72 MB PNG
>>
File: 1694083679547214.png (2.01 MB, 1152x896)
2.01 MB
2.01 MB PNG
>>
>>102184851
Awesome, godspeed
>>
File: file.png (633 KB, 512x512)
633 KB
633 KB PNG
>>
>>102184544
ok, [character] dressed as Father Time, scolds a koala
>>
>>102185048
kekked
>>
>>102185175
Dude, I went to bed ages ago. If you want to test it, the Lora is like 9.5 megabytes.
>>
>>102185187
bad sleep hygiene
>>
>>102184725
unpossible
>>
>>102184601
free speech movement, on the Berkeley campus mainly.
>>
>>102185200
I known
>>
>>102185219
free speech for me, not for thee
turns out when people have free speech they can say uncomfortable things, we can't have that
>>
File: 1.jpg (223 KB, 1080x1080)
223 KB
223 KB JPG
>>102185219
>>102185232

Why even bother posting on civitai or any cucked ai generating site with all the current and future rules and laws and instead not pretend you are an artist and created the art yourself and post it on tumblr or deviantart? Not like they could tell you used AI or not for artistic images

https://www.reddit.com/r/StableDiffusion/comments/1f5dtmf/california_bill_set_to_ban_civitai_huggingface/
>>
>>102185288
California is just committing suicide. They have some of the worst laws and their government is extremely unpopular and their governor is only in office because of extreme corruption (he's been legitimately recalled at least once -- but don't worry they found 1 million votes behind a dumpster). AI is here and here to stay, it's like trying to ban digital art tools because they threaten traditional media artists. But where the fuck was California when Disney was outsourcing all those Californian, American jobs in animation to foreign countries, something still ongoing today?

None of it matters, the AI laws will be repealed, AI is the most free speech thing possible and protected under the 1st amendment. We have centuries of precedent that show that we don't give a fuck if speech has potential harm. We don't ban .mp4s because someone can pirate a movie.
>>
File: 00054-2663734528.png (857 KB, 688x1008)
857 KB
857 KB PNG
face/body mismatch
>>
File: ComfyUI_01545_.png (1.98 MB, 1152x1536)
1.98 MB
1.98 MB PNG
>>
>>102185356
>We have centuries of precedent that show that we don't give a fuck if speech has potential harm.
You seem to be forgetting that most people, especially politicians, are clowns that don't consider images and videos to be part of free speech.
>>
>>102185356
They seem to want to put a watermark on each generated image by online ai. Looks like they want to make it more expensive to generate ai content locally and not on a paid site too.

There's a double standard too where if you draw a cartoon of mocking Trump or Kamala it's freedom of press but if you make a similar cartoon using AI tools they want it to be illegal.
>>
>>102185417
Good fucking thing we have a supreme court and 50 states. Let me know if you're confused.
>>
>>102185427
>>102185356
But AI is different from other stuff because it looks so real
And it takes zero effort to make an AI image
It's just different... Okay? Stop. It's just different, anon!
>>
So how the FUCK do I use Union ControlNets without OOMing? Using the usual "Apply ControlNet (Advanced)" node throws an error about being unable to multiply matrices, and all other suggested workflows use the "ControlNetApply SD3 and HunyuanDiT" which results in an OOM error.
XLabs ControlNets work fine with the former node but I need posing control which is only available on the latter.
Running on a 3090 in low VRAM mode if that matters, and no I won't use a shitty low quality quant of Flux either.
>>
>>102185427
It's just double dip nonsense, there's already laws on the books that cover all the potential harm you can already do with AI. And it really doesn't matter if we watermark or don't watermark these AI images, legitimate images are going to get watermarks/encrypted signatures just to prove they're not digitally altered or were created with a real device
>>
>>102185475
>and no I won't use a shitty low quality quant of Flux either
buy a 48GB card then
>>
>>102185488
What happens if people figure out how to make false watermarks? Could just print the AI image and take a photo of it with your phone which will generate a watermark
>>
File: ComfyUI_01547_.jpg (1.35 MB, 2304x1728)
1.35 MB
1.35 MB JPG
>>
File: 2024-09-01_00182_.png (1.46 MB, 832x1216)
1.46 MB
1.46 MB PNG
>>
>>102185592
very cool
>>
>>102185498
Flux has enough trouble being accurate as is, I'm not introducing further loss into the gens. Also not my fault the retarded devs thought it'd be a brilliant idea to shove every mode into a single massive file that has to be loaded all at once. Or that NJudea is stingy with VRAM even on its flagship cards.
Worst case I'll just manually draw out some pseudo-depth map and plug that into the XLabs ControlNet instead, but it's absolutely retarded that they just HAD to pack everything into one file.
>>
File: 00003-4221760617.png (1.09 MB, 896x1152)
1.09 MB
1.09 MB PNG
>>
File: ComfyUI_01544_.png (1.29 MB, 960x1280)
1.29 MB
1.29 MB PNG
>>102185604
sank yew
>>
>>102185522
The main point of this is to prove authenticity and have a chain of trust. Spoofing a signature really doesn't achieve either and you likely would not be able to spoof a real device signature which will probably be something signed like <Manufacturer> <Device Model> <Serial Number> <Time Stamp> + <Image metadata>.

Modern cameras already have depth sensors, so if you took a picture of a photograph it would reveal it was a fake because the signed image would include depth information.

Even if you refuse to watermark AI images, real pictures will absolutely watermark their images, because those people have a vested interest to not have their images called AI. This will all be done without any laws being passed. Camera manufacturers will do this because they don't want to be supplanted by AI, creators will do this because they don't want to be called AI users.
>>
File: ComfyUI_01000_.png (956 KB, 1024x1024)
956 KB
956 KB PNG
>>102185626
>Buttchin
>>
File: 1712014399932333.png (1.86 MB, 1152x896)
1.86 MB
1.86 MB PNG
>>
File: 1697285228086239.png (8 KB, 239x429)
8 KB
8 KB PNG
Jesus, these Union CNs are unusable even on Q8 Flux. 24 GB memory overrun? What the FUCK is going on here exactly?
>>
>>102181685
is that a stack of crab cakes?
>>
>model has seen millions, maybe billions of examples.of most things in the world
>Anon thinks his shitty captions made in joy caption that are full of hallucinations and of a completely different style to the way the base model was captioned is going to do anything but confused and ruin the model.

Why caption anything but names of people? Seems like it does more harm than good.
>>
File: 2024-09-01_00208_.png (1.81 MB, 832x1216)
1.81 MB
1.81 MB PNG
>>
>>102185829
I'm almost envious that you can't get it to run because you'll spare yourself the disappointment of knowing they're complete shit.
>>
>>102185907
I'd love to see the results of your theory anon.
>>
File: 2024-09-01_00177_.png (1.67 MB, 832x1216)
1.67 MB
1.67 MB PNG
>>
File: ComfyUI_Flux_34.png (967 KB, 1280x720)
967 KB
967 KB PNG
>>
>>102185970
ha, realizing rudd was the o.g. proompter.
>>
>>102185934
Plenty of people besides myself say the same
>>
>>102185970
Kek
>>
File: file.png (726 KB, 1280x896)
726 KB
726 KB PNG
>>102186000
>>
>>102185847
it's a dripping shortstack
>>
File: 170259_00005_.png (920 KB, 1024x768)
920 KB
920 KB PNG
Latest local img2video news?
Cog seems ghey in this respect, so that's another dead on arrival project. (Oh no I have to run a llm on the first image to feed to the rest and then constrain the videocheckpoints output to adhere to the structure of the initial image! This is too technical a problem ahhhh! I will commit Chinese Hari-kiri (eating Chinese street food made with sewer oil))

Anything else in the pipeline?
>>
File: 00000_17174_.png (2.53 MB, 1508x1032)
2.53 MB
2.53 MB PNG
>>
>>102186038
Flux is working on a video model. I can't wait to make PG-13 smut with it.
>>
>>102186054
Flux Video will prosper if it can make toes flex realistically
>>
File: 2024-09-01_00217_.png (731 KB, 832x1216)
731 KB
731 KB PNG
>>
File: 1708363099210.jpg (1.19 MB, 1024x1024)
1.19 MB
1.19 MB JPG
>>
File: 2024-09-01_00174_.png (1.54 MB, 832x1216)
1.54 MB
1.54 MB PNG
damn making this lora was so easy and successful, FLUX is very easy to train on .. what LoRA should I do next?
>>
File: 1709819101832.jpg (208 KB, 1024x1024)
208 KB
208 KB JPG
>>
File: 1694090733808.jpg (1.47 MB, 1024x1024)
1.47 MB
1.47 MB JPG
>>
File: 2024-09-01_00228_.png (1.53 MB, 832x1216)
1.53 MB
1.53 MB PNG
>>102186443
nice prompt
>>
File: xyz_grid-0015-2801078985.png (2.38 MB, 1848x1008)
2.38 MB
2.38 MB PNG
laughing. going through 15 epochs, 3 at a time, to narrow down to like 4 epochs/versions of this lora. here, 1st is kept.
>>
File: file.png (1.3 MB, 1280x896)
1.3 MB
1.3 MB PNG
>>
File: 00143-4221760618.jpg (695 KB, 1440x1920)
695 KB
695 KB JPG
>>
File: 1701894334887.jpg (1.43 MB, 1024x1024)
1.43 MB
1.43 MB JPG
>>
File: ComfyUI_Flux_38.jpg (2.02 MB, 2048x2048)
2.02 MB
2.02 MB JPG
>>102185992
>>102186011
Comic panels are hit or miss, but still pretty damn impressive.
>>
>>102186593
Flux is probably smart enough to be trained on proper comic panels as long as the total written word count like less than 20.
>>
File: 00152-4221760623.jpg (1 MB, 1613x2150)
1 MB
1 MB JPG
>>
>>
File: delux_sg_00101_.png (1.99 MB, 1536x968)
1.99 MB
1.99 MB PNG
>>102186443
can't believe I heard it here first. rip in peace

>>102186594
that sounds awful

>>102186666
checked
>>
File: 1721959715586.jpg (448 KB, 1024x1024)
448 KB
448 KB JPG
>>
File: file.png (1.3 MB, 1280x896)
1.3 MB
1.3 MB PNG
>>
File: FluxDev_04262_.jpg (166 KB, 832x1216)
166 KB
166 KB JPG
>>
File: 1724549917121.jpg (748 KB, 1024x1024)
748 KB
748 KB JPG
>>
>>102186750
that's great
>>
https://github.com/THUDM/CogVideo/issues/88#issuecomment-2273572339
>Yes, the above reply means that we do not plan to open source the image-generated video model in the near future. Please pay attention and look forward to it.
fuck off, image to video is what would bring some hype to their model
>>
>>102186054
Don't have much hope that'll fit in consumer cards, looking back, video models are larger than image generating models.
Maybe they will have learned some equivalent gguf magic before they release it, it's possible I suppose.
>>
File: 00162-4221760618.jpg (1.1 MB, 1613x2150)
1.1 MB
1.1 MB JPG
>>
Thinking about using my 500 buzz to train a lora. Is it a good idea? How long does it take?
>>
File: 00001-507899290.jpg (3.41 MB, 2576x2576)
3.41 MB
3.41 MB JPG
>>
File: GAqbyENboAA07kF.png (74 KB, 227x245)
74 KB
74 KB PNG
Any decent way to access Forge locally other than through a browser?
Is there an app with a good UI I can use?
I want to gen shit from my tablet.
>>
File: 1725067849277.jpg (852 KB, 1024x1024)
852 KB
852 KB JPG
>>
>>102186813
>we do not plan to
>but look forward to it
Getting mixed messages here.
>>
>>102186931
I thought you needed 2000 buzz, but maybe that's just for flux and you are doing something else?

Anyway people seem to think it's pretty good
>>
File: 2024-09-01_00230_.jpg (1.34 MB, 2496x3648)
1.34 MB
1.34 MB JPG
>>102186946
>Is there an app with a good UI I can use?
don't think so
>>102186946
>I want to gen shit from my tablet.
does your tablet not have a browser?
>>
File: delux_sg_00102_.png (1.84 MB, 1536x968)
1.84 MB
1.84 MB PNG
>>102186931
buy a coffee instead
>>
>>102186962
desu it doesn't matter much, Black Forest Lab will deliver with some good video model shit like they did with Flux
>>
File: 1723987264857.jpg (792 KB, 1024x1024)
792 KB
792 KB JPG
>>
>>102186946
>/g/
>>
>>102186943
left eye looks weird, but cool style, I guess this is Silent Hill 2 inspired.
>>
File: ComfyUI_01555_.png (2.96 MB, 1440x1920)
2.96 MB
2.96 MB PNG
>>
>>102186983
Thank you for your valuable opinion.
>>
File: 1696577598398.jpg (283 KB, 1024x1024)
283 KB
283 KB JPG
>>
File: 1710247204004.jpg (356 KB, 1024x1024)
356 KB
356 KB JPG
>>
File: 00006-529038626.jpg (3.24 MB, 2576x2576)
3.24 MB
3.24 MB JPG
>>102186991
> I guess this is Silent Hill 2 inspired
Nope
>>
>>102187011
nice one anon, lora?
>>
>>102187073
reminds me of nu metal etc music videos
>>
It's here, the next loaf of...
>>102187084
>>102187084
>>102187084
>>
File: 1723403046125.jpg (680 KB, 1024x1024)
680 KB
680 KB JPG
>>
>>102186978
Can't you read?
>>102186990
bot
>>
Trying to run FLUX controlnet for the first time on ComfyUI.


I do not understand in which folder should I put my controlent safternsors (flux-depth-controlnet-v3).
>>
File: ifx337.jpg (432 KB, 1024x1024)
432 KB
432 KB JPG
>>
File: ifx335.jpg (428 KB, 1024x1024)
428 KB
428 KB JPG
>>
>>102183849
dude that's ART.

It's on FIIIILM
>>
File: ComfyUI_00199_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>
My jihad for the booba continues.
>>
TIDDY AKBAR

(yes it had the blobs)
>>
File: ifx340.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>
>>102187414
safetensor doesn't tell you what folder.

it'll be in one of the models folders.

I have some under models/checkpoint

might be where yours goes
>>
>>102188752
checkpoints*
>>
File: ifx341.jpg (368 KB, 1024x1024)
368 KB
368 KB JPG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.