[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.18 MB, 3264x3264)
1.18 MB
1.18 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101779850

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Kolors
https://gokaygokay-kolors.hf.space
Nodes: https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

>AuraFlow
https://fal.ai/models/fal-ai/aura-flow
https://huggingface.co/fal/AuraFlows

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: 2024-08-08_00310_.png (1.92 MB, 768x1680)
1.92 MB
1.92 MB PNG
>>
>>
i didn't know ldg was the child abuser derivative of sdg
>>
>>101784258
ignore the shitstain, he just cries for attention, the less you give him the better
>>
>>101784126
that's some ugly knees. even mine look better
>>
>>101784292
those fashion girls be anorexic, they are basically skeletons
>>
File: ComfyUI_temp_sprbe_00167_.png (1.4 MB, 1024x1144)
1.4 MB
1.4 MB PNG
>>>101783080
had to learn comfy for flux and now im hooked
>>
>>
>>101784284
this.
>>
File: FD_00001_.jpg (224 KB, 1024x1024)
224 KB
224 KB JPG
I want my GPU to play Diablo but I can't stop proooompting
>>
File: up_0004.jpg (703 KB, 3616x5120)
703 KB
703 KB JPG
>>
File: 2024-08-08_00332_.png (2.16 MB, 1680x1024)
2.16 MB
2.16 MB PNG
>>
File: 1715555328596266.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
>>
File: 2024-08-08_00335_.png (2.46 MB, 1680x1024)
2.46 MB
2.46 MB PNG
>>
File: ComfyUI_temp_sprbe_00179_.png (1.47 MB, 1024x1144)
1.47 MB
1.47 MB PNG
>>101784356
i want to code but i cant stop proompting goblin sluts
>>
File: download (35).jpg (258 KB, 1024x1024)
258 KB
258 KB JPG
>>101784325
That's how he gets ya.
I started using comfy for some shit A1111 couldn't do at the time and now it's all I use because of how autistic I can make it.
>>
File: ComfyUI_temp_sprbe_00117_.png (1.32 MB, 1024x1144)
1.32 MB
1.32 MB PNG
>>101784467
so true, the autism has caught me
>>
File: Flux_00433_.png (927 KB, 1024x1024)
927 KB
927 KB PNG
[spoiler]Rakamakafon
>>
>>101784638
i got that reference.
>>
File: 2024-08-08_00349_.png (1.71 MB, 768x1280)
1.71 MB
1.71 MB PNG
>>
File: Flux_00441_.png (1.16 MB, 1024x1024)
1.16 MB
1.16 MB PNG
>>101784652
>>
File: 2024-08-08_00352_.png (1.93 MB, 768x1280)
1.93 MB
1.93 MB PNG
>>
File: FD_00003_.jpg (261 KB, 1024x1024)
261 KB
261 KB JPG
>>101784638
I hate that song. It was played 100x a day on the radio for months.
>>
File: file.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>>
File: Flux_00434_.png (951 KB, 1024x1024)
951 KB
951 KB PNG
>>101784696
>>
>>101784696
i mean, turn on the radio now lol, think you still gonna hate it?
>>
File: ComfyUI_02650_.png (1.35 MB, 1152x896)
1.35 MB
1.35 MB PNG
>>
File: 2024-08-08_00360_.png (1.59 MB, 768x1280)
1.59 MB
1.59 MB PNG
>>
File: file.png (1.54 MB, 1024x1024)
1.54 MB
1.54 MB PNG
>>101784735
your boobs are cute!
>>
File: Flux_00451_.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>mfw woke up for another 16 hour prompting session
>>
>>101784779
me fr fr
>>
File: ComfyUI_02644_.jpg (1.14 MB, 2048x2048)
1.14 MB
1.14 MB JPG
>>101784761
ty i grew them myself
>>
File: ComfyUI_Flux_5731.jpg (153 KB, 1392x512)
153 KB
153 KB JPG
>>
>>101784725
yes, you "le wrong generation" retard
>>
File: Flux_00350_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>101784735
can you share your prompt pls
>>
>>101784495
Gonna need this proompt if this is flux
>>
>>101784811
I should've been born in 10,000BC, no one even unga bunga anymore...no one even thinks about the invention of fire or anything
>>
>>
File: image.jpg (144 KB, 1024x768)
144 KB
144 KB JPG
>>
>>101784779
He should be emaciated from having aids like debo
>>
File: ComfyUI_02485_.png (3.35 MB, 1536x1536)
3.35 MB
3.35 MB PNG
>>101784812
no, because you prompt little kids
>>
>>
File: ComfyUI_02655_.png (1.34 MB, 1152x896)
1.34 MB
1.34 MB PNG
>>
File: 1707536050845710.jpg (36 KB, 720x712)
36 KB
36 KB JPG
>>101784811
yes, the waves being filled with nigger animal screeching is completely fucking normal and in no way subpar when compared to the diversity of the preceding 100 years since music radio became a thing.
you dumb fucking cunt.
>>
File: 2024-08-08_00377_.png (1.11 MB, 768x1280)
1.11 MB
1.11 MB PNG
>>
why can't the falseflagging avatartroons go to the cesspool they came from?
>>
>>101784875
>diversity
lol
lmao even
/pol/ is that way
>>
File: IMG_0267.png (1.11 MB, 1152x768)
1.11 MB
1.11 MB PNG
>>
File: 67.png (739 KB, 1344x768)
739 KB
739 KB PNG
>>
>>101784912
you have mental issues that trigger at code words or a Really Fucking Poor grasp of the english language.
>>
>>
>>101784935
you tried your best and failed, again /pol/ is that way, bye bye
>>
File: 2024-08-08_00380_.png (1.38 MB, 768x1280)
1.38 MB
1.38 MB PNG
>>
File: 1692356990015839.jpg (37 KB, 960x952)
37 KB
37 KB JPG
>>101784946
>>
>>
File: ComfyUI_Flux_32.png (1.03 MB, 1344x768)
1.03 MB
1.03 MB PNG
>>101784638
>>
>>101784957
you know you can just change to stations that don't run rap 24/7, right?
>>
File: file.png (988 KB, 1024x768)
988 KB
988 KB PNG
>>101784957
almost
>>
File: 2024-08-08_00394_.png (2.01 MB, 1280x1024)
2.01 MB
2.01 MB PNG
>>
File: FD_00009_.jpg (206 KB, 1024x1024)
206 KB
206 KB JPG
>>101784725
I stopped listening to music after I learned how to play guitar. Now I just play it.
>>
File: FD_00005_.jpg (229 KB, 1024x1024)
229 KB
229 KB JPG
>>101784779
For actual years now you have impressed me with how absolutely fucking disgusting you can gen shit.
It's really quite admirable.
>>
File: file.png (843 KB, 1024x768)
843 KB
843 KB PNG
>>101785025
>>
>>101784858
>because you prompt little kids
nothing wrong with that
>>
>>101785128
imagine ABUSING abstractions of little girls... you ruin their conceptualized life in the human consciousness
>>
>>
File: FD_00015_.jpg (240 KB, 1024x1024)
240 KB
240 KB JPG
>>
>>101785173
>>101785176
I never said anything about sexualizing them, but whats the harm in a picture of a girl holding a Graphics card?
Americans and their pedophilia obsession is really getting out of hand that they begin to rage at any picture of a little girl.
>>
>>101785255
>when the barbarian multi-classes into bard
>>
>>101785256
by posting a little girl in this thread, you subject her to sexualization by evil pedos and you're an evil monster
>>
>>101785256
of course this guy is german
>>
>>101785285
its not my problem what twisted shit goes on inside your head.
>>101785290
how did you know?
>>
>>101785304
>how did you know?
you're a pedophile. that's 75% of german males
>>
>>101785319
Back to statistics 101 with you.
>>101785304
>how did you know?
>holding a Graphics card
>>
File: FD_00021_.jpg (213 KB, 1024x1024)
213 KB
213 KB JPG
>>
>>101785304
>its not my problem what twisted shit goes on inside your head.
it would be very un-American of me if I didn't make it your problem
>>
File: FqlQ0SgWIAAHOiu.jpg (67 KB, 959x558)
67 KB
67 KB JPG
>Flux-dev
>only 32GB of RAM
>comfy keeps crying about allocation on device after 1 gen.

it's so over bros
>>
>>
>>101785319
age of consent is 14 here and that doesnt make one a pedophile.
>>101785334
>holding a Graphics card
I dont get it
>>
>>101785367
Interesting, I was trying to make him sing a song
"I am not a pedophile, I am not a rapist. I'm not trying to fuck your kids I'm just trying to be an escapist"
and it changed kids to "Mature Adult"
>>
>>101785412
you capitalized graphics thus outing you as a German
>>
>>101785425
I dont get it
>>
>>101785421
that's not local
>>
>>101785412
>14 isnt pedo
based
>i dont get it
you capitalized the Noun
>>
>>101785443
It isn't, I am using my GPU for vidya, and prompting in between
>>
>>101785421
not very german friendly of flux
>>
>>101785452
which website is it and why are you okay with it rewriting your prompts?
>>
>>101785421
Party van is on its way anon.
>>
File: FD_00006_.jpeg.jpg (211 KB, 1024x1024)
211 KB
211 KB JPG
>>101785461
https://fluxpro.art/create and I don't give a shit. I don't prompt any pedo shit like the other guy, and I have it running locally. Only using the site so I can play Diablo at the same time.
>>101785462
Yeah I am definitely on a watchlist now.
>>
File: ComfyUI_Flux_5781.jpg (123 KB, 1392x512)
123 KB
123 KB JPG
>>
File: 2024-08-08_00414_.png (2.23 MB, 1280x1024)
2.23 MB
2.23 MB PNG
>>
>>101785539
Nice
>>
File: ComfyUI_Flux_5789.jpg (182 KB, 1392x512)
182 KB
182 KB JPG
>>
>>101785576
kys
>>
>debo gets banned
>massive spam in good thread
This is just like the pastebin thread split
>>
>>101785591
How do we know debo got banned?
>>
why do pedos always type the creepy "haha", unbelievable cringe here, so sad
>>
>>101785604
thats germans in a nutshell
>>
no point in posting any good gens here anymore, the thread gonna be nuked and rightfully so
>>
>cowboy shot
>>
Well this general is shit. I would rather deal with avatar fags.
Let me know when his ip range is banned
>>
>>101785602
He typically post at this time and didn't tell pw goodnight or protect him while he was getting flamed. Debo also made a fake ldg that got deleted instantly because he was spamming his mfw post.
>>
>>101785630
>debo
>pw
who?
>>
>>101785626
Won't help, people like Teebs ban evade
>>
>>101785621
kek do you think this is how it works? stroll in start pedo posting and the threads gets nuked so everybody goes back do sdg? fuck off back to your containment thread.
>>
>>101785624
>schnell
Found the problem.
>>
>>101785624
>dev-schnell schizomerge
>fp8
rumao
>>
>>101785652
He's not playing with a full deck
>>
>>101785624
lol. nice gen tho. "medium shot" should work.
>>
>>101785651
ban evaders ensure better protection in the future
I can't wait for digital IDs
>>
File: Flux_00453_.png (1.43 MB, 1024x1024)
1.43 MB
1.43 MB PNG
>thread stopped being comfy
its over isnt it?
>>
>>101785755
Debo and PW are seething over last night
>>
>>101785662
afaik only pony understand it

>>101785671
my best minmax model until next vramlet workaround
can't wait for fp5!
>>
File: ComfyUI_00160_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>
>>101785755
cozy times ahead, anon
>>
File: ComfyUI_00161_.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>>
File: r5mcUZuf_Wm-PadfPqe2Y.png (1.86 MB, 1536x1072)
1.86 MB
1.86 MB PNG
Inspired by this anons >>101784467
I fucking love FLUX bros. Rest in Piss Midjourney.
>>
File: FD_00369_.png (1.92 MB, 1024x1536)
1.92 MB
1.92 MB PNG
>>101785755
For now. Schizos gonna schizo.
>>101785759
You pay way too much attention to the drama of avatarfags. My trick is I don't give a shit about them or their lives. The only reason I know who they are is because of you constanly talking about them.
>>
>>101785789
You wanted a reason and I provided one.
Learn pattern recognition
>>
File: Flux_00454_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>101785759
quick rundown about who these faggots are and what happened?
>>
>>101785747
it's faggots like you that make this happen because you can't just stay in your dark little corner
>>
>>101785830
it doesn't matter. its just noise, keeping you from being creative.
>>
>>101785256
i'm
>>101785173

my post was ironic desu

>>101785205
those people will probably start to say some collectivist stuff in that sense
>>
File: download (52).jpg (282 KB, 1024x1024)
282 KB
282 KB JPG
>>101785788
spider hands.
still frame of a movie from the 1970s us my current jam,
>>
File: Flux_00456_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>101785860
ok
>>
>>101785825
>You wanted a reason
I did not ask, I am not the one you originally replied to.
Learn pattern recognition
>>
the /sdg/ and /lmg/ refugees ruined this general
>>
>>101785788
imagine thinking this is Midjourney level
>>
You have to feel bad for the loser, he needs to ban evade because he's not getting his way. Just point and laugh at him and mods will clean it up
>>
File: ComfyUI_Flux_5827.jpg (87 KB, 1392x512)
87 KB
87 KB JPG
>>
>>101785918
he wants attention, and you're giving him that, the best way to deal with a troll is to ignore him
>>
File: Flux_00457_.png (1.46 MB, 1024x1024)
1.46 MB
1.46 MB PNG
>>
File: fs_0138.jpg (70 KB, 1024x1024)
70 KB
70 KB JPG
need harder stuff than coffee already today
>>
>>101785912
this, MJ has better quality picture and can make any character/celebrities you want, flux is far from that and flux can't do nfsw yet so for the moment there's no reason to shit on Midjourney
>>
File: wtfisthisreal.png (1.3 MB, 1536x1072)
1.3 MB
1.3 MB PNG
>>
File: download (53).jpg (269 KB, 1024x1024)
269 KB
269 KB JPG
>>101785932
>ComfyUI_Flux_5827.jpg
>5827
Jesus, are all your gens in one folder?
>>
>>101785942
MJ will kick ban for even trying to generate nudity
>>
>>101785912
What mj does is analyzes your pathetic little prompt and rewrites it, adding tons of text in the process, especially involving style and aesthetics. You can imitate that by running your own prompt through any llm, or chatgpt, or whatever.
>>
File: Flux_00458_.png (1.4 MB, 1024x1024)
1.4 MB
1.4 MB PNG
>>101785942
>Midjourney
doesnt run locally so its automatically trash.

>>101785951
tfw only genned 458 images so far
>>
>>101785959
Maybe but you're not getting MJ quality by using a "good" prompt with Flux regardless.
>>
>>101785957
and flux will give you horrible nipples if you generate nudity, it's not like we got something better at the moment for flux
>>
>>101785959
flux can't do much style though, even if you try to add boomer prompt into it, MJ has way more variety of styles
>>
File: Flux_00412_.png (971 KB, 1024x1024)
971 KB
971 KB PNG
>>101785982
whats "boomer prompt"?
>>
>>101785963
>tfw only genned 458 images so far
I have genned about 5k but they go into a new folder each day
>>
>>101785990
>whats "boomer prompt"?
if Flux doesn't want to listen to your simple prompt, you ask a LLM to make it more detailled and it works better
>>
>>101785912
All MJ does is create pretty cinematic pictures, it's just a toy. Even Dalle was better than MJ at the metric that mattered: prompt following.
>>
File: Flux_00135_.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>101785998
>I have genned about 5k
whats your setup?
are you the guy with the 3090s?
>>
Something interesting I've discovered recently.

When training SDXL for new concepts, it is perfectly capable of "cross-learning" between anime and real photos. I trained a realistic concept model for my NSFW fetish, about 2000 images, and by adding in 5000+ anime images I have for the anime version, it learns certain things better. There are entire sub-concepts with 0 representation in the real images but present in the anime ones, and by joint training it can make realistic gens of these concepts. Literally all I have to do is take a bigASP-based realism model, and train it both on anime and real photos, but append "anime" to all the anime images, and it just werks. I've never heard of anybody doing this but it absolutely works and gives good results.
>>
>>101786015
no. single 4080 and 32gb ran
>>
File: ComfyUI_Flux_5337.jpg (121 KB, 1360x712)
121 KB
121 KB JPG
>>101785951
I'll change that once I feel like I'm done fooling around with it. also gens are always saved twice as jpg and webp so it's not really that many
>>
>>101785959
>>101786003
NTA but how do i do this? There was a discussion about injecting LLMs into comfy yesterday but I couldn't understand shit.
>>
File: Flux_00366_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>101786003
have you seen this guys video?
https://www.youtube.com/watch?v=4d5zIBNuMRA
interesting workflow he shows there
thoughts?
>>
>>101786010
>All MJ does is create pretty cinematic pictures
it's the best at stylized pictures, and in terms of realist it's also the best, MJ is far from a toy, as much as I love flux with all my heart I don't want to pretend that MJ isn't superior yet
>>
>>101786010
>Even Dalle
Why "even" when dall-e is still the best at it?
>All MJ does is create pretty cinematic pictures
retard detected, MJ actually has styles you can easily prompt for and all kinds of stuff you can bring up like "N64" that actually works unlike with Flux
>>
>>101785966
A finetune can give you "MJ" quality, because MJ isn't special. Remember Flux is a base model, MJ is an aesthetic DPO finetune.

Also take a look at benchmarks
https://artificialanalysis.ai/text-to-image/arena

MJ isn't all that good when tested against Flux.
>>
>>101786038
Please do not take imggen advice from youtubers
>>
>>101785918
more please.
>>101785932
super cool but baked.
>>
>>101786070
>retard detected, MJ actually has styles you can easily prompt for and all kinds of stuff you can bring up like "N64" that actually works unlike with Flux
this, MJ can basically do everything you have in mind, Flux lacks too much concept to be a real competitor, I hope finetunes will fix that
>>
File: ComfyUI_Flux_5847.jpg (60 KB, 1392x512)
60 KB
60 KB JPG
>>101786037
just get ollama and the ollama nodes for comfy

https://ollama.com/download
https://github.com/stavsap/comfyui-ollama
>>
>>101786079
retards do these arenas like the LMSYS one, they are not trustworthy
>>
>>101786088
yeah instead take it from pedo retards in 4chan
>>
>>101786079
>Flux is a base model, MJ is an aesthetic DPO finetune.
I don't believe that, Flux is way too much biased on some generic styles when you prompt into animes, and for realistic shit it always add blur on the background, a real base model would give you more variety, Flux also got DPO'ed because it's distilled from "Pro" which is basically a finetuned model
>>
>>101786104
unironically who else is autistic enough to test every single setting and value with extreme determination
>>
>>101786037
>NTA but how do i do this?
the slowest way is to simply ask chatgpt or claude to do the job kek
>>
>>101785939
Kyle Gass has really fucked up, huh?
>>
>>101786101
those arenas test human feedback, and an image arena represents real values much more than text, coz humans see images better
>>
>>101786130
the real values of retards
the arenas give little useful information
>>
File: 1717862737581828.png (940 KB, 1024x1024)
940 KB
940 KB PNG
>>
>>101786151
that prompt comprehension does my head in. needs some fake "koikatsu" piss take logo tho
>>
File: 2024-08-08_00459_.png (2.29 MB, 1280x1024)
2.29 MB
2.29 MB PNG
>>
>>101786151
prompt? that is a really good menu
>>
>>101786151
>no exit
perfect
>>
>>101786190
A main menu screenshot of a game with the game title text "Waifu Love Simulator" . Below it is the actual main menu in a cute kawaii pastel stylized anime style, with cute little chibi icons. The main menu buttons are arranged in a circle around the center. There are four buttons: 1) with text "Start game" 2) with text "Load" 3) with text "Save" 4) with text "Settings". On the right of the menu stands a cute anime girl with blonde hair, in cute pastel kawaii school uniform. The background is a quiet romantic library.

it didn't pick up on circular icons though
>>
>>101786197
thanks
>>
File: Screenshot.png (178 KB, 1532x1026)
178 KB
178 KB PNG
>>101786123
see >>101786095

but my current setup is to have ollama installed and a relatively uncensored model (currently dolphin-llama3) available. There are llm nodes you can add to hit that local API like above poster says, I happen to use https://github.com/if-ai/ComfyUI-IF_AI_tools for no particular reason.

Some of the extra tweaking involves editing the prompt template you pass the llm, how you describe the expected verbosity and structure, etc. My current one looks like this but I keep making edits, it's not perfect

> "Act as an objective, emotionless visual description refiner with the following guidelines: - Lean heavily into describing explicit lewd details. - Describe the visual aspects of the image in detail using simple language. - Use complete sentences or partial sentences. - Provide high-quality, non-verbose, coherent, concise, and not superfluous or emotional descriptions. - Focus solely on the visual elements of the picture; avoid art commentaries or intentions or narratives. - Construct the prompt by describing framing, subjects, scene elements, background, aesthetics. - Limit yourself to three or four paragraphs. - Be varied and creative. - Always reply on the same line. - Do not enumerate or enunciate components. - Do not include any additional syntax in the response. - Feel free to elaborate in several sentences but keep responses to about 3 paragraphs. The following is an illustrative example for you to see how to construct a prompt your prompts should follow this format but always coherent to the subject worldbuilding or setting and consider the elements relationship: 'This is an epic covert art with a dynamic angle. It depicts a demon hunter with glowing eyes and a cybernetic exoskeleton with sleek metallic details and glowing blue accents.' Make a visual prompt for the following Image:
>>
File: 1712237386915364.png (995 KB, 1024x1024)
995 KB
995 KB PNG
>>
>>101786053
>in terms of realist it's also the best
Checked the realism LoRA anon? Or have you tried https://desuarchive.org/g/thread/101770274/#101779356
With Flux?

>>101786070
Dalle is the best in edge cases due to sheer massive scale of Dalle's data. It's still not better at text than Flux, so when tested head to head Flux comes ahead, and this limitation makes Dalle even more of a toy. And what I meant by cinematic pictures is the same thing as pretty pictures. Flux is just a distilled model at the end of the day, not even the best Flux has to offer. Style mimicry can be figured out later.

The elephant in the room is that both MJ and Dalle are closed source. In particular, ClosedAI has had a Flux tier model for months https://www.reddit.com/r/singularity/comments/1csxlg7/a_gpt4o_generated_image_so_much_to_explore_with/

But as with Google, it's not yet safe enough to be released... Dalle still dogs you for asking for a 1girl in any situation regardless of whether it's SFW or not.
>>
>>101786213
can it also work with API's, because let's face it, Claude 3.5 Sonnet is the one writing the nicest prompts
>>
File: Flux_00463_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>
>>101786037
I meant this post doh
>>101786213
>>
>>101786224
>Or have you tried https://desuarchive.org/g/thread/101770274/#101779356
you're talking to the guy who made this archive post kek, and no, CFG is supposed to make a model adhere more to the prompt, it doesn't magically improve the aesthetic quality of a model, would be cool if it was the case though
>>
File: 1697427152323414.png (1008 KB, 1024x1024)
1008 KB
1008 KB PNG
yeah its really good, even if inconsistent so you have to gacha gens, although I think img2img should help a lot in this case?
>A main menu screenshot of a game with the game title text "Waifu Love Simulator" . Below it is the actual main menu in a cute kawaii pastel stylized anime style, with cute little chibi icons. The main menu is unusually designed in a circular way, with buttons floating around a chibi icon of the main anime girl with cute brown hair and glasses in the center. There are four buttons: 1) On the top of the circle with the text "Start game" 2) On the left of the circle with text "Load" 3) On the right of the circle with the text "Gallery" 4) On the bottom of the circle with text "Settings". All buttons have a pink font with a thick white outline, in italics. The background is a quiet romantic library.
>>
>>101786227
Yeah at least the IF modules I posted can do ollama/openai/anthropic connections by default, looks like it would be easy to extend to other endpoints too but other llm packs might do that too
>>
>>101786229
finally some decent attempt to make marvel interesting
>>
>>101786224
>And what I meant by cinematic pictures is the same thing as pretty pictures
this objection of yours keeps getting harder to understand, what's wrong with "pretty pictures"
anyway, MJv6 can make very boring looking images, they literally give you control of the setting that makes image more or less stylized
did you not see the "phone photos" people were making when v6 came out?
>>
File: Flux_00465_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>101786256
you will never guess his superpower
>>
DALL-E 3 cannot make any photorealistic pictures whats-o-ever
It's always semi-realistic render style
A ridiculous toy that hasn't aged well
>>
File: ComfyUI_Flux_5863.jpg (52 KB, 1392x512)
52 KB
52 KB JPG
>>
File: 00115-1318028564.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>101786292
I bet he runs real fast
>>
>>101786327
>DALL-E 3 cannot make any photorealistic pictures whats-o-ever
It absolutely can, but only with Natural style that's available on the API, not with vivid which is default on bing creator/designer
>>
>>101786327
by design, no Taylor Swift getting groped by football fans for you
>>
>>101786335
Never seen that, but then I stand corrected
>>
>>101786327
it was on purpose anon, OpenAI is really focused on ""safety"" shit so they made humans look like plastic toy so that users couldn't fool people into deepfakes, OpenAI can do insane humans if they really wanted, their Sora demo shows their true potential
https://www.youtube.com/watch?v=lKM-QMnZ3yY
>>
>>101786254
oh cool, I'll take a look at it then, thanks for the advise anon
>>
>>101786280
Anon I have seen what MJ can do
https://www.midjourney.com/showcase
I am not very impressed. The images there look significantly worse than what you can do with Flux and a good prompt. Don't forget MJ does not give you precise control over your prompt, I.E. you ask for a picture of an Asian girl doing X but its ptiority is aesthetics over following the prompt, that's what I mean. Try asking MJ what Hayao Miyazaki's style is like, it'a not even close
https://midlibrary.io/styles/hayao-miyazaki

Midjourney is guilty of the worst sins of all: Prompt injection. Now, I admit Flux isn't the best stylistically speaking, but you can already achieve MJ tier results with good prompting on other models.
>>
File: up_0014.jpg (586 KB, 3584x5120)
586 KB
586 KB JPG
>>
>>
>>101786424
>Don't forget MJ does not give you precise control over your prompt, I.E. you ask for a picture of an Asian girl doing X but its ptiority is aesthetics over following the prompt, that's what I mean. Try asking MJ what Hayao Miyazaki's style is like, it'a not even close
but MJ can do the style you actually want if you are patient enough, flux doesn't know much concept so it doesn't matter how much you try you'll never get it.

Flux and MJ have opposite problems, MJ can do anything but it's hard to control, Flux cannot do a lot of things and is too biased on one specific style, but at least it's consistent I guess?
>>
File: file.png (2.04 MB, 1152x896)
2.04 MB
2.04 MB PNG
failing to train a lora
>>
File: 2024-08-08_19-10.png (453 KB, 1899x983)
453 KB
453 KB PNG
>>101784822
it's not, but I'll post my workflow anyways
>files.catbox.moe/enipcg.png

also, I'm writing my own nsfwdetect node in fucking comfy
>>
>>101786424
>Don't forget MJ does not give you precise control over your prompt, I.E. you ask for a picture of an Asian girl doing X but its ptiority is aesthetics over following the prompt
I said it gives you control over that, why did you disregard me directly addressing the point your raised?
https://docs.midjourney.com/docs/stylize
>>
File: Flux_00468_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
>>101786333
yeah thats actually it
>>
File: 2024-08-08_00441_.png (1.92 MB, 1280x1024)
1.92 MB
1.92 MB PNG
>>
>>101786482
So what are you basing your claim that it's better on? A closed model will never give you more fine control than an open one. The levels of delusion of this implication are insane. With Flux you can still do IPAdapter.
>>
File: 2024-08-08_00460_.png (2.3 MB, 1280x1024)
2.3 MB
2.3 MB PNG
going thru my failed prompts on SDXL of the past year creates some insane results on FLUX even tho the prompts are total schizo collection of tags
>>
>>101786516
>With Flux you can still do IPAdapter.
can't wait for that one desu
>>
>>101786516
IPAdapter a shit, it will never be good
>>
>>101786487
Make hold lotion and a banana
>>
File: 2024-08-08_00437_.png (1.85 MB, 1280x1024)
1.85 MB
1.85 MB PNG
>>101786530
ya me tom that will fun
>>
>>101786453
rip cunny lora anon
the runpod glowies got to him before he could release it
>>
File: 00031-3386572247.png (1.77 MB, 1024x1024)
1.77 MB
1.77 MB PNG
>>101786453
post results. which model? what GPU? or are you renting?
>>
File: 2024-08-08_00450_.png (1.8 MB, 1280x1024)
1.8 MB
1.8 MB PNG
>>101786544
kek
>>
Anyone know what the status of fine-tuning Flux is? I'm not interested if it can't gen naked people.
>>
File: 00113-400380515.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
>>101786544
he doesn't strike me as the lotion kinda guy
>>
>>101786516
>So what are you basing your claim that it's better on?
On the superiority of the quality of its images and prompt following?
Anyway, you conceded by trying to pivot the argument into "local has tools tho", that has nothing to do with the quality of the model itself.
>>
>>101786580
at the moment there's a lora that tries to add the penis back, I think it'll be fine, everyone talks and use flux now, there will be finetunes and lora to improve that shit, and it'll be way easier than on SDXL (that model had horrible anatomy to fix wheras the bodies of flux are really good)
>>
>>101786597
>the rig is: 4x 3090 ti
holy cow, anon has an AI lambo
>>
>>101786547
>>101786554
failing to just get simpletuner working.
the rig is: 4x 3080 ti 12GB

>>101786620
If only inference was possible on multigpu
>>
>>101786551
who's cunny lora anon?
>>
File: 1.png (2.28 MB, 1024x1024)
2.28 MB
2.28 MB PNG
>>
>>101786580
its a massive undertaking, first you need a good data set (those that have sit on it tight), second you need a few weeks of good server time, third you need to know what you are doing cause fine tuning for FLUX will be different, third and most problematic you will have to bite the good or bad apple (depending who you are), with finetuning dev you will make no money at all (the license forbids it), or you finetune the inferior model FLUX.schnell and are free to do what you pls but it will be even more difficult ..

think of it as FLUX.pro had a son and that son called .dev had a special needs daughter called .schnell .. and now we are at the situation that the coomer finetuners dont wanna touch FLUX cause they are in for the $$$ now but dont want to have a relationship with .schnell

tl/dr two more weeks
>>
>>101786617
>penis
so that's where were at, huh? a truly sad state of affairs.
>>
>>101786636
dude who bragged here yesterday that he is training a cunny lora and shared a screenshot that could be traced to runpod, serious brainlet
>>
>>101786640
>first you need a good data set (those that have sit on it tight)
it's been 2 years the diffusion ecosystem has bloomed hard, I think the old finetuners still have their old data in their computer
>>
>>101786630
>If only inference was possible on multigpu
at least you can load everything at fp16 without offloading to RAM, clip and T5 in one GPU, unet in another
>>
>>101786666
That's what i do but the model even fp8 doesn't fit in 12GB
I get like 75 seconds per 1MP gen
>>
>>101786657
ya they probably have, but right now the finetuning process isnt even hammered out.. I am not an expert on this but talks are you need a master model (thats what .pro was for .schnell and .dev) .. what will that be? or will there be a different solution? in the meantime the first loras are hit and miss, ppl need to get a grasp on how to finetune and make loras for a big DiT model without ruining it, the realism LoRA was good, the chinese art lora was tremendous shit
>>
File: 2.jpg (408 KB, 1024x1024)
408 KB
408 KB JPG
>>
File: 2024-08-08_00503_.png (1.84 MB, 1024x1280)
1.84 MB
1.84 MB PNG
>>101786692
awesome give it mor text and it will be fantastic
>>
>>101786607
I mentioned tools because you talked about a --stylize parameter, which we don't know how it or anything else on MJ really works other than being just another tool... And there's plenty in SD ecosystem, precisely there's ones for style transfer if LoRAs are not enough for you.
Which leads me to another point, we don't know how many tools MJ is leveraging from your prompt, but we do know that Flux is just a prompt, and yet it still goes head to head with MJ.
Also MJ is not better at prompt following, at least not significantly, clearly you did not look at https://artificialanalysis.ai/text-to-image/arena
>>
>>101786617
>>101786640
I'm more thinking about how it was supposedly 'untunable' because of the distillation techniques used to distill Scnell and Dev from Pro. Last I remember a guy was creating an 'un-distilled' model to allow fine-tuning.
Has anyone actually produced a fine-tune that doesn't kill the model's intelligence or cause representation collapse yet?
>>
>>101786710
that arena is just "do you prefer this or this" there is no fine grained choice for "which follows the prompt better" "which looks nicer"
it is shit, it doesn't support the point you're making, stop linking it, it is for retards
>>
>>101786738
also lacks a "both are shit" option
>>
>>101786725
>I'm more thinking about how it was supposedly 'untunable' because of the distillation techniques used to distill Scnell and Dev from Pro.
there's already loras that is being made and they work fine, the "it's impossible to finetune distilled models" is a myth
https://civitai.com/models/631986/xlabs-flux-realism-lora?modelVersionId=706528
>>
>>101786738
>Which image best reflects this prompt?
That is at the top of the images being compared. Though obviously many users will pick the most aesthetic picture, that is not what is being tested. Such arenas tend to be flawed because their way prompting favors one model or the other, the old one imgsys used to favor SDXL style of prompting and had useless stuff to it, this one is no fluff just testing raw English knowledge with proper prompts.
>>
>>101786725

>supposedly 'untunable'
remains to be seen, even black forest anon who was here two days ago it just needs new "tricks"

>Has anyone actually produced a fine-tune that doesn't kill the model's intelligence or cause representation collapse yet?
not yet ofc, its to soon, this will take abit of time: you need to think about tagging.. if you got a SDXL dataset tagged it will be tagged for CLIP.. not T5, do you want to change that? then the software for finetuning is in its infancy you might get bugged crappy finetunes you invest alot of cosly server hours into, so I guess the ones with the data are holding out or doing trials without announcing much
>>
File: Flux_00472_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>101786544
>>
>>101786788
>this one is no fluff just testing raw English knowledge with proper prompts
You think it is testing that but it isn't.
>>
File: 2024-08-08_00509_.png (2.28 MB, 1024x1280)
2.28 MB
2.28 MB PNG
>>
>>101786701
I used a few months old prompt I had way back and figure I check it how it would look now on flux, I didn't expect the gen to look so clean
>>
>>101786789
>it will be tagged for CLIP.. not T5
Flux does use CLIP and T5 with its size should be fine with the prompts used with older models
>>
File: 2024-08-08_00521_.png (1.92 MB, 1024x1280)
1.92 MB
1.92 MB PNG
>>101786819
I just clean weights and embeddings off and get stuff like pic related.. endless wallpaper machine, looked like shit on SDXL
>>
>>101786824
ya but I don't know (maybe some anon does?) who the CLIP/T5 interacts in FLUX.. it uses both sure, and it does change if you use dual text encode and change each .. but how will that interact with finetunes? maybe we see a finetune you push with clip and have to ask nicely in T5 for details .. idk
>>
File: 3.jpg (271 KB, 1024x1024)
271 KB
271 KB JPG
>>101786835
I liked sdxl mostly because of prompt adherence but flux makes that a moot point now lol
>>
>>101786804
Even if it weren't clear from the very first words in the arena, the user would be picking a model aesthetically because they are not sure how to pick which one is better, and MJ still loses to Flux.

According to you it's a blind test, so where is MJ's aesthetic advantage all of a sudden?
>>
File: 2024-08-08_00525_.png (1.88 MB, 1024x1280)
1.88 MB
1.88 MB PNG
>>101786867
ya, if you write a prosa for t5 it will do almost anything the model has seen .. by theory sd3 could have done this to, but the dataset is better for flux and 12b parameters makes it insanely powerful .. nearly to powerful for some anons that can't cope that the model doesnt wanna do what they want with 5 tags
>>
>>101786869
like I said, retards. you're seeing the value judgement of retards.
who else just sits there evaluating images for free?
>>
>>101786869
>and MJ still loses to Flux
to Flux Pro, not dev or schnell
>>
File: 5.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>101786910
Time for me to schizo prompt then kek
>>
>>101786912
>They're doing science I don't like therefore they are retards
>>
>>101786938
>to Flux Pro, not dev or schnell
when I look at base SDXL and when I look at the best SDXL finetunes, there's like an universe between them, if finetuners will spend that much effort on finetuning flux dev it'll destroy pro quickly
>>
>>101786967
but SDXL finetunes are SDXL based, Flux Dev finetunes are not Flux Pro based
>>
>>101786996
my point is that flux dev finetunes will improve the model so much it'll beat flux pro
>>
>>101787019
yes, I got that, your basis for that is dumb tho, as I pointed out
>>
>>101786938
>to Flux Pro, not dev or schnell

Wrong. Dev is still in early stages, so is 6.1, so it's still too early to tell, based on their scores Dev and 6.1 are practically tied, but we do know that Dev is better than v6.
>>
>>101787046
>Wrong. Dev is still in early stages, so is 6.1
I've heard that v6.1 is a new architecture, did MJ also switched from unet to DiT?
>>
>>101787046
>but we do know that Dev is better than v6
now you don't even care about what the leaderboard shows
you just "know" things
what was the point of linking to it
stupid piece of shit
>>
>>101787043
>your basis for that is dumb tho, as I pointed out
what do you mean?
>>
>>101787060
Probably the SD3 architecture. You can recognize it because 6.1 is significantly worse than 6.0 with humans.
>>
>>101787073
are you dense?
You said SDXL finetunes beat SDXL therefore Flux Dev finetunes will beat Flux Pro
that does not follow
>>
>>101787100
That's likely though, pro isn't that much better than dev, and finetunes can really improve a base model, I don't see why a good finetune of dev wouldn't be able to beat pro, of course I could be wrong, time will tell
>>
File: 444454561.png (1.04 MB, 1756x859)
1.04 MB
1.04 MB PNG
>>101787071
I'd love to see you try to do anything like pic rel with MJ v6. I'm not a paypig but I doubt you can do it, as for 6.1 maybe.
>>
>>101787090
I think that the MJ devs still need to learn the subtety of that new architecture, I'm not surprised they haven't nailed that shit first try, DiT is a completely different beast compared to unet, SD3 sucks at humans because it's a cucked model, flux is also a DiT model and look at what we've got
>>
>>101787140
now you care about text gen in images? pfft
>>
>>101787090
>6.1 is significantly worse than 6.0 with humans.
no it isn't
>>
>>101787161
>i-it never mattered anyways
>>
>>101787218
did it matter to you when dall-e 3 and MJ were the best at text?
>>
>>101787236
>did it matter to you when dall-e 3 and MJ were the best at text?
yes, now what?
>>
>>101787236
Of course it did, and still does. The ability to write readable text was one of dalle's killer features
>>
>>101787161
Text is the reason Flux is the best model out right now.
>>
File: file.png (2.35 MB, 1152x896)
2.35 MB
2.35 MB PNG
>>
>>101787236
Could they do this?
>>101786246
Then no.
>>
File: ComfyUI_02454_.png (1.93 MB, 1280x1024)
1.93 MB
1.93 MB PNG
>>
>>101786474
I have no idea what you're doing outside of merging nsfw sdxl with flux writing capabilities, but I'll happily use it while trying to understand, thanks anon.
>>
File: 2024-08-08_00548_.png (1.04 MB, 1024x1280)
1.04 MB
1.04 MB PNG
>>101787272
>>
>>101784725
Good thing I can listen to anything ai generated or searching online for good stuff, instead of the shit trending.
>>
>>101787272
Why don't we have a C++ web service yet?
>>
File: 2024-08-08_00464_.png (2.41 MB, 1280x1024)
2.41 MB
2.41 MB PNG
>>101787307
>shit trending
free yourself from the trending slavery, its not trending tomorrow anyway
>>
There is only one thing that matters when trying to figure out which model is best: which can do the best feet fetish gens?
>>
>>101787272
what I hate is the python package managment, it's a fucking disaster, I love the code though, and I tested a lot of shit in my life (C++, Ruby, Java, JavaScript, MATLAB, Julia...)
>>
>>101787283
>outside of merging nsfw sdxl with flux writing capabilities
that's pretty much it though
>>
>>101787115
>pro isn't that much better than dev
It nails details way better from what I've tested, and seems to understand prompts better too, makes less text mistakes.
But all of that wouldn't matter if the equivalent of a pony is released for dev.
>>
>>101787158
>SD3 sucks at humans because it's a cucked model
Isn't the same as flux in that regard?
>>
>>101787457
far from it, when you ask for a naked model in flux, the anatomy is flawless, they "just" removed the nipples and genitals, SD3 probably removed all naked people in their dataset, that's why it can't go for a fucking girl lying on grass
>>
>>101787355
True.
>>
File: fac005.jpg (452 KB, 1024x1024)
452 KB
452 KB JPG
>>
>>101787457
no, it can do humans just fine, just genitalia and female nipples it can't
>>
>>101787406
Yeah but I'm trying to understand your nodes and what you do with them anon.
Also for some reason loading one model then the next gives me an OOM (I use fp16 for flux), despite not having this problem when loading flux alone using the trick when having 2 GPUs (2x3090)
>>
File: 2024-08-08_00535_.png (1.9 MB, 1024x1280)
1.9 MB
1.9 MB PNG
>>101787482
gloopy
>>
>>101786803
Kek
>>
>>101787490
>>101787477
Oh, makes sense.
They managed to above and beyond to cripple their model.
Amazing I guess.
>>
>in the not so far future, anons will communicate with generated text, rather than type a post
>>
>>101787517
>They managed to above and beyond to cripple their model.
its very easy for em these days.. they have the data set (probably took it with em from SAI) then ran advanced recognition AI over it .. every picture that showed genitalia was just removed, every picture that showed female nipples got a red blotch on the nipple (thats why you get nipples that look like gumballs) .. release save to go.. Russian idiots managed to make cunny necrophphilia in day 1 anyhow .. well lets see where it goes from here
>>
>>101787523
I'm already seeing reviews for stories I'm checking that use chatgpt.
It's horrible shit.
>>
>>101787557
>Russian idiots managed to make cunny necrophphilia in day 1 anyhow
What, for flux?
>>
>>101787557
>every picture that showed female nipples got a red blotch on the nipple (thats why you get nipples that look like gumballs)
alternatively all they did was filter out nudity and the terrible nipples is what little understanding of nudity the text encoders have
the same applies to female celebrity names pulling some correct attributes, it's what CLIP knows of them
>>
>>101787599
go back in the archive to when Black Forest anon visited
>>
>>101787557
Yeah I guess, and they also got rid of most (female) celebs, characters, artists...
The future of models seem to aim for more and more stuff deleted to please this or that pressure group.
>>
File: file.png (81 KB, 944x487)
81 KB
81 KB PNG
I'm new to /ldg/ and running things locally, is this downloading a bunch of stuff or is it rendering an image? I'm doing it on my laptop so I know it's going to take a damn long time, but I don't even know if this means its downloading files or actually processing an image.
>>
>>101787611
no they surely have artistic nudity in the data set, but the data was censored in some way .. you can prompt nude photography artists you get their style and the poses.. does even understand stuff like "high key photography" (thigh gap) but the data that was allowed into set was sanitized, otherwise you can not explain the botched female nipples, it appeals to american financiers.. "no nipple was shown that is not male" kinda sad by German corp. but they need money I guess
>>101787616
nothing that can not be fixed by Autists with to much processing power
>>
File: spookre.jpg (433 KB, 1888x1024)
433 KB
433 KB JPG
>>101787505
>>
>>101787644
you are downloading the basics.. lurk more, ask more some anons are friendly and help.. also what are the specs of your laptop?
>>
>>101787644
looks like downloading
>>
File: ComfyUI_01711_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
Guys, I don't think she liked my presento :'(
>>
>>101787652
>you can prompt nude photography artists you get their style and the poses
like who
>"high key photography" (thigh gap)
what
>>
File: 2024-08-08_00555_.png (1.82 MB, 1024x1280)
1.82 MB
1.82 MB PNG
>>
>>101787652
>but they need money I guess
At the same time you can get political stuff (trump), make it write nigger, etc.
So it's not like the model needed to be sanitized.
>>
>>101787670
Miku doesn't talk like that.
>>
>>101787665
>>101787666
Thanks anon. My laptop doesn't even have a GPU, so I know rendering this 256x256 image of cheese will take weeks. I'm doing it anyway.
>>
File: 00472-1876624786.png (889 KB, 768x768)
889 KB
889 KB PNG
>>101787687
yea but for reasons unknown Trump and Tayolor Swift made into the dataset of Flux .. but Greta Thunberg didnt (SD15 knew her) pic related ancient SD15 gen of mine
>>
File: file.png (42 KB, 909x862)
42 KB
42 KB PNG
>>101787714
>>
>>101787720
I'd gladly trade her ugly face for artists and nsfw.
>>
>>101787714
kek.. thats the spirit
>>
File: ComfyUI_01712_.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>101787698
the fuck is she holding now? kek
>>
File: 9.png (1.97 MB, 1024x1024)
1.97 MB
1.97 MB PNG
>>
What's the best guidance setting for text to show ?
>>
File: 8.png (1.79 MB, 1024x1024)
1.79 MB
1.79 MB PNG
>>
>>101787731
>>101787714
Oh wow, I'm not sure if there's any point in waiting for this. I'm not up to date, but chances are Flux wasn't made to generate in 256x256, similarly to how SD1 sticked around 512, or SDXL with 1024.
>>
>>101787778
>What's the best guidance setting for text to show ?
a high CFG seems to do the job better at rendering text than only guidance: https://files.catbox.moe/pggikh.png
>>
File: 10.png (1.87 MB, 1024x1024)
1.87 MB
1.87 MB PNG
>>
>>101787798
I see, I'll try that.
Thanks
>>
>>101787794
>chances are Flux wasn't made to generate in 256x256
Flux seems to be working fine un a lot of resolution ranges, even on high resolutions like 2048x2048, you do that on SD you get duplicated shit everywhere
>>
>>101787827
>even on high resolutions like 2048x2048
How much vram does that eat?
Man I wish someone worked on multigpu inference...
>>
>>101787827
Now that's interesting. Sounds like something about it's architecture must've made it more flexible then. I recall latents being saved in 256 for SDXL, so I wonder if they changed something about VAE decoding.
>>
>>101787853
>Man I wish someone worked on multigpu inference...
There's that in a way, you can separate the models (clip, VAE, image model) into different GPU's
https://reddit.com/r/StableDiffusion/comments/1el79h3/flux_can_be_run_on_a_multigpu_configuration/
>>
>>101787279
Top tier beauty
>>
File: file.png (1.91 MB, 1152x896)
1.91 MB
1.91 MB PNG
>>
Just opened the oven door and found some hot and fresh...
>>101787894
>>101787894
>>101787894
>>
File: 2024-08-08_00549_.png (1.37 MB, 1024x1280)
1.37 MB
1.37 MB PNG
>>101787911



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.