[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (756 KB, 3264x3264)
756 KB
756 KB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>101678250

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Kolors
https://gokaygokay-kolors.hf.space
Nodes: https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

>AuraFlow
https://fal.ai/models/fal-ai/aura-flow
https://huggingface.co/fal/AuraFlows

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>GPU performance
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
bless thread of frenship
>>
>>101681371
I only use 12GB when genning in comfyui.
>>
bros, is it over for the faggots over at SAI???
Unironic question
>>
File: ComfyUI_00036_.png (293 KB, 512x512)
293 KB
293 KB PNG
at 512x512 speed is comparable to SD and is still pretty good
>>
So much fresh bred in a simgle day
>>
>>101681400
I feel mildly sorry for the people that didn't want to fuck up but got "overruled" by the leadership which also wasted tons of money (not on their wages).

But yes, the alternatives are better, and I don't see much of anything SAI has going for it - the competition is currently better.
>>
>>101681428
>don't see much of anything SAI has going for it
an image to 3d model no one asked for kek
>>
>>101681383
>bless
blessed
>>
>>
File: ComfyUI_00165_.png (1018 KB, 1024x1024)
1018 KB
1018 KB PNG
>>101681535
>>
File: 1722579110155603 (1).png (96 KB, 1536x743)
96 KB
96 KB PNG
Looks like they will cooke a finetune of flux dev, nice
>>
>>101681583
Pixartfags on suicide watch
>>
the speed of schnell is quite impressive
I shat on it before but I think I was giving it too many steps and getting deep fried images as a result

I'm used to SDXL turbo models lying about only needing 4 steps when they actually need more like 6-8
This one actually needs only 4
>>
File: flux.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
>>101681440
Maybe a future better variant, but I don't particularly expect it to be coming from SAI.

Even then I wonder if it's not still an ultimately 2D-ish imagegen component (to re-texture the same generated object) that is going to be just as important as generating the 3D object itself.
>>
>>101681583
It can't be finetuned retard, it's a dead end for a lot of reason
>>
>>101681598
>This one actually needs only 4
but the quality is worse than flux dev so...
>>
>>101681598
Two (2) steps
>>
https://reddit.com/r/StableDiffusion/comments/1ehotwi/it_seems_fluxdev_is_another_model_i_have_been/
I wonder what is his prompts, he nailed the different styles
>>
Im going to kill myself
>>
me too
>>
>>101681594
I don't see the issue for Pixart at all if this can't be finetuned on consumer hardware.
>>
File: Capture.jpg (622 KB, 2689x1678)
622 KB
622 KB JPG
Guys, I think I found a sampler that can replicate styles well, Euler Karras
>>
File: flux_041.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>101681684 >>101681702
But why?
>>
>>101681711
huh, karras does seem to bring out a more painterly style
this makes no sense, the noise scheduler shouldn't have any effect on that
>>
File: Untitled.png (191 KB, 2724x1052)
191 KB
191 KB PNG
friendly reminder to attach the sound effect node so it dings when it's done

that way you can switch tabs until you hear the ding
>>
>>101681765
https://github.com/kijai/ComfyUI-KJNodes
add status icon to tab, as an alternative
>>
>>101681583
will donate next month
>>
All the people that was training before this scale, were prepare only for 8B, so this suddenly movement of flux, will define the scenario. There are many companies, that will pay for improve a open source model that is in the tier level of Dalle3, and there is more room to enhance. Regard the license, you can monetize trough KOFI, or similar just like those are doing right now, and therefore, a company could donate to your project without many announcement. Is just a question of time, to many loras and finetunes will work with this.
>>
>>101681583
why do furries have so much money
>>
>>101681601
SEETHE COPE DIAL AN 8
>>
File: ComfyUI_01791_.png (792 KB, 1456x720)
792 KB
792 KB PNG
Finally a model that can oneshot this prompt (besides Dalle)
>A color photograph of a young Japanese woman in a cropped top typing on a retro computer in a dimly lit room. She is holding a gun while typing. The image looks like a VHS still, slightly faded with film grain. The room has a nostalgic feel with vintage decor and low lighting, casting shadows that create a moody atmosphere. The overall scene is retro and slightly mysterious, capturing a unique blend of past and present elements.
>>
>>101681789
It took great focus to muster through this post but after doing so, I have come to agree with you.
>>
where's the 5000 series nvidia?...
>>
>>101681583
what have they done previously ive never heard of these guys before
>>
>>101681806
Elon just bought all. We wont get any
>>
Flux has officially revived the image generation scene after being stale for such a long time (half a year)
Can't wait for some finetunes to come out.
>>
File: ComfyUI_01792_.png (769 KB, 1456x720)
769 KB
769 KB PNG
>>101681798
Slightly better but lost VHS feel kek
>>
>>101681817
Totally this. Any idea what hardware requirements are for finetunes tho?
>>
File: 00_10.jpg (256 KB, 1552x1200)
256 KB
256 KB JPG
Yo
>>
>>101681826
Seems like it's pretty high, so it's going to take a while to see anything worthwhile probably.
>>
>>101681733
vramlet :(
>>
>>101681840
yea I thought so, well at least the autists already have their finetune datasets, all they need is to rent some server time to get it done, 2-3 months for the first nice ones to appear?
>>
File: ComfyUI_01797_.png (883 KB, 1456x720)
883 KB
883 KB PNG
>A 1990s anime still featuring a woman with an 1980s hairstyle and look. She is pouring water from a bottle into a glass in a hotel room. Behind her, a blue light shines at the room during the day, creating an interesting contrast. The scene captures the essence of 1990s anime with its distinctive art style, vibrant colors, and detailed backgrounds.

Nice

>>101681817
We have been waiting for someone more sensible than SAI to step in for years now, first the Chinks (and we know we have their dedication as they will release models on par with Flux Pro soon) and now Flux, it truly is a relief.
>>
>>101681866
How much vram and ram do you have?
>>
File: flux.png (1.92 MB, 1024x1024)
1.92 MB
1.92 MB PNG
>>101681817
i think it's a very nice model, but there was constant progress either way

>>101681866
download more vram? maybe there'll be other solutions later
>>
>>101681888
>1990s
>2000s style
model is cooked stylistically
>>
>>101681866
Give it some time.
Remember when SDXL took 14gb vram minimum to run? You can now run it on a 4gb potato just a bit slower.
>>
>>101681900
i hope sooner rather than later
>>101681910
>Remember when SDXL
my thoughts exactly. the only question is when
>>
>>101681907
nta, but whatever ya do, anime styles are not well understood by flux
>>
>>101681910
>Remember when SDXL took 14gb vram minimum to run
that was for the full FP32 model.
to run this model that way would require 48gb+ vram
>>
It's kinda disappointing that it ends up being based around 1024x1024 again (with quality drop offs at higher res). Are we ever going to move to a higher base res?
>>
>>101681922
strangely I experience better results on schnell than on dev in res above 1024x1024 (schouldn't they have the same data set?)
>>
>>101681922
Once consumer GPUs catch up to datacenter GPUs in terms of vram we'll move on from 1024x1024.
For now it's simply more practical to gen at low resolution and upscale.
>>
>>101681922
I think that would shrink the dataset considerably, the number of 2k or 4k images is much smaller than the number of 1 megapixel images
You'd probably end up having to cheat by having a lot of the dataset be 1024 images upscaled with ESRGAN or something, in which case what would be the point
>>
File: flux_051.png (1.81 MB, 1024x1024)
1.81 MB
1.81 MB PNG
>>101681916
I think flux-schnell at least is easily fast enough that people will let you use this for cheap/free or via ai horde "trades" or w/e even if nothing changes.
>>
File: ComfyUI_00002_.png (1 MB, 1456x720)
1 MB
1 MB PNG
>>101681907
True, is training code released? If someone takes it and finetunes it on thousands of different styles then we might be back.
>>
File: ComfyUI_00001_.png (1.07 MB, 1456x720)
1.07 MB
1.07 MB PNG
>>
>>101681888
>interesting
Why the fuck do tech inept people add this to their prompts? Do they think he AI can do something meaningful with an adjective like ”interesting“?
>>
File: ComfyUI_00003_.png (1.29 MB, 1456x720)
1.29 MB
1.29 MB PNG
>1980s, retro, vintage_anime, anime_(1980s), takahashi_rumiko, adachi_mitsuru, hojo_tsukasa, hagiwara_kazushi, buronson, kurumada_masami, toriyama_akira, takahashi_yoichi, 1girl, a fantastical scene depicting a woman in swimsuit sitting on a hill overlooking an airport runway on a rocky outcropping in the middle of the ocean. A violent and powerful storm is visible in the background, and a giant wave is crashing over the rocks at the runway, 1980s anime tv episode still
>>
>>101681947
I also got the feeling that some data subjects are more focused on hires, if you prompt nature scenes, trees and the like, you can get incredible detail, but I tried night skies yesterday and got horrible rasterization effects on hires. Also ofc 2D was not their focus
>>
>>101681976
Might as well use "bizarre" instead because it actually does something interesting
>>
why does karras work for schnell but not for dev
they must be quite architecturally different
>>
post ghibli inspired miku
>>
>>101681977
in most of my trials it just borks on booru tagging, 1girl and artistname_name will not work well.. in the style of Akira Toriyama better than toriyama_akira
>>
File: flux.png (1.82 MB, 1024x1024)
1.82 MB
1.82 MB PNG
>>
File: ComfyUI_00005_.png (1.11 MB, 1456x720)
1.11 MB
1.11 MB PNG
>>101681976
I mean this is exactly how a VLM talks
>>
File: ComfyUI_00033_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>photo realistic spooky castle at night on a mountain from a distance red light emanating from the windows a narrow winding road runs upwards towards the castle moon light shines bright illuminating the scene

I'm glad I learned about this model, I'm having a blast with it.
>>
>>101681998
Tero isn't bossed around by some lapland burners
>>
File: ComfyUI_00007_.png (1.3 MB, 1456x720)
1.3 MB
1.3 MB PNG
>>101682006
I like to see what it does (by adding booru tags you add creativity to the model)
>>
>>101681998
karras does work for dev, it just requires way more steps to converge
you can't get away with 20 like you can when you're using simple or sgm_uniform, it has to be 50 minimum with karras

(it took me a while to discover this because that's the opposite of how it works for SDXL, where karras is the 'faster' scheduler)

also that other anon seems to have been correct that when you use karras, you get way better art styling, which is fucking WEIRD. the scheduler should not control that
>>
File: flux.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>
ow nice, finally diorama views with individually individually recognizable features and not blurry mess

>>101682046
ya I get you, doing that still to, just to see how much of booru tagging sneaked into it, or confure the model abit .. still the lack of recognized anime artists is sad, but I guess finetunes will arrive. God bless its apache2.0, no one can take it away from us again
>>
File: ComfyUI_00009_.png (1.28 MB, 1456x720)
1.28 MB
1.28 MB PNG
>>101682046
>1980s, retro, vintage_anime, anime_(1980s), takahashi_rumiko, adachi_mitsuru, hojo_tsukasa, hagiwara_kazushi, buronson, kurumada_masami, toriyama_akira, takahashi_yoichi, 1girl, pink_hair, space_suit, floating_in_zero_gravity, surrounded_by_aliens, playing_chess_with_a_robot, holographic_chessboard, neon_lights, space_station_window, asteroids_passing_by, distant_planets, glowing_computer_screens, shiny_metal_surfaces, high-tech_gadgetry, futuristic_headset, sparkling_star
>>
File: flux.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>
File: sgm vs karras.jpg (808 KB, 2304x768)
808 KB
808 KB JPG
>>101682056
>acrylic painting of a cat
left: euler sgm_uniform 50 steps
right: euler karras 50 steps
>>
File: flux.png (1010 KB, 1024x1024)
1010 KB
1010 KB PNG
>>
File: flux.png (819 KB, 1024x1024)
819 KB
819 KB PNG
>>
File: ComfyUI_00011_.png (1.28 MB, 1456x720)
1.28 MB
1.28 MB PNG
>>
File: flux.png (878 KB, 1024x1024)
878 KB
878 KB PNG
love that it has anonymous with the v for vendetta mask
>>
>Even people in the background are perfectly generated.
Finally
>>
>>101682084
can you change scheduler mid generation? or alter with every step.
>>
File: Flux_00195_.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>
>>101682084
I think we unlocked the styles at flux-dev, just use karras bro! kek
>>
>>101682067
how many steps anon?
>>
File: flux.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>
File: flux.png (774 KB, 1024x1024)
774 KB
774 KB PNG
ai pandering
>>
>>101681831
moar
>>
>>101682134
10 on schnell
>>
>>101682192
Does 6 work? I was finding schnell tended to stop improving at that point.
>>
File: 00_09.jpg (320 KB, 1552x1200)
320 KB
320 KB JPG
>>101682188
>>
>>101682211
same on 6 .. maybe less details? also my PC just crashed.. not getting the same result on 10 either, grr
>>
>>101682261
for the science .. this on 10
>>
>>101682261
>>101682192
what kind of prompt are you using? I'll do it on 50, i'm curious if it will fix the fucked up cars and other oddities
>>
File: 00042-1355492164.jpg (343 KB, 1552x1200)
343 KB
343 KB JPG
>>101682272
>>101682261
VERY cool
>>
>>101682287
would be:
>utopian science fiction city, diorama isometric view, in the style of Akira Toriyama, anime, capsule corp.
euler, normal scheduler, cfg 1.0
>>
>>101681353
My one grip with flux (aside from the artist issue) is that it also produces SD 1.5 faces unintentionally, just like all SDXL mixtures and like SD3 did on release (almost as if Lykon had his hands on the model). In comparison, Pixart doesn't do that
>>
File: ComfyUI_00035_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>101682299
Ok will try that, this gen got the view wrong.
>>
>>101682309
ya you need to combine isometric view with diorama, or you will get a random perspective, also pic related is Toriyama replaced with Giger
>>101682289
thx
>>
File: ComfyUI_00019_.png (855 KB, 720x1080)
855 KB
855 KB PNG
>>101682306
Supposedly flux was made by same team that made SD3 (that left when SAI went collapsed). Now, this is not necessarily a bad thing, since it can do both SD-slop and SD-nonslop as it should, though right now it can do only do so its own. It just means we need to tune the alignment away, after all this is a DiT model.
>>
File: ComfyUI_00020_.png (1.29 MB, 720x1080)
1.29 MB
1.29 MB PNG
>>
I am just amazed that even tho the resolution isnt that great it gets every individual stair step correct
>>
>>101682406
Yeah, the model is extremely good with fine detail and stuff in the background
>>
>>
>>101682430
very nice
>>
Is there a certain way to prompt with flux, or is it the same as SDXL with just keywords seperated by commas?
>>
>>101682406
Can you add some utility poles and power lines? SD usually shits the bed with these
>>
File: ComfyUI_temp_ufsvg_00035_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>101682306
That won't be an issue for long if it's easy to finetune.
>>
>>101682439
100% natural text, describe how you want the image to look. It's using a fully featured language model though, so it is pretty flexible and can incorporate tags if need be.
>>
>>101682459
I see. Thanks!
>>
What artists does flux know.
I tried rembrandt and it doesn't seem to work.
I'm using "In the style of Rembrandt van Rijn", should I word it differently?
>>
File: tris_9.jpg (936 KB, 1552x1200)
936 KB
936 KB JPG
Nice gens this morning. Lotta creativity.
>>
>>
>>101682445
does it reasonably well, I guess I have to remove some of the artists to make less of a spider web, the prompt became a mess now
>utopian science fiction city, diorama isometric view, in the style of Moebius, Jodorowsky, style HR Giger, style Zdzisław Beksiński, anime, in the desert, dune, Arrakis, brutalist architecture, Sandwurm, style Lynch, style Kubrick, overland power lines,
>>
File: ComfyUI_00023_.png (779 KB, 720x1080)
779 KB
779 KB PNG
>>101682449
>Finetune in progress
I hope. Once the code releases for Kohya a simple LoRA or 2 might do.
>>
>>101682445
cleaned up, ya impressive
>utopian science fiction city, diorama isometric view, in the style of Moebius, tyle HR Giger, anime, in the desert, dune, Arrakis, brutalist architecture overland power lines, utility poles
the place basically became a dune power plant now
>>
File: ComfyUI_temp_ufsvg_00071_.png (2.17 MB, 1024x1024)
2.17 MB
2.17 MB PNG
>>
File: EXJ6wb3XYAE0agw.jpg (88 KB, 785x1000)
88 KB
88 KB JPG
>tfw 12GB Vramlet
It's over for me isn't it
>>
File: ComfyUI_00034_ .png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>A beautiful acrylic painting of batman and spider-man sitting at a bar having a drink. Wolverine walks through the door while holding a watermelon. Wolverine yells "Wubalubadubdub!!"
Didn't do the speech bubble unfortunately, but this model is going to revolutionize memes. Especially once some good finetunes come out.
>>
>>101682550
yes
>>101682551
add speech bubble and he might actually do it
>>
>>
>>101682550
You can run it without 24gb vram as long as you have a lot of ram.
I'm running it with 10gb vram and using about 20gb of my 32gb ram.
>>
>>101682563
>add speech bubble
I didn't in my previous prompts and it worked fine, but I'll try.
>>
>>101682550
maybe fp8 works? maybe if you load t5 to system ram?
https://huggingface.co/camenduru/FLUX.1-dev/tree/main
>>
File: ComfyUI_00038_.png (1.81 MB, 1024x1024)
1.81 MB
1.81 MB PNG
>>101682574
I'm not having any look with the view. Perhaps i just need keep going through seeds, or perhaps its because I'm using 50 steps.
>>
>>101682550
just make a big swap partition or swap file. Itr works on my 12GB VRAM and only 16GB system RAM, but I've got a ridiculous swap partition setup on an old SSD, and its working fine. The only time I get issues with RAM on this system is when I start loading up like 3 SDXL models along with everything else with multiple ksamplers. Then I usually only have to close everything down and start fresh with just the one Comfy tab open.
>>
>>101682550
>vramlet
lol, im stuck on my 6gb laptop for months without reasonable funds to upgrade so I can only play with this shit on their free demo thing. it doesn't feel very cucked though to b fair.
>>
File: ComfyUI_00039_.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>101682670
>he only time I get issues with RAM on this system is when I start loading up like 3 SDXL models along with everything else with multiple ksamplers. Then I usually only have to close everything down and start fresh with just the one Comfy tab open.
In fact I think I eliminated that problem entirely when I set my fstab to contain discard

UUID=xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx none swap discard 0 0

That enables the SSD to do its trim routine properly on swap partitions
>>
File: flux.jpg (134 KB, 1024x1024)
134 KB
134 KB JPG
>>
Holy shit, I figured it out. It's T5 that doesn't know styles and artist names. But CLIP knows them! It was always good for that (the whole Greg Rutkowski thing from the 1.5 days was due to CLIP).

So, you put your artist name and/or desired art style in the CLIP prompt box.
And in the T5 box, you only put your desired CONTENT. Don't say anything about the style or the artist to T5, because it doesn't know what you're talking about and it's just gonna make some shit up instead. Overwhelming CLIP's contribution and giving you the wrong style or the "slopped" look.

CLIP box: Style/artist name
T5 box: desired image content only

This almost completely eliminates the inability to understand styles or recognize artist names in my testing so far. It was never the model itself that didn't know, it was just T5. With the above method it's now recognizing all kinds of names and styles that it seemed to ignore before.
>>
File: ComfyUI_00038_.png (861 KB, 720x1080)
861 KB
861 KB PNG
>>101682551
Yep, this is the moment we have all been waiting for. Isn't quite Dalle tier yet in terms of total concept knowledge but 80% of the way there. Crazy how the model just dropped casually kek, looking forward to v2 of this model.
>>
File: Flux_0027.jpg (81 KB, 1024x1024)
81 KB
81 KB JPG
>>101682740
it's better in terms of prompt following and stuff
>>
File: ComfyUI_Flux_0357.jpg (157 KB, 1024x1024)
157 KB
157 KB JPG
>>
File: Capture.jpg (347 KB, 2721x1265)
347 KB
347 KB JPG
>>101682736
Meh
>>
>>101682509
cute
>>
>>101682639
you using dev? I tried it there .. will work sporadically, on schnell it works 90% of the time

pic related 10 steps, euler on schnell
>>
>>101682736
Big if true. Share workflow in ComfyUI
>>
>>101682639
>>101682785
while this is 40 steps euler, same seed, same prompt on dev
>>
File: sneed.jpg (258 KB, 1024x1024)
258 KB
258 KB JPG
>>
File: file.png (1012 KB, 1024x1024)
1012 KB
1012 KB PNG
>Gameplay screenshot of a 2D pokemon game. It's a battle between two creatures, one on the left is a large blue alligator and the other one is a large red anthropomorphic chicken with one leg in the air
Kek
>>
File: ComfyUI_Flux_0359.jpg (169 KB, 1024x1024)
169 KB
169 KB JPG
>>
>>101682795
yeah i'm using dev
>>
File: Capture.jpg (312 KB, 2603x1392)
312 KB
312 KB JPG
>>101682805
What settings are you using? can't seem to get that style at all
>>
>>101682521
>>101682541
huge improvement, ty for testing
>>
File: ComfyUI_00032_.png (979 KB, 1024x1024)
979 KB
979 KB PNG
>>101682795
I'm gonna try reordering the prompt, since I had to do that to get it to understand from behind views. I had to use 'from behind' for image related.
>>
>>101682808
+1 for lotus attempt
really cool gens, I am hyped.
>>
File: photo_2024-08-01_20-47-05.jpg (190 KB, 1024x1024)
190 KB
190 KB JPG
>>101682824
I think I just fucked up with the settings, because is my 2º time with Comfy
https://files.catbox.moe/ol9h9k.png
You capture resembles better the output of pro
>>
>>101682736
heck you are onto something there, I am sure knows some artists, but this way you can force it on the whole prompt and txxl5 doesnt just think you are rambling on how Van Gogh is portraited in atmospheric ligh.. 10/10 important post

pic related, Giger in CLIP
>>
File: ComfyUI_00046_.png (1.08 MB, 832x1216)
1.08 MB
1.08 MB PNG
>>
File: Capture.jpg (597 KB, 3105x1684)
597 KB
597 KB JPG
>>101682736
>Using clip for styles
>Euler Karras 50 steps
we're not there but we're getting close
>>
>>101682232
please share workflow or prompt?
>>
Can someone catbox their comfy workflow with the seperate clip artist prompt?
I'm new to comfy and have absolutely no clue how to get another clip box.
>>
>>101682851
holy fuck you use t5xxl fp8? this text encoder gets really bad at 8bit, I'm really surprised you got this gen with that lol
>>
File: ComfyUI_temp_ozzud_00033_.png (1.49 MB, 1024x1024)
1.49 MB
1.49 MB PNG
>>101682551
>Wubalubadubdub
>>
File: ComfyUI_Flux_0385.jpg (162 KB, 1024x1024)
162 KB
162 KB JPG
has anyone tested if it can do upscales? is it a SD3 situation where only ultimate SD upscale is viable?
>>
>>101682881
Here anon: https://files.catbox.moe/efb6nn.png
>>
with "woman by da vinci" in txxl5 I get vaguely rennaince woman, but if do both: "woman by da vinci" in txxl5 and add "by da vinci" in clip I basically get the mona lisa instantly, I guess flux wants you to dual text encode for best effect
>>
>>101682851
yeah I just tried your settings I'm not getting what I want, I guess you got really lucky with that seed kek
>>
>>101682881
Not that anone but I messing around with it now, double click anywhere empty and search cliptextencodeflux top box is the style, second box is what you want in the image. Connect clip to clip and conditioning to positive on the Ksampler. I think that is how they are doing it.
>>
>>101682938
Thanks!
>>
>>101682913
you can render at very high resolution without duplication issues on flan, it's making upscale obsolete I guess
>>
BOFT seems to require 20gigs of vram for training (or it just doesnt work well with prodigy)
>>
For those with 24gb vram cards and are doing a 8bit DiT + 16bit text encoder, I highly suggest you to go for the --highvram command, it has enough room to load the both of them and it makes everythng way faster
>>
File: ComfyUI_Flux_0403.jpg (122 KB, 1024x1024)
122 KB
122 KB JPG
tattoos aren't a complete mess. I love this
>>
>>101682957
I see. I'll have to mess around with it for a bit to get a grip on how it works. Thanks!
>>
File: Capture.jpg (436 KB, 2603x1420)
436 KB
436 KB JPG
>>101682871
What have I done?? kek
>>
>>101683000
>--highvram command

How do I do that? Just as I load comfy?
>>
>>101682574
updated this to dual text encoder with styles enforced in CLIP

>>101682736
thank you so much anon you just made my day thank you very very .. this very important information on handling flux!
>>
>>101683000
nice tip thanks, no change to step speed but much faster prompt processing
>>
What step speeds are people at 1024x1024? It's not great on a p40, about 20 seconds per iteration.
>>
File: aa.jpg (203 KB, 1852x1144)
203 KB
203 KB JPG
>>101683024
you do that if you're using the "fast" method
>>
>>101683042
about 1.4 -> 1.7 secondes per iteration on my rtx 3090
>>
>>101683042
1.15 it/s on my undervolted underclocked 3090
>>
>>101682871
You could also maybe try these useful nodes?
>>
Is the quality difference between schnell and dev very obvious or is it relatively small?
>>
File: ComfyUI_temp_ozzud_00065_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>101682752
>>
>>101683069
I let you try it anon, you can download my workflow here >>101682938
>>
So what models do i download if i have a 3090? Just the 23gb dev file and run with it?
>>
>>101683087
still debating on that, some stuff seems to work even better on schnell as the isometric stuff posted above, as schnell is the one on apache2.0 I guess we will see the finetunes on that?
>>
>>101682752
But can it do girls wearing bikini or underwear without huge boobs? I haven't managed to, so far.
>>
File: ComfyUI_Flux_0413.jpg (269 KB, 1920x1080)
269 KB
269 KB JPG
1080p 168s on a 3090
>>
>>101683115
follow this guide, you need quite a few files
>https://comfyanonymous.github.io/ComfyUI_examples/flux/
>>
File: ComfyUI_15184_.png (2.39 MB, 1024x1024)
2.39 MB
2.39 MB PNG
>>
File: image.png (2.39 MB, 1024x1024)
2.39 MB
2.39 MB PNG
I still cannot believe we got dalle3 at home just like that
>>
>>101683135
ashton kutchers money was holding us back all this time
>>
File: ComfyUI_15185_.png (2.32 MB, 1024x1024)
2.32 MB
2.32 MB PNG
>>
File: ComfyUI_Flux_0415.jpg (205 KB, 1920x1080)
205 KB
205 KB JPG
do we know if flux comes with a 16ch vae?
>>
>>101683164
says it in the release, yes it does.
>>
Guidance down to 2.5 (not cfg, the guidance dial on the text encoder) seems to help reduce the slopped look for art gens too.
Only seems to work on dev, changing the guidance does nothing on schnell.
>>
File: ComfyUI_Flux_0409.jpg (118 KB, 768x1344)
118 KB
118 KB JPG
>>101683171
ty anon
>>
Flux looking hot
>>
File: flux.jpg (92 KB, 1024x1024)
92 KB
92 KB JPG
>>101683214
absolutely so
>>
I hope finetuning/lora training isn't too bad so we can maybe get some good stuff this month.
>>
File: ComfyUI_Flux_0419.jpg (181 KB, 1920x1080)
181 KB
181 KB JPG
1080p 25 steps deis sgm_uniform 86seconds
>>
>>101683246
i have a feeling the t5xxl is gonna make shit really painful to train in
>>
File: flux.jpg (95 KB, 1024x1024)
95 KB
95 KB JPG
>>
File: flux.jpg (86 KB, 1024x1024)
86 KB
86 KB JPG
>>101683257
>i have a feeling the t5xxl is gonna make shit really painful to train in
>>
File: ComfyUI_00217_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>A woman doing breakdance at Paris
I prompted that in french and it understands that well, nice
>>
File: flux.jpg (63 KB, 1024x1024)
63 KB
63 KB JPG
>>
>>101683257
You can process the prompts and then never touch T5 again for the remainder of the training run.
>>
File: ComfyUI_05351_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>101682803
>>
File: ComfyUI_Flux_0431.jpg (278 KB, 1920x1080)
278 KB
278 KB JPG
>>
File: ComfyUI_00183_.png (714 KB, 1024x1024)
714 KB
714 KB PNG
How do I fix the blurriness in some gens with flux?
>>
>>101683409
dont use cfg at all, flux doesnt support it set it to 1.0, use euler and normal scheduler
>>
>>101683383
I remember shitposting done on sdg with this prompt when dalle released, retards were coping with muh controlnet
flux really is great even if it has styling issues.
>>
How are we gonna cope with the fact that we'll never be able to use negative prompt on flux-dev?
>>
File: ComfyUI_Flux_0435.jpg (334 KB, 1920x1080)
334 KB
334 KB JPG
flux team suffering from success
>>
File: ComfyUI_Flux_01802_.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
My Hunyuan style selfies still work with Flux, just gotta find the proper realism keywords or perhaps tune it, nice.
>>
>>101682760
has the saem issue as Dalle where the skin seems too smooth. Otherwise this is an amazing model, can't wait for people to figure out how to finetune it for more skin details etc
>>
File: ComfyUI_Flux_01803_.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>>101683488
Not quite as good but if I find the right keywords I will get there
>>
File: ComfyUI_Flux_01800_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>101683509
So far it gives me what I want but it's anime, guess it still somewhat works out
>>
File: ComfyUI_00199_.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>101683409
>>101683425
Thanks! That and using the clip_l as the style prompt and t5 as the content prompt made all the difference, with upping the steps to 50.
>>
>>101683455
by using the positive prompt more intelligently? I mean to exclude things from the image, words like 'alone' or 'uncrowded'. Be more specific in your prompts. I rarely use negative prompts these days since using pony models, and this thing. It well... it shits all over pony, I don't have to keep spamming queue for imperfections.
>>
>>101683455
is this true, or overexaggerated doombait? because i don't think it's good enough to do away with negative prompts entirely. already there's a bunch of ugly slopstylization i'd like to try and neg out.
>>
File: ComfyUI_Flux_0449.jpg (200 KB, 1080x1920)
200 KB
200 KB JPG
>>
File: ComfyUI_Flux_0457.jpg (235 KB, 1080x1920)
235 KB
235 KB JPG
>>
>>101683586
no it's true, this model can't make good pictures at other cfg than 1, and cfg = 1 also means you cdan't use negative prompts
>>
File: file.png (18 KB, 359x273)
18 KB
18 KB PNG
My comfy just crashes when I attempt to run the workflow
>12 GB VRAM 32 GB RAM
>using fp8 vae and weights
Please help before I shill out 800 bucks for a 3090
>>
>>101683711
>My comfy just crashes when I attempt to run the workflow
make sure you have 24 or so gb free on your C: drive
>>
>>101683683
cfg works, it just needs to be figured out
see the web demo
>>
>>101683683
>cfg = 1 also means you cdan't use negative prompts
you're pulling this out of your ass?
>>
>>101683717
Wait what, why? There's no way I can make that happen, I have like 10 there that I keep having to free every now and then
>>
>>101683726
comfyui uses it while inferencing. it will not work without said space
>>
>>101683725
you don't know how CFG works clearly
>>
>>101683725
no, that been confirmed, just try it for yourself, negative tags have zero impact
>>
>>101683736
Huh, I wasn't aware. I'll see what I can do. Or just boot into my linux repo.
>>
>>101683742
neither do you. one of the devs claim that cfg is not even actually used for flux.
>>
>>101683751
that's what CFG 1.0 means you absolute retard
>>
File: ComfyUI_00047_.png (1.68 MB, 1024x1024)
1.68 MB
1.68 MB PNG
>>101683099
I just tried the conditioning (concat) node and it seems to work decent. The image is not exactly what i want because the dev model does not seem to understand isometric or Birdseye view but it seems to get the art style correct. So the way it works is you connect the image prompt to the conditioning_from connection and then the art style to the conditioning_to connection. This way the image is first considered then the art style is applied second, basically how concat works.
>>
>>101683766
give us one of your outputs through a catbox anon, so that we'll be able to get your workflow
>>
>>101683751
setting a cfg to zero will force it to use the negative prompt which in this case won't work so that is why cfg is set to 1
>>
>>101683817
give me a second, i'm testing it on this example to see if its working as intending >>101682871
>>
>>101683817
ok its not working, damn it, its the style but not the right character, maybe i need to tweak some things. but the pic is the basic connections i've done, which I think would be the correct way if I was using an SDXL model, perhaps i need to flip them around or use a different method like average or combine instead of concat.
>>
>>101683671
>>101683605
Nice, those look professional though, fake skin. Best I've gotten so far is >>101683488 everything else looks a bit unrealistic I guess, some higher failure rate but I'm sure it's due to censorship E.G. a simple LoRA can fix this,
>>
Are there any loras or models that allow finer control over facial features, or is that new flux thing better at it? I want to create characters but SD and XL just do sameface
>>
>>101683953
prompt facial features, try a different model, prompt some obscure famours person and use that as your personal sameface..

and no for FLUX there is nothing like loras or anything yet .. sameface there? hm remains to be seen once anon wrangled the model enough, but I guess not as much as SDXL finetunes
>>
Is it possible to combine flux with tensorRT for performance gains?
It works on 12 GB, but yeah it's pretty fucking slow, uses like 8 GB of system ram at fp8.
>>
File: ComfyUI_15261_.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
dual wielding
>>
File: file.png (1.04 MB, 1024x1024)
1.04 MB
1.04 MB PNG
>>
>>101683983
Maybe it's just the models I've tried but none of them offered much control about facial features and didn't seem to know the terms required. Fiddling around with celebrity faces does a bit more but it's still just a crutch at best
>>
>>101681601
like sdxl was?
>>
>>101683098
>>101682752
how do you get it to write something?
>>
>>101684057
nta but you need to explicitly state that there's a speech bubble
>a speech bubble above her head says "pixart was here"
>>
File: ComfyUI_00050_.png (1.81 MB, 1024x1024)
1.81 MB
1.81 MB PNG
>>101683817
ok the ConditioningAverage node is definitely working at 0.5 strength. doing it at 0.3 strength now, will post that image on catbox once its done so you can see the difference, the concat node does nothing to the image, no change when removed.
>>
File: goo_00041_.png (1.03 MB, 1024x1024)
1.03 MB
1.03 MB PNG
>>101684057
this >>101684091
or simply
>saying "Some text."
or
>caption "THIS AND THAT"
>>
File: ComfyUI_00049_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
pic related is base image
https://files.catbox.moe/ae4ovc.png
link is 0.3 strength for averaging
>>
File: ComfyUI_00051_.png (1.66 MB, 1024x1024)
1.66 MB
1.66 MB PNG
>>
File: ComfyUI_temp_ozzud_00210_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>101684057
I prompt it like
>neon sign with text "
>speech bubble with text "
>title with text "
>etc
>>
I thought CFG is supposed to be set to 1, should I keep it at 3.5 instead?
>>
Just cancelled my ideogram sub. Flux is so good I don't need cloud shit anymore.
>>
>>101684402
no, cfg is pointless, keep it on 1
>>
>>101681353
>local image gen that is literally better than the best closed source corporate models
Leftist journalists are going to crush Flux once this gets out. This will be over before it even begins.
>>
File: ComfyUI_00020_.png (934 KB, 1024x1024)
934 KB
934 KB PNG
>>
This thing does not like to denoise until 0.95 when routing to a second ksampler with an entirely different prompt. Why would that be? This is odd.
>>
>>101681840
3xA100 server by the minimum just for a fine-tune

A Lora requires a single A100
>>
File: ComfyUI_00053_.png (882 KB, 1024x1024)
882 KB
882 KB PNG
>>
File: hovercar.jpg (760 KB, 1344x1536)
760 KB
760 KB JPG
Top is Flux Pro on Replicate, bottom is DALL-E 3 on Microsoft Designer, this is the prompt:
>Detailed, realistic oil painting, 1990s science fiction illustration, waist up view of a female science fiction adventurer wearing futuristic clothes, looking at the camera, posing in front of a hovering sleek futuristic concept hovercar, on a street with futuristic pedestrians in a science fiction cityscape with sleek futuristic architecture

DALL-E still has more of the styles I want from non-photorealistic images. I think Flux was intentionally trained not to replicate traditional art drawn by humans: it's either photorealistic or it has a very smooth digital art style.
>>
>>101684478
Why? Artist names and styles don't work well at all and it doesn't know celebrities.
>>
>>101684679
nta but it absolutely does know at least the few big political ones. agree on styles though.
>>
File: 2024-08-02_00074_.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
>>101684665
>I think Flux was intentionally trained not to replicate traditional art drawn by humans:
I think blackforest learnt alot from SAI scandals and problems.. so they are leaving the dirty work for the finetunes. Smart move.
>>
>>101684708
meanwhile midjourney still making hundreds of millions per year off artist tears
>>
>>101684723
yea spent well on lawyers I guess
>>
>tfw 4080
>tfw automatically converted to low vram mode
wew lad, flux is a bit mental
>>
>>101684767
praise NVidia for being greedy on VRAM!

if the 5090 really only has 28GB I gonna nerdrage
>>
File: ComfyUI_00068_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>
>>101684679
Being open means it will eventually know much more (granted it is easy to train). Instead of pony types what we focus on first is artist finetunes, both Japanese and Western ones, also art/photo movements and photography. Ideally such tunes do not decide or converge on a specific style since their purpose is to teach. Then we do the pony.
>>
>>101681583
Poor furries scamming rich furries
>>
File: 1790653.png (2.97 MB, 1472x1472)
2.97 MB
2.97 MB PNG
>>101684767
same on 4090 though
>>
File: ComfyUI_temp_ozzud_00273_.png (1.42 MB, 1024x1024)
1.42 MB
1.42 MB PNG
>>101684478
There is nothing "journalists" and politicians can do, the weights are out, distributed over tens of thousands of machines all around the world. Even if they can drum up a big enough moral panic for HF and other mainstream providers to ban it, it's gonna continue underground.
>>
>>101684783
>>101684812
model is fucking sick though. After months of stagnation and SAI cuckery we finally have something that mogs DallE.
Just need to make it run on consumer hardware though
>>
>>101684478
this >>101684818
also.. Apache 2.0 .. its joever
>>
>>101684818
Can't steal artist styles out of the box, the only argument they have left is cunny, but it's not a strong one.
>>
>>101684834
>consumer hardware though
wont happen now, this will be reverse, the hardware you need to run it will be consumer in a year or three from now
>>
its good, but not 12b good. i shall remain waiting for bigma.
>>
>>101684854
all it needs is a severely autistic person to optimise with wizardry it and we're golden.
>>
File: 129615147.png (1.69 MB, 1024x1024)
1.69 MB
1.69 MB PNG
>>101684834
which shows how shitty SAI is, still, the better is always to come, this model is bad as a foundational model, it barely knows any art, its heavily tuned on "aesthetic" crap because its intended to be a consumer product for their generation services
>>
>>101684880
wah wah wah that's what fine tunes and loras are for.
>>
>>101684878
yea will probably happen, and for 1024x1024 FLUX.schnell meme production that the masses crave you will probably see it sooner rather than later
>>
>>101684880
>it barely knows any art
Ipdapter exists though. It technically knows all, so as a tool this is the best model, and I'm sure someone will figure out something for artists that is more efficient than IPAdapter or LoRAs, I mean especially with this model it's become a problem for this and every DiT model going forward.
>>
File: ComfyUI_00008_.png (1.94 MB, 1024x1024)
1.94 MB
1.94 MB PNG
>>101684919
there is no implementation because T5 conditioning has no visual understanding of anything it's trained on. clip is the only way to do it proper but clip l is very small
>>
https://huggingface.co/camenduru/FLUX.1-dev/tree/main

fp8 of flux dev
>>
>>101684892
>that's what fine tunes and loras are for.
Why even wait for companies to release a model then if we can just train one? lol
>>
>>101684996
Go ahead and make a 12B model from scratch
>>
>>101684991
What does this mean
>>
>>101685007
Well, your answer to "this model lacks the knowledge of basic public domain art" is "thats was fine tunes are for" then a model missing everything also just needs to be fine-tuned
>>
>>101685026
>nogen
Seriously, move on. There's plenty of models that know your esoteric art styles.
>>
>>101685020
lower precision and smaller model size, therefore more gpus can run it. we have a fp8 loader node so it doesnt really matter
>>
>>101685042
>>nogen
I accept you concession, thank you very much
>>
>>101685055
You're here in bad faith, you'll always find something to bitch about because you're negatively biased and want models to fail.
>>
>>101685066
I can see it in 6 months
>The urethras on my cockgens are a couple of mm off what I expect, dogshit model
>>
File: ComfyUI_01557_.png (1.13 MB, 1152x896)
1.13 MB
1.13 MB PNG
>>101685094
>>
>>101685042
>>101685055
>nogen
I need more time!
>>
cozy thread today
>>
>>101685053
It doesn't work.
>>
File: 2619579.png (1.5 MB, 768x1024)
1.5 MB
1.5 MB PNG
Here, a image so the niggers can understand it visually
>>101685133
cute
>>
>>101685219
>retarded poster
The best part of seeing gens is you can tell when someone is autistic and retarded. It's like if I saw you IRL and seeing you're an ungroomed 400 pound man I can disregard everything you have to say.
>>
>>101685219
>not foundational
can't say for certain just yet, we'll just have to wait and see what happens. fingers crossed it's not another sdxl.
>>
>>101685133
>>101685243
After all that time I can't even post it because nipples
>>
File: ComfyUI_temp_ozzud_00350_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>101685112
>>
>>101685266
I can say. If it's easily tuneable it's the new SD, 100%. Reddit is shitting and cumming over it.
>>
File: Y3WhuTEp87zykAaQOohBq.jpg (147 KB, 1024x768)
147 KB
147 KB JPG
Which anime was this from again?
>>
File: 3076.png (1.3 MB, 512x1568)
1.3 MB
1.3 MB PNG
>>101685243
cope lmao
>>101685266
>can't say for certain just yet
Its incapable of making heavily abstract or classical paintings because its turned to hell on aesthetic images, yeah if you train anime or porn on it will work, but thats very far from my point, though this is the only thing retards care about so its not like it makes sense to talk about anything else it seems
>>101685243
here, a (you) to make your day a little more brighter
>>
>>101685305
>wah it doesn't know my fetish art, here's an image with my lack of taste and creativity
Thanks sweaty 400 pound man for your input
>>
File: 6.png (719 KB, 1128x528)
719 KB
719 KB PNG
>>101685314
>impasto, piet mondrian, or any fucking painting style that isnt hyper ultra realistic slop
>my fetish art
ok then, you win from mental retardation
>>
File: 2024-08-02_00131_.png (2.05 MB, 1280x1280)
2.05 MB
2.05 MB PNG
>>101685314
nta, but atleast it knows my fetish art
>>
File: J8CNdBG5HKywhtrY38p1o.jpg (183 KB, 1024x768)
183 KB
183 KB JPG
>>
File: FLUX-Schnell_00008_.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>101685305
>>
File: ComfyUI_temp_ozzud_00388_.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>101685288
>>
>>101684880
try dropping guidance on -dev to 1.0-1.5 if you're having issues with style/prompt adherence
>>
File: file.png (2.33 MB, 1280x800)
2.33 MB
2.33 MB PNG
>>101685351
>it doesn't know fine art styles but here's my generic anime girl slop showing I don't try or care I just bitch because really my motive is I want the model to fail
>>
File: catbox_6bckyr.png (2.01 MB, 1024x1024)
2.01 MB
2.01 MB PNG
>>101684185
Not sure why but the conditioning node doesn't want to play nice but using the "split between clip_l and t5" method works really well. But I am also running it on schnell with 6 steps for quick gen purposes.
>>
File: 77383284622.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>101685367
now try making a heavily abstracted person/portrait like 1.5 and SDXL can do
>>
File: 1268928582345035848_1.jpg (307 KB, 1024x1024)
307 KB
307 KB JPG
:eye:
>>
File: FLUX-Schnell_00010_.png (1.57 MB, 1024x1024)
1.57 MB
1.57 MB PNG
I keep getting signatures that look like real signatures. I wonder if they are.
>>101685405
try licking my ass
>>
File: ComfyUI_00046_.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>101685405
>>
>>101685400
Yeah its not working right, completely loses the character. using another Ksampler for second pass different prompt does not work either. I might try IPadapter to transfer style if it even works, after some rest.
>>
Morning bread ready to eat...

>>101685374
>>101685374
>>101685374
>>
>>101685398
I already tried, it can't make impastos or classical art, if I hadn't tried I wouldn't have said it, its extremelly easy to achieve a painterly, textured with heavy brushstrokes image on 1.5/XL and a uphill battle on FLUX
>>101685421
Thats not a person, also doesn't work with animals, or any of those popular subjects, its always heavily shaded and defined
>>
File: 2024-08-02_00174_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
Yea! Beksinki signed this.
>>
>>101681891
4 GB VRAM, 16 GB RAM
>>
>>101685266
>can't say for certain just yet, we'll just have to wait and see what happens. fingers crossed it's not another sdxl.
this model lacks a lot of styles, artists and characters, fortunately that can be salvaged with more training, they did the hard part, good anatomy, good prompt understanding, good textures, this model has so much potential



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.