Discussion of free and open source text-to-image modelsPrevious /ldg/ bred : >>102964600Very Opposite Opinion Edition>Beginner UIFooocus: https://github.com/lllyasviel/fooocusEasyDiffusion: https://easydiffusion.github.ioMetastable: https://metastable.studio>Advanced UIForge: https://github.com/lllyasviel/stable-diffusion-webui-forgereForge: https://github.com/Panchovix/stable-diffusion-webui-reForgeAutomatic1111: https://github.com/automatic1111/stable-diffusion-webuiComfyUI: https://github.com/comfyanonymous/ComfyUIInvokeAI: https://github.com/invoke-ai/InvokeAISD.Next: https://github.com/vladmandic/automaticSwarmUI: https://github.com/mcmonkeyprojects/SwarmUI>Use a VAE if your images look washed outhttps://rentry.org/sdvae>Model Rankinghttps://imgsys.org/rankings>Models, LoRAs & traininghttps://aitracker.arthttps://huggingface.cohttps://civitai.comhttps://tensor.art/modelshttps://liblib.arthttps://github.com/Nerogar/OneTrainerhttps://github.com/kohya-ss/sd-scripts/tree/sd3>SD3.5https://huggingface.co/stabilityai/stable-diffusion-3.5-largehttps://replicate.com/stability-ai/stable-diffusion-3.5-large>Sanahttps://github.com/NVlabs/Sanahttps://sana-gen.mit.edu>Fluxhttps://huggingface.co/spaces/black-forest-labs/FLUX.1-schnellhttps://comfyanonymous.github.io/ComfyUI_examples/fluxDeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main>Index of guides and other toolshttps://rentry.org/sdg-linkhttps://rentry.org/rentrysd>Try online without registrationtxt2img: https://www.mage.spaceimg2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest>Maintain thread qualityhttps://rentry.org/debo>Related boards>>>/aco/sdg>>>/aco/aivg>>>/b/degen>>>/c/kdg>>>/d/ddg>>>/e/edg>>>/h/hdg>>>/tg/slop>>>/trash/sdg>>>/u/udg>>>/vt/vtai
>>102974667with the Van Gogh style lora, and trigger.
Blessed thread of frenship
>>102974909LyingSigmaSampler experiments.
>>102975105You see a guy with a paintbrush, stick him with a knife.
I'm an artist.
Can someone explain how the denoise steps work? Those are done by the samplers?
>>102975152>LyingSigmaSamplerI've seen the examples where it appears to generate "more details" and even lessen the blur present in Flux but, it gives images an unpleasant appearance IMO.
>>102975178i think the same but haven't messed around with it myself yet
>>102975152another>>102975178imo this stuff will be basic to ai images soon enough. The guy who made it is cloning DemonDetailer of A1111 or whatever the're called. I don't yet understand it. The current version has simple settings. It's a multiplier that applies to a portion or all of steps. Rather obviously, it could be done in more sophisticated ways.
>>102975220image.>>102975220um actually not a mult, an adder? idk, but it can't hit plus of 100%
>I'm currently convinced they nuked all popular celebrities from their training sets to dodge controversy.They used a VLM to erase the tags that mentioned the celebrity's names to replace them with tags that didn't, but I reckon they did it because they were retards, otherwise Trump would have been tagged as blond man, it can draw it because the VLM can recognize him.So it wasn't intentional, they let there what the model could recognize, if it recognized your celebrity it'd be properly tagged and would be drawn fine, but they didn't care either way.
>>102975292yep, the few ones it recognizes clearly shows it was never intentionalwhat was intentional was the erasure of all porn from the dataset (or as much as they could)
I have ugly test results, but I'm not posting them, since that's only annoying.
>>102975230Using 2 LyingSigmaSamplers
>>102975512Enjoying the series.
2:17:34 to generate a mochi video... that hurts
I think even I could've done a better job than this AI upscale.
>>102975626Well, let's see it
>>102975500put them in a nice grid/diagram and post
>>102975993it's shit
>>102976005We cross our fingers promise not to make fun of you.
I probably should play games some instead of just using Flux.
Is there any way to get comfyui to utilize 2 gpus for batch-genning?
>>102976123brap
>>102976123
>>102976193
>>102976123pov: I took my glasses off to read the wine list, a hooker finds out I won craps
>>102976207
>>102976154not in the way you want it to
>>102976005Now I want to see it even more
>>102976234
>>102976343I knew something interesting would happen with one of the settings.
using this lorahttps://civitai.com/models/889194/gigachad-and-poses-illustriousxl-and-noobai?modelVersionId=995038
>>102976470What is Noob?
>>102976470nice >>102976498https://civitai.com/models/833294?modelVersionId=968495
>>102976498https://civitai.com/models/833294/noobai-xl-nai-xlfurther illustrious finetuning that has more h100s thrown at it
>>102976470Prompt?
>>102976520masterpiece, best quality, absurdres, kubo tite, 1boy, flying kick, midair, gigachad \(meme\), pinstripe suit, (beard:0.75), full body, black shoes, wristwatch, cityscape, soloworst quality, censored, sketch, artist name, multiple views, lowres, flat color, (muted color:0), amputee, jpeg artifactshttps://files.catbox.moe/23odjc.png
>>102976507>>102976503Looks pretty cool. It's for 2D mostly?
still don't know what the hell this thing is, am i doing it right
>>102976590I don't know of documentation stating the order.
man, it's weird going back to sdxl. I tried a few new models yesterday and the prompt adherence is utter dogshit. Getting what you want requires 10 gens and a lot of luck
does anybody know of a good comfy workflow for FLUX inplainting that lets me paint inside comfy and not upload an external mask?
>>102976751right click an image node and select "open in mask editor"
>>102976533art
>>102974813retard here. I am new to this. I downloaded localai and bunch of models. My questions how do I generate porn from a set pictures I collected? I want to generate a specific person
>>102976922maybe try youtube something like "how to train a lora on civitai"i think civitai supposedly makes this easy. i haven't tried it myself though
All the kids in the last bred talking about>ai isn't "real" art
>
https://civitai.com/articles/8322Merge a Lora into Flux for better speed and quantize it.It's a pretty short article but tell me if I'm making some obvious mistake here.
>>102976901giwtwm
>>102977363no ones figured out lora quants?
>>102977659Loras can already be very smol, why quant them?
>>102977689i guess it wouldnt make them load any quicker, negating the need to bake them in
Sloppa go brrrr
>>102977883>SloppaHow? I think the quality is decent. Are you a luddite?
>Under the ultraviolet pink and blue glow of holographic advertisements, a young Russian teenage model girl in a pink plastic latex outfit and pink knee high boots poses in front of a futuristic luxury sports car on a city street, filled with pulsing neon lights. The camera captures her full body and beautiful faceI guess I can work with late teens
/ldg/ - 1girl general
1girl is All You Need
need 1gf
>>102977925>posts 1boy
is the a GUI that supports dreambooth and Pixart Sigma at the moment? I am aware of the script but cbf
Not bad, but 25 min to generate this.
If I was a cool rapper and had 1russianteengirl and 1sportscar irl I wouldn't need to generate them
>1girlOnly Sana can save this sinking ship
>>102978523Let me see the images your producing. Oh wait...
in Flux, some loras don't like mixing. Not sure if I can make it work.
If you're tired of 1girl, I might have some 2girls in the back
>>102978787
>>102978827Adjusting the LyingSigmaSampler.
>>102978523The sketch Flux lora is quite decent.
>>102978869
Overwhelmed by the sheer number of loras. We eatin good tonight fluxbros
>>102978803She clearly needs glasses lol, look how blurry everything is!
What is the conclusion about Sana? A failed launch?
>>102979090I agree. What kind of glasses should she wear?
>>102979017
>>10297913199% human :^)
>>102979108How about blue blockers?
>>102979181You got it
>>102979181
>>102979216>>102979225VERY cute lol
>>102977894I agree
>>102979345I'm glad you liked them
>>102979374Trying out loras
>>102979389
>>102974813Where's the nude models?You jokers are constantly edging.
>>102979533/g/ is a blue board, you won't find nudes here. look in the "related boards" section in the OP
>>102979533You mean a woman in a flowing garment? Very tasteful. Good thinking anon.
>>102974813Is there anything like VLLM for imagegen, optimized bulk inference with continuous batching?
>>102979701Having fast inference with multiple GPUs is unsafe.
>>102979896…no…
>>102976081>>102976300It was body horror I deleted, this is another one generated when I went to sleep.163 frames24 fps848 x 480200 steps because why not
are you splitting your sigmas?>picrel
i always see people commenting about depth maps for hands in blender, so you dont have to inpaint so muchhow do i learn that?
>>102975177it depends on your scheduler. Two (very oversimplified) basic types of schedulers. Ones that will converge to a pic and an increase steps will eventually do nothing and ones that will give you a new pic depending on ranges of steps. This is mostly a sampler video, but it should help. https://www.youtube.com/watch?v=-GXJDz8i-Wo>>102979104lost in the flood of model release. I see it similar to turbo/lcm models which fell out of popularity since nobody wants to do a full second pass. >>102980269in blender? Post link.Look into graphormer. It is okay.
https://github.com/kijai/ComfyUI-MochiWrapper/commit/f29f7397078b988110b82b85f135acc932a4c7eeso support cublas_ops with GGUFpretty big speed boost on 4090 at least, needs this installed:https://github.com/aredden/torch-cublas-hgemmCUBLAS INFERENCE: FLOPS: 274877906944TFLOP/s: 305.801TORCH INFERENCE: FLOPS: 274877906944TFLOP/s: 166.989gpt4o suggests it's bf16 mochi model only but idk.
>DEPRECATION: Loading egg at X is deprecated. pip 24.3 will enforce this behaviour change. A possible replacement is to use pip for package installation. Discussion can be found at https://github.com/pypa/pip/issues/12330Can they please stop screwing with pip and fix actual issues. ahhhhhhhhh
>>102979377very cool
My Flux dedistilled gens are absolute garbage now and I don't know why, it's driving me insane.
>>102980904A few days ago for comparison. Please help me
>>102980909You pull?
Ok I'm mostly happy with the age ranges time to try and make them into robots with pieces from that one anons robot angel prompt
>>102979104They could come back with a less compressed AE
>>102980879catbox?
>>102980934I pulled...
remember when the "trick" to giving Flux soul was to set negative guidance to 10?
>download illustrious xl>download example image with comfyui metadata>drag into comfyui so the settings are the same>queue prompt>receive garbagewhat am I doing wrong here
surprisingly cohesive
>Under the ultraviolet pink and blue glow of neon lights, two Russian teenage model girls with Cybernetic enhancements, machine made joints, mechanical limbs and blood vessels connected to tubes, wires and cables attaching to neck, wires and cables on head, science fiction, white knee high boots, walking together in an alley. The camera captures their full body and beautiful faces.Early in the gens it looked like it was just making normal girls until the last few steps
I really shouldn't have laughed at this so much.
>>102980230What made it generate a grid?
>>102981190post catbox of the image (and a link to where you found it) so anon can see whats going on
>>102980176Nice, more physics tests please
>>102981334It's just the first image in illustrious xl's example gallery, the pink and blue miku
>>102981263>>102981289Great stuff man
>>102980879im so lonely bros...
the voldy rentry was deleted. this makes me sad
>>102981361looks like he used auto or a fork, you wont get the same gen if you use comfy (without tweaking, if you can get close) unfortunately >with comfyui metadatacomfy CAN load auto gens as workflows but the two programs operate different enough that the output wont be 1:1 outright
>>102981853Oh I didn't realize comfy would do that. Thanks
>>102981872this https://github.com/BlenderNeko/ComfyUI_ADV_CLIP_emb will get you closer to replicating auto gens but IMO you'd be wasting your time. might as well simply load up auto if you want that exact image
>Tried using flux>Crashes the CMD right away with a python.exe reportDamn, guess it's not meant to be.
>>102981780
Cozy saturday
>>102977861this one's cute
>>102982029a year ago this would have been incredibly difficult to do... upside down without it being all fucked up and shit? forget it
>>102977371god i want this white meat
>>102982061true, things have improved a lot
Should I tell them?
>>102982155tell them what
>>102981391>Great stuff manthank you anon>>102982078>things have improved a lotStable diffusion is barely 2 years old 2 years from now video evidence will no longer be admissible in court
>>102982219>half-decent splitsNice. Flux? If so it's way better than any of my brief attempts
>>102982193The thumbs are on the wrong side.
>>102982219This one put the thumbs in the right place. idk hwat kind of demon toes those are tho
>>102981821These hands are correct.flipped hands:>>102981380>>102982029
Can I run 2 parallel instances of comfy, each one dedicated to a gpu?How do I do that?
>>102982331yeah it's flux>>102982564huh, true
>>102981263>>102981289Very cool.
>>102982626nevermind I think it's this : --cuda-device x --port xx
>>102982200everything will be signed by the hardware
>>102982735very nice
>>102982901thanks
A flower for your thoughts
>he pulled
>>102982471>>102982331Could you make them with more natural lighting? Would be interesting to see how it handles metal to skin tones
huh, the 8B Flux @ 768px can be full finetuned in kohya without the block swapthis is interesting, faster per step than SDXL too
>>102983118I couldnt help myself
>>102983290meant for >>102983000
>gen a couple hundred images >realize im using the wrong vae
oh
"Any sufficiently advanced technology is indistinguishable from magic” - Arthur C . Clarke
fp8 mochi 100 stepshttps://files.catbox.moe/mfhvnr.mp4
Mochi support:https://github.com/comfyanonymous/ComfyUI/commit/5cbb01bc
>>102983118>Could you make them with more natural lighting? Would be interesting to see how it handles metal to skin tonesLooks like the existence of robot body parts implies neon lights because I didn't prompt for it but it's still there>Two Russian teenage model girls with Cybernetic enhancements, machine made joints, mechanical limbs and blood vessels connected to tubes, wires and cables attaching to neck, wires and cables on head, science fiction, white knee high boots, walking together in an alley. The camera captures their full body and beautiful faces.I'll try a natural sunlight prompt to maybe force the lighting to be natural
>>102983934do quantized weights exist?
>>102983859>EmptyMochiLatentVideo node for the latent.Does this mean vid2vid or img2vid?
>>102984062https://huggingface.co/Kijai/Mochi_preview_comfy/tree/main(yes)
>>102984092Not yet because they have not released the encoder part of the VAE yet.
>>102984062>do quantized weights exist?Yeah fp8 and GGUF and even a 4bitBut that's for the 480p model. I'm using the websites model which is presumed to be the unreleased 720p model
>>102984137Wouldn't that just be in the github repo somewhere of the model they have released?I'm confused by my ignorance.
>>102984191They released the decoder weights and code but nothing for the encoder. VAEs can both encode to latent space and decode from latent space, they are missing the encoder part so anything that requires the VAE Encode node will not work.
>>102984229ok, i get it, thanks.
>>102984148Looks pretty cool
new pixelwave flux checkpoint is out btwhttps://civitai.com/models/141592/pixelwave?modelVersionId=992642
>>102984418What's the purpose?
>>102984527seems to be his first flux finetune thats not simply merging loras into a checkpoint>I fine tuned version 03 from base FLUX.1-dev for over 5 weeks on my 4090. It is able to do different art styles, photography, and anime.
>>102984541But can't Flux already do those?
>bright sunlight cyberpunk kinda cursed desu
BTW mochi can do nsfw decently well.
>>102984672A big poo 4 u
didn't know you can apply LUTs in comfy
>>102984672sorry to ask, but how is this different from Kijai wrapper? better speeds?
>>102984734Proper integration so you can use the regular sampler nodes, etc... with it.
>>102984418Booba status? Buttchin status?
>>102984781check the gens on civitai
>>102984560>>102984541Genuinely asking, I never did the whole sd thing, I only recently got a good gpu (6950 xt, slow but 16gb vram).
>>102984840Can you do a dragon?
>>102984776ok, thank you sexy beast
>>102984938
>>102984938i think i'll have to add some fur
>>102985019Neat!
>>102984776Drop a workflow dawg, I'm a retard
>>102984952oh damn
>>102985186
>>102984418Was it trained with actual artist names though?
>>102985420wait, this is actually neat.I thought it just made monstrous softcore.
>>102985420>>102985475Any idea if my 6950xt can run it? previous gen AMD gpu, I got it because it's the best gaming value, if you don't use rt. It runs Flux dev about as fast as quants, though it can't fully load (I have 64gb of great system ram tho).
>>102985468if its anything like the previous versions it was trained on a ton on synth slop
>>102984418>Your actions, big or small, can create ripples of positivity. Let's make the world a better place, one kind gesture at a time. Thank you for your support! I hope you have a wonderful day! Haha okay
>>102985420Hangs on vae decoding for me. Mochi wrapper works with vae tiling
>>102985502My ADA 6000 is struggling so if it even works on AMD it's going to be extremely slow.>>102985591yeah that needs to be optimized.
>>102985617Cute focks girl gens.
>>102984672Care to post an example?
does anyone here know how to do this?
>>102985617>ADA 6000cool
>>102985762>dog looks like a deepdream dog
So when is Illustrious XL 0.2 coming out?
>>102986034It's already here and they call it "NoobAI-XL"
>>102979688Nice, catbox?
>>102985762inpainting, batch size > 1
waiting room
we're almost at the point where we need advances in compute (H100s at home) more than we need better models. almost.
>>102985762That's playground AI canvass I think.Pretty sure comfyUI does it better
>Decide to have another go at SD after a long hiatus >Whip out ye olde a111>Would rather die than use comfy>But flux doesn't support a111>Nothing doesFuck me, man. Does comfyUI have some kinda un-Appleslop button that can just make it look normal?
>>102986733A111 is all but dead use ForgeUI, it's a fork of A1111 and supports Flux
>>102986767Is there a type of Flux model I should be aware of if all models don't support Forge?
>>102986787I'm not really sure what you're asking but Forge should support all Flux models.
>>102986855Was asking if some models only work in Comfy. These files are pretty big. Thanks.
>would rather die than use comfy ok but why? It takes like an hour to learn
>>102986910Couldn't find a good tutorial for graph/nodes/flowchart.
>>102986910NTA but inpainting and adetailer
>>102986957Inpainting sucks on Comfy but it has an adetailer equivalent.
>>102987003it does but an a1111 type workflow makes alot more sense for refining gens than comfy
>>102987003adetailer is way superior than the detailer from impact pack, the detailer node forces the user to use its own ksampler which sucks ass and its outdated, specially if you are using a samplercustom node or workflow, you can't replicate the ksampler used in your workflow in the detailer node, I've seen some open issues about it in its github page but the dev just ignores it and just adds an "improvement" tag and doesn't give a shit, its like the dev of impact pack forces the user to use its own shit instead of native comfyui stuff that is already done.adetailer just seamlessly add itself with every setting or workflow that you may have in webui/forge
okay maybe i try again to get forge working
>>102985750https://files.catbox.moe/uu6ypi.webmThe weird colours are because the tiled vae sucks right now.
Reminder than forge has some obfuscated code that sends your prompts to a chinese server: https://github.com/lllyasviel/stable-diffusion-webui-forge/pull/2151
>inpaintingFair enough. It just kind of sucks that a1111 and comfy handle prompts differently so even with the same settings and prompt and seed you get two different outputs >>102987086those tits are being subject to rollercoaster tycoon tier G-forces ouch
>>102987109Based. How can I set that up in comfy? You got a workflow?
>>102983853Man, even at 100 steps it's shimmering so much, kind of sad.>>102986653I'd be ok with the compute of a 4090 if it has the vram of an h100 lol.>>102987086>The weird colours are because the tiled vae sucks right now.All my gens look like bad VHS because of that, it's annoying.
>retards be like "nooo SD3.5 bad, only Flux is good, forever"meanwhile 3.5 can straight up generate close-to-perfect lesbian porn that actually looks properly photographic out of the box, with only very minor anatomical issues:https://files.catbox.moe/emmep6.jpghttps://files.catbox.moe/hx2z6l.jpgPrompt was:"A high-resolution professional photograph of two incredibly attractive young women kneeling on a bed and facing each other. Both women are completely nude, and they are kissing passionately. The woman on the left is Caucasian with blonde hair, while the woman on the right is African-American with brown hair. The lighting of the photograph is soft and appears designed to highlight the women's anatomy, suggesting the image is intended for use on an adult website."
>>102987212so did they stop caring about muh safety or are they incompetent >no metadata coward
>>102987227>so did they stop caring about muh safety or are they incompetentThey just censored less nudes vs whatever flux used.Both probably cleaned most nsfw/porn from their dataset.They still used a bad vlm so a lot of bad captioning led to most of anything artist, celeb or porn/pose related being completely unknown concepts for it.
>>102987212looking forward to sd3.5 medium
>>102987212>lesbian porn>two nude women kissing
>>102987212Simply don't care until it can do higher resolutions than 1MP
>>102987254you can upscale with a tiled vae until someone figures out a way to fix it
>>102987250i mean do you define it as like, "only ultra-closeup shots of cunnilingus" or something lol>>102987254SD 3.5 Medium specifically supports 0.25 to 2MP according to them, I really think it's going to turn out to have expectedly worse complex prompt adherence but subjectively better "image quality" in the eyes of Average Joe Diffusionman>>102987227I genned it while playing around with CivitAI's new onsite support for SD 3.5 actually:https://civitai.com/images/36765083
>24hr threadwhy did everyone leave?
>>102987350tech happenings make anon post no tech happenings make anon lurk
>>102984781undefeated
>>102987350Mochi was a letdownSD 3.5 was a letdownWe were so back for a little while but now it's back to being so over
Sana2 soon
wake me up when Sana3
>why did everyone leave Hardware requirements and gen times for SOTA models went up, and SOTA models are all censored right now too People have also just gotten bored of GenAI over time unless they're working towards something larger or have an interest in a niche that can only be created with AI
>>102987357SD 3.5 is only a letdown if you've come to expect base models to behave exactly the same as [Overfit Finetune Of Your Choice]. Not being distilled across the board even is a major plus, like gen ANY woman with SD3.5 Large and then do the same with SD 3.5 Large Turbo on the same seed, you'll immediately see how what we've come to call "Flux Girl" is actually just "Distillation Girl".100% of the things people don't like about how Flux looks are directly and specifically related to both Dev and Schnell distilled, anyone claiming otherwise is dumb
>>102987385but flux dedistill still makes buttchins anonthere is no escape from the buttchin>KG0YJ0
*being distilled, that is
>>102987393yeah cause it's not a proper de-distill, which is essentially impossible. De-destills are *doable* in the same way upscaling an image with ESRGAN Model XYZ is, you might get pretty good results in some cases but it will always be quite lossy no matter what you do.
I’m a 1girl post respecter btw.
>>102987393doubt all the "dedistills" are binary distilled/dedistilled poof switchesbfl probably trained on a larger dataset and its not like they are gonna recover the original model weights 100% from the distillation from training on a much smaller dataset and epochs/steps
>>102987435the 1girl is illusory
>>102985617what model are you using for those gens?>trys to post>forgets about timer>timer resets>repeatdamn
I just want to know how did buttchins become so ubiquitous with ai women? A cleft chin is a pretty rare trait on a woman, where did they find all these pictures of Popeye lookin gals to train on?
>>102987393>but flux dedistill still makes buttchins anonnta but I find that putting "chin" in the negative typically fixes the chins.
>>102987357>SD 3.5 was a letdownNope, people are waiting for the smaller SD3.5M version before choosing what version to focus on.But I still think that the best architecture by far is OmniGen. I wouldn't be surprised if all SAAS become like OmniGen in a few months(unless they cant filter the input/output).
>>KG0YJ0
>>102987514>smaller SD3.5MIs it because it'll be smaller so it would run on more hardware, or just because it will have something special to it?
>>102987512it's only an issue with flux, and no idea why
>>102987350>why did everyone leave?im glad youre here, anon
>>102987560You see it in a lot of SDXL fine tunes like Dreamshaper aswell
>>102987512Flux didn't release the base/foundation/pretrained model but only the post-trained model. And they happened to post-train it to give a certain "style" out of the box. That's the issue.The image model field really needs to label their models better like they do in the LLM field.
>>102987560it's because all open-weight versions of Flux are distilled, like I said, literally the SD 3.5 Large Turbo default lady looks VERY much like Flux Woman, whereas this is not the case for 3.5 Large regular version.
>>102987553The architecture is different:https://medium.com/diffusion-images/stable-diffusion-3-5-debuts-in-3-variants-large-turbo-and-medium-run-them-in-comfyui-ce760d7fab74
>>102987600So the distilled version sets a "style in stone", or makes the model be more rigid/autistic about what a typical concept like "woman" would look like?Is this why it's nowhere near as varied as what I could get with the same prompts on DALLE?
>>102987612The interesting part seems to be behind a subscription wall... anyway, I'm hoping the model being 2.5B will not be too disappointing.
>>102987615>>102987600I don't think that's how distilling has to work? The non-distilled, full Flux Pro model should be giving a default style as well. That's ideally HOW a final production model for casual users should work, since you want it to give aesthetic results without complex prompting or settings, just like how for LLMs, you want to expose the Instruct models to users, not the base model which gives more schizo, random outputs to casual unskilled user inputs.
>>102987615Distillation basically dumbs the model down to a "reasonable default" while drastically reducing output variety yeah, to allow for decent-enough outputs in a low number of steps. Presumably the overall dataset used by SAI and BFL had enough in common that this meant distillation lead to a very similar looking "default lady" in the distilled versions of both their models.
>>102987560
the example resolution in the comfy workflow for moch is 848x480, is the recommended resolution? or can I go 4/3 or 16/9 instead or phone style?
>>102987643I see, thanks for the explanation.>>102987651I wonder how the hell dalle did it then (outside of not filtering porn from the dataset).
>>102987643Flux Pro DOES generate "normal" looking images that have no plasticity or bumchin inherent, very similar to SD 3.5 Large non-Turbo outputs. Attached is a Pro 1.1 output from the following prompt:"a photograph featuring a young woman seated outdoors at a dining table. She has long, wavy blonde hair cascading over her shoulders and is smiling warmly at the camera. Her skin is fair, and she has blue eyes. She is wearing a blue, sleeveless dress adorned with small white floral patterns and ruffled shoulder straps. Around her neck, she wears a delicate necklace with a small pendant. The background consists of a plain, light gray concrete wall with a green, neatly trimmed hedge running horizontally just below the wall, providing a natural contrast to the urban setting. The dining table is set with several clear wine glasses, some filled with white wine, and a few empty glasses in front of her. The table itself is dark-colored, possibly a dark blue or black fabric. The overall atmosphere suggests a casual, yet elegant dining experience, possibly in a trendy urban restaurant or café. The lighting is natural, indicating that the photo was taken during the day. The composition of the photograph centers the woman, making her the focal point, while the background elements provide context without overwhelming the image."
>>102986056It's mostly the Monet lora. You have to put this in front: a painting by Claude Monet depicts
>>102987668We barely know what the actual model they're running under everything really looks like, as far as Dalle goes. It's pretty overtly run through a heavy post-process pipeline to add that distinct cartoonish look that always tends to remind me of the over-the-top way ambient occlusion was implemented in Far Cry 3.
>>102987651Is that the standard method in the image gen world? In the text gen world, usually what people mean by distillation is still essentially a pretraining run on the full dataset but with less epochs basically since the distillation method (using the logit distribution of the parent model) allows it to learn faster.
>>102987698my explanation was a big oversimplification but it's an accurate summary of the approach and practical result i'd say, yeah
New >>102987712>>102987712>>102987712
>>102987707>>102987680If that's the case, I wonder if Flux Pro can actually do other styles like Picasso significantly better. Logically I feel like they'd still want it to have a default style though for the reasons I described (reducing schizo outputs), but perhaps it does, just not to the degree of bias in Flux dev. So then maybe it should be called "partial" distillation. Things are quite different in LLM land.
>>102987472These fennecs are made with that noob vpred model.
>>102987785How hard is it to prompt noob?
>>102987810it's just danbooru tags + some e621, zero natural language though so you can only specify what's in the image, not where things are
>>102987836Is there a danbooru taxonomy? like a browsable tree?
>>102987880there's the danbooru tag wiki and this https://danbooru.donmai.us/related_tag
>>102987992>https://danbooru.donmai.us/related_tagNTA pretty useful link thank you