[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: the longest dick general.jpg (2.66 MB, 3264x1322)
2.66 MB
2.66 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bred : >>102949176

Two Hundred Steps Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://tensor.art/models
https://liblib.art
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3

>SD3.5
https://huggingface.co/stabilityai/stable-diffusion-3.5-large
https://replicate.com/stability-ai/stable-diffusion-3.5-large

>Sana
https://github.com/NVlabs/Sana
https://8876bd28ee2da4b909.gradio.live

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
the skinning alive of all 1girl posters
>>
Blessed thread of frenship
>>
>>102956911
Oof, those thots are on point
>>
>>102956931
what, you don't like girls?
>>
>>102956971
men only hobby
>>
posting in the blessed thread
>>
File: 00089-2293008652.png (1.16 MB, 832x1216)
1.16 MB
1.16 MB PNG
>>
>slops up a 1girl
>doesn't give her armpit hair
>>
File: file.png (2.51 MB, 1024x1024)
2.51 MB
2.51 MB PNG
>>
>>
File: file.png (101 KB, 1024x1024)
101 KB
101 KB PNG
>>
File: file.png (1.87 MB, 1024x1024)
1.87 MB
1.87 MB PNG
>>
>>102957113
what model is that?
>>
>>102957162
Everyone's favorite unreleased model
>>
>>102957175
had a feeling, it looks awesome, mind if i have the prompt?
>>
sana-samas will rise a-gain
>>
File: file.png (2.01 MB, 1024x1024)
2.01 MB
2.01 MB PNG
>surreal picasso painting, surrealism, post modernism, "The Land is the Lie", Planet Earth
>>
File: file.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
>>
>>102956516
ran another comparison.
fp8
https://files.catbox.moe/drqbg5.mp4
ggufq4
https://files.catbox.moe/mgr9xg.mp4
q4 is just struggling with clarity at this resolution
Others with more experience may get better results.
>>
>>102957237
anon, if your videos are on slow mo that's because you went for 8 fps on the node at the far right, Mochi is a 24fps model, I say that just in case you don't know
>>
>>102957257
you're right but i was just testing for clarity, I did forget to adjust the speed in the combine.
>>
File: file.png (1.55 MB, 1408x1024)
1.55 MB
1.55 MB PNG
>>
any1 know some good chinese artists? maybe sana has lots of that in the dataset and would look really cool
>>
File: file.png (1.9 MB, 1408x1024)
1.9 MB
1.9 MB PNG
>>102957320
I'm pretty sure the model isn't Asian biased, Pixart wasn't.
>>
File: file.png (968 KB, 1408x1024)
968 KB
968 KB PNG
>>
>>102957160
nice
>>
File: 00109-3137549724.png (1.41 MB, 832x1216)
1.41 MB
1.41 MB PNG
>>
File: 00118-3955540451.png (1.11 MB, 832x1216)
1.11 MB
1.11 MB PNG
>>
>>102956911
cool collage
>>
File: 00123-2442433168.png (1.13 MB, 832x1216)
1.13 MB
1.13 MB PNG
>>
File: dc-ae_mix_f32c32.png (1.92 MB, 1024x1024)
1.92 MB
1.92 MB PNG
here is your "not the be-all-end-all of a model" and a "very superficial requirement" bro
https://slow.pics/s/DIDxrbQx
>>
File: 00010-3453933645.png (1.08 MB, 1440x1440)
1.08 MB
1.08 MB PNG
>>
>>102957943
yikes, all the vae don't show significant differences to the original except the ultra compressed one, weird heh? :^)
>>
>>102957943
actually seething
were they replacing Flux with Sana or something?
>>
>>102957948
great dark tones
>>
>>102957943
Flux's VAE is fucking amazing, it's almost like there's zero difference with the original
>>
File: 00126-3781302863.png (1.26 MB, 832x1216)
1.26 MB
1.26 MB PNG
>>
File: file.png (2.4 MB, 1408x1280)
2.4 MB
2.4 MB PNG
>>
File: file.png (1.82 MB, 1024x1024)
1.82 MB
1.82 MB PNG
>>
>>102957943
damn wtf
>>
File: file.png (1.87 MB, 1024x1024)
1.87 MB
1.87 MB PNG
>>
File: file.png (1.93 MB, 1024x1024)
1.93 MB
1.93 MB PNG
>>
File: file.png (2.04 MB, 1024x1024)
2.04 MB
2.04 MB PNG
>>
>>102958208
Have you tried describing the background more accurately? Might remove some dof
>>
i hope they read my email :3
>>
File: file.png (2.02 MB, 1024x1024)
2.02 MB
2.02 MB PNG
>>
File: file.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>>
File: file.png (1.51 MB, 1024x1024)
1.51 MB
1.51 MB PNG
>>
File: rebuilded that shieeete.jpg (503 KB, 3418x1887)
503 KB
503 KB JPG
>>102957337
>I suggest you to use a workflow that works for you and the reconstruct everything from it to get it working
I did it and it works

also any of you dudes got any idea how can I improve on these settings to get a better image?
>>
>>102958375
wait what? guidance at 0? cfg at 2? mimic_scale at 3?? What the fuck is going on? lmao
>>
File: 00137-3672626443.png (1.19 MB, 832x1216)
1.19 MB
1.19 MB PNG
>>
gentle reminder to send the sana team an email with any criticisms/advice you'd like them to read >>102956249
if you are a 1girl slopper please ignore this post, an iq above 100 is required to email them.
>>
So Mochi was a big fuckin let down huh
>>
>>102958410
>So Mochi was a big fuckin let down huh
I still suspect them using the HD model for the demo, once we'll get that we'll make the real conclusion, 2 weeks
https://www.genmo.ai/blog
>Today, we are releasing our 480p base model, with Mochi 1 HD coming later this year.
>>
>>102958409
doubt they would seriously consider feedback regardless
>>
File: file.png (1.79 MB, 1024x1024)
1.79 MB
1.79 MB PNG
>>
>>102958409
theres no way that will go well
>>102958437
are you a bot?
>>
>>102958454
>are you a bot?
tf you talk about?
>>
File: ComfyUI_02516_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
>>102958387
I dunno lol
>>
>>102958438
>>102958454
wouldn't hurt to try, just don't be an asshole like this guy >>102958275 who sent them erp chatlogs
>>
>>102958499
the guidance must bt at 3.5, the mimic scale must be at 1, the cfg must be superior to the mimic scale, c'mon anon lol
>>
File: rebuilded that shieeete2.jpg (506 KB, 3313x1925)
506 KB
506 KB JPG
>>102958515
ok how about this?
>>
>>102958606
why do you have guidance negative at 10?
>>
anyone have tips on SDXL models and its refiner?
should I just not use it?
is there a secret sauce behind it to make it work well?
>>
>>102958641
>refiner
nobody uses it
>>
File: 3141454719.png (1.76 MB, 896x1152)
1.76 MB
1.76 MB PNG
>>
>>102957943
>expressive artistic model with a subpar VAE vs. rigid behemoth model with an excellent VAE
Why must the world be this way
>>
>>102958805
Gen higher resolution and downsample, fixes the problem. It's worse at low resolutions.
>>
File: 54191.png (672 KB, 665x1335)
672 KB
672 KB PNG
>>
File: file.png (1.74 MB, 1024x1024)
1.74 MB
1.74 MB PNG
>>
File: file.png (1.8 MB, 1024x1024)
1.8 MB
1.8 MB PNG
>>
File: 00716-1221462262.jpg (660 KB, 1296x1728)
660 KB
660 KB JPG
>>
File: ComfyUI_SD35L_0478.jpg (211 KB, 896x1152)
211 KB
211 KB JPG
>>
File: file.png (1.96 MB, 1024x1024)
1.96 MB
1.96 MB PNG
>>
File: giguballtorture.png (1.87 MB, 1552x1160)
1.87 MB
1.87 MB PNG
>>
File: file.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>
File: file.png (976 KB, 1024x1024)
976 KB
976 KB PNG
>>
File: file.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
>>
File: 00587-1256361378.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
Is it over or are we back?
>>
File: file.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
We were always back.
>>
>>102958641
The SDXL refiner fucks up your gen's composition. You should just do an upscale instead.
>>
>>102958910
nice, model?
>>
File: 1728789095902351.jpg (48 KB, 1067x946)
48 KB
48 KB JPG
I am once again asking if there is a local music generator yet.
>>
Androgynous failed 1girl gen. Is it an ugly woman? Or did FLUX fuck up and give me a guy? Is it trans?

What do you guys think, male or female?
>>
>>102958635
No clue dude
>>
mochi gguf_q4_v2:
https://files.catbox.moe/ybtzi8.mp4

Noticable improvement in clarity over v1.
https://huggingface.co/Kijai/Mochi_preview_comfy/tree/main
>>
File: ComfyUI_02499_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>
File: ComfyUI_SD35L_0556.jpg (166 KB, 672x1496)
166 KB
166 KB JPG
>>
File: file.png (2.1 MB, 1024x1024)
2.1 MB
2.1 MB PNG
>>
>>102958410
>So Mochi was a big fuckin let down huh
After doing around 70-80 gens on the website, yeah. Using mochi after trying kling and minimax feels like I'm gaslighting myself that something is wrong with my prompts but no it's just the model being shit.
We're gonna have to wait a bit longer for local video I'm afraid. I'm on vacation rn so I'll keep scumming gens every 6 hours during downtime until I hit the monthly limit on all accounts but I'm not expecting to generate even 1 usable video for the art project I want to make
>>
File: file.png (55 KB, 634x778)
55 KB
55 KB PNG
>>102959543
well I've just been testing a bit with 32 steps on euler a to gen initially with 32 steps of DPM 3 and it doesn't seem to destroy the composition with that combo...
just not sure if its worth the config or not cause I was just using pure euler a before lol
basically just hunting for a way to sharpen up some details that euler a tends to muddy up once it gets a decent image ready
would try adetailer and such but I can't find anything for invokeai regarding that sadly
I will say that different settings on this refiner definitely likes to fuck things up so this setup might just be placebo at this point
>>
>>102959815
Noose is missing
>>
File: ComfyUI_SD35L_0578.jpg (284 KB, 672x1496)
284 KB
284 KB JPG
>>
>>102960017
What did you prompt for the robot girl stuff, or is it a Lora?
>>
File: file.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>
File: ComfyUI_SD35L_0580.jpg (229 KB, 672x1496)
229 KB
229 KB JPG
>>102960069
>photo of a woman with cybernetic enhancements is sitting in a dark abandoned workshop, she has metal wings, machine made joints, mechanical limbs and blood vessels connected to tubes, wires and cables attaching to neck, wires and cables on head, science fiction
>>
>>102960017
>>102960127
Amazing
>>
>>102959693
Flux with lora trained on a game screenshots
>>
File: file.png (1.85 MB, 1024x1024)
1.85 MB
1.85 MB PNG
>>
File: ComfyUI_04481_.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
Oh shi...
>>
File: 00118-1853968631.png (1.3 MB, 1152x896)
1.3 MB
1.3 MB PNG
>>
File: 1698558925156242.png (3.66 MB, 1344x1344)
3.66 MB
3.66 MB PNG
>>102959798 (me)
>I am once again asking if there is a local music generator yet.
>>
>young tween girls
>anywhere from 6 to 24 with inverted bell curve probability
>young tween women
>always over the age of 20
At this point I want to keep using this shitty video model just to figure out how to wrangle ages consistently. I refuse to let it win

>>102960378
There is no good local music generator. Use Suno or udio
>>
File: 00015-4187201829.png (1.27 MB, 1152x896)
1.27 MB
1.27 MB PNG
>>
File: ComfyUI_04495_.png (1.88 MB, 1024x1024)
1.88 MB
1.88 MB PNG
Isekai'd by Truck Kun.
>>
File: 00136-1803988203.png (1.11 MB, 1152x896)
1.11 MB
1.11 MB PNG
>>
File: 1721805764190029.jpg (91 KB, 1024x1024)
91 KB
91 KB JPG
Can we get some more troons hanging themselves to counteract the outward-thinking pedophiles ITT?
>>
You Will Never Get A Perfect (or even good) Image Model
>>
no such thing as perfect
>>
File: 00006-2970898615.png (1.05 MB, 1152x896)
1.05 MB
1.05 MB PNG
>>
>>102960740
I have several
>>
>>102959945
Nice. I guess Q8's going to be pretty good.
>>
File: ComfyUI_04515_.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>
File: file.png (1.46 MB, 832x1376)
1.46 MB
1.46 MB PNG
>>102960127
nta but nice
>>
>>102960437
>There is no good local music generator.
ty, maybe some day
>>
SD3 seems to be... Alright... Not huge leaps in quality, but the ability to finetune it seems like it will hopefully lead to some actually decent photorealism at some point. Too bad Astralite swore it off entirely
>>
>>102960878
Pony isn't going to release anything again.
>>
File: ComfyUI_SDXL_0193.jpg (2.21 MB, 2048x2048)
2.21 MB
2.21 MB JPG
>>102960842
one of the default prompts i run on every checkpoint
>>
File: ComfyUI_04522_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
There is
a house
in New Orleans
they call
the Rising Sun...
>>
File: 1700983013418353.webm (917 KB, 1280x720)
917 KB
917 KB WEBM
Is Cog worth trying on a 3070ti or should I just pay China $10?
>>
>>102958851
If only we had it locally so we could do that...
>>
>>102960962
The demo works at 4K now, no more repeating.
>>
>>102960973
So it seems. Nice.
Still, release the model already.
>>
File: ohgodwhathaveIdone.webm (3.69 MB, 1280x720)
3.69 MB
3.69 MB WEBM
>>102960953
I wonder what kinda hardware this chinese video AI runs on
>>
File: 00027-2093664288.png (1.03 MB, 1152x896)
1.03 MB
1.03 MB PNG
>>
File: file.png (1.8 MB, 1024x1024)
1.8 MB
1.8 MB PNG
>>102960989
Yeah I want it
>>
I had to install the dev version of xformers to resolve dependency issues between mochi and cog. Anyone know what I am in for.

>>102960953
cog is nowhere close to this quality. Worth it? depends on your goals.

>>102960991
There is no way they are doing this stuff from scratch. It is i2i off a massive stolen clip database. As such hardware requirements probably aren't that bad.
>>
>>102960991
>I wonder what kinda hardware this chinese video AI runs on
H100s or something equivalent. Anything else would be even more minutes per gen.

>>102960953
>Is Cog worth trying on a 3070ti or should I just pay China $10?
CogVideo can't be used for any actual art project so the answer is maybe? Like the other anon said it depends on your goals. Remember to use the right tool for the job
>>
File: 1728818526785051.jpg (152 KB, 1024x1024)
152 KB
152 KB JPG
>>102961005
>>102961017
I'm the baker from /mwg/ on /pol/
My goal is to generate propaganda
Thank you for the input
>>
File: 00032-3468706293.png (1.15 MB, 1152x896)
1.15 MB
1.15 MB PNG
>>
File: 00041-4129377850.png (1.12 MB, 896x1152)
1.12 MB
1.12 MB PNG
>>
>>102961056
catbox?
>>
>>102961056
Cool
>>
File: 00067-551555187.jpg (156 KB, 1024x1376)
156 KB
156 KB JPG
>>
>>102957237
The ggufq4 is sadly too low.
>>
File: ComfyUI_02558_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>
>>102961524
trigger discipline in a generated image. I thought I would never see the day.
>>
>>102956987

ur gay
>>
>>102960991
How well does the minimax model generate tribal girls?
>>
File: LOLpasta44.webm (1.39 MB, 1280x720)
1.39 MB
1.39 MB WEBM
>>102961562
dunno man, the benchmark is will smith and the spagetti thats all I know
>>
>>102961513
Late here but mochi_preview_dit_GGUF_Q8_0.safetensors
was added an hour ago,
https://huggingface.co/Kijai/Mochi_preview_comfy/tree/main
>>
File: Mochi_00002.webm (180 KB, 856x480)
180 KB
180 KB WEBM
20 minutes...
>>
>>102960378
No, and even if there was, nothing would come even close to whatever udio is having.
Their model is insane, they probably fed it every song under the sun, properly tagged.
>>
File: 1706298781701474.jpg (141 KB, 1024x1024)
141 KB
141 KB JPG
>>102961726
>they probably fed it every song under the sun, properly tagged
Can confirm.
>>
>>102961608
That's...not bad.
What was the prompt? What version of the model did you use?
>>
>>102960953
>should I just pay China $10?
I'd pay if it wasn't censored crap.
>>
>>102961596
will try thanks
>>
File: 1724063511364942.jpg (144 KB, 1024x1024)
144 KB
144 KB JPG
>>102961726
Are you on honkFM or 8/mu?
We have a memetic audio warfare thread there

> Sauce a material (thread, article, memoryhole)
> ctrl-c, ctrl-v into deepai.org, claude or ChatGPT
> "Please generate lyrics for a song about this"
> Adapt/shorten the prompt as needed; make it moar catchy/edgy
> Use [Chorus], [Bridge] and [Verse: <meme singer name>] for songs with multiple vocalists
> Try adding ex. [Banjo solo] for epic solos
> 3000 characters max for Suno, 1500 char for Udio
> ?????
> Archive bangers on Honk FM or 8/mu/
>>
>>102961579
my benchmark is bouncing boobs
>>
>>102961752
Nope I don't know about these, I'm a simple man, I just want 80s/90s/00s pop catchy/melodic songs while I work, and this does exactly that. It's so nice.
I'm not using 10% of what's available like custom lyrics, is there a general around where tricks are shared, like how to enhance their awful slow website?
>>
>>102961737
Two women wearing towels in a steamy sauna sharing a passionate kiss
fp8, 65 steps, 73 frames
>>
>>102961836
Thanks, I can go to bed now.
>>
File: 1724886810395543.jpg (105 KB, 1024x1024)
105 KB
105 KB JPG
>>102961804
>is there a general around where tricks are shared, like how to enhance their awful slow website?
Just the dead 8kun /mu/ board that I 'own'
Post in the /mawg/ thread
We can share thoughts
>8kun/mu/catalog
>>
>8kun/mu/catalog
.top
>>
>>102961845
is this a groomer?
>>
>>102961918
yeah just ignore
>>
File: ComfyUI_02565_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>
File: ComfyUI_02567_.png (1.48 MB, 1024x1024)
1.48 MB
1.48 MB PNG
>>
File: 00084-600704932.jpg (300 KB, 1024x1376)
300 KB
300 KB JPG
>>
File: ComfyUI_02569_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
>>
SI there a list of danborour tags comaptivle with Illustrious soemwhere?


I swear I saw an anon compling them some time ago.
>>
File: ComfyUI_02571_.png (1.49 MB, 1024x1024)
1.49 MB
1.49 MB PNG
>>
>>102962177
Is this real?
>>
File: ComfyUI_02584_.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>102962195
yea
>>
Can't make cunny loli with new models, why fucking bother
>>
>>102962749
might as well an hero
>>
>>102962749
skill issue
>>
>>102958775
Very detailed
>>
is there any sort of big analysis actually comparing adamw, prodigy, adafactor, lion, etc? ive finally arrived on good results using the finetune-extraction schizo's config slightly tweaked
>>
>>102962749
A true cunnyseur should be able to enjoy even something as innocuous as a fully clothed loli licking an ice cream, which new models can absolutely do
>>
File: Image.png (2.48 MB, 2250x2436)
2.48 MB
2.48 MB PNG
reminder to other photo 1girlers: we are not the same.
>>
>>102963050
I put a few Flux gens in this and reads them as Midjourney kek
>>
>>102961608
Absolutely savage that one of them has a hook nose.
>>
>>102963050
just make a buttcheek detector
>>
File: 1725316800676688.png (2.1 MB, 1024x1024)
2.1 MB
2.1 MB PNG
For some reason, on flux, using the multiply sigma thing (for details) at 0.95 makes every generation cartoon, I have no idea why.
Anyone else got the same thing?
>>
File: 987.png (1.65 MB, 1300x1671)
1.65 MB
1.65 MB PNG
>>
>>102963169
What's your prompt?
>>
>>102963050
Well who cares if it's "likely AI generated"?
>>
>>102963225
If their retarded code can see it, your eyes can see it too.
>>
>>102959945
hehh that looks good! can't wait for Q8_0
>>
>>102960888
>Pony isn't going to release anything again.
this, he has no incentive on doing anything, he's making a shit ton of money from pony-v6 now
>>
>>102960953
>Is Cog worth trying on a 3070ti or should I just pay China $10?
Cog isn't close, but we got something better now
https://reddit.com/r/StableDiffusion/comments/1gb07vj/how_to_run_mochi_1_on_a_single_24gb_vram_card/
>>
>>102963257
I think he's going to try to run out the clock waiting for someone else to make a Booru model and then disappear.
>>
>>102963238
I mean if your goal is to try and make AI gens that pass their detector you're doing it wrong mate.
>>
>>102963169
>multiply sigma thing
what
>>
>>102961596
>GGUF.safetensors
I wonder how he makes thoses, I wanna try the same format for flux dev, I can feel it could speed things up
>>
>>102961726
>Their model is insane, they probably fed it every song under the sun, properly tagged.
indeed, suno is the goat and can make really niche genres I enjoy like shibuya-kei, they probably went all in with all the good music that exist, as it fucking should
>>
>>102963266
Right, and a lot of us wouldn't mind if there were a way to add stego that basically said "I'm fake". I don't like watermarks.
>>
>>102961596
This is slower and uses more VRAM than fp8 for me
>>
File: file.png (32 KB, 1037x345)
32 KB
32 KB PNG
>>102963268
he means picrel
>>
https://x.com/OpenAI/status/1849139783362347293
>We are sharing a new approach, called sCM, which simplifies the theoretical formulation of continuous-time consistency models, allowing us to stabilize and scale their training for large scale datasets. This approach achieves comparable sample quality to leading diffusion models, while using only two sampling steps.
Really interesting, that's a new sampler or something?
>>
>>102963283
Literally makes no sense what you're saying. Nobody outside this general really cares except for the starving artists.
>>
>>102963225
>>102963266
Surely you're not trying to imply that left looks more realistic than right.
>>
>>102963305
Just another turbo / lightning distillation method.
>>
>>102963324
Left looks better than the right, however. What right has going for it is its extremely high blur to cover any deformities.
>>
>>102963335
Not talking about subjective "better" but rather objective "looks like a real image"
>>
File: file.png (351 KB, 1991x1492)
351 KB
351 KB PNG
>>102963332
but the results look way better on that one, like it could be good enough to have 2 steps the norm, just imagine that kino
>>
>>102963342
Dunno anon, left looks pretty real to me, and left is objectively better.
>>
>>102963360
Don't let me stop you from enjoying slop
>>
>>102963353
I'll believe it when I see it in production, many companies have game changers that don't get used and at the end of the day, lightning models are a non-starter because people like to use Loras so it's just something you might use on a generation server.
>>
>>102963374
It's quality slop though. Right pic is just far too blurry.
>>
>>102963268
>>102963302
use this instead, it's better and doesn't change the overall vanilla composition
https://imgsli.com/MzExNjYx
https://imgsli.com/MzExNjQ2
https://www.reddit.com/r/comfyui/comments/1g9wfbq/comment/lte0rdg/?utm_source=share&utm_medium=web2x&context=3
you can use this modified script to get more decimals and go for that -0.05 value
https://files.catbox.moe/4gxohm.py
>>
>>102963382
>quality
>slop
Pick one
>>
>>102963377
>many companies have game changers that don't get used and at the end of the day
the turbo distillation is definitely a thing though, it's been used by SAI and BFL (Schnell), so improving that turbo method means that Schnell could've been closer to Flux Dev for example, imagine running a Flux dev like at 2 steps, the fucking dream man
>>
>>102963390
Dunno why you're seething so hard anon, just enjoy the pic for what it is
>>
>>102963407
Okay
>>
>>102963413
Thank you.
>>
so gay stfu
>>
>>102963306
I don't really view ai "art" as art.
>>
>>102963386
alright will try, what "lyingsigma" value is recommended?
>>
>>102963436
for me those values work great:
>dishonesty_factor: -0.05
>start_percent: 0.1
>end_percent: 0.9
>>
>>102963444
thanks anon
>>
>>102963386
When will this get added to comfyui manager?
>>
File: file.png (282 KB, 2262x1370)
282 KB
282 KB PNG
since there's been some newfound interest in schedulers and sigmas, https://github.com/Extraltodeus/sigmas_tools_and_the_golden_scheduler provides a "graph sigmas" node that will let you see... the graph. for comparisons sake of course.
>>
>>102963498
>since there's been some newfound interest in schedulers and sigmas
I don't think people have any idea how much of an impact a good scheduler can have, for example karras completly destroys the image and beta really improves the prompt adherance, it has way much of an impact than a sampler and it's really easy to modify, so yeah, it should be explored way more and I'm glad it's starting to be acknowledged
>>
File: Screenshot.png (37 KB, 1278x382)
37 KB
37 KB PNG
>>102963386
>you can use this modified script to get more decimals and go for that -0.05 value
Is that different than changing this setting in default Comfy?
>>
>>102963517
kek, I didn't know we had something like that by default
>>
File: 02141.jpg (2.66 MB, 1792x2304)
2.66 MB
2.66 MB JPG
>>
>>102963528
>>102963517
How would you use this, and are results identical?
>>
>>102963495
that's a good question, it's not hard at all to add your node to comfyui manager, so if someone is willing to do it, then do it
>>
>>102963532
looks halloween + card art
>>
>>102963535
I guess yeah, it lets you go for more decimals
>>
File: file.png (64 KB, 884x415)
64 KB
64 KB PNG
>>102963386
I can't find it, LyingSigmaSampler doesn't give me anything, and searching custom samplers either.
Where should it be?
I put it in ComfyUI\custom_nodes, is it the wrong directory?
>>
>>102963565
>I put it in ComfyUI\custom_nodes, is it the wrong directory?
no it's the good directory, did you restart ComfyUi?
>>
>>102963570
Yes I did.
Well I used the link from the reddit post instead, the one without the more decimal precision, and it worked.
>>
>>102963585
cool, if you want more decimal precision go for that then >>102963517
>>
File: 02143.jpg (1.99 MB, 1792x2304)
1.99 MB
1.99 MB JPG
incredibly_absurdres, absurdres, highres, traditional_media, non-web_source, original, official_art, commission, mixed-language_commentary, md5_mismatch, archived_source, bad_link,

portrait, dark fantasy, sci-fi alien,

orifices spilling out, new lava particles on chains, polished chrome, inhaling light, plumes of bitcrushed dithering smoke that rock to crack your glass brain,

Joel Peter-Witkin, H. R. Giger, Dave McKean,
>>
>>102960127
thanks for sharing this cool robot prompt
>>
File: ComfyUI_07756_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>102963386
This shit is fucking magic dude:
>It decreases the bokeh
>Add more details
>Decreases the saturation/burning
>Makes the skin more detailled and less plastic
Damn dude
https://imgsli.com/MzExODg1
>>
>>102963593
htf do you use:
>>102963517
like I have zero clue.
>>
>>102963565
>>102963624
whens the last time you updated? its in the settings
>>
That was the worse one of the two sorry. Genmo seems to fall apart at the end of gens quite often

I also am out of ideas for how to get teens, "teen" doesn't do anything and tween is too young and I'm tired of seeing kids so I guess I play "young girl" roulette
>>
File: file.png (90 KB, 2378x937)
90 KB
90 KB PNG
>>102963624
click on the "nut" on the top right of ComfyUi to get the settings
>>
File: 00100-651728054.png (954 KB, 832x1216)
954 KB
954 KB PNG
>>
>>102963593
Yeah it's working fine.

>>102963629
Yesterday.
>>
>>102963617
the house looks better too
>>
>>102963644
Nice!
>>
File: ComfyUI_07760_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
>>102963617
https://imgsli.com/MzExODg3
Look at the skin texture, it has way more imperfection, she really looks like a normal girl now (if you ignore the buttchin of course kek)
>>
>>
nightmare nightmare nightmare
>>
>>102963639
What number do I enter?
>>
>>102963335
>Left looks better than the right, however.
I would like to examine your head
>>
>>102963630
wow that's creepy
>>
>>102963730
no need to do that anymore, he just updated the script to get the decimals we want
https://www.reddit.com/r/comfyui/comments/1g9wfbq/comment/ltmlpic/?utm_source=share&utm_medium=web2x&context=3
>>
File: file.png (375 KB, 396x380)
375 KB
375 KB PNG
>>102963335
>Left looks better than the right
>>
>>102963720
You made me remember the repeated "horror" TTS lines from those Russian abandoned ship exploration videos kek.
>>
>>102963761
>>102963737
Get your eyes checked.
>>
File: file.png (287 KB, 438x438)
287 KB
287 KB PNG
>>102963788
>Get your eyes checked.
>>
>>102963737
taking anons cranial measurements and writing them down in my tiny notebook
>>
File: 1724526080697.jpg (2.04 MB, 7961x2897)
2.04 MB
2.04 MB JPG
>>102963288
That is expected as seen in the quants for FLUX. The point of using Q8 is that the quality should be better and closer to (B)F16 than using FP8. The downside is that you will take a speed hit and there is no native support in hardware to accelerate it vs FP8 which has that in the RTX 4000 series.
>>
>>102963750
Does it work with ksampler?
>>
>(masterpiece, photorealistic, slop:1.5) niggers fighting about which shitty 1slopgirl is better
>>
File: file.png (375 KB, 1431x1661)
375 KB
375 KB PNG
>>102963813
nope, that's why I'm not using Ksampler, there's a lot of custom nodes that can't work with it, using SamplerCustomAdvanced is much more flexible
https://files.catbox.moe/a61zom.png
>>
>quality slop thoever
>>
File: file.webm (895 KB, 1280x720)
895 KB
895 KB WEBM
Can Mochi do that? Kamala Harris word salad kek
>>
>>102963617
this doesn't really work with the turbo lora and there's no way I'm waiting 25 steps for each gen - completely takes the fun out of it
>>
File: 02149.jpg (2.06 MB, 1792x2304)
2.06 MB
2.06 MB JPG
>>102963543
>halloween
>>
>>102963887
>this doesn't really work with the turbo lora
yeah that's intuitive, turbo doesn't have much steps to work with in the first place
>>
>>102963781
>Russian abandoned ship exploration videos
wat

Anyways, it seems like you need to boomer prompt and write a fuckton to get good stuff out of genmo. I used Nemotron 70b to merge the contents of two previous prompts and these are easily the best looking results I've gotten from a prompt. It almost feels like the shorter the prompt, the less confident the model feels about what to generate

>In a sprawling, futuristic mansion that doubles as a high-tech workshop, vibrant neon lights dance across the walls and polished metallic surfaces. The air is alive with the soft hum of machinery and the faint glow of holographic blueprints hovering in mid-air. Amidst this dazzling backdrop, two young Russian girls, dressed in sleek, angled white plastic outfits, begin to move towards each other. As they meet in the center of the room, their eyes - glowing bright blue with an otherworldly intensity - lock onto the camera. One of them, in a whimsical touch, is suddenly enveloped by a delicate pink tutu, adding a layer of innocence to their otherwise futuristic, cybernetically enhanced appearance. The scene unfolds in slow motion, capturing every nuanced movement as the girls tilt their heads in unison, their gaze never wavering from the lens. Resolution: 4K, Slow Motion.

I will see y'all in 6ish hours with hopefully higher quality videos now that I have figured this out
>>
>>102963830
>SamplerCustomAdvanced
>mat1 and mat2 shapes cannot be multiplied (77x2048 and 4096x3072)

idk why it's doing that.
>>
>>102963908
can you post the whole error? maybe it's one package that's in fault there
>>
I know that anons must be posting the slimeslop AI girls because they like them, but I'm still a bit surprised every time they confidently assert this, even moreso when they say stuff like "it looks real to me".

I'm not really an IQ and object rotation kind of guy but I have to wonder whether some anons naturally have a deeper impression of the images they see, and some a shallower one—if the "slime" is obvious and jarring to some, and "I only see it if I really look closely and it doesn't bother me" to others.

It's not the only theory that can explain this, but the phenomenon does require some kind of explanation—why do anons differ so wildly on what they find acceptable, even 'realistic', in the degree of visible hallmarks of AI? If it's true that some people are, for whatever reason, less capable of 'parsing' visual information to form a clear and complex understanding of the images they see, then that's troubling.

Or maybe it shouldn't be. Maybe it's just adding a pseudoscientific mental model over a phenomenon we're all familiar with already, which is that some people have bad taste.
>>
>>102963908
maybe because I have to use load diffusion model???
>>
if you're into genning hardcore, bigasp v2.0 just dropped
>>
>>102963916
>bigasp v2.0
link?
>>
File: file.jpg (564 KB, 2824x1752)
564 KB
564 KB JPG
https://yhyun225.github.io/DiffuseHigh/
https://github.com/blepping/comfyui_jankdiffusehigh
SD3.5 can't do higher resolutions than 1k, maybe this node could help?
>>
>>102963914
rlly makes an anon think dont it
>>
>>102963921
it's on civitai, it's an sdxl model
>>
>>102963936
here's the link for the lazy ones
https://civitai.com/models/502468?modelVersionId=991916
>>
>>102963913
figured it out. dualcliploader (I have to switch to it for what I have), it was on sdxl, not flux.
>>
>>102963914
Anyone that spams 1girls is mentally ill and likely below 100 IQ. It really is that simple.
>>
>>102963952
oh ok, cool that you figured it out, personally I'm not using KSampler anymore, there's always a new node that apprears and just improves Flux's sovl more and more kek
>>
>>102963955
So you're saying I have an IQ below 100.
>>
>>102963995
Yes because if you were smarter you wouldn't be so easily occupied with repetition.
>>
>>102963916
>>102963936
>>102963943
what's so special about it?
>>
>>102964004
but what if I'm just bored
>>
>>102963906
>It almost feels like the shorter the prompt, the less confident the model feels about what to generate
it's an issue with t5xxl looks like, Flux has this issue aswell, the output gives something more interesting if you go for word salad
>>
>>102964011
it's just really good at producing realistic looking porn. It does skin textures, genitalia, and sex positions really well
>>
>>102964025
that's weird that his images examples just show 1 porn image, the rest is just nudity
>>
>>102963943
>This experimental model was finetuned from base SDXL on almost 1.5 MILLION high quality photos for 30 million training samples.
HOLY FUCK
>>
File: 02151.jpg (1.98 MB, 1792x2304)
1.98 MB
1.98 MB JPG
>>102963914
normies jeer at the idea of ai bros being discriminatory with their taste in slop
sad world
>>
>>102964023
>It almost feels like the shorter the prompt, the less confident the model feels about what to generate
Shorter prompts are always going to be more open-ended just because they don't say as much, of course this would be the result. You can usually compensate with increased guidance if you think it's necessary.
>>
File: Mochi_00004.webm (878 KB, 856x480)
878 KB
878 KB WEBM
23 fucking minutes and it's garbage!
>>
>>102964103
bf16?
>>
File: Mochi_00003.webm (199 KB, 856x480)
199 KB
199 KB WEBM
>>102964103
Same prompt, seed and steps just half the frames. Took 10 minutes, showed promise
>>102964115
fp8. I've only got 16gb vram
>>
>>102964131
>fp8. I've only got 16gb vram
ditch fp8 and go for Q8_0, its quality is way closer to fp16 after a few testings I've done, will post the results later
https://huggingface.co/Kijai/Mochi_preview_comfy/blob/main/mochi_preview_dit_GGUF_Q8_0.safetensors
>>
>>102964047
wait for Sana, SDXL is such a cursed architecture
>>
>>102964148
>wait for Sana
sana is DOA due to its completly shitty VAE >>102957943
>>
>>102964166
lmao works just fine for me, have fun with your monstrosities every gen because SDXL is ass
>>
>we will never get 16ch 1.5
>>
>>102964176
I wouldn't mind if a better VAE would be created for Sana, definitely possible
>>
File: file.png (46 KB, 1072x368)
46 KB
46 KB PNG
not only sage_att is faster than flash_att, but it also allows you to go for bf16 + 61 frames on a 24gb card wheras flash attention overflows the card, sage is really impressive not gonna lie
>>
File: file.png (2.06 MB, 1024x1024)
2.06 MB
2.06 MB PNG
>>102964191
Sana looks fine especially with downsample workflow.
>>
File: 00024-3909084809.png (1.47 MB, 832x1216)
1.47 MB
1.47 MB PNG
>>
File: file.png (3.25 MB, 1024x1384)
3.25 MB
3.25 MB PNG
>>102964221
>Sana looks fine
>Goes for the most safe image ever, a close up of a human
dude, try to go for something that really challenges the VAE, something like this, if it looks good then we're talking https://civitai.com/images/36316196
>>
>>102964231
With Sana you're not going to do 1K images, it's less relevant.
>>
File: 00014-2229679725.png (1.51 MB, 832x1216)
1.51 MB
1.51 MB PNG
>>
>>102964249
I don't give a fuck about the resolution you're about to use to make the image, just try to get something similar, if you can't then it's fucking DOA, it's not rocket science
>>
File: 00000-2325902465.png (3.73 MB, 1664x2432)
3.73 MB
3.73 MB PNG
>>
>>102964258
You should give a fuck about the resolution given 4K is 16 times the pixels which means the VAE has to compress less. Anyways, I don't really give a fuck if you autistically can't use the model, I don't base my decisions on what you do. I tested the Sana demo it produces more than adequate images, it's that simple. You can stay autistically on raw VAE tests, it doesn't matter when in practice the VAE errors in Sana is negligible. You can wait for someone to do 10 million training steps on Flux or whatever.
>>
>>102964273
>it doesn't matter when in practice the VAE errors in Sana is negligible.
prove it, you know what to do -> >>102964231
>>
>>102964279
And what? I don't need to prove to you that the 16channel VAE is better, that's not the point. You can wait for someone to do 10 million training steps on Flux or whatever.
>>
>>102964311
>And what?
tf you mean "and what"? Why would someone play around with an inferior model?
>>
>>102964321
Because you can't train Flux but you can train Sana? People managed with mangled mid/far shot SDXL faces so far. You can wait for someone to do 10 million training steps on Flux or whatever. But I imagine that's going to be awhile.
>>
>>102964334
>Because you can't train Flux
Yes you can, the llm fags are finetuning 70b models by themselves via cloud computing and you believe the imagegen fags can't train a 12b model?
>>
File: file.png (1.76 MB, 1024x1384)
1.76 MB
1.76 MB PNG
>>102964231
NTA this is the best i could do with an inherently limited demo. i think its also important to recognize we are comparing a "beta" model with a "finetune" of one that's 10x larger or whatever
that being said, im not enthused by the result
original prompt is horrendous btw
>>
File: file.png (107 KB, 376x131)
107 KB
107 KB PNG
>>102964346
she's not that far to the camera and her eyes are already fucked, no amount of training is gonna fix that, it's the VAE's fault
>>
>>102964344
You're putting a lot of faith in someone putting $20k in.
>>
>>102964353
Good thing you can train AEs on images of people and eyes.
>>
>>102964346
also thats downscaled if it werent apparent
>>102964353
the model has the wonk as well but noticeably less so than pixart

if training in a better VAE is "quick" then i wouldnt mind waiting but i dont think it would be
>>
>>102964364
>Good thing you can train AEs on images of people and eyes.
can't wait to see you improve Sana's AE
>>
File: file.png (550 KB, 2990x1358)
550 KB
550 KB PNG
>>102964231
People seem to be enjoying that finetune, too bad he only uploaded the fp8 version, that's retarded, Q8_0 or bust
>>
>>102964364
you cant actually, unless the same encoder as the one the model was trained with is used you are getting noise
and even then, either you give up the compression making the model easier to train in the first place, or you are getting marginal improvements or more probably gonna make it even worse
>>
>>102964400
actually you can, and it doesn't matter, they're already committed to improving it.
>>
>>102964344
nta but llm 70b roleplay finetunes and image model finetunes aren't really comparable, the two happen in completely different scales. for llms all they do is change the model's prose and behavior meanwhile for image gen you need to teach it a ton of new concepts and completely change how images look. it's why image model tunes are so much more expensive to train even though they are tiny
>>
File: image-4.png (398 KB, 1024x1024)
398 KB
398 KB PNG
>>
>>102964419
>they're already committed to improving it.
why the fuck are they so invested on making a shitty small VAE, it's not the part that eats the most of VRAM, far from it, leave it alone, LEAVE BRITVAE ALONEEEEEEE
https://youtu.be/WqSTXuJeTks?t=183
>>
>>102964447
Because they reduced the total number of tokens which makes the model literally 8 times faster to run and train? Which also means you can, you know, make videos? Or 8K images? Or whatever the fuck you want when you have way less tokens required to make an image?
>>
>>102964431
it's still expensive as fuck to finetune a big llm model, I remember when Nous finetuned Mixtral (49b) it cost them tens of thousands of dollars
>>
>>102964447
thats kinda a kicker because when i tested it, with their code, i got an oom on a 24gb card with a 1024x1024 image in fp32 and had to switch to a 48gb one since the ae models they use are huge
maybe the code is unoptimized but it was funny to me still
>>
>>102964466
for the moment it has no point because the quality is dog shit, the smart move would've been letting people chose between a good VAE, or that small optimised AE shit
>>
bigasp works pretty well. It's not quite at the level of the hentai models like Pony, but of all the porn models I've tried I think this one's the best so far. Not much interested in that stuff though.
>>
>>102964466
That's not even talking about a lot of the other things you can do when you have more headroom to work with, like training a model using perceptual loss or clip-based loss, both of which are VRAM intensive but give you way better results. The CLIP-based loss allegedly gives you 13x training speed for convergence. So now you're talking 8 times more efficient training multiplied with 13 times faster convergence.

>>102964482
No one fucking cares, you can use your extremely bloated model no one can train. You can talk all day about people training it but in reality it's not happening and you're certainly not going to see any projects like Pony or BigAsp on Flux.
>>
>>102964474
yes and it costed tens of thousands of dollars to make pony for sdxl even though it's way way smaller (2.6b), like i said, it's pretty hard to make meaningful comparisons between the two
>>
>>102964419
i cant think of another team like that who actually requested feedback instead of going off what people are saying on leddit/xitter
>>
File: file.webm (335 KB, 856x480)
335 KB
335 KB WEBM
this is bf16 kek, I noticed that the shorter the video is, the more glitches it has, fok
>>
File: file.png (147 KB, 490x640)
147 KB
147 KB PNG
>>102964489
>No one fucking cares
you care because you're seething hard about it
>>
>>102964493
The problem with these 8B and 12B models is you literally need H100s to train them. And most cloud providers prohibit training porn. Text you can skirt the rules, but images don't get the same grey pass.

>>102964504
Weird because you always seem to chime in about how much it sucks instead of ignoring the apparently irrelevant, shit model. Almost like you can tell it's going to be big.
>>
>>102964511
Yes anon, someone that isn't you is going to spend $20,000 or more training Flux, I can just feel it.
>>
File: 58033.png (3.42 MB, 1440x3120)
3.42 MB
3.42 MB PNG
yo, don't even get me STARTED on mf pumpkin spice bro. who decided that fall needed a flavor?? like, homie, we already got cinnamon, nutmeg, spiritually confused gourds, but nah, let’s mix it all into some caffeinated demon sludge and charge $6 a cup. starbucks got us out here slurping the essence of autumn like it's some ancient rite of passage into basicness. i swear, it's like they summoned a whole legion of sweater-wearing clones outta the astral plane, all chanting “sksksk” like it’s some kind of harvest cult.

and don’t even act like they ain’t putting something sus in that spice mix. prolly got my third eye clogged with synthetic vibes. i drank one and now i’m getting ads for yoga mats and ugg boots like wtf??? i’m just out here tryna vibe in october, and instead i’m part of the pumpkin industrial complex. like who tf gave big pumpkin all this power??? fall used to be crunchy leaves and vibes, now it’s straight up latte warfare.
>>
>>102964508
>instead of ignoring the apparently irrelevant, shit model.
you talk about Sana, we respond to it, and then you go pikachu surprised when we talked about it? No one is exempt of critisism anon, you can't avoid that, people will talk and give their opinion about whatever they want, and you won't do shit.
>>
>>102964527
okay, I'll ignore you for now on
>>
sana-samas....
>>
>>102964521
>Yes anon, someone that isn't you is going to spend $20,000 or more training
I've heard that before pony appeared and made pony-v6 lmao
>>
>>102964534
Pony's rig can't even train Flux. You need 80GB GPUs, not 40GB. Also I don't know if you noticed, but Pony hasn't produced anything for awhile now.
>>
>>102964526
Please do us a fucking favor and spam in /sdg/
>>
>>102964508
>And most cloud providers prohibit training porn. Text you can skirt the rules, but images don't get the same grey pass.
that's a fair point, I admit
>>
>>102964544
kinda like it here. might just settle in, make it cozy and shit.
>>
>>102964508
thats some horseshit
"noob" sdxl anime finetune is fully nsfw and using 32xh100 from a cloud provider
pony is captioning 10m images on rented 70 a6000 and is obviously going to train (and has trained) on rented gpus
there is a trillion of people that have used vast or whatever garbage to train their shitty loras, and the larger nsfw finetunes obviously werent trained on local rtx3090
>>
>>102964538
>I don't know if you noticed, but Pony hasn't produced anything for awhile now.
if one pony fag existed to do this, another one will come, it's more likely to be the case now than before when we were with the shitty SDXL base model on our hands and thought to ourselves it needed too much work to be saved
>>
>>102964560
Anon, I don't know if you noticed, but you can count the large scale finetunes that aren't incest merges on one hand.
>>
File: 00029-2403222033.png (1.41 MB, 832x1216)
1.41 MB
1.41 MB PNG
>>
File: file.png (135 KB, 720x405)
135 KB
135 KB PNG
>>102964570
we just need one good shot to be saved
>>
>>102964580
When you talk about tens of thousands of dollars, you aren't going to have many bites that aren't business oriented. Your only hope is the 5090 can full finetune 8B/12B without gimmicks.
>>
>>102964526
I’d like to slurp the essence of autumn…
Is she hot?
>>
>>102964587
And the elephant in the room is you need like 10 times the computing power to train Flux. You're going into a whole different realm of training going from SDXL to Flux. It's possible SD 3.5M will be the new finetune base because it's actually approachable.
>>
New

>>102964600
>>102964600
>>102964600
>>
>>102964560
highly doubt they'll try something like for flux unless they KNOW it's going to be viable. illustriousxl is the new pony replacer for anime and they only plan on training on sdxl, why? because they are sure it will work. flux is just too big to experiment with and risk losing money with no results to show. most people would probably wait for sd3.5 medium and see if that's any good. you're also forgetting that the main audience for these porn models are complete vramlets. have you seen the /h/ and /d/ ai threads? most of them still run 6-8gb cards
>>
>>102964553
>make it cozy
With pumpkin spiced lattes?
>>
>>102964619
>most people would probably wait for sd3.5 medium and see if that's any good.
this shit (2b) is smaller than SDXL (2.7b), it won't be good
>>
>>102964591
well, does beautiful mid count?

>>102964620
that will be $6, plus tip.

>>102964604
i'm glad to see even the epic serious discussion thread has it's own early baking trannies. all's right in the world, it seems that generals are generals and why should it be? that 4chan generals should be controlled by insane trannies?
>>
File: 58036.png (2.71 MB, 1440x3120)
2.71 MB
2.71 MB PNG
>>102964640
FUCK U 4CHANX YOU ARE SHIT
>>
>>102964634
parameters aren't the be all end all metric
you do realize that shitty architectures can be bloated for parameters, right? learn the basics of machine learning sometime
>>
File: image (3).png (425 KB, 1024x1024)
425 KB
425 KB PNG
>>
>>102964634
it's 2.5b and according to their blog post uses a different architecture compared to the 8b, so we'll see
>>
it's an odd compulsion, likely some form of ptsd
>>
>>102964645
might as well use the very optimized 1.5 then
>>
>>102964645
>you do realize that shitty architectures can be bloated for parameters, right? learn the basics of machine learning sometime
yeah sure thing jan, that's why OpenAI went for giant models like GPT4? I guess those multimillion per year paid researshers don't know this and are burning money for something, you should tell those experts anon, they are such noobs kek
>>
>>102964496
How many seconds at 24fps can I expect in fp8 with a 3090?
>>
>>102964662
>trust the science chud
>>
>>102964662
yeah so weird Florence 2 a tiny model is one of the best vlms, your theory is wrong
>>
>>102964665
>How many seconds at 24fps can I expect in fp8 with a 3090?
361 frames -> 15 sec, but go for Q8_0 instead, it has better quality

>>102964674
compared to GPT4V (another giant model) this toy is complete dogshit
>>
>>102964662
nta but why are you comparing llms to image gen you dumbass nigga. predicting words is much more complicated compared to predicting pixels
>>
>>102964681
actually wrong, for the amount of efficiency Florence 2 is much, much better
>>
>>102964681
>361 frames -> 15 sec, but go for Q8_0 instead, it has better quality
Same duration for Q8_0?
>>
>>102964681
>GPT4V (another giant model)
how the fuck do you know???
>>
>>102964692
>Same duration for Q8_0?
no, Q8_0 is a big bigger, maybe you can get away with 13 seconds
>>
>>102964704
thanks anon, will try it
hopefully I can get something out of this
>>
File: 02161.jpg (2.86 MB, 1792x2304)
2.86 MB
2.86 MB JPG
>>
File: 58039.png (2.84 MB, 1440x3120)
2.84 MB
2.84 MB PNG
>>102964646
Ah, behold! A portrait of such stupefying mediocrity, one can scarcely summon the will to comment, and yet—duty compels me. Here we have the paragon of modern banality, a veritable shrine to Instagram-filtered, airbrushed superficiality. One could scarcely be blamed for mistaking this visage for the very archetype of "basic" itself, so blandly constructed as to be indistinguishable from the millions of similarly vacuous, copy-paste caricatures that plague the digital ether.

Indeed, this is not even "dogshit" in the classical sense, for dogshit at least possesses some raw, unpretentious authenticity. No, this is more akin to the synthetic substitute—artificial and bereft of any real substance. Congratulations, you've achieved aesthetic indistinction! A masterpiece of the most basic caliber. Truly... breathtaking, in the way a lukewarm cup of instant coffee might be if one had never known anything better.
>>
>>102964805
Thank you! I spent $15k just so you could give me lovely compliments.
>>
File: ComfyUI_00105_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>
File: ComfyUI_00106_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>102966148
>>
File: 2024-10-25_00001_.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>102966205



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.