[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1714422527744016.png (1.6 MB, 896x1152)
1.6 MB
1.6 MB PNG
Previous /sdg/ thread : >>102053253

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
SD.Next: https://github.com/vladmandic/automatic
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
flux-dev: https://huggingface.co/spaces/black-forest-labs/FLUX.1-dev
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Black Forest Labs: Flux
https://huggingface.co/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
1 penis boy
>>
>>102067795
coming up!
>>
File: 00031-2377737842.png (1023 KB, 896x1152)
1023 KB
1023 KB PNG
>>
File: 00152-2969836414.png (2.7 MB, 1728x1344)
2.7 MB
2.7 MB PNG
>>
>>102067929
#4 is differently abled
>>
File: fSDG_News_000039_.jpg (363 KB, 896x512)
363 KB
363 KB JPG
>mfw Resource news

08/24/2024

>ComfyUI v0.1.x Release: Devil In the Details
https://blog.comfy.org/comfyui-v0-1-x-release-devil-in-the-details-2/

>Azula - Diffusion models in PyTorch
https://github.com/probabilists/azula

08/23/2024

>Midjourney's AI-image generator website is now officially open to everyone - for free
https://www.zdnet.com/article/midjourneys-ai-image-generator-website-is-now-officially-open-to-everyone/

>Anthropic says California AI bill's benefits likely outweigh costs
https://www.reuters.com/technology/artificial-intelligence/anthropic-says-california-ai-bills-benefits-likely-outweigh-costs-2024-08-23/

>AiM: Scalable Autoregressive Image Generation with Mamba
https://github.com/hp-l33/AiM

08/22/2024

>Towards Pony Diffusion V7, going with the flow
https://civitai.com/articles/6309

>SynPlay: Importing Real-world Diversity for a Synthetic Human Dataset
https://synplaydataset.github.io/

>Iterative Object Count Optimization for Text-to-image Diffusion Models
https://ozzafar.github.io/count_token/

>T2VIndexer: A Generative Video Indexer for Efficient Text-Video Retrieval
https://github.com/Lilidamowang/T2VIndexer-generativeSearch

>UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation
https://github.com/xiangyu-mm/UniFashion

>NovelAI Diffusion V1 Weights Release (EN)
https://blog.novelai.net/novelai-diffusion-v1-weights-release-en-e40d11e16bd5

08/21/2024

>IP-Adapter checkpoint for FLUX.1-dev model
https://huggingface.co/XLabs-AI/flux-ip-adapter

>[kijai] ComfyUI Flux Trainer
https://github.com/kijai/ComfyUI-FluxTrainer

>Prompt-Guided Image-Adaptive Neural Implicit Lookup Tables for Interpretable Image Enhancement
https://github.com/satoshi-kosugi/PG-IA-NILUT

>MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
https://haoningwu3639.github.io/MegaFusion/

>Aesthetics Toolbox v1.0.0
https://github.com/RBartho/Aesthetics-Toolbox
>>
>mfw Research news

08/24/2024

>Latent Feature and Attention Dual Erasure Attack against Multi-View Diffusion Models for 3D Assets Protection
https://arxiv.org/abs/2408.11408

>Positional Prompt Tuning for Efficient 3D Representation Learning
https://arxiv.org/abs/2408.11567

>AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
https://arxiv.org/abs/2408.11564

>HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution VLMs in Resource-Constrained Environments
https://arxiv.org/abs/2408.10945

>Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?
https://arxiv.org/abs/2408.10627

>SDE-based Multiplicative Noise Removal
https://arxiv.org/abs/2408.10283

>FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model
https://arxiv.org/abs/2408.09384

>Ask, Attend, Attack: A Effective Decision-Based Black-Box Targeted Attack for Image-to-Text Models
https://arxiv.org/abs/2408.08989

>DSReLU: A Novel Dynamic Slope Function for Superior Model Training
https://arxiv.org/abs/2408.09156

>Combo: Co-speech holistic 3D human motion generation and efficient customizable adaptation in harmony
https://arxiv.org/abs/2408.09397

>CHASE: 3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning
https://arxiv.org/abs/2408.09663

>ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement
https://arxiv.org/abs/2408.09650

>GoodSAM++: Bridging Domain and Capacity Gaps via SAM for Panoramic Semantic Segmentation
https://arxiv.org/abs/2408.09115

>Reefknot: Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal LLMs
https://arxiv.org/abs/2408.09429

>Perceptual Depth Quality Assessment of Stereoscopic Omnidirectional Images
https://arxiv.org/abs/2408.10134

>MCDubber: Multimodal Context-Aware Expressive Video Dubbing
https://arxiv.org/abs/2408.11593
>>
File: 00034-77878281.png (791 KB, 896x1152)
791 KB
791 KB PNG
>>
File: 000000_16822_.png (2.55 MB, 952x1667)
2.55 MB
2.55 MB PNG
>>102067976
>>
>me in the back
>>
File: tmpie2x1l8v.png (944 KB, 768x1024)
944 KB
944 KB PNG
>>
>>
File: delux_sf_00007_.png (1.53 MB, 1536x968)
1.53 MB
1.53 MB PNG
>>102068184
this is one of your best ones yet. very nice
>>
File: hghfg.gif (1.82 MB, 512x768)
1.82 MB
1.82 MB GIF
like that japanese guy in the jungle long after the end of wwii, but with 1.5.
https://youtu.be/_qUQgXrlzk4?si=iWWI0Kqg-0D-HqHK
>>
i like how if you push flux far enough, sometimes it goes "you know what man, fuck it, fuck your prompt and fuck you, i'm doing my own thing with certain of your phrases" but it still kinda works lel

>>102068227
the colors really came together there
>>
File: 00039-3959107963.png (1020 KB, 896x1152)
1020 KB
1020 KB PNG
>>
File: 00007-3074654286.jpg (694 KB, 1400x1112)
694 KB
694 KB JPG
>>
File: tmp689gqk5y.png (1.44 MB, 1152x896)
1.44 MB
1.44 MB PNG
>>
File: flux_cyber-env07.jpg (3.82 MB, 2896x2272)
3.82 MB
3.82 MB JPG
>>102068275
>i like how if you push flux far enough, sometimes it goes "you know what man, fuck it, fuck your prompt and fuck you, i'm doing my own thing with certain of your phrases" but it still kinda works lel
Uh... Example?
>>
>>102068388
should be 'people with weird obsession with debo'
>>
>>102068403
that was the example lel
>no animal frens
>symmetric composition
>no "lolwut" aspect
that goes against my prompts
>>
File: tmpv6qkayl8.png (1.32 MB, 1152x896)
1.32 MB
1.32 MB PNG
>>
>>
File: 000000_16830_.png (2.29 MB, 952x1667)
2.29 MB
2.29 MB PNG
>>
File: delux_sf_00008_.png (1.78 MB, 1536x968)
1.78 MB
1.78 MB PNG
>>102068388
wow, what a nice thing to say about me. thanks!
>>
File: delux_sf_00009_.png (1.8 MB, 1536x968)
1.8 MB
1.8 MB PNG
>>102068658
real monke or poster monke?
>>
Hello
>>
File: tmpqbgx6jg2.png (1.51 MB, 1152x896)
1.51 MB
1.51 MB PNG
>>
DOOD i should probably go to sleep
>>
>tfw schizo knows debo is around
>>
File: tmp4caggy_5.png (870 KB, 768x1024)
870 KB
870 KB PNG
>>
File: 00033-692679806.png (826 KB, 896x1152)
826 KB
826 KB PNG
>>102068708
hello
>>
File: tmpxigq9xp8.png (886 KB, 768x1024)
886 KB
886 KB PNG
>>
>>
File: 00295-2758676467.jpg (897 KB, 1248x1864)
897 KB
897 KB JPG
>>
File: 00046-3351233762.png (1.1 MB, 896x1152)
1.1 MB
1.1 MB PNG
>>
File: tmplvbc_096.png (1.07 MB, 768x1024)
1.07 MB
1.07 MB PNG
>>
File: 00047-350251966.png (1.17 MB, 896x1152)
1.17 MB
1.17 MB PNG
>>
>>
File: tmpcsc29esf.png (1.02 MB, 768x1024)
1.02 MB
1.02 MB PNG
>>
WTF I love RealDream 12 now
>>
>>
File: tmpmeds_o_u.png (989 KB, 768x1024)
989 KB
989 KB PNG
>>
>>102069479
dling right now
>>
>>102069479
not allowed
>>
File: tmpy_9tdpjp.png (937 KB, 768x1024)
937 KB
937 KB PNG
>>
>>102068088
swiggity swooty
>>
File: 00052-4237494228.png (941 KB, 896x1152)
941 KB
941 KB PNG
>>
File: tmp20ff5p3v.png (933 KB, 768x1024)
933 KB
933 KB PNG
>>
File: 00001-3875963767.jpg (99 KB, 896x1152)
99 KB
99 KB JPG
>>
File: 00133-313508792.png (1.32 MB, 896x1344)
1.32 MB
1.32 MB PNG
>>
>>102069595
poor lad doesn't even have a monitor for his pc
>>
File: tmp40ckt79u.png (1020 KB, 768x768)
1020 KB
1020 KB PNG
>>
File: 00004-1777542442.png (1.66 MB, 896x1152)
1.66 MB
1.66 MB PNG
>>
File: 57729.jpg (510 KB, 1440x3120)
510 KB
510 KB JPG
i can't even keep these fucking threads straight anymore. fml
>>
File: 57730.jpg (276 KB, 1440x3120)
276 KB
276 KB JPG
>>102069963
i like this. i'd prefer it be more occultish, but it's still very good.
>>
File: 00001-1218732348.jpg (127 KB, 1096x1087)
127 KB
127 KB JPG
>>102070039
Thanks, I haven't tested yet how Flux responds to the Illuminati tag
>>
>>102070001
What does that mean?
>>
File: 57732.jpg (777 KB, 1440x3120)
777 KB
777 KB JPG
>>102070130
/ldg/, /sdg/ who can even keep track anymore? what a world, what a world.
>>
File: 00011-2816623277.png (1.13 MB, 768x1152)
1.13 MB
1.13 MB PNG
ni-ni-nite. didnt see hardly any interesting gens today, besides my own, but that isnt new. tomorrow will be better, im sure.
>>
>>102070194
One is full of drama and arguing the other is full of friendly discussion and howtos. No idea how its hard for you to discern them
>>
>>102070515
good night
>>
>>
>>102070660
looks kinda like kirsten stewart
>>
>>102070701
Are you accusing me of something?
>>
File: DDpuJOTMxL-K1MgWEYTw9.png (1.63 MB, 1024x768)
1.63 MB
1.63 MB PNG
Yes!!
>>
>>102070708
you know what you did
>>
File: 00010-736900594.png (769 KB, 896x1152)
769 KB
769 KB PNG
>>
>>102070723
I've no idea what you're talking about buddy, I don't even know who that stewart guy is
>>
>>102070712
>implying liberals believe in God
>>
>>102070750
she's praying to satan
>>
>>102070767
Ahhhh ok
>>
>Well, well, well! If it isn't the Prince of Peace. I guess turning the other cheek doesn't work so well in the real world, does it? Faggot
>>
>>102070952
>the teachings of Greek paganism don't result In a cohesive, functioning society
Color me surprised
>>
File: p7kcbRrkfag8N8swb-FKq.png (1.36 MB, 1024x768)
1.36 MB
1.36 MB PNG
But then Jesus does a couple miracles with a deck of playing cards, which always tends to smooth things over
>>
>>102070967
Greece is a joke. Even their language is silly nonsense.
>>
>>102070979
>How did he know I picked that card?
>>
>>102070988
I dont need you to yes man my posts
>>
>>102070998
Yes sir!
>>
>>102070952
You mad bro?
>>
>DingDong FlimFlam The Furniture Destroyer Erupts Boastfully From a Sandwich Bicycle, many birds are offended, chicken egg mongoose gorilla attack, mushroom kitchen striptease bonanza, ak-47, tec-9, nuclear angel demon eats ice cream peacefully under the green sun
>>
>>102071061
kek Flux Pro actually shows its superiority on this one for once
>>
>>102071080
>superior
None of that image is coherent
>>
File: 00013-2792198796.png (1.55 MB, 896x1152)
1.55 MB
1.55 MB PNG
>>102071061
>>
File: 00401-313508792.png (1.33 MB, 896x1344)
1.33 MB
1.33 MB PNG
>>
File: river.webm (1008 KB, 1920x640)
1008 KB
1008 KB WEBM
>>
File: ComfyUI_cp_00169_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
i tried Flux on my gtx1060
it didn't like it
>2min per iteration/step
>pic unrelated
>>
Confused about this part with the token copying and env file
https://youtu.be/F-7gfqSP2ZY?t=448

I have no env file

Also if I try to continue the training Lora tutorial I get these errors, pic rel

Should I just unninstall Pinokio? I think it's conflicting.

Now I have another issue, lots of files were downloaded(god knows where) on my C: drive and now I don't have enough space.

halp!
>>
>>102071688
fuck knows why these guides are leading on you on a wild goose chase for the token when all you need to do is
>huggingface-cli login
>>
File: ComfyUI_cp_00176_.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>102071688
seems like the downloaded files are being used by another process, probably windows defender scanning them or the search indexer scanning files for faster search results.

Miniconda env's are usually under c:\users\yourusername\.conda

might be where all the downloaded files are (i'm making assumptions here)
>>
>>102071574
how did you do that?
>>
File: flux_tmp~3.png (3.01 MB, 2304x1792)
3.01 MB
3.01 MB PNG
/sdg/ seems slow lately
>>
File: 765819852007959619.png (1.23 MB, 1216x832)
1.23 MB
1.23 MB PNG
>>
How do I get Flux running on Kaggle? I don't have a GPU and on Kaggle there's an option called T4 GPU x2 which gives two GPUs with 15 GB RAM. But for some reason it only uses one GPU then complains about running out of memory.
>>
File: 765820388878878387.png (1.32 MB, 1216x832)
1.32 MB
1.32 MB PNG
>>102071864
kek
>>
>>102071868
>Kaggle
>T4
>15 GB RAM
yeah not going to happen
>>
>>102071881
How much RAM do you need, I want to use Schnell with 2 inference steps.
>>
is it jus me or does img2img not work on forge?!
>>
>>102071899
more than 15
>>
>>102071919
Would 30 suffice?
>>
>>102071982
try it
>>
File: ComfyUI_hgdf_00012_.png (1.31 MB, 1024x1024)
1.31 MB
1.31 MB PNG
i should have switched to SDXL earlier...
>>
>>102072035
what does it do?
>>
>>102071994
I am trying but I don't know how to split the workload across two GPUs.
>>
File: ComfyUI_temp_ccufe_00023_.jpg (2.51 MB, 2560x1440)
2.51 MB
2.51 MB JPG
>>102072074
more coherent images when compared to SD1.5
(backgrounds, architecture and the like)

>picrel
>an old SD1.5 gen
>>
>>102072079
use software that supports it
>>
File: upscale grid.webm (3.78 MB, 1210x1680)
3.78 MB
3.78 MB WEBM
>>102071818
Inpainting with animatediff. Normally it doesn't like to work with small areas like this stream, so now I'm cropping the masked area and upscaling it.
>>
File: Toyosatomimi no Miko.jpg (295 KB, 1536x1536)
295 KB
295 KB JPG
>>
>>102072326
Neat
>>
is there any reason to use sd1.5 at all?
i thought maybe running it and upscaling would be quicker than just running xl but it ends up being around the same time but with shittier results.
>>
>>102072518
Only if you want to use Loras that are not available for SDXL. Or if you like that samey look all 1.5 images have for some reason.
>>
File: stream.webm (797 KB, 1920x640)
797 KB
797 KB WEBM
That grid was generated using the kl-f8-anime2 vae, which unfortunately cooks the colors. After testing, vae-ft-mse-840000-ema-pruned seems the least lossy.
>>102072481
Thanks
>>102072518
I use 1.5 models for animatediff.
>>
File: file.jpg (398 KB, 1792x1024)
398 KB
398 KB JPG
upgraded infrastructure. 5 (five) download nodes now
>>
File: 000000_16838_.png (2.12 MB, 952x1667)
2.12 MB
2.12 MB PNG
G'mornin Anons,
>>
is there any way to change the default generation settings in forgeui instead of manually inputting them every single time?
>>
>>102072657
if there is it would be documented on the repo, why not check there
>>
File: stream 2.webm (1.07 MB, 1920x578)
1.07 MB
1.07 MB WEBM
>>102072632
Sounds interesting. I've been enjoying reading your web scraping and data formats series.
>>102072654
Morning
>>
File: file.png (26 KB, 1872x132)
26 KB
26 KB PNG
>>102072877
they're running a script similar to one in the articles. bulk downloads of images from parquets, but writing to webdataset then uploading that back to the storage server
bigmong is maxed out atm and its faster to distribute downloads due to i/o and bandwidth
>>
File: ComfyUI_hgdf_00022_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>
>>102073184
cute what checkpoint?
>>
platforms that allows nudism/sex/gore?
>>
i miss schizo anon
>>
>>102073262
I'm here
>>
>>102073241
Why would you EVER want to create that kind of disturbing imagery? I'm calling the FBI.
>>
>>102073298
sus
>>
>>102073329
I like using NAI so i wanted some illustrations for my fanfics.
>>
File: ComfyUI_hgdf_00033_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>102073238
that gen was 50/50 merge with abyssorangemix2 and lolidiffusion0.14v_aom3a3 and a second sample pass with tponynai3_v6
>>
File: stream 3.webm (896 KB, 1920x640)
896 KB
896 KB WEBM
>>102072974
Cool to see it in action. Is that for your local suno project?
>>
File: 4chan war criminal.png (1.29 MB, 1280x960)
1.29 MB
1.29 MB PNG
>>
File: 000000_16845_.png (2.14 MB, 952x1667)
2.14 MB
2.14 MB PNG
>>102072877
Nice animations
>>
File: ComfyUI_hgdf_00033_.jpg (2.25 MB, 2048x2048)
2.25 MB
2.25 MB JPG
>>102073610
A slightly better version and a quick upscale.
>>
>>102074139
It ruined her tail, it looked better here >>102073610
>>
File: file.jpg (223 KB, 1792x1024)
223 KB
223 KB JPG
>>102073786
current downloads:
>~2.1m tracks, at ~1.6m/2.1m, ~11tb
>ai gen images, ~20m sample across 50k unique prompts, then ~1.2m sample across ~1200 finetunes, from diffusion1b
preparing a 5m random subset from flickr for download, already did a ~4m public domain subset and have that downloaded
flickr images are for captioning, working on webdataset support in florence-tool atm for that
need to prepare a subset of steam screenshots for download, total set is ~25m
also civit loras is in constant progress
yesterday i finished up ~45k film frames from 235 films and ~123k tv frame from a sample of up to 50 episodes from 83 series, they're all processed and there's a linked database of characters from each film/episode, the characters arent tagged in the frames yet but they will be :^)
>>
File: stream 4.webm (1.55 MB, 1920x640)
1.55 MB
1.55 MB WEBM
>>102074046
Thanks anon.
Great textures on your angel.
>>
File: KINGOFIMGGEN~1.jpg (3.26 MB, 1792x2304)
3.26 MB
3.26 MB JPG
>>
File: ComfyUI_hgdf_00038_.jpg (2.36 MB, 2048x2048)
2.36 MB
2.36 MB JPG
>>102074333
you are right, didn't notice that
I am still experimenting though.
>>
File: 3359003210.webm (1.2 MB, 768x1216)
1.2 MB
1.2 MB WEBM
>>
File: ComfyUI_hgdf_00042_.jpg (2.37 MB, 2048x2048)
2.37 MB
2.37 MB JPG
>>102074731
impressive
>>
File: 0.jpg (303 KB, 1024x1024)
303 KB
303 KB JPG
>>
File: 000000_16851_.png (2.34 MB, 952x1667)
2.34 MB
2.34 MB PNG
>>102074374
Ty, getting good adjacent wings is difficult.

>can you have rain falling?
>>
File: delux_mp_00002.jpg (72 KB, 1024x1024)
72 KB
72 KB JPG
>>
>>102067754
is this the one from that southpark episode
>>
>>102067969
>Midjourney's AI-image generator website is now officially open to everyone - for free
>the site lets you generate as many as 25 images through a free trial.
>What happens once you chew up your allotment of 25 free images? Ah, then you'll have to sign up for one of the paid plans.
>>
File: ComfyUI_hgdf_00046_.png (1.62 MB, 1024x1024)
1.62 MB
1.62 MB PNG
>>
File: 1111.png (20 KB, 924x187)
20 KB
20 KB PNG
is A1111 dead?
>>
>>102075013
its never been more alive
>>
>>102075013
yeah, just use forge (until that dies too)
>>
File: tmpdvzmbaqx.png (1.24 MB, 1152x896)
1.24 MB
1.24 MB PNG
>>102074989
Thought that was a midriff at first
>>
File: ComfyUI_hgdf_00049_.png (1.68 MB, 1024x1024)
1.68 MB
1.68 MB PNG
>>102075098
>translucent golden armor
it got the first part right, i guess...
>>
>>102074871
Can you stop posting that schizo anon?
>>
File: 000000_16853_.png (2.16 MB, 952x1667)
2.16 MB
2.16 MB PNG
>>
File: tmptpclb7ic.png (955 KB, 768x1024)
955 KB
955 KB PNG
>>
File: taylorsphinx.png (1.81 MB, 1018x1018)
1.81 MB
1.81 MB PNG
>>
so far my favorite word for sexing up a FLUX gen is "bangable".

It's just a little bit dated (which I prefer), not in the naughty words list (which T5 performs poorly with), and coded for male gaze much more than female. Women call themselves sexy, they don't as much call themselves bangable.
>>
File: tmp1je4eti7.png (1.31 MB, 1152x896)
1.31 MB
1.31 MB PNG
Captcha:24GH

2.4 GHz?
>>
File: stream 5.webm (594 KB, 1920x640)
594 KB
594 KB WEBM
>>102074370
>1.6m/2.1m
If I'm reading that correctly it sounds like you're nearly done with audio scraping. It's a lot of work you're doing. Hope you got your coffee today.
>>102074860
That's on my list of things to try. I even have a few base gens prepared for testing. On previous attempts it didn't play nicely with interpolation. I'll see if I can come up with something.

see you anons.
>>
File: tmpt463g540.png (1.12 MB, 768x1024)
1.12 MB
1.12 MB PNG
>>
File: delux_sf_00011_.png (2 MB, 1536x968)
2 MB
2 MB PNG
>>102075537
I wonder why the "muh censorship" crowd never had a month-long spergout about the naughty list when thats all they ever talked about with sd

>>102075601
see ya
>>
File: tmpd70f1fb5.png (980 KB, 768x1024)
980 KB
980 KB PNG
>>
do conditioning operations work on flux
>>
>>102075810
Perhaps
>>
File: tmpckpj6xoy.png (931 KB, 896x1152)
931 KB
931 KB PNG
>>102075810
What are conditioning operations again?
>>
>>102075858
comfy nodes that combine conditionings (encodings)
>>
File: file.png (120 KB, 848x1289)
120 KB
120 KB PNG
>>102075601
yeah of that lot, i had made a separate mongo collection of what i had at the time to work with. there's more tracks with lyrics now, ~3.7m
full metadata is still in progress - the endpoints return different data, like there's one for tracks in an album and another that says whether it has lyrics and has the file ids, thats the endpoint i mean when i say full metadata
i have metadata for 25.6m tracks and full metadata for 16.1m so far, it's about 25% of spotify's entire library lol
the audio is of course encrypted, but it's not a problem i've already made something for decryption and some are not actually encrypted, i'll run the decryption when download is complete
i've done some research into the modeling side of things but the next thing i'll be doing for this project is training a better lyrics llm
>>
File: tmpjbzcgwdu.png (1.27 MB, 1152x896)
1.27 MB
1.27 MB PNG
>>
>>102075858
They make her hair soft soft soft!
>>
>>102067754
Thread Challenge
how many poster are here?

Can each one of you generate an image for this prompt

Draw me picture of vanity, materialism, pride, self obsession and body dismorphia

In your own style, modify the prompt as you like, bring our the best
>>
File: file.png (2.29 MB, 1024x1024)
2.29 MB
2.29 MB PNG
>tfw your perfect girl will never be real and even if she was you would be arrested for shagging her
we live in a society
>>
>>102076313
>vanity, materialism, pride, self obsession and body dismorphia
these are big words for me
>>
>This size does not support Azure Spot.
microsoft plz. single a100 not available on spot any more >:(
it's fine i can use 2
>>
>>102076355
Vain, obsessed with brands and products, snob, princess and insta thot
>>
File: ComfyUI_00429_.png (1.76 MB, 960x1280)
1.76 MB
1.76 MB PNG
>>102076313
I could use the ancient art of bullshido to explain how this pic all of these, but the truth is it's just a random gen
>>
>>102076324
hang in there buddy
>>
>>102075661
>I wonder why people got mad about censorship when it was a new change from the previous SD models but not now with Flux when it has long been the standard that all models follow
Amazing comment Debo, you've really outdone yourself
>>
File: delux_sf_00012_.png (1.43 MB, 1536x968)
1.43 MB
1.43 MB PNG
>>102076615
ty
>>
>>102076313
i just prompted "flag of america"
>>
File: file.png (2.25 MB, 1024x1024)
2.25 MB
2.25 MB PNG
>>102076757
hurr
>>
>>102076775
gem
>>
when genning pics, do you listen to music usually? what do you do between outputs? besides post on 4channel
>>
File: Screenshot 2024-08-25.png (320 KB, 890x490)
320 KB
320 KB PNG
>>102076795
>when genning pics, do you listen to music usually?
Sometimes I do.
>>
>>102076896
thats cool, me too. sometimes i even like to sing.
>>
>>102076979
What do you sing
>>
>>102076990
covers of songs, ha ha i almost typed 'dongs', from my youth. i think i am pretty good at singing, in my (humble:1.8) opinion.
>>
>>102077011
like what genre? I guess punk rock
>>
File: 00081-376782558.jpg (507 KB, 1992x1536)
507 KB
507 KB JPG
>>102077050
alternative, in general.
>>
im a nigbophile
>>
File: file.png (19 KB, 937x347)
19 KB
19 KB PNG
remembered i have runpod credit
A40, florence2-large, <CAPTION>, batch 12 (38gb usage, could increase a little), streaming webdataset, 16-17 images/s
very nice
only testing flickr_k-{00000..00001} so num_workers 2, probably better with more workers, these 2 shards will be finished in another 4 minutes so ill test
>>
Morning anons, i think i have given up on using anything else but FLUX Fusion at least for now, i think what was missing wasn't a faster model but that the text encoder wasn't .gguf and thanks to the new forge update i can finally replace the text encode and the load times are now faster, gening takes about the same but i can live with that.
>>
>>102077237
Not your blog.
>>
File: 00043-2387870570.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
does it make a difference if I just save the pony score tags as a style instead of just writing them at the beginning of the prompt?Because the style method applies the prompts last after the actual prompt.
>>
File: delux_sf_00015_.png (1.77 MB, 1536x968)
1.77 MB
1.77 MB PNG
>>102077237
gm. what is flux fusion? how long are you gens now with the gguf encoder?
>>
>>102076795
Many of my best gens were musically aided. All of these songs were the basis for some of my gens:
https://m.youtube.com/watch?v=UoeD8DWirQ8
https://m.youtube.com/watch?v=fkijihXrdhs
https://m.youtube.com/watch?v=GvESuQYzo_E
https://m.youtube.com/watch?v=Twag2MWjEZA&t=17m57s
https://m.youtube.com/watch?v=eN0rk5uOe2E
>>
File: tmpw6aqdz4b.png (859 KB, 768x1024)
859 KB
859 KB PNG
>>102076313
>>
>gm
>>
File: delux_sf_00016_.png (1.75 MB, 1536x968)
1.75 MB
1.75 MB PNG
>>102077410
gm
>>
>>102077432
gm
>>
File: file.jpg (323 KB, 1792x1024)
323 KB
323 KB JPG
>10000it [10:17, 16.20it/s]
FAST desu
batch 14, {00002..00010}, num_workers 4, slightly faster 16.5-17.5~
>>
File: 00070-528847359.png (1.46 MB, 1600x960)
1.46 MB
1.46 MB PNG
>>102077077
I'm not sure what that means, post a song or two
>>
>>102076313
I took this to mean "prompt anything to try to get this result", I did not use the prompt text
>>
ill test some others gpus to find the most cost effective but with A40 it's going to be around 16.67 gpu hours per 1M images captioned so ~$5.83 @ $0.35 per hour
not bad desu
still that's like $3k to caption the current 515m flickr set ;_;
>>
>>>/r/19475187
Not sure if this is the right place to ask, but no one is replying in /r/
Just wanted to save a couple of hours of downloading everything and experimenting. Even though I'm sure I'll be doing days of that later anyway.
>>
File: tmp09gttmj2.png (1.09 MB, 768x1024)
1.09 MB
1.09 MB PNG
>>
File: kkkkkk8d.webm (1.16 MB, 864x1168)
1.16 MB
1.16 MB WEBM
3 days it was queued up... luma...
>>102077544
like grunge, or anything else that was played on the radio in the 90s. i recorded myself singing smells like teen spirit but i did such a 'gj' making it horrible it is so horrible i will refrain from posting the 'roo
>>
I've setup my comfy ui workflow for testing out my flux loras but I want to run many different combinations of guidance, lora strength, different prompts etc. Is there some way write out a config file that will then automatically perform the runs for each combination of variables?
>>
>>102077750
I think you should first familiarize yourself with how to gen static images locally and once you know the basics and are kind of comfortable with your setups, then proceed to video.
>>
File: tmpvsrkwdiw.png (1.25 MB, 896x1152)
1.25 MB
1.25 MB PNG
>>
File: delux_hh_00039_.png (2.26 MB, 1024x1344)
2.26 MB
2.26 MB PNG
>>
File: delux_sf_00017_.png (1.49 MB, 1536x968)
1.49 MB
1.49 MB PNG
>>
File: delux_fu_00029_.png (2.28 MB, 1536x968)
2.28 MB
2.28 MB PNG
>>
>>102078152
Thread schizo
>>
File: delux_ci_00095_.png (2.09 MB, 1536x968)
2.09 MB
2.09 MB PNG
>>
>>102077303
>Flux Fusion?
A merge of Schnell and Dev
>How long
Around 3 minutes per gen
But I'm I'm pretty sure the gguf text encoder has more to do with how long it takes to get to that get process, starting.
>>
File: delux_bc_00020_.png (1.23 MB, 1536x968)
1.23 MB
1.23 MB PNG
>>
>>102076324
catbox?
>>
File: delux_sf_00018_.png (1.74 MB, 1536x968)
1.74 MB
1.74 MB PNG
>>
File: delux_hh_00041_.png (2.23 MB, 1024x1344)
2.23 MB
2.23 MB PNG
>>
File: delux_fu_00027_.png (2.14 MB, 1536x968)
2.14 MB
2.14 MB PNG
>>
File: delux_ci_00097_.png (1.99 MB, 1536x968)
1.99 MB
1.99 MB PNG
>>
>cracking open a cold one
It's genning time soon!
>>
File: delux_bc_00023_.png (1.42 MB, 1536x968)
1.42 MB
1.42 MB PNG
>>102078420
get to it already. I'm out here alone rn and tryn to make it to the news
>>
File: delux_sf_00019_.png (2 MB, 1536x968)
2 MB
2 MB PNG
>>
>>102078440
everyone genning smut now we've got loras out the ass
not me tho, I'm genning niche shit none of you care about
>>
File: 0.jpg (392 KB, 1024x1024)
392 KB
392 KB JPG
>>
>>102078440
>I'm out here alone rn
Have you ever thought about the "why"?
>>
File: delux_hh_00042_.png (2.23 MB, 1024x1344)
2.23 MB
2.23 MB PNG
>>102078465
based PG rated proompter

>>102078466
nice. moon-based mining station mass driver mixed with abstract. very cool

>>102078467
get new material
>>
File: delux_fu_00026_.png (2.39 MB, 1536x968)
2.39 MB
2.39 MB PNG
>>
File: 00100-2434399723.png (3.06 MB, 1152x1728)
3.06 MB
3.06 MB PNG
https://youtu.be/LFjqoYqp9Y4?si=EL8pePv_iet7zmXL
https://youtu.be/tdu487kq8xU?si=DlIksGhf144MILfi
https://youtu.be/Qm7wo2MbMhc?si=l-O4nQfGf0mR3tQG
https://youtu.be/jRuvS04rjOU?si=2yKZPIwP9Gt2cuzT
https://youtu.be/2NuTd0X3c64?si=kGZ-jKlWbMChlHvf
https://youtu.be/aFEoQTPpXjY?si=nFEj-RReVu2Ga6g2
https://youtu.be/aFEoQTPpXjY?si=yymae_5yjPT-5BiX
https://youtu.be/njBUJU0Ob3k?si=rGl1jsg8xvLjclac
https://youtu.be/ZJ7XQF-kia0?si=0vQjDOKBE8ovbAQP
>>102078467
why you are such a sad loser? who knows
>>
File: delux_ci_00098_.png (1.91 MB, 1536x968)
1.91 MB
1.91 MB PNG
>>
>>
File: 00084-2374536085.png (1.84 MB, 1024x1024)
1.84 MB
1.84 MB PNG
>>102078440
I need to experiment with genning character sheets. in theory, it should be possible to train loras on these.
>picrel not related
>>
File: file.png (16 KB, 543x93)
16 KB
16 KB PNG
over 200 downloads on flickr set
neat
>>
File: delux_bc_00025_.png (1.17 MB, 1536x968)
1.17 MB
1.17 MB PNG
>>102078580
flux does a better job with char sheet perspectives but a worse job with the concept art aesthetic
>>
>debo and pedo
>UNITE!
>>
>>102078596
>>102078596
>>102078596
>>
>>102077386
>>102077694
What exactly were your prompts
>>
>>102078440
Allright, I'm on it...
>>
File: 1715120481231804.jpg (45 KB, 896x512)
45 KB
45 KB JPG
>>
File: 1698668929821847.jpg (55 KB, 896x512)
55 KB
55 KB JPG
>>
File: 1705624862194490.jpg (53 KB, 896x512)
53 KB
53 KB JPG
>>
File: 1710525399085210.jpg (59 KB, 896x512)
59 KB
59 KB JPG
>>
File: 1700175128726558.jpg (39 KB, 896x512)
39 KB
39 KB JPG
>>
File: 1702010890657793.jpg (48 KB, 896x512)
48 KB
48 KB JPG
>>
File: 1698785619482242.jpg (61 KB, 896x512)
61 KB
61 KB JPG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.