[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1699080569241649.png (3.45 MB, 1536x2560)
3.45 MB
3.45 MB PNG
Previous /sdg/ thread : >>101673399

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SD.Next: https://github.com/vladmandic/automatic
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>SD3 info & download
https://rentry.org/sdg-link#sd3

>Try online without registration
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Animation
https://rentry.org/AnimAnon
https://rentry.org/AnimAnon-AnimDiff
https://rentry.org/AnimAnon-Deforum

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Discord
6wUwtcJsr2

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: FDG_News_00003_.jpg (944 KB, 1344x768)
944 KB
944 KB JPG
>mfw Resource news

08/01/2024

>Stable Fast 3D: Rapid 3D Asset Generation From Single Images
https://stability.ai/news/introducing-stable-fast-3d

>Announcing Black Forest Labs
https://blackforestlabs.ai/announcing-black-forest-labs

>Flux: The Next Leap in Text-to-Image Models
https://blog.fal.ai/flux-the-largest-open-sourced-text2img-model-now-available-on-fal

>ComfyUI: Basic Flux Schnell and Dev model implementation
https://github.com/comfyanonymous/ComfyUI/commit/1589b5

>Kolors ipadapter FaceID Plus
https://github.com/Kwai-Kolors/Kolors/tree/master/ipadapter_FaceID

>The EU’s AI Act is now in force
https://techcrunch.com/2024/08/01/the-eus-ai-act-is-now-in-force

>Video game performers picket over AI protections
https://apnews.com/article/sagaftra-strike-video-games-ai-f3f18ad01c5b8f4d525a836aeb531447

>Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs
https://lalbj.github.io/projects/PAI

>Detecting, Explaining, and Mitigating Memorization in Diffusion Models
https://github.com/YuxinWenRick/diffusion_memorization

>Forgedit: Text Guided Image Editing via Learning and Forgetting
https://github.com/witcherofresearch/Forgedit/

>ControlMLLM: Training-Free Visual Prompt Learning for Multimodal LLMs
https://github.com/mrwu-mac/ControlMLLM

>Accelerating Image Super-Resolution Networks with Pixel-Level Classification
https://github.com/3587jjh/PCSR

>ComfyStereo: port of the stereoscopic script used in stable-diffusion-webui-depthmap-script
https://github.com/Dobidop/ComfyStereo

>MeGA: Hybrid Mesh-Gaussian Head Avatar for High-Fidelity Rendering and Head Editing
https://github.com/conallwang/MeGA

07/31/2024

>Bubble Prompter for Stable Diffusion WebUI
https://github.com/captainzero93/sd-webui-bubble-prompter

>Waifu Diffusion V public tests
https://huggingface.co/waifu-diffusion/wdv-tests

>ComfyUI_frontend v1.2.7
https://github.com/Comfy-Org/ComfyUI_frontend/releases/tag/v1.2.7
>>
>mfw Research news

08/01/2024

>Tora: Trajectory-oriented Diffusion Transformer for Video Generation
https://ali-videoai.github.io/tora_video/

>Dynamic Object Queries for Transformer-based Incremental Object Detection
https://arxiv.org/abs/2407.21687

>Expressive Whole-Body 3D Gaussian Avatar
https://mks0601.github.io/ExAvatar/

>MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
https://arxiv.org/abs/2407.21654

>Evaluating SAM2's Role in Camouflaged Object Detection: From SAM to SAM2
https://arxiv.org/abs/2407.21596

>A Simple Low-bit Quantization Framework for Video Snapshot Compressive Imaging
https://arxiv.org/abs/2407.21517

>Fine-gained Zero-shot Video Sampling
https://densechen.github.io/zss

>Generalized Tampered Scene Text Detection in the era of Generative AI
https://arxiv.org/abs/2407.21422

>Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model
https://arxiv.org/abs/2407.21408

>SmileyNet -- Towards the Prediction of the Lottery by Reading Tea Leaves with AI
https://arxiv.org/abs/2407.21385

>AI Safety in Practice: Enhancing Adversarial Robustness in Multimodal Image Captioning
https://arxiv.org/abs/2407.21174

>Optical Computing for Deep Neural Network Acceleration: Foundations, Recent Developments, and Emerging Directions
https://arxiv.org/abs/2407.21184

>Embedding Space Selection for Detecting Memorization and Fingerprinting in Generative Models
https://arxiv.org/abs/2407.21159

>Adding Multi-modal Controls to Whole-body Human Motion Generation
https://yxbian23.github.io/ControlMM/

>Direct Unlearning Optimization for Robust and Safe Text-to-Image Models
https://arxiv.org/abs/2407.21035

>Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion
https://arxiv.org/abs/2407.21032

>SuperVINS: A visual-inertial SLAM framework integrated deep learning features
https://arxiv.org/abs/2407.21348
>>
File: ComfyUI_01440_.png (1.06 MB, 1024x768)
1.06 MB
1.06 MB PNG
>>
imagine if SAI wasn't so incompetent lol
>>
File: 00146-TFT_12403066.png (1.08 MB, 768x1280)
1.08 MB
1.08 MB PNG
oh that's my cat, neat
>>
>>101676775
I have a pretty powerful imagination but thats a hard one
>>
so how do I use this new model, I am a brainlet who doesn't code.
Is there a comfy workflow I can download or something? I have the 23gb model downloaded
>>
File: ComfyUI_01451_.png (949 KB, 1024x704)
949 KB
949 KB PNG
>>101676825
https://comfyanonymous.github.io/ComfyUI_examples/flux/
>>
File: de_fl_00076_.jpg (596 KB, 1344x768)
596 KB
596 KB JPG
its kinda crazy that the 5090 is rumored to have only 28gb. the nextgen flagship gpu won't be able to fit current gen models

>>101676825
you can burn through your free gens on replicate if you wanna avoid local setup
https://replicate.com/black-forest-labs/flux-dev
>>
has anyone trained a lora yet for this black forest stuff? If I can't generate my waifu it is difficult
>>
File: 000000_15686_.png (2.73 MB, 1680x960)
2.73 MB
2.73 MB PNG
>>101676916
Dang. another new good model, I'm liking Kolors+ipadapter, can't get the FaceID to work yet in Cubiq/Mateoo nodes.
>>
>>101676825
Sorry for being this annoying guy again, but if you hit Replicate ratelimits, or just wanna also try DALL-E together with Flux, I started a temp Telegram bot bot with DALL-E 3/Flux Schnell (fast)/Flux dev: @imgfun_bot

Just send /start and it'll show usage. I'll shut it down in a couple of days. You can get up to 4 concurrent images with any of the models.

I swear i'm not glowing, i just wanna help out people since I have some extra replicate access that can be used for all the gens.
>>
File: de_fl_00077_.jpg (623 KB, 1344x768)
623 KB
623 KB JPG
>>101676966
>Kolors+ipadapter
thats a cool setup. I've been wondering if people have been playing around with that combo yet. how do you feel about kolors in general? I think of all the new models, that one slipped under the radar the most
>>
File: ComfyUI_01458_.png (2.2 MB, 1728x1152)
2.2 MB
2.2 MB PNG
>>101676961
I'm not waiting a whole day of not using my computer for a bake. if inference is this close to the edge on a 4090 training must be hellish
>>
>>101677039
Agreed, I was using SD3M until I found Kolors and it is amazing. Very good at following prompts and quality is top notch. most LoRas (sdxl) work.
>>
File: de_fl_00079_.jpg (640 KB, 1344x768)
640 KB
640 KB JPG
>>101677187
>most LoRas (sdxl) work
oh really? thats very interesting. I wouldn't have assumed they were compatible
>>
>>101677229
you'll notice your cmd window throwing errors with some.
>>
File: sliders.png (16 KB, 238x528)
16 KB
16 KB PNG
>>101677229
>these all work without throwing errors.
>>
>>101677286
what is the actual output?
>>
>>101677286
https://sliders.baulab.info/weights/xl_sliders/
>>
>>101677308
What do you mean? Pretty much what you sees it does.
>>
File: 000000_15689_.png (2.41 MB, 2028x788)
2.41 MB
2.41 MB PNG
>>101677308
smiling.pt LoRa
-4 vs +4
>>
>>101677457
lol
>>
File: file.jpg (227 KB, 1024x1792)
227 KB
227 KB JPG
uhh
>fatal error C1128: number of sections exceeded object file format limit: compile with /bigobj
appropriate though
BIG OBJ
going to work on sequential weight loading and workspace stuff tomorrow probably. maybe some stuff to make writing IR easier too
also, 1B
>>
hi
>>
File: 00222-1714589075.jpg (1.44 MB, 2576x1944)
1.44 MB
1.44 MB JPG
>>
File: 000000_15700_.png (1.78 MB, 1440x1120)
1.78 MB
1.78 MB PNG
>>
File: 1697588549927882.png (706 KB, 1024x1024)
706 KB
706 KB PNG
>>
File: ComfyUI_temp_golca_00013_.jpg (2.84 MB, 2560x1440)
2.84 MB
2.84 MB JPG
old gen while I wait for the current one to finish
>>
File: file.jpg (507 KB, 1792x1024)
507 KB
507 KB JPG
big data gang
big data gang
big data gang
>>
>>101677876
Why do your gens look like dalle and have the same exact resolution as dalle?
>>
>>101677839
Why do you keep generating this Addams family-ass bitch?
>>
File: file.jpg (376 KB, 1792x1024)
376 KB
376 KB JPG
>>101677923
because im using dalle
>>
>>101677943
my waifu
>>
File: file.png (16 KB, 743x91)
16 KB
16 KB PNG
uh... replicate is suddenly requiring login, what happened?
>>
>>101677950
why not use the new model that shits on dalle?

>>101677991
they remembered they can sell analytics
>>
>>101677991
Obviously because they're getting a shitton of people trying the new Flux model.
>>
>>101678002
>>101678010
So I didn't get my IP flagged or something? Thank goodness. This happened exactly when I started to try to generate some spicy pics
>>
>>101678079
the good thing of this model is that it's local, you can run it on fp8 and the quality is still great, only needs 12-13gb of vram though
>>
>>101678079
You can't gen NSFW pics from their website now too, they enabled the safety checker on the model, so you can only gen spicy pics with the API.
>>
File: file.jpg (269 KB, 1024x1792)
269 KB
269 KB JPG
>>101678002
i have more important things to do than wait minutes for each image, like developing ait to make the new model run faster, also big data gang
>>
File: ComfyUI_temp_zjpls_00021_.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>101678187
comfy and I make random shit all the time while we dev, git gud kek
>>
File: file.jpg (242 KB, 1792x1024)
242 KB
242 KB JPG
>>101678202
i am too lol. don't be a dick
>>
>>101678187
>>101678224
just use flux on the api instead of dalle then? dalle sucks, flux is literally better
>>
>>101678234
why are you so upset about what model im using
>>
>>101678267
Check what this general is about again, it's not about dalle, it's about local models and SD/Flux in particular. There's a de3 thread on /g/, go there.
>>
File: out-0.png (845 KB, 1024x1024)
845 KB
845 KB PNG
>>101678224
I'm not, just odd that you can just do the same thing locally now or use the api like anon said. dalle is dead dude and openai can rot. we will probably get leaks faster when it dies
>>
File: file.jpg (275 KB, 1792x1024)
275 KB
275 KB JPG
>>101678278
not my problem
>>101678280
idk man you seem pretty bothered about it. whats the big deal. its all the same at the end of the day
>>
>>101678323
>not my problem
It is your problem though, as you must follow the rules of the general and the board you're on.
>>
>>101678280
>dalle is dead
real clown hours kek
>>
>>101678328
He's right though, flux is literally cheaper and is not extremely prompt filter cucked like dalle (even uncensored endpoints have it btw)
>>
>>101678325
is this coming from an off-topic avatar blogposter?
>>
>>101678346
No, I posted plenty of on-topic Flux gens in the thread.
>>
>>101678346
If you mean >>101678280, that's not me, that's some other dude. I don't agree with him about OpenAI dying - GPT-4o can do image gen but they just didn't release the feature, and they can be cooking dalle4 for all we know.
>>
>>101678344
it has very limited copyright and artist knowledge in comparison to dalle
>>
File: file.jpg (472 KB, 1024x1792)
472 KB
472 KB JPG
>>101678325
get a grip desu
>>101678344
dalle is free on bing. i really don't care
>>
>>101678323
>idk man you seem pretty bothered about it. whats the big deal
this is a local general

>>101678328
https://www.asiafinancial.com/openais-8-5-billion-bills-spark-bankruptcy-speculation
>>
File: de_fl_00080_.jpg (613 KB, 1344x768)
613 KB
613 KB JPG
>>101677864
this is peak ballet performance

>>101677876
>big data gang
>big data gang
>big data gang
https://suno.com/song/68597cfa-38c0-4fb7-a52e-7458bc67d9b8

>>101677991
too much CSAM. the story is always the same

>>101678234
post gens if you wanna be uptight about other people's gens. you're barking at other people for what they post without having anything to post of your own
>>
>>101678379
>post gens if you wanna be uptight about other people's gens
I posted multiple in this thread
>>
File: file.jpg (381 KB, 1024x1792)
381 KB
381 KB JPG
>>101678379
nice
i made this one earlier, i really like it
https://suno.com/song/26817194-cf0f-4d03-82a9-c81c863f3a1a
>>
File: 000000_15707_.png (2.5 MB, 1082x1581)
2.5 MB
2.5 MB PNG
>>
File: blue_alien.jpg (199 KB, 1536x514)
199 KB
199 KB JPG
>>
File: de_fl_00081_.jpg (825 KB, 1344x960)
825 KB
825 KB JPG
>>101678435
thats great. I love when suno hits that chillstep/ambient vibe. it does that dreamy atmosphere really well.
I put together something similar recently, maybe you'd dig it
https://suno.com/song/e1bba736-c831-4c69-b772-cb0ce54b1870

while you're out there mining data, can you mine me some extra suno coins? I'm so poor...
>>
File: ComfyUI_02133_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>
anyone have success running Flux Dev on a 24 GB card?
mine keeps running out of memory and crashing. on linux + Comfy. Tried a bunch of different stuff
Should I try Schnell instead?
>>
>>101678627
can you make her younger?
>>
>>101678633
>Should I try Schnell instead?
Anon, it's the same size, it just gens faster (and worse quality)
>>
>>101678633
>You can set the weight_dtype in the “Load Diffusion Model” node to fp8 which will lower the memory usage by half but might reduce quality a tiny bit.
>>
File: ComfyUI_02135_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
>>101678649
Yeah, did that. I get a memory allocation error. It's trying to allocate vram for the whole unet file, so the full 23.8 GB. I think it's a bug.
>>
>>101678656
make her even younger
>>
>>101678656
can you forcefully rape and impregnate her?
>>
File: ComfyUI_02140_.png (1.09 MB, 896x1152)
1.09 MB
1.09 MB PNG
>>
Happy national gf day!
>>
>>101678731
can you make her a loli
>>
File: file.jpg (404 KB, 1792x1024)
404 KB
404 KB JPG
there, pushed flux
>>101678585
ive noticed the lyrics are better when you describe a song instead of generating in custom mode
will be cool to try with my lyrics llm. waiting until i have a BIG amount of lyrics scraped. i know it will upset some posters if i use the gpt2 based training code from the last one, not my problem tho and ngl its funny how mad they get
>>
File: 1698013025329888.png (590 KB, 1024x1024)
590 KB
590 KB PNG
>>
>>101678367
Maybe this can be added back in
>>
File: 1713071050618101.png (680 KB, 1024x1024)
680 KB
680 KB PNG
>>
>>101678375
I'm sure Microsoft will just let them go like that.
>>
>>101678323
Sorry man, nothing against your waifus but once you have been politely pointed told you are offtopic you should oblige. Creating Dalle and creating SD-based generations are a different set of skills, which is why we have a dall-e general down the aisle as well.
>>
File: 1711817332395549.png (960 KB, 1024x1024)
960 KB
960 KB PNG
>>
>>101678899
Friendlier windows times...
>>
File: 00264-2611210882.jpg (430 KB, 1000x1328)
430 KB
430 KB JPG
modelism is worse than racism
goo nihgt
>>
>>101678887
>they just buy black forest instead
>>
>>
File: 1701016760746895.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>
File: 1693433733732863.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>
>>101678877
>making loras of a 24gb model
there is enough vramlet settings cope as is with XL/pony loras. can't imagine what kind of coping there will be for a model this big.
>>
>>101678888
>told you are offtopic you should oblige
the avatars never do, and diary entries are as off-topic as it gets
>>
>>101678976
People whined at any model size anyway, even back from nai leak days.
>>
File: tmp668lnl0q.png (802 KB, 768x1024)
802 KB
802 KB PNG
I don't even know how this happened

https://litter.catbox.moe/nx8jpg.png
>>
>>101678679
I got it to work by running with --gpu-only.
It was overloading my system ram. I should buy more LOL
>>
File: de_fl_00082_.jpg (916 KB, 1344x960)
916 KB
916 KB JPG
>>101678787
>ive noticed the lyrics are better when you describe a song instead of generating in custom mode
I think the custom lyrics are just a gpt dip. I notice a ton of the same lyrical tropes between the two
>neon nights
>silent realms
>digital skies
>unraveling tales
etc etc
but maybe suno asks for a more specific structure so there's a better rhythmic meter to their auto-lyrics which makes it fit to music better

>>101678862
>>101678878
>>101678899
these are trippy. perfect UI + content if I were stroking out. wait, I'm not stroking out, am I?

>>101678912
says the guy who will never try new models
gn

>>101678913
not even a crazy idea. why out-compete your competitors when you can just buy them out and shutter them instead. its usually cheaper too
>>
>>101678998
I think the avatarfags and the waifu fag who literally like an ape just replied "my waifu" to a question from me are also AIDS. But it doesn't help the quality if more noise is added. If you post dalle here, you will never engage with a post asking about a ComfyUI workflow or what some good Webui settings are. You see how Dalle fags posting here shit it up, then.
So kindly, just piss off.
>>
File: file.jpg (356 KB, 1792x1024)
356 KB
356 KB JPG
12T models/ about half of them
720m + 274m + 4m + 7m = 1005M!!
i sleep
>>101678888
i don't care. stay mad
>>101679055
yeah the custom lyrics follow silly patterns
>>101679072
i've contributed more than you ever will desu
>>
>>101679072
that anon is talking about datasets, meanwhile blogposters are wasting space or spamming/flooding the thread.
keep moving goalposts, it only makes it looks worse for you.
>>
File: de_fl_00084_.jpg (790 KB, 1344x960)
790 KB
790 KB JPG
>>101679072
why are there so many shitlord nogens trying to gatekeep today? no one wants you faux-policing what people do and don't do here. post your gens, post your workflows, post your insights. if you feel compelled to tell other people how to behave or what to post, just don't instead
>>
so far, i feel lux needs negatives, i think that would migitate the quality inconsistency
>>
>>101679181
>says this
>posts a shit gen
>>
>>101679181
>policing
>>101678379
>post gens if you wanna be uptight about other people's gens

Look inward debotard
>>
This model feels like it was trained with sdxl juggernaut, dalle and mj images
>>
>>
File: de_fl_00085_.jpg (957 KB, 1344x960)
957 KB
957 KB JPG
>>101679184
someone was saying the dev model can accept negative prompts. I haven't tried it myself

>>101679186
I'm running low on flux gens and you weren't worth a good one

>>101679201
I'm not policing peoples gens or what they want to contribute to this space. I'm policing shitlord nogens who offer nothing yet feel justified in barking at posters. if you don't wanna put up your own content, stop shitting on others. and even if you wanna put up your own content, there's plenty of space available for other people to do what they want too.
>>
File: tmpt2kld5nl.png (749 KB, 768x1024)
749 KB
749 KB PNG
>>
>>
>>101679256
who is ipvau
>>
is that gook \ 1.5 AI face baked in or are you just chaining models together
>>
>>101679280
>gook
Please don't be racist.
>>
File: tmp744el0js.png (1.06 MB, 768x1024)
1.06 MB
1.06 MB PNG
>>
I have a 3080 and a 4080, can I assign part of the generation to one, then the rest to the other?
>>
>>101679286
we've had the same ai face for the past 2 years, I'm tired of seeing it desu
>>
File: 1713516112473959.jpg (249 KB, 1344x768)
249 KB
249 KB JPG
>>
wen will replicate let us generate pics of up 2 mp :((
>>
nb4 mass reply
>>
File: ComfyUI_temp_ipvau_00022_.png (3.07 MB, 1120x1440)
3.07 MB
3.07 MB PNG
>>101679324
thats because of inbred models, most finetuners train on gens since gens dont have/need licenses
>>
File: 1715656484718703.png (78 KB, 221x257)
78 KB
78 KB PNG
>>101679366
where nipel?
>>
File: ComfyUI_10077_.png (991 KB, 896x1152)
991 KB
991 KB PNG
>>
MEDS
>>
>>101679366
grim, habsburg mode colapse soon
>>
File: tmpbqulcyol.png (758 KB, 768x1024)
758 KB
758 KB PNG
>>
>>101679235
>someone was saying the dev model can accept negative prompts. I haven't tried it myself
both dev and shnell can't accept negative promps because to use neg you need to have a cfg > 1, and when you do that it just hurts the output, that's a shame
>>
File: cfgguider.png (15 KB, 447x285)
15 KB
15 KB PNG
>>101679475
you can use this node and play with the values, and yeah there are no negatives
>>
>no negative prompt
DoA
>>
File: ComfyUI_01491_.png (989 KB, 1024x1024)
989 KB
989 KB PNG
>>101679582
>>101679594
you can use the regular ksampler to do it just keep in mind what >>101679475 is saying
>>
>>101679582
>>101679594
if Schaduled CFGGuider doesn't hurt the model at decent CFG, then we'll be able to use negatives
>>
File: 1711972357555914.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
File: ComfyUI_09901_.png (567 KB, 1024x1024)
567 KB
567 KB PNG
>>101679614
for example in this image, i was wondering why it was blurry and it was because the cfg made it closer to the negative prompt than the positive prompt, after testing it, imho for now its like using a double positive(?) which is the same as using t5 and clip l i think
>>
at least the reactor users will have a blast
>>
>>101679679
alien feet
>>
>>101679679
>perfect hands
>horrible feet
:(
>>
File: ComfyUI_02149_.png (1.08 MB, 1152x896)
1.08 MB
1.08 MB PNG
>>
>>101679669
show me your nodes anon, maybe you did something wrong
>>
File: ComfyUI_00139_.jpg (454 KB, 2224x1431)
454 KB
454 KB JPG
>>101679669
Negative prompt works, just be careful to not fry your model with a too big cfg, I used this metadata >>101679222
>>
>>101679794
>just be careful to not fry your model with a too big cfg, I used this metadata
I've heard of another CFG that is less prone to fry outputs, CFG++ no? can it be used on ComfyUI?
>>
>>101679794
>>101679869
can you use pag or dynamic thresholding with it?
>>
File: luxnegs.png (691 KB, 1179x867)
691 KB
691 KB PNG
>>101679794
thanks bro, but it didnt work for me, I believe comfy, there are no negatives...for now, if you use >>10167958 and play with the ranges you can see which prompt gets picked up or even see if they blend, but i think is just the same as cliptextencodeflux
>>
>>101679912
what cfg did you use? it must be over 1 to work, for me it worked at cfg = 1.5
>>
File: luxcliptext.png (443 KB, 1303x815)
443 KB
443 KB PNG
>>101679937
i just changed the prompt in the workflow that >>101679794 posted

this is a dumb example but you can see how it kinda works, from_cfg = 0 means it ignored the upper text prompt and to_cfg = 0.5 it picked up the text prompt from below
>>
File: luxcliptext2.png (515 KB, 1225x751)
515 KB
515 KB PNG
>>101680006
both same value
>>
>>101680006
>>101680023
why such a low cfg? I'm not sure if a cfg < 1 makes the negative prompt work
>>
File: luxcliptext3.png (866 KB, 1597x1039)
866 KB
866 KB PNG
>>101680023
upper text prompt value > lower text prompt value

so yeah, it works like a double positive for me and yeah there is no negative because of maybe a negative cfg value doesnt work? i dont know lol ahaha
>>
>>101680072
maybe when cfg < 1, the negative prompt acts like a positive prompt, try with a cfg > 1
>>
>>101680037
>why such a low cfg?

The basic update rule for a diffusion process can be represented as:
xt+1=xt+f(xt,t)Δt+g(xt,t)ΔWt
xt+1=xt+f(xt,t)Δt+g(xt,t)ΔWt

With classifier-free guidance, the update incorporates a scaling factor ss (CFG scale):
xt+1=xt+s(f(xt,t)+(1−s)⋅funcond(xt,t))Δt+g(xt,t)ΔWt
xt+1=xt+s(f(xt,t)+(1−s)⋅funcond(xt,t))Δt+g(xt,t)ΔWt

So naturally the differential with respect to the CFG scale ss, we can compute:
dxt+1ds=(f(xt,t)−funcond(xt,t))Δt
dsdxt+1=(f(xt,t)−funcond(xt,t))Δt

From there, the cfg scale should make more sense.
>>
File: ComfyUI_temp_gjjlz_00001_.jpg (697 KB, 2304x1792)
697 KB
697 KB JPG
>>
File: luxcliptext5.png (1.19 MB, 2021x1641)
1.19 MB
1.19 MB PNG
>>101680037
>>101680088
heres what i want to illustrate:
from_cfg = "positive" prompt value
to_cfg = "negative" prompt value

but in reality there is no negative, only two positive prompts that blend together depending of the values of the from_cfg and to_cfg ,
>>
File: ComfyUI_01499_.png (1 MB, 1024x1024)
1 MB
1 MB PNG
>>101680088
that's how it works on every model
>>
Purp
>>
File: aatu ruskettunu.png (660 KB, 826x826)
660 KB
660 KB PNG
>>
>>101680229
so cfg < 1 really makes the negative prompt act like a positive prompt?
>>
>>101680250
It flips the cond to the opposite effect. positive becomes negative etc.
>>
>>101680278
see, you got your answer, stop using cfg < 1 it messes up the negative prompt effect >>101680205
>>
File: tmpm7y4vo67.png (964 KB, 768x1024)
964 KB
964 KB PNG
>>
File: ComfyUI_temp_daapt_00085_.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
It's Schizo Time. Also 50 credits is never enough to finish songs in a day ; ;
https://suno.com/song/b263ee92-d8c6-4674-aec3-b698af03d0e9
>>
>>101680298
nah, there is no negative prompt period, if there is post some proof then
>>
IMGGEN WILL NEVER BE THE SAME AGAINNNNN
AYYEYYEEEEEEEEEEEEEEEEEEEEEEEEEE
>>
File: de_fl_00087_.jpg (950 KB, 1344x960)
950 KB
950 KB JPG
>>101680317
>50 credits is never enoug
I feel that

>>101680343
imggen is totally different until we run out of replicate credits, then its back to the same old
>>
Does anyone know of a segmentation (seg) control net model that works well with sdxl?
>>
>>101680321
>if there is post some proof then
what's this then? >>101679794
>>
>>101680317
How do you keep turning shitposts into club bangers?
>>
File: ComfyUI_02160_.png (1015 KB, 896x1152)
1015 KB
1015 KB PNG
>>
>>101680565
why aren't her eyes red
>>
File: FLUX_00476_.png (1.15 MB, 768x1024)
1.15 MB
1.15 MB PNG
>>
File: BMP_05057_.png (2.35 MB, 1328x1328)
2.35 MB
2.35 MB PNG
>>101680498
I dunno lol, I guess having 5 accounts to fuck with things helps
https://suno.com/song/4a714800-2096-429e-93fa-e7470ede058c
>>
File: 57636.png (2.6 MB, 1440x1440)
2.6 MB
2.6 MB PNG
abloo bloo bloo
>>
File: FLUX_00523_.png (820 KB, 768x1024)
820 KB
820 KB PNG
>>
File: FLUX_00526_.png (815 KB, 768x1024)
815 KB
815 KB PNG
>>
>>
File: de_fl_00088_.jpg (705 KB, 1344x960)
705 KB
705 KB JPG
>>101680608
have you tried out udio1.5 yet? you can do 2min+ gens with it now
have you tried flux either? its the new hotness
https://suno.com/song/2eb1d12b-2241-4f0c-9c01-46b6d85eda2e
>>
File: FLUX_00571_.png (858 KB, 768x1024)
858 KB
858 KB PNG
>>
>>101680861
it's a shame it knows so few characters/artists, i blame these llms being used to tag everything as generically as possible
>>
>>101680883
dalle3 knows a shitton despite being trained on 95% synthetic prompts generated with an older worse version of gpt4 with vision
>>
>>101680907
>95%
source?
>>
File: FLUX_00611_.png (878 KB, 768x1024)
878 KB
878 KB PNG
it's amazing how good it is with text.
SD3 could never
>>
>>101680883
Is it possible to add the artists/characters in a finetunes?
>>
File: 1722570895361240.png (96 KB, 1536x743)
96 KB
96 KB PNG
>>101681006
unlikely
>>
>>101680987
https://cdn.openai.com/papers/dall-e-3.pdf
>To test our synthetic captions at scale, we train DALL-E 3, a new state of the art text to image generator. To train this model, we use a mixture of 95% synthetic captions and 5% ground truth captions. The model itself is a scaled-up version of the model we used in the above ablations, with several other improvements.
>>
>>101681018
Im not reading all that
>>
>>101681030
TLDR: any hopes of being able to make loras on this thing rely entirely on jewvidia allowing us to have more vram in this next gen.
>>
>>101681047
>in this next gen.
the 50 series caps out at 28gb. they couldn't change that if they wanted to at this point
>>
File: FLUX_00641_.png (1.04 MB, 768x1024)
1.04 MB
1.04 MB PNG
>>
File: PW_78814_.jpg (588 KB, 2048x1536)
588 KB
588 KB JPG
Good evening, anons! I hope everyone is doing well :]
>>
is this company planning on releasing an 8B/6B model? they could wipe stability from the map forever
>>
>>101681083
the fuck are you getting on so late for purple bitch?
>>
File: PW_78819_.jpg (569 KB, 2048x1536)
569 KB
569 KB JPG
>>101681092
LOL just got home so I figured why not hahaha!
>>
>>101681083
>>101681119
I want to do unforgivable things to your purple witch
>>
File: PW_78087_.jpg (339 KB, 1536x2048)
339 KB
339 KB JPG
>>101681165
LMAO.
Uhhh I dunno how to respond to that, anon
>>
>>101681119
get in on the current hype train, new checkpoint with day 1 support from comfy

https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
>>
>>101681193
oops, checkpoint page also useful too...

https://huggingface.co/black-forest-labs/FLUX.1-schnell
>>
>>101681018
>>101681047
I see the screenshot as "it's possible but requires a shitton more of VRAM so it'll be costly and we need funding", not "impossible".
>>
>>101681208
when was "impossible" mentioned?
>>
File: PW_78843_.jpg (319 KB, 2048x1536)
319 KB
319 KB JPG
>>101681193
Ohhh fun! I love new things!
Thanks, anon! :D
I'll dl it now!
>>
File: de_fl_00089_.jpg (935 KB, 1344x960)
935 KB
935 KB JPG
>>101681063
my dick is out

>>101681083
hi pw. there's a new hotness model out that has captivated everyone. highly recommend playing with it
>>
File: PW_78862_.jpg (648 KB, 2048x1536)
648 KB
648 KB JPG
>>101681260
I just heard! Uhm, do I just DL the flux1-schnell.sft file and put it in my checkpoints folder?
>>
>>101681302
comfy has the usual instructions for where to put stuff here...

https://comfyanonymous.github.io/ComfyUI_examples/flux/
>>
File: de_fl_00090_.jpg (893 KB, 1344x960)
893 KB
893 KB JPG
>>101681302
comfy posted a writeup on it
https://comfyanonymous.github.io/ComfyUI_examples/flux/
>>
File: PW_78826_.jpg (555 KB, 2048x1536)
555 KB
555 KB JPG
>>101681323
>>101681326
Thank you both so much!! I'm gonna get it now! :D
>>
File: 1702765122869821.png (1.16 MB, 896x1152)
1.16 MB
1.16 MB PNG
>>101680321
this is the wolf running prompt I posted las thread with red jersey in neg

neg works fine in dev
>>
File: PW_78871_.png (1.27 MB, 1024x1024)
1.27 MB
1.27 MB PNG
I like the way this came out!! I'm gonna try a more simple proompt next!
>>
why can't we quantize flux like the lmg chads do to run their hundred billion+ parameter bullshit
>>
>>101681654
because images have way more infomation density than text
>>
File: ._00001_.png (2.89 MB, 1920x1088)
2.89 MB
2.89 MB PNG
ola
>>
File: PW_78872_.png (1019 KB, 1024x1024)
1019 KB
1019 KB PNG
>>101681745
Hello, anon :]
>>
File: gjo_00005_.png (3.09 MB, 1920x1088)
3.09 MB
3.09 MB PNG
amazing you can actually gen 1920x1088 on flux with a 4090 if you set both model and text encoder to fp8

>>101681757
gyo morning!
>>
File: download.jpg (336 KB, 1536x1536)
336 KB
336 KB JPG
>>
File: PW_78874_.png (1 MB, 1024x1024)
1 MB
1 MB PNG
Oh nice! I just reset my pc cause it was going kinda slow haha
It's been on for hella days
>>
File: collage.jpg (123 KB, 800x400)
123 KB
123 KB JPG
Stylization in flux is subpar, but you can use img2img to add style. It's not perfect e.g it can easily genderbend characters. Left Style with SDXL and right the original from Flux
>>
File: PW_78873_.png (1.01 MB, 1024x1024)
1.01 MB
1.01 MB PNG
>>101681797
>>101681832
Oops forgot to reply hahaha!
>>
>>101681837
ya true, I think its wonderful tech, but a subpar data set.. I wan't finetunes
>>
File: FLUX_00663_.png (1.09 MB, 768x1024)
1.09 MB
1.09 MB PNG
>>
File: FLUX_00664_.png (1.04 MB, 768x1024)
1.04 MB
1.04 MB PNG
I'm gonna cum
>>
>>101681860
yea I agree that guitar is fucking sexy
>>
File: FLUX_00674_.png (1001 KB, 768x1024)
1001 KB
1001 KB PNG
>>
How do I stop masturbating /sdg/
>>
>>101681912
stop looking at sexy men/women/ponies (whatever you are into)
>>
File: PW_78876_.png (1.2 MB, 1024x1024)
1.2 MB
1.2 MB PNG
>>
File: FLUX_00704_.png (1.02 MB, 768x1024)
1.02 MB
1.02 MB PNG
>>101681912
Condition yourself to not want to masturbate.
Try this: after every gooning session, start hitting yourself in the balls as hard as you can. Soon, you will develop a psychological reaction and you start associating masturbation with pain. Or you will develop a taste for cock and balls torture.
>>
File: 1704548533951831.png (909 KB, 749x753)
909 KB
909 KB PNG
Give me a video inpainting model so I can impregnate women
>>
File: FLUX_00720_.png (1.05 MB, 1024x768)
1.05 MB
1.05 MB PNG
>>
>wonder why 6 minutes per image feels so much worse than 2 minutes
>look at my thrice as large folder of candidate images that just need blowing up
SDXL is a waste of time for anime and normie usage in general. You need to be an actual graphic designer for it to be worthwhile
>>
this is reasonably fast with the schnell model
>>
File: PW_78891_.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>
File: FLUX_00744_.png (889 KB, 1024x768)
889 KB
889 KB PNG
>>
File: FLUX_00750_.png (1.12 MB, 1024x768)
1.12 MB
1.12 MB PNG
>>
>>101682009
ya I am getting 15s gens of this quality in full hd with schnell
>>
File: FLUX_00777_.png (920 KB, 1024x768)
920 KB
920 KB PNG
>>
File: PW_78904_.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
File: FLUX_00793_.png (688 KB, 1024x768)
688 KB
688 KB PNG
>>
>>101682047
around 50s for this, 20 steps, full model. still ok I'd say.
>>101681912
you don't. keep going until your dick falls off. lession learned, boy. mwhaha
>>101681926
morning!
>>
>>101682082
>Mr. Einstein we dont have an apointment!
>>
File: FLUX_00809_.png (1.09 MB, 768x1024)
1.09 MB
1.09 MB PNG
>>
File: FLUX_00819_.png (977 KB, 768x1024)
977 KB
977 KB PNG
>>
For those trying to make styles work on flux-dev, try to use the karras sheduler, somehow it just make it work
>>101681711
>>101682084
>>
>>101682196
nice, thanks
this is sgm uniform, 20 steps
>>
>>101682212
for karras you need way more steps or else it won't work, 40-50 seems to do the trick
>>
File: PW_78931_.png (1.26 MB, 1024x1024)
1.26 MB
1.26 MB PNG
It does duos really well!!
>>
>>101682214
ahh I see above 40 steps it seems to produce something coherent. 77 secs for 40 steps / 83s for 50 on my machine, "yikes"
here, 50 steps euler karras
>>
same prompt, 20 steps sgm uniform, thats a remarkable change in.. everything. hm
>>
>>101682285
>>101682249
yeah it really makes things really different compared to the other schedulers
>>
File: PW_78959_.png (904 KB, 1024x1024)
904 KB
904 KB PNG
>>
i don't need it...
>>
File: goo_00040_.png (1014 KB, 1024x1024)
1014 KB
1014 KB PNG
>>101682334
ya text recognition is the bomb with FLUX
>>
File: ComfyUI_jyhg_00001_.jpg (3.5 MB, 2560x1440)
3.5 MB
3.5 MB JPG
>>
File: PW_78977_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>101682346
It really is!!! I'm really impressed with it!
LOL that's amazing!
>>
>>101682378
good day. made the sign I posted earlier extra for you lol !

this is already pretty sloppy, oops
>>
File: FLUX_00916_.png (746 KB, 1024x768)
746 KB
746 KB PNG
>>
can someone post me a basic flux comfy workflow catbox please? i'm downloading the model and updating comfy rn
>>
>>101682467
https://comfyanonymous.github.io/ComfyUI_examples/flux/
>>
File: FLUX_00939_.png (1.23 MB, 1024x768)
1.23 MB
1.23 MB PNG
>>
>>101682480
oh nice durr, thanks

how much vram to realistically run all this shit? does comfy support splitting across gpus? the text encoder is huge right?
>>
>>101682249
whats your gpu?
>>
File: PW_79016_.png (1.37 MB, 1024x1280)
1.37 MB
1.37 MB PNG
>>
>>101682545
3090 @ 80% power.
>>
File: ComfyUI_jyhg_00003_.jpg (3.28 MB, 2560x1440)
3.28 MB
3.28 MB JPG
>>101682415
I want to upgrade.
But I'm poor and the GTX1060 is the best gpu I have, both in my laptop and my main rig (which has 2).
>>
>>101682597
yeah you mentioned it. ugh I got a 2080 collecting dust here, I'd give it to you.
>>
>>101682662
lol my gf would hate this so much, she's an artist who draws like 40% birds, like she's better than this for sure

i guess we still care about humans playing chess so it'll probably work itself out

are you using the full size t5? the fuck gpu do you have?
>>
File: PW_79054_.png (1.64 MB, 1024x1280)
1.64 MB
1.64 MB PNG
>>
>>101682727
oops one of the lines in my comment got deleted, i meant to say she's better than this for sure but like it's only a matter of time, what a weird time to be alive
>>
>>101682727
>guess we still care about humans playing chess so it'll probably work itself out
markets don't work that way sadly
better comparison would be ikea mass-produced furniture vs manually crafted
>>
File: FLUX_00002_.png (919 KB, 768x1024)
919 KB
919 KB PNG
>>101682763
i mean same shit, like she's above the cutoff and good enough at marketing that it doesn't matter, in your analogy ikea does not impact her ability to sell $5000 epoxy river tables

like people think "oh nooooo AI art is going to collapse the art market" as if the art market is only based on supply and demand as a function of the quality of art, i mean *some of it is* but i think there's a lot that's based on like, money laundering, status seeking and effective marketing and is kinda just vibes based and that stuff is somewhat insulated from AI art

flux seems good, but this shit took 2 minutes T__T
>>
File: FLUX_01020_.png (708 KB, 1024x768)
708 KB
708 KB PNG
>>
>>101682812
AI fucks up the entry and mid level, the best specialists are more likely to benefit from all this or remain unaffected
with shit like gacha splash arts, stock pics, porn arts etc etc I don't think most consumers give a fuck about "vibes", they want their product quick and passable, and that affects significant amount of artfags who drew slop for a living.
>flux seems good, but this shit took 2 minutes T__T
it's magic we even got this at all, pretty much dalle level comprehension (with some caveats but still)
>>
File: FLUX_00004_.png (766 KB, 768x1024)
766 KB
766 KB PNG
i'm not even mad, i like that there's diversity, easy to prompt out and makes me feel like the model is creative? 2 minute gen times are killing me tho
>>
>>101682592
>>101682592
>>101682592
>>
File: FLUX_01040_.png (964 KB, 768x1024)
964 KB
964 KB PNG
>>
File: ComfyUI_hgsh_00003_.jpg (2.44 MB, 2560x1440)
2.44 MB
2.44 MB JPG
>>101682662
>got a 2080 collecting dust here, I'd give it to you.

If only...
It's ok though, kinda like working with limitations. I end up being more efficient.
>>
File: file.jpg (206 KB, 1536x1536)
206 KB
206 KB JPG
>>
File: PW_79055_.png (1.75 MB, 1024x1280)
1.75 MB
1.75 MB PNG
>>
File: PW_79063_.png (1.36 MB, 1024x1280)
1.36 MB
1.36 MB PNG
>>
File: PW_79062_.png (1.46 MB, 1024x1280)
1.46 MB
1.46 MB PNG
>>
>>101681222
This is a remark not a rebuttal.
>>
>>101682949
CUTE!!!



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.