[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1702992784153215.jpg (2.74 MB, 2560x1440)
2.74 MB
2.74 MB JPG
Previous /sdg/ thread : >>101700396

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SD.Next: https://github.com/vladmandic/automatic
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Black Forest Labs: Flux
https://huggingface.co/black-forest-labs/FLUX.1-schnell
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Discord
6wUwtcJsr2

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg
>>
File: tmpa_fcq1n7.png (1.17 MB, 896x1152)
1.17 MB
1.17 MB PNG
>>
File: file.png (3.36 MB, 1024x1792)
3.36 MB
3.36 MB PNG
float8 weights
nice
this is just the FluxSingleTransformerBlock but FluxTransformerBlock will be about the same speed
so it's already twice as fast as before, smaller weights to load with each module makes that much difference
workspace changes will help more

inb4 hateful people moaning
new models always bring them out
>>
File: FLUX__00162_.png (1.03 MB, 896x1152)
1.03 MB
1.03 MB PNG
>>
>>101708609
It's hilarious hubris honestly, everything can be reversed. The question is how difficult it will be to get the compute but honestly when you have a very obvious end goal and target it makes research easy because the research simply is
>we want to train Schnell on a 24 GB GPU
Many people will be taking a crack at it and we already know the power of boners and there are many PHDs who literally have access to H100s provided by their universities.
As far as I know, to reverse Schnell all you need to do is be able to fully fine tune the weights. This is a target that can be achieved because after all, they had to be able to train it themselves.
Watch as someone figures out how to shard it into bite sized 8 GB pieces.
>>
File: ComfyUI_Flux_2009.jpg (205 KB, 1152x864)
205 KB
205 KB JPG
>>
File: FLUX__00163_.png (1.04 MB, 896x1152)
1.04 MB
1.04 MB PNG
>>
File: 1717504297739260.png (1.07 MB, 1024x1024)
1.07 MB
1.07 MB PNG
>>
>>101708688
link?
>>
File: FDG_News_00006_.jpg (913 KB, 1344x960)
913 KB
913 KB JPG
>mfw Resource news

08/03/2024

>TryOnDiffusion: A Tale of Two UNets
https://github.com/fashn-AI/tryondiffusion

>Nvidia reportedly delays its next AI chip due to a design flaw
https://www.theverge.com/2024/8/3/24212518

>ComfyUI Frontend Modernization: Transitioning to a New Era on August 15, 2024
https://github.com/comfyanonymous/ComfyUI/issues/4169

>CEO of Invoke says Flux fine tunes are not going to happen
https://www.reddit.com/r/StableDiffusion/comments/1eiuxps

>ComfyUI-FLUX-fal-API
https://github.com/gokayfem/ComfyUI-FLUX-fal-API

08/02/2024

>Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation
https://yixiaowang7.github.io/OptTrajDiff_Page

>UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model
https://github.com/X-niper/UniTalker

>Smoothed Energy Guidance for SDXL
https://github.com/SusungHong/SEG-SDXL

>Mitigating Multilingual Hallucination in Large Vision-Language Models
https://github.com/ssmisya/MHR

>GalleryGPT: Analyzing Paintings with Large Multimodal Models
https://github.com/steven640pixel/GalleryGPT

>The Manga Whisperer: Automatically Generating Transcriptions for Comics
https://github.com/ragavsachdeva/magi

08/01/2024

>Stable Fast 3D: Rapid 3D Asset Generation From Single Images
https://stability.ai/news/introducing-stable-fast-3d

>Announcing Black Forest Labs
https://blackforestlabs.ai/announcing-black-forest-labs

>Flux: The Next Leap in T2I Models
https://blog.fal.ai/flux-the-largest-open-sourced-text2img-model-now-available-on-fal

>ComfyUI: Basic Flux Schnell and Dev implementation
https://github.com/comfyanonymous/ComfyUI/commit/1589b5

>Kolors ipadapter FaceID Plus
https://github.com/Kwai-Kolors/Kolors/tree/master/ipadapter_FaceID

>The EU’s AI Act is now in force
https://techcrunch.com/2024/08/01/the-eus-ai-act-is-now-in-force

>Video game performers picket over AI protections
https://apnews.com/article/sagaftra-strike-video-games-ai-f3f18ad01c5b8f4d525a836aeb531447
>>
>mfw Research news

08/03/2024

>Image Super-Resolution with Taylor Expansion Approximation and Large Field Reception
https://arxiv.org/abs/2408.00470

>Localized Gaussian Splatting Editing with Contextual Awareness
https://arxiv.org/abs/2408.00083

>Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution
https://arxiv.org/abs/2408.00160

>SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models
https://arxiv.org/abs/2407.20756

>Real Face Video Animation Platform
https://arxiv.org/abs/2407.18955

>ObjectCarver: Semi-automatic segmentation, reconstruction and separation of 3D objects
https://arxiv.org/abs/2407.19108

>Multi-Expert Adaptive Selection: Task-Balancing for All-in-One Image Restoration
https://arxiv.org/abs/2407.19139

>Exploring the Adversarial Robustness of CLIP for AI-generated Image Detection
https://arxiv.org/abs/2407.19553

>Advancing Prompt Learning through an External Layer
https://arxiv.org/abs/2407.19674

>VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks
https://arxiv.org/abs/2407.19795

>Mixture of Nested Experts: Adaptive Processing of Visual Tokens
https://arxiv.org/abs/2407.19985

>Perm: A Parametric Representation for Multi-Style 3D Hair Modeling
https://cs.yale.edu/homes/che/projects/perm/

>ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2
https://arxiv.org/abs/2407.19832

>Adversarial Robustness in RGB-Skeleton Action Recognition: Leveraging Attention Modality Reweighter
https://arxiv.org/abs/2407.19981

>Exploring Robust Face-Voice Matching in Multilingual Environments
https://arxiv.org/abs/2407.19875

>MaskInversion: Localized Embeddings via Optimization of Explainability Maps
https://walidbousselham.com/MaskInversion/

>Task-Adapter: Task-specific Adaptation of Image Models for Few-shot Action Recognition
https://arxiv.org/abs/2408.00249
>>
>>101708688
oh is this AIT, you're compiling each layer to it's own program and then you can run them sequentially? is that how that works?

that's great i was just smoking weed and thinking about how you could probably ring buffer the weights and swap em out of gpu as they get processed to save on vram
>>
>>101708758
when its ready, its for developers only atm
>>101708771
yes, as a workaround for the model being so big ive split it, each layer runs separately. atm im binding the constants to the module but i will also test it using the set_constants function which would then only have 1 module per block type
>>
>>101708744
You do realize when SD came out there was zero tools to train it right? You realize textual inversion did not exist when SD came out, right? Fucking tourist.
>>
File: ComfyUI_Flux_2031.jpg (133 KB, 1152x864)
133 KB
133 KB JPG
flux sampler/scheduler comparison grid when
>>
>>101708771
>i was just smoking weed and thinking
thats cool. when I smoke weed, I watch cartoons
>>
Installed flux last night and generated a single 1026x1026 image and all it outputed after 10 minutes was a blank black image. I'm guessing 1070ti isn't enough to use this model, yes Im poor.
>>
>>101708830
>You realize textual inversion did not exist when SD came out
it did though
>>
>>101708930
Then you are a bigger retard than I thought. You can shut up now. No one asked someone with 90 IQ what they think could happen.
>>
>>101708935
https://github.com/rinongal/textual_inversion
check for yourself
>>
File: tmpmk0do3we.png (1.16 MB, 896x1152)
1.16 MB
1.16 MB PNG
>>
File: ComfyUI_Flux_2049.jpg (181 KB, 1152x864)
181 KB
181 KB JPG
>>
File: 1711792114568919.png (1.25 MB, 1024x1024)
1.25 MB
1.25 MB PNG
I feel like we deserve flux after the sd3 disaster.
>>
>>101708906
Wikipedia, Blender, Mozilla, Linux, Python and so many others are nonprofits.
I believe that truly revolutionary local AI shit will come when we get a nonprofit org that gets funding all over the world to deliver something good.
It's what SAI tried to become, but failed miserably.
>>
>>101708971
Damn I wish I could use it, whats your pc specs?
>>
best site for doing videos?
>>
>>101708943
Gee anon I wonder which image model they're customizing. Did they make this code before Stable Diffusion existed? Are you stupid? Are you a tourist? Help me understand.
>>
I trying out the NAI v3 gen on sd-webui with the extension, anyone knows why I'm getting washed out colors all the time? Sorta like the ones without VAE but I'm not sure about setting that up with it
>>
>>101709002
4080/7800x3d/32gb ddr5, but this model will run on any 10-12gb+ card

there are also online options too I think
>>
>>101709022
luma
>>
File: 1722330408384935.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
this one turned out better
>>
File: tmpwq6vkoie.png (1.4 MB, 896x1152)
1.4 MB
1.4 MB PNG
>>
>>101709024
any latent diffusion model. textual inversion was released before sd was released. ironically you're the tourist, don't try to pretend you were around at release.
>>
>>101709072
that's not very safe
>>
File: grifting_money_machine.png (2.29 MB, 1024x1024)
2.29 MB
2.29 MB PNG
Could make some good money off tshirts this season
>>
>>101709088
Actually I was, I ran SD from the command line. That's how I know you're a fucking moron because you think we were training customized models day one. Fucking retarded stolen valor zoomer probably was in middle school when SD came out.
>>
File: scribble_is_great.jpg (348 KB, 1980x1671)
348 KB
348 KB JPG
its pain to fix pose image
takes 5 - 10 minutes
>>
>>101709103
my first embed was released 25th aug 2022 though, not quite day 1 but close enough
>>
>>101709144
then you would know it was leaked
retard
>>
File: FLUX__00173_.png (1.08 MB, 896x1152)
1.08 MB
1.08 MB PNG
gotta stop 1girling
>>
First time comfy user.
I load the flux workflow, choose the model files, hit queue prompt, the entire thing crashes no error logged.
Terminal just says pause.
What the fuck am I supposed to do?
I have 4090
>>
File: segs.jpg (76 KB, 1360x450)
76 KB
76 KB JPG
I'm generating scenes with +2 people and using pic related to try to detect the people. It's working good and properly detects each person with a proper silhouette. The issue I'm having is that they come out as one entire segment rather than separate segments. Is there anything that can be done to either separate the people into individual segments or just have the detector work on a specified region of the image?
>>
File: 1712968625784375.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>
https://litter.catbox.moe/640z6x.png
>>
>>101709160
whats that got to do with anything
>>
>>101709252
You're saying TI was there day one, which it wasn't, not even close. This in relation to a new SOTA model which just barely emerged from the womb which you're now doomposting about as if that's relevant to anything. My point is SD spurred a shit ton of research and things rapidly changed and you're in here probably as an employee of Flux acting like your model can't be touched.

Your model will be raped and any hopes you have with Pro will be tossed when everyone realizes show bad faith you were all along.
>>
File: tmp_7vad8j6.png (1.25 MB, 896x1152)
1.25 MB
1.25 MB PNG
>>
>>101709192
>the entire thing crashes no error logged.
I'm not sure thats possible
>>
File: 1717158250585438.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
the text working consistently well is what makes this model unique. SDXL/ponyXL can do good characters but not text like this (yet)
>>
File: 000000_15778_.png (1.93 MB, 1434x1075)
1.93 MB
1.93 MB PNG
>>
>>101709192
>I have 4090
You need at least a 5080 Ti to run FLUX. Cheap cards won't make it.
>>
>>101709270
all these schizo /pol/ tourists. its so tiresome
>>
File: FLUX__00176_.png (1.16 MB, 1152x896)
1.16 MB
1.16 MB PNG
>>
>>101709270
I'm not sure leaning into schizo posting is gonna help you make your argument
>>
>>101708830
Textual Inversion and LoRAs existed before Stable Diffusion. They were designed for use in LLMs, not in latent diffusion.
We were using TIs to "train" SD since it was leaked.
>>
>>101709306
I say this and look like this
>>
>>101709311
Prove it, show the TIs with a post number. I was there, it was just Emma posting.
>>
>>101709310
>>101709305
wow discord faggotry in sdg noo, definitely no collusion here
anyways, enjoy flux boys, it's going to be trained
>>
people are already finding ways to get it working on 8gb, just be patient anons
>>
>>101709346
>it's going to be trained
Where and by who?
>>
>>101709365
No one was going to full fine tune SD, impossible they said.
>>
>>101709355
>people are already finding ways to get it working on 8gb, just be patient anons
And patient you will learn to be, because you will wait 5-10 minutes per image.
>>
File: 00218-856208543.jpg (320 KB, 1245x1371)
320 KB
320 KB JPG
>>
File: tmpbvixlb32.png (1.12 MB, 896x1152)
1.12 MB
1.12 MB PNG
>>
File: Untitled.png (378 KB, 2325x1280)
378 KB
378 KB PNG
>>101709280
It just dies
>>
>>101709327
Oh OK, I will go back in time and think to take a screenshot of a post because some day in the future a retard will be a faggot
>>
>>101709378
the schnell model can generate decent stuff in only 1-4 steps. but if you have the vram, dev works best (so far)
>>
>>101709391
can't wait to see a time stamp a week or more after the initial leak
>>
File: hehe.jpg (22 KB, 453x500)
22 KB
22 KB JPG
>>101709391
>because some day in the future a retard will be a faggot
>>
>>101709391
you're in luck anon, every post on /g/ is archived
that means you can go and look at what was posted on August 19, 2022
>>
can you explain to me this particular one? the winner of that match was just a normal woman with high t right, not actually trans
>>
File: 1697773340562156.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
it is neat though how sd3 was a bit of a huge letdown (cause it was censored, thus the training was shit) and now we get this, which is basically mini dalle-3.

open source is the way.
>>
>>101709389
https://openart.ai/workflows/maitruclam/comfyui-workflow-for-flux-simple/iuRdGnfzmTbOOzONIiVV

I used this guide, also make sure the unet/clip files have the same filenames or use the arrows to select them if different

if it acts up, download the comfy portable .zip and do a fresh install of that, then move the files over.
>>
>>101709451
I hope all this AI shit gets axed the worst way possible
>>
>>101709451
>open source is the way
Marketing gimmick. They have zero interest in supporting the scene in creating the next base for porn finetunes. In fact, they are betting that it will never happen, gloating.

Remains to be seen if autists can bruteforce and build on top of these models.
>>
File: 1714363137246773.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>101709493
the meme stuff is just to test functionality, it's capable of really neat stuff in terms of art styles. It can make manga panels and even do japanese characters. the text outputs are very good so this is a big step forward for open source models.
>>
>>101709449
yes
>>
>>101709451
The upside to Flux is that it's local, so normies get filtered and there won't be a mass flood of these types of images. A small number of quality ones is better than what happened with dall-e.
>>
>>101709517
that's not what i mean
mark my words the few degenerate scumbags are going to ruin AI for everyone else
but you already know that
>>
>>101709525
>The upside to Flux is that it's local
Normies use Flux Pro, which you can't (locally).
>>
File: 1707822002130955.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>101709525
and normies can't figure out comfyUI even though you can just click and drag a jpg to copy a workflow.

>>101709545
nah, the opposite: pony models were meant for making hardcore porn, right? yet they can make amazing anime gens, even though you CAN make porn doesn't mean the model is ONLY good at porn.
>>
is sai kill?
>>
>>101709565
this is also why sd3 fucked up, they censored the model, so it's shit at anatomy other than faces. if you dont teach an AI model how figures work, it can't make them properly.
>>
File: FD_00503_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>101709493
You can hope for all you want but it's not going away. Settle in and enjoy the ride.
>>
File: yotsubestest.jpg (741 KB, 1374x1518)
741 KB
741 KB JPG
>>101709565
Yotsubest!
>>
File: Puddi.jpg (237 KB, 1536x1536)
237 KB
237 KB JPG
>>
is there a comfy node that shows the image before it's complete, like every x number of steps it updates
>>
>>101709640
you can enable preview in the settings
>>
>>101709545
Pony already can do degenerate shit and we've survived.
>>
File: savedalt.png (2.89 MB, 1536x1536)
2.89 MB
2.89 MB PNG
>>101709098
>>
File: file.png (96 KB, 703x217)
96 KB
96 KB PNG
It's over.
>>
>>101709668
good
>>
>>101709652
and it's a great model BECAUSE it can do degenerate shit. if you can render a body doing almost anything, you can do the same for fully clothed characters too. without proper anatomy training, gens don't look as good.
>>
>>101709677
ai gens blobs of color, it does not understand we're naked under clothes
>>
>>101709652
Pony sucks balls. Oversaturated colors. Fucked up hands and faces.
>>
File: mikuskill.jpg (767 KB, 1536x1536)
767 KB
767 KB JPG
>>101709687
>>
>>101709686
even this font is just dots, like noise, that forms text. latent noise is the same, it forms an image. it knows what boobs are because it learns that from training data and concepts.
>>
>>101709687
use autismmix-confetti, it's much better for anime gens than the default model.
>>
>>101709787
no, when you say "clothes" it hallucinates blobs of colors associated with clothes
never once does it generate a naked person under those clothes
no, it generates a plausible image of blobs of color associated with "clothes"
>>
>>101709687
base pony is shit but good for training loras
and then using the loras on other bony based checkpoints
there's so many style loras than im sure you'd find what you're looking for
>>
File: osakaanyaface.jpg (886 KB, 1792x2304)
886 KB
886 KB JPG
>>101709819
>blobs of colors
As opposed to the blob of fat that is your mother?
>>
>>101709819
yeah, it doesn't know what naked is, that's a human concept. to the model it's just an array of dots.
>>
>>101709686
yes it doesn't "understand" that we're naked underneath out clothes but it doesn't need to, please don't be retarded

how do you think the weight dimensions happen? when training pictures of people will adjust weights that are related to people, pictures of naked people will adjust weights related to naked and people, clothed anatomy improves with nudes in the dataset
>>
>>101709842
it also doesn't know balls bounce
>>
>>101709855
That's cute but flux has no genitals or really any porn yet somehow manages which destroys your premise
>>
>>101709819
well, I have to disagree.
When im genning clothed 1girls, and observing the preview, i'd see the outline of a naked girl's body, and then the dress get's genned over the naked silhouette.
>>
>>101709211
Impact has a 'SEGS to Mask List' node. Maybe try that.
>>
File: 00235-745409368.jpg (985 KB, 1944x2576)
985 KB
985 KB JPG
lost, promptwise, gonna try to play nomanssky again
>>
>>101709773
Is your picture generated from Pony?
>>
>>101709882
prove it, you can't
>>
>>101709903
Autismmix, a Pony derivative.
When people here say Pony they typically mean Pony and its derivatives, not just base Pony.
>>
>>101709904
im busy baking a lora rn so you're right i cant prove it (right now) :^)
>>
File: delux_chi_00055_.jpg (519 KB, 1344x960)
519 KB
519 KB JPG
>>101709893
one of my favorites. I was just building a base at the center of the galaxy
>>
File: file.jpg (286 KB, 1536x1536)
286 KB
286 KB JPG
>>
File: 1697715732480896.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
now this is podracing.
>>
File: tmp1mikzst0.png (1.28 MB, 1152x896)
1.28 MB
1.28 MB PNG
>>
>>101709889
Wouldn't help, the issue itself is the SEGS not coming out as individual ones. It's always one that has all my characters.
>>
File: tmpqyslws89.png (1.21 MB, 1152x896)
1.21 MB
1.21 MB PNG
>>
File: 1722310604761834.webm (3.42 MB, 1280x768)
3.42 MB
3.42 MB WEBM
Hopefully the black forest labs text to video can at least come close to Runway Gen 3. That would be the real game changer.
>>
File: 00016_.jpg (374 KB, 1024x1024)
374 KB
374 KB JPG
>>101709976
Anon, she asked you to stop this >>101702717
>>
>>101710079
not a footfag but wowwww that's really good
>>
File: tmp921pssj5.png (1.32 MB, 896x1152)
1.32 MB
1.32 MB PNG
>>
>>101710102
id complain about zoomer meme miku's too
>>
File: tmp0azgj1nb.png (1.34 MB, 896x1152)
1.34 MB
1.34 MB PNG
>>
>>101710114
The caveat is that 90% of generations in runway Gen 3 devolve into unusable junk. Only 10% are really good, so it's still a lot of trial and error.
>>
File: tmpe8acss4t.png (1.45 MB, 896x1152)
1.45 MB
1.45 MB PNG
>>
I'm worried about my cpu, it's been at 85 degrees for about 18 hours of genning now
>>
>>101710233
new paste
>>
File: hard_mode.png (645 KB, 1327x802)
645 KB
645 KB PNG
>>101709178
I challenged this ultra hard pose
>>
>>101710247
i'm very tempted to get an AIO cooler. it's not like the £50 I'd spend on it comes anywhere close to the price of a 3090 which is a much better investment
>>
File: file.png (3.71 MB, 1792x1024)
3.71 MB
3.71 MB PNG
weird cutlass bug when add_*_proj is cast from float8, idk, doesnt make much difference to keep those in float16
anyway this is both FluxTransformerBlock and FluxSingleTransformerBlock, so yeah about twice as fast as it was earlier
loading/unloading is still slowing it down so ill try the set_constant method next, but this method is extremely beneficial for ramlets as well as vramlets
>>
File: 000000_15782_.png (1.97 MB, 1075x1434)
1.97 MB
1.97 MB PNG
>>
File: tmpm889dzvz.png (761 KB, 768x1024)
761 KB
761 KB PNG
>a centaur girl wearing one-shoulder clothing, carrying a long rifle, single bare shoulder
>>
File: Ayanami.jpg (302 KB, 1536x1536)
302 KB
302 KB JPG
>>
File: de_fl_00108_.jpg (357 KB, 1344x960)
357 KB
357 KB JPG
>>101710298
>weird cutlass bug
>>
File: tmpwye2g2p9.png (879 KB, 768x1024)
879 KB
879 KB PNG
Dig the shirt here
>>
File: 1702131352645240.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>
File: file.jpg (481 KB, 1792x1024)
481 KB
481 KB JPG
>>101710336
nice
>>
File: tmpa4jgyr6m.png (559 KB, 768x1024)
559 KB
559 KB PNG
I really like the color combo here, but I've been struggling to replicate it since. The hard part is getting the right shade of brown
>>
File: Migu.jpg (269 KB, 1536x1536)
269 KB
269 KB JPG
>>
>>101709668
Random pineapple opinion.
>>
File: tmproj091fu.png (688 KB, 768x1024)
688 KB
688 KB PNG
>>
>>101710336
Cool
>>
>>101710452
thats the ceo...
>>
>>101710471
well what the fuck does he know, he's a suit
>>
File: tmpof38jh97.png (639 KB, 768x1024)
639 KB
639 KB PNG
>>
File: 00252-745025470.jpg (1.26 MB, 1944x2576)
1.26 MB
1.26 MB JPG
>>
>>101710485
>>>/pol/
>>
File: tmpqwlmagae.png (795 KB, 768x1024)
795 KB
795 KB PNG
>>
>>101710510
that's right, nothing
>>
File: tmpxvl0kw16.png (863 KB, 768x1024)
863 KB
863 KB PNG
>>
>>101710429
omg it migu
>>
File: tmps432s31l.png (550 KB, 768x1024)
550 KB
550 KB PNG
>>
>>101710530
das rite
>>
File: image.jpg (148 KB, 1024x768)
148 KB
148 KB JPG
Im currently testing out flex pro, and have an account i can spend some money on, pls ask for prompts. This here is:

a giant billboard at night that reads "sex problems? not with Xera!
>>
>>101709668
>he's directly challenging the autism of the pony community
>>
File: tmpayuyuhyd.png (1011 KB, 768x1024)
1011 KB
1011 KB PNG
>>
>>101710617
>billboard alongside a small dirt road
>>
>>101710654
i think you dont understand how advanced models work, "at night" means that you want this in a setting where its clear that its night. How else would you logically show its night?
>>
File: tmp2_qe3rre.png (880 KB, 768x1024)
880 KB
880 KB PNG
>>
File: mikuquestion.jpg (817 KB, 1749x1524)
817 KB
817 KB JPG
>>101710694
What the FUCK does a dirt road have to do with night?
>>
File: delux_chi_00056_.jpg (746 KB, 1344x960)
746 KB
746 KB JPG
>>101710617
>goku powerbombs hatsune miku through the tournament floor at the tenkaichi budokai
>>
>>101710741
roads only turn to dirt at night
>>
File: tmp42a2d1m_.png (523 KB, 768x1024)
523 KB
523 KB PNG
>>
File: 1714299451265490.png (1.13 MB, 1024x1024)
1.13 MB
1.13 MB PNG
>>
File: ComfyUI_00012_.png (1016 KB, 1024x1024)
1016 KB
1016 KB PNG
Flux seems to be really good at specific styles
>>
File: tmpv3mrx0eo.png (838 KB, 768x1024)
838 KB
838 KB PNG
>dat fur on the forearms
>>
Couch potato
>>
>>101703851
>>101707455
I've implemented SEG for ComfyUI/Forge in https://github.com/pamparamm/sd-perturbed-attention, not sure if it's better than PAG tho
>>
File: FLUX__00006_.png (944 KB, 896x1152)
944 KB
944 KB PNG
>>
>>101710999
Kek literally me
>>
File: 2.png (1.19 MB, 1024x1024)
1.19 MB
1.19 MB PNG
>>
>>101710957
nta, but thanks
>>
File: 1712741273439750.png (1.24 MB, 1024x1024)
1.24 MB
1.24 MB PNG
>>
File: FLUX__00007_.png (1.1 MB, 896x1152)
1.1 MB
1.1 MB PNG
>>
File: delux_chi_00057_.jpg (511 KB, 1344x960)
511 KB
511 KB JPG
>>101710957
errm, its Smoothed Energy Guidance but you named the repo Perturbed-Attention Guidance
>>
File: forU1.png (1.02 MB, 768x1152)
1.02 MB
1.02 MB PNG
>>
File: forU2.png (1.37 MB, 768x1152)
1.37 MB
1.37 MB PNG
>>
File: ComfyUI_00030_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
Prompt: 'faggot'
>>
>>101711171
It's the same thing basically. SEG is a variation of PAG
>>
File: file.png (4 KB, 509x68)
4 KB
4 KB PNG
might defork it when i get to 100
>>
>>101711200
>>101711186
Try to prompt the reflection in the glasses to something specific.
>>
Is there a repository of information about SD techniques that are a bit more intermediate/advanced/obscure? For example, I recently learned about Differential Diffusion that basically converts any SD model into a model capable of inpainting. And now I see folks discussing something called Smoothed Energy Guidance, and I'm getting imposter syndrome kek.
>>
File: delux_chi_00060_.jpg (1.08 MB, 1344x960)
1.08 MB
1.08 MB JPG
>>101711244
nah, too much stuff happens all the time for there to be any sort of central repository of information. most information is spread ad hoc through social media like twitter, reddit, discord, and here. all you can really do is search around and try to dig up conversations on things you want to learn more about
>>
File: tmprnaha7xm.png (536 KB, 768x1024)
536 KB
536 KB PNG
>>
File: forU3.png (1.19 MB, 768x1152)
1.19 MB
1.19 MB PNG
>>101711221
>>
File: tmpudhybz_1.png (537 KB, 768x1024)
537 KB
537 KB PNG
>>
File: ComfyUI_Flux_2147.jpg (206 KB, 1152x864)
206 KB
206 KB JPG
>>
File: tmpqpv9301a.png (890 KB, 768x768)
890 KB
890 KB PNG
>>
File: 1712518486676486.png (1.39 MB, 1024x1024)
1.39 MB
1.39 MB PNG
Depict a dynamic and charismatic Hatsune Miku dancing on a stage. The image is in the style of a comic book from the 1930s. Miku is saying "Hello, SDG!" in a white speech bubble.
>>
File: tmp9x30pq34.png (721 KB, 768x1024)
721 KB
721 KB PNG
>>
>>101711535
Hello Miku
>>
File: 1691521653103002.png (1.6 MB, 1024x1024)
1.6 MB
1.6 MB PNG
>>101711535
>>
File: 1721969594036167.png (1.44 MB, 1024x1024)
1.44 MB
1.44 MB PNG
>>101711574
>>
File: delux_chi_00062_.jpg (1.32 MB, 1344x960)
1.32 MB
1.32 MB JPG
>>101711535
now make goku powerbomb her through the floor
>>
File: ComfyUI_00034_.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>tfw flux doesn't know celebrities
>>
File: 274754.jpg (1017 KB, 2449x1320)
1017 KB
1017 KB JPG
>>
File: 1713823915791356.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>101711607
one more, but a diff style prompt
>>
File: ComfyUI_Flux_2151.jpg (185 KB, 1152x864)
185 KB
185 KB JPG
>>
File: FLUX__00017_.png (1.08 MB, 896x1152)
1.08 MB
1.08 MB PNG
>>
File: de_fl_00006_.jpg (728 KB, 1344x960)
728 KB
728 KB JPG
>>101711639
yeah I was trying to get a missy elliot rap video. this is missy elliot
>>
File: FLUX__00018_.png (1020 KB, 896x1152)
1020 KB
1020 KB PNG
>>
File: 3941110155.png (2.39 MB, 1536x1536)
2.39 MB
2.39 MB PNG
COME VISIT NEW YORK
>>
Anyone else disappointed that Flux appeared so great but turned out to be dead on arrival because it cant be trained?
>>
>>101711718
I think it's not impossible to train. Just potentially very hard, especially considering the ungodly amount of compute power and VRAM required.
>>
>>101711718
sdg is waiting for ldg's apology
>>
>>101711734
people can rent super high end GPUs, just give it time
>>
>>101711696
>>101711708
sexo
>>
>>101711718
Releasing the models to the public was a publicity stunt, yeah. They made a great model though. That must be admitted.
>>
>>101711696
>>101711708
WTF how did her coat become transparent?
>>
>>101711718
I think personal loras are no-go, but we'll probably see finetunes eventually
I can live with it
>>
Nice, so we're back to using 1.5 and SDXL as our models again now that Flux can't be trained. It's over for us localchads.
>>
>>101711734
Even loras and controlnets would be enough. is it even possible?
>>
any reason why my bake is slowing to a crawl?
first time this is happening
>>
>>101711777
attempting a pony pass on a flux gen and it's just not comparable. Every time it makes it worse, even if it can attach some normal looking nipples.
If Flux is trainable it's over for SD.
>b-but some random faggot said
nobody cares
>>
i miss schizo anon
>>
>>101711813
Right is original pony?
>>
>>101711836
Left is pony, right is original flux
>>
File: FLUX__00020_.png (1006 KB, 896x1152)
1006 KB
1006 KB PNG
struggling to get a wet look
could be much wetter
>>
>>101711777
>>101711786
Some guy already came up with an experimental training script to finetune Flux Dev. I think it's only a matter of time.

Besides compute power, another potential factor that might hinder finetuning is the way Flux dev and Flux schnell were distilled from Flux pro. I don't have the full picture but apparently it's got something to do with the negative prompt and the guidance scale being hardcoded during the distillation process. However folks are already throwing out ideas to get around this limitation.
>>
>>101711813
>some random faggot
The CEO of black forest?
>>
File: 1653712429.png (2.84 MB, 1536x1536)
2.84 MB
2.84 MB PNG
Yep. This is going to my insta account.
>>
>>101711846
Yeah, but it takes 75gb of VRAM to train a lora at the moment. Let's see if it can go down to 24.
>>
>>101711861
catbox?
>>
File: 1706170516368444.png (1.05 MB, 1024x1024)
1.05 MB
1.05 MB PNG
Depict a dynamic and charismatic Hatsune Miku standing in a Pokemon stadium. She is shaking the hand of Ash Ketchum from the anime Pokemon. The image is in the style of the anime Pokemon. The scene is dynamic and features a Pokemon battle arena.
>>
File: 1698271487845107.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
>>101711878
I love how powerful the prompt "in the style of ____" works in flux
>>
>>
>>101711869
I don't think that's gonna be possible without significant concessions (which would be detrimental to the final LoRA quality), or a new and significantly optimized LoRA training method.
>>
File: magicbox2.jpg (6 KB, 225x225)
6 KB
6 KB JPG
>>101711875
https://files.catbox.moe/xivwep.png
gotta use controlnet on this bad boy
>>
>>101711857
You are dumb. That was Invoke CEO
>>
File: FLUX__00022_.png (1.42 MB, 896x1152)
1.42 MB
1.42 MB PNG
>>
>>101711930
baka I thought you were using flux
>>
>>101711932
That nigger also said that Inpainting was going to be impossible in Flux and a few hours later it was already working KEK!
>>
File: 0.jpg (233 KB, 1024x1024)
233 KB
233 KB JPG
I'm using the dev version of FLUX, but every seveal gens, it gets REALLY slow. I have to restart ComfyUI to get it working. What could be the issue?
>>
>>101711936
I'm really impressed at how good Flux is at knowing when something should or should not be symmetrical.
>>
Does anyone know if Differential Diffusion amplifies Flux's inpainting ability? Or is it an SD-only method?
>>
>>101711951
of course not, fuck new things.
>>
>>
File: FLUX__00024_.png (1.41 MB, 896x1152)
1.41 MB
1.41 MB PNG
>>
>>101712006
based
>>
>>101711963
probably a leak in one of those shitty python packages
>>
File: ComfyUI_Flux_2169.jpg (209 KB, 1152x864)
209 KB
209 KB JPG
>>
File: 1701670879992677.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
Anyone else recall when SDXL was released and people were saying it would be impossible to train a LORA on a 24gb gpu?
>>
I am 100% convinced it's over for womancels. 5 more years of llm development + embodiment and it's over. If you think the current birth rates are low, it'll be fucking all over in 5 years.
>>
File: ComfyUI_Flux_2171.jpg (173 KB, 1152x864)
173 KB
173 KB JPG
>>
File: Flux_1722732920_0001.jpg (1.14 MB, 2432x1664)
1.14 MB
1.14 MB JPG
>>
>>101712095
Yes, and I ended I trained a bunch for XL on my 2060S
>>
>>101711891
but this looks like the generic flux anime style and not pokemon.
>>
Black Forest is promising text to video now. Get hyped.
>>
File: 1709362930154862.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>101712161
if you specify pixar its an entirely different result, it knows styles
>>
>>101711535
Thanks for the prompt anon
>>
>>101712172
why, I doubt I have the hardware to use it
>>
>>101712174
eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee
>>
>>101711718
Very disappointed, yes, but on the bright side I can create training data for SD using Flux gens.
>>
>>
File: ComfyUI_temp_phjap_00079_.png (3.7 MB, 1584x1232)
3.7 MB
3.7 MB PNG
has anyone tried img2img with flux?
>>
File: tmpwigrtke0.png (1.23 MB, 768x1024)
1.23 MB
1.23 MB PNG
>>
File: Flux_00270_.png (829 KB, 768x1024)
829 KB
829 KB PNG
>>
File: redeem.jpg (842 KB, 2500x3333)
842 KB
842 KB JPG
>>
>>101712239
It can mix styles? That's impressive. Prompt?
>>
>>101712255
credit goes to the guy in the /lmg/ thread, but you can specify different styles by saying the thing is "edited in"

>circa 2015 poor quality mirror selfie taken with iPhone in bathroom of overweight man with a greasy shirt and thick glasses with his arm over the shoulder of a cutout of Sailor Moon which has been edited into the photograph, Sailor Moon is blushing and the man is looking into the camera with a smug grin
>>
>>101712253
whats SAI doing right now anyway? havent followed anything, are they still begging for money?
>>
>>101712312
They said they won't be releasing any other SD 3 model and they are still confident SD3 medium will become the standard and Flux will flop because it cannot be fine tuned.
>>
File: tmph8lrbxdq.png (1.13 MB, 768x1024)
1.13 MB
1.13 MB PNG
>>
>>101712312
they are just searching for a small loan of 30m dollars as this is the only thing preventing them from the release of the totally existing Stable Diffusion (NON BETA) Version 3.1: Awesome Edition which will bring world peace and answer the question if there are any odd perfect numbers at all existing (an unsolved mathematical question)
>>
File: 00239-2294070856.jpg (194 KB, 640x856)
194 KB
194 KB JPG
verify i am human.. but how....
>>
File: 2392714125.png (3.87 MB, 1248x2624)
3.87 MB
3.87 MB PNG
Are those pants even legal?
>>
>>101712431
How do you folks get such cool poses?
>>
>>101712431
How did you get those pants, anon? I want to buy some for my gf.
>>
>>101712103
womancels?
>>
>>101712469
I was trying to be funny as if women were getting replaced soon, but came across as incoherent sincere schizo
>>
>>101712448
controlnet depth midas with sdxl. Lower the control weight to about 7 (gotta experiment with those).

>>101712452
after doing what I said above, you send it to img2img and just roll with cfg at around 0.5
>>
Between Pony models and SDXL anime models, which ones are better for prompt adherence? Lets say I want to prompt a specific pose without using controlnets. Which models would give me a better chance of nailing it in my gens?
>>
>>101712387
i dont care as long as trani suffers
lets hope the company dies really slow and that at any point there is a slight hope for it to be redeemed (which it will never be)
that is really good for your mental health
>>
>>101712494
>controlnet depth midas with sdxl. Lower the control weight to about 7 (gotta experiment with those).
Ah, so I imagine you would be using a reference image for the pose right?
>>
File: ComfyUI_temp_uvzbi_00009_.png (1.85 MB, 1120x1440)
1.85 MB
1.85 MB PNG
>>
File: Flux_1722734816_0001.jpg (1.69 MB, 1792x2304)
1.69 MB
1.69 MB JPG
>>
>>101712499
pony anime checkpoints usually dont need controlnets, but you can get any pose you want if you use a reference photo and use controlnet openpose on it.

controlnet union is an all in one SDXL controlnet model btw, it works for canny/depth/openpose, works really well

https://huggingface.co/xinsir/controlnet-union-sdxl-1.0/tree/main

just rename the promax model to controlnetUnion.safetensors and put it in the controlnet model folder.
>>
>>101712514
Yes. Jojo's bizarre adventure is your source for cringe poses.
>>
>>101712539
Thanks. Lets say I want to save a bunch of pose depth maps and openpose keypoints as images for later use. When I use these reference images, do I need to resize them to be the same size as my latent?
>>
>>101712572
if you want to save the images of the openpose stuff you can use something like this:

https://github.com/fkunn1326/openpose-editor

or, just save the image that has the pose you want and drop it into controlnet when you want that pose.
>>
>>101712600
Yeah but lets say I downloaded a depth map from somewhere and it's some random resolution like 1118x956 and my latent resolution is 896x1152. Do I need to resize the depth map manually before passing it to the Apply Controlnet node? (I use ComfyUI)
>>
>>101712629
i'd use a new depth map made in controlnet every time, otherwise the info might not work well for the new image: controlnet can make it fast anyway
>>
File: de_fl_00007_.jpg (873 KB, 1344x960)
873 KB
873 KB JPG
>>101712312
they basically got omega cucked by BFL releasing flux like they did. BFL released an 'open' model to drum up hype and attention for their 'pro' saas offering. that was basically the exact play that SAI was angling for. they were aiming to release SD3.1M with a goal of driving people towards their saas ecosystem, which would have their SD3L plus all their other random stuff
so what happens when SAI drops SD3.1M and it looks retarded compared to flux? they're basically competing for the same audience (if there is one)
>>
>>101712638
That's not my question tho. I'm asking, if I don't have the original image and just the depth map, would I need to resize it manually to be the same size as my latent?

The reason I'm asking this is cuz certain reference images may have other artefacts (like a table in the background, or another subject) which I may not want. So I might choose to generate the depth map, manually remove the unneeded artefacts from the depth map in an image editing software, and save the depth map with only the subject to be used later.

Now, when I import this depth map later, do I need to resize it manually to be the same size as my latent?
>>
>>101712660
nta but you dont have to resize if it's not much bigger than 1024x1024 (I wouldn't resize from the example you gave). I have 16gb vram and haven't had any problems with those dimensions.
>>
>>101712660
I dont think you need to resize it, whatever you generate will be based on that map, the output is whatever size you like. So its up to you if you want it a rectangle shape, square, or whatever: but the initial output will be 1:1 with the depth map you pick, how it appears (skewed or not) depends on your output size.
>>
File: de_fl_00008_.jpg (920 KB, 1344x960)
920 KB
920 KB JPG
https://suno.com/song/bbc7032a-93a5-445b-a0d5-c91563c5214d
>>
>>101712685
I see, thanks
>>
File: Flux_1722735658_0001.jpg (1.67 MB, 1792x2304)
1.67 MB
1.67 MB JPG
>>
>>101712710
like if I did a canny controlnet on a 500x900 image, and the output was 250x250, the output will be fine but the output size will squish it. or you could also try the "resize and fit" option.
>>
File: 1710728591332273.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
"cutout of _____" works great to mix real and anime, good job prompt anon
>>
File: 1722735969016_image.jpg (62 KB, 984x984)
62 KB
62 KB JPG
>>101712739
Amazing.
>>
File: Flux_00295_.png (814 KB, 1024x768)
814 KB
814 KB PNG
>>
File: 1696973716558723.png (1.14 MB, 1024x1024)
1.14 MB
1.14 MB PNG
>>101712755
it's neat how dalle-3 is closed source and now we got an open source dalle, essentially

or 1.5/sdxl/ponyxl checkpoints and loras for anime/realism gens as well
>>
File: Flux_00282_.png (1.02 MB, 1024x1024)
1.02 MB
1.02 MB PNG
>>
File: 1722736315171_out-0.jpg (113 KB, 984x984)
113 KB
113 KB JPG
>>101712796
>>101712739
NINTENDO!!!!
>>
>>101712801
how did you get a realistic body but still get anime miku?
>>
>>101712790
prompt?
>>
File: delux_con_00020_.png (1.5 MB, 1216x832)
1.5 MB
1.5 MB PNG
>>101712796
are there /v/ threads for flux?
>>
File: 1714405647136201.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>101712844
not sure, but i'd prefer to see new flux/SD gens only in one thread so this is still the best spot.
>>
>>101712844
The dall-e thread is the designated Ai thread. Most anons won't care what model you use.
>>
File: tmp7vnixpy4.png (750 KB, 768x1024)
750 KB
750 KB PNG
>>
>>101712864
Lol
Are you doing something like cardboard cutout/Promotional cardboard cutout?
>>
Does anyone have an install guide?
>>
File: delux_con_00019_.png (1.24 MB, 1216x832)
1.24 MB
1.24 MB PNG
>>101712868
I'm kind of surprised. 4chan is rife with in-fighting regardless of which board you go to. I figured /v/ would be pretty quick to jump on the "anyone still using dalle is a m$ dick sucker"

>>101712908
https://comfyanonymous.github.io/ComfyUI_examples/flux/
>>
>>101712908
Be more vague please.
>>
File: tmp47gab9mt.png (1020 KB, 768x1024)
1020 KB
1020 KB PNG
>>
>>101712903
another anon noted a cutout worked for putting anime into real life photos, so this includes "a cutout of Hatsune Miku wearing her original outfit which has been edited into the photograph"
>>
File: cumrag.jpg (34 KB, 901x182)
34 KB
34 KB JPG
>update comfy
>fails
>fresh install
>install missing nodes
>works
>restart comfy
>this error every time
Wow really fucking cool
>>
>>101712925
Update Impact Pack
>>
File: 1722736814372_image.jpg (68 KB, 984x984)
68 KB
68 KB JPG
Filled!
>>
>>101712925
>comfy
>it's not comfy
>>
>>101712935
Altho, I was going through the Impact-Pack main branch earlier today and I literally could not find a definition for the UltralyticsDetectorProvider class. Not 100% sure about this but it's probably been renamed to mmdet or something.
>>
>>101712935
Same thing, after update works but same error once I restart
>>
>>101712913
/v/ its full of people that aren't tech savy so things like online image gen gets far more use than local from what I've seen. Also plenty of people that have consoles and maybe a laptop but no dedicated pc.
You can always make your own thread and see how it goes. If there was a thread with instructions on how to install local I bet it would catch on more.
>>
Please someone bake
>>
>>101712995
>You can always make your own thread and see how it goes
oh I don't actually care at all. I was just curious
>>
>>101713010
no, u
>>
Next thread
>>101713099
>>101713099
>>101713099
Have fun ;)
>>
>>101710957
Has anyone tested this yet? Not at computer but the GitHub sample images don't explain much to me, would be nice to see a normal user xy
>>
>>101711806
if your buckets are different resolutions with different amounts of pixels (width*height) then buckets with more pixels will take longer



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.