[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1721057839095516.png (1.23 MB, 928x1432)
1.23 MB
1.23 MB PNG
Previous /sdg/ thread : >>102083549

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Local install
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
SD.Next: https://github.com/vladmandic/automatic
AMD GPU: https://rentry.org/sdg-link#amd-gpu
Intel GPU: https://rentry.org/sdg-link#intel-gpu

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Run cloud hosted instance
https://rentry.org/sdg-link#run-cloud-hosted-instance

>Try online without registration
flux-dev: https://huggingface.co/spaces/black-forest-labs/FLUX.1-dev
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Black Forest Labs: Flux
https://huggingface.co/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/trash/sdg
>>
First for (tr)ani is literal human garbage
>>
>>102091354
hi hlky
>>
>>
>>102091456
nothing to do with me desu
>>
File: tmpecu56_bh.jpg (1011 KB, 1920x1080)
1011 KB
1011 KB JPG
happy? monday
>>
File: 00374-2145910606.png (1.77 MB, 1600x896)
1.77 MB
1.77 MB PNG
>>102091885
why are they all so sad?
>>
>>102091958
idk, but i changed the tag to 'happy' for a bit
>>
>>102091301
this looks like a new Nintendo Metroid game
>>
>>102091861
It's the schizo narcissist attaching to any frequent poster now that his ban evasion is not noted any longer. Just don't reply.
>>
>102092199
>hlky
>>
>one day i wont have to even start
>>
>mfw Resource news

08/26/2024

>FLUX.1-dev-ControlNet-Union-Pro
https://huggingface.co/Shakker-Labs/FLUX.1-dev-ControlNet-Union-Pro

>FLUX.1-dev-ControlNet-Depth
https://huggingface.co/Shakker-Labs/FLUX.1-dev-ControlNet-Depth

>Photoshop | Regional prompt support for ComfyUI by sd-ppp
https://github.com/zombieyang/sd-ppp/wiki/Tutorial:-regional-prompting-in-Photoshop-by-SD%E2%80%90PPP

>T3M: Text Guided 3D Human Motion Synthesis from Speech
https://github.com/Gloria2tt/T3M

>Semantic Alignment for Multimodal Large Language Models
https://mccartney01.github.io/SAM/

>joy-caption-pre-alpha: Image Captioning App
https://huggingface.co/Wi-zz/joy-caption-pre-alpha

>sdxl-controlnet-lineart-promeai
https://huggingface.co/promeai/sdxl-controlnet-lineart-promeai

>Diffusion Toolkit v1.7: Metadata-indexer and Viewer for AI-generated images
https://github.com/RupertAvery/DiffusionToolkit/releases/tag/v1.7

>facebook/Sapiens: family of vision transformers
https://huggingface.co/facebook/sapiens

08/25/2024

>image-gallery-comfyui: Image Gallery and Carousel
https://github.com/wogam/image-gallery-comfyui

>LabelPair: booru-style tagging tool in browser
https://github.com/IceSandwich/LabelPair

>diffusers V0.30.1: CogVideoX-5B & Bug fixes
https://github.com/huggingface/diffusers/releases/tag/v0.30.1

08/24/2024

>ComfyUI v0.1.x Release: Devil In the Details
https://blog.comfy.org/comfyui-v0-1-x-release-devil-in-the-details-2/

>Azula - Diffusion models in PyTorch
https://github.com/probabilists/azula

08/23/2024

>Midjourney's AI-image generator website is now officially open
https://www.zdnet.com/article/midjourneys-ai-image-generator-website-is-now-officially-open-to-everyone/

>Anthropic says California AI bill's benefits likely outweigh costs
https://www.reuters.com/technology/artificial-intelligence/anthropic-says-california-ai-bills-benefits-likely-outweigh-costs-2024-08-23/

>AiM: Scalable Autoregressive Image Generation with Mamba
https://github.com/hp-l33/AiM
>>
>mfw Research news

08/26/2024

>How Diffusion Models Learn to Factorize and Compose
https://arxiv.org/abs/2408.13256

>LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
https://ys-imtech.github.io/projects/LayerPano3D/

>CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
https://customcrafter.github.io/

>Focus on Neighbors and Know the Whole: Towards Consistent Dense Multiview Text-to-Image Generator for 3D Creation
https://arxiv.org/abs/2408.13149

>Atlas Gaussians Diffusion for 3D Generation with Infinite Number of Points
https://arxiv.org/abs/2408.13055

>G3FA: Geometry-guided GAN for Face Animation
https://arxiv.org/abs/2408.13049

>EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation
https://arxiv.org/abs/2408.13005

>Image Segmentation in Foundation Model Era: A Survey
https://arxiv.org/abs/2408.12957

>Context-Aware Temporal Embedding of Objects in Video Data
https://arxiv.org/abs/2408.12789

>Building and better understanding vision-language models: insights and future directions
https://arxiv.org/abs/2408.12637

>Can GPT-4 Models Detect Misleading Visualizations?
https://arxiv.org/abs/2408.12617

08/25/2024

>METR: Image Watermarking with Large Number of Unique Messages
https://arxiv.org/abs/2408.08340

>Knowledge Prompting: How Knowledge Engineers Use LLMs
https://arxiv.org/abs/2408.08878

>Barbie: Text to Barbie-Style 3D Avatars
https://arxiv.org/abs/2408.09126

>Depth-guided Texture Diffusion for Image Semantic Segmentation
https://arxiv.org/abs/2408.09097

>Realistic Extreme Image Rescaling via Generative Latent Space Learning
https://arxiv.org/abs/2408.09151

>Re-boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration
https://arxiv.org/abs/2408.09241

>Implicit Grid Convolution for Multi-Scale Image Super-Resolution
https://arxiv.org/abs/2408.09674
>>
File: delux_op_00019.jpg (71 KB, 1024x1024)
71 KB
71 KB JPG
>>
Do not post tranis shota catbox i warn you
>>
File: 1724684322237.jpg (429 KB, 1024x1024)
429 KB
429 KB JPG
>>
>>102092199
True
What a disgrace
>>
File: 1724684340785.jpg (408 KB, 1024x1024)
408 KB
408 KB JPG
>>
File: 1724684352699.jpg (444 KB, 1024x1024)
444 KB
444 KB JPG
>>
File: delux_sf_00023_.png (2.08 MB, 1536x968)
2.08 MB
2.08 MB PNG
gm
>>
File: 1724684360954.jpg (355 KB, 1024x1024)
355 KB
355 KB JPG
>>
>gm
>>
File: 00066-957060233.png (2.4 MB, 1376x1024)
2.4 MB
2.4 MB PNG
>>
>>102093216
Why the fuck do you never bother koff but always based ani?
>>
>>102093285
ani is successful and it makes me mad
>>
Flux users!

flux1-dev-bnb-nf4-v2.safetensors
flux1-dev-fp8.safetensors
flux1-dev.safetensors

Let me get this straight, the last version is the best?
I can't use it in Forge, gave me an error about needing Clip-L
where do I get this? where do I put it?
>>
>>102093311
Q6_K.gguf
>>
File: 00378-1232682073.png (1.78 MB, 1600x896)
1.78 MB
1.78 MB PNG
>>
>>102093298
pathetic "stealth" pedo
>>
>>102093311
I followed these instructions:
https://www.patreon.com/posts/110007661
>>
>>102093412
>he didn't deny it
>>
So I want to train a local LoRA of a person.

What rank should I use?


The civitai default rank 2 gives godo results.

i have seen LoRAs with rank 8 perform worse then rank 2.

Is there any consensus yet?

Does it depend on the dtaset size? I have 35 pics.


Also people talk about not captioned datased performing better than Joy Cpation datasets.


What is your take?
>>
>>102093466
Who do you want to train a lora of?
>>
>>102093482
A low key celebrity. Just the face
>>
File: 00130-420.png (1.47 MB, 1024x1152)
1.47 MB
1.47 MB PNG
>>
SAI coke fund status?
>>
>>102093216
>if you break the rules you will get janny'd
>this is somehow hard to understand for some "anons"
>>
>>102093338
>Q6_K.gguf
wut dis and how does it help? where do I put it?
>>
File: delux_op_00020.jpg (72 KB, 1024x1024)
72 KB
72 KB JPG
Gm
>>
File: 00078-216478323.png (1.34 MB, 1376x1024)
1.34 MB
1.34 MB PNG
trying to figure out how to use controlnet in i2i.. neat result i guess but not what i wanted
>>
Morning anons
The word "Quokka" on its own has officially been deemed "too explicit" for FLUX pro API
>>
File: delux_sf_00025_.png (2.14 MB, 1536x968)
2.14 MB
2.14 MB PNG
>>102093898
gm
>"Quokka" on its own has officially been deemed "too explicit"
sorry for your loss. might have to train your own lora
>>
>>102093898
try the scientific name, maybe. setonix brachyurus. i totally knew that name and didnt look it up on wikipedia just now
>>
File: delux_me_00062_.jpg (534 KB, 896x512)
534 KB
534 KB JPG
>>102093928
>setonix brachyurus
erm.. I dont think it works
>>
>>102093959
well, it is a neat critter
>>
>>102093928
>setonix
>sonix
>sonic
how have we never made this connection before? quokka is sonic
>>
File: delux_xp_00001.jpg (213 KB, 1024x1024)
213 KB
213 KB JPG
>>102093898
gm
>"Quokka" on its own has officially been deemed "too explicit"
sorry for your loss. might have to train your own lora
>>
File: 1724691215248_1.jpg (96 KB, 1170x800)
96 KB
96 KB JPG
>>102093927
>>102094003
Quokka FLUX LoRA sounds cool, i should definitely get a Civitai subscription
>>102093928
That's what I did, i typed in quokkasetonix and that worked
>>
File: delux_sf_00026_.png (2.3 MB, 1536x968)
2.3 MB
2.3 MB PNG
>>102094027
lmao why did you reply to the troll poster
>>
File: 02697.png (65 KB, 640x896)
65 KB
65 KB PNG
>>
>>102093311
>>102093413
Still getting the
AssertionError: You do not have CLIP state dict!
>>
>>102093311
they're just different levels of truncated to suit vramlets of different levels. Flux1-dev is the original, but if you have tiny VRAM like me you'll want nf4 version, or if you're somewhere in the middle you'll want fp8
>>
>>102093999
Gotta go fast
>>
File: 00101-3293567602.png (2.33 MB, 1376x1024)
2.33 MB
2.33 MB PNG
https://youtu.be/q6hjysbKNH0?si=cpr_PFKJyQq--1gc
>>
File: 02698.png (76 KB, 640x896)
76 KB
76 KB PNG
>>
File: quokka.png (889 KB, 660x788)
889 KB
889 KB PNG
>>102094146
>>102094027
>>102093898
Why are the real life ones more "memeable"? LOL
>>
>>102094422
I think vanilla 1.5 is the only model that has gotten close to make realistic Quokkas
>>
>>102093754
down to the last eighth
>>
>>102093803
https://github.com/city96/ComfyUI-GGUF
>>
>>102094184
gem
>>
https://github.com/city96/ComfyUI-GGUF/issues/48#issuecomment-2308413117

>3.85 s/it on a rx 6900xt

Whats your excuse for not getting a second hand rx 6800?
>>
File: delux_sf_00028_.png (2.51 MB, 1536x968)
2.51 MB
2.51 MB PNG
>>
>>102094115
I presume you did do a "git pull". That error is usually because you're not using the right model.
>>
>>102094967
>What's your excuse
I'm using a laptop
>>
>>102094967
>from 8s/it to 2s/it on my 7800 XT using these flags and flash attention
One guy got 4x speed, this reminds me of when some softwares were much slower for AMD than intel because of intel's shady business.
>>
>>102095133
You look tasty. Don't move.
>>
File: delux_sf_00029_.png (1.77 MB, 1536x968)
1.77 MB
1.77 MB PNG
>>102095430
I doubt quokka meat tastes very good
>>
File: 00114.jpg (917 KB, 1536x2048)
917 KB
917 KB JPG
>>
File: 00133-999.png (1.72 MB, 1600x896)
1.72 MB
1.72 MB PNG
>>
File: 00166-2556162586.png (1.87 MB, 1024x1280)
1.87 MB
1.87 MB PNG
dled forge, and will attempt to use flux
>>102095133
https://x.com/berenyi_miki/status/1050897067214811137
possible slight interest, singer of Lush with some quoks.
>>
File: 00381-999.png (1.74 MB, 1600x896)
1.74 MB
1.74 MB PNG
>>
File: delux_sf_00030_.png (2.2 MB, 1536x968)
2.2 MB
2.2 MB PNG
>>102095912
>dled forge, and will attempt to use flux
woah
a new era is upon us (maybe)
>>
>>102095985
dont get your hopes, or fears, up, im so used to the reliability of auto/1.5, low tolerance for errors and troubleshooting
>>
File: file.jpg (395 KB, 1792x1024)
395 KB
395 KB JPG
good progress today desu
soon:tm:
not even two more weeks, maybe two more days
>>
File: 00000-3421120106.png (1.26 MB, 896x1152)
1.26 MB
1.26 MB PNG
first forge pic, took 6 minutes, 50 secs.... i just have a gtx 1080, so expected? using flux1-dev-bnb-nf4-v2
https://files.catbox.moe/ace1au.png
if any1 can offer tips on settings.
>>
>>102096222
Did you use flux? If so, which one and how many steps?
>>
File: 00001-958756901.png (1.22 MB, 896x1152)
1.22 MB
1.22 MB PNG
>>102096522
ha. nevermind.
>>
https://www.reddit.com/r/StableDiffusion/comments/1f1lhyo/the_1girl_phenomenon_is_flux_next/
>I observed that many NSFW LoRas are generating faces and body parts that look almost identical to those produced by Pony Realism and SDXL models.
time is a circle
>>
should I be able to see some improvement in time/gen from batch gens vs singles with SDXL?
>>
File: basegen_00433_.png (1.15 MB, 1280x768)
1.15 MB
1.15 MB PNG
testing a flux schnell lora i made based off a small dataset of 2D platformer images with a retro-style filtered pixel art appearance. it seems to work pretty well so far but i need to do more work on it
>>
>>102096602
Yeah, batch size should increase the time of it/s (or s/it in case you have a slow machine) since it's generating many images at the same time and batch count shoud give you an amount of total time spent on all processes done In a row, although at first when genning with a batch count higher than 1 since it's individual processes (as if you were doing a single gen)
>>
>>102096590
>I've noticed something interesting about the Flux model—one of its standout features for me is the way it produces unique faces and anatomies, aside from the occasional cleft chin. In the past, it was easy to identify AI-generated images at a glance, even before scrutinizing the hands or other imperfections, just by recognizing the distinct "1girl" face. Fortunately, with Flux, this issue seems to be partly resolved.
Lmfao. Redditors never actually tried prompting base SDXL or SD1.5 I guess.
>>
>Julien
>>
>>102096708
What kind of improvement are we talking here?
It seems I can get some 10-20% improvement by genning in batches of 4-6 compared to singles.
That's forge, 24GB, 1024x1024 size so it should fit in VRAM with no problem.
>>
File: delux_sf_00034_.png (2.48 MB, 1536x968)
2.48 MB
2.48 MB PNG
>>102096107
I'm looking forward to seeing the end result of hlky webdev

>>102096590
I think any generalized lora discourse is completely dead because of civit allowing any sort of retard to train and publish loras. there's just an ocean of pure trash now that it'll be impossible to make any general assessment on this generation of loras

>>102096602
batches to a certain point are more efficient, somewhere between 5-15% more efficient per gen. you can just calculate out the average to see if batching is saving you any time or not
>>
File: delux_sf_00035_.png (2.36 MB, 1536x968)
2.36 MB
2.36 MB PNG
>>102096638
I definitely see donkey kong in your dataset. how malleable is it with themes? would be cool to see how it does with different specific games
>>
File: BMP_12142_.png (1.86 MB, 1680x1120)
1.86 MB
1.86 MB PNG
>>102096107
What's the endgame?
>>
File: 00050-1114063323.png (2.04 MB, 896x1152)
2.04 MB
2.04 MB PNG
>>
File: delux_op_00025.jpg (72 KB, 1024x1024)
72 KB
72 KB JPG
>>102096107
I'm looking forward to seeing the end result of hlky webdev

>>102096590
I think any generalized lora discourse is completely dead because of civit allowing any sort of retard to train and publish loras. there's just an ocean of pure trash now that it'll be impossible to make any general assessment on this generation of loras

>>102096602
batches to a certain point are more efficient, somewhere between 5-15% more efficient per gen. you can just calculate out the average to see if batching is saving you any time or not
>>
tired of trying to produce good images, I'll just post bullshit in the prompt window and see what garbage I get. pic related, this is the most beautiful woman in the world
>a perfect picture of the most beautiful woman in the whole world, she is in the most haunting and enchanted place in the whole world, it is the best photo ever taken

what else should I add to this? masterpiece? 4k? greg rutkowski?
>>
,
>>102097226
You are being incredibly vague what do you expect you give it vitrually no details
>>
File: delux_me_00064_.jpg (387 KB, 896x512)
387 KB
387 KB JPG
>>102097139
https://www.youtube.com/watch?v=OLF6FHa2IOU
>>
File: file.jpg (559 KB, 1792x1024)
559 KB
559 KB JPG
>>102097083
not much webdev left now then it's just down to the data
>>102097139
end game is a song by taylor swift
https://open.spotify.com/track/2x0WlnmfG39ZuDmstl9xfX
fr though its something BIG, you'll have to wait and see but you can probably figure it out if you read between the lines of everything ive been working on recently and if you know my past projects
>>
>>102097264
I tried adding more detail, I'm still just getting flickrslop, this is bullshit. FLUX doesn't work. I want my money back.
>a perfect picture of the most beautiful woman in the whole world, she is in the most haunting and enchanted place in the whole world, it is the best photo ever taken. she's wearing the most expensive dress ever fashioned, she has the prettiest eyelashes of any girl who ever lived, throughout the heavens her beauty is envied, it so far surpasses the regions of the stars, it is beyond the beauty of Aphrodite, it exceeds the comeliness of the angels, it is the ever-burning all-consuming fire, a conflagration devouring worlds, the perfect beauty of God
>>
File: BMP_FLUX_10442_.png (1.97 MB, 1024x1024)
1.97 MB
1.97 MB PNG
>>102097311
I knew it. It was the world's greatest coconut lora all along
>>
>>102097412
Ok ok you are trolling
>>
File: 00006-1928866874.png (1.3 MB, 896x1152)
1.3 MB
1.3 MB PNG
neat, i guess, 7 mins...
>>
File: 00016-3250318489.png (881 KB, 896x1152)
881 KB
881 KB PNG
>>
>>102097417
Doesn't anyone else consider it a failure of prompt comprehension that I asked for the most beautiful woman conceivable and it gave me instaslop garbage? What is a prompt supposed to do? What should a good prompt look like in an ideal model?
>>
>>102097412
You forgot masterpiece, trending on artstation
>>
>>102097483
Idk why you are still replying to me
>>
File: delux_sf_00039_.png (2.41 MB, 1536x968)
2.41 MB
2.41 MB PNG
>>102097412
can you describe what you're hoping to get that you're not seeing in the images you've been genning? hard for us to know what you're aiming for

>>102097444
>7 mins
oof. idk how technically minded you are but you can try watching the console outputs and your taskman graphs to see what is eating up time. might be model swapping happening thats hitting a pagefile on a slow drive or something

>>102097477
so heckin smug
>>
>>102097483
'most beautiful woman in the word' is still vague anon, you're not describing her at all.
> a female posing in front of a pink background. She has long, blonde hair styled with a top knot bun, leaving the rest of her hair down. She is wearing a white, sleeveless dress with a flared skirt and a belt that has circular metal eyelets. She also accessorizes with a silver choker necklace and is slightly leaning to one side, giving a playful yet stylish pose. Her makeup is subtle, with a focus on her eyes and pink lips, complementing the overall soft and feminine look. Her head is tilted slightly up and she is looking at the viewer.
There, I gave you a proompt.
>>
>>102097501
Because YOU think it is trolling to point out that I'm NOT getting what I'm asking for. I am inviting YOU to think about what a prompt is supposed to be. In the SD1.x days, we had to pretend we were writing internet alt text, everyone said this was bad, we should be able to describe what we wanted to see. T5 supposedly was a step in that direction. But we're so far from being there that it worries me about where prompt comprehension will be at in 10 years. Is this an intractable problem? Will "beautiful" always continue to mean "internet whores that Indians like"? How do we fix this?

>>102097541
Read the prompt Debo. That's what I want. I want the thing in the prompt.

>>102097552
If I ask for the tastiest meal in the world, a great chef can bring it to me. A good chef can bring me something that, while not quite what I wanted, is suggestive of the thing, gesturing toward it, and I know he knows what I wanted and tried to provide it. A retarded chef brings me slop. "Here you go, the tastiest meal in the world, kebab plate with garlic sauce"

So far we are at the "prompt comprehension is retarded" stage. It has no idea what beautiful means. It thinks this picture is what I'm asking for.
>>
File: 00007-2678770084.png (1.13 MB, 896x1152)
1.13 MB
1.13 MB PNG
>>102097541
not technically minded at all, (surprise). it's just taking awhile to get through 20 steps.
>>
>>102097600
>>102097501
>>
>>102097600
You have no idea how these image generators work do you
>>
>>102097600
If I went into a restaurant and said I want the tastiest meal in the world the server would get pissed off because I'm not telling her if I want a burger or tendies, etc.
>>
File: delux_sf_00040_.png (2.01 MB, 1536x968)
2.01 MB
2.01 MB PNG
>>102097608
there's an nf4 version of schnell available that might be a better option for you because schnell only needs 4 steps to converge.

https://huggingface.co/duuuuuuuden/flux1-nf4-unet
>>
>>102097645
I understand that when I type "dog", it is supposed to show me a dog.

When I type "woman", it shows me a woman. When I type "blue woman", maybe it can do it, or maybe it just shows me a bluish picture of a woman. But it sort of goes in the right direction.

When I type "beautiful image", it shows me embarrassingly stupid images. What gives? What is happening?

>>102097667
yes waitresses are what I would class as "retarded", that's why they get paid very little
>>
>>102097681
How's this different from https://huggingface.co/lllyasviel/flux1-dev-bnb-nf4
>>
>>102097722
it's schnell not dev
>>
>>102097737
What does that mean
>>
>>102097444
FLUX can do smaller resolutions, I'd just lower your gen size. Try e.g. 512x640
>>
>>102097600
Thank you for your service, Sir.
>>
>>102097740
schnell is trained to work in a small number of steps for rapid image generation, it's a different model
>>
>>102097762
Is the quality different? The speed of generation? What does this mean in real world application.
>>
>>102097770
4 steps takes 1/5 as long as 20 steps. the quality is not as good but far better than flux dev at 4 steps.
>>
>>102097779
Thank you. Would you recommend it over the Dev nf4 version fro low end GPU users?
>>
>>102097814
I've never seen the point of the fast models, myself. I don't like them. I'd rather use nf4 dev and turn down the image size.
>>
>>102097839
So no then?
>>
File: delux_sf_00041_.png (2.33 MB, 1536x968)
2.33 MB
2.33 MB PNG
>>102097740
you know how there was sdxl and then sdxl-turbo? its the same deal. there's the "full" model thats slow then the "turbo" model thats architectured for super fast convergence. you get a minor quality hit and slightly worse prompt adherence for a massive speed increase. the downsides aren't too noticable so its definitely worth if the fatter models aren't fitting into your vram
>>
>>102097852
I'm not the one who told you to try schnell, I am just telling you what it is and what the "point" of it is. If you want me to tell you what to do, I say try gennng in dev nf4 at 512x512. Obviously 7 minute gens aren't acceptable, it's up to you to decide what trade-offs you want to make
>>
>newfags still taking advice from n*gbo
>>
>>102097667
8 billion people in the world all with different tastebuds that grew up in different environments, what are you on about?
>>
File: 00009-352299812.png (598 KB, 504x688)
598 KB
598 KB PNG
tried smaller image size. i think this is the sorts deal where i will run it, at proper image size, while im out running errands or sleeping. hard to go from 1.5 and being able to batch like 30 pics in the same time as 1 flux pic. things will keep progressing anyway, modelwise.
>>
>>102097881
thank you for your valuable opinion.
>>
File: 02701.png (47 KB, 640x896)
47 KB
47 KB PNG
>>
File: 00059-1480192349.jpg (101 KB, 511x623)
101 KB
101 KB JPG
>>102097913
>what are you on about?
I was trying to tell him that "tastiest meal,' like 'beautiful woman,' is way too vague and gives them no idea what he wants. What are you on about?
>>
>>102097881
Ok but wasnt turbo universally panned as trash
>>
>>102097957
No you were in the right, also
>food analogy
>>
>>102097901
Hes white
>>
File: 00012-154594992.png (577 KB, 504x688)
577 KB
577 KB PNG
>>
>>102098079
literally me
>>
>>102097957
he's a retard who can't even pick the right post to reply to, ignore him

"beautiful woman" is not vague unless you are retarded. if you are retarded, then you will have no idea what beautiful means. so when asked, you'll say, "what do you mean beautiful? you want an instagram girl? be more specific." Most people are in fact retarded. AI is still trained on the worthless utterances of the retarded masses. Hence its total stupidity on questions of beauty. Can that ever be fixed?
>>
File: 20240720_204915.jpg (866 KB, 1156x1600)
866 KB
866 KB JPG
sadly, still unable to prompt the kind of wiggliness, messyness, of my (bad) sketch art.
>>
File: delux_sf_00042_.png (2.1 MB, 1536x968)
2.1 MB
2.1 MB PNG
>>102097958
I think turbo/flash had a much more noticeable negative effect but they still had people using them and training for them. there was a demographic that got good use from them I think. with flux, the distance between dev/schnell seems a lot smaller in terms of output quality
>>
File: 00014-3942589694.png (592 KB, 504x688)
592 KB
592 KB PNG
my internal state
>>
>>102091958
a reflection of nic's tortured faggot soul
>>
>>102098178
why are you always this mean?
>>
>>102098178
that's not very nice. apologize
>>
>>102098178
how would you describe your soul?
>>
File: 00016-3945238407.png (596 KB, 504x688)
596 KB
596 KB PNG
>>102098211
unrequited love.
>>
>>102094698
>https://github.com/city96/ComfyUI-GGUF
Using Forge mate
>>
>>102098099
Yeah, it really comes down to images in the dataset tagged as 'beautiful woman.' The haveibeentrained on website didn't actually return many results for that tag. Most the images wouldn't display though
https://haveibeentrained.com/search/TEXT?search_text=beautiful%20woman
>>
>>102098301
my condolences
>>
File: delux_sf_00043_.png (2.43 MB, 1536x968)
2.43 MB
2.43 MB PNG
my internal state
>>
a demonic faggot
>>
File: 000000_16900_.png (1.91 MB, 1229x1229)
1.91 MB
1.91 MB PNG
>>102091301
Nice Samus.
>>
>>102098889
no need to announce yourself.
>>
gottem
>>
File: 000000_16908_.png (2.1 MB, 1270x1270)
2.1 MB
2.1 MB PNG
>>102097412
>Nice prompt Anon.
Beautiful.
>>
File: 00021-2547294890.png (671 KB, 512x688)
671 KB
671 KB PNG
>>
>>102098141
You need to make a LoRa of your sketches, you then can get that effect.
>>
>>102099187
at some point i suppose ill try to make a flux lora of them. but it seems id be unable to use my own lora with the version of flux i am able to run. at least for now. everything changes day by day, week by etc..
>>
>>102098141
There's this thing
https://civitai.com/models/12910/shitty-oekaki-jaggy-lines-style
>>
>>102099274
Understood. GL.
>>
File: delux_sf_00045_.png (2.54 MB, 1536x968)
2.54 MB
2.54 MB PNG
>>
File: file.png (17 KB, 476x317)
17 KB
17 KB PNG
>>
File: output.webm (230 KB, 380x380)
230 KB
230 KB WEBM
>>
>>102099570
Nice welcome back
>>
>>102099291
i have known of that lora for awhile, a good one. sadly, it is in no way useful for recreating what i was talking about. désolée
>>
File: file.png (17 KB, 494x316)
17 KB
17 KB PNG
pretty fast desu
half the cost too
gpt-4o-mini lyric descriptions
>a song about coping with loss and the chaotic lifestyle surrounding fame, friendship, and substance use
>>
>>102099570
been trying out any new tech or techniques?
>>
File: output.webm (342 KB, 380x380)
342 KB
342 KB WEBM
>>102099576
ty haha

>>102099594
unfortunatly not to be honest (still mimicmotion), wish i had more time for it but its still fun haha
>>
File: 00023-1151100104.png (822 KB, 888x608)
822 KB
822 KB PNG
queued up full size, flux, gen, while i went to get food, had it queued up to run 'forever', return and see it crashed at the end of the first gen..
>>
File: 000000_16912_.png (1.97 MB, 1270x1270)
1.97 MB
1.97 MB PNG
>>
File: file.png (186 KB, 880x849)
186 KB
186 KB PNG
nice
only cost 18c for 10k descriptions
>>
>>102099853
:(
>>
File: 1724679655277703.png (1.36 MB, 928x1432)
1.36 MB
1.36 MB PNG
>>102091301
I woulda cleaned this up a bit if I knew it was going to be the OP
>>
File: centaur.png (2.37 MB, 1000x1696)
2.37 MB
2.37 MB PNG
>>
File: output.webm (271 KB, 380x380)
271 KB
271 KB WEBM
>>
File: 1.jpg (319 KB, 1496x1168)
319 KB
319 KB JPG
I remember when sdg threads would fill up in like an hour
>>
File: file.jpg (422 KB, 1792x1024)
422 KB
422 KB JPG
also got 1.7m images captioned with florence-2 so far for like $10
now i sleeep
>>
File: tmpgoxbpq08.png (878 KB, 768x1024)
878 KB
878 KB PNG
>>
File: 000000_16914_.png (2.28 MB, 1270x1270)
2.28 MB
2.28 MB PNG
>>102099942
Nice.
>>
File: output.webm (225 KB, 380x380)
225 KB
225 KB WEBM
gn frens
>>
File: tmpw5lc5gig.png (482 KB, 768x1024)
482 KB
482 KB PNG
>>
>>102091301
Is there stable diffusion style programs for ai voices?
>>
>>102100171
like lip syncing?
>>
File: tmpbozpo5zi.png (601 KB, 768x1024)
601 KB
601 KB PNG
>>
File: flux0333.jpg (1.87 MB, 2304x1792)
1.87 MB
1.87 MB JPG
>>102100023
I really enjoy your gens
>>
>>102100171
Check out StyleTTS2
>>
File: tmpmncz737f.png (527 KB, 768x1024)
527 KB
527 KB PNG
>>
File: 000000_16916_.png (2.07 MB, 1270x1270)
2.07 MB
2.07 MB PNG
>>
>>102098168
Same
>>
>>
File: pepe_awoken.png (3.99 MB, 1424x1424)
3.99 MB
3.99 MB PNG
every night, i go to bed, burnt out from genning and wondering what the fuck im doing with my life

every day, i wake up inspired and full of new ideas
>>
File: tmpq2xc1pan.png (654 KB, 768x1024)
654 KB
654 KB PNG
>>
>>102100489
I just wanna make money, do mind altering drugs, fap to loli and die a peaceful life. Fuck the police, fuck the government, fuck everything. Just give me a peaceful neet life.
>>
File: tmpa8eigmlv.png (842 KB, 768x1024)
842 KB
842 KB PNG
You know that thing where prosthetic arms have a strap that you put around the other side of the body? That's what I was trying to do here.
>>
so can flux dev nf4 use loras or no?
>>
ole
>>
File: 00026-4197643495.png (362 KB, 512x688)
362 KB
362 KB PNG
>>102100650
gonna try rn, ill keep you posted
>>
File: tmp7is0wzbj.png (763 KB, 768x1024)
763 KB
763 KB PNG
>>
>>
File: 00000_16923_.png (1.93 MB, 1270x1270)
1.93 MB
1.93 MB PNG
>>
File: tmps32inmyh.png (858 KB, 768x1024)
858 KB
858 KB PNG
>>
File: 00019-2684071944.jpg (207 KB, 996x1280)
207 KB
207 KB JPG
>>
File: tmpivsnq97l.png (1.02 MB, 768x1024)
1.02 MB
1.02 MB PNG
>>
File: 00028-2140771437.png (1.85 MB, 1536x688)
1.85 MB
1.85 MB PNG
so i guess loras work for nf4, but a inconvenient, taking a long time to load. i prob should have picked a lora more unusual than this mjanine lora, well..
left to right, lora at :1, then without lora but still with 'anime style' in prompt, then last with lora at :1.3.
so idk
>>
File: delux_sf_00043_.png (1.29 MB, 1536x968)
1.29 MB
1.29 MB PNG
>>102100489
this is called passion. and someday, when you open your own ai gallery in the middle of some swanky city district, you'll know it was all worth it

>>102100542
better to wish for the abolishment of money instead. so long as there is profit, there is waging, and so long as there is waging, you'll always be paying for the privelage to stay alive. also, we need to government to fight against the super-intelligent AI once it emerges

>>102100650
I think the loras have to be trained for that specific model. don't quote me tho

>>102101175
the lora does *something* but is it doing what its supposed to?
>>
File: tmps2tvcqjj.png (984 KB, 768x1024)
984 KB
984 KB PNG
>>
File: 00021-3357035525.png (1.03 MB, 896x1152)
1.03 MB
1.03 MB PNG
>>
File: flux0332.jpg (1.72 MB, 2304x1792)
1.72 MB
1.72 MB JPG
>>
File: 00033-421704216.png (786 KB, 672x872)
786 KB
786 KB PNG
>>102101199
i guess, idk. hardly need that particular lora anyway.
unrelated but the token merging ratio setting helps speed things up. i have it at .5
>>
File: flux0104.jpg (3.33 MB, 2800x2272)
3.33 MB
3.33 MB JPG
>>
>>102101175
interesting, I couldn't load them at all when I tried, maybe it's different now
>>
>>102091301
Samus sexo
>>
>>102101325
took like 5 or more mins to load that lora. not convenient. ill try again, since i started using the token ratio setting
>>
File: tmp9sjc6qeb.png (1.07 MB, 768x1024)
1.07 MB
1.07 MB PNG
>>
File: 00024-3139500729.png (1.09 MB, 896x1152)
1.09 MB
1.09 MB PNG
Chibi yelling at sunflower 1920s cartoon version
>>
File: 000000_16927_.png (1.91 MB, 1270x1270)
1.91 MB
1.91 MB PNG
>Parasite Banker Hologram Failed
>>
File: tmpganrdpae.png (1.18 MB, 768x1024)
1.18 MB
1.18 MB PNG
>>
File: chibi paper.png (400 KB, 800x450)
400 KB
400 KB PNG
>>102101454
I shoulda saved the photoshop file so I could just swap out the chibi image. I didn't know this was gonna be a thing
>>
File: 00025-882709954.png (999 KB, 896x1152)
999 KB
999 KB PNG
>>102101498
Haha
>I didn't know this was gonna be a thing
me either
>>
File: 00035-3183592829.png (787 KB, 672x872)
787 KB
787 KB PNG
état interne
>>
File: tmpfa_3dys2.png (1.19 MB, 768x1024)
1.19 MB
1.19 MB PNG
>>
File: 00027-608423957.png (1.04 MB, 896x1152)
1.04 MB
1.04 MB PNG
>>
mfw glance through the dalle thread and the pics greatly mog s/ldg pics
>>
>>102101731
The good posters have long left
>>
File: 00041-460970725.png (889 KB, 872x672)
889 KB
889 KB PNG
>>102101734
cant imagine why
>>102101723
buono.
trying to gen a renaissance fair centaur woman but i am goofing it up. idk why
>>
with flux, are you using 'distilled cfg scale' at 3.5 or?
>>
>>
File: basegen_00321_.png (1.22 MB, 1024x1024)
1.22 MB
1.22 MB PNG
>>102097129
This is funny because I didn't use any Donkey Kong there - actually, it was entirely images from Gunvolt 3 and Luminous Avenger iX 2, because they have a similar (likely the same?) pixel size and type of filtering. (incidentally I highly recommend GV3)

The issue I have is that it's difficult to cultivate a dataset with a consistent pixel size and style. As such it's currently only good with reproducing the specific kinds of images from the dataset though, sometimes, it can produce images of scenes that are sorta in the right style, like the attached

I hope to manipulate it into working better, eventually
>>
File: 00029-4196714435.png (1.02 MB, 896x1152)
1.02 MB
1.02 MB PNG
>>102101758
>renaissance fair centaur
interesting
>>
>>
File: 766476681356570552.png (683 KB, 832x1216)
683 KB
683 KB PNG
Flux output for Joycaption output of a CGI image of Tifa getting full nelsoned lmao (blur added by me of course)
>>
File: 00047-2378199244.png (1.51 MB, 1152x896)
1.51 MB
1.51 MB PNG
new toy, oh joy
>>
>>102102167
2 tails?
>>
File: 00048-2378199244.png (1.39 MB, 1152x896)
1.39 MB
1.39 MB PNG
>>102102179
ha, honestly didnt notice. that is at 3.5, and this is at 4.5, distilled cfg. surprised by the difference. same seed. but it seems fried. maybe an issue of a very basic prompt.
>>
SD3 is DOOMED
>>
>>102102233
your mouse is a gay boy. ever consider???? lol i wish you good gens in future though
-fag lord XL
>>
File: 00049-259400757.png (705 KB, 1152x896)
705 KB
705 KB PNG
chatgpt'd the prompt to make it more detailed, but got blurry result. je ne sais pas
https://youtu.be/zSTuUKWCmww?si=DvLG87_blBnLqeMI
>>
>>102100489
I have never seep pepe so happy.
>>
File: delux_sf_00036_.png (1.84 MB, 1536x968)
1.84 MB
1.84 MB PNG
>>102102261
wow so you're the legendary fag lord XL?
>>
File: 00050-259400757.png (641 KB, 1152x896)
641 KB
641 KB PNG
how i feel
what is your fav cheese? for me, its gouda
>>
File: tmpdrfu6v11.png (1.01 MB, 896x1152)
1.01 MB
1.01 MB PNG
>>
what happened to our thread? very quiet tonight
>>
File: 00032-3763484706.png (834 KB, 896x1152)
834 KB
834 KB PNG
>>102102426
I don't know, it's not really good by itself. So, it depends on what you are eating
>>
>>102102542
I didn't get any (You)s so I reported everyone
>>
File: 00000-2543973863.png (1.86 MB, 1152x896)
1.86 MB
1.86 MB PNG
>>102102552
>not good by itself
we are not on the same cheese page
>>
File: 00375-2007999.png (1.42 MB, 896x1344)
1.42 MB
1.42 MB PNG
>>102102426
white goat cheese and cheddar
>>
File: BigComparison.jpg (3.55 MB, 4160x1248)
3.55 MB
3.55 MB JPG
SD3, Ideogram 2.0, Leonardo Phoenix, Base SDXL + Loras, Flux, except the pictures are in a totally different order than I listed the models. Can you figure out which is which? Same seed and prompt for all.
>>
>>102102571
based
>>
File: 00001-901487988.png (1.43 MB, 1152x896)
1.43 MB
1.43 MB PNG
remembering now, as a young kid, pretty much white trash, snacking on slices of american cheese, or slices of boloney with mustard, folded like a taco. goodolddays
>>
File: delux_sf_00034_.png (1.86 MB, 1536x968)
1.86 MB
1.86 MB PNG
>>102102426
cheddar goes with everything

>>102102542
do you want quiet or do you want hyper-sperging? we don't get any in-between

>>102102668
relatable, apart from the mustard
>>
File: 00003-3530120532.png (773 KB, 896x1152)
773 KB
773 KB PNG
>>
>>102102698
>do you want quiet or do you want hyper-sperging?
I want action
https://www.youtube.com/watch?v=UfBrhl566zQ
>>
File: delux_me_00069_.jpg (289 KB, 896x512)
289 KB
289 KB JPG
cheese
>>
>>102102817
>>102102817
>>102102817
>>
File: delux_sf_00037_.png (1.87 MB, 1536x968)
1.87 MB
1.87 MB PNG
>>
File: delux_sf_00038_.png (2.11 MB, 1536x968)
2.11 MB
2.11 MB PNG
>harry potter 2049
>>
File: delux_ci_00042_.png (2.03 MB, 1536x968)
2.03 MB
2.03 MB PNG
>>
File: delux_ca_00025_.png (1.37 MB, 1344x768)
1.37 MB
1.37 MB PNG
>>
File: delux_me_00071_.jpg (228 KB, 896x512)
228 KB
228 KB JPG
egg
>>
File: 1716841758133099.jpg (7 KB, 128x112)
7 KB
7 KB JPG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.