[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: PW_100173_.png (2.34 MB, 1600x2000)
2.34 MB
2.34 MB PNG
Previous /sdg/ thread : >>103546064

>Beginner UI local install
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Local install
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SD.Next: https://github.com/vladmandic/automatic
InvokeAI: https://github.com/invoke-ai/InvokeAI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>SD 3.5 info & download
https://rentry.org/sdg-link#sd35
https://civitai.com/models/896953/stable-diffusion-35-medium
https://huggingface.co/city96/stable-diffusion-3.5-medium-gguf
---
https://civitai.com/models/878387/stable-diffusion-35-large
https://huggingface.co/city96/stable-diffusion-3.5-large-gguf

>Try online without registration
sd3.5-medium: https://replicate.com/stability-ai/stable-diffusion-3.5-medium
sd3.5-large: https://replicate.com/stability-ai/stable-diffusion-3.5-large
sd3.5-turbo: https://replicate.com/stability-ai/stable-diffusion-3.5-large-turbo
flux-dev: https://huggingface.co/spaces/black-forest-labs/FLUX.1-dev
txt2img: https://www.mage.space

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://aitracker.art
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html

>Share image prompt info
4chan removes prompt info from images, share them with the following guide/site...
https://rentry.org/hdgcb
https://catbox.moe

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/vp/napt
>>
>>
File: ComfyUI_03081_.jpg (816 KB, 1792x2304)
816 KB
816 KB JPG
>>
File: armor.jpg (149 KB, 1536x1152)
149 KB
149 KB JPG
>>
File: 121351-tmp.png (2.61 MB, 1536x1824)
2.61 MB
2.61 MB PNG
>>
File: GdLj9-hawAAWgsB.jpg (1007 KB, 2536x1432)
1007 KB
1007 KB JPG
there's also >>>/jp/2huai btw
>>
>mfw Resource news

12/17/2024

>HunyuanVideo GP: Large Video Generation Model - GPU Poor version
https://github.com/deepbeepmeep/HunyuanVideoGP

>HunyuanVideo GGUF Models
https://huggingface.co/city96/HunyuanVideo-gguf

>Ruyi-Mini-7B: Image-to-video model by CreateAI
https://huggingface.co/IamCreateAI/Ruyi-Mini-7B

>SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models
https://github.com/Snowfallingplum/SHMT

>Exploring Enhanced Contextual Information for Video-Level Object Tracking
https://github.com/kangben258/MCITrack

>Video Diffusion Transformers are In-Context Learners
https://github.com/feizc/Video-In-Context

>GridShow: Omni Visual Generation
https://github.com/Should-AI-Lab/GRID

>NVIDIA Unveils Its Most Affordable Generative AI Supercomputer
https://blogs.nvidia.com/blog/jetson-generative-ai-supercomputer

>EvalGIM: A Library for Evaluating Generative Image Models
https://github.com/facebookresearch/EvalGIM

>How Donald Trump will affect AI development
https://www.thenationalnews.com/future/technology/2024/12/16/donald-trump-ai-regulation

12/16/2024

>ColorFlow: Retrieval-Augmented Image Sequence Colorization
https://zhuang2002.github.io/ColorFlow

>Generative Inbetweening through Frame-wise Conditions-Driven Video Generation
https://fcvg-inbetween.github.io

>New Report Reveals How Anime & Manga Industry Is Using Generative AI
https://animehunch.com/new-report-reveals-how-anime-manga-industry-is-using-generative-ai

>Google Announces Veo 2: Their State-of-the-art Video Model
https://deepmind.google/technologies/veo/veo-2

>ComfyUI-HunyuanVideo-Nyan: Scale CLIP & LLM influence
https://github.com/zer0int/ComfyUI-HunyuanVideo-Nyan

>Will AI Make Universal Basic Income Inevitable?
https://www.forbes.com/sites/bernardmarr/2024/12/12/will-ai-make-universal-basic-income-inevitable

>Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation
https://github.com/nhw649/SCSD
>>
>mfw Research news

12/17/2024

>DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes
https://dynamic-scaler.pages.dev/

>Exploring Diffusion and Flow Matching Under Generator Matching
https://arxiv.org/abs/2412.11024

>SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
https://arxiv.org/abs/2412.10958

>Progressive Compression with Universally Quantized Diffusion Models
https://arxiv.org/abs/2412.10935

>Do large language vision models understand 3D shapes?
https://arxiv.org/abs/2412.10908

>PEARL: Input-Agnostic Prompt Enhancement with Negative Feedback Regulation for Class-Incremental Learning
https://arxiv.org/abs/2412.10900

>Zigzag Diffusion Sampling: The Path to Success Is Zigzag
https://arxiv.org/abs/2412.10891

>Unbiased General Annotated Dataset Generation
https://arxiv.org/abs/2412.10831

>Diffusion Model from Scratch
https://arxiv.org/abs/2412.10824

>Optimizing Few-Step Sampler for Diffusion Probabilistic Model
https://arxiv.org/abs/2412.10786

>StyleDiT: A Unified Framework for Diverse Child and Partner Faces Synthesis with Style Latent Diffusion Transformer
https://arxiv.org/abs/2412.10785

>One Pixel is All I Need
https://arxiv.org/abs/2412.10681

>Memory-Efficient 4-bit Preconditioned Stochastic Optimization
https://arxiv.org/abs/2412.10663

>SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner
https://yufanzhou.com/SUGAR

>Automated Image Captioning with CNNs and Transformers
https://arxiv.org/abs/2412.10511

>SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device
https://snap-research.github.io/snapgen-v

>SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation
https://safetydpo.github.io

>SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers
https://svgbuilder.github.io

>SVGFusion: Scalable Text-to-SVG Generation via Vector Space Diffusion
https://ximinng.github.io/SVGFusionProject
>>
File: PW_HV_00010.mp4 (334 KB, 848x480)
334 KB
334 KB MP4
>>
File: Christmas_0002.jpg (711 KB, 1664x2432)
711 KB
711 KB JPG
>>103556076
>>
File: flux1-dev-Q2_K.gguf.jpg (221 KB, 966x2454)
221 KB
221 KB JPG
>>103555845
Asking anyone who has experienced using .gguf models: I don't be original flux dev model is like 23 GB in size but people have been able to create "quanted" versions that are fractions of the original size. Currently the smallest one I could find was four gigabytes. How do the quantited versions compare to the full thing? Are they worse in quality and performance? Is it okay in one area but worse in another? If I were to use pic rel instead of the original full fp16 version, What differences should I expect assuming I use the exact same prompt and settings?

https://huggingface.co/city96/FLUX.1-dev-gguf/tree/main
>>
File: armor2.jpg (114 KB, 1536x1152)
114 KB
114 KB JPG
>>
File: 00049-1192218191.png (765 KB, 1024x1024)
765 KB
765 KB PNG
>>
File: 121354-tmp.png (3.27 MB, 1536x1920)
3.27 MB
3.27 MB PNG
>>103556250
>>
File: PW_HV_00008.mp4 (301 KB, 480x848)
301 KB
301 KB MP4
>>103556302
Cute! I love the little witch near the bottom LOL
You should do my catgirl too hahaha! She has white hair and green eyes
I wonder if I can get duos in an animation
I'm gonna try!
>>
File: 121303-tmp.png (3.15 MB, 1536x1824)
3.15 MB
3.15 MB PNG
>>103556336
>You should do my catgirl too hahaha! She has white hair and green eyes
What clothes?
>>
File: demr_00130_.png (1.99 MB, 1824x1248)
1.99 MB
1.99 MB PNG
>>103556250
nice, you finally got hunyuan up and running. looks great. she sure is yappin', lol
>>
File: 00051-1672377942.png (1.61 MB, 1024x1024)
1.61 MB
1.61 MB PNG
>>
File: Christmas_0004.jpg (616 KB, 1664x2432)
616 KB
616 KB JPG
>>
File: 121358-tmp.png (2.9 MB, 1536x1824)
2.9 MB
2.9 MB PNG
>>
File: 121360-tmp.png (2.84 MB, 1536x1824)
2.84 MB
2.84 MB PNG
>>
File: 00043-609622639835242.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>103556267
Someone made a big chart comparing the different quants with a prompt of Migu. Don't have it on hand, sorry, but I'm sure someone here does. Many of them were actually very close in quality to the full version with minor worsening in text readability, but there was one that had a totally different composition and got the perspective of the prompt wrong. It may have been Q4_K_S, but I do not remember at all, sorry.
If the only reason you want to use the smallest one is because you're a VRAMlet, I invite you to use Forge instead of ComfyUI. It has black magic that supports loading part of the model in VRAM and the rest of it in normal RAM. I'm running flux1-dev-fp8 on a [spoiler]laptop[/spoiler] 4060 that has just 6GB of VRAM.
>>
File: decs_00001_.png (2.35 MB, 2016x1152)
2.35 MB
2.35 MB PNG
>>103556267
>How do the quantited versions compare to the full thing?
depends on which ones you use because they all are crunched down into different degrees of precision. Q8_0 is basically equivalent to the fp16 full model

>>103556302
suprise secret mini-witch

>>103556404
>punk rock christmas album covers
hmm, thats kind of a fun idea for a prompt

>>103556415
how do you prompt for those cat eyes?
>>
File: 1723702585589.jpg (3.84 MB, 7961x2897)
3.84 MB
3.84 MB JPG
>>103556267
>>103556470
>comparing the different quants with a prompt of Migu
black migu, actually
>>
File: 00052-3153117390.png (1.5 MB, 1376x1024)
1.5 MB
1.5 MB PNG
>>103556485
earlier this year, i, or civitai, trained a flux lora off of old punk album covers.
pic unrelated, using a lora trained off of old horror/experimental film
>>
File: PW_99237_.png (1.51 MB, 1280x768)
1.51 MB
1.51 MB PNG
>>103556381
Like a long white fur jacket, green skirt and long white boots haha
>>103556402
Yeah!! It's so fun!
LMAO ikr idk why she talks so much hahaha
I put "silently sipping tea earlier" but that didn't work haha!
>>
What do people use to train image LoRAs these days? I'm still using a pretty old version of koyha_ss, and was thinking I should give OneTrainer a go. It's for PonyXL-based model LoRAs.
>>
File: 00065-2207324615.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>103556491
Wow, I got the details completely wrong, huh? Thanks for sharing it.
>>
File: 121365-tmp.png (2.63 MB, 1536x1824)
2.63 MB
2.63 MB PNG
>>103556485
>how do you prompt for those cat eyes?
slit pupils
>>
File: PW_99320_.png (1.36 MB, 1280x768)
1.36 MB
1.36 MB PNG
>>103556522
This one is probably a better example hahaha!
>>103556415
>>103556466
These are so cute hahaha
>>
How do you get more dramatic lighting like with SD 1.5 with Pony?
>>
File: 121367-tmp.png (2.72 MB, 1536x1824)
2.72 MB
2.72 MB PNG
>>103556522
>Like a long white fur jacket, green skirt and long white boots
>>
File: 00053-1977366249.png (1.64 MB, 1376x1024)
1.64 MB
1.64 MB PNG
>>
File: decs_00003_.png (2.06 MB, 2016x1152)
2.06 MB
2.06 MB PNG
>>103556567
high contrast, {ambient|rim|dark|dramatic} lighting, deep shadows
HDR in negatives

>>103556542
catgirl fishing vids would be fun. how fast have video gens been for you? there's some new quantized models out that might make it faster
>>
File: 121369-tmp.png (2.45 MB, 1536x1824)
2.45 MB
2.45 MB PNG
>>
>>103556250
is this img2vid or a lora
is there even img2vid yet
>>
File: PW_HV_00013.mp4 (211 KB, 848x480)
211 KB
211 KB MP4
>>103556567
With pony I use (dynamic lighting) in my proompt! You can also upscale a mix of two images and use "multiply" in the image blend!
Also this LoRA works really well!
>https://civitai.com/models/372763/cinematic-anime-scenery-enhance-your-scenes-pony-xl
>>103556576
This is so cute!!
Oh yea I like to add a little bell choker sometimes too hahaha!
I guess it'd just be a collar since shes a cat LOL
>>103556584
Ohhhh I should try that!! It takes about 3 mins each gen!
>>103556604
This is hella cute too! I love the pose!
Here's my first attempt! I need to work on the proompt haha
>>
File: 00017-1.png (944 KB, 1024x768)
944 KB
944 KB PNG
>>103556584
>high contrast, {ambient|rim|dark|dramatic} lighting, deep shadows
>HDR in negatives
Seems to help somewhat, thanks.
>>
File: PW_HV_00002.mp4 (178 KB, 848x480)
178 KB
178 KB MP4
>>103556623
This is txt2img! Uhmm i'm not sure, there might be haha!
I gotta look into that!
>>
File: 121370-tmp.png (2.67 MB, 1536x1824)
2.67 MB
2.67 MB PNG
>>103556624
>Fran speaks
>PW leaves in disgust
>>
File: 00029-2735910962.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
Is there a way to tell Flux the exact colour you want for something? I've tried every permutation of "azure", "cobalt" and "blue", but this balloon always turns out a light blue. "Sonic-colored" or "color of Sonic the Hedgehog" do the same, even if Sonic himself is also in the gen with his right colour. "Dark blue" usually goes for navy blue, like the PJs here. I even tried giving a hex code, but Flux doesn't understand that and uses whatever colour it wants instead.
>>
howdy
>>103556542
>>103556522
>pw using [catgirl] [cute outfit] [rando animal] [weapon] now
idk how to feel about this
>>
File: 121372-tmp.png (2.58 MB, 1536x1824)
2.58 MB
2.58 MB PNG
>>
File: PW_HV_00014.mp4 (427 KB, 848x480)
427 KB
427 KB MP4
>>103556640
LOL Fran is angry in this one
>>103556657
Good evening!! It's great to see you again :D
I hope you've had a great day so far!
LOOL she's my anti-vacation cat
>>
File: decs_00005_.png (2.13 MB, 2016x1152)
2.13 MB
2.13 MB PNG
>>103556649
just say "dark blue ball, dark blue ball, dark blue ball" over and over again until it works. persistence is key
>>
File: 00054-2964995847.png (2.48 MB, 1024x1376)
2.48 MB
2.48 MB PNG
>>
>>103556691
the more the merrier i say
>>103556649
try (-)
"cobalt-blue"
but "(cobalt-blue)" or higher emphasis may lead to success
>>
File: beefy.jpg (119 KB, 1536x1152)
119 KB
119 KB JPG
>>
File: decs_00007_.png (2.52 MB, 2016x1152)
2.52 MB
2.52 MB PNG
>>103556766
who's the ranking officer?
>>
>>103556649
try >]:-|_=}$~
>>
>>103556780
Situation is more complicated than it seems.
>>
>>
File: myhouse.png (2.43 MB, 1288x968)
2.43 MB
2.43 MB PNG
>>
>>103556822
nice
>>
File: eldritch_abomination.jpg (144 KB, 1536x1152)
144 KB
144 KB JPG
>>
File: decs_00008_.png (2.24 MB, 2016x1152)
2.24 MB
2.24 MB PNG
>>103556822
something about the color scheme and the mushrooms make me think of terraria
>>
File: desd35_00002_.png (2.87 MB, 1344x1344)
2.87 MB
2.87 MB PNG
sd35 doesn't want to do text
>>
>>103556925
you cant go wrong with eldritch
>>
File: delux_me_00001_.jpg (1.2 MB, 1024x1024)
1.2 MB
1.2 MB JPG
flux gets it first try but the image is very flux
>>
>>103556972
Text is incredibly dangerous therefore it's censored.
>>
File: 121378-tmp.png (3.67 MB, 1688x2000)
3.67 MB
3.67 MB PNG
>>103556691
Nice
>>
File: desd35_00005_.png (3.72 MB, 1344x1344)
3.72 MB
3.72 MB PNG
>>103557012
probably true. when you think about it, every slur is spelled with text
>>
>>
Comfy, if you're in here with the other anime trannies, can you confirm whether or not sage attention is working for hyvid ggufs?
>>
File: PW_HV_00026.mp4 (191 KB, 848x480)
191 KB
191 KB MP4
>>103556972
Did you do something like (text: "blah blah blah")?
That usually works for me haha
>>103557016
Ty! :D
I'm trying to do more but they're not coming out how I want em to hahaha
>>
File: desd35_00006_.png (2.79 MB, 1344x1344)
2.79 MB
2.79 MB PNG
>>103557046
>the album is titled "Red and Green Wasteland" in large font text
yields not even a bit of text
>>
File: delux_00002_.png (2.94 MB, 1344x1344)
2.94 MB
2.94 MB PNG
>>103557046
this image reminded me of the whole opening sequence I imagined for Infinite Burger 2: The Search for Quokka: The Movie. after the high octane finale of the original series, I thought a fun contrast would be to randomly open in a non-descript diner. there's a lot to the construction of the scene that acts as a narrative device to skim over the previous events, but the jist is that its casually revealed that the diner sits at the nexus of all latent space where every universe intersects. this sets the premise of the movie, where the cast travels through the sdg multiverse trying to find quokka
>>
File: boy.jpg (103 KB, 1536x1152)
103 KB
103 KB JPG
>>
File: 121380-tmp.png (3.58 MB, 1688x2000)
3.58 MB
3.58 MB PNG
>>103557046
That ones pretty good. Are you only able to make them 3 to 5 seconds long?
>>
File: PW_00026_.png (1.64 MB, 1280x768)
1.64 MB
1.64 MB PNG
>>103557052
Oh wow weird!
It should work that way haha!
>>103557094
Looks like you got the text to work! Nice!
Hahaha! It really does look like a real anime huh
>>103557105
Ty! I've only tried 3 so far, i'll try to make it longer now!
>>
File: delux_00003_.png (3.04 MB, 1344x1344)
3.04 MB
3.04 MB PNG
>>103557100
Angel Maikeru: Be Safe When Crossing Into Heaven! (2024)

>>103557131
>It should work that way haha!
maybe there's too much other stuff in the prompt so it gets confused. flux handles it fine
>>
File: PW_00028_.png (1.57 MB, 1280x768)
1.57 MB
1.57 MB PNG
>>103557140
I should have waited, this one came out way better hahaha!
Ohh probably! I would put it near the top of the proompt
>>
What is the easiest/fastest gui module for Python? I installed PySimpleGUI but it displayed some nag screen and started to beg money, wtf. I don't want to use Gradio either because it's tied to web browser.
>>
>>103557167
What's the hang-up about using a browser based gui?
>>
>>
>>103557177
I don't want to run things inside my browser.
>>
File: desd35_00007_.png (2.73 MB, 1344x1344)
2.73 MB
2.73 MB PNG
tried reworking the prompt a bit. sd35 always has cooler aesthetics but it just wont do text
>>
Tkinter it is then.
>>
>>103557211
I got that much, but why?
>>
File: Christmas_0005.jpg (471 KB, 1664x2432)
471 KB
471 KB JPG
Good night
>>
>>103557267
If you need to ask, you don't need to know.
>>
File: delux_00007_.png (2.94 MB, 1344x1344)
2.94 MB
2.94 MB PNG
>>103557274
gn
>>
>>103557290
You think your outputs being on a web browser means its somehow exposed to the internet and thus your gens aren't private.

I wanted to hear you say it yourself.
>>
>>103556023
https://files.catbox.moe/vjxfjs.json
i just tried upscaling a little bit twice that time, seems to help with seams, but still doubles things once they're too big. i know i can reduce the denoise, but then it won't draw new details. seems like a rock and a hard place type situation
the goal naturally is to make something that looks like it was generated directly in a higher resolution, but trying that results in distorted features, like really long torsos, which i assume is because it wasn't trained at that resolution

oh, any other tip that stand out are welcome, i'm mostly just changing things to see what effect they have atm, like for example i don't really know the differences between different samplers/schedulers
>>
File: 08028-935967822.jpg (233 KB, 1432x1840)
233 KB
233 KB JPG
>>
>>103557313
Go back to /ldg/, retard.
>>
>>103557322
Running through a browser doesn't mean your outputs can be seen lol.
>>
>>103556470
>I'm running flux1-dev-fp8 on a [spoiler]laptop[/spoiler] 4060 that has just 6GB of VRAM.
Gaming laptop or a normal one?
>>
File: rollerskate.jpg (120 KB, 1536x1152)
120 KB
120 KB JPG
>>
>>
File: desd35_00014_.png (3.08 MB, 1344x1344)
3.08 MB
3.08 MB PNG
>>
File: PW_HV_00027.mp4 (324 KB, 848x480)
324 KB
324 KB MP4
LMAO I waited 13 mins for a gen and went oom
I guess ima stick to short anims for now hahaha
>>
>>
File: desd35_00013_.png (2.53 MB, 1344x1344)
2.53 MB
2.53 MB PNG
>>103557455
https://suno.com/song/71f262fa-c74d-453f-b72f-f79f4c07ba2c

>>103557508
lmao that pw expression is hilarious
>>
>>
File: PW_HV_00028.mp4 (431 KB, 848x480)
431 KB
431 KB MP4
>>103557577
LOOL it is hahaha
Literally mfw my comfy crashed trying to load a long anim LMAO
I forgot to specify a lake in this gen lol whoops
>>
File: grid-0005.jpg (1.04 MB, 3072x3072)
1.04 MB
1.04 MB JPG
>>103556706
>>103556750
Thanks for the tips. Definitely got warmer in my attempts, but no success yet. I forgot (emphasis) works, I misremembered it as not working with Flux when it's probably that it didn't work with ComfyUI. It's annoying to experiment when gens take long, but I should be grateful that Forge is double the speed of Comfy.
>>103556787
This has some very interesting effects on both the balloon itself and the drawing on it. Is this some encoding of something and that's where the symbols and wild gradients come from?
>>103557341
Gaming. pls no bully :(
>>
>>103557317
https://files.catbox.moe/0rsjf8.json
rushed job but should do the trick, left some notes
>>
>>103557639
Stumbled on it in my Pixart craze. Works on all models.
>>
File: PW_HV_00029.mp4 (326 KB, 848x480)
326 KB
326 KB MP4
>>
>>103557317
>>103557691
Use this one as a complete one >>103553966
Doesn't have notes but I'm sure you'll figure out once you compare the two.
>>
File: desd35_00010_.png (3.95 MB, 1344x1344)
3.95 MB
3.95 MB PNG
>>
File: xeno_nurse.jpg (403 KB, 1536x1152)
403 KB
403 KB JPG
>>
File: desd35_00009_.png (3.87 MB, 1344x1344)
3.87 MB
3.87 MB PNG
everyone died. we had a good thing going for a bit
>>
File: 1722085004118611.png (144 KB, 400x712)
144 KB
144 KB PNG
>>
File: 121409-tmp.png (3.34 MB, 1536x2016)
3.34 MB
3.34 MB PNG
>>103557721
This one's really good, she's cute.
>>
File: 121411-tmp.png (3.72 MB, 1536x2016)
3.72 MB
3.72 MB PNG
>>
File: desd35_00008_.png (2.52 MB, 1344x1344)
2.52 MB
2.52 MB PNG
>>103557944
what do the reject gens look like?
>>
File: PW_HV_00034.mp4 (366 KB, 848x480)
366 KB
366 KB MP4
>>103557931
It just takes hella long for me to gen now LOL
>>103557952
Thanks so much! :D
I have another pretty good one, but idk LOL her skirt is a bit too short haha
>>
File: 1708479301785854.png (23 KB, 512x512)
23 KB
23 KB PNG
>>103557990
It's a variety
>>
File: desd35_00017_.png (2.94 MB, 1344x1344)
2.94 MB
2.94 MB PNG
>>103558009
was listening to this and the way pw is yappin fit the song and made me chuckle
https://soundcloud.com/amari-o/dirty-stick-w-roland-jones

>It just takes hella long for me to gen now LOL
did you test out a quant model yet?
>>
>>103553966
>>103557691
thanks, i'll spend some time on it later figuring out what you've added. i like to understand what it's doing
>>
File: desd35_00018_.png (3.39 MB, 1344x1344)
3.39 MB
3.39 MB PNG
>>
File: space.jpg (139 KB, 1536x1152)
139 KB
139 KB JPG
>>
File: PW_HV_00037.mp4 (215 KB, 848x480)
215 KB
215 KB MP4
>>103558036
LMAOO why does that fit so well?!
Uhmmm i'm not quite sure haha i'm doing like fp8 or something idk
>>
File: 121417-tmp.png (3.67 MB, 1536x2016)
3.67 MB
3.67 MB PNG
>>103558207
I like the mischievous grin she has in this. Are you using a particular artist in your prompt? This style looks familiar.
>>
File: PW_HV_00036.mp4 (135 KB, 848x480)
135 KB
135 KB MP4
>>103558236
Ty hahaha!
This one definitely overdid the grin LOOL
I'm not, but I should totally try!! What artist were you thinking? I'll try throwing it in!
Cute Nurse Fran!!
>>
>>103557639
>Gaming. pls no bully :(
How long does a typical Flux dev image take to gen? What pc do you have? I'm considering getting a gaming PC specifically for SD. Not being reliant on the cloud and being able to run it on my own hardware would be nice.
>>
>>103557721
looks pretty good. how long does it take on your 4090? I know it's in the region of minutes but still curious to see how it compares to numbers other anons have posted.
>>
>>103557691
trying to figure out which lora you have listed on yours, and it seems someone made a pony miyako lora 8 days ago, right after i made my own because i couldn't find one. what a coincidence
pic related, taking screenshots for training 4 days before that one on civit was released
it was a good opportunity/reason to learn to train a lora though. plus mine does the red hoodie outfit more accurately than those examples
>>
File: desd35_00020_.png (2.39 MB, 1344x1344)
2.39 MB
2.39 MB PNG
>>103558269
she looks like she's about to challenge me to a yugioh duel and she's gonna cheat
>>
File: PW_HV_00050.mp4 (163 KB, 848x480)
163 KB
163 KB MP4
>>103558285
Thanks so much, anon! They take about 3mins and 10secs or so!
>>103558325
LMAOOO like that bug guy with the big glasses!!
>>
File: 121421-tmp.png (3.02 MB, 1536x2016)
3.02 MB
3.02 MB PNG
>>103558373
Pretty good. As for the artist question I suppose it just has an early 2000's vibe.
>>
File: PW_HV_00051.mp4 (102 KB, 848x480)
102 KB
102 KB MP4
>>103558402
Ty haha
Ohhh gotcha! I kinda wanna try an artist now!
I dunno how to get the hair to look like yours with this model haha
It kinda just does what it wants
>>
>>103558373
>Thanks so much, anon! They take about 3mins and 10secs or so!
Nice, that seems relatively quick vs one anon in the other thread on his 3090. He's hitting 8mins+ and that's at a lower resolution than what you're at too. Which size of the hunyuan checkpoint are you using?
>>
File: PW_HV_00054.mp4 (123 KB, 848x480)
123 KB
123 KB MP4
>>103558465
I'm using the 720 fp8 one! fp16 makes my comfy die instantly LOL
>>
>>103558441
>I dunno how to get the hair to look like yours with this model
Maybe go with shoulder length curly hair?
>>
File: PW_HV_00056.mp4 (441 KB, 480x848)
441 KB
441 KB MP4
>>103558528
I'll give that a try haha I was doing (short curly hair) then (short curly ringlet hair)
Idk why this makes the mouth move so much LOL
I need to figure out how to stop that! I haven't played with the settings all that much
>>
>>103558479
thanks for this info, might give it a go on the weekend!
>>
File: PW_HV_00059.mp4 (576 KB, 480x848)
576 KB
576 KB MP4
LOL I have no idea how I got this style
>>103558594
I hope it works well for you as well, anon!! Good luck :D
>>
File: PW_HV_00060.mp4 (475 KB, 480x848)
475 KB
475 KB MP4
>>
File: 00201-2118785944.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>103558284
>How long does a typical Flux dev image take to gen?
With the Euler sampler and CFG Scale set to 1, a 1024x1024 gen takes ~1:24-1:27. Setting CFG any higher doubles it. Everyone advises not to do this because Flux doesn't support a negative prompt, but it still affects the way the gen comes out, so it can be worth messing with. Note that Forge has both a "Distilled CFG Scale" and "CFG Scale", and I'm talking about the latter. The former is called something like "Guidance" in ComfyUI.
>What pc do you have?
It's an Acer Nitro 5 AN515-58-79Q1 laptop which I upgraded to 64GB of RAM. I would strongly advise looking into desktop options instead of laptops because the laptop versions of cards are inherently weaker than desktops first of all, and because they're usually much more annoying to upgrade in any way - mine is an exception, and only for RAM and SSDs. I'm considering putting together a desktop with an RX 7900 XTX because it has 24GB of VRAM, though I'll have to do more research on how stuff like ZLUDA works.
>>
File: 5678-8.jpg (168 KB, 793x1122)
168 KB
168 KB JPG
>the entire thread was excellent
>>
File: 123412345678.jpg (196 KB, 793x1122)
196 KB
196 KB JPG
>>103556250
>>103556302
so cute
>>
File: 121427-tmp.png (2.8 MB, 1536x1728)
2.8 MB
2.8 MB PNG
>>103558693
Needs thicker eyebrows
>>
File: 123412341234.jpg (196 KB, 793x1122)
196 KB
196 KB JPG
>>103558009
lil lawbreaker ;3
>>
Is there any update on flux? Still using the nf4 model
>>
File: PW_HV_00064.mp4 (618 KB, 480x848)
618 KB
618 KB MP4
>>103558724
>>103558738
Hey you!! Welcome back :D
Great to see you again!
>>103558747
Omg how did I forget the thick eyebrows!! LOL
>>103558766
Always! Hahaha!
>>
File: PW_HV_00066.mp4 (433 KB, 480x848)
433 KB
433 KB MP4
>proompt (massive thick eyebrows)
>get this
LMAO wtf
>>
File: 5678-888.jpg (179 KB, 793x1122)
179 KB
179 KB JPG
>>103558786
~<3
>>
>>103558766
whoops lost a finger kek
>>
File: PW_HV_00045.mp4 (182 KB, 848x480)
182 KB
182 KB MP4
>>103558836
Also thank you for the kind words!! <3
Your gens are so cute as always!!!
I like the calendar idea haha I haven't seen anyone do that yet!!
>>
File: PW_HV_00071.mp4 (398 KB, 480x848)
398 KB
398 KB MP4
These eyebrows aren't getting any thicker unfortunately :[
>>
File: ComfyUI_00860_.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
I have been playing around with WAI-NSFW-illustrious-SDXL.
It seems to always have this "washed up" feel, almost low quality with lack of detail.
Even when I use the same prompt and settings from the examples, I cant replicate the original feel that's in the civitai page.
>>
In ComfyUI is there a way to have two or more models and vaes loaded at the same time? I'm trying a workflow that does a pass using one model, and then another pass using a different model.
>>
File: 1706519794872551.jpg (1.84 MB, 1664x2432)
1.84 MB
1.84 MB JPG
>>103559372
All illustrious derivatives but Noob are total shit in my experience.
>>
>>103559578
Do you have any recommendation anon?
I used to play with AnimagineXL, but it's not that good either
>>
File: PW_HV_00074.mp4 (359 KB, 480x848)
359 KB
359 KB MP4
>>103559372
Your gen doesn't look washed up, anon! Perhaps they used certain things in their proompt or added some extra stuff n LoRAs
>>103559537
Yup! You can absolutely use multiple models and vaes at the same time! You're more than welcome to pick apart my workflows if you'd like :]
>https://civitai.com/user/PWAnon
I hope they're useful to you! :]
>>
seeing that forge/a1111 has made some perf improvements since the last time I tested it, what is the best sdxl 1girl photorealistic model + loras right nao?
>>
File: file.png (3.07 MB, 2048x1024)
3.07 MB
3.07 MB PNG
>>103559605
but it looks really bad.
I tried the two passes thing, using a sd 1.5 based model on the second pass, and it looks so much better... But the sdxl model is best at making coherent pictures so I will use it to ensure decent hands.

Your workflows all look fantastic anon, I will experiment with them later, thanks for t he recommendation!
>>
File: 1705915897908493.jpg (1.87 MB, 1664x2432)
1.87 MB
1.87 MB JPG
>>103559593
As i said, noobai. Animagine is like mammoth shit old and pony is completely eclipsed by illustrious.
But for better experience with illustrious you need to experiment with artist and medium prompts. This gen for example is a combination of pixel art and alternation between two different artists.
>>
I miss Schizoanon
>>
File: PW_HV_00075.mp4 (434 KB, 480x848)
434 KB
434 KB MP4
>>103559664
Aw anon, I don't think it looks bad!
Hmm what models are you using? I wanna try to help! If you wanna give me a catbox that'd be super useful too!
Thank you so much! I hope you find some use from them! I have more that I really need to upload as these are kinda outdated hahaha!
>>
File: 1732955359458758.jpg (1.2 MB, 1664x2432)
1.2 MB
1.2 MB JPG
>>
Some really amazing gens this thread
>>
>>
File: PW_HV_00108.mp4 (386 KB, 848x480)
386 KB
386 KB MP4
>>
>>103555845
So what is the best overall local video model out there? In terms of gen time, quality and how well it follows a prompt?

>Mochi
>CogVideoX (and all of its variants)
>Hunyuan
>Hunyuan GGUF
>LTXvideo

I havnt had a chance to test every single variation of workflow, model type, etc. So far, I found cog to have better video to video, Mochi for text to video, LTX for speed (semi-decent results, video to video is complete turd though). A part from LTX, everything else just runs way to slow

t. 16gb ti super
>>
File: PW_HV_00111.mp4 (410 KB, 848x480)
410 KB
410 KB MP4
>>103559992
I've used the first three and Hunyuan is hella fun and pretty quick!
>>
>>103560036
>Hunyuan
I would need a 4090 for that. Might see how the GGUF version goes, then again I hear its slow as shit too.
>>
>>103560098
in a previous reply that anon mention it's about 3mins-ish on a 4090 so lower tiered cards will be progressively slower as you move down the xx80, xx70 etc line.
>>
File: PW_HV_00092.mp4 (552 KB, 480x848)
552 KB
552 KB MP4
>>103560098
Ohhh! I hope that version works! The other models were pretty fun too, but aren't really as fluid unfortunately imo!
>>
gm
>>
>>103557944
>>103558023
How do youmake such clean pixelart?
>>
>gm
>>
File: ComfyUI.webm (3.1 MB, 848x480)
3.1 MB
3.1 MB WEBM
>>
File: PW_HV_00089.mp4 (322 KB, 480x848)
322 KB
322 KB MP4
>>103560240
Good morning, anon! :]
I hope you've slept well!
>>103560318
Comfy!!! hello!! :D
Good morning!
>>
File: 00000-2582418762.png (1.63 MB, 1376x1024)
1.63 MB
1.63 MB PNG
>>
File: ComfyUI_.webm (3.52 MB, 848x480)
3.52 MB
3.52 MB WEBM
>>103560360
Hello, shit I need to go to sleep lol.
>>
File: 00001-2582418775.png (1.45 MB, 1376x1024)
1.45 MB
1.45 MB PNG
>>
File: PW_HV_00046.mp4 (101 KB, 848x480)
101 KB
101 KB MP4
>>103560444
LOOL saaaaame desu!
Ugh i'm not tired tho!
I'm just proompting and making music right now
It's totally your fault btw hahaha!! I'm having way too much fun with this!
>>
>>103560360
not enough ;_;
>>103560316
gm
>>
>>103560444
Really cute! Can you make her get hit by a truck?
>>
File: PW_HV_00087.mp4 (399 KB, 480x848)
399 KB
399 KB MP4
>>103560506
It's never enough! ;A;
That's what coffee is for! Hahaha!
>>
>>103560561
heh
>>
File: ComfyUI__.webm (2.96 MB, 848x480)
2.96 MB
2.96 MB WEBM
>>103560497
Yeah it's pretty fun.
>>
File: 00076-1989136047.png (3.09 MB, 1344x1728)
3.09 MB
3.09 MB PNG
I trained a thing, please try it out

I trained a thing, please try it out.

https://civitai.com/models/1049159?modelVersionId=1177214
>>
>>103560561
lel
>>103560584
aint dat the troot
>>
>>103560667
I can make a generic anime girl in a generic forest on my own though, so why?
>>
File: PW_HV_00134.mp4 (228 KB, 480x848)
228 KB
228 KB MP4
>>103560637
It really is haha!
>>103560667
I haven't actually tried the illustrious thing yet, i'll give it a try!
>>103560683
It sure is hahaha!
>>
File: 00112-2314887449.png (3.74 MB, 1344x1728)
3.74 MB
3.74 MB PNG
>>103560694
didn't want to post the unhinged, stylistically overcooked stuff since the average pleb would go "ew"

>>103560695

So far, I've had better things come out of illustrious than ponyXL, noobai had been a total wash in comparison, though.
>>
File: PW_HV_00133.mp4 (211 KB, 480x848)
211 KB
211 KB MP4
>>103560740
Ohh nice! I'm sometimes late to things cause I disappear every here n there hahaha! I'm excited to try it! :]
I'll totally post my results!
>>
>>103560740
....Sooooo, are you just plugging your generic waifu or what?
>>
File: file.mp4 (367 KB, 640x480)
367 KB
367 KB MP4
>>
>>103560782
The style, anon, not the waifu. At the risk of sounding masturbatory or pretentious, have a sample from training. It's all about that rich colour, the whole dark fantasy 90s DND flavour slop, the kind of thing you see in an old TTRPG rulebook, that kind of thing.
>>
>>103560807
Well yeah that's kinda cool. You shoulda just said that in the first place
>>
File: 00002-2370943577.png (1.79 MB, 1024x1376)
1.79 MB
1.79 MB PNG
>>
File: PW_HV_00085.mp4 (395 KB, 480x848)
395 KB
395 KB MP4
>>103560740
Hey btw what vae should I use? I'm getting some weird stuff haha
>>
>>103560852
now make a v for cringedetta anon mask meme edit of it
>>
File: 00118-1722887448.png (3.3 MB, 1344x1728)
3.3 MB
3.3 MB PNG
>>103560827
I wanted something presentable people would get/understand easier, and the good old 1girl generic anime girl seemed like a good idea instead of trying to explain something like that and coming across as a doochey AI bro/pretentious art critic.

>>103560878
Personally I use the generic SDXL VAE, but I had good results with https://civitai.com/models/855441/aaah-vibrant-vae-i-guess

Really depends on the model and such, Illustrious has been a lot more sensitive to sampler and CFG than pony ever was so far though.
>>
File: snowfall forest.webm (1.97 MB, 1920x640)
1.97 MB
1.97 MB WEBM
>>
>>
File: ComfyUI_01040_.jpg (1.16 MB, 4096x1024)
1.16 MB
1.16 MB JPG
In comfyUI is there a way to loopback, make a img2img pass happen twice or more times? Like in auto1111 we can use the loopback custom script in the img2img2 tab, and make the img2img pass several times...
I want to test passing this img2img picture a few times with low denoise to test a way to refine hands.
>>
Next Thread

>>103560815
>>103560815
>>103560815
>>
>>103560807
So you trained it with somewhat melted AI images? That's not a good look.
>>
>>103560903
Well I can tell you right now you are retarded, anyone can make your "style" you showcased without any loras whatsoever, now the DND thing now that's a cool thing and an actual hook. There is a saying if you try to please everyone you please no one.
>>
File: 00316.jpg (132 KB, 1240x1560)
132 KB
132 KB JPG
>>103551784
current mood, no dice
>>
>>103560955
Trained with these kinds of images, I saved off Pinterest over a few lonely evenings. Then I captioned them with CLIP iirc, and then promptly forgot about it until this morning.

>>103560976
That is fair enough
>>
File: snowfall forest 2.webm (1.87 MB, 1920x640)
1.87 MB
1.87 MB WEBM
>better without the foreground snowflakes
>>
File: PW_101335_.png (2.05 MB, 1800x960)
2.05 MB
2.05 MB PNG
>>103560903
Here's what I got from it so far, but it doesn't wanna work with my upscale haha! I'll have to play with it some more!
>>
>>103561038
Fucking nice, I had most visual power come off at around weight 2-ish, but your mileage may wary.
>>
File: PW_101336_.png (2.48 MB, 1800x960)
2.48 MB
2.48 MB PNG
>>103561072
I'm so glad you like it :D
I'll try more when I come back haha i'm super tired!
I hella like how the hat came out in this one! It's huge hahaha
>>
>>103561009
Yeah, these are AI. Please learn about why training AI with AI images is not a good thing if you care about image quality.
>>
>>103561122
I'm aware, and I have been around the block since SD1 was the new hotness. Still, I couldn't find where/how/with what those were made, and so I made something that replicates it.

Once again; it's about the vibe, yknow?
>>
>>103560944
Thanks
>>
>>103560852
when banana ring, u accept the call



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.