[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


New anti-spam measures have been applied to all boards.

Please see the Frequently Asked Questions page for details.

[Advertise on 4chan]


File: the longest dick general.jpg (2.32 MB, 3264x1472)
2.32 MB
2.32 MB JPG
Discussion of free and open source text-to-image models

Previously baked bread : >>103024144

Tasteless Retards Edition

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
reForge: https://github.com/Panchovix/stable-diffusion-webui-reForge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://aitracker.art
https://huggingface.co
https://civitai.com
https://tensor.art/models
https://liblib.art
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3

>SD3.5L/M
https://huggingface.co/stabilityai/stable-diffusion-3.5-large
https://replicate.com/stability-ai/stable-diffusion-3.5-large
https://huggingface.co/stabilityai/stable-diffusion-3.5-medium
https://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-medium

>Sana
https://github.com/NVlabs/Sana
https://sana-gen.mit.edu

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux
DeDistilled Quants: https://huggingface.co/TheYuriLover/flux-dev-de-distill-GGUF/tree/main

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
>>103027007
That looks legit, how did you do that?
>>
File: 2024-10-30_00012_.png (1.48 MB, 720x1280)
1.48 MB
1.48 MB PNG
>>103035670
>something important with pixelwave 3, make sure to use dpmpp_2m for the sampler and sgm_uniform for the scheduler, do at least 25 steps
Interesting. Why is that? Here is a prompt with PixelWave, some LyingSigma too. And DynamicThresholdingFull.

just euler, beta, otherwise.
>>
File: 2024-10-31_00002_.png (1.04 MB, 720x1280)
1.04 MB
1.04 MB PNG
Adding some earlier the lyingsigma un-detail.
>>
File: 002491.jpg (2.74 MB, 1536x2560)
2.74 MB
2.74 MB JPG
>>
>>103029174
Looks neat!
>>
>>103029288
This reminds me of Facing Worlds.
>>
>>103035784
nonono bro I'm doing dedistilled cfg=1 lmao

Two models at once, pixelwave highly realistic. Let me try it.
>>
>>103035714
c-can I t-take y-your p-picture? (the chines are very demure)
>>
>>103035758
>that image
that's the issue with PixelWave, the greats details of Flux are gone, now it looks like a SDXL image with its shitty VAE, maybe because he insisted on training the model with only 1k resolution pictures
>>
>>103035798
yeah my b I was about to respond to another post not you kek
>>
>>103035758
Those are the parameters the creator specified and i find it gives the best results by far
>>
File: 2024-10-31_00003_.png (820 KB, 720x1280)
820 KB
820 KB PNG
>>103035806
It seems like more adherence = worse details, it's a tradeoff.

>>103035776
The same exact settings but with PixelWave.
>>
File: 2024-10-31_00005_.png (1.27 MB, 720x1280)
1.27 MB
1.27 MB PNG
>>103035835
Same, but bypassed dynamic threshholding, and lyingsigmas, and I'm using the correct dpmpp_2m, sgm_uniform.

It's "better".

One problem with this whole topic is how it intersects the idea of image, culture, class, etc etc. photography, artwork, architecture, pragmatism.

ai is the ultimate social battleground.
>>
>desert storm was before the DJI Phantom

Tech continues to advance at a rapid pace. We just have stopped recognizing that our lives are altered over and over in ways that are not predictable, but which are mostly negative.
>>
File: 20241031T055303Z_00001_.jpg (1.86 MB, 2560x1440)
1.86 MB
1.86 MB JPG
>>
SOMEONE FIX THE OUT OF FOCUS BOKEH BLUR FLUX BULLSHIT!
Dedistilled, Pixelwave, whatever, it's still there I can't escape it!
>>
>>103036088
the only way I found to remove the bokeh is to go for that is by using the Lying Sigma Sampler node
https://github.com/Jonseed/ComfyUI-Detail-Daemon
>>
>>
>>
File: 2024-10-31_00007_.png (1.45 MB, 720x1280)
1.45 MB
1.45 MB PNG
Ariana Grande lora
>>
>>103036096
I really want to know how that's happening anyway - how can it know which parts to blur?
>>
>>103036249
>how can it know which parts to blur?
anon, models are deep learning neurons, they implicitly knows how the world works, so they probably finetuned it in a way that it should only make bokeh pictures
>>
File: wtf.png (395 KB, 1566x1166)
395 KB
395 KB PNG
wtf
>>
File: 00076-1639706402.png (375 KB, 496x496)
375 KB
375 KB PNG
from an unreleased GeoCities image lora
>>
File: 00046-3614240975.png (255 KB, 344x496)
255 KB
255 KB PNG
>>103036278
>>
>unreleased
Awh, man.
>>
File: 00034-4277168806.png (257 KB, 600x400)
257 KB
257 KB PNG
>>103036283
>>
File: 00067-1257821327.png (234 KB, 496x496)
234 KB
234 KB PNG
>>103036288
I'll release it eventually, just gotta generate some good examples images and get a better feel for it. I could upload it to catbox right now, I think I'm gonna do that right now in fact.
>>
File: esoteric_knowledge.jpg (92 KB, 900x669)
92 KB
92 KB JPG
>>103036267
Seems like this image will never not be relevant.
>>
File: 00017-3628111074.png (136 KB, 384x384)
136 KB
136 KB PNG
>>103036298
https://files.catbox.moe/qqy5s0.safetensors
Trigger is "Geocities image." at start of prompt. I recommend using a weight of 1.5 since 1.0 still has that Flux DoF blur.
>>
>>103036298
Flux? XL? 3.5?
>>
>>103036313
Flux, trained locally on a 3060. It takes hours but thankfully I have the time.
>>
>>103036312
TY anon, when I break out flux again I'll try it. The examples you're posting look so good.
>>
File: 00005-3902850286.png (410 KB, 768x384)
410 KB
410 KB PNG
>>103036312
Also I recommend generating at 512 resolution or lower, but text sometimes breaks down at around 300px. I tried song lyrics and it always spit out "Are a jaded?" instead of "Are you jaded?".
>>103036324
Do post your results, I'm proud of how this turned out.
>>
>>103036310
true, the line is really thin kek
>>
>>103036332
i almost dont believe this is AI
>>
>>103036350
Flux is incredible at producing low-quality looking images if you train it on them. I attribute it to the VAE personally, but even in that image you can see the generic blur in the back. Again, using 1.5 weight is probably better but I didn't bother when I generated that one.
>>
File: ComfyUI_00007_.png (1013 KB, 1024x1024)
1013 KB
1013 KB PNG
>>103036350
>i almost dont believe this is AI
that's the Flux effect, I had the same reaction for that picture until I was able to make it myself on flux dev
>>
File: 00115-3436883531.png (2.03 MB, 1024x1024)
2.03 MB
2.03 MB PNG
Here's a different experiment from a while back, using a weirdcore aesthetic lora without a prompt (only using the activation).
I generated a metric fuckton of these, and they look nothing like what the weirdcore aesthetic is supposed to be.
>>
File: 00181-2114428040.png (1.88 MB, 1024x1024)
1.88 MB
1.88 MB PNG
>>103036385
There's way too many to reasonably post, so here's two more
>>
File: 00140-2816918863.png (1.08 MB, 1024x1024)
1.08 MB
1.08 MB PNG
>>103036389
This one's closer, at least with the text.
>>
File: 00144-3930504532.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>>103036389
I lied.
>>
File: 00174-3930504562.png (2.27 MB, 1024x1024)
2.27 MB
2.27 MB PNG
>>103036427
Alright, NOW this is the last one. I have to stop now or else I'll get carried away.
>>
>>
>>
File: 2024-10-31_00009_.png (1.21 MB, 720x1280)
1.21 MB
1.21 MB PNG
>>103036246
dedistilled and lyingsigmas
>>
>>103036443
>1girl
>>
File: 00015-3930504503.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>103036479
>>
>>103036312
>>103036385
>>103036389
>>103036397
>>103036483
I'd take a whole thread of them. Incredible.
>>
File: 00030-3141513107.png (1.52 MB, 1024x1024)
1.52 MB
1.52 MB PNG
>>103036486
I dumped an absolute SHITLOAD of them in a dickscord server, I'll try just compiling them and making a catbox (or other file service if it's too big)
Glad you appreciate them! This is the last one I'll post in the thread, for realsies this time.
>>
>>103036310
real
>>
>>103036514
Sorry, doesn't look like it's happening. The zip is 1.4GB (granted i did absolutely no filtering) and I can't find a good file sharing site. There's one I specifically remember that was very similar to catbox but had an upload limit of 1gb but now I can't find it. If anyone knows what I'm talking about please post it here.
>>
>>103036560
>upload limit of 1gb
https://litterbox.catbox.moe
>>
>>103036571
I meant a base upload of 1GB, where it doesn't expire, and presumably their equivalent of litterbox would support larger sizes.
>>
>>103036578
everything expires eventually, catbox is a major exception to this
maybe use pixel drain or mega
>>
File: file.webm (1.3 MB, 1280x768)
1.3 MB
1.3 MB WEBM
https://github.com/jy0205/Pyramid-Flow
Babe wake up, they released their promised base model trained from scratch
>We have switched the model structure from SD3 to a mini FLUX to fix human structure issues, please try our 1024p image checkpoint and 384p video checkpoint (up to 5s). The new miniflux model shows great improvement on human structure and motion stability. We will release 768p video checkpoint in a few days.
>>
>>103036583
My upload speed is fucked unfortunately, so I'm currently sitting my ass down and saving everything I posted to discord, and I'll zip that instead.
>>
>>103036591
>please try our 1024p image checkpoint
oh nice a new local image model
>>
File: 002514.jpg (2.14 MB, 1536x2560)
2.14 MB
2.14 MB JPG
>>
>>103036623
It's still gonna take a bit. Will post when done.
>>
>>103036591
>they released their promised base model trained from scratch
Not quite, they released the 384p video sure, but not the 768p yet, so I guess we'll wait for some more before testing it seriously again. I hope that one will be as good as Mochi, at least those chinks made image2video possible
>>
>>103036676
Here it finally is: pixeldrain 7Za5iN4D
>>
>>103036718
God fucking dammit, I left out a bunch of images. Oh well, nobody gives a shit, it can wait.
>>
>>
File: 1071461689.png (1.33 MB, 768x1344)
1.33 MB
1.33 MB PNG
>>
>>103036591
The last time they said "a few days" that meant 3 weeks, I'm not going to install and test it for 384p.
While i understand this is primarily to get some sppedups and vram reductions from devs to use on the 768p model I don't apprecitate being told "a few days" when it is more likely to be several weeks.
I am grateful to all devs for the toys they provide.
>>
File: 2024-10-31_00010_.png (1.31 MB, 720x1280)
1.31 MB
1.31 MB PNG
>>103036473
>>
>>103036726
FINALLY it's done: https://files.catbox.moe/fmsoan.7z
>>
>>103036891
I didn't believe their "few days" bullshit, you can't pretrain a model in a few days lawl
>>
File: 2024-10-31_00012_.png (1.44 MB, 720x1280)
1.44 MB
1.44 MB PNG
>>103036934
"desert" storm. Time to find out the TRUTH.
>>
File: 00088-3930504576.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>
File: 2024-10-31_00013_.png (1.51 MB, 720x1280)
1.51 MB
1.51 MB PNG
>>103037164
What do they know that they aren't telling us?
>>
>>103037061
the thing is, once they've got the parameters set for the dataset at lower resolution, which probably took iteration and tests, they can just plug that into the larger model if it's the same dataset just at a higher resolution and get a comparable result in terms of what the model learns, but with finer fidelity.
But admittedly, "a few days" is kinda sus.
>>
File: 2024-10-31_00014_.png (965 KB, 720x1280)
965 KB
965 KB PNG
>>103037209
>>
File: 2024-10-31_00015_.png (446 KB, 720x1280)
446 KB
446 KB PNG
Data.
>>
File: grid-0327.jpg (336 KB, 1792x2304)
336 KB
336 KB JPG
>>
there are literally 20+ flux loras out there that are meant to do female nudity. Which is a reliable one worth using?
>>
File: altrevy.jpg (194 KB, 1024x1024)
194 KB
194 KB JPG
>>
>>103037440
IOPaint would have fixed that faster than whatever you used.
>>
>>103035779
Catbox please
>>
File: 0.jpg (441 KB, 1376x896)
441 KB
441 KB JPG



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.