[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology

Name
Options
Comment
Verification
4chan Pass users can bypass this verification. [Learn More] [Login]
File
  • Please read the Rules and FAQ before posting.
  • You may highlight syntax and preserve whitespace by using [code] tags.

08/21/20New boards added: /vrpg/, /vmg/, /vst/ and /vm/
05/04/17New trial board added: /bant/ - International/Random
10/04/16New board for 4chan Pass users: /vip/ - Very Important Posts
[Hide] [Show All]


Happy Birthday 4chan!


[Advertise on 4chan]


File: the longest dick general.jpg (2.31 MB, 3264x1895)
2.31 MB
2.31 MB JPG
Discussion of free and open source text-to-image models

Undistilled Edition

Previous /ldg/ bread : >>102646216

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/kohya-ss/sd-scripts/tree/sd3

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/aco/sdg
>>>/aco/aivg
>>>/b/degen
>>>/c/kdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vt/vtai
>>
File: 00132-3280264361.png (1.27 MB, 704x1664)
1.27 MB
1.27 MB PNG
>>
File: 00133-277872321.png (1.61 MB, 704x1664)
1.61 MB
1.61 MB PNG
>>
File: 00136-1418937834.png (2.46 MB, 1216x1600)
2.46 MB
2.46 MB PNG
>>
File: 00135-502578135.png (1.55 MB, 704x1664)
1.55 MB
1.55 MB PNG
>>
>puts the trypophobia pic in OP
I hate you
I fucking hate you
I hate you so much
fuck you
fuck
you
FUCK YOU AAAAAAAAAA
>>
File: 00142-1547560536.png (1.57 MB, 896x1344)
1.57 MB
1.57 MB PNG
>>
File: 00144-3649137082.png (1.74 MB, 896x1344)
1.74 MB
1.74 MB PNG
>>
>>102661427
pussy
>>
File: 00152-3335402968.png (953 KB, 576x1024)
953 KB
953 KB PNG
>>
>>102651788
I guess this is a technical question - how is it known that it is now undistilled? what if it's only 80% undistilled? how do you verify this
>>
File: 00161-3036414018.png (1020 KB, 576x1024)
1020 KB
1020 KB PNG
>>
File: 00164-2384723655.png (789 KB, 576x1024)
789 KB
789 KB PNG
>>
File: 00166-2721046942.png (834 KB, 576x1024)
834 KB
834 KB PNG
get rekt bruh
>>
File: file.jpg (1.73 MB, 9111x1796)
1.73 MB
1.73 MB JPG
>>102661447
>how is it known that it is now undistilled?
it doesn't burn at high cfg's, that's how you know it's undistilled
>>
File: 00172-2695188538.png (873 KB, 576x1024)
873 KB
873 KB PNG
>>
File: 00183-1104493318.png (884 KB, 576x1024)
884 KB
884 KB PNG
>>
File: 00182-2049000285.png (1.01 MB, 576x1024)
1.01 MB
1.01 MB PNG
>>
File: 00188-4198240280.png (933 KB, 576x1024)
933 KB
933 KB PNG
>>
File: 00192-3985904370.png (949 KB, 576x1024)
949 KB
949 KB PNG
>>
File: 00193-1537400177.png (915 KB, 576x1024)
915 KB
915 KB PNG
>>
File: 00195-3419126569.png (737 KB, 576x1024)
737 KB
737 KB PNG
>>
File: 00197-1455805134.png (889 KB, 576x1024)
889 KB
889 KB PNG
>>
File: 00194-2615945741.png (952 KB, 576x1024)
952 KB
952 KB PNG
>>
>>102661327
Google loves Temu and AliExpress
>>
File: 00204-291631721.png (846 KB, 576x1024)
846 KB
846 KB PNG
>>
File: 00201-3790433226.png (827 KB, 576x1024)
827 KB
827 KB PNG
>>
File: file.png (3.57 MB, 896x1536)
3.57 MB
3.57 MB PNG
remember the previous thread when it's been said that we couldn't finetune Flux? Well... the guy that previously made Realistic Vision took it personally lol
https://civitai.com/models/788550/realflux-10b
>>
File: 00203-1467414437.png (852 KB, 576x1024)
852 KB
852 KB PNG
>>
>>102661564
I don't see any information on the training he did other than Civitai's "Checkpoint Trained" tag...
>>
File: file.jpg (2.22 MB, 9999x1186)
2.22 MB
2.22 MB JPG
https://huggingface.co/nyanko7/flux-dev-de-distill
bruh it converges a 60 steps? oh man :(
>>
File: file.png (310 KB, 1843x1536)
310 KB
310 KB PNG
>>102661581
>I don't see any information on the training he did other than Civitai's "Checkpoint Trained" tag...
https://civitai.com/models/788550/realflux-10b?dialog=commentThread&commentId=551820
I think he's making a real finetune because he's seeing some collapse and is talking about the undistilled flux model
>>
>>102661606
>we will hope for the release of the non-dustilled version of the model
what? un-distilled flux dev was released 20 days ago
>>
File: file.png (83 KB, 1668x306)
83 KB
83 KB PNG
>>102661606
>I think he's making a real finetune
he is
>>
File: 00225-4124722864.png (1.26 MB, 768x1344)
1.26 MB
1.26 MB PNG
>>
File: 00232-2651050876.png (1.12 MB, 768x1344)
1.12 MB
1.12 MB PNG
>>
File: 00234-2343208380.png (1.49 MB, 768x1344)
1.49 MB
1.49 MB PNG
>>
Is there a discussion anywhere from the pony guy on how many H100 GPU-hours he would need to do it for either of these undistilled ones?
Or discussion of his GPU-hours at all for any model so I can extrapolate.
>>
>>102661732
>Is there a discussion anywhere from the pony guy on how many H100 GPU-hours he would need to do it for either of these undistilled ones?
to be honest I don't really care, Schnell's quality is so bad compared to dev, even if he made an insane finetune out of it the best scenario would be reaching dev's quality, I think a guy that makes a "normal" finetune on dev and does it because he doesn't care about money will be a better finetune than pony's insane one on schnell
>>
>>102661636
How many many images did he use for his dataset?
>>
>>102661743
Would you like to play a game?
Which is which, and which is SDXL?
>>
File: file.png (94 KB, 690x948)
94 KB
94 KB PNG
>>102661636
Ugh... he's giving up on flux because it's "distilled", I think he doesn't know un-distilled versions of flux already exist. I'm kinda curious what's this "Venus Vision" is though
https://huggingface.co/SG161222/Verus_Vision_1.0b
>>
>>102661802
you already did this a few threads ago, I don't want to sound rude, but you're the only human on earth that hasn't noticed the quality difference between schnell and dev
>>
>>102661816
That was SDXL vs schnell.
I can see it I’m just coping hard.
>>
>>102661824
>I can see it I’m just coping hard.
fair enough :v
>>
>Distilled Model
The only logical explanation was to tempt anon into purchasing a sub for pro, right?
>>
>>102661806
The model isn't even trained yet, lol. It could be a vaporware scam.
>>
File: file.png (220 KB, 631x513)
220 KB
220 KB PNG
>>102661840
It was more so releasing a model good enough to get the hype and publicity, and then destroy any competition by making their "open" models impossible to finetune, such an evil genius move if you ask me
>>
Bigma status?
>>
File: file.png (47 KB, 1074x224)
47 KB
47 KB PNG
>>102661873
>The model isn't even trained yet, lol. It could be a vaporware scam.
looks like Venus Vision is a continuation of the Flux undistilled model, they intend on improving over it, I like that path, those guys are definitely gonna save flux
>>
>>102661840
It’s to poison the well. Remember these are people that fled stability with everything learned there (if not outright their codebase). The reason stability is fucked financially is because their free models are almost as good as the paid one, and has been tuned to be way better. BFL made dev specifically to be at the same level as mid journey in benchmarks with a cuck license, and schnell just a hair above SDXL to fuck up open source, and clipped both of their wings for tuning to try to make it so that they couldn’t be tuned to be as good as the next model up.
>>
>>102661840
i havent seen a flux pro image that doesnt look like hyperslop tho
desu you cant desloppify it through prompt alone
>>
>>102661960
>i havent seen a flux pro image that doesnt look like hyperslop tho
true, pro isn't that better than dev, at least it should've gotten some sovl like Midjourney Niji, but I think they don't care that much, they got a partnership with Twitter, they are in a good place right now
>>
File: file.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
>>102661899
https://huggingface.co/nyanko7/flux-dev-de-distill
can't believe this took us 20 days to realize a trainable flux model existed, goddam...
>>
is the untrained model good or bad
>>
Can either of the undistilled Dev or Schnell models be used in ComfyUI?
>>
>>102662054
Has anyone even figured out if it being distilled was the only issue preventing training? From my experience with loras I think there's more fuckery up bfls sleeve. This could sadly still be a nothing burger
>>
File: 57069.jpg (346 KB, 2560x2560)
346 KB
346 KB JPG
epic posting in the /loser diffusion general/ ftw.
>>
>>102661888
It's apparently done, but they are writing and releasing paper about it first
>>
waiting
>>
https://huggingface.co/SG161222/RealFlux_1.0b_Dev
>>
>>102663039
none of the examples I've seen other people post look like they couldn't have been done with base Dev
nothingburger
>>
>>102661597
Is there a gguf version of this?

>>102661564
What's the difference to this?
>>
File: 1699093161939375.png (1.99 MB, 1024x1024)
1.99 MB
1.99 MB PNG
>>
Is there a good AI lyrics editing software yet?

Because there is so much good music with stupid fucking lyrics. I would make this shit into a fucking global no. 1.
>>
If I wanted to make pixel art for a game what would be the best local solution to do that?
>>
i'm usually running the normal flux fp8 model which has baked in vae and clip, and the model runs at 1.5it/s. but whenever i try the light weight versions like gguf, it becomes slow as shit. are there some settings i can use to get similar speeds or is this just how it works? there seems to be no reason to run the models with external clip and vae due to this.
>>
Is it possible to use flux with only 8gb of vram? I assume no but wanted to check.
>>
>>102663615
>Is it possible to use flux with only 8gb of vram?
yeah, you can run flux quanted
>https://huggingface.co/city96/FLUX.1-dev-gguf
pick one with a file fize that's smaller than your max vram + some extra room for stuff like image resolution and loras. i think you should be able to use the Q4_0 quant with 8gb
>>
>>102663593
Happened to be reading others stuff and saw this.
>GGUF is a pure compression tech, which means it is smaller but also slower because it has extra steps to decompress tensors and computation is still pytorch
>>
>>102663675
Does a1111 support this format or do I need to use comfy?
>>
>>102663682
>Does a1111 support
not sure but forge does
>>
>>102663682
see it as an opportunity to start using comfy instead.
>>
>>102663726
It's too annoying. Maybe once you get it set up with a pipeline it'll be "better" but I just don't care to do any of the fine tuning that comfy allows. Being able to select a model and type prompts immediately is good enough. I'm open to trying again because I still have it installed but there's just no usecase for me right now.
>>
>>102663593
Are you on a 40XX card? I believe they have some speed hack that makes fp8 run much faster.
>>
>>102663997
yeah i am. the compression also makes alot of sense.
>>
File: bComfyUI_124038_.jpg (782 KB, 1440x1024)
782 KB
782 KB JPG
>>102661427
it wasn't that bad but thanks for giving me something to gen for later
>>
File: bComfyUI_124112_.jpg (699 KB, 1440x1080)
699 KB
699 KB JPG
>>
>>102663223
Clawsome!
>>
>>102662461
can you go fatter
>>
File: 0.jpg (263 KB, 1024x1024)
263 KB
263 KB JPG
>>
File: 0.jpg (254 KB, 1024x1024)
254 KB
254 KB JPG
>>
File: 0.jpg (26 KB, 169x542)
26 KB
26 KB JPG
>>102664598
>
If you for some reason can't or don't want to use Photoshop's content-aware fill, here's a nice tool https://github.com/Sanster/IOPaint
>>
File: 0.jpg (243 KB, 1024x1024)
243 KB
243 KB JPG
yeah, messed up. posted wrong image.
>>
File: bComfyUI_123627_.jpg (642 KB, 1280x1024)
642 KB
642 KB JPG
>>
>>102664770
I like this style
>>
>>102664802
Halloween
mystery , Halloween , by Simon Stalenhag by Frazetta, surreal, by Andre Kohn,



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.