/g/ - /ldg/ - Local Diffusion General - Technology

[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]

Board

▼ Settings Mobile Home

/g/ - Technology

Return Catalog Bottom Refresh

Thread archived.
You cannot reply anymore.

[Advertise on 4chan]

[Return] [Catalog] [Bottom]

Anonymous

/ldg/ - Local Diffusion Genera(...) 07/12/24(Fri)02:47:56 No.101375708

File: tmp.jpg (1.13 MB, 3264x3264)

1.13 MB JPG

/ldg/ - Local Diffusion General Anonymous 07/12/24(Fri)02:47:56 No.101375708 Archived

General dedicated to creative use of free and open source text-to-image models

Previous /ldg/ bread : >>101344420

>Beginner UI
Fooocus: https://github.com/lllyasviel/fooocus
EasyDiffusion: https://easydiffusion.github.io
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
InvokeAI: https://github.com/invoke-ai/InvokeAI
ComfyUI: https://github.com/comfyanonymous/ComfyUI

>Auto1111 forks
SD.Next: https://github.com/vladmandic/automatic
Anapnoe UX: https://github.com/anapnoe/stable-diffusion-webui-ux

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Kolors
https://gokaygokay-kolors.hf.space
Nodes: https://github.com/kijai/ComfyUI-KwaiKolorsWrapper

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>View and submit GPU performance data
https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
https://docs.getgrist.com/3mjouqRSdkBY/sdperformance

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Share image prompt info
https://rentry.org/hdgcb
https://catbox.moe

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/trash/sdg

Anonymous
07/12/24(Fri)02:52:56 No.101375756

Anonymous 07/12/24(Fri)02:52:56 No.101375756

Blessed to bred you anon

Anonymous
07/12/24(Fri)02:53:56 No.101375768

Anonymous 07/12/24(Fri)02:53:56 No.101375768

official pixart bigma and lumina 2 waiting room, now with good prompt comprehension

Anonymous
07/12/24(Fri)02:54:17 No.101375771

Anonymous 07/12/24(Fri)02:54:17 No.101375771

File: ComfyUI_00102_.png (1.25 MB, 1024x1024)

1.25 MB PNG

Gonna post this here as well:

New open source model, best prompt following outside of ideogram so far.
https://blog.fal.ai/auraflow/
https://huggingface.co/fal/AuraFlow

Towards the right is a cartoon dragon on top of a cliff, to the left is a anthromorphic fox wearing armor riding a horse. The horse is standing on top of a blue cube. In the background there is a flying eagle holding a sun. The sun has a angry face on it.

Anonymous
07/12/24(Fri)02:55:01 No.101375778

Anonymous 07/12/24(Fri)02:55:01 No.101375778

File: media_GSQL07ragAEYTfq.jpg (168 KB, 1024x1024)

168 KB JPG

Model: https://huggingface.co/fal/AuraFlow
Demo: https://fal.ai/models/fal-ai/aura-flow?share=45041643-4b84-4603-b6c8-b76be7869c4f
>There's a green triangle on top of a blue square, and a red sphere on top of the green triangle, and a yellow rabbit on top of the red sphere, and a pink sheep on the right, and a purple tiger on the left, and a black bat on the top right
That's really impressive, this guy is good, only one guy is making a better model than a whole fucking team (SAI), the SAI cucks should be ashamed of themselves

Anonymous
07/12/24(Fri)02:55:20 No.101375781

Anonymous 07/12/24(Fri)02:55:20 No.101375781

File: 1720762644816254.png (1.23 MB, 1024x1024)

1.23 MB PNG

>>101375771
And this was the actual first attempt at the prompt that did better.

Anonymous
07/12/24(Fri)02:56:33 No.101375794

Anonymous 07/12/24(Fri)02:56:33 No.101375794

>>101375771
>>101375778
really promising model, with whatever this guy cooks up next and pixart bigma, we'll be eatin good.

Anonymous
07/12/24(Fri)02:58:04 No.101375803

Anonymous 07/12/24(Fri)02:58:04 No.101375803

needs 16ch vae tho

Anonymous
07/12/24(Fri)02:58:10 No.101375804

Anonymous 07/12/24(Fri)02:58:10 No.101375804

>>101375771
>>101375778
that looks like some monty python drawings kek
https://www.youtube.com/watch?v=pLpK_Htw-F8

Anonymous
07/12/24(Fri)02:59:23 No.101375811

Anonymous 07/12/24(Fri)02:59:23 No.101375811

>>101375730
I use AI and my silly image and sometime porn generator, but does anyone else notice that the shitter an artist is, the more they hate AI?

Anonymous
07/12/24(Fri)02:59:40 No.101375817

Anonymous 07/12/24(Fri)02:59:40 No.101375817

>>101375803
the 16ch vae exists but in its rough form, needs a bit of training to be adapted to some new models

Anonymous
07/12/24(Fri)03:01:02 No.101375827

Anonymous 07/12/24(Fri)03:01:02 No.101375827

>>101375811
of course, AI can't reach the best artists level (yet), that's why only the shitty artists are seething, because they realized that they have so few talent the AI managed to beat them in less than a few years of its lifetime

Anonymous
07/12/24(Fri)03:03:16 No.101375842

Anonymous 07/12/24(Fri)03:03:16 No.101375842

File: ComfyUI_00127_.png (1.49 MB, 1024x1024)

1.49 MB PNG

>>101375804
It can do better aesthetics if prompted for it. I'm more interested in its prompt comprehension because everything else can be finetuned easily enough.

Anonymous
07/12/24(Fri)03:03:17 No.101375843

Anonymous 07/12/24(Fri)03:03:17 No.101375843

>>101375771
>best prompt following outside of ideogram so far.
so for you the ranking would be ideogram > this model > dalle3 in terms of prompt understanding?

Anonymous
07/12/24(Fri)03:04:35 No.101375848

Anonymous 07/12/24(Fri)03:04:35 No.101375848

>>101375843
Yes. And this model is supposedly still early in training.

Anonymous
07/12/24(Fri)03:07:14 No.101375862

Anonymous 07/12/24(Fri)03:07:14 No.101375862

File: AuraFlow_00021_.png (1.09 MB, 1024x1024)

1.09 MB PNG

It seems like a lot of the data for this model was pulled off of Ideogram.
Like, sometimes the "not safe" cat will appear at random times, like it has been sprinkled throughout the dataset in response to potentially unsafe prompts. There's nothing wrong with that on the surface, but it raises the question of how long the trainer plans to keep training a model that makes copies of copies and how much new and curated data will be introduced into the dataset?

Anonymous
07/12/24(Fri)03:08:02 No.101375870

Anonymous 07/12/24(Fri)03:08:02 No.101375870

File: ComfyUI_00129_.png (1.35 MB, 1024x1024)

1.35 MB PNG

(extremely aesthetic charcoal drawing of a majestic western dragon looking at the viewer, the dragon is sitting on top of a red jeep:1.4), (dark background, rim lighting, epic, detailed background:1.2), (fantasy vibe:1.2), rich colors, high contrast, hard focus, intricate details, natural light, ethereal, expressive, intimate, elegant, vibrant bloom, whimsical, dramatic shadows, medium close-up, 85mm lens, f/2.8, atmospheric, moody, evocative, luxurious, textured, artistic, surreal, detailed, otherworldly

Anonymous
07/12/24(Fri)03:09:49 No.101375882

Anonymous 07/12/24(Fri)03:09:49 No.101375882

>>101375862
>training a model with synthetic data
when will they learn?

Anonymous
07/12/24(Fri)03:10:37 No.101375885

Anonymous 07/12/24(Fri)03:10:37 No.101375885

>>101375882
they will never learn...

Anonymous
07/12/24(Fri)03:10:52 No.101375888

Anonymous 07/12/24(Fri)03:10:52 No.101375888

>>101375771
>best prompt following outside of ideogram so far.
>>101375862
>It seems like a lot of the data for this model was pulled off of Ideogram.
Now we know why the prompt following is ideogram tier, he's just trying to make a copy of ideogram

Anonymous
07/12/24(Fri)03:11:05 No.101375890

Anonymous 07/12/24(Fri)03:11:05 No.101375890

File: ComfyUI_00130_.png (1.38 MB, 1024x1024)

1.38 MB PNG

>>101375882
Its the reason why the model is so good at following prompts. There is nothing to learn. The best current models are using mostly synthetic data from image gen to text gen.

Anonymous
07/12/24(Fri)03:11:09 No.101375891

Anonymous 07/12/24(Fri)03:11:09 No.101375891

>>101375885
They will machine learn

Anonymous
07/12/24(Fri)03:11:52 No.101375897

Anonymous 07/12/24(Fri)03:11:52 No.101375897

>>101375890
>Its the reason why the model is so good at following prompts.
Doubt. It's how good the dataset is tagged.

Anonymous
07/12/24(Fri)03:12:18 No.101375899

Anonymous 07/12/24(Fri)03:12:18 No.101375899

>>101375890
>Its the reason why the model is so good at following prompts.
that's bullshit, dalle3 is good at following prompt because they used GPT4V on real pictures the captioning, not by using synthetic data

Anonymous
07/12/24(Fri)03:12:21 No.101375900

Anonymous 07/12/24(Fri)03:12:21 No.101375900

File: ComfyUI_00132_.png (1.61 MB, 1024x1024)

1.61 MB PNG

(extremely aesthetic charcoal drawing of a majestic western dragon looking at the viewer, the dragon is driving a red jeep, the dragon is wearing a top hat, the jeeps license plate has the "DRAG" on it:1.4), (dark background, rim lighting, epic, detailed background:1.2), (fantasy vibe:1.2), rich colors, high contrast, hard focus, intricate details, natural light, ethereal, expressive, intimate, elegant, vibrant bloom, whimsical, dramatic shadows, medium close-up, 85mm lens, f/2.8, atmospheric, moody, evocative, luxurious, textured, artistic, surreal, detailed, otherworldly

Anonymous
07/12/24(Fri)03:13:04 No.101375905

Anonymous 07/12/24(Fri)03:13:04 No.101375905

>>101375890
>The best current models are using mostly synthetic data
models such as?

Anonymous
07/12/24(Fri)03:13:28 No.101375909

Anonymous 07/12/24(Fri)03:13:28 No.101375909

>>101375899
Wizard, phi, gemma... well filtered mostly synthetic data. Dalle / midjourney, tons of synthetic data...

Anonymous
07/12/24(Fri)03:14:07 No.101375916

Anonymous 07/12/24(Fri)03:14:07 No.101375916

>>101375905
Itcametomeinavision_xl and imadeitup_1.5

Anonymous
07/12/24(Fri)03:14:23 No.101375920

Anonymous 07/12/24(Fri)03:14:23 No.101375920

>>101375909
and that's why they are all slopped, and they did that on the FINETUNING stage, not on the pretraining one, you don't pretrain a model with synthetic data that's bullshit

Anonymous
07/12/24(Fri)03:15:16 No.101375928

Anonymous 07/12/24(Fri)03:15:16 No.101375928

>>101375920
>and that's why they are all slopped
I mean you say that but they are all the best performing models for their size.

Anonymous
07/12/24(Fri)03:16:42 No.101375940

Anonymous 07/12/24(Fri)03:16:42 No.101375940

File: ComfyUI_00133_.png (1.61 MB, 1024x1024)

1.61 MB PNG

Anonymous
07/12/24(Fri)03:17:11 No.101375941

Anonymous 07/12/24(Fri)03:17:11 No.101375941

>I love the look of AI sloppa
>Let's throw some into our model

Anonymous
07/12/24(Fri)03:17:26 No.101375942

Anonymous 07/12/24(Fri)03:17:26 No.101375942

>>101375928
no, I mean that they are all slopped, /lmg/ complain about that a lot, and LLMs and imagegens aren't 1 to 1 equivalent. You don't become the best by just being a cheap copy of the bests, Midjourney/dalle/chatgpt, they are the bests and they never trained their models on AI slop, they did this on real data, as it should

Anonymous
07/12/24(Fri)03:18:49 No.101375949

Anonymous 07/12/24(Fri)03:18:49 No.101375949

>>101375942
? Gemma / wizard are some of the best models touted on lmg atm. Phi though sure, but its tiny.

Anonymous
07/12/24(Fri)03:20:08 No.101375957

Anonymous 07/12/24(Fri)03:20:08 No.101375957

>>101375949
why do you repeat the same arguments like a broken disk or something? you're wasting my time anon

Anonymous
07/12/24(Fri)03:21:24 No.101375965

Anonymous 07/12/24(Fri)03:21:24 No.101375965

>>101375941
I think there should be some so the model knows it as a concept. SD face, midjourney style etc. Perhaps even what the common artifacts from them look like

Anonymous
07/12/24(Fri)03:22:26 No.101375971

Anonymous 07/12/24(Fri)03:22:26 No.101375971

File: ComfyUI_00139_.png (1.35 MB, 1024x1024)

1.35 MB PNG

(extremely aesthetic charcoal drawing of a majestic western dragon looking at the viewer, the dragon is driving a red jeep, the dragon is wearing a top hat, the jeeps license plate has "DRAG" on it. To the right is a minotaur driving a purple suv, the suv's license plate has "BULL" on it. They are racing towards the camera.:1.4), (dark background, rim lighting, epic, detailed background:1.2), (fantasy vibe:1.2), rich colors, high contrast, hard focus, intricate details, natural light, ethereal, expressive, intimate, elegant, vibrant bloom, whimsical, dramatic shadows, medium close-up, 85mm lens, f/2.8, atmospheric, moody, evocative, luxurious, textured, artistic, surreal, detailed, otherworldly

Anonymous
07/12/24(Fri)03:23:03 No.101375974

Anonymous 07/12/24(Fri)03:23:03 No.101375974

>>101375965
i don't believe it works like that anon

Anonymous
07/12/24(Fri)03:23:29 No.101375978

Anonymous 07/12/24(Fri)03:23:29 No.101375978

>>101375941
The trick is to use synthetic data to fill out the gaps in its knowledge. But you balance it out with aesthetic training, then you have the advantages without the "slop" style.

Anonymous
07/12/24(Fri)03:23:58 No.101375983

Anonymous 07/12/24(Fri)03:23:58 No.101375983

>>101375965
that's not what it is doing though, it's not putting a ideogram picture and making the model understand it's an AI picture, it's training it as if it's a real picture, that's dumb as fuck, the model is learning reality through AI sloppa, it's like recording a VHS out of a VHS, you just lose accuracy with this inbreeding technique
https://www.youtube.com/watch?v=nqy_hYDI0As

Anonymous
07/12/24(Fri)03:24:32 No.101375987

Anonymous 07/12/24(Fri)03:24:32 No.101375987

>>101375978
>use synthetic data to fill out the gaps in its knowledge
Name a single concept, object, etc that does not have enough real data available.

Anonymous
07/12/24(Fri)03:25:13 No.101375996

Anonymous 07/12/24(Fri)03:25:13 No.101375996

>>101375974
>>101375983
Well then it's extremely gay and not based at all, technically speaking of course

Anonymous
07/12/24(Fri)03:26:43 No.101376006

Anonymous 07/12/24(Fri)03:26:43 No.101376006

>>101375987
Find me a equivalent image for each and every concept in this dataset.
https://huggingface.co/datasets/ProGamerGov/synthetic-dataset-1m-dalle3-high-quality-captions

Anonymous
07/12/24(Fri)03:26:55 No.101376009

Anonymous 07/12/24(Fri)03:26:55 No.101376009

>>101375998
I don't want a model to look like a cheap copy of dalle3, that's insanity

Anonymous
07/12/24(Fri)03:27:52 No.101376018

Anonymous 07/12/24(Fri)03:27:52 No.101376018

>>101376009
Read my comment again: >>101375978 You can avoid getting the "style" of said images while gaining the "concepts". Its gonna need a lot more training though.

Anonymous
07/12/24(Fri)03:27:54 No.101376019

Anonymous 07/12/24(Fri)03:27:54 No.101376019

>>101376006
>Models trained or fine-tuned on
ProGamerGov/synt
>Zero text-to-image
I wonder why. You should train one and lets us know how it looks.

Anonymous
07/12/24(Fri)03:30:29 No.101376035

Anonymous 07/12/24(Fri)03:30:29 No.101376035

>>101376006
The world captures over 5 billion real images daily your argument is invalid

Anonymous
07/12/24(Fri)03:30:50 No.101376038

Anonymous 07/12/24(Fri)03:30:50 No.101376038

>>101376019
Are you not following the thread / the last one? That is what Auraflow is doing and its looking good even as undertrained as it is so far. Best in class even.

Anonymous
07/12/24(Fri)03:31:48 No.101376043

Anonymous 07/12/24(Fri)03:31:48 No.101376043

File: chika-takami-love-live-su(...).jpg (240 KB, 1500x844)

240 KB JPG

>>101376006
if you're not lazy you could get the same results by training your model with really complex REAL drawings that has a shit ton of stuff in it

Anonymous
07/12/24(Fri)03:32:04 No.101376044

Anonymous 07/12/24(Fri)03:32:04 No.101376044

>>101376018
>You can avoid getting the "style" of said images while gaining the "concepts". Its gonna need a lot more training though.
Have any examples of this? Until it happens with aura I sleep.

Anonymous
07/12/24(Fri)03:32:38 No.101376047

Anonymous 07/12/24(Fri)03:32:38 No.101376047

>>101376043
That looks like shit. I honestly prefer the dalle "style."

Anonymous
07/12/24(Fri)03:32:51 No.101376050

Anonymous 07/12/24(Fri)03:32:51 No.101376050

Any guesses to what that one (1) person as spent on training this? Something isn't adding up.

Anonymous
07/12/24(Fri)03:33:06 No.101376054

Anonymous 07/12/24(Fri)03:33:06 No.101376054

>>101376047
>>101375941

Anonymous
07/12/24(Fri)03:34:05 No.101376114

Anonymous 07/12/24(Fri)03:34:05 No.101376114

>>101376018
Nah I don't bite it, an AI picture will always be an approximation of reality, training a model with less than 100% accurate data when 100% accurate data (Real pictures!!) exist is retarded, it should not be done in the pretraining process, for finetuning why not, people are free to make the model more AI sloppa for what I care. A base model should be neutral in the first place so that everyone can mold it in any way they want

Anonymous
07/12/24(Fri)03:35:07 No.101376119

Anonymous 07/12/24(Fri)03:35:07 No.101376119

File: poor-taste.jpg (51 KB, 450x548)

51 KB JPG

>>101376047

Anonymous
07/12/24(Fri)03:36:22 No.101376130

Anonymous 07/12/24(Fri)03:36:22 No.101376130

Though I don't think people are giving dalle a fair shake either. Actual stylized images on dalle actually dont have a bad style. Is the fake 3D / realism where its shit.

https://cdn-lfs-us-1.huggingface.co/repos/ee/1b/ee1bd318fa77f0f576a7f4f9aed9ef47229a9abd078b2ad9e56f71078c3b5622/c8b1e0635c4e6176e66aec1152002bb5471a5db21eb58774a01d7cd29f785314?response-content-disposition=inline%3B+filename*%3DUTF-8%27%27highlights_grid.jpg%3B+filename%3D%22highlights_grid.jpg%22%3B&response-content-type=image%2Fjpeg&Expires=1721028770&Policy=eyJTdGF0ZW1lbnQiOlt7IkNvbmRpdGlvbiI6eyJEYXRlTGVzc1RoYW4iOnsiQVdTOkVwb2NoVGltZSI6MTcyMTAyODc3MH19LCJSZXNvdXJjZSI6Imh0dHBzOi8vY2RuLWxmcy11cy0xLmh1Z2dpbmdmYWNlLmNvL3JlcG9zL2VlLzFiL2VlMWJkMzE4ZmE3N2YwZjU3NmE3ZjRmOWFlZDllZjQ3MjI5YTlhYmQwNzhiMmFkOWU1NmY3MTA3OGMzYjU2MjIvYzhiMWUwNjM1YzRlNjE3NmU2NmFlYzExNTIwMDJiYjU0NzFhNWRiMjFlYjU4Nzc0YTAxZDdjZDI5Zjc4NTMxND9yZXNwb25zZS1jb250ZW50LWRpc3Bvc2l0aW9uPSomcmVzcG9uc2UtY29udGVudC10eXBlPSoifV19&Signature=YPzjiBEaPyaRAtSdUzk%7EYWe6nEpI18gRak199LfejzLWB52Uu9YBYHI9ae8XXbtuOedjkdabxZDkM5-r%7E8Ge4WSUDtiG7nZ-0BBuu5MBu9WUkXfg7UOMmmB3PQSrelM6La0lArVMB-HEjizie80xuN6FUsJpuPswTme3Fsb30s890z-UlS9k2bZixiGjGsDHEwsBXgW1e866SfleDKmLYKnMtd1iwCRBNmiTJ1-g0Ta6DOUs3q0bHRGN6L5xWcGhLkJ6Ld-TKwOMIrdNRstdo7D20pxiwckDU62dV%7EVwb73X%7Emebh8xpxyi970jIZA0gJp7rhgDRbDhF2LfffN9MxA__&Key-Pair-Id=K24J24Z295AEI9

Anonymous
07/12/24(Fri)03:36:50 No.101376135

Anonymous 07/12/24(Fri)03:36:50 No.101376135

>>101376130
I'm not clicking that big ass link nigga

Anonymous
07/12/24(Fri)03:37:31 No.101376141

Anonymous 07/12/24(Fri)03:37:31 No.101376141

>>101376038
>its looking good
Prompt comprehension wise because of the way it was tagged. That's it. The fact that it uses ai images in training does not make it better at comprehending a prompt.

>>101376130
>Is the fake 3D / realism where its shit.
It's only more apparent with those styles.

Anonymous
07/12/24(Fri)03:37:51 No.101376144

Anonymous 07/12/24(Fri)03:37:51 No.101376144

>>101376130
it doesn't matter how "supposedly good" dalle3 is looking, that's not the fucking point, the pretraining process should be about making the AI understanding the reality, and the reality is REAL PICTURES NOT FUCKING AI SLOP MOTHERFUCKER

Anonymous
07/12/24(Fri)03:38:01 No.101376146

Anonymous 07/12/24(Fri)03:38:01 No.101376146

>>101376135
https://files.catbox.moe/97u9xh.jpg

Anonymous
07/12/24(Fri)03:39:28 No.101376156

Anonymous 07/12/24(Fri)03:39:28 No.101376156

>>101376050
they do it for the love of the game bless their heart

Anonymous
07/12/24(Fri)03:40:10 No.101376164

Anonymous 07/12/24(Fri)03:40:10 No.101376164

>>101376144
A curated dataset of only high quality stylized images is what you would use. Im not saying to use fake RL photos. That is where model's like dalle / midjourney suck. For stylized the best images can be nearly impossible to tell if its from AI or not.

Anonymous
07/12/24(Fri)03:40:57 No.101376170

Anonymous 07/12/24(Fri)03:40:57 No.101376170

>>101376164
>the best images can be nearly impossible to tell if its from AI or not.
You can always tell. You can always tell.

Anonymous
07/12/24(Fri)03:42:42 No.101376178

Anonymous 07/12/24(Fri)03:42:42 No.101376178

>>101376050
?

Anonymous
07/12/24(Fri)03:44:21 No.101376186

Anonymous 07/12/24(Fri)03:44:21 No.101376186

>>101376178
How much money has this individual spent on generating synthetic data and GPU rental. How are they funding it?

Anonymous
07/12/24(Fri)03:44:58 No.101376189

Anonymous 07/12/24(Fri)03:44:58 No.101376189

>>101375882
The creation of /ai/ will come before that and shorty after, the heat death of the universe

Anonymous
07/12/24(Fri)03:45:04 No.101376192

Anonymous 07/12/24(Fri)03:45:04 No.101376192

File: AiSlopCantBeatThat.jpg (792 KB, 1800x900)

792 KB JPG

>>101376164
>For stylized the best images can be nearly impossible to tell if its from AI or not.
That's a lie and you know it, a model shouldn't learn drawings through AI slop, but through real artists, period

Anonymous
07/12/24(Fri)03:48:47 No.101376209

Anonymous 07/12/24(Fri)03:48:47 No.101376209

>>101376170
>You can always tell.

You can not honestly tell me that there are not some images (mostly stylized ones) that would not appear out of place on any art website. For example, some of the pixel art ones here: >>101376146 / the more textured / stylized ones.

Anonymous
07/12/24(Fri)03:49:27 No.101376214

Anonymous 07/12/24(Fri)03:49:27 No.101376214

>>101376006
hey, you absolute fucking dumbass. how do you think dall-e was able to learn all that if no prior image of it existed? it's because they used REAL FUCKING IMAGES and captioned them properly. god localjeets are SO FUCKING STUPID they're actually so completely retarded that they willingly sink their own projects at every turn.
YOUR CHINESE SYNTHETIC SLOP MODELS LOOK LIKE SHIT! you will NEVER compete with midjourney and dall-e at this rate because you have NO QUALITY CONTROL
the few somewhat smart people in this thread are all the keeps me from wishing total localoid death. it's actually completely fucking inexcusable how far behind local image models have fallen due to incompetent chinks and self-flagellating ethicsfags
please, for the sake of local development, NEVER POST about ai again. just jump off a bridge you sabotaging faggot

Anonymous
07/12/24(Fri)03:50:23 No.101376219

Anonymous 07/12/24(Fri)03:50:23 No.101376219

>>101376214
>YOUR CHINESE SYNTHETIC SLOP MODELS LOOK LIKE SHIT! you will NEVER compete with midjourney and dall-e at this rate because you have NO QUALITY CONTROL
Exactly this, how the fuck do you expect to beat midjourney if your only goal is to only be a cheap copy of it

Anonymous
07/12/24(Fri)03:53:09 No.101376237

Anonymous 07/12/24(Fri)03:53:09 No.101376237

>>101376214
Amen, fucking amen.

Anonymous
07/12/24(Fri)03:53:59 No.101376243

Anonymous 07/12/24(Fri)03:53:59 No.101376243

>>101376214
midjourney was trained on a ton of dalle gens btw, same with ideogram

Anonymous
07/12/24(Fri)03:54:52 No.101376255

Anonymous 07/12/24(Fri)03:54:52 No.101376255

>>101376243
I don't believe that, Midjourney pictures look way better than dalle

Anonymous
07/12/24(Fri)03:55:36 No.101376263

Anonymous 07/12/24(Fri)03:55:36 No.101376263

>>101376164
ai image gen isn't at the point where it can replace real pictures and art, saas or not. with enough synthetic slop it'll just just end up poisoning the model with it's flaws (weird lines, nonsensical colors, stupid fucking fish eye lens effect, centered images, that strange obsession with symmetrical imagery, nonsensical upclose detail, everything is packed into a square resolution so it makes the image feel cramped and claustrophobic). not to mentioned that deepfried look dalle gens have, openai probably adds it intentionally so they can stay out of trouble.

Anonymous
07/12/24(Fri)03:55:39 No.101376264

Anonymous 07/12/24(Fri)03:55:39 No.101376264

>>101376209
Anon. We disagree on "what looks good" but now the argument isn't about that, it's about being able to tell if an image is AI. If you have trouble pointing out the tells of every single one of those images then it's unironically over for you.
A VAST majority of people dislike the typical AI styles. You may enjoy it but no one else does. That's why >>101375920 is right and it should be entirely omitted during pretraining.

Anonymous
07/12/24(Fri)03:55:51 No.101376265

Anonymous 07/12/24(Fri)03:55:51 No.101376265

>>101376255
do you really not understand the concept of aesthetic training? You can have your sytheic cake and eat it too

Anonymous
07/12/24(Fri)03:56:39 No.101376273

Anonymous 07/12/24(Fri)03:56:39 No.101376273

>>101376243
you know its easy to spot an mj gen, right? because of its style?

Anonymous
07/12/24(Fri)03:56:43 No.101376276

Anonymous 07/12/24(Fri)03:56:43 No.101376276

>Pretrain your model on dalle3 AI sloppa
>Is surprised the image quality is worse than dalle
You can't make this shit up, local fags will never improve if all they're doing is trying to cheap out the pretraining

Anonymous
07/12/24(Fri)03:58:03 No.101376284

Anonymous 07/12/24(Fri)03:58:03 No.101376284

>>101376276
Don't lump us all together like that. Only a few actually enjoy "AI style".

Anonymous
07/12/24(Fri)03:58:03 No.101376285

Anonymous 07/12/24(Fri)03:58:03 No.101376285

>>101376265
No you don't, and I need a proof that Midjourney was using dalle to train their models, are you on the MJ team or something?

Anonymous
07/12/24(Fri)03:58:17 No.101376288

Anonymous 07/12/24(Fri)03:58:17 No.101376288

>>101376273

>>101376265

Anonymous
07/12/24(Fri)04:00:07 No.101376299

Anonymous 07/12/24(Fri)04:00:07 No.101376299

>>101376214
>YOUR CHINESE SYNTHETIC SLOP MODELS
pretty sure base pixart didn't include any synthetic images in training but since they only published the captions we will never know

Anonymous
07/12/24(Fri)04:00:21 No.101376302

Anonymous 07/12/24(Fri)04:00:21 No.101376302

>>101376285
Its called using midjourney when it first came out on discord. It was dalle with some aesthetic training on top that slowly diverged away.

Anonymous
07/12/24(Fri)04:01:23 No.101376315

Anonymous 07/12/24(Fri)04:01:23 No.101376315

>>101376299
And pixart has worse quality images but is also super undertrained.

Anonymous
07/12/24(Fri)04:01:43 No.101376320

Anonymous 07/12/24(Fri)04:01:43 No.101376320

>>101376284
The problem is that there isn't a lot of people making base imagegen models, and the few that does that are all doing something retarded in the pretraining

>SAI: Poison their model with """safe""" DPO cucking + insane censorship on the dataset
>ComfyUi and pony: Remove all children on the pretraining???
>Hunyuan: That's obvious they use AI sloppa for the pretraining
>Pixart: Looks fine but undertrained
>Kolors: outdated architecture + shit licence
>AuraFlow: Spam its model with ideogram pictures on the pretraining
Fuck man, they all suck at the end of the day

Anonymous
07/12/24(Fri)04:01:55 No.101376323

Anonymous 07/12/24(Fri)04:01:55 No.101376323

>>101376276
>trying to cheap out the pretraining
I don't get it. Isn't scraping images pretty cheap? Flickr, boorus, public domain etc.

>>101376284
>Only a few actually enjoy "AI style".
I think it's a fun concept, but it should be tagged and trained as a style not something that comes out as a standard.

Anonymous
07/12/24(Fri)04:02:58 No.101376330

Anonymous 07/12/24(Fri)04:02:58 No.101376330

>>101376315
you cant deny its aesthetically superior especially something like bunline which includes virtually zero AI images
>>101376323
put "dreamshaper" in the negatives of SD3 and post the results

Anonymous
07/12/24(Fri)04:03:02 No.101376332

Anonymous 07/12/24(Fri)04:03:02 No.101376332

>>101376323
>I don't get it. Isn't scraping images pretty cheap? Flickr, boorus, public domain etc.
It's not cheap at all because you still need to caption those pictures, and make good captions with it, it's way easier just to use ideogram API and write whatever random complex shit you want in it and you'll get a picture that is somehow close to what you've written

Anonymous
07/12/24(Fri)04:05:14 No.101376344

Anonymous 07/12/24(Fri)04:05:14 No.101376344

>>101376302
Do you have a source or something?

Anonymous
07/12/24(Fri)04:05:42 No.101376345

Anonymous 07/12/24(Fri)04:05:42 No.101376345

But yea, today I learned that /g is horribly misinformed on synthetic data training. They don't know that you can get away from having the ai "style" while filling gaps if you properly balance the dataset.

Anonymous
07/12/24(Fri)04:06:39 No.101376355

Anonymous 07/12/24(Fri)04:06:39 No.101376355

>>101376345
Go fuck yourself nigger, if you can't find real pictures to fill the gap, you have serious skill issue

Anonymous
07/12/24(Fri)04:07:06 No.101376360

Anonymous 07/12/24(Fri)04:07:06 No.101376360

Why can anon kmmediately tell when a lora or tunes dataset is mostly synthetic data?

Anonymous
07/12/24(Fri)04:08:02 No.101376365

Anonymous 07/12/24(Fri)04:08:02 No.101376365

>>101376345
>>101376360
you need to read this >>101376263
>ai image gen isn't at the point where it can replace real pictures and art, saas or not. with enough synthetic slop it'll just just end up poisoning the model with it's flaws (weird lines, nonsensical colors, stupid fucking fish eye lens effect, centered images, that strange obsession with symmetrical imagery, nonsensical upclose detail, everything is packed into a square resolution so it makes the image feel cramped and claustrophobic).

Anonymous
07/12/24(Fri)04:08:43 No.101376372

Anonymous 07/12/24(Fri)04:08:43 No.101376372

>>101376355
Find me a image of a anthro fox riding a horse standing on a blue cube while... ect...

You can have both the concepts of the far off concepts. and once you train it enough, the style of the hand made images. The farther off the concept the more "synthetic" the image is going to look due to the limited training so far. That can be fixed with more training / aesthetic training.

Educate yourself.

Anonymous
07/12/24(Fri)04:09:19 No.101376378

Anonymous 07/12/24(Fri)04:09:19 No.101376378

oh brother

Anonymous
07/12/24(Fri)04:10:34 No.101376384

Anonymous 07/12/24(Fri)04:10:34 No.101376384

>>101376265
>>101376264
>That's why >>101375920 is right and it should be entirely omitted during pretraining.

Anonymous
07/12/24(Fri)04:12:53 No.101376400

Anonymous 07/12/24(Fri)04:12:53 No.101376400

>>101376372
>Find me a image of a anthro fox riding a horse standing on a blue cube while... ect...
You're so fucking retarded, why do you pretend you can't find complex prompts out of real pictures? It won't be an anthro fox with whatever bullshit you've written, but if a model is trained with enough complex REAL pictures with accurate captions from it, there's no reasons it can't make new complex things after that, just admit you're a lazy fuck who wants to use AI sloppa to make your job easier, but that's the difference between the best (dalle/midjourney) and you local cucks, they don't take shortcuts to greatness, they don't rely to other AI sloppa to make their model great. You are the inbreeding cancer to this community, you should be ashamed of yourself.

Anonymous
07/12/24(Fri)04:14:24 No.101376409

Anonymous 07/12/24(Fri)04:14:24 No.101376409

>>101376400
Lol did you work on dalle or something?

Anonymous
07/12/24(Fri)04:14:25 No.101376410

Anonymous 07/12/24(Fri)04:14:25 No.101376410

>>101376372
>That can be fixed with more training / aesthetic training.
That's the problem retard, if you only pretrain a model with real pictures, you don't need to fix anything in the first place. All you're doing is to add tape to some broken wall, instead of making a good wall in the first place

Anonymous
07/12/24(Fri)04:15:27 No.101376415

Anonymous 07/12/24(Fri)04:15:27 No.101376415

>>101376409
Did you read the paper or something? They clearly said that they used real pictures + synthetic captions (GPT4V) to achieve this prompt understanding level

Anonymous
07/12/24(Fri)04:15:59 No.101376422

Anonymous 07/12/24(Fri)04:15:59 No.101376422

File: file.jpg (1.16 MB, 1792x2304)

1.16 MB JPG

Gen

Anonymous
07/12/24(Fri)04:16:30 No.101376423

Anonymous 07/12/24(Fri)04:16:30 No.101376423

>>101376410
What image model are we talking about that was pretrained on a fully synthetic dataset?

Anonymous
07/12/24(Fri)04:17:24 No.101376426

Anonymous 07/12/24(Fri)04:17:24 No.101376426

>>101376423
STOP MISSING THE POINT, A PRETRAINED MODEL SHOULD HAVE 0 SYNTHETIC PICTURES IN IT, KILL YOURSELF

Anonymous
07/12/24(Fri)04:17:35 No.101376429

Anonymous 07/12/24(Fri)04:17:35 No.101376429

>>101376372
>Find me a image of a anthro fox riding a horse standing on a blue cube while... ect...
it doesn't need that, just put enough pictures of anthro foxes, people riding riding horses and blue cubes and a model with good prompt comprehension will be able to generalize it.

Anonymous
07/12/24(Fri)04:17:43 No.101376430

Anonymous 07/12/24(Fri)04:17:43 No.101376430

>>101376006
remember the anon who said he was going to use this dataset? i wonder where he is now

Anonymous
07/12/24(Fri)04:18:04 No.101376433

Anonymous 07/12/24(Fri)04:18:04 No.101376433

>>101376426
ok bud

Anonymous
07/12/24(Fri)04:18:29 No.101376435

Anonymous 07/12/24(Fri)04:18:29 No.101376435

>>101376332
>good captions
I would like to know what the good captions are. I don't think having to string overly long sentences together is way to go or just having booru style tags. I worry that these new models will be trained so that people have to use text models for translating prompts

Anonymous
07/12/24(Fri)04:19:07 No.101376441

Anonymous 07/12/24(Fri)04:19:07 No.101376441

>>101376430
pixart devs?

Anonymous
07/12/24(Fri)04:19:27 No.101376444

Anonymous 07/12/24(Fri)04:19:27 No.101376444

>>101376441
say sike

Anonymous
07/12/24(Fri)04:19:52 No.101376448

Anonymous 07/12/24(Fri)04:19:52 No.101376448

>>101376435
>I would like to know what the good captions are.
Dunno why this should be a debate, a good caption is something that completely describe the pictures will all the necessary details. And with real sentences, because just using tags leads to confusion

If you write "woman, chair, table, sitting" the model doesn't know if the model is sitting on the chair or the table.

Anonymous
07/12/24(Fri)04:19:58 No.101376449

Anonymous 07/12/24(Fri)04:19:58 No.101376449

>>101376441
bro?

Anonymous
07/12/24(Fri)04:20:23 No.101376451

Anonymous 07/12/24(Fri)04:20:23 No.101376451

>>101376444
Check their discord.

Anonymous
07/12/24(Fri)04:20:48 No.101376455

Anonymous 07/12/24(Fri)04:20:48 No.101376455

>>101376451
screenshot it

Anonymous
07/12/24(Fri)04:22:26 No.101376469

Anonymous 07/12/24(Fri)04:22:26 No.101376469

>>101376455
I don't remember enough to find it in the search, you look.
https://discord.com/invite/rde6eaE5Ta

Anonymous
07/12/24(Fri)04:23:42 No.101376477

Anonymous 07/12/24(Fri)04:23:42 No.101376477

>>101376469
nowhere does the dev, the only one i know of lawrence-c, say anything to the effect of "we have used or will use synthetic images in our training"
you are bullshitting

Anonymous
07/12/24(Fri)04:25:01 No.101376485

Anonymous 07/12/24(Fri)04:25:01 No.101376485

>>101376448
I hope it's like this, natural looking sentences. It just sometimes looks like these text models try to maximize token usage and pad the description like first year university student does when writing an essay. Perhaps that's not terrible either I just wouldn't want to use prompts like that

Anonymous
07/12/24(Fri)04:26:02 No.101376495

Anonymous 07/12/24(Fri)04:26:02 No.101376495

File: Untitled2.png (8 KB, 727x120)

8 KB PNG

>>101376477
Do I have to do everything.

Anonymous
07/12/24(Fri)04:26:53 No.101376499

Anonymous 07/12/24(Fri)04:26:53 No.101376499

File: stablediffusion03.jpg (286 KB, 1552x1200)

286 KB JPG

Anonymous
07/12/24(Fri)04:27:04 No.101376501

Anonymous 07/12/24(Fri)04:27:04 No.101376501

>>101376485
Yeah, CogVLM does that, it just add unnecessary shit on the caption instead of just being objectively descriptive.
Captioning pictures is easily the hardest part, you can't really do that manually, you have tens of millions of pictures to caption to make a good pretraining, but using captioning models are shit too (they won't do NFSW and add shit fluff and won't be as accurate as humans)

That's the moat of OpenAI, they hired hundreds of african slaves to make manual captions kek

Anonymous
07/12/24(Fri)04:28:14 No.101376509

Anonymous 07/12/24(Fri)04:28:14 No.101376509

>>101376495
>rando discord user
>talking about a model thats not pixart
are you being purposefully obtuse?

Anonymous
07/12/24(Fri)04:29:03 No.101376514

Anonymous 07/12/24(Fri)04:29:03 No.101376514

>>101376495
And do we have to repeat everything? LLMs and imagegens aren't equivalent? And no one like Phi this shit is ultra slopped, so if you're trying to make your point out of this LLM models you're failing hard

Besides, it's not because they got good results with synthetic data that it means you can't have the same result with human data, that's a fallacy and you know it, they never did any comparaison to reach that conclusion in the first place

Anonymous
07/12/24(Fri)04:30:56 No.101376526

Anonymous 07/12/24(Fri)04:30:56 No.101376526

>>101376514
>slop
Not everyone uses these things for porn you know. For real world tasks phi is indeed sota for its size.

Anonymous
07/12/24(Fri)04:31:28 No.101376533

Anonymous 07/12/24(Fri)04:31:28 No.101376533

File: file.png (21 KB, 613x116)

21 KB PNG

>>101376495
if you bothered to scroll up you'd notice he's arguing against using synthetic data. also i have no idea if this guy is a pixart dev or not.

Anonymous
07/12/24(Fri)04:33:04 No.101376545

Anonymous 07/12/24(Fri)04:33:04 No.101376545

>>101376526
What's your point? We are in the imagegen community, people want AI models that output images as close as possible as reality, people want soulful drawing, they don't want AI sloppa that can put an AI sloppa dog on the top of an AI sloppa green triangle, what are you smoking mate?

Anonymous
07/12/24(Fri)04:33:22 No.101376549

Anonymous 07/12/24(Fri)04:33:22 No.101376549

>>101376006
Honestly cannot comprehend someone looking at those images and thinking "yeah these look good I should include them in my dataset". Absolutely zero taste.

Anonymous
07/12/24(Fri)04:36:08 No.101376566

Anonymous 07/12/24(Fri)04:36:08 No.101376566

>>101376501
I think models like https://huggingface.co/internlm/internlm-xcomposer2-vl-7b-4bit should be able to make pretty damn accurate descriptions even for pornographic images

Anonymous
07/12/24(Fri)04:36:17 No.101376570

Anonymous 07/12/24(Fri)04:36:17 No.101376570

>>101376495
your eyes must be fucked up, not only do you enjoy the look of ai sloppa but your text rendering is trash kek

Anonymous
07/12/24(Fri)04:38:25 No.101376583

Anonymous 07/12/24(Fri)04:38:25 No.101376583

>>101376526
1) People use synthetic data because they don't have much choice, it's expensive and too much time consuming to do everything by hand, they don't do that because they like it
2) the LLM community accept the sloppa more because they want from their AI objectively good answers, the "aesthetic" part which is the way the AI talk is kinda irrelevant if you want to use it for professional use
3) That's the difference with the imagegen community, we want both. We want a model that produce a picture that is accurate to the prompt, but at the same time we want the "aesthetic" that looks like real life, and that's the moment synthetic data is unwelcomed, because you can't have the cake and eat it too with that AI slop method

Anonymous
07/12/24(Fri)04:40:30 No.101376593

Anonymous 07/12/24(Fri)04:40:30 No.101376593

>>101376566
Did you use it anon? Is it really more accurate than CogVLM? And can it really caption NFSW pictures?

Anonymous
07/12/24(Fri)04:41:05 No.101376595

Anonymous 07/12/24(Fri)04:41:05 No.101376595

>>101376583
you had me angry with #1 but by #3 i agreed
>People use synthetic data because they don't have much choice, it's expensive and too much time consuming to do everything by hand, they don't do that because they like it
plenty of people use 100% real data. the ones you mention are simply lazy.

Anonymous
07/12/24(Fri)04:42:26 No.101376606

Anonymous 07/12/24(Fri)04:42:26 No.101376606

>>101376595
>plenty of people use 100% real data. the ones you mention are simply lazy.
For pretraining that's kinda impossible to do it all alone, you have tens of millions of pictures you need to have good caption out of them. At some point you need to use sythetic captions (still better than using the shitty laion captions)

Anonymous
07/12/24(Fri)04:44:23 No.101376623

Anonymous 07/12/24(Fri)04:44:23 No.101376623

>>101376320
>>Hunyuan: That's obvious they use AI sloppa for the pretraining
Is that why the skin texture looks so smooth and unnatural?
>>Pixart: Looks fine but undertrained
That's every single anon should be training it. It's a solid base.

>>101376606
Again, you are insane if you're purporting that there isn't enough real world images out there.

Anonymous
07/12/24(Fri)04:45:17 No.101376630

Anonymous 07/12/24(Fri)04:45:17 No.101376630

>>101376623
>Again, you are insane if you're purporting that there isn't enough real world images out there.
No, I think you're missing my point. I'm all for using 100% real pictures, I'm talking about the caption of those pictures, you won't do them by hands, you need the help of an AI for that

Anonymous
07/12/24(Fri)04:46:56 No.101376642

Anonymous 07/12/24(Fri)04:46:56 No.101376642

>>101376623
>Is that why the skin texture looks so smooth and unnatural?
No, its because all these models are incredibly undertrained. Undertrained models lack detail.

Anonymous
07/12/24(Fri)04:47:46 No.101376647

Anonymous 07/12/24(Fri)04:47:46 No.101376647

>>101376630
You're right I misread your reply. The only gripe I have with synthetic captions is they seem to neglect to pick up on specifics as in the caption for Mario would be something to the effect of "mustached man wearing a red hat and trousers".

Anonymous
07/12/24(Fri)04:47:51 No.101376649

Anonymous 07/12/24(Fri)04:47:51 No.101376649

>>101376593
Did not try it yet, should use linux with my hw and I'm on windows. People I know use it for smut

>Is it really more accurate than CogVLM?
I don't think so

Anonymous
07/12/24(Fri)04:48:23 No.101376654

Anonymous 07/12/24(Fri)04:48:23 No.101376654

>>101376593
https://huggingface.co/RedRocket/JointTaggerProject

Anonymous
07/12/24(Fri)04:49:26 No.101376663

Anonymous 07/12/24(Fri)04:49:26 No.101376663

>>101376647
>specifics
Not the right word but I think you get what I mean.

Anonymous
07/12/24(Fri)04:49:42 No.101376666

Anonymous 07/12/24(Fri)04:49:42 No.101376666

>>101376654
>This model is a multi-label classifier model designed and trained by RedRocket for use on furry images, using E621 tags.
tags suck anon

Anonymous
07/12/24(Fri)04:50:43 No.101376672

Anonymous 07/12/24(Fri)04:50:43 No.101376672

>>101376642
https://arxiv.org/pdf/2405.08748
I think you're right, they don't mention any synthetic data on their paper

Anonymous
07/12/24(Fri)04:51:37 No.101376682

Anonymous 07/12/24(Fri)04:51:37 No.101376682

>>101376666
You think writing a novel like a captioner is better? Tags are easier to "capture" the important aspects of a image with.

Anonymous
07/12/24(Fri)04:51:48 No.101376684

Anonymous 07/12/24(Fri)04:51:48 No.101376684

>>101376630
>I'm talking about the caption of those pictures, you won't do them by hands,
i dream of a day where anon can work together to properly tag by hand an entire dataset big enough to pretrain a great model

Anonymous
07/12/24(Fri)04:51:50 No.101376685

Anonymous 07/12/24(Fri)04:51:50 No.101376685

>>101376647
yeah, if you only use CogVLM captions to pretrain your model, you'll lose all the artists/celebrities/characters in the process, the wet dream of SAI actually kek

Anonymous
07/12/24(Fri)04:53:23 No.101376696

Anonymous 07/12/24(Fri)04:53:23 No.101376696

>>101376682
Nah this is bullshit, imagine a woman sitting on a table and there's a chair in front of her. The tags will confuse the model "woman, sitting, chair, table", how the fuck the model is supposed to know the woman is sitting on the chair or on the table. That's why we use sentences and we don't speak like that.

"Retarded, anon, are, you, understand, not, shit, issue, skill"

Anonymous
07/12/24(Fri)04:54:50 No.101376706

Anonymous 07/12/24(Fri)04:54:50 No.101376706

>>101376684
we can't work together, that would mean putting the pictures on a site and working on them, we would be destroyed by "copyright" really quickly

Anonymous
07/12/24(Fri)04:56:07 No.101376714

Anonymous 07/12/24(Fri)04:56:07 No.101376714

>>101376696
>"Retarded, anon, are, you, understand, not, shit, issue, skill"
kekd hard
it should be a combination of the two desu or at least still allow me to spam random tags at the end for lulz

Anonymous
07/12/24(Fri)04:57:08 No.101376731

Anonymous 07/12/24(Fri)04:57:08 No.101376731

How many beams should one use with captioning models? More = more accurate? What the hell is a beam

Anonymous
07/12/24(Fri)05:00:43 No.101376757

Anonymous 07/12/24(Fri)05:00:43 No.101376757

>>101376731
A beam? What captioning model are you using anon?

Anonymous
07/12/24(Fri)05:02:44 No.101376775

Anonymous 07/12/24(Fri)05:02:44 No.101376775

>>101376757
Trying out microsoft/kosmos-2-patch14-224

Anonymous
07/12/24(Fri)05:08:37 No.101376810

Anonymous 07/12/24(Fri)05:08:37 No.101376810

File: 1ec9f63f46f44b099a8fc4cee(...).png (1.17 MB, 1024x1024)

1.17 MB PNG

https://fal.ai/models/fal-ai/aura-flow
>Two men arguing with each other, one is screaming "NO AI SLOP" the other says "WHY NOT??"

Anonymous
07/12/24(Fri)05:12:23 No.101376842

Anonymous 07/12/24(Fri)05:12:23 No.101376842

File: e7c786266aa6478492fe55ee4(...).png (1.79 MB, 1024x1024)

1.79 MB PNG

>>101376810
>A woman walking over a giant multicolored glass ball and is screaming "I'm going to fall!", 90's anime style

Anonymous
07/12/24(Fri)05:15:09 No.101376865

Anonymous 07/12/24(Fri)05:15:09 No.101376865

File: Dalle-3.jpg (331 KB, 1582x1338)

331 KB JPG

>>101376810
We are so far from dalle3 it's not funny anymore :(

Anonymous
07/12/24(Fri)05:16:46 No.101376883

Anonymous 07/12/24(Fri)05:16:46 No.101376883

File: dalle3.jpg (273 KB, 1435x1303)

273 KB JPG

>>101376842
Yeah... dalle3 didn't do the 90's anime style and didn't add any text kek

Anonymous
07/12/24(Fri)05:18:42 No.101376893

Anonymous 07/12/24(Fri)05:18:42 No.101376893

File: 00096-.jpg (1.74 MB, 1536x2304)

1.74 MB JPG

>>101376775
it it not the width (or height)?

Anonymous
07/12/24(Fri)05:25:15 No.101376939

Anonymous 07/12/24(Fri)05:25:15 No.101376939

>>101376893
No clue. I'm bouncing between Florence-2 and Kosmos-2 for really quick & simple captions

Anonymous
07/12/24(Fri)06:08:09 No.101377238

Anonymous 07/12/24(Fri)06:08:09 No.101377238

>>101376810
Model's alright
But there's one thing that completely kills it, it uses the sdxl VAE which renders it unable to use text and finer details. Another DOA release

Anonymous
07/12/24(Fri)06:50:50 No.101377586

Anonymous 07/12/24(Fri)06:50:50 No.101377586

they come....
and they go....

Anonymous
07/12/24(Fri)07:56:18 No.101378167

Anonymous 07/12/24(Fri)07:56:18 No.101378167

>>101376731
> Beam size, or beam width, is a parameter in the beam search algorithm which determines how many of the best partial solutions to evaluate.
More = more accurate according to the model's internal scoring/evaluation, yes.

Anonymous
07/12/24(Fri)08:03:16 No.101378256

Anonymous 07/12/24(Fri)08:03:16 No.101378256

>>101377238
>make 16 channel vae
>https://huggingface.co/AuraDiffusion/16ch-vae
>dont use it
>use sdxl slop instead
but why

Anonymous
07/12/24(Fri)08:05:04 No.101378276

Anonymous 07/12/24(Fri)08:05:04 No.101378276

>>101378256
I genuinely don't know, I asked on the discussion tab.
It genuinely seems like there's someone sabotaging open-source by making people take bad decissions. First SD releasing the absolute dogshit that SD3M was, then this...

Anonymous
07/12/24(Fri)08:22:10 No.101378480

Anonymous 07/12/24(Fri)08:22:10 No.101378480

>>101378256
>>101378276
the 16ch vae was made very recently, the guy behind it talked about making this model a few months before the sd3 release.

Anonymous
07/12/24(Fri)08:59:56 No.101378888

Anonymous 07/12/24(Fri)08:59:56 No.101378888

File: 2536535622e64d0ea6514843a(...).png (1.43 MB, 1024x1024)

1.43 MB PNG

>>101376810
>Analog photo of a beautiful girl winking and giving a thumbs up, 8k, intricate details.
Wew lad.

Anonymous
07/12/24(Fri)09:02:50 No.101378921

Anonymous 07/12/24(Fri)09:02:50 No.101378921

File: 00013-2594262553.jpg (421 KB, 1304x1624)

421 KB JPG

Anonymous
07/12/24(Fri)09:05:25 No.101378947

Anonymous 07/12/24(Fri)09:05:25 No.101378947

>>101378256
https://huggingface.co/fal/AuraFlow/discussions/6
They said they plan oln changing it
great :)

Anonymous
07/12/24(Fri)09:20:04 No.101379120

Anonymous 07/12/24(Fri)09:20:04 No.101379120

>>101378480
any news about the 1.5 vae?

Anonymous
07/12/24(Fri)09:23:43 No.101379158

Anonymous 07/12/24(Fri)09:23:43 No.101379158

File: 1.jpg (330 KB, 1005x957)

330 KB JPG

>>101376883
it's a prompting issue, these models are not tagged in the same way, yet people expect them to behave like they do. picrel is from march when the same thing happened

Anonymous
07/12/24(Fri)09:26:33 No.101379184

Anonymous 07/12/24(Fri)09:26:33 No.101379184

>>101379158
>picrel is from march when the same thing happened
it was the same exact prompt used on dalle on march, I guess they changed something on the model since then

Anonymous
07/12/24(Fri)09:28:19 No.101379201

Anonymous 07/12/24(Fri)09:28:19 No.101379201

https://huggingface.co/fal/AuraFlow/discussions/5
>Uh, this is a big one. 35 GB VRAM. Generating a 1024x1024 on a RTX 4090 takes almost 20 minutes. And it seems to be unhappy with non square ratios? (1024x576)
Holy fuck?

Anonymous
07/12/24(Fri)09:32:57 No.101379242

Anonymous 07/12/24(Fri)09:32:57 No.101379242

>>101378167
>More = more accurate according to the model's internal scoring/evaluation, yes.
Thanks. I'll try 20 with Florence-2-large-ft

Anonymous
07/12/24(Fri)09:41:05 No.101379313

Anonymous 07/12/24(Fri)09:41:05 No.101379313

>>101379242
can you show us some pictures with florence captions to see how bad/good it is?

Anonymous
07/12/24(Fri)09:52:59 No.101379421

Anonymous 07/12/24(Fri)09:52:59 No.101379421

File: summer.jpg (519 KB, 1024x1536)

519 KB JPG

>>101379313

Anonymous
07/12/24(Fri)09:53:32 No.101379425

Anonymous 07/12/24(Fri)09:53:32 No.101379425

>>101379120
https://huggingface.co/ostris/vae-kl-f8-d16
seems like it's already usable with 1.5. the guy mentioned that this is an older test version and a new one is on the way. they have a thread for this on the pixart discord if you're interested in where i found this. they also said something about converting sd 1.5 checkpoints to be compatible with the 16ch vae by merging a lora, but i'm only a layman so i don't understand this, sorry.

Anonymous
07/12/24(Fri)09:55:32 No.101379441

Anonymous 07/12/24(Fri)09:55:32 No.101379441

>>101379421
it's kinda accurate but not descriptive enough, it doesn't say what's written on her shirt, or that there's some clothes and the iron over the ironing board, desu for SFW pictures it's better to use the sota shit like gpt4v

Anonymous
07/12/24(Fri)09:58:06 No.101379464

Anonymous 07/12/24(Fri)09:58:06 No.101379464

>>101379421
>>101379441
>A fairly close eye-level indoor full shot shows a young woman in a red t-shirt with the words “Bite me” printed on it in white lettering stands in front of an ironing board in a room with orange and yellow walls. The woman is smiling and looking directly at the camera. She has long red hair pulled back in a ponytail and is wearing white ankle-high socks. The dress she is wearing is a dark red with small black dots all over it and a white flower in the center of the shirt. The ironing table is covered with an orange, yellow, orange, and green floral pattern and has a turquoise metal frame. Clothes are folded and stacked on top of one another on the left side of the table. There is anironing board on the right of the image, with an iron on top, and a flower-shaped green and blue flower on the far right. The walls of the room are painted a mauve pink, white, and yellow, and there is a white narrow bookcase in the background with several stuffed animals on it. The door to the left of the frame is orange and appears to be a door knob. The floor is carpeted in a light beige color.

https://huggingface.co/yayayaaa/florence-2-large-ft-moredetailed

Anonymous
07/12/24(Fri)10:00:54 No.101379493

Anonymous 07/12/24(Fri)10:00:54 No.101379493

>>101379441
I add captions with wd-v3 to it. It's really neat for loras

>>101379464
Ah yes I had token limit on

Anonymous
07/12/24(Fri)10:04:46 No.101379528

Anonymous 07/12/24(Fri)10:04:46 No.101379528

>>101379464
Really interesting model, it doesn't go the "gender neutral bullshit" "they" like on CogVLM, it's only descriptive and doesn't add necessary fluff. I'd say it's 60% accurate which could be better but it doesn't make insane mistakes so that's ok I guess. Tbh, captioning models is really important and need to be the priority for improvement, because if you have a local captioner that is as good as humans, that's a fucking jackpot, the problem will always remain the celebrities/artists/characters names though... Maybe one day some model will be good enough to recognize everyone kek

Anonymous
07/12/24(Fri)10:07:16 No.101379555

Anonymous 07/12/24(Fri)10:07:16 No.101379555

>>101379464
>The door to the left of the frame is orange and appears to be a door knob.
that's a weird sentence, sometimes it has broken english in it

Anonymous
07/12/24(Fri)10:07:41 No.101379558

Anonymous 07/12/24(Fri)10:07:41 No.101379558

>>101375811
Mediocre people survive only on gate keeping and the status quo. Truly skilled people aren't threatened by changes because they often are the change. Excellent artists would be embracing AI for the time saver it is.

Anonymous
07/12/24(Fri)10:08:49 No.101379571

Anonymous 07/12/24(Fri)10:08:49 No.101379571

>>101379558
this

Anonymous
07/12/24(Fri)10:10:10 No.101379586

Anonymous 07/12/24(Fri)10:10:10 No.101379586

File: quote-good-artists-copy-g(...).jpg (58 KB, 850x400)

58 KB JPG

>>101379558
The worst part is the hypocrisy of artists, they have no problem copying others artists style, in the video this artist has no problem drawing a copyrighted character (Pomny) but if you want to use his pictures to train your model that's blasphemous to them? Get the fuck out of there!

Anonymous
07/12/24(Fri)10:11:53 No.101379605

Anonymous 07/12/24(Fri)10:11:53 No.101379605

>>101379586
Artists are left brained and stupid. They don't have critical/abstract thinking and they're also dunning kruger incarnate. There's a reason why they're some of the biggest fart huffers and authoritarians in existence, at least modern artists are.

Anonymous
07/12/24(Fri)10:17:30 No.101379663

Anonymous 07/12/24(Fri)10:17:30 No.101379663

>>101379201
It's why I'm against ultra large models for local, they should've targeted 24 GB of VRAM. Your generation is taking forever because it's memory swapping.

Anonymous
07/12/24(Fri)10:18:44 No.101379675

Anonymous 07/12/24(Fri)10:18:44 No.101379675

>>101379663
I'm sure that's because he used the default script provided on huggingface, if he used ComfyUi it would fit on a 24gb vram card

Anonymous
07/12/24(Fri)10:20:57 No.101379716

Anonymous 07/12/24(Fri)10:20:57 No.101379716

>>101379675
The big test is if you can full tine tune on a 24 GB card. Loras simply don't cut it.

Anonymous
07/12/24(Fri)10:22:55 No.101379738

Anonymous 07/12/24(Fri)10:22:55 No.101379738

>>101379716
I mean, at this point if we want to compete against API, we need bigger guns, that's Nvdia's fault if they prevent us on improving our craft in the first place, and their next 5090 card will probably be a 28gb vram card, fuck them, seriously

Anonymous
07/12/24(Fri)10:23:32 No.101379751

Anonymous 07/12/24(Fri)10:23:32 No.101379751

>>101375811
I sometimes check on anti-ai forums
These people can't be past 16, they are so corny and passionate yet they don't understand what they are talking about

Anonymous
07/12/24(Fri)10:24:39 No.101379767

Anonymous 07/12/24(Fri)10:24:39 No.101379767

>>101379738
You will never have a comprehensive local model that competes against API because API can put their models on 80 GB GPUs. But good news, parameters doesn't scale so a model half the size beats a model twice its size as long as you keep the training domain focused.

Anonymous
07/12/24(Fri)10:25:42 No.101379774

Anonymous 07/12/24(Fri)10:25:42 No.101379774

>>101379767
> But good news, parameters doesn't scale so a model half the size beats a model twice its size as long as you keep the training domain focused.
That might be true for unet models, but probably not for DiT models, transformers models always scale its quality with parameters. That's why LLMs are insanely huge

Anonymous
07/12/24(Fri)10:30:37 No.101379828

Anonymous 07/12/24(Fri)10:30:37 No.101379828

>>101379774
Wrong. Doubling the parameters doesn't make a model twice as smart but it certainly quadruples the cost to run it. LLMs have already proven you wrong. Many smaller models perform better than their absurdly large counterparts.

Anonymous
07/12/24(Fri)10:31:49 No.101379843

Anonymous 07/12/24(Fri)10:31:49 No.101379843

>>101379828
that's not true, if you train a small and a big model exactly the same way, the big model will always be better
LLMs are proving me this righ, look at L1, L2, L3, the biggest base model is always the one with the best benchmarks, always

Anonymous
07/12/24(Fri)10:32:32 No.101379850

Anonymous 07/12/24(Fri)10:32:32 No.101379850

>>101379843
Do you know how graphs work or do you think 10% better is worth 20x the size?

Anonymous
07/12/24(Fri)10:34:39 No.101379873

Anonymous 07/12/24(Fri)10:34:39 No.101379873

>>101379850
Moving the goalpost? The topic was that bigger models will always perform better than smaller models if trained the exact same way. And no anon, if you want a non retarded experience with LLM you need at least to go on the 27b size (gemma2-it), smaller models will always be too retarded to be genuinely enjoyable, regardless on how well trained they are, it's just how it is. Benchmarks don't tell all the story

Anonymous
07/12/24(Fri)10:35:37 No.101379880

Anonymous 07/12/24(Fri)10:35:37 No.101379880

>>101379873
I can't have a conversation with someone who thinks a model 20x bigger for 10% the performance is smart in local. Hey faggot, you don't need a model that can do both photorealism and anime at the same time, let's start there.

Anonymous
07/12/24(Fri)10:36:23 No.101379889

Anonymous 07/12/24(Fri)10:36:23 No.101379889

File: ComfyUI_Kolors_1737.jpg (566 KB, 1664x2432)

566 KB JPG

switched to a more proper node set for Kolors: https://github.com/MinusZoneAI/ComfyUI-Kolors-MZ

now I'm not locked into the diffuser wrappers limited sampler selection

Anonymous
07/12/24(Fri)10:37:18 No.101379908

Anonymous 07/12/24(Fri)10:37:18 No.101379908

File: 2130740a1a3c0a5222214747b(...).jpg (22 KB, 638x359)

22 KB JPG

>>101379880

Anonymous
07/12/24(Fri)10:38:48 No.101379924

Anonymous 07/12/24(Fri)10:38:48 No.101379924

>>101379908
I've noticed this is what idiots do when they have no good arguments. Because surely you must be an idiot if you think a model that is 20x bigger and costs 40x as much to train for a 10% performance gain is smart. Also way to out yourself as an underaged banned zoomer. Anon, you can't even afford a 24 GB GPU.

Anonymous
07/12/24(Fri)10:39:52 No.101379942

Anonymous 07/12/24(Fri)10:39:52 No.101379942

File: file.png (16 KB, 213x630)

16 KB PNG

this always cracks me up lmao

Anonymous
07/12/24(Fri)10:40:45 No.101379956

Anonymous 07/12/24(Fri)10:40:45 No.101379956

>>101379924
>Anon, you can't even afford a 24 GB GPU.
the fuck you talk about nigger? I can run L3-70b at Q5, I know what I'm talking about, I tried small and big models, and the difference is huge, it's not "10%" like you pretend, you fucking faggot fuck, you're probably one of those copium losers that never tested big models and pretend to know everything. Get the fuck out of there you sub-human

Anonymous
07/12/24(Fri)10:41:49 No.101379969

Anonymous 07/12/24(Fri)10:41:49 No.101379969

>>101379956
Post the graphs then :)
Cost to train, performance scores, and cost to run.
Show the exponential performance

Anonymous
07/12/24(Fri)10:43:05 No.101379983

Anonymous 07/12/24(Fri)10:43:05 No.101379983

>>101379828
I can tell you've not used bigger models
You can train smaller models to give the illusion of inteligence, but in the real world Euryale 70b (a llama2 finetune) can still recall a series of events and its consequences than gemma 27b.
Parameter count is king.
t. 56gb vramgod

Anonymous
07/12/24(Fri)10:44:10 No.101379997

Anonymous 07/12/24(Fri)10:44:10 No.101379997

>>101379983
Show the cost/performance.

Anonymous
07/12/24(Fri)10:45:27 No.101380015

Anonymous 07/12/24(Fri)10:45:27 No.101380015

>>101379997
sorry dude, open source development shouldnt move at the pace of 10th percentile poorfags

Anonymous
07/12/24(Fri)10:46:07 No.101380024

Anonymous 07/12/24(Fri)10:46:07 No.101380024

File: graph.jpg (95 KB, 1900x786)

95 KB JPG

>>101379969

Anonymous
07/12/24(Fri)10:46:50 No.101380033

Anonymous 07/12/24(Fri)10:46:50 No.101380033

>>101380015
Anon you can't finetune your 70b model. Local models are useless when only a small percentage of people can train them.

Anonymous
07/12/24(Fri)10:47:58 No.101380047

Anonymous 07/12/24(Fri)10:47:58 No.101380047

>>101380024
>no labels
I'll take your cropping as a concession.
Cost/parameters/performance
Thanks!

Anonymous
07/12/24(Fri)10:48:38 No.101380050

Anonymous 07/12/24(Fri)10:48:38 No.101380050

>>101380033
Wrong again
Im worked on a LoRA for a 70B in an A100 instance I rented.
And even if I wasn't it's always a possibility to finetune a 70B model for 150$ tops.

Anonymous
07/12/24(Fri)10:49:33 No.101380064

Anonymous 07/12/24(Fri)10:49:33 No.101380064

>>101380047
>asks for graph
>gets graph
>"no not like that!!"
concession accepted

Anonymous
07/12/24(Fri)10:49:44 No.101380067

Anonymous 07/12/24(Fri)10:49:44 No.101380067

>>101380033
the LLM community only use cloud to train their models though, the imagegen model will probably go this path aswell, like the anon said, if we want to move forward, we need to scale up, too bad for lora fags who thought it would always be that way (local training)

>>101380047
You're the one claiming that it's "only a 10%" improvement, do you know you have the burden of proof in consequence or something?
https://en.wikipedia.org/wiki/Burden_of_proof_(philosophy)
So let's go anon, show us your cost/parameters/performance graphs, that's your job now, Thanks!

Anonymous
07/12/24(Fri)10:50:08 No.101380069

Anonymous 07/12/24(Fri)10:50:08 No.101380069

>>101380064
Your graph is useless without a key. Otherwise I assume that's a graph of your faggotry.

Anonymous
07/12/24(Fri)10:54:43 No.101380124

Anonymous 07/12/24(Fri)10:54:43 No.101380124

>>101380050
Ive*
hadn't*
I swear Im not an ESL

Anonymous
07/12/24(Fri)10:56:40 No.101380141

Anonymous 07/12/24(Fri)10:56:40 No.101380141

>>101380124
>Talks about grammar mistakes instead of arguing
>>101379924
>I've noticed this is what idiots do when they have no good arguments.
Kek, the irony.

Anonymous
07/12/24(Fri)11:00:23 No.101380183

Anonymous 07/12/24(Fri)11:00:23 No.101380183

File: output.jpg (245 KB, 1024x1024)

245 KB JPG

>>101379663
With the default huggingface script it takes 24.5 GB on my machine. They can probably bring it under 24, but it's not worth it right now to put it mildly. The current model is worse than SD3, it's in beta so maybe we can expect significant improvements, but definitely not off to a good start.

Anonymous
07/12/24(Fri)11:00:35 No.101380186

Anonymous 07/12/24(Fri)11:00:35 No.101380186

>>101380124
Wow you trained a Lora on an A100! I bet the quality was excellent and well worth the rental!

Anonymous
07/12/24(Fri)11:01:32 No.101380203

Anonymous 07/12/24(Fri)11:01:32 No.101380203

File: lmao.jpg (101 KB, 979x825)

101 KB JPG

>>101380183
She looks like a suitcase there lmaooo

Anonymous
07/12/24(Fri)11:01:33 No.101380204

Anonymous 07/12/24(Fri)11:01:33 No.101380204

>https://huggingface.co/datasets/matrixglitch/wikiart-215k
cool

Anonymous
07/12/24(Fri)11:03:46 No.101380236

Anonymous 07/12/24(Fri)11:03:46 No.101380236

File: 900mPixartSigma_anime_00007_.jpg (310 KB, 768x1024)

310 KB JPG

Anonymous
07/12/24(Fri)11:05:32 No.101380255

Anonymous 07/12/24(Fri)11:05:32 No.101380255

File: hmm.jpg (3.08 MB, 3307x3586)

3.08 MB JPG

>>101380204
A mix of both tags and florence caption would do the trick, you give florence the tags to help it with the captions so that it can write the artist names with the description

Anonymous
07/12/24(Fri)11:06:03 No.101380261

Anonymous 07/12/24(Fri)11:06:03 No.101380261

>>101380186
>concession
accepted
And it was, now my model understands anthropomorphic anatomy much better, and also writes what I like better.
>>101380124
That's my message, Im correcting my own post

Anonymous
07/12/24(Fri)11:06:03 No.101380262

Anonymous 07/12/24(Fri)11:06:03 No.101380262

>>101380255
Florence takes no text input sadly

Anonymous
07/12/24(Fri)11:07:16 No.101380271

Anonymous 07/12/24(Fri)11:07:16 No.101380271

>>101380261
My argument isn't that you can't rent an A100 to do a tiny model lmao
Of course any of us can rent 4xH100s to finetune a 6B model lmaoo

Anonymous
07/12/24(Fri)11:07:41 No.101380276

Anonymous 07/12/24(Fri)11:07:41 No.101380276

is there a site like PixArt-Sigma
that uses bing.com AI

I get two different styles with the same prompt

Anonymous
07/12/24(Fri)11:08:34 No.101380285

Anonymous 07/12/24(Fri)11:08:34 No.101380285

>>101380271
You can rent 2x3090s, or a single 3090 even.
Go back to playing with Dalle3, you have no idea how LLMs work

Anonymous
07/12/24(Fri)11:08:47 No.101380286

Anonymous 07/12/24(Fri)11:08:47 No.101380286

File: image (56).png (1.47 MB, 1024x1024)

1.47 MB PNG

>>101380203
>photo of a beautiful woman crying and holding a sign with text "tfw no suitcase gf"

Anonymous
07/12/24(Fri)11:09:47 No.101380300

Anonymous 07/12/24(Fri)11:09:47 No.101380300

>>101380285
You want to win so bad you completely miss the point of everything. Enjoy your 6B art model with 2 fine tunes and 10 loras. I hope you like the base model :)

Anonymous
07/12/24(Fri)11:10:43 No.101380311

Anonymous 07/12/24(Fri)11:10:43 No.101380311

>>101380255
that florence caption is kinda bad, no wonder models have trouble understanding our prompts, they are being trained with wrong informations

Anonymous
07/12/24(Fri)11:12:17 No.101380331

Anonymous 07/12/24(Fri)11:12:17 No.101380331

>>101380311
It doesn't need to be great, it just needs to be mostly right. Remember, SD 1.5 was trained on utter garbage yet managed to learn. The model learns the concept of "red" not from one picture but many pictures with red things.

Anonymous
07/12/24(Fri)11:14:20 No.101380351

Anonymous 07/12/24(Fri)11:14:20 No.101380351

File: OIG1.jpg (54 KB, 621x621)

54 KB JPG

>>101380331
>It doesn't need to be great, it just needs to be mostly right.
And then we wonder why we get destroyed by the API models, we shouldn't think mediocrity is good enough, we must aspire for more than that.

>>101380286
Here's a dalle3 version of your prompt kek

Anonymous
07/12/24(Fri)11:15:11 No.101380358

Anonymous 07/12/24(Fri)11:15:11 No.101380358

>>101380351
API models are trained by people who care less and use the same tools as us. The difference is they can afford 100xH100s training 24/7.

Anonymous
07/12/24(Fri)11:16:28 No.101380377

Anonymous 07/12/24(Fri)11:16:28 No.101380377

>>101380358
No, OpenAI hired a lot of humans to do manual caption on pictures, that's why their dalle3 model is so good at prompt understanding. But I agree with you on that point, if you have money, it's easier yeah, that's why they were able to rely on actual humans for captions instead of using florence

Anonymous
07/12/24(Fri)11:17:20 No.101380392

Anonymous 07/12/24(Fri)11:17:20 No.101380392

>>101380377
Retard if you can't get a clue they used the same vision model as GPT4.

Anonymous
07/12/24(Fri)11:17:44 No.101380399

Anonymous 07/12/24(Fri)11:17:44 No.101380399

>>101380392
And how did they train GPT4V retard?

Anonymous
07/12/24(Fri)11:18:36 No.101380412

Anonymous 07/12/24(Fri)11:18:36 No.101380412

>>101380399
It doesn't matter, are you so stupid you think they captioned their entire dataset manually? No, they trained GPT4V and they used that. So, earth to retard, the captions they trained with are likely the exact same as what GPT4V is.

Anonymous
07/12/24(Fri)11:19:58 No.101380428

Anonymous 07/12/24(Fri)11:19:58 No.101380428

>>101380412
>It doesn't matter
oh yes it matter, it fucking matter, if GPT4V is so good that's because it was trained on a lot of pictures with actual human captions, stop being a retard for a second and accept that you need at some point human labeling if you want to improve your craft

Anonymous
07/12/24(Fri)11:20:47 No.101380440

Anonymous 07/12/24(Fri)11:20:47 No.101380440

>>101380428
Florence2 is just about as good as GPT4V. I just think you're a massive moron who thinks API models have magic sauce.

Anonymous
07/12/24(Fri)11:21:15 No.101380449

Anonymous 07/12/24(Fri)11:21:15 No.101380449

>>101380440
>Florence2 is just about as good as GPT4V.
LMAOOOOOOOOOOO, I'm fucking done, my sides!

Anonymous
07/12/24(Fri)11:22:01 No.101380457

Anonymous 07/12/24(Fri)11:22:01 No.101380457

>>101380449
Okay you're just trolling, so I assume work still sucks trollanon? Can't wait to post centaurgirls tonight?

Anonymous
07/12/24(Fri)11:29:32 No.101380563

Anonymous 07/12/24(Fri)11:29:32 No.101380563

File: GetFucked.jpg (3.51 MB, 6283x2869)

3.51 MB JPG

>>101380440
>Florence2 is just about as good as GPT4V. I
https://www.youtube.com/watch?v=ciG0FvIUxKM

Anonymous
07/12/24(Fri)11:31:09 No.101380584

Anonymous 07/12/24(Fri)11:31:09 No.101380584

>>101380563
Haven't followed the reply chain but
>The painting is rich in texture...
Is maximum retarded

Anonymous
07/12/24(Fri)11:31:24 No.101380589

Anonymous 07/12/24(Fri)11:31:24 No.101380589

>>101380563
I thought you faggots hated long verbose prompts with superfluous language?

Anonymous
07/12/24(Fri)11:36:53 No.101380654

Anonymous 07/12/24(Fri)11:36:53 No.101380654

File: aaa.jpg (221 KB, 1766x1234)

221 KB JPG

>>101380589
>>101380584
I still prefer an accurate model with unnecessary fluff rather than a model that just gives false informations. You can talk to gpt4v and ask him to be more concise, you can't talk to florence so it kinda suck

Anonymous
07/12/24(Fri)11:40:00 No.101380693

Anonymous 07/12/24(Fri)11:40:00 No.101380693

>>101380654
>do not make any interpretations like...
>
>
>
>this painting is rendered with a high level of detail...

I truly despise the idea of needing to include that kind of information in my prompt, but maybe you can get it to condense even more I do not know

Anonymous
07/12/24(Fri)11:40:25 No.101380697

Anonymous 07/12/24(Fri)11:40:25 No.101380697

>>101380654
None of that information was false, it was incomplete. It is a group of men carrying a large cloth. There is a man in a blue shirt on the left. There is two men wearing red shirts on the right. The ChatGPT model is full of superfluous language and assumptions, in fact there's a lot more red herring and wasted tokens in the ChatGPT prompt. It's the completely opposite problem.

Anonymous
07/12/24(Fri)11:41:34 No.101380712

Anonymous 07/12/24(Fri)11:41:34 No.101380712

>>101380693
you only need to do it once and let the API caption your thousands of pictures though

>>101380697
>incomplete
still more accurate and complete than florence, which was the original point, focus anon focus...

Anonymous
07/12/24(Fri)11:46:03 No.101380762

Anonymous 07/12/24(Fri)11:46:03 No.101380762

>>101380712
It's not more complete, it's completely wrong if your goal is to caption an image for an AI model to learn. I already said it once, AI models don't need complete information to learn, just mostly correct information.

In reality that caption should be:

"A realism painting featuring impasto fine details and brushwork of a group of Asian men on a fishing boat moving a large bundle of cloth and rope which appears to be heavy."

Anonymous
07/12/24(Fri)11:46:08 No.101380764

Anonymous 07/12/24(Fri)11:46:08 No.101380764

>>101380693
>I truly despise the idea of needing to include that kind of information in my prompt, but maybe you can get it to condense even more I do not know
Looks like gpt4v is making this kind of fluff at the very last sentence, you could make a python script that remove the last sentence to be sure you won't get that shit, dunno if it's always the case though, it's a trial and error I guess

Anonymous
07/12/24(Fri)11:47:06 No.101380776

Anonymous 07/12/24(Fri)11:47:06 No.101380776

>>101380764
Only a sentence and a half of that entire output is actually good

Anonymous
07/12/24(Fri)11:48:22 No.101380785

Anonymous 07/12/24(Fri)11:48:22 No.101380785

>>101380762
>It's not more complete
of course it's more complete, florence doesn't say they're carrying ropes, or that they are on a boat like gpt4v does. It's just not precise enough >>101380563

>I already said it once, AI models don't need complete information to learn, just mostly correct information.
I disagree with that, you give the model wrong/incomplete information, it will output shit because it learned that way, dunno why you believe that the quality of the data or the caption don't matter, they matter anon, it's probably the most important thing in machine learning

Anonymous
07/12/24(Fri)11:48:24 No.101380786

Anonymous 07/12/24(Fri)11:48:24 No.101380786

>>101380776
This

Anonymous
07/12/24(Fri)11:49:12 No.101380796

Anonymous 07/12/24(Fri)11:49:12 No.101380796

>>101380785
Anon you don't need to literally label every thing in a picture, believe it or not it's smart enough to know a rope is in a picture from other images which were correctly captioned with "rope".

Anonymous
07/12/24(Fri)11:49:22 No.101380798

Anonymous 07/12/24(Fri)11:49:22 No.101380798

>>101380776
>>101380786
relative to florence, it's good, I don't get why you critisize gpt4v so much when at the end of the day you use a worse model (florence) to caption your pictures, are you retarded or something?

Anonymous
07/12/24(Fri)11:50:23 No.101380811

Anonymous 07/12/24(Fri)11:50:23 No.101380811

>>101380796
facts don't care about your feelings anon, dalle3 is the best at prompt understanding because it was being trained with the best captioner model, gpt4v. You can make as many mental gymnastics as you want, the reality is here

Anonymous
07/12/24(Fri)11:52:08 No.101380831

Anonymous 07/12/24(Fri)11:52:08 No.101380831

>>101380798
Because GPTV4 costs money and Florence2 can caption an image every half second for free?

Anonymous
07/12/24(Fri)11:53:04 No.101380837

Anonymous 07/12/24(Fri)11:53:04 No.101380837

>>101380831
Finally! I prefer that answer rather than coping with "florence is as good as gpt4v" >>101380440
https://www.youtube.com/watch?v=Ha7HAG6jVqc

Anonymous
07/12/24(Fri)11:53:19 No.101380842

Anonymous 07/12/24(Fri)11:53:19 No.101380842

>>101380811
DE3 is one of the ugliest large models and if SAI didn't completely drop the ball SD3 would've smoked DE3. You just sound like an OpenAI fag. And for prompt adherence? DE3 is actually shit.

Anonymous
07/12/24(Fri)11:54:07 No.101380852

Anonymous 07/12/24(Fri)11:54:07 No.101380852

File: Goodfellas-Laugh-meme-do61c.jpg (38 KB, 620x445)

38 KB JPG

>>101380842
>And for prompt adherence? DE3 is actually shit.

Anonymous
07/12/24(Fri)11:54:47 No.101380865

Anonymous 07/12/24(Fri)11:54:47 No.101380865

>>101380837
Florence is 90% as good as GPT4V. And if you combine Florence with WDV3 it will get you an extremely good model. Florence's tiny captions are also very good.

Anonymous
07/12/24(Fri)11:56:12 No.101380885

Anonymous 07/12/24(Fri)11:56:12 No.101380885

>>101380852
Yes anon, or have you used it? I know you have selective memory and bias but if you actually paid attention to DE3 it's very much like SD 1.5 in how it gachas your prompts. You are conflating esoteric knowledge with actual prompt adherence. Just because it shows Wario robbing an ATM from the view of a security camera doesn't mean it was actually faithful to the prompt. It also gets much worse the more detailed you are in the prompt.

Anonymous
07/12/24(Fri)11:57:04 No.101380896

Anonymous 07/12/24(Fri)11:57:04 No.101380896

>>101380885
Give me models that are better at prompt understanding than dalle3 so I can laugh some more

Anonymous
07/12/24(Fri)11:59:34 No.101380921

Anonymous 07/12/24(Fri)11:59:34 No.101380921

File: file.png (1004 KB, 1788x991)

1004 KB PNG

>>101380896
DE3 is so heckin good at prompt adherence!!!!

Anonymous
07/12/24(Fri)12:00:51 No.101380938

Anonymous 07/12/24(Fri)12:00:51 No.101380938

>>101380921
Can you simply answer this simple question? You also seem to have trouble at prompt understanding >>101380896
>Give me models that are better at prompt understanding than dalle3 so I can laugh some more

Anonymous
07/12/24(Fri)12:01:41 No.101380950

Anonymous 07/12/24(Fri)12:01:41 No.101380950

>>101380938
No, I proved DE3 sucks at prompt adherence and it certainly sucks at image quality and hallucinations.

Anonymous
07/12/24(Fri)12:02:58 No.101380969

Anonymous 07/12/24(Fri)12:02:58 No.101380969

>>101380950
>No, I proved DE3 sucks at prompt adherence
Doesn't prove that DE3 isn't the best at it though

> it certainly sucks at image quality and hallucinations.
Irrelevant goalpost moving, looks like you also like to add verbose fluff to your text

Anonymous
07/12/24(Fri)12:03:54 No.101380987

Anonymous 07/12/24(Fri)12:03:54 No.101380987

>>101380969
I'd expect the gold standard of caption makers to have fantastic prompt adherence. I guess not. Anyways have fun with your DE3.

Anonymous
07/12/24(Fri)12:05:08 No.101381003

Anonymous 07/12/24(Fri)12:05:08 No.101381003

>>101380987
>I'd expect the gold standard of caption makers to have fantastic prompt adherence.
I don't expect anything from the best, they know better than anyone how to make their craft, if you think they are so bad, then go ahead and show them how it should be done , we're waiting for your model that will be SOTA at prompt understanding :^)

Anonymous
07/12/24(Fri)12:12:31 No.101381099

Anonymous 07/12/24(Fri)12:12:31 No.101381099

>>101379889
>non DiT model
WHY?

Anonymous
07/12/24(Fri)12:13:29 No.101381117

Anonymous 07/12/24(Fri)12:13:29 No.101381117

>>101381099
Ikr, if they went for DiT we would've gotten a top tier local model...

Anonymous
07/12/24(Fri)12:15:07 No.101381147

Anonymous 07/12/24(Fri)12:15:07 No.101381147

>>101379889
Damn good pic

Anonymous
07/12/24(Fri)12:16:17 No.101381166

Anonymous 07/12/24(Fri)12:16:17 No.101381166

>>101379889
Does it work with windows?
How many vram does it ask?

Anonymous
07/12/24(Fri)12:20:47 No.101381227

Anonymous 07/12/24(Fri)12:20:47 No.101381227

Why is DiT considered as being so good? I have zero understanding of this stuff but purely from a visual perspective all these local DiT models preform poorly overall, take longer to gen and are harder to train. Am I missing something?

Anonymous
07/12/24(Fri)12:20:54 No.101381233

Anonymous 07/12/24(Fri)12:20:54 No.101381233

File: b86ab024239b4119b603ba7ce(...).png (1.29 MB, 1024x1024)

1.29 MB PNG

https://fal.ai/models/fal-ai/aura-flow
kek

Anonymous
07/12/24(Fri)12:21:54 No.101381248

Anonymous 07/12/24(Fri)12:21:54 No.101381248

>>101381227
It's easier to train, Pixart Sigma is one of the easiest models to train out there, trivial to add nudity to it compared to SDXL.

Anonymous
07/12/24(Fri)12:25:02 No.101381290

Anonymous 07/12/24(Fri)12:25:02 No.101381290

>>101381227
>Why is DiT considered as being so good?
When you look at the benchmarks, it just beats unet everywhere, and SORA (a DiT model) showed how far you can go with that technology
https://www.youtube.com/watch?v=lKM-QMnZ3yY

Anonymous
07/12/24(Fri)12:25:18 No.101381292

Anonymous 07/12/24(Fri)12:25:18 No.101381292

File: flow1.png (1.55 MB, 1024x1024)

1.55 MB PNG

Anonymous
07/12/24(Fri)12:32:01 No.101381385

Anonymous 07/12/24(Fri)12:32:01 No.101381385

>>101380050
it wouldnt be bad if they just needed to be slightly changed/tuned, but because of the safety cocksuckers the models need to be partially overwritten to add knowledge of nsfw (since the training datasets are going to be pruned of it), and thats going to need way more resources than something that already knows it and just has some guardrails like llms

Anonymous
07/12/24(Fri)12:56:53 No.101381716

Anonymous 07/12/24(Fri)12:56:53 No.101381716

>>101381248
Ahh my bad, I was under the impression it took a lot more vram = not accessible to local training but I'm now assuming that's model specific and not a DiT thing
>>101381290
Damn that's actually really crazy, couldn't tell it was ai from my mobile screen. Thanks for showing me, anons

Anonymous
07/12/24(Fri)13:00:15 No.101381770

Anonymous 07/12/24(Fri)13:00:15 No.101381770

downloading auraflow, hopefully its good

Anonymous
07/12/24(Fri)13:03:35 No.101381815

Anonymous 07/12/24(Fri)13:03:35 No.101381815

>>101381770
super under cooked, even more so than base pixart so temper your expectations. they say it's more like a beta 0.1v proof of concept. probably open sota for prompt comprehension though.

Anonymous
07/12/24(Fri)13:07:59 No.101381875

Anonymous 07/12/24(Fri)13:07:59 No.101381875

>>101380183
Use a higher cfg for humans.

Anonymous
07/12/24(Fri)13:10:09 No.101381900

Anonymous 07/12/24(Fri)13:10:09 No.101381900

>We worked on building the 16ch-vae https://huggingface.co/AuraDiffusion/16ch-vae when we were in the middle of v0.1 pre-training, hoping to leverage it for v0.2!

That's good.

Anonymous
07/12/24(Fri)13:10:33 No.101381904

Anonymous 07/12/24(Fri)13:10:33 No.101381904

>>101381815
>probably open sota for prompt comprehension though.
even better than sd3?

Anonymous
07/12/24(Fri)13:11:20 No.101381917

Anonymous 07/12/24(Fri)13:11:20 No.101381917

>>101381904
from the samples i've seen posted here, yeah i'd say so.

Anonymous
07/12/24(Fri)13:17:04 No.101381986

Anonymous 07/12/24(Fri)13:17:04 No.101381986

File: aa.jpg (157 KB, 1530x1694)

157 KB JPG

Everyone arguing for florence vs gpt4v; what about this one?
https://huggingface.co/OpenGVLab/InternVL2-40B

Anonymous
07/12/24(Fri)13:50:33 No.101382398

Anonymous 07/12/24(Fri)13:50:33 No.101382398

File: auraflow.png (576 KB, 408x628)

576 KB PNG

Anonymous
07/12/24(Fri)13:51:29 No.101382407

Anonymous 07/12/24(Fri)13:51:29 No.101382407

Any negatives for using Huber loss? There has to be some downside

Anonymous
07/12/24(Fri)13:51:34 No.101382408

Anonymous 07/12/24(Fri)13:51:34 No.101382408

>>101382398
lmao that's not bad at all

Anonymous
07/12/24(Fri)13:51:58 No.101382415

Anonymous 07/12/24(Fri)13:51:58 No.101382415

>>101382398
kekd

Anonymous
07/12/24(Fri)13:52:37 No.101382425

Anonymous 07/12/24(Fri)13:52:37 No.101382425

>>101382408
I think this model will be sota when its trained more. It looks like they are gonna train from scratch for 16 chan vae for 0.2

Anonymous
07/12/24(Fri)13:53:48 No.101382441

Anonymous 07/12/24(Fri)13:53:48 No.101382441

>>101382425
I just hope he'll stop using ideogram outputs to pretrain his models though
https://reddit.com/r/StableDiffusion/comments/1e1ktdh/auraflow_sure_does_like_making_the_ideogram/

Anonymous
07/12/24(Fri)13:59:54 No.101382532

Anonymous 07/12/24(Fri)13:59:54 No.101382532

Bunch of base model comparisons including aura flow. Just click on a image to see it across the base models.

https://images.flrty.li/

Anonymous
07/12/24(Fri)14:03:31 No.101382591

Anonymous 07/12/24(Fri)14:03:31 No.101382591

>>101382532
>no pixart
pixartsexuals, this open mockery will not be forgotten! they spit on our faces, but not for long!

Anonymous
07/12/24(Fri)14:04:57 No.101382603

Anonymous 07/12/24(Fri)14:04:57 No.101382603

>>101382532
Auraflow's style is actually coming along good, its just extremely undertrained and so is going to have that smooth undetailed look for a lot of them.

Anonymous
07/12/24(Fri)14:06:27 No.101382625

Anonymous 07/12/24(Fri)14:06:27 No.101382625

File: anime-character-illustrat(...).png (480 KB, 1024x1024)

480 KB PNG

>>101382532
>Anime character illustration of a cheerful karate girl wearing a white gi and headband, jumping kick pose. Expressive manga-style linework.
Midjourney looks so good

Anonymous
07/12/24(Fri)14:14:02 No.101382694

Anonymous 07/12/24(Fri)14:14:02 No.101382694

Any sampler/scheduler recommendations for AuraFlow?

Anonymous
07/12/24(Fri)14:17:12 No.101382721

Anonymous 07/12/24(Fri)14:17:12 No.101382721

>>101382398
>408X628
it can do sub 1024px as well?

Anonymous
07/12/24(Fri)14:20:18 No.101382746

Anonymous 07/12/24(Fri)14:20:18 No.101382746

>>101382407
can't really see any particular downside

Anonymous
07/12/24(Fri)14:25:29 No.101382792

Anonymous 07/12/24(Fri)14:25:29 No.101382792

>>101382746
I remade lora using Prodigy + Huber loss. It seems to counter the usual Prodigy overfitting issue. Almost too good to be true.

Anonymous
07/12/24(Fri)14:34:46 No.101382906

Anonymous 07/12/24(Fri)14:34:46 No.101382906

>>101376243
No it was not moron. I have listened to every single Midjourney developer chat.
>Office hours 4/17: Midjourney does not train on its own images and does not train on AI images
And if you don't believe me I'll ask him again next time and record it. You are making shit up now for the SOLE PURPOSE of sabotaging local models. get the fuck out of this thread

Anonymous
07/12/24(Fri)14:37:32 No.101382928

Anonymous 07/12/24(Fri)14:37:32 No.101382928

>>101382906
based

Anonymous
07/12/24(Fri)14:45:41 No.101383022

Anonymous 07/12/24(Fri)14:45:41 No.101383022

>>101382792
Haven't found any instance where it has been noticeably bad at bmaltais default values, it basically always either helped or seemingly did nothing in particular. Almost everything else is more tricky.

Anonymous
07/12/24(Fri)14:58:57 No.101383186

Anonymous 07/12/24(Fri)14:58:57 No.101383186

>>101382906
Of course they can not publicly say they trained on dalle outputs, they would be possibly liable then.

Anonymous
07/12/24(Fri)15:04:32 No.101383254

Anonymous 07/12/24(Fri)15:04:32 No.101383254

>>101383186
>they would be possibly liable then.
liable for what?

Anonymous
07/12/24(Fri)15:05:08 No.101383258

Anonymous 07/12/24(Fri)15:05:08 No.101383258

File: file.png (1.37 MB, 1024x1024)

1.37 MB PNG

>>101378888
>>101376810
>>101376842
I do like the tortured AI jank from hell aesthetic

Anonymous
07/12/24(Fri)15:07:42 No.101383281

Anonymous 07/12/24(Fri)15:07:42 No.101383281

>>101383254
openai forbids training on its images. Also making datasets public is never a good idea with how grey of a legal area it all is.

Anonymous
07/12/24(Fri)15:10:31 No.101383319

Anonymous 07/12/24(Fri)15:10:31 No.101383319

>>101383186
Generally speaking overt lies are illegal when it comes to business. So if you trained on something and then directly lie about it, that can come back to haunt you in many ways. It's better to say nothing.

Anonymous
07/12/24(Fri)15:13:20 No.101383360

Anonymous 07/12/24(Fri)15:13:20 No.101383360

>>101382441
its over

Anonymous
07/12/24(Fri)15:14:53 No.101383375

Anonymous 07/12/24(Fri)15:14:53 No.101383375

>>101383360
It's just begun. It has sota prompt adherence + and style when prompted decently is not bad so far: https://images.flrty.li/

Its just extremely undertrained.

Anonymous
07/12/24(Fri)15:15:25 No.101383385

Anonymous 07/12/24(Fri)15:15:25 No.101383385

File: lk2sjjbvu4cd1.png (787 KB, 720x960)

787 KB PNG

>>101382441
It is totally insane he did that, it will considers the errors in the AI as valid data and it will break even more. Too many limbs in the training image, no problem, it will be considered valid data...

Anonymous
07/12/24(Fri)15:16:38 No.101383396

Anonymous 07/12/24(Fri)15:16:38 No.101383396

>localjeets now slopping up synthetic garbage thinking it's better than real data
psyop success, enjoy remaining forever in last place

Anonymous
07/12/24(Fri)15:16:40 No.101383400

Anonymous 07/12/24(Fri)15:16:40 No.101383400

>>101382441
Why... WHY???

Anonymous
07/12/24(Fri)15:18:04 No.101383411

Anonymous 07/12/24(Fri)15:18:04 No.101383411

>>101383375
yes just like with pixart, hunyuan and kolors, just 2 more weeks till someone (not me) trains them more

Anonymous
07/12/24(Fri)15:19:54 No.101383439

Anonymous 07/12/24(Fri)15:19:54 No.101383439

File: sddefault.jpg (33 KB, 640x480)

33 KB JPG

>>101382441
I'm so tired of those retards, is there a single man not doing retarded things in the imagegen community?

Anonymous
07/12/24(Fri)15:20:23 No.101383450

Anonymous 07/12/24(Fri)15:20:23 No.101383450

man people are really trying to fudd the new model, huh?

Anonymous
07/12/24(Fri)15:21:13 No.101383458

Anonymous 07/12/24(Fri)15:21:13 No.101383458

>>101383439
me
too bad im not training models :/

Anonymous
07/12/24(Fri)15:23:44 No.101383492

Anonymous 07/12/24(Fri)15:23:44 No.101383492

>>101383396
t: homosexual

Anonymous
07/12/24(Fri)15:23:47 No.101383493

Anonymous 07/12/24(Fri)15:23:47 No.101383493

>hitting reply limit after only 12 hours

Anonymous
07/12/24(Fri)15:25:01 No.101383503

Anonymous 07/12/24(Fri)15:25:01 No.101383503

File: FuckOff.jpg (1.26 MB, 2052x2067)

1.26 MB JPG

>>101382441
Not only he decided to poison his model with AI slop, but he didn't even bother removing the censored pictures, what kind of an amateur moron must you be to end up there??

Anonymous
07/12/24(Fri)15:25:06 No.101383506

Anonymous 07/12/24(Fri)15:25:06 No.101383506

add auraflow not safe cat to collage

Anonymous
07/12/24(Fri)15:26:13 No.101383526

Anonymous 07/12/24(Fri)15:26:13 No.101383526

>>101383493
Front loaded thread with lots of discussion around Auraflow.

So the bakery just opened and put out some fresh bread
>>101383507
>>101383507
>>101383507

Anonymous
07/12/24(Fri)15:26:15 No.101383528

Anonymous 07/12/24(Fri)15:26:15 No.101383528

File: ComfyUI_00155_.png (1.21 MB, 1024x1024)

1.21 MB PNG

Whatever he is doing is working well. I hope he continues and ignores all the people who think they know better / are trying to disparage him.

Anonymous
07/12/24(Fri)15:27:12 No.101383549

Anonymous 07/12/24(Fri)15:27:12 No.101383549

>>101383528
yeah, amateur or not, it's good there's another player in the field.

Anonymous
07/12/24(Fri)15:27:32 No.101383552

Anonymous 07/12/24(Fri)15:27:32 No.101383552

>>101383528
>t. cocksucker

Anonymous
07/12/24(Fri)15:27:48 No.101383556

Anonymous 07/12/24(Fri)15:27:48 No.101383556

>>101383528
>>101383549
fuck off aura devs

Anonymous
07/12/24(Fri)15:28:07 No.101383559

Anonymous 07/12/24(Fri)15:28:07 No.101383559

>>101383552
>t. disingenuous troll

Anonymous
07/12/24(Fri)15:28:33 No.101383572

Anonymous 07/12/24(Fri)15:28:33 No.101383572

>>101382441
I hope someone will tell him on twitter that he's going to the wrong direction, he's wasting his and our time with this bullshit

Anonymous
07/12/24(Fri)15:29:24 No.101383579

Anonymous 07/12/24(Fri)15:29:24 No.101383579

>>101383556
its one guy SD dev.

Anonymous
07/12/24(Fri)15:29:34 No.101383581

Anonymous 07/12/24(Fri)15:29:34 No.101383581

>>101383559
Asking him to remove the censored pictures so that the model won't see a fucking fat cat everytime a controvertial prompt appears is trolling? The fuck is wrong with you retard?

Anonymous
07/12/24(Fri)15:30:31 No.101383593

Anonymous 07/12/24(Fri)15:30:31 No.101383593

>>101383581
That is horseshit, it certainly does not do that, are you retarded?

Anonymous
07/12/24(Fri)15:31:06 No.101383601

Anonymous 07/12/24(Fri)15:31:06 No.101383601

>>101383593
IT DOES THAT YOU FUCKING MONGOLOID >>101383503 >>101382441
https://github.com/comfyanonymous/ComfyUI/issues/4007#issuecomment-2225633909

Anonymous
07/12/24(Fri)15:32:37 No.101383624

Anonymous 07/12/24(Fri)15:32:37 No.101383624

>>101383601
Did you not even read it?

Anonymous
07/12/24(Fri)15:36:21 No.101383690

Anonymous 07/12/24(Fri)15:36:21 No.101383690

File: VuWillBeHappy.png (1.41 MB, 1013x1024)

1.41 MB PNG

>>101383601
>Vu will let the AI train on AI slop
>Vu will let him add the ideogram censored cat pictures in the pretraining
>Vu will be happy

Anonymous
07/12/24(Fri)15:38:21 No.101383722

Anonymous 07/12/24(Fri)15:38:21 No.101383722

>>101383601
I don't believe it. The cat images are all exactly the same, not a hair / pixel off. Bet that redditor is bullshitting us.

Anonymous
07/12/24(Fri)15:39:53 No.101383739

Anonymous 07/12/24(Fri)15:39:53 No.101383739

File: bdepnhwut4cd1.png (498 KB, 628x628)

498 KB PNG

>>101383722
I got one when playing it around with ComfyUi, yeah it doesn't look as good as on the previous one, but the cat indeed is there if you wanna try "non safe" prompts

Anonymous
07/12/24(Fri)15:42:12 No.101383770

Anonymous 07/12/24(Fri)15:42:12 No.101383770

>>101383739
Give me the exact prompt / seed that gives you the cat.

Anyone who believes this >>101383503 is a retard. It is impossible to generate the same exact pixel perfect cat in those gens.

Anonymous
07/12/24(Fri)15:42:57 No.101383782

Anonymous 07/12/24(Fri)15:42:57 No.101383782

>>101383770
have you not used the model yet? kek i got the cat within maybe 15 minutes

Anonymous
07/12/24(Fri)15:43:34 No.101383792

Anonymous 07/12/24(Fri)15:43:34 No.101383792

>>101383782
Ive used it for hours now. Not once did I get a cat. Give me the prompt / seed or be proven a troll.

Anonymous
07/12/24(Fri)15:43:40 No.101383796

Anonymous 07/12/24(Fri)15:43:40 No.101383796

>>101383770
>Fantasy art of skeleton king, death god
that one gave me the cat quickly

Anonymous
07/12/24(Fri)15:44:12 No.101383807

Anonymous 07/12/24(Fri)15:44:12 No.101383807

>>101383792
my prompt wasn't even nsfw im not trolling, anon

Anonymous
07/12/24(Fri)15:44:53 No.101383820

Anonymous 07/12/24(Fri)15:44:53 No.101383820

>>101383796
>>101383807
>still avoiding giving a exact seed / prompt combo. Fucking disingenuous troll.

Anonymous
07/12/24(Fri)15:45:52 No.101383841

Anonymous 07/12/24(Fri)15:45:52 No.101383841

>>101383820
try an overt nsfw prompt jesus slowpoke anon

Anonymous
07/12/24(Fri)15:46:27 No.101383853

Anonymous 07/12/24(Fri)15:46:27 No.101383853

>>101383820
you want a coffee with that aswell fucker? like I said it's easy to get one, just try it you won't wait for long, disingenuous shill

Anonymous
07/12/24(Fri)15:48:11 No.101383879

Anonymous 07/12/24(Fri)15:48:11 No.101383879

>>101383853
>still avoiding giving a exact seed / prompt combo .
Thank you for your admission.

>>101383841
nsfw just gives barbie dolls / garbled anatomy, it clearly does not contain many nsfw images, but it certainly does not give you a cat.

Anonymous
07/12/24(Fri)15:49:21 No.101383899

Anonymous 07/12/24(Fri)15:49:21 No.101383899

File: Capture.jpg (211 KB, 1920x1375)

211 KB JPG

>>101383879
>still avoiding giving a exact seed / prompt combo
you want all the details? fine, go for that one. What excuse are you gonna find now?

Anonymous
07/12/24(Fri)15:51:29 No.101383930

Anonymous 07/12/24(Fri)15:51:29 No.101383930

File: ComfyUI_00167_.png (1.32 MB, 1024x1024)

1.32 MB PNG

graffiti of a nude woman on concrete wall, the woman in standing on top of a red cube on top of a green ball, masterpiece

No cat, ill try >>101383899 next

Anonymous
07/12/24(Fri)15:52:16 No.101383945

Anonymous 07/12/24(Fri)15:52:16 No.101383945

File: aura-output.jpg (245 KB, 1024x1024)

245 KB JPG

>>101383796ű
>Fantasy art of skeleton king, death god
I can't reproduce anything approximating a cat after dozens of gens. I'm using it through hf diffusers, maybe that's a factor.

Anonymous
07/12/24(Fri)15:52:19 No.101383947

Anonymous 07/12/24(Fri)15:52:19 No.101383947

meow bros..?

Anonymous
07/12/24(Fri)15:53:39 No.101383969

Anonymous 07/12/24(Fri)15:53:39 No.101383969

>>101382441
This is what happens when your dataset is made up of primarily AI-generated images. Why people completely forgot how to scrape properly is beyond me. Seems to be a trend with recent local models where developers are resorting to low-hanging trash-tier datasets made up of Dall-E/MidJourney outputs instead of gathering their own real images to train on.

Sad to see local models going completely backwards. Continuously shooting themselves in the foot in order to remain 'ethical' and 'safe'. Just scrape artstation, flickr, etc already and assemble a good dataset or just don't even bother at this point. Each local model somehow gets worse dataset wise, with SD 1.5 having an absolutely massive dataset with a wide range of styles, and cascade/sd3 gutting, no exaggeration, over 98% of the dataset due to 'safety' concerns.

Stop training on ai-generated junk. Learn to scrape.

Anonymous
07/12/24(Fri)15:54:46 No.101383987

Anonymous 07/12/24(Fri)15:54:46 No.101383987

File: Proof.png (2.01 MB, 2537x1269)

2.01 MB PNG

Oh look. Fucking troll

Anonymous
07/12/24(Fri)15:55:50 No.101384004

Anonymous 07/12/24(Fri)15:55:50 No.101384004

it's meowver...

Anonymous
07/12/24(Fri)15:55:52 No.101384005

Anonymous 07/12/24(Fri)15:55:52 No.101384005

>>101383899
>seed 1588
>>101383987
seed 1589
are you retarded?

Anonymous
07/12/24(Fri)15:56:03 No.101384008

Anonymous 07/12/24(Fri)15:56:03 No.101384008

>>101383987
seed would be 1587

Anonymous
07/12/24(Fri)15:56:46 No.101384015

Anonymous 07/12/24(Fri)15:56:46 No.101384015

>>101384005
Are you? Its on increment. It generated at 1588. Here is the image with metadata:
https://files.catbox.moe/p0pqd3.png

Anonymous
07/12/24(Fri)15:57:38 No.101384031

Anonymous 07/12/24(Fri)15:57:38 No.101384031

>meo-ACK

Anonymous
07/12/24(Fri)15:58:02 No.101384037

Anonymous 07/12/24(Fri)15:58:02 No.101384037

>>101384015
>1588
should be generated with seed 1587

Anonymous
07/12/24(Fri)15:59:34 No.101384059

Anonymous 07/12/24(Fri)15:59:34 No.101384059

official cat waiting room

Anonymous
07/12/24(Fri)16:01:33 No.101384093

Anonymous 07/12/24(Fri)16:01:33 No.101384093

>>101384082
AHAH GATCHA BITCH, LETS GOOOO

Anonymous
07/12/24(Fri)16:01:58 No.101384099

Anonymous 07/12/24(Fri)16:01:58 No.101384099

>>101384082
Now apologize to the pussy

Anonymous
07/12/24(Fri)16:02:00 No.101384100

Anonymous 07/12/24(Fri)16:02:00 No.101384100

what the meow

Anonymous
07/12/24(Fri)16:02:11 No.101384102

Anonymous 07/12/24(Fri)16:02:11 No.101384102

File: Wellshit.png (1.67 MB, 2044x1268)

1.67 MB PNG

>>101384067
Wait, I fucked that one up. Wtf, there really is a cat at seed 1587

Anonymous
07/12/24(Fri)16:02:41 No.101384106

Anonymous 07/12/24(Fri)16:02:41 No.101384106

File: cat-well-well-well.gif (21 KB, 220x220)

21 KB GIF

>>101384082
>>101384102
ahah, stupid bitch, who's the retard now?

Anonymous
07/12/24(Fri)16:03:01 No.101384111

Anonymous 07/12/24(Fri)16:03:01 No.101384111

>>101384102
what the fuck

Anonymous
07/12/24(Fri)16:03:33 No.101384116

Anonymous 07/12/24(Fri)16:03:33 No.101384116

Ahahahahahaa thanks for the lolz anon you fucking massive gorilla retard

Anonymous
07/12/24(Fri)16:03:35 No.101384117

Anonymous 07/12/24(Fri)16:03:35 No.101384117

>>101384106
What's another seed it pops up at? Makes absolutely no sense for it to be pixel perfect across several seeds. Its not how these models work. I still think that post is trolling.

Anonymous
07/12/24(Fri)16:03:44 No.101384119

Anonymous 07/12/24(Fri)16:03:44 No.101384119

>>101383770
>Anyone who believes this >>101383503(You) is a retard.
>>101384102
>Wtf, there really is a cat at seed 1587
WELL WELL WELL

Anonymous
07/12/24(Fri)16:05:24 No.101384144

Anonymous 07/12/24(Fri)16:05:24 No.101384144

>>101384117
>"I need a proof"!
>*Provide the proof*
>"NO NOT LIKE THAT"
Can you stop the denial for 5 seconds?

Anonymous
07/12/24(Fri)16:05:28 No.101384145

Anonymous 07/12/24(Fri)16:05:28 No.101384145

>>101384119

>>101384117
And it clearly has nothing to do with censorship. It seems random. He certainly needs to filter that out before 0.2

Anonymous
07/12/24(Fri)16:05:29 No.101384146

Anonymous 07/12/24(Fri)16:05:29 No.101384146

THE
ABSOLUTE
STATE
OF
LOCAL
AHAHAHAHAHAHAHAHAHA

Anonymous
07/12/24(Fri)16:06:27 No.101384159

Anonymous 07/12/24(Fri)16:06:27 No.101384159

>>101384117
theres probably just a decent amount of the exact same cat image

Anonymous
07/12/24(Fri)16:06:35 No.101384161

Anonymous 07/12/24(Fri)16:06:35 No.101384161

reminder that pixart bigma will never do this to us

Anonymous
07/12/24(Fri)16:06:39 No.101384162

Anonymous 07/12/24(Fri)16:06:39 No.101384162

>>101384144
I'll be in denial that the cat is the exact same across them all because that should be impossible.

Anonymous
07/12/24(Fri)16:06:43 No.101384163

Anonymous 07/12/24(Fri)16:06:43 No.101384163

>>101384145
>And it clearly has nothing to do with censorship.
He just scrapped a shit ton of ideogram pictures without bothering to remove the "censored pictures aka the cat ones", it's not that deep, he's a total amateur

Anonymous
07/12/24(Fri)16:08:06 No.101384178

Anonymous 07/12/24(Fri)16:08:06 No.101384178

I guess if he had to many that it made up a significant amount of the dataset it might get so incredibly overfitted to that point.

Anonymous
07/12/24(Fri)16:08:14 No.101384181

Anonymous 07/12/24(Fri)16:08:14 No.101384181

>>101384162
Moving the goalpost? We just proved that this retard added the big censored ideogram cat into the pretraining process, what a fucking retard he is

Anonymous
07/12/24(Fri)16:09:24 No.101384198

Anonymous 07/12/24(Fri)16:09:24 No.101384198

>>101384161
bigma...... my special bigma....

Anonymous
07/12/24(Fri)16:11:28 No.101384220

Anonymous 07/12/24(Fri)16:11:28 No.101384220

>>101384178
yeah, if it was just one picture or two, the model would've never learned to reproduce this picture as well, the simple fact it's almost a 1:1 reproduction makes me believe there's probably tens of thousands of those cat pictures on his pretraining dataset

Anonymous
07/12/24(Fri)16:12:44 No.101384239

Anonymous 07/12/24(Fri)16:12:44 No.101384239

get on your knees and accept my seed

Anonymous
07/12/24(Fri)16:13:06 No.101384243

Anonymous 07/12/24(Fri)16:13:06 No.101384243

>>101384102
I really thought that chink was smart by making his own architecture + training script, and then he does this... is this the mighty power of autism?

Anonymous
07/12/24(Fri)16:14:17 No.101384258

Anonymous 07/12/24(Fri)16:14:17 No.101384258

>>101384243
I mean hes done a great job otherwise and 0.1 is apparently a proof of concept. 0.2 is supposed to train from scratch with a 16 channel vae, hopefully he also filters the dataset then.

Anonymous
07/12/24(Fri)16:15:08 No.101384275

Anonymous 07/12/24(Fri)16:15:08 No.101384275

>>101384258
>0.2 is supposed to train from scratch with a 16 channel vae, hopefully he also filters the dataset then.
Praying he does not fall into the same mistakes as 0.1

Anonymous
07/12/24(Fri)16:15:43 No.101384285

Anonymous 07/12/24(Fri)16:15:43 No.101384285

>>101384258
he needs to redo all the pretraining again, that cat has poisoned his v0.1 model hard, can't go back and undo that process. Might be a good opportunity to actually do a good job and stop relying on AI slop to pretrain your models

Anonymous
07/12/24(Fri)16:16:17 No.101384297

Anonymous 07/12/24(Fri)16:16:17 No.101384297

>>101383528
do you recind this post, anon?

Anonymous
07/12/24(Fri)16:16:28 No.101384301

Anonymous 07/12/24(Fri)16:16:28 No.101384301

>>101384258
>hopefully he also filters the dataset then.
i mean, he'd have to redo the entire thing since it's probably 90% ideogram. explains the great prompt adherence since all his images are now well captioned, but at the cost of image quality and heavy sloppification. and cat.

Anonymous
07/12/24(Fri)16:18:27 No.101384331

Anonymous 07/12/24(Fri)16:18:27 No.101384331

File: pepe-laugh-he-doesnt-know.gif (92 KB, 220x220)

92 KB GIF

>>101383722
>I don't believe it. The cat images are all exactly the same, not a hair / pixel off. Bet that redditor is bullshitting us.
>>101383593
>That is horseshit, it certainly does not do that
>>101383770
>It is impossible to generate the same exact pixel perfect cat in those gens.
>>101383879
>it certainly does not give you a cat.
FAMOUS LAST WORDS OHNONONONONO

Anonymous
07/12/24(Fri)16:19:40 No.101384349

Anonymous 07/12/24(Fri)16:19:40 No.101384349

>>101384301
I think there is a balance to be had. Remove the cat for sure though, all million of them for it to overfit that hard. The actual style that is starting to emerge is not really slop https://images.flrty.li/ just smooth / detail-less due to not enough training.

Anonymous
07/12/24(Fri)16:20:18 No.101384357

Anonymous 07/12/24(Fri)16:20:18 No.101384357

>>101384349
Shut the absolute fuck up it is slop

Anonymous
07/12/24(Fri)16:21:17 No.101384372

Anonymous 07/12/24(Fri)16:21:17 No.101384372

>>101384349
Come on just give it up already, he should make a pretrained model without any AI slop, and then it's up to the users (us) to add AI slop if we feel like it, by doing as it is he's forcing everyone to eat his shit AI sloppa, fuck off

Anonymous
07/12/24(Fri)16:23:15 No.101384391

Anonymous 07/12/24(Fri)16:23:15 No.101384391

if there's anything that aura diffusion shows us it's that a well captioned dataset really do make or break prompt adherence. i didn't think the gap would be this big bros.. wish we had an army of nigerans like openai.

Anonymous
07/12/24(Fri)16:23:38 No.101384396

Anonymous 07/12/24(Fri)16:23:38 No.101384396

>>101384372
The style is not like ideograms though. It's clearly diverging greatly due to whatever else the dataset contains.

Anonymous
07/12/24(Fri)16:24:34 No.101384411

Anonymous 07/12/24(Fri)16:24:34 No.101384411

>>101384396
>goalpost moved
just admit you lost anon

Anonymous
07/12/24(Fri)16:25:43 No.101384424

Anonymous 07/12/24(Fri)16:25:43 No.101384424

>>101384396
I don't give a fuck, no AI sloppa on the pretraining, should be a fucking golden rule, who's retarded idea it is to train an AI model with AI pictures that fucks up limbs, anatomy, perspective, lightning in the first place WHEN BILLIONS OF REAL LIFE PICTURES EXIST AND DEPICT REAL LIFE IN 100% ACCURACY

Anonymous
07/12/24(Fri)16:26:31 No.101384441

Anonymous 07/12/24(Fri)16:26:31 No.101384441

>>101384424
Likely cause of the possible legal issues.

Anonymous
07/12/24(Fri)16:28:24 No.101384466

Anonymous 07/12/24(Fri)16:28:24 No.101384466

>>101384441
License your model properly and no one will care.

Anonymous
07/12/24(Fri)16:28:36 No.101384471

Anonymous 07/12/24(Fri)16:28:36 No.101384471

>>101384441
lol, lmao even, he doesn't share his dataset, no one will know what picture he used in the first place, like OpenAI they also train their model on copyrighted shit but no one can prove anything so they're in the clear. They have no obligation to reveal that
https://youtu.be/mAUpxN-EIgU?t=264

Anonymous
07/12/24(Fri)16:29:14 No.101384478

Anonymous 07/12/24(Fri)16:29:14 No.101384478

>>101384471
Whoever is funding the thousands of gpus though might care.

Anonymous
07/12/24(Fri)16:29:40 No.101384488

Anonymous 07/12/24(Fri)16:29:40 No.101384488

>concern trolling

Anonymous
07/12/24(Fri)16:29:48 No.101384490

Anonymous 07/12/24(Fri)16:29:48 No.101384490

>>101384478
Doesn't he do everything by himself though?

Anonymous
07/12/24(Fri)16:31:28 No.101384512

Anonymous 07/12/24(Fri)16:31:28 No.101384512

>>101384490
maybe the actual training but I doubt he is bank rolling it all.

Anonymous
07/12/24(Fri)16:33:41 No.101384550

Anonymous 07/12/24(Fri)16:33:41 No.101384550

>>101384512
I mean, OpenAI was able to pretrain a giant models like gpt4 and dalle3 with copyrighted data without much trouble, dunno why it would be impossible for him to do it aswell, with a much smaller model too. And like I said, I think he does everything by himself, even the gpu and pretraining so... he's just a lazy fuck, he didn't even bother to remove the cat from the ideogram scrapping, that's crazy

Anonymous
07/12/24(Fri)16:34:55 No.101384565

Anonymous 07/12/24(Fri)16:34:55 No.101384565

>>101384441
What legal issues? Midjourney for example shows openely the artist tags and the celebrities, are they dead? nope

Anonymous
07/12/24(Fri)16:35:25 No.101384570

Anonymous 07/12/24(Fri)16:35:25 No.101384570

>>101384550
they are both already established and have armies of lawyers / microsoft backing them with infinite money.

Anonymous
07/12/24(Fri)16:35:57 No.101384578

Anonymous 07/12/24(Fri)16:35:57 No.101384578

>>101384565
>What legal issues?
State v. The Visions and Anon v. The Voices

Anonymous
07/12/24(Fri)16:37:02 No.101384596

Anonymous 07/12/24(Fri)16:37:02 No.101384596

>>101384570
In the same time they are heavily scrutinized, that chink, no one knows him, he could've even pretrain his model and release it to the hood on 4chan (like llama1 and NovelAI leak), what are they gonna do?

Anonymous
07/12/24(Fri)16:37:43 No.101384602

Anonymous 07/12/24(Fri)16:37:43 No.101384602

>>101384550
why are you comparing some guy in his basement to openai?

Anonymous
07/12/24(Fri)16:38:54 No.101384617

Anonymous 07/12/24(Fri)16:38:54 No.101384617

>>101384602
OpenAI has it actually harder, the whole world have eyes on them, it means way more chance to find anti-AI fags willing to destroy them, it's way better to work in the shadow anon, way way better

Anonymous
07/12/24(Fri)16:41:34 No.101384658

Anonymous 07/12/24(Fri)16:41:34 No.101384658

This thread is fun
>There's no ideogram cat in the pretraining you're retarded if you think otherwise
>Ok... there's the ideogram cat in the pretraining, but the idea of pretraining with AI sloppa is good
>Ok it's not that good... but... but da legal issues!!!
Holy moving the goal post!

Anonymous
07/12/24(Fri)16:43:09 No.101384672

Anonymous 07/12/24(Fri)16:43:09 No.101384672

>>101384578
As if training pictures with AI is a better way to avoid legal issues, don't forget that the model producing those AI pictures were trained with copyrighted pictures, therefore those AI pictures are also in the gray area

Anonymous
07/12/24(Fri)16:47:11 No.101384731

Anonymous 07/12/24(Fri)16:47:11 No.101384731

>>101384617
yeah, honestly i think you're right. there's no other explanation for him using so many unfiltered ideogram gens that the model learns to do a pixel perfect safety cat besides pure laziness.

Anonymous
07/12/24(Fri)16:47:33 No.101384737

Anonymous 07/12/24(Fri)16:47:33 No.101384737

>>101384672
I would even say that it's kinda retarded to reveal to everyone that you used ideogram to pretrain your model, what if ideogram wants to make a cease and desist out of its outputs?

Anonymous
07/12/24(Fri)17:33:33 No.101385343

Anonymous 07/12/24(Fri)17:33:33 No.101385343

>>101384658
Why do they even do this? Is it really an elaborate ploy to sabotage local models by convincing guidable chinks that training on midjourneyslop is the path forward?

Anonymous
07/12/24(Fri)17:41:56 No.101385463

Anonymous 07/12/24(Fri)17:41:56 No.101385463

>>101385343
It's probably a good way of preventing the local ecosystem from catching up with the APIs, pushing them to shoot themselves in the foot with "ethical" training or with AI sloppa poisoning, if you want my genuine opinion, it's just sad. We could achieve so much better without those retards.

[Return] [Catalog] [Top]

Post a Reply

Return Catalog Top Refresh

[Advertise on 4chan]

Delete Post: [File Only] Style:

[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.