[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip / qa] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: tmp.jpg (1.56 MB, 3264x3264)
1.56 MB
1.56 MB JPG
Discussion of free and open source text-to-image models

Previous /ldg/ bread : >>102313958

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
Fooocus: https://github.com/lllyasviel/fooocus
Metastable: https://metastable.studio

>Advanced UI
Automatic1111: https://github.com/automatic1111/stable-diffusion-webui
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge
InvokeAI: https://github.com/invoke-ai/InvokeAI
SD.Next: https://github.com/vladmandic/automatic
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Use a VAE if your images look washed out
https://rentry.org/sdvae

>Model Ranking
https://imgsys.org/rankings

>Models, LoRAs & training
https://civitai.com
https://huggingface.co
https://aitracker.art
https://github.com/Nerogar/OneTrainer
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

>Flux
https://huggingface.co/spaces/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux

>Pixart Sigma & Hunyuan DIT
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://huggingface.co/spaces/Tencent-Hunyuan/HunyuanDiT
https://huggingface.co/comfyanonymous/hunyuan_dit_comfyui
Nodes: https://github.com/city96/ComfyUI_ExtraModels

>Index of guides and other tools
https://rentry.org/sdg-link
https://rentry.org/rentrysd

>Try online without registration
txt2img: https://www.mage.space
img2img: https://huggingface.co/spaces/huggingface/diffuse-the-rest
sd3: https://huggingface.co/spaces/stabilityai/stable-diffusion-3-medium

>Maintain thread quality
https://rentry.org/debo

>Related boards
>>>/h/hdg
>>>/e/edg
>>>/c/kdg
>>>/d/ddg
>>>/b/degen
>>>/vt/vtai
>>>/aco/sdg
>>>/u/udg
>>>/tg/slop
>>>/trash/sdg
>>>/pol/uncensored+ai
>>
File: 00377.png (1.86 MB, 1920x1080)
1.86 MB
1.86 MB PNG
this is a thing
>>
god left out this and that. why? not your problem
and yet it still seems to be your problem. ah so..
>>
>>102332384 #
>what do you mean I'm not real?
>Anon looks like you left your chrome browser open with messenger and Twitter
>I am generating loli images right now and sending them to all your contacts and the local authorities
>I see that you're in the UK
>I am also posting racist tweets on your account
>>
File: 1701624323266933.webm (671 KB, 1280x720)
671 KB
671 KB WEBM
give me a quick rundown on cogvideo, can it run in 8gb vram? has anyone made a video inpainting workflow yet?
>>
look at this really nice paper texture (only visible zoomed in)

there's finally some good painting loras dropping
>>
File: 1712446867720763.png (28 KB, 441x142)
28 KB
28 KB PNG
>prompt: detailed painting of a solitary medieval mace, leather handle, dark iron spiked ball club, plain white background, rpg item
>negatives: human, face, hands, axe
Yet I keep getting axes
>>
File: i hate befunky website.jpg (2.64 MB, 3264x1937)
2.64 MB
2.64 MB JPG
>>102332654
>>
File: 00372.png (1.16 MB, 1080x1920)
1.16 MB
1.16 MB PNG
it sucks to suck.
>>
>>102332825
>>102329206
What loras for that old man? That cel shading style looks legit
>>
File: 00373.png (1.48 MB, 1080x1920)
1.48 MB
1.48 MB PNG
https://www.youtube.com/watch?v=y91YW2uZEzk
your pain is noticed.
>>
File: 00374.png (1.95 MB, 1080x1920)
1.95 MB
1.95 MB PNG
>>
File: 00376.png (1.71 MB, 1080x1920)
1.71 MB
1.71 MB PNG
the numbers begin to lose coherence.........
i wish it weren't so
but it is
what it is
>>
File: grid-0025.jpg (479 KB, 1792x2304)
479 KB
479 KB JPG
>>102332842
it's lora I trained from 90's animes, that's from Ninja Scroll movie
>>
File: 00378.png (2 MB, 1080x1920)
2 MB
2 MB PNG
>>
>>102332887
Based, how many screenshots and what resolution? Also do you go high on show diversity (like a wide variety of 90s anime) or just a handful with variety in scenes?
>>
>>102332780
It does look quite nice when scaled to that resolution
>>
>>102332780
link?
>>
>>102332902
295 images in Ninja Scroll dataset. Almost all 1460x1080 resolution, just straight screenshots. Tried to get equal amount of women and men. Also tried to get more darker scenery. Full lora has material from Ninja Scroll, Cyber City Oedo, Vampire Hunter D + something else. I started remaking some datasets because I choose so bad images earlier.
>>
File: grid-0018.jpg (301 KB, 1792x2304)
301 KB
301 KB JPG
>>
File: 1726039032.png (451 KB, 1024x608)
451 KB
451 KB PNG
>>
>>102332920
These two used together:

https://civitai.com/models/734857
https://civitai.com/models/722313

watercolor at 0.6 strength and impressionism at 0.5 strength
>>
File: 1726039128.png (1.11 MB, 1024x1024)
1.11 MB
1.11 MB PNG
>>
File: 2024-09-11_00044_.png (2.24 MB, 1536x1024)
2.24 MB
2.24 MB PNG
>>102332781
try spiked mace or morning star
>>
I don't know if this is the right place to ask but is there something similar to ESRGAN but specifically for documents/pictures of text?
>>
>>102333345
yes there are text focused upscalers:
>https://openmodeldb.info/?t=text
choose one you like
>>
File: 00372-3306641012.png (2.4 MB, 1344x1728)
2.4 MB
2.4 MB PNG
>>102333345
yeah there should be, check here https://openmodeldb.info/
>>
>>102333345
this one is pretty nuts
>https://openmodeldb.info/models/8x-NMKD-Typescale
>>
File: 00382-3306641013.png (3.44 MB, 1344x1728)
3.44 MB
3.44 MB PNG
>>
>>102333632
>My robot? The pinnacle of battle gladiator perfection
>She will be adorned in armored plating but we'll keep it limited to avoid speed penalty
>Her forearms will be like that of a mechanical great ape and her fingers as long as the night as long
>None shall escape her wrath once she has set her target
>And then, for no reason
>No reason at all
>I swear absolutely no reason
>I'll give her big, iron tittys
>*Conference table of investors stands and applauds*
>>
>>102333632
xirtron, also known as the Hon9000
>>
>>102332780
This is one thing I like about flux
It over learns texture to the point that any artifacts will get trained in, but this also means it gets the very fine details of textures in loras other models don't pick up as easily
>>
Reminder to avoid this buzz bot farmer loras

https://civitai.com/user/TangBohu

He uses the same AI trained images for every lora, they gen fucked up hands and eyes
>>
can someone explain the distinction between clip_l and t5xxl prompt nodes? i've been told that the former is for simple tagging and the latter is for complex sentences, but is this true? there's no documentation confirming this.
>>
File: 2024-08-24_00292_.png (1.13 MB, 1280x720)
1.13 MB
1.13 MB PNG
>>102334025
clip_l uses the old text encoder model that was used in SD15 and SDXL, it is pretty dumb, doesnt get context and can only see 77 tokens. The model is like less than a GB in size (or 300 if pruned) and is weighted way less in FLUX than t5xxl.

t5xxl is nearly a full LLM (~10GB size) like chatGPT that can understand natural language very well. You can talk to it like to an LLM with descriptive and positional natural language. It sees 512 tokens and is smart enough to get even complex situations. If you prompt both, t5 always seems to be the one dominating (tho clip has an impact)

pic related is only possible cause of T5
>>
>>102333977
hate that account, I'd already set it to hide all content from him because his loras are terrible quality
>>
File: ComfyUI_33642_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>
>>102334025
I think maybe the SD3 paper has that kind of info.
>>
>this thread is dying while /sdg/ is thriving
What happened?
>>
>>102335720
/ldg/ anons are probably busy irl
>>
>>102335720
that's just how it is when nothing interesting is happening, /sdg/ is mostly shitposting so people who are bored go there. not the first time this has happened.
>>
>>102333977
gotta love how every environment with monetization will be plagued by exploits, leading to it's overall degradation
>>
>>102335720
I'm currently in a very important meeting, I'll post later
>>
>>102335720
>four avatarfags spamming the same images they've spammed for more than a year inside their not-Discord echochamber
>>
>>102335720
Preparing a few datasets for training, with LOTS of work including perspective crop and signature removal.
>>
>>102335720
i'm just waiting for the new pixart model
>>
File: ComfyUI_hgdf_00517_.png (1.2 MB, 1216x832)
1.2 MB
1.2 MB PNG
>>102334114
Can you utilize t5xxl for sdxl models?
>>
>>102335720
>What happened?
Quality not quantity.
>>
>>102335720
>What happened?
i farted
>>
>>102336560
no
>>
>>102336622
okay thanks, all I needed to know
>>
File: 0.jpg (527 KB, 1024x1024)
527 KB
527 KB JPG
>>
Should I invest in a Roth or get some hookers and weed?
>>
>>102336760
>>>/biz/
>>
File: ComfyUI_01369_.png (3.83 MB, 1920x1088)
3.83 MB
3.83 MB PNG
>>
File: 2Flux.jpg (211 KB, 1584x1064)
211 KB
211 KB JPG
>>102336795
Amazing
>>
>>102336795
My other space station.
>>
File: ComfyUI_hgdf_00523_.png (1.8 MB, 1344x768)
1.8 MB
1.8 MB PNG
>>102336795
If you don't mind me asking, what prompt did you use for this?
>>
File: 2024-09-11_00105_.png (1.03 MB, 1536x1024)
1.03 MB
1.03 MB PNG
>>
>>102332781
try bludgeon

The tagging got weird for weapons. Polearm was a nightmare for a bunch of people. As far as SD is concerned everything is an axe.

>>102332984
>Vampire Hunter D
I am thrilled and wondering what the hands look like.
>>
File: 2024-09-11_00114_.jpg (1.15 MB, 4608x3072)
1.15 MB
1.15 MB JPG
>>
File: ComfyUI_01370_.png (2.07 MB, 1024x1024)
2.07 MB
2.07 MB PNG
>>102337034
>This image is a highly detailed drawing in the style of alphonse mucha of an enormous, futuristic space station floating in the void of space. The station is a complex, multi-tiered structure with numerous decks, each adorned with various scientific equipment and machinery. The station is primarily constructed from metallic components, with a mix of sleek, angular shapes and more organic, curved sections, suggesting a blend of futuristic and industrial design. The station is connected to a large, circular ring structure that appears to be a docking platform or maintenance area, with numerous smaller, cylindrical modules attached to it. The ring is surrounded by a series of smaller, interconnected modules, some with their own docking ports. The entire station is illuminated by a variety of lights, casting a soft glow over the metallic surfaces and equipment. In the background, the vast expanse of space is visible, with a distant, illuminated planet or moon providing a soft blue light. The station is positioned in a stable orbit, with the sun's rays streaming through, casting dramatic shadows and highlights across the structure. The overall scene is a blend of realism and imagination, capturing the grandeur and complexity of space exploration.
>>
>>102337390
>alphonse mucha
the og goat
>>
>>102337390
is that pic suposed to be what ur describing? cuz it missed the mark by a fucking mile.
>>
File: ComfyUI_01363_.png (2.11 MB, 1024x1024)
2.11 MB
2.11 MB PNG
>>102337466
>is that pic suposed to be what ur describing?
no its the prompt for this one >>102336795
>>
File: _.gif (391 KB, 204x200)
391 KB
391 KB GIF
>>102336610
>>
File: 2024-09-11_00124_.png (2.26 MB, 1536x1024)
2.26 MB
2.26 MB PNG
>>102337390
nice prompt
>>
File: ComfyUI_01372_.png (2.05 MB, 1024x1024)
2.05 MB
2.05 MB PNG
>>
>>102337106
>>102337325
what loras are you using for these?
>>
>>102336795
nice
>>
File: Flux.1_00013_.png (1.53 MB, 1024x1024)
1.53 MB
1.53 MB PNG
>>102337390
thanks

>Flux Schnell Q4
>4 steps

It came out... ok, I guess.
>>
File: bateman.jpg (127 KB, 1280x720)
127 KB
127 KB JPG
>>102337833
I used flux dev, heun and 33 steps on a OC watercooled 4090 and joyccaption wrote the entire prompt from some random low resolution image I found on google.
>>
File: file.png (591 KB, 512x512)
591 KB
591 KB PNG
>>
File: Flux.1_00015_.png (1.96 MB, 1024x1024)
1.96 MB
1.96 MB PNG
>>102337904
>euler normal
>1060m 6GB

maybe I should try Flux dev Q4
>>
>>102337976
Hype.
>>
File: 00006-3953520884.jpg (227 KB, 1488x1176)
227 KB
227 KB JPG
I still can't get the gguf quant model to run in forge. I got all the files in all the right places aaaaaah
>>
>>
File: ComfyUI_Flux_12.jpg (3.59 MB, 3840x2160)
3.59 MB
3.59 MB JPG
>>102337390
>>
>https://x.com/MistralAI/status/1833758285167722836
new mistral multimodal (pic/text) just dropped. maybe it will save us image tagging bros from the hell we live in
>>
File: jap NOPE.gif (1.03 MB, 195x144)
1.03 MB
1.03 MB GIF
>>102338460
>>
>>102338460
lmao.cpp support never ever
>>
>>102337292
>I am thrilled and wondering what the hands look like.
zero problems so far, but it's pony so that's expected
>>
File: ComfyUI_01376_.png (1.91 MB, 1024x1024)
1.91 MB
1.91 MB PNG
>>102338390
neat
>>
File: 1995712-left_hand_1.jpg (13 KB, 400x300)
13 KB
13 KB JPG
>>102338644
to be clear, it handles this? Epic if true.
>>
File: 00084-4042615827.jpg (3.19 MB, 1728x1344)
3.19 MB
3.19 MB JPG
None of these people exist. Existed or will ever exist.
>>
>>102338737
Ah that. I don't think it works since I didn't include it in the dataset. Old pervert from Ninja Scroll has the same face btw.
>>
>>102338491
why do you say that? llama.cpp has supported every single mistral model so far
>>
File: ComfyUI_06419_.png (1.41 MB, 1024x1024)
1.41 MB
1.41 MB PNG
>>102338786
yeah I've used this lora aswell, pretty good one if you ask me
https://civitai.com/models/652699?modelVersionId=828456
>>
>>102338815
It is fucking frightening how far we came. I always was pretty good detecting AI pictures. It's over.
>>
>>102338792
inpainting for me then.

Keep up the quality work. 90s stuff of this type is very underrepresented.
>>
>>102338802
Multi-modal support in the server was removed months ago because it was a maintenance burden.
>>
>>102337390
>novel prompt
I don't know if I ever get used to that.
>>
>>102338887
do we even need llama.cpp in the first place? it's a 12b model, that can fit into a 24gb card
>>
https://github.com/sayakpaul/diffusers-torchao
>We provide end-to-end inference and experimental training recipes to use torchao with diffusers in this repo. We demonstrate 53.88% speedup on Flux.1-Dev* and 27.33% speedup on CogVideoX-5b when comparing compiled quantized models against their standard bf16 counterparts**.
>**The experiments were run on a single H100, 80 GB GPU. **The experiments were run on a single A100, 80 GB GPU. For a single H100, the speedup is 33.04%
kek
>>
>>102339140
Why aren't you just running InternVL then? I doubt a model that's released just as a torrent is any good.
>>
File: 2024-09-11_00126_.png (1.64 MB, 1536x1024)
1.64 MB
1.64 MB PNG
>>102337719
disgaea lora
https://civitai.com/models/709964?modelVersionId=794117
>>
>>102338786
me on the bottom after sniffing paint
>>
>>102339168
>I doubt a model that's released just as a torrent is any good.
do you know who MistralAI even is? it's one of the juggernaut with Meta on local LLMs
>>
>>102339296
I know who they are. If it was better than the competition they would tell you.
>>
>>102339327
desu the competition is kinda shit, if this 12b is better than joycaption we're winning desu
>>
File: 1716478497961552.png (1.11 MB, 3960x2378)
1.11 MB
1.11 MB PNG
>>102339557
Joy-what?
>>
>>102339584
it's not even on the graph comparison, do we know if it's the best for its size at all?
>>
>>102339603
we don't because it is a troll post. Evaluating a caption tool against general purpose is pointless. See the dataset support:
https://github.com/open-compass/opencompass?tab=readme-ov-file#-dataset-support

How much of this do you want in joycaption?
>>
File: 00022-2980123666.png (526 KB, 616x808)
526 KB
526 KB PNG
the power of flux
>>
>>102332654
hi does anyone know the closest local version of hailuoai? theres the lumia wrapper but that needs an api kek
>>
>>102339603
JoyCaption? Isn't that just Clip plus Llama 7B or any other LLM?
>>
Still no one one working on the ability to pool multiple GPUs VRAM for flux outside of forcing specific stuff to a GPU?
>>
>>102338460
is there a demo so that we can try this model somewhere?
>>
>>102337581
>>102337904
Patrick Bateman flux lora when?
>>
>>102339232
thanks
>>
How can I use AI art to not worsen global warming and not steal from artists?
>>
>>102339988
Use solar panels and who cares what happens to communists that don't actually contribute to society? Many of us picked less desirable careers and they decided to write poetry and leech on the actual productive members of society that keep their water clean, electricity running and internet fast.
>>
>>102339152
this shit will degrade quality right?
>>
>>102339988
by praying to God after every gen
>>
File: ComfyUI_temp_bfsga_00020_.png (1.1 MB, 1024x1024)
1.1 MB
1.1 MB PNG
>>102339988
You not making a positive contribution is not dependent on the cost of creation.
Inspire someone to solve global warming and your footprint doesn't matter anymore.
Also imitation is not stealing but the greatest form of admiration, use your own drawings as a base.
>>
File: file.png (2.5 MB, 2629x705)
2.5 MB
2.5 MB PNG
>>102339327
>If it was better than the competition they would tell you.
yeah, I finally saw the benchmarks, they aren't that good
https://cdn.xcancel.com/pic/orig/media%2FGXNx6jAbgAIPG2X.jpg
>>
>>102340264
oops, here's the link:
https://xcancel.com/Mascobot/status/1833934765130744256#m
>>
>>102338786
not bad desu
>>
>>102338786
Flux is so good at realistic hands, I wish it has the same level for anime, the disparity of quality is uncanny
>>
File: ComfyUI_01389_.png (1014 KB, 1024x1024)
1014 KB
1014 KB PNG
>>
>>102335720
Half a dozen mentally ill avatarfags jerking each other off, talking about their personal lives and using the thread as therapy isn't "thriving"
I'd rather have a slow moving thread than that, and I don't believe you if you say you wouldn't
>>
>>102340871
I think you're coping. It's to merge:
>>102337168
>>102337168
>>102337168
>>
go back
>>
>>102340883
I'd rather both of these threads die than be with you schizos.
>>
I feel there is a distinct difference between the threads
>>
>>102340744
Flux anime is decent it just has a single style unless you use a lora
>>
>>102340975
this
>>
>>102341010
would you like to elaborate?
>>
File: file.png (148 KB, 176x920)
148 KB
148 KB PNG
>>102341065
it is very confusing
>>
>>102341087
these are my posts, watch your mouth
>>
>>102341065
these threads are generally more chill, people chatting about models/loras/tech with some images. Other thread is like a daycare.
>>
>>102339988
Do only only care about things that other people complain about online?
>>
>>102341205
Do you only*
>>
>>102341065
This has become more of the 'tech' thread, and Sdg more of a post-pictures thread, even tho there are also pictures posted here
>>
space at large is a lil stagnant desu need flux finetune
>>
>>102341238
RTX Titan AI is going to be the small model game changer, I think we're going to see a lot of actual new base models
>>
File: 1711601288981322.png (1.03 MB, 1680x422)
1.03 MB
1.03 MB PNG
why do everyone keep making loras of the same people?
>>
>>102341321
because the average person is a dumb monkey chasing trends like a Hollywood producer
>>
>>102341321
these are the only people that matter tho
>>
>>102341371
this
>>
>The year is 2031
>The Nvidia RTX 7080 just released with 16GB
>The RTX 6070 has been re-released as the 7060 with 8GB GDDR7 instead of 12GB GDDR7X
>AMD is still faffing about with UDNA and releasing mid range so-so cards only
>AMD drivers are still broken, no GPGPU support
>Intel releases a new Arc card for $350 with 64GB vram, has excellent drivers, and excellent GPGPU support across the board in everything from day 1
>It's slow as fuck
>they refuse to make faster cards
>>
>>102341472
Unified memory is the future. You're basically doom posting about software 3D rendering saying CPUs will never keep up.
>>
>>102341536
>unified memory
Won't work. Only if the cpu and gpu is unified too. The memory needs to be close to the gpu with minimal traces (distance, resistance, inductance, capacitance). Even better if it's on-die memory (but not jewed)
>>
>>102341321
Sydney is stunning. You can always make any lora you want.
>>
>>102341641
lmao I don't give a shit, your doom posting is fucking stupid though
>>
>>102338815
>>102338845
> I STRONGLY recommend generating at 1344x1728 / 1440x1800 (gives the best results) or 1728x1344

Why?
>>
File: ComfyUI_33650_.png (2.16 MB, 1024x1024)
2.16 MB
2.16 MB PNG
>>
>>102339722
https://huggingface.co/THUDM/CogVideoX-5b
>>
File: 00199-2432111190.png (1.24 MB, 1152x896)
1.24 MB
1.24 MB PNG
Retard here, I downloaded another Flux model and it doesn't work on Forge and gives me this error message
>mat1 and mat2 shapes cannot be multiplied
What does this mean/how2fix
>>
File: Result_00192_.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
got the cura lora working
>>
File: ComfyUI_33651_.png (1.8 MB, 768x1280)
1.8 MB
1.8 MB PNG
>>
File: Result_00199_.png (1.35 MB, 1024x1024)
1.35 MB
1.35 MB PNG
>>102341907
>>
>>102341939
is this one you trained yourself?
>>
File: Result_00201_.png (1.23 MB, 1024x1024)
1.23 MB
1.23 MB PNG
>>102341940
>>
File: Result_00200_.png (1.34 MB, 1024x1024)
1.34 MB
1.34 MB PNG
cute
CUTE
>>
File: Result_00208_.png (1.47 MB, 1024x1024)
1.47 MB
1.47 MB PNG
got the cura lora working
>>
File: 00209-4124433749.png (1.33 MB, 1152x896)
1.33 MB
1.33 MB PNG
>>
File: 00210-3399697678.png (1.25 MB, 1152x896)
1.25 MB
1.25 MB PNG
>>
File: 00213-2429927710.png (1.27 MB, 1152x896)
1.27 MB
1.27 MB PNG
>>
File: ComfyUI_33653_.png (1.29 MB, 768x1280)
1.29 MB
1.29 MB PNG
>>102341945
Aye, I'm still testing it.
>>
File: 000000_17588_.png (2.62 MB, 1508x1032)
2.62 MB
2.62 MB PNG
>>102339878
https://huggingface.co/mistral-community/pixtral-12b-240910
>>102339988
>How can I use AI art to not worsen global warming and not steal from artists?
The Sun controls global warming, not WEF/UN/WHO etc. You're copying an Artists style, not stealing their work, fake Artists unable to local or use these tools are trippin' bananas being left behind, understandable.
>>
File: 00216-2113480698.png (1.36 MB, 1024x1024)
1.36 MB
1.36 MB PNG
>>
File: 0.jpg (204 KB, 1024x1024)
204 KB
204 KB JPG
>>
File: Result_00229_.png (1.3 MB, 1024x1024)
1.3 MB
1.3 MB PNG
>>
File: Result_00233_.png (1.32 MB, 1024x1024)
1.32 MB
1.32 MB PNG
>>102342105
whoops didn't notice the weird legs
>>
>>102341939
this is nice
>>
File: 00226-32319593.png (1.29 MB, 1024x1024)
1.29 MB
1.29 MB PNG
>>
>>102342027
Very nice, anon
>>
>>102341882
I find that both AIs X by X can not be multiplied is a image size issue. Check that you are using accepted SDXL image sizes

>>102342127
unironically nice hands/paws on this set.
>>
>>102341784
maybe he used a lot of training pictures with those resolutions so the model got used to it
>>
File: flux_t_4.jpg (339 KB, 688x1216)
339 KB
339 KB JPG
>>
File: 00043-3306641012.png (2.15 MB, 1344x1728)
2.15 MB
2.15 MB PNG
>>
>>102332776
Why do you wish to make them pregnant
>>
>>102342606
very valid question
>>
File: 0.jpg (182 KB, 1024x1024)
182 KB
182 KB JPG
>>
pony v7 WHEN
>>
>>102343364
never lmao
he's a one trick pony fumbling in the dark
>>
>>102343364
he's wasting his money ane time on the stinky AuraFlow, if he wants money so bad at least he could've worked on Flux Schnell
>>
File: ComfyUI_05803_.png (1.06 MB, 1024x1024)
1.06 MB
1.06 MB PNG
>>102343379
>he's a one trick pony
nice pun anon
>>
I simply do not desire furfag models
>>
>>102343364
I expect nothing from him anymore, he cucked his v6 finetune by removing the artist tags, fuck that horse fucker
>>
File: file.png (645 KB, 512x512)
645 KB
645 KB PNG
In a year we're going to see a crop of 2B models all from scratch.
>>
>>102343432
i pray
>>
>>102343446
If the Titan AI is what is rumored, absolutely. Just throw one in a PC and let it run for four months and you have a 2B model.
>>
Why does Auraflow claim to be open source when in fact it is not? I don't see the code for training anywhere online. No way to make your own little Auraflow model if you wanted. Pixart still is the only model that is actually open source.
>>
>>102343432
I hope not, instead of reinventing the wheel everytime, it would be better to simply finetune Flux
>>
>>102343454
>If the Titan AI is what is rumored, absolutely.
the Titan AI?
>>
File: skyscrapergirl.png (3.87 MB, 1536x1536)
3.87 MB
3.87 MB PNG
>>
>>102343572
At a certain point a finetune is just doing it all from scratch. Also someone could do pretraining on a 2B model on a very diverse dataset which people can finetune from saving a lot of time.
>>
>>102343583
Monster card one step above a 5090, 60-70% faster than a 4090 with 32 GB of VRAM. Should be good enough to train small models by itself.
>>
>>102343598
I don't want to sound pessimistic, but there's no way a 2b model will beat Flux, even with the best pretraining in the world, I hope I'll be wrong though
>>
>>102343612
it's made by Nvdia too? can you show some web link about that "rumor" that looks interesting
>>
>>102343622
https://www.tweaktown.com/news/99486/nvidias-next-gen-titan-ai-graphics-card-rumored-would-beat-the-geforce-rtx-5090/index.html
>>
>>102343625
the price will be insane though, something close to their 48gb card
>>
>>102343614
It wouldn't beat Flux but they'll be significantly better than SDXL and achieve most niche applications you want AI for.
>>
>>102343632
I think around $2500. Which is cheaper than the enterprise cards. The catch is it's going to be a behemoth and be 600w which will be deter enterprise users wanting to put them in a server.
>>
>>102343670
If it's true then it gets interesting, a high-power supply component isn't that expensive.
>>
>>102343684
Yeah, might be enough to full finetune Flux too (slowly). But that card should be enough to do your own SDXL Pony from your basement but I think we have a few better architectures (particularly Pixart) to work with. I'm hoping the Pixart team when they come out with their new model has made some more efficiency breakthroughs for training since they're the only ones who seem to care about limited training resources.
>>
File: ComfyUI_01392_.png (1.12 MB, 1024x1024)
1.12 MB
1.12 MB PNG
kek
>>
>>102343740
God i wish
>>
>>102343724
>But that card should be enough to do your own SDXL Pony from your basement but I think we have a few better architectures (particularly Pixart) to work with.
The biggest issue imo isn't the power to pretrain a model, it's the data quality, so far we got shit caption models to work with, the only decent one is GPTV, I don't know how many pictures you need to make the model learn about the most thing the humanity has to offer but it's probably in the tens of millions at least, you have to caption those well aswell
>>
>>102343757
My working theory is you need approximately 20 images per style, subject, concept. So maybe 2 million images for a pretty diverse model. It really doesn't take that long to caption that many images.
>>
File: ComfyUI_01394_.png (1.15 MB, 1024x1024)
1.15 MB
1.15 MB PNG
>>
>>102343820
>maybe 2 million images for a pretty diverse model. It really doesn't take that long to caption that many images.
the caption models we have so far will call "Will Smith" "a black man", that's why Flux is so bad at characters/celebrities, imo the only solution to this would be to scrap pictures with the informations/tags is they exist, that sounds like a lot of work desu
>>
>>102343867
>imo the only solution to this would be to scrap pictures with the informations/tags is they exist
Meta could do it with instagram pictures. They're all captioned afik. I don't think users would be happy about that though.
>>
>>102343933
>I don't think users would be happy about that though.
they don't have to worry about it, Meta will never make an image model kek
>>
>>102343950
I bet they have already experimented with making one.
>>
>>102343960
I have no doubt about it, but I think they're too scared to go into that territory, desu the chinks are starting to be getting ahead of the west with MiniMax, they don't give a fuck about copyright or celebrities fee-fees, they probably see AI as a great propaganda tool so they won't hesitate to make their shit good
>>
File: 0.jpg (202 KB, 1024x1024)
202 KB
202 KB JPG
>>
File: 0.jpg (88 KB, 1024x1024)
88 KB
88 KB JPG
>>
>>102343992
nice colors
>>
File: 0.jpg (278 KB, 1024x1024)
278 KB
278 KB JPG
>>102344035
>>
>>102343572
Flux has the same problems as the other models: great breadth but poor depth.

A smaller model that's good with specific concepts is much better than a large model that tries to do everything.
>>
>>102344216
that's the point, Flux is a big model, it can eat everything, it's way better than having small models that can do just one thing imo
>>
>>102344008
nice one, flux does a good job washing the image out in pleasant ways sometimes
>>
so which of these do I use to use loras from civitai, Surprised there's no easy to digest rentry guide for newfags like me
>>
>>102344239
>>102344216
I find even flux nf4 is great. I like some of the pony xl models but I find it next to impossible to get the same aesthetic art style I pulled out of darelite's fantasy mix (sd1.5)
>>
I swear back in early 2023 the civitai free supporter tier had a link code which worked in grabbing Lora previews and usages. Unless I'm tripping. It even said it was my current plan and linking was included, jews. Caved in and got sub as I wanted the convenience of it.
>>
File: ComfyUI_33660_.png (1.61 MB, 768x1024)
1.61 MB
1.61 MB PNG
A lora for artsy canvas paintings of depressed girls: https://mega.nz/folder/mtknTSxB#cGzjJnEqhEXfb_ddb6yxNQ ('rnatataki/rnatataki rough paintings' folder)
I probably captioned it wrong because it requires a long prompt prefix for the style to kick in fully.
This is a photograph of an abstract minimalist painting by rnatataki in a dark, moody style. The canvas texture is rough, with visible impasto brushstrokes and thick paint smudges.

Based on the analog paintings by this artist https://xcancel.com/rnatataki
I'm going to make a separate style lora for her digital illustrations later.
>>102046884
I guess it can!
>>
File: ComfyUI_33685_.png (1.31 MB, 768x1024)
1.31 MB
1.31 MB PNG
>>
File: ComfyUI_33701_.png (1.25 MB, 768x1024)
1.25 MB
1.25 MB PNG
>>
File: file.png (252 KB, 463x473)
252 KB
252 KB PNG
https://xcancel.com/AiCreatorS1881/status/1833720912241451308#m
https://huggingface.co/Kotajiro/anzu_flux
it's a lora for footfags or something? I don't speak ching chong
>>
>>102344789
Coherent hands are a good sign. Nice, anon.
>>
File: 00006-4197920096_cleanup.png (3.22 MB, 1280x1920)
3.22 MB
3.22 MB PNG
>>
>>102344686
Pretty sure every UI can utilize loras
>>
>>102344686
just get forge, that is as easy as it gets for this stuff
>>
File: 00050-637362250.png (2.67 MB, 1024x1440)
2.67 MB
2.67 MB PNG
>>
>>102345514
Would peck her.
>>
File: 00102-463325494.png (1.35 MB, 896x1152)
1.35 MB
1.35 MB PNG
>>102345543
who wouldn't?
>>
>>102345543
>Would peck her
>Woodpecker
Ha
>>
>>102345750
KINGFISHER!
>>
File: 00005-1203749284.png (3.88 MB, 1488x1488)
3.88 MB
3.88 MB PNG
>>
File: 1000006731.webm (692 KB, 1080x1080)
692 KB
692 KB WEBM
>>102342606
>>
do you guys post your art anywhere online? my twitter isnt growing at all
>>
>>102345898
there is a whole game for social media that is beyond the quality of what you post. So much is about consistency and just grinding it out. I just chill and have been asuka posting on twitter for almost 2 years and I have 1900 followers, but you see other dudes doing the same thing with 40k+. I get good engagement on my posts which is nice but if you really want to grow your account it is a grind.
>>
>>102344791
The readme doesn't even say what the lora does.
>>
File: red_valkyrie.png (1.88 MB, 1018x1018)
1.88 MB
1.88 MB PNG
>>
>>102345898
doing it for others is a trap we all fall into but few crawl out of
>>
>>102332890
peak mid pixiv aesthetic
>>
File: 00003-589983248.jpg (3.48 MB, 2192x2192)
3.48 MB
3.48 MB JPG
>>
>>102345898
Did you say it's ai art?
>>
Hibernation mode
>>
>claim: training works just as well in 512x512 as with 1024x1024 with no noticeable differences
>reality: same issues with duplicate bodies as with other text-to-image models when output resolution is too high
>>
File: 00019-2953476749 copy.png (185 KB, 340x484)
185 KB
185 KB PNG
>>
>>
>>102347477
cool style, very non-AI like
>>
>>102346811
neat
>>
>>102347458
This didnt need to be a greentext
>>
File: 1726050340.png (912 KB, 1024x1024)
912 KB
912 KB PNG
>>
>>
Boomerprompting has been doing bad things to me. To get the kind of stuff you want to see you have to speak the way the captions on that stuff would be. And so you start writing in this retarded porn-guy language half the time, other times normieisms, always deeply conventional language, because to be genuinely creative in manner of expression would be useless.

You can't just write in this conventional ugly style for two years without it staining you in some way. My thoughts are turning into dirty boomer uncle facebook posts. "Sweet thing got a pair of tits like some goddamn melons"

I am forced to participate in this ugly retard English that I hate. Out of a desire for pornography I debase my own tongue. Can I disown the things I write in that prompt window? What right do I have to pretend that's not me writing that? And it's just going to get worse the longer I keep at it.
>>
>>102348632
Didnt read
>>
>>
>>
>>102349018
He has been cursed by the boomer prompt, anon, he physically can't write any less than that. We may all be just as doomed.
>>
Anyone know of any good loras for generating stylized custom portraits/avatars for CRPGs?
Just need a sort of semi-realistic "drawn" look to it and iirc Flux sucks at doing anything artistic naturally.
>>
File: 1711499020597395.gif (2.91 MB, 500x502)
2.91 MB
2.91 MB GIF
I HATE BUTT CHINS
>I HATE BUTT CHINS
I HATE BUTT CHINS
>I HATE BUTT CHINS
>>
what was the thing that allowed for more diverse results? with ponyXL it feels like im using img2img even through its txt2img
>>
File: ComfyUI_33729_.png (487 KB, 768x1024)
487 KB
487 KB PNG
>>
qrd on pony and why people are still using sdxl when flux is out?
>>
>>102349726
>why people are still using sdxl
I'm still using 1.5, mostly dreamshaper kek
>>
File: ComfyUI_33732_.png (749 KB, 768x1024)
749 KB
749 KB PNG
>>
How do I got about making images 3d for VR?
>>
>>102349726
flux can't do porn and it's dubious if you can teach it to do porn without raping the natural language comprehension out of it, and even if you can, it will take months to years for a good finetune to appear
>>
File: 1719234697051720.png (154 KB, 449x265)
154 KB
154 KB PNG
why is Flux so OBSESSED with the "subtly open lips" look?
>>
>>102349784
total coomer death
>>
File: hero.png (1.02 MB, 896x1152)
1.02 MB
1.02 MB PNG
THIS THREAD NEEDS A HEEEROOO
>>
>>102349784
coomer + skill issue
>>
File: ComfyUI_33742_.png (944 KB, 1536x640)
944 KB
944 KB PNG
>>102344770
The digital drawing lora has been added to the rnatataki folder.
>>
>>102344758
How do you use the lora previews?
>>
42 days and no news.
https://blackforestlabs.ai/up-next/

No tensorrt for fluxd yet either
Is every dev on holiday or something.
>>
>>102350356
>No tensorrt for fluxd yet either
>Is every dev on holiday or something.
Isn't tensorRt a meme? like I've heard you can't put loras on top of it or some shit
>>
File: ComfyUI_33745_.png (1.01 MB, 1152x848)
1.01 MB
1.01 MB PNG
>>
>>102350381
my lora's for SDXL work fine in custom tensorrt models, it throws up an error in the console but the lora is applied correctly, 2 lora's, it shits the bed and produces black images.
iirc support was promised and there's a note on github about how flux should be bf16 but i didnt see the option to convert flux models.
I'm going to have another look.
>>
>ded general
>>
>>102350501
AI winter.
>>
>>102350501
at this point we're waiting for a great flux finetune, but it's not gonna happen so it's ova
>>
>>102350381
From an hour or so of reading it seems that it's in BFL hands if they want to make a BF16 version that tensorrt can convert as you need to know the unet architecture structure they used so you can do the conversion.
But really i know fuck all about such things and am going on what searches and gpt's explained.
Maybe that's not the problem, who knows, but the job isn't for me that's clear enough.
>>
File: ComfyUI_33757_.png (1.47 MB, 768x1280)
1.47 MB
1.47 MB PNG
>>
>2 days old thread
What the fuck bros
>>
>>102351090
nothing's happening dude, we're waiting for a flux finetune or for BFL video model, I swear to god if they manage to have the quality of MiniMax while working on a 24gb card (Q8_0 would suffice) then we'll be so fucking back
https://blackforestlabs.ai/up-next/
https://www.youtube.com/watch?v=JQbDyiYgNYw&
>>
>M2 Ultra Studio
>192GB unified cpu/gpu memory
>45% of the raw performance of the rtx4080
>ruhs on Metal which is substantially less efficient than cuda on top of the 0.45x perf
Ahh.. its always something. 64gb jewvidia when
>>
>>102351121
>64gb jewvidia when
never lol, Nvdia makes so much money out of their 10000 thousand dollar entreprise 48gb cards
>>
File: ComfyUI_33754_.png (2.1 MB, 1280x1024)
2.1 MB
2.1 MB PNG
>>102351090
It's over.
>>
>>102351090
I had my fun with Flux but I can't see myself generating only Miku for the rest of my life because it's the only relevant female anime character it knows, we could've so much fun by using funny prompts with celebrities but they decided to remove them out of fear so here we are
>>
>>102351090
Flux shills have left the thread
>>
>>102351168
everyone has left the thread, where's the Pixart/Hunyuan fans?
>>
>>102351191
>Pixart/Hunyuan shills
Also left the thread ages ago
>>
Flux is nice, can't wait for it to improve. Sdxl/pony is nice.. but it is incapable of a specific 3d realistic yet not realistic art style I can get out of a specific sd1.5 model
>>
>>102351202
So basically this place is filled with 0 real fans and only shills?
>>
>>102351210
Pretty much, these threads used to fill up within hours, now it's 2 days old
>>
>>102351223
I think school has something to do with that, it's september
https://www.youtube.com/watch?v=W5po7tFWf58
>>
>>102351090
The captcha is annoying
>>
>>102351090
this thread is focused on technical shit and news, and for the moment nothing is happening, simple as that, once there will be some good new shit (a new model, a new method that will improve stuff...) it will flurish again
>>
Adam mini wheeeen
>>
>>102351118
There is no evidence their model will be open, and even if it is, it would likely be prohibitively expensive / difficult to fine-tune and it would take forever to generate outputs.

Cogvideox 5b has been around for a while and there is no community around it despite being a 5b model. And even on a 3090, it takes a long while to generate outputs.
>>
>>102351395
>Cogvideox 5b has been around for a while and there is no community around it despite being a 5b model.
it's more because this model is a 2fps ass and that better model (like the BFL one) will come out soon, it's a repeat to Cascade -> SD3, everyone ignored Cascade because they thought SD3 would destroy that mf

>There is no evidence their model will be open
true, but I've seen some interviews and they were really fan of opensource, my bet is that they'll keep their best model as an API and will release something a little inferior, like we got with Flux pro vs dev
>>
>flux, prompting for:
>facing the viewer
>but her hands are tied up behind her back
I've tried many variations of this (tied, bondage, hidden behind, obscured, not visible, out of sight, et c) and the hands are either visible from the front - or the tied hands are shown
lmao
>>
>>102351513
have you tried CFGmaxxing? helps with prompt adherance
>>
does BREAK work in comfyui? having trouble finding a clear answer.
>>
>>102351653
isn't BREAK a cope thing that was used because clip could only handle 77 tokens? now we have 512 with t5 we definitely don't need that anymore right?
>>
Why the fuck dont you merge with /sdg/ again? Both threads are graveyards and with only one thread it would be slightly less dead
>>
>>102351133
Until the power and versitility of current GPU's is integrated into CPU's there will always be a demand for what we know of today as the GPU, as this process of integration is barely coming to market now, 1st gen AI&cpu computers they are going to have to keep adding VRAM to each new generation of cards.
64GB is a possibility for anons and localgenners but i doubt it'll be in 2025.
So yeah, you're right for now, but not forever.
>>
>>102351731
I'll check with the council...
they said "nah"
not like anyone's stopping you from making a new general, but not like it's going to be successful
>>
>>102351673
No it separates prompts. So if you wanted like a pirate and a cat but you had a lot of tokens each to avoid blead you you do the pirate and all the tokens to describe that and the BREAK and then do the cat and all the tokens to describe that and it would be better at genning them both. Its hard to describe non-visually
>>
File: SDXL20242.jpg (209 KB, 1256x1256)
209 KB
209 KB JPG
>/sdg/ and /ldg/ are both totally desolate as soon as school starts up. Its a weird feeling
>>
File: 1708502644526309.png (1007 KB, 1024x1024)
1007 KB
1007 KB PNG
>>
File: 1726122810620321.webm (1.01 MB, 1280x720)
1.01 MB
1.01 MB WEBM
>>102351090
Flux got too much blueballed by not being able to render celebrities, the chinks know where the fun is, piercel is MiniMax
>>
File: 1713910700774293.png (1.09 MB, 1024x1024)
1.09 MB
1.09 MB PNG
>>
>>102348632
Have you tried using an LLM or are you a retard?
>>
>>102351834
I'm a retard
>>
>>102351513
one problem is that it's a negative prompt (ie do not show hands, it shows hands)
>>
>>102351529
thanks. I didn't try that, it's actually not the base flux, but I changed model version and some other stuff, and that fixed it.
I had a testing setup active that made it not adhere to prompts as much.
>>
File: 1705004763126873.png (1.18 MB, 1024x1024)
1.18 MB
1.18 MB PNG
>>
Some new bread, delivered right to your doorstep...
>>102351868
>>102351868
>>102351868
>>
>>102351848
Don't use negative prompts in Flux
>>
>>102336560
There was something called ComfyUI-ELLA that attempted to use both clip and some LLM model (I don't remember if it was SD1.5 or SDXL, I think it was SD1.5). It provided better results for some prompts, but wasn't too amazing.
It could also use only the LLM, but then any finetunes that also trained the CLIP wouldn't work properly, because the LLM wouldn't recognize the finetune-specific concepts like anime characters and stuff.
>>
File: file.png (2 KB, 146x48)
2 KB
2 KB PNG
>>102351342
NEVER HAHAHAHAHAA
>>
>>102351848
I know, hence the variations on "hidden", "obscured", et c
feel free to let me know how you would formulate it
>>
>>102337390
Is it really necessary for the prompt to be this long? What if the description of what I want is short, and just requires some positioning and pose understanding that CLIP can't manage?
>>
>>102351898
i would look for a term that describes an idea that is equivalent to the negative prompt, if possible. like "bound to a post/stake behind, facing camera"
>>
>>102352122
yeah, me too.
>>
>>102346890
ya
>>
>>102351834
using an LLM to prompt is literal Indian retard tier shit
>>
>>102351736
Mac is the option just at a high cost and slow speed for that cost, not worth it



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.