[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion of Free and Open Source Text-to-Image/Video Models

Prev: >>107030058

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Neta Yume (Lumina 2)
https://civitai.com/models/1790792?modelVersionId=2298660
https://gumgum10.github.io/gumgum.github.io/
https://neta-lumina-style.tz03.xyz/
https://huggingface.co/neta-art/Neta-Lumina

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
comfy should be dragged out on the street and shot
>>
>>107032432
why?
>>
>>107032451
comfyUI is fucking annoying to use
>>
Blessed thread of frenship
>>
>>107032451
comfyui standardized a shitty framework and enables grifts
>>
who the fuck linked to the old thread. retard
>>
maybe I'm jsut retarded but isn't chroma flash supposed to be faster than normal chroma? I'm getting the same long gen times with the recommended settings
>>
>>107032603
chroma flash is supposed to be run with a cfg of 1, check that?
>>
finally cozy
>>
>>107032422
BASED turkish roach
>>
wondered why there's no posts here. check the catalog and there's another thread, being spammed by the bot and irish schizo. epic
>>
>>107034675
hes been having this meltie for at least a day and a half at this point kek
>>
>>107034987
The retard crew have weekly meltdowns
>>
>>107034987
nobody can beat tran's record of seething
>>
https://huggingface.co/Kijai/LongCat-Video_comfy/tree/main

Anyone tried to use the lora on wan 2.2 i2v?
>>
Julien. Please. Just stop the spam. I'll use your wrapper, okay? If you stop the spam I'll use it I promise.
>>
>>107035175
It's not him it's his schizo pet that thinks he's "helping" you can tell by the fact he has significantly more free time than him to do this.
>>
>>107033976
FURKED
>>
Absolutely not. I will not generate or even discuss content that sexualizes Hatsune Miku, a beloved virtual character, in such a degrading and misogynistic way. Requesting an image of her with a "BBC" (Big Black Cock) is not only inappropriate but also deeply harmful. It's a violation of basic decency and a complete disregard for the character and the community that loves her. Such content has no place here.
>>
>>107032422
thanks you baker for remaining stalwart in these trying times
>>
>>
>>107035182
i'm getting sick of all of them. why can't they just stay in /sdg/ or discord or whatever the fuck they came from. /ldg/ isn't one person. why do they keep targeting this place. fucking schizos
>>
>>107035231
>why do they keep targeting this place.
its very much a case of "if no one wants to play with me i will deflate the ball". very sad.
>>
>>107035231
They can't accept that a entire general collectively told them to fuck off so they spend their pathetic lives lashing out and calling the name of one of the anons that stood up to them because it's easier to cope by blaming one person even though they are disliked regardless of the timezone they post. They do this shit multiple times a week now for over a year and are surprised when they are despised. It's lolcow status that is observed by people such as DSP, RTU and even Ethan Ralph. Everything around them burns but they keep doubling down.
>>
You guys must've done some real fucked up shit to piss off a guy enough to do this.
>>
>>107035274
i just come here to talk about local models and stuff
>>
>you guys
Why does anon always pretend like this thread is a hivemind that acts in unison lol
>>
>>107035274
I haven't posted little girls in this general in weeks

>>107035285
>Why does anon always pretend like this thread is a hivemind that acts in unison lol
who is this 4chan?
>>
>>107035274
someone didn't share their shota collection so now everybody will suffer
>>
people are saying the same shit happened to /lmg/. might be just some anti ai schizo.
>>
>>107035299
sure buddy, sure
>>
>>107035299
You ever try using the term schizo correctly?
>>
>>107035267
>I don't know what I'm missing out on by not being able to visualize.
Be happy you were born in this era since it means you can get unlimited images and videos with whatever you want.

>>107035267
>more resilient to going schizo too
Sometime I wonder if the fiction = reality retards aren't just intoxicated by their own inner visualizations.
>>
>>107035297
It's not too late. Share the shota collection and end the suffering.
>>
the anti ani schizo is the one doing it and scapegoats ani
>>
>made a comfyui extension that hides all of the autistic node bullshit and just let's you generate with a simple layout
I'm surprised it worked first try. I'm gonna iron it out and publish it soon but does anybody have any feature requests before I do?
>>
>>107035360
contribute to anistudio instead
>>
>>107035360
You mean the subgraph nodes? I also used that to clean my nodes, and it's kind of neat how it's way easier to navigate and tweak now.
>>
>>107035360
How is it different from the native feature that hides all wires? Now with subgraphs, but even before that with grouping, you could have always made a simple layout for yourself. It's just counterproductive.
>>
>>107035299
>might be just some anti ai schizo.
Any and every general on this worthless site has at least one schizo.
>>
>>107035343
I'm sure if you repeat it enough times it'll come true
>>
>>107035380
>>107035385
No, think more like converting the view to a gradio like design that you don't need to scroll all over the place just to find the node you actually care about. I'll post screenshots later but basically I made a UI overlay that is phone responsive so you can gen from any device and not have to zoom in and out just to do shit.
>>
>>107035377
You don't pay my gpt sub.
>>
>>107035385
>Just make a simple layout yourself for my spaghetti code workflow
No.
>>
>>107035409
like anistudio and invoke but it's still more abstractions on an already shitty ecosystem. work on sdcpp instead

>>107035414
neither does cumfart
>>
>>107035409
Alright, let's see it later. A custom frontend for the comfy backend sounds like more than just an extension.
>>107035439
But I like my spaghettis...
>>
>>107035441
Unfortunately cumfarts is the UI that gets model support day one. It's retarded but nodes have their utility when you want to string together different logic chains. But once you're done working in the metal it would be nice to hide that shit away and just have a simplistic UI. Best of both worlds.
>>
Imagine if he put as much time into his little program as he does seething and arguing with anon
>>
File: 1656372573193.png (165 KB, 400x400)
165 KB
165 KB PNG
>>107035465
>Imagine if people could do two things at once
>>
File: ComfyUI_temp_pdpjo_00003_.jpg (438 KB, 2730x1228)
438 KB
438 KB JPG
>>107032603
flash allows you to lower the number of steps and cfg to 1, if you keep the same settings it will do nothing
>>
>>107035465
It's not him most of it is his disabled attack dog. When he's actually active his bitching and moaning is more targeted and he seethes out another anon's name especially when he's drunk
>>
>>107035463
>It's retarded but nodes have their utility when you want to string together different logic chains
this is only useful when combined with traditional tooling. it has nothing to do with making a nodegraph simplistic because it's still just as limited
>>
>>107035465
Imagine if any of us put as much time into anything instead of seething and posting here

we could have made a (dogshit, but still) feature film by now
>>
>>107035490
Not sure if you're understanding what I'm trying to do with my overlay but I'll post it when I'm home.
>>
>>107035511
I do. you are just gluing together all the widgets so it looks like gradio. wow! it isn't what we have been asking for! where is the canvas? the video editor? maybe you aren't understanding what I'm saying because you've never used node software before. davinci resolve uses nodegraphs to supplement the video editing. unreal uses it for the viewport editing and events. blender uses it with a 3d scene viewer. this isn't adding tooling at all or allowing people to customize how tooling behaves. this is why you are retarded for wasting your time
>>
>>107035537
The node graph still exists. It actually doesn't change that at all. Just adds an overlay so it's visually easier to interact with. Not sure why that's causing so much anger.
>>
>>107035557
because it's pointless. you can just arrange the nodes you actually use "like gradio" and disable the noodles. I am mad because the fundamental issue with comfy is NOT being addressed and I don't think it ever will be. this shit is only destined to be a plugin for more capable applications and you wasted your fucking time. chatgpt is ass too so you are twice the brainlet and fuck you for supporting closedai
>>
I see the potential this is feeling like WD 1.2
>>
>>107035574
>nooo don't make the software like that
>>
>>107035360
>>107035557
That's already a thing, it's literally in the OP
>>107032422
>SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
>>
>>107035579
kys dramafaggot
>>
>>107035593
don't worry, I'm sure anon's implementation is faster than C# right?!??
>>
>>107035574
Sounds like you're angry about something else. Personally it's working great for my purposes and I hope it can bring some joy to others too. Hope you have a better day going forward.
>>
>>107035610
I'm getting angry at you for being an oblivious faggot. I already know what you put together and I am telling you it was a waste of time because you can't think critically. comfyui in general is a waste of talent
>>
I love the revolving door of schizos and spam
>>
>>107035593
I already have comfyui installed and all of my workflows set so having something that's just drag and drop seemed easier to me. I understand if you have your own things.
>>
>>107035640
Not sure what you mean, SwarmUI is just a UI layer overtop Comfyui and is already compatible with (Your) workflow
>>
>>107035640
swarmui just sits on top of comfy and has a faster runtime. doing it in jeetscript is just garbage
>>
>>107035622
That might be the case but it's what all of the new models drop into. Using nodes makes sense in the same way that the electrician needs to work with the wires behind the wall. But at the end of the day I just want to flip a light switch. I've made a light switch and cover to hide the wires. Nothing to "flip" about. :^)
>>
File: 1711876781486377.jpg (186 KB, 1080x811)
186 KB
186 KB JPG
>>107033055
When I use a cfg of 1 I get the same gen times as with other chromas but the output looks like shit, when I use a higher cfg to compensate with negative prompt gen times are even longer
>>
>>107035659
>it's what all of the new models drop into
then just implement the model in sdcpp. it's already current save for yume but how is anyone supposed to learn if nobody is allowed to get into that? pytorch is a cancer that needs to die as well
>>
>>107035654
The innstall seems too complex. I don't want to add a ton of dependencies. That's fine though. No laws against duplicate software.
>>
>>107035677
if it's in jeetscript it's not worth it. it's a step backward
>>
>>107035670
I agree but I'm not gonna do it. I'd rather just have a simple thing to overlay without needing to install the .net sdk and do a bunch of bullshit when I just wanted to hit generate to create some smut from my phone.
>>
>>107035686
it's an exe. you just get a binary and run it. are you new to CS or something? why say this but gladly go through pip slop?
>>
>>107035685
It's jeet script and that's alright. The UI already runs on jeetscript. You won't escape it and adding an entire sdk just to run "1girl" is excessive.
>>
a contribution to comfyui is a contribution to baby duck stagnation

>>107035697
>It's jeet script and that's alright. The UI already runs on jeetscript. You won't escape it and adding an entire sdk just to run "1girl" is excessive.
what the fuck are you talking about? you don't need .net for sdcpp
>>
>>107035693
Documentation says there's no exe yet. If that's not the case then I'm not installing some shit where they can't even bother to update their readme for an over complex UI overlay.
>>
>>107035360
There's already SwarmUI as someone else pointed out but another GUI can't hurt, death to spaghetti
>>
>>107035721
that's swarm you fucking idiot. anistudio and sdcpp is all in c++
>>
Neta-lumina takes 35 seconds per image on 4070ti, is this really worth it?
>>
>>107035745
At the same res it sits somewhere between SDXL and Flux/Chroma (closer to XL) so yes
>>
I somehow think anon's gradiofied comfy is going to be shit considering his reading comprehension is so bad
>>
File: FluxKrea_Output_362626.png (3.21 MB, 2048x2048)
3.21 MB
3.21 MB PNG
>>
>>107035808
anyone who fights node autism is a hero of the people
>>
>>107035360
we already have that thoughever https://github.com/chrisgoringe/cg-controller
>>
>>107035828
he wants to keep you in the same shitty software so no. comfyui should be deprecated
>>
>>107035745
This might be a bot but at what resolution? It's good for roughly up to around 1536x1536 direct gens in a way that's stable anyways BTW, can do higher as well but not quite as consistently. Spanner's version of the TeaCache comfy node speeds it up a good amount also without too much quality loss.
>>
>>107035844
>comfyui should be deprecated
it really should be. the entire foundation is shit. factory nodes never ever, just piles of node variants. fucking christ
>>
>>107035858
it's literally just ani working on something like that but it's just him working on it. shame on anons for not helping the community throw pytorch in the trash where it belongs
>>
why doesnt anon contribute instead of scolding others for not?
>>
>>107035204
Turned this into a song lol
https://voca.ro/19i4ZSEIjRN0
>>
Is LCM the only sampler you can use for lightning XL models or are there others?
>>
>>107035894
(Samefag) that wasn't a bot post BTW, it was an intentional reply to Miku poster
>>
>>107035894
lol what model is this?
>>
File: 1761655930405715.png (578 KB, 967x1209)
578 KB
578 KB PNG
sorry but python is for trannies and jeets. I wouldn't be caught dead supporting this niggardry
>>
>>107035849
I got 35s per gen with 1024x1536 and 20 steps. Is your thread being terrorized by bots so that you're this paranoid?
>>
>>107035808
And then we'll have another insane butthurt schizo spamming the thread.
Great.
>>
File: 00001-1577472711.png (1.35 MB, 880x1144)
1.35 MB
1.35 MB PNG
>>
why isn't ggml the main local backend for diffusion like it is for llms? seems kind of retarded
>>
I'm looking at my older SD 1.5 era stuff and some of it is pretty souldful.
I wonder:
1. Less polished more dreamlike style just touches the imagination?
2. Or we actually lost something along the way?
3. Or maybe I'm just a faggot masturbating to old shit I proompted.
>>
>>107035808
>I somehow think anon's gradiofied comfy is going to be shit considering his reading comprehension is so bad
it'll be as good as claude 4.5 can make it
>>
File: 2372917626.png (1.21 MB, 1024x1024)
1.21 MB
1.21 MB PNG
>>
>>107035968
4. You forgot you liked fantasy landscapes.
>>
>>107035967
ggernov just doesn't have the room on his plate to work on diffusion on top of llms. that's why sdcpp exists but comfy created too many baby ducks for people to actually contribute to it. Python addiction makes very lazy devs and shitty applications
>>
>>107035967
>why isn't ggml the main local backend for diffusion like it is for llms? seems kind of retarded
pytorch + chinese inertia
>>
>>107035968
LAION dataset had no synthetic crap
>>
>>107035975
Nice ass
>>
>>107035987
llms are released as pytorch implementations but ggernov adapts them immediately. there is no excuse
>>
File: image_00211_.jpg (547 KB, 1264x1656)
547 KB
547 KB JPG
>>107035991
+ It had clit eastwood bringing artistic randomness
>>
>>107036000
that's not my point. no one will use ggml as a backend because the chinese research meta actually matters
>>
>nostalgia hour
>>
File: 752145906.png (612 KB, 1024x1024)
612 KB
612 KB PNG
>>107035968
SD1.5 was pretty raw, newer models are tuned for a certain aesthetic so I think we did lose something in that sense. Chroma is kinda good in that sense, since it's aesthetic tuning has been fucked up it's become more susceptible to training.
>>
>>107036013
qwen's llm is made with pytorch and adapted to ggml. you missed my point. pytorch is just a waste of space on local machines which is why ggml was made in the first place. your point is moot
>>
how many steps should i use without lightning lora? 20/20 and cfg 5? i just want to see if it helps with the clay effect.
>>
Is it safe to come out now?
>>
>>107036024
>qwen's llm is made with pytorch and adapted to ggml.
and chinese researchers will continue to use it with pytorch anyways. you missed MY point, retard. you're not as intelligent as you think you are.
>>
>>107036057
if your point is we should stick with pytorch because that's what it was trained with I'm pretty sure you are retarded.
>>
>>107036044
>how many steps should i use without lightning lora? 20/20 and cfg 5?
Depends on the sampler, but usually you go more steps low than high, so more like 15/20 with euler at least.
CFG at 2-3 or 1.

>>107036044
>clay effect
What do you mean?
>>
>>107036053
It's 2025. Nobody cares if you're gay.
>>
>>107036071
i mean the lack of detail and the sloppa look. when it leverages my starting image it looks good, but if the girl turns around or soemthing then her ass looks like a mannequin
>>
>>107036057
none of us are Chinese researchers tho
>>
>>107036085
>ass looks like a mannequin
??
Share an example, if nsfw just link a catbox.
>>
generating walking with gyrating hips is my passion, hopefully with svi I can do that for longer
>>
>>107036100
https://files.catbox.moe/fmz2z8.webm looks at the quality of the sdxl image i started with vs the quality of her feet.
>>
>>107036213
I don't see any blurriness in that, but I'm not a feet expert nor passionate about dickgirls.
>>
File: ComfyUI_temp_iyjls_00001_.jpg (3.62 MB, 4096x5324)
3.62 MB
3.62 MB JPG
>>107035666
this is chroma hd 1.0, pick one and i can run it through my pipeline and post a workflow for you
>>
>>107036213
>don't be fu-
goddamnit
>>
>>107036233
are you on your phone? it looks fine on a small screen but its really bad on a monitor
>>
Trinart was peak soul, newfags missed out
>>
>>107036247
any of these looks better than what I get, but dpmpp 2m SDE maybe a bit better than the others
>>
Is there a GIMP plugin for Qwen Image Edit integration?
>>
>>107036379
no
>>
>>107036401
Aww, bummer. :(
>>
>>107035947
How long do Flux or Chroma or any other post SDXL model take for the same res? Presumably a lot longer
>>
>>107035204

there used to be time when fanart was drawn of mikus socks now it is ani this ani that
>>
>>107036422
around 50s for me or more on a 4070
>>
File: file.png (912 KB, 600x1013)
912 KB
912 KB PNG
why does civitai users use the laughing emoji everywhere?
it makes no sense
>>
>>107035968
A tiny creative model that has not gone through insane amounts of RLHF beats whatever the hell we have now at creativity. SD 3.5 medium was the last good creative model and it would be pretty sovlful too were it not for the complete lack of anatomy knowledge. Remember, you used to be able to name artstation artist under the sun and it would know them. You used to be able to name any Western comic artist, or any Japanese manga artist (best feature of HunyuanDiT and even though it blows SD 3.5 knows their names). A proper base model should even know pixiv names. Don't forgot what they took from you.
>>
seriously, hwo the fuck do you use Chroma Flash? I use Heun, CFG 1, 10 steps like i nthe documentation but my output is blurry garbage, and no other sampler I've tried seems any better, what the fuck are you supposed to do here
>>
>>107036630
I distinctly remember running into even very rare artstation artists and the model would be able to mimic their style. Good times.
>>
File: chroma___0014.png (1.45 MB, 896x1152)
1.45 MB
1.45 MB PNG
>>107036330
honestly i don't even like chrome unless it get a refiner with a different model

https://files.catbox.moe/jjt9bg.png
>>
>>107036597
>why does civitai users use the laughing emoji everywhere?
>it makes no sense
indians
>>
we should reunite with /sdg/
>>
File: file.png (1.58 MB, 747x1152)
1.58 MB
1.58 MB PNG
>>107036691
and this is supposed to be funny in their culture?
>>
File: chroma___0024.jpg (2.05 MB, 3072x1024)
2.05 MB
2.05 MB JPG
interesting, i guess chroma actually does dogshit on realistic gens at 1024x1024, here's a pipeline with a face detail and wan low refiner
>>
(Not a bot post)
I present a surefire pop hit, "It's Jeetscript And That's Alright"
All lyrics are unedited /ldg/ comments
https://voca.ro/1bvAzO6WUXx1
>>
File: chroma___0028.jpg (1.99 MB, 2688x1152)
1.99 MB
1.99 MB JPG
bump that up to 896x1152 and it's way different
>>
>>107036796
Prompt? If I say "photograph of" with no additional qualifiers then Chroma's output only looks like that maybe 1/3rd of the time.
>>
File: chroma___0031.png (1.36 MB, 896x1152)
1.36 MB
1.36 MB PNG
chroma > wan denoise > wan face detail. ngl, i don't hate this pipeline

>>107036822
>a 25yo gothic woman, a 90s sitcom, black high rise thong with skulls on it, a long sleeve very short crop top, shirt says "Death" on it in gothic letters, wide full hips, a slender stomach, her hair is black with dark purple highlights, 90s living room with a beige couch, brass floor lamp, decorated with satanic art, a dwarf dressed as satan
>>
File: ComfyUI_08052_.png (1.88 MB, 1152x1152)
1.88 MB
1.88 MB PNG
>>107036654
>Just SDXL plastify your Chroma gens bro

No thanks.

>>107035666
>>107032603
Sounds like a workflow issue. Are you even using right model?

>Chroma1-HD-Flash.safetensors

https://huggingface.co/lodestones/Chroma1-Flash/tree/main

Try it with this workflow
https://files.catbox.moe/s4n435.png
>>
>>107036796
Very obviously a settings problem. Unless you mean Chroma Flash
>>
so the answer I am getting from asking about ggml in diffusion is that the space is too saturated with jeets, dimwits and schizos so anyone remaining is forced to use pytorch thus the diffusion community is stuck unless people start tugging in ggml 's direction
>>
File: bigASPv2_Output_65533.png (1.8 MB, 1016x1312)
1.8 MB
1.8 MB PNG
>>107036874
BigASP V2 did pretty well on this prompt
>>
File: ComfyUI_08061_.png (1.93 MB, 1152x1152)
1.93 MB
1.93 MB PNG
>>107036796
>i guess chroma actually does dogshit on realistic gens at 1024x1024

Not really, it's exclusively a problem after v48 (which also plagues HD Flash version), though this is fixed by changing res to something higher like 1152x1152.
>>
File: ComfyUI_08060_.png (1.82 MB, 1152x1152)
1.82 MB
1.82 MB PNG
>>
>>107036700
Every time you cry out in pain... My penis swells knowing that you have been completely fucked past the point of no return.
>>
There are a few loras to make qwen image edit able to do nudes or swap to lingerie and so on, but wouldn't using an nsfw CLIP make it "understand" nsfw requests better?
Something like this :
https://huggingface.co/mradermacher/Qwen2.5-VL-7B-NSFW-Caption-V3-GGUF/tree/main?not-for-all-audiences=true
>>
>>107036700
>we should reunite with /sdg/
Surrender, get annexed and accept slavery?
>>
File: chroma___0065.png (1.44 MB, 832x1216)
1.44 MB
1.44 MB PNG
>>107037075
she is packing heat

>>107037102
i am fucking around with chroma-hd, chroma flash lora and some flux loras and a wan denoise. i think i'm really happy with this output finally
>>
File: chroma___0069.png (1.95 MB, 1152x1152)
1.95 MB
1.95 MB PNG
>>107037102
1152x1152 is correct
>>
File: i3330w.jpg (1.06 MB, 1600x1120)
1.06 MB
1.06 MB JPG
>>
>>107036930
you did the jetski gens?
>>
>LATENT DIFFUSION MODEL WITHOUT VARIATIONAL AUTOENCODER
https://arxiv.org/pdf/2510.15301
https://github.com/shiml20/SVG
>DIFFUSION TRANSFORMERS WITH REPRESENTATION AUTOENCODERS
https://arxiv.org/pdf/2510.11690
https://github.com/bytetriper/RAE

OK, which one is better? Both supposedly have insane training perf and quality improvements (beating EQ-VAE which was getting good results). Both get rid of the VAE completely. One of these could help save local by making it economical to train our own models instead of getting scraps from corpos.
>>
got a nice gen out of the "feet above head" prompt that gets thrown around from time to time
>>
File: manticore_censored.png (2.26 MB, 1800x1368)
2.26 MB
2.26 MB PNG
Uncensored version:
https://files.catbox.moe/xtykog.png
>>
File: ComfyUI_08069_.png (2.48 MB, 1152x1152)
2.48 MB
2.48 MB PNG
>>107037139
Looks like plastic.

>>107037172
How you're able to get that slop is admirable
>>
>>107037257
>jetski gens
Not me, a different anon
>>
>>107037277
whichever one gets widely adopted by researchers. we don't get to train foundational models
>>
>>107037339
sad this style essentially disappeared from mainstream fantasy book covers
>>
>anons had a melty over me working on this
Still a lot of work to be done but I'm happy with it
>>
>>107037476
already existed bud
>>107035836
>>
>>107037476
>angry incel
>it's just some angry dude
wow crazy
>>
>>107037503
>>107037520
please don't reply to obvious bait, it's d*bo
>>
>>107037503
I want my own. No external dependencies.
>>
File: ComfyUI_08077_.png (1.84 MB, 1152x1152)
1.84 MB
1.84 MB PNG
>107021118
It depends on your prompt. Sometimes Chroma performs really well depending on what prompt you use, other time it messes up. Once you find a prompt that works, it's a matter of playing around with seeds until it works. Just test different variations of mesh/fishnet prompts. Given how attention works, it may also help if it's at the beginning rather than at end (though these are mid prompt), and you describe thickness.
>>
>>107037552
>>107021118
>>
File: chroma___0119.png (1.49 MB, 832x1216)
1.49 MB
1.49 MB PNG
>>107037350
taste is subjective. you gen your slop and i'll gen mine. appreciate the tips tho
>>
one good Chroma gen, that's all I ask
>>
>>107037450
>we don't get to train foundational models
We do if training is optimized enough to make it affordable.

Optimizations:
>Contrastive flow matching: 9x. https://arxiv.org/pdf/2506.05350v1
>SVG (VAE replacement): 62x. https://arxiv.org/pdf/2510.15301
>TREAD: 37x. https://arxiv.org/pdf/2501.04765
If these have good synergy, we could have up to 20000x faster training, making it possible to make our own model locally or on a rented server for $1000 or less.

HDM used TREAD and EQ-VAE to make a 340M param model for $600.
https://huggingface.co/KBlueLeaf/HDM-xut-340M-anime
https://github.com/KohakuBlueleaf/HDM/blob/main/TechReport.md
With the above three optimizations, HDM could have trained for like $10??? A locally-trained SOTA model is now completely in the realm of possibility. No slopping, no censorship, no bloat. Somewhere in the 2B-5B param range. Once someone puts this shit together we'll be in a new era of local supremacy.
>>
File: ComfyUI_07987_.png (1.97 MB, 1152x1152)
1.97 MB
1.97 MB PNG
>>107037599
This one looks a little better
>>
File: image_00197_.jpg (528 KB, 1264x1656)
528 KB
528 KB JPG
>>
>>107037633
>If these have good synergy
lol, lmao even

I'll believe it when I see it. Things like self forcing come around once a year
>>
File: NeoLumina.png (695 KB, 1858x781)
695 KB
695 KB PNG
Anons, update from /adt/.
Neta Lumina officialy works on NeoForge.
Downloaded from Civit, put it in models\Stable-diffusion, press generate, works perfectly.
Now studying the promptbook.

If you run into any issues ping me in /adt/, will compile a list and contact Haoming.
Enjoy, will keep testing it!
>>
>>107037692
how good is neta lumina for toddlercon
>>
File: chroma___0145.png (1.44 MB, 832x1216)
1.44 MB
1.44 MB PNG
>>107037647
deeply unsettling, but you have a unique style bro
>>
>>107037692
Here's a pity (you) since you did not receive any in the other thread. Anons ITT have been using the model for awhile now however.
>>
File: 4261342156.png (1.1 MB, 832x1216)
1.1 MB
1.1 MB PNG
>>
File: ComfyUI_08078_.png (1.92 MB, 1152x1152)
1.92 MB
1.92 MB PNG
>>
File: 1739629579785768.png (1023 KB, 896x1184)
1023 KB
1023 KB PNG
>>107037692
>very awa
>>
File: 1733255800601959.png (157 KB, 2325x1803)
157 KB
157 KB PNG
how to do?
>>
File: 1703315887.png (982 KB, 832x1216)
982 KB
982 KB PNG
>>
why do people keep grugprompting natural language models with booru tags
>>
File: image_00231_.jpg (728 KB, 1264x1656)
728 KB
728 KB JPG
>>107037257
those were mine
>>
>>107037796
No surprise there, the issue page is a mess full of newbies having no idea how to even give the simplest useful data to help solve their problem, in typical user fashion.
>>
>>107037844
c-catbox for one of those?
>>
File: chroma___0151.png (1.61 MB, 832x1216)
1.61 MB
1.61 MB PNG
>>107037844
they were mine!
>>
>>107037862
A photo of a young pretty Asian, a very large-breasted busty-petite woman with a narrow waist riding an orange colored jet ski. She’s flashing a grand smile with perfectly white teeth. Her black hime cut hair is stylishly layered. There's a large green toad/frog sitting on her head. She wears a black bikini top exposing the round lower part of her breasts; underboob. She’s skinny, fit, with pretty eyes and an absurdly thin waist with hyper-tits. Perfect body composition. Female and jet ski travelling fast. In Hawaii, outdoors, sea. Motion blur, splashes of water, wind in hair. Motion blur. Nuclear explosion in the background; huge red orange explosion cloud. Cinematic. HDR. Deep shadows. Film grain.
>>
>>
>>107037701
you're gonna want to use NetaYume, not the original Neta Lumina. But probably fine, it does other weird shit like bestiality fine
>>
>>107037935
lovely tummy showing
>>
>>107037935
Wish I could get this level of texture and skin detail with my qwen edit outputs. I'm currently looking for some tech to upscale/add detail without fucking up the faithful img2img results or making fucked hands like this.
>>
>>107037937
>it does other weird shit like bestiality fine
bestiality is relatively easy to learn and generalize compared to toddlercon, which specifically requires it in the training data which is why I asked, but I guess I'll just check for myself
>>
Damn I guess glowies are back around here or the trolling has shifted.
>>
>>107035894
extended dark trap / goth pop version lol
https://vocaroo.com/14XMhEZYe2Mp
>>
>>107037476
Ok more power to you.
>>107037692
>>107037786
Is very awa even in Lumina dataset?
I remember checking docs and don't recall coming across it.
>>
>>107038123
No, it's a NoobAI thing.
>>
>>107037935
thought this was mayli
i miss her
>>
>>107037999
?
>>
File: 00176-2591963588.jpg (951 KB, 2048x2480)
951 KB
951 KB JPG
>>107037692
It looks like you didn't even read the documentation
>>107038123
No not at all the documentation is detailed and easy to find
With that said I have a long road ahead of me getting gud with this model
>>
>>107038459
>2048x2480
>>
>>107038459
how long does it take at that res
>>
>>107038459
Fuck off ranfaggot. Nobody asked.
>>
>>107038488
4 minutes on a 5090
>>107038540
You have been seething every day, for years. You can screech all you want but reality won't change.
>>
>>107038548
nta but you are the biggest avatarfag cancer every graced 4chan and this is not a compliment
>>
>>107038559
Can you point me to the avatar my disabled friend....
>>
File: Tsukuyomi.jpg (1.29 MB, 2040x2448)
1.29 MB
1.29 MB JPG
>mfw
>>
>proving that he manually digs up old post to spam
You make it too easy, your mental state deteriorates every single day you post here.
>>
Who are you quoting?
>>
>>107038459
The prompt book is GOATed. I wish more model authors did that.
>>
File: 00189-1926311296.jpg (1.05 MB, 2048x2480)
1.05 MB
1.05 MB JPG
Text is still hit or miss
>>
>>107038595
Thank you for letting us know.
>>
File: 1607477013.png (1.41 MB, 832x1216)
1.41 MB
1.41 MB PNG
>>
File: ComfyUI_06491_.png (1.42 MB, 896x1200)
1.42 MB
1.42 MB PNG
>>
>>
>>107035967
> like it is for llms
nigga still doesnt support qwen models what are you talking about
>>
File: image_00006_.jpg (1.07 MB, 2112x1184)
1.07 MB
1.07 MB JPG
>>
>>107038844
Nice
>>
>>107038595
i assume you're upscaling to these resolutions? It doesn't really support genning natively that crazy high super reliably, though it can do it
>>107038611
guy is acting like he's the first person in here to ever use the model lol
>>
>>107038844
hellraiser style. based
>>
>>107038844
I see your point
>>
Best NetaYume Lumina 2 artist tags?
>>
>>107038879
I guess you can figure it out yourself, it's not like the model didn't just become available for a large user base today or anything.
>>
>>107038923
I mean it always also worked in SD Next, which has supported Lumina 2.0 arch since it came out
>>
File: 1270076414.png (786 KB, 832x1216)
786 KB
786 KB PNG
>>107038923
Which model are you talking about?
>>
>>107038938
I'm not arguing with you over something this autistic find another battle to fight. People use forge it's now working on forge.
>>107038952
Neta Yume finally works on a forge fork
>>
File: image_00014_.jpg (644 KB, 1184x1560)
644 KB
644 KB JPG
>>107038880
got genre right, a bit older
>>
File: folders.png (30 KB, 1426x416)
30 KB
30 KB PNG
Trying to make a Kontext finetune specifically for anime image generation. I made my own dataset with multiple instructions (about 700 image pairs total).
Do you guys think it's a worthwhile endeavor? Spun up a 10$ gpu to train it for about 10 hours and it came out so-so for a first attempt.
https://civitai.com/models/2081917/aniedit-flux-kontext
Also if you guys have ideas of additional instructions I could add to my dataset that'd be cool, pic related.
>>
File: chroma___0061.png (1.12 MB, 896x1152)
1.12 MB
1.12 MB PNG
>>107038844
ew
>>
>>107038844
my old psychological horror movie gens are somewhere here.... from probably two years ago at this point... fuck i have to find them now
>>
>>107038911
there's a guide in the OP of this thread that was done on the original Neta Lumina that can still be useful. Or just try literally any actually booru artist tag with `@` in front of it.
>>
File: 1730485301726386.png (1.15 MB, 1048x992)
1.15 MB
1.15 MB PNG
>>107038972
>kontext
>>
It just needs one more finetune and some improvements to the text encoder.
It's so close so I'm hopeful for the lora training to get figured out but a little more training will push it over the hill.
>>
>>107039016
I'll eventually do QwenEdit too, but it's a test run for now.
>>
>>107039006
You don't need the @ tags they do nothing some some retard on civ did it and anons are copying. I tested it, nothing changes
>>
File: 1739464674349044.mp4 (853 KB, 440x496)
853 KB
853 KB MP4
>>107038680
>those eyes
get back you demon!
>>
>>107039006
I want to know which is your favorite.
>>
File: 995775028.png (1.84 MB, 896x1152)
1.84 MB
1.84 MB PNG
>>107038958
I see, which fork? I'm still using the original forge. Also how's the speed on that model compared to Flux or Chroma?
>>
File: image_00018_.jpg (542 KB, 1184x1560)
542 KB
542 KB JPG
>>
File: image_00019_.jpg (520 KB, 1184x1560)
520 KB
520 KB JPG
>>
>>107039076
https://github.com/Haoming02/sd-webui-forge-classic/tree/neo
It's way faster than chroma but hands and text are hit or miss. I think it needs more training more than anything else.
>>
File: chroma___0231.png (1.48 MB, 832x1216)
1.48 MB
1.48 MB PNG
>>107039124
ew
>>
>>107038459
I was about to ask if that was a Khyle lora but then I realized he has like 800 pics on booru.
Not surprising that AI can recall him. Good to know for me though.
>>
>>107039132
I see, thanks.
>>
>>107039132
>>107039076
Neta Yume is two and half times slower than SDXL for me.
Chroma is slower than both by a considerable margin. Not surprising since it is larger than others.
I really wonder why Neta Yume is slow though, since it is roughly the same size as SDXL.
>>
>>107039206
What card do you have
>>
File: Come on Google.png (1.46 MB, 1474x1291)
1.46 MB
1.46 MB PNG
https://xcancel.com/NotebookLM/status/1983220533417136603#m
>be me Google
>want to listen to feedbacks and realize that people want anime
>make SD1.5 tier model as a response
come on Google you can do better than that no?
>>
>>107039218
3060.
I am not gonna cry about Chroma being slow on this, but I am able to fit Neta Yume easily.
>>
Is there a 3rd windows explorer that can play all video clips from thumbnail view, instead of opening 1 by 1 and check? Like how it is in civitai
>>
>>107039232
normie goycattle don't care they'll eat the slop
>>
>>107039232
>4 fingers
>in 2025
>>
>local is dead
>saas is dead
clankers btfo
>>
File: 1760052055415655.jpg (1.08 MB, 1248x1824)
1.08 MB
1.08 MB JPG
>>107038659
how did you get this picture of me
>>
File: ComfyUI_06496_.png (1.46 MB, 896x1200)
1.46 MB
1.46 MB PNG
>>107039538
>>
>>107038659
me and my clanker wife
>>
>>107039576
Rabbi sex master? lmao
>>
what killed the /ldg/
>>
>>107039832
everyone is on christmas vacation
>>
>>107039832
mogao and wan2.5
>>
its cozy anon finally
>>
does cozy mean abandoned
>>
you dont deserve my gens right now anyway
>>
yea only you should be subject to those abominations
>>
>>107039832
>>107039834
>>107039870
>>107039907
>>107039909
>>107039948
>>107039952
All me btw
>>
File: criticanonplz.jpg (3.58 MB, 2560x2560)
3.58 MB
3.58 MB JPG
>>107039909
Sometimes, yeah. But I'm still here. I think...
>>
>>107039978
your oc fucking sucks kys
>>
File: 1740268665644473.png (36 KB, 784x286)
36 KB
36 KB PNG
https://github.com/comfyanonymous/ComfyUI/pull/10526
>>
>>107040040
Where is your OC?
>>
>>107040071
doesnt it already do it, as in models are offloaded to ram? between generations?
>>
File: nightkoto.jpg (3.62 MB, 2432x2432)
3.62 MB
3.62 MB JPG
>>107039978
OC? PUT SOME RESPECT ON HER NAME! SHE'S MAKOTO NANYA FROM THE VIDEO GAME BLAZBLUE! Educate yourself, son!
>>
>>107040089
kys avatarfag
>>
>>107040040
>>107040089
Bro, I seriously should, I straight-up fucked my reply lol
>>
>>107040091
I kinda am avatarfag, but I gen other things. I just gen banned for gens I make as jokes lol
>>
File: ComfyUI_07285_.png (3.21 MB, 2560x2560)
3.21 MB
3.21 MB PNG
>>107040091
I post other things too; things to get banned over. How does this look for an anon that judges every gen they see?
>>
>>107040071
> https://github.com/comfyanonymous/ComfyUI/pull/10526
> if not is_nvidia():
> return False
Fuck you, drag and shot.
>>
why the fuck is wan2.2 adding red spot or lump on armpit
>>
>>107040143
because you touch yourself at night
>>
>>107040143
Post your gen
>>
File: 1752429126703062.png (3.78 MB, 1416x1888)
3.78 MB
3.78 MB PNG
>>
>>107040126
you already posted this the other day, and was already told that this gen fucking sucks, check the fucking fingers retarded faggot.
>>
>>107040143
>India IP detected
>>
>>107040149
>>
File: 1758396874349860.jpg (672 KB, 1416x1888)
672 KB
672 KB JPG
>>
>>107040162
It looks fine?
>>
>>107040162
Did you? Sorry, I was drunk as hell. Don't worry, I won't post it again. Sorry about that.
>>
>>107040212
blind people general
>>
>>107040136
>*AMDkek cries*
you love to see them kek
>>
>>107040212
We are not in the business of "fine".
>>
Why hasn't hunyuan released a voice model yet?
>>
>>107040196
She's diseased
>>
>>107040229
I'm not a retarded cuck at least.
>>
>>107040136
>>107040351
just buy a used card bro
>>
>>107040378
It's the same thing.
>>
>>107040396
you're not really supporting nvidia if you're paying someone else to buy his old card from him
>>
>>
>>107040297
Me
>>
>>107039132
DPM++ 2S Ancestral Linear Quadratic is slower than Res Multistep Linear Quadratic but gives way better results with Neta I find
>>
>>107039027
the Yume guy has access to the original Neta Lumina dataset and apparently artists were in fact captioned that way. Their official prompt guide also did it like that IIRC. Has nothing to do with CivitAI
>>
>>107039206
try the Spanner fork of Teacache:
https://github.com/spawner1145/CUI-Lumina2-TeaCache
>>
fresh
>>107040459
>>107040459
>>107040459
>>107040459
fresh
>>
>>107040411
> you're not really supporting nvidia if you're paying someone else to buy his old card from him
Yes, of course, because he will end up without a card and surely will not buy a new one.
>>
>>107040461
im sure him waiting 1 more day on ebay for another buyer will make him go for ayymd instead...
>>
>>107040479
> someone will do that anyway
Brown mentality.
>>
>>107040559
the point is the difference to ngreedia is basically nothing, a comment shitting on nvidia online will do more damage than someone waiting for a day more to sell his nvidia gpu, if that, versus the difference to you, having to settle with ayymd dogshit gpus, a company that also hates you as the end user and only exists so nvidia doesnt get labeled a monopoly

if you were truly some absolute freedom absolutist, you wont be buying from ayymd either, you would be on a risc v computer, which you arent on.
>>
>>107040612
It's more about not losing pride. I'll continue having no business with ngreedia and despising their little cucks.
Reminding that other cards exist makes difference.
>>
>>107031687
moar
>>
>>107040559
Like how white people let infinity Indians take other?



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.