[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Full Abstract, My Forte Edition

Discussion and Development of Local Image, Video, and Music Models

Previous: >>108996927

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
Why does OPcollageTranny hate zimage gens?
>>
Blessed thread of frenship
>>
>>109001724
too much of a raped retard that needs to include schizo walls in the OP
>>
>>109001724
>mfw Resource news

06/06/2026

>HugginFace VFS Plugin: Native Total Commander file system for Hugging Face models
https://github.com/mikinko/HuggingFace_WFX

>ComfyUI Lance AIO: Custom nodes to run Lance-3B
https://github.com/SteveImmanuel/comfyui-lance-aio

>Cube: Generative AI System for 3D
https://github.com/Roblox/cube

>The token bill comes due: Inside the industry scramble to manage AI’s runaway costs
https://techcrunch.com/2026/06/05/the-token-bill-comes-due-inside-the-industry-scramble-to-manage-ais-runaway-costs

06/05/2026

>RhymeFlow: Training-Free Acceleration for Video Generation with Asynchronous Denoising Flow Scheduling
https://simon-dcs.github.io/Website-of-RhymeFlow

>Complexity-Balanced Diffusion Splitting
https://noamissachar.github.io/CBS

>Can We Predict The Human Preference For Text-to-Image Content Prior To Generation And Is It Even Useful To Do So?
https://github.com/LSU-ATHENA/HPM-Predict

>SAM-Flow: Source-Anchored Masked Flow for Training-Free Image Editing
https://github.com/chwbob/Sam-Flow

>Geometry-Aware Dataset Condensation for Diffusion Model Training
https://github.com/2018cx/GADC

>StoryVideoQA: Scaling Deep Video Understanding with a Large-Scale, Multi-Genre and Auto-Generated Dataset
https://github.com/nercms-mmap/StoryVideoQA

>Lightricks to split into two companies as it cuts 75 jobs
https://www.calcalistech.com/ctechnews/article/r1dgjt5gmg

>Akium Sampler: Custom k-diffusion sampler for Stable Diffusion Forge / A1111
https://github.com/AkiumAI/akium-sampler

>When AI builds itself: Our progress toward recursive self-improvement, and its implications
https://www.anthropic.com/institute/recursive-self-improvement

>U.S. Government Officials In Talks To Acquire Shares In AI giants
https://www.notus.org/technology/trump-ai-stake-openai

06/04/2026

>Echo-Infinity: Learning Evolving Memory for Real-Time Infinite Video Generation
https://echo-team-joy-future-academy-jd.github.io/Echo-Infinity
>>
>caring about the faggollage
kek
>>
>>109001724
Circlestonelab owns /ldg/
>>
why are all zit checkpoints so asian coded
Is there any nsfw capable one that isnt?
>>
File: ComfyUI_00029_.png (2.82 MB, 1536x1536)
2.82 MB PNG
>>
>>109001578
It's okay anon, I still love you and I don't think any less of you.
>>
>>109001793
Just prompt the race retard
>>
File: 970122053045319.png (2.44 MB, 1088x1600)
2.44 MB PNG
>>
some anon genuinely like the aesthetics of slopmixes? really?
>>
How slow is the new Ideogram model on good GPUs?
>>
>>
>>109001866
should've stopped at 0
>>
>>109001866
I prefer 40 steps because it gives me a free hamburger. Also, what is Ayakon in your filename? Is it some kind of sampler or scheduler? An Ayakon scheduler? I've never heard of it, could you tell us? ;)
>>
>>109001866
Yeah, I'm curious too. I searched for Ayakon in custom nodes, but I can't find anything either. :S
>>
File: 1764276958994561.png (106 KB, 1619x788)
106 KB PNG
i just trained a 1024px klein 9b character lora for almost 14000 steps, and it looks like i could probably go even further. damn i love this model
>>
>>109001926
shouldn't you stop training once the model adequately represents what it's supposed to?
use case for infinite training?
>>
>>109001926
>bro saves only every 700 steps
diabolical, cant imagine the rest of the settings
>>
>>109001866
the point of using more steps is for higher resolution dumbass
>>
File: kx59ir.png (815 KB, 1216x832)
815 KB PNG
>>
>>109001937
i first stopped at 6000, and decided to keep going to see if it improved even further, which it did.

>>109001939
my config is set to save every 5 epochs. i usually save at way less steps but this was an experimental model that i threw 140 images at.
>>
>>109001976
I guess one could make a case for slow training as it kinda makes sense that slow training could improve quality. I don't know if I have the patience for it though. I generally want to be done training after 30 epochs
>>
File: 346569720558861.png (1.21 MB, 1728x1024)
1.21 MB PNG
>>
Prompt me.
>>
>>109002062
undo your life
>>
>>109001937
None. Once model can reproduce training image, it's overfit.
>>
>>109002036
>no background
>no details
>simple colors
>slop colors
guess the model...
>>
>>108998170
is this guy still here? I need to know more about this process
>>
>>109002092
>>no background
>>no details
>>simple colors
>>slop colors
You just described the average Danbooru post tho
>>
Ideogram is still not an edit model so it's shit compared to Gemini etc..., unless they're planning on releasing one.
>>
>>109002108
thats why we need noob/illust tune of anima, to actually have a model ready for genning and not just training
>>
>>109002125
WE don't need anything but YOU do need to hone your skills a bit further
>>
>>109002136
im not the one that genned that image
>>
I need to talk to Catjack, how can I reach him?
>>
>>109002139
I know but you are complaining about a model trained on Danbooru looking like Danbooru
>>
>>109002144
just say something he would accuse you of being his boogeyman. I guess you already did
>>
>>109002152
>>109002125
>>
>>109002167
>>109002136
>>
>>109002175
telling me to "hone my skills" while defending an image with no background, simple colors, and slop colors despite this being fixable with also a prompt doesnt work retard
>>
you can discern anons skill level by how they refer to the names of various model desu
>>
>>109002144
you need to have a github project with more than 50 stars
>>
>>109002190
>defending an image
I'm not defending the image I'm defending the model because you blamed the model and not the genner >>109002092
>despite this being fixable
Then why did you claim it's the models fault?
>>
How are people using Ideogram?
I see a template for Ideogram 3 on comfy but for Ideogram 4? Are they using the online web interface?
>>
>>109002193
I guess he does if you responded
>>
>>109002206
you have to sign up for cloud and let them send all your workflows to the mothership
>>
>>109001724
>zimg
the average zit gen got stale about a week after its release and only one or two anons use loras with it
zib is more flexible in that regard
>>
Is there anyone here who has had sex? Does anyone know how horizontal nipples are created? Is it something genetic, or does it happen because of external factors like having babies or hormonal changes that affect the body for baby making?
>>
>>109002226
>zit gen got stale
unlike unique and novel masterpieces like white background 5 colors generic tranime girl >>109002036
>>
>>109002202
>I'm not defending the image I'm defending the model
if the model needs to be told to do the most basic shit like generate a background, any background fitting an image, otherwise it leans heavily to generating an empty background despite it costing the same compute, then it means its a shit model that doesnt cater to what basically any end-user wants while using it.

thats why it needs a finetune.
>>
>>109002230
>>109002250
>then it means it a shit model not madel that doesnt cater to what any end user wants while using it.
Why would you want a model to generate things you dont prompt for? I don't need the model to handhold my prompts.

Once a finetune of Anima based on e621 "like Noob" (not Noob/Illust as was incorrectly claimed) does release you will encounter the same problems you have with Anima. You don't want actual Noob, you want a slopmix that makes up for you lack of prompting abilities.
>>
File: 1764751425726978.png (189 KB, 844x369)
189 KB PNG
>>109002238
no, sorry, what i meant was, unlike a loss graph without a label of what its a loss graph of
>>
>>109002261
>he doeesnt know
holy newfaggotry
>>
>>109002231
i have had sex but i dont know the answer to your question :-(
>>
>>109001708
Catbox on the 3d hair please?
>>
>>109002257
>Why would you want a model to generate things you dont prompt for? I don't need the model to handhold my prompts.
do you also want to have to tell the model that a character should be alive instead of dead, to always prompt a pose otherwise the character is in T pose, to prompt where the character is looking otherwise it has dead eyes staring blankly, to prompt art style line thickness otherwise the lines are actually invisible and theres nothing in the image?

imagine being so retarded you have to defend a model wasting the same amount of compute on generating a white background instead of literally anything else for free.
>>
>>109002295
That's a lot of words to agree that you are looking for a slopmix kek
>>
>>109002306
>couldnt answer
concession accepted.
>>
>>109002314
Why would I answer a rhetorical question?
>>
local is dead
>>
>>109002323
its not rhetorical. do you want a model to hold your hand or do you want to prompt art style line thickness otherwise the lines are actually invisible and theres nothing in the image?
>>
>>109002337
I already told you I don't want a model that inserts things into the output that I didn't ask for. If the artist I'm using is known for both thick and thin lines, for example, and if I don't specify line thickness then it should be an amalgamation of both extremes. That's why if you don't prompt for a background you're most likely to get a simple white background.

This is just how raw finetunes work anon. It was the same with base Illust and base Noob, you were just using mixes and merges.
>>
>>109002206
anon posted a catbox the other day >>108996611
>>
>>109002356
and if I don't prompt a background it should be an amalgamation of all backgrounds that fit the image.

>It was the same with base Illust and base Noob, you were just using mixes and merges
noob most definitely does not give an empty background and simple as shit colors by default
>>
For the anons who say that Anima needs more steps at higher resolutions If 30 steps are necessary for 1024px, how many would be needed for 1536px? No, I'm not doing a rule of three calculation.
>>
>>109002380
>it should be an amalgamation of all backgrounds that fit the image.
It should be an amalgamation of the average Danbooru background which is white, simple.
>noob most definitely does not give an empty background and simple as shit colors by default
Base Noob requires you to be incredibly autistic with the prompt, just like Anima. You'd know this already if you used it a lot.
>>
File: gob wife video.webm (2.24 MB, 704x1280)
2.24 MB
2.24 MB WEBM
what is the best lightning lora that retains detail and texture?
>>
File: 06194-789303376.png (573 KB, 1024x576)
573 KB PNG
>>109002222
>you have to sign up for cloud and let them send all your workflows to the mothership
>>
File: anima_00007_.png (1.31 MB, 1024x1024)
1.31 MB PNG
Generic 1girl
>>
post your favorite 1girl barbequing at the beach
>>
>>109002398
wtf? there's more than 1?
>>
>>
So is Ideogram 4 edit-capable or not? Locally that is. I can't find a solid answer anywhere. The Ideogram website has editing but it's not clear what model is actually used for it.
>>
File: 1769996619767530.png (3.52 MB, 1728x1728)
3.52 MB PNG
>>109002423
>>
>>109002261
its the loss graph from lodestone training/testing crap and changing stuff midway training
>>
>>109002502
>I can't find a solid answer anywhere.
go to the official ideogram 4 github repository and comfy pages, the answer will be there (as in, if they don't mention it it can't)
>>
why no ideogram spam here?
>>
>>109002375
Thanks chief
>>
>>109002571
/ldg/ is split into two groups, for 2 extremes of the iq bell curve
low iq retards are too dumb to set it up yet and/or cant run it as fast as the other vramlet shit they are running.
high iq kinnoisseurs know its just a slopped censored model with regional prompting and some more world knowledge thats not worth setting up.
>>
LTX is very good at synthesising real people's voices based on tiny samples. Unfortunately their jew magic also turns you into a woman hating incel sexist misogynist

https://files.catbox.moe/k2wn4y.mp4
https://files.catbox.moe/1ppuxv.mp4
>>
tfw no midwits in ldg
feelsgoodman
>>
>>109002622
ltx 3 never ever
local seedance 2.0 maybe next year
>>
File: llmadapeter.png (2 KB, 367x99)
2 KB PNG
>>109002626
Yeah, me.

For lora training in Anima, is this the famous text encoder? I have it set to 0 because the model tends to forget things, right?. I also have Train UNet Only enabled.
>>
>>109001866
this isnt really useful if you dont mention the native generation resolution as well as the sampler (some wont meaningfully converge even at 1,000,000 steps) anon
>>
File: 1762343324333425.png (3.54 MB, 1728x1728)
3.54 MB PNG
>>109002514
closer
>>
i get really disappointed when my posts dont make it into the collage.
judge me all you like but thats the only reach i can hope to achieve with my gens
>>
>>109002656
collagetranny is too retarded to just use all images
>>
>>109002665
that would make the collage less special though see adt
>>
>>109002571
I'm not using because... I can't use it, I need to feed a online chatbot something for it to make a json for me to give to the model, and its slow af, its just... annoying, if it at least produced ultra high quality images
>>
>>109002666
theres not that many images anyway and its more interesting when you can actually just quickly glance over everything instead of more than half of the gens being removed. there is enough pixels of space, no reason to not include everything.
>>
Where is your Faggollage?
>>
File: ideogram4_00022_.jpg (702 KB, 1184x1776)
702 KB JPG
>>109002671
>I need to feed a online chatbot something for it to make a json for me to give to the model
use Kijais ideogram prompt builder
>>
>>109002684
dont worry anon youll be included next time im sure of it
>>
File: 06197-2042841946.png (1.21 MB, 1152x896)
1.21 MB PNG
>>
>>109002700
conceeeeeession status:...accepted.
>>
>>109002705
okay have fun seething at the collage bake i guess
>>
>>109002715
im having fun with retards who when cornered in a basic discussion cant engage anymore and then try to derail to cope
>>
lol
>>
>>109002692
from ComfyUI-KJNodes? will give it a try
>>
File: Ideogram4_00023_.jpg (521 KB, 1184x1776)
521 KB JPG
>>109002375
>>109002739
take that workflow
>>
>>109002728
are you really getting this heated over the collage kek
>>
i am a kinnosovlisseur
>>
>>109002746
see? cant engage -> derails by picking one:
1. why do you care
2. you are mad
3. random insult
bonus if its a vaguepost in the thread without directly quoting too. every time lol
>>
>>109002762
what does any of that have to do with your seething tho
>>
>>109002769
you picked option 2. three times in a row now, at least mix it up
>>
Anima seems very sensitive (negatively) to "slop prompts" where you spam duplicate shit. I think because it's better at following the prompt, this means you get worse images because it's trying to follow everything.
Every word matters now.
>>
>>109002774
why would i when you keep seethe replying regardless
>>
>>109002788
you can't, thats the point. npcs have to pick between the 3 points every time, same reason you did so 4 times in a row and will again to this comment
>>
>>109002796
>>109002684
>not that mant images
92 in the last thread
>when you can actually just quickly glance over everything
4chanXT
>no reason to not include everything.
see >>109002666 and hdg highlights


you will still seethepost tho
>>
>>109002408
based. tanned bitches make pp go boing boing
>>
why would you use anime for real when zit runs on toasters?
>>
File: 06198-3831131916.png (1.06 MB, 1152x896)
1.06 MB PNG
>>
File: anima_00032_.png (1.03 MB, 1024x1024)
1.03 MB PNG
I see 98% of the rest of humanity is still fucking retarded, arguing over fucking nothing. All day every day pure seethe, should just fucking kys desu.
>>
>>109002803
isnt the 4chan limit 10k by 10k pixels or something? 92 images is nothing

also arguing that
>that would make the collage less special
meanwhile like 1/4 of the fagollage is currently a loss graph screenshot despite only 10% of the images from the previous thread being handpicked to be in it, lmao
>>
>>109002838
hot
>>109002840
too many images makes it appear as noise simpleas
>>
>>109002832
cause of the NSFW capabilities mostly
>>
>not using Ideogram 4 + LLM guidance + prompt builder to make depraved comic-style fanfics
NGMI
>>
File: Ideogram4_00029_.jpg (665 KB, 1936x1088)
665 KB JPG
>>
File: ComfyUI_00313.jpg (3.58 MB, 1500x1920)
3.58 MB JPG
>>109002571
Advertising doesn't really work on 4chan because nobody likes anything here... and as bad as the Ideogram spam is, it's nothing compared to the Nanobanana launch. Google was paying everybody!

>>109002656
>doesn't post a gen to help make his case
I think OP made the right choice.
>>
>>109002846
>didnt address what was said
im glad 1/4 of the fagollage being a loss graph screenshot isnt noise though
>>
>>109002855
can you convince me to try with an output of yours?
>>
>>109002855
post some samples
>>
>>109002864
youre right its not noise its a loss graph but keep seething
>>
>>109002873
>plays dumb
>returns to npc programming instantly picking the 2. option again
zoo
>>
File: anima_00039_.png (1 MB, 1024x1024)
1 MB PNG
I'm loving these realistic anima checkpoints, so fast, so flexible no bullshit like with klein.
>>
>>109002877
>>109002788
>>
>>109002881
>>109002762
>>
>>109002884
>>109002700
>>
>>109002860
Pretty cool.
>>
>flux
>qwen
>zit
>ideogram
dead-end waste-of-time models
>>
>>109002880
Still look plastic as hell...
>>
>>109002888
>>109002705
>>
>>109002903
>>109002715
>>
>>109002905
>>109002762
is it like the 10th time i forced this npc to pick the same option now? kek
>>
>>109002897
what's the GOAT according to >>109002897?
>>
>>109002902
i don't mind that desu since the video models i use alter the skin regardless. I just need something that works and gives decent enough results that the video model can latch onto. But klein is good for image to image edit mode for make it more realistic sure and i often use it for that.
>>
I turned a screenshot of spy x family into a photo from the 1960's
>>
>there are still fags that use cumfart over sdcpp
lol. lmao even
>>
what happened to based china? i just remembered ernie image and hidream pixel-space exist, but nobody is using them?
>>
>>109002912
>>109002846
>>
File: IMG_3557.jpg (203 KB, 1472x828)
203 KB JPG
>>109002921
original
>>
File: Ideogram4_00030_.jpg (608 KB, 1184x1776)
608 KB JPG
>>109002890
Can be decent, but not worth using until filter is realiably bypassed. It's like having baked in reddit moderator.
>>
>>109002408
went for a similar one on Klein
>>
>>109002924
I can't remember the last time I thought China was based.
>>
>>109002921
Fix the hair colors. Also, this is local diffusion general, not API general.
>>
>>109002902
>>109002920
in fact my workflow uses klein 9b edit on last frame for each clip I gen using wan, but it changes the skin to much and does not always behave.
>>109002932
yeah its always going to make their skin better and everything overall. But anima is a good little model that even has a controlnet model for it now, it would make a nice little last WAN frame restore model imo
>>
>>109002922
desu id use diffusers before sdcpp
>>
>>109002944
the hair colors should remain unchanged because the ai made them look like normal japanese people from that period
>>
Why did we never get z-image edit btw
>>
>>109002953
oh yeah my Klein one wasn't an edit of your Anima one to be clear, I just prompted T2I for a new, similar one
>>
i tought zit was a smart model, why is it as soon i mention booba size it gives me booba instead of booba under cloth?
>>
>>109002959
China gonna china
>>
File: 1749608285106984.png (2.81 MB, 1256x1672)
2.81 MB PNG
>>109002902
it can get pretty good with the right samplers. and pid for qwen came out just this week too
>>109002932
what the fuck is that tanline though
>>
>>109002959
they probably didnt think it could beat klein since it never even released via api
>>
>>109002921
Grainy shit gen. GPT is so ugly and bad
>>
>>109002965
I guess it's like, she wears a low-cut curved top a lot in summer that isn't the same shape as the bikini?
>>
>>109002924
they will be waiting for the next commercial western hype model i think, then they will crush them. This is the Chinese's plan all along, they will destroy competition before they make any money from their own models.
>>
File: Ideogram4_00032_.jpg (693 KB, 1936x1088)
693 KB JPG
just fuck off
>>
>>109002965
lil too much chin fuzz on Emma here I think lol
>>
How do I get hired for red teaming?
>>
>>109002961
are you sure about that? because that tanline is looking fake like my own image. The lines are too sharp to be realistic.
>>
>>109002976
noooo you have to like the model with baked-in safety block images and no edit capabilities, that requires JSON input exclusively
>>
File: 1756257231441416.png (3.18 MB, 1728x1728)
3.18 MB PNG
>>109002654
as good as i care to get it before resorting to inpainting
>>
>>109002981
idk man lol, I prompted Klein for 3D though, not a photo exactly, cause I was trying to match the style of yours kinda
>>
>>109002921
another
>>
>>109002985
did you use some sort of tagger? It very accurate, i find prompt klein a bit too hard or i can't be bothered because it takes an essay of just the subject to get right.

I know Wan video prompting perfectly, i can create entire minutes long videos with that thing because it easy once you know 81 frame 5 seconds action limit. But klein is awful to prompt for i hate it.
>>
diffusion-pipe now supports Ideogram4. The model is very good for training actually. I gave early access to the code to a trusted acquaintance and he informs me that it's trivially uncensored just by training it. "it's basically less censored than z-image or klein when you train it" - him
>>
File: IMG_3558.jpg (100 KB, 1280x720)
100 KB JPG
>>109003009
original
>>
>>109002932
ann colter?
>>
>>109003014
thanks russ
>he informs me that it's trivially uncensored just by training it. "it's basically less censored than z-image or klein when you train it" - him
big if true
>>
>>109003009
not bad but the text on the books should be german
>>
File: Ideogram4_00034_.jpg (606 KB, 1264x1680)
606 KB JPG
>>
>>109003014
>less censored than z-image
But what exactly do you mean by this
>>
File: animaHighres_00011_.png (1.26 MB, 1024x1024)
1.26 MB PNG
this was genned at 2048 x 2048 i think i need more steps.
>>
File: HFanima.png (19 KB, 953x255)
19 KB PNG
https://huggingface.co/circlestone-labs/Anima
>>
how come anima is better at understanding natural language than zit, wtf?
>>
>>109003014
are people just training on the fp8 checkpoints?
does training work well with NL or do captions have to be json? any reports on it?
>>
>>109002965
That's a man.
>>
>>109003034
from my own experience with euler you tend to have to go up to 70 to 90 for convergence
also try my yarat lora if you havent merged it in already, it was partially trained on 2048px
>>
>>109003038
probably the text encoder, its great because it has tags but when they don't work you can literally just tell it in your own words and it usually gets the idea. They done an amazing job here.
>>
>>109003029
train lora on 500 NSFW images and it gives much better results than z-image or klein 9b can achieve on the same dataset. even after just a few hundred steps you will never get "image blocked by safety filter" anymore
>>109003042
With the latest commit, diffusion-pipe can train directly on the native comfy quantized weights, like fp8_scaled. I doubt any other training script can do this currently. For example, I believe AI-Toolkit requires the original checkpoint format released by Ideogram. It works very well with basic NL. JSON might be better idk, I can't be arsed to format a dataset like that.
>>
File: 8605.png (32 KB, 379x499)
32 KB PNG
how do I get control after generate form random noise to show here as an option? My flux subgraph has that but I have no clue how to achieve this
>>
File: Ideogram4_00035_.jpg (440 KB, 1264x1680)
440 KB JPG
>>109003073
why dont you just release the lora
>>
>>109003073
can you make a trainer using ggml? I hate all this python shit
>>
>>109003082
ok think I found out, right click it -> promote, somehow it was already promoted but didnt show up, unpromoting and promoting again fixed
>>
>>109003073
sounds pretty fucking neat. thanks a lot
>JSON might be better idk, I can't be arsed to format a dataset like that.
i happen to have a larger data set with people based bounding boxes floating around anyway, just need to find the motivation to burn money on training in spite of the shitty license kek
>>
just give up lilbro
>>
>>109003094
>in spite of the shitty license kek
what's up with all these shitty licences anyways? anima and this i4? What the hell is up with these companies?
>>
>>109002972
they will wait for the next google video model and then BASED Bytedance will mog them with Seedream 6.0 and Seedance 3.0 API while Qwen releases a shitty happyhorse 2.0 that by all accounts should be local but they too put it behind API. Then Hunyuan comes along 6 months later with an LTX-tier video model with 4 text encoders that require 140GB VRAM to run train on seedream outputs. meanwhile kekstone discovered a revolutionary new way to train at 128x128 pixels to maximize flux.1 schnell training speed
>>
just saw a "child ai-influencer" made by some pajeet, don't know what to think about that

>>109002983
reddit be like
>>
>>109003020
lol I can see the resemblance now that you mention it honestly
>>
>>109003107
Can you explain why the Anima licence is bad without sounding like a jeet?
>>
>>109003083
>>109002867
>>109002865
this is what i mean when i asked yesterday: >>108998101
what's with the shitty ideogram shilling? it's the same thing again and again: "this model is NUTS if you know how to bypass the censor, just a little training and it gets real wild! with the right workflow you can generate ANYTHING!" but when asked to show NSFW results, nobody can deliver.
>>
>>
>>109003029
Klein is unironically better than Z Image out of the box at booba. Neither can do PP at all out of the box though. Hunyuan 2.1 and 3.0 can / could, though.
>>
>>109003135
cant do private lora commissions
>>
>>109003135
it's not just open source so nobody complains
>>
>>109003147
any fee would be for image generation not training try again
>>
>>109003145
Z base is way better than klein 9b
>>
File: debo_i_fia_00049_.png (2.89 MB, 1433x1792)
2.89 MB PNG
>>
>>109003153
in what way?
>>
>>109003151
what do you mean? private lora commissions is literally one of the examples of what you cant do by tdrussell himself on the HF thread where he explains the license
yes you obviously can circumvent it, doesnt change the fact its objectively worse than z-image legally speaking
>>
>>109003159
Fuck off
>>
>>109003143
hai i am carnivore can i eat your pussy?
>>
>>109003139
>"this model is NUTS if you know how to bypass the censor, just a little training and it gets real wild! with the right workflow you can generate ANYTHING!" but when asked to show NSFW results, nobody can deliver.
i know youre more describing the average plebbitor since no one here really cares _that_ much about i4, but this is just the way in which all free download models are spoken about when theyre new
>>
>>109003162
There's no plausible reason he'd ever know about an, uh, private, lora commission though, so why does it matter
>>
File: debo_i_fia_00051_.png (3.39 MB, 1433x1792)
3.39 MB PNG
>>109003163
>>
File: animaHighres_00014_.png (1.35 MB, 1024x1024)
1.35 MB PNG
eular_a works best it seems for this checkpoint at least, it smooths things out.
>>
>>109003147
>without sounding like a jeet
>"private lora commisions"
mission failed
>>
>>109003162
>private lora commissions is literally one of the examples of what you cant do by tdrussell himself on the HF thread where he explains the license
i see he mentions providers like civitai and fal not individual bakers getting commissioned
>>
lilbro please you already gave up on apache 2 anima just let it go
>>
>>109003186
people need to eat my guy
>>
>>109003091
now I cant change the order of the options... subgraphs are buck broken
>>
>>109003188
he needs a different lawyer then because that's not what the licence states. I don't need to get sued
>>
>>109003195
though its funny, on the klein workflow it just works, for whatever reason its only on the ideogram workflow
>>
>>109003204
wait... first you said he mentioned it in the HF thread but now youre saying he didnt but it says it in the licence? anon...
>>
>>109003177
the difference is that tdrussell could probably enforce this. he wont but you have to rely on what comes down to a pinkie promise. not really a good thing if you want to actually make money from this
also dont get me wrong i fully understand why it is this way, its still objectively worse than MIT or whatever and theres no reason to pretend otherwise
>>109003188
https://huggingface.co/circlestone-labs/Anima/discussions/37#69af0cad85dd88d442416f99
>If you are locking down the lora and selling access to it, that is commercial use of a derivative model and isn't allowed. If someone commissions a lora that you then post publicly, I would say that it simply someone donating to you to help with training costs, and isn't commercial use of the model itself.
>>
>>109003149
"Open source needs to change" is a much different argument or point than "This one model needs to change".
>>
>>109003214
a hugging face reply isn't as legally binding as a licence anon
>>
>>109003225
the original complaint is about recent models being these commercial dogshit licences
>>
>>109003224
>https://huggingface.co/circlestone-labs/Anima/discussions/37#69af0cad85dd88d442416f99
damn...
>>
wait, tdrussel is going to personally HUNT DOWN and DESTROY all the browns who try to sell loras?? fucking based
>>
File: animaHighres_00016_.png (1.48 MB, 1024x1024)
1.48 MB PNG
time for some Miku, just because.
>>
>>109003224
I guess the upside is that this doesn't deter giant finetunes since most of those bakers only ask for non-model-specific donations.
>>
what should i gen?
>>
>>109003224
>If you are locking down the lora and selling access to it, that is commercial use of a derivative model and isn't allowed. If someone commissions a lora that you then post publicly, I would say that it simply someone donating to you to help with training costs, and isn't commercial use of the model itself.
I'm not gonna release every lora I train. People who get them can upload them, I couldn't care less.
>>
>>109003243
Turbo Dick Russell
>>
>>109003269
the right to sell them belongs to Russ anyways
>>
>already made 76k on Anima LoRA commissions
Uh... how fucked am I?
>>
>>109003278
how much did you make?
>>
>>109003278
No one cares about your zimbabwe dollars
>>
are any of these good?
https://civitai.red/models/897413/big-love

It looks like he's doing it for like every conceivable arch, he's got Ernie, Klein 9B, Qwen 2511, Z, etc
>>
Hello, returning from a 4-month break here. Can we get an update on AnimAnon's OpenAnima project he said he was starting alongside LAX from Laxhar Labs, the creator of noob? I heard they were working on an Apache licensed cosmos-based model just like Anima, but it was going to be more "open for game developers to use". I looked around and couldn't find any info on it. It seems like LAX has joined ComfyOrg to work on Noob 2, and AnimAnon's UI hasn't been updated yet either.
Can someone update me on the status of AnimAnon and what he is currently working on? I am looking forward to seeing his 4 months of progress towards an open diffusion model for all.
>>
no you will never be rich and famous, catjak. just alone, schizophrenic and retarded from getting raped
>>
>>109003294
OpenAnima definitely doesn't exist in any way
>>
What did happen to ani anyways?
>>
>>109003287
76k USD
>>
>>109003312
yeah you are in deep shit
>>
>>109003294
no idea on animanon but lax announced at one point they would use glm image for noob 2, then spparently pivoted to klein 9b recently
>>
File: animaHighres_00019_.png (1.65 MB, 1024x1024)
1.65 MB PNG
Love this model or Miku will come to your house with klashnekoff blyat.

WTF is lil kleiner gonna do?
>>
File: z_00448_.jpg (756 KB, 1264x1800)
756 KB JPG
>>
>>109003320
I give 0 shits about noob 2 since it's saas only
>>
>>109003320
there's no way they're actually doing GLM lmao, would be like dumber than AuraFlow was as a choice for Pony V7
>>
>>109002238
>>109002261
what does this have to do with zit being too rigid compared to zib
>>
>>109003321
I'm an enjoyer of both Anima and Klein DESU
>>
>>109003320
they pivoted again to some shit called sensenova. noob is dead. like most models, it was a one-hit wonder. the next big thing after anima will be developed and released by someone nobody has ever heard of, as was the case for every relevant finetune.
>>
>>109003354
>the next big thing after anima will be developed and released by someone nobody has ever heard of, as was the case for every relevant finetune.
this anon speaks true
>>
>>109003354
pony only got better with v7 though and v8 will make saas pointless
>>
>>109003340
>>
>>109003367
this is true, right now pony v7 is ranked second in the safety arena, right behind ideogram.
>>
File: ldg guy.jpg (350 KB, 768x1344)
350 KB JPG
>>
>>109003354
wow wtf, why are they so obsessed with rando architectures lmao, this is even less well known than GLM Image
https://github.com/OpenSenseNova/SenseNova-U1
>>
>>109003369
Absolute garbage parody, doesn't match the original meter at all.
>>
>>109003376
real and straight
>>
>>109003376
ldgfren should be shorter, fatter, balding, and much much uglier
>>
>>109003392
speak for yourself anon
>>
>>109003291
No.
>>
>>109003183
wich model? there are so many now
>>
>>109003389
blame gemini
>>
Whats with all the sameface?
>>
>>109003147
That's it? Is there anything actually damning in the licence?
>>
>>109003401
https://civitai.red/models/2409949/samanima

i downloaded this one and another today, the above link imo is best. no lora used in that image eular_a simple 2048 squared 50 steps. 1 cfg because its a turbo model but it can easily do 4 cfg with 30 steps.

author states 1 cfg 16 steps max, but then you never really listen you just gonna test settings.

using some other detail or realistic lora's might give better results but i've not testing it fully.
>>
File: animaHighres_00017_.png (1.5 MB, 1024x1024)
1.5 MB PNG
>>
>>109001724
everyone with taste hates z-image gens

t. not the collage guy
>>
>>109003439
everyone with homo tastes that is
>>
>>109003461
on opposite day
>>
File: animaHighres_00020_.png (1.68 MB, 1024x1024)
1.68 MB PNG
>>109003401
trust me on that model, it is insane for what that little model can do. If lodestone was not such a kek he would just do a fine tune of this thing it would be mental.
>>
>>109003371
>pony v7
Should be number one, nothing is safer than a model nobody uses
>>
>>
>>109003312
Are you running a custom LoRA commission service on patreon or something else?
>>
>>109003469
epic rowing technique
>>
File: z_00456_.jpg (591 KB, 1264x1800)
591 KB JPG
>>
cozy bread
>>
>>109003502
in the future this how they row, they won't even notice anon. everything will be fake and they won't even notice, prove me wrong.
>>
File: fluxNiggermix_0024.png (1.99 MB, 1538x1022)
1.99 MB PNG
i love ai surrealism. it really activates the creative part of your brain.
also its important not to gen 1girls over and over. you risk falling into coomer mode, and it's hard to get out once you're in
>>
>>109003502
its probably not even in its dataset as realistic photo that will be why, however if i did in anime style first and then forward to controlnet (it exists google it) and second stage to realistic i think it will work just fine.
>>
>>109003514
>fluxNiggermix
whats in the mix, anon?
>>
>>109003520
a photo album of you growing up
>>
>>109003514
nice
>>
File: I love mayonnaise.png (393 KB, 506x474)
393 KB PNG
Russell should have developed AniRealism first, then finetuned it for anime with Anima, and later created a UI called AniStudio.
He would have made much more money, and the anime model would have had much better anatomical consistency.
>>
>>109003521
Kek
>>
>>109003503
Is this what modern smartphone pics look like? I haven't seen one in so long I don't remember anymore.
>>
>>109003503
I will CUM all over her freckled face and booba
>>
catjak. what the fuck
>>
catjack desu
>>
why did /ldg/ memoryhole catjack?
>>
I hate how stagnant everything is again. new models are barely better than what we had for a year already
>>
>>109003503
now make a collage with her pictures side by side, each one 15 years appart until 90
>>
>>109003551
>>109003561
>>109003569
>>109003571
holy schizo spergout batman
>>
>>109003569
could have been cool during the Victorian cat girl age but became a tranny schizo that has to hijak the OP to spite people she doesn't like. that started like 4 years ago and she still does it
>>
File: animaHighres_00024_.png (1.49 MB, 1024x1024)
1.49 MB PNG
>>109003503
That's nice i guess but how long did that take?

>stupid old cherry faced drunk man angry in his kitchen, threating the camera, propper nutcase. very short hair, scruffy beard.

vs your what? 500 word fucking essay... get lost
>>
Never forget catjack's 500 IQ Anima tag comparisons from Feb Mar.
>>
>>109003582
PDXL?
>>
File: cloud noise.jpg (1.13 MB, 1920x1920)
1.13 MB JPG
>>109003503
why do recent 'realism' models all have this ugly noise pattern overlaid on top of them? zit is especially terrible with this
>>
>>109003586
are you literally mentally ill or a bot of some kind, talking about a nameposter that doesn't exist
>>
File: animaHighres_00026_.png (1.37 MB, 1024x1024)
1.37 MB PNG
>>109003592
anima, now i add tags.

stupid old cherry faced drunk man angry in his kitchen, threating the camera, propper nutcase. very short hair, scruffy beard. flushed face, angry red face, blood shot eyes, deranged, phycopath

4 cfg is interesting but too extreme, i will post that next, here is 2.5 cfg 16 steps

so easy to prompt.

but realism it can't do unless you prompt it right, so i need red skin not flushed face
>>
who could can forget catjack who left their dark shitstain on the thread forever?
>>
File: animaHighres_00028_.png (1.31 MB, 1024x1024)
1.31 MB PNG
>stupid old drunk man angry in his kitchen, threating the camera, propper nutcase. very short hair, scruffy beard, angry face, blood shot eyes, deranged, phycopath, (pale red skin:3)

heheh

it can't do it, or i'm not prompting it right.
>>
"catjak" does not exist, take your meds
>>
File: animaHighres_00029_.png (1.33 MB, 1024x1024)
1.33 MB PNG
>>109003664
>stupid old drunk man angry in his kitchen, threating the camera, propper nutcase. very short hair, scruffy beard, angry face, blood shot eyes, deranged, phycopath, (red skin:3)

did it, removed the pale which i used to anticipate the overly read face, so now i just need to lower the cfg. yeah i spelt threatening wrong so fuck.
>>
>>109003664
>>109003688
looks worse than SDXL
>>
it is schizo hour i see
>>
>>109003693
it does but you don't know how i do my images...
>>
File: animaHighres_00031_.png (1.4 MB, 1024x1024)
1.4 MB PNG
>>109003693
GET OUT FAGGOT!
>>
File: animaHighres_00032_.png (1.38 MB, 1024x1024)
1.38 MB PNG
>stupid old drunk man angry in his kitchen, threating the camera, propper nutcase. very short hair, scruffy beard, angry face, blood shot eyes, deranged, phycopath, (red skin:3), mouth wide open, irate, shouting loudly, teeth showing

closer 1.5 cfg lowering to find sweet spot
>>
he fucking mentions this "catjak" from his imagination every thread, with no one ever showing up that actually is that
>>
klein can't do this shit and neither can wan so what ever, its too angry, anima does not care lol
>>
File: animaHighres_00034_.png (1.42 MB, 1024x1024)
1.42 MB PNG
>stupid old drunk man angry in his kitchen, threating the camera, propper nutcase. very short hair, scruffy beard, angry face, blood shot eyes, deranged, phycopath, (red skin:2), mouth wide open, irate, shouting loudly, teeth showing

lowered strength from :3 to :2 on (red skin) done it, this is how most real British men feel about their politicians right now.
>>
>>109003575
she looks like the type who will still have huge sexy booba at like 75 desu
>>
>>109003730
\> inventing a fictional Klein / Anima rivalry tha nobody else has ever mentioned or implied
>>
now do that with baby klein the safety model lol fuck off. I will be surprised if anon can do that same angry expression in the time i did with klein 9b. I think it could do it but it would look fake and cringe, like a pathetic cherry picked stock photo cringe you can find using google images. But what ever keep using that cucked model if you want, we want the real deal though.
>>
okay anon
>>
>>109003761
bruh are you drunk
>>
>>109003764
yeah anon has a tendency to drunkenly sperg out at no one in particular
he needs help desu
>>
File: animaHighres_00035_.png (1.41 MB, 1024x1024)
1.41 MB PNG
>>109003764
>>109003758
you are full of seethe a 2 billion model mogs you to no end and will continue to do so.

seethe

I am enjoying.
>>
the absolute state of local
>>
give me any idea or expression and i can create it no problems, klein will give you body horror, your shills are so pathetic, so desperate.

So ripe.
>>
File: image.jpg (1.05 MB, 2048x1536)
1.05 MB JPG
>made lora in anima
>1560 steps
>0.00002 LR
>text encoder blocked
>its forgetti
mom's spaghetti
>>
>post 1girl, red hair, cute
>anon spergs out over it for nearly an hour
kek
>>
>>109003785
Why aren't you using the authors default training params? It works fine.
>>
File: animaHighres_00036_.png (1.6 MB, 1024x1024)
1.6 MB PNG
lol like any of you cunts have any creative ability over the right?
>>
File: 00004-4149886489.jpg (986 KB, 2880x2496)
986 KB JPG
>>109003425
I got curious about how close this anima merge could get to realism. So I hit it with my choice of anime settings and lora enhancers and upscaled it. It came out better than I expected.
>>
you don't like us because our memes are linguistic and compact way to compress ideas, concepts etc. all you jerk offs create are cartoons because your fucking heads are full of cartoons, you = turd do not fucking (you) me faggot.

get out of our thread you do not own it...
>>
https://www.reddit.com/r/StableDiffusion/comments/1tzq7js/psa_a_possible_malware_disguised_as_comfyui/
more comfy malware I see
>>
>>109003827
cute elf gril
would smooch
>>
its the styles and etc. dead heads with no creativity care about such things because they want to copy like a child does with tracing paper.

surely you massive bloaty workflow will make you better...
>>
glad to see youre still here, ivan, and you were able to walk the 10 miles round trip to the liquor store
>>
but we can teach you, but you attack us always you always do do not say i came into this thread to attack you because i didn't.
>>
Sir, this is an API Nodes thread.
>>
>>109003848
>retard downloading and executing suspicious code
A tale as old as time
>>
i can teach so much about wan prompting for example because that is what i mostly worked with over the last year or so. 81 frames no more because it will loop back on it self and mess shit up, that is given and should already be know, the official prompt guide explains the prompting but it does not explain the 5 second limit. and one action, you can only usually get away with 1 action per clip it looks natural but its the best we got without it producing body horror.

people take little time to understand limitations of models, they all have limitations.
>>
>>109003827
bingo, what settings? if you care to share that is.
>>
>>109003827
that is a really clean gen btw, the detail is superior to pdxl, close to klein. I noticed anima does well with detailed prompts, but has a lot more flexibility. Shorter prompts produce less detailed results.
>>
>>109003827
how is anima with futa?
>>
>>109003889
Before upscale.
https://files.catbox.moe/v4anaq.png
After.
https://files.catbox.moe/nuam9e.png
Also, I know the negs aren't doing anything. I have the quality prompts set up to auto add themselves for every gen.
>>
Fresh

>>109003927
>>109003927
>>109003927
>>109003927
>>
>>109003929
it is impressive actually, i'm trying to at least get that level of detail on the first pass as you have done. My images look a bit blurry when you zoom in, you're has skin detail still when zooming in. The upscale looks bang on, what upscale method did you use? I suspect some upscale model on tiled or? It does not like anima was used on each step of the image gen.
>>
>>109003929
>>109003954
and do note i try to avoid massive workflows but if you have one i can look at it and probably strip out 10 or more nodes because mostly its not needed.
>>
>>109003954
For the upscaler I used nvidia-vfx. A damn near perfect upscaler. You can also use SeedVR2 but it way more slow. To get around Anima's limits, I also use MultiDiffusion in img2img. I don't have a workflow to share since I use Forge Neo.
>>
>>109003870
remember when APIKeks were trying to insist shit like this was only possible with Gippity 2
>>
File: z-image-turbo_00042_.png (1.81 MB, 1344x1344)
1.81 MB PNG
1girl.
>>
>>109003990
kek but also how did comfyui become a glorified API wrapper? I thought comfy was open source?
>>
>>109003291
No. Worst scam since merge exist.

You can find many on Hugging face using sha256... And that's really shitty lazy merge.
SubtleShader is probably one of the worst piece of shit in the creator community.
This scammer delete almost any negative comment and black list users saying how bad his merge are bad.
>>
https://www.youtube.com/watch?v=BzcQE8Jrry0
>>
>>109003439
if you hate the most realistic model that just means you hate it since it reminds you of your shit life irl, sounds like a you problem



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.