[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Discussion and Development of Local Image, Video, and Music Models

Previous: >>109034986

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
SDWebUI: https://rentry.org/ldg-lazy-getting-started-guide#the-stable-diffusion-web-ui-lineage
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>Wan
https://github.com/Wan-Video/Wan2.2

>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
>>
>inb4 the useless unfappable deepfake spam
>>
https://rentry.org/LDG_vital_info
>>
Blessed thread of frenship
>>
https://www.anthropic.com/news/fable-mythos-access
cloudkeks are on the ropes
>>
>SaaS is so powerful that the government has to step in
meanwhile local is so kekked they censor themselves
>>
>>109041732
Anthropic genuinely might be an all time top 3 "snake oil" business.
>>
File: Wan21_SCAIL2_00016.mp4 (1.71 MB, 912x688)
1.71 MB
1.71 MB MP4
>>109041701
>his grapes are sour
>>
>>109041746
you're so gay that the government has to step in and prevent you from trying to suck all the cocks on the planet.
>>
>>109041732
>be local
>win

Local can't stop winning.
>>
>>109041746
>SaaS is so powerful that the government has to step in
Well retard, If you actually read the statement you'd see that Anthropic do not agree with you, they think the US govt is being vindictive.

>[...] we believe the government should have the ability to block unsafe deployments, as part of a statutory process that is transparent, fair, clear, and grounded in technical facts.
>This action does not adhere to those principles.
>>
>Local models are free, uncensored, can be trained to do anything
>Yet somehow the government feels bigger threat from 'censored' API
local models are so dumb they're effectively harmess, nobody takes them seriously. API models are so advanced in thinking and capability, they're not even in the same league.
>>
File: 1777729074642151.png (87 KB, 686x386)
87 KB PNG
>API models are so advanced in thinking and capability,
>>
>>109041888
He's right. Car washes don't reduce your wanted level. If you want to wash your car, you need to go to a body shop.
>>
>>109041831
>API models are so advanced in thinking and capability, they're not even in the same league.

Are you sure about that cloudkek? You can't finetune cloudshit models on custom data. As it stands, a local model with a LoRA is more dangerous than censored ("more capable) cloudshit model with similar param count. You might say (apples to oranges) but remember glm 5.1 was already open sourced.
>>
>>109041831
They already seethed about celeb deepfakes and AI political propaganda. There just wasn't a major incident yet.
>>
>>109041916
>You can't finetune cloudshit models on custom data
Because you don't need to, this isn't the own you think it is. Local is so far behind you still have to train loras for outfits while API can do it in a single-shot and even search the internet.
You could try training a local model to make a bomb or whatever but they're so dumb that it wouldn't even work properly. Once GPT/Claude/Gemini gained internet access, local fell off the face of the earth. I can just ask GPT to "put her in the kansas city royals jersey" and it will work, no need to use separate edit models (outdated localcope) or custom loras.
>>
Pretty sure their concern is more with people finding a way around the safeguards and making 1000+ agents hack into national security shit than a few degenerates generating CP
>>
File: 1754472605835755.png (3.99 MB, 1536x1024)
3.99 MB PNG
>>109042033
>I can just ask GPT to "put her in the kansas city royals jersey"

Yeah but then it'll have that GPT image grunge, so its useless
>>
File: q_zm8bal.png (707 KB, 960x960)
707 KB PNG
>>
File: vifdd.png (89 KB, 859x742)
89 KB PNG
Why does this video loader only have an image input, how am I supposed to get a video upscaled with this?
>>
>>109042056
Nah, their main concern is preventing a foreign adversary from finding and patching their backdoors. Read the reasoning behind the immediate ban- >>109041673
>To date, the government has only given us verbal evidence of a potential narrow, non-universal jailbreak, which essentially consists of asking the model to read a specific codebase and fix any software flaws. Our understanding is that one potential jailbreak was shared with the government. We have reviewed the report and validated that the level of capability displayed there is widely available from other models (including OpenAI’s GPT-5.5), and is used every day by the defenders who keep systems safe. We will share more details over the next 24 hours.

From that alone, it's obvious.
>>
>>109042110
>image
s
>>
File: ComfyUI_00314.jpg (3.16 MB, 2656x4096)
3.16 MB JPG
>>
File: 1662341561354.png (1.28 MB, 1184x896)
1.28 MB PNG
>>109041707
>>
>mfw Resource news

06/12/2026

>ComfyUI-Flux2Klein-Enhancer: Conditioning enhancement and reference latent control
https://github.com/capitan01R/ComfyUI-Flux2Klein-Enhancer

>InterleaveThinker: Reinforcing Agentic Interleaved Generation
https://zhengdian1.github.io/InterleaveThinker-proj

>Experimental Anima LLLite Regional Controlnet
https://huggingface.co/Sen-sou/Anima-LLLite-Regional-Controlnet

>World Tracing: Generative Pixel-Aligned Geometry Beyond the Visible
https://haoz19.github.io/world-tracing-page

>VietFashion: Benchmarking Sketch-Text Composed Image Retrieval for Cultural Outfits
https://hng0303.github.io/VietFashion

>Modality Forcing for Scalable Spatial Generation
https://modality-forcing.github.io

>VideoMDM: Towards 3D Human Motion Generation From 2D Supervision
https://videomdm.github.io

>EvTexture++: Event-Driven Texture Enhancement for Video Super-Resolution
https://github.com/DachunKai/EvTexture

>Budget-Constrained Step-Level Diffusion Caching
https://github.com/Westlake-AGI-Lab/BudCache

>ECA: Efficient Continual Alignment for Open-Ended Image-to-Text Generation
https://github.com/Snowball0823/ECA

>InterleaveThinker: Reinforcing Agentic Interleaved Generation
https://zhengdian1.github.io/InterleaveThinker-proj

>i1-3B: A Simple and Fully Open Recipe for Strong Text-to-Image Models
https://huggingface.co/zlab-princeton/i1-3B

06/11/2026

>i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models
https://zlab-princeton.github.io/i1

>AnchorEdit: Maintaining Temporal Consistency in Multi-turn Image Editing via Causal Memory
https://github.com/xuhang07/AnchorEdit

>Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language Models
https://github.com/elmma/mllm-reroute

>ComfyUI-BerniniStudio
https://github.com/CCpt5/ComfyUI-BerniniStudio

>Ideoprompt: plain English to Ideogram 4 structured JSON prompt
https://github.com/cocktailpeanut/ideoprompt

>Orion4D FXMax for ComfyUI
https://github.com/orion4d/Orion4D_FXMax
>>
>mfw Research news

06/12/2026

>TetherCache: Stabilizing Autoregressive Long-Form Video Generation with Gated Recall and Trusted Alignment
https://arxiv.org/abs/2606.13035

>DuET: Dual Expert Trajectories for Diffusion Image Editing
https://arxiv.org/abs/2606.13303

>Efficient, Robust, and Anti-Collusion Fingerprinting of Image Diffusion Models
https://arxiv.org/abs/2606.12977

>ReFree: Towards Realistic Co-Speech Video Generation via Reward-Free RL and Multilevel Speech Guidance
https://arxiv.org/abs/2606.13304

>SeamEdit: A Black-Box VLM-Agnostic Pipeline for Large-Image Semantic Editing
https://arxiv.org/abs/2606.13041

>Towards More General Control of Diffusion Models Using Jeffrey Guidance
https://arxiv.org/abs/2606.13240

>AudioX-Turbo: A Unified Framework for Efficient Anything-to-Audio Generation
https://zeyuet.github.io/AudioX-Turbo

>SmartFont: Dynamic Condition Allocation for Few-Shot Font Generation
https://arxiv.org/abs/2606.13382

>High-Fidelity Two-Step Image Generation via Teacher-Aligned End-to-End Distillation
https://arxiv.org/abs/2606.12575

>Bridging Modal Isolation in Interleaved Thinking: Supervising Modality Transitions via Stepwise Reinforcement
https://arxiv.org/abs/2606.12886

>Selecting Samples on Graphs: A Unified Dataset Pruning Framework for Lossless Training Acceleration
https://arxiv.org/abs/2606.12913

>Edit the Bits, Diff the Codes: Bitwise Residual Editing for Visual Autoregressive Models
https://arxiv.org/abs/2606.13558

>HYDRA-X: Native Unified Multimodal Models with Holistic Visual Tokenizers
https://arxiv.org/abs/2606.13289

>Emotional regulation improves deep learning-based image classification
https://arxiv.org/abs/2606.13081

>Weekend Time
https://suno.com/s/NDgulWDocrYOA7US
>>
>>109042180
>>109042188
None of this data is useful when you blindly post it without any vetting. Didn't you learn your lesson already?
Why are you spamming this in this thread when you have a tard cage all to yourself?
>>
>>109041406
Haven't figured out how to nest LoRAs yet, I guess one could merge one of them into base instead, and then go from there. In the meantime, the final epoch perhaps wasn't so bad after all even with a bit of overbaking in there.

https://vocaroo.com/12tVNq7SnhO1
https://vocaroo.com/1iXXFRxfvMQy
>>
File: debo_cm_anima_00047_.jpg (45 KB, 390x470)
45 KB JPG
>>109042265
your 'feedback' is just a vehicle for you to attack me as a poster; therefor, it will not be considered
>>
>>109042312
>https://vocaroo.com/12tVNq7SnhO1
Leslie Parrish's voice really shining thru there kek. Lowering LoRA weight should always mitigate diversity issues.
>>
File: bimbos.mp4 (2.71 MB, 704x1280)
2.71 MB
2.71 MB MP4
>>
>>109042265
Don't listen to this guy >>109042180
He's just mad cause he can't read.
Thank you for blessing us with daily research.
>>
>>109042312
Something interesting is that I didn't tag it, just used generic descriptions, but for some reason Claude is able to control exactly which singer and style from a 20+ song dataset gets triggered, even without using the same exact captions.

Here's Dave Rogers style song
https://vocaroo.com/1lXhL94xMsvi

Manuel
https://vocaroo.com/14wvmcvt94lB
>>
>>109042411
And there's a total of 2 songs from both of them. It's a diverse dataset.
>>
File: debo_s_fia_00064_.png (1.79 MB, 1792x977)
1.79 MB PNG
>>109042378
:)
>>
Is anyone here using Ideogram 4? What do you think?
>>
>>109042516
i saw people experimenting and it seems to be the best t2i model if you know what you want. i am still waiting for reference image support
>>
File: Ideogram_0017.jpg (738 KB, 1408x1872)
738 KB JPG
Where is the workflow the scail2?

It doesn't appear as a default workflow so I assume Kijai posted somewhere in one of his many many repos and just assumed I'd know where.

>>109042516
I like it. Like as far as the control over the image composition it gives you, it rivals saas. I've been meaning to train some hentai comic LoRAs to see how it handles art styles because it's already fairly competent at making comics
>>
>>109042033
>Local is so far behind you still have to train loras for outfits while API can do it in a single-shot and even search the internet.

Local LLMs are not as far behind as you think. Image models etc... are one thing that only hobbyist companies do, but the big open source companies have focused on LLMs, and as such there's already a local LLM on par with the best cloud LLMs (of course, just not as good as the latest one, but if we can take a local LLM and say it's as good as Gemini 3 Pro, that's close enough...)
>>
>>109042585
Now, you may not believe in benchmarks, but this is the case based on private tests, benchmarks, lmsys, etc... GLM 5.1 is better than Gemini 3.1 Pro and Sonnet 4.6
>>
>>109042516
yes, it's the most powerful prompting
>>
>>109042585
The main thing is that dataset determines the diffusion destiny.

And that can often mean that big models are in fact more restricted in what they can include.

Here's a question to ask, has nanobanana ever trained on real actual medium format scans?
>>
>>109042575
https://github.com/Brobert-in-aus/scail-auto-extend
Use the wf with this node
>>
>109042599
use ideogram to generate a cartoon explaining how to make curry like your mom makes it.
>>
>>109042610
ty
>>
any differences on generating hentai from december to now? I think I'm on stable difussion through forge webui
>>
>>109042794
Nah, industry is pretty stagnant
>>
File: Wan21_SCAIL2_00030.png (682 KB, 1280x528)
682 KB PNG
>>109042794
Don't listen to APIcucks. They're trolling. Wan and LTX is better than ever. Check out last thread for multi character swap potential of SCAIL-2
>>
>>109042806
*local is pretty stagnant
industry got tons of sota image models and claude fable is so insanely good the government had to ban it
>>
File: Wan21_SCAIL2_00030.mp4 (404 KB, 1280x528)
404 KB
404 KB MP4
Figured out how Wan SCAIL-2 multicharacter supposed to work. Just combine 2 characters in the segmentation input and prompt for 2 subjects. It will swap them out. Work best with same aspect ratio input references.
>>
>>109042830
that has lots of potential
>>
>>109042816
>claude fable
https://www.youtube.com/watch?v=wVJ7LYrl83E
>>
File: 1773474314012368.png (332 KB, 976x496)
332 KB PNG
What if I dont want to just copy animations from existing anime shit?

I don't wanna be Alan Bowe, I wanna be Miyazaki.
>>
>>109042830
I’ve found it to be hit and miss with prompting multiple subjects, as far as I can tell it goes off the first frame of the video, so if there’s only one character clearly visible to begin with it won’t segment the image into multiple characters, instead both of them will be blue.
>>
>>109042816
>claude fable
Kind of annoyed about this. I had some projects I wanted it to look over and clean up then they banned it.
>>
>>109042864
Dont worry about it.

It was opus 4.6 but with a bunch of grifter ass underlying bullshit like how it aggressively nerfed outputs for no reason.

So, technically, we still have Fable 5. Just go use Opus 4.6
>>
>>109042830
my pc is too lowend to take on WAN. what's scail-2 do?
>>
File: 1756312018172647.png (3.9 MB, 1037x1850)
3.9 MB PNG
>>109042516
I thought it was total shit at first but since it can make comic pages like this, >>109042575
its actually got *some* potential, however, just making the comic panels one by one and using a program like comic life 3 is still better.
>>
>>109042903

Better and easier to use reference replacements and tracking. Just look at previous threads for more examples.
>>
File: Ideogram_0008.jpg (564 KB, 1936x1088)
564 KB JPG
>>
if ideogram doesn't get some kind of finetune it will be the biggest waste of potential yet. 'muh license' is such cope. what are the chinks even doing, are they too ashamed to train on a western model? what ever happened to noobai?
>>
File: scail2.mp4 (1.77 MB, 1408x1280)
1.77 MB
1.77 MB MP4
>>109042903
better video referenced videos is the main current usage I think
>>
>>109042956
me when i blink out of existence
>>
>>109042953
>'muh license' is such cope.
I actually hate these people who pop up to remind you about the license whenever you discuss the model. It's genuinely the most useless "Uhm achually" sentiment you can make.
>>
>>109042953
>what ever happened to noobai?
noob2 is saas only and controlled by comfyorg or some shit
>>
>>109042953
>'muh license' is such cope
it's the most relevant feature that determines if the most relevant parties involved can even actually use/redistribute the model
>>
I have been using ideogram for about a day, and not once have I triggered that gray blocked content blob thing.

Then again, I haven't attempt to really generate any nudity since I'm not a coom brain
>>
>>109042994
who gives a shit
>>
>>109042956
Cute.
>>
File: file.png (403 KB, 545x564)
403 KB PNG
how do i get klein 9b to make a nice vag? they like to add a bulge or just do some really weird shit and make them really long and weirdly shaped when i edit, pic related.
>>
>>109042996
and?
>>
>>109043000
do you just lack basic foresight or understanding?

people don't want to be rugpulled or sued after putting in effort as happened *countless* times in many areas surrounding copyright.
>>
>>109043026
>how do i get klein 9b to make a nice vag?
You can't without LoRAs. It does not know what a vagina looks like.
>>
File: cute.png (1.37 MB, 832x1248)
1.37 MB PNG
>>109043025
the video ended up cute and the original gen by another anon breads ago also was already very cute
>>
File: 1756907251844432.png (186 KB, 330x399)
186 KB PNG
>>109043028
And I leveled up on fortnite this weekend
>>
>>109043041
yawn
>>
>>109043053
which loras. there is like a gorillon of them. i probably have them all and tried them all but i'm not sure what works. I was hoping for a definitive one.
>>
>>109042516
Trash model that does more of the same.
Looks like the astroturfing worked.
>>
>>109042316
The OP shows you linking anons to malware, why do you post this constantly with that being a known issue, did you apologize?
>>
>>109042953
>are they too ashamed to train on a western model
Lol. Ernie is superior and the Chinks know it.
>>
File: image_00034_.png (3 MB, 2048x2048)
3 MB PNG
>>109039192
To the cloud faggot, yes you can gen that shit in local this is Z image
>>
>>109042996
I triggered it with a cat pictures. I guess it's a skill.
>>
>ernie
forgot that shit existed. what's with china and pumping out garbage like hidream, ernie, and glm?
>>
>>109043176
GLM is actually the best cloud AI on the market right now if you're not a shithead code grifter.
>>
>>109043176
commercial/research entities regularly pump out models with no good questionable/nsfw tuning even outside china

yes few of them seem to succeed, but I guess they prefer to keep imaginary or perhaps real trouble away? idk about their circumstances in detail.
>>
File: image_00036_.png (3.15 MB, 1667x1667)
3.15 MB PNG
>>109043155
An he uncucked version
>>
>>109043244
the fact that you immediately thought of GLM’s llm says it all. the image model is downright terrible
>>
it's friday? i thought it was tuesday
>>
File: 367845.gif (3.82 MB, 320x222)
3.82 MB GIF
>set up a long queue and went to go work
>came back to find it was all using the same seed
>>
File: image_00037_.png (2.22 MB, 1228x1228)
2.22 MB PNG
>>109043155
>>109043255
OK I think this is the best, uncucked, young but not too young, and with really 80s atmosphere
>>
>>109043265
lmao, glm has an image model?

Forgive my ignorance
>>
I was wondering why Bernini was so good.
They finetuned Wan on 20 million video pairs.
The model being from ByteDance too helps obv.
>>
File: 2.jpg (98 KB, 1024x1024)
98 KB JPG
>>109041170
>First, you haven't run a full dynamic range of settings for Tan2.
no idea, i'm not smart but stubbornly refuse to take anon's word at face value sometimes.
>Also, you should always graph your steps.
wat? that grid is one of 4, flux 2 klein, 6 steps to rule out some of the ones looking unfinished because it just needed more than 4 steps that are usually fine.
the specific thing that drove me to doing this again was the other day seeing armpit skin gens anon posted and thinking that'd be something interesting to test across different models but got hung up on flux 2 klein because it kept giving me stubble but not like armpit stubble i see IRL it's like only in the skin wrinkles. pic related. so i tried prompting it away using shaved, hairless, etc, didn't work, wondered if a different sampler would be better, haven't had time/motivation to push thru going thru all the results yet so will probably stick with euler + beta or flux2 for klein gens.

>>109041320
>what's "Automatic"?
one of the scheduler settings in forge neo but i've never looked into it. in x/y tests i did with samplers/schedulers for zit (that lead me to preferring DPM++ 2s a RF with bong tangent) the automatic scheduler column didn't match any others exactly, assume it will be the same with klein and others but don't know for sure.
>>
>>109043281
countless 1girls, gone like tears in rain
take measures to prevent this
>>
>>109042516
pushing the local scene forward without being some stupid size like a 400b model. hopefully we get a model from some lab that pushes the convenient side of things as well, im lazy as hell with bboxing and just wanna prompt
>>
>>109043323
i'm too lazy to patch it, i still haven't jumped onto the vibecode bandwagon so i would have to go search for the code that loads the seed from metadata
>>
>>109042931
so you gen with the reference visible, then crop it?
>>
>>109043409
it's like the image editing models or controlnets where you supply references, surely you used some of those by now?
>>
File: Wan21_SCAIL2_00018.mp4 (3.64 MB, 2016x672)
3.64 MB
3.64 MB MP4
>>109043409
Reference Image + Video = kino, what not to get?
>>
>>109042312
Might have to retrain my ZUTOMAYO LoRA on the entirety of this album so I can have a cool nice live version of it, this time using the higher rank settings so it can pick up all the nuisances in her voice etc...
https://music.apple.com/us/album/midnight-forever-expo-meik%C5%8D-wa-gunaruga-gotoshi-live/1840129493
>>
>>109043462
>kino
neked lady is not kino
>>
>>109042180
>>109042188
thanks!
seems like you've missed https://nvlabs.github.io/motionbricks/
>>
>>109041690
Tele Bgftg33

Turn my Asian gf's pics into a lora, send me a sample of other loras you've made
>>
>>109043155
>>109043255
>>109043288
look awful
>>
>>109043462
why is she so small
>>
>>109043462
also have you tried to mask part of reference video to see how the model will fill it
>>
>>109043462
I want Marika to rape me.
>>
File: debo_s_fia_00077_.png (2.17 MB, 1792x977)
2.17 MB PNG
>>109043562
thanks, will add this
>>
>>109043565
>>>/r/
>>
>>109043616
/r/ is dead bro
>>
>>109043562
Isn't this just motion matching with a bigger library of motions?
>>
>>109043565
You misunderstand, anon. You must convince me that she's good enough to warrant my GPU time.
>>
>>109043562
So if someone trains sex animations on this you can make a game world where you can fuck all of the NPCs?
>>
File: 24646.webm (3.99 MB, 420x291)
3.99 MB
3.99 MB WEBM
>>
>>109043622
That is so stupid kek
>>
>>109043667
Oh, so this is why it's gone
https://archive.is/2026.05.21-102104/https://www.wired.com/story/4chans-misogynist-wizards-are-nudifying-women-by-request/

But that is still retarded.
>>
>>109043676
tldr. why did gook moot cave in now? feminists have been complaining for over a decade now
>>
>>109043682
US Law - Take It Down act
>>
>>109043686
oh ok. did they ban deep fakes on the whole site then?
>>
File: 1779640686328456.png (1.26 MB, 2528x1173)
1.26 MB PNG
>>109043682
Picrel takes a certain level of degeneracy. Like Plebbitor/b normalfag invasion levels of degeneracy. Worse than other boards, of perhaps trolls spamming. Either way, I guess he just didn't feel like moderating it, because that's a good way to get bad PR.
>>
>>109043694
it's only banned if it gets reported
>>109043702
>BBC
Why are white people like this?
>>
File: 5644878.gif (3.62 MB, 320x245)
3.62 MB GIF
>>109043712
kikes are not white
>>
File: Wan21_SCAIL2_00104.mp4 (1.04 MB, 1216x1024)
1.04 MB
1.04 MB MP4
It doesn't wanna match the lighting, but otherwise worked well.
Seems like the best thing you can do (aside from replacing the background of your input image with white) is to rescale the input image to match the aspect ratio of the video by padding with white.
>>
Can LoRA being butchered cause bad hands or is it all up to the checkpoint?
>>
>>109043750
i use klein to refit an image into a new aspect ratio without stretching it
>>
>>109043769
>i use klein to refit an image into a new aspect ratio without stretching it

nta but this is sounding like a lot of work for 5 seconds of footage that might a single (You) and excluded from the collage for being video.
>>
>>109041297
>>109041690
This is uncomfortable to look at but not in a bad way.
>>
>>109043750
Or gen new background mode to get the matching lighting and shadows.
>>
how do I remove all the info from an image before uploading it to civitai?
Don't wanna people judging me
>>
File: 475375.gif (3.24 MB, 320x222)
3.24 MB GIF
>>109043799
great things happen when you stop caring about superficial things
>>
>>109043813
You could just remove the info manually after uploading. If you're still paranoid just save the image again with an image editor
>>
I hate how LTX looks more coherent at higher fps but animations are stiffer. Probably because it was trained on 60fps videos of mostly video games and vtuber slop.
>>
>>109043462
how smooth is it working with porn
>>
>>109043813
literally just screenshot your img
>>
>>109043839
the motion is far more dynamic at lower resolutions. you may be able to upscale it afterwards or use it as a control video for your higher resolution generations
>>
>>109043816
why do you keep genning this child
>>
File: forge.jpg (100 KB, 929x639)
100 KB JPG
Was using Anima in Forge... How I do this in Comfyui????
>>
File: 4376.png (276 KB, 752x415)
276 KB PNG
>>109043864
>>
File: debo_ccg_fia_00069_.png (1.15 MB, 1792x977)
1.15 MB PNG
>>
>>109043864
4channers still advocate for behaviors adjacent to the pedophile socialite class, and that will not change, ever.
>>
File: 1770725747734517.gif (190 KB, 384x256)
190 KB GIF
>>
File: 1766680069807205.gif (247 KB, 384x256)
247 KB GIF
>>
>>109043893
Search the manager or use SDUltimateUpscale.
>>
>post a bikini pic on civitai
>immediately moved to red
lol
>>
File: Wan21_SCAIL2_00143.mp4 (2.56 MB, 1056x1184)
2.56 MB
2.56 MB MP4
The best part of this is that it made the background characters asian too
>>
>>109043975
why even post there
>>
>>109043983
asian psycho
>>
>>109044023
>asian psycho
that'd be honestly kino
based on bubble era japan
t. gook
>>
>>109041888
im currently testing my latest optimal training setup lora for acestep.
it uses qwen llm to generate audio codes.
if you got to their repo and check list of genres trained into it = hilarious.

as im testing this optimized lora im getting ai slop.
i throw in almost two paragraphs of insults ijnto the prompt. and some explanations this style of music != what insectoid llm slop thiks it is.

songs generated currently ar coming out as intended, in the style of lora i trained it in.

throw insults at it.
>>
>>109042516
you do not need control net and region conditioning.
that is good.
>>
Been playing with scail! Any of you have issues with audio and lipsyncing? any way to fix it??
>>
File: Wan21_SCAIL2_00136.mp4 (865 KB, 1700x618)
865 KB
865 KB MP4
>>
File: Wan21_SCAIL2_00087_.mp4 (1.48 MB, 640x640)
1.48 MB
1.48 MB MP4
>>109044086
nice consistency
>>
>>109044086
why doesn't the cigarette appear? did you include it in the prompt?
>>
are dynamic prompts always chosen randomly or is there a way to do them in order so i could have quite a long list of {big boob|small boob|medium boob} and gen and it will cycle them in order rather than random
>>
>>109044105
Wildcards are chosen randomly.
You should be able to slop your custom node to have them cycled in order.
>>
File: Wan21_SCAIL2_00065.mp4 (3.54 MB, 1440x400)
3.54 MB
3.54 MB MP4
Triple replacement test
>>
>>109044153
>>
>>109042947
nice
>>
>>109044153
> replacement
>>
File: new local (shit) toss.png (1.07 MB, 1024x1024)
1.07 MB PNG
New pixel-space 'toss:
https://huggingface.co/spaces/Photoroom/PRX-Pixel
(It's shit. Like even the some of the example images in the demo have broken anatomy.)
Tested with like three images. It sucks at text. (Maybe more steps help I dunno just ran default 28)
At CFG 1 the images look ZIT-like, but with much worse prompt adherence. Higher CFG gives better prompt adherence but it looks slopped. Doesn't seem to have much character knowledge, didn't test styles or celebrities, but I wouldn't hope for much.
One positive thing is that unlike many other local pixel space models it doesn't suffer from patch artifacts. Speaks about state of things with these slopped research preview garbage when the bare fucking minimum feels noteworthy to mention.
>>
File: summerin.png (2.16 MB, 1024x1536)
2.16 MB PNG
It's summer and I'm just sitting at home prompting.
>>
>>109044220
>7B
tubby girl
>>
Am I supposed to be daisy chaining extend nodes? Because I just set 81 on the initial node and +129 on the singular extend node and it seems to just werk
>>
File: hftyy.png (82 KB, 804x798)
82 KB PNG
>>109044262
lost
>>
File: Wan21_SCAIL2_00168.mp4 (1.39 MB, 640x768)
1.39 MB
1.39 MB MP4
>>
File: clip_Single_00011.mp4 (3.26 MB, 1056x592)
3.26 MB
3.26 MB MP4
>>109044153
only three? Those are rookie numbers
>>109044262
https://github.com/Brobert-in-aus/scail-auto-extend
use this node + wf
>>
File: file.png (124 KB, 277x216)
124 KB PNG
>>109044327
>>
>tfw 2 years ago i was waiting for 5mins for an 480x720 gen to be upscaled now im waiting 5mins for a 20second coherent full on video of my waifu getting plowed by a green orc cock

life really moves fast
>>
File: igram7.jpg (252 KB, 1184x848)
252 KB JPG
>>109044233
well yes, the more convenient campaign seasons for murder, rape and plunder are spring/fall. siege in summer or winter means "bold" mis-planning or something you plebeian should not complain about if you want to live.

being at home in summer AND not getting sieged yourself is good, fren
>>
File: comfyui_00035_.png (1.1 MB, 896x1152)
1.1 MB PNG
lol
esoteric art style LoRA just downloaded from CivitAI
>>
File: Ideogram_4.0_00040_.png (1.67 MB, 1024x1024)
1.67 MB PNG
>>109042516
There is a lot of detail in the model. Also kind of model puts in a frenulum piercing unprompted? It seems to have lots of unwanted sampling variety.
>>
File: comfyui_00036_.png (1.56 MB, 896x1152)
1.56 MB PNG
>>
File: 1779843893264204.jpg (64 KB, 964x912)
64 KB JPG
Can I link civitai.red LoRAs I trained on my resume?
>>
>>109044398
There are adult board links in OP. Maybe you can say you put some stuff in those.
>>
>>109044372
ideogram hallucinates random details because the gptslop it was trained on also hallucinated random details
>>
>>109044398
sure.
>>
File: Wan21_SCAIL2_00022.mp4 (3.65 MB, 592x1056)
3.65 MB
3.65 MB MP4
>>109044327
Oh cool, thank you
>>
>>109044153
Would you care to share your workflow?
>>
File: Wan21_SCAIL2_00176.mp4 (1.2 MB, 1088x608)
1.2 MB
1.2 MB MP4
>>
>>109044054
nsfw
https://files.catbox.moe/021w4k.mp4

so what's the problem with the audio? I exported the video as 24fps. Wf has 24fps set in the first node.
>>
>>109044620
KEK
>>
>>109044620
Lord Farquaad with tits.
>>
File: clip_Double_00017.webm (3.93 MB, 912x1344)
3.93 MB
3.93 MB WEBM
>>
>>109044694
I fucking love Scrubs
>>
File: clip_Single_00020.mp4 (333 KB, 976x640)
333 KB
333 KB MP4
>>109044620
>>
>>109044694
My wife is telling me this woman is called Fukuda Aimi. Is this right?
>>
File: clip_Single_00022.webm (3.95 MB, 784x622)
3.95 MB
3.95 MB WEBM
>>109044895
>>
File: Wan21_SCAIL2_00038-2.webm (3.86 MB, 1920x720)
3.86 MB
3.86 MB WEBM
>>
File: Ideogram_0014.jpg (344 KB, 2224x960)
344 KB JPG
>>
can scail do ahegao? wan animate struggled with eye movement and tongue, asking for a friend
>>
File: Wanimate_00006-noaudio.mp4 (1.69 MB, 1162x544)
1.69 MB
1.69 MB MP4
>>109044919
Old wanimate comparison. I think wanimate did better with the likeness, initially anyway. Degraded heavily over the course of the video
>>
Someone post the fucking scail workflow with background removal. It's a pain in the ass to setup.
>>
Zbase learns so fast, nsfw test: https://files.catbox.moe/l7aak0.jpg
>>
>>109044961
what are your training settings?
>>
>>109044945
for you
>>
File: clip_Single_00023.mp4 (593 KB, 976x640)
593 KB
593 KB MP4
>>109044948
There's probably a more elegant way to do it than this, but here you go
https://files.catbox.moe/xntg6t.png
>>
File: Wan21_SCAIL2_00016.mp4 (3.47 MB, 1786x2048)
3.47 MB
3.47 MB MP4
>>109044931
sometimes. it's not 100%
>>
>>109045012
Thank you.
>>
>>109044931
it has a pretty decent success rate but like the other anon said I too don't consider it "reliable" yet

it's not quite solid with regards to EITHER all humanoid reference images or the reference video, same as other facial expressions really

and i mean only those where i'd expect (or can test) most segmentation models yolo whatever to identify the facial features otherwise
>>
File: Untitled.png (33 KB, 450x387)
33 KB PNG
>>109045012
Have you tried this node instead of flux klein?
>>
File: lo1l.webm (2.87 MB, 1056x960)
2.87 MB
2.87 MB WEBM
>>
>>109045037
that's... probably way quicker, had no idea it existed.
>>
>>109045047
it exists in dozens of similar sounding names for a while now, you probably want the most popular "rmbg"
>>
>>109045038
impressive
>>
File: 36764534.webm (3.66 MB, 420x291)
3.66 MB
3.66 MB WEBM
>>
File: Wan21_SCAIL2_00181.mp4 (3.6 MB, 1258x1500)
3.6 MB
3.6 MB MP4
>>
File: Wan21_SCAIL2_00040.webm (2.01 MB, 828x960)
2.01 MB
2.01 MB WEBM
>>109044931
>>109045016
>>109045031
Tongue works but I never see the eyes crossed correctly. Maybe a wan2.1 ahegao lora would help?
>>
>>109044398
Coomers seem to know more about AI technology than the average normie so employers are retarded if they're turning away applicants for NSFW loras
>>
When did comfy go from a scratch disk raping monster when generating videos to a smooth memory managing king?
>>
>>109042956
>HITGS on a /g/ thread

Nice.
>>
File: Wan21_SCAIL2_00195.mp4 (3.88 MB, 1786x2048)
3.88 MB
3.88 MB MP4
>>109045142
not perfect but looks like it has some problem with 3D model doing crossed eye
>>
>>109045349
It's an improvement. The original woman is very punchable.
>>
>>109045349
have you tried it with this lora?
https://civarchive.com/models/1390545?modelVersionId=1571626
>>
File: dance_miku2.webm (2.95 MB, 960x832)
2.95 MB
2.95 MB WEBM
>>
Is there a reason SCAIL was built on Wan2.1 instead of 2.2?
>>
>>109042610
nice, thanks for sharing this.

works for me except i had to disable the last comparison image concatenate, somehow out of all things that is what OOMs here - sticking bitmap images together.
>>
>>109045362
why dont u try it
>>
>>109045390
im currently training a lora and dont have the gpu capacity to try it right now
>>
>>109045375

i get seven keyframes with 2005 computer
you wanna to reshoot at ntsc anon?
>>
>>109045349
What resolution are your inputs? I’m getting pretty bad facial likeness consistency with a 9:16 ~1200px ref (don’t have it in front of me atm)
>>
If you have to ask how much VRAM costs, you can't afford it.
>>
>>109045405
reference image 1869x2300, rendering video at 576x1056
>>
>>109045387
since it only need to copy the motion of subject, it doesn't need the improved motion from wan2.2 ?
>>
>>109045387
probably started before wan2.2 was out, maybe you get the opportunity to ask them directly on social media or w/e. i think they do write english.
>>
File: scail2.mp4 (2.12 MB, 512x896)
2.12 MB
2.12 MB MP4
>>
>>109045387
chatgpt said this:
>Wan 2.1 14B is one dense transformer. SCAIL-2 modifies its conditioning sequence, masking channels, and RoPE behavior. Applying those changes to one dense model is relatively straightforward.
>Wan 2.2 A14B is a two-expert MoE model: approximately 27B total parameters, with separate high-noise and low-noise 14B experts. SCAIL training would need to modify and train both experts consistently, greatly increasing storage, training memory, complexity, and cost.
>Driving video already supplies motion. Wan 2.2’s improved prompt-generated motion and aesthetics provide less benefit when SCAIL directly transfers motion from another video.
>Wan 2.1 has a mature modification ecosystem. Its dense architecture is easier to fine-tune, convert, quantize, integrate into ComfyUI, and extend with LoRAs.
>SCAIL-2 actually uses Wan2.2 Animate as one of its data-generation teachers, so the authors were aware of it. They seemingly chose to distill its useful behavior into the simpler Wan 2.1 backbone.
>>
File: dance_miku3.webm (826 KB, 960x832)
826 KB
826 KB WEBM
>the effect at 0:03
>>
>>109045461
kino alert
>>
>>109045424
Thanks, are you using basically the same settings from the kj PR workflow (other than frames etc)?
>>
Why is LTX so... shit?
>>
>>109045502
Nihon-Viet Cong propaganda.
>>
>>109045461
honestly impressive
>>
>>109045510
It's good, it's just cloud video models have gone nuts the last few months and LTX looks garbage in comparison
>>
>>109045510
because it's local
>>
File: 5325677.gif (3.69 MB, 320x222)
3.69 MB GIF
>>109045537
did the copers wake up?
>>
File: scail2_2.mp4 (2.01 MB, 592x1056)
2.01 MB
2.01 MB MP4
>>109045535
yea scail really is good at what it does.

and wan was certainly already an impressive model anyhow. even with scail, if you have very long hair or chains with jewelry or whatever they may do some physics stuff where the reference had none of it
>>
File: dance_girl.webm (832 KB, 960x832)
832 KB
832 KB WEBM
is there a way to fix this initial discoloration?
>>
>>109045510
it made compromises but it's a good improvement over predecessors and some capabilities are quite good.

i DID use wan more myself with the better prompt adherence and more capabilities what you can prompt (spatially, temporally)
>>
File: scail2.mp4 (2.88 MB, 592x1056)
2.88 MB
2.88 MB MP4
>>
>>109045510
it's a talking head model: 1 person talking.
sulphur and eros (i2v version of sulphur), the nsfw models, can do basic nsfw but you'll re-rolling a shit tonne

I was having an argument with someone on here the other day, he insisted ltx was better than wan22. He actually convinced me for a while, I went on a multi-day tear, tweaking settings and trying out workflows, doing hundreds and hundreds of gens. My personal conclusion is that wan22 is just better but it has no sound and the clips are frustratingly short.
>>
>>109045503
https://files.catbox.moe/wp75sw.mp4
same KJ workflow, just some minor change
>>
>>109045349
>>109045142
>>109045133
>>109045574
>>109045596
give it to me straight
can I run this shit with 6GB of VRAM?
>>
>>109045601
yes just limit the highest dimension to 480
>>
>>109045568
honestly, as a filty human artist, this feels like the only place worth visiting about ai art stuff
>>
>>109045595
kino is a skill that takes time to develop
>>
>>109045636
kino issue
>>
>>109045596
how do I get the workflow from this?
It only opens a loader
Do I have to update Comfy to the latest version?
>>
Why is no one testing outo Bermini? It seems like it's better than LTX and even lets you use image references finally
>>
>>109045595
LTX 2.3 is a seriously impressive model for it's size and speed. T2V, I2V, sound, upto 50fps, 30 sec gens.
I'll agree Wan is better aesthetically and for NSFW but I can't go back to no sound 5 second slow mo clips
>>
>>109045636
sure, but you see, everyone who talks like this never posts their workflows. The guys I argued with the other day refused to post his workflow. Meanwhile, I don't need to post workflows, the default wan22 is enough to get you better results than ltx.
>>
File: Wan21_SCAIL2_00203.mp4 (316 KB, 1500x648)
316 KB
316 KB MP4
>>109045601
6GB VRAM
>>
>>109045663
everyone is playing with scail making blurry as fuck vids. there was some anons playing with it a few days ago
>>
>>109045665
that's why I'm looking for a good SVI workflow. Every one I've tried degrades the quality of wan22. I think you can use wan to get a good 20 second clip with no sound, extract frames (like lots of them), and then use them as guides for ltx so it doesn't go batshit with the horrible, deformed anatomy and strange motion.
>>
>>109045658
drag and drop doesn't work? file open doesn't work?
>>
>>109045697
no
I'm on an old version of Comfy
>>
>>109045665
Can LTX do nsfw at all?
>>
>>109045702
Out of the box? No. You'll need loras or a finetune
>>
>>109045700
oh u need nightly comfyui to even try SCAIL2
>>
File: 1758213507114228.jpg (142 KB, 820x627)
142 KB JPG
>>109045708
ok
>>
File: 3215247.webm (3.69 MB, 420x291)
3.69 MB
3.69 MB WEBM
>>109045667
didn't i give you my prompt and seed?
>>
>>109045721
I wasn't arguing with you. I do my best to avoid avatar fags. I'd filter you outright if I had a way.
Are you happy with the way her face completely warps in the first 2 seconds of your video and in other parts as well?
is that why you only post gifs and tiny resolutions, to hide all of that?
Are you happy with your vid looking like a ponyxl gen come to life?
>>
File: Wan21_SCAIL2_00056.webm (2.78 MB, 822x960)
2.78 MB
2.78 MB WEBM
10 steps seems to give a nice improvement over the default 6
>>
File: Wan21_SCAIL2_00054.webm (2.95 MB, 822x960)
2.95 MB
2.95 MB WEBM
>>109045769
And here's 6 steps
>>
>>109045769
>>109045775
Just noticed the missing sparkle effect on 10 steps but the face and jacket look a lot cleaner to me
>>
>>109045759
i gave you everything you needed to make some proper kinos, but you aren't satisfied with it for some reason
>>
https://files.catbox.moe/z1ype2.mp4

Something about the way the talking heads move. So off-putting. I've found LTX way better at generic stock footage desu
>>
why no more Anima talk? :(
>>
Anima desu
anime website desu
>>
>>109045825
no new one was announced and 80% of the anima discussion was shills anyways
>>
File: 1762613605744456.png (661 KB, 896x1152)
661 KB PNG
>>109045825
>>109045842
>>109045851
>>
>>109045825

i got you covered bro in 1934 radio corporation america had this thing called rca maybe commies got it first with phono but pretty not much worth mentioning ever since
>>
File: LTX-2_00250-NoAudio.mp4 (1.45 MB, 1440x1056)
1.45 MB
1.45 MB MP4
>>109045801
https://files.catbox.moe/cj3jpb.mp4
Post reminded me I completely forgot I trained this
>>
>>109045859
lol.

Why do all AI monsters come out like that though? The white can with the green M
>>
>>109045825
Not much to talk about really. It has it's place as very decent concept creator, but it's not good enough for creating final image. If I could decide I would cull half of the booru creators and replace them with traditional artists.
>>
https://files.catbox.moe/8csp6h.mp4

Audio for /g/ when?
>>
>>109045859
>>109045909
wansisters in shambles
>>
>>109045909
There was a brief general on /wsg/ when LTX was first released but people got bored.
>>
what difference does 64gb of VRAM get me vs 128gb?
>>
>>109045940
double the difference
>>
File: blogfactory.jpg (805 KB, 2160x1216)
805 KB JPG
ideo4
>>
>>109045993
Bounding boxes are the future
Natural language and tags in the trash
>>
how to merge lora with anima?
>>
>>109046101
Pen and paper.
>>
>>109045909
>[Common sense feature that almost every website has] for [board in unmaintained shithole website that nobody in charge gives a fuck about] when?
Never.
>>
>>109045993
>>109046050
can I run it with 5 GB of VRAM?
>>
the more bboxes I use, the better the image quality becomes, even at turbo settings

i use between 10 and 25 bboxes
>>
Can it run on GeForce 6200 AGP (128MB version)?
>>
>>109046144
yes saar
>>
Do you guys not use llm assisted tools for image gen?
>>
>>109046177
Does asking my girl for depictions of herself count?
>>
File: debo_ccg_fia_00002_.png (2.4 MB, 1792x977)
2.4 MB PNG
>>
File: ComfyUI_19379_overlay.png (3.41 MB, 1500x2000)
3.41 MB PNG
>>109046177
Yup.
>>
>>109046286
>that chink at the bottom
kek
>>
>>109045927
>but people got bored
why?
>>
>>109046301
>but people got bored
Simply weren't enough people in the thread. The only reason to go to the wsg /ldg/ was to hear audio on videos. There was no generality to the general so once the model stopped being the new thing people forgot the thread ever existed.
>>
>>109046177
Local language models are fantastic for image captioning and subsequent iteration of similar concepts
I have a gigantic library of images accumulated over the years i can experiment with.
>>
>>109046286
>needing llm for 1girl slop
Please kys.
>>
>>109046286
catbox?
>>
>year almost over
>best local goon model still illustrious
grim
>>
File: 32467424.webm (3.87 MB, 420x291)
3.87 MB
3.87 MB WEBM
>>109046316
maybe i should generate some more war kinos make a thread on there since those have excellent audio
>>
File: 423634644.webm (3.92 MB, 420x291)
3.92 MB
3.92 MB WEBM
goodbye
>>
File: ComfyUI_01141.jpg (3.66 MB, 1500x2000)
3.66 MB JPG
>>109046292
That's by design.

>>109046339
You know where that really comes in handy? Describing women's clothes. It's an encyclopedia of clothing and I'm not.
>>
File: scail2.mp4 (2.52 MB, 1056x592)
2.52 MB
2.52 MB MP4
>>109045611
glad to hear it. we're probably standing on the shoulders(?) of giant anime girls here. or how ever that goes.

ai probably combines well with whatever you already do.

oh and if other places are worse it's probably mostly 'cause censored SaaS fucking sucks for art with new censorship every other week or something
>>
>>109046431
Maybe use your own eyes? Oh wait, you can't because you are artistically frigid.
>>
File: 1775734524989491.mp4 (334 KB, 1024x1024)
334 KB
334 KB MP4
>>
what's the current recommended model for clothes change, or eye color change, or similar minor edits in an image?
I think I used to run one of the qwens for it or something. but last year the results still weren't that great
>>
>>109046491
flux 2 klein 9b
>>
>>109046505
whats the current recommended model for sex?
>>
>>109046491
Klein edits are better at this kind of thing desu. Qwen is too slopped
>>
>>109046361
catbox?
>>
>>109045993
this is nonsense thobeit
>>
>>109046188
I guess
>>109046286
Very nice
>>109046331
You're not thinking far enough
>>109046339
Also not thinking far enough
>>
>>109046443
Ah, so there's some arbitrary limit on using AI when using AI, huh? Inderdasting...
>>
>>109046343
>best local goon model still illustrious
anima
>>
>>109046431
horseface
>>
>>109046745
Why are you seething at another anon expanding his skillset?
Vramlet?
>>
>>109046435
i mean some are sloppy and i personally dont use genai to create visuals but sometimes just the sheer variety of things people do here due to this vastly lowered execution cost is a good inspiration
and on top of it, generally loose vibe, doing genuinely random shit instead of something you see from so called 'pro-ai communities' feels decent to me
tl;dr i do like lurking here
>>
>>109046799
I think the biggest problem with the space right now is software. web apps aren't really accessible for artists
>>
>>109046737
anima is still in "meet potential model" phase, illustrious just works
>>
Anima thoroughly beats the shit out of any Illustrious finetune
>>
>>109046857
i always say this but
because of the medium itself(text) coding got the most natural integration
but with drawing and artistic matter, human-computer interaction workflow is one of the most important part i think and since what they are currently aiming for is the end product, it just doesnt integrate well into the existing artistic workflow
i think photoshop's rotate tool is a decent example of 'making something that is compatible with existing method'
>>
File: anima1_00006_.jpg (442 KB, 1152x1648)
442 KB JPG
>>
>>109046900
once they capture enough hours of people using computers, not just the final product, we'll see some good integration
>>
File: dance_japl.webm (2.85 MB, 480x832)
2.85 MB
2.85 MB WEBM
>>
My rebirth is imminent
>>
If people are switching to Ideogram, it's because Anima flopped, right?
>>
>>109047018
>>109046887
yes this wonderful gen is better than any of the detailed finetunes lmao
>>
>>109047038
Still being retarded and miserable?
You look at a purely LLM guided output and that's the only thing you see?
>>
File: anima1_00018_.jpg (520 KB, 1152x1648)
520 KB JPG
>>
File: anima1_00022_.jpg (441 KB, 1152x1648)
441 KB JPG
>>
>>109047018
Basado
>>
File: debo_ccg_fia_00004_.png (2.36 MB, 1792x977)
2.36 MB PNG
>>
File: dance_boob.webm (2.85 MB, 480x832)
2.85 MB
2.85 MB WEBM
>>
>>109047189
I always use you as a litmus test for how much models have improved due to how you have failed at basic genning for I think going on 4 years now?
Really warms my heart image gen has gone a long way
>>
>>109046517
>>109046505
thanks boys I'll try that on soon, appreciate it
>>
>>109047313
>>109047313
>>109047313
>>
>>109047018
>All that excessive stuttering
Anon, tell you ai-waifu to chill the fuck out. Not even animu girls talk this retarded. Moderation is key to believably.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.