[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: WanVideo2_1_T2V_00211.mp4 (1.63 MB, 1872x1088)
1.63 MB
1.63 MB MP4
Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106723624

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: 00183-3281631945.png (2.1 MB, 1248x1824)
2.1 MB
2.1 MB PNG
GIDDYUP COWBOY YYYYYYEEEUUUUP

(which bakers let the thread die? confess your sins.)
>>
File: 1731076145011227.png (886 KB, 1024x1024)
886 KB
886 KB PNG
the anime girl is using the pose of image2. keep her appearance the same. the background is white.

aio aux preprocessor to get an openpose model then just use it as a source, cause the model understands openpose/depth/etc.
>>
File: 00027-852257677.png (2.04 MB, 1344x1728)
2.04 MB
2.04 MB PNG
>>
>>106727127
model?
>>
File: dmmg_0024.png (1.27 MB, 832x1216)
1.27 MB
1.27 MB PNG
>prompt for american woman
>get this

what did flux mean by that
>>
Blessed thread of frenship
>>
>>106727372
it means BUTTCHIN
>>
File: 00220-699741467.png (2.02 MB, 1248x1824)
2.02 MB
2.02 MB PNG
>>106727347
https://civitai.com/models/913998/noobaicyberfix?modelVersionId=1122850

>>106727388
i like the cut of your jib
>>
File: ComfyUI_temp_sqyup_00001_.png (1.87 MB, 1680x1120)
1.87 MB
1.87 MB PNG
What the fuck did my tile upscale do??
>>
>gm
>>
>>106721558
>>106721465
mode/prompt/catbox?
>>
File: Video_00006.mp4 (1.88 MB, 720x920)
1.88 MB
1.88 MB MP4
>>106727394
no need to be rude
>>
File: 1744914778434427.png (1.3 MB, 824x1256)
1.3 MB
1.3 MB PNG
the anime girl is on multiple huge billboards wrapping around tall buildings in Akihabara, Japan.

neat
>>
>>106727127
>>106727395
>sept 24th
>train TDI style
>post outputs and lora catbox link to 4ch
>few days later
>someone posts TDI style to civ https://civitai.com/images/102646224 https://civitai.com/models/1988844?modelVersionId=2251318
i know its just a coincidence because that style is so well known but still... i feel ill at ease...
>>
File: 00180-1669126009.png (2.17 MB, 1248x1824)
2.17 MB
2.17 MB PNG
>>106727468
were you the guy who made the miku OP for lmg? you're my direct inspo for the stocking gens. crazy coincidences.
i knew i was autistic, but i must be like ultra endgame giga autistic for popping a boner to clothed TDI gens kek
>>
With 16GB VRAM, what quant should I use for Wan 2.2?
I'm thinking of starting at q4 then go up but if anyone already has it running I'm open to suggestions. I suppose fp8 is out of range.
>>
>>106727537
q8 and swap to ram if you have 64GB
>>
>>106727548
32GB only unfortunately
>>
>>106727557
coming from a 16gb vram user, get 64 gigs if you have an inexpensive kit, thatll reaaaalllllyyy help so you don't swap to your nvme.
>>
Just returned from the getting started guide. I'm curious about starting a self hosted project.


I just want a self hosted AI box where I can generate funny ideas and concepts, given prompts from my phone. How censored are these models? I'm not making porn, just curious how they are weighed?
>>
are there particular FPS rates and prompts you can use to make wan animate anime really authentically? looking at those weird 16+fps animated clips is disorienting. like how i imagine the hobbit movies at 30fps but worse.
>>
>>106727537
Lately, anything below Q5~Q6 is pure shit, so you are better off going with bigger quants if you can.
>>
>>106727577
depends on the model, usually the user finetuned models are less censored than the commercial base models

try chroma/noob/illustrious for example
>>
>>106727537
get 64GB of ram anon, it's not that expensive and you can play around fp8/q8
>>
>>106727565
I might get an additional 32GB but I didn't want to fill 4 RAM channels
>>
>>106727619
>I didn't want to fill 4 RAM channels
wut duh, you literally want to do that though
>>
>>106727619
in ddr4 it should be fine
ddr5 is probably be a bad idea
>>
I watched the gamers nexus Nvidia investigation, shops in China can make a 48GB 4090.

meanwhile Nvidia insists on being stingy with VRAM so they can sell their 100k AI gpus.
>>
File: 1752209007290036.png (584 KB, 768x1360)
584 KB
584 KB PNG
>>106727507
yeah kek glad i could inspire
>but i must be like ultra endgame giga autistic for popping a boner to clothed TDI gens kek
nah thats normal, the proportions are insane
>>
>>106727678
Start soldering bro
>>
File: 190976871.png (1.3 MB, 832x1216)
1.3 MB
1.3 MB PNG
>>
File: ComfyUI_VFI_00010_.mp4 (504 KB, 720x1280)
504 KB
504 KB MP4
why does wan2.2 love slo-mo so much
>>
File: 00325-341218390.png (2.02 MB, 1824x1248)
2.02 MB
2.02 MB PNG
>>106727710
so true

>>106727678
that documentary is so good. i appreciate they just let people talk and be themselves too, the guy casually smoking in that shop full of computer parts and shit, nicely offering them tea, pure kino.
>>
File: 1736582519718443.png (687 KB, 768x1360)
687 KB
687 KB PNG
>>
>>106727577
Open local models are the most uncensored in terms of NSFW/celebs/pol stuff/etc
>>
Qwen 2509 is a bit shitty isn't it?
>>
>>106727579
no since anime is animated at different frame rates per animation layer
>>
>>106727655
Works fine on my ddr5 system, but I had to sacrifice XMP and latency, which sucks but not too much.
>>
File: 4195682229.png (1.32 MB, 768x1344)
1.32 MB
1.32 MB PNG
>>
File: ComfyUI_temp_eibkv_00004_.png (1.88 MB, 1152x1728)
1.88 MB
1.88 MB PNG
>>
>>106727448
It's clearly Chroma.
>>
>>106727468
You can check the hash of the file, might wanna call him out if he's stealing your shit.
>>
>>106727925
if you have no idea it's okay to remain silent
>>
File: 00376-1892681263.png (2.14 MB, 1824x1248)
2.14 MB
2.14 MB PNG
>>106727710
oh that reminds me if you wanna link the catbox ill use your model instead of the civitai fella because >>106727996
if he stole your shit then i'd prefer to delete the stolen lora.
>>
File: 00373-4114764627.png (2.15 MB, 1824x1248)
2.15 MB
2.15 MB PNG
>>106728012
lmao i posted the failgen my bad, its a good one though. never seen a tongue inside a hand before.
>>
https://github.com/EnragedAntelope/Flux-ChromaLoraConversion
Anyone managed to get this shit working?
>>
>nyooo he stole the lora i made with assets that someone else made nyyooooooooooooo
>>
>>106728007
I will not after years of working with the animation industry and own production animation cels. TGS was cool this year btw but I didn't get a cool ananta bag
>>
>>106727925
thank you for the response and >>106728007 that was not me

>>106728038
coool
>>
>>106727996
>>106728012
nah, looks like he trained it on base il while mine is noobvp. hashes are obviously different
the catbox is >>106702561 though if you really want. i only used ~40 imgs tho from r34 so its kinda meh
>>106728033
i just thought it was a funny coincidence isall
>>
>>106727814
it's easier to handle slow movement so they trained it on slow movement to hide flaws
>>
>>106728038
Ani, I love you.
>>
>>106728055
thanks. i mean yours looks pretty legit. what tags did you train it on by the way? forge is shit and doesnt show me.
>>
>>106728111
love u 2 nonie
>>
>>106728154
I would post here but they are too hostile.
It is because of you and others that who introduced me to satanism.
I don't care about image gen that much, llms are more interesting because they are more interactive and and I can affect them - writing python shit etc.
>>
>>106728126
https://files.catbox.moe/bjd6rg.txt
https://files.catbox.moe/9hsvvw.txt
are some captions from the set.
 total drama island, digital art, cartoon style, flat shading, stylized, simplified shapes, animated style 
should lock it in. with base vpred you should probably also do something like https://pastebin.com/S8tGwv2c in the negatives as well
>>
>>106728154
I'm learning slowly. Python is the biggest issue - it's the biggest pile of shit known to man.
Tuple vs array?
Doesn't make any sense.
>>
>>106727402
you denoised too much
tile upscale sucks anyway
>>
>>106728195
>v49
OOF
>>
File: PLX-EXPRESSH_01.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>106727372
Add keyword Ozempic
>>
>>106728169
>llms are more interesting because they are more interactive
well, the good news is a lot of companies are interested in switching over to ggml so adding llama.cpp is on the list for me. they great python upheaval is nigh but c++ not as hard as you think, if I can do it any retard can. I really want you guys to give game dev a shot with llm and/or diffusion integration a shot. even if it is something simple like a vn. I think that is the future and it's something that has the potential to revive the waning interest.
>>
>>106728154
>anistudio
GET OUT OF HERE! FUCK OFF YOU DONT HAVE FRIENDS;NOBODY LIKES YOU; GET OUT!
>>
>>106728243
>if I can do it any retard can. I really want you guys to give game dev a shot with llm and/or diffusion integration
based and filled with determination
>>
>>106728243
I really don't care about gaymes I make 1girl big bob
>>
>>106728243
KILL YOURSELF! I HATE YOU GET OUT GET OOOOOOUTTT
>>
>>106728243
I can't even vibecode a custom node so games are out
>>
>>106728268
yes but what about flirting with her and she has a personality and a need to garble your cock!
>>
>>106728243
That's way out of my circle. Even back 20 years ago when Nokia was a thing, person asked me "can I do cpp" I said no I can't I only know rudimentary C. I'm not a developer.
>>
>>106728243
talentless pedo
>>
>>106728272
you can spaghetti so it's enough
>>
>>106728243
Your UI doesn't work, why you are here?
>>
Daily reminder to use uv instead of pip, it's 100x faster

uv venv --python 3.13
uv python pin 3.13
uv pip install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu128
uv pip install -r requirements.txt
uv run main.py
>>
>>106728243
You need to have the autistic mind set.
>>
>>106728243
Fuck off fagot
>>
>>106728295
uv is the only thing that keeps me from killing myself
>>
File: 00400-2987547018.png (1.79 MB, 1824x1248)
1.79 MB
1.79 MB PNG
>>106728177
coool, good stuff. your lora's solid.
>>106728243
i honestly had no idea i was interacting with you until someone called you out, i really don't see the hype in reeing about cause ya seem alright in my books.

>>106728283
>>106728289
>>106728290
>>106728302
at least space your posts by a minute bud
>>
>>106728243
STOP SPAMMING THE GENERAL
>>
>>106728026
it only works on flux loras that already were compatible with chroma. it's also ancient, flux loras are no longer really compatible with base and hd chroma
>>
>>106728221
49 is ok though. It's way better at certain prompts than 48 or base.

chroma is such a mess that even way older versions give better results on a prompt+seed.
>>
>>106728279
open source is also about learning and teaching people how anon. I want it to be fun and for everyone to be better at it and maybe make a living creating things you love. open source isn't about pandering to corpos or looking for a free ride. if you take on the hardships of wanting to improve, the world is your oyster. THAT is the FOSS I believe in
>>
>>106728243
No one cares
You will never get revenge on comfy
>>
>>106728275
I have a real life hole for that
>>
>>106728313
>at least space your posts by a minute bud
singular schizo anon theory proven wrong yet again
>inb4 trani cope
>>
4step > 8step qwen light lora
>>
>>106728332
I have never been prouder of being some loser on a Mongolian basket weaving forum. you really are something else ani
>>
>>106727940
>black
>afro
Can you put her in bellbottoms and general 70s attire I love NigLi
>>
>>106728313
ty anon! I enjoy your spicy stocking gens! very hot!
>>
>>106728332
I guess so but I'm not a logical person. I need to regiment that thinking, even after all these years working at x company. Etc. Thank you for replying.
>>
>julien
>>
>>106728359
I work in a business where they want the most violent graphic artists. It's like from a movie.
>>
>debo
>>
>>106728332
s-shut up! you are making me tear up!!! i-it's not like what you said made me feel or anything!
>>
File: 00412-3701560761.png (1.94 MB, 1824x1248)
1.94 MB
1.94 MB PNG
>>106728358
gonna make you a nice catbox so you can judge how im using the lora and were it can improve, it definitely has trouble not twisting here and there but other than that its really nice.
>couldn't share some of them here because her tits just POPPED out and there was some stripping down to pussy
>>
>>106727923
it's not ideal but I also don't think it's bad
>>
>>106728313
>your lora's solid.
ty. glad you enjoy it. ive only made a couple dozen thus far
>>106728379
yeah i think itd be better if i used highres stills from the show but i wasnt able to find any. i have to seed hunt heavy with it so
>>
>>106728379
I can give it a go over a remote connection to my workstation. I really need to add a metadata viewer so it's easy to set components desu
>>
>schizo melting again
lmao seethe
>>
>>106728358
y pretend to be me tho
>>
>>106728243
>he's back
Fuck off anistudio dev your ui is dog shit and you spend more time shilling on 4chan than fixing it
>>
>guise guise look at my crashing sd.cpp wrapper
>>
>>106728243
>adding llama.cpp is on the list for me
MY BROTHER IN CHRIST FIX THE BASIC IMAGE GENERATION FIRST
I CAN'T EVEN LOAD A SAFETENSORS FILE WITHOUT IT OOOMING
>>
>>106728329
use hd
>>
>>106727127
>bakers
It's one guy 90% of the time
>>
File: QwenEdit4StepsQ4.png (2.95 MB, 2352x1328)
2.95 MB
2.95 MB PNG
Okay, this is sad
>>
>>106728415
I should have quoted the relevant part whoops! I do want to try the Lora but it's almost 4:00am in Tokyo so maybe I should just pass out. I just want to be supportive because anon is proud of his Lora and wants to make it better. I do want it for my collection desu. anyways, gn all
>>
>trani on his discord call with @mfw.debo to do gay advertisement again.
We doing this again?
Your comic book shop doesn't even like you we saw how they viewed in that video retard.
>>106728485
>not wanted
>still shill
>make no progress
Say the line again trani
>>
>>106728254
Based and schizo-pilled.
Also >>106728485 for the love of god just fork Forge and stop trying to make your own broken shit from scratch.
You're not him. YOU ARE NOT FUKING COMFY
>>
>>106728396
and here ya go https://catbox.moe/c/npfefe
even when it kinda style twists, still looks good.
>>
>>106728485
gn king
>>
>>106728485
Ani if you're reading this, just contribute to NeoForge or ReForge.
Your "vision" is a hallucination, your UI it's trash
>>
>trani advertizing his commercially licensed product again on 4chinz
>>
Meanwhile in a parallel universe where AniStudio actually works...

(it doesn't)
>>
>>106728485
ani, you should ask the reforge devs to help you out so we don't have to deal with python anymore
>>
>>106728471
the qwen lightning loras all cause a weird pixelation artifact

>>106728456
fucking hell yet another version? fine I'll try it
>>
(He's still here)
>>
AniStudio Crash Report
Error: Everything is broken.
Solution: Use ComfyUI.
>>
>>106728485
>I do want it for my collection desu
:D grab it from >>106702561
>>106728503
impressive. this is the first time im seeing how good that cyberfix tune is
>>
>>106728456
>>106728329
don't use hd
1hd is hot garbage
use chroma dc-2k instead, it's a sidegrade from 48 and has exceptional realism if you direct the photo right
threadly reminder that all flux loras work fine on it as well
>>
>>106728485
SPEAKING OF YOUR SHIT UI
I tried installing it again for the 5th time and it still throws a error on launch. FIX YOUR SPAGHETTI CODE YOU TALENTLESS HACK.
>>
>anistudio: "I really want you guys to give game dev a shot"
>also anistudio: can't even make a functional image generator without it shitting its pants on a fresh install.
>>
>>106728379
>>106728503
>so you can judge how im using the lora
i load up my noob gen negatives with a bunch of furry and troon shit but your gens look great
>>
File: caramel.jpg (1.44 MB, 1296x1728)
1.44 MB
1.44 MB JPG
1girl, huge_breasts
>>
>>106728485
>mfw AniStudio shills his broken UI for the 900th time
>>
did ani finally fix the text clipping issues? i can't test it because the ui crashes most of the time
>>
Daily reminder that ComfyUI just works. You load a workflow, and you don't have to deal with schizo devs having a mental breakdown in the general.
>>
>>106728243
>I really want you guys to give game dev a shot with llm
I tried making a platformer with ChatGPT and gamemaker, but progress stalled heavily when I encountered a bug I couldn't fix. The code is too complicated for both me and the LLM to know exactly where it's going wrong despite a lot of debugging. Still, within a year or two the context sizes and reasoning will have caught up, so for now I'm working on the VN side.

IdeaGuy Dev Soon(TM)
>>
>>106728485
Ahhh AniStudio dev...
Kek. The man, the myth, the malfunction....
Still shilling the broken dream....
At this point just contribute to Forge instead of reinventing the wheel FAGGOT!
>>
File: 00475-4005446379.png (1.82 MB, 1824x1248)
1.82 MB
1.82 MB PNG
>>106728546
its honestly a bit schizophrenic, you REALLY need to balance positives and negatives + cfg scale with it. >>106728574
thanks, like i said, balancing act. i was wondering why my shit was frying earlier but this model's a bit sensitive. + loras and how they're trained.
>>
ani enters thread , immediately 12 posts appear defending him in broken ESL, ”you’re just jealous, not comfy, my ui will save wan!!!”
>>
Timeline of TraniStudio™:

2023: “I’m making the ultimate comfy killer UI, expect beta next week”
2024: “just a few memory leaks guys trust the plan”
2025: “I NEED YOU TO DO GAME DEV WITH ME WE’LL KILL COMFY WITH VN PROJECTS AND CPP”
Present: still can’t txt2img without oooming

Pure kino.
>>
i am trying to do lip syncing with a wan video i already have but i am getting oom. the subject's head is only in the top half of the video, is it possible to cut a square out of the video for the lipsyncing the glue it back together?
>>
File: 80b parameters.png (1.38 MB, 896x1152)
1.38 MB
1.38 MB PNG
>>
>>106728620
>12 posts appear defending him in broken ESL
that is an odd way of saying schizo spam that has been fudding nonstop
>>
>>106728539
You don’t get it anon, crashing on startup is a feature, it’s avant-garde performance art. AniStudio is the first conceptual UI, runs better in your imagination than on a GPU.
>>
>>106728620
im not sure why i thought vpred loras wouldnt work on eps models (like cyberfix) but thats probably also contributing to the frying
happily surprised mine does work
>>
>>106727923
>Qwen 2509 is a bit shitty isn't it?
it's losing concepts due to the finetuning (can't do styles as easily as the previous version) and it's only getting worse, plus the zoom in effect and the plastic skin hasn't been fixed, at this point they need to restart this shit from scratch with a better dataset if they really want to be "ThE nExT nAnO bAnAnA"
>>
how long can schizo keep this up?
>>
ComfyUI: drops a stable update every week, entire community thriving.
TraniStudio dev: spends 8 hours on /ldg/ explaining how society wronged him instead of fixing a null pointer that’s been breaking his app for six months straight.

choose your fighter.
>>
File: 00484-1272320856.png (1.72 MB, 1248x1824)
1.72 MB
1.72 MB PNG
>>106728660
vpred and vpred loras are very very strange, its a headache.

for reference to what i was mentioning earlier about the balancing act of cfg and how a model/lora was trained, that last image and its settings but in wainsfw140 with nothing changed;
>>
>>106728597
AniStudio dev is like that one guy in every friend group who insists his “startup project” is gonna flip the industry
>>
>>106728700
then why is he going to big events in Japan and living his best life?
>>
File: image.jpg (31 KB, 500x500)
31 KB
31 KB JPG
STOP THIS MADNESS!

EVERY TIME HE CRAWLS BACK HERE EVERYONE GETS BAITED AGAIN. STOP FEEDING HIM HIS SCHIZO RECYCLOPS ENERGY.
WE ALREADY HAVE COMFY.
WE ALREADY HAVE FORGE.
WE ALREADY HAVE SWARM.
ANI IS JUST SOME DERANGED NIGGER
>>
File: 00490-276291084.png (1.19 MB, 1248x1824)
1.19 MB
1.19 MB PNG
>>106728692
>nothing changed, but 7 CFG
>>
>>106728485
dude just fork Forge and call it a day, nobody will clown you then
>>
>>106728725
>still hasn't filtered "ani" or "anistudio" on 4chanX
ngmi
>>
I am scared schizo might just crash out and kill himself
>>
File: 00493-4187053863.png (2.06 MB, 1248x1824)
2.06 MB
2.06 MB PNG
>>106728731
aaaaand 7 CFG + anon's lora at 1.5 lora strength

honestly, this looks cool as fuck when you let wainsfw's heavily trained styles add a bit more flair.
but it really is a lot about how you know a checkpoint behaves if its trained on styles already.
>>
>>106728644
Can it do opened bobs?
>>
>He comes back into the thread again trying to convince everyone to abandon Comfy for his startup
>>
>Shills 24/7
>Declares vendetta against Comfy
>UI doesn’t work, even his own fans can’t install it
Peak comedy
>>
https://www.youtube.com/watch?v=S6LL5iA6y9o
>>
the only comedy here is the schizo going bananas over someone that wants to make something cool for everyone to use. what a clown
>>
does every ui dev have its dedicated melting schizo in this thread or something
>>
>>106728780
I posted this because I know how to play this song's guitar solo.
>>
>>106728797
pretty cool stuff anon but maybe a gen to go along with it next time
>>
>>106728797
Proof?
>>
Thread Status:
Ani spotted
Samefagging detected >

Comfy still undefeated
>>
File: 00502-4149328524.png (3.18 MB, 1824x1248)
3.18 MB
3.18 MB PNG
>>106728780
>>106728797
cool dude

>>106728806
i don't blame him for wanting to show off a lil
>>
File: ComfyUI_00131_.png (925 KB, 768x1024)
925 KB
925 KB PNG
>>106728806
>>
>derail thread with schizo
>uses same one anon cope
Gee I wonder why everyone hates you
>lies about software
>for profit
>tries to hijack thread by working with mfw.debo on discord to change the OP
You have your shitty program in two OP and you still can't get traction.
>>106728628
>>106728636
preach
>>106728654
>>106728796
>>106728781
>Obvious @mfw.debo post
Go back to your dead thread
>>
Honestly kind of admire the raw schizophrenia it takes dedication to be THIS chronically wrong on the internet.
>>
>>106728807
I have played guitar for 30 years now. There is no proof.
>>
This thread has reached levels of cozy previously believed by China's top scientists to be unachievable
>>
>>106728329
try both hd and dc-2k
>>
>>106728485
>>106728243
Imagine being so terminally online you SAMEFAG against yourself while your UI can’t even load a PNG without going OOM.
Ani, fix your meme app before you try leading the Great Gaming Revolution™.
>>
>>106728295
>uv instead of pip
I tried it and it is indeed very fast but it's not like I use I pip all the time anyway.
>>
>>106728840
I really like how ani is returning to his speeches about doing what you love, anons sharing advice and stocking anon wanting to make better models when his Lora is already kino.
>>
Everyone ITT: generating 1girl, boobs and catboxing LoRAs
AniStudio: "hey guys what if we made Visual Novels with C++ integrations and LLM driven dialogue"
Nobody asked. Absolutely NOBODY asked.
KILL YOUR SELF
I HATE YOU!
>>
>>106728750
thank you for playing around with my lora, anon :D i gotta think of another one to do soon maybe another kino zillenial cartoon
>>
>>106728865
Just drink
Instant Coffee Type 2
>>
>>106728847
hd has the problem of flux textures for some reason and everything looks like it is overbaked flux
dc-2k is out of the box a little bit more cinematic
i found that dc-2k tends to switch to illustrations on long booru tags more often than hd but the overall quality and anatomy and lighting seems better
>>
File: 00517-2927283176.png (2.48 MB, 1824x1248)
2.48 MB
2.48 MB PNG
>>106728870
>maybe another kino zillenial cartoon

god you are so fucking based, please do.
That said, i don't often lurk the threads like i'm doing now (or recently) that often, do you post your stuff anywhere else?
>>
I M GOING TO BED SON OF A BITCH YOU RUINED MY DAY OFF!
>>
damn chroma rapid aio got removed
>>
File: Your Girlfriend.png (971 KB, 768x1024)
971 KB
971 KB PNG
>>
>>106728889
no but if i ever do post a lora itll be here https://civitai.com/user/souleoj
>>
File: 00529-1630938710.png (2.34 MB, 1824x1248)
2.34 MB
2.34 MB PNG
>>106728920
cooooool will keep an eye out
>>
>>106728644
Not bad, but highly inefficient. Those titan models won't be useful before we can put our hands on cheap high-VRAM GPUs.
>>
File: ChromaHD_00002_.jpg (1004 KB, 1248x1728)
1004 KB
1004 KB JPG
>>106728880
I prefer HD for non-realistic styles
>>
>>106728870
my suggestion? Powerpuff girls. would mesh well with the styles you have already
>>
>>106728026
Yes worked for the last tests I did.
>>
>Look in thread see this shit
I just want to highlight the general was peaceful and to nobody's surprise /sdg/ was at a complete standstill. You know what this guy is all about and you should ignore him and his other loser friend and only reply with gens involving wheelchairs.
He has nothing, stop giving him engagement his only reason for living is to do the shit you see right now. In regards to ani, his life is hell you should post gens with drunken losers because it will be more new gens then either of them have posted in this thread in over a year.
Don't even bother them in the other thread leave them the fuck alone and that hurts them the most.
>>
File: ComfyUI_VFI_00016_.mp4 (1.59 MB, 736x960)
1.59 MB
1.59 MB MP4
>>106728072
think i got it sorted m8
>>
File: file.png (57 KB, 1159x246)
57 KB
57 KB PNG
>>106728900
silveroxides in the chroma discord decided to act like a little bitch and pointed out some licensing stuff with the 2k model and then just kept going on about people always using wrong settings so the guy straight up just removed the entire repo and left.

fucking ironic that he complains about people using the "wrong" stuff when there are a trillion workflows and he himself uploads new ones basically daily.
idk what crawled up his ass and died. i've been using v6 aio without any issues since he uploaded it.
for someone wanting chroma to get more adoption they are doing their absolute damndest to halt it.
no concise documentation, no real info on what's happening or where chroma is headed etc. you have to be part of their discord and even then still piece shit together.

i like chroma but this is getting pathetic.
>>
What's the current best way to swap a face?
>>
>>106728978
What happened to your balls when you was a teenager?
>>
>>106728988
a very good scalpel
>>
>>106728988
a very sharp knife and a good surgeon.
>>
>>106728978
man I'm so tired of every technical discussion being behind some closed off discord, this is a such a disastrous idea
>>
>>106728978
discord and its consequences have been a disaster for these autistic generations
>>
>>106728978
I don't get it. Why care if someone whines
>>
>>106728988
Has it changed? I thought it was IpAdapter for SD1.5 and Insightface for SSXL and related.
>>
>>106728978
I really want to like Chroma and had high hopes for it but seeing the people involved act like miserable cunts who gatekeep all knowledge and plans for it on their gay little Discord server makes me hope that it does actually fade into obscurity.
>>
>>106729015
I was wondering if there was something new. Maybe Qwen Image Edit
>>
>>106729020
From my research and it was extensive the model was made wrong. Other anons pointed it out and the model is destroyed fundamentally with how it uses tokens. The main reason why it will see no traction is due to his retarded 512x512 training on top of obfuscating token which he said he wouldn't do. I feel bad for people that gave this guy money especially once he decided to freestyle it.
If your inherent tokens can overpower loras at full strength on random seeds your model is worthless. Not only that common phrases will cause a style change even with a lora which is a full on deal breaker
>>
>debo still mass reporting anons
>>
>>106728642
i tested this with infinite talk and wan2.1 and the lipsynced video had its color changed too much, will try to find a wan 2.2 flow
>>
File: 00542-2678112676.png (2.76 MB, 1824x1248)
2.76 MB
2.76 MB PNG
I don't like the samefagging meltdowns, couldn't care less about poopdickschizo thread lore. I just wanna swap gens and learn to make better ones.
>>
>>106729065
I'll help you and post wheelchairs when he starts up again. The general was so nice yesterday. I'm starting to think he argues with himself to goad anons into his bullshit.
>>106729057
Which is why you post gens that hurt him like new wheelchair gens
>>
nigli anon come back
>>
>>106729023
2509, at least from my personal experience and testing so far, is perfectly fine at replacing a face with a generic face using only one image, but trying to use a second reference image and replace a face with that specific face makes it absolutely awful at the task.

I'm not sure if it's just struggling to understand the prompts or what, but it will either give you an identical output to your input with virtually no change to the face, or it'll copy-paste one face onto the other regardless of orientation or lighting or sometimes even sizing.
>>
>>106728978
>hardware in his screen name
how 2 spot a faggot from a mile away
>>
>>106729084
Oh wow what a lovely new gen.
>>
>>106728978
>>106729107
>hardware in his screen name and his real face as the pfp
>>
>ran took it personally
>>
>>106729118
baking loras on 5090
>>
>>106728978
>silveroxides
he sounds like the regular passive agreesive reddit bitch lol
>>
>>106729135
I didn't pay $3,000 just to spam a thread on 4chan. If you don't have a job by now, you are not going to make it.
No one likes a narcissist.
>>
>>106728295
>instead of pip, it's 100x faster

I won't even double my download speed from 30MB/s, stop lying to me
>>
File: 00560-3372395684.png (2.73 MB, 1824x1248)
2.73 MB
2.73 MB PNG
>>106729107
>>106729126
He's handing out the points anyone can use to bully him, like he wants it to happen.
>>
This is a good example where you ignore, of not for lora baking he would be drowned in wheelchairs.
>>106728295
This makes python management brain dead so I use it. I want to go back to noob just wanted to cook something I have a good idea on the next lora but I'm going to see the new models. I might use chroma for composition but latent couple seems to give me a similar quality at higher steps with Euler which is childs play to my current card. I still really want built in text
>>
>with people like you it's always about dick measuring contest
>you want to rank up others because you think you are so much better than them
>well if you are so much better than them, just include your idbm here
>>
>>106729183
can you elaborate pls
>>
>>106729182
What do you mean?
>>
File: 00566-775382619.png (2.8 MB, 1824x1248)
2.8 MB
2.8 MB PNG
>i'm realizing in realtime that all i had to do was have a singular (1) (one) positive tag to get good 3d realism in nova animal

>\(realistic\)
>with the \
you know what? Ain't even mad. Also ignore the broken fork.

>>106729135
>>106729182
jesus these gens are good dude
>>
>>106729195
Get a job = then get the talk
>>
>ran won
based
>>
>>106729208
No one will hire a bitch like it.
His only positive attribute is that he's been spamming the same image for 3 years now.
>>
where do i get chroma 2k?
>>
>>106729046
imagine giving someone money to train a new base model and they wind up releasing some experimental jungle juice model with issues that should have been easily avoidable. and then you watch the guy's discord pals get angry at people for trying to do anything with the model outside of their discord hugbox
>>
>>106729239
Huh yeah i wonder why nobody's trusted with the money to train new models!
>>
another day of no qwen nunchaku loras
another day of pain
>>
>>106729235
There are loras for it on Huggingface and Civitai. Don't do anything with it though or silveroxides will get angry at you.
>>
File: ChromaHD_00013_.jpg (949 KB, 1296x1800)
949 KB
949 KB JPG
>>
>>106729102
Yeah, I've been experimenting with 2509. It can't turn her head around, apparently. Also, it slightly crops the image, and the VAE or model seems to add some tint to it.
>>
>>106729254
i dont want loras, i want the checkpoint and the page is gone
>>
I want a WAN SEX FINETUNE
>>
I'm going to buy a new computer and I was curious about the generation of videos. Is a 5080 enough or a is a 5090 needed? I mean, I can afford it but speeding more than 2,500€ hurts.
>>
File: 64875105.mp4 (3.95 MB, 960x848)
3.95 MB
3.95 MB MP4
Hmm. 30fps does look better, but I don't know if it's worth the extra time.
>>
>>106728847
>>106728456
>>106728547
alright, HD seems to work a bit better actually.

>dc-2k
wtf is that??
>>
>>106729342
Don't get a 5080 now, wait for a 24GB card SUPER card in January.
The 5090 is worth it because it's fast and has a lot of vram for a consumer card.
>>
File: 1723223008336074.png (275 KB, 960x485)
275 KB
275 KB PNG
anons i need your insight on this
have there been any other (broader) comparisons since picrel that imply some pytorch versions being superior, or was it just another seedism?
reason im asking is because im regenning some old gens and they look worse, but im not sure if thats because of selection bias or if i really need to return to an older version
>>
>>106729352
can you explain what you did anon
>>
File: 00046-2536846749.png (3.9 MB, 1344x1728)
3.9 MB
3.9 MB PNG
>>
>>106729352
of course its worth the extra time, just use it selectively
>>
File: squat.webm (3.84 MB, 832x1248)
3.84 MB
3.84 MB WEBM
Ugh, can't get the phone to levitate or something. MY IMMERSION
>>
File: 00602-3638112769.png (3.37 MB, 1944x1328)
3.37 MB
3.37 MB PNG
>>106729389
aww i remember the cute brown haired girl with this same facial expression from last year.
>>
>>106729357
You have to ask on lodestone's Discord.
>>
File: 79452648.mp4 (3.71 MB, 928x928)
3.71 MB
3.71 MB MP4
>>106729369
It's just one video rendered at 30fps and the other at 16fps and then interpolated to 32fps using RIFE.
>>106729401
I wonder.
>>
>>106729358
Yeah, that's something that prevents me from buying the 5080. However, my situation is that I need to spend a lot of money before January (complex to explain) so I cannot wait for it. But still, the prices of the 5090 hurt. The founders is quite "cheap", but I'm not sure if it is going to last for long compared to other models.
>>
God it feels so good when your generated bespoke scraping script works the first time
>>
File: goblin girl.png (63 KB, 300x300)
63 KB
63 KB PNG
https://vocaroo.com/1ms10IJ06ICA
>>
>>106729482
>he has the golden opportunity to make the most cummable little green onahole lines possible
>gens gay cuckboy shit instead
why are you like this? (last one was funny though)
>>
>>106729450
>It's just one video rendered at 30fps and the other at 16fps and then interpolated to 32fps using RIFE.
Sure but 14*30=420 you rendered a 420 frames video?
how?
>>
>>106729451
I have the FE, I'm quite happy with it, worth it for my use (AI and games)
>>
>>106729409
me on the left
>>
>>106729359
>have there been any other (broader) comparisons since picrel
no
>>106729359
>but im not sure if thats because of selection bias or if i really need to return to an older version
pytorch does affect the output but the answer to your question is unknown
>>
>>106729352
>2022
>I don't want to waste time on my gf

>2025
>I don't want to waste time on my AI gf

way to go, anon
>>
File: ChromaDC-2K_00018_.jpg (1.27 MB, 1296x1800)
1.27 MB
1.27 MB JPG
>>106729449
>dc-2k
I like it, it works just fine, but it would be nice to know how exactly it's different. Takes few seconds to copy paste info to model card, don't know why he doesn't do it.
>>
File: 00130-2849161446.jpg (1.07 MB, 2048x2480)
1.07 MB
1.07 MB JPG
>>106729239
It's mind boggling, I really tried with multiple loras but it's a phenomenon I have never seen before.
>>106729199
I'm getting back into things found some new models. I'm just disappointed with chorma and it's not going to get better. I personally don't even think it's worth finetuning with how fucked up the ratios are. I hope he does see this thread and just go back to whatever epoch he started fucking with resolutions and tokens and fix it. I think the worst part is it mostly apears the moment you make any gen that's complex outside of one girl standing doing nothing. You make something more interesting and the rate where the prompt adheres to the lora drops and it's on a per seed basis which is unacceptable. This could have been easily caught during training and I do remember anons ringing the alarm bells and I was stupid enough to think he would fix it, but all I saw him do was post that cringe emoji of his bird fursona sticking it's tongue out.
He did such a good job with easyfluff what the fuck happened?
>>
>>106729389
cute
>>
>>106729499
What I mean is if they will be more prone to fail than other editions because of the lack of features these brands give (with a significant increase of the price).
>>
>>106729569
No, the only reason the FE is cheaper is because nvidia doesn't have to buy its own dies.
>>
>use wan oral insertion LoRA
>img2vid an image of a woman kneeling next to a man's penis, use the proper trigger words
>this oughta be good
>wait 30 minutes while it gens
>it's done

> the woman stands up, the man pops off his cock and puts it in his own mouth, then both of them share it Lady-and-the-Tramp-style
I hate video gen.
>>
File: ChromaDC-2K_00021_.jpg (1.07 MB, 1408x1952)
1.07 MB
1.07 MB JPG
>>
>>106729598
>the man pops off his cock and puts it in his own mouth
I've seen quite a few nsfw loras do that, it's a training issue, especially with so many morons using trigger words instead of natural language and building on wan basic understanding.
>>
API nodes are insanely powerful, probably the greatest addition to ComfyUI
>>
https://www.reddit.com/r/StableDiffusion/comments/1nsyqls/qwen_image_vs_hunyuan_80b/
that 80b model doesn't look much better than Qwen Image
>>
>>106729610
I don't think it's a training issue. I think it's an inherent deficiency of the way this all works: you can never be certain that any action or description will be applied to the correct 'actor' in the scene. Sometimes it makes both of them suck the cock
>>
>>106729625
nice try alibaba
>>
File: myuck.gif (1.91 MB, 600x477)
1.91 MB
1.91 MB GIF
>>106727118
you guys seen the movie 'naked lunch' ?
what a trip OP video kek
>>>106724447
s a v e d
>benis
i approve this message
>>
>>106729598
>wait 30 minutes
time to use lightx bro
>>
time to use API nodes
>>
>>106729491
It's rendered in 129 frame chunks using sliding window
>>
File: Hunyan3Output.png (1.95 MB, 1024x1024)
1.95 MB
1.95 MB PNG
dude Hunyuan Image 3.0 is ASTONISHINGLY bad for what it ostensibly is architecturally
- limited to 1 megapixel (so much worse than both Qwen AND Hunyuan 2.1 in that regard)
- prompt adherence is approximately equivalent to both Qwen and Hunyuan 2.1 ish, no clear improvements at all as far as complex prompts or text coherency
- aesthetically it's a giant mixed bag, some stuff looks ok, other times it comes out turbo slopped

Like I've been testing it extensively trying to be as fair as possible on Fal.AI but it's just legit not that good
>>
>>106729758
>- limited to 1 megapixel
holy kek
>>
>>106727118
We're reaching a point where there's diminishing returns I feel like, it would be better for them to optimize these models so the community can improve them like other models.
>>
File: 1750372990825408.png (153 KB, 498x498)
153 KB
153 KB PNG
>>106729758
>limited to 1 megapixel
LMAOOOOOOOO
>>
>>106729767
what i'm hearing is that you want a 200b model that isn't discernibly better than anything we have now despite being many times larger
>>
>>106729778
We are already there
>>
why hasn't anyone made t5xxl work with sdxl
>>
>>106729790
Very good question, I'm guessing people want better models but I think that alone would be the buff people need.
>>
>>106729740
>sliding window
ah it's t2v then
>>
File: hilarous!.png (372 KB, 715x319)
372 KB
372 KB PNG
>>106729758
>80b model
>still can't write words
top kek
>>
>>106729758
>- limited to 1 megapixel
Can you go lower or it's like SDXL where it HAS to be 1mp?
>>
File: 00033-2664893731.jpg (82 KB, 450x450)
82 KB
82 KB JPG
Very happy with Chroma
>>
File: 00037-1149459993.jpg (77 KB, 450x450)
77 KB
77 KB JPG
Handles 2 characters very well.
>>
>>106729758
>limited to 1 megapixel
I really, really, do not get it. i can't square it. Where do they go making that particular decision in the sea of tasks that go into making a model?
>>
>>106729794
what would you need to do to actually make that work? just train it further using t5xxl as the encoder instead of clip?
>>
>>106729694
>naked lunch
based
>>
File: ChromaDC-2K_00028_.jpg (807 KB, 1408x1952)
807 KB
807 KB JPG
>>
File: 00047-2117906690.jpg (74 KB, 450x375)
74 KB
74 KB JPG
Sorry, this was the photo, different characters, different color of clothes, different body composition. Chroma has potential.
>>
>>106729516
and here I thought this general had already painstakingly compared everything there is to compare...
guess I'll have to give it a shot and see if it makes a subjective difference then

>>106729605
this goes pretty hard. catbox?
>>
>2025
>Chroma has potential.
>2026
>Chroma has potential.
>2027
>Chroma has potential.
Why do you keep necro shilling a dead model
>>
File: ChromaDC-2K_00029_.jpg (862 KB, 1408x1952)
862 KB
862 KB JPG
>>
>>106729815
Fal overrides to 1024x1024 if I try to do 512x512, so I assume the model isn't officially multi-res trained for below 1 MP like e.g. Flux or SD 3.5 Medium were
>>
So is genning at a 1280x720 base a nay or a yay on sdxl based models? its always a hot topic of debate and i've just opted for 1216x832 but to properly upscale to 1080p and then downscale again to 720p perfectly for wan seems like the best route, but i am curious if i'm potentially raping the quality of the output by genning at that particular resolution.
>>
why does comfy not use vram for the model merge nodes
>>
>>106729888
We already have a schizo meltdown today, you came late.
>>
>>106729846
they really went full "stack moar layers bro" meme and thought it would be enough to fix everything lmao
>>
>>106729914
>schizo is when you're suspicious of the capabilities of a model
>>
>>106729897
>ChromaDC-2K
what's the difference with that model?
>>
File: fairy-blue-pink-cartoon.jpg (1.28 MB, 1512x2080)
1.28 MB
1.28 MB JPG
>>106729906
That resolution is fine for SDXL. The rule of thumb is around 1 megapixel. I've used 1280x720 plenty of times in the past. I remember the original SDXL recommended these resolutions:

1024 x 1024
1152 x 896
896 x 1152
1216 x 832
832 x 1216
1344 x 768
768 x 1344
1536 x 640
640 x 1536


Don't know if that really matters anymore. Illustrious can do noticeably higher resolutions.
>>
>>106729851
>>106729838
>>106729828
Remember if you don't add any of these images to the next collage you're fatphobic and have a distorted view of the natural human body
>>
you thought comfy would realize this can actually get more users to subscribe to comfy cloud yet he threw away this money grabbing opportunity
>>
>>106729944
Nobody knows, you have to ask on Discord.
>>
File: 00734-3756665920.png (3.08 MB, 1920x1080)
3.08 MB
3.08 MB PNG
>>106729947
thanks Illustrious seems to handle this particular resolution perfectly fine.

though i still have lots to learn about avoiding funnyness.
>>
File: ChromaDC-2K_00032_.jpg (972 KB, 1296x1800)
972 KB
972 KB JPG
>>106729944
No clue
>>
>>106729952
he's right, it's a fucking 80b model, no one can run that and the outputs are pretty mid
>>
File: Screenshot_665.png (99 KB, 659x581)
99 KB
99 KB PNG
>Spend so much time redrawing segments of AI art that I become okay at actual art
Anyone else experience this sorcerous fuckery? I had resigned myself to never being able to draw, but when I tried again recently, it made so much more sense.
>>
File: HunyuanImage21Output.jpg (3.24 MB, 2048x2048)
3.24 MB
3.24 MB JPG
>>106729758
Hunyuan Image 2.1 on the same seed and prompt produces this BTW. Still pretty slopped but it actually gets the text right and it's also twice the resolution of Hunyuan 3.0. So yeah I just generally don't understand what happened here lol, like HOW did they wind up with an 80B model this bad.a
>>
>>106729984
>no one can run that
they can run with comfy could
>>
>>106729985
I am still at the stick figure level with a pencil but I have gotten pretty good at modifying and tweaking images in GIMP without having to inpaint over the years.
>>
>>106729986
that looks even worse solely because it's barely a salvageable attempt at pseudo realism
2.1 isn't a high parameter model is it? never ran/seen the hunyuan image models.
>>
>>106729994
>2.1 isn't a high parameter model is it?
it's a 17b model
>>
>>106729956
i dont have discord so i guess im fucked
>>106729981
aight, still gonna download it
>>
>>106729990
I'm stuck in paint-like programs, as you can see by the pic, but... it's still way better than it was. I'd wholeheartedly rec giving drawing in paint in a simple style like that a try, I think you'll find that it's way smoother than you think. Especially with the granularity of control it gives you and the lack of need to keep your hand steady, it's super nice. Especially on a trackpad or really low DPI mouse.
>>
>>106730007
>>106730007
>>106730007
>>106730007
>>106730007
>>
>>106729947
>Illustrious can do noticeably higher resolutions.
i get grey bars on the edges of my image when i do 1032x1416
>>
>>106730012
Early ass bitch
>>
>>106729758
let this be the beginning of the end for chinese bloatmaxxing and benchmaxxing. anything larger than 3B is bloat.
>>
>>106729947
yeah those are the standard XL buckets. They definitely still "matter" unless you're specifically using a checkpoint that has been trained at higher resolution buckets (and generally you'll know if that's the case, it will say as such on the CivitAI page or whatever).
>>
>>106730002
ZAAAMNN THATS A SEVENTEEN BILLION PARAMETER MODEL?

>>106730025
starting to unironically believe this
>>
>>106729952
I don't care about that as much as the fact he is contradicting what he said before about implementing the best image models out there regardless from Stability or not. He should've gave a different more convincing explanation for this and this just makes everything look worse in the face of the whole local vs API shift controversy going on.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.