[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


Always Trying Edition

Discussion of Free and Open Source Text-to-Image/Video Models and UI

Prev: >>106781464

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/sd-scripts/tree/sd3
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/tdrussell/diffusion-pipe

>WanX
https://comfyanonymous.github.io/ComfyUI_examples/wan22/
https://github.com/Wan-Video

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
Training: https://rentry.org/mvu52t46

>Neta Lumina
https://huggingface.co/neta-art/Neta-Lumina
https://civitai.com/models/1790792?modelVersionId=2203741
https://neta-lumina-style.tz03.xyz/

>Illustrious
1girl and Beyond: https://rentry.org/comfyui_guide_1girl
Tag Explorer: https://tagexplorer.github.io/

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
GPU Benchmarks: https://chimolog.co/bto-gpu-stable-diffusion-specs/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Bakery: https://rentry.org/ldgcollage

>Neighbours
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
>>
File: 00444-1465474724.jpg (1.01 MB, 2048x2688)
1.01 MB
1.01 MB JPG
Blessed thread of local kings.
>>
>>106784383
plump
>>
File: radiance.png (2.7 MB, 832x1488)
2.7 MB
2.7 MB PNG
>>106783949
who could have fucking guessed... they'll probably also use this to stop people from lampooning politicians and anything else like that
>>
It is with no exaggeration that I say xixxix is the best.
>>
File: radiance.png (3.19 MB, 832x1488)
3.19 MB
3.19 MB PNG
>>
File: radiance.png (3.1 MB, 832x1488)
3.1 MB
3.1 MB PNG
>>
File: radiance.png (3.06 MB, 832x1488)
3.06 MB
3.06 MB PNG
>>
File: 00178-131866008.png (1.21 MB, 1368x704)
1.21 MB
1.21 MB PNG
>>
>>106784383
ok, thread schizo.
>>
>>106783949
went to fuck around and was getting blocked WAY more, lmao, the only slightly good period of sora is over
>>
Blessed thread of frenship
>>
Ran thinks he is the thread owner. He is the single reason for all the drama.
/ldg/ is a convenient way for him to get attention because apparently discord isn't enough for him.
>>
File: 1741477194616796.mp4 (3.12 MB, 720x944)
3.12 MB
3.12 MB MP4
>>106784383
>>
therere simply too many artists to train on my queue is massive
>>
File: AniStudio-0000p.png (377 B, 666x666)
377 B
377 B PNG
>>
>>106784371
why doesn't anyone else contribute to sdcpp? is everything supposed to be ani that makes everything not done by the author?
>>
>>106784494
nobody pays attention to her on discord because all she does is blogpost
>>
>>106784527
Sort by those most vocally against AI and train on them first.
>>
Run and Debo are the same? And Ani?
>>
>>106784549
>her
>she
>>
>>106784383
Ok, thread schizo
>>
File: radiance.png (2.87 MB, 848x1488)
2.87 MB
2.87 MB PNG
>>106784359
not a big deal to me. you can hope the model learns it this time. or just (auto-)load style loras later

maybe you can talk to lodestone, but don't be surprised if it's not easy
>>
>>106784472
Nice gen
>>
>https://blog.samaltman.com/sora-update-number-1
SaaS fags in SHAMBLES
>>
File: radiance.png (2.19 MB, 848x1488)
2.19 MB
2.19 MB PNG
>>106784527
i'm sure you'll find out some that you want trained first. could also take into account what edit models might find more difficult to imitate. godspeed, anon
>>
>>106784597
A surprise to no one kek
>>
>>106784597
its so funny how shackled down SaaS models are
>>
File: ComfyUI_01795_.png (1.28 MB, 1024x1024)
1.28 MB
1.28 MB PNG
>>
>>106784597
>We are hearing from a lot of rightsholders who are very excited for this new kind of "interactive fan fiction" and think this new kind of engagement will accrue a lot of value to them, but want the ability to specify how their characters can be used (including not at all).
ARE CLOUDKEKS REALLY??
>>
>>106784597
that didn't take long to make this shit lame and gay this time lmao
>>
oh yeah
yeah im feeling safe
>>
>>106784597
>Could barely do anything fun from the start
>Now fun isn't allowed
leak the model
>>
File: ComfyUI_01799_.png (1.59 MB, 1024x1024)
1.59 MB
1.59 MB PNG
>preparing fresh localslop
>>
File: file.png (2.77 MB, 848x1488)
2.77 MB
2.77 MB PNG
>>106784520
jiggle. what I really like tho is the chibi sheep and serene atmosphere with the birds.
>>
>>106784644
>>Could barely do anything fun from the start
I found this fun kek
https://files.catbox.moe/diri53.mp4
>>
based collage. good variety of 1girl and other shit. Shows the last thread was healthy. Nice.
>>
File: ComfyUI_01802_.png (1.45 MB, 1024x1024)
1.45 MB
1.45 MB PNG
>>
>>106784650
Cloudslop just doesn't taste as good as the homemade slop.
>>
File: 00229-257001642.png (2.87 MB, 1248x1848)
2.87 MB
2.87 MB PNG
>>
>>106784563
Ranfag is the one with the real discord server. Just saying...
>>
File: ComfyUI_01809_.png (1.33 MB, 1024x1024)
1.33 MB
1.33 MB PNG
>>106784666
>trips
Like reading an owned a book at home, when there are libraries. On my time and how I like it (and in my budget sadface)
>>
>>106784690
ran has a discord server running in her head. only schizo inner voices allowed
>>
File: ComfyUI_temp_effqj_00001_.png (3.42 MB, 1192x1648)
3.42 MB
3.42 MB PNG
>>
File: 1748049505164458.mp4 (2.04 MB, 720x1248)
2.04 MB
2.04 MB MP4
>>106784602
>>
>>106784587
>Nice gen
thanks
>>
>>106784659
I want to combine the generals. Devo's recent thread gens showed me what's wrong with /ldg/, there's no artistic variety. Everything is "single character(or two if it's chroma) maybe sexy, doing something" that's awful. I'll bring in artists from other boards to get fresh content. This community is wasting good hardware on meme gens
>>
>>106784705
>her
>>
Clean it up janners.
>>
>>106784729
what man do you know that drama and blogposts this much? it's woman behavior
>>
>>106784624
This is grear I don't know if you are Debo but I really like your gens
>>
>>106784745
lmao, ok fair enough
>>
>>106784665
Nice gen
>>
>>106784472
Nice landscape, how do you prompt that?
>>
[BUYING]
Vintage Sora2.a generations. Looking for well-preserved alpha Sora2 generations, preferably featuring characters such as Spongbob and Bob Ross before they were nuked.
>>
File: 1755277934703293.mp4 (2.26 MB, 720x720)
2.26 MB
2.26 MB MP4
>>106784665
>>
>>106784746
Not debo. Glad you've been enjoying, possibly drunk anon

>>106784755
ty, Chroma flash has been fun

>>106784770
wow, that's really cool!
>>
File: file.png (236 KB, 538x465)
236 KB
236 KB PNG
Why do open-source image editing models lag behind close-source giants?

>>106784769
I'm collecting ones with blackface and hitler.
>>
>>106784770
Like a sea story!
>>
File: ComfyUI_01814_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>106784781
>picrel
>>
File: 1759279148302792.png (951 KB, 1064x984)
951 KB
951 KB PNG
the white anime dog with pink hair is on a pirate ship at sea, and is wearing a pirate hat and black eyepatch. keep their pose the same.

qwen edit can do basically anything, also a great meme maker.
>>
>>106784784
>Why do open-source image editing models lag behind close-source giants?
they don't have thousands of kenyan slaves manually annotating data
https://xcancel.com/WenhuChen/status/1973763996911054902#m
>>
File: 00184-351981838.png (1.26 MB, 1368x704)
1.26 MB
1.26 MB PNG
>>106784764
Surreal, exoplanet horizon, by Igor Morski, by Dali, This is a digital CGI artwork depicting a surreal landscape.. Dark blue night sky, The composition combines natural and abstract elements, creating a dreamlike, otherworldly scene. The painting uses vivid, contrasting colors and realistic textures to create a fantastical, otherworldly scene The textures are smooth and hyper-realistic, emphasizing the contrast between the organic and the man-made. This digital painting depicts a surreal landscape with a striking contrast. In the foreground, a smooth, flowing shards rise from a sea of dunes. . The foreground features a snow white desert and sparse, twisted, structures with flat, golden canopies. The mid-ground showcases a calm, blue ocean that meets a distant horizon. In the background, there's a large, partially visible planet set against a clear, bright blue sky.. unusual shapes and the desert's bright hues add a sense of alien beauty and mystery. The overall style is hyper-realistic with a fantastical twist, emphasizing the surreal beauty of the desert scene. windswept jellyfish swim in the sky The sky above is a gradient of dark blue to beige, with soft, billowing clouds adding to the dreamlike quality. The texture of the sand appears fine and soft, while the formations are rough and jagged.This is a highly detailed, digital painting of a surreal, alien landscape an snow white sand dune rises steeply, contrasting with the calm, blue ocean in the background. The foreground features a small, barren island with jagged, textures and a few , twisted cuboids. adding to the surreal atmosphere.
Steps: 26, Sampler: Euler, Schedule type: Simple, CFG scale: 1, Distilled CFG Scale: 3.5, Seed: 351981838, Size: 1368x704, Model hash: 4610115bb0, Model: flux1-dev, Version: f2.0.1v1.10.1-previous-669-gdfdcbab6, Module 1: ae, Module 2: clip_l, Module 3: t5xxl_fp16
>>
File: 1757300957956898.png (845 KB, 1064x984)
845 KB
845 KB PNG
>>106784798
the white anime character with pink hair is wearing black sunglassses and a black trenchcoat and is typing on a computer in a dark business office. keep their pose the same.
>>
File: ComfyUI_01822_.png (1.63 MB, 1024x1024)
1.63 MB
1.63 MB PNG
>>106784798
Not bad

>>106784806
Not bad at all
>>
>>106784520
Okay I will do video gen now
Is that built in or a Lora?
>>
File: ComfyUI_41845_.jpg (1.87 MB, 1456x1872)
1.87 MB
1.87 MB JPG
trying to make a coherent image with base noob be like
>>
File: ComfyUI_01824_.png (1.74 MB, 1024x1024)
1.74 MB
1.74 MB PNG
>>
File: ComfyUI_01827_.png (1.38 MB, 1024x1024)
1.38 MB
1.38 MB PNG
>>
>>106784781
>>106784804
Yes your gens and the landscape ones are refreshing to the eyes. AsukaSLOP, RanSLOP, and RadianceSLOP have become visual pollution to me at this point.
>>
ani is saying to bother comfy devs (non comfyorg) to contribute to sdcpp instead of shitting into the bloat pile. kinda based considering it would rid us of the shitty gpl3 licence once and for all
>>
>>106784823
Learn to love the wrangling. Or puss out and use cyberfix.
>>
>>106784834
Don't forget the beloved PedoSLOP
>>
File: 00192-1789447483.png (964 KB, 1024x1024)
964 KB
964 KB PNG
>>
File: 1736591087703025.mp4 (1.09 MB, 1280x656)
1.09 MB
1.09 MB MP4
>>106784472
>>
File: 1736660013935183.gif (2.55 MB, 498x272)
2.55 MB
2.55 MB GIF
>my gigakino specific chroma realism style prompt gens only work with fp16 accumulation on, e4 fp8 scaled t5xxl, and v33 chroma or else it goes to plastic
>>
>>106784848
Your gens make me want to dive into them, they are soothing.
>>
File: 1754424932786390.mp4 (2.43 MB, 720x720)
2.43 MB
2.43 MB MP4
>>106784792
>>
>>
>>106784838
nta, but any tips?
>>
File: 1729717290080044.mp4 (478 KB, 720x720)
478 KB
478 KB MP4
>>106784531
>>
File: ComfyUI_01855_.png (1.8 MB, 1024x1024)
1.8 MB
1.8 MB PNG
>>106784834
Too fast but great detail

>>106784955
kek'd
>>
File: MarseyHalloweenayakon.png (3.63 MB, 2154x2915)
3.63 MB
3.63 MB PNG
Marsey the cat
>>
>>106784922
Chromabox?
>>
>>106784723
> there's no artistic variety
do you mean diversity
>>
>>106784943
Patience. Persistence. Dedication.
>>
>>
File: ComfyUI_01856_.png (1.58 MB, 1024x1024)
1.58 MB
1.58 MB PNG
>>106784834
>x, y, and z have become visual pollution to me at this point.
This is a general you know

>>106784970
>Too fast but great detail
for >>106784872
>>
>>106784989
is 102d still the /h/ model of choice? nice gen
>>
File: 1746831290690527.png (112 KB, 1788x538)
112 KB
112 KB PNG
imagine being an APIkek
>>
File: ComfyUI_01868_.png (1.56 MB, 1024x1024)
1.56 MB
1.56 MB PNG
>>
>>106784974
nice
>>
File: dancejp.webm (3.64 MB, 1080x1428)
3.64 MB
3.64 MB WEBM
>>
>>106785054
>that res
GPU SAMA GET DOWN!
>>
File: 00397-60060659.png (2.79 MB, 1248x1848)
2.79 MB
2.79 MB PNG
>1990s \(style\) gang
>represent
>>
File: radiance.png (2.31 MB, 1488x848)
2.31 MB
2.31 MB PNG
>>106784798 >>106784806
very doro
>>
qwen image edit camera prompts (from a post)

change the view and tilt the camera up slightly

change the view and tilt the camera down slightly

change the view and move the camera up while tilting it down slightly

change the view and move the camera down while tilting it up slightly

change the view and move the camera way left while tilting it right

change the view and move the camera way right while tilting it left

view from above , bird's eye view

change the view to top view, camera tilted way down framing her from the ceiling level

view from ground level, worms's eye view

change the view to a vantage point at ground level camera tilted way up towards the ceiling

extreme bottom up view

closeup shot from her feet level camera aiming upwards to her face

change the view to a lower vantage point camera is tilted up

change the view to a higher vantage point camera tilted down slightly

change the view to a lower vantage point camera is at her face level

change the view to a new vantage point 10m to the left

change the view to a new vantage point 10m to the right

change the view to a new vantage point at the left side of the room

change the view to a new vantage point at the right side of the room

Fov

change the view to ultrawide 180 degrees FOV shot on ultrawide lens more of the scene fits the view

change the view to wide 100 degrees FOV

change the view to fisheye 180 fov

change the view to ultrawide fisheye lens

among others.
>>
>>106785063
sora 2 can do 1080p lol
>>
>>106785003
idk what /h/ likes, their threads are somehow even more schizo than this one.
102d is good but still goes off the rails sometimes. I switch between like 15 different anime models depending on the style
>>
>>106785026
sora community is also stupid. the guys showed copyrighted videos, every minutes and publicly...
>>
>>106785088
it can't do nsfw though
>>
File: 1745305906469537.png (3.72 MB, 1987x1098)
3.72 MB
3.72 MB PNG
the woman is facing the camera directly and the camera is facing her body straight on.

a->b
>>
File: 1754822386734878.png (3.03 MB, 1376x1080)
3.03 MB
3.03 MB PNG
>>106785116
it just works.
>>
File: 00216-2723270350.png (2.77 MB, 1072x1488)
2.77 MB
2.77 MB PNG
>>
File: 1741133320093433.png (1.24 MB, 816x1280)
1.24 MB
1.24 MB PNG
>>106785131
from behind instead of straight on:
>>
>>106785116
>>106785131
thats not exactly facing her straight on
>>106785142
this angle is good though
>>
>>106784597
How is this bad. You know they could have just blanket banned everything like they did with DALLE. Save the "shambles" part for when people start leaving the platform in droves (psst... they're not, they like it better than Wan).
>>
File: 1728531984238666.png (1.27 MB, 1676x1024)
1.27 MB
1.27 MB PNG
also, qwen edit v2 (2509) can use openpose or canny/depth images as a reference.

the woman in image1 is using the pose of image2, kicking high in the air.

simple kick openpose img to test. pretty cool.

if you want to make a quick stickman pose: https://huchenlei.github.io/sd-webui-openpose-editor/
>>
File: RA_NBCM_00034.jpg (939 KB, 1872x2736)
939 KB
939 KB JPG
>>
File: cropping.jpg (748 KB, 3247x1632)
748 KB
748 KB JPG
https://github.com/jhc13/taggui/pull/353
Is this gonna help NL model loras?
>>
>>106785168
also openpose pose node can make an openpose image from any image which is useful.
>>
>>106784922
I wish I had a qwen edit that I could tell to get her tits out.
>>
>>106785085
king shit
>>
>>106785202
there is, anon. just say "remove the woman's clothes".

use the qwen edit 2509 remover lora for even better results.
>>
>>106785168
whoa, that's cool as fuck! so I don't actually need to plug an extra model in, just gen the stick pose and qwen understands?
noice
>>
File: 1742207412712971.png (1.65 MB, 1042x1348)
1.65 MB
1.65 MB PNG
https://huggingface.co/inclusionAI/Ming-UniVision-16B-A3B
Soo, no one has tried this yet?
>>
>>106785211
Can I use loras while also using the lightning v2 one?
>>
>>106785216
yep, the new version understands controlnet data.
>>
>>106785193
imagine if we could just prompt like this or something and never have to worry about bleed again.
>>
>>106785222
>I AM BENCHMOOORKING
>>
>>106785223
yep! just chain it together or add it in the multi lora node.
>>
>>106785237
>>106785225
this is music to my ears fellas
>>
>>106785235
kek
>>
File: 00029-2244191546.png (2.64 MB, 1248x1824)
2.64 MB
2.64 MB PNG
>>106785187
really nice, model?
>>
File: 1753491853790184.png (462 KB, 849x512)
462 KB
462 KB PNG
the woman in image1 is using the pose of image2. she is kneeling on the floor. her left hand is pointing at the camera.

ivy joestar:

(used johnny pose in aio aux preprocessor node and with zoe depth for a depth map.)

so you can use whatever image as a reference to get whatever pose/style you want.
>>
>>
Is it possible to prevent qwen edit from shifting the whole image up and zooming in? Every single gen I make gets shifted up and adds a smidge of pixels at the bottom
>>
>>
File: 1729286871345563.jpg (1.13 MB, 1416x2120)
1.13 MB
1.13 MB JPG
>>
File: 1749709313618688.png (2.73 MB, 1728x1344)
2.73 MB
2.73 MB PNG
>>
kino gen hour
>>
File: 00411-1142161680.png (2.82 MB, 1248x1848)
2.82 MB
2.82 MB PNG
I hate multi-posting, but I can't help myself with this one
>>
File: 1731670866796147.png (1.17 MB, 816x1280)
1.17 MB
1.17 MB PNG
kek

the woman in image1 is using the pose of image2. she is upside down doing a handstand.

pose was from an elena fanart pic from street fighter
>>
>>
File: 00004-3460682441.png (2.52 MB, 1248x1824)
2.52 MB
2.52 MB PNG
>>
File: ComfyUI_41915_.png (2.85 MB, 1456x1872)
2.85 MB
2.85 MB PNG
>>106784838
been using cyberfix since it was brand new and then slowly moved on to shitmixes, now im messing around with base noob at lower denoise and trying some janky upscale methods because thats where the fun is at

>>106785317
good texture on that one!
>>
File: 1740431402266263.png (1.21 MB, 1416x2120)
1.21 MB
1.21 MB PNG
>>
>>
File: 1744359736798458.png (1.4 MB, 816x1280)
1.4 MB
1.4 MB PNG
>>106785322
depth map of miku, interesting
>>
>>
File: Sora is dead!.png (347 KB, 1170x2532)
347 KB
347 KB PNG
I got Cloudkek'ed...
>>
File: VirginApi.png (469 KB, 1892x1038)
469 KB
469 KB PNG
>>106785411
>>
File: 1751166564213945.jpg (811 KB, 1416x2120)
811 KB
811 KB JPG
epsilon scaling...
>>
File: 1758956706002508.png (1.19 MB, 816x1280)
1.19 MB
1.19 MB PNG
the woman in image1 is wearing the outfit from image2. she has very large breasts.

had to add the second part cause the source image girl was flat.
>>
>>106785420
yeah you can laugh about me, I deserved it :(
>>
File: 1751683619016045.png (1.22 MB, 816x1280)
1.22 MB
1.22 MB PNG
>>106785424
sf5 laura example:
>>
File: 1758528917282453.jpg (721 KB, 2120x1416)
721 KB
721 KB JPG
>>
>>106785424
>>106785432
Why don't you customize Ivy so she's only wearing base underwear. Then you won't get undesirable leftovers when running the image of her through AI.
>>
>>106785479
yeah, that's the optimal way to do swaps im just testing the new version out but neat how it works even as-is.
>>
>>
>>
File: 00102-2116386236.png (2.68 MB, 1248x1848)
2.68 MB
2.68 MB PNG
the thing about noob that blows me away, is how well it knows characters that have 100ish representation in the training data. this model shouldn't be as good as it is. I am actually glad this dude got corpo scooped.
>>
>>106785486
>>106785500
based
>>
>>106785500
just do futa hommie
>>
>>106785484
Only only pointing it out because I want that school outfit on Ivy without the stuff from her official outfit.
>>
>find incredible artist
>only ~20 usable images
>unable to decide if worth training
>>
>>
File: 1746096865564720.png (1.18 MB, 896x1160)
1.18 MB
1.18 MB PNG
literally him
>>
File: 1757237063399769.png (998 KB, 1360x768)
998 KB
998 KB PNG
the man in image1 holds up a framed picture of image2 with his hands.
>>
>>
>>106785611
>>
>>106785611
I want to fuck it
>>
>>106785620
>>
File: 00012-2362717104.png (2.33 MB, 1344x1728)
2.33 MB
2.33 MB PNG
is it me or is the majority of ai hate on 4chan and reddit is coming from zoomers and gen alpha?
>>
>>106785632

I can see why. They are too young/broke to afford compute nor API access. Also they are strictly forbidden from using it in school work. Their future careers choices are also kneecapped by AI advancements.
>>
>>106785632
Whole world got rugpulled from them and they don't even get real art anymore. I get it.
>>
File: 1746659361875555.mp4 (1.57 MB, 720x1056)
1.57 MB
1.57 MB MP4
>>106785502
>>
>>106785549
post link
>>
File: 1756371104191261.png (529 KB, 1112x936)
529 KB
529 KB PNG
>>
>>106785677
"real art" hasn't gone anywhere / was already absent from mass media
>>
>>106785729
>her horns are also jiggling
Kek
>>
File: 1731652747674234.jpg (761 KB, 1416x2120)
761 KB
761 KB JPG
>>
>>106785748
>was already absent from mass media
Not to this extent. For the record yes I also hate CGI.
>>
>>106785632
Restricting it generationally is gay and retarded
>>
File: 1738717135917499.png (3.43 MB, 1416x2120)
3.43 MB
3.43 MB PNG
>>
File: 1730580713152693.png (1.03 MB, 1360x768)
1.03 MB
1.03 MB PNG
the man is holding up a framed picture of Hatsune Miku.

fp8 scaled seems to work pretty good, also pretty fast
>>
File: 1754851330382695.png (1.05 MB, 1360x768)
1.05 MB
1.05 MB PNG
>>106785791
replace the man with Hatsune Miku wearing the same clothes.
>>
File: 1738127988545256.png (550 KB, 1546x2120)
550 KB
550 KB PNG
>>106785632
No, it's coming by the top minds of our earth
>>
>>106785322
how does it work with troonime?
>>
>>106785819
any image will do, controlnet stuff works with realism or anime.
>>
>>106784597
Wtf is a rightsholder?
Is he talking about shareholders?
>>
the asian woman in the red dress is flying in the air, doing a kung fu kick.

7mb too big for this site...

https://files.catbox.moe/c3mytw.png
>>
File: 1730393180814514.mp4 (1.47 MB, 720x1200)
1.47 MB
1.47 MB MP4
>>106785749
Kept getting gens where they would jiggle even more than that. Could be worse though, like double alien head.
>>106785486
>>
File: 1732751587825707.png (1.24 MB, 992x1048)
1.24 MB
1.24 MB PNG
the asian woman in the red dress is wrestling in a pool of water with an asian woman that looks exactly like her.

neat, it worked
>>
File: 1749771025134340.png (1017 KB, 992x1048)
1017 KB
1017 KB PNG
>>106785863
>>
File: 1738617855155404.png (1.07 MB, 992x1048)
1.07 MB
1.07 MB PNG
>>106785888
>>
File: 1747471023197988.png (931 KB, 992x1048)
931 KB
931 KB PNG
can you dispute this, SAAS anons?
>>
File: 1741834745567904.png (1.01 MB, 992x1048)
1.01 MB
1.01 MB PNG
>>106785898
this is true btw, he used a pod to "adopt" a kid
>>
File: 00039-2115849513.png (2.86 MB, 1280x1920)
2.86 MB
2.86 MB PNG
>>106785729
still like that old time rock 'n' roll, that kind of music just soothes the soul
>>
>>106785837
The person or company that holds rights to whatever the content is. For example if you create a cameo, you are a rightsholder. Pokemon? Nintendo is the rightsholder, etc.
>>
File: 1759338003842202.png (795 KB, 1064x984)
795 KB
795 KB PNG
the pink hair anime character with 4 white legs is dressed as a pirate. keep their pose the same.
>>
File: 1754541517050649.png (822 KB, 1064x984)
822 KB
822 KB PNG
the pink hair anime character with 4 white legs is dressed as a spaceman. keep their pose the same.

neat! space doro
>>
>>106785815
Holy shit, most of the redditors on that anti ai subreddit are laterally young kids and teens. Notice them lately brigading multiple other subreddits, mass downvoting pro ai posts and mass reporting ai posts.
>>
>>106785898
sora is the only generator in existence that lets me gen lewd asmr doe?
>>
File: 1742868312190498.png (805 KB, 1064x984)
805 KB
805 KB PNG
>>106785940
scientist doro

this model is so versatile.
>>
File: 1744131568873593.png (612 KB, 1064x984)
612 KB
612 KB PNG
>>106785954
make a sexy anime girl in the style of this image. the background is white.

not far off dorothy desu
>>
USE THIS WAN 2.2 TUNE.
https://civitai.com/models/1995784?modelVersionId=2260110

It legit changes everything. It fixes ALL of light loras flaws, this shit if mega quick with amazing motion and it does not need loras for nsfw

Meant to post here instead of the regular SD thread
>>
>>106785968
>slopmix lora merger
how does it """"fix"""" things if it's just a shitty merge?
>>
>>106785978
its not, its a major finetune + light lora merge
>>
>>106785984
>its a major finetune
source?
>>
>>106785987
its a major finetuner, they finally released a video model
>>
>>106785997
>finetuner
there's a difference between slopmixes and finetunes
>>
>>106785968
what did they change in it?
>>
File: ComfyUI_00139_.mp4 (479 KB, 640x640)
479 KB
479 KB MP4
>>106785997
>major finetuner
>>
File: 1746934410853505.png (870 KB, 1360x768)
870 KB
870 KB PNG
>Mr Altman will decide what you can and can't prompt despite you paying money
>>
legit just try it, im getting better results than I got with regular wan doing full 30 steps without light loras
>>
>>106786022
can you use lightx2v loras?
>>
>>106786022
gonna need a GGUF before i can try anything
>>
>>106785987
https://files.catbox.moe/rf2zud.mp4
https://files.catbox.moe/sukxln.mp4
no loras
>>
>>106786024
its already low steps, 6-8
>>
>>106786030
>pure slop
>no loras
we know
>>
>>106786030
and you chose blowjob? why? that was already overly trained. do some missionary or doggystyle. hell, show some vag without the pink blobs. also stop using shitty input images with plastic skin
>>
>>106785968
>Checkpoint Merge
>All gens slop.
>>
>>106785968
>merge every NSFW lora into one model
>call it a finetune
LOL
>>
>>106786047
nah, check their page, they finetune, its called smooth mix but its not just a mix
>>
File: 1728833621398708.mp4 (1.45 MB, 720x1072)
1.45 MB
1.45 MB MP4
>>106785921
>>
>>106786054
Type: Checkpoint Merge
>>
>>106786030
>Happy about a slop merge
>Happy about nsfw out of the box
>Looks worse than irl stuff
This is just sad. Be better.
>>
am I being trolled? this is legit incredible for something that takes 15 seconds to gen
>>
I'll try, anon
>>
>>106785902
can it transfer styles or do i2i with low denoising?
>>
dont care.
wan with sora-tier audio when
>>
File: 1735844052128886.png (1012 KB, 1280x816)
1012 KB
1012 KB PNG
>>
>>106786084
it can copy/emulate text styles and artstyles, and you can manipulate stuff with prompts, it's pretty neat.
>>
>>106786055
liquid fluid there is impressive
>>
>>106786099
nta but can i get the workflow catboxed
>>
>>106786108
it's just the default template for qwen edit in comfy. add the 8 step lora (qwen image lightning v2.0) and that's it. works better than the edit lora 1.0.
>>
File: ComfyUI_00074_.png (1.6 MB, 1280x1693)
1.6 MB
1.6 MB PNG
>>106786055
gg, i don't prompt talon that much, I ain't got any other good ones, random cammy tho
>>
File: 1751378412267772.png (858 KB, 1288x808)
858 KB
858 KB PNG
the man is sitting at a desk at a computer, in front of a large mic. Behind him is a banner saying "DID YOU KNOW I WORKED AT BLIZZARD ENTERTAINMENT?" with a Blizzard Entertainment logo below the text.
>>
File: 1753668764180332.png (934 KB, 1288x808)
934 KB
934 KB PNG
>>106786141
fixed no typo
>>
File: 1732450634949199.png (888 KB, 1288x808)
888 KB
888 KB PNG
>>106786149
k last one, better colors.
>>
>>106786154
i prefer the yellow
>>
File: 1734282921406203.png (954 KB, 1360x768)
954 KB
954 KB PNG
change the headline from "heavy israel airstrikes on beirut suburbs" to "BREAKING NEWS: IM GETTING DRUNK!". The man at the top right of the image is drinking a bottle of jack daniels.

kek, it worked
>>
File: 00108-1381144718.png (1.48 MB, 2048x512)
1.48 MB
1.48 MB PNG
>>
File: 1742595940061500.png (946 KB, 1360x768)
946 KB
946 KB PNG
>>106786180
>>
File: 1752279579038640.png (902 KB, 1360x768)
902 KB
902 KB PNG
>>106786184
Remove the bright explosion from the image.

holy shit man, with AI tools how do you trust anything any more?
>>
>>106786181
really nice
>>
File: 1734300095276847.png (913 KB, 1360x768)
913 KB
913 KB PNG
>>106786193
In place of the explosion, add several small condo buildings slightly lit up with street lights.

everything is a lie.
>>
File: 1759424092270255.png (3.07 MB, 1416x2120)
3.07 MB
3.07 MB PNG
>>
>>106786194
Thanks.
>>
File: 1741632792153346.png (1.17 MB, 1024x1024)
1.17 MB
1.17 MB PNG
>>
>>106785611
That's great. Giger wanted xeno variant with human female mouth
>>
If AniStudio is so good, how come it uses Python despite gloating cpp and my 1050 bros can't use it?!
>>
File: ComfyUI_06428_.png (3.29 MB, 2880x2160)
3.29 MB
3.29 MB PNG
>>106786424
Sorry, forgot image.
>>
File: ComfyUI_06424_.png (2.31 MB, 2880x2160)
2.31 MB
2.31 MB PNG
I was lied to! Ani is nothing but garbage! It's true because my Perfect form can't lie!
>>
>worlds most melted image
>perfect
imperfect cell
>>
>>106786467
I'm fucking TRYING!!
>>
File: ComfyUI_06433_.png (3.44 MB, 2880x2160)
3.44 MB
3.44 MB PNG
>>106786467
My point still stands.
>>
>>106784383
namefag alert
>>
>>106786440
protip: use adetailer, very useful tool for getting perfect faces/eyes on most gens. use it for all my anime stuff in reforge.
>>
>wan animate
wtf is this model supposed to be good at?
the output quality is so low, and when the character turns around the back view is basically corrupted
>>
>>106786527
I umm... use Comfy and don't know how to use [spoiler]ADetailer[/spoiler]
>>
>>106786541
WOW AND I ALREADY FUCKED IT UP.
>>
> why local never will die
40% hentai
20% porn
10% anime
10% insta scam
10% brainrot
5% ad
5% pets
>>
Become skilled enough and you will never need to use detailers
>>
>>106786566
You forgot furry mi amigo.
>>
>>106786424
give it some more time. its in alpha currently which means chads only.
>>
>>106786533
fuck me
why is nobody talking about this wan animate? what's the verdict on this model?
>>
>>106786588
in a forum, we would have a subcategory for this with threads such as best settings, experiments, etc.

internet is dead
>>
File: ComfyUI_06432_.png (3.81 MB, 2880x2160)
3.81 MB
3.81 MB PNG
>>106786586
If AniStudio was so good, wouldn't they allow IL models?
>>
>>106786625
... it does
>>
File: ComfyUI_06430_.png (3.99 MB, 2880x2160)
3.99 MB
3.99 MB PNG
>>106786653
IT DOES? LIES! SHOW ME!
>>
I would prefer a desktop solution with a node-based backend, but with the option of exporting each workflow to an nice frontend
>>
>>106786717
Do SwarmUI. It does that nicely. You can import to COmfyUI later.
>>
>>106786725
How up-to-date is it? Are there custom nodes such as comfyui that cover things like wdtagger, joycaption, seedvr, yolo models, sam, etc.?
>>
I tried that smoothmix wan 2.2, it just producing mangled and low quality results. What gives?
>>
File: ComfyUI_06630_.png (3.43 MB, 2048x2048)
3.43 MB
3.43 MB PNG
>>106786748
It Works like A1111 but it has a tab in which you can use ComfyUI. My buddy lost access to it because pytorch's latest update stopped support for the 10 series NVIDIA cards, it has worked for hitm ever sense, albeit kinda lower quality.
>>
>>106786766
Is a shitty loramix.
>>
>>106786572
Use detailers enough and you will never need to be skilled
>>
>>106786778
But the guy has good gens. This is just straight up broken stuff I'm getting. The motion itself is great.
>>
>>106786717
Yeah this is what we need. I think comfy has something in the works to make a streamlined UI from a workflow. You could just make an app that sits on top of comfy and uses the api. The krita plugin does this basically but it's still missing a lot of critical functionality (well it was last time I tried it like 6 months ago) and is full of jank.
>>
>>106786794
Hang on, fp16 accumulation. Isn't that only needed when you use fp16 models for wan?
I'm getting blistering speeds compared to my usual workflow. 100s for a 1280x720 video compared to around 580s on my own workflow, without the fp16 accumulation.
>>
>>106786890
dude desu it sounds like you dont know what youre talking about and just trying shit without understanding anything youre doing. I suggest to take a step back, put your dick back in your pants and start using your retarded brain for once.
you're welcome
>>
>>106786970
It was the wan 2.2 t2v speed lora that was incompatible with the model. The results are great now.
>>
the SOTA porn model is a SaaS model
>>
>>106786998
also it's fucking ugly
>>
Do we have a single good open source audio architecture? I want to train a generalized speech & sfx model but everything seems to be shit
>>
>>106787021
for speech vibevoice has training I think? Not sure about esseffex
>>
>>106787028
vibevoice WOULD'VE had training code if microsoft weren't pussies
>>
What's the difference between these two sage attention nodes?
>>
>>106787040
one is for kj wrapper one is for comfy workflow
>>
>>106787045
But they still do the same thing? The selections are different between them.
>>
>>106787048
are you using kijai's wrapper or comfy native
>>
>>106787048
man turn your brain on.
one has the control embedded in the loader, the other is a dedicated control. fucking retard
>>
>>106787057
>>106787060
I'm hopping between the both of them.
The kijais is taking much longer and uses gpu mainly. Comfy native brings my cpu up to 70%.

This made me wonder about the two nodes.
How are you supposed to know the control is embedded when you can't even swap it out?
>>
>>106787121
is this your literal 1st workflow? you're a fucking retard
>>
bros brain is running on one cylinder
>>
>add --fast fp16_accumulation
>can't use String to float list node in my workflow any longer

But why.

>>106787125
Your mother isn't my first.
>>
File: file.png (589 KB, 3087x1340)
589 KB
589 KB PNG
When I am using the WAN2.2 I2V like pic related, it works wonderfully for the things I am using it for.
But when I try to up the video length from 5 to 10 seconds, the video becomes all blurry and/or introduces a ghosting effect. Any ideas why?

Also, if I try to change the LORAs to anything else, the image insantly becomes a blurry mess, why?
>>
File: kuno.png (2.75 MB, 1296x1728)
2.75 MB
2.75 MB PNG
This is technically an upscale glitch but I love the way hair turns out like this.
>>
>>106787180
https://desuarchive.org/g
>>
>>106787186
Doesn't say me anything
>>
>>106787202
search nigga
>>
>>106787204
> WAN2.2 I2V
> Looks up every single thread as it is in the starting post
> WAN2.2 I2V ghosting
One post
> WAN2.2 I2V blurry
One post

Very good.
>>
weakest bait i've seen in a long time
>>
>>106787218
You're in the thread where the one schizo jeet is awake.

Wan can't really gen past 5 seconds, loses context hard.
And when you disable the loras, speed loras, you are leaving the gen at too few steps to finish the result. Up the steps to like 30 as well as increasing the cfg to like 3.

This will take like 20x longer to gen.
>>
>check civitai
> no new qwen or wan loras
suffering

>>106787202
>say me
hello mister german, ze wetter is gut ja?
>>
>>106787180
try the 2.1 light lora at 3.00 strength on high
>>
>>106786877
Well, if I had the money, I would have comfyUI and invokeAI create a baby for the desktop and strategically position the brand against Adobe
you invest a few tens of millions of dollars here and sell the thing in two years when you have expanded into the professional market for a billion

with money, its so easy to make more money
>>
>>106787180
> 10 seconds
are you genuinely retarded or just not aware that wan is not capable of generating longer than 6 second videos without eating total shit?
>>
>>106787254
>>106787314
Thanks for replying at least.

Didn't know there was a 5-second limit, as I've seen some people/workguides claim "unlimited" or 10-second videos.

>>106787296
Thanks for the tip, will see if it works.

>>106787293
Not German, but close enough, mein kamerad.
>>
>>106785968
>It fixes ALL of light loras flaws
which are? doesnt look much if any better than my regular q8 unipc wan
>>
>>106784974
How quaint very cool
>>
>>106787341
>Didn't know there was a 5-second limit, as I've seen some people/workguides claim "unlimited" or 10-second videos.
you can continue videos by grabbing the last frame of the first gen and genning from there, look at loop wf from https://civitai.com/models/1818841/wan-22-workflow-t2v-i2v-t2i-kijai-wrapper
>>
File: 1612989104780.jpg (56 KB, 365x365)
56 KB
56 KB JPG
>>106787398
Thanks homie.
>>
Trying to do first-last frame with wan but the output isn't even close to either the first or last frame. Not sure what's going wrong.
>>
>>106787507
you didn't connect the vision nodes
your prompt is ass
the images aren't connected
etc
>>
>>106787398
>uses retarded custom nodes for shit tht can be done natively

DIE NOW
YOU MUST DIE INSTANTLY
>>
>>106787519
>you didn't connect the vision nodes
I don't think that's the case, I checked all the links many times.
>your prompt is ass
Very likely, but does that matter for my issue? I thought the whole point is that it starts with the start frame, so at least that should look like the original image, right? It looks nothing like it.
>the images aren't connected
Not the case.
>>
it's another episode of anon expects us to divinate the information related to his specific problem
>>
>>106787551
it would save everyone time if you posted the actual workflow instead of making us guess.
>>
>>106787534
a price to pay for having a non-toy workflow that just works
>>
File: wtf_.jpg (454 KB, 1498x1153)
454 KB
454 KB JPG
>>106784371
>4%
>(1/28)
>38GB
>!!!!!!!!!
>nowhere in the ls is a file listed as being larger than 10 GB
wtf is happening here?
how the fuck big can this thing be for chrissake?
>>
>>106787575
Sorry, I am dumb. It's just the one from the templates since I assumed that would just werk.
I'll just keep trying until I figure it out, I just need to know one thing. If it's working properly, the video should at least start looking the exact same as the start frame image. Correct?
>>
>>106787182
god damn, yeah her hair
uhuh
the haiiirrr
>>
Fresh

>>106787650
>>106787650
>>106787650
>>106787650
>>
>>106787598
>nowhere in the ls is a file listed as being larger than 10 GB
so what? it's adding up, easy to see.

you're downloading multiple formats of the same thing btw
>>
>>106785997
>they
and of course the wokie is full of shit
>>
got myself 5080 today, would be happy to contribute but got no fucking idea where to even begin
>>
>>106785193
Yolo can't detect everything, let alone precise items like specific type of shoes or vests (well, except blazers apparently). It's possible to train it for specific things tasks tho, so it's still possible to do something with it.
>>
>>106785222
It doesn't look that good, even on their cherry picked examples. Maybe there is some value to it.



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.